CN111065408A - 免疫原性组合物 - Google Patents

免疫原性组合物 Download PDF

Info

Publication number
CN111065408A
CN111065408A CN201880057887.4A CN201880057887A CN111065408A CN 111065408 A CN111065408 A CN 111065408A CN 201880057887 A CN201880057887 A CN 201880057887A CN 111065408 A CN111065408 A CN 111065408A
Authority
CN
China
Prior art keywords
seq
nucleotide sequence
polypeptide
amino acid
amino acids
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880057887.4A
Other languages
English (en)
Inventor
J·J·宾德
H·K·赵
P·J·科克尔
D·J·福尔克纳
S·古鲁
M·M·A·马丁尼茨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pfizer Inc
Original Assignee
Pfizer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pfizer Inc filed Critical Pfizer Inc
Publication of CN111065408A publication Critical patent/CN111065408A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • C12N9/1276RNA-directed DNA polymerase (2.7.7.49), i.e. reverse transcriptase or telomerase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/07Nucleotidyltransferases (2.7.7)
    • C12Y207/07049RNA-directed DNA polymerase (2.7.7.49), i.e. telomerase or reverse-transcriptase
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/525Virus
    • A61K2039/5256Virus expressing foreign proteins
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/53DNA (RNA) vaccination
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/54Medicinal preparations containing antigens or antibodies characterised by the route of administration
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/545Medicinal preparations containing antigens or antibodies characterised by the dose, timing or administration schedule
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/555Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
    • A61K2039/55511Organic adjuvants
    • A61K2039/55555Liposomes; Vesicles, e.g. nanoparticles; Spheres, e.g. nanospheres; Polymers
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/70Multivalent vaccine
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K2300/00Mixtures or combinations of active ingredients, wherein at least one active ingredient is fully defined in groups A61K31/00 - A61K41/00
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K39/0005Vertebrate antigens
    • A61K39/0011Cancer antigens
    • A61K39/001154Enzymes
    • A61K39/001157Telomerase or TERT [telomerase reverse transcriptase]
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K39/0005Vertebrate antigens
    • A61K39/0011Cancer antigens
    • A61K39/001169Tumor associated carbohydrates
    • A61K39/00117Mucins, e.g. MUC-1
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K39/0005Vertebrate antigens
    • A61K39/0011Cancer antigens
    • A61K39/00118Cancer antigens from embryonic or fetal origin
    • A61K39/001182Carcinoembryonic antigen [CEA]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2710/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
    • C12N2710/00011Details
    • C12N2710/10011Adenoviridae
    • C12N2710/10311Mastadenovirus, e.g. human or simian adenoviruses
    • C12N2710/10341Use of virus, viral particle or viral elements as a vector
    • C12N2710/10343Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Immunology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Epidemiology (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Animal Behavior & Ethology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Cell Biology (AREA)
  • Toxicology (AREA)
  • Biophysics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Oncology (AREA)
  • Mycology (AREA)
  • Medicines Containing Material From Animals Or Micro-Organisms (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
  • Pregnancy & Childbirth (AREA)
  • Reproductive Health (AREA)
  • Gynecology & Obstetrics (AREA)
  • Developmental Biology & Embryology (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本公开提供:(a)分离的免疫原性CEA多肽;(b)编码(i)免疫原性CEA多肽、(ii)免疫原性CEA多肽与免疫原性MUC1多肽、(iii)免疫原性CEA多肽与免疫原性TERT多肽或(iv)免疫原性CEA多肽、免疫原性MUC1多肽及免疫原性TERT多肽的分离的核酸分子;(c)包含分离的核酸分子的组合物;及(d)有关使用所述免疫原性CEA多肽、核酸分子及组合物的方法。

Description

免疫原性组合物
相关申请参考
本申请请求2017年7月11日申请的美国临时申请号62/531,227及2018年6月7日申请的美国临时申请号62/682,044的优先权。前述申请的每一个的全部内容通过参考并入本文。
序列表参考
本申请与电子格式序列表一起提交。序列表以.txt格式的文件提供,命名为“PC72354A_FF_SeqList_ST25.txt”,建立在2018年6月8日,大小为963KB。该.txt文件中所含的序列表是本说明书的部分,其全部内容通过参考并入本文。
技术领域
本发明概括而言系有关免疫疗法且特别是针对治疗或预防赘生性病症的疫苗及方法。
先前技术
癌症是全世界死亡的主要原因。它们可能发生在多种器官及组织,如胰脏、乳房、肺、胃、结肠及直肠。在美国,胰腺癌是癌症死亡的第四最常见原因。胰腺癌可能发生在胰脏的外分泌或内分泌部分。外分泌癌症包括(1)胰腺癌,其是目前最常见类型;(2)腺泡细胞癌,其占外分泌胰腺癌的5%;(3)囊腺癌,其占胰腺癌的1%及(4)其他罕见型癌症,例如胰母细胞瘤、腺鳞癌、印戒细胞癌、肝样癌、胶体癌、未分化癌及具破骨细胞样巨细胞的未分化癌。
乳腺癌(BrC)是美国女性间另一常见癌症及女性癌症死亡的第二主要原因。根据各种肿瘤标记,如雌激素受体(ER)、黄体酮受体(PR)及人表皮生长因子受体2(HER2),乳腺癌可分为主要亚型,例如(1)激素受体阳性癌(其中癌细胞含有雌激素受体或黄体酮受体);(2)激素受体阴性癌(其中癌细胞不具雌激素受体或黄体酮受体);(3)HER2/neu阳性癌(其中具过量HER2/neu蛋白或额外拷贝的HER2/neu基因的癌症);(4)HER2/neu阴性癌症(其中癌症不具过量HER2/neu);(5)三阴性癌症(其中乳腺癌细胞不具雌激素受体、黄体酮受体或过量HER2)及(6)三阳性癌症(其中癌症为雌激素受体阳性、黄体酮受体阳性及具有过多HER2)。
肺癌占所有癌症死亡的四分之一以上且为全球性癌症相关死亡的主要原因。大约85%病例在组织学上被分类为非小细胞肺癌(NSCLC)。NSCLC可进一步分类为数个亚型,如鳞状细胞(表皮样)癌、腺癌、巨细胞(未分化)癌、腺鳞癌及肉瘤样癌。肺癌的第二常见类型为小细胞肺癌(SCLC),占所有肺癌的约10%至15%。
胃癌(GaC)是世界上癌症相关死亡的第三最常见原因。约90-95%的胃癌为腺癌;其他较少见类型包括淋巴瘤、GIST及类癌瘤。
在美国,结肠直肠癌(CRC)也是癌症相关死亡的主要原因。腺癌是CRC的最常见类型,占结肠直肠癌的超过95%。其他较少见的CRC类型包括类癌瘤、胃肠道基质肿瘤(GIST)、淋巴瘤及肉瘤。
癌症管理的传统疗法在循环癌症及实体癌症选择性组别的管理上获得了成功。然而,许多类型癌症对传统方法有抗性。近年来,探索了癌症的免疫疗法,特别是癌症疫苗及抗体疗法。癌症免疫疗法的一种方法涉及施用免疫原以产生针对标靶癌细胞上的肿瘤相关抗原(TAA)的活性全身性免疫应答。虽然已鉴定大量肿瘤相关抗原且这些抗原许多已进行了探索作为治疗或预防癌症的基于病毒、细菌、蛋白质、肽或DNA的疫苗;但是迄今多数临床试验仍无法产生治疗产品。因此,存在对于可用于治疗或预防癌症的免疫原或疫苗的需求。
本公开涉及衍生自肿瘤相关抗原MUC1、CEA或TERT的免疫原性多肽;编码这些免疫原性多肽的核酸分子;包含这些免疫原性多肽或核酸分子的组合物(如疫苗)与所述多肽、核酸分子及组合物的用途。
人黏蛋白1蛋白质(MUC1;也称为episialin、PEM、H23Ag、EMA、CA15-3及MCA)是在简单上皮细胞及腺上皮细胞顶面表达的多形性跨膜糖蛋白。MUC1基因编码包含信号肽序列的单一多肽链前体。该信号肽序列在翻译后立即被移除且该MUC1前体余留部分进一步被切割成两个肽片段:较长的N端亚单位(MUC1-N或MUC1α)及较短的C端亚单位(MUC1-C或MUC1β)。成熟的MUC1包含经通过穏定氢键相关联的MUC1-N及MUC1-C。MUC1-N是胞外结构域,含有具20个氨基酸残基的可变数目串联重复序列(VNTR),不同个体的重复序列数从20至125个不等。由可变数目串联重复序列组成的MUC1蛋白区域在本公开中亦称为“VNTR区域”。MUC1-C含有短的胞外区域(大约53个氨基酸)、跨膜结构域(大约28个氨基酸)及细胞质尾部(大约72个氨基酸)。MUC1细胞质尾部(MUC1-CT)含有高度保守的丝氨酸及酪氨酸残基,其由生长因子受体及细胞内激酶磷酸化。人MUC1以由不同类型的MUC1 RNA可变剪接产生的多种同种型存在。全长人MUC1同种型1蛋白前体(同种型1,Uniprot P15941-1)的氨基酸序列提供于SEQ ID NO:1(“MUC1参考多肽”)。迄今已经报导了人MUC-1的至少16种其他同种型(UniprotP15941-2至P15941-17),相较于同种型1的序列,其包括各种插入、缺失或取代。这些同种型称为同种型2、3、4、5、6、Y、8、9、F、Y-LSP、S2、M6、ZD、T10、E2及J13(分别为Uniprot P15941-2至P15941-17)。该全长人MUC1同种型1前体蛋白由1255个氨基酸组成,其包括氨基酸1-23的信号肽序列。成熟MUC1蛋白的MUC1-N及MUC1-C结构域分别由氨基酸24-1097及1098-1255组成。
癌胚抗原相关的细胞黏附分子(也称为CEACAM)是免疫球蛋白(Ig)超家族中的一组糖蛋白。在结构上,CEACAM组由单个N端结构域及最多六个二硫键连接的内部结构域(类似于C2型Ig结构域)组成。该组含有12种蛋白质(CEACAM1、3-8、16、18-21),其中数种,如CEACAM1、CEACAM5及CEACAM6,在如黑色素瘤、肺癌、直肠结肠癌及胰腺癌等多种癌症中,已被考虑作为有效的临床标记及有希望的治疗标靶。已在大多数人体癌症中发现CEACAM5的过表达,其在本文以及本领域也称为CEA。CEACAM5表达为702-氨基酸前体蛋白,其由:(1)信号肽(氨基酸1-34);(2)N结构域(氨基酸35-144);(3)包含称为A1(氨基酸146-237)、B1(氨基酸238-322)、A2(氨基酸324-415)及B2(氨基酸416-498)、A3(氨基酸502-593)与B3(氨基酸594-677)六个恒定C2样结构域的三个重复单元;及(4)前肽(氨基酸686-702)组成。所述信号肽在运输至细胞表面期间从成熟蛋白质被切下。全长人CEA前体蛋白的氨基酸序列可从UniProt获得(登记号P06731)且亦示于本文SEQ ID NO:2(“CEA参考多肽”)中。
端粒酶反转录酶(或TERT)是端粒酶的催化性组成部分,其是负责通过添加端粒重复TTAGGG维持端粒末端的核糖核蛋白聚合酶。除了TERT之外,端粒酶也包含作为该端粒重复模板用的RNA组成部分。人TERT基因编码1132个氨基酸的蛋白质。存在由可变剪接产生的数种人TERT同种型。同种型1、同种型2、同种型3及同种型4的氨基酸序列可在Uniprot获得(<www.uniprot.org>;Uniprot识别符分别为O14746-1、O14746-2、O14746-3及O14746-4)。人全长TERT同种型1蛋白(同种型1,Genbank AAD30037,Uniprot O14746-1)的氨基酸序列也提供于本文SEQ ID NO:3(“TERT参考多肽”)中。相较于TERT同种型1(O14746-1),同种型2(O14746-2)具氨基酸764-807的置换(STLTDLQPYM...LNEASSGLFD→LRPVPGDPAG...AGRAAPAFGG)与C端氨基酸808-1132的缺失,同种型3(O14746-3)具氨基酸885-947的缺失,及同种型4(O14746-4)具氨基酸711-722与808-1132的缺失及氨基酸764-807的置换(STLTDLQPYM...LNEASSGLFD→LRPVPGDPAG...AGRAAPAFGG)。
发明内容
在一些方面中,本公开提供衍生自肿瘤相关抗原(TAA)MUC1、CEA及TERT的分离的免疫原性多肽,举例而言,其可用于引起体内(例如在包括人的动物中)的免疫应答,或用为药物组合物(包括疫苗)的组分,用于治疗癌症。
在其他方面中,本公开提供核酸分子(亦称为“抗原构建体”),其每一种编码本公开提供的一或多种免疫原性多肽。在一些实施方案中,本公开提供多抗原核酸构建体,其每一种编码二、三或多种免疫原性TAA多肽。
本公开还提供含有本公开提供的一或多种核酸分子的载体。所述载体可用于克隆或表达由核酸分子编码的免疫原性TAA多肽,或用于递送组合物中的核酸分子(例如疫苗)至宿主细胞或宿主动物或人。在一方面中,本公开还提供含有本公开提供的一或多种核酸分子的载体,其用为疫苗或用于疫苗。
在一些进一步的方面中,本公开提供包含一或多种免疫原性多肽、编码免疫原性TAA多肽的分离的抗原构建体,或含有编码一或多种免疫原性TAA多肽的抗原构建体的载体或质粒的组合物。在一些实施方案中,该组合物是用于在哺乳动物(例如小鼠、狗、猴子或人)中引起针对TAA的免疫应答的免疫原性组合物。在一些实施方案中,该组合物是可用于免疫哺乳动物(如人)、抑制异常细胞增殖、提供针对癌症的进展的保护(用作预防剂)或用于治疗(用作治疗剂)与TAA过表达相关的病症(例如癌症,特别是胰腺癌、卵巢癌、肺癌、直肠结肠癌、胃癌及乳腺癌)的疫苗组合物。
在一些进一步的方面中,本公开提供编码一或多种免疫原性TAA多肽的分离的核酸分子或含有编码如本文公开的一或多种免疫原性TAA多肽的核酸分子的载体(如病毒载体及质粒载体),用于在哺乳动物(例如小鼠、狗、猴子或人)中引起针对TAA的免疫应答的方法。在一些进一步的方面中,本公开提供编码一或多种免疫原性TAA多肽的分离的核酸分子或含有编码如本文公开的一或多种免疫原性TAA多肽的核酸分子的载体(如病毒载体及质粒载体),用于抑制哺乳动物中异常细胞增殖的方法。在一些进一步的方面中,本公开提供编码一或多种免疫原性TAA多肽的分离的核酸分子或含有编码如本文公开的一或多种免疫原性TAA多肽的核酸分子的载体(如病毒载体及质粒载体),用于在哺乳动物中针对癌症的进展提供保护,以治疗癌症,或治疗与TAA过表达相关病症的方法。
在一些进一步的方面中,本公开提供编码一或多种免疫原性TAA多肽的分离的核酸分子或含有编码如本文公开的一或多种免疫原性TAA多肽的核酸分子的载体或质粒,用作抗癌剂。在一些特定方面中,所述癌症为胰腺癌、卵巢癌、肺癌、直肠结肠癌、胃癌或乳腺癌。
在其他方面中,本公开提供使用免疫原性TAA多肽、分离的核酸分子及组合物的方法。在一些实施方案中,本公开提供在哺乳动物(特别是人)中引起针对TAA的免疫应答的方法,包括给所述哺乳动物施用有效量的本发明提供的对标靶TAA具免疫原性的多肽、有效量的编码所述免疫原性多肽的分离的核酸分子或包含所述免疫原性多肽或编码所述免疫原性多肽的分离的核酸分子的组合物。所述多肽或核酸组合物可与一或多种佐剂或免疫调节剂一起使用。
在其他方面中,本公开提供本文公开的用作药剂的免疫原性TAA多肽、分离的核酸分子及组合物。所述多肽或核酸组合物可与一或多种佐剂或免疫调节剂一起使用。
在本发明的一方面中,涵盖下述实施方案,其每一个由编号的项目叙述:
1.一种抗原构建体,其包含如本文公开的编码免疫原性CEA多肽的核苷酸序列。
2.根据第1项目的抗原构建体,进一步包含编码如本文公开的免疫原性MUC1多肽的核苷酸序列。
3.根据第1或2项目的抗原构建体,进一步包含编码如本文公开的免疫原性TERT多肽的核苷酸序列。
4.根据第1项目的抗原构建体,进一步包含编码如本文公开的免疫原性MUC1多肽的核苷酸序列及编码如本文公开的免疫原性TERT多肽的核苷酸序列。
5.根据第2、3或4项目中任一项目的抗原构建体,进一步包含如本文公开的间隔子核苷酸序列。
6.根据第5项目的抗原构建体,其中该间隔子核苷酸序列编码2A肽。
7.根据第5项目的抗原构建体,其中该间隔子核苷酸序列编码选自EMC2A、ERA2A、ERB2A及T2A的2A肽。
8.根据第1至7项目中任一项目的抗原构建体,其中该免疫原性CEA多肽选自:
(1)包含SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ IDNO:2的氨基酸323-677或由SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ ID NO:2的氨基酸323-677组成的多肽;
(2)包含SEQ ID NO:15的氨基酸序列或SEQ ID NO:15的氨基酸4-704或由SEQ IDNO:15的氨基酸序列或SEQ ID NO:15的氨基酸4-704组成的多肽;
(3)包含SEQ ID NO:17的氨基酸序列或SEQ ID NO:17的氨基酸4-526或由SEQ IDNO:17的氨基酸序列或SEQ ID NO:17的氨基酸4-526组成的多肽;
(4)包含SEQ ID NO:19的氨基酸序列或SEQ ID NO:19的氨基酸4-468或由SEQ IDNO:19的氨基酸序列或SEQ ID NO:19的氨基酸4-468组成的多肽;或
(5)多肽,其是上述(1)至(4)中任一多肽的功能性变体。
9.根据第3至8项目中任一项目的抗原构建体,其中该免疫原性TERT多肽选自:
(1)包含SEQ ID NO:9的氨基酸序列或SEQ ID NO:9的氨基酸2-893的多肽;
(2)包含SEQ ID NO:11的氨基酸序列或SEQ ID NO:11的氨基酸3-791的多肽;
(3)包含SEQ ID NO:13的氨基酸序列或SEQ ID NO:13的氨基酸4-594的多肽;及
(4)多肽,其是上述(1)至(3)中任一多肽的功能性变体。
10.根据第2及4至9项目中任一项目的抗原构建体,其中该免疫原性MUC1多肽选自:
(1)包含SEQ ID NO:5的氨基酸序列或SEQ ID NO:5的氨基酸4-537的多肽;
(2)包含SEQ ID NO:7的氨基酸序列或SEQ ID NO:7的氨基酸4-517的多肽;及
(3)上述(1)或(2)的多肽的功能性变体。
11.根据第1至10项目中任一项目的抗原构建体,其包含编码选自以下的氨基酸序列的核苷酸序列:
(1)SEQ ID NO:31的氨基酸序列或包含SEQ ID NO:31的氨基酸4-1088的氨基酸序列;
(2)SEQ ID NO:33的氨基酸序列或包含SEQ ID NO:33的氨基酸4-1081的氨基酸序列;
(3)SEQ ID NO:35的氨基酸序列或包含SEQ ID NO:35的氨基酸4-1085的氨基酸序列;
(4)SEQ ID NO:37的氨基酸序列或包含SEQ ID NO:37的氨基酸4-1030的氨基酸序列;
(5)SEQ ID NO:39的氨基酸序列或包含SEQ ID NO:39的氨基酸4-1381的氨基酸序列;及
(6)SEQ ID NO:41的氨基酸序列或包含SEQ ID NO:41的氨基酸4-1441的氨基酸序列。
12.根据第1至11项目中任一项目的抗原构建体,其包含选自以下的核苷酸序列:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;及
(7)核苷酸序列,其是上述(1)至(6)中任一核苷酸序列的简并变体。
13.根据第1至12项目中任一项目的抗原构建体,其包含编码选自以下的氨基酸序列的核苷酸序列:
(1)SEQ ID NO:43的氨基酸序列或包含SEQ ID NO:43的氨基酸4-2003的氨基酸序列;
(2)SEQ ID NO:45的氨基酸序列或包含SEQ ID NO:45的氨基酸4-2001的氨基酸序列;
(3)SEQ ID NO:47的氨基酸序列或包含SEQ ID NO:47的氨基酸4-2008的氨基酸序列;
(4)SEQ ID NO:49的氨基酸序列或包含SEQ ID NO:49的氨基酸4-1996的氨基酸序列;
(5)SEQ ID NO:51的氨基酸序列或包含SEQ ID NO:51的氨基酸4-1943的氨基酸序列;及
(6)SEQ ID NO:53的氨基酸序列或包含SEQ ID NO:53的氨基酸4-1943的氨基酸序列。
14.根据第1至13项目中任一项目的抗原构建体,其包含选自由下列所组成群组的核苷酸序列:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48的核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其是上述(1)至(6)中任一核苷酸序列的简并变体。
15.根据第1至14项目中任一项目的抗原构建体,其包含:
(1)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列;或
(2)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列的简并变体。
16.一种药物组合物,其包含:(i)根据第1至15项目中任一项目的抗原构建体和(ii)药学上可接受的载剂。
17.根据第16项目的药物组合物,其是疫苗。
18.一种治疗在需要治疗的人中的癌症的方法,该方法包括给人施用有效量的根据第16或17项目的药物组合物。
19.根据第18项目的方法,其中所述癌症过表达选自MUC1、CEA或TERT的一或多种肿瘤相关抗原。
20.根据第18项目的方法,其中所述癌症为胰腺癌、卵巢癌、乳腺癌、胃癌、肺癌或结肠直肠癌。
21.根据第18项目的方法,其中所述癌症为三阴性乳腺癌、雌激素受体阳性乳腺癌或HER2阳性乳腺癌。
22.根据第18项目的方法,该方法进一步包括给患者施用有效量的免疫调节剂。
23.根据第22项目的方法,其中所述免疫调节剂是CTLA-4抑制剂、IDO1抑制剂、PD-1抑制剂或PD-L1抑制剂。
24.根据第18项目的方法,该方法进一步包括给人施用佐剂。
25.一种载体,其包含根据第1至15项目中任一项目的抗原构建体。
26.根据第25项目的载体,其是质粒载体。
27.根据第26项目的载体,其包含SEQ ID NO:57、59、61、63、65、67、69、70、71、72、73及74中任一核苷酸序列。
28.根据第25项目的载体,其是病毒载体。
29.根据第28项目的载体,其包含SEQ ID NO:58、60、62、64、66及68中任一核苷酸序列。
30.(1)根据第1至15项目中任一项目的抗原构建体、(2)根据第16或17项目的药物组合物或(3)根据第25至29项目中任一项目的载体作为药剂的用途。
31.根据第30项目之用途,其中该药剂用于治疗癌症。
32.(1)根据第1至14项目中任一项目的抗原构建体或(2)根据第25至29项目中任一项目的载体在制备用于治疗癌症的药剂的用途。
附图说明
图1.描述携带三抗原构建体的AdC68载体(即,称为载体AdC68Y-1424、AdC68Y-1425、AdC68Y-1426、AdC68Y-1427、AdC68Y-1428及AdC68Y-1429)的结构的图式。从Genbank参考序列AC_000011.1设计E1及E3缺失的AdC68载体骨架。转基因开放阅读框插入E1区域,在CMV立早增强子/启动子及SV40多腺苷酸终止子之间。tet操緃子序列插入启动子之后。
发明详述
A.定义
术语“佐剂”是指,在施用于宿主哺乳动物(例如人)时,能在所述宿主中增强、加速或延长由疫苗或免疫原引起抗原特异性免疫应答的物质。
术语“激动剂”是指促进(诱导、引起、增强或增加)另一分子(例如受体)的活性的物质。术语激动剂涵盖结合受体的物质及促进受体功能而不与其结合的物质。
术语“拮抗剂”或“抑制剂”是指部分或完全封闭、抑制或中和另一分子或受体的生物学活性的物质。
术语“抗原”是指在引入宿主哺乳动物(直接或表达,例如,如在DNA疫苗中)时,能被宿主哺乳动物的免疫系统识别,例如与抗体或T细胞上的抗原受体结合的物质。抗原可为蛋白质或蛋白片段、碳水化合物、神经节苷脂、半抗原或核酸。当物质能与免疫系统的抗原识别分子(例如抗体或T细胞抗原受体)特异性相互作用时,则称该物质是“抗原性的”。术语“肿瘤相关抗原”或“TAA”是指肿瘤细胞特异性表达,或相较于相同组织类型的非肿瘤细胞,肿瘤细胞以较高频率或密度表达的抗原。TAA可以是宿主不正常表达的分子,或者突变、截短、错误折迭或者异常表达的宿主正常表达的分子。TAA的实例包括CEA、TERT及MUC1。
术语“共同施用”是指给相同个体施用两种或多种物质作为治疗方案的部分。所述两种或多种物质可以涵盖于单一配制物中从而同时施用。替代地,所述两种或多种物质可在不同的物理配制物中且分别施用于个体(依序或同时)。“同时地施用”或“同时施用”是指第一物质的施用与第二物质的施用在时间上相互重迭,而“依序地施用”或“依序施用”意指第一物质的施用与第二物质的施用在时间上不相互重迭。
术语“细胞溶质的”或“细胞质的”是指在编码特定多肽的核苷酸序列经宿主细胞表达后,预期表达的多肽保留在宿主细胞内。
术语“简并变体”是指具有碱基取代但编码相同多肽或氨基酸序列的核酸序列。
术语“有效量”是指给哺乳动物施用的足以在哺乳动物中引起期望作用的量。
术语氨基酸序列的或免疫原性TAA多肽(统称为“参考多肽”)“功能性变体”是指包含90%至100%的参考多肽氨基酸数量的氨基酸序列或多肽,参考多肽氨基酸序列具有低在100%但高于95%的相同性,并拥有与参考多肽相同或类似的免疫原性特性。
术语“相同”是指分别有确切相同的核苷酸或氨基酸序列的两种或多种核酸或两种或多种多肽。术语“相同性百分比”描述两种或多种核酸或多肽间相似性的水平。当通过生物信息学软件比对两个序列时,将序列间确切的核苷酸/氨基酸配对数乘以100,并除以比对区的长度(包括空隙),计算“相同性百分比”。举例而言,比对时展现10个误配的两条100个氨基酸长的多肽将为90%相同。
术语“免疫效应细胞增强剂”或“IEC增强剂”是指能增加和/或增强哺乳动物的一或多种类型的免疫效应细胞的数量、质量和/或功能的物质。免疫效应细胞的实例包括细胞溶解性树突细胞、CD8 T细胞、CD4 T细胞、NK细胞及B细胞。
术语“免疫调节剂”是指能改变(例如抑制、减少、增加、增强或刺激)哺乳动物的先天性、体液或细胞免疫系统的任何组分的运转或功能的物质。因此,“免疫调节剂”涵盖如本文所定义的“免疫效应细胞增强子”以及影响哺乳动物免疫系统的任何其他组分的物质。
术语“免疫应答”是指宿主哺乳动物的适应性免疫系统对特定物质(例如抗原或免疫原)的任何可检测的应答,包括细胞介导的免疫应答(如由T细胞介导的应答,例如抗原特异性T细胞及免疫系统的非特异性细胞)与体液免疫应答(如由B细胞介导的应答,例如抗体的产生及分泌至血浆、淋巴和/或组织液中)。免疫应答的实例包括细胞因子(如,Th1、Th2或Th17型细胞因子)或趋化因子释放的改变(如,增加)、巨噬细胞活化、树突细胞活化、T细胞(如,CD4+或CD8+T细胞)活化、诱导B细胞应答(如,抗体产生)、诱导细胞毒性T淋巴细胞(CTL)太多及免疫系统细胞(如,T细胞与B细胞)的扩增(如,细胞群的生长)。
术语“免疫原性的”或“免疫原性”是指,在宿主哺乳动物中,无论单独或连接载剂,佐剂存在或不存在下,物质在施用于宿主哺乳动物(例如人)后,导致、引起、刺激或诱导免疫应答或改进、增强、增加或延长先前存在的免疫应答的能力。该等物质称为“免疫原”。
术语“免疫原性组合物”是指有免疫原性的组合物。
术语“免疫原性MUC1多肽”是指针对人天然MUC1蛋白或针对表达人天然MUC1蛋白的细胞具免疫原性的多肽。该多肽可具有与人天然MUC1蛋白相同的氨基酸序列或相较于人天然MUC1蛋白的氨基酸序列显示一或多个突变。
术语“免疫原性CEA多肽”是指针对人天然CEA蛋白或针对表达人天然CEA蛋白的细胞具免疫原性且相较于人天然CEA蛋白的氨基酸序列显示一或多个突变(例如缺失一或多种氨基酸)的多肽。
术语“免疫原性TERT多肽”是指针对人天然TERT蛋白或针对表达人天然TERT蛋白的细胞具免疫原性的多肽。该多肽可具有与人天然TERT蛋白相同的氨基酸序列或相较于人天然TERT蛋白的氨基酸序列显示一或多个突变。
术语“免疫原性TAA多肽”是指各如上文定义的“免疫原性CEA多肽”、“免疫原性MUC1多肽”或“免疫原性TERT多肽”。
术语“免疫抑制细胞抑制剂”或“ISC抑制剂”是指能减少和/或抑制哺乳动物免疫抑制细胞的数量和/或功能的物质。免疫抑制细胞的实例包括调控性T细胞(“Treg”)、骨髓衍生性抑制细胞及肿瘤相关的巨噬细胞。
术语“哺乳动物”是指哺乳动物纲的任何动物物种。哺乳动物的实例包括:人;非人灵长类动物例如猴子;实验动物例如大鼠、小鼠、天竺鼠;家畜例如,猫、狗、兔、牛、绵羊、山羊、马及猪;与圈养野生动物例如狮、虎、象等。
术语“膜结合”是指编码特定多肽的核苷酸序列由宿主细胞表达后,所表达的多肽与细胞膜结合、附着或关联。
术语“赘生性病症”是指其中细胞以异常高且不受控制的速率增殖的状况,该速率超过周围正常组织且与周围正常组织不协调。其通常导致实体病灶或肿块,称为“肿瘤”。此术语涵盖良性与恶性赘生性病症。术语“恶性赘生性病症”,在本公开中与可与“癌症”互换使用,是指特征在在肿瘤细胞扩散到体内其他位置的能力(称为“转移”)的赘生性病症。“良性赘生性病症”是指其中肿瘤细胞缺乏转移能力的赘生性病症。
术语“突变”是指相较在参考蛋白质或多肽的氨基酸序列,蛋白质或多肽的氨基酸序列中氨基酸残基的缺失、添加或取代。
术语“药物组合物”是指适于给个体(例如人患者)施用以引起期望的生理、药理或治疗效果的固态或液态组合物。除含有一或多种活性组分外,药物组合物可含有一或多种药学上可接受的赋形剂。
术语“药学上可接受的赋形剂”是指药物组合物(例如疫苗)中,除了活性组分(如,抗原、抗原编码核酸、免疫调节剂或佐剂)以外的物质,其与活性组分相容且不对所施用的个体引起显著的不利作用。
在药物组合物上下文中使用的术语“赋形剂”是指通常不具医药性质,为了简化药物产品的制造和/或促进活性药物物质的稳定、递送与吸收目的而包含在组合物中的物质。术语“药学上可接受的赋形剂”是指药物组合物(例如疫苗组合物)中的赋形剂,其与组合物中的活性组分(如,抗原或免疫原、编码抗原的核酸、免疫调节剂或佐剂)相容且不对所施用的个体引起显著的不利作用。
术语“肽”、“多肽”及“蛋白质”在本文中可互换使用,是指通过肽键连接在一起的聚合形式的氨基酸。它们可以是任何长度且可包括编码的与非编码的氨基酸,经化学或生物化学修饰或衍生化的氨基酸。
术语“预防(preventing或prevent)”是指(a)阻止病症发生,(b)延迟病症的发病或病症的症状的发病或(c)最小化病症的发生率或影响。
术语“分泌的”在多肽的上下文中是指编码多肽的核苷酸序列通过宿主细胞表达后,所表达的多肽分泌到宿主细胞外。
用于描述免疫调节剂(例如蛋白激酶抑制剂)的量时,术语“次最佳剂量”是指免疫调节剂的剂量低于单独给患者施用该免疫调节剂时对所治疗疾病产生期望治疗效果所需的最小量。
术语“治疗”是指消除病症、降低病症的严重性或减少病症症状的严重性或发生频率。
术语“疫苗”是指施用于哺乳动物(例如人)以针对一或多种特定抗原引起保护性免疫应答的免疫原性组合物。疫苗的主要活性组分为免疫原。含免疫原性多肽作为免疫原的疫苗亦称为“肽疫苗”。不含免疫原性多肽而含编码免疫原性多肽的核酸分子的疫苗称为“DNA疫苗”或“RNA疫苗”(取决于可能的情况)。递送DNA或RNA疫苗进入宿主细胞后,宿主细胞会表达由核酸分子编码的免疫原性多肽,产生保护性免疫应答。DNA或RNA疫苗中的核酸分子可为裸露的核酸、质粒或病毒载体的形式,或任何其他适于递送核酸的形式。
术语“载体”是指能将外来核酸分子转运或转移到宿主细胞中的核酸分子或经改良的微生物。所述外来核酸分子称为“插入序列”或“转基因”。载体通常由插入序列与用作载体骨架的较大序列构成。根据载体的结构或起源,载体的主要类型包括质粒载体、黏粒载体、噬菌体载体(例如λ噬菌体)、病毒载体(例如腺病毒载体)、人工染色体及细菌载体。
B.免疫原性TAA多肽
在一些方面中,本公开提供分离的免疫原性TAA多肽,其可用于,例如,体内(例如在包括人的动物中)或体外引起免疫应答、活化效应T细胞或产生对TAA具特异性的抗体或用作用于治疗癌症如胰腺癌、肺癌、结肠直肠癌、胃癌或乳腺癌的药物组合物(包括疫苗)的组分。
这些免疫原性TAA多肽可根据本公开通过本领域已知的方法制备。所述多肽引起免疫应答的能力可在体外分析或体内分析中测量。用于测定多肽或DNA构建体引起免疫应答能力的体外分析是本领域已知的。所述体外分析的实例是测量该多肽或表达多肽的核酸刺激T细胞应答的能力,如美国专利7,387,882(其公开并入本申请)中所描述。该分析方法包括下述步骤:(1)使培养物中的抗原呈递细胞与抗原接触,从而使该抗原可被抗原呈递细胞吸收并加工,产生一或多种经加工的抗原;(2)在足以使T细胞对一或多种经处理的抗原应答的条件下使抗原呈递细胞与T细胞接触;(3)测定T细胞是否对所述一或多种经处理的抗原应答。所用T细胞可为CD8+T细胞或CD4+T细胞。T细胞应答可利用测量一或多种细胞因子(例如干扰素-γ及介白素-2)的释放,与抗原呈递细胞(肿瘤细胞)的溶解测定。B细胞应答可利用测量抗体的产生测定。
B-1.免疫原性MUC1多肽
在一方面中,本公开提供通过在人天然MUC1蛋白引入一或多个突变的衍生自人天然MUC1的免疫原性MUC1多肽。所述突变的实例包括缺失MUC1蛋白VNTR区域中20个氨基酸的串联重复序列的一些,而非全部;缺失全部或部分信号肽序列及缺失在MUC1同种型中发现的非一致性氨基酸序列的氨基酸。因此,在一些实施方案中,免疫原性MUC1多肽包含(1)人MUC1蛋白的20个氨基酸的串联重复序列(3至30个)的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列。在一些特定实施方案中,免疫原性MUC1多肽包含(1)人MUC1的5至25个串联重复序列的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列。在一些实施方案中,免疫原性MUC1多肽由(1)人MUC1蛋白20个氨基酸的串联重复序列(3至30个)的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列组成。在一些特定实施方案中,免疫原性MUC1多肽由(1)人MUC1的5至25个串联重复序列的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列组成。在一些进一步实施方案中,免疫原性MUC1多肽是胞质形式(或“cMUC1”)。术语“胞质形式”是指缺少人天然MUC1蛋白的全部或部分分泌序列(氨基酸1-23;也称为“信号肽序列”)的免疫原性MUC1多肽。缺失分泌序列的氨基酸被预期在细胞中表达时阻止该多肽进入分泌途径。在一些其他实施方案中,免疫原性MUC1多肽是膜结合形式。免疫原性MUC1多肽可由本领域已知或未来发现的任何人MUC1同种型氨基酸序列衍生、构建或制备,包括,例如,Uniprot同种型1、2、3、4、5、6、Y、8、9、F、Y-LSP、S2、M6、ZD、T10、E2及J13(分别为Uniprot P15941-1至P15941-17)。在一些实施方案中,免疫原性MUC1多肽包含为人MUC1同种型1一部分的氨基酸序列,其中人MUC1同种型1的氨基酸序列示于SEQ ID NO:1。在一些实施方案中,免疫原性MUC1多肽由人MUC1同种型1一部分的氨基酸序列组成,其中人MUC1同种型1的氨基酸序列示于SEQ ID NO:1。在特定实施方案中,免疫原性MUC1多肽包含SEQ ID NO:1氨基酸序列的氨基酸22-225及946-1255。在一些其他特定实施方案中,本公开提供选自以下的免疫原性MUC1多肽:
(1)包含SEQ ID NO:5的氨基酸序列或由SEQ ID NO:5的氨基酸序列组成的多肽(质粒1027多肽);
(2)包含SEQ ID NO:5的氨基酸4-537或由SEQ ID NO:5的氨基酸4-537组成的多肽;
(3)包含SEQ ID NO:5的氨基酸24-537或由SEQ ID NO:5的氨基酸24-537组成的多肽;
(4)包含SEQ ID NO:7的氨基酸序列或由SEQ ID NO:7的氨基酸序列组成的多肽(质粒1197多肽);
(5)包含SEQ ID NO:7的氨基酸4-517或由SEQ ID NO:7的氨基酸4-517组成的多肽;
(6)包含SEQ ID NO:7的氨基酸4-517或由SEQ ID NO:7的氨基酸4-517组成的多肽,其中在SEQ ID NO:7中,位置513的氨基酸为T;及
(7)上述(1)至(6)中任一多肽的功能性变体。
在一些特定实施方案中,免疫原性MUC1多肽包含SEQ ID NO:5(质粒1027多肽)或SEQ ID NO:7(质粒1197多肽)的氨基酸序列。在一些特定实施方案中,免疫原性MUC1多肽由SEQ ID NO:5(质粒1027多肽)或SEQ ID NO:7(质粒1197多肽)的氨基酸序列组成。
在一方面中,本发明提供本文公开的任一免疫原性MUC1多肽的功能性变体。
B-2.免疫原性TERT多肽
在另一方面中,本公开提供通过缺失TERT蛋白的多达600个N端氨基酸而衍生自人TERT蛋白的免疫原性TERT多肽。因此,免疫原性TERT多肽可包含从任何人TERT蛋白同种型位置601开始的C端氨基酸序列。在一些实施方案中,免疫原性TERT多肽包含示于SEQ IDNO:3的TERT同种型1的氨基酸序列,其中缺少从TERT同种型1的氨基酸序列的N端(氨基端)起的多达约600个氨基酸。免疫原性TERT多肽中可缺少从TERT同种型1N端计至多600个的任何数量的氨基酸。举例而言,SEQ ID NO:3的TERT同种型1位置1至位置50、100、50、200、250、300、350、400、450、500、550或600的N端氨基酸可不存在免疫原性TERT多肽中。因此,免疫原性TERT多肽可包含SEQ ID NO:3的氨基酸51-1132、101-1132、151-1132、201-1132、251-1132、301-1132、351-1132、401-1132、451-1132、501-1132或551-1132。在一实施方案中,免疫原性TERT多肽包含SEQ ID NO:3的氨基酸601-1132的氨基酸序列。在另一实施方案中,本公开提供包含SEQ ID NO:3的氨基酸241-1132的氨基酸序列的免疫原性TERT多肽。
免疫原性TERT多肽可由SEQ ID NO:3的氨基酸51-1132、101-1132、151-1132、201-1132、251-1132、301-1132、351-1132、401-1132、451-1132、501-1132或551-1132组成。在一实施方案中,免疫原性TERT多肽由SEQ ID NO:3的氨基酸序列的氨基酸601-1132组成。在另一实施方案中,本公开提供由SEQ ID NO:3的氨基酸序列的氨基酸241-1132组成的免疫原性TERT多肽。
免疫原性TERT多肽亦可由其他TERT同种型构建。当免疫原性TERT多肽构建自C端截短的TERT同种型(例如同种型2、3或4),优选从该蛋白质N端缺失较少氨基酸。
在一些进一步实施方案中,免疫原性TERT多肽进一步包含使TERT催化结构域失活的一或多种氨基酸突变。所述氨基酸突变的实例包括SEQ ID NO:3位置712以丙氨酸取代天冬氨酸(D712A)及SEQ ID NO:3位置713以异亮氨酸取代缬氨酸(V713I)。在一些实施方案中,免疫原性TERT多肽包含突变D712A及V713I。在一实施方案中,所述突变包括SEQ ID NO:3位置712天冬氨酸的取代和/或SEQ ID NO:3位置713的缬氨酸取代(V713I),其中所述突变使TERT催化结构域失活。在另一实施方案中,该突变由SEQ ID NO:3位置712天冬氨酸的取代和/或SEQ ID NO:3位置713的缬氨酸取代(V713I)组成,其中所述突变使TERT催化结构域失活。在另一实施方案中,该突变由SEQ ID NO:3位置712以丙氨酸取代天冬氨酸(D712A)和/或SEQ ID NO:3位置713以异亮氨酸取代缬氨酸(V713I)组成。
在一些特定实施方案中,本公开提供选自以下的免疫原性TERT多肽:
(1)包含SEQ ID NO:9的氨基酸序列(质粒1112多肽)或SEQ ID NO:9的氨基酸2-893或由SEQ ID NO:9的氨基酸序列(质粒1112多肽)或SEQ ID NO:9的氨基酸2-893组成的多肽;
(2)包含SEQ ID NO:11的氨基酸序列(质粒1326多肽)或SEQ ID NO:11的氨基酸3-791或由SEQ ID NO:11的氨基酸序列(质粒1326多肽)或SEQ ID NO:11的氨基酸3-791组成的多肽;
(3)包含SEQ ID NO:13的氨基酸序列(质粒1330多肽)或SEQ ID NO:13的氨基酸4-594或由SEQ ID NO:13的氨基酸序列(质粒1330多肽)或SEQ ID NO:13的氨基酸4-594组成的多肽;或
(4)多肽,其为上述(1)至(3)中任一多肽的功能性变体。
在一方面中,本发明提供本文公开的任一免疫原性TERT多肽的功能性变体。
B-3.免疫原性CEA多肽
在另一方面中,本公开提供通过在人天然CEA前体蛋白引入一或多种突变的衍生自人天然CEA的分离的免疫原性CEA多肽。所引入的突变实例包括缺失一、二、三、四或五个C2样结构域;缺失全部或部分信号肽序列及缺失该前肽的一些或所有氨基酸。因此,在一些实施方案中,本公开提供的免疫性CEA多肽包含(1)N结构域的氨基酸序列及(2)人CEA蛋白的C2样结构域(1至5个)的氨基酸序列。在一些特定实施方案中,免疫原性CEA多肽包含(1)至少四个(例如A2、B2、A3及B3)C2样结构域的氨基酸序列与(2)N结构域的氨基酸序列。在一些进一步实施方案中,免疫原性CEA多肽为胞质形式(或“cCEA”)。术语“胞质形式”是指缺少人天然CEA前体蛋白的全部或部分信号肽序列(氨基酸1-34)的免疫原性CEA多肽。缺失信号序列的氨基酸被预期在细胞中表达时阻止所述多肽进入分泌途径。在一些其他实施方案中,免疫原性CEA多肽为膜结合形式(或“mCEA”)。免疫原性mCEA多肽包括信号肽的氨基酸且,被宿主细胞表达后,仍保持与宿主细胞的膜结合或者相关联。
本公开提供的免疫原性CEA多肽可由本领域已知或未来发现的任何人CEA同种型氨基酸序列衍生、构建或制备。在一些实施方案中,免疫原性CEA多肽包含具SEQ ID NO:2的氨基酸序列的人CEA同种型1前体蛋白一部分的氨基酸序列。
在一些特定实施方案中,本公开提供下述任一免疫原性CEA多肽:
(1)包含SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ IDNO:2的氨基酸323-677的多肽;
(2)由SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ ID NO:2的氨基酸323-677组成的多肽;
(3)包含SEQ ID NO:15的氨基酸(由质粒1361编码的氨基酸序列)或SEQ ID NO:15的氨基酸4-704的多肽;
(4)由SEQ ID NO:15的氨基酸(由质粒1361编码的氨基酸序列)或SEQ ID NO:15的氨基酸4-704组成的多肽;
(5)包含SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列)或SEQ IDNO:17的氨基酸4-526的多肽;
(6)由SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列)或SEQ ID NO:17的氨基酸4-526组成的多肽;
(7)包含SEQ ID NO:19的氨基酸序列(由质粒1390编码的氨基酸序列)或SEQ IDNO:19的氨基酸4-468的多肽;
(8)由SEQ ID NO:19的氨基酸序列(由质粒1390编码的氨基酸序列)或SEQ ID NO:19的氨基酸4-468组成的多肽;或
(9)多肽,其为上述(1)至(8)中任一多肽功能性变体。
在一方面中,本发明提供本文公开的任一免疫原性TERT多肽的功能性变体。
C.编码一或多种免疫原性TAA多肽的抗原构建体
在一些方面中,本公开提供编码一、二、三或多种不同免疫原性TAA多肽的分离的核酸分子。所述核酸分子在本公开中亦称为“抗原构建体”。只编码一种免疫原性TAA多肽的核酸分子在本文中亦称为“单抗原构建体”,而编码一种以上免疫原性TAA多肽的核酸分子亦称为“多抗原构建体”。编码两种不同免疫原性TAA多肽的核酸分子亦称为“双抗原构建体”,而编码三种不同免疫原性TAA多肽的核酸分子亦称为“三抗原构建体”。所述核酸分子可为脱氧核糖核酸(DNA)或核糖核酸(RNA)。因此,所述核酸分子可包含本文公开的核苷酸序列,其中胸腺嘧啶(T)亦可为尿嘧啶(U),其反映DNA及RNA化学结构间的差异。关于对应在本公开中DNA核苷酸序列的RNA核苷酸序列,术语“对应”是指RNA的核苷酸序列除了DNA核苷酸序列中的胸腺嘧啶核苷(T)被RNA核苷酸序列中的尿嘧啶(U)替换外,与DNA的参考核苷酸序列相同。核酸分子可以是经修饰形式、单链或双链形式,或线性或环状形式。
抗原构建体,包括DNA及RNA构建体二者,可根据本公开使用本领域已知的方法制备。在下文进一步叙述制造单抗原构建体及多抗原构建体的方法。另外,已确立,将mRNA注射入宿主细胞中导致所编码的蛋白质表达及免疫应答。通过使用本领域已知的各种元件/系统(例如UTR's、PolyA、加帽系统及密码子优化),可稳定地产生体外转录的mRNA且可有效地翻译所编码的蛋白质。另外,溶酶体或核内体靶向信号与mRNA编码的多肽融合可增强T细胞免疫应答。mRNA可未经配制或通过EP或配制在脂质或其他赋形剂中递送。
C-1CEA单抗原构建体
在一些实施方案中,本公开提供编码上文叙述的任一免疫原性CEA多肽的抗原构建体。
在一些特定实施方案中,所述抗原构建体编码选自以下的免疫原性CEA多肽:
(1)包含SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ IDNO:2的氨基酸323-677的多肽;
(2)包含SEQ ID NO:15的氨基酸(由质粒1361编码的氨基酸序列)或SEQ ID NO:15的氨基酸4-704的多肽;
(3)包含SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列)或SEQ IDNO:17的氨基酸4-526的多肽;
(4)包含SEQ ID NO:19的序列(由质粒1390编码的氨基酸序列)或SEQ ID NO:19的氨基酸4-468的多肽;或
(5)多肽,其为上述(1)至(4)的任一多肽的功能性变体。
在一些特定实施方案中,抗原构建体编码选自以下的免疫原性CEA多肽:
(1)由SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ ID NO:2的氨基酸323-677组成的多肽;
(2)由SEQ ID NO:15的氨基酸(由质粒1361编码的氨基酸序列)或SEQ ID NO:15的氨基酸4-704组成的多肽;
(3)由SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列)或SEQ ID NO:17的氨基酸4-526组成的多肽;
(4)由这些SEQ ID NO:19的序列(由质粒1390编码的氨基酸序列)或SEQ ID NO:19的氨基酸4-468组成的多肽;或
(5)多肽,其为上述(1)至(4)的任一多肽的功能性变体。
在一些特定实施方案中,本公开提供抗原构建体,其为DNA且包含选自以下的核苷酸序列:
(1)SEQ ID NO:14的核苷酸序列(质粒1361开放阅读框)或包含SEQ ID NO:14的核苷酸10-2112的核苷酸序列;
(2)SEQ ID NO:16的核苷酸序列(质粒1386开放阅读框)或包含SEQ ID NO:16的核苷酸10-1578的核苷酸序列;
(3)SEQ ID NO:18的核苷酸序列(质粒1390开放阅读框)或包含SEQ ID NO:18的核苷酸10-1404的核苷酸序列;及
(4)核苷酸序列,其为(1)至(3)的核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供抗原构建体,其为DNA且由选自以下的核苷酸序列组成:
(1)SEQ ID NO:14的核苷酸序列(质粒1361开放阅读框)或由SEQ ID NO:14的核苷酸10-2112组成的核苷酸序列;
(2)SEQ ID NO:16的核苷酸序列(质粒1386开放阅读框)或由SEQ ID NO:16的核苷酸10-1578组成的核苷酸序列;
(3)SEQ ID NO:18的核苷酸序列(质粒1390开放阅读框)或由SEQ ID NO:18的核苷酸10-1404组成的核苷酸序列;及
(4)核苷酸序列,其为(1)至(3)的核苷酸序列的简并变体的。在一些其他特定实施方案中,本公开提供抗原构建体,其为RNA且包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)SEQ ID NO:14的核苷酸序列(质粒1361开放阅读框)或包含SEQ ID NO:14的核苷酸10-2112的核苷酸序列;
(2)SEQ ID NO:16的核苷酸序列(质粒1386开放阅读框)或包含SEQ ID NO:16的核苷酸10-1578的核苷酸序列;
(3)SEQ ID NO:18的核苷酸序列(质粒1390开放阅读框)或包含SEQ ID NO:18的核苷酸10-1404的核苷酸序列;及
(4)核苷酸序列,其为(1)至(3)的核苷酸序列的简并变体。
C-2.多抗原构建体
在另一方面中,本公开提供各编码二、三或多种不同免疫原性TAA多肽的抗原构建体。
本领域已知构建用于从单一核酸共表达两种或多种多肽的载体(本领域也称为“多顺反子载体”)的方法及技术。本公开提供的多抗原构建体可根据本公开使用所述技术制备。举例而言,多抗原构建体可利用并入多个独立启动子至单一质粒中构建(Huang,Y.,Z.Chen,et al.(2008).“Design,construction,and characterization of a dual-promoter multigenic DNA vaccine directed against an HIV-1subtype C/B'recombinant.”J Acquir Immune Defic Syndr 47(4):403-411;Xu,K.,Z.Y.Ling,et al.(2011).“Broad humoral and cellular immunity elicited by a bivalent DNAvaccine encoding HA and NP genes from an H5N1 virus.”Viral Immunol 24(1):45-56)。质粒可经工程化以携带多个表达盒,各个表达盒由a)真核启动子,用于启动RNA聚合酶依赖性转录,有或无增强子元件,b)标靶抗原编码基因及c)转录终止子序列组成。递送质粒至经转染的细胞核后,将从各启动子开始转录,导致产生各自编码一种标靶抗原的单独mRNA。所述mRNA将独立翻译,从而产生所需抗原。
本公开提供的多抗原构建体亦可通过使用病毒2A肽构建(Szymczak,A.L.andD.A.Vignali(2005).“Development of 2A peptide-based strategies in the designof multicistronic vectors”,Expert Opin Biol Ther 5(5):627-638;de Felipe,P.,G.A.Luke,et al.(2006).“E unum pluribus:multiple proteins from a self-processing polyprotein”,Trends Biotechnol 24(2):68-75;Luke,G.A.,P.de Felipe,et al.(2008).“Occurrence,function and evolutionary origins of'2A-like'sequences in virus genomes”,J Gen Virol 89(Pt 4):1036-1042;Ibrahimi,A.,G.Vande Velde,et al.(2009).“Highly efficient multicistronic lentiviralvectors with peptide 2A sequences”,Hum Gene Ther 20(8):845-860;Kim,J.H.,S.R.Lee,et al.(2011).“High cleavage efficiency of a 2A peptide derived fromporcine teschovirus-1in human cell lines,zebrafish and mice”,PLoS One6(4):e18556)。这些肽,亦称为裂解盒或CHYSEL(顺式作用水解酶元件),约20个氨基酸长,具高度保守的羧基端D-V/I-EXNPGP基序。这些肽在自然界中罕见,最常见于例如口蹄疫病毒(FMDV)、马鼻炎A病毒(ERAV)、马鼻炎B病毒(ERBV)、脑心肌炎病毒(EMCV)、猪捷申病毒(PTV)及Thosea asigna病毒(TAV)的病毒(Luke,G.A.,P.de Felipe,et al.(2008).“Occurrence,function and evolutionary origins of'2A-like'sequences in virusgenomes”,J Gen Virol 89(Pt 4):1036-1042)。这些肽的一些氨基酸序列提供于表17。使用基于2A的多抗原表达策略,将编码多种标靶抗原的基因在单一开放阅读框(ORF)中连接在一起,由编码病毒2A肽的序列隔开。可将整个开放阅读框转入具单一启动子及终止子的载体中。将构建体递送至宿主细胞后,编码多种抗原的mRNA将被转录且翻译为单一多蛋白。在翻译2A肽的过程中,核糖体跳过C端甘氨酸及脯氨酸之间的键。核糖体跳跃扮演类似共翻译自动催化“裂解”的作用,其将2A肽上游肽序列从下游肽序列释出。在两种蛋白抗原间并入2A肽可导致在上游多肽C端添加~20个氨基酸及在下游蛋白质N端添加1个氨基酸(脯氨酸)。在此方法的改编中,蛋白酶裂解位点可并入2A盒的N端,使得普遍存在的蛋白酶从上游蛋白质裂解所述盒(Fang,J.,S.Yi,et al.(2007).“An antibody delivery system forregulated expression of therapeutic levels of monoclonal antibodies in vivo”,Mol Ther 15(6):1153-1159)。可用于构建本公开多抗原构建体的特定2A-肽序列的实例包括公开于Andrea L.Szymczak&Darrio AA Vignali:Development of 2A peptide-basedstrategies in the design of multicistronic vectors.Expert Opinion Biol.Ther.(2005)5(5)627-638以及国际专利申请WO2015/063674中的那些,其公开的内容通过参考并入本文。
可用于构建多抗原构建体的另一方法涉及使用内部核糖体进入位点或IRES。内部核糖体进入位点是在特定RNA分子5'非翻译区发现的RNA元件(Bonnal,S.,C.Boutonnet,etal.(2003).“IRESdb:the Internal Ribosome Entry Site database”,Nucleic AcidsRes 31(1):427-428)。其吸引真核核糖体至RNA以促进下游开放阅读框的翻译。不同于正常细胞的7-甲基鸟苷帽依赖性翻译,IRES介导的翻译可在远在RNA分子内的AUG密码子处启动。可开发此高效率方法以在多顺反子载体中使用(Bochkov,Y.A.and A.C.Palmenberg(2006).“Translational efficiency of EMCV IRES in bicistronic vectors isdependent upon IRES sequence and gene location”,Biotechniques41(3):283-284,286,288)。通常,将两个转基因插入启动子与转录终止子间的载体中,作为由IRES分开的两个独立开放阅读框。在递送构建体至宿主细胞后,将转录编码两个转基因的单一长转录本。第一个ORF将以传统的帽依赖性方式翻译,在IRES上游的终止密码子处停止。第二个ORF将使用IRES以非帽依赖性方式翻译。以此方式,可从具单一表达盒的载体转录单一mRNA,产生两个独立蛋白质。IRES序列的实例包括脊髓灰质炎病毒(PV)IRES、脑心肌炎病毒(EMCV)IRES、口蹄疫病毒(FMDV)IRES、甲型肝炎病毒IRES、乙型肝炎病毒IRES、卡波西氏(Kaposi's)肉瘤相关疱疹病毒(KSHV)IRES及典型猪瘟病毒IRES。EMCV IRES的核苷酸序列公开于WO2013/165754(图3)中并示在本公开的SEQ ID NO:93中。最小的EMCV IRES元件排除SEQID NO:93核苷酸序列3'端的15个核苷酸(其代表EMCV L蛋白最前面5个密码子)。
在本公开中,插入核酸分子开放阅读框(ORF)中两个编码序列或转基因间并起作用以容许源自核酸分子的二独立基因产物共表达或翻译的核苷酸序列系称为“间隔子核苷酸序列”。可用于多抗原构建体的特定间隔子核苷酸序列的实例包括真核启动子、编码2A肽的核苷酸序列及内部核糖体进入位点(IRES)序列。特定2A肽的实例包括急性蜜蜂麻痹病毒(ABP2A)、蟋蟀麻痹病毒(CrP2A)、马鼻炎A病毒(ERA2A)、马鼻炎B病毒(ERB2A)、脑心肌炎病毒(EMC2A)、口蹄疫病毒(FMD2A或F2A)、人轮状病毒(HT2A)、感染性蚕软腐病病毒病毒(IF2A)、猪捷申病毒(PT2A或P2A)、猪轮状病毒(PR2A)及Thosea asigna病毒(T2A、TA2A或TAV2A)。
在一些方面中,本公开提供抗原构建体,其包含(i)编码免疫原性CEA多肽的至少一种编码核苷酸序列及(ii)编码一或多种其他免疫原性TAA多肽(例如免疫原性TERT多肽、免疫原性MUC1多肽、免疫原性MSLN多肽、免疫原性PSA多肽、免疫原性PSMA多肽或免疫原性PSCA多肽)的一或多种核苷酸序列。
在一些实施方案中,本公开提供抗原构建体,其包含(i)编码免疫原性CEA多肽的至少一种编码核苷酸序列及(ii)编码免疫原性TERT多肽或者免疫原性MUC1多肽的至少一种编码核苷酸序列。编码免疫原性CEA多肽的核苷酸序列可以在另一编码核苷酸序列的上游或下游。所述构建体可进一步在所述编码核苷酸序列之间包含间隔子核苷酸序列。双抗原构建体的结构示于式(I)及式(II):
TAA-SPACER-CEA (I)
CEA-SPACER-TAA (II)
其中在式(I)及(II)的每一个中:(i)CEA表示编码免疫原性CEA多肽的核苷酸序列;(ii)TAA表示编码免疫原性MUC1多肽或者免疫原性TERT多肽的核苷酸序列及(iii)SPACER为间隔子核苷酸序列且可不存在。可包括于双抗原构建体的间隔子核苷酸序列的实例包括编码口蹄疫病毒2A肽(FMD2A或FMDV2A)、马鼻炎A病毒2A肽(ERA2A)、马鼻炎B病毒2A肽(ERB2A)、脑心肌炎病毒2A肽(EMC2A或EMCV2A)、猪捷申病毒2A肽(PT2A)及Thosea asigna病毒2A肽(T2A、TA2A或TAV2A)。在一些实施方案中,所述抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,所述抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在一些其他方面中,本公开提供多抗原构建体,其包含(i)编码免疫原性CEA多肽的至少一种编码核苷酸序列、(ii)编码免疫原性MUC1多肽的至少一种编码核苷酸序列及(iii)编码免疫原性TERT多肽的至少一种编码核苷酸序列。在一些实施方案中,多抗原构建体进一步包含间隔子核苷酸序列。多抗原构建体的结构示于式(III):
TAA1-SPACER1-TAA2-SPACER2-TAA3(III)
其中在式(III)中:(i)TAA1、TAA2及TAA3各表示编码选自免疫原性MUC1多肽、免疫原性CEA多肽及免疫原性TERT多肽的免疫原性TAA多肽的核苷酸序列,其中TAA1、TAA2及TAA3编码不同免疫原性TAA多肽;且(ii)SPACER1及SPACER2各表示间隔子核苷酸序列,其中(a)SPACER1及SPACER2可相同或不同且(b)SPACER1及SPACER2之一或二者可不存在。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在一些实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性MUC1多肽的核苷酸序列;(ii)TAA2为编码免疫原性CEA多肽的核苷酸序列;及(iii)TAA3为编码免疫原性TERT多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在一些其他实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性MUC1多肽的核苷酸序列;(ii)TAA2为编码免疫原性TERT多肽的核苷酸序列;及(iii)TAA3为编码免疫原性CEA多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在其他实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性CEA多肽的核苷酸序列;(ii)TAA2为编码免疫原性TERT多肽的核苷酸序列;及(iii)TAA3为编码免疫原性MUC1多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在一些进一步实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性CEA多肽的核苷酸序列;(ii)TAA2为编码免疫原性MUC1多肽的核苷酸序列;及(iii)TAA3为编码免疫原性TERT多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在又其他实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性TERT多肽的核苷酸序列;(ii)TAA2为编码免疫原性MUC1多肽的核苷酸序列;及(iii)TAA3为编码免疫原性CEA多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在又其他实施方案中,本公开提供式(III)的多抗原构建体,其中在式(III)中:(i)TAA1为编码免疫原性TERT多肽的核苷酸序列;(ii)TAA2为编码免疫原性CEA多肽的核苷酸序列;及(iii)TAA3为编码免疫原性MUC1多肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2,独立地,为编码2A肽的核苷酸序列或编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,SPACER1及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码2A肽的核苷酸序列及SPACER2为编码GGSGG的核苷酸序列。在一些实施方案中,SPACER1为编码GGSGG的核苷酸序列及SPACER2为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
在一些特定实施方案中,本公开提供选自以下的式的多抗原构建体:
(1)MUC1-2A-CEA-2A-TERT (IV)
(2)MUC1-2A-TERT-2A-CEA (V)
(3)CEA-2A-MUC1-2A-TERT(VI)
(4)CEA-2A-TERT-2A-MUC1(VII)
(5)TERT-2A-MUC1-2A-CEA(VIII)
(6)TERT-2A-CEA-2A-MUC1(IX)
其中在式(IV)-(IX)的每一个中:(i)MUC1、CEA及TERT分别表示编码免疫原性MUC1多肽、免疫原性CEA多肽及免疫原性TERT多肽的核苷酸序列,和(ii)2A为编码2A肽的核苷酸序列。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性CEA多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性TERT多肽。在一些实施方案中,抗原构建体编码上文叙述的任一免疫原性MUC1多肽。
由多抗原构建体(包括双抗原构建体及三抗原构建体)编码的免疫原性CEA多肽、免疫原性MUC1多肽及免疫原性TERT多肽,可以是膜结合形式或胞质形式。在一些特定实施方案中,免疫原性TAA多肽是胞质形式。
在一些实施方案中,由多抗原构建体编码的免疫原性CEA多肽包含(1)N结构域的氨基酸序列及(2)人CEA蛋白C样结构域(1、2、3、4或5个)的氨基酸序列。在一些特定实施方案中,免疫原性CEA多肽包含(1)至少四个C样结构域(例如A2、B2、A3及B3)的氨基酸序列及(2)N结构域的氨基酸序列。在一些进一步实施方案中,免疫原性CEA多肽是胞质形式(或“cCEA”)或膜结合形式(或“mCEA”)。
在一些特定实施方案中,由多抗原构建体编码的免疫原性CEA多肽包含选自在下的氨基酸序列:
(1)包含(i)SEQ ID NO:2的氨基酸323-677或(ii)SEQ ID NO:2的氨基酸35-144及323-677或由其组成的氨基酸序列;
(2)包含(i)SEQ ID NO:2的氨基酸323-702或(ii)SEQ ID NO:2的氨基酸2-144及323-702或由其组成的氨基酸序列;
(3)SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列(mCEA))或SEQ IDNO:17的氨基酸4-526;
(4)SEQ ID NO:19的氨基酸序列(由质粒1390编码的氨基酸序列(cCEA))或SEQ IDNO:19的氨基酸4-468;或
(5)上述(1)至(4)中任一氨基酸序列的功能性变体。
在一些特定实施方案中,由多抗原构建体编码的免疫原性CEA多肽由选自以下的氨基酸序列组成:
(1)包含(i)SEQ ID NO:2的氨基酸323-677或(ii)SEQ ID NO:2的氨基酸35-144及323-677或由SEQ ID NO:2的氨基酸323-677或(ii)SEQ ID NO:2的氨基酸35-144及323-677组成的氨基酸序列;
(2)包含(i)SEQ ID NO:2的氨基酸323-702或(ii)SEQ ID NO:2的氨基酸2-144及323-702或由(i)SEQ ID NO:2的氨基酸323-702或(ii)SEQ ID NO:2的氨基酸2-144及323-702组成的氨基酸序列;
(3)SEQ ID NO:17的氨基酸序列(由质粒1386编码的氨基酸序列(mCEA))或SEQ IDNO:17的氨基酸4-526;
(4)SEQ ID NO:19的氨基酸序列(由质粒1390编码的氨基酸序列(cCEA))或SEQ IDNO:19的氨基酸4-468;或
(5)上述(1)至(4)中任一氨基酸序列的功能性变体。
在一些特定实施方案中,多抗原构建体为DNA且包含(1)SEQ ID NO:14的核苷酸序列、(2)SEQ ID NO:16的核苷酸序列、(3)SEQ ID NO:18的核苷酸序列或(4)SEQ ID NO:14、16或18核苷酸序列的简并变体。在一些其他特定实施方案中,多抗原构建体为RNA且包含对应于(1)SEQ ID NO:14的核苷酸序列、(2)SEQ ID NO:16的核苷酸序列、(3)SEQ ID NO:18的核苷酸序列或(4)SEQ ID NO:14、16或18核苷酸序列的简并变体的核苷酸序列。
在一些实施方案中,由多抗原构建体编码的免疫原性MUC1多肽包含(1)人MUC1蛋白的20个氨基酸的串联重复序列(3至30个)的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列。在一些特定实施方案中,由多抗原构建体编码的免疫原性MUC1多肽包含选自以下的氨基酸序列:
(1)SEQ ID NO:5的氨基酸序列(质粒1027多肽);
(2)包含SEQ ID NO:5的氨基酸4-537的氨基酸序列;
(3)包含SEQ ID NO:5的氨基酸24-537的氨基酸序列;
(4)SEQ ID NO:7的氨基酸序列(质粒1197多肽);
(5)包含SEQ ID NO:7氨基酸4-517的氨基酸序列;及
(6)包含SEQ ID NO:7氨基酸4-517的氨基酸序列,附带条件为位置513的氨基酸为T。
在一些实施方案中,由多抗原构建体编码的免疫原性MUC1多肽由(1)人MUC1蛋白20个氨基酸的串联重复序列(3至30个)的氨基酸序列及(2)VNTR区域侧翼的人MUC1蛋白的氨基酸序列组成。在一些特定实施方案中,由多抗原构建体编码的免疫原性MUC1多肽由选自以下的氨基酸序列组成:
(1)SEQ ID NO:5的氨基酸序列(质粒1027多肽);
(2)包含SEQ ID NO:5的氨基酸4-537的氨基酸序列;
(3)包含SEQ ID NO:5的氨基酸24-537的氨基酸序列;
(4)SEQ ID NO:7的氨基酸序列(质粒1197多肽);
(5)包含SEQ ID NO:7的氨基酸4-517的氨基酸序列;及
(6)包含SEQ ID NO:7的氨基酸4-517的氨基酸序列,附带条件为位置513的氨基酸为T。
在一些特定实施方案中,由多抗原构建体编码的免疫原性MUC1多肽由选自由以下的氨基酸序列组成:
(1)由SEQ ID NO:5的氨基酸4-537组成的氨基酸序列;
(2)由SEQ ID NO:5的氨基酸24-537组成的氨基酸序列;
(3)由SEQ ID NO:7的氨基酸4-517组成的氨基酸序列;及
(4)由SEQ ID NO:7的氨基酸4-517组成的氨基酸序列,附带条件为位置513的氨基酸为T。
在一些特定实施方案中,多抗原构建体为DNA且包含(1)SEQ ID NO:4的核苷酸序列、(2)SEQ ID NO:6的核苷酸序列或(3)SEQ ID NO:4或6核苷酸序列的简并变体。在一些其他特定实施方案中,多抗原构建体为RNA且包含对应于(1)SEQ ID NO:4的核苷酸序列、(2)SEQ ID NO:6的核苷酸序列或(3)SEQ ID NO:4或6的核苷酸序列的简并变体的核苷酸序列。
由多抗原构建体编码的免疫原性TERT多肽可为全长TERT蛋白或TERT蛋白的任何截短或突变形式。全长TERT蛋白被预期比截短形式产生更强的免疫应答。然而,取决于所选择递送构建体的特定载体,该载体可能未具携带编码全长TERT蛋白基因的能力。因此,可能从该蛋白缺失一些氨基酸使得转基因适合特定载体。缺失氨基酸可从TERT蛋白序列(例如源自SEQ ID NO:3的TERT蛋白)的N端、C端或任何地方进行。可进行额外的缺失以去除核定位信号,从而使多肽成为细胞质多肽,增加接近细胞抗原加工/呈递机制的机会。在一些实施方案中,免疫原性TERT多肽(例如SEQ ID NO:3的TERT蛋白)没有TERT蛋白N端直至位置200、300、400、500或600的氨基酸。
在一些特定实施方案中,SEQ ID NO:3的TERT蛋白的N端氨基酸1-343(TERT343)、1-240(TERT240)或1-541(TERT541)不存在。因此,在一实施方案中,由本发明多抗原构建体编码的免疫原性TERT多肽的氨基酸序列为下述任一氨基酸序列:
(1)包含SEQ ID NO:3的氨基酸51-1132,且缺少SEQ ID NO:3氨基酸1至50的氨基酸序列;
(2)包含SEQ ID NO:3的氨基酸101-1132,且缺少SEQ ID NO:3氨基酸1至100的氨基酸序列;
(3)包含SEQ ID NO:3的氨基酸151-1132,且缺少SEQ ID NO:3氨基酸1至150的氨基酸序列;
(4)包含SEQ ID NO:3的氨基酸201-1132,且缺少SEQ ID NO:3氨基酸1至200的氨基酸序列;
(5)包含SEQ ID NO:3的氨基酸241-1132,且缺少SEQ ID NO:3氨基酸1至240的氨基酸序列;
(6)包含SEQ ID NO:3的氨基酸301-1132,且缺少SEQ ID NO:3氨基酸1至300的氨基酸序列;
(7)包含SEQ ID NO:3的氨基酸351-1132,且缺少SEQ ID NO:3氨基酸1至350的氨基酸序列;
(8)包含SEQ ID NO:3的氨基酸401-1132,且缺少SEQ ID NO:3氨基酸1至400的氨基酸序列;
(9)包含SEQ ID NO:3的氨基酸451-1132,且缺少SEQ ID NO:3氨基酸1至450的氨基酸序列;
(10)包含SEQ ID NO:3的氨基酸501-1132,且缺少SEQ ID NO:3氨基酸1至500的氨基酸序列;
(11)包含SEQ ID NO:3的氨基酸551-1132,且缺少SEQ ID NO:3氨基酸1至550的氨基酸序列;或
(12)包含SEQ ID NO:3氨基酸601-1132,且缺少SEQ ID NO:3氨基酸1-600的氨基酸序列。
在一实施方案中,由本发明多抗原构建体编码的免疫原性TERT多肽的氨基酸序列为下述任一氨基酸序列:
(1)由SEQ ID NO:3的氨基酸51-1132、101-1132、151-1132、201-1132、251-1132、301-1132、351-1132、401-1132、451-1132、501-1132或551-1132组成的氨基酸序列;
(2)由SEQ ID NO:3的氨基酸601-1132组成的氨基酸序列;
(3)由SEQ ID NO:3的氨基酸542-1132组成的氨基酸序列;
(4)由SEQ ID NO:3的氨基酸344-1132组成的氨基酸序列;
(5)由SEQ ID NO:3的氨基酸241-1132组成的氨基酸序列。
可引入额外的氨基酸突变使得TERT催化结构域失活。所述突变的实例包括在SEQID NO:3的位置712的天冬氨酸的取代,例如D712A,及在SEQ ID NO:3的位置713的缬氨酸的取代,例如V713I。因此,在一实施方案中,多抗原构建体编码的免疫原性TERT多肽由上文公开的任何TERT多肽组成,其中在对应于SEQ ID NO:3的位置712的天冬氨酸被取代和/或在对应于SEQ ID NO:3的位置713的缬氨酸被取代,且其中所述突变使TERT催化结构域失活。在一实施方案中,所述突变由在对应于SEQ ID NO:3的位置712的天冬氨酸的取代及在对应于SEQ ID NO:3的位置713的缬氨酸的取代组成,其中所述突变使TERT催化结构域失活。在一实施方案中,该突变由在对应于SEQ ID NO:3的位置712以丙氨酸取代天冬氨酸(D712A)及在对应于SEQ ID NO:3的位置713以异亮氨酸取代缬氨酸(V713I)组成。
在一些特定实施方案中,由多抗原构建体编码的免疫原性TERT多肽包含选自以下的氨基酸序列:
(1)SEQ ID NO:9的氨基酸序列(质粒1112多肽)或包含SEQ ID NO:9的氨基酸2-893的氨基酸序列;
(2)SEQ ID NO:11的氨基酸序列(质粒1326多肽)或包含SEQ ID NO:11的氨基酸4-791的氨基酸序列;
(3)SEQ ID NO:13的氨基酸序列(质粒1330多肽)或包含SEQ ID NO:13的氨基酸4-594的氨基酸序列;或
(4)氨基酸序列,其为上述(1)至(3)中任一氨基酸序列功能性变体。
在一些特定实施方案中,由多抗原构建体编码的免疫原性TERT多肽由选自以下的氨基酸序列组成:
(1)SEQ ID NO:9的氨基酸序列(质粒1112多肽)或包含SEQ ID NO:9的氨基酸2-893的氨基酸序列;
(2)SEQ ID NO:11的氨基酸序列(质粒1326多肽)或包含SEQ ID NO:11的氨基酸4-791的氨基酸序列;
(3)SEQ ID NO:13的氨基酸序列(质粒1330多肽)或包含SEQ ID NO:13的氨基酸4-594的氨基酸序列;或
(4)氨基酸序列,其为上述(1)至(3)中任一氨基酸序列功能性变体。
在一些特定实施方案中,由多抗原构建体编码的免疫原性TERT多肽由选自以下的氨基酸序列组成:
(1)由SEQ ID NO:9的氨基酸2-893组成的氨基酸序列;
(2)由SEQ ID NO:11的氨基酸4-791组成的氨基酸序列;
(3)由SEQ ID NO:13的氨基酸4-594组成的氨基酸序列;或
(4)氨基酸序列,其为上述(1)至(3)中任一氨基酸序列功能性变体。
在一些特定实施方案中,多抗原构建体为DNA且包含(1)SEQ ID NO:8的核苷酸序列、(2)SEQ ID NO:10的核苷酸序列、(3)SEQ ID NO:12的核苷酸序列或(4)SEQ ID NO:8、SEQ ID NO:10或SEQ ID NO:12核苷酸序列的简并变体。在一些其他特定实施方案中,多抗原构建体为RNA且包含对应于(1)SEQ ID NO:8核苷酸序列、(2)SEQ ID NO:10核苷酸序列、(3)SEQ ID NO:12核苷酸序列或(4)SEQ ID NO:8、10或12核苷酸序列的简并变体的核苷酸序列。
在一些特定实施方案中,本公开提供多抗原构建体,其包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体编码的氨基酸序列包括:
(1)SEQ ID NO:31的氨基酸序列或SEQ ID NO:31的氨基酸4-1088;
(2)SEQ ID NO:33的氨基酸序列或SEQ ID NO:33的氨基酸4-1081;
(3)SEQ ID NO:35的氨基酸序列或SEQ ID NO:35的氨基酸4-1085;
(4)SEQ ID NO:37的氨基酸序列或SEQ ID NO:37的氨基酸4-1030;
(5)SEQ ID NO:39的氨基酸序列或SEQ ID NO:39的氨基酸4-1381;或
(6)SEQ ID NO:41的氨基酸序列或SEQ ID NO:41的氨基酸4-1441。
在一些特定实施方案中,本公开提供多抗原构建体,其包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体编码由以下组成的氨基酸序列:
(1)SEQ ID NO:31的氨基酸序列或SEQ ID NO:31的氨基酸4-1088;
(2)SEQ ID NO:33的氨基酸序列或SEQ ID NO:33的氨基酸4-1081;
(3)SEQ ID NO:35的氨基酸序列或SEQ ID NO:35的氨基酸4-1085;
(4)SEQ ID NO:37的氨基酸序列或SEQ ID NO:37的氨基酸4-1030;
(5)SEQ ID NO:39的氨基酸序列或SEQ ID NO:39的氨基酸4-1381;或
(6)SEQ ID NO:41的氨基酸序列或SEQ ID NO:41的氨基酸4-1441。
在一些特定实施方案中,本公开提供多抗原构建体,其为DNA且包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含选自以下的核苷酸序列:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;或
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些特定实施方案中,本公开提供多抗原构建体,其为DNA且包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含选自以下的核苷酸序列:
(1)由SEQ ID NO:30的核苷酸10-3264组成的核苷酸序列;
(2)由SEQ ID NO:32的核苷酸10-3243组成的核苷酸序列;
(3)由SEQ ID NO:34的核苷酸10-3255组成的核苷酸序列;
(4)由SEQ ID NO:36的核苷酸10-3090组成的核苷酸序列;
(5)由SEQ ID NO:38的核苷酸10-4143组成的核苷酸序列;
(6)由SEQ ID NO:40的核苷酸10-4323组成的核苷酸序列;或
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供多抗原构建体,其为RNA(例如mRNA)且包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;或
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供多抗原构建体,其为RNA(例如mRNA)且包含(i)编码免疫原性CEA多肽的至少一种核苷酸序列及(ii)编码免疫原性MUC1多肽或者免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)由SEQ ID NO:30的核苷酸10-3264组成的核苷酸序列;
(2)由SEQ ID NO:32的核苷酸10-3243组成的核苷酸序列;
(3)由SEQ ID NO:34的核苷酸10-3255组成的核苷酸序列;
(4)由SEQ ID NO:36的核苷酸10-3090组成的核苷酸序列;
(5)由SEQ ID NO:38的核苷酸10-4143组成的核苷酸序列;
(6)由SEQ ID NO:40的核苷酸10-4323组成的核苷酸序列;或
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他实施方案中,本公开提供多抗原构建体,其包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含编码选自以下的氨基酸序列的核苷酸序列:
(1)SEQ ID NO:43的氨基酸序列或包含SEQ ID NO:43的氨基酸4-2003的氨基酸序列;
(2)SEQ ID NO:45的氨基酸序列或包含SEQ ID NO:45的氨基酸4-2001的氨基酸序列;
(3)SEQ ID NO:47的氨基酸序列或包含SEQ ID NO:47的氨基酸4-2008的氨基酸序列;
(4)SEQ ID NO:49的氨基酸序列或包含SEQ ID NO:49的氨基酸4-1996的氨基酸序列;
(5)SEQ ID NO:51的氨基酸序列或包含SEQ ID NO:51的氨基酸4-1943的氨基酸序列;或
(6)SEQ ID NO:53的氨基酸序列或包含SEQ ID NO:53的氨基酸4-1943的氨基酸序列。
在一些其他实施方案中,本公开提供多抗原构建体,其包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体包含编码选自以下的氨基酸序列的核苷酸序列:
(1)由SEQ ID NO:43的氨基酸4-2003组成的氨基酸序列;
(2)由SEQ ID NO:45的氨基酸4-2001组成的氨基酸序列;
(3)由SEQ ID NO:47的氨基酸4-2008组成的氨基酸序列;
(4)由SEQ ID NO:49的氨基酸4-1996组成的氨基酸序列;
(5)由SEQ ID NO:51的氨基酸4-1943组成的氨基酸序列;或
(6)由SEQ ID NO:53的氨基酸4-1943组成的氨基酸序列。
在一些特定实施方案中,本公开提供多抗原构建体,其包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体为DNA且包含选自以下的核苷酸序列:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些特定实施方案中,本公开提供多抗原构建体,其包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列,其中该多抗原构建体为DNA且包含选自以下的核苷酸序列:
(1)由SEQ ID NO:42的核苷酸10-6009组成的核苷酸序列;
(2)由SEQ ID NO:44的核苷酸10-6003组成的核苷酸序列;
(3)由SEQ ID NO:46的核苷酸10-6024组成的核苷酸序列;
(4)由SEQ ID NO:48的核苷酸10-5988组成的核苷酸序列;
(5)由SEQ ID NO:50的核苷酸10-5829组成的核苷酸序列;
(6)由SEQ ID NO:52的核苷酸10-5829组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供多抗原构建体,其中该多抗原构建体为RNA(例如mRNA)且包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供多抗原构建体,其中该多抗原构建体为RNA(例如mRNA)且包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)由SEQ ID NO:42的核苷酸10-6009组成的核苷酸序列;
(2)由SEQ ID NO:44的核苷酸10-6003组成的核苷酸序列;
(3)由SEQ ID NO:46的核苷酸10-6024组成的核苷酸序列;
(4)由SEQ ID NO:48的核苷酸10-5988组成的核苷酸序列;
(5)由SEQ ID NO:50的核苷酸10-5829组成的核苷酸序列;
(6)由SEQ ID NO:52的核苷酸10-5829组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在其他特定实施方案中,本公开提供包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列的多抗原构建体,其中该多抗原构建体为RNA(例如mRNA)且包含选自以下的核苷酸序列:
(1)SEQ ID NO:87的核苷酸序列;
(2)SEQ ID NO:88的核苷酸序列;
(3)SEQ ID NO:89的核苷酸序列;
(4)SEQ ID NO:90的核苷酸序列;
(5)SEQ ID NO:91的核苷酸序列;
(6)SEQ ID NO:92的核苷酸序列;及
(7)SEQ ID NO:87、SEQ ID NO:88、SEQ ID NO:89、SEQ ID NO:90、SEQ ID NO:91或SEQ ID NO:92任一核苷酸序列的简并变体。
在又其他特定实施方案中,本公开提供包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列、(2)编码免疫原性MUC1多肽的至少一种核苷酸序列及(3)编码免疫原性TERT多肽的至少一种核苷酸序列的多抗原构建体,其中该多抗原构建体为RNA(例如mRNA)且由选自以下的核苷酸序列组成:
(1)SEQ ID NO:87的核苷酸序列;
(2)SEQ ID NO:88的核苷酸序列;
(3)SEQ ID NO:89的核苷酸序列;
(4)SEQ ID NO:90的核苷酸序列;
(5)SEQ ID NO:91的核苷酸序列;
(6)SEQ ID NO:92的核苷酸序列;及
(7)SEQ ID NO:87、SEQ ID NO:88、SEQ ID NO:89、SEQ ID NO:90、SEQ ID NO:91或SEQ ID NO:92任一核苷酸序列的简并变体。
D.含有抗原构建体的载体
本发明的另一方面涉及含有一或多种本公开提供的任一抗原构建体的载体,包括单抗原构建体、双抗原构建体、三抗原构建体及其他多抗原构建体。载体用于克隆或表达由抗原构建体编码的免疫原性TAA多肽或用于递送组合物(例如疫苗)中的抗原构建体至宿主细胞或宿主动物(例如人)。
可制备各式各样载体以包含及表达本公开提供的抗原构建体,例如质粒载体、黏粒载体、噬菌体载体及病毒载体。除了亦称为开放阅读框(ORF)的转基因插入序列(即,本公开提供的单抗原构建体或多抗原构建体)之外,载体的结构通常包含赋予或促进表达的其他组件或元件,例如复制起点、多克隆位点及可选择标记。
在一些实施方案中,本公开提供含有本公开提供的抗原构建体的质粒载体。适当质粒载体的实例包括pBR325、pUC18、pSKF、pET23D及pGB-2。质粒载体的其他实例以及构建这些载体的方法描述于U.S.Pat.No.5,589,466、5,688,688及5,814,482。包含单抗原构建体、双抗原构建体或三抗原构建体的特定的例示性质粒载体的构建亦描述于本公开。
在一些特定实施方案中,本公开提供包含SEQ ID NO:54、55、56、57、59、61、63、65、67、69、70、71、72、73及74的任一核苷酸序列的质粒载体。
在其他实施方案中,本发明提供从病毒(包括DNA病毒及RNA病毒(反转录病毒))构建的载体(即,病毒载体)。可用以构建载体的DNA病毒实例包括单纯疱疹病毒、细小病毒、痘苗病毒及腺病毒。可用以构建载体的RNA病毒实例包括α病毒、黄病毒、瘟病毒、流感病毒、狂犬病病毒及水疱病毒。源自各种病毒的载体的构建是本领域已知的。反转录病毒载体的实例描述于U.S.Pat.Nos.5,716,613、5,716,832及5,817,491。可从α病毒产生的载体的实例描述于U.S.Pat.Nos.5,091,309、5,843,723及5,789,245。其他载体的实例包括:(1)痘病毒,例如金丝雀痘病病毒或痘苗病毒(U.S.Pat.Nos.4,603,112、4,769,330及5,017,487;WO89/01973);(2)SV40(Mulligan et al.,Nature 277:108-114,1979);(3)疱疹(Kit,Adv.Exp.Med.Biol.215:219-236,1989;U.S.Pat.No.5,288,641)及(4)慢病毒例如HIV(Poznansky,J.Virol.65:532-536,1991)。
在一些特定实施方案中,本公开提供衍生自非人灵长类腺病毒(例如猿猴腺病毒)的腺病毒载体。这样的腺病毒载体的实例,以及其制备,描述于PCT申请公告案WO2005/071093及WO2010/086189,且包括以猿猴腺病毒构建的非复制性载体,例如ChAd3、ChAd4、ChAd5、ChAd7、ChAd8、ChAd9、ChAd10、ChAd11、ChAd16、ChAd17、ChAd19、ChAd20、ChAd22、ChAd24、ChAd26、ChAd30、ChAd31、ChAd37、ChAd38、ChAd44、ChAd63、ChAd68、ChAd82、ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、Pan Ad2与Pan Ad3,及以腺病毒Ad4或Ad7构建的具有复制能力的载体。优选的是,在以猿猴腺病毒构建腺病毒载体中,通过缺失或突变使得来自病毒基因组区域的一或多个早期基因缺失或使其不具有功能,所述基因组区域选自E1A、E1B、E2A、E2B、E3及E4。在特定实施方案中,该载体以ChAd68构建。黑猩猩腺病毒ChAd68在文献中亦称为猿猴腺病毒25、C68、AdC68、Chad68、SAdV25、PanAd9或Pan9。构建源自ChAd68的载体用于表达多抗原构建体的方法描述于国际专利申请公开WO2015/063647中。表达载体通常包括一或多种与要表达的核酸序列可操作连接的控制元件。术语“控制元件”总的来说是指启动子区域、多聚腺苷酸化信号、转录终止序列、上游调控结构域、复制起始点、内部核糖体进入位点(“IRES”)、增强子等,其总的来说提供受体细胞中编码序列的复制、转录及翻译。只要所选择的编码序列能在适当宿主细胞中复制、转录及翻译,不是所有这些控制元件都需要总是存在。根据本领域技术人员知道的一些因素(例如特定的宿主细胞及其他载体组件的来源或结构)选择控制元件。为了增强免疫原性TAA多肽的表达,可在编码免疫原性TAA多肽序列的上游提供Kozak序列。对于脊椎动物,已知的Kozak序列为(GCC)NCCATGG,其中N为A或G且GCC较不保守。可使用的示例性Kozak序列包括GAACATGG、ACCAUGG及ACCATGG。
在一些实施方案中,载体包含编码(i)至少一种免疫原性CEA多肽及(ii)至少一种免疫原性MUC1多肽或至少一种免疫原性TERT多肽的多抗原构建体。载体可为DNA质粒载体、DNA病毒载体、RNA质粒载体或RNA病毒载体。在一些特定实施方案中,载体为DNA载体且包含含有选自以下的核苷酸序列的多抗原构建体:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供RNA载体,其包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他实施方案中,载体含有编码(i)至少一种免疫原性MUC1多肽、(ii)至少一种免疫原性CEA多肽及(iii)至少一种免疫原性TERT多肽的多抗原构建体。该载体可为DNA质粒载体、DNA病毒载体、RNA质粒载体或RNA病毒载体。在一些特定实施方案中,本公开提供DNA载体,其包括含有选自以下的核苷酸序列的多抗原构建体:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48的核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;或
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供RNA载体,其包含对应于选自由下列所组成群组的核苷酸序列的核苷酸序列:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48的核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;或
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些特定实施方案中,本公开提供包含SEQ ID NO:58、60、62、64、66及68的任一核苷酸序列的DNA病毒载体。在一些其他特定实施方案中,本公开提供包含SEQ ID NO:57、59、61、63、65、67、69、70、71、72、73及74的任一核苷酸序列的DNA质粒载体。
在一些特定实施方案中,该载体为DNA载体且包含含有选自以下的核苷酸序列的多抗原构建体:
(1)由SEQ ID NO:30的核苷酸10-3264组成的核苷酸序列;
(2)由SEQ ID NO:32的核苷酸10-3243组成的核苷酸序列;
(3)由SEQ ID NO:34的核苷酸10-3255组成的核苷酸序列;
(4)由SEQ ID NO:36的核苷酸10-3090组成的核苷酸序列;
(5)由SEQ ID NO:38的核苷酸10-4143组成的核苷酸序列;
(6)由SEQ ID NO:40的核苷酸10-4323组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开提供RNA载体,其包含对应于选自由下列所组成群组的核苷酸序列的核苷酸序列:
(1)由SEQ ID NO:30的核苷酸10-3264组成的核苷酸序列;
(2)由SEQ ID NO:32的核苷酸10-3243组成的核苷酸序列;
(3)由SEQ ID NO:34的核苷酸10-3255组成的核苷酸序列;
(4)由SEQ ID NO:36的核苷酸10-3090组成的核苷酸序列;
(5)由SEQ ID NO:38的核苷酸10-4143组成的核苷酸序列;
(6)由SEQ ID NO:40的核苷酸10-4323组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些其他实施方案中,载体含有编码(i)至少一种免疫原性MUC1多肽、(ii)至少一种免疫原性CEA多肽及(iii)至少一种免疫原性TERT多肽的多抗原构建体。该载体可为DNA质粒载体、DNA病毒载体、RNA质粒载体或RNA病毒载体。在一些特定实施方案中,本公开提供DNA载体,其包括含有选自以下的核苷酸序列的多抗原构建体:
(1)由SEQ ID NO:42的核苷酸10-6009组成的核苷酸序列;
(2)由SEQ ID NO:44的核苷酸10-6003组成的核苷酸序列;
(3)由SEQ ID NO:46的核苷酸10-6024组成的核苷酸序列;
(4)由SEQ ID NO:48的核苷酸10-5988组成的核苷酸序列;
(5)由SEQ ID NO:50的核苷酸10-5829组成的核苷酸序列;或
(6)由SEQ ID NO:52的核苷酸10-5829组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在一些特定实施方案中,本公开提供由SEQ ID NO:58、60、62、64、66及68的任一核苷酸序列组成的DNA病毒载体。在一些其他特定实施方案中,本公开提供由SEQ ID NO:57、59、61、63、65、67、69、70、71、72、73及74的任一核苷酸序列组成的DNA质粒载体。
在一些其他特定实施方案中,本公开提供RNA载体,其包含对应于选自以下的核苷酸序列的核苷酸序列:
(1)由SEQ ID NO:42的核苷酸10-6009组成的核苷酸序列;
(2)由SEQ ID NO:44的核苷酸10-6003组成的核苷酸序列;
(3)由SEQ ID NO:46的核苷酸10-6024组成的核苷酸序列;
(4)由SEQ ID NO:48的核苷酸10-5988组成的核苷酸序列;
(5)由SEQ ID NO:50的核苷酸10-5829组成的核苷酸序列;
(6)由SEQ ID NO:52的核苷酸10-5829组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
E.包含抗原构建体或载体的组合物
本公开还提供组合物,其包含本公开提供的分离的核酸分子(即,抗原构建体)或载体。组合物可只包含一种单独的抗原构建体,例如双抗原构建体或三抗原构建体。其也可包含两种或多种不同的单独抗原构建体,例如单抗原构建体及双抗原构建体的组合或编码不同免疫原性TAA多肽的三或多种单抗原构建体的组合。所述组合物可用于引起针对TAA蛋白的体外免疫应答或哺乳动物(包括人)的体内免疫应答。在一些实施方案中,该组合物为免疫原性组合物或药物组合物。在一些特定实施方案中,该组合物为施用于人以用于(1)抑制异常细胞增殖、针对癌症的进展提供保护(用作预防剂)、(2)治疗与TAA过表达相关的癌症(用作治疗剂)或(3)引起针对特定人TAA(例如CEA、MUC1及TERT)免疫应答的疫苗组合物。
在一些实施方案中,本公开所提供的组合物包含多抗原构建体或包含多抗原构建体的载体,其中该多抗原构建体编码两种或多种免疫原性TAA多肽。举例而言,多抗原构建体可编码下述任一组合中的两种或多种免疫原性TAA多肽:
(1)免疫原性CEA多肽及免疫原性MUC1多肽;
(2)免疫原性CEA多肽及免疫原性TERT多肽;及
(3)免疫原性CEA多肽、免疫原性MUC1多肽及免疫原性TERT多肽。
在一些特定实施方案中,本公开所提供的组合物包含双抗原构建体或包含双抗原构建体的载体,其中该双抗原构建体包含选自以下的核苷酸序列:
(1)编码SEQ ID NO:31的氨基酸序列或SEQ ID NO:31的氨基酸4-1088的核苷酸序列;
(2)编码SEQ ID NO:33的氨基酸序列或SEQ ID NO:33的氨基酸4-1081的核苷酸序列;
(3)编码SEQ ID NO:35的氨基酸序列或SEQ ID NO:35的氨基酸4-1085的核苷酸序列;
(4)编码SEQ ID NO:37的氨基酸序列或SEQ ID NO:37的氨基酸4-1030的核苷酸序列;
(5)编码SEQ ID NO:39的氨基酸序列或SEQ ID NO:39的氨基酸4-1381的核苷酸序列;
(6)编码SEQ ID NO:41的氨基酸序列或SEQ ID NO:41的氨基酸4-1441的核苷酸序列;
(7)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(8)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(9)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(10)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(11)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(12)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;及
(13)核苷酸序列,其为上述(1)至(12)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开所提供的组合物包含(1)三抗原构建体或(2)包含三抗原构建体的载体,其中该三抗原构建体包含选自以下的核苷酸序列:
(1)编码SEQ ID NO:43的氨基酸序列或SEQ ID NO:43的氨基酸4-2003的核苷酸序列;
(2)编码SEQ ID NO:45的氨基酸序列或SEQ ID NO:45的氨基酸4-2001的核苷酸序列;
(3)编码SEQ ID NO:47的氨基酸序列或SEQ ID NO:47的氨基酸4-2008的核苷酸序列;
(4)编码SEQ ID NO:49的氨基酸序列或SEQ ID NO:49的氨基酸4-1996的核苷酸序列;
(5)编码SEQ ID NO:51的氨基酸序列或SEQ ID NO:51的氨基酸4-1943的核苷酸序列;
(6)编码SEQ ID NO:53的氨基酸序列或SEQ ID NO:53的氨基酸4-1943的核苷酸序列;
(7)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(8)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(9)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(10)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48的核苷酸10-5988的核苷酸序列;
(11)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;
(12)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(13)核苷酸序列,其为上述(1)至(12)的任一核苷酸序列的简并变体。
在一些其他特定实施方案中,本公开所提供的组合物包含三抗原构建体或包含三抗原构建体的载体,其中该三抗原构建体包含选自以下的核苷酸序列:
(1)由SEQ ID NO:42的核苷酸10-6009组成的核苷酸序列;
(2)由SEQ ID NO:44的核苷酸10-6003组成的核苷酸序列;
(3)由SEQ ID NO:46的核苷酸10-6024组成的核苷酸序列;
(4)由SEQ ID NO:48的核苷酸10-5988组成的核苷酸序列;
(5)由SEQ ID NO:50的核苷酸10-5829组成的核苷酸序列;
(6)由SEQ ID NO:52的核苷酸10-5829组成的核苷酸序列;及
(7)核苷酸序列,其为上述(1)至(6)的任一核苷酸序列的简并变体。
在其他特定实施方案中,本公开所提供的组合物包含RNA三抗原构建体或包含RNA三抗原构建体的载体,其中该三抗原构建体包含对应在(1)SEQ ID NO:42、44、46、48、50、52的任一序列或(2)为SEQ ID NO:42、44、46、48、50、52的任一核苷酸序列的简并变体的核苷酸序列。
在其他特定实施方案中,本公开所提供的组合物包含RNA三抗原构建体或包含RNA三抗原构建体的载体,其中该三抗原构建体由对应于(1)SEQ ID NO:42、44、46、48、50、52的任一序列的核苷酸序列或(2)为SEQ ID NO:42、44、46、48、50、52的任一核苷酸序列的简并变体的核苷酸序列组成。
在其他特定实施方案中,本公开所提供的组合物包含三抗原构建体或包含三抗原构建体的载体,其中该三抗原构建体包含(1)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列或(2)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列的简并变体。在一些其他特定实施方案中,本公开提供包含质粒的组合物,其中该质粒包含SEQ ID NO:57、59、61、63、65及67的任一核苷酸序列。在又其他特定实施方案中,本公开提供包含载体的组合物,其中该载体包含SEQ ID NO:58、60、62、64、66及68的任一核苷酸序列。
在其他特定实施方案中,本公开所提供的组合物包含三抗原构建体或包含三抗原构建体的载体,其中该三抗原构建体由(1)SEQ ID NO:87、88、89、90、91及92的任一核苷酸序列或(2)SEQ ID NO:87、88、89、90、91及92的任一核苷酸序列的简并变体组成。在一些其他特定实施方案中,本公开提供包含质粒的组合物,其中该质粒由SEQ ID NO:57、59、61、63、65及67的任一核苷酸序列组成。在其他特定实施方案中,本公开提供包含载体的组合物,其中该载体由SEQ ID NO:58、60、62、64、66及68的任一核苷酸序列组成。
组合物,例如药物组合物或疫苗组合物,可进一步包含药学上可接受的赋形剂。适用于核酸组合物(包括DNA疫苗及RNA疫苗组合物)的药学上可接受的赋形剂是本领域技术人员熟知的。此类赋形剂可为水性或非水性溶液、悬浮液及乳液。非水性赋形剂的实例包括丙二醇、聚乙二醇、植物油(例如橄榄油)及注射用有机酯(例如油酸乙酯)。水性赋形剂的实例包括水、醇性/水性溶液、乳液或悬浮液,包括盐水及缓冲介质。适当的赋形剂也包括有助于细胞摄取多核苷酸分子的物质。此类物质的实例为(i)修饰细胞渗透性的化学制品,例如布比卡因(bupivacaine);(ii)用于封装多核苷酸的脂质体或病毒颗粒;或(iii)其本身与多核苷酸结合的阳离子脂质或二氧化硅、金或钨微粒。阴离子及中性脂质体为本领域公知(关在制造脂质体方法的详细说明参见例如,Liposomes:A Practical Approach,RPC NewEd,IRL press(1990))且可用于递送大范围的产品(包括多核苷酸)。
本公开提供的免疫原性组合物、药物组合物或疫苗组合物可与一或多种免疫调节剂结合或组合使用。该组合物亦可与一或多种佐剂结合或组合使用。另外,该组合物可与一或多种免疫调节剂及一或多种佐剂结合或组合使用。免疫调节剂或佐剂可与抗原构建体或载体分开配制,或其可为相同组合物配制物的一部分。因此,在一些实施方案中,本公开提供药物组合物,其包含(1)由本公开提供的抗原构建体或含此类抗原构建体的载体及(2)免疫调节剂。在一些进一步实施方案中,该药物组合物进一步包含佐剂。免疫调节剂及佐剂的实例在下文中提供。
组合物,包括疫苗组合物,可制备成任何适当剂型,例如液态形式(如,溶液、悬浮液或乳液)及固态形式(如,胶囊、片剂或粉剂),且可利用本领域技术人员已知的方法制备。
F.抗原构建体、载体及组合物的用途
在其他方面中,本公开提供(1)抗原构建体、载体及组合物作为药剂的用途;(2)抗原构建体、载体及组合物用于制造引起针对TAA的免疫应答、抑制异常细胞增殖或治疗癌症的药剂的用途及(3)使用抗原构建体、载体及组合物的方法;其中该抗原构建体、载体及组合物如上文中所述。
在一方面中,本公开提供使用(1)编码一或多种免疫原性TAA多肽的抗原构建体、(2)含有所述抗原构建体的载体或(3)含有所述抗原构建体或载体的组合物以在哺乳动物(例如人)中引起针对TAA的免疫应答的用途。在一些实施方案中,本公开提供在哺乳动物(特别是人)中引起针对TAA的免疫应答的方法,该方法包括给该哺乳动物施用有效量的包含(1)编码一或多种免疫原性TAA多肽的抗原构建体或(2)含有编码一或多种免疫原性TAA多肽抗原构建体的载体的组合物。在一些实施方案中,本公开提供在哺乳动物(特别是人)中引起针对CEA的免疫应答的方法,其包括给该哺乳动物施用有效量的包含本公开提供的抗原构建体的组合物,其中该抗原构建体包含(1)编码免疫原性CEA多肽的至少一种核苷酸序列及(2)编码免疫原性MUC1多肽或免疫原性TERT多肽的至少一种核苷酸序列。在一些其他实施方案中,本公开提供在哺乳动物(特别是人)中引起针对MUC1的免疫应答的方法,其包括给该哺乳动物施用有效量的包含本公开提供的抗原构建体的组合物,其中该抗原构建体包含(1)编码免疫原性MUC1多肽的至少一种核苷酸序列及(2)编码免疫原性CEA多肽或免疫原性TERT多肽的至少一种核苷酸序列。在一些进一步实施方案中,本公开提供在哺乳动物(特别是人)中引起针对TERT的免疫应答的方法,其包括给该哺乳动物施用有效量的包含本公开提供的抗原构建体的组合物,其中该抗原构建体包含(1)编码免疫原性TERT多肽的至少一种核苷酸序列及(2)编码免疫原性MUC1多肽或免疫原性CEA多肽的至少一种核苷酸序列。
在另一方面中,本公开提供使用(1)编码一或多种免疫原性TAA多肽的抗原构建体、(2)含所述抗原构建体的载体或(3)含所述抗原构建体或载体的组合物以在哺乳动物(例如人)中抑制异常细胞增殖的用途。在一些实施方案中,本公开提供在哺乳动物(特别是人)中抑制异常细胞增殖的方法,包括给该哺乳动物施用有效量的包含(1)编码一或多种免疫原性TAA多肽的抗原构建体或(2)含有编码一或多种免疫原性TAA多肽抗原构建体的载体的组合物,其中该异常细胞增殖与肿瘤相关抗原CEA、MUC1或TERT过表达有关。异常细胞增殖可能在人任何器官或组织例如乳房、胃、卵巢、肺、膀胱、大肠(例如结肠及直肠)、肾脏、胰脏及前列腺中。在一些实施方案中,该方法系用于抑制乳房、卵巢、胰脏、结肠、肺、胃及直肠中的异常细胞增殖。所施用的组合物中的抗原构建体或载体编码衍生自过表达的肿瘤相关抗原或对其具免疫原性的至少一种免疫原性多肽。抗原构建体可为单抗原构建体或多抗原构建体,例如双抗原构建体或三抗原构建体。在一些特定实施方案中,组合物包含编码免疫原性CEA多肽、免疫原性MUC1多肽及免疫原性TERT多肽的三抗原构建体。
在又一方面中,本公开提供(1)编码一或多种免疫原性TAA多肽的抗原构建体、(2)含所述抗原构建体的载体或(3)含所述抗原构建体或载体的组合物作为用于治疗哺乳动物(特别是人)中癌症的药剂的用途。在一些实施方案中,本公开提供治疗人中癌症的方法,其中该癌症与一或多种肿瘤相关抗原CEA、MUC1或TERT的过表达有关。该方法包括给人施用有效量的组合物,其包含编码至少一种免疫原性多肽的抗原构建体,该免疫原性多肽衍生自特定癌症中过表达的肿瘤相关抗原或对所述抗原具有免疫原性。抗原构建体可为单抗原构建体或多抗原构建体,例如双抗原构建体或三抗原构建体。在一些特定实施方案中,组合物包含编码免疫原性CEA多肽、免疫原性MUC1多肽及免疫原性TERT多肽的三抗原构建体。过表达肿瘤相关抗原MUC1、CEA、和/或TERT的任何癌症可利用本公开提供的方法治疗。癌症的实例包括乳腺癌、卵巢癌、肺癌(例如小细胞肺癌及非小细胞肺癌)、结肠直肠癌、胃癌及胰腺癌。在一些特定实施方案中,本公开提供治疗人体癌症的方法,其包括给人施用有效量的包含三抗原构建体的组合物,其中该癌症为(1)乳腺癌,例如雌激素受体和/或黄体酮受体阳性乳腺癌、HER2阳性乳腺癌或三阴性乳腺癌;(2)肺癌,例如NSCLC或SCLC;(3)胃癌;(4)胰腺癌;或(5)结肠直肠癌。
在一些特定实施方案中,本公开提供引起针对TAA免疫应答的方法、抑制异常细胞增殖的方法或治疗哺乳动物(特别是人)癌症的方法,该方法包括给该哺乳动物施用有效量的包含多抗原构建体或包含多抗原构建体的载体的组合物,其中该多抗原构建体包含编码SEQ ID NO:43、45、47、49、51及53的任一氨基酸序列的核苷酸序列。在其他特定实施方案中,本公开提供引起针对TAA免疫应答的方法、抑制异常细胞增殖的方法或治疗哺乳动物(特别是人)癌症的方法,该方法包括给该哺乳动物施用有效量的包含多抗原构建体的组合物,其中该多抗原构建体包含SEQ ID NO:42、44、46、48、50、52及87至92的任一核苷酸序列。在其他特定实施方案中,本公开提供引起针对TAA免疫应答的方法、抑制异常细胞增殖的方法或治疗哺乳动物(特别是人)癌症的方法,该方法包括给该哺乳动物施用有效量的包含载体的组合物,其中该载体包含SEQ ID NO:57至68的任一核苷酸序列。
组合物可通过本领域已知的一些适当方法施用于哺乳动物(包括人)。适当方法的实例包括:(1)肌内、皮内、表皮内或皮下施用,(2)口服施用,及(3)局部涂敷(例如眼部、鼻腔及阴道内涂敷)。核酸疫苗组合物(特别是含DNA质粒的组合物)的皮内或表皮内施用的一种特定方法是基因枪递送,其使用由PowderMed销售的颗粒介导的表皮递送(PMEDTM)的疫苗递送装置进行。PMED为给动物或人施用疫苗的无针法。PMED系统涉及使DNA沉淀于微观金颗粒上,然后由氦气推进到表皮中。将涂覆DNA的金颗粒递送到表皮的APC及角质细胞中,一旦进入这些细胞的核中,DNA就从金洗脱出来且变得具有转录活性,产生所编码的蛋白质。用于肌内施用核酸疫苗的另一特定方法涉及电穿孔。电穿孔使用受控制的电脉冲在细胞膜中产生暂时的细孔,其帮助细胞摄取注射到肌肉中的核酸疫苗。当CpG及核酸疫苗组合使用时,可使CpG与核酸疫苗在一种配制物中一起配制并利用电穿孔肌内施用该配制物。
在特定方法中施用组合物的有效量可由本领域技术人员容易地决定且将取决于一些因素。在治疗癌症(例如胰腺癌、卵巢癌、肺癌、结肠直肠癌、胃癌及乳腺癌)的方法中,决定有效量可能考虑的因素包括待治疗的个体(包括个体的免疫状态及健康状况)、待治疗癌症的严重性或阶段、所表达的特异性免疫原性TAA多肽、期望保护或治疗的程度、施用方法及计划与所使用的其他治疗剂(如佐剂或免疫调节剂)。配制及递送方法为确定引起有效免疫应答所需核酸剂量的关键因素。举例而言,当疫苗配制为水性溶液且通过皮下注射针注射或气动注射施用时,疫苗中核酸的有效量可在每剂2μg-10mg的范围内,而当核酸制备为涂覆的金颗粒且使用基因枪技术递送时,则仅需要每剂16ng-16μg。利用电穿孔的疫苗中核酸剂量范围通常在每剂0.5-10mg的范围内。在核酸疫苗与CpG以共配制物利用电穿孔一起施用的情形下,核酸疫苗的剂量可在每剂0.5-5mg的范围内且CpG的剂量通常在每剂0.05mg-5mg的范围内,例如每人每剂0.05、0.2、0.6或1.2mg。
本公开提供的疫苗组合物可用于初免增强策略以诱导强健且持久的免疫应答。根据重复注射相同免疫原性构建体的初免及增强疫苗接种规程为众所周知。一般而言,第一剂疫苗可能无法产生保护性免疫,仅“初免”免疫系统。第二、第三或随后的剂量(“增强”)后逐渐产生保护性免疫应答。根据常规技术进行增强,且可就施用计划、施用途径、佐剂选择、剂量及当与另一疫苗一起施用时的潜在顺序,凭经验进一步优化。在一实施方案中,在常规的均质性(homologous)初免增强策略中使用疫苗组合物,其中在初免以及增强剂量上都施用动物相同疫苗。举例而言,在初始剂量(“初免”)以及随后剂量(“增强”)中均施用含质粒载体的相同疫苗组合物。在另一实施方案中,在异质性(heterologous)初免增强疫苗接种中使用疫苗组合物,其中在预定的时间间隔下施用表达相同免疫原性TAA多肽的不同类型疫苗。例如,以初免剂量的质粒载体形式及以增强剂量的病毒载体形式施用抗原构建体,或反之亦然。
疫苗组合物可与一或多种佐剂一起使用。适当佐剂的实例包括:(1)水包油型乳液配制物,例如MF59及AS03;(2)皂素佐剂,例如QS21及
Figure BDA0002402017140000541
(Commonwealth SerumLaboratories,Australia);(3)弗氏完全佐剂(CFA)及弗氏不完全佐剂(IFA);(4)细胞因子,例如介白素(如IL-1、IL-2、IL-4、IL-5、IL-6、IL-7、IL-12)、干扰素(如γ干扰素)、巨噬细胞群落刺激因子(M-CSF)及肿瘤坏死因子(TNF);(5)单磷酰脂质A(MPL)或3-O-去酰基MPL(3dMPL);(6)包含CpG基序的寡核苷酸及(7)金属盐,包括铝盐(明矾),例如磷酸铝及氢氧化铝。
进一步,为了治疗哺乳动物(包括人)中包括癌症的赘生性病症,可将组合物与一或多种免疫调节剂组合施用。免疫调节剂可为免疫抑制细胞抑制剂(ISC抑制剂)或免疫效应细胞增强剂(IEC增强剂)。进一步,一或多种ISC抑制剂可与一或多种IEC增强剂组合使用。免疫调节剂可利用任何适当方法及途径施用,包括(1)全身性施用例如静脉内、肌内或口服施用;及(2)局部施用例如皮内及皮下施用。在适当或合适的情况下,局部施用通常优在全身性施用。任何免疫调节剂的局部施用可在哺乳动物身体的任何适于药物局部施用的位置进行;然而,更优选这些免疫调节剂在靠近疫苗排出(draining)的淋巴结附近局部施用。
组合物,例如疫苗,可与所用的任何或所有免疫调节剂同时或依序施用。同样地,当使用两种或多种免疫调节剂时,其可彼此关连地同时或依序施用。在一些实施方案中,疫苗与一免疫调节剂彼此关连地同时(如,在混合物中)施用,但与一或多种另外的免疫调节剂则彼此关连地依序施用。疫苗及免疫调节剂的共同施用可包括其中施用疫苗及至少一种免疫调节剂使得各自同时存在于施用部位(例如疫苗排出的淋巴结),即使抗原与免疫调节剂未同时施用的情况。疫苗与免疫调节剂的共同施用亦可包括其中疫苗或免疫调节剂从施用部位清除,但清除疫苗或免疫调节剂的至少一种细胞效应在施用部位(例如疫苗排出的淋巴结)存留,至少直到将一或多种另外的免疫调节剂施用施用部位的情况。在核酸疫苗及CpG组合施用的情况中,疫苗及CpG可包含在单一配制物中且利用任何适当方法一起施用。在一些实施方案中,共配制物(混合物)中的核酸疫苗及CpG利用肌内注射组合电穿孔施用。
在一些实施方案中,免疫调节剂为ISC抑制剂。ISC抑制剂的实例包括(1)蛋白激酶抑制剂,例如伊马替尼(imatinib)、索拉非尼(sorafenib)、拉帕替尼(lapatinib)、BIRB-796及AZD-1152、AMG706、凡德他尼(Zactima,ZD6474)、MP-412、索拉非尼(BAY 43-9006)、达沙替尼(dasatinib)、CEP-701(来他替尼(lestaurtinib))、XL647、XL999、Tykerb(拉帕替尼)、MLN518(前称CT53518)、PKC412、ST1571、AEE 788、OSI-930、OSI-817、苹果酸苏尼替尼(sunitinib)(Sutent)、阿西替尼(axitinib)(AG-013736)、厄洛替尼(erlotinib)、吉非替尼(gefitinib)、阿西替尼、博舒替尼(bosutinib)、替昔罗莫司(temsirolismus)及尼罗替尼(nilotinib)(AMN107)。在一些特定实施方案中,蛋白激酶抑制剂为酪氨酸激酶抑制剂,包括苏尼替尼、索拉非尼或苏尼替尼或索拉非尼的药学上可接受的盐或衍生物(例如苹果酸盐或甲苯磺酸盐);(2)环氧合酶-2(COX-2)抑制剂,例如塞来昔布(celecoxib)及罗非昔布(rofecoxib);(3)五型磷酸二酯酶(PDE5)抑制剂,例如阿伐那非(avanafil)、罗地那非(lodenafil)、米罗那非(mirodenafil)、西地那非(sildenafil)、他达拉非(tadalafil)、伐地那非(vardenafil)、乌地那非(udenafil)、扎普司特(zaprinast);(4)DNA交联剂,例如环磷酰胺;(5)PARP抑制剂,例如塔拉佐帕尼(talazoparib)及(6)CDK抑制剂,如帕博塞克(palbocyclib)。
在一些实施方案中,与核酸组合物组合使用的免疫调节剂为IEC增强剂。可一起使用两种或多种IEC增强剂。可使用的IEC增强剂的实例包括:(1)TNFR激动剂,例如OX40、4-1BB(例如BMS-663513)、GITR(例如TRX518)及CD40(例如CD40激动型抗体)的激动剂;(2)CTLA-4抑制剂,例如伊匹单抗(Ipilimumab)及德美利姆单抗(Tremelimumab);(3)TLR激动剂,例如
CpG 7909(5'TCGTCGTTTTGTCGTTTTGTCGTT3')、CpG 24555(5'TCGTCGTTTTTCGGTGCTTTT3'及CpG 10103(5'TCGTCGTTTTTCGGTCGTTTT3');(4)程序性细胞死亡蛋白1(PD-1)抑制剂,例如纳武单抗(nivolumab)及派姆单抗(pembrolizumab);(5)PD-L1抑制剂,例如阿替珠单抗(atezolizumab)、度伐单抗(durvalumab)、阿维单抗(avelumab);及(6)IDO1抑制剂。
在一些实施方案中,IEC增强剂为CD40激动剂抗体,其可为人、人源化或部分人嵌合抗CD40抗体。特异性CD40激动剂抗体的实例包括G28-5、mAb89、EA-5或S2C6单克隆抗体及CP870,893。CP-870,893是经临床研究作为抗肿瘤疗法的完全人激动性CD40单克隆抗体(mAb)。CP870,893的结构及制备公开于WO2003041070(其中抗体通过内部命名的“21.4.1”鉴定且抗体重链及轻链氨基酸序列分别示于SEQ ID NO:40及SEQ ID NO:41)。为了与本公开组合物组合使用,CP-870,893可通过任何适当途径(例如皮内、皮下或肌内注射)施用。CP870893的有效量通常在0.01-0.25mg/kg的范围内。在一些实施方案中,CP870893以0.05-0.1mg/kg的量施用。
在一些其他实施方案中,IEC增强剂为CTLA-4抑制剂,例如伊匹单抗及德美利姆单抗。以YERVOY销售的伊匹单抗(也称为MEX-010或MDX-101)为人抗人CTLA-4抗体。伊匹单抗亦可以其CAS登录编号477202-00-9提及且在PCT公开号WO 01/14424中公开为抗体10DI。德美利姆单抗(也称为CP-675,206)为完全人IgG2单克隆抗体且具有CAS编号745013-59-6。德美利姆单抗公开于美国专利号6,682,736中,其全部内容通过参考并入本文,其被命名为抗体11.2.1且其重链及轻链氨基酸序列分别示于SEQ ID NO:42及43。为了与本公开所提供组合物组合使用,德美利姆单抗可局部(特别是皮内或皮下)施用。皮内或皮下施用德美利姆单抗的有效量一般在每人每剂5-200mg的范围内。在一些实施方案中,德美利姆单抗的有效量在每人每剂10-150mg的范围内。在一些特定实施方案中,德美利姆单抗的有效量为每人约10、25、50、75、100、125、150、175或200mg/剂。
在一些其他实施方案中,免疫调节剂为PD-1抑制剂或PD-L1抑制剂。PD-1抑制剂的实例包括纳武单抗(商品名Opdivo)、派姆单抗(商品名Keytruda)、RN888(抗PD-1抗体)、皮地利珠单抗(pidilizumab)(Cure Tech)、AMP-224(GSK)、AMP-514(GSK)及PDR001(Novartis)。PD-L1抑制剂的实例包括阿替珠单抗(PD-L1特异性单抗;商品名Tecentriq)、度伐单抗(PD-L1特异性单抗;商品名Imfinzi)、阿维单抗(PD-L1特异性单抗;商品名Bavencio)、及BMS-936559(BMS)。亦参见Okazaki T et al.,International Immunology(2007);19,7:813-824及Sunshine J et al.,Curr Opin Pharmacol.2015Aug;23:32-8。在一些特定实施方案中,PD-1抑制剂为RN888。RN888为特异性结合PD-1的单克隆抗体。RN888公开于国际专利申请开WO2016/092419中,其中该抗体经鉴定为具有SEQ ID NO:29的全长重链氨基酸序列及SEQ ID NO:39的全长轻链氨基酸序列的mAb7。
在其他实施方案中,免疫调节剂为吲哚胺2,3-二加氧酶1(亦为所谓“IDO1”)的抑制剂。发现IDO1将免疫细胞功能调节为抑制性表型,因此认为其部分导致肿瘤逃避宿主免疫监视。该酶降解必需氨基酸色氨酸成为犬尿氨酸及其他代谢物。发现这些代谢物及缺乏色氨酸导致抑制效应T细胞功能及调节性T细胞的分化加强。IDO1抑制剂可为大分子,例如抗体,或小分子,例如化合物。
在一些特定实施方案中,本公开所提供的多肽或核酸组合物与公开于WO2010/005958的1,2,5-恶二唑衍生的IDO1抑制剂组合使用。特定的1,2,5-恶二唑衍生的IDO1抑制剂的实例包括下列化合物:
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-(3-溴-4-氟苯基)-N'-羟基-l,2,5-恶二唑-3-甲脒(carboximidamide);
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-(3-氯-4-氟苯基)-N'-羟基-1,2,5-恶二唑3-甲脒;
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-(4-氟-3-(三氟甲基)苯基)-N'-羟基-1,2,5-恶二唑-3-甲脒;
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N'-羟基-N-(3-(三氟甲基)苯基)-1,2,5-恶二唑-3-甲脒;
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-(3-氰基-4-氟苯基)-N'-羟基-1,2,5-恶二唑3-甲脒;
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-((4-溴-2-呋喃基)甲基)-N'-羟基-1,2,5-恶二唑-3-甲脒;或
4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-((4-氯-2-呋喃基)甲基)-N'-羟基-1,2,5-恶二唑-3-甲脒。
1,2,5-恶二唑衍生的IDO1抑制剂通常每天口服施用一次或两次且口服施用的有效量通常在每一患者每剂25mg-1000mg的范围内(例如25mg、50mg、100mg、200mg、300mg、400mg、500mg、600mg、700mg、800mg或1000mg)。在特定实施方案中,本公开所提供的多肽或核酸组合物以每剂量25mg或50mg每天口服施用两次与4-({2-((氨基磺酰基)氨基)乙基}氨基)-N-(3-溴-4-氟苯基)-N'-羟基-l,2,5-恶二唑-3-甲脒组合使用。该1,2,5-恶二唑衍生物可如美国专利案No.8,088,803中所述合成,其全部内容通过参考并入本文。
在一些其他特定实施方案中,本公开提供的多肽或核酸组合物与公开于WO2015/173764中的吡咯烷-2,5-二酮衍生的IDO1抑制剂组合使用。特定吡咯烷-2,5-二酮衍生的抑制剂的实例包括下述化合物:
3-(5-氟-1H-吲哚-3-基)吡咯烷-2,5-二酮;
(3-2H)-3-(5-氟-1H-吲哚-3-基)吡咯烷-2,5-二酮;
(-)-(R)-3-(5-氟-1H-吲哚-3-基)吡咯烷-2,5-二酮;
3-(1H-吲哚-3-基)吡咯烷-2,5-二酮;
(-)-(R)-3-(1H-吲哚-3-基)吡咯烷-2,5-二酮;
3-(5-氯-1H-吲哚-3-基)吡咯烷-2,5-二酮;
(-)-(R)-3-(5-氯-1H-吲哚-3-基)吡咯烷-2,5-二酮;
3-(5-溴-1H-吲哚-3-基)吡咯烷-2,5-二酮;
3-(5,6-二氟-1H-吲哚-3-基)吡咯烷-2,5-二酮;及
3-(6-氯-1H-吲哚-3-基)吡咯烷-2,5-二酮。
吡咯啶-2,5-二酮衍生的IDO1抑制剂通常每天口服施用一次或两次且口服施用的有效量通常在每一患者每剂50mg-1000mg的范围内(例如125mg、250mg、500mg、750mg或1000mg)。在特定实施方案中,本公开所提供的多肽或核酸组合物以每一患者每剂125-100mg每天口服施用一次与3-(5-氟-1H-吲哚-3-基)吡咯烷-2,5-二酮组合使用。吡咯烷-2,5-二酮衍生物可如美国专利申请公告案US2015329525中所述合成,其全部内容通过参考并入本文。
G.实施例
提供下述实施例以说明本发明的特定实施方案。其不意在解释为以任何方式对本发明构成局限。从上述讨论及这些实施例,本领域技术人员可确定本发明的必要特征,且在不离开其精神与范围下,可进行本发明的各种变化与修饰以使其适应各种用法及条件。
实施例1.构建含有单抗原构建体或多抗原构建体的质粒
实施例1说明含有单抗原构建体、双抗原构建体或三抗原构建体的质粒载体的构建。除非另行说明,否则提及MUC1、CEA及TERT蛋白的氨基酸位置或残基分别指如SEQ IDNO:1中所示的人MUC1同种型1前体蛋白的氨基酸序列、如SEQ ID NO:2中所示的人癌胚抗原(CEA)同种型1前体蛋白的氨基酸序列及如SEQ ID NO:3中所示的人TERT同种型1前体蛋白的氨基酸序列。表16中提供质粒构建中所使用的一些引物的结构。
1A.含有单抗原构建体的质粒
质粒1027(MUC1)。使用基因合成及限制片段交换技术产生质粒1027。将具有5X串联重复序列VNTR区的人MUC1的氨基酸序列提交GeneArt以供基因优化及合成。为表达、合成及克隆优化编码该多肽的基因。该MUC-1开放阅读框通过以NheI及BglII酶切从GeneArt载体切除并插入被同样酶切的质粒pPJV7563中。质粒1027的开放阅读框(ORF)核苷酸序列示于SEQ ID NO:4。由质粒1027编码的氨基酸序列示于SEQ ID NO:5。
质粒1361(CEA)。使用基因合成、PCR及无缝克隆技术构建质粒1361。首先,为了表达在DNA2.0将编码CEA参考序列的基因进行密码子优化。使用引物ID1361-1362_PCRF及ID1361-1362_PCRR,利用PCR扩增编码氨基酸2-702的序列。通过无缝克隆将扩增子克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1361的开放阅读框核苷酸序列示于SEQ ID NO:14。由质粒1361编码的氨基酸序列示于SEQ ID NO:15。
质粒1386(mCEA)。使用PCR及无缝克隆技术构建编码膜结合免疫原性CEA多肽(mCEA)的质粒1386。首先,使用引物f pmed CEA SS及r CEA D1利用PCR从质粒1361扩增编码CEA的氨基酸2-144的基因片段。其次,使用引物f CEA D1-D4及r pmed CEA GPI,利用PCR从质粒1361扩增编码CEA的氨基酸323-702的基因片段。连接扩增子且通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1386的开放阅读框核苷酸序列示于SEQ ID NO:16。由质粒1386编码的氨基酸序列示于SEQ ID NO:17。
质粒1390(cCEA)。使用PCR及无缝克隆技术构建编码胞质免疫原性CEA多肽(cCEA)的质粒1390。首先,使用引物f pmed CEA D1及r CEA D1利用PCR从质粒1361扩增编码CEA的氨基酸35-144的基因片段。其次,使用引物f CEA D1-D4及r pmed CEA D7利用PCR从质粒1361扩增编码CEA的氨基酸323-677的基因片段。连接扩增子且通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1390的开放阅读框核苷酸序列示于SEQ ID NO:18。由质粒1390编码的氨基酸序列示于SEQ ID NO:19。
质粒1065(全长TERT D712A/V713I)。使用基因合成及限制片段交换技术产生质粒1065。将经设计以失活酶活性的具两个突变(D712A/V713I)的人TERT的氨基酸序列提交DNA2.0进行基因优化及合成。为了表达、合成及克隆优化编码该多肽的基因。该TERT开放阅读框通过以NheI及BglII酶切从DNA2.0载体切除并插入被同样酶切的质粒pPJV7563中。由质粒1065编码的氨基酸序列示于SEQ ID NO:81。质粒1065的开放阅读框(ORF)核苷酸序列示于SEQ ID NO:82。
质粒1112(TERT240)。使用PCR及无缝克隆技术构建质粒1112。首先,使用引物fpmed TERT 241G及r TERT co#pMed利用PCR从质粒1065扩增编码TERT的氨基酸241-1132的基因。通过无缝克隆将扩增子克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1112的开放阅读框核苷酸序列示于SEQ ID NO:8。由质粒1112编码的氨基酸序列示于SEQ ID NO:9。
质粒1197(cMUC1)。使用PCR及无缝克隆技术构建编码胞质免疫原性MUC1多肽(cMUC1)的质粒1197。首先,使用引物ID1197F及ID1197R利用PCR从质粒1027扩增编码MUC1的氨基酸22-225、946-1255的基因。通过无缝克隆将扩增子克隆入pPJV7563的Nhe I/BglII位点中。质粒1197的开放阅读框核苷酸序列示于SEQ ID NO:6。由质粒1197编码的氨基酸序列示于SEQ ID NO:7。
质粒1326(TERT343)。使用PCR及无缝克隆技术构建质粒1326。首先,使用引物TertΔ343-F及Tert-R利用PCR从质粒1112扩增编码TERT的氨基酸344-1132的基因。通过无缝克隆将扩增子克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1326的开放阅读框核苷酸序列示于SEQ ID NO:10。由质粒1326编码的氨基酸序列示于SEQ ID NO:11。
质粒1330(TERT541)。使用PCR及无缝克隆技术构建质粒1330。首先,使用引物TertΔ541-F及Tert-R利用PCR从质粒1112扩增编码TERT的氨基酸542-1132的基因。通过无缝克隆将扩增子克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1330的开放阅读框核苷酸序列示于SEQ ID NO:12。由质粒1330编码的氨基酸序列示于SEQ ID NO:13。
1B.含有双抗原构建体的质粒
质粒1269(Muc1-Tert240)。使用PCR及无缝克隆技术构建质粒1269。首先,使用引物f tg link Ter240及r pmed Bgl Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f pmed Nhe Muc及r link muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。PCR导致在Tert的5'端及Muc1的3'端添加重迭的GGSGG接头。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/BglII位点中。质粒1269的开放阅读框核苷酸序列示于SEQ ID NO:20。由质粒1269编码的氨基酸序列示于SEQ ID NO:21。
质粒1270(Muc1-ERB2A-Tert240)。使用PCR及无缝克隆技术构建质粒1270。首先,使用引物f2 ERBV2A、f1 ERBV2A Ter240及r pmed Bgl Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f pmed Nhe Muc及r ERB2A Bamh Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。PCR导致在Tert的5'端及Muc1的3'端添加重迭的ERBV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1270的开放阅读框核苷酸序列示于SEQ ID NO:22。由质粒1270编码的氨基酸序列示于SEQ ID NO:23。
质粒1271(Tert240-ERB2A-Muc1)。使用PCR及无缝克隆技术构建质粒1271。首先,使用引物f pmed Nhe Ter240及r ERB2A Bamh Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f2 ERBV2A、f1 ERBV2A Muc及r pmed Bgl Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。PCR导致在Tert的3'端及Muc1的5'端添加重迭的ERBV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1271的开放阅读框核苷酸序列示于SEQ ID NO:24。由质粒1271编码的氨基酸序列示于SEQ ID NO:25。
质粒1286(cMuc1-ERB2A-Tert240)。使用PCR及无缝克隆技术构建质粒1286。首先,使用引物f2 ERBV2A、f1 ERBV2A Ter240及r pmed Bgl Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f pmed Nhe cytMuc及r ERB2A Bamh Muc利用PCR从质粒1197扩增编码人黏蛋白-1的氨基酸22-225、946-1255的基因。PCR导致在Tert的5'端及Muc1的3'端添加重迭的ERBV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1286的开放阅读框核苷酸序列示于SEQID NO:26。由质粒1286编码的氨基酸序列示于SEQ ID NO:27。
质粒1287(Tert240-ERB2A-cMuc1)。使用PCR及无缝克隆技术构建质粒1287。首先,使用引物f pmed Nhe Ter240及r ERB2A Bamh Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f2 ERBV2A、f1 ERBV2A cMuc及r pmed Bgl Muc利用PCR从质粒1197扩增编码人黏蛋白-1的氨基酸22-225、946-1255的基因。PCR导致在Tert的3'端及Muc1的5'端重迭的ERBV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1287的开放阅读框核苷酸序列示于SEQ ID NO:28。由质粒1287编码的氨基酸序列示于SEQ ID NO:29。
质粒1409(Muc1-EMC2A-mCEA)。使用PCR及无缝克隆技术构建质粒1409。首先,使用引物f pmed Nhe Muc及r EM2A Bamh Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。使用引物f2 EMCV2A、1EMC2a CEAss及r pmed CEA GPI利用PCR从质粒1386扩增编码CEA的氨基酸2-144、323-702的基因。PCR导致在CEA的5'端及Muc1的3'端添加重迭的EMCV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1409的开放阅读框核苷酸序列示于SEQ ID NO:30。由质粒1409编码的氨基酸序列示于SEQ ID NO:31。
质粒1410(mCEA-T2A-Muc1)。使用PCR及无缝克隆技术构建质粒1410。首先,使用引物f pmed CEA SS及r T2A CEA利用PCR从质粒1386扩增编码CEA的氨基酸2-144、323-702的基因。使用引物f2 T2A 63,f1 T2a Muc及r pmed Bgl Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。PCR导致在CEA的3'端及Muc1的5'端添加重迭的T2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1410的开放阅读框核苷酸序列示于在SEQ ID NO:32。由质粒1410编码的氨基酸序列示于SEQ ID NO:33。
质粒1411(mCEA-Furin-T2A-Muc1)。使用PCR及无缝克隆技术构建质粒1411。首先,使用引物f pmed CEA SS及r T2A弗林蛋白酶CEA利用PCR从质粒1386扩增编码CEA的氨基酸2-144、323-702的基因。使用引物f2 T2A 63、f1 T2a Muc及r pmed Bgl Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255的基因。PCR导致在CEA的3'端与Muc1的5'端添加重迭的弗林蛋白酶切割位点及T2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1411的开放阅读框核苷酸序列示于SEQ IDNO:34。由质粒1411编码的氨基酸序列示于SEQ ID NO:35。
质粒1431(Muc1-EMC2A-cCEA)。使用PCR及无缝克隆技术构建质粒1431。首先,使用引物f pmed Nhe Muc及r EM2A Bamh Muc利用PCR从质粒1027扩增编码人黏蛋白-1的氨基酸2-225、946-1255之基因。使用引物f2 EMCV2A、f EMC2a CEA d1及r pmed CEA D7利用PCR从质粒1390扩增编码CEA的氨基酸35-144、323-677之基因。PCR导致于CEA之5'端及Muc1之3'端添加重迭之EMCV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563之Nhe I/Bgl II位点中。质粒1431之开放阅读框核苷酸序列示于SEQ ID NO:36。由质粒1431编码之氨基酸序列示于SEQ ID NO:37。
质粒1432(cCEA-T2A-Tert240)。使用PCR及无缝克隆技术构建质粒1432。首先,使用引物f pmed CEA D1及r T2a CEA D7利用PCR从质粒1390扩增编码CEA的氨基酸35-144、323-677的基因。使用引物f2T2A 63、f1 T2A Tert240及r pmed Bgl Ter240利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。PCR导致在Tert的5'端及CEA的3'端添加重迭的TAV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1432的开放阅读框核苷酸序列示于SEQ ID NO:38。由质粒1432编码的氨基酸序列示于SEQ ID NO:39。
质粒1440(Tert240-ERA2A-mCEA)。使用PCR及无缝克隆技术构建质粒1440。首先,使用引物f pmed Nhe tert240及r ERA2A Tert利用PCR从质粒1112扩增编码人端粒酶的氨基酸241-1132的基因。使用引物f2ERAV2A、f1 ERA2A ssCEA及r pmed CEA GPI利用PCR从质粒1386扩增编码CEA的氨基酸2-144、323-702的基因。PCR导致在Tert的3'端及CEA的5'端添加重迭的ERAV 2A序列。使扩增子混合在一起并通过无缝克隆将其克隆入pPJV7563的NheI/Bgl II位点中。质粒1440的开放阅读框核苷酸序列示于SEQ ID NO:40。由质粒1440编码的氨基酸序列示于SEQ ID NO:41。
1C.含有三抗原构建体的质粒
质粒1424(Muc1-ERB2A-Tert240-ERA2A-mCEA)。使用PCR及无缝克隆技术构建质粒1424。首先,使用引物f pmed Nhe Muc及r tert 1602-1579利用PCR从质粒1270扩增编码人黏蛋白-1的氨基酸2-225、946-1255、ERBV 2A肽及人Tert240氨基端一半的基因。使用引物ftert 1584-1607及r pmed CEA GPI利用PCR从质粒1440扩增编码Tert240羧基端一半、ERAV2A肽及人CEA的氨基酸2-144、323-702的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1424的开放阅读框核苷酸序列示于SEQ ID NO:42。由质粒1424编码的氨基酸序列示于SEQ ID NO:43。
质粒1425(mCEA-T2A-Muc1-ERB2A-Tert240)。使用PCR及无缝克隆技术构建质粒1425。首先,使用引物f pmed CEA SS及r muc 986-963利用PCR从质粒1410扩增编码人CEA氨基酸的2-144、323-702、TAV 2A肽及人黏蛋白-1氨基端一半的基因。使用引物f Muc 960-983及r pmed Bgl Ter240利用PCR从质粒1270扩增编码人黏蛋白-1羧基端一半、ERBV 2A肽及人端粒酶的氨基酸241-1132的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1425的开放阅读框核苷酸序列示于SEQ ID NO:44。由质粒1425编码的氨基酸序列示于SEQ ID NO:45。
质粒1426(Tert240-ERB2A-Muc1-EMC2A-mCEA)。使用PCR及无缝克隆技术构建质粒1426。首先,使用引物f pmed Nhe Ter240及r muc 986-963利用PCR从质粒1271扩增编码人端粒酶的氨基酸241-1132、ERBV 2A肽及人黏蛋白-1氨基端一半的基因。使用引物f Muc960-983及r pmed CEA GPI利用PCR从质粒1409扩增编码人黏蛋白-1羧基端一半、EMCV 2A肽及CEA的氨基酸2-144、323-702的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1426的开放阅读框核苷酸序列示于SEQ ID NO:46。由质粒1426编码的氨基酸序列示于SEQ ID NO:47。
质粒1427(Tert240-ERA2A-mCEA-T2A-Muc1)。使用PCR及无缝克隆技术构建质粒1427。首先,使用引物f pmed Nhe Ter240及R CEA SR2利用PCR从质粒1440扩增编码人端粒酶的氨基酸241-1132、ERAV 2A肽及mCEA氨基端一半的基因。使用引物f cCEA 562-592及rpmed Bgl Muc利用PCR从质粒1410扩增编码mCEA羧基端一半、TAV 2A肽及人黏蛋白-1的氨基酸2-225、946-1255的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1427的开放阅读框核苷酸序列示于SEQ ID NO:48。由质粒1427编码的氨基酸序列示于SEQ ID NO:49。
质粒1428(Muc1-EMC2A-cCEA-T2A-Tert240)。使用PCR及无缝克隆技术构建质粒1428。首先,使用引物f pmed Nhe Muc及r cCEA 849-820,利用PCR从质粒1431扩增编码人黏蛋白-1的氨基酸2-225、946-1255、EMCV 2A肽及cCEA氨基端一半的基因。使用引物f CEA833-855及r pmed Bgl Ter240,利用PCR从质粒1432扩增编码cCEA羧基端一半、TAV 2A肽及人端粒酶的氨基酸241-1132的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1428的开放阅读框核苷酸序列示于SEQ ID NO:50。由质粒1428编码的氨基酸序列示于SEQ ID NO:51。
质粒1429(cCEA-T2A-Tert240-ERB2A-Muc1)。使用PCR及无缝克隆技术构建质粒1429。首先,使用引物f pmed CEA D1及r tert 1602-1579利用PCR从质粒1432扩增编码人CEA氨基酸35-144、323-677、TAV 2A肽及人Tert240氨基端一半的基因。使用引物f tert1584-1607及r pmed Bgl Muc利用PCR从质粒1271扩增编码人Tert240羧基端一半、ERBV 2A肽及人黏蛋白-1的氨基酸2-225、946-1255的基因。部分重迭的扩增子以Dpn I切开,混合在一起,并通过无缝克隆将其克隆入pPJV7563的Nhe I/Bgl II位点中。质粒1429的开放阅读框核苷酸序列示于SEQ ID NO:52。由质粒1429编码的氨基酸序列示于SEQ ID NO:53。
1D.载体构建
此实施例说明携带多抗原构建体载体的构建。如在国际专利申请公告案WO2015/063647中所述,从黑猩猩腺病毒AdC68基因组序列构建携带如1424、1425、1426、1427、1428及1429各质粒所携带的相同三抗原构建体(开放阅读框)的载体。这些载体分别称为AdC68Y-1424、AdC68Y-1425、AdC68Y-1426、AdC68Y-1427、AdC68Y-1428及AdC68Y-1429。图1提供这些载体的结构。
AdC68的全长基因组序列可从Genbank得到,登录编号为AC_000011.1,也在WO2015/063647中提供。利用计算机设计,经工程化在病毒中引入E1及E3缺失造成复制缺陷并创造转基因插入空间的无转基因的AdC68骨架(“空载体”)。碱基456-3256及27476-31831缺失的载体AdC68Y,经工程化成为具有比先前AdC68载体改进的生长特性。利用体外寡核苷酸合成(oligo synthesis)及随后重组介导的中间组装,以多阶段程序生物化学地合成空载体,作为大肠杆菌(E.coli)和/或酵母中的人工染色体。分别使用引物对Muc1-20bp-F-98/mCEA-20bp-R-100、Y-mCEA-S2/Y-Tert-A2、Y-Tert-S/Y-CEA-A、Y-Tert-S/Y-MUC-A、Y-MUC-S2/Y-Tert-A2及cCEA-20bp-F-106/Muc1-20BP-R-108利用PCR从质粒1424、1425、1426、1427、1428及1429扩增编码各种免疫原性TAA多肽的开放阅读框。然后将扩增子插入空载体骨架中。利用以PacI酶切从人造细菌染色体释放重组病毒基因组,并将线性化的核酸转染入E1补充的(complimenting)贴壁的HEK293细胞系中。在可看见的细胞病变效应及腺病毒病灶形成后,立即通过多回合冷冻/解冻从细胞释放病毒收获培养物。利用标准技术扩增及纯化病毒。
实施例2.MUC1单抗原构建体的免疫原性
在HLA-A2/DR1小鼠中的研究
研究设计。使用PMED法,以DNA构建体质粒1027(其编码SEQ ID NO:5的膜结合免疫原性MUC1多肽)或质粒1197(其编码SEQ ID NO:7的胞质免疫原性MUC1多肽)在第0天初免12只混合性别的HLA-A2/DR1小鼠并在第14天增强。第21天牺牲小鼠,并以干扰素-γ(IFN-γ)ELISpot及细胞内细胞因子染色(ICS)分析评估脾细胞的MUC1特异性细胞免疫原性。
颗粒介导的表皮递送(PMED)。PMED为给动物或患者施用疫苗的无针方法。PMED系统涉及使DNA沉淀于精微金颗粒上,然后由氦气推进到表皮中。ND10是单次使用装置,使用源自内部圆筒的加压氦气递送金颗粒,而X15是重复器递送装置,使用通过高压软管与X15连在一起的外部氦气罐递送金颗粒。此二装置在研究中均被用以递送MUC1 DNA质粒。金颗粒直径通常为1-3μm并将颗粒配制成每1mg金颗粒含有2μg抗原DNA质粒。(Sharpe,M.etal.:P.Protection of mice from H5N1 influenza challenge by prophylactic DNAvaccination using particle mediated epidermal delivery.Vaccine,2007,25(34):6392-98;Roberts LK,et al.:Clinical safety and efficacy of a powderedHepatitis B nucleic acid vaccine delivered to the epidermis by a commercialprototype device.Vaccine,2005;23(40):4867-78)。
IFN-γELISpot分析。在IFN-γELISpot板中,使源自单个动物的脾细胞与单个Ag特异肽(每种肽2-10μg/ml、每孔2.5-5e5个细胞)或15聚体Ag特异性肽的池(pool)(重迭11个氨基酸,包含全部Ag特异性氨基酸序列;参见表15;每种肽2-5μg/ml,每孔1.25-5e5个细胞)三重复共温育。在37℃、5%CO2温育所述板~16小时,然后依照厂商指示进行洗涤及显影。以CTL读取计计数IFN-γ斑点形成细胞(SFC)的数量。计算三重复的平均值并减去不含肽的阴性对照孔的应答。然后将SFC计数标准化以描述每1e6个脾细胞的应答。表中的抗原特异性应答表示Ag特异性肽或肽池应答的总和。
ICS分析。在U形底96孔板组织培养板中,使源自单个动物的脾细胞及H-2b-、HLA-A2-或HLA-A24-限制性Ag特异肽(每种肽5-10μg/ml,每孔1-2e6个细胞)或15聚体Ag特异性肽的池(重迭11个氨基酸,包含全部Ag特异性氨基酸序列;参见表15;每种肽2-5μg/ml、每孔1-2e6个脾细胞)共温育。在37℃、5%CO2温育所述板~16小时。然后将细胞染色以检测源自CD8+T细胞的细胞内IFN-γ表达并固定。在流式细胞仪上取得细胞。数据以每只动物在扣除不含肽的阴性对照孔中所得应答后的肽Ag-或肽池Ag特异性IFN-γ+CD8+T细胞的频率呈现。
夹心ELISA分析。使用Tecan Evo、Biomek FxPBioTek 405Select TS自动化仪器进行标准夹心ELISA分析。使用在1X PBS中的1.0μg/mL的人MUC1或人CEA蛋白(抗原),以25μl/孔涂覆在384孔微量板(平底孔、高结合力),并在4℃温育过夜。第二天早上,在室温下,以在含0.05%Tween 20的PBS(PBS-T)中的5%FBS封闭所述板1小时。在96U形底孔板中,以在PBS-T中的1/100起始稀释度制备小鼠血清。在PBS-T中,Tecan Evo在9个稀释增量点上进行1/2对数系列稀释,随后从96孔板将25μl/孔经稀释的血清冲压(stamping)至384孔板。室温下,在振荡器上以600RPM温育该等384孔板1小时,然后,使用BioTek EL 405Select TS板洗涤器,以PBS-T洗涤诸板4次。将小鼠抗IgG-HRP二抗稀释至适当稀释度并通过Biomek FxP以25μl/孔冲压至384孔板中,然后在室温下在振荡器上以600RPM温育1小时,随后重复洗涤5次。使用Biomek FxP,将板以25μl/孔的RT TMB基质冲压并在室温下在黑暗中温育30分钟,随后利用25μl/孔1N H2SO4的冲压停止酶促反应。在Molecular Devices的Spectramax340PC/384Plus上,在波长450nm读取诸板。报导数据为OD1.0时的计算效价,检测极限为99.0。在各板中使用抗原特异性的市售单克隆抗体作为阳性对照以追踪板对板的变异表达;以经不相关疫苗接种的小鼠血清作为阴性对照,只有PBS-T的孔用以监测非特异性结合背景值。表中的效价表示从个别动物所引起的抗原特异性IgG效价。
结果。表1显示源自分别与衍生自MUC1肽文库的肽池(参见表15)或MUC1肽aa516-530一起培养的HLA-A2/DR1脾细胞的ELISpot及ICS数据。第3栏的数值表示以MUC1肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。第4栏的数值表示以MUC1肽aa516-530再刺激及扣除背景值后CD8+T细胞为IFN-γ+的频率。阳性应答定义为SFC>100且IFN-γ+CD8+T细胞的频率>0.1%。如表1所示,以全长膜结合(质粒1027)及胞质(质粒1197)MUC1构建体制备的免疫原性MUC1多肽能诱导MUC1特异性T细胞应答,包括HLA-A2限制性MUC1肽aa516-530特异性CD8+T细胞应答。胞质MUC1抗原形式诱导最高量级(magnitude)的T细胞应答。重要的是,衍生自癌症患者的针对MUC1肽aa516-530的T细胞应答已显示与体外抗肿瘤效力相互关联(Jochems C et al.,Cancer Immunol Immunother(2014)63:161-174),证明提升针对此特异性抗原决定区d细胞应答的重要性。
表1.HLA-A2/DR1小鼠中由单抗原MUC1 DNA构建体(质粒1027和质粒1197)诱导的T细胞应答
Figure BDA0002402017140000691
在HLA-A24小鼠中的研究
研究设计。通过PMED施用,以DNA构建体质粒1027,在第0天初免混合性别的HLA-A24小鼠并在第14、28及42天增强。第21天牺牲小鼠并评估脾细胞的MUC1特异性细胞免疫原性(ELISpot)。
结果。表2显示源自与衍生自MUC1肽文库的肽池(参见表15)一起培养的HLA-A24脾细胞的ELISpot数据。第3栏的数值表示以MUC1肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。阳性应答定义为SFC>100。如表2所示,膜结合MUC1构建体能诱导MUC1特异性细胞应答。
表2.HLA-A24小鼠中由编码人天然全长膜结合MUC1抗原的单抗原DNA构建体质粒1027诱导的T细胞应答
Figure BDA0002402017140000701
在猴子中的研究
研究设计。通过双侧肌内注射(总共1mL)2e11个病毒颗粒,以编码胞质型MUC1多肽(由质粒1197编码的相同多肽)或全长膜结合MUC1多肽(由质粒1027编码的相同多肽)的腺病毒载体AdC68W初免14只源自中国的食蟹猴(cynomolgus macaques)。29天后,通过电穿孔由肌内双侧递送质粒1197或质粒1027(总共2mL)以增强动物。在第1天(32mg)及第29天(50mg)皮下施用抗CTLA-4。最后一次免疫14天后,进行动物采血并分离PBMC及血清以分别评估MUC1特异性细胞(ELISpot、ICS)及体液(ELISA)应答。在此及本公开的其他实施例中所用的腺病毒载体AdC68W是根据国际专利申请WO2015/063647中所述方法从黑猩猩腺病毒AdC68构建。
NHP特异性免疫分析
ELISpot分析。在IFN-γELISpot板中,使源自单个动物的PBMC及15聚体Ag特异性肽的池(重迭11个氨基酸,包含全部Ag特异性氨基酸序列),每种肽2μg/ml,每孔4e5个细胞,二重复共温育(参见表15)。在37℃、5%CO2温育所述板~16小时,然后依照厂商指示进行洗涤及显影。以CTL读取计计数IFN-γ斑点形成细胞(SFC)的数量。计算二重复的平均值并减去不含肽的阴性对照孔的应答。然后将SFC计数标准化以描述每1e6个PBMC的应答。表中的抗原特异性应答表示Ag特异性肽池应答的总和。
ICS分析。在U形底96孔板组织培养板中,使源自单个动物的PBMC及15聚体MUC1肽的池(重迭11个氨基酸,包含全部天然全长MUC1氨基酸序列;参见表15),各肽2μg/ml,每孔1.5-2e6个PBMC共温育。在37℃、5%CO2温育所述板~16小时,然后染色以检测源自CD8+T细胞的细胞内IFN-γ表达。固定后,在流式细胞仪上取得细胞。结果以每只单个动物扣除不含肽的阴性对照孔中所得到的应答并标准化至1e6个CD8+T细胞后的MUC1、CEA或TERT-特异性的IFN-γ+CD8+T细胞的数值呈现。
夹心ELISA分析。使用Tecan Evo、Biomek FxPBioTek 405Select TS自动化仪器进行标准夹心ELISA分析。使用在1X PBS中的1.0μg/mL人MUC1或人CEA蛋白(抗原),以25μl/孔涂覆在384孔微量板(平底孔、高结合力),并在4℃温育过夜。第二天早上,在室温下,以在含0.05%Tween 20的PBS(PBS-T)中的5%FBS封闭该等板1小时。在96U形底孔板中,以1/100起始稀释度在PBS-T中制备源自中国的食蟹猴血清。在PBS-T中,Tecan Evo在9个稀释增量点上进行1/2对数系列稀释,随后从96孔板将25μl/孔经稀释的血清冲压至384孔板。室温下,在振荡器上以600RPM温育该等384孔板1小时,然后,使用BioTek EL 405Select TS板洗涤器,以PBS-T洗涤各板4次。将与食蟹猴IgG交叉反应的恒河猴抗IgG-HRP二抗稀释至适当稀释度并通过Biomek FxP以25μl/孔冲压至384孔板中,然后在室温下在振荡器上以600RPM温育1小时,随后重复洗涤5次。使用Biomek FxP,将板以25μl/孔的RT TMB基质冲压并在室温下在黑暗中温育30分钟,随后利用25μl/孔1N H2SO4的冲压停止酶促反应。在MolecularDevices的Spectramax 340PC/384Plus上,在波长450nm读取各板。报导数据为OD 1.0时的计算效价,检测极限为99.0。在各板中使用抗原特异性的市售单克隆抗体作为阳性对照以追踪板对板的变异表达;以经不相关疫苗接种的小鼠血清作为阴性对照,只有PBS-T的孔用以监测非特异性结合背景值。表中的效价表示从个别动物所引起的抗原特异性IgG效价。
结果。表3显示源自与衍生自MUC1肽文库的肽池(表15)一起培养的源自中国食蟹猴PBMC的ELISpot及ICS数据,及源自中国的食蟹猴血清的ELISA数据。第3栏的数值表示以MUC1肽池再刺激及扣除背景值后的IFN-γ斑点#/106个PBMC。第4栏的数值表示以MUC1肽池再刺激及扣除背景值后的IFN-γ+CD8+T细胞#/106个CD8+T细胞。第5栏的数值表示抗MUC1IgG效价(光密度(O.D)=1,检测极限(L.O.D)=99.0)。阳性应答定义为SFC>50、IFN-γ+CD8+T细胞数/1e6个CD8+T细胞>50且IgG效价>99。如表3所示,以胞质(质粒1197)及天然全长膜结合(质粒1027)MUC1构建体制备的免疫原性MUC1多肽能诱导MUC1特异性T及B细胞应答。天然全长膜结合MUC1构建体(质粒1027)显示诱导整体最佳MUC1特异性细胞及体液应答。
表3.中国源的食蟹猴中由单抗原腺病毒AdC68W载体和单抗原DNA构建体(质粒1197;质粒1027)诱导的T细胞和B细胞应答
Figure BDA0002402017140000721
实施例3.CEA单抗原构建体的免疫原性
在Pasteur(HLA-A2/DR1)小鼠中的免疫应答研究
研究设计。通过电穿孔,在第0天以携带编码人膜结合(质粒1386)或胞质CEA多肽(质粒1390)的单抗原构建体的质粒初免混合性别的HLA-A2/DR1小鼠并在第14天增强。7天后在IFN-γELISpot及ICS分析中测量抗原特异性T细胞应答。
结果。表4显示HLA-A2/DR1脾细胞的ELISpot及ICS数据,所述皮细胞与衍生自CEA肽文库的肽池(亦参见表15)一起培养,该CEA肽文库由aa1-699(用于以构建体1386所免疫小鼠)及的aa37-679(移除信号序列及GPI序列)(用于以质粒1390所免疫小鼠)组成。第3及第4栏的数值分别表示在以相关的CEA肽池再刺激及扣除背景值后所引起的IFN-γ+斑点#/106个脾细胞及IFN-γ+CD8+T细胞的频率。表5显示与CEA肽aa693-701一起培养的HLA-A2/DR1脾细胞的ELISpot数据。阳性应答定义为SFC>100且IFN-γ+CD8+T细胞的频率>0.1%。如表4所示,以上文实施例1A所述膜结合(质粒1386)及胞质(质粒1390)CEA构建体制备的免疫原性CEA多肽能诱导CEA特异性T细胞应答。膜结合及胞质CEA抗原形式二者均诱导可相较强度的CEA特异性T细胞应答。如表5所示,以膜结合构建体1386免疫可诱导针对CEA肽aa693-701的HLA-A2限制性T细胞应答,其已在文献中显示由HLA-A2加工及呈递(Conforti A etal.,J Immunother(2009)32:744-754)。
表4.HLA-A2/DR1小鼠中由编码人膜结合或人胞质的CEA多肽的单抗原DNA构建体(质粒1386和1390)诱导的T细胞应答
Figure BDA0002402017140000731
Figure BDA0002402017140000741
表5.HLA-A2小鼠中由编码人膜结合的CEA多肽的单抗原DNA构建体(质粒1386;mCEA)诱导的HLA-A2-限制性CEA肽aa693-701-特异性T细胞应答
Figure BDA0002402017140000742
在HLA-A24小鼠中的免疫应答研究
研究设计。通过DNA电穿孔法,以人膜结合(质粒1386)或胞质CEA(质粒1390)DNA构建体在第0天初免16只混合性别的HLA-A24小鼠并在第14天增强。最后一次免疫7天后,在IFN-γELISpot及ICS分析中测量CEA特异性T细胞应答。
结果。表6显示与衍生自CEA肽文库的肽池(亦参见表15)一起培养的HLA-A24脾细胞的ELISpot及ICS数据。第3栏的数值表示,在用涵盖aa1-699的CEA肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。第4栏的数值表示,在用涵盖aa37-679的CEA肽池再刺激及扣除背景值后CD8+T细胞为IFN-γ+的频率。阳性应答定义为SFC>100且IFN-γ+CD8+T细胞的频率>0.1%。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。如表6所示,以膜结合(质粒1386)及胞质CEA(质粒1390)构建体制备的免疫原性CEA多肽,如通过ELISpot所测量,能诱导可相较的CEA特异性细胞应答。然而,通过ICS测量,用胞质CEA构建体(质粒1390)疫苗接种,诱导较强的CEA特异性的IFN-γ+CD8+T细胞应答。
表6.HLA-A24小鼠中由单抗原DNA构建体诱导的T细胞应答
Figure BDA0002402017140000751
实施例4.TERT单抗原构建体的免疫原性
在HLA-A2/DR1小鼠上中的免疫应答研究
研究设计。通过肌内注射(50μl)1e10个病毒颗粒,以编码截短的(Δ240)胞质型免疫原性TERT多肽的AdC68W腺病毒载体(质粒1112)初免6只混合性别的HLA-A2/DR1小鼠。28天后,通过电穿孔(2x20μl)由肌内双侧递送编码截短的(Δ240)胞质型TERT抗原的50μgDNA(质粒1112)以增强动物。7天后在IFN-γELISpot及ICS分析中测量抗原特异性T细胞应答。
结果。表7显示源自分别与衍生自TERT肽文库的肽池(亦参见表15)或TERT肽aa861-875一起培养的HLA-A2/DR1脾细胞的ELISpot及ICS数据。第3栏的数值表示以TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。第4栏的数值表示以TERT肽aa861-875再刺激及扣除背景值后CD8+T细胞为IFN-γ+的频率。阳性应答定义为SFC>100且IFN-γ+CD8+T细胞的频率>0.1%。如表7所示,以截短的(Δ240)胞质型TERT构建体制备的免疫原性TERT多肽能诱导HLA-A2-限制性的TERT特异性CD8+T细胞应答。
表7.HLA-A2/DR1小鼠中由单抗原腺病毒AdC68W和单抗原DNA构建体(质粒1112),编码人截短的(Δ240)胞质TERT抗原,诱导的T细胞应答
Figure BDA0002402017140000761
在HLA-A24小鼠中的免疫应答研究
研究设计。通过双侧肌内注射(各胫骨前肌50μl)总共1e10个病毒颗粒,以编码截短的(Δ240)胞质型TERT多肽(由质粒1112编码的相同多肽)的AdC68W腺病毒载体初免8只混合性别的HLA-A24小鼠。14天后,通过电穿孔(2×20μl)由肌内双侧递送编码截短的(Δ240)胞质型TERT多肽的50μg DNA(质粒1112)以增强动物。7天后在IFN-γELISpot及ICS分析中测量抗原特异性T细胞应答。
结果。表8显示源自分别与衍生自TERT肽文库的肽池(亦参见表15)或TERT肽aa841-855一起培养的HLA-A24脾细胞的IFN-γELISpot及ICS数据。第3栏的数值表示以TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。第4栏的数值表示以TERT肽aa841-855再刺激及扣除背景值后CD8+T细胞为IFN-γ+的频率。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。阳性应答定义为SFC>100且IFN-γ+CD8+T细胞的频率>0.1%。如表8所示,以截短的(Δ240)胞质型TERT构建体(质粒1112)制备的免疫原性TERT多肽能诱导HLA-A24-限制性的TERT特异性CD8+T细胞应答。
表8.HLA-A24小鼠中由单抗原腺病毒AdC68W和单抗原DNA构建体(质粒1112),编码人截短的(Δ240)胞质的TERT抗原,诱导的T细胞应答
Figure BDA0002402017140000771
在猴子中的免疫应答研究
研究设计。通过双侧肌内注射(总共1mL)2e11个病毒颗粒,以编码截短的(Δ240)胞质型TERT抗原的AdC68W腺病毒载体(质粒1112)初免8只源自中国的食蟹猴。30及64天后,通过电穿孔由肌内双侧递送(总共2mL)编码截短的(Δ240)胞质TERT抗原的DNA(质粒1112)以增强动物。在第1天(32mg)、第31天(50mg)及第65天(75mg)皮下施用抗CTLA-4。最后一次免疫14天后,进行动物采血并分离PBMC以评估TERT特异性细胞(ELISpot、ICS)应答。
结果。表9显示源自与衍生自TERT肽文库的肽池(亦参见表15)一起培养的源自中国的食蟹猴PBMC的ELISpot及ICS数据。第3栏的数值表示以TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。第4栏的数值表示以TERT肽池再刺激及扣除背景值后的IFN-γ+CD8+T细胞#/106个CD8+T细胞。阳性应答定义为SFC>50且IFN-γ+CD8+T细胞数/1e6个CD8+T细胞>50。如表9所示,以截短的(Δ240)胞质型TERT构建体(质粒1112)制备的免疫原性TERT多肽能诱导TERT特异性T细胞应答。
表9.中国源的食蟹猴中由TERT单抗原腺病毒AdC68W和TERT单抗原DNA构建体(质粒1112)诱导的T细胞应答
Figure BDA0002402017140000772
Figure BDA0002402017140000781
实施例5.双抗原构建体的免疫原性
在猴子中的免疫应答研究
研究设计。通过双侧肌内注射(总共1mL)2e11个病毒颗粒,以编码人天然全长膜结合MUC1(MUC1)及人截短的(Δ240)胞质TERT(TERTΔ240)多肽(质粒1270、1271及1269)的双抗原腺病毒AdC68W载体初免24只源自中国的食蟹猴。30及64天后,通过电穿孔由肌内双侧递送(总共2mL)编码相同两种抗原的双抗原DNA构建体(质粒1270、1271及1269)以增强动物。在第1天(32mg)、第31天(50mg)及第65天(75mg)皮下施用抗CTLA-4。最后一次免疫14天后,进行动物采血且分离PBMC及血清以分别评估MUC1及TERT特异性细胞(ELISpot、ICS)与MUC1特异性体液(ELISA)应答。总共,评估共表达两种抗原的三种不同的双抗原疫苗构建体:a)MUC1-2A-TERTΔ240(质粒1270),AdC68W载体及DNA质粒,编码通过2A肽连接MUC1与TERT;b)TERTΔ240-2A-MUC1(质粒1271),AdC68W载体及DNA质粒,编码通过2A肽连接TERT与MUC1;c)MUC1-TERTΔ240(质粒1269),AdC68W载体及DNA质粒,编码MUC1-TERT融合蛋白。
结果。表10显示源自与衍生自MUC1及TERT肽文库的肽池(亦参见表15)一起培养的源自中国的食蟹猴PBMC的ELISpot及ICS数据,及源自中国的食蟹猴血清的ELISA数据。阳性应答定义为SFC>50、IFN-γ+CD8+T细胞数/1e6个CD8+T细胞>50及IgG效价>99。第3及6栏的数值分别表示以MUC1及TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。第4及7栏的数值分别表示以MUC1肽池及TERT肽池再刺激及扣除背景值后的IFN-γ+CD8+T细胞#/106个CD8+T细胞。第5栏的数值表示抗MUC1的IgG效价(光密度(O.D)=1、检测极限(L.O.D)=99.0)。如表10所示,以MUC1及TERT表达双抗原构建体(质粒1270、1271及1269)制备的免疫原性MUC1及TERT多肽能诱导MUC1及TERT特异性T细胞应答与MUC1特异性B细胞应答。编码MUC1-TERT融合蛋白的双抗原构建体1269显示诱导最强的整体MUC1特异性细胞应答;对照之下,双抗原构建体质粒1271(TERT-2A-MUC1)显示诱导最强的整体TERT特异性细胞应答。三种双抗原构建体全部显示诱导相当的MUC1特异性体液应答。
表10.中国源的食蟹猴中由双抗原腺病毒AdC68W和单抗原DNA构建体(质粒1270、1271和1269),编码免疫原性MUC1和TERT多肽,诱导的免疫应答
Figure BDA0002402017140000791
Figure BDA0002402017140000801
实施例6.三抗原构建体的免疫原性
实施例6说明携带表达人天然全长膜结合MUC1多肽(MUC1)、人膜结合或胞质CEA多肽(mCEA或cCEA)及人截短的(Δ240)胞质TERT多肽(TERTΔ240)的三抗原构建体的质粒及腺病毒载体引起对全部三种经编码癌抗原的Ag特异性的T及B细胞应答的能力。
在C57BL/6J小鼠中使用DNA电穿孔法的免疫应答研究
研究设计。以编码人MUC1、mCEA或cCEA及TERTΔ240的三抗原DNA构建体免疫48只C57BL/6J母小鼠。在初免/增强疗法中,伴随电穿孔由肌内双侧递送(各胫骨前肌总共20μl)三抗原DNA疫苗(50μg),各疫苗接种间间隔两周。最后一次免疫7天后,分别在IFN-γELISpot分析及ELISA分析中测量MUC1、CEA及TERT特异性细胞应答与MUC1及CEA特异性体液应答。总共,使用六种不同的携带各编码由2A肽连接的三种TAA多肽的三抗原DNA构建体的质粒如下:MUC1-2A-TERTΔ240-2A-mCEA(质粒1424)、mCEA-2A-MUC1-2A-TERTΔ240(质粒1425)、TERTΔ240-2A-MUC1-2A-mCEA(质粒1426)、TERTΔ240-2A-mCEA-2A-MUC1(质粒1427)、MUC1-2A-cCEA-2A-TERTΔ240(质粒1428)、cCEA-2A-TERTΔ240-2A-MUC1(质粒1429)。
结果。表11A-C显示与衍生自MUC1、CEA及TERT肽文库的肽池(亦参见表15)一起培养的C57BL/6J脾细胞的ELISpot数据、与TERT肽aa1025-1039一起培养的C57BL/6J脾细胞的ICS数据及C57BL/6J小鼠血清的ELISA数据。阳性应答定义为SFC>100、IFN-γ+CD8+T细胞的频率>0.1%及IgG效价>99。表11A-C第3栏的数值分别表示在以MUC1、CEA或TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。表11A-B第4栏的数值分别表示抗MUC1及CEAIgG效价(光密度(O.D)=1、检测极限(L.O.D)=99.0)。表11C第4栏的数值表示以TERT特异性肽TERT aa1025-1039再刺激及扣除背景值后CD8+T细胞为IFN-γ+的频率。如表11A-C所示,以表达MUC1、CEA及TERT的三抗原构建体制备的免疫原性MUC1、CEA及TERT多肽能诱导针对全部三种抗原的T细胞应答及针对MUC1的B细胞应答。相比之下,尽管含mCEA的三抗原构建体(质粒1424-1427)能诱导针对CEA的B细胞应答,然而含cCEA的三抗原构建体(质粒1428-1429)诱导较弱或者不具CEA特异性的B细胞应答。
表11A.C57BL/6J小鼠中由编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽的三抗原DNA构建体(质粒1424-1429)诱导的MUC1-特异性T细胞和B细胞应答
Figure BDA0002402017140000811
Figure BDA0002402017140000821
表11B.C57BL/6J小鼠中由编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽的三抗原DNA构建体(质粒1424-1429)诱导的CEA-特异性T细胞和B细胞应答
Figure BDA0002402017140000822
Figure BDA0002402017140000831
表11C.C57BL/6J小鼠中由编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽的三抗原DNA构建体(质粒1424-1429)诱导的TERT-特异性T细胞和B细胞应答
Figure BDA0002402017140000832
Figure BDA0002402017140000841
Figure BDA0002402017140000851
在C57BL/6J小鼠中使用腺病毒载体的免疫应答的研究
研究设计。通过肌内注射(各胫骨前肌50μl)1e10个病毒颗粒,以编码人MUC1、mCEA或cCEA及TERTΔ240的三抗原腺病毒载体初免48只C57BL/6J母小鼠。14天后,伴随电穿孔由肌内双侧递送(各胫骨前肌20μl)三抗原DNA构建体(50μg)以增强动物。最后一次免疫7天后,分别在IFN-γELISpot及ICS分析与ELISA分析中测量MUC1、CEA及TERT特异性细胞应答与MUC1及mCEA特异性体液应答。总共,使用六种三抗原腺病毒及编码由2A肽连接的MUC1、mCEA或cCEA及TERTΔ240的DNA构建体如下:MUC1-2A-TERTΔ240-2A-mCEA(质粒1424)、mCEA-2A-MUC1-2A-TERTΔ240(质粒1425)、TERTΔ240-2A-MUC1-2A-mCEA(质粒1426)、TERTΔ240-2A-mCEA-2A-MUC1(质粒1427)、MUC1-2A-cCEA-2A-TERTΔ240(质粒1428)、cCEA-2A-TERTΔ240-2A-MUC1(质粒1429)。
结果。表12A-C显示与衍生自MUC1、CEA及TERT肽文库的肽池(亦参见表15)一起培养的C57BL/6J脾细胞的ELISpot数据、与TERT肽aa1025-1039一起培养的C57BL/6J脾细胞的ICS数据及源自C57BL/6J小鼠血清的ELISA数据。阳性应答定义为SFC>100、IFN-γ+CD8+T细胞的频率>0.1%及IgG效价>99。表12A-C第3栏数的值分别表示在以MUC1、CEA或TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。表12C第4栏的数值表示以TERT特异性肽TERT aa1025-1039再刺激及扣除背景值后的IFN-γ+CD8+T细胞#/106个CD8+T细胞。表12A-B第4栏的数值分别表示抗MUC1及抗CEA的IgG效价(光密度(O.D)=1、检测极限(L.O.D)=99.0)。如表12A-C所示,以表达MUC1、CEA及TERT的三抗原构建体制备的免疫原性MUC1、CEA及TERT多肽能诱导针对全部三种抗原的T细胞应答及针对MUC1的B细胞应答。相比之下,尽管含mCEA的三抗原构建体(质粒1424-1427)能诱导针对CEA的B细胞应答,然而含cCEA的三抗原构建体(质粒1428-1429)诱导较弱或者不具CEA特异性的B细胞应答。
表12A.C57BL/6J小鼠中由三抗原腺病毒AdC68Y和三抗原DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的MUC1-特异性T细胞和B细胞应答
Figure BDA0002402017140000861
Figure BDA0002402017140000871
表12B.C57BL/6J小鼠中由三抗原腺病毒AdC68Y和三抗原DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的CEA-特异性T细胞和B细胞应答
Figure BDA0002402017140000872
Figure BDA0002402017140000881
表12C.C57BL/6J小鼠中由三抗原腺病毒AdC68Y和三抗原DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的TERT-特异性T细胞和B细胞应答
Figure BDA0002402017140000882
Figure BDA0002402017140000891
在HLA-A24小鼠中的免疫应答的研究
研究设计。通过肌内注射(各胫骨前肌50μl)1e10个病毒颗粒,以编码人MUC1、mCEA或cCEA及TERTΔ240的腺病毒AdC68Y三抗原构建体(质粒1426:TERTΔ240-2A-MUC1-2A-mCEA或质粒1428:MUC1-2A-cCEA-2A-TERTΔ240)初免16只混合性别的HLA-A24小鼠。14天后,以编码相同三种抗原的50μg三抗原DNA构建体(质粒1426或1428)由肌内增强动物(伴随电穿孔递送20μl至各胫骨前肌)。最后一次免疫7天后,在IFN-γELISpot分析中测量HLA-A24-限制性MUC1特异性细胞应答。
结果。表13显示与MUC1肽aa524-532一起培养的HLA-A24脾细胞的ELISpot数据。阳性应答定义为SFC>50。第3栏的数值表示以MUC1肽aa524-532再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。如表13所示,以表达MUC1、CEA及TERT的三抗原构建体1426及1428制备的免疫原性MUC1多肽能诱导HLA-A24-限制性MUC1肽aa524-532特异性CD8+T细胞应答。重要的是,衍生自癌症患者的针对此特异性MUC1肽的T细胞应答已显示与体外抗肿瘤效力相关联(Jochems C et al.,Cancer Immunol Immunother(2014)63:161-174),证明提升针对此特异性抗原决定区细胞应答的重要性。
表13.HLA-A24小鼠中由三抗原腺病毒和DNA构建体质粒1426(TERTΔ240-2A-MUC1-2A-mCEA)和质粒1428(MUC1-2A-cCEA-2A-TERTΔ240),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的HLA-A24-限制性MUC1肽aa524-532-特异性T细胞应答
Figure BDA0002402017140000901
在猴子中的免疫应答研究
研究设计。在第1天,通过双侧肌内注射(总共1mL)2e11个病毒颗粒,以编码人天然全长膜结合MUC1(MUC1)、人膜结合或胞质CEA(mCEA或cCEA)及人截短的(Δ240)胞质型TERT(TERTΔ240)抗原的AdC68Y腺病毒载体初免42只源自中国的食蟹猴。第30天及第57天,通过电穿孔由肌内双侧递送编码相同三种抗原的DNA(总共2mL)以增强动物。在第1天(32mg)、第30天(50mg)及第57天(75mg)皮下施用抗CTLA-4。最后一次免疫15天后,进行动物采血且分离PBMC及血清以分别评估MUC1、CEA及TERT特异性细胞(ELISpot、ICS)与MUC1及mCEA特异性体液(ELISA)应答。总共,评估六种编码由2A肽连接的MUC1、mCEA或cCEA及TERTΔ240的三抗原腺病毒及DNA构建体如下:MUC1-2A-TERTΔ240-2A-mCEA(质粒1424)、mCEA-2A-MUC1-2A-TERTΔ240(质粒1425)、TERTΔ240-2A-MUC1-2A-mCEA(质粒1426)、TERTΔ240-2A-mCEA-2A-MUC1(质粒1427)、MUC1-2A-cCEA-2A-TERTΔ240(质粒1428)、cCEA-2A-TERTΔ240-2A-MUC1(质粒1429)。
结果。表14A、14B及14C显示源自与衍生自MUC1、CEA及TERT肽文库的肽池(亦参见表15)一起培养的源自中国的食蟹猴PBMC的ELISpot及ICS数据,及源自中国的食蟹猴血清的ELISA数据。阳性应答定义为SFC>50、IFN-γ+CD8+T细胞数/1e6个CD8+T细胞>50及IgG效价>99。表14A-C第3栏的数值分别表示以MUC1、CEA或TERT肽池再刺激及扣除背景值后的IFN-γ斑点#/106个脾细胞。呈粗体字的数值指示至少有1个测试的肽池太多而无法计数,因此真实数字至少为所述值。表14A-C第4栏的数值分别表示以MUC1、CEA或TERT肽池再刺激及扣除背景值后的IFN-γ+CD8+T细胞#/106个CD8+T细胞。表14A-B第5栏的数值分别表示抗MUC1及抗CEA的IgG效价(光密度(O.D)=1、检测极限(L.O.D)=99.0)。如表14A-C所示,以表达MUC1、CEA及TERT的三重Ag构建体制备的免疫原性MUC1、CEA及TERT多肽能诱导细胞针对全部三种抗原的细胞应答及针对MUC1的体液应答。然而,含mCEA的三抗原构建体比含cCEA的诱导更强的CEA特异性B细胞应答。
表14A.中国源食蟹猴中由三抗原腺病毒AdC68Y和DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的MUC1-特异性T细胞和B细胞应答
Figure BDA0002402017140000921
Figure BDA0002402017140000931
表14B.中国源食蟹猴中由三抗原腺病毒AdC68Y和DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的CEA-特异性T细胞和B细胞应答
Figure BDA0002402017140000932
Figure BDA0002402017140000941
表14C.中国源食蟹猴中由三抗原腺病毒AdC68Y和DNA构建体(质粒1424-1429),编码人天然全长膜结合MUC1、人膜结合或胞质CEA和人截短的(Δ240)胞质TERT多肽,诱导的TERT-特异性T细胞和B细胞应答
Figure BDA0002402017140000942
Figure BDA0002402017140000951
表15.衍生自人肿瘤相关抗原(TAA)MUC1、CEA和TERT的肽池
Figure BDA0002402017140000952
Figure BDA0002402017140000961
表16.用于质粒构建的引物
Figure BDA0002402017140000962
Figure BDA0002402017140000971
Figure BDA0002402017140000981
表17.2A-肽序列
Figure BDA0002402017140000982
Figure BDA0002402017140000991
表18.序列索引
Figure BDA0002402017140000992
Figure BDA0002402017140001001
Figure BDA0002402017140001011
原始序列表(部分)
SEQ ID NO:42.质粒1424ORF(核苷酸序列)
atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacggatccggccagtgcaccaattacgccctgctgaagctggccggcgacgtggaatctaaccctggccctgaatcgccaagcgcaccccctcatcggtggtgcatcccttggcaacgcctcctcctgaccgcctcactgctgactttctggaacccgccgaccaccgcaaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggaacttccccgggcctgagcgccggcgccaccgtgggaattatgatcggcgtgctcgtgggagtggccctgatc
SEQ ID NO:43.质粒1424多肽
MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILDGSGQCTNYALLKLAGDVESNPGPESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPGLSAGATVGIMIGVLVGVALI
SEQ ID NO:44.质粒1425ORF(核苷酸序列)
atggctagcgaatcgccaagcgcaccccctcatcggtggtgcatcccttggcaacgcctcctcctgaccgcctcactgctgactttctggaacccgccgaccaccgcaaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggaacttccccgggcctgagcgccggcgccaccgtgggaattatgatcggcgtgctcgtgggagtggccctgatcggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac
SEQ ID NO:45.质粒1425多肽
MASESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPGLSAGATVGIMIGVLVGVALIGSGEGRGSLLTCGDVEENPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANLGSGTILSEGATNFSLLKLAGDVELNPGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD
SEQ ID NO:46.质粒1426ORF(核苷酸序列)
atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctggccccgaatcgccaagcgcaccccctcatcggtggtgcatcccttggcaacgcctcctcctgaccgcctcactgctgactttctggaacccgccgaccaccgcaaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggaacttccccgggcctgagcgccggcgccaccgtgggaattatgatcggcgtgctcgtgggagtggccctgatc
SEQ ID NO:47.质粒1426多肽
MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPGLSAGATVGIMIGVLVGVALI
SEQ ID NO:48.质粒1427ORF(核苷酸序列)
atggctagcggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacggatccggccagtgcaccaattacgccctgctgaagctggccggcgacgtggaatctaaccctggccctgaatcgccaagcgcaccccctcatcggtggtgcatcccttggcaacgcctcctcctgaccgcctcactgctgactttctggaacccgccgaccaccgcaaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggaacttccccgggcctgagcgccggcgccaccgtgggaattatgatcggcgtgctcgtgggagtggccctgatcggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg
SEQ ID NO:49.质粒1427多肽
MASGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILDGSGQCTNYALLKLAGDVESNPGPESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPGLSAGATVGIMIGVLVGVALIGSGEGRGSLLTCGDVEENPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL
SEQ ID NO:50.质粒1428ORF(核苷酸序列)
atggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctggccccaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggac
SEQ ID NO:51.质粒1428多肽
MASTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANLGSGRIFNAHYAGYFADLLIHDIETNPGPKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGSGEGRGSLLTCGDVEENPGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD
SEQ ID NO:52.质粒1429ORF(核苷酸序列)
atggctagcaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacggatccggcacaatcctgtctgagggcgccaccaacttcagcctgctgaaactggccggcgacgtggaactgaaccctggccctacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctg
SEQ ID NO:53.质粒1429多肽
MASKLTIESTPFNVAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIYPNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPFITSNNSNPVEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYECGIQNKLSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWLIDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSNNSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGSGEGRGSLLTCGDVEENPGPGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVAITGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILDGSGTILSEGATNFSLLKLAGDVELNPGPTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVPVTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPALGSTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHHSDTPTTLASHSTKTDASSTHHSSVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPGSVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDVPFPFSAQSGAGVPGWGIALLVLVCVLVALAIVYLIALAVCQCRRKNYGQLDIFPARDTYHPMSEYPTYHTHGRYVPPSSTDRSPYEKVSAGNGGSSLSYTNPAVAAASANL
SEQ ID NO:65.质粒1428完整载体(核苷酸序列)
ggcgtaatgctctgccagtgttacaaccaattaaccaattctgattagaaaaactcatcgagcatcaaatgaaactgcaatttattcatatcaggattatcaataccatatttttgaaaaagccgtttctgtaatgaaggagaaaactcaccgaggcagttccataggatggcaagatcctggtatcggtctgcgattccgactcgtccaacatcaatacaacctattaatttcccctcgtcaaaaataaggttatcaagtgagaaatcaccatgagtgacgactgaatccggtgagaatggcaaaagcttatgcatttctttccagacttgttcaacaggccagccattacgctcgtcatcaaaatcactcgcatcaaccaaaccgttattcattcgtgattgcgcctgagcgagacgaaatacgcgatcgctgttaaaaggacaattacaaacaggaatcaaatgcaaccggcgcaggaacactgccagcgcatcaacaatattttcacctgaatcaggatattcttctaatacctggaatgctgttttcccggggatcgcagtggtgagtaaccatgcatcatcaggagtacggataaaatgcttgatggtcggaagaggcataaattccgtcagccagtttagtctgaccatctcatctgtaacatcattggcaacgctacctttgccatgtttcagaaacaactctggcgcatcgggcttcccatacaatcgatagattgtcgcacctgattgcccgacattatcgcgagcccatttatacccatataaatcagcatccatgttggaatttaatcgcggcctcgagcaagacgtttcccgttgaatatggctcataacaccccttgtattactgtttatgtaagcagacaggtcgacaatattggctattggccattgcatacgttgtatctatatcataatatgtacatttatattggctcatgtccaatatgaccgccatgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtccgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttacgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacaccaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaataaccccgccccgttgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccgcggccgggaacggtgcattggaacgcggattccccgtgccaagagtgactcaccgtccggatctcagcaagcaggtatgtactctccagggtgggcctggcttccccagtcaagactccagggatttgagggacgctgtgggctcttctcttacatgtaccttttgcttgcctcaaccctgactatcttccaggtcaggatcccagagtcaggggtctgtattttcctgctggtggctccagttcaggaacagtaaaccctgctccgaatattgcctctcacatctcgtcaatctccgcgaggactggggaccctgtgacgaacatggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctggccccaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggactgaagatctgggccctaacaaaacaaaaagatggggttattccctaaacttcatgggttacgtaattggaagttgggggacattgccacaagatcatattgtacaaaagatcaaacactgttttagaaaacttcctgtaaacaggcctattgattggaaagtatgtcaaaggattgtgggtcttttgggctttgctgctccatttacacaatgtggatatcctgccttaatgcctttgtatgcatgtatacaagctaaacaggctttcactttctcgccaacttacaaggcctttctaagtaaacagtacatgaacctttaccccgttgctcggcaacggcctggtctgtgccaagtgtttgctgacgcaacccccactggctggggcttggccataggccatcagcgcatgcgtggaacctttgtggctcctctgccgatccatactgcggaactcctagccgcttgttttgctcgcagccggtctggagcaaagctcataggaactgacaattctgtcgtcctctcgcggaaatatacatcgtttcgatctacgtatgatctttttccctctgccaaaaattatggggacatcatgaagccccttgagcatctgacttctggctaataaaggaaatttattttcattgcaatagtgtgttggaattttttgtgtctctcactcggaaggaattctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactc
SEQ ID NO:66.AdC68Y 1428完整载体(核苷酸序列)
ccatcttcaataatatacctcaaactttttgtgcgcgttaatatgcaaatgaggcgtttgaatttggggaggaagggcggtgattggtcgagggatgagcgaccgttaggggcggggcgagtgacgttttgatgacgtggttgcgaggaggagccagtttgcaagttctcgtgggaaaagtgacgtcaaacgaggtgtggtttgaacacggaaatactcaattttcccgcgctctctgacaggaaatgaggtgtttctgggcggatgcaagtgaaaacgggccattttcgcgcgaaaactgaatgaggaagtgaaaatctgagtaatttcgcgtttatggcagggaggagtatttgccgagggccgagtagactttgaccgattacgtgggggtttcgattaccgtgtttttcacctaaatttccgcgtacggtgtcaaagtccggtgtttttactactgtaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctgtccctatcagtgatagagatctccctatcagtgatagagagtttagtgaaccgtcagatccgctagggtaccgcgatCACCatggctagcacccctggaacccagagccccttcttccttctgctgctgctgaccgtgctgactgtcgtgacaggctctggccacgccagctctacacctggcggcgagaaagagacaagcgccacccagagaagcagcgtgccaagcagcaccgagaagaacgccgtgtccatgaccagctccgtgctgagcagccactctcctggcagcggcagcagcacaacacagggccaggatgtgacactggcccctgccacagaacctgcctctggatctgccgccacctggggacaggacgtgacaagcgtgccagtgaccagacctgccctgggctctacaacaccccctgcccacgatgtgaccagcgcccctgataacaagcctgcccctggaagcacagcccctccagctcatggcgtgacctctgccccagataccagaccagccccaggatctacagccccacccgcacacggcgtgacaagtgcccctgacacaagacccgctccaggctctactgctcctcctgcccatggcgtgacaagcgctcccgatacaaggccagctcctggctccacagcaccaccagcacatggcgtgacatcagctcccgacactagacctgctcccggatcaaccgctccaccagctcacggcgtgaccagcgcacctgataccagacctgctctgggaagcaccgcccctcccgtgcacaatgtgacatctgcttccggcagcgccagcggctctgcctctacactggtgcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccttcagcatccctagccaccacagcgacacccctaccacactggccagccactccaccaagaccgatgcctctagcacccaccactccagcgtgccccctctgaccagcagcaaccacagcacaagcccccagctgtctaccggcgtctcattcttctttctgtccttccacatcagcaacctgcagttcaacagcagcctggaagatcccagcaccgactactaccaggaactgcagcgggatatcagcgagatgttcctgcaaatctacaagcagggcggcttcctgggcctgagcaacatcaagttcagacccggcagcgtggtggtgcagctgaccctggctttccgggaaggcaccatcaacgtgcacgacgtggaaacccagttcaaccagtacaagaccgaggccgccagccggtacaacctgaccatctccgatgtgtccgtgtccgacgtgcccttcccattctctgcccagtctggcgcaggcgtgccaggatggggaattgctctgctggtgctcgtgtgcgtgctggtggccctggccatcgtgtatctgattgccctggccgtgtgccagtgccggcggaagaattacggccagctggacatcttccccgccagagacacctaccaccccatgagcgagtaccccacataccacacccacggcagatacgtgccacccagctccaccgacagatccccctacgagaaagtgtctgccggcaacggcggcagctccctgagctacacaaatcctgccgtggccgctgcctccgccaacctgggatccggcagaatcttcaacgcccactacgccggctacttcgccgacctgctgatccacgacatcgagacaaaccctggccccaagctgaccattgagagcactcccttcaacgtggctgaggggaaggaggtgctgctcctggtgcacaatctgccccagcacctgttcgggtactcctggtacaagggagaacgcgtggacgggaaccggcagatcataggctacgtcatcggaacccagcaggccacacccggtccagcgtacagcggccgggagattatctacccgaacgcctccctgctgatccaaaacatcatccagaacgacaccggtttctacactctgcacgtgattaagtcagatctggtcaacgaagaggccaccggccaattcagggtgtaccccgaactccctaagccgttcatcacctcgaacaacagcaacccggtcgaggatgaagatgcggtggccttgacgtgcgaacctgagatccagaacaccacctacttgtggtgggtgaacaatcagagcctgccagtctccccacgactccagctgtcgaacgacaacaggaccctgactttgctgtccgtgactcggaacgacgtgggcccttatgaatgcggtatccagaacaagctgtccgtggaccacagcgaccctgtgatcctgaacgtcctttacgggccggacgaccccaccatttccccgtcgtacacttactaccggccgggcgtgaacctgtccctgtcgtgccacgctgcctccaatccgccggcccagtactcctggctcatcgacggaaacatccagcagcacacccaagaactgttcatctccaacattaccgagaaaaactcgggactttacacctgtcaagccaacaattccgccagcggccactcccgcaccactgtcaaaactatcactgtgtccgccgaactcccgaagcccagcatcagctccaacaactcgaagcccgtggaggataaggacgctgtcgcgttcacctgtgaaccagaggcacagaataccacctacctttggtgggtcaacggacagtccctgcctgtctcaccgagactgcagctgtcaaacgggaataggactctgaccttgtttaacgtcacccggaacgacgcccgggcctacgtgtgcggcatccagaactccgtgagcgcaaaccggtctgacccagtgaccctggatgtgctgtacggccccgacactccgatcatttcaccccccgattcatcctacctgtccggcgctaacctcaacctctcatgccactccgcatccaaccccagcccgcaatattcgtggcgcattaacggaattcctcagcaacatacccaggtcctgttcattgcgaagatcacccctaacaacaacggaacctacgcctgctttgtgtcaaacctggccactggtagaaacaactccatcgtgaagtccattaccgtgtcggcgtccggatccggcgagggcagaggcagcctgctgacatgtggcgacgtggaagagaaccctggccccggagctgccccggagccggagaggacccccgttggccagggatcgtgggcccatccgggacgcaccaggggaccatccgacaggggattctgtgtggtgtcaccggccaggccagcagaagaggcaaccagcctcgagggagcgttgtctggaaccagacattcccacccgtcggtgggccggcagcaccacgcgggaccaccgtccacttccagaccgccacggccatgggacaccccttgcccgcctgtgtatgccgagactaaacacttcctgtactcatccggagacaaggaacagcttcggccgtccttcctcctgtcgtcgctcagaccgagcctgaccggagcacgcagattggtggaaactatcttccttgggtcacgtccgtggatgccaggtaccccacggcgcctcccgcgcctcccacagagatactggcagatgcggcctctgttcctggaattgctgggaaaccacgctcagtgcccgtacggagtcctgctcaagactcactgccctctgagggcggcggtcactccggcggccggagtgtgcgcacgggagaagccccagggaagcgtggcagctccggaagaggaggacaccgatccgcgccgcctcgtgcaacttctgcgccagcactcctcgccctggcaagtctacgggttcgtccgcgcctgcctgcgccgcctggtgccgcctgggctctggggttcccggcataacgagcgccgcttcctgagaaatactaagaagtttatctcacttggaaaacatgccaagttgtcgctgcaagaactcacgtggaagatgtcagtccgcgattgcgcctggctgcgccgctcgccgggcgtcgggtgtgttccagctgcagaacaccgcctgagagaagaaattctggccaaatttctgcattggctgatgtcagtgtacgtggtcgagctgctgcgctcctttttctacgtcactgagactacctttcaaaagaaccgcctgttcttctaccgcaaatctgtgtggagcaagctgcagtcaatcggcattcgccagcatctgaagagggtgcagctgcgggaactttccgaggcagaagtccgccagcaccgggaggcccggccggcgcttctcacgtcgcgtctgagattcatcccaaagcccgacgggctgaggcctatcgtcaacatggattacgtcgtgggcgctcgcacctttcgccgtgaaaagcgggccgaacgcttgacctcacgggtgaaggccctcttctccgtgctgaactacgagagagcaagacggcctggcctgctgggagcttcggtgctgggactggacgatatccaccgggcttggcggacctttgttctccgggtgagagcccaagaccctccgccggaactgtacttcgtgaaggtggcgatcaccggagcctatgatactattccgcaagatcgactcaccgaagtcatcgcctcgatcatcaaaccgcagaacacttactgcgtcaggcggtacgccgtggtccagaaggccgcgcatggccacgtgagaaaggcgttcaagtcgcacgtgtccactctcaccgacctccagccttacatgaggcaattcgttgcgcatttgcaagagacttcgcccctgagagatgcggtggtcatcgagcagagctccagcctgaacgaagcgagcagcggtctgtttgacgtgttcctccgcttcatgtgtcatcacgcggtgcgaatcaggggaaaatcatacgtgcagtgccagggaatcccacaaggcagcattctgtcgactctcttgtgttccctttgctacggcgatatggaaaacaagctgttcgctgggatcagacgggacgggttgctgctcagactggtggacgacttcctgctggtgactccgcacctcactcacgccaaaacctttctccgcactctggtgaggggagtgccagaatacggctgtgtggtcaatctccggaaaactgtggtgaatttccctgtcgaggatgaggcactcggaggaaccgcatttgtccaaatgccagcacatggcctgttcccatggtgcggtctgctgctggacacccgaactcttgaagtgcagtccgactactccagctatgcccggacgagcatccgcgccagcctcactttcaatcgcggctttaaggccggacgaaacatgcgcagaaagcttttcggagtcctccggcttaaatgccattcgctctttctcgatctccaagtcaattcgctgcagaccgtgtgcacgaacatctacaagatcctgctgctccaagcctaccggttccacgcttgcgtgcttcagctgccgtttcaccaacaggtgtggaagaacccgaccttctttctgcgggtcattagcgatactgcctccctgtgttactcaatcctcaaggcaaagaacgccggaatgtcgctgggtgcgaaaggagccgcgggacctcttcctagcgaagcggtgcagtggctctgccaccaggctttcctcctgaagctgaccaggcacagagtgacctacgtcccgctgctgggctcgctgcgcactgcacagacccagctgtctagaaaactccccggcaccaccctgaccgctctggaagccgccgccaacccagcattgccgtcagatttcaagaccatcttggacTGAcgcaCctcgagctgatcataatcagccataccacatttgtagaggttttacttgctttaaaaaacctcccacacctccccctgaacctgaaacataaaatgaatgcaattgttgttgttaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaccaggtgccgagcctgcgagtgcggagggaagcatgccaggttccagcccgtgtgtgtggatgtgacggaggacctgcgacccgatcatttggtgttgccctgcaccgggacggagttcggttccagcggggaagaatctgactagagtgagtagtgttctggggcgggggaggacctgcatgagggccagaataactgaaatctgtgcttttctgtgtgttgcagcagcatgagcggaagcggctcctttgagggaggggtattcagcccttatctgacggggcgtctcccctcctgggcgggagtgcgtcagaatgtgatgggatccacggtggacggccggcccgtgcagcccgcgaactcttcaaccctgacctatgcaaccctgagctcttcgtcgttggacgcagctgccgccgcagctgctgcatctgccgccagcgccgtgcgcggaatggccatgggcgccggctactacggcactctggtggccaactcgagttccaccaataatcccgccagcctgaacgaggagaagctgttgctgctgatggcccagctcgaggccttgacccagcgcctgggcgagctgacccagcaggtggctcagctgcaggagcagacgcgggccgcggttgccacggtgaaatccaaataaaaaatgaatcaataaataaacggagacggttgttgattttaacacagagtctgaatctttatttgatttttcgcgcgcggtaggccctggaccaccggtctcgatcattgagcacccggtggatcttttccaggacccggtagaggtgggcttggatgttgaggtacatgggcatgagcccgtcccgggggtggaggtagctccattgcagggcctcgtgctcgggggtggtgttgtaaatcacccagtcatagcaggggcgcagggcatggtgttgcacaatatctttgaggaggagactgatggccacgggcagccctttggtgtaggtgtttacaaatctgttgagctgggagggatgcatgcggggggagatgaggtgcatcttggcctggatcttgagattggcgatgttaccgcccagatcccgcctggggttcatgttgtgcaggaccaccagcacggtgtatccggtgcacttggggaatttatcatgcaacttggaagggaaggcgtgaaagaatttggcgacgcctttgtgcccgcccaggttttccatgcactcatccatgatgatggcgatgggcccgtgggcggcggcctgggcaaagacgtttcgggggtcggacacatcatagttgtggtcctgggtgaggtcatcataggccattttaatgaatttggggcggagggtgccggactgggggacaaaggtaccctcgatcccgggggcgtagttcccctcacagatctgcatctcccaggctttgagctcggagggggggatcatgtccacctgcggggcgataaagaacacggtttccggggcgggggagatgagctgggccgaaagcaagttccggagcagctgggacttgccgcagccggtggggccgtagatgaccccgatgaccggctgcaggtggtagttgagggagagacagctgccgtcctcccggaggaggggggccacctcgttcatcatctcgcgcacgtgcatgttctcgcgcaccagttccgccaggaggcgctctccccccagggataggagctcctggagcgaggcgaagtttttcagcggcttgagtccgtcggccatgggcattttggagagggtttgttgcaagagttccaggcggtcccagagctcggtgatgtgctctacggcatctcgatccagcagacctcctcgtttcgcgggttgggacggctgcgggagtagggcaccagacgatgggcgtccagcgcagccagggtccggtccttccagggtcgcagcgtccgcgtcagggtggtctccgtcacggtgaaggggtgcgcgccgggctgggcgcttgcgagggtgcgcttcaggctcatccggctggtcgaaaaccgctcccgatcggcgccctgcgcgtcggccaggtagcaattgaccatgagttcgtagttgagcgcctcggccgcgtggcctttggcgcggagcttacctttggaagtctgcccgcaggcgggacagaggagggacttgagggcgtagagcttgggggcgaggaagacggactcgggggcgtaggcgtccgcgccgcagtgggcgcagacggtctcgcactccacgagccaggtgaggtcgggctggtcggggtcaaaaaccagtttcccgccgttctttttgatgcgtttcttacctttggtctccatgagctcgtgtccccgctgggtgacaaagaggctgtccgtgtccccgtagaccgactttatgggccggtcctcgagcggtgtgccgcggtcctcctcgtagaggaaccccgcccactccgagacgaaagcccgggtccaggccagcacgaaggaggccacgtgggacgggtagcggtcgttgtccaccagcgggtccaccttttccagggtatgcaaacacatgtccccctcgtccacatccaggaaggtgattggcttgtaagtgtaggccacgtgaccgggggtcccggccgggggggtataaaagggtgcgggtccctgctcgtcctcactgtcttccggatcgctgtccaggagcgccagctgttggggtaggtattccctctcgaaggcgggcatgacctcggcactcaggttgtcagtttctagaaacgaggaggatttgatattgacggtgccggcggagatgcctttcaagagcccctcgtccatctggtcagaaaagacgatctttttgttgtcgagcttggtggcgaaggagccgtagagggcgttggagaggagcttggcgatggagcgcatggtctggtttttttccttgtcggcgcgctccttggcggcgatgttgagctgcacgtactcgcgcgccacgcacttccattcggggaagacggtggtcagctcgtcgggcacgattctgacctgccagccccgattatgcagggtgatgaggtccacactggtggccacctcgccgcgcaggggctcattagtccagcagaggcgtccgcccttgcgcgagcagaaggggggcagggggtccagcatgacctcgtcgggggggtcggcatcgatggtgaagatgccgggcaggaggtcggggtcaaagtagctgatggaagtggccagatcgtccagggcagcttgccattcgcgcacggccagcgcgcgctcgtagggactgaggggcgtgccccagggcatgggatgggtaagcgcggaggcgtacatgccgcagatgtcgtagacgtagaggggctcctcgaggatgccgatgtaggtggggtagcagcgccccccgcggatgctggcgcgcacgtagtcatacagctcgtgcgagggggcgaggagccccgggcccaggttggtgcgactgggcttttcggcgcggtagacgatctggcggaaaatggcatgcgagttggaggagatggtgggcctttggaagatgttgaagtgggcgtggggcagtccgaccgagtcgcggatgaagtgggcgtaggagtcttgcagcttggcgacgagctcggcggtgactaggacgtccagagcgcagtagtcgagggtctcctggatgatgtcatacttgagctgtcccttttgtttccacagctcgcggttgagaaggaactcttcgcggtccttccagtactcttcgagggggaacccgtcctgatctgcacggtaagagcctagcatgtagaactggttgacggccttgtaggcgcagcagcccttctccacggggagggcgtaggcctgggcggccttgcgcagggaggtgtgcgtgagggcgaaagtgtccctgaccatgaccttgaggaactggtgcttgaagtcgatatcgtcgcagcccccctgctcccagagctggaagtccgtgcgcttcttgtaggcggggttgggcaaagcgaaagtaacatcgttgaagaggatcttgcccgcgcggggcataaagttgcgagtgatgcggaaaggttggggcacctcggcccggttgttgatgacctgggcggcgagcacgatctcgtcgaagccgttgatgttgtggcccacgatgtagagttccacgaatcgcggacggcccttgacgtggggcagtttcttgagctcctcgtaggtgagctcgtcggggtcgctgagcccgtgctgctcgagcgcccagtcggcgagatgggggttggcgcggaggaaggaagtccagagatccacggccagggcggtttgcagacggtcccggtactgacggaactgctgcccgacggccattttttcgggggtgacgcagtagaaggtgcgggggtccccgtgccagcgatcccatttgagctggagggcgagatcgagggcgagctcgacgagccggtcgtccccggagagtttcatgaccagcatgaaggggacgagctgcttgccgaaggaccccatccaggtgtaggtttccacatcgtaggtgaggaagagcctttcggtgcgaggatgcgagccgatggggaagaactggatctcctgccaccaattggaggaatggctgttgatgtgatggaagtagaaatgccgacggcgcgccgaacactcgtgcttgtgtttatacaagcggccacagtgctcgcaacgctgcacgggatgcacgtgctgcacgagctgtacctgagttcctttgacgaggaatttcagtgggaagtggagtcgtggcgcctgcatctcgtgctgtactacgtcgtggtggtcggcctggccctcttctgcctcgatggtggtcatgctgacgagcccgcgcgggaggcaggtccagacctcggcgcgagcgggtcggagagcgaggacgagggcgcgcaggccggagctgtccagggtcctgagacgctgcggagtcaggtcagtgggcagcggcggcgcgcggttgacttgcaggagtttttccagggcgcgcgggaggtccagatggtacttgatctccaccgcgccattggtggcgacgtcgatggcttgcagggtcccgtgcccctggggtgtgaccaccgtcccccgtttcttcttgggcggctggggcgacgggggcggtgcctcttccatggttagaagcggcggcgaggacgcgcgccgggcggcaggggcggctcggggcccggaggcaggggcggcaggggcacgtcggcgccgcgcgcgggtaggttctggtactgcgcccggagaagactggcgtgagcgacgacgcgacggttgacgtcctggatctgacgcctctgggtgaaggccacgggacccgtgagtttgaacctgaaagagagttcgacagaatcaatctcggtatcgttgacggcggcctgccgcaggatctcttgcacgtcgcccgagttgtcctggtaggcgatctcggtcatgaactgctcgatctcctcctcttgaaggtctccgcggccggcgcgctccacggtggccgcgaggtcgttggagatgcggcccatgagctgcgagaaggcgttcatgcccgcctcgttccagacgcggctgtagaccacgacgccctcgggatcgcGggcgcgcatgaccacctgggcgaggttgagctccacgtggcgcgtgaagaccgcgtagttgcagaggcgctggtagaggtagttgagcgtggtggcgatgtgctcggtgacgaagaaatacatgatccagcggcggagcggcatctcgctgacgtcgcccagcgcctccaaacgttccatggcctcgtaaaagtccacggcgaagttgaaaaactgggagttgcgcgccgagacggtcaactcctcctccagaagacggatgagctcggcgatggtggcgcgcacctcgcgctcgaaggcccccgggagttcctccacttcctcttcttcctcctccactaacatctcttctacttcctcctcaggcggcagtggtggcgggggagggggcctgcgtcgccggcggcgcacgggcagacggtcgatgaagcgctcgatggtctcgccgcgccggcgtcgcatggtctcggtgacggcgcgcccgtcctcgcggggccgcagcgtgaagacgccgccgcgcatctccaggtggccgggggggtccccgttgggcagggagagggcgctgacgatgcatcttatcaattgccccgtagggactccgcgcaaggacctgagcgtctcgagatccacgggatctgaaaaccgctgaacgaaggcttcgagccagtcgcagtcgcaaggtaggctgagcacggtttcttctggcgggtcatgttggttgggagcggggcgggcgatgctgctggtgatgaagttgaaataggcggttctgagacggcggatggtggcgaggagcaccaggtctttgggcccggcttgctggatgcgcagacggtcggccatgccccaggcgtggtcctgacacctggccaggtccttgtagtagtcctgcatgagccgctccacgggcacctcctcctcgcccgcgcggccgtgcatgcgcgtgagcccgaagccgcgctggggctggacgagcgccaggtcggcgacgacgcgctcggcgaggatggcttgctggatctgggtgagggtggtctggaagtcatcaaagtcgacgaagcggtggtaggctccggtgttgatggtgtaggagcagttggccatgacggaccagttgacggtctggtggcccggacgcacgagctcgtggtacttgaggcgcgagtaggcgcgcgtgtcgaagatgtagtcgttgcaggtgcgcaccaggtactggtagccgatgaggaagtgcggcggcggctggcggtagagcggccatcgctcggtggcgggggcgccgggcgcgaggtcctcgagcatggtgcggtggtagccgtagatgtacctggacatccaggtgatgccggcggcggtggtggaggcgcgcgggaactcgcggacgcggttccagatgttgcgcagcggcaggaagtagttcatggtgggcacggtctggcccgtgaggcgcgcgcagtcgtggatgctctatacgggcaaaaacgaaagcggtcagcggctcgactccgtggcctggaggctaagcgaacgggttgggctgcgcgtgtaccccggttcgaatctcgaatcaggctggagccgcagctaacgtggtattggcactcccgtctcgacccaagcctgcaccaaccctccaggatacggaggcgggtcgttttgcaacttttttttggaggccggatgagactagtaagcgcggaaagcggccgaccgcgatggctcgctgccgtagtctggagaagaatcgccagggttgcgttgcggtgtgccccggttcgaggccggccggattccgcggctaacgagggcgtggctgccccgtcgtttccaagaccccatagccagccgacttctccagttacggagcgagcccctcttttgttttgtttgtttttgccagatgcatcccgtactgcggcagatgcgcccccaccaccctccaccgcaacaacagccccctccacagccggcgcttctgcccccgccccagcagcaacttccagccacgaccgccgcggccgccgtgagcggggctggacagagttatgatcaccagctggccttggaagagggcgaggggctggcgcgcctgggggcgtcgtcgccggagcggcacccgcgcgtgcagatgaaaagggacgctcgcgaggcctacgtgcccaagcagaacctgttcagagacaggagcggcgaggagcccgaggagatgcgcgcggcccggttccacgcggggcgggagctgcggcgcggcctggaccgaaagagggtgctgagggacgaggatttcgaggcggacgagctgacggggatcagccccgcgcgcgcgcacgtggccgcggccaacctggtcacggcgtacgagcagaccgtgaaggaggagagcaacttccaaaaatccttcaacaaccacgtgcgcaccctgatcgcgcgcgaggaggtgaccctgggcctgatgcacctgtgggacctgctggaggccatcgtgcagaaccccaccagcaagccgctgacggcgcagctgttcctggtggtgcagcatagtcgggacaacgaagcgttcagggaggcgctgctgaatatcaccgagcccgagggccgctggctcctggacctggtgaacattctgcagagcatcgtggtgcaggagcgcgggctgccgctgtccgagaagctggcggccatcaacttctcggtgctgagtttgggcaagtactacgctaggaagatctacaagaccccgtacgtgcccatagacaaggaggtgaagatcgacgggttttacatgcgcatgaccctgaaagtgctgaccctgagcgacgatctgggggtgtaccgcaacgacaggatgcaccgtgcggtgagcgccagcaggcggcgcgagctgagcgaccaggagctgatgcatagtctgcagcgggccctgaccggggccgggaccgagggggagagctactttgacatgggcgcggacctgcactggcagcccagccgccgggccttggaggcggcggcaggaccctacgtagaagaggtggacgatgaggtggacgaggagggcgagtacctggaagactgatggcgcgaccgtatttttgctagatgcaacaacaacagccacctcctgatcccgcgatgcgggcggcgctgcagagccagccgtccggcattaactcctcggacgattggacccaggccatgcaacgcatcatggcgctgacgacccgcaaccccgaagcctttagacagcagccccaggccaaccggctctcggccatcctggaggccgtggtgccctcgcgctccaaccccacgcacgagaaggtcctggccatcgtgaacgcgctggtggagaacaaggccatccgcggcgacgaggccggcctggtgtacaacgcgctgctggagcgcgtggcccgctacaacagcaccaacgtgcagaccaacctggaccgcatggtgaccgacgtgcgcgaggccgtggcccagcgcgagcggttccaccgcgagtccaacctgggatccatggtggcgctgaacgccttcctcagcacccagcccgccaacgtgccccggggccaggaggactacaccaacttcatcagcgccctgcgcctgatggtgaccgaggtgccccagagcgaggtgtaccagtccgggccggactacttcttccagaccagtcgccagggcttgcagaccgtgaacctgagccaggctttcaagaacttgcagggcctgtggggcgtgcaggccccggtcggggaccgcgcgacggtgtcgagcctgctgacgccgaactcgcgcctgctgctgctgctggtggcccccttcacggacagcggcagcatcaaccgcaactcgtacctgggctacctgattaacctgtaccgcgaggccatcggccaggcgcacgtggacgagcagacctaccaggagatcacccacgtgagccgcgccctgggccaggacgacccgggcaacctggaagccaccctgaactttttgctgaccaaccggtcgcagaagatcccgccccagtacgcgctcagcaccgaggaggagcgcatcctgcgttacgtgcagcagagcgtgggcctgttcctgatgcaggagggggccacccccagcgccgcgctcgacatgaccgcgcgcaacatggagcccagcatgtacgccagcaaccgcccgttcatcaataaactgatggactacttgcatcgggcggccgccatgaactctgactatttcaccaacgccatcctgaatccccactggctcccgccgccggggttctacacgggcgagtacgacatgcccgaccccaatgacgggttcctgtgggacgatgtggacagcagcgtgttctccccccgaccgggtgctaacgagcgccccttgtggaagaaggaaggcagcgaccgacgcccgtcctcggcgctgtccggccgcgagggtgctgccgcggcggtgcccgaggccgccagtcctttcccgagcttgcccttctcgctgaacagtatccgcagcagcgagctgggcaggatcacgcgcccgcgcttgctgggcgaagaggagtacttgaatgactcgctgttgagacccgagcgggagaagaacttccccaataacgggatagaaagcctggtggacaagatgagccgctggaagacgtatgcgcaggagcacagggacgatccccgggcgtcgcagggggccacgagccggggcagcgccgcccgtaaacgccggtggcacgacaggcagcggggacagatgtgggacgatgaggactccgccgacgacagcagcgtgttggacttgggtgggagtggtaacccgttcgctcacctgcgcccccgtatcgggcgcatgatgtaagagaaaccgaaaataaatgatactcaccaaggccatggcgaccagcgtgcgttcgtttcttctctgttgttgttgtatctagtatgatgaggcgtgcgtacccggagggtcctcctccctcgtacgagagcgtgatgcagcaggcgatggcggcggcggcgatgcagcccccgctggaggctccttacgtgcccccgcggtacctggcgcctacggaggggcggaacagcattcgttactcggagctggcacccttgtacgataccacccggttgtacctggtggacaacaagtcggcggacatcgcctcgctgaactaccagaacgaccacagcaacttcctgaccaccgtggtgcagaacaatgacttcacccccacggaggccagcacccagaccatcaactttgacgagcgctcgcggtggggcggccagctgaaaaccatcatgcacaccaacatgcccaacgtgaacgagttcatgtacagcaacaagttcaaggcgcgggtgatggtctcccgcaagacccccaatggggtgacagtgacagaggattatgatggtagtcaggatgagctgaagtatgaatgggtggaatttgagctgcccgaaggcaacttctcggtgaccatgaccatcgacctgatgaacaacgccatcatcgacaattacttggcggtggggcggcagaacggggtgctggagagcgacatcggcgtgaagttcgacactaggaacttcaggctgggctgggaccccgtgaccgagctggtcatgcccggggtgtacaccaacgaggctttccatcccgatattgtcttgctgcccggctgcggggtggacttcaccgagagccgcctcagcaacctgctgggcattcgcaagaggcagcccttccaggaaggcttccagatcatgtacgaggatctggaggggggcaacatccccgcgctcctggatgtcgacgcctatgagaaaagcaaggaggatgcagcagctgaagcaactgcagccgtagctaccgcctctaccgaggtcaggggcgataattttgcaagcgccgcagcagtggcagcggccgaggcggctgaaaccgaaagtaagatagtcattcagccggtggagaaggatagcaagaacaggagctacaacgtactaccggacaagataaacaccgcctaccgcagctggtacctagcctacaactatggcgaccccgagaagggcgtgcgctcctggacgctgctcaccacctcggacgtcacctgcggcgtggagcaagtctactggtcgctgcccgacatgatgcaagacccggtcaccttccgctccacgcgtcaagttagcaactacccggtggtgggcgccgagctcctgcccgtctactccaagagcttcttcaacgagcaggccgtctactcgcagcagctgcgcgccttcacctcgcttacgcacgtcttcaaccgcttccccgagaaccagatcctcgtccgcccgcccgcgcccaccattaccaccgtcagtgaaaacgttcctgctctcacagatcacgggaccctgccgctgcgcagcagtatccggggagtccagcgcgtgaccgttactgacgccagacgccgcacctgcccctacgtctacaaggccctgggcatagtcgcgccgcgcgtcctctcgagccgcaccttctaaatgtccattctcatctcgcccagtaataacaccggttggggcctgcgcgcgcccagcaagatgtacggaggcgctcgccaacgctccacgcaacaccccgtgcgcgtgcgcgggcacttccgcgctccctggggcgccctcaagggccgcgtgcggtcgcgcaccaccgtcgacgacgtgatcgaccaggtggtggccgacgcgcgcaactacacccccgccgccgcgcccgtctccaccgtggacgccgtcatcgacagcgtggtggcCgacgcgcgccggtacgcccgcgccaagagccggcggcggcgcatcgcccggcggcaccggagcacccccgccatgcgcgcggcgcgagccttgctgcgcagggccaggcgcacgggacgcagggccatgctcagggcggccagacgcgcggcttcaggcgccagcgccggcaggacccggagacgcgcggccacggcggcggcagcggccatcgccagcatgtcccgcccgcggcgagggaacgtgtactgggtgcgcgacgccgccaccggtgtgcgcgtgcccgtgcgcacccgcccccctcgcacttgaagatgttcacttcgcgatgttgatgtgtcccagcggcgaggaggatgtccaagcgcaaattcaaggaagagatgctccaggtcatcgcgcctgagatctacggccctgcggtggtgaaggaggaaagaaagccccgcaaaatcaagcgggtcaaaaaggacaaaaaggaagaagaaagtgatgtggacggattggtggagtttgtgcgcgagttcgccccccggcggcgcgtgcagtggcgcgggcggaaggtgcaaccggtgctgagacccggcaccaccgtggtcttcacgcccggcgagcgctccggcaccgcttccaagcgctcctacgacgaggtgtacggggatgatgatattctggagcaggcggccgagcgcctgggcgagtttgcttacggcaagcgcagccgttccgcaccgaaggaagaggcggtgtccatcccgctggaccacggcaaccccacgccgagcctcaagcccgtgaccttgcagcaggtgctgccgaccgcggcgccgcgccgggggttcaagcgcgagggcgaggatctgtaccccaccatgcagctgatggtgcccaagcgccagaagctggaagacgtgctggagaccatgaaggtggacccggacgtgcagcccgaggtcaaggtgcggcccatcaagcaggtggccccgggcctgggcgtgcagaccgtggacatcaagattcccacggagcccatggaaacgcagaccgagcccatgatcaagcccagcaccagcaccatggaggtgcagacggatccctggatgccatcggctcctagtcgaagaccccggcgcaagtacggcgcggccagcctgctgatgcccaactacgcgctgcatccttccatcatccccacgccgggctaccgcggcacgcgcttctaccgcggtcataccagcagccgccgccgcaagaccaccactcgccgccgccgtcgccgcaccgccgctgcaaccacccctgccgccctggtgcggagagtgtaccgccgcggccgcgcacctctgaccctgccgcgcgcgcgctaccacccgagcatcgccatttaaactttcgccTgctttgcagatcaatggccctcacatgccgccttcgcgttcccattacgggctaccgaggaagaaaaccgcgccgtagaaggctggcggggaacgggatgcgtcgccaccaccaccggcggcggcgcgccatcagcaagcggttggggggaggcttcctgcccgcgctgatccccatcatcgccgcggcgatcggggcgatccccggcattgcttccgtggcggtgcaggcctctcagcgccactgagacacacttggaaacatcttgtaataaaccAatggactctgacgctcctggtcctgtgatgtgttttcgtagacagatggaagacatcaatttttcgtccctggctccgcgacacggcacgcggccgttcatgggcacctggagcgacatcggcaccagccaactgaacgggggcgccttcaattggagcagtctctggagcgggcttaagaatttcgggtccacgcttaaaacctatggcagcaaggcgtggaacagcaccacagggcaggcgctgagggataagctgaaagagcagaacttccagcagaaggtggtcgatgggctcgcctcgggcatcaacggggtggtggacctggccaaccaggccgtgcagcggcagatcaacagccgcctggacccggtgccgcccgccggctccgtggagatgccgcaggtggaggaggagctgcctcccctggacaagcggggcgagaagcgaccccgccccgatgcggaggagacgctgctgacgcacacggacgagccgcccccgtacgaggaggcggtgaaactgggtctgcccaccacgcggcccatcgcgcccctggccaccggggtgctgaaacccgaaaagcccgcgaccctggacttgcctcctccccagccttcccgcccctctacagtggctaagcccctgccgccggtggccgtggcccgcgcgcgacccgggggcaccgcccgccctcatgcgaactggcagagcactctgaacagcatcgtgggtctgggagtgcagagtgtgaagcgccgccgctgctattaaacctaccgtagcgcttaacttgcttgtctgtgtgtgtatgtattatgtcgccgccgccgctgtccaccagaaggaggagtgaagaggcgcgtcgccgagttgcaagatggccaccccatcgatgctgccccagtgggcgtacatgcacatcgccggacaggacgcttcggagtacctgagtccgggtctggtgcagtttgcccgcgccacagacacctacttcagtctggggaacaagtttaggaaccccacggtggcgcccacgcacgatgtgaccaccgaccgcagccagcggctgacgctgcgcttcgtgcccgtggaccgcgaggacaacacctactcgtacaaagtgcgctacacgctggccgtgggcgacaaccgcgtgctggacatggccagcacctactttgacatccgcggcgtgctggatcggggccctagcttcaaaccctactccggcaccgcctacaacagtctggcccccaagggagcacccaacacttgtcagtggacatataaagccgatggtgaaactgccacagaaaaaacctatacatatggaaatgcacccgtgcagggcattaacatcacaaaagatggtattcaacttggaactgacaccgatgatcagccaatctacgcagataaaacctatcagcctgaacctcaagtgggtgatgctgaatggcatgacatcactggtactgatgaaaagtatggaggcagagctcttaagcctgataccaaaatgaagccttgttatggttcttttgccaagcctactaataaagaaggaggtcaggcaaatgtgaaaacaggaacaggcactactaaagaatatgacatagacatggctttctttgacaacagaagtgcggctgctgctggcctagctccagaaattgttttgtatactgaaaatgtggatttggaaactccagatacccatattgtatacaaagcaggcacagatgacagcagctcttctattaatttgggtcagcaagccatgcccaacagacctaactacattggtttcagagacaactttatcgggctcatgtactacaacagcactggcaatatgggggtgctggccggtcaggcttctcagctgaatgctgtggttgacttgcaagacagaaacaccgagctgtcctaccagctcttgcttgactctctgggtgacagaacccggtatttcagtatgtggaatcaggcggtggacagctatgatcctgatgtgcgcattattgaaaatcatggtgtggaggatgaacttcccaactattgtttccctctggatgctgttggcagaacagatacttatcagggaattaaggctaatggaactgatcaaaccacatggaccaaagatgacagtgtcaatgatgctaatgagataggcaagggtaatccattcgccatggaaatcaacatccaagccaacctgtggaggaacttcctctacgccaacgtggccctgtacctgcccgactcttacaagtacacgccggccaatgttaccctgcccaccaacaccaacacctacgattacatgaacggccgggtggtggcgccctcgctggtggactcctacatcaacatcggggcgcgctggtcgctggatcccatggacaacgtgaaccccttcaaccaccaccgcaatgcggggctgcgctaccgctccatgctcctgggcaacgggcgctacgtgcccttccacatccaggtgccccagaaatttttcgccatcaagagcctcctgctcctgcccgggtcctacacctacgagtggaacttccgcaaggacgtcaacatgatcctgcagagctccctcggcaacgacctgcgcacggacggggcctccatctccttcaccagcatcaacctctacgccaccttcttccccatggcgcacaacacggcctccacgctcgaggccatgctgcgcaacgacaccaacgaccagtccttcaacgactacctctcggcggccaacatgctctaccccatcccggccaacgccaccaacgtgcccatctccatcccctcgcgcaactgggccgccttccgcggctggtccttcacgcgtctcaagaccaaggagacgccctcgctgggctccgggttcgacccctacttcgtctactcgggctccatcccctacctcgacggcaccttctacctcaaccacaccttcaagaaggtctccatcaccttcgactcctccgtcagctggcccggcaacgaccggctcctgacgcccaacgagttcgaaatcaagcgcaccgtcgacggcgagggctacaacgtggcccagtgcaacatgaccaaggactggttcctggtccagatgctggcccactacaacatcggctaccagggcttctacgtgcccgagggctacaaggaccgcatgtactccttcttccgcaacttccagcccatgagccgccaggtggtggacgaggtcaactacaaggactaccaggccgtcaccctggcctaccagcacaacaactcgggcttcgtcggctacctcgcgcccaccatgcgccagggccagccctaccccgccaactacccctacccgctcatcggcaagagcgccgtcaccagcgtcacccagaaaaagttcctctgcgacagggtcatgtggcgcatccccttctccagcaacttcatgtccatgggcgcgctcaccgacctcggccagaacatgctctatgccaactccgcccacgcgctagacatgaatttcgaagtcgaccccatggatgagtccacccttctctatgttgtcttcgaagtcttcgacgtcgtccgagtgcaccagccccaccgcggcgtcatcgaggccgtctacctgcgcacccccttctcggccggtaacgccaccacctaagctcttgcttcttgcaagccatggccgcgggctccggcgagcaggagctcagggccatcatccgcgacctgggctgcgggccctacttcctgggcaccttcgataagcgcttcccgggattcatggccccgcacaagctggcctgcgccatcgtcaacacggccggccgcgagaccgggggcgagcactggctggccttcgcctggaacccgcgctcgaacacctgctacctcttcgaccccttcgggttctcggacgagcgcctcaagcagatctaccagttcgagtacgagggcctgctgcgccgcagcgccctggccaccgaggaccgctgcgtcaccctggaaaagtccacccagaccgtgcagggtccgcgctcggccgcctgcgggctcttctgctgcatgttcctgcacgccttcgtgcactggcccgaccgccccatggacaagaaccccaccatgaacttgctgacgggggtgcccaacggcatgctccagtcgccccaggtggaacccaccctgcgccgcaaccaggaggcgctctaccgcttcctcaactcccactccgcctactttcgctcccaccgcgcgcgcatcgagaaggccaccgccttcgaccgcatgaatcaagacatgtaaaccgtgtgtgtatgttaaatgtctttaataaacagcactttcatgttacacatgcatctgagatgatttatttagaaatcgaaagggttctgccgggtctcggcatggcccgcgggcagggacacgttgcggaactggtacttggccagccacttgaactcggggatcagcagtttgggcagcggggtgtcggggaaggagtcggtccacagcttccgcgtcagttgcagggcgcccagcaggtcgggcgcggagatcttgaaatcgcagttgggacccgcgttctgcgcgcgggagttgcggtacacggggttgcagcactggaacaccatcagggccgggtgcttcacgctcgccagcaccgtcgcgtcggtgatgctctccacgtcgaggtcctcggcgttggccatcccgaagggggtcatcttgcaggtctgccttcccatggtgggcacgcacccgggcttgtggttgcaatcgcagtgcagggggatcagcatcatctgggcctggtcggcgttcatccccgggtacatggccttcatgaaagcctccaattgcctgaacgcctgctgggccttggctccctcggtgaagaagaccccgcaggacttgctagagaactggttggtggcgcacccggcgtcgtgcacgcagcagcgcgcgtcgttgttggccagctgcaccacgctgcgcccccagcggttctgggtgatcttggcccggtcggggttctccttcagcgcgcgctgcccgttctcgctcgccacatccatctcgatcatgtgctccttctggatcatggtggtcccgtgcaggcaccgcagcttgccctcggcctcggtgcacccgtgcagccacagcgcgcacccggtgcactcccagttcttgtgggcgatctgggaatgcgcgtgcacgaagccctgcaggaagcggcccatcatggtggtcagggtcttgttgctagtgaaggtcagcggaatgccgcggtgctcctcgttgatgtacaggtggcagatgcggcggtacacctcgccctgctcgggcatcagctggaagttggctttcaggtcggtctccacgcggtagcggtccatcagcatagtcatgatttccatacccttctcccaggccgagacgatgggcaggctcatagggttcttcaccatcatcttagcgctagcagccgcggccagggggtcgctctcgtccagggtctcaaagctccgcttgccgtccttctcggtgatccgcaccggggggtagctgaagcccacggccgccagctcctcctcggcctgtctttcgtcctcgctgtcctggctgacgtcctgcaggaccacatgcttggtcttgcggggtttcttcttgggcggcagcggcggcggagatgttggagatggcgagggggagcgcgagttctcgctcaccactactatctcttcctcttcttggtccgaggccacgcggcggtaggtatgtctcttcgggggcagaggcggaggcgacgggctctcgccgccgcgacttggcggatggctggcagagccccttccgcgttcgggggtgcgctcccggcggcgctctgactgacttcctccgcggccggccattgtgttctcctagggaggaacaacaagcatggagactcagccatcgccaacctcgccatctgcccccaccgccgacgagaagcagcagcagcagaatgaaagcttaaccgccccgccgcccagccccgccacctccgacgcggccgtcccagacatgcaagagatggaggaatccatcgagattgacctgggctatgtgacgcccgcggagcacgaggaggagctggcagtgcgcttttcacaagaagagatacaccaagaacagccagagcaggaagcagagaatgagcagagtcaggctgggctcgagcatgacggcgactacctccacctgagcgggggggaggacgcgctcatcaagcatctggcccggcaggccaccatcgtcaaggatgcgctgctcgaccgcaccgaggtgcccctcagcgtggaggagctcagccgcgcctacgagttgaacctcttctcgccgcgcgtgccccccaagcgccagcccaatggcacctgcgagcccaacccgcgcctcaacttctacccggtcttcgcggtgcccgaggccctggccacctaccacatctttttcaagaaccaaaagatccccgtctcctgccgcgccaaccgcacccgcgccgacgcccttttcaacctgggtcccggcgcccgcctacctgatatcgcctccttggaagaggttcccaagatcttcgagggtctgggcagcgacgagactcgggccgcgaacgctctgcaaggagaaggaggagagcatgagcaccacagcgccctggtcgagttggaaggcgacaacgcgcggctggcggtgctcaaacgcacggtcgagctgacccatttcgcctacccggctctgaacctgccccccaaagtcatgagcgcggtcatggaccaggtgctcatcaagcgcgcgtcgcccatctccgaggacgagggcatgcaagactccgaggagggcaagcccgtggtcagcgacgagcagctggcccggtggctgggtcctaatgctagtccccagagtttggaagagcggcgcaaactcatgatggccgtggtcctggtgaccgtggagctggagtgcctgcgccgcttcttcgccgacgcggagaccctgcgcaaggtcgaggagaacctgcactacctcttcaggcacgggttcgtgcgccaggcctgcaagatctccaacgtggagctgaccaacctggtctcctacatgggcatcttgcacgagaaccgcctggggcagaacgtgctgcacaccaccctgcgcggggaggcccggcgcgactacatccgcgactgcgtctacctctacctctgccacacctggcagacgggcatgggcgtgtggcagcagtgtctggaggagcagaacctgaaagagctctgcaagctcctgcagaagaacctcaagggtctgtggaccgggttcgacgagcgcaccaccgcctcggacctggccgacctcattttccccgagcgcctcaggctgacgctgcgcaacggcctgcccgactttatgagccaaagcatgttgcaaaactttcgctctttcatcctcgaacgctccggaatcctgcccgccacctgctccgcgctgccctcggacttcgtgccgctgaccttccgcgagtgccccccgccgctgtggagccactgctacctgctgcgcctggccaactacctggcctaccactcggacgtgatcgaggacgtcagcggcgagggcctgctcgagtgccactgccgctgcaacctctgcacgccgcaccgctccctggcctgcaacccccagctgctgagcgagacccagatcatcggcaccttcgagttgcaagggcccagcgaaggcgagggttcagccgccaaggggggtctgaaactcaccccggggctgtggacctcggcctacttgcgcaagttcgtgcccgaggactaccatcccttcgagatcaggttctacgaggaccaatcccatccgcccaaggccgagctgtcggcctgcgtcatcacccagggggcgatcctggcccaattgcaagccatccagaaatcccgccaagaattcttgctgaaaaagggccgcggggtctacctcgacccccagaccggtgaggagctcaaccccggcttcccccaggatgccccgaggaaacaagaagctgaaagtggagctgccgcccgtggaggatttggaggaagactgggagaacagcagtcaggcagaggaggaggagatggaggaagactgggacagcactcaggcagaggaggacagcctgcaagacagtctggaggaagacgaggaggaggcagaggaggaggtggaagaagcagccgccgccagaccgtcgtcctcggcgggggagaaagcaagcagcacggataccatctccgctccgggtcggggtcccgctcgaccacacagtagatgggacgagaccggacgattcccgaaccccaccacccagaccggtaagaaggagcggcagggatacaagtcctggcgggggcacaaaaacgccatcgtctcctgcttgcaggcctgcgggggcaacatctccttcacccggcgctacctgctcttccaccgcggggtgaactttccccgcaacatcttgcattactaccgtcacctccacagcccctactacttccaagaagaggcagcagcagcagaaaaagaccagcagaaaaccagcagctagaaaatccacagcggcggcagcaggtggactgaggatcgcggcgaacgagccggcgcaaacccgggagctgaggaaccggatctttcccaccctctatgccatcttccagcagagtcgggggcaggagcaggaactgaaagtcaagaaccgttctctgcgctcgctcacccgcagttgtctgtatcacaagagcgaagaccaacttcagcgcactctcgaggacgccgaggctctcttcaacaagtactgcgcgctcactcttaaagagtagcccgcgcccgcccagtcgcagaaaaaggcgggaattacgtcacctgtgcccttcgccctagccgcctccacccatcatcatgagcaaagagattcccacgccttacatgtggagctaccagccccagatgggcctggccgccggtgccgcccaggactactccacccgcatgaattggctcagcgccgggcccgcgatgatctcacgggtgaatgacatccgcgcccaccgaaaccagatactcctagaacagtcagcgctcaccgccacgccccgcaatcacctcaatccgcgtaattggcccgccgccctggtgtaccaggaaattccccagcccacgaccgtactacttccgcgagacgcccaggccgaagtccagctgactaactcaggtgtccagctggcgggcggcgccaccctgtgtcgtcaccgccccgctcagggtataaagcggctggtgatccggggcagaggcacacagctcaacgacgaggtggtgagctcttcgctgggtctgcgacctgacggagtcttccaactcgccggatcggggagatcttccttcacgcctcgtcaggccgtcctgactttggagagttcgtcctcgcagccccgctcgggtggcatcggcactctccagttcgtggaggagttcactccctcggtctacttcaaccccttctccggctcccccggccactacccggacgagttcatcccgaacttcgacgccatcagcgagtcggtggacggctacgattgaatgtcccatggtggcgcagctgacctagctcggcttcgacacctggaccactgccgccgcttccgctgcttcgctcgggatctcgccgagtttgcctactttgagctgcccgaggagcaccctcagggcccggcccacggagtgcggatcgtcgtcgaagggggcctcgactcccacctgcttcggatcttcagccagcgtccgatcctggtcgagcgcgagcaaggacagacccttctgactctgtactgcatctgcaaccaccccggcctgcatgaaagtctttgttgtctgctgtgtactgagtataataaaagctgagatcagcgactactccggacttccgtgtgtTTAAACtcacccccttatccagtgaaataaagatcatattgatgatgattttacagaaataaaaaataatcatttgatttgaaataaagatacaatcatattgatgatttgagtttaacaaaaaaataaagaatcacttacttgaaatctgataccaggtctctgtccatgttttctgccaacaccacttcactcccctcttcccagctctggtactgcaggccccggcgggctgcaaacttcctccacacgctgaaggggatgtcaaattcctcctgtccctcaatcttcattttatcttctatcagatgtccaaaaagcgcgtccgggtggatgatgacttcgaccccgtctacccctacgatgcagacaacgcaccgaccgtgcccttcatcaacccccccttcgtctcttcagatggattccaagagaagcccctgggggtgttgtccctgcgactggccgaccccgtcaccaccaagaacggggaaatcaccctcaagctgggagagggggtggacctcgattcctcgggaaaactcatctccaacacggccaccaaggccgccgcccctctcagtttttccaacaacaccatttcccttaacatggatcaccccttttacactaaagatggaaaattatccttacaagtttctccaccattaaatatactgagaacaagcattctaaacacactagctttaggttttggatcaggtttaggactccgtggctctgccttggcagtacagttagtctctccacttacatttgatactgatggaaacataaagcttaccttagacagaggtttgcatgttacaacaggagatgcaattgaaagcaacataagctgggctaaaggtttaaaatttgaagatggagccatagcaaccaacattggaaatgggttagagtttggaagcagtagtacagaaacaggtgttgatgatgcttacccaatccaagttaaacttggatctggccttagctttgacagtacaggagccataatggctggtaacaaagaagacgataaactcactttgtggacaacacctgatccatcaccaaactgtcaaatactcgcagaaaatgatgcaaaactaacactttgcttgactaaatgtggtagtcaaatactggccactgtgtcagtcttagttgtaggaagtggaaacctaaaccccattactggcaccgtaagcagtgctcaggtgtttctacgttttgatgcaaacggtgttcttttaacagaacattctacactaaaaaaatactgggggtataggcagggagatagcatagatggcactccatataccaatgctgtaggattcatgcccaatttaaaagcttatccaaagtcacaaagttctactactaaaaataatatagtagggcaagtatacatgaatggagatgtttcaaaacctatgcttctcactataaccctcaatggtactgatgacagcaacagtacatattcaatgtcattttcatacacctggactaatggaagctatgttggagcaacatttggggctaactcttataccttctcatacatcgcccaagaatgaacactgtatcccaccctgcatgccaacccttcccaccccactctgtggaacaaactctgaaacacaaaataaaataaagttcaagtgttttattgattcaacagttttacaggattcgagcagttatttttcctccaccctcccaggacatggaatacaccaccctctccccccgcacagccttgaacatctgaatgccattggtgatggacatgcttttggtctccacgttccacacagtttcagagcgagccagtctcgggtcggtcagggagatgaaaccctccgggcactcccgcatctgcacctcacagctcaacagctgaggattgtcctcggtggtcgggatcacggttatctggaagaagcagaagagcggcggtgggaatcatagtccgcgaacgggatcggccggtggtgtcgcatcaggccccgcagcagtcgctgccgccgccgctccgtcaagctgctgctcagggggtccgggtccagggactccctcagcatgatgcccacggccctcagcatcagtcgtctggtgcggcgggcgcagcagcgcatgcggatctcgctcaggtcgctgcagtacgtgcaacacagaaccaccaggttgttcaacagtccatagttcaacacgctccagccgaaactcatcgcgggaaggatgctacccacgtggccgtcgtaccagatcctcaggtaaatcaagtggtgccccctccagaacacgctgcccacgtacatgatctccttgggcatgtggcggttcaccacctcccggtaccacatcaccctctggttgaacatgcagccccggatgatcctgcggaaccacagggccagcaccgccccgcccgccatgcagcgaagagaccccgggtcccggcaatggcaatggaggacccaccgctcgtacccgtggatcatctgggagctgaacaagtctatgttggcacagcacaggcatatgctcatgcatctcttcagcactctcaactcctcgggggtcaaaaccatatcccagggcacggggaactcttgcaggacagcgaaccccgcagaacagggcaatcctcgcacagaacttacattgtgcatggacagggtatcgcaatcaggcagcaccgggtgatcctccaccagagaagcgcgggtctcggtctcctcacagcgtggtaagggggccggccgatacgggtgatggcgggacgcggctgatcgtgttcgcgaccgtgtcatgatgcagttgctttcggacattttcgtacttgctgtagcagaacctggtccgggcgctgcacaccgatcgccggcggcggtctcggcgcttggaacgctcggtgttgaaattgtaaaacagccactctctcagaccgtgcagcagatctagggcctcaggagtgatgaagatcccatcatgcctgatggctctgatcacatcgaccaccgtggaatgggccagacccagccagatgatgcaattttgttgggtttcggtgacggcgggggagggaagaacaggaagaaccatgattaacttttaatccaaacggtctcggagtacttcaaaatgaagatcgcggagatggcacctctcgcccccgctgtgttggtggaaaataacagccaggtcaaaggtgatacggttctcgagatgttccacggtggcttccagcaaagcctccacgcgcacatccagaaacaagacaatagcgaaagcgggagggttctctaattcctcaatcatcatgttacactcctgcaccatccccagataattttcatttttccagccttgaatgattcgaactagttcCtgaggtaaatccaagccagccatgataaagagctcgcgcagagcgccctccaccggcattcttaagcacaccctcataattccaagatattctgctcctggttcacctgcagcagattgacaagcggaatatcaaaatctctgccgcgatccctgagctcctccctcagcaataactgtaagtactctttcatatcctctccgaaatttttagccataggaccaccaggaataagattagggcaagccacagtacagataaaccgaagtcctccccagtgagcattgccaaatgcaagactgctataagcatgctggctagacccggtgatatcttccagataactggacagaaaatcgcccaggcaatttttaagaaaatcaacaaaagaaaaatcctccaggtggacgtttagagcctcgggaacaacgatgaagtaaatgcaagcggtgcgttccagcatggttagttagctgatctgtagaaaaaacaaaaatgaacattaaaccatgctagcctggcgaacaggtgggtaaatcgttctctccagcaccaggcaggccacggggtctccggcgcgaccctcgtaaaaattgtcgctatgattgaaaaccatcacagagagacgttcccggtggccggcgtgaatgattcgacaagatgaatacacccccggaacattggcgtccgcgagtgaaaaaaagcgcccgaggaagcaataaggcactacaatgctcagtctcaagtccagcaaagcgatgccatgcggatgaagcacaaaattctcaggtgcgtacaaaatgtaattactcccctcctgcacaggcagcaaagcccccgatccctccaggtacacatacaaagcctcagcgtccatagcttaccgagcagcagcacacaacaggcgcaagagtcagagaaaggctgagctctaacctgtccacccgctctctgctcaatatatagcccagatctacactgacgtaaaggccaaagtctaaaaatacccgccaaataatcacacacgcccagcacacgcccagaaaccggtgacacactcaaaaaaatacgcgcacttcctcaaacgcccaaaactgccgtcatttccgggttcccacgctacgtcatcaaaacacgactttcaaattccgtcgaccgttaaaaacgtcacccgccccgcccctaacggtcgcccgtctctcagccaatcagcgccccgcatccccaaattcaaacacctcatttgcatattaacgcgcacaaaaagtttgaggtatattattgatgatgg
SEQ ID NO:87.质粒1424ORF(RNA)
auggcuagcaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccugggauccggcacaauccugucugagggcgccaccaacuucagccugcugaaacuggccggcgacguggaacugaacccuggcccuggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggacggauccggccagugcaccaauuacgcccugcugaagcuggccggcgacguggaaucuaacccuggcccugaaucgccaagcgcacccccucaucgguggugcaucccuuggcaacgccuccuccugaccgccucacugcugacuuucuggaacccgccgaccaccgcaaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggaacuuccccgggccugagcgccggcgccaccgugggaauuaugaucggcgugcucgugggaguggcccugauc
SEQ ID NO:88.质粒1425ORF(RNA)
auggcuagcgaaucgccaagcgcacccccucaucgguggugcaucccuuggcaacgccuccuccugaccgccucacugcugacuuucuggaacccgccgaccaccgcaaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggaacuuccccgggccugagcgccggcgccaccgugggaauuaugaucggcgugcucgugggaguggcccugaucggauccggcgagggcagaggcagccugcugacauguggcgacguggaagagaacccuggccccaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccugggauccggcacaauccugucugagggcgccaccaacuucagccugcugaaacuggccggcgacguggaacugaacccuggcccuggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggac
SEQ ID NO:89.质粒1426ORF(RNA)
auggcuagcggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggacggauccggcacaauccugucugagggcgccaccaacuucagccugcugaaacuggccggcgacguggaacugaacccuggcccuaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccugggauccggcagaaucuucaacgcccacuacgccggcuacuucgccgaccugcugauccacgacaucgagacaaacccuggccccgaaucgccaagcgcacccccucaucgguggugcaucccuuggcaacgccuccuccugaccgccucacugcugacuuucuggaacccgccgaccaccgcaaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggaacuuccccgggccugagcgccggcgccaccgugggaauuaugaucggcgugcucgugggaguggcccugauc
SEQ ID NO:90.质粒1427ORF(RNA)
auggcuagcggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggacggauccggccagugcaccaauuacgcccugcugaagcuggccggcgacguggaaucuaacccuggcccugaaucgccaagcgcacccccucaucgguggugcaucccuuggcaacgccuccuccugaccgccucacugcugacuuucuggaacccgccgaccaccgcaaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggaacuuccccgggccugagcgccggcgccaccgugggaauuaugaucggcgugcucgugggaguggcccugaucggauccggcgagggcagaggcagccugcugacauguggcgacguggaagagaacccuggccccaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccug
SEQ ID NO:91.质粒1428ORF(RNA)
auggcuagcaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccugggauccggcagaaucuucaacgcccacuacgccggcuacuucgccgaccugcugauccacgacaucgagacaaacccuggccccaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggauccggcgagggcagaggcagccugcugacauguggcgacguggaagagaacccuggccccggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggac
SEQ ID NO:92.质粒1429ORF(RNA)
auggcuagcaagcugaccauugagagcacucccuucaacguggcugaggggaaggaggugcugcuccuggugcacaaucugccccagcaccuguucggguacuccugguacaagggagaacgcguggacgggaaccggcagaucauaggcuacgucaucggaacccagcaggccacacccgguccagcguacagcggccgggagauuaucuacccgaacgccucccugcugauccaaaacaucauccagaacgacaccgguuucuacacucugcacgugauuaagucagaucuggucaacgaagaggccaccggccaauucaggguguaccccgaacucccuaagccguucaucaccucgaacaacagcaacccggucgaggaugaagaugcgguggccuugacgugcgaaccugagauccagaacaccaccuacuuguggugggugaacaaucagagccugccagucuccccacgacuccagcugucgaacgacaacaggacccugacuuugcuguccgugacucggaacgacgugggcccuuaugaaugcgguauccagaacaagcuguccguggaccacagcgacccugugauccugaacguccuuuacgggccggacgaccccaccauuuccccgucguacacuuacuaccggccgggcgugaaccugucccugucgugccacgcugccuccaauccgccggcccaguacuccuggcucaucgacggaaacauccagcagcacacccaagaacuguucaucuccaacauuaccgagaaaaacucgggacuuuacaccugucaagccaacaauuccgccagcggccacucccgcaccacugucaaaacuaucacuguguccgccgaacucccgaagcccagcaucagcuccaacaacucgaagcccguggaggauaaggacgcugucgcguucaccugugaaccagaggcacagaauaccaccuaccuuuggugggucaacggacagucccugccugucucaccgagacugcagcugucaaacgggaauaggacucugaccuuguuuaacgucacccggaacgacgcccgggccuacgugugcggcauccagaacuccgugagcgcaaaccggucugacccagugacccuggaugugcuguacggccccgacacuccgaucauuucaccccccgauucauccuaccuguccggcgcuaaccucaaccucucaugccacuccgcauccaaccccagcccgcaauauucguggcgcauuaacggaauuccucagcaacauacccagguccuguucauugcgaagaucaccccuaacaacaacggaaccuacgccugcuuugugucaaaccuggccacugguagaaacaacuccaucgugaaguccauuaccgugucggcguccggauccggcgagggcagaggcagccugcugacauguggcgacguggaagagaacccuggccccggagcugccccggagccggagaggacccccguuggccagggaucgugggcccauccgggacgcaccaggggaccauccgacaggggauucuguguggugucaccggccaggccagcagaagaggcaaccagccucgagggagcguugucuggaaccagacauucccacccgucggugggccggcagcaccacgcgggaccaccguccacuuccagaccgccacggccaugggacaccccuugcccgccuguguaugccgagacuaaacacuuccuguacucauccggagacaaggaacagcuucggccguccuuccuccugucgucgcucagaccgagccugaccggagcacgcagauugguggaaacuaucuuccuugggucacguccguggaugccagguaccccacggcgccucccgcgccucccacagagauacuggcagaugcggccucuguuccuggaauugcugggaaaccacgcucagugcccguacggaguccugcucaagacucacugcccucugagggcggcggucacuccggcggccggagugugcgcacgggagaagccccagggaagcguggcagcuccggaagaggaggacaccgauccgcgccgccucgugcaacuucugcgccagcacuccucgcccuggcaagucuacggguucguccgcgccugccugcgccgccuggugccgccugggcucugggguucccggcauaacgagcgccgcuuccugagaaauacuaagaaguuuaucucacuuggaaaacaugccaaguugucgcugcaagaacucacguggaagaugucaguccgcgauugcgccuggcugcgccgcucgccgggcgucggguguguuccagcugcagaacaccgccugagagaagaaauucuggccaaauuucugcauuggcugaugucaguguacguggucgagcugcugcgcuccuuuuucuacgucacugagacuaccuuucaaaagaaccgccuguucuucuaccgcaaaucuguguggagcaagcugcagucaaucggcauucgccagcaucugaagagggugcagcugcgggaacuuuccgaggcagaaguccgccagcaccgggaggcccggccggcgcuucucacgucgcgucugagauucaucccaaagcccgacgggcugaggccuaucgucaacauggauuacgucgugggcgcucgcaccuuucgccgugaaaagcgggccgaacgcuugaccucacgggugaaggcccucuucuccgugcugaacuacgagagagcaagacggccuggccugcugggagcuucggugcugggacuggacgauauccaccgggcuuggcggaccuuuguucuccgggugagagcccaagacccuccgccggaacuguacuucgugaagguggcgaucaccggagccuaugauacuauuccgcaagaucgacucaccgaagucaucgccucgaucaucaaaccgcagaacacuuacugcgucaggcgguacgccgugguccagaaggccgcgcauggccacgugagaaaggcguucaagucgcacguguccacucucaccgaccuccagccuuacaugaggcaauucguugcgcauuugcaagagacuucgccccugagagaugcgguggucaucgagcagagcuccagccugaacgaagcgagcagcggucuguuugacguguuccuccgcuucaugugucaucacgcggugcgaaucaggggaaaaucauacgugcagugccagggaaucccacaaggcagcauucugucgacucucuuguguucccuuugcuacggcgauauggaaaacaagcuguucgcugggaucagacgggacggguugcugcucagacugguggacgacuuccugcuggugacuccgcaccucacucacgccaaaaccuuucuccgcacucuggugaggggagugccagaauacggcuguguggucaaucuccggaaaacuguggugaauuucccugucgaggaugaggcacucggaggaaccgcauuuguccaaaugccagcacauggccuguucccauggugcggucugcugcuggacacccgaacucuugaagugcaguccgacuacuccagcuaugcccggacgagcauccgcgccagccucacuuucaaucgcggcuuuaaggccggacgaaacaugcgcagaaagcuuuucggaguccuccggcuuaaaugccauucgcucuuucucgaucuccaagucaauucgcugcagaccgugugcacgaacaucuacaagauccugcugcuccaagccuaccgguuccacgcuugcgugcuucagcugccguuucaccaacagguguggaagaacccgaccuucuuucugcgggucauuagcgauacugccucccuguguuacucaauccucaaggcaaagaacgccggaaugucgcugggugcgaaaggagccgcgggaccucuuccuagcgaagcggugcaguggcucugccaccaggcuuuccuccugaagcugaccaggcacagagugaccuacgucccgcugcugggcucgcugcgcacugcacagacccagcugucuagaaaacuccccggcaccacccugaccgcucuggaagccgccgccaacccagcauugccgucagauuucaagaccaucuuggacggauccggcacaauccugucugagggcgccaccaacuucagccugcugaaacuggccggcgacguggaacugaacccuggcccuaccccuggaacccagagccccuucuuccuucugcugcugcugaccgugcugacugucgugacaggcucuggccacgccagcucuacaccuggcggcgagaaagagacaagcgccacccagagaagcagcgugccaagcagcaccgagaagaacgccguguccaugaccagcuccgugcugagcagccacucuccuggcagcggcagcagcacaacacagggccaggaugugacacuggccccugccacagaaccugccucuggaucugccgccaccuggggacaggacgugacaagcgugccagugaccagaccugcccugggcucuacaacacccccugcccacgaugugaccagcgccccugauaacaagccugccccuggaagcacagccccuccagcucauggcgugaccucugccccagauaccagaccagccccaggaucuacagccccacccgcacacggcgugacaagugccccugacacaagacccgcuccaggcucuacugcuccuccugcccauggcgugacaagcgcucccgauacaaggccagcuccuggcuccacagcaccaccagcacauggcgugacaucagcucccgacacuagaccugcucccggaucaaccgcuccaccagcucacggcgugaccagcgcaccugauaccagaccugcucugggaagcaccgccccucccgugcacaaugugacaucugcuuccggcagcgccagcggcucugccucuacacuggugcacaacggcaccagcgccagagccacaacaaccccagccagcaagagcacccccuucagcaucccuagccaccacagcgacaccccuaccacacuggccagccacuccaccaagaccgaugccucuagcacccaccacuccagcgugcccccucugaccagcagcaaccacagcacaagcccccagcugucuaccggcgucucauucuucuuucuguccuuccacaucagcaaccugcaguucaacagcagccuggaagaucccagcaccgacuacuaccaggaacugcagcgggauaucagcgagauguuccugcaaaucuacaagcagggcggcuuccugggccugagcaacaucaaguucagacccggcagcgugguggugcagcugacccuggcuuuccgggaaggcaccaucaacgugcacgacguggaaacccaguucaaccaguacaagaccgaggccgccagccgguacaaccugaccaucuccgauguguccguguccgacgugcccuucccauucucugcccagucuggcgcaggcgugccaggauggggaauugcucugcuggugcucgugugcgugcugguggcccuggccaucguguaucugauugcccuggccgugugccagugccggcggaagaauuacggccagcuggacaucuuccccgccagagacaccuaccaccccaugagcgaguaccccacauaccacacccacggcagauacgugccacccagcuccaccgacagaucccccuacgagaaagugucugccggcaacggcggcagcucccugagcuacacaaauccugccguggccgcugccuccgccaaccug
序列表
<110> 辉瑞公司
<120> 免疫原性组合物
<130> PC72354A
<150> 62/682,044
<151> 2018-06-07
<150> 62/531,227
<151> 2017-07-11
<160> 93
<170> PatentIn version 3.5
<210> 1
<211> 1255
<212> PRT
<213> Homo sapiens
<400> 1
Met Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr
1 5 10 15
Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly
20 25 30
Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser
35 40 45
Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His
50 55 60
Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu
65 70 75 80
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln
85 90 95
Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr
100 105 110
Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro
115 120 125
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
130 135 140
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
145 150 155 160
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
165 170 175
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
180 185 190
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
195 200 205
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
210 215 220
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
225 230 235 240
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
245 250 255
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
260 265 270
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
275 280 285
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
290 295 300
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
305 310 315 320
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
325 330 335
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
340 345 350
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
355 360 365
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
370 375 380
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
385 390 395 400
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
405 410 415
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
420 425 430
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
435 440 445
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
450 455 460
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
465 470 475 480
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
485 490 495
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
500 505 510
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
515 520 525
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
530 535 540
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
545 550 555 560
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
565 570 575
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
580 585 590
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
595 600 605
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
610 615 620
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
625 630 635 640
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
645 650 655
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
660 665 670
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
675 680 685
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
690 695 700
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
705 710 715 720
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
725 730 735
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
740 745 750
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
755 760 765
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
770 775 780
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
785 790 795 800
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
805 810 815
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
820 825 830
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
835 840 845
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
850 855 860
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
865 870 875 880
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
885 890 895
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
900 905 910
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
915 920 925
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Asn
930 935 940
Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser
945 950 955 960
Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly
965 970 975
Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe
980 985 990
Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His
995 1000 1005
Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro
1010 1015 1020
Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
1025 1030 1035
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln
1040 1045 1050
Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu
1055 1060 1065
Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln
1070 1075 1080
Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser
1085 1090 1095
Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn
1100 1105 1110
Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala
1115 1120 1125
Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp
1130 1135 1140
Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly
1145 1150 1155
Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu
1160 1165 1170
Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg
1175 1180 1185
Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr
1190 1195 1200
His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
1205 1210 1215
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser
1220 1225 1230
Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val
1235 1240 1245
Ala Ala Thr Ser Ala Asn Leu
1250 1255
<210> 2
<211> 702
<212> PRT
<213> Homo sapiens
<400> 2
Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro Trp Gln
1 5 10 15
Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr
20 25 30
Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu Gly
35 40 45
Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe Gly
50 55 60
Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile Ile
65 70 75 80
Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr Ser
85 90 95
Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn Ile
100 105 110
Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser Asp
115 120 125
Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu Leu
130 135 140
Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys
145 150 155 160
Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gln Asp Ala Thr Tyr
165 170 175
Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg Leu Gln
180 185 190
Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn
195 200 205
Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gln Asn Pro Val Ser Ala Arg
210 215 220
Arg Ser Asp Ser Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro
225 230 235 240
Thr Ile Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn
245 250 255
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp Phe
260 265 270
Val Asn Gly Thr Phe Gln Gln Ser Thr Gln Glu Leu Phe Ile Pro Asn
275 280 285
Ile Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gln Ala His Asn Ser
290 295 300
Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr Ile Thr Val Tyr Ala
305 310 315 320
Glu Pro Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
325 330 335
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
340 345 350
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
355 360 365
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
370 375 380
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser
385 390 395 400
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
405 410 415
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
420 425 430
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
435 440 445
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
450 455 460
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
465 470 475 480
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
485 490 495
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
500 505 510
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
515 520 525
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
530 535 540
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
545 550 555 560
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
565 570 575
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
580 585 590
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
595 600 605
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
610 615 620
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
625 630 635 640
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
645 650 655
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
660 665 670
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
675 680 685
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile
690 695 700
<210> 3
<211> 1132
<212> PRT
<213> Homo sapiens
<400> 3
Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser
1 5 10 15
His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly
20 25 30
Pro Gln Gly Trp Arg Leu Val Gln Arg Gly Asp Pro Ala Ala Phe Arg
35 40 45
Ala Leu Val Ala Gln Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro
50 55 60
Pro Pro Ala Ala Pro Ser Phe Arg Gln Val Ser Cys Leu Lys Glu Leu
65 70 75 80
Val Ala Arg Val Leu Gln Arg Leu Cys Glu Arg Gly Ala Lys Asn Val
85 90 95
Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro
100 105 110
Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr
115 120 125
Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val
130 135 140
Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val
145 150 155 160
Leu Val Ala Pro Ser Cys Ala Tyr Gln Val Cys Gly Pro Pro Leu Tyr
165 170 175
Gln Leu Gly Ala Ala Thr Gln Ala Arg Pro Pro Pro His Ala Ser Gly
180 185 190
Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg
195 200 205
Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg
210 215 220
Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg
225 230 235 240
Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp
245 250 255
Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val
260 265 270
Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala
275 280 285
Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His
290 295 300
Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro
305 310 315 320
Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly
325 330 335
Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro
340 345 350
Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser
355 360 365
Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln
370 375 380
Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His
385 390 395 400
Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg
405 410 415
Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln
420 425 430
Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu
435 440 445
Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe
450 455 460
Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser
465 470 475 480
Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser
485 490 495
Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met
500 505 510
Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys
515 520 525
Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe
530 535 540
Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe
545 550 555 560
Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr
565 570 575
Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His
580 585 590
Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln
595 600 605
His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile
610 615 620
Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val
625 630 635 640
Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser
645 650 655
Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg
660 665 670
Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg
675 680 685
Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro
690 695 700
Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr Ile
705 710 715 720
Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln
725 730 735
Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His
740 745 750
Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp
755 760 765
Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser
770 775 780
Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu
785 790 795 800
Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His
805 810 815
Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro
820 825 830
Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
835 840 845
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu
850 855 860
Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala
865 870 875 880
Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys
885 890 895
Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu
900 905 910
Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe
915 920 925
Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser
930 935 940
Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe
945 950 955 960
Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly
965 970 975
Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn
980 985 990
Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln
995 1000 1005
Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
1010 1015 1020
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp
1025 1030 1035
Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
1040 1045 1050
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu
1055 1060 1065
Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr
1070 1075 1080
Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
1085 1090 1095
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr
1100 1105 1110
Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys
1115 1120 1125
Thr Ile Leu Asp
1130
<210> 4
<211> 1611
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 4
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 1611
<210> 5
<211> 537
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 5
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu
530 535
<210> 6
<211> 1551
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 6
atggctagca caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 60
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 120
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 180
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 240
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 300
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 360
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 420
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 480
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 540
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 600
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 660
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 720
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 780
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 840
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 900
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 960
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1020
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1080
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1140
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1200
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1260
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1320
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1380
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1440
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1500
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 1551
<210> 7
<211> 517
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 7
Met Ala Ser Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu
1 5 10 15
Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu
20 25 30
Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro
35 40 45
Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro
50 55 60
Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val
65 70 75 80
Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro
85 90 95
Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser
100 105 110
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser
210 215 220
Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser
225 230 235 240
Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile
245 250 255
Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr
260 265 270
Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu Thr
275 280 285
Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe
290 295 300
Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu
305 310 315 320
Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser
325 330 335
Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser
340 345 350
Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
355 360 365
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
370 375 380
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp
385 390 395 400
Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
405 410 415
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu
420 425 430
Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
435 440 445
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr
450 455 460
Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
465 470 475 480
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala
485 490 495
Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala
500 505 510
Ala Ser Ala Asn Leu
515
<210> 8
<211> 2679
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 8
atgggagctg ccccggagcc ggagaggacc cccgttggcc agggatcgtg ggcccatccg 60
ggacgcacca ggggaccatc cgacagggga ttctgtgtgg tgtcaccggc caggccagca 120
gaagaggcaa ccagcctcga gggagcgttg tctggaacca gacattccca cccgtcggtg 180
ggccggcagc accacgcggg accaccgtcc acttccagac cgccacggcc atgggacacc 240
ccttgcccgc ctgtgtatgc cgagactaaa cacttcctgt actcatccgg agacaaggaa 300
cagcttcggc cgtccttcct cctgtcgtcg ctcagaccga gcctgaccgg agcacgcaga 360
ttggtggaaa ctatcttcct tgggtcacgt ccgtggatgc caggtacccc acggcgcctc 420
ccgcgcctcc cacagagata ctggcagatg cggcctctgt tcctggaatt gctgggaaac 480
cacgctcagt gcccgtacgg agtcctgctc aagactcact gccctctgag ggcggcggtc 540
actccggcgg ccggagtgtg cgcacgggag aagccccagg gaagcgtggc agctccggaa 600
gaggaggaca ccgatccgcg ccgcctcgtg caacttctgc gccagcactc ctcgccctgg 660
caagtctacg ggttcgtccg cgcctgcctg cgccgcctgg tgccgcctgg gctctggggt 720
tcccggcata acgagcgccg cttcctgaga aatactaaga agtttatctc acttggaaaa 780
catgccaagt tgtcgctgca agaactcacg tggaagatgt cagtccgcga ttgcgcctgg 840
ctgcgccgct cgccgggcgt cgggtgtgtt ccagctgcag aacaccgcct gagagaagaa 900
attctggcca aatttctgca ttggctgatg tcagtgtacg tggtcgagct gctgcgctcc 960
tttttctacg tcactgagac tacctttcaa aagaaccgcc tgttcttcta ccgcaaatct 1020
gtgtggagca agctgcagtc aatcggcatt cgccagcatc tgaagagggt gcagctgcgg 1080
gaactttccg aggcagaagt ccgccagcac cgggaggccc ggccggcgct tctcacgtcg 1140
cgtctgagat tcatcccaaa gcccgacggg ctgaggccta tcgtcaacat ggattacgtc 1200
gtgggcgctc gcacctttcg ccgtgaaaag cgggccgaac gcttgacctc acgggtgaag 1260
gccctcttct ccgtgctgaa ctacgagaga gcaagacggc ctggcctgct gggagcttcg 1320
gtgctgggac tggacgatat ccaccgggct tggcggacct ttgttctccg ggtgagagcc 1380
caagaccctc cgccggaact gtacttcgtg aaggtggcga tcaccggagc ctatgatact 1440
attccgcaag atcgactcac cgaagtcatc gcctcgatca tcaaaccgca gaacacttac 1500
tgcgtcaggc ggtacgccgt ggtccagaag gccgcgcatg gccacgtgag aaaggcgttc 1560
aagtcgcacg tgtccactct caccgacctc cagccttaca tgaggcaatt cgttgcgcat 1620
ttgcaagaga cttcgcccct gagagatgcg gtggtcatcg agcagagctc cagcctgaac 1680
gaagcgagca gcggtctgtt tgacgtgttc ctccgcttca tgtgtcatca cgcggtgcga 1740
atcaggggaa aatcatacgt gcagtgccag ggaatcccac aaggcagcat tctgtcgact 1800
ctcttgtgtt ccctttgcta cggcgatatg gaaaacaagc tgttcgctgg gatcagacgg 1860
gacgggttgc tgctcagact ggtggacgac ttcctgctgg tgactccgca cctcactcac 1920
gccaaaacct ttctccgcac tctggtgagg ggagtgccag aatacggctg tgtggtcaat 1980
ctccggaaaa ctgtggtgaa tttccctgtc gaggatgagg cactcggagg aaccgcattt 2040
gtccaaatgc cagcacatgg cctgttccca tggtgcggtc tgctgctgga cacccgaact 2100
cttgaagtgc agtccgacta ctccagctat gcccggacga gcatccgcgc cagcctcact 2160
ttcaatcgcg gctttaaggc cggacgaaac atgcgcagaa agcttttcgg agtcctccgg 2220
cttaaatgcc attcgctctt tctcgatctc caagtcaatt cgctgcagac cgtgtgcacg 2280
aacatctaca agatcctgct gctccaagcc taccggttcc acgcttgcgt gcttcagctg 2340
ccgtttcacc aacaggtgtg gaagaacccg accttctttc tgcgggtcat tagcgatact 2400
gcctccctgt gttactcaat cctcaaggca aagaacgccg gaatgtcgct gggtgcgaaa 2460
ggagccgcgg gacctcttcc tagcgaagcg gtgcagtggc tctgccacca ggctttcctc 2520
ctgaagctga ccaggcacag agtgacctac gtcccgctgc tgggctcgct gcgcactgca 2580
cagacccagc tgtctagaaa actccccggc accaccctga ccgctctgga agccgccgcc 2640
aacccagcat tgccgtcaga tttcaagacc atcttggac 2679
<210> 9
<211> 893
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 9
Met Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser
1 5 10 15
Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys
20 25 30
Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly
35 40 45
Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His
50 55 60
His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
65 70 75 80
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser
85 90 95
Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg
100 105 110
Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly
115 120 125
Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro
130 135 140
Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn
145 150 155 160
His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu
165 170 175
Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro
180 185 190
Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg
195 200 205
Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly
210 215 220
Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly
225 230 235 240
Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile
245 250 255
Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys
260 265 270
Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
275 280 285
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys
290 295 300
Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
305 310 315 320
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe
325 330 335
Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln
340 345 350
His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg
355 360 365
Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe
370 375 380
Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val
385 390 395 400
Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr
405 410 415
Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg
420 425 430
Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His
435 440 445
Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro
450 455 460
Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr
465 470 475 480
Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro
485 490 495
Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala
500 505 510
His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr
515 520 525
Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr
530 535 540
Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn
545 550 555 560
Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His
565 570 575
His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile
580 585 590
Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly
595 600 605
Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
610 615 620
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His
625 630 635 640
Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly
645 650 655
Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp
660 665 670
Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu
675 680 685
Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln
690 695 700
Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr
705 710 715 720
Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe
725 730 735
Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val
740 745 750
Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu
755 760 765
Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
770 775 780
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
785 790 795 800
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser
805 810 815
Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln
820 825 830
Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val
835 840 845
Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
850 855 860
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala
865 870 875 880
Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
885 890
<210> 10
<211> 2373
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 10
atggctagct tcctcctgtc gtcgctcaga ccgagcctga ccggagcacg cagattggtg 60
gaaactatct tccttgggtc acgtccgtgg atgccaggta ccccacggcg cctcccgcgc 120
ctcccacaga gatactggca gatgcggcct ctgttcctgg aattgctggg aaaccacgct 180
cagtgcccgt acggagtcct gctcaagact cactgccctc tgagggcggc ggtcactccg 240
gcggccggag tgtgcgcacg ggagaagccc cagggaagcg tggcagctcc ggaagaggag 300
gacaccgatc cgcgccgcct cgtgcaactt ctgcgccagc actcctcgcc ctggcaagtc 360
tacgggttcg tccgcgcctg cctgcgccgc ctggtgccgc ctgggctctg gggttcccgg 420
cataacgagc gccgcttcct gagaaatact aagaagttta tctcacttgg aaaacatgcc 480
aagttgtcgc tgcaagaact cacgtggaag atgtcagtcc gcgattgcgc ctggctgcgc 540
cgctcgccgg gcgtcgggtg tgttccagct gcagaacacc gcctgagaga agaaattctg 600
gccaaatttc tgcattggct gatgtcagtg tacgtggtcg agctgctgcg ctcctttttc 660
tacgtcactg agactacctt tcaaaagaac cgcctgttct tctaccgcaa atctgtgtgg 720
agcaagctgc agtcaatcgg cattcgccag catctgaaga gggtgcagct gcgggaactt 780
tccgaggcag aagtccgcca gcaccgggag gcccggccgg cgcttctcac gtcgcgtctg 840
agattcatcc caaagcccga cgggctgagg cctatcgtca acatggatta cgtcgtgggc 900
gctcgcacct ttcgccgtga aaagcgggcc gaacgcttga cctcacgggt gaaggccctc 960
ttctccgtgc tgaactacga gagagcaaga cggcctggcc tgctgggagc ttcggtgctg 1020
ggactggacg atatccaccg ggcttggcgg acctttgttc tccgggtgag agcccaagac 1080
cctccgccgg aactgtactt cgtgaaggtg gcgatcaccg gagcctatga tactattccg 1140
caagatcgac tcaccgaagt catcgcctcg atcatcaaac cgcagaacac ttactgcgtc 1200
aggcggtacg ccgtggtcca gaaggccgcg catggccacg tgagaaaggc gttcaagtcg 1260
cacgtgtcca ctctcaccga cctccagcct tacatgaggc aattcgttgc gcatttgcaa 1320
gagacttcgc ccctgagaga tgcggtggtc atcgagcaga gctccagcct gaacgaagcg 1380
agcagcggtc tgtttgacgt gttcctccgc ttcatgtgtc atcacgcggt gcgaatcagg 1440
ggaaaatcat acgtgcagtg ccagggaatc ccacaaggca gcattctgtc gactctcttg 1500
tgttcccttt gctacggcga tatggaaaac aagctgttcg ctgggatcag acgggacggg 1560
ttgctgctca gactggtgga cgacttcctg ctggtgactc cgcacctcac tcacgccaaa 1620
acctttctcc gcactctggt gaggggagtg ccagaatacg gctgtgtggt caatctccgg 1680
aaaactgtgg tgaatttccc tgtcgaggat gaggcactcg gaggaaccgc atttgtccaa 1740
atgccagcac atggcctgtt cccatggtgc ggtctgctgc tggacacccg aactcttgaa 1800
gtgcagtccg actactccag ctatgcccgg acgagcatcc gcgccagcct cactttcaat 1860
cgcggcttta aggccggacg aaacatgcgc agaaagcttt tcggagtcct ccggcttaaa 1920
tgccattcgc tctttctcga tctccaagtc aattcgctgc agaccgtgtg cacgaacatc 1980
tacaagatcc tgctgctcca agcctaccgg ttccacgctt gcgtgcttca gctgccgttt 2040
caccaacagg tgtggaagaa cccgaccttc tttctgcggg tcattagcga tactgcctcc 2100
ctgtgttact caatcctcaa ggcaaagaac gccggaatgt cgctgggtgc gaaaggagcc 2160
gcgggacctc ttcctagcga agcggtgcag tggctctgcc accaggcttt cctcctgaag 2220
ctgaccaggc acagagtgac ctacgtcccg ctgctgggct cgctgcgcac tgcacagacc 2280
cagctgtcta gaaaactccc cggcaccacc ctgaccgctc tggaagccgc cgccaaccca 2340
gcattgccgt cagatttcaa gaccatcttg gac 2373
<210> 11
<211> 791
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 11
Met Ala Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala
1 5 10 15
Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro
20 25 30
Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met
35 40 45
Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr
50 55 60
Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro
65 70 75 80
Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala
85 90 95
Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg
100 105 110
Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu
115 120 125
Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg
130 135 140
Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala
145 150 155 160
Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys
165 170 175
Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu
180 185 190
His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met
195 200 205
Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu
210 215 220
Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp
225 230 235 240
Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln
245 250 255
Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg
260 265 270
Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly
275 280 285
Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe
290 295 300
Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu
305 310 315 320
Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly
325 330 335
Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe
340 345 350
Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
355 360 365
Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu
370 375 380
Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val
385 390 395 400
Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys
405 410 415
Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
420 425 430
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
435 440 445
Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu
450 455 460
Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg
465 470 475 480
Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu
485 490 495
Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu
500 505 510
Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp
515 520 525
Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg
530 535 540
Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
545 550 555 560
Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr
565 570 575
Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu
580 585 590
Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
595 600 605
Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys
610 615 620
Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys
625 630 635 640
Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val
645 650 655
Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
660 665 670
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
675 680 685
Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser
690 695 700
Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala
705 710 715 720
Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala
725 730 735
Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu
740 745 750
Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly
755 760 765
Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser
770 775 780
Asp Phe Lys Thr Ile Leu Asp
785 790
<210> 12
<211> 1782
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 12
atggctagcg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 60
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 120
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 180
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 240
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 300
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 360
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 420
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 480
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 540
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 600
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 660
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 720
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 780
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 840
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 900
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 960
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 1020
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 1080
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 1140
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 1200
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 1260
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 1320
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 1380
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 1440
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 1500
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 1560
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 1620
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 1680
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 1740
gccaacccag cattgccgtc agatttcaag accatcttgg ac 1782
<210> 13
<211> 594
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 13
Met Ala Ser Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val
1 5 10 15
Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys
20 25 30
Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser
35 40 45
Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser
50 55 60
Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr
65 70 75 80
Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val
85 90 95
Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg
100 105 110
Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn
115 120 125
Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly
130 135 140
Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
145 150 155 160
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr
165 170 175
Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala
180 185 190
Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val
195 200 205
Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His
210 215 220
Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala
225 230 235 240
His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln
245 250 255
Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu
260 265 270
Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val
275 280 285
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys
290 295 300
Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg
305 310 315 320
Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr
325 330 335
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly
340 345 350
Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn
355 360 365
Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met
370 375 380
Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
385 390 395 400
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile
405 410 415
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met
420 425 430
Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe
435 440 445
Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr
450 455 460
Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln
465 470 475 480
Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg
485 490 495
Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys
500 505 510
Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
515 520 525
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu
530 535 540
Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
545 550 555 560
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
565 570 575
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile
580 585 590
Leu Asp
<210> 14
<211> 2112
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 14
atggctagcg aatcgccaag cgcaccccct catcggtggt gcatcccttg gcaacgcctc 60
ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa gctgaccatt 120
gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg 180
ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag 240
atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg 300
gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt 360
ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc 420
agggtgtacc ccgaactccc aaagccgtcc atttcaagca acaactccaa gccggtggag 480
gacaaagacg ccgtggcctt cacttgtgaa cctgaaaccc aggacgccac ttacctttgg 540
tgggtgaaca accagtcgct ccccgtgtcg ccgaggctgc agctcagcaa cggaaacaga 600
acgctgaccc tcttcaatgt gacccgcaat gataccgcct cctataagtg cgaaacccag 660
aatccggtgt ccgcccggcg ctcggatagc gtgattctga acgtgctcta cggccctgac 720
gcccccacta tctcccctct gaacacttcc taccggtccg gagagaacct gaacctgagc 780
tgccacgcgg cgtccaaccc gcccgcccag tacagctggt tcgtgaatgg gacgttccag 840
cagtccaccc aggagctgtt tatccctaac attaccgtca acaactctgg atcgtacaca 900
tgccaagcgc ataactcgga cactgggctt aacagaacca ccgtgacaac catcactgtg 960
tatgcggaac ctcctaagcc gttcatcacc tcgaacaaca gcaacccggt cgaggatgaa 1020
gatgcggtgg ccttgacgtg cgaacctgag atccagaaca ccacctactt gtggtgggtg 1080
aacaatcaga gcctgccagt ctccccacga ctccagctgt cgaacgacaa caggaccctg 1140
actttgctgt ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat ccagaacaag 1200
ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc tttacgggcc ggacgacccc 1260
accatttccc cgtcgtacac ttactaccgg ccgggcgtga acctgtccct gtcgtgccac 1320
gctgcctcca atccgccggc ccagtactcc tggctcatcg acggaaacat ccagcagcac 1380
acccaagaac tgttcatctc caacattacc gagaaaaact cgggacttta cacctgtcaa 1440
gccaacaatt ccgccagcgg ccactcccgc accactgtca aaactatcac tgtgtccgcc 1500
gaactcccga agcccagcat cagctccaac aactcgaagc ccgtggagga taaggacgct 1560
gtcgcgttca cctgtgaacc agaggcacag aataccacct acctttggtg ggtcaacgga 1620
cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg ggaataggac tctgaccttg 1680
tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa ctccgtgagc 1740
gcaaaccggt ctgacccagt gaccctggat gtgctgtacg gccccgacac tccgatcatt 1800
tcaccccccg attcatccta cctgtccggc gctaacctca acctctcatg ccactccgca 1860
tccaacccca gcccgcaata ttcgtggcgc attaacggaa ttcctcagca acatacccag 1920
gtcctgttca ttgcgaagat cacccctaac aacaacggaa cctacgcctg ctttgtgtca 1980
aacctggcca ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc ggcgtccgga 2040
acttccccgg gcctgagcgc cggcgccacc gtgggaatta tgatcggcgt gctcgtggga 2100
gtggccctga tc 2112
<210> 15
<211> 704
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 15
Met Ala Ser Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro
1 5 10 15
Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro
20 25 30
Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala
35 40 45
Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu
50 55 60
Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln
65 70 75 80
Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
85 90 95
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln
100 105 110
Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys
115 120 125
Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro
130 135 140
Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val Glu
145 150 155 160
Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gln Asp Ala
165 170 175
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
180 185 190
Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr
195 200 205
Arg Asn Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gln Asn Pro Val Ser
210 215 220
Ala Arg Arg Ser Asp Ser Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
225 230 235 240
Ala Pro Thr Ile Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn
245 250 255
Leu Asn Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
260 265 270
Trp Phe Val Asn Gly Thr Phe Gln Gln Ser Thr Gln Glu Leu Phe Ile
275 280 285
Pro Asn Ile Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gln Ala His
290 295 300
Asn Ser Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr Ile Thr Val
305 310 315 320
Tyr Ala Glu Pro Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro
325 330 335
Val Glu Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln
340 345 350
Asn Thr Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser
355 360 365
Pro Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser
370 375 380
Val Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys
385 390 395 400
Leu Ser Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly
405 410 415
Pro Asp Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly
420 425 430
Val Asn Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln
435 440 445
Tyr Ser Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu
450 455 460
Phe Ile Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln
465 470 475 480
Ala Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile
485 490 495
Thr Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser
500 505 510
Lys Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu
515 520 525
Ala Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro
530 535 540
Val Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu
545 550 555 560
Phe Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln
565 570 575
Asn Ser Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu
580 585 590
Tyr Gly Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu
595 600 605
Ser Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser
610 615 620
Pro Gln Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln
625 630 635 640
Val Leu Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala
645 650 655
Cys Phe Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys
660 665 670
Ser Ile Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly
675 680 685
Ala Thr Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile
690 695 700
<210> 16
<211> 1578
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 16
atggctagcg aatcgccaag cgcaccccct catcggtggt gcatcccttg gcaacgcctc 60
ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa gctgaccatt 120
gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg 180
ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag 240
atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg 300
gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt 360
ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc 420
agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa cccggtcgag 480
gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac ctacttgtgg 540
tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa cgacaacagg 600
accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg cggtatccag 660
aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta cgggccggac 720
gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct gtccctgtcg 780
tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg aaacatccag 840
cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg actttacacc 900
tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac tatcactgtg 960
tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt ggaggataag 1020
gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct ttggtgggtc 1080
aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa taggactctg 1140
accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat ccagaactcc 1200
gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc cgacactccg 1260
atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct ctcatgccac 1320
tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc tcagcaacat 1380
acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta cgcctgcttt 1440
gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac cgtgtcggcg 1500
tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat cggcgtgctc 1560
gtgggagtgg ccctgatc 1578
<210> 17
<211> 526
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 17
Met Ala Ser Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro
1 5 10 15
Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro
20 25 30
Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala
35 40 45
Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu
50 55 60
Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln
65 70 75 80
Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
85 90 95
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln
100 105 110
Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys
115 120 125
Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro
130 135 140
Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
145 150 155 160
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
165 170 175
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
180 185 190
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
195 200 205
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser
210 215 220
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
225 230 235 240
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
245 250 255
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
260 265 270
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
275 280 285
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
290 295 300
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
305 310 315 320
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
325 330 335
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
340 345 350
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
355 360 365
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
370 375 380
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
385 390 395 400
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
405 410 415
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
420 425 430
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
435 440 445
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
450 455 460
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
465 470 475 480
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
485 490 495
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
500 505 510
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile
515 520 525
<210> 18
<211> 1404
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 18
atggctagca agctgaccat tgagagcact cccttcaacg tggctgaggg gaaggaggtg 60
ctgctcctgg tgcacaatct gccccagcac ctgttcgggt actcctggta caagggagaa 120
cgcgtggacg ggaaccggca gatcataggc tacgtcatcg gaacccagca ggccacaccc 180
ggtccagcgt acagcggccg ggagattatc tacccgaacg cctccctgct gatccaaaac 240
atcatccaga acgacaccgg tttctacact ctgcacgtga ttaagtcaga tctggtcaac 300
gaagaggcca ccggccaatt cagggtgtac cccgaactcc ctaagccgtt catcacctcg 360
aacaacagca acccggtcga ggatgaagat gcggtggcct tgacgtgcga acctgagatc 420
cagaacacca cctacttgtg gtgggtgaac aatcagagcc tgccagtctc cccacgactc 480
cagctgtcga acgacaacag gaccctgact ttgctgtccg tgactcggaa cgacgtgggc 540
ccttatgaat gcggtatcca gaacaagctg tccgtggacc acagcgaccc tgtgatcctg 600
aacgtccttt acgggccgga cgaccccacc atttccccgt cgtacactta ctaccggccg 660
ggcgtgaacc tgtccctgtc gtgccacgct gcctccaatc cgccggccca gtactcctgg 720
ctcatcgacg gaaacatcca gcagcacacc caagaactgt tcatctccaa cattaccgag 780
aaaaactcgg gactttacac ctgtcaagcc aacaattccg ccagcggcca ctcccgcacc 840
actgtcaaaa ctatcactgt gtccgccgaa ctcccgaagc ccagcatcag ctccaacaac 900
tcgaagcccg tggaggataa ggacgctgtc gcgttcacct gtgaaccaga ggcacagaat 960
accacctacc tttggtgggt caacggacag tccctgcctg tctcaccgag actgcagctg 1020
tcaaacggga ataggactct gaccttgttt aacgtcaccc ggaacgacgc ccgggcctac 1080
gtgtgcggca tccagaactc cgtgagcgca aaccggtctg acccagtgac cctggatgtg 1140
ctgtacggcc ccgacactcc gatcatttca ccccccgatt catcctacct gtccggcgct 1200
aacctcaacc tctcatgcca ctccgcatcc aaccccagcc cgcaatattc gtggcgcatt 1260
aacggaattc ctcagcaaca tacccaggtc ctgttcattg cgaagatcac ccctaacaac 1320
aacggaacct acgcctgctt tgtgtcaaac ctggccactg gtagaaacaa ctccatcgtg 1380
aagtccatta ccgtgtcggc gtcc 1404
<210> 19
<211> 468
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 19
Met Ala Ser Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu
1 5 10 15
Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe
20 25 30
Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile
35 40 45
Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr
50 55 60
Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn
65 70 75 80
Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser
85 90 95
Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu
100 105 110
Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu Asp
115 120 125
Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr Thr
130 135 140
Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg Leu
145 150 155 160
Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr Arg
165 170 175
Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser Val
180 185 190
Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp
195 200 205
Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu
210 215 220
Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp
225 230 235 240
Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile Ser
245 250 255
Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn Asn
260 265 270
Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val Ser
275 280 285
Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val
290 295 300
Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln Asn
305 310 315 320
Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser Pro
325 330 335
Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val
340 345 350
Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser Val
355 360 365
Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly Pro
370 375 380
Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly Ala
385 390 395 400
Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln Tyr
405 410 415
Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu Phe
420 425 430
Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val
435 440 445
Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr
450 455 460
Val Ser Ala Ser
465
<210> 20
<211> 4302
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 20
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggaggctcc 1620
ggcggaggag ctgccccgga gccggagagg acccccgttg gccagggatc gtgggcccat 1680
ccgggacgca ccaggggacc atccgacagg ggattctgtg tggtgtcacc ggccaggcca 1740
gcagaagagg caaccagcct cgagggagcg ttgtctggaa ccagacattc ccacccgtcg 1800
gtgggccggc agcaccacgc gggaccaccg tccacttcca gaccgccacg gccatgggac 1860
accccttgcc cgcctgtgta tgccgagact aaacacttcc tgtactcatc cggagacaag 1920
gaacagcttc ggccgtcctt cctcctgtcg tcgctcagac cgagcctgac cggagcacgc 1980
agattggtgg aaactatctt ccttgggtca cgtccgtgga tgccaggtac cccacggcgc 2040
ctcccgcgcc tcccacagag atactggcag atgcggcctc tgttcctgga attgctggga 2100
aaccacgctc agtgcccgta cggagtcctg ctcaagactc actgccctct gagggcggcg 2160
gtcactccgg cggccggagt gtgcgcacgg gagaagcccc agggaagcgt ggcagctccg 2220
gaagaggagg acaccgatcc gcgccgcctc gtgcaacttc tgcgccagca ctcctcgccc 2280
tggcaagtct acgggttcgt ccgcgcctgc ctgcgccgcc tggtgccgcc tgggctctgg 2340
ggttcccggc ataacgagcg ccgcttcctg agaaatacta agaagtttat ctcacttgga 2400
aaacatgcca agttgtcgct gcaagaactc acgtggaaga tgtcagtccg cgattgcgcc 2460
tggctgcgcc gctcgccggg cgtcgggtgt gttccagctg cagaacaccg cctgagagaa 2520
gaaattctgg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 2580
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 2640
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 2700
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 2760
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 2820
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 2880
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 2940
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 3000
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 3060
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 3120
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 3180
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 3240
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 3300
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 3360
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 3420
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 3480
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 3540
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 3600
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 3660
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 3720
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 3780
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 3840
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 3900
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 3960
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 4020
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 4080
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 4140
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 4200
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 4260
gccaacccag cattgccgtc agatttcaag accatcttgg ac 4302
<210> 21
<211> 1434
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 21
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Gly Ser Gly Gly Gly Ala
530 535 540
Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His
545 550 555 560
Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser
565 570 575
Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser
580 585 590
Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His Ala Gly
595 600 605
Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro
610 615 620
Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys
625 630 635 640
Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu
645 650 655
Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro
660 665 670
Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr
675 680 685
Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln
690 695 700
Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala
705 710 715 720
Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser
725 730 735
Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln
740 745 750
Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg
755 760 765
Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His
770 775 780
Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly
785 790 795 800
Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val
805 810 815
Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro
820 825 830
Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His
835 840 845
Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr
850 855 860
Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys
865 870 875 880
Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys
885 890 895
Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg
900 905 910
Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys
915 920 925
Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala
930 935 940
Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val
945 950 955 960
Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly
965 970 975
Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp
980 985 990
Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu
995 1000 1005
Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro
1010 1015 1020
Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln
1025 1030 1035
Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala
1040 1045 1050
His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu
1055 1060 1065
Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
1070 1075 1080
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser
1085 1090 1095
Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg
1100 1105 1110
Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val
1115 1120 1125
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu
1130 1135 1140
Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly
1145 1150 1155
Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu
1160 1165 1170
Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr
1175 1180 1185
Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
1190 1195 1200
Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly
1205 1210 1215
Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys
1220 1225 1230
Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr
1235 1240 1245
Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn
1250 1255 1260
Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly
1265 1270 1275
Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val
1280 1285 1290
Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu
1295 1300 1305
Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
1310 1315 1320
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile
1325 1330 1335
Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn
1340 1345 1350
Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
1355 1360 1365
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys
1370 1375 1380
Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu
1385 1390 1395
Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr
1400 1405 1410
Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp
1415 1420 1425
Phe Lys Thr Ile Leu Asp
1430
<210> 22
<211> 4371
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 22
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1680
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1740
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1800
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1860
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1920
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1980
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2040
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2100
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2160
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2220
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2280
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2340
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2400
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2460
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2520
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2580
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2640
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2700
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2760
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2820
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2880
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2940
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3000
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3060
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3120
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3180
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3240
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3300
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3360
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3420
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3480
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3540
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3600
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3660
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3720
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3780
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3840
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3900
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3960
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4020
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4080
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4140
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4200
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4260
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4320
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga c 4371
<210> 23
<211> 1457
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 23
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser
530 535 540
Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu
545 550 555 560
Leu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val
565 570 575
Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp
580 585 590
Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr
595 600 605
Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val
610 615 620
Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg
625 630 635 640
Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe
645 650 655
Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu
660 665 670
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr
675 680 685
Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
690 695 700
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu
705 710 715 720
Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr
725 730 735
His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala
740 745 750
Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr
755 760 765
Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp
770 775 780
Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro
785 790 795 800
Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr
805 810 815
Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu
820 825 830
Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser
835 840 845
Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu
850 855 860
Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu
865 870 875 880
Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn
885 890 895
Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
900 905 910
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu
915 920 925
Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
930 935 940
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn
945 950 955 960
Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala
965 970 975
Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr
980 985 990
Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu
995 1000 1005
Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
1010 1015 1020
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile
1025 1030 1035
Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val
1040 1045 1050
Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
1055 1060 1065
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala
1070 1075 1080
Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
1085 1090 1095
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp
1100 1105 1110
Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser
1115 1120 1125
Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val
1130 1135 1140
Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln
1145 1150 1155
Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
1160 1165 1170
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
1175 1180 1185
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr
1190 1195 1200
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
1205 1210 1215
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro
1220 1225 1230
Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro
1235 1240 1245
Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
1250 1255 1260
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser
1265 1270 1275
Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg
1280 1285 1290
Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His
1295 1300 1305
Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys
1310 1315 1320
Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
1325 1330 1335
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn
1340 1345 1350
Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys
1355 1360 1365
Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala
1370 1375 1380
Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu
1385 1390 1395
Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr
1400 1405 1410
Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
1415 1420 1425
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala
1430 1435 1440
Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1445 1450 1455
<210> 24
<211> 4371
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 24
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 2820
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2880
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2940
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 3000
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3060
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3120
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3180
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3240
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3300
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3360
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3420
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3480
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3540
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3600
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3660
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3720
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3780
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3840
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3900
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3960
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 4020
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4080
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4140
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4200
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4260
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4320
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 4371
<210> 25
<211> 1457
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 25
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Pro Gly Thr Gln
915 920 925
Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr
930 935 940
Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser
945 950 955 960
Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val
965 970 975
Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser
980 985 990
Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro
995 1000 1005
Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val
1010 1015 1020
Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His
1025 1030 1035
Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1130 1135 1140
Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His
1145 1150 1155
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr
1160 1165 1170
Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala
1175 1180 1185
Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr
1190 1195 1200
Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser
1205 1210 1215
Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
1220 1225 1230
Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser
1235 1240 1245
Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro
1250 1255 1260
Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met
1265 1270 1275
Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn
1280 1285 1290
Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
1295 1300 1305
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
1310 1315 1320
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile
1325 1330 1335
Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln
1340 1345 1350
Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
1355 1360 1365
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu
1370 1375 1380
Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
1385 1390 1395
Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr
1400 1405 1410
Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg
1415 1420 1425
Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu
1430 1435 1440
Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1445 1450 1455
<210> 26
<211> 4311
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 26
atggctagca caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 60
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 120
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 180
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 240
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 300
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 360
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 420
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 480
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 540
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 600
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 660
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 720
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 780
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 840
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 900
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 960
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1020
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1080
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1140
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1200
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1260
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1320
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1380
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1440
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1500
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1560
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1620
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1680
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1740
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1800
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1860
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1920
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 1980
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2040
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2100
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2160
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2220
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2280
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2340
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2400
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2460
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2520
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2580
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2640
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2700
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2760
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2820
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2880
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 2940
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3000
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3060
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3120
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3180
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3240
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3300
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3360
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3420
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3480
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3540
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3600
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3660
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3720
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3780
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3840
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3900
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 3960
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4020
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4080
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4140
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4200
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4260
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga c 4311
<210> 27
<211> 1437
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 27
Met Ala Ser Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu
1 5 10 15
Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu
20 25 30
Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro
35 40 45
Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro
50 55 60
Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val
65 70 75 80
Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro
85 90 95
Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser
100 105 110
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser
210 215 220
Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser
225 230 235 240
Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile
245 250 255
Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr
260 265 270
Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu Thr
275 280 285
Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe
290 295 300
Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu
305 310 315 320
Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser
325 330 335
Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser
340 345 350
Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
355 360 365
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
370 375 380
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp
385 390 395 400
Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
405 410 415
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu
420 425 430
Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
435 440 445
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr
450 455 460
Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
465 470 475 480
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala
485 490 495
Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala
500 505 510
Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr
515 520 525
Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu Asn Pro Gly
530 535 540
Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser
545 550 555 560
Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys
565 570 575
Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly
580 585 590
Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His
595 600 605
His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
610 615 620
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser
625 630 635 640
Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg
645 650 655
Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly
660 665 670
Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro
675 680 685
Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn
690 695 700
His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu
705 710 715 720
Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro
725 730 735
Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg
740 745 750
Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly
755 760 765
Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly
770 775 780
Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile
785 790 795 800
Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys
805 810 815
Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
820 825 830
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys
835 840 845
Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
850 855 860
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe
865 870 875 880
Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln
885 890 895
His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg
900 905 910
Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe
915 920 925
Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val
930 935 940
Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr
945 950 955 960
Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg
965 970 975
Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His
980 985 990
Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro
995 1000 1005
Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp
1010 1015 1020
Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
1025 1030 1035
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln
1040 1045 1050
Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val
1055 1060 1065
Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala
1070 1075 1080
His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu
1085 1090 1095
Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val
1100 1105 1110
Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys
1115 1120 1125
Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser
1130 1135 1140
Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu
1145 1150 1155
Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp
1160 1165 1170
Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe
1175 1180 1185
Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val
1190 1195 1200
Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala
1205 1210 1215
Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe
1220 1225 1230
Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln
1235 1240 1245
Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu
1250 1255 1260
Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
1265 1270 1275
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp
1280 1285 1290
Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys
1295 1300 1305
Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln
1310 1315 1320
Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu
1325 1330 1335
Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys
1340 1345 1350
Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly
1355 1360 1365
Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe
1370 1375 1380
Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu
1385 1390 1395
Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro
1400 1405 1410
Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu
1415 1420 1425
Pro Ser Asp Phe Lys Thr Ile Leu Asp
1430 1435
<210> 28
<211> 4311
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 28
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2820
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2880
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 2940
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3000
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3060
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3120
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3180
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3240
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3300
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3360
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3420
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3480
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3540
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3600
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3660
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3720
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3780
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3840
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3900
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 3960
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4020
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4080
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4140
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4200
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4260
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 4311
<210> 29
<211> 1437
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 29
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Gly Ser Gly His
915 920 925
Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg
930 935 940
Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser
945 950 955 960
Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln
965 970 975
Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser
980 985 990
Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro
995 1000 1005
Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala
1010 1015 1020
Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1025 1030 1035
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
1130 1135 1140
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly
1145 1150 1155
Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro
1160 1165 1170
Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
1175 1180 1185
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser
1190 1195 1200
Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu
1205 1210 1215
Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn
1220 1225 1230
Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr
1235 1240 1245
Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr
1250 1255 1260
Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro
1265 1270 1275
Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr
1280 1285 1290
Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr
1295 1300 1305
Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val
1310 1315 1320
Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val
1325 1330 1335
Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val
1340 1345 1350
Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
1355 1360 1365
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp
1370 1375 1380
Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly
1385 1390 1395
Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys
1400 1405 1410
Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
1415 1420 1425
Ala Val Ala Ala Ala Ser Ala Asn Leu
1430 1435
<210> 30
<211> 3264
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 30
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccgaatc gccaagcgca ccccctcatc ggtggtgcat cccttggcaa 1740
cgcctcctcc tgaccgcctc actgctgact ttctggaacc cgccgaccac cgcaaagctg 1800
accattgaga gcactccctt caacgtggct gaggggaagg aggtgctgct cctggtgcac 1860
aatctgcccc agcacctgtt cgggtactcc tggtacaagg gagaacgcgt ggacgggaac 1920
cggcagatca taggctacgt catcggaacc cagcaggcca cacccggtcc agcgtacagc 1980
ggccgggaga ttatctaccc gaacgcctcc ctgctgatcc aaaacatcat ccagaacgac 2040
accggtttct acactctgca cgtgattaag tcagatctgg tcaacgaaga ggccaccggc 2100
caattcaggg tgtaccccga actccctaag ccgttcatca cctcgaacaa cagcaacccg 2160
gtcgaggatg aagatgcggt ggccttgacg tgcgaacctg agatccagaa caccacctac 2220
ttgtggtggg tgaacaatca gagcctgcca gtctccccac gactccagct gtcgaacgac 2280
aacaggaccc tgactttgct gtccgtgact cggaacgacg tgggccctta tgaatgcggt 2340
atccagaaca agctgtccgt ggaccacagc gaccctgtga tcctgaacgt cctttacggg 2400
ccggacgacc ccaccatttc cccgtcgtac acttactacc ggccgggcgt gaacctgtcc 2460
ctgtcgtgcc acgctgcctc caatccgccg gcccagtact cctggctcat cgacggaaac 2520
atccagcagc acacccaaga actgttcatc tccaacatta ccgagaaaaa ctcgggactt 2580
tacacctgtc aagccaacaa ttccgccagc ggccactccc gcaccactgt caaaactatc 2640
actgtgtccg ccgaactccc gaagcccagc atcagctcca acaactcgaa gcccgtggag 2700
gataaggacg ctgtcgcgtt cacctgtgaa ccagaggcac agaataccac ctacctttgg 2760
tgggtcaacg gacagtccct gcctgtctca ccgagactgc agctgtcaaa cgggaatagg 2820
actctgacct tgtttaacgt cacccggaac gacgcccggg cctacgtgtg cggcatccag 2880
aactccgtga gcgcaaaccg gtctgaccca gtgaccctgg atgtgctgta cggccccgac 2940
actccgatca tttcaccccc cgattcatcc tacctgtccg gcgctaacct caacctctca 3000
tgccactccg catccaaccc cagcccgcaa tattcgtggc gcattaacgg aattcctcag 3060
caacataccc aggtcctgtt cattgcgaag atcaccccta acaacaacgg aacctacgcc 3120
tgctttgtgt caaacctggc cactggtaga aacaactcca tcgtgaagtc cattaccgtg 3180
tcggcgtccg gaacttcccc gggcctgagc gccggcgcca ccgtgggaat tatgatcggc 3240
gtgctcgtgg gagtggccct gatc 3264
<210> 31
<211> 1088
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 31
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys
565 570 575
Ile Pro Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp
580 585 590
Asn Pro Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn
595 600 605
Val Ala Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln
610 615 620
His Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn
625 630 635 640
Arg Gln Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly
645 650 655
Pro Ala Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu
660 665 670
Ile Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val
675 680 685
Ile Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val
690 695 700
Tyr Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro
705 710 715 720
Val Glu Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln
725 730 735
Asn Thr Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser
740 745 750
Pro Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser
755 760 765
Val Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys
770 775 780
Leu Ser Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly
785 790 795 800
Pro Asp Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly
805 810 815
Val Asn Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln
820 825 830
Tyr Ser Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu
835 840 845
Phe Ile Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln
850 855 860
Ala Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile
865 870 875 880
Thr Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser
885 890 895
Lys Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu
900 905 910
Ala Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro
915 920 925
Val Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu
930 935 940
Phe Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln
945 950 955 960
Asn Ser Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu
965 970 975
Tyr Gly Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu
980 985 990
Ser Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser
995 1000 1005
Pro Gln Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr
1010 1015 1020
Gln Val Leu Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr
1025 1030 1035
Tyr Ala Cys Phe Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser
1040 1045 1050
Ile Val Lys Ser Ile Thr Val Ser Ala Ser Gly Thr Ser Pro Gly
1055 1060 1065
Leu Ser Ala Gly Ala Thr Val Gly Ile Met Ile Gly Val Leu Val
1070 1075 1080
Gly Val Ala Leu Ile
1085
<210> 32
<211> 3243
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 32
atggctagcg aatcgccaag cgcaccccct catcggtggt gcatcccttg gcaacgcctc 60
ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa gctgaccatt 120
gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg 180
ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag 240
atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg 300
gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt 360
ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc 420
agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa cccggtcgag 480
gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac ctacttgtgg 540
tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa cgacaacagg 600
accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg cggtatccag 660
aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta cgggccggac 720
gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct gtccctgtcg 780
tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg aaacatccag 840
cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg actttacacc 900
tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac tatcactgtg 960
tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt ggaggataag 1020
gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct ttggtgggtc 1080
aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa taggactctg 1140
accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat ccagaactcc 1200
gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc cgacactccg 1260
atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct ctcatgccac 1320
tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc tcagcaacat 1380
acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta cgcctgcttt 1440
gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac cgtgtcggcg 1500
tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat cggcgtgctc 1560
gtgggagtgg ccctgatcgg atccggcgag ggcagaggca gcctgctgac atgtggcgac 1620
gtggaagaga accctggccc cacccctgga acccagagcc ccttcttcct tctgctgctg 1680
ctgaccgtgc tgactgtcgt gacaggctct ggccacgcca gctctacacc tggcggcgag 1740
aaagagacaa gcgccaccca gagaagcagc gtgccaagca gcaccgagaa gaacgccgtg 1800
tccatgacca gctccgtgct gagcagccac tctcctggca gcggcagcag cacaacacag 1860
ggccaggatg tgacactggc ccctgccaca gaacctgcct ctggatctgc cgccacctgg 1920
ggacaggacg tgacaagcgt gccagtgacc agacctgccc tgggctctac aacaccccct 1980
gcccacgatg tgaccagcgc ccctgataac aagcctgccc ctggaagcac agcccctcca 2040
gctcatggcg tgacctctgc cccagatacc agaccagccc caggatctac agccccaccc 2100
gcacacggcg tgacaagtgc ccctgacaca agacccgctc caggctctac tgctcctcct 2160
gcccatggcg tgacaagcgc tcccgataca aggccagctc ctggctccac agcaccacca 2220
gcacatggcg tgacatcagc tcccgacact agacctgctc ccggatcaac cgctccacca 2280
gctcacggcg tgaccagcgc acctgatacc agacctgctc tgggaagcac cgcccctccc 2340
gtgcacaatg tgacatctgc ttccggcagc gccagcggct ctgcctctac actggtgcac 2400
aacggcacca gcgccagagc cacaacaacc ccagccagca agagcacccc cttcagcatc 2460
cctagccacc acagcgacac ccctaccaca ctggccagcc actccaccaa gaccgatgcc 2520
tctagcaccc accactccag cgtgccccct ctgaccagca gcaaccacag cacaagcccc 2580
cagctgtcta ccggcgtctc attcttcttt ctgtccttcc acatcagcaa cctgcagttc 2640
aacagcagcc tggaagatcc cagcaccgac tactaccagg aactgcagcg ggatatcagc 2700
gagatgttcc tgcaaatcta caagcagggc ggcttcctgg gcctgagcaa catcaagttc 2760
agacccggca gcgtggtggt gcagctgacc ctggctttcc gggaaggcac catcaacgtg 2820
cacgacgtgg aaacccagtt caaccagtac aagaccgagg ccgccagccg gtacaacctg 2880
accatctccg atgtgtccgt gtccgacgtg cccttcccat tctctgccca gtctggcgca 2940
ggcgtgccag gatggggaat tgctctgctg gtgctcgtgt gcgtgctggt ggccctggcc 3000
atcgtgtatc tgattgccct ggccgtgtgc cagtgccggc ggaagaatta cggccagctg 3060
gacatcttcc ccgccagaga cacctaccac cccatgagcg agtaccccac ataccacacc 3120
cacggcagat acgtgccacc cagctccacc gacagatccc cctacgagaa agtgtctgcc 3180
ggcaacggcg gcagctccct gagctacaca aatcctgccg tggccgctgc ctccgccaac 3240
ctg 3243
<210> 33
<211> 1081
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 33
Met Ala Ser Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro
1 5 10 15
Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro
20 25 30
Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala
35 40 45
Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu
50 55 60
Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln
65 70 75 80
Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
85 90 95
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln
100 105 110
Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys
115 120 125
Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro
130 135 140
Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
145 150 155 160
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
165 170 175
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
180 185 190
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
195 200 205
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser
210 215 220
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
225 230 235 240
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
245 250 255
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
260 265 270
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
275 280 285
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
290 295 300
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
305 310 315 320
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
325 330 335
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
340 345 350
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
355 360 365
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
370 375 380
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
385 390 395 400
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
405 410 415
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
420 425 430
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
435 440 445
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
450 455 460
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
465 470 475 480
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
485 490 495
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
500 505 510
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile Gly Ser
515 520 525
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn
530 535 540
Pro Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
545 550 555 560
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
565 570 575
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
580 585 590
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
595 600 605
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
610 615 620
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
625 630 635 640
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
645 650 655
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
660 665 670
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
675 680 685
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
690 695 700
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
705 710 715 720
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
725 730 735
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
740 745 750
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
755 760 765
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
770 775 780
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
785 790 795 800
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
805 810 815
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
820 825 830
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
835 840 845
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
850 855 860
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
865 870 875 880
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
885 890 895
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
900 905 910
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
915 920 925
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
930 935 940
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
945 950 955 960
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
965 970 975
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
980 985 990
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
995 1000 1005
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1010 1015 1020
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1025 1030 1035
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1040 1045 1050
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1055 1060 1065
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1070 1075 1080
<210> 34
<211> 3255
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 34
atggctagcg aatcgccaag cgcaccccct catcggtggt gcatcccttg gcaacgcctc 60
ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa gctgaccatt 120
gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg 180
ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag 240
atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg 300
gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt 360
ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc 420
agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa cccggtcgag 480
gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac ctacttgtgg 540
tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa cgacaacagg 600
accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg cggtatccag 660
aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta cgggccggac 720
gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct gtccctgtcg 780
tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg aaacatccag 840
cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg actttacacc 900
tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac tatcactgtg 960
tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt ggaggataag 1020
gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct ttggtgggtc 1080
aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa taggactctg 1140
accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat ccagaactcc 1200
gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc cgacactccg 1260
atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct ctcatgccac 1320
tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc tcagcaacat 1380
acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta cgcctgcttt 1440
gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac cgtgtcggcg 1500
tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat cggcgtgctc 1560
gtgggagtgg ccctgatcag gaagagaaga ggatccggcg agggcagagg cagcctgctg 1620
acatgtggcg acgtggaaga gaaccctggc cccacccctg gaacccagag ccccttcttc 1680
cttctgctgc tgctgaccgt gctgactgtc gtgacaggct ctggccacgc cagctctaca 1740
cctggcggcg agaaagagac aagcgccacc cagagaagca gcgtgccaag cagcaccgag 1800
aagaacgccg tgtccatgac cagctccgtg ctgagcagcc actctcctgg cagcggcagc 1860
agcacaacac agggccagga tgtgacactg gcccctgcca cagaacctgc ctctggatct 1920
gccgccacct ggggacagga cgtgacaagc gtgccagtga ccagacctgc cctgggctct 1980
acaacacccc ctgcccacga tgtgaccagc gcccctgata acaagcctgc ccctggaagc 2040
acagcccctc cagctcatgg cgtgacctct gccccagata ccagaccagc cccaggatct 2100
acagccccac ccgcacacgg cgtgacaagt gcccctgaca caagacccgc tccaggctct 2160
actgctcctc ctgcccatgg cgtgacaagc gctcccgata caaggccagc tcctggctcc 2220
acagcaccac cagcacatgg cgtgacatca gctcccgaca ctagacctgc tcccggatca 2280
accgctccac cagctcacgg cgtgaccagc gcacctgata ccagacctgc tctgggaagc 2340
accgcccctc ccgtgcacaa tgtgacatct gcttccggca gcgccagcgg ctctgcctct 2400
acactggtgc acaacggcac cagcgccaga gccacaacaa ccccagccag caagagcacc 2460
cccttcagca tccctagcca ccacagcgac acccctacca cactggccag ccactccacc 2520
aagaccgatg cctctagcac ccaccactcc agcgtgcccc ctctgaccag cagcaaccac 2580
agcacaagcc cccagctgtc taccggcgtc tcattcttct ttctgtcctt ccacatcagc 2640
aacctgcagt tcaacagcag cctggaagat cccagcaccg actactacca ggaactgcag 2700
cgggatatca gcgagatgtt cctgcaaatc tacaagcagg gcggcttcct gggcctgagc 2760
aacatcaagt tcagacccgg cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc 2820
accatcaacg tgcacgacgt ggaaacccag ttcaaccagt acaagaccga ggccgccagc 2880
cggtacaacc tgaccatctc cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc 2940
cagtctggcg caggcgtgcc aggatgggga attgctctgc tggtgctcgt gtgcgtgctg 3000
gtggccctgg ccatcgtgta tctgattgcc ctggccgtgt gccagtgccg gcggaagaat 3060
tacggccagc tggacatctt ccccgccaga gacacctacc accccatgag cgagtacccc 3120
acataccaca cccacggcag atacgtgcca cccagctcca ccgacagatc cccctacgag 3180
aaagtgtctg ccggcaacgg cggcagctcc ctgagctaca caaatcctgc cgtggccgct 3240
gcctccgcca acctg 3255
<210> 35
<211> 1085
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 35
Met Ala Ser Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro
1 5 10 15
Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro
20 25 30
Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala
35 40 45
Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu
50 55 60
Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln
65 70 75 80
Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
85 90 95
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln
100 105 110
Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys
115 120 125
Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro
130 135 140
Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
145 150 155 160
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
165 170 175
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
180 185 190
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
195 200 205
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser
210 215 220
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
225 230 235 240
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
245 250 255
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
260 265 270
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
275 280 285
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
290 295 300
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
305 310 315 320
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
325 330 335
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
340 345 350
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
355 360 365
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
370 375 380
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
385 390 395 400
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
405 410 415
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
420 425 430
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
435 440 445
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
450 455 460
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
465 470 475 480
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
485 490 495
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
500 505 510
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile Arg Lys
515 520 525
Arg Arg Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp
530 535 540
Val Glu Glu Asn Pro Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe
545 550 555 560
Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His
565 570 575
Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg
580 585 590
Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser
595 600 605
Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln
610 615 620
Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser
625 630 635 640
Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro
645 650 655
Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro
660 665 670
Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
675 680 685
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
690 695 700
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
705 710 715 720
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
725 730 735
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
740 745 750
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
755 760 765
Thr Ser Ala Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro
770 775 780
Val His Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser
785 790 795 800
Thr Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala
805 810 815
Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro
820 825 830
Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His
835 840 845
His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro
850 855 860
Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser
865 870 875 880
Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr
885 890 895
Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys
900 905 910
Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser
915 920 925
Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val
930 935 940
His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser
945 950 955 960
Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe
965 970 975
Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala
980 985 990
Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu
995 1000 1005
Ile Ala Leu Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln
1010 1015 1020
Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu
1025 1030 1035
Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser
1040 1045 1050
Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly
1055 1060 1065
Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala
1070 1075 1080
Asn Leu
1085
<210> 36
<211> 3090
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 36
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccaagct gaccattgag agcactccct tcaacgtggc tgaggggaag 1740
gaggtgctgc tcctggtgca caatctgccc cagcacctgt tcgggtactc ctggtacaag 1800
ggagaacgcg tggacgggaa ccggcagatc ataggctacg tcatcggaac ccagcaggcc 1860
acacccggtc cagcgtacag cggccgggag attatctacc cgaacgcctc cctgctgatc 1920
caaaacatca tccagaacga caccggtttc tacactctgc acgtgattaa gtcagatctg 1980
gtcaacgaag aggccaccgg ccaattcagg gtgtaccccg aactccctaa gccgttcatc 2040
acctcgaaca acagcaaccc ggtcgaggat gaagatgcgg tggccttgac gtgcgaacct 2100
gagatccaga acaccaccta cttgtggtgg gtgaacaatc agagcctgcc agtctcccca 2160
cgactccagc tgtcgaacga caacaggacc ctgactttgc tgtccgtgac tcggaacgac 2220
gtgggccctt atgaatgcgg tatccagaac aagctgtccg tggaccacag cgaccctgtg 2280
atcctgaacg tcctttacgg gccggacgac cccaccattt ccccgtcgta cacttactac 2340
cggccgggcg tgaacctgtc cctgtcgtgc cacgctgcct ccaatccgcc ggcccagtac 2400
tcctggctca tcgacggaaa catccagcag cacacccaag aactgttcat ctccaacatt 2460
accgagaaaa actcgggact ttacacctgt caagccaaca attccgccag cggccactcc 2520
cgcaccactg tcaaaactat cactgtgtcc gccgaactcc cgaagcccag catcagctcc 2580
aacaactcga agcccgtgga ggataaggac gctgtcgcgt tcacctgtga accagaggca 2640
cagaatacca cctacctttg gtgggtcaac ggacagtccc tgcctgtctc accgagactg 2700
cagctgtcaa acgggaatag gactctgacc ttgtttaacg tcacccggaa cgacgcccgg 2760
gcctacgtgt gcggcatcca gaactccgtg agcgcaaacc ggtctgaccc agtgaccctg 2820
gatgtgctgt acggccccga cactccgatc atttcacccc ccgattcatc ctacctgtcc 2880
ggcgctaacc tcaacctctc atgccactcc gcatccaacc ccagcccgca atattcgtgg 2940
cgcattaacg gaattcctca gcaacatacc caggtcctgt tcattgcgaa gatcacccct 3000
aacaacaacg gaacctacgc ctgctttgtg tcaaacctgg ccactggtag aaacaactcc 3060
atcgtgaagt ccattaccgt gtcggcgtcc 3090
<210> 37
<211> 1030
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 37
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val
565 570 575
Ala Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His
580 585 590
Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg
595 600 605
Gln Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro
610 615 620
Ala Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile
625 630 635 640
Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile
645 650 655
Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr
660 665 670
Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val
675 680 685
Glu Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn
690 695 700
Thr Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro
705 710 715 720
Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val
725 730 735
Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu
740 745 750
Ser Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro
755 760 765
Asp Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val
770 775 780
Asn Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr
785 790 795 800
Ser Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe
805 810 815
Ile Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala
820 825 830
Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr
835 840 845
Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys
850 855 860
Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala
865 870 875 880
Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val
885 890 895
Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe
900 905 910
Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn
915 920 925
Ser Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr
930 935 940
Gly Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser
945 950 955 960
Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro
965 970 975
Gln Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val
980 985 990
Leu Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys
995 1000 1005
Phe Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys
1010 1015 1020
Ser Ile Thr Val Ser Ala Ser
1025 1030
<210> 38
<211> 4143
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 38
atggctagca agctgaccat tgagagcact cccttcaacg tggctgaggg gaaggaggtg 60
ctgctcctgg tgcacaatct gccccagcac ctgttcgggt actcctggta caagggagaa 120
cgcgtggacg ggaaccggca gatcataggc tacgtcatcg gaacccagca ggccacaccc 180
ggtccagcgt acagcggccg ggagattatc tacccgaacg cctccctgct gatccaaaac 240
atcatccaga acgacaccgg tttctacact ctgcacgtga ttaagtcaga tctggtcaac 300
gaagaggcca ccggccaatt cagggtgtac cccgaactcc ctaagccgtt catcacctcg 360
aacaacagca acccggtcga ggatgaagat gcggtggcct tgacgtgcga acctgagatc 420
cagaacacca cctacttgtg gtgggtgaac aatcagagcc tgccagtctc cccacgactc 480
cagctgtcga acgacaacag gaccctgact ttgctgtccg tgactcggaa cgacgtgggc 540
ccttatgaat gcggtatcca gaacaagctg tccgtggacc acagcgaccc tgtgatcctg 600
aacgtccttt acgggccgga cgaccccacc atttccccgt cgtacactta ctaccggccg 660
ggcgtgaacc tgtccctgtc gtgccacgct gcctccaatc cgccggccca gtactcctgg 720
ctcatcgacg gaaacatcca gcagcacacc caagaactgt tcatctccaa cattaccgag 780
aaaaactcgg gactttacac ctgtcaagcc aacaattccg ccagcggcca ctcccgcacc 840
actgtcaaaa ctatcactgt gtccgccgaa ctcccgaagc ccagcatcag ctccaacaac 900
tcgaagcccg tggaggataa ggacgctgtc gcgttcacct gtgaaccaga ggcacagaat 960
accacctacc tttggtgggt caacggacag tccctgcctg tctcaccgag actgcagctg 1020
tcaaacggga ataggactct gaccttgttt aacgtcaccc ggaacgacgc ccgggcctac 1080
gtgtgcggca tccagaactc cgtgagcgca aaccggtctg acccagtgac cctggatgtg 1140
ctgtacggcc ccgacactcc gatcatttca ccccccgatt catcctacct gtccggcgct 1200
aacctcaacc tctcatgcca ctccgcatcc aaccccagcc cgcaatattc gtggcgcatt 1260
aacggaattc ctcagcaaca tacccaggtc ctgttcattg cgaagatcac ccctaacaac 1320
aacggaacct acgcctgctt tgtgtcaaac ctggccactg gtagaaacaa ctccatcgtg 1380
aagtccatta ccgtgtcggc gtccggatcc ggcgagggca gaggcagcct gctgacatgt 1440
ggcgacgtgg aagagaaccc tggccccgga gctgccccgg agccggagag gacccccgtt 1500
ggccagggat cgtgggccca tccgggacgc accaggggac catccgacag gggattctgt 1560
gtggtgtcac cggccaggcc agcagaagag gcaaccagcc tcgagggagc gttgtctgga 1620
accagacatt cccacccgtc ggtgggccgg cagcaccacg cgggaccacc gtccacttcc 1680
agaccgccac ggccatggga caccccttgc ccgcctgtgt atgccgagac taaacacttc 1740
ctgtactcat ccggagacaa ggaacagctt cggccgtcct tcctcctgtc gtcgctcaga 1800
ccgagcctga ccggagcacg cagattggtg gaaactatct tccttgggtc acgtccgtgg 1860
atgccaggta ccccacggcg cctcccgcgc ctcccacaga gatactggca gatgcggcct 1920
ctgttcctgg aattgctggg aaaccacgct cagtgcccgt acggagtcct gctcaagact 1980
cactgccctc tgagggcggc ggtcactccg gcggccggag tgtgcgcacg ggagaagccc 2040
cagggaagcg tggcagctcc ggaagaggag gacaccgatc cgcgccgcct cgtgcaactt 2100
ctgcgccagc actcctcgcc ctggcaagtc tacgggttcg tccgcgcctg cctgcgccgc 2160
ctggtgccgc ctgggctctg gggttcccgg cataacgagc gccgcttcct gagaaatact 2220
aagaagttta tctcacttgg aaaacatgcc aagttgtcgc tgcaagaact cacgtggaag 2280
atgtcagtcc gcgattgcgc ctggctgcgc cgctcgccgg gcgtcgggtg tgttccagct 2340
gcagaacacc gcctgagaga agaaattctg gccaaatttc tgcattggct gatgtcagtg 2400
tacgtggtcg agctgctgcg ctcctttttc tacgtcactg agactacctt tcaaaagaac 2460
cgcctgttct tctaccgcaa atctgtgtgg agcaagctgc agtcaatcgg cattcgccag 2520
catctgaaga gggtgcagct gcgggaactt tccgaggcag aagtccgcca gcaccgggag 2580
gcccggccgg cgcttctcac gtcgcgtctg agattcatcc caaagcccga cgggctgagg 2640
cctatcgtca acatggatta cgtcgtgggc gctcgcacct ttcgccgtga aaagcgggcc 2700
gaacgcttga cctcacgggt gaaggccctc ttctccgtgc tgaactacga gagagcaaga 2760
cggcctggcc tgctgggagc ttcggtgctg ggactggacg atatccaccg ggcttggcgg 2820
acctttgttc tccgggtgag agcccaagac cctccgccgg aactgtactt cgtgaaggtg 2880
gcgatcaccg gagcctatga tactattccg caagatcgac tcaccgaagt catcgcctcg 2940
atcatcaaac cgcagaacac ttactgcgtc aggcggtacg ccgtggtcca gaaggccgcg 3000
catggccacg tgagaaaggc gttcaagtcg cacgtgtcca ctctcaccga cctccagcct 3060
tacatgaggc aattcgttgc gcatttgcaa gagacttcgc ccctgagaga tgcggtggtc 3120
atcgagcaga gctccagcct gaacgaagcg agcagcggtc tgtttgacgt gttcctccgc 3180
ttcatgtgtc atcacgcggt gcgaatcagg ggaaaatcat acgtgcagtg ccagggaatc 3240
ccacaaggca gcattctgtc gactctcttg tgttcccttt gctacggcga tatggaaaac 3300
aagctgttcg ctgggatcag acgggacggg ttgctgctca gactggtgga cgacttcctg 3360
ctggtgactc cgcacctcac tcacgccaaa acctttctcc gcactctggt gaggggagtg 3420
ccagaatacg gctgtgtggt caatctccgg aaaactgtgg tgaatttccc tgtcgaggat 3480
gaggcactcg gaggaaccgc atttgtccaa atgccagcac atggcctgtt cccatggtgc 3540
ggtctgctgc tggacacccg aactcttgaa gtgcagtccg actactccag ctatgcccgg 3600
acgagcatcc gcgccagcct cactttcaat cgcggcttta aggccggacg aaacatgcgc 3660
agaaagcttt tcggagtcct ccggcttaaa tgccattcgc tctttctcga tctccaagtc 3720
aattcgctgc agaccgtgtg cacgaacatc tacaagatcc tgctgctcca agcctaccgg 3780
ttccacgctt gcgtgcttca gctgccgttt caccaacagg tgtggaagaa cccgaccttc 3840
tttctgcggg tcattagcga tactgcctcc ctgtgttact caatcctcaa ggcaaagaac 3900
gccggaatgt cgctgggtgc gaaaggagcc gcgggacctc ttcctagcga agcggtgcag 3960
tggctctgcc accaggcttt cctcctgaag ctgaccaggc acagagtgac ctacgtcccg 4020
ctgctgggct cgctgcgcac tgcacagacc cagctgtcta gaaaactccc cggcaccacc 4080
ctgaccgctc tggaagccgc cgccaaccca gcattgccgt cagatttcaa gaccatcttg 4140
gac 4143
<210> 39
<211> 1381
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 39
Met Ala Ser Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu
1 5 10 15
Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe
20 25 30
Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile
35 40 45
Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr
50 55 60
Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn
65 70 75 80
Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser
85 90 95
Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu
100 105 110
Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu Asp
115 120 125
Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr Thr
130 135 140
Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg Leu
145 150 155 160
Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr Arg
165 170 175
Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser Val
180 185 190
Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp
195 200 205
Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu
210 215 220
Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp
225 230 235 240
Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile Ser
245 250 255
Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn Asn
260 265 270
Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val Ser
275 280 285
Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val
290 295 300
Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln Asn
305 310 315 320
Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser Pro
325 330 335
Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val
340 345 350
Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser Val
355 360 365
Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly Pro
370 375 380
Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly Ala
385 390 395 400
Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln Tyr
405 410 415
Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu Phe
420 425 430
Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val
435 440 445
Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr
450 455 460
Val Ser Ala Ser Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
465 470 475 480
Gly Asp Val Glu Glu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu
485 490 495
Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg
500 505 510
Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala
515 520 525
Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser
530 535 540
His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
545 550 555 560
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu
565 570 575
Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro
580 585 590
Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg
595 600 605
Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr
610 615 620
Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro
625 630 635 640
Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
645 650 655
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala
660 665 670
Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
675 680 685
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His
690 695 700
Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg
705 710 715 720
Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe
725 730 735
Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu
740 745 750
Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
755 760 765
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg
770 775 780
Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
785 790 795 800
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
805 810 815
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys
820 825 830
Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg
835 840 845
Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala
850 855 860
Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg
865 870 875 880
Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
885 890 895
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser
900 905 910
Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser
915 920 925
Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu
930 935 940
Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val
945 950 955 960
Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu
965 970 975
Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
980 985 990
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe
995 1000 1005
Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg
1010 1015 1020
Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
1025 1030 1035
Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly
1040 1045 1050
Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg
1055 1060 1065
Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly
1070 1075 1080
Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met
1085 1090 1095
Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu
1100 1105 1110
Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His
1115 1120 1125
Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr
1130 1135 1140
Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
1145 1150 1155
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala
1160 1165 1170
His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr
1175 1180 1185
Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile
1190 1195 1200
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn
1205 1210 1215
Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser
1220 1225 1230
Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr
1235 1240 1245
Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala
1250 1255 1260
Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
1265 1270 1275
Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr
1280 1285 1290
Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys
1295 1300 1305
Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys
1310 1315 1320
His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr
1325 1330 1335
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser
1340 1345 1350
Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala
1355 1360 1365
Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1370 1375 1380
<210> 40
<211> 4323
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 40
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggccagtgc 2700
accaattacg ccctgctgaa gctggccggc gacgtggaat ctaaccctgg ccctgaatcg 2760
ccaagcgcac cccctcatcg gtggtgcatc ccttggcaac gcctcctcct gaccgcctca 2820
ctgctgactt tctggaaccc gccgaccacc gcaaagctga ccattgagag cactcccttc 2880
aacgtggctg aggggaagga ggtgctgctc ctggtgcaca atctgcccca gcacctgttc 2940
gggtactcct ggtacaaggg agaacgcgtg gacgggaacc ggcagatcat aggctacgtc 3000
atcggaaccc agcaggccac acccggtcca gcgtacagcg gccgggagat tatctacccg 3060
aacgcctccc tgctgatcca aaacatcatc cagaacgaca ccggtttcta cactctgcac 3120
gtgattaagt cagatctggt caacgaagag gccaccggcc aattcagggt gtaccccgaa 3180
ctccctaagc cgttcatcac ctcgaacaac agcaacccgg tcgaggatga agatgcggtg 3240
gccttgacgt gcgaacctga gatccagaac accacctact tgtggtgggt gaacaatcag 3300
agcctgccag tctccccacg actccagctg tcgaacgaca acaggaccct gactttgctg 3360
tccgtgactc ggaacgacgt gggcccttat gaatgcggta tccagaacaa gctgtccgtg 3420
gaccacagcg accctgtgat cctgaacgtc ctttacgggc cggacgaccc caccatttcc 3480
ccgtcgtaca cttactaccg gccgggcgtg aacctgtccc tgtcgtgcca cgctgcctcc 3540
aatccgccgg cccagtactc ctggctcatc gacggaaaca tccagcagca cacccaagaa 3600
ctgttcatct ccaacattac cgagaaaaac tcgggacttt acacctgtca agccaacaat 3660
tccgccagcg gccactcccg caccactgtc aaaactatca ctgtgtccgc cgaactcccg 3720
aagcccagca tcagctccaa caactcgaag cccgtggagg ataaggacgc tgtcgcgttc 3780
acctgtgaac cagaggcaca gaataccacc tacctttggt gggtcaacgg acagtccctg 3840
cctgtctcac cgagactgca gctgtcaaac gggaatagga ctctgacctt gtttaacgtc 3900
acccggaacg acgcccgggc ctacgtgtgc ggcatccaga actccgtgag cgcaaaccgg 3960
tctgacccag tgaccctgga tgtgctgtac ggccccgaca ctccgatcat ttcacccccc 4020
gattcatcct acctgtccgg cgctaacctc aacctctcat gccactccgc atccaacccc 4080
agcccgcaat attcgtggcg cattaacgga attcctcagc aacataccca ggtcctgttc 4140
attgcgaaga tcacccctaa caacaacgga acctacgcct gctttgtgtc aaacctggcc 4200
actggtagaa acaactccat cgtgaagtcc attaccgtgt cggcgtccgg aacttccccg 4260
ggcctgagcg ccggcgccac cgtgggaatt atgatcggcg tgctcgtggg agtggccctg 4320
atc 4323
<210> 41
<211> 1441
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 41
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val
900 905 910
Glu Ser Asn Pro Gly Pro Glu Ser Pro Ser Ala Pro Pro His Arg Trp
915 920 925
Cys Ile Pro Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe
930 935 940
Trp Asn Pro Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe
945 950 955 960
Asn Val Ala Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro
965 970 975
Gln His Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly
980 985 990
Asn Arg Gln Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro
995 1000 1005
Gly Pro Ala Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser
1010 1015 1020
Leu Leu Ile Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr
1025 1030 1035
Leu His Val Ile Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly
1040 1045 1050
Gln Phe Arg Val Tyr Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser
1055 1060 1065
Asn Asn Ser Asn Pro Val Glu Asp Glu Asp Ala Val Ala Leu Thr
1070 1075 1080
Cys Glu Pro Glu Ile Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn
1085 1090 1095
Asn Gln Ser Leu Pro Val Ser Pro Arg Leu Gln Leu Ser Asn Asp
1100 1105 1110
Asn Arg Thr Leu Thr Leu Leu Ser Val Thr Arg Asn Asp Val Gly
1115 1120 1125
Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser Val Asp His Ser
1130 1135 1140
Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp Pro Thr
1145 1150 1155
Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser
1160 1165 1170
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp
1175 1180 1185
Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
1190 1195 1200
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala
1205 1210 1215
Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile
1220 1225 1230
Thr Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn
1235 1240 1245
Ser Lys Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu
1250 1255 1260
Pro Glu Ala Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln
1265 1270 1275
Ser Leu Pro Val Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg
1280 1285 1290
Thr Leu Thr Leu Phe Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr
1295 1300 1305
Val Cys Gly Ile Gln Asn Ser Val Ser Ala Asn Arg Ser Asp Pro
1310 1315 1320
Val Thr Leu Asp Val Leu Tyr Gly Pro Asp Thr Pro Ile Ile Ser
1325 1330 1335
Pro Pro Asp Ser Ser Tyr Leu Ser Gly Ala Asn Leu Asn Leu Ser
1340 1345 1350
Cys His Ser Ala Ser Asn Pro Ser Pro Gln Tyr Ser Trp Arg Ile
1355 1360 1365
Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu Phe Ile Ala Lys
1370 1375 1380
Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val Ser Asn
1385 1390 1395
Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr Val
1400 1405 1410
Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr Val
1415 1420 1425
Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile
1430 1435 1440
<210> 42
<211> 6009
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 42
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1680
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1740
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1800
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1860
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1920
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1980
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2040
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2100
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2160
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2220
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2280
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2340
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2400
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2460
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2520
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2580
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2640
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2700
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2760
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2820
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2880
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2940
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3000
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3060
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3120
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3180
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3240
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3300
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3360
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3420
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3480
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3540
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3600
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3660
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3720
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3780
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3840
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3900
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3960
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4020
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4080
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4140
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4200
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4260
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4320
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga cggatccggc 4380
cagtgcacca attacgccct gctgaagctg gccggcgacg tggaatctaa ccctggccct 4440
gaatcgccaa gcgcaccccc tcatcggtgg tgcatccctt ggcaacgcct cctcctgacc 4500
gcctcactgc tgactttctg gaacccgccg accaccgcaa agctgaccat tgagagcact 4560
cccttcaacg tggctgaggg gaaggaggtg ctgctcctgg tgcacaatct gccccagcac 4620
ctgttcgggt actcctggta caagggagaa cgcgtggacg ggaaccggca gatcataggc 4680
tacgtcatcg gaacccagca ggccacaccc ggtccagcgt acagcggccg ggagattatc 4740
tacccgaacg cctccctgct gatccaaaac atcatccaga acgacaccgg tttctacact 4800
ctgcacgtga ttaagtcaga tctggtcaac gaagaggcca ccggccaatt cagggtgtac 4860
cccgaactcc ctaagccgtt catcacctcg aacaacagca acccggtcga ggatgaagat 4920
gcggtggcct tgacgtgcga acctgagatc cagaacacca cctacttgtg gtgggtgaac 4980
aatcagagcc tgccagtctc cccacgactc cagctgtcga acgacaacag gaccctgact 5040
ttgctgtccg tgactcggaa cgacgtgggc ccttatgaat gcggtatcca gaacaagctg 5100
tccgtggacc acagcgaccc tgtgatcctg aacgtccttt acgggccgga cgaccccacc 5160
atttccccgt cgtacactta ctaccggccg ggcgtgaacc tgtccctgtc gtgccacgct 5220
gcctccaatc cgccggccca gtactcctgg ctcatcgacg gaaacatcca gcagcacacc 5280
caagaactgt tcatctccaa cattaccgag aaaaactcgg gactttacac ctgtcaagcc 5340
aacaattccg ccagcggcca ctcccgcacc actgtcaaaa ctatcactgt gtccgccgaa 5400
ctcccgaagc ccagcatcag ctccaacaac tcgaagcccg tggaggataa ggacgctgtc 5460
gcgttcacct gtgaaccaga ggcacagaat accacctacc tttggtgggt caacggacag 5520
tccctgcctg tctcaccgag actgcagctg tcaaacggga ataggactct gaccttgttt 5580
aacgtcaccc ggaacgacgc ccgggcctac gtgtgcggca tccagaactc cgtgagcgca 5640
aaccggtctg acccagtgac cctggatgtg ctgtacggcc ccgacactcc gatcatttca 5700
ccccccgatt catcctacct gtccggcgct aacctcaacc tctcatgcca ctccgcatcc 5760
aaccccagcc cgcaatattc gtggcgcatt aacggaattc ctcagcaaca tacccaggtc 5820
ctgttcattg cgaagatcac ccctaacaac aacggaacct acgcctgctt tgtgtcaaac 5880
ctggccactg gtagaaacaa ctccatcgtg aagtccatta ccgtgtcggc gtccggaact 5940
tccccgggcc tgagcgccgg cgccaccgtg ggaattatga tcggcgtgct cgtgggagtg 6000
gccctgatc 6009
<210> 43
<211> 2003
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 43
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser
530 535 540
Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu
545 550 555 560
Leu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val
565 570 575
Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp
580 585 590
Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr
595 600 605
Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val
610 615 620
Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg
625 630 635 640
Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe
645 650 655
Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu
660 665 670
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr
675 680 685
Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
690 695 700
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu
705 710 715 720
Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr
725 730 735
His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala
740 745 750
Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr
755 760 765
Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp
770 775 780
Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro
785 790 795 800
Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr
805 810 815
Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu
820 825 830
Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser
835 840 845
Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu
850 855 860
Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu
865 870 875 880
Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn
885 890 895
Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
900 905 910
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu
915 920 925
Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
930 935 940
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn
945 950 955 960
Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala
965 970 975
Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr
980 985 990
Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu
995 1000 1005
Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
1010 1015 1020
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile
1025 1030 1035
Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val
1040 1045 1050
Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
1055 1060 1065
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala
1070 1075 1080
Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
1085 1090 1095
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp
1100 1105 1110
Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser
1115 1120 1125
Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val
1130 1135 1140
Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln
1145 1150 1155
Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
1160 1165 1170
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
1175 1180 1185
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr
1190 1195 1200
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
1205 1210 1215
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro
1220 1225 1230
Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro
1235 1240 1245
Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
1250 1255 1260
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser
1265 1270 1275
Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg
1280 1285 1290
Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His
1295 1300 1305
Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys
1310 1315 1320
Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
1325 1330 1335
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn
1340 1345 1350
Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys
1355 1360 1365
Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala
1370 1375 1380
Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu
1385 1390 1395
Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr
1400 1405 1410
Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
1415 1420 1425
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala
1430 1435 1440
Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
1445 1450 1455
Ser Gly Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp
1460 1465 1470
Val Glu Ser Asn Pro Gly Pro Glu Ser Pro Ser Ala Pro Pro His
1475 1480 1485
Arg Trp Cys Ile Pro Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu
1490 1495 1500
Leu Thr Phe Trp Asn Pro Pro Thr Thr Ala Lys Leu Thr Ile Glu
1505 1510 1515
Ser Thr Pro Phe Asn Val Ala Glu Gly Lys Glu Val Leu Leu Leu
1520 1525 1530
Val His Asn Leu Pro Gln His Leu Phe Gly Tyr Ser Trp Tyr Lys
1535 1540 1545
Gly Glu Arg Val Asp Gly Asn Arg Gln Ile Ile Gly Tyr Val Ile
1550 1555 1560
Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr Ser Gly Arg Glu
1565 1570 1575
Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn Ile Ile Gln
1580 1585 1590
Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser Asp Leu
1595 1600 1605
Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu Leu
1610 1615 1620
Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu Asp
1625 1630 1635
Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
1640 1645 1650
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro
1655 1660 1665
Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser
1670 1675 1680
Val Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn
1685 1690 1695
Lys Leu Ser Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu
1700 1705 1710
Tyr Gly Pro Asp Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr
1715 1720 1725
Arg Pro Gly Val Asn Leu Ser Leu Ser Cys His Ala Ala Ser Asn
1730 1735 1740
Pro Pro Ala Gln Tyr Ser Trp Leu Ile Asp Gly Asn Ile Gln Gln
1745 1750 1755
His Thr Gln Glu Leu Phe Ile Ser Asn Ile Thr Glu Lys Asn Ser
1760 1765 1770
Gly Leu Tyr Thr Cys Gln Ala Asn Asn Ser Ala Ser Gly His Ser
1775 1780 1785
Arg Thr Thr Val Lys Thr Ile Thr Val Ser Ala Glu Leu Pro Lys
1790 1795 1800
Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys Asp
1805 1810 1815
Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln Asn Thr Thr Tyr
1820 1825 1830
Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser Pro Arg Leu
1835 1840 1845
Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr
1850 1855 1860
Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser Val
1865 1870 1875
Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
1880 1885 1890
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser
1895 1900 1905
Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser
1910 1915 1920
Pro Gln Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr
1925 1930 1935
Gln Val Leu Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr
1940 1945 1950
Tyr Ala Cys Phe Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser
1955 1960 1965
Ile Val Lys Ser Ile Thr Val Ser Ala Ser Gly Thr Ser Pro Gly
1970 1975 1980
Leu Ser Ala Gly Ala Thr Val Gly Ile Met Ile Gly Val Leu Val
1985 1990 1995
Gly Val Ala Leu Ile
2000
<210> 44
<211> 6003
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 44
atggctagcg aatcgccaag cgcaccccct catcggtggt gcatcccttg gcaacgcctc 60
ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa gctgaccatt 120
gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg 180
ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag 240
atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg 300
gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt 360
ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc 420
agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa cccggtcgag 480
gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac ctacttgtgg 540
tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa cgacaacagg 600
accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg cggtatccag 660
aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta cgggccggac 720
gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct gtccctgtcg 780
tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg aaacatccag 840
cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg actttacacc 900
tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac tatcactgtg 960
tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt ggaggataag 1020
gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct ttggtgggtc 1080
aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa taggactctg 1140
accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat ccagaactcc 1200
gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc cgacactccg 1260
atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct ctcatgccac 1320
tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc tcagcaacat 1380
acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta cgcctgcttt 1440
gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac cgtgtcggcg 1500
tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat cggcgtgctc 1560
gtgggagtgg ccctgatcgg atccggcgag ggcagaggca gcctgctgac atgtggcgac 1620
gtggaagaga accctggccc cacccctgga acccagagcc ccttcttcct tctgctgctg 1680
ctgaccgtgc tgactgtcgt gacaggctct ggccacgcca gctctacacc tggcggcgag 1740
aaagagacaa gcgccaccca gagaagcagc gtgccaagca gcaccgagaa gaacgccgtg 1800
tccatgacca gctccgtgct gagcagccac tctcctggca gcggcagcag cacaacacag 1860
ggccaggatg tgacactggc ccctgccaca gaacctgcct ctggatctgc cgccacctgg 1920
ggacaggacg tgacaagcgt gccagtgacc agacctgccc tgggctctac aacaccccct 1980
gcccacgatg tgaccagcgc ccctgataac aagcctgccc ctggaagcac agcccctcca 2040
gctcatggcg tgacctctgc cccagatacc agaccagccc caggatctac agccccaccc 2100
gcacacggcg tgacaagtgc ccctgacaca agacccgctc caggctctac tgctcctcct 2160
gcccatggcg tgacaagcgc tcccgataca aggccagctc ctggctccac agcaccacca 2220
gcacatggcg tgacatcagc tcccgacact agacctgctc ccggatcaac cgctccacca 2280
gctcacggcg tgaccagcgc acctgatacc agacctgctc tgggaagcac cgcccctccc 2340
gtgcacaatg tgacatctgc ttccggcagc gccagcggct ctgcctctac actggtgcac 2400
aacggcacca gcgccagagc cacaacaacc ccagccagca agagcacccc cttcagcatc 2460
cctagccacc acagcgacac ccctaccaca ctggccagcc actccaccaa gaccgatgcc 2520
tctagcaccc accactccag cgtgccccct ctgaccagca gcaaccacag cacaagcccc 2580
cagctgtcta ccggcgtctc attcttcttt ctgtccttcc acatcagcaa cctgcagttc 2640
aacagcagcc tggaagatcc cagcaccgac tactaccagg aactgcagcg ggatatcagc 2700
gagatgttcc tgcaaatcta caagcagggc ggcttcctgg gcctgagcaa catcaagttc 2760
agacccggca gcgtggtggt gcagctgacc ctggctttcc gggaaggcac catcaacgtg 2820
cacgacgtgg aaacccagtt caaccagtac aagaccgagg ccgccagccg gtacaacctg 2880
accatctccg atgtgtccgt gtccgacgtg cccttcccat tctctgccca gtctggcgca 2940
ggcgtgccag gatggggaat tgctctgctg gtgctcgtgt gcgtgctggt ggccctggcc 3000
atcgtgtatc tgattgccct ggccgtgtgc cagtgccggc ggaagaatta cggccagctg 3060
gacatcttcc ccgccagaga cacctaccac cccatgagcg agtaccccac ataccacacc 3120
cacggcagat acgtgccacc cagctccacc gacagatccc cctacgagaa agtgtctgcc 3180
ggcaacggcg gcagctccct gagctacaca aatcctgccg tggccgctgc ctccgccaac 3240
ctgggatccg gcacaatcct gtctgagggc gccaccaact tcagcctgct gaaactggcc 3300
ggcgacgtgg aactgaaccc tggccctgga gctgccccgg agccggagag gacccccgtt 3360
ggccagggat cgtgggccca tccgggacgc accaggggac catccgacag gggattctgt 3420
gtggtgtcac cggccaggcc agcagaagag gcaaccagcc tcgagggagc gttgtctgga 3480
accagacatt cccacccgtc ggtgggccgg cagcaccacg cgggaccacc gtccacttcc 3540
agaccgccac ggccatggga caccccttgc ccgcctgtgt atgccgagac taaacacttc 3600
ctgtactcat ccggagacaa ggaacagctt cggccgtcct tcctcctgtc gtcgctcaga 3660
ccgagcctga ccggagcacg cagattggtg gaaactatct tccttgggtc acgtccgtgg 3720
atgccaggta ccccacggcg cctcccgcgc ctcccacaga gatactggca gatgcggcct 3780
ctgttcctgg aattgctggg aaaccacgct cagtgcccgt acggagtcct gctcaagact 3840
cactgccctc tgagggcggc ggtcactccg gcggccggag tgtgcgcacg ggagaagccc 3900
cagggaagcg tggcagctcc ggaagaggag gacaccgatc cgcgccgcct cgtgcaactt 3960
ctgcgccagc actcctcgcc ctggcaagtc tacgggttcg tccgcgcctg cctgcgccgc 4020
ctggtgccgc ctgggctctg gggttcccgg cataacgagc gccgcttcct gagaaatact 4080
aagaagttta tctcacttgg aaaacatgcc aagttgtcgc tgcaagaact cacgtggaag 4140
atgtcagtcc gcgattgcgc ctggctgcgc cgctcgccgg gcgtcgggtg tgttccagct 4200
gcagaacacc gcctgagaga agaaattctg gccaaatttc tgcattggct gatgtcagtg 4260
tacgtggtcg agctgctgcg ctcctttttc tacgtcactg agactacctt tcaaaagaac 4320
cgcctgttct tctaccgcaa atctgtgtgg agcaagctgc agtcaatcgg cattcgccag 4380
catctgaaga gggtgcagct gcgggaactt tccgaggcag aagtccgcca gcaccgggag 4440
gcccggccgg cgcttctcac gtcgcgtctg agattcatcc caaagcccga cgggctgagg 4500
cctatcgtca acatggatta cgtcgtgggc gctcgcacct ttcgccgtga aaagcgggcc 4560
gaacgcttga cctcacgggt gaaggccctc ttctccgtgc tgaactacga gagagcaaga 4620
cggcctggcc tgctgggagc ttcggtgctg ggactggacg atatccaccg ggcttggcgg 4680
acctttgttc tccgggtgag agcccaagac cctccgccgg aactgtactt cgtgaaggtg 4740
gcgatcaccg gagcctatga tactattccg caagatcgac tcaccgaagt catcgcctcg 4800
atcatcaaac cgcagaacac ttactgcgtc aggcggtacg ccgtggtcca gaaggccgcg 4860
catggccacg tgagaaaggc gttcaagtcg cacgtgtcca ctctcaccga cctccagcct 4920
tacatgaggc aattcgttgc gcatttgcaa gagacttcgc ccctgagaga tgcggtggtc 4980
atcgagcaga gctccagcct gaacgaagcg agcagcggtc tgtttgacgt gttcctccgc 5040
ttcatgtgtc atcacgcggt gcgaatcagg ggaaaatcat acgtgcagtg ccagggaatc 5100
ccacaaggca gcattctgtc gactctcttg tgttcccttt gctacggcga tatggaaaac 5160
aagctgttcg ctgggatcag acgggacggg ttgctgctca gactggtgga cgacttcctg 5220
ctggtgactc cgcacctcac tcacgccaaa acctttctcc gcactctggt gaggggagtg 5280
ccagaatacg gctgtgtggt caatctccgg aaaactgtgg tgaatttccc tgtcgaggat 5340
gaggcactcg gaggaaccgc atttgtccaa atgccagcac atggcctgtt cccatggtgc 5400
ggtctgctgc tggacacccg aactcttgaa gtgcagtccg actactccag ctatgcccgg 5460
acgagcatcc gcgccagcct cactttcaat cgcggcttta aggccggacg aaacatgcgc 5520
agaaagcttt tcggagtcct ccggcttaaa tgccattcgc tctttctcga tctccaagtc 5580
aattcgctgc agaccgtgtg cacgaacatc tacaagatcc tgctgctcca agcctaccgg 5640
ttccacgctt gcgtgcttca gctgccgttt caccaacagg tgtggaagaa cccgaccttc 5700
tttctgcggg tcattagcga tactgcctcc ctgtgttact caatcctcaa ggcaaagaac 5760
gccggaatgt cgctgggtgc gaaaggagcc gcgggacctc ttcctagcga agcggtgcag 5820
tggctctgcc accaggcttt cctcctgaag ctgaccaggc acagagtgac ctacgtcccg 5880
ctgctgggct cgctgcgcac tgcacagacc cagctgtcta gaaaactccc cggcaccacc 5940
ctgaccgctc tggaagccgc cgccaaccca gcattgccgt cagatttcaa gaccatcttg 6000
gac 6003
<210> 45
<211> 2001
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 45
Met Ala Ser Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys Ile Pro
1 5 10 15
Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro
20 25 30
Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala
35 40 45
Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu
50 55 60
Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln
65 70 75 80
Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
85 90 95
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln
100 105 110
Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys
115 120 125
Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro
130 135 140
Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu
145 150 155 160
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr
165 170 175
Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg
180 185 190
Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr
195 200 205
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser
210 215 220
Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp
225 230 235 240
Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn
245 250 255
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser
260 265 270
Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
275 280 285
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn
290 295 300
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val
305 310 315 320
Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
325 330 335
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln
340 345 350
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser
355 360 365
Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn
370 375 380
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser
385 390 395 400
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly
405 410 415
Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly
420 425 430
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln
435 440 445
Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu
450 455 460
Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe
465 470 475 480
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile
485 490 495
Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr
500 505 510
Val Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile Gly Ser
515 520 525
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn
530 535 540
Pro Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
545 550 555 560
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
565 570 575
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
580 585 590
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
595 600 605
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
610 615 620
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
625 630 635 640
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
645 650 655
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
660 665 670
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
675 680 685
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
690 695 700
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
705 710 715 720
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
725 730 735
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
740 745 750
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
755 760 765
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
770 775 780
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
785 790 795 800
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
805 810 815
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
820 825 830
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
835 840 845
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
850 855 860
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
865 870 875 880
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
885 890 895
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
900 905 910
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
915 920 925
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
930 935 940
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
945 950 955 960
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
965 970 975
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
980 985 990
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
995 1000 1005
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1010 1015 1020
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1025 1030 1035
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1040 1045 1050
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1055 1060 1065
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser
1070 1075 1080
Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
1085 1090 1095
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Gly Ala Ala Pro
1100 1105 1110
Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro
1115 1120 1125
Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser
1130 1135 1140
Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu
1145 1150 1155
Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His
1160 1165 1170
Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
1175 1180 1185
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser
1190 1195 1200
Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
1205 1210 1215
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile
1220 1225 1230
Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
1235 1240 1245
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu
1250 1255 1260
Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu
1265 1270 1275
Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly
1280 1285 1290
Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
1295 1300 1305
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln
1310 1315 1320
His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu
1325 1330 1335
Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu
1340 1345 1350
Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys
1355 1360 1365
His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val
1370 1375 1380
Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val
1385 1390 1395
Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe
1400 1405 1410
Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
1415 1420 1425
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe
1430 1435 1440
Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
1445 1450 1455
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala
1460 1465 1470
Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
1475 1480 1485
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val
1490 1495 1500
Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys
1505 1510 1515
Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val
1520 1525 1530
Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser
1535 1540 1545
Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val
1550 1555 1560
Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
1565 1570 1575
Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg
1580 1585 1590
Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr
1595 1600 1605
Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His
1610 1615 1620
Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu
1625 1630 1635
Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser
1640 1645 1650
Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn
1655 1660 1665
Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys
1670 1675 1680
His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
1685 1690 1695
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu
1700 1705 1710
Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg
1715 1720 1725
Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr
1730 1735 1740
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg
1745 1750 1755
Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val
1760 1765 1770
Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe
1775 1780 1785
Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu
1790 1795 1800
Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
1805 1810 1815
Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe
1820 1825 1830
Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg
1835 1840 1845
Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu
1850 1855 1860
Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala
1865 1870 1875
Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln
1880 1885 1890
Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
1895 1900 1905
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met
1910 1915 1920
Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
1925 1930 1935
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg
1940 1945 1950
His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala
1955 1960 1965
Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
1970 1975 1980
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr
1985 1990 1995
Ile Leu Asp
2000
<210> 46
<211> 6024
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 46
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 2820
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2880
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2940
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 3000
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3060
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3120
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3180
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3240
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3300
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3360
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3420
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3480
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3540
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3600
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3660
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3720
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3780
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3840
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3900
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3960
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 4020
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4080
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4140
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4200
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4260
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4320
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 4380
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 4440
acaaaccctg gccccgaatc gccaagcgca ccccctcatc ggtggtgcat cccttggcaa 4500
cgcctcctcc tgaccgcctc actgctgact ttctggaacc cgccgaccac cgcaaagctg 4560
accattgaga gcactccctt caacgtggct gaggggaagg aggtgctgct cctggtgcac 4620
aatctgcccc agcacctgtt cgggtactcc tggtacaagg gagaacgcgt ggacgggaac 4680
cggcagatca taggctacgt catcggaacc cagcaggcca cacccggtcc agcgtacagc 4740
ggccgggaga ttatctaccc gaacgcctcc ctgctgatcc aaaacatcat ccagaacgac 4800
accggtttct acactctgca cgtgattaag tcagatctgg tcaacgaaga ggccaccggc 4860
caattcaggg tgtaccccga actccctaag ccgttcatca cctcgaacaa cagcaacccg 4920
gtcgaggatg aagatgcggt ggccttgacg tgcgaacctg agatccagaa caccacctac 4980
ttgtggtggg tgaacaatca gagcctgcca gtctccccac gactccagct gtcgaacgac 5040
aacaggaccc tgactttgct gtccgtgact cggaacgacg tgggccctta tgaatgcggt 5100
atccagaaca agctgtccgt ggaccacagc gaccctgtga tcctgaacgt cctttacggg 5160
ccggacgacc ccaccatttc cccgtcgtac acttactacc ggccgggcgt gaacctgtcc 5220
ctgtcgtgcc acgctgcctc caatccgccg gcccagtact cctggctcat cgacggaaac 5280
atccagcagc acacccaaga actgttcatc tccaacatta ccgagaaaaa ctcgggactt 5340
tacacctgtc aagccaacaa ttccgccagc ggccactccc gcaccactgt caaaactatc 5400
actgtgtccg ccgaactccc gaagcccagc atcagctcca acaactcgaa gcccgtggag 5460
gataaggacg ctgtcgcgtt cacctgtgaa ccagaggcac agaataccac ctacctttgg 5520
tgggtcaacg gacagtccct gcctgtctca ccgagactgc agctgtcaaa cgggaatagg 5580
actctgacct tgtttaacgt cacccggaac gacgcccggg cctacgtgtg cggcatccag 5640
aactccgtga gcgcaaaccg gtctgaccca gtgaccctgg atgtgctgta cggccccgac 5700
actccgatca tttcaccccc cgattcatcc tacctgtccg gcgctaacct caacctctca 5760
tgccactccg catccaaccc cagcccgcaa tattcgtggc gcattaacgg aattcctcag 5820
caacataccc aggtcctgtt cattgcgaag atcaccccta acaacaacgg aacctacgcc 5880
tgctttgtgt caaacctggc cactggtaga aacaactcca tcgtgaagtc cattaccgtg 5940
tcggcgtccg gaacttcccc gggcctgagc gccggcgcca ccgtgggaat tatgatcggc 6000
gtgctcgtgg gagtggccct gatc 6024
<210> 47
<211> 2008
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 47
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Pro Gly Thr Gln
915 920 925
Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr
930 935 940
Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser
945 950 955 960
Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val
965 970 975
Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser
980 985 990
Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro
995 1000 1005
Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val
1010 1015 1020
Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His
1025 1030 1035
Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1130 1135 1140
Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His
1145 1150 1155
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr
1160 1165 1170
Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala
1175 1180 1185
Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr
1190 1195 1200
Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser
1205 1210 1215
Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
1220 1225 1230
Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser
1235 1240 1245
Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro
1250 1255 1260
Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met
1265 1270 1275
Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn
1280 1285 1290
Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
1295 1300 1305
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
1310 1315 1320
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile
1325 1330 1335
Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln
1340 1345 1350
Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
1355 1360 1365
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu
1370 1375 1380
Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
1385 1390 1395
Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr
1400 1405 1410
Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg
1415 1420 1425
Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu
1430 1435 1440
Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly
1445 1450 1455
Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly Tyr Phe Ala Asp
1460 1465 1470
Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro Glu Ser Pro
1475 1480 1485
Ser Ala Pro Pro His Arg Trp Cys Ile Pro Trp Gln Arg Leu Leu
1490 1495 1500
Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr Thr Ala
1505 1510 1515
Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu Gly Lys
1520 1525 1530
Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe Gly
1535 1540 1545
Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile
1550 1555 1560
Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala
1565 1570 1575
Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile
1580 1585 1590
Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val
1595 1600 1605
Ile Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg
1610 1615 1620
Val Tyr Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser
1625 1630 1635
Asn Pro Val Glu Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro
1640 1645 1650
Glu Ile Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser
1655 1660 1665
Leu Pro Val Ser Pro Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr
1670 1675 1680
Leu Thr Leu Leu Ser Val Thr Arg Asn Asp Val Gly Pro Tyr Glu
1685 1690 1695
Cys Gly Ile Gln Asn Lys Leu Ser Val Asp His Ser Asp Pro Val
1700 1705 1710
Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp Pro Thr Ile Ser Pro
1715 1720 1725
Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu Ser Cys
1730 1735 1740
His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp Leu Ile Asp
1745 1750 1755
Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile Ser Asn Ile
1760 1765 1770
Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn Asn Ser
1775 1780 1785
Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val Ser
1790 1795 1800
Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro
1805 1810 1815
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala
1820 1825 1830
Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro
1835 1840 1845
Val Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr
1850 1855 1860
Leu Phe Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly
1865 1870 1875
Ile Gln Asn Ser Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu
1880 1885 1890
Asp Val Leu Tyr Gly Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp
1895 1900 1905
Ser Ser Tyr Leu Ser Gly Ala Asn Leu Asn Leu Ser Cys His Ser
1910 1915 1920
Ala Ser Asn Pro Ser Pro Gln Tyr Ser Trp Arg Ile Asn Gly Ile
1925 1930 1935
Pro Gln Gln His Thr Gln Val Leu Phe Ile Ala Lys Ile Thr Pro
1940 1945 1950
Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val Ser Asn Leu Ala Thr
1955 1960 1965
Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr Val Ser Ala Ser
1970 1975 1980
Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr Val Gly Ile Met
1985 1990 1995
Ile Gly Val Leu Val Gly Val Ala Leu Ile
2000 2005
<210> 48
<211> 5988
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 48
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggccagtgc 2700
accaattacg ccctgctgaa gctggccggc gacgtggaat ctaaccctgg ccctgaatcg 2760
ccaagcgcac cccctcatcg gtggtgcatc ccttggcaac gcctcctcct gaccgcctca 2820
ctgctgactt tctggaaccc gccgaccacc gcaaagctga ccattgagag cactcccttc 2880
aacgtggctg aggggaagga ggtgctgctc ctggtgcaca atctgcccca gcacctgttc 2940
gggtactcct ggtacaaggg agaacgcgtg gacgggaacc ggcagatcat aggctacgtc 3000
atcggaaccc agcaggccac acccggtcca gcgtacagcg gccgggagat tatctacccg 3060
aacgcctccc tgctgatcca aaacatcatc cagaacgaca ccggtttcta cactctgcac 3120
gtgattaagt cagatctggt caacgaagag gccaccggcc aattcagggt gtaccccgaa 3180
ctccctaagc cgttcatcac ctcgaacaac agcaacccgg tcgaggatga agatgcggtg 3240
gccttgacgt gcgaacctga gatccagaac accacctact tgtggtgggt gaacaatcag 3300
agcctgccag tctccccacg actccagctg tcgaacgaca acaggaccct gactttgctg 3360
tccgtgactc ggaacgacgt gggcccttat gaatgcggta tccagaacaa gctgtccgtg 3420
gaccacagcg accctgtgat cctgaacgtc ctttacgggc cggacgaccc caccatttcc 3480
ccgtcgtaca cttactaccg gccgggcgtg aacctgtccc tgtcgtgcca cgctgcctcc 3540
aatccgccgg cccagtactc ctggctcatc gacggaaaca tccagcagca cacccaagaa 3600
ctgttcatct ccaacattac cgagaaaaac tcgggacttt acacctgtca agccaacaat 3660
tccgccagcg gccactcccg caccactgtc aaaactatca ctgtgtccgc cgaactcccg 3720
aagcccagca tcagctccaa caactcgaag cccgtggagg ataaggacgc tgtcgcgttc 3780
acctgtgaac cagaggcaca gaataccacc tacctttggt gggtcaacgg acagtccctg 3840
cctgtctcac cgagactgca gctgtcaaac gggaatagga ctctgacctt gtttaacgtc 3900
acccggaacg acgcccgggc ctacgtgtgc ggcatccaga actccgtgag cgcaaaccgg 3960
tctgacccag tgaccctgga tgtgctgtac ggccccgaca ctccgatcat ttcacccccc 4020
gattcatcct acctgtccgg cgctaacctc aacctctcat gccactccgc atccaacccc 4080
agcccgcaat attcgtggcg cattaacgga attcctcagc aacataccca ggtcctgttc 4140
attgcgaaga tcacccctaa caacaacgga acctacgcct gctttgtgtc aaacctggcc 4200
actggtagaa acaactccat cgtgaagtcc attaccgtgt cggcgtccgg aacttccccg 4260
ggcctgagcg ccggcgccac cgtgggaatt atgatcggcg tgctcgtggg agtggccctg 4320
atcggatccg gcgagggcag aggcagcctg ctgacatgtg gcgacgtgga agagaaccct 4380
ggccccaccc ctggaaccca gagccccttc ttccttctgc tgctgctgac cgtgctgact 4440
gtcgtgacag gctctggcca cgccagctct acacctggcg gcgagaaaga gacaagcgcc 4500
acccagagaa gcagcgtgcc aagcagcacc gagaagaacg ccgtgtccat gaccagctcc 4560
gtgctgagca gccactctcc tggcagcggc agcagcacaa cacagggcca ggatgtgaca 4620
ctggcccctg ccacagaacc tgcctctgga tctgccgcca cctggggaca ggacgtgaca 4680
agcgtgccag tgaccagacc tgccctgggc tctacaacac cccctgccca cgatgtgacc 4740
agcgcccctg ataacaagcc tgcccctgga agcacagccc ctccagctca tggcgtgacc 4800
tctgccccag ataccagacc agccccagga tctacagccc cacccgcaca cggcgtgaca 4860
agtgcccctg acacaagacc cgctccaggc tctactgctc ctcctgccca tggcgtgaca 4920
agcgctcccg atacaaggcc agctcctggc tccacagcac caccagcaca tggcgtgaca 4980
tcagctcccg acactagacc tgctcccgga tcaaccgctc caccagctca cggcgtgacc 5040
agcgcacctg ataccagacc tgctctggga agcaccgccc ctcccgtgca caatgtgaca 5100
tctgcttccg gcagcgccag cggctctgcc tctacactgg tgcacaacgg caccagcgcc 5160
agagccacaa caaccccagc cagcaagagc acccccttca gcatccctag ccaccacagc 5220
gacaccccta ccacactggc cagccactcc accaagaccg atgcctctag cacccaccac 5280
tccagcgtgc cccctctgac cagcagcaac cacagcacaa gcccccagct gtctaccggc 5340
gtctcattct tctttctgtc cttccacatc agcaacctgc agttcaacag cagcctggaa 5400
gatcccagca ccgactacta ccaggaactg cagcgggata tcagcgagat gttcctgcaa 5460
atctacaagc agggcggctt cctgggcctg agcaacatca agttcagacc cggcagcgtg 5520
gtggtgcagc tgaccctggc tttccgggaa ggcaccatca acgtgcacga cgtggaaacc 5580
cagttcaacc agtacaagac cgaggccgcc agccggtaca acctgaccat ctccgatgtg 5640
tccgtgtccg acgtgccctt cccattctct gcccagtctg gcgcaggcgt gccaggatgg 5700
ggaattgctc tgctggtgct cgtgtgcgtg ctggtggccc tggccatcgt gtatctgatt 5760
gccctggccg tgtgccagtg ccggcggaag aattacggcc agctggacat cttccccgcc 5820
agagacacct accaccccat gagcgagtac cccacatacc acacccacgg cagatacgtg 5880
ccacccagct ccaccgacag atccccctac gagaaagtgt ctgccggcaa cggcggcagc 5940
tccctgagct acacaaatcc tgccgtggcc gctgcctccg ccaacctg 5988
<210> 49
<211> 1996
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 49
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val
900 905 910
Glu Ser Asn Pro Gly Pro Glu Ser Pro Ser Ala Pro Pro His Arg Trp
915 920 925
Cys Ile Pro Trp Gln Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe
930 935 940
Trp Asn Pro Pro Thr Thr Ala Lys Leu Thr Ile Glu Ser Thr Pro Phe
945 950 955 960
Asn Val Ala Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro
965 970 975
Gln His Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly
980 985 990
Asn Arg Gln Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro
995 1000 1005
Gly Pro Ala Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser
1010 1015 1020
Leu Leu Ile Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr
1025 1030 1035
Leu His Val Ile Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly
1040 1045 1050
Gln Phe Arg Val Tyr Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser
1055 1060 1065
Asn Asn Ser Asn Pro Val Glu Asp Glu Asp Ala Val Ala Leu Thr
1070 1075 1080
Cys Glu Pro Glu Ile Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn
1085 1090 1095
Asn Gln Ser Leu Pro Val Ser Pro Arg Leu Gln Leu Ser Asn Asp
1100 1105 1110
Asn Arg Thr Leu Thr Leu Leu Ser Val Thr Arg Asn Asp Val Gly
1115 1120 1125
Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser Val Asp His Ser
1130 1135 1140
Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp Pro Thr
1145 1150 1155
Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser
1160 1165 1170
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp
1175 1180 1185
Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile
1190 1195 1200
Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala
1205 1210 1215
Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile
1220 1225 1230
Thr Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn
1235 1240 1245
Ser Lys Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu
1250 1255 1260
Pro Glu Ala Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln
1265 1270 1275
Ser Leu Pro Val Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg
1280 1285 1290
Thr Leu Thr Leu Phe Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr
1295 1300 1305
Val Cys Gly Ile Gln Asn Ser Val Ser Ala Asn Arg Ser Asp Pro
1310 1315 1320
Val Thr Leu Asp Val Leu Tyr Gly Pro Asp Thr Pro Ile Ile Ser
1325 1330 1335
Pro Pro Asp Ser Ser Tyr Leu Ser Gly Ala Asn Leu Asn Leu Ser
1340 1345 1350
Cys His Ser Ala Ser Asn Pro Ser Pro Gln Tyr Ser Trp Arg Ile
1355 1360 1365
Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu Phe Ile Ala Lys
1370 1375 1380
Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val Ser Asn
1385 1390 1395
Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr Val
1400 1405 1410
Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr Val
1415 1420 1425
Gly Ile Met Ile Gly Val Leu Val Gly Val Ala Leu Ile Gly Ser
1430 1435 1440
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
1445 1450 1455
Asn Pro Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu
1460 1465 1470
Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala
1475 1480 1485
Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg
1490 1495 1500
Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr
1505 1510 1515
Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr
1520 1525 1530
Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala
1535 1540 1545
Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro
1550 1555 1560
Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp
1565 1570 1575
Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala
1580 1585 1590
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
1595 1600 1605
Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
1610 1615 1620
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
1625 1630 1635
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
1640 1645 1650
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
1655 1660 1665
Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
1670 1675 1680
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn
1685 1690 1695
Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu
1700 1705 1710
Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser
1715 1720 1725
Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro
1730 1735 1740
Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr
1745 1750 1755
His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr
1760 1765 1770
Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe
1775 1780 1785
His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser
1790 1795 1800
Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe
1805 1810 1815
Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile
1820 1825 1830
Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe
1835 1840 1845
Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
1850 1855 1860
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
1865 1870 1875
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1880 1885 1890
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1895 1900 1905
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1910 1915 1920
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1925 1930 1935
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1940 1945 1950
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1955 1960 1965
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1970 1975 1980
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1985 1990 1995
<210> 50
<211> 5829
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 50
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccaagct gaccattgag agcactccct tcaacgtggc tgaggggaag 1740
gaggtgctgc tcctggtgca caatctgccc cagcacctgt tcgggtactc ctggtacaag 1800
ggagaacgcg tggacgggaa ccggcagatc ataggctacg tcatcggaac ccagcaggcc 1860
acacccggtc cagcgtacag cggccgggag attatctacc cgaacgcctc cctgctgatc 1920
caaaacatca tccagaacga caccggtttc tacactctgc acgtgattaa gtcagatctg 1980
gtcaacgaag aggccaccgg ccaattcagg gtgtaccccg aactccctaa gccgttcatc 2040
acctcgaaca acagcaaccc ggtcgaggat gaagatgcgg tggccttgac gtgcgaacct 2100
gagatccaga acaccaccta cttgtggtgg gtgaacaatc agagcctgcc agtctcccca 2160
cgactccagc tgtcgaacga caacaggacc ctgactttgc tgtccgtgac tcggaacgac 2220
gtgggccctt atgaatgcgg tatccagaac aagctgtccg tggaccacag cgaccctgtg 2280
atcctgaacg tcctttacgg gccggacgac cccaccattt ccccgtcgta cacttactac 2340
cggccgggcg tgaacctgtc cctgtcgtgc cacgctgcct ccaatccgcc ggcccagtac 2400
tcctggctca tcgacggaaa catccagcag cacacccaag aactgttcat ctccaacatt 2460
accgagaaaa actcgggact ttacacctgt caagccaaca attccgccag cggccactcc 2520
cgcaccactg tcaaaactat cactgtgtcc gccgaactcc cgaagcccag catcagctcc 2580
aacaactcga agcccgtgga ggataaggac gctgtcgcgt tcacctgtga accagaggca 2640
cagaatacca cctacctttg gtgggtcaac ggacagtccc tgcctgtctc accgagactg 2700
cagctgtcaa acgggaatag gactctgacc ttgtttaacg tcacccggaa cgacgcccgg 2760
gcctacgtgt gcggcatcca gaactccgtg agcgcaaacc ggtctgaccc agtgaccctg 2820
gatgtgctgt acggccccga cactccgatc atttcacccc ccgattcatc ctacctgtcc 2880
ggcgctaacc tcaacctctc atgccactcc gcatccaacc ccagcccgca atattcgtgg 2940
cgcattaacg gaattcctca gcaacatacc caggtcctgt tcattgcgaa gatcacccct 3000
aacaacaacg gaacctacgc ctgctttgtg tcaaacctgg ccactggtag aaacaactcc 3060
atcgtgaagt ccattaccgt gtcggcgtcc ggatccggcg agggcagagg cagcctgctg 3120
acatgtggcg acgtggaaga gaaccctggc cccggagctg ccccggagcc ggagaggacc 3180
cccgttggcc agggatcgtg ggcccatccg ggacgcacca ggggaccatc cgacagggga 3240
ttctgtgtgg tgtcaccggc caggccagca gaagaggcaa ccagcctcga gggagcgttg 3300
tctggaacca gacattccca cccgtcggtg ggccggcagc accacgcggg accaccgtcc 3360
acttccagac cgccacggcc atgggacacc ccttgcccgc ctgtgtatgc cgagactaaa 3420
cacttcctgt actcatccgg agacaaggaa cagcttcggc cgtccttcct cctgtcgtcg 3480
ctcagaccga gcctgaccgg agcacgcaga ttggtggaaa ctatcttcct tgggtcacgt 3540
ccgtggatgc caggtacccc acggcgcctc ccgcgcctcc cacagagata ctggcagatg 3600
cggcctctgt tcctggaatt gctgggaaac cacgctcagt gcccgtacgg agtcctgctc 3660
aagactcact gccctctgag ggcggcggtc actccggcgg ccggagtgtg cgcacgggag 3720
aagccccagg gaagcgtggc agctccggaa gaggaggaca ccgatccgcg ccgcctcgtg 3780
caacttctgc gccagcactc ctcgccctgg caagtctacg ggttcgtccg cgcctgcctg 3840
cgccgcctgg tgccgcctgg gctctggggt tcccggcata acgagcgccg cttcctgaga 3900
aatactaaga agtttatctc acttggaaaa catgccaagt tgtcgctgca agaactcacg 3960
tggaagatgt cagtccgcga ttgcgcctgg ctgcgccgct cgccgggcgt cgggtgtgtt 4020
ccagctgcag aacaccgcct gagagaagaa attctggcca aatttctgca ttggctgatg 4080
tcagtgtacg tggtcgagct gctgcgctcc tttttctacg tcactgagac tacctttcaa 4140
aagaaccgcc tgttcttcta ccgcaaatct gtgtggagca agctgcagtc aatcggcatt 4200
cgccagcatc tgaagagggt gcagctgcgg gaactttccg aggcagaagt ccgccagcac 4260
cgggaggccc ggccggcgct tctcacgtcg cgtctgagat tcatcccaaa gcccgacggg 4320
ctgaggccta tcgtcaacat ggattacgtc gtgggcgctc gcacctttcg ccgtgaaaag 4380
cgggccgaac gcttgacctc acgggtgaag gccctcttct ccgtgctgaa ctacgagaga 4440
gcaagacggc ctggcctgct gggagcttcg gtgctgggac tggacgatat ccaccgggct 4500
tggcggacct ttgttctccg ggtgagagcc caagaccctc cgccggaact gtacttcgtg 4560
aaggtggcga tcaccggagc ctatgatact attccgcaag atcgactcac cgaagtcatc 4620
gcctcgatca tcaaaccgca gaacacttac tgcgtcaggc ggtacgccgt ggtccagaag 4680
gccgcgcatg gccacgtgag aaaggcgttc aagtcgcacg tgtccactct caccgacctc 4740
cagccttaca tgaggcaatt cgttgcgcat ttgcaagaga cttcgcccct gagagatgcg 4800
gtggtcatcg agcagagctc cagcctgaac gaagcgagca gcggtctgtt tgacgtgttc 4860
ctccgcttca tgtgtcatca cgcggtgcga atcaggggaa aatcatacgt gcagtgccag 4920
ggaatcccac aaggcagcat tctgtcgact ctcttgtgtt ccctttgcta cggcgatatg 4980
gaaaacaagc tgttcgctgg gatcagacgg gacgggttgc tgctcagact ggtggacgac 5040
ttcctgctgg tgactccgca cctcactcac gccaaaacct ttctccgcac tctggtgagg 5100
ggagtgccag aatacggctg tgtggtcaat ctccggaaaa ctgtggtgaa tttccctgtc 5160
gaggatgagg cactcggagg aaccgcattt gtccaaatgc cagcacatgg cctgttccca 5220
tggtgcggtc tgctgctgga cacccgaact cttgaagtgc agtccgacta ctccagctat 5280
gcccggacga gcatccgcgc cagcctcact ttcaatcgcg gctttaaggc cggacgaaac 5340
atgcgcagaa agcttttcgg agtcctccgg cttaaatgcc attcgctctt tctcgatctc 5400
caagtcaatt cgctgcagac cgtgtgcacg aacatctaca agatcctgct gctccaagcc 5460
taccggttcc acgcttgcgt gcttcagctg ccgtttcacc aacaggtgtg gaagaacccg 5520
accttctttc tgcgggtcat tagcgatact gcctccctgt gttactcaat cctcaaggca 5580
aagaacgccg gaatgtcgct gggtgcgaaa ggagccgcgg gacctcttcc tagcgaagcg 5640
gtgcagtggc tctgccacca ggctttcctc ctgaagctga ccaggcacag agtgacctac 5700
gtcccgctgc tgggctcgct gcgcactgca cagacccagc tgtctagaaa actccccggc 5760
accaccctga ccgctctgga agccgccgcc aacccagcat tgccgtcaga tttcaagacc 5820
atcttggac 5829
<210> 51
<211> 1943
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 51
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val
565 570 575
Ala Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His
580 585 590
Leu Phe Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg
595 600 605
Gln Ile Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro
610 615 620
Ala Tyr Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile
625 630 635 640
Gln Asn Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile
645 650 655
Lys Ser Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr
660 665 670
Pro Glu Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val
675 680 685
Glu Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn
690 695 700
Thr Thr Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro
705 710 715 720
Arg Leu Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val
725 730 735
Thr Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu
740 745 750
Ser Val Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro
755 760 765
Asp Asp Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val
770 775 780
Asn Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr
785 790 795 800
Ser Trp Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe
805 810 815
Ile Ser Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala
820 825 830
Asn Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr
835 840 845
Val Ser Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys
850 855 860
Pro Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala
865 870 875 880
Gln Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val
885 890 895
Ser Pro Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe
900 905 910
Asn Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn
915 920 925
Ser Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr
930 935 940
Gly Pro Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser
945 950 955 960
Gly Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro
965 970 975
Gln Tyr Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val
980 985 990
Leu Phe Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys
995 1000 1005
Phe Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys
1010 1015 1020
Ser Ile Thr Val Ser Ala Ser Gly Ser Gly Glu Gly Arg Gly Ser
1025 1030 1035
Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Gly Ala
1040 1045 1050
Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala
1055 1060 1065
His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val
1070 1075 1080
Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly
1085 1090 1095
Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln
1100 1105 1110
His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
1115 1120 1125
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu
1130 1135 1140
Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu
1145 1150 1155
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu
1160 1165 1170
Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg
1175 1180 1185
Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu
1190 1195 1200
Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
1205 1210 1215
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala
1220 1225 1230
Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala
1235 1240 1245
Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu
1250 1255 1260
Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala
1265 1270 1275
Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His
1280 1285 1290
Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu
1295 1300 1305
Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met
1310 1315 1320
Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
1325 1330 1335
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala
1340 1345 1350
Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
1355 1360 1365
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg
1370 1375 1380
Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
1385 1390 1395
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser
1400 1405 1410
Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu
1415 1420 1425
Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro
1430 1435 1440
Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
1445 1450 1455
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe
1460 1465 1470
Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly
1475 1480 1485
Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr
1490 1495 1500
Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr
1505 1510 1515
Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln
1520 1525 1530
Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn
1535 1540 1545
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His
1550 1555 1560
Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr
1565 1570 1575
Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu
1580 1585 1590
Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
1595 1600 1605
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe
1610 1615 1620
Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln
1625 1630 1635
Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys
1640 1645 1650
Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile
1655 1660 1665
Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu
1670 1675 1680
Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu
1685 1690 1695
Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys
1700 1705 1710
Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr
1715 1720 1725
Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly
1730 1735 1740
Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser
1745 1750 1755
Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg
1760 1765 1770
Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val
1775 1780 1785
Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn
1790 1795 1800
Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu
1805 1810 1815
Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His
1820 1825 1830
Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
1835 1840 1845
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala
1850 1855 1860
Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser
1865 1870 1875
Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu
1880 1885 1890
Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg
1895 1900 1905
Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu
1910 1915 1920
Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe
1925 1930 1935
Lys Thr Ile Leu Asp
1940
<210> 52
<211> 5829
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 52
atggctagca agctgaccat tgagagcact cccttcaacg tggctgaggg gaaggaggtg 60
ctgctcctgg tgcacaatct gccccagcac ctgttcgggt actcctggta caagggagaa 120
cgcgtggacg ggaaccggca gatcataggc tacgtcatcg gaacccagca ggccacaccc 180
ggtccagcgt acagcggccg ggagattatc tacccgaacg cctccctgct gatccaaaac 240
atcatccaga acgacaccgg tttctacact ctgcacgtga ttaagtcaga tctggtcaac 300
gaagaggcca ccggccaatt cagggtgtac cccgaactcc ctaagccgtt catcacctcg 360
aacaacagca acccggtcga ggatgaagat gcggtggcct tgacgtgcga acctgagatc 420
cagaacacca cctacttgtg gtgggtgaac aatcagagcc tgccagtctc cccacgactc 480
cagctgtcga acgacaacag gaccctgact ttgctgtccg tgactcggaa cgacgtgggc 540
ccttatgaat gcggtatcca gaacaagctg tccgtggacc acagcgaccc tgtgatcctg 600
aacgtccttt acgggccgga cgaccccacc atttccccgt cgtacactta ctaccggccg 660
ggcgtgaacc tgtccctgtc gtgccacgct gcctccaatc cgccggccca gtactcctgg 720
ctcatcgacg gaaacatcca gcagcacacc caagaactgt tcatctccaa cattaccgag 780
aaaaactcgg gactttacac ctgtcaagcc aacaattccg ccagcggcca ctcccgcacc 840
actgtcaaaa ctatcactgt gtccgccgaa ctcccgaagc ccagcatcag ctccaacaac 900
tcgaagcccg tggaggataa ggacgctgtc gcgttcacct gtgaaccaga ggcacagaat 960
accacctacc tttggtgggt caacggacag tccctgcctg tctcaccgag actgcagctg 1020
tcaaacggga ataggactct gaccttgttt aacgtcaccc ggaacgacgc ccgggcctac 1080
gtgtgcggca tccagaactc cgtgagcgca aaccggtctg acccagtgac cctggatgtg 1140
ctgtacggcc ccgacactcc gatcatttca ccccccgatt catcctacct gtccggcgct 1200
aacctcaacc tctcatgcca ctccgcatcc aaccccagcc cgcaatattc gtggcgcatt 1260
aacggaattc ctcagcaaca tacccaggtc ctgttcattg cgaagatcac ccctaacaac 1320
aacggaacct acgcctgctt tgtgtcaaac ctggccactg gtagaaacaa ctccatcgtg 1380
aagtccatta ccgtgtcggc gtccggatcc ggcgagggca gaggcagcct gctgacatgt 1440
ggcgacgtgg aagagaaccc tggccccgga gctgccccgg agccggagag gacccccgtt 1500
ggccagggat cgtgggccca tccgggacgc accaggggac catccgacag gggattctgt 1560
gtggtgtcac cggccaggcc agcagaagag gcaaccagcc tcgagggagc gttgtctgga 1620
accagacatt cccacccgtc ggtgggccgg cagcaccacg cgggaccacc gtccacttcc 1680
agaccgccac ggccatggga caccccttgc ccgcctgtgt atgccgagac taaacacttc 1740
ctgtactcat ccggagacaa ggaacagctt cggccgtcct tcctcctgtc gtcgctcaga 1800
ccgagcctga ccggagcacg cagattggtg gaaactatct tccttgggtc acgtccgtgg 1860
atgccaggta ccccacggcg cctcccgcgc ctcccacaga gatactggca gatgcggcct 1920
ctgttcctgg aattgctggg aaaccacgct cagtgcccgt acggagtcct gctcaagact 1980
cactgccctc tgagggcggc ggtcactccg gcggccggag tgtgcgcacg ggagaagccc 2040
cagggaagcg tggcagctcc ggaagaggag gacaccgatc cgcgccgcct cgtgcaactt 2100
ctgcgccagc actcctcgcc ctggcaagtc tacgggttcg tccgcgcctg cctgcgccgc 2160
ctggtgccgc ctgggctctg gggttcccgg cataacgagc gccgcttcct gagaaatact 2220
aagaagttta tctcacttgg aaaacatgcc aagttgtcgc tgcaagaact cacgtggaag 2280
atgtcagtcc gcgattgcgc ctggctgcgc cgctcgccgg gcgtcgggtg tgttccagct 2340
gcagaacacc gcctgagaga agaaattctg gccaaatttc tgcattggct gatgtcagtg 2400
tacgtggtcg agctgctgcg ctcctttttc tacgtcactg agactacctt tcaaaagaac 2460
cgcctgttct tctaccgcaa atctgtgtgg agcaagctgc agtcaatcgg cattcgccag 2520
catctgaaga gggtgcagct gcgggaactt tccgaggcag aagtccgcca gcaccgggag 2580
gcccggccgg cgcttctcac gtcgcgtctg agattcatcc caaagcccga cgggctgagg 2640
cctatcgtca acatggatta cgtcgtgggc gctcgcacct ttcgccgtga aaagcgggcc 2700
gaacgcttga cctcacgggt gaaggccctc ttctccgtgc tgaactacga gagagcaaga 2760
cggcctggcc tgctgggagc ttcggtgctg ggactggacg atatccaccg ggcttggcgg 2820
acctttgttc tccgggtgag agcccaagac cctccgccgg aactgtactt cgtgaaggtg 2880
gcgatcaccg gagcctatga tactattccg caagatcgac tcaccgaagt catcgcctcg 2940
atcatcaaac cgcagaacac ttactgcgtc aggcggtacg ccgtggtcca gaaggccgcg 3000
catggccacg tgagaaaggc gttcaagtcg cacgtgtcca ctctcaccga cctccagcct 3060
tacatgaggc aattcgttgc gcatttgcaa gagacttcgc ccctgagaga tgcggtggtc 3120
atcgagcaga gctccagcct gaacgaagcg agcagcggtc tgtttgacgt gttcctccgc 3180
ttcatgtgtc atcacgcggt gcgaatcagg ggaaaatcat acgtgcagtg ccagggaatc 3240
ccacaaggca gcattctgtc gactctcttg tgttcccttt gctacggcga tatggaaaac 3300
aagctgttcg ctgggatcag acgggacggg ttgctgctca gactggtgga cgacttcctg 3360
ctggtgactc cgcacctcac tcacgccaaa acctttctcc gcactctggt gaggggagtg 3420
ccagaatacg gctgtgtggt caatctccgg aaaactgtgg tgaatttccc tgtcgaggat 3480
gaggcactcg gaggaaccgc atttgtccaa atgccagcac atggcctgtt cccatggtgc 3540
ggtctgctgc tggacacccg aactcttgaa gtgcagtccg actactccag ctatgcccgg 3600
acgagcatcc gcgccagcct cactttcaat cgcggcttta aggccggacg aaacatgcgc 3660
agaaagcttt tcggagtcct ccggcttaaa tgccattcgc tctttctcga tctccaagtc 3720
aattcgctgc agaccgtgtg cacgaacatc tacaagatcc tgctgctcca agcctaccgg 3780
ttccacgctt gcgtgcttca gctgccgttt caccaacagg tgtggaagaa cccgaccttc 3840
tttctgcggg tcattagcga tactgcctcc ctgtgttact caatcctcaa ggcaaagaac 3900
gccggaatgt cgctgggtgc gaaaggagcc gcgggacctc ttcctagcga agcggtgcag 3960
tggctctgcc accaggcttt cctcctgaag ctgaccaggc acagagtgac ctacgtcccg 4020
ctgctgggct cgctgcgcac tgcacagacc cagctgtcta gaaaactccc cggcaccacc 4080
ctgaccgctc tggaagccgc cgccaaccca gcattgccgt cagatttcaa gaccatcttg 4140
gacggatccg gcacaatcct gtctgagggc gccaccaact tcagcctgct gaaactggcc 4200
ggcgacgtgg aactgaaccc tggccctacc cctggaaccc agagcccctt cttccttctg 4260
ctgctgctga ccgtgctgac tgtcgtgaca ggctctggcc acgccagctc tacacctggc 4320
ggcgagaaag agacaagcgc cacccagaga agcagcgtgc caagcagcac cgagaagaac 4380
gccgtgtcca tgaccagctc cgtgctgagc agccactctc ctggcagcgg cagcagcaca 4440
acacagggcc aggatgtgac actggcccct gccacagaac ctgcctctgg atctgccgcc 4500
acctggggac aggacgtgac aagcgtgcca gtgaccagac ctgccctggg ctctacaaca 4560
ccccctgccc acgatgtgac cagcgcccct gataacaagc ctgcccctgg aagcacagcc 4620
cctccagctc atggcgtgac ctctgcccca gataccagac cagccccagg atctacagcc 4680
ccacccgcac acggcgtgac aagtgcccct gacacaagac ccgctccagg ctctactgct 4740
cctcctgccc atggcgtgac aagcgctccc gatacaaggc cagctcctgg ctccacagca 4800
ccaccagcac atggcgtgac atcagctccc gacactagac ctgctcccgg atcaaccgct 4860
ccaccagctc acggcgtgac cagcgcacct gataccagac ctgctctggg aagcaccgcc 4920
cctcccgtgc acaatgtgac atctgcttcc ggcagcgcca gcggctctgc ctctacactg 4980
gtgcacaacg gcaccagcgc cagagccaca acaaccccag ccagcaagag cacccccttc 5040
agcatcccta gccaccacag cgacacccct accacactgg ccagccactc caccaagacc 5100
gatgcctcta gcacccacca ctccagcgtg ccccctctga ccagcagcaa ccacagcaca 5160
agcccccagc tgtctaccgg cgtctcattc ttctttctgt ccttccacat cagcaacctg 5220
cagttcaaca gcagcctgga agatcccagc accgactact accaggaact gcagcgggat 5280
atcagcgaga tgttcctgca aatctacaag cagggcggct tcctgggcct gagcaacatc 5340
aagttcagac ccggcagcgt ggtggtgcag ctgaccctgg ctttccggga aggcaccatc 5400
aacgtgcacg acgtggaaac ccagttcaac cagtacaaga ccgaggccgc cagccggtac 5460
aacctgacca tctccgatgt gtccgtgtcc gacgtgccct tcccattctc tgcccagtct 5520
ggcgcaggcg tgccaggatg gggaattgct ctgctggtgc tcgtgtgcgt gctggtggcc 5580
ctggccatcg tgtatctgat tgccctggcc gtgtgccagt gccggcggaa gaattacggc 5640
cagctggaca tcttccccgc cagagacacc taccacccca tgagcgagta ccccacatac 5700
cacacccacg gcagatacgt gccacccagc tccaccgaca gatcccccta cgagaaagtg 5760
tctgccggca acggcggcag ctccctgagc tacacaaatc ctgccgtggc cgctgcctcc 5820
gccaacctg 5829
<210> 53
<211> 1943
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 53
Met Ala Ser Lys Leu Thr Ile Glu Ser Thr Pro Phe Asn Val Ala Glu
1 5 10 15
Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gln His Leu Phe
20 25 30
Gly Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gln Ile
35 40 45
Ile Gly Tyr Val Ile Gly Thr Gln Gln Ala Thr Pro Gly Pro Ala Tyr
50 55 60
Ser Gly Arg Glu Ile Ile Tyr Pro Asn Ala Ser Leu Leu Ile Gln Asn
65 70 75 80
Ile Ile Gln Asn Asp Thr Gly Phe Tyr Thr Leu His Val Ile Lys Ser
85 90 95
Asp Leu Val Asn Glu Glu Ala Thr Gly Gln Phe Arg Val Tyr Pro Glu
100 105 110
Leu Pro Lys Pro Phe Ile Thr Ser Asn Asn Ser Asn Pro Val Glu Asp
115 120 125
Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu Ile Gln Asn Thr Thr
130 135 140
Tyr Leu Trp Trp Val Asn Asn Gln Ser Leu Pro Val Ser Pro Arg Leu
145 150 155 160
Gln Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr Arg
165 170 175
Asn Asp Val Gly Pro Tyr Glu Cys Gly Ile Gln Asn Lys Leu Ser Val
180 185 190
Asp His Ser Asp Pro Val Ile Leu Asn Val Leu Tyr Gly Pro Asp Asp
195 200 205
Pro Thr Ile Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu
210 215 220
Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gln Tyr Ser Trp
225 230 235 240
Leu Ile Asp Gly Asn Ile Gln Gln His Thr Gln Glu Leu Phe Ile Ser
245 250 255
Asn Ile Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gln Ala Asn Asn
260 265 270
Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr Ile Thr Val Ser
275 280 285
Ala Glu Leu Pro Lys Pro Ser Ile Ser Ser Asn Asn Ser Lys Pro Val
290 295 300
Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gln Asn
305 310 315 320
Thr Thr Tyr Leu Trp Trp Val Asn Gly Gln Ser Leu Pro Val Ser Pro
325 330 335
Arg Leu Gln Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val
340 345 350
Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly Ile Gln Asn Ser Val
355 360 365
Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly Pro
370 375 380
Asp Thr Pro Ile Ile Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly Ala
385 390 395 400
Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gln Tyr
405 410 415
Ser Trp Arg Ile Asn Gly Ile Pro Gln Gln His Thr Gln Val Leu Phe
420 425 430
Ile Ala Lys Ile Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe Val
435 440 445
Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser Ile Val Lys Ser Ile Thr
450 455 460
Val Ser Ala Ser Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
465 470 475 480
Gly Asp Val Glu Glu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu
485 490 495
Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg
500 505 510
Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala
515 520 525
Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser
530 535 540
His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
545 550 555 560
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu
565 570 575
Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro
580 585 590
Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg
595 600 605
Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr
610 615 620
Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro
625 630 635 640
Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
645 650 655
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala
660 665 670
Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
675 680 685
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His
690 695 700
Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg
705 710 715 720
Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe
725 730 735
Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu
740 745 750
Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
755 760 765
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg
770 775 780
Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
785 790 795 800
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
805 810 815
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys
820 825 830
Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg
835 840 845
Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala
850 855 860
Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg
865 870 875 880
Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
885 890 895
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser
900 905 910
Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser
915 920 925
Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu
930 935 940
Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val
945 950 955 960
Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu
965 970 975
Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
980 985 990
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe
995 1000 1005
Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg
1010 1015 1020
Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
1025 1030 1035
Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly
1040 1045 1050
Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg
1055 1060 1065
Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly
1070 1075 1080
Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met
1085 1090 1095
Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu
1100 1105 1110
Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His
1115 1120 1125
Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr
1130 1135 1140
Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
1145 1150 1155
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala
1160 1165 1170
His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr
1175 1180 1185
Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile
1190 1195 1200
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn
1205 1210 1215
Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser
1220 1225 1230
Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr
1235 1240 1245
Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala
1250 1255 1260
Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
1265 1270 1275
Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr
1280 1285 1290
Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys
1295 1300 1305
Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys
1310 1315 1320
His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr
1325 1330 1335
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser
1340 1345 1350
Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala
1355 1360 1365
Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly Ser
1370 1375 1380
Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
1385 1390 1395
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Pro Gly Thr
1400 1405 1410
Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val
1415 1420 1425
Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys
1430 1435 1440
Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu
1445 1450 1455
Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser
1460 1465 1470
Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu
1475 1480 1485
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly
1490 1495 1500
Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
1505 1510 1515
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys
1520 1525 1530
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
1535 1540 1545
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1550 1555 1560
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
1565 1570 1575
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
1580 1585 1590
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
1595 1600 1605
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1610 1615 1620
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Leu Gly Ser
1625 1630 1635
Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser Gly Ser Ala
1640 1645 1650
Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala Arg
1655 1660 1665
Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile Pro
1670 1675 1680
Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr
1685 1690 1695
Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu
1700 1705 1710
Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val
1715 1720 1725
Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn
1730 1735 1740
Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
1745 1750 1755
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly
1760 1765 1770
Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val
1775 1780 1785
Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His
1790 1795 1800
Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser
1805 1810 1815
Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro
1820 1825 1830
Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly Trp Gly
1835 1840 1845
Ile Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala Ile
1850 1855 1860
Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg Lys Asn
1865 1870 1875
Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr His Pro
1880 1885 1890
Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro
1895 1900 1905
Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala Gly
1910 1915 1920
Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala
1925 1930 1935
Ala Ser Ala Asn Leu
1940
<210> 54
<211> 5859
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 54
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcgaatcgc caagcgcacc ccctcatcgg tggtgcatcc cttggcaacg 2040
cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg caaagctgac 2100
cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc tggtgcacaa 2160
tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg 2220
gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag cgtacagcgg 2280
ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc agaacgacac 2340
cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg ccaccggcca 2400
attcagggtg taccccgaac tcccaaagcc gtccatttca agcaacaact ccaagccggt 2460
ggaggacaaa gacgccgtgg ccttcacttg tgaacctgaa acccaggacg ccacttacct 2520
ttggtgggtg aacaaccagt cgctccccgt gtcgccgagg ctgcagctca gcaacggaaa 2580
cagaacgctg accctcttca atgtgacccg caatgatacc gcctcctata agtgcgaaac 2640
ccagaatccg gtgtccgccc ggcgctcgga tagcgtgatt ctgaacgtgc tctacggccc 2700
tgacgccccc actatctccc ctctgaacac ttcctaccgg tccggagaga acctgaacct 2760
gagctgccac gcggcgtcca acccgcccgc ccagtacagc tggttcgtga atgggacgtt 2820
ccagcagtcc acccaggagc tgtttatccc taacattacc gtcaacaact ctggatcgta 2880
cacatgccaa gcgcataact cggacactgg gcttaacaga accaccgtga caaccatcac 2940
tgtgtatgcg gaacctccta agccgttcat cacctcgaac aacagcaacc cggtcgagga 3000
tgaagatgcg gtggccttga cgtgcgaacc tgagatccag aacaccacct acttgtggtg 3060
ggtgaacaat cagagcctgc cagtctcccc acgactccag ctgtcgaacg acaacaggac 3120
cctgactttg ctgtccgtga ctcggaacga cgtgggccct tatgaatgcg gtatccagaa 3180
caagctgtcc gtggaccaca gcgaccctgt gatcctgaac gtcctttacg ggccggacga 3240
ccccaccatt tccccgtcgt acacttacta ccggccgggc gtgaacctgt ccctgtcgtg 3300
ccacgctgcc tccaatccgc cggcccagta ctcctggctc atcgacggaa acatccagca 3360
gcacacccaa gaactgttca tctccaacat taccgagaaa aactcgggac tttacacctg 3420
tcaagccaac aattccgcca gcggccactc ccgcaccact gtcaaaacta tcactgtgtc 3480
cgccgaactc ccgaagccca gcatcagctc caacaactcg aagcccgtgg aggataagga 3540
cgctgtcgcg ttcacctgtg aaccagaggc acagaatacc acctaccttt ggtgggtcaa 3600
cggacagtcc ctgcctgtct caccgagact gcagctgtca aacgggaata ggactctgac 3660
cttgtttaac gtcacccgga acgacgcccg ggcctacgtg tgcggcatcc agaactccgt 3720
gagcgcaaac cggtctgacc cagtgaccct ggatgtgctg tacggccccg acactccgat 3780
catttcaccc cccgattcat cctacctgtc cggcgctaac ctcaacctct catgccactc 3840
cgcatccaac cccagcccgc aatattcgtg gcgcattaac ggaattcctc agcaacatac 3900
ccaggtcctg ttcattgcga agatcacccc taacaacaac ggaacctacg cctgctttgt 3960
gtcaaacctg gccactggta gaaacaactc catcgtgaag tccattaccg tgtcggcgtc 4020
cggaacttcc ccgggcctga gcgccggcgc caccgtggga attatgatcg gcgtgctcgt 4080
gggagtggcc ctgatctgaa gatctgggcc ctaacaaaac aaaaagatgg ggttattccc 4140
taaacttcat gggttacgta attggaagtt gggggacatt gccacaagat catattgtac 4200
aaaagatcaa acactgtttt agaaaacttc ctgtaaacag gcctattgat tggaaagtat 4260
gtcaaaggat tgtgggtctt ttgggctttg ctgctccatt tacacaatgt ggatatcctg 4320
ccttaatgcc tttgtatgca tgtatacaag ctaaacaggc tttcactttc tcgccaactt 4380
acaaggcctt tctaagtaaa cagtacatga acctttaccc cgttgctcgg caacggcctg 4440
gtctgtgcca agtgtttgct gacgcaaccc ccactggctg gggcttggcc ataggccatc 4500
agcgcatgcg tggaaccttt gtggctcctc tgccgatcca tactgcggaa ctcctagccg 4560
cttgttttgc tcgcagccgg tctggagcaa agctcatagg aactgacaat tctgtcgtcc 4620
tctcgcggaa atatacatcg tttcgatcta cgtatgatct ttttccctct gccaaaaatt 4680
atggggacat catgaagccc cttgagcatc tgacttctgg ctaataaagg aaatttattt 4740
tcattgcaat agtgtgttgg aattttttgt gtctctcact cggaaggaat tctgcattaa 4800
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 4860
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 4920
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 4980
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 5040
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 5100
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 5160
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 5220
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 5280
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 5340
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 5400
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 5460
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 5520
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 5580
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 5640
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 5700
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 5760
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 5820
gcgatctgtc tatttcgttc atccatagtt gcctgactc 5859
<210> 55
<211> 5151
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 55
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcaagctga ccattgagag cactcccttc aacgtggctg aggggaagga 2040
ggtgctgctc ctggtgcaca atctgcccca gcacctgttc gggtactcct ggtacaaggg 2100
agaacgcgtg gacgggaacc ggcagatcat aggctacgtc atcggaaccc agcaggccac 2160
acccggtcca gcgtacagcg gccgggagat tatctacccg aacgcctccc tgctgatcca 2220
aaacatcatc cagaacgaca ccggtttcta cactctgcac gtgattaagt cagatctggt 2280
caacgaagag gccaccggcc aattcagggt gtaccccgaa ctccctaagc cgttcatcac 2340
ctcgaacaac agcaacccgg tcgaggatga agatgcggtg gccttgacgt gcgaacctga 2400
gatccagaac accacctact tgtggtgggt gaacaatcag agcctgccag tctccccacg 2460
actccagctg tcgaacgaca acaggaccct gactttgctg tccgtgactc ggaacgacgt 2520
gggcccttat gaatgcggta tccagaacaa gctgtccgtg gaccacagcg accctgtgat 2580
cctgaacgtc ctttacgggc cggacgaccc caccatttcc ccgtcgtaca cttactaccg 2640
gccgggcgtg aacctgtccc tgtcgtgcca cgctgcctcc aatccgccgg cccagtactc 2700
ctggctcatc gacggaaaca tccagcagca cacccaagaa ctgttcatct ccaacattac 2760
cgagaaaaac tcgggacttt acacctgtca agccaacaat tccgccagcg gccactcccg 2820
caccactgtc aaaactatca ctgtgtccgc cgaactcccg aagcccagca tcagctccaa 2880
caactcgaag cccgtggagg ataaggacgc tgtcgcgttc acctgtgaac cagaggcaca 2940
gaataccacc tacctttggt gggtcaacgg acagtccctg cctgtctcac cgagactgca 3000
gctgtcaaac gggaatagga ctctgacctt gtttaacgtc acccggaacg acgcccgggc 3060
ctacgtgtgc ggcatccaga actccgtgag cgcaaaccgg tctgacccag tgaccctgga 3120
tgtgctgtac ggccccgaca ctccgatcat ttcacccccc gattcatcct acctgtccgg 3180
cgctaacctc aacctctcat gccactccgc atccaacccc agcccgcaat attcgtggcg 3240
cattaacgga attcctcagc aacataccca ggtcctgttc attgcgaaga tcacccctaa 3300
caacaacgga acctacgcct gctttgtgtc aaacctggcc actggtagaa acaactccat 3360
cgtgaagtcc attaccgtgt cggcgtcctg aagatctggg ccctaacaaa acaaaaagat 3420
ggggttattc cctaaacttc atgggttacg taattggaag ttgggggaca ttgccacaag 3480
atcatattgt acaaaagatc aaacactgtt ttagaaaact tcctgtaaac aggcctattg 3540
attggaaagt atgtcaaagg attgtgggtc ttttgggctt tgctgctcca tttacacaat 3600
gtggatatcc tgccttaatg cctttgtatg catgtataca agctaaacag gctttcactt 3660
tctcgccaac ttacaaggcc tttctaagta aacagtacat gaacctttac cccgttgctc 3720
ggcaacggcc tggtctgtgc caagtgtttg ctgacgcaac ccccactggc tggggcttgg 3780
ccataggcca tcagcgcatg cgtggaacct ttgtggctcc tctgccgatc catactgcgg 3840
aactcctagc cgcttgtttt gctcgcagcc ggtctggagc aaagctcata ggaactgaca 3900
attctgtcgt cctctcgcgg aaatatacat cgtttcgatc tacgtatgat ctttttccct 3960
ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct ggctaataaa 4020
ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca ctcggaagga 4080
attctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 4140
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 4200
gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 4260
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 4320
ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 4380
cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 4440
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 4500
gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 4560
aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 4620
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 4680
aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 4740
aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc 4800
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 4860
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 4920
atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 4980
atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 5040
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 5100
gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact c 5151
<210> 56
<211> 5325
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 56
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcgaatcgc caagcgcacc ccctcatcgg tggtgcatcc cttggcaacg 2040
cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg caaagctgac 2100
cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc tggtgcacaa 2160
tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg 2220
gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag cgtacagcgg 2280
ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc agaacgacac 2340
cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg ccaccggcca 2400
attcagggtg taccccgaac tccctaagcc gttcatcacc tcgaacaaca gcaacccggt 2460
cgaggatgaa gatgcggtgg ccttgacgtg cgaacctgag atccagaaca ccacctactt 2520
gtggtgggtg aacaatcaga gcctgccagt ctccccacga ctccagctgt cgaacgacaa 2580
caggaccctg actttgctgt ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat 2640
ccagaacaag ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc tttacgggcc 2700
ggacgacccc accatttccc cgtcgtacac ttactaccgg ccgggcgtga acctgtccct 2760
gtcgtgccac gctgcctcca atccgccggc ccagtactcc tggctcatcg acggaaacat 2820
ccagcagcac acccaagaac tgttcatctc caacattacc gagaaaaact cgggacttta 2880
cacctgtcaa gccaacaatt ccgccagcgg ccactcccgc accactgtca aaactatcac 2940
tgtgtccgcc gaactcccga agcccagcat cagctccaac aactcgaagc ccgtggagga 3000
taaggacgct gtcgcgttca cctgtgaacc agaggcacag aataccacct acctttggtg 3060
ggtcaacgga cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg ggaataggac 3120
tctgaccttg tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa 3180
ctccgtgagc gcaaaccggt ctgacccagt gaccctggat gtgctgtacg gccccgacac 3240
tccgatcatt tcaccccccg attcatccta cctgtccggc gctaacctca acctctcatg 3300
ccactccgca tccaacccca gcccgcaata ttcgtggcgc attaacggaa ttcctcagca 3360
acatacccag gtcctgttca ttgcgaagat cacccctaac aacaacggaa cctacgcctg 3420
ctttgtgtca aacctggcca ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc 3480
ggcgtccgga acttccccgg gcctgagcgc cggcgccacc gtgggaatta tgatcggcgt 3540
gctcgtggga gtggccctga tctgaagatc tgggccctaa caaaacaaaa agatggggtt 3600
attccctaaa cttcatgggt tacgtaattg gaagttgggg gacattgcca caagatcata 3660
ttgtacaaaa gatcaaacac tgttttagaa aacttcctgt aaacaggcct attgattgga 3720
aagtatgtca aaggattgtg ggtcttttgg gctttgctgc tccatttaca caatgtggat 3780
atcctgcctt aatgcctttg tatgcatgta tacaagctaa acaggctttc actttctcgc 3840
caacttacaa ggcctttcta agtaaacagt acatgaacct ttaccccgtt gctcggcaac 3900
ggcctggtct gtgccaagtg tttgctgacg caacccccac tggctggggc ttggccatag 3960
gccatcagcg catgcgtgga acctttgtgg ctcctctgcc gatccatact gcggaactcc 4020
tagccgcttg ttttgctcgc agccggtctg gagcaaagct cataggaact gacaattctg 4080
tcgtcctctc gcggaaatat acatcgtttc gatctacgta tgatcttttt ccctctgcca 4140
aaaattatgg ggacatcatg aagccccttg agcatctgac ttctggctaa taaaggaaat 4200
ttattttcat tgcaatagtg tgttggaatt ttttgtgtct ctcactcgga aggaattctg 4260
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 4320
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 4380
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 4440
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 4500
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 4560
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 4620
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 4680
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 4740
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 4800
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 4860
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 4920
ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 4980
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 5040
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 5100
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 5160
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 5220
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 5280
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactc 5325
<210> 57
<211> 9756
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 57
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcacccctg gaacccagag ccccttcttc cttctgctgc tgctgaccgt 2040
gctgactgtc gtgacaggct ctggccacgc cagctctaca cctggcggcg agaaagagac 2100
aagcgccacc cagagaagca gcgtgccaag cagcaccgag aagaacgccg tgtccatgac 2160
cagctccgtg ctgagcagcc actctcctgg cagcggcagc agcacaacac agggccagga 2220
tgtgacactg gcccctgcca cagaacctgc ctctggatct gccgccacct ggggacagga 2280
cgtgacaagc gtgccagtga ccagacctgc cctgggctct acaacacccc ctgcccacga 2340
tgtgaccagc gcccctgata acaagcctgc ccctggaagc acagcccctc cagctcatgg 2400
cgtgacctct gccccagata ccagaccagc cccaggatct acagccccac ccgcacacgg 2460
cgtgacaagt gcccctgaca caagacccgc tccaggctct actgctcctc ctgcccatgg 2520
cgtgacaagc gctcccgata caaggccagc tcctggctcc acagcaccac cagcacatgg 2580
cgtgacatca gctcccgaca ctagacctgc tcccggatca accgctccac cagctcacgg 2640
cgtgaccagc gcacctgata ccagacctgc tctgggaagc accgcccctc ccgtgcacaa 2700
tgtgacatct gcttccggca gcgccagcgg ctctgcctct acactggtgc acaacggcac 2760
cagcgccaga gccacaacaa ccccagccag caagagcacc cccttcagca tccctagcca 2820
ccacagcgac acccctacca cactggccag ccactccacc aagaccgatg cctctagcac 2880
ccaccactcc agcgtgcccc ctctgaccag cagcaaccac agcacaagcc cccagctgtc 2940
taccggcgtc tcattcttct ttctgtcctt ccacatcagc aacctgcagt tcaacagcag 3000
cctggaagat cccagcaccg actactacca ggaactgcag cgggatatca gcgagatgtt 3060
cctgcaaatc tacaagcagg gcggcttcct gggcctgagc aacatcaagt tcagacccgg 3120
cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc accatcaacg tgcacgacgt 3180
ggaaacccag ttcaaccagt acaagaccga ggccgccagc cggtacaacc tgaccatctc 3240
cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc cagtctggcg caggcgtgcc 3300
aggatgggga attgctctgc tggtgctcgt gtgcgtgctg gtggccctgg ccatcgtgta 3360
tctgattgcc ctggccgtgt gccagtgccg gcggaagaat tacggccagc tggacatctt 3420
ccccgccaga gacacctacc accccatgag cgagtacccc acataccaca cccacggcag 3480
atacgtgcca cccagctcca ccgacagatc cccctacgag aaagtgtctg ccggcaacgg 3540
cggcagctcc ctgagctaca caaatcctgc cgtggccgct gcctccgcca acctgggatc 3600
cggcacaatc ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt 3660
ggaactgaac cctggccctg gagctgcccc ggagccggag aggacccccg ttggccaggg 3720
atcgtgggcc catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc 3780
accggccagg ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca 3840
ttcccacccg tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc 3900
acggccatgg gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc 3960
atccggagac aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct 4020
gaccggagca cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg 4080
taccccacgg cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct 4140
ggaattgctg ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc 4200
tctgagggcg gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag 4260
cgtggcagct ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca 4320
gcactcctcg ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc 4380
gcctgggctc tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt 4440
tatctcactt ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt 4500
ccgcgattgc gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca 4560
ccgcctgaga gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt 4620
cgagctgctg cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt 4680
cttctaccgc aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa 4740
gagggtgcag ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc 4800
ggcgcttctc acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt 4860
caacatggat tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt 4920
gacctcacgg gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg 4980
cctgctggga gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt 5040
tctccgggtg agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac 5100
cggagcctat gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa 5160
accgcagaac acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca 5220
cgtgagaaag gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag 5280
gcaattcgtt gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca 5340
gagctccagc ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg 5400
tcatcacgcg gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg 5460
cagcattctg tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt 5520
cgctgggatc agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac 5580
tccgcacctc actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata 5640
cggctgtgtg gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact 5700
cggaggaacc gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct 5760
gctggacacc cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat 5820
ccgcgccagc ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct 5880
tttcggagtc ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct 5940
gcagaccgtg tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc 6000
ttgcgtgctt cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg 6060
ggtcattagc gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat 6120
gtcgctgggt gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg 6180
ccaccaggct ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg 6240
ctcgctgcgc actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc 6300
tctggaagcc gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc 6360
cggccagtgc accaattacg ccctgctgaa gctggccggc gacgtggaat ctaaccctgg 6420
ccctgaatcg ccaagcgcac cccctcatcg gtggtgcatc ccttggcaac gcctcctcct 6480
gaccgcctca ctgctgactt tctggaaccc gccgaccacc gcaaagctga ccattgagag 6540
cactcccttc aacgtggctg aggggaagga ggtgctgctc ctggtgcaca atctgcccca 6600
gcacctgttc gggtactcct ggtacaaggg agaacgcgtg gacgggaacc ggcagatcat 6660
aggctacgtc atcggaaccc agcaggccac acccggtcca gcgtacagcg gccgggagat 6720
tatctacccg aacgcctccc tgctgatcca aaacatcatc cagaacgaca ccggtttcta 6780
cactctgcac gtgattaagt cagatctggt caacgaagag gccaccggcc aattcagggt 6840
gtaccccgaa ctccctaagc cgttcatcac ctcgaacaac agcaacccgg tcgaggatga 6900
agatgcggtg gccttgacgt gcgaacctga gatccagaac accacctact tgtggtgggt 6960
gaacaatcag agcctgccag tctccccacg actccagctg tcgaacgaca acaggaccct 7020
gactttgctg tccgtgactc ggaacgacgt gggcccttat gaatgcggta tccagaacaa 7080
gctgtccgtg gaccacagcg accctgtgat cctgaacgtc ctttacgggc cggacgaccc 7140
caccatttcc ccgtcgtaca cttactaccg gccgggcgtg aacctgtccc tgtcgtgcca 7200
cgctgcctcc aatccgccgg cccagtactc ctggctcatc gacggaaaca tccagcagca 7260
cacccaagaa ctgttcatct ccaacattac cgagaaaaac tcgggacttt acacctgtca 7320
agccaacaat tccgccagcg gccactcccg caccactgtc aaaactatca ctgtgtccgc 7380
cgaactcccg aagcccagca tcagctccaa caactcgaag cccgtggagg ataaggacgc 7440
tgtcgcgttc acctgtgaac cagaggcaca gaataccacc tacctttggt gggtcaacgg 7500
acagtccctg cctgtctcac cgagactgca gctgtcaaac gggaatagga ctctgacctt 7560
gtttaacgtc acccggaacg acgcccgggc ctacgtgtgc ggcatccaga actccgtgag 7620
cgcaaaccgg tctgacccag tgaccctgga tgtgctgtac ggccccgaca ctccgatcat 7680
ttcacccccc gattcatcct acctgtccgg cgctaacctc aacctctcat gccactccgc 7740
atccaacccc agcccgcaat attcgtggcg cattaacgga attcctcagc aacataccca 7800
ggtcctgttc attgcgaaga tcacccctaa caacaacgga acctacgcct gctttgtgtc 7860
aaacctggcc actggtagaa acaactccat cgtgaagtcc attaccgtgt cggcgtccgg 7920
aacttccccg ggcctgagcg ccggcgccac cgtgggaatt atgatcggcg tgctcgtggg 7980
agtggccctg atctgaagat ctgggcccta acaaaacaaa aagatggggt tattccctaa 8040
acttcatggg ttacgtaatt ggaagttggg ggacattgcc acaagatcat attgtacaaa 8100
agatcaaaca ctgttttaga aaacttcctg taaacaggcc tattgattgg aaagtatgtc 8160
aaaggattgt gggtcttttg ggctttgctg ctccatttac acaatgtgga tatcctgcct 8220
taatgccttt gtatgcatgt atacaagcta aacaggcttt cactttctcg ccaacttaca 8280
aggcctttct aagtaaacag tacatgaacc tttaccccgt tgctcggcaa cggcctggtc 8340
tgtgccaagt gtttgctgac gcaaccccca ctggctgggg cttggccata ggccatcagc 8400
gcatgcgtgg aacctttgtg gctcctctgc cgatccatac tgcggaactc ctagccgctt 8460
gttttgctcg cagccggtct ggagcaaagc tcataggaac tgacaattct gtcgtcctct 8520
cgcggaaata tacatcgttt cgatctacgt atgatctttt tccctctgcc aaaaattatg 8580
gggacatcat gaagcccctt gagcatctga cttctggcta ataaaggaaa tttattttca 8640
ttgcaatagt gtgttggaat tttttgtgtc tctcactcgg aaggaattct gcattaatga 8700
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8760
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8820
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 8880
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 8940
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 9000
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 9060
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 9120
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9180
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9240
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9300
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9360
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9420
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9480
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9540
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9600
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9660
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9720
atctgtctat ttcgttcatc catagttgcc tgactc 9756
<210> 58
<211> 36268
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 58
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcacccctgg aacccagagc cccttcttcc 1140
ttctgctgct gctgaccgtg ctgactgtcg tgacaggctc tggccacgcc agctctacac 1200
ctggcggcga gaaagagaca agcgccaccc agagaagcag cgtgccaagc agcaccgaga 1260
agaacgccgt gtccatgacc agctccgtgc tgagcagcca ctctcctggc agcggcagca 1320
gcacaacaca gggccaggat gtgacactgg cccctgccac agaacctgcc tctggatctg 1380
ccgccacctg gggacaggac gtgacaagcg tgccagtgac cagacctgcc ctgggctcta 1440
caacaccccc tgcccacgat gtgaccagcg cccctgataa caagcctgcc cctggaagca 1500
cagcccctcc agctcatggc gtgacctctg ccccagatac cagaccagcc ccaggatcta 1560
cagccccacc cgcacacggc gtgacaagtg cccctgacac aagacccgct ccaggctcta 1620
ctgctcctcc tgcccatggc gtgacaagcg ctcccgatac aaggccagct cctggctcca 1680
cagcaccacc agcacatggc gtgacatcag ctcccgacac tagacctgct cccggatcaa 1740
ccgctccacc agctcacggc gtgaccagcg cacctgatac cagacctgct ctgggaagca 1800
ccgcccctcc cgtgcacaat gtgacatctg cttccggcag cgccagcggc tctgcctcta 1860
cactggtgca caacggcacc agcgccagag ccacaacaac cccagccagc aagagcaccc 1920
ccttcagcat ccctagccac cacagcgaca cccctaccac actggccagc cactccacca 1980
agaccgatgc ctctagcacc caccactcca gcgtgccccc tctgaccagc agcaaccaca 2040
gcacaagccc ccagctgtct accggcgtct cattcttctt tctgtccttc cacatcagca 2100
acctgcagtt caacagcagc ctggaagatc ccagcaccga ctactaccag gaactgcagc 2160
gggatatcag cgagatgttc ctgcaaatct acaagcaggg cggcttcctg ggcctgagca 2220
acatcaagtt cagacccggc agcgtggtgg tgcagctgac cctggctttc cgggaaggca 2280
ccatcaacgt gcacgacgtg gaaacccagt tcaaccagta caagaccgag gccgccagcc 2340
ggtacaacct gaccatctcc gatgtgtccg tgtccgacgt gcccttccca ttctctgccc 2400
agtctggcgc aggcgtgcca ggatggggaa ttgctctgct ggtgctcgtg tgcgtgctgg 2460
tggccctggc catcgtgtat ctgattgccc tggccgtgtg ccagtgccgg cggaagaatt 2520
acggccagct ggacatcttc cccgccagag acacctacca ccccatgagc gagtacccca 2580
cataccacac ccacggcaga tacgtgccac ccagctccac cgacagatcc ccctacgaga 2640
aagtgtctgc cggcaacggc ggcagctccc tgagctacac aaatcctgcc gtggccgctg 2700
cctccgccaa cctgggatcc ggcacaatcc tgtctgaggg cgccaccaac ttcagcctgc 2760
tgaaactggc cggcgacgtg gaactgaacc ctggccctgg agctgccccg gagccggaga 2820
ggacccccgt tggccaggga tcgtgggccc atccgggacg caccagggga ccatccgaca 2880
ggggattctg tgtggtgtca ccggccaggc cagcagaaga ggcaaccagc ctcgagggag 2940
cgttgtctgg aaccagacat tcccacccgt cggtgggccg gcagcaccac gcgggaccac 3000
cgtccacttc cagaccgcca cggccatggg acaccccttg cccgcctgtg tatgccgaga 3060
ctaaacactt cctgtactca tccggagaca aggaacagct tcggccgtcc ttcctcctgt 3120
cgtcgctcag accgagcctg accggagcac gcagattggt ggaaactatc ttccttgggt 3180
cacgtccgtg gatgccaggt accccacggc gcctcccgcg cctcccacag agatactggc 3240
agatgcggcc tctgttcctg gaattgctgg gaaaccacgc tcagtgcccg tacggagtcc 3300
tgctcaagac tcactgccct ctgagggcgg cggtcactcc ggcggccgga gtgtgcgcac 3360
gggagaagcc ccagggaagc gtggcagctc cggaagagga ggacaccgat ccgcgccgcc 3420
tcgtgcaact tctgcgccag cactcctcgc cctggcaagt ctacgggttc gtccgcgcct 3480
gcctgcgccg cctggtgccg cctgggctct ggggttcccg gcataacgag cgccgcttcc 3540
tgagaaatac taagaagttt atctcacttg gaaaacatgc caagttgtcg ctgcaagaac 3600
tcacgtggaa gatgtcagtc cgcgattgcg cctggctgcg ccgctcgccg ggcgtcgggt 3660
gtgttccagc tgcagaacac cgcctgagag aagaaattct ggccaaattt ctgcattggc 3720
tgatgtcagt gtacgtggtc gagctgctgc gctccttttt ctacgtcact gagactacct 3780
ttcaaaagaa ccgcctgttc ttctaccgca aatctgtgtg gagcaagctg cagtcaatcg 3840
gcattcgcca gcatctgaag agggtgcagc tgcgggaact ttccgaggca gaagtccgcc 3900
agcaccggga ggcccggccg gcgcttctca cgtcgcgtct gagattcatc ccaaagcccg 3960
acgggctgag gcctatcgtc aacatggatt acgtcgtggg cgctcgcacc tttcgccgtg 4020
aaaagcgggc cgaacgcttg acctcacggg tgaaggccct cttctccgtg ctgaactacg 4080
agagagcaag acggcctggc ctgctgggag cttcggtgct gggactggac gatatccacc 4140
gggcttggcg gacctttgtt ctccgggtga gagcccaaga ccctccgccg gaactgtact 4200
tcgtgaaggt ggcgatcacc ggagcctatg atactattcc gcaagatcga ctcaccgaag 4260
tcatcgcctc gatcatcaaa ccgcagaaca cttactgcgt caggcggtac gccgtggtcc 4320
agaaggccgc gcatggccac gtgagaaagg cgttcaagtc gcacgtgtcc actctcaccg 4380
acctccagcc ttacatgagg caattcgttg cgcatttgca agagacttcg cccctgagag 4440
atgcggtggt catcgagcag agctccagcc tgaacgaagc gagcagcggt ctgtttgacg 4500
tgttcctccg cttcatgtgt catcacgcgg tgcgaatcag gggaaaatca tacgtgcagt 4560
gccagggaat cccacaaggc agcattctgt cgactctctt gtgttccctt tgctacggcg 4620
atatggaaaa caagctgttc gctgggatca gacgggacgg gttgctgctc agactggtgg 4680
acgacttcct gctggtgact ccgcacctca ctcacgccaa aacctttctc cgcactctgg 4740
tgaggggagt gccagaatac ggctgtgtgg tcaatctccg gaaaactgtg gtgaatttcc 4800
ctgtcgagga tgaggcactc ggaggaaccg catttgtcca aatgccagca catggcctgt 4860
tcccatggtg cggtctgctg ctggacaccc gaactcttga agtgcagtcc gactactcca 4920
gctatgcccg gacgagcatc cgcgccagcc tcactttcaa tcgcggcttt aaggccggac 4980
gaaacatgcg cagaaagctt ttcggagtcc tccggcttaa atgccattcg ctctttctcg 5040
atctccaagt caattcgctg cagaccgtgt gcacgaacat ctacaagatc ctgctgctcc 5100
aagcctaccg gttccacgct tgcgtgcttc agctgccgtt tcaccaacag gtgtggaaga 5160
acccgacctt ctttctgcgg gtcattagcg atactgcctc cctgtgttac tcaatcctca 5220
aggcaaagaa cgccggaatg tcgctgggtg cgaaaggagc cgcgggacct cttcctagcg 5280
aagcggtgca gtggctctgc caccaggctt tcctcctgaa gctgaccagg cacagagtga 5340
cctacgtccc gctgctgggc tcgctgcgca ctgcacagac ccagctgtct agaaaactcc 5400
ccggcaccac cctgaccgct ctggaagccg ccgccaaccc agcattgccg tcagatttca 5460
agaccatctt ggacggatcc ggccagtgca ccaattacgc cctgctgaag ctggccggcg 5520
acgtggaatc taaccctggc cctgaatcgc caagcgcacc ccctcatcgg tggtgcatcc 5580
cttggcaacg cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg 5640
caaagctgac cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc 5700
tggtgcacaa tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg 5760
acgggaaccg gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag 5820
cgtacagcgg ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc 5880
agaacgacac cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg 5940
ccaccggcca attcagggtg taccccgaac tccctaagcc gttcatcacc tcgaacaaca 6000
gcaacccggt cgaggatgaa gatgcggtgg ccttgacgtg cgaacctgag atccagaaca 6060
ccacctactt gtggtgggtg aacaatcaga gcctgccagt ctccccacga ctccagctgt 6120
cgaacgacaa caggaccctg actttgctgt ccgtgactcg gaacgacgtg ggcccttatg 6180
aatgcggtat ccagaacaag ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc 6240
tttacgggcc ggacgacccc accatttccc cgtcgtacac ttactaccgg ccgggcgtga 6300
acctgtccct gtcgtgccac gctgcctcca atccgccggc ccagtactcc tggctcatcg 6360
acggaaacat ccagcagcac acccaagaac tgttcatctc caacattacc gagaaaaact 6420
cgggacttta cacctgtcaa gccaacaatt ccgccagcgg ccactcccgc accactgtca 6480
aaactatcac tgtgtccgcc gaactcccga agcccagcat cagctccaac aactcgaagc 6540
ccgtggagga taaggacgct gtcgcgttca cctgtgaacc agaggcacag aataccacct 6600
acctttggtg ggtcaacgga cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg 6660
ggaataggac tctgaccttg tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg 6720
gcatccagaa ctccgtgagc gcaaaccggt ctgacccagt gaccctggat gtgctgtacg 6780
gccccgacac tccgatcatt tcaccccccg attcatccta cctgtccggc gctaacctca 6840
acctctcatg ccactccgca tccaacccca gcccgcaata ttcgtggcgc attaacggaa 6900
ttcctcagca acatacccag gtcctgttca ttgcgaagat cacccctaac aacaacggaa 6960
cctacgcctg ctttgtgtca aacctggcca ctggtagaaa caactccatc gtgaagtcca 7020
ttaccgtgtc ggcgtccgga acttccccgg gcctgagcgc cggcgccacc gtgggaatta 7080
tgatcggcgt gctcgtggga gtggccctga tctgacgcac ctcgagctga tcataatcag 7140
ccataccaca tttgtagagg ttttacttgc tttaaaaaac ctcccacacc tccccctgaa 7200
cctgaaacat aaaatgaatg caattgttgt tgttaacttg tttattgcag cttataatgg 7260
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 7320
tagttgtggt ttgtccaaac tcatcaatgt atcttaccag gtgccgagcc tgcgagtgcg 7380
gagggaagca tgccaggttc cagcccgtgt gtgtggatgt gacggaggac ctgcgacccg 7440
atcatttggt gttgccctgc accgggacgg agttcggttc cagcggggaa gaatctgact 7500
agagtgagta gtgttctggg gcgggggagg acctgcatga gggccagaat aactgaaatc 7560
tgtgcttttc tgtgtgttgc agcagcatga gcggaagcgg ctcctttgag ggaggggtat 7620
tcagccctta tctgacgggg cgtctcccct cctgggcggg agtgcgtcag aatgtgatgg 7680
gatccacggt ggacggccgg cccgtgcagc ccgcgaactc ttcaaccctg acctatgcaa 7740
ccctgagctc ttcgtcgttg gacgcagctg ccgccgcagc tgctgcatct gccgccagcg 7800
ccgtgcgcgg aatggccatg ggcgccggct actacggcac tctggtggcc aactcgagtt 7860
ccaccaataa tcccgccagc ctgaacgagg agaagctgtt gctgctgatg gcccagctcg 7920
aggccttgac ccagcgcctg ggcgagctga cccagcaggt ggctcagctg caggagcaga 7980
cgcgggccgc ggttgccacg gtgaaatcca aataaaaaat gaatcaataa ataaacggag 8040
acggttgttg attttaacac agagtctgaa tctttatttg atttttcgcg cgcggtaggc 8100
cctggaccac cggtctcgat cattgagcac ccggtggatc ttttccagga cccggtagag 8160
gtgggcttgg atgttgaggt acatgggcat gagcccgtcc cgggggtgga ggtagctcca 8220
ttgcagggcc tcgtgctcgg gggtggtgtt gtaaatcacc cagtcatagc aggggcgcag 8280
ggcatggtgt tgcacaatat ctttgaggag gagactgatg gccacgggca gccctttggt 8340
gtaggtgttt acaaatctgt tgagctggga gggatgcatg cggggggaga tgaggtgcat 8400
cttggcctgg atcttgagat tggcgatgtt accgcccaga tcccgcctgg ggttcatgtt 8460
gtgcaggacc accagcacgg tgtatccggt gcacttgggg aatttatcat gcaacttgga 8520
agggaaggcg tgaaagaatt tggcgacgcc tttgtgcccg cccaggtttt ccatgcactc 8580
atccatgatg atggcgatgg gcccgtgggc ggcggcctgg gcaaagacgt ttcgggggtc 8640
ggacacatca tagttgtggt cctgggtgag gtcatcatag gccattttaa tgaatttggg 8700
gcggagggtg ccggactggg ggacaaaggt accctcgatc ccgggggcgt agttcccctc 8760
acagatctgc atctcccagg ctttgagctc ggaggggggg atcatgtcca cctgcggggc 8820
gataaagaac acggtttccg gggcggggga gatgagctgg gccgaaagca agttccggag 8880
cagctgggac ttgccgcagc cggtggggcc gtagatgacc ccgatgaccg gctgcaggtg 8940
gtagttgagg gagagacagc tgccgtcctc ccggaggagg ggggccacct cgttcatcat 9000
ctcgcgcacg tgcatgttct cgcgcaccag ttccgccagg aggcgctctc cccccaggga 9060
taggagctcc tggagcgagg cgaagttttt cagcggcttg agtccgtcgg ccatgggcat 9120
tttggagagg gtttgttgca agagttccag gcggtcccag agctcggtga tgtgctctac 9180
ggcatctcga tccagcagac ctcctcgttt cgcgggttgg gacggctgcg ggagtagggc 9240
accagacgat gggcgtccag cgcagccagg gtccggtcct tccagggtcg cagcgtccgc 9300
gtcagggtgg tctccgtcac ggtgaagggg tgcgcgccgg gctgggcgct tgcgagggtg 9360
cgcttcaggc tcatccggct ggtcgaaaac cgctcccgat cggcgccctg cgcgtcggcc 9420
aggtagcaat tgaccatgag ttcgtagttg agcgcctcgg ccgcgtggcc tttggcgcgg 9480
agcttacctt tggaagtctg cccgcaggcg ggacagagga gggacttgag ggcgtagagc 9540
ttgggggcga ggaagacgga ctcgggggcg taggcgtccg cgccgcagtg ggcgcagacg 9600
gtctcgcact ccacgagcca ggtgaggtcg ggctggtcgg ggtcaaaaac cagtttcccg 9660
ccgttctttt tgatgcgttt cttacctttg gtctccatga gctcgtgtcc ccgctgggtg 9720
acaaagaggc tgtccgtgtc cccgtagacc gactttatgg gccggtcctc gagcggtgtg 9780
ccgcggtcct cctcgtagag gaaccccgcc cactccgaga cgaaagcccg ggtccaggcc 9840
agcacgaagg aggccacgtg ggacgggtag cggtcgttgt ccaccagcgg gtccaccttt 9900
tccagggtat gcaaacacat gtccccctcg tccacatcca ggaaggtgat tggcttgtaa 9960
gtgtaggcca cgtgaccggg ggtcccggcc gggggggtat aaaagggtgc gggtccctgc 10020
tcgtcctcac tgtcttccgg atcgctgtcc aggagcgcca gctgttgggg taggtattcc 10080
ctctcgaagg cgggcatgac ctcggcactc aggttgtcag tttctagaaa cgaggaggat 10140
ttgatattga cggtgccggc ggagatgcct ttcaagagcc cctcgtccat ctggtcagaa 10200
aagacgatct ttttgttgtc gagcttggtg gcgaaggagc cgtagagggc gttggagagg 10260
agcttggcga tggagcgcat ggtctggttt ttttccttgt cggcgcgctc cttggcggcg 10320
atgttgagct gcacgtactc gcgcgccacg cacttccatt cggggaagac ggtggtcagc 10380
tcgtcgggca cgattctgac ctgccagccc cgattatgca gggtgatgag gtccacactg 10440
gtggccacct cgccgcgcag gggctcatta gtccagcaga ggcgtccgcc cttgcgcgag 10500
cagaaggggg gcagggggtc cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag 10560
atgccgggca ggaggtcggg gtcaaagtag ctgatggaag tggccagatc gtccagggca 10620
gcttgccatt cgcgcacggc cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc 10680
atgggatggg taagcgcgga ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc 10740
tcgaggatgc cgatgtaggt ggggtagcag cgccccccgc ggatgctggc gcgcacgtag 10800
tcatacagct cgtgcgaggg ggcgaggagc cccgggccca ggttggtgcg actgggcttt 10860
tcggcgcggt agacgatctg gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt 10920
tggaagatgt tgaagtgggc gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag 10980
gagtcttgca gcttggcgac gagctcggcg gtgactagga cgtccagagc gcagtagtcg 11040
agggtctcct ggatgatgtc atacttgagc tgtccctttt gtttccacag ctcgcggttg 11100
agaaggaact cttcgcggtc cttccagtac tcttcgaggg ggaacccgtc ctgatctgca 11160
cggtaagagc ctagcatgta gaactggttg acggccttgt aggcgcagca gcccttctcc 11220
acggggaggg cgtaggcctg ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg 11280
tccctgacca tgaccttgag gaactggtgc ttgaagtcga tatcgtcgca gcccccctgc 11340
tcccagagct ggaagtccgt gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca 11400
tcgttgaaga ggatcttgcc cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg 11460
ggcacctcgg cccggttgtt gatgacctgg gcggcgagca cgatctcgtc gaagccgttg 11520
atgttgtggc ccacgatgta gagttccacg aatcgcggac ggcccttgac gtggggcagt 11580
ttcttgagct cctcgtaggt gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc 11640
cagtcggcga gatgggggtt ggcgcggagg aaggaagtcc agagatccac ggccagggcg 11700
gtttgcagac ggtcccggta ctgacggaac tgctgcccga cggccatttt ttcgggggtg 11760
acgcagtaga aggtgcgggg gtccccgtgc cagcgatccc atttgagctg gagggcgaga 11820
tcgagggcga gctcgacgag ccggtcgtcc ccggagagtt tcatgaccag catgaagggg 11880
acgagctgct tgccgaagga ccccatccag gtgtaggttt ccacatcgta ggtgaggaag 11940
agcctttcgg tgcgaggatg cgagccgatg gggaagaact ggatctcctg ccaccaattg 12000
gaggaatggc tgttgatgtg atggaagtag aaatgccgac ggcgcgccga acactcgtgc 12060
ttgtgtttat acaagcggcc acagtgctcg caacgctgca cgggatgcac gtgctgcacg 12120
agctgtacct gagttccttt gacgaggaat ttcagtggga agtggagtcg tggcgcctgc 12180
atctcgtgct gtactacgtc gtggtggtcg gcctggccct cttctgcctc gatggtggtc 12240
atgctgacga gcccgcgcgg gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg 12300
aggacgaggg cgcgcaggcc ggagctgtcc agggtcctga gacgctgcgg agtcaggtca 12360
gtgggcagcg gcggcgcgcg gttgacttgc aggagttttt ccagggcgcg cgggaggtcc 12420
agatggtact tgatctccac cgcgccattg gtggcgacgt cgatggcttg cagggtcccg 12480
tgcccctggg gtgtgaccac cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc 12540
ggtgcctctt ccatggttag aagcggcggc gaggacgcgc gccgggcggc aggggcggct 12600
cggggcccgg aggcaggggc ggcaggggca cgtcggcgcc gcgcgcgggt aggttctggt 12660
actgcgcccg gagaagactg gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac 12720
gcctctgggt gaaggccacg ggacccgtga gtttgaacct gaaagagagt tcgacagaat 12780
caatctcggt atcgttgacg gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt 12840
cctggtaggc gatctcggtc atgaactgct cgatctcctc ctcttgaagg tctccgcggc 12900
cggcgcgctc cacggtggcc gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg 12960
cgttcatgcc cgcctcgttc cagacgcggc tgtagaccac gacgccctcg ggatcgcggg 13020
cgcgcatgac cacctgggcg aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc 13080
agaggcgctg gtagaggtag ttgagcgtgg tggcgatgtg ctcggtgacg aagaaataca 13140
tgatccagcg gcggagcggc atctcgctga cgtcgcccag cgcctccaaa cgttccatgg 13200
cctcgtaaaa gtccacggcg aagttgaaaa actgggagtt gcgcgccgag acggtcaact 13260
cctcctccag aagacggatg agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc 13320
ccgggagttc ctccacttcc tcttcttcct cctccactaa catctcttct acttcctcct 13380
caggcggcag tggtggcggg ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt 13440
cgatgaagcg ctcgatggtc tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc 13500
cgtcctcgcg gggccgcagc gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt 13560
ccccgttggg cagggagagg gcgctgacga tgcatcttat caattgcccc gtagggactc 13620
cgcgcaagga cctgagcgtc tcgagatcca cgggatctga aaaccgctga acgaaggctt 13680
cgagccagtc gcagtcgcaa ggtaggctga gcacggtttc ttctggcggg tcatgttggt 13740
tgggagcggg gcgggcgatg ctgctggtga tgaagttgaa ataggcggtt ctgagacggc 13800
ggatggtggc gaggagcacc aggtctttgg gcccggcttg ctggatgcgc agacggtcgg 13860
ccatgcccca ggcgtggtcc tgacacctgg ccaggtcctt gtagtagtcc tgcatgagcc 13920
gctccacggg cacctcctcc tcgcccgcgc ggccgtgcat gcgcgtgagc ccgaagccgc 13980
gctggggctg gacgagcgcc aggtcggcga cgacgcgctc ggcgaggatg gcttgctgga 14040
tctgggtgag ggtggtctgg aagtcatcaa agtcgacgaa gcggtggtag gctccggtgt 14100
tgatggtgta ggagcagttg gccatgacgg accagttgac ggtctggtgg cccggacgca 14160
cgagctcgtg gtacttgagg cgcgagtagg cgcgcgtgtc gaagatgtag tcgttgcagg 14220
tgcgcaccag gtactggtag ccgatgagga agtgcggcgg cggctggcgg tagagcggcc 14280
atcgctcggt ggcgggggcg ccgggcgcga ggtcctcgag catggtgcgg tggtagccgt 14340
agatgtacct ggacatccag gtgatgccgg cggcggtggt ggaggcgcgc gggaactcgc 14400
ggacgcggtt ccagatgttg cgcagcggca ggaagtagtt catggtgggc acggtctggc 14460
ccgtgaggcg cgcgcagtcg tggatgctct atacgggcaa aaacgaaagc ggtcagcggc 14520
tcgactccgt ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga 14580
atctcgaatc aggctggagc cgcagctaac gtggtattgg cactcccgtc tcgacccaag 14640
cctgcaccaa ccctccagga tacggaggcg ggtcgttttg caactttttt ttggaggccg 14700
gatgagacta gtaagcgcgg aaagcggccg accgcgatgg ctcgctgccg tagtctggag 14760
aagaatcgcc agggttgcgt tgcggtgtgc cccggttcga ggccggccgg attccgcggc 14820
taacgagggc gtggctgccc cgtcgtttcc aagaccccat agccagccga cttctccagt 14880
tacggagcga gcccctcttt tgttttgttt gtttttgcca gatgcatccc gtactgcggc 14940
agatgcgccc ccaccaccct ccaccgcaac aacagccccc tccacagccg gcgcttctgc 15000
ccccgcccca gcagcaactt ccagccacga ccgccgcggc cgccgtgagc ggggctggac 15060
agagttatga tcaccagctg gccttggaag agggcgaggg gctggcgcgc ctgggggcgt 15120
cgtcgccgga gcggcacccg cgcgtgcaga tgaaaaggga cgctcgcgag gcctacgtgc 15180
ccaagcagaa cctgttcaga gacaggagcg gcgaggagcc cgaggagatg cgcgcggccc 15240
ggttccacgc ggggcgggag ctgcggcgcg gcctggaccg aaagagggtg ctgagggacg 15300
aggatttcga ggcggacgag ctgacgggga tcagccccgc gcgcgcgcac gtggccgcgg 15360
ccaacctggt cacggcgtac gagcagaccg tgaaggagga gagcaacttc caaaaatcct 15420
tcaacaacca cgtgcgcacc ctgatcgcgc gcgaggaggt gaccctgggc ctgatgcacc 15480
tgtgggacct gctggaggcc atcgtgcaga accccaccag caagccgctg acggcgcagc 15540
tgttcctggt ggtgcagcat agtcgggaca acgaagcgtt cagggaggcg ctgctgaata 15600
tcaccgagcc cgagggccgc tggctcctgg acctggtgaa cattctgcag agcatcgtgg 15660
tgcaggagcg cgggctgccg ctgtccgaga agctggcggc catcaacttc tcggtgctga 15720
gtttgggcaa gtactacgct aggaagatct acaagacccc gtacgtgccc atagacaagg 15780
aggtgaagat cgacgggttt tacatgcgca tgaccctgaa agtgctgacc ctgagcgacg 15840
atctgggggt gtaccgcaac gacaggatgc accgtgcggt gagcgccagc aggcggcgcg 15900
agctgagcga ccaggagctg atgcatagtc tgcagcgggc cctgaccggg gccgggaccg 15960
agggggagag ctactttgac atgggcgcgg acctgcactg gcagcccagc cgccgggcct 16020
tggaggcggc ggcaggaccc tacgtagaag aggtggacga tgaggtggac gaggagggcg 16080
agtacctgga agactgatgg cgcgaccgta tttttgctag atgcaacaac aacagccacc 16140
tcctgatccc gcgatgcggg cggcgctgca gagccagccg tccggcatta actcctcgga 16200
cgattggacc caggccatgc aacgcatcat ggcgctgacg acccgcaacc ccgaagcctt 16260
tagacagcag ccccaggcca accggctctc ggccatcctg gaggccgtgg tgccctcgcg 16320
ctccaacccc acgcacgaga aggtcctggc catcgtgaac gcgctggtgg agaacaaggc 16380
catccgcggc gacgaggccg gcctggtgta caacgcgctg ctggagcgcg tggcccgcta 16440
caacagcacc aacgtgcaga ccaacctgga ccgcatggtg accgacgtgc gcgaggccgt 16500
ggcccagcgc gagcggttcc accgcgagtc caacctggga tccatggtgg cgctgaacgc 16560
cttcctcagc acccagcccg ccaacgtgcc ccggggccag gaggactaca ccaacttcat 16620
cagcgccctg cgcctgatgg tgaccgaggt gccccagagc gaggtgtacc agtccgggcc 16680
ggactacttc ttccagacca gtcgccaggg cttgcagacc gtgaacctga gccaggcttt 16740
caagaacttg cagggcctgt ggggcgtgca ggccccggtc ggggaccgcg cgacggtgtc 16800
gagcctgctg acgccgaact cgcgcctgct gctgctgctg gtggccccct tcacggacag 16860
cggcagcatc aaccgcaact cgtacctggg ctacctgatt aacctgtacc gcgaggccat 16920
cggccaggcg cacgtggacg agcagaccta ccaggagatc acccacgtga gccgcgccct 16980
gggccaggac gacccgggca acctggaagc caccctgaac tttttgctga ccaaccggtc 17040
gcagaagatc ccgccccagt acgcgctcag caccgaggag gagcgcatcc tgcgttacgt 17100
gcagcagagc gtgggcctgt tcctgatgca ggagggggcc acccccagcg ccgcgctcga 17160
catgaccgcg cgcaacatgg agcccagcat gtacgccagc aaccgcccgt tcatcaataa 17220
actgatggac tacttgcatc gggcggccgc catgaactct gactatttca ccaacgccat 17280
cctgaatccc cactggctcc cgccgccggg gttctacacg ggcgagtacg acatgcccga 17340
ccccaatgac gggttcctgt gggacgatgt ggacagcagc gtgttctccc cccgaccggg 17400
tgctaacgag cgccccttgt ggaagaagga aggcagcgac cgacgcccgt cctcggcgct 17460
gtccggccgc gagggtgctg ccgcggcggt gcccgaggcc gccagtcctt tcccgagctt 17520
gcccttctcg ctgaacagta tccgcagcag cgagctgggc aggatcacgc gcccgcgctt 17580
gctgggcgaa gaggagtact tgaatgactc gctgttgaga cccgagcggg agaagaactt 17640
ccccaataac gggatagaaa gcctggtgga caagatgagc cgctggaaga cgtatgcgca 17700
ggagcacagg gacgatcccc gggcgtcgca gggggccacg agccggggca gcgccgcccg 17760
taaacgccgg tggcacgaca ggcagcgggg acagatgtgg gacgatgagg actccgccga 17820
cgacagcagc gtgttggact tgggtgggag tggtaacccg ttcgctcacc tgcgcccccg 17880
tatcgggcgc atgatgtaag agaaaccgaa aataaatgat actcaccaag gccatggcga 17940
ccagcgtgcg ttcgtttctt ctctgttgtt gttgtatcta gtatgatgag gcgtgcgtac 18000
ccggagggtc ctcctccctc gtacgagagc gtgatgcagc aggcgatggc ggcggcggcg 18060
atgcagcccc cgctggaggc tccttacgtg cccccgcggt acctggcgcc tacggagggg 18120
cggaacagca ttcgttactc ggagctggca cccttgtacg ataccacccg gttgtacctg 18180
gtggacaaca agtcggcgga catcgcctcg ctgaactacc agaacgacca cagcaacttc 18240
ctgaccaccg tggtgcagaa caatgacttc acccccacgg aggccagcac ccagaccatc 18300
aactttgacg agcgctcgcg gtggggcggc cagctgaaaa ccatcatgca caccaacatg 18360
cccaacgtga acgagttcat gtacagcaac aagttcaagg cgcgggtgat ggtctcccgc 18420
aagaccccca atggggtgac agtgacagag gattatgatg gtagtcagga tgagctgaag 18480
tatgaatggg tggaatttga gctgcccgaa ggcaacttct cggtgaccat gaccatcgac 18540
ctgatgaaca acgccatcat cgacaattac ttggcggtgg ggcggcagaa cggggtgctg 18600
gagagcgaca tcggcgtgaa gttcgacact aggaacttca ggctgggctg ggaccccgtg 18660
accgagctgg tcatgcccgg ggtgtacacc aacgaggctt tccatcccga tattgtcttg 18720
ctgcccggct gcggggtgga cttcaccgag agccgcctca gcaacctgct gggcattcgc 18780
aagaggcagc ccttccagga aggcttccag atcatgtacg aggatctgga ggggggcaac 18840
atccccgcgc tcctggatgt cgacgcctat gagaaaagca aggaggatgc agcagctgaa 18900
gcaactgcag ccgtagctac cgcctctacc gaggtcaggg gcgataattt tgcaagcgcc 18960
gcagcagtgg cagcggccga ggcggctgaa accgaaagta agatagtcat tcagccggtg 19020
gagaaggata gcaagaacag gagctacaac gtactaccgg acaagataaa caccgcctac 19080
cgcagctggt acctagccta caactatggc gaccccgaga agggcgtgcg ctcctggacg 19140
ctgctcacca cctcggacgt cacctgcggc gtggagcaag tctactggtc gctgcccgac 19200
atgatgcaag acccggtcac cttccgctcc acgcgtcaag ttagcaacta cccggtggtg 19260
ggcgccgagc tcctgcccgt ctactccaag agcttcttca acgagcaggc cgtctactcg 19320
cagcagctgc gcgccttcac ctcgcttacg cacgtcttca accgcttccc cgagaaccag 19380
atcctcgtcc gcccgcccgc gcccaccatt accaccgtca gtgaaaacgt tcctgctctc 19440
acagatcacg ggaccctgcc gctgcgcagc agtatccggg gagtccagcg cgtgaccgtt 19500
actgacgcca gacgccgcac ctgcccctac gtctacaagg ccctgggcat agtcgcgccg 19560
cgcgtcctct cgagccgcac cttctaaatg tccattctca tctcgcccag taataacacc 19620
ggttggggcc tgcgcgcgcc cagcaagatg tacggaggcg ctcgccaacg ctccacgcaa 19680
caccccgtgc gcgtgcgcgg gcacttccgc gctccctggg gcgccctcaa gggccgcgtg 19740
cggtcgcgca ccaccgtcga cgacgtgatc gaccaggtgg tggccgacgc gcgcaactac 19800
acccccgccg ccgcgcccgt ctccaccgtg gacgccgtca tcgacagcgt ggtggccgac 19860
gcgcgccggt acgcccgcgc caagagccgg cggcggcgca tcgcccggcg gcaccggagc 19920
acccccgcca tgcgcgcggc gcgagccttg ctgcgcaggg ccaggcgcac gggacgcagg 19980
gccatgctca gggcggccag acgcgcggct tcaggcgcca gcgccggcag gacccggaga 20040
cgcgcggcca cggcggcggc agcggccatc gccagcatgt cccgcccgcg gcgagggaac 20100
gtgtactggg tgcgcgacgc cgccaccggt gtgcgcgtgc ccgtgcgcac ccgcccccct 20160
cgcacttgaa gatgttcact tcgcgatgtt gatgtgtccc agcggcgagg aggatgtcca 20220
agcgcaaatt caaggaagag atgctccagg tcatcgcgcc tgagatctac ggccctgcgg 20280
tggtgaagga ggaaagaaag ccccgcaaaa tcaagcgggt caaaaaggac aaaaaggaag 20340
aagaaagtga tgtggacgga ttggtggagt ttgtgcgcga gttcgccccc cggcggcgcg 20400
tgcagtggcg cgggcggaag gtgcaaccgg tgctgagacc cggcaccacc gtggtcttca 20460
cgcccggcga gcgctccggc accgcttcca agcgctccta cgacgaggtg tacggggatg 20520
atgatattct ggagcaggcg gccgagcgcc tgggcgagtt tgcttacggc aagcgcagcc 20580
gttccgcacc gaaggaagag gcggtgtcca tcccgctgga ccacggcaac cccacgccga 20640
gcctcaagcc cgtgaccttg cagcaggtgc tgccgaccgc ggcgccgcgc cgggggttca 20700
agcgcgaggg cgaggatctg taccccacca tgcagctgat ggtgcccaag cgccagaagc 20760
tggaagacgt gctggagacc atgaaggtgg acccggacgt gcagcccgag gtcaaggtgc 20820
ggcccatcaa gcaggtggcc ccgggcctgg gcgtgcagac cgtggacatc aagattccca 20880
cggagcccat ggaaacgcag accgagccca tgatcaagcc cagcaccagc accatggagg 20940
tgcagacgga tccctggatg ccatcggctc ctagtcgaag accccggcgc aagtacggcg 21000
cggccagcct gctgatgccc aactacgcgc tgcatccttc catcatcccc acgccgggct 21060
accgcggcac gcgcttctac cgcggtcata ccagcagccg ccgccgcaag accaccactc 21120
gccgccgccg tcgccgcacc gccgctgcaa ccacccctgc cgccctggtg cggagagtgt 21180
accgccgcgg ccgcgcacct ctgaccctgc cgcgcgcgcg ctaccacccg agcatcgcca 21240
tttaaacttt cgcctgcttt gcagatcaat ggccctcaca tgccgccttc gcgttcccat 21300
tacgggctac cgaggaagaa aaccgcgccg tagaaggctg gcggggaacg ggatgcgtcg 21360
ccaccaccac cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc 21420
gctgatcccc atcatcgccg cggcgatcgg ggcgatcccc ggcattgctt ccgtggcggt 21480
gcaggcctct cagcgccact gagacacact tggaaacatc ttgtaataaa ccaatggact 21540
ctgacgctcc tggtcctgtg atgtgttttc gtagacagat ggaagacatc aatttttcgt 21600
ccctggctcc gcgacacggc acgcggccgt tcatgggcac ctggagcgac atcggcacca 21660
gccaactgaa cgggggcgcc ttcaattgga gcagtctctg gagcgggctt aagaatttcg 21720
ggtccacgct taaaacctat ggcagcaagg cgtggaacag caccacaggg caggcgctga 21780
gggataagct gaaagagcag aacttccagc agaaggtggt cgatgggctc gcctcgggca 21840
tcaacggggt ggtggacctg gccaaccagg ccgtgcagcg gcagatcaac agccgcctgg 21900
acccggtgcc gcccgccggc tccgtggaga tgccgcaggt ggaggaggag ctgcctcccc 21960
tggacaagcg gggcgagaag cgaccccgcc ccgatgcgga ggagacgctg ctgacgcaca 22020
cggacgagcc gcccccgtac gaggaggcgg tgaaactggg tctgcccacc acgcggccca 22080
tcgcgcccct ggccaccggg gtgctgaaac ccgaaaagcc cgcgaccctg gacttgcctc 22140
ctccccagcc ttcccgcccc tctacagtgg ctaagcccct gccgccggtg gccgtggccc 22200
gcgcgcgacc cgggggcacc gcccgccctc atgcgaactg gcagagcact ctgaacagca 22260
tcgtgggtct gggagtgcag agtgtgaagc gccgccgctg ctattaaacc taccgtagcg 22320
cttaacttgc ttgtctgtgt gtgtatgtat tatgtcgccg ccgccgctgt ccaccagaag 22380
gaggagtgaa gaggcgcgtc gccgagttgc aagatggcca ccccatcgat gctgccccag 22440
tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc gggtctggtg 22500
cagtttgccc gcgccacaga cacctacttc agtctgggga acaagtttag gaaccccacg 22560
gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct gcgcttcgtg 22620
cccgtggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct ggccgtgggc 22680
gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt gctggatcgg 22740
ggccctagct tcaaacccta ctccggcacc gcctacaaca gtctggcccc caagggagca 22800
cccaacactt gtcagtggac atataaagcc gatggtgaaa ctgccacaga aaaaacctat 22860
acatatggaa atgcacccgt gcagggcatt aacatcacaa aagatggtat tcaacttgga 22920
actgacaccg atgatcagcc aatctacgca gataaaacct atcagcctga acctcaagtg 22980
ggtgatgctg aatggcatga catcactggt actgatgaaa agtatggagg cagagctctt 23040
aagcctgata ccaaaatgaa gccttgttat ggttcttttg ccaagcctac taataaagaa 23100
ggaggtcagg caaatgtgaa aacaggaaca ggcactacta aagaatatga catagacatg 23160
gctttctttg acaacagaag tgcggctgct gctggcctag ctccagaaat tgttttgtat 23220
actgaaaatg tggatttgga aactccagat acccatattg tatacaaagc aggcacagat 23280
gacagcagct cttctattaa tttgggtcag caagccatgc ccaacagacc taactacatt 23340
ggtttcagag acaactttat cgggctcatg tactacaaca gcactggcaa tatgggggtg 23400
ctggccggtc aggcttctca gctgaatgct gtggttgact tgcaagacag aaacaccgag 23460
ctgtcctacc agctcttgct tgactctctg ggtgacagaa cccggtattt cagtatgtgg 23520
aatcaggcgg tggacagcta tgatcctgat gtgcgcatta ttgaaaatca tggtgtggag 23580
gatgaacttc ccaactattg tttccctctg gatgctgttg gcagaacaga tacttatcag 23640
ggaattaagg ctaatggaac tgatcaaacc acatggacca aagatgacag tgtcaatgat 23700
gctaatgaga taggcaaggg taatccattc gccatggaaa tcaacatcca agccaacctg 23760
tggaggaact tcctctacgc caacgtggcc ctgtacctgc ccgactctta caagtacacg 23820
ccggccaatg ttaccctgcc caccaacacc aacacctacg attacatgaa cggccgggtg 23880
gtggcgccct cgctggtgga ctcctacatc aacatcgggg cgcgctggtc gctggatccc 23940
atggacaacg tgaacccctt caaccaccac cgcaatgcgg ggctgcgcta ccgctccatg 24000
ctcctgggca acgggcgcta cgtgcccttc cacatccagg tgccccagaa atttttcgcc 24060
atcaagagcc tcctgctcct gcccgggtcc tacacctacg agtggaactt ccgcaaggac 24120
gtcaacatga tcctgcagag ctccctcggc aacgacctgc gcacggacgg ggcctccatc 24180
tccttcacca gcatcaacct ctacgccacc ttcttcccca tggcgcacaa cacggcctcc 24240
acgctcgagg ccatgctgcg caacgacacc aacgaccagt ccttcaacga ctacctctcg 24300
gcggccaaca tgctctaccc catcccggcc aacgccacca acgtgcccat ctccatcccc 24360
tcgcgcaact gggccgcctt ccgcggctgg tccttcacgc gtctcaagac caaggagacg 24420
ccctcgctgg gctccgggtt cgacccctac ttcgtctact cgggctccat cccctacctc 24480
gacggcacct tctacctcaa ccacaccttc aagaaggtct ccatcacctt cgactcctcc 24540
gtcagctggc ccggcaacga ccggctcctg acgcccaacg agttcgaaat caagcgcacc 24600
gtcgacggcg agggctacaa cgtggcccag tgcaacatga ccaaggactg gttcctggtc 24660
cagatgctgg cccactacaa catcggctac cagggcttct acgtgcccga gggctacaag 24720
gaccgcatgt actccttctt ccgcaacttc cagcccatga gccgccaggt ggtggacgag 24780
gtcaactaca aggactacca ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc 24840
gtcggctacc tcgcgcccac catgcgccag ggccagccct accccgccaa ctacccctac 24900
ccgctcatcg gcaagagcgc cgtcaccagc gtcacccaga aaaagttcct ctgcgacagg 24960
gtcatgtggc gcatcccctt ctccagcaac ttcatgtcca tgggcgcgct caccgacctc 25020
ggccagaaca tgctctatgc caactccgcc cacgcgctag acatgaattt cgaagtcgac 25080
cccatggatg agtccaccct tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg 25140
caccagcccc accgcggcgt catcgaggcc gtctacctgc gcaccccctt ctcggccggt 25200
aacgccacca cctaagctct tgcttcttgc aagccatggc cgcgggctcc ggcgagcagg 25260
agctcagggc catcatccgc gacctgggct gcgggcccta cttcctgggc accttcgata 25320
agcgcttccc gggattcatg gccccgcaca agctggcctg cgccatcgtc aacacggccg 25380
gccgcgagac cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcgaacacct 25440
gctacctctt cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg 25500
agtacgaggg cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg 25560
aaaagtccac ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca 25620
tgttcctgca cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga 25680
acttgctgac gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc 25740
gccgcaacca ggaggcgctc taccgcttcc tcaactccca ctccgcctac tttcgctccc 25800
accgcgcgcg catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg 25860
tgtgtgtatg ttaaatgtct ttaataaaca gcactttcat gttacacatg catctgagat 25920
gatttattta gaaatcgaaa gggttctgcc gggtctcggc atggcccgcg ggcagggaca 25980
cgttgcggaa ctggtacttg gccagccact tgaactcggg gatcagcagt ttgggcagcg 26040
gggtgtcggg gaaggagtcg gtccacagct tccgcgtcag ttgcagggcg cccagcaggt 26100
cgggcgcgga gatcttgaaa tcgcagttgg gacccgcgtt ctgcgcgcgg gagttgcggt 26160
acacggggtt gcagcactgg aacaccatca gggccgggtg cttcacgctc gccagcaccg 26220
tcgcgtcggt gatgctctcc acgtcgaggt cctcggcgtt ggccatcccg aagggggtca 26280
tcttgcaggt ctgccttccc atggtgggca cgcacccggg cttgtggttg caatcgcagt 26340
gcagggggat cagcatcatc tgggcctggt cggcgttcat ccccgggtac atggccttca 26400
tgaaagcctc caattgcctg aacgcctgct gggccttggc tccctcggtg aagaagaccc 26460
cgcaggactt gctagagaac tggttggtgg cgcacccggc gtcgtgcacg cagcagcgcg 26520
cgtcgttgtt ggccagctgc accacgctgc gcccccagcg gttctgggtg atcttggccc 26580
ggtcggggtt ctccttcagc gcgcgctgcc cgttctcgct cgccacatcc atctcgatca 26640
tgtgctcctt ctggatcatg gtggtcccgt gcaggcaccg cagcttgccc tcggcctcgg 26700
tgcacccgtg cagccacagc gcgcacccgg tgcactccca gttcttgtgg gcgatctggg 26760
aatgcgcgtg cacgaagccc tgcaggaagc ggcccatcat ggtggtcagg gtcttgttgc 26820
tagtgaaggt cagcggaatg ccgcggtgct cctcgttgat gtacaggtgg cagatgcggc 26880
ggtacacctc gccctgctcg ggcatcagct ggaagttggc tttcaggtcg gtctccacgc 26940
ggtagcggtc catcagcata gtcatgattt ccataccctt ctcccaggcc gagacgatgg 27000
gcaggctcat agggttcttc accatcatct tagcgctagc agccgcggcc agggggtcgc 27060
tctcgtccag ggtctcaaag ctccgcttgc cgtccttctc ggtgatccgc accggggggt 27120
agctgaagcc cacggccgcc agctcctcct cggcctgtct ttcgtcctcg ctgtcctggc 27180
tgacgtcctg caggaccaca tgcttggtct tgcggggttt cttcttgggc ggcagcggcg 27240
gcggagatgt tggagatggc gagggggagc gcgagttctc gctcaccact actatctctt 27300
cctcttcttg gtccgaggcc acgcggcggt aggtatgtct cttcgggggc agaggcggag 27360
gcgacgggct ctcgccgccg cgacttggcg gatggctggc agagcccctt ccgcgttcgg 27420
gggtgcgctc ccggcggcgc tctgactgac ttcctccgcg gccggccatt gtgttctcct 27480
agggaggaac aacaagcatg gagactcagc catcgccaac ctcgccatct gcccccaccg 27540
ccgacgagaa gcagcagcag cagaatgaaa gcttaaccgc cccgccgccc agccccgcca 27600
cctccgacgc ggccgtccca gacatgcaag agatggagga atccatcgag attgacctgg 27660
gctatgtgac gcccgcggag cacgaggagg agctggcagt gcgcttttca caagaagaga 27720
tacaccaaga acagccagag caggaagcag agaatgagca gagtcaggct gggctcgagc 27780
atgacggcga ctacctccac ctgagcgggg gggaggacgc gctcatcaag catctggccc 27840
ggcaggccac catcgtcaag gatgcgctgc tcgaccgcac cgaggtgccc ctcagcgtgg 27900
aggagctcag ccgcgcctac gagttgaacc tcttctcgcc gcgcgtgccc cccaagcgcc 27960
agcccaatgg cacctgcgag cccaacccgc gcctcaactt ctacccggtc ttcgcggtgc 28020
ccgaggccct ggccacctac cacatctttt tcaagaacca aaagatcccc gtctcctgcc 28080
gcgccaaccg cacccgcgcc gacgcccttt tcaacctggg tcccggcgcc cgcctacctg 28140
atatcgcctc cttggaagag gttcccaaga tcttcgaggg tctgggcagc gacgagactc 28200
gggccgcgaa cgctctgcaa ggagaaggag gagagcatga gcaccacagc gccctggtcg 28260
agttggaagg cgacaacgcg cggctggcgg tgctcaaacg cacggtcgag ctgacccatt 28320
tcgcctaccc ggctctgaac ctgcccccca aagtcatgag cgcggtcatg gaccaggtgc 28380
tcatcaagcg cgcgtcgccc atctccgagg acgagggcat gcaagactcc gaggagggca 28440
agcccgtggt cagcgacgag cagctggccc ggtggctggg tcctaatgct agtccccaga 28500
gtttggaaga gcggcgcaaa ctcatgatgg ccgtggtcct ggtgaccgtg gagctggagt 28560
gcctgcgccg cttcttcgcc gacgcggaga ccctgcgcaa ggtcgaggag aacctgcact 28620
acctcttcag gcacgggttc gtgcgccagg cctgcaagat ctccaacgtg gagctgacca 28680
acctggtctc ctacatgggc atcttgcacg agaaccgcct ggggcagaac gtgctgcaca 28740
ccaccctgcg cggggaggcc cggcgcgact acatccgcga ctgcgtctac ctctacctct 28800
gccacacctg gcagacgggc atgggcgtgt ggcagcagtg tctggaggag cagaacctga 28860
aagagctctg caagctcctg cagaagaacc tcaagggtct gtggaccggg ttcgacgagc 28920
gcaccaccgc ctcggacctg gccgacctca ttttccccga gcgcctcagg ctgacgctgc 28980
gcaacggcct gcccgacttt atgagccaaa gcatgttgca aaactttcgc tctttcatcc 29040
tcgaacgctc cggaatcctg cccgccacct gctccgcgct gccctcggac ttcgtgccgc 29100
tgaccttccg cgagtgcccc ccgccgctgt ggagccactg ctacctgctg cgcctggcca 29160
actacctggc ctaccactcg gacgtgatcg aggacgtcag cggcgagggc ctgctcgagt 29220
gccactgccg ctgcaacctc tgcacgccgc accgctccct ggcctgcaac ccccagctgc 29280
tgagcgagac ccagatcatc ggcaccttcg agttgcaagg gcccagcgaa ggcgagggtt 29340
cagccgccaa ggggggtctg aaactcaccc cggggctgtg gacctcggcc tacttgcgca 29400
agttcgtgcc cgaggactac catcccttcg agatcaggtt ctacgaggac caatcccatc 29460
cgcccaaggc cgagctgtcg gcctgcgtca tcacccaggg ggcgatcctg gcccaattgc 29520
aagccatcca gaaatcccgc caagaattct tgctgaaaaa gggccgcggg gtctacctcg 29580
acccccagac cggtgaggag ctcaaccccg gcttccccca ggatgccccg aggaaacaag 29640
aagctgaaag tggagctgcc gcccgtggag gatttggagg aagactggga gaacagcagt 29700
caggcagagg aggaggagat ggaggaagac tgggacagca ctcaggcaga ggaggacagc 29760
ctgcaagaca gtctggagga agacgaggag gaggcagagg aggaggtgga agaagcagcc 29820
gccgccagac cgtcgtcctc ggcgggggag aaagcaagca gcacggatac catctccgct 29880
ccgggtcggg gtcccgctcg accacacagt agatgggacg agaccggacg attcccgaac 29940
cccaccaccc agaccggtaa gaaggagcgg cagggataca agtcctggcg ggggcacaaa 30000
aacgccatcg tctcctgctt gcaggcctgc gggggcaaca tctccttcac ccggcgctac 30060
ctgctcttcc accgcggggt gaactttccc cgcaacatct tgcattacta ccgtcacctc 30120
cacagcccct actacttcca agaagaggca gcagcagcag aaaaagacca gcagaaaacc 30180
agcagctaga aaatccacag cggcggcagc aggtggactg aggatcgcgg cgaacgagcc 30240
ggcgcaaacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 30300
gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 30360
cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 30420
tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccagtcgca 30480
gaaaaaggcg ggaattacgt cacctgtgcc cttcgcccta gccgcctcca cccatcatca 30540
tgagcaaaga gattcccacg ccttacatgt ggagctacca gccccagatg ggcctggccg 30600
ccggtgccgc ccaggactac tccacccgca tgaattggct cagcgccggg cccgcgatga 30660
tctcacgggt gaatgacatc cgcgcccacc gaaaccagat actcctagaa cagtcagcgc 30720
tcaccgccac gccccgcaat cacctcaatc cgcgtaattg gcccgccgcc ctggtgtacc 30780
aggaaattcc ccagcccacg accgtactac ttccgcgaga cgcccaggcc gaagtccagc 30840
tgactaactc aggtgtccag ctggcgggcg gcgccaccct gtgtcgtcac cgccccgctc 30900
agggtataaa gcggctggtg atccggggca gaggcacaca gctcaacgac gaggtggtga 30960
gctcttcgct gggtctgcga cctgacggag tcttccaact cgccggatcg gggagatctt 31020
ccttcacgcc tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag ccccgctcgg 31080
gtggcatcgg cactctccag ttcgtggagg agttcactcc ctcggtctac ttcaacccct 31140
tctccggctc ccccggccac tacccggacg agttcatccc gaacttcgac gccatcagcg 31200
agtcggtgga cggctacgat tgaatgtccc atggtggcgc agctgaccta gctcggcttc 31260
gacacctgga ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct 31320
actttgagct gcccgaggag caccctcagg gcccggccca cggagtgcgg atcgtcgtcg 31380
aagggggcct cgactcccac ctgcttcgga tcttcagcca gcgtccgatc ctggtcgagc 31440
gcgagcaagg acagaccctt ctgactctgt actgcatctg caaccacccc ggcctgcatg 31500
aaagtctttg ttgtctgctg tgtactgagt ataataaaag ctgagatcag cgactactcc 31560
ggacttccgt gtgtttaaac tcaccccctt atccagtgaa ataaagatca tattgatgat 31620
gattttacag aaataaaaaa taatcatttg atttgaaata aagatacaat catattgatg 31680
atttgagttt aacaaaaaaa taaagaatca cttacttgaa atctgatacc aggtctctgt 31740
ccatgttttc tgccaacacc acttcactcc cctcttccca gctctggtac tgcaggcccc 31800
ggcgggctgc aaacttcctc cacacgctga aggggatgtc aaattcctcc tgtccctcaa 31860
tcttcatttt atcttctatc agatgtccaa aaagcgcgtc cgggtggatg atgacttcga 31920
ccccgtctac ccctacgatg cagacaacgc accgaccgtg cccttcatca accccccctt 31980
cgtctcttca gatggattcc aagagaagcc cctgggggtg ttgtccctgc gactggccga 32040
ccccgtcacc accaagaacg gggaaatcac cctcaagctg ggagaggggg tggacctcga 32100
ttcctcggga aaactcatct ccaacacggc caccaaggcc gccgcccctc tcagtttttc 32160
caacaacacc atttccctta acatggatca ccccttttac actaaagatg gaaaattatc 32220
cttacaagtt tctccaccat taaatatact gagaacaagc attctaaaca cactagcttt 32280
aggttttgga tcaggtttag gactccgtgg ctctgccttg gcagtacagt tagtctctcc 32340
acttacattt gatactgatg gaaacataaa gcttacctta gacagaggtt tgcatgttac 32400
aacaggagat gcaattgaaa gcaacataag ctgggctaaa ggtttaaaat ttgaagatgg 32460
agccatagca accaacattg gaaatgggtt agagtttgga agcagtagta cagaaacagg 32520
tgttgatgat gcttacccaa tccaagttaa acttggatct ggccttagct ttgacagtac 32580
aggagccata atggctggta acaaagaaga cgataaactc actttgtgga caacacctga 32640
tccatcacca aactgtcaaa tactcgcaga aaatgatgca aaactaacac tttgcttgac 32700
taaatgtggt agtcaaatac tggccactgt gtcagtctta gttgtaggaa gtggaaacct 32760
aaaccccatt actggcaccg taagcagtgc tcaggtgttt ctacgttttg atgcaaacgg 32820
tgttctttta acagaacatt ctacactaaa aaaatactgg gggtataggc agggagatag 32880
catagatggc actccatata ccaatgctgt aggattcatg cccaatttaa aagcttatcc 32940
aaagtcacaa agttctacta ctaaaaataa tatagtaggg caagtataca tgaatggaga 33000
tgtttcaaaa cctatgcttc tcactataac cctcaatggt actgatgaca gcaacagtac 33060
atattcaatg tcattttcat acacctggac taatggaagc tatgttggag caacatttgg 33120
ggctaactct tataccttct catacatcgc ccaagaatga acactgtatc ccaccctgca 33180
tgccaaccct tcccacccca ctctgtggaa caaactctga aacacaaaat aaaataaagt 33240
tcaagtgttt tattgattca acagttttac aggattcgag cagttatttt tcctccaccc 33300
tcccaggaca tggaatacac caccctctcc ccccgcacag ccttgaacat ctgaatgcca 33360
ttggtgatgg acatgctttt ggtctccacg ttccacacag tttcagagcg agccagtctc 33420
gggtcggtca gggagatgaa accctccggg cactcccgca tctgcacctc acagctcaac 33480
agctgaggat tgtcctcggt ggtcgggatc acggttatct ggaagaagca gaagagcggc 33540
ggtgggaatc atagtccgcg aacgggatcg gccggtggtg tcgcatcagg ccccgcagca 33600
gtcgctgccg ccgccgctcc gtcaagctgc tgctcagggg gtccgggtcc agggactccc 33660
tcagcatgat gcccacggcc ctcagcatca gtcgtctggt gcggcgggcg cagcagcgca 33720
tgcggatctc gctcaggtcg ctgcagtacg tgcaacacag aaccaccagg ttgttcaaca 33780
gtccatagtt caacacgctc cagccgaaac tcatcgcggg aaggatgcta cccacgtggc 33840
cgtcgtacca gatcctcagg taaatcaagt ggtgccccct ccagaacacg ctgcccacgt 33900
acatgatctc cttgggcatg tggcggttca ccacctcccg gtaccacatc accctctggt 33960
tgaacatgca gccccggatg atcctgcgga accacagggc cagcaccgcc ccgcccgcca 34020
tgcagcgaag agaccccggg tcccggcaat ggcaatggag gacccaccgc tcgtacccgt 34080
ggatcatctg ggagctgaac aagtctatgt tggcacagca caggcatatg ctcatgcatc 34140
tcttcagcac tctcaactcc tcgggggtca aaaccatatc ccagggcacg gggaactctt 34200
gcaggacagc gaaccccgca gaacagggca atcctcgcac agaacttaca ttgtgcatgg 34260
acagggtatc gcaatcaggc agcaccgggt gatcctccac cagagaagcg cgggtctcgg 34320
tctcctcaca gcgtggtaag ggggccggcc gatacgggtg atggcgggac gcggctgatc 34380
gtgttcgcga ccgtgtcatg atgcagttgc tttcggacat tttcgtactt gctgtagcag 34440
aacctggtcc gggcgctgca caccgatcgc cggcggcggt ctcggcgctt ggaacgctcg 34500
gtgttgaaat tgtaaaacag ccactctctc agaccgtgca gcagatctag ggcctcagga 34560
gtgatgaaga tcccatcatg cctgatggct ctgatcacat cgaccaccgt ggaatgggcc 34620
agacccagcc agatgatgca attttgttgg gtttcggtga cggcggggga gggaagaaca 34680
ggaagaacca tgattaactt ttaatccaaa cggtctcgga gtacttcaaa atgaagatcg 34740
cggagatggc acctctcgcc cccgctgtgt tggtggaaaa taacagccag gtcaaaggtg 34800
atacggttct cgagatgttc cacggtggct tccagcaaag cctccacgcg cacatccaga 34860
aacaagacaa tagcgaaagc gggagggttc tctaattcct caatcatcat gttacactcc 34920
tgcaccatcc ccagataatt ttcatttttc cagccttgaa tgattcgaac tagttcctga 34980
ggtaaatcca agccagccat gataaagagc tcgcgcagag cgccctccac cggcattctt 35040
aagcacaccc tcataattcc aagatattct gctcctggtt cacctgcagc agattgacaa 35100
gcggaatatc aaaatctctg ccgcgatccc tgagctcctc cctcagcaat aactgtaagt 35160
actctttcat atcctctccg aaatttttag ccataggacc accaggaata agattagggc 35220
aagccacagt acagataaac cgaagtcctc cccagtgagc attgccaaat gcaagactgc 35280
tataagcatg ctggctagac ccggtgatat cttccagata actggacaga aaatcgccca 35340
ggcaattttt aagaaaatca acaaaagaaa aatcctccag gtggacgttt agagcctcgg 35400
gaacaacgat gaagtaaatg caagcggtgc gttccagcat ggttagttag ctgatctgta 35460
gaaaaaacaa aaatgaacat taaaccatgc tagcctggcg aacaggtggg taaatcgttc 35520
tctccagcac caggcaggcc acggggtctc cggcgcgacc ctcgtaaaaa ttgtcgctat 35580
gattgaaaac catcacagag agacgttccc ggtggccggc gtgaatgatt cgacaagatg 35640
aatacacccc cggaacattg gcgtccgcga gtgaaaaaaa gcgcccgagg aagcaataag 35700
gcactacaat gctcagtctc aagtccagca aagcgatgcc atgcggatga agcacaaaat 35760
tctcaggtgc gtacaaaatg taattactcc cctcctgcac aggcagcaaa gcccccgatc 35820
cctccaggta cacatacaaa gcctcagcgt ccatagctta ccgagcagca gcacacaaca 35880
ggcgcaagag tcagagaaag gctgagctct aacctgtcca cccgctctct gctcaatata 35940
tagcccagat ctacactgac gtaaaggcca aagtctaaaa atacccgcca aataatcaca 36000
cacgcccagc acacgcccag aaaccggtga cacactcaaa aaaatacgcg cacttcctca 36060
aacgcccaaa actgccgtca tttccgggtt cccacgctac gtcatcaaaa cacgactttc 36120
aaattccgtc gaccgttaaa aacgtcaccc gccccgcccc taacggtcgc ccgtctctca 36180
gccaatcagc gccccgcatc cccaaattca aacacctcat ttgcatatta acgcgcacaa 36240
aaagtttgag gtatattatt gatgatgg 36268
<210> 59
<211> 9750
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 59
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcgaatcgc caagcgcacc ccctcatcgg tggtgcatcc cttggcaacg 2040
cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg caaagctgac 2100
cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc tggtgcacaa 2160
tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg 2220
gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag cgtacagcgg 2280
ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc agaacgacac 2340
cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg ccaccggcca 2400
attcagggtg taccccgaac tccctaagcc gttcatcacc tcgaacaaca gcaacccggt 2460
cgaggatgaa gatgcggtgg ccttgacgtg cgaacctgag atccagaaca ccacctactt 2520
gtggtgggtg aacaatcaga gcctgccagt ctccccacga ctccagctgt cgaacgacaa 2580
caggaccctg actttgctgt ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat 2640
ccagaacaag ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc tttacgggcc 2700
ggacgacccc accatttccc cgtcgtacac ttactaccgg ccgggcgtga acctgtccct 2760
gtcgtgccac gctgcctcca atccgccggc ccagtactcc tggctcatcg acggaaacat 2820
ccagcagcac acccaagaac tgttcatctc caacattacc gagaaaaact cgggacttta 2880
cacctgtcaa gccaacaatt ccgccagcgg ccactcccgc accactgtca aaactatcac 2940
tgtgtccgcc gaactcccga agcccagcat cagctccaac aactcgaagc ccgtggagga 3000
taaggacgct gtcgcgttca cctgtgaacc agaggcacag aataccacct acctttggtg 3060
ggtcaacgga cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg ggaataggac 3120
tctgaccttg tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa 3180
ctccgtgagc gcaaaccggt ctgacccagt gaccctggat gtgctgtacg gccccgacac 3240
tccgatcatt tcaccccccg attcatccta cctgtccggc gctaacctca acctctcatg 3300
ccactccgca tccaacccca gcccgcaata ttcgtggcgc attaacggaa ttcctcagca 3360
acatacccag gtcctgttca ttgcgaagat cacccctaac aacaacggaa cctacgcctg 3420
ctttgtgtca aacctggcca ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc 3480
ggcgtccgga acttccccgg gcctgagcgc cggcgccacc gtgggaatta tgatcggcgt 3540
gctcgtggga gtggccctga tcggatccgg cgagggcaga ggcagcctgc tgacatgtgg 3600
cgacgtggaa gagaaccctg gccccacccc tggaacccag agccccttct tccttctgct 3660
gctgctgacc gtgctgactg tcgtgacagg ctctggccac gccagctcta cacctggcgg 3720
cgagaaagag acaagcgcca cccagagaag cagcgtgcca agcagcaccg agaagaacgc 3780
cgtgtccatg accagctccg tgctgagcag ccactctcct ggcagcggca gcagcacaac 3840
acagggccag gatgtgacac tggcccctgc cacagaacct gcctctggat ctgccgccac 3900
ctggggacag gacgtgacaa gcgtgccagt gaccagacct gccctgggct ctacaacacc 3960
ccctgcccac gatgtgacca gcgcccctga taacaagcct gcccctggaa gcacagcccc 4020
tccagctcat ggcgtgacct ctgccccaga taccagacca gccccaggat ctacagcccc 4080
acccgcacac ggcgtgacaa gtgcccctga cacaagaccc gctccaggct ctactgctcc 4140
tcctgcccat ggcgtgacaa gcgctcccga tacaaggcca gctcctggct ccacagcacc 4200
accagcacat ggcgtgacat cagctcccga cactagacct gctcccggat caaccgctcc 4260
accagctcac ggcgtgacca gcgcacctga taccagacct gctctgggaa gcaccgcccc 4320
tcccgtgcac aatgtgacat ctgcttccgg cagcgccagc ggctctgcct ctacactggt 4380
gcacaacggc accagcgcca gagccacaac aaccccagcc agcaagagca cccccttcag 4440
catccctagc caccacagcg acacccctac cacactggcc agccactcca ccaagaccga 4500
tgcctctagc acccaccact ccagcgtgcc ccctctgacc agcagcaacc acagcacaag 4560
cccccagctg tctaccggcg tctcattctt ctttctgtcc ttccacatca gcaacctgca 4620
gttcaacagc agcctggaag atcccagcac cgactactac caggaactgc agcgggatat 4680
cagcgagatg ttcctgcaaa tctacaagca gggcggcttc ctgggcctga gcaacatcaa 4740
gttcagaccc ggcagcgtgg tggtgcagct gaccctggct ttccgggaag gcaccatcaa 4800
cgtgcacgac gtggaaaccc agttcaacca gtacaagacc gaggccgcca gccggtacaa 4860
cctgaccatc tccgatgtgt ccgtgtccga cgtgcccttc ccattctctg cccagtctgg 4920
cgcaggcgtg ccaggatggg gaattgctct gctggtgctc gtgtgcgtgc tggtggccct 4980
ggccatcgtg tatctgattg ccctggccgt gtgccagtgc cggcggaaga attacggcca 5040
gctggacatc ttccccgcca gagacaccta ccaccccatg agcgagtacc ccacatacca 5100
cacccacggc agatacgtgc cacccagctc caccgacaga tccccctacg agaaagtgtc 5160
tgccggcaac ggcggcagct ccctgagcta cacaaatcct gccgtggccg ctgcctccgc 5220
caacctggga tccggcacaa tcctgtctga gggcgccacc aacttcagcc tgctgaaact 5280
ggccggcgac gtggaactga accctggccc tggagctgcc ccggagccgg agaggacccc 5340
cgttggccag ggatcgtggg cccatccggg acgcaccagg ggaccatccg acaggggatt 5400
ctgtgtggtg tcaccggcca ggccagcaga agaggcaacc agcctcgagg gagcgttgtc 5460
tggaaccaga cattcccacc cgtcggtggg ccggcagcac cacgcgggac caccgtccac 5520
ttccagaccg ccacggccat gggacacccc ttgcccgcct gtgtatgccg agactaaaca 5580
cttcctgtac tcatccggag acaaggaaca gcttcggccg tccttcctcc tgtcgtcgct 5640
cagaccgagc ctgaccggag cacgcagatt ggtggaaact atcttccttg ggtcacgtcc 5700
gtggatgcca ggtaccccac ggcgcctccc gcgcctccca cagagatact ggcagatgcg 5760
gcctctgttc ctggaattgc tgggaaacca cgctcagtgc ccgtacggag tcctgctcaa 5820
gactcactgc cctctgaggg cggcggtcac tccggcggcc ggagtgtgcg cacgggagaa 5880
gccccaggga agcgtggcag ctccggaaga ggaggacacc gatccgcgcc gcctcgtgca 5940
acttctgcgc cagcactcct cgccctggca agtctacggg ttcgtccgcg cctgcctgcg 6000
ccgcctggtg ccgcctgggc tctggggttc ccggcataac gagcgccgct tcctgagaaa 6060
tactaagaag tttatctcac ttggaaaaca tgccaagttg tcgctgcaag aactcacgtg 6120
gaagatgtca gtccgcgatt gcgcctggct gcgccgctcg ccgggcgtcg ggtgtgttcc 6180
agctgcagaa caccgcctga gagaagaaat tctggccaaa tttctgcatt ggctgatgtc 6240
agtgtacgtg gtcgagctgc tgcgctcctt tttctacgtc actgagacta cctttcaaaa 6300
gaaccgcctg ttcttctacc gcaaatctgt gtggagcaag ctgcagtcaa tcggcattcg 6360
ccagcatctg aagagggtgc agctgcggga actttccgag gcagaagtcc gccagcaccg 6420
ggaggcccgg ccggcgcttc tcacgtcgcg tctgagattc atcccaaagc ccgacgggct 6480
gaggcctatc gtcaacatgg attacgtcgt gggcgctcgc acctttcgcc gtgaaaagcg 6540
ggccgaacgc ttgacctcac gggtgaaggc cctcttctcc gtgctgaact acgagagagc 6600
aagacggcct ggcctgctgg gagcttcggt gctgggactg gacgatatcc accgggcttg 6660
gcggaccttt gttctccggg tgagagccca agaccctccg ccggaactgt acttcgtgaa 6720
ggtggcgatc accggagcct atgatactat tccgcaagat cgactcaccg aagtcatcgc 6780
ctcgatcatc aaaccgcaga acacttactg cgtcaggcgg tacgccgtgg tccagaaggc 6840
cgcgcatggc cacgtgagaa aggcgttcaa gtcgcacgtg tccactctca ccgacctcca 6900
gccttacatg aggcaattcg ttgcgcattt gcaagagact tcgcccctga gagatgcggt 6960
ggtcatcgag cagagctcca gcctgaacga agcgagcagc ggtctgtttg acgtgttcct 7020
ccgcttcatg tgtcatcacg cggtgcgaat caggggaaaa tcatacgtgc agtgccaggg 7080
aatcccacaa ggcagcattc tgtcgactct cttgtgttcc ctttgctacg gcgatatgga 7140
aaacaagctg ttcgctggga tcagacggga cgggttgctg ctcagactgg tggacgactt 7200
cctgctggtg actccgcacc tcactcacgc caaaaccttt ctccgcactc tggtgagggg 7260
agtgccagaa tacggctgtg tggtcaatct ccggaaaact gtggtgaatt tccctgtcga 7320
ggatgaggca ctcggaggaa ccgcatttgt ccaaatgcca gcacatggcc tgttcccatg 7380
gtgcggtctg ctgctggaca cccgaactct tgaagtgcag tccgactact ccagctatgc 7440
ccggacgagc atccgcgcca gcctcacttt caatcgcggc tttaaggccg gacgaaacat 7500
gcgcagaaag cttttcggag tcctccggct taaatgccat tcgctctttc tcgatctcca 7560
agtcaattcg ctgcagaccg tgtgcacgaa catctacaag atcctgctgc tccaagccta 7620
ccggttccac gcttgcgtgc ttcagctgcc gtttcaccaa caggtgtgga agaacccgac 7680
cttctttctg cgggtcatta gcgatactgc ctccctgtgt tactcaatcc tcaaggcaaa 7740
gaacgccgga atgtcgctgg gtgcgaaagg agccgcggga cctcttccta gcgaagcggt 7800
gcagtggctc tgccaccagg ctttcctcct gaagctgacc aggcacagag tgacctacgt 7860
cccgctgctg ggctcgctgc gcactgcaca gacccagctg tctagaaaac tccccggcac 7920
caccctgacc gctctggaag ccgccgccaa cccagcattg ccgtcagatt tcaagaccat 7980
cttggactga agatctgggc cctaacaaaa caaaaagatg gggttattcc ctaaacttca 8040
tgggttacgt aattggaagt tgggggacat tgccacaaga tcatattgta caaaagatca 8100
aacactgttt tagaaaactt cctgtaaaca ggcctattga ttggaaagta tgtcaaagga 8160
ttgtgggtct tttgggcttt gctgctccat ttacacaatg tggatatcct gccttaatgc 8220
ctttgtatgc atgtatacaa gctaaacagg ctttcacttt ctcgccaact tacaaggcct 8280
ttctaagtaa acagtacatg aacctttacc ccgttgctcg gcaacggcct ggtctgtgcc 8340
aagtgtttgc tgacgcaacc cccactggct ggggcttggc cataggccat cagcgcatgc 8400
gtggaacctt tgtggctcct ctgccgatcc atactgcgga actcctagcc gcttgttttg 8460
ctcgcagccg gtctggagca aagctcatag gaactgacaa ttctgtcgtc ctctcgcgga 8520
aatatacatc gtttcgatct acgtatgatc tttttccctc tgccaaaaat tatggggaca 8580
tcatgaagcc ccttgagcat ctgacttctg gctaataaag gaaatttatt ttcattgcaa 8640
tagtgtgttg gaattttttg tgtctctcac tcggaaggaa ttctgcatta atgaatcggc 8700
caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 8760
tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 8820
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 8880
aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 8940
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 9000
agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 9060
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 9120
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 9180
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 9240
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 9300
tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 9360
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 9420
tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 9480
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 9540
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 9600
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 9660
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 9720
ctatttcgtt catccatagt tgcctgactc 9750
<210> 60
<211> 36262
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 60
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcgaatcgcc aagcgcaccc cctcatcggt 1140
ggtgcatccc ttggcaacgc ctcctcctga ccgcctcact gctgactttc tggaacccgc 1200
cgaccaccgc aaagctgacc attgagagca ctcccttcaa cgtggctgag gggaaggagg 1260
tgctgctcct ggtgcacaat ctgccccagc acctgttcgg gtactcctgg tacaagggag 1320
aacgcgtgga cgggaaccgg cagatcatag gctacgtcat cggaacccag caggccacac 1380
ccggtccagc gtacagcggc cgggagatta tctacccgaa cgcctccctg ctgatccaaa 1440
acatcatcca gaacgacacc ggtttctaca ctctgcacgt gattaagtca gatctggtca 1500
acgaagaggc caccggccaa ttcagggtgt accccgaact ccctaagccg ttcatcacct 1560
cgaacaacag caacccggtc gaggatgaag atgcggtggc cttgacgtgc gaacctgaga 1620
tccagaacac cacctacttg tggtgggtga acaatcagag cctgccagtc tccccacgac 1680
tccagctgtc gaacgacaac aggaccctga ctttgctgtc cgtgactcgg aacgacgtgg 1740
gcccttatga atgcggtatc cagaacaagc tgtccgtgga ccacagcgac cctgtgatcc 1800
tgaacgtcct ttacgggccg gacgacccca ccatttcccc gtcgtacact tactaccggc 1860
cgggcgtgaa cctgtccctg tcgtgccacg ctgcctccaa tccgccggcc cagtactcct 1920
ggctcatcga cggaaacatc cagcagcaca cccaagaact gttcatctcc aacattaccg 1980
agaaaaactc gggactttac acctgtcaag ccaacaattc cgccagcggc cactcccgca 2040
ccactgtcaa aactatcact gtgtccgccg aactcccgaa gcccagcatc agctccaaca 2100
actcgaagcc cgtggaggat aaggacgctg tcgcgttcac ctgtgaacca gaggcacaga 2160
ataccaccta cctttggtgg gtcaacggac agtccctgcc tgtctcaccg agactgcagc 2220
tgtcaaacgg gaataggact ctgaccttgt ttaacgtcac ccggaacgac gcccgggcct 2280
acgtgtgcgg catccagaac tccgtgagcg caaaccggtc tgacccagtg accctggatg 2340
tgctgtacgg ccccgacact ccgatcattt caccccccga ttcatcctac ctgtccggcg 2400
ctaacctcaa cctctcatgc cactccgcat ccaaccccag cccgcaatat tcgtggcgca 2460
ttaacggaat tcctcagcaa catacccagg tcctgttcat tgcgaagatc acccctaaca 2520
acaacggaac ctacgcctgc tttgtgtcaa acctggccac tggtagaaac aactccatcg 2580
tgaagtccat taccgtgtcg gcgtccggaa cttccccggg cctgagcgcc ggcgccaccg 2640
tgggaattat gatcggcgtg ctcgtgggag tggccctgat cggatccggc gagggcagag 2700
gcagcctgct gacatgtggc gacgtggaag agaaccctgg ccccacccct ggaacccaga 2760
gccccttctt ccttctgctg ctgctgaccg tgctgactgt cgtgacaggc tctggccacg 2820
ccagctctac acctggcggc gagaaagaga caagcgccac ccagagaagc agcgtgccaa 2880
gcagcaccga gaagaacgcc gtgtccatga ccagctccgt gctgagcagc cactctcctg 2940
gcagcggcag cagcacaaca cagggccagg atgtgacact ggcccctgcc acagaacctg 3000
cctctggatc tgccgccacc tggggacagg acgtgacaag cgtgccagtg accagacctg 3060
ccctgggctc tacaacaccc cctgcccacg atgtgaccag cgcccctgat aacaagcctg 3120
cccctggaag cacagcccct ccagctcatg gcgtgacctc tgccccagat accagaccag 3180
ccccaggatc tacagcccca cccgcacacg gcgtgacaag tgcccctgac acaagacccg 3240
ctccaggctc tactgctcct cctgcccatg gcgtgacaag cgctcccgat acaaggccag 3300
ctcctggctc cacagcacca ccagcacatg gcgtgacatc agctcccgac actagacctg 3360
ctcccggatc aaccgctcca ccagctcacg gcgtgaccag cgcacctgat accagacctg 3420
ctctgggaag caccgcccct cccgtgcaca atgtgacatc tgcttccggc agcgccagcg 3480
gctctgcctc tacactggtg cacaacggca ccagcgccag agccacaaca accccagcca 3540
gcaagagcac ccccttcagc atccctagcc accacagcga cacccctacc acactggcca 3600
gccactccac caagaccgat gcctctagca cccaccactc cagcgtgccc cctctgacca 3660
gcagcaacca cagcacaagc ccccagctgt ctaccggcgt ctcattcttc tttctgtcct 3720
tccacatcag caacctgcag ttcaacagca gcctggaaga tcccagcacc gactactacc 3780
aggaactgca gcgggatatc agcgagatgt tcctgcaaat ctacaagcag ggcggcttcc 3840
tgggcctgag caacatcaag ttcagacccg gcagcgtggt ggtgcagctg accctggctt 3900
tccgggaagg caccatcaac gtgcacgacg tggaaaccca gttcaaccag tacaagaccg 3960
aggccgccag ccggtacaac ctgaccatct ccgatgtgtc cgtgtccgac gtgcccttcc 4020
cattctctgc ccagtctggc gcaggcgtgc caggatgggg aattgctctg ctggtgctcg 4080
tgtgcgtgct ggtggccctg gccatcgtgt atctgattgc cctggccgtg tgccagtgcc 4140
ggcggaagaa ttacggccag ctggacatct tccccgccag agacacctac caccccatga 4200
gcgagtaccc cacataccac acccacggca gatacgtgcc acccagctcc accgacagat 4260
ccccctacga gaaagtgtct gccggcaacg gcggcagctc cctgagctac acaaatcctg 4320
ccgtggccgc tgcctccgcc aacctgggat ccggcacaat cctgtctgag ggcgccacca 4380
acttcagcct gctgaaactg gccggcgacg tggaactgaa ccctggccct ggagctgccc 4440
cggagccgga gaggaccccc gttggccagg gatcgtgggc ccatccggga cgcaccaggg 4500
gaccatccga caggggattc tgtgtggtgt caccggccag gccagcagaa gaggcaacca 4560
gcctcgaggg agcgttgtct ggaaccagac attcccaccc gtcggtgggc cggcagcacc 4620
acgcgggacc accgtccact tccagaccgc cacggccatg ggacacccct tgcccgcctg 4680
tgtatgccga gactaaacac ttcctgtact catccggaga caaggaacag cttcggccgt 4740
ccttcctcct gtcgtcgctc agaccgagcc tgaccggagc acgcagattg gtggaaacta 4800
tcttccttgg gtcacgtccg tggatgccag gtaccccacg gcgcctcccg cgcctcccac 4860
agagatactg gcagatgcgg cctctgttcc tggaattgct gggaaaccac gctcagtgcc 4920
cgtacggagt cctgctcaag actcactgcc ctctgagggc ggcggtcact ccggcggccg 4980
gagtgtgcgc acgggagaag ccccagggaa gcgtggcagc tccggaagag gaggacaccg 5040
atccgcgccg cctcgtgcaa cttctgcgcc agcactcctc gccctggcaa gtctacgggt 5100
tcgtccgcgc ctgcctgcgc cgcctggtgc cgcctgggct ctggggttcc cggcataacg 5160
agcgccgctt cctgagaaat actaagaagt ttatctcact tggaaaacat gccaagttgt 5220
cgctgcaaga actcacgtgg aagatgtcag tccgcgattg cgcctggctg cgccgctcgc 5280
cgggcgtcgg gtgtgttcca gctgcagaac accgcctgag agaagaaatt ctggccaaat 5340
ttctgcattg gctgatgtca gtgtacgtgg tcgagctgct gcgctccttt ttctacgtca 5400
ctgagactac ctttcaaaag aaccgcctgt tcttctaccg caaatctgtg tggagcaagc 5460
tgcagtcaat cggcattcgc cagcatctga agagggtgca gctgcgggaa ctttccgagg 5520
cagaagtccg ccagcaccgg gaggcccggc cggcgcttct cacgtcgcgt ctgagattca 5580
tcccaaagcc cgacgggctg aggcctatcg tcaacatgga ttacgtcgtg ggcgctcgca 5640
cctttcgccg tgaaaagcgg gccgaacgct tgacctcacg ggtgaaggcc ctcttctccg 5700
tgctgaacta cgagagagca agacggcctg gcctgctggg agcttcggtg ctgggactgg 5760
acgatatcca ccgggcttgg cggacctttg ttctccgggt gagagcccaa gaccctccgc 5820
cggaactgta cttcgtgaag gtggcgatca ccggagccta tgatactatt ccgcaagatc 5880
gactcaccga agtcatcgcc tcgatcatca aaccgcagaa cacttactgc gtcaggcggt 5940
acgccgtggt ccagaaggcc gcgcatggcc acgtgagaaa ggcgttcaag tcgcacgtgt 6000
ccactctcac cgacctccag ccttacatga ggcaattcgt tgcgcatttg caagagactt 6060
cgcccctgag agatgcggtg gtcatcgagc agagctccag cctgaacgaa gcgagcagcg 6120
gtctgtttga cgtgttcctc cgcttcatgt gtcatcacgc ggtgcgaatc aggggaaaat 6180
catacgtgca gtgccaggga atcccacaag gcagcattct gtcgactctc ttgtgttccc 6240
tttgctacgg cgatatggaa aacaagctgt tcgctgggat cagacgggac gggttgctgc 6300
tcagactggt ggacgacttc ctgctggtga ctccgcacct cactcacgcc aaaacctttc 6360
tccgcactct ggtgagggga gtgccagaat acggctgtgt ggtcaatctc cggaaaactg 6420
tggtgaattt ccctgtcgag gatgaggcac tcggaggaac cgcatttgtc caaatgccag 6480
cacatggcct gttcccatgg tgcggtctgc tgctggacac ccgaactctt gaagtgcagt 6540
ccgactactc cagctatgcc cggacgagca tccgcgccag cctcactttc aatcgcggct 6600
ttaaggccgg acgaaacatg cgcagaaagc ttttcggagt cctccggctt aaatgccatt 6660
cgctctttct cgatctccaa gtcaattcgc tgcagaccgt gtgcacgaac atctacaaga 6720
tcctgctgct ccaagcctac cggttccacg cttgcgtgct tcagctgccg tttcaccaac 6780
aggtgtggaa gaacccgacc ttctttctgc gggtcattag cgatactgcc tccctgtgtt 6840
actcaatcct caaggcaaag aacgccggaa tgtcgctggg tgcgaaagga gccgcgggac 6900
ctcttcctag cgaagcggtg cagtggctct gccaccaggc tttcctcctg aagctgacca 6960
ggcacagagt gacctacgtc ccgctgctgg gctcgctgcg cactgcacag acccagctgt 7020
ctagaaaact ccccggcacc accctgaccg ctctggaagc cgccgccaac ccagcattgc 7080
cgtcagattt caagaccatc ttggactgac gcacctcgag ctgatcataa tcagccatac 7140
cacatttgta gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa 7200
acataaaatg aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa 7260
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 7320
tggtttgtcc aaactcatca atgtatctta ccaggtgccg agcctgcgag tgcggaggga 7380
agcatgccag gttccagccc gtgtgtgtgg atgtgacgga ggacctgcga cccgatcatt 7440
tggtgttgcc ctgcaccggg acggagttcg gttccagcgg ggaagaatct gactagagtg 7500
agtagtgttc tggggcgggg gaggacctgc atgagggcca gaataactga aatctgtgct 7560
tttctgtgtg ttgcagcagc atgagcggaa gcggctcctt tgagggaggg gtattcagcc 7620
cttatctgac ggggcgtctc ccctcctggg cgggagtgcg tcagaatgtg atgggatcca 7680
cggtggacgg ccggcccgtg cagcccgcga actcttcaac cctgacctat gcaaccctga 7740
gctcttcgtc gttggacgca gctgccgccg cagctgctgc atctgccgcc agcgccgtgc 7800
gcggaatggc catgggcgcc ggctactacg gcactctggt ggccaactcg agttccacca 7860
ataatcccgc cagcctgaac gaggagaagc tgttgctgct gatggcccag ctcgaggcct 7920
tgacccagcg cctgggcgag ctgacccagc aggtggctca gctgcaggag cagacgcggg 7980
ccgcggttgc cacggtgaaa tccaaataaa aaatgaatca ataaataaac ggagacggtt 8040
gttgatttta acacagagtc tgaatcttta tttgattttt cgcgcgcggt aggccctgga 8100
ccaccggtct cgatcattga gcacccggtg gatcttttcc aggacccggt agaggtgggc 8160
ttggatgttg aggtacatgg gcatgagccc gtcccggggg tggaggtagc tccattgcag 8220
ggcctcgtgc tcgggggtgg tgttgtaaat cacccagtca tagcaggggc gcagggcatg 8280
gtgttgcaca atatctttga ggaggagact gatggccacg ggcagccctt tggtgtaggt 8340
gtttacaaat ctgttgagct gggagggatg catgcggggg gagatgaggt gcatcttggc 8400
ctggatcttg agattggcga tgttaccgcc cagatcccgc ctggggttca tgttgtgcag 8460
gaccaccagc acggtgtatc cggtgcactt ggggaattta tcatgcaact tggaagggaa 8520
ggcgtgaaag aatttggcga cgcctttgtg cccgcccagg ttttccatgc actcatccat 8580
gatgatggcg atgggcccgt gggcggcggc ctgggcaaag acgtttcggg ggtcggacac 8640
atcatagttg tggtcctggg tgaggtcatc ataggccatt ttaatgaatt tggggcggag 8700
ggtgccggac tgggggacaa aggtaccctc gatcccgggg gcgtagttcc cctcacagat 8760
ctgcatctcc caggctttga gctcggaggg ggggatcatg tccacctgcg gggcgataaa 8820
gaacacggtt tccggggcgg gggagatgag ctgggccgaa agcaagttcc ggagcagctg 8880
ggacttgccg cagccggtgg ggccgtagat gaccccgatg accggctgca ggtggtagtt 8940
gagggagaga cagctgccgt cctcccggag gaggggggcc acctcgttca tcatctcgcg 9000
cacgtgcatg ttctcgcgca ccagttccgc caggaggcgc tctcccccca gggataggag 9060
ctcctggagc gaggcgaagt ttttcagcgg cttgagtccg tcggccatgg gcattttgga 9120
gagggtttgt tgcaagagtt ccaggcggtc ccagagctcg gtgatgtgct ctacggcatc 9180
tcgatccagc agacctcctc gtttcgcggg ttgggacggc tgcgggagta gggcaccaga 9240
cgatgggcgt ccagcgcagc cagggtccgg tccttccagg gtcgcagcgt ccgcgtcagg 9300
gtggtctccg tcacggtgaa ggggtgcgcg ccgggctggg cgcttgcgag ggtgcgcttc 9360
aggctcatcc ggctggtcga aaaccgctcc cgatcggcgc cctgcgcgtc ggccaggtag 9420
caattgacca tgagttcgta gttgagcgcc tcggccgcgt ggcctttggc gcggagctta 9480
cctttggaag tctgcccgca ggcgggacag aggagggact tgagggcgta gagcttgggg 9540
gcgaggaaga cggactcggg ggcgtaggcg tccgcgccgc agtgggcgca gacggtctcg 9600
cactccacga gccaggtgag gtcgggctgg tcggggtcaa aaaccagttt cccgccgttc 9660
tttttgatgc gtttcttacc tttggtctcc atgagctcgt gtccccgctg ggtgacaaag 9720
aggctgtccg tgtccccgta gaccgacttt atgggccggt cctcgagcgg tgtgccgcgg 9780
tcctcctcgt agaggaaccc cgcccactcc gagacgaaag cccgggtcca ggccagcacg 9840
aaggaggcca cgtgggacgg gtagcggtcg ttgtccacca gcgggtccac cttttccagg 9900
gtatgcaaac acatgtcccc ctcgtccaca tccaggaagg tgattggctt gtaagtgtag 9960
gccacgtgac cgggggtccc ggccgggggg gtataaaagg gtgcgggtcc ctgctcgtcc 10020
tcactgtctt ccggatcgct gtccaggagc gccagctgtt ggggtaggta ttccctctcg 10080
aaggcgggca tgacctcggc actcaggttg tcagtttcta gaaacgagga ggatttgata 10140
ttgacggtgc cggcggagat gcctttcaag agcccctcgt ccatctggtc agaaaagacg 10200
atctttttgt tgtcgagctt ggtggcgaag gagccgtaga gggcgttgga gaggagcttg 10260
gcgatggagc gcatggtctg gtttttttcc ttgtcggcgc gctccttggc ggcgatgttg 10320
agctgcacgt actcgcgcgc cacgcacttc cattcgggga agacggtggt cagctcgtcg 10380
ggcacgattc tgacctgcca gccccgatta tgcagggtga tgaggtccac actggtggcc 10440
acctcgccgc gcaggggctc attagtccag cagaggcgtc cgcccttgcg cgagcagaag 10500
gggggcaggg ggtccagcat gacctcgtcg ggggggtcgg catcgatggt gaagatgccg 10560
ggcaggaggt cggggtcaaa gtagctgatg gaagtggcca gatcgtccag ggcagcttgc 10620
cattcgcgca cggccagcgc gcgctcgtag ggactgaggg gcgtgcccca gggcatggga 10680
tgggtaagcg cggaggcgta catgccgcag atgtcgtaga cgtagagggg ctcctcgagg 10740
atgccgatgt aggtggggta gcagcgcccc ccgcggatgc tggcgcgcac gtagtcatac 10800
agctcgtgcg agggggcgag gagccccggg cccaggttgg tgcgactggg cttttcggcg 10860
cggtagacga tctggcggaa aatggcatgc gagttggagg agatggtggg cctttggaag 10920
atgttgaagt gggcgtgggg cagtccgacc gagtcgcgga tgaagtgggc gtaggagtct 10980
tgcagcttgg cgacgagctc ggcggtgact aggacgtcca gagcgcagta gtcgagggtc 11040
tcctggatga tgtcatactt gagctgtccc ttttgtttcc acagctcgcg gttgagaagg 11100
aactcttcgc ggtccttcca gtactcttcg agggggaacc cgtcctgatc tgcacggtaa 11160
gagcctagca tgtagaactg gttgacggcc ttgtaggcgc agcagccctt ctccacgggg 11220
agggcgtagg cctgggcggc cttgcgcagg gaggtgtgcg tgagggcgaa agtgtccctg 11280
accatgacct tgaggaactg gtgcttgaag tcgatatcgt cgcagccccc ctgctcccag 11340
agctggaagt ccgtgcgctt cttgtaggcg gggttgggca aagcgaaagt aacatcgttg 11400
aagaggatct tgcccgcgcg gggcataaag ttgcgagtga tgcggaaagg ttggggcacc 11460
tcggcccggt tgttgatgac ctgggcggcg agcacgatct cgtcgaagcc gttgatgttg 11520
tggcccacga tgtagagttc cacgaatcgc ggacggccct tgacgtgggg cagtttcttg 11580
agctcctcgt aggtgagctc gtcggggtcg ctgagcccgt gctgctcgag cgcccagtcg 11640
gcgagatggg ggttggcgcg gaggaaggaa gtccagagat ccacggccag ggcggtttgc 11700
agacggtccc ggtactgacg gaactgctgc ccgacggcca ttttttcggg ggtgacgcag 11760
tagaaggtgc gggggtcccc gtgccagcga tcccatttga gctggagggc gagatcgagg 11820
gcgagctcga cgagccggtc gtccccggag agtttcatga ccagcatgaa ggggacgagc 11880
tgcttgccga aggaccccat ccaggtgtag gtttccacat cgtaggtgag gaagagcctt 11940
tcggtgcgag gatgcgagcc gatggggaag aactggatct cctgccacca attggaggaa 12000
tggctgttga tgtgatggaa gtagaaatgc cgacggcgcg ccgaacactc gtgcttgtgt 12060
ttatacaagc ggccacagtg ctcgcaacgc tgcacgggat gcacgtgctg cacgagctgt 12120
acctgagttc ctttgacgag gaatttcagt gggaagtgga gtcgtggcgc ctgcatctcg 12180
tgctgtacta cgtcgtggtg gtcggcctgg ccctcttctg cctcgatggt ggtcatgctg 12240
acgagcccgc gcgggaggca ggtccagacc tcggcgcgag cgggtcggag agcgaggacg 12300
agggcgcgca ggccggagct gtccagggtc ctgagacgct gcggagtcag gtcagtgggc 12360
agcggcggcg cgcggttgac ttgcaggagt ttttccaggg cgcgcgggag gtccagatgg 12420
tacttgatct ccaccgcgcc attggtggcg acgtcgatgg cttgcagggt cccgtgcccc 12480
tggggtgtga ccaccgtccc ccgtttcttc ttgggcggct ggggcgacgg gggcggtgcc 12540
tcttccatgg ttagaagcgg cggcgaggac gcgcgccggg cggcaggggc ggctcggggc 12600
ccggaggcag gggcggcagg ggcacgtcgg cgccgcgcgc gggtaggttc tggtactgcg 12660
cccggagaag actggcgtga gcgacgacgc gacggttgac gtcctggatc tgacgcctct 12720
gggtgaaggc cacgggaccc gtgagtttga acctgaaaga gagttcgaca gaatcaatct 12780
cggtatcgtt gacggcggcc tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt 12840
aggcgatctc ggtcatgaac tgctcgatct cctcctcttg aaggtctccg cggccggcgc 12900
gctccacggt ggccgcgagg tcgttggaga tgcggcccat gagctgcgag aaggcgttca 12960
tgcccgcctc gttccagacg cggctgtaga ccacgacgcc ctcgggatcg cgggcgcgca 13020
tgaccacctg ggcgaggttg agctccacgt ggcgcgtgaa gaccgcgtag ttgcagaggc 13080
gctggtagag gtagttgagc gtggtggcga tgtgctcggt gacgaagaaa tacatgatcc 13140
agcggcggag cggcatctcg ctgacgtcgc ccagcgcctc caaacgttcc atggcctcgt 13200
aaaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgagacggtc aactcctcct 13260
ccagaagacg gatgagctcg gcgatggtgg cgcgcacctc gcgctcgaag gcccccggga 13320
gttcctccac ttcctcttct tcctcctcca ctaacatctc ttctacttcc tcctcaggcg 13380
gcagtggtgg cgggggaggg ggcctgcgtc gccggcggcg cacgggcaga cggtcgatga 13440
agcgctcgat ggtctcgccg cgccggcgtc gcatggtctc ggtgacggcg cgcccgtcct 13500
cgcggggccg cagcgtgaag acgccgccgc gcatctccag gtggccgggg gggtccccgt 13560
tgggcaggga gagggcgctg acgatgcatc ttatcaattg ccccgtaggg actccgcgca 13620
aggacctgag cgtctcgaga tccacgggat ctgaaaaccg ctgaacgaag gcttcgagcc 13680
agtcgcagtc gcaaggtagg ctgagcacgg tttcttctgg cgggtcatgt tggttgggag 13740
cggggcgggc gatgctgctg gtgatgaagt tgaaataggc ggttctgaga cggcggatgg 13800
tggcgaggag caccaggtct ttgggcccgg cttgctggat gcgcagacgg tcggccatgc 13860
cccaggcgtg gtcctgacac ctggccaggt ccttgtagta gtcctgcatg agccgctcca 13920
cgggcacctc ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag ccgcgctggg 13980
gctggacgag cgccaggtcg gcgacgacgc gctcggcgag gatggcttgc tggatctggg 14040
tgagggtggt ctggaagtca tcaaagtcga cgaagcggtg gtaggctccg gtgttgatgg 14100
tgtaggagca gttggccatg acggaccagt tgacggtctg gtggcccgga cgcacgagct 14160
cgtggtactt gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg caggtgcgca 14220
ccaggtactg gtagccgatg aggaagtgcg gcggcggctg gcggtagagc ggccatcgct 14280
cggtggcggg ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag ccgtagatgt 14340
acctggacat ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac tcgcggacgc 14400
ggttccagat gttgcgcagc ggcaggaagt agttcatggt gggcacggtc tggcccgtga 14460
ggcgcgcgca gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag cggctcgact 14520
ccgtggcctg gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg 14580
aatcaggctg gagccgcagc taacgtggta ttggcactcc cgtctcgacc caagcctgca 14640
ccaaccctcc aggatacgga ggcgggtcgt tttgcaactt ttttttggag gccggatgag 14700
actagtaagc gcggaaagcg gccgaccgcg atggctcgct gccgtagtct ggagaagaat 14760
cgccagggtt gcgttgcggt gtgccccggt tcgaggccgg ccggattccg cggctaacga 14820
gggcgtggct gccccgtcgt ttccaagacc ccatagccag ccgacttctc cagttacgga 14880
gcgagcccct cttttgtttt gtttgttttt gccagatgca tcccgtactg cggcagatgc 14940
gcccccacca ccctccaccg caacaacagc cccctccaca gccggcgctt ctgcccccgc 15000
cccagcagca acttccagcc acgaccgccg cggccgccgt gagcggggct ggacagagtt 15060
atgatcacca gctggccttg gaagagggcg aggggctggc gcgcctgggg gcgtcgtcgc 15120
cggagcggca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc 15180
agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcg gcccggttcc 15240
acgcggggcg ggagctgcgg cgcggcctgg accgaaagag ggtgctgagg gacgaggatt 15300
tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc 15360
tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca 15420
accacgtgcg caccctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 15480
acctgctgga ggccatcgtg cagaacccca ccagcaagcc gctgacggcg cagctgttcc 15540
tggtggtgca gcatagtcgg gacaacgaag cgttcaggga ggcgctgctg aatatcaccg 15600
agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg 15660
agcgcgggct gccgctgtcc gagaagctgg cggccatcaa cttctcggtg ctgagtttgg 15720
gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 15780
agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 15840
gggtgtaccg caacgacagg atgcaccgtg cggtgagcgc cagcaggcgg cgcgagctga 15900
gcgaccagga gctgatgcat agtctgcagc gggccctgac cggggccggg accgaggggg 15960
agagctactt tgacatgggc gcggacctgc actggcagcc cagccgccgg gccttggagg 16020
cggcggcagg accctacgta gaagaggtgg acgatgaggt ggacgaggag ggcgagtacc 16080
tggaagactg atggcgcgac cgtatttttg ctagatgcaa caacaacagc cacctcctga 16140
tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 16200
gacccaggcc atgcaacgca tcatggcgct gacgacccgc aaccccgaag cctttagaca 16260
gcagccccag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctccaa 16320
ccccacgcac gagaaggtcc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 16380
cggcgacgag gccggcctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 16440
caccaacgtg cagaccaacc tggaccgcat ggtgaccgac gtgcgcgagg ccgtggccca 16500
gcgcgagcgg ttccaccgcg agtccaacct gggatccatg gtggcgctga acgccttcct 16560
cagcacccag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 16620
cctgcgcctg atggtgaccg aggtgcccca gagcgaggtg taccagtccg ggccggacta 16680
cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 16740
cttgcagggc ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct 16800
gctgacgccg aactcgcgcc tgctgctgct gctggtggcc cccttcacgg acagcggcag 16860
catcaaccgc aactcgtacc tgggctacct gattaacctg taccgcgagg ccatcggcca 16920
ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg ccctgggcca 16980
ggacgacccg ggcaacctgg aagccaccct gaactttttg ctgaccaacc ggtcgcagaa 17040
gatcccgccc cagtacgcgc tcagcaccga ggaggagcgc atcctgcgtt acgtgcagca 17100
gagcgtgggc ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac 17160
cgcgcgcaac atggagccca gcatgtacgc cagcaaccgc ccgttcatca ataaactgat 17220
ggactacttg catcgggcgg ccgccatgaa ctctgactat ttcaccaacg ccatcctgaa 17280
tccccactgg ctcccgccgc cggggttcta cacgggcgag tacgacatgc ccgaccccaa 17340
tgacgggttc ctgtgggacg atgtggacag cagcgtgttc tccccccgac cgggtgctaa 17400
cgagcgcccc ttgtggaaga aggaaggcag cgaccgacgc ccgtcctcgg cgctgtccgg 17460
ccgcgagggt gctgccgcgg cggtgcccga ggccgccagt cctttcccga gcttgccctt 17520
ctcgctgaac agtatccgca gcagcgagct gggcaggatc acgcgcccgc gcttgctggg 17580
cgaagaggag tacttgaatg actcgctgtt gagacccgag cgggagaaga acttccccaa 17640
taacgggata gaaagcctgg tggacaagat gagccgctgg aagacgtatg cgcaggagca 17700
cagggacgat ccccgggcgt cgcagggggc cacgagccgg ggcagcgccg cccgtaaacg 17760
ccggtggcac gacaggcagc ggggacagat gtgggacgat gaggactccg ccgacgacag 17820
cagcgtgttg gacttgggtg ggagtggtaa cccgttcgct cacctgcgcc cccgtatcgg 17880
gcgcatgatg taagagaaac cgaaaataaa tgatactcac caaggccatg gcgaccagcg 17940
tgcgttcgtt tcttctctgt tgttgttgta tctagtatga tgaggcgtgc gtacccggag 18000
ggtcctcctc cctcgtacga gagcgtgatg cagcaggcga tggcggcggc ggcgatgcag 18060
cccccgctgg aggctcctta cgtgcccccg cggtacctgg cgcctacgga ggggcggaac 18120
agcattcgtt actcggagct ggcacccttg tacgatacca cccggttgta cctggtggac 18180
aacaagtcgg cggacatcgc ctcgctgaac taccagaacg accacagcaa cttcctgacc 18240
accgtggtgc agaacaatga cttcaccccc acggaggcca gcacccagac catcaacttt 18300
gacgagcgct cgcggtgggg cggccagctg aaaaccatca tgcacaccaa catgcccaac 18360
gtgaacgagt tcatgtacag caacaagttc aaggcgcggg tgatggtctc ccgcaagacc 18420
cccaatgggg tgacagtgac agaggattat gatggtagtc aggatgagct gaagtatgaa 18480
tgggtggaat ttgagctgcc cgaaggcaac ttctcggtga ccatgaccat cgacctgatg 18540
aacaacgcca tcatcgacaa ttacttggcg gtggggcggc agaacggggt gctggagagc 18600
gacatcggcg tgaagttcga cactaggaac ttcaggctgg gctgggaccc cgtgaccgag 18660
ctggtcatgc ccggggtgta caccaacgag gctttccatc ccgatattgt cttgctgccc 18720
ggctgcgggg tggacttcac cgagagccgc ctcagcaacc tgctgggcat tcgcaagagg 18780
cagcccttcc aggaaggctt ccagatcatg tacgaggatc tggagggggg caacatcccc 18840
gcgctcctgg atgtcgacgc ctatgagaaa agcaaggagg atgcagcagc tgaagcaact 18900
gcagccgtag ctaccgcctc taccgaggtc aggggcgata attttgcaag cgccgcagca 18960
gtggcagcgg ccgaggcggc tgaaaccgaa agtaagatag tcattcagcc ggtggagaag 19020
gatagcaaga acaggagcta caacgtacta ccggacaaga taaacaccgc ctaccgcagc 19080
tggtacctag cctacaacta tggcgacccc gagaagggcg tgcgctcctg gacgctgctc 19140
accacctcgg acgtcacctg cggcgtggag caagtctact ggtcgctgcc cgacatgatg 19200
caagacccgg tcaccttccg ctccacgcgt caagttagca actacccggt ggtgggcgcc 19260
gagctcctgc ccgtctactc caagagcttc ttcaacgagc aggccgtcta ctcgcagcag 19320
ctgcgcgcct tcacctcgct tacgcacgtc ttcaaccgct tccccgagaa ccagatcctc 19380
gtccgcccgc ccgcgcccac cattaccacc gtcagtgaaa acgttcctgc tctcacagat 19440
cacgggaccc tgccgctgcg cagcagtatc cggggagtcc agcgcgtgac cgttactgac 19500
gccagacgcc gcacctgccc ctacgtctac aaggccctgg gcatagtcgc gccgcgcgtc 19560
ctctcgagcc gcaccttcta aatgtccatt ctcatctcgc ccagtaataa caccggttgg 19620
ggcctgcgcg cgcccagcaa gatgtacgga ggcgctcgcc aacgctccac gcaacacccc 19680
gtgcgcgtgc gcgggcactt ccgcgctccc tggggcgccc tcaagggccg cgtgcggtcg 19740
cgcaccaccg tcgacgacgt gatcgaccag gtggtggccg acgcgcgcaa ctacaccccc 19800
gccgccgcgc ccgtctccac cgtggacgcc gtcatcgaca gcgtggtggc cgacgcgcgc 19860
cggtacgccc gcgccaagag ccggcggcgg cgcatcgccc ggcggcaccg gagcaccccc 19920
gccatgcgcg cggcgcgagc cttgctgcgc agggccaggc gcacgggacg cagggccatg 19980
ctcagggcgg ccagacgcgc ggcttcaggc gccagcgccg gcaggacccg gagacgcgcg 20040
gccacggcgg cggcagcggc catcgccagc atgtcccgcc cgcggcgagg gaacgtgtac 20100
tgggtgcgcg acgccgccac cggtgtgcgc gtgcccgtgc gcacccgccc ccctcgcact 20160
tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc gaggaggatg tccaagcgca 20220
aattcaagga agagatgctc caggtcatcg cgcctgagat ctacggccct gcggtggtga 20280
aggaggaaag aaagccccgc aaaatcaagc gggtcaaaaa ggacaaaaag gaagaagaaa 20340
gtgatgtgga cggattggtg gagtttgtgc gcgagttcgc cccccggcgg cgcgtgcagt 20400
ggcgcgggcg gaaggtgcaa ccggtgctga gacccggcac caccgtggtc ttcacgcccg 20460
gcgagcgctc cggcaccgct tccaagcgct cctacgacga ggtgtacggg gatgatgata 20520
ttctggagca ggcggccgag cgcctgggcg agtttgctta cggcaagcgc agccgttccg 20580
caccgaagga agaggcggtg tccatcccgc tggaccacgg caaccccacg ccgagcctca 20640
agcccgtgac cttgcagcag gtgctgccga ccgcggcgcc gcgccggggg ttcaagcgcg 20700
agggcgagga tctgtacccc accatgcagc tgatggtgcc caagcgccag aagctggaag 20760
acgtgctgga gaccatgaag gtggacccgg acgtgcagcc cgaggtcaag gtgcggccca 20820
tcaagcaggt ggccccgggc ctgggcgtgc agaccgtgga catcaagatt cccacggagc 20880
ccatggaaac gcagaccgag cccatgatca agcccagcac cagcaccatg gaggtgcaga 20940
cggatccctg gatgccatcg gctcctagtc gaagaccccg gcgcaagtac ggcgcggcca 21000
gcctgctgat gcccaactac gcgctgcatc cttccatcat ccccacgccg ggctaccgcg 21060
gcacgcgctt ctaccgcggt cataccagca gccgccgccg caagaccacc actcgccgcc 21120
gccgtcgccg caccgccgct gcaaccaccc ctgccgccct ggtgcggaga gtgtaccgcc 21180
gcggccgcgc acctctgacc ctgccgcgcg cgcgctacca cccgagcatc gccatttaaa 21240
ctttcgcctg ctttgcagat caatggccct cacatgccgc cttcgcgttc ccattacggg 21300
ctaccgagga agaaaaccgc gccgtagaag gctggcgggg aacgggatgc gtcgccacca 21360
ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga ggcttcctgc ccgcgctgat 21420
ccccatcatc gccgcggcga tcggggcgat ccccggcatt gcttccgtgg cggtgcaggc 21480
ctctcagcgc cactgagaca cacttggaaa catcttgtaa taaaccaatg gactctgacg 21540
ctcctggtcc tgtgatgtgt tttcgtagac agatggaaga catcaatttt tcgtccctgg 21600
ctccgcgaca cggcacgcgg ccgttcatgg gcacctggag cgacatcggc accagccaac 21660
tgaacggggg cgccttcaat tggagcagtc tctggagcgg gcttaagaat ttcgggtcca 21720
cgcttaaaac ctatggcagc aaggcgtgga acagcaccac agggcaggcg ctgagggata 21780
agctgaaaga gcagaacttc cagcagaagg tggtcgatgg gctcgcctcg ggcatcaacg 21840
gggtggtgga cctggccaac caggccgtgc agcggcagat caacagccgc ctggacccgg 21900
tgccgcccgc cggctccgtg gagatgccgc aggtggagga ggagctgcct cccctggaca 21960
agcggggcga gaagcgaccc cgccccgatg cggaggagac gctgctgacg cacacggacg 22020
agccgccccc gtacgaggag gcggtgaaac tgggtctgcc caccacgcgg cccatcgcgc 22080
ccctggccac cggggtgctg aaacccgaaa agcccgcgac cctggacttg cctcctcccc 22140
agccttcccg cccctctaca gtggctaagc ccctgccgcc ggtggccgtg gcccgcgcgc 22200
gacccggggg caccgcccgc cctcatgcga actggcagag cactctgaac agcatcgtgg 22260
gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aacctaccgt agcgcttaac 22320
ttgcttgtct gtgtgtgtat gtattatgtc gccgccgccg ctgtccacca gaaggaggag 22380
tgaagaggcg cgtcgccgag ttgcaagatg gccaccccat cgatgctgcc ccagtgggcg 22440
tacatgcaca tcgccggaca ggacgcttcg gagtacctga gtccgggtct ggtgcagttt 22500
gcccgcgcca cagacaccta cttcagtctg gggaacaagt ttaggaaccc cacggtggcg 22560
cccacgcacg atgtgaccac cgaccgcagc cagcggctga cgctgcgctt cgtgcccgtg 22620
gaccgcgagg acaacaccta ctcgtacaaa gtgcgctaca cgctggccgt gggcgacaac 22680
cgcgtgctgg acatggccag cacctacttt gacatccgcg gcgtgctgga tcggggccct 22740
agcttcaaac cctactccgg caccgcctac aacagtctgg cccccaaggg agcacccaac 22800
acttgtcagt ggacatataa agccgatggt gaaactgcca cagaaaaaac ctatacatat 22860
ggaaatgcac ccgtgcaggg cattaacatc acaaaagatg gtattcaact tggaactgac 22920
accgatgatc agccaatcta cgcagataaa acctatcagc ctgaacctca agtgggtgat 22980
gctgaatggc atgacatcac tggtactgat gaaaagtatg gaggcagagc tcttaagcct 23040
gataccaaaa tgaagccttg ttatggttct tttgccaagc ctactaataa agaaggaggt 23100
caggcaaatg tgaaaacagg aacaggcact actaaagaat atgacataga catggctttc 23160
tttgacaaca gaagtgcggc tgctgctggc ctagctccag aaattgtttt gtatactgaa 23220
aatgtggatt tggaaactcc agatacccat attgtataca aagcaggcac agatgacagc 23280
agctcttcta ttaatttggg tcagcaagcc atgcccaaca gacctaacta cattggtttc 23340
agagacaact ttatcgggct catgtactac aacagcactg gcaatatggg ggtgctggcc 23400
ggtcaggctt ctcagctgaa tgctgtggtt gacttgcaag acagaaacac cgagctgtcc 23460
taccagctct tgcttgactc tctgggtgac agaacccggt atttcagtat gtggaatcag 23520
gcggtggaca gctatgatcc tgatgtgcgc attattgaaa atcatggtgt ggaggatgaa 23580
cttcccaact attgtttccc tctggatgct gttggcagaa cagatactta tcagggaatt 23640
aaggctaatg gaactgatca aaccacatgg accaaagatg acagtgtcaa tgatgctaat 23700
gagataggca agggtaatcc attcgccatg gaaatcaaca tccaagccaa cctgtggagg 23760
aacttcctct acgccaacgt ggccctgtac ctgcccgact cttacaagta cacgccggcc 23820
aatgttaccc tgcccaccaa caccaacacc tacgattaca tgaacggccg ggtggtggcg 23880
ccctcgctgg tggactccta catcaacatc ggggcgcgct ggtcgctgga tcccatggac 23940
aacgtgaacc ccttcaacca ccaccgcaat gcggggctgc gctaccgctc catgctcctg 24000
ggcaacgggc gctacgtgcc cttccacatc caggtgcccc agaaattttt cgccatcaag 24060
agcctcctgc tcctgcccgg gtcctacacc tacgagtgga acttccgcaa ggacgtcaac 24120
atgatcctgc agagctccct cggcaacgac ctgcgcacgg acggggcctc catctccttc 24180
accagcatca acctctacgc caccttcttc cccatggcgc acaacacggc ctccacgctc 24240
gaggccatgc tgcgcaacga caccaacgac cagtccttca acgactacct ctcggcggcc 24300
aacatgctct accccatccc ggccaacgcc accaacgtgc ccatctccat cccctcgcgc 24360
aactgggccg ccttccgcgg ctggtccttc acgcgtctca agaccaagga gacgccctcg 24420
ctgggctccg ggttcgaccc ctacttcgtc tactcgggct ccatccccta cctcgacggc 24480
accttctacc tcaaccacac cttcaagaag gtctccatca ccttcgactc ctccgtcagc 24540
tggcccggca acgaccggct cctgacgccc aacgagttcg aaatcaagcg caccgtcgac 24600
ggcgagggct acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtccagatg 24660
ctggcccact acaacatcgg ctaccagggc ttctacgtgc ccgagggcta caaggaccgc 24720
atgtactcct tcttccgcaa cttccagccc atgagccgcc aggtggtgga cgaggtcaac 24780
tacaaggact accaggccgt caccctggcc taccagcaca acaactcggg cttcgtcggc 24840
tacctcgcgc ccaccatgcg ccagggccag ccctaccccg ccaactaccc ctacccgctc 24900
atcggcaaga gcgccgtcac cagcgtcacc cagaaaaagt tcctctgcga cagggtcatg 24960
tggcgcatcc ccttctccag caacttcatg tccatgggcg cgctcaccga cctcggccag 25020
aacatgctct atgccaactc cgcccacgcg ctagacatga atttcgaagt cgaccccatg 25080
gatgagtcca cccttctcta tgttgtcttc gaagtcttcg acgtcgtccg agtgcaccag 25140
ccccaccgcg gcgtcatcga ggccgtctac ctgcgcaccc ccttctcggc cggtaacgcc 25200
accacctaag ctcttgcttc ttgcaagcca tggccgcggg ctccggcgag caggagctca 25260
gggccatcat ccgcgacctg ggctgcgggc cctacttcct gggcaccttc gataagcgct 25320
tcccgggatt catggccccg cacaagctgg cctgcgccat cgtcaacacg gccggccgcg 25380
agaccggggg cgagcactgg ctggccttcg cctggaaccc gcgctcgaac acctgctacc 25440
tcttcgaccc cttcgggttc tcggacgagc gcctcaagca gatctaccag ttcgagtacg 25500
agggcctgct gcgccgcagc gccctggcca ccgaggaccg ctgcgtcacc ctggaaaagt 25560
ccacccagac cgtgcagggt ccgcgctcgg ccgcctgcgg gctcttctgc tgcatgttcc 25620
tgcacgcctt cgtgcactgg cccgaccgcc ccatggacaa gaaccccacc atgaacttgc 25680
tgacgggggt gcccaacggc atgctccagt cgccccaggt ggaacccacc ctgcgccgca 25740
accaggaggc gctctaccgc ttcctcaact cccactccgc ctactttcgc tcccaccgcg 25800
cgcgcatcga gaaggccacc gccttcgacc gcatgaatca agacatgtaa accgtgtgtg 25860
tatgttaaat gtctttaata aacagcactt tcatgttaca catgcatctg agatgattta 25920
tttagaaatc gaaagggttc tgccgggtct cggcatggcc cgcgggcagg gacacgttgc 25980
ggaactggta cttggccagc cacttgaact cggggatcag cagtttgggc agcggggtgt 26040
cggggaagga gtcggtccac agcttccgcg tcagttgcag ggcgcccagc aggtcgggcg 26100
cggagatctt gaaatcgcag ttgggacccg cgttctgcgc gcgggagttg cggtacacgg 26160
ggttgcagca ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt 26220
cggtgatgct ctccacgtcg aggtcctcgg cgttggccat cccgaagggg gtcatcttgc 26280
aggtctgcct tcccatggtg ggcacgcacc cgggcttgtg gttgcaatcg cagtgcaggg 26340
ggatcagcat catctgggcc tggtcggcgt tcatccccgg gtacatggcc ttcatgaaag 26400
cctccaattg cctgaacgcc tgctgggcct tggctccctc ggtgaagaag accccgcagg 26460
acttgctaga gaactggttg gtggcgcacc cggcgtcgtg cacgcagcag cgcgcgtcgt 26520
tgttggccag ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg 26580
ggttctcctt cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcatgtgct 26640
ccttctggat catggtggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcacc 26700
cgtgcagcca cagcgcgcac ccggtgcact cccagttctt gtgggcgatc tgggaatgcg 26760
cgtgcacgaa gccctgcagg aagcggccca tcatggtggt cagggtcttg ttgctagtga 26820
aggtcagcgg aatgccgcgg tgctcctcgt tgatgtacag gtggcagatg cggcggtaca 26880
cctcgccctg ctcgggcatc agctggaagt tggctttcag gtcggtctcc acgcggtagc 26940
ggtccatcag catagtcatg atttccatac ccttctccca ggccgagacg atgggcaggc 27000
tcatagggtt cttcaccatc atcttagcgc tagcagccgc ggccaggggg tcgctctcgt 27060
ccagggtctc aaagctccgc ttgccgtcct tctcggtgat ccgcaccggg gggtagctga 27120
agcccacggc cgccagctcc tcctcggcct gtctttcgtc ctcgctgtcc tggctgacgt 27180
cctgcaggac cacatgcttg gtcttgcggg gtttcttctt gggcggcagc ggcggcggag 27240
atgttggaga tggcgagggg gagcgcgagt tctcgctcac cactactatc tcttcctctt 27300
cttggtccga ggccacgcgg cggtaggtat gtctcttcgg gggcagaggc ggaggcgacg 27360
ggctctcgcc gccgcgactt ggcggatggc tggcagagcc ccttccgcgt tcgggggtgc 27420
gctcccggcg gcgctctgac tgacttcctc cgcggccggc cattgtgttc tcctagggag 27480
gaacaacaag catggagact cagccatcgc caacctcgcc atctgccccc accgccgacg 27540
agaagcagca gcagcagaat gaaagcttaa ccgccccgcc gcccagcccc gccacctccg 27600
acgcggccgt cccagacatg caagagatgg aggaatccat cgagattgac ctgggctatg 27660
tgacgcccgc ggagcacgag gaggagctgg cagtgcgctt ttcacaagaa gagatacacc 27720
aagaacagcc agagcaggaa gcagagaatg agcagagtca ggctgggctc gagcatgacg 27780
gcgactacct ccacctgagc gggggggagg acgcgctcat caagcatctg gcccggcagg 27840
ccaccatcgt caaggatgcg ctgctcgacc gcaccgaggt gcccctcagc gtggaggagc 27900
tcagccgcgc ctacgagttg aacctcttct cgccgcgcgt gccccccaag cgccagccca 27960
atggcacctg cgagcccaac ccgcgcctca acttctaccc ggtcttcgcg gtgcccgagg 28020
ccctggccac ctaccacatc tttttcaaga accaaaagat ccccgtctcc tgccgcgcca 28080
accgcacccg cgccgacgcc cttttcaacc tgggtcccgg cgcccgccta cctgatatcg 28140
cctccttgga agaggttccc aagatcttcg agggtctggg cagcgacgag actcgggccg 28200
cgaacgctct gcaaggagaa ggaggagagc atgagcacca cagcgccctg gtcgagttgg 28260
aaggcgacaa cgcgcggctg gcggtgctca aacgcacggt cgagctgacc catttcgcct 28320
acccggctct gaacctgccc cccaaagtca tgagcgcggt catggaccag gtgctcatca 28380
agcgcgcgtc gcccatctcc gaggacgagg gcatgcaaga ctccgaggag ggcaagcccg 28440
tggtcagcga cgagcagctg gcccggtggc tgggtcctaa tgctagtccc cagagtttgg 28500
aagagcggcg caaactcatg atggccgtgg tcctggtgac cgtggagctg gagtgcctgc 28560
gccgcttctt cgccgacgcg gagaccctgc gcaaggtcga ggagaacctg cactacctct 28620
tcaggcacgg gttcgtgcgc caggcctgca agatctccaa cgtggagctg accaacctgg 28680
tctcctacat gggcatcttg cacgagaacc gcctggggca gaacgtgctg cacaccaccc 28740
tgcgcgggga ggcccggcgc gactacatcc gcgactgcgt ctacctctac ctctgccaca 28800
cctggcagac gggcatgggc gtgtggcagc agtgtctgga ggagcagaac ctgaaagagc 28860
tctgcaagct cctgcagaag aacctcaagg gtctgtggac cgggttcgac gagcgcacca 28920
ccgcctcgga cctggccgac ctcattttcc ccgagcgcct caggctgacg ctgcgcaacg 28980
gcctgcccga ctttatgagc caaagcatgt tgcaaaactt tcgctctttc atcctcgaac 29040
gctccggaat cctgcccgcc acctgctccg cgctgccctc ggacttcgtg ccgctgacct 29100
tccgcgagtg ccccccgccg ctgtggagcc actgctacct gctgcgcctg gccaactacc 29160
tggcctacca ctcggacgtg atcgaggacg tcagcggcga gggcctgctc gagtgccact 29220
gccgctgcaa cctctgcacg ccgcaccgct ccctggcctg caacccccag ctgctgagcg 29280
agacccagat catcggcacc ttcgagttgc aagggcccag cgaaggcgag ggttcagccg 29340
ccaagggggg tctgaaactc accccggggc tgtggacctc ggcctacttg cgcaagttcg 29400
tgcccgagga ctaccatccc ttcgagatca ggttctacga ggaccaatcc catccgccca 29460
aggccgagct gtcggcctgc gtcatcaccc agggggcgat cctggcccaa ttgcaagcca 29520
tccagaaatc ccgccaagaa ttcttgctga aaaagggccg cggggtctac ctcgaccccc 29580
agaccggtga ggagctcaac cccggcttcc cccaggatgc cccgaggaaa caagaagctg 29640
aaagtggagc tgccgcccgt ggaggatttg gaggaagact gggagaacag cagtcaggca 29700
gaggaggagg agatggagga agactgggac agcactcagg cagaggagga cagcctgcaa 29760
gacagtctgg aggaagacga ggaggaggca gaggaggagg tggaagaagc agccgccgcc 29820
agaccgtcgt cctcggcggg ggagaaagca agcagcacgg ataccatctc cgctccgggt 29880
cggggtcccg ctcgaccaca cagtagatgg gacgagaccg gacgattccc gaaccccacc 29940
acccagaccg gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc 30000
atcgtctcct gcttgcaggc ctgcgggggc aacatctcct tcacccggcg ctacctgctc 30060
ttccaccgcg gggtgaactt tccccgcaac atcttgcatt actaccgtca cctccacagc 30120
ccctactact tccaagaaga ggcagcagca gcagaaaaag accagcagaa aaccagcagc 30180
tagaaaatcc acagcggcgg cagcaggtgg actgaggatc gcggcgaacg agccggcgca 30240
aacccgggag ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg 30300
ggggcaggag caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg 30360
tctgtatcac aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt 30420
caacaagtac tgcgcgctca ctcttaaaga gtagcccgcg cccgcccagt cgcagaaaaa 30480
ggcgggaatt acgtcacctg tgcccttcgc cctagccgcc tccacccatc atcatgagca 30540
aagagattcc cacgccttac atgtggagct accagcccca gatgggcctg gccgccggtg 30600
ccgcccagga ctactccacc cgcatgaatt ggctcagcgc cgggcccgcg atgatctcac 30660
gggtgaatga catccgcgcc caccgaaacc agatactcct agaacagtca gcgctcaccg 30720
ccacgccccg caatcacctc aatccgcgta attggcccgc cgccctggtg taccaggaaa 30780
ttccccagcc cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta 30840
actcaggtgt ccagctggcg ggcggcgcca ccctgtgtcg tcaccgcccc gctcagggta 30900
taaagcggct ggtgatccgg ggcagaggca cacagctcaa cgacgaggtg gtgagctctt 30960
cgctgggtct gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca 31020
cgcctcgtca ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggtggca 31080
tcggcactct ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg 31140
gctcccccgg ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg 31200
tggacggcta cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc 31260
tggaccactg ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg 31320
agctgcccga ggagcaccct cagggcccgg cccacggagt gcggatcgtc gtcgaagggg 31380
gcctcgactc ccacctgctt cggatcttca gccagcgtcc gatcctggtc gagcgcgagc 31440
aaggacagac ccttctgact ctgtactgca tctgcaacca ccccggcctg catgaaagtc 31500
tttgttgtct gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactt 31560
ccgtgtgttt aaactcaccc ccttatccag tgaaataaag atcatattga tgatgatttt 31620
acagaaataa aaaataatca tttgatttga aataaagata caatcatatt gatgatttga 31680
gtttaacaaa aaaataaaga atcacttact tgaaatctga taccaggtct ctgtccatgt 31740
tttctgccaa caccacttca ctcccctctt cccagctctg gtactgcagg ccccggcggg 31800
ctgcaaactt cctccacacg ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca 31860
ttttatcttc tatcagatgt ccaaaaagcg cgtccgggtg gatgatgact tcgaccccgt 31920
ctacccctac gatgcagaca acgcaccgac cgtgcccttc atcaaccccc ccttcgtctc 31980
ttcagatgga ttccaagaga agcccctggg ggtgttgtcc ctgcgactgg ccgaccccgt 32040
caccaccaag aacggggaaa tcaccctcaa gctgggagag ggggtggacc tcgattcctc 32100
gggaaaactc atctccaaca cggccaccaa ggccgccgcc cctctcagtt tttccaacaa 32160
caccatttcc cttaacatgg atcacccctt ttacactaaa gatggaaaat tatccttaca 32220
agtttctcca ccattaaata tactgagaac aagcattcta aacacactag ctttaggttt 32280
tggatcaggt ttaggactcc gtggctctgc cttggcagta cagttagtct ctccacttac 32340
atttgatact gatggaaaca taaagcttac cttagacaga ggtttgcatg ttacaacagg 32400
agatgcaatt gaaagcaaca taagctgggc taaaggttta aaatttgaag atggagccat 32460
agcaaccaac attggaaatg ggttagagtt tggaagcagt agtacagaaa caggtgttga 32520
tgatgcttac ccaatccaag ttaaacttgg atctggcctt agctttgaca gtacaggagc 32580
cataatggct ggtaacaaag aagacgataa actcactttg tggacaacac ctgatccatc 32640
accaaactgt caaatactcg cagaaaatga tgcaaaacta acactttgct tgactaaatg 32700
tggtagtcaa atactggcca ctgtgtcagt cttagttgta ggaagtggaa acctaaaccc 32760
cattactggc accgtaagca gtgctcaggt gtttctacgt tttgatgcaa acggtgttct 32820
tttaacagaa cattctacac taaaaaaata ctgggggtat aggcagggag atagcataga 32880
tggcactcca tataccaatg ctgtaggatt catgcccaat ttaaaagctt atccaaagtc 32940
acaaagttct actactaaaa ataatatagt agggcaagta tacatgaatg gagatgtttc 33000
aaaacctatg cttctcacta taaccctcaa tggtactgat gacagcaaca gtacatattc 33060
aatgtcattt tcatacacct ggactaatgg aagctatgtt ggagcaacat ttggggctaa 33120
ctcttatacc ttctcataca tcgcccaaga atgaacactg tatcccaccc tgcatgccaa 33180
cccttcccac cccactctgt ggaacaaact ctgaaacaca aaataaaata aagttcaagt 33240
gttttattga ttcaacagtt ttacaggatt cgagcagtta tttttcctcc accctcccag 33300
gacatggaat acaccaccct ctccccccgc acagccttga acatctgaat gccattggtg 33360
atggacatgc ttttggtctc cacgttccac acagtttcag agcgagccag tctcgggtcg 33420
gtcagggaga tgaaaccctc cgggcactcc cgcatctgca cctcacagct caacagctga 33480
ggattgtcct cggtggtcgg gatcacggtt atctggaaga agcagaagag cggcggtggg 33540
aatcatagtc cgcgaacggg atcggccggt ggtgtcgcat caggccccgc agcagtcgct 33600
gccgccgccg ctccgtcaag ctgctgctca gggggtccgg gtccagggac tccctcagca 33660
tgatgcccac ggccctcagc atcagtcgtc tggtgcggcg ggcgcagcag cgcatgcgga 33720
tctcgctcag gtcgctgcag tacgtgcaac acagaaccac caggttgttc aacagtccat 33780
agttcaacac gctccagccg aaactcatcg cgggaaggat gctacccacg tggccgtcgt 33840
accagatcct caggtaaatc aagtggtgcc ccctccagaa cacgctgccc acgtacatga 33900
tctccttggg catgtggcgg ttcaccacct cccggtacca catcaccctc tggttgaaca 33960
tgcagccccg gatgatcctg cggaaccaca gggccagcac cgccccgccc gccatgcagc 34020
gaagagaccc cgggtcccgg caatggcaat ggaggaccca ccgctcgtac ccgtggatca 34080
tctgggagct gaacaagtct atgttggcac agcacaggca tatgctcatg catctcttca 34140
gcactctcaa ctcctcgggg gtcaaaacca tatcccaggg cacggggaac tcttgcagga 34200
cagcgaaccc cgcagaacag ggcaatcctc gcacagaact tacattgtgc atggacaggg 34260
tatcgcaatc aggcagcacc gggtgatcct ccaccagaga agcgcgggtc tcggtctcct 34320
cacagcgtgg taagggggcc ggccgatacg ggtgatggcg ggacgcggct gatcgtgttc 34380
gcgaccgtgt catgatgcag ttgctttcgg acattttcgt acttgctgta gcagaacctg 34440
gtccgggcgc tgcacaccga tcgccggcgg cggtctcggc gcttggaacg ctcggtgttg 34500
aaattgtaaa acagccactc tctcagaccg tgcagcagat ctagggcctc aggagtgatg 34560
aagatcccat catgcctgat ggctctgatc acatcgacca ccgtggaatg ggccagaccc 34620
agccagatga tgcaattttg ttgggtttcg gtgacggcgg gggagggaag aacaggaaga 34680
accatgatta acttttaatc caaacggtct cggagtactt caaaatgaag atcgcggaga 34740
tggcacctct cgcccccgct gtgttggtgg aaaataacag ccaggtcaaa ggtgatacgg 34800
ttctcgagat gttccacggt ggcttccagc aaagcctcca cgcgcacatc cagaaacaag 34860
acaatagcga aagcgggagg gttctctaat tcctcaatca tcatgttaca ctcctgcacc 34920
atccccagat aattttcatt tttccagcct tgaatgattc gaactagttc ctgaggtaaa 34980
tccaagccag ccatgataaa gagctcgcgc agagcgccct ccaccggcat tcttaagcac 35040
accctcataa ttccaagata ttctgctcct ggttcacctg cagcagattg acaagcggaa 35100
tatcaaaatc tctgccgcga tccctgagct cctccctcag caataactgt aagtactctt 35160
tcatatcctc tccgaaattt ttagccatag gaccaccagg aataagatta gggcaagcca 35220
cagtacagat aaaccgaagt cctccccagt gagcattgcc aaatgcaaga ctgctataag 35280
catgctggct agacccggtg atatcttcca gataactgga cagaaaatcg cccaggcaat 35340
ttttaagaaa atcaacaaaa gaaaaatcct ccaggtggac gtttagagcc tcgggaacaa 35400
cgatgaagta aatgcaagcg gtgcgttcca gcatggttag ttagctgatc tgtagaaaaa 35460
acaaaaatga acattaaacc atgctagcct ggcgaacagg tgggtaaatc gttctctcca 35520
gcaccaggca ggccacgggg tctccggcgc gaccctcgta aaaattgtcg ctatgattga 35580
aaaccatcac agagagacgt tcccggtggc cggcgtgaat gattcgacaa gatgaataca 35640
cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc gaggaagcaa taaggcacta 35700
caatgctcag tctcaagtcc agcaaagcga tgccatgcgg atgaagcaca aaattctcag 35760
gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag caaagccccc gatccctcca 35820
ggtacacata caaagcctca gcgtccatag cttaccgagc agcagcacac aacaggcgca 35880
agagtcagag aaaggctgag ctctaacctg tccacccgct ctctgctcaa tatatagccc 35940
agatctacac tgacgtaaag gccaaagtct aaaaataccc gccaaataat cacacacgcc 36000
cagcacacgc ccagaaaccg gtgacacact caaaaaaata cgcgcacttc ctcaaacgcc 36060
caaaactgcc gtcatttccg ggttcccacg ctacgtcatc aaaacacgac tttcaaattc 36120
cgtcgaccgt taaaaacgtc acccgccccg cccctaacgg tcgcccgtct ctcagccaat 36180
cagcgccccg catccccaaa ttcaaacacc tcatttgcat attaacgcgc acaaaaagtt 36240
tgaggtatat tattgatgat gg 36262
<210> 61
<211> 9771
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 61
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcggagctg ccccggagcc ggagaggacc cccgttggcc agggatcgtg 2040
ggcccatccg ggacgcacca ggggaccatc cgacagggga ttctgtgtgg tgtcaccggc 2100
caggccagca gaagaggcaa ccagcctcga gggagcgttg tctggaacca gacattccca 2160
cccgtcggtg ggccggcagc accacgcggg accaccgtcc acttccagac cgccacggcc 2220
atgggacacc ccttgcccgc ctgtgtatgc cgagactaaa cacttcctgt actcatccgg 2280
agacaaggaa cagcttcggc cgtccttcct cctgtcgtcg ctcagaccga gcctgaccgg 2340
agcacgcaga ttggtggaaa ctatcttcct tgggtcacgt ccgtggatgc caggtacccc 2400
acggcgcctc ccgcgcctcc cacagagata ctggcagatg cggcctctgt tcctggaatt 2460
gctgggaaac cacgctcagt gcccgtacgg agtcctgctc aagactcact gccctctgag 2520
ggcggcggtc actccggcgg ccggagtgtg cgcacgggag aagccccagg gaagcgtggc 2580
agctccggaa gaggaggaca ccgatccgcg ccgcctcgtg caacttctgc gccagcactc 2640
ctcgccctgg caagtctacg ggttcgtccg cgcctgcctg cgccgcctgg tgccgcctgg 2700
gctctggggt tcccggcata acgagcgccg cttcctgaga aatactaaga agtttatctc 2760
acttggaaaa catgccaagt tgtcgctgca agaactcacg tggaagatgt cagtccgcga 2820
ttgcgcctgg ctgcgccgct cgccgggcgt cgggtgtgtt ccagctgcag aacaccgcct 2880
gagagaagaa attctggcca aatttctgca ttggctgatg tcagtgtacg tggtcgagct 2940
gctgcgctcc tttttctacg tcactgagac tacctttcaa aagaaccgcc tgttcttcta 3000
ccgcaaatct gtgtggagca agctgcagtc aatcggcatt cgccagcatc tgaagagggt 3060
gcagctgcgg gaactttccg aggcagaagt ccgccagcac cgggaggccc ggccggcgct 3120
tctcacgtcg cgtctgagat tcatcccaaa gcccgacggg ctgaggccta tcgtcaacat 3180
ggattacgtc gtgggcgctc gcacctttcg ccgtgaaaag cgggccgaac gcttgacctc 3240
acgggtgaag gccctcttct ccgtgctgaa ctacgagaga gcaagacggc ctggcctgct 3300
gggagcttcg gtgctgggac tggacgatat ccaccgggct tggcggacct ttgttctccg 3360
ggtgagagcc caagaccctc cgccggaact gtacttcgtg aaggtggcga tcaccggagc 3420
ctatgatact attccgcaag atcgactcac cgaagtcatc gcctcgatca tcaaaccgca 3480
gaacacttac tgcgtcaggc ggtacgccgt ggtccagaag gccgcgcatg gccacgtgag 3540
aaaggcgttc aagtcgcacg tgtccactct caccgacctc cagccttaca tgaggcaatt 3600
cgttgcgcat ttgcaagaga cttcgcccct gagagatgcg gtggtcatcg agcagagctc 3660
cagcctgaac gaagcgagca gcggtctgtt tgacgtgttc ctccgcttca tgtgtcatca 3720
cgcggtgcga atcaggggaa aatcatacgt gcagtgccag ggaatcccac aaggcagcat 3780
tctgtcgact ctcttgtgtt ccctttgcta cggcgatatg gaaaacaagc tgttcgctgg 3840
gatcagacgg gacgggttgc tgctcagact ggtggacgac ttcctgctgg tgactccgca 3900
cctcactcac gccaaaacct ttctccgcac tctggtgagg ggagtgccag aatacggctg 3960
tgtggtcaat ctccggaaaa ctgtggtgaa tttccctgtc gaggatgagg cactcggagg 4020
aaccgcattt gtccaaatgc cagcacatgg cctgttccca tggtgcggtc tgctgctgga 4080
cacccgaact cttgaagtgc agtccgacta ctccagctat gcccggacga gcatccgcgc 4140
cagcctcact ttcaatcgcg gctttaaggc cggacgaaac atgcgcagaa agcttttcgg 4200
agtcctccgg cttaaatgcc attcgctctt tctcgatctc caagtcaatt cgctgcagac 4260
cgtgtgcacg aacatctaca agatcctgct gctccaagcc taccggttcc acgcttgcgt 4320
gcttcagctg ccgtttcacc aacaggtgtg gaagaacccg accttctttc tgcgggtcat 4380
tagcgatact gcctccctgt gttactcaat cctcaaggca aagaacgccg gaatgtcgct 4440
gggtgcgaaa ggagccgcgg gacctcttcc tagcgaagcg gtgcagtggc tctgccacca 4500
ggctttcctc ctgaagctga ccaggcacag agtgacctac gtcccgctgc tgggctcgct 4560
gcgcactgca cagacccagc tgtctagaaa actccccggc accaccctga ccgctctgga 4620
agccgccgcc aacccagcat tgccgtcaga tttcaagacc atcttggacg gatccggcac 4680
aatcctgtct gagggcgcca ccaacttcag cctgctgaaa ctggccggcg acgtggaact 4740
gaaccctggc cctacccctg gaacccagag ccccttcttc cttctgctgc tgctgaccgt 4800
gctgactgtc gtgacaggct ctggccacgc cagctctaca cctggcggcg agaaagagac 4860
aagcgccacc cagagaagca gcgtgccaag cagcaccgag aagaacgccg tgtccatgac 4920
cagctccgtg ctgagcagcc actctcctgg cagcggcagc agcacaacac agggccagga 4980
tgtgacactg gcccctgcca cagaacctgc ctctggatct gccgccacct ggggacagga 5040
cgtgacaagc gtgccagtga ccagacctgc cctgggctct acaacacccc ctgcccacga 5100
tgtgaccagc gcccctgata acaagcctgc ccctggaagc acagcccctc cagctcatgg 5160
cgtgacctct gccccagata ccagaccagc cccaggatct acagccccac ccgcacacgg 5220
cgtgacaagt gcccctgaca caagacccgc tccaggctct actgctcctc ctgcccatgg 5280
cgtgacaagc gctcccgata caaggccagc tcctggctcc acagcaccac cagcacatgg 5340
cgtgacatca gctcccgaca ctagacctgc tcccggatca accgctccac cagctcacgg 5400
cgtgaccagc gcacctgata ccagacctgc tctgggaagc accgcccctc ccgtgcacaa 5460
tgtgacatct gcttccggca gcgccagcgg ctctgcctct acactggtgc acaacggcac 5520
cagcgccaga gccacaacaa ccccagccag caagagcacc cccttcagca tccctagcca 5580
ccacagcgac acccctacca cactggccag ccactccacc aagaccgatg cctctagcac 5640
ccaccactcc agcgtgcccc ctctgaccag cagcaaccac agcacaagcc cccagctgtc 5700
taccggcgtc tcattcttct ttctgtcctt ccacatcagc aacctgcagt tcaacagcag 5760
cctggaagat cccagcaccg actactacca ggaactgcag cgggatatca gcgagatgtt 5820
cctgcaaatc tacaagcagg gcggcttcct gggcctgagc aacatcaagt tcagacccgg 5880
cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc accatcaacg tgcacgacgt 5940
ggaaacccag ttcaaccagt acaagaccga ggccgccagc cggtacaacc tgaccatctc 6000
cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc cagtctggcg caggcgtgcc 6060
aggatgggga attgctctgc tggtgctcgt gtgcgtgctg gtggccctgg ccatcgtgta 6120
tctgattgcc ctggccgtgt gccagtgccg gcggaagaat tacggccagc tggacatctt 6180
ccccgccaga gacacctacc accccatgag cgagtacccc acataccaca cccacggcag 6240
atacgtgcca cccagctcca ccgacagatc cccctacgag aaagtgtctg ccggcaacgg 6300
cggcagctcc ctgagctaca caaatcctgc cgtggccgct gcctccgcca acctgggatc 6360
cggcagaatc ttcaacgccc actacgccgg ctacttcgcc gacctgctga tccacgacat 6420
cgagacaaac cctggccccg aatcgccaag cgcaccccct catcggtggt gcatcccttg 6480
gcaacgcctc ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa 6540
gctgaccatt gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt 6600
gcacaatctg ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg 6660
gaaccggcag atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta 6720
cagcggccgg gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa 6780
cgacaccggt ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac 6840
cggccaattc agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa 6900
cccggtcgag gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac 6960
ctacttgtgg tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa 7020
cgacaacagg accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg 7080
cggtatccag aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta 7140
cgggccggac gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct 7200
gtccctgtcg tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg 7260
aaacatccag cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg 7320
actttacacc tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac 7380
tatcactgtg tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt 7440
ggaggataag gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct 7500
ttggtgggtc aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa 7560
taggactctg accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat 7620
ccagaactcc gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc 7680
cgacactccg atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct 7740
ctcatgccac tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc 7800
tcagcaacat acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta 7860
cgcctgcttt gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac 7920
cgtgtcggcg tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat 7980
cggcgtgctc gtgggagtgg ccctgatctg aagatctggg ccctaacaaa acaaaaagat 8040
ggggttattc cctaaacttc atgggttacg taattggaag ttgggggaca ttgccacaag 8100
atcatattgt acaaaagatc aaacactgtt ttagaaaact tcctgtaaac aggcctattg 8160
attggaaagt atgtcaaagg attgtgggtc ttttgggctt tgctgctcca tttacacaat 8220
gtggatatcc tgccttaatg cctttgtatg catgtataca agctaaacag gctttcactt 8280
tctcgccaac ttacaaggcc tttctaagta aacagtacat gaacctttac cccgttgctc 8340
ggcaacggcc tggtctgtgc caagtgtttg ctgacgcaac ccccactggc tggggcttgg 8400
ccataggcca tcagcgcatg cgtggaacct ttgtggctcc tctgccgatc catactgcgg 8460
aactcctagc cgcttgtttt gctcgcagcc ggtctggagc aaagctcata ggaactgaca 8520
attctgtcgt cctctcgcgg aaatatacat cgtttcgatc tacgtatgat ctttttccct 8580
ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct ggctaataaa 8640
ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca ctcggaagga 8700
attctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 8760
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 8820
gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 8880
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 8940
ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 9000
cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 9060
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 9120
gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 9180
aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 9240
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 9300
aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 9360
aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc 9420
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 9480
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 9540
atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 9600
atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 9660
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 9720
gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact c 9771
<210> 62
<211> 36283
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 62
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcggagctgc cccggagccg gagaggaccc 1140
ccgttggcca gggatcgtgg gcccatccgg gacgcaccag gggaccatcc gacaggggat 1200
tctgtgtggt gtcaccggcc aggccagcag aagaggcaac cagcctcgag ggagcgttgt 1260
ctggaaccag acattcccac ccgtcggtgg gccggcagca ccacgcggga ccaccgtcca 1320
cttccagacc gccacggcca tgggacaccc cttgcccgcc tgtgtatgcc gagactaaac 1380
acttcctgta ctcatccgga gacaaggaac agcttcggcc gtccttcctc ctgtcgtcgc 1440
tcagaccgag cctgaccgga gcacgcagat tggtggaaac tatcttcctt gggtcacgtc 1500
cgtggatgcc aggtacccca cggcgcctcc cgcgcctccc acagagatac tggcagatgc 1560
ggcctctgtt cctggaattg ctgggaaacc acgctcagtg cccgtacgga gtcctgctca 1620
agactcactg ccctctgagg gcggcggtca ctccggcggc cggagtgtgc gcacgggaga 1680
agccccaggg aagcgtggca gctccggaag aggaggacac cgatccgcgc cgcctcgtgc 1740
aacttctgcg ccagcactcc tcgccctggc aagtctacgg gttcgtccgc gcctgcctgc 1800
gccgcctggt gccgcctggg ctctggggtt cccggcataa cgagcgccgc ttcctgagaa 1860
atactaagaa gtttatctca cttggaaaac atgccaagtt gtcgctgcaa gaactcacgt 1920
ggaagatgtc agtccgcgat tgcgcctggc tgcgccgctc gccgggcgtc gggtgtgttc 1980
cagctgcaga acaccgcctg agagaagaaa ttctggccaa atttctgcat tggctgatgt 2040
cagtgtacgt ggtcgagctg ctgcgctcct ttttctacgt cactgagact acctttcaaa 2100
agaaccgcct gttcttctac cgcaaatctg tgtggagcaa gctgcagtca atcggcattc 2160
gccagcatct gaagagggtg cagctgcggg aactttccga ggcagaagtc cgccagcacc 2220
gggaggcccg gccggcgctt ctcacgtcgc gtctgagatt catcccaaag cccgacgggc 2280
tgaggcctat cgtcaacatg gattacgtcg tgggcgctcg cacctttcgc cgtgaaaagc 2340
gggccgaacg cttgacctca cgggtgaagg ccctcttctc cgtgctgaac tacgagagag 2400
caagacggcc tggcctgctg ggagcttcgg tgctgggact ggacgatatc caccgggctt 2460
ggcggacctt tgttctccgg gtgagagccc aagaccctcc gccggaactg tacttcgtga 2520
aggtggcgat caccggagcc tatgatacta ttccgcaaga tcgactcacc gaagtcatcg 2580
cctcgatcat caaaccgcag aacacttact gcgtcaggcg gtacgccgtg gtccagaagg 2640
ccgcgcatgg ccacgtgaga aaggcgttca agtcgcacgt gtccactctc accgacctcc 2700
agccttacat gaggcaattc gttgcgcatt tgcaagagac ttcgcccctg agagatgcgg 2760
tggtcatcga gcagagctcc agcctgaacg aagcgagcag cggtctgttt gacgtgttcc 2820
tccgcttcat gtgtcatcac gcggtgcgaa tcaggggaaa atcatacgtg cagtgccagg 2880
gaatcccaca aggcagcatt ctgtcgactc tcttgtgttc cctttgctac ggcgatatgg 2940
aaaacaagct gttcgctggg atcagacggg acgggttgct gctcagactg gtggacgact 3000
tcctgctggt gactccgcac ctcactcacg ccaaaacctt tctccgcact ctggtgaggg 3060
gagtgccaga atacggctgt gtggtcaatc tccggaaaac tgtggtgaat ttccctgtcg 3120
aggatgaggc actcggagga accgcatttg tccaaatgcc agcacatggc ctgttcccat 3180
ggtgcggtct gctgctggac acccgaactc ttgaagtgca gtccgactac tccagctatg 3240
cccggacgag catccgcgcc agcctcactt tcaatcgcgg ctttaaggcc ggacgaaaca 3300
tgcgcagaaa gcttttcgga gtcctccggc ttaaatgcca ttcgctcttt ctcgatctcc 3360
aagtcaattc gctgcagacc gtgtgcacga acatctacaa gatcctgctg ctccaagcct 3420
accggttcca cgcttgcgtg cttcagctgc cgtttcacca acaggtgtgg aagaacccga 3480
ccttctttct gcgggtcatt agcgatactg cctccctgtg ttactcaatc ctcaaggcaa 3540
agaacgccgg aatgtcgctg ggtgcgaaag gagccgcggg acctcttcct agcgaagcgg 3600
tgcagtggct ctgccaccag gctttcctcc tgaagctgac caggcacaga gtgacctacg 3660
tcccgctgct gggctcgctg cgcactgcac agacccagct gtctagaaaa ctccccggca 3720
ccaccctgac cgctctggaa gccgccgcca acccagcatt gccgtcagat ttcaagacca 3780
tcttggacgg atccggcaca atcctgtctg agggcgccac caacttcagc ctgctgaaac 3840
tggccggcga cgtggaactg aaccctggcc ctacccctgg aacccagagc cccttcttcc 3900
ttctgctgct gctgaccgtg ctgactgtcg tgacaggctc tggccacgcc agctctacac 3960
ctggcggcga gaaagagaca agcgccaccc agagaagcag cgtgccaagc agcaccgaga 4020
agaacgccgt gtccatgacc agctccgtgc tgagcagcca ctctcctggc agcggcagca 4080
gcacaacaca gggccaggat gtgacactgg cccctgccac agaacctgcc tctggatctg 4140
ccgccacctg gggacaggac gtgacaagcg tgccagtgac cagacctgcc ctgggctcta 4200
caacaccccc tgcccacgat gtgaccagcg cccctgataa caagcctgcc cctggaagca 4260
cagcccctcc agctcatggc gtgacctctg ccccagatac cagaccagcc ccaggatcta 4320
cagccccacc cgcacacggc gtgacaagtg cccctgacac aagacccgct ccaggctcta 4380
ctgctcctcc tgcccatggc gtgacaagcg ctcccgatac aaggccagct cctggctcca 4440
cagcaccacc agcacatggc gtgacatcag ctcccgacac tagacctgct cccggatcaa 4500
ccgctccacc agctcacggc gtgaccagcg cacctgatac cagacctgct ctgggaagca 4560
ccgcccctcc cgtgcacaat gtgacatctg cttccggcag cgccagcggc tctgcctcta 4620
cactggtgca caacggcacc agcgccagag ccacaacaac cccagccagc aagagcaccc 4680
ccttcagcat ccctagccac cacagcgaca cccctaccac actggccagc cactccacca 4740
agaccgatgc ctctagcacc caccactcca gcgtgccccc tctgaccagc agcaaccaca 4800
gcacaagccc ccagctgtct accggcgtct cattcttctt tctgtccttc cacatcagca 4860
acctgcagtt caacagcagc ctggaagatc ccagcaccga ctactaccag gaactgcagc 4920
gggatatcag cgagatgttc ctgcaaatct acaagcaggg cggcttcctg ggcctgagca 4980
acatcaagtt cagacccggc agcgtggtgg tgcagctgac cctggctttc cgggaaggca 5040
ccatcaacgt gcacgacgtg gaaacccagt tcaaccagta caagaccgag gccgccagcc 5100
ggtacaacct gaccatctcc gatgtgtccg tgtccgacgt gcccttccca ttctctgccc 5160
agtctggcgc aggcgtgcca ggatggggaa ttgctctgct ggtgctcgtg tgcgtgctgg 5220
tggccctggc catcgtgtat ctgattgccc tggccgtgtg ccagtgccgg cggaagaatt 5280
acggccagct ggacatcttc cccgccagag acacctacca ccccatgagc gagtacccca 5340
cataccacac ccacggcaga tacgtgccac ccagctccac cgacagatcc ccctacgaga 5400
aagtgtctgc cggcaacggc ggcagctccc tgagctacac aaatcctgcc gtggccgctg 5460
cctccgccaa cctgggatcc ggcagaatct tcaacgccca ctacgccggc tacttcgccg 5520
acctgctgat ccacgacatc gagacaaacc ctggccccga atcgccaagc gcaccccctc 5580
atcggtggtg catcccttgg caacgcctcc tcctgaccgc ctcactgctg actttctgga 5640
acccgccgac caccgcaaag ctgaccattg agagcactcc cttcaacgtg gctgagggga 5700
aggaggtgct gctcctggtg cacaatctgc cccagcacct gttcgggtac tcctggtaca 5760
agggagaacg cgtggacggg aaccggcaga tcataggcta cgtcatcgga acccagcagg 5820
ccacacccgg tccagcgtac agcggccggg agattatcta cccgaacgcc tccctgctga 5880
tccaaaacat catccagaac gacaccggtt tctacactct gcacgtgatt aagtcagatc 5940
tggtcaacga agaggccacc ggccaattca gggtgtaccc cgaactccct aagccgttca 6000
tcacctcgaa caacagcaac ccggtcgagg atgaagatgc ggtggccttg acgtgcgaac 6060
ctgagatcca gaacaccacc tacttgtggt gggtgaacaa tcagagcctg ccagtctccc 6120
cacgactcca gctgtcgaac gacaacagga ccctgacttt gctgtccgtg actcggaacg 6180
acgtgggccc ttatgaatgc ggtatccaga acaagctgtc cgtggaccac agcgaccctg 6240
tgatcctgaa cgtcctttac gggccggacg accccaccat ttccccgtcg tacacttact 6300
accggccggg cgtgaacctg tccctgtcgt gccacgctgc ctccaatccg ccggcccagt 6360
actcctggct catcgacgga aacatccagc agcacaccca agaactgttc atctccaaca 6420
ttaccgagaa aaactcggga ctttacacct gtcaagccaa caattccgcc agcggccact 6480
cccgcaccac tgtcaaaact atcactgtgt ccgccgaact cccgaagccc agcatcagct 6540
ccaacaactc gaagcccgtg gaggataagg acgctgtcgc gttcacctgt gaaccagagg 6600
cacagaatac cacctacctt tggtgggtca acggacagtc cctgcctgtc tcaccgagac 6660
tgcagctgtc aaacgggaat aggactctga ccttgtttaa cgtcacccgg aacgacgccc 6720
gggcctacgt gtgcggcatc cagaactccg tgagcgcaaa ccggtctgac ccagtgaccc 6780
tggatgtgct gtacggcccc gacactccga tcatttcacc ccccgattca tcctacctgt 6840
ccggcgctaa cctcaacctc tcatgccact ccgcatccaa ccccagcccg caatattcgt 6900
ggcgcattaa cggaattcct cagcaacata cccaggtcct gttcattgcg aagatcaccc 6960
ctaacaacaa cggaacctac gcctgctttg tgtcaaacct ggccactggt agaaacaact 7020
ccatcgtgaa gtccattacc gtgtcggcgt ccggaacttc cccgggcctg agcgccggcg 7080
ccaccgtggg aattatgatc ggcgtgctcg tgggagtggc cctgatctga cgcacctcga 7140
gctgatcata atcagccata ccacatttgt agaggtttta cttgctttaa aaaacctccc 7200
acacctcccc ctgaacctga aacataaaat gaatgcaatt gttgttgtta acttgtttat 7260
tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt 7320
tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt accaggtgcc 7380
gagcctgcga gtgcggaggg aagcatgcca ggttccagcc cgtgtgtgtg gatgtgacgg 7440
aggacctgcg acccgatcat ttggtgttgc cctgcaccgg gacggagttc ggttccagcg 7500
gggaagaatc tgactagagt gagtagtgtt ctggggcggg ggaggacctg catgagggcc 7560
agaataactg aaatctgtgc ttttctgtgt gttgcagcag catgagcgga agcggctcct 7620
ttgagggagg ggtattcagc ccttatctga cggggcgtct cccctcctgg gcgggagtgc 7680
gtcagaatgt gatgggatcc acggtggacg gccggcccgt gcagcccgcg aactcttcaa 7740
ccctgaccta tgcaaccctg agctcttcgt cgttggacgc agctgccgcc gcagctgctg 7800
catctgccgc cagcgccgtg cgcggaatgg ccatgggcgc cggctactac ggcactctgg 7860
tggccaactc gagttccacc aataatcccg ccagcctgaa cgaggagaag ctgttgctgc 7920
tgatggccca gctcgaggcc ttgacccagc gcctgggcga gctgacccag caggtggctc 7980
agctgcagga gcagacgcgg gccgcggttg ccacggtgaa atccaaataa aaaatgaatc 8040
aataaataaa cggagacggt tgttgatttt aacacagagt ctgaatcttt atttgatttt 8100
tcgcgcgcgg taggccctgg accaccggtc tcgatcattg agcacccggt ggatcttttc 8160
caggacccgg tagaggtggg cttggatgtt gaggtacatg ggcatgagcc cgtcccgggg 8220
gtggaggtag ctccattgca gggcctcgtg ctcgggggtg gtgttgtaaa tcacccagtc 8280
atagcagggg cgcagggcat ggtgttgcac aatatctttg aggaggagac tgatggccac 8340
gggcagccct ttggtgtagg tgtttacaaa tctgttgagc tgggagggat gcatgcgggg 8400
ggagatgagg tgcatcttgg cctggatctt gagattggcg atgttaccgc ccagatcccg 8460
cctggggttc atgttgtgca ggaccaccag cacggtgtat ccggtgcact tggggaattt 8520
atcatgcaac ttggaaggga aggcgtgaaa gaatttggcg acgcctttgt gcccgcccag 8580
gttttccatg cactcatcca tgatgatggc gatgggcccg tgggcggcgg cctgggcaaa 8640
gacgtttcgg gggtcggaca catcatagtt gtggtcctgg gtgaggtcat cataggccat 8700
tttaatgaat ttggggcgga gggtgccgga ctgggggaca aaggtaccct cgatcccggg 8760
ggcgtagttc ccctcacaga tctgcatctc ccaggctttg agctcggagg gggggatcat 8820
gtccacctgc ggggcgataa agaacacggt ttccggggcg ggggagatga gctgggccga 8880
aagcaagttc cggagcagct gggacttgcc gcagccggtg gggccgtaga tgaccccgat 8940
gaccggctgc aggtggtagt tgagggagag acagctgccg tcctcccgga ggaggggggc 9000
cacctcgttc atcatctcgc gcacgtgcat gttctcgcgc accagttccg ccaggaggcg 9060
ctctcccccc agggatagga gctcctggag cgaggcgaag tttttcagcg gcttgagtcc 9120
gtcggccatg ggcattttgg agagggtttg ttgcaagagt tccaggcggt cccagagctc 9180
ggtgatgtgc tctacggcat ctcgatccag cagacctcct cgtttcgcgg gttgggacgg 9240
ctgcgggagt agggcaccag acgatgggcg tccagcgcag ccagggtccg gtccttccag 9300
ggtcgcagcg tccgcgtcag ggtggtctcc gtcacggtga aggggtgcgc gccgggctgg 9360
gcgcttgcga gggtgcgctt caggctcatc cggctggtcg aaaaccgctc ccgatcggcg 9420
ccctgcgcgt cggccaggta gcaattgacc atgagttcgt agttgagcgc ctcggccgcg 9480
tggcctttgg cgcggagctt acctttggaa gtctgcccgc aggcgggaca gaggagggac 9540
ttgagggcgt agagcttggg ggcgaggaag acggactcgg gggcgtaggc gtccgcgccg 9600
cagtgggcgc agacggtctc gcactccacg agccaggtga ggtcgggctg gtcggggtca 9660
aaaaccagtt tcccgccgtt ctttttgatg cgtttcttac ctttggtctc catgagctcg 9720
tgtccccgct gggtgacaaa gaggctgtcc gtgtccccgt agaccgactt tatgggccgg 9780
tcctcgagcg gtgtgccgcg gtcctcctcg tagaggaacc ccgcccactc cgagacgaaa 9840
gcccgggtcc aggccagcac gaaggaggcc acgtgggacg ggtagcggtc gttgtccacc 9900
agcgggtcca ccttttccag ggtatgcaaa cacatgtccc cctcgtccac atccaggaag 9960
gtgattggct tgtaagtgta ggccacgtga ccgggggtcc cggccggggg ggtataaaag 10020
ggtgcgggtc cctgctcgtc ctcactgtct tccggatcgc tgtccaggag cgccagctgt 10080
tggggtaggt attccctctc gaaggcgggc atgacctcgg cactcaggtt gtcagtttct 10140
agaaacgagg aggatttgat attgacggtg ccggcggaga tgcctttcaa gagcccctcg 10200
tccatctggt cagaaaagac gatctttttg ttgtcgagct tggtggcgaa ggagccgtag 10260
agggcgttgg agaggagctt ggcgatggag cgcatggtct ggtttttttc cttgtcggcg 10320
cgctccttgg cggcgatgtt gagctgcacg tactcgcgcg ccacgcactt ccattcgggg 10380
aagacggtgg tcagctcgtc gggcacgatt ctgacctgcc agccccgatt atgcagggtg 10440
atgaggtcca cactggtggc cacctcgccg cgcaggggct cattagtcca gcagaggcgt 10500
ccgcccttgc gcgagcagaa ggggggcagg gggtccagca tgacctcgtc gggggggtcg 10560
gcatcgatgg tgaagatgcc gggcaggagg tcggggtcaa agtagctgat ggaagtggcc 10620
agatcgtcca gggcagcttg ccattcgcgc acggccagcg cgcgctcgta gggactgagg 10680
ggcgtgcccc agggcatggg atgggtaagc gcggaggcgt acatgccgca gatgtcgtag 10740
acgtagaggg gctcctcgag gatgccgatg taggtggggt agcagcgccc cccgcggatg 10800
ctggcgcgca cgtagtcata cagctcgtgc gagggggcga ggagccccgg gcccaggttg 10860
gtgcgactgg gcttttcggc gcggtagacg atctggcgga aaatggcatg cgagttggag 10920
gagatggtgg gcctttggaa gatgttgaag tgggcgtggg gcagtccgac cgagtcgcgg 10980
atgaagtggg cgtaggagtc ttgcagcttg gcgacgagct cggcggtgac taggacgtcc 11040
agagcgcagt agtcgagggt ctcctggatg atgtcatact tgagctgtcc cttttgtttc 11100
cacagctcgc ggttgagaag gaactcttcg cggtccttcc agtactcttc gagggggaac 11160
ccgtcctgat ctgcacggta agagcctagc atgtagaact ggttgacggc cttgtaggcg 11220
cagcagccct tctccacggg gagggcgtag gcctgggcgg ccttgcgcag ggaggtgtgc 11280
gtgagggcga aagtgtccct gaccatgacc ttgaggaact ggtgcttgaa gtcgatatcg 11340
tcgcagcccc cctgctccca gagctggaag tccgtgcgct tcttgtaggc ggggttgggc 11400
aaagcgaaag taacatcgtt gaagaggatc ttgcccgcgc ggggcataaa gttgcgagtg 11460
atgcggaaag gttggggcac ctcggcccgg ttgttgatga cctgggcggc gagcacgatc 11520
tcgtcgaagc cgttgatgtt gtggcccacg atgtagagtt ccacgaatcg cggacggccc 11580
ttgacgtggg gcagtttctt gagctcctcg taggtgagct cgtcggggtc gctgagcccg 11640
tgctgctcga gcgcccagtc ggcgagatgg gggttggcgc ggaggaagga agtccagaga 11700
tccacggcca gggcggtttg cagacggtcc cggtactgac ggaactgctg cccgacggcc 11760
attttttcgg gggtgacgca gtagaaggtg cgggggtccc cgtgccagcg atcccatttg 11820
agctggaggg cgagatcgag ggcgagctcg acgagccggt cgtccccgga gagtttcatg 11880
accagcatga aggggacgag ctgcttgccg aaggacccca tccaggtgta ggtttccaca 11940
tcgtaggtga ggaagagcct ttcggtgcga ggatgcgagc cgatggggaa gaactggatc 12000
tcctgccacc aattggagga atggctgttg atgtgatgga agtagaaatg ccgacggcgc 12060
gccgaacact cgtgcttgtg tttatacaag cggccacagt gctcgcaacg ctgcacggga 12120
tgcacgtgct gcacgagctg tacctgagtt cctttgacga ggaatttcag tgggaagtgg 12180
agtcgtggcg cctgcatctc gtgctgtact acgtcgtggt ggtcggcctg gccctcttct 12240
gcctcgatgg tggtcatgct gacgagcccg cgcgggaggc aggtccagac ctcggcgcga 12300
gcgggtcgga gagcgaggac gagggcgcgc aggccggagc tgtccagggt cctgagacgc 12360
tgcggagtca ggtcagtggg cagcggcggc gcgcggttga cttgcaggag tttttccagg 12420
gcgcgcggga ggtccagatg gtacttgatc tccaccgcgc cattggtggc gacgtcgatg 12480
gcttgcaggg tcccgtgccc ctggggtgtg accaccgtcc cccgtttctt cttgggcggc 12540
tggggcgacg ggggcggtgc ctcttccatg gttagaagcg gcggcgagga cgcgcgccgg 12600
gcggcagggg cggctcgggg cccggaggca ggggcggcag gggcacgtcg gcgccgcgcg 12660
cgggtaggtt ctggtactgc gcccggagaa gactggcgtg agcgacgacg cgacggttga 12720
cgtcctggat ctgacgcctc tgggtgaagg ccacgggacc cgtgagtttg aacctgaaag 12780
agagttcgac agaatcaatc tcggtatcgt tgacggcggc ctgccgcagg atctcttgca 12840
cgtcgcccga gttgtcctgg taggcgatct cggtcatgaa ctgctcgatc tcctcctctt 12900
gaaggtctcc gcggccggcg cgctccacgg tggccgcgag gtcgttggag atgcggccca 12960
tgagctgcga gaaggcgttc atgcccgcct cgttccagac gcggctgtag accacgacgc 13020
cctcgggatc gcgggcgcgc atgaccacct gggcgaggtt gagctccacg tggcgcgtga 13080
agaccgcgta gttgcagagg cgctggtaga ggtagttgag cgtggtggcg atgtgctcgg 13140
tgacgaagaa atacatgatc cagcggcgga gcggcatctc gctgacgtcg cccagcgcct 13200
ccaaacgttc catggcctcg taaaagtcca cggcgaagtt gaaaaactgg gagttgcgcg 13260
ccgagacggt caactcctcc tccagaagac ggatgagctc ggcgatggtg gcgcgcacct 13320
cgcgctcgaa ggcccccggg agttcctcca cttcctcttc ttcctcctcc actaacatct 13380
cttctacttc ctcctcaggc ggcagtggtg gcgggggagg gggcctgcgt cgccggcggc 13440
gcacgggcag acggtcgatg aagcgctcga tggtctcgcc gcgccggcgt cgcatggtct 13500
cggtgacggc gcgcccgtcc tcgcggggcc gcagcgtgaa gacgccgccg cgcatctcca 13560
ggtggccggg ggggtccccg ttgggcaggg agagggcgct gacgatgcat cttatcaatt 13620
gccccgtagg gactccgcgc aaggacctga gcgtctcgag atccacggga tctgaaaacc 13680
gctgaacgaa ggcttcgagc cagtcgcagt cgcaaggtag gctgagcacg gtttcttctg 13740
gcgggtcatg ttggttggga gcggggcggg cgatgctgct ggtgatgaag ttgaaatagg 13800
cggttctgag acggcggatg gtggcgagga gcaccaggtc tttgggcccg gcttgctgga 13860
tgcgcagacg gtcggccatg ccccaggcgt ggtcctgaca cctggccagg tccttgtagt 13920
agtcctgcat gagccgctcc acgggcacct cctcctcgcc cgcgcggccg tgcatgcgcg 13980
tgagcccgaa gccgcgctgg ggctggacga gcgccaggtc ggcgacgacg cgctcggcga 14040
ggatggcttg ctggatctgg gtgagggtgg tctggaagtc atcaaagtcg acgaagcggt 14100
ggtaggctcc ggtgttgatg gtgtaggagc agttggccat gacggaccag ttgacggtct 14160
ggtggcccgg acgcacgagc tcgtggtact tgaggcgcga gtaggcgcgc gtgtcgaaga 14220
tgtagtcgtt gcaggtgcgc accaggtact ggtagccgat gaggaagtgc ggcggcggct 14280
ggcggtagag cggccatcgc tcggtggcgg gggcgccggg cgcgaggtcc tcgagcatgg 14340
tgcggtggta gccgtagatg tacctggaca tccaggtgat gccggcggcg gtggtggagg 14400
cgcgcgggaa ctcgcggacg cggttccaga tgttgcgcag cggcaggaag tagttcatgg 14460
tgggcacggt ctggcccgtg aggcgcgcgc agtcgtggat gctctatacg ggcaaaaacg 14520
aaagcggtca gcggctcgac tccgtggcct ggaggctaag cgaacgggtt gggctgcgcg 14580
tgtaccccgg ttcgaatctc gaatcaggct ggagccgcag ctaacgtggt attggcactc 14640
ccgtctcgac ccaagcctgc accaaccctc caggatacgg aggcgggtcg ttttgcaact 14700
tttttttgga ggccggatga gactagtaag cgcggaaagc ggccgaccgc gatggctcgc 14760
tgccgtagtc tggagaagaa tcgccagggt tgcgttgcgg tgtgccccgg ttcgaggccg 14820
gccggattcc gcggctaacg agggcgtggc tgccccgtcg tttccaagac cccatagcca 14880
gccgacttct ccagttacgg agcgagcccc tcttttgttt tgtttgtttt tgccagatgc 14940
atcccgtact gcggcagatg cgcccccacc accctccacc gcaacaacag ccccctccac 15000
agccggcgct tctgcccccg ccccagcagc aacttccagc cacgaccgcc gcggccgccg 15060
tgagcggggc tggacagagt tatgatcacc agctggcctt ggaagagggc gaggggctgg 15120
cgcgcctggg ggcgtcgtcg ccggagcggc acccgcgcgt gcagatgaaa agggacgctc 15180
gcgaggccta cgtgcccaag cagaacctgt tcagagacag gagcggcgag gagcccgagg 15240
agatgcgcgc ggcccggttc cacgcggggc gggagctgcg gcgcggcctg gaccgaaaga 15300
gggtgctgag ggacgaggat ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg 15360
cgcacgtggc cgcggccaac ctggtcacgg cgtacgagca gaccgtgaag gaggagagca 15420
acttccaaaa atccttcaac aaccacgtgc gcaccctgat cgcgcgcgag gaggtgaccc 15480
tgggcctgat gcacctgtgg gacctgctgg aggccatcgt gcagaacccc accagcaagc 15540
cgctgacggc gcagctgttc ctggtggtgc agcatagtcg ggacaacgaa gcgttcaggg 15600
aggcgctgct gaatatcacc gagcccgagg gccgctggct cctggacctg gtgaacattc 15660
tgcagagcat cgtggtgcag gagcgcgggc tgccgctgtc cgagaagctg gcggccatca 15720
acttctcggt gctgagtttg ggcaagtact acgctaggaa gatctacaag accccgtacg 15780
tgcccataga caaggaggtg aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc 15840
tgaccctgag cgacgatctg ggggtgtacc gcaacgacag gatgcaccgt gcggtgagcg 15900
ccagcaggcg gcgcgagctg agcgaccagg agctgatgca tagtctgcag cgggccctga 15960
ccggggccgg gaccgagggg gagagctact ttgacatggg cgcggacctg cactggcagc 16020
ccagccgccg ggccttggag gcggcggcag gaccctacgt agaagaggtg gacgatgagg 16080
tggacgagga gggcgagtac ctggaagact gatggcgcga ccgtattttt gctagatgca 16140
acaacaacag ccacctcctg atcccgcgat gcgggcggcg ctgcagagcc agccgtccgg 16200
cattaactcc tcggacgatt ggacccaggc catgcaacgc atcatggcgc tgacgacccg 16260
caaccccgaa gcctttagac agcagcccca ggccaaccgg ctctcggcca tcctggaggc 16320
cgtggtgccc tcgcgctcca accccacgca cgagaaggtc ctggccatcg tgaacgcgct 16380
ggtggagaac aaggccatcc gcggcgacga ggccggcctg gtgtacaacg cgctgctgga 16440
gcgcgtggcc cgctacaaca gcaccaacgt gcagaccaac ctggaccgca tggtgaccga 16500
cgtgcgcgag gccgtggccc agcgcgagcg gttccaccgc gagtccaacc tgggatccat 16560
ggtggcgctg aacgccttcc tcagcaccca gcccgccaac gtgccccggg gccaggagga 16620
ctacaccaac ttcatcagcg ccctgcgcct gatggtgacc gaggtgcccc agagcgaggt 16680
gtaccagtcc gggccggact acttcttcca gaccagtcgc cagggcttgc agaccgtgaa 16740
cctgagccag gctttcaaga acttgcaggg cctgtggggc gtgcaggccc cggtcgggga 16800
ccgcgcgacg gtgtcgagcc tgctgacgcc gaactcgcgc ctgctgctgc tgctggtggc 16860
ccccttcacg gacagcggca gcatcaaccg caactcgtac ctgggctacc tgattaacct 16920
gtaccgcgag gccatcggcc aggcgcacgt ggacgagcag acctaccagg agatcaccca 16980
cgtgagccgc gccctgggcc aggacgaccc gggcaacctg gaagccaccc tgaacttttt 17040
gctgaccaac cggtcgcaga agatcccgcc ccagtacgcg ctcagcaccg aggaggagcg 17100
catcctgcgt tacgtgcagc agagcgtggg cctgttcctg atgcaggagg gggccacccc 17160
cagcgccgcg ctcgacatga ccgcgcgcaa catggagccc agcatgtacg ccagcaaccg 17220
cccgttcatc aataaactga tggactactt gcatcgggcg gccgccatga actctgacta 17280
tttcaccaac gccatcctga atccccactg gctcccgccg ccggggttct acacgggcga 17340
gtacgacatg cccgacccca atgacgggtt cctgtgggac gatgtggaca gcagcgtgtt 17400
ctccccccga ccgggtgcta acgagcgccc cttgtggaag aaggaaggca gcgaccgacg 17460
cccgtcctcg gcgctgtccg gccgcgaggg tgctgccgcg gcggtgcccg aggccgccag 17520
tcctttcccg agcttgccct tctcgctgaa cagtatccgc agcagcgagc tgggcaggat 17580
cacgcgcccg cgcttgctgg gcgaagagga gtacttgaat gactcgctgt tgagacccga 17640
gcgggagaag aacttcccca ataacgggat agaaagcctg gtggacaaga tgagccgctg 17700
gaagacgtat gcgcaggagc acagggacga tccccgggcg tcgcaggggg ccacgagccg 17760
gggcagcgcc gcccgtaaac gccggtggca cgacaggcag cggggacaga tgtgggacga 17820
tgaggactcc gccgacgaca gcagcgtgtt ggacttgggt gggagtggta acccgttcgc 17880
tcacctgcgc ccccgtatcg ggcgcatgat gtaagagaaa ccgaaaataa atgatactca 17940
ccaaggccat ggcgaccagc gtgcgttcgt ttcttctctg ttgttgttgt atctagtatg 18000
atgaggcgtg cgtacccgga gggtcctcct ccctcgtacg agagcgtgat gcagcaggcg 18060
atggcggcgg cggcgatgca gcccccgctg gaggctcctt acgtgccccc gcggtacctg 18120
gcgcctacgg aggggcggaa cagcattcgt tactcggagc tggcaccctt gtacgatacc 18180
acccggttgt acctggtgga caacaagtcg gcggacatcg cctcgctgaa ctaccagaac 18240
gaccacagca acttcctgac caccgtggtg cagaacaatg acttcacccc cacggaggcc 18300
agcacccaga ccatcaactt tgacgagcgc tcgcggtggg gcggccagct gaaaaccatc 18360
atgcacacca acatgcccaa cgtgaacgag ttcatgtaca gcaacaagtt caaggcgcgg 18420
gtgatggtct cccgcaagac ccccaatggg gtgacagtga cagaggatta tgatggtagt 18480
caggatgagc tgaagtatga atgggtggaa tttgagctgc ccgaaggcaa cttctcggtg 18540
accatgacca tcgacctgat gaacaacgcc atcatcgaca attacttggc ggtggggcgg 18600
cagaacgggg tgctggagag cgacatcggc gtgaagttcg acactaggaa cttcaggctg 18660
ggctgggacc ccgtgaccga gctggtcatg cccggggtgt acaccaacga ggctttccat 18720
cccgatattg tcttgctgcc cggctgcggg gtggacttca ccgagagccg cctcagcaac 18780
ctgctgggca ttcgcaagag gcagcccttc caggaaggct tccagatcat gtacgaggat 18840
ctggaggggg gcaacatccc cgcgctcctg gatgtcgacg cctatgagaa aagcaaggag 18900
gatgcagcag ctgaagcaac tgcagccgta gctaccgcct ctaccgaggt caggggcgat 18960
aattttgcaa gcgccgcagc agtggcagcg gccgaggcgg ctgaaaccga aagtaagata 19020
gtcattcagc cggtggagaa ggatagcaag aacaggagct acaacgtact accggacaag 19080
ataaacaccg cctaccgcag ctggtaccta gcctacaact atggcgaccc cgagaagggc 19140
gtgcgctcct ggacgctgct caccacctcg gacgtcacct gcggcgtgga gcaagtctac 19200
tggtcgctgc ccgacatgat gcaagacccg gtcaccttcc gctccacgcg tcaagttagc 19260
aactacccgg tggtgggcgc cgagctcctg cccgtctact ccaagagctt cttcaacgag 19320
caggccgtct actcgcagca gctgcgcgcc ttcacctcgc ttacgcacgt cttcaaccgc 19380
ttccccgaga accagatcct cgtccgcccg cccgcgccca ccattaccac cgtcagtgaa 19440
aacgttcctg ctctcacaga tcacgggacc ctgccgctgc gcagcagtat ccggggagtc 19500
cagcgcgtga ccgttactga cgccagacgc cgcacctgcc cctacgtcta caaggccctg 19560
ggcatagtcg cgccgcgcgt cctctcgagc cgcaccttct aaatgtccat tctcatctcg 19620
cccagtaata acaccggttg gggcctgcgc gcgcccagca agatgtacgg aggcgctcgc 19680
caacgctcca cgcaacaccc cgtgcgcgtg cgcgggcact tccgcgctcc ctggggcgcc 19740
ctcaagggcc gcgtgcggtc gcgcaccacc gtcgacgacg tgatcgacca ggtggtggcc 19800
gacgcgcgca actacacccc cgccgccgcg cccgtctcca ccgtggacgc cgtcatcgac 19860
agcgtggtgg ccgacgcgcg ccggtacgcc cgcgccaaga gccggcggcg gcgcatcgcc 19920
cggcggcacc ggagcacccc cgccatgcgc gcggcgcgag ccttgctgcg cagggccagg 19980
cgcacgggac gcagggccat gctcagggcg gccagacgcg cggcttcagg cgccagcgcc 20040
ggcaggaccc ggagacgcgc ggccacggcg gcggcagcgg ccatcgccag catgtcccgc 20100
ccgcggcgag ggaacgtgta ctgggtgcgc gacgccgcca ccggtgtgcg cgtgcccgtg 20160
cgcacccgcc cccctcgcac ttgaagatgt tcacttcgcg atgttgatgt gtcccagcgg 20220
cgaggaggat gtccaagcgc aaattcaagg aagagatgct ccaggtcatc gcgcctgaga 20280
tctacggccc tgcggtggtg aaggaggaaa gaaagccccg caaaatcaag cgggtcaaaa 20340
aggacaaaaa ggaagaagaa agtgatgtgg acggattggt ggagtttgtg cgcgagttcg 20400
ccccccggcg gcgcgtgcag tggcgcgggc ggaaggtgca accggtgctg agacccggca 20460
ccaccgtggt cttcacgccc ggcgagcgct ccggcaccgc ttccaagcgc tcctacgacg 20520
aggtgtacgg ggatgatgat attctggagc aggcggccga gcgcctgggc gagtttgctt 20580
acggcaagcg cagccgttcc gcaccgaagg aagaggcggt gtccatcccg ctggaccacg 20640
gcaaccccac gccgagcctc aagcccgtga ccttgcagca ggtgctgccg accgcggcgc 20700
cgcgccgggg gttcaagcgc gagggcgagg atctgtaccc caccatgcag ctgatggtgc 20760
ccaagcgcca gaagctggaa gacgtgctgg agaccatgaa ggtggacccg gacgtgcagc 20820
ccgaggtcaa ggtgcggccc atcaagcagg tggccccggg cctgggcgtg cagaccgtgg 20880
acatcaagat tcccacggag cccatggaaa cgcagaccga gcccatgatc aagcccagca 20940
ccagcaccat ggaggtgcag acggatccct ggatgccatc ggctcctagt cgaagacccc 21000
ggcgcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 21060
tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg tcataccagc agccgccgcc 21120
gcaagaccac cactcgccgc cgccgtcgcc gcaccgccgc tgcaaccacc cctgccgccc 21180
tggtgcggag agtgtaccgc cgcggccgcg cacctctgac cctgccgcgc gcgcgctacc 21240
acccgagcat cgccatttaa actttcgcct gctttgcaga tcaatggccc tcacatgccg 21300
ccttcgcgtt cccattacgg gctaccgagg aagaaaaccg cgccgtagaa ggctggcggg 21360
gaacgggatg cgtcgccacc accaccggcg gcggcgcgcc atcagcaagc ggttgggggg 21420
aggcttcctg cccgcgctga tccccatcat cgccgcggcg atcggggcga tccccggcat 21480
tgcttccgtg gcggtgcagg cctctcagcg ccactgagac acacttggaa acatcttgta 21540
ataaaccaat ggactctgac gctcctggtc ctgtgatgtg ttttcgtaga cagatggaag 21600
acatcaattt ttcgtccctg gctccgcgac acggcacgcg gccgttcatg ggcacctgga 21660
gcgacatcgg caccagccaa ctgaacgggg gcgccttcaa ttggagcagt ctctggagcg 21720
ggcttaagaa tttcgggtcc acgcttaaaa cctatggcag caaggcgtgg aacagcacca 21780
cagggcaggc gctgagggat aagctgaaag agcagaactt ccagcagaag gtggtcgatg 21840
ggctcgcctc gggcatcaac ggggtggtgg acctggccaa ccaggccgtg cagcggcaga 21900
tcaacagccg cctggacccg gtgccgcccg ccggctccgt ggagatgccg caggtggagg 21960
aggagctgcc tcccctggac aagcggggcg agaagcgacc ccgccccgat gcggaggaga 22020
cgctgctgac gcacacggac gagccgcccc cgtacgagga ggcggtgaaa ctgggtctgc 22080
ccaccacgcg gcccatcgcg cccctggcca ccggggtgct gaaacccgaa aagcccgcga 22140
ccctggactt gcctcctccc cagccttccc gcccctctac agtggctaag cccctgccgc 22200
cggtggccgt ggcccgcgcg cgacccgggg gcaccgcccg ccctcatgcg aactggcaga 22260
gcactctgaa cagcatcgtg ggtctgggag tgcagagtgt gaagcgccgc cgctgctatt 22320
aaacctaccg tagcgcttaa cttgcttgtc tgtgtgtgta tgtattatgt cgccgccgcc 22380
gctgtccacc agaaggagga gtgaagaggc gcgtcgccga gttgcaagat ggccacccca 22440
tcgatgctgc cccagtgggc gtacatgcac atcgccggac aggacgcttc ggagtacctg 22500
agtccgggtc tggtgcagtt tgcccgcgcc acagacacct acttcagtct ggggaacaag 22560
tttaggaacc ccacggtggc gcccacgcac gatgtgacca ccgaccgcag ccagcggctg 22620
acgctgcgct tcgtgcccgt ggaccgcgag gacaacacct actcgtacaa agtgcgctac 22680
acgctggccg tgggcgacaa ccgcgtgctg gacatggcca gcacctactt tgacatccgc 22740
ggcgtgctgg atcggggccc tagcttcaaa ccctactccg gcaccgccta caacagtctg 22800
gcccccaagg gagcacccaa cacttgtcag tggacatata aagccgatgg tgaaactgcc 22860
acagaaaaaa cctatacata tggaaatgca cccgtgcagg gcattaacat cacaaaagat 22920
ggtattcaac ttggaactga caccgatgat cagccaatct acgcagataa aacctatcag 22980
cctgaacctc aagtgggtga tgctgaatgg catgacatca ctggtactga tgaaaagtat 23040
ggaggcagag ctcttaagcc tgataccaaa atgaagcctt gttatggttc ttttgccaag 23100
cctactaata aagaaggagg tcaggcaaat gtgaaaacag gaacaggcac tactaaagaa 23160
tatgacatag acatggcttt ctttgacaac agaagtgcgg ctgctgctgg cctagctcca 23220
gaaattgttt tgtatactga aaatgtggat ttggaaactc cagataccca tattgtatac 23280
aaagcaggca cagatgacag cagctcttct attaatttgg gtcagcaagc catgcccaac 23340
agacctaact acattggttt cagagacaac tttatcgggc tcatgtacta caacagcact 23400
ggcaatatgg gggtgctggc cggtcaggct tctcagctga atgctgtggt tgacttgcaa 23460
gacagaaaca ccgagctgtc ctaccagctc ttgcttgact ctctgggtga cagaacccgg 23520
tatttcagta tgtggaatca ggcggtggac agctatgatc ctgatgtgcg cattattgaa 23580
aatcatggtg tggaggatga acttcccaac tattgtttcc ctctggatgc tgttggcaga 23640
acagatactt atcagggaat taaggctaat ggaactgatc aaaccacatg gaccaaagat 23700
gacagtgtca atgatgctaa tgagataggc aagggtaatc cattcgccat ggaaatcaac 23760
atccaagcca acctgtggag gaacttcctc tacgccaacg tggccctgta cctgcccgac 23820
tcttacaagt acacgccggc caatgttacc ctgcccacca acaccaacac ctacgattac 23880
atgaacggcc gggtggtggc gccctcgctg gtggactcct acatcaacat cggggcgcgc 23940
tggtcgctgg atcccatgga caacgtgaac cccttcaacc accaccgcaa tgcggggctg 24000
cgctaccgct ccatgctcct gggcaacggg cgctacgtgc ccttccacat ccaggtgccc 24060
cagaaatttt tcgccatcaa gagcctcctg ctcctgcccg ggtcctacac ctacgagtgg 24120
aacttccgca aggacgtcaa catgatcctg cagagctccc tcggcaacga cctgcgcacg 24180
gacggggcct ccatctcctt caccagcatc aacctctacg ccaccttctt ccccatggcg 24240
cacaacacgg cctccacgct cgaggccatg ctgcgcaacg acaccaacga ccagtccttc 24300
aacgactacc tctcggcggc caacatgctc taccccatcc cggccaacgc caccaacgtg 24360
cccatctcca tcccctcgcg caactgggcc gccttccgcg gctggtcctt cacgcgtctc 24420
aagaccaagg agacgccctc gctgggctcc gggttcgacc cctacttcgt ctactcgggc 24480
tccatcccct acctcgacgg caccttctac ctcaaccaca ccttcaagaa ggtctccatc 24540
accttcgact cctccgtcag ctggcccggc aacgaccggc tcctgacgcc caacgagttc 24600
gaaatcaagc gcaccgtcga cggcgagggc tacaacgtgg cccagtgcaa catgaccaag 24660
gactggttcc tggtccagat gctggcccac tacaacatcg gctaccaggg cttctacgtg 24720
cccgagggct acaaggaccg catgtactcc ttcttccgca acttccagcc catgagccgc 24780
caggtggtgg acgaggtcaa ctacaaggac taccaggccg tcaccctggc ctaccagcac 24840
aacaactcgg gcttcgtcgg ctacctcgcg cccaccatgc gccagggcca gccctacccc 24900
gccaactacc cctacccgct catcggcaag agcgccgtca ccagcgtcac ccagaaaaag 24960
ttcctctgcg acagggtcat gtggcgcatc cccttctcca gcaacttcat gtccatgggc 25020
gcgctcaccg acctcggcca gaacatgctc tatgccaact ccgcccacgc gctagacatg 25080
aatttcgaag tcgaccccat ggatgagtcc acccttctct atgttgtctt cgaagtcttc 25140
gacgtcgtcc gagtgcacca gccccaccgc ggcgtcatcg aggccgtcta cctgcgcacc 25200
cccttctcgg ccggtaacgc caccacctaa gctcttgctt cttgcaagcc atggccgcgg 25260
gctccggcga gcaggagctc agggccatca tccgcgacct gggctgcggg ccctacttcc 25320
tgggcacctt cgataagcgc ttcccgggat tcatggcccc gcacaagctg gcctgcgcca 25380
tcgtcaacac ggccggccgc gagaccgggg gcgagcactg gctggccttc gcctggaacc 25440
cgcgctcgaa cacctgctac ctcttcgacc ccttcgggtt ctcggacgag cgcctcaagc 25500
agatctacca gttcgagtac gagggcctgc tgcgccgcag cgccctggcc accgaggacc 25560
gctgcgtcac cctggaaaag tccacccaga ccgtgcaggg tccgcgctcg gccgcctgcg 25620
ggctcttctg ctgcatgttc ctgcacgcct tcgtgcactg gcccgaccgc cccatggaca 25680
agaaccccac catgaacttg ctgacggggg tgcccaacgg catgctccag tcgccccagg 25740
tggaacccac cctgcgccgc aaccaggagg cgctctaccg cttcctcaac tcccactccg 25800
cctactttcg ctcccaccgc gcgcgcatcg agaaggccac cgccttcgac cgcatgaatc 25860
aagacatgta aaccgtgtgt gtatgttaaa tgtctttaat aaacagcact ttcatgttac 25920
acatgcatct gagatgattt atttagaaat cgaaagggtt ctgccgggtc tcggcatggc 25980
ccgcgggcag ggacacgttg cggaactggt acttggccag ccacttgaac tcggggatca 26040
gcagtttggg cagcggggtg tcggggaagg agtcggtcca cagcttccgc gtcagttgca 26100
gggcgcccag caggtcgggc gcggagatct tgaaatcgca gttgggaccc gcgttctgcg 26160
cgcgggagtt gcggtacacg gggttgcagc actggaacac catcagggcc gggtgcttca 26220
cgctcgccag caccgtcgcg tcggtgatgc tctccacgtc gaggtcctcg gcgttggcca 26280
tcccgaaggg ggtcatcttg caggtctgcc ttcccatggt gggcacgcac ccgggcttgt 26340
ggttgcaatc gcagtgcagg gggatcagca tcatctgggc ctggtcggcg ttcatccccg 26400
ggtacatggc cttcatgaaa gcctccaatt gcctgaacgc ctgctgggcc ttggctccct 26460
cggtgaagaa gaccccgcag gacttgctag agaactggtt ggtggcgcac ccggcgtcgt 26520
gcacgcagca gcgcgcgtcg ttgttggcca gctgcaccac gctgcgcccc cagcggttct 26580
gggtgatctt ggcccggtcg gggttctcct tcagcgcgcg ctgcccgttc tcgctcgcca 26640
catccatctc gatcatgtgc tccttctgga tcatggtggt cccgtgcagg caccgcagct 26700
tgccctcggc ctcggtgcac ccgtgcagcc acagcgcgca cccggtgcac tcccagttct 26760
tgtgggcgat ctgggaatgc gcgtgcacga agccctgcag gaagcggccc atcatggtgg 26820
tcagggtctt gttgctagtg aaggtcagcg gaatgccgcg gtgctcctcg ttgatgtaca 26880
ggtggcagat gcggcggtac acctcgccct gctcgggcat cagctggaag ttggctttca 26940
ggtcggtctc cacgcggtag cggtccatca gcatagtcat gatttccata cccttctccc 27000
aggccgagac gatgggcagg ctcatagggt tcttcaccat catcttagcg ctagcagccg 27060
cggccagggg gtcgctctcg tccagggtct caaagctccg cttgccgtcc ttctcggtga 27120
tccgcaccgg ggggtagctg aagcccacgg ccgccagctc ctcctcggcc tgtctttcgt 27180
cctcgctgtc ctggctgacg tcctgcagga ccacatgctt ggtcttgcgg ggtttcttct 27240
tgggcggcag cggcggcgga gatgttggag atggcgaggg ggagcgcgag ttctcgctca 27300
ccactactat ctcttcctct tcttggtccg aggccacgcg gcggtaggta tgtctcttcg 27360
ggggcagagg cggaggcgac gggctctcgc cgccgcgact tggcggatgg ctggcagagc 27420
cccttccgcg ttcgggggtg cgctcccggc ggcgctctga ctgacttcct ccgcggccgg 27480
ccattgtgtt ctcctaggga ggaacaacaa gcatggagac tcagccatcg ccaacctcgc 27540
catctgcccc caccgccgac gagaagcagc agcagcagaa tgaaagctta accgccccgc 27600
cgcccagccc cgccacctcc gacgcggccg tcccagacat gcaagagatg gaggaatcca 27660
tcgagattga cctgggctat gtgacgcccg cggagcacga ggaggagctg gcagtgcgct 27720
tttcacaaga agagatacac caagaacagc cagagcagga agcagagaat gagcagagtc 27780
aggctgggct cgagcatgac ggcgactacc tccacctgag cgggggggag gacgcgctca 27840
tcaagcatct ggcccggcag gccaccatcg tcaaggatgc gctgctcgac cgcaccgagg 27900
tgcccctcag cgtggaggag ctcagccgcg cctacgagtt gaacctcttc tcgccgcgcg 27960
tgccccccaa gcgccagccc aatggcacct gcgagcccaa cccgcgcctc aacttctacc 28020
cggtcttcgc ggtgcccgag gccctggcca cctaccacat ctttttcaag aaccaaaaga 28080
tccccgtctc ctgccgcgcc aaccgcaccc gcgccgacgc ccttttcaac ctgggtcccg 28140
gcgcccgcct acctgatatc gcctccttgg aagaggttcc caagatcttc gagggtctgg 28200
gcagcgacga gactcgggcc gcgaacgctc tgcaaggaga aggaggagag catgagcacc 28260
acagcgccct ggtcgagttg gaaggcgaca acgcgcggct ggcggtgctc aaacgcacgg 28320
tcgagctgac ccatttcgcc tacccggctc tgaacctgcc ccccaaagtc atgagcgcgg 28380
tcatggacca ggtgctcatc aagcgcgcgt cgcccatctc cgaggacgag ggcatgcaag 28440
actccgagga gggcaagccc gtggtcagcg acgagcagct ggcccggtgg ctgggtccta 28500
atgctagtcc ccagagtttg gaagagcggc gcaaactcat gatggccgtg gtcctggtga 28560
ccgtggagct ggagtgcctg cgccgcttct tcgccgacgc ggagaccctg cgcaaggtcg 28620
aggagaacct gcactacctc ttcaggcacg ggttcgtgcg ccaggcctgc aagatctcca 28680
acgtggagct gaccaacctg gtctcctaca tgggcatctt gcacgagaac cgcctggggc 28740
agaacgtgct gcacaccacc ctgcgcgggg aggcccggcg cgactacatc cgcgactgcg 28800
tctacctcta cctctgccac acctggcaga cgggcatggg cgtgtggcag cagtgtctgg 28860
aggagcagaa cctgaaagag ctctgcaagc tcctgcagaa gaacctcaag ggtctgtgga 28920
ccgggttcga cgagcgcacc accgcctcgg acctggccga cctcattttc cccgagcgcc 28980
tcaggctgac gctgcgcaac ggcctgcccg actttatgag ccaaagcatg ttgcaaaact 29040
ttcgctcttt catcctcgaa cgctccggaa tcctgcccgc cacctgctcc gcgctgccct 29100
cggacttcgt gccgctgacc ttccgcgagt gccccccgcc gctgtggagc cactgctacc 29160
tgctgcgcct ggccaactac ctggcctacc actcggacgt gatcgaggac gtcagcggcg 29220
agggcctgct cgagtgccac tgccgctgca acctctgcac gccgcaccgc tccctggcct 29280
gcaaccccca gctgctgagc gagacccaga tcatcggcac cttcgagttg caagggccca 29340
gcgaaggcga gggttcagcc gccaaggggg gtctgaaact caccccgggg ctgtggacct 29400
cggcctactt gcgcaagttc gtgcccgagg actaccatcc cttcgagatc aggttctacg 29460
aggaccaatc ccatccgccc aaggccgagc tgtcggcctg cgtcatcacc cagggggcga 29520
tcctggccca attgcaagcc atccagaaat cccgccaaga attcttgctg aaaaagggcc 29580
gcggggtcta cctcgacccc cagaccggtg aggagctcaa ccccggcttc ccccaggatg 29640
ccccgaggaa acaagaagct gaaagtggag ctgccgcccg tggaggattt ggaggaagac 29700
tgggagaaca gcagtcaggc agaggaggag gagatggagg aagactggga cagcactcag 29760
gcagaggagg acagcctgca agacagtctg gaggaagacg aggaggaggc agaggaggag 29820
gtggaagaag cagccgccgc cagaccgtcg tcctcggcgg gggagaaagc aagcagcacg 29880
gataccatct ccgctccggg tcggggtccc gctcgaccac acagtagatg ggacgagacc 29940
ggacgattcc cgaaccccac cacccagacc ggtaagaagg agcggcaggg atacaagtcc 30000
tggcgggggc acaaaaacgc catcgtctcc tgcttgcagg cctgcggggg caacatctcc 30060
ttcacccggc gctacctgct cttccaccgc ggggtgaact ttccccgcaa catcttgcat 30120
tactaccgtc acctccacag cccctactac ttccaagaag aggcagcagc agcagaaaaa 30180
gaccagcaga aaaccagcag ctagaaaatc cacagcggcg gcagcaggtg gactgaggat 30240
cgcggcgaac gagccggcgc aaacccggga gctgaggaac cggatctttc ccaccctcta 30300
tgccatcttc cagcagagtc gggggcagga gcaggaactg aaagtcaaga accgttctct 30360
gcgctcgctc acccgcagtt gtctgtatca caagagcgaa gaccaacttc agcgcactct 30420
cgaggacgcc gaggctctct tcaacaagta ctgcgcgctc actcttaaag agtagcccgc 30480
gcccgcccag tcgcagaaaa aggcgggaat tacgtcacct gtgcccttcg ccctagccgc 30540
ctccacccat catcatgagc aaagagattc ccacgcctta catgtggagc taccagcccc 30600
agatgggcct ggccgccggt gccgcccagg actactccac ccgcatgaat tggctcagcg 30660
ccgggcccgc gatgatctca cgggtgaatg acatccgcgc ccaccgaaac cagatactcc 30720
tagaacagtc agcgctcacc gccacgcccc gcaatcacct caatccgcgt aattggcccg 30780
ccgccctggt gtaccaggaa attccccagc ccacgaccgt actacttccg cgagacgccc 30840
aggccgaagt ccagctgact aactcaggtg tccagctggc gggcggcgcc accctgtgtc 30900
gtcaccgccc cgctcagggt ataaagcggc tggtgatccg gggcagaggc acacagctca 30960
acgacgaggt ggtgagctct tcgctgggtc tgcgacctga cggagtcttc caactcgccg 31020
gatcggggag atcttccttc acgcctcgtc aggccgtcct gactttggag agttcgtcct 31080
cgcagccccg ctcgggtggc atcggcactc tccagttcgt ggaggagttc actccctcgg 31140
tctacttcaa ccccttctcc ggctcccccg gccactaccc ggacgagttc atcccgaact 31200
tcgacgccat cagcgagtcg gtggacggct acgattgaat gtcccatggt ggcgcagctg 31260
acctagctcg gcttcgacac ctggaccact gccgccgctt ccgctgcttc gctcgggatc 31320
tcgccgagtt tgcctacttt gagctgcccg aggagcaccc tcagggcccg gcccacggag 31380
tgcggatcgt cgtcgaaggg ggcctcgact cccacctgct tcggatcttc agccagcgtc 31440
cgatcctggt cgagcgcgag caaggacaga cccttctgac tctgtactgc atctgcaacc 31500
accccggcct gcatgaaagt ctttgttgtc tgctgtgtac tgagtataat aaaagctgag 31560
atcagcgact actccggact tccgtgtgtt taaactcacc cccttatcca gtgaaataaa 31620
gatcatattg atgatgattt tacagaaata aaaaataatc atttgatttg aaataaagat 31680
acaatcatat tgatgatttg agtttaacaa aaaaataaag aatcacttac ttgaaatctg 31740
ataccaggtc tctgtccatg ttttctgcca acaccacttc actcccctct tcccagctct 31800
ggtactgcag gccccggcgg gctgcaaact tcctccacac gctgaagggg atgtcaaatt 31860
cctcctgtcc ctcaatcttc attttatctt ctatcagatg tccaaaaagc gcgtccgggt 31920
ggatgatgac ttcgaccccg tctaccccta cgatgcagac aacgcaccga ccgtgccctt 31980
catcaacccc cccttcgtct cttcagatgg attccaagag aagcccctgg gggtgttgtc 32040
cctgcgactg gccgaccccg tcaccaccaa gaacggggaa atcaccctca agctgggaga 32100
gggggtggac ctcgattcct cgggaaaact catctccaac acggccacca aggccgccgc 32160
ccctctcagt ttttccaaca acaccatttc ccttaacatg gatcacccct tttacactaa 32220
agatggaaaa ttatccttac aagtttctcc accattaaat atactgagaa caagcattct 32280
aaacacacta gctttaggtt ttggatcagg tttaggactc cgtggctctg ccttggcagt 32340
acagttagtc tctccactta catttgatac tgatggaaac ataaagctta ccttagacag 32400
aggtttgcat gttacaacag gagatgcaat tgaaagcaac ataagctggg ctaaaggttt 32460
aaaatttgaa gatggagcca tagcaaccaa cattggaaat gggttagagt ttggaagcag 32520
tagtacagaa acaggtgttg atgatgctta cccaatccaa gttaaacttg gatctggcct 32580
tagctttgac agtacaggag ccataatggc tggtaacaaa gaagacgata aactcacttt 32640
gtggacaaca cctgatccat caccaaactg tcaaatactc gcagaaaatg atgcaaaact 32700
aacactttgc ttgactaaat gtggtagtca aatactggcc actgtgtcag tcttagttgt 32760
aggaagtgga aacctaaacc ccattactgg caccgtaagc agtgctcagg tgtttctacg 32820
ttttgatgca aacggtgttc ttttaacaga acattctaca ctaaaaaaat actgggggta 32880
taggcaggga gatagcatag atggcactcc atataccaat gctgtaggat tcatgcccaa 32940
tttaaaagct tatccaaagt cacaaagttc tactactaaa aataatatag tagggcaagt 33000
atacatgaat ggagatgttt caaaacctat gcttctcact ataaccctca atggtactga 33060
tgacagcaac agtacatatt caatgtcatt ttcatacacc tggactaatg gaagctatgt 33120
tggagcaaca tttggggcta actcttatac cttctcatac atcgcccaag aatgaacact 33180
gtatcccacc ctgcatgcca acccttccca ccccactctg tggaacaaac tctgaaacac 33240
aaaataaaat aaagttcaag tgttttattg attcaacagt tttacaggat tcgagcagtt 33300
atttttcctc caccctccca ggacatggaa tacaccaccc tctccccccg cacagccttg 33360
aacatctgaa tgccattggt gatggacatg cttttggtct ccacgttcca cacagtttca 33420
gagcgagcca gtctcgggtc ggtcagggag atgaaaccct ccgggcactc ccgcatctgc 33480
acctcacagc tcaacagctg aggattgtcc tcggtggtcg ggatcacggt tatctggaag 33540
aagcagaaga gcggcggtgg gaatcatagt ccgcgaacgg gatcggccgg tggtgtcgca 33600
tcaggccccg cagcagtcgc tgccgccgcc gctccgtcaa gctgctgctc agggggtccg 33660
ggtccaggga ctccctcagc atgatgccca cggccctcag catcagtcgt ctggtgcggc 33720
gggcgcagca gcgcatgcgg atctcgctca ggtcgctgca gtacgtgcaa cacagaacca 33780
ccaggttgtt caacagtcca tagttcaaca cgctccagcc gaaactcatc gcgggaagga 33840
tgctacccac gtggccgtcg taccagatcc tcaggtaaat caagtggtgc cccctccaga 33900
acacgctgcc cacgtacatg atctccttgg gcatgtggcg gttcaccacc tcccggtacc 33960
acatcaccct ctggttgaac atgcagcccc ggatgatcct gcggaaccac agggccagca 34020
ccgccccgcc cgccatgcag cgaagagacc ccgggtcccg gcaatggcaa tggaggaccc 34080
accgctcgta cccgtggatc atctgggagc tgaacaagtc tatgttggca cagcacaggc 34140
atatgctcat gcatctcttc agcactctca actcctcggg ggtcaaaacc atatcccagg 34200
gcacggggaa ctcttgcagg acagcgaacc ccgcagaaca gggcaatcct cgcacagaac 34260
ttacattgtg catggacagg gtatcgcaat caggcagcac cgggtgatcc tccaccagag 34320
aagcgcgggt ctcggtctcc tcacagcgtg gtaagggggc cggccgatac gggtgatggc 34380
gggacgcggc tgatcgtgtt cgcgaccgtg tcatgatgca gttgctttcg gacattttcg 34440
tacttgctgt agcagaacct ggtccgggcg ctgcacaccg atcgccggcg gcggtctcgg 34500
cgcttggaac gctcggtgtt gaaattgtaa aacagccact ctctcagacc gtgcagcaga 34560
tctagggcct caggagtgat gaagatccca tcatgcctga tggctctgat cacatcgacc 34620
accgtggaat gggccagacc cagccagatg atgcaatttt gttgggtttc ggtgacggcg 34680
ggggagggaa gaacaggaag aaccatgatt aacttttaat ccaaacggtc tcggagtact 34740
tcaaaatgaa gatcgcggag atggcacctc tcgcccccgc tgtgttggtg gaaaataaca 34800
gccaggtcaa aggtgatacg gttctcgaga tgttccacgg tggcttccag caaagcctcc 34860
acgcgcacat ccagaaacaa gacaatagcg aaagcgggag ggttctctaa ttcctcaatc 34920
atcatgttac actcctgcac catccccaga taattttcat ttttccagcc ttgaatgatt 34980
cgaactagtt cctgaggtaa atccaagcca gccatgataa agagctcgcg cagagcgccc 35040
tccaccggca ttcttaagca caccctcata attccaagat attctgctcc tggttcacct 35100
gcagcagatt gacaagcgga atatcaaaat ctctgccgcg atccctgagc tcctccctca 35160
gcaataactg taagtactct ttcatatcct ctccgaaatt tttagccata ggaccaccag 35220
gaataagatt agggcaagcc acagtacaga taaaccgaag tcctccccag tgagcattgc 35280
caaatgcaag actgctataa gcatgctggc tagacccggt gatatcttcc agataactgg 35340
acagaaaatc gcccaggcaa tttttaagaa aatcaacaaa agaaaaatcc tccaggtgga 35400
cgtttagagc ctcgggaaca acgatgaagt aaatgcaagc ggtgcgttcc agcatggtta 35460
gttagctgat ctgtagaaaa aacaaaaatg aacattaaac catgctagcc tggcgaacag 35520
gtgggtaaat cgttctctcc agcaccaggc aggccacggg gtctccggcg cgaccctcgt 35580
aaaaattgtc gctatgattg aaaaccatca cagagagacg ttcccggtgg ccggcgtgaa 35640
tgattcgaca agatgaatac acccccggaa cattggcgtc cgcgagtgaa aaaaagcgcc 35700
cgaggaagca ataaggcact acaatgctca gtctcaagtc cagcaaagcg atgccatgcg 35760
gatgaagcac aaaattctca ggtgcgtaca aaatgtaatt actcccctcc tgcacaggca 35820
gcaaagcccc cgatccctcc aggtacacat acaaagcctc agcgtccata gcttaccgag 35880
cagcagcaca caacaggcgc aagagtcaga gaaaggctga gctctaacct gtccacccgc 35940
tctctgctca atatatagcc cagatctaca ctgacgtaaa ggccaaagtc taaaaatacc 36000
cgccaaataa tcacacacgc ccagcacacg cccagaaacc ggtgacacac tcaaaaaaat 36060
acgcgcactt cctcaaacgc ccaaaactgc cgtcatttcc gggttcccac gctacgtcat 36120
caaaacacga ctttcaaatt ccgtcgaccg ttaaaaacgt cacccgcccc gcccctaacg 36180
gtcgcccgtc tctcagccaa tcagcgcccc gcatccccaa attcaaacac ctcatttgca 36240
tattaacgcg cacaaaaagt ttgaggtata ttattgatga tgg 36283
<210> 63
<211> 9735
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 63
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcggagctg ccccggagcc ggagaggacc cccgttggcc agggatcgtg 2040
ggcccatccg ggacgcacca ggggaccatc cgacagggga ttctgtgtgg tgtcaccggc 2100
caggccagca gaagaggcaa ccagcctcga gggagcgttg tctggaacca gacattccca 2160
cccgtcggtg ggccggcagc accacgcggg accaccgtcc acttccagac cgccacggcc 2220
atgggacacc ccttgcccgc ctgtgtatgc cgagactaaa cacttcctgt actcatccgg 2280
agacaaggaa cagcttcggc cgtccttcct cctgtcgtcg ctcagaccga gcctgaccgg 2340
agcacgcaga ttggtggaaa ctatcttcct tgggtcacgt ccgtggatgc caggtacccc 2400
acggcgcctc ccgcgcctcc cacagagata ctggcagatg cggcctctgt tcctggaatt 2460
gctgggaaac cacgctcagt gcccgtacgg agtcctgctc aagactcact gccctctgag 2520
ggcggcggtc actccggcgg ccggagtgtg cgcacgggag aagccccagg gaagcgtggc 2580
agctccggaa gaggaggaca ccgatccgcg ccgcctcgtg caacttctgc gccagcactc 2640
ctcgccctgg caagtctacg ggttcgtccg cgcctgcctg cgccgcctgg tgccgcctgg 2700
gctctggggt tcccggcata acgagcgccg cttcctgaga aatactaaga agtttatctc 2760
acttggaaaa catgccaagt tgtcgctgca agaactcacg tggaagatgt cagtccgcga 2820
ttgcgcctgg ctgcgccgct cgccgggcgt cgggtgtgtt ccagctgcag aacaccgcct 2880
gagagaagaa attctggcca aatttctgca ttggctgatg tcagtgtacg tggtcgagct 2940
gctgcgctcc tttttctacg tcactgagac tacctttcaa aagaaccgcc tgttcttcta 3000
ccgcaaatct gtgtggagca agctgcagtc aatcggcatt cgccagcatc tgaagagggt 3060
gcagctgcgg gaactttccg aggcagaagt ccgccagcac cgggaggccc ggccggcgct 3120
tctcacgtcg cgtctgagat tcatcccaaa gcccgacggg ctgaggccta tcgtcaacat 3180
ggattacgtc gtgggcgctc gcacctttcg ccgtgaaaag cgggccgaac gcttgacctc 3240
acgggtgaag gccctcttct ccgtgctgaa ctacgagaga gcaagacggc ctggcctgct 3300
gggagcttcg gtgctgggac tggacgatat ccaccgggct tggcggacct ttgttctccg 3360
ggtgagagcc caagaccctc cgccggaact gtacttcgtg aaggtggcga tcaccggagc 3420
ctatgatact attccgcaag atcgactcac cgaagtcatc gcctcgatca tcaaaccgca 3480
gaacacttac tgcgtcaggc ggtacgccgt ggtccagaag gccgcgcatg gccacgtgag 3540
aaaggcgttc aagtcgcacg tgtccactct caccgacctc cagccttaca tgaggcaatt 3600
cgttgcgcat ttgcaagaga cttcgcccct gagagatgcg gtggtcatcg agcagagctc 3660
cagcctgaac gaagcgagca gcggtctgtt tgacgtgttc ctccgcttca tgtgtcatca 3720
cgcggtgcga atcaggggaa aatcatacgt gcagtgccag ggaatcccac aaggcagcat 3780
tctgtcgact ctcttgtgtt ccctttgcta cggcgatatg gaaaacaagc tgttcgctgg 3840
gatcagacgg gacgggttgc tgctcagact ggtggacgac ttcctgctgg tgactccgca 3900
cctcactcac gccaaaacct ttctccgcac tctggtgagg ggagtgccag aatacggctg 3960
tgtggtcaat ctccggaaaa ctgtggtgaa tttccctgtc gaggatgagg cactcggagg 4020
aaccgcattt gtccaaatgc cagcacatgg cctgttccca tggtgcggtc tgctgctgga 4080
cacccgaact cttgaagtgc agtccgacta ctccagctat gcccggacga gcatccgcgc 4140
cagcctcact ttcaatcgcg gctttaaggc cggacgaaac atgcgcagaa agcttttcgg 4200
agtcctccgg cttaaatgcc attcgctctt tctcgatctc caagtcaatt cgctgcagac 4260
cgtgtgcacg aacatctaca agatcctgct gctccaagcc taccggttcc acgcttgcgt 4320
gcttcagctg ccgtttcacc aacaggtgtg gaagaacccg accttctttc tgcgggtcat 4380
tagcgatact gcctccctgt gttactcaat cctcaaggca aagaacgccg gaatgtcgct 4440
gggtgcgaaa ggagccgcgg gacctcttcc tagcgaagcg gtgcagtggc tctgccacca 4500
ggctttcctc ctgaagctga ccaggcacag agtgacctac gtcccgctgc tgggctcgct 4560
gcgcactgca cagacccagc tgtctagaaa actccccggc accaccctga ccgctctgga 4620
agccgccgcc aacccagcat tgccgtcaga tttcaagacc atcttggacg gatccggcca 4680
gtgcaccaat tacgccctgc tgaagctggc cggcgacgtg gaatctaacc ctggccctga 4740
atcgccaagc gcaccccctc atcggtggtg catcccttgg caacgcctcc tcctgaccgc 4800
ctcactgctg actttctgga acccgccgac caccgcaaag ctgaccattg agagcactcc 4860
cttcaacgtg gctgagggga aggaggtgct gctcctggtg cacaatctgc cccagcacct 4920
gttcgggtac tcctggtaca agggagaacg cgtggacggg aaccggcaga tcataggcta 4980
cgtcatcgga acccagcagg ccacacccgg tccagcgtac agcggccggg agattatcta 5040
cccgaacgcc tccctgctga tccaaaacat catccagaac gacaccggtt tctacactct 5100
gcacgtgatt aagtcagatc tggtcaacga agaggccacc ggccaattca gggtgtaccc 5160
cgaactccct aagccgttca tcacctcgaa caacagcaac ccggtcgagg atgaagatgc 5220
ggtggccttg acgtgcgaac ctgagatcca gaacaccacc tacttgtggt gggtgaacaa 5280
tcagagcctg ccagtctccc cacgactcca gctgtcgaac gacaacagga ccctgacttt 5340
gctgtccgtg actcggaacg acgtgggccc ttatgaatgc ggtatccaga acaagctgtc 5400
cgtggaccac agcgaccctg tgatcctgaa cgtcctttac gggccggacg accccaccat 5460
ttccccgtcg tacacttact accggccggg cgtgaacctg tccctgtcgt gccacgctgc 5520
ctccaatccg ccggcccagt actcctggct catcgacgga aacatccagc agcacaccca 5580
agaactgttc atctccaaca ttaccgagaa aaactcggga ctttacacct gtcaagccaa 5640
caattccgcc agcggccact cccgcaccac tgtcaaaact atcactgtgt ccgccgaact 5700
cccgaagccc agcatcagct ccaacaactc gaagcccgtg gaggataagg acgctgtcgc 5760
gttcacctgt gaaccagagg cacagaatac cacctacctt tggtgggtca acggacagtc 5820
cctgcctgtc tcaccgagac tgcagctgtc aaacgggaat aggactctga ccttgtttaa 5880
cgtcacccgg aacgacgccc gggcctacgt gtgcggcatc cagaactccg tgagcgcaaa 5940
ccggtctgac ccagtgaccc tggatgtgct gtacggcccc gacactccga tcatttcacc 6000
ccccgattca tcctacctgt ccggcgctaa cctcaacctc tcatgccact ccgcatccaa 6060
ccccagcccg caatattcgt ggcgcattaa cggaattcct cagcaacata cccaggtcct 6120
gttcattgcg aagatcaccc ctaacaacaa cggaacctac gcctgctttg tgtcaaacct 6180
ggccactggt agaaacaact ccatcgtgaa gtccattacc gtgtcggcgt ccggaacttc 6240
cccgggcctg agcgccggcg ccaccgtggg aattatgatc ggcgtgctcg tgggagtggc 6300
cctgatcgga tccggcgagg gcagaggcag cctgctgaca tgtggcgacg tggaagagaa 6360
ccctggcccc acccctggaa cccagagccc cttcttcctt ctgctgctgc tgaccgtgct 6420
gactgtcgtg acaggctctg gccacgccag ctctacacct ggcggcgaga aagagacaag 6480
cgccacccag agaagcagcg tgccaagcag caccgagaag aacgccgtgt ccatgaccag 6540
ctccgtgctg agcagccact ctcctggcag cggcagcagc acaacacagg gccaggatgt 6600
gacactggcc cctgccacag aacctgcctc tggatctgcc gccacctggg gacaggacgt 6660
gacaagcgtg ccagtgacca gacctgccct gggctctaca acaccccctg cccacgatgt 6720
gaccagcgcc cctgataaca agcctgcccc tggaagcaca gcccctccag ctcatggcgt 6780
gacctctgcc ccagatacca gaccagcccc aggatctaca gccccacccg cacacggcgt 6840
gacaagtgcc cctgacacaa gacccgctcc aggctctact gctcctcctg cccatggcgt 6900
gacaagcgct cccgatacaa ggccagctcc tggctccaca gcaccaccag cacatggcgt 6960
gacatcagct cccgacacta gacctgctcc cggatcaacc gctccaccag ctcacggcgt 7020
gaccagcgca cctgatacca gacctgctct gggaagcacc gcccctcccg tgcacaatgt 7080
gacatctgct tccggcagcg ccagcggctc tgcctctaca ctggtgcaca acggcaccag 7140
cgccagagcc acaacaaccc cagccagcaa gagcaccccc ttcagcatcc ctagccacca 7200
cagcgacacc cctaccacac tggccagcca ctccaccaag accgatgcct ctagcaccca 7260
ccactccagc gtgccccctc tgaccagcag caaccacagc acaagccccc agctgtctac 7320
cggcgtctca ttcttctttc tgtccttcca catcagcaac ctgcagttca acagcagcct 7380
ggaagatccc agcaccgact actaccagga actgcagcgg gatatcagcg agatgttcct 7440
gcaaatctac aagcagggcg gcttcctggg cctgagcaac atcaagttca gacccggcag 7500
cgtggtggtg cagctgaccc tggctttccg ggaaggcacc atcaacgtgc acgacgtgga 7560
aacccagttc aaccagtaca agaccgaggc cgccagccgg tacaacctga ccatctccga 7620
tgtgtccgtg tccgacgtgc ccttcccatt ctctgcccag tctggcgcag gcgtgccagg 7680
atggggaatt gctctgctgg tgctcgtgtg cgtgctggtg gccctggcca tcgtgtatct 7740
gattgccctg gccgtgtgcc agtgccggcg gaagaattac ggccagctgg acatcttccc 7800
cgccagagac acctaccacc ccatgagcga gtaccccaca taccacaccc acggcagata 7860
cgtgccaccc agctccaccg acagatcccc ctacgagaaa gtgtctgccg gcaacggcgg 7920
cagctccctg agctacacaa atcctgccgt ggccgctgcc tccgccaacc tgtgaagatc 7980
tgggccctaa caaaacaaaa agatggggtt attccctaaa cttcatgggt tacgtaattg 8040
gaagttgggg gacattgcca caagatcata ttgtacaaaa gatcaaacac tgttttagaa 8100
aacttcctgt aaacaggcct attgattgga aagtatgtca aaggattgtg ggtcttttgg 8160
gctttgctgc tccatttaca caatgtggat atcctgcctt aatgcctttg tatgcatgta 8220
tacaagctaa acaggctttc actttctcgc caacttacaa ggcctttcta agtaaacagt 8280
acatgaacct ttaccccgtt gctcggcaac ggcctggtct gtgccaagtg tttgctgacg 8340
caacccccac tggctggggc ttggccatag gccatcagcg catgcgtgga acctttgtgg 8400
ctcctctgcc gatccatact gcggaactcc tagccgcttg ttttgctcgc agccggtctg 8460
gagcaaagct cataggaact gacaattctg tcgtcctctc gcggaaatat acatcgtttc 8520
gatctacgta tgatcttttt ccctctgcca aaaattatgg ggacatcatg aagccccttg 8580
agcatctgac ttctggctaa taaaggaaat ttattttcat tgcaatagtg tgttggaatt 8640
ttttgtgtct ctcactcgga aggaattctg cattaatgaa tcggccaacg cgcggggaga 8700
ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 8760
gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa 8820
tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 8880
aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 8940
aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 9000
ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg 9060
tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc 9120
agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 9180
gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta 9240
tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct 9300
acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc 9360
tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa 9420
caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa 9480
aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 9540
aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 9600
ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 9660
agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 9720
atagttgcct gactc 9735
<210> 64
<211> 36247
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 64
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcggagctgc cccggagccg gagaggaccc 1140
ccgttggcca gggatcgtgg gcccatccgg gacgcaccag gggaccatcc gacaggggat 1200
tctgtgtggt gtcaccggcc aggccagcag aagaggcaac cagcctcgag ggagcgttgt 1260
ctggaaccag acattcccac ccgtcggtgg gccggcagca ccacgcggga ccaccgtcca 1320
cttccagacc gccacggcca tgggacaccc cttgcccgcc tgtgtatgcc gagactaaac 1380
acttcctgta ctcatccgga gacaaggaac agcttcggcc gtccttcctc ctgtcgtcgc 1440
tcagaccgag cctgaccgga gcacgcagat tggtggaaac tatcttcctt gggtcacgtc 1500
cgtggatgcc aggtacccca cggcgcctcc cgcgcctccc acagagatac tggcagatgc 1560
ggcctctgtt cctggaattg ctgggaaacc acgctcagtg cccgtacgga gtcctgctca 1620
agactcactg ccctctgagg gcggcggtca ctccggcggc cggagtgtgc gcacgggaga 1680
agccccaggg aagcgtggca gctccggaag aggaggacac cgatccgcgc cgcctcgtgc 1740
aacttctgcg ccagcactcc tcgccctggc aagtctacgg gttcgtccgc gcctgcctgc 1800
gccgcctggt gccgcctggg ctctggggtt cccggcataa cgagcgccgc ttcctgagaa 1860
atactaagaa gtttatctca cttggaaaac atgccaagtt gtcgctgcaa gaactcacgt 1920
ggaagatgtc agtccgcgat tgcgcctggc tgcgccgctc gccgggcgtc gggtgtgttc 1980
cagctgcaga acaccgcctg agagaagaaa ttctggccaa atttctgcat tggctgatgt 2040
cagtgtacgt ggtcgagctg ctgcgctcct ttttctacgt cactgagact acctttcaaa 2100
agaaccgcct gttcttctac cgcaaatctg tgtggagcaa gctgcagtca atcggcattc 2160
gccagcatct gaagagggtg cagctgcggg aactttccga ggcagaagtc cgccagcacc 2220
gggaggcccg gccggcgctt ctcacgtcgc gtctgagatt catcccaaag cccgacgggc 2280
tgaggcctat cgtcaacatg gattacgtcg tgggcgctcg cacctttcgc cgtgaaaagc 2340
gggccgaacg cttgacctca cgggtgaagg ccctcttctc cgtgctgaac tacgagagag 2400
caagacggcc tggcctgctg ggagcttcgg tgctgggact ggacgatatc caccgggctt 2460
ggcggacctt tgttctccgg gtgagagccc aagaccctcc gccggaactg tacttcgtga 2520
aggtggcgat caccggagcc tatgatacta ttccgcaaga tcgactcacc gaagtcatcg 2580
cctcgatcat caaaccgcag aacacttact gcgtcaggcg gtacgccgtg gtccagaagg 2640
ccgcgcatgg ccacgtgaga aaggcgttca agtcgcacgt gtccactctc accgacctcc 2700
agccttacat gaggcaattc gttgcgcatt tgcaagagac ttcgcccctg agagatgcgg 2760
tggtcatcga gcagagctcc agcctgaacg aagcgagcag cggtctgttt gacgtgttcc 2820
tccgcttcat gtgtcatcac gcggtgcgaa tcaggggaaa atcatacgtg cagtgccagg 2880
gaatcccaca aggcagcatt ctgtcgactc tcttgtgttc cctttgctac ggcgatatgg 2940
aaaacaagct gttcgctggg atcagacggg acgggttgct gctcagactg gtggacgact 3000
tcctgctggt gactccgcac ctcactcacg ccaaaacctt tctccgcact ctggtgaggg 3060
gagtgccaga atacggctgt gtggtcaatc tccggaaaac tgtggtgaat ttccctgtcg 3120
aggatgaggc actcggagga accgcatttg tccaaatgcc agcacatggc ctgttcccat 3180
ggtgcggtct gctgctggac acccgaactc ttgaagtgca gtccgactac tccagctatg 3240
cccggacgag catccgcgcc agcctcactt tcaatcgcgg ctttaaggcc ggacgaaaca 3300
tgcgcagaaa gcttttcgga gtcctccggc ttaaatgcca ttcgctcttt ctcgatctcc 3360
aagtcaattc gctgcagacc gtgtgcacga acatctacaa gatcctgctg ctccaagcct 3420
accggttcca cgcttgcgtg cttcagctgc cgtttcacca acaggtgtgg aagaacccga 3480
ccttctttct gcgggtcatt agcgatactg cctccctgtg ttactcaatc ctcaaggcaa 3540
agaacgccgg aatgtcgctg ggtgcgaaag gagccgcggg acctcttcct agcgaagcgg 3600
tgcagtggct ctgccaccag gctttcctcc tgaagctgac caggcacaga gtgacctacg 3660
tcccgctgct gggctcgctg cgcactgcac agacccagct gtctagaaaa ctccccggca 3720
ccaccctgac cgctctggaa gccgccgcca acccagcatt gccgtcagat ttcaagacca 3780
tcttggacgg atccggccag tgcaccaatt acgccctgct gaagctggcc ggcgacgtgg 3840
aatctaaccc tggccctgaa tcgccaagcg caccccctca tcggtggtgc atcccttggc 3900
aacgcctcct cctgaccgcc tcactgctga ctttctggaa cccgccgacc accgcaaagc 3960
tgaccattga gagcactccc ttcaacgtgg ctgaggggaa ggaggtgctg ctcctggtgc 4020
acaatctgcc ccagcacctg ttcgggtact cctggtacaa gggagaacgc gtggacggga 4080
accggcagat cataggctac gtcatcggaa cccagcaggc cacacccggt ccagcgtaca 4140
gcggccggga gattatctac ccgaacgcct ccctgctgat ccaaaacatc atccagaacg 4200
acaccggttt ctacactctg cacgtgatta agtcagatct ggtcaacgaa gaggccaccg 4260
gccaattcag ggtgtacccc gaactcccta agccgttcat cacctcgaac aacagcaacc 4320
cggtcgagga tgaagatgcg gtggccttga cgtgcgaacc tgagatccag aacaccacct 4380
acttgtggtg ggtgaacaat cagagcctgc cagtctcccc acgactccag ctgtcgaacg 4440
acaacaggac cctgactttg ctgtccgtga ctcggaacga cgtgggccct tatgaatgcg 4500
gtatccagaa caagctgtcc gtggaccaca gcgaccctgt gatcctgaac gtcctttacg 4560
ggccggacga ccccaccatt tccccgtcgt acacttacta ccggccgggc gtgaacctgt 4620
ccctgtcgtg ccacgctgcc tccaatccgc cggcccagta ctcctggctc atcgacggaa 4680
acatccagca gcacacccaa gaactgttca tctccaacat taccgagaaa aactcgggac 4740
tttacacctg tcaagccaac aattccgcca gcggccactc ccgcaccact gtcaaaacta 4800
tcactgtgtc cgccgaactc ccgaagccca gcatcagctc caacaactcg aagcccgtgg 4860
aggataagga cgctgtcgcg ttcacctgtg aaccagaggc acagaatacc acctaccttt 4920
ggtgggtcaa cggacagtcc ctgcctgtct caccgagact gcagctgtca aacgggaata 4980
ggactctgac cttgtttaac gtcacccgga acgacgcccg ggcctacgtg tgcggcatcc 5040
agaactccgt gagcgcaaac cggtctgacc cagtgaccct ggatgtgctg tacggccccg 5100
acactccgat catttcaccc cccgattcat cctacctgtc cggcgctaac ctcaacctct 5160
catgccactc cgcatccaac cccagcccgc aatattcgtg gcgcattaac ggaattcctc 5220
agcaacatac ccaggtcctg ttcattgcga agatcacccc taacaacaac ggaacctacg 5280
cctgctttgt gtcaaacctg gccactggta gaaacaactc catcgtgaag tccattaccg 5340
tgtcggcgtc cggaacttcc ccgggcctga gcgccggcgc caccgtggga attatgatcg 5400
gcgtgctcgt gggagtggcc ctgatcggat ccggcgaggg cagaggcagc ctgctgacat 5460
gtggcgacgt ggaagagaac cctggcccca cccctggaac ccagagcccc ttcttccttc 5520
tgctgctgct gaccgtgctg actgtcgtga caggctctgg ccacgccagc tctacacctg 5580
gcggcgagaa agagacaagc gccacccaga gaagcagcgt gccaagcagc accgagaaga 5640
acgccgtgtc catgaccagc tccgtgctga gcagccactc tcctggcagc ggcagcagca 5700
caacacaggg ccaggatgtg acactggccc ctgccacaga acctgcctct ggatctgccg 5760
ccacctgggg acaggacgtg acaagcgtgc cagtgaccag acctgccctg ggctctacaa 5820
caccccctgc ccacgatgtg accagcgccc ctgataacaa gcctgcccct ggaagcacag 5880
cccctccagc tcatggcgtg acctctgccc cagataccag accagcccca ggatctacag 5940
ccccacccgc acacggcgtg acaagtgccc ctgacacaag acccgctcca ggctctactg 6000
ctcctcctgc ccatggcgtg acaagcgctc ccgatacaag gccagctcct ggctccacag 6060
caccaccagc acatggcgtg acatcagctc ccgacactag acctgctccc ggatcaaccg 6120
ctccaccagc tcacggcgtg accagcgcac ctgataccag acctgctctg ggaagcaccg 6180
cccctcccgt gcacaatgtg acatctgctt ccggcagcgc cagcggctct gcctctacac 6240
tggtgcacaa cggcaccagc gccagagcca caacaacccc agccagcaag agcaccccct 6300
tcagcatccc tagccaccac agcgacaccc ctaccacact ggccagccac tccaccaaga 6360
ccgatgcctc tagcacccac cactccagcg tgccccctct gaccagcagc aaccacagca 6420
caagccccca gctgtctacc ggcgtctcat tcttctttct gtccttccac atcagcaacc 6480
tgcagttcaa cagcagcctg gaagatccca gcaccgacta ctaccaggaa ctgcagcggg 6540
atatcagcga gatgttcctg caaatctaca agcagggcgg cttcctgggc ctgagcaaca 6600
tcaagttcag acccggcagc gtggtggtgc agctgaccct ggctttccgg gaaggcacca 6660
tcaacgtgca cgacgtggaa acccagttca accagtacaa gaccgaggcc gccagccggt 6720
acaacctgac catctccgat gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt 6780
ctggcgcagg cgtgccagga tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg 6840
ccctggccat cgtgtatctg attgccctgg ccgtgtgcca gtgccggcgg aagaattacg 6900
gccagctgga catcttcccc gccagagaca cctaccaccc catgagcgag taccccacat 6960
accacaccca cggcagatac gtgccaccca gctccaccga cagatccccc tacgagaaag 7020
tgtctgccgg caacggcggc agctccctga gctacacaaa tcctgccgtg gccgctgcct 7080
ccgccaacct gtgacgcacc tcgagctgat cataatcagc cataccacat ttgtagaggt 7140
tttacttgct ttaaaaaacc tcccacacct ccccctgaac ctgaaacata aaatgaatgc 7200
aattgttgtt gttaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 7260
cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 7320
catcaatgta tcttaccagg tgccgagcct gcgagtgcgg agggaagcat gccaggttcc 7380
agcccgtgtg tgtggatgtg acggaggacc tgcgacccga tcatttggtg ttgccctgca 7440
ccgggacgga gttcggttcc agcggggaag aatctgacta gagtgagtag tgttctgggg 7500
cgggggagga cctgcatgag ggccagaata actgaaatct gtgcttttct gtgtgttgca 7560
gcagcatgag cggaagcggc tcctttgagg gaggggtatt cagcccttat ctgacggggc 7620
gtctcccctc ctgggcggga gtgcgtcaga atgtgatggg atccacggtg gacggccggc 7680
ccgtgcagcc cgcgaactct tcaaccctga cctatgcaac cctgagctct tcgtcgttgg 7740
acgcagctgc cgccgcagct gctgcatctg ccgccagcgc cgtgcgcgga atggccatgg 7800
gcgccggcta ctacggcact ctggtggcca actcgagttc caccaataat cccgccagcc 7860
tgaacgagga gaagctgttg ctgctgatgg cccagctcga ggccttgacc cagcgcctgg 7920
gcgagctgac ccagcaggtg gctcagctgc aggagcagac gcgggccgcg gttgccacgg 7980
tgaaatccaa ataaaaaatg aatcaataaa taaacggaga cggttgttga ttttaacaca 8040
gagtctgaat ctttatttga tttttcgcgc gcggtaggcc ctggaccacc ggtctcgatc 8100
attgagcacc cggtggatct tttccaggac ccggtagagg tgggcttgga tgttgaggta 8160
catgggcatg agcccgtccc gggggtggag gtagctccat tgcagggcct cgtgctcggg 8220
ggtggtgttg taaatcaccc agtcatagca ggggcgcagg gcatggtgtt gcacaatatc 8280
tttgaggagg agactgatgg ccacgggcag ccctttggtg taggtgttta caaatctgtt 8340
gagctgggag ggatgcatgc ggggggagat gaggtgcatc ttggcctgga tcttgagatt 8400
ggcgatgtta ccgcccagat cccgcctggg gttcatgttg tgcaggacca ccagcacggt 8460
gtatccggtg cacttgggga atttatcatg caacttggaa gggaaggcgt gaaagaattt 8520
ggcgacgcct ttgtgcccgc ccaggttttc catgcactca tccatgatga tggcgatggg 8580
cccgtgggcg gcggcctggg caaagacgtt tcgggggtcg gacacatcat agttgtggtc 8640
ctgggtgagg tcatcatagg ccattttaat gaatttgggg cggagggtgc cggactgggg 8700
gacaaaggta ccctcgatcc cgggggcgta gttcccctca cagatctgca tctcccaggc 8760
tttgagctcg gaggggggga tcatgtccac ctgcggggcg ataaagaaca cggtttccgg 8820
ggcgggggag atgagctggg ccgaaagcaa gttccggagc agctgggact tgccgcagcc 8880
ggtggggccg tagatgaccc cgatgaccgg ctgcaggtgg tagttgaggg agagacagct 8940
gccgtcctcc cggaggaggg gggccacctc gttcatcatc tcgcgcacgt gcatgttctc 9000
gcgcaccagt tccgccagga ggcgctctcc ccccagggat aggagctcct ggagcgaggc 9060
gaagtttttc agcggcttga gtccgtcggc catgggcatt ttggagaggg tttgttgcaa 9120
gagttccagg cggtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagacc 9180
tcctcgtttc gcgggttggg acggctgcgg gagtagggca ccagacgatg ggcgtccagc 9240
gcagccaggg tccggtcctt ccagggtcgc agcgtccgcg tcagggtggt ctccgtcacg 9300
gtgaaggggt gcgcgccggg ctgggcgctt gcgagggtgc gcttcaggct catccggctg 9360
gtcgaaaacc gctcccgatc ggcgccctgc gcgtcggcca ggtagcaatt gaccatgagt 9420
tcgtagttga gcgcctcggc cgcgtggcct ttggcgcgga gcttaccttt ggaagtctgc 9480
ccgcaggcgg gacagaggag ggacttgagg gcgtagagct tgggggcgag gaagacggac 9540
tcgggggcgt aggcgtccgc gccgcagtgg gcgcagacgg tctcgcactc cacgagccag 9600
gtgaggtcgg gctggtcggg gtcaaaaacc agtttcccgc cgttcttttt gatgcgtttc 9660
ttacctttgg tctccatgag ctcgtgtccc cgctgggtga caaagaggct gtccgtgtcc 9720
ccgtagaccg actttatggg ccggtcctcg agcggtgtgc cgcggtcctc ctcgtagagg 9780
aaccccgccc actccgagac gaaagcccgg gtccaggcca gcacgaagga ggccacgtgg 9840
gacgggtagc ggtcgttgtc caccagcggg tccacctttt ccagggtatg caaacacatg 9900
tccccctcgt ccacatccag gaaggtgatt ggcttgtaag tgtaggccac gtgaccgggg 9960
gtcccggccg ggggggtata aaagggtgcg ggtccctgct cgtcctcact gtcttccgga 10020
tcgctgtcca ggagcgccag ctgttggggt aggtattccc tctcgaaggc gggcatgacc 10080
tcggcactca ggttgtcagt ttctagaaac gaggaggatt tgatattgac ggtgccggcg 10140
gagatgcctt tcaagagccc ctcgtccatc tggtcagaaa agacgatctt tttgttgtcg 10200
agcttggtgg cgaaggagcc gtagagggcg ttggagagga gcttggcgat ggagcgcatg 10260
gtctggtttt tttccttgtc ggcgcgctcc ttggcggcga tgttgagctg cacgtactcg 10320
cgcgccacgc acttccattc ggggaagacg gtggtcagct cgtcgggcac gattctgacc 10380
tgccagcccc gattatgcag ggtgatgagg tccacactgg tggccacctc gccgcgcagg 10440
ggctcattag tccagcagag gcgtccgccc ttgcgcgagc agaagggggg cagggggtcc 10500
agcatgacct cgtcgggggg gtcggcatcg atggtgaaga tgccgggcag gaggtcgggg 10560
tcaaagtagc tgatggaagt ggccagatcg tccagggcag cttgccattc gcgcacggcc 10620
agcgcgcgct cgtagggact gaggggcgtg ccccagggca tgggatgggt aagcgcggag 10680
gcgtacatgc cgcagatgtc gtagacgtag aggggctcct cgaggatgcc gatgtaggtg 10740
gggtagcagc gccccccgcg gatgctggcg cgcacgtagt catacagctc gtgcgagggg 10800
gcgaggagcc ccgggcccag gttggtgcga ctgggctttt cggcgcggta gacgatctgg 10860
cggaaaatgg catgcgagtt ggaggagatg gtgggccttt ggaagatgtt gaagtgggcg 10920
tggggcagtc cgaccgagtc gcggatgaag tgggcgtagg agtcttgcag cttggcgacg 10980
agctcggcgg tgactaggac gtccagagcg cagtagtcga gggtctcctg gatgatgtca 11040
tacttgagct gtcccttttg tttccacagc tcgcggttga gaaggaactc ttcgcggtcc 11100
ttccagtact cttcgagggg gaacccgtcc tgatctgcac ggtaagagcc tagcatgtag 11160
aactggttga cggccttgta ggcgcagcag cccttctcca cggggagggc gtaggcctgg 11220
gcggccttgc gcagggaggt gtgcgtgagg gcgaaagtgt ccctgaccat gaccttgagg 11280
aactggtgct tgaagtcgat atcgtcgcag cccccctgct cccagagctg gaagtccgtg 11340
cgcttcttgt aggcggggtt gggcaaagcg aaagtaacat cgttgaagag gatcttgccc 11400
gcgcggggca taaagttgcg agtgatgcgg aaaggttggg gcacctcggc ccggttgttg 11460
atgacctggg cggcgagcac gatctcgtcg aagccgttga tgttgtggcc cacgatgtag 11520
agttccacga atcgcggacg gcccttgacg tggggcagtt tcttgagctc ctcgtaggtg 11580
agctcgtcgg ggtcgctgag cccgtgctgc tcgagcgccc agtcggcgag atgggggttg 11640
gcgcggagga aggaagtcca gagatccacg gccagggcgg tttgcagacg gtcccggtac 11700
tgacggaact gctgcccgac ggccattttt tcgggggtga cgcagtagaa ggtgcggggg 11760
tccccgtgcc agcgatccca tttgagctgg agggcgagat cgagggcgag ctcgacgagc 11820
cggtcgtccc cggagagttt catgaccagc atgaagggga cgagctgctt gccgaaggac 11880
cccatccagg tgtaggtttc cacatcgtag gtgaggaaga gcctttcggt gcgaggatgc 11940
gagccgatgg ggaagaactg gatctcctgc caccaattgg aggaatggct gttgatgtga 12000
tggaagtaga aatgccgacg gcgcgccgaa cactcgtgct tgtgtttata caagcggcca 12060
cagtgctcgc aacgctgcac gggatgcacg tgctgcacga gctgtacctg agttcctttg 12120
acgaggaatt tcagtgggaa gtggagtcgt ggcgcctgca tctcgtgctg tactacgtcg 12180
tggtggtcgg cctggccctc ttctgcctcg atggtggtca tgctgacgag cccgcgcggg 12240
aggcaggtcc agacctcggc gcgagcgggt cggagagcga ggacgagggc gcgcaggccg 12300
gagctgtcca gggtcctgag acgctgcgga gtcaggtcag tgggcagcgg cggcgcgcgg 12360
ttgacttgca ggagtttttc cagggcgcgc gggaggtcca gatggtactt gatctccacc 12420
gcgccattgg tggcgacgtc gatggcttgc agggtcccgt gcccctgggg tgtgaccacc 12480
gtcccccgtt tcttcttggg cggctggggc gacgggggcg gtgcctcttc catggttaga 12540
agcggcggcg aggacgcgcg ccgggcggca ggggcggctc ggggcccgga ggcaggggcg 12600
gcaggggcac gtcggcgccg cgcgcgggta ggttctggta ctgcgcccgg agaagactgg 12660
cgtgagcgac gacgcgacgg ttgacgtcct ggatctgacg cctctgggtg aaggccacgg 12720
gacccgtgag tttgaacctg aaagagagtt cgacagaatc aatctcggta tcgttgacgg 12780
cggcctgccg caggatctct tgcacgtcgc ccgagttgtc ctggtaggcg atctcggtca 12840
tgaactgctc gatctcctcc tcttgaaggt ctccgcggcc ggcgcgctcc acggtggccg 12900
cgaggtcgtt ggagatgcgg cccatgagct gcgagaaggc gttcatgccc gcctcgttcc 12960
agacgcggct gtagaccacg acgccctcgg gatcgcgggc gcgcatgacc acctgggcga 13020
ggttgagctc cacgtggcgc gtgaagaccg cgtagttgca gaggcgctgg tagaggtagt 13080
tgagcgtggt ggcgatgtgc tcggtgacga agaaatacat gatccagcgg cggagcggca 13140
tctcgctgac gtcgcccagc gcctccaaac gttccatggc ctcgtaaaag tccacggcga 13200
agttgaaaaa ctgggagttg cgcgccgaga cggtcaactc ctcctccaga agacggatga 13260
gctcggcgat ggtggcgcgc acctcgcgct cgaaggcccc cgggagttcc tccacttcct 13320
cttcttcctc ctccactaac atctcttcta cttcctcctc aggcggcagt ggtggcgggg 13380
gagggggcct gcgtcgccgg cggcgcacgg gcagacggtc gatgaagcgc tcgatggtct 13440
cgccgcgccg gcgtcgcatg gtctcggtga cggcgcgccc gtcctcgcgg ggccgcagcg 13500
tgaagacgcc gccgcgcatc tccaggtggc cgggggggtc cccgttgggc agggagaggg 13560
cgctgacgat gcatcttatc aattgccccg tagggactcc gcgcaaggac ctgagcgtct 13620
cgagatccac gggatctgaa aaccgctgaa cgaaggcttc gagccagtcg cagtcgcaag 13680
gtaggctgag cacggtttct tctggcgggt catgttggtt gggagcgggg cgggcgatgc 13740
tgctggtgat gaagttgaaa taggcggttc tgagacggcg gatggtggcg aggagcacca 13800
ggtctttggg cccggcttgc tggatgcgca gacggtcggc catgccccag gcgtggtcct 13860
gacacctggc caggtccttg tagtagtcct gcatgagccg ctccacgggc acctcctcct 13920
cgcccgcgcg gccgtgcatg cgcgtgagcc cgaagccgcg ctggggctgg acgagcgcca 13980
ggtcggcgac gacgcgctcg gcgaggatgg cttgctggat ctgggtgagg gtggtctgga 14040
agtcatcaaa gtcgacgaag cggtggtagg ctccggtgtt gatggtgtag gagcagttgg 14100
ccatgacgga ccagttgacg gtctggtggc ccggacgcac gagctcgtgg tacttgaggc 14160
gcgagtaggc gcgcgtgtcg aagatgtagt cgttgcaggt gcgcaccagg tactggtagc 14220
cgatgaggaa gtgcggcggc ggctggcggt agagcggcca tcgctcggtg gcgggggcgc 14280
cgggcgcgag gtcctcgagc atggtgcggt ggtagccgta gatgtacctg gacatccagg 14340
tgatgccggc ggcggtggtg gaggcgcgcg ggaactcgcg gacgcggttc cagatgttgc 14400
gcagcggcag gaagtagttc atggtgggca cggtctggcc cgtgaggcgc gcgcagtcgt 14460
ggatgctcta tacgggcaaa aacgaaagcg gtcagcggct cgactccgtg gcctggaggc 14520
taagcgaacg ggttgggctg cgcgtgtacc ccggttcgaa tctcgaatca ggctggagcc 14580
gcagctaacg tggtattggc actcccgtct cgacccaagc ctgcaccaac cctccaggat 14640
acggaggcgg gtcgttttgc aacttttttt tggaggccgg atgagactag taagcgcgga 14700
aagcggccga ccgcgatggc tcgctgccgt agtctggaga agaatcgcca gggttgcgtt 14760
gcggtgtgcc ccggttcgag gccggccgga ttccgcggct aacgagggcg tggctgcccc 14820
gtcgtttcca agaccccata gccagccgac ttctccagtt acggagcgag cccctctttt 14880
gttttgtttg tttttgccag atgcatcccg tactgcggca gatgcgcccc caccaccctc 14940
caccgcaaca acagccccct ccacagccgg cgcttctgcc cccgccccag cagcaacttc 15000
cagccacgac cgccgcggcc gccgtgagcg gggctggaca gagttatgat caccagctgg 15060
ccttggaaga gggcgagggg ctggcgcgcc tgggggcgtc gtcgccggag cggcacccgc 15120
gcgtgcagat gaaaagggac gctcgcgagg cctacgtgcc caagcagaac ctgttcagag 15180
acaggagcgg cgaggagccc gaggagatgc gcgcggcccg gttccacgcg gggcgggagc 15240
tgcggcgcgg cctggaccga aagagggtgc tgagggacga ggatttcgag gcggacgagc 15300
tgacggggat cagccccgcg cgcgcgcacg tggccgcggc caacctggtc acggcgtacg 15360
agcagaccgt gaaggaggag agcaacttcc aaaaatcctt caacaaccac gtgcgcaccc 15420
tgatcgcgcg cgaggaggtg accctgggcc tgatgcacct gtgggacctg ctggaggcca 15480
tcgtgcagaa ccccaccagc aagccgctga cggcgcagct gttcctggtg gtgcagcata 15540
gtcgggacaa cgaagcgttc agggaggcgc tgctgaatat caccgagccc gagggccgct 15600
ggctcctgga cctggtgaac attctgcaga gcatcgtggt gcaggagcgc gggctgccgc 15660
tgtccgagaa gctggcggcc atcaacttct cggtgctgag tttgggcaag tactacgcta 15720
ggaagatcta caagaccccg tacgtgccca tagacaagga ggtgaagatc gacgggtttt 15780
acatgcgcat gaccctgaaa gtgctgaccc tgagcgacga tctgggggtg taccgcaacg 15840
acaggatgca ccgtgcggtg agcgccagca ggcggcgcga gctgagcgac caggagctga 15900
tgcatagtct gcagcgggcc ctgaccgggg ccgggaccga gggggagagc tactttgaca 15960
tgggcgcgga cctgcactgg cagcccagcc gccgggcctt ggaggcggcg gcaggaccct 16020
acgtagaaga ggtggacgat gaggtggacg aggagggcga gtacctggaa gactgatggc 16080
gcgaccgtat ttttgctaga tgcaacaaca acagccacct cctgatcccg cgatgcgggc 16140
ggcgctgcag agccagccgt ccggcattaa ctcctcggac gattggaccc aggccatgca 16200
acgcatcatg gcgctgacga cccgcaaccc cgaagccttt agacagcagc cccaggccaa 16260
ccggctctcg gccatcctgg aggccgtggt gccctcgcgc tccaacccca cgcacgagaa 16320
ggtcctggcc atcgtgaacg cgctggtgga gaacaaggcc atccgcggcg acgaggccgg 16380
cctggtgtac aacgcgctgc tggagcgcgt ggcccgctac aacagcacca acgtgcagac 16440
caacctggac cgcatggtga ccgacgtgcg cgaggccgtg gcccagcgcg agcggttcca 16500
ccgcgagtcc aacctgggat ccatggtggc gctgaacgcc ttcctcagca cccagcccgc 16560
caacgtgccc cggggccagg aggactacac caacttcatc agcgccctgc gcctgatggt 16620
gaccgaggtg ccccagagcg aggtgtacca gtccgggccg gactacttct tccagaccag 16680
tcgccagggc ttgcagaccg tgaacctgag ccaggctttc aagaacttgc agggcctgtg 16740
gggcgtgcag gccccggtcg gggaccgcgc gacggtgtcg agcctgctga cgccgaactc 16800
gcgcctgctg ctgctgctgg tggccccctt cacggacagc ggcagcatca accgcaactc 16860
gtacctgggc tacctgatta acctgtaccg cgaggccatc ggccaggcgc acgtggacga 16920
gcagacctac caggagatca cccacgtgag ccgcgccctg ggccaggacg acccgggcaa 16980
cctggaagcc accctgaact ttttgctgac caaccggtcg cagaagatcc cgccccagta 17040
cgcgctcagc accgaggagg agcgcatcct gcgttacgtg cagcagagcg tgggcctgtt 17100
cctgatgcag gagggggcca cccccagcgc cgcgctcgac atgaccgcgc gcaacatgga 17160
gcccagcatg tacgccagca accgcccgtt catcaataaa ctgatggact acttgcatcg 17220
ggcggccgcc atgaactctg actatttcac caacgccatc ctgaatcccc actggctccc 17280
gccgccgggg ttctacacgg gcgagtacga catgcccgac cccaatgacg ggttcctgtg 17340
ggacgatgtg gacagcagcg tgttctcccc ccgaccgggt gctaacgagc gccccttgtg 17400
gaagaaggaa ggcagcgacc gacgcccgtc ctcggcgctg tccggccgcg agggtgctgc 17460
cgcggcggtg cccgaggccg ccagtccttt cccgagcttg cccttctcgc tgaacagtat 17520
ccgcagcagc gagctgggca ggatcacgcg cccgcgcttg ctgggcgaag aggagtactt 17580
gaatgactcg ctgttgagac ccgagcggga gaagaacttc cccaataacg ggatagaaag 17640
cctggtggac aagatgagcc gctggaagac gtatgcgcag gagcacaggg acgatccccg 17700
ggcgtcgcag ggggccacga gccggggcag cgccgcccgt aaacgccggt ggcacgacag 17760
gcagcgggga cagatgtggg acgatgagga ctccgccgac gacagcagcg tgttggactt 17820
gggtgggagt ggtaacccgt tcgctcacct gcgcccccgt atcgggcgca tgatgtaaga 17880
gaaaccgaaa ataaatgata ctcaccaagg ccatggcgac cagcgtgcgt tcgtttcttc 17940
tctgttgttg ttgtatctag tatgatgagg cgtgcgtacc cggagggtcc tcctccctcg 18000
tacgagagcg tgatgcagca ggcgatggcg gcggcggcga tgcagccccc gctggaggct 18060
ccttacgtgc ccccgcggta cctggcgcct acggaggggc ggaacagcat tcgttactcg 18120
gagctggcac ccttgtacga taccacccgg ttgtacctgg tggacaacaa gtcggcggac 18180
atcgcctcgc tgaactacca gaacgaccac agcaacttcc tgaccaccgt ggtgcagaac 18240
aatgacttca cccccacgga ggccagcacc cagaccatca actttgacga gcgctcgcgg 18300
tggggcggcc agctgaaaac catcatgcac accaacatgc ccaacgtgaa cgagttcatg 18360
tacagcaaca agttcaaggc gcgggtgatg gtctcccgca agacccccaa tggggtgaca 18420
gtgacagagg attatgatgg tagtcaggat gagctgaagt atgaatgggt ggaatttgag 18480
ctgcccgaag gcaacttctc ggtgaccatg accatcgacc tgatgaacaa cgccatcatc 18540
gacaattact tggcggtggg gcggcagaac ggggtgctgg agagcgacat cggcgtgaag 18600
ttcgacacta ggaacttcag gctgggctgg gaccccgtga ccgagctggt catgcccggg 18660
gtgtacacca acgaggcttt ccatcccgat attgtcttgc tgcccggctg cggggtggac 18720
ttcaccgaga gccgcctcag caacctgctg ggcattcgca agaggcagcc cttccaggaa 18780
ggcttccaga tcatgtacga ggatctggag gggggcaaca tccccgcgct cctggatgtc 18840
gacgcctatg agaaaagcaa ggaggatgca gcagctgaag caactgcagc cgtagctacc 18900
gcctctaccg aggtcagggg cgataatttt gcaagcgccg cagcagtggc agcggccgag 18960
gcggctgaaa ccgaaagtaa gatagtcatt cagccggtgg agaaggatag caagaacagg 19020
agctacaacg tactaccgga caagataaac accgcctacc gcagctggta cctagcctac 19080
aactatggcg accccgagaa gggcgtgcgc tcctggacgc tgctcaccac ctcggacgtc 19140
acctgcggcg tggagcaagt ctactggtcg ctgcccgaca tgatgcaaga cccggtcacc 19200
ttccgctcca cgcgtcaagt tagcaactac ccggtggtgg gcgccgagct cctgcccgtc 19260
tactccaaga gcttcttcaa cgagcaggcc gtctactcgc agcagctgcg cgccttcacc 19320
tcgcttacgc acgtcttcaa ccgcttcccc gagaaccaga tcctcgtccg cccgcccgcg 19380
cccaccatta ccaccgtcag tgaaaacgtt cctgctctca cagatcacgg gaccctgccg 19440
ctgcgcagca gtatccgggg agtccagcgc gtgaccgtta ctgacgccag acgccgcacc 19500
tgcccctacg tctacaaggc cctgggcata gtcgcgccgc gcgtcctctc gagccgcacc 19560
ttctaaatgt ccattctcat ctcgcccagt aataacaccg gttggggcct gcgcgcgccc 19620
agcaagatgt acggaggcgc tcgccaacgc tccacgcaac accccgtgcg cgtgcgcggg 19680
cacttccgcg ctccctgggg cgccctcaag ggccgcgtgc ggtcgcgcac caccgtcgac 19740
gacgtgatcg accaggtggt ggccgacgcg cgcaactaca cccccgccgc cgcgcccgtc 19800
tccaccgtgg acgccgtcat cgacagcgtg gtggccgacg cgcgccggta cgcccgcgcc 19860
aagagccggc ggcggcgcat cgcccggcgg caccggagca cccccgccat gcgcgcggcg 19920
cgagccttgc tgcgcagggc caggcgcacg ggacgcaggg ccatgctcag ggcggccaga 19980
cgcgcggctt caggcgccag cgccggcagg acccggagac gcgcggccac ggcggcggca 20040
gcggccatcg ccagcatgtc ccgcccgcgg cgagggaacg tgtactgggt gcgcgacgcc 20100
gccaccggtg tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgttcactt 20160
cgcgatgttg atgtgtccca gcggcgagga ggatgtccaa gcgcaaattc aaggaagaga 20220
tgctccaggt catcgcgcct gagatctacg gccctgcggt ggtgaaggag gaaagaaagc 20280
cccgcaaaat caagcgggtc aaaaaggaca aaaaggaaga agaaagtgat gtggacggat 20340
tggtggagtt tgtgcgcgag ttcgcccccc ggcggcgcgt gcagtggcgc gggcggaagg 20400
tgcaaccggt gctgagaccc ggcaccaccg tggtcttcac gcccggcgag cgctccggca 20460
ccgcttccaa gcgctcctac gacgaggtgt acggggatga tgatattctg gagcaggcgg 20520
ccgagcgcct gggcgagttt gcttacggca agcgcagccg ttccgcaccg aaggaagagg 20580
cggtgtccat cccgctggac cacggcaacc ccacgccgag cctcaagccc gtgaccttgc 20640
agcaggtgct gccgaccgcg gcgccgcgcc gggggttcaa gcgcgagggc gaggatctgt 20700
accccaccat gcagctgatg gtgcccaagc gccagaagct ggaagacgtg ctggagacca 20760
tgaaggtgga cccggacgtg cagcccgagg tcaaggtgcg gcccatcaag caggtggccc 20820
cgggcctggg cgtgcagacc gtggacatca agattcccac ggagcccatg gaaacgcaga 20880
ccgagcccat gatcaagccc agcaccagca ccatggaggt gcagacggat ccctggatgc 20940
catcggctcc tagtcgaaga ccccggcgca agtacggcgc ggccagcctg ctgatgccca 21000
actacgcgct gcatccttcc atcatcccca cgccgggcta ccgcggcacg cgcttctacc 21060
gcggtcatac cagcagccgc cgccgcaaga ccaccactcg ccgccgccgt cgccgcaccg 21120
ccgctgcaac cacccctgcc gccctggtgc ggagagtgta ccgccgcggc cgcgcacctc 21180
tgaccctgcc gcgcgcgcgc taccacccga gcatcgccat ttaaactttc gcctgctttg 21240
cagatcaatg gccctcacat gccgccttcg cgttcccatt acgggctacc gaggaagaaa 21300
accgcgccgt agaaggctgg cggggaacgg gatgcgtcgc caccaccacc ggcggcggcg 21360
cgccatcagc aagcggttgg ggggaggctt cctgcccgcg ctgatcccca tcatcgccgc 21420
ggcgatcggg gcgatccccg gcattgcttc cgtggcggtg caggcctctc agcgccactg 21480
agacacactt ggaaacatct tgtaataaac caatggactc tgacgctcct ggtcctgtga 21540
tgtgttttcg tagacagatg gaagacatca atttttcgtc cctggctccg cgacacggca 21600
cgcggccgtt catgggcacc tggagcgaca tcggcaccag ccaactgaac gggggcgcct 21660
tcaattggag cagtctctgg agcgggctta agaatttcgg gtccacgctt aaaacctatg 21720
gcagcaaggc gtggaacagc accacagggc aggcgctgag ggataagctg aaagagcaga 21780
acttccagca gaaggtggtc gatgggctcg cctcgggcat caacggggtg gtggacctgg 21840
ccaaccaggc cgtgcagcgg cagatcaaca gccgcctgga cccggtgccg cccgccggct 21900
ccgtggagat gccgcaggtg gaggaggagc tgcctcccct ggacaagcgg ggcgagaagc 21960
gaccccgccc cgatgcggag gagacgctgc tgacgcacac ggacgagccg cccccgtacg 22020
aggaggcggt gaaactgggt ctgcccacca cgcggcccat cgcgcccctg gccaccgggg 22080
tgctgaaacc cgaaaagccc gcgaccctgg acttgcctcc tccccagcct tcccgcccct 22140
ctacagtggc taagcccctg ccgccggtgg ccgtggcccg cgcgcgaccc gggggcaccg 22200
cccgccctca tgcgaactgg cagagcactc tgaacagcat cgtgggtctg ggagtgcaga 22260
gtgtgaagcg ccgccgctgc tattaaacct accgtagcgc ttaacttgct tgtctgtgtg 22320
tgtatgtatt atgtcgccgc cgccgctgtc caccagaagg aggagtgaag aggcgcgtcg 22380
ccgagttgca agatggccac cccatcgatg ctgccccagt gggcgtacat gcacatcgcc 22440
ggacaggacg cttcggagta cctgagtccg ggtctggtgc agtttgcccg cgccacagac 22500
acctacttca gtctggggaa caagtttagg aaccccacgg tggcgcccac gcacgatgtg 22560
accaccgacc gcagccagcg gctgacgctg cgcttcgtgc ccgtggaccg cgaggacaac 22620
acctactcgt acaaagtgcg ctacacgctg gccgtgggcg acaaccgcgt gctggacatg 22680
gccagcacct actttgacat ccgcggcgtg ctggatcggg gccctagctt caaaccctac 22740
tccggcaccg cctacaacag tctggccccc aagggagcac ccaacacttg tcagtggaca 22800
tataaagccg atggtgaaac tgccacagaa aaaacctata catatggaaa tgcacccgtg 22860
cagggcatta acatcacaaa agatggtatt caacttggaa ctgacaccga tgatcagcca 22920
atctacgcag ataaaaccta tcagcctgaa cctcaagtgg gtgatgctga atggcatgac 22980
atcactggta ctgatgaaaa gtatggaggc agagctctta agcctgatac caaaatgaag 23040
ccttgttatg gttcttttgc caagcctact aataaagaag gaggtcaggc aaatgtgaaa 23100
acaggaacag gcactactaa agaatatgac atagacatgg ctttctttga caacagaagt 23160
gcggctgctg ctggcctagc tccagaaatt gttttgtata ctgaaaatgt ggatttggaa 23220
actccagata cccatattgt atacaaagca ggcacagatg acagcagctc ttctattaat 23280
ttgggtcagc aagccatgcc caacagacct aactacattg gtttcagaga caactttatc 23340
gggctcatgt actacaacag cactggcaat atgggggtgc tggccggtca ggcttctcag 23400
ctgaatgctg tggttgactt gcaagacaga aacaccgagc tgtcctacca gctcttgctt 23460
gactctctgg gtgacagaac ccggtatttc agtatgtgga atcaggcggt ggacagctat 23520
gatcctgatg tgcgcattat tgaaaatcat ggtgtggagg atgaacttcc caactattgt 23580
ttccctctgg atgctgttgg cagaacagat acttatcagg gaattaaggc taatggaact 23640
gatcaaacca catggaccaa agatgacagt gtcaatgatg ctaatgagat aggcaagggt 23700
aatccattcg ccatggaaat caacatccaa gccaacctgt ggaggaactt cctctacgcc 23760
aacgtggccc tgtacctgcc cgactcttac aagtacacgc cggccaatgt taccctgccc 23820
accaacacca acacctacga ttacatgaac ggccgggtgg tggcgccctc gctggtggac 23880
tcctacatca acatcggggc gcgctggtcg ctggatccca tggacaacgt gaaccccttc 23940
aaccaccacc gcaatgcggg gctgcgctac cgctccatgc tcctgggcaa cgggcgctac 24000
gtgcccttcc acatccaggt gccccagaaa tttttcgcca tcaagagcct cctgctcctg 24060
cccgggtcct acacctacga gtggaacttc cgcaaggacg tcaacatgat cctgcagagc 24120
tccctcggca acgacctgcg cacggacggg gcctccatct ccttcaccag catcaacctc 24180
tacgccacct tcttccccat ggcgcacaac acggcctcca cgctcgaggc catgctgcgc 24240
aacgacacca acgaccagtc cttcaacgac tacctctcgg cggccaacat gctctacccc 24300
atcccggcca acgccaccaa cgtgcccatc tccatcccct cgcgcaactg ggccgccttc 24360
cgcggctggt ccttcacgcg tctcaagacc aaggagacgc cctcgctggg ctccgggttc 24420
gacccctact tcgtctactc gggctccatc ccctacctcg acggcacctt ctacctcaac 24480
cacaccttca agaaggtctc catcaccttc gactcctccg tcagctggcc cggcaacgac 24540
cggctcctga cgcccaacga gttcgaaatc aagcgcaccg tcgacggcga gggctacaac 24600
gtggcccagt gcaacatgac caaggactgg ttcctggtcc agatgctggc ccactacaac 24660
atcggctacc agggcttcta cgtgcccgag ggctacaagg accgcatgta ctccttcttc 24720
cgcaacttcc agcccatgag ccgccaggtg gtggacgagg tcaactacaa ggactaccag 24780
gccgtcaccc tggcctacca gcacaacaac tcgggcttcg tcggctacct cgcgcccacc 24840
atgcgccagg gccagcccta ccccgccaac tacccctacc cgctcatcgg caagagcgcc 24900
gtcaccagcg tcacccagaa aaagttcctc tgcgacaggg tcatgtggcg catccccttc 24960
tccagcaact tcatgtccat gggcgcgctc accgacctcg gccagaacat gctctatgcc 25020
aactccgccc acgcgctaga catgaatttc gaagtcgacc ccatggatga gtccaccctt 25080
ctctatgttg tcttcgaagt cttcgacgtc gtccgagtgc accagcccca ccgcggcgtc 25140
atcgaggccg tctacctgcg cacccccttc tcggccggta acgccaccac ctaagctctt 25200
gcttcttgca agccatggcc gcgggctccg gcgagcagga gctcagggcc atcatccgcg 25260
acctgggctg cgggccctac ttcctgggca ccttcgataa gcgcttcccg ggattcatgg 25320
ccccgcacaa gctggcctgc gccatcgtca acacggccgg ccgcgagacc gggggcgagc 25380
actggctggc cttcgcctgg aacccgcgct cgaacacctg ctacctcttc gaccccttcg 25440
ggttctcgga cgagcgcctc aagcagatct accagttcga gtacgagggc ctgctgcgcc 25500
gcagcgccct ggccaccgag gaccgctgcg tcaccctgga aaagtccacc cagaccgtgc 25560
agggtccgcg ctcggccgcc tgcgggctct tctgctgcat gttcctgcac gccttcgtgc 25620
actggcccga ccgccccatg gacaagaacc ccaccatgaa cttgctgacg ggggtgccca 25680
acggcatgct ccagtcgccc caggtggaac ccaccctgcg ccgcaaccag gaggcgctct 25740
accgcttcct caactcccac tccgcctact ttcgctccca ccgcgcgcgc atcgagaagg 25800
ccaccgcctt cgaccgcatg aatcaagaca tgtaaaccgt gtgtgtatgt taaatgtctt 25860
taataaacag cactttcatg ttacacatgc atctgagatg atttatttag aaatcgaaag 25920
ggttctgccg ggtctcggca tggcccgcgg gcagggacac gttgcggaac tggtacttgg 25980
ccagccactt gaactcgggg atcagcagtt tgggcagcgg ggtgtcgggg aaggagtcgg 26040
tccacagctt ccgcgtcagt tgcagggcgc ccagcaggtc gggcgcggag atcttgaaat 26100
cgcagttggg acccgcgttc tgcgcgcggg agttgcggta cacggggttg cagcactgga 26160
acaccatcag ggccgggtgc ttcacgctcg ccagcaccgt cgcgtcggtg atgctctcca 26220
cgtcgaggtc ctcggcgttg gccatcccga agggggtcat cttgcaggtc tgccttccca 26280
tggtgggcac gcacccgggc ttgtggttgc aatcgcagtg cagggggatc agcatcatct 26340
gggcctggtc ggcgttcatc cccgggtaca tggccttcat gaaagcctcc aattgcctga 26400
acgcctgctg ggccttggct ccctcggtga agaagacccc gcaggacttg ctagagaact 26460
ggttggtggc gcacccggcg tcgtgcacgc agcagcgcgc gtcgttgttg gccagctgca 26520
ccacgctgcg cccccagcgg ttctgggtga tcttggcccg gtcggggttc tccttcagcg 26580
cgcgctgccc gttctcgctc gccacatcca tctcgatcat gtgctccttc tggatcatgg 26640
tggtcccgtg caggcaccgc agcttgccct cggcctcggt gcacccgtgc agccacagcg 26700
cgcacccggt gcactcccag ttcttgtggg cgatctggga atgcgcgtgc acgaagccct 26760
gcaggaagcg gcccatcatg gtggtcaggg tcttgttgct agtgaaggtc agcggaatgc 26820
cgcggtgctc ctcgttgatg tacaggtggc agatgcggcg gtacacctcg ccctgctcgg 26880
gcatcagctg gaagttggct ttcaggtcgg tctccacgcg gtagcggtcc atcagcatag 26940
tcatgatttc catacccttc tcccaggccg agacgatggg caggctcata gggttcttca 27000
ccatcatctt agcgctagca gccgcggcca gggggtcgct ctcgtccagg gtctcaaagc 27060
tccgcttgcc gtccttctcg gtgatccgca ccggggggta gctgaagccc acggccgcca 27120
gctcctcctc ggcctgtctt tcgtcctcgc tgtcctggct gacgtcctgc aggaccacat 27180
gcttggtctt gcggggtttc ttcttgggcg gcagcggcgg cggagatgtt ggagatggcg 27240
agggggagcg cgagttctcg ctcaccacta ctatctcttc ctcttcttgg tccgaggcca 27300
cgcggcggta ggtatgtctc ttcgggggca gaggcggagg cgacgggctc tcgccgccgc 27360
gacttggcgg atggctggca gagccccttc cgcgttcggg ggtgcgctcc cggcggcgct 27420
ctgactgact tcctccgcgg ccggccattg tgttctccta gggaggaaca acaagcatgg 27480
agactcagcc atcgccaacc tcgccatctg cccccaccgc cgacgagaag cagcagcagc 27540
agaatgaaag cttaaccgcc ccgccgccca gccccgccac ctccgacgcg gccgtcccag 27600
acatgcaaga gatggaggaa tccatcgaga ttgacctggg ctatgtgacg cccgcggagc 27660
acgaggagga gctggcagtg cgcttttcac aagaagagat acaccaagaa cagccagagc 27720
aggaagcaga gaatgagcag agtcaggctg ggctcgagca tgacggcgac tacctccacc 27780
tgagcggggg ggaggacgcg ctcatcaagc atctggcccg gcaggccacc atcgtcaagg 27840
atgcgctgct cgaccgcacc gaggtgcccc tcagcgtgga ggagctcagc cgcgcctacg 27900
agttgaacct cttctcgccg cgcgtgcccc ccaagcgcca gcccaatggc acctgcgagc 27960
ccaacccgcg cctcaacttc tacccggtct tcgcggtgcc cgaggccctg gccacctacc 28020
acatcttttt caagaaccaa aagatccccg tctcctgccg cgccaaccgc acccgcgccg 28080
acgccctttt caacctgggt cccggcgccc gcctacctga tatcgcctcc ttggaagagg 28140
ttcccaagat cttcgagggt ctgggcagcg acgagactcg ggccgcgaac gctctgcaag 28200
gagaaggagg agagcatgag caccacagcg ccctggtcga gttggaaggc gacaacgcgc 28260
ggctggcggt gctcaaacgc acggtcgagc tgacccattt cgcctacccg gctctgaacc 28320
tgccccccaa agtcatgagc gcggtcatgg accaggtgct catcaagcgc gcgtcgccca 28380
tctccgagga cgagggcatg caagactccg aggagggcaa gcccgtggtc agcgacgagc 28440
agctggcccg gtggctgggt cctaatgcta gtccccagag tttggaagag cggcgcaaac 28500
tcatgatggc cgtggtcctg gtgaccgtgg agctggagtg cctgcgccgc ttcttcgccg 28560
acgcggagac cctgcgcaag gtcgaggaga acctgcacta cctcttcagg cacgggttcg 28620
tgcgccaggc ctgcaagatc tccaacgtgg agctgaccaa cctggtctcc tacatgggca 28680
tcttgcacga gaaccgcctg gggcagaacg tgctgcacac caccctgcgc ggggaggccc 28740
ggcgcgacta catccgcgac tgcgtctacc tctacctctg ccacacctgg cagacgggca 28800
tgggcgtgtg gcagcagtgt ctggaggagc agaacctgaa agagctctgc aagctcctgc 28860
agaagaacct caagggtctg tggaccgggt tcgacgagcg caccaccgcc tcggacctgg 28920
ccgacctcat tttccccgag cgcctcaggc tgacgctgcg caacggcctg cccgacttta 28980
tgagccaaag catgttgcaa aactttcgct ctttcatcct cgaacgctcc ggaatcctgc 29040
ccgccacctg ctccgcgctg ccctcggact tcgtgccgct gaccttccgc gagtgccccc 29100
cgccgctgtg gagccactgc tacctgctgc gcctggccaa ctacctggcc taccactcgg 29160
acgtgatcga ggacgtcagc ggcgagggcc tgctcgagtg ccactgccgc tgcaacctct 29220
gcacgccgca ccgctccctg gcctgcaacc cccagctgct gagcgagacc cagatcatcg 29280
gcaccttcga gttgcaaggg cccagcgaag gcgagggttc agccgccaag gggggtctga 29340
aactcacccc ggggctgtgg acctcggcct acttgcgcaa gttcgtgccc gaggactacc 29400
atcccttcga gatcaggttc tacgaggacc aatcccatcc gcccaaggcc gagctgtcgg 29460
cctgcgtcat cacccagggg gcgatcctgg cccaattgca agccatccag aaatcccgcc 29520
aagaattctt gctgaaaaag ggccgcgggg tctacctcga cccccagacc ggtgaggagc 29580
tcaaccccgg cttcccccag gatgccccga ggaaacaaga agctgaaagt ggagctgccg 29640
cccgtggagg atttggagga agactgggag aacagcagtc aggcagagga ggaggagatg 29700
gaggaagact gggacagcac tcaggcagag gaggacagcc tgcaagacag tctggaggaa 29760
gacgaggagg aggcagagga ggaggtggaa gaagcagccg ccgccagacc gtcgtcctcg 29820
gcgggggaga aagcaagcag cacggatacc atctccgctc cgggtcgggg tcccgctcga 29880
ccacacagta gatgggacga gaccggacga ttcccgaacc ccaccaccca gaccggtaag 29940
aaggagcggc agggatacaa gtcctggcgg gggcacaaaa acgccatcgt ctcctgcttg 30000
caggcctgcg ggggcaacat ctccttcacc cggcgctacc tgctcttcca ccgcggggtg 30060
aactttcccc gcaacatctt gcattactac cgtcacctcc acagccccta ctacttccaa 30120
gaagaggcag cagcagcaga aaaagaccag cagaaaacca gcagctagaa aatccacagc 30180
ggcggcagca ggtggactga ggatcgcggc gaacgagccg gcgcaaaccc gggagctgag 30240
gaaccggatc tttcccaccc tctatgccat cttccagcag agtcgggggc aggagcagga 30300
actgaaagtc aagaaccgtt ctctgcgctc gctcacccgc agttgtctgt atcacaagag 30360
cgaagaccaa cttcagcgca ctctcgagga cgccgaggct ctcttcaaca agtactgcgc 30420
gctcactctt aaagagtagc ccgcgcccgc ccagtcgcag aaaaaggcgg gaattacgtc 30480
acctgtgccc ttcgccctag ccgcctccac ccatcatcat gagcaaagag attcccacgc 30540
cttacatgtg gagctaccag ccccagatgg gcctggccgc cggtgccgcc caggactact 30600
ccacccgcat gaattggctc agcgccgggc ccgcgatgat ctcacgggtg aatgacatcc 30660
gcgcccaccg aaaccagata ctcctagaac agtcagcgct caccgccacg ccccgcaatc 30720
acctcaatcc gcgtaattgg cccgccgccc tggtgtacca ggaaattccc cagcccacga 30780
ccgtactact tccgcgagac gcccaggccg aagtccagct gactaactca ggtgtccagc 30840
tggcgggcgg cgccaccctg tgtcgtcacc gccccgctca gggtataaag cggctggtga 30900
tccggggcag aggcacacag ctcaacgacg aggtggtgag ctcttcgctg ggtctgcgac 30960
ctgacggagt cttccaactc gccggatcgg ggagatcttc cttcacgcct cgtcaggccg 31020
tcctgacttt ggagagttcg tcctcgcagc cccgctcggg tggcatcggc actctccagt 31080
tcgtggagga gttcactccc tcggtctact tcaacccctt ctccggctcc cccggccact 31140
acccggacga gttcatcccg aacttcgacg ccatcagcga gtcggtggac ggctacgatt 31200
gaatgtccca tggtggcgca gctgacctag ctcggcttcg acacctggac cactgccgcc 31260
gcttccgctg cttcgctcgg gatctcgccg agtttgccta ctttgagctg cccgaggagc 31320
accctcaggg cccggcccac ggagtgcgga tcgtcgtcga agggggcctc gactcccacc 31380
tgcttcggat cttcagccag cgtccgatcc tggtcgagcg cgagcaagga cagacccttc 31440
tgactctgta ctgcatctgc aaccaccccg gcctgcatga aagtctttgt tgtctgctgt 31500
gtactgagta taataaaagc tgagatcagc gactactccg gacttccgtg tgtttaaact 31560
caccccctta tccagtgaaa taaagatcat attgatgatg attttacaga aataaaaaat 31620
aatcatttga tttgaaataa agatacaatc atattgatga tttgagttta acaaaaaaat 31680
aaagaatcac ttacttgaaa tctgatacca ggtctctgtc catgttttct gccaacacca 31740
cttcactccc ctcttcccag ctctggtact gcaggccccg gcgggctgca aacttcctcc 31800
acacgctgaa ggggatgtca aattcctcct gtccctcaat cttcatttta tcttctatca 31860
gatgtccaaa aagcgcgtcc gggtggatga tgacttcgac cccgtctacc cctacgatgc 31920
agacaacgca ccgaccgtgc ccttcatcaa cccccccttc gtctcttcag atggattcca 31980
agagaagccc ctgggggtgt tgtccctgcg actggccgac cccgtcacca ccaagaacgg 32040
ggaaatcacc ctcaagctgg gagagggggt ggacctcgat tcctcgggaa aactcatctc 32100
caacacggcc accaaggccg ccgcccctct cagtttttcc aacaacacca tttcccttaa 32160
catggatcac cccttttaca ctaaagatgg aaaattatcc ttacaagttt ctccaccatt 32220
aaatatactg agaacaagca ttctaaacac actagcttta ggttttggat caggtttagg 32280
actccgtggc tctgccttgg cagtacagtt agtctctcca cttacatttg atactgatgg 32340
aaacataaag cttaccttag acagaggttt gcatgttaca acaggagatg caattgaaag 32400
caacataagc tgggctaaag gtttaaaatt tgaagatgga gccatagcaa ccaacattgg 32460
aaatgggtta gagtttggaa gcagtagtac agaaacaggt gttgatgatg cttacccaat 32520
ccaagttaaa cttggatctg gccttagctt tgacagtaca ggagccataa tggctggtaa 32580
caaagaagac gataaactca ctttgtggac aacacctgat ccatcaccaa actgtcaaat 32640
actcgcagaa aatgatgcaa aactaacact ttgcttgact aaatgtggta gtcaaatact 32700
ggccactgtg tcagtcttag ttgtaggaag tggaaaccta aaccccatta ctggcaccgt 32760
aagcagtgct caggtgtttc tacgttttga tgcaaacggt gttcttttaa cagaacattc 32820
tacactaaaa aaatactggg ggtataggca gggagatagc atagatggca ctccatatac 32880
caatgctgta ggattcatgc ccaatttaaa agcttatcca aagtcacaaa gttctactac 32940
taaaaataat atagtagggc aagtatacat gaatggagat gtttcaaaac ctatgcttct 33000
cactataacc ctcaatggta ctgatgacag caacagtaca tattcaatgt cattttcata 33060
cacctggact aatggaagct atgttggagc aacatttggg gctaactctt ataccttctc 33120
atacatcgcc caagaatgaa cactgtatcc caccctgcat gccaaccctt cccaccccac 33180
tctgtggaac aaactctgaa acacaaaata aaataaagtt caagtgtttt attgattcaa 33240
cagttttaca ggattcgagc agttattttt cctccaccct cccaggacat ggaatacacc 33300
accctctccc cccgcacagc cttgaacatc tgaatgccat tggtgatgga catgcttttg 33360
gtctccacgt tccacacagt ttcagagcga gccagtctcg ggtcggtcag ggagatgaaa 33420
ccctccgggc actcccgcat ctgcacctca cagctcaaca gctgaggatt gtcctcggtg 33480
gtcgggatca cggttatctg gaagaagcag aagagcggcg gtgggaatca tagtccgcga 33540
acgggatcgg ccggtggtgt cgcatcaggc cccgcagcag tcgctgccgc cgccgctccg 33600
tcaagctgct gctcaggggg tccgggtcca gggactccct cagcatgatg cccacggccc 33660
tcagcatcag tcgtctggtg cggcgggcgc agcagcgcat gcggatctcg ctcaggtcgc 33720
tgcagtacgt gcaacacaga accaccaggt tgttcaacag tccatagttc aacacgctcc 33780
agccgaaact catcgcggga aggatgctac ccacgtggcc gtcgtaccag atcctcaggt 33840
aaatcaagtg gtgccccctc cagaacacgc tgcccacgta catgatctcc ttgggcatgt 33900
ggcggttcac cacctcccgg taccacatca ccctctggtt gaacatgcag ccccggatga 33960
tcctgcggaa ccacagggcc agcaccgccc cgcccgccat gcagcgaaga gaccccgggt 34020
cccggcaatg gcaatggagg acccaccgct cgtacccgtg gatcatctgg gagctgaaca 34080
agtctatgtt ggcacagcac aggcatatgc tcatgcatct cttcagcact ctcaactcct 34140
cgggggtcaa aaccatatcc cagggcacgg ggaactcttg caggacagcg aaccccgcag 34200
aacagggcaa tcctcgcaca gaacttacat tgtgcatgga cagggtatcg caatcaggca 34260
gcaccgggtg atcctccacc agagaagcgc gggtctcggt ctcctcacag cgtggtaagg 34320
gggccggccg atacgggtga tggcgggacg cggctgatcg tgttcgcgac cgtgtcatga 34380
tgcagttgct ttcggacatt ttcgtacttg ctgtagcaga acctggtccg ggcgctgcac 34440
accgatcgcc ggcggcggtc tcggcgcttg gaacgctcgg tgttgaaatt gtaaaacagc 34500
cactctctca gaccgtgcag cagatctagg gcctcaggag tgatgaagat cccatcatgc 34560
ctgatggctc tgatcacatc gaccaccgtg gaatgggcca gacccagcca gatgatgcaa 34620
ttttgttggg tttcggtgac ggcgggggag ggaagaacag gaagaaccat gattaacttt 34680
taatccaaac ggtctcggag tacttcaaaa tgaagatcgc ggagatggca cctctcgccc 34740
ccgctgtgtt ggtggaaaat aacagccagg tcaaaggtga tacggttctc gagatgttcc 34800
acggtggctt ccagcaaagc ctccacgcgc acatccagaa acaagacaat agcgaaagcg 34860
ggagggttct ctaattcctc aatcatcatg ttacactcct gcaccatccc cagataattt 34920
tcatttttcc agccttgaat gattcgaact agttcctgag gtaaatccaa gccagccatg 34980
ataaagagct cgcgcagagc gccctccacc ggcattctta agcacaccct cataattcca 35040
agatattctg ctcctggttc acctgcagca gattgacaag cggaatatca aaatctctgc 35100
cgcgatccct gagctcctcc ctcagcaata actgtaagta ctctttcata tcctctccga 35160
aatttttagc cataggacca ccaggaataa gattagggca agccacagta cagataaacc 35220
gaagtcctcc ccagtgagca ttgccaaatg caagactgct ataagcatgc tggctagacc 35280
cggtgatatc ttccagataa ctggacagaa aatcgcccag gcaattttta agaaaatcaa 35340
caaaagaaaa atcctccagg tggacgttta gagcctcggg aacaacgatg aagtaaatgc 35400
aagcggtgcg ttccagcatg gttagttagc tgatctgtag aaaaaacaaa aatgaacatt 35460
aaaccatgct agcctggcga acaggtgggt aaatcgttct ctccagcacc aggcaggcca 35520
cggggtctcc ggcgcgaccc tcgtaaaaat tgtcgctatg attgaaaacc atcacagaga 35580
gacgttcccg gtggccggcg tgaatgattc gacaagatga atacaccccc ggaacattgg 35640
cgtccgcgag tgaaaaaaag cgcccgagga agcaataagg cactacaatg ctcagtctca 35700
agtccagcaa agcgatgcca tgcggatgaa gcacaaaatt ctcaggtgcg tacaaaatgt 35760
aattactccc ctcctgcaca ggcagcaaag cccccgatcc ctccaggtac acatacaaag 35820
cctcagcgtc catagcttac cgagcagcag cacacaacag gcgcaagagt cagagaaagg 35880
ctgagctcta acctgtccac ccgctctctg ctcaatatat agcccagatc tacactgacg 35940
taaaggccaa agtctaaaaa tacccgccaa ataatcacac acgcccagca cacgcccaga 36000
aaccggtgac acactcaaaa aaatacgcgc acttcctcaa acgcccaaaa ctgccgtcat 36060
ttccgggttc ccacgctacg tcatcaaaac acgactttca aattccgtcg accgttaaaa 36120
acgtcacccg ccccgcccct aacggtcgcc cgtctctcag ccaatcagcg ccccgcatcc 36180
ccaaattcaa acacctcatt tgcatattaa cgcgcacaaa aagtttgagg tatattattg 36240
atgatgg 36247
<210> 65
<211> 9576
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 65
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcacccctg gaacccagag ccccttcttc cttctgctgc tgctgaccgt 2040
gctgactgtc gtgacaggct ctggccacgc cagctctaca cctggcggcg agaaagagac 2100
aagcgccacc cagagaagca gcgtgccaag cagcaccgag aagaacgccg tgtccatgac 2160
cagctccgtg ctgagcagcc actctcctgg cagcggcagc agcacaacac agggccagga 2220
tgtgacactg gcccctgcca cagaacctgc ctctggatct gccgccacct ggggacagga 2280
cgtgacaagc gtgccagtga ccagacctgc cctgggctct acaacacccc ctgcccacga 2340
tgtgaccagc gcccctgata acaagcctgc ccctggaagc acagcccctc cagctcatgg 2400
cgtgacctct gccccagata ccagaccagc cccaggatct acagccccac ccgcacacgg 2460
cgtgacaagt gcccctgaca caagacccgc tccaggctct actgctcctc ctgcccatgg 2520
cgtgacaagc gctcccgata caaggccagc tcctggctcc acagcaccac cagcacatgg 2580
cgtgacatca gctcccgaca ctagacctgc tcccggatca accgctccac cagctcacgg 2640
cgtgaccagc gcacctgata ccagacctgc tctgggaagc accgcccctc ccgtgcacaa 2700
tgtgacatct gcttccggca gcgccagcgg ctctgcctct acactggtgc acaacggcac 2760
cagcgccaga gccacaacaa ccccagccag caagagcacc cccttcagca tccctagcca 2820
ccacagcgac acccctacca cactggccag ccactccacc aagaccgatg cctctagcac 2880
ccaccactcc agcgtgcccc ctctgaccag cagcaaccac agcacaagcc cccagctgtc 2940
taccggcgtc tcattcttct ttctgtcctt ccacatcagc aacctgcagt tcaacagcag 3000
cctggaagat cccagcaccg actactacca ggaactgcag cgggatatca gcgagatgtt 3060
cctgcaaatc tacaagcagg gcggcttcct gggcctgagc aacatcaagt tcagacccgg 3120
cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc accatcaacg tgcacgacgt 3180
ggaaacccag ttcaaccagt acaagaccga ggccgccagc cggtacaacc tgaccatctc 3240
cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc cagtctggcg caggcgtgcc 3300
aggatgggga attgctctgc tggtgctcgt gtgcgtgctg gtggccctgg ccatcgtgta 3360
tctgattgcc ctggccgtgt gccagtgccg gcggaagaat tacggccagc tggacatctt 3420
ccccgccaga gacacctacc accccatgag cgagtacccc acataccaca cccacggcag 3480
atacgtgcca cccagctcca ccgacagatc cccctacgag aaagtgtctg ccggcaacgg 3540
cggcagctcc ctgagctaca caaatcctgc cgtggccgct gcctccgcca acctgggatc 3600
cggcagaatc ttcaacgccc actacgccgg ctacttcgcc gacctgctga tccacgacat 3660
cgagacaaac cctggcccca agctgaccat tgagagcact cccttcaacg tggctgaggg 3720
gaaggaggtg ctgctcctgg tgcacaatct gccccagcac ctgttcgggt actcctggta 3780
caagggagaa cgcgtggacg ggaaccggca gatcataggc tacgtcatcg gaacccagca 3840
ggccacaccc ggtccagcgt acagcggccg ggagattatc tacccgaacg cctccctgct 3900
gatccaaaac atcatccaga acgacaccgg tttctacact ctgcacgtga ttaagtcaga 3960
tctggtcaac gaagaggcca ccggccaatt cagggtgtac cccgaactcc ctaagccgtt 4020
catcacctcg aacaacagca acccggtcga ggatgaagat gcggtggcct tgacgtgcga 4080
acctgagatc cagaacacca cctacttgtg gtgggtgaac aatcagagcc tgccagtctc 4140
cccacgactc cagctgtcga acgacaacag gaccctgact ttgctgtccg tgactcggaa 4200
cgacgtgggc ccttatgaat gcggtatcca gaacaagctg tccgtggacc acagcgaccc 4260
tgtgatcctg aacgtccttt acgggccgga cgaccccacc atttccccgt cgtacactta 4320
ctaccggccg ggcgtgaacc tgtccctgtc gtgccacgct gcctccaatc cgccggccca 4380
gtactcctgg ctcatcgacg gaaacatcca gcagcacacc caagaactgt tcatctccaa 4440
cattaccgag aaaaactcgg gactttacac ctgtcaagcc aacaattccg ccagcggcca 4500
ctcccgcacc actgtcaaaa ctatcactgt gtccgccgaa ctcccgaagc ccagcatcag 4560
ctccaacaac tcgaagcccg tggaggataa ggacgctgtc gcgttcacct gtgaaccaga 4620
ggcacagaat accacctacc tttggtgggt caacggacag tccctgcctg tctcaccgag 4680
actgcagctg tcaaacggga ataggactct gaccttgttt aacgtcaccc ggaacgacgc 4740
ccgggcctac gtgtgcggca tccagaactc cgtgagcgca aaccggtctg acccagtgac 4800
cctggatgtg ctgtacggcc ccgacactcc gatcatttca ccccccgatt catcctacct 4860
gtccggcgct aacctcaacc tctcatgcca ctccgcatcc aaccccagcc cgcaatattc 4920
gtggcgcatt aacggaattc ctcagcaaca tacccaggtc ctgttcattg cgaagatcac 4980
ccctaacaac aacggaacct acgcctgctt tgtgtcaaac ctggccactg gtagaaacaa 5040
ctccatcgtg aagtccatta ccgtgtcggc gtccggatcc ggcgagggca gaggcagcct 5100
gctgacatgt ggcgacgtgg aagagaaccc tggccccgga gctgccccgg agccggagag 5160
gacccccgtt ggccagggat cgtgggccca tccgggacgc accaggggac catccgacag 5220
gggattctgt gtggtgtcac cggccaggcc agcagaagag gcaaccagcc tcgagggagc 5280
gttgtctgga accagacatt cccacccgtc ggtgggccgg cagcaccacg cgggaccacc 5340
gtccacttcc agaccgccac ggccatggga caccccttgc ccgcctgtgt atgccgagac 5400
taaacacttc ctgtactcat ccggagacaa ggaacagctt cggccgtcct tcctcctgtc 5460
gtcgctcaga ccgagcctga ccggagcacg cagattggtg gaaactatct tccttgggtc 5520
acgtccgtgg atgccaggta ccccacggcg cctcccgcgc ctcccacaga gatactggca 5580
gatgcggcct ctgttcctgg aattgctggg aaaccacgct cagtgcccgt acggagtcct 5640
gctcaagact cactgccctc tgagggcggc ggtcactccg gcggccggag tgtgcgcacg 5700
ggagaagccc cagggaagcg tggcagctcc ggaagaggag gacaccgatc cgcgccgcct 5760
cgtgcaactt ctgcgccagc actcctcgcc ctggcaagtc tacgggttcg tccgcgcctg 5820
cctgcgccgc ctggtgccgc ctgggctctg gggttcccgg cataacgagc gccgcttcct 5880
gagaaatact aagaagttta tctcacttgg aaaacatgcc aagttgtcgc tgcaagaact 5940
cacgtggaag atgtcagtcc gcgattgcgc ctggctgcgc cgctcgccgg gcgtcgggtg 6000
tgttccagct gcagaacacc gcctgagaga agaaattctg gccaaatttc tgcattggct 6060
gatgtcagtg tacgtggtcg agctgctgcg ctcctttttc tacgtcactg agactacctt 6120
tcaaaagaac cgcctgttct tctaccgcaa atctgtgtgg agcaagctgc agtcaatcgg 6180
cattcgccag catctgaaga gggtgcagct gcgggaactt tccgaggcag aagtccgcca 6240
gcaccgggag gcccggccgg cgcttctcac gtcgcgtctg agattcatcc caaagcccga 6300
cgggctgagg cctatcgtca acatggatta cgtcgtgggc gctcgcacct ttcgccgtga 6360
aaagcgggcc gaacgcttga cctcacgggt gaaggccctc ttctccgtgc tgaactacga 6420
gagagcaaga cggcctggcc tgctgggagc ttcggtgctg ggactggacg atatccaccg 6480
ggcttggcgg acctttgttc tccgggtgag agcccaagac cctccgccgg aactgtactt 6540
cgtgaaggtg gcgatcaccg gagcctatga tactattccg caagatcgac tcaccgaagt 6600
catcgcctcg atcatcaaac cgcagaacac ttactgcgtc aggcggtacg ccgtggtcca 6660
gaaggccgcg catggccacg tgagaaaggc gttcaagtcg cacgtgtcca ctctcaccga 6720
cctccagcct tacatgaggc aattcgttgc gcatttgcaa gagacttcgc ccctgagaga 6780
tgcggtggtc atcgagcaga gctccagcct gaacgaagcg agcagcggtc tgtttgacgt 6840
gttcctccgc ttcatgtgtc atcacgcggt gcgaatcagg ggaaaatcat acgtgcagtg 6900
ccagggaatc ccacaaggca gcattctgtc gactctcttg tgttcccttt gctacggcga 6960
tatggaaaac aagctgttcg ctgggatcag acgggacggg ttgctgctca gactggtgga 7020
cgacttcctg ctggtgactc cgcacctcac tcacgccaaa acctttctcc gcactctggt 7080
gaggggagtg ccagaatacg gctgtgtggt caatctccgg aaaactgtgg tgaatttccc 7140
tgtcgaggat gaggcactcg gaggaaccgc atttgtccaa atgccagcac atggcctgtt 7200
cccatggtgc ggtctgctgc tggacacccg aactcttgaa gtgcagtccg actactccag 7260
ctatgcccgg acgagcatcc gcgccagcct cactttcaat cgcggcttta aggccggacg 7320
aaacatgcgc agaaagcttt tcggagtcct ccggcttaaa tgccattcgc tctttctcga 7380
tctccaagtc aattcgctgc agaccgtgtg cacgaacatc tacaagatcc tgctgctcca 7440
agcctaccgg ttccacgctt gcgtgcttca gctgccgttt caccaacagg tgtggaagaa 7500
cccgaccttc tttctgcggg tcattagcga tactgcctcc ctgtgttact caatcctcaa 7560
ggcaaagaac gccggaatgt cgctgggtgc gaaaggagcc gcgggacctc ttcctagcga 7620
agcggtgcag tggctctgcc accaggcttt cctcctgaag ctgaccaggc acagagtgac 7680
ctacgtcccg ctgctgggct cgctgcgcac tgcacagacc cagctgtcta gaaaactccc 7740
cggcaccacc ctgaccgctc tggaagccgc cgccaaccca gcattgccgt cagatttcaa 7800
gaccatcttg gactgaagat ctgggcccta acaaaacaaa aagatggggt tattccctaa 7860
acttcatggg ttacgtaatt ggaagttggg ggacattgcc acaagatcat attgtacaaa 7920
agatcaaaca ctgttttaga aaacttcctg taaacaggcc tattgattgg aaagtatgtc 7980
aaaggattgt gggtcttttg ggctttgctg ctccatttac acaatgtgga tatcctgcct 8040
taatgccttt gtatgcatgt atacaagcta aacaggcttt cactttctcg ccaacttaca 8100
aggcctttct aagtaaacag tacatgaacc tttaccccgt tgctcggcaa cggcctggtc 8160
tgtgccaagt gtttgctgac gcaaccccca ctggctgggg cttggccata ggccatcagc 8220
gcatgcgtgg aacctttgtg gctcctctgc cgatccatac tgcggaactc ctagccgctt 8280
gttttgctcg cagccggtct ggagcaaagc tcataggaac tgacaattct gtcgtcctct 8340
cgcggaaata tacatcgttt cgatctacgt atgatctttt tccctctgcc aaaaattatg 8400
gggacatcat gaagcccctt gagcatctga cttctggcta ataaaggaaa tttattttca 8460
ttgcaatagt gtgttggaat tttttgtgtc tctcactcgg aaggaattct gcattaatga 8520
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8580
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8640
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 8700
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 8760
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 8820
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 8880
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 8940
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9000
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9060
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9120
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9180
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9240
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9300
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9360
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9420
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9480
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9540
atctgtctat ttcgttcatc catagttgcc tgactc 9576
<210> 66
<211> 36088
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 66
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcacccctgg aacccagagc cccttcttcc 1140
ttctgctgct gctgaccgtg ctgactgtcg tgacaggctc tggccacgcc agctctacac 1200
ctggcggcga gaaagagaca agcgccaccc agagaagcag cgtgccaagc agcaccgaga 1260
agaacgccgt gtccatgacc agctccgtgc tgagcagcca ctctcctggc agcggcagca 1320
gcacaacaca gggccaggat gtgacactgg cccctgccac agaacctgcc tctggatctg 1380
ccgccacctg gggacaggac gtgacaagcg tgccagtgac cagacctgcc ctgggctcta 1440
caacaccccc tgcccacgat gtgaccagcg cccctgataa caagcctgcc cctggaagca 1500
cagcccctcc agctcatggc gtgacctctg ccccagatac cagaccagcc ccaggatcta 1560
cagccccacc cgcacacggc gtgacaagtg cccctgacac aagacccgct ccaggctcta 1620
ctgctcctcc tgcccatggc gtgacaagcg ctcccgatac aaggccagct cctggctcca 1680
cagcaccacc agcacatggc gtgacatcag ctcccgacac tagacctgct cccggatcaa 1740
ccgctccacc agctcacggc gtgaccagcg cacctgatac cagacctgct ctgggaagca 1800
ccgcccctcc cgtgcacaat gtgacatctg cttccggcag cgccagcggc tctgcctcta 1860
cactggtgca caacggcacc agcgccagag ccacaacaac cccagccagc aagagcaccc 1920
ccttcagcat ccctagccac cacagcgaca cccctaccac actggccagc cactccacca 1980
agaccgatgc ctctagcacc caccactcca gcgtgccccc tctgaccagc agcaaccaca 2040
gcacaagccc ccagctgtct accggcgtct cattcttctt tctgtccttc cacatcagca 2100
acctgcagtt caacagcagc ctggaagatc ccagcaccga ctactaccag gaactgcagc 2160
gggatatcag cgagatgttc ctgcaaatct acaagcaggg cggcttcctg ggcctgagca 2220
acatcaagtt cagacccggc agcgtggtgg tgcagctgac cctggctttc cgggaaggca 2280
ccatcaacgt gcacgacgtg gaaacccagt tcaaccagta caagaccgag gccgccagcc 2340
ggtacaacct gaccatctcc gatgtgtccg tgtccgacgt gcccttccca ttctctgccc 2400
agtctggcgc aggcgtgcca ggatggggaa ttgctctgct ggtgctcgtg tgcgtgctgg 2460
tggccctggc catcgtgtat ctgattgccc tggccgtgtg ccagtgccgg cggaagaatt 2520
acggccagct ggacatcttc cccgccagag acacctacca ccccatgagc gagtacccca 2580
cataccacac ccacggcaga tacgtgccac ccagctccac cgacagatcc ccctacgaga 2640
aagtgtctgc cggcaacggc ggcagctccc tgagctacac aaatcctgcc gtggccgctg 2700
cctccgccaa cctgggatcc ggcagaatct tcaacgccca ctacgccggc tacttcgccg 2760
acctgctgat ccacgacatc gagacaaacc ctggccccaa gctgaccatt gagagcactc 2820
ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt gcacaatctg ccccagcacc 2880
tgttcgggta ctcctggtac aagggagaac gcgtggacgg gaaccggcag atcataggct 2940
acgtcatcgg aacccagcag gccacacccg gtccagcgta cagcggccgg gagattatct 3000
acccgaacgc ctccctgctg atccaaaaca tcatccagaa cgacaccggt ttctacactc 3060
tgcacgtgat taagtcagat ctggtcaacg aagaggccac cggccaattc agggtgtacc 3120
ccgaactccc taagccgttc atcacctcga acaacagcaa cccggtcgag gatgaagatg 3180
cggtggcctt gacgtgcgaa cctgagatcc agaacaccac ctacttgtgg tgggtgaaca 3240
atcagagcct gccagtctcc ccacgactcc agctgtcgaa cgacaacagg accctgactt 3300
tgctgtccgt gactcggaac gacgtgggcc cttatgaatg cggtatccag aacaagctgt 3360
ccgtggacca cagcgaccct gtgatcctga acgtccttta cgggccggac gaccccacca 3420
tttccccgtc gtacacttac taccggccgg gcgtgaacct gtccctgtcg tgccacgctg 3480
cctccaatcc gccggcccag tactcctggc tcatcgacgg aaacatccag cagcacaccc 3540
aagaactgtt catctccaac attaccgaga aaaactcggg actttacacc tgtcaagcca 3600
acaattccgc cagcggccac tcccgcacca ctgtcaaaac tatcactgtg tccgccgaac 3660
tcccgaagcc cagcatcagc tccaacaact cgaagcccgt ggaggataag gacgctgtcg 3720
cgttcacctg tgaaccagag gcacagaata ccacctacct ttggtgggtc aacggacagt 3780
ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa taggactctg accttgttta 3840
acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat ccagaactcc gtgagcgcaa 3900
accggtctga cccagtgacc ctggatgtgc tgtacggccc cgacactccg atcatttcac 3960
cccccgattc atcctacctg tccggcgcta acctcaacct ctcatgccac tccgcatcca 4020
accccagccc gcaatattcg tggcgcatta acggaattcc tcagcaacat acccaggtcc 4080
tgttcattgc gaagatcacc cctaacaaca acggaaccta cgcctgcttt gtgtcaaacc 4140
tggccactgg tagaaacaac tccatcgtga agtccattac cgtgtcggcg tccggatccg 4200
gcgagggcag aggcagcctg ctgacatgtg gcgacgtgga agagaaccct ggccccggag 4260
ctgccccgga gccggagagg acccccgttg gccagggatc gtgggcccat ccgggacgca 4320
ccaggggacc atccgacagg ggattctgtg tggtgtcacc ggccaggcca gcagaagagg 4380
caaccagcct cgagggagcg ttgtctggaa ccagacattc ccacccgtcg gtgggccggc 4440
agcaccacgc gggaccaccg tccacttcca gaccgccacg gccatgggac accccttgcc 4500
cgcctgtgta tgccgagact aaacacttcc tgtactcatc cggagacaag gaacagcttc 4560
ggccgtcctt cctcctgtcg tcgctcagac cgagcctgac cggagcacgc agattggtgg 4620
aaactatctt ccttgggtca cgtccgtgga tgccaggtac cccacggcgc ctcccgcgcc 4680
tcccacagag atactggcag atgcggcctc tgttcctgga attgctggga aaccacgctc 4740
agtgcccgta cggagtcctg ctcaagactc actgccctct gagggcggcg gtcactccgg 4800
cggccggagt gtgcgcacgg gagaagcccc agggaagcgt ggcagctccg gaagaggagg 4860
acaccgatcc gcgccgcctc gtgcaacttc tgcgccagca ctcctcgccc tggcaagtct 4920
acgggttcgt ccgcgcctgc ctgcgccgcc tggtgccgcc tgggctctgg ggttcccggc 4980
ataacgagcg ccgcttcctg agaaatacta agaagtttat ctcacttgga aaacatgcca 5040
agttgtcgct gcaagaactc acgtggaaga tgtcagtccg cgattgcgcc tggctgcgcc 5100
gctcgccggg cgtcgggtgt gttccagctg cagaacaccg cctgagagaa gaaattctgg 5160
ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc tcctttttct 5220
acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa tctgtgtgga 5280
gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg cgggaacttt 5340
ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg tcgcgtctga 5400
gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac gtcgtgggcg 5460
ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg aaggccctct 5520
tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct tcggtgctgg 5580
gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga gcccaagacc 5640
ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat actattccgc 5700
aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact tactgcgtca 5760
ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg ttcaagtcgc 5820
acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg catttgcaag 5880
agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg aacgaagcga 5940
gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg cgaatcaggg 6000
gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg actctcttgt 6060
gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga cgggacgggt 6120
tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact cacgccaaaa 6180
cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc aatctccgga 6240
aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca tttgtccaaa 6300
tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga actcttgaag 6360
tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc actttcaatc 6420
gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc cggcttaaat 6480
gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc acgaacatct 6540
acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag ctgccgtttc 6600
accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat actgcctccc 6660
tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg aaaggagccg 6720
cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc ctcctgaagc 6780
tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact gcacagaccc 6840
agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc gccaacccag 6900
cattgccgtc agatttcaag accatcttgg actgacgcac ctcgagctga tcataatcag 6960
ccataccaca tttgtagagg ttttacttgc tttaaaaaac ctcccacacc tccccctgaa 7020
cctgaaacat aaaatgaatg caattgttgt tgttaacttg tttattgcag cttataatgg 7080
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 7140
tagttgtggt ttgtccaaac tcatcaatgt atcttaccag gtgccgagcc tgcgagtgcg 7200
gagggaagca tgccaggttc cagcccgtgt gtgtggatgt gacggaggac ctgcgacccg 7260
atcatttggt gttgccctgc accgggacgg agttcggttc cagcggggaa gaatctgact 7320
agagtgagta gtgttctggg gcgggggagg acctgcatga gggccagaat aactgaaatc 7380
tgtgcttttc tgtgtgttgc agcagcatga gcggaagcgg ctcctttgag ggaggggtat 7440
tcagccctta tctgacgggg cgtctcccct cctgggcggg agtgcgtcag aatgtgatgg 7500
gatccacggt ggacggccgg cccgtgcagc ccgcgaactc ttcaaccctg acctatgcaa 7560
ccctgagctc ttcgtcgttg gacgcagctg ccgccgcagc tgctgcatct gccgccagcg 7620
ccgtgcgcgg aatggccatg ggcgccggct actacggcac tctggtggcc aactcgagtt 7680
ccaccaataa tcccgccagc ctgaacgagg agaagctgtt gctgctgatg gcccagctcg 7740
aggccttgac ccagcgcctg ggcgagctga cccagcaggt ggctcagctg caggagcaga 7800
cgcgggccgc ggttgccacg gtgaaatcca aataaaaaat gaatcaataa ataaacggag 7860
acggttgttg attttaacac agagtctgaa tctttatttg atttttcgcg cgcggtaggc 7920
cctggaccac cggtctcgat cattgagcac ccggtggatc ttttccagga cccggtagag 7980
gtgggcttgg atgttgaggt acatgggcat gagcccgtcc cgggggtgga ggtagctcca 8040
ttgcagggcc tcgtgctcgg gggtggtgtt gtaaatcacc cagtcatagc aggggcgcag 8100
ggcatggtgt tgcacaatat ctttgaggag gagactgatg gccacgggca gccctttggt 8160
gtaggtgttt acaaatctgt tgagctggga gggatgcatg cggggggaga tgaggtgcat 8220
cttggcctgg atcttgagat tggcgatgtt accgcccaga tcccgcctgg ggttcatgtt 8280
gtgcaggacc accagcacgg tgtatccggt gcacttgggg aatttatcat gcaacttgga 8340
agggaaggcg tgaaagaatt tggcgacgcc tttgtgcccg cccaggtttt ccatgcactc 8400
atccatgatg atggcgatgg gcccgtgggc ggcggcctgg gcaaagacgt ttcgggggtc 8460
ggacacatca tagttgtggt cctgggtgag gtcatcatag gccattttaa tgaatttggg 8520
gcggagggtg ccggactggg ggacaaaggt accctcgatc ccgggggcgt agttcccctc 8580
acagatctgc atctcccagg ctttgagctc ggaggggggg atcatgtcca cctgcggggc 8640
gataaagaac acggtttccg gggcggggga gatgagctgg gccgaaagca agttccggag 8700
cagctgggac ttgccgcagc cggtggggcc gtagatgacc ccgatgaccg gctgcaggtg 8760
gtagttgagg gagagacagc tgccgtcctc ccggaggagg ggggccacct cgttcatcat 8820
ctcgcgcacg tgcatgttct cgcgcaccag ttccgccagg aggcgctctc cccccaggga 8880
taggagctcc tggagcgagg cgaagttttt cagcggcttg agtccgtcgg ccatgggcat 8940
tttggagagg gtttgttgca agagttccag gcggtcccag agctcggtga tgtgctctac 9000
ggcatctcga tccagcagac ctcctcgttt cgcgggttgg gacggctgcg ggagtagggc 9060
accagacgat gggcgtccag cgcagccagg gtccggtcct tccagggtcg cagcgtccgc 9120
gtcagggtgg tctccgtcac ggtgaagggg tgcgcgccgg gctgggcgct tgcgagggtg 9180
cgcttcaggc tcatccggct ggtcgaaaac cgctcccgat cggcgccctg cgcgtcggcc 9240
aggtagcaat tgaccatgag ttcgtagttg agcgcctcgg ccgcgtggcc tttggcgcgg 9300
agcttacctt tggaagtctg cccgcaggcg ggacagagga gggacttgag ggcgtagagc 9360
ttgggggcga ggaagacgga ctcgggggcg taggcgtccg cgccgcagtg ggcgcagacg 9420
gtctcgcact ccacgagcca ggtgaggtcg ggctggtcgg ggtcaaaaac cagtttcccg 9480
ccgttctttt tgatgcgttt cttacctttg gtctccatga gctcgtgtcc ccgctgggtg 9540
acaaagaggc tgtccgtgtc cccgtagacc gactttatgg gccggtcctc gagcggtgtg 9600
ccgcggtcct cctcgtagag gaaccccgcc cactccgaga cgaaagcccg ggtccaggcc 9660
agcacgaagg aggccacgtg ggacgggtag cggtcgttgt ccaccagcgg gtccaccttt 9720
tccagggtat gcaaacacat gtccccctcg tccacatcca ggaaggtgat tggcttgtaa 9780
gtgtaggcca cgtgaccggg ggtcccggcc gggggggtat aaaagggtgc gggtccctgc 9840
tcgtcctcac tgtcttccgg atcgctgtcc aggagcgcca gctgttgggg taggtattcc 9900
ctctcgaagg cgggcatgac ctcggcactc aggttgtcag tttctagaaa cgaggaggat 9960
ttgatattga cggtgccggc ggagatgcct ttcaagagcc cctcgtccat ctggtcagaa 10020
aagacgatct ttttgttgtc gagcttggtg gcgaaggagc cgtagagggc gttggagagg 10080
agcttggcga tggagcgcat ggtctggttt ttttccttgt cggcgcgctc cttggcggcg 10140
atgttgagct gcacgtactc gcgcgccacg cacttccatt cggggaagac ggtggtcagc 10200
tcgtcgggca cgattctgac ctgccagccc cgattatgca gggtgatgag gtccacactg 10260
gtggccacct cgccgcgcag gggctcatta gtccagcaga ggcgtccgcc cttgcgcgag 10320
cagaaggggg gcagggggtc cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag 10380
atgccgggca ggaggtcggg gtcaaagtag ctgatggaag tggccagatc gtccagggca 10440
gcttgccatt cgcgcacggc cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc 10500
atgggatggg taagcgcgga ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc 10560
tcgaggatgc cgatgtaggt ggggtagcag cgccccccgc ggatgctggc gcgcacgtag 10620
tcatacagct cgtgcgaggg ggcgaggagc cccgggccca ggttggtgcg actgggcttt 10680
tcggcgcggt agacgatctg gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt 10740
tggaagatgt tgaagtgggc gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag 10800
gagtcttgca gcttggcgac gagctcggcg gtgactagga cgtccagagc gcagtagtcg 10860
agggtctcct ggatgatgtc atacttgagc tgtccctttt gtttccacag ctcgcggttg 10920
agaaggaact cttcgcggtc cttccagtac tcttcgaggg ggaacccgtc ctgatctgca 10980
cggtaagagc ctagcatgta gaactggttg acggccttgt aggcgcagca gcccttctcc 11040
acggggaggg cgtaggcctg ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg 11100
tccctgacca tgaccttgag gaactggtgc ttgaagtcga tatcgtcgca gcccccctgc 11160
tcccagagct ggaagtccgt gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca 11220
tcgttgaaga ggatcttgcc cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg 11280
ggcacctcgg cccggttgtt gatgacctgg gcggcgagca cgatctcgtc gaagccgttg 11340
atgttgtggc ccacgatgta gagttccacg aatcgcggac ggcccttgac gtggggcagt 11400
ttcttgagct cctcgtaggt gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc 11460
cagtcggcga gatgggggtt ggcgcggagg aaggaagtcc agagatccac ggccagggcg 11520
gtttgcagac ggtcccggta ctgacggaac tgctgcccga cggccatttt ttcgggggtg 11580
acgcagtaga aggtgcgggg gtccccgtgc cagcgatccc atttgagctg gagggcgaga 11640
tcgagggcga gctcgacgag ccggtcgtcc ccggagagtt tcatgaccag catgaagggg 11700
acgagctgct tgccgaagga ccccatccag gtgtaggttt ccacatcgta ggtgaggaag 11760
agcctttcgg tgcgaggatg cgagccgatg gggaagaact ggatctcctg ccaccaattg 11820
gaggaatggc tgttgatgtg atggaagtag aaatgccgac ggcgcgccga acactcgtgc 11880
ttgtgtttat acaagcggcc acagtgctcg caacgctgca cgggatgcac gtgctgcacg 11940
agctgtacct gagttccttt gacgaggaat ttcagtggga agtggagtcg tggcgcctgc 12000
atctcgtgct gtactacgtc gtggtggtcg gcctggccct cttctgcctc gatggtggtc 12060
atgctgacga gcccgcgcgg gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg 12120
aggacgaggg cgcgcaggcc ggagctgtcc agggtcctga gacgctgcgg agtcaggtca 12180
gtgggcagcg gcggcgcgcg gttgacttgc aggagttttt ccagggcgcg cgggaggtcc 12240
agatggtact tgatctccac cgcgccattg gtggcgacgt cgatggcttg cagggtcccg 12300
tgcccctggg gtgtgaccac cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc 12360
ggtgcctctt ccatggttag aagcggcggc gaggacgcgc gccgggcggc aggggcggct 12420
cggggcccgg aggcaggggc ggcaggggca cgtcggcgcc gcgcgcgggt aggttctggt 12480
actgcgcccg gagaagactg gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac 12540
gcctctgggt gaaggccacg ggacccgtga gtttgaacct gaaagagagt tcgacagaat 12600
caatctcggt atcgttgacg gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt 12660
cctggtaggc gatctcggtc atgaactgct cgatctcctc ctcttgaagg tctccgcggc 12720
cggcgcgctc cacggtggcc gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg 12780
cgttcatgcc cgcctcgttc cagacgcggc tgtagaccac gacgccctcg ggatcgcggg 12840
cgcgcatgac cacctgggcg aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc 12900
agaggcgctg gtagaggtag ttgagcgtgg tggcgatgtg ctcggtgacg aagaaataca 12960
tgatccagcg gcggagcggc atctcgctga cgtcgcccag cgcctccaaa cgttccatgg 13020
cctcgtaaaa gtccacggcg aagttgaaaa actgggagtt gcgcgccgag acggtcaact 13080
cctcctccag aagacggatg agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc 13140
ccgggagttc ctccacttcc tcttcttcct cctccactaa catctcttct acttcctcct 13200
caggcggcag tggtggcggg ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt 13260
cgatgaagcg ctcgatggtc tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc 13320
cgtcctcgcg gggccgcagc gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt 13380
ccccgttggg cagggagagg gcgctgacga tgcatcttat caattgcccc gtagggactc 13440
cgcgcaagga cctgagcgtc tcgagatcca cgggatctga aaaccgctga acgaaggctt 13500
cgagccagtc gcagtcgcaa ggtaggctga gcacggtttc ttctggcggg tcatgttggt 13560
tgggagcggg gcgggcgatg ctgctggtga tgaagttgaa ataggcggtt ctgagacggc 13620
ggatggtggc gaggagcacc aggtctttgg gcccggcttg ctggatgcgc agacggtcgg 13680
ccatgcccca ggcgtggtcc tgacacctgg ccaggtcctt gtagtagtcc tgcatgagcc 13740
gctccacggg cacctcctcc tcgcccgcgc ggccgtgcat gcgcgtgagc ccgaagccgc 13800
gctggggctg gacgagcgcc aggtcggcga cgacgcgctc ggcgaggatg gcttgctgga 13860
tctgggtgag ggtggtctgg aagtcatcaa agtcgacgaa gcggtggtag gctccggtgt 13920
tgatggtgta ggagcagttg gccatgacgg accagttgac ggtctggtgg cccggacgca 13980
cgagctcgtg gtacttgagg cgcgagtagg cgcgcgtgtc gaagatgtag tcgttgcagg 14040
tgcgcaccag gtactggtag ccgatgagga agtgcggcgg cggctggcgg tagagcggcc 14100
atcgctcggt ggcgggggcg ccgggcgcga ggtcctcgag catggtgcgg tggtagccgt 14160
agatgtacct ggacatccag gtgatgccgg cggcggtggt ggaggcgcgc gggaactcgc 14220
ggacgcggtt ccagatgttg cgcagcggca ggaagtagtt catggtgggc acggtctggc 14280
ccgtgaggcg cgcgcagtcg tggatgctct atacgggcaa aaacgaaagc ggtcagcggc 14340
tcgactccgt ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga 14400
atctcgaatc aggctggagc cgcagctaac gtggtattgg cactcccgtc tcgacccaag 14460
cctgcaccaa ccctccagga tacggaggcg ggtcgttttg caactttttt ttggaggccg 14520
gatgagacta gtaagcgcgg aaagcggccg accgcgatgg ctcgctgccg tagtctggag 14580
aagaatcgcc agggttgcgt tgcggtgtgc cccggttcga ggccggccgg attccgcggc 14640
taacgagggc gtggctgccc cgtcgtttcc aagaccccat agccagccga cttctccagt 14700
tacggagcga gcccctcttt tgttttgttt gtttttgcca gatgcatccc gtactgcggc 14760
agatgcgccc ccaccaccct ccaccgcaac aacagccccc tccacagccg gcgcttctgc 14820
ccccgcccca gcagcaactt ccagccacga ccgccgcggc cgccgtgagc ggggctggac 14880
agagttatga tcaccagctg gccttggaag agggcgaggg gctggcgcgc ctgggggcgt 14940
cgtcgccgga gcggcacccg cgcgtgcaga tgaaaaggga cgctcgcgag gcctacgtgc 15000
ccaagcagaa cctgttcaga gacaggagcg gcgaggagcc cgaggagatg cgcgcggccc 15060
ggttccacgc ggggcgggag ctgcggcgcg gcctggaccg aaagagggtg ctgagggacg 15120
aggatttcga ggcggacgag ctgacgggga tcagccccgc gcgcgcgcac gtggccgcgg 15180
ccaacctggt cacggcgtac gagcagaccg tgaaggagga gagcaacttc caaaaatcct 15240
tcaacaacca cgtgcgcacc ctgatcgcgc gcgaggaggt gaccctgggc ctgatgcacc 15300
tgtgggacct gctggaggcc atcgtgcaga accccaccag caagccgctg acggcgcagc 15360
tgttcctggt ggtgcagcat agtcgggaca acgaagcgtt cagggaggcg ctgctgaata 15420
tcaccgagcc cgagggccgc tggctcctgg acctggtgaa cattctgcag agcatcgtgg 15480
tgcaggagcg cgggctgccg ctgtccgaga agctggcggc catcaacttc tcggtgctga 15540
gtttgggcaa gtactacgct aggaagatct acaagacccc gtacgtgccc atagacaagg 15600
aggtgaagat cgacgggttt tacatgcgca tgaccctgaa agtgctgacc ctgagcgacg 15660
atctgggggt gtaccgcaac gacaggatgc accgtgcggt gagcgccagc aggcggcgcg 15720
agctgagcga ccaggagctg atgcatagtc tgcagcgggc cctgaccggg gccgggaccg 15780
agggggagag ctactttgac atgggcgcgg acctgcactg gcagcccagc cgccgggcct 15840
tggaggcggc ggcaggaccc tacgtagaag aggtggacga tgaggtggac gaggagggcg 15900
agtacctgga agactgatgg cgcgaccgta tttttgctag atgcaacaac aacagccacc 15960
tcctgatccc gcgatgcggg cggcgctgca gagccagccg tccggcatta actcctcgga 16020
cgattggacc caggccatgc aacgcatcat ggcgctgacg acccgcaacc ccgaagcctt 16080
tagacagcag ccccaggcca accggctctc ggccatcctg gaggccgtgg tgccctcgcg 16140
ctccaacccc acgcacgaga aggtcctggc catcgtgaac gcgctggtgg agaacaaggc 16200
catccgcggc gacgaggccg gcctggtgta caacgcgctg ctggagcgcg tggcccgcta 16260
caacagcacc aacgtgcaga ccaacctgga ccgcatggtg accgacgtgc gcgaggccgt 16320
ggcccagcgc gagcggttcc accgcgagtc caacctggga tccatggtgg cgctgaacgc 16380
cttcctcagc acccagcccg ccaacgtgcc ccggggccag gaggactaca ccaacttcat 16440
cagcgccctg cgcctgatgg tgaccgaggt gccccagagc gaggtgtacc agtccgggcc 16500
ggactacttc ttccagacca gtcgccaggg cttgcagacc gtgaacctga gccaggcttt 16560
caagaacttg cagggcctgt ggggcgtgca ggccccggtc ggggaccgcg cgacggtgtc 16620
gagcctgctg acgccgaact cgcgcctgct gctgctgctg gtggccccct tcacggacag 16680
cggcagcatc aaccgcaact cgtacctggg ctacctgatt aacctgtacc gcgaggccat 16740
cggccaggcg cacgtggacg agcagaccta ccaggagatc acccacgtga gccgcgccct 16800
gggccaggac gacccgggca acctggaagc caccctgaac tttttgctga ccaaccggtc 16860
gcagaagatc ccgccccagt acgcgctcag caccgaggag gagcgcatcc tgcgttacgt 16920
gcagcagagc gtgggcctgt tcctgatgca ggagggggcc acccccagcg ccgcgctcga 16980
catgaccgcg cgcaacatgg agcccagcat gtacgccagc aaccgcccgt tcatcaataa 17040
actgatggac tacttgcatc gggcggccgc catgaactct gactatttca ccaacgccat 17100
cctgaatccc cactggctcc cgccgccggg gttctacacg ggcgagtacg acatgcccga 17160
ccccaatgac gggttcctgt gggacgatgt ggacagcagc gtgttctccc cccgaccggg 17220
tgctaacgag cgccccttgt ggaagaagga aggcagcgac cgacgcccgt cctcggcgct 17280
gtccggccgc gagggtgctg ccgcggcggt gcccgaggcc gccagtcctt tcccgagctt 17340
gcccttctcg ctgaacagta tccgcagcag cgagctgggc aggatcacgc gcccgcgctt 17400
gctgggcgaa gaggagtact tgaatgactc gctgttgaga cccgagcggg agaagaactt 17460
ccccaataac gggatagaaa gcctggtgga caagatgagc cgctggaaga cgtatgcgca 17520
ggagcacagg gacgatcccc gggcgtcgca gggggccacg agccggggca gcgccgcccg 17580
taaacgccgg tggcacgaca ggcagcgggg acagatgtgg gacgatgagg actccgccga 17640
cgacagcagc gtgttggact tgggtgggag tggtaacccg ttcgctcacc tgcgcccccg 17700
tatcgggcgc atgatgtaag agaaaccgaa aataaatgat actcaccaag gccatggcga 17760
ccagcgtgcg ttcgtttctt ctctgttgtt gttgtatcta gtatgatgag gcgtgcgtac 17820
ccggagggtc ctcctccctc gtacgagagc gtgatgcagc aggcgatggc ggcggcggcg 17880
atgcagcccc cgctggaggc tccttacgtg cccccgcggt acctggcgcc tacggagggg 17940
cggaacagca ttcgttactc ggagctggca cccttgtacg ataccacccg gttgtacctg 18000
gtggacaaca agtcggcgga catcgcctcg ctgaactacc agaacgacca cagcaacttc 18060
ctgaccaccg tggtgcagaa caatgacttc acccccacgg aggccagcac ccagaccatc 18120
aactttgacg agcgctcgcg gtggggcggc cagctgaaaa ccatcatgca caccaacatg 18180
cccaacgtga acgagttcat gtacagcaac aagttcaagg cgcgggtgat ggtctcccgc 18240
aagaccccca atggggtgac agtgacagag gattatgatg gtagtcagga tgagctgaag 18300
tatgaatggg tggaatttga gctgcccgaa ggcaacttct cggtgaccat gaccatcgac 18360
ctgatgaaca acgccatcat cgacaattac ttggcggtgg ggcggcagaa cggggtgctg 18420
gagagcgaca tcggcgtgaa gttcgacact aggaacttca ggctgggctg ggaccccgtg 18480
accgagctgg tcatgcccgg ggtgtacacc aacgaggctt tccatcccga tattgtcttg 18540
ctgcccggct gcggggtgga cttcaccgag agccgcctca gcaacctgct gggcattcgc 18600
aagaggcagc ccttccagga aggcttccag atcatgtacg aggatctgga ggggggcaac 18660
atccccgcgc tcctggatgt cgacgcctat gagaaaagca aggaggatgc agcagctgaa 18720
gcaactgcag ccgtagctac cgcctctacc gaggtcaggg gcgataattt tgcaagcgcc 18780
gcagcagtgg cagcggccga ggcggctgaa accgaaagta agatagtcat tcagccggtg 18840
gagaaggata gcaagaacag gagctacaac gtactaccgg acaagataaa caccgcctac 18900
cgcagctggt acctagccta caactatggc gaccccgaga agggcgtgcg ctcctggacg 18960
ctgctcacca cctcggacgt cacctgcggc gtggagcaag tctactggtc gctgcccgac 19020
atgatgcaag acccggtcac cttccgctcc acgcgtcaag ttagcaacta cccggtggtg 19080
ggcgccgagc tcctgcccgt ctactccaag agcttcttca acgagcaggc cgtctactcg 19140
cagcagctgc gcgccttcac ctcgcttacg cacgtcttca accgcttccc cgagaaccag 19200
atcctcgtcc gcccgcccgc gcccaccatt accaccgtca gtgaaaacgt tcctgctctc 19260
acagatcacg ggaccctgcc gctgcgcagc agtatccggg gagtccagcg cgtgaccgtt 19320
actgacgcca gacgccgcac ctgcccctac gtctacaagg ccctgggcat agtcgcgccg 19380
cgcgtcctct cgagccgcac cttctaaatg tccattctca tctcgcccag taataacacc 19440
ggttggggcc tgcgcgcgcc cagcaagatg tacggaggcg ctcgccaacg ctccacgcaa 19500
caccccgtgc gcgtgcgcgg gcacttccgc gctccctggg gcgccctcaa gggccgcgtg 19560
cggtcgcgca ccaccgtcga cgacgtgatc gaccaggtgg tggccgacgc gcgcaactac 19620
acccccgccg ccgcgcccgt ctccaccgtg gacgccgtca tcgacagcgt ggtggccgac 19680
gcgcgccggt acgcccgcgc caagagccgg cggcggcgca tcgcccggcg gcaccggagc 19740
acccccgcca tgcgcgcggc gcgagccttg ctgcgcaggg ccaggcgcac gggacgcagg 19800
gccatgctca gggcggccag acgcgcggct tcaggcgcca gcgccggcag gacccggaga 19860
cgcgcggcca cggcggcggc agcggccatc gccagcatgt cccgcccgcg gcgagggaac 19920
gtgtactggg tgcgcgacgc cgccaccggt gtgcgcgtgc ccgtgcgcac ccgcccccct 19980
cgcacttgaa gatgttcact tcgcgatgtt gatgtgtccc agcggcgagg aggatgtcca 20040
agcgcaaatt caaggaagag atgctccagg tcatcgcgcc tgagatctac ggccctgcgg 20100
tggtgaagga ggaaagaaag ccccgcaaaa tcaagcgggt caaaaaggac aaaaaggaag 20160
aagaaagtga tgtggacgga ttggtggagt ttgtgcgcga gttcgccccc cggcggcgcg 20220
tgcagtggcg cgggcggaag gtgcaaccgg tgctgagacc cggcaccacc gtggtcttca 20280
cgcccggcga gcgctccggc accgcttcca agcgctccta cgacgaggtg tacggggatg 20340
atgatattct ggagcaggcg gccgagcgcc tgggcgagtt tgcttacggc aagcgcagcc 20400
gttccgcacc gaaggaagag gcggtgtcca tcccgctgga ccacggcaac cccacgccga 20460
gcctcaagcc cgtgaccttg cagcaggtgc tgccgaccgc ggcgccgcgc cgggggttca 20520
agcgcgaggg cgaggatctg taccccacca tgcagctgat ggtgcccaag cgccagaagc 20580
tggaagacgt gctggagacc atgaaggtgg acccggacgt gcagcccgag gtcaaggtgc 20640
ggcccatcaa gcaggtggcc ccgggcctgg gcgtgcagac cgtggacatc aagattccca 20700
cggagcccat ggaaacgcag accgagccca tgatcaagcc cagcaccagc accatggagg 20760
tgcagacgga tccctggatg ccatcggctc ctagtcgaag accccggcgc aagtacggcg 20820
cggccagcct gctgatgccc aactacgcgc tgcatccttc catcatcccc acgccgggct 20880
accgcggcac gcgcttctac cgcggtcata ccagcagccg ccgccgcaag accaccactc 20940
gccgccgccg tcgccgcacc gccgctgcaa ccacccctgc cgccctggtg cggagagtgt 21000
accgccgcgg ccgcgcacct ctgaccctgc cgcgcgcgcg ctaccacccg agcatcgcca 21060
tttaaacttt cgcctgcttt gcagatcaat ggccctcaca tgccgccttc gcgttcccat 21120
tacgggctac cgaggaagaa aaccgcgccg tagaaggctg gcggggaacg ggatgcgtcg 21180
ccaccaccac cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc 21240
gctgatcccc atcatcgccg cggcgatcgg ggcgatcccc ggcattgctt ccgtggcggt 21300
gcaggcctct cagcgccact gagacacact tggaaacatc ttgtaataaa ccaatggact 21360
ctgacgctcc tggtcctgtg atgtgttttc gtagacagat ggaagacatc aatttttcgt 21420
ccctggctcc gcgacacggc acgcggccgt tcatgggcac ctggagcgac atcggcacca 21480
gccaactgaa cgggggcgcc ttcaattgga gcagtctctg gagcgggctt aagaatttcg 21540
ggtccacgct taaaacctat ggcagcaagg cgtggaacag caccacaggg caggcgctga 21600
gggataagct gaaagagcag aacttccagc agaaggtggt cgatgggctc gcctcgggca 21660
tcaacggggt ggtggacctg gccaaccagg ccgtgcagcg gcagatcaac agccgcctgg 21720
acccggtgcc gcccgccggc tccgtggaga tgccgcaggt ggaggaggag ctgcctcccc 21780
tggacaagcg gggcgagaag cgaccccgcc ccgatgcgga ggagacgctg ctgacgcaca 21840
cggacgagcc gcccccgtac gaggaggcgg tgaaactggg tctgcccacc acgcggccca 21900
tcgcgcccct ggccaccggg gtgctgaaac ccgaaaagcc cgcgaccctg gacttgcctc 21960
ctccccagcc ttcccgcccc tctacagtgg ctaagcccct gccgccggtg gccgtggccc 22020
gcgcgcgacc cgggggcacc gcccgccctc atgcgaactg gcagagcact ctgaacagca 22080
tcgtgggtct gggagtgcag agtgtgaagc gccgccgctg ctattaaacc taccgtagcg 22140
cttaacttgc ttgtctgtgt gtgtatgtat tatgtcgccg ccgccgctgt ccaccagaag 22200
gaggagtgaa gaggcgcgtc gccgagttgc aagatggcca ccccatcgat gctgccccag 22260
tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc gggtctggtg 22320
cagtttgccc gcgccacaga cacctacttc agtctgggga acaagtttag gaaccccacg 22380
gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct gcgcttcgtg 22440
cccgtggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct ggccgtgggc 22500
gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt gctggatcgg 22560
ggccctagct tcaaacccta ctccggcacc gcctacaaca gtctggcccc caagggagca 22620
cccaacactt gtcagtggac atataaagcc gatggtgaaa ctgccacaga aaaaacctat 22680
acatatggaa atgcacccgt gcagggcatt aacatcacaa aagatggtat tcaacttgga 22740
actgacaccg atgatcagcc aatctacgca gataaaacct atcagcctga acctcaagtg 22800
ggtgatgctg aatggcatga catcactggt actgatgaaa agtatggagg cagagctctt 22860
aagcctgata ccaaaatgaa gccttgttat ggttcttttg ccaagcctac taataaagaa 22920
ggaggtcagg caaatgtgaa aacaggaaca ggcactacta aagaatatga catagacatg 22980
gctttctttg acaacagaag tgcggctgct gctggcctag ctccagaaat tgttttgtat 23040
actgaaaatg tggatttgga aactccagat acccatattg tatacaaagc aggcacagat 23100
gacagcagct cttctattaa tttgggtcag caagccatgc ccaacagacc taactacatt 23160
ggtttcagag acaactttat cgggctcatg tactacaaca gcactggcaa tatgggggtg 23220
ctggccggtc aggcttctca gctgaatgct gtggttgact tgcaagacag aaacaccgag 23280
ctgtcctacc agctcttgct tgactctctg ggtgacagaa cccggtattt cagtatgtgg 23340
aatcaggcgg tggacagcta tgatcctgat gtgcgcatta ttgaaaatca tggtgtggag 23400
gatgaacttc ccaactattg tttccctctg gatgctgttg gcagaacaga tacttatcag 23460
ggaattaagg ctaatggaac tgatcaaacc acatggacca aagatgacag tgtcaatgat 23520
gctaatgaga taggcaaggg taatccattc gccatggaaa tcaacatcca agccaacctg 23580
tggaggaact tcctctacgc caacgtggcc ctgtacctgc ccgactctta caagtacacg 23640
ccggccaatg ttaccctgcc caccaacacc aacacctacg attacatgaa cggccgggtg 23700
gtggcgccct cgctggtgga ctcctacatc aacatcgggg cgcgctggtc gctggatccc 23760
atggacaacg tgaacccctt caaccaccac cgcaatgcgg ggctgcgcta ccgctccatg 23820
ctcctgggca acgggcgcta cgtgcccttc cacatccagg tgccccagaa atttttcgcc 23880
atcaagagcc tcctgctcct gcccgggtcc tacacctacg agtggaactt ccgcaaggac 23940
gtcaacatga tcctgcagag ctccctcggc aacgacctgc gcacggacgg ggcctccatc 24000
tccttcacca gcatcaacct ctacgccacc ttcttcccca tggcgcacaa cacggcctcc 24060
acgctcgagg ccatgctgcg caacgacacc aacgaccagt ccttcaacga ctacctctcg 24120
gcggccaaca tgctctaccc catcccggcc aacgccacca acgtgcccat ctccatcccc 24180
tcgcgcaact gggccgcctt ccgcggctgg tccttcacgc gtctcaagac caaggagacg 24240
ccctcgctgg gctccgggtt cgacccctac ttcgtctact cgggctccat cccctacctc 24300
gacggcacct tctacctcaa ccacaccttc aagaaggtct ccatcacctt cgactcctcc 24360
gtcagctggc ccggcaacga ccggctcctg acgcccaacg agttcgaaat caagcgcacc 24420
gtcgacggcg agggctacaa cgtggcccag tgcaacatga ccaaggactg gttcctggtc 24480
cagatgctgg cccactacaa catcggctac cagggcttct acgtgcccga gggctacaag 24540
gaccgcatgt actccttctt ccgcaacttc cagcccatga gccgccaggt ggtggacgag 24600
gtcaactaca aggactacca ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc 24660
gtcggctacc tcgcgcccac catgcgccag ggccagccct accccgccaa ctacccctac 24720
ccgctcatcg gcaagagcgc cgtcaccagc gtcacccaga aaaagttcct ctgcgacagg 24780
gtcatgtggc gcatcccctt ctccagcaac ttcatgtcca tgggcgcgct caccgacctc 24840
ggccagaaca tgctctatgc caactccgcc cacgcgctag acatgaattt cgaagtcgac 24900
cccatggatg agtccaccct tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg 24960
caccagcccc accgcggcgt catcgaggcc gtctacctgc gcaccccctt ctcggccggt 25020
aacgccacca cctaagctct tgcttcttgc aagccatggc cgcgggctcc ggcgagcagg 25080
agctcagggc catcatccgc gacctgggct gcgggcccta cttcctgggc accttcgata 25140
agcgcttccc gggattcatg gccccgcaca agctggcctg cgccatcgtc aacacggccg 25200
gccgcgagac cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcgaacacct 25260
gctacctctt cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg 25320
agtacgaggg cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg 25380
aaaagtccac ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca 25440
tgttcctgca cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga 25500
acttgctgac gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc 25560
gccgcaacca ggaggcgctc taccgcttcc tcaactccca ctccgcctac tttcgctccc 25620
accgcgcgcg catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg 25680
tgtgtgtatg ttaaatgtct ttaataaaca gcactttcat gttacacatg catctgagat 25740
gatttattta gaaatcgaaa gggttctgcc gggtctcggc atggcccgcg ggcagggaca 25800
cgttgcggaa ctggtacttg gccagccact tgaactcggg gatcagcagt ttgggcagcg 25860
gggtgtcggg gaaggagtcg gtccacagct tccgcgtcag ttgcagggcg cccagcaggt 25920
cgggcgcgga gatcttgaaa tcgcagttgg gacccgcgtt ctgcgcgcgg gagttgcggt 25980
acacggggtt gcagcactgg aacaccatca gggccgggtg cttcacgctc gccagcaccg 26040
tcgcgtcggt gatgctctcc acgtcgaggt cctcggcgtt ggccatcccg aagggggtca 26100
tcttgcaggt ctgccttccc atggtgggca cgcacccggg cttgtggttg caatcgcagt 26160
gcagggggat cagcatcatc tgggcctggt cggcgttcat ccccgggtac atggccttca 26220
tgaaagcctc caattgcctg aacgcctgct gggccttggc tccctcggtg aagaagaccc 26280
cgcaggactt gctagagaac tggttggtgg cgcacccggc gtcgtgcacg cagcagcgcg 26340
cgtcgttgtt ggccagctgc accacgctgc gcccccagcg gttctgggtg atcttggccc 26400
ggtcggggtt ctccttcagc gcgcgctgcc cgttctcgct cgccacatcc atctcgatca 26460
tgtgctcctt ctggatcatg gtggtcccgt gcaggcaccg cagcttgccc tcggcctcgg 26520
tgcacccgtg cagccacagc gcgcacccgg tgcactccca gttcttgtgg gcgatctggg 26580
aatgcgcgtg cacgaagccc tgcaggaagc ggcccatcat ggtggtcagg gtcttgttgc 26640
tagtgaaggt cagcggaatg ccgcggtgct cctcgttgat gtacaggtgg cagatgcggc 26700
ggtacacctc gccctgctcg ggcatcagct ggaagttggc tttcaggtcg gtctccacgc 26760
ggtagcggtc catcagcata gtcatgattt ccataccctt ctcccaggcc gagacgatgg 26820
gcaggctcat agggttcttc accatcatct tagcgctagc agccgcggcc agggggtcgc 26880
tctcgtccag ggtctcaaag ctccgcttgc cgtccttctc ggtgatccgc accggggggt 26940
agctgaagcc cacggccgcc agctcctcct cggcctgtct ttcgtcctcg ctgtcctggc 27000
tgacgtcctg caggaccaca tgcttggtct tgcggggttt cttcttgggc ggcagcggcg 27060
gcggagatgt tggagatggc gagggggagc gcgagttctc gctcaccact actatctctt 27120
cctcttcttg gtccgaggcc acgcggcggt aggtatgtct cttcgggggc agaggcggag 27180
gcgacgggct ctcgccgccg cgacttggcg gatggctggc agagcccctt ccgcgttcgg 27240
gggtgcgctc ccggcggcgc tctgactgac ttcctccgcg gccggccatt gtgttctcct 27300
agggaggaac aacaagcatg gagactcagc catcgccaac ctcgccatct gcccccaccg 27360
ccgacgagaa gcagcagcag cagaatgaaa gcttaaccgc cccgccgccc agccccgcca 27420
cctccgacgc ggccgtccca gacatgcaag agatggagga atccatcgag attgacctgg 27480
gctatgtgac gcccgcggag cacgaggagg agctggcagt gcgcttttca caagaagaga 27540
tacaccaaga acagccagag caggaagcag agaatgagca gagtcaggct gggctcgagc 27600
atgacggcga ctacctccac ctgagcgggg gggaggacgc gctcatcaag catctggccc 27660
ggcaggccac catcgtcaag gatgcgctgc tcgaccgcac cgaggtgccc ctcagcgtgg 27720
aggagctcag ccgcgcctac gagttgaacc tcttctcgcc gcgcgtgccc cccaagcgcc 27780
agcccaatgg cacctgcgag cccaacccgc gcctcaactt ctacccggtc ttcgcggtgc 27840
ccgaggccct ggccacctac cacatctttt tcaagaacca aaagatcccc gtctcctgcc 27900
gcgccaaccg cacccgcgcc gacgcccttt tcaacctggg tcccggcgcc cgcctacctg 27960
atatcgcctc cttggaagag gttcccaaga tcttcgaggg tctgggcagc gacgagactc 28020
gggccgcgaa cgctctgcaa ggagaaggag gagagcatga gcaccacagc gccctggtcg 28080
agttggaagg cgacaacgcg cggctggcgg tgctcaaacg cacggtcgag ctgacccatt 28140
tcgcctaccc ggctctgaac ctgcccccca aagtcatgag cgcggtcatg gaccaggtgc 28200
tcatcaagcg cgcgtcgccc atctccgagg acgagggcat gcaagactcc gaggagggca 28260
agcccgtggt cagcgacgag cagctggccc ggtggctggg tcctaatgct agtccccaga 28320
gtttggaaga gcggcgcaaa ctcatgatgg ccgtggtcct ggtgaccgtg gagctggagt 28380
gcctgcgccg cttcttcgcc gacgcggaga ccctgcgcaa ggtcgaggag aacctgcact 28440
acctcttcag gcacgggttc gtgcgccagg cctgcaagat ctccaacgtg gagctgacca 28500
acctggtctc ctacatgggc atcttgcacg agaaccgcct ggggcagaac gtgctgcaca 28560
ccaccctgcg cggggaggcc cggcgcgact acatccgcga ctgcgtctac ctctacctct 28620
gccacacctg gcagacgggc atgggcgtgt ggcagcagtg tctggaggag cagaacctga 28680
aagagctctg caagctcctg cagaagaacc tcaagggtct gtggaccggg ttcgacgagc 28740
gcaccaccgc ctcggacctg gccgacctca ttttccccga gcgcctcagg ctgacgctgc 28800
gcaacggcct gcccgacttt atgagccaaa gcatgttgca aaactttcgc tctttcatcc 28860
tcgaacgctc cggaatcctg cccgccacct gctccgcgct gccctcggac ttcgtgccgc 28920
tgaccttccg cgagtgcccc ccgccgctgt ggagccactg ctacctgctg cgcctggcca 28980
actacctggc ctaccactcg gacgtgatcg aggacgtcag cggcgagggc ctgctcgagt 29040
gccactgccg ctgcaacctc tgcacgccgc accgctccct ggcctgcaac ccccagctgc 29100
tgagcgagac ccagatcatc ggcaccttcg agttgcaagg gcccagcgaa ggcgagggtt 29160
cagccgccaa ggggggtctg aaactcaccc cggggctgtg gacctcggcc tacttgcgca 29220
agttcgtgcc cgaggactac catcccttcg agatcaggtt ctacgaggac caatcccatc 29280
cgcccaaggc cgagctgtcg gcctgcgtca tcacccaggg ggcgatcctg gcccaattgc 29340
aagccatcca gaaatcccgc caagaattct tgctgaaaaa gggccgcggg gtctacctcg 29400
acccccagac cggtgaggag ctcaaccccg gcttccccca ggatgccccg aggaaacaag 29460
aagctgaaag tggagctgcc gcccgtggag gatttggagg aagactggga gaacagcagt 29520
caggcagagg aggaggagat ggaggaagac tgggacagca ctcaggcaga ggaggacagc 29580
ctgcaagaca gtctggagga agacgaggag gaggcagagg aggaggtgga agaagcagcc 29640
gccgccagac cgtcgtcctc ggcgggggag aaagcaagca gcacggatac catctccgct 29700
ccgggtcggg gtcccgctcg accacacagt agatgggacg agaccggacg attcccgaac 29760
cccaccaccc agaccggtaa gaaggagcgg cagggataca agtcctggcg ggggcacaaa 29820
aacgccatcg tctcctgctt gcaggcctgc gggggcaaca tctccttcac ccggcgctac 29880
ctgctcttcc accgcggggt gaactttccc cgcaacatct tgcattacta ccgtcacctc 29940
cacagcccct actacttcca agaagaggca gcagcagcag aaaaagacca gcagaaaacc 30000
agcagctaga aaatccacag cggcggcagc aggtggactg aggatcgcgg cgaacgagcc 30060
ggcgcaaacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 30120
gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 30180
cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 30240
tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccagtcgca 30300
gaaaaaggcg ggaattacgt cacctgtgcc cttcgcccta gccgcctcca cccatcatca 30360
tgagcaaaga gattcccacg ccttacatgt ggagctacca gccccagatg ggcctggccg 30420
ccggtgccgc ccaggactac tccacccgca tgaattggct cagcgccggg cccgcgatga 30480
tctcacgggt gaatgacatc cgcgcccacc gaaaccagat actcctagaa cagtcagcgc 30540
tcaccgccac gccccgcaat cacctcaatc cgcgtaattg gcccgccgcc ctggtgtacc 30600
aggaaattcc ccagcccacg accgtactac ttccgcgaga cgcccaggcc gaagtccagc 30660
tgactaactc aggtgtccag ctggcgggcg gcgccaccct gtgtcgtcac cgccccgctc 30720
agggtataaa gcggctggtg atccggggca gaggcacaca gctcaacgac gaggtggtga 30780
gctcttcgct gggtctgcga cctgacggag tcttccaact cgccggatcg gggagatctt 30840
ccttcacgcc tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag ccccgctcgg 30900
gtggcatcgg cactctccag ttcgtggagg agttcactcc ctcggtctac ttcaacccct 30960
tctccggctc ccccggccac tacccggacg agttcatccc gaacttcgac gccatcagcg 31020
agtcggtgga cggctacgat tgaatgtccc atggtggcgc agctgaccta gctcggcttc 31080
gacacctgga ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct 31140
actttgagct gcccgaggag caccctcagg gcccggccca cggagtgcgg atcgtcgtcg 31200
aagggggcct cgactcccac ctgcttcgga tcttcagcca gcgtccgatc ctggtcgagc 31260
gcgagcaagg acagaccctt ctgactctgt actgcatctg caaccacccc ggcctgcatg 31320
aaagtctttg ttgtctgctg tgtactgagt ataataaaag ctgagatcag cgactactcc 31380
ggacttccgt gtgtttaaac tcaccccctt atccagtgaa ataaagatca tattgatgat 31440
gattttacag aaataaaaaa taatcatttg atttgaaata aagatacaat catattgatg 31500
atttgagttt aacaaaaaaa taaagaatca cttacttgaa atctgatacc aggtctctgt 31560
ccatgttttc tgccaacacc acttcactcc cctcttccca gctctggtac tgcaggcccc 31620
ggcgggctgc aaacttcctc cacacgctga aggggatgtc aaattcctcc tgtccctcaa 31680
tcttcatttt atcttctatc agatgtccaa aaagcgcgtc cgggtggatg atgacttcga 31740
ccccgtctac ccctacgatg cagacaacgc accgaccgtg cccttcatca accccccctt 31800
cgtctcttca gatggattcc aagagaagcc cctgggggtg ttgtccctgc gactggccga 31860
ccccgtcacc accaagaacg gggaaatcac cctcaagctg ggagaggggg tggacctcga 31920
ttcctcggga aaactcatct ccaacacggc caccaaggcc gccgcccctc tcagtttttc 31980
caacaacacc atttccctta acatggatca ccccttttac actaaagatg gaaaattatc 32040
cttacaagtt tctccaccat taaatatact gagaacaagc attctaaaca cactagcttt 32100
aggttttgga tcaggtttag gactccgtgg ctctgccttg gcagtacagt tagtctctcc 32160
acttacattt gatactgatg gaaacataaa gcttacctta gacagaggtt tgcatgttac 32220
aacaggagat gcaattgaaa gcaacataag ctgggctaaa ggtttaaaat ttgaagatgg 32280
agccatagca accaacattg gaaatgggtt agagtttgga agcagtagta cagaaacagg 32340
tgttgatgat gcttacccaa tccaagttaa acttggatct ggccttagct ttgacagtac 32400
aggagccata atggctggta acaaagaaga cgataaactc actttgtgga caacacctga 32460
tccatcacca aactgtcaaa tactcgcaga aaatgatgca aaactaacac tttgcttgac 32520
taaatgtggt agtcaaatac tggccactgt gtcagtctta gttgtaggaa gtggaaacct 32580
aaaccccatt actggcaccg taagcagtgc tcaggtgttt ctacgttttg atgcaaacgg 32640
tgttctttta acagaacatt ctacactaaa aaaatactgg gggtataggc agggagatag 32700
catagatggc actccatata ccaatgctgt aggattcatg cccaatttaa aagcttatcc 32760
aaagtcacaa agttctacta ctaaaaataa tatagtaggg caagtataca tgaatggaga 32820
tgtttcaaaa cctatgcttc tcactataac cctcaatggt actgatgaca gcaacagtac 32880
atattcaatg tcattttcat acacctggac taatggaagc tatgttggag caacatttgg 32940
ggctaactct tataccttct catacatcgc ccaagaatga acactgtatc ccaccctgca 33000
tgccaaccct tcccacccca ctctgtggaa caaactctga aacacaaaat aaaataaagt 33060
tcaagtgttt tattgattca acagttttac aggattcgag cagttatttt tcctccaccc 33120
tcccaggaca tggaatacac caccctctcc ccccgcacag ccttgaacat ctgaatgcca 33180
ttggtgatgg acatgctttt ggtctccacg ttccacacag tttcagagcg agccagtctc 33240
gggtcggtca gggagatgaa accctccggg cactcccgca tctgcacctc acagctcaac 33300
agctgaggat tgtcctcggt ggtcgggatc acggttatct ggaagaagca gaagagcggc 33360
ggtgggaatc atagtccgcg aacgggatcg gccggtggtg tcgcatcagg ccccgcagca 33420
gtcgctgccg ccgccgctcc gtcaagctgc tgctcagggg gtccgggtcc agggactccc 33480
tcagcatgat gcccacggcc ctcagcatca gtcgtctggt gcggcgggcg cagcagcgca 33540
tgcggatctc gctcaggtcg ctgcagtacg tgcaacacag aaccaccagg ttgttcaaca 33600
gtccatagtt caacacgctc cagccgaaac tcatcgcggg aaggatgcta cccacgtggc 33660
cgtcgtacca gatcctcagg taaatcaagt ggtgccccct ccagaacacg ctgcccacgt 33720
acatgatctc cttgggcatg tggcggttca ccacctcccg gtaccacatc accctctggt 33780
tgaacatgca gccccggatg atcctgcgga accacagggc cagcaccgcc ccgcccgcca 33840
tgcagcgaag agaccccggg tcccggcaat ggcaatggag gacccaccgc tcgtacccgt 33900
ggatcatctg ggagctgaac aagtctatgt tggcacagca caggcatatg ctcatgcatc 33960
tcttcagcac tctcaactcc tcgggggtca aaaccatatc ccagggcacg gggaactctt 34020
gcaggacagc gaaccccgca gaacagggca atcctcgcac agaacttaca ttgtgcatgg 34080
acagggtatc gcaatcaggc agcaccgggt gatcctccac cagagaagcg cgggtctcgg 34140
tctcctcaca gcgtggtaag ggggccggcc gatacgggtg atggcgggac gcggctgatc 34200
gtgttcgcga ccgtgtcatg atgcagttgc tttcggacat tttcgtactt gctgtagcag 34260
aacctggtcc gggcgctgca caccgatcgc cggcggcggt ctcggcgctt ggaacgctcg 34320
gtgttgaaat tgtaaaacag ccactctctc agaccgtgca gcagatctag ggcctcagga 34380
gtgatgaaga tcccatcatg cctgatggct ctgatcacat cgaccaccgt ggaatgggcc 34440
agacccagcc agatgatgca attttgttgg gtttcggtga cggcggggga gggaagaaca 34500
ggaagaacca tgattaactt ttaatccaaa cggtctcgga gtacttcaaa atgaagatcg 34560
cggagatggc acctctcgcc cccgctgtgt tggtggaaaa taacagccag gtcaaaggtg 34620
atacggttct cgagatgttc cacggtggct tccagcaaag cctccacgcg cacatccaga 34680
aacaagacaa tagcgaaagc gggagggttc tctaattcct caatcatcat gttacactcc 34740
tgcaccatcc ccagataatt ttcatttttc cagccttgaa tgattcgaac tagttcctga 34800
ggtaaatcca agccagccat gataaagagc tcgcgcagag cgccctccac cggcattctt 34860
aagcacaccc tcataattcc aagatattct gctcctggtt cacctgcagc agattgacaa 34920
gcggaatatc aaaatctctg ccgcgatccc tgagctcctc cctcagcaat aactgtaagt 34980
actctttcat atcctctccg aaatttttag ccataggacc accaggaata agattagggc 35040
aagccacagt acagataaac cgaagtcctc cccagtgagc attgccaaat gcaagactgc 35100
tataagcatg ctggctagac ccggtgatat cttccagata actggacaga aaatcgccca 35160
ggcaattttt aagaaaatca acaaaagaaa aatcctccag gtggacgttt agagcctcgg 35220
gaacaacgat gaagtaaatg caagcggtgc gttccagcat ggttagttag ctgatctgta 35280
gaaaaaacaa aaatgaacat taaaccatgc tagcctggcg aacaggtggg taaatcgttc 35340
tctccagcac caggcaggcc acggggtctc cggcgcgacc ctcgtaaaaa ttgtcgctat 35400
gattgaaaac catcacagag agacgttccc ggtggccggc gtgaatgatt cgacaagatg 35460
aatacacccc cggaacattg gcgtccgcga gtgaaaaaaa gcgcccgagg aagcaataag 35520
gcactacaat gctcagtctc aagtccagca aagcgatgcc atgcggatga agcacaaaat 35580
tctcaggtgc gtacaaaatg taattactcc cctcctgcac aggcagcaaa gcccccgatc 35640
cctccaggta cacatacaaa gcctcagcgt ccatagctta ccgagcagca gcacacaaca 35700
ggcgcaagag tcagagaaag gctgagctct aacctgtcca cccgctctct gctcaatata 35760
tagcccagat ctacactgac gtaaaggcca aagtctaaaa atacccgcca aataatcaca 35820
cacgcccagc acacgcccag aaaccggtga cacactcaaa aaaatacgcg cacttcctca 35880
aacgcccaaa actgccgtca tttccgggtt cccacgctac gtcatcaaaa cacgactttc 35940
aaattccgtc gaccgttaaa aacgtcaccc gccccgcccc taacggtcgc ccgtctctca 36000
gccaatcagc gccccgcatc cccaaattca aacacctcat ttgcatatta acgcgcacaa 36060
aaagtttgag gtatattatt gatgatgg 36088
<210> 67
<211> 9576
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 67
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcaagctga ccattgagag cactcccttc aacgtggctg aggggaagga 2040
ggtgctgctc ctggtgcaca atctgcccca gcacctgttc gggtactcct ggtacaaggg 2100
agaacgcgtg gacgggaacc ggcagatcat aggctacgtc atcggaaccc agcaggccac 2160
acccggtcca gcgtacagcg gccgggagat tatctacccg aacgcctccc tgctgatcca 2220
aaacatcatc cagaacgaca ccggtttcta cactctgcac gtgattaagt cagatctggt 2280
caacgaagag gccaccggcc aattcagggt gtaccccgaa ctccctaagc cgttcatcac 2340
ctcgaacaac agcaacccgg tcgaggatga agatgcggtg gccttgacgt gcgaacctga 2400
gatccagaac accacctact tgtggtgggt gaacaatcag agcctgccag tctccccacg 2460
actccagctg tcgaacgaca acaggaccct gactttgctg tccgtgactc ggaacgacgt 2520
gggcccttat gaatgcggta tccagaacaa gctgtccgtg gaccacagcg accctgtgat 2580
cctgaacgtc ctttacgggc cggacgaccc caccatttcc ccgtcgtaca cttactaccg 2640
gccgggcgtg aacctgtccc tgtcgtgcca cgctgcctcc aatccgccgg cccagtactc 2700
ctggctcatc gacggaaaca tccagcagca cacccaagaa ctgttcatct ccaacattac 2760
cgagaaaaac tcgggacttt acacctgtca agccaacaat tccgccagcg gccactcccg 2820
caccactgtc aaaactatca ctgtgtccgc cgaactcccg aagcccagca tcagctccaa 2880
caactcgaag cccgtggagg ataaggacgc tgtcgcgttc acctgtgaac cagaggcaca 2940
gaataccacc tacctttggt gggtcaacgg acagtccctg cctgtctcac cgagactgca 3000
gctgtcaaac gggaatagga ctctgacctt gtttaacgtc acccggaacg acgcccgggc 3060
ctacgtgtgc ggcatccaga actccgtgag cgcaaaccgg tctgacccag tgaccctgga 3120
tgtgctgtac ggccccgaca ctccgatcat ttcacccccc gattcatcct acctgtccgg 3180
cgctaacctc aacctctcat gccactccgc atccaacccc agcccgcaat attcgtggcg 3240
cattaacgga attcctcagc aacataccca ggtcctgttc attgcgaaga tcacccctaa 3300
caacaacgga acctacgcct gctttgtgtc aaacctggcc actggtagaa acaactccat 3360
cgtgaagtcc attaccgtgt cggcgtccgg atccggcgag ggcagaggca gcctgctgac 3420
atgtggcgac gtggaagaga accctggccc cggagctgcc ccggagccgg agaggacccc 3480
cgttggccag ggatcgtggg cccatccggg acgcaccagg ggaccatccg acaggggatt 3540
ctgtgtggtg tcaccggcca ggccagcaga agaggcaacc agcctcgagg gagcgttgtc 3600
tggaaccaga cattcccacc cgtcggtggg ccggcagcac cacgcgggac caccgtccac 3660
ttccagaccg ccacggccat gggacacccc ttgcccgcct gtgtatgccg agactaaaca 3720
cttcctgtac tcatccggag acaaggaaca gcttcggccg tccttcctcc tgtcgtcgct 3780
cagaccgagc ctgaccggag cacgcagatt ggtggaaact atcttccttg ggtcacgtcc 3840
gtggatgcca ggtaccccac ggcgcctccc gcgcctccca cagagatact ggcagatgcg 3900
gcctctgttc ctggaattgc tgggaaacca cgctcagtgc ccgtacggag tcctgctcaa 3960
gactcactgc cctctgaggg cggcggtcac tccggcggcc ggagtgtgcg cacgggagaa 4020
gccccaggga agcgtggcag ctccggaaga ggaggacacc gatccgcgcc gcctcgtgca 4080
acttctgcgc cagcactcct cgccctggca agtctacggg ttcgtccgcg cctgcctgcg 4140
ccgcctggtg ccgcctgggc tctggggttc ccggcataac gagcgccgct tcctgagaaa 4200
tactaagaag tttatctcac ttggaaaaca tgccaagttg tcgctgcaag aactcacgtg 4260
gaagatgtca gtccgcgatt gcgcctggct gcgccgctcg ccgggcgtcg ggtgtgttcc 4320
agctgcagaa caccgcctga gagaagaaat tctggccaaa tttctgcatt ggctgatgtc 4380
agtgtacgtg gtcgagctgc tgcgctcctt tttctacgtc actgagacta cctttcaaaa 4440
gaaccgcctg ttcttctacc gcaaatctgt gtggagcaag ctgcagtcaa tcggcattcg 4500
ccagcatctg aagagggtgc agctgcggga actttccgag gcagaagtcc gccagcaccg 4560
ggaggcccgg ccggcgcttc tcacgtcgcg tctgagattc atcccaaagc ccgacgggct 4620
gaggcctatc gtcaacatgg attacgtcgt gggcgctcgc acctttcgcc gtgaaaagcg 4680
ggccgaacgc ttgacctcac gggtgaaggc cctcttctcc gtgctgaact acgagagagc 4740
aagacggcct ggcctgctgg gagcttcggt gctgggactg gacgatatcc accgggcttg 4800
gcggaccttt gttctccggg tgagagccca agaccctccg ccggaactgt acttcgtgaa 4860
ggtggcgatc accggagcct atgatactat tccgcaagat cgactcaccg aagtcatcgc 4920
ctcgatcatc aaaccgcaga acacttactg cgtcaggcgg tacgccgtgg tccagaaggc 4980
cgcgcatggc cacgtgagaa aggcgttcaa gtcgcacgtg tccactctca ccgacctcca 5040
gccttacatg aggcaattcg ttgcgcattt gcaagagact tcgcccctga gagatgcggt 5100
ggtcatcgag cagagctcca gcctgaacga agcgagcagc ggtctgtttg acgtgttcct 5160
ccgcttcatg tgtcatcacg cggtgcgaat caggggaaaa tcatacgtgc agtgccaggg 5220
aatcccacaa ggcagcattc tgtcgactct cttgtgttcc ctttgctacg gcgatatgga 5280
aaacaagctg ttcgctggga tcagacggga cgggttgctg ctcagactgg tggacgactt 5340
cctgctggtg actccgcacc tcactcacgc caaaaccttt ctccgcactc tggtgagggg 5400
agtgccagaa tacggctgtg tggtcaatct ccggaaaact gtggtgaatt tccctgtcga 5460
ggatgaggca ctcggaggaa ccgcatttgt ccaaatgcca gcacatggcc tgttcccatg 5520
gtgcggtctg ctgctggaca cccgaactct tgaagtgcag tccgactact ccagctatgc 5580
ccggacgagc atccgcgcca gcctcacttt caatcgcggc tttaaggccg gacgaaacat 5640
gcgcagaaag cttttcggag tcctccggct taaatgccat tcgctctttc tcgatctcca 5700
agtcaattcg ctgcagaccg tgtgcacgaa catctacaag atcctgctgc tccaagccta 5760
ccggttccac gcttgcgtgc ttcagctgcc gtttcaccaa caggtgtgga agaacccgac 5820
cttctttctg cgggtcatta gcgatactgc ctccctgtgt tactcaatcc tcaaggcaaa 5880
gaacgccgga atgtcgctgg gtgcgaaagg agccgcggga cctcttccta gcgaagcggt 5940
gcagtggctc tgccaccagg ctttcctcct gaagctgacc aggcacagag tgacctacgt 6000
cccgctgctg ggctcgctgc gcactgcaca gacccagctg tctagaaaac tccccggcac 6060
caccctgacc gctctggaag ccgccgccaa cccagcattg ccgtcagatt tcaagaccat 6120
cttggacgga tccggcacaa tcctgtctga gggcgccacc aacttcagcc tgctgaaact 6180
ggccggcgac gtggaactga accctggccc tacccctgga acccagagcc ccttcttcct 6240
tctgctgctg ctgaccgtgc tgactgtcgt gacaggctct ggccacgcca gctctacacc 6300
tggcggcgag aaagagacaa gcgccaccca gagaagcagc gtgccaagca gcaccgagaa 6360
gaacgccgtg tccatgacca gctccgtgct gagcagccac tctcctggca gcggcagcag 6420
cacaacacag ggccaggatg tgacactggc ccctgccaca gaacctgcct ctggatctgc 6480
cgccacctgg ggacaggacg tgacaagcgt gccagtgacc agacctgccc tgggctctac 6540
aacaccccct gcccacgatg tgaccagcgc ccctgataac aagcctgccc ctggaagcac 6600
agcccctcca gctcatggcg tgacctctgc cccagatacc agaccagccc caggatctac 6660
agccccaccc gcacacggcg tgacaagtgc ccctgacaca agacccgctc caggctctac 6720
tgctcctcct gcccatggcg tgacaagcgc tcccgataca aggccagctc ctggctccac 6780
agcaccacca gcacatggcg tgacatcagc tcccgacact agacctgctc ccggatcaac 6840
cgctccacca gctcacggcg tgaccagcgc acctgatacc agacctgctc tgggaagcac 6900
cgcccctccc gtgcacaatg tgacatctgc ttccggcagc gccagcggct ctgcctctac 6960
actggtgcac aacggcacca gcgccagagc cacaacaacc ccagccagca agagcacccc 7020
cttcagcatc cctagccacc acagcgacac ccctaccaca ctggccagcc actccaccaa 7080
gaccgatgcc tctagcaccc accactccag cgtgccccct ctgaccagca gcaaccacag 7140
cacaagcccc cagctgtcta ccggcgtctc attcttcttt ctgtccttcc acatcagcaa 7200
cctgcagttc aacagcagcc tggaagatcc cagcaccgac tactaccagg aactgcagcg 7260
ggatatcagc gagatgttcc tgcaaatcta caagcagggc ggcttcctgg gcctgagcaa 7320
catcaagttc agacccggca gcgtggtggt gcagctgacc ctggctttcc gggaaggcac 7380
catcaacgtg cacgacgtgg aaacccagtt caaccagtac aagaccgagg ccgccagccg 7440
gtacaacctg accatctccg atgtgtccgt gtccgacgtg cccttcccat tctctgccca 7500
gtctggcgca ggcgtgccag gatggggaat tgctctgctg gtgctcgtgt gcgtgctggt 7560
ggccctggcc atcgtgtatc tgattgccct ggccgtgtgc cagtgccggc ggaagaatta 7620
cggccagctg gacatcttcc ccgccagaga cacctaccac cccatgagcg agtaccccac 7680
ataccacacc cacggcagat acgtgccacc cagctccacc gacagatccc cctacgagaa 7740
agtgtctgcc ggcaacggcg gcagctccct gagctacaca aatcctgccg tggccgctgc 7800
ctccgccaac ctgtgaagat ctgggcccta acaaaacaaa aagatggggt tattccctaa 7860
acttcatggg ttacgtaatt ggaagttggg ggacattgcc acaagatcat attgtacaaa 7920
agatcaaaca ctgttttaga aaacttcctg taaacaggcc tattgattgg aaagtatgtc 7980
aaaggattgt gggtcttttg ggctttgctg ctccatttac acaatgtgga tatcctgcct 8040
taatgccttt gtatgcatgt atacaagcta aacaggcttt cactttctcg ccaacttaca 8100
aggcctttct aagtaaacag tacatgaacc tttaccccgt tgctcggcaa cggcctggtc 8160
tgtgccaagt gtttgctgac gcaaccccca ctggctgggg cttggccata ggccatcagc 8220
gcatgcgtgg aacctttgtg gctcctctgc cgatccatac tgcggaactc ctagccgctt 8280
gttttgctcg cagccggtct ggagcaaagc tcataggaac tgacaattct gtcgtcctct 8340
cgcggaaata tacatcgttt cgatctacgt atgatctttt tccctctgcc aaaaattatg 8400
gggacatcat gaagcccctt gagcatctga cttctggcta ataaaggaaa tttattttca 8460
ttgcaatagt gtgttggaat tttttgtgtc tctcactcgg aaggaattct gcattaatga 8520
atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 8580
actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 8640
gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 8700
cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 8760
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 8820
ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 8880
ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat 8940
agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 9000
cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 9060
aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 9120
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 9180
agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 9240
ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 9300
cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 9360
tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 9420
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 9480
tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 9540
atctgtctat ttcgttcatc catagttgcc tgactc 9576
<210> 68
<211> 36088
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 68
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc accatggcta gcaagctgac cattgagagc actcccttca 1140
acgtggctga ggggaaggag gtgctgctcc tggtgcacaa tctgccccag cacctgttcg 1200
ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg gcagatcata ggctacgtca 1260
tcggaaccca gcaggccaca cccggtccag cgtacagcgg ccgggagatt atctacccga 1320
acgcctccct gctgatccaa aacatcatcc agaacgacac cggtttctac actctgcacg 1380
tgattaagtc agatctggtc aacgaagagg ccaccggcca attcagggtg taccccgaac 1440
tccctaagcc gttcatcacc tcgaacaaca gcaacccggt cgaggatgaa gatgcggtgg 1500
ccttgacgtg cgaacctgag atccagaaca ccacctactt gtggtgggtg aacaatcaga 1560
gcctgccagt ctccccacga ctccagctgt cgaacgacaa caggaccctg actttgctgt 1620
ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat ccagaacaag ctgtccgtgg 1680
accacagcga ccctgtgatc ctgaacgtcc tttacgggcc ggacgacccc accatttccc 1740
cgtcgtacac ttactaccgg ccgggcgtga acctgtccct gtcgtgccac gctgcctcca 1800
atccgccggc ccagtactcc tggctcatcg acggaaacat ccagcagcac acccaagaac 1860
tgttcatctc caacattacc gagaaaaact cgggacttta cacctgtcaa gccaacaatt 1920
ccgccagcgg ccactcccgc accactgtca aaactatcac tgtgtccgcc gaactcccga 1980
agcccagcat cagctccaac aactcgaagc ccgtggagga taaggacgct gtcgcgttca 2040
cctgtgaacc agaggcacag aataccacct acctttggtg ggtcaacgga cagtccctgc 2100
ctgtctcacc gagactgcag ctgtcaaacg ggaataggac tctgaccttg tttaacgtca 2160
cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa ctccgtgagc gcaaaccggt 2220
ctgacccagt gaccctggat gtgctgtacg gccccgacac tccgatcatt tcaccccccg 2280
attcatccta cctgtccggc gctaacctca acctctcatg ccactccgca tccaacccca 2340
gcccgcaata ttcgtggcgc attaacggaa ttcctcagca acatacccag gtcctgttca 2400
ttgcgaagat cacccctaac aacaacggaa cctacgcctg ctttgtgtca aacctggcca 2460
ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc ggcgtccgga tccggcgagg 2520
gcagaggcag cctgctgaca tgtggcgacg tggaagagaa ccctggcccc ggagctgccc 2580
cggagccgga gaggaccccc gttggccagg gatcgtgggc ccatccggga cgcaccaggg 2640
gaccatccga caggggattc tgtgtggtgt caccggccag gccagcagaa gaggcaacca 2700
gcctcgaggg agcgttgtct ggaaccagac attcccaccc gtcggtgggc cggcagcacc 2760
acgcgggacc accgtccact tccagaccgc cacggccatg ggacacccct tgcccgcctg 2820
tgtatgccga gactaaacac ttcctgtact catccggaga caaggaacag cttcggccgt 2880
ccttcctcct gtcgtcgctc agaccgagcc tgaccggagc acgcagattg gtggaaacta 2940
tcttccttgg gtcacgtccg tggatgccag gtaccccacg gcgcctcccg cgcctcccac 3000
agagatactg gcagatgcgg cctctgttcc tggaattgct gggaaaccac gctcagtgcc 3060
cgtacggagt cctgctcaag actcactgcc ctctgagggc ggcggtcact ccggcggccg 3120
gagtgtgcgc acgggagaag ccccagggaa gcgtggcagc tccggaagag gaggacaccg 3180
atccgcgccg cctcgtgcaa cttctgcgcc agcactcctc gccctggcaa gtctacgggt 3240
tcgtccgcgc ctgcctgcgc cgcctggtgc cgcctgggct ctggggttcc cggcataacg 3300
agcgccgctt cctgagaaat actaagaagt ttatctcact tggaaaacat gccaagttgt 3360
cgctgcaaga actcacgtgg aagatgtcag tccgcgattg cgcctggctg cgccgctcgc 3420
cgggcgtcgg gtgtgttcca gctgcagaac accgcctgag agaagaaatt ctggccaaat 3480
ttctgcattg gctgatgtca gtgtacgtgg tcgagctgct gcgctccttt ttctacgtca 3540
ctgagactac ctttcaaaag aaccgcctgt tcttctaccg caaatctgtg tggagcaagc 3600
tgcagtcaat cggcattcgc cagcatctga agagggtgca gctgcgggaa ctttccgagg 3660
cagaagtccg ccagcaccgg gaggcccggc cggcgcttct cacgtcgcgt ctgagattca 3720
tcccaaagcc cgacgggctg aggcctatcg tcaacatgga ttacgtcgtg ggcgctcgca 3780
cctttcgccg tgaaaagcgg gccgaacgct tgacctcacg ggtgaaggcc ctcttctccg 3840
tgctgaacta cgagagagca agacggcctg gcctgctggg agcttcggtg ctgggactgg 3900
acgatatcca ccgggcttgg cggacctttg ttctccgggt gagagcccaa gaccctccgc 3960
cggaactgta cttcgtgaag gtggcgatca ccggagccta tgatactatt ccgcaagatc 4020
gactcaccga agtcatcgcc tcgatcatca aaccgcagaa cacttactgc gtcaggcggt 4080
acgccgtggt ccagaaggcc gcgcatggcc acgtgagaaa ggcgttcaag tcgcacgtgt 4140
ccactctcac cgacctccag ccttacatga ggcaattcgt tgcgcatttg caagagactt 4200
cgcccctgag agatgcggtg gtcatcgagc agagctccag cctgaacgaa gcgagcagcg 4260
gtctgtttga cgtgttcctc cgcttcatgt gtcatcacgc ggtgcgaatc aggggaaaat 4320
catacgtgca gtgccaggga atcccacaag gcagcattct gtcgactctc ttgtgttccc 4380
tttgctacgg cgatatggaa aacaagctgt tcgctgggat cagacgggac gggttgctgc 4440
tcagactggt ggacgacttc ctgctggtga ctccgcacct cactcacgcc aaaacctttc 4500
tccgcactct ggtgagggga gtgccagaat acggctgtgt ggtcaatctc cggaaaactg 4560
tggtgaattt ccctgtcgag gatgaggcac tcggaggaac cgcatttgtc caaatgccag 4620
cacatggcct gttcccatgg tgcggtctgc tgctggacac ccgaactctt gaagtgcagt 4680
ccgactactc cagctatgcc cggacgagca tccgcgccag cctcactttc aatcgcggct 4740
ttaaggccgg acgaaacatg cgcagaaagc ttttcggagt cctccggctt aaatgccatt 4800
cgctctttct cgatctccaa gtcaattcgc tgcagaccgt gtgcacgaac atctacaaga 4860
tcctgctgct ccaagcctac cggttccacg cttgcgtgct tcagctgccg tttcaccaac 4920
aggtgtggaa gaacccgacc ttctttctgc gggtcattag cgatactgcc tccctgtgtt 4980
actcaatcct caaggcaaag aacgccggaa tgtcgctggg tgcgaaagga gccgcgggac 5040
ctcttcctag cgaagcggtg cagtggctct gccaccaggc tttcctcctg aagctgacca 5100
ggcacagagt gacctacgtc ccgctgctgg gctcgctgcg cactgcacag acccagctgt 5160
ctagaaaact ccccggcacc accctgaccg ctctggaagc cgccgccaac ccagcattgc 5220
cgtcagattt caagaccatc ttggacggat ccggcacaat cctgtctgag ggcgccacca 5280
acttcagcct gctgaaactg gccggcgacg tggaactgaa ccctggccct acccctggaa 5340
cccagagccc cttcttcctt ctgctgctgc tgaccgtgct gactgtcgtg acaggctctg 5400
gccacgccag ctctacacct ggcggcgaga aagagacaag cgccacccag agaagcagcg 5460
tgccaagcag caccgagaag aacgccgtgt ccatgaccag ctccgtgctg agcagccact 5520
ctcctggcag cggcagcagc acaacacagg gccaggatgt gacactggcc cctgccacag 5580
aacctgcctc tggatctgcc gccacctggg gacaggacgt gacaagcgtg ccagtgacca 5640
gacctgccct gggctctaca acaccccctg cccacgatgt gaccagcgcc cctgataaca 5700
agcctgcccc tggaagcaca gcccctccag ctcatggcgt gacctctgcc ccagatacca 5760
gaccagcccc aggatctaca gccccacccg cacacggcgt gacaagtgcc cctgacacaa 5820
gacccgctcc aggctctact gctcctcctg cccatggcgt gacaagcgct cccgatacaa 5880
ggccagctcc tggctccaca gcaccaccag cacatggcgt gacatcagct cccgacacta 5940
gacctgctcc cggatcaacc gctccaccag ctcacggcgt gaccagcgca cctgatacca 6000
gacctgctct gggaagcacc gcccctcccg tgcacaatgt gacatctgct tccggcagcg 6060
ccagcggctc tgcctctaca ctggtgcaca acggcaccag cgccagagcc acaacaaccc 6120
cagccagcaa gagcaccccc ttcagcatcc ctagccacca cagcgacacc cctaccacac 6180
tggccagcca ctccaccaag accgatgcct ctagcaccca ccactccagc gtgccccctc 6240
tgaccagcag caaccacagc acaagccccc agctgtctac cggcgtctca ttcttctttc 6300
tgtccttcca catcagcaac ctgcagttca acagcagcct ggaagatccc agcaccgact 6360
actaccagga actgcagcgg gatatcagcg agatgttcct gcaaatctac aagcagggcg 6420
gcttcctggg cctgagcaac atcaagttca gacccggcag cgtggtggtg cagctgaccc 6480
tggctttccg ggaaggcacc atcaacgtgc acgacgtgga aacccagttc aaccagtaca 6540
agaccgaggc cgccagccgg tacaacctga ccatctccga tgtgtccgtg tccgacgtgc 6600
ccttcccatt ctctgcccag tctggcgcag gcgtgccagg atggggaatt gctctgctgg 6660
tgctcgtgtg cgtgctggtg gccctggcca tcgtgtatct gattgccctg gccgtgtgcc 6720
agtgccggcg gaagaattac ggccagctgg acatcttccc cgccagagac acctaccacc 6780
ccatgagcga gtaccccaca taccacaccc acggcagata cgtgccaccc agctccaccg 6840
acagatcccc ctacgagaaa gtgtctgccg gcaacggcgg cagctccctg agctacacaa 6900
atcctgccgt ggccgctgcc tccgccaacc tgtgacgcac ctcgagctga tcataatcag 6960
ccataccaca tttgtagagg ttttacttgc tttaaaaaac ctcccacacc tccccctgaa 7020
cctgaaacat aaaatgaatg caattgttgt tgttaacttg tttattgcag cttataatgg 7080
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 7140
tagttgtggt ttgtccaaac tcatcaatgt atcttaccag gtgccgagcc tgcgagtgcg 7200
gagggaagca tgccaggttc cagcccgtgt gtgtggatgt gacggaggac ctgcgacccg 7260
atcatttggt gttgccctgc accgggacgg agttcggttc cagcggggaa gaatctgact 7320
agagtgagta gtgttctggg gcgggggagg acctgcatga gggccagaat aactgaaatc 7380
tgtgcttttc tgtgtgttgc agcagcatga gcggaagcgg ctcctttgag ggaggggtat 7440
tcagccctta tctgacgggg cgtctcccct cctgggcggg agtgcgtcag aatgtgatgg 7500
gatccacggt ggacggccgg cccgtgcagc ccgcgaactc ttcaaccctg acctatgcaa 7560
ccctgagctc ttcgtcgttg gacgcagctg ccgccgcagc tgctgcatct gccgccagcg 7620
ccgtgcgcgg aatggccatg ggcgccggct actacggcac tctggtggcc aactcgagtt 7680
ccaccaataa tcccgccagc ctgaacgagg agaagctgtt gctgctgatg gcccagctcg 7740
aggccttgac ccagcgcctg ggcgagctga cccagcaggt ggctcagctg caggagcaga 7800
cgcgggccgc ggttgccacg gtgaaatcca aataaaaaat gaatcaataa ataaacggag 7860
acggttgttg attttaacac agagtctgaa tctttatttg atttttcgcg cgcggtaggc 7920
cctggaccac cggtctcgat cattgagcac ccggtggatc ttttccagga cccggtagag 7980
gtgggcttgg atgttgaggt acatgggcat gagcccgtcc cgggggtgga ggtagctcca 8040
ttgcagggcc tcgtgctcgg gggtggtgtt gtaaatcacc cagtcatagc aggggcgcag 8100
ggcatggtgt tgcacaatat ctttgaggag gagactgatg gccacgggca gccctttggt 8160
gtaggtgttt acaaatctgt tgagctggga gggatgcatg cggggggaga tgaggtgcat 8220
cttggcctgg atcttgagat tggcgatgtt accgcccaga tcccgcctgg ggttcatgtt 8280
gtgcaggacc accagcacgg tgtatccggt gcacttgggg aatttatcat gcaacttgga 8340
agggaaggcg tgaaagaatt tggcgacgcc tttgtgcccg cccaggtttt ccatgcactc 8400
atccatgatg atggcgatgg gcccgtgggc ggcggcctgg gcaaagacgt ttcgggggtc 8460
ggacacatca tagttgtggt cctgggtgag gtcatcatag gccattttaa tgaatttggg 8520
gcggagggtg ccggactggg ggacaaaggt accctcgatc ccgggggcgt agttcccctc 8580
acagatctgc atctcccagg ctttgagctc ggaggggggg atcatgtcca cctgcggggc 8640
gataaagaac acggtttccg gggcggggga gatgagctgg gccgaaagca agttccggag 8700
cagctgggac ttgccgcagc cggtggggcc gtagatgacc ccgatgaccg gctgcaggtg 8760
gtagttgagg gagagacagc tgccgtcctc ccggaggagg ggggccacct cgttcatcat 8820
ctcgcgcacg tgcatgttct cgcgcaccag ttccgccagg aggcgctctc cccccaggga 8880
taggagctcc tggagcgagg cgaagttttt cagcggcttg agtccgtcgg ccatgggcat 8940
tttggagagg gtttgttgca agagttccag gcggtcccag agctcggtga tgtgctctac 9000
ggcatctcga tccagcagac ctcctcgttt cgcgggttgg gacggctgcg ggagtagggc 9060
accagacgat gggcgtccag cgcagccagg gtccggtcct tccagggtcg cagcgtccgc 9120
gtcagggtgg tctccgtcac ggtgaagggg tgcgcgccgg gctgggcgct tgcgagggtg 9180
cgcttcaggc tcatccggct ggtcgaaaac cgctcccgat cggcgccctg cgcgtcggcc 9240
aggtagcaat tgaccatgag ttcgtagttg agcgcctcgg ccgcgtggcc tttggcgcgg 9300
agcttacctt tggaagtctg cccgcaggcg ggacagagga gggacttgag ggcgtagagc 9360
ttgggggcga ggaagacgga ctcgggggcg taggcgtccg cgccgcagtg ggcgcagacg 9420
gtctcgcact ccacgagcca ggtgaggtcg ggctggtcgg ggtcaaaaac cagtttcccg 9480
ccgttctttt tgatgcgttt cttacctttg gtctccatga gctcgtgtcc ccgctgggtg 9540
acaaagaggc tgtccgtgtc cccgtagacc gactttatgg gccggtcctc gagcggtgtg 9600
ccgcggtcct cctcgtagag gaaccccgcc cactccgaga cgaaagcccg ggtccaggcc 9660
agcacgaagg aggccacgtg ggacgggtag cggtcgttgt ccaccagcgg gtccaccttt 9720
tccagggtat gcaaacacat gtccccctcg tccacatcca ggaaggtgat tggcttgtaa 9780
gtgtaggcca cgtgaccggg ggtcccggcc gggggggtat aaaagggtgc gggtccctgc 9840
tcgtcctcac tgtcttccgg atcgctgtcc aggagcgcca gctgttgggg taggtattcc 9900
ctctcgaagg cgggcatgac ctcggcactc aggttgtcag tttctagaaa cgaggaggat 9960
ttgatattga cggtgccggc ggagatgcct ttcaagagcc cctcgtccat ctggtcagaa 10020
aagacgatct ttttgttgtc gagcttggtg gcgaaggagc cgtagagggc gttggagagg 10080
agcttggcga tggagcgcat ggtctggttt ttttccttgt cggcgcgctc cttggcggcg 10140
atgttgagct gcacgtactc gcgcgccacg cacttccatt cggggaagac ggtggtcagc 10200
tcgtcgggca cgattctgac ctgccagccc cgattatgca gggtgatgag gtccacactg 10260
gtggccacct cgccgcgcag gggctcatta gtccagcaga ggcgtccgcc cttgcgcgag 10320
cagaaggggg gcagggggtc cagcatgacc tcgtcggggg ggtcggcatc gatggtgaag 10380
atgccgggca ggaggtcggg gtcaaagtag ctgatggaag tggccagatc gtccagggca 10440
gcttgccatt cgcgcacggc cagcgcgcgc tcgtagggac tgaggggcgt gccccagggc 10500
atgggatggg taagcgcgga ggcgtacatg ccgcagatgt cgtagacgta gaggggctcc 10560
tcgaggatgc cgatgtaggt ggggtagcag cgccccccgc ggatgctggc gcgcacgtag 10620
tcatacagct cgtgcgaggg ggcgaggagc cccgggccca ggttggtgcg actgggcttt 10680
tcggcgcggt agacgatctg gcggaaaatg gcatgcgagt tggaggagat ggtgggcctt 10740
tggaagatgt tgaagtgggc gtggggcagt ccgaccgagt cgcggatgaa gtgggcgtag 10800
gagtcttgca gcttggcgac gagctcggcg gtgactagga cgtccagagc gcagtagtcg 10860
agggtctcct ggatgatgtc atacttgagc tgtccctttt gtttccacag ctcgcggttg 10920
agaaggaact cttcgcggtc cttccagtac tcttcgaggg ggaacccgtc ctgatctgca 10980
cggtaagagc ctagcatgta gaactggttg acggccttgt aggcgcagca gcccttctcc 11040
acggggaggg cgtaggcctg ggcggccttg cgcagggagg tgtgcgtgag ggcgaaagtg 11100
tccctgacca tgaccttgag gaactggtgc ttgaagtcga tatcgtcgca gcccccctgc 11160
tcccagagct ggaagtccgt gcgcttcttg taggcggggt tgggcaaagc gaaagtaaca 11220
tcgttgaaga ggatcttgcc cgcgcggggc ataaagttgc gagtgatgcg gaaaggttgg 11280
ggcacctcgg cccggttgtt gatgacctgg gcggcgagca cgatctcgtc gaagccgttg 11340
atgttgtggc ccacgatgta gagttccacg aatcgcggac ggcccttgac gtggggcagt 11400
ttcttgagct cctcgtaggt gagctcgtcg gggtcgctga gcccgtgctg ctcgagcgcc 11460
cagtcggcga gatgggggtt ggcgcggagg aaggaagtcc agagatccac ggccagggcg 11520
gtttgcagac ggtcccggta ctgacggaac tgctgcccga cggccatttt ttcgggggtg 11580
acgcagtaga aggtgcgggg gtccccgtgc cagcgatccc atttgagctg gagggcgaga 11640
tcgagggcga gctcgacgag ccggtcgtcc ccggagagtt tcatgaccag catgaagggg 11700
acgagctgct tgccgaagga ccccatccag gtgtaggttt ccacatcgta ggtgaggaag 11760
agcctttcgg tgcgaggatg cgagccgatg gggaagaact ggatctcctg ccaccaattg 11820
gaggaatggc tgttgatgtg atggaagtag aaatgccgac ggcgcgccga acactcgtgc 11880
ttgtgtttat acaagcggcc acagtgctcg caacgctgca cgggatgcac gtgctgcacg 11940
agctgtacct gagttccttt gacgaggaat ttcagtggga agtggagtcg tggcgcctgc 12000
atctcgtgct gtactacgtc gtggtggtcg gcctggccct cttctgcctc gatggtggtc 12060
atgctgacga gcccgcgcgg gaggcaggtc cagacctcgg cgcgagcggg tcggagagcg 12120
aggacgaggg cgcgcaggcc ggagctgtcc agggtcctga gacgctgcgg agtcaggtca 12180
gtgggcagcg gcggcgcgcg gttgacttgc aggagttttt ccagggcgcg cgggaggtcc 12240
agatggtact tgatctccac cgcgccattg gtggcgacgt cgatggcttg cagggtcccg 12300
tgcccctggg gtgtgaccac cgtcccccgt ttcttcttgg gcggctgggg cgacgggggc 12360
ggtgcctctt ccatggttag aagcggcggc gaggacgcgc gccgggcggc aggggcggct 12420
cggggcccgg aggcaggggc ggcaggggca cgtcggcgcc gcgcgcgggt aggttctggt 12480
actgcgcccg gagaagactg gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac 12540
gcctctgggt gaaggccacg ggacccgtga gtttgaacct gaaagagagt tcgacagaat 12600
caatctcggt atcgttgacg gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt 12660
cctggtaggc gatctcggtc atgaactgct cgatctcctc ctcttgaagg tctccgcggc 12720
cggcgcgctc cacggtggcc gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg 12780
cgttcatgcc cgcctcgttc cagacgcggc tgtagaccac gacgccctcg ggatcgcggg 12840
cgcgcatgac cacctgggcg aggttgagct ccacgtggcg cgtgaagacc gcgtagttgc 12900
agaggcgctg gtagaggtag ttgagcgtgg tggcgatgtg ctcggtgacg aagaaataca 12960
tgatccagcg gcggagcggc atctcgctga cgtcgcccag cgcctccaaa cgttccatgg 13020
cctcgtaaaa gtccacggcg aagttgaaaa actgggagtt gcgcgccgag acggtcaact 13080
cctcctccag aagacggatg agctcggcga tggtggcgcg cacctcgcgc tcgaaggccc 13140
ccgggagttc ctccacttcc tcttcttcct cctccactaa catctcttct acttcctcct 13200
caggcggcag tggtggcggg ggagggggcc tgcgtcgccg gcggcgcacg ggcagacggt 13260
cgatgaagcg ctcgatggtc tcgccgcgcc ggcgtcgcat ggtctcggtg acggcgcgcc 13320
cgtcctcgcg gggccgcagc gtgaagacgc cgccgcgcat ctccaggtgg ccgggggggt 13380
ccccgttggg cagggagagg gcgctgacga tgcatcttat caattgcccc gtagggactc 13440
cgcgcaagga cctgagcgtc tcgagatcca cgggatctga aaaccgctga acgaaggctt 13500
cgagccagtc gcagtcgcaa ggtaggctga gcacggtttc ttctggcggg tcatgttggt 13560
tgggagcggg gcgggcgatg ctgctggtga tgaagttgaa ataggcggtt ctgagacggc 13620
ggatggtggc gaggagcacc aggtctttgg gcccggcttg ctggatgcgc agacggtcgg 13680
ccatgcccca ggcgtggtcc tgacacctgg ccaggtcctt gtagtagtcc tgcatgagcc 13740
gctccacggg cacctcctcc tcgcccgcgc ggccgtgcat gcgcgtgagc ccgaagccgc 13800
gctggggctg gacgagcgcc aggtcggcga cgacgcgctc ggcgaggatg gcttgctgga 13860
tctgggtgag ggtggtctgg aagtcatcaa agtcgacgaa gcggtggtag gctccggtgt 13920
tgatggtgta ggagcagttg gccatgacgg accagttgac ggtctggtgg cccggacgca 13980
cgagctcgtg gtacttgagg cgcgagtagg cgcgcgtgtc gaagatgtag tcgttgcagg 14040
tgcgcaccag gtactggtag ccgatgagga agtgcggcgg cggctggcgg tagagcggcc 14100
atcgctcggt ggcgggggcg ccgggcgcga ggtcctcgag catggtgcgg tggtagccgt 14160
agatgtacct ggacatccag gtgatgccgg cggcggtggt ggaggcgcgc gggaactcgc 14220
ggacgcggtt ccagatgttg cgcagcggca ggaagtagtt catggtgggc acggtctggc 14280
ccgtgaggcg cgcgcagtcg tggatgctct atacgggcaa aaacgaaagc ggtcagcggc 14340
tcgactccgt ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga 14400
atctcgaatc aggctggagc cgcagctaac gtggtattgg cactcccgtc tcgacccaag 14460
cctgcaccaa ccctccagga tacggaggcg ggtcgttttg caactttttt ttggaggccg 14520
gatgagacta gtaagcgcgg aaagcggccg accgcgatgg ctcgctgccg tagtctggag 14580
aagaatcgcc agggttgcgt tgcggtgtgc cccggttcga ggccggccgg attccgcggc 14640
taacgagggc gtggctgccc cgtcgtttcc aagaccccat agccagccga cttctccagt 14700
tacggagcga gcccctcttt tgttttgttt gtttttgcca gatgcatccc gtactgcggc 14760
agatgcgccc ccaccaccct ccaccgcaac aacagccccc tccacagccg gcgcttctgc 14820
ccccgcccca gcagcaactt ccagccacga ccgccgcggc cgccgtgagc ggggctggac 14880
agagttatga tcaccagctg gccttggaag agggcgaggg gctggcgcgc ctgggggcgt 14940
cgtcgccgga gcggcacccg cgcgtgcaga tgaaaaggga cgctcgcgag gcctacgtgc 15000
ccaagcagaa cctgttcaga gacaggagcg gcgaggagcc cgaggagatg cgcgcggccc 15060
ggttccacgc ggggcgggag ctgcggcgcg gcctggaccg aaagagggtg ctgagggacg 15120
aggatttcga ggcggacgag ctgacgggga tcagccccgc gcgcgcgcac gtggccgcgg 15180
ccaacctggt cacggcgtac gagcagaccg tgaaggagga gagcaacttc caaaaatcct 15240
tcaacaacca cgtgcgcacc ctgatcgcgc gcgaggaggt gaccctgggc ctgatgcacc 15300
tgtgggacct gctggaggcc atcgtgcaga accccaccag caagccgctg acggcgcagc 15360
tgttcctggt ggtgcagcat agtcgggaca acgaagcgtt cagggaggcg ctgctgaata 15420
tcaccgagcc cgagggccgc tggctcctgg acctggtgaa cattctgcag agcatcgtgg 15480
tgcaggagcg cgggctgccg ctgtccgaga agctggcggc catcaacttc tcggtgctga 15540
gtttgggcaa gtactacgct aggaagatct acaagacccc gtacgtgccc atagacaagg 15600
aggtgaagat cgacgggttt tacatgcgca tgaccctgaa agtgctgacc ctgagcgacg 15660
atctgggggt gtaccgcaac gacaggatgc accgtgcggt gagcgccagc aggcggcgcg 15720
agctgagcga ccaggagctg atgcatagtc tgcagcgggc cctgaccggg gccgggaccg 15780
agggggagag ctactttgac atgggcgcgg acctgcactg gcagcccagc cgccgggcct 15840
tggaggcggc ggcaggaccc tacgtagaag aggtggacga tgaggtggac gaggagggcg 15900
agtacctgga agactgatgg cgcgaccgta tttttgctag atgcaacaac aacagccacc 15960
tcctgatccc gcgatgcggg cggcgctgca gagccagccg tccggcatta actcctcgga 16020
cgattggacc caggccatgc aacgcatcat ggcgctgacg acccgcaacc ccgaagcctt 16080
tagacagcag ccccaggcca accggctctc ggccatcctg gaggccgtgg tgccctcgcg 16140
ctccaacccc acgcacgaga aggtcctggc catcgtgaac gcgctggtgg agaacaaggc 16200
catccgcggc gacgaggccg gcctggtgta caacgcgctg ctggagcgcg tggcccgcta 16260
caacagcacc aacgtgcaga ccaacctgga ccgcatggtg accgacgtgc gcgaggccgt 16320
ggcccagcgc gagcggttcc accgcgagtc caacctggga tccatggtgg cgctgaacgc 16380
cttcctcagc acccagcccg ccaacgtgcc ccggggccag gaggactaca ccaacttcat 16440
cagcgccctg cgcctgatgg tgaccgaggt gccccagagc gaggtgtacc agtccgggcc 16500
ggactacttc ttccagacca gtcgccaggg cttgcagacc gtgaacctga gccaggcttt 16560
caagaacttg cagggcctgt ggggcgtgca ggccccggtc ggggaccgcg cgacggtgtc 16620
gagcctgctg acgccgaact cgcgcctgct gctgctgctg gtggccccct tcacggacag 16680
cggcagcatc aaccgcaact cgtacctggg ctacctgatt aacctgtacc gcgaggccat 16740
cggccaggcg cacgtggacg agcagaccta ccaggagatc acccacgtga gccgcgccct 16800
gggccaggac gacccgggca acctggaagc caccctgaac tttttgctga ccaaccggtc 16860
gcagaagatc ccgccccagt acgcgctcag caccgaggag gagcgcatcc tgcgttacgt 16920
gcagcagagc gtgggcctgt tcctgatgca ggagggggcc acccccagcg ccgcgctcga 16980
catgaccgcg cgcaacatgg agcccagcat gtacgccagc aaccgcccgt tcatcaataa 17040
actgatggac tacttgcatc gggcggccgc catgaactct gactatttca ccaacgccat 17100
cctgaatccc cactggctcc cgccgccggg gttctacacg ggcgagtacg acatgcccga 17160
ccccaatgac gggttcctgt gggacgatgt ggacagcagc gtgttctccc cccgaccggg 17220
tgctaacgag cgccccttgt ggaagaagga aggcagcgac cgacgcccgt cctcggcgct 17280
gtccggccgc gagggtgctg ccgcggcggt gcccgaggcc gccagtcctt tcccgagctt 17340
gcccttctcg ctgaacagta tccgcagcag cgagctgggc aggatcacgc gcccgcgctt 17400
gctgggcgaa gaggagtact tgaatgactc gctgttgaga cccgagcggg agaagaactt 17460
ccccaataac gggatagaaa gcctggtgga caagatgagc cgctggaaga cgtatgcgca 17520
ggagcacagg gacgatcccc gggcgtcgca gggggccacg agccggggca gcgccgcccg 17580
taaacgccgg tggcacgaca ggcagcgggg acagatgtgg gacgatgagg actccgccga 17640
cgacagcagc gtgttggact tgggtgggag tggtaacccg ttcgctcacc tgcgcccccg 17700
tatcgggcgc atgatgtaag agaaaccgaa aataaatgat actcaccaag gccatggcga 17760
ccagcgtgcg ttcgtttctt ctctgttgtt gttgtatcta gtatgatgag gcgtgcgtac 17820
ccggagggtc ctcctccctc gtacgagagc gtgatgcagc aggcgatggc ggcggcggcg 17880
atgcagcccc cgctggaggc tccttacgtg cccccgcggt acctggcgcc tacggagggg 17940
cggaacagca ttcgttactc ggagctggca cccttgtacg ataccacccg gttgtacctg 18000
gtggacaaca agtcggcgga catcgcctcg ctgaactacc agaacgacca cagcaacttc 18060
ctgaccaccg tggtgcagaa caatgacttc acccccacgg aggccagcac ccagaccatc 18120
aactttgacg agcgctcgcg gtggggcggc cagctgaaaa ccatcatgca caccaacatg 18180
cccaacgtga acgagttcat gtacagcaac aagttcaagg cgcgggtgat ggtctcccgc 18240
aagaccccca atggggtgac agtgacagag gattatgatg gtagtcagga tgagctgaag 18300
tatgaatggg tggaatttga gctgcccgaa ggcaacttct cggtgaccat gaccatcgac 18360
ctgatgaaca acgccatcat cgacaattac ttggcggtgg ggcggcagaa cggggtgctg 18420
gagagcgaca tcggcgtgaa gttcgacact aggaacttca ggctgggctg ggaccccgtg 18480
accgagctgg tcatgcccgg ggtgtacacc aacgaggctt tccatcccga tattgtcttg 18540
ctgcccggct gcggggtgga cttcaccgag agccgcctca gcaacctgct gggcattcgc 18600
aagaggcagc ccttccagga aggcttccag atcatgtacg aggatctgga ggggggcaac 18660
atccccgcgc tcctggatgt cgacgcctat gagaaaagca aggaggatgc agcagctgaa 18720
gcaactgcag ccgtagctac cgcctctacc gaggtcaggg gcgataattt tgcaagcgcc 18780
gcagcagtgg cagcggccga ggcggctgaa accgaaagta agatagtcat tcagccggtg 18840
gagaaggata gcaagaacag gagctacaac gtactaccgg acaagataaa caccgcctac 18900
cgcagctggt acctagccta caactatggc gaccccgaga agggcgtgcg ctcctggacg 18960
ctgctcacca cctcggacgt cacctgcggc gtggagcaag tctactggtc gctgcccgac 19020
atgatgcaag acccggtcac cttccgctcc acgcgtcaag ttagcaacta cccggtggtg 19080
ggcgccgagc tcctgcccgt ctactccaag agcttcttca acgagcaggc cgtctactcg 19140
cagcagctgc gcgccttcac ctcgcttacg cacgtcttca accgcttccc cgagaaccag 19200
atcctcgtcc gcccgcccgc gcccaccatt accaccgtca gtgaaaacgt tcctgctctc 19260
acagatcacg ggaccctgcc gctgcgcagc agtatccggg gagtccagcg cgtgaccgtt 19320
actgacgcca gacgccgcac ctgcccctac gtctacaagg ccctgggcat agtcgcgccg 19380
cgcgtcctct cgagccgcac cttctaaatg tccattctca tctcgcccag taataacacc 19440
ggttggggcc tgcgcgcgcc cagcaagatg tacggaggcg ctcgccaacg ctccacgcaa 19500
caccccgtgc gcgtgcgcgg gcacttccgc gctccctggg gcgccctcaa gggccgcgtg 19560
cggtcgcgca ccaccgtcga cgacgtgatc gaccaggtgg tggccgacgc gcgcaactac 19620
acccccgccg ccgcgcccgt ctccaccgtg gacgccgtca tcgacagcgt ggtggccgac 19680
gcgcgccggt acgcccgcgc caagagccgg cggcggcgca tcgcccggcg gcaccggagc 19740
acccccgcca tgcgcgcggc gcgagccttg ctgcgcaggg ccaggcgcac gggacgcagg 19800
gccatgctca gggcggccag acgcgcggct tcaggcgcca gcgccggcag gacccggaga 19860
cgcgcggcca cggcggcggc agcggccatc gccagcatgt cccgcccgcg gcgagggaac 19920
gtgtactggg tgcgcgacgc cgccaccggt gtgcgcgtgc ccgtgcgcac ccgcccccct 19980
cgcacttgaa gatgttcact tcgcgatgtt gatgtgtccc agcggcgagg aggatgtcca 20040
agcgcaaatt caaggaagag atgctccagg tcatcgcgcc tgagatctac ggccctgcgg 20100
tggtgaagga ggaaagaaag ccccgcaaaa tcaagcgggt caaaaaggac aaaaaggaag 20160
aagaaagtga tgtggacgga ttggtggagt ttgtgcgcga gttcgccccc cggcggcgcg 20220
tgcagtggcg cgggcggaag gtgcaaccgg tgctgagacc cggcaccacc gtggtcttca 20280
cgcccggcga gcgctccggc accgcttcca agcgctccta cgacgaggtg tacggggatg 20340
atgatattct ggagcaggcg gccgagcgcc tgggcgagtt tgcttacggc aagcgcagcc 20400
gttccgcacc gaaggaagag gcggtgtcca tcccgctgga ccacggcaac cccacgccga 20460
gcctcaagcc cgtgaccttg cagcaggtgc tgccgaccgc ggcgccgcgc cgggggttca 20520
agcgcgaggg cgaggatctg taccccacca tgcagctgat ggtgcccaag cgccagaagc 20580
tggaagacgt gctggagacc atgaaggtgg acccggacgt gcagcccgag gtcaaggtgc 20640
ggcccatcaa gcaggtggcc ccgggcctgg gcgtgcagac cgtggacatc aagattccca 20700
cggagcccat ggaaacgcag accgagccca tgatcaagcc cagcaccagc accatggagg 20760
tgcagacgga tccctggatg ccatcggctc ctagtcgaag accccggcgc aagtacggcg 20820
cggccagcct gctgatgccc aactacgcgc tgcatccttc catcatcccc acgccgggct 20880
accgcggcac gcgcttctac cgcggtcata ccagcagccg ccgccgcaag accaccactc 20940
gccgccgccg tcgccgcacc gccgctgcaa ccacccctgc cgccctggtg cggagagtgt 21000
accgccgcgg ccgcgcacct ctgaccctgc cgcgcgcgcg ctaccacccg agcatcgcca 21060
tttaaacttt cgcctgcttt gcagatcaat ggccctcaca tgccgccttc gcgttcccat 21120
tacgggctac cgaggaagaa aaccgcgccg tagaaggctg gcggggaacg ggatgcgtcg 21180
ccaccaccac cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc 21240
gctgatcccc atcatcgccg cggcgatcgg ggcgatcccc ggcattgctt ccgtggcggt 21300
gcaggcctct cagcgccact gagacacact tggaaacatc ttgtaataaa ccaatggact 21360
ctgacgctcc tggtcctgtg atgtgttttc gtagacagat ggaagacatc aatttttcgt 21420
ccctggctcc gcgacacggc acgcggccgt tcatgggcac ctggagcgac atcggcacca 21480
gccaactgaa cgggggcgcc ttcaattgga gcagtctctg gagcgggctt aagaatttcg 21540
ggtccacgct taaaacctat ggcagcaagg cgtggaacag caccacaggg caggcgctga 21600
gggataagct gaaagagcag aacttccagc agaaggtggt cgatgggctc gcctcgggca 21660
tcaacggggt ggtggacctg gccaaccagg ccgtgcagcg gcagatcaac agccgcctgg 21720
acccggtgcc gcccgccggc tccgtggaga tgccgcaggt ggaggaggag ctgcctcccc 21780
tggacaagcg gggcgagaag cgaccccgcc ccgatgcgga ggagacgctg ctgacgcaca 21840
cggacgagcc gcccccgtac gaggaggcgg tgaaactggg tctgcccacc acgcggccca 21900
tcgcgcccct ggccaccggg gtgctgaaac ccgaaaagcc cgcgaccctg gacttgcctc 21960
ctccccagcc ttcccgcccc tctacagtgg ctaagcccct gccgccggtg gccgtggccc 22020
gcgcgcgacc cgggggcacc gcccgccctc atgcgaactg gcagagcact ctgaacagca 22080
tcgtgggtct gggagtgcag agtgtgaagc gccgccgctg ctattaaacc taccgtagcg 22140
cttaacttgc ttgtctgtgt gtgtatgtat tatgtcgccg ccgccgctgt ccaccagaag 22200
gaggagtgaa gaggcgcgtc gccgagttgc aagatggcca ccccatcgat gctgccccag 22260
tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc gggtctggtg 22320
cagtttgccc gcgccacaga cacctacttc agtctgggga acaagtttag gaaccccacg 22380
gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct gcgcttcgtg 22440
cccgtggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct ggccgtgggc 22500
gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt gctggatcgg 22560
ggccctagct tcaaacccta ctccggcacc gcctacaaca gtctggcccc caagggagca 22620
cccaacactt gtcagtggac atataaagcc gatggtgaaa ctgccacaga aaaaacctat 22680
acatatggaa atgcacccgt gcagggcatt aacatcacaa aagatggtat tcaacttgga 22740
actgacaccg atgatcagcc aatctacgca gataaaacct atcagcctga acctcaagtg 22800
ggtgatgctg aatggcatga catcactggt actgatgaaa agtatggagg cagagctctt 22860
aagcctgata ccaaaatgaa gccttgttat ggttcttttg ccaagcctac taataaagaa 22920
ggaggtcagg caaatgtgaa aacaggaaca ggcactacta aagaatatga catagacatg 22980
gctttctttg acaacagaag tgcggctgct gctggcctag ctccagaaat tgttttgtat 23040
actgaaaatg tggatttgga aactccagat acccatattg tatacaaagc aggcacagat 23100
gacagcagct cttctattaa tttgggtcag caagccatgc ccaacagacc taactacatt 23160
ggtttcagag acaactttat cgggctcatg tactacaaca gcactggcaa tatgggggtg 23220
ctggccggtc aggcttctca gctgaatgct gtggttgact tgcaagacag aaacaccgag 23280
ctgtcctacc agctcttgct tgactctctg ggtgacagaa cccggtattt cagtatgtgg 23340
aatcaggcgg tggacagcta tgatcctgat gtgcgcatta ttgaaaatca tggtgtggag 23400
gatgaacttc ccaactattg tttccctctg gatgctgttg gcagaacaga tacttatcag 23460
ggaattaagg ctaatggaac tgatcaaacc acatggacca aagatgacag tgtcaatgat 23520
gctaatgaga taggcaaggg taatccattc gccatggaaa tcaacatcca agccaacctg 23580
tggaggaact tcctctacgc caacgtggcc ctgtacctgc ccgactctta caagtacacg 23640
ccggccaatg ttaccctgcc caccaacacc aacacctacg attacatgaa cggccgggtg 23700
gtggcgccct cgctggtgga ctcctacatc aacatcgggg cgcgctggtc gctggatccc 23760
atggacaacg tgaacccctt caaccaccac cgcaatgcgg ggctgcgcta ccgctccatg 23820
ctcctgggca acgggcgcta cgtgcccttc cacatccagg tgccccagaa atttttcgcc 23880
atcaagagcc tcctgctcct gcccgggtcc tacacctacg agtggaactt ccgcaaggac 23940
gtcaacatga tcctgcagag ctccctcggc aacgacctgc gcacggacgg ggcctccatc 24000
tccttcacca gcatcaacct ctacgccacc ttcttcccca tggcgcacaa cacggcctcc 24060
acgctcgagg ccatgctgcg caacgacacc aacgaccagt ccttcaacga ctacctctcg 24120
gcggccaaca tgctctaccc catcccggcc aacgccacca acgtgcccat ctccatcccc 24180
tcgcgcaact gggccgcctt ccgcggctgg tccttcacgc gtctcaagac caaggagacg 24240
ccctcgctgg gctccgggtt cgacccctac ttcgtctact cgggctccat cccctacctc 24300
gacggcacct tctacctcaa ccacaccttc aagaaggtct ccatcacctt cgactcctcc 24360
gtcagctggc ccggcaacga ccggctcctg acgcccaacg agttcgaaat caagcgcacc 24420
gtcgacggcg agggctacaa cgtggcccag tgcaacatga ccaaggactg gttcctggtc 24480
cagatgctgg cccactacaa catcggctac cagggcttct acgtgcccga gggctacaag 24540
gaccgcatgt actccttctt ccgcaacttc cagcccatga gccgccaggt ggtggacgag 24600
gtcaactaca aggactacca ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc 24660
gtcggctacc tcgcgcccac catgcgccag ggccagccct accccgccaa ctacccctac 24720
ccgctcatcg gcaagagcgc cgtcaccagc gtcacccaga aaaagttcct ctgcgacagg 24780
gtcatgtggc gcatcccctt ctccagcaac ttcatgtcca tgggcgcgct caccgacctc 24840
ggccagaaca tgctctatgc caactccgcc cacgcgctag acatgaattt cgaagtcgac 24900
cccatggatg agtccaccct tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg 24960
caccagcccc accgcggcgt catcgaggcc gtctacctgc gcaccccctt ctcggccggt 25020
aacgccacca cctaagctct tgcttcttgc aagccatggc cgcgggctcc ggcgagcagg 25080
agctcagggc catcatccgc gacctgggct gcgggcccta cttcctgggc accttcgata 25140
agcgcttccc gggattcatg gccccgcaca agctggcctg cgccatcgtc aacacggccg 25200
gccgcgagac cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcgaacacct 25260
gctacctctt cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg 25320
agtacgaggg cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg 25380
aaaagtccac ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca 25440
tgttcctgca cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga 25500
acttgctgac gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc 25560
gccgcaacca ggaggcgctc taccgcttcc tcaactccca ctccgcctac tttcgctccc 25620
accgcgcgcg catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg 25680
tgtgtgtatg ttaaatgtct ttaataaaca gcactttcat gttacacatg catctgagat 25740
gatttattta gaaatcgaaa gggttctgcc gggtctcggc atggcccgcg ggcagggaca 25800
cgttgcggaa ctggtacttg gccagccact tgaactcggg gatcagcagt ttgggcagcg 25860
gggtgtcggg gaaggagtcg gtccacagct tccgcgtcag ttgcagggcg cccagcaggt 25920
cgggcgcgga gatcttgaaa tcgcagttgg gacccgcgtt ctgcgcgcgg gagttgcggt 25980
acacggggtt gcagcactgg aacaccatca gggccgggtg cttcacgctc gccagcaccg 26040
tcgcgtcggt gatgctctcc acgtcgaggt cctcggcgtt ggccatcccg aagggggtca 26100
tcttgcaggt ctgccttccc atggtgggca cgcacccggg cttgtggttg caatcgcagt 26160
gcagggggat cagcatcatc tgggcctggt cggcgttcat ccccgggtac atggccttca 26220
tgaaagcctc caattgcctg aacgcctgct gggccttggc tccctcggtg aagaagaccc 26280
cgcaggactt gctagagaac tggttggtgg cgcacccggc gtcgtgcacg cagcagcgcg 26340
cgtcgttgtt ggccagctgc accacgctgc gcccccagcg gttctgggtg atcttggccc 26400
ggtcggggtt ctccttcagc gcgcgctgcc cgttctcgct cgccacatcc atctcgatca 26460
tgtgctcctt ctggatcatg gtggtcccgt gcaggcaccg cagcttgccc tcggcctcgg 26520
tgcacccgtg cagccacagc gcgcacccgg tgcactccca gttcttgtgg gcgatctggg 26580
aatgcgcgtg cacgaagccc tgcaggaagc ggcccatcat ggtggtcagg gtcttgttgc 26640
tagtgaaggt cagcggaatg ccgcggtgct cctcgttgat gtacaggtgg cagatgcggc 26700
ggtacacctc gccctgctcg ggcatcagct ggaagttggc tttcaggtcg gtctccacgc 26760
ggtagcggtc catcagcata gtcatgattt ccataccctt ctcccaggcc gagacgatgg 26820
gcaggctcat agggttcttc accatcatct tagcgctagc agccgcggcc agggggtcgc 26880
tctcgtccag ggtctcaaag ctccgcttgc cgtccttctc ggtgatccgc accggggggt 26940
agctgaagcc cacggccgcc agctcctcct cggcctgtct ttcgtcctcg ctgtcctggc 27000
tgacgtcctg caggaccaca tgcttggtct tgcggggttt cttcttgggc ggcagcggcg 27060
gcggagatgt tggagatggc gagggggagc gcgagttctc gctcaccact actatctctt 27120
cctcttcttg gtccgaggcc acgcggcggt aggtatgtct cttcgggggc agaggcggag 27180
gcgacgggct ctcgccgccg cgacttggcg gatggctggc agagcccctt ccgcgttcgg 27240
gggtgcgctc ccggcggcgc tctgactgac ttcctccgcg gccggccatt gtgttctcct 27300
agggaggaac aacaagcatg gagactcagc catcgccaac ctcgccatct gcccccaccg 27360
ccgacgagaa gcagcagcag cagaatgaaa gcttaaccgc cccgccgccc agccccgcca 27420
cctccgacgc ggccgtccca gacatgcaag agatggagga atccatcgag attgacctgg 27480
gctatgtgac gcccgcggag cacgaggagg agctggcagt gcgcttttca caagaagaga 27540
tacaccaaga acagccagag caggaagcag agaatgagca gagtcaggct gggctcgagc 27600
atgacggcga ctacctccac ctgagcgggg gggaggacgc gctcatcaag catctggccc 27660
ggcaggccac catcgtcaag gatgcgctgc tcgaccgcac cgaggtgccc ctcagcgtgg 27720
aggagctcag ccgcgcctac gagttgaacc tcttctcgcc gcgcgtgccc cccaagcgcc 27780
agcccaatgg cacctgcgag cccaacccgc gcctcaactt ctacccggtc ttcgcggtgc 27840
ccgaggccct ggccacctac cacatctttt tcaagaacca aaagatcccc gtctcctgcc 27900
gcgccaaccg cacccgcgcc gacgcccttt tcaacctggg tcccggcgcc cgcctacctg 27960
atatcgcctc cttggaagag gttcccaaga tcttcgaggg tctgggcagc gacgagactc 28020
gggccgcgaa cgctctgcaa ggagaaggag gagagcatga gcaccacagc gccctggtcg 28080
agttggaagg cgacaacgcg cggctggcgg tgctcaaacg cacggtcgag ctgacccatt 28140
tcgcctaccc ggctctgaac ctgcccccca aagtcatgag cgcggtcatg gaccaggtgc 28200
tcatcaagcg cgcgtcgccc atctccgagg acgagggcat gcaagactcc gaggagggca 28260
agcccgtggt cagcgacgag cagctggccc ggtggctggg tcctaatgct agtccccaga 28320
gtttggaaga gcggcgcaaa ctcatgatgg ccgtggtcct ggtgaccgtg gagctggagt 28380
gcctgcgccg cttcttcgcc gacgcggaga ccctgcgcaa ggtcgaggag aacctgcact 28440
acctcttcag gcacgggttc gtgcgccagg cctgcaagat ctccaacgtg gagctgacca 28500
acctggtctc ctacatgggc atcttgcacg agaaccgcct ggggcagaac gtgctgcaca 28560
ccaccctgcg cggggaggcc cggcgcgact acatccgcga ctgcgtctac ctctacctct 28620
gccacacctg gcagacgggc atgggcgtgt ggcagcagtg tctggaggag cagaacctga 28680
aagagctctg caagctcctg cagaagaacc tcaagggtct gtggaccggg ttcgacgagc 28740
gcaccaccgc ctcggacctg gccgacctca ttttccccga gcgcctcagg ctgacgctgc 28800
gcaacggcct gcccgacttt atgagccaaa gcatgttgca aaactttcgc tctttcatcc 28860
tcgaacgctc cggaatcctg cccgccacct gctccgcgct gccctcggac ttcgtgccgc 28920
tgaccttccg cgagtgcccc ccgccgctgt ggagccactg ctacctgctg cgcctggcca 28980
actacctggc ctaccactcg gacgtgatcg aggacgtcag cggcgagggc ctgctcgagt 29040
gccactgccg ctgcaacctc tgcacgccgc accgctccct ggcctgcaac ccccagctgc 29100
tgagcgagac ccagatcatc ggcaccttcg agttgcaagg gcccagcgaa ggcgagggtt 29160
cagccgccaa ggggggtctg aaactcaccc cggggctgtg gacctcggcc tacttgcgca 29220
agttcgtgcc cgaggactac catcccttcg agatcaggtt ctacgaggac caatcccatc 29280
cgcccaaggc cgagctgtcg gcctgcgtca tcacccaggg ggcgatcctg gcccaattgc 29340
aagccatcca gaaatcccgc caagaattct tgctgaaaaa gggccgcggg gtctacctcg 29400
acccccagac cggtgaggag ctcaaccccg gcttccccca ggatgccccg aggaaacaag 29460
aagctgaaag tggagctgcc gcccgtggag gatttggagg aagactggga gaacagcagt 29520
caggcagagg aggaggagat ggaggaagac tgggacagca ctcaggcaga ggaggacagc 29580
ctgcaagaca gtctggagga agacgaggag gaggcagagg aggaggtgga agaagcagcc 29640
gccgccagac cgtcgtcctc ggcgggggag aaagcaagca gcacggatac catctccgct 29700
ccgggtcggg gtcccgctcg accacacagt agatgggacg agaccggacg attcccgaac 29760
cccaccaccc agaccggtaa gaaggagcgg cagggataca agtcctggcg ggggcacaaa 29820
aacgccatcg tctcctgctt gcaggcctgc gggggcaaca tctccttcac ccggcgctac 29880
ctgctcttcc accgcggggt gaactttccc cgcaacatct tgcattacta ccgtcacctc 29940
cacagcccct actacttcca agaagaggca gcagcagcag aaaaagacca gcagaaaacc 30000
agcagctaga aaatccacag cggcggcagc aggtggactg aggatcgcgg cgaacgagcc 30060
ggcgcaaacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 30120
gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 30180
cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 30240
tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccagtcgca 30300
gaaaaaggcg ggaattacgt cacctgtgcc cttcgcccta gccgcctcca cccatcatca 30360
tgagcaaaga gattcccacg ccttacatgt ggagctacca gccccagatg ggcctggccg 30420
ccggtgccgc ccaggactac tccacccgca tgaattggct cagcgccggg cccgcgatga 30480
tctcacgggt gaatgacatc cgcgcccacc gaaaccagat actcctagaa cagtcagcgc 30540
tcaccgccac gccccgcaat cacctcaatc cgcgtaattg gcccgccgcc ctggtgtacc 30600
aggaaattcc ccagcccacg accgtactac ttccgcgaga cgcccaggcc gaagtccagc 30660
tgactaactc aggtgtccag ctggcgggcg gcgccaccct gtgtcgtcac cgccccgctc 30720
agggtataaa gcggctggtg atccggggca gaggcacaca gctcaacgac gaggtggtga 30780
gctcttcgct gggtctgcga cctgacggag tcttccaact cgccggatcg gggagatctt 30840
ccttcacgcc tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag ccccgctcgg 30900
gtggcatcgg cactctccag ttcgtggagg agttcactcc ctcggtctac ttcaacccct 30960
tctccggctc ccccggccac tacccggacg agttcatccc gaacttcgac gccatcagcg 31020
agtcggtgga cggctacgat tgaatgtccc atggtggcgc agctgaccta gctcggcttc 31080
gacacctgga ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc gagtttgcct 31140
actttgagct gcccgaggag caccctcagg gcccggccca cggagtgcgg atcgtcgtcg 31200
aagggggcct cgactcccac ctgcttcgga tcttcagcca gcgtccgatc ctggtcgagc 31260
gcgagcaagg acagaccctt ctgactctgt actgcatctg caaccacccc ggcctgcatg 31320
aaagtctttg ttgtctgctg tgtactgagt ataataaaag ctgagatcag cgactactcc 31380
ggacttccgt gtgtttaaac tcaccccctt atccagtgaa ataaagatca tattgatgat 31440
gattttacag aaataaaaaa taatcatttg atttgaaata aagatacaat catattgatg 31500
atttgagttt aacaaaaaaa taaagaatca cttacttgaa atctgatacc aggtctctgt 31560
ccatgttttc tgccaacacc acttcactcc cctcttccca gctctggtac tgcaggcccc 31620
ggcgggctgc aaacttcctc cacacgctga aggggatgtc aaattcctcc tgtccctcaa 31680
tcttcatttt atcttctatc agatgtccaa aaagcgcgtc cgggtggatg atgacttcga 31740
ccccgtctac ccctacgatg cagacaacgc accgaccgtg cccttcatca accccccctt 31800
cgtctcttca gatggattcc aagagaagcc cctgggggtg ttgtccctgc gactggccga 31860
ccccgtcacc accaagaacg gggaaatcac cctcaagctg ggagaggggg tggacctcga 31920
ttcctcggga aaactcatct ccaacacggc caccaaggcc gccgcccctc tcagtttttc 31980
caacaacacc atttccctta acatggatca ccccttttac actaaagatg gaaaattatc 32040
cttacaagtt tctccaccat taaatatact gagaacaagc attctaaaca cactagcttt 32100
aggttttgga tcaggtttag gactccgtgg ctctgccttg gcagtacagt tagtctctcc 32160
acttacattt gatactgatg gaaacataaa gcttacctta gacagaggtt tgcatgttac 32220
aacaggagat gcaattgaaa gcaacataag ctgggctaaa ggtttaaaat ttgaagatgg 32280
agccatagca accaacattg gaaatgggtt agagtttgga agcagtagta cagaaacagg 32340
tgttgatgat gcttacccaa tccaagttaa acttggatct ggccttagct ttgacagtac 32400
aggagccata atggctggta acaaagaaga cgataaactc actttgtgga caacacctga 32460
tccatcacca aactgtcaaa tactcgcaga aaatgatgca aaactaacac tttgcttgac 32520
taaatgtggt agtcaaatac tggccactgt gtcagtctta gttgtaggaa gtggaaacct 32580
aaaccccatt actggcaccg taagcagtgc tcaggtgttt ctacgttttg atgcaaacgg 32640
tgttctttta acagaacatt ctacactaaa aaaatactgg gggtataggc agggagatag 32700
catagatggc actccatata ccaatgctgt aggattcatg cccaatttaa aagcttatcc 32760
aaagtcacaa agttctacta ctaaaaataa tatagtaggg caagtataca tgaatggaga 32820
tgtttcaaaa cctatgcttc tcactataac cctcaatggt actgatgaca gcaacagtac 32880
atattcaatg tcattttcat acacctggac taatggaagc tatgttggag caacatttgg 32940
ggctaactct tataccttct catacatcgc ccaagaatga acactgtatc ccaccctgca 33000
tgccaaccct tcccacccca ctctgtggaa caaactctga aacacaaaat aaaataaagt 33060
tcaagtgttt tattgattca acagttttac aggattcgag cagttatttt tcctccaccc 33120
tcccaggaca tggaatacac caccctctcc ccccgcacag ccttgaacat ctgaatgcca 33180
ttggtgatgg acatgctttt ggtctccacg ttccacacag tttcagagcg agccagtctc 33240
gggtcggtca gggagatgaa accctccggg cactcccgca tctgcacctc acagctcaac 33300
agctgaggat tgtcctcggt ggtcgggatc acggttatct ggaagaagca gaagagcggc 33360
ggtgggaatc atagtccgcg aacgggatcg gccggtggtg tcgcatcagg ccccgcagca 33420
gtcgctgccg ccgccgctcc gtcaagctgc tgctcagggg gtccgggtcc agggactccc 33480
tcagcatgat gcccacggcc ctcagcatca gtcgtctggt gcggcgggcg cagcagcgca 33540
tgcggatctc gctcaggtcg ctgcagtacg tgcaacacag aaccaccagg ttgttcaaca 33600
gtccatagtt caacacgctc cagccgaaac tcatcgcggg aaggatgcta cccacgtggc 33660
cgtcgtacca gatcctcagg taaatcaagt ggtgccccct ccagaacacg ctgcccacgt 33720
acatgatctc cttgggcatg tggcggttca ccacctcccg gtaccacatc accctctggt 33780
tgaacatgca gccccggatg atcctgcgga accacagggc cagcaccgcc ccgcccgcca 33840
tgcagcgaag agaccccggg tcccggcaat ggcaatggag gacccaccgc tcgtacccgt 33900
ggatcatctg ggagctgaac aagtctatgt tggcacagca caggcatatg ctcatgcatc 33960
tcttcagcac tctcaactcc tcgggggtca aaaccatatc ccagggcacg gggaactctt 34020
gcaggacagc gaaccccgca gaacagggca atcctcgcac agaacttaca ttgtgcatgg 34080
acagggtatc gcaatcaggc agcaccgggt gatcctccac cagagaagcg cgggtctcgg 34140
tctcctcaca gcgtggtaag ggggccggcc gatacgggtg atggcgggac gcggctgatc 34200
gtgttcgcga ccgtgtcatg atgcagttgc tttcggacat tttcgtactt gctgtagcag 34260
aacctggtcc gggcgctgca caccgatcgc cggcggcggt ctcggcgctt ggaacgctcg 34320
gtgttgaaat tgtaaaacag ccactctctc agaccgtgca gcagatctag ggcctcagga 34380
gtgatgaaga tcccatcatg cctgatggct ctgatcacat cgaccaccgt ggaatgggcc 34440
agacccagcc agatgatgca attttgttgg gtttcggtga cggcggggga gggaagaaca 34500
ggaagaacca tgattaactt ttaatccaaa cggtctcgga gtacttcaaa atgaagatcg 34560
cggagatggc acctctcgcc cccgctgtgt tggtggaaaa taacagccag gtcaaaggtg 34620
atacggttct cgagatgttc cacggtggct tccagcaaag cctccacgcg cacatccaga 34680
aacaagacaa tagcgaaagc gggagggttc tctaattcct caatcatcat gttacactcc 34740
tgcaccatcc ccagataatt ttcatttttc cagccttgaa tgattcgaac tagttcctga 34800
ggtaaatcca agccagccat gataaagagc tcgcgcagag cgccctccac cggcattctt 34860
aagcacaccc tcataattcc aagatattct gctcctggtt cacctgcagc agattgacaa 34920
gcggaatatc aaaatctctg ccgcgatccc tgagctcctc cctcagcaat aactgtaagt 34980
actctttcat atcctctccg aaatttttag ccataggacc accaggaata agattagggc 35040
aagccacagt acagataaac cgaagtcctc cccagtgagc attgccaaat gcaagactgc 35100
tataagcatg ctggctagac ccggtgatat cttccagata actggacaga aaatcgccca 35160
ggcaattttt aagaaaatca acaaaagaaa aatcctccag gtggacgttt agagcctcgg 35220
gaacaacgat gaagtaaatg caagcggtgc gttccagcat ggttagttag ctgatctgta 35280
gaaaaaacaa aaatgaacat taaaccatgc tagcctggcg aacaggtggg taaatcgttc 35340
tctccagcac caggcaggcc acggggtctc cggcgcgacc ctcgtaaaaa ttgtcgctat 35400
gattgaaaac catcacagag agacgttccc ggtggccggc gtgaatgatt cgacaagatg 35460
aatacacccc cggaacattg gcgtccgcga gtgaaaaaaa gcgcccgagg aagcaataag 35520
gcactacaat gctcagtctc aagtccagca aagcgatgcc atgcggatga agcacaaaat 35580
tctcaggtgc gtacaaaatg taattactcc cctcctgcac aggcagcaaa gcccccgatc 35640
cctccaggta cacatacaaa gcctcagcgt ccatagctta ccgagcagca gcacacaaca 35700
ggcgcaagag tcagagaaag gctgagctct aacctgtcca cccgctctct gctcaatata 35760
tagcccagat ctacactgac gtaaaggcca aagtctaaaa atacccgcca aataatcaca 35820
cacgcccagc acacgcccag aaaccggtga cacactcaaa aaaatacgcg cacttcctca 35880
aacgcccaaa actgccgtca tttccgggtt cccacgctac gtcatcaaaa cacgactttc 35940
aaattccgtc gaccgttaaa aacgtcaccc gccccgcccc taacggtcgc ccgtctctca 36000
gccaatcagc gccccgcatc cccaaattca aacacctcat ttgcatatta acgcgcacaa 36060
aaagtttgag gtatattatt gatgatgg 36088
<210> 69
<211> 7011
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 69
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcacccctg gaacccagag ccccttcttc cttctgctgc tgctgaccgt 2040
gctgactgtc gtgacaggct ctggccacgc cagctctaca cctggcggcg agaaagagac 2100
aagcgccacc cagagaagca gcgtgccaag cagcaccgag aagaacgccg tgtccatgac 2160
cagctccgtg ctgagcagcc actctcctgg cagcggcagc agcacaacac agggccagga 2220
tgtgacactg gcccctgcca cagaacctgc ctctggatct gccgccacct ggggacagga 2280
cgtgacaagc gtgccagtga ccagacctgc cctgggctct acaacacccc ctgcccacga 2340
tgtgaccagc gcccctgata acaagcctgc ccctggaagc acagcccctc cagctcatgg 2400
cgtgacctct gccccagata ccagaccagc cccaggatct acagccccac ccgcacacgg 2460
cgtgacaagt gcccctgaca caagacccgc tccaggctct actgctcctc ctgcccatgg 2520
cgtgacaagc gctcccgata caaggccagc tcctggctcc acagcaccac cagcacatgg 2580
cgtgacatca gctcccgaca ctagacctgc tcccggatca accgctccac cagctcacgg 2640
cgtgaccagc gcacctgata ccagacctgc tctgggaagc accgcccctc ccgtgcacaa 2700
tgtgacatct gcttccggca gcgccagcgg ctctgcctct acactggtgc acaacggcac 2760
cagcgccaga gccacaacaa ccccagccag caagagcacc cccttcagca tccctagcca 2820
ccacagcgac acccctacca cactggccag ccactccacc aagaccgatg cctctagcac 2880
ccaccactcc agcgtgcccc ctctgaccag cagcaaccac agcacaagcc cccagctgtc 2940
taccggcgtc tcattcttct ttctgtcctt ccacatcagc aacctgcagt tcaacagcag 3000
cctggaagat cccagcaccg actactacca ggaactgcag cgggatatca gcgagatgtt 3060
cctgcaaatc tacaagcagg gcggcttcct gggcctgagc aacatcaagt tcagacccgg 3120
cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc accatcaacg tgcacgacgt 3180
ggaaacccag ttcaaccagt acaagaccga ggccgccagc cggtacaacc tgaccatctc 3240
cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc cagtctggcg caggcgtgcc 3300
aggatgggga attgctctgc tggtgctcgt gtgcgtgctg gtggccctgg ccatcgtgta 3360
tctgattgcc ctggccgtgt gccagtgccg gcggaagaat tacggccagc tggacatctt 3420
ccccgccaga gacacctacc accccatgag cgagtacccc acataccaca cccacggcag 3480
atacgtgcca cccagctcca ccgacagatc cccctacgag aaagtgtctg ccggcaacgg 3540
cggcagctcc ctgagctaca caaatcctgc cgtggccgct gcctccgcca acctgggatc 3600
cggcagaatc ttcaacgccc actacgccgg ctacttcgcc gacctgctga tccacgacat 3660
cgagacaaac cctggccccg aatcgccaag cgcaccccct catcggtggt gcatcccttg 3720
gcaacgcctc ctcctgaccg cctcactgct gactttctgg aacccgccga ccaccgcaaa 3780
gctgaccatt gagagcactc ccttcaacgt ggctgagggg aaggaggtgc tgctcctggt 3840
gcacaatctg ccccagcacc tgttcgggta ctcctggtac aagggagaac gcgtggacgg 3900
gaaccggcag atcataggct acgtcatcgg aacccagcag gccacacccg gtccagcgta 3960
cagcggccgg gagattatct acccgaacgc ctccctgctg atccaaaaca tcatccagaa 4020
cgacaccggt ttctacactc tgcacgtgat taagtcagat ctggtcaacg aagaggccac 4080
cggccaattc agggtgtacc ccgaactccc taagccgttc atcacctcga acaacagcaa 4140
cccggtcgag gatgaagatg cggtggcctt gacgtgcgaa cctgagatcc agaacaccac 4200
ctacttgtgg tgggtgaaca atcagagcct gccagtctcc ccacgactcc agctgtcgaa 4260
cgacaacagg accctgactt tgctgtccgt gactcggaac gacgtgggcc cttatgaatg 4320
cggtatccag aacaagctgt ccgtggacca cagcgaccct gtgatcctga acgtccttta 4380
cgggccggac gaccccacca tttccccgtc gtacacttac taccggccgg gcgtgaacct 4440
gtccctgtcg tgccacgctg cctccaatcc gccggcccag tactcctggc tcatcgacgg 4500
aaacatccag cagcacaccc aagaactgtt catctccaac attaccgaga aaaactcggg 4560
actttacacc tgtcaagcca acaattccgc cagcggccac tcccgcacca ctgtcaaaac 4620
tatcactgtg tccgccgaac tcccgaagcc cagcatcagc tccaacaact cgaagcccgt 4680
ggaggataag gacgctgtcg cgttcacctg tgaaccagag gcacagaata ccacctacct 4740
ttggtgggtc aacggacagt ccctgcctgt ctcaccgaga ctgcagctgt caaacgggaa 4800
taggactctg accttgttta acgtcacccg gaacgacgcc cgggcctacg tgtgcggcat 4860
ccagaactcc gtgagcgcaa accggtctga cccagtgacc ctggatgtgc tgtacggccc 4920
cgacactccg atcatttcac cccccgattc atcctacctg tccggcgcta acctcaacct 4980
ctcatgccac tccgcatcca accccagccc gcaatattcg tggcgcatta acggaattcc 5040
tcagcaacat acccaggtcc tgttcattgc gaagatcacc cctaacaaca acggaaccta 5100
cgcctgcttt gtgtcaaacc tggccactgg tagaaacaac tccatcgtga agtccattac 5160
cgtgtcggcg tccggaactt ccccgggcct gagcgccggc gccaccgtgg gaattatgat 5220
cggcgtgctc gtgggagtgg ccctgatctg aagatctggg ccctaacaaa acaaaaagat 5280
ggggttattc cctaaacttc atgggttacg taattggaag ttgggggaca ttgccacaag 5340
atcatattgt acaaaagatc aaacactgtt ttagaaaact tcctgtaaac aggcctattg 5400
attggaaagt atgtcaaagg attgtgggtc ttttgggctt tgctgctcca tttacacaat 5460
gtggatatcc tgccttaatg cctttgtatg catgtataca agctaaacag gctttcactt 5520
tctcgccaac ttacaaggcc tttctaagta aacagtacat gaacctttac cccgttgctc 5580
ggcaacggcc tggtctgtgc caagtgtttg ctgacgcaac ccccactggc tggggcttgg 5640
ccataggcca tcagcgcatg cgtggaacct ttgtggctcc tctgccgatc catactgcgg 5700
aactcctagc cgcttgtttt gctcgcagcc ggtctggagc aaagctcata ggaactgaca 5760
attctgtcgt cctctcgcgg aaatatacat cgtttcgatc tacgtatgat ctttttccct 5820
ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct ggctaataaa 5880
ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca ctcggaagga 5940
attctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 6000
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 6060
gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 6120
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 6180
ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 6240
cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 6300
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 6360
gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 6420
aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 6480
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 6540
aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 6600
aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc 6660
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 6720
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 6780
atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 6840
atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 6900
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 6960
gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact c 7011
<210> 70
<211> 6990
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 70
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcgaatcgc caagcgcacc ccctcatcgg tggtgcatcc cttggcaacg 2040
cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg caaagctgac 2100
cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc tggtgcacaa 2160
tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg 2220
gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag cgtacagcgg 2280
ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc agaacgacac 2340
cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg ccaccggcca 2400
attcagggtg taccccgaac tccctaagcc gttcatcacc tcgaacaaca gcaacccggt 2460
cgaggatgaa gatgcggtgg ccttgacgtg cgaacctgag atccagaaca ccacctactt 2520
gtggtgggtg aacaatcaga gcctgccagt ctccccacga ctccagctgt cgaacgacaa 2580
caggaccctg actttgctgt ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat 2640
ccagaacaag ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc tttacgggcc 2700
ggacgacccc accatttccc cgtcgtacac ttactaccgg ccgggcgtga acctgtccct 2760
gtcgtgccac gctgcctcca atccgccggc ccagtactcc tggctcatcg acggaaacat 2820
ccagcagcac acccaagaac tgttcatctc caacattacc gagaaaaact cgggacttta 2880
cacctgtcaa gccaacaatt ccgccagcgg ccactcccgc accactgtca aaactatcac 2940
tgtgtccgcc gaactcccga agcccagcat cagctccaac aactcgaagc ccgtggagga 3000
taaggacgct gtcgcgttca cctgtgaacc agaggcacag aataccacct acctttggtg 3060
ggtcaacgga cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg ggaataggac 3120
tctgaccttg tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa 3180
ctccgtgagc gcaaaccggt ctgacccagt gaccctggat gtgctgtacg gccccgacac 3240
tccgatcatt tcaccccccg attcatccta cctgtccggc gctaacctca acctctcatg 3300
ccactccgca tccaacccca gcccgcaata ttcgtggcgc attaacggaa ttcctcagca 3360
acatacccag gtcctgttca ttgcgaagat cacccctaac aacaacggaa cctacgcctg 3420
ctttgtgtca aacctggcca ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc 3480
ggcgtccgga acttccccgg gcctgagcgc cggcgccacc gtgggaatta tgatcggcgt 3540
gctcgtggga gtggccctga tcggatccgg cgagggcaga ggcagcctgc tgacatgtgg 3600
cgacgtggaa gagaaccctg gccccacccc tggaacccag agccccttct tccttctgct 3660
gctgctgacc gtgctgactg tcgtgacagg ctctggccac gccagctcta cacctggcgg 3720
cgagaaagag acaagcgcca cccagagaag cagcgtgcca agcagcaccg agaagaacgc 3780
cgtgtccatg accagctccg tgctgagcag ccactctcct ggcagcggca gcagcacaac 3840
acagggccag gatgtgacac tggcccctgc cacagaacct gcctctggat ctgccgccac 3900
ctggggacag gacgtgacaa gcgtgccagt gaccagacct gccctgggct ctacaacacc 3960
ccctgcccac gatgtgacca gcgcccctga taacaagcct gcccctggaa gcacagcccc 4020
tccagctcat ggcgtgacct ctgccccaga taccagacca gccccaggat ctacagcccc 4080
acccgcacac ggcgtgacaa gtgcccctga cacaagaccc gctccaggct ctactgctcc 4140
tcctgcccat ggcgtgacaa gcgctcccga tacaaggcca gctcctggct ccacagcacc 4200
accagcacat ggcgtgacat cagctcccga cactagacct gctcccggat caaccgctcc 4260
accagctcac ggcgtgacca gcgcacctga taccagacct gctctgggaa gcaccgcccc 4320
tcccgtgcac aatgtgacat ctgcttccgg cagcgccagc ggctctgcct ctacactggt 4380
gcacaacggc accagcgcca gagccacaac aaccccagcc agcaagagca cccccttcag 4440
catccctagc caccacagcg acacccctac cacactggcc agccactcca ccaagaccga 4500
tgcctctagc acccaccact ccagcgtgcc ccctctgacc agcagcaacc acagcacaag 4560
cccccagctg tctaccggcg tctcattctt ctttctgtcc ttccacatca gcaacctgca 4620
gttcaacagc agcctggaag atcccagcac cgactactac caggaactgc agcgggatat 4680
cagcgagatg ttcctgcaaa tctacaagca gggcggcttc ctgggcctga gcaacatcaa 4740
gttcagaccc ggcagcgtgg tggtgcagct gaccctggct ttccgggaag gcaccatcaa 4800
cgtgcacgac gtggaaaccc agttcaacca gtacaagacc gaggccgcca gccggtacaa 4860
cctgaccatc tccgatgtgt ccgtgtccga cgtgcccttc ccattctctg cccagtctgg 4920
cgcaggcgtg ccaggatggg gaattgctct gctggtgctc gtgtgcgtgc tggtggccct 4980
ggccatcgtg tatctgattg ccctggccgt gtgccagtgc cggcggaaga attacggcca 5040
gctggacatc ttccccgcca gagacaccta ccaccccatg agcgagtacc ccacatacca 5100
cacccacggc agatacgtgc cacccagctc caccgacaga tccccctacg agaaagtgtc 5160
tgccggcaac ggcggcagct ccctgagcta cacaaatcct gccgtggccg ctgcctccgc 5220
caacctgtga agatctgggc cctaacaaaa caaaaagatg gggttattcc ctaaacttca 5280
tgggttacgt aattggaagt tgggggacat tgccacaaga tcatattgta caaaagatca 5340
aacactgttt tagaaaactt cctgtaaaca ggcctattga ttggaaagta tgtcaaagga 5400
ttgtgggtct tttgggcttt gctgctccat ttacacaatg tggatatcct gccttaatgc 5460
ctttgtatgc atgtatacaa gctaaacagg ctttcacttt ctcgccaact tacaaggcct 5520
ttctaagtaa acagtacatg aacctttacc ccgttgctcg gcaacggcct ggtctgtgcc 5580
aagtgtttgc tgacgcaacc cccactggct ggggcttggc cataggccat cagcgcatgc 5640
gtggaacctt tgtggctcct ctgccgatcc atactgcgga actcctagcc gcttgttttg 5700
ctcgcagccg gtctggagca aagctcatag gaactgacaa ttctgtcgtc ctctcgcgga 5760
aatatacatc gtttcgatct acgtatgatc tttttccctc tgccaaaaat tatggggaca 5820
tcatgaagcc ccttgagcat ctgacttctg gctaataaag gaaatttatt ttcattgcaa 5880
tagtgtgttg gaattttttg tgtctctcac tcggaaggaa ttctgcatta atgaatcggc 5940
caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6000
tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6060
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 6120
aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 6180
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 6240
agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 6300
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 6360
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 6420
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 6480
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 6540
tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 6600
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 6660
tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 6720
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 6780
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 6840
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 6900
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 6960
ctatttcgtt catccatagt tgcctgactc 6990
<210> 71
<211> 7002
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 71
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcgaatcgc caagcgcacc ccctcatcgg tggtgcatcc cttggcaacg 2040
cctcctcctg accgcctcac tgctgacttt ctggaacccg ccgaccaccg caaagctgac 2100
cattgagagc actcccttca acgtggctga ggggaaggag gtgctgctcc tggtgcacaa 2160
tctgccccag cacctgttcg ggtactcctg gtacaaggga gaacgcgtgg acgggaaccg 2220
gcagatcata ggctacgtca tcggaaccca gcaggccaca cccggtccag cgtacagcgg 2280
ccgggagatt atctacccga acgcctccct gctgatccaa aacatcatcc agaacgacac 2340
cggtttctac actctgcacg tgattaagtc agatctggtc aacgaagagg ccaccggcca 2400
attcagggtg taccccgaac tccctaagcc gttcatcacc tcgaacaaca gcaacccggt 2460
cgaggatgaa gatgcggtgg ccttgacgtg cgaacctgag atccagaaca ccacctactt 2520
gtggtgggtg aacaatcaga gcctgccagt ctccccacga ctccagctgt cgaacgacaa 2580
caggaccctg actttgctgt ccgtgactcg gaacgacgtg ggcccttatg aatgcggtat 2640
ccagaacaag ctgtccgtgg accacagcga ccctgtgatc ctgaacgtcc tttacgggcc 2700
ggacgacccc accatttccc cgtcgtacac ttactaccgg ccgggcgtga acctgtccct 2760
gtcgtgccac gctgcctcca atccgccggc ccagtactcc tggctcatcg acggaaacat 2820
ccagcagcac acccaagaac tgttcatctc caacattacc gagaaaaact cgggacttta 2880
cacctgtcaa gccaacaatt ccgccagcgg ccactcccgc accactgtca aaactatcac 2940
tgtgtccgcc gaactcccga agcccagcat cagctccaac aactcgaagc ccgtggagga 3000
taaggacgct gtcgcgttca cctgtgaacc agaggcacag aataccacct acctttggtg 3060
ggtcaacgga cagtccctgc ctgtctcacc gagactgcag ctgtcaaacg ggaataggac 3120
tctgaccttg tttaacgtca cccggaacga cgcccgggcc tacgtgtgcg gcatccagaa 3180
ctccgtgagc gcaaaccggt ctgacccagt gaccctggat gtgctgtacg gccccgacac 3240
tccgatcatt tcaccccccg attcatccta cctgtccggc gctaacctca acctctcatg 3300
ccactccgca tccaacccca gcccgcaata ttcgtggcgc attaacggaa ttcctcagca 3360
acatacccag gtcctgttca ttgcgaagat cacccctaac aacaacggaa cctacgcctg 3420
ctttgtgtca aacctggcca ctggtagaaa caactccatc gtgaagtcca ttaccgtgtc 3480
ggcgtccgga acttccccgg gcctgagcgc cggcgccacc gtgggaatta tgatcggcgt 3540
gctcgtggga gtggccctga tcaggaagag aagaggatcc ggcgagggca gaggcagcct 3600
gctgacatgt ggcgacgtgg aagagaaccc tggccccacc cctggaaccc agagcccctt 3660
cttccttctg ctgctgctga ccgtgctgac tgtcgtgaca ggctctggcc acgccagctc 3720
tacacctggc ggcgagaaag agacaagcgc cacccagaga agcagcgtgc caagcagcac 3780
cgagaagaac gccgtgtcca tgaccagctc cgtgctgagc agccactctc ctggcagcgg 3840
cagcagcaca acacagggcc aggatgtgac actggcccct gccacagaac ctgcctctgg 3900
atctgccgcc acctggggac aggacgtgac aagcgtgcca gtgaccagac ctgccctggg 3960
ctctacaaca ccccctgccc acgatgtgac cagcgcccct gataacaagc ctgcccctgg 4020
aagcacagcc cctccagctc atggcgtgac ctctgcccca gataccagac cagccccagg 4080
atctacagcc ccacccgcac acggcgtgac aagtgcccct gacacaagac ccgctccagg 4140
ctctactgct cctcctgccc atggcgtgac aagcgctccc gatacaaggc cagctcctgg 4200
ctccacagca ccaccagcac atggcgtgac atcagctccc gacactagac ctgctcccgg 4260
atcaaccgct ccaccagctc acggcgtgac cagcgcacct gataccagac ctgctctggg 4320
aagcaccgcc cctcccgtgc acaatgtgac atctgcttcc ggcagcgcca gcggctctgc 4380
ctctacactg gtgcacaacg gcaccagcgc cagagccaca acaaccccag ccagcaagag 4440
cacccccttc agcatcccta gccaccacag cgacacccct accacactgg ccagccactc 4500
caccaagacc gatgcctcta gcacccacca ctccagcgtg ccccctctga ccagcagcaa 4560
ccacagcaca agcccccagc tgtctaccgg cgtctcattc ttctttctgt ccttccacat 4620
cagcaacctg cagttcaaca gcagcctgga agatcccagc accgactact accaggaact 4680
gcagcgggat atcagcgaga tgttcctgca aatctacaag cagggcggct tcctgggcct 4740
gagcaacatc aagttcagac ccggcagcgt ggtggtgcag ctgaccctgg ctttccggga 4800
aggcaccatc aacgtgcacg acgtggaaac ccagttcaac cagtacaaga ccgaggccgc 4860
cagccggtac aacctgacca tctccgatgt gtccgtgtcc gacgtgccct tcccattctc 4920
tgcccagtct ggcgcaggcg tgccaggatg gggaattgct ctgctggtgc tcgtgtgcgt 4980
gctggtggcc ctggccatcg tgtatctgat tgccctggcc gtgtgccagt gccggcggaa 5040
gaattacggc cagctggaca tcttccccgc cagagacacc taccacccca tgagcgagta 5100
ccccacatac cacacccacg gcagatacgt gccacccagc tccaccgaca gatcccccta 5160
cgagaaagtg tctgccggca acggcggcag ctccctgagc tacacaaatc ctgccgtggc 5220
cgctgcctcc gccaacctgt gaagatctgg gccctaacaa aacaaaaaga tggggttatt 5280
ccctaaactt catgggttac gtaattggaa gttgggggac attgccacaa gatcatattg 5340
tacaaaagat caaacactgt tttagaaaac ttcctgtaaa caggcctatt gattggaaag 5400
tatgtcaaag gattgtgggt cttttgggct ttgctgctcc atttacacaa tgtggatatc 5460
ctgccttaat gcctttgtat gcatgtatac aagctaaaca ggctttcact ttctcgccaa 5520
cttacaaggc ctttctaagt aaacagtaca tgaaccttta ccccgttgct cggcaacggc 5580
ctggtctgtg ccaagtgttt gctgacgcaa cccccactgg ctggggcttg gccataggcc 5640
atcagcgcat gcgtggaacc tttgtggctc ctctgccgat ccatactgcg gaactcctag 5700
ccgcttgttt tgctcgcagc cggtctggag caaagctcat aggaactgac aattctgtcg 5760
tcctctcgcg gaaatataca tcgtttcgat ctacgtatga tctttttccc tctgccaaaa 5820
attatgggga catcatgaag ccccttgagc atctgacttc tggctaataa aggaaattta 5880
ttttcattgc aatagtgtgt tggaattttt tgtgtctctc actcggaagg aattctgcat 5940
taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 6000
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 6060
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6120
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6180
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6240
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6300
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6360
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6420
tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6480
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 6540
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 6600
tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 6660
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 6720
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 6780
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 6840
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 6900
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 6960
tcagcgatct gtctatttcg ttcatccata gttgcctgac tc 7002
<210> 72
<211> 6837
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 72
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcacccctg gaacccagag ccccttcttc cttctgctgc tgctgaccgt 2040
gctgactgtc gtgacaggct ctggccacgc cagctctaca cctggcggcg agaaagagac 2100
aagcgccacc cagagaagca gcgtgccaag cagcaccgag aagaacgccg tgtccatgac 2160
cagctccgtg ctgagcagcc actctcctgg cagcggcagc agcacaacac agggccagga 2220
tgtgacactg gcccctgcca cagaacctgc ctctggatct gccgccacct ggggacagga 2280
cgtgacaagc gtgccagtga ccagacctgc cctgggctct acaacacccc ctgcccacga 2340
tgtgaccagc gcccctgata acaagcctgc ccctggaagc acagcccctc cagctcatgg 2400
cgtgacctct gccccagata ccagaccagc cccaggatct acagccccac ccgcacacgg 2460
cgtgacaagt gcccctgaca caagacccgc tccaggctct actgctcctc ctgcccatgg 2520
cgtgacaagc gctcccgata caaggccagc tcctggctcc acagcaccac cagcacatgg 2580
cgtgacatca gctcccgaca ctagacctgc tcccggatca accgctccac cagctcacgg 2640
cgtgaccagc gcacctgata ccagacctgc tctgggaagc accgcccctc ccgtgcacaa 2700
tgtgacatct gcttccggca gcgccagcgg ctctgcctct acactggtgc acaacggcac 2760
cagcgccaga gccacaacaa ccccagccag caagagcacc cccttcagca tccctagcca 2820
ccacagcgac acccctacca cactggccag ccactccacc aagaccgatg cctctagcac 2880
ccaccactcc agcgtgcccc ctctgaccag cagcaaccac agcacaagcc cccagctgtc 2940
taccggcgtc tcattcttct ttctgtcctt ccacatcagc aacctgcagt tcaacagcag 3000
cctggaagat cccagcaccg actactacca ggaactgcag cgggatatca gcgagatgtt 3060
cctgcaaatc tacaagcagg gcggcttcct gggcctgagc aacatcaagt tcagacccgg 3120
cagcgtggtg gtgcagctga ccctggcttt ccgggaaggc accatcaacg tgcacgacgt 3180
ggaaacccag ttcaaccagt acaagaccga ggccgccagc cggtacaacc tgaccatctc 3240
cgatgtgtcc gtgtccgacg tgcccttccc attctctgcc cagtctggcg caggcgtgcc 3300
aggatgggga attgctctgc tggtgctcgt gtgcgtgctg gtggccctgg ccatcgtgta 3360
tctgattgcc ctggccgtgt gccagtgccg gcggaagaat tacggccagc tggacatctt 3420
ccccgccaga gacacctacc accccatgag cgagtacccc acataccaca cccacggcag 3480
atacgtgcca cccagctcca ccgacagatc cccctacgag aaagtgtctg ccggcaacgg 3540
cggcagctcc ctgagctaca caaatcctgc cgtggccgct gcctccgcca acctgggatc 3600
cggcagaatc ttcaacgccc actacgccgg ctacttcgcc gacctgctga tccacgacat 3660
cgagacaaac cctggcccca agctgaccat tgagagcact cccttcaacg tggctgaggg 3720
gaaggaggtg ctgctcctgg tgcacaatct gccccagcac ctgttcgggt actcctggta 3780
caagggagaa cgcgtggacg ggaaccggca gatcataggc tacgtcatcg gaacccagca 3840
ggccacaccc ggtccagcgt acagcggccg ggagattatc tacccgaacg cctccctgct 3900
gatccaaaac atcatccaga acgacaccgg tttctacact ctgcacgtga ttaagtcaga 3960
tctggtcaac gaagaggcca ccggccaatt cagggtgtac cccgaactcc ctaagccgtt 4020
catcacctcg aacaacagca acccggtcga ggatgaagat gcggtggcct tgacgtgcga 4080
acctgagatc cagaacacca cctacttgtg gtgggtgaac aatcagagcc tgccagtctc 4140
cccacgactc cagctgtcga acgacaacag gaccctgact ttgctgtccg tgactcggaa 4200
cgacgtgggc ccttatgaat gcggtatcca gaacaagctg tccgtggacc acagcgaccc 4260
tgtgatcctg aacgtccttt acgggccgga cgaccccacc atttccccgt cgtacactta 4320
ctaccggccg ggcgtgaacc tgtccctgtc gtgccacgct gcctccaatc cgccggccca 4380
gtactcctgg ctcatcgacg gaaacatcca gcagcacacc caagaactgt tcatctccaa 4440
cattaccgag aaaaactcgg gactttacac ctgtcaagcc aacaattccg ccagcggcca 4500
ctcccgcacc actgtcaaaa ctatcactgt gtccgccgaa ctcccgaagc ccagcatcag 4560
ctccaacaac tcgaagcccg tggaggataa ggacgctgtc gcgttcacct gtgaaccaga 4620
ggcacagaat accacctacc tttggtgggt caacggacag tccctgcctg tctcaccgag 4680
actgcagctg tcaaacggga ataggactct gaccttgttt aacgtcaccc ggaacgacgc 4740
ccgggcctac gtgtgcggca tccagaactc cgtgagcgca aaccggtctg acccagtgac 4800
cctggatgtg ctgtacggcc ccgacactcc gatcatttca ccccccgatt catcctacct 4860
gtccggcgct aacctcaacc tctcatgcca ctccgcatcc aaccccagcc cgcaatattc 4920
gtggcgcatt aacggaattc ctcagcaaca tacccaggtc ctgttcattg cgaagatcac 4980
ccctaacaac aacggaacct acgcctgctt tgtgtcaaac ctggccactg gtagaaacaa 5040
ctccatcgtg aagtccatta ccgtgtcggc gtcctgaaga tctgggccct aacaaaacaa 5100
aaagatgggg ttattcccta aacttcatgg gttacgtaat tggaagttgg gggacattgc 5160
cacaagatca tattgtacaa aagatcaaac actgttttag aaaacttcct gtaaacaggc 5220
ctattgattg gaaagtatgt caaaggattg tgggtctttt gggctttgct gctccattta 5280
cacaatgtgg atatcctgcc ttaatgcctt tgtatgcatg tatacaagct aaacaggctt 5340
tcactttctc gccaacttac aaggcctttc taagtaaaca gtacatgaac ctttaccccg 5400
ttgctcggca acggcctggt ctgtgccaag tgtttgctga cgcaaccccc actggctggg 5460
gcttggccat aggccatcag cgcatgcgtg gaacctttgt ggctcctctg ccgatccata 5520
ctgcggaact cctagccgct tgttttgctc gcagccggtc tggagcaaag ctcataggaa 5580
ctgacaattc tgtcgtcctc tcgcggaaat atacatcgtt tcgatctacg tatgatcttt 5640
ttccctctgc caaaaattat ggggacatca tgaagcccct tgagcatctg acttctggct 5700
aataaaggaa atttattttc attgcaatag tgtgttggaa ttttttgtgt ctctcactcg 5760
gaaggaattc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 5820
cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 5880
gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 5940
aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 6000
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 6060
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 6120
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 6180
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 6240
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 6300
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 6360
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 6420
tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca 6480
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 6540
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 6600
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 6660
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 6720
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 6780
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactc 6837
<210> 73
<211> 7890
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 73
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcaagctga ccattgagag cactcccttc aacgtggctg aggggaagga 2040
ggtgctgctc ctggtgcaca atctgcccca gcacctgttc gggtactcct ggtacaaggg 2100
agaacgcgtg gacgggaacc ggcagatcat aggctacgtc atcggaaccc agcaggccac 2160
acccggtcca gcgtacagcg gccgggagat tatctacccg aacgcctccc tgctgatcca 2220
aaacatcatc cagaacgaca ccggtttcta cactctgcac gtgattaagt cagatctggt 2280
caacgaagag gccaccggcc aattcagggt gtaccccgaa ctccctaagc cgttcatcac 2340
ctcgaacaac agcaacccgg tcgaggatga agatgcggtg gccttgacgt gcgaacctga 2400
gatccagaac accacctact tgtggtgggt gaacaatcag agcctgccag tctccccacg 2460
actccagctg tcgaacgaca acaggaccct gactttgctg tccgtgactc ggaacgacgt 2520
gggcccttat gaatgcggta tccagaacaa gctgtccgtg gaccacagcg accctgtgat 2580
cctgaacgtc ctttacgggc cggacgaccc caccatttcc ccgtcgtaca cttactaccg 2640
gccgggcgtg aacctgtccc tgtcgtgcca cgctgcctcc aatccgccgg cccagtactc 2700
ctggctcatc gacggaaaca tccagcagca cacccaagaa ctgttcatct ccaacattac 2760
cgagaaaaac tcgggacttt acacctgtca agccaacaat tccgccagcg gccactcccg 2820
caccactgtc aaaactatca ctgtgtccgc cgaactcccg aagcccagca tcagctccaa 2880
caactcgaag cccgtggagg ataaggacgc tgtcgcgttc acctgtgaac cagaggcaca 2940
gaataccacc tacctttggt gggtcaacgg acagtccctg cctgtctcac cgagactgca 3000
gctgtcaaac gggaatagga ctctgacctt gtttaacgtc acccggaacg acgcccgggc 3060
ctacgtgtgc ggcatccaga actccgtgag cgcaaaccgg tctgacccag tgaccctgga 3120
tgtgctgtac ggccccgaca ctccgatcat ttcacccccc gattcatcct acctgtccgg 3180
cgctaacctc aacctctcat gccactccgc atccaacccc agcccgcaat attcgtggcg 3240
cattaacgga attcctcagc aacataccca ggtcctgttc attgcgaaga tcacccctaa 3300
caacaacgga acctacgcct gctttgtgtc aaacctggcc actggtagaa acaactccat 3360
cgtgaagtcc attaccgtgt cggcgtccgg atccggcgag ggcagaggca gcctgctgac 3420
atgtggcgac gtggaagaga accctggccc cggagctgcc ccggagccgg agaggacccc 3480
cgttggccag ggatcgtggg cccatccggg acgcaccagg ggaccatccg acaggggatt 3540
ctgtgtggtg tcaccggcca ggccagcaga agaggcaacc agcctcgagg gagcgttgtc 3600
tggaaccaga cattcccacc cgtcggtggg ccggcagcac cacgcgggac caccgtccac 3660
ttccagaccg ccacggccat gggacacccc ttgcccgcct gtgtatgccg agactaaaca 3720
cttcctgtac tcatccggag acaaggaaca gcttcggccg tccttcctcc tgtcgtcgct 3780
cagaccgagc ctgaccggag cacgcagatt ggtggaaact atcttccttg ggtcacgtcc 3840
gtggatgcca ggtaccccac ggcgcctccc gcgcctccca cagagatact ggcagatgcg 3900
gcctctgttc ctggaattgc tgggaaacca cgctcagtgc ccgtacggag tcctgctcaa 3960
gactcactgc cctctgaggg cggcggtcac tccggcggcc ggagtgtgcg cacgggagaa 4020
gccccaggga agcgtggcag ctccggaaga ggaggacacc gatccgcgcc gcctcgtgca 4080
acttctgcgc cagcactcct cgccctggca agtctacggg ttcgtccgcg cctgcctgcg 4140
ccgcctggtg ccgcctgggc tctggggttc ccggcataac gagcgccgct tcctgagaaa 4200
tactaagaag tttatctcac ttggaaaaca tgccaagttg tcgctgcaag aactcacgtg 4260
gaagatgtca gtccgcgatt gcgcctggct gcgccgctcg ccgggcgtcg ggtgtgttcc 4320
agctgcagaa caccgcctga gagaagaaat tctggccaaa tttctgcatt ggctgatgtc 4380
agtgtacgtg gtcgagctgc tgcgctcctt tttctacgtc actgagacta cctttcaaaa 4440
gaaccgcctg ttcttctacc gcaaatctgt gtggagcaag ctgcagtcaa tcggcattcg 4500
ccagcatctg aagagggtgc agctgcggga actttccgag gcagaagtcc gccagcaccg 4560
ggaggcccgg ccggcgcttc tcacgtcgcg tctgagattc atcccaaagc ccgacgggct 4620
gaggcctatc gtcaacatgg attacgtcgt gggcgctcgc acctttcgcc gtgaaaagcg 4680
ggccgaacgc ttgacctcac gggtgaaggc cctcttctcc gtgctgaact acgagagagc 4740
aagacggcct ggcctgctgg gagcttcggt gctgggactg gacgatatcc accgggcttg 4800
gcggaccttt gttctccggg tgagagccca agaccctccg ccggaactgt acttcgtgaa 4860
ggtggcgatc accggagcct atgatactat tccgcaagat cgactcaccg aagtcatcgc 4920
ctcgatcatc aaaccgcaga acacttactg cgtcaggcgg tacgccgtgg tccagaaggc 4980
cgcgcatggc cacgtgagaa aggcgttcaa gtcgcacgtg tccactctca ccgacctcca 5040
gccttacatg aggcaattcg ttgcgcattt gcaagagact tcgcccctga gagatgcggt 5100
ggtcatcgag cagagctcca gcctgaacga agcgagcagc ggtctgtttg acgtgttcct 5160
ccgcttcatg tgtcatcacg cggtgcgaat caggggaaaa tcatacgtgc agtgccaggg 5220
aatcccacaa ggcagcattc tgtcgactct cttgtgttcc ctttgctacg gcgatatgga 5280
aaacaagctg ttcgctggga tcagacggga cgggttgctg ctcagactgg tggacgactt 5340
cctgctggtg actccgcacc tcactcacgc caaaaccttt ctccgcactc tggtgagggg 5400
agtgccagaa tacggctgtg tggtcaatct ccggaaaact gtggtgaatt tccctgtcga 5460
ggatgaggca ctcggaggaa ccgcatttgt ccaaatgcca gcacatggcc tgttcccatg 5520
gtgcggtctg ctgctggaca cccgaactct tgaagtgcag tccgactact ccagctatgc 5580
ccggacgagc atccgcgcca gcctcacttt caatcgcggc tttaaggccg gacgaaacat 5640
gcgcagaaag cttttcggag tcctccggct taaatgccat tcgctctttc tcgatctcca 5700
agtcaattcg ctgcagaccg tgtgcacgaa catctacaag atcctgctgc tccaagccta 5760
ccggttccac gcttgcgtgc ttcagctgcc gtttcaccaa caggtgtgga agaacccgac 5820
cttctttctg cgggtcatta gcgatactgc ctccctgtgt tactcaatcc tcaaggcaaa 5880
gaacgccgga atgtcgctgg gtgcgaaagg agccgcggga cctcttccta gcgaagcggt 5940
gcagtggctc tgccaccagg ctttcctcct gaagctgacc aggcacagag tgacctacgt 6000
cccgctgctg ggctcgctgc gcactgcaca gacccagctg tctagaaaac tccccggcac 6060
caccctgacc gctctggaag ccgccgccaa cccagcattg ccgtcagatt tcaagaccat 6120
cttggactga agatctgggc cctaacaaaa caaaaagatg gggttattcc ctaaacttca 6180
tgggttacgt aattggaagt tgggggacat tgccacaaga tcatattgta caaaagatca 6240
aacactgttt tagaaaactt cctgtaaaca ggcctattga ttggaaagta tgtcaaagga 6300
ttgtgggtct tttgggcttt gctgctccat ttacacaatg tggatatcct gccttaatgc 6360
ctttgtatgc atgtatacaa gctaaacagg ctttcacttt ctcgccaact tacaaggcct 6420
ttctaagtaa acagtacatg aacctttacc ccgttgctcg gcaacggcct ggtctgtgcc 6480
aagtgtttgc tgacgcaacc cccactggct ggggcttggc cataggccat cagcgcatgc 6540
gtggaacctt tgtggctcct ctgccgatcc atactgcgga actcctagcc gcttgttttg 6600
ctcgcagccg gtctggagca aagctcatag gaactgacaa ttctgtcgtc ctctcgcgga 6660
aatatacatc gtttcgatct acgtatgatc tttttccctc tgccaaaaat tatggggaca 6720
tcatgaagcc ccttgagcat ctgacttctg gctaataaag gaaatttatt ttcattgcaa 6780
tagtgtgttg gaattttttg tgtctctcac tcggaaggaa ttctgcatta atgaatcggc 6840
caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 6900
tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 6960
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 7020
aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 7080
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 7140
agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 7200
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7260
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7320
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7380
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7440
tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7500
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7560
tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 7620
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 7680
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 7740
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 7800
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 7860
ctatttcgtt catccatagt tgcctgactc 7890
<210> 74
<211> 8070
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 74
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcggagctg ccccggagcc ggagaggacc cccgttggcc agggatcgtg 2040
ggcccatccg ggacgcacca ggggaccatc cgacagggga ttctgtgtgg tgtcaccggc 2100
caggccagca gaagaggcaa ccagcctcga gggagcgttg tctggaacca gacattccca 2160
cccgtcggtg ggccggcagc accacgcggg accaccgtcc acttccagac cgccacggcc 2220
atgggacacc ccttgcccgc ctgtgtatgc cgagactaaa cacttcctgt actcatccgg 2280
agacaaggaa cagcttcggc cgtccttcct cctgtcgtcg ctcagaccga gcctgaccgg 2340
agcacgcaga ttggtggaaa ctatcttcct tgggtcacgt ccgtggatgc caggtacccc 2400
acggcgcctc ccgcgcctcc cacagagata ctggcagatg cggcctctgt tcctggaatt 2460
gctgggaaac cacgctcagt gcccgtacgg agtcctgctc aagactcact gccctctgag 2520
ggcggcggtc actccggcgg ccggagtgtg cgcacgggag aagccccagg gaagcgtggc 2580
agctccggaa gaggaggaca ccgatccgcg ccgcctcgtg caacttctgc gccagcactc 2640
ctcgccctgg caagtctacg ggttcgtccg cgcctgcctg cgccgcctgg tgccgcctgg 2700
gctctggggt tcccggcata acgagcgccg cttcctgaga aatactaaga agtttatctc 2760
acttggaaaa catgccaagt tgtcgctgca agaactcacg tggaagatgt cagtccgcga 2820
ttgcgcctgg ctgcgccgct cgccgggcgt cgggtgtgtt ccagctgcag aacaccgcct 2880
gagagaagaa attctggcca aatttctgca ttggctgatg tcagtgtacg tggtcgagct 2940
gctgcgctcc tttttctacg tcactgagac tacctttcaa aagaaccgcc tgttcttcta 3000
ccgcaaatct gtgtggagca agctgcagtc aatcggcatt cgccagcatc tgaagagggt 3060
gcagctgcgg gaactttccg aggcagaagt ccgccagcac cgggaggccc ggccggcgct 3120
tctcacgtcg cgtctgagat tcatcccaaa gcccgacggg ctgaggccta tcgtcaacat 3180
ggattacgtc gtgggcgctc gcacctttcg ccgtgaaaag cgggccgaac gcttgacctc 3240
acgggtgaag gccctcttct ccgtgctgaa ctacgagaga gcaagacggc ctggcctgct 3300
gggagcttcg gtgctgggac tggacgatat ccaccgggct tggcggacct ttgttctccg 3360
ggtgagagcc caagaccctc cgccggaact gtacttcgtg aaggtggcga tcaccggagc 3420
ctatgatact attccgcaag atcgactcac cgaagtcatc gcctcgatca tcaaaccgca 3480
gaacacttac tgcgtcaggc ggtacgccgt ggtccagaag gccgcgcatg gccacgtgag 3540
aaaggcgttc aagtcgcacg tgtccactct caccgacctc cagccttaca tgaggcaatt 3600
cgttgcgcat ttgcaagaga cttcgcccct gagagatgcg gtggtcatcg agcagagctc 3660
cagcctgaac gaagcgagca gcggtctgtt tgacgtgttc ctccgcttca tgtgtcatca 3720
cgcggtgcga atcaggggaa aatcatacgt gcagtgccag ggaatcccac aaggcagcat 3780
tctgtcgact ctcttgtgtt ccctttgcta cggcgatatg gaaaacaagc tgttcgctgg 3840
gatcagacgg gacgggttgc tgctcagact ggtggacgac ttcctgctgg tgactccgca 3900
cctcactcac gccaaaacct ttctccgcac tctggtgagg ggagtgccag aatacggctg 3960
tgtggtcaat ctccggaaaa ctgtggtgaa tttccctgtc gaggatgagg cactcggagg 4020
aaccgcattt gtccaaatgc cagcacatgg cctgttccca tggtgcggtc tgctgctgga 4080
cacccgaact cttgaagtgc agtccgacta ctccagctat gcccggacga gcatccgcgc 4140
cagcctcact ttcaatcgcg gctttaaggc cggacgaaac atgcgcagaa agcttttcgg 4200
agtcctccgg cttaaatgcc attcgctctt tctcgatctc caagtcaatt cgctgcagac 4260
cgtgtgcacg aacatctaca agatcctgct gctccaagcc taccggttcc acgcttgcgt 4320
gcttcagctg ccgtttcacc aacaggtgtg gaagaacccg accttctttc tgcgggtcat 4380
tagcgatact gcctccctgt gttactcaat cctcaaggca aagaacgccg gaatgtcgct 4440
gggtgcgaaa ggagccgcgg gacctcttcc tagcgaagcg gtgcagtggc tctgccacca 4500
ggctttcctc ctgaagctga ccaggcacag agtgacctac gtcccgctgc tgggctcgct 4560
gcgcactgca cagacccagc tgtctagaaa actccccggc accaccctga ccgctctgga 4620
agccgccgcc aacccagcat tgccgtcaga tttcaagacc atcttggacg gatccggcca 4680
gtgcaccaat tacgccctgc tgaagctggc cggcgacgtg gaatctaacc ctggccctga 4740
atcgccaagc gcaccccctc atcggtggtg catcccttgg caacgcctcc tcctgaccgc 4800
ctcactgctg actttctgga acccgccgac caccgcaaag ctgaccattg agagcactcc 4860
cttcaacgtg gctgagggga aggaggtgct gctcctggtg cacaatctgc cccagcacct 4920
gttcgggtac tcctggtaca agggagaacg cgtggacggg aaccggcaga tcataggcta 4980
cgtcatcgga acccagcagg ccacacccgg tccagcgtac agcggccggg agattatcta 5040
cccgaacgcc tccctgctga tccaaaacat catccagaac gacaccggtt tctacactct 5100
gcacgtgatt aagtcagatc tggtcaacga agaggccacc ggccaattca gggtgtaccc 5160
cgaactccct aagccgttca tcacctcgaa caacagcaac ccggtcgagg atgaagatgc 5220
ggtggccttg acgtgcgaac ctgagatcca gaacaccacc tacttgtggt gggtgaacaa 5280
tcagagcctg ccagtctccc cacgactcca gctgtcgaac gacaacagga ccctgacttt 5340
gctgtccgtg actcggaacg acgtgggccc ttatgaatgc ggtatccaga acaagctgtc 5400
cgtggaccac agcgaccctg tgatcctgaa cgtcctttac gggccggacg accccaccat 5460
ttccccgtcg tacacttact accggccggg cgtgaacctg tccctgtcgt gccacgctgc 5520
ctccaatccg ccggcccagt actcctggct catcgacgga aacatccagc agcacaccca 5580
agaactgttc atctccaaca ttaccgagaa aaactcggga ctttacacct gtcaagccaa 5640
caattccgcc agcggccact cccgcaccac tgtcaaaact atcactgtgt ccgccgaact 5700
cccgaagccc agcatcagct ccaacaactc gaagcccgtg gaggataagg acgctgtcgc 5760
gttcacctgt gaaccagagg cacagaatac cacctacctt tggtgggtca acggacagtc 5820
cctgcctgtc tcaccgagac tgcagctgtc aaacgggaat aggactctga ccttgtttaa 5880
cgtcacccgg aacgacgccc gggcctacgt gtgcggcatc cagaactccg tgagcgcaaa 5940
ccggtctgac ccagtgaccc tggatgtgct gtacggcccc gacactccga tcatttcacc 6000
ccccgattca tcctacctgt ccggcgctaa cctcaacctc tcatgccact ccgcatccaa 6060
ccccagcccg caatattcgt ggcgcattaa cggaattcct cagcaacata cccaggtcct 6120
gttcattgcg aagatcaccc ctaacaacaa cggaacctac gcctgctttg tgtcaaacct 6180
ggccactggt agaaacaact ccatcgtgaa gtccattacc gtgtcggcgt ccggaacttc 6240
cccgggcctg agcgccggcg ccaccgtggg aattatgatc ggcgtgctcg tgggagtggc 6300
cctgatctga agatctgggc cctaacaaaa caaaaagatg gggttattcc ctaaacttca 6360
tgggttacgt aattggaagt tgggggacat tgccacaaga tcatattgta caaaagatca 6420
aacactgttt tagaaaactt cctgtaaaca ggcctattga ttggaaagta tgtcaaagga 6480
ttgtgggtct tttgggcttt gctgctccat ttacacaatg tggatatcct gccttaatgc 6540
ctttgtatgc atgtatacaa gctaaacagg ctttcacttt ctcgccaact tacaaggcct 6600
ttctaagtaa acagtacatg aacctttacc ccgttgctcg gcaacggcct ggtctgtgcc 6660
aagtgtttgc tgacgcaacc cccactggct ggggcttggc cataggccat cagcgcatgc 6720
gtggaacctt tgtggctcct ctgccgatcc atactgcgga actcctagcc gcttgttttg 6780
ctcgcagccg gtctggagca aagctcatag gaactgacaa ttctgtcgtc ctctcgcgga 6840
aatatacatc gtttcgatct acgtatgatc tttttccctc tgccaaaaat tatggggaca 6900
tcatgaagcc ccttgagcat ctgacttctg gctaataaag gaaatttatt ttcattgcaa 6960
tagtgtgttg gaattttttg tgtctctcac tcggaaggaa ttctgcatta atgaatcggc 7020
caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 7080
tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 7140
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 7200
aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 7260
gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 7320
agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 7380
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 7440
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 7500
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 7560
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 7620
tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga 7680
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 7740
tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 7800
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 7860
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 7920
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 7980
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 8040
ctatttcgtt catccatagt tgcctgactc 8070
<210> 75
<211> 30252
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 75
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc gcacctcgag ctgatcataa tcagccatac cacatttgta 1140
gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa acataaaatg 1200
aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa ataaagcaat 1260
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 1320
aaactcatca atgtatctta ccaggtgccg agcctgcgag tgcggaggga agcatgccag 1380
gttccagccc gtgtgtgtgg atgtgacgga ggacctgcga cccgatcatt tggtgttgcc 1440
ctgcaccggg acggagttcg gttccagcgg ggaagaatct gactagagtg agtagtgttc 1500
tggggcgggg gaggacctgc atgagggcca gaataactga aatctgtgct tttctgtgtg 1560
ttgcagcagc atgagcggaa gcggctcctt tgagggaggg gtattcagcc cttatctgac 1620
ggggcgtctc ccctcctggg cgggagtgcg tcagaatgtg atgggatcca cggtggacgg 1680
ccggcccgtg cagcccgcga actcttcaac cctgacctat gcaaccctga gctcttcgtc 1740
gttggacgca gctgccgccg cagctgctgc atctgccgcc agcgccgtgc gcggaatggc 1800
catgggcgcc ggctactacg gcactctggt ggccaactcg agttccacca ataatcccgc 1860
cagcctgaac gaggagaagc tgttgctgct gatggcccag ctcgaggcct tgacccagcg 1920
cctgggcgag ctgacccagc aggtggctca gctgcaggag cagacgcggg ccgcggttgc 1980
cacggtgaaa tccaaataaa aaatgaatca ataaataaac ggagacggtt gttgatttta 2040
acacagagtc tgaatcttta tttgattttt cgcgcgcggt aggccctgga ccaccggtct 2100
cgatcattga gcacccggtg gatcttttcc aggacccggt agaggtgggc ttggatgttg 2160
aggtacatgg gcatgagccc gtcccggggg tggaggtagc tccattgcag ggcctcgtgc 2220
tcgggggtgg tgttgtaaat cacccagtca tagcaggggc gcagggcatg gtgttgcaca 2280
atatctttga ggaggagact gatggccacg ggcagccctt tggtgtaggt gtttacaaat 2340
ctgttgagct gggagggatg catgcggggg gagatgaggt gcatcttggc ctggatcttg 2400
agattggcga tgttaccgcc cagatcccgc ctggggttca tgttgtgcag gaccaccagc 2460
acggtgtatc cggtgcactt ggggaattta tcatgcaact tggaagggaa ggcgtgaaag 2520
aatttggcga cgcctttgtg cccgcccagg ttttccatgc actcatccat gatgatggcg 2580
atgggcccgt gggcggcggc ctgggcaaag acgtttcggg ggtcggacac atcatagttg 2640
tggtcctggg tgaggtcatc ataggccatt ttaatgaatt tggggcggag ggtgccggac 2700
tgggggacaa aggtaccctc gatcccgggg gcgtagttcc cctcacagat ctgcatctcc 2760
caggctttga gctcggaggg ggggatcatg tccacctgcg gggcgataaa gaacacggtt 2820
tccggggcgg gggagatgag ctgggccgaa agcaagttcc ggagcagctg ggacttgccg 2880
cagccggtgg ggccgtagat gaccccgatg accggctgca ggtggtagtt gagggagaga 2940
cagctgccgt cctcccggag gaggggggcc acctcgttca tcatctcgcg cacgtgcatg 3000
ttctcgcgca ccagttccgc caggaggcgc tctcccccca gggataggag ctcctggagc 3060
gaggcgaagt ttttcagcgg cttgagtccg tcggccatgg gcattttgga gagggtttgt 3120
tgcaagagtt ccaggcggtc ccagagctcg gtgatgtgct ctacggcatc tcgatccagc 3180
agacctcctc gtttcgcggg ttgggacggc tgcgggagta gggcaccaga cgatgggcgt 3240
ccagcgcagc cagggtccgg tccttccagg gtcgcagcgt ccgcgtcagg gtggtctccg 3300
tcacggtgaa ggggtgcgcg ccgggctggg cgcttgcgag ggtgcgcttc aggctcatcc 3360
ggctggtcga aaaccgctcc cgatcggcgc cctgcgcgtc ggccaggtag caattgacca 3420
tgagttcgta gttgagcgcc tcggccgcgt ggcctttggc gcggagctta cctttggaag 3480
tctgcccgca ggcgggacag aggagggact tgagggcgta gagcttgggg gcgaggaaga 3540
cggactcggg ggcgtaggcg tccgcgccgc agtgggcgca gacggtctcg cactccacga 3600
gccaggtgag gtcgggctgg tcggggtcaa aaaccagttt cccgccgttc tttttgatgc 3660
gtttcttacc tttggtctcc atgagctcgt gtccccgctg ggtgacaaag aggctgtccg 3720
tgtccccgta gaccgacttt atgggccggt cctcgagcgg tgtgccgcgg tcctcctcgt 3780
agaggaaccc cgcccactcc gagacgaaag cccgggtcca ggccagcacg aaggaggcca 3840
cgtgggacgg gtagcggtcg ttgtccacca gcgggtccac cttttccagg gtatgcaaac 3900
acatgtcccc ctcgtccaca tccaggaagg tgattggctt gtaagtgtag gccacgtgac 3960
cgggggtccc ggccgggggg gtataaaagg gtgcgggtcc ctgctcgtcc tcactgtctt 4020
ccggatcgct gtccaggagc gccagctgtt ggggtaggta ttccctctcg aaggcgggca 4080
tgacctcggc actcaggttg tcagtttcta gaaacgagga ggatttgata ttgacggtgc 4140
cggcggagat gcctttcaag agcccctcgt ccatctggtc agaaaagacg atctttttgt 4200
tgtcgagctt ggtggcgaag gagccgtaga gggcgttgga gaggagcttg gcgatggagc 4260
gcatggtctg gtttttttcc ttgtcggcgc gctccttggc ggcgatgttg agctgcacgt 4320
actcgcgcgc cacgcacttc cattcgggga agacggtggt cagctcgtcg ggcacgattc 4380
tgacctgcca gccccgatta tgcagggtga tgaggtccac actggtggcc acctcgccgc 4440
gcaggggctc attagtccag cagaggcgtc cgcccttgcg cgagcagaag gggggcaggg 4500
ggtccagcat gacctcgtcg ggggggtcgg catcgatggt gaagatgccg ggcaggaggt 4560
cggggtcaaa gtagctgatg gaagtggcca gatcgtccag ggcagcttgc cattcgcgca 4620
cggccagcgc gcgctcgtag ggactgaggg gcgtgcccca gggcatggga tgggtaagcg 4680
cggaggcgta catgccgcag atgtcgtaga cgtagagggg ctcctcgagg atgccgatgt 4740
aggtggggta gcagcgcccc ccgcggatgc tggcgcgcac gtagtcatac agctcgtgcg 4800
agggggcgag gagccccggg cccaggttgg tgcgactggg cttttcggcg cggtagacga 4860
tctggcggaa aatggcatgc gagttggagg agatggtggg cctttggaag atgttgaagt 4920
gggcgtgggg cagtccgacc gagtcgcgga tgaagtgggc gtaggagtct tgcagcttgg 4980
cgacgagctc ggcggtgact aggacgtcca gagcgcagta gtcgagggtc tcctggatga 5040
tgtcatactt gagctgtccc ttttgtttcc acagctcgcg gttgagaagg aactcttcgc 5100
ggtccttcca gtactcttcg agggggaacc cgtcctgatc tgcacggtaa gagcctagca 5160
tgtagaactg gttgacggcc ttgtaggcgc agcagccctt ctccacgggg agggcgtagg 5220
cctgggcggc cttgcgcagg gaggtgtgcg tgagggcgaa agtgtccctg accatgacct 5280
tgaggaactg gtgcttgaag tcgatatcgt cgcagccccc ctgctcccag agctggaagt 5340
ccgtgcgctt cttgtaggcg gggttgggca aagcgaaagt aacatcgttg aagaggatct 5400
tgcccgcgcg gggcataaag ttgcgagtga tgcggaaagg ttggggcacc tcggcccggt 5460
tgttgatgac ctgggcggcg agcacgatct cgtcgaagcc gttgatgttg tggcccacga 5520
tgtagagttc cacgaatcgc ggacggccct tgacgtgggg cagtttcttg agctcctcgt 5580
aggtgagctc gtcggggtcg ctgagcccgt gctgctcgag cgcccagtcg gcgagatggg 5640
ggttggcgcg gaggaaggaa gtccagagat ccacggccag ggcggtttgc agacggtccc 5700
ggtactgacg gaactgctgc ccgacggcca ttttttcggg ggtgacgcag tagaaggtgc 5760
gggggtcccc gtgccagcga tcccatttga gctggagggc gagatcgagg gcgagctcga 5820
cgagccggtc gtccccggag agtttcatga ccagcatgaa ggggacgagc tgcttgccga 5880
aggaccccat ccaggtgtag gtttccacat cgtaggtgag gaagagcctt tcggtgcgag 5940
gatgcgagcc gatggggaag aactggatct cctgccacca attggaggaa tggctgttga 6000
tgtgatggaa gtagaaatgc cgacggcgcg ccgaacactc gtgcttgtgt ttatacaagc 6060
ggccacagtg ctcgcaacgc tgcacgggat gcacgtgctg cacgagctgt acctgagttc 6120
ctttgacgag gaatttcagt gggaagtgga gtcgtggcgc ctgcatctcg tgctgtacta 6180
cgtcgtggtg gtcggcctgg ccctcttctg cctcgatggt ggtcatgctg acgagcccgc 6240
gcgggaggca ggtccagacc tcggcgcgag cgggtcggag agcgaggacg agggcgcgca 6300
ggccggagct gtccagggtc ctgagacgct gcggagtcag gtcagtgggc agcggcggcg 6360
cgcggttgac ttgcaggagt ttttccaggg cgcgcgggag gtccagatgg tacttgatct 6420
ccaccgcgcc attggtggcg acgtcgatgg cttgcagggt cccgtgcccc tggggtgtga 6480
ccaccgtccc ccgtttcttc ttgggcggct ggggcgacgg gggcggtgcc tcttccatgg 6540
ttagaagcgg cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag 6600
gggcggcagg ggcacgtcgg cgccgcgcgc gggtaggttc tggtactgcg cccggagaag 6660
actggcgtga gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc 6720
cacgggaccc gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt 6780
gacggcggcc tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc 6840
ggtcatgaac tgctcgatct cctcctcttg aaggtctccg cggccggcgc gctccacggt 6900
ggccgcgagg tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgcccgcctc 6960
gttccagacg cggctgtaga ccacgacgcc ctcgggatcg cgggcgcgca tgaccacctg 7020
ggcgaggttg agctccacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag 7080
gtagttgagc gtggtggcga tgtgctcggt gacgaagaaa tacatgatcc agcggcggag 7140
cggcatctcg ctgacgtcgc ccagcgcctc caaacgttcc atggcctcgt aaaagtccac 7200
ggcgaagttg aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg 7260
gatgagctcg gcgatggtgg cgcgcacctc gcgctcgaag gcccccggga gttcctccac 7320
ttcctcttct tcctcctcca ctaacatctc ttctacttcc tcctcaggcg gcagtggtgg 7380
cgggggaggg ggcctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat 7440
ggtctcgccg cgccggcgtc gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg 7500
cagcgtgaag acgccgccgc gcatctccag gtggccgggg gggtccccgt tgggcaggga 7560
gagggcgctg acgatgcatc ttatcaattg ccccgtaggg actccgcgca aggacctgag 7620
cgtctcgaga tccacgggat ctgaaaaccg ctgaacgaag gcttcgagcc agtcgcagtc 7680
gcaaggtagg ctgagcacgg tttcttctgg cgggtcatgt tggttgggag cggggcgggc 7740
gatgctgctg gtgatgaagt tgaaataggc ggttctgaga cggcggatgg tggcgaggag 7800
caccaggtct ttgggcccgg cttgctggat gcgcagacgg tcggccatgc cccaggcgtg 7860
gtcctgacac ctggccaggt ccttgtagta gtcctgcatg agccgctcca cgggcacctc 7920
ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag ccgcgctggg gctggacgag 7980
cgccaggtcg gcgacgacgc gctcggcgag gatggcttgc tggatctggg tgagggtggt 8040
ctggaagtca tcaaagtcga cgaagcggtg gtaggctccg gtgttgatgg tgtaggagca 8100
gttggccatg acggaccagt tgacggtctg gtggcccgga cgcacgagct cgtggtactt 8160
gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg caggtgcgca ccaggtactg 8220
gtagccgatg aggaagtgcg gcggcggctg gcggtagagc ggccatcgct cggtggcggg 8280
ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag ccgtagatgt acctggacat 8340
ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac tcgcggacgc ggttccagat 8400
gttgcgcagc ggcaggaagt agttcatggt gggcacggtc tggcccgtga ggcgcgcgca 8460
gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag cggctcgact ccgtggcctg 8520
gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg aatcaggctg 8580
gagccgcagc taacgtggta ttggcactcc cgtctcgacc caagcctgca ccaaccctcc 8640
aggatacgga ggcgggtcgt tttgcaactt ttttttggag gccggatgag actagtaagc 8700
gcggaaagcg gccgaccgcg atggctcgct gccgtagtct ggagaagaat cgccagggtt 8760
gcgttgcggt gtgccccggt tcgaggccgg ccggattccg cggctaacga gggcgtggct 8820
gccccgtcgt ttccaagacc ccatagccag ccgacttctc cagttacgga gcgagcccct 8880
cttttgtttt gtttgttttt gccagatgca tcccgtactg cggcagatgc gcccccacca 8940
ccctccaccg caacaacagc cccctccaca gccggcgctt ctgcccccgc cccagcagca 9000
acttccagcc acgaccgccg cggccgccgt gagcggggct ggacagagtt atgatcacca 9060
gctggccttg gaagagggcg aggggctggc gcgcctgggg gcgtcgtcgc cggagcggca 9120
cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc agaacctgtt 9180
cagagacagg agcggcgagg agcccgagga gatgcgcgcg gcccggttcc acgcggggcg 9240
ggagctgcgg cgcggcctgg accgaaagag ggtgctgagg gacgaggatt tcgaggcgga 9300
cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc tggtcacggc 9360
gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca accacgtgcg 9420
caccctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg acctgctgga 9480
ggccatcgtg cagaacccca ccagcaagcc gctgacggcg cagctgttcc tggtggtgca 9540
gcatagtcgg gacaacgaag cgttcaggga ggcgctgctg aatatcaccg agcccgaggg 9600
ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg agcgcgggct 9660
gccgctgtcc gagaagctgg cggccatcaa cttctcggtg ctgagtttgg gcaagtacta 9720
cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga agatcgacgg 9780
gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg gggtgtaccg 9840
caacgacagg atgcaccgtg cggtgagcgc cagcaggcgg cgcgagctga gcgaccagga 9900
gctgatgcat agtctgcagc gggccctgac cggggccggg accgaggggg agagctactt 9960
tgacatgggc gcggacctgc actggcagcc cagccgccgg gccttggagg cggcggcagg 10020
accctacgta gaagaggtgg acgatgaggt ggacgaggag ggcgagtacc tggaagactg 10080
atggcgcgac cgtatttttg ctagatgcaa caacaacagc cacctcctga tcccgcgatg 10140
cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg gacccaggcc 10200
atgcaacgca tcatggcgct gacgacccgc aaccccgaag cctttagaca gcagccccag 10260
gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctccaa ccccacgcac 10320
gagaaggtcc tggccatcgt gaacgcgctg gtggagaaca aggccatccg cggcgacgag 10380
gccggcctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag caccaacgtg 10440
cagaccaacc tggaccgcat ggtgaccgac gtgcgcgagg ccgtggccca gcgcgagcgg 10500
ttccaccgcg agtccaacct gggatccatg gtggcgctga acgccttcct cagcacccag 10560
cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc cctgcgcctg 10620
atggtgaccg aggtgcccca gagcgaggtg taccagtccg ggccggacta cttcttccag 10680
accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa cttgcagggc 10740
ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct gctgacgccg 10800
aactcgcgcc tgctgctgct gctggtggcc cccttcacgg acagcggcag catcaaccgc 10860
aactcgtacc tgggctacct gattaacctg taccgcgagg ccatcggcca ggcgcacgtg 10920
gacgagcaga cctaccagga gatcacccac gtgagccgcg ccctgggcca ggacgacccg 10980
ggcaacctgg aagccaccct gaactttttg ctgaccaacc ggtcgcagaa gatcccgccc 11040
cagtacgcgc tcagcaccga ggaggagcgc atcctgcgtt acgtgcagca gagcgtgggc 11100
ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac cgcgcgcaac 11160
atggagccca gcatgtacgc cagcaaccgc ccgttcatca ataaactgat ggactacttg 11220
catcgggcgg ccgccatgaa ctctgactat ttcaccaacg ccatcctgaa tccccactgg 11280
ctcccgccgc cggggttcta cacgggcgag tacgacatgc ccgaccccaa tgacgggttc 11340
ctgtgggacg atgtggacag cagcgtgttc tccccccgac cgggtgctaa cgagcgcccc 11400
ttgtggaaga aggaaggcag cgaccgacgc ccgtcctcgg cgctgtccgg ccgcgagggt 11460
gctgccgcgg cggtgcccga ggccgccagt cctttcccga gcttgccctt ctcgctgaac 11520
agtatccgca gcagcgagct gggcaggatc acgcgcccgc gcttgctggg cgaagaggag 11580
tacttgaatg actcgctgtt gagacccgag cgggagaaga acttccccaa taacgggata 11640
gaaagcctgg tggacaagat gagccgctgg aagacgtatg cgcaggagca cagggacgat 11700
ccccgggcgt cgcagggggc cacgagccgg ggcagcgccg cccgtaaacg ccggtggcac 11760
gacaggcagc ggggacagat gtgggacgat gaggactccg ccgacgacag cagcgtgttg 11820
gacttgggtg ggagtggtaa cccgttcgct cacctgcgcc cccgtatcgg gcgcatgatg 11880
taagagaaac cgaaaataaa tgatactcac caaggccatg gcgaccagcg tgcgttcgtt 11940
tcttctctgt tgttgttgta tctagtatga tgaggcgtgc gtacccggag ggtcctcctc 12000
cctcgtacga gagcgtgatg cagcaggcga tggcggcggc ggcgatgcag cccccgctgg 12060
aggctcctta cgtgcccccg cggtacctgg cgcctacgga ggggcggaac agcattcgtt 12120
actcggagct ggcacccttg tacgatacca cccggttgta cctggtggac aacaagtcgg 12180
cggacatcgc ctcgctgaac taccagaacg accacagcaa cttcctgacc accgtggtgc 12240
agaacaatga cttcaccccc acggaggcca gcacccagac catcaacttt gacgagcgct 12300
cgcggtgggg cggccagctg aaaaccatca tgcacaccaa catgcccaac gtgaacgagt 12360
tcatgtacag caacaagttc aaggcgcggg tgatggtctc ccgcaagacc cccaatgggg 12420
tgacagtgac agaggattat gatggtagtc aggatgagct gaagtatgaa tgggtggaat 12480
ttgagctgcc cgaaggcaac ttctcggtga ccatgaccat cgacctgatg aacaacgcca 12540
tcatcgacaa ttacttggcg gtggggcggc agaacggggt gctggagagc gacatcggcg 12600
tgaagttcga cactaggaac ttcaggctgg gctgggaccc cgtgaccgag ctggtcatgc 12660
ccggggtgta caccaacgag gctttccatc ccgatattgt cttgctgccc ggctgcgggg 12720
tggacttcac cgagagccgc ctcagcaacc tgctgggcat tcgcaagagg cagcccttcc 12780
aggaaggctt ccagatcatg tacgaggatc tggagggggg caacatcccc gcgctcctgg 12840
atgtcgacgc ctatgagaaa agcaaggagg atgcagcagc tgaagcaact gcagccgtag 12900
ctaccgcctc taccgaggtc aggggcgata attttgcaag cgccgcagca gtggcagcgg 12960
ccgaggcggc tgaaaccgaa agtaagatag tcattcagcc ggtggagaag gatagcaaga 13020
acaggagcta caacgtacta ccggacaaga taaacaccgc ctaccgcagc tggtacctag 13080
cctacaacta tggcgacccc gagaagggcg tgcgctcctg gacgctgctc accacctcgg 13140
acgtcacctg cggcgtggag caagtctact ggtcgctgcc cgacatgatg caagacccgg 13200
tcaccttccg ctccacgcgt caagttagca actacccggt ggtgggcgcc gagctcctgc 13260
ccgtctactc caagagcttc ttcaacgagc aggccgtcta ctcgcagcag ctgcgcgcct 13320
tcacctcgct tacgcacgtc ttcaaccgct tccccgagaa ccagatcctc gtccgcccgc 13380
ccgcgcccac cattaccacc gtcagtgaaa acgttcctgc tctcacagat cacgggaccc 13440
tgccgctgcg cagcagtatc cggggagtcc agcgcgtgac cgttactgac gccagacgcc 13500
gcacctgccc ctacgtctac aaggccctgg gcatagtcgc gccgcgcgtc ctctcgagcc 13560
gcaccttcta aatgtccatt ctcatctcgc ccagtaataa caccggttgg ggcctgcgcg 13620
cgcccagcaa gatgtacgga ggcgctcgcc aacgctccac gcaacacccc gtgcgcgtgc 13680
gcgggcactt ccgcgctccc tggggcgccc tcaagggccg cgtgcggtcg cgcaccaccg 13740
tcgacgacgt gatcgaccag gtggtggccg acgcgcgcaa ctacaccccc gccgccgcgc 13800
ccgtctccac cgtggacgcc gtcatcgaca gcgtggtggc cgacgcgcgc cggtacgccc 13860
gcgccaagag ccggcggcgg cgcatcgccc ggcggcaccg gagcaccccc gccatgcgcg 13920
cggcgcgagc cttgctgcgc agggccaggc gcacgggacg cagggccatg ctcagggcgg 13980
ccagacgcgc ggcttcaggc gccagcgccg gcaggacccg gagacgcgcg gccacggcgg 14040
cggcagcggc catcgccagc atgtcccgcc cgcggcgagg gaacgtgtac tgggtgcgcg 14100
acgccgccac cggtgtgcgc gtgcccgtgc gcacccgccc ccctcgcact tgaagatgtt 14160
cacttcgcga tgttgatgtg tcccagcggc gaggaggatg tccaagcgca aattcaagga 14220
agagatgctc caggtcatcg cgcctgagat ctacggccct gcggtggtga aggaggaaag 14280
aaagccccgc aaaatcaagc gggtcaaaaa ggacaaaaag gaagaagaaa gtgatgtgga 14340
cggattggtg gagtttgtgc gcgagttcgc cccccggcgg cgcgtgcagt ggcgcgggcg 14400
gaaggtgcaa ccggtgctga gacccggcac caccgtggtc ttcacgcccg gcgagcgctc 14460
cggcaccgct tccaagcgct cctacgacga ggtgtacggg gatgatgata ttctggagca 14520
ggcggccgag cgcctgggcg agtttgctta cggcaagcgc agccgttccg caccgaagga 14580
agaggcggtg tccatcccgc tggaccacgg caaccccacg ccgagcctca agcccgtgac 14640
cttgcagcag gtgctgccga ccgcggcgcc gcgccggggg ttcaagcgcg agggcgagga 14700
tctgtacccc accatgcagc tgatggtgcc caagcgccag aagctggaag acgtgctgga 14760
gaccatgaag gtggacccgg acgtgcagcc cgaggtcaag gtgcggccca tcaagcaggt 14820
ggccccgggc ctgggcgtgc agaccgtgga catcaagatt cccacggagc ccatggaaac 14880
gcagaccgag cccatgatca agcccagcac cagcaccatg gaggtgcaga cggatccctg 14940
gatgccatcg gctcctagtc gaagaccccg gcgcaagtac ggcgcggcca gcctgctgat 15000
gcccaactac gcgctgcatc cttccatcat ccccacgccg ggctaccgcg gcacgcgctt 15060
ctaccgcggt cataccagca gccgccgccg caagaccacc actcgccgcc gccgtcgccg 15120
caccgccgct gcaaccaccc ctgccgccct ggtgcggaga gtgtaccgcc gcggccgcgc 15180
acctctgacc ctgccgcgcg cgcgctacca cccgagcatc gccatttaaa ctttcgcctg 15240
ctttgcagat caatggccct cacatgccgc cttcgcgttc ccattacggg ctaccgagga 15300
agaaaaccgc gccgtagaag gctggcgggg aacgggatgc gtcgccacca ccaccggcgg 15360
cggcgcgcca tcagcaagcg gttgggggga ggcttcctgc ccgcgctgat ccccatcatc 15420
gccgcggcga tcggggcgat ccccggcatt gcttccgtgg cggtgcaggc ctctcagcgc 15480
cactgagaca cacttggaaa catcttgtaa taaaccaatg gactctgacg ctcctggtcc 15540
tgtgatgtgt tttcgtagac agatggaaga catcaatttt tcgtccctgg ctccgcgaca 15600
cggcacgcgg ccgttcatgg gcacctggag cgacatcggc accagccaac tgaacggggg 15660
cgccttcaat tggagcagtc tctggagcgg gcttaagaat ttcgggtcca cgcttaaaac 15720
ctatggcagc aaggcgtgga acagcaccac agggcaggcg ctgagggata agctgaaaga 15780
gcagaacttc cagcagaagg tggtcgatgg gctcgcctcg ggcatcaacg gggtggtgga 15840
cctggccaac caggccgtgc agcggcagat caacagccgc ctggacccgg tgccgcccgc 15900
cggctccgtg gagatgccgc aggtggagga ggagctgcct cccctggaca agcggggcga 15960
gaagcgaccc cgccccgatg cggaggagac gctgctgacg cacacggacg agccgccccc 16020
gtacgaggag gcggtgaaac tgggtctgcc caccacgcgg cccatcgcgc ccctggccac 16080
cggggtgctg aaacccgaaa agcccgcgac cctggacttg cctcctcccc agccttcccg 16140
cccctctaca gtggctaagc ccctgccgcc ggtggccgtg gcccgcgcgc gacccggggg 16200
caccgcccgc cctcatgcga actggcagag cactctgaac agcatcgtgg gtctgggagt 16260
gcagagtgtg aagcgccgcc gctgctatta aacctaccgt agcgcttaac ttgcttgtct 16320
gtgtgtgtat gtattatgtc gccgccgccg ctgtccacca gaaggaggag tgaagaggcg 16380
cgtcgccgag ttgcaagatg gccaccccat cgatgctgcc ccagtgggcg tacatgcaca 16440
tcgccggaca ggacgcttcg gagtacctga gtccgggtct ggtgcagttt gcccgcgcca 16500
cagacaccta cttcagtctg gggaacaagt ttaggaaccc cacggtggcg cccacgcacg 16560
atgtgaccac cgaccgcagc cagcggctga cgctgcgctt cgtgcccgtg gaccgcgagg 16620
acaacaccta ctcgtacaaa gtgcgctaca cgctggccgt gggcgacaac cgcgtgctgg 16680
acatggccag cacctacttt gacatccgcg gcgtgctgga tcggggccct agcttcaaac 16740
cctactccgg caccgcctac aacagtctgg cccccaaggg agcacccaac acttgtcagt 16800
ggacatataa agccgatggt gaaactgcca cagaaaaaac ctatacatat ggaaatgcac 16860
ccgtgcaggg cattaacatc acaaaagatg gtattcaact tggaactgac accgatgatc 16920
agccaatcta cgcagataaa acctatcagc ctgaacctca agtgggtgat gctgaatggc 16980
atgacatcac tggtactgat gaaaagtatg gaggcagagc tcttaagcct gataccaaaa 17040
tgaagccttg ttatggttct tttgccaagc ctactaataa agaaggaggt caggcaaatg 17100
tgaaaacagg aacaggcact actaaagaat atgacataga catggctttc tttgacaaca 17160
gaagtgcggc tgctgctggc ctagctccag aaattgtttt gtatactgaa aatgtggatt 17220
tggaaactcc agatacccat attgtataca aagcaggcac agatgacagc agctcttcta 17280
ttaatttggg tcagcaagcc atgcccaaca gacctaacta cattggtttc agagacaact 17340
ttatcgggct catgtactac aacagcactg gcaatatggg ggtgctggcc ggtcaggctt 17400
ctcagctgaa tgctgtggtt gacttgcaag acagaaacac cgagctgtcc taccagctct 17460
tgcttgactc tctgggtgac agaacccggt atttcagtat gtggaatcag gcggtggaca 17520
gctatgatcc tgatgtgcgc attattgaaa atcatggtgt ggaggatgaa cttcccaact 17580
attgtttccc tctggatgct gttggcagaa cagatactta tcagggaatt aaggctaatg 17640
gaactgatca aaccacatgg accaaagatg acagtgtcaa tgatgctaat gagataggca 17700
agggtaatcc attcgccatg gaaatcaaca tccaagccaa cctgtggagg aacttcctct 17760
acgccaacgt ggccctgtac ctgcccgact cttacaagta cacgccggcc aatgttaccc 17820
tgcccaccaa caccaacacc tacgattaca tgaacggccg ggtggtggcg ccctcgctgg 17880
tggactccta catcaacatc ggggcgcgct ggtcgctgga tcccatggac aacgtgaacc 17940
ccttcaacca ccaccgcaat gcggggctgc gctaccgctc catgctcctg ggcaacgggc 18000
gctacgtgcc cttccacatc caggtgcccc agaaattttt cgccatcaag agcctcctgc 18060
tcctgcccgg gtcctacacc tacgagtgga acttccgcaa ggacgtcaac atgatcctgc 18120
agagctccct cggcaacgac ctgcgcacgg acggggcctc catctccttc accagcatca 18180
acctctacgc caccttcttc cccatggcgc acaacacggc ctccacgctc gaggccatgc 18240
tgcgcaacga caccaacgac cagtccttca acgactacct ctcggcggcc aacatgctct 18300
accccatccc ggccaacgcc accaacgtgc ccatctccat cccctcgcgc aactgggccg 18360
ccttccgcgg ctggtccttc acgcgtctca agaccaagga gacgccctcg ctgggctccg 18420
ggttcgaccc ctacttcgtc tactcgggct ccatccccta cctcgacggc accttctacc 18480
tcaaccacac cttcaagaag gtctccatca ccttcgactc ctccgtcagc tggcccggca 18540
acgaccggct cctgacgccc aacgagttcg aaatcaagcg caccgtcgac ggcgagggct 18600
acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtccagatg ctggcccact 18660
acaacatcgg ctaccagggc ttctacgtgc ccgagggcta caaggaccgc atgtactcct 18720
tcttccgcaa cttccagccc atgagccgcc aggtggtgga cgaggtcaac tacaaggact 18780
accaggccgt caccctggcc taccagcaca acaactcggg cttcgtcggc tacctcgcgc 18840
ccaccatgcg ccagggccag ccctaccccg ccaactaccc ctacccgctc atcggcaaga 18900
gcgccgtcac cagcgtcacc cagaaaaagt tcctctgcga cagggtcatg tggcgcatcc 18960
ccttctccag caacttcatg tccatgggcg cgctcaccga cctcggccag aacatgctct 19020
atgccaactc cgcccacgcg ctagacatga atttcgaagt cgaccccatg gatgagtcca 19080
cccttctcta tgttgtcttc gaagtcttcg acgtcgtccg agtgcaccag ccccaccgcg 19140
gcgtcatcga ggccgtctac ctgcgcaccc ccttctcggc cggtaacgcc accacctaag 19200
ctcttgcttc ttgcaagcca tggccgcggg ctccggcgag caggagctca gggccatcat 19260
ccgcgacctg ggctgcgggc cctacttcct gggcaccttc gataagcgct tcccgggatt 19320
catggccccg cacaagctgg cctgcgccat cgtcaacacg gccggccgcg agaccggggg 19380
cgagcactgg ctggccttcg cctggaaccc gcgctcgaac acctgctacc tcttcgaccc 19440
cttcgggttc tcggacgagc gcctcaagca gatctaccag ttcgagtacg agggcctgct 19500
gcgccgcagc gccctggcca ccgaggaccg ctgcgtcacc ctggaaaagt ccacccagac 19560
cgtgcagggt ccgcgctcgg ccgcctgcgg gctcttctgc tgcatgttcc tgcacgcctt 19620
cgtgcactgg cccgaccgcc ccatggacaa gaaccccacc atgaacttgc tgacgggggt 19680
gcccaacggc atgctccagt cgccccaggt ggaacccacc ctgcgccgca accaggaggc 19740
gctctaccgc ttcctcaact cccactccgc ctactttcgc tcccaccgcg cgcgcatcga 19800
gaaggccacc gccttcgacc gcatgaatca agacatgtaa accgtgtgtg tatgttaaat 19860
gtctttaata aacagcactt tcatgttaca catgcatctg agatgattta tttagaaatc 19920
gaaagggttc tgccgggtct cggcatggcc cgcgggcagg gacacgttgc ggaactggta 19980
cttggccagc cacttgaact cggggatcag cagtttgggc agcggggtgt cggggaagga 20040
gtcggtccac agcttccgcg tcagttgcag ggcgcccagc aggtcgggcg cggagatctt 20100
gaaatcgcag ttgggacccg cgttctgcgc gcgggagttg cggtacacgg ggttgcagca 20160
ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt cggtgatgct 20220
ctccacgtcg aggtcctcgg cgttggccat cccgaagggg gtcatcttgc aggtctgcct 20280
tcccatggtg ggcacgcacc cgggcttgtg gttgcaatcg cagtgcaggg ggatcagcat 20340
catctgggcc tggtcggcgt tcatccccgg gtacatggcc ttcatgaaag cctccaattg 20400
cctgaacgcc tgctgggcct tggctccctc ggtgaagaag accccgcagg acttgctaga 20460
gaactggttg gtggcgcacc cggcgtcgtg cacgcagcag cgcgcgtcgt tgttggccag 20520
ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg ggttctcctt 20580
cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcatgtgct ccttctggat 20640
catggtggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcacc cgtgcagcca 20700
cagcgcgcac ccggtgcact cccagttctt gtgggcgatc tgggaatgcg cgtgcacgaa 20760
gccctgcagg aagcggccca tcatggtggt cagggtcttg ttgctagtga aggtcagcgg 20820
aatgccgcgg tgctcctcgt tgatgtacag gtggcagatg cggcggtaca cctcgccctg 20880
ctcgggcatc agctggaagt tggctttcag gtcggtctcc acgcggtagc ggtccatcag 20940
catagtcatg atttccatac ccttctccca ggccgagacg atgggcaggc tcatagggtt 21000
cttcaccatc atcttagcgc tagcagccgc ggccaggggg tcgctctcgt ccagggtctc 21060
aaagctccgc ttgccgtcct tctcggtgat ccgcaccggg gggtagctga agcccacggc 21120
cgccagctcc tcctcggcct gtctttcgtc ctcgctgtcc tggctgacgt cctgcaggac 21180
cacatgcttg gtcttgcggg gtttcttctt gggcggcagc ggcggcggag atgttggaga 21240
tggcgagggg gagcgcgagt tctcgctcac cactactatc tcttcctctt cttggtccga 21300
ggccacgcgg cggtaggtat gtctcttcgg gggcagaggc ggaggcgacg ggctctcgcc 21360
gccgcgactt ggcggatggc tggcagagcc ccttccgcgt tcgggggtgc gctcccggcg 21420
gcgctctgac tgacttcctc cgcggccggc cattgtgttc tcctagggag gaacaacaag 21480
catggagact cagccatcgc caacctcgcc atctgccccc accgccgacg agaagcagca 21540
gcagcagaat gaaagcttaa ccgccccgcc gcccagcccc gccacctccg acgcggccgt 21600
cccagacatg caagagatgg aggaatccat cgagattgac ctgggctatg tgacgcccgc 21660
ggagcacgag gaggagctgg cagtgcgctt ttcacaagaa gagatacacc aagaacagcc 21720
agagcaggaa gcagagaatg agcagagtca ggctgggctc gagcatgacg gcgactacct 21780
ccacctgagc gggggggagg acgcgctcat caagcatctg gcccggcagg ccaccatcgt 21840
caaggatgcg ctgctcgacc gcaccgaggt gcccctcagc gtggaggagc tcagccgcgc 21900
ctacgagttg aacctcttct cgccgcgcgt gccccccaag cgccagccca atggcacctg 21960
cgagcccaac ccgcgcctca acttctaccc ggtcttcgcg gtgcccgagg ccctggccac 22020
ctaccacatc tttttcaaga accaaaagat ccccgtctcc tgccgcgcca accgcacccg 22080
cgccgacgcc cttttcaacc tgggtcccgg cgcccgccta cctgatatcg cctccttgga 22140
agaggttccc aagatcttcg agggtctggg cagcgacgag actcgggccg cgaacgctct 22200
gcaaggagaa ggaggagagc atgagcacca cagcgccctg gtcgagttgg aaggcgacaa 22260
cgcgcggctg gcggtgctca aacgcacggt cgagctgacc catttcgcct acccggctct 22320
gaacctgccc cccaaagtca tgagcgcggt catggaccag gtgctcatca agcgcgcgtc 22380
gcccatctcc gaggacgagg gcatgcaaga ctccgaggag ggcaagcccg tggtcagcga 22440
cgagcagctg gcccggtggc tgggtcctaa tgctagtccc cagagtttgg aagagcggcg 22500
caaactcatg atggccgtgg tcctggtgac cgtggagctg gagtgcctgc gccgcttctt 22560
cgccgacgcg gagaccctgc gcaaggtcga ggagaacctg cactacctct tcaggcacgg 22620
gttcgtgcgc caggcctgca agatctccaa cgtggagctg accaacctgg tctcctacat 22680
gggcatcttg cacgagaacc gcctggggca gaacgtgctg cacaccaccc tgcgcgggga 22740
ggcccggcgc gactacatcc gcgactgcgt ctacctctac ctctgccaca cctggcagac 22800
gggcatgggc gtgtggcagc agtgtctgga ggagcagaac ctgaaagagc tctgcaagct 22860
cctgcagaag aacctcaagg gtctgtggac cgggttcgac gagcgcacca ccgcctcgga 22920
cctggccgac ctcattttcc ccgagcgcct caggctgacg ctgcgcaacg gcctgcccga 22980
ctttatgagc caaagcatgt tgcaaaactt tcgctctttc atcctcgaac gctccggaat 23040
cctgcccgcc acctgctccg cgctgccctc ggacttcgtg ccgctgacct tccgcgagtg 23100
ccccccgccg ctgtggagcc actgctacct gctgcgcctg gccaactacc tggcctacca 23160
ctcggacgtg atcgaggacg tcagcggcga gggcctgctc gagtgccact gccgctgcaa 23220
cctctgcacg ccgcaccgct ccctggcctg caacccccag ctgctgagcg agacccagat 23280
catcggcacc ttcgagttgc aagggcccag cgaaggcgag ggttcagccg ccaagggggg 23340
tctgaaactc accccggggc tgtggacctc ggcctacttg cgcaagttcg tgcccgagga 23400
ctaccatccc ttcgagatca ggttctacga ggaccaatcc catccgccca aggccgagct 23460
gtcggcctgc gtcatcaccc agggggcgat cctggcccaa ttgcaagcca tccagaaatc 23520
ccgccaagaa ttcttgctga aaaagggccg cggggtctac ctcgaccccc agaccggtga 23580
ggagctcaac cccggcttcc cccaggatgc cccgaggaaa caagaagctg aaagtggagc 23640
tgccgcccgt ggaggatttg gaggaagact gggagaacag cagtcaggca gaggaggagg 23700
agatggagga agactgggac agcactcagg cagaggagga cagcctgcaa gacagtctgg 23760
aggaagacga ggaggaggca gaggaggagg tggaagaagc agccgccgcc agaccgtcgt 23820
cctcggcggg ggagaaagca agcagcacgg ataccatctc cgctccgggt cggggtcccg 23880
ctcgaccaca cagtagatgg gacgagaccg gacgattccc gaaccccacc acccagaccg 23940
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 24000
gcttgcaggc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 24060
gggtgaactt tccccgcaac atcttgcatt actaccgtca cctccacagc ccctactact 24120
tccaagaaga ggcagcagca gcagaaaaag accagcagaa aaccagcagc tagaaaatcc 24180
acagcggcgg cagcaggtgg actgaggatc gcggcgaacg agccggcgca aacccgggag 24240
ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg ggggcaggag 24300
caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg tctgtatcac 24360
aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt caacaagtac 24420
tgcgcgctca ctcttaaaga gtagcccgcg cccgcccagt cgcagaaaaa ggcgggaatt 24480
acgtcacctg tgcccttcgc cctagccgcc tccacccatc atcatgagca aagagattcc 24540
cacgccttac atgtggagct accagcccca gatgggcctg gccgccggtg ccgcccagga 24600
ctactccacc cgcatgaatt ggctcagcgc cgggcccgcg atgatctcac gggtgaatga 24660
catccgcgcc caccgaaacc agatactcct agaacagtca gcgctcaccg ccacgccccg 24720
caatcacctc aatccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc 24780
cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt 24840
ccagctggcg ggcggcgcca ccctgtgtcg tcaccgcccc gctcagggta taaagcggct 24900
ggtgatccgg ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct 24960
gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca 25020
ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggtggca tcggcactct 25080
ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg 25140
ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg tggacggcta 25200
cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc tggaccactg 25260
ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg agctgcccga 25320
ggagcaccct cagggcccgg cccacggagt gcggatcgtc gtcgaagggg gcctcgactc 25380
ccacctgctt cggatcttca gccagcgtcc gatcctggtc gagcgcgagc aaggacagac 25440
ccttctgact ctgtactgca tctgcaacca ccccggcctg catgaaagtc tttgttgtct 25500
gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactt ccgtgtgttt 25560
aaactcaccc ccttatccag tgaaataaag atcatattga tgatgatttt acagaaataa 25620
aaaataatca tttgatttga aataaagata caatcatatt gatgatttga gtttaacaaa 25680
aaaataaaga atcacttact tgaaatctga taccaggtct ctgtccatgt tttctgccaa 25740
caccacttca ctcccctctt cccagctctg gtactgcagg ccccggcggg ctgcaaactt 25800
cctccacacg ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca ttttatcttc 25860
tatcagatgt ccaaaaagcg cgtccgggtg gatgatgact tcgaccccgt ctacccctac 25920
gatgcagaca acgcaccgac cgtgcccttc atcaaccccc ccttcgtctc ttcagatgga 25980
ttccaagaga agcccctggg ggtgttgtcc ctgcgactgg ccgaccccgt caccaccaag 26040
aacggggaaa tcaccctcaa gctgggagag ggggtggacc tcgattcctc gggaaaactc 26100
atctccaaca cggccaccaa ggccgccgcc cctctcagtt tttccaacaa caccatttcc 26160
cttaacatgg atcacccctt ttacactaaa gatggaaaat tatccttaca agtttctcca 26220
ccattaaata tactgagaac aagcattcta aacacactag ctttaggttt tggatcaggt 26280
ttaggactcc gtggctctgc cttggcagta cagttagtct ctccacttac atttgatact 26340
gatggaaaca taaagcttac cttagacaga ggtttgcatg ttacaacagg agatgcaatt 26400
gaaagcaaca taagctgggc taaaggttta aaatttgaag atggagccat agcaaccaac 26460
attggaaatg ggttagagtt tggaagcagt agtacagaaa caggtgttga tgatgcttac 26520
ccaatccaag ttaaacttgg atctggcctt agctttgaca gtacaggagc cataatggct 26580
ggtaacaaag aagacgataa actcactttg tggacaacac ctgatccatc accaaactgt 26640
caaatactcg cagaaaatga tgcaaaacta acactttgct tgactaaatg tggtagtcaa 26700
atactggcca ctgtgtcagt cttagttgta ggaagtggaa acctaaaccc cattactggc 26760
accgtaagca gtgctcaggt gtttctacgt tttgatgcaa acggtgttct tttaacagaa 26820
cattctacac taaaaaaata ctgggggtat aggcagggag atagcataga tggcactcca 26880
tataccaatg ctgtaggatt catgcccaat ttaaaagctt atccaaagtc acaaagttct 26940
actactaaaa ataatatagt agggcaagta tacatgaatg gagatgtttc aaaacctatg 27000
cttctcacta taaccctcaa tggtactgat gacagcaaca gtacatattc aatgtcattt 27060
tcatacacct ggactaatgg aagctatgtt ggagcaacat ttggggctaa ctcttatacc 27120
ttctcataca tcgcccaaga atgaacactg tatcccaccc tgcatgccaa cccttcccac 27180
cccactctgt ggaacaaact ctgaaacaca aaataaaata aagttcaagt gttttattga 27240
ttcaacagtt ttacaggatt cgagcagtta tttttcctcc accctcccag gacatggaat 27300
acaccaccct ctccccccgc acagccttga acatctgaat gccattggtg atggacatgc 27360
ttttggtctc cacgttccac acagtttcag agcgagccag tctcgggtcg gtcagggaga 27420
tgaaaccctc cgggcactcc cgcatctgca cctcacagct caacagctga ggattgtcct 27480
cggtggtcgg gatcacggtt atctggaaga agcagaagag cggcggtggg aatcatagtc 27540
cgcgaacggg atcggccggt ggtgtcgcat caggccccgc agcagtcgct gccgccgccg 27600
ctccgtcaag ctgctgctca gggggtccgg gtccagggac tccctcagca tgatgcccac 27660
ggccctcagc atcagtcgtc tggtgcggcg ggcgcagcag cgcatgcgga tctcgctcag 27720
gtcgctgcag tacgtgcaac acagaaccac caggttgttc aacagtccat agttcaacac 27780
gctccagccg aaactcatcg cgggaaggat gctacccacg tggccgtcgt accagatcct 27840
caggtaaatc aagtggtgcc ccctccagaa cacgctgccc acgtacatga tctccttggg 27900
catgtggcgg ttcaccacct cccggtacca catcaccctc tggttgaaca tgcagccccg 27960
gatgatcctg cggaaccaca gggccagcac cgccccgccc gccatgcagc gaagagaccc 28020
cgggtcccgg caatggcaat ggaggaccca ccgctcgtac ccgtggatca tctgggagct 28080
gaacaagtct atgttggcac agcacaggca tatgctcatg catctcttca gcactctcaa 28140
ctcctcgggg gtcaaaacca tatcccaggg cacggggaac tcttgcagga cagcgaaccc 28200
cgcagaacag ggcaatcctc gcacagaact tacattgtgc atggacaggg tatcgcaatc 28260
aggcagcacc gggtgatcct ccaccagaga agcgcgggtc tcggtctcct cacagcgtgg 28320
taagggggcc ggccgatacg ggtgatggcg ggacgcggct gatcgtgttc gcgaccgtgt 28380
catgatgcag ttgctttcgg acattttcgt acttgctgta gcagaacctg gtccgggcgc 28440
tgcacaccga tcgccggcgg cggtctcggc gcttggaacg ctcggtgttg aaattgtaaa 28500
acagccactc tctcagaccg tgcagcagat ctagggcctc aggagtgatg aagatcccat 28560
catgcctgat ggctctgatc acatcgacca ccgtggaatg ggccagaccc agccagatga 28620
tgcaattttg ttgggtttcg gtgacggcgg gggagggaag aacaggaaga accatgatta 28680
acttttaatc caaacggtct cggagtactt caaaatgaag atcgcggaga tggcacctct 28740
cgcccccgct gtgttggtgg aaaataacag ccaggtcaaa ggtgatacgg ttctcgagat 28800
gttccacggt ggcttccagc aaagcctcca cgcgcacatc cagaaacaag acaatagcga 28860
aagcgggagg gttctctaat tcctcaatca tcatgttaca ctcctgcacc atccccagat 28920
aattttcatt tttccagcct tgaatgattc gaactagttc ctgaggtaaa tccaagccag 28980
ccatgataaa gagctcgcgc agagcgccct ccaccggcat tcttaagcac accctcataa 29040
ttccaagata ttctgctcct ggttcacctg cagcagattg acaagcggaa tatcaaaatc 29100
tctgccgcga tccctgagct cctccctcag caataactgt aagtactctt tcatatcctc 29160
tccgaaattt ttagccatag gaccaccagg aataagatta gggcaagcca cagtacagat 29220
aaaccgaagt cctccccagt gagcattgcc aaatgcaaga ctgctataag catgctggct 29280
agacccggtg atatcttcca gataactgga cagaaaatcg cccaggcaat ttttaagaaa 29340
atcaacaaaa gaaaaatcct ccaggtggac gtttagagcc tcgggaacaa cgatgaagta 29400
aatgcaagcg gtgcgttcca gcatggttag ttagctgatc tgtagaaaaa acaaaaatga 29460
acattaaacc atgctagcct ggcgaacagg tgggtaaatc gttctctcca gcaccaggca 29520
ggccacgggg tctccggcgc gaccctcgta aaaattgtcg ctatgattga aaaccatcac 29580
agagagacgt tcccggtggc cggcgtgaat gattcgacaa gatgaataca cccccggaac 29640
attggcgtcc gcgagtgaaa aaaagcgccc gaggaagcaa taaggcacta caatgctcag 29700
tctcaagtcc agcaaagcga tgccatgcgg atgaagcaca aaattctcag gtgcgtacaa 29760
aatgtaatta ctcccctcct gcacaggcag caaagccccc gatccctcca ggtacacata 29820
caaagcctca gcgtccatag cttaccgagc agcagcacac aacaggcgca agagtcagag 29880
aaaggctgag ctctaacctg tccacccgct ctctgctcaa tatatagccc agatctacac 29940
tgacgtaaag gccaaagtct aaaaataccc gccaaataat cacacacgcc cagcacacgc 30000
ccagaaaccg gtgacacact caaaaaaata cgcgcacttc ctcaaacgcc caaaactgcc 30060
gtcatttccg ggttcccacg ctacgtcatc aaaacacgac tttcaaattc cgtcgaccgt 30120
taaaaacgtc acccgccccg cccctaacgg tcgcccgtct ctcagccaat cagcgccccg 30180
catccccaaa ttcaaacacc tcatttgcat attaacgcgc acaaaaagtt tgaggtatat 30240
tattgatgat gg 30252
<210> 76
<211> 19
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 76
His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr
1 5 10 15
Asn Pro Gly
<210> 77
<211> 19
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 77
Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val Glu Ser
1 5 10 15
Asn Pro Gly
<210> 78
<211> 19
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 78
Gly Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu
1 5 10 15
Asn Pro Gly
<210> 79
<211> 18
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 79
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly
<210> 80
<211> 17
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 80
Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro
1 5 10 15
Gly
<210> 81
<211> 1134
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 81
Met Ala Ser Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu
1 5 10 15
Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg
20 25 30
Leu Gly Pro Gln Gly Trp Arg Leu Val Gln Arg Gly Asp Pro Ala Ala
35 40 45
Phe Arg Ala Leu Val Ala Gln Cys Leu Val Cys Val Pro Trp Asp Ala
50 55 60
Arg Pro Pro Pro Ala Ala Pro Ser Phe Arg Gln Val Ser Cys Leu Lys
65 70 75 80
Glu Leu Val Ala Arg Val Leu Gln Arg Leu Cys Glu Arg Gly Ala Lys
85 90 95
Asn Val Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly
100 105 110
Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr
115 120 125
Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg
130 135 140
Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu
145 150 155 160
Phe Val Leu Val Ala Pro Ser Cys Ala Tyr Gln Val Cys Gly Pro Pro
165 170 175
Leu Tyr Gln Leu Gly Ala Ala Thr Gln Ala Arg Pro Pro Pro His Ala
180 185 190
Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser
195 200 205
Val Arg Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg
210 215 220
Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro
225 230 235 240
Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly
245 250 255
Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe
260 265 270
Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu
275 280 285
Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln
290 295 300
His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp
305 310 315 320
Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser
325 330 335
Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu
340 345 350
Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu
355 360 365
Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu
370 375 380
Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly
385 390 395 400
Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro
405 410 415
Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys
420 425 430
Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg
435 440 445
Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr
450 455 460
Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp
465 470 475 480
Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe
485 490 495
Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp
500 505 510
Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val
515 520 525
Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala
530 535 540
Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg
545 550 555 560
Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe
565 570 575
Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg
580 585 590
Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val
595 600 605
Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg
610 615 620
Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr
625 630 635 640
Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu
645 650 655
Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala
660 665 670
Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile
675 680 685
His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro
690 695 700
Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp
705 710 715 720
Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys
725 730 735
Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala
740 745 750
Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu
755 760 765
Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu
770 775 780
Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu
785 790 795 800
Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys
805 810 815
His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly
820 825 830
Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr
835 840 845
Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu
850 855 860
Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr
865 870 875 880
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr
885 890 895
Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu
900 905 910
Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly
915 920 925
Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val
930 935 940
Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu
945 950 955 960
Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu
965 970 975
Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln
980 985 990
Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu
995 1000 1005
Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
1010 1015 1020
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile
1025 1030 1035
Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn
1040 1045 1050
Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
1055 1060 1065
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys
1070 1075 1080
Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu
1085 1090 1095
Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr
1100 1105 1110
Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp
1115 1120 1125
Phe Lys Thr Ile Leu Asp
1130
<210> 82
<211> 3402
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 82
atggctagcc cgcgcgctcc aagatgtcgg gccgtccgct cgctcctgag gtcgcattac 60
agagaagtgc tgcctttggc cacgttcgtg cgccggctcg gaccgcaggg atggcggctt 120
gtgcagcggg gcgacccggc tgccttccgc gctctcgtgg cgcaatgctt ggtgtgcgtt 180
ccatgggacg cacgccctcc ccctgcagcg ccctcgttcc gccaagtcag ctgcctgaag 240
gaactcgtcg ccagagtcct gcagagactg tgtgagagag gggcgaaaaa tgtgctcgcg 300
ttcggattcg cactgctgga tggagcaagg gggggtccgc cagaagcgtt cacgactagc 360
gtgcgctcct acctcccaaa tactgtgacc gacgccctcc gcggatcagg agcctggggc 420
ctccttttga ggcgggtggg cgatgacgtg ctggtgcacc tcctcgcgcg atgcgccctg 480
ttcgtgctcg tggccccgtc ctgcgcctac caggtctgcg gccccccgtt gtaccaactg 540
ggggccgcca cgcaggctcg gccgccacct catgcatccg gcccacggag gcgactcggt 600
tgtgaacggg cctggaacca ttcggtgcgg gaggctggtg ttccactggg actgcccgct 660
cctggtgcca gacgccgggg aggttcggcg tcacgctcgt tgccactgcc gaagcggccc 720
agacggggag ctgccccgga gccggagagg acccccgttg gccagggatc gtgggcccat 780
ccgggacgca ccaggggacc atccgacagg ggattctgtg tggtgtcacc ggccaggcca 840
gcagaagagg caaccagcct cgagggagcg ttgtctggaa ccagacattc ccacccgtcg 900
gtgggccggc agcaccacgc gggaccaccg tccacttcca gaccgccacg gccatgggac 960
accccttgcc cgcctgtgta tgccgagact aaacacttcc tgtactcatc cggagacaag 1020
gaacagcttc ggccgtcctt cctcctgtcg tcgctcagac cgagcctgac cggagcacgc 1080
agattggtgg aaactatctt ccttgggtca cgtccgtgga tgccaggtac cccacggcgc 1140
ctcccgcgcc tcccacagag atactggcag atgcggcctc tgttcctgga attgctggga 1200
aaccacgctc agtgcccgta cggagtcctg ctcaagactc actgccctct gagggcggcg 1260
gtcactccgg cggccggagt gtgcgcacgg gagaagcccc agggaagcgt ggcagctccg 1320
gaagaggagg acaccgatcc gcgccgcctc gtgcaacttc tgcgccagca ctcctcgccc 1380
tggcaagtct acgggttcgt ccgcgcctgc ctgcgccgcc tggtgccgcc tgggctctgg 1440
ggttcccggc ataacgagcg ccgcttcctg agaaatacta agaagtttat ctcacttgga 1500
aaacatgcca agttgtcgct gcaagaactc acgtggaaga tgtcagtccg cgattgcgcc 1560
tggctgcgcc gctcgccggg cgtcgggtgt gttccagctg cagaacaccg cctgagagaa 1620
gaaattctgg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 1680
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 1740
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 1800
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 1860
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 1920
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 1980
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 2040
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 2100
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 2160
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 2220
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 2280
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 2340
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 2400
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 2460
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 2520
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 2580
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 2640
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 2700
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 2760
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 2820
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 2880
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 2940
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 3000
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 3060
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 3120
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 3180
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 3240
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 3300
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 3360
gccaacccag cattgccgtc agatttcaag accatcttgg ac 3402
<210> 83
<211> 7149
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 83
ggcgtaatgc tctgccagtg ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg 60
agcatcaaat gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa 120
agccgtttct gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc 180
tggtatcggt ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg 240
tcaaaaataa ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat 300
ggcaaaagct tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca 360
tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgagacga 420
aatacgcgat cgctgttaaa aggacaatta caaacaggaa tcaaatgcaa ccggcgcagg 480
aacactgcca gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg 540
aatgctgttt tcccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata 600
aaatgcttga tggtcggaag aggcataaat tccgtcagcc agtttagtct gaccatctca 660
tctgtaacat cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg 720
ggcttcccat acaatcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat 780
ttatacccat ataaatcagc atccatgttg gaatttaatc gcggcctcga gcaagacgtt 840
tcccgttgaa tatggctcat aacacccctt gtattactgt ttatgtaagc agacaggtcg 900
acaatattgg ctattggcca ttgcatacgt tgtatctata tcataatatg tacatttata 960
ttggctcatg tccaatatga ccgccatgtt gacattgatt attgactagt tattaatagt 1020
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 1080
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 1140
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 1200
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt ccgcccccta 1260
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttacggg 1320
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 1380
tttggcagta caccaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 1440
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 1500
gtcgtaataa ccccgccccg ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct 1560
atataagcag agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 1620
ttgacctcca tagaagacac cgggaccgat ccagcctccg cggccgggaa cggtgcattg 1680
gaacgcggat tccccgtgcc aagagtgact caccgtccgg atctcagcaa gcaggtatgt 1740
actctccagg gtgggcctgg cttccccagt caagactcca gggatttgag ggacgctgtg 1800
ggctcttctc ttacatgtac cttttgcttg cctcaaccct gactatcttc caggtcagga 1860
tcccagagtc aggggtctgt attttcctgc tggtggctcc agttcaggaa cagtaaaccc 1920
tgctccgaat attgcctctc acatctcgtc aatctccgcg aggactgggg accctgtgac 1980
gaacatggct agcccgcgcg ctccaagatg tcgggccgtc cgctcgctcc tgaggtcgca 2040
ttacagagaa gtgctgcctt tggccacgtt cgtgcgccgg ctcggaccgc agggatggcg 2100
gcttgtgcag cggggcgacc cggctgcctt ccgcgctctc gtggcgcaat gcttggtgtg 2160
cgttccatgg gacgcacgcc ctccccctgc agcgccctcg ttccgccaag tcagctgcct 2220
gaaggaactc gtcgccagag tcctgcagag actgtgtgag agaggggcga aaaatgtgct 2280
cgcgttcgga ttcgcactgc tggatggagc aagggggggt ccgccagaag cgttcacgac 2340
tagcgtgcgc tcctacctcc caaatactgt gaccgacgcc ctccgcggat caggagcctg 2400
gggcctcctt ttgaggcggg tgggcgatga cgtgctggtg cacctcctcg cgcgatgcgc 2460
cctgttcgtg ctcgtggccc cgtcctgcgc ctaccaggtc tgcggccccc cgttgtacca 2520
actgggggcc gccacgcagg ctcggccgcc acctcatgca tccggcccac ggaggcgact 2580
cggttgtgaa cgggcctgga accattcggt gcgggaggct ggtgttccac tgggactgcc 2640
cgctcctggt gccagacgcc ggggaggttc ggcgtcacgc tcgttgccac tgccgaagcg 2700
gcccagacgg ggagctgccc cggagccgga gaggaccccc gttggccagg gatcgtgggc 2760
ccatccggga cgcaccaggg gaccatccga caggggattc tgtgtggtgt caccggccag 2820
gccagcagaa gaggcaacca gcctcgaggg agcgttgtct ggaaccagac attcccaccc 2880
gtcggtgggc cggcagcacc acgcgggacc accgtccact tccagaccgc cacggccatg 2940
ggacacccct tgcccgcctg tgtatgccga gactaaacac ttcctgtact catccggaga 3000
caaggaacag cttcggccgt ccttcctcct gtcgtcgctc agaccgagcc tgaccggagc 3060
acgcagattg gtggaaacta tcttccttgg gtcacgtccg tggatgccag gtaccccacg 3120
gcgcctcccg cgcctcccac agagatactg gcagatgcgg cctctgttcc tggaattgct 3180
gggaaaccac gctcagtgcc cgtacggagt cctgctcaag actcactgcc ctctgagggc 3240
ggcggtcact ccggcggccg gagtgtgcgc acgggagaag ccccagggaa gcgtggcagc 3300
tccggaagag gaggacaccg atccgcgccg cctcgtgcaa cttctgcgcc agcactcctc 3360
gccctggcaa gtctacgggt tcgtccgcgc ctgcctgcgc cgcctggtgc cgcctgggct 3420
ctggggttcc cggcataacg agcgccgctt cctgagaaat actaagaagt ttatctcact 3480
tggaaaacat gccaagttgt cgctgcaaga actcacgtgg aagatgtcag tccgcgattg 3540
cgcctggctg cgccgctcgc cgggcgtcgg gtgtgttcca gctgcagaac accgcctgag 3600
agaagaaatt ctggccaaat ttctgcattg gctgatgtca gtgtacgtgg tcgagctgct 3660
gcgctccttt ttctacgtca ctgagactac ctttcaaaag aaccgcctgt tcttctaccg 3720
caaatctgtg tggagcaagc tgcagtcaat cggcattcgc cagcatctga agagggtgca 3780
gctgcgggaa ctttccgagg cagaagtccg ccagcaccgg gaggcccggc cggcgcttct 3840
cacgtcgcgt ctgagattca tcccaaagcc cgacgggctg aggcctatcg tcaacatgga 3900
ttacgtcgtg ggcgctcgca cctttcgccg tgaaaagcgg gccgaacgct tgacctcacg 3960
ggtgaaggcc ctcttctccg tgctgaacta cgagagagca agacggcctg gcctgctggg 4020
agcttcggtg ctgggactgg acgatatcca ccgggcttgg cggacctttg ttctccgggt 4080
gagagcccaa gaccctccgc cggaactgta cttcgtgaag gtggcgatca ccggagccta 4140
tgatactatt ccgcaagatc gactcaccga agtcatcgcc tcgatcatca aaccgcagaa 4200
cacttactgc gtcaggcggt acgccgtggt ccagaaggcc gcgcatggcc acgtgagaaa 4260
ggcgttcaag tcgcacgtgt ccactctcac cgacctccag ccttacatga ggcaattcgt 4320
tgcgcatttg caagagactt cgcccctgag agatgcggtg gtcatcgagc agagctccag 4380
cctgaacgaa gcgagcagcg gtctgtttga cgtgttcctc cgcttcatgt gtcatcacgc 4440
ggtgcgaatc aggggaaaat catacgtgca gtgccaggga atcccacaag gcagcattct 4500
gtcgactctc ttgtgttccc tttgctacgg cgatatggaa aacaagctgt tcgctgggat 4560
cagacgggac gggttgctgc tcagactggt ggacgacttc ctgctggtga ctccgcacct 4620
cactcacgcc aaaacctttc tccgcactct ggtgagggga gtgccagaat acggctgtgt 4680
ggtcaatctc cggaaaactg tggtgaattt ccctgtcgag gatgaggcac tcggaggaac 4740
cgcatttgtc caaatgccag cacatggcct gttcccatgg tgcggtctgc tgctggacac 4800
ccgaactctt gaagtgcagt ccgactactc cagctatgcc cggacgagca tccgcgccag 4860
cctcactttc aatcgcggct ttaaggccgg acgaaacatg cgcagaaagc ttttcggagt 4920
cctccggctt aaatgccatt cgctctttct cgatctccaa gtcaattcgc tgcagaccgt 4980
gtgcacgaac atctacaaga tcctgctgct ccaagcctac cggttccacg cttgcgtgct 5040
tcagctgccg tttcaccaac aggtgtggaa gaacccgacc ttctttctgc gggtcattag 5100
cgatactgcc tccctgtgtt actcaatcct caaggcaaag aacgccggaa tgtcgctggg 5160
tgcgaaagga gccgcgggac ctcttcctag cgaagcggtg cagtggctct gccaccaggc 5220
tttcctcctg aagctgacca ggcacagagt gacctacgtc ccgctgctgg gctcgctgcg 5280
cactgcacag acccagctgt ctagaaaact ccccggcacc accctgaccg ctctggaagc 5340
cgccgccaac ccagcattgc cgtcagattt caagaccatc ttggactgaa gatctgggcc 5400
ctaacaaaac aaaaagatgg ggttattccc taaacttcat gggttacgta attggaagtt 5460
gggggacatt gccacaagat catattgtac aaaagatcaa acactgtttt agaaaacttc 5520
ctgtaaacag gcctattgat tggaaagtat gtcaaaggat tgtgggtctt ttgggctttg 5580
ctgctccatt tacacaatgt ggatatcctg ccttaatgcc tttgtatgca tgtatacaag 5640
ctaaacaggc tttcactttc tcgccaactt acaaggcctt tctaagtaaa cagtacatga 5700
acctttaccc cgttgctcgg caacggcctg gtctgtgcca agtgtttgct gacgcaaccc 5760
ccactggctg gggcttggcc ataggccatc agcgcatgcg tggaaccttt gtggctcctc 5820
tgccgatcca tactgcggaa ctcctagccg cttgttttgc tcgcagccgg tctggagcaa 5880
agctcatagg aactgacaat tctgtcgtcc tctcgcggaa atatacatcg tttcgatcta 5940
cgtatgatct ttttccctct gccaaaaatt atggggacat catgaagccc cttgagcatc 6000
tgacttctgg ctaataaagg aaatttattt tcattgcaat agtgtgttgg aattttttgt 6060
gtctctcact cggaaggaat tctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 6120
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 6180
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 6240
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 6300
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 6360
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 6420
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 6480
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 6540
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 6600
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 6660
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 6720
ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct 6780
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 6840
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 6900
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 6960
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 7020
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 7080
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 7140
gcctgactc 7149
<210> 84
<211> 1611
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 84
auggcuagca ccccuggaac ccagagcccc uucuuccuuc ugcugcugcu gaccgugcug 60
acugucguga caggcucugg ccacgccagc ucuacaccug gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgu gccaagcagc accgagaaga acgccguguc caugaccagc 180
uccgugcuga gcagccacuc uccuggcagc ggcagcagca caacacaggg ccaggaugug 240
acacuggccc cugccacaga accugccucu ggaucugccg ccaccugggg acaggacgug 300
acaagcgugc cagugaccag accugcccug ggcucuacaa cacccccugc ccacgaugug 360
accagcgccc cugauaacaa gccugccccu ggaagcacag ccccuccagc ucauggcgug 420
accucugccc cagauaccag accagcccca ggaucuacag ccccacccgc acacggcgug 480
acaagugccc cugacacaag acccgcucca ggcucuacug cuccuccugc ccauggcgug 540
acaagcgcuc ccgauacaag gccagcuccu ggcuccacag caccaccagc acauggcgug 600
acaucagcuc ccgacacuag accugcuccc ggaucaaccg cuccaccagc ucacggcgug 660
accagcgcac cugauaccag accugcucug ggaagcaccg ccccucccgu gcacaaugug 720
acaucugcuu ccggcagcgc cagcggcucu gccucuacac uggugcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcacccccu ucagcauccc uagccaccac 840
agcgacaccc cuaccacacu ggccagccac uccaccaaga ccgaugccuc uagcacccac 900
cacuccagcg ugcccccucu gaccagcagc aaccacagca caagccccca gcugucuacc 960
ggcgucucau ucuucuuucu guccuuccac aucagcaacc ugcaguucaa cagcagccug 1020
gaagauccca gcaccgacua cuaccaggaa cugcagcggg auaucagcga gauguuccug 1080
caaaucuaca agcagggcgg cuuccugggc cugagcaaca ucaaguucag acccggcagc 1140
gugguggugc agcugacccu ggcuuuccgg gaaggcacca ucaacgugca cgacguggaa 1200
acccaguuca accaguacaa gaccgaggcc gccagccggu acaaccugac caucuccgau 1260
guguccgugu ccgacgugcc cuucccauuc ucugcccagu cuggcgcagg cgugccagga 1320
uggggaauug cucugcuggu gcucgugugc gugcuggugg cccuggccau cguguaucug 1380
auugcccugg ccgugugcca gugccggcgg aagaauuacg gccagcugga caucuucccc 1440
gccagagaca ccuaccaccc caugagcgag uaccccacau accacaccca cggcagauac 1500
gugccaccca gcuccaccga cagauccccc uacgagaaag ugucugccgg caacggcggc 1560
agcucccuga gcuacacaaa uccugccgug gccgcugccu ccgccaaccu g 1611
<210> 85
<211> 2679
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 85
augggagcug ccccggagcc ggagaggacc cccguuggcc agggaucgug ggcccauccg 60
ggacgcacca ggggaccauc cgacagggga uucugugugg ugucaccggc caggccagca 120
gaagaggcaa ccagccucga gggagcguug ucuggaacca gacauuccca cccgucggug 180
ggccggcagc accacgcggg accaccgucc acuuccagac cgccacggcc augggacacc 240
ccuugcccgc cuguguaugc cgagacuaaa cacuuccugu acucauccgg agacaaggaa 300
cagcuucggc cguccuuccu ccugucgucg cucagaccga gccugaccgg agcacgcaga 360
uugguggaaa cuaucuuccu ugggucacgu ccguggaugc cagguacccc acggcgccuc 420
ccgcgccucc cacagagaua cuggcagaug cggccucugu uccuggaauu gcugggaaac 480
cacgcucagu gcccguacgg aguccugcuc aagacucacu gcccucugag ggcggcgguc 540
acuccggcgg ccggagugug cgcacgggag aagccccagg gaagcguggc agcuccggaa 600
gaggaggaca ccgauccgcg ccgccucgug caacuucugc gccagcacuc cucgcccugg 660
caagucuacg gguucguccg cgccugccug cgccgccugg ugccgccugg gcucuggggu 720
ucccggcaua acgagcgccg cuuccugaga aauacuaaga aguuuaucuc acuuggaaaa 780
caugccaagu ugucgcugca agaacucacg uggaagaugu caguccgcga uugcgccugg 840
cugcgccgcu cgccgggcgu cggguguguu ccagcugcag aacaccgccu gagagaagaa 900
auucuggcca aauuucugca uuggcugaug ucaguguacg uggucgagcu gcugcgcucc 960
uuuuucuacg ucacugagac uaccuuucaa aagaaccgcc uguucuucua ccgcaaaucu 1020
guguggagca agcugcaguc aaucggcauu cgccagcauc ugaagagggu gcagcugcgg 1080
gaacuuuccg aggcagaagu ccgccagcac cgggaggccc ggccggcgcu ucucacgucg 1140
cgucugagau ucaucccaaa gcccgacggg cugaggccua ucgucaacau ggauuacguc 1200
gugggcgcuc gcaccuuucg ccgugaaaag cgggccgaac gcuugaccuc acgggugaag 1260
gcccucuucu ccgugcugaa cuacgagaga gcaagacggc cuggccugcu gggagcuucg 1320
gugcugggac uggacgauau ccaccgggcu uggcggaccu uuguucuccg ggugagagcc 1380
caagacccuc cgccggaacu guacuucgug aagguggcga ucaccggagc cuaugauacu 1440
auuccgcaag aucgacucac cgaagucauc gccucgauca ucaaaccgca gaacacuuac 1500
ugcgucaggc gguacgccgu gguccagaag gccgcgcaug gccacgugag aaaggcguuc 1560
aagucgcacg uguccacucu caccgaccuc cagccuuaca ugaggcaauu cguugcgcau 1620
uugcaagaga cuucgccccu gagagaugcg guggucaucg agcagagcuc cagccugaac 1680
gaagcgagca gcggucuguu ugacguguuc cuccgcuuca ugugucauca cgcggugcga 1740
aucaggggaa aaucauacgu gcagugccag ggaaucccac aaggcagcau ucugucgacu 1800
cucuuguguu cccuuugcua cggcgauaug gaaaacaagc uguucgcugg gaucagacgg 1860
gacggguugc ugcucagacu gguggacgac uuccugcugg ugacuccgca ccucacucac 1920
gccaaaaccu uucuccgcac ucuggugagg ggagugccag aauacggcug uguggucaau 1980
cuccggaaaa cuguggugaa uuucccuguc gaggaugagg cacucggagg aaccgcauuu 2040
guccaaaugc cagcacaugg ccuguuccca uggugcgguc ugcugcugga cacccgaacu 2100
cuugaagugc aguccgacua cuccagcuau gcccggacga gcauccgcgc cagccucacu 2160
uucaaucgcg gcuuuaaggc cggacgaaac augcgcagaa agcuuuucgg aguccuccgg 2220
cuuaaaugcc auucgcucuu ucucgaucuc caagucaauu cgcugcagac cgugugcacg 2280
aacaucuaca agauccugcu gcuccaagcc uaccgguucc acgcuugcgu gcuucagcug 2340
ccguuucacc aacaggugug gaagaacccg accuucuuuc ugcgggucau uagcgauacu 2400
gccucccugu guuacucaau ccucaaggca aagaacgccg gaaugucgcu gggugcgaaa 2460
ggagccgcgg gaccucuucc uagcgaagcg gugcaguggc ucugccacca ggcuuuccuc 2520
cugaagcuga ccaggcacag agugaccuac gucccgcugc ugggcucgcu gcgcacugca 2580
cagacccagc ugucuagaaa acuccccggc accacccuga ccgcucugga agccgccgcc 2640
aacccagcau ugccgucaga uuucaagacc aucuuggac 2679
<210> 86
<211> 1404
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 86
auggcuagca agcugaccau ugagagcacu cccuucaacg uggcugaggg gaaggaggug 60
cugcuccugg ugcacaaucu gccccagcac cuguucgggu acuccuggua caagggagaa 120
cgcguggacg ggaaccggca gaucauaggc uacgucaucg gaacccagca ggccacaccc 180
gguccagcgu acagcggccg ggagauuauc uacccgaacg ccucccugcu gauccaaaac 240
aucauccaga acgacaccgg uuucuacacu cugcacguga uuaagucaga ucuggucaac 300
gaagaggcca ccggccaauu caggguguac cccgaacucc cuaagccguu caucaccucg 360
aacaacagca acccggucga ggaugaagau gcgguggccu ugacgugcga accugagauc 420
cagaacacca ccuacuugug gugggugaac aaucagagcc ugccagucuc cccacgacuc 480
cagcugucga acgacaacag gacccugacu uugcuguccg ugacucggaa cgacgugggc 540
ccuuaugaau gcgguaucca gaacaagcug uccguggacc acagcgaccc ugugauccug 600
aacguccuuu acgggccgga cgaccccacc auuuccccgu cguacacuua cuaccggccg 660
ggcgugaacc ugucccuguc gugccacgcu gccuccaauc cgccggccca guacuccugg 720
cucaucgacg gaaacaucca gcagcacacc caagaacugu ucaucuccaa cauuaccgag 780
aaaaacucgg gacuuuacac cugucaagcc aacaauuccg ccagcggcca cucccgcacc 840
acugucaaaa cuaucacugu guccgccgaa cucccgaagc ccagcaucag cuccaacaac 900
ucgaagcccg uggaggauaa ggacgcuguc gcguucaccu gugaaccaga ggcacagaau 960
accaccuacc uuuggugggu caacggacag ucccugccug ucucaccgag acugcagcug 1020
ucaaacggga auaggacucu gaccuuguuu aacgucaccc ggaacgacgc ccgggccuac 1080
gugugcggca uccagaacuc cgugagcgca aaccggucug acccagugac ccuggaugug 1140
cuguacggcc ccgacacucc gaucauuuca ccccccgauu cauccuaccu guccggcgcu 1200
aaccucaacc ucucaugcca cuccgcaucc aaccccagcc cgcaauauuc guggcgcauu 1260
aacggaauuc cucagcaaca uacccagguc cuguucauug cgaagaucac cccuaacaac 1320
aacggaaccu acgccugcuu ugugucaaac cuggccacug guagaaacaa cuccaucgug 1380
aaguccauua ccgugucggc gucc 1404
<210> 87
<211> 6009
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 87
auggcuagca ccccuggaac ccagagcccc uucuuccuuc ugcugcugcu gaccgugcug 60
acugucguga caggcucugg ccacgccagc ucuacaccug gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgu gccaagcagc accgagaaga acgccguguc caugaccagc 180
uccgugcuga gcagccacuc uccuggcagc ggcagcagca caacacaggg ccaggaugug 240
acacuggccc cugccacaga accugccucu ggaucugccg ccaccugggg acaggacgug 300
acaagcgugc cagugaccag accugcccug ggcucuacaa cacccccugc ccacgaugug 360
accagcgccc cugauaacaa gccugccccu ggaagcacag ccccuccagc ucauggcgug 420
accucugccc cagauaccag accagcccca ggaucuacag ccccacccgc acacggcgug 480
acaagugccc cugacacaag acccgcucca ggcucuacug cuccuccugc ccauggcgug 540
acaagcgcuc ccgauacaag gccagcuccu ggcuccacag caccaccagc acauggcgug 600
acaucagcuc ccgacacuag accugcuccc ggaucaaccg cuccaccagc ucacggcgug 660
accagcgcac cugauaccag accugcucug ggaagcaccg ccccucccgu gcacaaugug 720
acaucugcuu ccggcagcgc cagcggcucu gccucuacac uggugcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcacccccu ucagcauccc uagccaccac 840
agcgacaccc cuaccacacu ggccagccac uccaccaaga ccgaugccuc uagcacccac 900
cacuccagcg ugcccccucu gaccagcagc aaccacagca caagccccca gcugucuacc 960
ggcgucucau ucuucuuucu guccuuccac aucagcaacc ugcaguucaa cagcagccug 1020
gaagauccca gcaccgacua cuaccaggaa cugcagcggg auaucagcga gauguuccug 1080
caaaucuaca agcagggcgg cuuccugggc cugagcaaca ucaaguucag acccggcagc 1140
gugguggugc agcugacccu ggcuuuccgg gaaggcacca ucaacgugca cgacguggaa 1200
acccaguuca accaguacaa gaccgaggcc gccagccggu acaaccugac caucuccgau 1260
guguccgugu ccgacgugcc cuucccauuc ucugcccagu cuggcgcagg cgugccagga 1320
uggggaauug cucugcuggu gcucgugugc gugcuggugg cccuggccau cguguaucug 1380
auugcccugg ccgugugcca gugccggcgg aagaauuacg gccagcugga caucuucccc 1440
gccagagaca ccuaccaccc caugagcgag uaccccacau accacaccca cggcagauac 1500
gugccaccca gcuccaccga cagauccccc uacgagaaag ugucugccgg caacggcggc 1560
agcucccuga gcuacacaaa uccugccgug gccgcugccu ccgccaaccu gggauccggc 1620
acaauccugu cugagggcgc caccaacuuc agccugcuga aacuggccgg cgacguggaa 1680
cugaacccug gcccuggagc ugccccggag ccggagagga cccccguugg ccagggaucg 1740
ugggcccauc cgggacgcac caggggacca uccgacaggg gauucugugu ggugucaccg 1800
gccaggccag cagaagaggc aaccagccuc gagggagcgu ugucuggaac cagacauucc 1860
cacccgucgg ugggccggca gcaccacgcg ggaccaccgu ccacuuccag accgccacgg 1920
ccaugggaca ccccuugccc gccuguguau gccgagacua aacacuuccu guacucaucc 1980
ggagacaagg aacagcuucg gccguccuuc cuccugucgu cgcucagacc gagccugacc 2040
ggagcacgca gauuggugga aacuaucuuc cuugggucac guccguggau gccagguacc 2100
ccacggcgcc ucccgcgccu cccacagaga uacuggcaga ugcggccucu guuccuggaa 2160
uugcugggaa accacgcuca gugcccguac ggaguccugc ucaagacuca cugcccucug 2220
agggcggcgg ucacuccggc ggccggagug ugcgcacggg agaagcccca gggaagcgug 2280
gcagcuccgg aagaggagga caccgauccg cgccgccucg ugcaacuucu gcgccagcac 2340
uccucgcccu ggcaagucua cggguucguc cgcgccugcc ugcgccgccu ggugccgccu 2400
gggcucuggg guucccggca uaacgagcgc cgcuuccuga gaaauacuaa gaaguuuauc 2460
ucacuuggaa aacaugccaa guugucgcug caagaacuca cguggaagau gucaguccgc 2520
gauugcgccu ggcugcgccg cucgccgggc gucgggugug uuccagcugc agaacaccgc 2580
cugagagaag aaauucuggc caaauuucug cauuggcuga ugucagugua cguggucgag 2640
cugcugcgcu ccuuuuucua cgucacugag acuaccuuuc aaaagaaccg ccuguucuuc 2700
uaccgcaaau cuguguggag caagcugcag ucaaucggca uucgccagca ucugaagagg 2760
gugcagcugc gggaacuuuc cgaggcagaa guccgccagc accgggaggc ccggccggcg 2820
cuucucacgu cgcgucugag auucauccca aagcccgacg ggcugaggcc uaucgucaac 2880
auggauuacg ucgugggcgc ucgcaccuuu cgccgugaaa agcgggccga acgcuugacc 2940
ucacggguga aggcccucuu cuccgugcug aacuacgaga gagcaagacg gccuggccug 3000
cugggagcuu cggugcuggg acuggacgau auccaccggg cuuggcggac cuuuguucuc 3060
cgggugagag cccaagaccc uccgccggaa cuguacuucg ugaagguggc gaucaccgga 3120
gccuaugaua cuauuccgca agaucgacuc accgaaguca ucgccucgau caucaaaccg 3180
cagaacacuu acugcgucag gcgguacgcc gugguccaga aggccgcgca uggccacgug 3240
agaaaggcgu ucaagucgca cguguccacu cucaccgacc uccagccuua caugaggcaa 3300
uucguugcgc auuugcaaga gacuucgccc cugagagaug cgguggucau cgagcagagc 3360
uccagccuga acgaagcgag cagcggucug uuugacgugu uccuccgcuu caugugucau 3420
cacgcggugc gaaucagggg aaaaucauac gugcagugcc agggaauccc acaaggcagc 3480
auucugucga cucucuugug uucccuuugc uacggcgaua uggaaaacaa gcuguucgcu 3540
gggaucagac gggacggguu gcugcucaga cugguggacg acuuccugcu ggugacuccg 3600
caccucacuc acgccaaaac cuuucuccgc acucugguga ggggagugcc agaauacggc 3660
ugugugguca aucuccggaa aacuguggug aauuucccug ucgaggauga ggcacucgga 3720
ggaaccgcau uuguccaaau gccagcacau ggccuguucc cauggugcgg ucugcugcug 3780
gacacccgaa cucuugaagu gcaguccgac uacuccagcu augcccggac gagcauccgc 3840
gccagccuca cuuucaaucg cggcuuuaag gccggacgaa acaugcgcag aaagcuuuuc 3900
ggaguccucc ggcuuaaaug ccauucgcuc uuucucgauc uccaagucaa uucgcugcag 3960
accgugugca cgaacaucua caagauccug cugcuccaag ccuaccgguu ccacgcuugc 4020
gugcuucagc ugccguuuca ccaacaggug uggaagaacc cgaccuucuu ucugcggguc 4080
auuagcgaua cugccucccu guguuacuca auccucaagg caaagaacgc cggaaugucg 4140
cugggugcga aaggagccgc gggaccucuu ccuagcgaag cggugcagug gcucugccac 4200
caggcuuucc uccugaagcu gaccaggcac agagugaccu acgucccgcu gcugggcucg 4260
cugcgcacug cacagaccca gcugucuaga aaacuccccg gcaccacccu gaccgcucug 4320
gaagccgccg ccaacccagc auugccguca gauuucaaga ccaucuugga cggauccggc 4380
cagugcacca auuacgcccu gcugaagcug gccggcgacg uggaaucuaa cccuggcccu 4440
gaaucgccaa gcgcaccccc ucaucggugg ugcaucccuu ggcaacgccu ccuccugacc 4500
gccucacugc ugacuuucug gaacccgccg accaccgcaa agcugaccau ugagagcacu 4560
cccuucaacg uggcugaggg gaaggaggug cugcuccugg ugcacaaucu gccccagcac 4620
cuguucgggu acuccuggua caagggagaa cgcguggacg ggaaccggca gaucauaggc 4680
uacgucaucg gaacccagca ggccacaccc gguccagcgu acagcggccg ggagauuauc 4740
uacccgaacg ccucccugcu gauccaaaac aucauccaga acgacaccgg uuucuacacu 4800
cugcacguga uuaagucaga ucuggucaac gaagaggcca ccggccaauu caggguguac 4860
cccgaacucc cuaagccguu caucaccucg aacaacagca acccggucga ggaugaagau 4920
gcgguggccu ugacgugcga accugagauc cagaacacca ccuacuugug gugggugaac 4980
aaucagagcc ugccagucuc cccacgacuc cagcugucga acgacaacag gacccugacu 5040
uugcuguccg ugacucggaa cgacgugggc ccuuaugaau gcgguaucca gaacaagcug 5100
uccguggacc acagcgaccc ugugauccug aacguccuuu acgggccgga cgaccccacc 5160
auuuccccgu cguacacuua cuaccggccg ggcgugaacc ugucccuguc gugccacgcu 5220
gccuccaauc cgccggccca guacuccugg cucaucgacg gaaacaucca gcagcacacc 5280
caagaacugu ucaucuccaa cauuaccgag aaaaacucgg gacuuuacac cugucaagcc 5340
aacaauuccg ccagcggcca cucccgcacc acugucaaaa cuaucacugu guccgccgaa 5400
cucccgaagc ccagcaucag cuccaacaac ucgaagcccg uggaggauaa ggacgcuguc 5460
gcguucaccu gugaaccaga ggcacagaau accaccuacc uuuggugggu caacggacag 5520
ucccugccug ucucaccgag acugcagcug ucaaacggga auaggacucu gaccuuguuu 5580
aacgucaccc ggaacgacgc ccgggccuac gugugcggca uccagaacuc cgugagcgca 5640
aaccggucug acccagugac ccuggaugug cuguacggcc ccgacacucc gaucauuuca 5700
ccccccgauu cauccuaccu guccggcgcu aaccucaacc ucucaugcca cuccgcaucc 5760
aaccccagcc cgcaauauuc guggcgcauu aacggaauuc cucagcaaca uacccagguc 5820
cuguucauug cgaagaucac cccuaacaac aacggaaccu acgccugcuu ugugucaaac 5880
cuggccacug guagaaacaa cuccaucgug aaguccauua ccgugucggc guccggaacu 5940
uccccgggcc ugagcgccgg cgccaccgug ggaauuauga ucggcgugcu cgugggagug 6000
gcccugauc 6009
<210> 88
<211> 6003
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 88
auggcuagcg aaucgccaag cgcacccccu caucgguggu gcaucccuug gcaacgccuc 60
cuccugaccg ccucacugcu gacuuucugg aacccgccga ccaccgcaaa gcugaccauu 120
gagagcacuc ccuucaacgu ggcugagggg aaggaggugc ugcuccuggu gcacaaucug 180
ccccagcacc uguucgggua cuccugguac aagggagaac gcguggacgg gaaccggcag 240
aucauaggcu acgucaucgg aacccagcag gccacacccg guccagcgua cagcggccgg 300
gagauuaucu acccgaacgc cucccugcug auccaaaaca ucauccagaa cgacaccggu 360
uucuacacuc ugcacgugau uaagucagau cuggucaacg aagaggccac cggccaauuc 420
aggguguacc ccgaacuccc uaagccguuc aucaccucga acaacagcaa cccggucgag 480
gaugaagaug cgguggccuu gacgugcgaa ccugagaucc agaacaccac cuacuugugg 540
ugggugaaca aucagagccu gccagucucc ccacgacucc agcugucgaa cgacaacagg 600
acccugacuu ugcuguccgu gacucggaac gacgugggcc cuuaugaaug cgguauccag 660
aacaagcugu ccguggacca cagcgacccu gugauccuga acguccuuua cgggccggac 720
gaccccacca uuuccccguc guacacuuac uaccggccgg gcgugaaccu gucccugucg 780
ugccacgcug ccuccaaucc gccggcccag uacuccuggc ucaucgacgg aaacauccag 840
cagcacaccc aagaacuguu caucuccaac auuaccgaga aaaacucggg acuuuacacc 900
ugucaagcca acaauuccgc cagcggccac ucccgcacca cugucaaaac uaucacugug 960
uccgccgaac ucccgaagcc cagcaucagc uccaacaacu cgaagcccgu ggaggauaag 1020
gacgcugucg cguucaccug ugaaccagag gcacagaaua ccaccuaccu uugguggguc 1080
aacggacagu cccugccugu cucaccgaga cugcagcugu caaacgggaa uaggacucug 1140
accuuguuua acgucacccg gaacgacgcc cgggccuacg ugugcggcau ccagaacucc 1200
gugagcgcaa accggucuga cccagugacc cuggaugugc uguacggccc cgacacuccg 1260
aucauuucac cccccgauuc auccuaccug uccggcgcua accucaaccu cucaugccac 1320
uccgcaucca accccagccc gcaauauucg uggcgcauua acggaauucc ucagcaacau 1380
acccaggucc uguucauugc gaagaucacc ccuaacaaca acggaaccua cgccugcuuu 1440
gugucaaacc uggccacugg uagaaacaac uccaucguga aguccauuac cgugucggcg 1500
uccggaacuu ccccgggccu gagcgccggc gccaccgugg gaauuaugau cggcgugcuc 1560
gugggagugg cccugaucgg auccggcgag ggcagaggca gccugcugac auguggcgac 1620
guggaagaga acccuggccc caccccugga acccagagcc ccuucuuccu ucugcugcug 1680
cugaccgugc ugacugucgu gacaggcucu ggccacgcca gcucuacacc uggcggcgag 1740
aaagagacaa gcgccaccca gagaagcagc gugccaagca gcaccgagaa gaacgccgug 1800
uccaugacca gcuccgugcu gagcagccac ucuccuggca gcggcagcag cacaacacag 1860
ggccaggaug ugacacuggc cccugccaca gaaccugccu cuggaucugc cgccaccugg 1920
ggacaggacg ugacaagcgu gccagugacc agaccugccc ugggcucuac aacacccccu 1980
gcccacgaug ugaccagcgc cccugauaac aagccugccc cuggaagcac agccccucca 2040
gcucauggcg ugaccucugc cccagauacc agaccagccc caggaucuac agccccaccc 2100
gcacacggcg ugacaagugc cccugacaca agacccgcuc caggcucuac ugcuccuccu 2160
gcccauggcg ugacaagcgc ucccgauaca aggccagcuc cuggcuccac agcaccacca 2220
gcacauggcg ugacaucagc ucccgacacu agaccugcuc ccggaucaac cgcuccacca 2280
gcucacggcg ugaccagcgc accugauacc agaccugcuc ugggaagcac cgccccuccc 2340
gugcacaaug ugacaucugc uuccggcagc gccagcggcu cugccucuac acuggugcac 2400
aacggcacca gcgccagagc cacaacaacc ccagccagca agagcacccc cuucagcauc 2460
ccuagccacc acagcgacac cccuaccaca cuggccagcc acuccaccaa gaccgaugcc 2520
ucuagcaccc accacuccag cgugcccccu cugaccagca gcaaccacag cacaagcccc 2580
cagcugucua ccggcgucuc auucuucuuu cuguccuucc acaucagcaa ccugcaguuc 2640
aacagcagcc uggaagaucc cagcaccgac uacuaccagg aacugcagcg ggauaucagc 2700
gagauguucc ugcaaaucua caagcagggc ggcuuccugg gccugagcaa caucaaguuc 2760
agacccggca gcgugguggu gcagcugacc cuggcuuucc gggaaggcac caucaacgug 2820
cacgacgugg aaacccaguu caaccaguac aagaccgagg ccgccagccg guacaaccug 2880
accaucuccg auguguccgu guccgacgug cccuucccau ucucugccca gucuggcgca 2940
ggcgugccag gauggggaau ugcucugcug gugcucgugu gcgugcuggu ggcccuggcc 3000
aucguguauc ugauugcccu ggccgugugc cagugccggc ggaagaauua cggccagcug 3060
gacaucuucc ccgccagaga caccuaccac cccaugagcg aguaccccac auaccacacc 3120
cacggcagau acgugccacc cagcuccacc gacagauccc ccuacgagaa agugucugcc 3180
ggcaacggcg gcagcucccu gagcuacaca aauccugccg uggccgcugc cuccgccaac 3240
cugggauccg gcacaauccu gucugagggc gccaccaacu ucagccugcu gaaacuggcc 3300
ggcgacgugg aacugaaccc uggcccugga gcugccccgg agccggagag gacccccguu 3360
ggccagggau cgugggccca uccgggacgc accaggggac cauccgacag gggauucugu 3420
guggugucac cggccaggcc agcagaagag gcaaccagcc ucgagggagc guugucugga 3480
accagacauu cccacccguc ggugggccgg cagcaccacg cgggaccacc guccacuucc 3540
agaccgccac ggccauggga caccccuugc ccgccugugu augccgagac uaaacacuuc 3600
cuguacucau ccggagacaa ggaacagcuu cggccguccu uccuccuguc gucgcucaga 3660
ccgagccuga ccggagcacg cagauuggug gaaacuaucu uccuuggguc acguccgugg 3720
augccaggua ccccacggcg ccucccgcgc cucccacaga gauacuggca gaugcggccu 3780
cuguuccugg aauugcuggg aaaccacgcu cagugcccgu acggaguccu gcucaagacu 3840
cacugcccuc ugagggcggc ggucacuccg gcggccggag ugugcgcacg ggagaagccc 3900
cagggaagcg uggcagcucc ggaagaggag gacaccgauc cgcgccgccu cgugcaacuu 3960
cugcgccagc acuccucgcc cuggcaaguc uacggguucg uccgcgccug ccugcgccgc 4020
cuggugccgc cugggcucug ggguucccgg cauaacgagc gccgcuuccu gagaaauacu 4080
aagaaguuua ucucacuugg aaaacaugcc aaguugucgc ugcaagaacu cacguggaag 4140
augucagucc gcgauugcgc cuggcugcgc cgcucgccgg gcgucgggug uguuccagcu 4200
gcagaacacc gccugagaga agaaauucug gccaaauuuc ugcauuggcu gaugucagug 4260
uacguggucg agcugcugcg cuccuuuuuc uacgucacug agacuaccuu ucaaaagaac 4320
cgccuguucu ucuaccgcaa aucugugugg agcaagcugc agucaaucgg cauucgccag 4380
caucugaaga gggugcagcu gcgggaacuu uccgaggcag aaguccgcca gcaccgggag 4440
gcccggccgg cgcuucucac gucgcgucug agauucaucc caaagcccga cgggcugagg 4500
ccuaucguca acauggauua cgucgugggc gcucgcaccu uucgccguga aaagcgggcc 4560
gaacgcuuga ccucacgggu gaaggcccuc uucuccgugc ugaacuacga gagagcaaga 4620
cggccuggcc ugcugggagc uucggugcug ggacuggacg auauccaccg ggcuuggcgg 4680
accuuuguuc uccgggugag agcccaagac ccuccgccgg aacuguacuu cgugaaggug 4740
gcgaucaccg gagccuauga uacuauuccg caagaucgac ucaccgaagu caucgccucg 4800
aucaucaaac cgcagaacac uuacugcguc aggcgguacg ccguggucca gaaggccgcg 4860
cauggccacg ugagaaaggc guucaagucg cacgugucca cucucaccga ccuccagccu 4920
uacaugaggc aauucguugc gcauuugcaa gagacuucgc cccugagaga ugcggugguc 4980
aucgagcaga gcuccagccu gaacgaagcg agcagcgguc uguuugacgu guuccuccgc 5040
uucauguguc aucacgcggu gcgaaucagg ggaaaaucau acgugcagug ccagggaauc 5100
ccacaaggca gcauucuguc gacucucuug uguucccuuu gcuacggcga uauggaaaac 5160
aagcuguucg cugggaucag acgggacggg uugcugcuca gacuggugga cgacuuccug 5220
cuggugacuc cgcaccucac ucacgccaaa accuuucucc gcacucuggu gaggggagug 5280
ccagaauacg gcuguguggu caaucuccgg aaaacugugg ugaauuuccc ugucgaggau 5340
gaggcacucg gaggaaccgc auuuguccaa augccagcac auggccuguu cccauggugc 5400
ggucugcugc uggacacccg aacucuugaa gugcaguccg acuacuccag cuaugcccgg 5460
acgagcaucc gcgccagccu cacuuucaau cgcggcuuua aggccggacg aaacaugcgc 5520
agaaagcuuu ucggaguccu ccggcuuaaa ugccauucgc ucuuucucga ucuccaaguc 5580
aauucgcugc agaccgugug cacgaacauc uacaagaucc ugcugcucca agccuaccgg 5640
uuccacgcuu gcgugcuuca gcugccguuu caccaacagg uguggaagaa cccgaccuuc 5700
uuucugcggg ucauuagcga uacugccucc cuguguuacu caauccucaa ggcaaagaac 5760
gccggaaugu cgcugggugc gaaaggagcc gcgggaccuc uuccuagcga agcggugcag 5820
uggcucugcc accaggcuuu ccuccugaag cugaccaggc acagagugac cuacgucccg 5880
cugcugggcu cgcugcgcac ugcacagacc cagcugucua gaaaacuccc cggcaccacc 5940
cugaccgcuc uggaagccgc cgccaaccca gcauugccgu cagauuucaa gaccaucuug 6000
gac 6003
<210> 89
<211> 6024
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 89
auggcuagcg gagcugcccc ggagccggag aggacccccg uuggccaggg aucgugggcc 60
cauccgggac gcaccagggg accauccgac aggggauucu gugugguguc accggccagg 120
ccagcagaag aggcaaccag ccucgaggga gcguugucug gaaccagaca uucccacccg 180
ucggugggcc ggcagcacca cgcgggacca ccguccacuu ccagaccgcc acggccaugg 240
gacaccccuu gcccgccugu guaugccgag acuaaacacu uccuguacuc auccggagac 300
aaggaacagc uucggccguc cuuccuccug ucgucgcuca gaccgagccu gaccggagca 360
cgcagauugg uggaaacuau cuuccuuggg ucacguccgu ggaugccagg uaccccacgg 420
cgccucccgc gccucccaca gagauacugg cagaugcggc cucuguuccu ggaauugcug 480
ggaaaccacg cucagugccc guacggaguc cugcucaaga cucacugccc ucugagggcg 540
gcggucacuc cggcggccgg agugugcgca cgggagaagc cccagggaag cguggcagcu 600
ccggaagagg aggacaccga uccgcgccgc cucgugcaac uucugcgcca gcacuccucg 660
cccuggcaag ucuacggguu cguccgcgcc ugccugcgcc gccuggugcc gccugggcuc 720
ugggguuccc ggcauaacga gcgccgcuuc cugagaaaua cuaagaaguu uaucucacuu 780
ggaaaacaug ccaaguuguc gcugcaagaa cucacgugga agaugucagu ccgcgauugc 840
gccuggcugc gccgcucgcc gggcgucggg uguguuccag cugcagaaca ccgccugaga 900
gaagaaauuc uggccaaauu ucugcauugg cugaugucag uguacguggu cgagcugcug 960
cgcuccuuuu ucuacgucac ugagacuacc uuucaaaaga accgccuguu cuucuaccgc 1020
aaaucugugu ggagcaagcu gcagucaauc ggcauucgcc agcaucugaa gagggugcag 1080
cugcgggaac uuuccgaggc agaaguccgc cagcaccggg aggcccggcc ggcgcuucuc 1140
acgucgcguc ugagauucau cccaaagccc gacgggcuga ggccuaucgu caacauggau 1200
uacgucgugg gcgcucgcac cuuucgccgu gaaaagcggg ccgaacgcuu gaccucacgg 1260
gugaaggccc ucuucuccgu gcugaacuac gagagagcaa gacggccugg ccugcuggga 1320
gcuucggugc ugggacugga cgauauccac cgggcuuggc ggaccuuugu ucuccgggug 1380
agagcccaag acccuccgcc ggaacuguac uucgugaagg uggcgaucac cggagccuau 1440
gauacuauuc cgcaagaucg acucaccgaa gucaucgccu cgaucaucaa accgcagaac 1500
acuuacugcg ucaggcggua cgccgugguc cagaaggccg cgcauggcca cgugagaaag 1560
gcguucaagu cgcacguguc cacucucacc gaccuccagc cuuacaugag gcaauucguu 1620
gcgcauuugc aagagacuuc gccccugaga gaugcggugg ucaucgagca gagcuccagc 1680
cugaacgaag cgagcagcgg ucuguuugac guguuccucc gcuucaugug ucaucacgcg 1740
gugcgaauca ggggaaaauc auacgugcag ugccagggaa ucccacaagg cagcauucug 1800
ucgacucucu uguguucccu uugcuacggc gauauggaaa acaagcuguu cgcugggauc 1860
agacgggacg gguugcugcu cagacuggug gacgacuucc ugcuggugac uccgcaccuc 1920
acucacgcca aaaccuuucu ccgcacucug gugaggggag ugccagaaua cggcugugug 1980
gucaaucucc ggaaaacugu ggugaauuuc ccugucgagg augaggcacu cggaggaacc 2040
gcauuugucc aaaugccagc acauggccug uucccauggu gcggucugcu gcuggacacc 2100
cgaacucuug aagugcaguc cgacuacucc agcuaugccc ggacgagcau ccgcgccagc 2160
cucacuuuca aucgcggcuu uaaggccgga cgaaacaugc gcagaaagcu uuucggaguc 2220
cuccggcuua aaugccauuc gcucuuucuc gaucuccaag ucaauucgcu gcagaccgug 2280
ugcacgaaca ucuacaagau ccugcugcuc caagccuacc gguuccacgc uugcgugcuu 2340
cagcugccgu uucaccaaca gguguggaag aacccgaccu ucuuucugcg ggucauuagc 2400
gauacugccu cccuguguua cucaauccuc aaggcaaaga acgccggaau gucgcugggu 2460
gcgaaaggag ccgcgggacc ucuuccuagc gaagcggugc aguggcucug ccaccaggcu 2520
uuccuccuga agcugaccag gcacagagug accuacgucc cgcugcuggg cucgcugcgc 2580
acugcacaga cccagcuguc uagaaaacuc cccggcacca cccugaccgc ucuggaagcc 2640
gccgccaacc cagcauugcc gucagauuuc aagaccaucu uggacggauc cggcacaauc 2700
cugucugagg gcgccaccaa cuucagccug cugaaacugg ccggcgacgu ggaacugaac 2760
ccuggcccua ccccuggaac ccagagcccc uucuuccuuc ugcugcugcu gaccgugcug 2820
acugucguga caggcucugg ccacgccagc ucuacaccug gcggcgagaa agagacaagc 2880
gccacccaga gaagcagcgu gccaagcagc accgagaaga acgccguguc caugaccagc 2940
uccgugcuga gcagccacuc uccuggcagc ggcagcagca caacacaggg ccaggaugug 3000
acacuggccc cugccacaga accugccucu ggaucugccg ccaccugggg acaggacgug 3060
acaagcgugc cagugaccag accugcccug ggcucuacaa cacccccugc ccacgaugug 3120
accagcgccc cugauaacaa gccugccccu ggaagcacag ccccuccagc ucauggcgug 3180
accucugccc cagauaccag accagcccca ggaucuacag ccccacccgc acacggcgug 3240
acaagugccc cugacacaag acccgcucca ggcucuacug cuccuccugc ccauggcgug 3300
acaagcgcuc ccgauacaag gccagcuccu ggcuccacag caccaccagc acauggcgug 3360
acaucagcuc ccgacacuag accugcuccc ggaucaaccg cuccaccagc ucacggcgug 3420
accagcgcac cugauaccag accugcucug ggaagcaccg ccccucccgu gcacaaugug 3480
acaucugcuu ccggcagcgc cagcggcucu gccucuacac uggugcacaa cggcaccagc 3540
gccagagcca caacaacccc agccagcaag agcacccccu ucagcauccc uagccaccac 3600
agcgacaccc cuaccacacu ggccagccac uccaccaaga ccgaugccuc uagcacccac 3660
cacuccagcg ugcccccucu gaccagcagc aaccacagca caagccccca gcugucuacc 3720
ggcgucucau ucuucuuucu guccuuccac aucagcaacc ugcaguucaa cagcagccug 3780
gaagauccca gcaccgacua cuaccaggaa cugcagcggg auaucagcga gauguuccug 3840
caaaucuaca agcagggcgg cuuccugggc cugagcaaca ucaaguucag acccggcagc 3900
gugguggugc agcugacccu ggcuuuccgg gaaggcacca ucaacgugca cgacguggaa 3960
acccaguuca accaguacaa gaccgaggcc gccagccggu acaaccugac caucuccgau 4020
guguccgugu ccgacgugcc cuucccauuc ucugcccagu cuggcgcagg cgugccagga 4080
uggggaauug cucugcuggu gcucgugugc gugcuggugg cccuggccau cguguaucug 4140
auugcccugg ccgugugcca gugccggcgg aagaauuacg gccagcugga caucuucccc 4200
gccagagaca ccuaccaccc caugagcgag uaccccacau accacaccca cggcagauac 4260
gugccaccca gcuccaccga cagauccccc uacgagaaag ugucugccgg caacggcggc 4320
agcucccuga gcuacacaaa uccugccgug gccgcugccu ccgccaaccu gggauccggc 4380
agaaucuuca acgcccacua cgccggcuac uucgccgacc ugcugaucca cgacaucgag 4440
acaaacccug gccccgaauc gccaagcgca cccccucauc gguggugcau cccuuggcaa 4500
cgccuccucc ugaccgccuc acugcugacu uucuggaacc cgccgaccac cgcaaagcug 4560
accauugaga gcacucccuu caacguggcu gaggggaagg aggugcugcu ccuggugcac 4620
aaucugcccc agcaccuguu cggguacucc ugguacaagg gagaacgcgu ggacgggaac 4680
cggcagauca uaggcuacgu caucggaacc cagcaggcca cacccggucc agcguacagc 4740
ggccgggaga uuaucuaccc gaacgccucc cugcugaucc aaaacaucau ccagaacgac 4800
accgguuucu acacucugca cgugauuaag ucagaucugg ucaacgaaga ggccaccggc 4860
caauucaggg uguaccccga acucccuaag ccguucauca ccucgaacaa cagcaacccg 4920
gucgaggaug aagaugcggu ggccuugacg ugcgaaccug agauccagaa caccaccuac 4980
uugugguggg ugaacaauca gagccugcca gucuccccac gacuccagcu gucgaacgac 5040
aacaggaccc ugacuuugcu guccgugacu cggaacgacg ugggcccuua ugaaugcggu 5100
auccagaaca agcuguccgu ggaccacagc gacccuguga uccugaacgu ccuuuacggg 5160
ccggacgacc ccaccauuuc cccgucguac acuuacuacc ggccgggcgu gaaccugucc 5220
cugucgugcc acgcugccuc caauccgccg gcccaguacu ccuggcucau cgacggaaac 5280
auccagcagc acacccaaga acuguucauc uccaacauua ccgagaaaaa cucgggacuu 5340
uacaccuguc aagccaacaa uuccgccagc ggccacuccc gcaccacugu caaaacuauc 5400
acuguguccg ccgaacuccc gaagcccagc aucagcucca acaacucgaa gcccguggag 5460
gauaaggacg cugucgcguu caccugugaa ccagaggcac agaauaccac cuaccuuugg 5520
ugggucaacg gacagucccu gccugucuca ccgagacugc agcugucaaa cgggaauagg 5580
acucugaccu uguuuaacgu cacccggaac gacgcccggg ccuacgugug cggcauccag 5640
aacuccguga gcgcaaaccg gucugaccca gugacccugg augugcugua cggccccgac 5700
acuccgauca uuucaccccc cgauucaucc uaccuguccg gcgcuaaccu caaccucuca 5760
ugccacuccg cauccaaccc cagcccgcaa uauucguggc gcauuaacgg aauuccucag 5820
caacauaccc agguccuguu cauugcgaag aucaccccua acaacaacgg aaccuacgcc 5880
ugcuuugugu caaaccuggc cacugguaga aacaacucca ucgugaaguc cauuaccgug 5940
ucggcguccg gaacuucccc gggccugagc gccggcgcca ccgugggaau uaugaucggc 6000
gugcucgugg gaguggcccu gauc 6024
<210> 90
<211> 5988
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 90
auggcuagcg gagcugcccc ggagccggag aggacccccg uuggccaggg aucgugggcc 60
cauccgggac gcaccagggg accauccgac aggggauucu gugugguguc accggccagg 120
ccagcagaag aggcaaccag ccucgaggga gcguugucug gaaccagaca uucccacccg 180
ucggugggcc ggcagcacca cgcgggacca ccguccacuu ccagaccgcc acggccaugg 240
gacaccccuu gcccgccugu guaugccgag acuaaacacu uccuguacuc auccggagac 300
aaggaacagc uucggccguc cuuccuccug ucgucgcuca gaccgagccu gaccggagca 360
cgcagauugg uggaaacuau cuuccuuggg ucacguccgu ggaugccagg uaccccacgg 420
cgccucccgc gccucccaca gagauacugg cagaugcggc cucuguuccu ggaauugcug 480
ggaaaccacg cucagugccc guacggaguc cugcucaaga cucacugccc ucugagggcg 540
gcggucacuc cggcggccgg agugugcgca cgggagaagc cccagggaag cguggcagcu 600
ccggaagagg aggacaccga uccgcgccgc cucgugcaac uucugcgcca gcacuccucg 660
cccuggcaag ucuacggguu cguccgcgcc ugccugcgcc gccuggugcc gccugggcuc 720
ugggguuccc ggcauaacga gcgccgcuuc cugagaaaua cuaagaaguu uaucucacuu 780
ggaaaacaug ccaaguuguc gcugcaagaa cucacgugga agaugucagu ccgcgauugc 840
gccuggcugc gccgcucgcc gggcgucggg uguguuccag cugcagaaca ccgccugaga 900
gaagaaauuc uggccaaauu ucugcauugg cugaugucag uguacguggu cgagcugcug 960
cgcuccuuuu ucuacgucac ugagacuacc uuucaaaaga accgccuguu cuucuaccgc 1020
aaaucugugu ggagcaagcu gcagucaauc ggcauucgcc agcaucugaa gagggugcag 1080
cugcgggaac uuuccgaggc agaaguccgc cagcaccggg aggcccggcc ggcgcuucuc 1140
acgucgcguc ugagauucau cccaaagccc gacgggcuga ggccuaucgu caacauggau 1200
uacgucgugg gcgcucgcac cuuucgccgu gaaaagcggg ccgaacgcuu gaccucacgg 1260
gugaaggccc ucuucuccgu gcugaacuac gagagagcaa gacggccugg ccugcuggga 1320
gcuucggugc ugggacugga cgauauccac cgggcuuggc ggaccuuugu ucuccgggug 1380
agagcccaag acccuccgcc ggaacuguac uucgugaagg uggcgaucac cggagccuau 1440
gauacuauuc cgcaagaucg acucaccgaa gucaucgccu cgaucaucaa accgcagaac 1500
acuuacugcg ucaggcggua cgccgugguc cagaaggccg cgcauggcca cgugagaaag 1560
gcguucaagu cgcacguguc cacucucacc gaccuccagc cuuacaugag gcaauucguu 1620
gcgcauuugc aagagacuuc gccccugaga gaugcggugg ucaucgagca gagcuccagc 1680
cugaacgaag cgagcagcgg ucuguuugac guguuccucc gcuucaugug ucaucacgcg 1740
gugcgaauca ggggaaaauc auacgugcag ugccagggaa ucccacaagg cagcauucug 1800
ucgacucucu uguguucccu uugcuacggc gauauggaaa acaagcuguu cgcugggauc 1860
agacgggacg gguugcugcu cagacuggug gacgacuucc ugcuggugac uccgcaccuc 1920
acucacgcca aaaccuuucu ccgcacucug gugaggggag ugccagaaua cggcugugug 1980
gucaaucucc ggaaaacugu ggugaauuuc ccugucgagg augaggcacu cggaggaacc 2040
gcauuugucc aaaugccagc acauggccug uucccauggu gcggucugcu gcuggacacc 2100
cgaacucuug aagugcaguc cgacuacucc agcuaugccc ggacgagcau ccgcgccagc 2160
cucacuuuca aucgcggcuu uaaggccgga cgaaacaugc gcagaaagcu uuucggaguc 2220
cuccggcuua aaugccauuc gcucuuucuc gaucuccaag ucaauucgcu gcagaccgug 2280
ugcacgaaca ucuacaagau ccugcugcuc caagccuacc gguuccacgc uugcgugcuu 2340
cagcugccgu uucaccaaca gguguggaag aacccgaccu ucuuucugcg ggucauuagc 2400
gauacugccu cccuguguua cucaauccuc aaggcaaaga acgccggaau gucgcugggu 2460
gcgaaaggag ccgcgggacc ucuuccuagc gaagcggugc aguggcucug ccaccaggcu 2520
uuccuccuga agcugaccag gcacagagug accuacgucc cgcugcuggg cucgcugcgc 2580
acugcacaga cccagcuguc uagaaaacuc cccggcacca cccugaccgc ucuggaagcc 2640
gccgccaacc cagcauugcc gucagauuuc aagaccaucu uggacggauc cggccagugc 2700
accaauuacg cccugcugaa gcuggccggc gacguggaau cuaacccugg cccugaaucg 2760
ccaagcgcac ccccucaucg guggugcauc ccuuggcaac gccuccuccu gaccgccuca 2820
cugcugacuu ucuggaaccc gccgaccacc gcaaagcuga ccauugagag cacucccuuc 2880
aacguggcug aggggaagga ggugcugcuc cuggugcaca aucugcccca gcaccuguuc 2940
ggguacuccu gguacaaggg agaacgcgug gacgggaacc ggcagaucau aggcuacguc 3000
aucggaaccc agcaggccac acccggucca gcguacagcg gccgggagau uaucuacccg 3060
aacgccuccc ugcugaucca aaacaucauc cagaacgaca ccgguuucua cacucugcac 3120
gugauuaagu cagaucuggu caacgaagag gccaccggcc aauucagggu guaccccgaa 3180
cucccuaagc cguucaucac cucgaacaac agcaacccgg ucgaggauga agaugcggug 3240
gccuugacgu gcgaaccuga gauccagaac accaccuacu uguggugggu gaacaaucag 3300
agccugccag ucuccccacg acuccagcug ucgaacgaca acaggacccu gacuuugcug 3360
uccgugacuc ggaacgacgu gggcccuuau gaaugcggua uccagaacaa gcuguccgug 3420
gaccacagcg acccugugau ccugaacguc cuuuacgggc cggacgaccc caccauuucc 3480
ccgucguaca cuuacuaccg gccgggcgug aaccuguccc ugucgugcca cgcugccucc 3540
aauccgccgg cccaguacuc cuggcucauc gacggaaaca uccagcagca cacccaagaa 3600
cuguucaucu ccaacauuac cgagaaaaac ucgggacuuu acaccuguca agccaacaau 3660
uccgccagcg gccacucccg caccacuguc aaaacuauca cuguguccgc cgaacucccg 3720
aagcccagca ucagcuccaa caacucgaag cccguggagg auaaggacgc ugucgcguuc 3780
accugugaac cagaggcaca gaauaccacc uaccuuuggu gggucaacgg acagucccug 3840
ccugucucac cgagacugca gcugucaaac gggaauagga cucugaccuu guuuaacguc 3900
acccggaacg acgcccgggc cuacgugugc ggcauccaga acuccgugag cgcaaaccgg 3960
ucugacccag ugacccugga ugugcuguac ggccccgaca cuccgaucau uucacccccc 4020
gauucauccu accuguccgg cgcuaaccuc aaccucucau gccacuccgc auccaacccc 4080
agcccgcaau auucguggcg cauuaacgga auuccucagc aacauaccca gguccuguuc 4140
auugcgaaga ucaccccuaa caacaacgga accuacgccu gcuuuguguc aaaccuggcc 4200
acugguagaa acaacuccau cgugaagucc auuaccgugu cggcguccgg aacuuccccg 4260
ggccugagcg ccggcgccac cgugggaauu augaucggcg ugcucguggg aguggcccug 4320
aucggauccg gcgagggcag aggcagccug cugacaugug gcgacgugga agagaacccu 4380
ggccccaccc cuggaaccca gagccccuuc uuccuucugc ugcugcugac cgugcugacu 4440
gucgugacag gcucuggcca cgccagcucu acaccuggcg gcgagaaaga gacaagcgcc 4500
acccagagaa gcagcgugcc aagcagcacc gagaagaacg ccguguccau gaccagcucc 4560
gugcugagca gccacucucc uggcagcggc agcagcacaa cacagggcca ggaugugaca 4620
cuggccccug ccacagaacc ugccucugga ucugccgcca ccuggggaca ggacgugaca 4680
agcgugccag ugaccagacc ugcccugggc ucuacaacac ccccugccca cgaugugacc 4740
agcgccccug auaacaagcc ugccccugga agcacagccc cuccagcuca uggcgugacc 4800
ucugccccag auaccagacc agccccagga ucuacagccc cacccgcaca cggcgugaca 4860
agugccccug acacaagacc cgcuccaggc ucuacugcuc cuccugccca uggcgugaca 4920
agcgcucccg auacaaggcc agcuccuggc uccacagcac caccagcaca uggcgugaca 4980
ucagcucccg acacuagacc ugcucccgga ucaaccgcuc caccagcuca cggcgugacc 5040
agcgcaccug auaccagacc ugcucuggga agcaccgccc cucccgugca caaugugaca 5100
ucugcuuccg gcagcgccag cggcucugcc ucuacacugg ugcacaacgg caccagcgcc 5160
agagccacaa caaccccagc cagcaagagc acccccuuca gcaucccuag ccaccacagc 5220
gacaccccua ccacacuggc cagccacucc accaagaccg augccucuag cacccaccac 5280
uccagcgugc ccccucugac cagcagcaac cacagcacaa gcccccagcu gucuaccggc 5340
gucucauucu ucuuucuguc cuuccacauc agcaaccugc aguucaacag cagccuggaa 5400
gaucccagca ccgacuacua ccaggaacug cagcgggaua ucagcgagau guuccugcaa 5460
aucuacaagc agggcggcuu ccugggccug agcaacauca aguucagacc cggcagcgug 5520
guggugcagc ugacccuggc uuuccgggaa ggcaccauca acgugcacga cguggaaacc 5580
caguucaacc aguacaagac cgaggccgcc agccgguaca accugaccau cuccgaugug 5640
uccguguccg acgugcccuu cccauucucu gcccagucug gcgcaggcgu gccaggaugg 5700
ggaauugcuc ugcuggugcu cgugugcgug cugguggccc uggccaucgu guaucugauu 5760
gcccuggccg ugugccagug ccggcggaag aauuacggcc agcuggacau cuuccccgcc 5820
agagacaccu accaccccau gagcgaguac cccacauacc acacccacgg cagauacgug 5880
ccacccagcu ccaccgacag aucccccuac gagaaagugu cugccggcaa cggcggcagc 5940
ucccugagcu acacaaaucc ugccguggcc gcugccuccg ccaaccug 5988
<210> 91
<211> 5829
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 91
auggcuagca ccccuggaac ccagagcccc uucuuccuuc ugcugcugcu gaccgugcug 60
acugucguga caggcucugg ccacgccagc ucuacaccug gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgu gccaagcagc accgagaaga acgccguguc caugaccagc 180
uccgugcuga gcagccacuc uccuggcagc ggcagcagca caacacaggg ccaggaugug 240
acacuggccc cugccacaga accugccucu ggaucugccg ccaccugggg acaggacgug 300
acaagcgugc cagugaccag accugcccug ggcucuacaa cacccccugc ccacgaugug 360
accagcgccc cugauaacaa gccugccccu ggaagcacag ccccuccagc ucauggcgug 420
accucugccc cagauaccag accagcccca ggaucuacag ccccacccgc acacggcgug 480
acaagugccc cugacacaag acccgcucca ggcucuacug cuccuccugc ccauggcgug 540
acaagcgcuc ccgauacaag gccagcuccu ggcuccacag caccaccagc acauggcgug 600
acaucagcuc ccgacacuag accugcuccc ggaucaaccg cuccaccagc ucacggcgug 660
accagcgcac cugauaccag accugcucug ggaagcaccg ccccucccgu gcacaaugug 720
acaucugcuu ccggcagcgc cagcggcucu gccucuacac uggugcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcacccccu ucagcauccc uagccaccac 840
agcgacaccc cuaccacacu ggccagccac uccaccaaga ccgaugccuc uagcacccac 900
cacuccagcg ugcccccucu gaccagcagc aaccacagca caagccccca gcugucuacc 960
ggcgucucau ucuucuuucu guccuuccac aucagcaacc ugcaguucaa cagcagccug 1020
gaagauccca gcaccgacua cuaccaggaa cugcagcggg auaucagcga gauguuccug 1080
caaaucuaca agcagggcgg cuuccugggc cugagcaaca ucaaguucag acccggcagc 1140
gugguggugc agcugacccu ggcuuuccgg gaaggcacca ucaacgugca cgacguggaa 1200
acccaguuca accaguacaa gaccgaggcc gccagccggu acaaccugac caucuccgau 1260
guguccgugu ccgacgugcc cuucccauuc ucugcccagu cuggcgcagg cgugccagga 1320
uggggaauug cucugcuggu gcucgugugc gugcuggugg cccuggccau cguguaucug 1380
auugcccugg ccgugugcca gugccggcgg aagaauuacg gccagcugga caucuucccc 1440
gccagagaca ccuaccaccc caugagcgag uaccccacau accacaccca cggcagauac 1500
gugccaccca gcuccaccga cagauccccc uacgagaaag ugucugccgg caacggcggc 1560
agcucccuga gcuacacaaa uccugccgug gccgcugccu ccgccaaccu gggauccggc 1620
agaaucuuca acgcccacua cgccggcuac uucgccgacc ugcugaucca cgacaucgag 1680
acaaacccug gccccaagcu gaccauugag agcacucccu ucaacguggc ugaggggaag 1740
gaggugcugc uccuggugca caaucugccc cagcaccugu ucggguacuc cugguacaag 1800
ggagaacgcg uggacgggaa ccggcagauc auaggcuacg ucaucggaac ccagcaggcc 1860
acacccgguc cagcguacag cggccgggag auuaucuacc cgaacgccuc ccugcugauc 1920
caaaacauca uccagaacga caccgguuuc uacacucugc acgugauuaa gucagaucug 1980
gucaacgaag aggccaccgg ccaauucagg guguaccccg aacucccuaa gccguucauc 2040
accucgaaca acagcaaccc ggucgaggau gaagaugcgg uggccuugac gugcgaaccu 2100
gagauccaga acaccaccua cuuguggugg gugaacaauc agagccugcc agucucccca 2160
cgacuccagc ugucgaacga caacaggacc cugacuuugc uguccgugac ucggaacgac 2220
gugggcccuu augaaugcgg uauccagaac aagcuguccg uggaccacag cgacccugug 2280
auccugaacg uccuuuacgg gccggacgac cccaccauuu ccccgucgua cacuuacuac 2340
cggccgggcg ugaaccuguc ccugucgugc cacgcugccu ccaauccgcc ggcccaguac 2400
uccuggcuca ucgacggaaa cauccagcag cacacccaag aacuguucau cuccaacauu 2460
accgagaaaa acucgggacu uuacaccugu caagccaaca auuccgccag cggccacucc 2520
cgcaccacug ucaaaacuau cacugugucc gccgaacucc cgaagcccag caucagcucc 2580
aacaacucga agcccgugga ggauaaggac gcugucgcgu ucaccuguga accagaggca 2640
cagaauacca ccuaccuuug gugggucaac ggacaguccc ugccugucuc accgagacug 2700
cagcugucaa acgggaauag gacucugacc uuguuuaacg ucacccggaa cgacgcccgg 2760
gccuacgugu gcggcaucca gaacuccgug agcgcaaacc ggucugaccc agugacccug 2820
gaugugcugu acggccccga cacuccgauc auuucacccc ccgauucauc cuaccugucc 2880
ggcgcuaacc ucaaccucuc augccacucc gcauccaacc ccagcccgca auauucgugg 2940
cgcauuaacg gaauuccuca gcaacauacc cagguccugu ucauugcgaa gaucaccccu 3000
aacaacaacg gaaccuacgc cugcuuugug ucaaaccugg ccacugguag aaacaacucc 3060
aucgugaagu ccauuaccgu gucggcgucc ggauccggcg agggcagagg cagccugcug 3120
acauguggcg acguggaaga gaacccuggc cccggagcug ccccggagcc ggagaggacc 3180
cccguuggcc agggaucgug ggcccauccg ggacgcacca ggggaccauc cgacagggga 3240
uucugugugg ugucaccggc caggccagca gaagaggcaa ccagccucga gggagcguug 3300
ucuggaacca gacauuccca cccgucggug ggccggcagc accacgcggg accaccgucc 3360
acuuccagac cgccacggcc augggacacc ccuugcccgc cuguguaugc cgagacuaaa 3420
cacuuccugu acucauccgg agacaaggaa cagcuucggc cguccuuccu ccugucgucg 3480
cucagaccga gccugaccgg agcacgcaga uugguggaaa cuaucuuccu ugggucacgu 3540
ccguggaugc cagguacccc acggcgccuc ccgcgccucc cacagagaua cuggcagaug 3600
cggccucugu uccuggaauu gcugggaaac cacgcucagu gcccguacgg aguccugcuc 3660
aagacucacu gcccucugag ggcggcgguc acuccggcgg ccggagugug cgcacgggag 3720
aagccccagg gaagcguggc agcuccggaa gaggaggaca ccgauccgcg ccgccucgug 3780
caacuucugc gccagcacuc cucgcccugg caagucuacg gguucguccg cgccugccug 3840
cgccgccugg ugccgccugg gcucuggggu ucccggcaua acgagcgccg cuuccugaga 3900
aauacuaaga aguuuaucuc acuuggaaaa caugccaagu ugucgcugca agaacucacg 3960
uggaagaugu caguccgcga uugcgccugg cugcgccgcu cgccgggcgu cggguguguu 4020
ccagcugcag aacaccgccu gagagaagaa auucuggcca aauuucugca uuggcugaug 4080
ucaguguacg uggucgagcu gcugcgcucc uuuuucuacg ucacugagac uaccuuucaa 4140
aagaaccgcc uguucuucua ccgcaaaucu guguggagca agcugcaguc aaucggcauu 4200
cgccagcauc ugaagagggu gcagcugcgg gaacuuuccg aggcagaagu ccgccagcac 4260
cgggaggccc ggccggcgcu ucucacgucg cgucugagau ucaucccaaa gcccgacggg 4320
cugaggccua ucgucaacau ggauuacguc gugggcgcuc gcaccuuucg ccgugaaaag 4380
cgggccgaac gcuugaccuc acgggugaag gcccucuucu ccgugcugaa cuacgagaga 4440
gcaagacggc cuggccugcu gggagcuucg gugcugggac uggacgauau ccaccgggcu 4500
uggcggaccu uuguucuccg ggugagagcc caagacccuc cgccggaacu guacuucgug 4560
aagguggcga ucaccggagc cuaugauacu auuccgcaag aucgacucac cgaagucauc 4620
gccucgauca ucaaaccgca gaacacuuac ugcgucaggc gguacgccgu gguccagaag 4680
gccgcgcaug gccacgugag aaaggcguuc aagucgcacg uguccacucu caccgaccuc 4740
cagccuuaca ugaggcaauu cguugcgcau uugcaagaga cuucgccccu gagagaugcg 4800
guggucaucg agcagagcuc cagccugaac gaagcgagca gcggucuguu ugacguguuc 4860
cuccgcuuca ugugucauca cgcggugcga aucaggggaa aaucauacgu gcagugccag 4920
ggaaucccac aaggcagcau ucugucgacu cucuuguguu cccuuugcua cggcgauaug 4980
gaaaacaagc uguucgcugg gaucagacgg gacggguugc ugcucagacu gguggacgac 5040
uuccugcugg ugacuccgca ccucacucac gccaaaaccu uucuccgcac ucuggugagg 5100
ggagugccag aauacggcug uguggucaau cuccggaaaa cuguggugaa uuucccuguc 5160
gaggaugagg cacucggagg aaccgcauuu guccaaaugc cagcacaugg ccuguuccca 5220
uggugcgguc ugcugcugga cacccgaacu cuugaagugc aguccgacua cuccagcuau 5280
gcccggacga gcauccgcgc cagccucacu uucaaucgcg gcuuuaaggc cggacgaaac 5340
augcgcagaa agcuuuucgg aguccuccgg cuuaaaugcc auucgcucuu ucucgaucuc 5400
caagucaauu cgcugcagac cgugugcacg aacaucuaca agauccugcu gcuccaagcc 5460
uaccgguucc acgcuugcgu gcuucagcug ccguuucacc aacaggugug gaagaacccg 5520
accuucuuuc ugcgggucau uagcgauacu gccucccugu guuacucaau ccucaaggca 5580
aagaacgccg gaaugucgcu gggugcgaaa ggagccgcgg gaccucuucc uagcgaagcg 5640
gugcaguggc ucugccacca ggcuuuccuc cugaagcuga ccaggcacag agugaccuac 5700
gucccgcugc ugggcucgcu gcgcacugca cagacccagc ugucuagaaa acuccccggc 5760
accacccuga ccgcucugga agccgccgcc aacccagcau ugccgucaga uuucaagacc 5820
aucuuggac 5829
<210> 92
<211> 5829
<212> RNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 92
auggcuagca agcugaccau ugagagcacu cccuucaacg uggcugaggg gaaggaggug 60
cugcuccugg ugcacaaucu gccccagcac cuguucgggu acuccuggua caagggagaa 120
cgcguggacg ggaaccggca gaucauaggc uacgucaucg gaacccagca ggccacaccc 180
gguccagcgu acagcggccg ggagauuauc uacccgaacg ccucccugcu gauccaaaac 240
aucauccaga acgacaccgg uuucuacacu cugcacguga uuaagucaga ucuggucaac 300
gaagaggcca ccggccaauu caggguguac cccgaacucc cuaagccguu caucaccucg 360
aacaacagca acccggucga ggaugaagau gcgguggccu ugacgugcga accugagauc 420
cagaacacca ccuacuugug gugggugaac aaucagagcc ugccagucuc cccacgacuc 480
cagcugucga acgacaacag gacccugacu uugcuguccg ugacucggaa cgacgugggc 540
ccuuaugaau gcgguaucca gaacaagcug uccguggacc acagcgaccc ugugauccug 600
aacguccuuu acgggccgga cgaccccacc auuuccccgu cguacacuua cuaccggccg 660
ggcgugaacc ugucccuguc gugccacgcu gccuccaauc cgccggccca guacuccugg 720
cucaucgacg gaaacaucca gcagcacacc caagaacugu ucaucuccaa cauuaccgag 780
aaaaacucgg gacuuuacac cugucaagcc aacaauuccg ccagcggcca cucccgcacc 840
acugucaaaa cuaucacugu guccgccgaa cucccgaagc ccagcaucag cuccaacaac 900
ucgaagcccg uggaggauaa ggacgcuguc gcguucaccu gugaaccaga ggcacagaau 960
accaccuacc uuuggugggu caacggacag ucccugccug ucucaccgag acugcagcug 1020
ucaaacggga auaggacucu gaccuuguuu aacgucaccc ggaacgacgc ccgggccuac 1080
gugugcggca uccagaacuc cgugagcgca aaccggucug acccagugac ccuggaugug 1140
cuguacggcc ccgacacucc gaucauuuca ccccccgauu cauccuaccu guccggcgcu 1200
aaccucaacc ucucaugcca cuccgcaucc aaccccagcc cgcaauauuc guggcgcauu 1260
aacggaauuc cucagcaaca uacccagguc cuguucauug cgaagaucac cccuaacaac 1320
aacggaaccu acgccugcuu ugugucaaac cuggccacug guagaaacaa cuccaucgug 1380
aaguccauua ccgugucggc guccggaucc ggcgagggca gaggcagccu gcugacaugu 1440
ggcgacgugg aagagaaccc uggccccgga gcugccccgg agccggagag gacccccguu 1500
ggccagggau cgugggccca uccgggacgc accaggggac cauccgacag gggauucugu 1560
guggugucac cggccaggcc agcagaagag gcaaccagcc ucgagggagc guugucugga 1620
accagacauu cccacccguc ggugggccgg cagcaccacg cgggaccacc guccacuucc 1680
agaccgccac ggccauggga caccccuugc ccgccugugu augccgagac uaaacacuuc 1740
cuguacucau ccggagacaa ggaacagcuu cggccguccu uccuccuguc gucgcucaga 1800
ccgagccuga ccggagcacg cagauuggug gaaacuaucu uccuuggguc acguccgugg 1860
augccaggua ccccacggcg ccucccgcgc cucccacaga gauacuggca gaugcggccu 1920
cuguuccugg aauugcuggg aaaccacgcu cagugcccgu acggaguccu gcucaagacu 1980
cacugcccuc ugagggcggc ggucacuccg gcggccggag ugugcgcacg ggagaagccc 2040
cagggaagcg uggcagcucc ggaagaggag gacaccgauc cgcgccgccu cgugcaacuu 2100
cugcgccagc acuccucgcc cuggcaaguc uacggguucg uccgcgccug ccugcgccgc 2160
cuggugccgc cugggcucug ggguucccgg cauaacgagc gccgcuuccu gagaaauacu 2220
aagaaguuua ucucacuugg aaaacaugcc aaguugucgc ugcaagaacu cacguggaag 2280
augucagucc gcgauugcgc cuggcugcgc cgcucgccgg gcgucgggug uguuccagcu 2340
gcagaacacc gccugagaga agaaauucug gccaaauuuc ugcauuggcu gaugucagug 2400
uacguggucg agcugcugcg cuccuuuuuc uacgucacug agacuaccuu ucaaaagaac 2460
cgccuguucu ucuaccgcaa aucugugugg agcaagcugc agucaaucgg cauucgccag 2520
caucugaaga gggugcagcu gcgggaacuu uccgaggcag aaguccgcca gcaccgggag 2580
gcccggccgg cgcuucucac gucgcgucug agauucaucc caaagcccga cgggcugagg 2640
ccuaucguca acauggauua cgucgugggc gcucgcaccu uucgccguga aaagcgggcc 2700
gaacgcuuga ccucacgggu gaaggcccuc uucuccgugc ugaacuacga gagagcaaga 2760
cggccuggcc ugcugggagc uucggugcug ggacuggacg auauccaccg ggcuuggcgg 2820
accuuuguuc uccgggugag agcccaagac ccuccgccgg aacuguacuu cgugaaggug 2880
gcgaucaccg gagccuauga uacuauuccg caagaucgac ucaccgaagu caucgccucg 2940
aucaucaaac cgcagaacac uuacugcguc aggcgguacg ccguggucca gaaggccgcg 3000
cauggccacg ugagaaaggc guucaagucg cacgugucca cucucaccga ccuccagccu 3060
uacaugaggc aauucguugc gcauuugcaa gagacuucgc cccugagaga ugcggugguc 3120
aucgagcaga gcuccagccu gaacgaagcg agcagcgguc uguuugacgu guuccuccgc 3180
uucauguguc aucacgcggu gcgaaucagg ggaaaaucau acgugcagug ccagggaauc 3240
ccacaaggca gcauucuguc gacucucuug uguucccuuu gcuacggcga uauggaaaac 3300
aagcuguucg cugggaucag acgggacggg uugcugcuca gacuggugga cgacuuccug 3360
cuggugacuc cgcaccucac ucacgccaaa accuuucucc gcacucuggu gaggggagug 3420
ccagaauacg gcuguguggu caaucuccgg aaaacugugg ugaauuuccc ugucgaggau 3480
gaggcacucg gaggaaccgc auuuguccaa augccagcac auggccuguu cccauggugc 3540
ggucugcugc uggacacccg aacucuugaa gugcaguccg acuacuccag cuaugcccgg 3600
acgagcaucc gcgccagccu cacuuucaau cgcggcuuua aggccggacg aaacaugcgc 3660
agaaagcuuu ucggaguccu ccggcuuaaa ugccauucgc ucuuucucga ucuccaaguc 3720
aauucgcugc agaccgugug cacgaacauc uacaagaucc ugcugcucca agccuaccgg 3780
uuccacgcuu gcgugcuuca gcugccguuu caccaacagg uguggaagaa cccgaccuuc 3840
uuucugcggg ucauuagcga uacugccucc cuguguuacu caauccucaa ggcaaagaac 3900
gccggaaugu cgcugggugc gaaaggagcc gcgggaccuc uuccuagcga agcggugcag 3960
uggcucugcc accaggcuuu ccuccugaag cugaccaggc acagagugac cuacgucccg 4020
cugcugggcu cgcugcgcac ugcacagacc cagcugucua gaaaacuccc cggcaccacc 4080
cugaccgcuc uggaagccgc cgccaaccca gcauugccgu cagauuucaa gaccaucuug 4140
gacggauccg gcacaauccu gucugagggc gccaccaacu ucagccugcu gaaacuggcc 4200
ggcgacgugg aacugaaccc uggcccuacc ccuggaaccc agagccccuu cuuccuucug 4260
cugcugcuga ccgugcugac ugucgugaca ggcucuggcc acgccagcuc uacaccuggc 4320
ggcgagaaag agacaagcgc cacccagaga agcagcgugc caagcagcac cgagaagaac 4380
gccgugucca ugaccagcuc cgugcugagc agccacucuc cuggcagcgg cagcagcaca 4440
acacagggcc aggaugugac acuggccccu gccacagaac cugccucugg aucugccgcc 4500
accuggggac aggacgugac aagcgugcca gugaccagac cugcccuggg cucuacaaca 4560
cccccugccc acgaugugac cagcgccccu gauaacaagc cugccccugg aagcacagcc 4620
ccuccagcuc auggcgugac cucugcccca gauaccagac cagccccagg aucuacagcc 4680
ccacccgcac acggcgugac aagugccccu gacacaagac ccgcuccagg cucuacugcu 4740
ccuccugccc auggcgugac aagcgcuccc gauacaaggc cagcuccugg cuccacagca 4800
ccaccagcac auggcgugac aucagcuccc gacacuagac cugcucccgg aucaaccgcu 4860
ccaccagcuc acggcgugac cagcgcaccu gauaccagac cugcucuggg aagcaccgcc 4920
ccucccgugc acaaugugac aucugcuucc ggcagcgcca gcggcucugc cucuacacug 4980
gugcacaacg gcaccagcgc cagagccaca acaaccccag ccagcaagag cacccccuuc 5040
agcaucccua gccaccacag cgacaccccu accacacugg ccagccacuc caccaagacc 5100
gaugccucua gcacccacca cuccagcgug cccccucuga ccagcagcaa ccacagcaca 5160
agcccccagc ugucuaccgg cgucucauuc uucuuucugu ccuuccacau cagcaaccug 5220
caguucaaca gcagccugga agaucccagc accgacuacu accaggaacu gcagcgggau 5280
aucagcgaga uguuccugca aaucuacaag cagggcggcu uccugggccu gagcaacauc 5340
aaguucagac ccggcagcgu gguggugcag cugacccugg cuuuccggga aggcaccauc 5400
aacgugcacg acguggaaac ccaguucaac caguacaaga ccgaggccgc cagccgguac 5460
aaccugacca ucuccgaugu guccgugucc gacgugcccu ucccauucuc ugcccagucu 5520
ggcgcaggcg ugccaggaug gggaauugcu cugcuggugc ucgugugcgu gcugguggcc 5580
cuggccaucg uguaucugau ugcccuggcc gugugccagu gccggcggaa gaauuacggc 5640
cagcuggaca ucuuccccgc cagagacacc uaccacccca ugagcgagua ccccacauac 5700
cacacccacg gcagauacgu gccacccagc uccaccgaca gaucccccua cgagaaagug 5760
ucugccggca acggcggcag cucccugagc uacacaaauc cugccguggc cgcugccucc 5820
gccaaccug 5829
<210> 93
<211> 568
<212> RNA
<213> Encephalomyocarditis virus
<400> 93
uaacguuacu ggccgaagcc gcuuggaaua aggccggugu gcguuugucu auauguuauu 60
uuccaccaua uugccgucuu uuggcaaugu gagggcccgg aaaccuggcc cugucuucuu 120
gacgagcauu ccuagggguc uuuccccucu cgccaaagga augcaagguc uguugaaugu 180
cgugaaggaa gcaguuccuc uggaagcuuc uugaagacaa acaacgucug uagcgacccu 240
uugcaggcag cggaaccccc caccuggcga caggugccuc ugcggccaaa agccacgugu 300
auaagauaca ccugcaaagg cggcacaacc ccagugccac guugugaguu ggauaguugu 360
ggaaagaguc aaauggcucu ccucaagcgu auucaacaag gggcugaagg augcccagaa 420
gguaccccau uguaugggau cugaucuggg gccucggugc acaugcuuua cauguguuua 480
gucgagguua aaaaacgucu aggccccccg aaccacgggg acgugguuuu ccuuugaaaa 540
acacgaugau aauauggcca caaccaug 568

Claims (32)

1.一种抗原构建体,其包含编码免疫原性CEA多肽的核苷酸序列。
2.权利要求1的抗原构建体,其进一步包含编码免疫原性MUC1多肽的核苷酸序列。
3.权利要求1的抗原构建体,其进一步包含编码免疫原性TERT多肽的核苷酸序列。
4.权利要求1的抗原构建体,其进一步包含编码免疫原性MUC1多肽的核苷酸序列和编码免疫原性TERT多肽的核苷酸序列。
5.权利要求2、3或4任一项的抗原构建体,其进一步包含间隔子核苷酸序列。
6.权利要求5的抗原构建体,其中所述间隔子核苷酸序列编码2A肽。
7.权利要求5的抗原构建体,其中该间隔子核苷酸序列编码选自EMC2A、ERA2A、ERB2A及T2A的2A肽。
8.权利要求1-7中任一项的抗原构建体,其中所述免疫原性CEA多肽选自:
(1)多肽,其包含SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ IDNO:2的氨基酸323-677,或由SEQ ID NO:2的氨基酸2-702、SEQ ID NO:2的氨基酸323-702或SEQ ID NO:2的氨基酸323-677组成;
(2)多肽,其包含SEQ ID NO:15的氨基酸序列或SEQ ID NO:15的氨基酸4-704,或由SEQID NO:15的氨基酸序列或SEQ ID NO:15的氨基酸4-704组成;
(3)多肽,其包含SEQ ID NO:17的氨基酸序列或SEQ ID NO:17的氨基酸4-526,或由SEQID NO:17的氨基酸序列或SEQ ID NO:17的氨基酸4-526组成;
(4)多肽,其包含SEQ ID NO:19的氨基酸序列或SEQ ID NO:19的氨基酸4-468,或由SEQID NO:19的氨基酸序列或SEQ ID NO:19的氨基酸4-468组成;及
(5)多肽,其是上述(1)至(4)中任一多肽的功能性变体。
9.权利要求3至8中任一项的抗原构建体,其中所述免疫原性TERT多肽选自:
(1)多肽,其包含SEQ ID NO:9的氨基酸序列或SEQ ID NO:9的氨基酸2-893;
(2)多肽,其包含SEQ ID NO:11的氨基酸序列或SEQ ID NO:11的氨基酸3-791;
(3)多肽,其包含SEQ ID NO:13的氨基酸序列或SEQ ID NO:13的氨基酸4-594;及
(4)多肽,其是上述(1)至(3)中任一多肽的功能性变体。
10.权利要求2及4至9中任一项的抗原构建体,其中所述免疫原性MUC1多肽选自:
(1)多肽,其包含SEQ ID NO:5的氨基酸序列或SEQ ID NO:5的氨基酸4-537;
(2)多肽,其包含SEQ ID NO:7的氨基酸序列或SEQ ID NO:7的氨基酸4-517;及
(3)多肽,其是上述(1)或(2)的多肽的功能性变体。
11.权利要求1的抗原构建体,其包含编码氨基酸序列的核苷酸序列,所述氨基酸序列选自:
(1)SEQ ID NO:31的氨基酸序列或包含SEQ ID NO:31的氨基酸4-1088的氨基酸序列;
(2)SEQ ID NO:33的氨基酸序列或包含SEQ ID NO:33的氨基酸4-1081的氨基酸序列;
(3)SEQ ID NO:35的氨基酸序列或包含SEQ ID NO:35的氨基酸4-1085的氨基酸序列;
(4)SEQ ID NO:37的氨基酸序列或包含SEQ ID NO:37的氨基酸4-1030的氨基酸序列;
(5)SEQ ID NO:39的氨基酸序列或包含SEQ ID NO:39的氨基酸4-1381的氨基酸序列;及
(6)SEQ ID NO:41的氨基酸序列或包含SEQ ID NO:41的氨基酸4-1441的氨基酸序列。
12.权利要求1的抗原构建体,其包含选自以下的核苷酸序列:
(1)SEQ ID NO:30的核苷酸序列或包含SEQ ID NO:30的核苷酸10-3264的核苷酸序列;
(2)SEQ ID NO:32的核苷酸序列或包含SEQ ID NO:32的核苷酸10-3243的核苷酸序列;
(3)SEQ ID NO:34的核苷酸序列或包含SEQ ID NO:34的核苷酸10-3255的核苷酸序列;
(4)SEQ ID NO:36的核苷酸序列或包含SEQ ID NO:36的核苷酸10-3090的核苷酸序列;
(5)SEQ ID NO:38的核苷酸序列或包含SEQ ID NO:38的核苷酸10-4143的核苷酸序列;
(6)SEQ ID NO:40的核苷酸序列或包含SEQ ID NO:40的核苷酸10-4323的核苷酸序列;及
(7)核苷酸序列,其是上述(1)至(6)中任一核苷酸序列的简并变体。
13.权利要求1的抗原构建体,其包含编码氨基酸序列的核苷酸序列,所述氨基酸序列选自:
(1)SEQ ID NO:43的氨基酸序列或包含SEQ ID NO:43的氨基酸4-2003的氨基酸序列;
(2)SEQ ID NO:45的氨基酸序列或包含SEQ ID NO:45的氨基酸4-2001的氨基酸序列;
(3)SEQ ID NO:47的氨基酸序列或包含SEQ ID NO:47的氨基酸4-2008的氨基酸序列;
(4)SEQ ID NO:49的氨基酸序列或包含SEQ ID NO:49的氨基酸4-1996的氨基酸序列;
(5)SEQ ID NO:51的氨基酸序列或包含SEQ ID NO:51的氨基酸4-1943的氨基酸序列;及
(6)SEQ ID NO:53的氨基酸序列或包含SEQ ID NO:53的氨基酸4-1943的氨基酸序列。
14.权利要求1的抗原构建体,其包含选自以下的核苷酸序列:
(1)SEQ ID NO:42的核苷酸序列或包含SEQ ID NO:42的核苷酸10-6009的核苷酸序列;
(2)SEQ ID NO:44的核苷酸序列或包含SEQ ID NO:44的核苷酸10-6003的核苷酸序列;
(3)SEQ ID NO:46的核苷酸序列或包含SEQ ID NO:46的核苷酸10-6024的核苷酸序列;
(4)SEQ ID NO:48的核苷酸序列或包含SEQ ID NO:48的核苷酸10-5988的核苷酸序列;
(5)SEQ ID NO:50的核苷酸序列或包含SEQ ID NO:50的核苷酸10-5829的核苷酸序列;
(6)SEQ ID NO:52的核苷酸序列或包含SEQ ID NO:52的核苷酸10-5829的核苷酸序列;及
(7)核苷酸序列,其是上述(1)至(6)中任一核苷酸序列的简并变体。
15.权利要求1的抗原构建体,其包含:
(1)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列;或
(2)SEQ ID NO:87、88、89、90、91及92中任一核苷酸序列的简并变体。
16.一种药物组合物,其包含(i)权利要求1-15中任一项的抗原构建体,和(ii)药学上可接受的载剂。
17.权利要求16的药物组合物,其是疫苗。
18.一种治疗需要治疗的人中的癌症的方法,其包括给人施用有效量的权利要求16或17的药物组合物。
19.权利要求18的方法,其中所述癌症过表达选自MUC1、CEA或TERT的一或多种肿瘤相关抗原。
20.权利要求18的方法,其中所述癌症是胰腺癌、卵巢癌、乳腺癌、胃癌、肺癌或结肠直肠癌。
21.权利要求18的方法,其中所述癌症是三阴性乳腺癌、雌激素受体阳性乳腺癌或HER2阳性乳腺癌。
22.权利要求18的方法,其进一步包括给患者施用有效量的免疫调节剂。
23.权利要求22的方法,其中所述免疫调节剂是CTLA-4抑制剂、IDO1抑制剂、PD-1抑制剂或PD-L1抑制剂。
24.权利要求18的方法,其进一步包括给人施用佐剂。
25.一种载体,其包含权利要求1-15中任一项的抗原构建体。
26.权利要求25的载体,其是质粒载体。
27.权利要求26的载体,其包含SEQ ID NO:57、59、61、63、65、67、69、70、71、72、73及74中的任一核苷酸序列。
28.权利要求25的载体,其是病毒载体。
29.权利要求28的载体,其包含SEQ ID NO:58、60、62、64、66及68中的任一核苷酸序列。
30.(1)权利要求1-15中任一项的抗原构建体,(2)权利要求16或17的药物组合物,或(3)权利要求25-29中任一项的载体用作药剂的用途。
31.权利要求30的用途,其中所述药剂用于治疗癌症。
32.(1)权利要求1-15中任一项的抗原构建体或(2)权利要求25-29中任一项的载体在制备用于治疗癌症的药剂中的用途。
CN201880057887.4A 2017-07-11 2018-07-03 免疫原性组合物 Pending CN111065408A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201762531227P 2017-07-11 2017-07-11
US62/531,227 2017-07-11
US201862682044P 2018-06-07 2018-06-07
US62/682,044 2018-06-07
PCT/IB2018/054926 WO2019012371A1 (en) 2017-07-11 2018-07-03 IMMUNOGENIC COMPOSITIONS COMPRISING CEA MUC1 AND TERT

Publications (1)

Publication Number Publication Date
CN111065408A true CN111065408A (zh) 2020-04-24

Family

ID=63720720

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880057887.4A Pending CN111065408A (zh) 2017-07-11 2018-07-03 免疫原性组合物

Country Status (16)

Country Link
US (1) US20190016775A1 (zh)
EP (1) EP3651792A1 (zh)
JP (2) JP7028953B2 (zh)
KR (1) KR20200027551A (zh)
CN (1) CN111065408A (zh)
AU (1) AU2018300295A1 (zh)
BR (1) BR112020000413A2 (zh)
CA (1) CA3069363A1 (zh)
CO (1) CO2020000231A2 (zh)
IL (1) IL271917A (zh)
PE (1) PE20200613A1 (zh)
PH (1) PH12020500087A1 (zh)
RU (1) RU2020100072A (zh)
SG (1) SG11202000197PA (zh)
TW (1) TW201920674A (zh)
WO (1) WO2019012371A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102020201219A1 (de) 2020-01-31 2021-08-05 United Initiators Gmbh Transport- und Lagerbehälter für Peroxide
CN112552380B (zh) * 2020-12-10 2021-12-24 武汉博沃生物科技有限公司 一种SARS-CoV-2病毒的免疫原及其应用

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050059624A1 (en) * 2001-12-19 2005-03-17 Ingmar Hoerr Application of mRNA for use as a therapeutic against tumour diseases
WO2008043760A1 (en) * 2006-10-12 2008-04-17 Istituto Di Ricerche Di Biologia Molecolare P. Angeletti Spa Telomerase reverse transcriptase fusion protein, nucleotides encoding it, and uses thereof
JP2014161283A (ja) * 2013-02-26 2014-09-08 Shizuoka Prefecture Ceacam5遺伝子のスプライシングバリアント
CN104918958A (zh) * 2012-11-20 2015-09-16 赛诺菲 抗ceacam5抗体及其用途
CN105530952A (zh) * 2013-08-21 2016-04-27 库瑞瓦格股份公司 用于治疗肺癌的组合物和疫苗
WO2016112195A1 (en) * 2015-01-09 2016-07-14 Etubics Corporation Methods and compositions for combination immunotherapy

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4769330A (en) 1981-12-24 1988-09-06 Health Research, Incorporated Modified vaccinia virus and methods for making and using the same
US4603112A (en) 1981-12-24 1986-07-29 Health Research, Incorporated Modified vaccinia virus
US5288641A (en) 1984-06-04 1994-02-22 Arch Development Corporation Herpes Simplex virus as a vector
CA1341423C (en) 1984-10-31 2003-03-04 Paul A. Luciw Recombinant proteins of viruses associated with lymphadenopathy syndrome and/or acquired immune deficiency syndrome
GB8508845D0 (en) 1985-04-04 1985-05-09 Hoffmann La Roche Vaccinia dna
US5091309A (en) 1986-01-16 1992-02-25 Washington University Sindbis virus vectors
WO1989001973A2 (en) 1987-09-02 1989-03-09 Applied Biotechnology, Inc. Recombinant pox virus for immunization against tumor-associated antigens
US5591624A (en) 1988-03-21 1997-01-07 Chiron Viagene, Inc. Retroviral packaging cell lines
US5716826A (en) 1988-03-21 1998-02-10 Chiron Viagene, Inc. Recombinant retroviruses
US5703055A (en) 1989-03-21 1997-12-30 Wisconsin Alumni Research Foundation Generation of antibodies through lipid mediated DNA delivery
US5817491A (en) 1990-09-21 1998-10-06 The Regents Of The University Of California VSV G pseusdotyped retroviral vectors
US6015686A (en) 1993-09-15 2000-01-18 Chiron Viagene, Inc. Eukaryotic layered vector initiation systems
US6962790B1 (en) 1998-09-23 2005-11-08 University Of Massachusetts Medical Center Predictive assay for immune response
US6682736B1 (en) 1998-12-23 2004-01-27 Abgenix, Inc. Human monoclonal antibodies to CTLA-4
ES2282133T3 (es) 1999-08-24 2007-10-16 Medarex, Inc. Anticuerpos frente a la ctla-4 humano y sus usos.
TWI228718B (en) 2001-11-05 2005-03-01 Tdk Corp Manufacturing method and device of mold plate for information medium
PL1711518T3 (pl) 2004-01-23 2010-06-30 St Di Richerche Di Biologia Molecolare P Angeletti S P A Nośniki szczepionek pochodzące od szympansich adenowirusów
KR101927291B1 (ko) 2008-07-08 2018-12-10 인사이트 홀딩스 코포레이션 인돌아민 2,3-디옥시게나아제의 억제제로서의 1,2,5-옥사디아졸
BRPI1008018A2 (pt) 2009-02-02 2016-03-15 Okairos Ag ácidos nucleicos de adenovírus símio e sequências de aminoácidos, vetores contendo os mesmos e uso dos mesmos
US9128725B2 (en) 2012-05-04 2015-09-08 Apple Inc. Load-store dependency predictor content management
JP2016535746A (ja) 2013-10-28 2016-11-17 ピラマル エンタープライジーズ リミテッド 薬草組成物、その製造方法および使用
KR102006527B1 (ko) 2013-11-01 2019-08-02 화이자 인코포레이티드 전립선-연관 항원의 발현을 위한 벡터
EA201650031A1 (ru) 2014-05-15 2017-06-30 Итеос Терапьютик Производные пирролидин-2,5-диона, фармацевтические композиции и способы применения в качестве ингибиторов ido1
TWI595006B (zh) 2014-12-09 2017-08-11 禮納特神經系統科學公司 抗pd-1抗體類和使用彼等之方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050059624A1 (en) * 2001-12-19 2005-03-17 Ingmar Hoerr Application of mRNA for use as a therapeutic against tumour diseases
WO2008043760A1 (en) * 2006-10-12 2008-04-17 Istituto Di Ricerche Di Biologia Molecolare P. Angeletti Spa Telomerase reverse transcriptase fusion protein, nucleotides encoding it, and uses thereof
CN101522706A (zh) * 2006-10-12 2009-09-02 P.安杰莱蒂分子生物学研究所 端粒酶逆转录酶融合蛋白、编码它的核苷酸以及其用途
CN104918958A (zh) * 2012-11-20 2015-09-16 赛诺菲 抗ceacam5抗体及其用途
JP2014161283A (ja) * 2013-02-26 2014-09-08 Shizuoka Prefecture Ceacam5遺伝子のスプライシングバリアント
CN105530952A (zh) * 2013-08-21 2016-04-27 库瑞瓦格股份公司 用于治疗肺癌的组合物和疫苗
WO2016112195A1 (en) * 2015-01-09 2016-07-14 Etubics Corporation Methods and compositions for combination immunotherapy

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ELIZABETH S. GABITZSCH等: "The generation and analyses of a novel combination of recombinant adenovirus vaccines targeting three tumor antigens as an immunotherapeutic" *
JAMES L. GULLEY等: "Pilot Study of Vaccination with Recombinant CEA-MUC-1-TRICOM Poxviral-Based Vaccines in Patients with Metastatic Carcinoma" *
SUSANNE M RITTIG等: "Intradermal Vaccinations With RNA Coding for TAA Generate CD8+ and CD4+ Immune Responses and Induce Clinical Benefit in Vaccinated Patients" *

Also Published As

Publication number Publication date
JP7028953B2 (ja) 2022-03-02
BR112020000413A2 (pt) 2020-07-21
RU2020100072A3 (zh) 2021-08-11
CO2020000231A2 (es) 2020-01-17
KR20200027551A (ko) 2020-03-12
IL271917A (en) 2020-02-27
PH12020500087A1 (en) 2020-09-14
SG11202000197PA (en) 2020-02-27
EP3651792A1 (en) 2020-05-20
AU2018300295A1 (en) 2020-01-23
CA3069363A1 (en) 2019-01-17
TW201920674A (zh) 2019-06-01
JP2022031653A (ja) 2022-02-22
RU2020100072A (ru) 2021-08-11
JP2020526202A (ja) 2020-08-31
PE20200613A1 (es) 2020-03-11
US20190016775A1 (en) 2019-01-17
WO2019012371A1 (en) 2019-01-17

Similar Documents

Publication Publication Date Title
KR102006527B1 (ko) 전립선-연관 항원의 발현을 위한 벡터
AU2020260485B2 (en) Gene therapies for lysosomal disorders
KR101728483B1 (ko) 전립선 관련된 항원 및 백신 기재 면역치료 요법
DK2753355T3 (en) ONCOLYTIC HERP SIMPLEX VIRUSES AND THERAPEUTIC APPLICATIONS THEREOF
KR20220141332A (ko) 홍역-벡터화된 covid-19 면역원성 조성물 및 백신
ES2388527T3 (es) Vacunas de VIH basadas en Env de múltiples clados de VIH
KR20150014505A (ko) 아과 e 원숭이 아데노바이러스 a1302, a1320, a1331 및 a1337 및 이것들의 사용
CN111295449A (zh) 腺病毒载体及其用途
DK2623594T3 (da) Antistof mod human prostaglandin-E2-receptor EP4
KR20230066360A (ko) 신경퇴행성 장애를 위한 유전자 요법
KR20220078607A (ko) 융합 단백질들을 이용한 tcr 재프로그래밍을 위한 조성물 및 방법들
KR20200083510A (ko) 아데노바이러스 및 이의 용도
CN111065408A (zh) 免疫原性组合物
KR20230031929A (ko) 고릴라 아데노바이러스 핵산 서열 및 아미노산 서열, 이들을 함유하는 벡터, 및 이의 용도
KR20210150486A (ko) 리소좀 장애에 대한 유전자 요법
KR102158923B1 (ko) 암 백신
TW202308669A (zh) 嵌合共刺激性受體、趨化激素受體及彼等於細胞免疫治療之用途
KR20210150487A (ko) 리소좀 장애를 위한 유전자 요법
JP2024073576A (ja) 改変アデノウイルス
CN113088530A (zh) 一种基于黑猩猩ChAd63型腺病毒的表达载体及其构建方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40027086

Country of ref document: HK

WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20200424

WD01 Invention patent application deemed withdrawn after publication