CN115247179B - 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用 - Google Patents

一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用 Download PDF

Info

Publication number
CN115247179B
CN115247179B CN202110446525.8A CN202110446525A CN115247179B CN 115247179 B CN115247179 B CN 115247179B CN 202110446525 A CN202110446525 A CN 202110446525A CN 115247179 B CN115247179 B CN 115247179B
Authority
CN
China
Prior art keywords
ala
gly
leu
val
thr
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110446525.8A
Other languages
English (en)
Other versions
CN115247179A (zh
Inventor
邵雷
彭传新
王晓婧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai University of Medicine and Health Sciences
Original Assignee
Shanghai University of Medicine and Health Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai University of Medicine and Health Sciences filed Critical Shanghai University of Medicine and Health Sciences
Priority to CN202110446525.8A priority Critical patent/CN115247179B/zh
Publication of CN115247179A publication Critical patent/CN115247179A/zh
Application granted granted Critical
Publication of CN115247179B publication Critical patent/CN115247179B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • C12N9/0077Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with a reduced iron-sulfur protein as one donor (1.14.15)
    • C12N9/0081Cholesterol monooxygenase (cytochrome P 450scc)(1.14.15.6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/08Oxygen as only ring hetero atoms containing a hetero ring of at least seven ring members, e.g. zearalenone, macrolide aglycons
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/15Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with reduced iron-sulfur protein as one donor, and incorporation of one atom of oxygen (1.14.15)
    • C12Y114/15006Cholesterol monooxygenase (side-chain-cleaving) (1.14.15.6), i.e. cytochrome P450scc

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本发明首次解析并确证了与聚酮化合物骨架以及Tubelactomicin生物合成有关的基因簇,所述基因簇可以进一步与其它生物合成酶基因共同构建聚酮化合物重组表达系统,用于提高Tubelactomicin的产量或者用于获得新的Tubelactomicin衍生物。

Description

一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其 应用
技术领域
本发明属于微生物与基因工程领域,具体涉及一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用。
背景技术
快速生长型非结核分枝杆菌(RGM,Rapidly growing mycobacteria)是指除结核分枝杆菌复合群和麻风分枝杆菌以外的快速生长类型分支杆菌。RGM可以引起肺部病变、淋巴结炎、皮肤软组织感染和骨骼系统病变等疾病,统称为快速生长分枝杆菌病。近年来,快速生长分枝杆菌病(RGM)的患病率迅速上升,且对于目前的临床抗菌药物普遍呈耐药性。广州市胸科医院分离到的180株RGM的药敏检测发现总体耐药率为100%,除了对克拉霉素和硫酸阿米卡星耐药率低于40%以外,对其他10种一二线抗菌药物的耐药率普遍高于80%。RGM患病率迅速上升,耐药性强,治疗困难,寻找新的治疗药物已成为临床重大需求和抗菌药物研究的难点之一。
由于RGM对于不同结构类别和抗菌机制的药物几乎全部耐药,从现有药物靶点和抗生素结构衍生物寻找抗RGM药物的困难较大。
Tubelactomicin是一种聚酮类天然化合物,于2000年被发现,属于16元环内酯抗生素,由土壤放线菌诺卡氏菌的发酵培养液中分离获得。Tubelactomicin A对快速生长型分枝杆菌显示出强大活性。TubelactomicinA对草分枝杆菌(MIC为0.2μg/ml),牛分枝杆菌(MIC为0.1μg/ml),耻垢分枝杆菌(MIC为0.1μg/ml)和偶发分枝杆菌(MIC为0.78μg/ml)等RGM表现出强大的抗菌活性。在100mg/kg静脉给药量的动物实验中,Tubelactomicin A也表现出良好的体内安全性。Tubelactomicin A与其它抗结核抗生素无交叉耐药性,表明其对快速生长型分枝杆菌的生长抑制作用具有新的机理。
诺卡氏菌产生的Tubelactomicin类化合物包含Tubelactomicin A、Tubelactomicin B、Tubelactomicin D、Tubelactomicin E组分。其中Tubelactomicin B、Tubelactomicin D、Tubelactomicin E组分为Tubelactomicin A生物合成途径中的中间体。Tubelactomicin D和Tubelactomicin E组分也具有一定的抗RGM活性。
组合生物合成是近20年发展起来的一种以药物发现为中心,通过基因工程改造来挖掘和生物合成新型复杂的化合物,获取新型抗生素的新方法。对于化学手段难以合成或者合成起来代价昂贵的复杂生物大分子化合物,组合生物合成技术尤其适用于这些化合物。通过生物信息学对比分析,确定基因簇中各个模块或者结构域的功能,结合微生物遗传学手段,将这些模块或者结构域进行进行排列组合,就可能产生不同功能基团的新型人为改造的天然产物。
聚酮类化合物是由细菌、真菌、放线菌或植物产生,以小分子羧酸为前体,经酰基转移酶和聚酮合成酶PKS催化合成大环骨架,再经由一系列后修饰操作形成的一类数目庞大、结构复杂的天然次级代谢产物。目前发现的PKS根据结构和机理上的差异主要分为三类:Ⅰ型PKS、Ⅱ型PKS、Ⅲ型PKS。相较于其他聚酮合成酶,Ⅰ型PKS是目前研究最深入,最广泛的一类聚酮合成酶,整个PKS运行类似于工厂流水线车间,非重复使用的各个催化功能域依次执行识别、加载、加工处理等一系列操作,聚酮链的生物合成顺序与结构功能域一一对应。Ⅰ型PKS主要包括酮基合成酶(Keto-synthase,KS)、酰基转移酶(Acyl Transferase,AT)、酰基载体蛋白(Acyl Carrier Protein,ACP)、烯醇还原酶(Enoyl Reductase,ER)、脱氢酶(Dehydratase,DH)、酮基还原酶(Ketoreducatse,KR)、转甲基酶(Methyltransferase,MT)、硫化氢解酶(Sulfhydrolase,SH)硫酯酶(Thioesterase,TE)等酶的功能域。聚酮碳链的构建时,需要各种催化结构域组合成不同模块参与反应,使得每个碳单位发生不同程度的还原,从而合成结构特异多样的聚酮化合物,这意味着聚酮化合物的结构具有一定的可塑性,聚酮合酶PKS所具有的可塑性使人们可以通过组合生物合成手段获取获得新结构或新活性的化合物。
发明内容
本发明首次对诺卡氏菌基因组进行了测序及分析,基于三代测序数据,对诺卡氏菌的655473条序列进行组装,得到初步组装结果;之后,将质量过滤的二代测序数据比对到组装结果上,对组装结果进一步校正,经过拼接组装及校正,得到了诺卡氏菌的基因组信息,一共含有10224019个碱基,基因组中总GC碱基百分比含量为65.47%。
完成组装后,对编码基因进行了预测,预测的编码基因数目为9106个,预测的编码基因总长度为9074577bp,最短基因为90bp,最长基因为71547bp,所有预测基因的总GC碱基百分比含量为65.90%。其中,基因组中含有133个碳水化合物结合模块基因(Carbohydrate-Binding Modules,CBM)、349个编码糖基转移酶(Glycosyl Transferases,GT)的基因、173个编码糖苷水解酶(Glycoside Hydrolases,GH)的基因、88个编码碳水化合物酯酶(Carbohydrate Esterases,CE)的基因、2个编码多糖裂解酶(PolysaccharideLyases,PL)的基因及36个辅助活动基因(Auxiliary Activities,AA)。进一步的,预测了诺卡氏菌种20个PKS基因,从众多候选PKS基因种最终确证了与Tubelactomicin聚酮链的合成有关的聚酮合成酶基因簇。
本发明具体技术方案如下:
一种聚酮化合物骨架的生物合成基因簇,包括:
编码酰基转移酶(Tub A)的基因,所述酰基转移酶(Tub A)的氨基酸序列如SEQ IDNO:2所示;
编码聚酮合成酶1(Tub B)的基因,所述聚酮合成酶1(Tub B)的氨基酸序列如SEQID NO:3所示;
编码聚酮合成酶2(Tub C)的基因,所述聚酮合成酶2(Tub C)的氨基酸序列如SEQID NO:4所示;
编码聚酮合成酶3(Tub D)的基因,所述聚酮合成酶3(Tub D)的氨基酸序列如SEQID NO:5所示;
编码聚酮合成酶4(Tub E)的基因,所述聚酮合成酶4(Tub E)的氨基酸序列如SEQID NO:6所示;
所述聚酮化合物骨架具有如下结构:
本发明所述生物合成基因簇中的各基因对应于诺卡氏菌的基因的第8079-8083位基因,如表1所示。
表1聚酮化合物骨架的生物合成基因簇信息
上述各酶包含的功能域如图1所示。
上述聚酮化合物骨架的生物合成基因簇中,编码酰基转移酶的基因序列为SEQ IDNO:1的序列第501~17702位所示的序列;编码聚酮合成酶1的基因序列为SEQ ID NO:1的序列第17719~22716位所示的序列;编码聚酮合成酶2的基因序列为SEQ ID NO:1的序列第22724~34054位所示的序列;编码聚酮合成酶3的基因序列为SEQ ID NO:1的序列第34178~49867位所示的序列;编码聚酮合成酶4酶的基因序列为SEQ ID NO:1的序列第49894~64248位所示的序列。
一个具体的方案,上述聚酮化合物骨架的生物合成基因簇序列为SEQ ID NO:1的序列第501~64248位所示的序列。
本发明另一目的在于提供一种聚酮化合物生物合成基因簇,包括本发明所述的聚酮化合物骨架的生物合成基因簇以及后修饰基因中的一种或几种,所述后修饰基因包括硫酯酶、细胞色素P450酶、铁氧化还原蛋白。
一个具体的方案,所述聚酮化合物为Tubelactomicin A、Tubelactomicin D或Tubelactomicin E,所述后修饰基因包括编码硫酯酶(Tub F)、一个或多个细胞色素P450酶(Tub G)和铁氧化还原蛋白(Tub H)的基因,硫酯酶氨基酸序列如SEQ ID NO:7所示,细胞色素P450酶氨基酸序列如SEQ ID NO:8所示铁氧化还原蛋白氨基酸序列如SEQ ID NO:9所示。
聚酮化合物骨架(Tubelactomicin B)以及Tubelactomicin A、Tubelactomicin D或Tubelactomicin E的生物合成途径如图2所示。
上述后修饰基因对应于诺卡氏菌的基因的第8084-8086位基因,如表2所示。
表2聚酮化合物骨架的后修饰基因信息
本发明在对上述生物合成基因簇的功能研究中发现,细胞色素P450酶基因能够促进聚酮化合物骨架环化的形成,进而提高Tubelactomicin A、Tubelactomicin D或Tubelactomicin E的产率。
上述聚酮化合物的生物合成基因簇中,编码硫酯酶的基因序列为SEQ ID NO:1的序列第66224~66985位所示的序列;编码细胞色素P450酶的基因序列为SEQ ID NO:1的序列第67135~68322位所示的序列;编码铁氧化还原蛋白的基因序列为SEQ ID NO:1的序列第68335~68529位所示的序列。
一个具体的方案,Tubelactomicin A的生物合成基因簇序列如SEQ ID NO:1所示。
本发明另一目的在于提供本发明所述的聚酮化合物骨架和/或聚酮化合物的生物合成基因簇在构建聚酮化合物重组表达系统中的应用。可通过对聚酮化合物骨架或聚酮化合物生物合成基因簇中的功能基因进行基因重组和/或基因敲除和/或基因置换,或者将其与一种或几种其它生物合成酶基因和/或生物调节基因联合构建聚酮化合物重组表达系统。
本发明另一目的在于提供一种Tubelactomicin A重组表达系统,包含本发明所述的聚酮化合物生物合成基因簇和一个或多个细胞色素P450酶基因的共表达载体,或者包含表达本发明所述的的聚酮化合物生物合成基因簇的载体和表达细胞色素P450酶基因的载体。
本发明另一目的在于提供一种Tubelactomicin A的生物合成方法,包括如下步骤:
(1)构建包含SEQ ID No:1所示的聚酮化合物生物合成基因簇和一个或多个P450酶基因的重组表达工程菌;
(2)种子培养:
种子培养基:2%半乳糖,2%糊精,1%甘油,1%大豆蛋白胨,0.5%玉米浆,0.2%硫酸铵,0.2%碳酸钙;
将步骤(1)构建的重组表达工程菌接种种子培养基,初始pH值7.4,180rpm转速,30℃培养3天;
(3)发酵培养:
发酵培养基:4%可溶性淀粉,1%黄豆饼粉,0.5%Yeast extract,0.4%碳酸钙;
将步骤(2)种子菌液接种发酵培养基,初始pH值7.4,7%接种量,180rpm转速,30℃培养6天。
(4)分离纯化发酵液,得到Tubelactomicin A。
优选的,纯化步骤包括:
(1)调节发酵液pH值1.0-3.0,优选2.0,加入HP20树脂(树脂事先使用甲醇溶胀过夜),吸附1-5h,优选2.5h,使用甲醇浸泡HP20,将浸出液浓缩,用二氯甲烷萃取,收集二氯甲烷相。
(2)硅胶柱层析,洗脱条件:依次使用二氯甲烷洗脱2个柱体积,使用体积比20:1的二氯甲烷:甲醇混合液洗脱5个柱体积。收集20:1的二氯甲烷:甲醇混合液的洗脱组分,浓缩得到Tubelactomicin A。
(3)可进一步使用HPLC进行检测洗脱液纯度。
HPLC分析色谱柱为Amchemteq ACI-C18,4.6×250mm,柱温为30℃,流速为1mL/min,检测波长为238nm,流动相为A相:1‰甲酸水溶液,B相乙腈,洗脱梯度如表3所示。
表3洗脱梯度
Tubelactomicin A的保留时间为24.9min。
本发明优点:
本发明首次解析并确证了Tubelactomicin生物合成有关的基因簇,所述聚酮合成酶基因簇通过基因重组/基因敲除/基因置换,得到Tubelactomicin及其衍生物,并可以进一步与其它生物合成酶基因和生物调节基因共同构建Tubelactomicin或其衍生物重组表达系统,用于提高Tubelactomicin的产量或者用于获得新的Tubelactomicin衍生物。
附图说明
图1为聚酮化合物骨架的生物合成基因簇各酶包含的功能域示意图。
图2为聚酮化合物骨架(Tubelactomicin B)以及Tubelactomicin A、D或E的生物合成途径。
图3为Tubelactomicin A HPLC色谱图。
图4为Tub D单交换敲除基因片段电泳图(M:Marker;1:Tub D单交换敲除基因片段)。
图5为pOJ260-Tub D单交换重组质粒及电泳图(图5a:pOJ260-Tub D单交换重组质粒图;
图5(b):pOJ260-Tub D单交换重组质粒电泳图)。
图6为pOJ260-Tub D单交换重组质粒酶切电泳图。
图7为pOJ260-Tub D单交换突变株电泳图。
图8为野生型DSM 44638以及pOJ260-Tub D单交换突变株表达Tubelactomicin A。
图9为Tub G基因片段PCR验证电泳结果(1:Tub基因片段;2:Marker)。
图10为pSET152-Tub G回补质粒及电泳图(图10a为pSET152-Tub G回补质粒示意图;图10b为pSET152-Tub G回补质粒PCR电泳图。1~3:pSET152-Tub G回补质粒;M:Marker)。
图11为pSET152-Tub G回补质粒酶切电泳图(1:pSET152-Tub G回补质粒酶切产物;M:Marker)。
图12为野生型DSM 44638以及DSM 44638-Tub G倍增突变株表达TubelactomicinA。
具体实施方式
以下通过实施例说明本发明的具体步骤,但不受实施例限制。
在本发明中所使用的术语,除非另有说明,一般具有本领域普通技术人员通常理解的含义。
下面结合具体实例并参照数据进一步详细描述本发明。应理解,这些实施例只是为了举例说明本发明,而非以任何方式限制本发明的范围。
在以下实施例中,未详细描述的各种过程和方法是本领域中公知的常规方法。
实施例1使用诺卡氏菌发酵制备Tubelactomicin A
(1)菌种发酵
按照2%半乳糖,2%糊精,1%甘油,1%大豆蛋白胨,0.5%玉米浆,0.2%硫酸铵,0.2%碳酸钙与去离子水配制种子培养基,调pH值至7.4,于121℃,高压蒸汽灭菌20min。接入诺卡氏菌Nocαrdia vinacea DSM44638(购于德国微生物菌种保藏中心DSMZ),180rpm转速,30℃培养3天;
按照4%可溶性淀粉,1%黄豆饼粉,0.5%Yeast extract,0.4%碳酸钙去离子水配制发酵培养基,将种子菌液接种发酵培养基,初始pH值7.4,7%接种量,180rpm转速,30℃培养6天。
(2)发酵液分离纯化
将培养的发酵液离心,取上清液调pH值到2左右,加入HP20树脂(树脂事先使用甲醇溶胀过夜),于摇床震荡吸附2.5h左右后,取出,静置沉淀树脂,将沉淀的树脂使用去离子水清洗,保留树脂。加入甲醇。将甲醇浸出液浓缩,用二氯甲烷萃取,收集二氯甲烷相。使用硅胶柱层析,依次使用二氯甲烷洗脱2个柱体积,使用体积比20:1的二氯甲烷:甲醇混合液洗脱5个柱体积。收集20:1的二氯甲烷:甲醇混合液的洗脱组分,浓缩得到TubelactomicinA。
(3)使用HPLC进行检测洗脱液,纯度。
HPLC分析色谱柱为Amchemteq ACI-C18,4.6×250mm,柱温为30℃,流速为1mL/min,检测波长为238nm,进样量根据处理方法和可能的含量选择10~25μL,流动相为1‰甲酸水和乙腈洗脱梯度如表4所示。
表4洗脱梯度
将纯化的样品送样进行LC-MS检测,样品质谱结果显示为一个分子质量为487.3085的加氢峰,其分子式为C29H42O6,这与Tubelactomicin A分子式相符合。
进一步检测Tubelactomicin A对耻垢分枝杆菌ATCC607的最小抑菌浓度(MIC)。结果显示,Tubelactomicin A对ATCC607有良好的抑菌活性,其MIC值测量值为0.125μg/mL。将Tubelactomicin A纯品使用氘代丙酮溶解后,进行400兆NMR测定,结果如下表5所示。
表5.Tubelactomicin A的1H and 13C NMR数据
实施例2聚酮化合物的生物合成基因簇与Tubelactomicin A的相关性验证
验证策略:采用单交换同源重组法,敲降诺卡氏菌DSM44638菌株聚酮化合物的生物合成基因簇中编码聚酮合成酶3(Tub D)的基因(诺卡氏菌DSM44638菌株全基因组第8082位基因),考察对Tubelactomicin A生物合成的影响。
(1)设计PCR扩增用引物序列如下:
pOJ260-Tub D_Dan_F:5’-cgacggccagtgccaagcttgaaccggttgtggtggtgg-3’(SEQID No:10);
pOJ260-Tub D_Dan_R:5’-tatgacatgattacgaattcttcggcagtgtctcgtggc-3’(SEQID No:11)。
以诺卡氏菌DSM44638基因组为模板,使用Prime star DNA聚合酶进行PCR扩增,PCR反应程序如下:Step 1:98℃,10s,Step 2:62℃,5s,Step 3:72℃,70s,以上三步重复30次,Step 4:72℃,5min。
对PCR扩增产物进行1%琼脂糖凝胶电泳分析,并回收扩增产物。PCR产物凝胶电泳图如图4所示(M:Marker;1:Tub D单交换敲除基因片段)。从图4中可以看出,扩增得到1100bp的Tub D基因片段,结果符合预期。
(2)酶切和连接
将步骤(1)获得Tub D基因条带进行琼脂糖凝胶电泳分离,切胶回收,测量胶回收浓度。将质粒pOJ260(Bierman M,Logan R,O′brien K,Seno ET,Rao RN,Schoner BE(1992)Plasmid cloning vectors for the conjugal transfer of DNA from Escherichiacoli to Streptomyces spp.Gene 116:43–49)使用HandⅢ酶和EcoRⅠ酶进行酶切。将酶切体系放置于37℃水浴锅,反应30min,将反应后的体系进行琼脂糖电泳,在凝胶成像仪的紫外照射下进行切胶,回收酶切产物。
将Tub D扩增产物和质粒pOJ260的酶切产物采用Infusion重组连接酶进行连接,获得pOJ260-Tub D单交换重组质粒。质粒示意图如图5a所示,电泳结果如图5b所示(1~3为:pOJ260-Tub D单交换重组质粒,M:Marker)。
将pOJ260-Tub D单交换重组质粒使用XhoⅠ/HindⅢ进行酶切,酶切得到2890bp和1633bp的两条条带(酶切电泳图如图6所示),结果符合预期。将酶切进行测序,测序结果正确。
(3)转化、筛选并鉴定
将步骤(2)获得的pOJ260-Tub D单交换重组质粒与诺卡氏菌DSM44638菌体混合,将混合物通过电击转化后,涂在含有安普霉素(100μg/ml)的MS琼脂培养基上,10天后,长出的菌落阳性即为转化子。
挑取阳性菌体作为模板,使用引物:PKS-P3:5’-tcggattcgactccctcacc-3’(SEQID No:12)/RV-M:5’-gagcggataacaatttcacacagg-3’(SEQ ID No:13)进行菌落PCR,PCR程序为:Step 1:98℃,10s,Step 2:62℃,5s,Step 3:72℃,90s,以上三步重复30次,Step 4:72℃,5min。
将扩增产物进行电泳,扩增出1274bp的基因片段,符合预期大小(如图7所示,泳道1:DSM 44638野生株;2:pOJ260-Tub D单交换质粒;3:DSM 44638-Tub D突变株;M:Marker)。
经PCR验证正确的突变株命名为DSM 44638-Tub D,将DSM 44638-Tub D突变菌株在不含抗生素的种子培养基中培养,并转接到发酵培养基进行发酵。发酵液经过1:1甲醇处理和浓缩处理后,进行HPLC检测发酵液中的Tubelactomicin A的发酵单位(发酵及分离纯化方法参照实施例1),结果如图8所示。结果显示突变株DSM 44638-Tub D失去产Tubelactomicin A的能力。结果表明,本发明所述的聚酮化合物骨架的生物合成基因簇是Tubelactomicin A生物合成所必需的。
实施例3高产Tubelactomicin重组表达菌株的构建
(1)设计PCR扩增用引物序列如下:
pSET_Tub G_F:5’-cgataagcttggatcattttgtccccaccgatagatagtc-3’(SEQ IDNo:14);
pSET_Tub G_R:5’-ggctgcaggtcgactcgagagaaaacagttgtcctgaataag-3’(SEQ IDNo:15)。
以诺卡氏菌DSM44638基因组为模板,使用Prime star DNA聚合酶进行PCR扩增,PCR反应条件为Step 1:98℃,10s,Step 2:62℃,5s,Step 3:72℃,70s,以上三步重复30次。扩增得到大小为1409bp;包含Tub G完整基因的DNA片段(如图9所示),结果符合预期。
(2)酶切和连接
将步骤(1)Tub G扩增的目的条带进行琼脂糖凝胶电泳分离,切胶回收,测量胶回收浓度。将模板质粒pSET152使用XbalⅠ内切酶和BamHⅠ内切酶进行酶切。将20μL的酶切体系放置于37℃水浴锅,反应30min,将反应后的体系进行琼脂糖电泳,在凝胶成像仪的紫外照射下进行切胶,回收酶切产物,线性模板条带大小为6195bp,结果符合预期。
将Tub G扩增产物和模板质粒pSET152的酶切产物采用Infusion重组连接酶进行连接,获得pSET152-Tub G回补质粒。质粒示意图如图10a所示,电泳结果如图10b所示(1~3为:pSET152-Tub G回补质粒,M:Marker)。
使用XhoⅠ/NcoⅠ对成功整合的pSET152-Tub G回补质粒进行酶切,酶切得到3927bp、2678bp和1222bp的三条条带(酶切电泳图如图11所示),结果符合预期。将酶切验证正确的送样测序,测序结果正确。
(3)转化、筛选并鉴定
将步骤(2)获得的pSET152-Tub G回补质粒与诺卡氏菌DSM 44638菌体混合,将混合物通过电击转化后,涂在含有安普霉素(100μg/ml)的MS琼脂培养基上,10天后,长出的菌落阳性即为转化子。
挑取具有安普霉素抗性的阳性转化子作为模板,使用pSET152质粒上的通用引物序列RV-M:5’-gagcggataacaatttcacacagg-3’(SEQ ID No:16)以及Tub G基因末端序列设计引物pSET_Tub G_JD_F1(5’-attgcattcgggtcagggga-3’(SEQ ID No:17),进行菌落PCR,PCR程序同实施例2。将扩增产物进行电泳,扩增得到797bp的基因片段,符合预期结果。将此条带测序后,结果与预期一致。将该菌株命名为DSM 44638-Tub G。
DSM 44638-Tub G表达株采用实施例1的方法发酵、分离、纯化,并计算Tubelactomicin A的产量,结果如图12所示。结果显示,DSM 44638-Tub G相对于野生型提高了约30%。结果表明,Tub G基因(P450)能够提高野生型菌DSM 44638 Tubelactomicin A的产量。
序列表
<110> 上海健康医学院
<120> 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用
<160> 17
<170> SIPOSequenceListing 1.0
<210> 1
<211> 69840
<212> DNA
<213> Nocardia vinacea
<400> 1
gaaacctatg tagggctcga ccgctacgag gtccgccgct acgacgcctg gtaccgccgc 60
cacatcgcta tggccatgct cgccggcatc tacctcgccg tcactgccgc gaaatcccaa 120
aagcattggc agcggcctca tcccactcac tatcggagaa attcgacgtt gttggcacac 180
ctgatcgcca tcaccgcacc cctcgatcgg gatccgacga tggtcgacct ggcgacgcgg 240
acatcaatac tcgcccacta ccagcgacgc cagcgcctac ataactaagc gcggctggag 300
tactagccat tgcactcaga ttatcgagga gcggatggac tatctgccga cgagatggct 360
acagcccaca tctattcccc gacgagcggt gacctccagt atcggcgaca ggagccctcg 420
tacgttaagt acgcactacg ggatccgtgt gggggccgtc cgcaagagcc gtcccccaca 480
gccatttcga tatctccaca tcagcttctt gagcgtccca accattcctc tatggcatca 540
acaaccttac cagcatcgct cgataccatc gaaaaatggt ccgaatctac caccgagagc 600
gtttgtgcat tcgaccaagg tttggcgatt attccgatat ttacaccctc aagttccagt 660
tcgaataccg gctcccgaca ctgcacaaag agcgcatcat ggtcttctgc gacgtagtcg 720
agcttcgata aaaaatccac ccaggtgata gtcgcggtta gactttcaaa gctaaggccg 780
ccttcgaatt cctccctgag agttctgttt gccatttgat ccatagggag atttttccct 840
acttccattt cgaaggtatc caacagcact ataccttcca gttttgacac acaagaccct 900
gcgagccgtt cagctgtggc atacgccaga tgcccagcag aagagtgccc aaccagaata 960
aatggcgcgt cgccaactac cccttctatc gatttagcta ccgattctat cgctacatca 1020
gacgattccg gcagcggttc tcccggctta aatcccaaca acgggacact ccacactcga 1080
cggcgtccac taaatatctt cgaaagatgt gcatgctcgg ccactccacc gaaaaccggt 1140
gtgcacacaa aaacaacctg cggcacagtt ggtccatcac atagcataac tggctgcgga 1200
cccgtcacgg cgtcagacga tgcaaaggtt tcccgaaggc atgtggcagc ccggagtaag 1260
tcatagcctg catccatttt tccagatcgc agcgccgccc taaagagccc gccaagtgta 1320
tccagaggac gcactttctc gaccgatgaa cgttcaatat caaactcgct cctcagatat 1380
tccgcaaagg aagcaatgtt ctcctgatca aatatcacca tgggagacaa ttcaattccg 1440
gtggctcgga cgaggtagtt tcttaattcg acggcggtga gggagtcgaa tccgagttcc 1500
tggaagttgc ggtcggcgtc gatcgcggtg gcgtcgtcgt gtccgagcac gatcgccacc 1560
tgaccacgca ccaactccag cagcaccttg acctgttcgg cctcgtccag gcccgacaag 1620
cgttgccgca gttgcgatcc cgccacccca gcacctgcgc taccggtgtt tccggcggca 1680
acacgtcggg cgttgggtac caggttgtgc agtatcggtg ccagcatccc cgcccgcgcc 1740
tgcgcggcca ggacggtggt gtcgaaccgc gctgccagca cggtagcgtg ctcggcggtc 1800
acggcggtgt cgaacatcgc cataccctgt gcctcggtca acgccagcat gccgccacgg 1860
ctcatgcgcg cggtatcgcc accatcgagg tgaccggtca tcccggtgcc cgatccccac 1920
aaaccccacg cgatcgacgt cgccgccaac cctcgagccc gccggtactc ggccaacccg 1980
tcgaggaact gattggccgc ggcatagttg ccctgaccgg gcgaacccag cacaccggcg 2040
gccgaggaat acaggacgaa catgcccaga tccaggccac gggtcagctc gtgcagatac 2100
cacgccgcat cggctttcac ggaaagcacc gtgtcgaggc gttgcggtgt cagcgacgcg 2160
atcacaccgt cgtcgagcac acccgccgca tgcaccaccc cgaccaaagg atcctcgtcc 2220
ggtaccgcgg ccagcagttg ctcgaccccg gcacgggtgg acacatcaca ggccaccacc 2280
gccacccgcg cacccgaacc ggtcaactcc tcgaccaact cacgcgcacc ctccgcagcc 2340
aaaccccggc gagaagccaa caccagcgac cgcacacccc gcacacccac cagatgccgg 2400
gccagaatcc gacccaaacc accggtacca ccggtcacca ccacagtccc ccgaccagca 2460
cccgacacgg tgtcatcggt atccgagata tccgatgccg tggtcttgtc gtcacgcccg 2520
gcagtacgga ccagtcgcgc gatgtgggcg ataccgtcac ggatcagaac ctgaggctcc 2580
cccaccgcca cagccaacga cacgatcccg gccacatcaa caccatcgga gccctcgatg 2640
tcggtgtcgg cgagcaggat ccggcccggt tcctccgact gcgccgaacg caccaaaccc 2700
cagattgtcg acgccgccgg atcgacccga tcacccgcgg tcgtggtgac cgccgcccgg 2760
gtgaggacca gcagcgtgct ggacgcgaac cgctgaccag tcgagaactc ctgcaacaca 2820
cccagcaccc gatggctgat cgcgtgcgcc ctgaccagca catcggtgtc gatttcaccg 2880
acggtgttgt tttcaccgtc acggcagtcg agcaccacca ccggtggcac cggatcgtcg 2940
gcggactcgt gctccaggtt gttccactcc acgaattcca cttcccgcag ttgcgcaggt 3000
atgggtgtgg gtgtccagtg cagggtgtgc agccggtctc cgccgtctgc tgcggtggtg 3060
agttggtcga gttggacggg gcgcagtgtc agtgatgcga tggtgaggac cggtagcccg 3120
tcggggtcgg tcacggtcac gcggaccgtg ttgtgcccga gtggggtgat tctggcgtgc 3180
acggtcgagg cgccgacggc atgcaattgc acgccttccc acgcgaacgg cagcaacgga 3240
cccgctgcgg cctccacccc ggccttccca ctggtgtcaa agccggtggt catggcgtgc 3300
aggacggcat cgagcagggc ggggtgtagt ccgtaatggt ttgcctggcc gccggtttcg 3360
gggagggtgg cttgcacgag ccagtcttct ccggtgcgcc agactgattc caggccttgg 3420
aacgcgggac cgtagccgta cccgtcctcg gcgagttgct ggtagaggct gctggtatcg 3480
gtgtgcaccg cacccgctgg tggccatgcc gccagcccgg catccacgtc gtgtggtgtt 3540
gtggtggtga ggttttcgac cggactctgg gtgtggagta ggccttgggc gttcaacacc 3600
cactcctggt ctcgggtttg ggagtacacc gacactgtgc gggtgccgga ggtttcgagc 3660
gcgccgacga ggacctggat ggcggtgccg ccctcggcgg gcagtgtcag gggtgcgagc 3720
aacgtcagtt cccgtatcgc cccgcatccg acctcgtcac cggcacggat caccagctcg 3780
accagcccgg tcccgggcag caacaccaca ccacccacgg cgtggtcggc cagccacgga 3840
tgagtctgca gcgacagccg gccggtcacg gtcacggccc cggtctccgg agacaccacc 3900
accgccccga tcaatggatg atcgagtgct gtttgcccca gcgagtcagg gttcccggat 3960
gcggtgaggg tgtcgagcca gtagcggcga tgctggaacg cgtaggaggg caacgcaacc 4020
cgggtcgcac cacggccgtc gaagatcggt gtccagtcga ccccggcacc ggccacatcg 4080
accattgcca gcgccgagag cagccggtcc agtccgccgt cgtcgcggcg cagtgatccg 4140
gtcacgacga tgtcgcgtgt tcgtgggccg gtttgttcgc cgagttcttc gatgcccggt 4200
gtcagcaccg ggtgtggtga ggcctccacg aacacggtgt ggccctcgct cagcagggtt 4260
tgcacggtcg cggcgaagtt cacggtgtcg cgcaggttgc ggaaccagta cccggcatcg 4320
agttcggtgg tgtcgagcag ggtgccggtc accgtggagt agaacgcgat tcgtgagggt 4380
cgtggagtga tggtggccag ttcctcgagc agtcgttgcc gcaaggactc gacctgtggt 4440
gaatgcgagg cgtagtcgac cgcgatccgg cggacctgca caccgtcgct ttcgcaggcg 4500
gccacgaacg cgtcgagttg ttcggtcgga cccgacacca ccgtggtggt gggcccgttg 4560
accgcggcca cggccagtcc cggcatgtcg gtcagtcgtt gctcgaccag tgtcgtcggc 4620
agcagcacgc tggccattcc gcctcgaccc gacagttccc gcaatgcccg cgaacgcaga 4680
atcaccacac gtgcggcgtc ttcgagcgac aacgccccgg ccacacaggc tgcggcgatc 4740
tcgccttgcg aatgccccag taccgcatcc gggacgacac cgaaggagcg ccatacctcg 4800
gcgagggaaa ccatcaccgc gaacagtgcc ggctgcacga catcgacccg ctcgagcagc 4860
atcggatcgg ccgtgccctg cagcacgtcg atcaacgacc actccaccaa gggcgcgaac 4920
acctcggcgc attcgagcac tttctgctcg aacaccaccg attcctgcaa cagacgcgca 4980
cccattccca gccactgcgc gccctgaccg gggaacacca acaccgtctt gcccacgcca 5040
ccagcaacac ccgagaccac acccggaccc gcgaacatgc cggcacccgc gagcatgccc 5100
ggttcgctat cgatcagtgc ttgcagtcgg gtcatcaact cctcacggtc ggcaccgacc 5160
agcaccgccc gatgctccaa ccgtgcccgg gtattgatca aagaccaccc cacgtccacc 5220
gcatccagcc ccggccgtgt cagcatccac tcctgcaacc ggcgcccctg cgcgagcaag 5280
ccctcaccgg tacgccccga caacacccac accaacccgc cagcagtgac cggcaccagc 5340
gactcggtac caggcaccgg ttcaacgacc gcaggtgctt gttcgaggat gacgtgcgcg 5400
ttggtgcccg agacgccgaa ggaggagatg cccgcgcgca gggggtgccc gttgcgaggc 5460
cacggctgtt cctgggtgag cagctcgact gtgccggtcg tccagtccac gtggctggag 5520
ggtgtgtcga cgtgcaaggt tttcggcagt gtctcgtggc gcatcgcctc gatcattttg 5580
atcaccccgg ccacaccggc ggcggcctgg gcgtggccga tattcgattt cagcgacccc 5640
agccacaagg gccggtcggg ttcacggttt tgcccgtagg tggccagcag ggcttgggcc 5700
tcgatcgggt caccgagggt ggtgccggtg ccgtgagcct caaccacatc gatcagatcg 5760
ggcgaaagac cggcgttggc caacgcgcgg cggatcaccc gctgctggga aggaccgttg 5820
ggggcggtca acccgttgga cgcaccgtcc tgattgaccg ccgaaccccg caccaccgcc 5880
aacacttgat ggccgtgttt gcgggcctcc gacagccgct ccacgaccag gatgccgacg 5940
ccttcggacc agcccgtccc gtcggcggcc tcggcgaacg gtttgcaccg gccatccggc 6000
gccagtcctc cctgccggga gaactccaca aatgcaccag gagtcgacat cactgtcacc 6060
ccgccgacca gcgccatccc gcactcgccg gcgcgcacgg cctgcacggc ttgatgcagg 6120
gcgaccagtg aggacgagca cgcggtgtcc accgacaccg cggggccctc caaccccaac 6180
acatacgaca cccggcccga caccacactt gaggtcgcgc cggtcagccg atagccctcg 6240
actccggcgt ccccgtctcc tcggcccaca ccgtaggact ggtcgctgac gccgatgaac 6300
acgccggtct cgctgccatg caacgaggtt gggtccacgc cggcgtcttc gagggcttcc 6360
cacaccgttt ccagcagcaa ccgctgctgc ggatccatcg caaccgcctc gcgcgggctg 6420
atcccgaaga accccgcatc gaacgagctt gcatcgtgga ggaacccgcc ctcacgcgta 6480
gaggacctag tacctatgtc ccaccctcgg tccagcggcc attgcgacac cacatcgcgg 6540
ccctgcacca gcagctccca caggtcctcc cgagacgaca cttctccggg gaagcgacag 6600
cccacaccca cgatcgcgat cggctcggcc gaatgaccca tcaccacaac cggttcggcg 6660
gccacggatg ctccggccag ctgttggtgg aggtgttcgg cgacggctct gggggtcggg 6720
tagtcgaagg taagagtggc cgggatcgcg acttcggtgg cggttttgag tcggttgcgg 6780
gcttcgacgg cggtgaggga gtcgaatccg agttcctgga agttgcggtc ggcgtcgatc 6840
gtggtggcgt cgtcgtgtcc gagcacgatc gccacctggg tctgcaccaa ctccagcagg 6900
atcttgacct gttcggcctc gtccaggccg gacaggcgtt gccgcagttg cgatcccgcc 6960
accccgctgg cgctgccggt gtctccggtg gctgcgcgtc gggcgttggg gacgagctgg 7020
tgcagtatcg gtgtcagcat ccccgcacgg gcctgcgcgg ccagggcggt ggtgtcgaac 7080
cgcgctgcga gcacggtggc gtgctcggcg gtaatggcgg tgtcgaacat cgccatgccc 7140
tgctcgtcgg tcatcgccag atagccgcca cggttcatgc gtgcggtgtc gccaccatcg 7200
aggtgaccgg tcatcccggt cgatgatccc cacaatcccc acgcgatcga ggtcgcgggc 7260
aggccttgag cacggcggtg ttcggccagt ccgtcgagga actgattggc cgcggcatag 7320
ttgccctgac cgggcgagcc cagcacaccg gcggtcgagg aatacatgac gaacatgccc 7380
agatccaggc cacgggtcag ctcgtgcaga taccacgccg catcggcttt cacggaaagc 7440
accgtgtcga ggcgttgcgg tgtcagcgac gcgatcacac cgtcgtcgag cacacccgcc 7500
gcatgcacca ccccgaccaa aggatcctcg tccggtaccg cggccagcag ttgctcgacc 7560
ccggcacggg tggacacatc acaggccacc accgccaccc gcgcacccga accggtcaac 7620
tcctcgacca actcacgcgc accctccgca gcgatacccc ggcgagaagc caacaccagc 7680
gaccgcacac cccgcacacc caccagatgc cgggccagaa tccgacccaa accaccggta 7740
ccaccggtca acaccacagt cccacgaccg gtatccgtcg tgtccgagac cacgggcagt 7800
gtcgaaacca cgggtagtgt gaggacgact ttgccgatgt gacgggtctg gctgaaatat 7860
cggaaggcct cgggtgcctg cctgatatcc catgcttgga tgggaatgga cttgagttcg 7920
ccgcgatcga aactggcggt gagctcggag agcatctgtt ggatacggtc ttccccggcc 7980
tcgaacatgt cgaaggcttg atagatcacc ccgggatact gggtagtgat cgcatcgctg 8040
tcacgcttgt cggtcttgcc catctcgagg aagtggcccc cgcgcggtag cagtcgcagc 8100
gacgcgtcga cgaaatcccc ggccaatgag ttcaagacga tgtccacccc gtgaccgtcg 8160
gtggccgaca agaattcgtc ttcgaagctc aacgtccgcg aattcgcgat gtgctggtcg 8220
tcgaaaccta tgccccgcaa cacatcccac ttgccactgc tggcagttgc gaagacttcc 8280
aggccccagc aacgtgccag ttggatcgcg gccattccta cgccgccggt cgccgcatgc 8340
accagcaggc gatcccccgg ctttgcatga gccaggtcca tgagcccgta gtaggccgtc 8400
aagaacacga ccggtaccgc tgcggcttgg gcgaatgacc accccgccgg catgtgcacg 8460
accagtcgat ggtcgacgat cacgaccggt cccactccac gaccggccaa ccccatgacc 8520
cggtcgccga cgctcagacc ctcgacgtct gcaccgacct cgacaatgac tcctgccagc 8580
tcggcaccca ccacagcgtc gtcatcgggg tacatgccca gcgcgatcag tacatcccgg 8640
aagttcaacc cggcagcccg cacggagatc cgcacctgcc ccgctgccag tggctgctcc 8700
gccaaggggt gactcaccag ggccaaacca tccagcacac ccttatccac agcagcgagc 8760
tgccatgccc ccgcgtcggg gatcgcaaga gtcccgcgcc cgggaccacg ggtcagtcgc 8820
gcgatgtggg cgataccgtc acggatcaga acctgaggct cccccaccgc cacagccaac 8880
gacacgatcc cggccacatc aacaccatcg gagccctcga tgtcggtgtc ggcgagcagg 8940
atccggcccg gttcctccga ctgcgccgaa cgcaccaaac cccagattgt cgacgccgcc 9000
ggatcgaccc gatcacccgc ggtcgtggtg accgccgccc gggtgaggac cagcagcgtg 9060
ctggacgcga accgctgacc agtcgagaac tcctgcaaca cacccagcac ccgatggctg 9120
atcgcgtgcg ccctgaccag cacatcggtg tcgatttcac cgacggtgtt gttttcaccg 9180
tcacggcagt cgagcaccac caccggtggc accggatcgt cggcggactc gtgctccagg 9240
ttgttccact ccacgaattc cacttcccgc agttgcgcag gtatgggtgt gggtgtccag 9300
tgcagggtgt gcagccggtc tccgccgtct gctgcggtgg tgagttggtc gagttggacg 9360
gggcgcagtg tcagtgatgc gatggtgagg accggttgcc cgtcggggtc ggtcacggtc 9420
acgcggaccg tgttgtgccc aagtggggtg attctggcgt gcacggtcga ggcgccgacg 9480
gcatggagct gcacgccttc ccacgcgaac ggcagcaacg gacccacact ggtatcggta 9540
tcggtgtcgg tatcgtggcc ggtggtcatg gcgtgcagga cggcatcgag cagggcgggg 9600
tgtagtccgt agtggtgtgc gtcgccgccg gtttcgggga ggcgggcttg cacgagccag 9660
tcttctccgg tgcgccagac tgattccagg ccttggaacg cgggaccgta gccgtacccg 9720
tcctcggcga gttgctggta gaggctgctg gtgtcggttc gggtggcgtt ttgtggtggc 9780
cataccgcca accccgtgtc cactggtgtt gtggtggtga ggttttcgac cggactctgg 9840
gtgtggagta ggccttgggc gttcaacacc cactcctggt ctcgggtttg ggagtacacc 9900
gacactgtgc gggtgccgga ggtttcgagc gcgccgacga ggacctggat ggcggtgccg 9960
ccctcggcgg gcagtgtcag gggtgcgagc aacgtcagtt cccgtatcgc cccgcatccg 10020
acctcgtcac cggcacggat caccagctcg accagcccgg tcccgggcag caacaccaca 10080
ccacccacgg cgtggtcggc cagccacgga tgagtctgca gcgacagccg gccggtcacg 10140
gtcacggccc cggtctccgg ggacaccacc accgccccga tcaatggatg atcgagtccc 10200
gacaatccca gcgagtcggg atcggtgttg ccggtgatcg tatcgagcca gtagcggcgg 10260
tgctggaaag cgtaggaagg caacggaacc cgggtcgcgc cgcggccgtg gaagatcggt 10320
gtccagtcga ttccggtgcc ggccacatcc agtcgggcca gtgccgagag cagggtcgtg 10380
tcctcgacac ggtccttacg cagcagcgaa gccacgacag cctccacacc gtccacggtg 10440
ggtttggtat ccacggcgtc gctggtggtg tgttggaggg tttcgtcgat gagtccggat 10500
aggccgccgt cggggcccat gatcacgtat cgggttgctc ccgccgtggt gagggtggtg 10560
attccgtcgg cgaaccgtac ggtgttgcgg acgtggtcga cccagtactg cggcgtggtc 10620
agcggtgaat ccgcctgttg ggcatcggtg ttcgggctgt cggtgttcgg gccggtgagt 10680
tggccgtcga ggttggagat gatcgggatg accggttggg tgtaggtgag ttcggtggcg 10740
atgcgggcga attcggccag catgggttcc atcgaggcgg agtggaacgc gtgggagacc 10800
cgcagccggt tgacctggta tccggcctgc cgtagttgtt gctcggtggt gtcgatcgcg 10860
tgctgggggc cggcgaggac gatcgattcg ggtccgttga ccgcggcgat ctcgacgaca 10920
ccgtcctcga tgctgtcacc gagcagggtg gtgatctggg tttcggaggc tcgcatggcg 10980
agcatggctc cgccggtggg gagttgctgc atcagccggg cgcgggcggc gaccagtacc 11040
gtcgcgtcct cgaggctcag tacaccggcc acggtcgcgg cggccagttc accgatggag 11100
tgtccggcca cgaaatccgg tcggacaccg aaggattcca gcaaccggaa cagcgcgata 11160
ccgacggcga acagtcctgt ctgggtgtag agggtcgctt gcagtgcctg ctcgtcgaca 11220
ccccacacca catcccgcag cgagcattcc agctgctgtt ccagcagtgc ggtggtctcg 11280
tcgaagctgg ccgcgaacac cgggaacgcc tcgtacaaac cagatcccat acccagcagc 11340
tgcgcgccct gaccggggaa cacgaacacc gtcttgccgc ggtcacgcga aacaccagcc 11400
gccacagcgg gatcaccgtc gatcaacccc tgcagtcggg tcatcaactc ctcacggtcg 11460
gcaccgacca gcaccgcccg atgctccaac cgtgcccggg tattgatcaa agaccacccc 11520
acgtccaccg catccagccc cgggcgcgcc agcatccact cgtgtagacg ccgtccctgc 11580
gcgagcaagc cttcaccggt acgccccgac accatccaca ccactgcatc ggacttcacc 11640
gcaggcaccg gatcggtgtc gggtgccgat gattcggtgt cgggtgtgac gggtggtgat 11700
tgttcgagga tgacatgcgc gttggtaccc gagacgccga aggcagacac ggccgcacga 11760
cgcggccggt cggcctcggt ggtccaggta cgagactcgg tgagtagttc gactgcgccc 11820
gatgtccagt cgacgtgggt ggtcggtgtg tcgatgtgga gggttttcgg tagtgtctgg 11880
tggcgtattg cctcgatcat tttgatcact ccggctacgc cggcggcgtt ttgggtgtgg 11940
ccgatgttgg atttcaggga gcccagccac aggggccggt cgggttcgcg gttttgcccg 12000
taggtggcca gcagggcttg ggcttcgatc gggtcaccga gggtggtgcc ggtgccgtgg 12060
gcttcgacca cgtccacttc cgtggccgag actccggcgt tggccagggc gcggcggatc 12120
acccgctgct gggaaggacc gttgggggcg gtcagcccgt tggacgcccc gtcctggttg 12180
accgccgagc cacgcaccac cgccaacact tgatgcccgt ggcgacgcgc gtcggagagt 12240
cgttccacga cgaggatgcc gacgccttcg gaccagcctg ttccgtcggc ggcttcggcg 12300
aaggatttgc agcgtccgtc gggtgccagt cctttctggc gggagaattc gatgaacgcg 12360
ccgggtgtgg ccagtaccgc gacgccgccg accagcgcca tcccgcactc accggcgcgc 12420
acggcttgca cggcttggtg cagggcgacc agcgaggacg agcacgcggt gtccaccgat 12480
accgcgggac cttccagtcc caacacatac gacactcggc ccgatacgac gctcgtggcg 12540
ccgccggtca gccggtatcc ctcgactccg gcgtcgccgt cgcttcggcc tatgccgtag 12600
gactggtcgc tgacgccgat gaatacgccg gtgtcgctgc cgcgcaacga gaccgggtcc 12660
accccggcgt cttcgagggc ttcccacacc gtttccagca gcaaccgctg ctgcggatcc 12720
atcgcaaccg cctcgcgcgg gctgatcccg aagaacccgg cgtcgaacaa gccggcgtcg 12780
tgaaggaacg cgccgtctcg cgtgtaggac ttaccggtca cccccggttc gggatcgaac 12840
aaccctgcat cccagccacg gtcgaccggc cattgcgaca ccacatcccg gccctgggcc 12900
acgacctgcc acagctcttc gcgtgaggac acgcctccgg ggaagcggca acccacaccc 12960
acgatcgcga tcggctcggc cgaatgaccc accaccacaa ccggttcggc ggccacggat 13020
gctccggcca gctgctggtg caggtgttcg gcgaccgccc ggggagtcgg atagtcgaag 13080
atcaacgtcg ccgcgaccgc caccccggtg gcggtcttga tccggttgcg ggcttcgacg 13140
gcggtgaggg agtcgaatcc caggtcccgg aagttgcggt cggcgtcgat cgcggtggcg 13200
tcgtcgtgtc cgagcacgat cgcgatctgg gcgcgcacca gatccagcag tacctggatc 13260
tgttcggttt cgcccagacc tgacaagcgt tgccgcagtg gcgatcccgt gttcccgctg 13320
tcgatagcac tgtcggtgtc ggcttgggca tcgggtatgt ctgtgatgag cagacgacgc 13380
cgcgacagcg tgtagtagac ggtgaactgt tgccagtcga tgtcggcgac ggtcaccagg 13440
gtttcgttgt tggcgactgc ttgactcaat gcttgcaggg ccagatcggg ttccatcaat 13500
cggatcccga gccgaccgaa gtattcggtg gttgtgccga tttcggtcat cccgccgcct 13560
gaccagccgc cccaggccag ggatgtggcg accagtccgc gcgatcgacg atcctgtgcc 13620
aggccgtcga ggtgggcgtt gcttgccgcg tactcggcga gtccggtacc gccccaggtc 13680
gcagcgcccg aggagaacag cacgaaggcg tccaggcggc gatcgccgag tagttcgtcc 13740
aaatgctgtg cgcccccgac tttcgccgcg gccgcggtgg tcatcgattc cgagtcgatc 13800
tcggtcaacg gccgctgatc gaccactccc gccgcgtgga tgacggcggt gagcgggatg 13860
ctgtcgttgt cgatcgtgga caacacggcg gccacgtcgc cgcgttcggc gatgtcggcg 13920
gccatgatcg tcactcgccc gccgagggca ctcagttcct gctccagctc gagcgcgccg 13980
ggagcctgcc gcccgcgacg gctcaccagt accacgtgtt ctgcgccgtt tgtgagcagc 14040
catcgtgcag catgtgcccc gattcccccg gtgccgccgg tgacgagcac tgtcccgcgg 14100
ggacgccaat gcttgccccg acctgaattg ggcaacggtg cccgcatcat ccgccgtccg 14160
tagacaccgc tttcgcggac ggctaactgg tcttctccgt cctcacgtga cagcaccgcc 14220
ggcagactgc gcaggatggt gtcgtcccac gcgttcggaa ggtcgatcag ccctccccac 14280
gactgcggga gttccagacc cgctacctgc cccaaccccc acatctgtga ctgggtggca 14340
tcgacggatc gatccgaagg acccacgatt actgcgccgc tggtcacgca ccacaacggg 14400
atctcggcgg ccgtctcgcg caatgccttg agcagccaca catttcccgc tacaccacga 14460
gagaccaggg gcgaatcgcc accgatcccg tcgttcaagg caatgagcga aacgacaccg 14520
cggaactcgt cccacgggcc tgctgattcg agcagatcgg ccatcgtttg ccgcgtcatc 14580
cgatcggcat caacctctag gcgctgagtc tccagacccg ccgctgtgaa tactccacag 14640
acttcatcgc ctatcgcggc gccggtcggg ctgaccacga gccatttccc cgacacgcgg 14700
acaggtttct cggccagcca cttccagccg attcgataac gccactgatc gatcaccgat 14760
tgggcgcgac gctgctgccg ccacgacgac aacaaaggcg atacttcacc gatggtgcag 14820
ccttcttcca ggcccagcgc ctcccagtcc tcacgagcga ccgcgtccca gaattcaccg 14880
tcgatgccat cgtcggcccc gaccgacgag cccagtgagt cggggtgtcc ggatgcggtg 14940
agggtgtcga gccagtagcg gcgatgctgg aacgcgtagg agggcaacgc aacccgggtc 15000
gcaccacggc cgtcgaagat cggtgtccag tcgaccccgg caccggccac atcgaccatt 15060
gccagcgccg agagcagccg gtccagtccg ccgtcgtcgc ggcgcagtga tccggtcacg 15120
acgatgtcgc gtgttcgtgg gccggtttgt tcgccgagtt cttcgatgcc cggtgtcagc 15180
accgggtgtg gtgaggcctc cacgaacacg gtgtggccct cgctcagcag ggtttgcacg 15240
gtcgcggcga agttcacggt gtcgcgcagg ttgcggaacc agtacccggc atcgagttcg 15300
gtggtgtcga gcagggtgcc ggtcaccgtg gagtagaacg cgattcgtga gggtcgtgga 15360
gtgatggtgg ccagttcctc gagcagtcgt tgccgcaagg actcgacctg tggtgaatgc 15420
gaggcgtagt cgaccgcgat ccggcggacc tgcacaccgt cgctttcgca ggcggccacg 15480
aacgcgtcga gttgttcggt cggacccgac accaccgtgg tggtgggccc gttgaccgcg 15540
gccacggcca gtcccggcat gtcggtcagt cgttgctcga ccagtgtcgt cggcagcagc 15600
acgctggcca ttccgcctcg acccgacagt tcccgcaatg cccgcgaacg cagaatcacc 15660
acacgtgcgg cgtcttcgag cgacaacgcc ccggccacac aggctgcggc gatctcgcct 15720
tgcgaatgcc ccagtaccgc atccgggacg acaccgaagg agcgccatac ctcggcgagg 15780
gaaaccatca ccgcgaacag tgccggctgc acgacatcga cccgctcgag cagcatcgga 15840
tcggccgtgc cctgcagcac gtcgatcaac gaccactcca ccaagggcgc gaacacctcg 15900
gcgcattcga gcactttctg ctcgaacacc accgattcct gcaacagacg cgcacccatt 15960
cccagccact gcgcgccctg accggggaac accaacaccg tcttgcccac gccaccagca 16020
acacccgaga ccacacccgg acccgcgaac atgccggcac ccgcgagcat gcccggttcg 16080
ctatcgatca gtgcttgcag tcgggtcatc aactcctcac ggtcggcacc gaccagcacc 16140
gcccgatgct ccaaccgtgc ccgggtattg atcaaagacc accccacgtc caccgcatcc 16200
agccccggcc gtgtcagcat ccactcctgc aaccggcgcc cctgcgcgag caagccttca 16260
ccggtacgcc ccgacaccat ccacaccact gcatcggact tcaccgcagg caccggatcg 16320
gtgtcgggtg ccgatgattc ggtgtcgggt gtgacgggtg gtgattgttc gaggatgaca 16380
tgcgcgttgg tacccgagat accgaaggag gacaccgccg cacgacgcgg ccggtcggcc 16440
tcgacagtcc aggcacgaga ctcggtcagc aattccaccg cgcccgcggt ccagtccacg 16500
tgggtggtgg gagtgtcgac gtgcaaggtt ttcggcagtg tctcgtggcg catcgcctcg 16560
atcattttga tcaccccggc cacaccggcg gcggcctggg cgtggccgat attcgatttc 16620
agcgacccca gccacaaggg ccggtcgggt tcacggtttt gcccgtaggt ggccagcagg 16680
gcttgggcct cgatcgggtc accgagggtg gtgccggtgc cgtgagcctc aaccacatcc 16740
acttccgtcg ccgctacccc ggcgttggcc agggcacggc ggatcacccg ctgctgggaa 16800
ggaccgttgg gggcggtcaa cccgttggac gcaccgtcct gattgaccgc cgaaccacgc 16860
accaccgcca acacttgatg cccgcggcga cgcgcgtcgg aaagccgctc cacgaccagg 16920
acaccgacgc cttcggacca gccggcaccg tcggcggcct cggcgaacga cttgcaccgg 16980
ccatccggcg ccagtccttt ctggcgggag aactcgacga acgtgtcggg tgtcgacatc 17040
accgtgacac cgccgaccaa cgccatcccg cactcaccgg cccgcacggc ctgcacggct 17100
tgatgcaggg cgaccagcga cgacgagcac gcggtgtcca ccgataccgc cgggccttcc 17160
aatcccagca cgtacgacac ccgacccgag acgaccgaac caccgaccgc gctggcgggg 17220
tagtcgtggt acatcacgcc catgaacacg ccggtgtcgc tgccgcgcaa cgagaccggg 17280
tccaccccgg cgtcttcgag ggcttcccac accgtttcca gcagcaaccg ctgctgcgga 17340
tccatcgcaa ccgcttcccg cgggctgatc ccgaagaacc cggcatcgaa caaccccgca 17400
tcgtgcagga acccaccctc acgcgtgtag gacctacccg ccacacccgg ttcgggatcg 17460
aacaaccccc catcccaacc ccgatccaac ggccactgcg acaccacatc ccggccctgg 17520
gccacgacct gccacagctc ttcgcgtgag gacactcctc cggggaagcg acagcccaca 17580
cccacgatcg cgatcggttc actcgaacga cccaccaata gctcattcga ttgccgtaaa 17640
gccaaattct ccttggcgga gacacgcaga gctttaacaa gttcttctat cgacaaagac 17700
atatctaatt ctttccgatc aattcatgtc gcgctgcatg atgaagcgca cgagactttc 17760
gggatccatt tcatctaccg cgtcttgcgt catatcaccg cccttggcga cagaattact 17820
tctatcctcc atggcaaatc tcaatagatg atcaaagatg cccgccgaac ggagtcgctc 17880
cattgacgca ttggcgagga attcactaac gacagcttcg tcttcacgag aagggccatt 17940
gaaatggtcg ttaccggcta atttctgatg gaggtgctcg gcgacggctc taggggtcgg 18000
gtagtcgaag atcaacgtcg ccgggaccgc gacttcggtg gcggtcttca agcgcttgcg 18060
tgcttcgatg gcggtgaggg agtcgaatcc gagttcctgg aagttgcggt cggcgtcgat 18120
cgcggtggcg tcgtcgtgtc cgagcacgat cgccacctga ccacgcacca actccagcag 18180
caccttgacc tgttcggcct cgtccaggcc cgacaagcgt tgccgcagtt gcgatcccgc 18240
caccccagca cctgcgctac cggtgtttcc ggcggcaaca cgtcgggcgt tgggtaccag 18300
gttgtgcagt atcggtgcca gcatccccgc ccgcgcctgc gcggccagga cggtggtgtc 18360
gaaccgcgct gccagcacgg tagcgtgctc ggcggtcacg gcggtgtcga acatcgccat 18420
accctgtgcc tcggtcaacg ccagcatgcc gccacggctc atgcgcgcgg tatcgccacc 18480
atcgaggtga ccggtcatcc cggtgcccga tccccacaaa ccccacgcga tcgacgtcgc 18540
cgccaaccct cgagcccgcc ggtactcggc caacccgtcg aggaactgat tggccgcggc 18600
atagttgccc tgaccgggcg aacccagcac accggcggtc gaggaataca tgacgaacat 18660
gcccagatcc aggccacggg tcagctcgtg cagataccac gccgcatcgg ctttcacgga 18720
aagcgccgtg tcgaggcgtt gcggtgtcag cgacgcgatc acaccgtcgt cgagcacacc 18780
cgccgcatgc accaccccga ccaaaggatc ctcgtccggt accgcggcca gcagttgctc 18840
gaccccggca cgggtggaca catcacaggc caccaccgcc acccgcgcac ccgaaccggt 18900
caactcctcg accaactcac gcgcaccctc cgcagccaaa ccccggcgag aagccaacac 18960
cagcgaccgc acaccccgca cacccaccag atgccgggcc agaatccgac ccaaaccacc 19020
ggtaccaccg gtcaccacca cagtcccccg accagcaccc gacacggtgt catcggtatc 19080
cgagatatcc gatgccgtgg tcttgtcgtc acgcccggca gtacggacca gtcgcgcgat 19140
gtgggcgata ccgtcacgga tcagaacctg aggctccccc accgccacag ccaacgacac 19200
gatcccggcc acatcaacac catcggagcc ctcgatgtcg gtgtcggcga gcaggatccg 19260
gcccggttcc tccgactgcg ccgaacgcac caaaccccag attgtcgacg ccgccggatc 19320
gacccgatca cccgcggtcg tggtgaccgc cgcccgggtg aggaccagca gcgtgctgga 19380
cgcgaaccgc tgaccagtcg agaactcctg caacacaccc agcacccgat ggctgatcgc 19440
gtgcgccctg accagcacat cggtgtcgat ttcaccgacg gtgttgtttt caccgtcacg 19500
gcagtcgagc accaccaccg gtggcaccgg atcgtcggcg gactcgtgct ccaggttgtt 19560
ccactccacg aattccactt cccgcagttg cgcaggtatg ggtgtgggtg tccagtgcag 19620
ggtgtgcagc cggtctccgc cgtctgctgc ggtggtgagt tggtcgagtt ggacggggcg 19680
cagtgtcagt gatgcgatgg tgaggaccgg tagcccgtcg gggtcggtca cggtcacgct 19740
gaccgtgttg tggccgtggg gggtgattct ggcgtgcacg gtcgaggcgc cgacggcatg 19800
gagctgcacg ccttcccacg cgaacggcag caacggaccc acactggtat cggtatcggt 19860
gtcggtatcg tggccggtgg tcatggcgtg caggacggca tcgagcaggg cggggtgtag 19920
tccgtagtgg tgtgcgtcgc cgccggtttc ggggaggcgg gcttgcacga gccagtcttc 19980
tccggtgcgc cagactgatt ccaggccttg gaacgcggga ccgtagccgt acccgtcctc 20040
ggcgagttgc tggtagaggc tgctggtgtc ggttcgggtg gcgttttgtg gtggccatac 20100
cgccaacccc gtgtccactg gtgttgtggt ggtgaggttt tcgaccggac tctgggtgtg 20160
gagtaggcct tgggcgttca acacccactc ctggtctcgg gtttgggagt acaccgacac 20220
tgtgcgggtg ccggaggttt cgagcgcgcc gacgaggacc tggatggcgg tgccgccctc 20280
ggcgggcagt gtcaggggtg cgagcaacgt cagttcccgt atcgccccgc atccgacctc 20340
gtcaccggca cggatcacca gctcgaccag cccggtcccg ggcagcaaca ccacaccacc 20400
cacggcgtgg tcggccagcc acggatgagt ctgcagcgac agccggccgg tcacggtcac 20460
ggccccggtc tccggggaca ccaccaccgc cccgatcaat ggatgatcga gtcccgacaa 20520
tcccagcgag tcgggatcgg tgttgccggt gatcgtatcg agccagtagc ggcggtgctg 20580
gaaagcgtag gaaggcaacg gaacccgggt cgcgccgcgg ccgtggaaga tcggtgtcca 20640
gtcgattccg gtgccggcca catccagtcg ggccagtgcc gagagcaggg tcgtgtcctc 20700
gacacggtcc ttacgcagca gcgaagccac gacagcctcc acaccgtcca cggtgggttt 20760
ggtatccacg gcgtcgctgg tggtgtgttg gagggtttcg tcgatgagtc cggataggcc 20820
gccgtcgggg cccatgatca cgtatcgggt tgctcccgcc gtggtgaggg tggtgattcc 20880
gtcggcgaac cgtacggtgt tgcggacgtg gtcgacccag tactgcggcg tggtcagcgg 20940
tgaatccgcc tgttgggcat cggtgttcgg gctgtcggtg ttcgggccgg tgagttggcc 21000
gtcgaggttg gagatgatcg ggatgaccgg ttgggtgtag gtgagttcgg tggcgatgcg 21060
ggcgaattcg gccagcatgg gttccatcga ggcggagtgg aacgcgtggg agacccgcag 21120
ccggttgacc tggtatccgg cctgccgtag ttgttgctcg gtggtgtcga tcgcgtgctg 21180
ggggccggcg aggacgatcg attcgggtcc gttgaccgcg gcgatctcga cgacaccgtc 21240
ctcgatgctg tcaccgagca gggtggtgat ctgggtttcg gaggctcgca tggcgagcat 21300
ggctccgccg gtggggagtt gctgcatcag ccgggcgcgg gcggcgacca gtaccgtcgc 21360
gtcctcgagg ctcagtacac cggccacggt cgcggcggcc agttcaccga tggagtgtcc 21420
ggccacgaaa tccggtcgga caccgaagga ttccagcaac cggaacagcg cgataccgac 21480
ggcgaacagt cctgtctggg tgtagagggt cgcttgcagt gcctgctcgt cgacacccca 21540
caccacatcc cgcagcgagc attccagctg ctgttccagc agtgcggtgg tctcgtcgaa 21600
gctggccgcg aacaccggga acgcctcgta caaaccagat cccataccca gcagctgcgc 21660
gccctgaccg gggaacacga acaccgtctt gccgcggtca cgcgaaacac cagccgccac 21720
agcgggatca ccgtcgatca acccctgcag tcgggtcatc aactcctcac ggtcggcacc 21780
gaccagcacc gcccgatgct ccaaccgtgc ccgggtattg atcaaagacc accccacgtc 21840
caccgcatcc agccccgggc gcgccagcat ccactcgtgt agacgccgtc cctgcgcgag 21900
caagccttca ccggtacgcc ccgacaccat ccacaccact gcatcggact tcaccgcagg 21960
caccggatcg gtgtcgggtg ccgatgattc ggtgtcgggt gtgacgggtg gtgattgttc 22020
gaggatgaca tgcgcgttgg tacccgagat accgaaggag gacaccgccg cacgacgcgg 22080
ccggtcggcc tcgacagtcc aggcacgaga ctcggtcagc aattccaccg cgcccgcggt 22140
ccagtccacg tgggtggtgg gagtgtcgac gtgcaaggtt ttcggcagtg tctcgtggcg 22200
catcgcctcg atcattttga tcaccccggc cacaccggcg gcggcctggg cgtggccgat 22260
attcgatttc agcgacccca gccacaaggg ccggtcgggt tcacggtttt gcccgtaggt 22320
ggccagcagg gcttgggcct cgatcgggtc accgagggtg gtgccggtgc cgtgagcctc 22380
aaccacatcc acttccgtcg ccgctacccc ggcgttggcc agggcacggc ggatcacccg 22440
ctgctgggaa ggaccgttgg gggcggtcaa cccgttggac gcaccgtcct gattgaccgc 22500
cgaaccacgc accaccgcca acacttgatg cccgcggcga cgcgcgtcgg aaagccgctc 22560
cacgaccagg acaccgacgc cttcggacca gccggcaccg tcggcggcct cggcgaacga 22620
cttgcaccgg ccatccggcg ccagtccttt ctggcgggag aactcgacga acgtgtcggg 22680
tgtcgacatc accgtgacac cgccgaccaa cgccatcccg cactcaccgg cccgcacggc 22740
ctgcacggct tgatgcaggg cgaccagcga cgacgagcac gcggtgtcca ccgataccgc 22800
cgggccttcc aatcccagca cgtacgacac ccgacccgag acgaccgaac caccgaccgc 22860
gctggcgggg tagtcgtggt acatcacgcc catgaacacg ccggtgtcgc tgccgcgcaa 22920
cgagaccggg tccaccccgg cgtcttcgag ggcttcccac accgtttcca gcagcaaccg 22980
ctgctgcgga tccatcgcaa ccgcttcccg cgggctgatc ccgaagaacc cggcatcgaa 23040
caccccgcat cgtgcaggaa cccaccctca cgcgtgtagg acctacccgc cacacccggt 23100
tcgggatcga acaacccccc atcccaaccc cgatccaacg gccactgcga caccacatcc 23160
cggcctgggc cacgacctgc cacagctctt cgcgtgagga cactcctccg gggaagcgac 23220
agcccacacc cacgatcgcg atcggctcgg ccgaatgacc caccaccaca accggttccg 23280
cggccacgga tgctccggcc agctgctggt gcaggtgttc ggcgacggct ctgggggtcg 23340
gatagtcgaa gatcaacgtc gccgcgaccg ccaccccggt agcagttttg agtcggttgc 23400
gggcttcgac ggcggtgagg gagtcgaatc cgagttcctg gaagttgcgg tcggcgtcga 23460
tcgcggtggc gtcgtcgtgt ccgagcacga tcgccacctg accacgcacc aactccagca 23520
gcaccttgac ctgttcggcc tcgtccaggc ccgacaagcg ttgccgcagt tgcgatcccg 23580
ccaccccagc acctgcgcta ccggtgtttc cggcggcaac acgtcgggcg ttgggtacca 23640
ggttgtgcag tatcggtgcc agcatccccg cccgcgcctg cgcggccagg acggtggtgt 23700
cgaaccgcgc tgccagcacg gtagcgtgct cggcggtcac ggcggtgtcg aacatcgcca 23760
taccctgtgc ctcggtcaac gccagcatgc cgccacggct catgcgcgcg gtatcgccac 23820
catcgaggtg accggtcatc ccggtgcccg atccccacaa accccacgcg atcgacgtcg 23880
ccgccaaccc tcgagcccgc cggtactcgg ccaacccgtc gaggaactga ttggccgcgg 23940
catagttgcc ctgaccgggc gaacccagca caccggcggc cgaggaatac aggacgaaca 24000
tgcccagatc caggccacgg gtcagctcgt gcagatacca cgccgcatcg gctttcacgg 24060
aaagcaccgt gtcgaggcgt tgcggtgtca gcgacgcgat cacaccgtcg tcgagcacac 24120
ccgccgcatg caccaccccg accaaaggat cctcgtccgg taccgcggcc agcagttgct 24180
cgaccccggc acgggtggac acatcacagg ccaccaccgc cacccgcgca cccgaaccgg 24240
tcaactcctc gaccaactca cgcgcaccct ccgcagccaa accccggcga gaagccaaca 24300
ccagcgaccg cacaccccgc acacccacca gatgccgggc cagaatccga cccaaaccac 24360
cggtaccacc ggtcaccacc acagtccccc gaccagcacc cgacacggtg tcatcggtat 24420
ccgagatatc cgatgccgtg gtcttgtcgt cacgcccggc agtacggacc agtcgcgcga 24480
tgtgggcgat accgtcacgg atcagaacct gaggctcccc caccgccaca gccaacgaca 24540
cgatcccggc cacatcaaca ccatcggagc cctcgatgtc ggtgtcggcg agcaggatcc 24600
ggcccggttc ctccgactgc gccgaacgca ccaaacccca gattgtcgac gccgccggat 24660
cgacccgatc acccgcggtc gtggtgaccg ccgcccgggt gaggaccagc agcgtgctgg 24720
acgcgaaccg ctgaccagtc gagaactcct gcaacacacc cagcacccga tggctgatcg 24780
cgtgcgccct ggccaacaca tcggtgccat ttccaccgtc gttgtcggtg tcgtttctac 24840
cgctacggca gtcgagcacc acgatctgtg gtgccgggtg gtcggtggac tcggcttgca 24900
gatcgtccca ctccacgaat gccacgtcct gcggctgtgt gggtgtggct gtccagtgca 24960
gggtgtggag ccggtctccg ctgcctgtcg cgatggccaa ctggtcgagt tggaccggtc 25020
gcagggtcag tgatgcgatg gtgaggaccg gtagcccgtc ggggtcggtc acggtcacgc 25080
ggaccgtgtt gtgcccgagt ggggtgattc tggcgtgcac ggtcgaggcg ccgacggcat 25140
gcaattgcac gccttcccac gcgaacggca gcaacggacc cgctgcggcc tccaccccgg 25200
ccttcccact ggtgtcaaag ccggtggtca tggcgtgcag gacggcatcg agcagggcgg 25260
ggtgtagtcc gtaatggttt gcctggccgc cggtttcggg gagggtggct tgcacgagcc 25320
agtcttctcc ggtgcgccag actgattcca ggccttggaa cgcgggaccg tagccgtacc 25380
cgtcctcggc gagttgctgg tagaggctgc tggtatcggt gtgcaccgca cccgctggtg 25440
gccatgccgc cagcccggca tccacgtcgt gtggtgttgt ggtggtgagg ttttcgaccg 25500
gactctgggt gtggagtagg ccttgggcgt tcaacaccca ctcctggtct cgggtttggg 25560
agtacaccga cactgtgcgg gtgccggagg tttcgagcgc gccgacgagg acctggatgg 25620
cggtgccgcc ctcggcgggc agtgtcaggg gtgcgagcaa cgtcagttcc cgtatcgccc 25680
cgcatccgac ctcgtcaccg gcacggatca ccagctcgac cagcccggtc ccgggcagca 25740
acaccacacc acccacggcg tggtcggcca gccacggatg agtctgcagc gacagccggc 25800
cggtcacggt cacggccccg gtctccggag acaccaccac cgccccgatc aatggatgat 25860
cgagtgctgt ttgccccagc gagtcagggt tcccggatgc ggtgagggtg tcgagccagt 25920
agcggcgatg ctggaacgcg taggagggca acgcaacccg ggtcgcacca cggccgtcga 25980
agatcggtgt ccagtcgacc ccggcaccgg ccacatcgac cattgccagc gccgagagca 26040
gccggtccag tccgccgtcg tcgcggcgca gtgatccggt cacgacgatg tcgcgtgttc 26100
gtgggccggt ttgttcgccg agttcttcga tgcccggtgt cagcaccggg tgtggtgagg 26160
cctccacgaa cacggtgtgg ccctcgctca gcagggtttg cacggtcgcg gcgaagttca 26220
cggtgtcgcg caggttgcgg aaccagtacc cggcatcgag ttcggtggtg tcgagcaggg 26280
tgccggtcac cgtggagtag aacgcgattc gtgagggtcg tggagtgatg gtggccagtt 26340
cctcgagcag tcgttgccgc aaggactcga cctgtggtga atgcgaggcg tagtcgaccg 26400
cgatccggcg gacctgcaca ccgtcgcttt cgcaggcggc cacgaacgcg tcgagctgtt 26460
cggtcagacc cgacaccacc gtggtggtgg gcccgttgac cgcggccacg gccagtcccg 26520
gcatgtcggt cagtcgttgc tcgaccagtg tcgtcggcag cagcacgctg gccattccgc 26580
ctcgacccga cagttcccgc aatgcccgcg aacgcagaat caccacacgt gcggcgtctt 26640
cgagcgacaa cgccccggcc acacaggctg cggcgatctc gccttgcgaa tgccccagta 26700
ccgcatccgg gacgacaccg aaggagcgcc atacctcggc gagggaaacc atcaccgcga 26760
acagtgccgg ctgcacgaca tcgacccgct cgagcagcat cggatcggcc gtgccctgca 26820
gcacgtcgat caacgaccac tccaccaagg gcgcgaacac ctcggcgcat tcgagcactt 26880
tctgctcgaa caccaccgat tcctgcaaca gacgcgcacc cattcccagc cactgcgcgc 26940
cctgaccggg gaacaccaac accgtcttgc ccacgccacc agcaacaccc gagaccacac 27000
ccggacccgc gaacatgccg gcacccgcga gcatgcccgg ttcgctatcg atcagtgctt 27060
gcagtcgggt catcaactcc tcacggtcgg caccgaccag caccgcccga tgctccaacc 27120
gtgcccgggt attgatcaaa gaccacccca cgtccaccgc atccagcccc ggccgtgtca 27180
gcatccactc ctgcaaccgg cgcccctgcg cgagcaagcc ctcaccggta cgccccgaca 27240
acacccacac caacccgcca gcagtgaccg gcaccagcga ctcggtacca ggcaccggtt 27300
caacgaccgc aggtgcttgt tcgaggatga cgtgcgcgtt ggtgcccgag acgccgaagg 27360
aggagatgcc cgcgcgcagg gggtgcccgt tgcgaggcca cggctgttcc tgggtgagca 27420
gctcgactgt gccggtcgtc cagtccacgt ggctggaggg tgtgtcgacg tgcaaggttt 27480
tcggcagtgt ctcgtggcgc atcgcctcga tcattttgat caccccggcc acaccggcgg 27540
cggcctgggc gtggccgata ttcgatttca gcgaccccag ccacaagggc cggtcgggtt 27600
cacggttttg cccgtaggtg gccagcaggg cttgggcctc gatcgggtca ccgagggtgg 27660
tgccggtgcc gtgagcctca accacatcca cttccgtcgc cgctaccccg gcgttggcca 27720
gggcacggcg gatcacccgc tgctgggaag gaccgttggg ggcggtcaac ccgttggacg 27780
caccgtcctg attgaccgcc gaaccacgca ccaccgccaa cacttgatgc ccgcggcgac 27840
gcgcgtcgga aagccgctcc acgaccagga caccgacgcc ttcggaccag ccggcaccgt 27900
cggcggcctc ggcgaacgac ttgcaccggc catccggcgc cagtcctttc tggcgggaga 27960
actcgacgaa cgtgtcgggt gtcgacatca ccgtgacacc gccgaccaac gccatcccgc 28020
actcaccggc ccgcacggcc tgcacggctt gatgcagggc gaccagcgac gacgagcacg 28080
cggtgtccac cgataccgcc gggccttcca atcccagcac gtacgacacc cgacccgaga 28140
cgaccgaacc accgaccgcg ctggcggggt agtcgtggta catcacgccc atgaacacgc 28200
cggtgtcgct gccgcgcaac gagaccgggt ccaccccggc gtcttcgagg gcttcccaca 28260
ccgtttccag cagcaaccgc tgctgcggat ccatcgcaac cgcttcccgc gggctgatcc 28320
cgaagaaccc ggcatcgaac aaccccgcat cgtgcaggaa cccaccctca cgcgtgtagg 28380
acctacccgc cacacccggt tcgggatcga acaacccccc atcccaaccc cgatccaacg 28440
gccactgcga caccacatcc cggccctggg ccacgacctg ccacagctct tcgcgtgagg 28500
acactcctcc ggggaagcga cagcccacac caacaatcgc gatcggctcg gccgaatgac 28560
ccaccaccac aaccggttcc gcggccacgg atgctccggc cagctgctgg tgcaggtgtt 28620
cggcgacggc tctgggggtc ggatagtcga agatcaacgt cgccgcgacc gccaccccgg 28680
tagcagtttt gagtcggttg cgggcttcga cggcggtgag ggagtcgaat ccgagttcct 28740
ggaagttgcg gtcggcgtcg atcgcggtgg cgtcgtcgtg tccgagcacg atcgccacct 28800
gaccacgcac caactccagc agcaccttga cctgttcggc ctcgtccagg cccgacaagc 28860
gttgccgcag ttgcgatccc gccaccccag cacctgcgct accggtgttt ccggcggcaa 28920
cacgtcgggc gttgggtacc aggttgtgca gtatcggtgc cagcatcccc gcccgcgcct 28980
gcgcggccag gacggtggtg tcgaaccgcg ctgccagcac ggtagcgtgc tcggcggtca 29040
cggcggtgtc gaacatcgcc ataccctgtg cctcggtcaa cgccagcatg ccgccacggc 29100
tcatgcgcgc ggtatcgcca ccatcgaggt gaccggtcat cccggtgccc gatccccaca 29160
aaccccacgc gatcgacgtc gccgccaacc ctcgagcccg ccggtactcg gccaacccgt 29220
cgaggaactg attggccgcg gcatagttgc cctgaccggg cgaacccagc acaccggcgg 29280
ccgaggaata caggacgaac atgcccagat ccaggccacg ggtcagctcg tgcagatacc 29340
acgccgcatc ggctttcacg gaaagcaccg tgtcgaggcg ttgcggtgtc agcgacgcga 29400
tcacaccgtc gtcgagcaca cccgccgcat gcaccacccc gaccaaagga tcctcgtccg 29460
gtaccgcggc cagcagttgc tcgaccccgg cacgggtgga cacatcacag gccaccaccg 29520
ccacccgcgc acccgaaccg gtcaactcct cgaccaactc acgcgcaccc tccgcagcca 29580
aaccccggcg agaagccaac accagcgacc gcacaccccg cacacccacc agatgccggg 29640
ccagaatccg acccaaacca ccggtaccac cggtcaccac cacagtcccc cgaccagcac 29700
ccgacacggt gtcatcggta tccgagatat ccgatgccgt ggtcttgtcg tcacgcccgg 29760
cagtacggac cagtcgcgcg atgtgggcga taccgtcacg gatcagaacc tgaggctccc 29820
ccaccgccac agccaacgac acgatcccgg ccacatcaac accatcggag ccctcgatgt 29880
cggtgtcggc gagcaggatc cggcccggtt cctccgactg cgccgaacgc accaaacccc 29940
agattgtcga cgccgccgga tcgacccgat cacccgcggt cgtggtgacc gccgcccggg 30000
tgaggaccag cagcgtgctg gacgcgaacc gctgaccagt cgagaactcc tgcaacacac 30060
ccagcacccg atggctgatc gcgtgcgccc tgaccagcac atcggtgtcg atttcaccga 30120
cggtgttgtt ttcaccgtca cggcagtcga gcaccaccac cggtggcacc ggatcgtcgg 30180
cggactcgtg ctccaggttg ttccactcca cgaattccac ttcccgcagt tgcgcaggta 30240
tgggtgtggg tgtccagtgc agggtgtgca gccggtctcc gccgtctgct gcggtggtga 30300
gttggtcgag ttggacgggg cgcagtgtca gtgatgcgat ggtgaggacc ggttgcccgt 30360
cggggtcggt cacggtcacg cggaccgtgt tgtgcccaag tggggtgatt ctggcgtgca 30420
cggtcgaggc gccgacggca tggagctgca cgccttccca cgcgaacggc agcaacggac 30480
ccacactggt atcggtatcg gtgtcggtat cgtggccggt ggtcatggcg tgcaggacgg 30540
catcgagcag ggcggggtgt agtccgtagt ggtgtgcgtc gccgccggtt tcggggaggc 30600
gggcttgcac gagccagtct tctccggtgc gccagactga ttccaggcct tggaacgcgg 30660
gaccgtagcc gtacccgtcc tcggcgagtt gctggtagag gctgctggtg tcggttcggg 30720
tggcgttttg tggtggccat accgccaacc ccgtgtccac tggtgttgtg gtggtgaggt 30780
tttcgaccgg actctgggtg tggagtaggc cttgggcgtt caacacccac tcctggtctc 30840
gggtttggga gtacaccgac actgtgcggg tgccggaggt ttcgagcgcg ccgacgagga 30900
cctggatggc ggtgccgccc tcggcgggca gtgtcagggg tgcgagcaac gtcagttccc 30960
gtatcgcccc gcatccgacc tcgtcaccgg cacggatcac cagctcgacc agcccggtcc 31020
cgggcagcaa caccacacca cccacggcgt ggtcggccag ccacggatga gtctgcagcg 31080
acagccggcc ggtcacggtc acggccccgg tctccgggga caccaccacc gccccgatca 31140
atggatgatc gagtcccgac aatcccagcg agtcgggatc ggtgttgccg gtgatcgtat 31200
cgagccagta gcggcggtgc tggaaagcgt aggaaggcaa cggaacccgg gtcgcgccgc 31260
ggccgtggaa gatcggtgtc cagtcgattc cggtgccggc cacatccagt cgggccagtg 31320
ccgagagcag ggtcgtgtcc tcgacacggt ccttacgcag cagcgaagcc acgacagcct 31380
ccacaccgtc cacggtgggt ttggtatcca cggcgtcgct ggtggtgtgt tggagggttt 31440
cgtcgatgag tccggatagg ccgccgtcgg ggcccatgat cacgtatcgg gttgctcccg 31500
ccgtggtgag ggtggtgatt ccgtcggcga accgtacggt gttgcggacg tggtcgaccc 31560
agtactgcgg cgtggtcagc ggtgaatccg cctgttgggc atcggtgttc gggctgtcgg 31620
tgttcgggcc ggtgagttgg ccgtcgaggt tggagatgat cgggatgacc ggttgggtgt 31680
aggtgagttc ggtggcgatg cgggcgaatt cggccagcat gggttccatc gaggcggagt 31740
ggaacgcgtg ggagacccgc agccggttga cctggtatcc ggcctgccgt agttgttgct 31800
cggtggtgtc gatcgcgtgc tgggggccgg cgaggacgat cgattcgggt ccgttgaccg 31860
cggcgatctc gacgacaccg tcctcgatgc tgtcaccgag cagggtggtg atctgggttt 31920
cggaggctcg catggcgagc atggctccgc cggtggggag ttgctgcatc agccgggcgc 31980
gggcggcgac cagtaccgtc gcgtcctcga ggctcagtac accggccacg gtcgcggcgg 32040
ccagttcacc gatggagtgt ccggccacga aatccggtcg gacaccgaag gattccagca 32100
accggaacag cgcgataccg acggcgaaca gtcctgtctg ggtgtagagg gtcgcttgca 32160
gtgcctgctc gtcgacaccc cacaccacat cccgcagcga gcattccagc tgctgttcca 32220
gcagtgcggt ggtctcgtcg aagctggccg cgaacaccgg gaacgcctcg tacaaaccag 32280
atcccatacc cagcagctgc gcgccctgac cggggaacac gaacaccgtc ttgccgcggt 32340
cacgcgaaac accagccgcc acagcgggat caccgtcgat caacccctgc agtcgggtca 32400
tcaactcctc acggtcggca ccgaccagca ccgcccgatg ctccaaccgt gcccgggtat 32460
tgatcaaaga ccaccccacg tccaccgcat ccagccccgg gcgcgccagc atccactcct 32520
gcaaccggcg cccctgcgcg agcagcccgt cactgctacg ccccgacaac acccacacca 32580
acccaccgga cttcaccaca ggcaccggat cggtgtcggg tgccgatgat tcggtgtcgg 32640
gcgcctcggt gtcgggtgtt tcggtcacgg gtggtgcttg ttcgaggatg acgtgcgcgt 32700
tggtgcccga gataccgaac gccgataccg ccgcacgacg cggccggtca gcctcgacag 32760
tccaggcacg agactcggtg agcagttcaa ctgcgccgat attccagtcg acgtgtgtgg 32820
tcggtgcgtc cacatgcagt gtccgaggca gtgtctcgtg gcgcatcgcc tcgatcatct 32880
tgatcacccc ggccacaccc gcggcggcct gggtatgacc gatattggac ttcagcgccc 32940
ccagccacaa gggccggtcc ggttcacggc cttgcccata agtggccaac aacgcctgcg 33000
cctcgatcgg gtcaccgaga gtggtgccgg tgccgtgggc ctcgaccaca tccacttccg 33060
tcgccgctac cccggcgttg gccagggcac ggcggatcac ccgctgctgg gaaggaccgt 33120
tgggggcggt caacccgttg gacgcaccgt cctgattgac cgccgaacca cgcaccaccg 33180
ccaacacttg atgcccgcgg cgacgcgcgt cggaaagccg ctccaccacc agaacaccga 33240
cgccttccga ccagcccgtc ccgtcggcgg cctcggcgaa cgacttgcac cggccgtccg 33300
gcgccagccc tttctggcgg gagaactcga cgaacgtgtc gggtaccgac atgaccgtca 33360
cgccgccgac cagcgccatc ccgcactcac cggcgcgcac ggcctgcacg gcttgatgca 33420
gggcgaccag cgaggacgag cacgcggtgt ccaccgacac cgccgggccc tccaacccca 33480
acacatacga cacccggccc gacaccacac tcgaagtcgc gccgatcagc cgatacccct 33540
cgactccggc gtcgccgtcg cttcggccta tgccgtagga ctggtcgctg acgccgatga 33600
acacgccggt ctcgctgccg cgcaacgaga ccgggtccac cccggcgtct tcgagggcct 33660
cccacaccgt ttccagcagc aaccgctgct gcggatccat cgcaaccgcc tcgcgcgggc 33720
tgatcccgaa gaacccggca tcgaacaacc ccgcatcgtg caggaaccca ccctcacgcg 33780
tgtaggactt acccaccaca cccggctcag gatcgaataa ccccgtatcc caaccccgat 33840
cgagtggcca ttgcgacagc acatctcgcc cctcggcaag aacctgccac agatcctccc 33900
gcgaggacac gcctcccggg aaatgacacc ccacgcccac aatcgcgacc ggctcgtggg 33960
cacgatattc aagctcatgc aatctttttc gcgtctctcg gagactgatt gccgtgcgct 34020
tcaaatactc cagggactgc gatccgtctg acataaaatt ctccgatatt gaagttattg 34080
tcgttatatt cttctataac cgactctaca aagctcgata acccctgatc cgaaatatca 34140
gccagggccc ggagtcatcg ccgtccgcac ccaataccta ttccagagag tcttccagac 34200
cttcaatacc gagctcattt tccaatatgc tgtatatctc gtccgcagag gcaaattcaa 34260
tatcgagatt atccatgctt gagcctggcc gcccgtagta ttttgaccag tcttccagga 34320
tcagattgaa atctgcttca agggaattct gagtagcatt actccagacc gtgctgcgga 34380
gggctatctt gagctgctcc agtgccgtag taggagtctc tgatgcgagg actaccggtt 34440
cgattactac aggcaactca gccaatcggt ggtgtaggtc ttcggcgact gctctggggg 34500
tcgggtagtc gaagatcaag gtcgccggga ccgccacccc cgtggcggtc ttcaagcgct 34560
tgcgtgcttc gatggcggtg agggagtcga atccgagttc ctggaagttg cggtcggcgt 34620
cgatcgcggt ggcgtcgtcg tgtccgagca cgatcgccac ctgaccacgc accaactcca 34680
gcagcacctt gacctgttcg gcctcgtcca ggcccgacaa gcgttgccgc agttgcgatc 34740
ccgccacccc agcacctgcg ctaccggtgt ttccggcggc aacacgtcgg gcgttgggta 34800
ccaggttgtg cagtatcggt gccagcatcc ccgcccgcgc ctgcgcggcc aggacggtgg 34860
tgtcgaaccg cgctgccagc acggtagcgt gctcggcggt cacggcggtg tcgaacatcg 34920
ccataccctg tgcctcggtc aacgccagca tgccgccacg gctcatgcgc gcggtatcgc 34980
caccatcgag gtgaccggtc atcccggtgc ccgatcccca caaaccccac gcgatcgacg 35040
tcgccgccaa ccctcgagcc cgccggtact cggccaaccc gtcgaggaac tgattggccg 35100
cggcatagtt gccctgaccg ggcgaaccca gcacaccggc ggtcgaggaa tacatgacga 35160
acatgcccag atccaggcca cgggtcagct cgtgcagata ccacgccgca tcggctttca 35220
cggaaagcac cgtgtcgagg cgttgcggtg tcagcgacgc gatcacaccg tcgtcgagca 35280
cacccgccgc atgcaccacc ccgaccaaag gatcctcgtc cggtaccgcg gccagcagtt 35340
gctcgacccc ggcacgggtg gacacatcac aggccaccac cgccacccgc gcacccgaac 35400
cggtcaactc ctcgaccaac tcacgcgcac cctccgcagc caaaccccgg cgagaagcca 35460
acaccagcga ccgcacaccc cgcacaccca ccagatgccg ggccagaatc cgacccaaac 35520
caccggtacc accggtcacc accacagtcc cccgaccagc acccgacacg gtgtcatcgg 35580
tatccgagat atccgatgcc gtggtcttgt cgtcacgccc ggcagtacgg accagtcgcg 35640
cgatgtgggc gataccgtca cggatcagaa cctgaggctc ccccaccgcc acagccaacg 35700
acacgatccc ggccacatca acaccatcgg agccctcgat gtcggtgtct gcgagcagga 35760
tccggcccgg ttcctccgac tgcgccgaac gcaccaaacc ccacaccgcc gacgccgccg 35820
gatcaacccc atcaccgttc accgcgaccg ctgaccgggt gaggaccagc aaggtgctgg 35880
aagcgaaccg ctgctgaaca gagaagtcct gcaacactgt cagcacccgg tggctgatgg 35940
cgtgcgccct tgccagcacg tcgatgccgt ttccaccgtc gttgtcggtg tcgtttccac 36000
cgctacggca gtcgagcacc acgaccgctg ggaccggttg gtcggtcgac tcggcttcta 36060
gggactcggc ttgcagatcg tcccattcgg caaacgccgc ttcccgcagt tgcgcaggca 36120
tgggtgtggg tgtccagtgc agggtgtgca gccggtctcc gccgtctgct gcggtggtga 36180
gttggtcgag ttggacgggg cgcagtgtca gtgatgcgat ggtgaggacc ggttgcccgt 36240
cggggtcggt cacggtcacg ctgaccgtgt tgtggccgtg gggggtgatt ctggcgtgca 36300
cggtcgaggc gccgacggca tggagctgca cgccttccca cgcgaacggc agcaacggac 36360
ccacactggt atcggtatcg gtgtcggtat cgtggccggt ggtcatggcg tgcaggacgg 36420
catcgagcag ggcggggtgt agtccgtagt ggtgtgcgtc gccgccggtt tcggggaggc 36480
gggcttgcac gagccagtct tctccggtgc gccagactga ttccaggcct tggaacgcgg 36540
gaccgtagcc gtacccgtcc tcggcgagtt gctggtagag gctgctggtg tcggttcggg 36600
tggcgttttg tggtggccat accgccaacc ccgtgtccac tggtgttgtg gtggtgaggt 36660
tttcgaccgg actctgggtg tggagtaggc cttgggcgtt caacacccac tcctggtctc 36720
gggtttggga gtacaccgac actgtgcggg tgccggaggt ttcgagcgcg ccgacgagga 36780
cctggatggc ggtgccgccc tcggcgggca gtgtcagggg tgcgagcaac gtcagttccc 36840
gtatcgcccc gcatccgacc tcgtcaccgg cacggatcac cagctcgacc agcccggtcc 36900
cgggcagcaa caccacacca cccacggcgt ggtcggccag ccacggatga gtctgcagcg 36960
acagccggcc ggtcacggtc acggccccgg tctccgggga caccaccacc gccccgatca 37020
atggatgatc gagtcccgac aatcccagcg agtcgggatc ggtgttgccg gtgatcgtat 37080
cgagccagta gcggcggtgc tggaaagcgt aggaaggcaa cggaacccgg gtcgcgccgc 37140
ggccgtggaa gatcggtgtc cagtcgattc cggtgccggc cacatccagt cgggccagtg 37200
ccgagagcag ggtcgtgtcc tcgacacggt ccttacgcag cagcgaagcc acgacagcct 37260
ccacaccgtc cacggtgggt ttggtatcca cggcgtcgct ggtggtgtgt tggagggttt 37320
cgtcgatgag tccggatagg ccgccgtcgg ggcccatgat cacgtatcgg gttgctcccg 37380
ccgtggtgag ggtggtgatt ccgtcggcga accgtacggt gttgcggacg tggtcgaccc 37440
agtactgcgg cgtggtcagc ggtgaatccg cctgttgggc atcggtgttc gggctgtcgg 37500
tgttcgggcc ggtgagttgg ccgtcgaggt tggagatgat cgggatgacc ggttgggtgt 37560
aggtgagttc ggtggcgatg cgggcgaatt cggccagcat gggttccatc gaggcggagt 37620
ggaacgcgtg ggagacccgc agccggttga cctggtatcc ggcctgccgt agttgttgct 37680
cggtggtgtc gatcgcgtgc tgggggccgg cgaggacgat cgattcgggt ccgttgaccg 37740
cggcgatctc gacgacaccg tcctcgatgc tgtcaccgag cagggtggtg atctgggttt 37800
cggaggctcg catggcgagc atggctccgc cggtggggag ttgctgcatc agccgggcgc 37860
gggcggcgac cagtaccgtc gcgtcctcga ggctcagtac accggccacg gtcgcggcgg 37920
ccagttcacc gatggagtgt ccggccacga aatccggtcg gacaccgaag gattccagca 37980
accggaacag cgcgataccg acggcgaaca gtcctgtctg ggtgtagagg gtcgcttgca 38040
gtgcctgctc gtcgacaccc cacaccacat cccgcagcga gcattccagc tgctgttcca 38100
gcagtgcggt ggtctcgtcg aagctggccg cgaacaccgg gaacgcctcg tacaaaccag 38160
atcccatacc cagcagctgc gcgccctgac cggggaacac gaacaccgtc ttgccgcggt 38220
cacgcgaaac accagccgcc acagcgggat caccgtcgat caacccctgc agtcgggtca 38280
tcaactcctc acggtcggca ccgaccagca ccgcccgatg ctccaaccgt gcccgggtat 38340
tgatcaaaga ccaccccacg tccaccgcat ccagccccgg gcgcgccagc atccactcgt 38400
gtagacgccg tccctgcgcg agcaagcctt caccggtacg ccccgacacc atccacacca 38460
ctgcatcgga cttcaccgca ggcaccggat cggtgtcggg tgccgatgat tcggtgtcgg 38520
gtgtgacggg tggtgattgt tcgaggatga catgcgcgtt ggtacccgag acgccgaagg 38580
cagacacggc cgcacgacgc ggccggtcgg cctcggtggt ccaggtacga gactcggtga 38640
gtagttcgac tgcgcccgat gtccagtcga cgtgggtggt cggtgtgtcg atgtggaggg 38700
ttttcggtag tgtctggtgg cgtattgcct cgatcatttt gatcactccg gctacgccgg 38760
cggcgttttg ggtgtggccg atgttggatt tcagggagcc cagccacagg ggccggtcgg 38820
gttcgcggtt ttgcccgtag gtggccagca gggcttgggc ttcgatcggg tcaccgaggg 38880
tggtgccggt gccgtgggct tcgaccacgt ccacttccgt ggccgagact ccggcgttgg 38940
ccagggcgcg gcggatcacc cgctgctggg aaggaccgtt gggggcggtc agcccgttgg 39000
acgccccgtc ctggttgacc gccgagccac gcaccaccgc caacacttga tgcccgtggc 39060
gacgcgcgtc ggagagtcgt tccacgacga ggatgccgac gccttcggac cagcctgttc 39120
cgtcggcggc ttcggcgaag gatttgcagc gtccgtcggg tgccagtcct ttctggcggg 39180
agaattcgat gaacgcgccg ggtgtggcca gtaccgcgac gccgccgacc agcgccatcc 39240
cgcactcacc ggcgcgcacg gcttgcacgg cttggtgcag ggcgaccagc gaggacgagc 39300
acgcggtgtc caccgatacc gcgggacctt ccagtcccaa cacatacgac actcggcccg 39360
atacgacgct cgtggcgccg ccggtcagcc ggtatccctc gactccggcg tcgccgtcgc 39420
ttcggcctat gccgtaggac tggtcgctga cgccgatgaa tacgccggtg tcgctgccgc 39480
gcaacgagac cgggtccacc ccggcgtctt cgagggcttc ccacaccgtt tccagcagca 39540
accgctgctg cggatccatc gcaaccgcct cgcgcgggct gatcccgaag aacccggcgt 39600
cgaacaagcc ggcgtcgtga aggaacgcgc cgtctcgcgt gtaggactta ccggtcaccc 39660
ccggttcggg atcgaacaac cctgcatccc agccacggtc gaccggccat tgcgacacca 39720
catcccggcc ctgggccacg acctgccaca gctcttcgcg tgaggacacg cctccgggga 39780
agcggcaacc cacacccacg atcgcgatcg gctcggccga atgacccacc accacaaccg 39840
gttcggcggc cacggatgct ccggccagct gctggtgcag gtgttcggcg accgcccggg 39900
gagtcggata gtcgaagatc aacgtcgccg cgaccgccac cccggtggcg gtcttgatcc 39960
ggttgcgggc ttcgacggcg gtgagggagt cgaatcccag gtcccggaag ttgcggtcgg 40020
cgtcgatcgc ggtggcgtcg tcgtgtccga gcacgatcgc gatctgggcg cgcaccagat 40080
ccagcagtac ctggatctgt tcggtttcgc ccagacctga caagcgttgc cgcagtggcg 40140
atcccgtgtt cccgctgtcg atagcactgt cggtgtcggc ttgggcatcg ggtatgtctg 40200
tgatgagcag acgacgccgc gacagcgtgt agtagacggt gaactgttgc cagtcgatgt 40260
cggcgacggt caccagggtt tcgttgttgg cgactgcttg actcaatgct tgcagggcca 40320
gatcgggttc catcaatcgg atcccgagcc gaccgaagta ttcggtggtt gtgccgattt 40380
cggtcatccc gccgcctgac cagccgcccc aggccaggga tgtggcgacc agtccgcgcg 40440
atcgacgatc ctgtgccagg ccgtcgaggt gggcgttgct tgccgcgtac tcggcgagtc 40500
cggtaccgcc ccaggtcgca gcgcccgagg agaacagcac gaaggcgtcc aggcggcgat 40560
cgccgagtag ttcgtccaaa tgctgtgcgc ccccgacttt cgccgcggcc gcggtggtca 40620
tcgattccga gtcgatctcg gtcaacggcc gctgatcgac cactcccgcc gcgtggatga 40680
cggcggtgag cgggatgctg tcgttgtcga tcgtggacaa cacggcggcc acgtcgccgc 40740
gttcggcgat gtcggcggcc atgatcgtca ctcgcccgcc gagggcactc agttcctgct 40800
ccagctcgag cgcgccggga gcctgccgcc cgcgacggct caccagtacc acgtgttctg 40860
cgccgtttgt gagcagccat cgtgcagcat gtgccccgat tcccccggtg ccgccggtga 40920
cgagcactgt cccgcgggga cgccaatgct tgccccgacc tgaattgggc aacggtgccc 40980
gcatcatccg ccgtccgtag acaccgcttt cgcggacggc taactggtct tctccgtcct 41040
cacgtgacag caccgccggc agactgcgca ggatggtgtc gtcccacgcg ttcggaaggt 41100
cgatcagccc tccccacgac tgcgggagtt ccagacccgc tacctgcccc aacccccaca 41160
tctgtgactg ggtggcatcg acggatcgat ccgaaggacc cacgattact gcgccgctgg 41220
tcacgcacca caacgggatc tcggcggccg tctcgcgcaa tgccttgagc agccacacat 41280
ttcccgctac accacgagag accaggggcg aatcgccacc gatcccgtcg ttcaaggcaa 41340
tgagcgaaac gacaccgcgg aactcgtccc acgggcctgc tgattcgagc agatcggcca 41400
tcgtttgccg cgtcatccga tcggcatcaa cctctaggcg ctgagtctcc agacccgccg 41460
ctgtgaatac tccacagact tcatcgccta tcgcggcgcc ggtcgggctg accacgagcc 41520
atttccccga cacgcggaca ggtttctcgg ccagccactt ccagccgatt cgataacgcc 41580
actgatcgat caccgattgg gcgcgacgct gctgccgcca cgacgacaac aaaggcgata 41640
cttcaccgat ggtgcagcct tcttccaggc ccagcgcctc ccagtcctca cgagcgaccg 41700
cgtcccagaa ttcaccgtcg atgccatcgt cggccccgac cgacgagccc agtgagtcgg 41760
ggtgtccgga tgcggtgagg gtgtcgagcc agtagcggcg atgctggaac gcgtaggagg 41820
gcaacgcaac ccgggtcgca ccacggccgt cgaagatcgg tgtccagtcg accccggcac 41880
cggccacatc gaccattgcc agcgccgaga gcagccggtc cagtccgccg tcgtcgcggc 41940
gcagtgatcc ggtcacgacg atgtcgcgtg ttcgtgggcc ggtttgttcg ccgagttctt 42000
cgatgcccgg tgtcagcacc gggtgtggtg aggcctccac gaacacggtg tggccctcgc 42060
tcagcagggt ttgcacggtc gcggcgaagt tcacggtgtc gcgcaggttg cggaaccagt 42120
acccggcatc gagttcggtg gtgtcgagca gggtgccggt caccgtggag tagaacgcga 42180
ttcgtgaggg tcgtggagtg atggtggcca gttcctcgag cagtcgttgc cgcaaggact 42240
cgacctgtgg tgaatgcgag gcgtagtcga ccgcgatccg gcggacctgc acaccgtcgc 42300
tttcgcaggc ggccacgaac gcgtcgagtt gttcggtcgg acccgacacc accgtggtgg 42360
tgggcccgtt gaccgcggcc acggccagtc ccggcatgtc ggtcagtcgt tgctcgacca 42420
gtgtcgtcgg cagcagcacg ctggccattc cgcctcgacc cgacagttcc cgcaatgccc 42480
gcgaacgcag aatcaccaca cgtgcggcgt cttcgagcga caacgccccg gccacacagg 42540
ctgcggcgat ctcgccttgc gaatgcccca gtaccgcatc cgggacgaca ccgaaggagc 42600
gccatacctc ggcgagggaa accatcaccg cgaacagtgc cggctgcacg acatcgaccc 42660
gctcgagcag catcggatcg gccgtgccct gcagcacgtc gatcaacgac cactccacca 42720
agggcgcgaa cacctcggcg cattcgagca ctttctgctc gaacaccacc gattcctgca 42780
acagacgcgc acccattccc agccactgcg cgccctgacc ggggaacacc aacaccgtct 42840
tgcccacgcc accagcaaca cccgagacca cacccggacc cgcgaacatg ccggcacccg 42900
cgagcatgcc cggttcgcta tcgatcagtg cttgcagtcg ggtcatcaac tcctcacggt 42960
cggcaccgac cagcaccgcc cgatgctcca accgtgcccg ggtattgatc aaagaccacc 43020
ccacgtccac cgcatccagc cccggccgtg tcagcatcca ctcctgcaac cggcgcccct 43080
gcgcgagcaa gccttcaccg gtacgccccg acaccatcca caccactgca tcggacttca 43140
ccgcaggcac cggatcggtg tcgggtgccg atgattcggt gtcgggtgtg acgggtggtg 43200
attgttcgag gatgacatgc gcgttggtac ccgagatacc gaaggaggac accgccgcac 43260
gacgcggccg gtcggcctcg acagtccagg cacgagactc ggtcagcaat tccaccgcgc 43320
ccgcggtcca gtccacgtgg gtggtgggag tgtcgacgtg caaggttttc ggcagtgtct 43380
cgtggcgcat cgcctcgatc attttgatca ccccggccac accggcggcg gcctgggcgt 43440
ggccgatatt cgatttcagc gaccccagcc acaagggccg gtcgggttca cggttttgcc 43500
cgtaggtggc cagcagggct tgggcctcga tcgggtcacc gagggtggtg ccggtgccgt 43560
gagcctcaac cacatccact tccgtcgccg ctaccccggc gttggccagg gcacggcgga 43620
tcacccgctg ctgggaagga ccgttggggg cggtcaaccc gttggacgca ccgtcctgat 43680
tgaccgccga accacgcacc accgccaaca cttgatgccc gcggcgacgc gcgtcggaaa 43740
gccgctccac gaccaggaca ccgacgcctt cggaccagcc ggcaccgtcg gcggcctcgg 43800
cgaacgactt gcaccggcca tccggcgcca gtcctttctg gcgggagaac tcgacgaacg 43860
tgtcgggtgt cgacatcacc gtgacaccgc cgaccaacgc catcccgcac tcaccggccc 43920
gcacggcctg cacggcttga tgcagggcga ccagcgacga cgagcacgcg gtgtccaccg 43980
ataccgccgg gccttccaat cccagcacgt acgacacccg acccgagacg accgaaccac 44040
cgaccgcgct ggcggggtag tcgtggtaca tcacgcccat gaacacgccg gtgtcgctgc 44100
cgcgcaacga gaccgggtcc accccggcgt cttcgagggc ttcccacacc gtttccagca 44160
gcaaccgctg ctgcggatcc atcgcaaccg cttcccgcgg gctgatcccg aagaacccgg 44220
catcgaacaa ccccgcatcg tgcaggaacc caccctcacg cgtgtaggac ctacccgcca 44280
cacccggttc gggatcgaac aaccccccat cccaaccccg atccaacggc cactgcgaca 44340
ccacatcccg gccctgggcc acgacctgcc acagctcttc gcgtgaggac actcctccgg 44400
ggaagcgaca gcccacacca acaatcgcga tcggctcggc cgaatgaccc accaccacaa 44460
ccggttccgc ggccacggat gctccggcca gctgctggtg caggtgttcg gcgacggctc 44520
tgggggtcgg atagtcgaag atcaacgtcg ccgcgaccgc caccccggta gcagttttga 44580
gtcggttgcg ggcttcgacg gcggtgaggg agtcgaatcc gagttcctgg aagttgcggt 44640
cggcgtcgat cgcggtggcg tcgtcgtgtc cgagcacgat cgccacctga ccacgcacca 44700
actccagcag caccttgacc tgttcggcct cgtccaggcc cgacaagcgt tgccgcagtt 44760
gcgatcccgc caccccagca cctgcgctac cggtgtttcc ggcggcaaca cgtcgggcgt 44820
tgggtaccag gttgtgcagt atcggtgcca gcatccccgc ccgcgcctgc gcggccagga 44880
cggtggtgtc gaaccgcgct gccagcacgg tagcgtgctc ggcggtcacg gcggtgtcga 44940
acatcgccat accctgtgcc tcggtcaacg ccagcatgcc gccacggctc atgcgcgcgg 45000
tatcgccacc atcgaggtga ccggtcatcc cggtgcccga tccccacaaa ccccacgcga 45060
tcgacgtcgc cgccaaccct cgagcccgcc ggtactcggc caacccgtcg aggaactgat 45120
tggccgcggc atagttgccc tgaccgggcg aacccagcac accggcggcc gaggaataca 45180
ggacgaacat gcccagatcc aggccacggg tcagctcgtg cagataccac gccgcatcgg 45240
ctttcacgga aagcaccgtg tcgaggcgtt gcggtgtcag cgacgcgatc acaccgtcgt 45300
cgagcacacc cgccgcatgc accaccccga ccaaaggatc ctcgtccggt accgcggcca 45360
gcagttgctc gaccccggca cgggtggaca catcacaggc caccaccgcc acccgcgcac 45420
ccgaaccggt caactcctcg accaactcac gcgcaccctc cgcagccaaa ccccggcgag 45480
aagccaacac cagcgaccgc acaccccgca cacccaccag atgccgggcc agaatccgac 45540
ccaaaccacc ggtaccaccg gtcaccacca cagtcccccg accagcaccc gacacggtgt 45600
catcggtatc cgagatatcc gatgccgtgg tcttgtcgtc acgcccggca gtacggacca 45660
gtcgcgcgat gtgggcgata ccgtcacgga tcagaacctg aggctccccc accgccacag 45720
ccaacgacac gatcccggcc acatcaacac catcggagcc ctcgatgtcg gtgtcggcga 45780
gcaggatccg gcccggttcc tccgactgcg ccgaacgcac caaaccccag attgtcgacg 45840
ccgccggatc gacccgatca cccgcggtcg tggtgaccgc cgcccgggtg aggaccagca 45900
gcgtgctgga cgcgaaccgc tgaccagtcg agaactcctg caacacaccc agcacccgat 45960
ggctgatcgc gtgcgccctg gccaacacat cggtgccatt tccaccgtcg ttgtcggtgt 46020
cgtttctacc gctacggcag tcgagcacca cgatctgtgg tgccgggtgg tcggtggact 46080
cggcttgcag atcgtcccac tccacgaatg ccacgtcctg cggctgtgtg ggtgtggctg 46140
tccagtgcag ggtgtggagc cggtctccgc tgcctgtcgc gatggccaac tggtcgagtt 46200
ggaccggtcg cagggtcagt gatgcgatgg tgaggaccgg tagcccgtcg gggtcggtca 46260
cggtcacgcg gaccgtgttg tgcccgagtg gggtgattct ggcgtgcacg gtcgaggcgc 46320
cgacggcatg caattgcacg ccttcccacg cgaacggcag caacggaccc gctgcggcct 46380
ccaccccggc cttcccactg gtgtcaaagc cggtggtcat ggcgtgcagg acggcatcga 46440
gcagggcggg gtgtagtccg taatggtttg cctggccgcc ggtttcgggg agggtggctt 46500
gcacgagcca gtcttctccg gtgcgccaga ctgattccag gccttggaac gcgggaccgt 46560
agccgtaccc gtcctcggcg agttgctggt agaggctgct ggtatcggtg tgcaccgcac 46620
ccgctggtgg ccatgccgcc agcccggcat ccacgtcgtg tggtgttgtg gtggtgaggt 46680
tttcgaccgg actctgggtg tggagtaggc cttgggcgtt caacacccac tcctggtctc 46740
gggtttggga gtacaccgac actgtgcggg tgccggaggt ttcgagcgcg ccgacgagga 46800
cctggatggc ggtgccgccc tcggcgggca gtgtcagggg tgcgagcaac gtcagttccc 46860
gtatcgcccc gcatccgacc tcgtcaccgg cacggatcac cagctcgacc agcccggtcc 46920
cgggcagcaa caccacacca cccacggcgt ggtcggccag ccacggatga gtctgcagcg 46980
acagccggcc ggtcacggtc acggccccgg tctccggaga caccaccacc gccccgatca 47040
atggatgatc gagtgctgtt tgccccagcg agtcagggtt cccggatgcg gtgagggtgt 47100
cgagccagta gcggcgatgc tggaacgcgt aggagggcaa cgcaacccgg gtcgcaccac 47160
ggccgtcgaa gatcggtgtc cagtcgaccc cggcaccggc cacatcgacc attgccagcg 47220
ccgagagcag ccggtccagt ccgccgtcgt cgcggcgcag tgatccggtc acgacgatgt 47280
cgcgtgttcg tgggccggtt tgttcgccga gttcttcgat gcccggtgtc agcaccgggt 47340
gtggtgaggc ctccacgaac acggtgtggc cctcgctcag cagggtttgc acggtcgcgg 47400
cgaagttcac ggtgtcgcgc aggttgcgga accagtaccc ggcatcgagt tcggtggtgt 47460
cgagcagggt gccggtcacc gtggagtaga acgcgattcg tgagggtcgt ggagtgatgg 47520
tggccagttc ctcgagcagt cgttgccgca aggactcgac ctgtggtgaa tgcgaggcgt 47580
agtcgaccgc gatccggcgg acctgcacac cgtcgctttc gcaggcggcc acgaacgcgt 47640
cgagttgttc ggtcagaccc gacaccaccg tggtggtggg cccgttgacc gcggccacgg 47700
ccagtcccgg catgtcggtc agtcgttgct cgaccagtgt cgtcggcagc agcacgctgg 47760
ccattccgcc tcgacccgac agttcccgca atgcccgcga acgcagaatc accacacgtg 47820
cggcgtcttc gagcgacaac gccccggcca cacaggctgc ggcgatctcg ccttgcgaat 47880
gccccagtac cgcatccggg acgacaccga aggagcgcca tacctcggcg agggaaacca 47940
tcaccgcgaa cagtgccggc tgcacgacat cgacccgctc gagcagcatc ggatcggccg 48000
tgccctgcag cacgtcgatc aacgaccact ccaccaaggg cgcgaacacc tcggcgcatt 48060
cgagcacttt ctgctcgaac accaccgatt cctgcaacag acgcgcaccc attcccagcc 48120
actgcgcgcc ctgaccgggg aacaccaaca ccgtcttgcc cacgccacca gcaacacccg 48180
agaccacacc cggacccgcg aacatgccgg cacccgcgag catgcccggt tcgctatcga 48240
tcagtgcttg cagtcgggtc atcaactcct cacggtcggc accgaccagc accgcccgat 48300
gctccaaccg tgcccgggta ttgatcaaag accaccccac gtccaccgca tccagccccg 48360
gccgtgtcag catccactcc tgcaaccggc gcccctgcgc gagcaagccc tcaccggtac 48420
gccccgacaa cacccacacc aacccgccag cagtgaccgg caccagcgac tcggtaccag 48480
gcaccggttc aacgaccgca ggtgcttgtt cgaggatgac gtgcgcgttg gtgcccgaga 48540
cgccgaagga ggagatgccc gcgcgcaggg ggtgcccgtt gcgaggccac ggctgttcct 48600
gggtgagcag ctcgactgtg ccggtcgtcc agtccacgtg gctggagggt gtgtcgacgt 48660
gcaaggtttt cggcagtgtc tcgtggcgca tcgcctcgat cattttgatc accccggcca 48720
caccggcggc ggcctgggcg tggccgatat tcgatttcag cgaccccagc cacaagggcc 48780
ggtcgggttc acggttttgc ccgtaggtgg ccagcagggc ttgggcctcg atcgggtcac 48840
cgagggtggt gccggtgccg tgagcctcaa ccacatcgat cagatcgggc gaaagaccgg 48900
cgttggccaa cgcgcggcgg atcacccgct gctgggaagg accgttgggg gcggtcaacc 48960
cgttggacgc accgtcctga ttgaccgccg aaccccgcac caccgccaac acttgatggc 49020
cgtgtttgcg ggcctccgac agccgctcca cgaccaggat gccgacgcct tcggaccagc 49080
ccgtcccatc agcagcctcc gcgaacgatt tgcaccggcc atccgaggcc agtcctcctt 49140
ggcgggagaa ctccacgaac atgctgggtg tcgacatcac cgtcgcgccg cctaccagtg 49200
ccatcccgca ctcgccgaca cgcacggctc gtacggcttg atgcaacgca accagcgagg 49260
acgagcacgc ggtatccacc gacacggccg ggccctccag ccccaacacg tacgacaccc 49320
ggcccgagac cacactcgag gtcgtgccgg tcagccggta tccctcgaca ccggcgtcgc 49380
cgtcgcttcg gcctatgccg taggactggt cgctgacgcc gatgaacacg ccggtgtcgc 49440
tgccgcgcaa cgagaccggg tccaccccgg cgtcttcgag ggcttcccac accgtttcca 49500
gcagcaaccg ctgctgcgga tccatcgcaa ccgcttcccg cgggctgatc ccgaagaacc 49560
cggcatcgaa caaccccgca tcgtgcagga acccaccctc acgcgtgtag gacctacccg 49620
ccacacccgg ttcgggatcg aacaaccccg catcccagcc acggtcgacc ggccactgcg 49680
acaccacatc ccgaccctgg gccacgacct cccacagctc ttcccgcgag gacacaccgc 49740
caggaagacg acagcccaca cccacaatcg cgatcggctc agacagcacg gcgtctaatc 49800
gctggcgagt ctccagcaat tcggatgtga cccacctcag attggcgaga agcctttcct 49860
cttccataag tctgtccttc ctatcgcaac aagctatacg ccggggtaac ccaactgagt 49920
ttgaataaaa ttcatgaggt catcggccga ggactctttt atgacttcct taactgccga 49980
attacccgag aggtcattgg gcctgacatc ccaatttttt tcaccgctct ttagcaccag 50040
cttcccgaaa ccgtcgagaa ggcttttcct atcagcaaca ctcaaattag cgtcggaaag 50100
cagagactca attcgataca agatttcatc tacgccacgt cgtccactcc ttccggccag 50160
ctgttggtgg aggtgttcgg cgacggctct gggggtcggg tagtcgaagg taagagtggc 50220
cgggatcgcg acttcggtgg cggttttgag tcggttgcgg gcttcgacgg cggtgaggga 50280
gtcgaatccg agttcctgga agttgcggtc ggcgtcgatc gtggtggcgt cgtcgtgtcc 50340
gagcacgatc gccacctggg tctgcaccaa ctccagcagg atcttgacct gttcggcctc 50400
gtccaggccg gacaggcgtt gccgcagttg cgatcccgcc accccgctgg cgctgccggt 50460
gtctccggtg gctgcgcgtc gggcgttggg gacgagctgg tgcagtatcg gtgtcagcat 50520
ccccgcacgg gcctgcgcgg ccagggcggt ggtgtcgaac cgcgctgcga gcacggtggc 50580
gtgctcggcg gtaatggcgg tgtcgaacat cgccatgccc tgctcgtcgg tcatcgccag 50640
atagccgcca cggttcatgc gtgcggtgtc gccaccatcg aggtgaccgg tcatcccggt 50700
cgatgatccc cacaatcccc acgcgatcga ggtcgcgggc aggccttgag cacggcggtg 50760
ttcggccagt ccgtcgagga actgattggc cgcggcatag ttgccctgac cgggcgagcc 50820
cagcacaccg gcggtcgagg aatacatgac gaacatgccc agatccaggc cacgggtcag 50880
ctcgtgcaga taccacgccg catcggcttt cacggaaagc accgtgtcga ggcgttgcgg 50940
tgtcagcgac gcgatcacac cgtcgtcgag cacacccgcc gcatgcacca ccccgaccaa 51000
aggatcctcg tccggtaccg cggccagcag ttgctcgacc ccggcacggg tggacacatc 51060
acaggccacc accgccaccc gcgcacccga accggtcaac tcctcgacca actcacgcgc 51120
accctccgca gcgatacccc ggcgagaagc caacaccagc gaccgcacac cccgcacacc 51180
caccagatgc cgggccagaa tccgacccaa accaccggta ccaccggtca acaccacagt 51240
cccacgaccg gtatccgtcg tgtccgagac cacgggcagt gtcgaaacca cgggtagtgt 51300
gaggacgact ttgccgatgt gacgggtctg gctgaaatat cggaaggcct cgggtgcctg 51360
cctgatatcc catgcttgga tgggaatgga cttgagttcg ccgcgatcga aactggcggt 51420
gagctcggag agcatctgtt ggatacggtc ttccccggcc tcgaacatgt cgaaggcttg 51480
atagatcacc ccgggatact gggtagtgat cgcatcgctg tcacgcttgt cggtcttgcc 51540
catctcgagg aagtggcccc cgcgcggtag cagtcgcagc gacgcgtcga cgaaatcccc 51600
ggccaatgag ttcaagacga tgtccacccc gtgaccgtcg gtggccgaca agaattcgtc 51660
ttcgaagctc aacgtccgcg aattcgcgat gtgctggtcg tcgaaaccta tgccccgcaa 51720
cacatcccac ttgccactgc tggcagttgc gaagacttcc aggccccagc aacgtgccag 51780
ttggatcgcg gccattccta cgccgccggt cgccgcatgc accagcaggc gatcccccgg 51840
ctttgcatga gccaggtcca tgagcccgta gtaggccgtc aagaacacga ccggtaccgc 51900
tgcggcttgg gcgaatgacc accccgccgg catgtgcacg accagtcgat ggtcgacgat 51960
cacgaccggt cccactccac gaccggccaa ccccatgacc cggtcgccga cgctcagacc 52020
ctcgacgtct gcaccgacct cgacaatgac tcctgccagc tcggcaccca ccacagcgtc 52080
gtcatcgggg tacatgccca gcgcgatcag tacatcccgg aagttcaacc cggcagcccg 52140
cacggagatc cgcacctgcc ccgctgccag tggctgctcc gccaaggggt gactcaccag 52200
ggccaaacca tccagcacac ccttatccac agcagcgagc tgccatgccc ccgcgtcggg 52260
gatcgcaaga gtcccgcgcc cgggaccacg ggtcagtcgc gcgatgtggg cgataccgtc 52320
acggatcaga acctgaggct cccccaccgc cacagccaac gacacgatcc cggccacatc 52380
aacaccatcg gagccctcga tgtcggtgtc ggcgagcagg atccggcccg gttcctccga 52440
ctgcgccgaa cgcaccaaac cccagattgt cgacgccgcc ggatcgaccc gatcacccgc 52500
ggtcgtggtg accgccgccc gggtgaggac cagcagcgtg ctggacgcga accgctgacc 52560
agtcgagaac tcctgcaaca cacccagcac ccgatggctg atcgcgtgcg ccctgaccag 52620
cacatcggtg ccgttttcct tatcacggca gtccagcact acgacctgtg gtgccggttg 52680
gtcggcggac tcagggtcgg tggactcggc ttgcagatcg gtccactcgg tatatgccac 52740
gtcctgcggc tgcctagatg tggtcgtggg tgtccagtgc acggtgagca gccgatcccc 52800
ggtacctgcc gcggctgtca gctggtctag ttgtgctgga cgcagtgtca gcgatccgat 52860
cgtcaggacc ggtcgacccg caggatcggt caccgtcacc tgtaccgtgt tgtggccgtg 52920
gggggtgatt ctggcgcgca cggtcgaggc cccgactgca tgcaactgca ctgcttccca 52980
tgcgaagggc agcaacggac ccgcactggt gtcgtggccg gtggtcatgg cgtgcaggac 53040
ggtatcgagc agggcggggt gtagtccgta gtggtgtgcg tcaccgccgg tttcggggag 53100
ggtggcttgc acgagccagt cctgtccggt gcgccagacc gattccaagc cttggaatgc 53160
gggaccgtag ccgtacccgt cctcggccag ttgctggtag aggctgctgg tgtcggcttg 53220
caccgcgccc gctggtggcc atgccgccaa ctcggtgtcc acgtcgtgtg gtgttgtggc 53280
gctctgggtt tggagcaggc cttgggcgtt caacacccac tcctggtctc gggtttgcga 53340
gtacaccgac accgtccgag tcccggacga ttcgagcgca ccgacgagga cctgcacggc 53400
ggtcccgccg tcggtcggca gtgtcagggg cgcgagcaac gtcagttccc gtaccacccc 53460
gcacccggcc tcgtcaccgg cacggatcac cagttccacc agcccggtcc ccggaaccaa 53520
caccacaccg cccacggcat ggtcggccag ccacggatgg gtgtgcagcg agagccggcc 53580
ggtcacggtc acggccccgg tctccgggga caccaccacc gccccgatca atggatgatc 53640
gagtcccgac aatcccagcg agtcgggatc ggtgttgccg gtgatcgtat cgagccagta 53700
gcggcggtgc tggaaagcgt aggaaggcaa cggaacccgg gtcgcgccgc ggccgtggaa 53760
gatcggtgtc cagtcgattc cggtgccggc cacatccagt cgggccagtg ccgagagcag 53820
ggtcgtgtcc tcgacacggt ccttacgcag cagcgaagcc acgacagcct ccacaccgtc 53880
cacggtgggt ttggtatcca cggcgtcgct ggtggtgtgt tggagggttt cgtcgatgag 53940
tccggatagg ccgccgtcgg ggcccatgat cacgtatcgg gttgctcccg ccgtggtgag 54000
ggtggtgatt ccgtcggcga accgtacggt gttgcggacg tggtcgaccc agtactgcgg 54060
cgtggtcagc ggtgaatccg cctgttgggc atcggtgttc gggctgtcgg tgttcgggcc 54120
ggtgagttgg ccgtcgaggt tggagatgat cgggatgacc ggttgggtgt aggtgagttc 54180
ggtggcgatg cgggcgaatt cggccagcat gggttccatc gaggcggagt ggaacgcgtg 54240
ggagacccgc agccggttga cctggtatcc ggcctgccgt agttgttgct cggtggtgtc 54300
gatcgcgtgc tgggggccgg cgaggacgat cgattcgggt ccgttgaccg cggcgatctc 54360
gacgacaccg tcctcgatgc tgtcaccgag cagggtggtg atctgggttt cggaggctcg 54420
catggcgagc atggctccgc cggtggggag ttgctgcatc agccgggcgc gggcggcgac 54480
cagtaccgtc gcgtcctcga ggctcagtac accggccacg gtcgcggcgg ccagttcacc 54540
gatggagtgt ccggccacga aatccggtcg gacaccgaag gattccagca accggaacag 54600
cgcgataccg acggcgaaca gtcctgtctg ggtgtagagg gtcgcttgca gtgcctgctc 54660
gtcgacaccc cacaccacat cccgcagcga gcattccagc tgctgttcca gcagtgcggt 54720
ggtctcgtcg aagctggccg cgaacaccgg gaacgcctcg tacaaaccag atcccatacc 54780
cagcagctgc gcgccctgac cggggaacac gaacaccgtc ttgccgcggt cacgcgaaac 54840
accagccgcc acagcgggat caccgtcgat caacccctgc agtcgggtca tcaactcctc 54900
acggtcggca ccgaccagca ccgcccgatg ctccaaccgt gcccgggtat tgatcaaaga 54960
ccaccccacg tccaccgcat ccagccccgg gcgcgccagc atccactcgt gtagacgccg 55020
tccctgcgcg agcaagcctt caccggtacg ccccgacacc atccacacca ctgcatcgga 55080
cttcaccgca ggcaccggat cggtgtcggg tgccgatgat tcggtgtcgg gtgtgacggg 55140
tggtgattgt tcgaggatga catgcgcgtt ggtacccgag ataccgaagg aggacaccgc 55200
cgcacgacgc ggccggtcgg cctcgacagt ccaggcacga gactcggtca gcaattccac 55260
cgcgcccgcg gtccagtcca cgtgggtggt gggagtgtcg atgtgcaagg ttttcggcag 55320
tgtctcgtgg cgcatcgcct cgatcatttt gatcaccccg gccacaccgg cggcggcctg 55380
ggcgtggccg atattcgatt tcagcgaccc cagccacaag ggccggtcgg gttcacggtt 55440
ttgcccgtag gtggccagca gggcttgggc ctcgatcggg tcaccgaggg tggtgccggt 55500
gccgtgagcc tcaaccacat cgatcagatc gggcgaaaga ccggcgttgg ccaacgcgcg 55560
gcggatcacc cgctgctggg aaggaccgtt gggggcggtc aacccgttgg acgcaccgtc 55620
ctgattgacc gccgaaccac gcaccaccgc caacacttga tgcccgcggc gacgcgcgtc 55680
ggaaagccgc tccacgacca ggacaccgac accttccgac cagcctgtcc catcagcagc 55740
ctcggcgaag gatttgcagc gcccgtcggc cgccagtcct ttctggcggg agaactcgat 55800
gaacgtgtcg ggtgtcgcca tgaccatcac accacccacc agcgccatcc cgcactcacc 55860
ggcacgcacg gcctgcacgg cttggtgcaa cgcgaccagc gacgacgagc acgcggtgtc 55920
caccgacacc gccggacctt ccagtcccag cacatacgac acccggcccg atacgacgct 55980
cgtggcgccg ccggtcagcc ggtatccctc gactccggcg tcgccgtcgc ttcggcctat 56040
gccgtaggac tggtcgctga cgccgatgaa tacgccggtg tcgctaccgc gcaacgagac 56100
cgggtccacc ccggcgtctt cgagggcttc ccacaccgtt tccagcagca accgctgctg 56160
cggatccatc gcaaccgctt cccgcgggct gatcccgaag aacccggcat cgaacaaccc 56220
cgcatcgtgc aggaacccac cctcacgcgt gtaggactta cccgccacac ccggttcggg 56280
atcgaacaac cccgcatccc aaccccgatc caacggccac tgcgacacca catcccggcc 56340
ctgggccacg acctgccaca gctcttcccg cgaggacaca ccgcccggga gacgacatcc 56400
cacacccacg atcgctatcg gctcggctga atcgcccacc acatccggtt cagcgacaac 56460
aggtgctccg gccagctgct ggtacagatg ttcagcaacc gccctgggag tcgggtaatc 56520
aaaagtaaga gtggcttgga ctgcgactcc ggtggtggtc ttgatccggt tgcgggcttc 56580
gaccgctgtc agggaatcga atccgagttc ctggaagttg cggtcggcat cgatcgcggt 56640
tatgtcgtcg tgcccgagca cgatcgctac gtccgcgcgc acaagatcca gcagtagctc 56700
gatctgttcg gtgtccttca ggcccgacaa gcgttgctgc agttgcgatc ccggcacccc 56760
gccgctggta ttgccgaccg cccggcgggc accgggaacc aggttgttca gtatcggcgc 56820
caataccccg gccctggcct gcgccgcaag cgcggtgatg tcgaaccgca ccgccaacac 56880
cgaggactgg tcttgagcga cagcggcgtt gaacatcgcc atgccctggt catcggtcaa 56940
cgccagcatg ccgccacggt tcatgcgtgc ggtgtcgcca ccatcgaggt gaccggtcat 57000
cccggtcgat gatccccaca aaccccacgc gatcgacgtc gccgccaacc ctcgagcccg 57060
ccggtactcg gccaacccgt cgaggaactg attggccgcc gcgtagttgc cctgaccggg 57120
cgaacccagc acaccggtga ccgaggagta catgacgaac atcgccacat ccagctcacg 57180
agtcagctcg tgcagatacc aggccgcatc cgccttcgcg gaaagtaccg tgtccaggcg 57240
ttgtggtgtc aacgatgcga tcacaccgtc gtcgagcaca cccgccgcat gcaccacccc 57300
gaccaaagga tcctcgtccg gtaccgcggc cagcagttgc tcgaccccgg cacgggtgga 57360
cacatcacag gccaccaccg ccacccgcgc acccgaaccg gtcaactcct cgaccaactc 57420
acgcgcaccc tccgcagcca aaccccggcg agaagccaac accagcgacc gcacaccccg 57480
cacacccacc agatgccggg ccaggatccg acccaaacca ccggtaccac cggtcaccac 57540
cacagtccca ccggcacccg acacggtcgc aacggcatcc gagatatccg atgccgtgcc 57600
cgtgtcgctg cgctcgggta cacggaccag tcgcgcggta tgcgcgatac cgtcgcggat 57660
cagtacctga ggctccccca ccgccacagc caacgacacg atctcggcca gatcggtgcc 57720
atcgatgccg tggatatctg tatcgaggag caagatccgg cccggatcct cgatttgggc 57780
cgaacgcacc aagccccaca ctgccgacgc cgccggatca accgccaggt cgatgcgttc 57840
agcagcctca tcgatgcggt cgcccgtcac agagaccgct gcccgggtga ggatcagcag 57900
cgtgctggaa gcgaaccgct gctgggtcga gaattcctgc agcacgccca gcacccgttg 57960
accggtggcg cgtgtcttgg ccagcatgtc ggccccgtca ccaacgctgg tgtcatgttc 58020
actctcacgg cagtcgagca ccaccaccgg aggcgtcggc ggccaaccga tgggttctgg 58080
gtcgagggac tccagttgca ggtcggtcca ttcggcaaat gacacttccc gcagttgcac 58140
ggttgtaggt gtccagtgca gggtgtgcag gcggtcctcg gtcgctgtcg cagtgaccag 58200
ctgggcgaat tggaccgagc gcagtgtcag cgatccgatg gtgaggaccg gtcgaccgtc 58260
gaggtcgaac acggttatgc gtaccgtgtt gtggccgtgg ggggtgattc tggcgcgcac 58320
ggtcgaggcc ccgactgcat gcaactgcac tgcttcccat gcgaagggca gcaacggacc 58380
cgcactggtg tcgtggccgg tggtcatggc gtgcaggacg gcatccagca gggcggggtg 58440
tagtccgtag tggtgtgcgt cgccgccggt ttcggggagg cgggcttgca cgagccagtc 58500
ttctccggtg cgccagactg attccaggcc ttggaacgcg ggaccgtagc cgtacccgtc 58560
ctcggcgagt tgctggtaga ggctgctggt gtcggttcgg gtggcgtttt gtggtggcca 58620
taccgccaac cccgtgtcca ctggtgttgt ggtggtgagg ttttcgaccg gactctgggt 58680
gtggagtagg ccttgggcgt tcaacaccca ctcctggtct cgggtttggg agtacaccga 58740
cactgtgcgg gtgccggagg tttcgagcgc gccgacgagg acctggatcg cggtgccgcc 58800
ctcggcgggc agtgtcaggg gcgcgagcaa cgtcagttcc cgtatcgccc cgcatccgac 58860
ctcgtcaccg gcacggatca ccagctcgac cagcccggtc ccgggcagca acaccacacc 58920
acccacggcg taatcggcca gccacggatg agtctgcagc gacagccggc cggtcacggt 58980
cacggccccg gtctccgggg acaccaccac cgccccgatc aatggatgat cgagtcccga 59040
caatcccagc gagtcgggat cggtgttgcc ggtgatcgta tcgagccagt agcggcggtg 59100
ctggaaagcg taggaaggca acacaacccg gctcgcgcca cggccatcga agaccggtgt 59160
ccagtcgatc ccggcaccgg ccacatccac cactgccagt gccgacaaga aggtcgtgtc 59220
ctcgacacga tccctacgca gcaaagacgt caccaccgtg tcggtcgtgt cggtgtcgct 59280
ggattggagg gtttcgtcga tgagtccgga taggccgccg tcggggccca tgatcacgta 59340
tcgggttgct cccgccgtgg tgagggtggt gattccgtcg gcgaaccgta cggtgttgcg 59400
gacgtggtcg acccagtact gcggcgtggt cagcggtgaa tccgcctgtt gggcatcggt 59460
gttcgggctg tcggtgttcg ggccggtgag ttggccgtcg aggttggaga tgatcgggat 59520
gaccggttgg gtgtaggtga gttcggtggc gatgcgggcg aattcggcca gcatgggttc 59580
catcgaggcg gagtggaacg cgtgggagac ccgcagccgg ttgacctggt atccggcctg 59640
ccgtagttgt tgctcggtgg tgtcgatcgc gtgctggggg ccggcgagga cgatcgattc 59700
gggtccgttg accgcggcga tctcgacgac accgtcctcg atgctgtcac cgagcagggt 59760
ggtgatctgg gtttcggagg ctcgcatggc gagcatggct ccgccggtgg ggagttgctg 59820
catcagccgg gcgcgggcgg cgaccagtac cgtcgcgtcc tcgaggctca gtacaccggc 59880
cacggtcgcg gcggccagtt caccgatgga gtgtccggcc acgaaatccg gtcggacacc 59940
gaaggattcc agcaaccgga acagcgcgat accgacggcg aacagtcctg tctgggtgta 60000
gagggtcgct tgcagtgcct gctcgtcgac accccacacc acatcccgca gcgagcattc 60060
cagctgctgt tccagcagtg cggtggtctc gtcgaagctg gccgcgaaca ccgggaacgc 60120
ctcgtacaaa ccagatccca tacccagcag ctgcgcgccc tgaccgggga acacgaacac 60180
cgtcttgccg cggtcacgcg aaacaccagc cgccacagcg ggatcaccgt cgatcaaccc 60240
ctgcagtcgg gtcatcaact cctcacggtc ggcaccgacc agcaccgccc gatgctccaa 60300
ccgtgcccgg gtattgatca aagaccaccc cacgtccacc gcatccagcc ccgggcgcgc 60360
cagcatccac tcgtgtagac gccgtccctg cgcgagcaag ccttcaccgg tacgccccga 60420
caccatccac accactgcat cggacttcac cgcaggcacc ggatcggtgt cgggtgtctc 60480
ggtgaccggt ggtgcttgtt cgaggatgac atgcgcgttg gtacccgaga taccgaagga 60540
ggacaccgcc gcacgacgcg gccggtcggc ctcgacagtc caggcacgag actcggtcag 60600
caattccacc gcgcccgcgg tccagtccac gtgggtggtg ggagtgtcga cgtgcaaggt 60660
tttcggcagt gtctcgtggc gcatcgcctc gatcattttg atcaccccgg ccacaccggc 60720
ggcggcctgg gcgtggccga tattcgattt cagcgacccc agccacaagg gccggtcggg 60780
ttcacggttt tgcccgtagg tggccagcag ggcttgggcc tcgatcgggt caccgagggt 60840
ggtgccggtg ccgtgagcct caaccacatc gatcagatcg ggcgaaagac cggcgttggc 60900
caacgcgcgg cggatcaccc gctgctggga aggaccgttg ggggcggtca acccgttgga 60960
cgcaccgtcc tgattgaccg ccgaaccccg caccaccgcc aacacttgat ggccgtgttt 61020
gcgggcctcc gacagccgct ccacgaccag gatgccgacg ccttcggacc agcccgtccc 61080
atcagcagcc tccgcgaacg atttgcaccg gccatccgag gccagtcctc cttggcggga 61140
gaactccacg aacatgctgg gcgtcgacat caccgtgaca ccgcctacca gtgccatccc 61200
gcactcgcca gcacgcacgg cttgcacggc ttgatgcagg gcgaccagcg aggacgagca 61260
cgcggtgtcc accgataccg cgggaccttc cagccccagc acatacgaca cccggcccga 61320
gaccacactc gaggtcgtgc cggtcagccg atatccctcg aaactgtcat ccacctcacc 61380
tcgaccaatt ccatacgcgt gatccgttac gccgatgaat acaccggtgt cgctgccgcg 61440
caacgagacc gggtccaccc cggcgtcttc gagggcttcc cacaccgttt ccagcagcaa 61500
ccgctgctgc ggatccatcg caaccgcttc ccgcgggctg atcccgaaga acccggcatc 61560
gaacaacccc gcatcgtgca ggaacccacc ctcacgcgtg taggacctac ccgccacacc 61620
cggttcggga tcgaacaacc ccgcatccca accccgatcc aacggccact gcgacaccac 61680
atcccggccc tgggccacga cctgccacag ctcttcgcgt gaggacactc ctccggggaa 61740
gcgacagccc acaccaacaa tcgcgatcgg ctcggccgaa tgacccatca ccacaactgg 61800
ttccgcggcc acggatgctc cggccagctg ctggtgcagg tgttcggcga ccgccctggg 61860
agtcgggtag tcgaagatca gggtggccga catcgccacc ccagtcagcc tatttagtcg 61920
gcgacggaac tctacggctc ccagtgaatc caatcccaaa tctcgaaaag aattatcttc 61980
gtcaaactcg tcgggactaa atttcccact aacagccgcg agttgatcgc gcaccgcatt 62040
ctttatcatc aaccactggt cgcccaaact tagctgggcc aactttctcg caaaaagatc 62100
ctgaacttct tccgaagtaa ttctcgacga agacgatgcg tgccgcccgc cttgctgtcg 62160
atcaaaggca taggtaggaa gctgtacccg cgatgcgccg cggccgtgga agatcggtgt 62220
ccagtcgatt ccggtgccgg ccacatccag tcgggccagt gccgagagca gggtcgtgtc 62280
ctcgacacgg tccttacgca gcagcgaagc cacgacagcc tccacaccgt ccacggtggg 62340
tttggtatcc acggcgtcgc tggtggtgtg ttggagggtt tcgtcgatga gtccggatag 62400
gccgccgtcg gggcccatga tcacgtatcg ggttgctccc gccgtggtga gggtggtgat 62460
tccgtcggcg aaccgtacgg tgttgcggac gtggtcgacc cagtactgcg gcgtggccaa 62520
cgccgagggt tgatcgtcgg gggttccggt gagttggccg tcgaggttgg agatgatcgg 62580
gatgaccggt tgggtgtagg tgagttcggt ggcgatgcgg gcgaattcgg ccagcatggg 62640
ttccatcgag gcggagtgga acgcgtggga gacccgcagc cggttgacct ggtatccggc 62700
ctgccgtagt tgttgctcgg tggtgtcgat cgcgtgctgg gggccggcga ggacgatcga 62760
ttcgggtccg ttgaccgcgg cgatctcgac gacaccgtcc tcgatgctgt caccgagcag 62820
ggtggtgatc tgggtttcgg aggctcgcat ggcgagcatg gctccgccgg tggggagttg 62880
ctgcatcagc cgggcgcggg cggcgaccag taccgtcgcg tcctcgaggc tcagtacacc 62940
ggccacggtc gcggcggcca gttcaccgat ggagtgtccg gccacgaaat ccggtcggac 63000
accgaaggat tccagcaacc ggaacagcgc gataccgacg gcgaacagtc ctgtctgggt 63060
gtagagggtc gcttgcagtg cctgctcgtc gacaccccac accacatccc gcagcgagca 63120
ttccagctgc tgttccagca gtgcggtggt ctcgtcgaag ctggccgcga acaccgggaa 63180
cgcctcgtac aaaccagatc ccatacccag cagctgcgcg ccctgaccgg ggaacacgaa 63240
caccgtcttg ccgcggtcac gcgaaacacc agccgccaca gcgggatcac cgtcgatcaa 63300
cccctgcagt cgggtcatca actcctcacg gtcggcaccg accagcaccg cccgatgctc 63360
caaccgtgcc cgggtattga tcaaagacca ccccacgtcc accgcatcca gccccggccg 63420
tgtcagcatc cactcctgca accggcgccc ctgcgcgagc agcccgtcac tgctacgccc 63480
cgacaacacc cacaccaacc catgagtggt gacagactct tgcaccagcc cgaaagcctg 63540
attggtgcgc gaatcagtgg ctccggctgc attgcccgct aggacaatgt gacaattcgt 63600
ccctcccata ccgaacgagg acaccccagc gacggccctg ctgtgatctg cagacaaatc 63660
caagcactga cgcaatacag ccacgccgga atcttcgata gaaatactcc ggttgggtgt 63720
ctcatagttc aaactcgaag gaattgtttc atgccacaac ccaagtgcta ctttaataaa 63780
gcccgctatc cccgccgccg cgtctaaatg cccaatattt gtctttaccg agccaacctg 63840
caaccatgct tcgtccgatc cgcgcccctt accaaacaca gaaactaggg ctgcggcctc 63900
cgccaagtcc cctgcaacag ttccggttcc atgcagctcg acataatgga cttgatcacc 63960
accgacgttg gcgcgccgaa gggcagacaa gaggagctct tcttgcgctg cgacactcgg 64020
agccatgaaa tccgtactgg caccatcttg gttgatcgcc ccaccaagaa taactgagta 64080
gatcctatca ccgtcatgca aagcctgggc taatggtttc aacaaaacca tcccaccacc 64140
ttcaccgggc acgaatccgt cagcgttcgc gtcaaaggcc ctgcattcgc cctgattgga 64200
gattacccca agactgtcca tcatctgcac atgactgccg gttaccatca agctaacccc 64260
gccagcgatc gcagcactac attcaccgcg acgtatactc tcacacgcca aatggacagc 64320
aaccagcgat gacgactgcg cactgtccag aaccaaactt ggacctgtaa agtcaaagaa 64380
gtgcgagacc ctattggcga tcatcgcacg gtgggtgcca gttgccgaat acgcgtccga 64440
atcgccgact cttatcgact caccaaattc gtcgcgcgat gcaccaacaa acacacctac 64500
agcgcggcgt ttcaatcttt ctggtgagat acctgcatct tcgatcgctt cccaacaaag 64560
ctcaagcacg agtaattgac gaggatctat cgtcgccgcc tcaaccggac ggatgtcaaa 64620
gaactcagca tcgaagccgt cgatgctact caaataccct gcaggtttgg gaaatatgta 64680
tctaccgcca gcgctgatag agctcggctg attaacggcg ctttgaccat ttaccaaaag 64740
ctgccagtac ttgtcgatgt tgtcagcacc cggaagttta cacgacaaac ccacaaccgc 64800
aatatcatca gcgctagcgc tatacatgag aaaagcctcc ccattcaaaa attctataag 64860
cagaatgatc ggcccatacc ggggtctgca atgaccgcgc cgagatccgc accgctagtc 64920
ggctgccagt gaaatgttcg tcagtgaaac actgctacag atagacggtt gcctaggccc 64980
tgagcctgtg cgccggctcc gttacagatc acgaccgctg gacaccgctc cccgaaccac 65040
gatctcaagc acgcccacgg ccgcccattt gggggtctct tccacgtcgc acaccagcga 65100
gcgattggga accgctccca tccgtggcgc cgaacccaca ccaacgtcag ataatcctga 65160
atagaggggc gacgatccag ggcggggaca cccgtccagg tcggtccaat caagcgagag 65220
catcgcgccg gtccgagtgc ccaggatcac cgtcctttgt atgcggcggt gcttgatcat 65280
gggcccgacc cacaacatcg cgataccggt ttaacaaact cggcgccagt caagattctc 65340
cgacgggaac gtgcactgca gctaccgcag acgcagctcg aactagccgc cgactttgac 65400
atctccaacc gattcgtcgc agacttcacc atcggaagta ccccgcttga tagcagctgg 65460
acgcttgagc ataaccgaac tccgggcctc ggacgctaac caaattggac tgcgatatcc 65520
gaacctcttc gaagtcccat acacgaattg cataccacaa caaatatatc ccccgatgac 65580
taaagacatc gccaatggac gatgttcacg agtacatcat caccggcacc acgcgtcaat 65640
agctgcgcta acgatctacg gccacacaga gttagctgag tcagtcttcg aacatgcaca 65700
cagacacccg aacacaacgg gatcgcactc cgactcgcaa cccatcgccc attagcgaca 65760
ccgctgatag aacgcgtttt ctggccaagc gtccagtatt gccgcaccgc gtcaccatgg 65820
agcgcacgag gatgggcaag cccggttggc tggcatccag attaagagct gctacagatt 65880
tgccatgcgc acgaacctcg tgatgtgaca tgagtcatag tgccagaggg gaccatcatc 65940
ggtcatagtg ccagaggcgg ggggtgatga gatgcactga ggcagaacag aacccggagc 66000
atcgtgctcg aaagctgcag cacccgttta ctcttgcgtt ttgcgccttc gagaactgtt 66060
tcagcgcagc catttacgat cacgacggga actccgccac gggaatcaac tgctcaagga 66120
acaacaccag tgactcggac cagatgcgtc cctcgtagcg ggccacttcg ccaccggcaa 66180
gacaacgccc tggcccggtc gatcgttgac tgaaaggacc cgattgagca cttcagccga 66240
gataagcttg tggttcagac ggttcaatcc gtcgccgact gcaagcagta gactcatatg 66300
tttcccccac gctggcggct ccgcgagctt ctttttgccg ctatccaggg cgatgtcccc 66360
cgaagttgaa gtgttgtctg tgcagtatcc gggaaggcag gatcgccgaa atgaacagcc 66420
agccggaagc atagctgctc tcgcagattc cattgcggac aatatttcac acttctcaga 66480
caagccgctg gccctattcg gccatagcat gggagcgatt ttagcctatg aagtgactcg 66540
aagaatatct atcaccaaca gcccaatcgc actgtttgcc tctggccgac gcgcgccgtc 66600
tcgctaccgc ccagagatcg cgcacacact gtcggacgag aaactgcttg aagagctcaa 66660
aatgctaggc ggtacagaca gccgcgcttt cgcggacaac gatattgttc gaatgatctt 66720
gcccgccgta cgcgcagact atcgagctat cgaaacatat ttttaccaac caggctccga 66780
ggtttccaca cccatatttg cccatattgg cgaccgggac cctcgagtaa cctttgacga 66840
agcaagcagt tggaaagaac atacttcgaa cagtttcgag cttcacacac ataccggcgg 66900
tcatttttat atagccgagc acaccaacag catcgccaca cacatccagc agaagctctc 66960
cgagcaccca atccgtccca gatagcgcaa ttagacgcac cacatcaaga ctattttgtc 67020
cccaccgata gatagtcgtc tgaagttccc agccgcagct tccacacttg cgaccgtgcc 67080
gattcggttc aggcagactt cgaaacccaa ttctcagcaa tggagccacc cacgatgagc 67140
gaagcacccg tcatcgcaac gcaattgccg accactcggt ccggccgttg ccccttcgac 67200
ccacccgccg ctctgacaga gatccgccag cgtgatcccc tgacccgaat gcaattcgcc 67260
aacggccacc agggctggct ggccaccggc cacaccgagg tccgcgcggt actctcggat 67320
cctcgcttta gcgcccgcca tgaacttcag cactatccgt acgccgacta cggacctatg 67380
ccgccggcac cggtgggcgc cctcgccggt atggacggcc cggaccaccg tcggtaccga 67440
aagctattga ccggcaagtt caccgtacga cgtatgcaac tgctcaccga gcggatcgag 67500
cagatcacta ctgagcatct cgatgcgatg gaaaagcatg gcggcccaat agatctggtg 67560
acggccttcg cacgtcccat tccggcgctg atgatctgcg aactgctggg tgtgcccagc 67620
tccgatcgca ctaccttcca ggagcatgcg aaaaaggcca gcgacgtcac ggccggccta 67680
gaggaacgtc tcgccgccta caccgccatc gtcgactacg tcgctgacct ggtgacggac 67740
aaacgtacgg ctcccaccga cgatctactc agcgacctga ccacgaccga cctcaccgac 67800
gaggagttgg ccggcatcgg cgccttcctg ctcggcgcag ggctagacac gacggcgaac 67860
atgctggcgc tgggcacatt cgcgctgctc actcaccccg agcagttggc ggcattgcgc 67920
tcggatccgg acctcaccga cagcgcggtg gaggagctga tgcgttacct gagcatcagc 67980
cacagcaccg cccgggccgc cctggaagac gtcgaactcg gcggcaaact gatccgggcc 68040
ggtgagacgg tcgcggtgtc catccagacc gccaaccgcg atccggcccg cttcgacaat 68100
cccgatgcac tcgacttgca ccgaaacaca gtcgggcacg tgggattcag ccacggcgcc 68160
catcaatgtc ttggccagca gctggcccgc gtcgagatgc gagttgcctt ccgtgcgttg 68220
gtgattcgct ttcccaatct gaagttggcg atccccgctc acgaggtgca gttgggaagc 68280
ggccagatct tcggcgtaaa ccaactgcct gtcagctggt aggagcggaa aaccatgaag 68340
ctcgttgtcg atcgcaaccg ctgcatcggc gctgggatgt gcgcgctgac ggcacctgcg 68400
ttgttcgacc aagatgatga cgatggactg gtgatcacac atgctgagac gccgactcct 68460
gaccaggagg gtgtcgtgcg cgaggcagtg gaggcctgcc catcgggcgc gcttcggacc 68520
gaggagtagg agcagcaagg cggcgggtat gccctcgaaa ggatggacag accagcggcc 68580
tgggaaagtg ttgccgccct ggtgagaaca tcggggcgac aaccttattc aggacaactg 68640
ttttctctcg gccctcatcg tcaaaggatt cgaatcacgg acgacagaac ctaggcgatt 68700
ctcgcacaca ctgagaatca ggatctatct atcggaagca acgtgagcac atcactttgg 68760
ccgcagtggg cgtgaagttt tcgacggcca gggattagcc agctttcctt gttgttgaag 68820
ggatttaatc cagtcctgag cgacatgatg gggaatgtcg tagtgcttag caactttggc 68880
ggtgctgccg agacgccagt aagttacacc gaaatccgat ggcgctcctg cccgttgctc 68940
tcgctgacgc tgcgcccgct gctgcgctcc cgcgagctta gaagcgggct cactgactgc 69000
agtgtcggcg tcgtcgatac tctctagcgg ggtcagcgga atacctgttt ctgccggtgg 69060
cgagacggtt tcactggtag aggacagctt cgatgccaag ttgagtcgcg acgggccacc 69120
cgacagcatg ttcgcggtgc gaacaaggag cggaaaatca atgcgggcca agtcttcagg 69180
aacataggcc tcgccggtag tctcggcacg tacctcggcc aatcgtactc cggatgcatc 69240
aatctctacg gttgcaacgg ccgctacgac gccggttctg gcgtcagaga tcgtgatcgt 69300
atagctgtct gtcattggct accgagtccg ttcaggctga cgatctagtt cttgtttgtg 69360
gaggaagttt ccacagatgg gtcaagcttg cgcaatcgcc ccatccatcc ttgagctgtg 69420
tgccgaggta cgccgaaatg ctttgcaacc gcggtgaccg tgcccagctg ctcgtacgtc 69480
gcacgaagct cgttgatatc gggcatcctc cgataggcgc gtccgccttc tccgtccgtg 69540
gagcgtggag ttgctgcctg atcagcaacc tctggcactg aagcgttttc gagcatcatc 69600
tgttccgatt gccccacagg cacctcggag gaagctgccg ccaccgcggc ccgcggaccc 69660
gaagacggaa accggcgaac cagggcggtc acaaccgcct cgatgtcgat gtcgcatacc 69720
tgctgggatg tcagaccagc agaacccgtc gtgctgatgg caataccggt gacacgcgta 69780
tctgatgtag aggcatcgac tcgtatcgtt agctgcggag catcgctgct atccccctga 69840
<210> 2
<211> 5733
<212> PRT
<213> Nocardia vinacea
<400> 2
Met Ser Leu Ser Ile Glu Glu Leu Val Lys Ala Leu Arg Val Ser Ala
1 5 10 15
Lys Glu Asn Leu Ala Leu Arg Gln Ser Asn Glu Leu Leu Val Gly Arg
20 25 30
Ser Ser Glu Pro Ile Ala Ile Val Gly Val Gly Cys Arg Phe Pro Gly
35 40 45
Gly Val Ser Ser Arg Glu Glu Leu Trp Gln Val Val Ala Gln Gly Arg
50 55 60
Asp Val Val Ser Gln Trp Pro Leu Asp Arg Gly Trp Asp Gly Gly Leu
65 70 75 80
Phe Asp Pro Glu Pro Gly Val Ala Gly Arg Ser Tyr Thr Arg Glu Gly
85 90 95
Gly Phe Leu His Asp Ala Gly Leu Phe Asp Ala Gly Phe Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Thr Val Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Val Ser
130 135 140
Leu Arg Gly Ser Asp Thr Gly Val Phe Met Gly Val Met Tyr His Asp
145 150 155 160
Tyr Pro Ala Ser Ala Val Gly Gly Ser Val Val Ser Gly Arg Val Ser
165 170 175
Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys
180 185 190
Ser Ser Ser Leu Val Ala Leu His Gln Ala Val Gln Ala Val Arg Ala
195 200 205
Gly Glu Cys Gly Met Ala Leu Val Gly Gly Val Thr Val Met Ser Thr
210 215 220
Pro Asp Thr Phe Val Glu Phe Ser Arg Gln Lys Gly Leu Ala Pro Asp
225 230 235 240
Gly Arg Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Ala Gly Trp Ser
245 250 255
Glu Gly Val Gly Val Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg
260 265 270
Arg Gly His Gln Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln
275 280 285
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln
290 295 300
Arg Val Ile Arg Arg Ala Leu Ala Asn Ala Gly Val Ala Ala Thr Glu
305 310 315 320
Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro
325 330 335
Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asn Arg Glu Pro
340 345 350
Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ala
355 360 365
Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met
370 375 380
Arg His Glu Thr Leu Pro Lys Thr Leu His Val Asp Thr Pro Thr Thr
385 390 395 400
His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu Ser Arg
405 410 415
Ala Trp Thr Val Glu Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ser
420 425 430
Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ser Pro
435 440 445
Pro Val Thr Pro Asp Thr Glu Ser Ser Ala Pro Asp Thr Asp Pro Val
450 455 460
Pro Ala Val Lys Ser Asp Ala Val Val Trp Met Val Ser Gly Arg Thr
465 470 475 480
Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln Glu Trp Met Leu
485 490 495
Thr Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser Leu Ile Asn
500 505 510
Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly Ala Asp Arg
515 520 525
Glu Glu Leu Met Thr Arg Leu Gln Ala Leu Ile Asp Ser Glu Pro Gly
530 535 540
Met Leu Ala Gly Ala Gly Met Phe Ala Gly Pro Gly Val Val Ser Gly
545 550 555 560
Val Ala Gly Gly Val Gly Lys Thr Val Leu Val Phe Pro Gly Gln Gly
565 570 575
Ala Gln Trp Leu Gly Met Gly Ala Arg Leu Leu Gln Glu Ser Val Val
580 585 590
Phe Glu Gln Lys Val Leu Glu Cys Ala Glu Val Phe Ala Pro Leu Val
595 600 605
Glu Trp Ser Leu Ile Asp Val Leu Gln Gly Thr Ala Asp Pro Met Leu
610 615 620
Leu Glu Arg Val Asp Val Val Gln Pro Ala Leu Phe Ala Val Met Val
625 630 635 640
Ser Leu Ala Glu Val Trp Arg Ser Phe Gly Val Val Pro Asp Ala Val
645 650 655
Leu Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala
660 665 670
Leu Ser Leu Glu Asp Ala Ala Arg Val Val Ile Leu Arg Ser Arg Ala
675 680 685
Leu Arg Glu Leu Ser Gly Arg Gly Gly Met Ala Ser Val Leu Leu Pro
690 695 700
Thr Thr Leu Val Glu Gln Arg Leu Thr Asp Met Pro Gly Leu Ala Val
705 710 715 720
Ala Ala Val Asn Gly Pro Thr Thr Thr Val Val Ser Gly Pro Thr Glu
725 730 735
Gln Leu Asp Ala Phe Val Ala Ala Cys Glu Ser Asp Gly Val Gln Val
740 745 750
Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Pro Gln Val Glu Ser
755 760 765
Leu Arg Gln Arg Leu Leu Glu Glu Leu Ala Thr Ile Thr Pro Arg Pro
770 775 780
Ser Arg Ile Ala Phe Tyr Ser Thr Val Thr Gly Thr Leu Leu Asp Thr
785 790 795 800
Thr Glu Leu Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asp Thr Val
805 810 815
Asn Phe Ala Ala Thr Val Gln Thr Leu Leu Ser Glu Gly His Thr Val
820 825 830
Phe Val Glu Ala Ser Pro His Pro Val Leu Thr Pro Gly Ile Glu Glu
835 840 845
Leu Gly Glu Gln Thr Gly Pro Arg Thr Arg Asp Ile Val Val Thr Gly
850 855 860
Ser Leu Arg Arg Asp Asp Gly Gly Leu Asp Arg Leu Leu Ser Ala Leu
865 870 875 880
Ala Met Val Asp Val Ala Gly Ala Gly Val Asp Trp Thr Pro Ile Phe
885 890 895
Asp Gly Arg Gly Ala Thr Arg Val Ala Leu Pro Ser Tyr Ala Phe Gln
900 905 910
His Arg Arg Tyr Trp Leu Asp Thr Leu Thr Ala Ser Gly His Pro Asp
915 920 925
Ser Leu Gly Ser Ser Val Gly Ala Asp Asp Gly Ile Asp Gly Glu Phe
930 935 940
Trp Asp Ala Val Ala Arg Glu Asp Trp Glu Ala Leu Gly Leu Glu Glu
945 950 955 960
Gly Cys Thr Ile Gly Glu Val Ser Pro Leu Leu Ser Ser Trp Arg Gln
965 970 975
Gln Arg Arg Ala Gln Ser Val Ile Asp Gln Trp Arg Tyr Arg Ile Gly
980 985 990
Trp Lys Trp Leu Ala Glu Lys Pro Val Arg Val Ser Gly Lys Trp Leu
995 1000 1005
Val Val Ser Pro Thr Gly Ala Ala Ile Gly Asp Glu Val Cys Gly Val
1010 1015 1020
Phe Thr Ala Ala Gly Leu Glu Thr Gln Arg Leu Glu Val Asp Ala Asp
1025 1030 1035 1040
Arg Met Thr Arg Gln Thr Met Ala Asp Leu Leu Glu Ser Ala Gly Pro
1045 1050 1055
Trp Asp Glu Phe Arg Gly Val Val Ser Leu Ile Ala Leu Asn Asp Gly
1060 1065 1070
Ile Gly Gly Asp Ser Pro Leu Val Ser Arg Gly Val Ala Gly Asn Val
1075 1080 1085
Trp Leu Leu Lys Ala Leu Arg Glu Thr Ala Ala Glu Ile Pro Leu Trp
1090 1095 1100
Cys Val Thr Ser Gly Ala Val Ile Val Gly Pro Ser Asp Arg Ser Val
1105 1110 1115 1120
Asp Ala Thr Gln Ser Gln Met Trp Gly Leu Gly Gln Val Ala Gly Leu
1125 1130 1135
Glu Leu Pro Gln Ser Trp Gly Gly Leu Ile Asp Leu Pro Asn Ala Trp
1140 1145 1150
Asp Asp Thr Ile Leu Arg Ser Leu Pro Ala Val Leu Ser Arg Glu Asp
1155 1160 1165
Gly Glu Asp Gln Leu Ala Val Arg Glu Ser Gly Val Tyr Gly Arg Arg
1170 1175 1180
Met Met Arg Ala Pro Leu Pro Asn Ser Gly Arg Gly Lys His Trp Arg
1185 1190 1195 1200
Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly Gly Ile Gly Ala
1205 1210 1215
His Ala Ala Arg Trp Leu Leu Thr Asn Gly Ala Glu His Val Val Leu
1220 1225 1230
Val Ser Arg Arg Gly Arg Gln Ala Pro Gly Ala Leu Glu Leu Glu Gln
1235 1240 1245
Glu Leu Ser Ala Leu Gly Gly Arg Val Thr Ile Met Ala Ala Asp Ile
1250 1255 1260
Ala Glu Arg Gly Asp Val Ala Ala Val Leu Ser Thr Ile Asp Asn Asp
1265 1270 1275 1280
Ser Ile Pro Leu Thr Ala Val Ile His Ala Ala Gly Val Val Asp Gln
1285 1290 1295
Arg Pro Leu Thr Glu Ile Asp Ser Glu Ser Met Thr Thr Ala Ala Ala
1300 1305 1310
Ala Lys Val Gly Gly Ala Gln His Leu Asp Glu Leu Leu Gly Asp Arg
1315 1320 1325
Arg Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Thr Trp Gly
1330 1335 1340
Gly Thr Gly Leu Ala Glu Tyr Ala Ala Ser Asn Ala His Leu Asp Gly
1345 1350 1355 1360
Leu Ala Gln Asp Arg Arg Ser Arg Gly Leu Val Ala Thr Ser Leu Ala
1365 1370 1375
Trp Gly Gly Trp Ser Gly Gly Gly Met Thr Glu Ile Gly Thr Thr Thr
1380 1385 1390
Glu Tyr Phe Gly Arg Leu Gly Ile Arg Leu Met Glu Pro Asp Leu Ala
1395 1400 1405
Leu Gln Ala Leu Ser Gln Ala Val Ala Asn Asn Glu Thr Leu Val Thr
1410 1415 1420
Val Ala Asp Ile Asp Trp Gln Gln Phe Thr Val Tyr Tyr Thr Leu Ser
1425 1430 1435 1440
Arg Arg Arg Leu Leu Ile Thr Asp Ile Pro Asp Ala Gln Ala Asp Thr
1445 1450 1455
Asp Ser Ala Ile Asp Ser Gly Asn Thr Gly Ser Pro Leu Arg Gln Arg
1460 1465 1470
Leu Ser Gly Leu Gly Glu Thr Glu Gln Ile Gln Val Leu Leu Asp Leu
1475 1480 1485
Val Arg Ala Gln Ile Ala Ile Val Leu Gly His Asp Asp Ala Thr Ala
1490 1495 1500
Ile Asp Ala Asp Arg Asn Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr
1505 1510 1515 1520
Ala Val Glu Ala Arg Asn Arg Ile Lys Thr Ala Thr Gly Val Ala Val
1525 1530 1535
Ala Ala Thr Leu Ile Phe Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu
1540 1545 1550
His Leu His Gln Gln Leu Ala Gly Ala Ser Val Ala Ala Glu Pro Val
1555 1560 1565
Val Val Val Gly His Ser Ala Glu Pro Ile Ala Ile Val Gly Val Gly
1570 1575 1580
Cys Arg Phe Pro Gly Gly Val Ser Ser Arg Glu Glu Leu Trp Gln Val
1585 1590 1595 1600
Val Ala Gln Gly Arg Asp Val Val Ser Gln Trp Pro Val Asp Arg Gly
1605 1610 1615
Trp Asp Ala Gly Leu Phe Asp Pro Glu Pro Gly Val Thr Gly Lys Ser
1620 1625 1630
Tyr Thr Arg Asp Gly Ala Phe Leu His Asp Ala Gly Leu Phe Asp Ala
1635 1640 1645
Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gln
1650 1655 1660
Gln Arg Leu Leu Leu Glu Thr Val Trp Glu Ala Leu Glu Asp Ala Gly
1665 1670 1675 1680
Val Asp Pro Val Ser Leu Arg Gly Ser Asp Thr Gly Val Phe Ile Gly
1685 1690 1695
Val Ser Asp Gln Ser Tyr Gly Ile Gly Arg Ser Asp Gly Asp Ala Gly
1700 1705 1710
Val Glu Gly Tyr Arg Leu Thr Gly Gly Ala Thr Ser Val Val Ser Gly
1715 1720 1725
Arg Val Ser Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Ser Val Asp
1730 1735 1740
Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gln Ala Val Gln Ala
1745 1750 1755 1760
Val Arg Ala Gly Glu Cys Gly Met Ala Leu Val Gly Gly Val Ala Val
1765 1770 1775
Leu Ala Thr Pro Gly Ala Phe Ile Glu Phe Ser Arg Gln Lys Gly Leu
1780 1785 1790
Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Thr
1795 1800 1805
Gly Trp Ser Glu Gly Val Gly Ile Leu Val Val Glu Arg Leu Ser Asp
1810 1815 1820
Ala Arg Arg His Gly His Gln Val Leu Ala Val Val Arg Gly Ser Ala
1825 1830 1835 1840
Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro
1845 1850 1855
Ser Gln Gln Arg Val Ile Arg Arg Ala Leu Ala Asn Ala Gly Val Ser
1860 1865 1870
Ala Thr Glu Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu
1875 1880 1885
Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asn
1890 1895 1900
Arg Glu Pro Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile
1905 1910 1915 1920
Gly His Thr Gln Asn Ala Ala Gly Val Ala Gly Val Ile Lys Met Ile
1925 1930 1935
Glu Ala Ile Arg His Gln Thr Leu Pro Lys Thr Leu His Ile Asp Thr
1940 1945 1950
Pro Thr Thr His Val Asp Trp Thr Ser Gly Ala Val Glu Leu Leu Thr
1955 1960 1965
Glu Ser Arg Thr Trp Thr Thr Glu Ala Asp Arg Pro Arg Arg Ala Ala
1970 1975 1980
Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val Ile Leu Glu
1985 1990 1995 2000
Gln Ser Pro Pro Val Thr Pro Asp Thr Glu Ser Ser Ala Pro Asp Thr
2005 2010 2015
Asp Pro Val Pro Ala Val Lys Ser Asp Ala Val Val Trp Met Val Ser
2020 2025 2030
Gly Arg Thr Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu His Glu
2035 2040 2045
Trp Met Leu Ala Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser
2050 2055 2060
Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly
2065 2070 2075 2080
Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Gly Leu Ile Asp Gly
2085 2090 2095
Asp Pro Ala Val Ala Ala Gly Val Ser Arg Asp Arg Gly Lys Thr Val
2100 2105 2110
Phe Val Phe Pro Gly Gln Gly Ala Gln Leu Leu Gly Met Gly Ser Gly
2115 2120 2125
Leu Tyr Glu Ala Phe Pro Val Phe Ala Ala Ser Phe Asp Glu Thr Thr
2130 2135 2140
Ala Leu Leu Glu Gln Gln Leu Glu Cys Ser Leu Arg Asp Val Val Trp
2145 2150 2155 2160
Gly Val Asp Glu Gln Ala Leu Gln Ala Thr Leu Tyr Thr Gln Thr Gly
2165 2170 2175
Leu Phe Ala Val Gly Ile Ala Leu Phe Arg Leu Leu Glu Ser Phe Gly
2180 2185 2190
Val Arg Pro Asp Phe Val Ala Gly His Ser Ile Gly Glu Leu Ala Ala
2195 2200 2205
Ala Thr Val Ala Gly Val Leu Ser Leu Glu Asp Ala Thr Val Leu Val
2210 2215 2220
Ala Ala Arg Ala Arg Leu Met Gln Gln Leu Pro Thr Gly Gly Ala Met
2225 2230 2235 2240
Leu Ala Met Arg Ala Ser Glu Thr Gln Ile Thr Thr Leu Leu Gly Asp
2245 2250 2255
Ser Ile Glu Asp Gly Val Val Glu Ile Ala Ala Val Asn Gly Pro Glu
2260 2265 2270
Ser Ile Val Leu Ala Gly Pro Gln His Ala Ile Asp Thr Thr Glu Gln
2275 2280 2285
Gln Leu Arg Gln Ala Gly Tyr Gln Val Asn Arg Leu Arg Val Ser His
2290 2295 2300
Ala Phe His Ser Ala Ser Met Glu Pro Met Leu Ala Glu Phe Ala Arg
2305 2310 2315 2320
Ile Ala Thr Glu Leu Thr Tyr Thr Gln Pro Val Ile Pro Ile Ile Ser
2325 2330 2335
Asn Leu Asp Gly Gln Leu Thr Gly Pro Asn Thr Asp Ser Pro Asn Thr
2340 2345 2350
Asp Ala Gln Gln Ala Asp Ser Pro Leu Thr Thr Pro Gln Tyr Trp Val
2355 2360 2365
Asp His Val Arg Asn Thr Val Arg Phe Ala Asp Gly Ile Thr Thr Leu
2370 2375 2380
Thr Thr Ala Gly Ala Thr Arg Tyr Val Ile Met Gly Pro Asp Gly Gly
2385 2390 2395 2400
Leu Ser Gly Leu Ile Asp Glu Thr Leu Gln His Thr Thr Ser Asp Ala
2405 2410 2415
Val Asp Thr Lys Pro Thr Val Asp Gly Val Glu Ala Val Val Ala Ser
2420 2425 2430
Leu Leu Arg Lys Asp Arg Val Glu Asp Thr Thr Leu Leu Ser Ala Leu
2435 2440 2445
Ala Arg Leu Asp Val Ala Gly Thr Gly Ile Asp Trp Thr Pro Ile Phe
2450 2455 2460
His Gly Arg Gly Ala Thr Arg Val Pro Leu Pro Ser Tyr Ala Phe Gln
2465 2470 2475 2480
His Arg Arg Tyr Trp Leu Asp Thr Ile Thr Gly Asn Thr Asp Pro Asp
2485 2490 2495
Ser Leu Gly Leu Ser Gly Leu Asp His Pro Leu Ile Gly Ala Val Val
2500 2505 2510
Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly Arg Leu Ser Leu
2515 2520 2525
Gln Thr His Pro Trp Leu Ala Asp His Ala Val Gly Gly Val Val Leu
2530 2535 2540
Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg Ala Gly Asp Glu
2545 2550 2555 2560
Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu Ala Pro Leu Thr
2565 2570 2575
Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu Val Gly Ala Leu
2580 2585 2590
Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser Gln Thr Arg Asp
2595 2600 2605
Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His Thr Gln Ser Pro
2610 2615 2620
Val Glu Asn Leu Thr Thr Thr Thr Pro Val Asp Thr Gly Leu Ala Val
2625 2630 2635 2640
Trp Pro Pro Gln Asn Ala Thr Arg Thr Asp Thr Ser Ser Leu Tyr Gln
2645 2650 2655
Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Gln Gly Leu
2660 2665 2670
Glu Ser Val Trp Arg Thr Gly Glu Asp Trp Leu Val Gln Ala Arg Leu
2675 2680 2685
Pro Glu Thr Gly Gly Asp Ala His His Tyr Gly Leu His Pro Ala Leu
2690 2695 2700
Leu Asp Ala Val Leu His Ala Met Thr Thr Gly His Asp Thr Asp Thr
2705 2710 2715 2720
Asp Thr Asp Thr Ser Val Gly Pro Leu Leu Pro Phe Ala Trp Glu Gly
2725 2730 2735
Val Gln Leu His Ala Val Gly Ala Ser Thr Val His Ala Arg Ile Thr
2740 2745 2750
Pro Leu Gly His Asn Thr Val Arg Val Thr Val Thr Asp Pro Asp Gly
2755 2760 2765
Gln Pro Val Leu Thr Ile Ala Ser Leu Thr Leu Arg Pro Val Gln Leu
2770 2775 2780
Asp Gln Leu Thr Thr Ala Ala Asp Gly Gly Asp Arg Leu His Thr Leu
2785 2790 2795 2800
His Trp Thr Pro Thr Pro Ile Pro Ala Gln Leu Arg Glu Val Glu Phe
2805 2810 2815
Val Glu Trp Asn Asn Leu Glu His Glu Ser Ala Asp Asp Pro Val Pro
2820 2825 2830
Pro Val Val Val Leu Asp Cys Arg Asp Gly Glu Asn Asn Thr Val Gly
2835 2840 2845
Glu Ile Asp Thr Asp Val Leu Val Arg Ala His Ala Ile Ser His Arg
2850 2855 2860
Val Leu Gly Val Leu Gln Glu Phe Ser Thr Gly Gln Arg Phe Ala Ser
2865 2870 2875 2880
Ser Thr Leu Leu Val Leu Thr Arg Ala Ala Val Thr Thr Thr Ala Gly
2885 2890 2895
Asp Arg Val Asp Pro Ala Ala Ser Thr Ile Trp Gly Leu Val Arg Ser
2900 2905 2910
Ala Gln Ser Glu Glu Pro Gly Arg Ile Leu Leu Ala Asp Thr Asp Ile
2915 2920 2925
Glu Gly Ser Asp Gly Val Asp Val Ala Gly Ile Val Ser Leu Ala Val
2930 2935 2940
Ala Val Gly Glu Pro Gln Val Leu Ile Arg Asp Gly Ile Ala His Ile
2945 2950 2955 2960
Ala Arg Leu Thr Arg Gly Pro Gly Arg Gly Thr Leu Ala Ile Pro Asp
2965 2970 2975
Ala Gly Ala Trp Gln Leu Ala Ala Val Asp Lys Gly Val Leu Asp Gly
2980 2985 2990
Leu Ala Leu Val Ser His Pro Leu Ala Glu Gln Pro Leu Ala Ala Gly
2995 3000 3005
Gln Val Arg Ile Ser Val Arg Ala Ala Gly Leu Asn Phe Arg Asp Val
3010 3015 3020
Leu Ile Ala Leu Gly Met Tyr Pro Asp Asp Asp Ala Val Val Gly Ala
3025 3030 3035 3040
Glu Leu Ala Gly Val Ile Val Glu Val Gly Ala Asp Val Glu Gly Leu
3045 3050 3055
Ser Val Gly Asp Arg Val Met Gly Leu Ala Gly Arg Gly Val Gly Pro
3060 3065 3070
Val Val Ile Val Asp His Arg Leu Val Val His Met Pro Ala Gly Trp
3075 3080 3085
Ser Phe Ala Gln Ala Ala Ala Val Pro Val Val Phe Leu Thr Ala Tyr
3090 3095 3100
Tyr Gly Leu Met Asp Leu Ala His Ala Lys Pro Gly Asp Arg Leu Leu
3105 3110 3115 3120
Val His Ala Ala Thr Gly Gly Val Gly Met Ala Ala Ile Gln Leu Ala
3125 3130 3135
Arg Cys Trp Gly Leu Glu Val Phe Ala Thr Ala Ser Ser Gly Lys Trp
3140 3145 3150
Asp Val Leu Arg Gly Ile Gly Phe Asp Asp Gln His Ile Ala Asn Ser
3155 3160 3165
Arg Thr Leu Ser Phe Glu Asp Glu Phe Leu Ser Ala Thr Asp Gly His
3170 3175 3180
Gly Val Asp Ile Val Leu Asn Ser Leu Ala Gly Asp Phe Val Asp Ala
3185 3190 3195 3200
Ser Leu Arg Leu Leu Pro Arg Gly Gly His Phe Leu Glu Met Gly Lys
3205 3210 3215
Thr Asp Lys Arg Asp Ser Asp Ala Ile Thr Thr Gln Tyr Pro Gly Val
3220 3225 3230
Ile Tyr Gln Ala Phe Asp Met Phe Glu Ala Gly Glu Asp Arg Ile Gln
3235 3240 3245
Gln Met Leu Ser Glu Leu Thr Ala Ser Phe Asp Arg Gly Glu Leu Lys
3250 3255 3260
Ser Ile Pro Ile Gln Ala Trp Asp Ile Arg Gln Ala Pro Glu Ala Phe
3265 3270 3275 3280
Arg Tyr Phe Ser Gln Thr Arg His Ile Gly Lys Val Val Leu Thr Leu
3285 3290 3295
Pro Val Val Ser Thr Leu Pro Val Val Ser Asp Thr Thr Asp Thr Gly
3300 3305 3310
Arg Gly Thr Val Val Leu Thr Gly Gly Thr Gly Gly Leu Gly Arg Ile
3315 3320 3325
Leu Ala Arg His Leu Val Gly Val Arg Gly Val Arg Ser Leu Val Leu
3330 3335 3340
Ala Ser Arg Arg Gly Ile Ala Ala Glu Gly Ala Arg Glu Leu Val Glu
3345 3350 3355 3360
Glu Leu Thr Gly Ser Gly Ala Arg Val Ala Val Val Ala Cys Asp Val
3365 3370 3375
Ser Thr Arg Ala Gly Val Glu Gln Leu Leu Ala Ala Val Pro Asp Glu
3380 3385 3390
Asp Pro Leu Val Gly Val Val His Ala Ala Gly Val Leu Asp Asp Gly
3395 3400 3405
Val Ile Ala Ser Leu Thr Pro Gln Arg Leu Asp Thr Val Leu Ser Val
3410 3415 3420
Lys Ala Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg Gly Leu Asp
3425 3430 3435 3440
Leu Gly Met Phe Val Met Tyr Ser Ser Thr Ala Gly Val Leu Gly Ser
3445 3450 3455
Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn Gln Phe Leu Asp Gly Leu
3460 3465 3470
Ala Glu His Arg Arg Ala Gln Gly Leu Pro Ala Thr Ser Ile Ala Trp
3475 3480 3485
Gly Leu Trp Gly Ser Ser Thr Gly Met Thr Gly His Leu Asp Gly Gly
3490 3495 3500
Asp Thr Ala Arg Met Asn Arg Gly Gly Tyr Leu Ala Met Thr Asp Glu
3505 3510 3515 3520
Gln Gly Met Ala Met Phe Asp Thr Ala Ile Thr Ala Glu His Ala Thr
3525 3530 3535
Val Leu Ala Ala Arg Phe Asp Thr Thr Ala Leu Ala Ala Gln Ala Arg
3540 3545 3550
Ala Gly Met Leu Thr Pro Ile Leu His Gln Leu Val Pro Asn Ala Arg
3555 3560 3565
Arg Ala Ala Thr Gly Asp Thr Gly Ser Ala Ser Gly Val Ala Gly Ser
3570 3575 3580
Gln Leu Arg Gln Arg Leu Ser Gly Leu Asp Glu Ala Glu Gln Val Lys
3585 3590 3595 3600
Ile Leu Leu Glu Leu Val Gln Thr Gln Val Ala Ile Val Leu Gly His
3605 3610 3615
Asp Asp Ala Thr Thr Ile Asp Ala Asp Arg Asn Phe Gln Glu Leu Gly
3620 3625 3630
Phe Asp Ser Leu Thr Ala Val Glu Ala Arg Asn Arg Leu Lys Thr Ala
3635 3640 3645
Thr Glu Val Ala Ile Pro Ala Thr Leu Thr Phe Asp Tyr Pro Thr Pro
3650 3655 3660
Arg Ala Val Ala Glu His Leu His Gln Gln Leu Ala Gly Ala Ser Val
3665 3670 3675 3680
Ala Ala Glu Pro Val Val Val Met Gly His Ser Ala Glu Pro Ile Ala
3685 3690 3695
Ile Val Gly Val Gly Cys Arg Phe Pro Gly Glu Val Ser Ser Arg Glu
3700 3705 3710
Asp Leu Trp Glu Leu Leu Val Gln Gly Arg Asp Val Val Ser Gln Trp
3715 3720 3725
Pro Leu Asp Arg Gly Trp Asp Ile Gly Thr Arg Ser Ser Thr Arg Glu
3730 3735 3740
Gly Gly Phe Leu His Asp Ala Ser Ser Phe Asp Ala Gly Phe Phe Gly
3745 3750 3755 3760
Ile Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu
3765 3770 3775
Leu Glu Thr Val Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Thr
3780 3785 3790
Ser Leu His Gly Ser Glu Thr Gly Val Phe Ile Gly Val Ser Asp Gln
3795 3800 3805
Ser Tyr Gly Val Gly Arg Gly Asp Gly Asp Ala Gly Val Glu Gly Tyr
3810 3815 3820
Arg Leu Thr Gly Ala Thr Ser Ser Val Val Ser Gly Arg Val Ser Tyr
3825 3830 3835 3840
Val Leu Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser
3845 3850 3855
Ser Ser Leu Val Ala Leu His Gln Ala Val Gln Ala Val Arg Ala Gly
3860 3865 3870
Glu Cys Gly Met Ala Leu Val Gly Gly Val Thr Val Met Ser Thr Pro
3875 3880 3885
Gly Ala Phe Val Glu Phe Ser Arg Gln Gly Gly Leu Ala Pro Asp Gly
3890 3895 3900
Arg Cys Lys Pro Phe Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu
3905 3910 3915 3920
Gly Val Gly Ile Leu Val Val Glu Arg Leu Ser Glu Ala Arg Lys His
3925 3930 3935
Gly His Gln Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp
3940 3945 3950
Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg
3955 3960 3965
Val Ile Arg Arg Ala Leu Ala Asn Ala Gly Leu Ser Pro Asp Leu Ile
3970 3975 3980
Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile
3985 3990 3995 4000
Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asn Arg Glu Pro Asp
4005 4010 4015
Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ala Gln
4020 4025 4030
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met Arg
4035 4040 4045
His Glu Thr Leu Pro Lys Thr Leu His Val Asp Thr Pro Ser Ser His
4050 4055 4060
Val Asp Trp Thr Thr Gly Thr Val Glu Leu Leu Thr Gln Glu Gln Pro
4065 4070 4075 4080
Trp Pro Arg Asn Gly His Pro Leu Arg Ala Gly Ile Ser Ser Phe Gly
4085 4090 4095
Val Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Ala Val
4100 4105 4110
Val Glu Pro Val Pro Gly Thr Glu Ser Leu Val Pro Val Thr Ala Gly
4115 4120 4125
Gly Leu Val Trp Val Leu Ser Gly Arg Thr Gly Glu Gly Leu Leu Ala
4130 4135 4140
Gln Gly Arg Arg Leu Gln Glu Trp Met Leu Thr Arg Pro Gly Leu Asp
4145 4150 4155 4160
Ala Val Asp Val Gly Trp Ser Leu Ile Asn Thr Arg Ala Arg Leu Glu
4165 4170 4175
His Arg Ala Val Leu Val Gly Ala Asp Arg Glu Glu Leu Met Thr Arg
4180 4185 4190
Leu Gln Ala Leu Ile Asp Ser Glu Pro Gly Met Leu Ala Gly Ala Gly
4195 4200 4205
Met Phe Ala Gly Pro Gly Val Val Ser Gly Val Ala Gly Gly Val Gly
4210 4215 4220
Lys Thr Val Leu Val Phe Pro Gly Gln Gly Ala Gln Trp Leu Gly Met
4225 4230 4235 4240
Gly Ala Arg Leu Leu Gln Glu Ser Val Val Phe Glu Gln Lys Val Leu
4245 4250 4255
Glu Cys Ala Glu Val Phe Ala Pro Leu Val Glu Trp Ser Leu Ile Asp
4260 4265 4270
Val Leu Gln Gly Thr Ala Asp Pro Met Leu Leu Glu Arg Val Asp Val
4275 4280 4285
Val Gln Pro Ala Leu Phe Ala Val Met Val Ser Leu Ala Glu Val Trp
4290 4295 4300
Arg Ser Phe Gly Val Val Pro Asp Ala Val Leu Gly His Ser Gln Gly
4305 4310 4315 4320
Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala
4325 4330 4335
Ala Arg Val Val Ile Leu Arg Ser Arg Ala Leu Arg Glu Leu Ser Gly
4340 4345 4350
Arg Gly Gly Met Ala Ser Val Leu Leu Pro Thr Thr Leu Val Glu Gln
4355 4360 4365
Arg Leu Thr Asp Met Pro Gly Leu Ala Val Ala Ala Val Asn Gly Pro
4370 4375 4380
Thr Thr Thr Val Val Ser Gly Pro Thr Glu Gln Leu Asp Ala Phe Val
4385 4390 4395 4400
Ala Ala Cys Glu Ser Asp Gly Val Gln Val Arg Arg Ile Ala Val Asp
4405 4410 4415
Tyr Ala Ser His Ser Pro Gln Val Glu Ser Leu Arg Gln Arg Leu Leu
4420 4425 4430
Glu Glu Leu Ala Thr Ile Thr Pro Arg Pro Ser Arg Ile Ala Phe Tyr
4435 4440 4445
Ser Thr Val Thr Gly Thr Leu Leu Asp Thr Thr Glu Leu Asp Ala Gly
4450 4455 4460
Tyr Trp Phe Arg Asn Leu Arg Asp Thr Val Asn Phe Ala Ala Thr Val
4465 4470 4475 4480
Gln Thr Leu Leu Ser Glu Gly His Thr Val Phe Val Glu Ala Ser Pro
4485 4490 4495
His Pro Val Leu Thr Pro Gly Ile Glu Glu Leu Gly Glu Gln Thr Gly
4500 4505 4510
Pro Arg Thr Arg Asp Ile Val Val Thr Gly Ser Leu Arg Arg Asp Asp
4515 4520 4525
Gly Gly Leu Asp Arg Leu Leu Ser Ala Leu Ala Met Val Asp Val Ala
4530 4535 4540
Gly Ala Gly Val Asp Trp Thr Pro Ile Phe Asp Gly Arg Gly Ala Thr
4545 4550 4555 4560
Arg Val Ala Leu Pro Ser Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu
4565 4570 4575
Asp Thr Leu Thr Ala Ser Gly Asn Pro Asp Ser Leu Gly Gln Thr Ala
4580 4585 4590
Leu Asp His Pro Leu Ile Gly Ala Val Val Val Ser Pro Glu Thr Gly
4595 4600 4605
Ala Val Thr Val Thr Gly Arg Leu Ser Leu Gln Thr His Pro Trp Leu
4610 4615 4620
Ala Asp His Ala Val Gly Gly Val Val Leu Leu Pro Gly Thr Gly Leu
4625 4630 4635 4640
Val Glu Leu Val Ile Arg Ala Gly Asp Glu Val Gly Cys Gly Ala Ile
4645 4650 4655
Arg Glu Leu Thr Leu Leu Ala Pro Leu Thr Leu Pro Ala Glu Gly Gly
4660 4665 4670
Thr Ala Ile Gln Val Leu Val Gly Ala Leu Glu Thr Ser Gly Thr Arg
4675 4680 4685
Thr Val Ser Val Tyr Ser Gln Thr Arg Asp Gln Glu Trp Val Leu Asn
4690 4695 4700
Ala Gln Gly Leu Leu His Thr Gln Ser Pro Val Glu Asn Leu Thr Thr
4705 4710 4715 4720
Thr Thr Pro His Asp Val Asp Ala Gly Leu Ala Ala Trp Pro Pro Ala
4725 4730 4735
Gly Ala Val His Thr Asp Thr Ser Ser Leu Tyr Gln Gln Leu Ala Glu
4740 4745 4750
Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Gln Gly Leu Glu Ser Val Trp
4755 4760 4765
Arg Thr Gly Glu Asp Trp Leu Val Gln Ala Thr Leu Pro Glu Thr Gly
4770 4775 4780
Gly Gln Ala Asn His Tyr Gly Leu His Pro Ala Leu Leu Asp Ala Val
4785 4790 4795 4800
Leu His Ala Met Thr Thr Gly Phe Asp Thr Ser Gly Lys Ala Gly Val
4805 4810 4815
Glu Ala Ala Ala Gly Pro Leu Leu Pro Phe Ala Trp Glu Gly Val Gln
4820 4825 4830
Leu His Ala Val Gly Ala Ser Thr Val His Ala Arg Ile Thr Pro Leu
4835 4840 4845
Gly His Asn Thr Val Arg Val Thr Val Thr Asp Pro Asp Gly Leu Pro
4850 4855 4860
Val Leu Thr Ile Ala Ser Leu Thr Leu Arg Pro Val Gln Leu Asp Gln
4865 4870 4875 4880
Leu Thr Thr Ala Ala Asp Gly Gly Asp Arg Leu His Thr Leu His Trp
4885 4890 4895
Thr Pro Thr Pro Ile Pro Ala Gln Leu Arg Glu Val Glu Phe Val Glu
4900 4905 4910
Trp Asn Asn Leu Glu His Glu Ser Ala Asp Asp Pro Val Pro Pro Val
4915 4920 4925
Val Val Leu Asp Cys Arg Asp Gly Glu Asn Asn Thr Val Gly Glu Ile
4930 4935 4940
Asp Thr Asp Val Leu Val Arg Ala His Ala Ile Ser His Arg Val Leu
4945 4950 4955 4960
Gly Val Leu Gln Glu Phe Ser Thr Gly Gln Arg Phe Ala Ser Ser Thr
4965 4970 4975
Leu Leu Val Leu Thr Arg Ala Ala Val Thr Thr Thr Ala Gly Asp Arg
4980 4985 4990
Val Asp Pro Ala Ala Ser Thr Ile Trp Gly Leu Val Arg Ser Ala Gln
4995 5000 5005
Ser Glu Glu Pro Gly Arg Ile Leu Leu Ala Asp Thr Asp Ile Glu Gly
5010 5015 5020
Ser Asp Gly Val Asp Val Ala Gly Ile Val Ser Leu Ala Val Ala Val
5025 5030 5035 5040
Gly Glu Pro Gln Val Leu Ile Arg Asp Gly Ile Ala His Ile Ala Arg
5045 5050 5055
Leu Val Arg Thr Ala Gly Arg Asp Asp Lys Thr Thr Ala Ser Asp Ile
5060 5065 5070
Ser Asp Thr Asp Asp Thr Val Ser Gly Ala Gly Arg Gly Thr Val Val
5075 5080 5085
Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Ile Leu Ala Arg His Leu
5090 5095 5100
Val Gly Val Arg Gly Val Arg Ser Leu Val Leu Ala Ser Arg Arg Gly
5105 5110 5115 5120
Leu Ala Ala Glu Gly Ala Arg Glu Leu Val Glu Glu Leu Thr Gly Ser
5125 5130 5135
Gly Ala Arg Val Ala Val Val Ala Cys Asp Val Ser Thr Arg Ala Gly
5140 5145 5150
Val Glu Gln Leu Leu Ala Ala Val Pro Asp Glu Asp Pro Leu Val Gly
5155 5160 5165
Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val Ile Ala Ser Leu
5170 5175 5180
Thr Pro Gln Arg Leu Asp Thr Val Leu Ser Val Lys Ala Asp Ala Ala
5185 5190 5195 5200
Trp Tyr Leu His Glu Leu Thr Arg Gly Leu Asp Leu Gly Met Phe Val
5205 5210 5215
Leu Tyr Ser Ser Ala Ala Gly Val Leu Gly Ser Pro Gly Gln Gly Asn
5220 5225 5230
Tyr Ala Ala Ala Asn Gln Phe Leu Asp Gly Leu Ala Glu Tyr Arg Arg
5235 5240 5245
Ala Arg Gly Leu Ala Ala Thr Ser Ile Ala Trp Gly Leu Trp Gly Ser
5250 5255 5260
Gly Thr Gly Met Thr Gly His Leu Asp Gly Gly Asp Thr Ala Arg Met
5265 5270 5275 5280
Ser Arg Gly Gly Met Leu Ala Leu Thr Glu Ala Gln Gly Met Ala Met
5285 5290 5295
Phe Asp Thr Ala Val Thr Ala Glu His Ala Thr Val Leu Ala Ala Arg
5300 5305 5310
Phe Asp Thr Thr Val Leu Ala Ala Gln Ala Arg Ala Gly Met Leu Ala
5315 5320 5325
Pro Ile Leu His Asn Leu Val Pro Asn Ala Arg Arg Val Ala Ala Gly
5330 5335 5340
Asn Thr Gly Ser Ala Gly Ala Gly Val Ala Gly Ser Gln Leu Arg Gln
5345 5350 5355 5360
Arg Leu Ser Gly Leu Asp Glu Ala Glu Gln Val Lys Val Leu Leu Glu
5365 5370 5375
Leu Val Arg Gly Gln Val Ala Ile Val Leu Gly His Asp Asp Ala Thr
5380 5385 5390
Ala Ile Asp Ala Asp Arg Asn Phe Gln Glu Leu Gly Phe Asp Ser Leu
5395 5400 5405
Thr Ala Val Glu Leu Arg Asn Tyr Leu Val Arg Ala Thr Gly Ile Glu
5410 5415 5420
Leu Ser Pro Met Val Ile Phe Asp Gln Glu Asn Ile Ala Ser Phe Ala
5425 5430 5435 5440
Glu Tyr Leu Arg Ser Glu Phe Asp Ile Glu Arg Ser Ser Val Glu Lys
5445 5450 5455
Val Arg Pro Leu Asp Thr Leu Gly Gly Leu Phe Arg Ala Ala Leu Arg
5460 5465 5470
Ser Gly Lys Met Asp Ala Gly Tyr Asp Leu Leu Arg Ala Ala Thr Cys
5475 5480 5485
Leu Arg Glu Thr Phe Ala Ser Ser Asp Ala Val Thr Gly Pro Gln Pro
5490 5495 5500
Val Met Leu Cys Asp Gly Pro Thr Val Pro Gln Val Val Phe Val Cys
5505 5510 5515 5520
Thr Pro Val Phe Gly Gly Val Ala Glu His Ala His Leu Ser Lys Ile
5525 5530 5535
Phe Ser Gly Arg Arg Arg Val Trp Ser Val Pro Leu Leu Gly Phe Lys
5540 5545 5550
Pro Gly Glu Pro Leu Pro Glu Ser Ser Asp Val Ala Ile Glu Ser Val
5555 5560 5565
Ala Lys Ser Ile Glu Gly Val Val Gly Asp Ala Pro Phe Ile Leu Val
5570 5575 5580
Gly His Ser Ser Ala Gly His Leu Ala Tyr Ala Thr Ala Glu Arg Leu
5585 5590 5595 5600
Ala Gly Ser Cys Val Ser Lys Leu Glu Gly Ile Val Leu Leu Asp Thr
5605 5610 5615
Phe Glu Met Glu Val Gly Lys Asn Leu Pro Met Asp Gln Met Ala Asn
5620 5625 5630
Arg Thr Leu Arg Glu Glu Phe Glu Gly Gly Leu Ser Phe Glu Ser Leu
5635 5640 5645
Thr Ala Thr Ile Thr Trp Val Asp Phe Leu Ser Lys Leu Asp Tyr Val
5650 5655 5660
Ala Glu Asp His Asp Ala Leu Phe Val Gln Cys Arg Glu Pro Val Phe
5665 5670 5675 5680
Glu Leu Glu Leu Glu Gly Val Asn Ile Gly Ile Ile Ala Lys Pro Trp
5685 5690 5695
Ser Asn Ala Gln Thr Leu Ser Val Val Asp Ser Asp His Phe Ser Met
5700 5705 5710
Val Ser Ser Asp Ala Gly Lys Val Val Asp Ala Ile Glu Glu Trp Leu
5715 5720 5725
Gly Arg Ser Arg Ser
5730
<210> 3
<211> 1665
<212> PRT
<213> Nocardia vinacea
<400> 3
Met Ala Leu Val Gly Gly Val Thr Val Met Ser Thr Pro Asp Thr Phe
1 5 10 15
Val Glu Phe Ser Arg Gln Lys Gly Leu Ala Pro Asp Gly Arg Cys Lys
20 25 30
Ser Phe Ala Glu Ala Ala Asp Gly Ala Gly Trp Ser Glu Gly Val Gly
35 40 45
Val Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg Arg Gly His Gln
50 55 60
Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
65 70 75 80
Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
85 90 95
Arg Ala Leu Ala Asn Ala Gly Val Ala Ala Thr Glu Val Asp Val Val
100 105 110
Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln
115 120 125
Ala Leu Leu Ala Thr Tyr Gly Gln Asn Arg Glu Pro Asp Arg Pro Leu
130 135 140
Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ala Gln Ala Ala Ala
145 150 155 160
Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met Arg His Glu Thr
165 170 175
Leu Pro Lys Thr Leu His Val Asp Thr Pro Thr Thr His Val Asp Trp
180 185 190
Thr Ala Gly Ala Val Glu Leu Leu Thr Glu Ser Arg Ala Trp Thr Val
195 200 205
Glu Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Ile Ser
210 215 220
Gly Thr Asn Ala His Val Ile Leu Glu Gln Ser Pro Pro Val Thr Pro
225 230 235 240
Asp Thr Glu Ser Ser Ala Pro Asp Thr Asp Pro Val Pro Ala Val Lys
245 250 255
Ser Asp Ala Val Val Trp Met Val Ser Gly Arg Thr Gly Glu Gly Leu
260 265 270
Leu Ala Gln Gly Arg Arg Leu His Glu Trp Met Leu Ala Arg Pro Gly
275 280 285
Leu Asp Ala Val Asp Val Gly Trp Ser Leu Ile Asn Thr Arg Ala Arg
290 295 300
Leu Glu His Arg Ala Val Leu Val Gly Ala Asp Arg Glu Glu Leu Met
305 310 315 320
Thr Arg Leu Gln Gly Leu Ile Asp Gly Asp Pro Ala Val Ala Ala Gly
325 330 335
Val Ser Arg Asp Arg Gly Lys Thr Val Phe Val Phe Pro Gly Gln Gly
340 345 350
Ala Gln Leu Leu Gly Met Gly Ser Gly Leu Tyr Glu Ala Phe Pro Val
355 360 365
Phe Ala Ala Ser Phe Asp Glu Thr Thr Ala Leu Leu Glu Gln Gln Leu
370 375 380
Glu Cys Ser Leu Arg Asp Val Val Trp Gly Val Asp Glu Gln Ala Leu
385 390 395 400
Gln Ala Thr Leu Tyr Thr Gln Thr Gly Leu Phe Ala Val Gly Ile Ala
405 410 415
Leu Phe Arg Leu Leu Glu Ser Phe Gly Val Arg Pro Asp Phe Val Ala
420 425 430
Gly His Ser Ile Gly Glu Leu Ala Ala Ala Thr Val Ala Gly Val Leu
435 440 445
Ser Leu Glu Asp Ala Thr Val Leu Val Ala Ala Arg Ala Arg Leu Met
450 455 460
Gln Gln Leu Pro Thr Gly Gly Ala Met Leu Ala Met Arg Ala Ser Glu
465 470 475 480
Thr Gln Ile Thr Thr Leu Leu Gly Asp Ser Ile Glu Asp Gly Val Val
485 490 495
Glu Ile Ala Ala Val Asn Gly Pro Glu Ser Ile Val Leu Ala Gly Pro
500 505 510
Gln His Ala Ile Asp Thr Thr Glu Gln Gln Leu Arg Gln Ala Gly Tyr
515 520 525
Gln Val Asn Arg Leu Arg Val Ser His Ala Phe His Ser Ala Ser Met
530 535 540
Glu Pro Met Leu Ala Glu Phe Ala Arg Ile Ala Thr Glu Leu Thr Tyr
545 550 555 560
Thr Gln Pro Val Ile Pro Ile Ile Ser Asn Leu Asp Gly Gln Leu Thr
565 570 575
Gly Pro Asn Thr Asp Ser Pro Asn Thr Asp Ala Gln Gln Ala Asp Ser
580 585 590
Pro Leu Thr Thr Pro Gln Tyr Trp Val Asp His Val Arg Asn Thr Val
595 600 605
Arg Phe Ala Asp Gly Ile Thr Thr Leu Thr Thr Ala Gly Ala Thr Arg
610 615 620
Tyr Val Ile Met Gly Pro Asp Gly Gly Leu Ser Gly Leu Ile Asp Glu
625 630 635 640
Thr Leu Gln His Thr Thr Ser Asp Ala Val Asp Thr Lys Pro Thr Val
645 650 655
Asp Gly Val Glu Ala Val Val Ala Ser Leu Leu Arg Lys Asp Arg Val
660 665 670
Glu Asp Thr Thr Leu Leu Ser Ala Leu Ala Arg Leu Asp Val Ala Gly
675 680 685
Thr Gly Ile Asp Trp Thr Pro Ile Phe His Gly Arg Gly Ala Thr Arg
690 695 700
Val Pro Leu Pro Ser Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu Asp
705 710 715 720
Thr Ile Thr Gly Asn Thr Asp Pro Asp Ser Leu Gly Leu Ser Gly Leu
725 730 735
Asp His Pro Leu Ile Gly Ala Val Val Val Ser Pro Glu Thr Gly Ala
740 745 750
Val Thr Val Thr Gly Arg Leu Ser Leu Gln Thr His Pro Trp Leu Ala
755 760 765
Asp His Ala Val Gly Gly Val Val Leu Leu Pro Gly Thr Gly Leu Val
770 775 780
Glu Leu Val Ile Arg Ala Gly Asp Glu Val Gly Cys Gly Ala Ile Arg
785 790 795 800
Glu Leu Thr Leu Leu Ala Pro Leu Thr Leu Pro Ala Glu Gly Gly Thr
805 810 815
Ala Ile Gln Val Leu Val Gly Ala Leu Glu Thr Ser Gly Thr Arg Thr
820 825 830
Val Ser Val Tyr Ser Gln Thr Arg Asp Gln Glu Trp Val Leu Asn Ala
835 840 845
Gln Gly Leu Leu His Thr Gln Ser Pro Val Glu Asn Leu Thr Thr Thr
850 855 860
Thr Pro Val Asp Thr Gly Leu Ala Val Trp Pro Pro Gln Asn Ala Thr
865 870 875 880
Arg Thr Asp Thr Ser Ser Leu Tyr Gln Gln Leu Ala Glu Asp Gly Tyr
885 890 895
Gly Tyr Gly Pro Ala Phe Gln Gly Leu Glu Ser Val Trp Arg Thr Gly
900 905 910
Glu Asp Trp Leu Val Gln Ala Arg Leu Pro Glu Thr Gly Gly Asp Ala
915 920 925
His His Tyr Gly Leu His Pro Ala Leu Leu Asp Ala Val Leu His Ala
930 935 940
Met Thr Thr Gly His Asp Thr Asp Thr Asp Thr Asp Thr Ser Val Gly
945 950 955 960
Pro Leu Leu Pro Phe Ala Trp Glu Gly Val Gln Leu His Ala Val Gly
965 970 975
Ala Ser Thr Val His Ala Arg Ile Thr Pro His Gly His Asn Thr Val
980 985 990
Ser Val Thr Val Thr Asp Pro Asp Gly Leu Pro Val Leu Thr Ile Ala
995 1000 1005
Ser Leu Thr Leu Arg Pro Val Gln Leu Asp Gln Leu Thr Thr Ala Ala
1010 1015 1020
Asp Gly Gly Asp Arg Leu His Thr Leu His Trp Thr Pro Thr Pro Ile
1025 1030 1035 1040
Pro Ala Gln Leu Arg Glu Val Glu Phe Val Glu Trp Asn Asn Leu Glu
1045 1050 1055
His Glu Ser Ala Asp Asp Pro Val Pro Pro Val Val Val Leu Asp Cys
1060 1065 1070
Arg Asp Gly Glu Asn Asn Thr Val Gly Glu Ile Asp Thr Asp Val Leu
1075 1080 1085
Val Arg Ala His Ala Ile Ser His Arg Val Leu Gly Val Leu Gln Glu
1090 1095 1100
Phe Ser Thr Gly Gln Arg Phe Ala Ser Ser Thr Leu Leu Val Leu Thr
1105 1110 1115 1120
Arg Ala Ala Val Thr Thr Thr Ala Gly Asp Arg Val Asp Pro Ala Ala
1125 1130 1135
Ser Thr Ile Trp Gly Leu Val Arg Ser Ala Gln Ser Glu Glu Pro Gly
1140 1145 1150
Arg Ile Leu Leu Ala Asp Thr Asp Ile Glu Gly Ser Asp Gly Val Asp
1155 1160 1165
Val Ala Gly Ile Val Ser Leu Ala Val Ala Val Gly Glu Pro Gln Val
1170 1175 1180
Leu Ile Arg Asp Gly Ile Ala His Ile Ala Arg Leu Val Arg Thr Ala
1185 1190 1195 1200
Gly Arg Asp Asp Lys Thr Thr Ala Ser Asp Ile Ser Asp Thr Asp Asp
1205 1210 1215
Thr Val Ser Gly Ala Gly Arg Gly Thr Val Val Val Thr Gly Gly Thr
1220 1225 1230
Gly Gly Leu Gly Arg Ile Leu Ala Arg His Leu Val Gly Val Arg Gly
1235 1240 1245
Val Arg Ser Leu Val Leu Ala Ser Arg Arg Gly Leu Ala Ala Glu Gly
1250 1255 1260
Ala Arg Glu Leu Val Glu Glu Leu Thr Gly Ser Gly Ala Arg Val Ala
1265 1270 1275 1280
Val Val Ala Cys Asp Val Ser Thr Arg Ala Gly Val Glu Gln Leu Leu
1285 1290 1295
Ala Ala Val Pro Asp Glu Asp Pro Leu Val Gly Val Val His Ala Ala
1300 1305 1310
Gly Val Leu Asp Asp Gly Val Ile Ala Ser Leu Thr Pro Gln Arg Leu
1315 1320 1325
Asp Thr Ala Leu Ser Val Lys Ala Asp Ala Ala Trp Tyr Leu His Glu
1330 1335 1340
Leu Thr Arg Gly Leu Asp Leu Gly Met Phe Val Met Tyr Ser Ser Thr
1345 1350 1355 1360
Ala Gly Val Leu Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn
1365 1370 1375
Gln Phe Leu Asp Gly Leu Ala Glu Tyr Arg Arg Ala Arg Gly Leu Ala
1380 1385 1390
Ala Thr Ser Ile Ala Trp Gly Leu Trp Gly Ser Gly Thr Gly Met Thr
1395 1400 1405
Gly His Leu Asp Gly Gly Asp Thr Ala Arg Met Ser Arg Gly Gly Met
1410 1415 1420
Leu Ala Leu Thr Glu Ala Gln Gly Met Ala Met Phe Asp Thr Ala Val
1425 1430 1435 1440
Thr Ala Glu His Ala Thr Val Leu Ala Ala Arg Phe Asp Thr Thr Val
1445 1450 1455
Leu Ala Ala Gln Ala Arg Ala Gly Met Leu Ala Pro Ile Leu His Asn
1460 1465 1470
Leu Val Pro Asn Ala Arg Arg Val Ala Ala Gly Asn Thr Gly Ser Ala
1475 1480 1485
Gly Ala Gly Val Ala Gly Ser Gln Leu Arg Gln Arg Leu Ser Gly Leu
1490 1495 1500
Asp Glu Ala Glu Gln Val Lys Val Leu Leu Glu Leu Val Arg Gly Gln
1505 1510 1515 1520
Val Ala Ile Val Leu Gly His Asp Asp Ala Thr Ala Ile Asp Ala Asp
1525 1530 1535
Arg Asn Phe Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Ile Glu Ala
1540 1545 1550
Arg Lys Arg Leu Lys Thr Ala Thr Glu Val Ala Val Pro Ala Thr Leu
1555 1560 1565
Ile Phe Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu His Leu His Gln
1570 1575 1580
Lys Leu Ala Gly Asn Asp His Phe Asn Gly Pro Ser Arg Glu Asp Glu
1585 1590 1595 1600
Ala Val Val Ser Glu Phe Leu Ala Asn Ala Ser Met Glu Arg Leu Arg
1605 1610 1615
Ser Ala Gly Ile Phe Asp His Leu Leu Arg Phe Ala Met Glu Asp Arg
1620 1625 1630
Ser Asn Ser Val Ala Lys Gly Gly Asp Met Thr Gln Asp Ala Val Asp
1635 1640 1645
Glu Met Asp Pro Glu Ser Leu Val Arg Phe Ile Met Gln Arg Asp Met
1650 1655 1660
Asn
1665
<210> 4
<211> 3776
<212> PRT
<213> Nocardia vinacea
<400> 4
Met Ser Asp Gly Ser Gln Ser Leu Glu Tyr Leu Lys Arg Thr Ala Ile
1 5 10 15
Ser Leu Arg Glu Thr Arg Lys Arg Leu His Glu Leu Glu Tyr Arg Ala
20 25 30
His Glu Pro Val Ala Ile Val Gly Val Gly Cys His Phe Pro Gly Gly
35 40 45
Val Ser Ser Arg Glu Asp Leu Trp Gln Val Leu Ala Glu Gly Arg Asp
50 55 60
Val Leu Ser Gln Trp Pro Leu Asp Arg Gly Trp Asp Thr Gly Leu Phe
65 70 75 80
Asp Pro Glu Pro Gly Val Val Gly Lys Ser Tyr Thr Arg Glu Gly Gly
85 90 95
Phe Leu His Asp Ala Gly Leu Phe Asp Ala Gly Phe Phe Gly Ile Ser
100 105 110
Pro Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
115 120 125
Thr Val Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Val Ser Leu
130 135 140
Arg Gly Ser Glu Thr Gly Val Phe Ile Gly Val Ser Asp Gln Ser Tyr
145 150 155 160
Gly Ile Gly Arg Ser Asp Gly Asp Ala Gly Val Glu Gly Tyr Arg Leu
165 170 175
Ile Gly Ala Thr Ser Ser Val Val Ser Gly Arg Val Ser Tyr Val Leu
180 185 190
Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser Ser
195 200 205
Leu Val Ala Leu His Gln Ala Val Gln Ala Val Arg Ala Gly Glu Cys
210 215 220
Gly Met Ala Leu Val Gly Gly Val Thr Val Met Ser Val Pro Asp Thr
225 230 235 240
Phe Val Glu Phe Ser Arg Gln Lys Gly Leu Ala Pro Asp Gly Arg Cys
245 250 255
Lys Ser Phe Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Val
260 265 270
Gly Val Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg Arg Gly His
275 280 285
Gln Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala
290 295 300
Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile
305 310 315 320
Arg Arg Ala Leu Ala Asn Ala Gly Val Ala Ala Thr Glu Val Asp Val
325 330 335
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala
340 345 350
Gln Ala Leu Leu Ala Thr Tyr Gly Gln Gly Arg Glu Pro Asp Arg Pro
355 360 365
Leu Trp Leu Gly Ala Leu Lys Ser Asn Ile Gly His Thr Gln Ala Ala
370 375 380
Ala Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met Arg His Glu
385 390 395 400
Thr Leu Pro Arg Thr Leu His Val Asp Ala Pro Thr Thr His Val Asp
405 410 415
Trp Asn Ile Gly Ala Val Glu Leu Leu Thr Glu Ser Arg Ala Trp Thr
420 425 430
Val Glu Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Ile
435 440 445
Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Pro Val Thr
450 455 460
Glu Thr Pro Asp Thr Glu Ala Pro Asp Thr Glu Ser Ser Ala Pro Asp
465 470 475 480
Thr Asp Pro Val Pro Val Val Lys Ser Gly Gly Leu Val Trp Val Leu
485 490 495
Ser Gly Arg Ser Ser Asp Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln
500 505 510
Glu Trp Met Leu Ala Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp
515 520 525
Ser Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val
530 535 540
Gly Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Gly Leu Ile Asp
545 550 555 560
Gly Asp Pro Ala Val Ala Ala Gly Val Ser Arg Asp Arg Gly Lys Thr
565 570 575
Val Phe Val Phe Pro Gly Gln Gly Ala Gln Leu Leu Gly Met Gly Ser
580 585 590
Gly Leu Tyr Glu Ala Phe Pro Val Phe Ala Ala Ser Phe Asp Glu Thr
595 600 605
Thr Ala Leu Leu Glu Gln Gln Leu Glu Cys Ser Leu Arg Asp Val Val
610 615 620
Trp Gly Val Asp Glu Gln Ala Leu Gln Ala Thr Leu Tyr Thr Gln Thr
625 630 635 640
Gly Leu Phe Ala Val Gly Ile Ala Leu Phe Arg Leu Leu Glu Ser Phe
645 650 655
Gly Val Arg Pro Asp Phe Val Ala Gly His Ser Ile Gly Glu Leu Ala
660 665 670
Ala Ala Thr Val Ala Gly Val Leu Ser Leu Glu Asp Ala Thr Val Leu
675 680 685
Val Ala Ala Arg Ala Arg Leu Met Gln Gln Leu Pro Thr Gly Gly Ala
690 695 700
Met Leu Ala Met Arg Ala Ser Glu Thr Gln Ile Thr Thr Leu Leu Gly
705 710 715 720
Asp Ser Ile Glu Asp Gly Val Val Glu Ile Ala Ala Val Asn Gly Pro
725 730 735
Glu Ser Ile Val Leu Ala Gly Pro Gln His Ala Ile Asp Thr Thr Glu
740 745 750
Gln Gln Leu Arg Gln Ala Gly Tyr Gln Val Asn Arg Leu Arg Val Ser
755 760 765
His Ala Phe His Ser Ala Ser Met Glu Pro Met Leu Ala Glu Phe Ala
770 775 780
Arg Ile Ala Thr Glu Leu Thr Tyr Thr Gln Pro Val Ile Pro Ile Ile
785 790 795 800
Ser Asn Leu Asp Gly Gln Leu Thr Gly Pro Asn Thr Asp Ser Pro Asn
805 810 815
Thr Asp Ala Gln Gln Ala Asp Ser Pro Leu Thr Thr Pro Gln Tyr Trp
820 825 830
Val Asp His Val Arg Asn Thr Val Arg Phe Ala Asp Gly Ile Thr Thr
835 840 845
Leu Thr Thr Ala Gly Ala Thr Arg Tyr Val Ile Met Gly Pro Asp Gly
850 855 860
Gly Leu Ser Gly Leu Ile Asp Glu Thr Leu Gln His Thr Thr Ser Asp
865 870 875 880
Ala Val Asp Thr Lys Pro Thr Val Asp Gly Val Glu Ala Val Val Ala
885 890 895
Ser Leu Leu Arg Lys Asp Arg Val Glu Asp Thr Thr Leu Leu Ser Ala
900 905 910
Leu Ala Arg Leu Asp Val Ala Gly Thr Gly Ile Asp Trp Thr Pro Ile
915 920 925
Phe His Gly Arg Gly Ala Thr Arg Val Pro Leu Pro Ser Tyr Ala Phe
930 935 940
Gln His Arg Arg Tyr Trp Leu Asp Thr Ile Thr Gly Asn Thr Asp Pro
945 950 955 960
Asp Ser Leu Gly Leu Ser Gly Leu Asp His Pro Leu Ile Gly Ala Val
965 970 975
Val Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly Arg Leu Ser
980 985 990
Leu Gln Thr His Pro Trp Leu Ala Asp His Ala Val Gly Gly Val Val
995 1000 1005
Leu Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg Ala Gly Asp
1010 1015 1020
Glu Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu Ala Pro Leu
1025 1030 1035 1040
Thr Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu Val Gly Ala
1045 1050 1055
Leu Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser Gln Thr Arg
1060 1065 1070
Asp Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His Thr Gln Ser
1075 1080 1085
Pro Val Glu Asn Leu Thr Thr Thr Thr Pro Val Asp Thr Gly Leu Ala
1090 1095 1100
Val Trp Pro Pro Gln Asn Ala Thr Arg Thr Asp Thr Ser Ser Leu Tyr
1105 1110 1115 1120
Gln Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Gln Gly
1125 1130 1135
Leu Glu Ser Val Trp Arg Thr Gly Glu Asp Trp Leu Val Gln Ala Arg
1140 1145 1150
Leu Pro Glu Thr Gly Gly Asp Ala His His Tyr Gly Leu His Pro Ala
1155 1160 1165
Leu Leu Asp Ala Val Leu His Ala Met Thr Thr Gly His Asp Thr Asp
1170 1175 1180
Thr Asp Thr Asp Thr Ser Val Gly Pro Leu Leu Pro Phe Ala Trp Glu
1185 1190 1195 1200
Gly Val Gln Leu His Ala Val Gly Ala Ser Thr Val His Ala Arg Ile
1205 1210 1215
Thr Pro Leu Gly His Asn Thr Val Arg Val Thr Val Thr Asp Pro Asp
1220 1225 1230
Gly Gln Pro Val Leu Thr Ile Ala Ser Leu Thr Leu Arg Pro Val Gln
1235 1240 1245
Leu Asp Gln Leu Thr Thr Ala Ala Asp Gly Gly Asp Arg Leu His Thr
1250 1255 1260
Leu His Trp Thr Pro Thr Pro Ile Pro Ala Gln Leu Arg Glu Val Glu
1265 1270 1275 1280
Phe Val Glu Trp Asn Asn Leu Glu His Glu Ser Ala Asp Asp Pro Val
1285 1290 1295
Pro Pro Val Val Val Leu Asp Cys Arg Asp Gly Glu Asn Asn Thr Val
1300 1305 1310
Gly Glu Ile Asp Thr Asp Val Leu Val Arg Ala His Ala Ile Ser His
1315 1320 1325
Arg Val Leu Gly Val Leu Gln Glu Phe Ser Thr Gly Gln Arg Phe Ala
1330 1335 1340
Ser Ser Thr Leu Leu Val Leu Thr Arg Ala Ala Val Thr Thr Thr Ala
1345 1350 1355 1360
Gly Asp Arg Val Asp Pro Ala Ala Ser Thr Ile Trp Gly Leu Val Arg
1365 1370 1375
Ser Ala Gln Ser Glu Glu Pro Gly Arg Ile Leu Leu Ala Asp Thr Asp
1380 1385 1390
Ile Glu Gly Ser Asp Gly Val Asp Val Ala Gly Ile Val Ser Leu Ala
1395 1400 1405
Val Ala Val Gly Glu Pro Gln Val Leu Ile Arg Asp Gly Ile Ala His
1410 1415 1420
Ile Ala Arg Leu Val Arg Thr Ala Gly Arg Asp Asp Lys Thr Thr Ala
1425 1430 1435 1440
Ser Asp Ile Ser Asp Thr Asp Asp Thr Val Ser Gly Ala Gly Arg Gly
1445 1450 1455
Thr Val Val Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Ile Leu Ala
1460 1465 1470
Arg His Leu Val Gly Val Arg Gly Val Arg Ser Leu Val Leu Ala Ser
1475 1480 1485
Arg Arg Gly Leu Ala Ala Glu Gly Ala Arg Glu Leu Val Glu Glu Leu
1490 1495 1500
Thr Gly Ser Gly Ala Arg Val Ala Val Val Ala Cys Asp Val Ser Thr
1505 1510 1515 1520
Arg Ala Gly Val Glu Gln Leu Leu Ala Ala Val Pro Asp Glu Asp Pro
1525 1530 1535
Leu Val Gly Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val Ile
1540 1545 1550
Ala Ser Leu Thr Pro Gln Arg Leu Asp Thr Val Leu Ser Val Lys Ala
1555 1560 1565
Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg Gly Leu Asp Leu Gly
1570 1575 1580
Met Phe Val Leu Tyr Ser Ser Ala Ala Gly Val Leu Gly Ser Pro Gly
1585 1590 1595 1600
Gln Gly Asn Tyr Ala Ala Ala Asn Gln Phe Leu Asp Gly Leu Ala Glu
1605 1610 1615
Tyr Arg Arg Ala Arg Gly Leu Ala Ala Thr Ser Ile Ala Trp Gly Leu
1620 1625 1630
Trp Gly Ser Gly Thr Gly Met Thr Gly His Leu Asp Gly Gly Asp Thr
1635 1640 1645
Ala Arg Met Ser Arg Gly Gly Met Leu Ala Leu Thr Glu Ala Gln Gly
1650 1655 1660
Met Ala Met Phe Asp Thr Ala Val Thr Ala Glu His Ala Thr Val Leu
1665 1670 1675 1680
Ala Ala Arg Phe Asp Thr Thr Val Leu Ala Ala Gln Ala Arg Ala Gly
1685 1690 1695
Met Leu Ala Pro Ile Leu His Asn Leu Val Pro Asn Ala Arg Arg Val
1700 1705 1710
Ala Ala Gly Asn Thr Gly Ser Ala Gly Ala Gly Val Ala Gly Ser Gln
1715 1720 1725
Leu Arg Gln Arg Leu Ser Gly Leu Asp Glu Ala Glu Gln Val Lys Val
1730 1735 1740
Leu Leu Glu Leu Val Arg Gly Gln Val Ala Ile Val Leu Gly His Asp
1745 1750 1755 1760
Asp Ala Thr Ala Ile Asp Ala Asp Arg Asn Phe Gln Glu Leu Gly Phe
1765 1770 1775
Asp Ser Leu Thr Ala Val Glu Ala Arg Asn Arg Leu Lys Thr Ala Thr
1780 1785 1790
Gly Val Ala Val Ala Ala Thr Leu Ile Phe Asp Tyr Pro Thr Pro Arg
1795 1800 1805
Ala Val Ala Glu His Leu His Gln Gln Leu Ala Gly Ala Ser Val Ala
1810 1815 1820
Ala Glu Pro Val Val Val Val Gly His Ser Ala Glu Pro Ile Ala Ile
1825 1830 1835 1840
Val Gly Val Gly Cys Arg Phe Pro Gly Gly Val Ser Ser Arg Glu Glu
1845 1850 1855
Leu Trp Gln Val Val Ala Gln Gly Arg Asp Val Val Ser Gln Trp Pro
1860 1865 1870
Leu Asp Arg Gly Trp Asp Gly Gly Leu Phe Asp Pro Glu Pro Gly Val
1875 1880 1885
Ala Gly Arg Ser Tyr Thr Arg Glu Gly Gly Phe Leu His Asp Ala Gly
1890 1895 1900
Leu Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Val Ala
1905 1910 1915 1920
Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Val Trp Glu Ala Leu
1925 1930 1935
Glu Asp Ala Gly Val Asp Pro Val Ser Leu Arg Gly Ser Asp Thr Gly
1940 1945 1950
Val Phe Met Gly Val Met Tyr His Asp Tyr Pro Ala Ser Ala Val Gly
1955 1960 1965
Gly Ser Val Val Ser Gly Arg Val Ser Tyr Val Leu Gly Leu Glu Gly
1970 1975 1980
Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu
1985 1990 1995 2000
His Gln Ala Val Gln Ala Val Arg Ala Gly Glu Cys Gly Met Ala Leu
2005 2010 2015
Val Gly Gly Val Thr Val Met Ser Thr Pro Asp Thr Phe Val Glu Phe
2020 2025 2030
Ser Arg Gln Lys Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala
2035 2040 2045
Glu Ala Ala Asp Gly Ala Gly Trp Ser Glu Gly Val Gly Val Leu Val
2050 2055 2060
Val Glu Arg Leu Ser Asp Ala Arg Arg Arg Gly His Gln Val Leu Ala
2065 2070 2075 2080
Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu
2085 2090 2095
Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Arg Ala Leu
2100 2105 2110
Ala Asn Ala Gly Val Ala Ala Thr Glu Val Asp Val Val Glu Ala His
2115 2120 2125
Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu
2130 2135 2140
Ala Thr Tyr Gly Gln Asn Arg Glu Pro Asp Arg Pro Leu Trp Leu Gly
2145 2150 2155 2160
Ser Leu Lys Ser Asn Ile Gly His Ala Gln Ala Ala Ala Gly Val Ala
2165 2170 2175
Gly Val Ile Lys Met Ile Glu Ala Met Arg His Glu Thr Leu Pro Lys
2180 2185 2190
Thr Leu His Val Asp Thr Pro Ser Ser His Val Asp Trp Thr Thr Gly
2195 2200 2205
Thr Val Glu Leu Leu Thr Gln Glu Gln Pro Trp Pro Arg Asn Gly His
2210 2215 2220
Pro Leu Arg Ala Gly Ile Ser Ser Phe Gly Val Ser Gly Thr Asn Ala
2225 2230 2235 2240
His Val Ile Leu Glu Gln Ala Pro Ala Val Val Glu Pro Val Pro Gly
2245 2250 2255
Thr Glu Ser Leu Val Pro Val Thr Ala Gly Gly Leu Val Trp Val Leu
2260 2265 2270
Ser Gly Arg Thr Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln
2275 2280 2285
Glu Trp Met Leu Thr Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp
2290 2295 2300
Ser Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val
2305 2310 2315 2320
Gly Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Ala Leu Ile Asp
2325 2330 2335
Ser Glu Pro Gly Met Leu Ala Gly Ala Gly Met Phe Ala Gly Pro Gly
2340 2345 2350
Val Val Ser Gly Val Ala Gly Gly Val Gly Lys Thr Val Leu Val Phe
2355 2360 2365
Pro Gly Gln Gly Ala Gln Trp Leu Gly Met Gly Ala Arg Leu Leu Gln
2370 2375 2380
Glu Ser Val Val Phe Glu Gln Lys Val Leu Glu Cys Ala Glu Val Phe
2385 2390 2395 2400
Ala Pro Leu Val Glu Trp Ser Leu Ile Asp Val Leu Gln Gly Thr Ala
2405 2410 2415
Asp Pro Met Leu Leu Glu Arg Val Asp Val Val Gln Pro Ala Leu Phe
2420 2425 2430
Ala Val Met Val Ser Leu Ala Glu Val Trp Arg Ser Phe Gly Val Val
2435 2440 2445
Pro Asp Ala Val Leu Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys
2450 2455 2460
Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg Val Val Ile Leu
2465 2470 2475 2480
Arg Ser Arg Ala Leu Arg Glu Leu Ser Gly Arg Gly Gly Met Ala Ser
2485 2490 2495
Val Leu Leu Pro Thr Thr Leu Val Glu Gln Arg Leu Thr Asp Met Pro
2500 2505 2510
Gly Leu Ala Val Ala Ala Val Asn Gly Pro Thr Thr Thr Val Val Ser
2515 2520 2525
Gly Leu Thr Glu Gln Leu Asp Ala Phe Val Ala Ala Cys Glu Ser Asp
2530 2535 2540
Gly Val Gln Val Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Pro
2545 2550 2555 2560
Gln Val Glu Ser Leu Arg Gln Arg Leu Leu Glu Glu Leu Ala Thr Ile
2565 2570 2575
Thr Pro Arg Pro Ser Arg Ile Ala Phe Tyr Ser Thr Val Thr Gly Thr
2580 2585 2590
Leu Leu Asp Thr Thr Glu Leu Asp Ala Gly Tyr Trp Phe Arg Asn Leu
2595 2600 2605
Arg Asp Thr Val Asn Phe Ala Ala Thr Val Gln Thr Leu Leu Ser Glu
2610 2615 2620
Gly His Thr Val Phe Val Glu Ala Ser Pro His Pro Val Leu Thr Pro
2625 2630 2635 2640
Gly Ile Glu Glu Leu Gly Glu Gln Thr Gly Pro Arg Thr Arg Asp Ile
2645 2650 2655
Val Val Thr Gly Ser Leu Arg Arg Asp Asp Gly Gly Leu Asp Arg Leu
2660 2665 2670
Leu Ser Ala Leu Ala Met Val Asp Val Ala Gly Ala Gly Val Asp Trp
2675 2680 2685
Thr Pro Ile Phe Asp Gly Arg Gly Ala Thr Arg Val Ala Leu Pro Ser
2690 2695 2700
Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu Asp Thr Leu Thr Ala Ser
2705 2710 2715 2720
Gly Asn Pro Asp Ser Leu Gly Gln Thr Ala Leu Asp His Pro Leu Ile
2725 2730 2735
Gly Ala Val Val Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly
2740 2745 2750
Arg Leu Ser Leu Gln Thr His Pro Trp Leu Ala Asp His Ala Val Gly
2755 2760 2765
Gly Val Val Leu Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg
2770 2775 2780
Ala Gly Asp Glu Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu
2785 2790 2795 2800
Ala Pro Leu Thr Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu
2805 2810 2815
Val Gly Ala Leu Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser
2820 2825 2830
Gln Thr Arg Asp Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His
2835 2840 2845
Thr Gln Ser Pro Val Glu Asn Leu Thr Thr Thr Thr Pro His Asp Val
2850 2855 2860
Asp Ala Gly Leu Ala Ala Trp Pro Pro Ala Gly Ala Val His Thr Asp
2865 2870 2875 2880
Thr Ser Ser Leu Tyr Gln Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly
2885 2890 2895
Pro Ala Phe Gln Gly Leu Glu Ser Val Trp Arg Thr Gly Glu Asp Trp
2900 2905 2910
Leu Val Gln Ala Thr Leu Pro Glu Thr Gly Gly Gln Ala Asn His Tyr
2915 2920 2925
Gly Leu His Pro Ala Leu Leu Asp Ala Val Leu His Ala Met Thr Thr
2930 2935 2940
Gly Phe Asp Thr Ser Gly Lys Ala Gly Val Glu Ala Ala Ala Gly Pro
2945 2950 2955 2960
Leu Leu Pro Phe Ala Trp Glu Gly Val Gln Leu His Ala Val Gly Ala
2965 2970 2975
Ser Thr Val His Ala Arg Ile Thr Pro Leu Gly His Asn Thr Val Arg
2980 2985 2990
Val Thr Val Thr Asp Pro Asp Gly Leu Pro Val Leu Thr Ile Ala Ser
2995 3000 3005
Leu Thr Leu Arg Pro Val Gln Leu Asp Gln Leu Ala Ile Ala Thr Gly
3010 3015 3020
Ser Gly Asp Arg Leu His Thr Leu His Trp Thr Ala Thr Pro Thr Gln
3025 3030 3035 3040
Pro Gln Asp Val Ala Phe Val Glu Trp Asp Asp Leu Gln Ala Glu Ser
3045 3050 3055
Thr Asp His Pro Ala Pro Gln Ile Val Val Leu Asp Cys Arg Ser Gly
3060 3065 3070
Arg Asn Asp Thr Asp Asn Asp Gly Gly Asn Gly Thr Asp Val Leu Ala
3075 3080 3085
Arg Ala His Ala Ile Ser His Arg Val Leu Gly Val Leu Gln Glu Phe
3090 3095 3100
Ser Thr Gly Gln Arg Phe Ala Ser Ser Thr Leu Leu Val Leu Thr Arg
3105 3110 3115 3120
Ala Ala Val Thr Thr Thr Ala Gly Asp Arg Val Asp Pro Ala Ala Ser
3125 3130 3135
Thr Ile Trp Gly Leu Val Arg Ser Ala Gln Ser Glu Glu Pro Gly Arg
3140 3145 3150
Ile Leu Leu Ala Asp Thr Asp Ile Glu Gly Ser Asp Gly Val Asp Val
3155 3160 3165
Ala Gly Ile Val Ser Leu Ala Val Ala Val Gly Glu Pro Gln Val Leu
3170 3175 3180
Ile Arg Asp Gly Ile Ala His Ile Ala Arg Leu Val Arg Thr Ala Gly
3185 3190 3195 3200
Arg Asp Asp Lys Thr Thr Ala Ser Asp Ile Ser Asp Thr Asp Asp Thr
3205 3210 3215
Val Ser Gly Ala Gly Arg Gly Thr Val Val Val Thr Gly Gly Thr Gly
3220 3225 3230
Gly Leu Gly Arg Ile Leu Ala Arg His Leu Val Gly Val Arg Gly Val
3235 3240 3245
Arg Ser Leu Val Leu Ala Ser Arg Arg Gly Leu Ala Ala Glu Gly Ala
3250 3255 3260
Arg Glu Leu Val Glu Glu Leu Thr Gly Ser Gly Ala Arg Val Ala Val
3265 3270 3275 3280
Val Ala Cys Asp Val Ser Thr Arg Ala Gly Val Glu Gln Leu Leu Ala
3285 3290 3295
Ala Val Pro Asp Glu Asp Pro Leu Val Gly Val Val His Ala Ala Gly
3300 3305 3310
Val Leu Asp Asp Gly Val Ile Ala Ser Leu Thr Pro Gln Arg Leu Asp
3315 3320 3325
Thr Val Leu Ser Val Lys Ala Asp Ala Ala Trp Tyr Leu His Glu Leu
3330 3335 3340
Thr Arg Gly Leu Asp Leu Gly Met Phe Val Leu Tyr Ser Ser Ala Ala
3345 3350 3355 3360
Gly Val Leu Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn Gln
3365 3370 3375
Phe Leu Asp Gly Leu Ala Glu Tyr Arg Arg Ala Arg Gly Leu Ala Ala
3380 3385 3390
Thr Ser Ile Ala Trp Gly Leu Trp Gly Ser Gly Thr Gly Met Thr Gly
3395 3400 3405
His Leu Asp Gly Gly Asp Thr Ala Arg Met Ser Arg Gly Gly Met Leu
3410 3415 3420
Ala Leu Thr Glu Ala Gln Gly Met Ala Met Phe Asp Thr Ala Val Thr
3425 3430 3435 3440
Ala Glu His Ala Thr Val Leu Ala Ala Arg Phe Asp Thr Thr Val Leu
3445 3450 3455
Ala Ala Gln Ala Arg Ala Gly Met Leu Ala Pro Ile Leu His Asn Leu
3460 3465 3470
Val Pro Asn Ala Arg Arg Val Ala Ala Gly Asn Thr Gly Ser Ala Gly
3475 3480 3485
Ala Gly Val Ala Gly Ser Gln Leu Arg Gln Arg Leu Ser Gly Leu Asp
3490 3495 3500
Glu Ala Glu Gln Val Lys Val Leu Leu Glu Leu Val Arg Gly Gln Val
3505 3510 3515 3520
Ala Ile Val Leu Gly His Asp Asp Ala Thr Ala Ile Asp Ala Asp Arg
3525 3530 3535
Asn Phe Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Ala Arg
3540 3545 3550
Asn Arg Leu Lys Thr Ala Thr Gly Val Ala Val Ala Ala Thr Leu Ile
3555 3560 3565
Phe Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu His Leu His Gln Gln
3570 3575 3580
Leu Ala Gly Ala Ser Val Ala Ala Glu Pro Val Val Val Val Gly His
3585 3590 3595 3600
Ser Ala Glu Pro Ile Ala Ile Val Gly Val Gly Cys Arg Phe Pro Gly
3605 3610 3615
Gly Val Ser Ser Arg Glu Glu Leu Trp Gln Val Val Ala Gln Ala Gly
3620 3625 3630
Met Trp Cys Arg Ser Gly Arg Trp Ile Gly Val Gly Met Gly Gly Cys
3635 3640 3645
Ser Ile Pro Asn Arg Val Trp Arg Val Gly Pro Thr Arg Val Arg Val
3650 3655 3660
Gly Ser Cys Thr Met Arg Gly Val Arg Cys Arg Val Leu Arg Asp Gln
3665 3670 3675 3680
Pro Ala Gly Ser Gly Cys Asp Gly Ser Ala Ala Ala Val Ala Ala Gly
3685 3690 3695
Asn Gly Val Gly Ser Pro Arg Arg Arg Arg Gly Gly Pro Gly Leu Val
3700 3705 3710
Ala Arg Gln Arg His Arg Arg Val His Gly Arg Asp Val Pro Arg Leu
3715 3720 3725
Pro Arg Gln Arg Gly Arg Trp Phe Gly Arg Leu Gly Ser Gly Val Val
3730 3735 3740
Arg Ala Gly Ile Gly Arg Pro Gly Gly Ile Gly Gly His Arg Val Leu
3745 3750 3755 3760
Val Val Ala Gly Arg Pro Ala Ser Ser Arg Ala Gly Arg Ala Gly Arg
3765 3770 3775
<210> 5
<211> 5010
<212> PRT
<213> Nocardia vinacea
<400> 5
Met Glu Glu Glu Arg Leu Leu Ala Asn Leu Arg Trp Val Thr Ser Glu
1 5 10 15
Leu Leu Glu Thr Arg Gln Arg Leu Asp Ala Val Leu Ser Glu Pro Ile
20 25 30
Ala Ile Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Ser Ser Arg
35 40 45
Glu Glu Leu Trp Glu Val Val Ala Gln Gly Arg Asp Val Val Ser Gln
50 55 60
Trp Pro Val Asp Arg Gly Trp Asp Ala Gly Leu Phe Asp Pro Glu Pro
65 70 75 80
Gly Val Ala Gly Arg Ser Tyr Thr Arg Glu Gly Gly Phe Leu His Asp
85 90 95
Ala Gly Leu Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala
100 105 110
Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Val Trp Glu
115 120 125
Ala Leu Glu Asp Ala Gly Val Asp Pro Val Ser Leu Arg Gly Ser Asp
130 135 140
Thr Gly Val Phe Ile Gly Val Ser Asp Gln Ser Tyr Gly Ile Gly Arg
145 150 155 160
Ser Asp Gly Asp Ala Gly Val Glu Gly Tyr Arg Leu Thr Gly Thr Thr
165 170 175
Ser Ser Val Val Ser Gly Arg Val Ser Tyr Val Leu Gly Leu Glu Gly
180 185 190
Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu
195 200 205
His Gln Ala Val Arg Ala Val Arg Val Gly Glu Cys Gly Met Ala Leu
210 215 220
Val Gly Gly Ala Thr Val Met Ser Thr Pro Ser Met Phe Val Glu Phe
225 230 235 240
Ser Arg Gln Gly Gly Leu Ala Ser Asp Gly Arg Cys Lys Ser Phe Ala
245 250 255
Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Val Gly Ile Leu Val
260 265 270
Val Glu Arg Leu Ser Glu Ala Arg Lys His Gly His Gln Val Leu Ala
275 280 285
Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu
290 295 300
Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Arg Ala Leu
305 310 315 320
Ala Asn Ala Gly Leu Ser Pro Asp Leu Ile Asp Val Val Glu Ala His
325 330 335
Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu
340 345 350
Ala Thr Tyr Gly Gln Asn Arg Glu Pro Asp Arg Pro Leu Trp Leu Gly
355 360 365
Ser Leu Lys Ser Asn Ile Gly His Ala Gln Ala Ala Ala Gly Val Ala
370 375 380
Gly Val Ile Lys Met Ile Glu Ala Met Arg His Glu Thr Leu Pro Lys
385 390 395 400
Thr Leu His Val Asp Thr Pro Ser Ser His Val Asp Trp Thr Thr Gly
405 410 415
Thr Val Glu Leu Leu Thr Gln Glu Gln Pro Trp Pro Arg Asn Gly His
420 425 430
Pro Leu Arg Ala Gly Ile Ser Ser Phe Gly Val Ser Gly Thr Asn Ala
435 440 445
His Val Ile Leu Glu Gln Ala Pro Ala Val Val Glu Pro Val Pro Gly
450 455 460
Thr Glu Ser Leu Val Pro Val Thr Ala Gly Gly Leu Val Trp Val Leu
465 470 475 480
Ser Gly Arg Thr Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln
485 490 495
Glu Trp Met Leu Thr Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp
500 505 510
Ser Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val
515 520 525
Gly Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Ala Leu Ile Asp
530 535 540
Ser Glu Pro Gly Met Leu Ala Gly Ala Gly Met Phe Ala Gly Pro Gly
545 550 555 560
Val Val Ser Gly Val Ala Gly Gly Val Gly Lys Thr Val Leu Val Phe
565 570 575
Pro Gly Gln Gly Ala Gln Trp Leu Gly Met Gly Ala Arg Leu Leu Gln
580 585 590
Glu Ser Val Val Phe Glu Gln Lys Val Leu Glu Cys Ala Glu Val Phe
595 600 605
Ala Pro Leu Val Glu Trp Ser Leu Ile Asp Val Leu Gln Gly Thr Ala
610 615 620
Asp Pro Met Leu Leu Glu Arg Val Asp Val Val Gln Pro Ala Leu Phe
625 630 635 640
Ala Val Met Val Ser Leu Ala Glu Val Trp Arg Ser Phe Gly Val Val
645 650 655
Pro Asp Ala Val Leu Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys
660 665 670
Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg Val Val Ile Leu
675 680 685
Arg Ser Arg Ala Leu Arg Glu Leu Ser Gly Arg Gly Gly Met Ala Ser
690 695 700
Val Leu Leu Pro Thr Thr Leu Val Glu Gln Arg Leu Thr Asp Met Pro
705 710 715 720
Gly Leu Ala Val Ala Ala Val Asn Gly Pro Thr Thr Thr Val Val Ser
725 730 735
Gly Leu Thr Glu Gln Leu Asp Ala Phe Val Ala Ala Cys Glu Ser Asp
740 745 750
Gly Val Gln Val Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Pro
755 760 765
Gln Val Glu Ser Leu Arg Gln Arg Leu Leu Glu Glu Leu Ala Thr Ile
770 775 780
Thr Pro Arg Pro Ser Arg Ile Ala Phe Tyr Ser Thr Val Thr Gly Thr
785 790 795 800
Leu Leu Asp Thr Thr Glu Leu Asp Ala Gly Tyr Trp Phe Arg Asn Leu
805 810 815
Arg Asp Thr Val Asn Phe Ala Ala Thr Val Gln Thr Leu Leu Ser Glu
820 825 830
Gly His Thr Val Phe Val Glu Ala Ser Pro His Pro Val Leu Thr Pro
835 840 845
Gly Ile Glu Glu Leu Gly Glu Gln Thr Gly Pro Arg Thr Arg Asp Ile
850 855 860
Val Val Thr Gly Ser Leu Arg Arg Asp Asp Gly Gly Leu Asp Arg Leu
865 870 875 880
Leu Ser Ala Leu Ala Met Val Asp Val Ala Gly Ala Gly Val Asp Trp
885 890 895
Thr Pro Ile Phe Asp Gly Arg Gly Ala Thr Arg Val Ala Leu Pro Ser
900 905 910
Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu Asp Thr Leu Thr Ala Ser
915 920 925
Gly Asn Pro Asp Ser Leu Gly Gln Thr Ala Leu Asp His Pro Leu Ile
930 935 940
Gly Ala Val Val Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly
945 950 955 960
Arg Leu Ser Leu Gln Thr His Pro Trp Leu Ala Asp His Ala Val Gly
965 970 975
Gly Val Val Leu Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg
980 985 990
Ala Gly Asp Glu Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu
995 1000 1005
Ala Pro Leu Thr Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu
1010 1015 1020
Val Gly Ala Leu Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser
1025 1030 1035 1040
Gln Thr Arg Asp Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His
1045 1050 1055
Thr Gln Ser Pro Val Glu Asn Leu Thr Thr Thr Thr Pro His Asp Val
1060 1065 1070
Asp Ala Gly Leu Ala Ala Trp Pro Pro Ala Gly Ala Val His Thr Asp
1075 1080 1085
Thr Ser Ser Leu Tyr Gln Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly
1090 1095 1100
Pro Ala Phe Gln Gly Leu Glu Ser Val Trp Arg Thr Gly Glu Asp Trp
1105 1110 1115 1120
Leu Val Gln Ala Thr Leu Pro Glu Thr Gly Gly Gln Ala Asn His Tyr
1125 1130 1135
Gly Leu His Pro Ala Leu Leu Asp Ala Val Leu His Ala Met Thr Thr
1140 1145 1150
Gly Phe Asp Thr Ser Gly Lys Ala Gly Val Glu Ala Ala Ala Gly Pro
1155 1160 1165
Leu Leu Pro Phe Ala Trp Glu Gly Val Gln Leu His Ala Val Gly Ala
1170 1175 1180
Ser Thr Val His Ala Arg Ile Thr Pro Leu Gly His Asn Thr Val Arg
1185 1190 1195 1200
Val Thr Val Thr Asp Pro Asp Gly Leu Pro Val Leu Thr Ile Ala Ser
1205 1210 1215
Leu Thr Leu Arg Pro Val Gln Leu Asp Gln Leu Ala Ile Ala Thr Gly
1220 1225 1230
Ser Gly Asp Arg Leu His Thr Leu His Trp Thr Ala Thr Pro Thr Gln
1235 1240 1245
Pro Gln Asp Val Ala Phe Val Glu Trp Asp Asp Leu Gln Ala Glu Ser
1250 1255 1260
Thr Asp His Pro Ala Pro Gln Ile Val Val Leu Asp Cys Arg Ser Gly
1265 1270 1275 1280
Arg Asn Asp Thr Asp Asn Asp Gly Gly Asn Gly Thr Asp Val Leu Ala
1285 1290 1295
Arg Ala His Ala Ile Ser His Arg Val Leu Gly Val Leu Gln Glu Phe
1300 1305 1310
Ser Thr Gly Gln Arg Phe Ala Ser Ser Thr Leu Leu Val Leu Thr Arg
1315 1320 1325
Ala Ala Val Thr Thr Thr Ala Gly Asp Arg Val Asp Pro Ala Ala Ser
1330 1335 1340
Thr Ile Trp Gly Leu Val Arg Ser Ala Gln Ser Glu Glu Pro Gly Arg
1345 1350 1355 1360
Ile Leu Leu Ala Asp Thr Asp Ile Glu Gly Ser Asp Gly Val Asp Val
1365 1370 1375
Ala Gly Ile Val Ser Leu Ala Val Ala Val Gly Glu Pro Gln Val Leu
1380 1385 1390
Ile Arg Asp Gly Ile Ala His Ile Ala Arg Leu Val Arg Thr Ala Gly
1395 1400 1405
Arg Asp Asp Lys Thr Thr Ala Ser Asp Ile Ser Asp Thr Asp Asp Thr
1410 1415 1420
Val Ser Gly Ala Gly Arg Gly Thr Val Val Val Thr Gly Gly Thr Gly
1425 1430 1435 1440
Gly Leu Gly Arg Ile Leu Ala Arg His Leu Val Gly Val Arg Gly Val
1445 1450 1455
Arg Ser Leu Val Leu Ala Ser Arg Arg Gly Leu Ala Ala Glu Gly Ala
1460 1465 1470
Arg Glu Leu Val Glu Glu Leu Thr Gly Ser Gly Ala Arg Val Ala Val
1475 1480 1485
Val Ala Cys Asp Val Ser Thr Arg Ala Gly Val Glu Gln Leu Leu Ala
1490 1495 1500
Ala Val Pro Asp Glu Asp Pro Leu Val Gly Val Val His Ala Ala Gly
1505 1510 1515 1520
Val Leu Asp Asp Gly Val Ile Ala Ser Leu Thr Pro Gln Arg Leu Asp
1525 1530 1535
Thr Val Leu Ser Val Lys Ala Asp Ala Ala Trp Tyr Leu His Glu Leu
1540 1545 1550
Thr Arg Gly Leu Asp Leu Gly Met Phe Val Leu Tyr Ser Ser Ala Ala
1555 1560 1565
Gly Val Leu Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn Gln
1570 1575 1580
Phe Leu Asp Gly Leu Ala Glu Tyr Arg Arg Ala Arg Gly Leu Ala Ala
1585 1590 1595 1600
Thr Ser Ile Ala Trp Gly Leu Trp Gly Ser Gly Thr Gly Met Thr Gly
1605 1610 1615
His Leu Asp Gly Gly Asp Thr Ala Arg Met Ser Arg Gly Gly Met Leu
1620 1625 1630
Ala Leu Thr Glu Ala Gln Gly Met Ala Met Phe Asp Thr Ala Val Thr
1635 1640 1645
Ala Glu His Ala Thr Val Leu Ala Ala Arg Phe Asp Thr Thr Val Leu
1650 1655 1660
Ala Ala Gln Ala Arg Ala Gly Met Leu Ala Pro Ile Leu His Asn Leu
1665 1670 1675 1680
Val Pro Asn Ala Arg Arg Val Ala Ala Gly Asn Thr Gly Ser Ala Gly
1685 1690 1695
Ala Gly Val Ala Gly Ser Gln Leu Arg Gln Arg Leu Ser Gly Leu Asp
1700 1705 1710
Glu Ala Glu Gln Val Lys Val Leu Leu Glu Leu Val Arg Gly Gln Val
1715 1720 1725
Ala Ile Val Leu Gly His Asp Asp Ala Thr Ala Ile Asp Ala Asp Arg
1730 1735 1740
Asn Phe Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Ala Arg
1745 1750 1755 1760
Asn Arg Leu Lys Thr Ala Thr Gly Val Ala Val Ala Ala Thr Leu Ile
1765 1770 1775
Phe Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu His Leu His Gln Gln
1780 1785 1790
Leu Ala Gly Ala Ser Val Ala Ala Glu Pro Val Val Val Val Gly His
1795 1800 1805
Ser Ala Glu Pro Ile Ala Ile Val Gly Val Gly Cys Arg Phe Pro Gly
1810 1815 1820
Gly Val Ser Ser Arg Glu Glu Leu Trp Gln Val Val Ala Gln Gly Arg
1825 1830 1835 1840
Asp Val Val Ser Gln Trp Pro Leu Asp Arg Gly Trp Asp Gly Gly Leu
1845 1850 1855
Phe Asp Pro Glu Pro Gly Val Ala Gly Arg Ser Tyr Thr Arg Glu Gly
1860 1865 1870
Gly Phe Leu His Asp Ala Gly Leu Phe Asp Ala Gly Phe Phe Gly Ile
1875 1880 1885
Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
1890 1895 1900
Glu Thr Val Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Val Ser
1905 1910 1915 1920
Leu Arg Gly Ser Asp Thr Gly Val Phe Met Gly Val Met Tyr His Asp
1925 1930 1935
Tyr Pro Ala Ser Ala Val Gly Gly Ser Val Val Ser Gly Arg Val Ser
1940 1945 1950
Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys
1955 1960 1965
Ser Ser Ser Leu Val Ala Leu His Gln Ala Val Gln Ala Val Arg Ala
1970 1975 1980
Gly Glu Cys Gly Met Ala Leu Val Gly Gly Val Thr Val Met Ser Thr
1985 1990 1995 2000
Pro Asp Thr Phe Val Glu Phe Ser Arg Gln Lys Gly Leu Ala Pro Asp
2005 2010 2015
Gly Arg Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Ala Gly Trp Ser
2020 2025 2030
Glu Gly Val Gly Val Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg
2035 2040 2045
Arg Gly His Gln Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln
2050 2055 2060
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln
2065 2070 2075 2080
Arg Val Ile Arg Arg Ala Leu Ala Asn Ala Gly Val Ala Ala Thr Glu
2085 2090 2095
Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro
2100 2105 2110
Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asn Arg Glu Pro
2115 2120 2125
Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ala
2130 2135 2140
Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met
2145 2150 2155 2160
Arg His Glu Thr Leu Pro Lys Thr Leu His Val Asp Thr Pro Thr Thr
2165 2170 2175
His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu Ser Arg
2180 2185 2190
Ala Trp Thr Val Glu Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ser
2195 2200 2205
Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ser Pro
2210 2215 2220
Pro Val Thr Pro Asp Thr Glu Ser Ser Ala Pro Asp Thr Asp Pro Val
2225 2230 2235 2240
Pro Ala Val Lys Ser Asp Ala Val Val Trp Met Val Ser Gly Arg Thr
2245 2250 2255
Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln Glu Trp Met Leu
2260 2265 2270
Thr Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser Leu Ile Asn
2275 2280 2285
Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly Ala Asp Arg
2290 2295 2300
Glu Glu Leu Met Thr Arg Leu Gln Ala Leu Ile Asp Ser Glu Pro Gly
2305 2310 2315 2320
Met Leu Ala Gly Ala Gly Met Phe Ala Gly Pro Gly Val Val Ser Gly
2325 2330 2335
Val Ala Gly Gly Val Gly Lys Thr Val Leu Val Phe Pro Gly Gln Gly
2340 2345 2350
Ala Gln Trp Leu Gly Met Gly Ala Arg Leu Leu Gln Glu Ser Val Val
2355 2360 2365
Phe Glu Gln Lys Val Leu Glu Cys Ala Glu Val Phe Ala Pro Leu Val
2370 2375 2380
Glu Trp Ser Leu Ile Asp Val Leu Gln Gly Thr Ala Asp Pro Met Leu
2385 2390 2395 2400
Leu Glu Arg Val Asp Val Val Gln Pro Ala Leu Phe Ala Val Met Val
2405 2410 2415
Ser Leu Ala Glu Val Trp Arg Ser Phe Gly Val Val Pro Asp Ala Val
2420 2425 2430
Leu Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala
2435 2440 2445
Leu Ser Leu Glu Asp Ala Ala Arg Val Val Ile Leu Arg Ser Arg Ala
2450 2455 2460
Leu Arg Glu Leu Ser Gly Arg Gly Gly Met Ala Ser Val Leu Leu Pro
2465 2470 2475 2480
Thr Thr Leu Val Glu Gln Arg Leu Thr Asp Met Pro Gly Leu Ala Val
2485 2490 2495
Ala Ala Val Asn Gly Pro Thr Thr Thr Val Val Ser Gly Pro Thr Glu
2500 2505 2510
Gln Leu Asp Ala Phe Val Ala Ala Cys Glu Ser Asp Gly Val Gln Val
2515 2520 2525
Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Pro Gln Val Glu Ser
2530 2535 2540
Leu Arg Gln Arg Leu Leu Glu Glu Leu Ala Thr Ile Thr Pro Arg Pro
2545 2550 2555 2560
Ser Arg Ile Ala Phe Tyr Ser Thr Val Thr Gly Thr Leu Leu Asp Thr
2565 2570 2575
Thr Glu Leu Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asp Thr Val
2580 2585 2590
Asn Phe Ala Ala Thr Val Gln Thr Leu Leu Ser Glu Gly His Thr Val
2595 2600 2605
Phe Val Glu Ala Ser Pro His Pro Val Leu Thr Pro Gly Ile Glu Glu
2610 2615 2620
Leu Gly Glu Gln Thr Gly Pro Arg Thr Arg Asp Ile Val Val Thr Gly
2625 2630 2635 2640
Ser Leu Arg Arg Asp Asp Gly Gly Leu Asp Arg Leu Leu Ser Ala Leu
2645 2650 2655
Ala Met Val Asp Val Ala Gly Ala Gly Val Asp Trp Thr Pro Ile Phe
2660 2665 2670
Asp Gly Arg Gly Ala Thr Arg Val Ala Leu Pro Ser Tyr Ala Phe Gln
2675 2680 2685
His Arg Arg Tyr Trp Leu Asp Thr Leu Thr Ala Ser Gly His Pro Asp
2690 2695 2700
Ser Leu Gly Ser Ser Val Gly Ala Asp Asp Gly Ile Asp Gly Glu Phe
2705 2710 2715 2720
Trp Asp Ala Val Ala Arg Glu Asp Trp Glu Ala Leu Gly Leu Glu Glu
2725 2730 2735
Gly Cys Thr Ile Gly Glu Val Ser Pro Leu Leu Ser Ser Trp Arg Gln
2740 2745 2750
Gln Arg Arg Ala Gln Ser Val Ile Asp Gln Trp Arg Tyr Arg Ile Gly
2755 2760 2765
Trp Lys Trp Leu Ala Glu Lys Pro Val Arg Val Ser Gly Lys Trp Leu
2770 2775 2780
Val Val Ser Pro Thr Gly Ala Ala Ile Gly Asp Glu Val Cys Gly Val
2785 2790 2795 2800
Phe Thr Ala Ala Gly Leu Glu Thr Gln Arg Leu Glu Val Asp Ala Asp
2805 2810 2815
Arg Met Thr Arg Gln Thr Met Ala Asp Leu Leu Glu Ser Ala Gly Pro
2820 2825 2830
Trp Asp Glu Phe Arg Gly Val Val Ser Leu Ile Ala Leu Asn Asp Gly
2835 2840 2845
Ile Gly Gly Asp Ser Pro Leu Val Ser Arg Gly Val Ala Gly Asn Val
2850 2855 2860
Trp Leu Leu Lys Ala Leu Arg Glu Thr Ala Ala Glu Ile Pro Leu Trp
2865 2870 2875 2880
Cys Val Thr Ser Gly Ala Val Ile Val Gly Pro Ser Asp Arg Ser Val
2885 2890 2895
Asp Ala Thr Gln Ser Gln Met Trp Gly Leu Gly Gln Val Ala Gly Leu
2900 2905 2910
Glu Leu Pro Gln Ser Trp Gly Gly Leu Ile Asp Leu Pro Asn Ala Trp
2915 2920 2925
Asp Asp Thr Ile Leu Arg Ser Leu Pro Ala Val Leu Ser Arg Glu Asp
2930 2935 2940
Gly Glu Asp Gln Leu Ala Val Arg Glu Ser Gly Val Tyr Gly Arg Arg
2945 2950 2955 2960
Met Met Arg Ala Pro Leu Pro Asn Ser Gly Arg Gly Lys His Trp Arg
2965 2970 2975
Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly Gly Ile Gly Ala
2980 2985 2990
His Ala Ala Arg Trp Leu Leu Thr Asn Gly Ala Glu His Val Val Leu
2995 3000 3005
Val Ser Arg Arg Gly Arg Gln Ala Pro Gly Ala Leu Glu Leu Glu Gln
3010 3015 3020
Glu Leu Ser Ala Leu Gly Gly Arg Val Thr Ile Met Ala Ala Asp Ile
3025 3030 3035 3040
Ala Glu Arg Gly Asp Val Ala Ala Val Leu Ser Thr Ile Asp Asn Asp
3045 3050 3055
Ser Ile Pro Leu Thr Ala Val Ile His Ala Ala Gly Val Val Asp Gln
3060 3065 3070
Arg Pro Leu Thr Glu Ile Asp Ser Glu Ser Met Thr Thr Ala Ala Ala
3075 3080 3085
Ala Lys Val Gly Gly Ala Gln His Leu Asp Glu Leu Leu Gly Asp Arg
3090 3095 3100
Arg Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Thr Trp Gly
3105 3110 3115 3120
Gly Thr Gly Leu Ala Glu Tyr Ala Ala Ser Asn Ala His Leu Asp Gly
3125 3130 3135
Leu Ala Gln Asp Arg Arg Ser Arg Gly Leu Val Ala Thr Ser Leu Ala
3140 3145 3150
Trp Gly Gly Trp Ser Gly Gly Gly Met Thr Glu Ile Gly Thr Thr Thr
3155 3160 3165
Glu Tyr Phe Gly Arg Leu Gly Ile Arg Leu Met Glu Pro Asp Leu Ala
3170 3175 3180
Leu Gln Ala Leu Ser Gln Ala Val Ala Asn Asn Glu Thr Leu Val Thr
3185 3190 3195 3200
Val Ala Asp Ile Asp Trp Gln Gln Phe Thr Val Tyr Tyr Thr Leu Ser
3205 3210 3215
Arg Arg Arg Leu Leu Ile Thr Asp Ile Pro Asp Ala Gln Ala Asp Thr
3220 3225 3230
Asp Ser Ala Ile Asp Ser Gly Asn Thr Gly Ser Pro Leu Arg Gln Arg
3235 3240 3245
Leu Ser Gly Leu Gly Glu Thr Glu Gln Ile Gln Val Leu Leu Asp Leu
3250 3255 3260
Val Arg Ala Gln Ile Ala Ile Val Leu Gly His Asp Asp Ala Thr Ala
3265 3270 3275 3280
Ile Asp Ala Asp Arg Asn Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr
3285 3290 3295
Ala Val Glu Ala Arg Asn Arg Ile Lys Thr Ala Thr Gly Val Ala Val
3300 3305 3310
Ala Ala Thr Leu Ile Phe Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu
3315 3320 3325
His Leu His Gln Gln Leu Ala Gly Ala Ser Val Ala Ala Glu Pro Val
3330 3335 3340
Val Val Val Gly His Ser Ala Glu Pro Ile Ala Ile Val Gly Val Gly
3345 3350 3355 3360
Cys Arg Phe Pro Gly Gly Val Ser Ser Arg Glu Glu Leu Trp Gln Val
3365 3370 3375
Val Ala Gln Gly Arg Asp Val Val Ser Gln Trp Pro Val Asp Arg Gly
3380 3385 3390
Trp Asp Ala Gly Leu Phe Asp Pro Glu Pro Gly Val Thr Gly Lys Ser
3395 3400 3405
Tyr Thr Arg Asp Gly Ala Phe Leu His Asp Ala Gly Leu Phe Asp Ala
3410 3415 3420
Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gln
3425 3430 3435 3440
Gln Arg Leu Leu Leu Glu Thr Val Trp Glu Ala Leu Glu Asp Ala Gly
3445 3450 3455
Val Asp Pro Val Ser Leu Arg Gly Ser Asp Thr Gly Val Phe Ile Gly
3460 3465 3470
Val Ser Asp Gln Ser Tyr Gly Ile Gly Arg Ser Asp Gly Asp Ala Gly
3475 3480 3485
Val Glu Gly Tyr Arg Leu Thr Gly Gly Ala Thr Ser Val Val Ser Gly
3490 3495 3500
Arg Val Ser Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Ser Val Asp
3505 3510 3515 3520
Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gln Ala Val Gln Ala
3525 3530 3535
Val Arg Ala Gly Glu Cys Gly Met Ala Leu Val Gly Gly Val Ala Val
3540 3545 3550
Leu Ala Thr Pro Gly Ala Phe Ile Glu Phe Ser Arg Gln Lys Gly Leu
3555 3560 3565
Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Glu Ala Ala Asp Gly Thr
3570 3575 3580
Gly Trp Ser Glu Gly Val Gly Ile Leu Val Val Glu Arg Leu Ser Asp
3585 3590 3595 3600
Ala Arg Arg His Gly His Gln Val Leu Ala Val Val Arg Gly Ser Ala
3605 3610 3615
Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro
3620 3625 3630
Ser Gln Gln Arg Val Ile Arg Arg Ala Leu Ala Asn Ala Gly Val Ser
3635 3640 3645
Ala Thr Glu Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu
3650 3655 3660
Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asn
3665 3670 3675 3680
Arg Glu Pro Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile
3685 3690 3695
Gly His Thr Gln Asn Ala Ala Gly Val Ala Gly Val Ile Lys Met Ile
3700 3705 3710
Glu Ala Ile Arg His Gln Thr Leu Pro Lys Thr Leu His Ile Asp Thr
3715 3720 3725
Pro Thr Thr His Val Asp Trp Thr Ser Gly Ala Val Glu Leu Leu Thr
3730 3735 3740
Glu Ser Arg Thr Trp Thr Thr Glu Ala Asp Arg Pro Arg Arg Ala Ala
3745 3750 3755 3760
Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val Ile Leu Glu
3765 3770 3775
Gln Ser Pro Pro Val Thr Pro Asp Thr Glu Ser Ser Ala Pro Asp Thr
3780 3785 3790
Asp Pro Val Pro Ala Val Lys Ser Asp Ala Val Val Trp Met Val Ser
3795 3800 3805
Gly Arg Thr Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu His Glu
3810 3815 3820
Trp Met Leu Ala Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser
3825 3830 3835 3840
Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly
3845 3850 3855
Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Gly Leu Ile Asp Gly
3860 3865 3870
Asp Pro Ala Val Ala Ala Gly Val Ser Arg Asp Arg Gly Lys Thr Val
3875 3880 3885
Phe Val Phe Pro Gly Gln Gly Ala Gln Leu Leu Gly Met Gly Ser Gly
3890 3895 3900
Leu Tyr Glu Ala Phe Pro Val Phe Ala Ala Ser Phe Asp Glu Thr Thr
3905 3910 3915 3920
Ala Leu Leu Glu Gln Gln Leu Glu Cys Ser Leu Arg Asp Val Val Trp
3925 3930 3935
Gly Val Asp Glu Gln Ala Leu Gln Ala Thr Leu Tyr Thr Gln Thr Gly
3940 3945 3950
Leu Phe Ala Val Gly Ile Ala Leu Phe Arg Leu Leu Glu Ser Phe Gly
3955 3960 3965
Val Arg Pro Asp Phe Val Ala Gly His Ser Ile Gly Glu Leu Ala Ala
3970 3975 3980
Ala Thr Val Ala Gly Val Leu Ser Leu Glu Asp Ala Thr Val Leu Val
3985 3990 3995 4000
Ala Ala Arg Ala Arg Leu Met Gln Gln Leu Pro Thr Gly Gly Ala Met
4005 4010 4015
Leu Ala Met Arg Ala Ser Glu Thr Gln Ile Thr Thr Leu Leu Gly Asp
4020 4025 4030
Ser Ile Glu Asp Gly Val Val Glu Ile Ala Ala Val Asn Gly Pro Glu
4035 4040 4045
Ser Ile Val Leu Ala Gly Pro Gln His Ala Ile Asp Thr Thr Glu Gln
4050 4055 4060
Gln Leu Arg Gln Ala Gly Tyr Gln Val Asn Arg Leu Arg Val Ser His
4065 4070 4075 4080
Ala Phe His Ser Ala Ser Met Glu Pro Met Leu Ala Glu Phe Ala Arg
4085 4090 4095
Ile Ala Thr Glu Leu Thr Tyr Thr Gln Pro Val Ile Pro Ile Ile Ser
4100 4105 4110
Asn Leu Asp Gly Gln Leu Thr Gly Pro Asn Thr Asp Ser Pro Asn Thr
4115 4120 4125
Asp Ala Gln Gln Ala Asp Ser Pro Leu Thr Thr Pro Gln Tyr Trp Val
4130 4135 4140
Asp His Val Arg Asn Thr Val Arg Phe Ala Asp Gly Ile Thr Thr Leu
4145 4150 4155 4160
Thr Thr Ala Gly Ala Thr Arg Tyr Val Ile Met Gly Pro Asp Gly Gly
4165 4170 4175
Leu Ser Gly Leu Ile Asp Glu Thr Leu Gln His Thr Thr Ser Asp Ala
4180 4185 4190
Val Asp Thr Lys Pro Thr Val Asp Gly Val Glu Ala Val Val Ala Ser
4195 4200 4205
Leu Leu Arg Lys Asp Arg Val Glu Asp Thr Thr Leu Leu Ser Ala Leu
4210 4215 4220
Ala Arg Leu Asp Val Ala Gly Thr Gly Ile Asp Trp Thr Pro Ile Phe
4225 4230 4235 4240
His Gly Arg Gly Ala Thr Arg Val Pro Leu Pro Ser Tyr Ala Phe Gln
4245 4250 4255
His Arg Arg Tyr Trp Leu Asp Thr Ile Thr Gly Asn Thr Asp Pro Asp
4260 4265 4270
Ser Leu Gly Leu Ser Gly Leu Asp His Pro Leu Ile Gly Ala Val Val
4275 4280 4285
Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly Arg Leu Ser Leu
4290 4295 4300
Gln Thr His Pro Trp Leu Ala Asp His Ala Val Gly Gly Val Val Leu
4305 4310 4315 4320
Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg Ala Gly Asp Glu
4325 4330 4335
Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu Ala Pro Leu Thr
4340 4345 4350
Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu Val Gly Ala Leu
4355 4360 4365
Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser Gln Thr Arg Asp
4370 4375 4380
Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His Thr Gln Ser Pro
4385 4390 4395 4400
Val Glu Asn Leu Thr Thr Thr Thr Pro Val Asp Thr Gly Leu Ala Val
4405 4410 4415
Trp Pro Pro Gln Asn Ala Thr Arg Thr Asp Thr Ser Ser Leu Tyr Gln
4420 4425 4430
Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Gln Gly Leu
4435 4440 4445
Glu Ser Val Trp Arg Thr Gly Glu Asp Trp Leu Val Gln Ala Arg Leu
4450 4455 4460
Pro Glu Thr Gly Gly Asp Ala His His Tyr Gly Leu His Pro Ala Leu
4465 4470 4475 4480
Leu Asp Ala Val Leu His Ala Met Thr Thr Gly His Asp Thr Asp Thr
4485 4490 4495
Asp Thr Asp Thr Ser Val Gly Pro Leu Leu Pro Phe Ala Trp Glu Gly
4500 4505 4510
Val Gln Leu His Ala Val Gly Ala Ser Thr Val His Ala Arg Ile Thr
4515 4520 4525
Pro His Gly His Asn Thr Val Ser Val Thr Val Thr Asp Pro Asp Gly
4530 4535 4540
Gln Pro Val Leu Thr Ile Ala Ser Leu Thr Leu Arg Pro Val Gln Leu
4545 4550 4555 4560
Asp Gln Leu Thr Thr Ala Ala Asp Gly Gly Asp Arg Leu His Thr Leu
4565 4570 4575
His Trp Thr Pro Thr Pro Met Pro Ala Gln Leu Arg Glu Ala Ala Phe
4580 4585 4590
Ala Glu Trp Asp Asp Leu Gln Ala Glu Ser Leu Glu Ala Glu Ser Thr
4595 4600 4605
Asp Gln Pro Val Pro Ala Val Val Val Leu Asp Cys Arg Ser Gly Gly
4610 4615 4620
Asn Asp Thr Asp Asn Asp Gly Gly Asn Gly Ile Asp Val Leu Ala Arg
4625 4630 4635 4640
Ala His Ala Ile Ser His Arg Val Leu Thr Val Leu Gln Asp Phe Ser
4645 4650 4655
Val Gln Gln Arg Phe Ala Ser Ser Thr Leu Leu Val Leu Thr Arg Ser
4660 4665 4670
Ala Val Ala Val Asn Gly Asp Gly Val Asp Pro Ala Ala Ser Ala Val
4675 4680 4685
Trp Gly Leu Val Arg Ser Ala Gln Ser Glu Glu Pro Gly Arg Ile Leu
4690 4695 4700
Leu Ala Asp Thr Asp Ile Glu Gly Ser Asp Gly Val Asp Val Ala Gly
4705 4710 4715 4720
Ile Val Ser Leu Ala Val Ala Val Gly Glu Pro Gln Val Leu Ile Arg
4725 4730 4735
Asp Gly Ile Ala His Ile Ala Arg Leu Val Arg Thr Ala Gly Arg Asp
4740 4745 4750
Asp Lys Thr Thr Ala Ser Asp Ile Ser Asp Thr Asp Asp Thr Val Ser
4755 4760 4765
Gly Ala Gly Arg Gly Thr Val Val Val Thr Gly Gly Thr Gly Gly Leu
4770 4775 4780
Gly Arg Ile Leu Ala Arg His Leu Val Gly Val Arg Gly Val Arg Ser
4785 4790 4795 4800
Leu Val Leu Ala Ser Arg Arg Gly Leu Ala Ala Glu Gly Ala Arg Glu
4805 4810 4815
Leu Val Glu Glu Leu Thr Gly Ser Gly Ala Arg Val Ala Val Val Ala
4820 4825 4830
Cys Asp Val Ser Thr Arg Ala Gly Val Glu Gln Leu Leu Ala Ala Val
4835 4840 4845
Pro Asp Glu Asp Pro Leu Val Gly Val Val His Ala Ala Gly Val Leu
4850 4855 4860
Asp Asp Gly Val Ile Ala Ser Leu Thr Pro Gln Arg Leu Asp Thr Val
4865 4870 4875 4880
Leu Ser Val Lys Ala Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg
4885 4890 4895
Gly Leu Asp Leu Gly Met Phe Val Met Tyr Ser Ser Thr Ala Gly Val
4900 4905 4910
Leu Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn Gln Phe Leu
4915 4920 4925
Asp Gly Leu Ala Glu Tyr Arg Arg Ala Arg Gly Leu Ala Ala Thr Ser
4930 4935 4940
Ile Ala Trp Gly Leu Trp Gly Ser Gly Thr Gly Met Thr Gly His Leu
4945 4950 4955 4960
Asp Gly Gly Asp Thr Ala Arg Met Ser Arg Gly Gly Met Leu Ala Leu
4965 4970 4975
Thr Glu Ala Gln Gly Met Ala Met Phe Asp Thr Ala Val Thr Ala Glu
4980 4985 4990
His Ala Thr Val Leu Ala Ala Arg Phe Asp Thr Thr Val Leu Ala Ala
4995 5000 5005
Gln Ala
5010
<210> 6
<211> 4784
<212> PRT
<213> Nocardia vinacea
<400> 6
Met Val Thr Gly Ser His Val Gln Met Met Asp Ser Leu Gly Val Ile
1 5 10 15
Ser Asn Gln Gly Glu Cys Arg Ala Phe Asp Ala Asn Ala Asp Gly Phe
20 25 30
Val Pro Gly Glu Gly Gly Gly Met Val Leu Leu Lys Pro Leu Ala Gln
35 40 45
Ala Leu His Asp Gly Asp Arg Ile Tyr Ser Val Ile Leu Gly Gly Ala
50 55 60
Ile Asn Gln Asp Gly Ala Ser Thr Asp Phe Met Ala Pro Ser Val Ala
65 70 75 80
Ala Gln Glu Glu Leu Leu Leu Ser Ala Leu Arg Arg Ala Asn Val Gly
85 90 95
Gly Asp Gln Val His Tyr Val Glu Leu His Gly Thr Gly Thr Val Ala
100 105 110
Gly Asp Leu Ala Glu Ala Ala Ala Leu Val Ser Val Phe Gly Lys Gly
115 120 125
Arg Gly Ser Asp Glu Ala Trp Leu Gln Val Gly Ser Val Lys Thr Asn
130 135 140
Ile Gly His Leu Asp Ala Ala Ala Gly Ile Ala Gly Phe Ile Lys Val
145 150 155 160
Ala Leu Gly Leu Trp His Glu Thr Ile Pro Ser Ser Leu Asn Tyr Glu
165 170 175
Thr Pro Asn Arg Ser Ile Ser Ile Glu Asp Ser Gly Val Ala Val Leu
180 185 190
Arg Gln Cys Leu Asp Leu Ser Ala Asp His Ser Arg Ala Val Ala Gly
195 200 205
Val Ser Ser Phe Gly Met Gly Gly Thr Asn Cys His Ile Val Leu Ala
210 215 220
Gly Asn Ala Ala Gly Ala Thr Asp Ser Arg Thr Asn Gln Ala Phe Gly
225 230 235 240
Leu Val Gln Glu Ser Val Thr Thr His Gly Leu Val Trp Val Leu Ser
245 250 255
Gly Arg Ser Ser Asp Gly Leu Leu Ala Gln Gly Arg Arg Leu Gln Glu
260 265 270
Trp Met Leu Thr Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser
275 280 285
Leu Ile Asn Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly
290 295 300
Ala Asp Arg Glu Glu Leu Met Thr Arg Leu Gln Gly Leu Ile Asp Gly
305 310 315 320
Asp Pro Ala Val Ala Ala Gly Val Ser Arg Asp Arg Gly Lys Thr Val
325 330 335
Phe Val Phe Pro Gly Gln Gly Ala Gln Leu Leu Gly Met Gly Ser Gly
340 345 350
Leu Tyr Glu Ala Phe Pro Val Phe Ala Ala Ser Phe Asp Glu Thr Thr
355 360 365
Ala Leu Leu Glu Gln Gln Leu Glu Cys Ser Leu Arg Asp Val Val Trp
370 375 380
Gly Val Asp Glu Gln Ala Leu Gln Ala Thr Leu Tyr Thr Gln Thr Gly
385 390 395 400
Leu Phe Ala Val Gly Ile Ala Leu Phe Arg Leu Leu Glu Ser Phe Gly
405 410 415
Val Arg Pro Asp Phe Val Ala Gly His Ser Ile Gly Glu Leu Ala Ala
420 425 430
Ala Thr Val Ala Gly Val Leu Ser Leu Glu Asp Ala Thr Val Leu Val
435 440 445
Ala Ala Arg Ala Arg Leu Met Gln Gln Leu Pro Thr Gly Gly Ala Met
450 455 460
Leu Ala Met Arg Ala Ser Glu Thr Gln Ile Thr Thr Leu Leu Gly Asp
465 470 475 480
Ser Ile Glu Asp Gly Val Val Glu Ile Ala Ala Val Asn Gly Pro Glu
485 490 495
Ser Ile Val Leu Ala Gly Pro Gln His Ala Ile Asp Thr Thr Glu Gln
500 505 510
Gln Leu Arg Gln Ala Gly Tyr Gln Val Asn Arg Leu Arg Val Ser His
515 520 525
Ala Phe His Ser Ala Ser Met Glu Pro Met Leu Ala Glu Phe Ala Arg
530 535 540
Ile Ala Thr Glu Leu Thr Tyr Thr Gln Pro Val Ile Pro Ile Ile Ser
545 550 555 560
Asn Leu Asp Gly Gln Leu Thr Gly Thr Pro Asp Asp Gln Pro Ser Ala
565 570 575
Leu Ala Thr Pro Gln Tyr Trp Val Asp His Val Arg Asn Thr Val Arg
580 585 590
Phe Ala Asp Gly Ile Thr Thr Leu Thr Thr Ala Gly Ala Thr Arg Tyr
595 600 605
Val Ile Met Gly Pro Asp Gly Gly Leu Ser Gly Leu Ile Asp Glu Thr
610 615 620
Leu Gln His Thr Thr Ser Asp Ala Val Asp Thr Lys Pro Thr Val Asp
625 630 635 640
Gly Val Glu Ala Val Val Ala Ser Leu Leu Arg Lys Asp Arg Val Glu
645 650 655
Asp Thr Thr Leu Leu Ser Ala Leu Ala Arg Leu Asp Val Ala Gly Thr
660 665 670
Gly Ile Asp Trp Thr Pro Ile Phe His Gly Arg Gly Ala Ser Arg Val
675 680 685
Gln Leu Pro Thr Tyr Ala Phe Asp Arg Gln Gln Gly Gly Arg His Ala
690 695 700
Ser Ser Ser Ser Arg Ile Thr Ser Glu Glu Val Gln Asp Leu Phe Ala
705 710 715 720
Arg Lys Leu Ala Gln Leu Ser Leu Gly Asp Gln Trp Leu Met Ile Lys
725 730 735
Asn Ala Val Arg Asp Gln Leu Ala Ala Val Ser Gly Lys Phe Ser Pro
740 745 750
Asp Glu Phe Asp Glu Asp Asn Ser Phe Arg Asp Leu Gly Leu Asp Ser
755 760 765
Leu Gly Ala Val Glu Phe Arg Arg Arg Leu Asn Arg Leu Thr Gly Val
770 775 780
Ala Met Ser Ala Thr Leu Ile Phe Asp Tyr Pro Thr Pro Arg Ala Val
785 790 795 800
Ala Glu His Leu His Gln Gln Leu Ala Gly Ala Ser Val Ala Ala Glu
805 810 815
Pro Val Val Val Met Gly His Ser Ala Glu Pro Ile Ala Ile Val Gly
820 825 830
Val Gly Cys Arg Phe Pro Gly Gly Val Ser Ser Arg Glu Glu Leu Trp
835 840 845
Gln Val Val Ala Gln Gly Arg Asp Val Val Ser Gln Trp Pro Leu Asp
850 855 860
Arg Gly Trp Asp Ala Gly Leu Phe Asp Pro Glu Pro Gly Val Ala Gly
865 870 875 880
Arg Ser Tyr Thr Arg Glu Gly Gly Phe Leu His Asp Ala Gly Leu Phe
885 890 895
Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Val Ala Met Asp
900 905 910
Pro Gln Gln Arg Leu Leu Leu Glu Thr Val Trp Glu Ala Leu Glu Asp
915 920 925
Ala Gly Val Asp Pro Val Ser Leu Arg Gly Ser Asp Thr Gly Val Phe
930 935 940
Ile Gly Val Thr Asp His Ala Tyr Gly Ile Gly Arg Gly Glu Val Asp
945 950 955 960
Asp Ser Phe Glu Gly Tyr Arg Leu Thr Gly Thr Thr Ser Ser Val Val
965 970 975
Ser Gly Arg Val Ser Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Ser
980 985 990
Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gln Ala Val
995 1000 1005
Gln Ala Val Arg Ala Gly Glu Cys Gly Met Ala Leu Val Gly Gly Val
1010 1015 1020
Thr Val Met Ser Thr Pro Ser Met Phe Val Glu Phe Ser Arg Gln Gly
1025 1030 1035 1040
Gly Leu Ala Ser Asp Gly Arg Cys Lys Ser Phe Ala Glu Ala Ala Asp
1045 1050 1055
Gly Thr Gly Trp Ser Glu Gly Val Gly Ile Leu Val Val Glu Arg Leu
1060 1065 1070
Ser Glu Ala Arg Lys His Gly His Gln Val Leu Ala Val Val Arg Gly
1075 1080 1085
Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn
1090 1095 1100
Gly Pro Ser Gln Gln Arg Val Ile Arg Arg Ala Leu Ala Asn Ala Gly
1105 1110 1115 1120
Leu Ser Pro Asp Leu Ile Asp Val Val Glu Ala His Gly Thr Gly Thr
1125 1130 1135
Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly
1140 1145 1150
Gln Asn Arg Glu Pro Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser
1155 1160 1165
Asn Ile Gly His Ala Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys
1170 1175 1180
Met Ile Glu Ala Met Arg His Glu Thr Leu Pro Lys Thr Leu His Val
1185 1190 1195 1200
Asp Thr Pro Thr Thr His Val Asp Trp Thr Ala Gly Ala Val Glu Leu
1205 1210 1215
Leu Thr Glu Ser Arg Ala Trp Thr Val Glu Ala Asp Arg Pro Arg Arg
1220 1225 1230
Ala Ala Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile
1235 1240 1245
Leu Glu Gln Ala Pro Pro Val Thr Glu Thr Pro Asp Thr Asp Pro Val
1250 1255 1260
Pro Ala Val Lys Ser Asp Ala Val Val Trp Met Val Ser Gly Arg Thr
1265 1270 1275 1280
Gly Glu Gly Leu Leu Ala Gln Gly Arg Arg Leu His Glu Trp Met Leu
1285 1290 1295
Ala Arg Pro Gly Leu Asp Ala Val Asp Val Gly Trp Ser Leu Ile Asn
1300 1305 1310
Thr Arg Ala Arg Leu Glu His Arg Ala Val Leu Val Gly Ala Asp Arg
1315 1320 1325
Glu Glu Leu Met Thr Arg Leu Gln Gly Leu Ile Asp Gly Asp Pro Ala
1330 1335 1340
Val Ala Ala Gly Val Ser Arg Asp Arg Gly Lys Thr Val Phe Val Phe
1345 1350 1355 1360
Pro Gly Gln Gly Ala Gln Leu Leu Gly Met Gly Ser Gly Leu Tyr Glu
1365 1370 1375
Ala Phe Pro Val Phe Ala Ala Ser Phe Asp Glu Thr Thr Ala Leu Leu
1380 1385 1390
Glu Gln Gln Leu Glu Cys Ser Leu Arg Asp Val Val Trp Gly Val Asp
1395 1400 1405
Glu Gln Ala Leu Gln Ala Thr Leu Tyr Thr Gln Thr Gly Leu Phe Ala
1410 1415 1420
Val Gly Ile Ala Leu Phe Arg Leu Leu Glu Ser Phe Gly Val Arg Pro
1425 1430 1435 1440
Asp Phe Val Ala Gly His Ser Ile Gly Glu Leu Ala Ala Ala Thr Val
1445 1450 1455
Ala Gly Val Leu Ser Leu Glu Asp Ala Thr Val Leu Val Ala Ala Arg
1460 1465 1470
Ala Arg Leu Met Gln Gln Leu Pro Thr Gly Gly Ala Met Leu Ala Met
1475 1480 1485
Arg Ala Ser Glu Thr Gln Ile Thr Thr Leu Leu Gly Asp Ser Ile Glu
1490 1495 1500
Asp Gly Val Val Glu Ile Ala Ala Val Asn Gly Pro Glu Ser Ile Val
1505 1510 1515 1520
Leu Ala Gly Pro Gln His Ala Ile Asp Thr Thr Glu Gln Gln Leu Arg
1525 1530 1535
Gln Ala Gly Tyr Gln Val Asn Arg Leu Arg Val Ser His Ala Phe His
1540 1545 1550
Ser Ala Ser Met Glu Pro Met Leu Ala Glu Phe Ala Arg Ile Ala Thr
1555 1560 1565
Glu Leu Thr Tyr Thr Gln Pro Val Ile Pro Ile Ile Ser Asn Leu Asp
1570 1575 1580
Gly Gln Leu Thr Gly Pro Asn Thr Asp Ser Pro Asn Thr Asp Ala Gln
1585 1590 1595 1600
Gln Ala Asp Ser Pro Leu Thr Thr Pro Gln Tyr Trp Val Asp His Val
1605 1610 1615
Arg Asn Thr Val Arg Phe Ala Asp Gly Ile Thr Thr Leu Thr Thr Ala
1620 1625 1630
Gly Ala Thr Arg Tyr Val Ile Met Gly Pro Asp Gly Gly Leu Ser Gly
1635 1640 1645
Leu Ile Asp Glu Thr Leu Gln Ser Ser Asp Thr Asp Thr Thr Asp Thr
1650 1655 1660
Val Val Thr Ser Leu Leu Arg Arg Asp Arg Val Glu Asp Thr Thr Phe
1665 1670 1675 1680
Leu Ser Ala Leu Ala Val Val Asp Val Ala Gly Ala Gly Ile Asp Trp
1685 1690 1695
Thr Pro Val Phe Asp Gly Arg Gly Ala Ser Arg Val Val Leu Pro Ser
1700 1705 1710
Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu Asp Thr Ile Thr Gly Asn
1715 1720 1725
Thr Asp Pro Asp Ser Leu Gly Leu Ser Gly Leu Asp His Pro Leu Ile
1730 1735 1740
Gly Ala Val Val Val Ser Pro Glu Thr Gly Ala Val Thr Val Thr Gly
1745 1750 1755 1760
Arg Leu Ser Leu Gln Thr His Pro Trp Leu Ala Asp Tyr Ala Val Gly
1765 1770 1775
Gly Val Val Leu Leu Pro Gly Thr Gly Leu Val Glu Leu Val Ile Arg
1780 1785 1790
Ala Gly Asp Glu Val Gly Cys Gly Ala Ile Arg Glu Leu Thr Leu Leu
1795 1800 1805
Ala Pro Leu Thr Leu Pro Ala Glu Gly Gly Thr Ala Ile Gln Val Leu
1810 1815 1820
Val Gly Ala Leu Glu Thr Ser Gly Thr Arg Thr Val Ser Val Tyr Ser
1825 1830 1835 1840
Gln Thr Arg Asp Gln Glu Trp Val Leu Asn Ala Gln Gly Leu Leu His
1845 1850 1855
Thr Gln Ser Pro Val Glu Asn Leu Thr Thr Thr Thr Pro Val Asp Thr
1860 1865 1870
Gly Leu Ala Val Trp Pro Pro Gln Asn Ala Thr Arg Thr Asp Thr Ser
1875 1880 1885
Ser Leu Tyr Gln Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly Pro Ala
1890 1895 1900
Phe Gln Gly Leu Glu Ser Val Trp Arg Thr Gly Glu Asp Trp Leu Val
1905 1910 1915 1920
Gln Ala Arg Leu Pro Glu Thr Gly Gly Asp Ala His His Tyr Gly Leu
1925 1930 1935
His Pro Ala Leu Leu Asp Ala Val Leu His Ala Met Thr Thr Gly His
1940 1945 1950
Asp Thr Ser Ala Gly Pro Leu Leu Pro Phe Ala Trp Glu Ala Val Gln
1955 1960 1965
Leu His Ala Val Gly Ala Ser Thr Val Arg Ala Arg Ile Thr Pro His
1970 1975 1980
Gly His Asn Thr Val Arg Ile Thr Val Phe Asp Leu Asp Gly Arg Pro
1985 1990 1995 2000
Val Leu Thr Ile Gly Ser Leu Thr Leu Arg Ser Val Gln Phe Ala Gln
2005 2010 2015
Leu Val Thr Ala Thr Ala Thr Glu Asp Arg Leu His Thr Leu His Trp
2020 2025 2030
Thr Pro Thr Thr Val Gln Leu Arg Glu Val Ser Phe Ala Glu Trp Thr
2035 2040 2045
Asp Leu Gln Leu Glu Ser Leu Asp Pro Glu Pro Ile Gly Trp Pro Pro
2050 2055 2060
Thr Pro Pro Val Val Val Leu Asp Cys Arg Glu Ser Glu His Asp Thr
2065 2070 2075 2080
Ser Val Gly Asp Gly Ala Asp Met Leu Ala Lys Thr Arg Ala Thr Gly
2085 2090 2095
Gln Arg Val Leu Gly Val Leu Gln Glu Phe Ser Thr Gln Gln Arg Phe
2100 2105 2110
Ala Ser Ser Thr Leu Leu Ile Leu Thr Arg Ala Ala Val Ser Val Thr
2115 2120 2125
Gly Asp Arg Ile Asp Glu Ala Ala Glu Arg Ile Asp Leu Ala Val Asp
2130 2135 2140
Pro Ala Ala Ser Ala Val Trp Gly Leu Val Arg Ser Ala Gln Ile Glu
2145 2150 2155 2160
Asp Pro Gly Arg Ile Leu Leu Leu Asp Thr Asp Ile His Gly Ile Asp
2165 2170 2175
Gly Thr Asp Leu Ala Glu Ile Val Ser Leu Ala Val Ala Val Gly Glu
2180 2185 2190
Pro Gln Val Leu Ile Arg Asp Gly Ile Ala His Thr Ala Arg Leu Val
2195 2200 2205
Arg Val Pro Glu Arg Ser Asp Thr Gly Thr Ala Ser Asp Ile Ser Asp
2210 2215 2220
Ala Val Ala Thr Val Ser Gly Ala Gly Gly Thr Val Val Val Thr Gly
2225 2230 2235 2240
Gly Thr Gly Gly Leu Gly Arg Ile Leu Ala Arg His Leu Val Gly Val
2245 2250 2255
Arg Gly Val Arg Ser Leu Val Leu Ala Ser Arg Arg Gly Leu Ala Ala
2260 2265 2270
Glu Gly Ala Arg Glu Leu Val Glu Glu Leu Thr Gly Ser Gly Ala Arg
2275 2280 2285
Val Ala Val Val Ala Cys Asp Val Ser Thr Arg Ala Gly Val Glu Gln
2290 2295 2300
Leu Leu Ala Ala Val Pro Asp Glu Asp Pro Leu Val Gly Val Val His
2305 2310 2315 2320
Ala Ala Gly Val Leu Asp Asp Gly Val Ile Ala Ser Leu Thr Pro Gln
2325 2330 2335
Arg Leu Asp Thr Val Leu Ser Ala Lys Ala Asp Ala Ala Trp Tyr Leu
2340 2345 2350
His Glu Leu Thr Arg Glu Leu Asp Val Ala Met Phe Val Met Tyr Ser
2355 2360 2365
Ser Val Thr Gly Val Leu Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala
2370 2375 2380
Ala Asn Gln Phe Leu Asp Gly Leu Ala Glu Tyr Arg Arg Ala Arg Gly
2385 2390 2395 2400
Leu Ala Ala Thr Ser Ile Ala Trp Gly Leu Trp Gly Ser Ser Thr Gly
2405 2410 2415
Met Thr Gly His Leu Asp Gly Gly Asp Thr Ala Arg Met Asn Arg Gly
2420 2425 2430
Gly Met Leu Ala Leu Thr Asp Asp Gln Gly Met Ala Met Phe Asn Ala
2435 2440 2445
Ala Val Ala Gln Asp Gln Ser Ser Val Leu Ala Val Arg Phe Asp Ile
2450 2455 2460
Thr Ala Leu Ala Ala Gln Ala Arg Ala Gly Val Leu Ala Pro Ile Leu
2465 2470 2475 2480
Asn Asn Leu Val Pro Gly Ala Arg Arg Ala Val Gly Asn Thr Ser Gly
2485 2490 2495
Gly Val Pro Gly Ser Gln Leu Gln Gln Arg Leu Ser Gly Leu Lys Asp
2500 2505 2510
Thr Glu Gln Ile Glu Leu Leu Leu Asp Leu Val Arg Ala Asp Val Ala
2515 2520 2525
Ile Val Leu Gly His Asp Asp Ile Thr Ala Ile Asp Ala Asp Arg Asn
2530 2535 2540
Phe Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Ala Arg Asn
2545 2550 2555 2560
Arg Ile Lys Thr Thr Thr Gly Val Ala Val Gln Ala Thr Leu Thr Phe
2565 2570 2575
Asp Tyr Pro Thr Pro Arg Ala Val Ala Glu His Leu Tyr Gln Gln Leu
2580 2585 2590
Ala Gly Ala Pro Val Val Ala Glu Pro Asp Val Val Gly Asp Ser Ala
2595 2600 2605
Glu Pro Ile Ala Ile Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val
2610 2615 2620
Ser Ser Arg Glu Glu Leu Trp Gln Val Val Ala Gln Gly Arg Asp Val
2625 2630 2635 2640
Val Ser Gln Trp Pro Leu Asp Arg Gly Trp Asp Ala Gly Leu Phe Asp
2645 2650 2655
Pro Glu Pro Gly Val Ala Gly Lys Ser Tyr Thr Arg Glu Gly Gly Phe
2660 2665 2670
Leu His Asp Ala Gly Leu Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro
2675 2680 2685
Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr
2690 2695 2700
Val Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Val Ser Leu Arg
2705 2710 2715 2720
Gly Ser Asp Thr Gly Val Phe Ile Gly Val Ser Asp Gln Ser Tyr Gly
2725 2730 2735
Ile Gly Arg Ser Asp Gly Asp Ala Gly Val Glu Gly Tyr Arg Leu Thr
2740 2745 2750
Gly Gly Ala Thr Ser Val Val Ser Gly Arg Val Ser Tyr Val Leu Gly
2755 2760 2765
Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser Ser Leu
2770 2775 2780
Val Ala Leu His Gln Ala Val Gln Ala Val Arg Ala Gly Glu Cys Gly
2785 2790 2795 2800
Met Ala Leu Val Gly Gly Val Met Val Met Ala Thr Pro Asp Thr Phe
2805 2810 2815
Ile Glu Phe Ser Arg Gln Lys Gly Leu Ala Ala Asp Gly Arg Cys Lys
2820 2825 2830
Ser Phe Ala Glu Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Val Gly
2835 2840 2845
Val Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg Arg Gly His Gln
2850 2855 2860
Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
2865 2870 2875 2880
Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
2885 2890 2895
Arg Ala Leu Ala Asn Ala Gly Leu Ser Pro Asp Leu Ile Asp Val Val
2900 2905 2910
Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln
2915 2920 2925
Ala Leu Leu Ala Thr Tyr Gly Gln Asn Arg Glu Pro Asp Arg Pro Leu
2930 2935 2940
Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ala Gln Ala Ala Ala
2945 2950 2955 2960
Gly Val Ala Gly Val Ile Lys Met Ile Glu Ala Met Arg His Glu Thr
2965 2970 2975
Leu Pro Lys Thr Leu His Ile Asp Thr Pro Thr Thr His Val Asp Trp
2980 2985 2990
Thr Ala Gly Ala Val Glu Leu Leu Thr Glu Ser Arg Ala Trp Thr Val
2995 3000 3005
Glu Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Ile Ser
3010 3015 3020
Gly Thr Asn Ala His Val Ile Leu Glu Gln Ser Pro Pro Val Thr Pro
3025 3030 3035 3040
Asp Thr Glu Ser Ser Ala Pro Asp Thr Asp Pro Val Pro Ala Val Lys
3045 3050 3055
Ser Asp Ala Val Val Trp Met Val Ser Gly Arg Thr Gly Glu Gly Leu
3060 3065 3070
Leu Ala Gln Gly Arg Arg Leu His Glu Trp Met Leu Ala Arg Pro Gly
3075 3080 3085
Leu Asp Ala Val Asp Val Gly Trp Ser Leu Ile Asn Thr Arg Ala Arg
3090 3095 3100
Leu Glu His Arg Ala Val Leu Val Gly Ala Asp Arg Glu Glu Leu Met
3105 3110 3115 3120
Thr Arg Leu Gln Gly Leu Ile Asp Gly Asp Pro Ala Val Ala Ala Gly
3125 3130 3135
Val Ser Arg Asp Arg Gly Lys Thr Val Phe Val Phe Pro Gly Gln Gly
3140 3145 3150
Ala Gln Leu Leu Gly Met Gly Ser Gly Leu Tyr Glu Ala Phe Pro Val
3155 3160 3165
Phe Ala Ala Ser Phe Asp Glu Thr Thr Ala Leu Leu Glu Gln Gln Leu
3170 3175 3180
Glu Cys Ser Leu Arg Asp Val Val Trp Gly Val Asp Glu Gln Ala Leu
3185 3190 3195 3200
Gln Ala Thr Leu Tyr Thr Gln Thr Gly Leu Phe Ala Val Gly Ile Ala
3205 3210 3215
Leu Phe Arg Leu Leu Glu Ser Phe Gly Val Arg Pro Asp Phe Val Ala
3220 3225 3230
Gly His Ser Ile Gly Glu Leu Ala Ala Ala Thr Val Ala Gly Val Leu
3235 3240 3245
Ser Leu Glu Asp Ala Thr Val Leu Val Ala Ala Arg Ala Arg Leu Met
3250 3255 3260
Gln Gln Leu Pro Thr Gly Gly Ala Met Leu Ala Met Arg Ala Ser Glu
3265 3270 3275 3280
Thr Gln Ile Thr Thr Leu Leu Gly Asp Ser Ile Glu Asp Gly Val Val
3285 3290 3295
Glu Ile Ala Ala Val Asn Gly Pro Glu Ser Ile Val Leu Ala Gly Pro
3300 3305 3310
Gln His Ala Ile Asp Thr Thr Glu Gln Gln Leu Arg Gln Ala Gly Tyr
3315 3320 3325
Gln Val Asn Arg Leu Arg Val Ser His Ala Phe His Ser Ala Ser Met
3330 3335 3340
Glu Pro Met Leu Ala Glu Phe Ala Arg Ile Ala Thr Glu Leu Thr Tyr
3345 3350 3355 3360
Thr Gln Pro Val Ile Pro Ile Ile Ser Asn Leu Asp Gly Gln Leu Thr
3365 3370 3375
Gly Pro Asn Thr Asp Ser Pro Asn Thr Asp Ala Gln Gln Ala Asp Ser
3380 3385 3390
Pro Leu Thr Thr Pro Gln Tyr Trp Val Asp His Val Arg Asn Thr Val
3395 3400 3405
Arg Phe Ala Asp Gly Ile Thr Thr Leu Thr Thr Ala Gly Ala Thr Arg
3410 3415 3420
Tyr Val Ile Met Gly Pro Asp Gly Gly Leu Ser Gly Leu Ile Asp Glu
3425 3430 3435 3440
Thr Leu Gln His Thr Thr Ser Asp Ala Val Asp Thr Lys Pro Thr Val
3445 3450 3455
Asp Gly Val Glu Ala Val Val Ala Ser Leu Leu Arg Lys Asp Arg Val
3460 3465 3470
Glu Asp Thr Thr Leu Leu Ser Ala Leu Ala Arg Leu Asp Val Ala Gly
3475 3480 3485
Thr Gly Ile Asp Trp Thr Pro Ile Phe His Gly Arg Gly Ala Thr Arg
3490 3495 3500
Val Pro Leu Pro Ser Tyr Ala Phe Gln His Arg Arg Tyr Trp Leu Asp
3505 3510 3515 3520
Thr Ile Thr Gly Asn Thr Asp Pro Asp Ser Leu Gly Leu Ser Gly Leu
3525 3530 3535
Asp His Pro Leu Ile Gly Ala Val Val Val Ser Pro Glu Thr Gly Ala
3540 3545 3550
Val Thr Val Thr Gly Arg Leu Ser Leu His Thr His Pro Trp Leu Ala
3555 3560 3565
Asp His Ala Val Gly Gly Val Val Leu Val Pro Gly Thr Gly Leu Val
3570 3575 3580
Glu Leu Val Ile Arg Ala Gly Asp Glu Ala Gly Cys Gly Val Val Arg
3585 3590 3595 3600
Glu Leu Thr Leu Leu Ala Pro Leu Thr Leu Pro Thr Asp Gly Gly Thr
3605 3610 3615
Ala Val Gln Val Leu Val Gly Ala Leu Glu Ser Ser Gly Thr Arg Thr
3620 3625 3630
Val Ser Val Tyr Ser Gln Thr Arg Asp Gln Glu Trp Val Leu Asn Ala
3635 3640 3645
Gln Gly Leu Leu Gln Thr Gln Ser Ala Thr Thr Pro His Asp Val Asp
3650 3655 3660
Thr Glu Leu Ala Ala Trp Pro Pro Ala Gly Ala Val Gln Ala Asp Thr
3665 3670 3675 3680
Ser Ser Leu Tyr Gln Gln Leu Ala Glu Asp Gly Tyr Gly Tyr Gly Pro
3685 3690 3695
Ala Phe Gln Gly Leu Glu Ser Val Trp Arg Thr Gly Gln Asp Trp Leu
3700 3705 3710
Val Gln Ala Thr Leu Pro Glu Thr Gly Gly Asp Ala His His Tyr Gly
3715 3720 3725
Leu His Pro Ala Leu Leu Asp Thr Val Leu His Ala Met Thr Thr Gly
3730 3735 3740
His Asp Thr Ser Ala Gly Pro Leu Leu Pro Phe Ala Trp Glu Ala Val
3745 3750 3755 3760
Gln Leu His Ala Val Gly Ala Ser Thr Val Arg Ala Arg Ile Thr Pro
3765 3770 3775
His Gly His Asn Thr Val Gln Val Thr Val Thr Asp Pro Ala Gly Arg
3780 3785 3790
Pro Val Leu Thr Ile Gly Ser Leu Thr Leu Arg Pro Ala Gln Leu Asp
3795 3800 3805
Gln Leu Thr Ala Ala Ala Gly Thr Gly Asp Arg Leu Leu Thr Val His
3810 3815 3820
Trp Thr Pro Thr Thr Thr Ser Arg Gln Pro Gln Asp Val Ala Tyr Thr
3825 3830 3835 3840
Glu Trp Thr Asp Leu Gln Ala Glu Ser Thr Asp Pro Glu Ser Ala Asp
3845 3850 3855
Gln Pro Ala Pro Gln Val Val Val Leu Asp Cys Arg Asp Lys Glu Asn
3860 3865 3870
Gly Thr Asp Val Leu Val Arg Ala His Ala Ile Ser His Arg Val Leu
3875 3880 3885
Gly Val Leu Gln Glu Phe Ser Thr Gly Gln Arg Phe Ala Ser Ser Thr
3890 3895 3900
Leu Leu Val Leu Thr Arg Ala Ala Val Thr Thr Thr Ala Gly Asp Arg
3905 3910 3915 3920
Val Asp Pro Ala Ala Ser Thr Ile Trp Gly Leu Val Arg Ser Ala Gln
3925 3930 3935
Ser Glu Glu Pro Gly Arg Ile Leu Leu Ala Asp Thr Asp Ile Glu Gly
3940 3945 3950
Ser Asp Gly Val Asp Val Ala Gly Ile Val Ser Leu Ala Val Ala Val
3955 3960 3965
Gly Glu Pro Gln Val Leu Ile Arg Asp Gly Ile Ala His Ile Ala Arg
3970 3975 3980
Leu Thr Arg Gly Pro Gly Arg Gly Thr Leu Ala Ile Pro Asp Ala Gly
3985 3990 3995 4000
Ala Trp Gln Leu Ala Ala Val Asp Lys Gly Val Leu Asp Gly Leu Ala
4005 4010 4015
Leu Val Ser His Pro Leu Ala Glu Gln Pro Leu Ala Ala Gly Gln Val
4020 4025 4030
Arg Ile Ser Val Arg Ala Ala Gly Leu Asn Phe Arg Asp Val Leu Ile
4035 4040 4045
Ala Leu Gly Met Tyr Pro Asp Asp Asp Ala Val Val Gly Ala Glu Leu
4050 4055 4060
Ala Gly Val Ile Val Glu Val Gly Ala Asp Val Glu Gly Leu Ser Val
4065 4070 4075 4080
Gly Asp Arg Val Met Gly Leu Ala Gly Arg Gly Val Gly Pro Val Val
4085 4090 4095
Ile Val Asp His Arg Leu Val Val His Met Pro Ala Gly Trp Ser Phe
4100 4105 4110
Ala Gln Ala Ala Ala Val Pro Val Val Phe Leu Thr Ala Tyr Tyr Gly
4115 4120 4125
Leu Met Asp Leu Ala His Ala Lys Pro Gly Asp Arg Leu Leu Val His
4130 4135 4140
Ala Ala Thr Gly Gly Val Gly Met Ala Ala Ile Gln Leu Ala Arg Cys
4145 4150 4155 4160
Trp Gly Leu Glu Val Phe Ala Thr Ala Ser Ser Gly Lys Trp Asp Val
4165 4170 4175
Leu Arg Gly Ile Gly Phe Asp Asp Gln His Ile Ala Asn Ser Arg Thr
4180 4185 4190
Leu Ser Phe Glu Asp Glu Phe Leu Ser Ala Thr Asp Gly His Gly Val
4195 4200 4205
Asp Ile Val Leu Asn Ser Leu Ala Gly Asp Phe Val Asp Ala Ser Leu
4210 4215 4220
Arg Leu Leu Pro Arg Gly Gly His Phe Leu Glu Met Gly Lys Thr Asp
4225 4230 4235 4240
Lys Arg Asp Ser Asp Ala Ile Thr Thr Gln Tyr Pro Gly Val Ile Tyr
4245 4250 4255
Gln Ala Phe Asp Met Phe Glu Ala Gly Glu Asp Arg Ile Gln Gln Met
4260 4265 4270
Leu Ser Glu Leu Thr Ala Ser Phe Asp Arg Gly Glu Leu Lys Ser Ile
4275 4280 4285
Pro Ile Gln Ala Trp Asp Ile Arg Gln Ala Pro Glu Ala Phe Arg Tyr
4290 4295 4300
Phe Ser Gln Thr Arg His Ile Gly Lys Val Val Leu Thr Leu Pro Val
4305 4310 4315 4320
Val Ser Thr Leu Pro Val Val Ser Asp Thr Thr Asp Thr Gly Arg Gly
4325 4330 4335
Thr Val Val Leu Thr Gly Gly Thr Gly Gly Leu Gly Arg Ile Leu Ala
4340 4345 4350
Arg His Leu Val Gly Val Arg Gly Val Arg Ser Leu Val Leu Ala Ser
4355 4360 4365
Arg Arg Gly Ile Ala Ala Glu Gly Ala Arg Glu Leu Val Glu Glu Leu
4370 4375 4380
Thr Gly Ser Gly Ala Arg Val Ala Val Val Ala Cys Asp Val Ser Thr
4385 4390 4395 4400
Arg Ala Gly Val Glu Gln Leu Leu Ala Ala Val Pro Asp Glu Asp Pro
4405 4410 4415
Leu Val Gly Val Val His Ala Ala Gly Val Leu Asp Asp Gly Val Ile
4420 4425 4430
Ala Ser Leu Thr Pro Gln Arg Leu Asp Thr Val Leu Ser Val Lys Ala
4435 4440 4445
Asp Ala Ala Trp Tyr Leu His Glu Leu Thr Arg Gly Leu Asp Leu Gly
4450 4455 4460
Met Phe Val Met Tyr Ser Ser Thr Ala Gly Val Leu Gly Ser Pro Gly
4465 4470 4475 4480
Gln Gly Asn Tyr Ala Ala Ala Asn Gln Phe Leu Asp Gly Leu Ala Glu
4485 4490 4495
His Arg Arg Ala Gln Gly Leu Pro Ala Thr Ser Ile Ala Trp Gly Leu
4500 4505 4510
Trp Gly Ser Ser Thr Gly Met Thr Gly His Leu Asp Gly Gly Asp Thr
4515 4520 4525
Ala Arg Met Asn Arg Gly Gly Tyr Leu Ala Met Thr Asp Glu Gln Gly
4530 4535 4540
Met Ala Met Phe Asp Thr Ala Ile Thr Ala Glu His Ala Thr Val Leu
4545 4550 4555 4560
Ala Ala Arg Phe Asp Thr Thr Ala Leu Ala Ala Gln Ala Arg Ala Gly
4565 4570 4575
Met Leu Thr Pro Ile Leu His Gln Leu Val Pro Asn Ala Arg Arg Ala
4580 4585 4590
Ala Thr Gly Asp Thr Gly Ser Ala Ser Gly Val Ala Gly Ser Gln Leu
4595 4600 4605
Arg Gln Arg Leu Ser Gly Leu Asp Glu Ala Glu Gln Val Lys Ile Leu
4610 4615 4620
Leu Glu Leu Val Gln Thr Gln Val Ala Ile Val Leu Gly His Asp Asp
4625 4630 4635 4640
Ala Thr Thr Ile Asp Ala Asp Arg Asn Phe Gln Glu Leu Gly Phe Asp
4645 4650 4655
Ser Leu Thr Ala Val Glu Ala Arg Asn Arg Leu Lys Thr Ala Thr Glu
4660 4665 4670
Val Ala Ile Pro Ala Thr Leu Thr Phe Asp Tyr Pro Thr Pro Arg Ala
4675 4680 4685
Val Ala Glu His Leu His Gln Gln Leu Ala Gly Arg Ser Gly Arg Arg
4690 4695 4700
Gly Val Asp Glu Ile Leu Tyr Arg Ile Glu Ser Leu Leu Ser Asp Ala
4705 4710 4715 4720
Asn Leu Ser Val Ala Asp Arg Lys Ser Leu Leu Asp Gly Phe Gly Lys
4725 4730 4735
Leu Val Leu Lys Ser Gly Glu Lys Asn Trp Asp Val Arg Pro Asn Asp
4740 4745 4750
Leu Ser Gly Asn Ser Ala Val Lys Glu Val Ile Lys Glu Ser Ser Ala
4755 4760 4765
Asp Asp Leu Met Asn Phe Ile Gln Thr Gln Leu Gly Tyr Pro Gly Val
4770 4775 4780
<210> 7
<211> 253
<212> PRT
<213> Nocardia vinacea
<400> 7
Leu Ser Thr Ser Ala Glu Ile Ser Leu Trp Phe Arg Arg Phe Asn Pro
1 5 10 15
Ser Pro Thr Ala Ser Ser Arg Leu Ile Cys Phe Pro His Ala Gly Gly
20 25 30
Ser Ala Ser Phe Phe Leu Pro Leu Ser Arg Ala Met Ser Pro Glu Val
35 40 45
Glu Val Leu Ser Val Gln Tyr Pro Gly Arg Gln Asp Arg Arg Asn Glu
50 55 60
Gln Pro Ala Gly Ser Ile Ala Ala Leu Ala Asp Ser Ile Ala Asp Asn
65 70 75 80
Ile Ser His Phe Ser Asp Lys Pro Leu Ala Leu Phe Gly His Ser Met
85 90 95
Gly Ala Ile Leu Ala Tyr Glu Val Thr Arg Arg Ile Ser Ile Thr Asn
100 105 110
Ser Pro Ile Ala Leu Phe Ala Ser Gly Arg Arg Ala Pro Ser Arg Tyr
115 120 125
Arg Pro Glu Ile Ala His Thr Leu Ser Asp Glu Lys Leu Leu Glu Glu
130 135 140
Leu Lys Met Leu Gly Gly Thr Asp Ser Arg Ala Phe Ala Asp Asn Asp
145 150 155 160
Ile Val Arg Met Ile Leu Pro Ala Val Arg Ala Asp Tyr Arg Ala Ile
165 170 175
Glu Thr Tyr Phe Tyr Gln Pro Gly Ser Glu Val Ser Thr Pro Ile Phe
180 185 190
Ala His Ile Gly Asp Arg Asp Pro Arg Val Thr Phe Asp Glu Ala Ser
195 200 205
Ser Trp Lys Glu His Thr Ser Asn Ser Phe Glu Leu His Thr His Thr
210 215 220
Gly Gly His Phe Tyr Ile Ala Glu His Thr Asn Ser Ile Ala Thr His
225 230 235 240
Ile Gln Gln Lys Leu Ser Glu His Pro Ile Arg Pro Arg
245 250
<210> 8
<211> 395
<212> PRT
<213> Nocardia vinacea
<400> 8
Met Ser Glu Ala Pro Val Ile Ala Thr Gln Leu Pro Thr Thr Arg Ser
1 5 10 15
Gly Arg Cys Pro Phe Asp Pro Pro Ala Ala Leu Thr Glu Ile Arg Gln
20 25 30
Arg Asp Pro Leu Thr Arg Met Gln Phe Ala Asn Gly His Gln Gly Trp
35 40 45
Leu Ala Thr Gly His Thr Glu Val Arg Ala Val Leu Ser Asp Pro Arg
50 55 60
Phe Ser Ala Arg His Glu Leu Gln His Tyr Pro Tyr Ala Asp Tyr Gly
65 70 75 80
Pro Met Pro Pro Ala Pro Val Gly Ala Leu Ala Gly Met Asp Gly Pro
85 90 95
Asp His Arg Arg Tyr Arg Lys Leu Leu Thr Gly Lys Phe Thr Val Arg
100 105 110
Arg Met Gln Leu Leu Thr Glu Arg Ile Glu Gln Ile Thr Thr Glu His
115 120 125
Leu Asp Ala Met Glu Lys His Gly Gly Pro Ile Asp Leu Val Thr Ala
130 135 140
Phe Ala Arg Pro Ile Pro Ala Leu Met Ile Cys Glu Leu Leu Gly Val
145 150 155 160
Pro Ser Ser Asp Arg Thr Thr Phe Gln Glu His Ala Lys Lys Ala Ser
165 170 175
Asp Val Thr Ala Gly Leu Glu Glu Arg Leu Ala Ala Tyr Thr Ala Ile
180 185 190
Val Asp Tyr Val Ala Asp Leu Val Thr Asp Lys Arg Thr Ala Pro Thr
195 200 205
Asp Asp Leu Leu Ser Asp Leu Thr Thr Thr Asp Leu Thr Asp Glu Glu
210 215 220
Leu Ala Gly Ile Gly Ala Phe Leu Leu Gly Ala Gly Leu Asp Thr Thr
225 230 235 240
Ala Asn Met Leu Ala Leu Gly Thr Phe Ala Leu Leu Thr His Pro Glu
245 250 255
Gln Leu Ala Ala Leu Arg Ser Asp Pro Asp Leu Thr Asp Ser Ala Val
260 265 270
Glu Glu Leu Met Arg Tyr Leu Ser Ile Ser His Ser Thr Ala Arg Ala
275 280 285
Ala Leu Glu Asp Val Glu Leu Gly Gly Lys Leu Ile Arg Ala Gly Glu
290 295 300
Thr Val Ala Val Ser Ile Gln Thr Ala Asn Arg Asp Pro Ala Arg Phe
305 310 315 320
Asp Asn Pro Asp Ala Leu Asp Leu His Arg Asn Thr Val Gly His Val
325 330 335
Gly Phe Ser His Gly Ala His Gln Cys Leu Gly Gln Gln Leu Ala Arg
340 345 350
Val Glu Met Arg Val Ala Phe Arg Ala Leu Val Ile Arg Phe Pro Asn
355 360 365
Leu Lys Leu Ala Ile Pro Ala His Glu Val Gln Leu Gly Ser Gly Gln
370 375 380
Ile Phe Gly Val Asn Gln Leu Pro Val Ser Trp
385 390 395
<210> 9
<211> 64
<212> PRT
<213> Nocardia vinacea
<400> 9
Met Lys Leu Val Val Asp Arg Asn Arg Cys Ile Gly Ala Gly Met Cys
1 5 10 15
Ala Leu Thr Ala Pro Ala Leu Phe Asp Gln Asp Asp Asp Asp Gly Leu
20 25 30
Val Ile Thr His Ala Glu Thr Pro Thr Pro Asp Gln Glu Gly Val Val
35 40 45
Arg Glu Ala Val Glu Ala Cys Pro Ser Gly Ala Leu Arg Thr Glu Glu
50 55 60
<210> 10
<211> 39
<212> DNA
<213> Artificial Sequence
<400> 10
cgacggccag tgccaagctt gaaccggttg tggtggtgg 39
<210> 11
<211> 39
<212> DNA
<213> Artificial Sequence
<400> 11
tatgacatga ttacgaattc ttcggcagtg tctcgtggc 39
<210> 12
<211> 20
<212> DNA
<213> Artificial Sequence
<400> 12
tcggattcga ctccctcacc 20
<210> 13
<211> 24
<212> DNA
<213> Artificial Sequence
<400> 13
gagcggataa caatttcaca cagg 24
<210> 14
<211> 40
<212> DNA
<213> Artificial Sequence
<400> 14
cgataagctt ggatcatttt gtccccaccg atagatagtc 40
<210> 15
<211> 42
<212> DNA
<213> Artificial Sequence
<400> 15
ggctgcaggt cgactcgaga gaaaacagtt gtcctgaata ag 42
<210> 16
<211> 24
<212> DNA
<213> Artificial Sequence
<400> 16
gagcggataa caatttcaca cagg 24
<210> 17
<211> 20
<212> DNA
<213> Artificial Sequence
<400> 17
attgcattcg ggtcagggga 20

Claims (3)

1.一种聚酮化合物生物合成基因簇,其特征在于所述聚酮化合物为TubelactomicinA,所述生物合成基因簇序列如SEQ ID NO:1所示。
2.根据权利要求1所述的聚酮化合物生物合成基因簇在构建聚酮化合物重组表达系统中的应用,其特征在于对所述聚酮化合物生物合成基因簇中的功能基因进行基因敲除,或者将所述聚酮化合物生物合成基因簇与细胞色素P450酶基因联合构建聚酮化合物重组表达系统,细胞色素 P450 酶氨基酸序列如 SEQ ID NO:8 所示。
3.一种Tubelactomicin A的生物合成方法,其特征在于构建细胞色素P450酶的回补质粒,将其转化入诺卡氏菌DSM44638菌体,发酵培养,分离纯化发酵液,得到TubelactomicinA,细胞色素P450酶氨基酸序列如SEQ ID NO:8 所示。
CN202110446525.8A 2021-04-25 2021-04-25 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用 Active CN115247179B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110446525.8A CN115247179B (zh) 2021-04-25 2021-04-25 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110446525.8A CN115247179B (zh) 2021-04-25 2021-04-25 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用

Publications (2)

Publication Number Publication Date
CN115247179A CN115247179A (zh) 2022-10-28
CN115247179B true CN115247179B (zh) 2024-03-12

Family

ID=83697097

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110446525.8A Active CN115247179B (zh) 2021-04-25 2021-04-25 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用

Country Status (1)

Country Link
CN (1) CN115247179B (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11217382A (ja) * 1998-01-28 1999-08-10 Microbial Chem Res Found 抗生物質ツベラクトマイシンとその製造法
JP2001055386A (ja) * 1999-08-13 2001-02-27 Microbial Chem Res Found 抗生物質ツベラクトマイシンb、dおよびeとその製造法
CN101275141A (zh) * 2008-03-07 2008-10-01 中国科学院上海有机化学研究所 阿嗪霉素的生物合成基因簇
WO2010011882A1 (en) * 2008-07-25 2010-01-28 Wyeth Biosynthetic gene cluster for the production of a complex polyketide
CN101818158A (zh) * 2010-03-30 2010-09-01 中国科学院上海有机化学研究所 Fr901464的生物合成基因簇

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200716744A (en) * 2005-05-26 2007-05-01 Eisai R&D Man Co Ltd Genetically modified microorganism and process for production of macrolide compound using the microorganism

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11217382A (ja) * 1998-01-28 1999-08-10 Microbial Chem Res Found 抗生物質ツベラクトマイシンとその製造法
JP2001055386A (ja) * 1999-08-13 2001-02-27 Microbial Chem Res Found 抗生物質ツベラクトマイシンb、dおよびeとその製造法
CN101275141A (zh) * 2008-03-07 2008-10-01 中国科学院上海有机化学研究所 阿嗪霉素的生物合成基因簇
WO2010011882A1 (en) * 2008-07-25 2010-01-28 Wyeth Biosynthetic gene cluster for the production of a complex polyketide
CN101818158A (zh) * 2010-03-30 2010-09-01 中国科学院上海有机化学研究所 Fr901464的生物合成基因簇

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Natural Product Synthesis Featuring Intramolecular Diels-Alder Approaches-Total Syntheses of Tubelactomicins and Spiculoic Acid A;Tadano K;《Eur J Org Chem》;第2009卷(第26期);第4381-4394页 *
微生物次级代谢产物生物合成基因簇与药物创新;白林泉等;《中国抗生素杂志》;第31卷(第02期);第80-86页 *
链霉菌中高效生产聚酮化合物的研究方法及进展;姚永鹏等;《微生物学报》;第56卷(第03期);第418-428页 *

Also Published As

Publication number Publication date
CN115247179A (zh) 2022-10-28

Similar Documents

Publication Publication Date Title
CA2731760C (en) Pyripyropene a biosynthetic gene
US10047363B2 (en) NRPS-PKS gene cluster and its manipulation and utility
CN110218244B (zh) 化合物ilamycin F及其应用
AU2021201969B2 (en) Process
CN101275141A (zh) 阿嗪霉素的生物合成基因簇
CN110777155B (zh) 最小霉素生物合成基因簇、重组菌及其应用
KR101602195B1 (ko) 비천연항생물질의 제조방법
EP0929681B1 (en) Rifamycin biosynthesis gene cluster
WO2001051639A2 (en) Everninomicin biosynthetic genes
CN111117942B (zh) 一种产林可霉素的基因工程菌及其构建方法和应用
CN115247179B (zh) 一种聚酮化合物骨架及其后修饰物的生物合成基因簇及其应用
CN104928305B (zh) 一种大环内酰胺类化合物heronamides的生物合成基因簇及其应用
KR20020029767A (ko) 환상 뎁시펩티드 합성효소 및 그의 유전자 및 환상뎁시펩티드의 대량생산계
CN101586112A (zh) 诺丝七肽的生物合成基因簇
AU783603B2 (en) Transformant producing secondary metabolite modified with functional group and novel biosynthesis genes
CN104427870A (zh) Uk-2生物合成基因和使用其提高uk-2生产率的方法
US7670827B2 (en) Strain belonging to the genus Streptomyces and being capable of producing nemadictin and process for producing nemadictin using the strain
CN107164394B (zh) 一种非典型角环素类化合物nenestatin A的生物合成基因簇及其应用
CN110551739A (zh) 吡唑霉素生物合成基因簇、重组菌及其应用
CN101545000B (zh) 一种提高东方拟无枝酸菌发酵生产eco-0501的产量的方法
KR100861771B1 (ko) 발리다마이신 생합성을 위한 발리오론 합성효소 및 이의 제조방법
JP5335413B2 (ja) アスパラギン酸アミノトランスフェラーゼ遺伝子およびl−ホスフィノスリシンの製造方法
CN1509334B (zh) 生产pf1022物质衍生物的转化体及其制备方法以及新型生物合成基因
CA2354030A1 (en) Micromonospora echinospora genes encoding for biosynthesis of calicheamicin and self-resistance thereto
CN110129244B (zh) 链霉菌底盘菌株及其构建方法、在异源表达研究中的应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant