CN1190444C - 南昌霉素生物合成基因簇 - Google Patents
南昌霉素生物合成基因簇 Download PDFInfo
- Publication number
- CN1190444C CN1190444C CNB031149200A CN03114920A CN1190444C CN 1190444 C CN1190444 C CN 1190444C CN B031149200 A CNB031149200 A CN B031149200A CN 03114920 A CN03114920 A CN 03114920A CN 1190444 C CN1190444 C CN 1190444C
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gly
- val
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
一种作为聚醚类抗生素的南昌霉素生物合成基因簇属于基因技术领域。整个南昌霉素生物合成基因簇共30个基因,具体为:(1)聚酮合酶基因,即nanA1,nanA2,nanA3,nanA4,nanA5,nanA6,nanA7,nanA8,nanA9,nanA10,nanA11共11个基因;(2)南昌霉素的修饰基因,即nanE,nanI,nanO,nanP共4个基因;(3)南昌霉素脱氧糖的生物合成基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM共6个基因;(4)南昌霉素的调节基因,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5共9个基因。本发明所提供的基因及其蛋白质,抗体也可以用来查找和发展可用于医药,工业,农业的化合物或蛋白。
Description
技术领域
本发明涉及的是一种抗生素生物合成基因簇,特别是一种作为聚醚类抗生素的南昌霉素生物合成基因簇,属于基因技术领域。
背景技术
链霉菌及其近缘放线菌由于可以产生大量的天然抗生素而具有极其重要的应用价值。聚酮化合物是这些天然产物最多的类群之一,因其巨大的药用价值而被广泛地用于医用、兽用和农用等领域。例如抗细菌抗生素红霉素(erythromycin)、抗真菌抗生素两性霉素B(amphotericin B)、抗寄生虫抗生素阿维菌素(avermectin)、肿瘤抑制剂雷帕霉素(rapamycin)、抗肿瘤抗生素柔红霉素(daunarubicin)等。现有技术中还存在着一些技术问题需要解决,聚酮化合物是由聚酮合酶(PKS)催化形成的,以模块结构形式组成的PKS以类似脂肪酸生物合成的方式通过连续的缩合反应将一些简单的羧酸分子催化形成聚酮化合物。每一个模块在聚酮链形成过程中只负责一步缩合反应,它至少包含一个β-酮酯酰合成酶(KS)结构域,一个酰基转移酶(AT)结构域和一个酰基载体蛋白(ACP)结构域。此外,还可能包含一个β-酮酯酰还原酶(KR)结构域,一个脱水酶(DH)结构域和一个酯酰还原酶(ER)结构域,它们决定着加入的延伸单位的还原步骤。此外,还需要硫酯酶(TE)结构域的作用催化聚酮链的环化与释放。最后,还要经过羟基化、糖基化、甲基化和酰基化等修饰步骤。这些步骤对于大多数终产物的生物活性来说是至关重要的。
聚酮生物合成PKS的模块结构组成具有一定的可塑性,使得人们能够通过改变模块的数目、延伸模块的特异性或结构域的插入或失活等基因工程操作获得新的聚酮衍生物。聚醚抗生素是一类作用于线粒体阳离子转移的聚酮化合物,它常常具有抗微生物的活性,如抗革兰氏阳性细菌(包括分枝杆菌)和真菌。在兽药中作为一种生长促进剂也起着重要作用。聚醚抗生素的结构特点使之能够通过成环将氧原子集中在中心区域以结合合适的阳离子形成复合物,如结合钠离子的猎神霉素,侧链烷基基团然后跨越外膜。通过这种方式,阳离子便可穿越目标细胞膜,导致细胞的去极化和最终死亡。
发明内容
本发明根据背景技术中存在的不足和需要解决的技术问题,提供一种南昌霉素生物合成基因簇,其产生的一种聚醚类离子载体抗生素南昌霉素的化学结构和理化特性与猎神霉素(dianemycin)同质,对防治鸡球虫病有显著疗效,无毒副作用,同时还具有一定的增重效果,是一种理想的抗鸡球虫药。
本发明是通过以下的技术方案来实现的,本发明整个聚醚类离子载体抗生素生物合成基因簇共30个基因的核苷酸序列或互补序列(序列1)。其中11个用于编码聚酮合酶(PKS),它包含14个模块,共74个结构域,负责催化南昌霉素聚酮糖苷配基的生物合成。另有4个基因,即nanE,nanI,nanO,nanP编码参与南昌霉素生物合成修饰的蛋白,负责催化聚酮链的氧化、异构化和聚醚结构的形成。还有6个基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM负责编码参与南昌霉素生物合成中脱氧糖(4-O-甲基-L-rhodinose)的合成及转移的蛋白。此外,在基因簇的两侧还存在着两组共9个调节基因,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5,它们参与南昌霉素生物合成的调节。这些核苷酸序列是分别选自于序列1中的nanT1(2052-682),nanT2(2064-2879),nanT3(3117-3818),nanT4(4923-3724),nanT5(6106-4916),nanR1(7493-6768),nanR2(8334-7573),nanA1(8919-17627),nanA2(17642-24313),nanA3(24310-36408),nanA4(36429-48299),nanA5(48345-60284),nanA6(60305-65302),nanG5(66747-65365),nanM(66881-67798),nanG4(67823-68863),nanG3(68860-70164),nanG2(71140-70145),nanG1(72045-71137),nanA7(76990-72050),nanE(77864-76992),nanA10(78263-77949),nanO(79763-78327),nanI(80769-79828),nanA9(83192-80784),nanA11(89759-83196),nanA8(100138-89771),nanP(101467-100196),nanR3(101593-102600),nanR4(102600-103541)。
本发明还提供了一个编码膜整合型转移蛋白的核苷酸序列,由序列2中的氨基酸序列组成,命名为nanT1,基因的核苷酸序列选自于序列1中2052-682碱基。
本发明还提供了一个编码ABC转移因子蛋白的核苷酸序列,由序列3中的氨基酸序列组成,命名为nanT2,基因的核苷酸序列选自于序列1中2064-2879碱基。
本发明还提供了一个编码双组分反馈调节因子蛋白的核苷酸序列,由序列4中的氨基酸序列组成,命名为nanT3,基因的核苷酸序列选自于序列1中3117-3818碱基。
本发明还提供了一个编码化学受体蛋白的核苷酸序列,由序列5中的氨基酸序列组成,命名为nanT4,基因的核苷酸序列选自于序列1中4923-3724碱基。
本发明还提供了一个编码双组分传感器组氨酸激酶蛋白的核苷酸序列,由序列6中的氨基酸序列组成,命名为nanT5,基因的核苷酸序列选自于序列1中6106-4916碱基。
本发明还提供了一个编码调节蛋白的核苷酸序列,由序列7中的氨基酸序列组成,命名为nanR1,基因的核苷酸序列选自于序列1中7493-6768碱基。
本发明还提供了一个编码调节蛋白的核苷酸序列,由序列8中的氨基酸序列组成,命名为nanR2,基因的核苷酸序列选自于序列1中8334-7573碱基。
本发明还提供了编码包括KSQ,AT-L,ACP-L,KS1,AT1,DH1,KR1,ACP1聚酮合酶结构域的核苷酸序列,这些结构域由序列9中的氨基酸序列组成,命名为nanA1,基因的核苷酸序列选自于序列1中的8919-17627碱基。
本发明还提供了编码包括KS2,AT2,DH2,ER2,KR2,ACP2聚酮合酶结构域的核苷酸序列,这些结构域由序列10中的氨基酸序列组成,命名为nanA2,基因的核苷酸序列选自于序列1中的17642-24313碱基。
本发明还提供了编码包括KS3,AT3,DH3,KR3,ACP3,KS4,AT4,DH4,ER4,KR4,ACP4聚酮合酶结构域的核苷酸序列,这些结构域由序列11中的氨基酸序列组成,命名为nanA3,基因的核苷酸序列选自于序列1中的24310-36408碱基。
本发明还提供了编码包括KS5,AT5,DH5,KR5,ACP5,KS6,AT6,DH6,ER6,KR6,ACP6聚酮合酶结构域的核苷酸序列,这些结构域由序列12中的氨基酸序列组成,命名为nanA4,基因的核苷酸序列选自于序列1中的36249-48299碱基。
本发明还提供了编码包括KS7,AT7,DH7,KR7,ACP7,KS8,AT8,DH8,ER8,KR8,ACP8聚酮合酶结构域的核苷酸序列,这些结构域由序列13中的氨基酸序列组成,命名为nanA5,基因的核苷酸序列选自于序列1中的48345-60284碱基。
本发明还提供了编码包括KS9,AT9,KR9,ACP9聚酮合酶结构域的核苷酸序列,这些结构域由序列14中的氨基酸序列组成,命名为nanA6,基因的核苷酸序列选自于序列1中的60305-65302碱基。
本发明还提供了一个编码糖基转移酶蛋白的核苷酸序列,由序列15中的氨基酸序列组成,命名为nanG5,基因的核苷酸序列选自于序列1中66747-65365碱基。
本发明还提供了一个编码甲基转移酶蛋白的核苷酸序列,由序列16中的氨基酸序列组成,命名为nanM,基因的核苷酸序列选自于序列1中66881-67798碱基。
本发明还提供了一个编码NDP-D-葡萄糖-4,6-脱水酶,NDP-D-葡萄糖-4-异构酶,NDP-D-葡萄糖-4-还原酶蛋白的核苷酸序列,由序列17中的氨基酸序列组成,命名为nanG4,基因的核苷酸序列选自于序列1中67823-68863碱基。
本发明还提供了一个编码NDP-D-葡萄糖-3,4-脱水酶蛋白的核苷酸序列,由序列18中的氨基酸序列组成,命名为nanG3,基因的核苷酸序列选自于序列1中68860-70164碱基。
本发明还提供了一个编码dTDP-D-葡萄糖-4,6-脱水酶蛋白的核苷酸序列,由序列19中的氨基酸序列组成,命名为nanG2,基因的核苷酸序列选自于序列1中71140-70145碱基。
本发明还提供了一个编码葡萄糖-1-磷酸:TTP胸苷基转移酶蛋白的核苷酸序列,由序列20中的氨基酸序列组成,命名为nanG1,基因的核苷酸序列选自于序列1中72045-71137碱基。
本发明还提供了编码包括KS10,AT10,KR10,ACP10聚酮合酶结构域的核苷酸序列,这些结构域由序列21中的氨基酸序列组成,命名为nanA7,基因的核苷酸序列选自于序列1中的76990-72050碱基。
本发明还提供了一个编码环氧化物水解酶蛋白的核苷酸序列,由序列22中的氨基酸序列组成,命名为nanE,基因的核苷酸序列选自于序列1中77864-76992碱基。
本发明还提供了编码ACP聚酮合酶结构域的核苷酸序列,这个结构域由序列23中的氨基酸序列组成,命名为nanA10,基因的核苷酸序列选自于序列1中的78263-77949碱基。
本发明还提供了一个编码环氧化物酶蛋白的核苷酸序列,由序列24中的氨基酸序列组成,命名为nanO,基因的核苷酸序列选自于序列1中79763-78327碱基。
本发明还提供了一个编码酮甾异构酶蛋白的核苷酸序列,由序列25中的氨基酸序列组成,命名为nanI,基因的核苷酸序列选自于序列1中80769-79828碱基。
本发明还提供了编码包括KS13,AT13聚酮合酶结构域的核苷酸序列,这些结构域由序列26中的氨基酸序列组成,命名为nanA9,基因的核苷酸序列选自于序列1中的83192-80784碱基。
本发明还提供了编码包括KS14,AT14,DH14,ER14,KR14,ACP14,CR聚酮合酶结构域的核苷酸序列,这些结构域由序列27中的氨基酸序列组成,命名为nanA11,基因的核苷酸序列选自于序列1中的89759-83196碱基。
本发明还提供了编码包括KS11,AT11,KR11,ACP11,KS12,AT12,DH12,KR12,ACP12聚酮合酶结构域的核苷酸序列,这些结构域由序列28中的氨基酸序列组成,命名为nanA8,基因的核苷酸序列选自于序列1中的100138-89771碱基。
本发明还提供了一个编码细胞色素P450蛋白的核苷酸序列,由序列29中的氨基酸序列组成,命名为nanP,基因的核苷酸序列选自于序列1中101467-100196碱基。
本发明还提供了一个编码转录调节因子蛋白的核苷酸序列,由序列30中的氨基酸序列组成,命名为nanR3,基因的核苷酸序列选自于序列1中101593-102600碱基。
本发明还提供了一个编码转录调节因子蛋白的核苷酸序列,由序列31中的氨基酸序列组成,命名为nanR4,基因的核苷酸序列选自于序列1中102600-103541碱基。
本发明还提供了以至少来自于序列1聚酮合酶序列中的一个片段与来自于其它聚酮合酶基因簇的序列来构建重组载体以获得新型聚酮合酶的途径。
本发明还提供了在基因工程微生物体中提高南昌霉素产量的途径。
本发明还提供了得到至少包含部分序列1中DNA序列的重组DNA载体的途径。
本发明还提供了产生南昌霉素生物合成基因被打断或加倍的微生物体的途径,至少其中之一的基因包含有序列1中的核苷酸序列。
序列1的互补序列可依据DNA碱基互补原则随时得到。序列1的核苷酸序列或部分核苷酸序列可以通过聚合酶链式反应(PCR)或用合适的限制性内切酶酶切相应的DNA或使用其它合适的技术得到。通过本发明所提供的核苷酸序列或部分核苷酸序列,可利用聚合酶链式反应(PCR)的方法或包含本发明序列的DNA作为探针进行Southern杂交的方法,从其它生物体得到与南昌霉素生物合成基因相似的基因。
包含本发明所提供核苷酸序列或至少部分序列的克隆基因或DNA片段可以通过打断南昌霉素生物合成的一个或几个步骤而得到新的南昌霉素衍生物。包含DNA片段或基因可以用来提高南昌霉素或其衍生物的产量。
包含本发明所提供核苷酸序列或至少部分序列的克隆DNA可用来从南昌链霉菌基因组文库中定位更多的文库质粒。这些文库质粒至少包含有本发明中的部分序列,也包含有南昌链霉菌基因组中以前邻近区域未克隆的DNA。
本发明所提供的核苷酸序列可以被修饰或突变。这些途径包括插入或置换,聚合酶链式反应,错误介导聚合酶链式反应,位点特异性突变,不同序列的重新连接,或通过紫外线或化学试剂。
本发明所提供的核苷酸序列可以通过序列的不同部分或其它来源的同源序列进行直接进化(DNA shuffling)。
通过缺失或失活来自于相同或不同聚酮合酶系统的一个或多个聚酮合酶结构域,模块或基因,或增加一个或多个聚酮合酶结构域,模块或基因而产生新的聚酮化合物。
包含本发明的序列或至少部分序列的克隆基因可以通过合适的表达系统在外源宿主中表达以得到修饰的聚酮合酶或更高的生物活性或更高的产量。这些外源宿主包括链霉菌,大肠杆菌,芽孢杆菌,酵母,植物和动物等。
包含本发明的核苷酸序列或至少部分序列的片段或结构域或模块或基因可以用来构建聚酮合酶库或聚酮合酶衍生库或组合库。
南昌霉素生物合成修饰基因的核苷酸序列提供了通过缺失或改造这些修饰基因而得到南昌霉素衍生物的途径。
含有本发明的核苷酸序列或至少部分序列的基因或基因簇可以在异源宿主中表达并通过DNA芯片技术了解它们在宿主代谢链中的功能。
包含本发明的氨基酸序列或至少部分序列的多肽可能在去除或替代某个或某些氨基酸之后仍有生物活性甚至有新的生物学活性,或者提高了产量或优化蛋白动力学特征或其它致力于得到的性质。
通过合适的技术缺失,连接本发明中的氨基酸序列可以得到新的蛋白或酶,进而产生新的聚酮或相关联的产物。
本发明所提供的氨基酸序列可以用来分离需要的蛋白质并可以用于抗体制备。
本发明所提供的氨基酸序列提供了预测聚酮合酶三维结构的可能。
本发明具有实质性特点和显著的进步,本发明所提供的基因及其蛋白质,抗体也可以用来查找和发展可用于医药,工业,农业的化合物或蛋白。
附图说明
图1南昌霉素化学结构
如图1,聚酮链的编号顺序是从左到右,粗线表示用PKS催化1个乙酸起始单位,4个乙酸和10个丙酸延伸单位形成的结构单位。结合在C-19上的4-O-甲基-L-rhodinose脱氧糖的编号是从1‘至6’。
图2南昌霉素生物合成基因簇的结构组成
如图2,灰色箭头标记的ORF表示I型PKS基因。负责乙酸加载延伸单位的结构域标记为ATa,负责加载丙酸延伸单位的结构域标记为ATp。黑色箭头表示修饰基因、脱氧糖生物合成基因、调节基因。
图3南昌霉素聚酮生物合成模型
如图3,黑色粗线表示I型PKS蛋白亚基,黑色细线表示I型PKS模块(module)。黑色箭头代表修饰基因。
图4南昌霉素脱氧糖(4-O-甲基-L-rhodinose)生物合成途径
如图4,nanG1,nanG2,nanG3,nanG4,nanG5,nanM分别代表葡萄糖-1-磷酸:TTP胸苷基转移酶,dTDP-D-葡萄糖-4,6-脱水酶,NDP-D-葡萄糖-3,4-脱水酶,NDP-D-葡萄糖-4,6-脱水酶、NDP-D-葡萄糖-4-异构酶、NDP-D-葡萄糖-4-还原酶,糖基转移酶,甲基转移酶。
具体实施方式
以下结合图1、图2、图3和图4对本发明进一步详细描述
本发明中的整个南昌霉素生物合成基因簇共30个基因,具体为:
(1)聚酮合酶基因,即nanA1,nanA2,nanA3,nanA4,nanA5,nanA6,nanA7,nanA8,nanA9,nanA10,nanA11共11个基因;
(2)南昌霉素的修饰基因,即nanE,nanI,nanO,nanP共4个基因;
(3)南昌霉素脱氧糖的生物合成基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM共6个基因;
(4)南昌霉素的调节基因,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5共9个基因。
聚酮合酶基因:
以下是编码催化南昌链霉菌NS3226中聚醚类离子载体抗生素南昌霉素聚酮糖苷配基生物合成所需的11个I型聚酮合酶开放读码框,即nanA1,nanA2,nanA3,nanA4,nanA5,nanA6,nanA7,nanA8,nanA9,nanA10,nanA11的核苷酸序列或互补序列及其相应的氨基酸序列和11个开放读码框所包含的I型聚酮合酶模块或结构域,即酮基合成酶结构域(KS)、酰基转移酶结构域(AT)、酮基还原酶结构域(KR)、脱水酶结构域(DH)、烯酰基还原酶结构域(ER)、酰基载体蛋白结构域(ACP)、链释放结构域(CR)的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1中有11个基因(nanA1-nanA11)是用于编码聚酮合酶,其中共包含14个模块,共有74个结构域,负责催化南昌霉素聚酮糖苷配基的生物合成(如:图2,图3)。
nanA1包含两个模块,即加载模块和模块1。加载模块含三个结构域:KSQ,AT-L,ACP-L,负责聚酮链的起始合成,催化引入一个乙酸作为合成起始单位并最终形成南昌霉素C29-C30碳链骨架。模块1含五个结构域:KS1,AT1,DH1,KR1,ACP1,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C27-C28碳链骨架,并在C28上带有一个甲基支链。各结构域的氨基酸位置如表1所示。
表1 聚酮合酶基因nanA1所包含的结构域及其氨基酸位置
模块 结构域 在序列9中氨基酸位置
KSQ 21-447
加载模块 AT-L 557-844
ACP-L 959-1024
KS1 1047-1478
AT1 1585-1891
模块1 DH1 1954-2148
KR1 2491-2658
ACP1 2768-2835
nanA2包含一个模块,即模块2。模块2含六个结构域:KS2,AT2,DH2,ER2,KR2,ACP2,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C25-C26碳链骨架,并在C26上带有一个甲基支链。各结构域的氨基酸位置如表2所示。
表2 聚酮合酶基因nanA2所包含的结构域及其氨基酸位置
模块 结构域 在序列10中氨基酸位置
KS2 34-460
AT2 575-878
模块2 DH2 933-1121
ER2 1445-1752
KR2 1761-1933
ACP2 2055-2122
nanA3包含两个模块,即模块3和模块4。模块3含五个结构域:KS3,AT3,DH3,KR3,ACP3,负责催化引入一个乙酸延伸单位最终形成南昌霉素的C23-C24碳链骨架。模块4含六个结构域KS4,AT4,DH4,ER4,KR4,ACP4,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C21-C22碳链骨架,并在C22上带有一个甲基支链。各结构域的氨基酸位置如表3所示。
表3 聚酮合酶基因nanA3所包含的结构域及其氨基酸位置
模块 结构域 在序列11中氨基酸位置
KS3 42-466
AT3 575-887
模块3 DH3 953-1158
KR3 1495-1659
ACP3 1775-1842
KS4 1873-2301
AT4 2405-2711
DH4 2762-2949
模块4 ER4 3289-3582
KR4 3591-3797
ACP4 3877-3942
nanA4包含两个模块,即模块5和模块6。模块3含五个结构域:KS5,AT5,DH5,KR5,ACP5,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C19-C20碳链骨架,并在C20上带有一个甲基支链。模块6含六个结构域KS6,AT6,DH6,ER6,KR6,ACP6,负责催化引入一个乙酸延伸单位最终形成南昌霉素的C17-C18碳链骨架。各结构域的氨基酸位置如表4所示。
表4 聚酮合酶基因nanA4所包含的结构域及其氨基酸位置
模块 结构域 在序列12中氨基酸位置
KS5 35-460
AT5 596-902
模块5 DH5 954-1144
KR5 1474-1632
ACP5 1732-1797
KS6 1822-2249
AT6 2359-2660
DH6 2715-2899
模块6 ER6 3196-3505
KR6 3514-3666
ACP6 3802-3867
nanA5包含两个模块,即模块7和模块8。模块7含五个结构域:KS7,AT7,DH7,KR7,ACP7,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C15-C16碳链骨架,并在C16上带有一个甲基支链。模块8含六个结构域KS8,AT8,DH8,ER8,KR8,ACP8,负责催化引入一个乙酸延伸单位最终形成南昌霉素的C13-C14碳链骨架。各结构域的氨基酸位置如表5所示。
表5 聚酮合酶基因nanA5所包含的结构域及其氨基酸位置
模块 结构域 在序列13中氨基酸位置
KS7 34-458
AT7 583-889
模块7 DH7 943-1135
KR7 1449-1632
ACP7 1729-1796
KS8 1820-2247
AT8 2353-2655
DH8 2713-2898
模块8 ER8 3256-3537
KR8 3542-3725
ACP8 3827-3894
nanA6包含一个模块,即模块9。模块9含四个结构域:KS9,AT9,KR9,ACP9,负责催化引入一个乙酸延伸单位最终形成南昌霉素的C11-C12碳链骨架。各结构域的氨基酸位置如表6所示。
表6 聚酮合酶基因nanA6所包含的结构域及其氨基酸位置
模块 结构域 在序列14中氨基酸位置
KS9 35-461
AT9 564-870
模块9 KR9 1209-1367
ACP9 1514-1579
nanA7包含一个模块,即模块10。模块10含四个结构域:KS10,AT10,KR10,ACP10,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C9-C10碳链骨架,并在C10上带有一个甲基支链。各结构域的氨基酸位置如表7所示。
表7 聚酮合酶基因nanA7所包含的结构域及其氨基酸位置
模块 结构域 在序列21中氨基酸位置
KS10 35-461
AT10 566-873
模块10 KR10 1215-1372
ACP10 1499-1564
nanA8包含两个模块,即模块11和模块12。模块11含四个结构域:KS11,AT11,KR11,ACP11,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C7-C8碳链骨架,并在C8上带有一个甲基支链。模块12含五个结构域KS12,AT12,DH12,KR12,ACP12,负责催化引入一个丙酸延伸单位最终形成南昌霉素的C5-C6碳链骨架,并在C6上带有一个甲基支链。各结构域的氨基酸位置如表8所示。
表8 聚酮合酶基因nanA8所包含的结构域及其氨基酸位置
模块 结构域 在序列28中氨基酸位置
KS11 48-474
AT11 579-880
模块11 KR11 1223-1401
ACP11 1506-1571
KS12 1598-2025
AT12 2132-2433
模块12 DH12 2499-2685
KR12 3014-3197
ACP12 3304-3371
nanA9包含一个模块,即模块13。模块13含两个结构域:KS13和AT13,与nanA10的ACP结构域一起负责催化引入一个丙酸延伸单位最终形成南昌霉素的C3-C4碳链骨架,并在C4上带有一个甲基支链。各结构域的氨基酸位置如表9所示。
表9 聚酮合酶基因nanA9所包含的结构域及其氨基酸位置
模块 结构域 在序列26中氨基酸位置
模块13 KS13 43-469
AT13 578-685
nanA10包含一个结构域:ACP,它的氨基酸位置如表10所示。
表10 聚酮合酶基因nanA10所包含的结构域及其氨基酸位置
结构域 在序列23中氨基酸位置
ACP 17-97
nanA11包含一个模块,即模块14。模块14含七个结构域:KS14,AT14,DH14,ER14,KR14,ACP14,CR,前六个结构域负责催化引入一个丙酸延伸单位最终形成南昌霉素的C1-C2碳链骨架,并在C2上带有一个甲基支链。最后一个靠近C末端的CR结构域,在南昌霉素聚酮链合成的最后负责催化聚酮链从聚酮合酶上释放下来。各结构域的氨基酸位置如表11所示。
表11 聚酮合酶基因nanA11所包含的结构域及其氨基酸位置
模块 结构域 在序列27中氨基酸位置
KS14 76-499
AT14 599-902
DH14 954-1137
模块14 ER14 1484-1763
KR14 1770-1953
ACP14 2048-2115
CR 2146-2225
南昌霉素的修饰基因:
以下是编码参与南昌霉素聚酮链修饰的4个开放读码框,即nanE,nanI,nanO,nanP的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1有中存在四个负责南昌霉素修饰的基因,即nanO,nanE,nanI和nanP。nanO和nanE(如:图2),分别编环氧化物酶和环氧化物水解酶,在南昌霉素霉素的生物合成中,分别催化双烯中间体的环氧化和双环氧化物的开环。NanI催化激活的双键由E构型向Z构型的转化。NanO负责催化C-30位点上甲基基团的氧化。它们的核苷酸、氨基酸位置及其功能如表12所示。
表12 南昌霉素的修饰基因的核苷酸、氨基酸位置及其功能
基因 序列1中碱基的位置 相应的氨基酸序列 功能
nanE 77864-76992 序列22 环氧化物水解酶
nanI 80769-79828 序列25 酮甾异构酶
nanO 79763-78327 序列24 环氧化物酶
nanP 101467-100196 序列29 细胞色素P450
南昌霉素脱氧糖(4-O-甲基-L-rhodinose)的生物合成基因:
以下是编码参与南昌霉素生物合成中脱氧糖(4-O-甲基-L-rhodinose)的合成及转移的6个开放读码框,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1中存在六个催化脱氧糖(4-O-甲基-L-rhodinose)的生物合成和转移的基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM(如:图2)。从D-葡萄糖-1-磷酸开始,通过NanG1(葡萄糖-1-磷酸:TTP胸苷基转移酶)的作用将其转变成dTDP-D-葡萄糖,随后又在dTDP-D-葡萄糖-4,6-脱水酶(NanG2)催化下转变成dTDP-4-酮基-6-脱氧-D-葡萄糖,dTDP-4-酮基-6-脱氧-D-葡萄糖再经NanG3的催化转变成dTDP-4-酮基-2.6-双脱氧-D-葡萄糖,接着产生dTDP-D-cinerulose,dTDP-L-cinerulose和dTDP-L-rhodinose的等一连串反应,这些反应均被多功能酶NanG4催化。最后,NanM将完成L-rhodinose的4-O-甲基化并在糖基转移酶(NanG5)作用下将4-O-甲基-L-rhodinose连接于糖苷配基而形成成熟的南昌霉素。整个南昌霉素脱氧糖(4-O-甲基-L-rhodinose)生物合成途径见图4。它们的核苷酸、氨基酸位置及其功能如表13所示。
表13 南昌霉素脱氧糖生物合成基因的核苷酸、氨基酸位置及其功能
基因 序列1中碱基的 相应的氨基酸序 功能
位置 列
nanG1 72045-71137 序列20 葡萄糖-1-磷酸:TTP胸苷基
转移酶
nanG2 71140-70145 序列19 dTDP-D-葡萄糖-4,6-脱水酶
nanG3 68860-70164 序列18 NDP-D-葡萄糖-3,4-脱水酶
nanG4 67823-68863 序列17 NDP-D-葡萄糖-4,6-脱水酶,
NDP-D-葡萄糖-4-异构酶,
NDP-D-葡萄糖-4-还原酶
nanG5 66747-65365 序列15 糖基转移酶
nanM 66881-67798 序列16 基转移酶
南昌霉素的调节基因:
以下是编码参与南昌霉素生物合成调节的9个开放读码框,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1中存在九个调节基因,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5(如:图2),它们参与南昌霉素生物合成的调节。它们的核苷酸、氨基酸位置及其功能如表14所示。
表14 南昌霉素的调节基因的核苷酸、氨基酸位置及其功能
基因 序列1中碱基的位置 相应的氨基酸 功能
序列
nanR1 7493~6768 序列7 调节蛋白
nanR2 8334~7573 序列8 调节蛋白
nanR3 101593~102600 序列30 转录调节因子
nanR4 102600~103541 序列31 转录调节因子
nanT1 2052~682 序列2 膜整合型转移蛋白
nanT2 2064~2879 序列3 ABC转移因子
nanT3 3117~3818 序列4 双组分反馈调节因子
nanT4 4923~3724 序列5 化学受体
nanT5 6106~4916 序列6 组分传感器组氨酸激酶
序列1为南昌霉素生物合成基因簇共30个基因的核苷酸序列或互补序列,全长132554个碱基,包含11个用于编码聚酮合酶的基因(nanA1-nanA11),4个参与南昌霉素生物合成修饰的基因(nanE,nanI,nanO,nanP),6个参与脱氧糖(4-O-甲基-L-rhodinose)的合成及转移的基因(nanG1,nanG2,nanG3,nanG4,nanG5,nanM)和9个调节基因(nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5)。南昌霉素生物合成基因簇的结构组成见。
序列2为nanT1基因(序列1中2052-682碱基)编码的膜整合型转移蛋白(NanT1)的氨基酸序列。
序列3为nanT2基因(序列1中2064-2879碱基)编码的ABC转移因子(NanT2)的氨基酸序列。
序列4为nanT3基因(序列1中3117-3818碱基)编码的双组分反馈调节因子(NanT3)的氨基酸序列。
序列5为nanT4基因(序列1中4923-3724碱基)编码的化学受体蛋白(NanT4)的氨基酸序列。
序列6为nanT5基因(序列1中6106-4916碱基)编码的双组分传感器组氨酸激酶(NanT5)的氨基酸序列。
序列7为nanR1基因序列1中7493-6768碱基)编码的调节蛋白(NanR1)的氨基酸序列。
序列8为nanR2基因(序列1中8334-7573碱基)编码的调节蛋白(NanR2)的氨基酸序列。
序列9为nanA1基因(序列1中8919-17627碱基)编码的I型聚酮合酶(NanA1)的氨基酸序列。
序列10为nanA2基因(序列1中17642-24313碱基)编码的I型聚酮合酶(NanA2)的氨基酸序列。
序列11为nanA3基因(序列1中24310-36408碱基)编码的I型聚酮合酶(NanA3)的氨基酸序列。
序列12为nanA4基因(序列1中36249-48299碱基)编码的I型聚酮合酶(NanA4)的氨基酸序列。
序列13为nanA5基因(序列1中48345-60284碱基)编码的I型聚酮合酶(NanA5)的氨基酸序列。
序列14为nanA6基因(序列1中60305-65302碱基)编码的I型聚酮合酶(NanA6)的氨基酸序列。
序列15为nanG5基因(序列1中66747-65365碱基)编码的糖基转移酶(NanG5)的氨基酸序列。
序列16为nanM基因(序列1中66881-67798碱基)编码的甲基转移酶(NanM)的氨基酸序列。
序列17为nanG4基因(序列1中67823-68863碱基)编码的NDP-D-葡萄糖-4,6-脱水酶,NDP-D-葡萄糖-4-异构酶,NDP-D-葡萄糖-4-还原酶(NanG4)的氨基酸序列。
序列18为nanG3基因(序列1中68860-70164碱基)编码的NDP-D-葡萄糖-3,4-脱水酶(NanG3)的氨基酸序列。
序列19为nanG2基因(序列1中71140-70145碱基)编码的dTDP-D-葡萄糖-4,6-脱水酶(NanG2)的氨基酸序列。
序列20为nanG1基因(序列1中72045-71137碱基)编码的葡萄糖-1-磷酸:TTP胸苷基转移酶(NanG1)的氨基酸序列。
序列21为nanA7基因(序列1中76990-72050碱基)编码的I型聚酮合酶(NanA7)的氨基酸序列。
序列22为nanE基因(序列1中77864-76992碱基)编码的环氧化物水解酶(NanE)的氨基酸序列。
序列23为nanA10基因(序列1中78263-77949碱基)编码的酰基载体蛋白(NanA10)的氨基酸序列。
序列24为nanO基因(序列1中79763-78327碱基)编码的环氧化物酶(NanO)的氨基酸序列。
序列25为nanI基因(序列1中80769-79828碱基)编码的酮甾异构酶(NanI)的氨基酸序列。
序列26为nanA9基因(序列1中83192-80784碱基)编码的I型聚酮合酶(NanA9)的氨基酸序列。
序列27为nanA11基因(序列1中89759-83196碱基)编码的I型聚酮合酶(NanA11)的氨基酸序列。
序列28为nanA8基因(序列1中100138-89771碱基)编码的I型聚酮合酶(NanA8)的氨基酸序列。
序列29为nanP基因(序列1中101467-100196碱基)编码的细胞色素P450(NanP)的氨基酸序列。
序列30为nanR3基因(序列1中101593-102600碱基)编码的转录调节因子(NanR3)的氨基酸序列。
序列31为nanR4基因(序列1中102600-103541碱基)编码的转录调节因子(NanR4)的氨基酸序列。
以下进一步提供应用实例,这些实例只是阐明了得到和应用本发明所提供序列和要素的优选的途径,仅用做说明而并不限制本发明的应用范围。
应用实例1:
缺失南昌霉素生物合成基因簇中所有基因
缺失序列1中从nanT1序列中的一个BamHI位点(1035bp)到南昌霉素生物合成基因簇之外的另一个BamHI位点(125077bp)之间124042个碱基。它包括了整个南昌霉素生物合成基因簇。用于同源交换的两个大小分别为8.4kb和4.3kb天然同向片段分别来自于科斯质粒2G2和2F7,在这两个片段中间插入一个阿泊拉霉素抗性基因构建成基因置换载体pHZ1586。通过结合转移将pHZ1586导入南昌链霉菌NS3226并筛选得到基因置换菌株SYH-8,通过高压脉冲电泳和分子杂交验证了基因置换菌株的正确性,经过生物学活性和高效液相色谱分析证明了所得到的基因置换菌株中南昌霉素生物合成被阻断。此基因置换证明分离到的序列(序列1)确实是负责南昌霉素的生物合成。
这提供了一个通过完全敲除南昌链霉菌中南昌霉素的生物合成基因簇,以减少底物和能量竞争达到提高同类型其它抗生素产量的途径。还提供了一种获得染色体上大片段DNA敲除的技术手段。
应用实例2:
缺失南昌霉素部分聚酮合酶基因
缺失序列1中从nanA3序列中的一个BamHI位点(30332bp)到nanA5序列中的另一个BamHI位点(57743bp)之间27411个碱基。它包括了聚酮合酶基因nanA3中的部分KS4,AT4,DH4,ER4,KR4和ACP4结构域;包括了聚酮合酶基因nanA4;包括了聚酮合酶基因nanA5中的KS7,AT7,DH7,KR7,KS8,AT8,DH8结构域。用于同源交换的两个大小分别为4.8kb和5.8kb天然同向片段来自于科斯质粒11A8,在这两个片段中间插入一个阿泊拉霉素抗性基因构建成基因置换载体pHZ1553。通过结合转移将pHZ1553导入南昌链霉菌NS3226并筛选得到基因置换菌株SYH-5,通过分子杂交验证了基因置换菌株的正确性,经过生物学活性和高效液相色谱分析证明了所得到的基因置换菌株中南昌霉素生物合成被阻断。此基因置换证明这些序列对南昌霉素的生物合成是必需的。
这提供了一个通过置换南昌霉素生物合成基因簇中部分结构域而获得相应结构缩短了的新的南昌霉素衍生物。
应用实例3:
缺失南昌霉素脱氧糖生物合成基因、修饰基因和部分聚酮合酶基因
缺失序列1中从nanA5序列中的一个BamHI位点(57743bp)到nanA11序列中的另一个BamHI位点(83968bp)之间26225个碱基。它包括了聚酮合酶基因nanA5中的ER8,KR8和ACP8结构域;包括了六个催化脱氧糖(4-O-甲基-L-rhodinose)的生物合成和转移的基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM;包括了三个负责南昌霉素修饰的基因,即nanO,nanE,nanI;包括了聚酮合酶基因nanA10;包括了聚酮合酶基因nanA11中的ACP14和CR结构域。用于同源交换的两个大小分别为4.5kb和7.3kb天然同向片段来自于科斯质粒18A2,在这两个片段中间插入一个阿泊拉霉素抗性基因构建成基因置换载体pHZ1581。通过结合转移将pHZ1581导入南昌链霉菌NS3226并筛选得到基因置换菌株SYH-1,通过分子杂交验证了基因置换菌株的正确性,经过生物学活性和高效液相色谱分析证明了所得到的基因置换菌株中南昌霉素生物合成被阻断。此基因置换证明这些序列对南昌霉素的生物合成是必需的。
这提供了一个通过改造南昌链霉菌中修饰酶基因、糖生物合成基因而得到相应结构改变的新的抗生素的途径。
以下根据本发明的内容提供基因序列:
序列列表:
SEQUENCE LISTING
<110>上海交通大学
江西农业大学
<120>南昌霉素生物合成基因簇
<160>31
<170>PatentIn version 3.1
<210>1
<211>132544
<212>DNA
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
tccggctgta cccgatgccg gtcggcgcgt ctcccggggc ggtgccgggg tcggcctggc 60
cgacggcggc cagccagagg gtggcgtcca gggctcgggc ccgtaccagc agctcccact 120
ggtcgagctt gccgggcccg gctccccagg aggccgccag cagtgtcgcg gccgccccgg 180
cgtccgcatg tgcccggaac agctccggga agcggacgtc gtagcaggtg gccagggaca 240
gccgtacgcc gtccacctcc atgacggtgg tccgcgagcc cgcggcgacg gtgtcggact 300
cgcggtagcc gaaggcgtcg tagaggtgga tcttgtcgta cgacgtctcg acgccggggc 360
cggtggccag cagggtgttg gtcaccttgc cgccgtccgc cggggtgaac atgccggcca 420
cgatgacggt gtcggtggcc ctggcgatct cccgtactcc gtcggcccac ggtccgttca 480
ggggttcggc gagcgctgcc agtggggtgc cgaagcaggc catcgaggcc tcggggaaga 540
cgacgacacg ggctccggcg tccgcggcgc gctgtgccca ttcctcgatg agccgcaggt 600
tcttcccggg gtccgggccg gtggtgagct ggctcagggc gatccgcata tttcctcctc 660
agtgggtcgg actgcagtgg gtcagacggc agtgggtcgg gcggcgtcgg ccgtggcctg 720
cgcggccggt gtggcgagct gggcgtcctc gtcgacggcg gcttcgggat cgcggcccag 780
gagcaccccc gcgagggaga cggccgcggc gacggcgatg tagagggcca gcggcagcca 840
gctgccgtag gcgctcagca gggtggtgaa cagcagcggg gcgatggcgc cgccgatgat 900
cccggccagg gtgtacgcca gcgatgagcc ggtgtagcgc agccggggcg agaactgctc 960
ggcgatgaag gcggcctgcg gcccgtacag gaaggagtgg atcacgaggc ccaccacgac 1020
gccgagcgcc agcgggatcc agctgccgcc gccgatcatc gggaagaaca ggaacggcca 1080
cacaccggcg gccaccgcgg cacagccgta cagcactcga cggttgatcc ggtcggagac 1140
ggcaccggcc agcggcatca ggaagacctg cagcgaggag ccgatgagga cggccgcgag 1200
ggcggaaccg cgcgacatgc ccagttctcc ggtggcgtac gtcaggacga agacggtgaa 1260
catggcgtac agcacgtcgg ggccgacgcg gcagaggatc gcggcgagca gggcgcgggg 1320
ctgggtggtg aagacctcgc ggatcggcgc ctcgggccgg gactgctcgg cctccatcgc 1380
cttgaagacc ggggtctcct ccagcttcgc cctgatccac aggccgaagg cgacgagggc 1440
gccggagagc aggaaggcca cgcgccagcc ccatgcctcg aactgcgcct cggtcagcag 1500
cgcgccgagg gcggccagga cgccgttggc caggaggttg cccgcgggcg ggcccacctg 1560
cgcggcggag gcgtagaaac cgcggcggcg cgagtcgccg aactcgctgg acagcagcac 1620
cgcgccgccc cactcgccgc cgacgcccac gccctgcgcg aaccgcagca cgaccagggc 1680
gatcggggcg gcgaccccga tcgtggagta cgtgggcagg acgccgatca ggaacgtggc 1740
cgcgccgatc aggaccagtg tggcgatcag caccttcttg cggccgatca catcgccgag 1800
gcgaccgaag acgatcccgc ccagcgggcg cgagacgtag ccgaccgcgt atgtggagaa 1860
ggccagcagc gtgccggtga gcgggtcctc cgaagggaag aagagatcgc cgaagaccag 1920
ggccgccgcg gcggagtaga cggcgaagtc gtaccactcc agggctgtgc cggtcaggct 1980
cgcgacaaag gcccgccgga cgccggagcg cctctcggac gccctcggat cgtccttggt 2040
ggtgtcgtgc atggttccca tccttgctcg ggggacatcg accttccggc atacttcttg 2100
tatacaggtc gtatgcgcaa gccctgcgca agaagttaac cgcggagaaa tccaccaccg 2160
cggaccgtca cccgaactct gggagtgccc ccatggcccg cctgaccttc gaactgccgg 2220
acggatcgac acgcgaggtc gacatcgtcc aggtcctcaa cgccgggtac gccgggcgca 2280
gccaggacga cgtcgccgca cacatcgccg aactcgccga gctcggcgta cccaccccct 2340
ccgtgacccc ggccctctac cccgtcgccc cctacctcgc ccagcagatc gaccgcgtcg 2400
cggtccagca ccggcgcacc tccggcgagg ccgagtgggc gctcgtcgtg gcgggcgacg 2460
gcgagctgct gctcacggcc gcctgcgacc acaccgaccg cgacctcgag gtgcacggcg 2520
tggcctggag caagaacgcc ggccccgacg tcctcgcccg ccgtgcctgg cggctcgccg 2580
acgtcgaacc acgtctggac gacctcaccc tgcgcgcctg ggtcacccgc gacggcaccg 2640
agaccgagat ccagcacggc accctcgccg aactcctcac ccccgcctac tgggtcgacg 2700
tcctgcgctc ccgcgacgcc ctcaccccag gcaccgtcct catctccggc accatcccca 2760
tgaccccggg tgtcgaccag ttcgccgaca cctggcgcgt ggaactgggc gaccccgcca 2820
ccggcgacac catccgcctc gcctacgacg tgcaccccat gccggaaccc atcggctgac 2880
gagccgctgc ggccgccgcg cgaagctgcg gccgcaggac cacacacgac ccggacatat 2940
gttcactttg tcgaacatgc gttcattaac ggaatctttc gccgcacttc cgcatccgac 3000
gtgtgaaacc cggggagtct ttgagcagga tcggctccga ttgcacggga accccaagca 3060
tgacgtgaag catcccagca agcaggtcca ttccccaaaa ctcggggcac gtcagtttgt 3120
ctactcgttg tacaggttct ctcacgaatg ggtgggtgag tgtggagtta ggaattcccc 3180
tcagcgtcgt gttggtcgat gatcaccccg tggtgcgcgc gggcatctcg gcgtggtgtg 3240
ccgccgccga cgcaccgatc agcgtggtcg ccgagggggc gaacgtgtcg gtggccctcc 3300
atggtccggg ccggggggcg gatgtcgtgg tcatggatct gttgttgcag aacgggcggc 3360
ccgcgtacga cgagttgcag gagctcgtgg cccaggagcg caaggtcgtc gtctacacca 3420
tgcgggacag ccaggatgcg gcgttgacct gcatggatct gggctcggct acctacatca 3480
ccaaggcgga ggggcagcgg cacctggtca aggcgatccg ggccgccgcc gaggacatcc 3540
cctatacacc cccgtccctg gccggcgcgt tcggcagcga cacccggcag agccgccccg 3600
tgctgtccgt acgggaggtg gaggtgctgg tggagtggtt ccagtccgag tcgaaggcgg 3660
tggtcgccca gagcctgggg atctcggagc gcacggtgaa cacctatctg gaccgggtgc 3720
gcatcaagta cgcgaacgcc ggccggccgg cgaccacgaa ggccaagctg gtggccaggg 3780
cggttcagga cggtctgata gctctggacg agctgtgatc cggcaggcgg cctccacgag 3840
gagacggtcc tcggcgtgga ccgtgcgtac ccggatcccg ttcgccttcg gctcgggcag 3900
ccggtcccgc agcgcgtcca ccacgacgcc cacccgcacc tgatccccgc cgcggaccac 3960
ggtgacgcgc gccgtctgcc cggcggcggc cagcgcggcg atgaccgggt ccaggagcgc 4020
gcggcgcagg ggcaggggca gttcgagggg gtcgccgcgg acggcgagtt gcacggtggt 4080
cccgttgcgc tcggcgacgt cgatacaggc gcgcatctcg tggaccagcg gatcgaacgt 4140
acagtcgctc tccgcgaaca gccgcctcag ccgggcggct tccagcgcgc aggcgtgccg 4200
cacctcgtcg tcctccgggt cgagggagcc gctggcgagc ccggtgagca gggggagcac 4260
ggtggcgttc agctcggcgt accggcggcg ctggtccgcc tggatctgcc gggcgactgc 4320
gatacgggtg cgtacccgct cctgggcccg cagggcctgg ccgatggccg cgctgctgtc 4380
gaggagcagc ttggcgccgg cggcgatggt catctggaag cagccggcgg acagggcgct 4440
cagggccatg ctcgccccga tggaccgcga gggcacgtcg acggtcagca acagggcggc 4500
ggtgacggcc agatgagcgc cgaggaaggc gccggacacc agcggtcctc ggtggccgag 4560
caggacgacg gtgaaccagc ccgcgagcga gtaggaccag tgcaaggagc tgacgaagta 4620
gtggtccgcc gggacctggc tggtggtggc caccgaggcg acgaaggtgc cggccagcgc 4680
gcatggcacc cagccgcgcg gccaggggcg cccgtcgagg ccgagaacca cgctgccgcc 4740
gaggatcagc gtgagcaggc cgtaggcggc cagggggatc caggggtgcc ggtattcgcc 4800
caggccgccc agcaggccgg gcagcgcgga cccgatgtgc agagtgacca ggatgaccag 4860
ccccccggcg cggatgccac ggagctggtg acgggtgatc agctgccgga agtcctcacc 4920
cacgcggcca ctccagccgc accgtcgtac cgtggccggg ccgggactcg acggtggcgc 4980
ggccgcccac gttccacatc cggccgtgga tcgaacgctt gatcccgatc cggtggcctg 5040
gaatgtgctc cgggtcgaac cccaccccgc tgtcggcgat ggtcacgacg atgacatggt 5100
ccacgcgctc cgccgtgacc cgtacggcat cgacgccagc gtaccgggcc acgttgcgca 5160
gcgcctcgcc gaggccgccg cgcagggctg ccgcgatggg gcgccaggtc gtgcccagtt 5220
ccacgtcgaa cctgttgatg acgcggaggg gatgcgcggc gatctcgtcc agcagttcgg 5280
cggccaggtc cacctccccc gccaccggct gctcggcgcg cagccggagg agatctctgg 5340
cggcctgggc gcgcagcgtc tccgcccgca tcgacgccca tggtgccgac gcgatcagca 5400
gggtcgcgca ggcggtgtcg tggagcgcgg ccaggtgttc ccgttccgcg gcgcagcggg 5460
cctccgccgc ctccccctgc tgccgcgccc gcgcggcccg gtcggcggcc tcgtcggccg 5520
cccggctctc gctgaacacc agccggacca cggcccgggc cagggcggtg ttcaccacca 5580
cccagatgac actgcgcagc acacctgacc actcgaggtt cgggtgcaac gcggtgccca 5640
ccacatccgc gcagcacagc cataacgtgg ccagaatccc ggtgagcggc ggctgcatca 5700
gctggtacgc cacggccgtg aagctgacgg acaccaggat ccaggtgccg gcatgcgtgg 5760
tctgcgcctc ggggatcgtg aaccgctggc tcagacacag cgccgtcatg accgtcagat 5820
ccagggcgag cagcggcctc tgccgcatcg ccgaggtggt ggcgagccgc agatggatcc 5880
agctccaggc gagtacggcc gccgcgacca ccgccgcggg cgccatctcg cccatcggca 5940
tggccgccac gcccagcacg gcgcaggcgc tcatcacccc ggcgcggccg tagagcgcca 6000
accgcctgtt gaacgcgacg tattcgcgcg cccaggggcc ggccgagccg ggaaggcgcg 6060
ccgaggccgg cggatcgata ccgggcgaca gtgagctaat cgacatgggt catccatcct 6120
cgacggccgt cggtgtgacg gggcgatagc ggaatcctca cgatccgtga gggaattgat 6180
gcctttttgt cggtcgaacg accctcacac cggatgaccc ggcgcctcta ggctggaccg 6240
agcgcagtgg cttaggggga gcagatcagg acggtctgga agcccaccca tgtcgggccc 6300
ccacaagccc tttgtgagca agaagtatgt cgcccatcgg tgcgctttcc gcctcatacc 6360
aaccttctct tgggtgtata gcgaagggaa tgcagccttt tatgtccgca tctgcgccca 6420
acgtcatcct gcgcggcggc cccacggagc tggccgatcc gcacacgcgg cacgtcgatg 6480
acaccgagtc gaccctgaag cttctcatcg gcaacgcgta cgaacacttc gaacccacag 6540
gccagttcgt ggagcaccac ggacagcatc tgcgggtctt cgagtggatc cgccggacct 6600
acgtcgccga gtaatcagcc aagtcagcca agtggccaac gggtcgattg gccaacaagc 6660
ggccaaagcc cgcaagggcc cggaacatcg ccggtgggtt cactcgcggc atgcgaggaa 6720
acccccggcg agctcgggcc caggtggcgt ccgcctggtg gattccgtca ggtcgaggcg 6780
agcaccgtgt gccggagctc cctgagcttc cggccgggtt cgagccccag ttccctgtcg 6840
agaacacgcc gggcggactc gtatatgcca agcgcctctg acttctggtc ggagcggtag 6900
agggccagca tcagctgccg atagaacgtc tcccggagcg ggaactcggc gaccagggaa 6960
cgtagtcggc cgacgagttc ccggtggcgg ccgagttcaa gatgcgcctc ggccgacatt 7020
tccgcacatt cgatgcgcat ctcgtccagc cgcttcagaa agccggccac gatcggcccg 7080
gtctggacgt tgccgaacgc cgggccccgc cacagcgcca gggcctgctc gaaacagagg 7140
gccgcctgcg ccggctcccc gtatctgacg cacttccggc cgtccgcgac cagctgctgg 7200
aagacatcga aatcgcattc gctggacgct ctgcgcagca tgtatccacc ggggcgggtg 7260
atgatgggat tctgctggcc cggccggctc aggaacttgc gaagccgcga aatgcagacg 7320
tagatgctcg cggtgtcgcg gcgcggcggc agctcgcccc agatctctct ggtgagctgt 7380
tccgccatca caacctgatc cgagcggatc agcaggacgg cgagaacggc agcgagtttc 7440
tgcgctctga gcggtgaacc gcccttttcg tcgactacac ggagaggtcc caaaatctcg 7500
taccgcacga attaaccccc tgctctgcta ttcctggata cgtgacgagt cggcggtgcg 7560
cacgggattc ggctatgcgg cctcgagttc cggggcgtcg acgagcacgg cgtgatgcaa 7620
ccgcttgaga gcggcggacg gctccagccc gagttcctcg ctcagcacgg cccgcgccga 7680
ctggtagaca tgcagggcgt cggccctgca gcccgagcgg tagagcgcca gcatgagttg 7740
ccggtagaac gcttcgttca gcggaagctc ctctatccgg gcatagaggt agctgacgaa 7800
ttcacggtgc cggccgagtt ccagtccggc ttccacgaac gactcgacgc attccagccg 7860
ggcctgctcg gcccagcata cgaagccgtt gacgatcgcc ccgtcgcgca ggtcttcgag 7920
gaccggcccc cgccagaact caagaccctc ctcgtaggcc tcgcaggcct catgggtgcg 7980
cccggcccgc tgtgcggccc gcccctggcg caccaggccc tggaagacat ggaagtccag 8040
ctcgtcgtcg cccagtcgca gcacataccc gctgcgagcg gtgacgatgg ggctctcggt 8100
gcggttgggg cgcctgagga acttgcggag ttgggacaca tacacatgga gcgccgccac 8160
ggcctgccgc ggtgcccggg taccccagat ctccttcacg agatcttcgc tgctcaccac 8220
ctcgtcggcc cgcacgagta atgcggccag cagcacctcg atcttccggg cgctaattga 8280
ggtgaccgaa gatcggtcaa ccacgcgcaa cgggcccagt atctcgtacc tcatgatacg 8340
cgaactcccc ccgttgccag cgtcttgtcg aaccaggagg gcgctccatt gtgatcgaat 8400
tcagccagca ttcgaaaaat tcaccccgat tttggaagcc tttttggtcg cgcttcagac 8460
ctcagcgtcg ttgcctaccc acgccaccaa cagcttagcc agatcgcggc cgcgtccgcg 8520
gatttcacac ggcggcaaca ttgcctcggt gagattttcg gtcaggcggg aggtggtctt 8580
ctggccatgg agatcgacgg cgttgctctg cgatccagtg atcgcagaga gggcaacatc 8640
aacgcatcct aaccgtgcgg cgcgcgccag tgtgagggtc aaaagaccca caaccaaatg 8700
ggcacatatc aggattcgcc gatcagtgac cgtcaccgta ccctgctgtt tcggcgacgg 8760
tcccccgcag gtcaccgatc cgatccgaag gatcgtcagg gcaagtttcg cctggcgtat 8820
acaaggcgct ccctagacat tctcggttag atcaacttta gaaaccattc ggagaatgag 8880
cgctgttcgg ggcatctcat aagccataac cagcggtttt gcgattcagg tatgggggtg 8940
tcatggccgg gagcagcccg agccacgcgc agcaggcgac gtctcccgtc gcgatcgtgg 9000
ggctggcctg ccggctgccc ggcgcacccg atcccgaggc gttctggcgc ctgctgcgcg 9060
cgggcgagaa cgccgtcgtg cccgtcccgg acagccgcct gccgaccgag cccggttccc 9120
cgccgtactt cgccggactt ctggagcacg ttgacacgtt cgacgccgga ttcttcggca 9180
tctcgccacg cgaggccacg gcgatggacc cgcagcaacg gctgatgctc gaactggcgt 9240
gggaggccat ggaggacgcg gggctcggcc ccaagaacct ggcggagcgc cggaccgcgg 9300
tgttcaccgg cgcgatctgg gacgactacg ccacgctcct ccaccggcgc cgccccaacg 9360
acatcgccat cacgcggcac acgatggcgg ggctgcaccg cggcctcatc gccaatcggg 9420
tctcccacct gctgggcctg cgcggaccga gcctcacggt ggacgcggcc cagtcctccg 9480
ggctcgtcgc cgtgcacctg gcctgcgaga gcctgcgccg gggcgaggcc gacctggcgc 9540
tcgcgggcgg ggtgaacctg atcctggccg aggagagcat gcgtatggcc gaggcccagt 9600
tccagggcct ctcgccggac ggccggtgct acaccttcga cgcgcgcgcc aacggcttcg 9660
tgcgcggcga gggcggcgga atggtgctgc tcaaaccgct ggccgcggcc gtcgccgacg 9720
gcgacccggt gtactgcgtg atcgagggca gcgcggtcaa caacgacggc gccaccgacg 9780
gcctgacccg gccgagcgcc gacgcccaga cggatgtggt gcggcaggcg tggcagcggg 9840
ccgcggtgtc gcccgccgaa ctccagtacg tcgaactgca cggcaccggt acgccggtcg 9900
gcgatccgat cgaagccgcc gcgctcggtg acgcgctcgg tgacgcattc gatgggtgcc 9960
atcgggaccg tgcggtggac gcgccgctgc gggtcggctc ggtcaagacc aacgtgggcc 10020
atctggaggc cgccgccggt atcgcgggcc tgctgaagac ggccctgagc atccaccacc 10080
ggcgacttcc gccgagtctc aacttcgcca ccccgaaccc cggcatcccg ctggccgaac 10140
tcggcctgcg cgtacagacc gcgttcgggc cctggcccga cgagcggcgg cggctcacgg 10200
ccggggtcag ctccttcggc atgggcggta ccaactgcca tgtcgtcctg gccgaaccgc 10260
ccgccccggc cgtgctgccg gaccggccca cggcgcccgg ggacgtgtcc gcgcgcaccg 10320
ccccgccggt catgccctgg gtcgtatcgg cggcgagtcc gaaggccctc acggcccagg 10380
ccgccgcgct gtacgaacac ctgcgcgcac accccgggct ccacccggtg gatatcggcc 10440
acgcgcttgc cacgacccgt acggcgttcc cgcaccgagc cgtcgtcctc ggcagggacg 10500
aagacgaact ggtaagccga ctcgacgcgt tggcctccga aacacagaca tcgggagtga 10560
tccgtggccg cgcgggcggg ggccgggtcg cgttcctgtt cagcggccag ggcagccagc 10620
gccccggcat gggccgcgaa ctgtacgcgg cgtacccggt cttcgccgac gcgctgcgcg 10680
aggtgtgcgc gcacctggac cccatgctcg acaccgacac cccgctgctg gacctcatgt 10740
tcgccgaggc cccaccggac ggcgagccgc cgctgaaccg gaccgcgtac acccagcccg 10800
cgctcttcgc catcgaggtg gccctctacc gcctggtgac ctcgtggggc gtgacccccg 10860
accacctgat gggccactcg gtcggcgaga tcacggccgc gcatgtggcc ggggtgctgt 10920
cgctgcccga cgcctgcacg ctggtggcgg cccggggccg cctcatgcag tccatcaccg 10980
cccccggggc gatggccgcc tggcaggcca ccgccgagga ggccgggcag gcgctggagg 11040
cgtacggcgg acgggtgggc ctcgcggccg tcaacgcacc cgcctccgtg gtgatctccg 11100
gcgaccgcga ggccgtggcg gaggcgaccg ccgcctggcg ggcgcggggc cgcaaggcca 11160
ccgtgctcaa ggtcagccac gccttccact ccccgcacct cgacggcatc ctcggggacc 11220
tgcgcacggt cgcggccggg ctgacattcg ccgcgcccgc catccccgtc gtatccaatc 11280
tgaccggcgg ggcggccacc gaggcccagt tgcgctcgcc cgactactgg gccgaccacg 11340
cccggcaggc cgtacggttc gacgccgggg tacggcacct gtgcgacgcg ggggtcgaca 11400
ccttcctcga actcggcccc gacgcctcgt tgaccggcat ggcgcgggag agcgccgcgg 11460
catgggccgg cgacgcgccc cgcccggtgg cggtcgcggt gcagcgccgc ggccgaccgg 11520
aggcgcagag cttcgtgtcg gcgatggccc aggcgcacgt acggggcgtc ggcgtcgact 11580
gggcggccgc gttcgcgggc cacgagaccc ggcgggcgcc cctgccgacg tacgccttcc 11640
agcgcgatcg ccactggcc gacgggctgg acgagcgggg ggcgaggcgg ccgagtacct 11700
cgccggtcgt gccgtcaacc gaccgtgagc cggtcgtcgt tgacgcgtcc cccgccgatc 11760
gcgcggcctc cccgggcgag ctgctggcgc tggtacgtac ccacgccgcc ctcgtactcg 11820
gacacaacag ccccgacggc atcgacccgg ccctgacctt caaacagctc ggcttcgact 11880
cgctggccgc cacggagctg agcgaacgcc tgagcgcggc cacggacacc gaactccccg 11940
ccaccctcac cttcgaccac ccgacgccga acgccgtggc ggcgtggctg cgcgcggcgc 12000
acgaggggca gcccaccgcc gccccgacgg cggcgacagg gccgtccatg gccgaggacc 12060
ccgtcgccgt ggtggccgtc agctgccgct accccggcgg ggtcgagtcg ggcgaggcgc 12120
tgtggcgcct ggtggacgaa ggggtcgacg ccgtgggcga gttccccggc gaccggggct 12180
gggatctggc ggagctgttc ggccgggcgc cggacgggag cggcggcagc gcgacgggcc 12240
ggggcggctt tctgtacggc gccggggact tcgacgcgga gttcttcggg atcagcccgc 12300
gcgaggcgct ggccatggat ccgcagcagc gcatcctgct cgaactctcc tgggaactgc 12360
tggaacgggc cgggatcccc ccggcctcgc tggccggcag cgcgaccggc gtctacgtcg 12420
gggcgacggc ggtggactac gggccgcggc tgcacgaggc caccgccgag ctcgacggac 12480
acctgctgac cggctccacg ccgagcgtgg cgtccggccg ggtggcctac gccctcggcc 12540
tggagggtcc ggcgctcacg gtggacacgg cgtgttcgtc gtcgctggtg gcgatgcacc 12600
tggccgcgca ggcgctgcgg cagggcgaat gcgatctggc gctggcgggc ggtgtgacgg 12660
tgatggccac accgggcatg ttcacctcct tctcccggca gcgcgggctg gcgcctgacg 12720
ggcggtgcaa gccgttcgcc gccgccgccg acggcacggg gtggagcgag ggcgccgggc 12780
tcgtcctgct cgaacgcctc tccgacgccc gccgcaacgg ccaccaggtg ctcgccgtga 12840
tccgcggctc ggcggtcaac caggacggcg cgagcaacgg cctgtccgcc cccaacggcc 12900
cctcccagca acgcgtcatc cgccaggccc tggccaacgc ccgcctcgaa cccgcggacg 12960
tggacgcggt cgaggcccat ggcacgggca cgacgctggg cgaccccatc gaggcccagg 13020
ccctcctggc cacgtacggc gggcaacgca ccgacgaccg gccgctgtgg ctgggctcca 13080
tcaagtcgaa catcggccac acccaggccg ccgccggcgt cgcgggcgtc atcaagatgg 13140
tgatggccct gcgccacggc cgcctcccgg ccagcctcca catcgacgcc cccagcccgc 13200
atatcgactg gtccgacggt acggtcaggc tcctcagcga gcccgtcgac tggcccggga 13260
ccgactggcc cggatccgac cggccccgcc gcgccgccgt ctcctccttc ggcatctccg 13320
gcaccaacgc ccacctcatc ctcgaacagg cacccgacca ccccgaaccg gagcccacca 13380
cctcgggcgg cgtggtgccc tgggtgctgt cggcccgtac cgccgacgcg ctccgcgccc 13440
aggccggccg cctggccgag tgggtcaccg ctggcgcccc gcgctccccg gcctccccga 13500
cctccccggc ctccccggcg gacgtggggt ggtcgctggc caccacgcgg tcggcggacc 13560
gccaccgagc cgtggttagc ggtacggacc gggacgagct gctgtccggg ctgcgcgcgg 13620
tggcggacgg cctcgcgccc gccgccgtct cggcgggggc ggcccccggc ccggtcatgg 13680
tgttccccgg acaggggtcg cagtggcgtg gcatgggcgt ggaactgctg gactcctccc 13740
ccgtcttcgc ggcccgcatg gccgcgtgcg aggcggcgct cggcgagttc gtcgactggt 13800
ccctcaccgc cgtgctgcgc ggcgcgccgg gcgcgcccga gccgtcccgg gtcgatgtcc 13860
tccagccgtg cctgtgggcg gtgatggtgt cgctcgccgc cgtctgggag agctacggcg 13920
tgacgcccac cgcggtcgtc ggacactcgc agggcgagat cgccgccgcg tgcgtcgccg 13980
ggggcctgtc gctccgggac ggcgccaggg tcgtggcgct gcgctcccaa gccctgcgcg 14040
ccctcgccgg ccacggcacc atggcctcgc tcgcgctgag cggcgcggag gccgagcgct 14100
tcctcgcgga cctgggtgcg gcggcggcac gggtgacggt ggcggtgttc aacgggccgt 14160
actccacggt ggtgtcgggg cccaccgacc aggtcgccgc ggtggtggcg gcctgtgagg 14220
cggcgggcca ccgggcccgc acgatcgacg tggactacgc ctcgcacggc ccgcaggtcg 14280
accggctcgc ggacacgatc cgcaccgact tggccgacct ctcccccggc gcctcggacg 14340
cggtgttcta ctccgccgtc accggcgccc ggcagccgac ggaggagctg gacgccgact 14400
actggttcac caatctgcgg cagccggtgc ggttcgcgtc ggcgatcgac gccctgctgg 14460
ccgccgggta ccgcgtgttc atcgaggtca gccctcatcc ggtgctgatc cccgcgctac 14520
gggagtgctt cgaggaggcc gaggtggcgg cggccaccgt gccgacgctg cgccgggacc 14580
aaggcggccc ggatcaggtg gcgcgcgccc tgggggacgg cttcgtggcg gggctcgcgg 14640
tcgactggag ccgctggttc gtcggcgacg ggcgtgaggc gggcgacgag ggccaccggc 14700
cgcgtacggt cgagctgccc acgtatccgt tccagcggcg gcggtactgg ctggcgccgg 14760
accacggccg ccgcgagggc cgcacggccg gggtcggaac ccggccggcc gggcacgccc 14820
tgctgtcctc ggcggtcgaa ctggccgacg gcggcctggt gctgagcggc cggcttcccg 14880
gcgacgcggc ctgggtgggc gcgcacaccg tggcgggcgt ccagttggtg cccggcgcgg 14940
tcctggtgga ctgggcgctg ctcgccgccg acgaggcggg cggggcgtcg ctggaggaac 15000
tgctgctgcg cgcgcccctg gagctgtccg ggccgtccgg gctgtccgag ccgtccgcgg 15060
gtgtgctggc gcaggtcgcg gtcggggcgc ctgacgagtc gggccgccgt gagctacgga 15120
tctcctcccg gcccgccgac gcgggcgcgg gcgagggctg gacgtgtcac gcggtggggt 15180
cgctggcccc gggcggcccg cccgctccgg cggacaccgg gaccgcgacc gtcccctggc 15240
cgcccgcggg cgccgaggcg ctggatccgg ccgggctgta cgagcgggcc gagcggcgcg 15300
gctacggcta cgggcccgcg ctgcggggcg ttgtggcgct gtggcgggac ggcgccgacc 15360
tggtggcgga cgtggcgctg cccgaggagg ccggtggcgg cggcgaaggc ggtgcggacg 15420
gcgacggtac ggccggattc ggcctgcacc ccgtcctgct ggatgccgcg ttgcagcccg 15480
cgctgctggc cgaaccggac ggcaccggtg gcgagggagc ggggccggaa gcccggttgt 15540
ggctgccgtt cgcgtggagc ggcgtccggc tgtgggcgac cggggcgcgc gccgcgcggg 15600
tccggctgtc gccgctggac gggggcggtg gggatgttgc ggacgagcgc gagctgcgga 15660
tcgaggtgtc cgacccgacc ggggctcccg tgctgagcgt cgcgtcggtg gtgctgcgcc 15720
cgcgcaccgt acggcaggtg cgcgaggcga gcggggccgc ggcgggtggt ctcttcgccc 15780
tcgactggac gccggtggcg cctcaggagc cgtccggcgc cgaggacgac gccgggtgtg 15840
tggccgtact gggcgaggcg ccgacggagc cgggcgtgga cgggtgccgg gacacctata 15900
cggatctccc ggcgctgctg gccgccctgg acgcgggtgc gcccttgccc tccgtggtga 15960
tgtggcggcc accggccgcc gaccccggcg ccgcacccga ggacgccgcc ctgtccgccg 16020
tacgaggcgt ggcggcggcg ctgcgcgcct gggtcgccga gccgcggctg acggtcagcc 16080
ggctcgcggt ggtcactcgg ggcgcggtcg cggcgggcgg cgccgagggc gagccggtgg 16140
acctggccgc ggcggcggcc tggggctgtg cccgtggtgt ccaggcggaa caccccgacc 16200
gcatcgtcct cgtcgacgtg gacgacgacg tcgacatggg cgcggacacc gacaccgaca 16260
tcggcgcggc ggcgggtctg gcggccgcgc tcggcgaacc acaggtcgcg ttgcgcgggg 16320
acacgctgct cgcgccacgc ctggcgcgtt ccgcggccac cccgggcggc gtggcattcg 16380
acccgaacgg cacggtcctc gtcacggact ccggaggccc gctcgccggt tcggtcgccg 16440
agcatctggt gcgcgccgag ggtgtccggc acctgctgct ggtccgcttc gagggcgcgg 16500
acggcgccta cgacacgtat gacaggcaag acgcccaggt gcacatggtc acggttgatc 16560
ctcgcgacac cgccgcgctg gaacgggtgg tggcgcaggt cgatccggcc catccgctga 16620
ccggcgtcgt ccatgtcgcg ggactgtccg ccgacatcga gacaagtggc gcggcgcgcg 16680
ggtgggccgt ggcggccggg gtggtgcggg ccctgcacca ggccaccgcc gcgctgccgt 16740
ccgtccggtt tgtgaccctg tccgacgccg cgacggcctg ggacggcccg gcggcccccg 16800
aaagggcggc ggccggtgcg ttctgcgcgg ccgtgacgga cgtacggcgc cgggccggac 16860
tgcacggcct ggacgtcgcg ttcgggccct gggccgccgc cgacgacgat ggcggcgccg 16920
attccggtgg gcgctggacc ggggtcctcg gcgccgaccg cgggctcgcg ctgctgcgcg 16980
cggcctgtcg cgccgaccgc ccgcggctgg tggcggccga catccgtacc cgagccctga 17040
ccgcccatcc ggcccacgag ctcccggccg cgctgcgcac gctcggcgcg agtgcgagtg 17100
cgagtgcggg cggccgggcg ccggtccgtc gggtcgccgc cgccgcgccc ggccgcacca 17160
ccgactgggc gagccggctg gtgggcctcg ggcccgccga acggcgccgc gccgtcctgg 17220
agctggtacg cgatcacgcc gccgccgtgc tcgggcagcc cgaccccaag gccgtacggg 17280
ccgacgcctc gttcaaggag ctgggcttcg actcggtgac cgccgtcgag ctgcgcgacc 17340
ggctggtggc ggtcggcggg ctgcggctgc ccgccgccgt cgtcttccgc cacccgacgc 17400
cggaggcgct ggcccaccgt atcgagcagc aactcgcgcc cgacgacacg aataacgcag 17460
ctatcacaga caacgcagac aacgcagcaa agagcaacgg caacagcaac ggcacggcgc 17520
tcgacgcggc ggacaagctc gcgtcggcca cggccgacga aatcctcgac ttcatcgaca 17580
acgagctcgg cgtgctctcc gaagcgcgtc cgcgcccctc caactgaagg caggtgacga 17640
catggtcagc gaggagaaac tggtcgagta cctgcgccgg gtcaccaccg agctgcatga 17700
cgccaggacc cggctgcggg agctggagga gggcgagcag gagccggtcg cggtggtcgg 17760
catggcctgc cgtttccccg gcggggtgcg gtcccccgag gacctgcgcc ggctcgtcct 17820
gtcgggcggc gacgcgatcg gcgacttccc caccgaccgc ggctgggacc tggacggtct 17880
gttccacccg gacccggcgc acttcggcac cagctatgtg agccagggcg gcttcctgta 17940
cgacgtcgac cggttcgacg cggggttctt cgggatcagc ccgcgcgagg cgctggcgat 18000
ggacccgcag cagcggctgc tcctcgagct gtcgtgggag gcgctggagt ccgccggggt 18060
ggtgccgggc gcgctgcgcg ccagccggac cggggtgtac gtgggggtgt ccagcgagga 18120
ctacatctcc gggctgccgc agatcccgga gggcttcgag ggctacgcca ccacgggcag 18180
cctcaccagt gtcatctccg gccgcgtcgc gtacaccttc ggtttcgagg gccccgcggt 18240
caccgtggac accgcgtgct cgtcgtcgat ggtggccatc catctggcgg ggcaggccct 18300
gcgacagggc gagtgttccc tcgcactggc gggcggtgtc acggtgctgt ccacgccgct 18360
gatgttcacc gagttctgcc ggcagcgggc gctgacgccc gacgcccggt gcaagccgtt 18420
cgccgccgcc gcggacggca ccggcttctc ggagggggcc ggactcctgc tgctggagcg 18480
gctgtccgac gcgcggcgca acggccacga ggtcctggcc gtcctgcggg gctcggccat 18540
caaccaggac ggcgcgagca acgggctgac cgcgcccaac gacgtcgccc aggagagtgt 18600
gatccgggac gcgttggcga gggccgggct gtccggcgcg gatgtggaca tggtcgaggc 18660
gcacgggacg ggtacccggc tgggcgaccc catcgaggct gaggcgctga tcgccacgta 18720
cggggcggac cgcccggcgg accggccgct gtacctcggc tcgatcaagt cgaacatcgg 18780
ccatacgcat gccgcggcgg gggtcgcggg cgcgatcaac accgtgatgg cgctgcggga 18840
cggcaagctg gccaggaccc tgcacatcga cgagccgacc cgccacgtgg actggagcgc 18900
gggcacggta cggctgctga cggacccgta cgactggccg gtggccgacc ggccgcggcg 18960
ggcggcggtg tcgtcgttcg gggtgtccgg caccaatgcg catgtgatcc tcgaacaggc 19020
cccggacgcc ggcgctcagc aggatgctcg gcaaaggggc ggcgacacgt tccacggcgt 19080
ggtcccctgg cccgtttcgg ggcgcaccga ggcggcgctg cgggaccagg ccgcacggct 19140
gggcgcgttc ctgacagcgg acggcgcgac ggcgaacggg gcggcgaccg gtggggtcgc 19200
cgacgtgggc tggtcgctgg cgatgcgtcg cacggcgttc gagcaccggg ccgtcgtggt 19260
cggccgcgac cggtcggacc ttctcgccgc gctcgaaggt ctcgcggccg acgagccggg 19320
ccccgcggtg gtgcgcgggg tggcggcgga cgtcggcgcg ggcccggtca tggtgttccc 19380
cgggcagggg tcgcagtggc tgggcatggg cgtggaactg ctggactcct cccccgtctt 19440
cgcggcgcgt atcgccgcct gcgaacgggc cctggccgcg catgtcgact ggtcgctgac 19500
cgatgtgctg cgcggcgcgc ggggcgcggc cgacatcggc cgggtcgatg tggtgcagcc 19560
ggtgctgtgg gcggtcatgg tgtcgctggc cgcggtgtgg gaggcgcacg gcgtacggcc 19620
gtcggccgtg gtgggccact cgcagggaga gatcgcggcg gcgtgtgtcg cgggcgcgat 19680
gacgctggag gacggcgccc gcgtggtggc gctgcgggcg cgggcgctgc gggcgctggc 19740
cggatacggc gccatggcct cgctgggctg cggcgtcgag gaaaccgagc ggctgaccgc 19800
cgtacacgcg ccggacgtgg cggtcgcggc ggtcaacggc ccgtcgtcca cggtggtgtc 19860
cgggccgtcc gagcaggtcg agaagctggt ggccgccgtg cgcgccgacg ggctgcgggc 19920
ccgcgcgatc gacgtggact acgcctcgca cggccctcaa gtggaccgta tcgccgacga 19980
gttggccgac gtactcgccg gggtgtccgg cgccgccacg gacaccgcct tctactcgac 20040
cgtgaccggc gcccgtatgg acgcctccgg tctcgacgcc ggctactggt tcaccaatct 20100
gcggcagccg gtgcggttcg ccgaggcggt ccaggcgctg ctcgacgccg attaccgggt 20160
gttcatcgag gtcagcgcgc atcccgtact gctgctcggc cttcaggagt gcttcgaggc 20220
ggcgggccga ccggccgtgg cgatcggcac gctccgccgg gacgagggcg gccccgagcg 20280
gctgtgccgg gcgctggccg aggcgcatgt cgcgggcgtg gcggtggact gggcgagctg 20340
gtacgccgat gggcccgcac ccgcggccgt accactgccg gcgtacgcct tccagcggga 20400
gcggtactgg ctgccggccg gtgccgggtc cggtcctggc gatgtcgcgg gtgccgggct 20460
caccgcggtc ggacacgcgc tgctcccggt gtccgtacgg ctggccgacg ggagcctggt 20520
gctcaccggg cggctgccgg aggcggcgcg ggccggctgg ctggccgaac acctcgtcgc 20580
ggacctcccg ttgctgcccg gcacggtgct ggtcgaatgg gtgctgcggg cggccgacga 20640
ggccggctgc ggtggtgtcg aggagctggc gctccaggtc ccggtggcgc ttcccgtatc 20700
cggcgggctg gtgatccagg tggtcgtgga cgccgccgag ggcgacggac gccgtccggt 20760
gcgggtccac tcccggcccg aggaggactc gggcgcgccg gacgcatggg tctgccatgt 20820
ctcgggcacg ctgctccccg gcgtggccgg gccggttccg ccgtccggtc cgggcggcgc 20880
gtggccgccg ccgggggcgc ggcccgcggc gatcgacggc ttctacgagc gggccgaggc 20940
cgcgggttac ggctacggcg cgttcttccg gggcctgacg aacgtatggc acgacggcga 21000
ggacacgctg gcggaggtgg tgctgcccaa ggaggcggcg gagcaggcgg gtggcttcgg 21060
tatccatccc gccctgctcg acgcggcgat gcagcccgta ctgctggccg gtcaactccg 21120
tcaatgcgct gccgcggcgg gcgcggacac ggcgtccggg accgtgctgc tcccgttcac 21180
ctggagcggg gtgcggctgt gggcgggtgg cgccacccgg ctccgcgtcc ggctgtcgcc 21240
gcggccggag gggctgcggg tgctgttggc cgatgccacg ggtgcccccg tactgaccgc 21300
ggacgcggtc gccctccgcg agacgggcgt ccagcagttg cgcgcgtcga gccgggtgcg 21360
ggggtcgcac gggctgttcg ccgtcgaatg ggtgccgccg ctgtccgcta cggcgggcgg 21420
gacggcgccc gcgacgctcg cggtgctggg cgacgacgcc ccggacctgg ccgatgccga 21480
ccgttacccc gacctcgacg cgctgttccg cgccgtggcc gacggcgccc ccgcaccgga 21540
tgtcgtcatc gcttcggtgc ggacgggcaa cgacccggcg gggtcggaca ccggcttggc 21600
caccgcccgc cggacgctga cgctggctca ggagtggctg gccgggagcg gggccgacgg 21660
tgcgcggctt gccgtggtca cccggtcggc gatacggacc ggggacgacg gccaggagcg 21720
ggtggtcccc tcggccgccg cggtgtgggg gctgatgcgc agcgcccaga cggaacaccc 21780
ggggcgtttc gtcctcatcg acgaagacac cgacagcacc gaaaacatcc tggaggccgt 21840
acgtacggac gaaccgcagc tcgcgctgcg cggcgggcgc gcgttggtgc cccggatggc 21900
ccgggtggac gccgaaccgg agctgacggc cccgtcgggc gagcgggcct ggcatgtggc 21960
ggccggcaag accggccccg acgatctcac ggcggtgccc agcccccggg cctccgcgcc 22020
cctcgcgccc ggccaggtgc ggatcgccgt acgcgccgcg gggctcaact tccgcgatgc 22080
gctgatcgcc ctggacatgt acccggacgc ctcggcgtcg atcggcagcg agggcgcggg 22140
ggtcgtgctc gaagtgtccg agggggtggc cggggtcgcc gtcggcgacc gtgtgatggg 22200
cctgttcaac gacgcgttcg gcccggtggc ggtggcggac gcccggatgg tcgccccggt 22260
gccggacggc tggagcttcc gggaggccgc ggcggccccg gtggccttcc tgaccgcgtg 22320
gtacggcctg gtggatcttg gcgggctgag ctccggcgag accgtggtga tccacggtgc 22380
ggccggtggc gtgggcatgg ccgccgtcca ggtcgcccgg cacctgggcg ccgaggtgtt 22440
cgcgacggcg agcccggcga aacacccggt gcttgagggc atgggcgtgg acgccgccca 22500
ccgagcgtcc tcgcgggatc tggggttcga ggcggcgttc tcgtcggcga ccggcggccg 22560
gggcgtcgat gtcgtcctca acagcctggc cggggagttc acggacgcct cgctgaggct 22620
tctcgcgccc ggcgggcggc tgatcgagat gggcaagacg gacgtacgcg atccggatca 22680
ggtcgcgcgc gagcactccg tggcgtaccg ggcgttcgat ctgatcgcgg acgccggccc 22740
ggagcgcatc gggcagttgc tggccgccct gggtgagcgg ttcgccgacg gcgcgttcac 22800
gcccctgccc gtcaccgggt ggcggctcgg ccaggccagg caggcgctgc ggcagctcag 22860
ccaggcccgg cacaccggga agctggtgct ggacgtggat cccgcacccg acccggacgg 22920
cacggtgctg atcacgggcg gcaccggcac tctcggcggt ctgatcgccg aacacctggt 22980
gcgctcgcgc ggggtgcgcc atctgctgct gctcagccgg cgcggcccgg acgccccggg 23040
cgccgaggag ctgacggccc ggctcaccga gctgggcgcg cgggtgcggg tggccgcggt 23100
ggacgtcggg gacgccaccg ccctgggcga ggcggtcgcg ggcgtggacc cggcgcatcc 23160
gctcaccggt gtcgtccacg cggcgggcgt ggtggccgac gcgatgctgc catcgcagga 23220
cgacgaacgc ctggtggcgg cctggtccgc caaggccgcg gcggcggccc gtctgcacga 23280
cgcgaccgcc gggctgccgc tgggcatgtt cgtgctgttc tcgtcgttcg cgtcgaccct 23340
cggcaccgcg gggcaggcca actacgcggc cgccaacgcg tactgcgacg ccctggtcga 23400
gcggcggcac gccgagggcc tgccgggcgt atcggtgagc tgggggctgt ggtccgccgc 23460
gagcgggctc accggagggc tgaccgaggc cgatgtcgcg cggatcgccc gccagggcat 23520
cgtcccgaac agcaccgagc agggatacga cctgttcgac gcggcgctcg gacacgggcg 23580
tcccgcgctg ctcgcgctga acctggacac ccgggcgctg gccgcgcagc ccgtcgcggc 23640
cctgcccgcg ccactgcgcg ccctggccgc cgacgcccag gcggcgggcg cccgctccgg 23700
cggcgcggcg gcgcggccga ccgcggcggc ggccgaggag cccgcggact gggctgcccg 23760
gttgcgcgcg ctcgccccgg ccgagcagcg gcgcctgctc accgacctgg tacgcaggca 23820
cgccgccacc gtgctcggcc acgccgaccc cgaggccgta ccggccgacg cggcgttcaa 23880
ggagctcgga ttcgactcgc tgaccgcggt cgagctgcgc aaccgcgtca ccgccgccac 23940
cgggctgcgg ctgcccgcca cggtcatctt cgactacccg gagcccgggg cgctggccga 24000
gcgtctgcgc accgaactgg cccccgagga gggggcatcg gcgacggcgc cggacctcta 24060
cgcgcccgtg ctcagccgac tcaccggcct ggaggagacc ctggcggcac tggcctcgtc 24120
cggtgtcaac ggcggtgtga atggcggagt ggccgacccg ggcgcggtga ccgcgcgtct 24180
ggagtcgctg ctggccgact ggaaggcggc ccacgccccg agccggaacg gcggcacggc 24240
ggccgaacgg ctcgaagccg cgacgaccga tcaggtcctc gacttcatcg acaaggaact 24300
cggagtgcaa tgagcggggg cgctgtgacc actgagacga acgaagagcg gctggtcgat 24360
tacctcaagc gggtcgccgc ggatctgcac gacacgcggg cgcggctgcg cgaggtggag 24420
gacggacagc gggaaccggt ggccatcgtg gcgatggcct gccgctaccc cggcgacgtc 24480
gcctcccccg aggatctgtg ggacctcgtc gccgcgcgcc ggcacgcgat gaccgcgttt 24540
ccggacaacc gcggctggga cctggagcgg ctgttccacc cggaccccga ccacccgggc 24600
accagctacg cccgcgaggg cggattcctg cacgacgccg acctgttcga cccggagttc 24660
ttcggcatca gcccccgcga ggcggcggcg gtcgacccgc agcagcggct gctgctcgaa 24720
gtggcctggg aggcgctgga gcgggcgggc atcgcgcccg ggtcgctcaa gggcgcgccg 24780
gtcggcgtgt acgcgggcac cgcgctgccc ggcttcggca ccccgcatat cgaccgggcc 24840
gcggagggct atctggtcac cggcaacgcg ccgagcgtgc tgtccggccg ggtggcgtac 24900
accctgggcc tggaggggcc cgcggtcacg gtggacacgg cgtgttcgtc gtcgctggtg 24960
gcgatgcatc tggccgcgca ggcgctgcgg cagggcgaat gcgagctcgc gctggcgggc 25020
ggtgtgacgg tcatgacgac gccttatgtg ttcacggagt tcgcccggca gcgcggcctg 25080
gcggcggaca gccggtgcaa ggcgttctcc aagggggccg acggcacggc gttcgccgag 25140
ggcgcggggc tgctggtgct ggagcggctg tcggacgccc agcgcaacgg acatcaggtg 25200
ctggccgtga tgcgcggttc ggcgatcaac caggacggtg cgagcaacgg gctgaccgcg 25260
cccaacggtc tggcgcagca gcgcgtcatc cggcaggcgc tggccggcgc ccggctctcc 25320
ccggcggatg tggatgtggt cgaggcgcat ggtacgggca ccacgctcgg tgacccgatc 25380
gaggcccagg cgctgctggc cacctacggt caggagcggc ccgagggccg gccgctgtgg 25440
ctgggcgcga tcaagccgaa cctcgggcat acgcagggcg cggcgggcgt ggccggcgtg 25500
atcaagatgg tgatggcgct gcggaacgcc tcgctgcccg ccctgctgca cgccgaccgg 25560
cccacgtccg tggtcgactg ggacggtggc gcggtacggc tgctggccga gccggtggcg 25620
tggccggccg gggaccggcg gcgccgggcc ggcgtctcgt ccttcggtat cagcgggacc 25680
aacgcgcatc tgatcctcga ggaggccccg ccacggccgg acacggagct gcccgcgagc 25740
gtgcctcagc ggccggaggg caccgtcgtg ccgtgggtgg tctccgcgcg gggcgcggtg 25800
tcgcttcgta cgcaggccgc ggcgctcgcc gagcacatgg ccgcgcaccc ggacaccccg 25860
gtcgacgcca tcggctggtc gctggccacc acacggtcgc cgctcgacca ccgcgcggtc 25920
gtgctgggcg cggaccgcgg ggagttgtcc gcccggctcg ccgacctcgc cgaggggcgg 25980
acccaccccg atgtgacgcg ggcgtccgcc ccggcgcggc tcggtggcag cgcgtttctc 26040
ttcaccgggc agggcagcca gcgccccggc atgggcgccc agctgtaccg ggcctatccg 26100
gcgttcgccg ccgcgttcga cgaggcgtgt gcggcgctcg acccccatct ggggcggtcg 26160
ctgctggagt tggtgttcgc accggccgac acagacggag acggagatgc ggacagggcg 26220
agcgcgctgg acgcgacgga ggtgacccag gccgcgctgt tcgcggtcga ggtcgccctg 26280
taccggctgg tcgagtcgtt cggtgtggtc cccggctatc tggcgggaca ctcggtcggc 26340
gaactggtcg ccgcgcatat cgccggggtg ctgtcgctgc cggacgccgc gcggctggtc 26400
gccgcccggg ggcggctgat gcaggcgctg cccgagggtg gcgcgatggt cgcgttggag 26460
gccgccgagg aggaggtggc gctgctgctc gccgggcggg cggaccaggt ggcgctggcg 26520
gccgtgaacg cgccgacgtc ggtggtggtc tcgggtgacg aggaggcggt cgaggagatc 26580
gcccgtacga tccgggagcg cggacaccgg accaggcggc tgcgggtcgg tcacgccttc 26640
cactcgccgc ggatcgaccc gatgctggag gagttccggc aggtcgcggc ctcactgacg 26700
tattcacagc cccggatcgc cgtcgtctcg aacgtcaccg gcgcgctggc cggggccgag 26760
cagctgtgcg accccgacta ctgggtgcgg catgcgcggc agccggtgcg gttccgcgac 26820
ggtatcgccg cgctgcgcgc ggagggtgtc acccgcttcc tggagctcgg cccggacgcc 26880
gtgctcaccg ccatggcgcg cgactgcctg acctcggagg cggccccgga ggtggccgcc 26940
ggggaatcgg cccaggatgc cggcacggcc tccggtccca gcgcgccggt gctcgccacg 27000
gtgttgcgca agggccgcga cgagccccgt acgctgctga ccgccctcgc ccagctccat 27060
gtggacggcg agagcgtcga cttctccgcc tccttcccgg ccaccaccca ggccaccgac 27120
ctgcccacct accgcttcga ccgccgccgg tactggcggg acgccccgca ggcggaggcg 27180
gacgtacggg ccgcggggct cgaggcgagc gaccatccgc tgctgcgcgc cgcgctcgaa 27240
ccggcggacg gcgggctgct gctgaccggc cggctgtcgc tgcgcggcca gccctggctg 27300
gccgaccacg ccatcgtgga cgcggtgccg ctgcccggca cgctcttcgt ggagctcgcc 27360
ctgcaggccg gggaacgggt cggctgcgac ctgatcgacg atctcaccct ggaggcgccg 27420
ctgctgctgc ccccggtggg tgcggtggac ctccaggtgg ccgtcggggc caccgacgcg 27480
gccggccgac gggccgtcac cgtctactca cgcccgtccg gcggcggttc ggagggttgg 27540
gagagttcgg cggacccggg tgtcgcggac gagccgggcg acggcccgta cggcccgtgg 27600
cgccggcatg ccacggccac tctgggcacc gcgccgaccg gtgtccccga gcccgccgcg 27660
tcaccggcgc agtggccgcc cgccggggcg gaggcgatcg acgtggccgg gctgtacgag 27720
cggctggccg ccgagggcta ccggtacgga ccggcgttca cggggttgcg caccgcgtgg 27780
cgggtcggcg aggagatgtt cgccgaggtg ggtctcgccc ccggtcagcg cggcgacggc 27840
ggggcgtacg ccgtccaccc cgcgctcctg gacgccgccc tgcacccgat cggcgcgctg 27900
ttcaccggcg aggaccaggc gggcggcgcg ccgggcacgg tccggctgcc gttctccttc 27960
ggcggcgtac ggctgctcgc gcgcggcgcg tcccggctgc gggtccggat cacgccgacc 28020
ggacctgaca ccgtgaccat gcggctgtcc gacgacaccg gcgccgaggt cgtggccgtc 28080
gactcgctca cgctgcgcac cgtttcggcg cagcggtggc ggtccggggc ggtcccggcg 28140
gaccggccgc tgtaccggct ggactgggat gccttcgcgc tgcccgccgc cgcgacaacg 28200
gcgccggacc gctgggccgt actcgccgcc gatgacacga acgccacgga caccgccacg 28260
gcctcgctgc ccgcggaaca cgtcgcgcgc caccccgacc tcgccgcgct gagcgcgtcg 28320
gtcgccgccg gggcccccgc ccccgacctg gtcgtcatgg cctgcctcgg tgccccctac 28380
gacacgtccg acgacggcga cgagccgccc tcccaggtgc gcaccgccac ccatcgtgtc 28440
ctcgcgcggc tgcgggagtg gctgacggac gacgccctcg ccgcctcccg tctggtggtc 28500
ctgactgcca aagccgtcgc cgccgacccg gccgacgcgc cgcccgatct ggcgggcgcg 28560
gcggtgctgg gcctgctgcg cgccgcccag gccgaacacc cggggcggat cgtgctggtg 28620
gacaccgacg gcatctcggc gtcccgggac gcgctggcgg cggccgtggc tgccgccgtg 28680
gccgccggtg agtggcagct ggcgctgcgc gacgggcgcg cgctggtgcc ccggctgatc 28740
ctggcgcacc ccgaccccga cgcggcgccg gtggtgctcg accccgacgg caccgtgctg 28800
gtgaccggcg gcaccgggtc gctcggccgg ctgctcgccc gccacctcgt cgaacatcac 28860
ggcgcccggc atctgctgct ggtgagccgg agcggaccgg ccgccgaggg catcgaggcg 28920
ttcgcggccg ggctcgcggc cgacgtacgg atcgagagct gcgacaccac cgaccccgag 28980
gcgctggccg ccctgctcgc caccgtcccc ggcgaacacc cgctgaccgc ggtcgtgcac 29040
accgcgggcg tgctcgacga cggcgtggtc acctcgctga cgcccgaaca gctcgacacc 29100
gtactggcgc ccaaggtgga cgcggcctgg cagttgcacc ggctgacccg gggcgcggac 29160
ctcgcctcgt tcgtcctgtt ctcgtccgcc gcctccgtgc tcggcagcgg cgggcagggc 29220
aactacggtg cggcgaacgc gttcctcaac gccctcgccg agcacatccg cgccgcgggc 29280
ggcccggcca cctcgctggc ctggggcctg tggggcgtgg acgaggggat gacggaacac 29340
ctggccaccg cggaccgggc ccggatggcg cgttcgggca ccgccgcgat gtccggggag 29400
gcgggcctgg cccggttcga cgcggcgctg gccaccgcgc tgcccgtact ggtcccggcc 29460
aggttcgacc tcgccgtgct gcgcgaacag gcggccggcg gcgcgctgcc tccgctgctg 29520
cgccgcctcg tacgcctgcc cgtccgtacg gccgcggcgg tggaggcgtc cccctcctgg 29580
gccgggcggc tcgccgggct gccggagacc gagcaggacc gggtgatcgg ggagctgata 29640
cgcgaccgga tcgccgcagt cctggcgcat ccggagcccg aaaccctcga actgggccgg 29700
acgttcgcgc agctcggcct ggactcgctg accgcgctgg agctgcgcaa cgcgatccat 29760
gaggcgacgg gggtccgcct cccggccacc gccatcttcg actacccgac ccccgagacc 29820
ctcgtcagcc atctgcgcac cgaactgctc ggcgccaccg cgaccaccgc ggccaccgcg 29880
ccgctgccgc cgggggccgg ggccccggcc cggtccggga gcgccgacga ccccgtcgtg 29940
atcgtcggca tggcctgcca ctacccgggc gatgtgcact cccctgacga gctgtggcgg 30000
ctggtggccg acggggtgga cgcgatcggg ccgttccccg aggaccgtgg ctgggatgtg 30060
gccgggctgt acgacccgga ccccgagcgc accggcaaga gctacaccag ggagggtggc 30120
ttcctgcccg aagcggccct gttcgacgcg gagttcttcg ggatcagccc ccgggaggcg 30180
ctggccaccg atccccagca gcggctgttg ctggagacgg cgtggcaggc gttcgagcac 30240
gcccgtatcg atccggcggc gctgcgcggc agccgtacgg cggtggtcac cggcatcatg 30300
tacgacgact acggcgcgcg gttcctgggc cggatcccgg aggggtacga gggccagatc 30360
atgaccgggt ccacgccgag cgtggcgtcc ggccgggtgg cctacacctt cgggctggag 30420
gggccgacgc tgaccgtgga cacggcgtgt tcgtcgtcgc tggtggcgat gcacctggcg 30480
gcgcaggcgc tgcgccaggg ggaatgcgac ctcgccctgg cgggcggtgt gacggtgatg 30540
gccacgccga acacgttcat cgagttctcg cgccagcgcg ggctggcgcc ggacgggcgg 30600
tgcaaaccgt tcgctgccgc cgccgacggc accgggtgga gtgagggcgc cgggctgctg 30660
gtgctggagc ggctgtcgga cgcgcgccgc aacgggcacc gggtgctcgc ggtcctgcgg 30720
ggttcggcgg tcaaccagga cggtgcgagc aacgggctga ccgcgcccaa cgggccgtcc 30780
cagcagcggg tgatcggcca ggccctggcc gcggcggggg tggatcccgc cggggtcgat 30840
gtggtggagg cccatggcac gggcacgatg ctgggcgacc ccatcgaggc ccaggccctg 30900
ctggccacat acggccggaa ccgccccgca gaacagcccc tatggctggg gtccatcaag 30960
tcgaacatcg gccacaccca ggccgccgcc ggggccgcgg gcatcatcaa gatggtcatg 31020
gccctgcgcc acggccggct gcccgccacc ctgcacgtgg acgagccgag cccgcatgtg 31080
gactgggcga gcggaagcgt ccggctcctc accgaggcca ccgactggcc ggaggccgac 31140
cggccccgcc gcgccgccgt ctcctctttc ggcatctccg gcaccaacgc ccacctcatc 31200
ctggaacagg cccctgacca gcccgaaccg ctcccggaac agtccgagtc agccaccgcg 31260
ggcggcatcg tgccgttcgt gctctccgcc cgtacggccg aggcgctgcg cgcccaggcc 31320
gcgaacctgg cggcgcggct cccctcggcc ggggtggcgg aggtgggctg gtcgctggcg 31380
accacccggt cggcgttcga gcaccgggcg gtgatcgtgg ccgaggaccg ggacgcgctg 31440
ctggccggtc tggagaagct ggccgccgac gagcccgatc cggcggtggt ggcggggacg 31500
gcgaccaccg cggccgcggg cccggtgttc gtcttcccgg gccaggggtc gcagtggcgc 31560
ggcatgggcg tggaactgct ggacacctcg ccggtgttcg cggcccggat cgcggagtgc 31620
gagcgggccc tggcgccgta cgtggactgg tcgctcaccg ccgtactgcg cggcagtgac 31680
accaccaccg acccgcaccg cgtggatgtc gtccagccca ccctctgggc cgtcatggtg 31740
tccctggccg ccctctggca acacctgggc atcgccccgg ccgccgtcat cggacactcc 31800
cagggcgaga tcgccgccgc ttgtgtggcg ggagccctca ccctcgacga cgccgccaag 31860
gtcgtggccc tgcgcagcca ggcactgcgc gccctcgccg gccacggcac catggcctcc 31920
ctcaccctcg gcgccgacga caccgccggg ctcctggagg agctgggcga gcgggccgac 31980
gacgtcaccg tggccgccgc gaacggcccc gtcaccaccg tcatctccgg tgcggtggag 32040
cagatcgcca ccgttctggc ggcggccgag gcgcacggcg cgcggacccg caccatcgat 32100
gtggactacg cctcccatgg gccccatgtc gaccgcatcc gcgaggacat cgtctccgcc 32160
ttgagcggcc tcgcgcccac cgcgtcggag gtcgccttct actcgacggt gaccgccgaa 32220
cgcctggaca ccgccgggct cgacgccgac tactggttca ccaacctccg ccgcccggtc 32280
cgcttcgccg acaccctcgc cactctcctc gcccatggac accggcactt catcgaggtc 32340
agcccgcacc ccgtgctgat ccccgggatg caggacggtt tcgaggcggc cgacgccgcc 32400
gccaccgccg tccccaccct ccgccgcgac cagggcgggc cccaccagct cgcccaggcc 32460
gtggcccgtg cgtacaccgc cggtctggcc atcgactggg ccccctggta tccggcccgg 32520
ccgtacacca ccgacctgcc cacctacccc ttccagcgcc gccgctactg gctcggcatg 32580
gacggcggac cgggcgatct gcgcagcgcc gggctcgtct cggtgtccca cgcccagatc 32640
ggcgccgcgg tcgaactggc cgacggcgga ctggtgatga ccggccggct gcccgccgcc 32700
gggagcggcg gctggctgga cgaccacgtc gtcgccgaca ctcccctggt gcccggcact 32760
gccctggtgg agtgggtgct gcgagcggcc gacgaggcgg ggtgcggcgg gatcgaggag 32820
ttggcgctcc atgtcccgat gacactgccc gcgtccggtg ggctgcggat ccaggtcgtg 32880
gcgtacgcgc cggacggcga cggccgccgc gaggtacggg tgcactcccg ccccgacgcg 32940
gaggacggct cgtcaccctg gacctgccac gccaccggac acctctcccc gaccgccccg 33000
ggcgccgcgg acccggcccc ggccggggtg tggccgccgc gggacgccga acaggtcgac 33060
gtggccgact tctacggccg cgccgaggcg atcggctacg gctacggacc ggccttccgg 33120
ggcctgaccg ccgcctggcg gcagggtgac gacctgctcg ccgaggtggt gctgccggag 33180
gcggcgcacg agggcgccga cgggttcgcc ctgcaccccg cgctgctcga cgcggccctg 33240
caccccctgg ccctggacgg gcagggggag gacgggcgga tgcggctgcc gttcgcctgg 33300
agcggtgtgt cgctgtgggc caccggcgct cgcgcggccc gggtccggat gtccccgctg 33360
gaacacggct tccggctggt ggtggccgat gcggcgggcc ggccggtgct cagcgccgaa 33420
tccgtggtcg tacgccccac cagcgcacgg cagttgcgcg acgcgggcgc gcggcgggtg 33480
gacgggctgt acgaggtggc gtgggtggcc ctgccgccgt cgtcggacac cgtcgctgag 33540
acggagacgc ggggcgtaga ggggtgggcg ctgctggacg gcggcccgct tccgttcgac 33600
ccgtcaaagg ccggatccct gcctcgccac gccgacatcg acgccctgct gaccgccccc 33660
gccctgccgt cgacagtgct cgtcggcgtt tccggccccg tgggcgccgc gggcgacgag 33720
aattcggccg agggcgcgtt ggccgtgacc accggtgtgc tgacgtcggc gcggcggtgg 33780
ctggagacac cggagctggc cgacgcccgg ctggtgctgg tcacccgggg cgcggtcgcg 33840
gctgcggaaa ccgacgacgg ccccgaccct gcggctgccg cggtgtgggg gctgctgcgc 33900
agcgcacagg cggagaaccc gggccgtttc ctgctgtgcg acatcgacga cggcgccggg 33960
cccgacgatg tgctcggcgc cgtcacacgg gcggtggccc tggacgagcc gcaggtggcg 34020
gtgcgcggtg agcgtgtgct gacgccccgt cttgagcggg cgggagccgc tgagttggta 34080
ccgccgccgg gtgagcctgc ctggcggctg tcggccgatg acaccggcac catcgacagc 34140
gtgtcggtgg tggcctgccc cgaggtgctg gagccgctgg cgcccgggca ggtgcggatc 34200
gcggtgcggg ccgcgggcat caacttccgc gatgtgctga tcgtgctcgg gatgtatccg 34260
gacgaggggg tgttccgggg cagcgagggc gcgggcgtgg tgctggacgt ggccgacgat 34320
gtgacgtcgg tggccgtcgg cgaccgtgtc ttcgggctgt tcgagggcgc cttcgggccc 34380
gtggccgtcg ccgacgcccg cgccgtcgta ccggtcccgc cggactggac cgaccagcag 34440
gccgcggccg tcccgacgac gttcctgacc gcctggtacg ggctggtgga cctggccggc 34500
ctcaaggcgg gcgagtcggt cctcatccat gccgccacgg gcggggtggg cacggcggcc 34560
gtgcagatcg cccgccacct gggcgccgag gtgtacgcca ccgccggccc cggcaaacac 34620
gccgtactgg aggcgatggg catcgacgag gcccaccgcg cctcctcccg cgacctggac 34680
ttcgaggacg ccttccgcac cgccaccggc ggccgcgggg tcgacgtgat cctcaacagc 34740
ctcgcgggcg agtacaccga cgcgtccttg cggctgttga ccggtggtgg gcgcttcatc 34800
gagatgggca agaccgacaa gcgtgacgcc gagcagatcg ccgacacgta ctccggcgtc 34860
cgctaccgct tctacgacct ggtgccggac gccggtctgg accgggtcgc ggagatgctg 34920
accacgctgg ccgggcactt cgcccagggt gttctggcgc cgccgccggt gcgggcgtgg 34980
ccgctgacgg aggcccggca ggcgctgcga cagatgagcc aggccaggca caccggcaaa 35040
tacgtcctgg acatgccccg tacgctcgac ccggacggca ccgtcctgat caccggcggc 35100
accggcaccc tcggcgccct cgtcgccgaa cacctcgtca ccaaccacca catcagccat 35160
ctccacctcc tcagccgccg cggacccgac gcccccggct ccgccgacct ggcggcccgc 35220
ctcaccgaac tcggcgccac cgtccgcatc accgccaccg acaccaccga cccgcaggcc 35280
ctccggcagg ctctggacac cgtcgaccgc gaccatccgc tcaccggtgt catccacgcc 35340
gcgggggcgc tggacgacgc ggtgctgacc gcgcagacac cggagcggct ggcgtcggtc 35400
tgggcggcga aggcgacggc cgcggccaat ctccaccggg ccaccaagga cctccccctg 35460
gccatgttcg tgatcttctc ctcggccgcc ggcaccctcg gcacaccggg gcaggccaac 35520
tacgccgccg ccaacgccta ctgcgacgcc ctcgccgtcc ggcgacggcg ggcggggctg 35580
cccgccacct ccatcgcctg ggggctgtgg gccgccacca gcgagatgac cgggcatctg 35640
gcggacgccg atctggcccg gatgagccgc accggcttca ccccgctggc caccccgatg 35700
gcgctggccc tgttcgacgc cgccgggcga cacggcgccg cgaccccgct cgcgctcgac 35760
ctcgatccgc gtacgctcgg cgcccagccg tccgacgccg tgcccgcggt gctgcgtacg 35820
gtcgccgccg cgggagcgcc ggtccggcgt accgccgccg tcgcacagtc caccgactgg 35880
gccggccgac tggccgcgct ctcggccgcc gagcgccacc gcgagctggt caacctggta 35940
cgtacgcatg cggcgacggt gctcggccac agcgaccccg cggcgctccg cgcggacacc 36000
tcgttcaagg agctggggtt cgactcgctg accgccgtcg agctgcgcaa ccggctctcg 36060
gccgcgaccg gtctgcggct gcccgccgcg ctcgtcttcg actatcccga cgcggagacc 36120
atggcccgct tcctggacca gaagctcgcc cccggggacc ggacggaggc ggcggcggtc 36180
gaccacctcg cccccgtcct gaacgacttg gcccggctgg agtccacctt gggcagccat 36240
gacgtggacg ggaaggcccg cgagacggtc gcgggccggc tccacgccct gctgtcccgg 36300
ctggagggga gcacggccag tgcggcggac atcgacggcg aggcgctgga gtccgcctcg 36360
gacgacgaga tgttcgcgct catcgaccag caactgggct cgtcctgacg agaagtgggg 36420
gaggaccaat gtcgtccacg gaagacaagc tgcgccagta cctcaagcgg gtcaccgtcg 36480
acctgggcga ggcccgagcg cgcctgcgca aagcggagca acgccaacac gagcccatcg 36540
ccatcacctc catggcctgc cgctaccccg gcggtgtcac ctcccccgaa accctgtggg 36600
agctggtcga cagccgcacc gacgccatcg gctccttccc ggccaaccgc ggctggaacc 36660
tggcctccct ctaccacccc gaccccgacc actccggcac cagttacgtc cgggacggcg 36720
gcttcgtcca cgacgccgat gaattcgacg cctccttctt caacatcagc ccgcgcgaag 36780
ccctcgccat ggacccgcag cagcggctcc tcctcgaaac cgcctgggaa ctcctcgaac 36840
gcgcccatat cgaccccacg gccctcaagg gcacccccac cggggtctac acgggctgtg 36900
gtgtgcccgg tttcggcaca ccacatatcg agcggagcgc cgagggcttc ctgctgaccg 36960
gcaacgcgct cagcgtggtg tccggccgta tcgccttcac cctcggcctg gagggcccgg 37020
cggtcacgct ggacacggcg tgctcgtcct cgctggtcgc gatgcacctg gcggtccagg 37080
cactgcggca gggcgaatgc gacctggcgc tggccggcgg ggtgacggtc atgtcgaccc 37140
cgaacgtgat cgtcgagttc tcccggcagc gcggcctgtc cccggacggg cggtgcaagc 37200
cgttcgccac ggcggccgac ggcacgggct tctccgaggg cgcgggcctg gtgctgctgg 37260
agcggctgtc ggacgccgag cgcaacgggc accaggtgct cgccgtcatc cggggtacgg 37320
ccgtcaacca ggacggcgcg agcaacgggc tgagcgcccc caacggaccc tcgcagcagc 37380
gggtgatccg ccaggcgctg gccaacgccg ggctggccac ggtcgaagtg gacgcggtgg 37440
aggcccatgg cacgggcacc acgctcggcg acccgatcga ggccgaggcg ctgctggcca 37500
cctacggtca ggagcggccg gaggaccgtc cgctgtggct ggggtcgatc aagtccaaca 37560
tcggccatac gcagggcgcc gccggggtcg cgggcgtcat caagatggtg atggccatgc 37620
gccacgcctc gctgcccgcc accctgcacg tcgacgagcc cacgtcgcat gtggactggg 37680
accggggcac ggtacggctg ctgaccgagc cggtcgactg gccgacggcc cccgaccggc 37740
cgcgccgggc cggtgtctcg gcgttcggca tctccggcac caacgcccac atcatcctgg 37800
aggaggcggg cctgcccacg gcggcggagg cggaggctgg gactgcggct gaggctggga 37860
ctgacgctgg ggctgggact gaggctgggg ctgaggacgc cgcaccggag gaggccaccg 37920
tagagccggc cctcctgggc ggcgtggcgc cctgggtggt ctcggcccgt acccaggagg 37980
ccctggccga ccaggcgcgc gggctggtcc gcgccgtcac cgacaccggc gccccggatg 38040
ccgtgcccgc cgaggtggcc tggtcgctgg ccaccacccg cgccacattc gaccaccgcg 38100
ccgtggtcac cggcaccgaa ctcgccgacc tcaccgccgc gctcgaggcc ctggccaccg 38160
gcggtgaaca cccccatctc gtccgaggca ccgccctcga cccgcaggcg ggcccggtcc 38220
tcgtcttccc cggccagggc tcccagtggc ccggtatggc cgtcgggctg ctggactcct 38280
cccccgcctt cgccacccgg atcgccgcct gcgaacaggc cctggccccg tatgtcgact 38340
ggtcactcac cgccgttctg cgcggcagtg acaccgcaac cgacccgcac cgcgtggatg 38400
tcatccagcc gaccctgtgg gccgtcatgg tgtccctggc gggcctgtgg caggacttcg 38460
gcatcacccc ggccgccgtc atcggacact cccagggcga gatcgccgcc gcttgtgtgg 38520
ccggggccct cagcctcgac gacgccgcca aggtggtggc actgcgcagc caggcgctac 38580
gcgccctcgc cggccacggg gccatggcct ccctcaccct cggcgccgag gacaccgccc 38640
gcgtcctgac cggactcggc ccggccgccg agggcgtcgc ggtcgccgcc cacaacggcc 38700
cccgctccac cgtcgtctcc gggccaccgg accagatcgc caccgtcctg gcggccgccg 38760
aggcacgagg ggcgcggacc cgcaccatcg acgtggacta cgcctcccac agcccgcacg 38820
tcgaccgcat ccgcgacacc atcctcgcgc aactcgccga tctggccccc gccgcgccca 38880
ccatcccctt ctactccacc gtcaccggcg agccgctggc cgacaccccg ctcgacgccg 38940
agtactggtt caccaacctc cgccagccgg tccgcttcac cgacaccctc accaccctcc 39000
tggaccacca gcaccgccac ttcatcgagg cgagccccca ccccgtcctc acccccggca 39060
tccaggacgc catcgacgac gccgagctcc ccgccaccac gatccccacc ctccgccgcg 39120
accacggcac cccccacgac ctcgcggacg ccctcgcgct cgcccacacc accggcctcg 39180
ccgtcgactg gcggccctgg tacgccacca ccccgcccgc caccaccgat ctgcccacct 39240
accccttcca gcggcagcgc tactggtcgg cggctggccg ccggaccggc gacgtcagcg 39300
cggccggtct gcggccggtg gaccatccgc agctgtccgc cgcgacgggc ctcgccgacg 39360
gcgggctcct gctcaccggg cgcctgccgg ccgccggtga cgcgggctgg ctgggcgagc 39420
acgagttcgc cgatgtggtc ctggtgccca gcacggccct ggtggagtgg acactgcggg 39480
cggccgacga ggcgggctgc ggcggcgtcg aggagctgac ccttgaggtc ccgctgaccc 39540
tgtccgccgc ctccgagcta cgggtccagg tcgtggtgga cgcccccgac gaggacggcc 39600
gacgcgccgt acgcgtctcc tcccaacccg cggtggacac cccggaccgg gccgacgggc 39660
aggacacctg gacctgccat gcgacgggca ccctcatggc cgccgcagcc gctgggaccg 39720
agctggcggg cgcctggcca ccggccggtg ccgaacctgt ggacctcacc aacctgtacg 39780
cacgcgccga ggccgccggc taccggtacg ggccgacgtt ccagggtgtc caggcggtgt 39840
ggcggcacgg cgccgacctg ctcgccgaag tggccctgga ccagggggcc gaggagggcg 39900
gcgacgagtt cggcatccat cccgcgctgc tcgaatgcgc ccttcacccg gtggcgctga 39960
ccgatacacc gcacgacgat acgccgctgg gcgatgcgga caccgacggt ccgctgtggc 40020
tgccgttcgc ctggaacggc gtatccctgc acgcgggcgg cgcgacgagc gtacgggtgc 40080
ggatcgggca gcgggggcag acggacaccg aaggccgtga gctgaccgtc gtggtcgcgg 40140
acccgaccgg cgccccggtg ctcaccgtgg actcggtcgc gctgcggccc gccgacggcg 40200
actggctgaa agccgcggaa cggcgttcca cggccgcgct gttcacggtg gaatggacac 40260
cgctgccgcc ccaggacagt cggcccgagc cggtcgaggc cgaggacggc tgggcgacac 40320
tcggcgcctc cggccccggg caccactacg ccgacctggc cgcgctgctg tcggcggcgg 40380
atggcgcaga gcccgcgccg cccgtggtcc tcgcctccgt aactccgacg gccgacaccg 40440
gggccgactc cgaggctgac acggatctgg ccaccgtacg gcgcacgttg gggctcatcc 40500
aggagtggct ggcggagccg gggctacggg actcccggct ggtgctgatc acctccggcg 40560
cgacctcggt cggcgacggc gacgggccgg tggagccggg gagcgctgct gtgttcggcc 40620
tggttcaggc ggtccaggcc gaacacccgg accgcttcat gctggtcgac gtgggcgccg 40680
acgcggacgc ggacggtgac ggagacggtg gcgagacgtt ggccgatgcc gtacgccggg 40740
ccatcgccgc ggacgaaccg cagatcgcgg tccgttcggg cgaggtgtcg gtcccccgcc 40800
tgctgcgcgc cgccgcccgg cccgacgagg gcacggcggt cgagctgtcc ggcggtacgg 40860
tggtggtgtc gggcgcgatg gaccatgtgt ccggcggcgc gatcgccgag cagctggtac 40920
gtgcgtacgg agccgaacgt ctgctgctcc tctcccaccc ggacgagcag gctcccgacc 40980
tggccgagcg gctgacggcg ctgggcgcgg cggttgaggt ggcggtggtc gatatcgccg 41040
accgcgccgc tctggcggag gttctggcgt ccgtaccgga cagccacccg ctggtgggcg 41100
tggtgcatct ggcgggggcg gccgacgagg ggccagtgga gtcgtggaac gacgggcgac 41160
tgtcgcgggc gtgggcgccg cgggccaccg gggcgtggca gctgcacacc ctcacccagg 41220
acctgccgct gcggatgttc gtggtgtgct ccgccgccgc tgacgtcacg ggcggccccg 41280
gccgggccgg atacgccgcc gccaacgccc acaccgacgc cctgatcgcc caccggcgcg 41340
ccgccgggct cccgggcacg ggtctggtct gggccctgga ggaagaggcc accgccgacg 41400
cctcgcgcct gttcgacgcg gcgttccatg ccgtacagcc gctggtcgtg gccgcggacc 41460
tggacaccgc ccggctgggc ccctcggccc ccgcgctgct gcgtgccctg gtccggccgg 41520
cccggcgccg cgccgcggaa cggcagtcgg cggcccacgc cctgacctcg cgcctcgccg 41580
gtctcgacaa ctccggacag cgcgaactgc tgctggatgt ggtgcgccag atggccgccg 41640
tggtcctggg ccactcctcc gacacggcga tccgggccga ggccgccttc aaggagctgg 41700
gcttcgactc cctcacggcc gtcgggctgc gcaaccgcct tgtcgacgcc accgggctgc 41760
ggctgccctc gaccctggtc ttcgactatc cgaccccccg ggccctggcc gaccacctcc 41820
tccagctcgt gaccagcacc gcgcccacca cttcgctccc ggtggggccg gcgcgggccg 41880
ccggcgcgga cgacgagccc atcgccgtgg tggcgatggc atgccgcttc cccggcgatg 41940
tgaccacgcc cgaggggctg tgggacctgg tcgcggcggg agagaacatc cgcggcccgt 42000
tccccaccaa ccgcggctgg gacctggcca acctcttcca ccccgacccc gaacaccccg 42060
gcacgaccta cgcctcgcaa ggcgcgttca tctacgacgc cgacggcttc gacgccgcgt 42120
tcttcggcat caacccccgc gaggcgctgg ccatcgaccc ccagcagcgc ctgatcctgg 42180
aaaccgcctg ggaagccctc gaacgcgccg gtatcgaccc acacaccctc aaggagagcc 42240
tgaccggcgt ctataccggc gtcatctacc acgactacgc cgccggtctg cccgcgagcg 42300
acccccggct ggacggctac accatgctgt ccagcatcgg cagcatcatc tccggccggg 42360
tggcgtacac cctcggcctg cagggcccgg ccgtcaccgt ggacaccgcc tgctcctcct 42420
ccctggtggc catgcacctc gccgcccagg cgctccgcca gggcgaatgc gatctggcgc 42480
tcgcgggcgg cgtgaccgtc atggccaccc ccgacccgtt caccgggttc tcgcgccagc 42540
gcggactggc cccggacggg cggtgcaagc cgttcgccgc cgccgccgac ggcaccagcc 42600
tgagcgaggg agccgggctc gtcgtgctcg aacggctctc cgacgcccgc cgcaacggcc 42660
atcaggtgct cgccgtactg cgcggatccg cgatcaacca ggacggcgcg agcaacgggc 42720
tgaccgcccc caacggcccc tcccagcaac gcgtcatcgg ccaggcactc gccaacgccg 42780
gcctcggccc cgccgacatc gacgccgtcg aggcccatgg caccggcacg acgctgggcg 42840
accccatcga ggcccaggcc ctgctggcca cctacggcca acaccgcgcc gacgaccggc 42900
cgctgtggct cggctcggtc aagtccaaca tcggccacac ccaggccgcc agcggggtcg 42960
tcggcgtgat caagatgatc atggcgatgc gccacggccg gctgcccgcc agcctccata 43020
tcgacgaacc cagcccgcat atcgactgga ccagcggaaa cgtccagctc ctcaccgaag 43080
ccatcgactg gcccgaggcc gaccggcccc gccgcgccgg tgtgtcctcc ttcggcgcct 43140
ccggtaccaa cgctcatgtg atcctggaag aggcgccccc accgcccgat cccgcgcccg 43200
aaccggccgc cgcgcccgcc atcgcgggcg gcgtggtgcc ctggccgctg tcggcgcggg 43260
acgagcaggc gctgcgcgag caggcctcgg cgctcgccga gcacctgggg accgacgacc 43320
gtgcgtctgt cgcggacgtg ggatggtcct tggccacgac acgggcgatg ttcgagcggc 43380
gggccgtgat cgtcggggag ggccgcgagg agatggccgc cgccctggag gcgctggccg 43440
acgggtcccc ccaccccggc ctgtccaccc tcggtggcac cgcttcggac acaccaggga 43500
agacggtgtg gctgttctcc ggacagggca gccagcgccc cggcatgggc gccgacctct 43560
accggcgttt ccccgtcttc gccgaggcct tcgacggggt ccgcgcgctc ctggaccccc 43620
acctcgatca cccgctcgcc gacgtggtct tcgccaccga ccccggacac ggcgacctga 43680
tccaccacac cacctacacc caggccgggc tcttcgccct ccacatcgcc ctcgcccgcc 43740
tgctgggcga catgggcctg gcgcccgacg cggtcgccgg acactccatc ggtgagatca 43800
gcgccgccca cctcgccggt gtgctctccc tggaggacgc cgcccaactc gtcgccgccc 43860
gcgccaccct catgggcggc ctcccgtccg gcggcgccat ggccaccgtg aacgccgacg 43920
aacaggagat caccgccacc ctggccgact atcccgacct ggccatcgcc gcgatgaaca 43980
cccccgcaca caccgtcgtc tccggccccg ccgaccaggt cgccgccctc accgccgcct 44040
ggcgggagcg aggccgcaag acccgcgccc tcccggtctc ccacgccttc cactcgcccc 44100
agatggagcc catcctcgtc cccttcaccg aggccatcgg ccacctggcc ttccacccgc 44160
cccgcatccc gctcatcagc aacctcaccg gcgagcccgc gggcgaggac atcgccacac 44220
ccgattactg ggcccgccac atccgcaggc ccgtccactt ccaccagagc atcacccacc 44280
tcgccgagga cacggccgtc ttcctggagc tgggccccgc tccggtactg acccacgccg 44340
tccaccacac cctccccgag gagaccacag ccaccgcgct cgccaccctc accggtaagc 44400
aacccgacgt gcccgccctg gcccacagcc tggcggcgct gcacaccagt cacgccccgg 44460
tggactggac gccctggttc cgcaccgatc cggcccccag gaccgtgggc ctgcccacct 44520
accgcttcca gcgccggccc tactggatcg ccccccgcgt ctccggcggc gccacccccg 44580
gcggtaccgg cctggaccac ccgctgctgg acaccgcggc ggctctggcg gacggcggca 44640
tggtgctcac cgggagcgtc cctccggccg accacgacag ctggctcacc gagcgcgcca 44700
tcgcgggaac cgtcgtgctg cccggcaccg ccctcctcga actcgccctc cgctgcgccg 44760
aggacacccg cagcccccac gtcgaggagc ttctcctcca ccaccccctc accctccacc 44820
ccaccgcaca tctcgatctg caggtcgtca tcggcgccgc cgatgacgac gcccgtcgca 44880
ccctccacct gtacacccgc ccccagtccg actcctcggc cgagtggacc cggcacgcca 44940
ccgccaccct gaccggcgaa cccactgacg accggccgcc ggcagaaggg gaagcggcct 45000
ggccgcctgc cggcgccgag cccgtcgacc tcaccggctt ctatgaccgc gccgcgtcga 45060
acggctatgc gtacgggccc tccctccgag ggcttcaggc gctgtggcgc cacggcgaag 45120
acctcctcgc cgacatcgcc ctgcccatgg ccgacgactc caccgacacc ctcgtcctcc 45180
accccgcact cctggacagc gcgctccacc ccctcctcgc cgtcatggac acctcgggcg 45240
accaggtctg gctgccgttc tcctggagcg gcgtcaccct gcacgccacc ggcgccaccc 45300
acgcccgtgt ccgcgtcacc ccccacgacg accacgagca ccgcatcgct ctcaccgaca 45360
ccgccggccg accgatcctc accgccaacg ccgtcgccgt ccggccgacc cgcctcgagg 45420
cgcctcaaca gccgctgtcc gaagggctgt tcagcctgga gtggacgccc gtgtcgacgc 45480
tggccgaccg gtccgacgcg gccgcgccga cgccgggggt ggtcctcgcc aaggcgccgg 45540
tcgcggaggg ggaaggcggt gagctggagg ccgtccagcg ggcgttgacc ctggtacagg 45600
actggctggc cgagccgcgg cccgacgatg cccggctggt ggtgatgacc cgcgacgccg 45660
tcgccgtgga cggggaagcg cacatcgatc ccgtagcggc ggccgtctgg ggcctgatcc 45720
gcagtgctca gacggagaac cccggccggt tcgtcctgct ggacagggac ctggacacag 45780
agctggacac ggatccggtc ctgggtcccg atgcccttgc cgaggcggac ggccgggtgg 45840
ccgaagcggt ccgctgcgcg ctggacctcg acgagtcgca ggtggcgctc cgcggcggcc 45900
gggttctggt accccggctc gtccgcgcga ccgcctcggc cacgctgcct ggccccgtcg 45960
accggcggaa ctggcggctg gaggccgcga ccccggccgg cgccgcgtcg ctcgacgccg 46020
tggcgccggt gccgttcccc gaggcggagg aggagccggc ggccgggcgc gtacgcatcg 46080
aggtgcgcgc ggccggcgtc accttccgcg atgtcctgat cgccaccggc ggggtgcccg 46140
acgagacccg cctcggcggc gaaggcgcgg gcgtcgtcct cgaggtgggc cccgacgtca 46200
ccgacgtggc accgggcgac cgtgtcatgg gcgtcttcga cggcgcgttc ggccgggtcg 46260
ccgacgccga cgcgcgcatg gtgacccgga tgccgaggac ctgggacttc acccgggcgg 46320
cgggcgtccc cgtggcgttt ctcaccgcct ggtacggcct ggtggagctc gcggacctca 46380
gggcgggcga gtcggtcctc atccacgcgg ccaccggcgg cgtcggcacg gcggcggtgc 46440
agatcgcccg ccacctgggc gccgatgcct acgccaccgc cgaccctgcc gagcaccatg 46500
tactggaggc gatgggcatc gacgaggccc accgcgcatc ctcccgcgac ctggacttcg 46560
agaacgcctt ccgcgccgcc accggcggcc gcggcgtcga cgtggtcctc aacagcctca 46620
ccggcgacca catcgacgac cgcaccgacg cgtccctgcg cctgctggcc gagggcggcc 46680
gcttccttga tcccggccgg gccgacgcac gcgatcccga acagctggcc aaggacttcc 46740
ctgccgtgga ctaccgcgtc tacgacctgg tcccggacgc cgggccggag cgcgtccagc 46800
cgatgctggc ggcgctggtg gcgctgttcg acgagggtgt cctggcaccg ctcccggtgc 46860
gggcgtggcc gctggccagg gcccgtcagg cgctgcgcca catgagccgg gccgagcaca 46920
ccggcaagct cgtgctgacc gtccctcccg ccctcgatcc cgacggcacc gttctgatca 46980
ccggtggcac cggcgtgctc gccggcctgg tcgccgaaca cctcgtcacc acccaccaca 47040
tcacccatct ccacctcctc agccgtcgcg gacccgacgc ccccggcgcc accgatctgg 47100
cgacccgcct cgccgaactg ggtgccaccg tccacatcac cgccgcggac gcctcggatc 47160
cgggggccct gcgccgggtc gtcgacgcca tcgatccgga ccatcccctc accggcgtcg 47220
tccacaccgc gggcatcgtc gaggacgccg tggtcacctc gcagaccccc gacaccctcc 47280
ggcgcgtctg gaccgccaag gcgacgagcg ccgccaacct ccaccaggcc accaagcacc 47340
tccccctggc catgttcacc ctgtactcct ccgtctccgg caccctcggc aaccccggac 47400
aggccaacta cgccgccgcc aacgcctact gcgacgccct cgccgcccag cggcagcacg 47460
ccggcctgcc ggccacctcc atcgcctggg gcctgtggtc caccgccagc gacatcaccg 47520
ggcaactcag ccaagccgat gtggcacgga tgggccgcgc cggcgtcagg gcgctggcca 47580
ccgaacacgc gctggcgctg ttcgacgcgg cacaccggca aggcgacccg caactcgtgg 47640
cgctcaatct cgacgtgccc gccctggccg cgcagccggt cgccatcctc ccggccgcac 47700
tgcgcggcct ggccacccgc tcgggcggga ccacccggcg cgccgccgcg gccgtgcagc 47760
gccccgacga ctggacgcgc cggctcgccg ggctcccgga ggccgaacag cggcagcagc 47820
tgctcacgct ggtacgcggc aacgccgcga ccgtgctcgg ccacgcggat tccgagcgcg 47880
tccgggagga ggccccgttc aaggatctcg gcttcgactc cctgaccggc gtcgagctgc 47940
gcaaccggct ctcggccgcg accgggctgc ggctgcccgc cgcgctcgtc ttcgacttcc 48000
cgtcggcgaa gtcgctggcc gattacctgc gcggccggct ggtggcggac ggggggtcgg 48060
cggcccaggc cggtgtcgac ccggtgctcg gcgagctggc gaggctggag tccacgctgt 48120
cggccctcga tctgcccgag gccgacgcgc gggcggtgac agatcggctt gagggtctgc 48180
tggcccaatg gaaggcggcg tccgcgccac cggccgagga caacgcggcc gaccggctca 48240
cgctcgccac cgccgacgag gtgctcgcgt tcatcgacaa cgaactcggc acctcgtgat 48300
ggctcatctg ctcgcttcga cgcgtatgaa ggtcgtgact acacatgccg gatgaagaga 48360
ggcttgtcga ctacctcaag cgtgtggcga cggacctgca cgacacgcgg cggcgcctgc 48420
gcgaggtcga ggagcggcac caggagccca tcgccatcac cgcgatgacc tgccggttcc 48480
ccggcggggt ggattcgccc gaggcgttgt gggacctggt ggcctcgggg ggcgacgtga 48540
tcgggccgtt ccccgcggac cgcggctggg atctggaggg cctgtaccac ccggaccccg 48600
accaccccgg cacgacgtac acgcgcgagg gcgggttcct gcgcgacgcc gacacgttcg 48660
actcgggttt cttcgagatc agcccccggg aggcgctggt catggacccg cagcagcgca 48720
agctgctcga ggtgacctgg gagttgttcg agcgcgccgg cctcgacgcg acgagcctgc 48780
ggggaagtcg gaccggcgtc ttcatcggcg cggcgaccat gggctcgggc acgccgagcg 48840
gccccgcccg caaggagtcc gaggggtatg tgggcgtcgc gcccagcatg ctgtcgggcc 48900
gcctgtcgta caccttcggc ctcgagggcc cctcgctgac ggtggagacg gcctgctccg 48960
cgtcgctggt ggcgatgcat caggggattc acgcgctgcg gcagggcgag tgcggcctgg 49020
cggtggtcgg cggggtgacc atcatgtcct cgcccgcggt gttcatcggc ttcgcccgcc 49080
agcgcggcct ggcgccgaac ggacgctgca agccgttcgc ggccggcgcc gacggcaccg 49140
gctggggcga gggggccggg ctcgtccttc tcgagcggct gtccgacgcc cgccgcaacg 49200
gccaccaggt gctcgccgtc atccggggct cggccgtcaa ccaggacggc gcgagcaatg 49260
gcttctccgc gcccaacggg ccttcgcagc agcgcgtcat ccgtcaggcg ctcctcaacg 49320
cccgcctgtc gtccgccgag gtggacgcgg tggaggcgca tggcacgggc acccggctcg 49380
gcgacccgat cgaggccgac gcgctgcacg cgacgtacgg gcagcggcgg ccggccgacc 49440
ggccgctgct gctgggctcg gtcaagtcga acatcggtca cccccaggcc gccgccggag 49500
tggcgggcgt gatcaagacg gtgatggcga tccggcacgg gctgttcccc gcgacgttgc 49560
acatcgacga gccgacgccg cacgtggact ggggctcggg ggccatacgt ctggtgaccg 49620
agccggtcga atggcccgag acggaccacc cgcgccgggc cggggtgtcc tcgttcggcg 49680
tctccggaac caacgcgcat gtgatcatcg agcaggcgcc cgacccggac acgagcgaga 49740
cggacgcggg cgagacggac gcgggcgaga cggcccggga cgcggagggg gcggcgccgg 49800
cgcggcaggc cgtggtggcg ggcggcgtgg tgccgtggat gctgtccgcg cgggacgagg 49860
cggcgctggc gcggcaggcg ctgcgtctcg ccgaggtggc ggagggcgac ccggccgccg 49920
acgtgacgga catgggctgg tcgctggcga cgacacgtgc ccggttcgag caccgggccg 49980
tggtcgttgg cacggaccgt gcgacgctgc tcgacggtct ggccaagctg gccgccgacg 50040
agcccgaccc cgcggtcgtc accgcgacgg cgggcccgat cggcgccggc ccggtgttcg 50100
tcttccccgg ccagggagcc cagtggccgg gcatggcccg tgaactcctc gactcctcac 50160
cggtgttcgc cgcccggatc gccgagtgcg agcgggctct ggggccgtac gtggactggt 50220
cgctgacgga ggtgctgcgc ggcacggatc cggcgacgga ccccggccgg gacgacgtca 50280
tccagcccgt gctgtgggcc atccacgtct cgctggcagc cgtctggcag agcttcggca 50340
tcaccccggc ggccgtcgtc gggcactccc agggcgaggt cgccgcggcg tgcgtcgccg 50400
gagccctcac cctcgacgac gccgccaagg tcatcgccct gcgcgtccag gcgcttcgcc 50460
cgctgatcgg gcacggtgcc atggcctccc tcagcctggg cgccgaggac accgcgcggc 50520
tcctcgccga gctcggcgcc gcggccgggg acgtcgccgt cgccgccgtc aacggcccgc 50580
acgccacggt cgtctcgggc tcccccgacc atctggacgc cgttctcgag acggcccggg 50640
agcgtggcgc acgcacccgc accatcgacg tcgagtacgc ctcccacggc ccccatgtgg 50700
accgcatccg cgacgacatc gtctccgccc tgcgtgatgt gacaccggtc gagtccgaga 50760
tcgcgttcta ctccaccgtc accgcggagc ggctgaacac cgctgaactc ggcaccgagt 50820
actggttcga caacctccgc cgaccggtcc gcttcgcgga tgccgtgggc cgtctgctgg 50880
ccgacgggta ccgggccttc atccagtgca acccgcaccc gatcctgtcc acgagcctgc 50940
aggacatctt cgaggagtcc ggcacccgcg ccgcctccct cgccaccctc cgccgcgacc 51000
acggcggcgc gcatcagctc gccctcgcgc tcgcccaggc ccacgccgcg ggggtcgagg 51060
tcgactggcg cccctggttc ccggccgacc gcacgccccg taccgtcgag ctgcccacgt 51120
accccttcca gggaaagcgt tactggatac ccgtcggcgg cagcggggcg ggggacgtgt 51180
ccgccgccgg tctgcgcgct gtcgaccatc cgctgctggc cgccgcggtg agcctgcccg 51240
acggcggcat ggtgctgacc ggacgcctct cggccacaac gggcgccggc tggctcgccg 51300
accacgtcgt cggcgacacc acgctgctgc ccggggcggc catggtcgag tgggcgttgc 51360
aggccgctca cgaggcgggc tgtgccgcgg tcgaggagct ggccctgcag accccgttcg 51420
tgctgcccgc ctcgggggcc ctgcgcgtac gcgtcgcggt ggggcccgcc gacgacgagg 51480
gcaggcggac ggtggacgtg tactcgcgcc ccgacgagct cgacaccgag acccccgacg 51540
gatgggtgtg ccacgccatg ggcgtgctgg cccccgaggc ccccgaggac cggaccgccc 51600
cgccggacgc cccggcggcc ccctggccac cgcggggggc ggaaccgctg gacgtcacgg 51660
acttctacga acgggccgcg gccggaggct acggctatgg ccccgcgttc cgcggcctca 51720
ccgcggcgtg gcgggacggc gccgatctgc tcgccgagat cgcgcttccc gaggcggccg 51780
gcgagggcgc cgaccggttc ggcatccatc cggcgctgtt ggacgccgcc acgcacccga 51840
cgatcctcgg cggcggccgg gaggacggct ctgacgccgg gcaggtgtgg ctgccgttcg 51900
cgtggagcgg tgtgtcgctg tgggccacgg gcgcccgccg ggttcgcgtg cgcatattcc 51960
cggaggacaa cgggcagcgg atctccctga ccgacgagac cggcgcgccg gtcctcgagg 52020
ccgcgtcggt cgccgcgcgc cccaccggcc tcgccgagct gcgtgccctc ggcgcgcgcg 52080
ccgccgaggg cctgttcgtc gtggactggg tgcccgcgcg aggcgggacg ggtgacgcac 52140
cgcccccgga tgacggcggt tgggccacgg tgggcggcgg tggcgtacgg ctcgccgggg 52200
tggcggacca cgccgacctg ggcgcgctgc tcgccgcggt ggacgacggg gccccggtgc 52260
cgacggtcgt cctccacccg gtgcccgcga cggccacgcc cgacgacggc ctcgccgccg 52320
tcgggggcgt actggccctg atacgggaat ggctggccga accgcggtgg ctggactccc 52380
ggttggtact ggtcacctcc gacgcggtgt cggccggtga cgacgagggt gcggtggacc 52440
cgggtggcgc ggccgtctgg ggtctggtcc gttccgtcca ggccgagcac ccgggccggt 52500
tcaccctgct cgacgtgggc ggcgacacgg atgccgacgc gggcgggggc gagagcctcg 52560
ccgaggcggt gcgccggtcc atcgacgcgg acgaaccgca ggtcgtcgtg cgcgcggccg 52620
gaacgctggt gccgcgtctg gtgcgcaccg ccccggcggc cgaggccgac accccggagc 52680
tgtccggtgg cacggtgctg gtgtcgggcg gaacgggtgt gctgggcgga gcggccgccg 52740
aacacctggt ccgcgcgcac ggcgtggagc gtgtgctgct cctctcccgt cgcggcccga 52800
acgcccccga agcggccgaa ctggtgcggc ggctcaccgc cctgggggcc caggtggacg 52860
tggcggcggt tgacgtggcc gaccgcgccg cgctggcgga gacgttgcgg acgatccccg 52920
acagccaccc gctgctcggc gtggtccact ccgccggggt gaccgacgac gccctggtgg 52980
agtcctggga cgccgaccgg ctcacccggg tgtgggagcc gaaagccaca ggcgcgtggc 53040
atctgcacac cctcacccgc gatctgcccc tgcggatgtt cgtggtgttc tcctccgccg 53100
ccggcgtcgt cggcaactca ggccaggccg gctacgccgc cgccaacgcc tgcaccgacg 53160
ccctgatcgc ccaccggcgc gccgccgggc tgcccgggac ctcggtcgcc tggacgctgt 53220
gggaacaggc ctccgccatg accgaacacc tcaccgaggc cgacctctcc cgcctcggca 53280
ccctcggcat gcgccccctg gcgacgtctc gtgccctcgg gctcctcgac gcggcgttgc 53340
acgtcaccca ccccgtggtg gtggccgccg acctcgacgc cacccgactg ggcccggaca 53400
gccccgccat gctgcgcgcc ctggccaggc ctgctcgccg ccgcgccatg gaacaccatg 53460
ccaccggccc tgcgctggcg ggccggctgg ccggtctcga cgccaccgcc cggcgcgatc 53520
tgctgctgca gacggtccgc cagatggtca cggtggtgct ggggcactcc tccgacgcgg 53580
cgatccgggc cgaggccgcg ttcaaggagc tgggcttcga ctcgctgacc gccgtcgaac 53640
tgcgcaaccg cctggccggt gccaccgggc tgcggctgcc cgcgaccctc gtcttcgact 53700
accccactcc cctggccctc gccgaccacc tcctggaacg cctgaccgcc accgcctccc 53760
ccgcgtcccc ccgggccgtt ccgtccaggg ccggggccgc cgacgagccg atcgcggtgg 53820
tgtcgatggc ctgccgcttc cccggcgggg tgaccacgcc agaggagctg tgggacctgg 53880
tcgcggcgga tcggcatgtg ctcggcccgt tccccaccaa ccgcggctgg gacctggcca 53940
acctgttcca ccccgacccc gaccaccccg gcaccaccta cgcctccgag ggcgcgttca 54000
tgtacgacgc cgacggcttc gacgccgcgt tcttcggcat caacccccgc gaggccctgg 54060
cgatggaccc gcagcagcgc gtcctgctgg aaacgtcctg ggaactgctg gaacgggccg 54120
gcatcgaccc ccacaccctc aaggacagtc tgaccggcgt ctacgccggc gtcatgtacc 54180
acgactacgg caacggcctg ccccccggcg acccccgact ggacggctac gccgggctct 54240
ccgggaccag cagcatcatc gccggccggg tggcgtacac cctcggcctc cagggccccg 54300
ccgtcaccgt cgacaccgcc tgctcctcct ccctggtcac catgcatctg gccgcccagg 54360
ccctgcggca gggcgagtgc gacctggcgc tggccggggg cgtgaccgtc ctggccaccc 54420
ccgacgtgtt caccgggttc tcccggcagc ggggtctggc gccggacggc cggtgcaagc 54480
cgttcgccgc cgcagccgac ggcaccggct tcggcgaggg cgtcggcctg gtgctgctgg 54540
agcggctgtc ggacgcgcac cgcaacgggc accgggtgct cgccgtcctg cgcggttcgg 54600
cggtcaacca ggacggcgcg agcaacggcc tgaccgcgcc caacgggccc gcccagcaac 54660
gcgtcatccg ccaggccctg gccggagcgg aactcgaccc ggcggatgtg gacgccgtgg 54720
aagcccatgg cacgggcacg acgctgggcg accccatcga ggcacaggcg ctcctggcca 54780
cctacggcca ggaccgtccg acggaccggc cgctgtggct cgggtccatc aagtccaaca 54840
tcggccacac ccaggccgcc gccggcgtcg ccggggtcat caagatgatc atggccatgg 54900
accacggccg gctgcccacc agcctccaca tcgacgaacc cagcccgcat atcgactgga 54960
ccaccggaaa cgtccgactc ctcaccgaac ccgccgactg gcccgcgacc gaccgccccc 55020
ggcgcgccgc cgtgtcctcc ttcggtgcct ccggcaccaa cgcgcacctc atcctcgaac 55080
aggccccgga ccgccccgga gacgggccgg cgggggaccg ccccgcgccc gtggccgtgg 55140
cctggccgct ctccgcccgc accgacgagg ccctgcggac cgtggccacg gccctcgccg 55200
accggctggg agccgacgac accacgccgg tcacggacgt ggggtggtcg ttggccacgg 55260
cccgggcgac attcgaacga cgagccgtga tcatcggaag cgaccgccag gagatgaccg 55320
ccgccctgga cgccctggcc cgggacctcc ctcacccgaa cctggtcgcc ccgctccccg 55380
tcgcgccccc ggcgggcgac acggtgtggc tgttctccgg gcaggggagc cagcggccgg 55440
gcatgggcgc cgaactccat gagcgtttcc ccgccttcgc cgacaccttc gacgagatct 55500
gcgcactgat cgacccccac ctcgaccacc ccctgcgcga catcgtcttc gccacccacc 55560
ccgaccacac cgacctcctg aaccacacca cctacaccca agccggactc ttcgccgtac 55620
aagtggcact cgcccggctc ctggaacact gcggcctacg acccgacacc gtgatcggcc 55680
actccatcgg cgagatcacc gccgcccaca tcgccggcgt cctctccctc caagacgcct 55740
gccacctcgt cgccaaccgc gccaccctcc tcggcaaact cccacccggc ggcgccatga 55800
ccgccatcga agccaccgcc gaagaaatca cccaaaccct caccccctac cacggccaag 55860
tcaccatcgc cgccctcaac gcccccacca gcaccgtcat ctcaggaccc gaagaactcg 55920
tggcccaact cacccgcaga tggaaagaac gcggccgcag aaccaaaaca ctcaccgtca 55980
gccacgcctt ccactccccc ctcatggaac ccgccctcaa cgacttccgc cacgccatcg 56040
accacctgac ctaccaccag cccaccatcc ccctcatcag caacctcacc ggcgaacccg 56100
caacccaaga catcgccacc cccgactact gggtgcgcca catccgccaa cccgtccact 56160
tccaccccgc catcacccac atcgcacccc acacagcggc cttcctcgaa atcggccccg 56220
acgccacact catccccgcc acacagaaca ccctcgacac actggagaac caacccacct 56280
ccgcacccca actgatcccc accctcaccc gcaaacaacc cgacacccaa gccctcgccc 56340
acgccctcgc ccgcctccac accctcaccc ccctcaactg gcacccctgg tacaccgacc 56400
aacccacccc caccaccatc gacctcccca cctacccctt ccaacacgag cgctactggc 56460
tcactcccac ccacgccggg cccaccaccc cgggcgccac cccgctcacc caccccttcc 56520
tcgccgccac cgcaccgctc gcggacggtg gcctcctgct gaccgggcag gtcccctccg 56580
ccgaccacgc gggctggcac accgagcaca ccatcgcggg cgccaccctc ctgcccgcca 56640
ccgccctcct cgagatcgcc ctccacgccg ccgaccacac caccaccccc cacatcgacg 56700
aactgatcct gcagcacccg ctcaccctcg accccagcca ccccctcgcg ctccaagcca 56760
tcgtcagccc tgctgatgac tccggtcacc gcgccctcca catctacaca cgcgcgccgt 56820
ccagccccac cgcggagtgg acccaccacg ccaccgccac cctcggcgga gagcccacgg 56880
ccgagcggcc cacgaccgaa gcggaggcgg cctggcctcc cccgggagcc aaagccgtcg 56940
acatcaccgg cttctacgac cgcgccgccg cggacggcta ccactacgga cccagttacc 57000
agggcctcca aaccgtctgg cgccagggcg aggacctcct cgccgacatc actctgccca 57060
cggccggcac gcccgaccac accaccgact ccctggccat ccatcccgcc ctcctcgacg 57120
ccgcgctcca cccgctcctc gccaccgccg acaacccgga tggcgagatc tggctcccct 57180
tcacctggag cggcgtcacc ctccacgcga ccggtgccac ccacgtccgc gcccgcatca 57240
ccccccaggg cgacaacgac taccgcctca ccctcaccga cgcgaccggc caaaccgtcc 57300
tcaccgccgg caccatcgcc tcccgccccc tcgacaccgc gcggctgcgg acgcgcgggc 57360
cgggtgacgg cctgtaccag gtgcggtgga cggcgatgcc gatcccggcc ggatcggcga 57420
ctgccgtggc ggacgactgg gcgatgctcg gggatgccgg gctcagggac ggcgggctcg 57480
ccgatgcggt cgcgccgctc gcgtcgtatc cggatgtcgc cgcactggtc gcggccatgg 57540
acgacggcac gcccgtgccg tcggtcgtcc tgaccggcct ggcacccgcg gacggcggtg 57600
acgccgatgt ggtggtggag gtgctgacga cggcgcggga atggctggcc gagccgcggc 57660
tcgccgagtc ccggctggtg gtcgtcaccc acgacgccgc cgtcgccgag gacaccgaca 57720
gcggcccgga cggcggcgat gtggatccgg tggccgcggg cgtgtggggg ctgatccgca 57780
gcgcccagtc ggagaacccc ggccggttca cgctgctcga cctcacccgg cgcgacgccg 57840
gtacggcacc ggatgtcgtg gaggtgctgc gcgcggccat ggacgcggac gagtggcagg 57900
tggccgtgcg cggcgggcgg gcgctggtac cccggctgac ggccgcggac gcggcggccg 57960
gcatcgtact gcccgtcggc gcgcctgcct ggcagctcgt catggcggac gagcgcgcgg 58020
gtacggtcga cggcttggcg cccgaggaat gcccggaggt gctggaaccg ctggcgcccg 58080
ggcaggtgcg catcgccgtc cgcgccgcgg gcgtcaactt ccgggacgtc atggtgaccc 58140
tcggcgtcgt gcccgaccgc cgcggcctgg gcggtgaggg cgcgggcatg gtgctcgacg 58200
tggccccgga cgtgacgtcg gtggccgtcg gcgaccgcgt catggggttg ttccagggct 58260
ccttcggccc catagccgtc gccgacgccc gcgccctggt gcctgtcccg ccgggctgga 58320
ccgaccggca ggcggcggcc gtgcccatcg cgttcctcac cgcctggtac gggctgatcg 58380
acctcgccgg cctcaaggcg ggcgagtcgg tcctcatcca cgccgccacc ggcggcgtcg 58440
gcacggccgc cgtgcagatc gcccgccacc tgggcgcggt gatctacgcc accgccagcc 58500
ccggcaagca cccgatgctg gaggcgatgg gcgtcgacga gacccaccgc gcctcctccc 58560
gcgacctgga cttcgagcac atcttccgcg ccgccaccgg gtccgagggc atggacgtgg 58620
tgctggactg cctggcgggc gagttcgtgg acgcgtcact gcggctgctg ggtcagggcg 58680
gccggttcat cgagatgggc aagaccgaca tccgtgaccc cgagcagatt gccgacacgc 58740
accccggcgt ccactaccgt tcgtacgacc tcgtctccga cgcgggtctc gaccgccttt 58800
cggagatgct gggtacgctc gccgacctct tcgcccaggg cgtgctcaca ccgcccccgg 58860
tccaggcatg gccgctggcc agggcccgcc aggcgctgcg ccacatgagc caggccaaac 58920
acaccggcaa gctcgtcctc gacatccctc ccgccctcga tcccgacggc accgtcctga 58980
tcaccggcgg caccggcacc ctcggcgccc tcatcgccga acacctcgtc accaaccacc 59040
acatcaccca cctgcacctc ctcagccgcc gcggacccga cgcccccggc gccgccgaac 59100
tgacggccca cctcaccgaa ctcggcgcca ccgtccacat caccgccacc gacaccaccg 59160
acccccacgc cctgcgccag gccctcgaca ccgtcgaccc ccgccacccc ctcaccgccg 59220
tcatccacac cgccggcatc gtcgacgacg ccgtgatcac cgcccagacc gccgacagcc 59280
tccaccgcgt ctgggccgcc aaggcgacga gcgccgccaa cctccaccag gccaccgagc 59340
acctccccct ggccatgttc gtgatcttct cctccgccgc cggtacgttc ggcagccccg 59400
ggcaggccaa ctacgccgcc gccaacgcct actgcgacgc gctcgccacc cggcgccggc 59460
acgccggcct gccggccacc tccatcgcct ggggcctgtg ggcggccacc agcggcatga 59520
ccggcggact caccgagatc gaccacgcga ggatgagccg gtcgggcatg gcaccgctgc 59580
cctcggagca cgccctggcg ctgttcgacg cggcgcacgg gctcggtgcg gcacgggtgc 59640
tcgccgcccg gctcgacctg gcgaggctgt ccgcccagcc gaccgaagcg ctgccgccgt 59700
tggtccgctc gctcacgggc accggccccc ggaccgcgcg gcgcagcgcg gcggctcccg 59760
tggccgacct gtccggccgg ctggcgtcga tggcccccgc cggacagctc gcgctcctgc 59820
tggatctggt ccggacccac gcggccaccg tgctggggca catggattcc ggcacggtgt 59880
cggcggacac ccccttcaag gatctcgggt tcgactcgct gacggcggtc gagctgcgca 59940
accggctcac cacggtgacg gggctgcggt tgtccgcggc gtcggtgttc cgctacccga 60000
cagccaccgc catggccgag cacctgcggg gcgagctgtg cccgacgggg gatgacacgg 60060
cgcagcccgt gctgcgggag ctggcacggc tcgaggcggc ggtgggcgag tcgaagccgg 60120
agggggagac cagtgcccag ctcgtcaagc ggttgcagac cctcttgtgg cggctcggtg 60180
acgaggccgc cgcggtcgat cacaccgtcg acggcgagga gttggagtcc gcctcggacg 60240
acgagatgtt cgcgctcatc gaccagcaac tgggctcgtc ctgaccgtca gcgagggaga 60300
agtcgtgtca accactgaag agaaactgcg ccagtacctc aagcgcgtca ccctcgacct 60360
cggccaggcc aagcaacgcc tgcgggaagc ggaagaacgc caccaggaac ccatcgccat 60420
caccgccatg gcctgccgct accccggcgg cgtcaggtca ccggaagccc tgtgggacct 60480
cgtcgccacc cgcaccgacg ccatcggccc tttcccgacc aaccgcggct gggacctgga 60540
gggcctgttt cacccggacc ccgaccacta cggcaccagc tacgtccggg aaggcggctt 60600
cctccacgac gccgagcggt tcgacgcctc cttcttcaac atcagccccc gcgaagccct 60660
cgccatggac ccccagcagc gagtgctcct ggagaccgcc tgggaactgc tggaacgcgc 60720
ccacatcgac ccccacagcc tcaaaggcac cctcaccggc gtctacacgg gggtgtcgag 60780
ccaggactat ctgtcgcgga taccgcggat tcccgagggt ttcgagggct atacggccac 60840
gggcggactc atgagcgtgg tgtccggccg cgtggcgtac acgctgggcc tggagggacc 60900
ggcggtcacg ctggacaccg cgtgctccgc gtcgctggtg gcgatgcacc tcgccgggca 60960
ggcgctgcgg cagggcgagt gcgacctggc cctggccggt ggtgtgaccg tgttcagcac 61020
gcccaccgcg tatgtggagt tctcccggca gcgggggttc gcaccggacg cccgctgcaa 61080
gccgttcgcc gccgcagccg acggcaccgg cttctccgag ggcgtgggcc tggtgctgct 61140
ggagcggctg tcggacgccc agcgccacgg gcgtcgtgtc ctcgccgtgc tgcgtggctc 61200
ggcggtcaac caggacggcg cgagcaatgg gctgtccgcg cccaacgacg ccgcgcagga 61260
gcgcgtgatc cggcaggcgc tggacagcgc ccggctcacc gccgaccagg tggacgcggt 61320
cgaggcgcac ggcaccggaa ccacgctcgg tgaccccatc gaggcgcagg cgctccttgc 61380
cacctacggc aaggagcgtt cggcggaccg gccgctgtgg ctggggtcgg tgaaatcgaa 61440
catcggccac acccacgcgg cggcgggcgt ggcgggcgtg atcaagatgg tgatggccat 61500
gcatcacggc cggctgcccg ccaccctgca cgtcgacgag ccgacctccc atgtggactg 61560
ggacacgggc acggtgcgac tgctgaccga gccggtcgac tggccgcggg gggaccgtcc 61620
gcgccgggca ggcgtgtcct cgttcggcat ctccggtacc aacgcccatg tgattctgga 61680
ggaggccgcc ctgccgcccg ccgccacggg cgccgagcgg ccgggagacc ggctcacgcc 61740
gtgggtggtg tcggcgcgcg gccaggccgc gctgcacgac caggcccggc ggctgctcga 61800
cgcgaccgtt gacggcgatc ccgaggcagt gggctggtcg ctggtcgcct cgcgtgcggt 61860
gttcgaccag cgggcggtga tcacgggccg ggacaccgaa acgctgcggg cgggtctggc 61920
cgcactggcg gccggggagg accatccggc gctggtgcga cgtgaggcgg gggtaccggc 61980
ctcggggtcg caggtgtggc tgttctccgg gcaggggagc cagcggccgg gcatgggcgc 62040
cgaactccat gagcgtttcc ccgccttcgc cgacaccttc gacgagatct gcgcactgat 62100
cgacccccac ctcgaccacc ccctgcgcga catcgtcttc gccacccacc ccgaccacac 62160
cgacctcctg aaccacacca cctacaccca agccggactc ttcgccgtac aagtggcact 62220
cgcccggctc ctggaacact gcggcctacg acccgacacc gtgatcggcc actccatcgg 62280
cgagatcacc gccgcccaca tcgccggcgt cctctccctc caagacgcct gccacctcgt 62340
cgccaaccgc gccaccctcc tcggcaaact cccacccggc ggcgccatga ccgccatcga 62400
agccaccgcc gaagaaatca cccaaaccct caccccctac cacggccaag tcaccatcgc 62460
cgccctcaac gcccccacca gcaccgtcat ctcaggaccc gaagaactcg tggcccaact 62520
cacccgcaga tggaaagaac gcggccgcag aaccaaaaca ctcaccgtca gccacgcctt 62580
ccactccccc ctcatggaac ccgccctcaa cgacttccgc cacgccatcg accacctgac 62640
ctaccaccag cccaccatcc ccctcatcag caacctcacc ggcgaacccg caacccaaga 62700
catcgccacc cccgactact gggtgcgcca catccgccaa cccgtccact tccaccccgc 62760
catcacccac atcgcacccc acacagccgt cttcctcgaa atcggccccg acgccacact 62820
catccccgcc acccaaaaca ccctcgacac cctcgacaaa caacccgcac acccacccca 62880
actgatcccc accctcaccc gcaaacaacc cgacacccaa gccctcgccc acgccctcgc 62940
ccgcctccac accctcaccc ccctcaactg gcacccctgg tacaccgacc aacccacccc 63000
caccaccatc gacctcccca cctacccctt ccaacgcgag cggtactggc tgcccgatgc 63060
cctcgcggac gccccgccgc cggaggcgga cgaggagcag gtccggttct ggaacgcggt 63120
ggaggcgcag gatctcccgg ctctgtccga cacgctgggc atcggcgagg aggacgggcg 63180
gcgctcgtcg ctcggcgcgg tgctgccgac gctgtcgcgc tggcaccagg aacgccatga 63240
gcgggcgacc gtgagctcct ggcgctatcg ggtgggctgg cgccacctgc cggacctcgg 63300
cccggcggcc gtggcggggc cgtggctgct ggtcgtgccg ccaaagggcg ccgacgcgtg 63360
ggccgacgcc tgtgagcggg ccctcactgc ggacggcggc gaggtgcggc ggctggtgac 63420
ggacgggcgg gccgatgtgg ccgagctcgc cgcgtcgctg cgggcgctgt acgccgaggg 63480
cccgtccccg gcgggggtgc tgtccctgct gcccctcgac gagcgtccgc acgaggcgtt 63540
ccccgccgtg accggcggtg tgacgggcac ccacgtcctg ctgcgggccc tgctggacgc 63600
cgagctcgac gcgccgctgt ggtgcgccac gcggggcgcg gtggcggtcg atgacgatga 63660
agcacctgag gccccggcgc aggctcaggt gtgggggctg ggccgggtgg ccgcgctgga 63720
gcatccgacg gcgtggggcg ggctggtcga cctgccggcc tcggtcgcgg acctcgctcc 63780
cgatctgctg tgcgccgtgc tggcggggcg gaacggggag gaccaggtgg cgctgcggcc 63840
ggccggggcc ttcgggcggc ggctgctgcc cgctccgctg gatgcccagg ccccggccca 63900
ggagcgggcg tggacgcccc gggatggcgt cctcgtcacc ggcggggtcg ccggagccgc 63960
ggccctcgtg gcgcggtggc tggccgcgga cggcaccaag cacatcgtgc tgctggcgcc 64020
ggacggcccg gccgcgcccg gtggtgcgga actggtggcc gagctggcgg agctgggcgc 64080
ggaggcgacg gtggtggacg gcgtgccgtc cgaaccgacg acccgccagg agctggccga 64140
ccggctcgcg gcctcgggtc tgcgggtccg cacggtcgtg cacgccgggg cgccgggcga 64200
ctgggcgccg ctggcagagc tcaccccaga cgagctggcc gaggcgctgt ccgacgccat 64260
ggggggagcc gatcggctcg cggagctgtg cggcctggag cccgacgacc cggtggtggt 64320
cttctcgtcc atcgccgcgg tctggggcgg tggcgggcac ggtgcccggg cggcggccga 64380
cgcgtatctc gacgcgtggg cgcggcggcg ccaggcggcg ggcggccatg tcgcccggct 64440
ggcgtggggc gtgtgggacg gctcggaaga cccggaggcg gccgaacgcg ccgagcgtca 64500
ggggcttttg gccctccacc cgacgcccgc gctggccgcg ctgcggcgga cgctcgatca 64560
cagcggtgac ggaaccgacc aaggcaccgt ccgggccggt gatgagggcg gcgaccggag 64620
tgacgtccac gcggtgatcg cggatgtgca ctgggaccgt ttcgtgcccc tgttcaccat 64680
ggcgcgtgcc agccgcctct tcgacgagat cccggcggcc cggcgggcgt ggcaggcggc 64740
actggactcc tcggacgacg agagttccga gagcctgacg gccctgaggg accggctggc 64800
ggcccagtcg ccccaggcgc gcaccggcac gctgctggcc ctcgtccgcg cccatgtggc 64860
cggggcgctg cgctacccgg cggcggagtc ggtcgatccg gagcagccct tcaaggagct 64920
gggtttcgac tccctggcgg ccgtcgagtt ccggaaccgg ctacggggcg ccatcggcct 64980
gacgctgccc gccaccctgg tcttcgacta tccgacgccg accgcgctgg ccgggtatct 65040
ggtctcgcag gtcctgccgg ccgagccggc ggacgaaccc gccgccgcgc atctcgacga 65100
gatcgaggcg acgctcgccg ccctggacgc ggacgacccg cgccgcgccg ggctgacgca 65160
ccgactgcgg ttgctgctgt ggcggtacgc cgacggggac gatgcgctcg aaccgcgcga 65220
ggagacgggc ggggacgacc tggaaacggc gtccgcggac gagatgttcg ccctcatcga 65280
ccgcgagttc ggggagtcct gagcacggcc gagcccctga ccccggcggg gtcaggggct 65340
cgtggcgatc gtgagccgga gcgatcagcg agaacggagc tcctccgtca gccgttccag 65400
ttcggggacc acctcgttgg gggacggctg agccagcacc tcctgacgga gccgctcggc 65460
acccgcccgg taggaggggt cttgcaccag cttcgccagg tgattccgga ttcgctttcc 65520
tgtcatgcgc tccggcggga tccacccacc cgcctgggcc tcttccagcc gtgcgcctcg 65580
caccgaggcg tcgggggcca tacggctgac catgagctgt gggacgccgt gcgccagcga 65640
gctgcagaag gcgggaagtc cgccgtggtg aatcaccgcc gcacaggacg gaatgacgag 65700
gtgcagcggg acgaaatcga caagcctggt gttctggggg acgtgcttca gtttttcctg 65760
gattctcccc ggcagggtca tcaccagttc catctcgaga tcggccatcg agtcgagcca 65820
gtcctgcagc tgctcaatgg agaccatctg gtagctggag acgggctgcg agctggacat 65880
gtcccacatg ctcatcccga atgtggcgag gacacgcggc gccggcggct cagggcggat 65940
ccagtccggc gcgacggcgc ggccgttgta cggcacatgg cggacggaga cctgcggcac 66000
attggctttg agcctcatgg agtccggcac ctggtcgatc gtgaactggc cggtcaccag 66060
ctcttcgcgg aactcgaaac cgtgtttctc cccccacgcg ccgagccatt cggccatggg 66120
gtcctcgcgg tccgccggct cctgctgcgc cagcagcttc aggaagtggc ggcgcatacg 66180
ggcctccacc tcgataccga tcggcatccg cgcgtgtgcc gcccccaccg cggcggcggc 66240
gaccgccccc gcgtggctca gccactccca gaccaccagg tcgggcttcc accagcggca 66300
gtacgcgacc agatcgtcga tcatcgaatc gttcacggcc ttggccactt cggtggaagt 66360
ggcgcagagc gacttcaggt agctcagcgg aaacatctct tcgcgattct cgcccagatc 66420
gatgggatgg tgaattccac cccatgtcgc ttcggatgct tccgtctgcg cttggtgcgc 66480
cttctccagg aagtgctcct ccgatccgac cgggaccgcc gtcagccctg actggacgat 66540
gacgtcggtg aggtcggggc cgctggcgac atgaacctcg tggcccgcgg tgcgcagggc 66600
ccaggccatc ggcacacaga actggaagtg ggtccgccaa ggaatggtga cgaacaggac 66660
acgcatggct ggcatctctc ctctctggtc accggcaatg gcaaccagtt cgctctctcg 66720
cccgtgcacc ttcgtcattg accgcactca cgtttaccct aacgacgcct ctaaagaggg 66780
tcttgcacgt gctcgaccgg cgttagtccg cctttagccg tgcccgagag actgcgtacc 66840
gccgggatgc gaccaacgcc gaacgtatga ggagagcggt atggctcagg gttttcaggc 66900
cagcctgcag tgggaacgca tcaatgaact ctgggttacc gaggaggcgt cggctgatct 66960
gaccggcttc aagtcggacc gccggaattt caatatcgcc ctgtgggatc cgacgaccaa 67020
cggaatcagg tatctgaggg ctctcgtcta cgagttggcc acgcggctga gcgacgacga 67080
ctggtcgaag atcgagaaag tccgcaaccg tgacgtggga gatccggtca ccgtccgata 67140
cgagggcagg accgtctgtc tggactatct ccaggcggcg ctcgagctcg gcttcatcga 67200
gaaggaactg gacctgggcg gcgcccgcgt cctggagatc ggcgccgggt acggccgtac 67260
gtgccatgcg atgctctcca actacgacct ggcctcctac accatcgtcg acctgaagaa 67320
caccctcggg ctgagcaggg cgtatcttcg cgaggtcctg gacgagaagc agttctccaa 67380
gatgaggttc gtccaggtgg aggacatcga cacggggctg ggcccggacg gcttcgacct 67440
gtgcgtcaac gtgcactcct tcacggagat gaccccggac accgtcaagg cgtatctgcg 67500
cctgatcgac gagaggtgcg gggcgttctt cgtgaagaac ccggtcggaa agttccggga 67560
caagagcatg gacggccacc agaagggcga ggaggccgtg cggctggcca tgcagaccgg 67620
gccgctgcgg caggtgctcg acatccatga cagccaggcc gtggcggccg ccgtgccggc 67680
cttcatcgag gcctatcagc ccggcgaggg atggacgtgc gccgccaaca cccgcggcat 67740
gccctggagt tacttctggc aggccctcta cacgaagacc ggcgacgacc ttcgatagcc 67800
cccgtcttcc caggaggaaa ccatggccgc ctctccggtg cccccgcccc gcggtgacga 67860
agcgctggcc ggcacgccgg tcctggtcct gggcggctcc ggctacctgg ggcggcacat 67920
atgctcggcg ttcggtgccg cgggcgccca ggtggtgccg gtctcgcgcg gcgcgcgcgg 67980
cggcgtggac ggtgacggct gccgctcggt gcgcctggat ctgaccgcgg ccgggcccga 68040
tgagctggcc cggctgtgcg ccggtacagg ggcgcgggtg ctggtgaacg cctcgggcgc 68100
ggtctggggc ggtggcgaac ggcagatggc cgaggccaac accgagctgg tcggccggct 68160
ggccggggcc gtcgcccgat tgccgggccg gccgcggctg attcacctgg gcagcgccta 68220
cgagtacggc ccggcccgcc cggggaccgc gatcgcggag gactggccgc ccgccccgac 68280
caccgtctac ggccgcacca agctgagcgg ctcgcaggcc gtccttcggg ccgccgccga 68340
gctcggtgtc gccggaaccg tgctgcgggt ctccgtcgcc tgcggcccgg gcgccccggt 68400
gagcagcctc gccggggcgg tggccgcgca cttggcggcc ggccgcgacg agttgcggct 68460
cgcgcccctg cgcgaccacc gcgacctggt ggacgtccgg gacgtggccg acgcggtggt 68520
cgcggcggcc gtggcaccgg tcgccgccgt caccggcaca gtcgtcaaca tcggcagcgg 68580
ccaggcggtg cccgtgcgcc ggctggtcga tctgatgatc gccctcagcg ggcgcccggt 68640
gcgtgtcatc gaggaccccg cgctgcgccg gacgcgttcc gacgcggcct ggcagcgact 68700
ggacatcgga cgtgcgcggc gcctgctggg ctgggcgccg cgccgaaccc tgcgggagtc 68760
actgcgcgat ctgctcgccg ccgtcggcgc gccgcagccc gcggcggtac gagcggcgac 68820
agcgatcgga ccccggaaca gtcatgggaa ggacagcaga tgagcgaacg ggtcgcgcgc 68880
atcctcgacg aggtgcgcaa gtaccaccag gacagccagg aggggcgtgg cttcatcccc 68940
ggggtcaccg agatctggcc ctcgggcgcg gtgctcgacg aggacgaccg ggtcgcgctg 69000
gtgcaggccg cgctggagat gcggatcgcc gcgggcaagc tctcccggaa gttcgagtcg 69060
gccttcgcgc gccggatgaa gcgccgcaag gcacacctga cgaactccgg gtcgtccgcg 69120
aacctcctcg cgatctccgc gctgacctcc cacctgctgg gcgagcgccg gctgcgcccg 69180
ggcgacgagg tcatcacggt ggcggcgagc ttccccacca ccgtcaaccc gatcctgcag 69240
aacgggcttg tcccggtcta tgtggacgtc gagttgggga cgtacaacgc cacggccgag 69300
cgggtggcgg aggcgatcgg gccgcgcacc cgggccatca tgatggcgca cacgctgggc 69360
aaccccttcc aggccacgga gatggcccgg ctggcgcagg accacgacct gattttgatc 69420
gaggacagct gcgacgccgt gggctccacc tatgacgggc ggccggccgg gacgttcggc 69480
gatctgacga ccgtcagctt ctaccccgcg caccatctga ccatgggtga gggcggctgc 69540
gtgctgacct cgaacctggt cctggcgcgg atcgtggagt cgctgcggga ctggggccgg 69600
gactgctggt gcgagccggg cgagagcgac acatgccgca agcggttcgg gtaccagatg 69660
ggcacgctgc cggccggcta cgaccacaag tacatcttct cccacatcgg gtacaacctg 69720
aagtcgaccg acctccaggc ggcgctcggg ctgacccagc tcgacaagct cgacgcgttc 69780
tgctcggccc gtcggagcaa ctggcggcgg ctgcgcgaag ggctggacgg gctgccgtgg 69840
ctgatcctgc cggaggcgac gccgcgctcc gatccgagct ggttcgggtt cgtcctgacc 69900
gtcgatccgc gggccccgtt cagccgcgcc gagctggtcg acttcctgga gtcccgcaag 69960
atcggcacac gtcggctgtt cgccgggaac ctgacgcgcc atcccgcgca tgccgaagcg 70020
ccacaccggg tgtgcggcga tctggccaac agcgacaccg tcaccgagca cacgttctgg 70080
gtcggcgtct atccagggct gaccgaggag atgatcgact tcatggtctc ctccatcacc 70140
gagttcatcg ggagccaccg gtgagcgcgg agcgtttgag cggctcccac caggggcggt 70200
tgtcccggta ccaggcgacc gtctcggcca gcccggccgc gaagtcgtgg cgcgggcggt 70260
agccgagttc gtccctggcc ttgtcgtcct ggaccgcgta gcgccggtcg tgccccttgc 70320
ggtcctcgac gtaccgtacg cggctccagc cggcgccgca ggcgtccaac agccgaccgg 70380
tcagctcccg gttggtcaac gcggtgccgc cgccgatgtt gtagatctcg ccgggcgatc 70440
ccttggtgcg caccagttcc acgccgcggc agtggtcctc cacgtgcagc cagtcgcgca 70500
cgttcagccc gtcgccgtac agcggcacgt cctcgccgtc gagcaggttg gtgatgaaca 70560
gcgggatgat cttttccggg aactggtggg gcccgtagtt gttgctgcac cgggtgacgc 70620
gtacgtccag cccatgggtg cggtggtagg cgagggcgat caggtcggag gcggccttgg 70680
aggcggcgta cggcgagttg gggtccagta cggcggtctc ggggctgaat ccggtctcga 70740
ccgagccgta gacctcgtcg gtggagacat ggacgaaacg ctccaccccg tggcgcaggg 70800
ccgcctgcag cagcgtgtgg gtgccctcga cattggtgcg cacgaacggg tcggcgccgg 70860
tgatggagcg gtcgacgtgg gattcggcgg cgaagtgcac cacttggttg gcccgcgcca 70920
tcagcgtgtc gaccaggtcg gcgtcgcaga tgtcgccgtg cacgaacgtc agccgggggt 70980
cggcggtgtc gaggttgctc agtgtccccg cgtaggtgag cttgtccagc acggtgacat 71040
gtgagacgtc ggggccccac tccgcgccgc cccgtgcgag gagcctgcgt gcataggtgg 71100
acccgatgaa ccccgcggca cctgtgacca ggagcttcat atctcctcca gaaagcgcgc 71160
gggcggcagg ccccgccggt acgagcgcgc ccggtccatg acgtactggc cgtagggtga 71220
gttgctcatc tcggtgccga gggtgtagca ctcgtcggct ccgatgaatc ccatgtacat 71280
ggcgatctcc tcgacgcagc cgagccgcgt gccttggtag tgctgaacgt cgcgcaccat 71340
acggcccgcg tccaggaggg tctcgtgggt gccggtgtcc agccaggtga cacccctgcc 71400
gagccacacc agatcggccg tgccctcctc gagataggcg cgcaggacgt ccgtgatctc 71460
cagttcgccc cgggccgagg gcaccaggcg cttggccacg tcgacgacgc cgccgtcgaa 71520
gaggtacagg ccgggaatgg ccaggttcga ccggggccgc tcgggcttct cctcgatgga 71580
cagcagcctg ccgcgctcgt cgatttcggc gacgccgaag tgccggggat cggcgacctc 71640
gtgcccgaac agcacgcatc cgcgcagccg ctgcacactt ctacggagca gcgcgggcag 71700
attcgcgccg tggaagaggt tgtcgcccag gatcagcgcg cactcctctc cgcgaatatg 71760
atccgcgccg atccggaagg cgtcggcgat tccccgcggt ttctcctggg aggcataaga 71820
gaggttcagc cccaggcgcc ggccgtctcc gaaaagctgg cggaattgag gcagctcggt 71880
ggggcgactg atgatcagga tatcccggat gcccgcgaac atgagaacgg acagcggata 71940
gtagatcatg ggtttgtcgt aaacgggaac caactgtttc gaccccgcca aggtgagggg 72000
ctgcagccgt gttccgttcc cgcccgcgag aatgatgccc ttcatcatgt caggacttcc 72060
cgaactcggt ggcgatcaaa tcgaatatgt cctccgccgt cgccgtttcc aggtcgctgt 72120
gggacccggt ggcgcgggcg gtgtcgctcc acttgtcggc caggagccgc agccgtgcgg 72180
cgatccggga gcaggcggcc tcgtcgacgg tgtccggggc gtgggcggcg tcccagccgt 72240
ccagttccga catgatcagc tgttcgcccg aggccgcgtc ggcgacgagc agcccgtgca 72300
gatggcgggc gagctcctcg gccgaggggt agtcgaacag gacggtggtg ggcagccgca 72360
gcccggtgcc ggcgttcagc cggtcccgca gttggacggc ggtgagcgag tcgaagccca 72420
gctcctggaa cggcttggtg gggggcacgg cgtccgcgtc ggcgtgaccg agcgtggtgg 72480
ccgcgtgcgc ctggacgtgt cgtacgagga tgtggtgctg ctcggccggc gggcagccgg 72540
cgagcttccg cctcagcggg tccgcctcgt cctccctggg ccgggtctcc tcggcgcgcg 72600
cctcggcggc ctccggaagg tcggagagca gcgggctcgg gcgcagcgcg gtgaagcccg 72660
tcaggaaacg ccgccagtcg atggcggcga ccacggcgtt ggtctcccgg cgttccaccg 72720
cctgctgaag ggaggcgatg gccaggccgg ggtcgagggg gtggatgccg cggcggcgga 72780
aggcgtcgac ctcgtcggcc gtggtggcca tgcccacttc gccccacagc ccccaggcca 72840
cggaggtggt cggcaggccg aggccgcggc ggtgctcggc gagggcgtcc aggtaggcgt 72900
tggcggcggc gtagccggcc tgctgcccgc tgccccagga cgcggagacc gaggagaaca 72960
ggatgaacgc ggagaggtcg aggtgccggg tcagctcgtg gaggtgccag gccgcgagcg 73020
ccttcggtgc cagcatgcgg ttcaggtgct cgtcgtccac gtcggcgatg aggtcctgct 73080
cgctgacgcc cgcggcgtgg atgacggcgg acagcgggtg ctcggcgggg atcgcgtcca 73140
gcacgccgcg catggcctcc gggtccgccg cgtcgcaggc ggcgatggtc accgcgacgc 73200
ccaggccgcg cagttcggcg gccagttccg cggcgccggg agcgttgggg ccacggcggc 73260
tggtcaggac cagttgggcg gccccgcgtt cggcggccca gcgggccagc agcgctccga 73320
cgcctccggt gccgccggtg atcagcacgc tgccccgggg ctgccacgcc tggtccgcgg 73380
tacgcgcgct cgtggtgtgg acgaggcggc gggccagcgc cccgccaccg gcccggatcg 73440
ccacttggtc ctcggcctgg ccgcccgtcg gcgcgccggc cagcagggcc gccagctgtg 73500
cctcggtccg ctcgtcgggc tcggccggca gatccaccag acccccccac cggtcggggt 73560
gttccagcgc ggccacccgg cccaaacccc aggtctgggc ctggaggggg tgcgacagcg 73620
ggtcgtcgtg gccggtggac accgcgccct gggtcacaca ccacagcggc gcggtgatcc 73680
ccgcgtcgcc gagcgcctgg agggccgcca cggtggcggc gagcccggcg ggcacggccg 73740
ggtggtcggc gcgggggcgc tcgtcgagcg cgagcaggct gaccaccccg gccggggcgt 73800
cgtcaccgcg caagcccgtc agccgttcgg cgagttcggc gcggctggtg acggaggggt 73860
cggcggggca gggccggacc tcggcgccgt gcccggtcag ggccccggtc acggtacgga 73920
ccgccgggtg gtcttcgtag ccggtgggga cgagcagcag ccaggtgctg ctcacggcgg 73980
gcggggcggg gtcgggcagc ggtgtccacc tgatctggtg gcgccatgac tccagggtgg 74040
tgcgttcgcg gtggcggcgg cgccaggtgg acagtacggg cagggcgggc cgcagcgcct 74100
cgatgtcctc ggccgtgccg tccggctcca gcacccgcgc cagcgcctcg gtgtcgaggt 74160
cctcgatggc gccccacagt tcggcctcgg ccgggtcctg cgcgccgtcg ccgcggaccg 74220
cacgggccgg gggcagccag tagcgctcgc ggtcgaacgc gtacgtgggc agggccgtcg 74280
tccggggggc gggcgggccg tcgaaccagg gccgccagtc caccggcacc ccggaggtga 74340
acgcctgcgc cagcgcggtg gccagctgtg cggggccgcc ctcgtcgcgg cgcagcgtgc 74400
ccatggccac cgcggcgacc tgggcggact cgaagcactg ctggagcgcg gggacgacca 74460
cggggtgggt gctgacctcg atgaacaccc ggtgccccgc ggtcagtagc gcgtcgaccg 74520
cctcggtgag acggacgggc ctgcgcaggt tcgtcagcca gtagcctgcg tccagggcgg 74580
tggcggtgtc catcgggccg ccggtcacgg tcgagtagaa cgccacatcc gtggcctgac 74640
cgccgactcc ggcgagacgg gccgtcacct cgtcggcgat ctcgtccacc tggggaccgt 74700
gcgaggcgta gtccacatcg atggtgcggg cccgcgcacc ggtcgcctcg caggcggcga 74760
cgacagcggc cacggcgtcg ggcgggccgg agaccacggt ggaggacggg ctgttcagcg 74820
ccgccacccc cacaccggcc gccttctcgc ccacctcggc gacgagccgc tcggccgtgt 74880
cggggtcgac gcccagggag gccatcgcgc cacggccggc cagcgcccgc agcgcctgcg 74940
agcgcagggc caccacccgg gccccgtcct ccagcgtgag tgcgcccgcg acacacgcgg 75000
ccgcgatctc gccctgggag tggccgacga ccgccgcggg ccgcacgccg tggtgggccc 75060
acagggcggc cagggagacc atcagcgccc acagcaccgg ctggaccaca tcgacgcggc 75120
cgaggtcggc ggcgccgtcg gcgccccgca acacctccgt cagggaccag tccacgtacg 75180
gggcgagcgc cgcctcgcac tccgcgatcc gggcggcgaa caccggtgag gcgtccagca 75240
gttcggcgcc catcccgggc cactgggagc cctgccccgg gaacacgagg accacccccg 75300
cgcccggcgc gcccggcgcg ccggtcacca cgtggggcgc cggctccccc gccgccagcg 75360
cggccagccg ctcccgcagc gtcgcgccat cggggccggt gacgaccgcc cggtgctcga 75420
acaccgagcg cgtggtcacc agcgaccagc ccaccgcccg cgggtcgccg tcggtcgccg 75480
cgtcgagcag ccgccgggcc tggcggcgca gcgcggccgc gctgcgcgcc gacagcaccc 75540
agggggtgac gagcgcggcg ggctcggcgg gctcggcgga ttcgttggca ggtggtgctg 75600
gttcgggggc ctcttccagg atcaggtgtg cgttggttcc cgagaaaccg aaggcggaca 75660
cgcccgcccg ccgggggcgc tcggcgcgcg gccactcgac ggcctggtcc agcagccgca 75720
ctccgccggt gtcccaggcc acatgcgggg tgggctcgtc gatgtgcagg gagcgcggca 75780
gcatcccgtg gcgcagcgcc atgaccatct tgatcacgcc ggcggcaccg gcggcgatct 75840
gggtgtggcc gatgttggac ttgatcgacc ccagccacag cggccgcccc tcgggccggt 75900
cctggccgta ggtggccagc agcgcctggg cctcgatggg atcgccgagg ctggtgcccg 75960
tcccgtgtgc ctccaccgcg tccacgtccg ccggcgcgag ccgggcgttg gccagcgcct 76020
gctggatgac gcgctgctgt gacggcccgt tcggcgcggt cagaccgttg ctggcaccgt 76080
cctcgttgac ggcggagcca cggatgaccg ccagcacctc gtggccgttg cggcgtgcgt 76140
ccgacagccg ctccaggagc accacaccgg cgccctcggc gaggttcatg ccgtcggcgt 76200
ccgccgagaa gggcttggca cggccgtccg gggccagggc gcgcagctcg ctgaacccga 76260
ccagcggtgc cggtgaggac atcacgtaca ccccaccggc cagcgccagc gtcacctctt 76320
cctcccgcag cgcctggcac gccaggtgga tggcgaccag cgccgaggag caggcggtgt 76380
ccagcgtcac cgcggcgccc tccaggccga gggtgtacga cacgcgtccg gcgaccacgc 76440
tcggcgagtt gccggtggtc agctggcccg ccgcgccctc gggcacctgg aaggagttga 76500
ggaagtagtc gaggccgtcg cagccgaggt aggtgccggt ggcgctggag cgcagggtgt 76560
gcgggtcgat gcgggcgtgc tcgacggctt cccaggccag ctccagcgcc agccgctgct 76620
gcggggccat ggtgaccgct tcgttcgggc tgatcccgaa gaacgcggcg tcgaaccggg 76680
cggcgtcgcg gaggaagccg ccctcgcgca catagctggt gcccggatgc ccgggttcgg 76740
ggtcgtagag cgcttcgagg tcccagcctc ggtcggcggg caggccgccg gtgccctcgc 76800
ggccggcggc gaccaggtcc cacaggtcct cgggggaccc cgcatcgccg gggaaacggc 76860
agccgatacc gatgatcgcg acgggttcgt ggcgggcgta ctcgacgtcc ttcagccggc 76920
gctgggcctc gcgcagatcg gccgtcacgc gcttgaggta gtcgaggagc ttcttgtcgt 76980
cgttcgccat gtcaggaccg ttccggcgcc ggcttgtcca cgaggcgcgc caggtcgagc 77040
aggtcgctcg ccacccgctc gggttcttcc acgtggaggt tgtggtcggt gtcgacgtac 77100
cagcgggatt ccgcctgcgg gatggccgcc tcggcctccg ccacccactt ccgggcgtag 77160
gtgccccagg actgggcgtg ggcgggcagg gcgacgagca gcagaacggg cgccgtgatg 77220
ccggggtacc agcgggcggg cgggtcctgc cacatgctgt ccgcgagcga catgtagtgc 77280
tccagcggca gccgctgcac gagcagcccg tccggccctt ccaccatgtc ggcgaggctc 77340
gcctcgacgg cggtctgtga ccacgtcggg tggagacccc gcaggagttc ccgcatggtc 77400
tcctccttga ttccggtgac gtcgcgccac catccggcga gctccacgct cttctcccgg 77460
gtgggggcgt aacgagaggt gtcgacgaag ccgatccagc cgccgtccac cagcgcgagg 77520
ccggcggcca gctcggggtg ctgggccgcg agccgcaccg cgaggtttcc gccccaggag 77580
tggccggcga acagcgcccc ggtcagccgc agctcgcggc agacggccac caggtccgcc 77640
acggcggtcg cgttgtcgta cccgtgctcc ggcaggtcgg agtcaccgtg tccgcgcatg 77700
tcgagcgcgt agacggggtg gccggcggcc gcgagcagat cggcgacctc gtcccacagc 77760
cgggcgttgg accccagccc gtggagcagc aggaacggcc ggccgtcgcg gcccggccgg 77820
tgccgcacgt tcaaatgcac tgcgtggtct accggaacgg acaacttcat ggcggggatc 77880
ccttgcctcc tgggagaggg gttgagacgg gtgaccgatc gagcgggatc gcggcgcctc 77940
ggcggaggct aggccgcatc cgcttcgtag gtctccgcga ggtacttgcc gagcagggtg 78000
gaggtggggt gttcgacgat cgcgacgagg gggacttcga gcccggtgtc gctcatcagg 78060
ctcttggcca gttgcacggc gctcagcgag ctcaggccgt tctccaggaa gttgctctcc 78120
tcgtccaggg cggggacgtc gaggacctcg gaggcgcggc ggagcacctg cgcggccagc 78180
agccgctggc gttcctccgg cgtcgcggac gcgagacgtg cgaccagctg ctcgtcgtga 78240
gtggcttcct cgttgggttc catggctgca cccttctcgg tgtcgtatcg ggctgacggg 78300
ggtacgtcgt ggtctggtgc ggtgggctag ccgagcgcgc ccttcgcggc gatgccggac 78360
ggcttgagcc cgaccatctc cagctcctcg gggtggaacg gcggttcgac gagcttcggg 78420
tggctggtgt cggtgcgcag caacgacagg aagcgggagt cgcccagggc cttctgcggc 78480
gcgttcaggc tcagcacgct ggtggtgacc tcgcacacct tcggcgagcg gatcgacttg 78540
ccggagagga agtcggagaa cttcagccgc tccgcgacgt ccgggccggt cagccgcggg 78600
tcgctggaga ggttgcggca gttgacgtac tggatgtcgt tcagcccggc catgatccag 78660
ggatcgtcca ccacctcgct cagcgcgcgc tggacctgcc gggtgccgcc ctcgccgaag 78720
ccgtcgcgtg ccagctcctt gtccagggcc tcggcggcac gggccgccga gctcatgccg 78780
tggccgtaga ccggattgaa cgccgccagc gagtcgccga ggatcagcag gccctccggc 78840
cactgcggca tgcgctcggg gtacatgcgc cggttggctc ccgagtgcga ctggaagatg 78900
gggctgatgg gctcggcgac gctcatgagt tcgctgacga tcgagtgccg caggaccttg 78960
gcgtagccgg tgaactcgtc ctcgtcggtg ggcagcgggg cgccgcgcgt gctggtcagg 79020
gtgaccagcc agcgaccgcc ctcctgcggg tagaccacgc cgaagcggcc cggttgccgg 79080
gtgagggggt cggcggcgac ctggaccgcg gggaagtttc cgtccgcgcc ctcgggcgcc 79140
ttgaacatgc gggtcgcgta gccgatgccg gcgtcgacca cgtcctgctc gacggccggc 79200
aggccgaggg ccgccagcca gtggcccagg cgcgaggcgc gccccgtggc gtccaccacc 79260
aggtcggcct ccagcagcct gggttcgccg ccgcccgcgt cccgcacccg caccccggtg 79320
accctggagc ggtccccggc cagctcgacg gcctcggtgc cgtgctccac ctcgatccgg 79380
ccgccggcga ggatcgcgtc acgcaccacc cagtcgagca gggggcgggt gcacatcatc 79440
gcgtacgcgg tggccgggaa gcggtactgc cagccccacg ccgtcagggt caccaggtcg 79500
ctctggaagc cgatccggcg ggcaccggcc gccagcagcc gttcggcggt gcccggcagc 79560
aggttctcga cgatgcccgc gccactggac cacaggacgt ggacgtggcg ggcctggggc 79620
tgtcccttac ggtggtgcgg tccgtcgggc aggatgtcac gctccaccac ggtcacggat 79680
tccaggtggc gggccagcac atgggcggtg agcatgccgg cccatccgcc gcccagcaca 79740
acggcacgtg tcggtgtggt catggcacca agtcctctcg gagcgggctc gtctgcgcgg 79800
acactggcct ttcgcccagg acgcccatca cgagggtgtc ccgaacggct taagtgcgtc 79860
tttccgttcc ggccgccgcg cggtgtccaa agccggctcc ggtgcggtgt cgtcgtcggg 79920
gaacccgatc gtcatgtcgg tgaggcccca gaacccctgg atgtgtgcgc ccagtcggtc 79980
ctcgttgacc tcgcagatag tgatcatgcg catgcgcatc ctggccggga cgtagacccg 80040
cgcgtccccc aggaccacca cgaaccgtcc gtccatcgac gtgacgagcc gccccagcat 80100
ctcgtgcacc ttgccctccg agccccggaa cgcgtgctcg cgcagggcgt ccttgccgcg 80160
cagcaacagg ccgcccaccg ggtcctcgaa gacgatgtcg tccgtgaaca gggcgacggt 80220
cccttccacg tccccctttt gcagcagccg gaggtagttg tggggcagct ccctgagggc 80280
cgcctcgtcg gggtcgatcg ccgggggccc gacgaacggc cctgacctgc cgtcgacgga 80340
ggtctcgagg tcgggtttgc cccagtacgc ccgcatgtcc tcgatgaggc cggactcgcc 80400
gacgcggagc agcagcgcgt agtggcagcg cagggcggtc ggctccgcgc cctcgggcgc 80460
gggcagccag ccgcggtcgg cgtacagcgg gccgcggggc cggtagtcca tgaccgcggt 80520
gatccgggac aggacgtgtt cgccgtcctg ccccacgacc gactccacga tcgtctcgtc 80580
gacgtttccg gcgactgcgc tctcgaagtg cgcgcgcagc gcctcgtggc cggcctggcc 80640
gccggccccg acgggggcct cgaacgtgac gtccttcgcg tacaggtcga gaagcccgtc 80700
gatgtcgccc tcgctgaggc gccggctgtg ttcgcgtgcg atcttcttgc gggtgcgctc 80760
gttgagcacc gtatctcctc tgcttaccgg tcgggccgta ccaggctcat gtcgaacgcg 80820
gcgcgccagt accgtccgcg ccggaaggcg taggtgggca gctcgacgcg gcgggcctcg 80880
ggcccgaaga ccgcccgcca gtccacggcg gtgccgcggg catgcgcccg ggcgaggccg 80940
gtggcggcgg aggtggcctc cgcgacgccg ggccgcagcg cggccaccgc gtgcgccccc 81000
gcgccgtcga ggagtccggc cacctcgtgg ccgcccagtt ccaggtagac cgtgccgtcc 81060
ccggccaggt cacggacccg ctgggcgaag tcccgcggcg ggtccggggc ctcgtgcggg 81120
ccggtcagcg ccgcgcacgc gtcgggcagt gccagtcgcc cgctgacgtg gtcttcggcg 81180
aggcggccga tgccgtggcc gcgcaccgca ccgggcctcg tcccccacga ctcgaccagt 81240
cggaacagcg ccacctggat cgcgaaggcc ctggccgggt cggggacggc cgccgaggtg 81300
gggtccgcgc cggtgagcat cgcgtcacgg gcggggaagc cgaggtgggc gtcgagcgcc 81360
gcgcagaccg tgtcgaacgc gtcggcgaac gccgggaagg cgtcatacag ctgcttgccg 81420
gtgccgggtg cgggggcgtg gcccgtgaac aggacgacga gccggggccg gcgtccgatc 81480
gcgccccggg cgacatgggg cgcagccctt ccctcggcca gtgcgcgcag tccgtccagc 81540
agctcctggc ggctctcgcc gacgaccacg ccgcggtgtt ccagcgcggc ccgggacgtc 81600
cccagcgaca ggccgacgtc ggccggccgg agcccgggcc gctcgtcgag gtgggccagc 81660
agacgggccg cctggtcgcg cagcgcgtcg gggtcgcagc cggagagcgg ccagagcgcc 81720
accccggcgc ttcccggctc tccttccgct tcgggtagcg ggggcggagg gtcgtcttcg 81780
ggcggcgcct gttcaacaat cacatgcgca ttggtgccgc cgaatccgaa cgcggacacc 81840
ccggaccgaa gtggccggtc agttcccggc cagggggtgg cagcggtgag cagccggatc 81900
cgaccccgct tccaggagat gtgcggcgac ggcacctcgg tgtgcagggt gcgcggcagg 81960
acgccgtggc gcatggcctg gaccgtcttg atcaccccgg ccacaccgct ggccccctgc 82020
gtgtggccga ggttggactt caccgccccc agcagcagcg ggcggtcggc cgggcggccc 82080
tcgccgtaga cggacgccag cgcctgggcc tccaccgcgt cactgagcag gccaccggtg 82140
ccctggccct cgaccgcgtc gatctcgtgc ggcaggagcc ccgcggcggc cagcgcctgc 82200
ctgatcagcc gctgctgggc ggggccgttc gacgcgctca cgccgttgtt ggtgccttcc 82260
tggccgatcg cgctgccacg gaccacggcg agcacgggac gcccgtgcgc gcgggcggtg 82320
gagagacgct ccaggagcac cacgcccgcc ccctcggcga aggcgatccc ctcggccccg 82380
tcggcgaacg agcggcagcg ggcagcggcg ggcagcccta tcccccggcc ggtcccgatg 82440
aacgtctccg ggctggacat cacggtgacg ccgccggcca gagccatgga acactcgccg 82500
ccgcgcagcg actggcaggc caggtgcagc gccaccaggg atccggagca ggcggtgtcg 82560
agggtggccg tgggcccttc gaggccgaag acggaggcga ggcggccgga cagcacggcg 82620
ctcgaaccgc cggaggcgat gtggccgagc agctcgtcgg ggacgtcctt caggtgcggc 82680
gtgtaatcct ggccggtggt gcccatgaag acgccacagc gccggccgcg gaccgaccgc 82740
gggtcgatcc cggcccgctc gaacgcctcc caggtggtct ccagcagcag ccgctgctga 82800
gggtccatca ccgaggcctc gcccggggag atcccgaacg gcgcggggtc gaacgcgccg 82860
gcgtcgctga ggaaacctcc ctcctcgacc gcgctgcgca cgtccgggtg ggcgtccggc 82920
ccgtacaggc cgtccaggtc ccagccccgg tcggcgggaa aggcgccgat ggcgtcccgc 82980
tcctcggtca gcagtgtcca caggtcgtcc ggcgactcga cacccccggg atagcggcag 83040
gccatcccgg tgatcgcgac gggctcccgg ggtgccgcgc gcagccggtc gttctccttg 83100
cgcagccgct ctgtctccag gagcgaggtg cgcagggcct cgacgacctt ggcgtcggcc 83160
actgtcatga cattcctccg aagggacggc acacgtcagg actcgtcgtt gcgcagggcc 83220
gcgttcacca gttcctcgag gtccatggag gcgatgtccg cggagtcggg ggtgtgggtt 83280
tcatcgcggg ttccttccac gggccgcgcc cccgtggtgt ccgcttcccc gggcccctcg 83340
cccagccgca gcagtgtctc cagcaggccg gactcctcca gccggcgcag cgggatggcg 83400
gccagcacct gccggacccg gctctcgcgc ggatcggtgg acgggccctc cggcaggccg 83460
gggggcgggc tcgggtcggc gggtgtactc gggtcggtgg gcggtcccag ctcggtgggc 83520
ggggcgatct gctcgcgcag gtgctcggcg agcgccacgg ggttggcgaa gtcgaagacg 83580
aacgtggcgg gcaggcgcag gccggtggcg gccgagagcc ggttgcgcag ctcgacggcg 83640
gtcagcgagt cgaagcccag ctccttgaac gtccggtcgg gggcgatctc gtcggccgag 83700
tcgtggccga ggacggccgc catgtgcgcg cgcaccgcct ccagcacaca gctgtcgcgt 83760
tccgcctccg gcagcgcggt gacccggtcc gtcagcgacg ggccgccggt gcccgccacg 83820
ccgctgtccg cgacccgccg cgccctggtg cggccggcgg ccgggccgcc gcgcgaggcc 83880
gcggtggact gccggatcgg gaccagcagc gcgtcaccca cggcgtacga ccggtcgaac 83940
agcgccaggg cctgctcggt gggcagcgga tccataccgg cctggctgat gcgctgccgg 84000
acaccgtcgt ccagctcacc agtgagcgcg ctggtctcct cccacattcc ccaggccagg 84060
gactgcgcgg gcagcccgcc ggcctggcgg tgctgggcca gggcgtccag gaacgtgctg 84120
gcggccgcgt agttggcctg cgaggggttg ccgtacaccc cgacgatgga ggagaacaac 84180
acgaacagcg ccaggtcggc gtcctgggtg agctcgtgca gatgcagcgc ggcctcggcc 84240
ttcggccgca ggacccgggc cagttgttcg ggggtcagcg cgtcgaccat gccgtcgtcg 84300
aggacgccgg cgcagtgcac caccccggtc agcgggtggg cgtcggggac gccggccagc 84360
acctcggcga gccgctcgcg gtcgccggcg tcgcaggccg tgacggtcac ggtggcgccc 84420
agcgcggcga ggtcctcgcg cagggcgggg gcgccgggtg cctcggggcc ccggcggctg 84480
gtcagcagca ggtgccgcac cccgtgtcgg gtgaccaggt ggcgggcgat cagaccgccc 84540
agggtgccgg tgccgccggt gaccaggacc gtgccgtccg ggtcgagcgg cctcggcagt 84600
gtcagcacga tcttcccgat atgagcggcc cggctcatca gccggaacgc gtcgcgcgcc 84660
cggcggatgt cgaaagcgcg gaccgcgggg tgggtcagct cgccgcccgc gaaccggccg 84720
aggatctcgg tcagcatgcg tccgatgccg tcctctcccg gctccctcag gtcgaggacg 84780
cggtagcgga ggccggggta cggctcggcg agctgctcgg ggtcgcggac atcggtcatg 84840
cccagctcca cgaaccggcc cttctcggcc agcaggcgca gcgaggcgtc ggtgtggtcg 84900
ccggtgaggc tgttgaggac cacgtcgacg ccgcggccgc cggtggcggc gcggaaggcg 84960
tcctcgaagt ccaggtcgcg ggaggaggcg cggtgggcct cgtcgatgcc catcgcctcc 85020
agcacatggt gcttaccggg gccggcggtg gcgtagacct cggcgcccag gtggcgggcg 85080
atctgcaccg ccgccgtgcc gacgccgcca gtggccgcgt ggatgaggac cgactcgccc 85140
gccttgagcc cggcgaggtc caccagcccg taccaggcgg tcacgtacgc gatgggcacg 85200
gccgccgcct gctcgtcggt ccagcccggc gggatcggca cgacggagcg ggcgtcggcc 85260
acggcggtcg agccgaaggc cccctggaac atgcccatca cgcggtcgcc gacggccacc 85320
gaggtgacgt cctcggccac ctcgagcacc acgcccgcgc cttcgctgcc gcggaacgcc 85380
ccgtcgcccg ggtacatgcc gagcgcgatc agcacatccc ggaagttgac gcccgcggcg 85440
cggacctgga cgcggacatg gccgggggcg ggcggcccgg gttcctccgc cgccaccagc 85500
cccaggttgt ccagcgtggc gggagcggcg atgtccagcc gccagtgggg ggtttccggg 85560
agggccaggg cgccgccgga caccgcgggg accaggcggg acacgcgcac cgtcccgtcc 85620
cggacgacga tctgcttctc accggcggcc accgcgacgg cgacggccgc ggcgaccgcc 85680
ttcggcagtt cgggcccgtc gaggtccacc agggccagcc ggtccgggtg ctcggactgc 85740
gcggagcgga tcaggcccca caccggggcg tgcacgaggt cggtgacgtc gtcgccctcg 85800
aaggcggcga ccgcgccgcg ggtgaccacc accagccggg aggcggcgaa ccgctcgtcg 85860
gcgagccagc gctggacggt gtgcagcacc tcgcccagcc ggcggcgcag cgcaccgggg 85920
aggtccccgt cgtccggcgc cggttcgcag ggcagcaggg tcaccggcgg ggcgggcagg 85980
ccgccgtcca ccgcggcggc gagcgcgtcg aggtcgccgt agcggcgcac tccgtagtag 86040
gtgggctcgc tgccggtggc gaccgcggtc cagtcgtcgg tggcggtgcc cggctcgggg 86100
tcggtgagcg gggtgtacgc cacgcggaag agcccgtcgc gcgtctcgtc gccgtcgctc 86160
gggagttgcc ccgccgtgaa cggccgggtc accagggact cgatcgagat caccgggtgt 86220
cccgaggcgt cggtggcgtg gacggacatg ccgtcgtcgc cgaccggcgc gagtctgacg 86280
cgcagcaact cggcgccggt ggcggccagc gacacgccgt tccaggcgaa cggcagcgcc 86340
atccggcctc cgctgtcgcc gcgcaccgcg ctcagtccgc tggcgtgcag cgcaccgtcc 86400
agcagggcgg ggtggatgcc gaacgcggcg gccttcccgc gctgctcctc cgggaggacg 86460
acctcggcga agacctccgt ctcgcgccgg tagacggcgc gcagcgagcg gaaggcgggg 86520
ccgtagcgga aaccgccctc ggcgagctgg tcgtagcagc cttccacggg cacgggccgg 86580
gcgccttcgg gcggccactg ggcgaggtcg aacggccggg cggccccgcc gggcgccaag 86640
gtgccggtga cgtgctggac ccacggccgg tcggacccgg acgggcgggc gtacccgccg 86700
agcgtacggc ggccgtcctc ggcggcgccg cccaccacga gctggaggtc gaaaccgccc 86760
tgctcgggga ggaccagcgg gctcgccagc accatttcct cgaggtggtc gcagcccacc 86820
tcgtcaccgg ccctgacggc cagttccacg aacacggcgc cgggcaccag caccgcgccg 86880
gccaccgcgt ggtcggccag ccatggatgg gtgtccaggc cgagccggcc ggtcatgacc 86940
acctcgtcgc cttcggcgag gagggtgacg gctcccagca gcgggtggtc ggccggcgtg 87000
aggccgagac ttgcggggtc gcccgccgcg ccggcggtgg catccagcca gtaacgccgc 87060
cgctggaacg cgtaggtggg caggtcggcg gggctggtgc cgtccgggaa cacccggctc 87120
cagtccaccg cgacgccatg ggtgtgcgcc tggccgagcg cgaggaggaa gcggtcccag 87180
ccgccctcgt cccggcgcag ggtgccgagg gccgtggcgt tggcgcccga ctggtccatg 87240
gtttcctgta cgccgatggt gaggaccggg tgcgggctgc actcgatgaa cacgccgtgc 87300
ccgtcggcca ggagcgcctc ggtggcctct tggaacctca cctgttggcg gagattggtg 87360
taccagtagc cggcgtccat ggtggtgccg tcgatacgcc gcccggtgac ggtcgagtac 87420
agcgggacgg tgcctgcctt cggctcgata cccgagaggg catcgagcag cgggccgcgc 87480
agcgcgtcga cgtgggcgca gtgcgaggcg tagtccaccg ggacgcgccg ggcgcgcacg 87540
ccatcggcct catagccctc cagcagctcc tcgagggcgt cggcgtcgcc ggagaccacg 87600
gtcgaggagg ggccgttgac ggcggccacg ctcagccgcc cgtcccagcg ctccaggtgt 87660
ttctccacca gcccgacggg ctgcgccacg gaggccatac cgccacggcc ggacagctcg 87720
ggaagcacct gggagcgcag caccacgacg cgggccgcgt cgtccaccga cagcacgccc 87780
gcgacacagg ccgccgcgat ctcgccctgg gagtggccca ccacggccgc gggctccaca 87840
ccgtgggaac gccacagctc ggccagggcc accatcaccg cccacagcac gggctggacc 87900
acgtccaccc ggtccagcga gggggcgccg ggatcggccc gcagcaccga ctccacggtg 87960
tagtcggtgt gccgttcgac cgcttcggcg cagcggcgga agtgttcggc gaacaccggt 88020
gaggcgtcgg ccagcgcgag ggccatgccg acccactgcg agccctggcc ggggaagacg 88080
aacgccgtcg cgccggggct caccgaaccc gtgacgacgt ggccggcggg ggtcccctcg 88140
gccagcgagc ccagcccggc caggagggtg tcgcggtcgg cggcgaccac ggcggcacgg 88200
tgcgactgcg cggcgcgcga gcccgccagg gcgaacccga cacgggtggg gctcgcctcc 88260
ggccgctcgg cgaggaagtc ccgcagccgg gcggcctggg cgcgcagcgc cgcctcgcca 88320
cgcccggaca ccagccacgg gaccaccggc gactcggcgg cgggggcggg ctcctcggcc 88380
ggcggggcct gttcgaggat gacgtgcgtg ttggtgccgc tcatgccgaa ggaggacaca 88440
ccggcccggc gcgggccgcc ggcctcgggc cacggccgcg cctcggtgag cagccgtacg 88500
gtgccctcgg accagtccac ctgatgggtg ggttcgtcca cgtgcagggt gcgcggcagg 88560
agtccgtggc gcagggcgtg caccatcttg atcaccccgg ccatgccggc ggcggcctgg 88620
gtgtgcccga tgttggactt gaccgagccc agccacagcg ggccgccgtc ggcgcgccgc 88680
ctgccgtagg cggcgatcag cgcctgcgcc tcgatggggt cgcccagcga ggtgccggtg 88740
ccgtgcgcct ccaccacgtc gatgtcccgg gcggacagct gggcgtcggc cagcgccgcg 88800
cggatgacac gttcctgggc ggggccgtgg ggcgcggtga ggccgttgga ggcgccgtcc 88860
tggttgatcg cggagccgcg gatcacggcg agcacggggt gcccgaggcg ttcggcgtcc 88920
gccagccgtt ccaccagcag gacgcccacg ccctcggcgg gcccgaagcc gtcggccgcc 88980
gccgcgaacg ccttgcaccg gccgtcggcg gcgagcccgt tcagcttgct gaactccaga 89040
taggtgccgg gggtcgccat cacgcacact ccgccggcca gcgccaggga gcactccccc 89100
ttgcgcagcg cctgcaccgc caggtgcagg gccaccccgg acgaggagca ggcggtgtcc 89160
accgtgaagg aggggccctc gaagccgaag gtgtaggaga tgcggcccga catcgagctc 89220
gcggccagcc cggtggccag gtacccggcg acgtcctgcg gcacgcccgc ggcgttgccc 89280
gcgtagtaga cggtgctgcc gcctacgaag acaccggtct tgctgccgcg cagtgaggtc 89340
aggtcgatgc gggcgcgctc gatggcctcc caggaggtct ccagcagcag gcgctgctgc 89400
gggtccatgg ccagcgcctc gcgcgggttg atgtcgaaga agtcggcgtc gaagtcggcg 89460
atgtcgtgga cgaatccgcc ctcgcgcaca taactggtgc cggtctcgcc cgagtcgttg 89520
aacatgccgg ggtcccagcc ccggtcgtcg ggcatggggg agatcacgtc gcggccctcg 89580
gacaccaggt cccacagggc ctccggagtg tccgtcccac cggggaaccg gcaggccatg 89640
ccgacgatcg cgatcggctg gcggtcccgg gcctccacct cgcggaggcg ctgctgagtg 89700
tcatgcaggt cggtcgccac ccgtttgagg tagtcgacca gctgttcctg gttcgccatg 89760
gcgtcctctt tcaggccctt cccaggtggg agtcgatgaa ctgcagtact tcgtcggccg 89820
aagcctcctg gagccgctcc gcgacgccat tgccctccgc agccgtgggt ttctggcggc 89880
accgcgtcag cagcgcttcg aggcgggctg tcacggcgtc ccgttccccg tcgtcgacga 89940
gcaggtcgac gtggtcctcc agccggtcga gttcctggat cagggcctcc gacccgctca 90000
gcccctcgga gaccagttgt tcccgcaagt gctcggcgag cagtgcgggg gtggggatgt 90060
cgaagacgag ggtggagggc agcgtggtgt cggtggcccg gctgagccgg ttgcgcagct 90120
cgaccgcggt cagcgagtcg aagccgatct ccttgaaggg ccgcgacgga tcgaccgcga 90180
ggacatcgcc gagcccgagc acggccgcca cgttgcgccg caccagctcc agcagggtgc 90240
ggctctgctc ggccgccgag aggcccgtca ggcgctccgc catcgtcagg tcgtccccgg 90300
tggcgggctg ccccgccgcg gcggcgcgcc gggcaggggc ctgcctcacc aggccgctga 90360
ggatgcgggg caggacacgg gcgtcggcga ggtcccgcag cgcttccggg tcgaggcggg 90420
cgggtaccag caggctttcg ggacggcgct cggccgcgtc gaacagggcc aggccctgct 90480
cggacgacat cgcccggata ccggagcgct ccatacgggc gatgtcgcgc tcgtccaggg 90540
tgccggtgat gccgctggtc tgttcccagt acccccagga gagcgactgg ccggggaacc 90600
cggcggcgcg gcggtcctgg gcgagcgcgt ccaggaaggc gttggacgcc gtgtagttgc 90660
tctggccgcc gacgccggtc actccgccgg tcgaggagaa cagcacgaag gttgcgaggt 90720
ccttctcgcg ggtcagctgg tgcaggttga ccgccgcgtc caccttgggc cgcagcaccg 90780
cgcgcagccg ctcgccggtg agggagggga tggtgccgtc gtcgatcacg ccggcggcgt 90840
ggatgacgcc ggtcagcggg tgcgccgccg ggatgtccgc cagcagctcc gccagggcgt 90900
cgcggtcggc ggcgtcgcag gcggcgatcg tcacctctgc gcccagggag gtcagttcgt 90960
cgcgcagctc ggcggcgccg ggcgcgtcgg cgccccgacg gctcgtcagc agcaggtgac 91020
gggcgccccg cgtggtgacc aggtggcggg cgaccacgcc gccgatcgtg ccggtgccgc 91080
cggtgatcag tacggtgcct tcgctgtcca gacgccggcc gggcaggtcc ggcggggtgg 91140
gggcggtgtc ctggccgata cgggccaggc gcgggacaac cacacggccc tggcgcaccg 91200
ccacctgcgg ctcacccgcc ccgagcgcga cggacagggc gccgggcagc agcgccgccg 91260
cgtccgggtc ggcgccgtcc aggtcgacca ggacgaaccg gtcgatgttc tccagttgcg 91320
cggagcgcac cagcccccac agggcggagt gcaccaggtc ggtgacccct tcgtcgccgc 91380
tcgtcgactg ggcgccgcgg gtgaccagca ccagccggga gtcggcgaac cgcccgtccg 91440
cgagccagcg ttgcacaacc ccgagcagcc gtgcggtctc gtcgtggacc gcggcggcga 91500
gtccggggtc gagggcggtc gctccgccgg cgaggggtgc gcagtccagc accatcaggt 91560
cggaaacggg ggcgtcgtcg tcgacgtacc gcgccgactc cgcgagatcc gtggtggtgc 91620
gcacggcatg cccggcgcgt tccagcgcgt tcgtgatgcc gaggccgtcg gcgcccacca 91680
cttcccacgc gcggtcggcg agcacctccg gcctgggctc ggcggagatc cactcgacgc 91740
ggtagagcga gtcgagctcg gaaccgcccc ggctcagctg ctccagggag accgggcgga 91800
agacgaccgc gtcggcggag gcgacctgac ggcccgaggg gtccgcgaca ctcagcgaca 91860
ccccgtcggc cgtttcgttg gggaccatac ggacgcgcag cgccgaggcg ccggtggcgt 91920
acagcgacac gccgttccag gtgaacggga gcagcatctg cccgtcgggc gggccgccct 91980
gcccgatgcc gccggcttgc agcgcggtgt ccagcagggc gggatgcagg ccgaaccgtc 92040
cggccccgcc gtgcagttcc tcggcgagcg cggcctcggc gaacacctcg tccccccggc 92100
gccaggcggc cttcaggccg cggaacgccg gcccgtagcc gtagcccgcc tccgcgagac 92160
cggcgtagag accgtccagg gacagcggct cggcgccggc cggcggccac tgggtgaagt 92220
cgaagtcgtc cggccgggca cctgtggtga gggtgccgct cgcgtgccgg gtccacggcc 92280
ggtcggcgtc gtcggggccg tcggcgcgcg cgtggacggt gaagccgcgg ccgccgtcct 92340
ggcgtggcgc ggagacgacg agctggagct ggagggcccc gtcctcgggg aggaccaagg 92400
gggtttccag gacgagttcc tcgacgcggt cgcagcccac ctcgtccccc gcccgcacgg 92460
cgaggtcgac gaacgccgtg ccgggcagga acaccgaacc ggccaccgcg tggtcggcca 92520
gccaggagtg tccggcggtg gagatccggc ccgtcagcaa cacctcctcg ctctcggcga 92580
cccggacggc ggcgccgagc agtgggtggt cggcggaggc gaggccgaga ctgtggggat 92640
cgccctggcc gccgatggtg gaggcgtcga tccagtagtg ctggtgttgg aacgcgtagg 92700
tggggaggtc caggacgggc ggggccgggg tggtgacgcc ggtggggaca tcaggagtgg 92760
tgtccacccc gtaccaggct ttccagttca cggacagacc cgcggtgaag gcctgcgcca 92820
aggcccgggc cagctgcacc ggaccaccct ggtcccgctg cagggtcgcc acggtggcca 92880
cagggacatc caacgcctgg gcggtctcct gaatgcccac gctcagcacc ggatgcggac 92940
tcgactccac cagcacccga tatccatcgc tcagcaacgc ctcaacggcc gtggcgaacc 93000
gcacctgccg acgcaggttc tccacccagt acgccgcgtc cagcgccgtg gactccaccc 93060
ggccaccggt caccgccgaa tagaacgcca cagccgtctc aaccggcgtg acctgcccca 93120
acacctcgat caactcatcg gcgatgtcat ccacctgcgg attgtgagac gcataatcca 93180
catccaccag ccgggcccgg ccgccggccg cctcaaccgc cgcgaccacc tccgccaccg 93240
ccaccggcgg cccggaaacc accgtcgaac ccggcccgtt caccgcggcc accaccaccc 93300
ccgcacggcc gccgatcgcc ttctcagcct cctcccgacc caccgccaac gacgccatcg 93360
ccccacgccc cgacaacacc cgcaacgccc gagaacgcaa agccaccacc cgcgcaccat 93420
cccgcaacgt aagcgccccg gccaccaccg cagcagcgat ctccccctgc gaatgcccca 93480
ccaccgccgc gggctcaaca ccatacgaac gccacacctc cgccaacgac accatcaccg 93540
cccacaacgc aggctgcacc acatccaccc gcgacacatc caccccatcc tcgccccgca 93600
acaccgccgt caacgaccac tccacaaacg gcgccaacgc cacctcacac tccgcgatcc 93660
gccccgcaaa caccggcgaa gcatccaaca actccacccc catcccccgc cactgaccac 93720
cctgccccgg aaacaccaac accgggccgg ccccgccgac caccggttcc ccggccacca 93780
cgccgcccga gggcagtcca tcggccaggg ccgtcagacc ggtcaacagc tccgtccggt 93840
cctggccgac caccaccgcg cggtggtcgt gcaccgcccg accggccacc agcgaccagc 93900
cgacctccgc cacctcgtcc acggtctcca ggacgaactc cgccagccgt ccggcctggg 93960
cccgcagccc cgcggccgac cgccccgaca ccacccacgg caccaccccg cccaccggcg 94020
ccgggccaca gtccacggga gcgtcgacgt acggatcctc cggagcctcc tccagaatca 94080
catgcgcgtt cgtcccggag atgccgaacg ccgagacgcc cgcccgccac ggacggtcca 94140
cctccggcca gtcggtcgcc tcgctgagca gtgcgaccgc cccgctgtcc cagtcgacat 94200
gcggcgtcgg cgcgtcgatg aacaggctgg ccggcagctc cccgtggcgc atggccatca 94260
ccatcttgat gacaccggcg gcgcccgcgg cggcctgggt gtggcctatg ttggacttga 94320
ccgaccccag ccacaacggc cgctccgccg accgtccctg accgtaggtc gccagcagcg 94380
cctgcgcctc gatcggatca cccagcgtcg tacccgtccc gtgcgcctcc accacatcca 94440
catccgccgg cgacagccgc gcgttcgcca gcgcctgccg gatcacccgc tgctgcgacg 94500
gaccgttcgg cgccgtcagc ccattgctcg caccgtcctg attgatcgcc gaaccacgca 94560
ccaccgccca caccttgtga ccattacgcc gcgcatccga caaccgctcc aacaacagca 94620
aacccacacc ctcgccccac cccgtgccgt ccgccgccgc cgcgaacgcc ttgcagcggc 94680
cgtccgccgc gagcccccgc tgccgcgaga actccacgaa ggcgcccggt gtggacatca 94740
ccgtcacccc acccgccaac gccagcgagc actcaccgcc gcgcagcccc cgcaccgccg 94800
agtgaagcgc caccagcgac gacgaacacc ccgtgtccac cgtcaccgcg ggaccctcca 94860
gccccagcac atacgacacc cgccccgaca ccacgcagcc gagattgcct gtgccggtgt 94920
acccctccat gtcgaggctg gagatcccga tgatcgacag gtagtcgaag atggtcgcgc 94980
cgacgtagac gccggtgtcg ctgccgcgca gggagccgtg gtcgatgccg gcgcgctcga 95040
acgtctccca ggcggtctcc agcaacagcc gctgctgggg atccatcccc accgcctcac 95100
gcggcgagat accgaagaag tccgcgtcga actcccccgc gtcgtagatg aatccgccct 95160
cgcgcacata gctggtgccc gggcggtccg ggtccgggtc gtacagcgtg tccaggcgcc 95220
agccgcggtc ggtcggaaac gccgacatcg cgtcccgccc ctctgtcacg aggtcccaga 95280
gctgttccgg gctggtcacc ccgcccggga accggcacgc catcccgacg atcgcgatcg 95340
gctcgtccgg ttccccgatc ggtcccgcga cggggacgac ggtggcgtcg gcgacagccg 95400
tctgcgtgcc ggagatctct ccgaggacat ggcgggccag gtcgtccgcc gtggggtagt 95460
cgaagatcag cgtggtgggc agcggcattc cggcgcttga gctgagcttg ttgcgcagct 95520
cgacggcggt cagcgagtcg aagcccatct cgaagaaggg cttcgttgcg gaaacggcgt 95580
ccggacccga gtggcccagc acgccggccg ccagggcgcg gatgtgctgc gagagtatct 95640
ggtgccgctg ttccggggcg gcggcggaca gttgcctgcg caacgggctg tcctcggacg 95700
gcgtctcctt ctcgaggagt tcgacggcct gggggatctc ggcgagcagc cggctggcgc 95760
gctgcccggc gaacgtcgcg gcgaaccggg cccagtcgaa gtcggccacc gtcagcgtcg 95820
tctcgccacg gcccagggcg tcgcccagca cccgcagcgc cagctccggc ggcatggcgg 95880
tcaggccccg gtcacggaag aaggtggcca ccgcctgatc ggcggccatg cccgcctcgc 95940
cccacggccc ccacgccacc gaaagccccg gcaggccctc ggcgcggcgg cgttcggcca 96000
gcgcgtccag atacgcgttg gccgccgcgt acgcgccctg ctggccgctg ccccagctgg 96060
ccgcgccgga ggagaacagc acgaacgccg acacgtccag gtcgcgcgtg agctcatcga 96120
gccagcgcgc gccccccgcc ttcgccgccg tcacctgcgc cacccggtac ggatccaggt 96180
cgcgcaccgg cgtgtagtcg ccgacaccgg cggcgtggaa gatcccacgc aggggatgct 96240
cttccggcac ccgtgccagc agcccggccg cctggtcccg gtccgcgaca tcgcaggcca 96300
ccacgtccac cgaggcgccg agcgcctcca gctccgccgt cagctcctcg gccccgggcg 96360
cggcagggcc gcgtcggccg gccacgatca catggcgcgc gccccgctcg accacccacc 96420
gtgcgacccg gccgcccaca ccgcccgtcc cgccggtgac caggaccgac ccctcggtcc 96480
gccactcctc gcccggcgca cctcccggtg ccgcgcgcag ccggcgcccc agcgcaccgt 96540
cggcgcgcaa agccacctgg tcctccgtgc cctcggccac caccgcggcc agccggctcg 96600
ccgtctcctc ctccgccgcg gccggaagat cgaccagtcc gccccaccgc ttcgggtact 96660
ccaacgccat cacgcgcccc agaccccaca cctgggcctg cgccgggctc accgggccgt 96720
cctcgccgac cgacaccccg ccgcgcgtca cacaccacag gcgcccgccg aagtccatct 96780
cgtccagtgc ctgaaccagg gcgatggtcc cggccagccc caccgacagg tgcgggtggt 96840
ccgggtgtac ggcttcgtca aggcccaacg cgctgatcac accggcgacc tcgggcccgc 96900
cctccagctg ccgcagacgc gtcgtgagcg gatcgcgcgc ctggtccccc ggcgccagcg 96960
cgagggtcgt gcaccgcgcc ccccggccct ccagcgcccg gacgaccgtc tcgaccgccg 97020
cgtcaccctc gagcttctcg ggcaccacca gcaaccatgt accgctcagc cggggggccg 97080
cgtccaagcc ggtgaccgtg gaccaggcgg tccggtagcg ccacccgtcg atccgggctc 97140
gatcccgtgt gcgctgacgc cactccgcca acaccggcaa cgccggggcc agcgcctgcc 97200
gcgcggtgtc ctccgccacg ttcagcgagg ccccgagccc ggcgagatcc cccgcttcga 97260
cagcgctcca gaagcgctcg tcgtcctccg agccctgctc cggagaactc ccccacctgc 97320
cgggcttcag ccagtagtgc tggtgttgga acgcgtaggt ggggaggtcc aggacgggcg 97380
gggccggggt gatggcgccg gtggggacat caggagtggt gtccaccccg taccaggctt 97440
tccagtccac ggacagaccc gcggtgaagg cctgcgccaa ggcccgggcc aactgcaccg 97500
caccaccctg atcccgctgc agggtcgcca cggtggccac agggacatcc aacgcctggg 97560
cggtctcctg gacacccaca ctcagcaccg gatgcggact cgactccacc agcacccgat 97620
atccatcgct cagcaacgcc tcaacagccg tcgcgaaccg cacctgccga cgcagattct 97680
ccacccagta cgccgcatcc agcgccgtgg actccacccg gccaccggtc accgccgaat 97740
agaacgccac agccgtctca accggcgtga cctgccccaa cacctcgatc aactcatcgg 97800
cgatgtcatc cacctgcgga ttgtgagacg cataatccac atccaccagc cgggcccggc 97860
cgccggccgc ctcaaccgcc gcgaccacct ccgccaccgc caccggcggc ccggaaacca 97920
ccgtcgaacc cggcccgttc accgcggcca ccaccacccc cgcacggccg ccgatcgcct 97980
tctcagcctc ctcccgaccc accgccaacg acgccatcgc cccacgcccc gacaacaccc 98040
gcaacgcccg agaacgcaaa gccaccaccc gcgcaccatc ccgcaacgta agcgccccgg 98100
ccaccaccgc agcagcgatc tccccctgcg aatgccccac caccgccgcg ggctcaacac 98160
catacgaacg ccacacctcc gccaacgaca ccatcaccgc ccacaacgca ggctgcacca 98220
catccacccg cgacacatcc accccatcct cgccccgcaa caccgccgtc aacgaccact 98280
ccacaaacgg cgccaacgcc acctcacact ccgcgatccg ccccgcaaac accggcgaag 98340
catccaacaa ctccaccccc atcccccgcc actgaccacc ctgccccgga aacaccaaca 98400
ccggcccagc acccccagcc accggatcca ccgacaccac accaccgaac ggcacaccct 98460
ccgccagggc ccgcaacccc gccaggagtt cattccggtc gtcgccgacg accacggccc 98520
ggtgttcgag catcgcccgg ccggccacca gcgaccaccc cacctccgcc acatcggcgg 98580
tggcctgctc ggcgaactcc gccagccgtc cggcctgcgc ccgcagcccc gcggccgacc 98640
ggccggagac cacccagggc accacgccgc ccggggtggt cggctccgcg ttttccccct 98700
ctgtcgacag tgcgggcgcc tcttcgagaa tcacgtgtgc gttggtgccg gagatgccga 98760
acgccgagac gcccgcccgc cgcggacggt ccacctccgg ccagtcggtc gcctcgctga 98820
gcagtgcgac cgccccgctg tcccagtcga catgcggcgt cggcgcgtcc acgtgcagcg 98880
tctgggggag caggccatgg cgcatggcca tcaccatctt gatgacaccc gccacaccgg 98940
cggcggcctg ggtgtgggcg aggttggact tgatcgaccc cagccacaac ggccgctccg 99000
ccgaccgccc ctgaccgtag gtcgccagca gcgcctgcgc ctcgatcgga tcacccagcg 99060
tcgtacccgt cccgtgcgcc tccaccacat ccacatccgc cggcgacagc cgcgcgttcg 99120
ccagcgcctg ccggatcacc cgctgctgcg acggaccgtt cggcgccgtc agcccattgc 99180
tcgcaccgtc ctgattgatc gccgaaccac gcaccaccgc ccacaccttg tgaccattac 99240
gccgcgcatc cgacaaccgc tccaacaaca gcaaacccac accctcgccc caccccgtgc 99300
cgtccgccgc ccccgcgaac gccttgcagc ggccgtccgc cgcgagcccc cgctgccgcg 99360
agaactccac gaaggcgccc ggcgtggcca tcaccgtcac cccacccgcc aacgccagcg 99420
agcactcacc gccgcgcaac gcctgcaccg ccgagtgaag cgccaccagc gacgacgaac 99480
accccgtgtc caccgtcacc gcgggaccct ccagccccaa cacatacgac acccgccccg 99540
acagcacaca cgcgaggttc ccggtagcgg cgtagccctc cacgtcactg ccggccatgg 99600
cggtgagcat catgtattcc tgcgatgtca caccggcgta gacgcccgtg ctgctcccca 99660
cgagaccgtg cgggtcgatt ccggcgtgtt cgaacgtctc ccaggcgacc tccagcaaca 99720
gccgctgctg gggatccatc gccaccgcct cacgcggcga gataccgaag aattccgcgt 99780
cgaaaccgcc cgcgccggac agaaaggcgc cccggcgcac ataactcgta cccgcgctgt 99840
ccggatcggg atcgaaaagg ttttccagat cccagccgcg gtcctccgga aactcggtca 99900
gcccttcgcc tccggccgcc accaactgcc acaagtcgtc cggattatcg accccgccgg 99960
gaaagcggca cgccattccg acgatcgcga tcggctcggg ttccgtggac tccgcttcgc 100020
gcagacgccg acgagtgtca cgcagttccg ccgtgaccca cttcaggtgg tcgagaagct 100080
tcgcctcgtc cttcgccatc gctggcgcct cctcttcgcg gggccgagat aagatcattc 100140
ttgacgggtt ctcttaagct gtcctaaccg acctcccgct ttcaccagga gtggttcaga 100200
aggtgcagct caactctttg aagccctcaa tgaagctcga tgccagccgg accggctcgc 100260
cctccacgcg gatgtccggc agtgaggcga ggagttcccg gagcatggcg atgatctcga 100320
tccgggcgag atgagcaccg aggcagaagt gcggccccac cgcgccgaac gacagatgcg 100380
ggttggggtc ccgtgtgatg tcgaaccggt acgcatcctc gaagaccttc tcgtcgtggt 100440
tggcggacca gtagaagagg aagatctcgt cgcccttgcg gaaccgatgg ccgttcatct 100500
cgcagtcgcc ggtcgcggtc cggcgcatcc agttgatggg cgtgcccacc cggaggatct 100560
cctcgacggc gcccttcgcg tggagttcga agtccgacag caggagttgc cgctgatcag 100620
gatgttcggt cagcagaacc agggcctggg cgatgacgtt gcgggtcgtc tccatgcccg 100680
cgttgatgag gaggatgaag aacgagacca gttcctggtt ggtcagttgc tctccgtctt 100740
cctgcacctg gaccagcttg gtgatcacat cggggccggg acgggcgagc cggtcctcgc 100800
ggagccgtcc gatgtagtct ccgaggtctc gcagggctcc cagcaccgcc gccgccatct 100860
tctccgggtc ggccgcgagt tcgggatcag cccctcccat gatcgtgttc gtccgctcga 100920
agaggaattc gtagtcttcc cccggaattc ccatcatggt gctgaggacg gcgatcggca 100980
tctccgccgc cgggcggatg aagtcgccgg ggccgcgctc gatgagctcg tccacgatcc 101040
gccgggccac ccgccgggac atggcgtcga acttgggcgc cattcctcgg ccgaacgacc 101100
gcgcgacgat gcgccgtaat ctggagtgct ccggattgtc catgttgatc atggagccat 101160
agaactcgtc catctccggc ggcaggatcg ccgtggcgcc ctccgaggag aagtcctgag 101220
ggcggcggct cacctcgcag atgtcggcgt gtttcgacag cgcgtaataa ccggatgcca 101280
acggccccca gggcagccgc ggcggaacga attccgggcc cggcagcgcg cgcagcttct 101340
cgaacacctc ggcccgctca tcacgcggcc gcagccagaa ggacggatcc atgaagtcgg 101400
gaggcctggt ggctttggcg gacgatgcgg gggtggcttc cgtaggagac acaacgcctc 101460
gattcaagct cggacaggta caaaaccggt ccgcgcaagt ctcgcaaccg cctcttaaga 101520
gttcctaaca caacggcagg ggacgccctg tttgacgccg gtggaacggg ctgtcaacct 101580
cggggcgtgc aggtgacgat tcgggacgtc gcgaaggcgt cgggggtgtc cccgtccacc 101640
gtgtcgcggg ccctggcgcc cggcggggcg gtgagccccg tgacgcgcga gcgggtacgg 101700
gccgccgccg accggctcgg ctaccagccc aaccaggcgg cgcgcgggct gatcaccggg 101760
cgtacgggcc atctgggcgt gatcgtcccg gatctgctga atcccttctt cgcggacatc 101820
tgcaaggggg tgcaggcgcg ggcccggggg cttggcctca cggtgttcgt gagcgacacc 101880
gagcgggacg agggcctgga gctggacgcc atccgcacac tggccccgca ggtggacggg 101940
atcgtgctgt gttcaccgca tctgagcggc gaggagctgg ggtcgctcgg ggacttcacc 102000
gacaagccga tcgtgctgct gcaccgcaag gagcccgggt tcggaagcgt cacggccgat 102060
ctggtggagg ggatgaccga cgcactgacc catctgcacg cgctgggcca ccgcaggatc 102120
gcgtatgtcg gcgggccgcg cagctcgtgg gcggcgcggg agcgggcggc cggtgtcgaa 102180
gcggtggccg cgtcggggtt ggtggagatc gtccaggtgg ggagcgtggc gccgcatttc 102240
gacggcgggg tgaccggggc cgccgatgtg gtgctggcca gcggggcgag tgcggtgctc 102300
gcgttcgacg acatcgtcgc gttcgggctg atcagccggt tcacggtgcg cggggtgcgg 102360
gtgccggagg agatgagcgt ggtggggtgt gacgatatcg ccctgtcggg gatggcggcg 102420
ccgccgctga ccacggtctc ggtgcccaag gcgcatggcg cgcgggccgc cgtcgatctg 102480
ctgtgccgca tactggccac cccggccgcc gagggcgagc agcctccgca gcgggtgctc 102540
cccacccatc tggtggtacg gggctcgacg gccgccctcg accggcggca gcgggcgtga 102600
tggatccact cgccggtctc ctggacgggc cgagggcccg gggggcgttt ctgctccgga 102660
tggtgatgga cccgccctgg tcggtacgga tcgaggaccg ggccccgctg tgcgtgatga 102720
ccgtggcgcg cggtgaggcg tgggtggtcc cggaccgggg cgaagcgcgg cgccttggcc 102780
cgggggatgt ggcggtcgtg cgcggccccg atccgtacac cgtggccggc gatccggcca 102840
ccgagcccca ggcgtggatc ctgccgggcg aggtgtgccg caccgccggg ggcgaggatc 102900
tggcggagcg gatggcgctg ggcgtacgca cctggggcaa cagcgaacac ggtgcgacga 102960
cgatgctggt cggcacgtac cggatggacg gggagatcag ccggcggctg ctggacgcgc 103020
tgcccccgct gctggtgctg gggcgtgagc gttgcgattc gccgttgctg ccgtggctcg 103080
gcgaggagat cgtcaaggag gaggcgggcc agacggccgt gctggaccgg ctgctggatc 103140
tgctgctgat ctcggtgctg cgggcgtggt tcgcgcggcc tgaggcgcgg gccccggcct 103200
ggtaccgggc gctgggcgac ccggtggtcg gccgcgcgct gcggctgctg cagaacaacc 103260
cgggccatcc gtggacggtg gcgctgctgg cggcggagac cgggatctcc cgtgcggtgc 103320
tggcccgccg gttcaccgag ctggtcggcg agccgccgat ggcgtatctg accggctggc 103380
ggctggatct ggccgccgat ctgctgcggg agccggacgc caccctcggg gcggtggccc 103440
ggcgggtcgg ctacggcagc tccttcgcgc tgagcgccgc cttcaagcgg gtgcgcgggg 103500
tcagcccgcg ggagcaccgc tccgcggcca gcgcagggtg acggcggggc cgcgagcgca 103560
gccgtcgggc ccgcagctga cggagcgttc gtacagcagg gccagccggc gggcggtgcg 103620
gatgatgtcg tagcgctcga cagcggcggg caccgggtgc cgtacggggc cggcggcggc 103680
ccgctcgcgc agggcgtcgg ccagggcggc ggtgccgggc ccgatgcggc gggcgccggg 103740
ggccgcgccg ggcggcagtt cgtcgagggc cgggcagacg ccgtgagaac cccgggagcc 103800
cggccgccag ggcttccagg gcggcgagcc cgaacgcctc ctccgcggac ggcgcgacga 103860
gcacatccat ggcggcaagc agcccgggga tgtccgcggg gcggtccccg gtcccggcca 103920
ccgccccgtc gcgctcgccg agcagccgga tccggtcggc caccccgagc caggcggcca 103980
ggtcgcgcag cgcggtgcgt tcgggtccgt ccccgaccaa caacagccgg gcctggggca 104040
gttcggcgac ggcccgcagc gccacatcga agcgcttggc cgccaccagc cggccgaccc 104100
cgccgacgac gaaggcgtgt gcggggatgc cgagacgggc ccgggcggcg gtgcgggcgg 104160
ccggggtgaa gctgaagtgg cgggcgtcca gcccgttggg caccaggtgg atgcggtggt 104220
tggcgacgcc ccagccgcgc agccggtcgg cgacggcggc cgacacggcg acggtcgccg 104280
agcccagtcg ttcggtggcg agatagcggg cgcgggtgcc gacggtgagc gggcggccgt 104340
cgatatggcg ctcacccagg ctgtgctcgg tggcgaccac ggcgccgaca cccgccagcc 104400
gggccgcgac ccggccgtgg acacaggccc ggtagagatg ggtgtgcacc aggtcgtagc 104460
cgccaccgcg gatgaaccgc gccagccggg ccgtccccat cgggtcgcgg acacccatcg 104520
gcagatgggt gacgggcaca ccgtccgccc ggatgccgtg gccggcggcc ccgggcgcgg 104580
tcagggtgag gacgtcacaa cgggcgggca ggtggcgcag cagcagccgc agttgctgtt 104640
cggcgccacc cgcgccgagg cgcgagatga tgtgcagcac tctcatcggg cgaccgggcc 104700
ccatcttccg tatatcgcac gctgaatcac atccgaccat gacagacggc gggatgtaaa 104760
tcacgtaaga cgcgcggcga taggccgtcg gagtggcggc ccgatcggca tatcggcgag 104820
gcgcgagggg gcgcgccgaa aatccgattt gcacagccat tccccagcgg attgttacct 104880
tctcatctcg cgcgatgaca tgacgaatgt aggccgtccg ggtcatcgcc gcgcgatatt 104940
gcgtcccgac gacccgcggg agtgcccaga tgtccgcacc tacccccgtc agagacacgg 105000
ccgagacggg agccgcgccc agcaccgctc cccctcccgc gcggcccgcg caccatgaga 105060
tggccccctg ccgccggggc ccgaccgaga gttatgagta ccagcggtac agccagctgg 105120
cgggcccgct gacccagccc ccgtccggcc gcccgtaccg ggtgcgctgc cggagcctgc 105180
tggcgcagga accgcaccgc gtccgcgcgg cgctgctgct gtgcgcggcg ccggtgtgct 105240
cggcgctgct gctgttgtgg ctgctgcagc cccagcactg ggtgcaccgc gacggtgtga 105300
cgggctggga gacggccgcg gaccggacga tgctgttctc gatcgcgctg atcgaggcgt 105360
tccgcctgtt caccgtggtc tccaacgcgc acgccacgct ggtggcccgc gacccggtgc 105420
cggtgaccgc cgagcccggg acccgggtgg cgttcctgac cagcttcgtc cccggcaagg 105480
agccgatcga catggtccgg gcgacgctgg agggggcgat gcaggtcaga cacgccgggg 105540
ggctcgacgt atggctgctc gacgagggaa acgacgagga ggccaaggag ctgtgcgcac 105600
ggctgggcgt acggcatttc tcccgcaagg gcatcgagaa atggaatcag cccaagggct 105660
ccttccgcgc caggaccaag cacgggaatt acaacgcctg gctggacgcc catggcgacg 105720
actatgaatt catggcctgt gtggacaccg atcatgtgcc gctgcccaac ttcctggagc 105780
gcatgatggg ttacttccgc gaccccgatg tcgccttcgt ggtcggtccg caggtgtacg 105840
gaaattacga tacggcggtc accaaggcgg ccgagagcca gcagttcctc ttccacgcgc 105900
tgatccagcg ggccggaaac gcctatggcg gcccgatgtt cgtcgggacc aacaacgcgg 105960
tgcgcatcgc cgcgctgcgg agcatcggcg ggctgtacga ctcgatcacc gaggacatgg 106020
ccaccggctt cgagctgcac cgccgccgca accccgtcac ccggaagaag tggcgctcgg 106080
tctacacccc ggacgtactg gccgtcggag agggcccgag cacctggacg gacttcttca 106140
cccagcagct gcgctggagc cgcggcacct acgagacggt gctcaagcag ttctggaagg 106200
ggccgttcac gctgtcgccg gggaagctgc tcaactactc cctgatgatc gcctactacc 106260
cgatgaccgc gctcaactgg atgctctctg ccctgtcctg cgcgctgttc ctggtgctcg 106320
gcgcgtccgg gatccatatc ccggcggaga tgtggatgat gctctacagc gacgccgcgg 106380
cgctccagat cggcctgtac atctggaacc ggcggcacaa cgtctccccg cacgaaccgg 106440
agggctccgg cggggtggcg gggatgctgc tctccgcgct gtccgccccg atctacgcca 106500
agtcgctgtg cgacgcgctg ctgcggcgca agagccggtt cgtggtcacg cccaagggcg 106560
actcctccag cgccgaccgg ctgtccacct tccgtatcca tctggtgtgg gccgcggtct 106620
tcggcggctc gctggtcgcc tcgctgttcc tgggccacac ccacgccgcg atgcgcacct 106680
gggcgatgct ggcgctggcg ctgtgtctgg cgccgatggc catctggtcc ggcgggctgc 106740
tgctcacacg gcgcggacgg cgcaccgccc cggcggtgcc gacggcccgc gccgaggccc 106800
actgaagctt gtggtcacct ggcgcgcaac ctggcgttac gggccaggtc gagcgcgtac 106860
tccgtccacc aacgacctgc gggcggaccg ccgttgcagg tcccgtcgga ctctcccgga 106920
cgcttgatcc acagatacgc gtcgatggcg gggtcgcccg tacgggtgct ggggggatcc 106980
cccagagccc gccccggcgg gttgcaccag ctctcctggc cgtcgctggc ggagcgcacg 107040
gcgcccatgt caccgctctt gccgtcgtcg ccctcgcccc tgacggaggt gagcgggccg 107100
ttgccgttgc ggctggtgtc gatgatgaag tggaccctgc cgagctggag ggacagccga 107160
cggccgtagg cgcgggtcac cgctgtcggc tggaagttcg agacgttcag ggcgaagccg 107220
tccgcctggt cgataccgct tttcatcagc ggctccacca gccgctcggt gtccttgatc 107280
caccccgggt tcccggcgtc cagatagatc cgggtggcgg gcatcgcgga gagcatggcg 107340
accgcgtcgc gcagcagggc cagccgcggc gcacgcagcg gcttgggaat gcagccgtcg 107400
accacctgag cgatcgcgtc gggctccaat acgacaatcg cccttcgctc gccgattccg 107460
gagacgatct cacggatcca gtccagatat tcggcgtcgc tgctcgcccc gccccgggag 107520
tatcttccgc agtcccggta cggaatgtga taggcgacca gcaccggcgt acggtgctcc 107580
cgcgctgcgg attcggtgat aaaccggacc cgttcgcgcg gatgcgggcc gagccattcg 107640
gcgatcggct cgtcggcgat ccgccgcatg agctccgcct ccgcgtaacg gcccgcgcgc 107700
cgctgctcac gggcctgccg ggcggccggg ttgtccgggt caacccatag cccgctccgc 107760
cggtccgacc cgcctgacgt gccgagacat ccgctcagcg cgaggagcgc ggcggcacat 107820
cccgcaagcg gcgcccaatg tcgcgcggtg gctctcccga gccctttctc ggccattcga 107880
gcacaccttt tccccggagt ttcccggcct cggaatcgac ggcacgtgtc gcatcatggc 107940
atcgggaacg ggcggcggct ttcaattcac gcttgtgttg ccccgccgcc atcggggcga 108000
atttcggaaa gggccgtgaa acttacccgt cgcggtcctc gtcggtccgc cagagcgggt 108060
gttcgcgggc cgcccaccta cggtcgaccg tcccggtgcg cagaccgcgc cgggcctccg 108120
ggtcgccgag cgccatgccg atgtgcccga gcagaacgat gccgacggcc agcgccagcc 108180
agtcgtggac gaacgtggcg ctgatacgcc agaccagggg ggccagatcg gtgaaccaca 108240
tcagcagccc ggtacccgcc atcaccagca cggcgccgct gatataggcg gcgtacagct 108300
tctgccccgc gttgaacttg cccgccgggc gcgccccctc gcgcgtgtcg cgccgcagca 108360
ccgcgcgcag ccagcggcgg tcgtgcgggc cgaagcggtt gaggcgggtc aggtcggcgc 108420
ggaaggcgcg ggagaccagg cccagcagcg tcgggaccgg ggtcaggatg ccggtccact 108480
cgtgcaccgt gaccaccagg gcgcggcggc ccacgagttc ggcgagctgc ggcagataca 108540
gacaggccgc gctcaccaca caggtgccca ggaggacggc ggtggtgcgg tggatccacc 108600
gctcggccgg ggtgaaccgg gccacgcggg ggcgggggcc ggtgcgggcg cgtgcggcgg 108660
ggcgtacggg ggcgggacca ccggggtgcg cgtcaggtcg tggcggcatc gtcgcgtccg 108720
ttcgacttgc cgacccaggc gtccacgtca tagccgcggt cctcccaata gccgggctgt 108780
acctcgtggg tgacctcgat accggagagc cacttcgccg acttgtagaa gtacatgggg 108840
gcgacgtaca gccgcaccgg gccgccgtgg gagtggccca gcggctcgtc ccgcatccgc 108900
agggcgacca ggacgtcgtc ccggcgggcc tgttcgaggg tcaggctctc gctgtacgcc 108960
ccgtcgaagc aggtgaagcg gacggccccg gcccccgggc ggaccccggc ggcgtccagc 109020
agcgtggcca ggcgtacgcc ctcgaaggag gtctgcggca cgcgccagcc ggtgacgcac 109080
tggacgtcgt ggacgatgcg gtgctggggc agcgcgcgga ggtcggcgag ccggtagccg 109140
gtgggtttgt cgacaagtcc gccgacggtc agccggtagt tctcggcgtc ttgtgcggga 109200
cggaggaggt caccgagtag tagcggaacc cgccgccgtt ggggagcagc ccggtcaccc 109260
cgatggggtc gttctccgcc gccacggcca ggaacttctc caggcggcgc tgcagccagg 109320
gcgcggcggc catgccgccc gcgcccaggg cgagcatgcc gaggatgacg cgccggccga 109380
cgggggtgcc gcgaccggcg gcgtccgggt cgggggtatc cgagccgggc gtgtccaggt 109440
cggcggccgg gcctggttct gtggcacccg ggtcagcggt gtccggatcg gcggcggggt 109500
gctccgcggc gccgggattg cggtccatgt ccctgatgag agcactccgg gggcggcggg 109560
cgccaggctc ttcggccttt cgtcagactt ccgtcagact tccgtcagaa gctggcgcac 109620
ccgctctgcc ccaccggccg ccctacccca ccgccttcgc cgccgcgcgg cccgccgtac 109680
ggccggagaa gagacagccg ccgaggaagg tgccctccag ggagcggtag ccgtggacgc 109740
ccccgccgcc gaacccggcc gcctcgcccg ccgcgtacac accgggcagc ggctccccgc 109800
cctcggccag cacccgcgcc gacaggtcgg tctccaggcc gcccagcgtc ttacgggtca 109860
ggatgttgag gcgcacggcg atcagcgggc ccgccttggg gtcgaggatg cggtgcgggg 109920
cggcggtgcg gatgaggcgg tcggcgaggt acttgcgggt gccgcgcagg gccgtgatct 109980
ggaggtcctt ggtgtagggg ttggcgatct cccggtcgcg ggcgacgatt tggcggcgca 110040
gctcggcctc gtcgatcagc ggttccttgg tgagttcgtt catccgccgt acgaggccgg 110100
ggagcgatcg ctcgacgatg aagtccgcgc cgtggtccat gaacgcctgg accgggcccg 110160
gggcgccgcc ccgcccgcgg ttcagcacat cgcggacgct cttgccggtc aggtccgggt 110220
tctgctcgga gccggagagc gcgaactcct tctcgatgat cttccgggtg aggacgaacc 110280
aggtgtagtc gtagccggtc cgcatgatgt gttcgagcgt gccgagggtg tcgaaaccgg 110340
ggaacagcgg cacgggaagc cgtctgccgc gggcgtcgaa ccacagcgag gaggggccgg 110400
gcaggatgcg gatggcgtgc cggggccaga tggggttcca gttctcgatg ccctcggtgt 110460
agtgccacat ccggtcgcgg ttgatcagcc gcgcgcccgc cgcctccgcg acgcccagca 110520
tcttgccgtc cacatgcgcg gggacgccgg agagcatccg ctcgggcggg gtgccaagac 110580
gctcgggcca gttctggcgg acgagttcgt ggttgccgcc gatcccgccg gaggtgatga 110640
tcaccgcttg ggcgcgcagt tcgaaggcgc cggtgacctc gcggctgctg ggggcgccgc 110700
gctcggcgtc gctgggctcc aggatctcgc cggtgacggt gtcgacggcg cccgcgctac 110760
gggacagtcc ggtcacccgg tgccggaagc gcagctggac caggccgcgg gcgaccgcgg 110820
cccgcacccg ccgctcgaag ggcgcgacga cgccggggcc ggtgccccag gtgacatgga 110880
agcgggggac ggagttgccg tgcccggtcg cgtcatagcc gccgcgctcg gcccagccga 110940
cgagcgggaa cagccgcagc ccctgggcgt gcagccaggc gcgcttctcc ccggccgcga 111000
agtccacgta cgcctcggcc cagcggcgcg gccagtggtc ctcctcacgg tcgaacccgg 111060
ccgtgcccag ccagtcctgc caggccagtt cgcggctgtc gcggatgcgc agccggcgct 111120
gctcgggcga gtcgacgagg aagaggccgc cgaaggacca gtgggcctgg ccgccgaagg 111180
attgttcggg ctcctggtcc agcaggatca ccttgcggcc cgcgtcggcg agttcggcgg 111240
tggccgccag gcccgcgagc cccgccccga ccacgatcac atccgcgtcg tacgccatgg 111300
tttctggtgt cccgttctgt caggagcgcg caggaaggcc ggtggccggg ccgtggtgcc 111360
cgggcaccgg tggtgtcggt cgatcctgcc taccggccgg tatggcgtca acccgccggg 111420
cgggcagcgc cccgctacgg gcttcacggc gcccggcagc gggcccccaa ctcccgcccg 111480
gcgcgccgac ctgtgtgacg taacgccaat aaaggggaca atcgatcgcc gagatggaca 111540
gcgggttcgg cgggcgggca tcatcaagca ccatggctga ccaccatcct tctccctcgc 111600
gaccgcgcag cgcggacaag cgtgtggtga gtcctacgct ggtcactccc ggccctcagc 111660
ccgtcgagct cgccgggtcc tgcgccgagg acgaaccgca ggtgctgcgg ggcgaggtgg 111720
tcggcccccg gcacgcccgc cgacgccccg tcagcgaccg gcgcaaggcc gcggccgggg 111780
cgattctgct ctccgccacc ggcgccgcca ccgcgctctt cctgatgctg ggcaaggacg 111840
gccgccaggc cgccgccgcg cccgcacacg acgcgccgtc gtcggcgccc gacgcgagcg 111900
cggacgacgc cgtcccggtg gccgacgcgg tggccggcgc caagccgctg ccgggcgcgt 111960
cgcgcaccgc cgagcccgtg gcggccccca gcgtcccgac cgcacagccc aaggccccgt 112020
ccacggcgcc caccaagccc ccggcgcgca cccagcagcc gacgcgcccc tcctggccgt 112080
cccagtgggg aggccagggg cggaccgccg aggactggca gcgggaccgg gaggaggcgg 112140
cccgctgggc gaaggagtgg gcggaccggt acggctccgg ccaagacggg tccggccatg 112200
acgggtccgg ccgcgggggg agccaccagg gcgggtcggg ctacccgggg tacccgggcg 112260
gagggcacgg aggacagtgg cgctgaccgc gccccgtact gcttgggccg ccgttctgct 112320
tggatggcgg catgtctgac aaggaaccgc cgcaggagcc gcgttccgcc gatgagctgc 112380
tggacatcgt ggacgagcgg gatcgggtgg tgggccaggc ccggcgagcc gacgccatgg 112440
cccgccggct gcgccatcgg accgtcttcg tgctgacccg ggacggtgcg gaccggatct 112500
tcgtacatcg ccgcaccgcc accaaactcg tcttcccttc cctgtacgac atgttcgtcg 112560
gcggtgtcgt cggcgcgggc gagtcgtacg acacggcggc gctgcgggag gccgaggagg 112620
agctgggggt atccgggctg ccggcgccga cgccgctgtt ctccttcctc tacgacacgc 112680
ccgagcacac ctggtggtcg cggatctacg aggtccgctg cgagctgccg gtcgccccgc 112740
aggccgagga gatcgcctgg cacgccttcc tggacgagga ggagctgggg cggcggctgg 112800
cggagtggga gtgggtgccg gacggcgcgg aggcatatcg gcggctgctg gagtggcgcc 112860
accagggccg gtctccggca ggtccggagg gtcctaacgg ctttccccgc gattagcccc 112920
ttgccatggc catgcgtgcc ggccggcacc gtgcccacgg ggaaggtgac cgcctcgggc 112980
aagccaggga tacggaaggc cacctggtgg gaccacaccg gccccctcgc cgtcgtcttc 113040
gcgggcggcc ccgacagcaa cgccggtgac ctcgtgctgc ctgtgcgtat cccgcagggg 113100
cccgggcagt gggcgcgagt ggaacacttc ctcgcaggcc ccggccgctg gcacaaggtg 113160
gaccttgtcc gccgacgcaa agccagtgcg ccgggcggct gggtgtatga ggcgcacctg 113220
atgattctcg gccccggcta caccgccccg gcggtacagc ggatgcggca gcgggccgcc 113280
gccctggacc ggatcggcgg ggtggacggc aacgtctcca acctctcgat cgtctccttc 113340
cccgccggcc tggaccccgc cgagggcgcc cccgcgtcca ccgagatcac cctcaccgac 113400
gccgagcgcg ccctgttgga gagacaggcg aagaagcggc gcggccgtgc ccgtgcgctg 113460
gaacgctccc gccgggccac gaacaccgcc cagtacgggc tgtcgaagaa gcagacccgc 113520
cgcgctgagc acagggccgc gaagggcttg cccgtcaaga ccgtgacggt tccaggcggt 113580
gcccgcgccg cccgtacgga cggggttccc aagcaggcat tccgccgcga ccggctgtcc 113640
gagggctacc ggaatctgcg cgcccgccag gccgaagccg ccgccagcgc cgccgaacac 113700
cgccgccacc gagcccgcgc cgtggcccgg gagatcatcg ccgcccacgg cgtgaacctc 113760
acggtcgagg actgcgacat ccgcacctgg taccggctgt ggggcaagca cctctcccag 113820
accacccctg gcatgctgat cgccgccctg gaccgggaat gccgggcagc gggcggacga 113880
ctcgtccgcg cctccacctg gtcaaccgcc ctttcccagc actgcctgtg cggccagcgg 113940
gtcaacaaga cgctacgcga ccgcgaacac aagtgcgtcg cctgcggcct ggtcggcaag 114000
cgtgacctcg tatccgcagc cctggccgcg ttcgtccgcc tcaccgacgt ggacgacccc 114060
aagaccgcgc acctgcacaa cgccatgtcc cggcacgcac agatcacata cggacaaggg 114120
ctggaagagg ccctgcgtga gtcaactaca ccgaacccga aaccggttcg ccggccggga 114180
cgcgtggcag tcccccacca gcgggagacc tctgctcacc gaaccgccgc aaggcggccc 114240
cgagcaaccc cggatgagac acgccccgcg cgtgaccacg ccggaaagcc cggacgccgc 114300
cccggctgcg acccgcagct cgccctctgg tgaattgcgg aacaagtctt agtaggcttg 114360
tcccatgatc gacgccaacc gcacgccgac ccggggcagg ctcgcctccg ccgtcgcgga 114420
tgtgcggctg tggttcgcgc ccgagcggct gcgcgacgag ggggagaccc ccgactaccg 114480
tttctccctc gccaacgaac gcaccttcct ggcctggctg cgcaccgcga tggcgctggt 114540
gggcgggggc ttcgcggtcg atcagttcct gccggacacg aacagcgcgc tgcggctcgc 114600
ggcggcgctc acgctgctgg cggggggcgc ggtgtgcgcc gtgcgggcgc tcaaccactg 114660
ggtccgctgt gagcgggcga tgcgccgcgg cgaggacctg ccggtctccc gcttcccggt 114720
cgtgctgggg ctggccgtcg gggtgatcgc cctggtgatg gtgggcctgg tcgcggtccg 114780
ctggggccgc tgagccgatg gccgggcccg actcgcccac ggggccgcag gacgaccccg 114840
agcgacccga tccccggtca cccgaatcgc ccgatccggg gtccgagggt ggtccctgga 114900
agcgggaccc cgggctgcag ccggagcgga cccggctggc ctggcggcgt acgacactgg 114960
cctgtgccgt cgccgcgtcg ctggccgcac ggcaggccct gcacaacggg accgggccgg 115020
tgtccgtgct ctgcgcggcg ctggccgcgc tggcctggct cggcttcctg ggtgtggcgg 115080
gccgccgcat ccgggcggtg tccgcgccgc gtccgcccgc actgtccccg cgggcggccg 115140
cggcggcggc aggctgcacc ctggcgctcg tggccttcgg cgccgcgctg acccgatgac 115200
gcgctgatct gccgaagccg gcccctaccc gaccgtgacg acgatcttgc cgcgggtgtg 115260
gccctcctcg ttgaggcggt gcgcgtcggc ggtcttctcc agggggaaga cggacgagac 115320
ctccacgccg agggcgccgc gctcggcgat gtcggtgagg tcctggaggt cgttcgggtc 115380
gggccgtacg aagacatagc ggccgccgag ggcgaggaca gtctggtcgg tgatggaggc 115440
cagccgcccg cccggggcca gcagcgggac ggattcggtg agcgcttcgc cgccgacggt 115500
gtcgaagacg gcgttcaccc ctccgggggc cagcgtccgc acccggtcgc ggagcccgtc 115560
cccgtaagtg atcggctcgg cgcccagctg gcgcaggtac tcgtggttgc gctcgctggc 115620
ggtgccgatg acgcgggcgc cgaggtgccg ggccagctgg acggccatgg agccgacgcc 115680
gcccgcggcg gcgtgcacca aaacggtgtc gccctcggcc acctccaaca ccttgaccag 115740
cgtctggtac gccgtgagac ccgccagggg aacgcccgcg gcctgcgtga acgtgagatt 115800
ccgcggcttg cgggcgagcg tacgcaaggg ggctgcgacg tattcggcga aggtgccccg 115860
ggacaggaag tcctcccgga cgtatccgat cacctcgtcg cccgccgtga actcggtcac 115920
cgccgggccg gtccgttcga cgacgcccgc gacgtcccag cccgggatga ccgggaagac 115980
cgcctccagg acggagtcca ggtagcccgc ccgggccttc cagtcgaccg ggttgacggc 116040
cgcggccctc accttgacca ggacggtgtc ggggccgacc ttgggctcgc cgacctcccc 116100
gtactccagg acctcggggc cgccgtagcg gcggtaggtg atggccttca tgggaggctc 116160
cttgcgttcc tgcgcggtgt gagggccgga cggcgagcgc ggccggagcc cgcgctcccg 116220
tcttcgacgc tcccgcgccc gccgacccgg cgcaaaccgg gcgggctagc ccacgctgaa 116280
gttgcggcag gcggagctgt cggacctctc gacggcactg cttccgacag ggatcgcctg 116340
gttacccacg ccgcccttca tcggactgcc tttgtcaact actcgtcgat aaatcgatga 116400
gcttgcacaa cgggcgtcac tggctgcgac acgtcgcttg cgtccagctc caccggagcg 116460
gagcaggggc gattgactgc gccccgcacc tatgcagtcg aacgtcccgt gtgaaaccgg 116520
gtccaccggc aaaaccggat ctgtgccttc ggagactccc gcatgggagt agcaaaatac 116580
cgcaatctgc gcagcgttga ccagctccgc cggccagagg ccgcgaacga ctcagcactg 116640
gactccatac cgactggtcg gcatcatgat gcgggaccgt ccgaccaccc gtcccgcgcc 116700
cgcccgcccg cagcaccctc aacggaggca gagacccgat gaactcaggc aatccgccag 116760
gactcgacct cgaacggctc cgcgcccatc tggaccgtga gcggccgggg ctggtgcgcg 116820
ggccgctgcg tgccaagctg atcgagggcg ggcggtcgaa tctcacctat acggtggacg 116880
acagcgccgc ccgctgggtc gtgcgccgcc caccgctggg ccatgtgctg gccaccgccc 116940
acgacatgcg gcgcgaacac cgggtgatcg gcgcgctgca cccgaccagc gtgcccgtgc 117000
cggagccggt gctgctgtgc gaggacgatt cggtcctcgg atcgccgttc tacgtcatgg 117060
agttcgtcga gggcacgccg taccgcacgg ccgaacagct caccccgctc ggcccggagc 117120
gcacccgccg tgtcgtgctc aacctggtgg acaccctggt cgagctgcac gccgtcgacc 117180
cggccgccgt cggtctcgcc gacttcgggc gcccggaggg gttcctggag cggcagctgc 117240
gccgctgggg caagcagctg gccgcctccc gcaaccgtga gctgcccgga atcgacgagc 117300
tgcacgaagc gctcggcaag gcgctgcccc tcgccgcttc ttctggcgcc cccgccgtgg 117360
tccatggcga ctaccggctg gacaacgtgc tcgtgggcgc cgatgaccag atcaaggcaa 117420
ttctcgactg ggagatgtcc acactcggcg atccgctgcc cgacctgggc ctgctggtga 117480
tgtacagcga gcagcaggag acgtccgact cgccgatcac cacgaccagc ggcgcccccg 117540
gccaccccgc gccgagcgag ctgatcgagc ggtacgcggc cggctcgggc cgcgatgtcg 117600
gcggcgtcgc ctggtacacg gcgttcgcgt acttcaagct cgccgtgatt ctggaaggca 117660
tccactaccg cttcaccctc ggccagaccg tcggcgccgg cttcgaccgc atcggcgaag 117720
tcgtcccggt gttcgtcgag agcggcctca ccacgctacg gaagggctga gacagccatg 117780
gacttcgcat tcgacgcccg aaccgaggag ctgcgggcca agctgctcgc cttcatggcg 117840
gagcacgtcc acccggccga ggcggtggcc gaggagcagc gcgcccggct ggactccccc 117900
tggctgaccc cgccggtcgt ccaggagctc aaggcggagg cgcgcaagca gggcctgtgg 117960
aatctgtttc tgcccgacgc cgagctgggc gcggggctga ccaatctcca gtacgccccg 118020
ctcgccgaga tcaccggcca cagcccccag ctggccccca cggccctgaa ctgcgccgcc 118080
cccgacaccg ggaacatgga ggtgctggcc cagttcggca gcgagcagca gcgcaagcag 118140
tggctggagc cgctgctcgc gggcgagatc cgctcggcgt tcgcgatgac cgagcccgag 118200
gtcgcctcgt ccgacgcgac caacatcgag acccggatcg agcgggacgg cgacgactac 118260
gtcatcaccg gccgcaagtg gtacatctcc ggggcgatga acccggactg caggatattc 118320
atcgtcatgg gcaagaccga tccggacggc cccgacctcc gccgtcagca gtccatggtg 118380
ctggtggagc gcgagaccga gggcgtcgag gtgcgccggg ccatgcgggt ctacggctac 118440
gaggaccact atcacggcgg ccacgccgag gtggtcttcc acggtgtgcg cgtcccggcc 118500
gccaatctga tcggggagga gggcggcggc ttcgccatcg cccaggcccg gctgggcccc 118560
ggccgtatcc accactgcat gcggctgatc ggcatggccg agcgcgcgat cgagctgatg 118620
tgccgccggg cggtgtcccg tacggccttc ggcaagccgc tcgcccggca gggccaggtc 118680
cacgcgtgga tcgcggacgc gcgggtggcc gtcgagcagc tgcggctgct ggtgctcaag 118740
acggcctggc tgatggacac cgtcggcaac caggcggcgc acaccgagat ccaggccatc 118800
aagatcgcca ccccgcgcac ggtcgtcgac atcctggacc aggcggtgca gctgcacggc 118860
gcgggcgggg tcagccagga cttcccgctg gccgagctgt gggccgcggc gcggacgctg 118920
cggctcgcgg acggaccgga cgaggtgcac cagcggtcgc tggcgcggcg ggagctgaag 118980
aagtacctct aaaaccccct gaaaaccgct gtcacggccg cagggcgcgc agcagcaggt 119040
cggcgaggtg gtcggcgacc tgctgggggc tgagcggtcc gtcggggcgg taccaggtgc 119100
ccaggtggtg gatcgagccg aagtggtagt ccaccaccag gtcggccggg gtgtccgagc 119160
tgaacacccc ggcccgctgg ccctcctcca ccagggcgcg gaaccgctcg tggtagcgcc 119220
gccgctccgc ccgtacctgc ttgtgcttct ccgggctgag atggtgcatg gaccggaaga 119280
agatcgtcgc gtcgtcgagg ttgtcgatgg tggtgacgac gacgtcggcc gccgcgtcgc 119340
gcagccgctg ctcgacgggc gcgtccgcgt ccgcgaaggc gtccaggcgc tcctgctgca 119400
gccgtagcac ccgtccgtag atctcgtgca gcagatcgtc cttggagccg aagtagtggt 119460
agagggcgcc cttggtgacc ccggccgcct cgacgatctc ctggacggag gtccggtcgt 119520
agccgcgctc cgcgaagagg cgggtggccg cggccagcaa gcgctgtggc accggtttcc 119580
cgtcaccgtc cctcatcctg gccatctcgt cgccacctgc ctctcgttgg tgccgggttc 119640
gctactcccg ggggctccgc gactcccgca gctcccgtct gaggatcttg ccactggccg 119700
tcttgggcag ctcgggcagg atctctatcc ggcgcggata cttgtacgcc gcgagccgct 119760
gcgcgcagta cgaggtcagc tcctccggcg ccacctcggc gcccggccgc aggctgatgt 119820
acgccttgac cgtctccccg cggtacgggt cggggacgcc gacgacggcc gcctcgcgca 119880
cggcggggtg ggtgtacagc acatcctcca cctcgcgcgg ccacaccttg aagcccgagg 119940
cgttgatcat gtccttcttg cggtcgacga cgtagagcca gccgtccggg tccatgaaac 120000
cgatgtcccc ggtgcgcagc tccccgtcgg gcagggccgc cgcgctcgcc tccggtgccc 120060
gccagtagcc gggcaccacc atggggccgc gcaccgcgat ctcgcccggc tcgccgaacg 120120
gcacgtcccg gccctcgtcg tcgatgatgc gcaccacgga gtcggggccc ggcaggccga 120180
ccgccagagt gcccgagacc gggtcgacgg gggcccgctt accgggcggg acgctggcgc 120240
agggggcggt gcactcggtc aggccgtagc cgttcttgag gtagaggccg aagtccgcct 120300
cgaaccgctc gaccagcgcg ggcggcaccg gggccccgcc cgaggaggcg agcaggaagg 120360
acgcgaagtg gtcgcgggtc accttcgggt gtgccctcag cgccatgaag gcggtggagg 120420
ggccgacggt gtacgcgggg cggtgctcgg cgaaggcatc gaggacgaca cccggctcga 120480
agcggtaggc catggcgagg gtcccggcgt ccgcgatgga ggtggacagc tcgcagacca 120540
tgccggtgat gtggaacagc ggcgcgagca cgaagatcgt ggacccgtcg ggcagctcgt 120600
gaccgatatg ttgccgctcg gcgttgtagg cgatgttgcc gtgggtgttc agggcgccct 120660
tgggggtgcc gctggtcccc gaggtgtagc tgatcagcgc gatgtcgtcg ggtcctggct 120720
ccggcacgtc cggccgcgcc gcgccgccgc gggccgccgc gagcagatcg tcggtgtcct 120780
cgggcgcggg gatgcgccca tgggtgagga cgcgggggtc gtcgcgggtc tggaagtcgt 120840
acgcgtcggc ggtgagcgcg atgcgtacgg aggaggcggc ggcggccgtt tcgcgcgcgt 120900
actcctccca ggcgtggccc gagcacacca gggcgctgac ctcggcgtcg gccaggatgt 120960
gaccgagctc gccccccttg tacatggggt tgaccggcac caccgtgccg cccgccttcc 121020
acgcgccgag cagggcgagc acgaagtgcg gggtgttctg caacatgatc gcgagccggt 121080
cgccgggcct gaagccccgc gcggcgagat gcccggcgat gccgtcggag agctcatcgg 121140
cctcccggta ggtcacctgc ccgtcgaagt aggcgaacgc ggcccggtcg ggcctggtgc 121200
gcacggcgtc ccggaaggcg tgcagcacgc tgggccgggg ccggatcggt tggcgctgca 121260
cctcggtgag ctgagctagc catggctggt cctggtatga ggtcatgagg gcggccctgt 121320
ctgctcggga gatgggggtg cgggggcgta cggatcagcc ggtggccccc attaacggct 121380
cgtttcgccg gcttcgacct tgcgctggag gtggttcatg tgcgtgagcc agcggtcggt 121440
gtcggtggcg cgggccgcgt agtactcggc gacctcgggg tgcggaagga tcaggaagcg 121500
ctcgtgcgcg atcccgtcga agagggcgtc ggcgacctgc tcgggctcga tcgccgtcgg 121560
aacgaggatg aggtcgccgg tggggccggt ggcgcgcagc atatcggtgc ggacgccctg 121620
cgggcagatc gcgtggacgc tgacgccgcg atggcggtag gtcgtggaca gccactcggc 121680
gaaggcgagc gcggcgtgct tggacacgct gtacggggcc gagccgacca tggtcagcag 121740
accggcggcg gagacggtgg agacgaagcg gccgctgccg cgctccagcc actcgggcag 121800
cagctcacgg gccgcccgga catgcgccat gacgttgacg tcccaggcgt ccgcccacag 121860
cagctcgtcg gcctcggggc cgccgatggg ggcgacaccg gcgttggcgc agtagatgtc 121920
gatgcccccg ccgagcgccg cccgcgcctc ggggaccaca tgggaggcat cgcccggcac 121980
ggccaccgcg ccgatctcgt cggcgacggc cttcgccttg tcggcgtcga tgtcgttgac 122040
cacgacccgg gcgccctcgg cggcgaagcg gcgggccagg gcggcaccga tgccgcctcc 122100
cgccccggtg accacggttc gcgctccacg tacggcgtct ccacgcacgg tatccacgca 122160
tgctccttcg gccgggggtc ggctgcgtcg ctcagccgcg tcgctcagct gctcggcaga 122220
ctaaccagtc ggtatgtgta aacggaaggg gcacgacggg gtacggaagg gaagcgccgg 122280
ggtgccgcgc ggacggggcg gccggatcca cttccggccg ccgggctcac ctcatagggc 122340
gaagcgagcg gtccgcggtc gtgacggctc cgaaggtttc tccggcaccc cgtggccacg 122400
tggcgtcgag gagaccgtct ctgtgcttac tcagcaaggc ttactcagca agcagggcca 122460
cggtctcctg tagccgctcg cacgcccagt cgtactgctc ggtgccggtg tgtgcccgga 122520
agttcgtctc gatgatctgg atcagagcga ggtgaaggtt gtacagccgc cgacgggtcc 122580
gctcggcggt ggtgaacggc cggtggccgt agccgcgaat gaaagccgtg gcgtccccgt 122640
aggcgctcga ctcactcccg gcgaagccga actccatcag cgggtcaccg tagaacgccc 122700
gctcgtggtc gattactgcc acgatccggc cgtcgcccac catgcagttg ccgggccaca 122760
ggtcccactc gacgaaccgg ggctcggtca cctcgtccaa cgaatcggcg tgagcggcca 122820
cgacttcgcg gatcacgtcg tagccgtggg gcaggtcgac gccgcgtcgc tcgccgtcgc 122880
gcagcaactc ctcgaccatc cgcaggaagg ccgcgcgcca cgtgctttcg cccggcccgg 122940
ccagcgggcc gaaggcggtg ccggggatgg tgttgagttc gcgggtgatc gcgccgagcg 123000
cctcgtcgta cgcgttgatc tccgttccgg ccacggtgtc cctgatggcg tggaggttgt 123060
ccgcatctat gtacgtcatg aagaagtagt cggcatcgca cacctcatgg ctctgatcgg 123120
cgaagtcgac ctctggcact ggaaccttcg tgtgctcccg gatcaaccgc agcgccgcca 123180
gttcggtggc catggcgccg cgctcgtagg tcatgacctc gacatcgggc ggcggtgcga 123240
tcttgaggac agcttgcgta ccggatcgca gccggatccg gtaggccacg ttgaaccagc 123300
cgtcgctcag ttcgctgacc cagtcctcgc ccgcgtcggg cacctcctcc gggccgtagg 123360
cgcgggcgac catagcgcga agcgcctcaa cgggctgccg gttctttgtg atgctctcca 123420
tcaaccctcc acccgaagca tgatcggacg gtagcaccag gacggagttg tgcactcggg 123480
tctcctgcga tgccttgcta ctgctgcggg cgggcaactg gggctgccgc agccggttcc 123540
aggtcgcgtc ctgttgggcg cggaggccgc gcccaggagc ccgcggctgg tttcggtggc 123600
gttcggggtc gggccgtcat cagggaagat ccgcggcgat acgcacggtg gttgaaccga 123660
tcaggcgcgt ggttttctgc tggtcgtgaa gagcacctcc aagcactcga aaacgcttga 123720
ggccgcgccg gccacggccg aggatgccgt gggccggatg gcgacccgcc cggtgggcag 123780
gctgctgtgg gagaactgcg tacagaccat cgcgtccgtc agcatgttcg gcttctacgc 123840
cctgacgaac gcctggttcg tcagccacgg ggtcggcaat caggcgatgg ccgcggtcaa 123900
cctcgtcgcc ccgctgctgc tcctgctcgg agccatgtcc accaccgtgg gcgcgggcgg 123960
cgcgacgctg ctctcccgcg cgctgggcgc cggcgacgag cgcgccgccg cccgcgcggc 124020
gggcaacgcc ttcacgctgt tctgggtcac cgcggccgtc accaccgtgg ccgggctggt 124080
cttcatcgac ccgctcctcg acgtgctggg cgcgcgcggc tcactgcgcg agtacgcccg 124140
gcagtacgcg gtgatcctgc tgtgcgggtc agtgtcctac accgggttct ccagcctggt 124200
gcgcgccgag ggccgggtcg ggttctccac tcggatgtgg atcggtgccc tcgtggtgca 124260
gatcgccctc gacccgctgc tgatcttcgg cctcgacctc ggggtgagcg gagctgccct 124320
gggcaccgtc ggcgggcaga ccatatccgc cgtgatgagc tggtggttct tcttccttcg 124380
ccgcgagcgg ccgtaccgcg tccgcctcgc cgacctgcgc ccacacgccg tgaccctgag 124440
cacgctggtg ggcatcgggc tgccgtcctt ctttgccaat atcggcctca ccgtgctcgc 124500
cgtactggtc aacagcaccc tcgcggtgac cgggggagtg atagcgctga ccgcgtacgt 124560
cgtctgctcc cgcctgcaga ccttcgccgt gatgccgcac acaggcatga gccaggggct 124620
gcagcccatt gtcagctaca acgccgggcg cggcctggac gcccgagtag cgcgaacccg 124680
ggcgctcgcg ctgcgggcct cgctcgtcta cggcgcggcc agcgcgatcc tgctgacggt 124740
cctcgccgaa ccgctcgtcg gcatcttcgt cagcggcggc ggagccgcgg aggaggccag 124800
gttcgcgctg tgcatcatcg ctgtcggcgg gatcgtcacg ggtatcgccc cgctgaccgc 124860
ggcctatttc caggctctgg gacgcccgct gcccgcctac gtcctgacca tcggcaccct 124920
cgcatcctgc tgaaggcgcc gctcgtcgtg gcgttcggcc tgatctgggg cacgcacggc 124980
gtatggttcg gcctcgccgc cggcgagttc gccaccgcgg cgagcgcgct gctgctgttg 125040
caccgcgccg accggcgcgc gccgccccgc ggcaccggat ccccccgctg cgtcctgaga 125100
cgtccgatga cgcgacgggc gaaaccggga ctccggccgc aggacaccga ccggcgaccg 125160
cgagcgagcc cgccggtcac gacggcccgg tcggttgccc gttactgaga cacgctccta 125220
gctgccgctc tccccgccgc ccgagccctc cccggcctcg cccgggttct tcatcacgct 125280
ccgccagcgg gcgttctcgc ccgccagggc gtcccacagg acgttgagcg ggtgggcggc 125340
gtcccatgcg gtgcgctccc caccgcgtac gaacaccccg accagcacgg ccagatccgg 125400
cccgaggtca tccgccagct ggagcgtccg cagccccgca ccgctctgca gccgcccgtc 125460
ggcgaccgct tcgccgatcc cctgggtcac cagcacacag ccgcgcagca cgcccgagga 125520
gagcagttcc aggccgtact gcacggtgtc gatctcggcc gcgatatgca gctcctggcg 125580
gtagcgggcg ccgaaccagc cgcgcagaaa gccgggtatc aggccgtcgg cggagaccgc 125640
gagcggcagc ccggggagtt cgcggaccga cagcccgggg cccggcagtt cgtcctcgga 125700
gaggttggtc accagggaga gcccgctgcg ccgccactcc atcacctcga agccgtccag 125760
ccggctgtcc ccgtcctcgg tggtcagcac ggagccgcag accaggtcga gttccttgga 125820
gccgagccgg tccagcagat cgcgggtgcg cacatgctcg accttgagct ccacaccacg 125880
gctctcgaag tcctcggtca ccagttccac ggcgcccagc agaaagccga gggtgtagcg 125940
ggtggagccc acgttcaacc ggctgccgag ccggcggcgg cagccgtgca ccgagtccag 126000
ccagtcctgc agcgtgccgc gggcgagttt ggccagggct tccccggtgg gggtgaagag 126060
cacgttacgg ctccggccgc gcttgagtac gagcggttca ccgcacagcg cgccgaagtt 126120
acggttcatg gtgtcgagtt gtttctgaac gctggactgt tcccgcccca gcagccgggc 126180
cgcgcccagt gcggtacccg cctcgtggac cgcgagcagc gtgcggagct ggtccatcgt 126240
ggtgtcgagc agcgccgtgg ggtactgcca ccgggggttt aagggcatac cggccttccg 126300
ctccgagcct acgaattccg gcttcgcagc atagtccaat gggctccggc aaactgatcc 126360
ggagaacagc ggcaattttc ttgcgatttg tattcgggaa ttcgacagac tctattccac 126420
aggggcgatt ccggatttcc cgtcgctgtc ggcggcgctc ggctgggccg cggggcttgc 126480
cagtgccacg cgctacacgc tggccgcggc ctcccgggac gcgagccgga ggtgcggcgg 126540
gtcggcgtgc tgtacacgct cgcctcccag ctggccgggg acagggcctc gctgcggggc 126600
ggcccgatcc gcgctgggga caacggggta cggcgtcagg cccggcccga taccggcccc 126660
cctcatgccc gagtgaccga gcactacatc tccgattgaa gctcagtcag gttttctcaa 126720
cggtgatcag ttccatatcc cgtgaagacg agggcttcgt tgccgctcac gagtcggtcg 126780
atggttcacg tcgggcggcc cgtcaggagg cttccgggcc gtgagcgcac ggagggacaa 126840
cgtcgccgat cacgctgacg cgcgtccaga gcccgtcacc gtcgtcgagg tcaccgcaca 126900
agtggaccgc cggccatccc cgggccagcg ccgggcgccc gtccccgtcc aggctgtcgc 126960
ccacaacgag gcacacggcc tgatcgcggg tgaacgcgtc gctaacggcc tggaaaaacc 127020
gcggatgcgg cttggcatag ccgacctccg aggagacgaa caccctggcg acgtgctgtt 127080
ccagaccgct gaggctcagc ttgagtcgtt gcaggcggga ggccccgttc gtgatcaacc 127140
agacctcgtg gttgcgagcc attcgctcca cgaaccccgc cgcccccggg aagggccgca 127200
ccagcgcctg ccgatgccgc gcgaactgcc gcgtggcatc ctcacgatcc gcgtctgcgc 127260
ccgctgttcg cagcacctcg ccccagacct ccacgcggta tcgctgcccc cacgtggccg 127320
cctgcggcac cgacgcgagc gtcagatccg cccaaagcgc ctcccaagag ctgatcccca 127380
ggtctctcag gcgctcgaag tacggccctg agcgccaatg agaccgggcg acatcgaaca 127440
cgctctgcac ggtctccgaa ggacgcgcta ctccgaccgc gtgcagagtg ccctccacgg 127500
ccgcccaggc ggcgcttcgg tccgggagca gtgtgtcgtc caggtccacg aggacgacga 127560
acctacttgt ctccacacaa cctcccgctg acaccgcgca ttgcagccga cctggcggcg 127620
gacaagcgcg aagcccctgg taggacaggc ctcaccacaa gatcatgtcc gtaattacca 127680
gaggcttcac gttggtcacc tatgttgcca cgctcgacgt accgcgccat gtcgtcgagt 127740
tcctcgccag cctgtcggcc gcccaccgac ggcggatcgg cacgccgaag ggctcccgcg 127800
cgctcgggcc gtttcgtcag gcagtagctg gtgctgcgct ggtcccgcga gcgggactgc 127860
gtgcactgcc tggcccgcga cgccggcgtt tcccaggcca ccggctaccg ctaccttcac 127920
gagggcatcg acgtcctcgg cgaccagacc ccgtgggtcc accgcagccc cagtactcca 127980
cttggaagta ccggggctgc ggcgttgcat gatcagctca gtttcgtatg ccacttccct 128040
cgggcgagga ttgtcgttcc gtccggcttg atgtacacct cggcgccgcg cacgggaagc 128100
tcaccctgcg tgagccagta ggtacggcac tcgtcgagca ggtcccacag gcgtcgcggc 128160
ccgccctggt gcacggttgg ccgatcgtcg ccgaccgcgg ttgcacgggc ccacgaacca 128220
tcctcgtgtg ccatgagtgc gacgcgtcgc tcgccgtcgt cctcatacga atgctcgata 128280
ccgggggcct tgaggctcag catcgaatcg aggtcccacg cgttcgccac gtcgacgatc 128340
ggataagggc cgagcgctac ctcgtcgccg tcctgctccg cggccttggt gagcacatcg 128400
ccgacaccgg gcgggtagtc atcgccggcc cgggcgtgca tgaaccctgc ccggtcccac 128460
tcgacgcgcc ccgaagcgct cccgtcccgg ttcttctccg cagtgatgag cagagaggtg 128520
ccggcgaggg tggtgacgag tcgccccccg gggcgtagca ccgtgagcca actcgcggga 128580
accgcgcgca cggcgaccgt ggcgacgatc cggtcgaaca tcgacgcggc gaatggaagc 128640
tcccccgtcg cgtcgacggc ctcgacggtg gggccgacgc ctgcctcggc gagacgcgag 128700
cgcgcggcct cgacgaggta ggggtcaaca tcgacgctcg tcacctgctc gtcgctgagg 128760
cgccgggcgg cgacggcggc cccgtacccg cttcccgtgc ccacgtcgag caggtagtcg 128820
ccgtcatcca tgcgggcgtg ctggaacatg cgcaccacca ggctcggcag ggttgccgag 128880
gaggtcggcc tgccgtgcgg ccggtcgctc ggggtggcct gatccgcatg caggggcccg 128940
acacgggtca cgagcgacgt gtcgcggtac gccgattcca tccagcggtc cgggtctgcg 129000
ggcccgtcga gcagttccca gccgtcggaa ccgcgccccc accatcgcgg gatgaacagg 129060
tgccgcggtg tggcggcgac ggccggcgcc catcgggagc gctggtgtgt tgcaacgcct 129120
gccagggcgg cggcgtggtg gggccagttc acgcgtggac tccaagagag ggggctgaca 129180
gctgctgagt aatccgtcag tgcggctcag cctctctcgg cttgtgtcgt gatggcgtga 129240
acctcgccga ctgggacgtg cgtcagggcg tgccctgccc ggtatggggc cgggtctttg 129300
cctccctgga tggatacggg tgttccttca tcgaatgcgt gacctttgtg tgtggtgttc 129360
gttgggtgcg gtgaggggat cttcgaaccg ggggaggcag tgacgtgcgg gggaagttcg 129420
ggccgcggtc cggctgtgtg gtgcaggcct acaggttcgc tctcgatccg aacgccgggc 129480
aagggcgggc gttgcgttcg cactgcggtg ccgcccgcgc cgcctataac tgggccgtga 129540
cctgggtgac cgcctcgtgg tggcagcgca aagccgaagc cacctacggc atcggcgagg 129600
aggagctcac cccgtggcgg ccgtggtcgc tgcccgcgct gcgcaaggag ttcaaccgga 129660
tcaagaccac cgacccgaga ttcgcccagt ggtgggagga gaactccaag gaggcgtaca 129720
gcaccggcct cgcgaacgcg gccgccgcgt tcgacaacta cgccaagtcc aagcagggca 129780
aacgcaaggg gcgccggatc ggtgtcccgc gccggaagcc gaagcggaag gcccgcctgg 129840
cctgccggtt caccaccggc acgatccgcc tcgagccgga cggccgacac ctcaccctgc 129900
cccggctggg cacaatccgc acccatgaac ccacccacaa gctcctcacc cgcatccagg 129960
ccggcacggc gcgcctcctg tccgcgaccg tccggcatga gcgcgggcgc tggttcgtct 130020
ccctccaggt ggagacagca cgggagatca tccgcgttgc ccggccggat gtggcggtgg 130080
ggatcgatct gggtgtcaag cacctggcgg tcctggccga cagttgtgga cagacacggt 130140
atgagccgaa cccgaagcac ctggacggcg ccctcaggct gctgaggctc cactcccgcc 130200
gcgtctgccg gcggcagggg cctgaccgca ggaccggccg gaagccgtcc aggcggtggg 130260
agagggccaa ccgcgagcgc aacaggctcc accaccgggt ggcgaacctg cgcgccaatg 130320
cgctgcacaa gttcaccacc cgtgtgcgcg ccgagtacgg cacggtggtg gtcgaagacc 130380
tcaacgtcgc cggaatgctc cgcaacaaac gactcgcccg ccacgtggcc gatgccgggt 130440
tcggggagat ccgccgccag ctcacctaca agggccaacg aaacgcctgc cccaccattg 130500
tggcggaccg ctggtacccc agctccaaga cctgctcgaa ctgcggcgcg gtgaaagcca 130560
agctgccgct gcacgtccga gtcttcacct gcgacgcctg cggcctggtc ctggaccggg 130620
acgagaacgc agggcacaac ctggtcgccc tcgtggctgc ctgcaccact ggtaccggag 130680
tggccggaga ccaggacacg ccaggcgtgt cgaagccccg tggagccgac cgtaagaccc 130740
gccgtcaacg ccccgaccgg aacaccggcc gaggcgggcg ggcaggtggc gcaagcccgc 130800
cgcacccgcg gcggaaggaa acggggaccg tcgtcaggac accgagcgca acccacgctc 130860
cggtgacacc gtcatggacc tttcagacgg aaacgtctgg aatgctgaga gtcactgaga 130920
ccttgagtaa cggaactcca cgacctgctg gtacgtcggc cggttctgcc agtgcacctt 130980
gtcgtgggtg acaccgccca gcgggcgctg gatgatcgag tcggcgcacc actggtcgcc 131040
cgccgagcag ttggcgtcgc cggggtagac ctccgtggcg ggtttggccg cggcctgctt 131100
gaggctggtc aggagggcgt cacggcacgc cgacaggtcg ccggcgccgc agtacttcct 131160
ggcgagcggg ccctggacat cctcgccgag gaccgcgcgc aggtccttgt cgacgtagga 131220
ccaccagccg tactggaagg aggacccggc atgcgacccg gtgggcccat ggccggccga 131280
gggcgcctcg tcgacggtca gattggcgcg cagcgcatcg tacagatcgg ctccgagctg 131340
aggtttgaac tcggcctcca ccagaagcgg ccaccaagcg tccatgatcc gtacggcgtc 131400
ggcgtgtcca taggtgtgcg aaccggcgga tgtctcacgg cgctgggcgc ccgccgtctg 131460
ccaggactcc agctgctgca cggccttggc cgcttgcgga tcggtgaccg gcgcgctgcg 131520
caggaccttc agcagctccg gcagcacgtc ctcgccgcgc aggtcggtga cggcggcctc 131580
ggccatggcg cgggtgaggg ccgcgcgggt caccccgccc ttcttcacca gggcgcgcac 131640
ccggtcgtcg aggaggttgc cgcggtgcac cgagccgttg ccgaagccgg ccgcggtgta 131700
gcccttggcc tgccggttgt tccagctgac gtagtagtcc tgatcgacgg actgggggtg 131760
ctgggcggcc ggggcgtagg cggcggtgtt gaccgtcggg tcgaagtccc gccactcgta 131820
cgccgcctcg cccttgatcg gaagcgaggg gtccacgtcc gccgcccgga ccaggttcag 131880
cccgctgttg tagtaggcgg tctgccggga gtcggcgtag aaccagttga aggtgtagtt 131940
gatgtgctgc gcggcgctct ggaaggactt ggcgtccttc acatagccgg ggtcgttgag 132000
catctggaag ccgacgatgg agtcggcctc gtggcggtag ctggcgcgca gcagggtgta 132060
ggcgacgggt ttgccgtcga ccgtcgcgcg atgggtcaca agaccgtact tggtgcggta 132120
gacctgcatc cggtacgagc cttcggcggt ggagtcggcg agcgtcggct tccaggcgtt 132180
gcggcgctcg atcttctcca tcggcaggca ggagccgtgg tagcggtagt acgtcgagtc 132240
cttcgtgggg gcagacccgt ccggcgtgca cagctcgacg gcgaaggtgt cggtgacgtc 132300
ctgggcggaa gtggtggcgc tccaggcgta gtcctggccg cggcccagct ggacgtacat 132360
ccccacgccc gcgaaggagg cgccgcgggc gctgatgccc ggaccctgga gctcctggag 132420
catcagcagc tgcggggcga aatagccggt ctgcgggccg aagacggcga cgggatggcc 132480
gctcgcggtg tatttgccgg agaccaggag cgcgttggac atcccgtgct tacggctgaa 132540
caga 132544
<210>2
<211>456
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met His Asp Thr Thr Lys Asp Asp Pro Arg Ala Ser Glu Arg Arg Ser
1 5 10 15
Gly Val Arg Arg Ala Phe Val Ala Ser Leu Thr Gly Thr Ala Leu Glu
20 25 30
Trp Tyr Asp Phe Ala Val Tyr Ser Ala Ala Ala Ala Leu Val Phe Gly
35 40 45
Asp Leu Phe Phe Pro Ser Glu Asp Pro Leu Thr Gly Thr Leu Leu Ala
50 55 60
Phe Ser Thr Tyr Ala Val Gly Tyr Val Ser Arg Pro Leu Gly Gly Ile
65 70 75 80
Val Phe Gly Arg Leu Gly Asp Val Ile Gly Arg Lys Lys Val Leu Ile
85 90 95
Ala Thr Leu Val Leu Ile Gly Ala Ala Thr Phe Leu Ile Gly Val Leu
100 105 110
Pro Thr Tyr Ser Thr Ile Gly Val Ala Ala Pro Ile Ala Leu Val Val
115 120 125
Leu Arg Phe Ala Gln Gly Val Gly Val Gly Gly Glu Trp Gly Gly Ala
130 135 140
Val Leu Leu Ser Ser Glu Phe Gly Asp Ser Arg Arg Arg Gly Phe Tyr
145 150 155 160
Ala Ser Ala Ala Gln Val Gly Pro Pro Ala Gly Asn Leu Leu Ala Asn
165 170 175
Gly Val Leu Ala Ala Leu Gly Ala Leu Leu Thr Glu Ala Gln Phe Glu
180 185 190
Ala Trp Gly Trp Arg Val Ala Phe Leu Leu Ser Gly Ala Leu Val Ala
195 200 205
Phe Gly Leu Trp Ile Arg Ala Lys Leu Glu Glu Thr Pro Val Phe Lys
210 215 220
Ala Met Glu Ala Glu Gln Ser Arg Pro Glu Ala Pro Ile Arg Glu Val
225 230 235 240
Phe Thr Thr Gln Pro Arg Ala Leu Leu Ala Ala Ile Leu Cys Arg Val
245 250 255
Gly Pro Asp Val Leu Tyr Ala Met Phe Thr Val Phe Val Leu Thr Tyr
260 265 270
Ala Thr Gly Glu Leu Gly Met Ser Arg Gly Ser Ala Leu Ala Ala Val
275 280 285
Leu Ile Gly Ser Ser Leu Gln Val Phe Leu Met Pro Leu Ala Gly Ala
290 295 300
Val Ser Asp Arg Ile Asn Arg Arg Val Leu Tyr Gly Cys Ala Ala Val
305 310 315 320
Ala Ala Gly Val Trp Pro Phe Leu Phe Phe Pro Met Ile Gly Gly Gly
325 330 335
Ser Trp Ile Pro Leu Ala Leu Gly Val Val Val Gly Leu Val Ile His
340 345 350
Ser Phe Leu Tyr Gly Pro Gln Ala Ala Phe Ile Ala Glu Gln Phe Ser
355 360 365
Pro Arg Leu Arg Tyr Thr Gly Ser Ser Leu Ala Tyr Thr Leu Ala Gly
370 375 380
Ile Ile Gly Gly Ala Ile Ala Pro Leu Leu Phe Thr Thr Leu Leu Ser
385 390 395 400
Ala Tyr Gly Ser Trp Leu Pro Leu Ala Leu Tyr Ile Ala Val Ala Ala
405 410 415
Ala Val Ser Leu Ala Gly Val Leu Leu Gly Arg Asp Pro Glu Ala Ala
420 425 430
Val Asp Glu Asp Ala Gln Leu Ala Thr Pro Ala Ala Gln Ala Thr Ala
435 440 445
Asp Ala Ala Arg Pro Thr Ala Val
450 455
<210>3
<211>271
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Leu Gly Gly His Arg Pro Ser Gly Ile Leu Leu Val Tyr Arg Ser
1 5 10 15
Tyr Ala Gln Ala Leu Arg Lys Lys Leu Thr Ala Glu Lys Ser Thr Thr
20 25 30
Ala Asp Arg His Pro Asn Ser Gly Ser Ala Pro Met Ala Arg Leu Thr
35 40 45
Phe Glu Leu Pro Asp Gly Ser Thr Arg Glu Val Asp Ile Val Gln Val
50 55 60
Leu Asn Ala Gly Tyr Ala Gly Arg Ser Gln Asp Asp Val Ala Ala His
65 70 75 80
Ile Ala Glu Leu Ala Glu Leu Gly Val Pro Thr Pro Ser Val Thr Pro
85 90 95
Ala Leu Tyr Pro Val Ala Pro Tyr Leu Ala Gln Gln Ile Asp Arg Val
100 105 110
Ala Val Gln His Arg Arg Thr Ser Gly Glu Ala Glu Trp Ala Leu Val
115 120 125
Val Ala Gly Asp Gly Glu Leu Leu Leu Thr Ala Ala Cys Asp His Thr
130 135 140
Asp Arg Asp Leu Glu Val His Gly Val Ala Trp Ser Lys Asn Ala Gly
145 150 155 160
Pro Asp Val Leu Ala Arg Arg Ala Trp Arg Leu Ala Asp Val Glu Pro
165 170 175
Arg Leu Asp Asp Leu Thr Leu Arg Ala Trp Val Thr Arg Asp Gly Thr
180 185 190
Glu Thr Glu Ile Gln His Gly Thr Leu Ala Glu Leu Leu Thr Pro Ala
195 200 205
Tyr Trp Val Asp Val Leu Arg Ser Arg Asp Ala Leu Thr Pro Gly Thr
210 215 220
Val Leu Ile Ser Gly Thr Ile Pro Met Thr Pro Gly Val Asp Gln Phe
225 230 235 240
Ala Asp Thr Trp Arg Val Glu Leu Gly Asp Pro Ala Thr Gly Asp Thr
245 250 255
Ile Arg Leu Ala Tyr Asp Val His Pro Met Pro Glu Pro Ile Gly
260 265 270
<210>4
<211>233
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Ser Thr Arg Cys Thr Gly Ser Leu Thr Asn Gly Trp Val Ser Val
1 5 10 15
Glu Leu Gly Ile Pro Leu Ser Val Val Leu Val Asp Asp His Pro Val
20 25 30
Val Arg Ala Gly Ile Ser Ala Trp Cys Ala Ala Ala Asp Ala Pro Ile
35 40 45
Ser Val Val Ala Glu Gly Ala Asn Val Ser Val Ala Leu His Gly Pro
50 55 60
Gly Arg Gly Ala Asp Val Val Val Met Asp Leu Leu Leu Gln Asn Gly
65 70 75 80
Arg Pro Ala Tyr Asp Glu Leu Gln Glu Leu Val Ala Gln Glu Arg Lys
85 90 95
Val Val Val Tyr Thr Met Arg Asp Ser Gln Asp Ala Ala Leu Thr Cys
100 105 110
Met Asp Leu Gly Ser Ala Thr Tyr Ile Thr Lys Ala Glu Gly Gln Arg
115 120 125
His Leu Val Lys Ala Ile Arg Ala Ala Ala Glu Asp Ile Pro Tyr Thr
130 135 140
Pro Pro Ser Leu Ala Gly Ala Phe Gly Ser Asp Thr Arg Gln Ser Arg
145 150 155 160
Pro Val Leu Ser Val Arg Glu Val Glu Val Leu Val Glu Trp Phe Gln
165 170 175
Ser Glu Ser Lys Ala Val Val Ala Gln Ser Leu Gly Ile Ser Glu Arg
180 185 190
Thr Val Asn Thr Tyr Leu Asp Arg Val Arg Ile Lys Tyr Ala Asn Ala
195 200 205
Gly Arg Pro Ala Thr Thr Lys Ala Lys Leu Val Ala Arg Ala Val Gln
210 215 220
Asp Gly Leu Ile Ala Leu Asp Glu Leu
225 230
<210>5
<211>399
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Gly Glu Asp Phe Arg Gln Leu Ile Thr Arg His Gln Leu Arg Gly
1 5 10 15
Ile Arg Ala Gly Gly Leu Val Ile Leu Val Thr Leu His Ile Gly Ser
20 25 30
Ala Leu Pro Gly Leu Leu Gly Gly Leu Gly Glu Tyr Arg His Pro Trp
35 40 45
Ile Pro Leu Ala Ala Tyr Gly Leu Leu Thr Leu Ile Leu Gly Gly Ser
50 55 60
Val Val Leu Gly Leu Asp Gly Arg Pro Trp Pro Arg Gly Trp Val Pro
65 70 75 80
Cys Ala Leu Ala Gly Thr Phe Val Ala Ser Val Ala Thr Thr Ser Gln
85 90 95
Val Pro Ala Asp His Tyr Phe Val Ser Ser Leu His Trp Ser Tyr Ser
100 105 110
Leu Ala Gly Trp Phe Thr Val Val Leu Leu Gly His Arg Gly Pro Leu
115 120 125
Val Ser Gly Ala Phe Leu Gly Ala His Leu Ala Val Thr Ala Ala Leu
130 135 140
Leu Leu Thr Val Asp Val Pro Ser Arg Ser Ile Gly Ala Ser Met Ala
145 150 155 160
Leu Ser Ala Leu Ser Ala Gly Cys Phe Gln Met Thr Ile Ala Ala Gly
165 170 175
Ala Lys Leu Leu Leu Asp Ser Ser Ala Ala Ile Gly Gln Ala Leu Arg
180 185 190
Ala Gln Glu Arg Val Arg Thr Arg Ile Ala Val Ala Arg Gln Ile Gln
195 200 205
Ala Asp Gln Arg Arg Arg Tyr Ala Glu Leu Asn Ala Thr Val Leu Pro
210 215 220
Leu Leu Thr Gly Leu Ala Ser Gly Ser Leu Asp Pro Glu Asp Asp Glu
225 230 235 240
Val Arg His Ala Cys Ala Leu Glu Ala Ala Arg Leu Arg Arg Leu Phe
245 250 255
Ala Glu Ser Asp Cys Thr Phe Asp Pro Leu Val His Glu Met Arg Ala
260 265 270
Cys Ile Asp Val Ala Glu Arg Asn Gly Thr Thr Val Gln Leu Ala Val
275 280 285
Arg Gly Asp Pro Leu Glu Leu Pro Leu Pro Leu Arg Arg Ala Leu Leu
290 295 300
Asp Pro Val Ile Ala Ala Leu Ala Ala Ala Gly Gln Thr Ala Arg Val
305 310 315 320
Thr Val Val Arg Gly Gly Asp Gln Val Arg Val Gly Val Val Val Asp
325 330 335
Ala Leu Arg Asp Arg Leu Pro Glu Pro Lys Ala Asn Gly Ile Arg Val
340 345 350
Arg Thr Val His Ala Glu Asp Arg Leu Leu Val Glu Ala Ala Cys Arg
355 360 365
Ile Thr Ala Arg Pro Glu Leu Ser Asp Arg Pro Glu Pro Pro Trp Pro
370 375 380
Pro Ala Trp Pro Ser Trp Ser Pro Ala Gly Arg Arg Ser Arg Thr
385 390 395
<210>6
<211>396
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Ile Ser Ser Leu Ser Pro Gly Ile Asp Pro Pro Ala Ser Ala
1 5 10 15
Arg Leu Pro Gly Ser Ala Gly Pro Trp Ala Arg Glu Tyr Val Ala Phe
20 25 30
Asn Arg Arg Leu Ala Leu Tyr Gly Arg Ala Gly Val Met Ser Ala Cys
35 40 45
Ala Val Leu Gly Val Ala Ala Met Pro Met Gly Glu Met Ala Pro Ala
50 55 60
Ala Val Val Ala Ala Ala Val Leu Ala Trp Ser Trp Ile His Leu Arg
65 70 75 80
Leu Ala Thr Thr Ser Ala Met Arg Gln Arg Pro Leu Leu Ala Leu Asp
85 90 95
Leu Thr Val Met Thr Ala Leu Cys Leu Ser Gln Arg Phe Thr Ile Pro
100 105 110
Glu Ala Gln Thr Thr His Ala Gly Thr Trp Ile Leu Val Ser Val Ser
115 120 125
Phe Thr Ala Val Ala Tyr Gln Leu Met Gln Pro Pro Leu Thr Gly Ile
130 135 140
Leu Ala Thr Leu Trp Leu Cys Cys Ala Asp Val Val Gly Thr Ala Leu
145 150 155 160
His Pro Asn Leu Glu Trp Ser Gly Val Leu Arg Ser Val Ile Trp Val
165 170 175
Val Val Asn Thr Ala Leu Ala Arg Ala Val Val Arg Leu Val Phe Ser
180 185 190
Glu Ser Arg Ala Ala Asp Glu Ala Ala Asp Arg Ala Ala Arg Ala Arg
195 200 205
Gln Gln Gly Glu Ala Ala Glu Ala Arg Cys Ala Ala Glu Arg Glu His
210 215 220
Leu Ala Ala Leu His Asp Thr Ala Cys Ala Thr Leu Leu Ile Ala Ser
225 230 235 240
Ala Pro Trp Ala Ser Met Arg Ala Glu Thr Leu Arg Ala Gln Ala Ala
245 250 255
Arg Asp Leu Leu Arg Leu Arg Ala Glu Gln Pro Val Ala Gly Glu Val
260 265 270
Asp Leu Ala Ala Glu Leu Leu Asp Glu Ile Ala Ala His Pro Leu Arg
275 280 285
Val Ile Asn Arg Phe Asp Val Glu Leu Gly Thr Thr Trp Arg Pro Ile
290 295 300
Ala Ala Ala Leu Arg Gly Gly Leu Gly Glu Ala Leu Arg Asn Val Ala
305 310 315 320
Arg Tyr Ala Gly Val Asp Ala Val Arg Val Thr Ala Glu Arg Val Asp
325 330 335
His Val Ile Val Val Thr Ile Ala Asp Ser Gly Val Gly Phe Asp Pro
340 345 350
Glu His Ile Pro Gly His Arg Ile Gly Ile Lys Arg Ser Ile His Gly
355 360 365
Arg Met Trp Asn Val Gly Gly Arg Ala Thr Val Glu Ser Arg Pro Gly
370 375 380
His Gly Thr Thr Val Arg Leu Glu Trp Pro Arg Gly
385 390 395
<210>7
<211>241
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Gly Pro Leu Arg Val Val Asp Glu Lys Gly Gly Ser Pro Leu Arg
1 5 10 15
Ala Gln Lys Leu Ala Ala Val Leu Ala Val Leu Leu Ile Arg Ser Asp
20 25 30
Gln Val Val Met Ala Glu Gln Leu Thr Arg Glu Ile Trp Gly Glu Leu
35 40 45
Pro Pro Arg Arg Asp Thr Ala Ser Ile Tyr Val Cys Ile Ser Arg Leu
50 55 60
Arg Lys Phe Leu Ser Arg Pro Gly Gln Gln Asn Pro Ile Ile Thr Arg
65 70 75 80
Pro Gly Gly Tyr Met Leu Arg Arg Ala Ser Ser Glu Cys Asp Phe Asp
85 90 95
Val Phe Gln Gln Leu Val Ala Asp Gly Arg Lys Cys Val Arg Tyr Gly
100 105 110
Glu Pro Ala Gln Ala Ala Leu Cys Phe Glu Gln Ala Leu Ala Leu Trp
115 120 125
Arg Gly Pro Ala Phe Gly Asn Val Gln Thr Gly Pro Ile Val Ala Gly
130 135 140
Phe Leu Lys Arg Leu Asp Glu Met Arg Ile Glu Cys Ala Glu Met Ser
145 150 155 160
Ala Glu Ala His Leu Glu Leu Gly Arg His Arg Glu Leu Val Gly Arg
165 170 175
Leu Arg Ser Leu Val Ala Glu Phe Pro Leu Arg Glu Thr Phe Tyr Arg
180 185 190
Gln Leu Met Leu Ala Leu Tyr Arg Ser Asp Gln Lys Ser Glu Ala Leu
195 200 205
Gly Ile Tyr Glu Ser Ala Arg Arg Val Leu Asp Arg Glu Leu Gly Leu
210 215 220
Glu Pro Gly Arg Lys Leu Arg Glu Leu Arg His Thr Val Leu Ala Ser
225 230 235 240
Thr
<210>8
<211>253
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Arg Tyr Glu Ile Leu Gly Pro Leu Arg Val Val Asp Arg Ser Ser
1 5 10 15
Val Thr Ser Ile Ser Ala Arg Lys Ile Glu Val Leu Leu Ala Ala Leu
20 25 30
Leu Val Arg Ala Asp Glu Val Val Ser Ser Glu Asp Leu Val Lys Glu
35 40 45
Ile Trp Gly Thr Arg Ala Pro Arg Gln Ala Val Ala Ala Leu His Val
50 55 60
Tyr Val Ser Gln Leu Arg Lys Phe Leu Arg Arg Pro Asn Arg Thr Glu
65 70 75 80
Ser Pro Ile Val Thr Ala Arg Ser Gly Tyr Val Leu Arg Leu Gly Asp
85 90 95
Asp Glu Leu Asp Phe His Val Phe Gln Gly Leu Val Arg Gln Gly Arg
100 105 110
Ala Ala Gln Arg Ala Gly Arg Thr His Glu Ala Cys Glu Ala Tyr Glu
115 120 125
Glu Gly Leu Glu Phe Trp Arg Gly Pro Val Leu Glu Asp Leu Arg Asp
130 135 140
Gly Ala Ile Val Asn Gly Phe Val Cys Trp Ala Glu Gln Ala Arg Leu
145 150 155 160
Glu Cys Val Glu Ser Phe Val Glu Ala Gly Leu Glu Leu Gly Arg His
165 170 175
Arg Glu Phe Val Ser Tyr Leu Tyr Ala Arg Ile Glu Glu Leu Pro Leu
180 185 190
Asn Glu Ala Phe Tyr Arg Gln Leu Met Leu Ala Leu Tyr Arg Ser Gly
195 200 205
Cys Arg Ala Asp Ala Leu His Val Tyr Gln Ser Ala Arg Ala Val Leu
210 215 220
Ser Glu Glu Leu Gly Leu Glu Pro Ser Ala Ala Leu Lys Arg Leu His
225 230 235 240
His Ala Val Leu Val Asp Ala Pro Glu Leu Glu Ala Ala
245 250
<210>9
<211>2902
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Arg Phe Arg Tyr Gly Gly Val Met Ala Gly Ser Ser Pro Ser His
1 5 10 15
Ala Gln Gln Ala Thr Ser Pro Val Ala Ile Val Gly Leu Ala Cys Arg
20 25 30
Leu Pro Gly Ala Pro Asp Pro Glu Ala Phe Trp Arg Leu Leu Arg Ala
35 40 45
Gly Glu Asn Ala Val Val Pro Val Pro Asp Ser Arg Leu Pro Thr Glu
50 55 60
Pro Gly Ser Pro Pro Tyr Phe Ala Gly Leu Leu Glu His Val Asp Thr
65 70 75 80
Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Thr Ala Met
85 90 95
Asp Pro Gln Gln Arg Leu Met Leu Glu Leu Ala Trp Glu Ala Met Glu
100 105 110
Asp Ala Gly Leu Gly Pro Lys Asn Leu Ala Glu Arg Arg Thr Ala Val
115 120 125
Phe Thr Gly Ala Ile Trp Asp Asp Tyr Ala Thr Leu Leu His Arg Arg
130 135 140
Arg Pro Asn Asp Ile Ala Ile Thr Arg His Thr Met Ala Gly Leu His
145 150 155 160
Arg Gly Leu Ile Ala Asn Arg Val Ser His Leu Leu Gly Leu Arg Gly
165 170 175
Pro Ser Leu Thr Val Asp Ala Ala Gln Ser Ser Gly Leu Val Ala Val
180 185 190
His Leu Ala Cys Glu Ser Leu Arg Arg Gly Glu Ala Asp Leu Ala Leu
195 200 205
Ala Gly Gly Val Asn Leu Ile Leu Ala Glu Glu Ser Met Arg Met Ala
210 215 220
Glu Ala Gln Phe Gln Gly Leu Ser Pro Asp Gly Arg Cys Tyr Thr Phe
225 230 235 240
Asp Ala Arg Ala Asn Gly Phe Val Arg Gly Glu Gly Gly Gly Met Val
245 250 255
Leu Leu Lys Pro Leu Ala Ala Ala Val Ala Asp Gly Asp Pro Val Tyr
260 265 270
Cys Val Ile Glu Gly Ser Ala Val Asn Asn Asp Gly Ala Thr Asp Gly
275 280 285
Leu Thr Arg Pro Ser Ala Asp Ala Gln Thr Asp Val Val Arg Gln Ala
290 295 300
Trp Gln Arg Ala Ala Val Ser Pro Ala Glu Leu Gln Tyr Val Glu Leu
305 310 315 320
His Gly Thr Gly Thr Pro Val Gly Asp Pro Ile Glu Ala Ala Ala Leu
325 330 335
Gly Asp Ala Leu Gly Asp Ala Phe Asp Gly Cys His Arg Asp Arg Ala
340 345 350
Val Asp Ala Pro Leu Arg Val Gly Ser Val Lys Thr Asn Val Gly His
355 360 365
Leu Glu Ala Ala Ala Gly Ile Ala Gly Leu Leu Lys Thr Ala Leu Ser
370 375 380
Ile His His Arg Arg Leu Pro Pro Ser Leu Asn Phe Ala Thr Pro Asn
385 390 395 400
Pro Gly Ile Pro Leu Ala Glu Leu Gly Leu Arg Val Gln Thr Ala Phe
405 410 415
Gly Pro Trp Pro Asp Glu Arg Arg Arg Leu Thr Ala Gly Val Ser Ser
420 425 430
Phe Gly Met Gly Gly Thr Asn Cys His Val Val Leu Ala Glu Pro Pro
435 440 445
Ala Pro Ala Val Leu Pro Asp Arg Pro Thr Ala Pro Gly Asp Val Ser
450 455 460
Ala Arg Thr Ala Pro Pro Val Met Pro Trp Val Val Ser Ala Ala Ser
465 470 475 480
Pro Lys Ala Leu Thr Ala Gln Ala Ala Ala Leu Tyr Glu His Leu Arg
485 490 495
Ala His Pro Gly Leu His Pro Val Asp Ile Gly His Ala Leu Ala Thr
500 505 510
Thr Arg Thr Ala Phe Pro His Arg Ala Val Val Leu Gly Arg Asp Glu
515 520 525
Asp Glu Leu Val Ser Arg Leu Asp Ala Leu Ala Ser Glu Thr Gln Thr
530 535 540
Ser Gly Val Ile Arg Gly Arg Ala Gly Gly Gly Arg Val Ala Phe Leu
545 550 555 560
Phe Ser Gly Gln Gly Ser Gln Arg Pro Gly Met Gly Arg Glu Leu Tyr
565 570 575
Ala Ala Tyr Pro Val Phe Ala Asp Ala Leu Arg Glu Val Cys Ala His
580 585 590
Leu Asp Pro Met Leu Asp Thr Asp Thr Pro Leu Leu Asp Leu Met Phe
595 600 605
Ala Glu Ala Pro Pro Asp Gly Glu Pro Pro Leu Asn Arg Thr Ala Tyr
610 615 620
Thr Gln Pro Ala Leu Phe Ala Ile Glu Val Ala Leu Tyr Arg Leu Val
625 630 635 640
Thr Ser Trp Gly Val Thr Pro Asp His Leu Met Gly His Ser Val Gly
645 650 655
Glu Ile Thr Ala Ala His Val Ala Gly Val Leu Ser Leu Pro Asp Ala
660 665 670
Cys Thr Leu Val Ala Ala Arg Gly Arg Leu Met Gln Ser Ile Thr Ala
675 680 685
Pro Gly Ala Met Ala Ala Trp Gln Ala Thr Ala Glu Glu Ala Gly Gln
690 695 700
Ala Leu Glu Ala Tyr Gly Gly Arg Val Gly Leu Ala Ala Val Asn Ala
705 710 715 720
Pro Ala Ser Val Val Ile Ser Gly Asp Arg Glu Ala Val Ala Glu Ala
725 730 735
Thr Ala Ala Trp Arg Ala Arg Gly Arg Lys Ala Thr Val Leu Lys Val
740 745 750
Ser His Ala Phe His Ser Pro His Leu Asp Gly Ile Leu Gly Asp Leu
755 760 765
Arg Thr Val Ala Ala Gly Leu Thr Phe Ala Ala Pro Ala Ile Pro Val
770 775 780
Val Ser Asn Leu Thr Gly Gly Ala Ala Thr Glu Ala Gln Leu Arg Ser
785 790 795 800
Pro Asp Tyr Trp Ala Asp His Ala Arg Gln Ala Val Arg Phe Asp Ala
805 810 815
Gly Val Arg His Leu Cys Asp Ala Gly Val Asp Thr Phe Leu Glu Leu
820 825 830
Gly Pro Asp Ala Ser Leu Thr Gly Met Ala Arg Glu Ser Ala Ala Ala
835 840 845
Trp Ala Gly Asp Ala Pro Arg Pro Val Ala Val Ala Val Gln Arg Arg
850 855 860
Gly Arg Pro Glu Ala Gln Ser Phe Val Ser Ala Met Ala Gln Ala His
865 870 875 880
Val Arg Gly Val Gly Val Asp Trp Ala Ala Ala Phe Ala Gly His Glu
885 890 895
Thr Arg Arg Ala Pro Leu Pro Thr Tyr Ala Phe Gln Arg Asp Arg His
900 905 910
Trp Pro Asp Gly Leu Asp Glu Arg Gly Ala Arg Arg Pro Ser Thr Ser
915 920 925
Pro Val Val Pro Ser Thr Asp Arg Glu Pro Val Val Val Asp Ala Ser
930 935 940
Pro Ala Asp Arg Ala Ala Ser Pro Gly Glu Leu Leu Ala Leu Val Arg
945 950 955 960
Thr His Ala Ala Leu Val Leu Gly His Asn Ser Pro Asp Gly Ile Asp
965 970 975
Pro Ala Leu Thr Phe Lys Gln Leu Gly Phe Asp Ser Leu Ala Ala Thr
980 985 990
Glu Leu Ser Glu Arg Leu Ser Ala Ala Thr Asp Thr Glu Leu Pro Ala
995 1000 1005
Thr Leu Thr Phe Asp His Pro Thr Pro Asn Ala Val Ala Ala Trp
1010 1015 1020
Leu Arg Ala Ala His Glu Gly Gln Pro Thr Ala Ala Pro Thr Ala
1025 1030 1035
Ala Thr Gly Pro Ser Met Ala Glu Asp Pro Val Ala Val Val Ala
1040 1045 1050
Val Ser Cys Arg Tyr Pro Gly Gly Val Glu Ser Gly Glu Ala Leu
1055 1060 1065
Trp Arg Leu Val Asp Glu Gly Val Asp Ala Val Gly Glu Phe Pro
1070 1075 1080
Gly Asp Arg Gly Trp Asp Leu Ala Glu Leu Phe Gly Arg Ala Pro
1085 1090 1095
Asp Gly Ser Gly Gly Ser Ala Thr Gly Arg Gly Gly Phe Leu Tyr
1100 1105 1110
Gly Ala Gly Asp Phe Asp Ala Glu Phe Phe Gly Ile Ser Pro Arg
1115 1120 1125
Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Ile Leu Leu Glu Leu
1130 1135 1140
Ser Trp Glu Leu Leu Glu Arg Ala Gly Ile Pro Pro Ala Ser Leu
1145 1150 1155
Ala Gly Ser Ala Thr Gly Val Tyr Val Gly Ala Thr Ala Val Asp
1160 1165 1170
Tyr Gly Pro Arg Leu His Glu Ala Thr Ala Glu Leu Asp Gly His
1175 1180 1185
Leu Leu Thr Gly Ser Thr Pro Ser Val Ala Ser Gly Arg Val Ala
1190 1195 1200
Tyr Ala Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala
1205 1210 1215
Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gln Ala Leu
1220 1225 1230
Arg Gln Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val
1235 1240 1245
Met Ala Thr Pro Gly Met Phe Thr Ser Phe Ser Arg Gln Arg Gly
1250 1255 1260
Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp
1265 1270 1275
Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu Val Leu Leu Glu Arg
1280 1285 1290
Leu Ser Asp Ala Arg Arg Asn Gly His Gln Val Leu Ala Val Ile
1295 1300 1305
Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser
1310 1315 1320
Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu
1325 1330 1335
Ala Asn Ala Arg Leu Glu Pro Ala Asp Val Asp Ala Val Glu Ala
1340 1345 1350
His Gly Thr Gly Thr Thr Leu GIy Asp Pro Ile Glu Ala Gln Ala
1355 1360 1365
Leu Leu Ala Thr Tyr Gly Gly Gln Arg Thr Asp Asp Arg Pro Leu
1370 1375 1380
Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr Gln Ala Ala
1385 1390 1395
Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Leu Arg His
1400 1405 1410
Gly Arg Leu Pro Ala Ser Leu His Ile Asp Ala Pro Ser Pro His
1415 1420 1425
Ile Asp Trp Ser Asp Gly Thr Val Arg Leu Leu Ser Glu Pro Val
1430 1435 1440
Asp Trp Pro Gly Thr Asp Trp Pro Gly Ser Asp Arg Pro Arg Arg
1445 1450 1455
Ala Ala Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu
1460 1465 1470
Ile Leu Glu Gln Ala Pro Asp His Pro Glu Pro Glu Pro Thr Thr
1475 1480 1485
Ser Gly Gly Val Val Pro Trp Val Leu Ser Ala Arg Thr Ala Asp
1490 1495 1500
Ala Leu Arg Ala Gln Ala Gly Arg Leu Ala Glu Trp Val Thr Ala
1505 1510 1515
Gly Ala Pro Arg Ser Pro Ala Ser Pro Thr Ser Pro Ala Ser Pro
1520 1525 1530
Ala Asp Val Gly Trp Ser Leu Ala Thr Thr Arg Ser Ala Asp Arg
1535 1540 1545
His Arg Ala Val Val Ser Gly Thr Asp Arg Asp Glu Leu Leu Ser
1550 1555 1560
Gly Leu Arg Ala Val Ala Asp Gly Leu Ala Pro Ala Ala Val Ser
1565 1570 1575
Ala Gly Ala Ala Pro Gly Pro Val Met Val Phe Pro Gly Gln Gly
1580 1585 1590
Ser Gln Trp Arg Gly Met Gly Val Glu Leu Leu Asp Ser Ser Pro
1595 1600 1605
Val Phe Ala Ala Arg Met Ala Ala Cys Glu Ala Ala Leu Gly Glu
1610 1615 1620
Phe Val Asp Trp Ser Leu Thr Ala Val Leu Arg Gly Ala Pro Gly
1625 1630 1635
Ala Pro Glu Pro Ser Arg Val Asp Val Leu Gln Pro Cys Leu Trp
1640 1645 1650
Ala Val Met Val Ser Leu Ala Ala Val Trp Glu Ser Tyr Gly Val
1655 1660 1665
Thr Pro Thr Ala Val Val Gly His Ser Gln Gly Glu Ile Ala Ala
1670 1675 1680
Ala Cys Val Ala Gly Gly Leu Ser Leu Arg Asp Gly Ala Arg Val
1685 1690 1695
Val Ala Leu Arg Ser Gln Ala Leu Arg Ala Leu Ala Gly His Gly
1700 1705 1710
Thr Met Ala Ser Leu Ala Leu Ser Gly Ala Glu Ala Glu Arg Phe
1715 1720 1725
Leu Ala Asp Leu Gly Ala Ala Ala Ala Arg Val Thr Val Ala Val
1730 1735 1740
Phe Asn Gly Pro Tyr Ser Thr Val Val Ser Gly Pro Thr Asp Gln
1745 1750 1755
Val Ala Ala Val Val Ala Ala Cys Glu Ala Ala Gly His Arg Ala
1760 1765 1770
Arg Thr Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Val Asp
1775 1780 1785
Arg Leu Ala Asp Thr Ile Arg Thr Asp Leu Ala Asp Leu Ser Pro
1790 1795 1800
Gly Ala Ser Asp Ala Val Phe Tyr Ser Ala Val Thr Gly Ala Arg
1805 1810 1815
Gln Pro Thr Glu Glu Leu Asp Ala Asp Tyr Trp Phe Thr Asn Leu
1820 1825 1830
Arg Gln Pro Val Arg Phe Ala Ser Ala Ile Asp Ala Leu Leu Ala
1835 1840 1845
Ala Gly Tyr Arg Val Phe Ile Glu Val Ser Pro His Pro Val Leu
1850 1855 1860
Ile Pro Ala Leu Arg Glu Cys Phe Glu Glu Ala Glu Val Ala Ala
1865 1870 1875
Ala Thr Val Pro Thr Leu Arg Arg Asp Gln Gly Gly Pro Asp Gln
1880 1885 1890
Val Ala Arg Ala Leu Gly Asp Gly Phe Val Ala Gly Leu Ala Val
1895 1900 1905
Asp Trp Ser Arg Trp Phe Val Gly Asp Gly Arg Glu Ala Gly Asp
1910 1915 1920
Glu Gly His Arg Pro Arg Thr Val Glu Leu Pro Thr Tyr Pro Phe
1925 1930 1935
Gln Arg Arg Arg Tyr Trp Leu Ala Pro Asp His Gly Arg Arg Glu
1940 1945 1950
Gly Arg Thr Ala Gly Val Gly Thr Arg Pro Ala Gly His Ala Leu
1955 1960 1965
Leu Ser Ser Ala Val Glu Leu Ala Asp Gly Gly Leu Val Leu Ser
1970 1975 1980
Gly Arg Leu Pro Gly Asp Ala Ala Trp Val Gly Ala His Thr Val
1985 1990 1995
Ala Gly Val Gln Leu Val Pro Gly Ala Val Leu Val Asp Trp Ala
2000 2005 2010
Leu Leu Ala Ala Asp Glu Ala Gly Gly Ala Ser Leu Glu Glu Leu
2015 2020 2025
Leu Leu Arg Ala Pro Leu Glu Leu Ser Gly Pro Ser Gly Leu Ser
2030 2035 2040
Glu Pro Ser Ala Gly Val Leu Ala Gln Val Ala Val Gly Ala Pro
2045 2050 2055
Asp Glu Ser Gly Arg Arg Glu Leu Arg Ile Ser Ser Arg Pro Ala
2060 2065 2070
Asp Ala Gly Ala Gly Glu Gly Trp Thr Cys His Ala Val Gly Ser
2075 2080 2085
Leu Ala Pro Gly Gly Pro Pro Ala Pro Ala Asp Thr Gly Thr Ala
2090 2095 2100
Thr Val Pro Trp Pro Pro Ala Gly Ala Glu Ala Leu Asp Pro Ala
2105 2110 2115
Gly Leu Tyr Glu Arg Ala Glu Arg Arg Gly Tyr Gly Tyr Gly Pro
2120 2125 2130
Ala Leu Arg Gly Val Val Ala Leu Trp Arg Asp Gly Ala Asp Leu
2135 2140 2145
Val Ala Asp Val Ala Leu Pro Glu Glu Ala Gly Gly Gly Gly Glu
2150 2155 2160
Gly Gly Ala Asp Gly Asp Gly Thr Ala Gly Phe Gly Leu His Pro
2165 2170 2175
Val Leu Leu Asp Ala Ala Leu Gln Pro Ala Leu Leu Ala Glu Pro
2180 2185 2190
Asp Gly Thr Gly Gly Glu Gly Ala Gly Pro Glu Ala Arg Leu Trp
2195 2200 2205
Leu Pro Phe Ala Trp Ser Gly Val Arg Leu Trp Ala Thr Gly Ala
2210 2215 2220
Arg Ala Ala Arg Val Arg Leu Ser Pro Leu Asp Gly Gly Gly Gly
2225 2230 2235
Asp Val Ala Asp Glu Arg Glu Leu Arg Ile Glu Val Ser Asp Pro
2240 2245 2250
Thr Gly Ala Pro Val Leu Ser Val Ala Ser Val Val Leu Arg Pro
2255 2260 2265
Arg Thr Val Arg Gln Val Arg Glu Ala Ser Gly Ala Ala Ala Gly
2270 2275 2280
Gly Leu Phe Ala Leu Asp Trp Thr Pro Val Ala Pro Gln Glu Pro
2285 2290 2295
Ser Gly Ala Glu Asp Asp Ala Gly Cys Val Ala Val Leu Gly Glu
2300 2305 2310
Ala Pro Thr Glu Pro Gly Val Asp Gly Cys Arg Asp Thr Tyr Thr
2315 2320 2325
Asp Leu Pro Ala Leu Leu Ala Ala Leu Asp Ala Gly Ala Pro Leu
2330 2335 2340
Pro Ser Val Val Met Trp Arg Pro Pro Ala Ala Asp Pro Gly Ala
2345 2350 2355
Ala Pro Glu Asp Ala Ala Leu Ser Ala Val Arg Gly Val Ala Ala
2360 2365 2370
Ala Leu Arg Ala Trp Val Ala Glu Pro Arg Leu Thr Val Ser Arg
2375 2380 2385
Leu Ala Val Val Thr Arg Gly Ala Val Ala Ala Gly Gly Ala Glu
2390 2395 2400
Gly Glu Pro Val Asp Leu Ala Ala Ala Ala Ala Trp Gly Cys Ala
2405 2410 2415
Arg Gly Val Gln Ala Glu His Pro Asp Arg Ile Val Leu Val Asp
2420 2425 2430
Val Asp Asp Asp Val Asp Met Gly Ala Asp Thr Asp Thr Asp Ile
2435 2440 2445
Gly Ala Ala Ala Gly Leu Ala Ala Ala Leu Gly Glu Pro Gln Val
2450 2455 2460
Ala Leu Arg Gly Asp Thr Leu Leu Ala Pro Arg Leu Ala Arg Ser
2465 2470 2475
Ala Ala Thr Pro Gly Gly Val Ala Phe Asp Pro Asn Gly Thr Val
2480 2485 2490
Leu Val Thr Asp Ser Gly Gly Pro Leu Ala Gly Ser Val Ala Glu
2495 2500 2505
His Leu Val Arg Ala Glu Gly Val Arg His Leu Leu Leu Val Arg
2510 2515 2520
Phe Glu Gly Ala Asp Gly Ala Tyr Asp Thr Tyr Asp Arg Gln Asp
2525 2530 2535
Ala Gln Val His Met Val Thr Val Asp Pro Arg Asp Thr Ala Ala
2540 2545 2550
Leu Glu Arg Val Val Ala Gln Val Asp Pro Ala His Pro Leu Thr
2555 2560 2565
Gly Val Val His Val Ala Gly Leu Ser Ala Asp Ile Glu Thr Ser
2570 2575 2580
Gly Ala Ala Arg Gly Trp Ala Val Ala Ala Gly Val Val Arg Ala
2585 2590 2595
Leu His Gln Ala Thr Ala Ala Leu Pro Ser Val Arg Phe Val Thr
2600 2605 2610
Leu Ser Asp Ala Ala Thr Ala Trp Asp Gly Pro Ala Ala Pro Glu
2615 2620 2625
Arg Ala Ala Ala Gly Ala Phe Cys Ala Ala Val Thr Asp Val Arg
2630 2635 2640
Arg Arg Ala Gly Leu His Gly Leu Asp Val Ala Phe Gly Pro Trp
2645 2650 2655
Ala Ala Ala Asp Asp Asp Gly Gly Ala Asp Ser Gly Gly Arg Trp
2660 2665 2670
Thr Gly Val Leu Gly Ala Asp Arg Gly Leu Ala Leu Leu Arg Ala
2675 2680 2685
Ala Cys Arg Ala Asp Arg Pro Arg Leu Val Ala Ala Asp Ile Arg
2690 2695 2700
Thr Arg Ala Leu Thr Ala His Pro Ala His Glu Leu Pro Ala Ala
2705 2710 2715
Leu Arg Thr Leu Gly Ala Ser Ala Ser Ala Ser Ala Gly Gly Arg
2720 2725 2730
Ala Pro Val Arg Arg Val Ala Ala Ala Ala Pro Gly Arg Thr Thr
2735 2740 2745
Asp Trp Ala Ser Arg Leu Val Gly Leu Gly Pro Ala Glu Arg Arg
2750 2755 2760
Arg Ala Val Leu Glu Leu Val Arg Asp His Ala Ala Ala Val Leu
2765 2770 2775
Gly Gln Pro Asp Pro Lys Ala Val Arg Ala Asp Ala Ser Phe Lys
2780 2785 2790
Glu Leu Gly Phe Asp Ser Val Thr Ala Val Glu Leu Arg Asp Arg
2795 2800 2805
Leu Val Ala Val Gly Gly Leu Arg Leu Pro Ala Ala Val Val Phe
2810 2815 2820
Arg His Pro Thr Pro Glu Ala Leu Ala His Arg Ile Glu Gln Gln
2825 2830 2835
Leu Ala Pro Asp Asp Thr Asn Asn Ala Ala Ile Thr Asp Asn Ala
2840 2845 2850
Asp Asn Ala Ala Lys Ser Asn Gly Asn Ser Asn Gly Thr Ala Leu
2855 2860 2865
Asp Ala Ala Asp Lys Leu Ala Ser Ala Thr Ala Asp Glu Ile Leu
2870 2875 2880
Asp Phe Ile Asp Asn Glu Leu Gly Val Leu Ser Glu Ala Arg Pro
2885 2890 2895
Arg Pro Ser Asn
2900
<210>10
<211>2223
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Val Ser Glu Glu Lys Leu Val Glu Tyr Leu Arg Arg Val Thr Thr
1 5 10 15
Glu Leu His Asp Ala Arg Thr Arg Leu Arg Glu Leu Glu Glu Gly Glu
20 25 30
Gln Glu Pro Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Gly
35 40 45
Val Arg Ser Pro Glu Asp Leu Arg Arg Leu Val Leu Ser Gly Gly Asp
50 55 60
Ala Ile Gly Asp Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu
65 70 75 80
Phe His Pro Asp Pro Ala His Phe Gly Thr Ser Tyr Val Ser Gln Gly
85 90 95
Gly Phe Leu Tyr Asp Val Asp Arg Phe Asp Ala Gly Phe Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Leu Ser Trp Glu Ala Leu Glu Ser Ala Gly Val Val Pro Gly Ala
130 135 140
Leu Arg Ala Ser Arg Thr Gly Val Tyr Val Gly Val Ser Ser Glu Asp
145 150 155 160
Tyr Ile Ser Gly Leu Pro Gln Ile Pro Glu Gly Phe Glu Gly Tyr Ala
165 170 175
Thr Thr Gly Ser Leu Thr Ser Val Ile Ser Gly Arg Val Ala Tyr Thr
180 185 190
Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser
195 200 205
Ser Met Val Ala Ile His Leu Ala Gly Gln Ala Leu Arg Gln Gly Glu
210 215 220
Cys Ser Leu Ala Leu Ala Gly Gly Val Thr Val Leu Ser Thr Pro Leu
225 230 235 240
Met Phe Thr Glu Phe Cys Arg Gln Arg Ala Leu Thr Pro Asp Ala Arg
245 250 255
Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Phe Ser Glu Gly
260 265 270
Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly
275 280 285
His Glu Val Leu Ala Val Leu Arg Gly Ser Ala Ile Asn Gln Asp Gly
290 295 300
Ala Ser Asn Gly Leu Thr Ala Pro Asn Asp Val Ala Gln Glu Ser Val
305 310 315 320
Ile Arg Asp Ala Leu Ala Arg Ala Gly Leu Ser Gly Ala Asp Val Asp
325 330 335
Met Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu
340 345 350
Ala Glu Ala Leu Ile Ala Thr Tyr Gly Ala Asp Arg Pro Ala Asp Arg
355 360 365
Pro Leu Tyr Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr His Ala
370 375 380
Ala Ala Gly Val Ala Gly Ala Ile Asn Thr Val Met Ala Leu Arg Asp
385 390 395 400
Gly Lys Leu Ala Arg Thr Leu His Ile Asp Glu Pro Thr Arg His Val
405 410 415
Asp Trp Ser Ala Gly Thr Val Arg Leu Leu Thr Asp Pro Tyr Asp Trp
420 425 430
Pro Val Ala Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val
435 440 445
Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Asp Ala Gly
450 455 460
Ala Gln Gln Asp Ala Arg Gln Arg Gly Gly Asp Thr Phe His Gly Val
465 470 475 480
Val Pro Trp Pro Val Ser Gly Arg Thr Glu Ala Ala Leu Arg Asp Gln
485 490 495
Ala Ala Arg Leu Gly Ala Phe Leu Thr Ala Asp Gly Ala Thr Ala Asn
500 505 510
Gly Ala Ala Thr Gly Gly Val Ala Asp Val Gly Trp Ser Leu Ala Met
515 520 525
Arg Arg Thr Ala Phe Glu His Arg Ala Val Val Val Gly Arg Asp Arg
530 535 540
Ser Asp Leu Leu Ala Ala Leu Glu Gly Leu Ala Ala Asp Glu Pro Gly
545 550 555 560
Pro Ala Val Val Arg Gly Val Ala Ala Asp Val Gly Ala Gly Pro Val
565 570 575
Met Val Phe Pro Gly Gln Gly Ser Gln Trp Leu Gly Met Gly Val Glu
580 585 590
Leu Leu Asp Ser Ser Pro Val Phe Ala Ala Arg Ile Ala Ala Cys Glu
595 600 605
Arg Ala Leu Ala Ala His Val Asp Trp Ser Leu Thr Asp Val Leu Arg
610 615 620
Gly Ala Arg Gly Ala Ala Asp Ile Gly Arg Val Asp Val Val Gln Pro
625 630 635 640
Val Leu Trp Ala Val Met Val Ser Leu Ala Ala Val Trp Glu Ala His
645 650 655
Gly Val Arg Pro Ser Ala Val Val Gly His Ser Gln Gly Glu Ile Ala
660 665 670
Ala Ala Cys Val Ala Gly Ala Met Thr Leu Glu Asp Gly Ala Arg Val
675 680 685
Val Ala Leu Arg Ala Arg Ala Leu Arg Ala Leu Ala Gly Tyr Gly Ala
690 695 700
Met Ala Ser Leu Gly Cys Gly Val Glu Glu Thr Glu Arg Leu Thr Ala
705 710 715 720
Val His Ala Pro Asp Val Ala Val Ala Ala Val Asn Gly Pro Ser Ser
725 730 735
Thr Val Val Ser Gly Pro Ser Glu Gln Val Glu Lys Leu Val Ala Ala
740 745 750
Val Arg Ala Asp Gly Leu Arg Ala Arg Ala Ile Asp Val Asp Tyr Ala
755 760 765
Ser His Gly Pro Gln Val Asp Arg Ile Ala Asp Glu Leu Ala Asp Val
770 775 780
Leu Ala Gly Val Ser Gly Ala Ala Thr Asp Thr Ala Phe Tyr Ser Thr
785 790 795 800
Val Thr Gly Ala Arg Met Asp Ala Ser Gly Leu Asp Ala Gly Tyr Trp
805 810 815
Phe Thr Asn Leu Arg Gln Pro Val Arg Phe Ala Glu Ala Val Gln Ala
820 825 830
Leu Leu Asp Ala Asp Tyr Arg Val Phe Ile Glu Val Ser Ala His Pro
835 840 845
Val Leu Leu Leu Gly Leu Gln Glu Cys Phe Glu Ala Ala Gly Arg Pro
850 855 860
Ala Val Ala Ile Gly Thr Leu Arg Arg Asp Glu Gly Gly Pro Glu Arg
865 870 875 880
Leu Cys Arg Ala Leu Ala Glu Ala His Val Ala Gly Val Ala Val Asp
885 890 895
Trp Ala Ser Trp Tyr Ala Asp Gly Pro Ala Pro Ala Ala Val Pro Leu
900 905 910
Pro Ala Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Pro Ala Gly Ala
915 920 925
Gly Ser Gly Pro Gly Asp Val Ala Gly Ala Gly Leu Thr Ala Val Gly
930 935 940
His Ala Leu Leu Pro Val Ser Val Arg Leu Ala Asp Gly Ser Leu Val
945 950 955 960
Leu Thr Gly Arg Leu Pro Glu Ala Ala Arg Ala Gly Trp Leu Ala Glu
965 970 975
His Leu Val Ala Asp Leu Pro Leu Leu Pro Gly Thr Val Leu Val Glu
980 985 990
Trp Val Leu Arg Ala Ala Asp Glu Ala Gly Cys Gly Gly Val Glu Glu
995 1000 1005
Leu Ala Leu Gln Val Pro Val Ala Leu Pro Val Ser Gly Gly Leu
1010 1015 1020
Val Ile Gln Val Val Val Asp Ala Ala Glu Gly Asp Gly Arg Arg
1025 1030 1035
Pro Val Arg Val His Ser Arg Pro Glu Glu Asp Ser Gly Ala Pro
1040 1045 1050
Asp Ala Trp Val Cys His Val Ser Gly Thr Leu Leu Pro Gly Val
1055 1060 1065
Ala Gly Pro Val Pro Pro Ser Gly Pro Gly Gly Ala Trp Pro Pro
1070 1075 1080
Pro Gly Ala Arg Pro Ala Ala Ile Asp Gly Phe Tyr Glu Arg Ala
1085 1090 1095
Glu Ala Ala Gly Tyr Gly Tyr Gly Ala Phe Phe Arg Gly Leu Thr
1100 1105 1110
Asn Val Trp His Asp Gly Glu Asp Thr Leu Ala Glu Val Val Leu
1115 1120 1125
Pro Lys Glu Ala Ala Glu Gln Ala Gly Gly Phe Gly Ile His Pro
1130 1135 1140
Ala Leu Leu Asp Ala Ala Met Gln Pro Val Leu Leu Ala Gly Gln
1145 1150 1155
Leu Arg Gln Cys Ala Ala Ala Ala Gly Ala Asp Thr Ala Ser Gly
1160 1165 1170
Thr Val Leu Leu Pro Phe Thr Trp Ser Gly Val Arg Leu Trp Ala
1175 1180 1185
Gly Gly Ala Thr Arg Leu Arg Val Arg Leu Ser Pro Arg Pro Glu
1190 1195 1200
Gly Leu Arg Val Leu Leu Ala Asp Ala Thr Gly Ala Pro Val Leu
1205 1210 1215
Thr Ala Asp Ala Val Ala Leu Arg Glu Thr Gly Val Gln Gln Leu
1220 1225 1230
Arg Ala Ser Ser Arg Val Arg Gly Ser His Gly Leu Phe Ala Val
1235 1240 1245
Glu Trp Val Pro Pro Leu Ser Ala Thr Ala Gly Gly Thr Ala Pro
1250 1255 1260
Ala Thr Leu Ala Val Leu Gly Asp Asp Ala Pro Asp Leu Ala Asp
1265 1270 1275
Ala Asp Arg Tyr Pro Asp Leu Asp Ala Leu Phe Arg Ala Val Ala
1280 1285 1290
Asp Gly Ala Pro Ala Pro Asp Val Val Ile Ala Ser Val Arg Thr
1295 1300 1305
Gly Asn Asp Pro Ala Gly Ser Asp Thr Gly Leu Ala Thr Ala Arg
1310 1315 1320
Arg Thr Leu Thr Leu Ala Gln Glu Trp Leu Ala Gly Ser Gly Ala
1325 1330 1335
Asp Gly Ala Arg Leu Ala Val Val Thr Arg Ser Ala Ile Arg Thr
1340 1345 1350
Gly Asp Asp Gly Gln Glu Arg Val Val Pro Ser Ala Ala Ala Val
1355 1360 1365
Trp Gly Leu Met Arg Ser Ala Gln Thr Glu His Pro Gly Arg Phe
1370 1375 1380
Val Leu Ile Asp Glu Asp Thr Asp Ser Thr Glu Asn Ile Leu Glu
1385 1390 1395
Ala Val Arg Thr Asp Glu Pro Gln Leu Ala Leu Arg Gly Gly Arg
1400 1405 1410
Ala Leu Val Pro Arg Met Ala Arg Val Asp Ala Glu Pro Glu Leu
1415 1420 1425
Thr Ala Pro Ser Gly Glu Arg Ala Trp His Val Ala Ala Gly Lys
1430 1435 1440
Thr Gly Pro Asp Asp Leu Thr Ala Val Pro Ser Pro Arg Ala Ser
1445 1450 1455
Ala Pro Leu Ala Pro Gly Gln Val Arg Ile Ala Val Arg Ala Ala
1460 1465 1470
Gly Leu Asn Phe Arg Asp Ala Leu Ile Ala Leu Asp Met Tyr Pro
1475 1480 1485
Asp Ala Ser Ala Ser Ile Gly Ser Glu Gly Ala Gly Val Val Leu
1490 1495 1500
Glu Val Ser Glu Gly Val Ala Gly Val Ala Val Gly Asp Arg Val
1505 1510 1515
Met Gly Leu Phe Asn Asp Ala Phe Gly Pro Val Ala Val Ala Asp
1520 1525 1530
Ala Arg Met Val Ala Pro Val Pro Asp Gly Trp Ser Phe Arg Glu
1535 1540 1545
Ala Ala Ala Ala Pro Val Ala Phe Leu Thr Ala Trp Tyr Gly Leu
1550 1555 1560
Val Asp Leu Gly Gly Leu Ser Ser Gly Glu Thr Val Val Ile His
1565 1570 1575
Gly Ala Ala Gly Gly Val Gly Met Ala Ala Val Gln Val Ala Arg
1580 1585 1590
His Leu Gly Ala Glu Val Phe Ala Thr Ala Ser Pro Ala Lys His
1595 1600 1605
Pro Val Leu Glu Gly Met Gly Val Asp Ala Ala His Arg Ala Ser
1610 1615 1620
Ser Arg Asp Leu Gly Phe Glu Ala Ala Phe Ser Ser Ala Thr Gly
1625 1630 1635
Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu Ala Gly Glu Phe
1640 1645 1650
Thr Asp Ala Ser Leu Arg Leu Leu Ala Pro Gly Gly Arg Leu Ile
1655 1660 1665
Glu Met Gly Lys Thr Asp Val Arg Asp Pro Asp Gln Val Ala Arg
1670 1675 1680
Glu His Ser Val Ala Tyr Arg Ala Phe Asp Leu Ile Ala Asp Ala
1685 1690 1695
Gly Pro Glu Arg Ile Gly Gln Leu Leu Ala Ala Leu Gly Glu Arg
1700 1705 1710
Phe Ala Asp Gly Ala Phe Thr Pro Leu Pro Val Thr Gly Trp Arg
1715 1720 1725
Leu Gly Gln Ala Arg Gln Ala Leu Arg Gln Leu Ser Gln Ala Arg
1730 1735 1740
His Thr Gly Lys Leu Val Leu Asp Val Asp Pro Ala Pro Asp Pro
1745 1750 1755
Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Thr Leu Gly Gly
1760 1765 1770
Leu Ile Ala Glu His Leu Val Arg Ser Arg Gly Val Arg His Leu
1775 1780 1785
Leu Leu Leu Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Glu Glu
1790 1795 1800
Leu Thr Ala Arg Leu Thr Glu Leu Gly Ala Arg Val Arg Val Ala
1805 1810 1815
Ala Val Asp Val Gly Asp Ala Thr Ala Leu Gly Glu Ala Val Ala
1820 1825 1830
Gly Val Asp Pro Ala His Pro Leu Thr Gly Val Val His Ala Ala
1835 1840 1845
Gly Val Val Ala Asp Ala Met Leu Pro Ser Gln Asp Asp Glu Arg
1850 1855 1860
Leu Val Ala Ala Trp Ser Ala Lys Ala Ala Ala Ala Ala Arg Leu
1865 1870 1875
His Asp Ala Thr Ala Gly Leu Pro Leu Gly Met Phe Val Leu Phe
1880 1885 1890
Ser Ser Phe Ala Ser Thr Leu Gly Thr Ala Gly Gln Ala Asn Tyr
1895 1900 1905
Ala Ala Ala Asn Ala Tyr Cys Asp Ala Leu Val Glu Arg Arg His
1910 1915 1920
Ala Glu Gly Leu Pro Gly Val Ser Val Ser Trp Gly Leu Trp Ser
1925 1930 1935
Ala Ala Ser Gly Leu Thr Gly Gly Leu Thr Glu Ala Asp Val Ala
1940 1945 1950
Arg Ile Ala Arg Gln Gly Ile Val Pro Asn Ser Thr Glu Gln Gly
1955 1960 1965
Tyr Asp Leu Phe Asp Ala Ala Leu Gly His Gly Arg Pro Ala Leu
1970 1975 1980
Leu Ala Leu Asn Leu Asp Thr Arg Ala Leu Ala Ala Gln Pro Val
1985 1990 1995
Ala Ala Leu Pro Ala Pro Leu Arg Ala Leu Ala Ala Asp Ala Gln
2000 2005 2010
Ala Ala Gly Ala Arg Ser Gly Gly Ala Ala Ala Arg Pro Thr Ala
2015 2020 2025
Ala Ala Ala Glu Glu Pro Ala Asp Trp Ala Ala Arg Leu Arg Ala
2030 2035 2040
Leu Ala Pro Ala Glu Gln Arg Arg Leu Leu Thr Asp Leu Val Arg
2045 2050 2055
Arg His Ala Ala Thr Val Leu Gly His Ala Asp Pro Glu Ala Val
2060 2065 2070
Pro Ala Asp Ala Ala Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr
2075 2080 2085
Ala Val Glu Leu Arg Asn Arg Val Thr Ala Ala Thr Gly Leu Arg
2090 2095 2100
Leu Pro Ala Thr Val Ile Phe Asp Tyr Pro Glu Pro Gly Ala Leu
2105 2110 2115
Ala Glu Arg Leu Arg Thr Glu Leu Ala Pro Glu Glu Gly Ala Ser
2120 2125 2130
Ala Thr Ala Pro Asp Leu Tyr Ala Pro Val Leu Ser Arg Leu Thr
2135 2140 2145
Gly Leu Glu Glu Thr Leu Ala Ala Leu Ala Ser Ser Gly Val Asn
2150 2155 2160
Gly Gly Val Asn Gly Gly Val Ala Asp Pro Gly Ala Val Thr Ala
2165 2170 2175
Arg Leu Glu Ser Leu Leu Ala Asp Trp Lys Ala Ala His Ala Pro
2180 2185 2190
Ser Arg Asn Gly Gly Thr Ala Ala Glu Arg Leu Glu Ala Ala Thr
2195 2200 2205
Thr Asp Gln Val Leu Asp Phe Ile Asp Lys Glu Leu Gly Val Gln
2210 2215 2220
<210>11
<211>4032
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Gly Gly Ala Val Thr Thr Glu Thr Asn Glu Glu Arg Leu Val
1 5 10 15
Asp Tyr Leu Lys Arg Val Ala Ala Asp Leu His Asp Thr Arg Ala Arg
20 25 30
Leu Arg Glu Val Glu Asp Gly Gln Arg Glu Pro Val Ala Ile Val Ala
35 40 45
Met Ala Cys Arg Tyr Pro Gly Asp Val Ala Ser Pro Glu Asp Leu Trp
50 55 60
Asp Leu Val Ala Ala Arg Arg His Ala Met Thr Ala Phe Pro Asp Asn
65 70 75 80
Arg Gly Trp Asp Leu Glu Arg Leu Phe His Pro Asp Pro Asp His Pro
85 90 95
Gly Thr Ser Tyr Ala Arg Glu Gly Gly Phe Leu His Asp Ala Asp Leu
100 105 110
Phe Asp Pro Glu Phe Phe Gly Ile Ser Pro Arg Glu Ala Ala Ala Val
115 120 125
Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ala Trp Glu Ala Leu Glu
130 135 140
Arg Ala Gly Ile Ala Pro Gly Ser Leu Lys Gly Ala Pro Val Gly Val
145 150 155 160
Tyr Ala Gly Thr Ala Leu Pro Gly Phe Gly Thr Pro His Ile Asp Arg
165 170 175
Ala Ala Glu Gly Tyr Leu Val Thr Gly Asn Ala Pro Ser Val Leu Ser
180 185 190
Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Val
195 200 205
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gln
210 215 220
Ala Leu Arg Gln Gly Glu Cys Glu Leu Ala Leu Ala Gly Gly Val Thr
225 230 235 240
Val Met Thr Thr Pro Tyr Val Phe Thr Glu Phe Ala Arg Gln Arg Gly
245 250 255
Leu Ala Ala Asp Ser Arg Cys Lys Ala Phe Ser Lys Gly Ala Asp Gly
260 265 270
Thr Ala Phe Ala Glu Gly Ala Gly Leu Leu Val Leu Glu Arg Leu Ser
275 280 285
Asp Ala Gln Arg Asn Gly His Gln Val Leu Ala Val Met Arg Gly Ser
290 295 300
Ala Ile Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly
305 310 315 320
Leu Ala Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Gly Ala Arg Leu
325 330 335
Ser Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr
340 345 350
Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln
355 360 365
Glu Arg Pro Glu Gly Arg Pro Leu Trp Leu Gly Ala Ile Lys Pro Asn
370 375 380
Leu Gly His Thr Gln Gly Ala Ala Gly Val Ala Gly Val Ile Lys Met
385 390 395 400
Val Met Ala Leu Arg Asn Ala Ser Leu Pro Ala Leu Leu His Ala Asp
405 410 415
Arg Pro Thr Ser Val Val Asp Trp Asp Gly Gly Ala Val Arg Leu Leu
420 425 430
Ala Glu Pro Val Ala Trp Pro Ala Gly Asp Arg Arg Arg Arg Ala Gly
435 440 445
Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu Ile Leu Glu
450 455 460
Glu Ala Pro Pro Arg Pro Asp Thr Glu Leu Pro Ala Ser Val Pro Gln
465 470 475 480
Arg Pro Glu Gly Thr Val Val Pro Trp Val Val Ser Ala Arg Gly Ala
485 490 495
Val Ser Leu Arg Thr Gln Ala Ala Ala Leu Ala Glu His Met Ala Ala
500 505 510
His Pro Asp Thr Pro Val Asp Ala Ile Gly Trp Ser Leu Ala Thr Thr
515 520 525
Arg Ser Pro Leu Asp His Arg Ala Val Val Leu Gly Ala Asp Arg Gly
530 535 540
Glu Leu Ser Ala Arg Leu Ala Asp Leu Ala Glu Gly Arg Thr His Pro
545 550 555 560
Asp Val Thr Arg Ala Ser Ala Pro Ala Arg Leu Gly Gly Ser Ala Phe
565 570 575
Leu Phe Thr Gly Gln Gly Ser Gln Arg Pro Gly Met Gly Ala Gln Leu
580 585 590
Tyr Arg Ala Tyr Pro Ala Phe Ala Ala Ala Phe Asp Glu Ala Cys Ala
595 600 605
Ala Leu Asp Pro His Leu Gly Arg Ser Leu Leu Glu Leu Val Phe Ala
610 615 620
Pro Ala Asp Thr Asp Gly Asp Gly Asp Ala Asp Arg Ala Ser Ala Leu
625 630 635 640
Asp Ala Thr Glu Val Thr Gln Ala Ala Leu Phe Ala Val Glu Val Ala
645 650 655
Leu Tyr Arg Leu Val Glu Ser Phe Gly Val Val Pro Gly Tyr Leu Ala
660 665 670
Gly His Ser Val Gly Glu Leu Val Ala Ala His Ile Ala Gly Val Leu
675 680 685
Ser Leu Pro Asp Ala Ala Arg Leu Val Ala Ala Arg Gly Arg Leu Met
690 695 700
Gln Ala Leu Pro Glu Gly Gly Ala Met Val Ala Leu Glu Ala Ala Glu
705 710 715 720
Glu Glu Val Ala Leu Leu Leu Ala Gly Arg Ala Asp Gln Val Ala Leu
725 730 735
Ala Ala Val Asn Ala Pro Thr Ser Val Val Val Ser Gly Asp Glu Glu
740 745 750
Ala Val Glu Glu Ile Ala Arg Thr Ile Arg Glu Arg Gly His Arg Thr
755 760 765
Arg Arg Leu Arg Val Gly His Ala Phe His Ser Pro Arg Ile Asp Pro
770 775 780
Met Leu Glu Glu Phe Arg Gln Val Ala Ala Ser Leu Thr Tyr Ser Gln
785 790 795 800
Pro Arg Ile Ala Val Val Ser Asn Val Thr Gly Ala Leu Ala Gly Ala
805 810 815
Glu Gln Leu Cys Asp Pro Asp Tyr Trp Val Arg His Ala Arg Gln Pro
820 825 830
Val Arg Phe Arg Asp Gly Ile Ala Ala Leu Arg Ala Glu Gly Val Thr
835 840 845
Arg Phe Leu Glu Leu Gly Pro Asp Ala Val Leu Thr Ala Met Ala Arg
850 855 860
Asp Cys Leu Thr Ser Glu Ala Ala Pro Glu Val Ala Ala Gly Glu Ser
865 870 875 880
Ala Gln Asp Ala Gly Thr Ala Ser Gly Pro Ser Ala Pro Val Leu Ala
885 890 895
Thr Val Leu Arg Lys Gly Arg Asp Glu Pro Arg Thr Leu Leu Thr Ala
900 905 910
Leu Ala Gln Leu His Val Asp Gly Glu Ser Val Asp Phe Ser Ala Ser
915 920 925
Phe Pro Ala Thr Thr Gln Ala Thr Asp Leu Pro Thr Tyr Arg Phe Asp
930 935 940
Arg Arg Arg Tyr Trp Arg Asp Ala Pro Gln Ala Glu Ala Asp Val Arg
945 950 955 960
Ala Ala Gly Leu Glu Ala Ser Asp His Pro Leu Leu Arg Ala Ala Leu
965 970 975
Glu Pro Ala Asp Gly Gly Leu Leu Leu Thr Gly Arg Leu Ser Leu Arg
980 985 990
Gly Gln Pro Trp Leu Ala Asp His Ala Ile Val Asp Ala Val Pro Leu
995 1000 1005
Pro Gly Thr Leu Phe Val Glu Leu Ala Leu Gln Ala Gly Glu Arg
1010 1015 1020
Val Gly Cys Asp Leu Ile Asp Asp Leu Thr Leu Glu Ala Pro Leu
1025 1030 1035
Leu Leu Pro Pro Val Gly Ala Val Asp Leu Gln Val Ala Val Gly
1040 1045 1050
Ala Thr Asp Ala Ala Gly Arg Arg Ala Val Thr Val Tyr Ser Arg
1055 1060 1065
Pro Ser Gly Gly Gly Ser Glu Gly Trp Glu Ser Ser Ala Asp Pro
1070 1075 1080
Gly Val Ala Asp Glu Pro Gly Asp Gly Pro Tyr Gly Pro Trp Arg
1085 1090 1095
Arg His Ala Thr Ala Thr Leu Gly Thr Ala Pro Thr Gly Val Pro
1100 1105 1110
Glu Pro Ala Ala Ser Pro Ala Gln Trp Pro Pro Ala Gly Ala Glu
1115 1120 1125
Ala Ile Asp Val Ala Gly Leu Tyr Glu Arg Leu Ala Ala Glu Gly
1130 1135 1140
Tyr Arg Tyr Gly Pro Ala Phe Thr Gly Leu Arg Thr Ala Trp Arg
1145 1150 1155
Val Gly Glu Glu Met Phe Ala Glu Val Gly Leu Ala Pro Gly Gln
1160 1165 1170
Arg Gly Asp Gly Gly Ala Tyr Ala Val His Pro Ala Leu Leu Asp
1175 1180 1185
Ala Ala Leu His Pro Ile Gly Ala Leu Phe Thr Gly Glu Asp Gln
1190 1195 1200
Ala Gly Gly Ala Pro Gly Thr Val Arg Leu Pro Phe Ser Phe Gly
1205 1210 1215
Gly Val Arg Leu Leu Ala Arg Gly Ala Ser Arg Leu Arg Val Arg
1220 1225 1230
Ile Thr Pro Thr Gly Pro Asp Thr Val Thr Met Arg Leu Ser Asp
1235 1240 1245
Asp Thr Gly Ala Glu Val Val Ala Val Asp Ser Leu Thr Leu Arg
1250 1255 1260
Thr Val Ser Ala Gln Arg Trp Arg Ser Gly Ala Val Pro Ala Asp
1265 1270 1275
Arg Pro Leu Tyr Arg Leu Asp Trp Asp Ala Phe Ala Leu Pro Ala
1280 1285 1290
Ala Ala Thr Thr Ala Pro Asp Arg Trp Ala Val Leu Ala Ala Asp
1295 1300 1305
Asp Thr Asn Ala Thr Asp Thr Ala Thr Ala Ser Leu Pro Ala Glu
1310 1315 1320
His Val Ala Arg His Pro Asp Leu Ala Ala Leu Ser Ala Ser Val
1325 1330 1335
Ala Ala Gly Ala Pro Ala Pro Asp Leu Val Val Met Ala Cys Leu
1340 1345 1350
Gly Ala Pro Tyr Asp Thr Ser Asp Asp Gly Asp Glu Pro Pro Ser
1355 1360 1365
Gln Val Arg Thr Ala Thr His Arg Val Leu Ala Arg Leu Arg Glu
1370 1375 1380
Trp Leu Thr Asp Asp Ala Leu Ala Ala Ser Arg Leu Val Val Leu
1385 1390 1395
Thr Ala Lys Ala Val Ala Ala Asp Pro Ala Asp Ala Pro Pro Asp
1400 1405 1410
Leu Ala Gly Ala Ala Val Leu Gly Leu Leu Arg Ala Ala Gln Ala
1415 1420 1425
Glu His Pro Gly Arg Ile Val Leu Val Asp Thr Asp Gly Ile Ser
1430 1435 1440
Ala Ser Arg Asp Ala Leu Ala Ala Ala Val Ala Ala Ala Val Ala
1445 1450 1455
Ala Gly Glu Trp Gln Leu Ala Leu Arg Asp Gly Arg Ala Leu Val
1460 1465 1470
Pro Arg Leu Ile Leu Ala His Pro Asp Pro Asp Ala Ala Pro Val
1475 1480 1485
Val Leu Asp Pro Asp Gly Thr Val Leu Val Thr Gly Gly Thr Gly
1490 1495 1500
Ser Leu Gly Arg Leu Leu Ala Arg His Leu Val Glu His His Gly
1505 1510 1515
Ala Arg His Leu Leu Leu Val Ser Arg Ser Gly Pro Ala Ala Glu
1520 1525 1530
Gly Ile Glu Ala Phe Ala Ala Gly Leu Ala Ala Asp Val Arg Ile
1535 1540 1545
Glu Ser Cys Asp Thr Thr Asp Pro Glu Ala Leu Ala Ala Leu Leu
1550 1555 1560
Ala Thr Val Pro Gly Glu His Pro Leu Thr Ala Val Val His Thr
1565 1570 1575
Ala Gly Val Leu Asp Asp Gly Val Val Thr Ser Leu Thr Pro Glu
1580 1585 1590
Gln Leu Asp Thr Val Leu Ala Pro Lys Val Asp Ala Ala Trp Gln
1595 1600 1605
Leu His Arg Leu Thr Arg Gly Ala Asp Leu Ala Ser Phe Val Leu
1610 1615 1620
Phe Ser Ser Ala Ala Ser Val Leu Gly Ser Gly Gly Gln Gly Asn
1625 1630 1635
Tyr Gly Ala Ala Asn Ala Phe Leu Asn Ala Leu Ala Glu His Ile
1640 1645 1650
Arg Ala Ala Gly Gly Pro Ala Thr Ser Leu Ala Trp Gly Leu Trp
1655 1660 1665
Gly Val Asp Glu Gly Met Thr Glu His Leu Ala Thr Ala Asp Arg
1670 1675 1680
Ala Arg Met Ala Arg Ser Gly Thr Ala Ala Met Ser Gly Glu Ala
1685 1690 1695
Gly Leu Ala Arg Phe Asp Ala Ala Leu Ala Thr Ala Leu Pro Val
1700 1705 1710
Leu Val Pro Ala Arg Phe Asp Leu Ala Val Leu Arg Glu Gln Ala
1715 1720 1725
Ala Gly Gly Ala Leu Pro Pro Leu Leu Arg Arg Leu Val Arg Leu
1730 1735 1740
Pro Val Arg Thr Ala Ala Ala Val Glu Ala Ser Pro Ser Trp Ala
1745 1750 1755
Gly Arg Leu Ala Gly Leu Pro Glu Thr Glu Gln Asp Arg Val Ile
1760 1765 1770
Gly Glu Leu Ile Arg Asp Arg Ile Ala Ala Val Leu Ala His Pro
1775 1780 1785
Glu Pro Glu Thr Leu Glu Leu Gly Arg Thr Phe Ala Gln Leu Gly
1790 1795 1800
Leu Asp Ser Leu Thr Ala Leu Glu Leu Arg Asn Ala Ile His Glu
1805 1810 1815
Ala Thr Gly Val Arg Leu Pro Ala Thr Ala Ile Phe Asp Tyr Pro
1820 1825 1830
Thr Pro Glu Thr Leu Val Ser His Leu Arg Thr Glu Leu Leu Gly
1835 1840 1845
Ala Thr Ala Thr Thr Ala Ala Thr Ala Pro Leu Pro Pro Gly Ala
1850 1855 1860
Gly Ala Pro Ala Arg Ser Gly Ser Ala Asp Asp Pro Val Val Ile
1865 1870 1875
Val Gly Met Ala Cys His Tyr Pro Gly Asp Val His Ser Pro Asp
1880 1885 1890
Glu Leu Trp Arg Leu Val Ala Asp Gly Val Asp Ala Ile Gly Pro
1895 1900 1905
Phe Pro Glu Asp Arg Gly Trp Asp Val Ala Gly Leu Tyr Asp Pro
1910 1915 1920
Asp Pro Glu Arg Thr Gly Lys Ser Tyr Thr Arg Glu Gly Gly Phe
1925 1930 1935
Leu Pro Glu Ala Ala Leu Phe Asp Ala Glu Phe Phe Gly Ile Ser
1940 1945 1950
Pro Arg Glu Ala Leu Ala Thr Asp Pro Gln Gln Arg Leu Leu Leu
1955 1960 1965
Glu Thr Ala Trp Gln Ala Phe Glu His Ala Arg Ile Asp Pro Ala
1970 1975 1980
Ala Leu Arg Gly Ser Arg Thr Ala Val Val Thr Gly Ile Met Tyr
1985 1990 1995
Asp Asp Tyr Gly Ala Arg Phe Leu Gly Arg Ile Pro Glu Gly Tyr
2000 2005 2010
Glu Gly Gln Ile Met Thr Gly Ser Thr Pro Ser Val Ala Ser Gly
2015 2020 2025
Arg Val Ala Tyr Thr Phe Gly Leu Glu Gly Pro Thr Leu Thr Val
2030 2035 2040
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala
2045 2050 2055
Gln Ala Leu Arg Gln Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly
2060 2065 2070
Val Thr Val Met Ala Thr Pro Asn Thr Phe Ile Glu Phe Ser Arg
2075 2080 2085
Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala
2090 2095 2100
Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly Ala Gly Leu Leu Val
2105 2110 2115
Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu
2120 2125 2130
Ala Val Leu Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn
2135 2140 2145
Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Gly
2150 2155 2160
Gln Ala Leu Ala Ala Ala Gly Val Asp Pro Ala Gly Val Asp Val
2165 2170 2175
Val Glu Ala His Gly Thr Gly Thr Met Leu Gly Asp Pro Ile Glu
2180 2185 2190
Ala Gln Ala Leu Leu Ala Thr Tyr Gly Arg Asn Arg Pro Ala Glu
2195 2200 2205
Gln Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr
2210 2215 2220
Gln Ala Ala Ala Gly Ala Ala Gly Ile Ile Lys Met Val Met Ala
2225 2230 2235
Leu Arg His Gly Arg Leu Pro Ala Thr Leu His Val Asp Glu Pro
2240 2245 2250
Ser Pro His Val Asp Trp Ala Ser Gly Ser Val Arg Leu Leu Thr
2255 2260 2265
Glu Ala Thr Asp Trp Pro Glu Ala Asp Arg Pro Arg Arg Ala Ala
2270 2275 2280
Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu Ile Leu
2285 2290 2295
Glu Gln Ala Pro Asp Gln Pro Glu Pro Leu Pro Glu Gln Ser Glu
2300 2305 2310
Ser Ala Thr Ala Gly Gly Ile Val Pro Phe Val Leu Ser Ala Arg
2315 2320 2325
Thr Ala Glu Ala Leu Arg Ala Gln Ala Ala Asn Leu Ala Ala Arg
2330 2335 2340
Leu Pro Ser Ala Gly Val Ala Glu Val Gly Trp Ser Leu Ala Thr
2345 2350 2355
Thr Arg Ser Ala Phe Glu His Arg Ala Val Ile Val Ala Glu Asp
2360 2365 2370
Arg Asp Ala Leu Leu Ala Gly Leu Glu Lys Leu Ala Ala Asp Glu
2375 2380 2385
Pro Asp Pro Ala Val Val Ala Gly Thr Ala Thr Thr Ala Ala Ala
2390 2395 2400
Gly Pro Val Phe Val Phe Pro Gly Gln Gly Ser Gln Trp Arg Gly
2405 2410 2415
Met Gly Val Glu Leu Leu Asp Thr Ser Pro Val Phe Ala Ala Arg
2420 2425 2430
Ile Ala Glu Cys Glu Arg Ala Leu Ala Pro Tyr Val Asp Trp Ser
2435 2440 2445
Leu Thr Ala Val Leu Arg Gly Ser Asp Thr Thr Thr Asp Pro His
2450 2455 2460
Arg Val Asp Val Val Gln Pro Thr Leu Trp Ala Val Met Val Ser
2465 2470 2475
Leu Ala Ala Leu Trp Gln His Leu Gly Ile Ala Pro Ala Ala Val
2480 2485 2490
Ile Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly
2495 2500 2505
Ala Leu Thr Leu Asp Asp Ala Ala Lys Val Val Ala Leu Arg Ser
2510 2515 2520
Gln Ala Leu Arg Ala Leu Ala Gly His Gly Thr Met Ala Ser Leu
2525 2530 2535
Thr Leu Gly Ala Asp Asp Thr Ala Gly Leu Leu Glu Glu Leu Gly
2540 2545 2550
Glu Arg Ala Asp Asp Val Thr Val Ala Ala Ala Asn Gly Pro Val
2555 2560 2565
Thr Thr Val Ile Ser Gly Ala Val Glu Gln Ile Ala Thr Val Leu
2570 2575 2580
Ala Ala Ala Glu Ala His Gly Ala Arg Thr Arg Thr Ile Asp Val
2585 2590 2595
Asp Tyr Ala Ser His Gly Pro His Val Asp Arg Ile Arg Glu Asp
2600 2605 2610
Ile Val Ser Ala Leu Ser Gly Leu Ala Pro Thr Ala Ser Glu Val
2615 2620 2625
Ala Phe Tyr Ser Thr Val Thr Ala Glu Arg Leu Asp Thr Ala Gly
2630 2635 2640
Leu Asp Ala Asp Tyr Trp Phe Thr Asn Leu Arg Arg Pro Val Arg
2645 2650 2655
Phe Ala Asp Thr Leu Ala Thr Leu Leu Ala His Gly His Arg His
2660 2665 2670
Phe Ile Glu Val Ser Pro His Pro Val Leu Ile Pro Gly Met Gln
2675 2680 2685
Asp Gly Phe Glu Ala Ala Asp Ala Ala Ala Thr Ala Val Pro Thr
2690 2695 2700
Leu Arg Arg Asp Gln Gly Gly Pro His Gln Leu Ala Gln Ala Val
2705 2710 2715
Ala Arg Ala Tyr Thr Ala Gly Leu Ala Ile Asp Trp Ala Pro Trp
2720 2725 2730
Tyr Pro Ala Arg Pro Tyr Thr Thr Asp Leu Pro Thr Tyr Pro Phe
2735 2740 2745
Gln Arg Arg Arg Tyr Trp Leu Gly Met Asp Gly Gly Pro Gly Asp
2750 2755 2760
Leu Arg Ser Ala Gly Leu Val Ser Val Ser His Ala Gln Ile Gly
2765 2770 2775
Ala Ala Val Glu Leu Ala Asp Gly Gly Leu Val Met Thr Gly Arg
2780 2785 2790
Leu Pro Ala Ala Gly Ser Gly Gly Trp Leu Asp Asp His Val Val
2795 2800 2805
Ala Asp Thr Pro Leu Val Pro Gly Thr Ala Leu Val Glu Trp Val
2810 2815 2820
Leu Arg Ala Ala Asp Glu Ala Gly Cys Gly Gly Ile Glu Glu Leu
2825 2830 2835
Ala Leu His Val Pro Met Thr Leu Pro Ala Ser Gly Gly Leu Arg
2840 2845 2850
Ile Gln Val Val Ala Tyr Ala Pro Asp Gly Asp Gly Arg Arg Glu
2855 2860 2865
Val Arg Val His Ser Arg Pro Asp Ala Glu Asp Gly Ser Ser Pro
2870 2875 2880
Trp Thr Cys His Ala Thr Gly His Leu Ser Pro Thr Ala Pro Gly
2885 2890 2895
Ala Ala Asp Pro Ala Pro Ala Gly Val Trp Pro Pro Arg Asp Ala
2900 2905 2910
Glu Gln Val Asp Val Ala Asp Phe Tyr Gly Arg Ala Glu Ala Ile
2915 2920 2925
Gly Tyr Gly Tyr Gly Pro Ala Phe Arg Gly Leu Thr Ala Ala Trp
2930 2935 2940
Arg Gln Gly Asp Asp Leu Leu Ala Glu Val Val Leu Pro Glu Ala
2945 2950 2955
Ala His Glu Gly Ala Asp Gly Phe Ala Leu His Pro Ala Leu Leu
2960 2965 2970
Asp Ala Ala Leu His Pro Leu Ala Leu Asp Gly Gln Gly Glu Asp
2975 2980 2985
Gly Arg Met Arg Leu Pro Phe Ala Trp Ser Gly Val Ser Leu Trp
2990 2995 3000
Ala Thr Gly Ala Arg Ala Ala Arg Val Arg Met Ser Pro Leu Glu
3005 3010 3015
His Gly Phe Arg Leu Val Val Ala Asp Ala Ala Gly Arg Pro Val
3020 3025 3030
Leu Ser Ala Glu Ser Val Val Val Arg Pro Thr Ser Ala Arg Gln
3035 3040 3045
Leu Arg Asp Ala Gly Ala Arg Arg Val Asp Gly Leu Tyr Glu Val
3050 3055 3060
Ala Trp Val Ala Leu Pro Pro Ser Ser Asp Thr Val Ala Glu Thr
3065 3070 3075
Glu Thr Arg Gly Val Glu Gly Trp Ala Leu Leu Asp Gly Gly Pro
3080 3085 3090
Leu Pro Phe Asp Pro Ser Lys Ala Gly Ser Leu Pro Arg His Ala
3095 3100 3105
Asp Ile Asp Ala Leu Leu Thr Ala Pro Ala Leu Pro Ser Thr Val
3110 3115 3120
Leu Val Gly Val Ser Gly Pro Val Gly Ala Ala Gly Asp Glu Asn
3125 3130 3135
Ser Ala Glu Gly Ala Leu Ala Val Thr Thr Gly Val Leu Thr Ser
3140 3145 3150
Ala Arg Arg Trp Leu Glu Thr Pro Glu Leu Ala Asp Ala Arg Leu
3155 3160 3165
Val Leu Val Thr Arg Gly Ala Val Ala Ala Ala Glu Thr Asp Asp
3170 3175 3180
Gly Pro Asp Pro Ala Ala Ala Ala Val Trp Gly Leu Leu Arg Ser
3185 3190 3195
Ala Gln Ala Glu Asn Pro Gly Arg Phe Leu Leu Cys Asp Ile Asp
3200 3205 3210
Asp Gly Ala Gly Pro Asp Asp Val Leu Gly Ala Val Thr Arg Ala
3215 3220 3225
Val Ala Leu Asp Glu Pro Gln Val Ala Val Arg Gly Glu Arg Val
3230 3235 3240
Leu Thr Pro Arg Leu Glu Arg Ala Gly Ala Ala Glu Leu Val Pro
3245 3250 3255
Pro Pro Gly Glu Pro Ala Trp Arg Leu Ser Ala Asp Asp Thr Gly
3260 3265 3270
Thr Ile Asp Ser Val Ser Val Val Ala Cys Pro Glu Val Leu Glu
3275 3280 3285
Pro Leu Ala Pro Gly Gln Val Arg Ile Ala Val Arg Ala Ala Gly
3290 3295 3300
Ile Asn Phe Arg Asp Val Leu Ile Val Leu Gly Met Tyr Pro Asp
3305 3310 3315
Glu Gly Val Phe Arg Gly Ser Glu Gly Ala Gly Val Val Leu Asp
3320 3325 3330
Val Ala Asp Asp Val Thr Ser Val Ala Val Gly Asp Arg Val Phe
3335 3340 3345
Gly Leu Phe Glu Gly Ala Phe Gly Pro Val Ala Val Ala Asp Ala
3350 3355 3360
Arg Ala Val Val Pro Val Pro Pro Asp Trp Thr Asp Gln Gln Ala
3365 3370 3375
Ala Ala Val Pro Thr Thr Phe Leu Thr Ala Trp Tyr Gly Leu Val
3380 3385 3390
Asp Leu Ala Gly Leu Lys Ala Gly Glu Ser Val Leu Ile His Ala
3395 3400 3405
Ala Thr Gly Gly Val Gly Thr Ala Ala Val Gln Ile Ala Arg His
3410 3415 3420
Leu Gly Ala Glu Val Tyr Ala Thr Ala Gly Pro Gly Lys His Ala
3425 3430 3435
Val Leu Glu Ala Met Gly Ile Asp Glu Ala His Arg Ala Ser Ser
3440 3445 3450
Arg Asp Leu Asp Phe Glu Asp Ala Phe Arg Thr Ala Thr Gly Gly
3455 3460 3465
Arg Gly Val Asp Val Ile Leu Asn Ser Leu Ala Gly Glu Tyr Thr
3470 3475 3480
Asp Ala Ser Leu Arg Leu Leu Thr Gly Gly Gly Arg Phe Ile Glu
3485 3490 3495
Met Gly Lys Thr Asp Lys Arg Asp Ala Glu Gln Ile Ala Asp Thr
3500 3505 3510
Tyr Ser Gly Val Arg Tyr Arg Phe Tyr Asp Leu Val Pro Asp Ala
3515 3520 3525
Gly Leu Asp Arg Val Ala Glu Met Leu Thr Thr Leu Ala Gly His
3530 3535 3540
Phe Ala Gln Gly Val Leu Ala Pro Pro Pro Val Arg Ala Trp Pro
3545 3550 3555
Leu Thr Glu Ala Arg Gln Ala Leu Arg Gln Met Ser Gln Ala Arg
3560 3565 3570
His Thr Gly Lys Tyr Val Leu Asp Met Pro Arg Thr Leu Asp Pro
3575 3580 3585
Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala
3590 3595 3600
Leu Val Ala Glu His Leu Val Thr Asn His His Ile Ser His Leu
3605 3610 3615
His Leu Leu Ser Arg Arg Gly Pro Asp Ala Pro Gly Ser Ala Asp
3620 3625 3630
Leu Ala Ala Arg Leu Thr Glu Leu Gly Ala Thr Val Arg Ile Thr
3635 3640 3645
Ala Thr Asp Thr Thr Asp Pro Gln Ala Leu Arg Gln Ala Leu Asp
3650 3655 3660
Thr Val Asp Arg Asp His Pro Leu Thr Gly Val Ile His Ala Ala
3665 3670 3675
Gly Ala Leu Asp Asp Ala Val Leu Thr Ala Gln Thr Pro Glu Arg
3680 3685 3690
Leu Ala Ser Val Trp Ala Ala Lys Ala Thr Ala Ala Ala Asn Leu
3695 3700 3705
His Arg Ala Thr Lys Asp Leu Pro Leu Ala Met Phe Val Ile Phe
3710 3715 3720
Ser Ser Ala Ala Gly Thr Leu Gly Thr Pro Gly Gln Ala Asn Tyr
3725 3730 3735
Ala Ala Ala Asn Ala Tyr Cys Asp Ala Leu Ala Val Arg Arg Arg
3740 3745 3750
Arg Ala Gly Leu Pro Ala Thr Ser Ile Ala Trp Gly Leu Trp Ala
3755 3760 3765
Ala Thr Ser Glu Met Thr Gly His Leu Ala Asp Ala Asp Leu Ala
3770 3775 3780
Arg Met Ser Arg Thr Gly Phe Thr Pro Leu Ala Thr Pro Met Ala
3785 3790 3795
Leu Ala Leu Phe Asp Ala Ala Gly Arg His Gly Ala Ala Thr Pro
3800 3805 3810
Leu Ala Leu Asp Leu Asp Pro Arg Thr Leu Gly Ala Gln Pro Ser
3815 3820 3825
Asp Ala Val Pro Ala Val Leu Arg Thr Val Ala Ala Ala Gly Ala
3830 3835 3840
Pro Val Arg Arg Thr Ala Ala Val Ala Gln Ser Thr Asp Trp Ala
3845 3850 3855
Gly Arg Leu Ala Ala Leu Ser Ala Ala Glu Arg His Arg Glu Leu
3860 3865 3870
Val Asn Leu Val Arg Thr His Ala Ala Thr Val Leu Gly His Ser
3875 3880 3885
Asp Pro Ala Ala Leu Arg Ala Asp Thr Ser Phe Lys Glu Leu Gly
3890 3895 3900
Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ser Ala
3905 3910 3915
Ala Thr Gly Leu Arg Leu Pro Ala Ala Leu Val Phe Asp Tyr Pro
3920 3925 3930
Asp Ala Glu Thr Met Ala Arg Phe Leu Asp Gln Lys Leu Ala Pro
3935 3940 3945
Gly Asp Arg Thr Glu Ala Ala Ala Val Asp His Leu Ala Pro Val
3950 3955 3960
Leu Asn Asp Leu Ala Arg Leu Glu Ser Thr Leu Gly Ser His Asp
3965 3970 3975
Val Asp Gly Lys Ala Arg Glu Thr Val Ala Gly Arg Leu His Ala
3980 3985 3990
Leu Leu Ser Arg Leu Glu Gly Ser Thr Ala Ser Ala Ala Asp Ile
3995 4000 4005
Asp Gly Glu Ala Leu Glu Ser Ala Ser Asp Asp Glu Met Phe Ala
4010 4015 4020
Leu Ile Asp Gln Gln Leu Gly Ser Ser
4025 4030
<210>12
<211>3956
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Ser Thr Glu Asp Lys Leu Arg Gln Tyr Leu Lys Arg Val Thr
1 5 10 15
Val Asp Leu Gly Glu Ala Arg Ala Arg Leu Arg Lys Ala Glu Gln Arg
20 25 30
Gln His Glu Pro Ile Ala Ile Thr Ser Met Ala Cys Arg Tyr Pro Gly
35 40 45
Gly Val Thr Ser Pro Glu Thr Leu Trp Glu Leu Val Asp Ser Arg Thr
50 55 60
Asp Ala Ile Gly Ser Phe Pro Ala Asn Arg Gly Trp Asn Leu Ala Ser
65 70 75 80
Leu Tyr His Pro Asp Pro Asp His Ser Gly Thr Ser Tyr Val Arg Asp
85 90 95
Gly Gly Phe Val His Asp Ala Asp Glu Phe Asp Ala Ser Phe Phe Asn
100 105 110
Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu
115 120 125
Leu Glu Thr Ala Trp Glu Leu Leu Glu Arg Ala His Ile Asp Pro Thr
130 135 140
Ala Leu Lys Gly Thr Pro Thr Gly Val Tyr Thr Gly Cys Gly Val Pro
145 150 155 160
Gly Phe Gly Thr Pro His Ile Glu Arg Ser Ala Glu Gly Phe Leu Leu
165 170 175
Thr Gly Asn Ala Leu Ser Val Val Ser Gly Arg Ile Ala Phe Thr Leu
180 185 190
Gly Leu Glu Gly Pro Ala Val Thr Leu Asp Thr Ala Cys Ser Ser Ser
195 200 205
Leu Val Ala Met His Leu Ala Val Gln Ala Leu Arg Gln Gly Glu Cys
210 215 220
Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr Pro Asn Val
225 230 235 240
Ile Val Glu Phe Ser Arg Gln Arg Gly Leu Ser Pro Asp Gly Arg Cys
245 250 255
Lys Pro Phe Ala Thr Ala Ala Asp Gly Thr Gly Phe Ser Glu Gly Ala
260 265 270
Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His
275 280 285
Gln Val Leu Ala Val Ile Arg Gly Thr Ala Val Asn Gln Asp Gly Ala
290 295 300
Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile
305 310 315 320
Arg Gln Ala Leu Ala Asn Ala Gly Leu Ala Thr Val Glu Val Asp Ala
325 330 335
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala
340 345 350
Glu Ala Leu Leu Ala Thr Tyr Gly Gln Glu Arg Pro Glu Asp Arg Pro
355 360 365
Leu Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr Gln Gly Ala
370 375 380
Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg His Ala
385 390 395 400
Ser Leu Pro Ala Thr Leu His Val Asp Glu Pro Thr Ser His Val Asp
405 410 415
Trp Asp Arg Gly Thr Val Arg Leu Leu Thr Glu Pro Val Asp Trp Pro
420 425 430
Thr Ala Pro Asp Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Ile
435 440 445
Ser Gly Thr Asn Ala His Ile Ile Leu Glu Glu Ala Gly Leu Pro Thr
450 455 460
Ala Ala Glu Ala Glu Ala Gly Thr Ala Ala Glu Ala Gly Thr Asp Ala
465 470 475 480
Gly Ala Gly Thr Glu Ala Gly Ala Glu Asp Ala Ala Pro Glu Glu Ala
485 490 495
Thr Val Glu Pro Ala Leu Leu Gly Gly Val Ala Pro Trp Val Val Ser
500 505 510
Ala Arg Thr Gln Glu Ala Leu Ala Asp Gln Ala Arg Gly Leu Val Arg
515 520 525
Ala Val Thr Asp Thr Gly Ala Pro Asp Ala Val Pro Ala Glu Val Ala
530 535 540
Trp Ser Leu Ala Thr Thr Arg Ala Thr Phe Asp His Arg Ala Val Val
545 550 555 560
Thr Gly Thr Glu Leu Ala Asp Leu Thr Ala Ala Leu Glu Ala Leu Ala
565 570 575
Thr Gly Gly Glu His Pro His Leu Val Arg Gly Thr Ala Leu Asp Pro
580 585 590
Gln Ala Gly Pro Val Leu Val Phe Pro Gly Gln Gly Ser Gln Trp Pro
595 600 605
Gly Met Ala Val Gly Leu Leu Asp Ser Ser Pro Ala Phe Ala Thr Arg
610 615 620
Ile Ala Ala Cys Glu Gln Ala Leu Ala Pro Tyr Val Asp Trp Ser Leu
625 630 635 640
Thr Ala Val Leu Arg Gly Ser Asp Thr Ala Thr Asp Pro His Arg Val
645 650 655
Asp Val Ile Gln Pro Thr Leu Trp Ala Val Met Val Ser Leu Ala Gly
660 665 670
Leu Trp Gln Asp Phe Gly Ile Thr Pro Ala Ala Val Ile Gly His Ser
675 680 685
Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Asp
690 695 700
Asp Ala Ala Lys Val Val Ala Leu Arg Ser Gln Ala Leu Arg Ala Leu
705 710 715 720
Ala Gly His Gly Ala Met Ala Ser Leu Thr Leu Gly Ala Glu Asp Thr
725 730 735
Ala Arg Val Leu Thr Gly Leu Gly Pro Ala Ala Glu Gly Val Ala Val
740 745 750
Ala Ala His Asn Gly Pro Arg Ser Thr Val Val Ser Gly Pro Pro Asp
755 760 765
Gln Ile Ala Thr Val Leu Ala Ala Ala Glu Ala Arg Gly Ala Arg Thr
770 775 780
Arg Thr Ile Asp Val Asp Tyr Ala Ser His Ser Pro His Val Asp Arg
785 790 795 800
Ile Arg Asp Thr Ile Leu Ala Gln Leu Ala Asp Leu Ala Pro Ala Ala
805 810 815
Pro Thr Ile Pro Phe Tyr Ser Thr Val Thr Gly Glu Pro Leu Ala Asp
820 825 830
Thr Pro Leu Asp Ala Glu Tyr Trp Phe Thr Asn Leu Arg Gln Pro Val
835 840 845
Arg Phe Thr Asp Thr Leu Thr Thr Leu Leu Asp His Gln His Arg His
850 855 860
Phe Ile Glu Ala Ser Pro His Pro Val Leu Thr Pro Gly Ile Gln Asp
865 870 875 880
Ala Ile Asp Asp Ala Glu Leu Pro Ala Thr Thr Ile Pro Thr Leu Arg
885 890 895
Arg Asp His Gly Thr Pro His Asp Leu Ala Asp Ala Leu Ala Leu Ala
900 905 910
His Thr Thr Gly Leu Ala Val Asp Trp Arg Pro Trp Tyr Ala Thr Thr
915 920 925
Pro Pro Ala Thr Thr Asp Leu Pro Thr Tyr Pro Phe Gln Arg Gln Arg
930 935 940
Tyr Trp Ser Ala Ala Gly Arg Arg Thr Gly Asp Val Ser Ala Ala Gly
945 950 955 960
Leu Arg Pro Val Asp His Pro Gln Leu Ser Ala Ala Thr Gly Leu Ala
965 970 975
Asp Gly Gly Leu Leu Leu Thr Gly Arg Leu Pro Ala Ala Gly Asp Ala
980 985 990
Gly Trp Leu Gly Glu His Glu Phe Ala Asp Val Val Leu Val Pro Ser
995 1000 1005
Thr Ala Leu Val Glu Trp Thr Leu Arg Ala Ala Asp Glu Ala Gly
1010 1015 1020
Cys Gly Gly Val Glu Glu Leu Thr Leu Glu Val Pro Leu Thr Leu
1025 1030 1035
Ser Ala Ala Ser Glu Leu Arg Val Gln Val Val Val Asp Ala Pro
1040 1045 1050
Asp Glu Asp Gly Arg Arg Ala Val Arg Val Ser Ser Gln Pro Ala
1055 1060 1065
Val Asp Thr Pro Asp Arg Ala Asp Gly Gln Asp Thr Trp Thr Cys
1070 1075 1080
His Ala Thr Gly Thr Leu Met Ala Ala Ala Ala Ala Gly Thr Glu
1085 1090 1095
Leu Ala Gly Ala Trp Pro Pro Ala Gly Ala Glu Pro Val Asp Leu
1100 1105 1110
Thr Asn Leu Tyr Ala Arg Ala Glu Ala Ala Gly Tyr Arg Tyr Gly
1115 1120 1125
Pro Thr Phe Gln Gly Val Gln Ala Val Trp Arg His Gly Ala Asp
1130 1135 1140
Leu Leu Ala Glu Val Ala Leu Asp Gln Gly Ala Glu Glu Gly Gly
1145 1150 1155
Asp Glu Phe Gly Ile His Pro Ala Leu Leu Glu Cys Ala Leu His
1160 1165 1170
Pro Val Ala Leu Thr Asp Thr Pro His Asp Asp Thr Pro Leu Gly
1175 1180 1185
Asp Ala Asp Thr Asp Gly Pro Leu Trp Leu Pro Phe Ala Trp Asn
1190 1195 1200
Gly Val Ser Leu His Ala Gly Gly Ala Thr Ser Val Arg Val Arg
1205 1210 1215
Ile Gly Gln Arg Gly Gln Thr Asp Thr Glu Gly Arg Glu Leu Thr
1220 1225 1230
Val Val Val Ala Asp Pro Thr Gly Ala Pro Val Leu Thr Val Asp
1235 1240 1245
Ser Val Ala Leu Arg Pro Ala Asp Gly Asp Trp Leu Lys Ala Ala
1250 1255 1260
Glu Arg Arg Ser Thr Ala Ala Leu Phe Thr Val Glu Trp Thr Pro
1265 1270 1275
Leu Pro Pro Gln Asp Ser Arg Pro Glu Pro Val Glu Ala Glu Asp
1280 1285 1290
Gly Trp Ala Thr Leu Gly Ala Ser Gly Pro Gly His His Tyr Ala
1295 1300 1305
Asp Leu Ala Ala Leu Leu Ser Ala Ala Asp Gly Ala Glu Pro Ala
1310 1315 1320
Pro Pro Val Val Leu Ala Ser Val Thr Pro Thr Ala Asp Thr Gly
1325 1330 1335
Ala Asp Ser Glu Ala Asp Thr Asp Leu Ala Thr Val Arg Arg Thr
1340 1345 1350
Leu Gly Leu Ile Gln Glu Trp Leu Ala Glu Pro Gly Leu Arg Asp
1355 1360 1365
Ser Arg Leu Val Leu Ile Thr Ser Gly Ala Thr Ser Val Gly Asp
l370 1375 1380
Gly Asp Gly Pro Val Glu Pro Gly Ser Ala Ala Val Phe Gly Leu
1385 1390 1395
Val Gln Ala Val Gln Ala Glu His Pro Asp Arg Phe Met Leu Val
1400 1405 1410
Asp Val Gly Ala Asp Ala Asp Ala Asp Gly Asp Gly Asp Gly Gly
1415 1420 1425
Glu Thr Leu Ala Asp Ala Val Arg Arg Ala Ile Ala Ala Asp Glu
1430 1435 1440
Pro Gln Ile Ala Val Arg Ser Gly Glu Val Ser Val Pro Arg Leu
1445 1450 1455
Leu Arg Ala Ala Ala Arg Pro Asp Glu Gly Thr Ala Val Glu Leu
1460 1465 1470
Ser Gly Gly Thr Val Val Val Ser Gly Ala Met Asp His Val Ser
1475 1480 1485
Gly Gly Ala Ile Ala Glu Gln Leu Val Arg Ala Tyr Gly Ala Glu
1490 1495 1500
Arg Leu Leu Leu Leu Ser His Pro Asp Glu Gln Ala Pro Asp Leu
1505 1510 1515
Ala Glu Arg Leu Thr Ala Leu Gly Ala Ala Val Glu Val Ala Val
1520 1525 1530
Val Asp Ile Ala Asp Arg Ala Ala Leu Ala Glu Val Leu Ala Ser
1535 1540 1545
Val Pro Asp Ser His Pro Leu Val Gly Val Val His Leu Ala Gly
1550 1555 1560
Ala Ala Asp Glu Gly Pro Val Glu Ser Trp Asn Asp Gly Arg Leu
1565 1570 1575
Ser Arg Ala Trp Ala Pro Arg Ala Thr Gly Ala Trp Gln Leu His
1580 1585 1590
Thr Leu Thr Gln Asp Leu Pro Leu Arg Met Phe Val Val Cys Ser
1595 1600 1605
Ala Ala Ala Asp Val Thr Gly Gly Pro Gly Arg Ala Gly Tyr Ala
1610 1615 1620
Ala Ala Asn Ala His Thr Asp Ala Leu Ile Ala His Arg Arg Ala
1625 1630 1635
Ala Gly Leu Pro Gly Thr Gly Leu Val Trp Ala Leu Glu Glu Glu
1640 1645 1650
Ala Thr Ala Asp Ala Ser Arg Leu Phe Asp Ala Ala Phe His Ala
1655 1660 1665
Val Gln Pro Leu Val Val Ala Ala Asp Leu Asp Thr Ala Arg Leu
1670 1675 1680
Gly Pro Ser Ala Pro Ala Leu Leu Arg Ala Leu Val Arg Pro Ala
1685 1690 1695
Arg Arg Arg Ala Ala Glu Arg Gln Ser Ala Ala His Ala Leu Thr
1700 1705 1710
Ser Arg Leu Ala Gly Leu Asp Asn Ser Gly Gln Arg Glu Leu Leu
1715 1720 1725
Leu Asp Val Val Arg Gln Met Ala Ala Val Val Leu Gly His Ser
1730 1735 1740
Ser Asp Thr Ala Ile Arg Ala Glu Ala Ala Phe Lys Glu Leu Gly
1745 1750 1755
Phe Asp Ser Leu Thr Ala Val Gly Leu Arg Asn Arg Leu Val Asp
1760 1765 1770
Ala Thr Gly Leu Arg Leu Pro Ser Thr Leu Val Phe Asp Tyr Pro
1775 1780 1785
Thr Pro Arg Ala Leu Ala Asp His Leu Leu Gln Leu Val Thr Ser
1790 1795 1800
Thr Ala Pro Thr Thr Ser Leu Pro Val Gly Pro Ala Arg Ala Ala
1805 1810 1815
Gly Ala Asp Asp Glu Pro Ile Ala Val Val Ala Met Ala Cys Arg
1820 1825 1830
Phe Pro Gly Asp Val Thr Thr Pro Glu Gly Leu Trp Asp Leu Val
1835 1840 1845
Ala Ala Gly Glu Asn Ile Arg Gly Pro Phe Pro Thr Asn Arg Gly
1850 1855 1860
Trp Asp Leu Ala Asn Leu Phe His Pro Asp Pro Glu His Pro Gly
1865 1870 1875
Thr Thr Tyr Ala Ser Gln Gly Ala Phe Ile Tyr Asp Ala Asp Gly
1880 1885 1890
Phe Asp Ala Ala Phe Phe Gly Ile Asn Pro Arg Glu Ala Leu Ala
1895 1900 1905
Ile Asp Pro Gln Gln Arg Leu Ile Leu Glu Thr Ala Trp Glu Ala
1910 1915 1920
Leu Glu Arg Ala Gly Ile Asp Pro His Thr Leu Lys Glu Ser Leu
1925 1930 1935
Thr Gly Val Tyr Thr Gly Val Ile Tyr His Asp Tyr Ala Ala Gly
1940 1945 1950
Leu Pro Ala Ser Asp Pro Arg Leu Asp Gly Tyr Thr Met Leu Ser
1955 1960 1965
Ser Ile Gly Ser Ile Ile Ser Gly Arg Val Ala Tyr Thr Leu Gly
1970 1975 1980
Leu Gln Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser
1985 1990 1995
Leu Val Ala Met His Leu Ala Ala Gln Ala Leu Arg Gln Gly Glu
2000 2005 2010
Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro
2015 2020 2025
Asp Pro Phe Thr Gly Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp
2030 2035 2040
Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Ser Leu
2045 2050 2055
Ser Glu Gly Ala Gly Leu Val Val Leu Glu Arg Leu Ser Asp Ala
2060 2065 2070
Arg Arg Asn Gly His Gln Val Leu Ala Val Leu Arg Gly Ser Ala
2075 2080 2085
Ile Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly
2090 2095 2100
Pro Ser Gln Gln Arg Val Ile Gly Gln Ala Leu Ala Asn Ala Gly
2105 2110 2115
Leu Gly Pro Ala Asp Ile Asp Ala Val Glu Ala His Gly Thr Gly
2120 2125 2130
Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr
2135 2140 2145
Tyr Gly Gln His Arg Ala Asp Asp Arg Pro Leu Trp Leu Gly Ser
2150 2155 2160
Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ser Gly Val Val
2165 2170 2175
Gly Val Ile Lys Met Ile Met Ala Met Arg His Gly Arg Leu Pro
2180 2185 2190
Ala Ser Leu His Ile Asp Glu Pro Ser Pro His Ile Asp Trp Thr
2195 2200 2205
Ser Gly Asn Val Gln Leu Leu Thr Glu Ala Ile Asp Trp Pro Glu
2210 2215 2220
Ala Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser
2225 2230 2235
Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Pro Pro Pro
2240 2245 2250
Asp Pro Ala Pro Glu Pro Ala Ala Ala Pro Ala Ile Ala Gly Gly
2255 2260 2265
Val Val Pro Trp Pro Leu Ser Ala Arg Asp Glu Gln Ala Leu Arg
2270 2275 2280
Glu Gln Ala Ser Ala Leu Ala Glu His Leu Gly Thr Asp Asp Arg
2285 2290 2295
Ala Ser Val Ala Asp Val Gly Trp Ser Leu Ala Thr Thr Arg Ala
2300 2305 2310
Met Phe Glu Arg Arg Ala Val Ile Val Gly Glu Gly Arg Glu Glu
2315 2320 2325
Met Ala Ala Ala Leu Glu Ala Leu Ala Asp Gly Ser Pro His Pro
2330 2335 2340
Gly Leu Ser Thr Leu Gly Gly Thr Ala Ser Asp Thr Pro Gly Lys
2345 2350 2355
Thr Val Trp Leu Phe Ser Gly Gln Gly Ser Gln Arg Pro Gly Met
2360 2365 2370
Gly Ala Asp Leu Tyr Arg Arg Phe Pro Val Phe Ala Glu Ala Phe
2375 2380 2385
Asp Gly Val Arg Ala Leu Leu Asp Pro His Leu Asp His Pro Leu
2390 2395 2400
Ala Asp Val Val Phe Ala Thr Asp Pro Gly His Gly Asp Leu Ile
2405 2410 2415
His His Thr Thr Tyr Thr Gln Ala Gly Leu Phe Ala Leu His Ile
2420 2425 2430
Ala Leu Ala Arg Leu Leu Gly Asp Met Gly Leu Ala Pro Asp Ala
2435 2440 2445
Val Ala Gly His Ser Ile Gly Glu Ile Ser Ala Ala His Leu Ala
2450 2455 2460
Gly Val Leu Ser Leu Glu Asp Ala Ala Gln Leu Val Ala Ala Arg
2465 2470 2475
Ala Thr Leu Met Gly Gly Leu Pro Ser Gly Gly Ala Met Ala Thr
2480 2485 2490
Val Asn Ala Asp Glu Gln Glu Ile Thr Ala Thr Leu Ala Asp Tyr
2495 2500 2505
Pro Asp Leu Ala Ile Ala Ala Met Asn Thr Pro Ala His Thr Val
2510 2515 2520
Val Ser Gly Pro Ala Asp Gln Val Ala Ala Leu Thr Ala Ala Trp
2525 2530 2535
Arg Glu Arg Gly Arg Lys Thr Arg Ala Leu Pro Val Ser His Ala
2540 2545 2550
Phe His Ser Pro Gln Met Glu Pro Ile Leu Val Pro Phe Thr Glu
2555 2560 2565
Ala Ile Gly His Leu Ala Phe His Pro Pro Arg Ile Pro Leu Ile
2570 2575 2580
Ser Asn Leu Thr Gly Glu Pro Ala Gly Glu Asp Ile Ala Thr Pro
2585 2590 2595
Asp Tyr Trp Ala Arg His Ile Arg Arg Pro Val His Phe His Gln
2600 2605 2610
Ser Ile Thr His Leu Ala Glu Asp Thr Ala Val Phe Leu Glu Leu
2615 2620 2625
Gly Pro Ala Pro Val Leu Thr His Ala Val His His Thr Leu Pro
2630 2635 2640
Glu Glu Thr Thr Ala Thr Ala Leu Ala Thr Leu Thr Gly Lys Gln
2645 2650 2655
Pro Asp Val Pro Ala Leu Ala His Ser Leu Ala Ala Leu His Thr
2660 2665 2670
Ser His Ala Pro Val Asp Trp Thr Pro Trp Phe Arg Thr Asp Pro
2675 2680 2685
Ala Pro Arg Thr Val Gly Leu Pro Thr Tyr Arg Phe Gln Arg Arg
2690 2695 2700
Pro Tyr Trp Ile Ala Pro Arg Val Ser Gly Gly Ala Thr Pro Gly
2705 2710 2715
Gly Thr Gly Leu Asp His Pro Leu Leu Asp Thr Ala Ala Ala Leu
2720 2725 2730
Ala Asp Gly Gly Met Val Leu Thr Gly Ser Val Pro Pro Ala Asp
2735 2740 2745
His Asp Ser Trp Leu Thr Glu Arg Ala Ile Ala Gly Thr Val Val
2750 2755 2760
Leu Pro Gly Thr Ala Leu Leu Glu Leu Ala Leu Arg Cys Ala Glu
2765 2770 2775
Asp Thr Arg Ser Pro His Val Glu Glu Leu Leu Leu His His Pro
2780 2785 2790
Leu Thr Leu His Pro Thr Ala His Leu Asp Leu Gln Val Val Ile
2795 2800 2805
Gly Ala Ala Asp Asp Asp Ala Arg Arg Thr Leu His Leu Tyr Thr
2810 2815 2820
Arg Pro Gln Ser Asp Ser Ser Ala Glu Trp Thr Arg His Ala Thr
2825 2830 2835
Ala Thr Leu Thr Gly Glu Pro Thr Asp Asp Arg Pro Pro Ala Glu
2840 2845 2850
Gly Glu Ala Ala Trp Pro Pro Ala Gly Ala Glu Pro Val Asp Leu
2855 2860 2865
Thr Gly Phe Tyr Asp Arg Ala Ala Ser Asn Gly Tyr Ala Tyr Gly
2870 2875 2880
Pro Ser Leu Arg Gly Leu Gln Ala Leu Trp Arg His Gly Glu Asp
2885 2890 2895
Leu Leu Ala Asp Ile Ala Leu Pro Met Ala Asp Asp Ser Thr Asp
2900 2905 2910
Thr Leu Val Leu His Pro Ala Leu Leu Asp Ser Ala Leu His Pro
2915 2920 2925
Leu Leu Ala Val Met Asp Thr Ser Gly Asp Gln Val Trp Leu Pro
2930 2935 2940
Phe Ser Trp Ser Gly Val Thr Leu His Ala Thr Gly Ala Thr His
2945 2950 2955
Ala Arg Val Arg Val Thr Pro His Asp Asp His Glu His Arg Ile
2960 2965 2970
Ala Leu Thr Asp Thr Ala Gly Arg Pro Ile Leu Thr Ala Asn Ala
2975 2980 2985
Val Ala Val Arg Pro Thr Arg Leu Glu Ala Pro Gln Gln Pro Leu
2990 2995 3000
Ser Glu Gly Leu Phe Ser Leu Glu Trp Thr Pro Val Ser Thr Leu
3005 3010 3015
Ala Asp Arg Ser Asp Ala Ala Ala Pro Thr Pro Gly Val Val Leu
3020 3025 3030
Ala Lys Ala Pro Val Ala Glu Gly Glu Gly Gly Glu Leu Glu Ala
3035 3040 3045
Val Gln Arg Ala Leu Thr Leu Val Gln Asp Trp Leu Ala Glu Pro
3050 3055 3060
Arg Pro Asp Asp Ala Arg Leu Val Val Met Thr Arg Asp Ala Val
3065 3070 3075
Ala Val Asp Gly Glu Ala His Ile Asp Pro Val Ala Ala Ala Val
3080 3085 3090
Trp Gly Leu Ile Arg Ser Ala Gln Thr Glu Asn Pro Gly Arg Phe
3095 3100 3105
Val Leu Leu Asp Arg Asp Leu Asp Thr Glu Leu Asp Thr Asp Pro
3110 3115 3120
Val Leu Gly Pro Asp Ala Leu Ala Glu Ala Asp Gly Arg Val Ala
3125 3130 3135
Glu Ala Val Arg Cys Ala Leu Asp Leu Asp Glu Ser Gln Val Ala
3140 3145 3150
Leu Arg Gly Gly Arg Val Leu Val Pro Arg Leu Val Arg Ala Thr
3155 3160 3165
Ala Ser Ala Thr Leu Pro Gly Pro Val Asp Arg Arg Asn Trp Arg
3170 3175 3180
Leu Glu Ala Ala Thr Pro Ala Gly Ala Ala Ser Leu Asp Ala Val
3185 3190 3195
Ala Pro Val Pro Phe Pro Glu Ala Glu Glu Glu Pro Ala Ala Gly
3200 3205 3210
Arg Val Arg Ile Glu Val Arg Ala Ala Gly Val Thr Phe Arg Asp
3215 3220 3225
Val Leu Ile Ala Thr Gly Gly Val Pro Asp Glu Thr Arg Leu Gly
3230 3235 3240
Gly Glu Gly Ala Gly Val Val Leu Glu Val Gly Pro Asp Val Thr
3245 3250 3255
Asp Val Ala Pro Gly Asp Arg Val Met Gly Val Phe Asp Gly Ala
3260 3265 3270
Phe Gly Arg Val Ala Asp Ala Asp Ala Arg Met Val Thr Arg Met
3275 3280 3285
Pro Arg Thr Trp Asp Phe Thr Arg Ala Ala Gly Val Pro Val Ala
3290 3295 3300
Phe Leu Thr Ala Trp Tyr Gly Leu Val Glu Leu Ala Asp Leu Arg
3305 3310 3315
Ala Gly Glu Ser Val Leu Ile His Ala Ala Thr Gly Gly Val Gly
3320 3325 3330
Thr Ala Ala Val Gln Ile Ala Arg His Leu Gly Ala Asp Ala Tyr
3335 3340 3345
Ala Thr Ala Asp Pro Ala Glu His His Val Leu Glu Ala Met Gly
3350 3355 3360
Ile Asp Glu Ala His Arg Ala Ser Ser Arg Asp Leu Asp Phe Glu
3365 3370 3375
Asn Ala Phe Arg Ala Ala Thr Gly Gly Arg Gly Val Asp Val Val
3380 3385 3390
Leu Asn Ser Leu Thr Gly Asp His Ile Asp Asp Arg Thr Asp Ala
3395 3400 3405
Ser Leu Arg Leu Leu Ala Glu Gly Gly Arg Phe Leu Asp Pro Gly
3410 3415 3420
Arg Ala Asp Ala Arg Asp Pro Glu Gln Leu Ala Lys Asp Phe Pro
3425 3430 3435
Ala Val Asp Tyr Arg Val Tyr Asp Leu Val Pro Asp Ala Gly Pro
3440 3445 3450
Glu Arg Val Gln Pro Met Leu Ala Ala Leu Val Ala Leu Phe Asp
3455 3460 3465
Glu Gly Val Leu Ala Pro Leu Pro Val Arg Ala Trp Pro Leu Ala
3470 3475 3480
Arg Ala Arg Gln Ala Leu Arg His Met Ser Arg Ala Glu His Thr
3485 3490 3495
Gly Lys Leu Val Leu Thr Val Pro Pro Ala Leu Asp Pro Asp Gly
3500 3505 3510
Thr Val Leu Ile Thr Gly Gly Thr Gly Val Leu Ala Gly Leu Val
3515 3520 3525
Ala Glu His Leu Val Thr Thr His His Ile Thr His Leu His Leu
3530 3535 3540
Leu Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Thr Asp Leu Ala
3545 3550 3555
Thr Arg Leu Ala Glu Leu Gly Ala Thr Val His Ile Thr Ala Ala
3560 3565 3570
Asp Ala Ser Asp Pro Gly Ala Leu Arg Arg Val Val Asp Ala Ile
3575 3580 3585
Asp Pro Asp His Pro Leu Thr Gly Val Val His Thr Ala Gly Ile
3590 3595 3600
Val Glu Asp Ala Val Val Thr Ser Gln Thr Pro Asp Thr Leu Arg
3605 3610 3615
Arg Val Trp Thr Ala Lys Ala Thr Ser Ala Ala Asn Leu His Gln
3620 3625 3630
Ala Thr Lys His Leu Pro Leu Ala Met Phe Thr Leu Tyr Ser Ser
3635 3640 3645
Val Ser Gly Thr Leu Gly Asn Pro Gly Gln Ala Asn Tyr Ala Ala
3650 3655 3660
Ala Asn Ala Tyr Cys Asp Ala Leu Ala Ala Gln Arg Gln His Ala
3665 3670 3675
Gly Leu Pro Ala Thr Ser Ile Ala Trp Gly Leu Trp Ser Thr Ala
3680 3685 3690
Ser Asp Ile Thr Gly Gln Leu Ser Gln Ala Asp Val Ala Arg Met
3695 3700 3705
Gly Arg Ala Gly Val Arg Ala Leu Ala Thr Glu His Ala Leu Ala
3710 3715 3720
Leu Phe Asp Ala Ala His Arg Gln Gly Asp Pro Gln Leu Val Ala
3725 3730 3735
Leu Asn Leu Asp Val Pro Ala Leu Ala Ala Gln Pro Val Ala Ile
3740 3745 3750
Leu Pro Ala Ala Leu Arg Gly Leu Ala Thr Arg Ser Gly Gly Thr
3755 3760 3765
Thr Arg Arg Ala Ala Ala Ala Val Gln Arg Pro Asp Asp Trp Thr
3770 3775 3780
Arg Arg Leu Ala Gly Leu Pro Glu Ala Glu Gln Arg Gln Gln Leu
3785 3790 3795
Leu Thr Leu Val Arg Gly Asn Ala Ala Thr Val Leu Gly His Ala
3800 3805 3810
Asp Ser Glu Arg Val Arg Glu Glu Ala Pro Phe Lys Asp Leu Gly
3815 3820 3825
Phe Asp Ser Leu Thr Gly Val Glu Leu Arg Asn Arg Leu Ser Ala
3830 3835 3840
Ala Thr Gly Leu Arg Leu Pro Ala Ala Leu Val Phe Asp Phe Pro
3845 3850 3855
Ser Ala Lys Ser Leu Ala Asp Tyr Leu Arg Gly Arg Leu Val Ala
3860 3865 3870
Asp Gly Gly Ser Ala Ala Gln Ala Gly Val Asp Pro Val Leu Gly
3875 3880 3885
Glu Leu Ala Arg Leu Glu Ser Thr Leu Ser Ala Leu Asp Leu Pro
3890 3895 3900
Glu Ala Asp Ala Arg Ala Val Thr Asp Arg Leu Glu Gly Leu Leu
3905 3910 3915
Ala Gln Trp Lys Ala Ala Ser Ala Pro Pro Ala Glu Asp Asn Ala
3920 3925 3930
Ala Asp Arg Leu Thr Leu Ala Thr Ala Asp Glu Val Leu Ala Phe
3935 3940 3945
Ile Asp Asn Glu Leu Gly Thr Ser
3950 3955
<210>13
<211>3979
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Pro Asp Glu Glu Arg Leu Val Asp Tyr Leu Lys Arg Val Ala Thr
1 5 10 15
Asp Leu His Asp Thr Arg Arg Arg Leu Arg Glu Val Glu Glu Arg His
20 25 30
Gln Glu Pro Ile Ala Ile Thr Ala Met Thr Cys Arg Phe Pro Gly Gly
35 40 45
Val Asp Ser Pro Glu Ala Leu Trp Asp Leu Val Ala Ser Gly Gly Asp
50 55 60
Val Ile Gly Pro Phe Pro Ala Asp Arg Gly Trp Asp Leu Glu Gly Leu
65 70 75 80
Tyr His Pro Asp Pro Asp His Pro Gly Thr Thr Tyr Thr Arg Glu Gly
85 90 95
Gly Phe Leu Arg Asp Ala Asp Thr Phe Asp Ser Gly Phe Phe Glu Ile
100 105 110
Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gln Gln Arg Lys Leu Leu
115 120 125
Glu Val Thr Trp Glu Leu Phe Glu Arg Ala Gly Leu Asp Ala Thr Ser
130 135 140
Leu Arg Gly Ser Arg Thr Gly Val Phe Ile Gly Ala Ala Thr Met Gly
145 150 155 160
Ser Gly Thr Pro Ser Gly Pro Ala Arg Lys Glu Ser Glu Gly Tyr Val
165 170 175
Gly Val Ala Pro Ser Met Leu Ser Gly Arg Leu Ser Tyr Thr Phe Gly
180 185 190
Leu Glu Gly Pro Ser Leu Thr Val Glu Thr Ala Cys Ser Ala Ser Leu
195 200 205
Val Ala Met His Gln Gly Ile His Ala Leu Arg Gln Gly Glu Cys Gly
210 215 220
Leu Ala Val Val Gly Gly Val Thr Ile Met Ser Ser Pro Ala Val Phe
225 230 235 240
Ile Gly Phe Ala Arg Gln Arg Gly Leu Ala Pro Asn Gly Arg Cys Lys
245 250 255
Pro Phe Ala Ala Gly Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly
260 265 270
Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Gln
275 280 285
Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
290 295 300
Asn Gly Phe Ser Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
305 310 315 320
Gln Ala Leu Leu Asn Ala Arg Leu Ser Ser Ala Glu Val Asp Ala Val
325 330 335
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Asp
340 345 350
Ala Leu His Ala Thr Tyr Gly Gln Arg Arg Pro Ala Asp Arg Pro Leu
355 360 365
Leu Leu Gly Ser Val Lys Ser Asn Ile Gly His Pro Gln Ala Ala Ala
370 375 380
Gly Val Ala Gly Val Ile Lys Thr Val Met Ala Ile Arg His Gly Leu
385 390 395 400
Phe Pro Ala Thr Leu His Ile Asp Glu Pro Thr Pro His Val Asp Trp
405 410 415
Gly Ser Gly Ala Ile Arg Leu Val Thr Glu Pro Val Glu Trp Pro Glu
420 425 430
Thr Asp His Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser Gly
435 440 445
Thr Asn Ala His Val Ile Ile Glu Gln Ala Pro Asp Pro Asp Thr Ser
450 455 460
Glu Thr Asp Ala Gly Glu Thr Asp Ala Gly Glu Thr Ala Arg Asp Ala
465 470 475 480
Glu Gly Ala Ala Pro Ala Arg Gln Ala Val Val Ala Gly Gly Val Val
485 490 495
Pro Trp Met Leu Ser Ala Arg Asp Glu Ala Ala Leu Ala Arg Gln Ala
500 505 510
Leu Arg Leu Ala Glu Val Ala Glu Gly Asp Pro Ala Ala Asp Val Thr
515 520 525
Asp Met Gly Trp Ser Leu Ala Thr Thr Arg Ala Arg Phe Glu His Arg
530 535 540
Ala Val Val Val Gly Thr Asp Arg Ala Thr Leu Leu Asp Gly Leu Ala
545 550 555 560
Lys Leu Ala Ala Asp Glu Pro Asp Pro Ala Val Val Thr Ala Thr Ala
565 570 575
Gly Pro Ile Gly Ala Gly Pro Val Phe Val Phe Pro Gly Gln Gly Ala
580 585 590
Gln Trp Pro Gly Met Ala Arg Glu Leu Leu Asp Ser Ser Pro Val Phe
595 600 605
Ala Ala Arg Ile Ala Glu Cys Glu Arg Ala Leu Gly Pro Tyr Val Asp
610 615 620
Trp Ser Leu Thr Glu Val Leu Arg Gly Thr Asp Pro Ala Thr Asp Pro
625 630 635 640
Gly Arg Asp Asp Val Ile Gln Pro Val Leu Trp Ala Ile His Val Ser
645 650 655
Leu Ala Ala Val Trp Gln Ser Phe Gly Ile Thr Pro Ala Ala Val Val
660 665 670
Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly Ala Leu
675 680 685
Thr Leu Asp Asp Ala Ala Lys Val Ile Ala Leu Arg Val Gln Ala Leu
690 695 700
Arg Pro Leu Ile Gly His Gly Ala Met Ala Ser Leu Ser Leu Gly Ala
705 710 715 720
Glu Asp Thr Ala Arg Leu Leu Ala Glu Leu Gly Ala Ala Ala Gly Asp
725 730 735
Val Ala Val Ala Ala Val Asn Gly Pro His Ala Thr Val Val Ser Gly
740 745 750
Ser Pro Asp His Leu Asp Ala Val Leu Glu Thr Ala Arg Glu Arg Gly
755 760 765
Ala Arg Thr Arg Thr Ile Asp Val Glu Tyr Ala Ser His Gly Pro His
770 775 780
Val Asp Arg Ile Arg Asp Asp Ile Val Ser Ala Leu Arg Asp Val Thr
785 790 795 800
Pro Val Glu Ser Glu Ile Ala Phe Tyr Ser Thr Val Thr Ala Glu Arg
805 810 815
Leu Asn Thr Ala Glu Leu Gly Thr Glu Tyr Trp Phe Asp Asn Leu Arg
820 825 830
Arg Pro Val Arg Phe Ala Asp Ala Val Gly Arg Leu Leu Ala Asp Gly
835 840 845
Tyr Arg Ala Phe Ile Gln Cys Asn Pro His Pro Ile Leu Ser Thr Ser
850 855 860
Leu Gln Asp Ile Phe Glu Glu Ser Gly Thr Arg Ala Ala Ser Leu Ala
865 870 875 880
Thr Leu Arg Arg Asp His Gly Gly Ala His Gln Leu Ala Leu Ala Leu
885 890 895
Ala Gln Ala His Ala Ala Gly Val Glu Val Asp Trp Arg Pro Trp Phe
900 905 910
Pro Ala Asp Arg Thr Pro Arg Thr Val Glu Leu Pro Thr Tyr Pro Phe
915 920 925
Gln Gly Lys Arg Tyr Trp Ile Pro Val Gly Gly Ser Gly Ala Gly Asp
930 935 940
Val Ser Ala Ala Gly Leu Arg Ala Val Asp His Pro Leu Leu Ala Ala
945 950 955 960
Ala Val Ser Leu Pro Asp Gly Gly Met Val Leu Thr Gly Arg Leu Ser
965 970 975
Ala Thr Thr Gly Ala Gly Trp Leu Ala Asp His Val Val Gly Asp Thr
980 985 990
Thr Leu Leu Pro Gly Ala Ala Met Val Glu Trp Ala Leu Gln Ala Ala
995 1000 1005
His Glu Ala Gly Cys Ala Ala Val Glu Glu Leu Ala Leu Gln Thr
1010 1015 1020
Pro Phe Val Leu Pro Ala Ser Gly Ala Leu Arg Val Arg Val Ala
1025 1030 1035
Val Gly Pro Ala Asp Asp Glu Gly Arg Arg Thr Val Asp Val Tyr
1040 1045 1050
Ser Arg Pro Asp Glu Leu Asp Thr Glu Thr Pro Asp Gly Trp Val
1055 1060 1065
Cys His Ala Met Gly Val Leu Ala Pro Glu Ala Pro Glu Asp Arg
1070 1075 1080
Thr Ala Pro Pro Asp Ala Pro Ala Ala Pro Trp Pro Pro Arg Gly
1085 1090 1095
Ala Glu Pro Leu Asp Val Thr Asp Phe Tyr Glu Arg Ala Ala Ala
1100 1105 1110
Gly Gly Tyr Gly Tyr Gly Pro Ala Phe Arg Gly Leu Thr Ala Ala
1115 1120 1125
Trp Arg Asp Gly Ala Asp Leu Leu Ala Glu Ile Ala Leu Pro Glu
1130 1135 1140
Ala Ala Gly Glu Gly Ala Asp Arg Phe Gly Ile His Pro Ala Leu
1145 1150 1155
Leu Asp Ala Ala Thr His Pro Thr Ile Leu Gly Gly Gly Arg Glu
1160 1165 1170
Asp Gly Ser Asp Ala Gly Gln Val Trp Leu Pro Phe Ala Trp Ser
1175 1180 1185
Gly Val Ser Leu Trp Ala Thr Gly Ala Arg Arg Val Arg Val Arg
1190 1195 1200
Ile Phe Pro Glu Asp Asn Gly Gln Arg Ile Ser Leu Thr Asp Glu
1205 1210 1215
Thr Gly Ala Pro Val Leu Glu Ala Ala Ser Val Ala Ala Arg Pro
1220 1225 1230
Thr Gly Leu Ala Glu Leu Arg Ala Leu Gly Ala Arg Ala Ala Glu
1235 1240 1245
Gly Leu Phe Val Val Asp Trp Val Pro Ala Arg Gly Gly Thr Gly
1250 1255 1260
Asp Ala Pro Pro Pro Asp Asp Gly Gly Trp Ala Thr Val Gly Gly
1265 1270 1275
Gly Gly Val Arg Leu Ala Gly Val Ala Asp His Ala Asp Leu Gly
1280 1285 1290
Ala Leu Leu Ala Ala Val Asp Asp Gly Ala Pro Val Pro Thr Val
1295 1300 1305
Val Leu His Pro Val Pro Ala Thr Ala Thr Pro Asp Asp Gly Leu
1310 1315 1320
Ala Ala Val Gly Gly Val Leu Ala Leu Ile Arg Glu Trp Leu Ala
1325 1330 1335
Glu Pro Arg Trp Leu Asp Ser Arg Leu Val Leu Val Thr Ser Asp
1340 1345 1350
Ala Val Ser Ala Gly Asp Asp Glu Gly Ala Val Asp Pro Gly Gly
1355 1360 1365
Ala Ala Val Trp Gly Leu Val Arg Ser Val Gln Ala Glu His Pro
1370 1375 1380
Gly Arg Phe Thr Leu Leu Asp Val Gly Gly Asp Thr Asp Ala Asp
1385 1390 1395
Ala Gly Gly Gly Glu Ser Leu Ala Glu Ala Val Arg Arg Ser Ile
1400 1405 1410
Asp Ala Asp Glu Pro Gln Val Val Val Arg Ala Ala Gly Thr Leu
1415 1420 1425
Val Pro Arg Leu Val Arg Thr Ala Pro Ala Ala Glu Ala Asp Thr
1430 1435 1440
Pro Glu Leu Ser Gly Gly Thr Val Leu Val Ser Gly Gly Thr Gly
1445 1450 1455
Val Leu Gly Gly Ala Ala Ala Glu His Leu Val Arg Ala His Gly
1460 1465 1470
Val Glu Arg Val Leu Leu Leu Ser Arg Arg Gly Pro Asn Ala Pro
1475 1480 1485
Glu Ala Ala Glu Leu Val Arg Arg Leu Thr Ala Leu Gly Ala Gln
1490 1495 1500
Val Asp Val Ala Ala Val Asp Val Ala Asp Arg Ala Ala Leu Ala
1505 1510 1515
Glu Thr Leu Arg Thr Ile Pro Asp Ser His Pro Leu Leu Gly Val
1520 1525 1530
Val His Ser Ala Gly Val Thr Asp Asp Ala Leu Val Glu Ser Trp
1535 1540 1545
Asp Ala Asp Arg Leu Thr Arg Val Trp Glu Pro Lys Ala Thr Gly
1550 1555 1560
Ala Trp His Leu His Thr Leu Thr Arg Asp Leu Pro Leu Arg Met
1565 1570 1575
Phe Val Val Phe Ser Ser Ala Ala Gly Val Val Gly Asn Ser Gly
1580 1585 1590
Gln Ala Gly Tyr Ala Ala Ala Asn Ala Cys Thr Asp Ala Leu Ile
1595 1600 1605
Ala His Arg Arg Ala Ala Gly Leu Pro Gly Thr Ser Val Ala Trp
1610 1615 1620
Thr Leu Trp Glu Gln Ala Ser Ala Met Thr Glu His Leu Thr Glu
1625 1630 1635
Ala Asp Leu Ser Arg Leu Gly Thr Leu Gly Met Arg Pro Leu Ala
1640 1645 1650
Thr Ser Arg Ala Leu Gly Leu Leu Asp Ala Ala Leu His Val Thr
1655 1660 1665
His Pro Val Val Val Ala Ala Asp Leu Asp Ala Thr Arg Leu Gly
1670 1675 1680
Pro Asp Ser Pro Ala Met Leu Arg Ala Leu Ala Arg Pro Ala Arg
1685 1690 1695
Arg Arg Ala Met Glu His His Ala Thr Gly Pro Ala Leu Ala Gly
1700 1705 1710
Arg Leu Ala Gly Leu Asp Ala Thr Ala Arg Arg Asp Leu Leu Leu
1715 1720 1725
Gln Thr Val Arg Gln Met Val Thr Val Val Leu Gly His Ser Ser
1730 1735 1740
Asp Ala Ala Ile Arg Ala Glu Ala Ala Phe Lys Glu Leu Gly Phe
1745 1750 1755
Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ala Gly Ala
1760 1765 1770
Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Thr
1775 1780 1785
Pro Leu Ala Leu Ala Asp His Leu Leu Glu Arg Leu Thr Ala Thr
1790 1795 1800
Ala Ser Pro Ala Ser Pro Arg Ala Val Pro Ser Arg Ala Gly Ala
1805 1810 1815
Ala Asp Glu Pro Ile Ala Val Val Ser Met Ala Cys Arg Phe Pro
1820 1825 1830
Gly Gly Val Thr Thr Pro Glu Glu Leu Trp Asp Leu Val Ala Ala
1835 1840 1845
Asp Arg His Val Leu Gly Pro Phe Pro Thr Asn Arg Gly Trp Asp
1850 1855 1860
Leu Ala Ash Leu Phe His Pro Asp Pro Asp His Pro Gly Thr Thr
1865 1870 1875
Tyr Ala Ser Glu Gly Ala Phe Met Tyr Asp Ala Asp Gly Phe Asp
1880 1885 1890
Ala Ala Phe Phe Gly Ile Asn Pro Arg Glu Ala Leu Ala Met Asp
1895 1900 1905
Pro Gln Gln Arg Val Leu Leu Glu Thr Ser Trp Glu Leu Leu Glu
1910 1915 1920
Arg Ala Gly Ile Asp Pro His Thr Leu Lys Asp Ser Leu Thr Gly
1925 1930 1935
Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Asn Gly Leu Pro
1940 1945 1950
Pro Gly Asp Pro Arg Leu Asp Gly Tyr Ala Gly Leu Ser Gly Thr
1955 1960 1965
Ser Ser Ile Ile Ala Gly Arg Val Ala Tyr Thr Leu Gly Leu Gln
1970 1975 1980
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val
1985 1990 1995
Thr Met His Leu Ala Ala Gln Ala Leu Arg Gln Gly Glu Cys Asp
2000 2005 2010
Leu Ala Leu Ala Gly Gly Val Thr Val Leu Ala Thr Pro Asp Val
2015 2020 2025
Phe Thr Gly Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg
2030 2035 2040
Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Phe Gly Glu
2045 2050 2055
Gly Val Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala His Arg
2060 2065 2070
Asn Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn
2075 2080 2085
Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala
2090 2095 2100
Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Gly Ala Glu Leu Asp
2105 2110 2115
Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr
2120 2125 2130
Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly
2135 2140 2145
Gln Asp Arg Pro Thr Asp Arg Pro Leu Trp Leu Gly Ser Ile Lys
2150 2155 2160
Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly Val
2165 2170 2175
Ile Lys Met Ile Met Ala Met Asp His Gly Arg Leu Pro Thr Ser
2180 2185 2190
Leu His Ile Asp Glu Pro Ser Pro His Ile Asp Trp Thr Thr Gly
2195 2200 2205
Asn Val Arg Leu Leu Thr Glu Pro Ala Asp Trp Pro Ala Thr Asp
2210 2215 2220
Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Ala Ser Gly Thr
2225 2230 2235
Asn Ala His Leu Ile Leu Glu Gln Ala Pro Asp Arg Pro Gly Asp
2240 2245 2250
Gly Pro Ala Gly Asp Arg Pro Ala Pro Val Ala Val Ala Trp Pro
2255 2260 2265
Leu Ser Ala Arg Thr Asp Glu Ala Leu Arg Thr Val Ala Thr Ala
2270 2275 2280
Leu Ala Asp Arg Leu Gly Ala Asp Asp Thr Thr Pro Val Thr Asp
2285 2290 2295
Val Gly Trp Ser Leu Ala Thr Ala Arg Ala Thr Phe Glu Arg Arg
2300 2305 2310
Ala Val Ile Ile Gly Ser Asp Arg Gln Glu Met Thr Ala Ala Leu
2315 2320 2325
Asp Ala Leu Ala Arg Asp Leu Pro His Pro Asn Leu Val Ala Pro
2330 2335 2340
Leu Pro Val Ala Pro Pro Ala Gly Asp Thr Val Trp Leu Phe Ser
2345 2350 2355
Gly Gln Gly Ser Gln Arg Pro Gly Met Gly Ala Glu Leu His Glu
2360 2365 2370
Arg Phe Pro Ala Phe Ala Asp Thr Phe Asp Glu Ile Cys Ala Leu
2375 2380 2385
Ile Asp Pro His Leu Asp His Pro Leu Arg Asp Ile Val Phe Ala
2390 2395 2400
Thr His Pro Asp His Thr Asp Leu Leu Asn His Thr Thr Tyr Thr
2405 2410 2415
Gln Ala Gly Leu Phe Ala Val Gln Val Ala Leu Ala Arg Leu Leu
2420 2425 2430
Glu His Cys Gly Leu Arg Pro Asp Thr Val Ile Gly His Ser Ile
2435 2440 2445
Gly Glu Ile Thr Ala Ala His Ile Ala Gly Val Leu Ser Leu Gln
2450 2455 2460
Asp Ala Cys His Leu Val Ala Asn Arg Ala Thr Leu Leu Gly Lys
2465 2470 2475
Leu Pro Pro Gly Gly Ala Met Thr Ala Ile Glu Ala Thr Ala Glu
2480 2485 2490
Glu Ile Thr Gln Thr Leu Thr Pro Tyr His Gly Gln Val Thr Ile
2495 2500 2505
Ala Ala Leu Asn Ala Pro Thr Ser Thr Val Ile Ser Gly Pro Glu
2510 2515 2520
Glu Leu Val Ala Gln Leu Thr Arg Arg Trp Lys Glu Arg Gly Arg
2525 2530 2535
Arg Thr Lys Thr Leu Thr Val Ser His Ala Phe His Ser Pro Leu
2540 2545 2550
Met Glu Pro Ala Leu Asn Asp Phe Arg His Ala Ile Asp His Leu
2555 2560 2565
Thr Tyr His Gln Pro Thr Ile Pro Leu Ile Ser Asn Leu Thr Gly
2570 2575 2580
Glu Pro Ala Thr Gln Asp Ile Ala Thr Pro Asp Tyr Trp Val Arg
2585 2590 2595
His Ile Arg Gln Pro Val His Phe His Pro Ala Ile Thr His Ile
2600 2605 2610
Ala Pro His Thr Ala Ala Phe Leu Glu Ile Gly Pro Asp Ala Thr
2615 2620 2625
Leu Ile Pro Ala Thr Gln Asn Thr Leu Asp Thr Leu Glu Asn Gln
2630 2635 2640
Pro Thr Ser Ala Pro Gln Leu Ile Pro Thr Leu Thr Arg Lys Gln
2645 2650 2655
Pro Asp Thr Gln Ala Leu Ala His Ala Leu Ala Arg Leu His Thr
2660 2665 2670
Leu Thr Pro Leu Asn Trp His Pro Trp Tyr Thr Asp Gln Pro Thr
2675 2680 2685
Pro Thr Thr Ile Asp Leu Pro Thr Tyr Pro Phe Gln His Glu Arg
2690 2695 2700
Tyr Trp Leu Thr Pro Thr His Ala Gly Pro Thr Thr Pro Gly Ala
2705 2710 2715
Thr Pro Leu Thr His Pro Phe Leu Ala Ala Thr Ala Pro Leu Ala
2720 2725 2730
Asp Gly Gly Leu Leu Leu Thr Gly Gln Val Pro Ser Ala Asp His
2735 2740 2745
Ala Gly Trp His Thr Glu His Thr Ile Ala Gly Ala Thr Leu Leu
2750 2755 2760
Pro Ala Thr Ala Leu Leu Glu Ile Ala Leu His Ala Ala Asp His
2765 2770 2775
Thr Thr Thr Pro His Ile Asp Glu Leu Ile Leu Gln His Pro Leu
2780 2785 2790
Thr Leu Asp Pro Ser His Pro Leu Ala Leu Gln Ala Ile Val Ser
2795 2800 2805
Pro Ala Asp Asp Ser Gly His Arg Ala Leu His Ile Tyr Thr Arg
2810 2815 2820
Ala Pro Ser Ser Pro Thr Ala Glu Trp Thr His His Ala Thr Ala
2825 2830 2835
Thr Leu Gly Gly Glu Pro Thr Ala Glu Arg Pro Thr Thr Glu Ala
2840 2845 2850
Glu Ala Ala Trp Pro Pro Pro Gly Ala Lys Ala Val Asp Ile Thr
2855 2860 2865
Gly Phe Tyr Asp Arg Ala Ala Ala Asp Gly Tyr His Tyr Gly Pro
2870 2875 2880
Ser Tyr Gln Gly Leu Gln Thr Val Trp Arg Gln Gly Glu Asp Leu
2885 2890 2895
Leu Ala Asp Ile Thr Leu Pro Thr Ala Gly Thr Pro Asp His Thr
2900 2905 2910
Thr Asp Ser Leu Ala Ile His Pro Ala Leu Leu Asp Ala Ala Leu
2915 2920 2925
His Pro Leu Leu Ala Thr Ala Asp Asn Pro Asp Gly Glu Ile Trp
2930 2935 2940
Leu Pro Phe Thr Trp Ser Gly Val Thr Leu His Ala Thr Gly Ala
2945 2950 2955
Thr His Val Arg Ala Arg Ile Thr Pro Gln Gly Asp Asn Asp Tyr
2960 2965 2970
Arg Leu Thr Leu Thr Asp Ala Thr Gly Gln Thr Val Leu Thr Ala
2975 2980 2985
Gly Thr Ile Ala Ser Arg Pro Leu Asp Thr Ala Arg Leu Arg Thr
2990 2995 3000
Arg Gly Pro Gly Asp Gly Leu Tyr Gln Val Arg Trp Thr Ala Met
3005 3010 3015
Pro Ile Pro Ala Gly Ser Ala Thr Ala Val Ala Asp Asp Trp Ala
3020 3025 3030
Met Leu Gly Asp Ala Gly Leu Arg Asp Gly Gly Leu Ala Asp Ala
3035 3040 3045
Val Ala Pro Leu Ala Ser Tyr Pro Asp Val Ala Ala Leu Val Ala
3050 3055 3060
Ala Met Asp Asp Gly Thr Pro Val Pro Ser Val Val Leu Thr Gly
3065 3070 3075
Leu Ala Pro Ala Asp Gly Gly Asp Ala Asp Val Val Val Glu Val
3080 3085 3090
Leu Thr Thr Ala Arg Glu Trp Leu Ala Glu Pro Arg Leu Ala Glu
3095 3100 3105
Ser Arg Leu Val Val Val Thr His Asp Ala Ala Val Ala Glu Asp
3110 3115 3120
Thr Asp Ser Gly Pro Asp Gly Gly Asp Val Asp Pro Val Ala Ala
3125 3130 3135
Gly Val Trp Gly Leu Ile Arg Ser Ala Gln Ser Glu Asn Pro Gly
3140 3145 3150
Arg Phe Thr Leu Leu Asp Leu Thr Arg Arg Asp Ala Gly Thr Ala
3155 3160 3165
Pro Asp Val Val Glu Val Leu Arg Ala Ala Met Asp Ala Asp Glu
3170 3175 3180
Trp Gln Val Ala Val Arg Gly Gly Arg Ala Leu Val Pro Arg Leu
3185 3190 3195
Thr Ala Ala Asp Ala Ala Ala Gly Ile Val Leu Pro Val Gly Ala
3200 3205 3210
Pro Ala Trp Gln Leu Val Met Ala Asp Glu Arg Ala Gly Thr Val
3215 3220 3225
Asp Gly Leu Ala Pro Glu Glu Cys Pro Glu Val Leu Glu Pro Leu
3230 3235 3240
Ala Pro Gly Gln Val Arg Ile Ala Val Arg Ala Ala Gly Val Asn
3245 3250 3255
Phe Arg Asp Val Met Val Thr Leu Gly Val Val Pro Asp Arg Arg
3260 3265 3270
Gly Leu Gly Gly Glu Gly Ala Gly Met Val Leu Asp Val Ala Pro
3275 3280 3285
Asp Val Thr Ser Val Ala Val Gly Asp Arg Val Met Gly Leu Phe
3290 3295 3300
Gln Gly Ser Phe Gly Pro Ile Ala Val Ala Asp Ala Arg Ala Leu
3305 3310 3315
Val Pro Val Pro Pro Gly Trp Thr Asp Arg Gln Ala Ala Ala Val
3320 3325 3330
Pro Ile Ala Phe Leu Thr Ala Trp Tyr Gly Leu Ile Asp Leu Ala
3335 3340 3345
Gly Leu Lys Ala Gly Glu Ser Val Leu Ile His Ala Ala Thr Gly
3350 3355 3360
Gly Val Gly Thr Ala Ala Val Gln Ile Ala Arg His Leu Gly Ala
3365 3370 3375
Val Ile Tyr Ala Thr Ala Ser Pro Gly Lys His Pro Met Leu Glu
3380 3385 3390
Ala Met Gly Val Asp Glu Thr His Arg Ala Ser Ser Arg Asp Leu
3395 3400 3405
Asp Phe Glu His Ile Phe Arg Ala Ala Thr Gly Ser Glu Gly Met
3410 3415 3420
Asp Val Val Leu Asp Cys Leu Ala Gly Glu Phe Val Asp Ala Ser
3425 3430 3435
Leu Arg Leu Leu Gly Gln Gly Gly Arg Phe Ile Glu Met Gly Lys
3440 3445 3450
Thr Asp Ile Arg Asp Pro Glu Gln Ile Ala Asp Thr His Pro Gly
3455 3460 3465
Val His Tyr Arg Ser Tyr Asp Leu Val Ser Asp Ala Gly Leu Asp
3470 3475 3480
Arg Leu Ser Glu Met Leu Gly Thr Leu Ala Asp Leu Phe Ala Gln
3485 3490 3495
Gly Val Leu Thr Pro Pro Pro Val Gln Ala Trp Pro Leu Ala Arg
3500 3505 3510
Ala Arg Gln Ala Leu Arg His Met Ser Gln Ala Lys His Thr Gly
3515 3520 3525
Lys Leu Val Leu Asp Ile Pro Pro Ala Leu Asp Pro Asp Gly Thr
3530 3535 3540
Val Leu Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala Leu Ile Ala
3545 3550 3555
Glu His Leu Val Thr Asn His His Ile Thr His Leu His Leu Leu
3560 3565 3570
Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Ala Glu Leu Thr Ala
3575 3580 3585
His Leu Thr Glu Leu Gly Ala Thr Val His Ile Thr Ala Thr Asp
3590 3595 3600
Thr Thr Asp Pro His Ala Leu Arg Gln Ala Leu Asp Thr Val Asp
3605 3610 3615
Pro Arg His Pro Leu Thr Ala Val Ile His Thr Ala Gly Ile Val
3620 3625 3630
Asp Asp Ala Val Ile Thr Ala Gln Thr Ala Asp Ser Leu His Arg
3635 3640 3645
Val Trp Ala Ala Lys Ala Thr Ser Ala Ala Asn Leu His Gln Ala
3650 3655 3660
Thr Glu His Leu Pro Leu Ala Met Phe Val Ile Phe Ser Ser Ala
3665 3670 3675
Ala Gly Thr Phe Gly Ser Pro Gly Gln Ala Asn Tyr Ala Ala Ala
3680 3685 3690
Asn Ala Tyr Cys Asp Ala Leu Ala Thr Arg Arg Arg His Ala Gly
3695 3700 3705
Leu Pro Ala Thr Ser Ile Ala Trp Gly Leu Trp Ala Ala Thr Ser
3710 3715 3720
Gly Met Thr Gly Gly Leu Thr Glu Ile Asp His Ala Arg Met Ser
3725 3730 3735
Arg Ser Gly Met Ala Pro Leu Pro Ser Glu His Ala Leu Ala Leu
3740 3745 3750
Phe Asp Ala Ala His Gly Leu Gly Ala Ala Arg Val Leu Ala Ala
3755 3760 3765
Arg Leu Asp Leu Ala Arg Leu Ser Ala Gln Pro Thr Glu Ala Leu
3770 3775 3780
Pro Pro Leu Val Arg Ser Leu Thr Gly Thr Gly Pro Arg Thr Ala
3785 3790 3795
Arg Arg Ser Ala Ala Ala Pro Val Ala Asp Leu Ser Gly Arg Leu
3800 3805 3810
Ala Ser Met Ala Pro Ala Gly Gln Leu Ala Leu Leu Leu Asp Leu
3815 3820 3825
Val Arg Thr His Ala Ala Thr Val Leu Gly His Met Asp Ser Gly
3830 3835 3840
Thr Val Ser Ala Asp Thr Pro Phe Lys Asp Leu Gly Phe Asp Ser
3845 3850 3855
Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Thr Thr Val Thr Gly
3860 3865 3870
Leu Arg Leu Ser Ala Ala Ser Val Phe Arg Tyr Pro Thr Ala Thr
3875 3880 3885
Ala Met Ala Glu His Leu Arg Gly Glu Leu Cys Pro Thr Gly Asp
3890 3895 3900
Asp Thr Ala Gln Pro Val Leu Arg Glu Leu Ala Arg Leu Glu Ala
3905 3910 3915
Ala Val Gly Glu Ser Lys Pro Glu Gly Glu Thr Ser Ala Gln Leu
3920 3925 3930
Val Lys Arg Leu Gln Thr Leu Leu Trp Arg Leu Gly Asp Glu Ala
3935 3940 3945
Ala Ala Val Asp His Thr Val Asp Gly Glu Glu Leu Glu Ser Ala
3950 3955 3960
Ser Asp Asp Glu Met Phe Ala Leu Ile Asp Gln Gln Leu Gly Ser
3965 3970 3975
Ser
<210>14
<211>1665
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Ser Thr Thr Glu Glu Lys Leu Arg Gln Tyr Leu Lys Arg Val Thr
1 5 10 15
Leu Asp Leu Gly Gln Ala Lys Gln Arg Leu Arg Glu Ala Glu Glu Arg
20 25 30
His Gln Glu Pro Ile Ala Ile Thr Ala Met Ala Cys Arg Tyr Pro Gly
35 40 45
Gly Val Arg Ser Pro Glu Ala Leu Trp Asp Leu Val Ala Thr Arg Thr
50 55 60
Asp Ala Ile Gly Pro Phe Pro Thr Asn Arg Gly Trp Asp Leu Glu Gly
65 70 75 80
Leu Phe His Pro Asp Pro Asp His Tyr Gly Thr Ser Tyr Val Arg Glu
85 90 95
Gly Gly Phe Leu His Asp Ala Glu Arg Phe Asp Ala Ser Phe Phe Asn
100 105 110
Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Val Leu
115 120 125
Leu Glu Thr Ala Trp Glu Leu Leu Glu Arg Ala His Ile Asp Pro His
130 135 140
Ser Leu Lys Gly Thr Leu Thr Gly Val Tyr Thr Gly Val Ser Ser Gln
145 150 155 160
Asp Tyr Leu Ser Arg Ile Pro Arg Ile Pro Glu Gly Phe Glu Gly Tyr
165 170 175
Thr Ala Thr Gly Gly Leu Met Ser Val Val Ser Gly Arg Val Ala Tyr
180 185 190
Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Leu Asp Thr Ala Cys Ser
195 200 205
Ala Ser Leu Val Ala Met His Leu Ala Gly Gln Ala Leu Arg Gln Gly
210 215 220
Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Phe Ser Thr Pro
225 230 235 240
Thr Ala Tyr Val Glu Phe Ser Arg Gln Arg Gly Phe Ala Pro Asp Ala
245 250 255
Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Phe Ser Glu
260 265 270
Gly Val Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Gln Arg His
275 280 285
Gly Arg Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gln Asp
290 295 300
Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Asp Ala Ala Gln Glu Arg
305 310 315 320
Val Ile Arg Gln Ala Leu Asp Ser Ala Arg Leu Thr Ala Asp Gln Val
325 330 335
Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile
340 345 350
Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Lys Glu Arg Ser Ala Asp
355 360 365
Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr His
370 375 380
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met His
385 390 395 400
His Gly Arg Leu Pro Ala Thr Leu His Val Asp Glu Pro Thr Ser His
405 410 415
Val Asp Trp Asp Thr Gly Thr Val Arg Leu Leu Thr Glu Pro Val Asp
420 425 430
Trp Pro Arg Gly Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly
435 440 445
Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Ala Leu Pro
450 455 460
Pro Ala Ala Thr Gly Ala Glu Arg Pro Gly Asp Arg Leu Thr Pro Trp
465 470 475 480
Val Val Ser Ala Arg Gly Gln Ala Ala Leu His Asp Gln Ala Arg Arg
485 490 495
Leu Leu Asp Ala Thr Val Asp Gly Asp Pro Glu Ala Val Gly Trp Ser
500 505 510
Leu Val Ala Ser Arg Ala Val Phe Asp Gln Arg Ala Val Ile Thr Gly
515 520 525
Arg Asp Thr Glu Thr Leu Arg Ala Gly Leu Ala Ala Leu Ala Ala Gly
530 535 540
Glu Asp His Pro Ala Leu Val Arg Arg Glu Ala Gly Val Pro Ala Ser
545 550 555 560
Gly Ser Gln Val Trp Leu Phe Ser Gly Gln Gly Ser Gln Arg Pro Gly
565 570 575
Met Gly Ala Glu Leu His Glu Arg Phe Pro Ala Phe Ala Asp Thr Phe
580 585 590
Asp Glu Ile Cys Ala Leu Ile Asp Pro His Leu Asp His Pro Leu Arg
595 600 605
Asp Ile Val Phe Ala Thr His Pro Asp His Thr Asp Leu Leu Asn His
610 615 620
Thr Thr Tyr Thr Gln Ala Gly Leu Phe Ala Val Gln Val Ala Leu Ala
625 630 635 640
Arg Leu Leu Glu His Cys Gly Leu Arg Pro Asp Thr Val Ile Gly His
645 650 655
Ser Ile Gly Glu Ile Thr Ala Ala His Ile Ala Gly Val Leu Ser Leu
660 665 670
Gln Asp Ala Cys His Leu Val Ala Asn Arg Ala Thr Leu Leu Gly Lys
675 680 685
Leu Pro Pro Gly Gly Ala Met Thr Ala Ile Glu Ala Thr Ala Glu Glu
690 695 700
Ile Thr Gln Thr Leu Thr Pro Tyr His Gly Gln Val Thr Ile Ala Ala
705 710 715 720
Leu Asn Ala Pro Thr Ser Thr Val Ile Ser Gly Pro Glu Glu Leu Val
725 730 735
Ala Gln Leu Thr Arg Arg Trp Lys Glu Arg Gly Arg Arg Thr Lys Thr
740 745 750
Leu Thr Val Ser His Ala Phe His Ser Pro Leu Met Glu Pro Ala Leu
755 760 765
Asn Asp Phe Arg His Ala Ile Asp His Leu Thr Tyr His Gln Pro Thr
770 775 780
Ile Pro Leu Ile Ser Asn Leu Thr Gly Glu Pro Ala Thr Gln Asp Ile
785 790 795 800
Ala Thr Pro Asp Tyr Trp Val Arg His Ile Arg Gln Pro Val His Phe
805 810 815
His Pro Ala Ile Thr His Ile Ala Pro His Thr Ala Val Phe Leu Glu
820 825 830
Ile Gly Pro Asp Ala Thr Leu Ile Pro Ala Thr Gln Asn Thr Leu Asp
835 840 845
Thr Leu Asp Lys Gln Pro Ala His Pro Pro Gln Leu Ile Pro Thr Leu
850 855 860
Thr Arg Lys Gln Pro Asp Thr Gln Ala Leu Ala His Ala Leu Ala Arg
865 870 875 880
Leu His Thr Leu Thr Pro Leu Asn Trp His Pro Trp Tyr Thr Asp Gln
885 890 895
Pro Thr Pro Thr Thr Ile Asp Leu Pro Thr Tyr Pro Phe Gln Arg Glu
900 905 910
Arg Tyr Trp Leu Pro Asp Ala Leu Ala Asp Ala Pro Pro Pro Glu Ala
915 920 925
Asp Glu Glu Gln Val Arg Phe Trp Asn Ala Val Glu Ala Gln Asp Leu
930 935 940
Pro Ala Leu Ser Asp Thr Leu Gly Ile Gly Glu Glu Asp Gly Arg Arg
945 950 955 960
Ser Ser Leu Gly Ala Val Leu Pro Thr Leu Ser Arg Trp His Gln Glu
965 970 975
Arg His Glu Arg Ala Thr Val Ser Ser Trp Arg Tyr Arg Val Gly Trp
980 985 990
Arg His Leu Pro Asp Leu Gly Pro Ala Ala Val Ala Gly Pro Trp Leu
995 1000 1005
Leu Val Val Pro Pro Lys Gly Ala Asp Ala Trp Ala Asp Ala Cys
1010 1015 1020
Glu Arg Ala Leu Thr Ala Asp Gly Gly Glu Val Arg Arg Leu Val
1025 1030 1035
Thr Asp Gly Arg Ala Asp Val Ala Glu Leu Ala Ala Ser Leu Arg
1040 1045 1050
Ala Leu Tyr Ala Glu Gly Pro Ser Pro Ala Gly Val Leu Ser Leu
1055 1060 1065
Leu Pro Leu Asp Glu Arg Pro His Glu Ala Phe Pro Ala Val Thr
1070 1075 1080
Gly Gly Val Thr Gly Thr His Val Leu Leu Arg Ala Leu Leu Asp
1085 1090 1095
Ala Glu Leu Asp Ala Pro Leu Trp Cys Ala Thr Arg Gly Ala Val
1100 1105 1110
Ala Val Asp Asp Asp Glu Ala Pro Glu Ala Pro Ala Gln Ala Gln
1115 1120 1125
Val Trp Gly Leu Gly Arg Val Ala Ala Leu Glu His Pro Thr Ala
1130 1135 1140
Trp Gly Gly Leu Val Asp Leu Pro Ala Ser Val Ala Asp Leu Ala
1145 1150 1155
Pro Asp Leu Leu Cys Ala Val Leu Ala Gly Arg Asn Gly Glu Asp
1160 1165 1170
Gln Val Ala Leu Arg Pro Ala Gly Ala Phe Gly Arg Arg Leu Leu
1175 1180 1185
Pro Ala Pro Leu Asp Ala Gln Ala Pro Ala Gln Glu Arg Ala Trp
1190 1195 1200
Thr Pro Arg Asp Gly Val Leu Val Thr Gly Gly Val Ala Gly Ala
1205 1210 1215
Ala Ala Leu Val Ala Arg Trp Leu Ala Ala Asp Gly Thr Lys His
1220 1225 1230
Ile Val Leu Leu Ala Pro Asp Gly Pro Ala Ala Pro Gly Gly Ala
1235 1240 1245
Glu Leu Val Ala Glu Leu Ala Glu Leu Gly Ala Glu Ala Thr Val
1250 1255 1260
Val Asp Gly Val Pro Ser Glu Pro Thr Thr Arg Gln Glu Leu Ala
1265 1270 1275
Asp Arg Leu Ala Ala Ser Gly Leu Arg Val Arg Thr Val Val His
1280 1285 1290
Ala Gly Ala Pro Gly Asp Trp Ala Pro Leu Ala Glu Leu Thr Pro
1295 1300 1305
Asp Glu Leu Ala Glu Ala Leu Ser Asp Ala Met Gly Gly Ala Asp
1310 1315 1320
Arg Leu Ala Glu Leu Cys Gly Leu Glu Pro Asp Asp Pro Val Val
1325 1330 1335
Val Phe Ser Ser Ile Ala Ala Val Trp Gly Gly Gly Gly His Gly
1340 1345 1350
Ala Arg Ala Ala Ala Asp Ala Tyr Leu Asp Ala Trp Ala Arg Arg
1355 1360 1365
Arg Gln Ala Ala Gly Gly His Val Ala Arg Leu Ala Trp Gly Val
1370 1375 1380
Trp Asp Gly Ser Glu Asp Pro Glu Ala Ala Glu Arg Ala Glu Arg
1385 1390 1395
Gln Gly Leu Leu Ala Leu His Pro Thr Pro Ala Leu Ala Ala Leu
1400 1405 1410
Arg Arg Thr Leu Asp His Ser Gly Asp Gly Thr Asp Gln Gly Thr
1415 1420 1425
Val Arg Ala Gly Asp Glu Gly Gly Asp Arg Ser Asp Val His Ala
1430 1435 1440
Val Ile Ala Asp Val His Trp Asp Arg Phe Val Pro Leu Phe Thr
1445 1450 1455
Met Ala Arg Ala Ser Arg Leu Phe Asp Glu Ile Pro Ala Ala Arg
1460 1465 1470
Arg Ala Trp Gln Ala Ala Leu Asp Ser Ser Asp Asp Glu Ser Ser
1475 1480 1485
Glu Ser Leu Thr Ala Leu Arg Asp Arg Leu Ala Ala Gln Ser Pro
1490 1495 1500
Gln Ala Arg Thr Gly Thr Leu Leu Ala Leu Val Arg Ala His Val
1505 1510 1515
Ala Gly Ala Leu Arg Tyr Pro Ala Ala Glu Ser Val Asp Pro Glu
1520 1525 1530
Gln Pro Phe Lys Glu Leu Gly Phe Asp Ser Leu Ala Ala Val Glu
1535 1540 1545
Phe Arg Asn Arg Leu Arg Gly Ala Ile Gly Leu Thr Leu Pro Ala
1550 1555 1560
Thr Leu Val Phe Asp Tyr Pro Thr Pro Thr Ala Leu Ala Gly Tyr
1565 1570 1575
Leu Val Ser Gln Val Leu Pro Ala Glu Pro Ala Asp Glu Pro Ala
1580 1585 1590
Ala Ala His Leu Asp Glu Ile Glu Ala Thr Leu Ala Ala Leu Asp
1595 1600 1605
Ala Asp Asp Pro Arg Arg Ala Gly Leu Thr His Arg Leu Arg Leu
1610 1615 1620
Leu Leu Trp Arg Tyr Ala Asp Gly Asp Asp Ala Leu Glu Pro Arg
1625 1630 1635
Glu Glu Thr Gly Gly Asp Asp Leu Glu Thr Ala Ser Ala Asp Glu
1640 1645 1650
Met Phe Ala Leu Ile Asp Arg Glu Phe Gly Glu Ser
1655 1660 1665
<210>15
<211>460
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Arg Ser Met Thr Lys Val His Gly Arg Glu Ser Glu Leu Val Ala
1 5 10 15
Ile Ala Gly Asp Gln Arg Gly Glu Met Pro Ala Met Arg Val Leu Phe
20 25 30
Val Thr Ile Pro Trp Arg Thr His Phe Gln Phe Cys Val Pro Met Ala
35 40 45
Trp Ala Leu Arg Thr Ala Gly His Glu Val His Val Ala Ser Gly Pro
50 55 60
Asp Leu Thr Asp Val Ile Val Gln Ser Gly Leu Thr Ala Val Pro Val
65 70 75 80
Gly Ser Glu Glu His Phe Leu Glu Lys Ala His Gln Ala Gln Thr Glu
85 90 95
Ala Ser Glu Ala Thr Trp Gly Gly Ile His His Pro Ile Asp Leu Gly
100 105 110
Glu Asn Arg Glu Glu Met Phe Pro Leu Ser Tyr Leu Lys Ser Leu Cys
115 120 125
Ala Thr Ser Thr Glu Val Ala Lys Ala Val Asn Asp Ser Met Ile Asp
130 135 140
Asp Leu Val Ala Tyr Cys Arg Trp Trp Lys Pro Asp Leu Val Val Trp
145 150 155 160
Glu Trp Leu Ser His Ala Gly Ala Val Ala Ala Ala Ala Val Gly Ala
165 170 175
Ala His Ala Arg Met Pro Ile Gly Ile Glu Val Glu Ala Arg Met Arg
180 185 190
Arg His Phe Leu Lys Leu Leu Ala Gln Gln Glu Pro Ala Asp Arg Glu
195 200 205
Asp Pro Met Ala Glu Trp Leu Gly Ala Trp Gly Glu Lys His Gly Phe
210 215 220
Glu Phe Arg Glu Glu Leu Val Thr Gly Gln Phe Thr Ile Asp Gln Val
225 230 235 240
Pro Asp Ser Met Arg Leu Lys Ala Asn Val Pro Gln Val Ser Val Arg
245 250 255
His Val Pro Tyr Asn Gly Arg Ala Val Ala Pro Asp Trp Ile Arg Pro
260 265 270
Glu Pro Pro Ala Pro Arg Val Leu Ala Thr Phe Gly Met Ser Met Trp
275 280 285
Asp Met Ser Ser Ser Gln Pro Val Ser Ser Tyr Gln Met Val Ser Ile
290 295 300
Glu Gln Leu Gln Asp Trp Leu Asp Ser Met Ala Asp Leu Glu Met Glu
305 310 315 320
Leu Val Met Thr Leu Pro Gly Arg Ile Gln Glu Lys Leu Lys His Val
325 330 335
Pro Gln Asn Thr Arg Leu Val Asp Phe Val Pro Leu His Leu Val Ile
340 345 350
Pro Ser Cys Ala Ala Val Ile His His Gly Gly Leu Pro Ala Phe Cys
355 360 365
Ser Ser Leu Ala His Gly Val Pro Gln Leu Met Val Ser Arg Met Ala
370 375 380
Pro Asp Ala Ser Val Arg Gly Ala Arg Leu Glu Glu Ala Gln Ala Gly
385 390 395 400
Gly Trp Ile Pro Pro Glu Arg Met Thr Gly Lys Arg Ile Arg Asn His
405 410 415
Leu Ala Lys Leu Val Gln Asp Pro Ser Tyr Arg Ala Gly Ala Glu Arg
420 425 430
Leu Arg Gln Glu Val Leu Ala Gln Pro Ser Pro Asn Glu Val Val Pro
435 440 445
Glu Leu Glu Arg Leu Thr Glu Glu Leu Arg Ser Arg
450 455 460
<210>16
<211>305
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Gln Gly Phe Gln Ala Ser Leu Gln Trp Glu Arg Ile Asn Glu
1 5 10 15
Leu Trp Val Thr Glu Glu Ala Ser Ala Asp Leu Thr Gly Phe Lys Ser
20 25 30
Asp Arg Arg Asn Phe Asn Ile Ala Leu Trp Asp Pro Thr Thr Asn Gly
35 40 45
Ile Arg Tyr Leu Arg Ala Leu Val Tyr Glu Leu Ala Thr Arg Leu Ser
50 55 60
Asp Asp Asp Trp Ser Lys Ile Glu Lys Val Arg Asn Arg Asp Val Gly
65 70 75 80
Asp Pro Val Thr Val Arg Tyr Glu Gly Arg Thr Val Cys Leu Asp Tyr
85 90 95
Leu Gln Ala Ala Leu Glu Leu Gly Phe Ile Glu Lys Glu Leu Asp Leu
100 105 110
Gly Gly Ala Arg Val Leu Glu Ile Gly Ala Gly Tyr Gly Arg Thr Cys
115 120 125
His Ala Met Leu Ser Asn Tyr Asp Leu Ala Ser Tyr Thr Ile Val Asp
130 135 140
Leu Lys Asn Thr Leu Gly Leu Ser Arg Ala Tyr Leu Arg Glu Val Leu
145 150 155 160
Asp Glu Lys Gln Phe Ser Lys Met Arg Phe Val Gln Val Glu Asp Ile
165 170 175
Asp Thr Gly Leu Gly Pro Asp Gly Phe Asp Leu Cys Val Asn Val His
180 185 190
Ser Phe Thr Glu Met Thr Pro Asp Thr Val Lys Ala Tyr Leu Arg Leu
195 200 205
Ile Asp Glu Arg Cys Gly Ala Phe Phe Val Lys Asn Pro Val Gly Lys
210 215 220
Phe Arg Asp Lys Ser Met Asp Gly His Gln Lys Gly Glu Glu Ala Val
225 230 235 240
Arg Leu Ala Met Gln Thr Gly Pro Leu Arg Gln Val Leu Asp Ile His
245 250 255
Asp Ser Gln Ala Val Ala Ala Ala Val Pro Ala Phe Ile Glu Ala Tyr
260 265 270
Gln Pro Gly Glu Gly Trp Thr Cys Ala Ala Asn Thr Arg Gly Met Pro
275 280 285
Trp Ser Tyr Phe Trp Gln Ala Leu Tyr Thr Lys Thr Gly Asp Asp Leu
290 295 300
Arg
305
<210>17
<211>346
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Ala Ser Pro Val Pro Pro Pro Arg Gly Asp Glu Ala Leu Ala
1 5 10 15
Gly Thr Pro Val Leu Val Leu Gly Gly Ser Gly Tyr Leu Gly Arg His
20 25 30
Ile Cys Ser Ala Phe Gly Ala Ala Gly Ala Gln Val Val Pro Val Ser
35 40 45
Arg Gly Ala Arg Gly Gly Val Asp Gly Asp Gly Cys Arg Ser Val Arg
50 55 60
Leu Asp Leu Thr Ala Ala Gly Pro Asp Glu Leu Ala Arg Leu Cys Ala
65 70 75 80
Gly Thr Gly Ala Arg Val Leu Val Asn Ala Ser Gly Ala Val Trp Gly
85 90 95
Gly Gly Glu Arg Gln Met Ala Glu Ala Asn Thr Glu Leu Val Gly Arg
100 105 110
Leu Ala Gly Ala Val Ala Arg Leu Pro Gly Arg Pro Arg Leu Ile His
115 120 125
Leu Gly Ser Ala Tyr Glu Tyr Gly Pro Ala Arg Pro Gly Thr Ala Ile
130 135 140
Ala Glu Asp Trp Pro Pro Ala Pro Thr Thr Val Tyr Gly Arg Thr Lys
145 150 155 160
Leu Ser Gly Ser Gln Ala Val Leu Arg Ala Ala Ala Glu Leu Gly Val
165 170 175
Ala Gly Thr Val Leu Arg Val Ser Val Ala Cys Gly Pro Gly Ala Pro
180 185 190
Val Ser Ser Leu Ala Gly Ala Val Ala Ala His Leu Ala Ala Gly Arg
195 200 205
Asp Glu Leu Arg Leu Ala Pro Leu Arg Asp His Arg Asp Leu Val Asp
210 215 220
Val Arg Asp Val Ala Asp Ala Val Val Ala Ala Ala Val Ala Pro Val
225 230 235 240
Ala Ala Val Thr Gly Thr Val Val Asn Ile Gly Ser Gly Gln Ala Val
245 250 255
Pro Val Arg Arg Leu Val Asp Leu Met Ile Ala Leu Ser Gly Arg Pro
260 265 270
Val Arg Val Ile Glu Asp Pro Ala Leu Arg Arg Thr Arg Ser Asp Ala
275 280 285
Ala Trp Gln Arg Leu Asp Ile Gly Arg Ala Arg Arg Leu Leu Gly Trp
290 295 300
Ala Pro Arg Arg Thr Leu Arg Glu Ser Leu Arg Asp Leu Leu Ala Ala
305 310 315 320
Val Gly Ala Pro Gln Pro Ala Ala Val Arg Ala Ala Thr Ala Ile Gly
325 330 335
Pro Arg Asn Ser His Gly Lys Asp Ser Arg
340 345
<210>18
<211>434
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Glu Arg Val Ala Arg Ile Leu Asp Glu Val Arg Lys Tyr His
1 5 10 15
Gln Asp Ser Gln Glu Gly Arg Gly Phe Ile Pro Gly Val Thr Glu Ile
20 25 30
Trp Pro Ser Gly Ala Val Leu Asp Glu Asp Asp Arg Val Ala Leu Val
35 40 45
Gln Ala Ala Leu Glu Met Arg Ile Ala Ala Gly Lys Leu Ser Arg Lys
50 55 60
Phe Glu Ser Ala Phe Ala Arg Arg Met Lys Arg Arg Lys Ala His Leu
65 70 75 80
Thr Asn Ser Gly Ser Ser Ala Asn Leu Leu Ala Ile Ser Ala Leu Thr
85 90 95
Ser His Leu Leu Gly Glu Arg Arg Leu Arg Pro Gly Asp Glu Val Ile
100 105 110
Thr Val Ala Ala Ser Phe Pro Thr Thr Val Asn Pro Ile Leu Gln Asn
115 120 125
Gly Leu Val Pro Val Tyr Val Asp Val Glu Leu Gly Thr Tyr Asn Ala
130 135 140
Thr Ala Glu Arg Val Ala Glu Ala Ile Gly Pro Arg Thr Arg Ala Ile
145 150 155 160
Met Met Ala His Thr Leu Gly Asn Pro Phe Gln Ala Thr Glu Met Ala
165 170 175
Arg Leu Ala Gln Asp His Asp Leu Ile Leu Ile Glu Asp Ser Cys Asp
180 185 190
Ala Val Gly Ser Thr Tyr Asp Gly Arg Pro Ala Gly Thr Phe Gly Asp
195 200 205
Leu Thr Thr Val Ser Phe Tyr Pro Ala His His Leu Thr Met Gly Glu
210 215 220
Gly Gly Cys Val Leu Thr Ser Asn Leu Val Leu Ala Arg Ile Val Glu
225 230 235 240
Ser Leu Arg Asp Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Glu Ser
245 250 255
Asp Thr Cys Arg Lys Arg Phe Gly Tyr Gln Met Gly Thr Leu Pro Ala
260 265 270
Gly Tyr Asp His Lys Tyr Ile Phe Ser His Ile Gly Tyr Asn Leu Lys
275 280 285
Ser Thr Asp Leu Gln Ala Ala Leu Gly Leu Thr Gln Leu Asp Lys Leu
290 295 300
Asp Ala Phe Cys Ser Ala Arg Arg Ser Asn Trp Arg Arg Leu Arg Glu
305 310 315 320
Gly Leu Asp Gly Leu Pro Trp Leu Ile Leu Pro Glu Ala Thr Pro Arg
325 330 335
Ser Asp Pro Ser Trp Phe Gly Phe Val Leu Thr Val Asp Pro Arg Ala
340 345 350
Pro Phe Ser Arg Ala Glu Leu Val Asp Phe Leu Glu Ser Arg Lys Ile
355 360 365
Gly Thr Arg Arg Leu Phe Ala Gly Asn Leu Thr Arg His Pro Ala His
370 375 380
Ala Glu Ala Pro His Arg Val Cys Gly Asp Leu Ala Asn Ser Asp Thr
385 390 395 400
Val Thr Glu His Thr Phe Trp Val Gly Val Tyr Pro Gly Leu Thr Glu
405 410 415
Glu Met Ile Asp Phe Met Val Ser Ser Ile Thr Glu Phe Ile Gly Ser
420 425 430
His Arg
<210>19
<211>331
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Lys Leu Leu Val Thr Gly Ala Ala Gly Phe Ile Gly Ser Thr Tyr
1 5 10 15
Ala Arg Arg Leu Leu Ala Arg Gly Gly Ala Glu Trp Gly Pro Asp Val
20 25 30
Ser His Val Thr Val Leu Asp Lys Leu Thr Tyr Ala Gly Thr Leu Ser
35 40 45
Asn Leu Asp Thr Ala Asp Pro Arg Leu Thr Phe Val His Gly Asp Ile
50 55 60
Cys Asp Ala Asp Leu Val Asp Thr Leu Met Ala Arg Ala Asn Gln Val
65 70 75 80
Val His Phe Ala Ala Glu Ser His Val Asp Arg Ser Ile Thr Gly Ala
85 90 95
Asp Pro Phe Val Arg Thr Asn Val Glu Gly Thr His Thr Leu Leu Gln
100 105 110
Ala Ala Leu Arg His Gly Val Glu Arg Phe Val His Val Ser Thr Asp
115 120 125
Glu Val Tyr Gly Ser Val Glu Thr Gly Phe Ser Pro Glu Thr Ala Val
130 135 140
Leu Asp Pro Asn Ser Pro Tyr Ala Ala Ser Lys Ala Ala Ser Asp Leu
145 150 155 160
Ile Ala Leu Ala Tyr His Arg Thr His Gly Leu Asp Val Arg Val Thr
165 170 175
Arg Cys Ser Asn Asn Tyr Gly Pro His Gln Phe Pro Glu Lys Ile Ile
180 185 190
Pro Leu Phe Ile Thr Asn Leu Leu Asp Gly Glu Asp Val Pro Leu Tyr
195 200 205
Gly Asp Gly Leu Asn Val Arg Asp Trp Leu His Val Glu Asp His Cys
210 215 220
Arg Gly Val Glu Leu Val Arg Thr Lys Gly Ser Pro Gly Glu Ile Tyr
225 230 235 240
Asn Ile Gly Gly Gly Thr Ala Leu Thr Asn Arg Glu Leu Thr Gly Arg
245 250 255
Leu Leu Asp Ala Cys Gly Ala Gly Trp Ser Arg Val Arg Tyr Val Glu
260 265 270
Asp Arg Lys Gly His Asp Arg Arg Tyr Ala Val Gln Asp Asp Lys Ala
275 280 285
Arg Asp Glu Leu Gly Tyr Arg Pro Arg His Asp Phe Ala Ala Gly Leu
290 295 300
Ala Glu Thr Val Ala Trp Tyr Arg Asp Asn Arg Pro Trp Trp Glu Pro
305 310 315 320
Leu Lys Arg Ser Ala Leu Thr Gly Gly Ser Arg
325 330
<210>20
<211>302
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Lys Gly Ile Ile Leu Ala Gly Gly Asn Gly Thr Arg Leu Gln Pro
1 5 10 15
Leu Thr Leu Ala Gly Ser Lys Gln Leu Val Pro Val Tyr Asp Lys Pro
20 25 30
Met Ile Tyr Tyr Pro Leu Ser Val Leu Met Phe Ala Gly Ile Arg Asp
35 40 45
Ile Leu Ile Ile Ser Arg Pro Thr Glu Leu Pro Gln Phe Arg Gln Leu
50 55 60
Phe Gly Asp Gly Arg Arg Leu Gly Leu Asn Leu Ser Tyr Ala Ser Gln
65 70 75 80
Glu Lys Pro Arg Gly Ile Ala Asp Ala Phe Arg Ile Gly Ala Asp His
85 90 95
Ile Arg Gly Glu Glu Cys Ala Leu Ile Leu Gly Asp Asn Leu Phe His
100 105 110
Gly Ala Asn Leu Pro Ala Leu Leu Arg Arg Ser Val Gln Arg Leu Arg
115 120 125
Gly Cys Val Leu Phe Gly His Glu Val Ala Asp Pro Arg His Phe Gly
130 135 140
Val Ala Glu Ile Asp Glu Arg Gly Arg Leu Leu Ser Ile Glu Glu Lys
145 150 155 160
Pro Glu Arg Pro Arg Ser Asn Leu Ala Ile Pro Gly Leu Tyr Leu Phe
165 170 175
Asp Gly Gly Val Val Asp Val Ala Lys Arg Leu Val Pro Ser Ala Arg
180 185 190
Gly Glu Leu Glu Ile Thr Asp Val Leu Arg Ala Tyr Leu Glu Glu Gly
195 200 205
Thr Ala Asp Leu Val Trp Leu Gly Arg Gly Val Thr Trp Leu Asp Thr
210 215 220
Gly Thr His Glu Thr Leu Leu Asp Ala Gly Arg Met Val Arg Asp Val
225 230 235 240
Gln His Tyr Gln Gly Thr Arg Leu Gly Cys Val Glu Glu Ile Ala Met
245 250 255
Tyr Met Gly Phe Ile Gly Ala Asp Glu Cys Tyr Thr Leu Gly Thr Glu
260 265 270
Met Ser Asn Ser Pro Tyr Gly Gln Tyr Val Met Asp Arg Ala Arg Ser
275 280 285
Tyr Arg Arg Gly Leu Pro Pro Ala Arg Phe Leu Glu Glu Ile
290 295 300
<210>21
<211>1646
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Asn Asp Asp Lys Lys Leu Leu Asp Tyr Leu Lys Arg Val Thr
1 5 10 15
Ala Asp Leu Arg Glu Ala Gln Arg Arg Leu Lys Asp Val Glu Tyr Ala
20 25 30
Arg His Glu Pro Val Ala Ile Ile Gly Ile Gly Cys Arg Phe Pro Gly
35 40 45
Asp Ala Gly Ser Pro Glu Asp Leu Trp Asp Leu Val Ala Ala Gly Arg
50 55 60
Glu Gly Thr Gly Gly Leu Pro Ala Asp Arg Gly Trp Asp Leu Glu Ala
65 70 75 80
Leu Tyr Asp Pro Glu Pro Gly His Pro Gly Thr Ser Tyr Val Arg Glu
85 90 95
Gly Gly Phe Leu Arg Asp Ala Ala Arg Phe Asp Ala Ala Phe Phe Gly
100 105 110
Ile Ser Pro Asn Glu Ala Val Thr Met Ala Pro Gln Gln Arg Leu Ala
115 120 125
Leu Glu Leu Ala Trp Glu Ala Val Glu His Ala Arg Ile Asp Pro His
130 135 140
Thr Leu Arg Ser Ser Ala Thr Gly Thr Tyr Leu Gly Cys Asp Gly Leu
145 150 155 160
Asp Tyr Phe Leu Asn Ser Phe Gln Val Pro Glu Gly Ala Ala Gly Gln
165 170 175
Leu Thr Thr Gly Asn Ser Pro Ser Val Val Ala Gly Arg Val Ser Tyr
180 185 190
Thr Leu Gly Leu Glu Gly Ala Ala Val Thr Leu Asp Thr Ala Cys Ser
195 200 205
Ser Ala Leu Val Ala Ile His Leu Ala Cys Gln Ala Leu Arg Glu Glu
210 215 220
Glu Val Thr Leu Ala Leu Ala Gly Gly Val Tyr Val Met Ser Ser Pro
225 230 235 240
Ala Pro Leu Val Gly Phe Ser Glu Leu Arg Ala Leu Ala Pro Asp Gly
245 250 255
Arg Ala Lys Pro Phe Ser Ala Asp Ala Asp Gly Met Asn Leu Ala Glu
260 265 270
Gly Ala Gly Val Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn
275 280 285
Gly His Glu Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Glu Asp
290 295 300
Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg
305 310 315 320
Val Ile Gln Gln Ala Leu Ala Asn Ala Arg Leu Ala Pro Ala Asp Val
325 330 335
Asp Ala Val Glu Ala His Gly Thr Gly Thr Ser Leu Gly Asp Pro Ile
340 345 350
Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Glu Gly
355 360 365
Arg Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr Gln
370 375 380
Ile Ala Ala Gly Ala Ala Gly Val Ile Lys Met Val Met Ala Leu Arg
385 390 395 400
His Gly Met Leu Pro Arg Ser Leu His Ile Asp Glu Pro Thr Pro His
405 410 415
Val Ala Trp Asp Thr Gly Gly Val Arg Leu Leu Asp Gln Ala Val Glu
420 425 430
Trp Pro Arg Ala Glu Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly
435 440 445
Phe Ser Gly Thr Asn Ala His Leu Ile Leu Glu Glu Ala Pro Glu Pro
450 455 460
Ala Pro Pro Ala Asn Glu Ser Ala Glu Pro Ala Glu Pro Ala Ala Leu
465 470 475 480
Val Thr Pro Trp Val Leu Ser Ala Arg Ser Ala Ala Ala Leu Arg Arg
485 490 495
Gln Ala Arg Arg Leu Leu Asp Ala Ala Thr Asp Gly Asp Pro Arg Ala
500 505 510
Val Gly Trp Ser Leu Val Thr Thr Arg Ser Val Phe Glu His Arg Ala
515 520 525
Val Val Thr Gly Pro Asp Gly Ala Thr Leu Arg Glu Arg Leu Ala Ala
530 535 540
Leu Ala Ala Gly Glu Pro Ala Pro His Val Val Thr Gly Ala Pro Gly
545 550 555 560
Ala Pro Gly Ala Gly Val Val Leu Val Phe Pro Gly Gln Gly Ser Gln
565 570 575
Trp Pro Gly Met Gly Ala Glu Leu Leu Asp Ala Ser Pro Val Phe Ala
580 585 590
Ala Arg Ile Ala Glu Cys Glu Ala Ala Leu Ala Pro Tyr Val Asp Trp
595 600 605
Ser Leu Thr Glu Val Leu Arg Gly Ala Asp Gly Ala Ala Asp Leu Gly
610 615 620
Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Leu Met Val Ser Leu
625 630 635 640
Ala Ala Leu Trp Ala His His Gly Val Arg Pro Ala Ala Val Val Gly
645 650 655
His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Thr
660 665 670
Leu Glu Asp Gly Ala Arg Val Val Ala Leu Arg Ser Gln Ala Leu Arg
675 680 685
Ala Leu Ala Gly Arg Gly Ala Met Ala Ser Leu Gly Val Asp Pro Asp
690 695 700
Thr Ala Glu Arg Leu Val Ala Glu Val Gly Glu Lys Ala Ala Gly Val
705 710 715 720
Gly Val Ala Ala Leu Asn Ser Pro Ser Ser Thr Val Val Ser Gly Pro
725 730 735
Pro Asp Ala Val Ala Ala Val Val Ala Ala Cys Glu Ala Thr Gly Ala
740 745 750
Arg Ala Arg Thr Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Val
755 760 765
Asp Glu Ile Ala Asp Glu Val Thr Ala Arg Leu Ala Gly Val Gly Gly
770 775 780
Gln Ala Thr Asp Val Ala Phe Tyr Ser Thr Val Thr Gly Gly Pro Met
785 790 795 800
Asp Thr Ala Thr Ala Leu Asp Ala Gly Tyr Trp Leu Thr Asn Leu Arg
805 810 815
Arg Pro Val Arg Leu Thr Glu Ala Val Asp Ala Leu Leu Thr Ala Gly
820 825 830
His Arg Val Phe Ile Glu Val Ser Thr His Pro Val Val Val Pro Ala
835 840 845
Leu Gln Gln Cys Phe Glu Ser Ala Gln Val Ala Ala Val Ala Met Gly
850 855 860
Thr Leu Arg Arg Asp Glu Gly Gly Pro Ala Gln Leu Ala Thr Ala Leu
865 870 875 880
Ala Gln Ala Phe Thr Ser Gly Val Pro Val Asp Trp Arg Pro Trp Phe
885 890 895
Asp Gly Pro Pro Ala Pro Arg Thr Thr Ala Leu Pro Thr Tyr Ala Phe
900 905 910
Asp Arg Glu Arg Tyr Trp Leu Pro Pro Ala Arg Ala Val Arg Gly Asp
915 920 925
Gly Ala Gln Asp Pro Ala Glu Ala Glu Leu Trp Gly Ala Ile Glu Asp
930 935 940
Leu Asp Thr Glu Ala Leu Ala Arg Val Leu Glu Pro Asp Gly Thr Ala
945 950 955 960
Glu Asp Ile Glu Ala Leu Arg Pro Ala Leu Pro Val Leu Ser Thr Trp
965 970 975
Arg Arg Arg His Arg Glu Arg Thr Thr Leu Glu Ser Trp Arg His Gln
980 985 990
Ile Arg Trp Thr Pro Leu Pro Asp Pro Ala Pro Pro Ala Val Ser Ser
995 1000 1005
Thr Trp Leu Leu Leu Val Pro Thr Gly Tyr Glu Asp His Pro Ala
1010 1015 1020
Val Arg Thr Val Thr Gly Ala Leu Thr Gly His Gly Ala Glu Val
1025 1030 1035
Arg Pro Cys Pro Ala Asp Pro Ser Val Thr Ser Arg Ala Glu Leu
1040 1045 1050
Ala Glu Arg Leu Thr Gly Leu Arg Gly Asp Asp Ala Pro Ala Gly
1055 1060 1065
Val Val Ser Leu Leu Ala Leu Asp Glu Arg Pro Arg Ala Asp His
1070 1075 1080
Pro Ala Val Pro Ala Gly Leu Ala Ala Thr Val Ala Ala Leu Gln
1085 1090 1095
Ala Leu Gly Asp Ala Gly Ile Thr Ala Pro Leu Trp Cys Val Thr
1100 1105 1110
Gln Gly Ala Val Ser Thr Gly His Asp Asp Pro Leu Ser His Pro
1115 1120 1125
Leu Gln Ala Gln Thr Trp Gly Leu Gly Arg Val Ala Ala Leu Glu
1130 1135 1140
His Pro Asp Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Glu Pro
1145 1150 1155
Asp Glu Arg Thr Glu Ala Gln Leu Ala Ala Leu Leu Ala Gly Ala
1160 1165 1170
Pro Thr Gly Gly Gln Ala Glu Asp Gln Val Ala Ile Arg Ala Gly
1175 1180 1185
Gly Gly Ala Leu Ala Arg Arg Leu Val His Thr Thr Ser Ala Arg
1190 1195 1200
Thr Ala Asp Gln Ala Trp Gln Pro Arg Gly Ser Val Leu Ile Thr
1205 1210 1215
Gly Gly Thr Gly Gly Val Gly Ala Leu Leu Ala Arg Trp Ala Ala
1220 1225 1230
Glu Arg Gly Ala Ala Gln Leu Val Leu Thr Ser Arg Arg Gly Pro
1235 1240 1245
Asn Ala Pro Gly Ala Ala Glu Leu Ala Ala Glu Leu Arg Gly Leu
1250 1255 1260
Gly Val Ala Val Thr Ile Ala Ala Cys Asp Ala Ala Asp Pro Glu
1265 1270 1275
Ala Met Arg Gly Val Leu Asp Ala Ile Pro Ala Glu His Pro Leu
1280 1285 1290
Ser Ala Val Ile His Ala Ala Gly Val Ser Glu Gln Asp Leu Ile
1295 1300 1305
Ala Asp Val Asp Asp Glu His Leu Asn Arg Met Leu Ala Pro Lys
1310 1315 1320
Ala Leu Ala Ala Trp His Leu His Glu Leu Thr Arg His Leu Asp
1325 1330 1335
Leu Ser Ala Phe Ile Leu Phe Ser Ser Val Ser Ala Ser Trp Gly
1340 1345 1350
Ser Gly Gln Gln Ala Gly Tyr Ala Ala Ala Asn Ala Tyr Leu Asp
1355 1360 1365
Ala Leu Ala Glu His Arg Arg Gly Leu Gly Leu Pro Thr Thr Ser
1370 1375 1380
Val Ala Trp Gly Leu Trp Gly Glu Val Gly Met Ala Thr Thr Ala
1385 1390 1395
Asp Glu Val Asp Ala Phe Arg Arg Arg Gly Ile His Pro Leu Asp
1400 1405 1410
Pro Gly Leu Ala Ile Ala Ser Leu Gln Gln Ala Val Glu Arg Arg
1415 1420 1425
Glu Thr Asn Ala Val Val Ala Ala Ile Asp Trp Arg Arg Phe Leu
1430 1435 1440
Thr Gly Phe Thr Ala Leu Arg Pro Ser Pro Leu Leu Ser Asp Leu
1445 1450 1455
Pro Glu Ala Ala Glu Ala Arg Ala Glu Glu Thr Arg Pro Arg Glu
1460 1465 1470
Asp Glu Ala Asp Pro Leu Arg Arg Lys Leu Ala Gly Cys Pro Pro
1475 1480 1485
Ala Glu Gln His His Ile Leu Val Arg His Val Gln Ala His Ala
1490 1495 1500
Ala Thr Thr Leu Gly His Ala Asp Ala Asp Ala Val Pro Pro Thr
1505 1510 1515
Lys Pro Phe Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Gln
1520 1525 1530
Leu Arg Asp Arg Leu Asn Ala Gly Thr Gly Leu Arg Leu Pro Thr
1535 1540 1545
Thr Val Leu Phe Asp Tyr Pro Ser Ala Glu Glu Leu Ala Arg His
1550 1555 1560
Leu His Gly Leu Leu Val Ala Asp Ala Ala Ser Gly Glu Gln Leu
1565 1570 1575
Ile Met Ser Glu Leu Asp Gly Trp Asp Ala Ala His Ala Pro Asp
1580 1585 1590
Thr Val Asp Glu Ala Ala Cys Ser Arg Ile Ala Ala Arg Leu Arg
1595 1600 1605
Leu Leu Ala Asp Lys Trp Ser Asp Thr Ala Arg Ala Thr Gly Ser
1610 1615 1620
His Ser Asp Leu Glu Thr Ala Thr Ala Glu Asp Ile Phe Asp Leu
1625 1630 1635
Ile Ala Thr Glu Phe Gly Lys Ser
1640 1645
<210>22
<211>290
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Ser Val Pro Val Asp His Ala Val His Leu Asn Val Arg His Arg
1 5 10 15
Pro Gly Arg Asp Gly Arg Pro Phe Leu Leu Leu His Gly Leu Gly Ser
20 25 30
Asn Ala Arg Leu Trp Asp Glu Val Ala Asp Leu Leu Ala Ala Ala Gly
35 40 45
His Pro Val Tyr Ala Leu Asp Met Arg Gly His Gly Asp Ser Asp Leu
50 55 60
Pro Glu His Gly Tyr Asp Asn Ala Thr Ala Val Ala Asp Leu Val Ala
65 70 75 80
Val Cys Arg Glu Leu Arg Leu Thr Gly Ala Leu Phe Ala Gly His Ser
85 90 95
Trp Gly Gly Asn Leu Ala Val Arg Leu Ala Ala Gln His Pro Glu Leu
100 105 110
Ala Ala Gly Leu Ala Leu Val Asp Gly Gly Trp Ile Gly Phe Val Asp
115 120 125
Thr Ser Arg Tyr Ala Pro Thr Arg Glu Lys Ser Val Glu Leu Ala Gly
130 135 140
Trp Trp Arg Asp Val Thr Gly Ile Lys Glu Glu Thr Met Arg Glu Leu
145 150 155 160
Leu Arg Gly Leu His Pro Thr Trp Ser Gln Thr Ala Val Glu Ala Ser
165 170 175
Leu Ala Asp Met Val Glu Gly Pro Asp Gly Leu Leu Val Gln Arg Leu
180 185 190
Pro Leu Glu His Tyr Met Ser Leu Ala Asp Ser Met Trp Gln Asp Pro
195 200 205
Pro Ala Arg Trp Tyr Pro Gly Ile Thr Ala Pro Val Leu Leu Leu Val
210 215 220
Ala Leu Pro Ala His Ala Gln Ser Trp Gly Thr Tyr Ala Arg Lys Trp
225 230 235 240
Val Ala Glu Ala Glu Ala Ala Ile Pro Gln Ala Glu Ser Arg Trp Tyr
245 250 255
Val Asp Thr Asp His Asn Leu His Val Glu Glu Pro Glu Arg Val Ala
260 265 270
Ser Asp Leu Leu Asp Leu Ala Arg Leu Val Asp Lys Pro Ala Pro Glu
275 280 285
Arg Ser
290
<210>23
<211>104
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Glu Pro Asn Glu Glu Ala Thr His Asp Glu Gln Leu Val Ala Arg
1 5 10 15
Leu Ala Ser Ala Thr Pro Glu Glu Arg Gln Arg Leu Leu Ala Ala Gln
20 25 30
Val Leu Arg Arg Ala Ser Glu Val Leu Asp Val Pro Ala Leu Asp Glu
35 40 45
Glu Ser Asn Phe Leu Glu Asn Gly Leu Ser Ser Leu Ser Ala Val Gln
50 55 60
Leu Ala Lys Ser Leu Met Ser Asp Thr Gly Leu Glu Val Pro Leu Val
65 70 75 80
Ala Ile Val Glu His Pro Thr Ser Thr Leu Leu Gly Lys Tyr Leu Ala
85 90 95
Glu Thr Tyr Glu Ala Asp Ala Ala
100
<210>24
<211>478
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Thr Thr Pro Thr Arg Ala Val Val Leu Gly Gly Gly Trp Ala Gly
1 5 10 15
Met Leu Thr Ala His Val Leu Ala Arg His Leu Glu Ser Val Thr Val
20 25 30
Val Glu Arg Asp Ile Leu Pro Asp Gly Pro His His Arg Lys Gly Gln
35 40 45
Pro Gln Ala Arg His Val His Val Leu Trp Ser Ser Gly Ala Gly Ile
50 55 60
Val Glu Asn Leu Leu Pro Gly Thr Ala Glu Arg Leu Leu Ala Ala Gly
65 70 75 80
Ala Arg Arg Ile Gly Phe Gln Ser Asp Leu Val Thr Leu Thr Ala Trp
85 90 95
Gly Trp Gln Tyr Arg Phe Pro Ala Thr Ala Tyr Ala Met Met Cys Thr
100 105 110
Arg Pro Leu Leu Asp Trp Val Val Arg Asp Ala Ile Leu Ala Gly Gly
115 120 125
Arg Ile Glu Val Glu His Gly Thr Glu Ala Val Glu Leu Ala Gly Asp
130 135 140
Arg Ser Arg Val Thr Gly Val Arg Val Arg Asp Ala Gly Gly Gly Glu
145 150 155 160
Pro Arg Leu Leu Glu Ala Asp Leu Val Val Asp Ala Thr Gly Arg Ala
165 170 175
Ser Arg Leu Gly His Trp Leu Ala Ala Leu Gly Leu Pro Ala Val Glu
180 185 190
Gln Asp Val Val Asp Ala Gly Ile Gly Tyr Ala Thr Arg Met Phe Lys
195 200 205
Ala Pro Glu Gly Ala Asp Gly Asn Phe Pro Ala Val Gln Val Ala Ala
210 215 220
Asp Pro Leu Thr Arg Gln Pro Gly Arg Phe Gly Val Val Tyr Pro Gln
225 230 235 240
Glu Gly Gly Arg Trp Leu Val Thr Leu Thr Ser Thr Arg Gly Ala Pro
245 250 255
Leu Pro Thr Asp Glu Asp Glu Phe Thr Gly Tyr Ala Lys Val Leu Arg
260 265 270
His Ser Ile Val Ser Glu Leu Met Ser Val Ala Glu Pro Ile Ser Pro
275 280 285
Ile Phe Gln Ser His Ser Gly Ala Asn Arg Arg Met Tyr Pro Glu Arg
290 295 300
Met Pro Gln Trp Pro Glu Gly Leu Leu Ile Leu Gly Asp Ser Leu Ala
305 310 315 320
Ala Phe Asn Pro Val Tyr Gly His Gly Met Ser Ser Ala Ala Arg Ala
325 330 335
Ala Glu Ala Leu Asp Lys Glu Leu Ala Arg Asp Gly Phe Gly Glu Gly
340 345 350
Gly Thr Arg Gln Val Gln Arg Ala Leu Ser Glu Val Val Asp Asp Pro
355 360 365
Trp Ile Met Ala Gly Leu Asn Asp Ile Gln Tyr Val Asn Cys Arg Asn
370 375 380
Leu Ser Ser Asp Pro Arg Leu Thr Gly Pro Asp Val Ala Glu Arg Leu
385 390 395 400
Lys Phe Ser Asp Phe Leu Ser Gly Lys Ser Ile Arg Ser Pro Lys Val
405 410 415
Cys Glu Val Thr Thr Ser Val Leu Ser Leu Asn Ala Pro Gln Lys Ala
420 425 430
Leu Gly Asp Ser Arg Phe Leu Ser Leu Leu Arg Thr Asp Thr Ser His
435 440 445
Pro Lys Leu Val Glu Pro Pro Phe His Pro Glu Glu Leu Glu Met Val
450 455 460
Gly Leu Lys Pro Ser Gly Ile Ala Ala Lys Gly Ala Leu Gly
465 470 475
<210>25
<211>313
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Leu Asn Glu Arg Thr Arg Lys Lys Ile Ala Arg Glu His Ser Arg
1 5 10 15
Arg Leu Ser Glu Gly Asp Ile Asp Gly Leu Leu Asp Leu Tyr Ala Lys
20 25 30
Asp Val Thr Phe Glu Ala Pro Val Gly Ala Gly Gly Gln Ala Gly His
35 40 45
Glu Ala Leu Arg Ala His Phe Glu Ser Ala Val Ala Gly Asn Val Asp
50 55 60
Glu Thr Ile Val Glu Ser Val Val Gly Gln Asp Gly Glu His Val Leu
65 70 75 80
Ser Arg Ile Thr Ala Val Met Asp Tyr Arg Pro Arg Gly Pro Leu Tyr
85 90 95
Ala Asp Arg Gly Trp Leu Pro Ala Pro Glu Gly Ala Glu Pro Thr Ala
100 105 110
Leu Arg Cys His Tyr Ala Leu Leu Leu Arg Val Gly Glu Ser Gly Leu
115 120 125
Ile Glu Asp Met Arg Ala Tyr Trp Gly Lys Pro Asp Leu Glu Thr Ser
130 135 140
Val Asp Gly Arg Ser Gly Pro Phe Val Gly Pro Pro Ala Ile Asp Pro
145 150 155 160
Asp Glu Ala Ala Leu Arg Glu Leu Pro His Asn Tyr Leu Arg Leu Leu
165 170 175
Gln Lys Gly Asp Val Glu Gly Thr Val Ala Leu Phe Thr Asp Asp Ile
180 185 190
Val Phe Glu Asp Pro Val Gly Gly Leu Leu Leu Arg Gly Lys Asp Ala
195 200 205
Leu Arg Glu His Ala Phe Arg Gly Ser Glu Gly Lys Val His Glu Met
210 215 220
Leu Gly Arg Leu Val Thr Ser Met Asp Gly Arg Phe Val Val Val Leu
225 230 235 240
Gly Asp Ala Arg Val Tyr Val Pro Ala Arg Met Arg Met Arg Met Ile
245 250 255
Thr Ile Cys Glu Val Asn Glu Asp Arg Leu Gly Ala His Ile Gln Gly
260 265 270
Phe Trp Gly Leu Thr Asp Met Thr Ile Gly Phe Pro Asp Asp Asp Thr
275 280 285
Ala Pro Glu Pro Ala Leu Asp Thr Ala Arg Arg Pro Glu Arg Lys Asp
290 295 300
Ala Leu Lys Pro Phe Gly Thr Pro Ser
305 310
<210>26
<211>802
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Pro Ser Leu Arg Arg Asn Val Met Thr Val Ala Asp Ala Lys Val
1 5 10 15
Val Glu Ala Leu Arg Thr Ser Leu Leu Glu Thr Glu Arg Leu Arg Lys
20 25 30
Glu Asn Asp Arg Leu Arg Ala Ala Pro Arg Glu Pro Val Ala Ile Thr
35 40 45
Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Glu Ser Pro Asp Asp Leu
50 55 60
Trp Thr Leu Leu Thr Glu Glu Arg Asp Ala Ile Gly Ala Phe Pro Ala
65 70 75 80
Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Gly Pro Asp Ala His Pro
85 90 95
Asp Val Arg Ser Ala Val Glu Glu Gly Gly Phe Leu Ser Asp Ala Gly
100 105 110
Ala Phe Asp Pro Ala Pro Phe Gly Ile Ser Pro Gly Glu Ala Ser Val
115 120 125
Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Phe
130 135 140
Glu Arg Ala Gly Ile Asp Pro Arg Ser Val Arg Gly Arg Arg Cys Gly
145 150 155 160
Val Phe Met Gly Thr Thr Gly Gln Asp Tyr Thr Pro His Leu Lys Asp
165 170 175
Val Pro Asp Glu Leu Leu Gly His Ile Ala Ser Gly Gly Ser Ser Ala
180 185 190
Val Leu Ser Gly Arg Leu Ala Ser Val Phe Gly Leu Glu Gly Pro Thr
195 200 205
Ala Thr Leu Asp Thr Ala Cys Ser Gly Ser Leu Val Ala Leu His Leu
210 215 220
Ala Cys Gln Ser Leu Arg Gly Gly Glu Cys Ser Met Ala Leu Ala Gly
225 230 235 240
Gly Val Thr Val Met Ser Ser Pro Glu Thr Phe Ile Gly Thr Gly Arg
245 250 255
Gly Ile Gly Leu Pro Ala Ala Ala Arg Cys Arg Ser Phe Ala Asp Gly
260 265 270
Ala Glu Gly Ile Ala Phe Ala Glu Gly Ala Gly Val Val Leu Leu Glu
275 280 285
Arg Leu Ser Thr Ala Arg Ala His Gly Arg Pro Val Leu Ala Val Val
290 295 300
Arg Gly Ser Ala Ile Gly Gln Glu Gly Thr Asn Asn Gly Val Ser Ala
305 310 315 320
Ser Asn Gly Pro Ala Gln Gln Arg Leu Ile Arg Gln Ala Leu Ala Ala
325 330 335
Ala Gly Leu Leu Pro His Glu Ile Asp Ala Val Glu Gly Gln Gly Thr
340 345 350
Gly Gly Leu Leu Ser Asp Ala Val Glu Ala Gln Ala Leu Ala Ser Val
355 360 365
Tyr Gly Glu Gly Arg Pro Ala Asp Arg Pro Leu Leu Leu Gly Ala Val
370 375 380
Lys Ser Asn Leu Gly His Thr Gln Gly Ala Ser Gly Val Ala Gly Val
385 390 395 400
Ile Lys Thr Val Gln Ala Met Arg His Gly Val Leu Pro Arg Thr Leu
405 410 415
His Thr Glu Val Pro Ser Pro His Ile Ser Trp Lys Arg Gly Arg Ile
420 425 430
Arg Leu Leu Thr Ala Ala Thr Pro Trp Pro Gly Thr Asp Arg Pro Leu
435 440 445
Arg Ser Gly Val Ser Ala Phe Gly Phe Gly Gly Thr Asn Ala His Val
450 455 460
Ile Val Glu Gln Ala Pro Pro Glu Asp Asp Pro Pro Pro Pro Leu Pro
465 470 475 480
Glu Ala Glu Gly Glu Pro Gly Ser Ala Gly Val Ala Leu Trp Pro Leu
485 490 495
Ser Gly Cys Asp Pro Asp Ala Leu Arg Asp Gln Ala Ala Arg Leu Leu
500 505 510
Ala His Leu Asp Glu Arg Pro Gly Leu Arg Pro Ala Asp Val Gly Leu
515 520 525
Ser Leu Gly Thr Ser Arg Ala Ala Leu Glu His Arg Gly Val Val Val
530 535 540
Gly Glu Ser Arg Gln Glu Leu Leu Asp Gly Leu Arg Ala Leu Ala Glu
545 550 555 560
Gly Arg Ala Ala Pro His Val Ala Arg Gly Ala Ile Gly Arg Arg Pro
565 570 575
Arg Leu Val Val Leu Phe Thr Gly His Ala Pro Ala Pro Gly Thr Gly
580 585 590
Lys Gln Leu Tyr Asp Ala Phe Pro Ala Phe Ala Asp Ala Phe Asp Thr
595 600 605
Val Cys Ala Ala Leu Asp Ala His Leu Gly Phe Pro Ala Arg Asp Ala
610 615 620
Met Leu Thr Gly Ala Asp Pro Thr Ser Ala Ala Val Pro Asp Pro Ala
625 630 635 640
Arg Ala Phe Ala Ile Gln Val Ala Leu Phe Arg Leu Val Glu Ser Trp
645 650 655
Gly Thr Arg Pro Gly Ala Val Arg Gly His Gly Ile Gly Arg Leu Ala
660 665 670
Glu Asp His Val Ser Gly Arg Leu Ala Leu Pro Asp Ala Cys Ala Ala
675 680 685
Leu Thr Gly Pro His Glu Ala Pro Asp Pro Pro Arg Asp Phe Ala Gln
690 695 700
Arg Val Arg Asp Leu Ala Gly Asp Gly Thr Val Tyr Leu Glu Leu Gly
705 710 715 720
Gly His Glu Val Ala Gly Leu Leu Asp Gly Ala Gly Ala His Ala Val
725 730 735
Ala Ala Leu Arg Pro Gly Val Ala Glu Ala Thr Ser Ala Ala Thr Gly
740 745 750
Leu Ala Arg Ala His Ala Arg Gly Thr Ala Val Asp Trp Arg Ala Val
755 760 765
Phe Gly Pro Glu Ala Arg Arg Val Glu Leu Pro Thr Tyr Ala Phe Arg
770 775 780
Arg Gly Arg Tyr Trp Arg Ala Ala Phe Asp Met Ser Leu Val Arg Pro
785 790 795 800
Asp Arg
<210>27
<211>2187
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Ash Gln Glu Gln Leu Val Asp Tyr Leu Lys Arg Val Ala Thr
1 5 10 15
Asp Leu His Asp Thr Gln Gln Arg Leu Arg Glu Val Glu Ala Arg Asp
20 25 30
Arg Gln Pro Ile Ala Ile Val Gly Met Ala Cys Arg Phe Pro Gly Gly
35 40 45
Thr Asp Thr Pro Glu Ala Leu Trp Asp Leu Val Ser Glu Gly Arg Asp
50 55 60
Val Ile Ser Pro Met Pro Asp Asp Arg Gly Trp Asp Pro Gly Met Phe
65 70 75 80
Asn Asp Ser Gly Glu Thr Gly Thr Ser Tyr Val Arg Glu Gly Gly Phe
85 90 95
Val His Asp Ile Ala Asp Phe Asp Ala Asp Phe Phe Asp Ile Asn Pro
100 105 110
Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr
115 120 125
Ser Trp Glu Ala Ile Glu Arg Ala Arg Ile Asp Leu Thr Ser Leu Arg
130 135 140
Gly Ser Lys Thr Gly Val Phe Val Gly Gly Ser Thr Val Tyr Tyr Ala
145 150 155 160
Gly Asn Ala Ala Gly Val Pro Gln Asp Val Ala Gly Tyr Leu Ala Thr
165 170 175
Gly Leu Ala Ala Ser Ser Met Ser Gly Arg Ile Ser Tyr Thr Phe Gly
180 185 190
Phe Glu Gly Pro Ser Phe Thr Val Asp Thr Ala Cys Ser Ser Ser Gly
195 200 205
Val Ala Leu His Leu Ala Val Gln Ala Leu Arg Lys Gly Glu Cys Ser
210 215 220
Leu Ala Leu Ala Gly Gly Val Cys Val Met Ala Thr Pro Gly Thr Tyr
225 230 235 240
Leu Glu Phe Ser Lys Leu Asn Gly Leu Ala Ala Asp Gly Arg Cys Lys
245 250 255
Ala Phe Ala Ala Ala Ala Asp Gly Phe Gly Pro Ala Glu Gly Val Gly
260 265 270
Val Leu Leu Val Glu Arg Leu Ala Asp Ala Glu Arg Leu Gly His Pro
275 280 285
Val Leu Ala Val Ile Arg Gly Ser Ala Ile Asn Gln Asp Gly Ala Ser
290 295 300
Asn Gly Leu Thr Ala Pro His Gly Pro Ala Gln Glu Arg Val Ile Arg
305 310 315 320
Ala Ala Leu Ala Asp Ala Gln Leu Ser Ala Arg Asp Ile Asp Val Val
325 330 335
Glu Ala His Gly Thr Gly Thr Ser Leu Gly Asp Pro Ile Glu Ala Gln
340 345 350
Ala Leu Ile Ala Ala Tyr Gly Arg Arg Arg Ala Asp Gly Gly Pro Leu
355 360 365
Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala
370 375 380
Gly Met Ala Gly Val Ile Lys Met Val His Ala Leu Arg His Gly Leu
385 390 395 400
Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr His Gln Val Asp Trp
405 410 415
Ser Glu Gly Thr Val Arg Leu Leu Thr Glu Ala Arg Pro Trp Pro Glu
420 425 430
Ala Gly Gly Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Met Ser Gly
435 440 445
Thr Asn Thr His Val Ile Leu Glu Gln Ala Pro Pro Ala Glu Glu Pro
450 455 460
Ala Pro Ala Ala Glu Ser Pro Val Val Pro Trp Leu Val Ser Gly Arg
465 470 475 480
Gly Glu Ala Ala Leu Arg Ala Gln Ala Ala Arg Leu Arg Asp Phe Leu
485 490 495
Ala Glu Arg Pro Glu Ala Ser Pro Thr Arg Val Gly Phe Ala Leu Ala
500 505 510
Gly Ser Arg Ala Ala Gln Ser His Arg Ala Ala Val Val Ala Ala Asp
515 520 525
Arg Asp Thr Leu Leu Ala Gly Leu Gly Ser Leu Ala Glu Gly Thr Pro
530 535 540
Ala Gly His Val Val Thr Gly Ser Val Ser Pro Gly Ala Thr Ala Phe
545 550 555 560
Val Phe Pro Gly Gln Gly Ser Gln Trp Val Gly Met Ala Leu Ala Leu
565 570 575
Ala Asp Ala Ser Pro Val Phe Ala Glu His Phe Arg Arg Cys Ala Glu
580 585 590
Ala Val Glu Arg His Thr Asp Tyr Thr Val Glu Ser Val Leu Arg Ala
595 600 605
Asp Pro Gly Ala Pro Ser Leu Asp Arg Val Asp Val Val Gln Pro Val
610 615 620
Leu Trp Ala Val Met Val Ala Leu Ala Glu Leu Trp Arg Ser His Gly
625 630 635 640
Val Glu Pro Ala Ala Val Val Gly His Ser Gln Gly Glu Ile Ala Ala
645 650 655
Ala Cys Val Ala Gly Val Leu Ser Val Asp Asp Ala Ala Arg Val Val
660 665 670
Val Leu Arg Ser Gln Val Leu Pro Glu Leu Ser Gly Arg Gly Gly Met
675 680 685
Ala Ser Val Ala Gln Pro Val Gly Leu Val Glu Lys His Leu Glu Arg
690 695 700
Trp Asp Gly Arg Leu Ser Val Ala Ala Val Asn Gly Pro Ser Ser Thr
705 710 715 720
Val Val Ser Gly Asp Ala Asp Ala Leu Glu Glu Leu Leu Glu Gly Tyr
725 730 735
Glu Ala Asp Gly Val Arg Ala Arg Arg Val Pro Val Asp Tyr Ala Ser
740 745 750
His Cys Ala His Val Asp Ala Leu Arg Gly Pro Leu Leu Asp Ala Leu
755 760 765
Ser Gly Ile Glu Pro Lys Ala Gly Thr Val Pro Leu Tyr Ser Thr Val
770 775 780
Thr Gly Arg Arg Ile Asp Gly Thr Thr Met Asp Ala Gly Tyr Trp Tyr
785 790 795 800
Thr Asn Leu Arg Gln Gln Val Arg Phe Gln Glu Ala Thr Glu Ala Leu
805 810 815
Leu Ala Asp Gly His Gly Val Phe Ile Glu Cys Ser Pro His Pro Val
820 825 830
Leu Thr Ile Gly Val Gln Glu Thr Met Asp Gln Ser Gly Ala Asn Ala
835 840 845
Thr Ala Leu Gly Thr Leu Arg Arg Asp Glu Gly Gly Trp Asp Arg Phe
850 855 860
Leu Leu Ala Leu Gly Gln Ala His Thr His Gly Val Ala Val Asp Trp
865 870 875 880
Ser Arg Val Phe Pro Asp Gly Thr Ser Pro Ala Asp Leu Pro Thr Tyr
885 890 895
Ala Phe Gln Arg Arg Arg Tyr Trp Leu Asp Ala Thr Ala Gly Ala Ala
900 905 910
Gly Asp Pro Ala Ser Leu Gly Leu Thr Pro Ala Asp His Pro Leu Leu
915 920 925
Gly Ala Val Thr Leu Leu Ala Glu Gly Asp Glu Val Val Met Thr Gly
930 935 940
Arg Leu Gly Leu Asp Thr His Pro Trp Leu Ala Asp His Ala Val Ala
945 950 955 960
Gly Ala Val Leu Val Pro Gly Ala Val Phe Val Glu Leu Ala Val Arg
965 970 975
Ala Gly Asp Glu Val Gly Cys Asp His Leu Glu Glu Met Val Leu Ala
980 985 990
Ser Pro Leu Val Leu Pro Glu Gln Gly Gly Phe Asp Leu Gln Leu Val
995 1000 1005
Val Gly Gly Ala Ala Glu Asp Gly Arg Arg Thr Leu Gly Gly Tyr
1010 1015 1020
Ala Arg Pro Ser Gly Ser Asp Arg Pro Trp Val Gln His Val Thr
1025 1030 1035
Gly Thr Leu Ala Pro Gly Gly Ala Ala Arg Pro Phe Asp Leu Ala
1040 1045 1050
Gln Trp Pro Pro Glu Gly Ala Arg Pro Val Pro Val Glu Gly Cys
1055 1060 1065
Tyr Asp Gln Leu Ala Glu Gly Gly Phe Arg Tyr Gly Pro Ala Phe
1070 1075 1080
Arg Ser Leu Arg Ala Val Tyr Arg Arg Glu Thr Glu Val Phe Ala
1085 1090 1095
Glu Val Val Leu Pro Glu Glu Gln Arg Gly Lys Ala Ala Ala Phe
1100 1105 1110
Gly Ile His Pro Ala Leu Leu Asp Gly Ala Leu His Ala Ser Gly
1115 1120 1125
Leu Ser Ala Val Arg Gly Asp Ser Gly Gly Arg Met Ala Leu Pro
1130 1135 1140
Phe Ala Trp Asn Gly Val Ser Leu Ala Ala Thr Gly Ala Glu Leu
1145 1150 1155
Leu Arg Val Arg Leu Ala Pro Val Gly Asp Asp Gly Met Ser Val
1160 1165 1170
His Ala Thr Asp Ala Ser Gly His Pro Val Ile Ser Ile Glu Ser
1175 1180 1185
Leu Val Thr Arg Pro Phe Thr Ala Gly Gln Leu Pro Ser Asp Gly
1190 1195 1200
Asp Glu Thr Arg Asp Gly Leu Phe Arg Val Ala Tyr Thr Pro Leu
1205 1210 1215
Thr Asp Pro Glu Pro Gly Thr Ala Thr Asp Asp Trp Thr Ala Val
1220 1225 1230
Ala Thr Gly Ser Glu Pro Thr Tyr Tyr Gly Val Arg Arg Tyr Gly
1235 1240 1245
Asp Leu Asp Ala Leu Ala Ala Ala Val Asp Gly Gly Leu Pro Ala
1250 1255 1260
Pro Pro Val Thr Leu Leu Pro Cys Glu Pro Ala Pro Asp Asp Gly
1265 1270 1275
Asp Leu Pro Gly Ala Leu Arg Arg Arg Leu Gly Glu Val Leu His
1280 1285 1290
Thr Val Gln Arg Trp Leu Ala Asp Glu Arg Phe Ala Ala Ser Arg
1295 1300 1305
Leu Val Val Val Thr Arg Gly Ala Val Ala Ala Phe Glu Gly Asp
1310 1315 1320
Asp Val Thr Asp Leu Val His Ala Pro Val Trp Gly Leu Ile Arg
1325 1330 1335
Ser Ala Gln Ser Glu His Pro Asp Arg Leu Ala Leu Val Asp Leu
1340 1345 1350
Asp Gly Pro Glu Leu Pro Lys Ala Val Ala Ala Ala Val Ala Val
1355 1360 1365
Ala Val Ala Ala Gly Glu Lys Gln Ile Val Val Arg Asp Gly Thr
1370 1375 1380
Val Arg Val Ser Arg Leu Val Pro Ala Val Ser Gly Gly Ala Leu
1385 1390 1395
Ala Leu Pro Glu Thr Pro His Trp Arg Leu Asp Ile Ala Ala Pro
1400 1405 1410
Ala Thr Leu Asp Asn Leu Gly Leu Val Ala Ala Glu Glu Pro Gly
1415 1420 1425
Pro Pro Ala Pro Gly His Val Arg Val Gln Val Arg Ala Ala Gly
1430 1435 1440
Val Asn Phe Arg Asp Val Leu Ile Ala Leu Gly Met Tyr Pro Gly
1445 1450 1455
Asp Gly Ala Phe Arg Gly Ser Glu Gly Ala Gly Val Val Leu Glu
1460 1465 1470
Val Ala Glu Asp Val Thr Ser Val Ala Val Gly Asp Arg Val Met
1475 1480 1485
Gly Met Phe Gln Gly Ala Phe Gly Ser Thr Ala Val Ala Asp Ala
1490 1495 1500
Arg Ser Val Val Pro Ile Pro Pro Gly Trp Thr Asp Glu Gln Ala
1505 1510 1515
Ala Ala Val Pro Ile Ala Tyr Val Thr Ala Trp Tyr Gly Leu Val
1520 1525 1530
Asp Leu Ala Gly Leu Lys Ala Gly Glu Ser Val Leu Ile His Ala
1535 1540 1545
Ala Thr Gly Gly Val Gly Thr Ala Ala Val Gln Ile Ala Arg His
1550 1555 1560
Leu Gly Ala Glu Val Tyr Ala Thr Ala Gly Pro Gly Lys His His
1565 1570 1575
Val Leu Glu Ala Met Gly Ile Asp Glu Ala His Arg Ala Ser Ser
1580 1585 1590
Arg Asp Leu Asp Phe Glu Asp Ala Phe Arg Ala Ala Thr Gly Gly
1595 1600 1605
Arg Gly Val Asp Val Val Leu Asn Ser Leu Thr Gly Asp His Thr
1610 1615 1620
Asp Ala Ser Leu Arg Leu Leu Ala Glu Lys Gly Arg Phe Val Glu
1625 1630 1635
Leu Gly Met Thr Asp Val Arg Asp Pro Glu Gln Leu Ala Glu Pro
1640 1645 1650
Tyr Pro Gly Leu Arg Tyr Arg Val Leu Asp Leu Arg Glu Pro Gly
1655 1660 1665
Glu Asp Gly Ile Gly Arg Met Leu Thr Glu Ile Leu Gly Arg Phe
1670 1675 1680
Ala Gly Gly Glu Leu Thr His Pro Ala Val Arg Ala Phe Asp Ile
1685 1690 1695
Arg Arg Ala Arg Asp Ala Phe Arg Leu Met Ser Arg Ala Ala His
1700 1705 1710
Ile Gly Lys Ile Val Leu Thr Leu Pro Arg Pro Leu Asp Pro Asp
1715 1720 1725
Gly Thr Val Leu Val Thr Gly Gly Thr Gly Thr Leu Gly Gly Leu
1730 1735 1740
Ile Ala Arg His Leu Val Thr Arg His Gly Val Arg His Leu Leu
1745 1750 1755
Leu Thr Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala Pro Ala Leu
1760 1765 1770
Arg Glu Asp Leu Ala Ala Leu Gly Ala Thr Val Thr Val Thr Ala
1775 1780 1785
Cys Asp Ala Gly Asp Arg Glu Arg Leu Ala Glu Val Leu Ala Gly
1790 1795 1800
Val Pro Asp Ala His Pro Leu Thr Gly Val Val His Cys Ala Gly
1805 1810 1815
Val Leu Asp Asp Gly Met Val Asp Ala Leu Thr Pro Glu Gln Leu
1820 1825 1830
Ala Arg Val Leu Arg Pro Lys Ala Glu Ala Ala Leu His Leu His
1835 1840 1845
Glu Leu Thr Gln Asp Ala Asp Leu Ala Leu Phe Val Leu Phe Ser
1850 1855 1860
Ser Ile Val Gly Val Tyr Gly Asn Pro Ser Gln Ala Asn Tyr Ala
1865 1870 1875
Ala Ala Ser Thr Phe Leu Asp Ala Leu Ala Gln His Arg Gln Ala
1880 1885 1890
Gly Gly Leu Pro Ala Gln Ser Leu Ala Trp Gly Met Trp Glu Glu
1895 1900 1905
Thr Ser Ala Leu Thr Gly Glu Leu Asp Asp Gly Val Arg Gln Arg
1910 1915 1920
Ile Ser Gln Ala Gly Met Asp Pro Leu Pro Thr Glu Gln Ala Leu
1925 1930 1935
Ala Leu Phe Asp Arg Ser Tyr Ala Val Gly Asp Ala Leu Leu Val
1940 1945 1950
Pro Ile Arg Gln Ser Thr Ala Ala Ser Arg Gly Gly Pro Ala Ala
1955 1960 1965
Gly Arg Thr Arg Ala Arg Arg Val Ala Asp Ser Gly Val Ala Gly
1970 1975 1980
Thr Gly Gly Pro Ser Leu Thr Asp Arg Val Thr Ala Leu Pro Glu
1985 1990 1995
Ala Glu Arg Asp Ser Cys Val Leu Glu Ala Val Arg Ala His Met
2000 2005 2010
Ala Ala Val Leu Gly His Asp Ser Ala Asp Glu Ile Ala Pro Asp
2015 2020 2025
Arg Thr Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu
2030 2035 2040
Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala
2045 2050 2055
Thr Phe Val Phe Asp Phe Ala Asn Pro Val Ala Leu Ala Glu His
2060 2065 2070
Leu Arg Glu Gln Ile Ala Pro Pro Thr Glu Leu Gly Pro Pro Thr
2075 2080 2085
Asp Pro Ser Thr Pro Ala Asp Pro Ser Pro Pro Pro Gly Leu Pro
2090 2095 2100
Glu Gly Pro Ser Thr Asp Pro Arg Glu Ser Arg Val Arg Gln Val
2105 2110 2115
Leu Ala Ala Ile Pro Leu Arg Arg Leu Glu Glu Ser Gly Leu Leu
2120 2125 2130
Glu Thr Leu Leu Arg Leu Gly Glu Gly Pro Gly Glu Ala Asp Thr
2135 2140 2145
Thr Gly Ala Arg Pro Val Glu Gly Thr Arg Asp Glu Thr His Thr
2150 2155 2160
Pro Asp Ser Ala Asp Ile Ala Ser Met Asp Leu Glu Glu Leu Val
2165 2170 2175
Asn Ala Ala Leu Arg Asn Asp Glu Ser
2180 2185
<210>28
<211>3455
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ile Leu Ser Arg Pro Arg Glu Glu Glu Ala Pro Ala Met Ala Lys
1 5 10 15
Asp Glu Ala Lys Leu Leu Asp His Leu Lys Trp Val Thr Ala Glu Leu
20 25 30
Arg Asp Thr Arg Arg Arg Leu Arg Glu Ala Glu Ser Thr Glu Pro Glu
35 40 45
Pro Ile Ala Ile Val Gly Met Ala Cys Arg Phe Pro Gly Gly Val Asp
50 55 60
Asn Pro Asp Asp Leu Trp Gln Leu Val Ala Ala Gly Gly Glu Gly Leu
65 70 75 80
Thr Glu Phe Pro Glu Asp Arg Gly Trp Asp Leu Glu Asn Leu Phe Asp
85 90 95
Pro Asp Pro Asp Ser Ala Gly Thr Ser Tyr Val Arg Arg Gly Ala Phe
100 105 110
Leu Ser Gly Ala Gly Gly Phe Asp Ala Glu Phe Phe Gly Ile Ser Pro
115 120 125
Arg Glu Ala Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val
130 135 140
Ala Trp Glu Thr Phe Glu His Ala Gly Ile Asp Pro His Gly Leu Val
145 150 155 160
Gly Ser Ser Thr Gly Val Tyr Ala Gly Val Thr Ser Gln Glu Tyr Met
165 170 175
Met Leu Thr Ala Met Ala Gly Ser Asp Val Glu Gly Tyr Ala Ala Thr
180 185 190
Gly Asn Leu Ala Cys Val Leu Ser Gly Arg Val Ser Tyr Val Leu Gly
195 200 205
Leu Glu Gly Pro Ala Val Thr Val Asp Thr Gly Cys Ser Ser Ser Leu
210 215 220
Val Ala Leu His Ser Ala Val Gln Ala Leu Arg Gly Gly Glu Cys Ser
225 230 235 240
Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Gly Ala Phe
245 250 255
Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys
260 265 270
Ala Phe Ala Gly Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Val Gly
275 280 285
Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Lys
290 295 300
Val Trp Ala Val Val Arg Gly Ser Ala Ile Asn Gln Asp Gly Ala Ser
305 310 315 320
Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
325 330 335
Gln Ala Leu Ala Asn Ala Arg Leu Ser Pro Ala Asp Val Asp Val Val
340 345 350
Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln
355 360 365
Ala Leu Leu Ala Thr Tyr Gly Gln Gly Arg Ser Ala Glu Arg Pro Leu
370 375 380
Trp Leu Gly Ser Ile Lys Ser Asn Leu Ala His Thr Gln Ala Ala Ala
385 390 395 400
Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg His Gly Leu
405 410 415
Leu Pro Gln Thr Leu His Val Asp Ala Pro Thr Pro His Val Asp Trp
420 425 430
Asp Ser Gly Ala Val Ala Leu Leu Ser Glu Ala Thr Asp Trp Pro Glu
435 440 445
Val Asp Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Ile Ser Gly
450 455 460
Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Ala Leu Ser Thr Glu
465 470 475 480
Gly Glu Asn Ala Glu Pro Thr Thr Pro Gly Gly Val Val Pro Trp Val
485 490 495
Val Ser Gly Arg Ser Ala Ala Gly Leu Arg Ala Gln Ala Gly Arg Leu
500 505 510
Ala Glu Phe Ala Glu Gln Ala Thr Ala Asp Val Ala Glu Val Gly Trp
515 520 525
Ser Leu Val Ala Gly Arg Ala Met Leu Glu His Arg Ala Val Val Val
530 535 540
Gly Asp Asp Arg Asn Glu Leu Leu Ala Gly Leu Arg Ala Leu Ala Glu
545 550 555 560
Gly Val Pro Phe Gly Gly Val Val Ser Val Asp Pro Val Ala Gly Gly
565 570 575
Ala Gly Pro Val Leu Val Phe Pro Gly Gln Gly Gly Gln Trp Arg Gly
580 585 590
Met Gly Val Glu Leu Leu Asp Ala Ser Pro Val Phe Ala Gly Arg Ile
595 600 605
Ala Glu Cys Glu Val Ala Leu Ala Pro Phe Val Glu Trp Ser Leu Thr
610 615 620
Ala Val Leu Arg Gly Glu Asp Gly Val Asp Val Ser Arg Val Asp Val
625 630 635 640
Val Gln Pro Ala Leu Trp Ala Val Met Val Ser Leu Ala Glu Val Trp
645 650 655
Arg Ser Tyr Gly Val Glu Pro Ala Ala Val Val Gly His Ser Gln Gly
660 665 670
Glu Ile Ala Ala Ala Val Val Ala Gly Ala Leu Thr Leu Arg Asp Gly
675 680 685
Ala Arg Val Val Ala Leu Arg Ser Arg Ala Leu Arg Val Leu Ser Gly
690 695 700
Arg Gly Ala Met Ala Ser Leu Ala Val Gly Arg Glu Glu Ala Glu Lys
705 710 715 720
Ala Ile Gly Gly Arg Ala Gly Val Val Val Ala Ala Val Asn Gly Pro
725 730 735
Gly Ser Thr Val Val Ser Gly Pro Pro Val Ala Val Ala Glu Val Val
740 745 750
Ala Ala Val Glu Ala Ala Gly Gly Arg Ala Arg Leu Val Asp Val Asp
755 760 765
Tyr Ala Ser His Asn Pro Gln Val Asp Asp Ile Ala Asp Glu Leu Ile
770 775 780
Glu Val Leu Gly Gln Val Thr Pro Val Glu Thr Ala Val Ala Phe Tyr
785 790 795 800
Ser Ala Val Thr Gly Gly Arg Val Glu Ser Thr Ala Leu Asp Ala Ala
805 810 815
Tyr Trp Val Glu Asn Leu Arg Arg Gln Val Arg Phe Ala Thr Ala Val
820 825 830
Glu Ala Leu Leu Ser Asp Gly Tyr Arg Val Leu Val Glu Ser Ser Pro
835 840 845
His Pro Val Leu Ser Val Gly Val Gln Glu Thr Ala Gln Ala Leu Asp
850 855 860
Val Pro Val Ala Thr Val Ala Thr Leu Gln Arg Asp Gln Gly Gly Ala
865 870 875 880
Val Gln Leu Ala Arg Ala Leu Ala Gln Ala Phe Thr Ala Gly Leu Ser
885 890 895
Val Asp Trp Lys Ala Trp Tyr Gly Val Asp Thr Thr Pro Asp Val Pro
900 905 910
Thr Gly Ala Ile Thr Pro Ala Pro Pro Val Leu Asp Leu Pro Thr Tyr
915 920 925
Ala Phe Gln His Gln His Tyr Trp Leu Lys Pro Gly Arg Trp Gly Ser
930 935 940
Ser Pro Glu Gln Gly Ser Glu Asp Asp Glu Arg Phe Trp Ser Ala Val
945 950 955 960
Glu Ala Gly Asp Leu Ala Gly Leu Gly Ala Ser Leu Asn Val Ala Glu
965 970 975
Asp Thr Ala Arg Gln Ala Leu Ala Pro Ala Leu Pro Val Leu Ala Glu
980 985 990
Trp Arg Gln Arg Thr Arg Asp Arg Ala Arg Ile Asp Gly Trp Arg Tyr
995 1000 1005
Arg Thr Ala Trp Ser Thr Val Thr Gly Leu Asp Ala Ala Pro Arg
1010 1015 1020
Leu Ser Gly Thr Trp Leu Leu Val Val Pro Glu Lys Leu Glu Gly
1025 1030 1035
Asp Ala Ala Val Glu Thr Val Val Arg Ala Leu Glu Gly Arg Gly
1040 1045 1050
Ala Arg Cys Thr Thr Leu Ala Leu Ala Pro Gly Asp Gln Ala Arg
1055 1060 1065
Asp Pro Leu Thr Thr Arg Leu Arg Gln Leu Glu Gly Gly Pro Glu
1070 1075 1080
Val Ala Gly Val Ile Ser Ala Leu Gly Leu Asp Glu Ala Val His
1085 1090 1095
Pro Asp His Pro His Leu Ser Val Gly Leu Ala Gly Thr Ile Ala
1100 1105 1110
Leu Val Gln Ala Leu Asp Glu Met Asp Phe Gly Gly Arg Leu Trp
1115 1120 1125
Cys Val Thr Arg Gly Gly Val Ser Val Gly Glu Asp Gly Pro Val
1130 1135 1140
Ser Pro Ala Gln Ala Gln Val Trp Gly Leu Gly Arg Val Met Ala
1145 1150 1155
Leu Glu Tyr Pro Lys Arg Trp Gly Gly Leu Val Asp Leu Pro Ala
1160 1165 1170
Ala Ala Glu Glu Glu Thr Ala Ser Arg Leu Ala Ala Val Val Ala
1175 1180 1185
Glu Gly Thr Glu Asp Gln Val Ala Leu Arg Ala Asp Gly Ala Leu
1190 1195 1200
Gly Arg Arg Leu Arg Ala Ala Pro Gly Gly Ala Pro Gly Glu Glu
1205 1210 1215
Trp Arg Thr Glu Gly Ser Val Leu Val Thr Gly Gly Thr Gly Gly
1220 1225 1230
Val Gly Gly Arg Val Ala Arg Trp Val Val Glu Arg Gly Ala Arg
1235 1240 1245
His Val Ile Val Ala Gly Arg Arg Gly Pro Ala Ala Pro Gly Ala
1250 1255 1260
Glu Glu Leu Thr Ala Glu Leu Glu Ala Leu Gly Ala Ser Val Asp
1265 1270 1275
Val Val Ala Cys Asp Val Ala Asp Arg Asp Gln Ala Ala Gly Leu
1280 1285 1290
Leu Ala Arg Val Pro Glu Glu His Pro Leu Arg Gly Ile Phe His
1295 1300 1305
Ala Ala Gly Val Gly Asp Tyr Thr Pro Val Arg Asp Leu Asp Pro
1310 1315 1320
Tyr Arg Val Ala Gln Val Thr Ala Ala Lys Ala Gly Gly Ala Arg
1325 1330 1335
Trp Leu Asp Glu Leu Thr Arg Asp Leu Asp Val Ser Ala Phe Val
1340 1345 1350
Leu Phe Ser Ser Gly Ala Ala Ser Trp Gly Ser Gly Gln Gln Gly
1355 1360 1365
Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Ala Glu Arg
1370 1375 1380
Arg Arg Ala Glu Gly Leu Pro Gly Leu Ser Val Ala Trp Gly Pro
1385 1390 1395
Trp Gly Glu Ala Gly Met Ala Ala Asp Gln Ala Val Ala Thr Phe
1400 1405 1410
Phe Arg Asp Arg Gly Leu Thr Ala Met Pro Pro Glu Leu Ala Leu
1415 1420 1425
Arg Val Leu Gly Asp Ala Leu Gly Arg Gly Glu Thr Thr Leu Thr
1430 1435 1440
Val Ala Asp Phe Asp Trp Ala Arg Phe Ala Ala Thr Phe Ala Gly
1445 1450 1455
Gln Arg Ala Ser Arg Leu Leu Ala Glu Ile Pro Gln Ala Val Glu
1460 1465 1470
Leu Leu Glu Lys Glu Thr Pro Ser Glu Asp Ser Pro Leu Arg Arg
1475 1480 1485
Gln Leu Ser Ala Ala Ala Pro Glu Gln Arg His Gln Ile Leu Ser
1490 1495 1500
Gln His Ile Arg Ala Leu Ala Ala Gly Val Leu Gly His Ser Gly
1505 1510 1515
Pro Asp Ala Val Ser Ala Thr Lys Pro Phe Phe Glu Met Gly Phe
1520 1525 1530
Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Lys Leu Ser Ser Ser
1535 1540 1545
Ala Gly Met Pro Leu Pro Thr Thr Leu Ile Phe Asp Tyr Pro Thr
1550 1555 1560
Ala Asp Asp Leu Ala Arg His Val Leu Gly Glu Ile Ser Gly Thr
1565 1570 1575
Gln Thr Ala Val Ala Asp Ala Thr Val Val Pro Val Ala Gly Pro
1580 1585 1590
Ile Gly Glu Pro Asp Glu Pro Ile Ala Ile Val Gly Met Ala Cys
1595 1600 1605
Arg Phe Pro Gly Gly Val Thr Ser Pro Glu Gln Leu Trp Asp Leu
1610 1615 1620
Val Thr Glu Gly Arg Asp Ala Met Ser Ala Phe Pro Thr Asp Arg
1625 1630 1635
Gly Trp Arg Leu Asp Thr Leu Tyr Asp Pro Asp Pro Asp Arg Pro
1640 1645 1650
Gly Thr Ser Tyr Val Arg Glu Gly Gly Phe Ile Tyr Asp Ala Gly
1655 1660 1665
Glu Phe Asp Ala Asp Phe Phe Gly Ile Ser Pro Arg Glu Ala Val
1670 1675 1680
Gly Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ala Trp Glu
1685 1690 1695
Thr Phe Glu Arg Ala Gly Ile Asp His Gly Ser Leu Arg Gly Ser
1700 1705 1710
Asp Thr Gly Val Tyr Val Gly Ala Thr Ile Phe Asp Tyr Leu Ser
1715 1720 1725
Ile Ile Gly Ile Ser Ser Leu Asp Met Glu Gly Tyr Thr Gly Thr
1730 1735 1740
Gly Asn Leu Gly Cys Val Val Ser Gly Arg Val Ser Tyr Val Leu
1745 1750 1755
Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Gly Cys Ser Ser
1760 1765 1770
Ser Leu Val Ala Leu His Ser Ala Val Arg Gly Leu Arg Gly Gly
1775 1780 1785
Glu Cys Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met Ser Thr
1790 1795 1800
Pro Gly Ala Phe Val Glu Phe Ser Arg Gln Arg Gly Leu Ala Ala
1805 1810 1815
Asp Gly Arg Cys Lys Ala Phe Ala Ala Ala Ala Asp Gly Thr Gly
1820 1825 1830
Trp Gly Glu Gly Val Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp
1835 1840 1845
Ala Arg Arg Asn Gly His Lys Val Trp Ala Val Val Arg Gly Ser
1850 1855 1860
Ala Ile Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn
1865 1870 1875
Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asn Ala
1880 1885 1890
Arg Leu Ser Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr
1895 1900 1905
Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala
1910 1915 1920
Thr Tyr Gly Gln Gly Arg Ser Ala Glu Arg Pro Leu Trp Leu Gly
1925 1930 1935
Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Ala
1940 1945 1950
Ala Gly Val Ile Lys Met Val Met Ala Met Arg His Gly Glu Leu
1955 1960 1965
Pro Ala Ser Leu Phe Ile Asp Ala Pro Thr Pro His Val Asp Trp
1970 1975 1980
Asp Ser Gly Ala Val Ala Leu Leu Ser Glu Ala Thr Asp Trp Pro
1985 1990 1995
Glu Val Asp Arg Pro Trp Arg Ala Gly Val Ser Ala Phe Gly Ile
2000 2005 2010
Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Glu Asp
2015 2020 2025
Pro Tyr Val Asp Ala Pro Val Asp Cys Gly Pro Ala Pro Val Gly
2030 2035 2040
Gly Val Val Pro Trp Val Val Ser Gly Arg Ser Ala Ala Gly Leu
2045 2050 2055
Arg Ala Gln Ala Gly Arg Leu Ala Glu Phe Val Leu Glu Thr Val
2060 2065 2070
Asp Glu Val Ala Glu Val Gly Trp Ser Leu Val Ala Gly Arg Ala
2075 2080 2085
Val His Asp His Arg Ala Val Val Val Gly Gln Asp Arg Thr Glu
2090 2095 2100
Leu Leu Thr Gly Leu Thr Ala Leu Ala Asp Gly Leu Pro Ser Gly
2105 2110 2115
Gly Val Val Ala Gly Glu Pro Val Val Gly Gly Ala Gly Pro Val
2120 2125 2130
Leu Val Phe Pro Gly Gln Gly Gly Gln Trp Arg Gly Met Gly Val
2135 2140 2145
Glu Leu Leu Asp Ala Ser Pro Val Phe Ala Gly Arg Ile Ala Glu
2150 2155 2160
Cys Glu Val Ala Leu Ala Pro Phe Val Glu Trp Ser Leu Thr Ala
2165 2170 2175
Val Leu Arg Gly Glu Asp Gly Val Asp Val Ser Arg Val Asp Val
2180 2185 2190
Val Gln Pro Ala Leu Trp Ala Val Met Val Ser Leu Ala Glu Val
2195 2200 2205
Trp Arg Ser Tyr Gly Val Glu Pro Ala Ala Val Val Gly His Ser
2210 2215 2220
Gln Gly Glu Ile Ala Ala Ala Val Val Ala Gly Ala Leu Thr Leu
2225 2230 2235
Arg Asp Gly Ala Arg Val Val Ala Leu Arg Ser Arg Ala Leu Arg
2240 2245 2250
Val Leu Ser Gly Arg Gly Ala Met Ala Ser Leu Ala Val Gly Arg
2255 2260 2265
Glu Glu Ala Glu Lys Ala Ile Gly Gly Arg Ala Gly Val Val Val
2270 2275 2280
Ala Ala Val Asn Gly Pro Gly Ser Thr Val Val Ser Gly Pro Pro
2285 2290 2295
Val Ala Val Ala Glu Val Val Ala Ala Val Glu Ala Ala Gly Gly
2300 2305 2310
Arg Ala Arg Leu Val Asp Val Asp Tyr Ala Ser His Asn Pro Gln
2315 2320 2325
Val Asp Asp Ile Ala Asp Glu Leu Ile Glu Val Leu Gly Gln Val
2330 2335 2340
Thr Pro Val Glu Thr Ala Val Ala Phe Tyr Ser Ala Val Thr Gly
2345 2350 2355
Gly Arg Val Glu Ser Thr Ala Leu Asp Ala Ala Tyr Trp Val Glu
2360 2365 2370
Asn Leu Arg Arg Gln Val Arg Phe Ala Thr Ala Val Glu Ala Leu
2375 2380 2385
Leu Ser Asp Gly Tyr Arg Val Leu Val Glu Ser Ser Pro His Pro
2390 2395 2400
Val Leu Ser Val Gly Ile Gln Glu Thr Ala Gln Ala Leu Asp Val
2405 2410 2415
Pro Val Ala Thr Val Ala Thr Leu Gln Arg Asp Gln Gly Gly Pro
2420 2425 2430
Val Gln Leu Ala Arg Ala Leu Ala Gln Ala Phe Thr Ala Gly Leu
2435 2440 2445
Ser Val Asn Trp Lys Ala Trp Tyr Gly Val Asp Thr Thr Pro Asp
2450 2455 2460
Val Pro Thr Gly Val Thr Thr Pro Ala Pro Pro Val Leu Asp Leu
2465 2470 2475
Pro Thr Tyr Ala Phe Gln His Gln His Tyr Trp Ile Asp Ala Ser
2480 2485 2490
Thr Ile Gly Gly Gln Gly Asp Pro His Ser Leu Gly Leu Ala Ser
2495 2500 2505
Ala Asp His Pro Leu Leu Gly Ala Ala Val Arg Val Ala Glu Ser
2510 2515 2520
Glu Glu Val Leu Leu Thr Gly Arg Ile Ser Thr Ala Gly His Ser
2525 2530 2535
Trp Leu Ala Asp His Ala Val Ala Gly Ser Val Phe Leu Pro Gly
2540 2545 2550
Thr Ala Phe Val Asp Leu Ala Val Arg Ala Gly Asp Glu Val Gly
2555 2560 2565
Cys Asp Arg Val Glu Glu Leu Val Leu Glu Thr Pro Leu Val Leu
2570 2575 2580
Pro Glu Asp Gly Ala Leu Gln Leu Gln Leu Val Val Ser Ala Pro
2585 2590 2595
Arg Gln Asp Gly Gly Arg Gly Phe Thr Val His Ala Arg Ala Asp
2600 2605 2610
Gly Pro Asp Asp Ala Asp Arg Pro Trp Thr Arg His Ala Ser Gly
2615 2620 2625
Thr Leu Thr Thr Gly Ala Arg Pro Asp Asp Phe Asp Phe Thr Gln
2630 2635 2640
Trp Pro Pro Ala Gly Ala Glu Pro Leu Ser Leu Asp Gly Leu Tyr
2645 2650 2655
Ala Gly Leu Ala Glu Ala Gly Tyr Gly Tyr Gly Pro Ala Phe Arg
2660 2665 2670
Gly Leu Lys Ala Ala Trp Arg Arg Gly Asp Glu Val Phe Ala Glu
2675 2680 2685
Ala Ala Leu Ala Glu Glu Leu His Gly Gly Ala Gly Arg Phe Gly
2690 2695 2700
Leu His Pro Ala Leu Leu Asp Thr Ala Leu Gln Ala Gly Gly Ile
2705 2710 2715
Gly Gln Gly Gly Pro Pro Asp Gly Gln Met Leu Leu Pro Phe Thr
2720 2725 2730
Trp Asn Gly Val Ser Leu Tyr Ala Thr Gly Ala Ser Ala Leu Arg
2735 2740 2745
Val Arg Met Val Pro Asn Glu Thr Ala Asp Gly Val Ser Leu Ser
2750 2755 2760
Val Ala Asp Pro Ser Gly Arg Gln Val Ala Ser Ala Asp Ala Val
2765 2770 2775
Val Phe Arg Pro Val Ser Leu Glu Gln Leu Ser Arg Gly Gly Ser
2780 2785 2790
Glu Leu Asp Ser Leu Tyr Arg Val Glu Trp Ile Ser Ala Glu Pro
2795 2800 2805
Arg Pro Glu Val Leu Ala Asp Arg Ala Trp Glu Val Val Gly Ala
2810 2815 2820
Asp Gly Leu Gly Ile Thr Asn Ala Leu Glu Arg Ala Gly His Ala
2825 2830 2835
Val Arg Thr Thr Thr Asp Leu Ala Glu Ser Ala Arg Tyr Val Asp
2840 2845 2850
Asp Asp Ala Pro Val Ser Asp Leu Met Val Leu Asp Cys Ala Pro
2855 2860 2865
Leu Ala Gly Gly Ala Thr Ala Leu Asp Pro Gly Leu Ala Ala Ala
2870 2875 2880
Val His Asp Glu Thr Ala Arg Leu Leu Gly Val Val Gln Arg Trp
2885 2890 2895
Leu Ala Asp Gly Arg Phe Ala Asp Ser Arg Leu Val Leu Val Thr
2900 2905 2910
Arg Gly Ala Gln Ser Thr Ser Gly Asp Glu Gly Val Thr Asp Leu
2915 2920 2925
Val His Ser Ala Leu Trp Gly Leu Val Arg Ser Ala Gln Leu Glu
2930 2935 2940
Asn Ile Asp Arg Phe Val Leu Val Asp Leu Asp Gly Ala Asp Pro
2945 2950 2955
Asp Ala Ala Ala Leu Leu Pro Gly Ala Leu Ser Val Ala Leu Gly
2960 2965 2970
Ala Gly Glu Pro Gln Val Ala Val Arg Gln Gly Arg Val Val Val
2975 2980 2985
Pro Arg Leu Ala Arg Ile Gly Gln Asp Thr Ala Pro Thr Pro Pro
2990 2995 3000
Asp Leu Pro Gly Arg Arg Leu Asp Ser Glu Gly Thr Val Leu Ile
3005 3010 3015
Thr Gly Gly Thr Gly Thr Ile Gly Gly Val Val Ala Arg His Leu
3020 3025 3030
Val Thr Thr Arg Gly Ala Arg His Leu Leu Leu Thr Ser Arg Arg
3035 3040 3045
Gly Ala Asp Ala Pro Gly Ala Ala Glu Leu Arg Asp Glu Leu Thr
3050 3055 3060
Ser Leu Gly Ala Glu Val Thr Ile Ala Ala Cys Asp Ala Ala Asp
3065 3070 3075
Arg Asp Ala Leu Ala Glu Leu Leu Ala Asp Ile Pro Ala Ala His
3080 3085 3090
Pro Leu Thr Gly Val Ile His Ala Ala Gly Val Ile Asp Asp Gly
3095 3100 3105
Thr Ile Pro Ser Leu Thr Gly Glu Arg Leu Arg Ala Val Leu Arg
3110 3115 3120
Pro Lys Val Asp Ala Ala Val Asn Leu His Gln Leu Thr Arg Glu
3125 3130 3135
Lys Asp Leu Ala Thr Phe Val Leu Phe Ser Ser Thr Gly Gly Val
3140 3145 3150
Thr Gly Val Gly Gly Gln Ser Asn Tyr Thr Ala Ser Asn Ala Phe
3155 3160 3165
Leu Asp Ala Leu Ala Gln Asp Arg Arg Ala Ala Gly Phe Pro Gly
3170 3175 3180
Gln Ser Leu Ser Trp Gly Tyr Trp Glu Gln Thr Ser Gly Ile Thr
3185 3190 3195
Gly Thr Leu Asp Glu Arg Asp Ile Ala Arg Met Glu Arg Ser Gly
3200 3205 3210
Ile Arg Ala Met Ser Ser Glu Gln Gly Leu Ala Leu Phe Asp Ala
3215 3220 3225
Ala Glu Arg Arg Pro Glu Ser Leu Leu Val Pro Ala Arg Leu Asp
3230 3235 3240
Pro Glu Ala Leu Arg Asp Leu Ala Asp Ala Arg Val Leu Pro Arg
3245 3250 3255
Ile Leu Ser Gly Leu Val Arg Gln Ala Pro Ala Arg Arg Ala Ala
3260 3265 3270
Ala Ala Gly Gln Pro Ala Thr Gly Asp Asp Leu Thr Met Ala Glu
3275 3280 3285
Arg Leu Thr Gly Leu Ser Ala Ala Glu Gln Ser Arg Thr Leu Leu
3290 3295 3300
Glu Leu Val Arg Arg Asn Val Ala Ala Val Leu Gly Leu Gly Asp
3305 3310 3315
Val Leu Ala Val Asp Pro Ser Arg Pro Phe Lys Glu Ile Gly Phe
3320 3325 3330
Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Ala
3335 3340 3345
Thr Asp Thr Thr Leu Pro Ser Thr Leu Val Phe Asp Ile Pro Thr
3350 3355 3360
Pro Ala Leu Leu Ala Glu His Leu Arg Glu Gln Leu Val Ser Glu
3365 3370 3375
Gly Leu Ser Gly Ser Glu Ala Leu Ile Gln Glu Leu Asp Arg Leu
3380 3385 3390
Glu Asp His Val Asp Leu Leu Val Asp Asp Gly Glu Arg Asp Ala
3395 3400 3405
Val Thr Ala Arg Leu Glu Ala Leu Leu Thr Arg Cys Arg Gln Lys
3410 3415 3420
Pro Thr Ala Ala Glu Gly Asn Gly Val Ala Glu Arg Leu Gln Glu
3425 3430 3435
Ala Ser Ala Asp Glu Val Leu Gln Phe Ile Asp Ser His Leu Gly
3440 3445 3450
Arg Ala
3455
<210>29
<211>423
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Leu Asn Arg Gly Val Val Ser Pro Thr Glu Ala Thr Pro Ala Ser Ser
1 5 10 15
Ala Lys Ala Thr Arg Pro Pro Asp Phe Met Asp Pro Ser Phe Trp Leu
20 25 30
Arg Pro Arg Asp Glu Arg Ala Glu Val Phe Glu Lys Leu Arg Ala Leu
35 40 45
Pro Gly Pro Glu Phe Val Pro Pro Arg Leu Pro Trp Gly Pro Leu Ala
50 55 60
Ser Gly Tyr Tyr Ala Leu Ser Lys His Ala Asp Ile Cys Glu Val Ser
65 70 75 80
Arg Arg Pro Gln Asp Phe Ser Ser Glu Gly Ala Thr Ala Ile Leu Pro
85 90 95
Pro Glu Met Asp Glu Phe Tyr Gly Ser Met Ile Asn Met Asp Asn Pro
100 105 110
Glu His Ser Arg Leu Arg Arg Ile Val Ala Arg Ser Phe Gly Arg Gly
115 120 125
Met Ala Pro Lys Phe Asp Ala Met Ser Arg Arg Val Ala Arg Arg Ile
130 135 140
Val Asp Glu Leu Ile Glu Arg Gly Pro Gly Asp Phe Ile Arg Pro Ala
145 150 155 160
Ala Glu Met Pro Ile Ala Val Leu Ser Thr Met Met Gly Ile Pro Gly
165 170 175
Glu Asp Tyr Glu Phe Leu Phe Glu Arg Thr Asn Thr Ile Met Gly Gly
180 185 190
Ala Asp Pro Glu Leu Ala Ala Asp Pro Glu Lys Met Ala Ala Ala Val
195 200 205
Leu Gly Ala Leu Arg Asp Leu Gly Asp Tyr Ile Gly Arg Leu Arg Glu
210 215 220
Asp Arg Leu Ala Arg Pro Gly Pro Asp Val Ile Thr Lys Leu Val Gln
225 230 235 240
Val Gln Glu Asp Gly Glu Gln Leu Thr Asn Gln Glu Leu Val Ser Phe
245 250 255
Phe Ile Leu Leu Ile Asn Ala Gly Met Glu Thr Thr Arg Asn Val Ile
260 265 270
Ala Gln Ala Leu Val Leu Leu Thr Glu His Pro Asp Gln Arg Gln Leu
275 280 285
Leu Leu Ser Asp Phe Glu Leu His Ala Lys Gly Ala Val Glu Glu Ile
290 295 300
Leu Arg Val Gly Thr Pro Ile Asn Trp Met Arg Arg Thr Ala Thr Gly
305 310 315 320
Asp Cys Glu Met Asn Gly His Arg Phe Arg Lys Gly Asp Glu Ile Phe
325 330 335
Leu Phe Tyr Trp Ser Ala Asn His Asp Glu Lys Val Phe Glu Asp Ala
340 345 350
Tyr Arg Phe Asp Ile Thr Arg Asp Pro Asn Pro His Leu Ser Phe Gly
355 360 365
Ala Val Gly Pro His Phe Cys Leu Gly Ala His Leu Ala Arg Ile Glu
370 375 380
Ile Ile Ala Met Leu Arg Glu Leu Leu Ala Ser Leu Pro Asp Ile Arg
385 390 395 400
Val Glu Gly Glu Pro Val Arg Leu Ala Ser Ser Phe Ile Glu Gly Phe
405 410 415
Lys Glu Leu Ser Cys Thr Phe
420
<210>30
<211>335
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Thr Ile Arg Asp Val Ala Lys Ala Ser Gly Val Ser Pro Ser Thr
1 5 10 15
Val Ser Arg Ala Leu Ala Pro Gly Gly Ala Val Ser Pro Val Thr Arg
20 25 30
Glu Arg Val Arg Ala Ala Ala Asp Arg Leu Gly Tyr Gln Pro Asn Gln
35 40 45
Ala Ala Arg Gly Leu Ile Thr Gly Arg Thr Gly His Leu Gly Val Ile
50 55 60
Val Pro Asp Leu Leu Asn Pro Phe Phe Ala Asp Ile Cys Lys Gly Val
65 70 75 80
Gln Ala Arg Ala Arg Gly Leu Gly Leu Thr Val Phe Val Ser Asp Thr
85 90 95
Glu Arg Asp Glu Gly Leu Glu Leu Asp Ala Ile Arg Thr Leu Ala Pro
100 105 110
Gln Val Asp Gly Ile Val Leu Cys Ser Pro His Leu Ser Gly Glu Glu
115 120 125
Leu Gly Ser Leu Gly Asp Phe Thr Asp Lys Pro Ile Val Leu Leu His
130 135 140
Arg Lys Glu Pro Gly Phe Gly Ser Val Thr Ala Asp Leu Val Glu Gly
145 150 155 160
Met Thr Asp Ala Leu Thr His Leu His Ala Leu Gly His Arg Arg Ile
165 170 175
Ala Tyr Val Gly Gly Pro Arg Ser Ser Trp Ala Ala Arg Glu Arg Ala
180 185 190
Ala Gly Val Glu Ala Val Ala Ala Ser Gly Leu Val Glu Ile Val Gln
195 200 205
Val Gly Ser Val Ala Pro His Phe Asp Gly Gly Val Thr Gly Ala Ala
210 215 220
Asp Val Val Leu Ala Ser Gly Ala Ser Ala Val Leu Ala Phe Asp Asp
225 230 235 240
Ile Val Ala Phe Gly Leu Ile Ser Arg Phe Thr Val Arg Gly Val Arg
245 250 255
Val Pro Glu Glu Met Ser Val Val Gly Cys Asp Asp Ile Ala Leu Ser
260 265 270
Gly Met Ala Ala Pro Pro Leu Thr Thr Val Ser Val Pro Lys Ala His
275 280 285
Gly Ala Arg Ala Ala Val Asp Leu Leu Cys Arg Ile Leu Ala Thr Pro
290 295 300
Ala Ala Glu Gly Glu Gln Pro Pro Gln Arg Val Leu Pro Thr His Leu
305 310 315 320
Val Val Arg Gly Ser Thr Ala Ala Leu Asp Arg Arg Gln Arg Ala
325 330 335
<210>31
<211>313
<212>PRT
<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Asp Pro Leu Ala Gly Leu Leu Asp Gly Pro Arg Ala Arg Gly Ala
1 5 10 15
Phe Leu Leu Arg Met Val Met Asp Pro Pro Trp Ser Val Arg Ile Glu
20 25 30
Asp Arg Ala Pro Leu Cys Val Met Thr Val Ala Arg Gly Glu Ala Trp
35 40 45
Val Val Pro Asp Arg Gly Glu Ala Arg Arg Leu Gly Pro Gly Asp Val
50 55 60
Ala Val Val Arg Gly Pro Asp Pro Tyr Thr Val Ala Gly Asp Pro Ala
65 70 75 80
Thr Glu Pro Gln Ala Trp Ile Leu Pro Gly Glu Val Cys Arg Thr Ala
85 90 95
Gly Gly Glu Asp Leu Ala Glu Arg Met Ala Leu Gly Val Arg Thr Trp
100 105 110
Gly Asn Ser Glu His Gly Ala Thr Thr Met Leu Val Gly Thr Tyr Arg
115 120 125
Met Asp Gly Glu Ile Ser Arg Arg Leu Leu Asp Ala Leu Pro Pro Leu
130 135 140
Leu Val Leu Gly Arg Glu Arg Cys Asp Ser Pro Leu Leu Pro Trp Leu
145 150 155 160
Gly Glu Glu Ile Val Lys Glu Glu Ala Gly Gln Thr Ala Val Leu Asp
165 170 175
Arg Leu Leu Asp Leu Leu Leu Ile Ser Val Leu Arg Ala Trp Phe Ala
180 185 190
Arg Pro Glu Ala Arg Ala Pro Ala Trp Tyr Arg Ala Leu Gly Asp Pro
195 200 205
Val Val Gly Arg Ala Leu Arg Leu Leu Gln Asn Asn Pro Gly His Pro
210 215 220
Trp Thr Val Ala Leu Leu Ala Ala Glu Thr Gly Ile Ser Arg Ala Val
225 230 235 240
Leu Ala Arg Arg Phe Thr Glu Leu Val Gly Glu Pro Pro Met Ala Tyr
245 250 255
Leu Thr Gly Trp Arg Leu Asp Leu Ala Ala Asp Leu Leu Arg Glu Pro
260 265 270
Asp Ala Thr Leu Gly Ala Val Ala Arg Arg Val Gly Tyr Gly Ser Ser
275 280 285
Phe Ala Leu Ser Ala Ala Phe Lys Arg Val Arg Gly Val Ser Pro Arg
290 295 300
Glu His Arg Ser Ala Ala Ser Ala Gly
305 310
Claims (2)
1、一种南昌霉素生物合成基因簇,其特征在于编码南昌霉素生物合成所涉及的30个基因,具体为:
(1)聚酮合酶基因,即nanA1,nanA2,nanA3,nanA4,nanA5,nanA6,nanA7,nanA8,nanA9,nanA10,nanA11共11个基因:
nanA1位于基因簇核苷酸序列第8919-17627个碱基处,长度为8709个碱基对,编码2902个氨基酸,合成聚酮合酶;
nanA2位于基因簇核苷酸序列第17642-24313个碱基处,长度为6672个碱基对,编码4032个氨基酸,合成聚酮合酶;
nanA3位于基因簇核苷酸序列第24310-36408个碱基处,长度为12099个碱基对,编码2902个氨基酸,合成聚酮合酶;
nanA4位于基因簇核苷酸序列第36429-48299个碱基处,长度为11871个碱基对,编码3956个氨基酸,合成聚酮合酶;
nanA5位于基因簇核苷酸序列第48345-60284个碱基处,长度为11940个碱基对,编码3979个氨基酸,合成聚酮合酶;
nanA6位于基因簇核苷酸序列第60305-65302个碱基处,长度为4998个碱基对,编码1665个氨基酸,合成聚酮合酶;
nanA7位于基因簇核苷酸序列第76990-72050个碱基处,长度为4941个碱基对,编码1646个氨基酸,合成聚酮合酶;
nanA8位于基因簇核苷酸序列第100138-89771个碱基处,长度为10368个碱基对,编码3455个氨基酸,合成聚酮合酶;
nanA9位于基因簇核苷酸序列第83192-80784个碱基处,长度为2409个碱基对,编码802个氨基酸,合成聚酮合酶;
nanA10位于基因簇核苷酸序列第78263-77949个碱基处,长度为315个碱基对,编码104个氨基酸,合成聚酮合酶;
nanA11位于基因簇核苷酸序列第89759-83196个碱基处,长度为6564个碱基对,编码2187个氨基酸,合成聚酮合酶;
(2)南昌霉素的修饰基因,即nanE,nanI,nanO,nanP共4个基因:
nanE位于基因簇核苷酸序列第77864-76992个碱基处,长度为873个碱基对,编码290个氨基酸,合成环氧化物水解酶;
nanI位于基因簇核苷酸序列第80769-79828个碱基处,长度为942个碱基对,编码313个氨基酸;合成酮甾异构酶;
nanO位于基因簇核苷酸序列第79763-7327个碱基处,长度为1437个碱基对,编码478个氨基酸;合成环氧化物酶;
nanP位于基因簇核苷酸序列第101467-100196个碱基处,长度为1272个碱基对,编码423个氨基酸;合成细胞色素P450;
(3)南昌霉素脱氧糖的生物合成基因,即nanG1,nanG2,nanG3,nanG4,nanG5,nanM共6个基因:
nanG1位于基因簇核苷酸序列第72045-71137个碱基处,长度为909个碱基对,编码302个氨基酸,合成葡萄糖-1-磷酸:TTP胸苷基转移酶;
nanG2位于基因簇核苷酸序列第71140-70145个碱基处,长度为996个碱基对,编码331个氨基酸,合成dTDP-D-葡萄糖-4,6-脱水酶;
nanG3位于基因簇核苷酸序列第68860-70164个碱基处,长度为1305个碱基对,编码434个氨基酸,合成NDP-D-葡萄糖-3,4-脱水酶;
nanG4位于基因簇核苷酸序列第67823-68863个碱基处,长度为1041个碱基对,编码346个氨基酸,合成NDP-D-葡萄糖-4,6-脱水酶,NDP-D-葡萄糖-4-异构酶,NDP-D-葡萄糖-4-还原酶;
nanG5位于基因簇核苷酸序列第66747-65365个碱基处,长度为1383个碱基对,编码460个氨基酸,合成糖基转移酶;
nanM位于基因簇核苷酸序列第66881-67798个碱基处,长度为918个碱基对,编码305个氨基酸,合成甲基转移酶;
(4)南昌霉素的调节基因,即nanR1,nanR2,nanR3,nanR4,nanT1,nanT2,nanT3,nanT4,nanT5共9个基因:
nanR1位于基因簇核苷酸序列第7493-6768个碱基处,长度为726个碱基对,编码241个氨基酸,合成调节蛋白;
nanR2位于基因簇核苷酸序列第8334-7573个碱基处,长度为762个碱基对,编码253个氨基酸,合成调节蛋白;
nanR3位于基因簇核苷酸序列第101593-102600个碱基处,长度为1008个碱基对,编码335个氨基酸,合成转录调节因子;
nanR4位于基因簇核苷酸序列第102600-103541个碱基处,长度为942个碱基对,编码313个氨基酸,合成转录调节因子;
nanT1位于基因簇核苷酸序列第2052-682个碱基处,长度为1731个碱基对,编码456个氨基酸,合成膜整合型转移蛋白;
nanT2位于基因簇核苷酸序列第2064-2879个碱基处,长度为816个碱基对,编码271个氨基酸,合成ABC转移因子;
nanT3位于基因簇核苷酸序列第3117-3818个碱基处,长度为702个碱基对,编码233个氨基酸,合成双组分反馈调节因子;
nanT4位于基因簇核苷酸序列第4923-3724个碱基处,长度为1200个碱基对,编码399个氨基酸,合成化学受体蛋白;
nanT5位于基因簇核苷酸序列第6106-4916个碱基处,长度为1191个碱基对,编码396个氨基酸,合成双组分传感器组氨酸激酶;
2、根据权利要求1所述的南昌霉素生物合成基因簇,其特征是,聚酮合酶基因所包含的模块或结构域为:即酮基合成酶结构域KS、酰基转移酶结构域AT、酮基还原酶结构域KR、脱水酶结构域DH、烯酰基还原酶结构域ER、酰基载体蛋白结构域ACP、链释放结构域CR。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB031149200A CN1190444C (zh) | 2003-01-16 | 2003-01-16 | 南昌霉素生物合成基因簇 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB031149200A CN1190444C (zh) | 2003-01-16 | 2003-01-16 | 南昌霉素生物合成基因簇 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1431220A CN1431220A (zh) | 2003-07-23 |
CN1190444C true CN1190444C (zh) | 2005-02-23 |
Family
ID=4790524
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB031149200A Expired - Fee Related CN1190444C (zh) | 2003-01-16 | 2003-01-16 | 南昌霉素生物合成基因簇 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1190444C (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100465277C (zh) * | 2005-07-01 | 2009-03-04 | 中国科学院上海有机化学研究所 | 氯丝菌素的生物合成基因簇及其应用 |
CN101812472B (zh) * | 2009-08-13 | 2011-07-20 | 上海交通大学 | 米多霉素生物合成基因簇 |
-
2003
- 2003-01-16 CN CNB031149200A patent/CN1190444C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1431220A (zh) | 2003-07-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2271666T3 (da) | Nrps-pks-gengruppe og dens manipulation og anvendelighed | |
CN1277843C (zh) | 分枝杆菌比较基因组学作为鉴定分枝杆菌病的诊断、预防或治疗靶的工具 | |
CN1977046A (zh) | 编码参与普拉地内酯生物合成的多肽的dna | |
CN1730657A (zh) | 氯丝菌素的生物合成基因簇及其应用 | |
CN1732264A (zh) | 产生疏螺旋体素的聚酮化合物合酶及其用途 | |
CN107868789B (zh) | 可利霉素生物合成基因簇 | |
CN1676607A (zh) | 为抗生素的生物合成从蓝灰链霉菌非产蓝亚种中克隆基因及其使用方法 | |
CN107794286B (zh) | 一种环脂肽类化合物生物合成基因簇及其激活方法与应用 | |
CN101691575B (zh) | 一种萨菲菌素的生物合成基因簇 | |
CN101818158B (zh) | Fr901464的生物合成基因簇 | |
CN111378008B (zh) | 脂肽类化合物Totopotensamides及其制备方法和应用 | |
CN111607603B (zh) | Hangtaimycin生物合成基因簇及其应用 | |
CN1190444C (zh) | 南昌霉素生物合成基因簇 | |
CN110857447B (zh) | 提高米尔贝霉素a3/a4或其衍生物产量的方法 | |
CN101063140A (zh) | 万古霉素生物合成基因簇 | |
KR101189475B1 (ko) | 삼원환 화합물의 생합성을 담당하는 유전자와 단백질 | |
CN114517175B (zh) | 基因工程菌及其应用 | |
CN106676115B (zh) | 2’-氯代喷司他丁和2’-氨基-2’-脱氧腺苷生物合成基因簇及其应用 | |
US20030175888A1 (en) | Discrete acyltransferases associated with type I polyketide synthases and methods of use | |
CN1257282C (zh) | 南寡霉素生物合成基因簇 | |
US20030171562A1 (en) | Genes and proteins for the biosynthesis of polyketides | |
CN1667123A (zh) | 负责fr-008聚酮抗生素生物合成的基因簇 | |
CN1507493A (zh) | 用于生产丁烯基-多杀菌素杀虫剂的生物合成基因 | |
CN112442507B (zh) | 马度米星化合物的生物合成基因簇及其应用 | |
US20030113874A1 (en) | Genes and proteins for the biosynthesis of rosaramicin |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |