CN1950395A - 杀虫毒素复合体融合蛋白 - Google Patents
杀虫毒素复合体融合蛋白 Download PDFInfo
- Publication number
- CN1950395A CN1950395A CN 200580014048 CN200580014048A CN1950395A CN 1950395 A CN1950395 A CN 1950395A CN 200580014048 CN200580014048 CN 200580014048 CN 200580014048 A CN200580014048 A CN 200580014048A CN 1950395 A CN1950395 A CN 1950395A
- Authority
- CN
- China
- Prior art keywords
- fusion
- seq
- category
- polypeptide
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明涉及杀虫毒素复合体(“TC”)融合蛋白及这些融合蛋白的编码多核苷酸。本发明也包括本发明的TC融合蛋白的编码多核苷酸及含有所述多核苷酸的载体。在一些实施方案中,本发明提供了包含融合在一起形成单个蛋白质的A类蛋白、B类蛋白和C类TC蛋白的融合蛋白。在另一些实施方案中,本发明提供了融合在一起的B类和C类TC蛋白的融合蛋白。在后面的实施方案中,该BC或者CB融合蛋白可用于增强或者加强“毒素A”或者A类蛋白的抗虫活性。迄今为止,没有期望此类融合在一起,会正确地发挥作用且保持其活性的融合蛋白。本发明有利地减少了转化植物需要的基因数目。所以,本发明也包括含有所述多核苷酸的植物、细胞(例如,细菌和植物细胞)和种子。所述植物产生本发明的融合蛋白,其赋予所述植物昆虫抗性。本发明也包括用本发明的融合蛋白控制害虫(优选是昆虫或者其他植物害虫)的方法。
Description
背景技术
每年控制害虫方面的花费为几十亿美元,另外还要因为这些害虫造成的破坏再损失几十亿美元。合成的有机化学杀虫剂一直是用于控制害虫的主要工具,而生物杀虫剂如来自苏云金芽孢杆菌(Bacillus thuringiensis(B.t.))的杀虫蛋白在一些领域发挥着重要的作用。通过转化B.t.杀虫蛋白基因来产生昆虫抗性植物的能力已经彻底改变了现代农业,凸现了杀虫蛋白及其基因的重要性和价值。
将两种不同的B.t.基因“排列在一起(stacked)”,以便一株植物产生两种不同类型的B.t.蛋白。已经完成了上述过程,以便增加植物的昆虫抗性谱,同时阻止对单一类型B.t.蛋白有抗性的昆虫的发展。与表达单个基因相比,表达多个基因相对更加复杂。在包括转基因植物的转基因真核生物的世代中普遍的是,单个蛋白质的编码区域以单个基因装配及导入,每个基因有一个单独一套的启动子和转录终止区。
已发现主要存在于光杆状菌属(Photorhabdus)和致病杆菌属(Xenorhabdus)的细菌中,以及于其他细菌物种如沙雷氏菌属(Serratia)、假单胞菌属(Pseudomonas)和类芽胞杆菌属(Paenibacillus)中的毒素复合体(TC)蛋白和基因是重要的、相对新的杀虫蛋白和基因的来源。至少有三种不同类型的TC蛋白。天然的A类TC蛋白大小为大约280kDa,且具有杀虫活性。B类TC蛋白(大约170kDa)和C类TC蛋白(大约112kDa)组合起来增强了A类TC蛋白的杀虫潜力,但在缺乏A类TC蛋白的情况下具有很小甚至没有杀虫活性。也就是说,B类和C类TC蛋白组合起来加强了A类TC蛋白的杀虫活性。关于本领域更详细的综述,参见例如US-2004-0208907和WO 2004/067727。单独A类TC蛋白具有杀虫活性,但该活性相对较低。当A类TC蛋白与B类和C类TC蛋白组合时,形成比单独的A类TC蛋白更有效的复合体。
尚未了解TC蛋白杀虫作用的确切机理。可能在杀害昆虫过程中蛋白质彼此相互作用和/或彼此装配。
发明概述
本发明涉及杀虫毒素复合体(“TC”)融合蛋白及编码这些融合蛋白的多核苷酸。在一些实施方案中,本发明提供了包括(以任意次序)融合在一起形成单个蛋白质的A类蛋白、B类蛋白和C类TC蛋白的融合蛋白。在其他一些实施方案中,本发明提供了融合在一起的B类TC蛋白和C类TC蛋白的融合蛋白。在后面的实施方案中,BC或者CB融合蛋白可用于增强或者加强“毒素A”或者A类蛋白的抗虫活性。
本发明部分地涉及本发明的融合蛋白具有与非融合蛋白相比相同水平的活性这一令人惊讶的发现。在一些情况下,本发明的融合蛋白甚至可以具有比单一(非融合)组分更好的活性。活性被保持在与非融合蛋白相同水平的发现十分令人惊讶。迄今为止,没有关于融合TC蛋白将会正确地发挥作用且保持其活性的预期(当融合在一起时)。这部分由于缺乏当这些蛋白质处于融合状态时,能否彼此正确地相互作用的有关知识。也没有先有动机去制备此类构建体和蛋白质。
本发明包括本发明的融合蛋白、编码该融合蛋白的多核苷酸及包括所述多核苷酸的载体。本发明也包括包括所述多核苷酸的植物、细胞(例如细菌和植物细胞)及种子。所述植物能产生本发明的融合蛋白,这赋予所述植物昆虫抗性。本发明包括将A类、B类和C类TC蛋白作为单一融合蛋白表达的转基因植物。本发明也包括将A类、B类和C类TC蛋白作为单一融合蛋白表达的转基因植物。本发明提供了通过在植物中表达有效量的本发明的融合蛋白使植物抗昆虫的方法。
本发明包括用本发明的融合蛋白抑制/控制害虫(优选是昆虫和其他植物害虫)的方法。本发明的方法包括保护植物免受昆虫破坏的方法,其中所述方法包括在植物中表达有效量的至少一种类型的本发明的融合蛋白,其中该融合蛋白作为单一融合蛋白产生。因此,本发明包括对保护植物免受昆虫破坏的方法的改进,其中所述方法包括在植物中表达有效量的毒素复合体(TC)A类、B类和C类TC蛋白,其中所述改进包括将至少两种所述蛋白质表达为单一融合蛋白。本发明的方法也包括保护植物免受昆虫破坏的方法,其中所述方法包括在植物中表达有效量的三种类型的TC蛋白,其中至少两种TC蛋白是从单一转录物翻译的。本发明的其他方法也包括喷射方法及其类似方法,这些方法在本领域是熟知的。在后面的情况下,本发明的改进包括提供本发明的融合蛋白为昆虫摄取,其中所述蛋白质应用于植物(或者植物附近)。
正如本文所讨论的,也正如受益于本公开后,对本领域的技术人员来说将是显而易见的,本发明提供了许多优点。例如,上面提及的方法提供了降低产生本发明的转基因、昆虫抗性植物所需要的“事件”的数量的优点。这些方法也为相互作用的蛋白质提供了时间和空间上的同步翻译,这对真核细胞而言尤其可贵。
附图简述
图1显示了TcdB2+TccC3或者8920融合蛋白与固定化XptA2的结合作用的表面等离子共振感应图。
序列简述
SEQ ID NO:1显示指定为pDAB8920的质粒中融合编码区盒的DNA序列。TcdB2、接头肽和TccC3的编码区分别由SEQ ID NO:1的核苷酸48-4469、4470-4511和4512-7394表示。
SEQ ID NO:2显示SEQ ID NO:1中的融合基因编码的“8920”多肽。TcdB2、接头肽和TccC3的氨基酸序列分别由SEQ ID NO:2的氨基酸1-1474、1475-1488和1489-2448表示。
SEQ ID NO:3显示tcdB2和tccC3编码区之间的连接寡核苷酸。
SEQ ID NO:4显示SEQ ID NO:3编码的多肽接头片段。
SEQ ID NO:5是B类TC蛋白TcdB1的氨基酸序列。
SEQ ID NO:6是B类TC蛋白TcdB2的氨基酸序列。
SEQ ID NO:7是B类TC蛋白TcaC的氨基酸序列。
SEQ ID NO:8是B类TC蛋白XptC1wi的氨基酸序列。
SEQ ID NO:9是B类TC蛋白XptB1xb的氨基酸序列。
SEQ ID NO:10是B类TC蛋白PptB11529的氨基酸序列。
SEQ ID NO:11是B类TC蛋白SepB的氨基酸序列。
SEQ ID NO:12是C类TC蛋白TccC1的氨基酸序列。
SEQ ID NO:13是C类TC蛋白TccC2的氨基酸序列。
SEQ ID NO:14是C类TC蛋白TccC3的氨基酸序列。
SEQ ID NO:15是C类TC蛋白TccC4的氨基酸序列。
SEQ ID NO:16是C类TC蛋白TccC5的氨基酸序列。
SEQ ID NO:17是C类TC蛋白XptB1wi的氨基酸序列。
SEQ ID NO:18是C类TC蛋白XptC1xb的氨基酸序列。
SEQ ID NO:19是类芽胞杆菌属ORF6(长)编码的C类TC蛋白PptC11529的备选(长)氨基酸序列。
SEQ ID NO:20是类芽胞杆菌属ORF6(短)编码的C类TC蛋白PptC11529的备选(短)氨基酸序列。
SEQ ID NO:21是C类TC蛋白SepC的氨基酸序列。
SEQ ID NO:22是A类TC蛋白XptA1wi的氨基酸序列。
SEQ ID NO:23是A类TC蛋白XptA2wi的氨基酸序列。
SEQ ID NO:24是A类TC蛋白TcbA的氨基酸序列。
SEQ ID NO:25是A类TC蛋白TcdA的氨基酸序列。
SEQ ID NO:26是A类TC蛋白TcdA2的氨基酸序列。
SEQ ID NO:27是A类TC蛋白TcdA4的氨基酸序列。
SEQ ID NO:28是编码B类TC蛋白TcdB1的天然核酸序列。
SEQ ID NO:29是编码B类TC蛋白TcdB2的天然核酸序列。
SEQ ID NO:30是编码B类TC蛋白TcaC的天然核酸序列。
SEQ ID NO:31是编码B类TC蛋白XptC1wi的天然核酸序列。
SEQ ID NO:32是编码B类TC蛋白XptB1xb的天然核酸序列。
SEQ ID NO:33是编码B类TC蛋白PptB11529的天然核酸序列。
SEQ ID NO:34是编码C类TC蛋白TccC1的天然核酸序列。
SEQ ID NO:35是编码C类TC蛋白TccC2的天然核酸序列。
SEQ ID NO:36是编码C类TC蛋白TccC3的天然核酸序列。
SEQ ID NO:37是编码C类TC蛋白TccC4的天然核酸序列。
SEQ ID NO:38是编码C类TC蛋白TccC5的天然核酸序列。
SEQ ID NO:39是编码C类TC蛋白XptB1wi的天然核酸序列。
SEQ ID NO:40是编码C类TC蛋白XptC1xb的天然核酸序列。
SEQ ID NO:41是编码C类TC蛋白PptC11529ORF6(长)的天然核酸序列。
SEQ ID NO:42是编码C类TC蛋白PptC11529ORF6(短)的天然核酸序列。
SEQ ID NO:43是优化在植物中表达的编码B类TC蛋白TcdB2的核酸序列。
SEQ ID NO:44是优化在植物中表达的编码C类TC蛋白TccC3的核酸序列。
SEQ ID NO:45是编码TcdB2/TccC3融合蛋白8563(也称为“8563”)的核酸序列。
SEQ ID NO:46是TcdB2/TccC3融合蛋白8563(也称为“8563”)的氨基酸序列。
SEQ ID NO:47是编码TcdB2/TccC3融合蛋白8564的核酸序列。
SEQ ID NO:48是TcdB2/TccC3融合蛋白8564的氨基酸序列。
SEQ ID NO:49是编码TcdB2/TccC3融合蛋白8940的核酸序列。
SEQ ID NO:50是TcdB2/TccC3融合蛋白8940的氨基酸序列。
SEQ ID NO:51是编码TcdB2/TccC3融合蛋白8920的核酸序列。
SEQ ID NO:52是TcdB2/TccC3融合蛋白8920的氨基酸序列。
SEQ ID NO:53是编码TcdB2/TccC3融合蛋白8921的核酸序列。
SEQ ID NO:54是TcdB2/TccC3融合蛋白8921的氨基酸序列。
SEQ ID NO:55是编码TcdB2/TccC3融合蛋白8923的核酸序列。
SEQ ID NO:56是TcdB2/TccC3融合蛋白8923的氨基酸序列。
SEQ ID NO:57是编码TcdB2/TccC3融合蛋白8951的核酸序列。
SEQ ID NO:58是TcdB2/TccC3融合蛋白8951的氨基酸序列。
SEQ ID NO:59是编码TcdB2/TccC3融合蛋白8811的核酸序列。
SEQ ID NO:60是TcdB2/TccC3融合蛋白8811的氨基酸序列。
SEQ ID NO:61是编码A类TC蛋白XptA1wi的天然核酸序列。
SEQ ID NO:62是编码A类TC蛋白XptA2wi的天然核酸序列。
SEQ ID NO:63是编码A类TC蛋白TcbA的天然核酸序列。
SEQ ID NO:64是编码A类TC蛋白TcdA的天然核酸序列。
SEQ ID NO:65是编码A类TC蛋白TcdA2的天然核酸序列。
SEQ ID NO:66是编码A类TC蛋白TcdA4的天然核酸序列。
SEQ ID NO:67是8836“BCA”三联融合多核苷酸序列。
SEQ ID NO:68是SEQ ID NO:67编码的8836“BCA”三联融合蛋白的氨基酸序列。
发明详述
本发明部分地涉及当毒素复合体(“TC”)蛋白融合(或者连接)在一起时保持其杀虫活性这一令人惊讶的发现。迄今为止,没有关于当此类融合蛋白融合在一起时可正确发挥作用且保持其活性的预期。正如本文所述,存在A类、B类和C类毒素复合体或者“TC”蛋白质。这些单个蛋白质也被称为本发明的融合蛋白的多肽组分。
因此,本发明包括杀虫TC融合蛋白和编码这些融合蛋白的多核苷酸。本发明在一些实施方案中提供了包括(以任意次序)融合或者连接在一起形成单一融合蛋白的A类、B类和C类TC蛋白(或者多肽)的融合蛋白。本发明在其他一些实施方案中提供了包括融合或者连接在一起的B类和C类TC蛋白的融合蛋白。在后面的实施方案中,BC或者CB融合蛋白可用于增强或者加强”毒素A”蛋白的抗虫活性。
本文所用的术语个体A类、B类和C类TC蛋白在本领域是已知的。此类蛋白质包括处于单独状态的(stand-alone)毒素(A类TC蛋白)和增效剂(B类和C类TC蛋白)。已知产生TC蛋白的细菌包括下述属中的细菌:光杆状菌属、致病杆菌属、类芽胞杆菌属、沙雷氏菌属和假单胞菌属。例如参见丁香假单胞菌(Pseudomonas syringae pv.Syringae)B728a(GenBank登录号gi:23470933和gi:23472543)。任何此类蛋白质都可以用作本发明的多肽组分。
正如上面在技术背景部分所讨论的,尽管单独的”毒素A”蛋白有一定的杀虫活性,但“A+B+C”复合体的高杀虫潜力对于TC蛋白的商业应用是更优选的。然而TC蛋白的作用机理仍然是未知的。同样地,A、B和C组分中的每一组分彼此如何(和是否)相互作用也是未知的。因此,无法预测本发明的融合蛋白是否允许这三个组分(在昆虫内脏中)正确地发挥作用。因此,令人惊讶地是融合的TC蛋白对控制昆虫是高度有效的。没有关于本发明的融合蛋白被靶昆虫摄取后将有活性(即毒性)的预期。本文表明本发明的融合蛋白在昆虫内脏中令人惊讶的发挥相当好的作用。
已经表明当A-、B-和C型TC多肽以本发明的融合蛋白产生时,仍然能物理上相互作用,产生活性ABC复合体,本发明的融合基因(其编码本发明的TC融合蛋白)能用于应对至少三种基因同步表达的技术挑战。以前没有尝试将A、B或者C组分中的任何组分融合在一起以应对这些挑战的启示。本发明的公开表明,现在可以通过实践本发明应对和缓解至少三种基因同步表达的技术挑战。这些技术挑战在真核生物如植物中更艰巨。在原核细胞中,相互作用的蛋白质的编码区一般按顺序排列,且转录成单个mRNA。这些编码区的连续翻译导致分开的蛋白质彼此在时间和物理上很邻近地合成,从而确保伙伴蛋白能有效地装配成复合体。真核细胞比原核细胞更大,结构上更复杂。真核细胞的基因组包含在细胞核中,mRNA必须被转运出细胞核,到达蛋白质合成发生的细胞质。在真核细胞中,相互作用的蛋白质通常由分离的基因和编码区编码,这可以导致mRNA和被编码蛋白质的非同步的生物合成。伙伴蛋白的装配因此受到时间和空间分离的影响;分离的蛋白质必须穿过其他蛋白质的环境并逃过胞内蛋白酶的降解,才能发现彼此。
尽管每一种被导入基因,在所得到的转基因生物中可能必然提供期望表型(例如,在转基因植物中一种基因可能赋予昆虫抗性和另一种基因可能赋予除草剂耐性),但通常不要求转基因表达的蛋白质质相互作用来产生期望表型。此类相互作用对工程改造是困难的。例如,由于非连接整合位点、构建体重排或者缺失及不同基因的非相容表达模式,转化引入多个基因会导致不期望的结果。
因此,本发明是在植物中表达三种相互作用的蛋白质的不可预期的方案。迄今为止还没有将融合TC蛋白用作三基因植物表达问题的解决方案的启示。
在了解了本发明公开内容的益处后,本发明提供的许多优点对本领域的技术人员而言是显而易见的。首先,本发明的多种融合蛋白在杀死或者抑制昆虫方面作用效果很好,或者甚至比单个对应物更好。已经表明本发明的融合蛋白将有效地杀死或者抑制靶昆虫,有很多与本发明使用融合TC基因的新颖方法相关的优点。本发明的一些其他优点如下。
本发明允许降低产生昆虫抗性转基因植物需要的独立表达的基因的数量(在优选实施方案中从三种TC基因降低为一种或者两种融合基因)。植物转化构建体大小的相应降低可增加转化频率和转基因的总回收率。因为仅仅需要一种或者两种独立基因,所以可增加转基因植物的稳定性。因为植物中活性复合体形成效率的增加,所以可提高回收活性转基因植物的概率。
本发明包括含有本发明的多核苷酸的载体。本发明也包括包含所述多核苷酸的植物、细胞(例如细菌和植物细胞)及种子。所述植物可以产生赋予所述植物昆虫抗性的本发明的融合蛋白。本发明也包括用本发明的融合蛋白控制害虫(优选是昆虫和其他植物害虫)的方法。
融合基因和蛋白质术语。在本申请中下述符号用于融合基因和融合蛋白。操纵子中存在的融合基因间的连锁,也就是转录连锁但编码分离的、不同的蛋白质的基因间的连锁,用连字号表示,例如tcdB2-tccC3。已经通过接头的方式连接在一起,从而将两种或者多种编码区融合为单个开放阅读框的基因间的连锁用斜线记号表示,例如tcdB2/tccC3。这样的融合基因编码的单个融合蛋白的连锁也用斜线记号表示,例如TcdB2/tccC3。使用不同接头融合起来的相同基因或者蛋白质用“V”标记区分。例如tcdB2/tccC3V1和tcdB2/tccC3V2表示使用不同接头融合的相同基因,其分别编码融合蛋白TcdB2/TccC3V1和TcdB2/TccC3V2。“+”符号也用于表示非融合组分,如当非融合B+C与融合B/C活性相比增强活性的情况下。
本文所用的术语“接头”和“接头序列”指用于将第一个蛋白质编码区与随后的、紧密相连的蛋白质编码区连接在一起的核苷酸,这样第一个和第二个(和/或随后的)蛋白质编码区在+1阅读框(如第一个蛋白质编码区的开放阅读框所定义)中形成单个较长的蛋白质编码区。因此此类接头或者接头序列不包括+1阅读框中的翻译终止密码子。作为接头或者接头序列的翻译结果,第一个蛋白质编码区编码的蛋白质通过一个或者多个氨基酸与第二个蛋白质编码区编码的蛋白质连接。由于多肽组分可直接连接,不需要接头序列,所以接头是任选的。
本文所用的“分离的”多核苷酸和/或蛋白质及“纯化的”蛋白质指与其在自然界中发现的其他相关联分子不再相关联时的那些分子。因此,涉及的“分离的”和/或“纯化的”的涵义表示本文描述的“人工”的参与。例如,置入植物中表达的本发明细菌蛋白质“基因”是一种“分离的多核苷酸”。同样地,本文例举的由植物产生的融合蛋白是一种“分离的蛋白质”。术语“连接到”也可用于表示“人工”的参与。也就是说,一种多肽组分可以是合成地连接或者“连锁”到另一种多肽组分上,从而形成本发明的融合蛋白。
“重组的”分子指已经重组的分子。当用于核酸分子时,该术语指包含通过分子生物技术的方法连接在一起的核酸序列的分子。当术语“重组的”用于蛋白质或者多肽时,指使用一种或者多种重组核酸分子产生的蛋白质分子。
当术语“异源的”用于核酸序列时,指连接到、或者被操作性连接到与其天然不连接的核酸序列或者与其天然在不同位置连接的核酸序列上。因此,术语“异源的”表示核酸分子已经使用基因工程改造,即通过人工干预来操作。因此,本发明的融合蛋白基因能有效地连接到异源启动子上(或者“转录调节区”,其指当转录调节区有效连接到目的序列上时,能调节或者调制目的核苷酸序列转录的核苷酸序列)。优选的异源启动子可以是植物启动子。当序列在功能上连接以使目的序列的转录被转录调节区介导或者调制时,启动子和/或转录调节区和目的序列(融合基因)是“有效连接的”。在一些实施方案中,为了有效地连接,转录调节区可位于与目的序列相同的链上。在一些实施方案中转录调节区可位于目的序列的5’端上。在这些实施方案中,转录调节区可直接是相关序列的5’端,或者在这些区域之间可以有插入序列。转录调节区和目的序列的有效连接可能要求适当的分子(如转基因激活蛋白质)与转录调节区结合,因此本发明包括或者体外或者体内提供这样的分子的实施方案。
本发明的融合蛋白和构建体。在一些实施方案中,本发明涉及编码B类TC蛋白的TC基因与编码C类TC蛋白的TC基因的融合,以便融合基因产生融合蛋白。融合可以是直接的,或者接头序列可以连接两个编码区域。本发明包括BC融合和CB融合,即编码序列可以以任意顺序融合。
本发明也包括编码A类TC蛋白的TC基因、编码B类TC蛋白的基因、和编码C类TC蛋白的TC基因的融合,以便融合基因产生融合蛋白。融合可以是直接的,或者接头序列可以连接两个编码区域。这三个组分可以任意次序融合,例如ABC、ACB、BAC、BCA、CAB或者CBA。
因此本发明包括A类/B类/C类TC融合蛋白、编码A类/B类/C类TC融合蛋白的多核苷酸、包括所述多核苷酸的载体和包含所述多核苷酸的植物、细胞(例如细菌和植物细胞)及种子。所述植物可以产生赋予所述植物昆虫抗性的本发明的融合蛋白。这些实施方案将在植物和其他生物中表达需要的转录控制序列的数量降低了三分之二,且消除了伴随分开的、完整基因转化的缺点。这些实施方案也提供了维持相互作用的蛋白质在物理和时间上同步翻译的办法,尤其是在真核细胞中。
本发明也包括B类/C类TC融合蛋白、编码B类/C类TC融合蛋白的多核苷酸、包括所述多核苷酸的载体和包含所述多核苷酸的植物、细胞(例如细菌和植物细胞)及种子。所述植物可以产生本发明的融合蛋白,当其与A类TC蛋白结合时赋予所述植物昆虫抗性。这些实施方案将在植物和其他生物中表达所需要的转录控制序列的数量降低了至少一半,且消除了伴随分开的、完整基因转化的缺点。这些实施方案也提供了维持相互作用的蛋白质在物理和时间上同步翻译的办法,尤其是在真核细胞中。
在一些情况下,融合编码区域的主要翻译产物在很大程度上保持完整,且含有与融合蛋白的被表达多肽组分目的活性。在其他情况下,主要翻译产物含有一个或者多个蛋白酶裂解位点,其被基因工程改造为位于独立多肽编码序列之间的多肽接头。当主要翻译产物暴露于适当的蛋白酶时,该蛋白酶裂解位点提供多肽组分的释放。
例如限制位点也能被基因工程改造到如接头中。在一个特别有代表性的实施方案中,XptA2和TcdB2蛋白质结构域之间的连接多肽片段编码SEQ ID NO:59中所示的多肽接头片段。该接头多肽长为9个氨基酸,含有侧翼为脯氨酸残基的带电和亲水性氨基酸。限制酶Avr II和SpeI的独特识别位点包括在相应的、编码寡核苷酸片段中。
对于下面给出的一些实施例,B类和C类TC蛋白的编码序列通过特别设计的接头连接。更具体地,该实施例描述了tcdB2(编码B类TC蛋白的基因)和tccC3(编码C类TC蛋白的基因)的编码区的融合。融合的B类/C类TC基因编码单一多肽。编码区通过编码接头肽的短寡核苷酸片断连接。接头肽进行基因工程改造以允许连接后的B类和C类TC蛋白的适当折叠,且提供位于融合的B类和C类蛋白之间可及的蛋白酶敏感位点(sensitive site)。下面公开了用于编码新颖TcdB2/TccC3 V1融合蛋白的基因的构建细节。
在这些实施例的其中一个中,含有融合TcdB2/TccC3V1蛋白质的裂解物在增效活性上比得上程序化表达非融合增效剂基因tcdB2和tccC3的细胞的裂解物。在这些实施例的另一个中,使用两个A类TC蛋白:TcdA(对鞘翅目昆虫有活性)和XptA2wi(对鳞翅目昆虫有活性)在生物检测中,测定程序化表达融合的编码区tcdB2/tccC3 V1的细胞裂解物。这表明含有融合TcdB2/TccC3 V1蛋白质的此类裂解物在增效活性上比得上程序化表达非融合增效剂基因tcdB2和tccC3的细胞的裂解物。
在另一个实施例中,A类、B类和C类TC蛋白的编码序列通过接头连接。该实施例描述了A类TC蛋白XptA2的编码区与上面描述的tcdB2/tccC3 V1融合物构成的融合物。含有XptA2/TcdB2/TccC3V1融合蛋白的裂解物表明了优异的功能活性。
融合蛋白的施用。可以用许多不同方式进行本发明。例如,植物可以进行基因工程改造以产生两种类型的A类TC蛋白和一种B类/C类融合蛋白。该植物的每个细胞、或者给定类型的组织中的每个细胞(如根或叶子)可以具有编码该两个A蛋白质和B类/C类融合蛋白的基因。或者,植物的不同细胞可以产生这些蛋白质中的一个(或者多个)。在这种情况下,当昆虫咬食植物组织时,该昆虫可能吃掉产生第一个A类TC蛋白的一个细胞,产生第二个A类TC蛋白的另一个细胞及产生B类/C类融合蛋白的另一个细胞。因此,重要的是植物(不必要是每一个植物细胞)产生本发明的两个A类TC蛋白和B类/C类融合蛋白,以便当昆虫吃掉植物组织时就吃到所有这四种蛋白质。
结合本发明,除了转基因植物之外,有许多施用目标害虫蛋白质的其他方式。喷雾应用在本领域是已知的。可以喷雾一些或者所有A类和B类/C类融合蛋白(植物可以产生一种或者多种蛋白质,其他蛋白质可以喷雾)。例如,土壤应用的多种类型的饵颗粒在本领域也是已知的,可以根据本发明使用。
不同的A类、B类和/或C类TC蛋白的多种组合现在可以以另人惊奇的新方式融合。本文阐明的一个实例表明使用TcdB2/TccC3融合物增强XptA2和TcdA的活性。在了解了本发明公开内容的益处后,使用这些和其他组合对本领域的技术人员而言是显而易见的。参见US-2004-0208907和WO 2004/067727。因此,本发明包括增效剂“混合对”的融合,如来自致病杆菌属的A类基因,与来自光杆状菌属的B类基因和来自致病杆菌属的C类基因的融合。也可以省略A类基因,因此本发明包括增效剂如来自光杆状菌属的B类基因和来自致病杆菌属的C类基因“混合对”的融合。因此,可以选择此类“毒素A”和/或增效剂的“异源”组合以最大化它们增强两个(例如)杀虫蛋白的能力。这就是说,可以发现,相比例如XptC1wi(B类)和XptB1wi(C类),对于给定用途,更期望TcdB1(B类)和XptB1wi(C类)的融合。同样地,本发明包括“ABC”型融合,其中A、B和/或C来自不同类型的生物。
本发明给本领域的技术人员提供了许多令人惊讶的优点。这些优点可以结合例如发明US-2004-0208907和WO 2004/067727来使用。在这些优点中,本领域的技术人员现在将能使用一对融合的增效剂来增强例如单独状态的致病杆菌属蛋白质毒素及例如单独状态的光杆状菌属蛋白质毒素的活性。(本领域的技术人员已知的是,致病杆菌属毒素蛋白质对控制鳞翅目昆虫更理想,而光杆状菌属毒素蛋白质往往对控制鞘翅目昆虫更理想)。这降低了转基因植物需要表达的基因(和转化事件)数量,实现更广谱目标害虫的有效控制。
本发明也包括使用转基因植物产生本发明的TC融合蛋白,组合例如一种或者多种苏云金芽孢杆菌(Bacillus thurigniensis)Cry蛋白质。同样,本发明的融合蛋白也能与其他杀虫毒素一起施用(例如通过喷雾应用)。
本发明TC融合蛋白的毒素复合体(TC)蛋白质组分。根据本发明的公开,本领域的技术人员现在可合理的期待根据本发明可以使用种类广泛的“A”、“B”和/或“C”组分,以及本发明不限于特别例证的实施方案。例如,在例证说明具体光杆状菌属A、B和/或C多肽的情况下,人们将知道其他光杆状菌属TC蛋白可以使用或者取代。同样地,可以使用相应的致病杆菌属TC多肽来取代例证的光杆状菌属多肽,以形成本发明的融合蛋白。例如参见US-2004-0208907和WO 2004/067727。
本发明提供了融合的TC蛋白。两个主要的实施方案是“BC”融合和“ABC”融合。然而应该注意到BC型融合包括C到B融合,“ABC”融合不限于A到B到C融合。下面更详细的讨论了多种其他可能的排列和方向。
取决于本发明选择使用的确切“B”和“C”组分,本发明的“BC”(或者“CB”)融合蛋白的分子量范围一般在大约220kDa到大约295kDa之间。例如,优选的重量在大约280-285kDa范围内。下面更详细讨论了本发明的BC融合蛋白(其可以加强A类毒素)的单一B和C组分可以以数种方式定义。
根据选择使用于本发明的确切A、B和C组分(以及接头,如果有的话),本发明的“ABC”融合蛋白(例如包括ACB融合)的分子量范围一般在大约450kDa到大约590kDa之间。例如,优选重量在大约560-565kDa范围内。下面更详细讨论了单一A、B和C组分可以以数种方式定义。
本文所用的“A类TC蛋白”是单独具有杀虫活性的230-290kDa TC蛋白,具有与选自XptA1wi(SEQ ID NO:22)、XptA2wi(SEQ ID NO:23)、TcbA(SEQ ID NO:24)、TcdA(SEQ ID NO:25)、TcdA2(SEQ ID NO:26)和TcdA4(SEQ ID NO:27)的序列有至少40%同一性的氨基酸序列。
除非另行说明,本文使用的序列同一性百分比和/或两个核酸的相似性使用Karlin和Altschul(1990),Proc.Natl.Acad.Sci.美国87:2264-2268的算法确定,该算法如Karlin和Altschul(1993),Proc.Natl.Acad.Sci.美国90:5873-5877进行改进。该算法整合到Altschul等人(1990),J.Mol.Biol.215:402-410的NBLAST和XBLAST程序中。BLAST核苷酸捡索用NBLAST程序进行,分值=100和字长=12。可如Altschul等人(1997),Nucl.Acids Res.25:3389-3402中所述使用缺口BLAST。当使用BLAST和缺口BLAST程序时,使用每个程序(NBLAST和XBLAST)的默认参数。参见NCBI/NIH网站。也可以使用上面的背景部分描述的Crickmore等人的方法和算法计算得分。
为了获得用于比较目的的缺口比对,应用Vector NTI Suite 8(InforMax,Inc,North Bethesda,MD,美国)的AlignX函数,其中使用默认参数。这些默认参数是:缺口开放罚分为15,缺口延伸罚分为6.66,缺口分离罚分范围为8。以该方式或者使用本领域熟知的其他技术,比对并比较两个或者多个序列。通过分析这样的比对,可鉴定本发明多肽的相对保守和非保守区域。这是用于例如评价通过修饰或者取代一个或者多个氨基酸残基来改变多肽序列是否可被预期为可以忍受的。
A类TC蛋白的实例,在本文中如SEQ ID NO:22-25中所示。这些实例包括来自光杆状菌属的TcbA和TcdA、来自致病杆菌属的XptA1和XptA2和来自嗜虫沙雷氏菌(Serratia entomophila)的SepA(GenBank登录号为AAG09642.1)。A类TC蛋白可以是例如大约230kDa(尤其是如果截短)、大约250-290kDa、大约260-285kDa和大约270kDa。已知A类TC蛋白TcdA,单独针对烟草天蛾(Manduca Sexta)是有活性的。
表1提供已知A类TC蛋白的序列同一性比较。这些比较表明40%序列同一性是定义A类TC蛋白的适当标准。
表I.A类TC蛋白的序列同一性比较 | |||||||
TcdA | TcdA2 | TcdA4 | TcbA | XptA1wi | XptA2wi | SepA | |
同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | |
发光光杆状菌(Photorhabdus luminescens)A类 | |||||||
TcdA | 100.0 | 55.0 | 68.0 | 50.1 | 46.3 | 40.6 | 40.7 |
TcdA2 | 100.0 | 55.9 | 42.4 | 41.3 | 36.8 | 34.7 | |
TcdA4 | 100.0 | 49.4 | 44.4 | 38.7 | 38.7 | ||
TcbA | 100.0 | 43.7 | 40.8 | 40.2 | |||
嗜线虫致病杆菌(Xenorhabdus nematophilus)xwi A类 | |||||||
XptA1wi | 100.0 | 44.2 | 46.6 |
XptA2wi | 100.0 | 38.2 | |||||
嗜虫沙雷氏菌(Serratia entomophila)A类 | |||||||
SepA | 100.0 |
通过多核苷酸编码的蛋白质的编码多核苷酸可以定义和/或表征一些A类TC蛋白。可以通过这类多核苷酸与选自SEQ ID NO:61-66的核酸杂交(在严格条件下)的能力鉴定这类多核苷酸。用另一种方式来说,本发明的A类多肽组分可以由与选自SEQ ID NO:22-27的多肽的编码多核苷酸的互补分子杂交的多核苷酸编码。应该注意到,例如为了在植物中表达DNA序列可以被优化,同时注意到一定程度的变异在本发明的范围内。
A类TC蛋白的实例在本文中如SEQ ID NO:22-27中所示。这些实例包括来自光杆状菌属的TcbA和TcdA、来自致病杆菌属的XptA1和XptA2和来自嗜虫沙雷氏菌的SepA(GenBank登录号为AAG09642.1)。A类TC蛋白可以是例如大约230kDa(尤其是如果截短)、大约250-290kDa、大约260-285kDa和大约270kDa。已知A类TC蛋白TcdA单独针对烟草天蛾是有活性的。
除了那些在SEQ ID NO:22-27中具体鉴定的,A类TC蛋白包括例如:
1)从野生型生物获得的蛋白质;
2)突变得到的变体;
3)通过进行保守性氨基酸取代设计的变体;和
4)编码A类TC蛋白的多个不同序列的随机断裂和重装配产生的变体(DNA改组)。
例如,参见美国专利5,605,793。
编码A类TC蛋白的DNA序列可以是野生型序列、突变序列或者设计来表达预先确定的A类TC蛋白的合成序列。例如,避免多腺苷酸信号和使用植物优先的密码子尤其可用于设计在植物中高度表达DNA序列。已经公开了编码A类TC蛋白的植物优化核酸的实例,例如在美国专利6,590,142中。
本文所用的“B类TC蛋白”是具有与选自如下的序列有至少40%同一性的氨基酸序列的130-180kDa蛋白质:
TcdB1(SEQ ID NO:5),
TcdB2(SEQ ID NO:6),
TcaC(SEQ ID NO:7),
XptC1wi(SEQ ID NO:8),
XptB1xb(SEQ ID NO:9),
PptB11529(SEQ ID NO:10)和
Sep B(SEQ ID NO:11),
当与C类TC蛋白(如下定义)一起使用时,所述蛋白质能增加A类TC蛋白的毒性。
表II提供了已知B类TC蛋白的序列同一性比较。这些比较表明40%序列同一性是定义B类TC蛋白的适当标准。
表II.已知B类TC蛋白的序列同一性比较 | |||||||
TcdB1 | TcdB2 | TcaC | XptC1wi | XptB1xb | PptB1(Orf5) | SepB | |
同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | |
发光光杆状菌B类 | |||||||
TcdB1 | 100.0 | 75.6 | 58.2 | 50.2 | 54.6 | 42.3 | 52.6 |
TcdB2 | 100.0 | 57.2 | 49.8 | 53.3 | 42.0 | 51.4 | |
TcaC | 100.0 | 51.6 | 59.8 | 42.6 | 50.1 | ||
嗜线虫致病杆菌xwi B类 | |||||||
XptC1wi | 100.0 | 53.2 | 40.7 | 47.8 | |||
伯氏致病杆菌(Xenorhabdus bovienii)B类 | |||||||
XptB1xb | 100.0 | 40.6 | 46.0 | ||||
类芽胞杆菌属物种(Paenibacillus spp)菌株1529B类 | |||||||
PptB1 | 100.0 | 38.7 |
(Orf5) | |||||||
嗜虫沙雷氏菌B类 | |||||||
SepB | 100.0 |
本发明的B类TC蛋白能够由在严格条件下与SEQ ID NO:28-33之一的核酸杂交的互补分子编码。用另一种方式来说,本发明的B类多肽组分可以由与选自SEQ ID NO:5-11的多肽的编码多核苷酸的互补分子杂交的多核苷酸编码。应该注意到,例如为了在植物中表达可以优化DNA序列,同时注意到一定程度的变异在本发明的范围内。
B类TC蛋白的实例在本文中如SEQ ID NO:5-11中所示。这些实例包括来自光杆状菌属的TcaC、TcdB1和TcdB2,来自致病杆菌属的XptC1wi和XptB1xb,来自类芽胞杆菌属的PptB11529(类芽胞杆菌属菌株DAS1529的ORF5的蛋白质产物)和来自嗜虫沙雷氏菌的SepB(GenBank登录号为AAG09643.1;本文再制备为SEQ ID NO:11)。B类TC蛋白大小一般在大约170kDa的范围内。B类TC蛋白的进一步实例来自丁香假单胞菌(Pseudomonas syringae pv.SyringaeB728)的TcaC同源物(GenBank登录号为gi23472544和gi23059431,和嗜线虫致病杆菌PO ORF268(由WO20/004855的图2的碱基258-1991编码)。优选的B类TC蛋白是TcdB2(SEQ ID NO:6)。B类TC蛋白可以是例如大约130-180kDa、大约140-170kDa、大约150-165kDa和大约155kDa。
除了那些在SEQ ID NO:5-11中具体别鉴定的,B类TC蛋白包括例如:
1)从野生型生物获得的蛋白质;
2)突变得到的变体;
3)通过进行保守氨基酸取代设计的变体;和
4)编码B类TC蛋白的多个不同序列的随机断裂和重装配产生的变体(DNA改组)。
例如,参见美国专利5,605,793。
编码B类TC蛋白的DNA序列可以是野生型序列、突变序列或者设计来表达预先确定的B类TC蛋白的合成序列。例如,避免多腺苷酸信号和使用植物优先的密码子尤其可用于设计在植物中高度表达的DNA序列。
本文所用的“C类TC蛋白”是具有与选自如下的序列有至少35%同一性的氨基酸序列的90-112kDa蛋白质:
TccC1(SEQ ID NO:12),
TccC2(SEQ ID NO:13),
TccC3(SEQ ID NO:14),
TccC4(SEQ ID NO:15),
TccC5(SEQ ID NO:16),
XptB1wi(SEQ ID NO:17),
XptC1xb(SEQ ID NO:18),
PptC1(长)(SEQ ID NO:19),
PptC1(短)(SEQ ID NO:20),和
Sep C(SEQ ID NO:21);
当与B类TC蛋白组合使用时,所述蛋白质能增加A类TC蛋白的毒性。
表III提供了已知C类TC蛋白的序列同一性比较。该比较表明35%序列同一性是定义C类TC蛋白的适当标准。
表III.已知C类TC蛋白的序列同一性比较 | ||||||||||
TccC1 | TccC2 | TccC3 | TccC4 | TccC5 | XptB1wi | XptC1xb | PptC1(Orf6长) | PptC1(Orf6短) | ScpC | |
同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | 同一性百分比 | |
发光光杆状菌C类 | ||||||||||
TccC1 | 100.0 | 48.1 | 52.8 | 52.9 | 51.3 | 45.5 | 46.5 | 35.0 | 35.7 | 44.1 |
TccC2 | 100.0 | 52.5 | 53.7 | 61.4 | 44.1 | 47.2 | 35.3 | 36.1 | 46.1 | |
TccC3 | 100.0 | 59.5 | 58.4 | 46.0 | 48.1 | 35.4 | 36.1 | 46.6 | ||
TccC4 | 100.0 | 57.2 | 44.8 | 49.1 | 36.9 | 37.7 | 45.3 | |||
TccC5 | 100.0 | 45.6 | 48.7 | 35.2 | 36.0 | 44.9 | ||||
嗜线虫致病杆菌xwi C类 | ||||||||||
XptB1wi | 100.0 | 41.4 | 32.7 | 33.5 | 46.3 | |||||
伯氏致病杆菌C类 | ||||||||||
XptC1xb | 100.0 | 35.4 | 36.2 | 43.5 | ||||||
类芽胞杆菌属物种菌株1529C类 |
PptC1(Orf6长) | 100.0 | 97.6 | 34.9 | |||||||
PptC1(Orf6短) | 100.0 | 35.7 | ||||||||
嗜虫沙雷氏菌C类 | ||||||||||
SepC | 100.0 |
典型的C类TC蛋白由在严格条件下与SEQ ID NO:34-42之一的核酸杂交的多核苷酸编码。用另一种方式来说,本发明的C类多肽组分可以由与选自SEQ ID NO:12-21的多肽的编码多核苷酸的互补分子杂交的多核苷酸编码。应该注意到,例如为了在植物中表达可以优化DNA序列,同时注意到一定程度的变异在本发明的范围内。
C类TC蛋白的实例在本文中如SEQ ID NO:12-21中所示。这些实例包括来自光杆状菌属的TccC1和TccC3,来自致病杆菌属的XptB1wi和XptC1xb,来自类芽孢杆菌属的PptC11529(类芽孢杆菌属菌株DAS1529的ORF6的蛋白质产物)和来自嗜虫沙雷氏菌的SepC(GenBank登录号为AAG09644.1;本文再制备为SEQ ID NO:21)。该类中的蛋白质大小一般在大约112kDa的范围内。C类TC蛋白的其他实例来自丁香假单胞菌(Pseudomonas syringae pv.Syringae B728a)的TccC同源物(GenBank登录号为gi23470227、gi23472546、gi23472540、gi23472541、gi23468542、gi23472545、gi23058175、gi23058176、gi23059433、gi23059435和gi23059432)。优选的C类TC蛋白是TccC3(SEQ ID NO:14)。C类TC蛋白可以是例如大约90-120kDa、大约95-115kDa、大约1050-110kDa和大约105-107kDa。
除了那些在SEQ ID NO:12-21中具体鉴定的,C类TC蛋白包括例如:
1)从野生型生物获得的蛋白质;
2)突变得到的变体;
3)通过进行保守氨基酸取代设计的变体;和
4)多个不同的C类TC蛋白序列的随机断裂和重装配产生的变体(DNA改组)。
编码C类TC蛋白的DNA序列可以是野生型序列、突变序列或者设计来表达预先确定的C类TC蛋白的合成序列。例如,避免多腺苷酸信号和使用植物优先的密码子尤其可用于设计在植物中高度表达的DNA序列。
本发明使用的组分的一些其他实例(和它们彼此的相关性)包括:
A类蛋白
光杆状菌属TcdA毒素同源物 | ||
名称 | 参考 | 与W-14TcdA(GenBank登录号为AAF05542.1)的序列同一性 |
P.1.Hph2 | 美国6,281,413B1的SEQ ID NO:13 | ~93% |
P.1.Hph3 | 由美国6,281,413B1的SEQ IDNO:11的碱基2416到9909编码 | ~57% |
光杆状菌属TcbA毒素同源物 | ||
名称 | 参考序列 | 与W-14TcdA(GenBank登录号为AAF05542.1)的序列同一性 |
P.1.W-14TcbA | GenBank登录号为AAC38627.1(本文再制备为SEQ ID NO:24) | (与W-14TcdA有~50%序列同一性) |
致病杆菌属XptA1毒素同源物 | ||
名称 | 参考序列 | 与Xwi XptA1(本文公开为SEQ ID NO:22)的序列同一性 |
X.n XptA1 | GenBank登录号为CAC38401.1(AJ308438) | ~96% |
致病杆菌属XptA2毒素同源物 | ||
名称 | 参考序列 | 与Xwi XptA2(本文公 |
开为SEQ ID NO:23)的序列同一性 | ||
X.n XptA2 | GenBank登录号为CAC38404.1(AJ308438) | ~95% |
B类TC蛋白
光杆状菌属~170kDa增效剂 | ||
名称 | 鉴定人 | 与TcdB(GenBank登录号为AAL18487.1)的序列同一性 |
P.1.ORF2 | 美国6,281,413B1的SEQ ID NO:14 | ~93% |
P.1.ORF4 | 由美国6,281,413B1的SEQ IDNO:11的碱基9966到14633编码 | ~71% |
致病杆菌属大约170kDa增效剂 | ||
名称 | 鉴定人 | 与XptC1wi(本文公开为SEQ ID NO:8)的序列同一性 |
X.n.XptC 1 | GenBank登录号为CAC38403.1 | ~90% |
C类TC蛋白
光杆状菌属大约112kDa增效剂 | ||
名称 | 鉴定人 | 与TccC1A(GenBank登录号为AAC38630.1)的序列同一性 |
P.1.ORF5 | 美国6,281,413B1的SEQ ID NO:12 | ~51% |
致病杆菌属~112kDA增效剂 | ||
名称 | 鉴定人 | 与XptB1wi(本文公开为SEQ ID NO:17)的序列同一性 |
X.n.XptB1 | GenBank登录号为CAC38402 | ~96% |
X.nem.P2-ORF 2071 | 由WO 20/004855的图2的碱基2071到4929编码 | ~48% |
本文特别例证了可用于本发明融合蛋白中的某些A类、B类和C类TC蛋白。因为这些蛋白质仅仅是可用于本发明的蛋白质的例证,所以本发明显然应该包括与例证性蛋白质具有相同或者类似功能性的变体或者等价蛋白质(和编码其等价物的核苷酸序列)的使用。等价蛋白质将与例证性TC蛋白具有氨基酸相似性(和/或同源性)。可以根据较窄的同一性和/或相似性范围定义本发明的优选多核苷酸和蛋白质。例如与本文例证或者建议的序列相比,A、B和/或C类TC蛋白的同一性和/或相似性可以是40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87、88、89、90、91、92、93、94、95、96、97、98或者99%;与本文例证或者建议的序列相比而言,C类TC蛋白的同一性和/或相似性可以是35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87、88、89、90、91、92、93、94、95、96、97、98或者99%。上面列出的任何数可用于定义上限和下限。例如本发明融合蛋白的B类组分可定义为与给定的TcdB蛋白质具有50-90%的同一性。因此,与之前已知的TcdB蛋白质,包括本文特别例证的融合TcdB蛋白质(同样地,和具有PptB或者相应的致病杆菌属蛋白质)相比,TcdB样蛋白质(和/或tcdB样基因)可通过本文提供或者建议的任何用数字表示的同一性来定义。
氨基酸的同源性/相似性/同一性,一般(但不是必须)在决定蛋白质活性的蛋白质区域或者参与确定最终负责活性的三维构象的蛋白质区域中最高。在这点上,某些氨基酸取代是可接受的并且预计是可以忍受的。例如,这些取代可以是在蛋白质非关键活性的区域。可以使用蛋白质晶体结构分析和基于软件的蛋白质结构建模来确定蛋白质中能够被修饰的区域(使用定点诱变、改组等),以确实改变蛋白质的特性和/或提高蛋白质的功能性。
也可改变蛋白质的多种特性和三维特征,而不会负面影响蛋白质的毒素活性/功能性。保守氨基酸的取代预计能够忍受/不会不利地影响分子的三维构象。氨基酸可以分为下面的几类:非极性的、不带电荷的极性的、碱性的和酸性的。一类中的氨基酸被该类中的另一个氨基酸取代的这种保守取代处于本发明的范畴,条件是只要取代不为不利于所述化合物生物学活性的取代即可。
表IV提供了属于每一类别的氨基酸实例的列表。
表IV氨基酸类别 | |
氨基酸类别 | 氨基酸的例子 |
非极性的 | Ala、Val、Leu、Ile、Pro、Met、Phe、Trp |
不带电极性的 | Gly、Ser、Thr、Cys、Tyr、Asn、Gln |
酸性的 | Asp、Glu |
碱性的 | Lys、Arg、His |
在一些情况下,也可以进行非保守性取代。关键的因素是这种取代必须不明显损失蛋白质的功能/生物学/毒素活性。
使用本文提供的教导,可以从野生型或者重组细菌和/或其他野生型或重组生物,得到或者获得等效的A类、B类和/或C类TC蛋白和编码这些等效蛋白质的基因。例如其他芽胞杆菌、沙雷氏菌、类芽孢杆菌、光杆状菌和致病杆菌物种可用作分离源。
根据本发明可以用多种方法得到能够使用的蛋白质。例如,本文公开的蛋白质的抗体可以用来从混合物中鉴定和分离其他蛋白质。具体而言,针对蛋白质中最恒定和最能与其他蛋白质相区别的部分产生抗体。然后通过免疫沉淀、酶联免疫吸附测定(ELISA)或者免疫印迹,可以使用这些抗体特异地鉴定具有特征活性的等效蛋白质。使用标准的操作步骤可以容易地制备针对本文所公开的蛋白质、或者等效蛋白质、或者这些蛋白质片段的抗体。此类抗体是本发明的一个方面。本发明的蛋白质可以从多种来源的微生物中得到。
本领域技术人员可以清楚地认识到本发明的蛋白质(和基因)可以从多种来源中得到。“从”或者“自……获得”本发明的任何分离株的蛋白质,在此处指的意思或建议的意思是毒素(或者相似的毒素)可以从分离株或者诸如另一细菌菌株或者植物的其它一些来源得到。“来自……”也具有该含义并且包括从例如经过修饰用于在植物中进行表达的特定类型细菌中得到的蛋白质。本领域技术人员会清楚地认识到,一旦细菌基因和蛋白质被公开,就可以将植物进行工程改造以产生该蛋白质。使用本文公开的多核苷酸和/或氨基酸序列可以制备抗体制备物、核酸探针(DNA和RNA)等等,并且这些抗体制备物、核酸探针等等可以用来从其他(天然)来源中筛选和回收其他蛋白质基因。
通过使用例如寡核苷酸探针可以鉴定和得到在本发明中有用的蛋白质和基因。这些探针是能够依靠恰当标记检测的可检测核苷酸序列或者如国际申请序列号WO 93/16094中所述的制作成固有荧光性的。探针(和本发明的多核苷酸)可以是DNA、RNA或者PNA。除了腺嘌呤(A)、胞嘧啶(C)、鸟嘌呤(G)、胸腺嘧啶(T)和尿嘧啶(U;对于RNA分子)外,本发明的合成探针(和多核苷酸)还可以含有次黄嘌呤(能够与所有四种碱基配对的中性碱基;有时用于代替合成探针中所有四种碱基的混合物)。因此,当文中谈及合成的简并寡核苷酸时,并且通常使用“N”或者“n”时,“N”或者“n”可以是G、A、T、C或者次黄嘌呤。文中所使用的多义密码子,与截止本申请提交之时的标准的IUPAC命名规则一致(例如,R指A或者G,Y指C或者T等)。
如本领域所熟知,如果探针分子与核酸样品杂交,可以有理由认为探针和样品具有相当的同源性/相似性/同一性。优选地,通过本领域众所周知的技术,如在Keller,G. H.,M.M.Manak(1987)DNA Probes,StocktonPress,New York,NY,169-170页中所述,多核苷酸首先进行杂交,然后在低、中或者高严格条件下进行洗涤。例如,如本文所述,通过在室温下用2×SSC(标准柠檬酸盐)/0.1%SDS(十二烷基硫酸钠)第一次洗涤15分钟实现低严格条件。一般进行两次洗涤。然后通过降低盐浓度和/或升高温度可以实现较高的严格条件。例如,在上述的洗涤后接着是用0.1×SSC/0.1%SDS在室温下进行两次15分钟的洗涤,再其次是用0.1×SSC/0.1%SDS在55℃条件下进行每次30分钟的多次洗涤。这些温度可以与文中提出的其他杂交和洗涤操作步骤一起使用,且是本领域技术人员已知的(例如SSPE可以作为盐来代替SSC)。通过将50ml 20×SSC和5ml10%SDS加入到445ml水中来配制2×SSC/0.1%SDS。通过组合NaCl(175.3g/0.150M)、柠檬酸钠(88.2g/0.015M)和水,用10N NaOH调节pH至7.0,然后调节体积至1升来配制20×SSC。通过在50ml灭菌水中溶解10g SDS,然后稀释到100ml来配制10%SDS。
探针检测提供了一种以已知的方式确定杂交是否维持的方法。此种探针分析提供了一种鉴定本发明编码毒素的基因的快速方法。作为本发明探针使用的核苷酸片段可以使用DNA合成仪和标准方法进行合成。这些核苷酸序列还可以用作PCR引物,用于扩增本发明的基因。
用编码A类、B类和C类TC蛋白(例如SEQ ID NO:28-42和61-66)的给定野生型核酸杂交是一种可用于发现和/或确定A类、B类和C类TC蛋白的技术,该技术可用于本发明的融合蛋白。如本文所用,杂交的“严格”条件是指实现同当前申请人所采用的条件相同或者大约相同程度的杂交特异性的条件。具体而言,通过标准的方法(例如参见Maniatis,T.,E.F.Fritsch,J.Sambrook [1982]Molecular Cloning:A Laboratory Manual,ColdSpring Harbor Laboratory,Cold Spring Harbor,NY),采用32P-标记的基因特异的探针开展开展固相DNA杂交的Southern印迹。杂交和随后的洗涤通常在允许检测靶序列的条件下进行。对于双链DNA基因探针,在6×SSPE、5×Denhardt′s溶液、0.1%SDS、0.1mg/ml变性DNA中,在低于DNA杂交体解链温度(Tm)20-25℃条件下过夜杂交。在下面的公式中叙述了解链温度(Beltz,G. A.,K.A.Jacobs,T.H.Eickbush,P.T.Cherbas和F.C.Kafatos[1983]Methods of Enzymology,R.Wu,L.Grossman和K.Moldave[编著]Academic Press,New York 100:266-285):
1)Tm=81.5℃+16.6Log[Na+]+0.41(%G+C)-0.61(%甲酰胺)-600/碱基对中双链体的长度。
2)洗涤通常如下进行:
3)1×SSPE、0.1%SDS室温15分钟(低严格洗涤),两次。
4)在0.2×SSPE、0.1%SDS中,在Tm-20℃下洗涤15分钟(中等严格洗涤),一次。
对于寡核苷酸探针,在6×SSPE、5×Denhardt′s溶液、0.1%SDS、0.1mg/ml变性DNA中,在低于杂交体解链温度(Tm)10-20℃条件下过夜杂交。对于寡核苷酸探针的Tm,可以通过下式确定:Tm(℃)=2(T/A碱基对数目)+4(G/C碱基对数目)(Suggs,S.V.,T.Miyake,E.H.Kawashime,M.J.Johnson,K.Itakura和R.B.Wallace[1981]ICN-UCLA Symp.Dev.Biol. Using Purified Genes,D.D.Brown[编著],Academic Press,New York,23:683-693)。
洗涤通常如下进行:
1)1×SSPE、0.1%SDS室温15分钟(低严格洗涤),两次。
2)在0.2×SSPE、0.1%SDS中,在杂交温度下洗涤15分钟(中等严格洗涤),一次。
通常,改变盐和/或温度可以改变严格度。对于标记的>70或者大约如此碱基长度的DNA片段,可以使用下面的条件:
1)低:1或2×SSPE,室温
2)低:1或2×SSPE,42℃
3)中:0.2×或1×SSPE,65℃
4)高:0.1×SSPE,65℃。
双链体的形成和稳定性取决于杂合体两条链之间的基本互补性,并且如上指出能够耐受一定程度的错配。因此,本发明的探针序列包括所述序列的突变(单个和多重突变)、缺失、插入及其组合,其中所述突变、插入和缺失允许与有关的靶多核苷酸形成稳定杂交体。突变、插入和缺失能够以许多方式在目的多核苷酸序列内产生,并且这些方法是本领域普通技术人员熟知的。其他方法将来可以逐渐为人们所知。
PCR技术:聚合酶链式反应(PCR)是重复的、酶促的、使用引物触发的核酸序列合成。该过程是众所周知的并且本领域技术人员常常使用(参见Mullis,美国专利第4,683,195号、第4,683,202号和第4,800,159号;Saiki,Randall K.,Stephen Scharf, Fred Faloona,Kary B.Mullis,Glenn T.Horn,Henry A.Erlich,Norman Amheim[1985]“β-球蛋白质基因组序列的酶促扩增和限制性酶切位点分析用于诊断镰刀形红细胞贫血症”,Science 230:1350-1354)。PCR是基于目的DNA片段的酶促扩增,该DNA片段的两侧为可与靶序列的相对链杂交的两条寡核苷酸引物。引物用指向彼此的3′末端表示方向。模板热变性、引物退火至其互补序列和使用DNA聚合酶延伸退火的引物步骤的重复循环,导致由PCR引物5′末端所限定的片段的扩增。每条引物的延伸产物能够作为模板用于其他引物,因此每个循环基本上加倍前一循环中所产生的DNA片段的数量。这导致特定靶片段的指数积累,在几个小时内增加至几百万倍。通过使用热稳定DNA聚合酶,诸如从嗜热菌-嗜热水生菌(Thermus aquaticus)分离的Taq聚合酶,扩增过程能够完全自动化。其他能够使用的酶是本领域技术人员熟知的。
本发明的DNA序列能够用作PCR扩增的引物。在进行PCR扩增的过程中能够耐受引物与模板之间一定程度的错配。因此,作为例证的引物的突变、缺失和插入(特别是将核苷酸加至5′末端)包括在本发明的范围之内。通过本领域普通技术人员已知的方法能够在给定引物中产生突变、插入和缺失。
基因和蛋白质的修饰。根据本发明有用的基因和蛋白质不仅包括具体作为例证的全长序列,还包括这些序列、变体、突变体、嵌合体的部分、节段和/或片段(包括与全长分子相比的内部和/或末端缺失),及其融合物。本发明中使用的蛋白质可具有取代的氨基酸,只要它们保留此处具体作为例证的蛋白质的杀虫/功能活性。“变体”基因具有编码相同蛋白质或具有等效于例证蛋白质的功能性的等效蛋白质的核苷酸序列。术语“变体蛋白质”和“等效蛋白质”是指具有与作为例证的蛋白质相同或基本上相同的生物学/功能活性的蛋白质。如文中所使用,当谈及“等效”序列时,是指具有能够提高功能性或对功能性无不利影响的氨基酸取代、缺失、添加或插入的序列。该定义中还包括了保留功能性的片段。保留与作为例证的蛋白质的相应片段相同或相似功能的片段和其他等效物,也包括在本发明的范围之内。变化例如氨基酸取代或添加能够用于多种目的,例如增加(或降低)蛋白质的蛋白酶稳定性(而没有大大地/相当大地降低蛋白质的功能性)。
使用例如用于点突变的标准技术,可以容易地构建基因的变异。此外,例如美国专利第5,605,793号叙述了通过在随机断裂后利用DNA重装配产生额外的分子多样性的方法。变体基因能够用于产生变体蛋白质;重组宿主能够用于产生变体蛋白质。利用这些“基因改组(gene shuffling)”技术,能够构建包含本文作为例证的任意序列的任意5个、10个或20个连续残基(氨基酸或核苷酸)的等价基因和蛋白质。
利用商业可得的核酸外切酶或核酸内切酶,按照标准步骤能够制备全长基因的片段。例如,诸如Bal31的酶或定点诱变能够用于从这些基因的末端系统地切除核苷酸。同样地利用多种限制性内切酶可以获得编码活性片段的基因。蛋白酶可以用于直接获得这些蛋白质的活性片段。
如本文所述,TC蛋白可以被截短而仍然保留功能活性也在本发明的范围之内。“截短的蛋白质”意思是指蛋白质之一部分可以被切割并且切割之后仍然呈现活性。通过昆虫肠道内部或外部的蛋白酶能够实现切割。而且,使用分子生物学技术能够产生有效切割的蛋白质,其中编码所述蛋白质的DNA碱基通过用限制性内切酶消化或者使用技术人员可用的其他技术去除。截短之后,所述蛋白质能够在异源系统例如大肠杆菌、杆状病毒、基于植物的病毒系统、酵母及其类似系统中进行表达,然后置于本文所公开的昆虫测定法中确定活性。本领域中众所周知能够成功地产生截短的蛋白质,以至于虽然小于完整的全长序列,但是它们仍保留了功能活性。本领域中众所周知B.t.毒素能够以截短(核心毒素)的形式使用。例如参见Adang等人,Gene 36:289-300(1985),“Characterized full-length andtruncated plasmid clones of the crystal protein of Bacillus thuringgiensissubsp kurstaki HD-73and their toxicity to Manduca sexta (苏云金芽孢杆菌亚种kurstaki HD-73的晶体蛋白质的全长和截短的质粒克隆特性表征及其对烟草天蛾的毒性)”。保留杀虫活性的截短蛋白质的其他例子包括昆虫保幼激素酯酶(授予Regents of University of California的美国专利第5,674,485号)。文中所使用的术语“毒素”意思也包括功能上有活性的截短形式。
由于遗传密码的简并性/冗余性,多种不同的DNA序列可以编码本文公开的氨基酸序列。产生编码相同或者基本上相同毒素的备选DNA序列在受过本领域培训的技术人员的技术范围之内。这些变体DNA序列属于本发明的范畴。
优化序列用于在植物中表达:为了实现异源基因在植物中高表达,优选地是对所述基因重新进行改造使得它们能够在植物细胞(的胞质)中更有效地表达。玉米是此种植物的一种,优选地是在转化前将外源基因进行重新设计以提高它们在所述植物中的表达水平。因此,设计编码细菌毒素的基因的额外步骤是重新改造异源基因用于优化表达。例如可以在美国专利5,380,831中发现关于产生在植物中优化表达的合成基因的的指导。SEQID NO:43和44给出了编码B类TC蛋白TcdB2和C类TC蛋白TccC3的植物优化序列的实例。
功能、活性和实用性。本发明提供了可方便施用的功能蛋白质。本发明还提供递送具有功能活性并且对多数目的昆虫,优选地是鳞翅类昆虫有效的昆虫杀虫剂的方法。“功能活性”(或“有活性”),在本文指蛋白质作作为口服活性的控制昆虫的试剂发挥作用(单独或者与其它蛋白质相结合),这些蛋白质有毒性效应(单独或者与其他蛋白质组合),或者能破坏或者阻止昆虫生长和/或其进食,这可能引起或者不能引起昆虫的死亡。当通过转基因植物表达、配制的蛋白质组合物、可喷雾的蛋白质组合物、饵基质或者其他投递系统而投递,昆虫与“有效量的”本发明的“杀虫蛋白”接触时,结果通常是昆虫的死亡,昆虫的生长和/或繁殖的抑制,和/或在能产生昆虫可获得该蛋白质的来源(优选转基因植物)上阻止昆虫进食。因此,例如摄食有效量ABC融合蛋白的昆虫,举例而言可能被阻止进食、阻碍其生长、和/或被杀死。如果当本发明的“BC”融合蛋白与A类TC蛋白组合使用可增强其功能活性,那么本发明的“BC”融合蛋白有“功能性”或毒性活性。
对进食昆虫的完全致命性是优选的,但对获得功能活性不是必须的。如果昆虫远离该蛋白质或者停止进食,甚至如果这些效果是亚致死的,或者致命性被延迟或者不直接,该避免在一些应用中将是有用的。例如如果期望是抗昆虫性的转基因植物,昆虫不愿摄食该植物与对昆虫的致命毒性同样有用,因为最终目标是避免昆虫所导致的植物的损害。
功能活性转移到植物或细菌系统通常需要将编码毒素的氨基酸序列的核苷酸序列整合进入蛋白质表达载体上,该蛋白质表达载体对于载体所要居留的宿主细胞是恰当的。获得编码有功能活性的蛋白质的核酸序列的一种途经是如本文公开的使用从毒素的氨基酸序列推导的信息,从产生毒素的细菌物种中分离出天然遗传物质。例如如同下面详细讨论的,可以优化天然序列以便在植物中表达。也可以基于蛋白质序列设计优化的多核苷酸。
有许多TC蛋白搀入昆虫食物中的其他方法。例如本文公开的,可通过用蛋白质溶液喷雾食物的方式,用毒性蛋白质掺杂幼虫的食物源。另外,纯化的蛋白质可被基因工程改造到其他无害细菌中,然后在培养物中培养,或者用于食物源或者允许在期望根除昆虫的区域的土壤中存在。而且,该蛋白质可以被基因工程改造直接进入昆虫食物源中。例如许多昆虫幼虫的主要食物源是植物材料。因此可将编码毒素的基因转入植物材料中,这样所述植物材料就可表达目的毒素。
转基因宿主。可以将本发明的编码毒素复合体融合物的基因导入到广泛的多种微生物或植物宿主中。在优选的实施方案中,使用了转基因植物细胞和植物。优选的植物(和植物细胞)是谷类(玉米)、棉花、卡诺拉(canola)、向日葵和大豆。
在优选的实施方案中,融合基因的表达直接或间接导致融合蛋白在细胞内产生(和维持)。以这种方式植物表现出昆虫抗性。当害虫摄取转基因/重组/转化/转染的宿主细胞(或其内容物)时,害虫就会摄取毒素。这是导致害虫与毒素接触的优选方式。结果是对害虫的控制(杀死或致病)。对于吮吸害虫也可以用相似的方式进行控制。备选地,在靶害虫存在的地方可以使用适宜的微生物宿主,例如假单胞菌属如荧光假单胞菌(P.fluorescens),微生物可以在此处增殖并且被靶害虫摄取。可以将具有毒素基因的微生物在延长毒素活性和稳定细胞的条件下进行处理。从而经过处理的、保留毒素活性的细胞可以应用到靶害虫的环境中。本发明也包括施用产生少于所有三种类型TC多肽的细胞。例如,在一些实施方案中,本发明包括共同施用产生毒素A的细胞和产生本发明的BC融合蛋白的细胞。
如果通过适当的载体将毒素基因导入微生物宿主,并且将所述宿主以活的状态应用到环境中时,可以用到一些宿主微生物。选择了已知占据一种或多种目的作物的植物圈(叶面、叶际、根际和/或根面)的微生物宿主。将这些微生物进行选择以致于具有在具体环境中(作物和其他昆虫栖息地)与野生型微生物成功竞争的能力,提供能够表达多肽杀虫剂的基因的稳定维持和表达,并且希望提供使杀虫剂免受环境降解和失活的增强的保护。
已知大量的微生物栖息在广泛多样的重要作物的叶面(植物叶的表面)和/或根际(植物根周围的土壤)。这些微生物包括细菌、藻类和真菌。特别感兴趣的是微生物,诸如细菌,例如假单胞菌属、欧文氏菌属(Erwinia)、沙雷氏菌属、克雷伯氏菌属(Klebsiella)、黄单胞菌属(Xanthomonas)、链霉菌属(Streptomyces)、根瘤菌属(Rhizobium)、红假单胞菌属(Rhodopseudomonas)、Methylophilius、农杆菌属(Agrobacterium)、醋杆菌属(Acetobacter)、乳杆菌属(Lactobacillus)、节杆菌属(Arthrobacter)、固氮菌属(Azotobacter)、明串珠菌属(Leuconostoc)和产碱菌属(Alcaligenes);真菌,特别是酵母,例如酵母属(Saccharomyces)、隐球酵母属(Cryptococcus)、克鲁维酵母属(Kluyveromyces)、掷孢酵母属(Sporobolomyces)、红酵母属(Rhodotorula)和短梗霉属(Aureobasidium)。特别感兴趣的是植物圈细菌物种如丁香假单胞菌(Pseudomonas syringae)、荧光假单胞菌、粘质沙雷氏菌(Serratia marcescens)、木醋杆菌(Acetobacterxylinum)、根癌农杆菌(Agrobacterium tumefaciens)、球形红假单胞菌(Rhodopseudomonas spheroides)、野油菜黄单胞菌(Xanthomonascampestris)、苜蓿根瘤菌(Rhizobum melioti)、真养产碱菌(Alcaligenesentrophus)和维涅兰德固氮菌(Azotobacter vinlandii);和植物圈酵母物种如深红酵母(Rhodotorula rubra)、红酵母(R.glutinis)、海滨红酵母(R.marina)、橙黄红酵母(R.aurantiaca)、浅白隐球酵母(Cryptococcusalbidus)、流散隐球酵母(C.diffluens)、变黄罗伦隐球酵母(C.laurentii)、罗斯酵母(Saccharomyces rosei)、普地酵母(S.pretoriensis)、酿酒酵母(S.cerevisiae)、掷孢酵母类(Sporobolomyces roseus)、香气掷孢酵母(S.odorus)、佛地克鲁维酵母(Kluyveromyces veronae)和出芽短梗霉(Aureobasidium pollulans)。也感兴趣的是有颜色的微生物。
将基因插入以形成转基因宿主:本发明的一个方面是用表达本发明蛋白质的本发明多核苷酸转化/转染植物、植物细胞和其他宿主细胞。用此种方式转化的植物具有抗靶害虫攻击的抗性。
可以用广泛大量的方法在允许基因稳定维持和表达的条件下将编码蛋白质的基因导入靶宿主。这些方法对于本领域技术人员而言是熟知的并且在例如美国专利第5,135,867号中有所叙述。
例如,包含大肠杆菌复制系统和允许对转化细胞进行选择的标记物的大量克隆载体可用于将外来基因插入到高等植物中。这些载体包含例如pBR322、pUC系列、M13mp系列、pACYC184等等。相应地,可以将编码毒素的序列插入到载体的适当限制性位点。所得到的质粒用于转化入大肠杆菌。将大肠杆菌在适当的营养培养基中进行培养,然后收获和裂解。将质粒回收。通常实施序列分析、限制性分析、电泳和其他生物化学-分子生物学方法作为分析的方法。每一次操作之后,将所使用的DNA序列进行切割并且连接到下一个DNA序列中。可以将每一个质粒序列克隆到相同的或者其他的质粒中。根据目的基因插入植物的方法,其他的DNA序列可能是需要的。例如,如果Ti或者Ri质粒用于转化植物细胞,那么至少是Ti或者Ri质粒T-DNA的右边界,但常常是右边界和左边界,作为所插入基因的侧翼区而连接。对于T-DNA用于转化植物细胞的用途已经得以广泛研究并在EP 120516;Hoekema(1985):The Binary Plant VectorSystem,Offset-durkkerij Kanters B.V.,Alblasserdam,第5章;Fraley等人,Crit.Rev.Plant Sci.4:1-46;和An等人(1985)EMBO J.4:277-287中有叙述。
有大量的技术可以将DNA插入到植物宿主细胞。这些技术包括使用根癌农杆菌或者发根农杆菌(Agrobacterium rhizogenes)作为转化介质的T-DNA转化、融合、注射、生物弹射击法(biolistics)(微粒子轰击)或者电穿孔以及其他可能的方法。如果农杆菌属用于转化,必须将欲被插入的DNA克隆到特定的质粒中,即克隆到中间载体或者双元载体中。由于序列与T-DNA上的序列同源,通过同源重组可以将中间载体整合到Ti或者Ri质粒中。Ti或者Ri质粒还含有对于T-DNA转移必需的vir区域。中间载体不能够在农杆菌属中自我复制。通过辅助质粒(接合)的方法可以将中间载体转移入根癌农杆菌。双元载体可以在大肠杆菌和农杆菌属中进行自我复制。它们包含选择性标记基因和接头或者多接头,这些接头或者多接头被T-DNA右边界和左边界框起来。它们可以直接转化进入农杆菌属中(Holsters等人[1978]MoL Gen.Genet.163:181-187)。作为宿主细胞使用的农杆菌属要包含携带vir区域的质粒。vir区域对于将T-DNA转移进入植物细胞是必需的。可以包含额外的T-DNA。如此转化的细菌用于转化植物细胞。植物外植体可以有利地与根癌农杆菌或者发根农杆菌一起培养,以便将DNA转移进入植物细胞。在含有用于选择的抗生素或者杀生物剂的适当培养基中使被感染的植物材料(例如叶片、茎段、根,还可以是原生质体或者悬浮培养的细胞)再生出整株植物。然后对如此得到的植物检验插入DNA是否存在。在注射和电穿孔的情况下,对质粒没有特殊的要求。使用普通的质粒例如pUC衍生物是可能的。
通常,转化的细胞生长在植物体的内部。它们可以形成生殖细胞并且将转化的特性传给后代植物体。此种植物体可以以正常方式生长并可与转化有同样遗传因子或者其他遗传因子的植物体进行杂交。所得的杂交个体具有相应的表型特征。
在本发明的一些优选实施方案中,编码细菌毒素的基因从插入植物基因组中的转录单元进行表达。所述转录单元优选是能够稳定整合进入植物基因组并能够对表达mRNA编码蛋白质的经转化植物品系进行筛选的重组载体。
一旦插入的DNA被整合进入基因组,则它在基因组中是相对稳定的(并且不能够再次出来)。正常情况下它含有赋予转化的植物细胞抗杀生物剂或者抗生素的选择性标记,诸如卡那霉素、G418、博来霉素、潮霉素或者氯霉素及其他。各种所应用的标记因此应该允许对转化细胞进行选择而不是对不含有插入DNA的细胞进行选择。目的基因优选地通过组成型或者诱导型启动子在植物细胞中表达。一旦表达,mRNA可以翻译成蛋白质,从而将有关氨基酸掺入到蛋白质中。在植物细胞中表达编码毒素的基因可以在组成型启动子、组织特异型启动子或者诱导型启动子的控制之下。
存在一些用于将外来重组载体导入植物细胞并且得到稳定维持和表达所导入基因的植物的技术。此类技术包括将包被在微颗粒上的遗传物质直接导入细胞(Cornell的美国专利第4,945,050号和现在为DowAgroSciences,LLC的DowElanco的美国专利第5,141,131号)。此外,使用农杆菌属技术将植物进行转化,见University of Toledo的美国专利第5,177,010号;Texas A&M的美国专利第5,104,310号;欧洲专利申请0131624B1;Schilperoot的欧洲专利申请120516,159418B1和176,112;Schilperoot的美国专利申请第5,149,645号,第5,469,976号,第5,464,763号和第4,940,838号和第4,693,976号;Max Planck的欧洲专利申请116718,290799,320500;日本Tobacco的欧洲专利申请604662和627752和美国专利第5,591,616号;现在为Novartis的Ciba Geigy的欧洲专利申请0267159和0292435和美国专利第5,231,019号;Calgene的美国专利第5,463,174号和第4,762,785号和Agracetus的美国专利第5,004,863号和第5,159,135号。其他的转化技术包括whiskers技术。参见Zeneca的美国专利第5,302,523号和第5,464,765号。电穿孔技术也用来转化植物。参见Boyce Thompson Institute的WO 87/06614;Dekalb的美国专利号5,472,869和5,384,253和Plant Genetic Systems的WO 92/09696和WO93/21335。此外,病毒载体也可以用于产生表达目的蛋白质的转基因植物。例如,使用Mycogen Plant Science和现在为Novartis的Ciba-Giegy的美国专利号5,569,597以及Biosource的美国专利第5,589,367号和第5,316,93号1中叙述的方法可以用病毒载体转化单子叶植物。
如先前所提到,DNA构建体导入植物宿主的方式不是本发明的关键。可采用能够提供有效转化的任何方法。例如,本文叙述了植物细胞转化的多种方法,且包括使用Ti或者Ri质粒等进行农杆菌属介导的转化。在很多情况下,希望用于转化的构建体位于T-DNA边界的一侧或者两侧,更加优选是位于右边界。虽然在其他转化模式中可见T-DNA边界的使用,但当此构建体用根癌农杆菌或者毛根农杆菌作为转化模式时特别有用。当使用农杆菌属用于植物细胞转化时,欲被导入宿主的载体用来与存在于宿主中的T-DNA或者Ti或者Ri质粒进行同源重组。可以通过电穿孔、三亲交配和本领域技术人员已知的用于转化革兰氏阴性细菌的其他技术实施载体的导入。载体转化进入农杆菌宿主的方式不是本发明的关键。包含用于重组的T-DNA的Ti或者Ri质粒能够或者不能够导致菌瘿的形成,并且这不是本发明的重点,只要所述此宿主中存在vir基因即可。
在农杆菌属用于转化的一些情况下,位于T-DNA边界中的表达构建体将插入到广谱的载体中,例如在Ditta等人(PNAS美国(1980)77:7347-7351和EPO 0120515,本文引为参考)中所叙述的pRK2或其衍生物。表达构建体和T-DNA中包含如本文所述的一种或多种标记,所述标记允许对转化的农杆菌属和转化的植物细胞进行选择。采用的具体标记不是本发明的本质,优选的标记取决于所使用的宿主和构建体。
对于使用农杆菌转化植物细胞,将外植体和转化的农杆菌属植物结合并孵育足够的时间以便允许其转化。在转化后,通过用恰当的抗生素进行选择将农杆菌杀死,并且将植物细胞在恰当的选择培养基中培养。愈伤组织一旦形成,就可根据植物组织培养和植物再生领域内熟知的方法,通过采用适当的植物激素促进枝条的形成。然而,愈伤组织中间阶段并不总是必要。在枝条形成后,可以将此植物细胞转移至培养基中促进根的形成,从而完成植物的再生。然后将植物培养至结种子,并且所述种子可以用来建立未来的子代。不管转化的技术为何技术,编码细菌毒素的基因优选掺入到基因转移载体上,通过在载体中包含植物启动子调节元件和诸如Nos等的3′非翻译转录终止区,使得该载体适合用来在植物细胞中表达此种基因。
除了用于转化植物的技术有多种外,与外来基因相接触的组织类型同样也会变化。此种组织包括但不限于胚发生组织、I、II和III型愈伤组织、胚轴、分裂组织、根组织、在韧皮部表达的组织等等。使用本文叙述的恰当的技术,几乎所有植物组织在去分化过程中均能够被转化。
如上面所提到,如果期望的话,可使用多种选择标记。对具体标记的优先使用由本领域技术人员判断,但是任何下面的选择标记可以与本文没有列出的但能够作为选择标记起作用的其他基因一起使用。此种选择标记包括但不限于编码对抗生素卡那霉素、新霉素和G418抗性的转座子Tn5的氨基糖苷磷酸转移酶基因(Aph II)和编码对草甘膦、潮霉素、氨甲喋呤、膦丝菌素(双丙氨膦)、咪唑啉酮、磺酰脲和诸如氯磺隆(chlorsulfuron)、溴草腈(bromoxynil)、茅草枯等的三唑嘧啶(triazolopyrimidine)除草剂抗性或者耐受的那些基因。
除了选择标记以外,还可以使用报告基因。在一些例子中,报告基因可以与或者不与选择标记一起使用。报告基因是这样的基因,它一般不存在于受体生物体或者组织中,并且一般编码导致一些表型改变或者酶特性的蛋白质。此种基因的例子在K.Wising等人的Ann.Rev.Genetics,22,421(1988)中有所提供。优选的报告基因包括大肠杆菌uidA基因座的β-葡糖醛酸糖苷酶(GUS)、来源于大肠杆菌Tn9的氯霉素乙酰转移酶基因、来源于发生物荧光的水母维多利亚水母(Aequorea victoria)的绿色荧光蛋白质和来源于萤火虫北美萤火虫(Photifluspyralis)的荧光素酶基因。在该基因被导入受体细胞适当时间后进行检测报告基因表达的分析。如Jefferson等人(1987Biochem.Soc.Trans.15,17-19)所叙述,优选的此种方法是使用编码大肠杆菌uidA基因座的β-葡糖醛酸糖苷酶(GUS)的基因来鉴定转化的细胞。
除了植物启动子调节元件外,可以在植物细胞中有效使用来自多种来源的启动子调节元件来表达外来基因。例如可以使用例如细菌来源的启动子调节元件,诸如章鱼氨酸合酶启动子、胭脂氨酸合酶启动子、marmopine合酶启动子;病毒来源的启动子,诸如花椰菜花叶病毒(35S和19S)、35T(它是一个重新进行工程改造的35S启动子,参见美国专利第6,166,302号,特别是实施例7E)等等。植物启动子调节元件包括但不限于核酮糖-1,6-二磷酸(RUBP)羧化酶小亚基(ssu)、β-伴大豆球蛋白质(conglycinin)启动子、β-菜豆蛋白质启动子、ADH启动子、热休克启动子和组织特异性启动子。还可以存在其他元件诸如基质附着区、支架附着区、内含子、增强子、多聚腺苷酸化序列等等,并且这些元件可以促进转录效率或者DNA整合。虽然此类元件可以通过影响转录、mRNA的稳定性等等来提供更好的DNA表达或者功能,但它们对于DNA功能是必需或不是必需的。此类元件可以如愿地包含在DNA中,以便使已经转化的DNA在植物中有最佳的表现。一般的元件包括但不限于Adh-内含子1、Adh-内含子6、苜蓿花叶病毒衣壳蛋白质前导序列、玉米条纹病毒衣壳蛋白质前导序列以及本领域技术人员可得的其他元件。也可以使用组成型启动子调节元件从而指导基因在所有细胞类型和在所有的时间内持续表达(例如肌动蛋白质、泛素、CaMV 35S等)。组织特异型启动子调节元件负责基因在特异细胞或者组织类型诸如叶或者种子中的表达(例如玉米醇溶蛋白质、油质蛋白质、napin、ACP、球蛋白质等),并且也可以使用这些启动子。
启动子调节元件还可以是在植物发育的一定阶段活化的,也可以是在植物的组织和器官中活化的。此类启动子的例子包括但不限于花粉特异性的、胚特异性的、玉米穗丝特异性的、棉花纤维特异性的、根特异性的、种子胚乳特异性的的启动子调节元件等等。在一定情况下,使用诱导型启动子调节元件是所希望的,该元件负责对特异信号作出反应而使基因表达,其中所述特异信号诸如物理刺激(热休克基因)、光(RUBP羧化酶)、激素(Em)、代谢物、化学品和胁迫。可以使用在植物中有功能的其他目的转录和翻译元件。多种植物特异性的基因转移载体在本领域是已知的。
使用标准的分子生物学技术对本文所述的毒素进行克隆和测序。其他信息可以在Sambrook,J.,Fritsch,E.F.,和Maniatis,T.(1989),MolecularCloning,A Laboratory Manual,Cold Spring Harbor Press中找到,本文将其引为参考。
抗性管理:随着杀虫蛋白在转基因植物中使用的日益商业化,抗性管理成为一件需要考虑的事情。即在其产品中使用苏云金芽孢杆菌毒素的公司有很多,并且对发展成抗B.t.毒素的昆虫存在忧虑。对昆虫抗性管理的一个策略是将由致病杆菌属、光杆状菌属等产生的TC杀虫蛋白与诸如B.t.晶体毒素、来源于芽孢杆菌菌株的可溶性杀虫蛋白(参见例如WO 98/18932和WO 99/57282)或其他杀虫的毒素相结合。这种结合可以是配制成喷雾施用的制剂或者是分子的结合。可以用产生两种或者多种不同杀虫毒素的细菌基因转化植物(参见例如Gould,38Bioscience 26-33(1988)和美国专利第5,500,365号;同样地,欧洲专利申请0400246A1和美国专利第5,866,784号、第5,908,970号;和第6,172,281号也叙述了用两种B.t.晶体毒素进行植物转化)。产生含有多于一种抗昆虫基因的转基因植物的另一种方法是首先产生两种植物,每种植物含有一种抗昆虫的基因。然后使用常规的植物育种技术将这些植物杂交以产生含有多于一种抗昆虫基因的植物。因此,很显然文中所使用的短语“包含多核苷酸”意思是指至少一种多核苷酸(并且可能更多,它们互相接近或者不接近),除非另行指出。
制剂和其他递送系统:所配制的含有本发明细胞和/或蛋白质(包括含有本文所述基因的重组微生物)的诱饵颗粒可以施用于土壤。所配制的产品还可以在作物循环的后期阶段作为种子包被、根处理或者全植物处理而施用。对植物和土壤进行处理的细胞可以采用可湿性粉末、颗粒或者粉剂,通过与多种惰性物质诸如无机矿物质(层状硅酸盐、碳酸盐、硫酸盐、磷酸盐等)或者植物材料(弄成粉末的玉米芯、稻皮、胡桃壳等)混合施用。制剂可以包含展布剂-粘着剂佐剂、稳定剂、其他杀虫添加剂或者表面活性剂。液体制剂可以是基于水的或者非水的并且以诸如泡沫、凝胶、悬液、可乳化原液等应用。成分可以包含流变剂、表面活性剂、乳化剂、分散剂或者聚合体。
本领域技术人员应当理解,杀虫剂的浓度会变化很大,这取决于具体制剂的性质,特别是它是否是浓缩的或者是直接使用的。杀虫剂可以以至少1wt%并且可以以100wt%存在。干制剂可以具有大约1-95wt%的杀虫剂,而液体制剂通常在液相中具有大约1-60wt%的固体。制剂通常具有大约102-104个细胞/毫克。这些制剂可以以每公顷大约50毫克(液体的或干的)至1千克或更多量进行施用。
通过喷雾、撒粉、喷洒等可以将制剂施用于昆虫的环境例如土壤和植物叶上。
另一个递送方案是将毒素的遗传物质掺入到杆状病毒载体中。杆状病毒感染特定的昆虫宿主,包括欲用毒素靶向的那些昆虫宿主。可以将带有毒素表达构建体的感染性杆状病毒引入昆虫侵害的区域,从而使被感染的昆虫中毒或者被毒死。
已知昆虫病毒或者杆状病毒可以感染并且负面影响一些昆虫。病毒对昆虫的影响是慢性的,并且病毒不能够立即阻止昆虫的摄食。因此病毒不是最佳的昆虫害虫控制剂。然而将毒素基因结合到杆状病毒载体中提供了一种传递毒素的有效方式。除此之外,由于不同的杆状病毒对不同的昆虫是特异的,因此有可能使用特定的毒素去选择性靶向特定的破坏性昆虫害虫。对于毒素基因特别有用的载体是核型多角体病毒。已经对使用这种病毒的转移载体进行了叙述并且现在是用于转移外来基因进入昆虫的一种选择。可以将病毒-毒素基因重组体制作成口服转移的形式。杆状病毒通常通过中肠肠粘膜感染昆虫。插入到强的病毒衣壳蛋白质启动子后的毒素基因会表达并迅速杀死被感染昆虫。
除了用于本发明蛋白质毒素的昆虫病毒或杆状病毒或者转基因植物递送系统外,使用苏云金芽孢杆菌包封技术可以将蛋白质进行包封,该技术诸如但不限于美国专利第4,695,455号、第4,695,462号、第4,861,595号,这些在本文引为参考。用于本发明蛋白质毒素的另一个递送系统是将蛋白质配制成诱饵介质,该介质可以在地上和地下昆虫诱饵装置中使用。此种技术的例子包括但不限于PCT专利申请WO 93/23998,其在本文引为参考。
基于植物RNA病毒的系统也可以用于表达细菌毒素。在这种情况下,编码毒素的基因被插入到适当植物病毒的衣壳启动子区域,该病毒可感染目的宿主植物。然后毒素被表达,因此保护植物免受昆虫损害。基于植物RNA病毒的系统叙述于Mycogen Plant Sciences的美国专利第5,500,360号和Biosource Genetics Corp.的美国专利第5,316,931号和第5,589,367号中。
除了产生经转化的植物外,还有其他对细菌基因进行了工程改造的递送系统。例如通过将作为食物源而吸引昆虫的分子与毒素融合在一起而构建成蛋白质毒素。此种具有“内部”诱饵的毒性剂在实验室纯化后可包装在标准昆虫捕捉箱中。
突变体:通过本领域众所周知的操作步骤可以制成细菌分离株的突变体。例如通过对分离株进行甲基磺酸乙酯(EMS)诱变可以得到不产孢子的突变型。通过本领域众所周知的操作步骤使用紫外线和亚硝基胍可以产生突变型。
引入本文所提到或引用的所有专利、专利申请、临时申请和公开完整引为参考,在它们与本说明书的明确指导一致的程度之内均可作为参考。
下面是阐明用于实践本发明的步骤的实施例。这些实施例不应该被解释成对本发明的限制。除非另外注释,所有的百分数是以重量计,并且所有溶剂混合物比例是以体积计。
实施例1基因tcdB2/tccC3 V1的构建
通过特异合成的寡核苷酸片段连接毒素复合体增效剂或来自发光光杆状菌菌株W-14的协同因子基因tcdB2和tccC3的编码区。在多步骤方法中,使用标准分子生物学技术修饰tcdB2基因编码区的3’末端,以去除天然的翻译终止密码子,并且能使tcdB2编码区与其他编码区连接。同样地,对tccC3基因的编码区的5’末端进行基因工程改造以允许与其他编码区连接。然后在pET表达质粒载体(Novagen,Madison WI)中,用这样的方式将两个修正的编码区连接为一个单一的开放阅读框,以便维持适当的细菌转录和翻译信号。质粒命名为pDAB8920。所得的融合编码区盒子的DNA序列如SEQ ID NO:1中所示。TcdB2、接头肽和TccC3的编码区分别由SEQ ID NO:1的核苷酸48-4469、4470-4511和4512-7394表示。SEQ IDNO:2显示了由SEQ ID NO:1中的融合基因编码的多肽。TcdB2、接头肽和TccC3的氨基酸序列分别由SEQ ID NO:2的氨基酸1-1474、1475-1488和1489-2448表示。
tcdB2和tccC3编码区之间的连接寡核苷酸(SEQ ID NO:3)编码多肽接头片段(SEQ ID NO:4)。特别设计接头多肽以便包括几个发明特征。目的是通过非结构的、亲水的、柔性的多肽接头连接TcdB2和TccC3蛋白结构域。不期望此类接头抑制连接的TcdB2和TccC3蛋白的折叠。此外,接头区域被构建为易于蛋白酶解,从而允许TcdB2和TccC3蛋白分离。
特别地,脯氨酸残基基因改造为安排连接到TcdB2和TccC3的接头肽(SEQ ID NO:4)的每一端上。添加脯氨酸残基的目的是将弯曲导入到多肽骨架中,从而暴露它们之间的残基。将特有限制位点插入到与脯氨酸密码子(SEQ ID NO:3)邻近的连接寡核苷酸序列中。Bam HI限制位点编码氨基酸-甘氨酸和丝氨酸。已知甘氨酸能在多肽骨架内引入柔性,能抑制蛋白质内的二级结构。Stu I位点编码上面描述的脯氨酸和精氨酸。氨基酸-丝氨酸和精氨酸-是两种亲水残基。特有限制位点有助于引入额外的连接寡核苷酸。
从毒素复合体蛋白TcdB1中选择Bam HI和Stu I限制位点之间编码的氨基酸接头序列(SEQ ID NO:4的DNKGQTIRT)。因为该序列具有四个期望的特征,所以是优选的。首先,九个编码的氨基酸中的七个是亲水性残基(天冬氨酸(D))、天冬酰胺(N)、赖氨酸(K)、谷氨酰胺(Q)、苏氨酸(T)和精氨酸(R))。亲水性残基确保该片段位于融合蛋白的表面上,并暴露于极性溶剂。其次,序列中包括了推测由蛋白酶-胰蛋白酶裂解的两个位点(KG和RT)。第三,该片段含有已知抑制蛋白质的二级结构的残基(甘氨酸和天冬酰胺)。第四,该序列含有已知将柔性引入肽链的甘氨酸残基。
实施例2用于生物测定的蛋白质来源
A类TC蛋白TcdA和XptA2xwi以异源表达相应基因的荧光假单胞菌(Psudomonas fluorescens)培养物制备的纯化形式使用。B类和C类增效剂,TcdB2和TccC3及新的融合蛋白TcdB2/TccC3V1作为大肠杆菌裂解物的成分测试。通过与TcdB2和TccC3的纯化制备物比较,验证裂解物的用途。对编码TcdB2和TccC3蛋白的阅读框进行基因工程改造,以通过将tcdB2-tccC3顺序的双顺反子操纵子克隆到pET质粒(Novagen,MadisonWI)中而在大肠杆菌中表达。编码和产生分离的(非融合)TcdB2和TccC3蛋白的质粒命名为pDAB3093。每个编码区含有一个具有适当间隔的核糖体结合位点(相对于起始密码子)和终止信号。将一些基因的5′端DNA序列进行修饰以减少预测的RNA二级结构并因此而提高翻译。这些碱基变化是沉默的,不会导致被编码蛋白质中的氨基酸改变。
实施例3表达条件和裂解物制备
使用标准方法,将pET表达质粒(空载体,对照)、pDAB3090和pDAB8920转化进大肠杆菌T7表达菌株BL21(DE3)(Novagen,MadisonWI)中。将10-200个新鲜转化的菌落接种到250mL含有50μg/ml抗生素和75μM IPTG(异丙基-β-D-硫代半乳糖吡喃糖苷)的LB培养基中开始进行表达培养。将培养物于28℃下、180-200转/分钟生长24小时。通过在250ml Nalgene瓶中于4℃、3,400×g离心10分钟收集细胞。将沉淀悬浮于4-4.5mL Butterfield′s磷酸盐溶液(Hardy Diagnostics,SantaMaria,CA;0.3mM磷酸钾pH 7.2)。将悬浮的细胞转移至含有1mL 0.1mm直径玻璃珠(Biospec,Bartlesville,OK,目录号1107901)的50mL聚丙烯螺口离心管中。将细胞-玻璃珠混合液置于冰上冷却,然后将细胞通过超声破碎裂解,超声时使用Branson Sonifier 250(Danbury CT)在约20的输出条件下使用2mm探头进行两次45秒的脉冲,在两个脉冲之间要完全冷却。将裂解物转移至2mL Eppendorf管中并且在16,000×g下离心5分钟。收集上清液并且测量蛋白质的浓度。用H2O将Bio-Rad Protein DyeAssay Reagent以1∶5进行稀释,并且将1mL加入到10μl1∶10稀释的每一个样品中和加入到浓度为5、10、15、20和25μg/mL的牛血清白蛋白质(BSA)中。然后在Shimadzu UV160U分光光度计上(Kyoto,JP),用分光光度计法读取样品,测量在波长595nm下的光密度。然后根据BSA的标准曲线计算出每个样品中的蛋白质的量并且用磷酸盐缓冲液调整至3-6mg/mL。一般测定新鲜制备的裂解物,然而当储存于-70℃时没有观察到活性损失。
实施例4生物测定条件
在特别设计用于昆虫生物测定法的128-孔碟子(C-D International,Pitman,NJ)中对人工饵料上的新生幼虫进行昆虫生物测定。所测定的物种有南方玉米根虫、南方玉米根虫(diabrotica undecimpunctata howardii(Barber))、棉铃虫、美洲棉铃虫(Helicoverpa zea)(Boddie)、和甜菜粘虫、甜菜夜蛾(Spodoptera exigua)(
Hübner)。
生物测定法是通过在控制的环境条件下(28℃,大约40% r.h.,16:8[L:D])培养5天,此时记录处理的昆虫的总数、死亡昆虫数和存活昆虫的重量。
如下测定仅用粗裂解物或与加入的TcdA或者XptA2xwi毒素蛋白质一起的生物学活性。将对照培养物或者那些表达增效剂蛋白质的大肠杆菌粗裂解物(40μL)涂布到生物测定碟子的8个孔中的人工饵料表面。在每个孔中受处理食物的平均表面积为大约1.5cm2。将裂解物调整到3-5mg/mL总蛋白质,并且与或不与TcdA或者XptA2xwi一起应用。所加入的TcdA或者XptA2xwi是来自异源表达所述蛋白质的细菌培养物的高度纯化组分。食物中TcdA和XptA2xwi的终浓度分别为250ng/cm2和50ng/cm2。
实施例5生物测定结果
表V所示为与对照的细胞裂解物和程序表达非融合增效剂TcdB2+TccC3的细胞裂解物相比,进行程序表达融合蛋白TcdB2/TccC3V1的细胞的裂解物的生物测定结果。对这些数据的观察显示,当TcdA(鞘翅目昆虫毒素)和XptA2xwi(鳞翅目昆虫毒素)与对照裂解物混合时,影响可以忽略。应该注意到调节添加到裂解物中的TcdA和TccC3的量以突出TcdB2和TccC3编码基因的增效作用。来自含pDAB3093细胞的裂解物单独不会杀死昆虫。然而,当与TcdA或XptA2xwi混合时,在预期范围内出现显著死亡率。令人惊讶的是,程序产生融合蛋白TcdB2/TccC3V1的细胞裂解物表现出非融合增效剂的类似活性曲线。通过SDS-PAGE进行的多种裂解物的分析表明:pDAB8920样品中存在显著的~280kDa。该带的迁移与TcdB2/TccC3V1的预测分子量一致。在对照或者pDAB3093样品中没有检测到该带。这些结果表明质粒pDAB8920产生了新的融合蛋白TcdB2/TccC3V1,该蛋白质增强了昆虫毒素TcdA和XptA2的活性。
表V.鞘翅目物种和鳞翅目物种对大肠杆菌裂解物和纯化蛋白质的响应。每份重复中使用七到九个昆虫。数据是三个独立的重复。死亡率等级:0=0-20%;+=21-41%;++=41-60%;+++=61-80%;++++=81-100%。 | ||||
样品 | 昆虫物种 | |||
所测试的裂解物 | 南方玉米根虫 | 棉铃虫 | 甜菜粘虫 | |
pET | 对照 | 0 | 0 | 0 |
pET+TcdA | 对照 | 0 | 0 | 0 |
pET+XptA2 | 对照 | 0 | 0 | 0 |
pDAB3093 | TcdB2+TccC3 | 0 | 0 | 0 |
pDAB3093+TcdA | TcdB2+TccC3 | ++++ | 0 | 0 |
pDAB3093+XptA2 | TcdB2+TccC3 | 0 | ++++ | ++++ |
PDAB8920 | TcdB2/Tcc3V1 | 0 | 0 | 0 |
PDAB8920+TcdA | TcdB2/Tcc3V1 | ++++ | + | + |
PDAB8920+XptA2 | TcdB2/Tcc3V1 | 0 | ++++ | ++++ |
实施例6TcdB2+TccC3和TcdB2/TccC3V1与XptA2的结合
制备异源表达TcdB2+TccC3复合体的纯化样品和TcdB2/TccC3V1融合蛋白。使用BiaCore 3000仪器,通过表面等离振子共振(SPR)谱学,测量TcdB2+TccC3复合体(非融合)和TcdB2/TccC3 V1融合蛋白与XptA2的结合。简单来说,将在10mM,pH 4.8的乙酸钠中的高度纯化(0.05mg/ml)XptA2偶联到已经用N-羟基琥珀酰亚胺和N-乙基-N’-(二甲基氨丙基)碳二亚胺活化的CM4芯片上,以达到固定化2000共振单位(RU)。固定化后,用pH 8.5的1M盐酸乙醇胺封闭剩余的活性胺基团。通过在芯片上以30μL/分钟的流速流动200μL的100nM TcdB2+TccC3或者25nM 8920融合蛋白(溶解在10mM HEPES pH 7.4、150mM NaCl和0.005%表面活性剂P20中)测定与XptA2的结合。测定RU的变化,变化速率与非线性回归曲线拟合,以得到TcdB2+TccC3复合体或者TcdB2/TccC3V1融合蛋白与XptA2结合的速率。令人惊讶地,TcdB2/TccC3V1融合蛋白的结合速率(ka=1.03×106)比TcdB2+TccC3复合体的结合速率(ka=4.49×104)大至少20倍。也就是,TcdB2/TccC3V1融合蛋白比非融合TcdB2+TccC3复合体更快地结合XptA2。一旦结合,两种制剂都不容易从XptA2上解离。
实施例7其他的TcdB2/TccC3融合蛋白
使用标准分子生物学技术,构建TcdB2和TccC3编码区之间的其他融合基因。所有构建体在实施例1中描述的pET表达质粒中制备,具有适当的细菌转录和翻译信号。总共产生和测试了六种不同的TcdB2/TccC3融合。表VI显示了用于融合蛋白的TcdB2/接头/TccC3蛋白连接序列。为了清楚,融合蛋白将用来自编码质粒的数字标记称呼。例如,实施例1中描述的融合蛋白TcdB2/TccC3V1是由质粒pDAB8920编码,将被称为蛋白质8920(表VI)。除了单一融合蛋白(8563),所有融合蛋白含有TcdB2和TccC3的编码区全长。在已经删除编码最初的21个氨基酸的DNA的情形下,蛋白质8563含有TccC3编码区的截短形式。TcdB2和TccC3编码区之间的接头区域长度在编码0-93个氨基酸之间变化。表VI列出了融合蛋白表达质粒、基因和蛋白质名称、编码区、目的蛋白质片段和有关的SEQ ID NO。表VII列出了包括表VI中描述的融合蛋白的接头片段的连接物。下面给出了各种融合蛋白的简单描述。
表VI.融合蛋白序列信息 | DAS-118XC1 | |||||||||
质粒 | 基因名称 | DNASEQ IDNO: | 编码区(终止密码子除外;核苷酸) | 编码的融合蛋白 | 蛋白质SEQ IDNO: | TcdB2片段(AA残基) | 接头片段(AA残基) | TccC3片段(AA残基) | XptA2片段(AA残基) | |
pDAB8563 | 8563 | 45 | 48-7295 | 8563 | 46 | 1-1474 | 1475-1477 | 1478-2416 | NA | |
pDAB8564 | 8564 | 47 | 48-7349 | 8564 | 48 | 1-1474 | 无 | 1475-2434 | NA | |
pDAB8940 | 8940 | 49 | 48-7364 | 8940 | 50 | 1-1474 | 1475-1479 | 1480-2439 | NA | |
pDAB8920 | 8920 | 51 | 48-7391 | 8920 | 52 | 1-1474 | 1475-1488 | 1489-2448 | NA | |
pDAB8921 | 8921 | 53 | 48-7463 | 8921 | 54 | 1-1474 | 1475-1512 | 1513-2472 | NA | |
pDAB8923 | 8923 | 55 | 48-7628 | 8923 | 56 | 1-1474 | 1475-1567 | 1568-2527 | NA | |
pDAB8951 | 8951 | 57 | 21-7436 | 8951 | 58 | 999-2472 | 961-998 | 1-960 | NA | |
pDAB8811 | 8811 | 59 | 34-15018 | 8811 | 60 | 2548-4021 | XptA2/TcdB22539-2547TcdB2/TccC34022-4035 | 4036-4995 | 1-2538 |
表VII.融合蛋白结合的序列 | ||
蛋白质 | 接头大小(AA) | 不同毒素复合体融合蛋白的接头(下划线)和邻近的蛋白质序列 |
8563 | 3;21AATccC3deletion | TcdB2 >DENDTAAEVKKVK>Linker > PGS>TccC3 >GLIIRNIDF> |
8564 | 0 | TcdB2 >DENDTAAEVKKVKM>Linker NoneTccC3 >MKNIDPKLYQKTPTVSVYDNRGLIIRNIDF> |
8940 | 5 | TcdB2 >DENDTAAEVKKVKM>Linker > PGSRP>TccC3 >MKNIDPKLYQKTPTVSVYDNRGLIIRNIDF> |
8920 | 14 | TcdB2 >DENDTAAEVKKVKM>Linker > PGSDNKGQTIRTRP> |
TccC3 >MKNIDPKLYQKTPTVSVYDNRGLIIRNIDF> | ||
8921 | 38 | TcdB2 >DENDTAAEVKKVKM>Linker> PRLDRAADITTQNAHDSAIVALRQNIPTPAPLSLRSRP> |
TccC3 >MKNIDPKLYQKTPTVSVYDNRGLIIRNIDF> | ||
8923 | 93 | TcdB2 >DENDTAAEVKKVKM>Linker> PGSEAYADTHVYDPIGREIKVITAKGWFRRTLFTPWFTVNEDENDTA> |
Linker> AEVKKVKMPRLDRAADITTQNAHDSAIVALRQNIPTPAPLSLRSRP> | ||
TccC3 >MKNIDPKLYQKTPTVSVYDNRGLIIRNIDF> | ||
8951 | 38 | TccC3 >DAEISFLTTI PLKNVKPHKR>Linker> PRLDRAADITTQNAHDSAIVALRQNIPTPAPLSLRSRP> |
TcdB2 >MQNSQDFSITELSLPKGGGA> | ||
8811 | 9 | XptA2 >KALLESLSDIILHIRYTIRS>Linker > PRDRTRPTS> |
TcdB2 >MQNSQDFSITELSLPKGGGA> | ||
8811 | 14 | TcdB2 >WFTVNEDENDTAAEVKKVKM>Linker > PGSDNKGQTIRTRP> |
TccC3 >MKNIDPKLYQKTPTVSVYDN> |
质粒pDAB8563编码由通过三氨基酸接头(PGS)融合到截短TccC3编码区(删除了TccC3氨基酸1-21)的整个TcdB2编码区组成的融合蛋白8563。编码蛋白质8563的基因的DNA序列如SEQ ID NO:45中所示。蛋白质8563的氨基酸序列如SEQ DI NO:46中所示。
质粒pDAB8564编码融合蛋白8564,其由直接融合到TccC3的整个编码区上的整个TcdB2编码区组成。没有组成接头序列的其他氨基酸。编码蛋白质8564的基因的DNA序列如SEQ ID NO:47中所示。蛋白质8564的氨基酸序列如SEQ DI NO:48中所示。
质粒pDAB8940编码融合蛋白8940,其由通过五个氨基酸接头直接融合到TccC3的整个编码区上的整个TcdB2编码区组成。编码蛋白质8940的基因的DNA序列如SEQ ID NO:49中所示。蛋白质8940的氨基酸序列如SEQ DI NO:50中所示。
质粒pDAB8920编码融合蛋白8920,其由通过十四个氨基酸接头直接融合到TccC3的整个编码区上的整个TcdB2编码区组成。一部分接头序列(DNKGQTIRT)来自实施例1中描述的光杆状菌属蛋白TcdB1。编码蛋白质8920的基因的DNA序列如SEQ ID NO:51中所示。蛋白质8920的氨基酸序列如SEQ DI NO:52中所示。
质粒pDAB8921编码融合蛋白8921,其由通过38个氨基酸接头直接融合到TccC3的整个编码区上的整个TcdB2编码区组成。该接头序列的36个氨基酸(
PRLDRAADITTQNAHDSAIVALRQNIPTPAPLSLRS)来自光杆状菌属蛋白TcdA1。编码蛋白质8921的基因的DNA序列如SEQ IDNO:53中所示。蛋白质8921的氨基酸序列如SEQ DI NO:54中所示。
质粒pDAB8923编码融合蛋白8923,其由通过93个氨基酸接头直接融合到TccC3的整个编码区上的整个TcdB2编码区组成。该接头有三个片段。紧接TcdB2编码区的第一个片段是三个氨基酸的片段(PGS)。第二个片段是TcdB2的最后52个氨基酸的重复(EAYADTHVYDPIGREIKVITAKGWFRRTLFTPWFTVNEDENDTAAEVKKVKM)。第三个片段是上述蛋白质8921的38个氨基酸接头。编码蛋白质8923的基因的DNA序列如SEQ ID NO:55中所示。蛋白质8923的氨基酸序列如SEQ DI NO:56中所示。
实施例8TcdB2/TccC3融合蛋白8563、8564、8940、8920、8921和8923的表达和生物测定
表达条件和裂解制备如实施例3和4所述。用tri-甘氨酸SDS-PAGE(Cambrex,Walkersville MD)对融合蛋白分析,表明在预期的分子量范围内存在明显的考马斯蓝染色条带(约270-285kDa),这些高分子量条带在对照裂解物中不存在。采用有微小改变的上述方法对裂解物进行生物测定,用调节至67或者133ng/cm2的XptA2对裂解物进行测定。来自这些测定的结果表现为生长抑制。生长抑制百分数是如下计算:生长抑制(%)=100X(处理昆虫的平均重量)/对照昆虫的平均重量)。
表VIII.单独用大肠杆菌裂解物及其和纯化的XptA2蛋白质一起喂养的棉铃虫的生长。在每一次重复中使用8只昆虫/处理,重复2-3次生物测定法。生长抑制等级:0=0-20%;+=21-40%;++=41-60%;+++=61-80%;++++=81-100%。 | ||||
样品 | 所测试的裂解物 | XptA2浓度(ng/cm2) | ||
0 | 67 | 133 | ||
pET | 对照 | 0 | 0 | 0 |
pDAB3093 | TcdB2+TccC3 | 0 | ++++ | ++++ |
pDAB8563 | 8563 | 0 | + | ++ |
pDAB8564 | 8564 | 0 | ++++ | ++++ |
pDAB8940 | 8940 | 0 | ++++ | ++++ |
pDAB8920 | 8920 | 0 | ++++ | ++++ |
pDAB8921 | 8921 | 0 | ++++ | ++++ |
pDAB8923 | 8923 | 0 | ++++ | ++++ |
表VIII中所示的生物测定结果表明,当与XptA2结合时,TcdB2/TccC3融合蛋白具有高的增强活性。融合蛋白裂解物8764(0aa接头)、8940(5aa接头)、8920(14aa接头)、8921(38aa接头)和8923(93aa接头)在性质上是与非融合TcdB2+TccC3构成的3093裂解物等效的。8963(3aa接头,TccC3的21aa缺失)虽然表现出实质上的增强活性,但是显示出比其他裂解物更低的效力。
这些数据清楚地表明,由B型和C型增强剂组成的毒素复合体融合蛋白可以被融合产生新的融合蛋白。该融合蛋白增强A型蛋白质的抗虫活性。融合可以是全长蛋白质的缺失,或者可以是全长蛋白质直接融合或者通过长达93个氨基酸的接头融合。
实施例9融合蛋白8951TccC3/TcdB2的构建和测试
实施例1-8描述和记载了多种毒素复合体融合蛋白基因的构建和测试。由这些基因编码的融合蛋白由从氨基端到羧基端顺序的TcdB2/TccC3组成,用各种连接键连接起来。在本实施例中,描述了另外一种毒素复合体融合蛋白基因的构建。这种新构建体编码具有相反顺序的融合蛋白,即,从氨基端到羧基端顺序的TccC3/TcdB2。
编码阅读框的TccC3/TcdB2的基因构建是一个多步骤过程。在第一个步骤中,通过在tccC3基因的3’末端添加一个合成的DNA片段来修饰tccC3基因。该合成片段编码接头序列并提供特有的限制位点,以便允许在第二个步骤中提供TcdB2编码区的连接。在第三个步骤中,新构建的融合蛋白编码基因被转移至实施例1中描述的pET表达质粒中。所得到的表达质粒称为pDAB8951,编码该融合蛋白的基因称为8951,表示在SEQ IDSEQ:57中。由通过38个氨基酸接头融合到TcdB2上的TccC3组成的编码融合蛋白称为8951,表示在SEQ ID NO:58中。表VI中描述了TccC3、接头和TcdB2氨基酸片段。TccC3、接头和TcdB2之间的连接如表VII中所示。
实施例10融合蛋白8951的表达和生物测定
实施例8描述了融合蛋白8951的表达、裂解物制备和生物测定。实施例8中也描述了,SDS-PAGE分析表明了与融合蛋白的期望分子量相对应的考马斯蓝染色条带。表IX表明了两个大肠杆菌克隆表达的融合蛋白8951的生物测定结果。
表IX.单独用大肠杆菌裂解物及其和纯化的XptA2蛋白质一起喂养的棉铃虫的生长。对所测试的每一克隆重复两次生物测定法,在每一次重复中使用8只昆虫/处理。生长抑制等级:0=0-20%;+=21-40%;++=41-60%;+++=61-80%;++++=81-100% | ||||
样品 | 所测试的裂解物 | XptA2浓度(ng/cm2) | ||
0 | 200 | 400 | ||
pET | 对照 | 0 | 0 | 0 |
pDAB8951-1 | 8951克隆1 | 0 | ++++ | ++-+ |
pDAB8951-2 | 8951克隆2 | 0 | ++++ | ++-+ |
实施例11编码三联融合蛋白8811(XptA2/TcdB2/TccC3)的基因的构建
下面的实施例涉及三个编码区之间翻译融合的构建和测试。致病杆菌属蛋白质XptA2(A类蛋白)的编码区通过8920双融合(tcdB 2/tccC3)融合到光杆状菌属TcdB2(B类蛋白)和TccC3(C类蛋白)编码区上,以产生三联融合基因xptA2/tccB2/tccC3。该新的三联融合基因称为8811(SEQ ID NO:59),编码多肽8811(SEQ ID NO:60)。含有8811融合蛋白的裂解物表现出优秀的功能活性。本发明将植物和其他生物中表达所需要的转录控制序列的数量降低了三分之二,并且消除了与分离的完整基因转化相关的缺点。本发明也提供了维持相互作用的蛋白质物理上和时间上翻译同步机理,尤其是真核细胞中。
使用标准分子生物学技术,在多步骤过程中修饰毒素复合体毒素XptA2的编码区的3’末端。同样地,修饰8920编码区的5’末端。两个修饰后的编码区通过合成的核苷酸接头连接以产生单个开放阅读框。由XptA2、TcdB2和TccC3编码区组成的融合基因,在修饰的lac启动子控制下,通过基因工程进入大肠杆菌表达质粒。构建以这样的方式进行,以便维持适当的细菌转录和翻译信号。该质粒命名为pDAB8811。融合的编码区盒的DNA序列如SEQ ID NO:59中所示。该盒的长度为15036个核苷酸,含有XptA2(nts 34-7647)、XptA2/TcdB2接头肽(nts 7648-7674)、TcdB2(nts 7675-12096)、TcdB2/TccC3接头肽(nts 12907-12138)和TccC3(nts 12139-15018)的编码区。SEQ ID NO:59中的融合基因编码的多肽如SEQ ID NO:60中所示。预测该融合蛋白含有4995个氨基酸,具有XptA2(残基1-2538)、XptA2/TcdB2接头肽(残基2539-2547)、TcdB2(残基2548-4021)、TcdB2/TccC3接头肽(残基4022-4035)和TccC3(残基4036-4995)代表的片段。DNA和三联融合的蛋白质片段的概括示于表VI。表VII所示为两个接头(XptA2/TcdB2和TcdB2/TccC3)的氨基酸序列。
实施例12pDAB8811的表达条件和裂解物制备
使用标准方法将表达质粒pBT(2003年1月7日提交的美国申请序列号10/754,115中描述的空载体对照)、pDAB8812(仅含有XptA2编码区)和pDAB8811(含有8811编码区)转化进大肠杆菌表达菌株BL21(Novagen,Madison WI)。用10-200个新鲜转化菌落接种到200mL含有50μg/ml抗生素和75μM IPTG(异丙基-α-D-硫代半乳糖吡喃糖苷)的LB培养基中,起动表达培养。将培养物于28℃下、180-200转/分钟生长24小时。通过在250ml Nalgene瓶中于4℃、3,400×g离心10分钟收集细胞。将沉淀悬浮于4-4.5mL Butterfield′s磷酸盐溶液(Hardy Diagnostics,Santa Maria,CA;0.3mM磷酸钾pH 7.2)。将悬浮的细胞转移至含有1mL0.1mm直径玻璃珠(Biospec,Bartlesville,OK,目录号1107901)的50mL聚丙烯螺口离心管中。将细胞-玻璃珠混合液置于冰上冷却,然后将细胞通过超声破碎裂解,超声时使用Branson Sonifier 250(Danbury CT)在约30的输出条件下使用2mm探头进行两次45秒的脉冲,在两个脉冲之间要完全冷却。将裂解物转移至2mL Eppendorf管中并且在16,000×g下离心5分钟。采用上面描述的SDS-PAGE进行裂解物分析,表明-与三联融合8811蛋白质对应的8811裂解物中存在大于500kDa的考马斯蓝染色条带,这在对照组或XptA2裂解物中不存在。收集上清,用于生物测定。
实施例13三联融合8811裂解物的生物测定条件
在特别设计用于昆虫生物测定法的128-孔碟子(C-D International,Pitman,NJ)中对人工饵料上的新生幼虫进行昆虫生物测定。测定物种是棉铃虫、美洲棉铃虫(Boddie)。生物测定法是通过在控制的环境条件下(28℃,大约40% r.h.,16∶8[L∶D])培养5天,此时记录处理的昆虫的总数、死亡昆虫数和存活昆虫的重量。如下测定粗裂解物的生物活性。将对照培养物或者表达三联融合蛋白8811的那些培养物的粗大肠杆菌裂解物(40μL)涂布到生物测定碟的8个孔中的人工饵料的表面。每孔中被处理食物的平均表面积为大约1.5cm2。
实施例14三联融合8811裂解物的生物测定结果
表X表明了与对照裂解物(空载体)相比,仅程序化表达蛋白质XptA2或者表达融合蛋白8811的细胞裂解物的生物测定结果。这些数据表明程序化表达三联融合8811的细胞制备的裂解物严重限制了昆虫的生长。这些数据清楚地表明进行程序化表达三联融合蛋白8811的裂解物比仅程序化表达XptA2蛋白质的裂解物更加有效。
表X.棉铃虫(美洲棉铃虫(Boddie)对表达毒素复合体蛋白的大肠杆菌裂解物的响应。 | ||
样品 | 测试的裂解物 | 棉铃虫的生长抑制 |
pBT280 | 空载体对照 | 0 |
pDAB8812 | XptA2 | 0 |
pDAB8811克隆1 | 8811 | ++++ |
pDAB8811克隆2 | 8811 | ++++ |
pDAB8811克隆3 | 8811 | ++++ |
pDAB8811克隆4 | 8811 | ++++ |
对每一样品,测试两份独立表达培养物裂解物。每次测试使用8只昆虫。生长抑制等级:0=0-20%;+=21-40%;++=41-60%;+++=61-80%;++++=81-100%
实施例15TcdB2/TccC3融合蛋白与固定化XptA2的结合
为了确定多种TcdB2/TccC3融合蛋白与XptA2蛋白质相互作用的相对亲和性,使用上面描述的标准胺偶联技术将XptA2固定化到CM5芯片上。通过Biacore 3000SPR分光计测定表面等离振子体共振(SPR)来确定结合作用,结合水平以共振单位(RU)度量。大约5000RU的XptA2固定化到芯片上。从进行程序化表达融合蛋白8920、8921、8923和8940的大肠杆菌培养物制备裂解物。以1∶10稀释裂解物,该裂解物以30微升/分钟的速率在固定化XptA2蛋白质上流动200秒。此时,停止细胞裂解物的流动,在XptA2蛋白质上仅流动缓冲溶液。从细胞裂解物到缓冲液的转换仅允许结合的TcdB2/TccC3融合蛋白从XptA2上解离。从细胞裂解物转换为仅仅缓冲液之后,测定解离200秒,表示为在200秒的细胞裂解物流动之后立即测量的RU和仅流动缓冲溶液200秒之后测量的RU之间的差异。表XI所示为这些实验的结果。含有TcdB2/TccC3融合蛋白的所有四种可溶裂解物紧紧地结合到固定化的XptA2上,在733-836RU之间。在结合之后发生了非常小的解离(17.9-21.4RU)。
表XI. | ||
分析物 | 200秒之后的结合(RU) | 200秒之后的解离(RU) |
pET裂解物(对照组) | 23.6 | 14.1 |
8940(5aa接头) | 830.6 | 17.9 |
8920(14aa接头) | 836.4 | 19.0 |
8921(38aa接头) | 764.9 | 18.8 |
8923(93aa接头) | 733.0 | 21.4 |
实施例16比较性非融合和融合活性以及纯化的TcdB2/TccC3融合蛋白8920的结合研究
与非融合tcdB2+TccC3蛋白复合体相比,为了进行更加充分表征TcdB2/TccC3融合蛋白(本文称为8920蛋白质)的活性,从程序性异源表达这些蛋白质的细菌培养物纯化融合蛋白或复合体。然后用加入的A类蛋白(XptA2或TcdA)进行纯化样品的生物测定。此外,通过表面等离振子体共振测定这两份样品结合固定化XptA2的能力。
纯化产生8920融合蛋白或者TcdB2+TccC3复合体的重组大肠杆菌细胞的2升培养物培养过夜,离心细胞,-80℃冷冻细胞沉淀用于储存。在冷水中快速解冻细胞沉淀,悬浮在250mL的50mM Tris-HCl pH 8.0、0.10M NaCl、1mM DTT、10%甘油和溶菌酶(0.6mg/mL)中。加入少量玻璃珠(0.5mm,Biospec,Bartlesville,OK,目录号1107901),轻轻地振荡溶液以促进悬浮。然后通过超声处理,以最大输出功率(带有一个微型探针的Branson Sonifier Model 250),以大约50mL的批量破坏细胞,使用冰浴使裂解物保持在低温。然后以48,000×g在4℃下离心破裂细胞60分钟。收集上清,加入Sigma Chemical Company(St.Louis,MO;目录号P2714)的4.0mL的普遍蛋白酶抑制剂。用冷却的蒸馏水将溶液稀释2倍,然后上样到Q Sepharose XL阴离子交换柱(1.6cm×10cm)上。首先用250mL的25mM Tris-HCl,pH 8.0、+50mM NaCl洗涤结合的蛋白质,然后用50mM Tris HCl pH8.0+300mM NaCl(250mL)洗脱。用25mMTris-HCl,pH 8.0将洗脱的蛋白质溶液透析过夜,然后上样到Mono Q 10/10阴离子交换柱(1cm×10cm)上。用在15个柱体积的25mM Tris-HCl,pH 8.0中的0到500mM NaCl梯度2mL/分钟洗脱蛋白质,得到3mL组分。以大约120mM NaCl洗脱含有8920融合蛋白(或TcdB2+TccC3复合体)的组分。将这些组分合并、稀释并且再次上样到Mono Q 10/10柱上,用在25mM Tris-HCl,pH 8.0中的0到300mM NaCl的微弱梯度如前洗脱,但得到2mL组分。合并含有8920融合蛋白(或TcdB2+TccC3复合体)的组分,并且浓缩到大约1.0mL,上样到Superose 200大小的排阻柱(1.6cm×60cm)上,在50mM Tris-HCl,pH 8.0、100mM NaCl、5%甘油、0.05%Tween-20平衡。以1.0mL/分钟的流速洗脱蛋白质。合并与8920融合蛋白或者TcdB2+TccC3复合体相应的组分,通过SDS-PAGE分析以确定它们的身份和纯度。
昆虫生物测定这些研究中使用的棉铃虫(CEW,美洲棉铃虫(Boddie))是由North Carlina State University(Raleigh,NC)的昆虫饲养所以卵的形式提供。南方玉米根虫(SCR,Diabrotica undcimpunctatahowardi)卵由FrenchAg Research,Lamberton,MN或CropCharacteristics,Inc.,Farmington,MN提供。清洗这些卵,且保持在24℃和50%RH下,直到它们孵出。人工饵料含有2-4%粉状固体,如大豆面粉、酵母、麦胚、酪蛋白质、糖、维生素和胆固醇,它们悬浮在水介质中的1.0-2.0%溶解琼脂中。对于生物测定,蛋白质或者蛋白质复合体以3倍或4倍增量稀释到10mM,pH 7.0的磷酸钠缓冲液中,浓度范围是每cm2上500-0.48ng蛋白质,然后用于人工饵料的表面。每个浓度分别测试8个重复,通过将新出现的幼虫放置到经过处理的食物上,并且将试验保持在28℃下5天。在一些试验中,除了记录昆虫的死亡率或发育延缓外,在每个时间段的结束时测定幼虫的重量。死亡幼虫计分为零重量。
结合作用测定使用BiaCore 3000仪器,通过表面等离振子体共振(SPR),测定TcdB2+TccC3和8920融合蛋白与XptA2的结合。简单来说,将在10mM,pH 4.8的乙酸钠中的高度纯化(0.05mg/ml)XptA2偶联到已经用N-羟基琥珀酰亚胺和N-乙基-N’-(二甲基氨丙基)碳二亚胺活化的CM4芯片上(按照制造商的说明书),以达到固定化2000共振单位(RU)。在XptA2固定化后,用pH 8.5的1M盐酸乙醇胺封闭剩余的活性胺基团。通过在芯片上以30μL/分钟的流速流动200μL的100nM TcdB2+TccC3或者25nM 8920融合蛋白(溶解在10mM HEPES pH 7.4、150mM NaCl和0.005%表面活性剂P20中)测定与XptA2的结合。测定RU的变化,变化速率与非线性回归曲线拟合,以得到TcdB+TccC3或者8920融合蛋白与XptA2的结合速率。
生物测定结果表XII的A栏和B栏显示了与TcdB2+TccC3复合体相比,8920融合蛋白加强A类蛋白XptA2对CEW幼虫的增加效果。在这些实验中,XptA2的浓度保持恒定在250ng/cm2。A栏所示为XptA2+TcdB2+TccC3复合体的杀死/延缓活性。这些数据表明TcdB2+TccC3杀死/延缓活性显著降低到浓度低于7.8ng/cm2。相反,B栏显示XptA2+8920TcdB/TccC3融合蛋白复合体更有效的杀死/延缓活性。在该情况下,XptA2+8920组合以1.9ng/cm2的8920融合蛋白有效地引起所有测试幼虫的发育延缓。令人惊讶地,这些数据表明8920融合蛋白的有效性至少是非融合亲本蛋白质TcdB2+TccC3的4倍。
表XII.在存在浓度增加的纯化TcdB2+TccC3复合体(A栏)或者TcdB2/TccC3融合蛋白8920(B栏)的情况下,XptA2(250ng/cm2)的杀虫活性的证明 | |||||||
A栏 | B栏 | ||||||
TcdB2+TccC3的浓度 | 棉铃虫幼虫 | 8920融合物的浓度 | 棉铃虫幼虫 | ||||
(ng/cm2) | 死亡 | 发育延缓 | 总计 | (ng/cm2) | 死亡 | 发育延缓 | 总计 |
500 | 8 | 0 | 8 | 500 | 8 | 0 | 8 |
125 | 6 | 2 | 8 | 125 | 6 | 2 | 8 |
31.2 | 0 | 8 | 8 | 31.2 | 0 | 8 | 8 |
7.8 | 1 | 7 | 8 | 7.8 | 0 | 8 | 8 |
1.9 | 0 | 2 | 8 | 1.9 | 0 | 8 | 8 |
0.48 | 0 | 1 | 8 | 0.48 | 0 | 0 | 8 |
使用加入到500ng/cm2的TcdA(针对SCR测试)或者XptA2(针对CEW测试)中的多个浓度地8920融合蛋白进行针对SCR和CEW幼虫的进一步生物测定。结果如表XIII所示。这些数据清楚地表明,即使低浓度的8920融合蛋白在增强500ng/cm2的TcdA或XptA2中是非常有效的。
表XIII.TcdB2/TccC3融合蛋白8920增强TcdA(针对南方玉米根虫测定)和XptA2(针对棉铃虫测定)的有效性证明。多种浓度的8920加入到500ng/cm2的TcdA或XptA2中。给出了8只昆虫幼虫的总重量。 | ||||||||
8920融合物的浓度 | 南方玉米根虫幼虫 | 棉铃虫幼虫 | ||||||
(ng/cm2) | 死亡 | 发育延缓 | 总计 | 重量 | 死亡 | 发育延缓 | 总计 | 重量 |
300 | 8 | 0 | 8 | 0 | 8 | 0 | 8 | 0 |
100 | 8 | 0 | 8 | 0 | 8 | 0 | 8 | 0 |
33 | 8 | 0 | 8 | 0 | 8 | 0 | 8 | 0 |
11 | 7 | 1 | 8 | 0.01 | 3 | 5 | 8 | 0.8 |
3.7 | 1 | 0 | 8 | 0.5 | 2 | 6 | 8 | 0.9 |
1.2 | 2 | 0 | 8 | 0.6 | 2 | 6 | 8 | 1.8 |
0.4 | 0 | 0 | 8 | 1.4 | 4 | 4 | 8 | 0.9 |
0 | 0 | 0 | 8 | 2.2 | 0 | 3 | 8 | 84.3 |
结合结果:通过SPR将TcdB2+TccC3与XptA2的结合速率和TcdB2+TccC3与8920融合蛋白的结合速率比较。图1所示为读出图。8920融合蛋白的结合速率(ka=1.03×106)比TcdB2+TccC3的结合速率(ka=4.49×104)大至少20倍。一旦结合,两种蛋白质都不易从XptA2上解离。与XptA2+TcdB2+TccC3相比,8920融合蛋白与XptA2增加的结合速率期望增加XptA2+8920复合体的有效性。该期望与该实施例中上面所示的观察一致(表XIII),也就是,较低浓度的8920融合蛋白对增强A类蛋白XptA2是必须的。
实施例17编码三联融合蛋白8836(TcdB2/TccC3/XptA2xwi)的基因的构建
该实施例和实施例18-20涉及三个编码区之间的翻译融合的构建和测试。光杆状菌属TcdB2(B类蛋白)和TccC3(C类蛋白)的8920(tcdB2/tccC3)双融合的编码区进一步融合到致病杆菌属蛋白质XptA2xwi(A类蛋白)的编码区上,以产生三联融合基因tcdB2/tccC3/xptA2xwi。该新的三联融合基因称为8836(SEQ ID NO:67),编码多肽8836(SEQ IDNO:68)。该融合蛋白与上面的实施例14中描述的8811三联融合蛋白XptA2xwi/TcdB2/TccC3不同,不同之处在于与单个蛋白质对应的编码区的次序已经改变。含有8836融合蛋白的裂解物表现出良好的功能活性。本发明将植物和其他生物中表达所需要的转录控制序列的数量降低了三分之二,并且消除了与分离的完整基因转化相关的缺点。本发明也提供了维持相互作用的蛋白质物理上和时间上翻译同步机理,尤其是真核细胞中。此外,该实施例表明可以改变初级转录产物内A、B和C类蛋白对应的编码区的顺序,而不会干扰翻译出的融合蛋白的最终活性。
使用标准分子生物学技术,在多步骤过程中修饰毒素复合体A类蛋白XptA2xwi编码区的5’末端。同样地,修饰8920编码区的3’末端。两个修饰后的编码区通过合成的核苷酸接头连接以产生单个开放阅读框。由相连的TcdB2、TccC3和XptA2xwi编码区组成的融合基因通过基因工程改造为pET表达质粒载体(Novagen,Madison WI)中的单个开放阅读框。构建以这样的方式进行以便维持适当的细菌转录和翻译信号,所得的质粒命名为pDAB8836。融合的编码区盒的DNA序列如SEQ ID NO:67中所示。该盒的长度为15067个核苷酸,含有TcdB2(nts 48-4469)、TcdB2/TccC3接头肽(nts 4470-4511)、TccC3(nts 4512-7391)、TccC3/XptA2xwi接头肽(nts7392-7436)和XptA2xwi(nts 7437-15050)的编码区。SEQ ID NO:67中的融合基因编码的多肽在SEQ ID NO:68中所示。预测该融合蛋白含有5001个氨基酸,具有TcdB2(残基1-1474)、TcdB2/TccC3接头肽(残基1475-1488)、TccC3(残基1489-2448)、TccC3/XptA2xwi接头肽(残基2449-2463)和XptA2xwi(残基2464-5001)代表的片段。
实施例18pDAB8836的表达条件和裂解物制备
A类TC蛋白XptA2xwi以从异源表达该基因的荧光假单胞菌(Pseudomonasfluorescens)的培养物制备而来的纯化形式使用。使用标准方法将表达质粒pET(空载体对照)、pDAB8920和pDAB8836转化到大肠杆菌T7表达菌株BL21(DE3)Star中(Novagen,Carlsbad,CA)。将10-200个新鲜转化的菌落接种到250mL含有50μg/ml抗生素和75μM IPTG(异丙基-β-D-硫代半乳糖吡喃糖苷)的LB培养基中开始进行表达培养。将培养物于28℃下、180-200转/分钟生长24小时。通过4℃、5,000×g离心20分钟收集细胞。将沉淀悬浮于4-4.5mL Butterfield′s磷酸盐溶液(HardyDiagnostics,Santa Maria,CA;0.3mM磷酸钾pH 7.2)。将悬浮的细胞转移至含有1mL 0.1mm直径玻璃珠(Biospec,Bartlesville,OK,目录号1107901)的50mL聚丙烯螺口离心管中,置于冰上冷却,然后将细胞通过超声破碎裂解,超声时使用Branson Sonifier 250(Danbury CT)在约30的输出条件下使用2mm探头进行两次45秒的脉冲,在两个脉冲之间要完全冷却。将裂解物转移至2mL Eppendorf管中并且在16,000×g下离心5分钟。用如上所述的SDS-PAGE分析裂解物,表明考马斯蓝-染色条带大于8836裂解物中存在的500kDa,对应于三联融合8836蛋白质(计算出大小为560.6kDa)。在对照细胞的裂解物中不存在高分子量条带。
实施例19三联融合8836裂解物的生物测定条件
在特别设计用于昆虫生物测定法的128孔碟子(C-D International,Pitman,NJ)中对人工饵料上的新生棉铃虫幼虫(美洲棉铃虫(Boddie))进行昆虫生物测定。生物测定法是通过在控制的环境条件下(28℃,大约40%相对湿度,16小时∶8小时[光照∶黑暗])培养5天,此时记录处理的昆虫的总数、死亡昆虫数和存活昆虫的重量。
如下测定仅用粗裂解物或与加入的XptA2xwi毒素蛋白质一起的生物学活性。将对照培养物或者那些表达毒素复合物蛋白质的大肠杆菌粗裂解物(40μL)(浓度范围在12-17mg/mL之间)使用在生物测定碟的8个孔中的人工饵料的表面。每个孔中被处理食物的平均表面积为大约1.5cm2。空白载体对照和TcdB2/TccC3融合蛋白8920裂解物与或不与XptA2xwi一起使用。所加入的XptA2xwi是来自异源表达所述蛋白质的细菌培养物的高度纯化制备物。另外,将纯化的XptA2xwi与Butterfield’s磷酸盐溶液混合作为对照。食物中XptA2xwi的终浓度为250ng/cm2。
实施例20三联融合8836裂解物的生物测定结果
表XIV表明了对照裂解物、程序化表达TcdB2/TccC3融合蛋白8920的细胞裂解物和程序化表达三联TcdB2/TccC3/XptA2xwi融合蛋白8836的细胞裂解物的生物测定结果。对照裂解物和8920裂解物进行生物测定,加上或者减去纯化的XptA2xwi。这些数据表明含有或者不含XptA2xwi的对照裂解物对昆虫的影响很小。不加入XptA2xwi,仅含有TcdB2/TccC3融合蛋白8920的裂解物没有效果。然而,正如上面的实施例中所示,加入XptA2的8920裂解物是一种有效的昆虫生长抑制剂。不加入XptA2xwi,程序化表达三联TcdB2/TccC3/XptA2xwi融合蛋白8836的裂解物是非常有效的昆虫生长抑制剂。这些数据和上面的实施例14的数据表明,含有XptA2xwi、TcdB2和TccC3的三联融合蛋白是有功能的,并且是高度有效的。这些数据与实施例14的那些数据进一步表明这样一个令人惊讶的结果,无论融合蛋白内分离的蛋白质结构域的顺序,三联融合蛋白均有功能。
表XIV.棉铃虫(美洲棉铃虫(Boddie))对表达毒素复合体蛋白的大肠杆菌裂解物的响应 | ||
样品 | 测试的裂解物 | 棉铃虫的生长抑制 |
pET280 | 空载体对照 | + |
pET280+XptA2xwi | 空载体对照 | 0 |
纯化的XptA2xwi | XptA2xwi | 0 |
pDAB8920 | 8920(TcdB2/TccC3) | 0 |
pDAB8920+XptA2xwi | 8920(TcdB2/TccC3) | ++++ |
PDAB8836 | 8836(TcdB2/TccC3/XptA2) | ++++ |
每次测试使用8只昆虫。生长抑制等级:0=0-20%;+=21-40%;++=41-60%;+++=61-80%;++++=81-100%
序列表
<110>美国陶氏益农公司
<120>杀虫毒素复合体融合蛋白
<130>DAS-118XC1
<150>60/549,502
<151>2004-03-02
<150>60/549,516
<151>2004-03-02
<160>68
<170>PatentIn version 3.2
<210>1
<211>7409
<212>DNA
<213>发光光杆状菌(Photorhabdus luminescens)
<220>
<221>misc_feature
<222>(48)..(4469)
<223>TcdB2编码区
<220>
<221>misc_feature
<222>(4470)..(4511)
<223>接头编码区
<220>
<221>misc_feature
<222>(4512)..(7394)
<223>TccC3编码区
<400>1
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccat t tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatccga caacaagggt cagactatcc 4500
gcactaggcc tatgaaaaac atcgatccca aactttatca aaaaacccct actgtcagcg 4560
tttacgataa ccgtggtctg ataatccgta acatcgattt tcatcgtact accgcaaatg 4620
gtgatcccga tacccgtatt acccgccatc aatacgatat tcacggacac ctaaatcaaa 4680
gcatcgatcc gcgcctatat gaagccaagc aaaccaacaa tacgatcaaa cccaattttc 4740
tttggcagta tgatttgacc ggtaatcccc tatgtacaga gagcattgat gcaggtcgca 4800
ctgtcacctt gaatgatatt gaaggccgtc cgctactaac ggtgactgca acaggggtta 4860
tacaaactcg acaatatgaa acttcttccc tgcccggtcg tctgttatct gttgccgaac 4920
aaacacccga ggaaaaaaca tcccgtatca ccgaacgcct gatttgggct ggcaataccg 4980
aagcagagaa agaccataac cttgccggcc agtgcgtgcg tcactatgac acggcgggag 5040
ttacccggtt agagagttta tcactgaccg gtactgtttt atctcaatcc agccaactat 5100
tgatcgacac tcaagaggca aactggacag gtgataacga aaccgtctgg caaaacatgc 5160
tggctgatga catctacaca accctgagca ccttcgatgc caccggtgct ttactgactc 5220
agaccgatgc gaaagggaac attcagagac tggcttatga tgtggccggg cagctaaacg 5280
ggagctggct aacactcaaa ggccagacgg aacaagtgat tatcaaatcc ctgacctact 5340
ccgccgccgg acaaaaatta cgtgaggaac acggcaatga tgttatcacc gaatacagtt 5400
atgaaccgga aacccaacgg ctgatcggta tcaaaacccg ccgtccgtca gacactaaag 5460
tgctacaaga cctgcgctat gaatatgacc cggtaggcaa tgtcatcagc atccgtaatg 5520
acgcggaagc cacccgcttt tggcacaatc agaaagtgat gccggaaaac acttatacct 5580
acgattccct gtatcagctt atcagcgcca ccgggcgcga aatggcgaat ataggtcaac 5640
aaagtcacca atttccctca cccgctctac cttctgataa caacacctat accaactata 5700
cccgtactta tacttatgac cgtggcggca atctgaccaa aatccagcac agttcaccgg 5760
cgacgcaaaa caactacacc accaatatca cggtttcaaa tcgcagcaac cgcgcagtac 5820
tcagcacatt gaccgaagat ccggcgcaag tagatgcttt gtttgatgca ggcggacatc 5880
agaacacctt gatatcagga caaaacctga actggaatac tcgtggtgaa ctgcaacaag 5940
taacactggt taaacgggac aagggcgcca atgatgatcg ggaatggtat cgttatagcg 6000
gtgacggaag aaggatgtta aaaatcaatg aacagcaggc cagcaacaac gctcaaacac 6060
aacgtgtgac ttatttgccg aacttagaac ttcgtctaac acaaaacagc acggccacaa 6120
ccgaagattt gcaagttatc accgtaggcg aagcgggccg ggcacaggta cgagtattac 6180
attgggagag cggtaaaccg gaagatatcg acaataatca gttgcgttat agttacgata 6240
atcttatcgg ttccagtcaa cttgaattag atagcgaagg acaaattatc agtgaagaag 6300
aatattatcc ctatggtgga acagcattat gggccgccag gaatcagaca gaagccagtt 6360
ataaaactat ccgttattca ggcaaagagc gggatgccac cgggctatat tactacggct 6420
atcggtatta ccaaccgtgg ataggacggt ggttaagctc cgatccggca ggaacaatcg 6480
atgggctgaa tttatatcgg atggtgagga ataatccagt taccctcctt gatcctgatg 6540
gattaatgcc aacaattgca gaacgcatag cagcactaaa aaaaaataaa gtaacagact 6600
cagcgccttc gccagcaaat gccacaaacg tagcgataaa catccgcccg cctgtagcac 6660
caaaacctag cttaccgaaa gcatcaacga gtagccaacc aaccacacac cctatcggag 6720
ctgcaaacat aaaaccaacg acgtctgggt catctattgt tgctccattg agtccagtag 6780
gaaataaatc tacttctgaa atctctctgc cagaaagcgc tcaaagcagt tcttcaagca 6840
ctacctcgac aaatctacag aaaaaatcat ttactttata tagagcagat aacagatcct 6900
ttgaagaaat gcaaagtaaa ttccctgaag gatttaaagc ctggactcct ctagacacta 6960
agatggcaag gcaatttgct agtatcttta ttggtcagaa agatacatct aatttaccta 7020
aagaaacagt caagaacata agcacatggg gagcaaagcc aaaactaaaa gatctctcaa 7080
attacataaa atataccaag gacaaatcta cagtatgggt ttctactgca attaatactg 7140
aagcaggtgg acaaagctca ggggctccac tccataaaat tgatatggat ctctacgagt 7200
ttgccattga tggacaaaaa ctaaatccac taccggaggg tagaactaaa aacatggtac 7260
cttccctttt actcgacacc ccacaaatag agacatcatc catcattgca cttaatcatg 7320
gaccggtaaa tgatgcagaa atttcatttc tgacaacaat tccgcttaaa aatgtaaaac 7380
ctcataagag ataattaatc tgactcgag 7409
<210>2
<211>2448
<212>PRT
<213>发光光杆状菌
<220>
<221>misc_feature
<222>(1)..(1474)
<223>TcdB2
<220>
<221>misc_feature
<222>(1475)..(1488)
<223>接头
<220>
<221>misc_feature
<222>(1489)..(2248)
<223>TccC3
<400>2
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro
1475 1480 1485
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val
1490 1495 1500
Ser Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe
1505 1510 1515
His Arg Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg
1520 1525 1530
His Gln Tyr Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro
1535 1540 1545
Arg Leu Tyr Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn
1550 1555 1560
Phe Leu Trp Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu
1565 1570 1575
Ser Ile Asp Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu Gly
1580 1585 1590
Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr Arg
1595 1600 1605
Gln Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Ala
1610 1615 1620
Glu Gln Thr Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg Leu
1625 1630 1635
Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu Ala
1640 1645 1650
Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu
1655 1660 1665
Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser Gln
1670 1675 1680
Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn Glu
1685 1690 1695
Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr Leu
1700 1705 1710
Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
1715 1720 1725
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu
1730 1735 1740
Asn Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile
1745 1750 1755
Ile Lys Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu
1760 1765 1770
Glu His Gly Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu
1775 1780 1785
Thr Gln Arg Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr
1790 1795 1800
Lys Val Leu Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn
1805 1810 1815
Val Ile Ser Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp His
1820 1825 1830
Asn Gln Lys Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu
1835 1840 1845
Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly
1850 1855 1860
Gln Gln Ser His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp Asn
1865 1870 1875
Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly
1880 1885 1890
Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln Asn
1895 1900 1905
Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg Ala
1910 1915 1920
Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala Leu
1925 1930 1935
Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln Asn
1940 1945 1950
Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val
1955 1960 1965
Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr
1970 1975 1980
Ser Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala
1985 1990 1995
Ser Asn Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu
2000 2005 2010
Glu Leu Arg Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu
2015 2020 2025
Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val
2030 2035 2040
Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln
2045 2050 2055
Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu
2060 2065 2070
Leu Asp Ser Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro
2075 2080 2085
Tyr Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala
2090 2095 2100
Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr
2105 2110 2115
Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly
2120 2125 2130
Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn
2135 2140 2145
Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro
2150 2155 2160
Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu Lys
2165 2170 2175
Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala Thr
2180 2185 2190
Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser
2195 2200 2205
Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile
2210 2215 2220
Gly Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val
2225 2230 2235
Ala Pro Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser
2240 2245 2250
Leu Pro Glu Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr
2255 2260 2265
Asn Leu Gln Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg
2270 2275 2280
Ser Phe Glu Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala
2285 2290 2295
Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gln Phe Ala Ser Ile
2300 2305 2310
Phe Ile Gly Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val
2315 2320 2325
Lys Asn Ile Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp Leu
2330 2335 2340
Ser Asn Tyr Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val
2345 2350 2355
Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly Ala
2360 2365 2370
Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile Asp
2375 2380 2385
Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn Met
2390 2395 2400
Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser Ser
2405 2410 2415
Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile Ser
2420 2425 2430
Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
2435 2440 2445
<210>3
<211>42
<212>DNA
<213>人工序列
<220>
<223>tcdB2-tccC3接头的核苷酸序列
<400>3
ccgggatccg acaacaaggg tcagactatc cgcactaggc ct 42
<210>4
<211>14
<212>PRT
<213>人工序列
<220>
<223>TcdB2-TccC3接头编码的蛋白质序列
<400>4
Pro Gly Ser Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro
1 5 10
<210>5
<211>1476
<212>PRT
<213>发光光杆状菌
<400>5
Met Gln Asn Ser Gln Thr Phe Ser Val Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Ala Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ser Leu Thr Leu Asn Tyr Asn Ser 6ly Thr Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Gly Val Met Ala Ile Arg
65 70 75 80
Arg Arg Thr Ser Thr Gly Val Pro Asn Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Leu Asn Glu Ala Gly
100 105 110
Gln Ala Asp Ile Arg Ser Glu Ser Ser Leu Gln Gly Ile Asn Leu Gly
115 120 125
Ala Thr Phe Thr Val Thr Cys Tyr Arg Ser Arg Leu Glu Ser His Phe
130 135 140
Asn Arg Leu Glu Tyr Trp Gln Pro Gln Thr Thr Gly Ala Thr Asp Phe
145 150 155 160
Trp Leu Ile Tyr Ser Pro Asp Gly Gln Val His Leu Leu Gly Lys Asn
165 170 175
Pro Gln Ala Arg Ile Ser Asn Pro Leu Asn Val Asn Gln Thr Ala Gln
180 185 190
Trp Leu Leu Glu Ala Ser Ile Ser Ser His Ser Glu Gln Ile Tyr Tyr
195 200 205
Gln Tyr Arg Ala Glu Asp Glu Ala Gly Cys Glu Thr Asp Glu Leu Ala
210 215 220
Ala His Pro Ser Ala Thr Val Gln Arg Tyr Leu Gln Thr Val His Tyr
225 230 235 240
Gly Asn Leu Thr Ala Ser Asp Val Phe Pro Thr Leu Asn Gly Asp Asp
245 250 255
Pro Leu Lys Ser Gly Trp Met Phe Cys Leu Val Phe Asp Tyr Gly Glu
260 265 270
Arg Lys Asn Ser Leu Ser Glu Met Pro Leu Phe Lys Ala Thr Gly Asn
275 280 285
Trp Leu Cys Arg Lys Asp Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu
290 295 300
Leu Arg Thr Arg Arg Leu Cys Arg Gln Ile Leu Met Phe His Arg Leu
305 310 315 320
Gln Thr Leu Ser Gly Gln Ala Lys Gly Asp Asp Glu Pro Ala Leu Val
325 330 335
Ser Arg Leu Ile Leu Asp Tyr Asp Glu Asn Ala Met Val Ser Thr Leu
340 345 350
Val Ser Val Arg Arg Val Gly His Glu Asp Asn Asn Thr Val Thr Ala
355 360 365
Leu Pro Pro Leu Glu Leu Ala Tyr Gln Pro Phe Glu Pro Glu Gln Thr
370 375 380
Ala Leu Trp Gln Ser Met Asp Val Leu Ala Asn Phe Asn Thr Ile Gln
385 390 395 400
Arg Trp Gln Leu Leu Asp Leu Lys Gly Glu Gly Val Pro Gly Ile Leu
405 410 415
Tyr Gln Asp Arg Asn Gly Trp Trp Tyr Arg Ser Ala Gln Arg Gln Ala
420 425 430
Gly Glu Glu Met Asn Ala Val Thr Trp Gly Lys Met Gln Leu Leu Pro
435 440 445
Ile Thr Pro Ala Val Gln Asp Asn Ala Ser Leu Met Asp Ile Asn Gly
450 455 460
Asp Gly Gln Leu Asp Trp Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr
465 470 475 480
His Ser Gln His Pro Asp Gly Ser Trp Thr Arg Phe Thr Pro Leu His
485 490 495
Ala Leu Pro Ile Glu Tyr Ser His Pro Arg Ala Gln Leu Ala Asp Leu
500 505 510
Met Gly Ala Gly Leu Ser Asp Leu Val Leu Ile Gly Pro Lys Ser Val
515 520 525
Arg Leu Tyr Val Asn Asn Arg Asp Gly Phe Thr Glu Gly Arg Asp Val
530 535 540
Val Gln Ser Gly Asp Ile Thr Leu Pro Leu Pro Gly Ala Asp Ala Arg
545 550 555 560
Lys Leu Val Ala Phe Ser Asp Val Leu Gly Ser Gly Gln Ala His Leu
565 570 575
Val Glu Val Ser Ala Thr Gln Val Thr Cys Trp Pro Asn Leu Gly His
580 585 590
Gly Arg Phe Gly Gln Pro Ile Val Leu Pro Gly Phe Ser Gln Ser Ala
595 600 605
Ala Ser Phe Asn Pro Asp Arg Val His Leu Ala Asp Leu Asp Gly Ser
610 615 620
Gly Pro Ala Asp Leu Ile Tyr Val His Ala Asp Arg Leu Asp Ile Phe
625 630 635 640
Ser Asn Glu Ser Gly Asn Gly Phe Ala Lys Pro Phe Thr Leu Ser Phe
645 650 655
Pro Asp Gly Leu Arg Phe Asp Asp Thr Cys Gln Leu Gln Val Ala Asp
660 665 670
Val Gln Gly Leu Gly Val Val Ser Leu Ile Leu Ser Val Pro His Met
675 680 685
Ala Pro His His Trp Arg Cys Asp Leu Thr Asn Ala Lys Pro Trp Leu
690 695 700
Leu Ser Glu Thr Asn Asn Asn Met Gly Ala Asn His Thr Leu His Tyr
705 710 715 720
Arg Ser Ser Val Gln Phe Trp Leu Asp Glu Lys Ala Ala Ala Leu Ala
725 730 735
Thr Gly Gln Thr Pro Val Cys Tyr Leu Pro Phe Pro Val His Thr Leu
740 745 750
Trp Gln Thr Glu Thr Glu Asp Glu Ile Ser Gly Asn Lys Leu Val Thr
755 760 765
Thr Leu Arg Tyr Ala His Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe
770 775 780
Arg Gly Phe Gly Tyr Val Glu Gln Thr Asp Ser His Gln Leu Ala Gln
785 790 795 800
Gly Asn Ala Pro Glu Arg Thr Pro Pro Ala Leu Thr Lys Ser Trp Tyr
805 810 815
Ala Thr Gly Leu Pro Ala Val Asp Asn Ala Leu Ser Ala Gly Tyr Trp
820 825 830
Arg Gly Asp Lys Gln Ala Phe Ala Gly Phe Thr Pro Arg Phe Thr Leu
835 840 845
Trp Lys Glu Gly Lys Asp Val Pro Leu Thr Pro Glu Asp Asp His Asn
850 855 860
Leu Tyr Trp Leu Asn Arg Ala Leu Lys Gly Gln Pro Leu Arg Ser Glu
865 870 875 880
Leu Tyr Gly Leu Asp Gly Ser Ala Gln Gln Gln Ile Pro Tyr Thr Val
885 890 895
Thr Glu Ser Arg Pro Gln Val Arg Gln Leu Gln Asp Gly Ala Thr Val
900 905 910
Ser Pro Val Leu Trp Ala Ser Val Val Glu Ser Arg Ser Tyr His Tyr
915 920 925
Glu Arg Ile Ile Ser Asp Pro Gln Cys Asn Gln Asp Ile Thr Leu Ser
930 935 940
Ser Asp Leu Phe Gly Gln Pro Leu Lys Gln Val Ser Val Gln Tyr Pro
945 950 955 960
Arg Arg Asn Lys Pro Thr Thr Asn Pro Tyr Pro Asp Thr Leu Pro Asp
965 970 975
Thr Leu Phe Ala Ser Ser Tyr Asp Asp Gln Gln Gln Leu Leu Arg Leu
980 985 990
Thr Cys Arg Gln Ser Ser Trp His His Leu Ile Gly Asn Glu Leu Arg
995 1000 1005
Val Leu Gly Leu Pro Asp Gly Thr Arg Ser Asp Ala Phe Thr Tyr
1010 1015 1020
Asp Ala Lys Gln Val Pro Val Asp Gly Leu Asn Leu Glu Thr Leu
1025 1030 1035
Cys Ala Glu Asn Ser Leu Ile Ala Asp Asp Lys Pro Arg Glu Tyr
1040 1045 1050
Leu Asn Gln Gln Arg Thr Phe Tyr Thr Asp Gly Lys Asn Gln Thr
1055 1060 1065
Pro Leu Lys Thr Pro Thr Arg Gln Ala Leu Ile Ala Phe Thr Glu
1070 1075 1080
Thr Ala Val Leu Thr Glu Ser Leu Leu Ser Ala Phe Asp Gly Gly
1085 1090 1095
Ile Thr Pro Asp Glu Leu Pro Gly Ile Leu Thr Gln Ala Gly Tyr
1100 1105 1110
Gln Gln Glu Pro Tyr Leu Phe Pro Arg Thr Gly Glu Asn Lys Val
1115 1120 1125
Trp Val Ala Arg Gln Gly Tyr Thr Asp Tyr Gly Thr Glu Ala Gln
1130 1135 1140
Phe Trp Arg Pro Val Ala Gln Arg Asn Ser Leu Leu Thr Gly Lys
1145 1150 1155
Met Thr Leu Lys Trp Asp Thr His Tyr Cys Val Ile Thr Gln Thr
1160 1165 1170
Gln Asp Ala Ala Gly Leu Thr Val Ser Ala Asn Tyr Asp Trp Arg
1175 1180 1185
Phe Leu Thr Pro Thr Gln Leu Thr Asp Ile Asn Asp Asn Val His
1190 1195 1200
Leu Ile Thr Leu Asp Ala Leu Gly Arg Pro Val Thr Gln Arg Phe
1205 1210 1215
Trp Gly Ile Glu Ser Gly Val Ala Thr Gly Tyr Ser Ser Ser G1u
1220 1225 1230
Glu Lys Pro Phe Ser Pro Pro Asn Asp Ile Asp Thr Ala Ile Asn
1235 1240 1245
Leu Thr Gly Pro Leu Pro Val Ala Gln Cys Leu Val Tyr A1a Pro
1250 1255 1260
Asp Ser Trp Met Pro Leu Phe Ser Gln Glu Thr Phe Asn Thr Leu
1265 1270 1275
Thr Gln Glu Glu Gln Glu Thr Leu Arg Asp Ser Arg Ile Ile Thr
1280 1285 1290
Glu Asp Trp Arg Ile Cys Ala Leu Thr Arg Arg Arg Trp Leu Gln
1295 1300 1305
Ser Gln Lys Ile Ser Thr Pro Leu Val Lys Leu Leu Thr Asn Ser
1310 1315 1320
Ile Gly Leu Pro Pro His Asn Leu Thr Leu Thr Thr Asp Arg Tyr
1325 1330 1335
Asp Arg Asp Ser Glu Gln Gln Ile Arg Gln Gln Val Ala Phe Ser
1340 1345 1350
Asp Gly Phe Gly Arg Leu Leu Gln Ala Ser Val Arg His Glu Ala
1355 1360 1365
Gly Glu Ala Trp Gln Arg Asn Gln Asp Gly Ser Leu Val Thr Lys
1370 1375 1380
Val Glu Asn Thr Lys Thr Arg Trp Ala Val Thr Gly Arg Thr Glu
1385 1390 1395
Tyr Asp Asn Lys Gly Gln Thr Ile Arg Thr Tyr Gln Pro Tyr Phe
1400 1405 1410
Leu Asn Asp Trp Arg Tyr Val Ser Asp Asp Ser Ala Arg Lys Glu
1415 1420 1425
Ala Tyr Ala Asp Thr His Ile Tyr Asp Pro IIe Gly Arg Glu Ile
1430 1435 1440
Arg Val Ile Thr Ala Lys Gly Trp Leu Arg Gln Ser Gln Tyr Phe
1445 1450 1455
Pro Trp Phe Thr Val Ser Glu Asp Glu Asn Asp Thr Ala Ala Asp
1460 1465 1470
Ala Leu Val
1475
<210>6
<211>1474
<212>PRT
<213>发光光杆状菌
<400>6
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met
<210>7
<211>1485
<212>PRT
<213>发光光杆状菌菌株W14
<400>7
Met Gln Asp Ser Pro Glu Val Ser Ile Thr Thr Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Asn Gly Met Gly Glu Ala Leu Asn Ala Ala Gly
20 25 30
Pro Asp Gly Met Ala Ser Leu Ser Leu Pro Leu Pro Leu Ser Thr Gly
35 40 45
Arg Gly Thr Ala Pro Gly Leu Ser Leu Ile Tyr Ser Asn Ser Ala Gly
50 55 60
Asn Gly Pro Phe Gly Ile Gly Trp Gln Cys Gly Val Met Ser Ile Ser
65 70 75 80
Arg Arg Thr Gln His Gly Ile Pro Gln Tyr Gly Asn Asp Asp Thr Phe
85 90 95
Leu Ser Pro Gln Gly Glu Val Met Asn Ile Ala Leu Asn Asp Gln Gly
100 105 110
Gln Pro Asp Ile Arg Gln Asp Val Lys Thr Leu Gln Gly Val Thr Leu
115 120 125
Pro Ile Ser Tyr Thr Val Thr Arg Tyr Gln Ala Arg Gln Ile Leu Asp
130 135 140
Phe Ser Lys Ile Glu Tyr Trp Gln Pro Ala Ser Gly Gln Glu Gly Arg
145 150 155 160
Ala Phe Trp Leu Ile Ser Ser Pro Asp Gly Gln Leu His Ile Leu Gly
165 170 175
Lys Thr Ala Gln Ala Cys Leu Ala Asn Pro Gln Asn Asp Gln Gln Ile
180 185 190
Ala Gln Trp Leu Leu Glu Glu Thr Val Thr Pro Ala Gly Glu His Val
195 200 205
Ser Tyr Gln Tyr Arg Ala Glu Asp Glu Ala His Cys Asp Asp Asn Glu
210 215 220
Lys Thr Ala His Pro Asn Val Thr Ala Gln Arg Tyr Leu Val Gln Val
225 230 235 240
Asn Tyr Gly Asn Ile Lys Pro Gln Ala Ser Leu Phe Val Leu Asp Asn
245 250 255
Ala Pro Pro Ala Pro Glu Glu Trp Leu Phe His Leu Val Phe Asp His
260 265 270
Gly Glu Arg Asp Thr Ser Leu His Thr Val Pro Thr Trp Asp Ala Gly
275 280 285
Thr Ala Gln Trp Ser Val Arg Pro Asp Ile Phe Ser Arg Tyr Glu Tyr
290 295 300
Gly Phe Glu Val Arg Thr Arg Arg Leu Cys Gln Gln Val Leu Met Phe
305 310 315 320
His Arg Thr Ala Leu Met Ala Gly Glu Ala Ser Thr Asn Asp Ala Pro
325 330 335
Glu Leu Val Gly Arg Leu Ile Leu Glu Tyr Asp Lys Asn Ala Ser Val
340 345 350
Thr Thr Leu Ile Thr Ile Arg Gln Leu Ser His Glu Ser Asp Gly Ser
355 360 365
Pro Val Thr Gln Pro Pro Leu Glu Leu Ala Trp Gln Arg Phe Asp Leu
370 375 380
Glu Lys Met Pro Thr Trp Gln Arg Phe Asp Ala Leu Asp Asn Phe Asn
385 390 395 400
Ser Gln Gln Arg Tyr Gln Leu Val Asp Leu Arg Gly Glu Gly Leu Pro
405 410 415
Gly Met Leu Tyr Gln Asp Arg Gly Ala Trp Trp Tyr Lys Ala Pro Gln
420 425 430
Arg Gln Glu Asp Gly Asp Ser Asn Ala Val Thr Tyr Asp Lys Ile Ala
435 440 445
Pro Leu Pro Thr Leu Pro Asn Leu Gln Asp Asn Ala Ser Leu Met Asp
450 455 460
Ile Asn Gly Asp Gly Gln Leu Asp Trp Val Val Thr Ala Ser Gly Ile
465 470 475 480
Arg Gly Tyr His Ser Gln Gln Pro Asp Gly Lys Trp Thr His Phe Thr
485 490 495
Pro Ile Asn Ala Leu Pro Val Glu Tyr Phe His Pro Ser Ile Gln Phe
500 505 510
Ala Asp Leu Thr Gly Ala Gly Leu Ser Asp Leu Val Leu Ile Gly Pro
515 520 525
Lys Ser Val Arg Leu Tyr Ala Asn Gln Arg Asn Gly Trp Arg Lys Gly
530 535 540
Glu Asp Val Pro Gln Ser Thr Gly Ile Thr Leu Pro Val Thr Gly Thr
545 550 555 560
Asp Ala Arg Lys Leu Val Ala Phe Ser Asp Met Leu Gly Ser Gly Gln
565 570 575
Gln His Leu Val Glu Ile Lys Ala Asn Arg Val Thr Cys Trp Pro Asn
580 585 590
Leu Gly His Gly Arg Phe Gly Gln Pro Leu Thr Leu Ser Gly Phe Ser
595 600 605
Gln Pro Glu Asn Ser Phe Asn Pro Glu Arg Leu Phe Leu Ala Asp Ile
610 615 620
Asp Gly Ser Gly Thr Thr Asp Leu Ile Tyr Ala Gln Ser Gly Ser Leu
625 630 635 640
Leu Ile Tyr Leu Asn Gln Ser Gly Asn Gln Phe Asp Ala Pro Leu Thr
645 650 655
Leu Ala Leu Pro Glu Gly Val Gln Phe Asp Asn Thr Cys Gln Leu Gln
660 665 670
Val Ala Asp Ile Gln Gly Leu Gly Ile Ala Ser Leu Ile Leu Thr Val
675 680 685
Pro His Ile Ala Pro His His Trp Arg Cys Asp Leu Ser Leu Thr Lys
690 695 700
Pro Trp Leu Leu Asn Val Met Asn Asn Asn Arg Gly Ala His His Thr
705 710 715 720
Leu His Tyr Arg Ser Ser Ala Gln Phe Trp Leu Asp Glu Lys Leu Gln
725 730 735
Leu Thr Lys Ala Gly Lys Ser Pro Ala Cys Tyr Leu Pro Phe Pro Met
740 745 750
His Leu Leu Trp Tyr Thr Glu Ile Gln Asp Glu Ile Ser Gly Asn Arg
755 760 765
Leu Thr Ser Glu Val Asn Tyr Ser His Gly Val Trp Asp Gly Lys Glu
770 775 780
Arg Glu Phe Arg Gly Phe Gly Cys Ile Lys Gln Thr Asp Thr Thr Thr
785 790 795 800
Phe Ser His Gly Thr Ala Pro Glu Gln Ala Ala Pro Ser Leu Ser Ile
805 810 815
Ser Trp Phe Ala Thr Gly Met Asp Glu Val Asp Ser Gln Leu Ala Thr
820 825 830
Glu Tyr Trp Gln Ala Asp Thr Gln Ala Tyr Ser Gly Phe Glu Thr Arg
835 840 845
Tyr Thr Val Trp Asp His Thr Asn Gln Thr Asp Gln Ala Phe Thr Pro
850 855 860
Asn Glu Thr Gln Arg Asn Trp Leu Thr Arg Ala Leu Lys Gly Gln Leu
865 870 875 880
Leu Arg Thr Glu Leu Tyr Gly Leu Asp Gly Thr Asp Lys Gln Thr Val
885 890 895
Pro Tyr Thr Val Ser Glu Ser Arg Tyr Gln Val Arg Ser Ile Pro Val
900 905 910
Asn Lys Glu Thr Glu Leu Ser Ala Trp Val Thr Ala Ile Glu Asn Arg
915 920 925
Ser Tyr His Tyr Glu Arg Ile Ile Thr Asp Pro Gln Phe Ser Gln Ser
930 935 940
Ile Lys Leu Gln His Asp Ile Phe Gly Gln Ser Leu Gln Ser Val Asp
945 950 955 960
Ile Ala Trp Pro Arg Arg Glu Lys Pro Ala Val Asn Pro Tyr Pro Pro
965 970 975
Thr Leu Pro Glu Thr Leu Phe Asp Ser Ser Tyr Asp Asp Gln Gln Gln
980 985 990
Leu Leu Arg Leu Val Arg Gln Lys Asn Ser Trp His His Leu Thr Asp
995 1000 1005
Gly Glu Asn Trp Arg Leu Gly Leu Pro Asn Ala Gln Arg Arg Asp
1010 1015 1020
Val Tyr Thr Tyr Asp Arg Ser Lys Ile Pro Thr Glu Gly Ile Ser
1025 1030 1035
Leu Glu Ile Leu Leu Lys Asp Asp Gly Leu Leu Ala Asp Glu Lys
1040 1045 1050
Ala Ala Val Tyr Leu Gly Gln Gln Gln Thr Phe Tyr Thr Ala Gly
1055 1060 1065
Gln Ala Glu Val Thr Leu Glu Lys Pro Thr Leu Gln Ala Leu Val
1070 1075 1080
Ala Phe Gln Glu Thr Ala Met Met Asp Asp Thr Ser Leu Gln Ala
1085 1090 1095
Tyr Glu Gly Val Ile Glu Glu Gln Glu Leu Asn Thr Ala Leu Thr
1100 1105 1110
Gln Ala Gly Tyr Gln Gln Val Ala Arg Leu Phe Asn Thr Arg Ser
1115 1120 1125
Glu Ser Pro Val Trp Ala Ala Arg Gln Gly Tyr Thr Asp Tyr Gly
1130 1135 1140
Asp Ala Ala Gln Phe Trp Arg Pro Gln Ala Gln Arg Asn Ser Leu
1145 1150 1155
Leu Thr Gly Lys Thr Thr Leu Thr Trp Asp Thr His His Cys Val
1160 1165 1170
Ile Ile Gln Thr Gln Asp Ala Ala Gly Leu Thr Thr Gln Ala His
1175 1180 1185
Tyr Asp Tyr Arg Phe Leu Thr Pro Val Gln Leu Thr Asp Ile Asn
1190 1195 1200
Asp Asn Gln His Ile Val Thr Leu Asp Ala Leu Gly Arg Val Thr
1205 1210 1215
Thr Ser Arg Phe Trp Gly Thr Glu Ala Gly Gln Ala Ala Gly Tyr
1220 1225 1230
Ser Asn Gln Pro Phe Thr Pro Pro Asp Ser Val Asp Lys Ala Leu
1235 1240 1245
Ala Leu Thr Gly Ala Leu Pro Val Ala Gln Cys Leu Val Tyr Ala
1250 1255 1260
Val Asp Ser Trp Met Pro Ser Leu Ser Leu Ser Gln Leu Ser Gln
1265 1270 1275
Ser Gln Glu Glu Ala Glu Ala Leu Trp Ala Gln Leu Arg Ala Ala
1280 1285 1290
His Met Ile Thr Glu Asp Gly Lys Val Cys Ala Leu Ser Gly Lys
1295 1300 1305
Arg Gly Thr Ser His Gln Asn Leu Thr Ile Gln Leu Ile Ser Leu
1310 1315 1320
Leu Ala Ser Ile Pro Arg Leu Pro Pro His Val Leu Gly Ile Thr
1325 1330 1335
Thr Asp Arg Tyr Asp Ser Asp Pro Gln Gln Gln His Gln Gln Thr
1340 1345 1350
Val Ser Phe Ser Asp Gly Phe Gly Arg Leu Leu Gln Ser Ser Ala
1355 1360 1365
Arg His Glu Ser Gly Asp Ala Trp Gln Arg Lys Glu Asp Gly Gly
1370 1375 1380
Leu Val Val Asp Ala Asn Gly Val Leu Val Ser Ala Pro Thr Asp
1385 1390 1395
Thr Arg Trp Ala Val Ser Gly Arg Thr Glu Tyr Asp Asp Lys Gly
1400 1405 1410
Gln Pro Val Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg
1415 1420 1425
Tyr Val Ser Asp Asp Ser Ala Arg Asp Asp Leu Phe Ala Asp Thr
1430 1435 1440
His Leu Tyr Asp Pro Leu Gly Arg Glu Tyr Lys Val Ile Thr Ala
1445 1450 1455
Lys Lys Tyr Leu Arg Glu Lys Leu Tyr Thr Pro Trp Phe Ile Val
1460 1465 1470
Ser Glu Asp Glu Asn Asp Thr Ala Ser Arg Thr Pro
1475 1480 1485
<210>8
<211>1493
<212>PRT
<213>嗜线虫致病杆菌
<400>8
Met Gln Gly Ser Thr Pro Leu Lys Leu Glu Ile Pro Ser Leu Pro Ser
1 5 10 15
Gly Gly Gly Ser Leu Lys Gly Met Gly Glu Ala Leu Asn Ala Val Gly
20 25 30
Ala Glu Gly Gly Ala Ser Phe Ser Leu Pro Leu Pro Ile Ser Val Gly
35 40 45
Arg Gly Leu Val Pro Val Leu Ser Leu Asn Tyr Ser Ser Thr Ala Gly
50 55 60
Asn Gly Ser Phe Gly Met Gly Trp Gln Cys Gly Val Gly Phe Ile Ser
65 70 75 80
Leu Arg Thr Ala Lys Gly Val Pro His Tyr Thr Gly Gln Asp Glu Tyr
85 90 95
Leu Gly Pro Asp Gly Glu Val Leu Ser Ile Val Pro Asp Ser Gln Gly
100 105 110
Gln Pro Glu Gln Arg Thr Ala Thr Ser Leu Leu Gly Thr Val Leu Thr
115 120 125
Gln Pro His Thr Val Thr Arg Tyr Gln Ser Arg Val Ala Glu Lys Ile
130 135 140
Val Arg Leu Glu His Trp Gln Pro Gln Gln Arg Arg Glu Glu Glu Thr
l45 150 155 160
Ser Phe Trp Val Leu Phe Thr Ala Asp Gly Leu Val His Leu Phe Gly
165 170 175
Lys His His His Ala Arg Ile Ala Asp Pro Gln Asp Glu Thr Arg Ile
180 185 190
Ala Arg Trp Leu Met Glu Glu Thr Val Thr His Thr Gly Glu His Ile
195 200 205
Tyr Tyr His Tyr Arg Ala Glu Asp Asp Leu Asp Cys Asp Glu His Glu
210 215 220
Leu Ala Gln His Ser Gly Val Thr Ala Gln Arg Tyr Leu Ala Lys Val
225 230 235 240
Ser Tyr Gly Asn Thr Gln Pro Glu Thr Ala Phe Phe Ala Val Lys Ser
245 250 255
Gly Ile Pro Ala Asp Asn Asp Trp Leu Phe His Leu Val Phe Asp Tyr
260 265 270
Gly Glu Arg Ser Ser Ser Leu Asn Ser Val Pro Glu Phe Asn Val Ser
275 280 285
Glu Asn Asn Val Ser Glu Asn Asn Val Pro Glu Lys Trp Arg Cys Arg
290 295 300
Pro Asp Ser Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg
305 310 315 320
Arg Leu Cys Arg Gln Val Leu Met Phe His Gln Leu Lys Ala Leu Ala
325 330 335
Gly Glu Lys Val Ala Glu Glu Thr Pro Ala Leu Val Ser Arg Leu Ile
340 345 350
Leu Asp Tyr Asp Leu Asn Asn Lys Val Ser Leu Leu Gln Thr Ala Arg
355 360 365
Arg Leu Ala His Glu Thr Asp Gly Thr Pro Val Met Met Ser Pro Leu
370 375 380
Glu Met Asp Tyr Gln Arg Val Asn His Gly Val Asn Leu Asn Trp Gln
385 390 395 400
Ser Met Pro Gln Leu Glu Lys Met Asn Thr Leu Gln Pro Tyr Gln Leu
405 410 415
Val Asp Leu Tyr Gly Glu Gly Ile Ser Gly Val Leu Tyr Gln Asp Thr
420 425 430
Gln Lys Ala Trp Trp Tyr Arg Ala Pro Val Arg Asp Ile Thr Ala Glu
435 440 445
Gly Thr Asn Ala Val Thr Tyr Glu Glu Ala Lys Pro Leu Pro His Ile
450 455 460
Pro Ala Gln Gln Glu Ser Ala Met Leu Leu Asp Ile Asn Gly Asp Gly
465 470 475 480
Arg Leu Asp Trp Val Ile Thr Ala Ser Gly Leu Arg Gly Tyr His Thr
485 490 495
Met Ser Pro Glu Gly Glu Trp Thr Pro Phe Ile Pro Leu Ser Ala Val
500 505 510
Pro Met Glu Tyr Phe His Pro Gln Ala Lys Leu Ala Asp Ile Asp Gly
515 520 525
Ala Gly Leu Pro Asp Leu Ala Leu Ile Gly Pro Asn Ser Val Arg Val
530 535 540
Trp Ser Asn Asn Arg Ala Gly Trp Asp Arg Ala Gln Asp Val Ile His
545 550 555 560
Leu Ser Asp Met Pro Leu Pro Val Pro Gly Arg Asn Glu Arg His Leu
565 570 575
Val Ala Phe Ser Asp Met Thr Gly Ser Gly Gln Ser His Leu Val Glu
580 585 590
Val Thr Ala Asp Ser Val Arg Tyr Trp Pro Asn Leu Gly His Gly Lys
595 600 605
Phe Gly Glu Pro Leu Met Met Thr Gly Phe Gln Ile Ser Gly Glu Thr
610 615 620
Phe Asn Pro Asp Arg Leu Tyr Met Val Asp Ile Asp Gly Ser Gly Thr
625 630 635 640
Thr Asp Phe Ile Tyr Ala Arg Asn Thr Tyr Leu Glu Leu Tyr Ala Asn
645 650 655
Glu Ser Gly Asn His Phe Ala Glu Pro Gln Arg Ile Asp Leu Pro Asp
660 665 670
Gly Val Arg Phe Asp Asp Thr Cys Arg Leu Gln Ile Ala Asp Thr Gln
675 680 685
Gly Leu Gly Thr Ala Ser Ile Ile Leu Thr Ile Pro His Met Lys Val
690 695 700
Gln His Trp Arg Leu Asp Met Thr Ile Phe Lys Pro Trp Leu Leu Asn
705 710 715 720
Ala Val Asn Asn Asn Met Gly Thr Glu Thr Thr Leu Tyr Tyr Arg Ser
725 730 735
Ser Ala Gln Phe Trp Leu Asp Glu Lys Leu Gln Ala Ser Glu Ser Gly
740 745 750
Met Thr Val Val Ser Tyr Leu Pro Phe Pro Val His Val Leu Trp Arg
755 760 765
Thr Glu Val Leu Asp Glu Ile Ser Gly Asn Arg Leu Thr Ser His Tyr
770 775 780
His Tyr Ser His Gly Ala Trp Asp Gly Leu Glu Arg Glu Phe Arg Gly
785 790 795 800
Phe Gly Arg Val Thr Gln Thr Asp Ile Asp Ser Arg Ala Ser Ala Thr
805 810 815
Gln Gly Thr His Ala Glu Pro Pro Ala Pro Ser Arg Thr Val Asn Trp
820 825 830
Tyr Gly Thr Gly Val Arg Glu Val Asp Ile Leu Leu Pro Thr Glu Tyr
835 840 845
Trp Gln Gly Asp Gln Gln Ala Phe Pro His Phe Thr Pro Arg Phe Thr
850 855 860
Arg Tyr Asp Glu Lys Ser Gly Gly Asp Met Thr Val Thr Pro Ser Glu
865 870 875 880
Gln Glu Glu Tyr Trp Leu His Arg Ala Leu Lys Gly Gln Arg Leu Arg
885 890 895
Ser Glu Leu Tyr Gly Asp Asp Asp Ser Ile Leu Ala Gly Thr Pro Tyr
900 905 910
Ser Val Asp Glu Ser Arg Thr Gln Val Arg Leu Leu Pro Val Met Val
915 920 925
Ser Asp Val Pro Ala Val Leu Val Ser Val Ala Glu Ser Arg Gln Tyr
930 935 940
Arg Tyr Glu Arg Val Ala Thr Asp Pro Gln Cys Ser Gln Lys Ile Val
945 950 955 960
Leu Lys Ser Asp Ala Leu Gly Phe Pro Gln Asp Asn Leu Glu Ile Ala
965 970 975
Tyr Ser Arg Arg Pro Gln Pro Glu Phe Ser Pro Tyr Pro Asp Thr Leu
980 985 990
Pro Glu Thr Leu Phe Thr Ser Ser Phe Asp Glu Gln Gln Met Phe Leu
995 1000 1005
Arg Leu Thr Arg Gln Arg Ser Ser Tyr His His Leu Asn His Asp
1010 1015 1020
Asp Asn Thr Trp Ile Thr Gly Leu Met Asp Thr Ser Arg Ser Asp
1025 1030 1035
Ala Arg Ile Tyr Gln Ala Asp Lys Val Pro Asp Gly Gly Phe Ser
1040 1045 1050
Leu Glu Trp Phe Ser Ala Thr Gly Ala Gly Ala Leu Leu Leu Pro
1055 1060 1065
Asp Ala Ala Ala Asp Tyr Leu Gly His Gln Arg Val Ala Tyr Thr
1070 1075 1080
Gly Pro Glu Glu Gln Pro Ala Ile Pro Pro Leu Val Ala Tyr Ile
1085 1090 1095
Glu Thr Ala Glu Phe Asp Glu Arg Ser Leu Ala Ala Phe Glu Glu
1100 1105 1110
Val Met Asp Glu Gln Glu Leu Thr Lys Gln Leu Asn Asp Ala Gly
1115 1120 1125
Trp Asn Thr Ala Lys Val Pro Phe Ser Glu Lys Thr Asp Phe His
1130 1135 1140
Val Trp Val Gly Gln Lys Glu Phe Thr Glu Tyr Ala Gly Ala Asp
1145 1150 1155
Gly Phe Tyr Arg Pro Leu Val Gln Arg Glu Thr Lys Leu Thr Gly
1160 1165 1170
Lys Thr Thr Val Thr Trp Asp Ser His Tyr Cys Val Ile Thr Ala
1175 1180 1185
Thr Glu Asp Ala Ala Gly Leu Arg Met Gln Ala His Tyr Asp Tyr
1190 1195 1200
Arg Phe Met Val Ala Asp Asn Thr Thr Asp Val Asn Asp Asn Tyr
1205 1210 1215
His Thr Val Thr Phe Asp Ala Leu Gly Arg Val Thr Ser Phe Arg
1220 1225 1230
Phe Trp Gly Thr Glu Asn Gly Glu Lys Gln Gly Tyr Thr Pro Ala
1235 1240 1245
Glu Asn Glu Thr Val Pro Phe Ile Val Pro Thr Thr Val Asp Asp
1250 1255 1260
Ala Leu Ala Leu Lys Pro Gly Ile Pro Val Ala Gly Leu Met Val
1265 1270 1275
Tyr Ala Pro Leu Ser Trp Met Val Gln Ala Ser Phe Ser Asn Asp
1280 1285 1290
Gly Glu Leu Tyr Gly Glu Leu Lys Pro Ala Gly Ile Ile Thr Glu
1295 1300 1305
Asp Gly Tyr Leu Leu Ser Leu Ala Phe Arg Arg Trp Gln Gln Asn
1310 1315 1320
Asn Pro Ala Ala Ala Met Pro Lys Gln Val Asn Ser Gln Asn Pro
1325 1330 1335
Pro His Val Leu Ser Val Ile Thr Asp Arg Tyr Asp Ala Asp Pro
1340 1345 1350
Glu Gln Gln Leu Arg Gln Thr Phe Thr Phe Ser Asp Gly Phe Gly
1355 1360 1365
Arg Thr Leu Gln Thr Ala Val Arg His Glu Ser Gly Glu Ala Trp
1370 1375 1380
Val Arg Asp Glu Tyr Gly Ala Ile Val Ala Glu Asn His Gly Ala
1385 1390 1395
Pro Glu Thr Ala Met Thr Asp Phe Arg Trp Ala Val Ser Gly Arg
1400 1405 1410
Thr Glu Tyr Asp Gly Lys Gly Gln Ala Leu Arg Lys Tyr Gln Pro
1415 1420 1425
Tyr Phe Leu Asn Ser Trp Gln Tyr Val Ser Asp Asp Ser Ala Arg
1430 1435 1440
Gln Asp Ile Tyr Ala Asp Thr His Tyr Tyr Asp Pro Leu Gly Arg
1445 1450 1455
Glu Tyr Gln Val Ile Thr Ala Lys Gly Gly Phe Arg Arg Ser Leu
1460 1465 1470
Phe Thr Pro Trp Phe Val Val Asn Glu Asp Glu Asn Asp Thr Ala
1475 1480 1485
Gly Glu Met Thr Ala
1490
<210>9
<211>1506
<212>PRT
<213>伯氏致病杆菌(Xenorhabdus bovienii)
<400>9
Met Lys Gln Asp Ser Gln Asp Met Thr Val Thr Gln Leu Ser Leu Pro
1 5 10 15
Lys Gly Gly Gly Ala Ile Ser Gly Met Gly Asp Thr Ile Ser Asn Ala
20 25 30
Gly Pro Asp Gly Met Ala Ser Leu Ser Val Pro Leu Pro Ile Ser Ala
35 40 45
Gly Arg Gly Gly Ala Pro Asn Leu Ser Leu Asn Tyr Ser Ser Gly Ala
50 55 60
Gly Asn Gly Ser Phe Gly Ile Gly Trp Gln Ser Ser Thr Met Ala Ile
65 70 75 80
Ser Arg Arg Thr Gln His Gly Val Pro Gln Tyr His Gly Glu Asp Thr
85 90 95
Phe Leu Cys Pro Met Gly Glu Val Met Ala Val Ala Val Asn Gln Ser
100 105 110
Gly Gln Pro Asp Val Arg Lys Thr Asp Lys Leu Leu Gly Gly Gln Leu
115 120 125
Pro Val Thr Tyr Thr Val Thr Arg His Gln Pro Arg Asn Ile Gln His
130 135 140
Phe Ser Lys Leu Glu Tyr Trp Gln Pro Pro Thr Asp Val Glu Thr Thr
145 150 155 160
Pro Phe Trp Leu Met Tyr Ser Pro Asp Gly Gln Ile His Ile Phe Gly
165 170 175
Lys Thr Glu Gln Ala Gln Ile Ala Asn Pro Ala Glu Val Ser Gln Ile
180 185 190
Ala Gln Trp Leu Leu Glu Glu Thr Val Thr Pro Ala Gly Glu His Ile
195 200 205
Tyr Tyr Gln Tyr Arg Ala Glu Asp Asp Ile Gly Cys Asp Asp Ser Glu
210 215 220
Lys Asn Ala His Pro Asn Ala Ser Ala Gln Arg Tyr Leu Thr Gln Val
225 230 235 240
Asn Tyr Gly Asn Ile Thr Pro Glu Ser Ser Leu Leu Val Leu Lys Asn
245 250 255
Thr Pro Pro Ala Asp Asn Glu Trp Leu Phe His Leu Val Phe Asp Tyr
260 265 270
Gly Glu Arg Ala Gln Glu Ile Asn Thr Val Pro Pro Phe Lys Ala Pro
275 280 285
Ser Asn Asn Trp Lys Ile Arg Pro Asp Arg Phe Ser Arg Phe Glu Tyr
290 295 300
Gly Phe Glu Val Arg Thr Arg Arg Leu Cys Gln Gln Ile Leu Met Phe
305 310 315 320
His Arg Leu Lys Ser Leu Ala Gly Glu Gln Ile Asp Gly Glu Glu Ile
325 330 335
Pro Ala Leu Val Ala Arg Leu Leu Leu Ser Tyr Asp Leu Asn Asp Ser
340 345 350
Val Thr Thr Leu Thr Ala Ile Arg Gln Met Ala Tyr Glu Thr Asp Ala
355 360 365
Thr Leu Ile Ala Leu Pro Pro Leu Glu Phe Asp Tyr Gln Pro Phe Glu
370 375 380
Ala Lys Val Thr Gln Lys Trp Gln Glu Met Pro Gln Leu Ala Gly Leu
385 390 395 400
Asn Ala Gln Gln Pro Tyr Gln Leu Val Asp Leu Tyr Gly Glu Gly Ile
405 410 415
Ser Gly Ile Leu Tyr Gln Asp Arg Pro Gly Ala Trp Trp Tyr Gln Ala
420 425 430
Pro Ile Arg Gln Lys Asn Val Glu Asp Ile Asn Ala Val Thr Tyr Ser
435 440 445
Pro Ile Asn Pro Leu Pro Lys Ile Pro Ser Gln Gln Asp Arg Ala Thr
450 455 460
Leu Met Asp Ile Asp Gly Asp Gly His Leu Asp Trp Val Ile Ala Gly
465 470 475 480
Ala Gly Ile Gln Gly Arg Tyr Ser Met Gln Pro Asn Gly Glu Trp Thr
485 490 495
His Phe Ile Pro Ile Ser Ala Leu Pro Thr Glu Tyr Phe His Pro Gln
500 505 510
Ala Gln Leu Ala Asp Leu Val Gly Ala Gly Leu Ser Asp Leu Ala Leu
515 520 525
Ile Gly Pro Arg Ser Val Arg Leu Tyr Ala Asn Asp Arg Gly Asn Trp
530 535 540
Lys Ala Gly Ile Asn Val Met Pro Pro Asp Gly Val Asn Leu Pro Ile
545 550 555 560
Phe Gly Gly Asp Ala Ser Ser Leu Val Ala Phe Ser Asp Met Leu Gly
565 570 575
Ser Gly Gln Gln His Leu Val Glu Ile Ala Ala Gln Ser Val Lys Cys
580 585 590
Trp Pro Asn Leu Gly His Gly Arg Phe Gly Ala Ala Ile Leu Leu Pro
595 600 605
Gly Phe Ser Gln Pro Asn Gly Thr Phe Asn Ala Asn Gln Val Phe Leu
610 615 620
Ala Asp Ile Asp Gly Ser Gly Thr Ala Asp Ile Ile Tyr Ala His Ser
625 630 635 640
Thr Tyr Leu Asp Ile Tyr Leu Asn Glu Ser Gly Asn Arg Phe Ser Ala
645 650 655
Pro Val Arg Leu Asn Leu Pro Glu Gly Val Met Phe Asp Asn Thr Cys
660 665 670
Gln Leu Gln Val Ser Asp Ile Gln Gly Leu Gly Ala Ala Ser Ile Val
675 680 685
Leu Thr Val Pro His Met Thr Pro Arg His Trp Arg Tyr Asp Phe Thr
690 695 700
His Asn Lys Pro Trp Leu Leu Asn Val Ile Asn Asn Asn Arg Gly Ala
705 710 715 720
Glu Thr Thr Leu Phe Tyr Arg Ser Ser Ala Gln Phe Trp Leu Asp Glu
725 730 735
Lys Ser Gln Ile Glu Glu Leu Gly Lys Phe Ala Ala Ser Tyr Leu Pro
740 745 750
Phe Pro Ile His Leu Leu Trp Arg Asn Glu Ala Leu Asp Glu Ile Thr
755 760 765
Gly Asn Arg Leu Thr Lys Val Met Asn Tyr Ala His Gly Ala Trp Asp
770 775 780
Gly Arg Glu Arg Glu Phe Cys Gly Phe Gly Arg Val Thr Gln Ile Asp
785 790 795 800
Thr Asp Glu Phe Ala Lys Gly Thr Thr Glu Lys Ala Pro Asp Glu Asn
805 810 815
Ile Tyr Pro Ser Arg Ser Ile Ser Trp Phe Ala Thr Gly Leu Pro Glu
820 825 830
Val Asp Ser Gln Leu Pro Ala Glu Tyr Trp Arg Gly Asp Asp Gln Ala
835 840 845
Phe Ala Gly Phe Thr Pro Arg Phe Thr Arg Tyr Glu Lys Gly Asn Ala
850 855 860
Gly Gln Glu Gly Gln Asp Thr Pro Ile Lys Glu Pro Thr Glu Thr Glu
865 870 875 880
Ala Tyr Trp Leu Asn Arg Ala Met Lys Gly Gln Leu Leu Arg Ser Glu
885 890 895
Val Tyr Gly Asp Asp Lys Thr Glu Lys Ala Lys Ile Pro Tyr Thr Val
900 905 910
Thr Glu Ala Arg Cys Gln Val Arg Leu Ile Pro Ser Asn Asp Glu Ala
915 920 925
Ala Pro Ser Ser Trp Thr Ser Ile Ile Glu Asn Arg Ser Tyr His Tyr
930 935 940
Glu Arg Ile Val Val Asp Pro Ser Cys Lys Gln Gln Val Val Leu Lys
945 950 955 960
Ala Asp Glu Tyr Gly Phe Pro Leu Ala Lys Val Asp Ile Ala Tyr Pro
965 970 975
Arg Arg Asn Lys Pro Ala Gln Asn Pro Tyr Pro Asp Ser Leu Pro Asp
980 985 990
Thr Leu Phe Ala Asp Ser Tyr Asp Asp Gln Gln Lys Gln Leu Tyr Leu
995 1000 1005
Thr Lys Gln Gln Gln Ser Tyr Tyr His Leu Thr Gln Gln Asp Asp
1010 1015 1020
Trp Val Leu Gly Leu Thr Asp Ser Arg Tyr Ser Glu Val Tyr His
1025 1030 1035
Tyr Ala Gln Thr Asp Ala Gln Ser Asp Ile Pro Lys Ala Gly Leu
1040 1045 1050
Ile Leu Glu Asp Leu Leu Lys Val Asp Gly Leu Ile Gly Lys Asp
1055 1060 1065
Lys Thr Phe Ile Tyr Leu Gly Gln Gln Arg Val Ala Tyr Val Gly
1070 1075 1080
Gly Asp Ala Glu Lys Pro Thr Arg Gln Val Arg Val Ala Tyr Thr
1085 1090 1095
Glu Thr Ala Ala Phe Asp Asp Asn Ala Leu His Ala Phe Asp Gly
1100 1105 1110
Val Ile Ala Pro Asp Glu Leu Thr Gln Gln Leu Leu Ala Gly Gly
1115 1120 1125
Tyr Leu Leu Val Pro Gln Ile Ser Asp Val Ala Gly Ser Ser Glu
1130 1135 1140
Lys Val Trp Val Ala Arg Gln Gly Tyr Thr Glu Tyr Gly Ser Ala
1145 1150 1155
Ala Gln Phe Tyr Arg Pro Leu Ile Gln Arg Lys Ser Leu Leu Thr
1160 1165 1170
Gly Lys Tyr Thr Leu Ser Trp Asp Thr His Tyr Cys Val Val Val
1175 1180 1185
Lys Thr Glu Asp Gly Ala Gly Met Thr Thr Gln Ala Lys Tyr Asp
1190 1195 1200
Tyr Arg Phe Leu Leu Pro Ala Gln Leu Thr Asp Ile Asn Asp Asn
1205 1210 1215
Gln His Ile Val Thr Phe Asn Ala Leu Gly Gln Val Thr Ser Ser
1220 1225 1230
Arg Phe Trp Gly Thr Glu Asn Gly Lys Ile Ser Gly Tyr Ser Thr
1235 1240 1245
Pro Glu Ser Lys Pro Phe Thr Val Pro Asp Thr Val Glu Lys Ala
1250 1255 1260
Leu Ala Leu Gln Pro Thr Ile Pro Val Ser Gln Cys Asn Ile Tyr
1265 1270 1275
Val Pro Asp Ser Trp Met Arg Leu Leu Pro Gln Gln Ser Leu Thr
1280 1285 1290
Gly Gln Leu Lys Glu Gly Glu Thr Leu Trp Asn Ala Leu His Arg
1295 1300 1305
Ala Gly Val Val Thr Glu Asp Gly Leu Ile Cys Glu Leu Ala Tyr
1310 1315 1320
Arg Arg Trp Ile Lys Arg Gln Ala Thr Ser Ser Met Met Ala Val
1325 1330 1335
Thr Leu Gln Gln Ile Leu Ala Gln Thr Pro Arg Gln Pro Pro His
1340 1345 1350
Ala Met Thr Ile Thr Thr Asp Arg Tyr Asp Ser Asp Ser Gln Gln
1355 1360 1365
Gln Leu Arg Gln Ser Ile Val Leu Ser Asp Gly Phe Gly Arg Val
1370 1375 1380
Leu Gln Ser Ala Gln Arg His Glu Ala Gly Glu Ala Trp Gln Arg
1385 1390 1395
Ala Glu Asp Gly Ser Leu Val Val Asp Asn Thr Gly Lys Pro Val
1400 1405 1410
Val Ala Asn Thr Thr Thr Arg Trp Ala Val Ser Gly Arg Thr Glu
1415 1420 1425
Tyr Asp Gly Lys Gly Gln Ala Ile Arg Ala Tyr Leu Pro Tyr Tyr
1430 1435 1440
Leu Asn Asp Trp Arg Tyr Val Ser Asp Asp Ser Ala Arg Asp Asp
1445 1450 1455
Leu Tyr Ala Asp Thr His Phe Tyr Asp Pro Leu Gly Arg Glu Tyr
1460 1465 1470
Gln Val Lys Thr Ala Lys Gly Phe Trp Arg Glu Asn Met Phe Met
1475 1480 1485
Pro Trp Phe Val Val Asn Glu Asp Glu Asn Asp Thr Ala Ala Arg
1490 1495 1500
Leu Thr Ser
1505
<210>10
<211>1444
<212>PRT
<213>类芽胞杆菌属(Paenibacillus)菌株DAS1529
<400>10
Met Pro Gln Ser Ser Asn Ala Asp Ile Lys Leu Leu Ser Pro Ser Leu
1 5 10 15
Pro Lys Gly Gly Gly Ser Met Lys Gly Ile Glu Glu Asn Ile Ala Ala
20 25 30
Pro Gly Ser Asp Gly Met Ala Arg Cys Asn Val Pro Leu Pro Val Thr
35 40 45
Ser Gly Arg Tyr Ile Thr Pro Asp Ile Ser Leu Ser Tyr Ala Ser Gly
50 55 60
His Gly Asn Gly Ala Tyr Gly Met Gly Trp Thr Met Gly Val Met Ser
65 70 75 80
Ile Ser Arg Arg Thr Ser Arg Gly Thr Pro Ser Tyr Thr Ser Glu Asp
85 90 95
Gln Phe Leu Gly Pro Asp Gly Glu Val Leu Val Pro Glu Ser Asn Glu
100 105 110
Gln Gly Glu Ile Ile Thr Arg His Thr Asp Thr Ala Gln Gly Ile Pro
115 120 125
Leu Gly Glu Thr Phe Thr Val Thr Arg Tyr Phe Pro Arg Ile Glu Ser
130 135 140
Ala Phe His Leu Leu Glu Tyr Trp Glu Ala Gln Ala Gly Ser Ala Thr
145 150 155 160
Ala Ser Phe Trp Leu Ile His Ser Ala Asp Gly Val Leu His Cys Leu
165 170 175
Gly Lys Thr Ala Gln Ala Arg Ile Ala Ala Pro Asp Asp Ser Ala Lys
180 185 190
Ile Ala Glu Trp Leu Val Glu Glu Ser Val Ser Pro Phe Gly Glu His
195 200 205
Ile Tyr Tyr Gln Tyr Lys Glu Glu Asp Asn Gln Gly Val Asn Leu Glu
210 215 220
Glu Asp Asn His Gln Tyr Gly Ala Asn Arg Tyr Leu Lys Ser Ile Arg
225 230 235 240
Tyr Gly Asn Lys Val Ala Ser Pro Ser Leu Tyr Val Trp Lys Gly Glu
245 250 255
Ile Pro Ala Asp Gly Gln Trp Leu Tyr Ser Val Ile Leu Asp Tyr Gly
260 265 270
Glu Asn Asp Thr Ser Ala Asp Val Pro Pro Leu Tyr Thr Pro Gln Gly
275 280 285
Glu Trp Leu Val Arg Pro Asp Arg Phe Ser Arg Tyr Asp Tyr Gly Phe
290 295 300
Glu Val Arg Thr Cys Arg Leu Cys Arg Gln Val Leu Met Phe His Val
305 310 315 320
Phe Lys Glu Leu Gly Gly Glu Pro Ala Leu Val Trp Arg Met Gln Leu
325 330 335
Glu Tyr Asp Glu Asn Pro Ala Ala Ser Met Leu Ser Ala Val Arg Gln
340 345 350
Leu Ala Tyr Glu Ala Asp Gly Ala Ile Arg Ser Leu Pro Pro Leu Glu
355 360 365
Phe Asp Tyr Thr Pro Phe Gly Ile Glu Thr Thr Ala Asp Trp Gln Pro
370 375 380
Phe Leu Pro Val Pro Glu Trp Ala Asp Glu Glu His Tyr Gln Leu Val
385 390 395 400
Asp Leu Tyr Gly Glu Gly Ile Pro Gly Leu Leu Tyr Gln Asn Asn Asp
405 410 415
His Trp His Tyr Arg Ser Pro Ala Arg Gly Asp Thr Pro Asp Gly Ile
420 425 430
Ala Tyr Asn Ser Trp Arg Pro Leu Pro His Ile Pro Val Asn Ser Arg
435 440 445
Asn Gly Met Leu Met Asp Leu Asn Gly Asp Gly Tyr Leu Glu Trp Leu
450 455 460
Leu Ala Glu Pro Gly Val Ala Gly Arg Tyr Ser Met Asn Pro Asp Lys
465 470 475 480
Ser Trp Ser Gly Phe Val Pro Leu Gln Ala Leu Pro Thr Glu Phe Phe
485 490 495
His Pro Gln Ala Gln Leu Ala Asn Val Thr Gly Ser Gly Leu Thr Asp
500 505 510
Leu Val Met Ile Gly Pro Lys Ser Val Arg Phe Tyr Ala Gly Glu Glu
515 520 525
Ala Gly Phe Lys Arg Ala Cys Glu Val Trp Gln Gln Val Gly Ile Thr
530 535 540
Leu Pro Val Glu Arg Val Asp Lys Lys Glu Leu Val Ala Phe Ser Asp
545 550 555 560
Met Leu Gly Ser Gly Gln Ser His Leu Val Arg Ile Arg His Asp Gly
565 570 575
Val Thr Cys Trp Pro Asn Leu Gly Asn Gly Val Phe Gly Ala Pro Leu
580 585 590
Ala Leu His Gly Phe Thr Ala Ser Glu Arg Glu Phe Asn Pro Glu Arg
595 600 605
Val Tyr Leu Val Asp Leu Asp Gly Ser Gly Ala Ser Asp Ile Ile Tyr
610 615 620
Ala Ser Arg Asp Ala Leu Leu Ile Tyr Arg Asn Leu Ser Gly Asn Gly
625 630 635 640
Phe Ala Asp Pro Val Arg Val Pro Leu Pro Asp Gly Val Arg Phe Asp
645 650 655
Asn Leu Cys Arg Leu Leu Pro Ala Asp Ile Arg Gly Leu Gly Val Ala
660 665 670
Ser Leu Val Leu His Val Pro Tyr Met Ala Pro Arg Ser Trp Lys Leu
675 680 685
Asp Phe Phe Ala Ala Lys Pro Tyr Leu Leu Gln Thr Val Ser Asn Asn
690 695 700
Leu Gly Ala Ser Ser Ser Phe Trp Tyr Arg Ser Ser Thr Gln Tyr Trp
705 710 715 720
Leu Asp Glu Lys Gln Ala Ala Ser Ser Ala Val Ser Ala Leu Pro Phe
725 730 735
Pro Ile Asn Val Val Ser Asp Met His Thr Val Asp Glu Ile Ser Gly
740 745 750
Arg Thr Arg Thr Gln Lys Tyr Thr Tyr Arg His Gly Val Tyr Asp Arg
755 760 765
Thr Glu Lys Glu Phe Ala Gly Phe Gly Arg Ile Asp Thr Trp Glu Glu
770 775 780
Glu Arg Asp Ser Glu Gly Thr Leu Ser Val Ser Thr Pro Pro Val Leu
785 790 795 800
Thr Arg Thr Trp Tyr His Thr Gly Gln Lys Gln Asp Glu Glu Arg Ala
805 810 815
Val Gln Gln Tyr Trp Gln Gly Asp Pro Ala Ala Phe Gln Val Lys Pro
820 825 830
Val Arg Leu Thr Arg Phe Asp Ala Ala Ala Ala Gln Asp Leu Pro Leu
835 840 845
Asp Ser Asn Asn Gly Gln Gln Glu Tyr Trp Leu Tyr Arg Ser Leu Gln
850 855 860
Gly Met Pro Leu Arg Thr Glu Ile Phe Ala Gly Asp Val Gly Gly Ser
865 870 875 880
Pro Pro Tyr Gln Val Glu Ser Phe Arg Tyr Gln Val Arg Leu Val Gln
885 890 895
Ser Ile Asp Ser Glu Cys Val Ala Leu Pro Met Gln Leu Glu Gln Leu
900 905 910
Thr Tyr Asn Tyr Glu Gln Ile Ala Ser Asp Pro Gln Cys Ser Gln Gln
915 920 925
Ile Gln Gln Trp Phe Asp Glu Tyr Gly Val Ala Ala Gln Ser Val Thr
930 935 940
Ile Gln Tyr Pro Arg Arg Ala Gln Pro Glu Asp Asn Pro Tyr Pro Arg
945 950 955 960
Thr Leu Pro Asp Thr Ser Trp Ser Ser Ser Tyr Asp Ser Gln Gln Met
965 970 975
Leu Leu Arg Leu Thr Arg Gln Arg Gln Lys Ala Tyr His Leu Ala Asp
980 985 990
Pro Glu Gly Trp Arg Leu Asn Ile Pro His Gln Thr Arg Leu Asp Ala
995 1000 1005
Phe Ile Tyr Ser Ala Asp Ser Val Pro Ala Glu Gly Ile Ser Ala
1010 1015 1020
Glu Leu Leu Glu Val Asp Gly Thr Leu Arg Ser Ser Ala Leu Glu
1025 1030 1035
Gln Ala Tyr Gly Gly Gln Ser Glu Ile Ile Tyr Ala Gly Gly Gly
1040 1045 1050
Glu Pro Asp Leu Arg Ala Leu Val His Tyr Thr Arg Ser Ala Val
1055 1060 1065
Leu Asp Glu Asp Cys Leu Gln Ala Tyr Glu Gly Val Leu Ser Asp
1070 1075 1080
Ser Gln Leu Asn Ser Leu Leu Ala Ser Ser Gly Tyr Gln Arg Ser
1085 1090 1095
Ala Arg Ile Leu Gly Ser Gly Asp Glu Val Asp Ile Phe Val Ala
1100 1105 1110
Glu Gln Gly Phe Thr Arg Tyr Ala Asp Glu Pro Asn Phe Phe Arg
1115 1120 1125
Ile Leu Gly Gln Gln Ser Ser Leu Leu Ser Gly Glu Gln Val Leu
1130 1135 1140
Thr Trp Asp Asp Asn Phe Cys Ala Val Thr Ser Ile Glu Asp Ala
1145 1150 1155
Leu Gly Asn Gln Ile Gln Ile Ala Tyr Asp Tyr Arg Phe Val Glu
1160 1165 1170
Ala Ile GlnIle Thr Asp Thr Asn Asn Asn Val Asn Gln Val Ala
1175 1180 1185
Leu Asp Ala Leu Gly Arg Val Val Tyr Ser Arg Thr Trp Gly Thr
1190 1195 1200
Glu Glu Gly Ile Lys Thr Gly Phe Arg Pro Glu Val Glu Phe Ala
1205 1210 1215
Thr Pro Glu Thr Met Glu Gln Ala Leu Ala Leu Ala Ser Pro Leu
1220 1225 1230
Pro Val Ala Ser Cys Cys Val Tyr Asp Ala His Ser Trp Met Gly
1235 1240 1245
Thr Ile Thr Leu Ala Gln Leu Ser Glu Leu Val Pro Asp Ser Glu
1250 1255 1260
Lys Gln Trp Ser Phe Leu Ile Asp Asn Arg Leu Ile Met Pro Asp
1265 1270 1275
Gly Arg Ile Arg Ser Arg Gly Arg Asp Pro Trp Ser Leu His Arg
1280 1285 1290
Leu Leu Pro Pro Ala Val Gly Glu Leu Leu Ser Glu Ala Asp Arg
1295 1300 1305
Lys Pro Pro His Thr Val Ile Leu Ala Ala Asp Arg Tyr Pro Asp
1310 1315 1320
Asp Pro Ser Gln Gln Ile Gln Ala Ser Ile Val Phe Ser Asp Gly
1325 1330 1335
Phe Gly Arg Thr Ile Gln Thr Ala Lys Arg Glu Asp Thr Arg Trp
1340 1345 1350
Ala Ile Ala Glu Arg Val Asp Tyr Asp Gly Thr Gly Ala Val Ile
1355 1360 1365
Arg Ser Phe Gln Pro Phe Tyr Leu Asp Asp Trp Asn Tyr Val Gly
1370 1375 1380
Glu Glu Ala Val Ser Ser Ser Met Tyr Ala Thr Ile Tyr Tyr Tyr
1385 1390 1395
Asp Ala Leu Ala Arg Gln Leu Arg Met Val Asn Ala Lys Gly Tyr
1400 1405 1410
Glu Arg Arg Thr Ala Phe Tyr Pro Trp Phe Thr Val Asn Glu Asp
1415 1420 1425
Glu Asn Asp Thr Met Asp Ser Ser Leu Phe Ala Ser Pro Pro Ala
1430 1435 1440
Arg
<210>11
<211>1428
<212>PRT
<213>嗜虫沙雷氏菌(Serratia entomophila)
<400>11
Met Gln Asn His Gln Asp Met Ala Ile Thr Ala Pro Thr Leu Pro Ser
1 5 10 15
Gly Gly Gly Ala Val Thr Gly Leu Lys Gly Asp Ile Ala Ala Ala Gly
20 25 30
Pro Asp Gly Ala Ala Thr Leu Ser Ile Pro Leu Pro Val Ser Pro Gly
35 40 45
Arg Gly Tyr Ala Pro Thr Gly Ala Leu Asn Tyr His Ser Arg Ser Gly
50 55 60
Asn Gly Pro Phe Gly Ile Gly Trp Gly Ile Gly Gly Ala Ala Val Gln
65 70 75 80
Arg Arg Thr Arg Asn Gly Ala Pro Thr Tyr Asp Asp Thr Asp Glu Phe
85 90 95
Thr Gly Pro Asp Gly Glu Val Leu Val Pro Ala Leu Thr Ala Ala Gly
100 105 110
Thr Gln Glu Ala Arg Gln Ala Thr Ser Leu Leu Gly Ile Asn Pro Gly
115 120 125
Gly Ser Phe Asn Val Gln Val Tyr Arg Ser Arg Thr Glu Gly Ser Leu
130 135 140
Ser Arg Leu Glu Arg Trp Leu Pro Ala Asp Glu Thr Glu Thr Glu Phe
145 150 155 160
Trp Val Leu Tyr Thr Pro Asp Gly Gln Val Ala Leu Leu Gly Arg Asn
165 170 175
Ala Gln Ala Arg Ile Ser Asn Pro Thr Ala Pro Thr Gln Thr Ala Val
180 185 190
Trp Leu Met Glu Ser Ser Val Ser Leu Thr Gly Glu Gln Met Tyr Tyr
195 200 205
Gln Tyr Arg Ala Glu Asp Asp Asp Gly Cys Asp Glu Ala Glu Arg Asp
210 215 220
Ala His Pro Gln Ala Gly Ala Gln Arg Tyr Pro Val Ala Val Trp Tyr
225 230 235 240
Gly Asn Arg Gln Ala Ala Arg Thr Leu Pro Ala Leu Val Ser Thr Pro
245 250 255
Ser Met Asp Ser Trp Leu Phe Ile Leu Val Phe Asp Tyr Gly Glu Arg
260 265 270
Ser Ser Val Leu Ser Glu Ala Pro Ala Trp Gln Thr Pro Gly Ser Gly
275 280 285
Glu Trp Leu Cys Arg Gln Asp Cys Phe Ser Gly Tyr Glu Phe Gly Phe
290 295 300
Asn Leu Arg Thr Arg Arg Leu Cys Arg Gln Val Leu Met Phe His Tyr
305 310 315 320
Leu Gly Val Leu Ala Gly Ser Ser Gly Ala Asn Asp Ala Pro Ala Leu
325 330 335
Ile Ser Arg Leu Leu Leu Asp Tyr Arg Glu Ser Pro Ser Leu Ser Leu
340 345 350
Leu Glu Asn Val His Gln Val Ala Tyr Glu Ser Asp Gly Thr Ser Cys
355 360 365
Ala Leu Pro Ala Leu Ala Leu Gly Trp Gln Thr Phe Thr Pro Pro Thr
370 375 380
Leu Ser Ala Trp Gln Thr Arg Asp Asp Met Gly Lys Leu Ser Leu Leu
385 390 395 400
Gln Pro Tyr Gln Leu Val Asp Leu Asn Gly Glu Gly Val Val Gly Ile
405 410 415
Leu Tyr Gln Asp Ser Gly Ala Trp Trp Tyr Arg Glu Pro Val Arg Gln
420 425 430
Ser Gly Asp Asp Pro Asp Ala Val Thr Trp Gly Ala Ala Ala Ala Leu
435 440 445
Pro Thr Met Pro Ala Leu His Asn Ser Gly Ile Leu Ala Asp Leu Asn
450 455 460
Gly Asp Gly Arg Leu Glu Trp Val Val Thr Ala Pro Gly Val Ala Gly
465 470 475 480
Met Tyr Asp Arg Thr Pro Gly Arg Asp Trp Leu His Phe Thr Pro Leu
485 490 495
Ser Ala Leu Pro Val Glu Tyr Ala His Pro Lys Ala Val Leu Ala Asp
500 505 510
Ile Leu Gly Ala Gly Leu Thr Asp Met Val Leu Ile Gly Pro Arg Ser
515 520 525
Val Arg Leu Tyr Ser Gly Lys Asn Asp Gly Trp Asn Lys Gly Glu Thr
530 535 540
Val Gln Gln Thr Glu Arg Leu Thr Leu Pro Val Pro Gly Val Asp Pro
545 550 555 560
Arg Thr Leu Val Ala Phe Ser Asp Met Ala Gly Ser Gly Gln Gln His
565 570 575
Leu Thr Glu Val Arg Ala Asn Gly Val Arg Tyr Trp Pro Asn Leu Gly
580 585 590
His Gly Arg Phe Gly Gln Pro Val Asn Ile Pro Gly Phe Ser Gln Ser
595 600 605
Val Thr Thr Phe Asn Pro Asp Gln Ile Leu Leu Ala Asp Thr Asp Gly
610 615 620
Ser Gly Thr Thr Asp Leu Ile Tyr Ala Met Ser Asp Arg Leu Val Ile
625 630 635 640
Tyr Phe Asn Gln Ser Gly Asn Tyr Phe Ala Glu Pro His Thr Leu Leu
645 650 655
Leu Pro Lys Gly Val Arg Tyr Asp Arg Thr Cys Ser Leu Gln Val Ala
660 665 670
Asp Ile Gln Gly Leu Gly Val Pro Ser Leu Leu Leu Thr Val Pro His
675 680 685
Val Ala Pro His His Trp Val Cys His Leu Ser Ala Asp Lys Pro Trp
690 695 700
Leu Leu Asn Gly Met Asn Asn Asn Met Gly Ala Arg His Ala Leu His
705 710 715 720
Tyr Arg Ser Ser Val Gln Phe Trp Leu Asp Glu Lys Ala Glu Ala Leu
725 730 735
Ala Ala Gly Ser Ser Pro Ala Cys Tyr Leu Pro Phe Thr Leu His Thr
740 745 750
Leu Trp Arg Ser Val Val Gln Asp Glu Ile Thr Gly Asn Arg Leu Val
755 760 765
Ser Asp Val Leu Tyr Arg His Gly Val Trp Asp Gly Gln Glu Arg Glu
770 775 780
Phe Arg Gly Phe Gly Phe Val Glu Ile Arg Asp Thr Asp Thr Leu Ala
785 790 795 800
Ser Gln Gly Thr Ala Thr Glu Leu Ser Met Pro Ser Val Ser Arg Asn
805 810 815
Trp Tyr Ala Thr Gly Val Pro Ala Val Asp Glu Arg Leu Pro Glu Thr
820 825 830
Tyr Trp Gln Asn Asp Ala Ala Ala Phe Ala Asp Phe Ala Thr Arg Phe
835 840 845
Thr Val Gly Ser Gly Glu Asp Glu Gln Thr Tyr Thr Pro Asp Asp Ser
850 855 860
Lys Thr Phe Trp Leu Gln Arg Ala Leu Lys Gly Ile Leu Leu Arg Ser
865 870 875 880
Glu Leu Tyr Gly Ala Asp Gly Ser Ser Gln Ala Asp Ile Pro Tyr Ser
885 890 895
Val Thr Glu Ser Arg Pro Gln Val Arg Leu Val Glu Ala Asn Gly Asp
900 905 910
Tyr Pro Val Val Trp Pro Met Gly Ala Glu Ser Arg Thr Ser Val Tyr
915 920 925
Glu Arg Tyr His Asn Asp Pro Gln Cys Gln Gln Gln Ala Val Leu Leu
930 935 940
Ser Asp Glu Tyr Gly Phe Pro Leu Arg Gln Val Ser Val Asn Tyr Pro
945 950 955 960
Arg Arg Pro Pro Ser Ala Asp Asn Pro Tyr Pro Ala Ser Leu Pro Ala
965 970 975
Thr Leu Phe Ala Asn Ser Tyr Asp Glu Gln Gln Gln Ile Leu Arg Leu
980 985 990
Gly Leu Gln Gln Ser Ser Ala His His Leu Val Ser Leu Ser Glu Gly
995 1000 1005
His Trp Leu Leu Gly Leu Ala Glu Ala Ser Arg Asp Asp Val Phe
1010 1015 1020
Thr Tyr Ser Ala Asp Asn Val Pro Glu Gly Gly Leu Thr Leu Glu
1025 1030 1035
His Leu Leu Ala Pro Glu Ser Leu Val Ser Asp Ser Gln Val Gly
1040 1045 1050
Thr Leu Ala Gly Gln Gln Gln Val Trp Tyr Leu Asp Ser Gln Asp
1055 1060 1065
Val Ala Thr Val Ala Ala Pro Pro Leu Pro Pro Lys Val Ala Phe
1070 1075 1080
Ile Glu Thr Ala Val Leu Asp Glu Gly Met Val Ser Ser Leu Ala
1085 1090 1095
Ala Tyr Ile Val Asp Glu His Leu Glu Gln Ala Gly Tyr Arg Gln
1100 1105 1110
Ser Gly Tyr Leu Phe Pro Arg Gly Arg Glu Ala Glu Gln Ala Leu
1115 1120 1125
Trp Thr Gln Cys Gln Gly Tyr Val Thr Tyr Ala Gly Ala Glu His
1130 1135 1140
Phe Trp Leu Pro Leu Ser Phe Arg Asp Ser Met Leu Thr Gly Pro
1145 1150 1155
Val Thr Val Thr Arg Asp Ala Tyr Asp Cys Val Ile Thr Gln Trp
1160 1165 1170
Gln Asp Ala Ala Gly Ile Val Thr Thr Ala Asp Tyr Asp Trp Arg
1175 1180 1185
Phe Leu hr Pro Val Arg Val Thr Asp Pro Asn Asp Asn Leu Gln
1190 1195 1200
Ser Val Thr Leu Asp Ala Leu Gly Arg Val Thr Thr Leu Arg Phe
1205 1210 1215
Trp Gly Thr Glu Asn Gly Ile Ala Thr Gly Tyr Ser Asp Ala Thr
1220 1225 1230
Leu Ser Val Pro Asp Gly Ala Ala Ala Ala Leu Ala Leu Thr Ala
1235 1240 1245
Pro Leu Pro Val Ala Gln Cys Leu Val Tyr Val Thr Asp Ser Trp
1250 1255 1260
Gly Asp Asp Asp Asn Glu Lys Met Pro Pro His Val Val Val Leu
1265 1270 1275
Ala Thr Asp Arg Tyr Asp Ser Asp Thr Gly Gln Gln Val Arg Gln
1280 1285 1290
Gln Val Thr Phe Ser Asp Gly Phe Gly Arg Glu Leu Gln Ser Ala
1295 1300 1305
Thr Arg Gln Ala Glu Gly Asn Ala Trp Gln Arg Gly Arg Asp Gly
1310 1315 1320
Lys Leu Val Thr Ala Ser Asp Gly Leu Pro Val Thr Val Ala Thr
1325 1330 1335
Asn Phe Arg Trp Ala Val Thr Gly Arg Ala Glu Tyr Asp Asn Lys
1340 1345 1350
Gly Leu Pro Val Arg Val Tyr Gln Pro Tyr Phe Leu Asp Ser Trp
1355 1360 1365
Gln Tyr Val Ser Asp Asp Ser Ala Arg Gln Asp Leu Tyr Ala Asp
1370 1375 1380
Thr His Phe Tyr Asp Pro Thr Ala Arg Glu Trp Gln Val IIe Thr
1385 1390 1395
Ala Lys Gly Glu Arg Arg Gln Val Leu Tyr Thr Pro Trp Phe Val
1400 1405 1410
Val Ser Glu Asp Glu Asn Asp Thr Val Gly Leu Asn Asp Ala Ser
1415 1420 1425
<210>12
<211>1043
<212>PRT
<213>发光光杆状菌
<400>12
Met Ser Pro Ser Glu Thr Thr Leu Tyr Thr Gln Thr Pro Thr Val Ser
1 5 10 15
Val Leu Asp Asn Arg Gly Leu Ser Ile Arg Asp Ile Gly Phe His Arg
20 25 30
Ile Val Ile Gly Gly Asp Thr Asp Thr Arg Val Thr Arg His Gln Tyr
35 40 45
Asp Ala Arg Gly His Leu Asn Tyr Ser Ile Asp Pro Arg Leu Tyr Asp
50 55 60
Ala Lys Gln Ala Asp Asn Ser Val Lys Pro Asn Phe Val Trp Gln His
65 70 75 80
Asp Leu Ala Gly His Ala Leu Arg Thr Glu Ser Val Asp Ala Gly Arg
85 90 95
Thr Val Ala Leu Asn Asp Ile Glu Gly Arg Ser Val Met Thr Met Asn
100 105 110
Ala Thr Gly Val Arg Gln Thr Arg Arg Tyr Glu Gly Asn Thr Leu Pro
115 120 125
Gly Arg Leu Leu Ser Val Ser Glu Gln Val Phe Asn Gln Glu Ser Ala
130 135 140
Lys Val Thr Glu Arg Phe Ile Trp Ala Gly Asn Thr Thr Ser Glu Lys
145 150 155 160
Glu Tyr Asn Leu Ser Gly Leu Cys Ile Arg His Tyr Asp Thr Ala Gly
165 170 175
Val Thr Arg Leu Met Ser Gln Ser Leu Ala Gly Ala Met Leu Ser Gln
180 185 190
Ser His Gln Leu Leu Ala Glu Gly Gln Glu Ala Asn Trp Ser Gly Asp
195 200 205
Asp Glu Thr Val Trp Gln Gly Met Leu Ala Ser Glu Val Tyr Thr Thr
210 215 220
Gln Ser Thr Thr Asn Ala Ile Gly Ala Leu Leu Thr Gln Thr Asp Ala
225 230 235 240
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Ile Ala Gly Gln Leu Lys
245 250 255
Gly Ser Trp Leu Thr Val Lys Gly Gln Ser Glu Gln Val Ile Val Lys
260 265 270
Ser Leu Ser Trp Ser Ala Ala Gly His Lys Leu Arg Glu Glu His Gly
275 280 285
Asn Gly Val Val Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu
290 295 300
Ile Gly Ile Thr Thr Arg Arg Ala Glu Gly Ser Gln Ser Gly Ala Arg
305 310 315 320
Val Leu Gln Asp Leu Arg Tyr Lys Tyr Asp Pro Val Gly Asn Val Ile
325 330 335
Ser Ile His Asn Asp Ala Glu Ala Thr Arg Phe Trp Arg Asn Gln Lys
340 345 350
Val Glu Pro Glu Asn Arg Tyr Val Tyr Asp Ser Leu Tyr Gln Leu Met
355 360 365
Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly Gln Gln Ser Asn Gln
370 375 380
Leu Pro Ser Pro Val Ile Pro Val Pro Thr Asp Asp Ser Thr Tyr Thr
385 390 395 400
Asn Tyr Leu Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn Leu Val Gln
405 410 415
Ile Arg His Ser Ser Pro Ala Thr Gln Asn Ser Tyr Thr Thr Asp Ile
420 425 430
Thr Val Ser Ser Arg Ser Asn Arg Ala Val Leu Ser Thr Leu Thr Thr
435 440 445
Asp Pro Thr Arg Val Asp Ala Leu Phe Asp Ser Gly Gly His Gln Lys
450 455 460
Met Leu Ile Pro Gly Gln Asn Leu Asp Trp Asn Ile Arg Gly Glu Leu
465 470 475 480
Gln Arg Val Thr Pro Val Ser Arg Glu Asn Ser Ser Asp Ser Glu Trp
485 490 495
Tyr Arg Tyr Ser Ser Asp Gly Met Arg Leu Leu Lys Val Ser Glu Gln
500 505 510
Gln Thr Gly Asn Ser Thr Gln Val Gln Arg Val Thr Tyr Leu Pro Gly
515 520 525
Leu Glu Leu Arg Thr Thr Gly Val Ala Asp Lys Thr Thr Glu Asp Leu
530 535 540
Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu
545 550 555 560
His Trp Glu Ser Gly Lys Pro Thr Asp Ile Asp Asn Asn Gln Val Arg
565 570 575
Tyr Ser Tyr Asp Asn Leu Leu Gly Ser Ser Gln Leu Glu Leu Asp Ser
580 585 590
Glu Gly Gln Ile Leu Ser Gln Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr
595 600 605
Ala Ile Trp Ala Ala Arg Asn Gln Thr Glu Ala Ser Tyr Lys Phe Ile
610 615 620
Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly
625 630 635 640
Tyr Arg Tyr Tyr Gln Pro Trp Val Gly Arg Trp Leu Ser Ala Asp Pro
645 650 655
Ala Gly Thr Val Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn
660 665 670
Pro Ile Thr Leu Thr Asp His Asp Gly Leu Ala Pro Ser Pro Asn Arg
675 680 685
Asn Arg Asn Thr Phe Trp Phe Ala Ser Phe Leu Phe Arg Lys Pro Asp
690 695 700
Glu Gly Met Ser Ala Ser Met Arg Arg Gly Gln Lys Ile Gly Arg Ala
705 710 715 720
Ile Ala Gly Gly Ile Ala Ile Gly Gly Leu Ala Ala Thr Ile Ala Ala
725 730 735
Thr Ala Gly Ala Ala Ile Pro Val Ile Leu Gly Val Ala Ala Val Gly
740 745 750
Ala Gly Ile Gly Ala Leu Met Gly Tyr Asn Val Gly Ser Leu Leu Glu
755 760 765
Lys Gly Gly Ala Leu Leu Ala Arg Leu Val Gln Gly Lys Ser Thr Leu
770 775 780
Val Gln Ser Ala Ala Gly Ala Ala Ala Gly Ala Ser Ser Ala Ala Ala
785 790 795 800
Tyr Gly Ala Arg Ala Gln Gly Val Gly Val Ala Ser Ala Ala Gly Ala
805 810 815
Val Thr Gly Ala Val Gly Ser Trp Ile Asn Asn Ala Asp Arg Gly Ile
820 825 830
Gly Gly Ala Ile Gly Ala Gly Ser Ala Val Gly Thr Ile Asp Thr Met
835 840 845
Leu Gly Thr Ala Ser Thr Leu Thr His Glu Val Gly Ala Ala Ala Gly
850 855 860
Gly Ala Ala Gly Gly Met Ile Thr Gly Thr Gln Gly Ser Thr Arg Ala
865 870 875 880
Gly Ile His Ala Gly Ile Gly Thr Tyr Tyr Gly Ser Trp Ile Gly Phe
885 890 895
Gly Leu Asp Val Ala Ser Asn Pro Ala Gly His Leu Ala Asn Tyr Ala
900 905 910
Val Gly Tyr Ala Ala Gly Leu Gly Ala Glu Met Ala Val Asn Arg Ile
915 920 925
Met Gly Gly Gly Phe Leu Ser Arg Leu Leu Gly Arg Val Val Ser Pro
930 935 940
Tyr Ala Ala Gly Leu Ala Arg Gln Leu Val His Phe Ser Val Ala Arg
945 950 955 960
Pro Val Phe Glu Pro Ile Phe Ser Val Leu Gly Gly Leu Val Gly Gly
965 970 975
Ile Gly Thr Gly Leu His Arg Val Met Gly Arg Glu Ser Trp Ile Ser
980 985 990
Arg Ala Leu Ser Ala Ala Gly Ser Gly Ile Asp His Val Ala Gly Met
995 1000 1005
Ile Gly Asn Gln Ile Arg Gly Arg Val Leu Thr Thr Thr Gly Ile
1010 1015 1020
Ala Asn Ala Ile Asp Tyr Gly Thr Ser Ala Val Gly Ala Ala Arg
1025 1030 1035
Arg Val Phe Ser Leu
1040
<210>13
<211>915
<212>PRT
<213>发光光杆状菌
<400>13
Met Ser Ser Tyr Ash Ser Ala Ile Asp Gln Lys Thr Pro Ser Ile Lys
1 5 10 15
Val Leu Asp Asn Arg Lys Leu Asn Val Arg Thr Leu Glu Tyr Leu Arg
20 25 30
Thr Gln Ala Asp Glu Asn Ser Asp Glu Leu Ile Thr Phe Tyr Glu Phe
35 40 45
Asn Ile Pro Gly Phe Gln Val Lys Ser Thr Asp Pro Arg Lys Asn Lys
50 55 60
Asn Gln Ser Gly Pro Asn Phe Ile Arg Val Phe Asn Leu Ala Gly Gln
65 70 75 80
Val Leu Arg Glu Glu Ser Val Asp Ala Gly Arg Thr Ile Thr Leu Asn
85 90 95
Asp Ile Glu Ser Arg Pro Val Leu Ile Ile Asn Ala Thr Gly Val Arg
100 105 110
Gln Asn His Arg Tyr Glu Asp Asn Thr Leu Pro Gly Arg Leu Leu Ala
115 120 125
Ile Thr Glu Gln Val Gln Ala Gly Glu Lys Thr Thr Glu Arg Leu Ile
130 135 140
Trp Ala Gly Asn Thr Pro Gln Glu Lys Asp Tyr Asn Leu Ala Gly Gln
145 150 155 160
Cys Val Arg His Tyr Asp Thr Ala Gly Leu Thr Gln Leu Asn Ser Leu
165 170 175
Ser Leu Ala Gly Val Val Leu Ser Gln Ser Gln Gln Leu Leu Thr Asp
180 185 190
Asn Gln Asp Ala Asp Trp Thr Gly Glu Asp Gln Ser Leu Trp Gln Gln
195 200 205
Lys Leu Ser Ser Asp Val Tyr Ile Thr Gln Ser Asn Thr Asp Ala Thr
210 215 220
Gly Ala Leu Leu Thr Gln Thr Asp Ala Lys Gly Asn Ile Gln Arg Leu
225 230 235 240
Ala Tyr Asp Val Ala Gly Gln Leu Lys Gly Ser Trp Leu Thr Leu Lys
245 250 255
Gly Gln Ala Glu Gln Val Ile Ile Lys Ser Leu Thr Tyr Ser Ala Ala
260 265 270
Gly Gln Lys Leu Arg Glu Glu His Gly Asn Gly Ile Val Thr Glu Tyr
275 280 285
Ser Tyr Glu Pro Glu Thr Gln Arg Leu Ile Gly Ile Thr Thr Arg Arg
290 295 300
Pro Ser Asp Ala Lys Val Leu Gln Asp Leu Arg Tyr Gln Tyr Asp Pro
305 310 315 320
Val Gly Asn Val Ile Asn Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe
325 330 335
Trp Arg Asn Gln Lys Val Ala Pro Glu Asn Ser Tyr Thr Tyr Asp Ser
340 345 350
Leu Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly
355 360 365
Gln Gln Asn Asn Gln Leu Pro Ser Pro Ala Leu Pro Ser Asp Asn Asn
370 375 380
Thr Tyr Thr Asn Tyr Thr Arg Ser Tyr Ser Tyr Asp His Ser Gly Asn
385 390 395 400
Leu Thr Gln Ile Arg His Ser Ser Pro Ala Thr Gln Asn Asn Tyr Thr
405 410 415
Val Ala Ile Thr Leu Ser Asn Arg Ser Asn Arg Gly Val Leu Ser Thr
420 425 430
Leu Thr Thr Asp Pro Asn Gln Val Asp Thr Leu Phe Asp Ala Gly Gly
435 440 445
His Gln Thr Ser Leu Leu Pro Gly Gln Thr Leu Ile Trp Thr Pro Arg
450 455 460
Gly Glu Leu Lys Gln Val Asn Asn Gly Pro Gly Asn Glu Trp Tyr Arg
465 470 475 480
Tyr Asp Ser Asn Gly Met Arg Gln Leu Lys Val Ser Glu Gln Pro Thr
485 490 495
Gln Asn Thr Thr Gln Gln Gln Arg Val Ile Tyr Leu Pro Gly Leu Glu
500 505 510
Leu Arg Thr Thr Gln Ser Asn Ala Thr Thr Thr Glu Glu Leu His Val
515 520 525
Ile Thr Leu Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp
530 535 540
Glu Ser Gly Lys Pro Glu Asp Val Asn Asn Asn Gln Leu Arg Tyr Ser
545 550 555 560
Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Asn Gln Gly
565 570 575
Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro Phe Gly Gly Thr Ala Leu
580 585 590
Trp Ala Ala Asn Ser Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg Tyr
595 600 605
Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg
610 615 620
Tyr Tyr Gln Pro Trp Ala Gly Arg Trp Leu Ser Ala Asp Pro Ala Gly
625 630 635 640
Thr Ile Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Val
645 650 655
Ser Leu Gln Asp Glu Asn Gly Leu Ala Pro Glu Lys Gly Lys Tyr Thr
660 665 670
Lys Glu Val Asn Phe Phe Asp Glu Leu Lys Phe Lys Leu Ala Ala Lys
675 680 685
Ser Ser His Val Val Lys Trp Asn Glu Lys Glu Ser Ser Tyr Thr Lys
690 695 700
Asn Lys Ser Leu Lys Val Val Arg Val Gly Asp Ser Asp Pro Ser Gly
705 710 715 720
Tyr Leu Leu Ser His Glu Glu Leu Leu Lys Gly Ile Glu Lys Ser Gln
725 730 735
Ile Ile Tyr Ser Arg Leu Glu Glu Asn Ser Ser Leu Ser Glu Lys Ser
740 745 750
Lys Thr Asn Leu Ser Leu Gly Ser Glu Ile Ser Gly Tyr Met Ala Arg
755 760 765
Thr Ile Gln Asp Thr Ile Ser Glu Tyr Ala Glu Glu His Lys Tyr Arg
770 775 780
Ser Asn His Pro Asp Phe Tyr Ser Glu Thr Asp Phe Phe Ala Leu Met
785 790 795 800
Asp Lys Ser Glu Lys Asn Asp Tyr Ser Gly Glu Arg Lys Ile Tyr Ala
805 810 815
Ala Met Glu Val Lys Val Tyr His Asp Leu Lys Asn Lys Gln Ser Glu
820 825 830
Leu His Val Asn Tyr Ala Leu Ala His Pro Tyr Thr Gln Leu Ser Asn
835 840 845
Glu Glu Arg Ala Leu Leu Gln Glu Thr Glu Pro Ala Ile Ala Ile Asp
850 855 860
Arg Glu Tyr Asn Phe Lys Gly Val Gly Lys Phe Leu Thr Met Lys Ala
865 870 875 880
Ile Lys Lys Ser Leu Lys Gly His Lys Ile Asn Arg Ile Ser Thr Glu
885 890 895
Ala Ile Asn Ile Arg Ser Ala Ala Ile Ala Glu Asn Leu Gly Met Arg
900 905 910
Arg Thr Ser
915
<210>14
<211>960
<212>PRT
<213>发光光杆状菌
<400>14
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val Ser
1 5 10 15
Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe His Arg
20 25 30
Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro Arg Leu Tyr Glu
50 55 60
Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn Phe Leu Trp Gln Tyr
65 70 75 80
Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu Ser Ile Asp Ala Gly Arg
85 90 95
Thr Val Thr Leu Asn Asp Ile Glu Gly Arg Pro Leu Leu Thr Val Thr
100 105 110
Ala Thr Gly Val Ile Gln Thr Arg Gln Tyr Glu Thr Ser Ser Leu Pro
115 120 125
Gly Arg Leu Leu Ser Val Ala Glu Gln Thr Pro Glu Glu Lys Thr Ser
130 135 140
Arg Ile Thr Glu Arg Leu Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys
145 150 155 160
Asp His Asn Leu Ala Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly
165 170 175
Val Thr Arg Leu Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln
180 185 190
Ser Ser Gln Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp
195 200 205
Asn Glu Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr
210 215 220
Leu Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
225 230 235 240
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu Asn
245 250 255
Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile Ile Lys
260 265 270
Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly
275 280 285
Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu
290 295 300
Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr Lys Val Leu Gln Asp
305 310 315 320
Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Ile Ser Ile Arg Asn
325 330 335
Asp Ala Glu Ala Thr Arg Phe Trp His Asn Gln Lys Val Met Pro Glu
340 345 350
Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly
355 360 365
Arg Glu Met Ala Asn Ile Gly Gln Gln Ser His Gln Phe Pro Ser Pro
370 375 380
Ala Leu Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr
385 390 395 400
Thr Tyr Asp Arg Gly Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro
405 410 415
Ala Thr Gln Asn Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser
420 425 430
Asn Arg Ala Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp
435 440 445
Ala Leu Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln
450 455 460
Asn Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val
465 470 475 480
Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser
485 490 495
Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala Ser Asn
500 505 510
Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu Glu Leu Arg
515 520 525
Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu Gln Val Ile Thr
530 535 540
Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ser
545 550 555 560
Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln Leu Arg Tyr Ser Tyr Asp
565 570 575
Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Ser Glu Gly Gln Ile
580 585 590
Ile Ser Glu Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Ala
595 600 605
Ala Arg Asn Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Ile Gly Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile
645 650 655
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu
660 665 670
Leu Asp Pro Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala
675 680 685
Leu Lys Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala
690 695 700
Thr Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser
705 710 715 720
Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile Gly
725 730 735
Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val Ala Pro
740 745 750
Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser Leu Pro Glu
755 760 765
Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr Asn Leu Gln Lys
770 775 780
Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg Ser Phe Glu Glu Met
785 790 795 800
Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr
805 810 815
Lys Met Ala Arg Gln Phe Ala Ser Ile Phe Ile Gly Gln Lys Asp Thr
820 825 830
Ser Asn Leu Pro Lys Glu Thr Val Lys Asn Ile Ser Thr Trp Gly Ala
835 840 845
Lys Pro Lys Leu Lys Asp Leu Ser Asn Tyr Ile Lys Tyr Thr Lys Asp
850 855 860
Lys Ser Thr Val Trp Val Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly
865 870 875 880
Gln Ser Ser Gly Ala Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu
885 890 895
Phe Ala Ile Asp Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr
900 905 910
Lys Asn Met Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr
915 920 925
Ser Ser Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile
930 935 940
Ser Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
945 950 955 960
<210>15
<211>949
<212>PRT
<213>发光光杆状菌
<400>15
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln His Thr Pro Thr Val Asn
1 5 10 15
Val Tyr Asp Asn Arg Gly Leu Thr Ile Arg Asn Ile Asp Phe His Arg
20 25 30
Asp Val Ala Gly Gly Asp Thr Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Asp Thr Arg Gly His Leu Ser Gln Ser Ile Asp Pro Arg Leu Tyr Asp
50 55 60
Ala Lys Gln Thr Asn Asn Ser Thr Asn Pro Asn Phe Leu Trp Gln Tyr
65 70 75 80
Asn Leu Thr Gly Asp Thr Leu Arg Thr Glu Ser Val Asp Ala Gly Arg
85 90 95
Thr Val Ala Leu Asn Asp Ile Glu Gly Arg Gln Val Leu Ile Val Thr
100 105 110
Ala Thr Gly Ala Ile Gln Thr Arg Gln Tyr Glu Ala Asn Thr Leu Pro
115 120 125
Gly Arg Leu Leu Ser Val Ser Glu Gln Ala Pro Gly Glu Gln Thr Pro
130 135 140
Arg Val Thr Glu His Phe Ile Trp Ala Gly Asn Thr Gln Ala Glu Lys
145 150 155 160
Asp His Asn Leu Ala Gly Gln Tyr Val Arg His Tyr Asp Thr Ala Gly
165 170 175
Val Thr Gln Leu Glu Ser Leu Ser Leu Thr Glu Asn Ile Leu Ser Gln
180 185 190
Ser Arg Gln Leu Leu Ala Asp Gly Gln Glu Ala Asp Trp Thr Gly Asn
195 200 205
Asp Glu Thr Leu Trp Gln Thr Lys Leu Asn Ser Glu Thr Tyr Thr Thr
210 215 220
Gln Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
225 230 235 240
Lys Gly Asn Met Gln Arg Leu Ala Tyr Asn Val Ala Gly Gln Leu Gln
245 250 255
Gly Ser Trp Leu Thr Leu Lys Asn Gln Ser Glu Gln Val Ile Val Lys
260 265 270
Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly
275 280 285
Asn Gly Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Leu Arg Leu
290 295 300
Ile Gly Thr Thr Thr Arg Arg Gln Ser Asp Ser Lys Val Leu Gln Asp
305 310 315 320
Leu Arg Tyr Glu His Asp Pro Val Gly Asn Ile Ile Ser Val Arg Asn
325 330 335
Asp Ala Glu Ala Thr Arg Phe Trp Arg Asn Gln Lys Ile Val Pro Glu
340 345 350
Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly
355 360 365
Arg Glu Met Ala Asn Ile Gly Gln Gln Ser Asn Gln Leu Pro Ser Pro
370 375 380
Ile Ile Pro Leu Pro Thr Asp Glu Asn Ser Tyr Thr Asn Tyr Thr Arg
385 390 395 400
Ser Tyr Asn Tyr Asp Arg Gly Gly Asn Leu Val Gln Ile Arg His Ser
405 410 415
Ser Pro Ala Ala Gln Asn Asn Tyr Thr Thr Asp Ile Thr Val Ser Asn
420 425 430
Arg Ser Asn Arg Ala Val Leu Ser Ser Leu Thr Ser Asp Pro Thr Gln
435 440 445
Val Glu Ala Leu Phe Asp Ala Gly Gly His Gln Thr Lys Leu Leu Pro
450 455 460
Gly Gln Glu Leu Ser Trp Asn Thr Arg Gly Glu Leu Lys Gln Val Thr
465 470 475 480
Pro Val Ser Arg Glu Ser Ala Ser Asp Arg Glu Trp Tyr Arg Tyr Gly
485 490 495
Asn Asp Gly Met Arg Arg Leu Lys Val Ser Glu Gln Gln Thr Gly Asn
500 505 510
Ser Thr Gln Gln Gln Arg Val Thr Tyr Leu Pro Asp Leu Glu Leu Arg
515 520 525
Thr Thr Gln Asn Gly Thr Thr Thr Ser Glu Asp Leu His Ala Ile Thr
530 535 540
Val Gly Ala Ala Gly His Ala Gln Val Arg Val Leu His Trp Glu Thr
545 550 555 560
Thr Pro Pro Ala Gly Ile Asn Asn Asn Gln Leu Arg Tyr Ser Tyr Asp
565 570 575
Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Asn Ala Gly Gln Ile
580 585 590
Ile Ser Gln Glu Glu Tyr Tyr Pro Phe Gly Gly Thr Ala Leu Trp Ala
595 600 605
Ala Arg Asn Gln Ile Glu Ala Ser Tyr Lys Ile Leu Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Val Gly Arg Trp Leu Ser Ala Asp Pro Ala Gly Thr Ile
645 650 655
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Ser Thr Leu
660 665 670
Val Asp Ile Ser Gly Leu Ala Pro Thr Lys Tyr Asn Ile Pro Gly Phe
675 680 685
Asp Phe Asp Val Glu Ile Asp Glu Gln Lys Arg Ser Lys Leu Lys Pro
690 695 700
Thr Leu Ile Arg Ile Lys Asp Glu Phe Leu His Tyr Gly Pro Val Asp
705 710 715 720
Lys Leu Leu Glu Glu Lys Lys Pro Gly Leu Asn Val Pro Glu Glu Leu
725 730 735
Phe Asp Arg Gly Pro Ser Glu Asn Gly Val Ser Thr Leu Thr Phe Lys
740 745 750
Lys Asp Leu Pro Ile Ser Cys Ile Ser Asn Thr Glu Tyr Thr Leu Asp
755 760 765
Ile Leu Tyr Asn Lys His Glu Thr Lys Pro Phe Pro Tyr Glu Asn Glu
770 775 780
Ala Thr Val Gly Ala Asp Leu Gly ValIle Met Ser Val Glu Phe Gly
785 790 795 800
Asn Lys Ser Ile Gly Asn Ala Ser Asp Glu Asp Leu Lys Glu Glu His
805 810 815
Leu Pro Leu Gly Lys Ser Thr Met Asp Lys Thr Asp Leu Pro Asp Leu
820 825 830
Lys Gln Gly Leu Met Ile Ala Glu Lys Ile Lys Ser Gly Lys Gly Ala
835 840 845
Tyr Pro Phe His Phe Gly Ala Ala Ile Ala Val Val Tyr Gly Glu Asp
850 855 860
Lys Lys Val Ala Ala Ser Ile Leu Thr Asp Leu Ser Glu Pro Lys Arg
865 870 875 880
Asp Glu Gly Glu Tyr Leu Gln Ser Thr Arg Lys Val Ser Ala Met Phe
885 890 895
Ile Thr Asn Val Asn Glu Phe Arg Gly His Asp Tyr Pro Lys Ser Lys
900 905 910
Tyr Ser Ile Gly Leu Val Thr Ala Glu Lys Arg Gln Pro Val Ile Ser
915 920 925
Lys Lys Arg Ala Asn Pro Glu Glu Ala Pro Ser Ser Ser Arg Asn Lys
930 935 940
Lys Leu His Val His
945
<210>16
<211>938
<212>PRT
<213>发光光杆状菌菌株W14
<400>16
Met Glu Asn Ile Asp Pro Lys Leu Tyr His His Thr Pro Thr Val Ser
1 5 10 15
Val His Asp Asn Arg Gly Leu Ala Ile Arg Asn Ile Ser Phe His Arg
20 25 30
Thr Thr Ala Glu Ala Asn Thr Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Asn Ala Gly Gly Tyr Leu Asn Gln Ser Ile Asp Pro Arg Leu Tyr Asp
50 55 60
Ala Lys Gln Thr Asn Asn Ala Val Gln Pro Asn Phe Ile Trp Arg His
65 70 75 80
Asn Leu Thr Gly Asn Ile Leu Arg Thr Glu Ser Val Asp Ala Gly Arg
85 90 95
Thr Ile Thr Leu Asn Asp Ile Glu Gly Arg Pro Val Leu Thr Ile Asn
100 105 110
Ala Ala Gly Val Arg Gln Asn His Arg Tyr Glu Asp Asn Thr Leu Pro
115 120 125
Gly Arg Leu Leu Ala Ile Ser Glu Gln Gly Gln Ala Glu Glu Lys Thr
130 135 140
Thr Glu Arg Leu Ile Trp Ala Gly Asn Thr Pro Gln Glu Lys Asp His
145 150 155 160
Asn Leu Ala Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly Leu Thr
165 170 175
Gln Leu Asn Ser Leu Ala Leu Thr Gly Ala Val Leu Ser Gln Ser Gln
180 185 190
Gln Leu Leu Thr Asp Asn Gln Asp Ala Asp Trp Thr Gly Glu Asp Gln
195 200 205
Ser Leu Trp Gln Gln Lys Leu Ser Ser Asp Val Tyr Ile Thr Gln Ser
210 215 220
Asn Thr Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala Lys Gly
225 230 235 240
Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu Lys Gly Ser
245 250 255
Trp Leu Thr Leu Lys Gly Gln Ala Glu Gln Val Ile Ile Lys Ser Leu
260 265 270
Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly Asn Gly
275 280 285
Ile Val Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu Ile Gly
290 295 300
Ile Thr Thr Arg Arg Pro Ser Asp Ala Lys Val Leu Gln Asp Leu Arg
305 310 315 320
Tyr Gln Tyr Asp Pro Val Gly Asn Val Ile Ser Ile Arg Asn Asp Ala
325 330 335
Glu Ala Thr Arg Phe Trp Arg Asn Gln Lys Val Ala Pro Glu Asn Ser
340 345 350
Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu
355 360 365
Met Ala Asn Ile Gly Gln Gln Ser Asn Gln Leu Pro Ser Pro Ala Leu
370 375 380
Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr
385 390 395 400
Asp Arg Gly Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro Ala Ala
405 410 415
Gln Asn Asn Tyr Thr Thr Asp Ile Thr Val Ser Asn Arg Ser Asn Arg
420 425 430
Ala Val Leu Ser Thr Leu Thr Ala Asp Pro Thr Gln Val Asp Ala Leu
435 440 445
Phe Asp Ala Gly Gly His Gln Thr Ser Leu Leu Ser Gly Gln Val Leu
450 455 460
Thr Trp Thr Pro Arg Gly Glu Leu Lys Gln Ala Asn Asn Ser Ala Gly
465 470 475 480
Asn Glu Trp Tyr Arg Tyr Asp Ser Asn Gly Ile Arg Gln Leu Lys Val
485 490 495
Asn Glu Gln Gln Thr Gln Asn Ile Pro Gln Gln Gln Arg Val Thr Tyr
500 505 510
Leu Pro Gly Leu Glu Ile Arg Thr Thr Gln Asn Asn Ala Thr Thr Thr
515 520 525
Glu Glu Leu His Val Ile Thr Leu Gly Lys Ala Gly Arg Ala Gln Val
530 535 540
Arg Val Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile Asn Asn Asn
545 550 555 560
Gln Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Gln
565 570 575
Leu Asp Ser Asp Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro Phe
580 585 590
Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala Ser Tyr
595 600 605
Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Val Thr Gly Leu Tyr
610 615 620
Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ala Gly Arg Trp Leu Gly
625 630 635 640
Ala Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn Leu Tyr Arg Met Val
645 650 655
Arg Asn Asn Pro Val Thr Gln Phe Asp Val Gln Gly Leu Ser Pro Ala
660 665 670
Asn Arg Thr Glu Glu Ala Ile Ile Lys Gln Gly Ser Phe Thr Gly Met
675 680 685
Glu Glu Ala Val Tyr Lys Lys Met Ala Lys Pro Gln Thr Phe Lys Arg
690 695 700
Gln Arg Ala Ile Ala Ala Gln Thr Glu Gln Glu Ala His Glu Ser Leu
705 710 715 720
Thr Asn Asn Pro Ser Val Asp Ile Ser Pro Ile Lys Asn Tyr Thr Thr
725 730 735
Asp Ser Ser Gln Ile Asn Ala Ala Ile Arg Glu Asn Arg Ile Thr Pro
740 745 750
Ala Val Glu Ser Leu Asp Ala Thr Leu Ser Ser Leu Gln Asp Arg Gln
755 760 765
Met Arg Val Thr Tyr Arg Val Met Thr Tyr Val Asp Asn Ser Thr Pro
770 775 780
Ser Pro Trp His Ser Pro Gln Glu Gly Asn Ser Ile Asn Val Gly Asp
785 790 795 800
Ile Val Ser Asp Asn Ala Tyr Leu Ser Thr Ser Ala His Arg Gly Phe
805 810 815
Leu Asn Phe Val His Lys Lys Glu Thr Ser Glu Thr Arg Tyr Val Lys
820 825 830
Met Ala Phe Leu Thr Asn Ala Gly Val Asn Val Pro Ala Ala Ser Met
835 840 845
Tyr Asn Asn Ala Gly Glu Glu Gln Val Phe Lys Met Asp Leu Asn Asp
850 855 860
Ser Arg Lys Ser Leu Ala Glu Lys Leu Lys Leu Arg Val Ser Gly Pro
865 870 875 880
Gln Ser Gly Gln Ala Glu Ile Leu Leu Pro Arg Glu Thr Gln Phe Glu
885 890 895
Val Val Ser Met Lys His Gln Gly Arg Asp Thr Tyr Val Leu Leu Gln
900 905 910
Asp Ile Asn Gln Ser Ala Ala Thr His Arg Asn Val Arg Asn Thr Tyr
915 920 925
Thr Gly Asn Phe Lys Ser Ser Ser Ala Asn
930 935
<210>17
<211>1016
<212>PRT
<213>嗜线虫致病杆菌
<400>17
Met Lys Asn Phe Val His Ser Asn Thr Pro Ser Val Thr Val Leu Asp
1 5 10 15
Asn Arg Gly Gln Thr Val Arg Glu Ile Ala Trp Tyr Arg His Pro Asp
20 25 30
Thr Pro Gln Val Thr Asp Glu Arg Ile Thr Gly Tyr Gln Tyr Asp Ala
35 40 45
Gln Gly Ser Leu Thr Gln Ser Ile Asp Pro Arg Phe Tyr Glu Arg Gln
50 55 60
Gln Thr Ala Ser Asp Lys Asn Ala Ile Thr Pro Asn Leu Ile Leu Leu
65 70 75 80
Ser Ser Leu Ser Lys Lys Ala Leu Arg Thr Gln Ser Val Asp Ala Gly
85 90 95
Thr Arg Val Ala Leu His Asp Val Ala Gly Arg Pro Val Leu Ala Val
100 105 110
Ser Ala Asn Gly Val Ser Arg Thr Phe Gln Tyr Glu Ser Asp Asn Leu
115 120 125
Pro Gly Arg Leu Leu Thr Ile Thr Glu Gln Val Lys Gly Glu Asn Ala
130 135 140
Cys Ile Thr Glu Arg Leu Ile Trp Ser Gly Asn Thr Pro Ala Glu Lys
145 150 155 160
Gly Asn Asn Leu Ala Gly Gln Cys Val Val His Tyr Asp Pro Thr Gly
165 170 175
Met Asn Gln Thr Asn Ser Ile Ser Leu Thr Ser Ile Pro Leu Ser Ile
180 185 190
Thr Gln Gln Leu Leu Lys Asp Asp Ser Glu Ala Asp Trp His Gly Met
195 200 205
Asp Glu Ser Gly Trp Lys Asn Ala Leu Ala Pro Glu Ser Phe Thr Ser
210 215 220
Val Ser Thr Thr Asp Ala Thr Gly Thr Val Leu Thr Ser Thr Asp Ala
225 230 235 240
Ala Gly Asn Lys Gln Arg Ile Ala Tyr Asp Val Ala Gly Leu Leu Gln
245 250 255
Gly Ser Trp Leu Ala Leu Lys Gly Lys Gln Glu Gln Val Ile Val Lys
260 265 270
Ser Leu Thr Tyr Ser Ala Ala Ser Gln Lys Leu Arg Glu Glu His Gly
275 280 285
Asn Gly Ile Val Thr Thr Tyr Thr Tyr Glu Pro Glu Thr Gln Arg Val
290 295 300
Ile Gly Ile Lys Thr Glu Arg Pro Ser Gly His Ala Ala Gly Glu Lys
305 310 3l5 320
Ile Leu Gln Asn Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Leu
325 330 335
Lys Ser Thr Asn Asp Ala Glu Ile Thr Arg Phe Trp Arg Asn Gln Lys
340 345 350
Ile Val Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Val
355 360 365
Ser Val Thr Gly Arg Glu Met Ala Asn Ile Gly Arg Gln Lys Asn Gln
370 375 380
Leu Pro Ile Pro Ala Leu Ile Asp Asn Asn Thr Tyr Thr Asn Tyr Ser
385 390 395 400
Arg Thr Tyr Asp Tyr Asp Arg Gly Gly Asn Leu Thr Arg Ile Arg His
405 410 415
Asn Ser Pro Ile Thr Gly Asn Asn Tyr Thr Thr Asn Met Thr Val Ser
420 425 430
Asp His Ser Asn Arg Ala Val Leu Glu Glu Leu Ala Gln Asp Pro Thr
435 440 445
Gln Val Asp Met Leu Phe Thr Pro Gly Gly His Gln Thr Arg Leu Val
450 455 460
Pro Gly Gln Asp Leu Phe Trp Thr Pro Arg Asp Glu Leu Gln Gln Val
465 470 475 480
Ile Leu Val Asn Arg Glu Asn Thr Thr Pro Asp Gln Glu Phe Tyr Arg
485 490 495
Tyr Asp Ala Asp Ser Gln Arg Val Ile Lys Thr His Ile Gln Lys Thr
500 505 510
Gly Asn Ser Glu Gln Ile Gln Arg Thr Leu Tyr Leu Pro Glu Leu Glu
515 520 525
Trp Arg Thr Thr Tyr Ser Gly Asn Thr Leu Lys Glu Phe Leu Gln Val
530 535 540
Ile Thr Val Gly Glu Ser Gly Gln Ala Gln Val Arg Val Leu His Trp
545 550 555 560
Glu Thr Gly Lys Pro Ala Asp Ile Ser Asn Asp Gln Leu Arg Tyr Ser
565 570 575
Tyr Gly Asn Leu Ile Gly Ser Ser Gly Leu Glu Leu Asp Ser Asp Gly
580 585 590
Gln Ile Ile Ser Gln Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Val
595 600 605
Trp Ala Ala Arg Ser Gln Ser Glu Ala Asp Tyr Lys Thr Val Arg Tyr
610 615 620
Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg
625 630 635 640
Tyr Tyr Gln Ser Trp Thr Gly Arg Trp Leu Ser Val Asp Pro Ala Gly
645 650 655
Glu Val Asp Gly Leu Asn Leu Phe Arg Met Cys Arg Asn Asn Pro Ile
660 665 670
Val Phe Ser Asp Ser Asp Gly Arg Phe Pro Gly Gln Gly Val Leu Ala
675 680 685
Trp Ile Gly Lys Lys Ala Tyr Arg Lys Ala Val Asn Ile Thr Thr Glu
690 695 700
His Leu Leu Glu Gln Gly Ala Ser Phe Asp Thr Phe Leu Lys Leu Asn
705 710 715 720
Arg Gly Leu Arg Thr Phe Val Leu Gly Val Gly Val Ala Ser Leu Gly
725 730 735
Val Lys Ala Ala Thr Ile Ala Gly Ala Ser Pro Trp Gly Ile Val Gly
740 745 750
Ala Ala Ile Gly Gly Phe Val Ser Gly Ala Val Met Gly Phe Phe Ala
755 760 765
Asn Asn Ile Ser Glu Lys Ile Gly Glu Val Leu Ser Tyr Leu Thr Arg
770 775 780
Lys Arg Ser Val Pro Val Gln Val Gly Ala Phe Val Val Thr Ser Leu
785 790 795 800
Val Thr Ser Ala Leu Phe Asn Ser Ser Ser Thr Gly Thr Ala Ile Ser
805 810 815
Ala Ala Thr Ala Val Thr Val Gly Gly Leu Met Ala Leu Ala Gly Glu
820 825 830
His Asn Thr Gly Met Ala Ile Ser Ile Ala Thr Pro Ala Gly Gln Gly
835 840 845
Thr Leu Asp Thr Leu Arg Pro Gly Asn Val Ser Ala Pro Glu Arg Leu
850 855 860
Gly Ala Leu Ser Gly Ala Ile Ile Gly Gly Ile Leu Leu Gly Arg His
865 870 875 880
Gln Gly Ser Ser Glu Leu Gly Glu Arg Ala Ala Ile Gly Ala Met Tyr
885 890 895
Gly Ala Arg Trp Gly Arg Ile Ile Gly Asn Leu Trp Asp Gly Pro Tyr
900 905 910
Arg Phe Ile Gly Arg Leu Leu Leu Arg Arg Gly Ile Ser Ser Ala Ile
915 920 925
Ser His Ala Val Ser Ser Arg Ser Trp Phe Gly Arg Met Ile Gly Glu
930 935 940
Ser Val Gly Arg Asn Ile Ser Glu Val Leu Leu Pro Tyr Ser Arg Thr
945 950 955 960
Pro Gly Glu Trp Val Gly Ala Ala Ile Gly Gly Thr Ala Ala Ala Ala
965 970 975
His His Ala Val Gly Gly Glu Val Ala Asn Ala Ala Ser Arg Val Thr
980 985 990
Trp Ser Gly Phe Lys Arg Ala Phe Asn Asn Phe Phe Phe Asn Ala Ser
995 1000 1005
Ala Arg His Asn Glu Ser Glu Ala
1010 1015
<210>18
<211>962
<212>PRT
<213>伯氏致病杆菌
<400>18
Met Asn Val Phe Asn Pro Thr Leu Tyr Ala Gly Thr Pro Thr Val Thr
1 5 10 15
Val Met Asp Asn Arg Gly Leu Ser Val Arg Asp Ile Ala Tyr His Arg
20 25 30
Thr Thr Ala Gly Glu Gln Ala Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Ser Pro His Ash Phe Leu Ile Glu Ser Ile Asp Pro Arg Leu Phe Asp
50 55 60
Leu Gln Ser Gln Ser Thr Ile Lys Pro Asn Phe Thr Tyr Cys Pro Ala
65 70 75 80
Leu Lys Gly Asp Val Leu Arg Thr Glu Ser Val Asp Ala Gly Gln Thr
85 90 95
Val Ile Leu Ser Asp Ile Glu Gly Arg Pro Leu Leu Asn Ile Ser Ala
100 105 110
Met Gly Val Val Lys His Trp Gln Tyr Glu Glu Ser Thr Leu Pro Gly
115 120 125
Arg Leu Leu Ala Val Ser Glu Arg Lys Asn Glu Ala Ser Thr Pro Gln
130 135 140
Ile Ile Glu Arg Phe Ile Trp Ser Gly Asn Ser Pro Ser Glu Lys Asp
145 150 155 160
His Asn Leu Ala Gly Lys Tyr Leu Arg His Tyr Asp Thr Ala Gly Leu
165 170 175
Asn Gln Leu Asn Ala Val Ser Leu Thr Ser Val Asp Leu Ser Gln Ser
180 185 190
Arg Gln Leu Leu Gln Asp Asp Val Thr Ala Asp Trp Ser Gly Ser Asp
195 200 205
Glu Ser Gln Trp Lys Thr Arg Leu Ser Asn Asp Ile Phe Thr Thr Glu
210 215 220
Ile Thr Ala Asp Ala Val Gly Asn Phe Leu Thr Gln Asn Asp Ala Lys
225 230 235 240
Ser Asn Gln Gln Arg Leu Ser Tyr Asp Val Ala Gly Gln Leu Lys Ala
245 250 255
Ser Trp Leu Thr Ile Lys Gly Gln Asn Glu Gln Val Ile Val Asn Ser
260 265 270
Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu Gln Gly Asn
275 280 285
Gly Val Val Thr Glu Tyr Ser Tyr Glu Ala Gln Thr Trp Arg Leu Ile
290 295 300
Gly Val Thr Ala Tyr Arg Gln Ser Asp Lys Lys Arg Leu Gln Asp Leu
305 310 315 320
Val Tyr Asn Tyr Asp Pro Val Gly Asn Leu Leu Asn Ile Arg Asn Asn
325 330 335
Ala Glu Ala Thr Arg Phe Trp Arg Asn Gln Ile Val Glu Pro Glu Asn
340 345 350
His Tyr Ala Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Ser Gly Arg
355 360 365
Glu Ile Ala Ser Ile Gly Gln Gln Gly Ser Arg Leu Pro Val Pro Ile
370 375 380
Ile Pro Leu Pro Ala Asn Asp Asp Val Tyr Thr Arg Tyr Thr Arg Thr
385 390 395 400
Tyr His Tyr Asp Arg Gly Gly Asn Leu Cys Gln Ile Arg His Cys Ala
405 410 415
Pro Ala Thr Asp Asn Lys Tyr Thr Thr Lys Ile Thr Val Ser Asn Arg
420 425 430
Ser Asn Arg Ala Val Trp Asp Thr Leu Thr Thr Asp Pro Ala Lys Val
435 440 445
Asp Thr Leu Phe Asp His Gly Gly His Gln Leu Gln Leu Gln Ser Gly
450 455 460
Gln Thr Leu Cys Trp Asn Tyr Arg Gly Glu Leu Gln Gln Ile Thr Lys
465 470 475 480
Ile Gln Arg Asp Glu Lys Pro Ala Asp Lys Glu Arg Tyr Arg Tyr Gly
485 490 495
Val Gly Ala Ala Arg Val Val Lys Ile Ser Thr Gln Gln Ala Gly Gly
500 505 510
Ser Ser His Val Gln Arg Val Val Tyr Leu Pro Gly Leu Glu Leu Arg
515 520 525
Thr Thr Gln His Asp Ala Thr Leu Ile Glu Asp Leu Gln Val Ile Ile
530 535 540
Met Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ile
545 550 555 560
Pro Pro Pro Asp Asn Leu Asn Asn Asp Ser Leu Arg Tyr Ser Tyr Asp
565 570 575
Ser Leu Met Gly Ser Ser Gln Leu Glu Leu Asp Gly Ala Gly Gln Ile
580 585 590
Ile Thr Gln Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Ile Trp Ala
595 600 605
Ala Arg Asn Gln Thr Glu Ala Asn Tyr Lys Thr Ile Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly His Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Leu Gly Arg Trp Leu Ser Ala Asp Pro Ala Gly Thr Val
645 650 655
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Ile Thr Tyr
660 665 670
Arg Asp Ala Asp Gly Leu Ala Pro Ile Gly Asp Lys Ile Ser Glu Gly
675 680 685
Ile Tyr Glu Pro Glu Leu Arg Val Gly Leu Glu Arg Asp Asp Pro Asn
690 695 700
Val Arg Asp Tyr Asp Arg Val Tyr Pro Asp Thr Ala Lys Thr Glu Met
705 710 715 720
Ile Glu Ala Thr Ala Thr Thr Ile Ala Pro Ser Gln Met Leu Ser Ala
725 730 735
His Ala Phe Ala Ser Val Pro Ile Leu Thr Asp Leu Phe Asn Pro Gln
740 745 750
Thr Ala Arg Leu Ser Gln Lys Thr Thr Asp Ile Val Leu Asn Thr Gln
755 760 765
Gly Gly Gly Asp Leu Ile Phe Thr Gly Met Asn Ile Lys Gly Lys Gly
770 775 780
Lys Glu Phe Asn Ala Leu Lys Ile Val Asp Thr Tyr Gly Gly Glu Met
785 790 795 800
Pro Asp Ser Lys Thr Ala Ile Ser Ala Tyr Trp Leu Pro Gln Gly Gly
805 810 815
Tyr Thr Asp Ile Pro Ile His Pro Thr Gly Ile Gln Lys Tyr Leu Phe
820 825 830
Thr Pro Ala Phe Ser Gly Cys Thr Leu Ala Val Asp Lys Leu Asn Glu
835 840 845
Asn Thr Leu Arg Ala Tyr His Val Glu Gly Ser Lys Glu Asp Ala Gln
850 855 860
Tyr Asn Asn Leu Ala Val Ala Ala His Gly Glu Gly Leu Val Met Ala
865 870 875 880
Met Glu Phe Pro Asp Tyr Gly Phe His Thr Asp Lys Thr Gly Gln Arg
885 890 895
Leu Arg Asn Thr Gln Gly Phe Ala Phe Met Ser Tyr Asn Gln Ser Gln
900 905 910
Lys Lys Trp Glu Ile His Tyr Gln Arg Gln Ala Leu Thr Ser Asn Thr
915 920 925
Gly Ile Met Asn Val Ser Ala Lys Asn Lys Ile Arg Leu Asn Ala Pro
930 935 940
Ser His Val Lys Asn Ser Ser Ile Lys Gly Thr Glu Ile Met Thr Thr
945 950 955 960
His Phe
<210>19
<211>953
<212>PRT
<213>类芽胞杆菌属菌株DASl529
<400>19
Met Lys Met Ile Pro Trp Thr His His Tyr Leu Leu His Arg Leu Arg
1 5 10 15
Gly Glu Met Glu Val Lys Pro Met Asn Thr Thr Ser Ile Tyr Arg Gly
20 25 30
Thr Pro Thr Ile Ser Val Val Asp Asn Arg Asn Leu Glu Ile Arg Ile
35 40 45
Leu Gln Tyr Asn Arg Ile Ala Ala Glu Asp Pro Ala Asp Glu Cys Ile
50 55 60
Leu Arg Asn Thr Tyr Thr Pro Leu Ser Tyr Leu Gly Ser Ser Met Asp
65 70 75 80
Pro Arg Leu Phe Ser Gln Tyr Gln Asp Asp Arg Gly Thr Pro Pro Asn
85 90 95
Ile Arg Thr Met Ala Ser Leu Arg Gly Glu Ala Leu Cys Ser Glu Ser
100 105 110
Val Asp Ala Gly Arg Lys Ala Glu Leu Phe Asp Ile Glu Gly Arg Pro
115 120 125
Val Trp Leu Ile Asp Ala Asn Gly Thr Glu Thr Thr Leu Glu Tyr Asp
130 135 140
Val Leu Gly Arg Pro Thr Ala Val Phe Glu Gln Gln Glu Gly Thr Asp
145 150 155 160
Ser Pro Gln Cys Arg Glu Arg Phe Ile Tyr Gly Glu Lys Glu Ala Asp
165 170 175
Ala Gln Ala Asn Asn Leu Arg Gly Gln Leu Val Arg His Tyr Asp Thr
180 185 190
Ala Gly Arg Ile Gln Thr Asp Ser Ile Ser Leu Ala Gly Leu Pro Leu
195 200 205
Arg Gln Ser Arg Gln Leu Leu Lys Asn Trp Asp Glu Pro Gly Asp Trp
210 215 220
Ser Met Asp Glu Glu Ser Ala Trp Ala Ser Leu Leu Ala Ala Glu Ala
225 230 235 240
Tyr Asp Thr Ser Trp Arg Tyr Asp Ala Gln Asp Arg Val Leu Ala Gln
245 250 255
Thr Asp Ala Lys Gly Asn Leu Gln Gln Leu Thr Tyr Asn Asp Ala Gly
260 265 270
Gln Pro Gln Ala Val Ser Leu Lys Leu Gln Gly Gln Ala Glu Gln Arg
275 280 285
Ile Trp Asn Arg Ile Glu Tyr Asn Ala Ala Gly Gln Val Asp Leu Ala
290 295 300
Glu Ala Gly Asn Gly Ile Val Thr Glu Tyr Thr Tyr Glu Glu Ser Thr
305 310 315 320
Gln Arg Leu Ile Arg Lys Lys Asp Ser Arg Gly Leu Ser Ser Gly Glu
325 330 335
Arg Glu Val Leu Gln Asp Tyr Arg Tyr Glu Tyr Asp Pro Val Gly Asn
340 345 350
Ile Leu Ser Ile Tyr Asn Glu Ala Glu Pro Val Arg Tyr Phe Arg Asn
355 360 365
Gln Ala Val Ala Pro Lys Arg Gln Tyr Ala Tyr Asp Ala Leu Tyr Gln
370 375 380
Leu Val Ser Ser Ser Gly Arg Glu Ser Asp Ala Leu Arg Gln Gln Thr
385 390 395 400
Ser Leu Pro Pro Leu Ile Thr Pro Ile Pro Leu Asp Asp Ser Gln Tyr
405 410 415
Val Asn Tyr Ala Glu Lys Tyr Ser Tyr Asp Gln Ala Gly Asn Leu Ile
420 425 430
Lys Leu Ser His Asn Gly Ala Ser Gln Tyr Thr Thr Asn Val Tyr Val
435 440 445
Asp Lys Ser Ser Asn Arg Gly Ile Trp Arg Gln Gly Glu Asp Ile Pro
450 455 460
Asp Ile Ala Ala Ser Phe Asp Arg Ala Gly Asn Gln Gln Ala Leu Phe
465 470 475 480
Pro Gly Arg Pro Leu Glu Trp Asp Thr Arg Asn Gln Leu Ser Arg Val
485 490 495
His Met Val Val Arg Glu Gly Gly Asp Asn Asp Trp Glu Gly Tyr Leu
500 505 510
Tyr Asp Ser Ser Gly Met Arg Ile Val Lys Arg Ser Thr Arg Lys Thr
515 520 525
Gln Thr Thr Thr Gln Thr Asp Thr Thr Leu Tyr Leu Pro Gly Leu Glu
530 535 540
Leu Arg Ile Arg Gln Thr Gly Asp Arg Val Thr Glu Ala Leu Gln Val
545 550 555 560
Ile Thr Val Asp Glu Gly Ala Gly Gln Val Arg Val Leu His Trp Glu
565 570 575
Asp Gly Thr Glu Pro Gly Gly Ile Ala Asn Asp Gln Tyr Arg Tyr Ser
580 585 590
Leu Asn Asp His Leu Thr Ser Ser Leu Leu Glu Val Asp Gly Gln Gly
595 600 605
Gln Ile Ile Ser Lys Glu Glu Phe Tyr Pro Tyr Gly Gly Thr Ala Leu
610 615 620
Trp Thr Ala Arg Ser Glu Val Glu Ala Ser Tyr Lys Thr Ile Arg Tyr
625 630 635 640
Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly His Arg
645 650 655
Tyr Tyr Met Pro Trp Leu Gly Arg Trp Leu Asn Pro Asp Pro Ala Gly
660 665 670
Met Val Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Ile
675 680 685
Gly Leu Met Asp Pro Asn Gly Asn Ala Pro Ile Asn Val Ala Asp Tyr
690 695 700
Ser Phe Val His Gly Asp Leu Val Tyr Gly Leu Ser Lys Glu Arg Gly
705 710 715 720
Arg Tyr Leu Lys Leu Phe Asn Pro Asn Phe Asn Met Glu Lys Ser Asp
725 730 735
Ser Pro Ala Met Val Ile Asp Gln Tyr Asn Asn Asn Val Ala Leu Ser
740 745 750
Ile Thr Asn Gln Tyr Lys Val Glu Glu Leu Met Lys Phe Gln Lys Asp
755 760 765
Pro Gln Lys Ala Ala Arg Lys Ile Lys Val Pro Glu Gly Asn Arg Leu
770 775 780
Ser Arg Asn Glu Asn Tyr Pro Leu Trp His Asp Tyr Ile Asn Ile Gly
785 790 795 800
Glu Ala Lys Ala Ala Phe Lys Ala Ser His Ile Phe Gln Glu Val Lys
805 810 815
Gly Asn Tyr Gly Lys Asp Tyr Tyr His Lys Leu Leu Leu Asp Arg Met
820 825 830
Ile Glu Ser Pro Leu Leu Trp Lys Arg Gly Ser Lys Leu Gly Leu Glu
835 840 845
Ile Ala Ala Thr Asn Gln Arg Thr Lys Ile His Phe Val Leu Asp Asn
850 855 860
Leu Asn Ile Glu Gln Val Val Thr Lys Glu Gly Ser Gly Gly Gln Ser
865 870 875 880
Ile Thr Ala Ser Glu Leu Arg Tyr Ile Tyr Arg Asn Arg Glu Arg Leu
885 890 895
Asn Gly Arg Val Ile Phe Tyr Arg Asn Asn Glu Arg Leu Asp Gln Ala
900 905 910
Pro Trp Gln Glu Asn Pro Asp Leu Trp Ser Lys Tyr Gln Pro Gly Leu
915 920 925
Arg Gln Ser Ser Ser Ser Arg Val Lys Glu Arg Gly Ile Gly Asn Phe
930 935 940
Phe Arg Arg Phe Ser Met Lys Arg Lys
945 950
<210>20
<211>930
<212>PRT
<213>类芽胞杆菌属菌株DAS1529
<400>20
Met Asn Thr Thr Ser Ile Tyr Arg Gly Thr Pro Thr Ile Ser Val Val
1 5 10 15
Asp Asn Arg Asn Leu Glu Ile Arg Ile Leu Gln Tyr Asn Arg Ile Ala
20 25 30
Ala Glu Asp Pro Ala Asp Glu Cys Ile Leu Arg Asn Thr Tyr Thr Pro
35 40 45
Leu Ser Tyr Leu Gly Ser Ser Met Asp Pro Arg Leu Phe Ser Gln Tyr
50 55 60
Gln Asp Asp Arg Gly Thr Pro Pro Asn Ile Arg Thr Met Ala Ser Leu
65 70 75 80
Arg Gly Glu Ala Leu Cys Ser Glu Ser Val Asp Ala Gly Arg Lys Ala
85 90 95
Glu Leu Phe Asp Ile Glu Gly Arg Pro Val Trp Leu Ile Asp Ala Asn
100 105 110
Gly Thr Glu Thr Thr Leu Glu Tyr Asp Val Leu Gly Arg Pro Thr Ala
115 120 125
Val Phe Glu Gln Gln Glu Gly Thr Asp Ser Pro Gln Cys Arg Glu Arg
130 135 140
Phe Ile Tyr Gly Glu Lys Glu Ala Asp Ala Gln Ala Asn Asn Leu Arg
145 150 155 160
Gly Gln Leu Val Arg His Tyr Asp Thr Ala Gly Arg Ile Gln Thr Asp
165 170 175
Ser Ile Ser Leu Ala Gly Leu Pro Leu Arg Gln Ser Arg Gln Leu Leu
180 185 190
Lys Asn Trp Asp Glu Pro Gly Asp Trp Ser Met Asp Glu Glu Ser Ala
195 200 205
Trp Ala Ser Leu Leu Ala Ala Glu Ala Tyr Asp Thr Ser Trp Arg Tyr
210 215 220
Asp Ala Gln Asp Arg Val Leu Ala Gln Thr Asp Ala Lys Gly Asn Leu
225 230 235 240
Gln Gln Leu Thr Tyr Asn Asp Ala Gly Gln Pro Gln Ala Val Ser Leu
245 250 255
Lys Leu Gln Gly Gln Ala Glu Gln Arg Ile Trp Asn Arg Ile Glu Tyr
260 265 270
Asn Ala Ala Gly Gln Val Asp Leu Ala Glu Ala Gly Asn Gly Ile Val
275 280 285
Thr Glu Tyr Thr Tyr Glu Glu Ser Thr Gln Arg Leu Ile Arg Lys Lys
290 295 300
Asp Ser Arg Gly Leu Ser Ser Gly Glu Arg Glu Val Leu Gln Asp Tyr
305 310 315 320
Arg Tyr Glu Tyr Asp Pro Val Gly Asn Ile Leu Ser Ile Tyr Asn Glu
325 330 335
Ala Glu Pro Val Arg Tyr Phe Arg Asn Gln Ala Val Ala Pro Lys Arg
340 345 350
Gln Tyr Ala Tyr Asp Ala Leu Tyr Gln Leu Val Ser Ser Ser Gly Arg
355 360 365
Glu Ser Asp Ala Leu Arg Gln Gln Thr Ser Leu Pro Pro Leu Ile Thr
370 375 380
Pro Ile Pro Leu Asp Asp Ser Gln Tyr Val Asn Tyr Ala Glu Lys Tyr
385 390 395 400
Ser Tyr Asp Gln Ala Gly Asn Leu Ile Lys Leu Ser His Asn Gly Ala
405 410 415
Ser Gln Tyr Thr Thr Asn Val Tyr Val Asp Lys Ser Ser Asn Arg Gly
420 425 430
Ile Trp Arg Gln Gly Glu Asp Ile Pro Asp Ile Ala Ala Ser Phe Asp
435 440 445
Arg Ala Gly Asn Gln Gln Ala Leu Phe Pro Gly Arg Pro Leu Glu Trp
450 455 460
Asp Thr Arg Asn Gln Leu Ser Arg Val His Met Val Val Arg Glu Gly
465 470 475 480
Gly Asp Asn Asp Trp Glu Gly Tyr Leu Tyr Asp Ser Ser Gly Met Arg
485 490 495
Ile Val Lys Arg Ser Thr Arg Lys Thr Gln Thr Thr Thr Gln Thr Asp
500 505 510
Thr Thr Leu Tyr Leu Pro Gly Leu Glu Leu Arg Ile Arg Gln Thr Gly
515 520 525
Asp Arg Val Thr Glu Ala Leu Gln Val Ile Thr Val Asp Glu Gly Ala
530 535 540
Gly Gln Val Arg Val Leu His Trp Glu Asp Gly Thr Glu Pro Gly Gly
545 550 555 560
Ile Ala Asn Asp Gln Tyr Arg Tyr Ser Leu Asn Asp His Leu Thr Ser
565 570 575
Ser Leu Leu Glu Val Asp Gly Gln Gly Gln Ile Ile Ser Lys Glu Glu
580 585 590
Phe Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Thr Ala Arg Ser Glu Val
595 600 605
Glu Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala
610 615 620
Thr Gly Leu Tyr Tyr Tyr Gly His Arg Tyr Tyr Met Pro Trp Leu Gly
625 630 635 640
Arg Trp Leu Asn Pro Asp Pro Ala Gly Met Val Asp Gly Leu Asn Leu
645 650 655
Tyr Arg Met Val Arg Asn Asn Pro Ile Gly Leu Met Asp Pro Asn Gly
660 665 670
Asn Ala Pro Ile Asn Val Ala Asp Tyr Ser Phe Val His Gly Asp Leu
675 680 685
Val Tyr Gly Leu Ser Lys Glu Arg Gly Arg Tyr Leu Lys Leu Phe Asn
690 695 700
Pro Asn Phe Asn Met Glu Lys Ser Asp Ser Pro Ala Met Val Ile Asp
705 710 715 720
Gln Tyr Asn Asn Asn Val Ala Leu Ser Ile Thr Asn Gln Tyr Lys Val
725 730 735
Glu Glu Leu Met Lys Phe Gln Lys Asp Pro Gln Lys Ala Ala Arg Lys
740 745 750
Ile Lys Val Pro Glu Gly Asn Arg Leu Ser Arg Asn Glu Asn Tyr Pro
755 760 765
Leu Trp His Asp Tyr Ile Asn Ile Gly Glu Ala Lys Ala Ala Phe Lys
770 775 780
Ala Ser His Ile Phe Gln Glu Val Lys Gly Asn Tyr Gly Lys Asp Tyr
785 790 795 800
Tyr His Lys Leu Leu Leu Asp Arg Met Ile Glu Ser Pro Leu Leu Trp
805 810 815
Lys Arg Gly Ser Lys Leu Gly Leu Glu Ile Ala Ala Thr Asn Gln Arg
820 825 830
Thr Lys Ile His Phe Val Leu Asp Asn Leu Asn Ile Glu Gln Val Val
835 840 845
Thr Lys Glu Gly Ser Gly Gly Gln Ser Ile Thr Ala Ser Glu Leu Arg
850 855 860
Tyr Ile Tyr Arg Asn Arg Glu Arg Leu Asn Gly Arg Val Ile Phe Tyr
865 870 875 880
Arg Asn Asn Glu Arg Leu Asp Gln Ala Pro Trp Gln Glu Asn Pro Asp
885 890 895
Leu Trp Ser Lys Tyr Gln Pro Gly Leu Arg Gln Ser Ser Ser Ser Arg
900 905 910
Val Lys Glu Arg Gly Ile Gly Asn Phe Phe Arg Arg Phe Ser Met Lys
915 920 925
Arg Lys
930
<210>21
<211>973
<212>PRT
<213>嗜虫沙雷氏菌
<400>21
Met Ser Thr Ser Leu Phe Ser Ser Thr Pro Ser Val Ala Val Leu Asp
1 5 10 15
Asn Arg Gly Leu Leu Val Arg Glu Leu Gln Tyr Tyr Arg His Pro Asp
20 25 30
Thr Pro Glu Glu Thr Asp Glu Arg Ile Thr Cys His Gln His Asp Glu
35 40 45
Arg Gly Ser Leu Ser Gln Ser Ala Asp Pro Arg Leu His Ala Ala Gly
50 55 60
Leu Thr Asn Phe Thr Tyr Leu Asn Ser Leu Thr Gly Thr Val Leu Gln
65 70 75 80
Ser Val Ser Ala Asp Ala Gly Thr Ser Leu Glu Leu Ser Asp Ala Ala
85 90 95
Gly Arg Ala Phe Leu Ala Val Thr Gly Ala Gly Thr Glu Asp Ala Val
100 105 110
Thr Arg Thr Trp Gln Tyr Glu Asp Asp Thr Leu Pro Gly Arg Pro Leu
115 120 125
Ser Ile Thr Glu Gln Val Thr Gly Glu Ala Ala Gln Ile Thr Glu Arg
130 135 140
Phe Val Tyr Ala Gly Asn Thr Asp Ala Glu Lys Ile Leu Asn Leu Ala
145 150 155 160
Gly Gln Cys Val Ser His Tyr Asp Thr Ala Gly Leu Val Gln Thr Asp
165 170 175
Ser Ile Ala Leu Ser Gly Val Pro Leu Ala Val Thr Arg Gln Leu Leu
180 185 190
Pro Asp Ala Ala Gly Ala Asn Trp Met Gly Glu Asp Ala Ser Ala Trp
195 200 205
Asn Asp Leu Leu Asp Gly Glu Thr Phe Phe Thr Gln Thr His Ala Asp
210 215 220
Ala Thr Gly Ala Val Leu Ser Ile Thr Asp Ala Lys Gly Asn Leu Gln
225 230 235 240
Arg Val Ala Tyr Asp Val Ala Gly Leu Leu Ser Gly Ser Trp Leu Thr
245 250 255
Leu Lys Asp Gly Thr Glu Gln Val Ile Val Ala Ser Leu Thr Tyr Ser
260 265 270
Ala Ala Gly Lys Lys Leu Arg Glu Glu His Gly Asn Gly Val Val Thr
275 280 285
Ser Tyr Ile Tyr Glu Pro Glu Thr Gln Arg Leu Thr Gly Ile Lys Thr
290 295 300
Glu Arg Pro Ser Gly His Val Ala Gly Ala Lys Val Leu Gln Asp Leu
305 310 315 320
Arg Tyr Thr Tyr Asp Pro Val Gly Asn Val Leu Ser Val Asn Asn Asp
325 330 335
Ala Glu Glu Thr Arg Phe Trp Arg Asn Gln Lys Val Val Pro Glu Asn
340 345 350
Thr Tyr Ile Tyr Asp Ser Leu Tyr Gln Leu Val Ser Ala Thr Gly Arg
355 360 365
Glu Met Ala Asn Ala Gly Gln Gln Gly Asn Asp Leu Pro Ser Ala Thr
370 375 380
Ala Pro Leu Pro Thr Asp Ser Ser Ala Tyr Thr Asn Tyr Thr Arg Thr
385 390 395 400
Tyr Arg Tyr Asp Arg Gly Gly Asn Leu Thr Gln Met Arg His Ser Ala
405 410 415
Pro Ala Thr Asn Asn Asn Tyr Thr Thr Asp Ile Thr Val Ser Asp Arg
420 425 430
Ser Asn Arg Ala Val Leu Ser Thr Leu Ala Glu Val Pro Ser Asp Val
435 440 445
Asp Met Leu Phe Ser Ala Gly Gly His Gln Lys His Leu Gln Pro Gly
450 455 460
Gln Ala Leu Val Trp Thr Pro Arg Gly Glu Leu Gln Lys Val Thr Pro
465 470 475 480
Val Val Arg Asp Gly Gly Ala Asp Asp Ser Glu Ser Tyr Arg Tyr Asp
485 490 495
Ala Gly Ser Gln Arg Ile Ile Lys Thr Gly Thr Arg Gln Thr Gly Asn
500 505 510
Asn Val Gln Thr Gln Arg Val Val Tyr Leu Pro Gly Leu Glu Leu Arg
515 520 525
Ile Met Ala Asn Gly Val Thr Glu Lys Glu Ser Leu Gln Val Ile Thr
530 535 540
Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ile
545 550 555 560
Gly Lys Pro Asp Asp Leu Asp Glu Asp Ser Val Arg Tyr Ser Tyr Asp
565 570 575
Asn Leu Val Gly Ser Ser Gln Leu Glu Leu Asp Arg Glu Gly Tyr Leu
580 585 590
Ile Ser Glu Glu Glu Phe Tyr Pro Tyr Gly Gly Thr Ala Val Leu Thr
595 600 605
Ala Arg Ser Glu Val Glu Ala Asp Tyr Lys Thr Ile Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Asp Tyr Tyr Gly Tyr Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Ala Gly Arg Trp Leu Ser Thr Asp Pro Ala Gly Thr Val
645 650 655
Asp Gly Leu Asn Leu Phe Arg Met Val Arg Asn Asn Pro Val Thr Leu
660 665 670
Phe Asp Ser Asn Gly Arg Ile Ser Thr Gly Gln Glu Ala Arg Arg Leu
675 680 685
Val Gly Glu Ala Phe Val His Pro Leu His Met Pro Val Phe Glu Arg
690 695 700
Ile Ser Val Glu Arg Lys Ile Ser Met Ser Val Arg Glu Ala Gly Ile
705 710 715 720
Tyr Thr Ile Ser Ala Leu Gly Glu Gly Ala Ala Ala Lys Gly His Asn
725 730 735
Ile Leu Glu Lys Thr Ile Lys Pro Gly Ser Leu Lys Ala Ile Tyr Gly
740 745 750
Asp Lys Ala Glu Ser Ile Leu Gly Leu Ala Lys Arg Ser Gly Leu Val
755 760 765
Gly Arg Val Gly Gln Trp Asp Ala Ser Gly Val Arg Gly Ile Tyr Ala
770 775 780
His Asn Arg Pro Gly Gly Glu Asp Leu Val Tyr Pro Val Ser Leu Gln
785 790 795 800
Asn Thr Ser Ala Asn Glu Ile Val Asn Ala Trp Ile Lys Phe Lys Ile
805 810 815
Ile Thr Pro Tyr Thr Gly Asp Tyr Asp Met His Asp Ile Ile Lys Phe
820 825 830
Ser Asp Gly Lys Gly His Val Pro Thr Ala Glu Ser Ser Glu Glu Arg
835 840 845
Gly Val Lys Asp Leu Ile Asn Lys Gly Val Ala Glu Val Asp Pro Ser
850 855 860
Arg Pro Phe Glu Tyr Thr Ala Met Asn Val Ile Arg His Gly Pro Gln
865 870 875 880
Val Asn Phe Val Pro Tyr Met Trp Glu His Glu His Asp Lys Val Val
885 890 895
Asn Asp Asn Gly Tyr Leu Gly Val Val Ala Ser Pro Gly Pro Phe Pro
900 905 910
Val Ala Met Val His Gln Gly Glu Trp Thr Val Phe Asp Asn Ser Glu
915 920 925
Glu Leu Phe Asn Phe Tyr Lys Ser Thr Asn Thr Pro Leu Pro Glu His
930 935 940
Trp Ser Gln Asp Phe Met Asp Arg Gly Lys Gly Ile Val Ala Thr Pro
945 950 955 960
Arg His Ala Glu Leu Leu Asp Lys Arg Arg Val Met Tyr
965 970
<210>22
<211>2523
<212>PRT
<213>嗜线虫致病杆菌
<400>22
Met Ile Lys Val Asn Glu Leu Leu Asp Lys Ile Asn Arg Lys Arg Ser
1 5 10 15
Gly Asp Thr Leu Leu Leu Thr Asn Ile Ser Phe Met Ser Phe Ser Glu
20 25 30
Phe Arg His Arg Thr Ser Gly Thr Leu Thr Trp Arg Glu Thr Asp Phe
35 40 45
Leu Tyr Gln Gln Ala His Gln Glu Ser Lys Gln Asn Lys Leu Glu Glu
50 55 60
Leu Arg Ile Leu Ser Arg Ala Asn Pro Gln Leu Ala Asn Thr Thr Asn
65 70 75 80
Leu Asn Ile Thr Pro Ser Thr Leu Asn Asn Ser Tyr Asn Ser Trp Phe
85 90 95
Tyr Gly Arg Ala His Arg Phe Val Lys Pro Gly Ser Ile Ala Ser Ile
100 105 110
Phe Ser Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp
115 120 125
Phe His Pro Asp Asn Ser Gln Tyr His Leu Asn Lys Arg Arg Pro Asp
130 135 140
Ile Ala Ser Leu Ala Leu Thr Gln Asn Asn Met Asp Glu Glu Ile Ser
145 150 155 160
Thr Leu Ser Leu Ser Asn Glu Leu Leu Leu His Asn Ile Gln Thr Leu
165 170 175
Glu Lys Thr Asp Tyr Asn Gly Val Met Lys Met Leu Ser Thr Tyr Arg
180 185 190
Gln Thr Gly Met Thr Pro Tyr His Leu Pro Tyr Glu Ser Ala Arg Gln
195 200 205
Ala Ile Leu Leu Gln Asp Lys Asn Leu Thr Ala Phe Ser Arg Asn Thr
210 215 220
Asp Val Ala Glu Leu Met Asp Pro Thr Ser Leu Leu Ala Ile Lys Thr
225 230 235 240
Asp Ile Ser Pro Glu Leu Tyr Gln Ile Leu Val Glu Glu Ile Thr Pro
245 250 255
Glu Asn Ser Thr Glu Leu Met Lys Lys Asn Phe Gly Thr Asp Asp Val
260 265 270
Leu Ile Phe Lys Ser Tyr Ala Ser Leu Ala Arg Tyr Tyr Asp Leu Ser
275 280 285
Tyr Asp Glu Leu Ser Leu Phe Val Asn Leu Ser Phe Gly Lys Lys Asn
290 295 300
Thr Asn Gln Gln Tyr Lys Asn Glu Gln Leu Ile Thr Leu Val Asn Asp
305 310 315 320
Gly Asn Asp Thr Ala Thr Ala Arg Leu Ile Lys Arg Thr Arg Lys Asp
325 330 335
Phe Tyr Asp Ser His Leu Asn Tyr Ala Glu Leu Ile Pro Ile Lys Glu
340 345 350
Asn Glu Tyr Lys Tyr Asn Phe Ser Val Lys Lys Thr Glu Pro Asp His
355 360 365
Leu Asp Phe Arg Leu Gln Asn Gly Asp Lys Glu Tyr Ile Tyr Gln Asp
370 375 380
Lys Asn Phe Val Pro Ile Ala Asn Thr His Tyr Ser Ile Pro Ile Lys
385 390 395 400
Leu Thr Thr Glu Gln Ile Thr Asn Gly Ile Thr Leu Arg Leu Trp Arg
405 410 415
Val Lys Pro Asn Pro Ser Asp Ala Ile Asn Ala Asn Ala Tyr Phe Lys
420 425 430
Met Met Glu Phe Pro Gly Asp Ile Phe Leu Leu Lys Leu Asn Lys Ala
435 440 445
Ile Arg Leu Tyr Lys Ala Thr Gly Ile Ser Pro Glu Asp Ile Trp Gln
450 455 460
Val Ile Glu Ser Ile Tyr Asp Asp Leu Thr Ile Asp Ser Asn Val Leu
465 470 475 480
Gly Lys Leu Phe Tyr Val Gln Tyr Tyr Met Gln His Tyr Asn Ile Ser
485 490 495
Val Ser Asp Ala Leu Val Leu Cys His Ser Asp Ile Ser Gln Tyr Ser
500 505 510
Thr Lys Gln Gln Pro Ser His Phe Thr Ile Leu Phe Asn Thr Pro Leu
515 520 525
Leu Asn Gly Gln Glu Phe Ser Ala Asp Asn Thr Lys Leu Asp Leu Thr
530 535 540
Pro Gly Glu Ser Lys Asn His Phe Tyr Leu Gly Ile Met Lys Arg Ala
545 550 555 560
Phe Arg Val Asn Asp Thr Glu Leu Tyr Thr Leu Trp Lys Leu Ala Asn
565 570 575
Gly Gly Thr Asn Pro Glu Phe Met Cys Ser Ile Glu Asn Leu Ser Leu
580 585 590
Leu Tyr Arg Val Arg Leu Leu Ala Asp Ile His His Leu Thr Val Asn
595 600 605
Glu Leu Ser Met Leu Leu Ser Val Ser Pro Tyr Val Asn Thr Lys Ile
610 615 620
Ala Leu Phe Ser Asp Thr Ala Leu Thr Gln Leu Ile Ser Phe Leu Phe
625 630 635 640
Gln Cys Thr Gln Trp Leu Thr Thr Gln Lys Trp Ser Val Ser Asp Val
645 650 655
Phe Leu Met Thr Thr Asp Asn Tyr Ser Thr Val Leu Thr Pro Asp Ile
660 665 670
Glu Asn Leu Ile Thr Thr Leu Ser Asn Gly Leu Ser Thr Leu Ser Leu
675 680 685
Gly Asp Asp Glu Leu Ile Arg Ala Ala Ala Pro Leu Ile Ala Ala Ser
690 695 700
Ile Gln Met Asp Ser Ala Lys Thr Ala Glu Thr Ile Leu Leu Trp Ile
705 710 715 720
Asn Gln Ile Lys Pro Gln Gly Leu Thr Phe Asp Asp Phe Met Ile Ile
725 730 735
Ala Ala Asn Arg Asp Arg Ser Glu Asn Glu Thr Ser Asn Met Val Ala
740 745 750
Phe Cys Gln Val Leu Gly Gln Leu Ser Leu Ile Val Arg Asn Ile Gly
755 760 765
Leu Ser Glu Asn Glu Leu Thr Leu Leu Val Thr Lys Pro Glu Lys Phe
770 775 780
Gln Ser Glu Thr Thr Ala Leu Gln His Asp Leu Pro Thr Leu Gln Ala
785 790 795 800
Leu Thr Arg Phe His Ala Val Ile Met Arg Cys Gly Ser Tyr Ala Thr
805 810 815
Glu Ile Leu Thr Ala Leu Glu Leu Gly Ala Leu Thr Ala Glu Gln Leu
820 825 830
Ala Val Ala Leu Lys Phe Asp Ala Gln Val Val Thr Gln Ala Leu Gln
835 840 845
Gln Thr Gly Leu Gly Val Asn Thr Phe Thr Asn Trp Arg Thr Ile Asp
850 855 860
Val Thr Leu Gln Trp Leu Asp Val Ala Ala Thr Leu Gly Ile Thr Pro
865 870 875 880
Asp Gly Val Ala Ala Leu Ile Lys Leu Lys Tyr Ile Gly Glu Pro Glu
885 890 895
Thr Pro Met Pro Thr Phe Asp Asp Trp Gln Ala Ala Ser Thr Leu Leu
900 905 910
Gln Ala Gly Leu Asn Ser Gln Gln Ser Asp Gln Leu Gln Ala Trp Leu
915 920 925
Asp Glu Ala Thr Thr Thr Ala Ala Ser Ala Tyr Tyr Ile Lys Asn Ser
930 935 940
Ala Pro Gln Gln Ile Lys Ser Arg Asp Glu Leu Tyr Ser Tyr Leu Leu
945 950 955 960
Ile Asp Asn Gln Val Ser Ala Gln Val Lys Thr Thr Arg Val Ala Glu
965 970 975
Ala Ile Ala Ser Ile Gln Leu Tyr Val Asn Arg Ala Leu Asn Asn Val
980 985 990
Glu Gly Lys Val Ser Lys Pro Val Lys Thr Arg Gln Phe Phe Cys Asp
995 1000 1005
Trp Glu Thr Tyr Asn Arg Arg Tyr Ser Thr Trp Ala Gly Val Ser
1010 1015 1020
Glu Leu Ala Tyr Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Ile Arg
1025 1030 1035
Ile Gly Gln Thr Gly Met Met Asn Asn Leu Leu Gln Gln Leu Ser
1040 1045 1050
Gln Ser Gln Leu Asn Ile Asp Thr Val Glu Asp Ser Phe Lys Asn
1055 1060 1065
Tyr Leu Thr Ala Phe Glu Asp Val Ala Asn Leu Gln Val Ile Ser
1070 1075 1080
Gly Tyr His Asp Ser Ile Asn Val Asn Glu Gly Leu Thr Tyr Leu
1085 1090 1095
Ile Gly Tyr Ser Gln Thr Glu Pro Arg Ile Tyr Tyr Trp Arg Asn
1100 1105 1110
Val Asp His Gln Lys Cys Gln His Gly Gln Phe Ala Ala Asn Ala
1115 1120 1125
Trp Gly Glu Trp Lys Lys Ile Glu Ile Pro Ile Asn Val Trp Gln
1130 1135 1140
Glu Asn Ile Arg Pro Val Ile Tyr Lys Ser Arg Leu Tyr Leu Leu
1145 1150 1155
Trp Leu Glu Gln Lys Glu Leu Lys Asn Glu Ser Glu Asp Gly Lys
1160 1165 1170
Ile Asp Ile Thr Asp Tyr Ile Leu Lys Leu Ser His Ile Arg Tyr
1175 1180 1185
Asp Gly Ser Trp Ser Ser Pro Phe Asn Phe Asn Val Thr Asp Lys
1190 1195 1200
Ile Glu Asn Leu Ile Asn Lys Lys Ala Ser Ile Gly Met Tyr Cys
1205 1210 1215
Ser Ser Asp Tyr Glu Lys Asp Val Ile Ile Val Tyr Phe His Glu
1220 1225 1230
Lys Lys Asp Asn Tyr Ser Phe Asn Ser Leu Pro Ala Arg Glu Gly
1235 1240 1245
Met Thr Ile Asn Pro Asp Met Thr Leu Ser Ile Leu Thr Glu Asn
1250 1255 1260
Asp Leu Asp Ala Ile Val Lys Ser Thr Leu Ser Glu Leu Asp Thr
1265 1270 1275
Arg Thr Glu Tyr Lys Val Asn Asn Gln Phe Ala Thr Asp Tyr Leu
1280 1285 1290
Ala Glu Tyr Lys Glu Ser Ile Thr Thr Lys Asn Lys Leu Ala Ser
1295 1300 1305
Phe Thr Gly Asn Ile Phe Asp Leu Ser Tyr Ile Ser Pro Gly Asn
1310 1315 1320
Gly His Ile Asn Leu Thr Phe Asn Pro Ser Met Glu Ile Asn Phe
1325 1330 1335
Ser Lys Gly Asn Ile Tyr Asn Asp Glu Val Lys Tyr Leu Leu Ser
1340 1345 1350
Met Val Glu Asp Glu Thr Val Ile Leu Phe Asp Tyr Asp Arg His
1355 1360 1365
Asp Glu Met Leu Gly Lys Glu Glu Glu Val Phe His Tyr Gly Thr
1370 1375 1380
Leu Asp Phe Ile Ile Ser Ile Asp Leu Lys Asn Ala Glu Tyr Phe
1385 1390 1395
Arg Val Leu Met His Leu Arg Thr Lys Glu Lys Ile Pro Arg Lys
1400 1405 1410
Ser Glu Ile Gly Val Gly Ile Asn Tyr Asp Tyr Glu Ser Asn Asp
1415 1420 1425
Ala Glu Phe Lys Leu Asp Thr Asn Ile Val Leu Asp Trp Lys Asp
1430 1435 1440
Asn Thr Gly Val Trp His Thr Ile Cys Glu Ser Phe Thr Asn Asp
1445 1450 1455
Val Ser Ile Ile Asn Asn Met Gly Asn Ile Ala Ala Leu Phe Leu
1460 1465 1470
Arg Glu Asp Pro Cys Val Tyr Leu Cys Ser Ile Ala Thr Asp Ile
1475 1480 1485
Lys Ile Ala Ser Ser Met Ile Glu Gln Ile Gln Asp Lys Asn Ile
1490 1495 1500
Ser Phe Leu Leu Lys Asn Gly Ser Asp Ile Leu Val Glu Leu Asn
1505 1510 1515
Ala Glu Asp His Val Ala Ser Lys Pro Ser His Glu Ser Asp Pro
1520 1525 1530
Met Val Tyr Asp Phe Asn Gln Val Lys Val Asp Ile Glu Gly Tyr
1535 1540 1545
Asp Ile Pro Leu Val Ser Glu Phe Ile Ile Lys Gln Pro Asp Gly
1550 1555 1560
Gly Tyr Asn Asp Ile Val Ile Glu Ser Pro Ile His Ile Lys Leu
1565 1570 1575
Lys Ser Lys Asp Thr Ser Asn Val Ile Ser Leu His Lys Met Pro
1580 1585 1590
Ser Gly Thr Gln Tyr Met Gln Ile Gly Pro Tyr Arg Thr Arg Leu
1595 1600 1605
Asn Thr Leu Phe Ser Arg Lys Leu Ala Glu Arg Ala Asn Ile Gly
1610 1615 1620
Ile Asp Asn Val Leu Ser Met Glu Thr Gln Asn Leu Pro Glu Pro
1625 1630 1635
Gln Leu Gly Glu Gly Phe Tyr Ala Thr Phe Lys Leu Pro Pro Tyr
1640 1645 1650
Asn Lys Glu Glu His Gly Asp Glu Arg Trp Phe Lys Ile His Ile
1655 1660 1665
Gly Asn Ile Asp Gly Asn Ser Ala Arg Gln Pro Tyr Tyr Glu Gly
1670 1675 1680
Met Leu Ser Asp Ile Glu Thr Thr Val Thr Leu Phe Val Pro Tyr
1685 1690 1695
Ala Lys Gly Tyr Tyr Ile Arg Glu Gly Val Arg Leu Gly Val Gly
1700 1705 1710
Tyr Lys Lys Ile Ile Tyr Asp Lys Ser Trp Glu Ser Ala Phe Phe
1715 1720 1725
Tyr Phe Asp Glu Thr Lys Asn Gln Phe Ile Phe Ile Asn Asp Ala
1730 1735 1740
Asp His Asp Ser Gly Met Thr Gln Gln Gly Ile Val Lys Asn Ile
1745 1750 1755
Lys Lys Tyr Lys Gly Phe Ile His Val Val Val Met Lys Asn Asn
1760 1765 1770
Thr Glu Pro Met Asp Phe Asn Gly Ala Asn Ala Ile Tyr Phe Trp
1775 1780 1785
Glu Leu Phe Tyr Tyr Thr Pro Met Met Val Phe Gln Arg Leu Leu
1790 1795 1800
Gln Glu Gln Asn Phe Thr Glu Ser Thr Arg Trp Leu Arg Tyr Ile
1805 1810 1815
Trp Asn Pro Ala Gly Tyr Ser Val Gln Gly Glu Met Gln Asp Tyr
1820 1825 1830
Tyr Trp Asn Val Arg Pro Leu Glu Glu Asp Thr Ser Trp Asn Ala
1835 1840 1845
Asn Pro Leu Asp Ser Val Asp Pro Asp Ala Val Ala Gln His Asp
1850 1855 1860
Pro Met His Tyr Lys Val Ala Thr Phe Met Lys Met Leu Asp Leu
1865 1870 1875
Leu Ile Thr Arg Gly Asp Ser Ala Tyr Arg Gln Leu Glu Arg Asp
1880 1885 1890
Thr Leu Asn Glu Ala Lys Met Trp Tyr Val Gln Ala Leu Thr Leu
1895 1900 1905
Leu Gly Asp Glu Pro Tyr Phe Ser Leu Asp Asn Asp Trp Ser Glu
1910 1915 1920
Pro Arg Leu Glu Glu Ala Ala Ser Gln Thr Met Arg His His Tyr
1925 1930 1935
Gln His Lys Met Leu Gln Leu Arg Gln Arg Ala Ala Leu Pro Thr
1940 1945 1950
Lys Arg Thr Ala Asn Ser Leu Thr Ala Leu Phe Leu Pro Gln Ile
1955 1960 1965
Asn Lys Lys Leu Gln Gly Tyr Trp Gln Thr Leu Thr Gln Arg Leu
1970 1975 1980
Tyr Asn Leu Arg His Asn Leu Thr Ile Asp Gly Gln Pro Leu Ser
1985 1990 1995
Leu Ser Leu Tyr Ala Thr Pro Ala Asp Pro Ser Met Leu Leu Ser
2000 2005 2010
Ala Ala Ile Thr Ala Ser Gln Gly Gly Gly Asp Leu Pro His Ala
2015 2020 2025
Val Met Pro Met Tyr Arg Phe Pro Val Ile Leu Glu Asn Ala Lys
2030 2035 2040
Trp Gly Val Ser Gln LeuIle Gln Phe Gly Asn Thr Leu Leu Ser
2045 2050 2055
Ile Thr Glu Arg Gln Asp Ala Glu Ala Leu Ala Glu Ile Leu Gln
2060 2065 2070
Thr Gln Gly Ser Glu Leu Ala Leu Gln Ser Ile Lys Met Gln Asp
2075 2080 2085
Lys Val Met Ala Glu Ile Asp Ala Asp Lys Leu Ala Leu Gln Glu
2090 2095 2100
Ser Arg His Gly Ala Gln Ser Arg Phe Asp Ser Phe Asn Thr Leu
2105 2110 2115
Tyr Asp Glu Asp Val Asn Ala Gly Glu Lys Gln Ala Met Asp Leu
2120 2125 2130
Tyr Leu Ser Ser Ser Val Leu Ser Thr Ser Gly Thr Ala Leu His
2135 2140 2145
Met Ala Ala Ala Ala Ala Asp Leu Val Pro Asn Ile Tyr Gly Phe
2150 2155 2160
Ala Val Gly Gly Ser Arg Phe Gly Ala Leu Phe Asn Ala Ser Ala
2165 2170 2175
Ile Gly Ile Glu Ile Ser Ala Ser Ala Thr Arg Ile Ala Ala Asp
2180 2185 2190
Lys Ile Ser Gln Ser Glu Ile Tyr Arg Arg Arg Arg Gln Glu Trp
2195 2200 2205
Glu Ile Gln Arg Asn Asn Ala Glu Ala Glu Ile Lys Gln Ile Asp
2210 2215 2220
Ala Gln Leu Ala Thr Leu Ala Val Arg Arg Glu Ala Ala Val Leu
2225 2230 2235
Gln Lys Asn Tyr Leu Glu Thr Gln Gln Ala Gln Thr Gln Ala Gln
2240 2245 2250
Leu Ala Phe Leu Gln Ser Lys Phe Ser Asn Ala Ala Leu Tyr Asn
2255 2260 2265
Trp Leu Arg Gly Arg Leu Ser Ala Ile Tyr Tyr Gln Phe Tyr Asp
2270 2275 2280
Leu Ala Val Ser Leu Cys Leu Met Ala Glu Gln Thr Tyr Gln Tyr
2285 2290 2295
Glu Leu Asn Asn Ala Ala Ala His Phe Ile Lys Pro Gly Ala Trp
2300 2305 2310
His Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met Leu
2315 2320 2325
Asn Leu Ala Gln Met Glu Lys Ser Tyr Leu Glu Lys Asp Glu Arg
2330 2335 2340
Ala Leu Glu Val Thr Arg Thr Val Ser Leu Ala Glu Val Tyr Ala
2345 2350 2355
Gly Leu Thr Glu Asn Ser Phe Ile Leu Lys Asp Lys Val Thr Glu
2360 2365 2370
Leu Val Asn Ala Gly Glu Gly Ser Ala Gly Thr Thr Leu Asn Gly
2375 2380 2385
Leu Asn Val Glu Gly Thr Gln Leu Gln Ala Ser Leu Lys Leu Ser
2390 2395 2400
Asp Leu Asn Ile Ala Thr Asp Tyr Pro Asp Gly Leu Gly Asn Thr
2405 2410 2415
Arg Arg Ile Lys Gln Ile Ser Val Thr Leu Pro Ala Leu Leu Gly
2420 2425 2430
Pro Tyr Gln Asp Val Arg Ala Ile Leu Ser Tyr Gly Gly Ser Thr
2435 2440 2445
Met Met Pro Arg Gly Cys Lys Ala Ile Ala Ile Ser His Gly Met
2450 2455 2460
Asn Asp Ser Gly Gln Phe Gln Met Asp Phe Asn Asp Ala Lys Tyr
2465 2470 2475
Leu Pro Phe Glu Gly Leu Pro Val Ala Asp Thr Gly Thr Leu Thr
2480 2485 2490
Leu Ser Phe Pro Gly Ile Ser Gly Lys Gln Lys Ser Leu Leu Leu
2495 2500 2505
Ser Leu Ser Asp Ile Ile Leu His Ile Arg Tyr Thr Ile Arg Ser
2510 2515 2520
<210>23
<211>2538
<212>PRT
<213>嗜线虫致病杆菌
<400>23
Met Tyr Ser Thr Ala Val Leu Leu Asn Lys Ile Ser Pro Thr Arg Asp
1 5 10 15
Gly Gln Thr Met Thr Leu Ala Asp Leu Gln Tyr Leu Ser Phe Ser Glu
20 25 30
Leu Arg Lys Ile Phe Asp Asp Gln Leu Ser Trp Gly Glu Ala Arg His
35 40 45
Leu Tyr His Glu Thr Ile Glu Gln Lys Lys Asn Asn Arg Leu Leu Glu
50 55 60
Ala Arg Ile Phe Thr Arg Ala Asn Pro Gln Leu Ser Gly Ala Ile Arg
65 70 75 80
Leu Gly Ile Glu Arg Asp Ser Val Ser Arg Ser Tyr Asp Glu Met Phe
85 90 95
Gly Ala Arg Ser Ser Ser Phe Val Lys Pro Gly Ser Val Ala Ser Met
100 105 110
Phe Ser Pro Ala Gly Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp
115 120 125
Leu His Phe Ser Ser Ser Ala Tyr His Leu Asp Asn Arg Arg Pro Asp
130 135 140
Leu Ala Asp Leu Thr Leu Ser Gln Ser Asn Met Asp Thr Glu Ile Ser
145 150 155 160
Thr Leu Thr Leu Ser Asn Glu Leu Leu Leu Glu His Ile Thr Arg Lys
165 170 175
Thr Gly Gly Asp Ser Asp Ala Leu Met Glu Ser Leu Ser Thr Tyr Arg
180 185 190
Gln Ala Ile Asp Thr Pro Tyr His Gln Pro Tyr Glu Thr Ile Arg Gln
195 200 205
Val Ile Met Thr His Asp Ser Thr Leu Ser Ala Leu Ser Arg Asn Pro
210 215 220
Glu Val Met Gly Gln Ala Glu Gly Ala Ser Leu Leu Ala Ile Leu Ala
225 230 235 240
Asn Ile Ser Pro Glu Leu Tyr Asn Ile Leu Thr Glu Glu Ile Thr Glu
245 250 255
Lys Asn Ala Asp Ala Leu Phe Ala Gln Asn Phe Ser Glu Asn Ile Thr
260 265 270
Pro Glu Asn Phe Ala Ser Gln Ser Trp Ile Ala Lys Tyr Tyr Gly Leu
275 280 285
Glu Leu Ser Glu Val Gln Lys Tyr Leu Gly Met Leu Gln Asn Gly Tyr
290 295 300
Ser Asp Ser Thr Ser Ala Tyr Val Asp Asn Ile Ser Thr Gly Leu Val
305 310 315 320
Val Asn Asn Glu Ser Lys Leu Glu Ala Tyr Lys Ile Thr Arg Val Lys
325 330 335
Thr Asp Asp Tyr Asp Lys Asn Ile Asn Tyr Phe Asp Leu Met Tyr Glu
340 345 350
Gly Asn Asn Gln Phe Phe Ile Arg Ala Asn Phe Lys Val Ser Arg Glu
355 360 365
Phe Gly Ala Thr Leu Arg Lys Asn Ala Gly Pro Ser Gly Ile Val Gly
370 375 380
Ser Leu Ser Gly Pro Leu Ile Ala Asn Thr Asn Phe Lys Ser Asn Tyr
385 390 395 400
Leu Ser Asn Ile Ser Asp Ser Glu Tyr Lys Asn Gly Val Lys Ile Tyr
405 410 415
Ala Tyr Arg Tyr Thr Ser Ser Thr Ser Ala Thr Asn Gln Gly Gly Gly
420 425 430
Ile Phe Thr Phe Glu Ser Tyr Pro Leu Thr Ile Phe Ala Leu Lys Leu
435 440 445
Asn Lys Ala Ile Arg Leu Cys Leu Thr Ser Gly Leu Ser Pro Asn Glu
450 455 460
Leu Gln Thr Ile Val Arg Ser Asp Asn Ala Gln Gly Ile Ile Asn Asp
465 470 475 480
Ser Val Leu Thr Lys Val Phe Tyr Thr Leu Phe Tyr Ser His Arg Tyr
485 490 495
Ala Leu Ser Phe Asp Asp Ala Gln Val Leu Asn Gly Ser Val Ile Asn
500 505 510
Gln Tyr Ala Asp Asp Asp Ser Val Ser His Phe Asn Arg Leu Phe Asn
515 520 525
Thr Pro Pro Leu Lys Gly Lys Ile Phe Glu Ala Asp Gly Asn Thr Val
530 535 540
Ser Ile Asp Pro Asp Glu Glu Gln Ser Thr Phe Ala Arg Ser Ala Leu
545 550 555 560
Met Arg Gly Leu Gly Val Asn Ser Gly Glu Leu Tyr Gln Leu Gly Lys
565 570 575
Leu Ala Gly Val Leu Asp Ala Gln Asn Thr Ile Thr Leu Ser Val Phe
580 585 590
Val Ile Ser Ser Leu Tyr Arg Leu Thr Leu Leu Ala Arg Val His Gln
595 600 605
Leu Thr Val Asn Glu Leu Cys Met Leu Tyr Gly Leu Ser Pro Phe Asn
610 615 620
Gly Lys Thr Thr Ala Ser Leu Ser Ser Gly Glu Leu Pro Arg Leu Val
625 630 635 640
Ile Trp Leu Tyr Gln Val Thr Gln Trp Leu Thr Glu Ala Glu Ile Thr
645 650 655
Thr Glu Ala Ile Trp Leu Leu Cys Thr Pro Glu Phe Ser Gly Asn Ile
660 665 670
Ser Pro Glu Ile Ser Asn Leu Leu Asn Asn Leu Arg Pro Ser Ile Ser
675 680 685
Glu Asp Met Ala Gln Ser His Asn Arg Glu Leu Gln Ala Glu Ile Leu
690 695 700
Ala Pro Phe Ile Ala Ala Thr Leu His Leu Ala Ser Pro Asp Met Ala
705 710 715 720
Arg Tyr Ile Leu Leu Trp Thr Asp Asn Leu Arg Pro Gly Gly Leu Asp
725 730 735
Ile Ala Gly Phe Met Thr Leu Val Leu Lys Glu Ser Leu Asn Ala Asn
740 745 750
Glu Thr Thr Gln Leu Val Gln Phe Cys His Val Met Ala Gln Leu Ser
755 760 765
Leu Ser Val Gln Thr Leu Arg Leu Ser Glu Ala Glu Leu Ser Val Leu
770 775 780
Val Ile Ser Gly Phe Ala Val Leu Gly Ala Lys Asn Gln Pro Ala Gly
785 790 795 800
Gln His Asn Ile Asp Thr Leu Phe Ser Leu Tyr Arg Phe His Gln Trp
805 810 815
Ile Asn Gly Leu Gly Asn Pro Gly Ser Asp Thr Leu Asp Met Leu Arg
820 825 830
Gln Gln Thr Leu Thr Ala Asp Arg Leu Ala Ser Val Met Gly Leu Asp
835 840 845
Ile Ser Met Val Thr Gln Ala Met Val Ser Ala Gly Val Asn Gln Leu
850 855 860
Gln Cys Trp Gln Asp Ile Asn Thr Val Leu Gln Trp Ile Asp Val Ala
865 870 875 880
Ser Ala Leu His Thr Met Pro Ser Val Ile Arg Thr Leu Val Asn Ile
885 890 895
Arg Tyr Val Thr Ala Leu Asn Lys Ala Glu Ser Asn Leu Pro Ser Trp
900 905 910
Asp Glu Trp Gln Thr Leu Ala Glu Asn Met Glu Ala Gly Leu Ser Thr
915 920 925
Gln Gln Ala Gln Thr Leu Ala Asp Tyr Thr Ala Glu Arg Leu Ser Ser
930 935 940
Val Leu Cys Asn Trp Phe Leu Ala Asn Ile Gln Pro Glu Gly Val Ser
945 950 955 960
Leu His Ser Arg Asp Asp Leu Tyr Ser Tyr Phe Leu Ile Asp Asn Gln
965 970 975
Val Ser Ser Ala Ile Lys Thr Thr Arg Leu Ala Glu Ala Ile Ala Gly
980 985 990
Ile Gln Leu Tyr Ile Asn Arg Ala Leu Asn Arg Ile Glu Pro Asn Ala
995 1000 1005
Arg Ala Asp Val Ser Thr Arg Gln Phe Phe Thr Asp Trp Thr Val
1010 1015 1020
Asn Asn Arg Tyr Ser Thr Trp Gly Gly Val Ser Arg Leu Val Tyr
1025 1030 1035
Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Gln Arg Ile Gly Gln Thr
1040 1045 1050
Arg Met Met Asp Glu Leu Leu Glu Asn Ile Ser Gln Ser Lys Leu
1055 1060 1065
Ser Arg Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr Leu Thr Arg
1070 1075 1080
Phe Glu Thr Val Ala Asp Leu Lys Val Val Ser Ala Tyr His Asp
1085 1090 1095
Asn Val Asn Ser Asn Thr Gly Leu Thr Trp Phe Val Gly Gln Thr
1100 1105 1110
Arg Glu Asn Leu Pro Glu Tyr Tyr Trp Arg Asn Val Asp Ile Ser
1115 1120 1125
Arg Met Gln Ala Gly Glu Leu Ala Ala Asn Ala Trp Lys Glu Trp
1130 1135 1140
Thr Lys Ile Asp Thr Ala Val Asn Pro Tyr Lys Asp AlaIle Arg
1145 1150 1155
Pro Val Ile Phe Arg Glu Arg Leu His Leu Ile Trp Val Glu Lys
1160 1165 1170
Glu Glu Val Ala Lys Asn Gly Thr Asp Pro Val Glu Thr Tyr Asp
1175 1180 1185
Arg Phe Thr Leu Lys Leu Ala Phe Leu Arg His Asp Gly Ser Trp
1190 1195 1200
Ser Ala Pro Trp Ser Tyr Asp Ile Thr Thr Gln Val Glu Ala Val
1205 1210 1215
Thr Asp Lys Lys Pro Asp Thr Glu Arg Leu Ala Leu Ala Ala Ser
1220 1225 1230
Gly Phe Gln Gly Glu Asp Thr Leu Leu Val Phe Val Tyr Lys Thr
1235 1240 1245
Gly Lys Ser Tyr Ser Asp Phe Gly Gly Ser Asn Lys Asn Val Ala
1250 1255 1260
Gly Met Thr Ile Tyr Gly Asp Gly Ser Phe Lys Lys Met Glu Asn
1265 1270 1275
Thr Ala Leu Ser Arg Tyr Ser Gln Leu Lys Asn Thr Phe Asp Ile
1280 1285 1290
Ile His Thr Gln Gly Asn Asp Leu Val Arg Lys Ala Ser Tyr Arg
1295 1300 1305
Phe Ala Gln Asp Phe Glu Val Pro Ala Ser Leu Asn Met Gly Ser
1310 1315 1320
Ala Ile Gly Asp Asp Ser Leu Thr Val Met Glu Asn Gly Asn Ile
1325 1330 1335
Pro Gln Ile Thr Ser Lys Tyr Ser Ser Asp Asn Leu Ala Ile Thr
1340 1345 1350
Leu His Asn Ala Ala Phe Thr Val Arg Tyr Asp Gly Ser Gly Asn
1355 1360 1365
Val Ile Arg Asn Lys Gln Ile Ser Ala Met Lys Leu Thr Gly Val
1370 1375 1380
Asp Gly Lys Ser Gln Tyr Gly Asn Ala Phe Ile Ile Ala Asn Thr
1385 1390 1395
Val Lys His Tyr Gly Gly Tyr Ser Asp Leu Gly Gly Pro Ile Thr
1400 1405 1410
Val Tyr Asn Lys Thr Lys Asn Tyr Ile Ala Ser Val Gln Gly His
1415 1420 1425
Leu Met Asn Ala Asp Tyr Thr Arg Arg Leu Ile Leu Thr Pro Val
1430 1435 1440
Glu Asn Asn Tyr Tyr Ala Arg Leu Phe Glu Phe Pro Phe Ser Pro
1445 1450 1455
Asn Thr Ile Leu Asn Thr Val Phe Thr Val Gly Ser Asn Lys Thr
1460 1465 1470
Ser Asp Phe Lys Lys Cys Ser Tyr Ala Val Asp Gly Asn Asn Ser
1475 1480 1485
Gln Gly Phe Gln Ile Phe Ser Ser Tyr Gln Ser Ser Gly Trp Leu
1490 1495 1500
Asp Ile Asp Thr Gly Ile Asn Asn Thr Asp Ile Lys Ile Thr Val
1505 1510 1515
Met Ala Gly Ser Lys Thr His Thr Phe Thr Ala Ser Asp His Ile
1520 1525 1530
Ala Ser Leu Pro Ala Asn Ser Phe Asp Ala Met Pro Tyr Thr Phe
1535 1540 1545
Lys Pro Leu Glu Ile Asp Ala Ser Ser Leu Ala Phe Thr Asn Asn
1550 1555 1560
Ile Ala Pro Leu Asp Ile Val Phe Glu Thr Lys Ala Lys Asp Gly
1565 1570 1575
Arg Val Leu Gly Lys Ile Lys Gln Thr Leu Ser Val Lys Arg Val
1580 1585 1590
Asn Tyr Asn Pro Glu Asp Ile Leu Phe Leu Arg Glu Thr His Ser
1595 1600 1605
Gly Ala Gln Tyr Met Gln Leu Gly Val Tyr Arg Ile Arg Leu Asn
1610 1615 1620
Thr Leu Leu Ala Ser Gln Leu Val Ser Arg Ala Asn Thr Gly Ile
1625 1630 1635
Asp Thr Ile Leu Thr Met Glu Thr Gln Arg Leu Pro Glu Pro Pro
1640 1645 1650
Leu Gly Glu Gly Phe Phe Ala Asn Phe Val Leu Pro Lys Tyr Asp
1655 1660 1665
Pro Ala Glu His Gly Asp Glu Arg Trp Phe Lys Ile His Ile Gly
1670 1675 1680
Asn Val Gly Gly Asn Thr Gly Arg Gln Pro Tyr Tyr Ser Gly Met
1685 1690 1695
Leu Ser Asp Thr Ser Glu Thr Ser Met Thr Leu Phe Val Pro Tyr
1700 1705 1710
Ala Glu Gly Tyr Tyr Met His Glu Gly Val Arg Leu Gly Val Gly
1715 1720 1725
Tyr Gln Lys Ile Thr Tyr Asp Asn Thr Trp Glu Ser Ala Phe Phe
1730 1735 1740
Tyr Phe Asp Glu Thr Lys Gln Gln Phe Val Leu Ile Asn Asp Ala
1745 1750 1755
Asp His Asp Ser Gly Met Thr Gln Gln Gly Ile Val Lys Asn Ile
1760 1765 1770
Lys Lys Tyr Lys Gly Phe Leu Asn Val Ser Ile Ala Thr Gly Tyr
1775 1780 1785
Ser Ala Pro Met Asp Phe Asn Ser Ala Ser Ala Leu Tyr Tyr Trp
1790 1795 1800
Glu Leu Phe Tyr Tyr Thr Pro Met Met Cys Phe Gln Arg Leu Leu
1805 1810 1815
Gln Glu Lys Gln Phe Asp Glu Ala Thr Gln Trp Ile Asn Tyr Val
1820 1825 1830
Tyr Asn Pro Ala Gly Tyr Ile Val Asn Gly Glu Ile Ala Pro Trp
1835 1840 1845
Ile Trp Asn Cys Arg Pro Leu Glu Glu Thr Thr Ser Trp Asn Ala
1850 1855 1860
Asn Pro Leu Asp Ala Ile Asp Pro Asp Ala Val Ala Gln Asn Asp
1865 1870 1875
Pro Met His Tyr Lys Ile Ala Thr Phe Met Arg Leu Leu Asp Gln
1880 1885 1890
Leu Ile Leu Arg Gly Asp Met Ala Tyr Arg Glu Leu Thr Arg Asp
1895 1900 1905
Ala Leu Asn Glu Ala Lys Met Trp Tyr Val Arg Thr Leu Glu Leu
1910 1915 1920
Leu Gly Asp Glu Pro Glu Asp Tyr Gly Ser Gln Gln Trp Ala Ala
1925 1930 1935
Pro Ser Leu Ser Gly Ala Ala Ser Gln Thr Val Gln Ala Ala Tyr
1940 1945 1950
Gln Gln Asp Leu Thr Met Leu Gly Arg Gly Gly Val Ser Lys Asn
1955 1960 1965
Leu Arg Thr Ala Asn Ser Leu Val Gly Leu Phe Leu Pro Glu Tyr
1970 1975 1980
Asn Pro Ala Leu Thr Asp Tyr Trp Gln Thr Leu Arg Leu Arg Leu
1985 1990 1995
Phe Asn Leu Arg His Asn Leu Ser Ile Asp Gly Gln Pro Leu Ser
2000 2005 2010
Leu Ala Ile Tyr Ala Glu Pro Thr Asp Pro Lys Ala Leu Leu Thr
2015 2020 2025
Ser Met Val Gln Ala Ser Gln Gly Gly Ser Ala Val Leu Pro Gly
2030 2035 2040
Thr Leu Ser Leu Tyr Arg Phe Pro Val Met Leu Glu Arg Thr Arg
2045 2050 2055
Asn Leu Val Ala Gln Leu Thr Gln Phe Gly Thr Ser Leu Leu Ser
2060 2065 2070
Met Ala Glu His Asp Asp Ala Asp Glu Leu Thr Thr Leu Leu Leu
2075 2080 2085
Gln Gln Gly Met Glu Leu Ala Thr Gln Ser Ile Arg Ile Gln Gln
2090 2095 2100
Arg Thr Val Asp Glu Val Asp Ala Asp Ile Ala Val Leu Ala Glu
2105 2110 2115
Ser Arg Arg Ser Ala Gln Asn Arg Leu Glu Lys Tyr Gln Gln Leu
2120 2125 2130
Tyr Asp Glu Asp Ile Asn His Gly Glu Gln Arg Ala Met Ser Leu
2135 2140 2145
Leu Asp Ala Ala Ala Gly Gln Ser Leu Ala Gly Gln Val Leu Ser
2150 2155 2160
Ile Ala Glu Gly Val Ala Asp Leu Val Pro Asn Val Phe Gly Leu
2165 2170 2175
Ala Cys Gly Gly Ser Arg Trp Gly Ala Ala Leu Arg Ala Ser Ala
2180 2185 2190
Ser Val Met Ser Leu Ser Ala Thr Ala Ser Gln Tyr Ser Ala Asp
2195 2200 2205
Lys Ile Ser Arg Ser Glu Ala Tyr Arg Arg Arg Arg Gln Glu Trp
2210 2215 2220
Glu Ile Gln Arg Asp Asn Ala Asp Gly Glu Val Lys Gln Met Asp
2225 2230 2235
Ala Gln Leu Glu Ser Leu Lys Ile Arg Arg Glu Ala Ala Gln Met
2240 2245 2250
Gln Val Glu Tyr Gln Glu Thr Gln Gln Ala His Thr Gln Ala Gln
2255 2260 2265
Leu Glu Leu Leu Gln Arg Lys Phe Thr Asn Lys Ala Leu Tyr Ser
2270 2275 2280
Trp Met Arg Gly Lys Leu Ser Ala Ile Tyr Tyr Gln Phe Phe Asp
2285 2290 2295
Leu Thr Gln Ser Phe Cys Leu Met Ala Gln Glu Ala Leu Arg Arg
2300 2305 2310
Glu Leu Thr Asp Asn Gly Val Thr Phe Ile Arg Gly Gly Ala Trp
2315 2320 2325
Asn Gly Thr Thr Ala Gly Leu Met Ala Gly Glu Thr Leu Leu Leu
2330 2335 2340
Asn Leu Ala Glu Met Glu Lys Val Trp Leu Glu Arg Asp Glu Arg
2345 2350 2355
Ala Leu Glu Val Thr Arg Thr Val Ser Leu Ala Gln Phe Tyr Gln
2360 2365 2370
Ala Leu Ser Ser Asp Asn Phe Asn Leu Thr Glu Lys Leu Thr Gln
2375 2380 2385
Phe Leu Arg Glu Gly Lys Gly Asn Val Gly Ala Ser Gly Asn Glu
2390 2395 2400
Leu Lys Leu Ser Asn Arg Gln Ile Glu Ala Ser Val Arg Leu Ser
2405 2410 2415
Asp Leu Lys Ile Phe Ser Asp Tyr Pro Glu Ser Leu Gly Asn Thr
2420 2425 2430
Arg Gln Leu Lys Gln Val Ser Val Thr Leu Pro Ala Leu Val Gly
2435 2440 2445
Pro Tyr Glu Asp Ile Arg Ala Val Leu Asn Tyr Gly Gly Ser Ile
2450 2455 2460
Val Met Pro Arg Gly Cys Ser Ala Ile Ala Leu Ser His Gly Val
2465 2470 2475
Asn Asp Ser Gly Gln Phe Met Leu Asp Phe Asn Asp Ser Arg Tyr
2480 2485 2490
Leu Pro Phe Glu Gly Ile Ser Val Asn Asp Ser Gly Ser Leu Thr
2495 2500 2505
Leu Ser Phe Pro Asp Ala Thr Asp Arg Gln Lys Ala Leu Leu Glu
2510 2515 2520
Ser Leu Ser Asp Ile Ile Leu His Ile Arg Tyr Thr Ile Arg Ser
2525 2530 2535
<210>24
<211>2504
<212>PRT
<213>发光光杆状菌
<400>24
Met Gln Asn Ser Leu Ser Ser Thr Ile Asp Thr Ile Cys Gln Lys Leu
1 5 10 15
Gln Leu Thr Cys Pro Ala Glu Ile Ala Leu Tyr Pro Phe Asp Thr Phe
20 25 30
Arg Glu Lys Thr Arg Gly Met Val Asn Trp Gly Glu Ala Lys Arg Ile
35 40 45
Tyr Glu Ile Ala Gln Ala Glu Gln Asp Arg Asn Leu Leu His Glu Lys
50 55 60
Arg Ile Phe Ala Tyr Ala Asn Pro Leu Leu Lys Asn Ala Val Arg Leu
65 70 75 80
Gly Thr Arg Gln Met Leu Gly Phe Ile Gln Gly Tyr Ser Asp Leu Phe
85 90 95
Gly Asn Arg Ala Asp Asn Tyr Ala Ala Pro Gly Ser Val Ala Ser Met
100 105 110
Phe Ser Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asn
115 120 125
Leu His Asp Ser Ser Ser Ile Tyr Tyr Leu Asp Lys Arg Arg Pro Asp
130 135 140
Leu Ala Ser Leu Met Leu Ser Gln Lys Asn Met Asp Glu Glu Ile Ser
145 150 155 160
Thr Leu Ala Leu Ser Asn Glu Leu Cys Leu Ala Gly Ile Glu Thr Lys
165 170 175
Thr Gly Lys Ser Gln Asp Glu Val Met Asp Met Leu Ser Thr Tyr Arg
180 185 190
Leu Ser Gly Glu Thr Pro Tyr His His Ala Tyr Glu Thr Val Arg Glu
195 200 205
Ile Val His Glu Arg Asp Pro Gly Phe Arg His Leu Ser Gln Ala Pro
210 215 220
Ile Val Ala Ala Lys Leu Asp Pro Val Thr Leu Leu Gly Ile Ser Ser
225 230 235 240
His Ile Ser Pro Glu Leu Tyr Asn Leu Leu Ile Glu Glu Ile Pro Glu
245 250 255
Lys Asp Glu Ala Ala Leu Asp Thr Leu Tyr Lys Thr Asn Phe Gly Asp
260 265 270
Ile Thr Thr Ala Gln Leu Met Ser Pro Ser Tyr Leu Ala Arg Tyr Tyr
275 280 285
Gly Val Ser Pro Glu Asp Ile Ala Tyr Val Thr Thr Ser Leu Ser His
290 295 300
Val Gly Tyr Ser Ser Asp Ile Leu Val Ile Pro Leu Val Asp Gly Val
305 310 315 320
Gly Lys Met Glu Val Val Arg Val Thr Arg Thr Pro Ser Asp Asn Tyr
325 330 335
Thr Ser Gln Thr Asn Tyr Ile Glu Leu Tyr Pro Gln Gly Gly Asp Asn
340 345 350
Tyr Leu Ile Lys Tyr Asn Leu Ser Asn Ser Phe Gly Leu Asp Asp Phe
355 360 365
Tyr Leu Gln Tyr Lys Asp Gly Ser Ala Asp Trp Thr Glu Ile Ala His
370 375 380
Asn Pro Tyr Pro Asp Met Val Ile Asn Gln Lys Tyr Glu Ser Gln Ala
385 390 395 400
Thr Ile Lys Arg Ser Asp Ser Asp Asn Ile Leu SerIle Gly Leu Gln
405 410 415
Arg Trp His Ser Gly Ser Tyr Asn Phe Ala Ala Ala Asn Phe Lys Ile
420 425 430
Asp Gln Tyr Ser Pro Lys Ala Phe Leu Leu Lys Met Asn Lys Ala Ile
435 440 445
Arg Leu Leu Lys Ala Thr Gly Leu Ser Phe Ala Thr Leu Glu Arg Ile
450 455 460
Val Asp Ser Val Asn Ser Thr Lys Ser Ile Thr Val Glu Val Leu Asn
465 470 475 480
Lys Val Tyr Arg Val Lys Phe Tyr Ile Asp Arg Tyr Gly Ile Ser Glu
485 490 495
Glu Thr Ala Ala Ile Leu Ala Asn Ile Asn Ile Ser Gln Gln Ala Val
500 505 510
Gly Asn Gln Leu Ser Gln Phe Glu Gln Leu Phe Asn His Pro Pro Leu
515 520 525
Asn Gly Ile Arg Tyr Glu Ile Ser Glu Asp Asn Ser Lys His Leu Pro
530 535 540
Asn Pro Asp Leu Asn Leu Lys Pro Asp Ser Thr Gly Asp Asp Gln Arg
545 550 555 560
Lys Ala Val Leu Lys Arg Ala Phe Gln Val Asn Ala Ser Glu Leu Tyr
565 570 575
Gln Met Leu Leu Ile Thr Asp Arg Lys Glu Asp Gly Val Ile Lys Asn
580 585 590
Asn Leu Glu Asn Leu Ser Asp Leu Tyr Leu Val Ser Leu Leu Ala Gln
595 600 605
Ile His Asn Leu Thr Ile Ala Glu Leu Asn Ile Leu Leu Val Ile Cys
610 615 620
Gly Tyr Gly Asp Thr Asn Ile Tyr Gln Ile Thr Asp Asp Asn Leu Ala
625 630 635 640
Lys Ile Val Glu Thr Leu Leu Trp Ile Thr Gln Trp Leu Lys Thr Gln
645 650 655
Lys Trp Thr Val Thr Asp Leu Phe Leu Met Thr Thr Ala Thr Tyr Ser
660 665 670
Thr Thr Leu Thr Pro Glu Ile Ser Asn Leu Thr Ala Thr Leu Ser Ser
675 680 685
Thr Leu His Gly Lys Glu Ser Leu Ile Gly Glu Asp Leu Lys Arg Ala
690 695 700
Met Ala Pro Cys Phe Thr Ser Ala Leu His Leu Thr Ser Gln Glu Val
705 710 715 720
Ala Tyr Asp Leu Leu Leu Trp Ile Asp Gln Ile Gln Pro Ala Gln Ile
725 730 735
Thr Val Asp Gly Phe Trp Glu Glu Val Gln Thr Thr Pro Thr Ser Leu
740 745 750
Lys Val Ile Thr Phe Ala Gln Val Leu Ala Gln Leu Ser Leu Ile Tyr
755 760 765
Arg Arg Ile Gly Leu Ser Glu Thr Glu Leu Ser Leu Ile Val Thr Gln
770 775 780
Ser Ser Leu Leu Val Ala Gly Lys Ser Ile Leu Asp His Gly Leu Leu
785 790 795 800
Thr Leu Met Ala Leu Glu Gly Phe His Thr Trp Val Asn Gly Leu Gly
805 810 815
Gln His Ala Ser Leu Ile Leu Ala Ala Leu Lys Asp Gly Ala Leu Thr
820 825 830
Val Thr Asp Val Ala Gln Ala Met Asn Lys Glu Glu Ser Leu Leu Gln
835 840 845
Met Ala Ala Asn Gln Val Glu Lys Asp Leu Thr Lys Leu Thr Ser Trp
850 855 860
Thr Gln Ile Asp Ala Ile Leu Gln Trp Leu Gln Met Ser Ser Ala Leu
865 870 875 880
Ala Val Ser Pro Leu Asp Leu Ala Gly Met Met Ala Leu Lys Tyr Gly
885 890 895
Ile Asp His Asn Tyr Ala Ala Trp Gln Ala Ala Ala Ala Ala Leu Met
900 905 910
Ala Asp His Ala Asn Gln Ala Gln Lys Lys Leu Asp Glu Thr Phe Ser
915 920 925
Lys Ala Leu Cys Asn Tyr Tyr Ile Asn Ala Val Val Asp Ser Ala Ala
930 935 940
Gly Val Arg Asp Arg Asn Gly Leu Tyr Thr Tyr Leu Leu Ile Asp Asn
945 950 955 960
Gln Val Ser Ala Asp Val Ile Thr Ser Arg Ile Ala Glu Ala Ile Ala
965 970 975
Gly Ile Gln Leu Tyr Val Asn Arg Ala Leu Asn Arg Asp Glu Gly Gln
980 985 990
Leu Ala Ser Asp Val Ser Thr Arg Gln Phe Phe Thr Asp Trp Glu Arg
995 1000 1005
Tyr Asn Lys Arg Tyr Ser Thr Trp Ala Gly Val Ser Glu Leu Val
1010 1015 1020
Tyr Tyr Pro Glu Asn Tyr Val Asp Pro Thr Gln Arg Ile Gly Gln
1025 1030 1035
Thr Lys Met Met Asp Ala Leu Leu Gln Ser Ile Asn Gln Ser Gln
1040 1045 1050
Leu Asn Ala Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr Leu Thr
1055 1060 1065
Ser Phe Glu Gln Val Ala Asn Leu Lys Val Ile Ser Ala Tyr His
1070 1075 1080
Asp Asn Val Asn Val Asp Gln Gly Leu Thr Tyr Phe Ile Gly Ile
1085 1090 1095
Asp Gln Ala Ala Pro Gly Thr Tyr Tyr Trp Arg Ser Val Asp His
1100 1105 1110
Ser Lys Cys Glu Asn Gly Lys Phe Ala Ala Asn Ala Trp Gly Glu
1115 1120 1125
Trp Asn Lys Ile Thr Cys Ala Val Asn Pro Trp Lys Asn Ile Ile
1130 1135 1140
Arg Pro Val Val Tyr Met Ser Arg Leu Tyr Leu Leu Trp Leu Glu
1145 1150 1155
Gln Gln Ser Lys Lys Ser Asp Asp Gly Lys Thr Thr Ile Tyr Gln
1160 1165 1170
Tyr Asn Leu Lys Leu Ala His Ile Arg Tyr Asp Gly Ser Trp Asn
1175 1180 1185
Thr Pro Phe Thr Phe Asp Val Thr Glu Lys Val Lys Asn Tyr Thr
1190 1195 1200
Ser Ser Thr Asp Ala Ala Glu Ser Leu Gly Leu Tyr Cys Thr Gly
1205 1210 1215
Tyr Gln Gly Glu Asp Thr Leu Leu Val Met Phe Tyr Ser Met Gln
1220 1225 1230
Ser Ser Tyr Ser Ser Tyr Thr Asp Asn Asn Ala Pro Val Thr Gly
1235 1240 1245
Leu Tyr Ile Phe Ala Asp Met Ser Ser Asp Asn Met Thr Asn Ala
1250 1255 1260
Gln Ala Thr Asn Tyr Trp Asn Asn Ser Tyr Pro Gln Phe Asp Thr
1265 1270 1275
Val Met Ala Asp Pro Asp Ser Asp Asn Lys Lys Val Ile Thr Arg
1280 1285 1290
Arg Val Asn Asn Arg Tyr Ala Glu Asp Tyr Glu Ile Pro Ser Ser
1295 1300 1305
Val Thr Ser Asn Ser Asn Tyr Ser Trp Gly Asp His Ser Leu Thr
1310 1315 1320
Met Leu Tyr Gly Gly Ser Val Pro Asn Ile Thr Phe Glu Ser Ala
1325 1330 1335
Ala Glu Asp Leu Arg Leu Ser Thr Asn Met Ala Leu Ser Ile Ile
1340 1345 1350
His Asn Gly Tyr Ala Gly Thr Arg Arg Ile Gln Cys Asn Leu Met
1355 1360 1365
Lys Gln Tyr Ala Ser Leu Gly Asp Lys Phe Ile Ile Tyr Asp Ser
1370 1375 1380
Ser Phe Asp Asp Ala Asn Arg Phe Asn Leu Val Pro Leu Phe Lys
1385 1390 1395
Phe Gly Lys Asp Glu Asn Ser Asp Asp Ser Ile Cys Ile Tyr Asn
1400 1405 1410
Glu Asn Pro Ser Ser Glu Asp Lys Lys Trp Tyr Phe Ser Ser Lys
1415 1420 1425
Asp Asp Asn Lys Thr Ala Asp Tyr Asn Gly Gly Thr Gln Cys Ile
1430 1435 1440
Asp Ala Gly Thr Ser Asn Lys Asp Phe Tyr Tyr Asn Leu Gln Glu
1445 1450 1455
Ile Glu Val Ile Ser Val Thr Gly Gly Tyr Trp Ser Ser Tyr Lys
1460 1465 1470
Ile Ser Asn Pro Ile Asn Ile Asn Thr Gly Ile Asp Ser Ala Lys
1475 1480 1485
Val Lys Val Thr Val Lys Ala Gly Gly Asp Asp Gln Ile Phe Thr
1490 1495 1500
Ala Asp Asn Ser Thr Tyr Val Pro Gln Gln Pro Ala Pro Ser Phe
1505 1510 1515
Glu Glu Met Ile Tyr Gln Phe Asn Asn Leu Thr Ile Asp Cys Lys
1520 1525 1530
Asn Leu Asn Phe Ile Asp Asn Gln Ala His Ile Glu Ile Asp Phe
1535 1540 1545
Thr Ala Thr Ala Gln Asp Gly Arg Phe Leu Gly Ala Glu Thr Phe
1550 1555 1560
Ile Ile Pro Val Thr Lys Lys Val Leu Gly Thr Glu Asn Val Ile
1565 1570 1575
Ala Leu Tyr Ser Glu Asn Asn Gly Val Gln Tyr Met Gln Ile Gly
1580 1585 1590
Ala Tyr Arg Thr Arg Leu Asn Thr Leu Phe Ala Gln Gln Leu Val
1595 1600 1605
Ser Arg Ala Asn Arg Gly Ile Asp Ala Val Leu Ser Met Glu Thr
1610 1615 1620
Gln Asn Ile Gln Glu Pro Gln Leu Gly Ala Gly Thr Tyr Val Gln
1625 1630 1635
Leu Val Leu Asp Lys Tyr Asp Glu Ser Ile His Gly Thr Asn Lys
1640 1645 1650
Ser Phe Ala Ile Glu Tyr Val Asp Ile Phe Lys Glu Asn Asp Ser
1655 1660 1665
Phe Val Ile Tyr Gln Gly Glu Leu Ser Glu Thr Ser Gln Thr Val
1670 1675 1680
Val Lys Val Phe Leu Ser Tyr Phe Ile Glu Ala Thr Gly Asn Lys
1685 1690 1695
Asn His Leu Trp Val Arg Ala Lys Tyr Gln Lys Glu Thr Thr Asp
1700 1705 1710
Lys Ile Leu Phe Asp Arg Thr Asp Glu Lys Asp Pro His Gly Trp
1715 1720 1725
Phe Leu Ser Asp Asp His Lys Thr Phe Ser Gly Leu Ser Ser Ala
1730 1735 1740
Gln Ala Leu Lys Asn Asp Ser Glu Pro Met Asp Phe Ser Gly Ala
1745 1750 1755
Asn Ala Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Thr Pro Met Met
1760 1765 1770
Met Ala His Arg Leu Leu Gln Glu Gln Asn Phe Asp Ala Ala Asn
1775 1780 1785
His Trp Phe Arg Tyr Val Trp Ser Pro Ser Gly Tyr Ile Val Asp
1790 1795 1800
Gly Lys Ile Ala Ile Tyr His Trp Asn Val Arg Pro Leu Glu Glu
1805 1810 1815
Asp Thr Ser Trp Asn Ala Gln Gln Leu Asp Ser Thr Asp Pro Asp
1820 1825 1830
Ala Val Ala Gln Asp Asp Pro Met His Tyr Lys Val Ala Thr Phe
1835 1840 1845
Met Ala Thr Leu Asp Leu Leu Met Ala Arg Gly Asp Ala Ala Tyr
1850 1855 1860
Arg Gln Leu Glu Arg Asp Thr Leu Ala Glu Ala Lys Met Trp Tyr
1865 1870 1875
Thr Gln Ala Leu Asn Leu Leu Gly Asp Glu Pro Gln Val Met Leu
1880 1885 1890
Ser Thr Thr Trp Ala Asn Pro Thr Leu Gly Asn Ala Ala Ser Lys
1895 1900 1905
Thr Thr Gln Gln Val Arg Gln Gln Val Leu Thr Gln Leu Arg Leu
1910 1915 1920
Asn Ser Arg Val Lys Thr Pro Leu Leu Gly Thr Ala Asn Ser Leu
1925 1930 1935
Thr Ala Leu Phe Leu Pro Gln Glu Asn Ser Lys Leu Lys Gly Tyr
1940 1945 1950
Trp Arg Thr Leu Ala Gln Arg Met Phe Asn Leu Arg His Asn Leu
1955 1960 1965
Ser Ile Asp Gly Gln Pro Leu Ser Leu Pro Leu Tyr Ala Lys Pro
1970 1975 1980
Ala Asp Pro Lys Ala Leu Leu Ser Ala Ala Val Ser Ala Ser Gln
1985 1990 1995
Gly Gly Ala Asp Leu Pro Lys Ala Pro Leu Thr Ile His Arg Phe
2000 2005 2010
Pro Gln Met Leu Glu Gly Ala Arg Gly Leu Val Asn Gln Leu Ile
2015 2020 2025
Gln Phe Gly Ser Ser Leu Leu Gly Tyr Ser Glu Arg Gln Asp Ala
2030 2035 2040
Glu Ala Met Ser Gln Leu Leu Gln Thr Gln Ala Ser Glu Leu Ile
2045 2050 2055
Leu Thr Ser Ile Arg Met Gln Asp Asn Gln Leu Ala Glu Leu Asp
2060 2065 2070
Ser Glu Lys Thr Ala Leu Gln Val Ser Leu Ala Gly Val Gln Gln
2075 2080 2085
Arg Phe Asp Ser Tyr Ser Gln Leu Tyr Glu Glu Asn Ile Asn Ala
2090 2095 2100
Gly Glu Gln Arg Ala Leu Ala Leu Arg Ser Glu Ser Ala Ile Glu
2105 2110 2115
Ser Gln Gly Ala Gln Ile Ser Arg Met Ala Gly Ala Gly Val Asp
2120 2125 2130
Met Ala Pro Asn Ile Phe Gly Leu Ala Asp Gly Gly Met His Tyr
2135 2140 2145
Gly Ala Ile Ala Tyr Ala Ile Ala Asp Gly Ile Glu Leu Ser Ala
2150 2155 2160
Ser Ala Lys Met Val Asp Ala Glu Lys Val Ala Gln Ser Glu Ile
2165 2170 2175
Tyr Arg Arg Arg Arg Gln Glu Trp Lys Ile Gln Arg Asp Asn Ala
2180 2185 2190
Gln Ala Glu Ile Asn Gln Leu Asn Ala Gln Leu Glu Ser Leu Ser
2195 2200 2205
Ile Arg Arg Glu Ala Ala Glu Met Gln Lys Glu Tyr Leu Lys Thr
2210 2215 2220
Gln Gln Ala Gln Ala Gln Ala Gln Leu Thr Phe Leu Arg Ser Lys
2225 2230 2235
Phe Ser Asn Gln Ala Leu Tyr Ser Trp Leu Arg Gly Arg Leu Ser
2240 2245 2250
Gly Ile Tyr Phe Gln Phe Tyr Asp Leu Ala Val Ser Arg Cys Leu
2255 2260 2265
Met Ala Glu Gln Ser Tyr Gln Trp Glu Ala Asn Asp Asn Ser Ile
2270 2275 2280
Ser Phe Val Lys Pro Gly Ala Trp Gln Gly Thr Tyr Ala Gly Leu
2285 2290 2295
Leu Cys Gly Glu Ala Leu Ile Gln Asn Leu Ala Gln Met Glu Glu
2300 2305 2310
Ala Tyr Leu Lys Trp Glu Ser Arg Ala Leu Glu Val Glu Arg Thr
2315 2320 2325
Val Ser Leu Ala Val Val Tyr Asp Ser Leu Glu Gly Asn Asp Arg
2330 2335 2340
Phe Asn Leu Ala Glu Gln Ile Pro Ala Leu Leu Asp Lys Gly Glu
2345 2350 2355
Gly Thr Ala Gly Thr Lys Glu Asn Gly Leu Ser Leu Ala Asn Ala
2360 2365 2370
Ile Leu Ser Ala Ser Val Lys Leu Ser Asp Leu Lys Leu Gly Thr
2375 2380 2385
Asp Tyr Pro Asp Ser Ile Val Gly Ser Asn Lys Val Arg Arg Ile
2390 2395 2400
Lys Gln Ile Ser Val Ser Leu Pro Ala Leu Val Gly Pro Tyr Gln
2405 2410 2415
Asp Val Gln Ala Met Leu Ser Tyr Gly Gly Ser Thr Gln Leu Pro
2420 2425 2430
Lys Gly Cys Ser Ala Leu Ala Val Ser His Gly Thr Asn Asp Ser
2435 2440 2445
Gly Gln Phe Gln Leu Asp Phe Asn Asp Gly Lys Tyr Leu Pro Phe
2450 2455 2460
Glu Gly Ile Ala Leu Asp Asp Gln Gly Thr Leu Asn Leu Gln Phe
2465 2470 2475
Pro Asn Ala Thr Asp Lys Gln Lys Ala Ile Leu Gln Thr Met Ser
2480 2485 2490
Asp Ile Ile Leu His Ile Arg Tyr Thr Ile Arg
2495 2500
<210>25
<211>2516
<212>PRT
<213>发光光杆状菌
<400>25
Met Asn Glu Ser Val Lys Glu Ile Pro Asp Val Leu Lys Ser Gln Cys
1 5 10 15
Gly Phe Asn Cys Leu Thr Asp Ile Ser His Ser Ser Phe Asn Glu Phe
20 25 30
Arg Gln Gln Val Ser Glu His Leu Ser Trp Ser Glu Thr His Asp Leu
35 40 45
Tyr His Asp Ala Gln Gln Ala Gln Lys Asp Asn Arg Leu Tyr Glu Ala
50 55 60
Arg Ile Leu Lys Arg Ala Asn Pro Gln Leu Gln Asn Ala Val His Leu
65 70 75 80
Ala Ile Leu Ala Pro Asn Ala Glu Leu Ile Gly Tyr Asn Asn Gln Phe
85 90 95
Ser Gly Arg Ala Ser Gln Tyr Val Ala Pro Gly Thr Val Ser Ser Met
100 105 110
Phe Ser Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Arg Asn
115 120 125
Leu His Ala Ser Asp Ser Val Tyr Tyr Leu Asp Thr Arg Arg Pro Asp
130 135 140
Leu Lys Ser Met Ala Leu Ser Gln Gln Asn Met Asp Ile Glu Leu Ser
145 150 155 160
Thr Leu Ser Leu Ser Asn Glu Leu Leu Leu Glu Ser Ile Lys Thr Glu
165 170 175
Ser Lys Leu Glu Asn Tyr Thr Lys Val Met Glu Met Leu Ser Thr Phe
180 185 190
Arg Pro Ser Gly Ala Thr Pro Tyr His Asp Ala Tyr Glu Asn Val Arg
195 200 205
Glu Val Ile Gln Leu Gln Asp Pro Gly Leu Glu Gln Leu Asn Ala Ser
210 215 220
Pro Ala Ile Ala Gly Leu Met His Gln Ala Ser Leu Leu Gly Ile Asn
225 230 235 240
Ala Ser Ile Ser Pro Glu Leu Phe Asn Ile Leu Thr Glu Glu Ile Thr
245 250 255
Glu Gly Asn Ala Glu Glu Leu Tyr Lys Lys Asn Phe Gly Asn Ile Glu
260 265 270
Pro Ala Ser Leu Ala Met Pro Glu Tyr Leu Lys Arg Tyr Tyr Asn Leu
275 280 285
Ser Asp Glu Glu Leu Ser Gln Phe Ile Gly Lys Ala Ser Asn Phe Gly
290 295 300
Gln Gln Glu Tyr Ser Asn Asn Gln Leu Ile Thr Pro Val Val Asn Ser
305 310 315 320
Ser Asp Gly Thr Val Lys Val Tyr Arg Ile Thr Arg Glu Tyr Thr Thr
325 330 335
Asn Ala Tyr Gln Met Asp Val Glu Leu Phe Pro Phe Gly Gly Glu Asn
340 345 350
Tyr Arg Leu Asp Tyr Lys Phe Lys Asn Phe Tyr Asn Ala Ser Tyr Leu
355 360 365
Ser Ile Lys Leu Asn Asp Lys Arg Glu Leu Val Arg Thr Glu Gly Ala
370 375 380
Pro Gln Val Asn Ile Glu Tyr Ser Ala Asn Ile Thr Leu Asn Thr Ala
385 390 395 400
Asp Ile Ser Gln Pro Phe Glu Ile Gly Leu Thr Arg Val Leu Pro Ser
405 410 415
Gly Ser Trp Ala Tyr Ala Ala Ala Lys Phe Thr Val Glu Glu Tyr Asn
420 425 430
Gln Tyr Ser Phe Leu Leu Lys Leu Asn Lys Ala Ile Arg Leu Ser Arg
435 440 445
Ala Thr Glu Leu Ser Pro Thr Ile Leu Glu Gly Ile Val Arg Ser Val
450 455 460
Asn Leu Gln Leu Asp Ile Asn Thr Asp Val Leu Gly Lys Val Phe Leu
465 470 475 480
Thr Lys Tyr Tyr Met Gln Arg Tyr Ala Ile His Ala Glu Thr Ala Leu
485 490 495
Ile Leu Cys Asn Ala Pro Ile Ser Gln Arg Ser Tyr Asp Asn Gln Pro
500 505 510
Ser Gln Phe Asp Arg Leu Phe Asn Thr Pro Leu Leu Asn Gly Gln Tyr
515 520 525
Phe Ser Thr Gly Asp Glu Glu Ile Asp Leu Asn Ser Gly Ser Thr Gly
530 535 540
Asp Trp Arg Lys Thr Ile Leu Lys Arg Ala Phe Asn Ile Asp Asp Val
545 550 555 560
Ser Leu Phe Arg Leu Leu Lys Ile Thr Asp His Asp Asn Lys Asp Gly
565 570 575
Lys Ile Lys Asn Asn Leu Lys Asn Leu Ser Asn Leu Tyr Ile Gly Lys
580 585 590
Leu Leu Ala Asp Ile His Gln Leu Thr Ile Asp Glu Leu Asp Leu Leu
595 600 605
Leu Ile Ala Val Gly Glu Gly Lys Thr Asn Leu Ser Ala Ile Ser Asp
610 615 620
Lys Gln Leu Ala Thr Leu Ile Arg Lys Leu Asn Thr Ile Thr Ser Trp
625 630 635 640
Leu His Thr Gln Lys Trp Ser Val Phe Gln Leu Phe Ile Met Thr Ser
645 650 655
Thr Ser Tyr Asn Lys Thr Leu Thr Pro Glu Ile Lys Asn Leu Leu Asp
660 665 670
Thr Val Tyr His Gly Leu Gln Gly Phe Asp Lys Asp Lys Ala Asp Leu
675 680 685
Leu His Val Met Ala Pro Tyr Ile Ala Ala Thr Leu Gln Leu Ser Ser
690 695 700
Glu Asn Val Ala His Ser Val Leu Leu Trp Ala Asp Lys Leu Gln Pro
705 710 715 720
Gly Asp Gly Ala Met Thr Ala Glu Lys Phe Trp Asp Trp Leu Asn Thr
725 730 735
Lys Tyr Thr Pro Gly Ser Ser Glu Ala Val Glu Thr Gln Glu His Ile
740 745 750
Val Gln Tyr Cys Gln Ala Leu Ala Gln Leu Glu Met Val Tyr His Ser
755 760 765
Thr Gly Ile Asn Glu Asn Ala Phe Arg Leu Phe Val Thr Lys Pro Glu
770 775 780
Met Phe Gly Ala Ala Thr Gly Ala Ala Pro Ala His Asp Ala Leu Ser
785 790 795 800
Leu Ile Met Leu Thr Arg Phe Ala Asp Trp Val Asn Ala Leu Gly Glu
805 810 815
Lys Ala Ser Ser Val Leu Ala Ala Phe Glu Ala Asn Ser Leu Thr Ala
820 825 830
Glu Gln Leu Ala Asp Ala Met Asn Leu Asp Ala Asn Leu Leu Leu Gln
835 840 845
Ala Ser Ile Gln Ala Gln Asn His Gln His Leu Pro Pro Val Thr Pro
850 855 860
Glu Asn Ala Phe Ser Cys Trp Thr Ser Ile Asn Thr Ile Leu Gln Trp
865 870 875 880
Val Asn Val Ala Gln Gln Leu Asn Val Ala Pro Gln Gly Val Ser Ala
885 890 895
Leu Val Gly Leu Asp Tyr Ile Gln Ser Met Lys Glu Thr Pro Thr Tyr
900 905 910
Ala Gln Trp Glu Asn Ala Ala Gly Val Leu Thr Ala Gly Leu Asn Ser
915 920 925
Gln Gln Ala Asn Thr Leu His Ala Phe Leu Asp Glu Ser Arg Ser Ala
930 935 940
Ala Leu Ser Thr Tyr Tyr Ile Arg Gln Val Ala Lys Ala Ala Ala Ala
945 950 955 960
Ile Lys Ser Arg Asp Asp Leu Tyr Gln Tyr Leu Leu Ile Asp Asn Gln
965 970 975
Val Ser Ala Ala Ile Lys Thr Thr Arg Ile Ala Glu Ala Ile Ala Ser
980 985 990
Ile Gln Leu Tyr Val Asn Arg Ala Leu Glu Asn Val Glu Glu Asn Ala
995 1000 1005
Asn Ser Gly Val Ile Ser Arg Gln Phe Phe Ile Asp Trp Asp Lys
1010 1015 1020
Tyr Asn Lys Arg Tyr Ser Thr Trp Ala Gly Val Ser Gln Leu Val
1025 1030 1035
Tyr Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Met Arg Ile Gly Gln
1040 1045 1050
Thr Lys Met Met Asp Ala Leu Leu Gln Ser Val Ser Gln Ser Gln
1055 1060 1065
Leu Asn Ala Asp Thr Val Glu Asp Ala Phe Met Ser Tyr Leu Thr
1070 1075 1080
Ser Phe Glu Gln Val Ala Asn Leu Lys Val Ile Ser Ala Tyr His
1085 1090 1095
Asp Asn Ile Asn Asn Asp Gln Gly Leu Thr Tyr Phe Ile Gly Leu
1100 1105 1110
Ser Glu Thr Asp Ala Gly Glu Tyr Tyr Trp Arg Ser Val Asp His
1115 1120 1125
Ser Lys Phe Asn Asp Gly Lys Phe Ala Ala Asn Ala Trp Ser Glu
1130 1135 1140
Trp His Lys Ile Asp Cys Pro Ile Asn Pro Tyr Lys Ser Thr Ile
1145 1150 1155
Arg Pro Val Ile Tyr Lys Ser Arg Leu Tyr Leu Leu Trp Leu Glu
1160 1165 1170
Gln Lys Glu Ile Thr Lys Gln Thr Gly Asn Ser Lys Asp Gly Tyr
1175 1180 1185
Gln Thr Glu Thr Asp Tyr Arg Tyr Glu Leu Lys Leu Ala His Ile
1190 1195 1200
Arg Tyr Asp Gly Thr Trp Asn Thr Pro Ile Thr Phe Asp Val Asn
1205 1210 1215
Lys Lys Ile Ser Glu Leu Lys Leu Glu Lys Asn Arg Ala Pro Gly
1220 1225 1230
Leu Tyr Cys Ala Gly Tyr Gln Gly Glu Asp Thr Leu Leu Val Met
1235 1240 1245
Phe Tyr Asn Gln Gln Asp Thr Leu Asp Ser Tyr Lys Asn Ala Ser
1250 1255 1260
Met Gln Gly Leu Tyr Ile Phe Ala Asp Met Ala Ser Lys Asp Met
1265 1270 1275
Thr Pro Glu Gln Ser Asn Val Tyr Arg Asp Asn Ser Tyr Gln Gln
1280 1285 1290
Phe Asp Thr Asn Asn Val Arg Arg Val Asn Asn Arg Tyr Ala Glu
1295 1300 1305
Asp Tyr Glu Ile Pro Ser Ser Val Ser Ser Arg Lys Asp Tyr Gly
1310 1315 1320
Trp Gly Asp Tyr Tyr Leu Ser Met Val Tyr Asn Gly Asp Ile Pro
1325 1330 1335
Thr Ile Asn Tyr Lys Ala Ala Ser Ser Asp Leu Lys Ile Tyr Ile
1340 1345 1350
Ser Pro Lys Leu Arg Ile Ile His Asn Gly Tyr Glu Gly Gln Lys
1355 1360 1365
Arg Asn Gln Cys Asn Leu Met Asn Lys Tyr Gly Lys Leu Gly Asp
1370 1375 1380
Lys Phe Ile Val Tyr Thr Ser Leu Gly Val Asn Pro Asn Asn Ser
1385 1390 1395
Ser Asn Lys Leu Met Phe Tyr Pro Val Tyr Gln Tyr Ser Gly Asn
1400 1405 1410
Thr Ser Gly Leu Asn Gln Gly Arg Leu Leu Phe His Arg Asp Thr
1415 1420 1425
Thr Tyr Pro Ser Lys Val Glu Ala Trp Ile Pro Gly Ala Lys Arg
1430 1435 1440
Ser Leu Thr Asn Gln Asn Ala Ala Ile Gly Asp Asp Tyr Ala Thr
1445 1450 1455
Asp Ser Leu Asn Lys Pro Asp Asp Leu Lys Gln Tyr Ile Phe Met
1460 1465 1470
Thr Asp Ser Lys Gly Thr Ala Thr Asp Val Ser Gly Pro Val Glu
1475 1480 1485
Ile Asn Thr Ala Ile Ser Pro Ala Lys Val Gln Ile Ile Val Lys
1490 1495 1500
Ala Gly Gly Lys Glu Gln Thr Phe Thr Ala Asp Lys Asp Val Ser
1505 1510 1515
Ile Gln Pro Ser Pro Ser Phe Asp Glu Met Asn Tyr Gln Phe Asn
1520 1525 1530
Ala Leu Glu Ile Asp Gly Ser Gly Leu Asn Phe Ile Asn Asn Ser
1535 1540 1545
Ala Ser Ile Asp Val Thr Phe Thr Ala Phe Ala Glu Asp Gly Arg
1550 1555 1560
Lys Leu Gly Tyr Glu Ser Phe Ser Ile Pro Val Thr Leu Lys Val
1565 1570 1575
Ser Thr Asp Asn Ala Leu Thr Leu His His Asn Glu Asn Gly Ala
1580 1585 1590
Gln Tyr Met Gln Trp Gln Ser Tyr Arg Thr Arg Leu Asn Thr Leu
1595 1600 1605
Phe Ala Arg Gln Leu Val Ala Arg Ala Thr Thr Gly Ile Asp Thr
1610 1615 1620
Ile Leu Ser Met Glu Thr Gln Asn Ile Gln Glu Pro Gln Leu Gly
1625 1630 1635
Lys Gly Phe Tyr Ala Thr Phe Val Ile Pro Pro Tyr Asn Leu Ser
1640 1645 1650
Thr His Gly Asp Glu Arg Trp Phe Lys Leu Tyr Ile Lys His Val
1655 1660 1665
Val Asp Asn Asn Ser His Ile Ile Tyr Ser Gly Gln Leu Thr Asp
1670 1675 1680
Thr Asn Ile Asn Ile Thr Leu Phe Ile Pro Leu Asp Asp Val Pro
1685 1690 1695
Leu Asn Gln Asp Tyr His Ala Lys Val Tyr Met Thr Phe Lys Lys
1700 1705 1710
Ser Pro Ser Asp Gly Thr Trp Trp Gly Pro His Phe Val Arg Asp
1715 1720 1725
Asp Lys Gly Ile Val Thr Ile Asn Pro Lys Ser Ile Leu Thr His
1730 1735 1740
Phe Glu Ser Val Asn Val Leu Asn Asn Ile Ser Ser Glu Pro Met
1745 1750 1755
Asp Phe Ser Gly Ala Asn Ser Leu Tyr Phe Trp Glu Leu Phe Tyr
1760 1765 1770
Tyr Thr Pro Met Leu Val Ala Gln Arg Leu Leu His Glu Gln Asn
1775 1780 1785
Phe Asp Glu Ala Asn Arg Trp Leu Lys Tyr Val Trp Ser Pro Ser
1790 1795 1800
Gly Tyr Ile Val His Gly Gln Ile Gln Asn Tyr Gln Trp Asn Val
1805 1810 1815
Arg Pro Leu Leu Glu Asp Thr Ser Trp Asn Ser Asp Pro Leu Asp
1820 1825 1830
Ser Val Asp Pro Asp Ala Val Ala Gln His Asp Pro Met His Tyr
1835 1840 1845
Lys Val Ser Thr Phe Met Arg Thr Leu Asp Leu Leu Ile Ala Arg
1850 1855 1860
Gly Asp His Ala Tyr Arg Gln Leu Glu Arg Asp Thr Leu Asn Glu
1865 1870 1875
Ala Lys Met Trp Tyr Met Gln Ala Leu His Leu Leu Gly Asp Lys
1880 1885 1890
Pro Tyr Leu Pro Leu Ser Thr Thr Trp Ser Asp Pro Arg Leu Asp
1895 1900 1905
Arg Ala Ala Asp Ile Thr Thr Gln Asn Ala His Asp Ser Ala Ile
1910 1915 1920
Val Ala Leu Arg Gln Asn Ile Pro Thr Pro Ala Pro Leu Ser Leu
1925 1930 1935
Arg Ser Ala Asn Thr Leu Thr Asp Leu Phe Leu Pro Gln Ile Asn
1940 1945 1950
Glu Val Met Met Asn Tyr Trp Gln Thr Leu Ala Gln Arg Val Tyr
1955 1960 1965
Asn Leu Arg His Asn Leu Ser Ile Asp Gly Gln Pro Leu Tyr Leu
1970 1975 1980
Pro Ile Tyr Ala Thr Pro Ala Asp Pro Lys Ala Leu Leu Ser Ala
1985 1990 1995
Ala Val Ala Thr Ser Gln Gly Gly Gly Lys Leu Pro Glu Ser Phe
2000 2005 2010
Met Ser Leu Trp Arg Phe Pro His Met Leu Glu Asn Ala Arg Gly
2015 2020 2025
Met Val Ser Gln Leu Thr Gln Phe Gly Ser Thr Leu Gln Asn Ile
2030 2035 2040
Ile Glu Arg Gln Asp Ala Glu Ala Leu Asn Ala Leu Leu Gln Asn
2045 2050 2055
Gln Ala Ala Glu Leu Ile Leu Thr Asn Leu Ser Ile Gln Asp Lys
2060 2065 2070
Thr Ile Glu Glu Leu Asp Ala Glu Lys Thr Val Leu Glu Lys Ser
2075 2080 2085
Lys Ala Gly Ala Gln Ser Arg Phe Asp Ser Tyr Gly Lys Leu Tyr
2090 2095 2100
Asp Glu Asn Ile Asn Ala Gly Glu Asn Gln Ala Met Thr Leu Arg
2105 2110 2115
Ala Ser Ala Ala Gly Leu Thr Thr Ala Val Gln Ala Ser Arg Leu
2120 2125 2130
Ala Gly Ala Ala Ala Asp Leu Val Pro Asn Ile Phe Gly Phe Ala
2135 2140 2145
Gly Gly Gly Ser Arg Trp Gly Ala Ile Ala Glu Ala Thr Gly Tyr
2150 2155 2160
Val Met Glu Phe Ser Ala Asn Val Met Asn Thr Glu Ala Asp Lys
2165 2170 2175
Ile Ser Gln Ser Glu Thr Tyr Arg Arg Arg Arg Gln Glu Trp Glu
2180 2185 2190
Ile Gln Arg Asn Asn Ala Glu Ala Glu Leu Lys Gln Ile Asp Ala
2195 2200 2205
Gln Leu Lys Ser Leu Ala Val Arg Arg Glu Ala Ala Val Leu Gln
2210 2215 2220
Lys Thr Ser Leu Lys Thr Gln Gln Glu Gln Thr Gln Ser Gln Leu
2225 2230 2235
Ala Phe Leu Gln Arg Lys Phe Ser Asn Gln Ala Leu Tyr Asn Trp
2240 2245 2250
Leu Arg Gly Arg Leu Ala Ala Ile Tyr Phe Gln Phe Tyr Asp Leu
2255 2260 2265
Ala Val Ala Arg Cys Leu Met Ala Glu Gln Ala Tyr Arg Trp Glu
2270 2275 2280
Leu Asn Asp Asp Ser Ala Arg Phe Ile Lys Pro Gly Ala Trp Gln
2285 2290 2295
Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met Leu Ser
2300 2305 2310
Leu Ala Gln Met Glu Asp Ala His Leu Lys Arg Asp Lys Arg Ala
2315 2320 2325
Leu Glu Val Glu Arg Thr Val Ser Leu Ala Glu Val Tyr Ala Gly
2330 2335 2340
Leu Pro Lys Asp Asn Gly Pro Phe Ser Leu Ala Gln Glu Ile Asp
2345 2350 2355
Lys Leu Val Ser Gln Gly Ser Gly Ser Ala Gly Ser Gly Asn Asn
2360 2365 2370
Asn Leu Ala Phe Gly Ala Gly Thr Asp Thr Lys Thr Ser Leu Gln
2375 2380 2385
Ala Ser Val Ser Phe Ala Asp Leu Lys Ile Arg Glu Asp Tyr Pro
2390 2395 2400
Ala Ser Leu Gly Lys Ile Arg Arg Ile Lys Gln Ile Ser Val Thr
2405 2410 2415
Leu Pro Ala Leu Leu Gly Pro Tyr Gln Asp Val Gln Ala Ile Leu
2420 2425 2430
Ser Tyr Gly Asp Lys Ala Gly Leu Ala Asn Gly Cys Glu Ala Leu
2435 2440 2445
Ala Val Ser His Gly Met Asn Asp Ser Gly Gln Phe Gln Leu Asp
2450 2455 2460
Phe Asn Asp Gly Lys Phe Leu Pro Phe Glu Gly Ile Ala Ile Asp
2465 2470 2475
Gln Gly Thr Leu Thr Leu Ser Phe Pro Asn Ala Ser Met Pro Glu
2480 2485 2490
Lys Gly Lys Gln Ala Thr Met Leu Lys Thr Leu Asn Asp Ile Ile
2495 2500 2505
Leu His Ile Arg Tyr Thr Ile Lys
2510 2515
<210>26
<211>2499
<212>PRT
<213>发光光杆状菌
<400>26
Met Asn Thr Leu Lys Ser Glu Tyr Gln Gln Ala Leu Gly Ala Gly Phe
1 5 10 15
Asn Asn Leu Thr Asp Ile Cys His Leu Ser Phe Asp Glu Leu Arg Lys
20 25 30
Lys Val Lys Asp Lys Leu Ser Trp Ser Gln Thr Gln Ser Leu Tyr Leu
35 40 45
Glu Ala Gln Gln Val Gln Lys Asp Asn Leu Leu His Glu Ala Arg Ile
50 55 60
Leu Lys Arg Ala Asn Pro His Leu Gln Ser Ala Val His Leu Ala Leu
65 70 75 80
Thr Ala Pro His Ala Asp Gln Gln Gly Tyr Asn Ser Arg Phe Gly Asn
85 90 95
Arg Ala Ser Lys Tyr Ala Ala Pro Gly Ala Ile Ser Ser Met Phe Ser
100 105 110
Leu Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Gln Ala Arg Asn Leu His
115 120 125
Ala Glu Gly Ser Ile Tyr His Leu Asp Thr Arg Arg Pro Asp Leu Lys
130 135 140
Ser Leu Val Leu Ser Gln Lys Asn Met Asn Thr Glu Ile Ser Thr Leu
145 150 155 160
Ser Leu Ser Asn Asn Met Leu Leu Asn Ser Ile Lys Thr Gln Pro Asn
165 170 175
Leu Asn Ser His Ala Lys Val Met Glu Lys Leu Ser Thr Phe Arg Thr
180 185 190
Ser Gly Ser Met Pro Tyr His Asp Ala Tyr Glu Ser Val Arg Lys Ile
195 200 205
Ile Gln Leu Gln Ala Pro Val Phe Glu Gln Ser Ser Thr Leu Thr Asp
210 215 220
Thr Pro Ile Thr Lys Leu Met Tyr Gln Ile Ser Leu Leu Gly Ile Asn
225 230 235 240
Ala Ser Val Ser Pro Glu Leu Phe Thr Ile Leu Thr Gln Lys Ile Lys
245 250 255
Pro Ala Thr Asn Ala Asp Asn Thr Asn Glu Leu Lys Lys Leu Tyr Lys
260 265 270
Lys Asn Phe Gly Glu Ile Lys Ser Ile Gln Met Ala Arg Ala Glu Tyr
275 280 285
Leu Lys Ser Tyr Tyr Asn Leu Thr Asp Lys Glu Leu Asn Gln Phe Ser
290 295 300
Lys Lys Ile Lys Gln Ile Asp Ser Leu Trp Asn Ile Gly Asp Glu Ile
305 310 315 320
Thr Gln Tyr His Leu Leu Lys Phe Asn Lys Ala Ile Asn Leu Ser Arg
325 330 335
Ser Thr Glu Leu Ser Pro Ile Ile Leu Asn Ser Ile Ala Ile Asp Ile
340 345 350
Leu Lys Lys Thr Pro Pro Glu Asp Asp Ser Asp Asn Pro Phe Arg Asp
355 360 365
Asp Pro Asp Tyr Leu Glu Ser Phe Gln Asp Leu Asp Leu Ser Asp Glu
370 375 380
Pro Asp Ile Asp Glu Asp Val Leu Arg Glu Ala Leu Arg Val Lys Asp
385 390 395 400
Tyr Met Gln Arg Tyr Gly Ile Asp Ala Glu Thr Ala Leu Ile Leu Cys
405 410 415
Lys Ala Pro Ile Ser Glu Asn Pro Ser His Pro Asp Leu Ser Lys Leu
420 425 430
Leu Ala Asp Ile His Gln Leu Thr Ile Asp Glu Leu Gly Val Leu Leu
435 440 445
Val Ala Ile Asp Glu Gly Lys Thr Asp Leu Ser Gln Ile Thr His Asp
450 455 460
Asn Leu Ala Val Leu Ile Ser Lys Leu Tyr Ser Val Thr Asn Trp Leu
465 470 475 480
Arg Thr Arg Lys Trp Ser Val Tyr Gln Leu Phe Val Met Thr Thr Asp
485 490 495
Lys Tyr Asn Lys Thr Leu Thr Pro Glu Ile Asn Asn Leu Leu Asp Thr
500 505 510
Val Tyr Asn Gly Leu Gln Asn Phe Tyr Lys Asp Asn Leu Leu Lys Ile
515 520 525
Lys Asp Asn Leu Leu Lys Ala Lys Glu Ser Leu Pro Glu Asp Lys Asp
530 535 540
Asn Leu Pro Lys Ala Glu Gln Tyr Leu Leu Glu Ala Glu Lys Tyr Leu
545 550 555 560
Leu Ala Ala Glu Lys Tyr Leu Leu Ala Ala Glu Lys Tyr Leu Leu Glu
565 570 575
Ala Asn Lys Asn Pro Leu Glu Ala Lys Lys Ala Leu Lys Glu Tyr Glu
580 585 590
Lys Asn Gln Glu Ala Tyr Glu Lys Asn Leu Lys Glu His Glu Lys Tyr
595 600 605
Leu Leu Lys Ala Gly Glu Asn Leu Pro Ala Ile Lys Glu Asn Leu Leu
610 615 620
Lys Ile Lys Glu Asn Leu Pro Lys Ala Ile Ser Pro Tyr Ile Ala Ala
625 630 635 640
Ala Leu Gln Leu Pro Ser Glu Asn Val Ala Leu Ser Val Leu Ala Trp
645 650 655
Ala Asp Lys Leu Asn Ser Gly Lys Glu Asn Lys Met Thr Ala Asp Ser
660 665 670
Phe Trp Asn Trp Leu Arg Lys Lys Pro Ile Glu Thr Gln Ser Lys Thr
675 680 685
Thr Glu Ala Thr Glu Ala Thr Glu Ala Thr Glu Ala Thr Glu Ala Thr
690 695 700
Glu Ala Thr Glu Lys Thr Thr Leu Ile Gln Gln Ala Val Gln Tyr Cys
705 710 715 720
Gln Cys Leu Ala Gln Leu Ala Leu Ile Tyr Arg Ser Thr Gly Leu Ser
725 730 735
Glu Ser Thr Leu Arg Leu Phe Val Thr Asn Pro Gln Ile Phe Gly Leu
740 745 750
Thr Ala Lys Thr Thr Ser Thr His Asn Val Leu Ser Leu Ile Met Leu
755 760 765
Thr Arg Phe Thr Asp Trp Val Asn Ser Leu Gly Glu Asn Ala Ser Ser
770 775 780
Val Leu Thr Glu Phe Glu Lys Gly Thr Leu Thr Ala Glu Leu Leu Ala
785 790 795 800
Asn Ala Met Asn Leu Asp Lys Asn Leu Leu Glu Gln Ala Ser Thr Gln
805 810 815
Ala Gln Ala Asp Phe Ser Asn Trp Pro Ser Ile Asp Asn Leu Leu Gln
820 825 830
Trp Ile Asn Ile Ser Arg Gln Leu Asn Ile Ser Pro Gln Gly Val Ser
835 840 845
Glu Leu Ala Lys Ile Leu Asp Ile Glu Ser Ser Thr Asn Tyr Ala Gln
850 855 860
Trp Glu Asn Val Ala Ser Ile Leu Thr Ala Gly Leu Asp Thr Gln Lys
865 870 875 880
Ala Asn Thr Leu His Ala Phe Leu Gly Glu Ser Arg Ser Thr Ala Leu
885 890 895
Ser Thr Tyr Tyr Ile Tyr Ser His Asn Gln Lys Asp Arg Glu Glu Arg
900 905 910
Lys His Thr Val Ile Lys Asp Arg Asp Asp Leu Tyr Gln Tyr Leu Leu
915 920 925
Ile Asp Asn Gln Val Ser Ala Ala Ile Lys Thr Thr Glu Ile Ala Glu
930 935 940
Ala Ile Ala Ser Ile Gln Leu Tyr Ile Asn Arg Ala Leu Lys Asn Met
945 950 955 960
Glu Gly Asp Thr Asp Thr Ser Val Thr Ser Arg Leu Phe Phe Thr Asn
965 970 975
Trp Asp Lys Tyr Asn Lys Arg Tyr Ser Thr Trp Ala Gly Ile Thr Lys
980 985 990
Leu Leu Tyr Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Leu Arg Ile Gly
995 1000 1005
Gln Thr Lys Met Met Asp Thr Leu Leu Gln Ser Ile Ser Gln Ser
1010 1015 1020
Gln Leu Asn Thr Asp Thr Val Glu Asp Ala Phe Lys Ser Tyr Leu
1025 1030 1035
Thr Ser Phe Glu Gln Val Ala Asn Leu Glu Val Ile Ser Ala Tyr
1040 1045 1050
His Asp Asn Ile Asn Asn Asp Gln Gly Leu Thr Tyr Phe Ile Gly
1055 1060 1065
Arg Ser Lys Thr Glu Val Asn Gln Tyr Tyr Trp Arg Ser Val Asp
1070 1075 1080
His Asn Lys Phe Ser Glu Gly Lys Phe Pro Ala Asn Ala Trp Ser
1085 1090 1095
Glu Trp His Lys Ile Asp Cys Pro Ile Asn Pro Tyr Glu Asp Thr
1100 1105 1110
Ile Arg Pro Val Val Tyr Gln Ser Arg Leu Tyr Ile Ile Trp Leu
1115 1120 1125
Glu Gln Lys Lys Val Thr Asn Arg Ala Glu Gly Glu Ala Ile Lys
1130 1135 1140
Gln Gly Ser Lys Thr Thr Thr Ser Tyr His Tyr Glu Leu Lys Leu
1145 1150 1155
Ala His Ile Arg Tyr Asp Gly Thr Trp Asn Thr Pro Ile Thr Phe
1160 1165 1170
Asp Val Asp Glu Lys Ile Ser Gly Leu Asn Leu Glu Leu Asn Lys
1175 1180 1185
Ala Leu Gly Leu Tyr Cys Ala Ser Tyr Gln Gly Lys Asp Lys Leu
1190 1195 1200
Leu Val Met Phe Tyr Lys Lys Gln Glu Gln Leu Asn Asn Tyr Thr
1205 1210 1215
Glu Lys Thr Gly Asn Thr Tyr Thr Ala Pro Ile Lys Gly Leu Tyr
1220 1225 1230
Ile Thr Ser Asn Met Ser Pro Glu Glu Met Thr Pro Glu Ser Tyr
1235 1240 1245
Arg Leu Asn Ala His Lys Gln Phe Asp Thr Asn Asn Val Val Arg
1250 1255 1260
Val Asn Asn Arg Tyr Ala Glu Ser Tyr Glu Ile Pro Ser Ser Val
1265 1270 1275
Asn Ser Asn Asn Gly Tyr Asp Trp Gly Glu Gly Tyr Leu Ser Met
1280 1285 1290
Val Tyr Gly Gly Ser Ile Leu Ile Thr Arg Asp Pro Ser Asp Asn
1295 1300 1305
Ser Lys Ile Gln Ile Ser Pro Lys Leu Arg Ile Ile His Asn Gly
1310 1315 1320
Tyr Glu Gly Arg Gln Arg Asn Gln Cys Asn Leu Met Lys Lys Tyr
1325 1330 1335
Gly Lys Leu Gly Asp Lys Phe Ile Ile Tyr Thr Thr Leu Gly Ile
1340 1345 1350
Asn Pro Asn Asn Leu Ser Asn Lys Lys Leu Ile Tyr Pro Val Tyr
1355 1360 1365
Gln Tyr Glu Gly Asn Glu Ser Lys Leu Ser Gln Gly Arg Leu Leu
1370 1375 1380
Phe Tyr Arg Asp Ser Thr Thr Asn Phe Thr Arg Ala Trp Phe Pro
1385 1390 1395
Asn Leu Ser Ser Asp Ser Lys Glu Met Ser Ile Thr Thr Gly Gly
1400 1405 1410
Asn Ile Ser Gly Asn Tyr Gly Tyr Ile Asp Asn Lys His Ser Asp
1415 1420 1425
Asn Lys Pro Phe Glu Glu Tyr Phe Tyr Met Asp Asp His Gly Gly
1430 1435 1440
Ile Asp Thr Asp Val Ser Glu Pro Ile Phe Ile Asn Thr Lys Ile
1445 1450 1455
Gln Pro Ser Asn Val Lys Ile Ile Val Lys Thr Val Lys Asp Asp
1460 1465 1470
Gly Lys Leu Asp Ser Lys Pro Tyr Ile Ala Glu Asp Lys Val Ser
1475 1480 1485
Val Lys Pro Thr Pro Asn Phe Glu Glu Met Cys Tyr Gln Phe Asn
1490 1495 1500
Asn Leu Asp Gln Ile Asp Val Ser Thr Leu Val Phe Lys Asn Asn
1505 1510 1515
Glu Ala Ser Ile Asp Ile Thr Phe Thr Ala Ser Ala Asp Ala Phe
1520 1525 1530
Glu Ser Gly Lys Glu Gln Arg Asn Leu Gly Glu Glu His Phe Ser
1535 1540 1545
Ile Arg Ile Ile Lys Lys Ala Asn Val Asn Asp Val Leu Thr Leu
1550 1555 1560
His His Asp Pro Ser Gly Ala Gln Tyr Met Gln Trp Gly Ala Tyr
1565 1570 1575
Arg Thr Arg Leu Asn Thr Leu Phe Ala Arg Lys Leu Ile Ser Arg
1580 1585 1590
Ala Asn Ala Gly Ile Asp Thr Ile Leu Ser Met Glu Thr Gln Asn
1595 1600 1605
Ile Gln Glu Pro Gln Leu Gly Lys Gly Phe Tyr Val Asn Phe Thr
1610 1615 1620
Leu Pro Lys Tyr Asp Gln Asn Thr His Gly Asn Glu Arg Gln Phe
1625 1630 1635
Lys Ile His Ile Gly Asn Ile Ala Gly Asp Asn Thr Met Arg Pro
1640 1645 1650
Tyr Tyr Gln Gly Ile Leu Ala Asp Thr Glu Thr Ser Val Val Leu
1655 1660 1665
Phe Val Pro Tyr Glu Lys Gln Ser Tyr Thr Asn Glu Gly Val Arg
1670 1675 1680
Leu Gly Val Glu Tyr Lys Lys Val Ser Tyr Leu Gly Val Trp Glu
1685 1690 1695
Pro Ala Phe Phe Tyr Phe Asn Glu Ile Gln Gln Lys Phe Ile Leu
1700 1705 1710
Ile Asn Asp Ala Asp His Asn Ser Ala Met Thr Gln Ser Gly Glu
1715 1720 1725
Lys Thr Gly Ile Lys Lys Tyr Lys Gly Phe Leu Asp Val Ser Ile
1730 1735 1740
Leu Ile Asp His Gln His Thr Glu Pro Met Asp Phe Asn Gly Ala
1745 1750 1755
Asn Ser Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Thr Pro Met Leu
1760 1765 1770
Ile Ala Gln Arg Leu Leu His Glu Gln Asn Phe Asp Glu Ala Asn
1775 1780 1785
Arg Trp Leu Lys Tyr Val Trp Asn Pro Ser Gly His Ile Ala Asn
1790 1795 1800
Gly Gln Lys Gln His Pro His Asn Trp Asn Val Arg Pro Leu Gln
1805 1810 1815
Glu Asp Thr Ser Trp Asn Asp Asp Pro Leu Asp Thr Phe Asp Pro
1820 1825 1830
Asp Ala Ile Ala Gln His Asp Pro Met His Tyr Lys Val AlaThr
1835 1840 1845
Phe Met Cys Ala Leu Asp Leu Leu Ile Glu Gln Gly Asp Tyr Ala
1850 1855 1860
Tyr Arg Gln Leu Glu Arg Asp Thr Leu Ala Glu Ala Lys Met Trp
1865 1870 1875
Tyr Met Gln Ala Leu His Leu Leu Gly Asp Lys Pro His Leu Leu
1880 1885 1890
Leu Ser Ser Thr Trp Ser Asp Pro Glu Leu Lys Glu Ala Ala Asp
1895 1900 1905
Leu Glu Lys Gln Gln Ala His Ala Lys Ala Ile Ala Asp Leu Arg
1910 1915 1920
Gln Gly Gln Pro Lys Asp Gly Ser Asn Thr Asp Leu Phe Leu Pro
1925 1930 1935
Gln Val Asn Glu Val Met Leu Ser Tyr Trp Gln Lys Leu Glu Gln
1940 1945 1950
Arg Leu Tyr Asn Leu Arg His Asn Leu Ser Ile Asp Gly Gln Pro
1955 1960 1965
Leu His Leu Pro Ile Phe Ala Thr Pro Ala Asp Pro Lys Ala Leu
1970 1975 1980
Leu Ser Ala Ala Val Ala Ser Ser Gln Gly Gly Ser Asn Leu Pro
1985 1990 1995
Ser Glu Phe Ile Ser Val Trp Arg Phe Pro His Met Leu Glu Asn
2000 2005 20l0
Ala Arg Ser Met Val Ser Gln Leu Thr Gln Phe Gly Ser Thr Leu
2015 2020 2025
Gln Asn Ile Ile Glu Arg Gln Asp Ala Glu Ala Leu Asn Thr Leu
2030 2035 2040
Leu Gln Asn Gln Ala Ala Glu Leu Ile Leu Thr Asn Leu Ser Ile
2045 2050 2055
Gln Asp Lys Thr Ile Glu Glu Leu Asp Val Glu Lys Thr Val Leu
2060 2065 2070
Glu Lys Thr Arg Ala Gly Ala Lys Ser Arg Phe Asp Ser Tyr Ser
2075 2080 2085
Lys Phe Tyr Asp Glu Asp Ile Asn Ala Gly Glu Lys Gln Ala Met
2090 2095 2100
Ala Leu Arg Ala Ser Val Ala Gly Ile Ser Thr Ala Leu Gln Ala
2105 2110 2115
Ser His Leu Ala Gly Ala Ala Leu Asp Leu Ala Pro Asn Ile Phe
2120 2125 2130
Gly Phe Ala Asp Gly Gly Ser His Trp Gly Ala Ile Ala Gln Ala
2135 2140 2145
Thr Ser Asn Val Met Glu Phe Ser Ala Ser Val Met Ser Thr Glu
2150 2155 2160
Ala Asp Lys Ile Ser Gln Ser Glu Ala Tyr Arg Arg Arg Arg Gln
2165 2170 2175
Glu Trp Lys Ile Gln Arg Asn Asn Ala Asp Ala Glu Leu Lys Gln
2180 2185 2190
Ile Asp Ala Gln Leu Gln Ser Leu Val Val Arg Arg Glu Ala Ala
2195 2200 2205
Val Leu Gln Lys Thr Ser Leu Lys Thr Gln Gln Glu Gln Thr His
2210 2215 2220
Ala Gln Leu Thr Phe Leu Gln His Lys Phe Ser Asn Gln Ala Leu
2225 2230 2235
Tyr Asn Trp Leu Arg Gly Arg Leu Ser Ala Ile Tyr Phe Gln Phe
2240 2245 2250
Tyr Asp Leu Ala Val Ala Arg Cys Leu Met Ala Glu Met Ala Tyr
2255 2260 2265
Arg Trp Glu Thr Asn Asp Ala Ala Ala Arg Phe Ile Lys Pro Gly
2270 2275 2280
Ala Trp Gln Gly Thr His Ala Gly Leu Leu Ala Gly Glu Thr Leu
2285 2290 2295
Met Leu Asn Leu Ala Gln Met Glu Asp Ala His Leu Lys Gln Glu
2300 2305 2310
Gln Arg Val Leu Glu Val Glu Arg Thr Val Ser Leu Ala Glu Val
2315 2320 2325
Tyr Lys Glu Lys Gly Gln Phe Ser Leu Thr Lys Lys Ile Ala Glu
2330 2335 2340
Leu Val Asn Lys Lys Pro Asp Thr Thr Ser Ser Arg Asn Asn Thr
2345 2350 2355
Leu Asn Phe Gly Glu Gly Asn Ala Lys Thr Ser Leu Gln Ala Ser
2360 2365 2370
Ile Ser Leu Ala Asp Leu Gln Ile Arg His Asp Tyr Pro Glu Asn
2375 2380 2385
Ser Gly Ala Gly Asn Val Arg Arg Ile Lys Gln Ile Ser Val Thr
2390 2395 2400
Leu Pro Ala Leu Leu Gly Pro Tyr Gln Asp Val Gln Ala Ile Leu
2405 2410 2415
Ser Tyr Gly Gly Asp Ala Thr Gly Leu Ala Lys Gly Cys Lys Ala
2420 2425 2430
Leu Ala Val Ser His Gly Met Asn Asp Ser Gly Gln Phe Gln Leu
2435 2440 2445
Asp Phe Asn Asp Gly Lys Phe Leu Pro Phe Glu Gly Ile Glu Ile
2450 2455 2460
Asp Lys Gly Thr Leu Thr Leu Ser Phe Pro Asn Ala Thr Glu Lys
2465 2470 2475
Gln Lys Thr Met Leu Glu Ser Ile Ser Asp Ile Ile Leu His Ile
2480 2485 2490
Arg Tyr Thr Ile Arg Gln
2495
<210>27
<211>2381
<212>PRT
<213>发光光杆状菌
<400>27
Met Asn Ser Tyr Val Lys Glu Ile Pro Asp Val Leu Gln Ser Gln Tyr
1 5 10 15
Gly Ile Asn Cys Leu Thr Asp Ile Cys His Tyr Ser Phe Asn Glu Phe
20 25 30
Arg Gln Gln Val Ser Asp His Leu Ser Trp Ser Glu Thr Asn Arg Leu
35 40 45
Tyr Arg Asp Ala Gln Gln Glu Gln Lys Glu Asn Gln Leu Tyr Glu Ala
50 55 60
Arg Ile Leu Lys Arg Ala Asn Pro Gln Leu Gln Asn Ala Val His Leu
65 70 75 80
Gly Ile Thr Leu Pro His Ala Glu Leu Arg Gly Tyr Asn Ser Glu Phe
85 90 95
Gly Gly Arg Ala Ser Gln Tyr Val Ala Pro Gly Ser Val Ser Ser Met
100 105 110
Phe Ser Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Arg Asn
115 120 125
Leu His Ala Ser Asp Ser Val Tyr His Leu Asp Glu Arg Arg Pro Asp
130 135 140
Leu Gln Ser Met Thr Leu Ser Gln Gln Asn Met Asp Thr Glu Leu Ser
145 150 155 160
Thr Leu Ser Leu Ser Asn Glu Ile Leu Leu Lys Gly Ile Lys Ala Asn
165 170 175
Gln Ser Asn Leu Asp Ser Asp Thr Lys Val Met Glu Met Leu Ser Thr
180 185 190
Phe Arg Pro Ser Gly Thr Ile Pro Tyr His Asp Ala Tyr Glu Asn Val
195 200 205
Arg Lys Ala Ile Gln Leu Gln Asp Pro Lys Leu Glu Gln Phe Gln Lys
210 215 220
Ser Pro Ala Val Ala Gly Leu Met His Gln Ala Ser Leu Leu Gly Ile
225 230 235 240
Asn Asn Ser Ile Ser Pro Glu Leu Phe Asn Ile Leu Thr Glu Glu Ile
245 250 255
Thr Glu Ala Asn Ala Glu Ala Ile Tyr Lys Gln Asn Phe Gly Asp Ile
260 265 270
Asp Pro Ala Cys Leu Ala Met Pro Glu Tyr Leu Lys Ser Tyr Tyr Asn
275 280 285
Phe Ser Asp Glu Glu Leu Ser Gln Phe Ile Arg Lys Tyr Pro Asp Asn
290 295 300
Glu Leu Asn Thr Gln Lys Ile His Leu Leu Lys Ile Asn Lys Ile Ile
305 310 315 320
Leu Leu Ser Gln Ala Val Asn Leu Pro Phe Leu Lys Leu Asp Glu Ile
325 330 335
Ile Pro Glu Gln Asn Ile Thr Pro Thr Val Leu Gly Lys Ile Phe Leu
340 345 350
Val Lys Tyr Tyr Met Gln Lys Tyr Asn Ile Gly Thr Glu Thr Ala Leu
355 360 365
Ile Leu Cys Asn Asp Ser Ile Ser Gln Tyr Ser Tyr Ser Asn Gln Pro
370 375 380
Ser Gln Phe Asp Arg Leu Phe Asn Thr Ser Pro Leu Asn Gly Gln Tyr
385 390 395 400
Phe Val Ile Glu Asp Thr Asn Ile Asp Leu Ser Leu Asn Ser Thr Asp
405 410 415
Asn Trp His Lys Ala Val Leu Lys Arg Ala Phe Asn Val Asp Asp Ile
420 425 430
Ser Leu Tyr Arg Leu Leu His Ile Ala Asn His Asn Asn Thr Asp Gly
435 440 445
Lys Ile Ala Asn Asn Ile Lys Asn Leu Ser Asn Leu Tyr Met Thr Lys
450 455 460
Leu Leu Ala Asp Ile His Gln Leu Thr Ile Asp Glu Leu Tyr Leu Leu
465 470 475 480
Leu Ile Thr Ile Gly Glu Asp Lys Ile Asn Leu Tyr Asp Ile Asp Asp
485 490 495
Lys Glu Leu Glu Lys Leu Ile Asn Arg Leu Asp Thr Leu Ser Asn Trp
500 505 510
Leu His Thr Gln Lys Trp Ser Ile Tyr Gln Leu Phe Leu Met Thr Thr
515 520 525
Thr Asn Tyr Asp Lys Thr Leu Thr Pro Glu Ile Gln Asn Leu Leu Asp
530 535 540
Thr Val Tyr Asn Gly Leu Gln Asn Phe Asp Lys Asn Lys Thr Lys Leu
545 550 555 560
Leu Ala Ala Ile Ala Pro Tyr Ile Ala Ala Thr Leu Gln Leu Pro Ser
565 570 575
Glu Asn Val Ala His Ser Ile Leu Leu Trp Ala Asp Lys Ile Lys Pro
580 585 590
Ser Glu Asn Lys Ile Thr Ala Glu Lys Phe Trp Ile Trp Leu Gln Asn
595 600 605
Arg Asp Thr Thr Glu Leu Ser Lys Pro Pro Glu Met Gln Glu Gln Ile
610 615 620
Ile Gln Tyr Cys His Cys Leu Ala Gln Leu Thr Met Ile Tyr Arg Ser
625 630 635 640
Ser Gly Ile Asn Glu Asn Ala Phe Arg Leu Phe Ile Glu Lys Pro Thr
645 650 655
Ile Phe Gly Ile Pro Asp Glu Pro Asn Lys Ala Thr Pro Ala His Asn
660 665 670
Ala Pro Thr Leu Ile Ile Leu Thr Arg Phe Ala Asn Trp Val Asn Ser
675 680 685
Leu Gly Glu Lys Ala Ser Pro Ile Leu Thr Ala Phe Glu Asn Lys Thr
690 695 700
Leu Thr Ala Glu Lys Leu Ala Asn Ala Met Asn Leu Asp Ala Asn Leu
705 710 715 720
Leu Glu Gln Ala Ser Ile Gln Ala Gln Asn Tyr Lys Gln Val Thr Lys
725 730 735
Glu Asn Thr Phe Ser Asn Trp Gln Ser Ile Asp Ile Ile Leu Gln Trp
740 745 750
Thr Asn Ile Ala Ser Asn Leu Asn Ile Ser Pro Gln Gly Ile Ser Pro
755 760 765
Leu Ile Ala Leu Asp Tyr Ile Lys Pro Ala Gln Lys Thr Pro Thr Tyr
770 775 780
Ala Gln Trp Glu Asn Ala Ala Ile Ala Leu Thr Ala Gly Leu Asp Thr
785 790 795 800
Gln Gln Thr His Thr Leu His Val Phe Leu Asp Glu Ser Arg Ser Thr
805 810 815
Ala Leu Ser Asn Tyr Tyr Ile Gly Lys Val Ala Asn Arg Ala Ala Ser
820 825 830
Ile Lys Ser Arg Asp Asp Leu Tyr Gln Tyr Leu Leu Ile Asp Asn Gln
835 840 845
Val Ser Ala Glu Ile Lys Thr Thr Arg Ile Ala Glu Ala Ile Ala Ser
850 855 860
Ile Gln Leu Tyr Val Asn Arg Ala Leu Glu Asn Ile Glu Ile His Ala
865 870 875 880
Val Ser Asp Val Ile Thr Arg Gln Phe Phe Ile Asp Trp Asp Lys Tyr
885 890 895
Asn Lys Arg Tyr Ser Thr Trp Ala Gly Val Ser Gln Leu Val Tyr Tyr
900 905 910
Pro Glu Asn Tyr Ile Asp Pro Thr Met Arg Ile Gly Gln Thr Lys Met
915 920 925
Met Asp Thr Leu Leu Gln Ser Val Ser Gln Ser Gln Leu Asn Ala Asp
930 935 940
Thr Val Glu Asp Ala Phe Lys Ser Tyr Leu Thr Ser Phe Glu Gln Val
945 950 955 960
Ala Asn Leu Glu Val Ile Ser Ala Tyr His Asp Asn Val Asn Asn Asp
965 970 975
Gln Gly Leu Thr Tyr Phe Ile Gly Asn Ser Lys Thr Glu Val Asn Gln
980 985 990
Tyr Tyr Trp Arg Ser Val Asp His Ser Lys Phe Asn Asp Gly Lys Phe
995 1000 1005
Ala Ala Asn Ala Trp Ser Glu Trp His Lys Ile Asp Cys Ala Ile
1010 1015 1020
Asn Pro Tyr Gln Ser Thr Ile Arg Pro Val Ile Tyr Lys Ser Arg
1025 1030 1035
Leu Tyr LeuIle Trp Leu Glu Gln Lys Glu Thr Ala Lys Gln Lys
1040 1045 1050
Glu Asp Asn Lys Val Thr Thr Asp Tyr His Tyr Glu Leu Lys Leu
1055 1060 1065
Ala His Ile Arg Tyr Asp Gly Thr Trp Asn Val Pro Ile Thr Phe
1070 1075 1080
Asp Val Asp Glu Lys Ile Leu Ala Leu Glu Leu Thr Lys Ser Gln
1085 1090 1095
Ala Pro Gly Leu Tyr Cys Ala Gly Tyr Gln Gly Glu Asp Thr Leu
1100 1105 1110
Leu Ile Met Phe Tyr Arg Lys Lys Glu Lys Leu Asp Asp Tyr Lys
1115 1120 1125
Thr Ala Pro Met Gln Gly Phe Tyr Ile Phe Ser Asp Met Ser Ser
1130 1135 1140
Lys Asp Met Thr Asn Glu Gln Cys Asn Ser Tyr Arg Asp Asn Gly
1145 1150 1155
Tyr Thr His Phe Asp Thr Asn Ser Asp Thr Asn Ser Val Ile Arg
1160 1165 1170
Ile Asn Asn Arg Tyr Ala Glu Asp Tyr Glu Ile Pro Ser Leu Ile
1175 1180 1185
Asn His Ser Asn Ser His Asp Trp Gly Glu Tyr Asn Leu Ser Gln
1190 1195 1200
Val Tyr Gly Gly Asn Ile Val Ile Asn Tyr Lys Val Thr Ser Asn
1205 1210 1215
Asp Leu Lys Ile Tyr Ile Ser Pro Lys Leu Arg Ile Ile His Asp
1220 1225 1230
Gly Lys Glu Gly Arg Glu Arg Ile Gln Ser Asn Leu Ile Lys Lys
1235 1240 1245
Tyr Gly Lys Leu Gly Asp Lys Phe Ile Ile Tyr Thr Ser Leu Gly
1250 1255 1260
Ile Asn Pro Asn Asn Ser Ser Asn Arg Phe Met Phe Tyr Pro Val
1265 1270 1275
Tyr Gln Tyr Asn Gly Asn Thr Ser Gly Leu Ala Gln Gly Arg Leu
1280 1285 1290
Leu Phe His Arg Asp Thr Ser Tyr Ser Ser Lys Val Ala Ala Trp
1295 1300 1305
Ile Pro Gly Ala Gly Arg Ser Leu Ile Asn Glu Asn Ala Asn Ile
1310 1315 1320
Gly Asp Asp Cys Ala Glu Asp Ser Val Asn Lys Pro Asp Asp Leu
1325 1330 1335
Lys Gln Tyr Ile Tyr Met Thr Asp Ser Lys Gly Thr Ala Thr Asp
1340 1345 1350
Val Ser Gly Pro Val Asp Ile Asn Thr Ala Ile Ser Ser Glu Lys
1355 1360 1365
Val Gln Ile Thr Ile Lys Ala Gly Lys Glu Tyr Ser Leu Thr Ala
1370 1375 1380
Asn Lys Asp Val Ser Val Gln Pro Ser Pro Ser Phe Glu Glu Met
1385 1390 1395
Cys Tyr Gln Phe Asn Ala Leu Glu Ile Asp Gly Ser Asn Leu Asn
1400 1405 1410
Phe Thr Asn Asn Ser Ala Ser Ile Asp Val Thr Phe Thr Ala Leu
1415 1420 1425
Ala Asp Asp Gly Arg Lys Leu Gly Tyr Glu Ile Phe Asn Ile Pro
1430 1435 1440
Val Ile Gln Lys Val Lys Thr Asp Asn Ala Leu Thr Leu Phe His
1445 1450 1455
Asp Glu Asn Gly Ala Gln Tyr Met Gln Trp Gly Ala Tyr Arg Ile
1460 1465 1470
Arg Leu Asn Thr Leu Phe Ala Arg Gln Leu Val Glu Arg Ala Asn
1475 1480 1485
Thr Gly Ile Asp Thr Ile Leu Ser Met Glu Thr Gln AsnIle Gln
1490 1495 1500
Glu Pro Met Met Gly Ile Gly Ala Tyr Ile Glu Leu Ile Leu Asp
1505 1510 1515
Lys Tyr Asn Pro Asp Ile His Gly Thr Asn Lys Ser Phe Lys Ile
1520 1525 1530
Ile Tyr Gly Asp Ile Phe Lys Ala Gly Asp His Phe Pro Ile Tyr
1535 1540 1545
Gln Gly Ala Leu Ser Asp Ile Thr Gln Thr Thr Val Lys Leu Phe
1550 1555 1560
Leu Pro Arg Val Asp Asn Ala Tyr Gly Asn Lys Asn Asn Leu Tyr
1565 1570 1575
Val Tyr Ala Ala Tyr Gln Lys Val Glu Thr Asn Phe Ile Arg Phe
1580 1585 1590
Val Lys Glu Asp Asn Asn Lys Pro Ala Thr Phe Asp Thr Thr Tyr
1595 1600 1605
Lys Asn Gly Thr Phe Pro Gly Leu Ala Ser Ala Arg Val Ile Gln
1610 1615 1620
Thr Val Ser Glu Pro Met Asp Phe Ser Gly Ala Asn Ser Leu Tyr
1625 1630 1635
Phe Trp Glu Leu Phe Tyr Tyr Thr Pro Met Met Val Ala Gln Arg
1640 1645 1650
Leu Leu His Glu Gln Asn Phe Asp Glu Ala Asn Arg Trp Leu Lys
1655 1660 1665
Tyr Val Trp Ser Pro Ser Gly Tyr Ile Val Arg Gly Gln Ile Lys
1670 1675 1680
Asn Tyr His Trp Asn Val Arg Pro Leu Leu Glu Asn Thr Ser Trp
1685 1690 1695
Asn Ser Asp Pro Leu Asp Ser Val Asp Pro Asp Ala Val Ala Gln
1700 1705 17l0
His Asp Pro Met His Tyr Lys Val Ala Thr Phe Met Arg Thr Leu
1715 1720 1725
Asp Leu Leu Met Ala Arg Gly Asp His Ala Tyr Arg Gln Leu Glu
1730 1735 1740
Arg Asp Thr Leu Asn Glu Ala Lys Met Trp Tyr Met Gln Ala Leu
1745 1750 1755
His Leu Leu Gly Asn Lys Pro Tyr Leu Pro Leu Ser Ser Val Trp
1760 1765 1770
Asn Asp Pro Arg Leu Asp Asn Ala Ala Ala Thr Thr Thr Gln Lys
1775 1780 1785
Ala His Ala Tyr Ala Ile Thr Ser Leu Arg Gln Gly Thr Gln Thr
1790 1795 1800
Pro Ala Leu Leu Leu Arg Ser Ala Asn Thr Leu Thr Asp Leu Phe
1805 1810 1815
Leu Pro Gln Ile Asn Asp Val Met Leu Ser Tyr Trp Asn Lys Leu
1820 1825 1830
Glu Leu Arg Leu Tyr Asn Leu Arg His Asn Leu Ser Ile Asp Gly
1835 1840 1845
Gln Pro Leu His Leu Pro Ile Tyr Ala Thr Pro Ala Asp Pro Lys
1850 1855 1860
Ala Leu Leu Ser Ala Ala Val Ala Thr Ser Gln Gly Gly Gly Lys
1865 1870 1875
Leu Pro Glu Ser Phe Ile Ser Leu Trp Arg Phe Pro His Met Leu
1880 1885 1890
Glu Asn Ala Arg Ser Met Val Thr Gln Leu Ile Gln Phe Gly Ser
1895 1900 1905
Thr Leu Gln Asn Ile Ile Glu Arg Gln Asp Ala Glu Ser Leu Asn
1910 1915 1920
Ala Leu Leu Gln Asn Gln Ala Lys Glu Leu Ile Leu Thr Thr Leu
1925 1930 1935
Ser Ile Gln Asp Lys Thr Ile Glu Glu Ile Asp Ala Glu Lys Thr
1940 1945 1950
Val Leu Glu Lys Ser Lys Ala Gly Ala Lys Ser Arg Phe Asp Asn
1955 1960 1965
Tyr Ser Lys Leu Tyr Asp Glu Asp Val Asn Ala Gly Glu Arg Gln
1970 1975 1980
Ala Leu Asp Met Arg Ile Ala Ser Gln Ser Ile Thr Ser Gly Leu
1985 1990 1995
Lys Gly Leu His Met Ala Ala Ala Ala Leu Glu Met Val Pro Asn
2000 2005 2010
Ile Tyr Gly Phe Ala Val Gly Gly Thr Arg Tyr Gly Ala Ile Ala
2015 2020 2025
Asn Ala Ile Ala Ile Gly Gly Gly Ile Ala Ala Glu Gly Leu Leu
2030 2035 2040
Ile Glu Ala Glu Lys Val Ser Gln Ser Glu Ile Trp Arg Arg Arg
2045 2050 2055
Arg Gln Glu Trp Glu Ile Gln Arg Asn Asn Ala Glu Ala Glu Met
2060 2065 2070
Lys Gln Ile Asp Ala Gln Leu Lys Ser Leu Thr Val Arg Arg Glu
2075 2080 2085
Ala Ala Val Leu Gln Lys Thr Gly Leu Lys Thr Gln Gln Glu Gln
2090 2095 2100
Thr Gln Ala Gln Leu Ala Phe Leu Gln Arg Lys Phe Ser Asn Gln
2105 2110 2115
Ala Leu Tyr Asn Trp Leu Arg Gly Arg Leu Ala Ala Ile Tyr Phe
2120 2125 2130
Gln Phe Tyr Asp Leu Val Val Ala Arg Cys Leu Met Ala Glu Gln
2135 2140 2145
Ala Tyr Arg Trp Glu Thr Asn Asp Ser Ser Ala Arg Phe Ile Lys
2150 2155 2160
Pro Gly Ala Trp Gln Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu
2165 2170 2175
Thr Leu Met Leu Asn Leu Ala Gln Met Glu Asp Ala His Leu Lys
2180 2185 2190
Gln Glu Gln Arg Ala Leu Glu Val Glu Arg Thr Val Ser Leu Ala
2195 2200 2205
Gln Val Tyr Gln Ser Leu Gly Glu Lys Ser Phe Ala Leu Lys Asp
2210 2215 2220
Lys Ile Glu Ala Leu Leu Gln Gly Asp Lys Glu Thr Ser Ala Gly
2225 2230 2235
Asn Asp Gly Asn Gln Leu Lys Leu Thr Asn Asn Thr Leu Ser Ala
2240 2245 2250
Thr Leu Thr Leu Gln Asp Leu Lys Leu Lys Asp Asp Tyr Pro Glu
2255 2260 2265
Glu Met Gln Leu Gly Lys Thr Arg Arg Ile Lys Gln Ile Ser Val
2270 2275 2280
Ser Leu Pro Ala Leu Leu Gly Pro Tyr Gln Asp Val Gln Ala Val
2285 2290 2295
Leu Ser Tyr Gly Gly Asp Ala Thr Gly Leu Ala Lys Gly Cys Lys
2300 2305 2310
Ala Leu Ala Val Ser His Gly Leu Asn Asp Asn Gly Gln Phe Gln
2315 2320 2325
Leu Asp Phe Asn Asp Gly Lys Phe Leu Pro Phe Glu Gly Ile Asp
2330 2335 2340
Ile Asn Asp Lys Gly Thr Phe Thr Leu Ser Phe Pro Asn Ala Ala
2345 2350 2355
Ser Lys Gln Lys Asn Ile Leu Gln Met Leu Thr Asp Ile Ile Leu
2360 2365 2370
His Ile Arg Tyr Thr Ile Leu Glu
2375 2380
<210>28
<211>4431
<212>DNA
<213>发光光杆状菌
<400>28
atgcagaatt cacaaacatt cagtgttacc gagctgtcat tacccaaagg cggcggcgct 60
attaccggta tgggtgaagc attaacacca gccgggccgg atggtatggc cgccttatcc 120
ctgccattac ccatttccgc cgggcgtggt tacgcaccct cgctcactct gaattacaac 180
agtggaaccg gtaacagccc atttggtctc ggttgggact gcggcgtcat ggcaattcgt 240
cgtcgcacca gtaccggcgt accgaattac gatgaaaccg atacttttct ggggccggaa 300
ggtgaagtgt tggtcgtagc attaaatgag gcaggtcaag ctgatatccg cagtgaatcc 360
tcattgcagg gcatcaattt gggtgcgacc ttcaccgtta cctgttatcg ctcccgccta 420
gaaagccact ttaaccggtt ggaatactgg caaccccaaa caaccggcgc aaccgatttc 480
tggctgatat acagccccga cggacaggtc catttactgg gcaaaaatcc tcaggcacgt 540
atcagcaatc cactcaatgt taaccaaaca gcgcaatggc tgttggaagc ctcgatatca 600
tcccacagcg aacagattta ttatcaatat cgcgctgaag atgaagcagg ttgtgaaacc 660
gacgagctag cagcccaccc cagcgcaacc gttcagcgct acctgcaaac agtacattac 720
gggaacctga ccgccagcga cgtttttcct acactaaacg gagatgaccc acttaaatct 780
ggctggatgt tctgtttagt atttgactac ggtgagcgca aaaacagctt atctgaaatg 840
ccgctgttta aagccacagg caattggctt tgccgaaaag accgtttttc ccgttatgag 900
tacggttttg aattgcgtac tcgccgctta tgccgccaaa tactgatgtt tcaccgtcta 960
caaaccctat ctggtcaggc aaagggggat gatgaacctg cgctagtgtc gcgtctgata 1020
ctggattatg acgaaaacgc gatggtcagt acgctcgttt ctgtccgccg ggtaggccat 1080
gaggacaaca acacggttac cgcgctgcca ccactggaac tggcctatca gccttttgag 1140
ccagaacaaa ccgcactctg gcaatcaatg gatgtactgg caaatttcaa caccattcag 1200
cgctggcaac tgcttgacct gaaaggagaa ggcgtgcccg gcattctcta tcaggataga 1260
aatggctggt ggtatcgatc tgcccaacgt caggccgggg aagagatgaa tgcggtcacc 1320
tgggggaaaa tgcaactcct tcccatcaca ccagctgtgc aggataacgc ctcactgatg 1380
gatattaacg gtgacgggca actggactgg gtgattaccg ggccggggct aaggggctat 1440
cacagccaac acccggatgg cagttggacg cgttttacgc cattacatgc cctgccgata 1500
gaatattctc atcctcgcgc tcaacttgcc gatttaatgg gagccgggct gtccgattta 1560
gtgctaattg gtcccaaaag tgtgcgctta tatgtcaata accgtgatgg ttttaccgaa 1620
gggcgggatg tggtgcaatc cggtgatatc accctgccgc taccgggcgc cgatgcccgt 1680
aagttagtgg catttagtga cgtactgggt tcaggccaag cacatctggt tgaagttagt 1740
gcaactcaag tcacctgctg gccgaatctg gggcatggcc gttttggtca gccaatcgta 1800
ttgccgggat tcagccaatc tgccgccagt tttaatcctg atcgagttca tctggccgat 1860
ttggatggga gcggccctgc cgatttgatt tatgttcatg ctgaccgtct ggatattttc 1920
agcaatgaaa gtggcaacgg ttttgcaaaa ccattcacac tctcttttcc tgacggcctg 1980
cgttttgatg atacctgcca gttgcaagta gccgatgtac aagggttagg cgttgtcagc 2040
ctgatcctaa gcgtaccgca tatggcgcca catcattggc gctgcgatct gaccaacgcg 2100
aaaccgtggt tactcagtga aacgaacaac aatatggggg ccaatcacac cttgcattac 2160
cgtagctctg tccagttctg gctggatgaa aaagctgcgg cattggctac cggacaaaca 2220
ccggtctgtt acctgccctt cccggtccat accctttggc aaacagaaac cgaggatgaa 2280
atcagcggca ataagttagt gaccacgtta cgttatgctc acggcgcttg ggatggacgt 2340
gaacgggaat ttcgtggctt tggttatgtt gagcagacag acagccatca actcgctcaa 2400
ggcaatgcgc cggaacgtac accaccggca ctcaccaaaa gctggtatgc caccggatta 2460
cctgcggtag ataatgcgtt atccgccggg tattggcgtg gcgataagca agctttcgcc 2520
ggttttacgc cacgttttac tctctggaaa gagggcaaag atgttccact gacaccggaa 2580
gatgaccata atctatactg gttaaaccgg gcgctaaaag gtcagccact gcgtagtgaa 2640
ctctacgggc tggatggcag cgcacagcaa cagatcccct atacagtgac tgaatcccgt 2700
ccacaggtgc gccaattaca agatggcgcc accgtttccc cggtgctctg ggcctcagtc 2760
gtggaaagcc gtagttatca ctacgaacgt attatcagtg atccccagtg caatcaggat 2820
atcacgttgt ccagtgacct attcgggcaa ccactgaaac aggtttccgt acaatatccc 2880
cgccgcaaca aaccaacaac caatccgtat cccgataccc taccggatac gctgtttgcc 2940
agcagttatg acgatcaaca acagctattg cgattaacct gccgacaatc cagttggcac 3000
catcttattg gtaatgagct aagagtgttg ggattaccgg atggcacacg cagtgatgcc 3060
tttacttacg atgccaaaca ggtacctgtc gatggcttaa atctggaaac cctgtgtgct 3120
gaaaatagcc tgattgccga tgataaacct cgcgaatacc tcaatcagca acgaacgttc 3180
tataccgacg ggaaaaacca aacaccgctg aaaacaccga cacgacaagc gttaatcgcc 3240
tttaccgaaa cggcggtatt aacggaatct ctgttatccg cgtttgatgg cggtattacg 3300
ccagacgaat taccgggaat actgacacag gccggatacc aacaagagcc ttatctgttt 3360
ccacgcaccg gcgaaaacaa agtttgggta gcgcgtcaag gctataccga ttacgggacg 3420
gaagcacaat tttggcgtcc tgtcgcacaa cgtaacagcc tgttaaccgg gaaaatgacg 3480
ttaaaatggg atactcacta ttgtgtcatc acccaaaccc aagatgctgc cggcctcacc 3540
gtctcagcca attatgactg gcgttttctc acaccaacgc aactgactga catcaacgat 3600
aatgtgcatc tcatcacctt ggatgctctg ggacgccctg tcacgcaacg tttctggggg 3660
atcgaaagcg gtgtggcaac aggttactct tcatcagaag aaaaaccatt ctctccacca 3720
aacgatatcg ataccgctat taatctaacc ggaccactcc ctgtcgcaca gtgtctggtc 3780
tatgcaccgg acagttggat gccactattc agtcaagaaa ccttcaacac attaacgcag 3840
gaagagcagg agacgctgcg tgattcacgt attatcacgg aagattggcg tatttgcgca 3900
ctgactcgcc gccgttggct acaaagtcaa aagatcagta caccattagt taaactgtta 3960
accaacagca ttggtttacc tccccataac cttacgctga ccacagaccg ttatgaccgc 4020
gactctgagc agcaaattcg ccaacaagtc gcatttagtg atggttttgg ccgtctgcta 4080
caagcgtctg tacgacatga ggcaggcgaa gcctggcaac gtaaccaaga cggttctctg 4140
gtgacaaaag tggagaatac caaaacgcgt tgggcggtca cgggacgcac cgaatatgat 4200
aataaagggc aaacgatacg cacttatcag ccctatttcc tcaacgactg gcgatatgtc 4260
agtgatgaca gcgccagaaa agaagcctat gcggatactc atatttatga tccaattggg 4320
cgagaaatcc gggttattac tgcaaaaggc tggctgcgcc aaagccaata tttcccgtgg 4380
tttaccgtga gtgaggatga gaatgatacg gccgctgatg cgctggtgta a 4431
<210>29
<211>4425
<212>DNA
<213>发光光杆状菌
<400>29
atgcaaaatt cacaagattt tagtattacg gaactgtcac tgcccaaagg ggggggcgct 60
atcacgggaa tgggtgaagc attaaccccc actggaccgg atggtatggc cgcgctatct 120
ctaccattgc ctatttctgc cgggcgcggt tatgctcccg cattcactct gaattacaac 180
agcggcgccg gtaacagtcc atttggtctg ggttgggatt gcaacgttat gactatccgc 240
cgccgcaccc attttggcgt cccccattat gacgaaaccg ataccttttt ggggccagaa 300
ggcgaagtgc tggtggtagc ggatcaacct cgcgacgaat ccacattaca gggtatcaat 360
ttaggcgcca cctttaccgt taccggctac cgttcccgtc tggaaagcca tttcagccga 420
ttggaatatt ggcaacccaa aacaacaggt aaaacagatt tttggttgat atatagccca 480
gatgggcagg tgcatctact gggtaaatca ccgcaagcgc ggatcagcaa cccatcccaa 540
acgacacaaa cagcacaatg gctgctggaa gcctctgtat catcacgtgg cgaacaaatt 600
tattatcaat atcgcgccga agatgacaca ggttgcgaag cagatgaaat tacgcaccat 660
ttacaggcta cagcgcaacg ttatttacac atcgtgtatt acggcaaccg tacagccagc 720
gaaacattac ccggtctgga tggcagcgcc ccatcacaag cagactggtt gttctatctg 780
gtatttgatt acggcgaacg cagtaacaac ctgaaaacgc caccagcatt ttcgactaca 840
ggtagctggc tttgccgtca ggaccgtttt tcccgttatg aatatggctt tgagattcgt 900
acccgccgct tatgccgtca ggtattgatg taccatcacc tgcaagcact ggatagtaag 960
ataacagaac acaacggacc aacgctggtt tcacgcctga tactcaatta cgacgaaagc 1020
gcgatagcca gcacgctagt attcgttcgc cgagtgggac acgagcaaga tggtaatgtc 1080
gtcaccctgc cgccattaga attggcatat caggattttt caccgcgaca tcacgctcac 1140
tggcaaccaa tggatgtact ggcaaacttc aatgccattc agcgctggca gctagtcgat 1200
ctaaaaggcg aaggattacc cggcctgtta tatcaggata aaggcgcttg gtggtaccgc 1260
tccgcacagc gtctgggcga aattggctca gatgccgtca cttgggaaaa gatgcaacct 1320
ttatcggtta ttccttcttt gcaaagtaat gcctcgttgg tggatatcaa tggagacggc 1380
caacttgact gggttatcac cggaccggga ttacggggat atcatagtca acgcccggat 1440
ggcagttgga cacgttttac cccactcaac gctctgccgg tggaatacac ccatccacgc 1500
gcgcaactcg cagatttaat gggagccggg ctatccgatt tggtgctgat cggccctaag 1560
agcgtgcgtt tatatgccaa tacccgcgac ggctttgcca aaggaaaaga tgtggtgcaa 1620
tccggtgata tcacactgcc ggtgccgggc gccgatccac gtaagttggt ggcgtttagt 1680
gatgtattgg gttcaggtca agcccatctg gttgaagtaa gcgcgactaa agtcacctgc 1740
tggcctaatc tggggcgcgg acgttttggt caacccatta ccttaccggg attcagccag 1800
ccagcaaccg agtttaaccc ggctcaagtt tatctggccg atctggatgg cagcggtcca 1860
acggatctga tttatgttca tacaaaccgt ctggatatct tcctgaacaa aagtggcaat 1920
ggctttgctg aaccagtgac attacgcttc ccggaaggtc tgcgttttga tcatacctgt 1980
cagttacaaa tggccgatgt acaaggatta ggcgtcgcca gcctgatact gagcgtgccg 2040
catatgtctc cccatcactg gcgctgcgat ctgaccaaca tgaagccgtg gttactcaat 2100
gaaatgaaca acaatatggg ggtccatcac accttgcgtt accgcagttc ctcccaattc 2160
tggctggatg aaaaagccgc ggcgctgact accggacaaa caccggtttg ctatctcccc 2220
ttcccgatcc acaccctatg gcaaacggaa acagaagatg aaatcagcgg caacaaatta 2280
gtcacaacac ttcgttatgc tcgtggcgca tgggacggac gcgagcggga atttcgcgga 2340
tttggttatg tagagcagac agacagccat caactggctc aaggcaacgc gccagaacgt 2400
acgccaccgg cgctgaccaa aaactggtat gccaccggac tgccggtgat agataacgca 2460
ttatcaaccg agtattggcg tgatgatcag gcttttgccg gtttctcacc gcgctttacg 2520
acttggcaag ataacaaaga tgtcccgtta acaccggaag atgataacag tcgttactgg 2580
ttcaaccgcg cgttgaaagg tcaactgcta cgtagtgaac tgtacggatt ggacgatagt 2640
acaaataaac acgttcccta tactgtcact gaatttcgtt cacaggtacg tcgattacag 2700
cataccgaca gccgataccc tgtactttgg tcatctgtag ttgaaagccg caactatcac 2760
tacgaacgta tcgccagcga cccgcaatgc agtcaaaata ttacgctatc cagtgatcga 2820
tttggtcagc cgctaaaaca gctttcggta cagtacccgc gccgccagca gccagcaatc 2880
aatctgtatc ctgatacatt gcctgataag ttgttagcca acagctatga tgaccaacaa 2940
cgccaattac ggctcaccta tcaacaatcc agttggcatc acctgaccaa caataccgtt 3000
cgagtattgg gattaccgga tagtacccgc agtgatatct ttacttatgg cgctgaaaat 3060
gtgcctgctg gtggtttaaa tctggaactt ctgagtgata aaaatagcct gatcgcggac 3120
gataaaccac gtgaatacct cggtcagcaa aaaaccgctt ataccgatgg acaaaataca 3180
acgccgttgc aaacaccaac acggcaagcc ctgattgcct ttaccgaaac aacggtattc 3240
aaccagtcca cattatcagc gtttaacgga agcatcccgt ccgataaatt atcaacgacg 3300
ctggagcaag ctggatatca gcaaacaaat tatctattcc ctcgcactgg agaagataaa 3360
gtttgggtag cccatcacgg ctataccgat tatggtacag cggcacagtt ctggcgcccg 3420
caaaaacaga gcaacaccca actcaccggt aaaatcaccc tcatctggga tgcaaactat 3480
tgcgttgtgg tacaaacccg ggatgctgct ggactgacaa cctcagccaa atatgactgg 3540
cgttttctga ccccggtgca actcaccgat atcaatgaca atcagcacct tatcacactg 3600
gatgcattgg gccgaccaat cacattgcgc ttttggggaa ctgaaaacgg caagatgaca 3660
ggttattcct caccggaaaa agcatcattt tctccaccat ccgatgttaa tgccgctatt 3720
gagttaaaaa aaccgctccc tgtagcacag tgtcaggtct acgcaccaga aagctggatg 3780
ccagtattaa gtcagaaaac cttcaatcga ctggcagaac aagattggca aaagttatat 3840
aacgcccgaa tcatcaccga agatggacgt atctgcacac tggcttatcg ccgctgggta 3900
caaagccaaa aggcaatccc tcaactcatt agcctgttaa acaacggacc ccgtttacct 3960
cctcacagcc tgacattgac gacggatcgt tatgatcacg atcctgagca acagatccgt 4020
caacaggtgg tattcagtga tggctttggc cgcttgctgc aagccgctgc ccgacatgag 4080
gcaggcatgg cccggcaacg caatgaagac ggctctttga ttataaatgt ccagcatact 4140
gagaaccgtt gggcagtgac tggacgaacg gaatatgaca ataaggggca accgatacgt 4200
acctatcagc cctatttcct caatgactgg cgatacgtca gcaatgatag tgcccggcag 4260
gaaaaagaag cttatgcaga tacccatgtc tatgatccca taggtcgaga aatcaaggtt 4320
atcaccgcaa aaggttggtt ccgtcgaacc ttgttcactc cctggtttac tgtcaatgaa 4380
gatgaaaatg acacagccgc tgaggtgaag aaggtaaaga tgtaa 4425
<210>30
<211>4458
<212>DNA
<213>发光光杆状菌
<400>30
atgcaggatt caccagaagt atcgattaca acgctgtcac ttcccaaagg tggcggtgct 60
atcaatggca tgggagaagc actgaatgct gccggccctg atggaatggc ctccctatct 120
ctgccattac ccctttcgac cggcagaggg acggctcctg gattatcgct gatttacagc 180
aacagtgcag gtaatgggcc tttcggcatc ggctggcaat gcggtgttat gtccattagc 240
cgacgcaccc aacatggcat tccacaatac ggtaatgacg acacgttcct atccccacaa 300
ggcgaggtca tgaatatcgc cctgaatgac caagggcaac ctgatatccg tcaagacgtt 360
aaaacgctgc aaggcgttac cttgccaatt tcctataccg tgacccgcta tcaagcccgc 420
cagatcctgg atttcagtaa aatcgaatac tggcaacctg cctccggtca agaaggacgc 480
gctttctggc tgatatcgtc accggacggc caactacaca tcttagggaa aaccgcgcag 540
gcttgtctgg caaatccgca aaatgaccaa caaatcgccc agtggttgct ggaagaaact 600
gtgacgccag ccggtgaaca tgtcagctat caatatcgag ccgaagatga agcccattgt 660
gacgacaatg aaaaaaccgc tcatcccaat gttaccgcac agcgctatct ggtacaggtg 720
aactacggca acatcaaacc acaagccagc ctgttcgtac tggataacgc acctcccgca 780
ccggaagagt ggctgtttca tctggtcttt gaccacggtg agcgcgatac ctcacttcat 840
accgtgccaa catgggatgc aggtacagcg caatggtctg tacgcccgga tatcttctct 900
cgctatgaat atggttttga agtgcgtact cgccgcttat gtcaacaagt gctgatgttt 960
caccgcaccg cgctcatggc cggagaagcc agtaccaatg acgccccgga actggttgga 1020
cgcttaatac tggaatatga caaaaacgcc agcgtcacca cgttgattac catccgtcaa 1080
ttaagccatg aatcggacgg cagcccagtc acccagccac cactagaact agcctggcaa 1140
cggtttgatc tggagaaaat gccgacatgg caacgctttg acgcactaga taattttaac 1200
tcgcagcaac gttatcaact ggttgatctg cggggagaag ggttgccagg tatgctgtat 1260
caagatcgag gcgcttggtg gtataaagct ccgcaacgtc aggaagacgg agacagcaat 1320
gccgtcactt acgacaaaat cgccccactg cctaccctac ccaatttgca ggataatgcc 1380
tcattgatgg atatcaacgg agacggccaa ctggattggg ttgttaccgc ctccggtatt 1440
cgcggatacc atagtcagca acccgatgga aagtggacgc actttacgcc aatcaatgcc 1500
ttgcccgtgg aatattttca tccaagcatc cagttcgctg accttaccgg ggcaggctta 1560
tctgatttag tgttgatcgg gccgaaaagc gtgcgtctat atgccaacca gcgaaacggc 1620
tggcgtaaag gagaagatgt cccccaatcc acaggtatca ccctgcctgt cacagggacc 1680
gatgcccgca aactggtggc tttcagtgat atgctcggtt ccggtcaaca acatctggtg 1740
gaaatcaagg ctaatcgcgt cacctgttgg ccgaatctag ggcatggccg tttcggtcaa 1800
ccactaactc tgtcaggatt tagccagccc gaaaatagct tcaatcccga acggctgttt 1860
ctggcggata tcgacggctc cggcaccacc gaccttatct atgcgcaatc cggctctttg 1920
ctcatttatc tcaaccaaag tggtaatcag tttgatgccc cgttgacatt agcgttgcca 1980
gaaggcgtac aatttgacaa cacttgccaa cttcaagtcg ccgatattca gggattaggg 2040
atagccagct tgattctgac tgtgccacat atcgcgccac atcactggcg ttgtgacctg 2100
tcactgacca aaccctggtt gttgaatgta atgaacaata accggggcgc acatcacacg 2160
ctacattatc gtagttccgc gcaattctgg ttggatgaaa aattacagct caccaaagca 2220
ggcaaatctc cggcttgtta tctgccgttt ccaatgcatt tgctatggta taccgaaatt 2280
caggatgaaa tcagcggcaa ccggctcacc agtgaagtca actacagcca cggcgtctgg 2340
gatggtaaag agcgggaatt cagaggattt ggctgcatca aacagacaga taccacaacg 2400
ttttctcacg gcaccgcccc cgaacaggcg gcaccgtcgc tgagtattag ctggtttgcc 2460
accggcatgg atgaagtaga cagccaatta gctacggaat attggcaggc agacacgcaa 2520
gcttatagcg gatttgaaac ccgttatacc gtctgggatc acaccaacca gacagaccaa 2580
gcatttaccc ccaatgagac acaacgtaac tggctgacgc gagcgcttaa aggccaactg 2640
ctacgcactg agctctacgg tctggacgga acagataagc aaacagtgcc ttataccgtc 2700
agtgaatcgc gctatcaggt acgctctatt cccgtaaata aagaaactga attatctgcc 2760
tgggtgactg ctattgaaaa tcgcagctac cactatgaac gtatcatcac tgacccacag 2820
ttcagccaga gtatcaagtt gcaacacgat atctttggtc aatcactgca aagtgtcgat 2880
attgcctggc cgcgccgcga aaaaccagca gtgaatccct acccgcctac cctgccggaa 2940
acgctatttg acagcagcta tgatgatcaa caacaactat tacgtctggt gagacaaaaa 3000
aatagctggc atcacctgac tgatggggaa aactggcgat taggtttacc gaatgcacaa 3060
cgccgtgatg tttatactta tgaccggagc aaaattccaa ccgaagggat ttcccttgaa 3120
atcttgctga aagatgatgg cctgctagca gatgaaaaag cggccgttta tctgggacaa 3180
caacagacgt tttacaccgc cggtcaagcg gaagtcactc tagaaaaacc cacgttacaa 3240
gcactggtcg cgttccaaga aaccgccatg atggacgata cctcattaca ggcgtatgaa 3300
ggcgtgattg aagagcaaga gttgaatacc gcgctgacac aggccggtta tcagcaagtc 3360
gcgcggttgt ttaataccag atcagaaagc ccggtatggg cggcacggca aggttatacc 3420
gattacggtg acgccgcaca gttctggcgg cctcaggctc agcgtaactc gttgctgaca 3480
gggaaaacca cactgacctg ggatacccat cattgtgtaa taatacagac tcaagatgcc 3540
gctggattaa cgacgcaagc ccattacgat tatcgtttcc ttacaccggt acaactgaca 3600
gatattaatg ataatcaaca tattgtgact ctggacgcgc taggtcgcgt aaccaccagc 3660
cggttctggg gcacagaggc aggacaagcc gcaggctatt ccaaccagcc cttcacacca 3720
ccggactccg tagataaagc gctggcatta accggcgcac tccctgttgc ccaatgttta 3780
gtctatgccg ttgatagctg gatgccgtcg ttatctttgt ctcagctttc tcagtcacaa 3840
gaagaggcag aagcgctatg ggcgcaactg cgtgccgctc atatgattac cgaagatggg 3900
aaagtgtgtg cgttaagcgg gaaacgagga acaagccatc agaacctgac gattcaactt 3960
atttcgctat tggcaagtat tccccgttta ccgccacatg tactggggat caccactgat 4020
cgctatgata gcgatccgca acagcagcac caacagacgg tgagctttag tgacggtttt 4080
ggccggttac tccagagttc agctcgtcat gagtcaggtg atgcctggca acgtaaagag 4140
gatggcgggc tggtcgtgga tgcaaatggc gttctggtca gtgcccctac agacacccga 4200
tgggccgttt ccggtcgcac agaatatgac gacaaaggcc aacctgtgcg tacttatcaa 4260
ccctattttc taaatgactg gcgttacgtt agtgatgaca gcgcacgaga tgacctgttt 4320
gccgataccc acctttatga tccattggga cgggaataca aagtcatcac tgctaagaaa 4380
tatttgcgag aaaagctgta caccccgtgg tttattgtca gtgaggatga aaacgataca 4440
gcatcaagaa ccccatag 4458
<210>31
<211>4479
<212>DNA
<213>嗜线虫致病杆菌
<400>31
atgcagggtt caacaccttt gaaacttgaa ataccgtcat tgccctctgg gggcggatca 60
ctaaaaggaa tgggagaagc actcaatgcc gtcggagcgg aagggggagc gtcattttca 120
ctgcccttgc cgatctctgt cgggcgtggt ctggtgccgg tgctatcact gaattacagc 180
agtactgccg gcaatgggtc attcgggatg gggtggcaat gtggggttgg ttttatcagc 240
ctgcgtaccg ccaagggcgt tccgcactat acgggacaag atgagtatct cgggccggat 300
ggggaagtgt tgagtattgt gccggacagc caagggcaac cagagcaacg caccgcaacc 360
tcactgttgg ggacggttct gacacagccg catactgtta cccgctatca gtcccgcgtg 420
gcagaaaaaa tcgttcgttt agaacactgg cagccacagc agagacgtga ggaagagacg 480
tctttttggg tactttttac tgcggatggt ttagtgcacc tattcggtaa gcatcaccat 540
gcacgtattg ctgacccgca ggatgaaacc agaattgccc gctggctgat ggaggaaacc 600
gtcacgcata ccggggaaca tatttactat cactatcggg cagaagacga tcttgactgt 660
gatgagcatg aacttgctca gcattcaggt gttacggccc agcgttatct ggcaaaagtc 720
agctatggca atactcagcc ggaaaccgct tttttcgcgg taaaatcagg tattcctgct 780
gataatgact ggctgtttca tctggtattt gattacggtg agcgctcatc ttcgctgaac 840
tctgtacccg aattcaatgt gtcagaaaac aatgtgtctg aaaacaatgt gcctgaaaaa 900
tggcgttgtc gtccggacag tttctcccgc tatgaatatg ggtttgaaat tcgaacccgt 960
cgcttgtgtc gccaagttct gatgtttcat cagctgaaag cgctggcagg ggaaaaggtt 1020
gcagaagaaa caccggcgct ggtttcccgt cttattctgg attatgacct gaacaacaag 1080
gtttccttgc tgcaaacggc ccgcagactg gcccatgaaa cggacggtac gccagtgatg 1140
atgtccccgc tggaaatgga ttatcaacgt gttaatcatg gcgtgaatct gaactggcag 1200
tccatgccgc agttagaaaa aatgaacacg ttgcagccat accaattggt tgatttatat 1260
ggagaaggaa tttccggcgt actttatcag gatactcaga aagcctggtg gtaccgtgct 1320
ccggtacggg atatcactgc cgaaggaacg aatgcggtta cctatgagga ggccaaacca 1380
ctgccacata ttccggcaca acaggaaagc gcgatgttgt tggacatcaa tggtgacggg 1440
cgtctggatt gggtgattac ggcatcaggg ttacggggct accacaccat gtcaccggaa 1500
ggtgaatgga caccctttat tccattatcc gctgtgccaa tggaatattt ccatccgcag 1560
gcaaaactgg ctgatattga tggggctggg ctgcctgact tagcgcttat cgggccaaat 1620
agtgtacgtg tctggtcaaa taatcgggca ggatgggatc gcgctcagga tgtgattcat l680
ttgtcagata tgccactgcc ggttcccggc agaaatgagc gtcatcttgt cgcattcagt 1740
gatatgacag gctccgggca atcacatctg gtggaagtaa cggcagatag cgtgcgctac 1800
tggccgaacc tggggcatgg aaaatttggt gagcctctga tgatgacagg cttccagatt 1860
agcggggaaa cgtttaaccc cgacagactg tatatggtag acatagatgg ctcaggcacc 1920
accgatttta tttatgcccg caatacttac cttgaactct atgccaatga aagcggcaat 1980
cattttgctg aacctcagcg tattgatctg ccggatgggg tacgttttga tgatacttgt 2040
cggttacaaa tagcggatac acaaggatta gggactgcca gcattatttt gacgatcccc 2100
catatgaagg tgcagcactg gcgattggat atgaccatat tcaagccttg gctgctgaat 2160
gccgtcaata acaatatggg aacagaaacc acgctgtatt atcgcagctc tgcccagttc 2220
tggctggatg agaaattaca ggcttctgaa tccgggatga cggtggtcag ctacttaccg 2280
ttcccggtgc atgtgttgtg gcgcacggaa gtgctggatg aaatttccgg taaccgattg 2340
accagccatt atcattactc acatggtgcc tgggatggtc tggaacggga gtttcgtggt 2400
tttgggcggg tgacacaaac tgatattgat tcacgggcga gtgcgacaca ggggacacat 2460
gctgaaccac cggcaccttc gcgcacggtt aattggtacg gcactggcgt acgggaagtc 2520
gatattcttc tgcccacgga atattggcag ggggatcaac aggcatttcc ccattttacc 2580
ccacgcttta cccgttatga cgaaaaatcc ggtggtgata tgacggtcac gccgagcgaa 2640
caggaagaat actggttaca tcgagcctta aaaggacaac gtttacgcag tgagctgtat 2700
ggggatgatg attctatact ggccggtacg ccttattcag tggatgaatc ccgcacccaa 2760
gtacgtttgt taccggtgat ggtatcggac gtgcctgcgg tactggtttc ggtggccgaa 2820
tcccgccaat accgatatga acgggttgct accgatccac agtgcagcca aaagatcgtc 2880
cttaaatctg atgcgttagg atttccgcag gacaatcttg agattgccta ttcgagacgt 2940
ccacagcctg agttctcgcc ttatccggat accctgcccg aaacactttt caccagcagt 3000
ttcgacgaac agcagatgtt ccttcgtctg acacgccagc gttcttctta tcatcatctg 3060
aatcatgatg ataatacgtg gatcacaggg cttatggata cctcacgcag tgacgcacgt 3120
atttatcaag ccgataaagt gccggacggt ggattttccc ttgaatggtt ttctgccaca 3180
ggtgcaggag cattgttgtt gcctgatgcc gcagccgatt atctgggaca tcagcgtgta 3240
gcatataccg gtccagaaga acaacccgct attcctccgc tggtggcata cattgaaacc 3300
gcagagtttg atgaacgatc gttggcggct tttgaggagg tgatggatga gcaggagctg 3360
acaaaacagc tgaatgatgc gggctggaat acggcaaaag tgccgttcag tgaaaagaca 3420
gatttccatg tctgggtggg acaaaaggaa tttacagaat atgccggtgc agacggattc 3480
tatcggccat tggtgcaacg ggaaaccaag cttacaggta aaacgacagt cacgtgggat 3540
agccattact gtgttatcac cgcaacagag gatgcggctg gcctgcgtat gcaagcgcat 3600
tacgattatc gatttatggt tgcggataac accacagatg tcaatgataa ctatcacacc 3660
gtgacgtttg atgcactggg gagggtaacc agcttccgtt tctgggggac tgaaaacggt 3720
gaaaaacaag gatatacccc tgcggaaaat gaaactgtcc cctttattgt ccccacaacg 3780
gtggatgatg ctctggcatt gaaacccggt atacctgttg cagggctgat ggtttatgcc 3840
cctctgagct ggatggttca ggccagcttt tctaatgatg gggagcttta tggagagctg 3900
aaaccggctg ggatcatcac tgaagatggt tatctcctgt cgcttgcttt tcgccgctgg 3960
caacaaaata accctgccgc tgccatgcca aagcaagtca attcacagaa cccaccccat 4020
gtactgagtg tgatcaccga ccgctatgat gccgatccgg aacaacaatt acgtcaaacg 4080
tttacgttta gtgatggttt tgggcgaacc ttacaaacag ccgtacgcca tgaaagtggt 4140
gaagcctggg tacgtgatga gtatggagcc attgtggctg aaaatcatgg cgcgcctgaa 4200
acggcgatga cagatttccg ttgggcagtt tccggacgta cagaatatga cggaaaaggc 4260
caagccctgc gtaagtatca accgtatttc ctgaatagtt ggcagtacgt cagtgatgac 4320
agtgcccggc aggatatata tgccgatacc cattactatg atccgttggg gcgtgaatat 4380
caggttatca cggccaaagg cgggtttcgt cgatccttat tcactccctg gtttgtggtg 4440
aatgaagatg aaaatgacac tgccggtgaa atgacagca 4479
<210>32
<211>4521
<212>DNA
<213>伯氏致病杆菌
<400>32
atgaaacaag attcacagga catgacagta acacagctgt ccctgcccaa agggggcggt 60
gcgatcagtg gcatgggtga cactatcagc aatgcagggc cggatgggat ggcttcgctt 120
tccgtgcctt tgcctatctc tgccggtcgg gggggcgcac cgaatttatc cctgaactac 180
agtagcggag caggaaacgg gtcatttggt attggctggc aatccagtac catggctatc 240
agccgtcgta ctcaacatgg cgtaccgcaa tatcacggcg aagatacttt tttatgtccg 300
atgggagaag tgatggcggt tgccgtcaat cagagcgggc aacccgatgt gcgtaaaacc 360
gataaactat taggcgggca actgcctgtt acttataccg ttacgcgtca tcagcccaga 420
aatattcagc acttcagcaa acttgaatac tggcagcccc caacggatgt ggaaaccacg 480
cctttttggt taatgtattc acccgatgga caaattcaca ttttcggaaa aactgagcag 540
gctcagatcg ctaacccggc agaggtttca cagattgccc aatggctttt ggaagaaacc 600
gtaacaccag cgggagaaca catttattac cagtatcggg cagaagacga tatcggttgt 660
gatgacagcg aaaaaaatgc ccaccctaat gccagtgctc aacgttattt gactcaggtg 720
aactacggca atattacacc tgaatccagc ctgcttgtgc tgaagaatac gccaccggcg 780
gataacgaat ggctattcca tttggttttt gattatggtg aacgagcgca ggaaataaac 840
acggttcctc ctttcaaagc accttcaaac aactggaaaa tacggccaga ccgtttctcc 900
cgctttgaat atggttttga ggtgcgaacc cgccgcctgt gtcaacaaat tctgatgttc 960
catcgcctga aatcccttgc aggagaacag attgacggag aagaaatccc tgccttggtt 1020
gcccgtctgc ttctcagtta tgacctgaac gacagcgtga caacccttac cgccattcgg 1080
caaatggcgt atgaaactga cgcaacctta atcgctttac cgccactgga gtttgactat 1140
cagccctttg aggcaaaagt cacgcagaaa tggcaggaaa tgcctcaatt ggccggattg 1200
aatgcccaac aaccttacca actcgtcgat ctctatggtg aaggtatctc cggcatcttg 1260
tatcaggaca gacccggagc atggtggtat caggcaccga tccgtcagaa aaacgttgaa 1320
gatattaacg ctgtcaccta tagcccaata aaccccttac ctaagatccc cagccagcag 1380
gacagagcaa cgttgatgga tatcgacggt gatggacatc tggattgggt gatcgctggc 1440
gcaggtattc aggggcggta cagtatgcag ccgaatggag agtggacaca ctttattccc 1500
atttctgcac tgccaacaga atattttcat ccacaggcac aactggcgga tctggtgggg 1560
gccgggttat ctgatttagc gctgattggc cccagaagtg tgcgtttata tgccaacgac 1620
cgaggaaact ggaaagcggg tattaatgtt atgccacctg atggtgtgaa tttgccgata 1680
tttggtggtg atgccagcag tctggtcgca ttttctgaca tgttgggatc gggacagcag 1740
catttggtgg aaattgccgc tcagagcgtc aaatgctggc cgaatctagg acatggccgt 1800
tttggtgcgg ctattttgct gccggggttt agccagccga atggaacatt caatgctaac 1860
caagtttttc tggcagatat cgatggttcc ggcaccgccg acatcatcta tgcacacagt 1920
acgtatctgg atatttacct gaacgaaagc ggcaaccgtt tcagtgcacc cgttcggctt 1980
aatttgccgg aaggggtgat gtttgacaat acctgtcagt tacaggtgtc ggatattcaa 2040
ggattgggcg ctgccagcat tgtactgacc gtacctcata tgacaccgcg ccattggcgt 2100
tatgatttta ctcacaataa accttggctg ctcaatgtca tcaacaacaa tcgtggcgca 2160
gaaaccacgt tgttttaccg tagttctgcc caattctggc tggatgaaaa aagtcagatc 2220
gaagagctgg gaaaatttgc agcgagttat ctgcctttcc ccatacattt gttgtggcgc 2280
aatgaggcgc tggatgaaat tactggtaat cgactgacta aggtcatgaa ttatgcccac 2340
ggtgcatggg atggcagaga gagagaattt tgcggatttg gccgtgtaac gcaaattgat 2400
accgacgaat ttgccaaggg aaccacagag aaagcgccgg atgaaaatat ctatccttcc 2460
cgtagcataa gctggtttgc cacgggttta ccagaagtgg attctcaact tccggcagaa 2520
tactggcgtg gtgacgatca ggcatttgcc ggctttacac cgcgcttcac tcgttatgaa 2580
aaaggtaatg cggggcaaga ggggcaggat accccgatta aagaaccgac cgaaacagaa 2640
gcgtattggc ttaaccgcgc catgaaaggc caattactgc gcagtgaagt ctatggtgac 2700
gacaaaacag aaaaagctaa aattccgtac accgtcacag aagctcgctg tcaggtcaga 2760
ttaattccca gcaatgacga agccgcgccg tcgtcttgga cgtcgatcat tgaaaaccgc 2820
agttatcact atgagcgtat cgtcgtcgat ccgagttgca aacaacaggt cgtgctcaag 2880
gcggatgaat atggcttccc actggcaaaa gtagatatcg cctatccacg gcgcaataaa 2940
ccggcacaga acccttatcc ggattcgtta ccggatactc tgttcgccga tagctatgac 3000
gaccagcaaa aacagttata tctgacaaaa cagcagcaga gctattacca cctgacccag 3060
caggatgatt gggttctggg tttgacggat agccgataca gcgaagttta tcattatgcg 3120
caaactgacg ctcaaagtga catccccaag gcagggctga tattggaaga cctgctgaaa 3180
gttgacggcc tgataggtaa agacaagact tttatctatt tagggcagca gcgagtggct 3240
tatgtgggag gagatgcaga aaaaccgaca cgtcaggtgc gggtggctta tacagaaacc 3300
gctgcttttg atgacaatgc gctgcacgcc tttgatggcg tgattgcccc tgatgaactg 3360
acgcaacagt tgctggcggg tggatacctg ctcgtgccgc agatttctga tgtggcaggc 3420
agtagtgaaa aggtatgggt agctcggcag ggatacaccg aatacggcag tgctgctcaa 3480
ttctaccggc cactcatcca gcgcaaaagc ttgctgaccg gaaaatatac ccttagttgg 3540
gatacgcact attgtgtggt ggtaaaaacc gaagatggtg cgggaatgac cacgcaagcg 3600
aagtacgatt accgcttcct gcttccggcg caattgacag atatcaatga caaccagcac 3660
atcgtgacat ttaatgcatt ggggcaggtg acttccagcc gtttctgggg cacagaaaat 3720
ggcaaaataa gcggttactc gacgccggag agtaaaccgt tcacagtacc cgataccgtc 3780
gaaaaagccc ttgccttgca accgacgatc ccggtttcac agtgcaacat ttatgtgccg 3840
gatagttgga tgcggcttct gccccaacag tctctgactg gccagctaaa agagggggaa 3900
actttgtgga acgcattaca ccgggcgggt gtagtaacgg aagacggttt gatctgtgaa 3960
ctggcctatc gtcgttggat caaacgtcag gcaacgtctt caatgatggc cgtgacatta 4020
cagcaaatct tggctcagac tccacgacaa cctccgcatg ccatgacgat cacgacagat 4080
cgttatgaca gcgattctca gcagcaactt cggcagtcga tagtattgag tgatggtttt 4140
ggtcgcgtat tgcaaagcgc ccagcgtcat gaagcaggag aggcatggca gcgtgcagaa 4200
gatggttctt tggttgtcga taataccggt aaacccgttg ttgctaatac cacaacgcgc 4260
tgggcagtat ccggtcgcac agaatacgac ggcaaagggc aggcgatcag agcttacctg 4320
ccttattatc tcaatgattg gcgctatgtc agtgatgaca gcgcccggga tgacctgtac 4380
gccgataccc atttttacga tcctctgggg cgtgaatatc aggtaaaaac cgcgaaagga 4440
ttttggcgtg aaaacatgtt tatgccgtgg tttgtcgtca atgaagatga aaatgacaca 4500
gcagcacgtt taacatctta a 4521
<210>33
<211>4335
<212>DNA
<213>类芽胞杆菌属菌株DAS1529
<400>33
atgccacaat ctagcaatgc cgatatcaag ctattgtcgc catcgctgcc aaagggcggc 60
ggttccatga agggaatcga agaaaacatc gcggctcccg gctccgacgg catggcacgt 120
tgtaatgtgc cgctgccggt aacctccggc cgctatatta ctcctgatat aagcctgtcc 180
tatgcgagcg gccacggcaa cggcgcttat ggaatgggct ggacgatggg agtgatgagc 240
attagccgga gaacaagccg agggaccccc agttatacat ccgaagacca gttccttggt 300
ccggatgggg aggtgcttgt tccggaaagc aacgaacaag gggagatcat tacccgccac 360
accgatacgg cccaagggat accgttaggc gagacgttta cggttacacg ctattttccc 420
cggatcgaga gcgcttttca tttgctggaa tactgggaag cgcaagcagg aagcgcaaca 480
gcgtcgtttt ggcttattca ctctgccgat ggagtgctgc actgtctggg taaaactgct 540
caggcgagga tagccgcccc tgacgattcc gccaagatcg cagaatggct agtggaggag 600
tccgtctccc ccttcggaga gcatatttat taccaataca aagaagaaga caatcaaggc 660
gtgaatctgg aggaagacaa tcatcaatat ggggcgaacc gctatctgaa atcgattcgc 720
tatggaaata aggttgcctc tccttctctc tatgtctgga agggggaaat tccggcagac 780
ggccaatggc tgtattccgt tatcctggat tatggcgaga acgatacctc agcggatgtt 840
cctcccctat acacgcccca aggggagtgg ctggtgcgcc cggaccgttt ttcccgctat 900
gactacggat ttgaggtccg gacttgccgc ttgtgccgcc aggtcttgat gttccacgtc 960
tttaaggagc ttggcgggga gccggcgctg gtgtggcgga tgcagttgga atacgacgag 1020
aacccggcgg cgtccatgct gagcgcggtc cggcaattgg cttatgaagc agatggggcc 1080
attcgaagct tgccgccgct ggaattcgat tatactccat ttggcatcga gacaacggcc 1140
gattggcagc cttttctgcc tgtgcctgaa tgggcggatg aagaacatta tcagttggtc 1200
gatttgtacg gagaaggcat accgggctta ttatatcaga acaatgacca ctggcattat 1260
cgttcgcccg cccggggcga cacaccggac gggatcgcct ataacagctg gcggccgctt 1320
cctcatatcc ccgtgaactc ccggaacggg atgctgatgg atctgaatgg agacgggtat 1380
ctggaatggt tgcttgcgga acccggggtt gcggggcgct atagcatgaa cccggataag 1440
agctggtccg gttttgtgcc gctccaggca ctgccaacgg aattcttcca tccgcaggca 1500
cagcttgcca atgttaccgg atcgggttta accgacttgg ttatgatcgg tccgaagagc 1560
gtccggtttt atgccggaga agaagcgggc ttcaagcgcg catgtgaagt gtggcagcaa 1620
gtgggcatta ctttgcctgt ggaacgcgtg gataaaaagg aactggtggc attcagcgat 1680
atgctgggat cgggtcagtc tcatctggtg cgcatccggc atgatggcgt tacatgctgg 1740
cctaatctgg ggaacggcgt gttcggggcg ccgttggccc ttcacgggtt tacggcatcg 1800
gagcgggaat tcaatccgga acgtgtatat cttgtggacc ttgatggatc cggcgcttcc 1860
gatatcattt atgcttctcg tgacgctcta ctcatttacc gaaatctttc cggcaatggc 1920
tttgctgatc cggtgcgggt tccgctgcct gacggcgtgc ggtttgataa tctgtgccgg 1980
ctgctgcctg ccgatatccg cgggttaggt gtggccagtc tggtgctgca tgtaccttac 2040
atggcccccc gcagttggaa attagatttc tttgcggcga agccgtattt attgcaaacg 2100
gtcagcaaca atcttggagc ttccagctcg ttttggtacc gaagctccac ccagtattgg 2160
ctggatgaga aacaggcggc ctcatcggct gtctccgctt tgcccttccc gataaacgtg 2220
gtatcggata tgcacacggt ggacgaaatc agcggccgca ccaggactca gaagtatact 2280
taccgccatg gcgtgtatga ccggaccgaa aaggaatttg ccggattcgg ccgcattgac 2340
acatgggaag aggagcggga ttccgaagga accctgagcg tcagcactcc gcccgtgctg 2400
acgcggacct ggtatcatac cgggcaaaag caggatgagg agcgtgccgt gcagcaatat 2460
tggcaaggcg accctgcggc ttttcaggtt aaacccgtcc ggcttactcg attcgatgcg 2520
gcagcggccc aggatctgcc gctagattct aataatgggc agcaagaata ctggctgtac 2580
cgatcattac aagggatgcc gctgcggact gagatttttg cgggagatgt tggcgggtcg 2640
cctccttatc aggtagagag cttccgttat caagtgcgct tggtgcagag catcgattcg 2700
gaatgtgttg ccttgcccat gcagttggag cagcttacgt acaactatga gcaaatcgcc 2760
tctgatccgc agtgttcaca gcagatacag caatggttcg acgaatacgg cgtggcggca 2820
cagagtgtaa caatccaata tccgcgccgg gcacagccgg aggacaatcc gtaccctcgc 2880
acgctgccgg ataccagctg gagcagcagt tatgattcgc agcaaatgct gctgcggttg 2940
accaggcaaa ggcaaaaagc gtaccacctt gcagatcctg aaggctggcg cttgaatatt 3000
ccccatcaga cacgcctgga tgccttcatt tattctgctg acagcgtgcc cgccgaagga 3060
ataagcgccg agctgctgga ggtggacggc acgttacgat cttcggcgct ggaacaggct 3120
tatggcggcc agtcagagat catctatgcg ggcgggggcg aaccggattt gcgagccctg 3180
gtccattaca ccagaagcgc ggttcttgat gaagactgtt tacaagccta tgaaggcgta 3240
ctgagcgata gccaattgaa ctcgcttctt gcctcttccg gctatcaacg aagcgcaaga 3300
atattgggtt cgggcgatga agtggatatt tttgtcgcgg aacaaggatt tacccgttat 3360
gcggatgaac cgaatttttt ccgtattctg gggcaacaat cctctctctt gtccggggaa 3420
caagtattaa catgggatga taatttctgt gcggttacat ccatcgaaga cgcgcttggc 3480
aatcaaattc agattgcata tgattaccgc tttgtggagg ccatccagat taccgatacg 3540
aataataatg tgaatcaggt cgccctggat gctctcggcc gggtcgtata cagccggacc 3600
tggggcacgg aggaagggat aaagaccggc ttccgcccgg aggtggaatt cgcgacgccc 3660
gagacaatgg agcaggcgct tgccctggca tctcccttgc cggttgcatc ctgctgtgta 3720
tatgatgcgc atagctggat gggaacgata actcttgcac aactgtcaga gcttgttcca 3780
gatagtgaaa agcaatggtc gttcttgata gacaatcgct tgattatgcc ggacggcaga 3840
atcagatccc gcggtcggga tccatggtcg cttcaccggc tattgccgcc tgctgtgggc 3900
gaattgctga gcgaggcgga ccgtaaaccg ccgcatacgg taattttggc agcagatcgt 3960
tacccggatg acccatccca gcaaattcag gcgagcatcg tgtttagcga tggctttggg 4020
cgtacgatac aaactgctaa aagagaagat acccgatggg cgattgcgga acgggtggac 4080
tatgacggaa ccggagccgt aatccgcagc tttcagcctt tttatcttga cgactggaat 4140
tatgtgggcg aagaggctgt cagcagctct atgtacgcaa cgatctatta ttatgatgct 4200
ctggcacgac aattaaggat ggtcaacgct aaaggatatg agaggagaac tgctttttac 4260
ccatggttta cagtaaacga agatgaaaat gataccatgg actcatcatt atttgcttca 4320
ccgcctgcgc ggtga 4335
<210>34
<211>3132
<212>DNA
<213>发光光杆状菌
<400>34
atgagtccgt ctgagactac tctttatact caaaccccaa cagtcagcgt gttagataat 60
cgcggtctgt ccattcgtga tattggtttt caccgtattg taatcggggg ggatactgac 120
acccgcgtca cccgtcacca gtatgatgcc cgtggacacc tgaactacag tattgaccca 180
cgcttgtatg atgcaaagca ggctgataac tcagtaaagc ctaattttgt ctggcagcat 240
gatctggccg gtcatgccct gcggacagag agtgtcgatg ctggtcgtac tgttgcattg 300
aatgatattg aaggtcgttc ggtaatgaca atgaatgcga ccggtgttcg tcagacccgt 360
cgctatgaag gcaacacctt gcccggtcgc ttgttatctg tgagcgagca agttttcaac 420
caagagagtg ctaaagtgac agagcgcttt atctgggctg ggaatacaac ctcggagaaa 480
gagtataacc tctccggtct gtgtatacgc cactacgaca cagcgggagt gacccggttg 540
atgagtcagt cactggcggg cgccatgcta tcccaatctc accaattgct ggcggaaggg 600
caggaggcta actggagcgg tgacgacgaa actgtctggc agggaatgct ggcaagtgag 660
gtctatacga cacaaagtac cactaatgcc atcggggctt tactgaccca aaccgatgcg 720
aaaggcaata ttcagcgtct ggcttatgac attgccggtc agttaaaagg gagttggttg 780
acggtgaaag gccagagtga acaggtgatt gttaagtccc tgagctggtc agccgcaggt 840
cataaattgc gtgaagagca cggtaacggc gtggttacgg agtacagtta tgagccggaa 900
actcaacgtc tgataggtat caccacccgg cgtgccgaag ggagtcaatc aggagccaga 960
gtattgcagg atctacgcta taagtatgat ccggtgggga atgttatcag tatccataat 1020
gatgccgaag ctacccgctt ttggcgtaat cagaaagtgg agccggagaa tcgctatgtt 1080
tatgattctc tgtatcagct tatgagtgcg acagggcgtg aaatggctaa tatcggtcag 1140
caaagcaacc aacttccctc acccgttata cctgttccta ctgacgacag cacttatacc 1200
aattaccttc gtacctatac ttatgaccgt ggcggtaatt tggttcaaat ccgacacagt 1260
tcacccgcga ctcaaaatag ttacaccaca gatatcaccg tttcaagccg cagtaaccgg 1320
gcggtattga gtacattaac gacagatcca acccgagtgg atgcgctatt tgattccggc 1380
ggtcatcaga agatgttaat accggggcaa aatctggatt ggaatattcg gggtgaattg 1440
caacgagtca caccggtgag ccgtgaaaat agcagtgaca gtgaatggta tcgctatagc 1500
agtgatggca tgcggctgct aaaagtgagt gaacagcaga cgggcaacag tactcaagta 1560
caacgggtga cttatctgcc gggattagag ctacggacaa ctggggttgc agataaaaca 1620
accgaagatt tgcaggtgat tacggtaggt gaagcgggtc gcgcacaggt aagggtattg 1680
cactgggaaa gtggtaagcc gacagatatt gacaacaatc aggtgcgcta cagctacgat 1740
aatctgcttg gctccagcca gcttgaactg gatagcgaag ggcagattct cagtcaggaa 1800
gagtattatc cgtatggcgg tacggcgata tgggcggcga gaaatcagac agaagccagc 1860
tacaaattta ttcgttactc cggtaaagag cgggatgcca ctggattgta ttattacggc 1920
taccgttatt atcaaccttg ggtgggtcga tggttgagtg ctgatccggc gggaaccgtg 1980
gatgggctga atttgtaccg aatggtgagg aataacccca tcacattgac tgaccatgac 2040
ggattagcac cgtctccaaa tagaaatcga aatacatttt ggtttgcttc atttttgttt 2100
cgtaaacctg atgagggaat gtccgcgtca atgagacggg gacaaaaaat tggcagagcc 2160
attgccggcg ggattgcgat tggcggtctt gcggctacca ttgccgctac ggctggcgcg 2220
gctatccccg tcattctggg ggttgcggcc gtaggcgcgg ggattggcgc gttgatggga 2280
tataacgtcg gtagcctgct ggaaaaaggc ggggcattac ttgctcgact cgtacagggg 2340
aaatcgacgt tagtacagtc ggcggctggc gcggctgccg gagcgagttc agccgcggct 2400
tatggcgcac gggcacaagg tgtcggtgtt gcatcagccg ccggggcggt aacaggggct 2460
gtgggatcat ggataaataa tgctgatcgg gggattggcg gcgctattgg ggccgggagt 2520
gcggtaggca ccattgatac tatgttaggg actgcctcta cccttaccca tgaagtcggg 2580
gcagcggcgg gtggggcggc gggtgggatg atcaccggta cgcaagggag tactcgggca 2640
ggtatccatg ccggtattgg cacctattat ggctcctgga ttggttttgg tttagatgtc 2700
gctagtaacc ccgccggaca tttagcgaat tacgcagtgg gttatgccgc tggtttgggt 2760
gctgaaatgg ctgtcaacag aataatgggt ggtggatttt tgagtaggct cttaggccgg 2820
gttgtcagcc catatgccgc cggtttagcc agacaattag tacatttcag tgtcgccaga 2880
cctgtctttg agccgatatt tagtgttctc ggcgggcttg tcggtggtat tggaactggc 2940
ctgcacagag tgatgggaag agagagttgg atttccagag cgttaagtgc tgccggtagt 3000
ggtatagatc atgtcgctgg catgattggt aatcagatca gaggcagggt cttgaccaca 3060
accgggatcg ctaatgcgat agactatggc accagtgctg tgggagccgc acgacgagtt 3120
ttttctttgt aa 3132
<210>35
<211>2745
<212>DNA
<213>发光光杆状菌
<400>35
atgagcagtt acaattctgc aattgaccaa aagaccccct cgattaaggt attagataac 60
aggaaattaa atgtacgtac tttagaatat ctacgcactc aagctgacga aaacagtgat 120
gaattaatta cgttctatga gttcaatatt ccgggatttc aggtaaaaag caccgatcct 180
cgtaaaaata aaaaccagag cggcccaaat ttcattcgtg tctttaatct tgccggtcaa 240
gttttacgtg aagaaagtgt tgatgccggt cggactatta ccctcaatga tattgaaagt 300
cgcccggtgt tgatcatcaa tgcaaccggt gtccgccaaa accatcgtta tgaagataac 360
acccttcccg gtcgtctgct cgctatcacc gaacaagtac aggcaggaga gaaaacgacc 420
gaacgtctta tctgggccgg caatacgccg caagaaaaag attacaacct cgccggtcag 480
tgtgtccgcc attacgatac cgcgggactt actcaactca atagcctttc tctggctggc 540
gtcgtgctat cacaatctca acaactgctt accgataacc aggatgccga ctggacaggt 600
gaagaccaga gcctctggca acaaaaactg agtagtgatg tctatatcac ccaaagtaac 660
actgatgcca ccggggcttt actgacccag accgatgcca aaggcaacat tcagcggctg 720
gcctatgatg tggccgggca gctaaaaggg agttggttaa cactcaaagg tcaggcggaa 780
caggtgatta tcaaatcgct aacctactcc gccgccgggc aaaaattacg tgaagagcac 840
ggtaacggga ttgtcactga atacagctac gaaccggaaa cccaacggct tatcggcatt 900
accactcgcc gtccatcaga cgccaaggtg ttgcaagacc tacgctatca atatgaccca 960
gtaggcaatg tcattaatat ccgtaatgat gcggaagcca ctcgcttttg gcgcaatcag 1020
aaagtagccc cggagaatag ctatacctac gattccctgt atcagcttat cagcgccacc 1080
gggcgcgaaa tggccaatat cggtcagcaa aacaaccaac ttccctcccc tgcgctacct 1140
tctgacaaca atacctacac taactatact cgcagctaca gctatgatca cagtggtaat 1200
ctgacgcaaa ttcggcacag ctcgccagct acccagaaca actacaccgt ggctatcacc 1260
ctctcaaacc gcagcaatcg gggtgttctc agtacgctaa ccaccgatcc aaatcaagtg 1320
gatacgttgt ttgatgccgg tggtcaccaa accagtttat tacccggaca gacacttatc 1380
tggacaccac gaggagagtt aaagcaggtt aataatggcc cgggaaatga gtggtaccgc 1440
tacgacagca acggcatgag acaactgaaa gtgagtgaac agccaaccca gaatactacg 1500
cagcaacaac gggtaatcta tttgccggga ctggagctac gcacaaccca gagcaacgcc 1560
acaacaacgg aagagttaca cgttatcaca ctcggtgaag ccggtcgcgc acaggtacgg 1620
gtgttgcact gggagagcgg taagccagaa gatgtcaaca ataatcaact acgttacagc 1680
tacgataatc tgatcggctc cagccagctt gaactggaca accaaggaca aattatcagc 1740
gaggaagagt attatccatt tggcgggaca gcgctgtggg cagcaaacag ccaaacagaa 1800
gccagctata aaacgattcg ctattccggc aaagaacgag atgccaccgg gttgtattat 1860
tacggttatc gttattacca accgtgggcg ggcagatggt taagcgcgga cccggcagga 1920
accattgatg ggctgaatct ataccgaatg gtaagaaata atcctgtgag tttacaagat 1980
gaaaatggat tagcgccaga aaaagggaaa tataccaaag aggtaaattt ctttgatgaa 2040
ttaaaattca aattggcagc caaaagttca catgttgtca aatggaacga gaaagagagc 2100
agttatacaa aaaataaatc attgaaagtg gttcgtgtcg gtgattccga tccgtcgggt 2160
tatttgctaa gccacgaaga gttactaaaa ggtatagaaa aaagtcaaat catatatagc 2220
cgacttgaag aaaacagctc cctttcagaa aaatcaaaaa cgaatctttc tttaggatct 2280
gaaatatccg gttatatggc aagaaccata caagatacga tatcagaata tgccgaagag 2340
cataaatata gaagtaatca ccctgatttt tattcagaaa ccgatttctt tgcgttaatg 2400
gataaaagtg aaaaaaatga ttattccggt gaaagaaaaa tttatgcggc aatggaggtt 2460
aaggtttatc atgatttaaa aaataaacaa tcagaattac atgtcaacta tgcattggcc 2520
catccctata cgcaattgag taatgaagaa agagcgctgt tgcaagaaac agaacccgct 2580
attgcaatag atagagaata taatttcaaa ggtgttggca aattcctgac aatgaaagca 2640
attaaaaaat cattgaaagg acataaaatt aataggatat caacagaggc tattaatatt 2700
cgctctgcgg ctatcgctga gaatttagga atgcggagaa cttca 2745
<210>36
<211>2883
<212>DNA
<213>发光光杆状菌
<400>36
atgaaaaaca ttgatcccaa actttatcaa aaaaccccta ctgtcagcgt ttacgataac 60
cgtggtctga taatccgtaa catcgatttt catcgtacta ccgcaaatgg tgatcccgat 120
acccgtatta cccgccatca atacgatatt cacggacacc taaatcaaag catcgatccg 180
cgcctatatg aagccaagca aaccaacaat acgatcaaac ccaattttct ttggcagtat 240
gatttgaccg gtaatcccct atgtacagag agcattgatg caggtcgcac tgtcaccttg 300
aatgatattg aaggccgtcc gctactaacg gtgactgcaa caggggttat acaaactcga 360
caatatgaaa cttcttccct gcccggtcgt ctgttatctg ttgccgaaca aacacccgag 420
gaaaaaacat cccgtatcac cgaacgcctg atttgggctg gcaataccga agcagagaaa 480
gaccataacc ttgccggcca gtgcgtgcgt cactatgaca cggcgggagt tacccggtta 540
gagagtttat cactgaccgg tactgtttta tctcaatcca gccaactatt gatcgacact 600
caagaggcaa actggacagg tgataacgaa accgtctggc aaaacatgct ggctgatgac 660
atctacacaa ccctgagcac cttcgatgcc accggtgctt tactgactca gaccgatgcg 720
aaagggaaca ttcagagact ggcttatgat gtggccgggc agctaaacgg gagctggcta 780
acactcaaag gccagacgga acaagtgatt atcaaatccc tgacctactc cgccgccgga 840
caaaaattac gtgaggaaca cggcaatgat gttatcaccg aatacagtta tgaaccggaa 900
acccaacggc tgatcggtat caaaacccgc cgtccgtcag acactaaagt gctacaagac 960
ctgcgctatg aatatgaccc ggtaggcaat gtcatcagca tccgtaatga cgcggaagcc 1020
acccgctttt ggcacaatca gaaagtgatg ccggaaaaca cttataccta cgattccctg 1080
tatcagctta tcagcgccac cgggcgcgaa atggcgaata taggtcaaca aagtcaccaa 1140
tttccctcac ccgctctacc ttctgataac aacacctata ccaactatac ccgtacttat 1200
acttatgacc gtggcggcaa tctgaccaaa atccagcaca gttcaccggc gacgcaaaac 1260
aactacacca ccaatatcac ggtttcaaat cgcagcaacc gcgcagtact cagcacattg 1320
accgaagatc cggcgcaagt agatgctttg tttgatgcag gcggacatca gaacaccttg 1380
atatcaggac aaaacctgaa ctggaatact cgtggtgaac tgcaacaagt aacactggtt 1440
aaacgggaca agggcgccaa tgatgatcgg gaatggtatc gttatagcgg tgacggaaga 1500
aggatgttaa aaatcaatga acagcaggcc agcaacaacg ctcaaacaca acgtgtgact 1560
tatttgccga acttagaact tcgtctaaca caaaacagca cggccacaac cgaagatttg 1620
caagttatca ccgtaggcga agcgggccgg gcacaggtac gagtattaca ttgggagagc 1680
ggtaaaccgg aagatatcga caataatcag ttgcgttata gttacgataa tcttatcggt 1740
tccagtcaac ttgaattaga tagcgaagga caaattatca gtgaagaaga atattatccc 1800
tatggtggaa cagcattatg ggccgccagg aatcagacag aagccagtta taaaactatc 1860
cgttattcag gcaaagagcg ggatgccacc gggctatatt actacggcta tcggtattac 1920
caaccgtgga taggacggtg gttaagctcc gatccggcag gaacaatcga tgggctgaat 1980
ttatatcgga tggtgaggaa taatccagtt accctccttg atcctgatgg attaatgcca 2040
acaattgcag aacgcatagc agcactaaaa aaaaataaag taacagactc agcgccttcg 2100
ccagcaaatg ccacaaacgt agcgataaac atccgcccgc ctgtagcacc aaaacctagc 2160
ttaccgaaag catcaacgag tagccaacca accacacacc ctatcggagc tgcaaacata 2220
aaaccaacga cgtctgggtc atctattgtt gctccattga gtccagtagg aaataaatct 2280
acttctgaaa tctctctgcc agaaagcgct caaagcagtt cttcaagcac tacctcgaca 2340
aatctacaga aaaaatcatt tactttatat agagcagata acagatcctt tgaagaaatg 2400
caaagtaaat tccctgaagg atttaaagcc tggactcctc tagacactaa gatggcaagg 2460
caatttgcta gtatctttat tggtcagaaa gatacatcta atttacctaa agaaacagtc 2520
aagaacataa gcacatgggg agcaaagcca aaactaaaag atctctcaaa ttacataaaa 2580
tataccaagg acaaatctac agtatgggtt tctactgcaa ttaatactga agcaggtgga 2640
caaagctcag gggctccact ccataaaatt gatatggatc tctacgagtt tgccattgat 2700
ggacaaaaac taaatccact accggagggt agaactaaaa acatggtacc ttccctttta 2760
ctcgacaccc cacaaataga gacatcatcc atcattgcac ttaatcatgg accggtaaat 2820
gatgcagaaa tttcatttct gacaacaatt ccgcttaaaa atgtaaaacc tcataagaga 2880
taa 2883
<210>37
<211>2850
<212>DNA
<213>发光光杆状菌
<400>37
atgaaaaaca ttgacccaaa actttatcaa catacgccca ccgttaacgt ctacgataac 60
cgtggcctga ccattcgtaa catcgacttt caccgtgacg tcgcgggagg cgatacagat 120
actcgtatta cccgccacca atatgatacc cgaggacact tgagccaaag cattgatcca 180
cggctgtatg acgccaaaca aaccaataac tcgacaaacc ccaacttcct ctggcaatac 240
aatctcaccg gcgacacttt gcggacagaa agtgtcgatg ccggccgtac cgtagccctc 300
aatgatattg aaggccgtca agtgttgatt gtaaccgcaa ccggcgccat tcagacccga 360
caatatgaag ccaataccct gcccggtcgt ctattatccg taagtgaaca agcccccgga 420
gaacagactc cccgcgttac tgagcatttt atttgggctg gtaatacaca ggcggagaaa 480
gatcataatc ttgccggcca gtatgtgcgc cactacgaca cagcaggagt gacgcaactg 540
gaaagcctgt cattgacaga aaacatctta tctcaatccc gtcagttatt agccgacggt 600
caggaagcag actggacagg taacgatgaa accctctggc agaccaaact caatagcgaa 660
acttacacga cacaaagcac ctttgatgct accggcgctt tgctgaccca aaccgatgca 720
aaaggcaaca tgcaacgtct ggcttacaac gtggcaggac aattacaagg tagctggctg 780
acattgaaaa accaaagtga gcaagtcatt gtcaaatccc tgacctattc cgccgcaggc 840
cagaaattgc gtgaagaaca cggtaatggc gttatcactg aatacagcta tgaaccggaa 900
actctacgat tgatcggtac cactactcgc cgtcaatcag atagcaaggt gttacaagat 960
ctacgctatg aacatgatcc tgtagggaat attattagtg tccgtaatga tgcagaagcc 1020
acccgcttct ggcgcaatca gaaaatagtc cctgaaaata cctacaccta cgattccctg 1080
tatcagctta tcagtgcaac aggacgtgag atggctaaca tcggccagca aagcaaccaa 1140
cttccttcgc caatcatccc tcttcctact gatgaaaact catataccaa ctatactcgc 1200
agctataatt acgatcgcgg cggcaatttg gttcaaatcc ggcacagttc ccccgccgcc 1260
caaaataact acaccacaga tatcaccgtt tcgaatcgca gtaaccgggc agtgctgagt 1320
tcgctaacct cagacccaac acaggtggag gcactgtttg atgccggcgg acatcaaaca 1380
aaattgttac cggggcaaga gctgagttgg aatacacgag gtgaactaaa acaggtaacg 1440
ccagtcagtc gcgagagcgc cagcgatcgg gaatggtatc gttacggcaa cgacggcatg 1500
cgacggttaa aagtcagtga gcaacagact ggcaacagca cgcagcagca acgagtaact 1560
tatcttcccg atctggagct acgtacaaca caaaatggga ctactacatc agaagacctg 1620
catgctatta ccgtgggagc agcaggccac gcacaagtgc gagttctaca ctgggaaact 1680
acgccaccag ccggtatcaa taacaatcag cttcgctata gctatgataa tttgattggt 1740
tccagtcaac ttgaactgga taacgcagga caaattatca gtcaggaaga gtattatcca 1800
tttggcggca cagcattatg ggcagcaaga aaccaaatag aagccagcta caaaatcctc 1860
cgttactcag gtaaagaacg cgatgctacc gggctctatt attacggcta ccgctattat 1920
cagccgtggg ttggtaggtg gttaagcgcc gatccggctg gaacaatcga tggactgaat 1980
ctataccgga tggtgagaaa taatccgtca acactggttg atatttctgg gcttgcacct 2040
acgaaataca atattcccgg atttgacttt gatgtagaaa tagatgagca aaaaagatct 2100
aaattaaaac caacgttgat aagaatcaaa gatgaatttt tacattatgg tcctgtagat 2160
aagctgttag aagaaaaaaa acccggcctc aatgtaccag aggagctatt tgatagaggt 2220
ccatccgaga atggagtgtc aacattaact ttcaaaaaag acctaccgat aagttgtatt 2280
agcaacacag aatataccct tgatatctta tacaacaaac atgagactaa accattccct 2340
tacgaaaacg aagcaacagt tggcgcagat ctgggagtaa taatgtccgt ggagtttgga 2400
aataaatcaa taggtaatgc ctctgacgaa gatttaaaag aagaacatct cccattagga 2460
aaatccacaa tggataaaac agacctgcca gatttaaaac aagggctaat gatcgcggag 2520
aagataaaaa gtggaaaagg ggcatatcct tttcattttg gtgctgcaat agctgttgta 2580
tatggtgagg ataaaaaagt agccgcttca attctgacag atttatctga acctaaaaga 2640
gacgaaggcg agtatttgca aagtacgaga aaggtaagcg caatgtttat cacaaacgtc 2700
aatgaatttc gcggccatga ttacccaaaa agtaaatata gtatcggatt agttacagct 2760
gaaaaacgtc agccagtaat aagcaaaaaa cgtgcaaacc cggaagaggc cccttcatca 2820
tccagaaata aaaaattgca tgtacattaa 2850
<210>38
<211>2817
<212>DNA
<213>发光光杆状菌
<400>38
atggaaaaca ttgacccaaa actttatcac catacgccta ccgtcagtgt tcacgataac 60
cgtggactag ctatccgtaa tattagtttt caccgcacta ccgcagaagc aaataccgat 120
acccgtatta cccgccatca atataatgcc ggcggatatt tgaaccaaag cattgatcct 180
cgcctgtatg acgccaaaca gactaacaac gctgtacaac cgaattttat ctggcgacat 240
aatttgaccg gcaatatcct gcgaacagag agcgtcgatg ccggtcggac gattaccctc 300
aacgatattg aaggccgccc ggtgttgacc atcaatgcag ccggtgtccg gcaaaaccat 360
cgctacgaag ataacaccct gcccggtcgc ctgctcgcta tcagcgaaca aggacaggca 420
gaagagaaaa cgaccgagcg ccttatctgg gccggcaata cgccgcaaga aaaagaccac 480
aaccttgccg gtcagtgcgt ccgccattac gataccgcag gactcactca actcaacagc 540
cttgccctga ccggcgccgt tctatcacaa tctcaacaac tgcttaccga taaccaggat 600
gccgactgga caggtgaaga ccagagcctc tggcaacaaa aactgagtag tgatgtctat 660
atcacccaaa gtaacactga tgccaccggg gctttactga cccagaccga tgccaaaggc 720
aacattcagc ggctggccta tgatgtggcc gggcagctaa aagggagttg gttaacactc 780
aaaggtcagg cggaacaggt gattatcaaa tcgctaacct actccgccgc cgggcaaaaa 840
ttacgtgaag agcacggtaa cgggattgtc actgaataca gctacgaacc ggaaacccaa 900
cggcttatcg gcattaccac tcgccgtcca tcagacgcca aggtgttgca agacctacgc 960
tatcaatatg acccagtagg caatgtcatt agtatccgta atgatgcgga agccactcgc 1020
ttttggcgca atcagaaagt agccccggag aatagctata cctacgattc cctgtatcag 1080
cttatcagcg ccaccgggcg cgagatggcc aatatcggtc agcaaagcaa ccaacttccc 1140
tctccggcgc taccttctga taacaatacc tacaccaact atactcgcac ttatacttat 1200
gaccgtggcg gcaatttgac gaaaattcag catagttcac cagccgcgca aaataactac 1260
acgacggata taacggtttc aaatcgcagc aaccgcgcgg tactcagcac attgaccgca 1320
gatccaactc aagtcgatgc cttatttgat gcgggaggcc atcaaaccag cttgttatcc 1380
ggccaagttc taacttggac accgcgaggc gaattgaaac aagccaacaa tagcgcagga 1440
aatgagtggt atcgctacga tagcaacggc atacgccagc taaaagtgaa tgaacaacaa 1500
actcagaata tcccgcaaca acaaagggta acttatctac cggggctgga aatacgtaca 1560
acccagaaca acgccacaac aacagaagag ttacacgtta tcacactcgg taaagccggc 1620
cgcgcgcaag tccgagtatt gcattgggag agcggtaaac cagaagatat taataacaat 1680
cagcttcgtt acagctacga taatcttatt ggctccagcc aacttcaatt agatagcgac 1740
ggacaaatta tcagtgaaga agaatattat ccatttggtg gtacagcgct gtgggcggca 1800
aggaatcaaa ccgaagccag ctataaaacc attcgttatt ctggtaaaga gcgggatgtt 1860
accgggctgt attattatgg ctaccgttat taccaaccgt gggcgggcag atggttaggt 1920
gcagacccgg caggaaccat tgatggactg aatttatatc gcatggtgag aaataacccg 1980
gtgacgcaat ttgatgttca gggattatca ccggccaaca gaacagaaga agcgataata 2040
aaacagggtt cctttacggg aatggaagaa gctgtttata aaaaaatggc taaacctcaa 2100
actttcaaac gccaaagagc tatcgctgcc caaacagagc aagaagccca tgaatcattg 2160
accaacaacc ctagtgtaga tattagccca attaaaaact acaccacaga tagctcacaa 2220
attaatgccg cgataaggga aaatcgtatt acgccagcag tggaaagttt agacgccaca 2280
ttatcttccc tacaagatag acaaatgagg gtaacttatc gggtgatgac ctatgtagat 2340
aattccacgc catcgccttg gcactcgcca caggaaggaa atagtattaa tgttggtgat 2400
atcgtttcgg ataacgctta tttatcaaca tcggcccatc gtggttttct gaattttgtt 2460
cacaaaaaag aaaccagtga aactcgatac gtcaagatgg catttttaac gaatgcgggt 2520
gtcaatgtcc cagcagcatc tatgtataat aatgctggcg aggagcaagt atttaaaatg 2580
gatttaaacg attcaagaaa aagccttgct gaaaaattaa aactaagagt cagtggacca 2640
caatcgggac aagcggaaat attactacct agggaaacac agttcgaagt tgtttcaatg 2700
aaacatcaag gcagagatac ctatgtatta ttgcaagata ttaaccaatc cgcagccact 2760
catagaaatg tacgtaacac ttacaccggt aatttcaaat catccagtgc aaattaa 2817
<210>39
<211>3048
<212>DNA
<213>嗜线虫致病杆菌
<400>39
atgaagaatt tcgttcacag caatacgcca tccgtcaccg tactggacaa ccgtggtcag 60
acagtacgcg aaatagcctg gtatcggcac cccgatacac ctcaggtaac cgatgaacgc 120
atcaccggtt atcaatatga tgctcaagga tctctgactc agagtattga tccgcgattt 180
tatgaacgcc agcagacagc gagtgacaag aacgccatta cacccaatct tattctcttg 240
tcatcactca gtaagaaggc attgcgtacg caaagtgtgg atgccggaac ccgtgtcgcc 300
ctgcatgatg ttgccgggcg tcccgtttta gctgtcagcg ccaatggcgt tagccgaacg 360
tttcagtatg aaagtgataa ccttccggga cgattgctaa cgattaccga gcaggtaaaa 420
ggagagaacg cctgtatcac ggagcgattg atctggtcag gaaatacgcc ggcagaaaaa 480
ggcaataatc tggccggcca gtgcgtggtc cattatgatc ccaccggaat gaatcaaacc 540
aacagcatat cgttaaccag catacccttg tccatcacac agcaattact gaaagatgac 600
agcgaagccg attggcacgg tatggatgaa tctggctgga aaaacgcgct ggcgccggaa 660
agcttcactt ctgtcagcac aacggatgct accggcacgg tattaacgag tacagatgct 720
gccggaaaca agcaacgtat cgcctatgat gtggccggtc tgcttcaagg cagttggttg 780
gcgctgaagg ggaaacaaga acaagttatc gtgaaatccc tgacctattc ggctgccagc 840
cagaagctac gggaggaaca tggtaacggg atagtgacta catataccta tgaacccgag 900
acgcaacgag ttattggcat aaaaacagaa cgtccttccg gtcatgccgc tggggagaaa 960
attttacaaa acctgcgtta tgaatatgat cctgtcggaa atgtgctgaa atcaactaat 1020
gatgctgaaa ttacccgctt ttggcgcaac cagaaaattg taccggaaaa tacttacacc 1080
tatgacagcc tgtaccagct ggtttccgtc actgggcgtg aaatggcgaa tattggccga 1140
caaaaaaacc agttacccat ccccgctctg attgataaca atacttatac gaattactct 1200
cgcacttacg actatgatcg tgggggaaat ctgaccagaa ttcgccataa ttcaccgatc 1260
accggtaata actatacaac gaacatgacc gtttcagatc acagcaaccg ggctgtactg 1320
gaagagctgg cgcaagatcc cactcaggtg gatatgttgt tcacccccgg cgggcatcag 1380
acccggcttg ttcccggtca ggatcttttc tggacacccc gtgacgaatt gcaacaagtg 1440
atattggtca atagggaaaa tacgacgcct gatcaggaat tctaccgtta tgatgcagac 1500
agtcagcgtg tcattaagac tcatattcag aagacaggta acagtgagca aatacagcga 1560
acattatatt tgccagagct ggaatggcgc acgacatata gcggcaatac attaaaagag 1620
tttttgcagg tcatcactgt cggtgaatcg ggtcaggcac aagtgcgggt gctgcattgg 1680
gaaacaggca aaccggcgga tatcagcaat gatcagctgc gctacagtta tggcaacctg 1740
attggcagta gcgggctgga attggacagt gacgggcaga tcattagtca ggaagaatat 1800
tacccctatg ggggaaccgc cgtgtgggca gcccgaagtc agtcagaagc tgattacaaa 1860
accgtgcgtt attctggcaa agagcgggat gcaacagggt tgtattacta cggttatcgt 1920
tattatcaat cgtggacagg gcgatggttg agtgtagatc ctgccggtga ggtcgatggt 1980
ctcaatttgt tccgaatgtg caggaataac cccatcgttt tttctgattc tgatggtcgt 2040
ttccccggtc agggtgtcct tgcctggata gggaaaaaag cgtatcgaaa ggcagtcaac 2100
atcacgacag aacacctgct tgaacaaggc gcttcctttg atacgttctt gaaattaaac 2160
cgaggattgc gaacgtttgt tttgggtgtg ggggtagcaa gtctgggggt gaaggcggcc 2220
acgattgcag gagcgtcgcc ttgggggatt gtcggggctg ccattggtgg ttttgtctcc 2280
ggggcggtga tggggttttt cgcgaacaac atctcagaaa aaattgggga agttttaagt 2340
tatctgacgc gtaaacgttc tgttcctgtt caggttggcg cttttgttgt cacatcgctt 2400
gtgacgtctg cactatttaa cagctcttcg acaggtaccg ccatttccgc agcaacagcg 2460
gtcaccgttg gaggattaat ggctttagcc ggagagcata acacgggcat ggctatcagt 2520
attgccacac ccgccggaca aggtacgctg gatacgctca ggcccggtaa tgtcagcgcg 2580
ccagagcggt taggggcact atcaggcgca attattggcg gcatattact tggccgccat 2640
cagggaagtt ctgagctggg tgaacgggca gcgattggtg ctatgtatgg tgctcgatgg 2700
ggaaggatca ttggtaatct atgggatggc ccttatcggt ttatcggcag gttactgctc 2760
agaagaggca ttagctctgc catttcccac gctgtcagtt ccaggagctg gtttggccga 2820
atgataggag aaagtgtcgg gagaaatatt tctgaagtat tattacctta tagccgtaca 2880
cccggtgaat gggttggtgc agccattggc gggacagccg cggccgctca tcatgccgtt 2940
ggaggggaag ttgccaatgc cgctagccgg gttacctgga gcggctttaa gcgggctttt 3000
aataacttct tctttaacgc ctctgcacgt cataatgaat ccgaagca 3048
<210>40
<211>2889
<212>DNA
<213>伯氏致病杆菌
<400>40
atgaatgttt ttaatccaac tttatatgcc ggtacaccga ctgtcaccgt catggacaat 60
cgagggctgt cagtgcggga tattgcttat caccgtacaa cagcaggaga gcaggctgac 120
actcgcatca cccgccatca atacagtccc cataattttt taatcgagag cattgatcca 180
cgcctttttg atttgcaatc tcagagcacc ataaaaccta atttcaccta ctgtcctgcc 240
ttgaagggtg atgtcctacg gacagagagt gtggatgccg gacaaactgt cattttgagt 300
gacatcgaag gtcgtccgtt actgaatatc agtgcgatgg gtgtcgtcaa acactggcaa 360
tatgaagaga gtacattgcc ggggcgcttg ctcgctgtca gtgaacggaa gaatgaggct 420
tcaacacccc aaattattga acggtttatt tggtcgggaa atagcccatc agaaaaagat 480
cacaatttgg cgggaaaata tcttcgtcat tatgataccg ccggattaaa ccagcttaat 540
gctgtgtctc tgaccagcgt ggatctctca caatcccgtc agttattgca ggatgatgtc 600
acagcagatt ggagcggaag tgacgaatcc cagtggaaga cgcgactgag taacgacata 660
ttcacaaccg aaatcaccgc tgatgcggtt ggcaatttct tgactcagaa tgatgccaaa 720
agcaaccagc aacgattgtc ctatgatgtg gcagggcagt taaaggcaag ctggctgacg 780
ataaaaggcc agaatgagca ggtgatagtt aactccctga cttactccgc cgcagggcag 840
aaactgcgtg aagagcaggg taacggcgtt gtcactgaat actcctatga agcacaaacc 900
tggcgtttga taggtgtaac ggcttaccgt cagtcagata aaaaaagatt gcaggatctt 960
gtctataact atgatccggt cggtaatctc ctgaatattc gcaataatgc agaggcaacc 1020
cgtttctggc gtaatcagat agtagaacca gagaaccact atgcttatga ctcgctttat 1080
caactcatca gtgctagtgg tcgagaaatc gccagtatcg gtcagcaggg cagccggctg 1140
cctgtaccga ttattcctct tcctgccaat gacgatgttt atactcgcta cacccgcaca 1200
tatcactatg atcgcggtgg aaatctctgc cagatccggc attgcgctcc tgctacagat 1260
aataagtaca ccacaaagat caccgtatcg aatcgtagta atcgtgcagt atgggatacc 1320
ttgaccacag atcccgccaa agtggatacc ctgtttgatc atggagggca tcaacttcaa 1380
ctccagtcag gccagacttt atgttggaac tatcggggtg aactacagca aataacaaag 1440
atacagcgtg acgaaaaacc cgcagataaa gagcggtatc gctatggtgt tggggctgcg 1500
cgggtcgtga aaatcagcac acagcaggcg gggggaagca gccatgtgca gcgtgttgtt 1560
tatctgccgg ggttggaact acgcacaact cagcatgatg cgacattaat cgaagactta 1620
caggtgatta tcatgggtga agcaggacgt gctcaggtac gcgtacttca ttgggaaata 1680
ccaccaccgg ataatcttaa caatgactca ctgcgttaca gctacgatag tttgatgggt 1740
tccagtcagc ttgaattgga tggagcaggg cagattatta cgcaggaaga atactacccc 1800
tatggaggta cagcaatatg ggcggcaaga aaccagaccg aagccaatta caaaaccatt 1860
cgctactccg gcaaagagcg tgatgcgacg gggctttatt actacgggca ccgttattat 1920
cagccgtggc tagggcgctg gttgagcgca gatcccgccg gaaccgtgga cggactgaat 1980
ctatatcgaa tggtgaggaa taacccgatt acttaccggg atgcagatgg gcttgcgccg 2040
ataggcgata agatcagcga agggatttat gagcctgagt tgcgagttgg tcttgaacga 2100
gatgacccaa atgtcagaga ttatgaccgg gtttatcctg atacggccaa gacagagatg 2160
atcgaagcaa ctgcgaccac aattgctccc agtcaaatgt tatcggcgca tgcttttgca 2220
tctgtaccta tattgacaga tttgtttaat cctcaaacag caaggctttc tcaaaagaca 2280
acggatattg tattaaacac acaaggtgga ggcgatttaa tctttactgg catgaatatt 2340
aaaggtaagg gaaaagaatt taatgcatta aaaatcgttg atacttatgg cggagaaatg 2400
cctgatagca aaaccgctat ttcagcatat tggcttccgc aaggtgggta tactgatatt 2460
ccgatacatc cgactggaat acaaaagtat ttgtttacgc ctgcgtttag tggttgcact 2520
ctggcagtag ataagcttaa cgaaaataca ttacgggcgt atcacgtcga aggaagtaag 2580
gaagatgctc aatataataa tttagcagtt gcagcgcacg gagagggttt ggtcatggct 2640
atggaatttc ctgactatgg atttcataca gacaaaacag ggcaaagact aaggaacaca 2700
cagggatttg cgtttatgtc ctacaatcaa tcccagaaaa aatgggaaat tcattatcaa 2760
aggcaagcat tgacatcaaa caccggtatc atgaatgtta gtgctaaaaa caagattcga 2820
ttgaatgccc ccagtcatgt aaaaaatagc tcaatcaaag gaactgaaat aatgacgaca 2880
catttttaa 2889
<210>41
<211>2862
<212>DNA
<213>类芽胞杆菌属菌株DAS1529
<400>41
atgaaaatga taccgtggac tcaccattat ttgcttcacc gcctgcgcgg tgagatggag 60
gttaaaccta tgaacacaac gtccatatat aggggcacgc ctacgatttc agttgtggat 120
aaccggaact tggagattcg cattcttcag tataaccgta tcgcggctga agatccggca 180
gatgagtgta tcctgcggaa cacgtatacg ccgttaagct atcttggcag cagcatggat 240
ccccgtttgt tctcgcaata tcaggatgat cgcggaacac cgccgaatat acgaaccatg 300
gcttccctga gaggcgaagc gctgtgttcg gaaagtgtgg atgccggccg caaggcggag 360
ctttttgata tcgaggggcg gcccgtctgg cttatcgatg ccaacggcac agagacgact 420
ctcgaatatg atgtcttagg caggccaaca gccgtattcg agcaacagga aggtacggac 480
tccccccagt gcagggagcg gtttatttat ggtgagaagg aggcggatgc ccaggccaac 540
aatttgcgcg gacaactggt tcgccactac gataccgcgg gccggataca gaccgacagc 600
atctccttgg ctggactgcc gttgcgccaa agccgtcaac tgctgaaaaa ttgggatgaa 660
cctggcgact ggagtatgga tgaggaaagc gcctgggcct cgttgctggc tgccgaagct 720
tatgatacga gctggcggta tgacgcgcag gacagggtgc tcgcccaaac cgacgccaaa 780
gggaatctcc agcaactgac ttacaatgac gccggccagc cgcaggcggt cagcctcaag 840
ctgcaaggcc aagcggagca acggatttgg aaccggatcg agtacaacgc ggcgggtcaa 900
gtggatctcg ccgaagccgg gaatggaatc gtaacggaat atacttacga ggaaagcacg 960
cagcggttaa tccgaaaaaa agattcccgc ggactgtcct ccggggaaag agaagtgctg 1020
caggattatc gttatgaata tgatccggta ggcaatatcc tttctattta caatgaagcg 1080
gagccggttc gttatttccg caatcaggcc gttgctccga aaaggcaata tgcctacgat 1140
gccttgtatc agcttgtatc tagttcgggg cgggaatccg acgcgcttcg gcagcagacg 1200
tcgcttcctc ccttgatcac gcctatccct ctggacgata gccaatacgt caattacgct 1260
gaaaaataca gctatgatca ggcgggcaat ttaatcaagc ttagccataa cggggcaagt 1320
caatatacaa cgaatgtgta tgtggacaaa agctcaaacc gggggatttg gcggcaaggg 1380
gaagacatcc cggatatcgc ggcttccttt gacagagcag gcaatcaaca agctttattc 1440
ccggggagac cgttggaatg ggatacacgc aatcaattaa gccgtgtcca tatggtcgtg 1500
cgcgaaggcg gagacaacga ctgggaaggc tatctctatg acagctcggg aatgcgtatc 1560
gtaaaacgat ctacccgcaa aacacagaca acgacgcaaa cggatacgac cctctatttg 1620
ccgggcctgg agctgcgaat ccgccagacc ggggaccggg tcacggaagc attgcaggtc 1680
attaccgtgg atgagggagc gggacaagtg agggtgctgc actgggagga tggaaccgag 1740
ccgggcggca tcgccaatga tcagtaccgg tacagcctga acgatcatct tacctcctct 1800
ttattggaag ttgacgggca aggtcagatc attagtaagg aagaatttta tccctatggc 1860
ggcacagccc tgtggacagc ccggtcagag gtagaggcaa gctacaagac catccgctat 1920
tcaggcaaag agcgggatgc cacaggcctg tattattacg gacaccgcta ctatatgcca 1980
tggttgggtc gctggctgaa tccggacccg gccggaatgg tagatggact aaacctgtac 2040
cgtatggtca ggaacaatcc tataggactg atggatccga atgggaatgc gccaatcaac 2100
gtggcggatt atagcttcgt gcatggtgat ttagtttatg gtcttagtaa ggaaagagga 2160
agatatctaa agctatttaa tccaaacttt aatatggaaa aatcagactc tcctgctatg 2220
gttatagatc aatataataa taatgttgca ttgagtataa ctaaccaata taaagtagaa 2280
gaattgatga aatttcaaaa agacccacaa aaagccgcac ggaaaataaa ggttccagaa 2340
gggaatcgtt tatcgaggaa cgaaaattat cctttgtggc acgattatat taacattgga 2400
gaagctaaag ctgcatttaa ggcctctcat attttccaag aagtgaaggg gaattatggg 2460
aaagattatt atcataaatt attattagac agaatgatag aatcgccgtt gctgtggaaa 2520
cgaggcagca aactcgggct agaaatcgcc gctaccaatc agagaacaaa aatacacttt 2580
gttcttgaca atttaaatat cgagcaggtg gttacgaaag agggtagcgg cggtcagtca 2640
atcacagctt cggagctccg ttatatttat cgaaatcgcg aaagattgaa cgggcgtgtc 2700
attttctata gaaataatga aaggctagat caggctccat ggcaagaaaa tccggactta 2760
tggagcaaat atcaaccggg tcttagacaa agcagcagtt caagagtcaa agaacgaggg 2820
attgggaac tttttccgccg gttttcaatg aagagaaagt aa 2862
<210>42
<211>2793
<212>DNA
<213>类芽胞杆菌属菌株DAS1529
<400>42
atgaacacaa cgtccatata taggggcacg cctacgattt cagttgtgga taaccggaac 60
ttggagattc gcattcttca gtataaccgt atcgcggctg aagatccggc agatgagtgt 120
atcctgcgga acacgtatac gccgttaagc tatcttggca gcagcatgga tccccgtttg 180
ttctcgcaat atcaggatga tcgcggaaca ccgccgaata tacgaaccat ggcttccctg 240
agaggcgaag cgctgtgttc ggaaagtgtg gatgccggcc gcaaggcgga gctttttgat 300
atcgaggggc ggcccgtctg gcttatcgat gccaacggca cagagacgac tctcgaatat 360
gatgtcttag gcaggccaac agccgtattc gagcaacagg aaggtacgga ctccccccag 420
tgcagggagc ggtttattta tggtgagaag gaggcggatg cccaggccaa caatttgcgc 480
ggacaactgg ttcgccacta cgataccgcg ggccggatac agaccgacag catctccttg 540
gctggactgc cgttgcgcca aagccgtcaa ctgctgaaaa attgggatga acctggcgac 600
tggagtatgg atgaggaaag cgcctgggcc tcgttgctgg ctgccgaagc ttatgatacg 660
agctggcggt atgacgcgca ggacagggtg ctcgcccaaa ccgacgccaa agggaatctc 720
cagcaactga cttacaatga cgccggccag ccgcaggcgg tcagcctcaa gctgcaaggc 780
caagcggagc aacggatttg gaaccggatc gagtacaacg cggcgggtca agtggatctc 840
gccgaagccg ggaatggaat cgtaacggaa tatacttacg aggaaagcac gcagcggtta 900
atccgaaaaa aagattcccg cggactgtcc tccggggaaa gagaagtgct gcaggattat 960
cgttatgaat atgatccggt aggcaatatc ctttctattt acaatgaagc ggagccggtt 1020
cgttatttcc gcaatcaggc cgttgctccg aaaaggcaat atgcctacga tgccttgtat 1080
cagcttgtat ctagttcggg gcgggaatcc gacgcgcttc ggcagcagac gtcgcttcct 1140
cccttgatca cgcctatccc tctggacgat agccaatacg tcaattacgc tgaaaaatac 1200
agctatgatc aggcgggcaa tttaatcaag cttagccata acggggcaag tcaatataca 1260
acgaatgtgt atgtggacaa aagctcaaac cgggggattt ggcggcaagg ggaagacatc 1320
ccggatatcg cggcttcctt tgacagagca ggcaatcaac aagctttatt cccggggaga 1380
ccgttggaat gggatacacg caatcaatta agccgtgtcc atatggtcgt gcgcgaaggc 1440
ggagacaacg actgggaagg ctatctctat gacagctcgg gaatgcgtat cgtaaaacga 1500
tctacccgca aaacacagac aacgacgcaa acggatacga ccctctattt gccgggcctg 1560
gagctgcgaa tccgccagac cggggaccgg gtcacggaag cattgcaggt cattaccgtg 1620
gatgagggag cgggacaagt gagggtgctg cactgggagg atggaaccga gccgggcggc 1680
atcgccaatg atcagtaccg gtacagcctg aacgatcatc ttacctcctc tttattggaa 1740
gttgacgggc aaggtcagat cattagtaag gaagaatttt atccctatgg cggcacagcc 1800
ctgtggacag cccggtcaga ggtagaggca agctacaaga ccatccgcta ttcaggcaaa 1860
gagcgggatg ccacaggcct gtattattac ggacaccgct actatatgcc atggttgggt 1920
cgctggctga atccggaccc ggccggaatg gtagatggac taaacctgta ccgtatggtc 1980
aggaacaatc ctataggact gatggatccg aatgggaatg cgccaatcaa cgtggcggat 2040
tatagcttcg tgcatggtga tttagtttat ggtcttagta aggaaagagg aagatatcta 2100
aagctattta atccaaactt taatatggaa aaatcagact ctcctgctat ggttatagat 2160
caatataata ataatgttgc attgagtata actaaccaat ataaagtaga agaattgatg 2220
aaatttcaaa aagacccaca aaaagccgca cggaaaataa aggttccaga agggaatcgt 2280
ttatcgagga acgaaaatta tcctttgtgg cacgattata ttaacattgg agaagctaaa 2340
gctgcattta aggcctctca tattttccaa gaagtgaagg ggaattatgg gaaagattat 2400
tatcataaat tattattaga cagaatgata gaatcgccgt tgctgtggaa acgaggcagc 2460
aaactcgggc tagaaatcgc cgctaccaat cagagaacaa aaatacactt tgttcttgac 2520
aatttaaata tcgagcaggt ggttacgaaa gagggtagcg gcggtcagtc aatcacagct 2580
tcggagctcc gttatattta tcgaaatcgc gaaagattga acgggcgtgt cattttctat 2640
agaaataatg aaaggctaga tcaggctcca tggcaagaaa atccggactt atggagcaaa 2700
tatcaaccgg gtcttagaca aagcagcagt tcaagagtca aagaacgagg gattgggaac 2760
tttttccgcc ggttttcaat gaagagaaag tag 2793
<210>43
<211>4428
<212>DNA
<213>人工序列
<220>
<223>优化在植物中表达的编码B类TC蛋白TcdB2的核酸序列
<400>43
atggctcaaa attcccagga cttctccata actgaactca gcttgcccaa gggtggaggt 60
gccatcactg ggatgggtga ggccctcaca ccaactggcc ctgatgggat ggctgcactc 120
tccttgccct tgccaatctc cgctggaaga ggctatgcac ctgccttcac actcaactac 180
aactctggtg ctgggaactc accctttgga cttggctggg actgcaatgt gatgaccata 240
aggaggcgca cccactttgg tgttccacac tatgatgaga ctgacacctt ccttggacca 300
gaaggtgagg tgcttgttgt cgctgaccag ccacgtgatg agagcacctt gcaagggatc 360
aaccttggtg ccaccttcac agtgactggc taccgcagca gacttgagtc acacttcagc 420
agacttgagt attggcaacc caagaccact gggaagactg acttctggct catctactcc 480
cctgatggcc aagttcatct ccttgggaag tcaccacagg cacgcatctc caacccctca 540
caaaccacac agactgcaca gtggctcctt gaagcctcag tcagctcccg tggtgaacag 600
atctactacc agtacagagc tgaagatgac actggctgtg aggctgatga gataacccat 660
cacctccaag ccacagcaca acgctacctc cacattgtct actatgggaa caggactgcc 720
tcagagacac tccctggact tgatggctct gcaccctccc aagctgactg gttgttctac 780
cttgtctttg actatggtga acgctccaac aatttgaaga ccccacctgc cttcagcacc 840
actggctcct ggctctgccg tcaggaccgc ttcagcagat atgagtatgg ctttgagata 900
aggacaaggc gcctttgcag acaagtcttg atgtaccatc acctccaagc acttgactcc 960
aagatcactg aacacaatgg acccaccctt gtgagccgtc tcatcttgaa ctatgatgag 1020
tcagccattg cctccacact tgtctttgtc aggcgtgttg gccatgagca agatgggaat 1080
gttgtcaccc ttccaccctt ggaacttgcc taccaggact tcagccccag gcaccatgca 1140
cactggcagc caatggatgt ccttgccaac ttcaatgcca ttcaacgctg gcagcttgtt 1200
gacttgaagg gtgaaggcct cccaggactc ttgtaccaag acaagggtgc ctggtggtac 1260
cgcagcgcac agaggcttgg tgagattggc tctgatgctg tgacctggga gaagatgcag 1320
cccttgtcag tgattcccag cctccagagc aatgcctccc ttgttgacat caatggtgat 1380
gggcagcttg actgggtgat aacaggccct ggcttgagag gctaccacag ccagaggcct 1440
gatggctcct ggacacgctt cacccccttg aacgcactcc cagttgagta cacccatcca 1500
cgtgcccaac ttgctgactt gatgggtgct ggcctctctg accttgtctt gattggaccc 1560
aagtctgtta gactctatgc caacacacgt gatggctttg ccaaagggaa ggatgttgtc 1620
cagtctggtg acataacctt gcctgtgcct ggtgctgatc cacgcaaact tgttgccttc 1680
tcagatgttc ttggctctgg gcaagcacat cttgttgaag tctcagccac taaggttact 1740
tgttggccaa atttgggaag aggcagattc ggacagccta ttacactccc tggcttcagc 1800
caacctgcca ctgagttcaa ccctgcccaa gtctacttgg ctgaccttga tggctctgga 1860
cccactgatc tcatctatgt tcacaccaac aggcttgaca tcttcttgaa caagtctggg 1920
aatggcttcg ctgaacctgt gacattgcgc tttcctgaag gcctccgctt tgaccacacc 1980
tgccagctcc aaatggctga tgtgcaaggc cttggtgttg cctccttgat tctctctgtt 2040
ccacacatgt caccacatca ctggcgttgt gatctcacca acatgaagcc ctggctcttg 2100
aatgagatga acaacaacat gggtgttcat cacacattgc gctacagaag ctcaagccag 2160
ttctggcttg atgagaaggc tgccgcactc accactggcc agaccccagt ctgctacctc 2220
cccttcccca ttcacacctt gtggcagact gagactgaag atgagatttc tgggaacaaa 2280
cttgtgacca cattgcgcta tgcacgtggt gcctgggatg gacgtgagcg tgagttccgt 2340
ggctttggct acgttgagca gactgacagc catcagcttg cacaagggaa cgcaccagag 2400
cgcaccccac ctgcactcac caagaactgg tacgccacag gcctcccagt gattgacaat 2460
gcactcagca ctgagtactg gcgcgatgac caagcctttg ctggcttctc accccgcttc 2520
accacatggc aagacaacaa ggatgttccc ttgacccctg aagatgacaa ctcccgctac 2580
tggttcaacc gtgccctcaa gggccaactc ttgaggtctg aactctatgg acttgatgac 2640
agcaccaaca aacatgtccc ctacacagtg actgagttcc gctcccaagt gaggcgcctc 2700
cagcacactg acagccgtta ccctgtcttg tggtccagcg tcgttgagtc tagaaactac 2760
cactacgaga ggattgcctc tgatccacaa tgcagccaga acattacact ctccagcgac 2820
cgctttggac agccactcaa acaactctca gtgcagtacc caaggcgcca gcaacctgcc 2880
atcaacttgt accctgacac cctccctgac aaactccttg ccaactcata tgatgaccag 2940
caaaggcaac tcaggttgac ctaccaacag tccagctggc atcacttgac caacaacact 3000
gttcgtgtgc ttggactccc tgacagcaca cgctctgaca tcttcaccta tggtgctgag 3060
aatgtgcctg ctggtggcct caaccttgaa ctcttgtctg acaaaaattc cttgattgct 3120
gatgacaaac cacgtgagta ccttggacag cagaagacag cctacaccga tggccagaac 3180
accacaccac tccaaacccc caccaggcaa gccttgattg ccttcactga gaccacagtc 3240
ttcaaccagt caacactctc tgccttcaat gggagcatac cctctgacaa gttgtcaacc 3300
acacttgaac aggctggcta ccagcaaaca aactacctct ttcccaggac tggtgaagac 3360
aaggtctggg ttgcacatca cggctacact gactatggga cagctgcaca attctggagg 3420
cctcagaaac aatccaacac ccaactcact gggaagatca ccttgatttg ggatgctaac 3480
tactgtgtcg ttgtccagac tcgtgatgct gctggactca ccacatctgc caagtatgac 3540
tggcgctttc tcacccctgt gcaactcact gacatcaacg acaaccagca cttgataacc 3600
cttgatgcac ttggccgtcc aatcacattg cgcttctggg ggactgagaa tgggaagatg 3660
acaggctaca gctcccctga gaaggccagc ttctccccac cctctgatgt caatgctgcc 3720
attgaactca aaaaaccact ccctgttgca cagtgccaag tctatgcccc agagagctgg 3780
atgcctgtct tgagccaaaa gaccttcaac cgtcttgctg aacaggactg gcagaaactc 3840
tacaatgcac gcataatcac tgaagatgga cgcatctgca cacttgccta cagaaggtgg 3900
gtgcaatcac agaaggccat tccacaactc ataagcctct tgaacaatgg acccaggctc 3960
ccaccacaca gcttgacact caccactgac agatatgacc atgatccaga gcagcaaatc 4020
aggcaacagg ttgtcttctc agatggcttt ggacgtttgc tccaagctgc cgctcgccat 4080
gaagctggga tggcacgtca gaggaacgaa gatggctcct tgatcatcaa tgttcagcac 4140
actgagaaca ggtgggctgt gactggacgc actgagtatg acaacaaggg gcagccaatc 4200
aggacctacc agccctactt tctcaatgac tggagatatg tctccaatga ctcagcacgt 4260
caggagaagg aagcctatgc tgacacccat gtctatgatc caattggacg tgagatcaag 4320
gtgataactg ccaagggctg gttcagacgc acactcttca ccccctggtt cactgtgaac 4380
gaagatgaga atgacacagc tgctgaagtg aagaaggtca agatgtga 4428
<210>44
<211>2886
<212>DNA
<213>人工序列
<220>
<223>优化在植物中表达的编码C类TC蛋白TccC3的核酸序列
<400>44
atggccaaga acattgaccc aaaattgtac caaaagactc ccactgtgtc tgtttacgac 60
aatcgtggct tgatcataag aaacattgac ttccatagga caactgccaa tggagaccct 120
gacaccagaa tcaccaggca tcagtatgac attcatgggc acctcaacca gagcattgac 180
ccaagactct acgaagccaa gcagacaaac aacaccatca agcccaactt cctttggcag 240
tatgacttga ctggcaaccc tctctgcact gaatccatag atgctggccg cactgtcaca 300
ttgaatgaca ttgagggccg ccctcttctc actgtcactg ccactggtgt catccaaacc 360
aggcagtatg agaccagctc actccctggc cgccttctct ctgtggcaga acaaactccc 420
gaggaaaaga caagccgcat aactgagaga ctcatctggg ctggcaacac tgaagcagag 480
aaagaccaca acttggctgg gcagtgtgtg aggcactatg acactgctgg agtcacaagg 540
ttggagtcgt tgtccctcac tggcactgtc ctttcccagt ccagccaact cctcattgac 600
acccaagaag ccaactggac tggtgacaat gagactgttt ggcagaacat gttggctgat 660
gacatctaca ccacactctc aaccttcgat gccactggag cactcctcac acagactgat 720
gccaaaggca acatccaaag attggcctat gatgtggctg gccaactcaa tgggtcatgg 780
ctcaccctca agggacagac tgagcaagtg atcatcaagt ccctcacata ctctgctgct 840
gggcagaagt tgagagagga acatggcaat gatgtcataa ctgagtacag ctatgagcct 900
gagacccaga gactcattgg catcaagacc aggaggccct ctgacaccaa agttcttcaa 960
gatcttcgct atgagtatga ccctgtgggc aatgtgatct caatcaggaa tgatgcagaa 1020
gccacaaggt tctggcacaa tcagaaggtc atgcctgaga acacctacac ctatgacagc 1080
ctctaccaac tcatctctgc cactggccgt gagatggcca acattgggca acaatcacac 1140
cagtttccca gcccagcact cccatctgac aacaacacat acaccaacta caccagaaca 1200
tacacctatg atcgtggtgg caacctcaca aaaatccaac acagcagccc agccacacag 1260
aacaactaca ccaccaacat cactgtgagc aacaggagca acagagcagt cttgtcgaca 1320
ctcactgagg acccagcaca ggtggatgca ctctttgatg ctggtgggca ccagaacacc 1380
ctcatctctg gccagaatct caactggaac acccgtggtg agttgcaaca ggtgacattg 1440
gtcaagaggg acaagggagc caatgatgac agagaatggt accgctactc tggtgatggc 1500
cgccgcatgt tgaagatcaa tgagcaacag gcctccaaca atgcacagac ccagagggtc 1560
acatacttgc ccaaccttga gttgaggctc acccagaaca gcactgccac cactgaggac 1620
cttcaagtca tcactgtggg agaagccggc cgtgctcagg tcagggttct tcactgggag 1680
tctggcaaac ccgaggacat agacaacaac cagctcaggt acagctacga caatctcatt 1740
ggctcatccc agttggagtt ggactctgaa ggccaaatca tatctgagga agagtactac 1800
ccttatggag gcactgcact ttgggctgcc aggaaccaga ctgaggcaag ctacaagaca 1860
atcaggtact ctggcaagga gagggatgcc actggactct actactatgg ttaccgctac 1920
taccaacctt ggattggccg ctggctttcc tctgatcccg ctggcaccat tgatggtctc 1980
aacctctacc gcatggtcag gaacaaccct gtcacattgc ttgaccctga tggactcatg 2040
cccaccattg cagagaggat tgctgccttg aagaagaaca aggtcactga ctctgctccc 2100
agcccagcca atgccaccaa tgtggccatc aacataagac cacctgtggc tcccaagccc 2160
tcacttccca aggcttccac cagctcacaa cccacaactc accctattgg tgctgccaac 2220
atcaagccca ccacctctgg gtccagcatt gtggctcctc tcagccctgt gggcaacaag 2280
tcaacctctg agatcagcct tcccgagtct gctcagtcct cctcatcaag caccaccagc 2340
accaacctcc agaagaagtc cttcaccctc taccgtgctg acaacagatc ctttgaagag 2400
atgcagtcca agttcccaga gggcttcaag gcttggactc ctcttgacac aaaaatggcc 2460
agacaattcg cttccatctt catcgggcag aaggacacca gcaaccttcc caaggagact 2520
gtcaagaaca tctccacctg gggtgccaag cccaagctca aggacctttc caactacatc 2580
aagtacacca aggacaagtc cactgtttgg gtttccactg ccatcaatac tgaagctggt 2640
gggcagtcct ctggtgctcc actccacaaa attgacatgg acctctatga gttcgccatt 2700
gatgggcaga agctcaaccc actcccagag ggccgcacaa agaacatggt tccctccctc 2760
ctccttgaca ctccacagat tgagacctcc tccatcattg ctctcaatca cgggccagtg 2820
aacgatgctg agatctcatt cctcaccacc ataccactca aaaatgtcaa gccacataag 2880
aggtga 2886
<210>45
<211>7313
<212>DNA
<213>人工序列
<220>
<223>编码TcdB2/TccC3融合蛋白8563(pDAB8563)的核酸序列
<400>45
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatccgg tctcataatc cgtaacatcg 4500
attttcatcg tactaccgca aatggtgatc ccgatacccg tattacccgc catcaatacg 4560
atattcacgg acacctaaat caaagcatcg atccgcgcct atatgaagcc aagcaaacca 4620
acaatacgat caaacccaat tttctttggc agtatgattt gaccggtaat cccctatgta 4680
cagagagcat tgatgcaggt cgcactgtca ccttgaatga tattgaaggc cgtccgctac 4740
taacggtgac tgcaacaggg gttatacaaa ctcgacaata tgaaacttct tccctgcccg 4800
gtcgtctgtt atctgttgcc gaacaaacac ccgaggaaaa aacatcccgt atcaccgaac 4860
gcctgatttg ggctggcaat accgaagcag agaaagacca taaccttgcc ggccagtgcg 4920
tgcgtcacta tgacacggcg ggagttaccc ggttagagag tttatcactg accggtactg 4980
ttttatctca atccagccaa ctattgatcg acactcaaga ggcaaactgg acaggtgata 5040
acgaaaccgt ctggcaaaac atgctggctg atgacatcta cacaaccctg agcaccttcg 5100
atgccaccgg tgctttactg actcagaccg atgcgaaagg gaacattcag agactggctt 5160
atgatgtggc cgggcagcta aacgggagct ggctaacact caaaggccag acggaacaag 5220
tgattatcaa atccctgacc tactccgccg ccggacaaaa attacgtgag gaacacggca 5280
atgatgttat caccgaatac agttatgaac cggaaaccca acggctgatc ggtatcaaaa 5340
cccgccgtcc gtcagacact aaagtgctac aagacctgcg ctatgaatat gacccggtag 5400
gcaatgtcat cagcatccgt aatgacgcgg aagccacccg cttttggcac aatcagaaag 5460
tgatgccgga aaacacttat acctacgatt ccctgtatca gcttatcagc gccaccgggc 5520
gcgaaatggc gaatataggt caacaaagtc accaatttcc ctcacccgct ctaccttctg 5580
ataacaacac ctataccaac tatacccgta cttatactta tgaccgtggc ggcaatctga 5640
ccaaaatcca gcacagttca ccggcgacgc aaaacaacta caccaccaat atcacggttt 5700
caaatcgcag caaccgcgca gtactcagca cattgaccga agatccggcg caagtagatg 5760
ctttgtttga tgcaggcgga catcagaaca ccttgatatc aggacaaaac ctgaactgga 5820
atactcgtgg tgaactgcaa caagtaacac tggttaaacg ggacaagggc gccaatgatg 5880
atcgggaatg gtatcgttat agcggtgacg gaagaaggat gttaaaaatc aatgaacagc 5940
aggccagcaa caacgctcaa acacaacgtg tgacttattt gccgaactta gaacttcgtc 6000
taacacaaaa cagcacggcc acaaccgaag atttgcaagt tatcaccgta ggcgaagcgg 6060
gccgggcaca ggtacgagta ttacattggg agagcggtaa accggaagat atcgacaata 6120
atcagttgcg ttatagttac gataatctta tcggttccag tcaacttgaa ttagatagcg 6180
aaggacaaat tatcagtgaa gaagaatatt atccctatgg tggaacagca ttatgggccg 6240
ccaggaatca gacagaagcc agttataaaa ctatccgtta ttcaggcaaa gagcgggatg 6300
ccaccgggct atattactac ggctatcggt attaccaacc gtggatagga cggtggttaa 6360
gctccgatcc ggcaggaaca atcgatgggc tgaatttata tcggatggtg aggaataatc 6420
cagttaccct ccttgatcct gatggattaa tgccaacaat tgcagaacgc atagcagcac 6480
taaaaaaaaa taaagtaaca gactcagcgc cttcgccagc aaatgccaca aacgtagcga 6540
taaacatccg cccgcctgta gcaccaaaac ctagcttacc gaaagcatca acgagtagcc 6600
aaccaaccac acaccctatc ggagctgcaa acataaaacc aacgacgtct gggtcatcta 6660
ttgttgctcc attgagtcca gtaggaaata aatctacttc tgaaatctct ctgccagaaa 6720
gcgctcaaag cagttcttca agcactacct cgacaaatct acagaaaaaa tcatttactt 6780
tatatagagc agataacaga tcctttgaag aaatgcaaag taaattccct gaaggattta 6840
aagcctggac tcctctagac actaagatgg caaggcaatt tgctagtatc tttattggtc 6900
agaaagatac atctaattta cctaaagaaa cagtcaagaa cataagcaca tggggagcaa 6960
agccaaaact aaaagatctc tcaaattaca taaaatatac caaggacaaa tctacagtat 7020
gggtttctac tgcaattaat actgaagcag gtggacaaag ctcaggggct ccactccata 7080
aaattgatat ggatctctac gagtttgcca ttgatggaca aaaactaaat ccactaccgg 7140
agggtagaac taaaaacatg gtaccttccc ttttactcga caccccacaa atagagacat 7200
catccatcat tgcacttaat catggaccgg taaatgatgc agaaatttca tttctgacaa 7260
caattccgct taaaaatgta aaacctcata agagataatt aatctgactc gag 7313
<210>46
<211>2416
<212>PRT
<213>人工序列
<220>
<223>TcdB2/TccC3融合蛋白pDAB8563
<400>46
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pr0 Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Gly Leu Ile Ile Arg Asn Ile Asp Phe His Arg
1475 1480 1485
Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg His Gln
1490 1495 1500
Tyr Asp Ile His Gly His Leu Asn Gln SerIle Asp Pro Arg Leu
1505 1510 1515
Tyr Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn Phe Leu
1520 1525 1530
Trp Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu Ser Ile
1535 1540 1545
Asp Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu Gly Arg Pro
1550 1555 1560
Leu Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr Arg Gln Tyr
1565 1570 1575
Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Ala Glu Gln
1580 1585 1590
Thr Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg Leu Ile Trp
1595 1600 1605
Ala Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu Ala Gly Gln
1610 1615 1620
Cys Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser
1625 1630 1635
Leu Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser Gln Leu Leu
1640 1645 1650
Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn Glu Thr Val
1655 1660 1665
Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr Leu Ser Thr
1670 1675 1680
Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala Lys Gly
1685 1690 1695
Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu Asn Gly
1700 1705 1710
Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile Ile Lys
1715 1720 1725
Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His
1730 1735 1740
Gly Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln
1745 1750 1755
Arg Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr Lys Val
1760 1765 1770
Leu Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Ile
1775 1780 1785
Ser Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp His Asn Gln
1790 1795 1800
Lys Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln
1805 1810 1815
Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly Gln Gln
1820 1825 1830
Ser His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp Asn Asn Thr
1835 1840 1845
Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn
1850 1855 1860
Leu Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln Asn Asn Tyr
1865 1870 1875
Thr Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu
1880 1885 1890
Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala Leu Phe Asp
1895 1900 1905
Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln Asn Leu Asn
1910 1915 1920
Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val Lys Arg
1925 1930 1935
Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser Gly
1940 1945 1950
Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala Ser Asn
1955 1960 1965
Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu Glu Leu
1970 1975 1980
Arg Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu Gln Val
1985 1990 1995
Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His
2000 2005 2010
Trp Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln Leu Arg
2015 2020 2025
Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp
2030 2035 2040
Ser Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro Tyr Gly
2045 2050 2055
Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala Ser Tyr
2060 2065 2070
Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu
2075 2080 2085
Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly Arg Trp
2090 2095 2100
Leu Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn Leu Tyr
2105 2110 2115
Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly
2120 2125 2130
Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu Lys Lys Asn
2135 2140 2145
Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala Thr Asn Val
2150 2155 2160
Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser Leu Pro
2165 2170 2175
Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile Gly Ala
2180 2185 2190
Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val Ala Pro
2195 2200 2205
Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser Leu Pro
2210 2215 2220
Glu Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr Asn Leu
2225 2230 2235
Gln Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg Ser Phe
2240 2245 2250
Glu Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala Trp Thr
2255 2260 2265
Pro Leu Asp Thr Lys Met Ala Arg Gln Phe Ala Ser Ile Phe Ile
2270 2275 2280
Gly Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val Lys Asn
2285 2290 2295
Ile Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp Leu Ser Asn
2300 2305 2310
Tyr Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val Ser Thr
2315 2320 2325
Ala Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly Ala Pro Leu
2330 2335 2340
His Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile Asp Gly Gln
2345 2350 2355
Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn Met Val Pro
2360 2365 2370
Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser Ser Ile Ile
2375 2380 2385
Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile Ser Phe Leu
2390 2395 2400
Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
2405 2410 2415
<210>47
<211>7367
<212>DNA
<213>人工序列
<220>
<223>编码TcdB2/TccC3融合蛋白pDAB8564的核酸序列
<400>47
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatga tgaaaaacat cgatcccaaa ctttatcaaa 4500
aaacccctac tgtcagcgtt tacgataacc gtggtctcat aatccgtaac atcgattttc 4560
atcgtactac cgcaaatggt gatcccgata cccgtattac ccgccatcaa tacgatattc 4620
acggacacct aaatcaaagc atcgatccgc gcctatatga agccaagcaa accaacaata 4680
cgatcaaacc caattttctt tggcagtatg atttgaccgg taatccccta tgtacagaga 4740
gcattgatgc aggtcgcact gtcaccttga atgatattga aggccgtccg ctactaacgg 4800
tgactgcaac aggggttata caaactcgac aatatgaaac ttcttccctg cccggtcgtc 4860
tgttatctgt tgccgaacaa acacccgagg aaaaaacatc ccgtatcacc gaacgcctga 4920
tttgggctgg caataccgaa gcagagaaag accataacct tgccggccag tgcgtgcgtc 4980
actatgacac ggcgggagtt acccggttag agagtttatc actgaccggt actgttttat 5040
ctcaatccag ccaactattg atcgacactc aagaggcaaa ctggacaggt gataacgaaa 5100
ccgtctggca aaacatgctg gctgatgaca tctacacaac cctgagcacc ttcgatgcca 5160
ccggtgcttt actgactcag accgatgcga aagggaacat tcagagactg gcttatgatg 5220
tggccgggca gctaaacggg agctggctaa cactcaaagg ccagacggaa caagtgatta 5280
tcaaatccct gacctactcc gccgccggac aaaaattacg tgaggaacac ggcaatgatg 5340
ttatcaccga atacagttat gaaccggaaa cccaacggct gatcggtatc aaaacccgcc 5400
gtccgtcaga cactaaagtg ctacaagacc tgcgctatga atatgacccg gtaggcaatg 5460
tcatcagcat ccgtaatgac gcggaagcca cccgcttttg gcacaatcag aaagtgatgc 5520
cggaaaacac ttatacctac gattccctgt atcagcttat cagcgccacc gggcgcgaaa 5580
tggcgaatat aggtcaacaa agtcaccaat ttccctcacc cgctctacct tctgataaca 5640
acacctatac caactatacc cgtacttata cttatgaccg tggcggcaat ctgaccaaaa 5700
tccagcacag ttcaccggcg acgcaaaaca actacaccac caatatcacg gtttcaaatc 5760
gcagcaaccg cgcagtactc agcacattga ccgaagatcc ggcgcaagta gatgctttgt 5820
ttgatgcagg cggacatcag aacaccttga tatcaggaca aaacctgaac tggaatactc 5880
gtggtgaact gcaacaagta acactggtta aacgggacaa gggcgccaat gatgatcggg 5940
aatggtatcg ttatagcggt gacggaagaa ggatgttaaa aatcaatgaa cagcaggcca 6000
gcaacaacgc tcaaacacaa cgtgtgactt atttgccgaa cttagaactt cgtctaacac 6060
aaaacagcac ggccacaacc gaagatttgc aagttatcac cgtaggcgaa gcgggccggg 6120
cacaggtacg agtattacat tgggagagcg gtaaaccgga agatatcgac aataatcagt 6180
tgcgttatag ttacgataat cttatcggtt ccagtcaact tgaattagat agcgaaggac 6240
aaattatcag tgaagaagaa tattatccct atggtggaac agcattatgg gccgccagga 6300
atcagacaga agccagttat aaaactatcc gttattcagg caaagagcgg gatgccaccg 6360
ggctatatta ctacggctat cggtattacc aaccgtggat aggacggtgg ttaagctccg 6420
atccggcagg aacaatcgat gggctgaatt tatatcggat ggtgaggaat aatccagtta 6480
ccctccttga tcctgatgga ttaatgccaa caattgcaga acgcatagca gcactaaaaa 6540
aaaataaagt aacagactca gcgccttcgc cagcaaatgc cacaaacgta gcgataaaca 6600
tccgcccgcc tgtagcacca aaacctagct taccgaaagc atcaacgagt agccaaccaa 6660
ccacacaccc tatcggagct gcaaacataa aaccaacgac gtctgggtca tctattgttg 6720
ctccattgag tccagtagga aataaatcta cttctgaaat ctctctgcca gaaagcgctc 6780
aaagcagttc ttcaagcact acctcgacaa atctacagaa aaaatcattt actttatata 6840
gagcagataa cagatccttt gaagaaatgc aaagtaaatt ccctgaagga tttaaagcct 6900
ggactcctct agacactaag atggcaaggc aatttgctag tatctttatt ggtcagaaag 6960
atacatctaa tttacctaaa gaaacagtca agaacataag cacatgggga gcaaagccaa 7020
aactaaaaga tctctcaaat tacataaaat ataccaagga caaatctaca gtatgggttt 7080
ctactgcaat taatactgaa gcaggtggac aaagctcagg ggctccactc cataaaattg 7140
atatggatct ctacgagttt gccattgatg gacaaaaact aaatccacta ccggagggta 7200
gaactaaaaa catggtacct tcccttttac tcgacacccc acaaatagag acatcatcca 7260
tcattgcact taatcatgga ccggtaaatg atgcagaaat ttcatttctg acaacaattc 7320
cgcttaaaaa tgtaaaacct cataagagat aattaatctg actcgag 7367
<210>48
<211>2434
<212>PRT
<213>人工序列
<220>
<223>TcdB2/TccC3融合蛋白pDAB8564
<400>48
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr
1475 1480 1485
Val Ser Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp
1490 1495 1500
Phe His Arg Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr
1505 1510 1515
Arg His Gln Tyr Asp Ile His Gly His Leu Asn Gln Ser Ile Asp
1520 1525 1530
Pro Arg Leu Tyr Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro
1535 1540 1545
Asn Phe Leu Trp Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr
1550 1555 1560
Glu Ser Ile Asp Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu
1565 1570 1575
Gly Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr
1580 1585 1590
Arg Gln Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val
1595 1600 1605
Ala Glu Gln Thr Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg
1610 1615 1620
Leu Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu
1625 1630 1635
Ala Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg
1640 1645 1650
Leu Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser
1655 1660 1665
Gln Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn
1670 1675 1680
Glu Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr
1685 1690 1695
Leu Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp
1700 1705 1710
Ala Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln
1715 1720 1725
Leu Asn Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val
1730 1735 1740
Ile Ile Lys Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg
1745 1750 1755
Glu Glu His Gly Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro
1760 1765 1770
Glu Thr Gln Arg Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp
1775 1780 1785
Thr Lys Val Leu Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly
1790 1795 1800
Asn Val Ile Ser Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp
1805 1810 1815
His Asn Gln Lys Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser
1820 1825 1830
Leu Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile
1835 1840 1845
Gly Gln Gln Ser His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp
1850 1855 1860
Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg
1865 1870 1875
Gly Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln
1880 1885 1890
Asn Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg
1895 1900 1905
Ala Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala
1910 1915 1920
Leu Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln
1925 1930 1935
Asn Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu
1940 1945 1950
Val Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg
1955 1960 1965
Tyr Ser Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln
1970 1975 1980
Ala Ser Asn Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn
1985 1990 1995
Leu Glu Leu Arg Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp
2000 2005 2010
Leu Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg
2015 2020 2025
Val Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn
2030 2035 2040
Gln Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu
2045 2050 2055
Glu Leu Asp Ser Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr
2060 2065 2070
Pro Tyr Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu
2075 2080 2085
Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala
2090 2095 2100
Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile
2105 2110 2115
Gly Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu
2120 2125 2130
Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp
2135 2140 2145
Pro Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu
2150 2155 2160
Lys Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala
2165 2170 2175
Thr Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro
2180 2185 2190
Ser Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro
2195 2200 2205
Ile Gly Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile
2210 2215 2220
Val Ala Pro Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile
2225 2230 2235
Ser Leu Pro Glu Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser
2240 2245 2250
Thr Asn Leu Gln Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn
2255 2260 2265
Arg Ser Phe Glu Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys
2270 2275 2280
Ala Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gln Phe Ala Ser
2285 2290 2295
Ile Phe Ile Gly Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr
2300 2305 2310
Val Lys Asn Ile Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp
2315 2320 2325
Leu Ser Ash Tyr Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp
2330 2335 2340
Val Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly
2345 2350 2355
Ala Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile
2360 2365 2370
Asp Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn
2375 2380 2385
Met Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser
2390 2395 2400
Ser Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile
2405 2410 2415
Ser Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys
2420 2425 2430
Arg
<210>49
<211>7382
<212>DNA
<213>人工序列
<220>
<223>编码TcdB2/TccC3融合蛋白pDAB 8940的核酸序列
<400>49
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatctag gcctatgaaa aacatcgatc 4500
ccaaacttta tcaaaaaacc cctactgtca gcgtttacga taaccgtggt ctgataatcc 4560
gtaacatcga ttttcatcgt actaccgcaa atggtgatcc cgatacccgt attacccgcc 4620
atcaatacga tattcacgga cacctaaatc aaagcatcga tccgcgccta tatgaagcca 4680
agcaaaccaa caatacgatc aaacccaatt ttctttggca gtatgatttg accggtaatc 4740
ccctatgtac agagagcatt gatgcaggtc gcactgtcac cttgaatgat attgaaggcc 4800
gtccgctact aacggtgact gcaacagggg ttatacaaac tcgacaatat gaaacttctt 4860
ccctgcccgg tcgtctgtta tctgttgccg aacaaacacc cgaggaaaaa acatcccgta 4920
tcaccgaacg cctgatttgg gctggcaata ccgaagcaga gaaagaccat aaccttgccg 4980
gccagtgcgt gcgtcactat gacacggcgg gagttacccg gttagagagt ttatcactga 5040
ccggtactgt tttatctcaa tccagccaac tattgatcga cactcaagag gcaaactgga 5100
caggtgataa cgaaaccgtc tggcaaaaca tgctggctga tgacatctac acaaccctga 5160
gcaccttcga tgccaccggt gctttactga ctcagaccga tgcgaaaggg aacattcaga 5220
gactggctta tgatgtggcc gggcagctaa acgggagctg gctaacactc aaaggccaga 5280
cggaacaagt gattatcaaa tccctgacct actccgccgc cggacaaaaa ttacgtgagg 5340
aacacggcaa tgatgttatc accgaataca gttatgaacc ggaaacccaa cggctgatcg 5400
gtatcaaaac ccgccgtccg tcagacacta aagtgctaca agacctgcgc tatgaatatg 5460
acccggtagg caatgtcatc agcatccgta atgacgcgga agccacccgc ttttggcaca 5520
atcagaaagt gatgccggaa aacacttata cctacgattc cctgtatcag cttatcagcg 5580
ccaccgggcg cgaaatggcg aatataggtc aacaaagtca ccaatttccc tcacccgctc 5640
taccttctga taacaacacc tataccaact atacccgtac ttatacttat gaccgtggcg 5700
gcaatctgac caaaatccag cacagttcac cggcgacgca aaacaactac accaccaata 5760
tcacggtttc aaatcgcagc aaccgcgcag tactcagcac attgaccgaa gatccggcgc 5820
aagtagatgc tttgtttgat gcaggcggac atcagaacac cttgatatca ggacaaaacc 5880
tgaactggaa tactcgtggt gaactgcaac aagtaacact ggttaaacgg gacaagggcg 5940
ccaatgatga tcgggaatgg tatcgttata gcggtgacgg aagaaggatg ttaaaaatca 6000
atgaacagca ggccagcaac aacgctcaaa cacaacgtgt gacttatttg ccgaacttag 6060
aacttcgtct aacacaaaac agcacggcca caaccgaaga tttgcaagtt atcaccgtag 6120
gcgaagcggg ccgggcacag gtacgagtat tacattggga gagcggtaaa ccggaagata 6180
tcgacaataa tcagttgcgt tatagttacg ataatcttat cggttccagt caacttgaat 6240
tagatagcga aggacaaatt atcagtgaag aagaatatta tccctatggt ggaacagcat 6300
tatgggccgc caggaatcag acagaagcca gttataaaac tatccgttat tcaggcaaag 6360
agcgggatgc caccgggcta tattactacg gctatcggta ttaccaaccg tggataggac 6420
ggtggttaag ctccgatccg gcaggaacaa tcgatgggct gaatttatat cggatggtga 6480
ggaataatcc agttaccctc cttgatcctg atggattaat gccaacaatt gcagaacgca 6540
tagcagcact aaaaaaaaat aaagtaacag actcagcgcc ttcgccagca aatgccacaa 6600
acgtagcgat aaacatccgc ccgcctgtag caccaaaacc tagcttaccg aaagcatcaa 6660
cgagtagcca accaaccaca caccctatcg gagctgcaaa cataaaacca acgacgtctg 6720
ggtcatctat tgttgctcca ttgagtccag taggaaataa atctacttct gaaatctctc 6780
tgccagaaag cgctcaaagc agttcttcaa gcactacctc gacaaatcta cagaaaaaat 6840
catttacttt atatagagca gataacagat cctttgaaga aatgcaaagt aaattccctg 6900
aaggatttaa agcctggact cctctagaca ctaagatggc aaggcaattt gctagtatct 6960
ttattggtca gaaagataca tctaatttac ctaaagaaac agtcaagaac ataagcacat 7020
ggggagcaaa gccaaaacta aaagatctct caaattacat aaaatatacc aaggacaaat 7080
ctacagtatg ggtttctact gcaattaata ctgaagcagg tggacaaagc tcaggggctc 7140
cactccataa aattgatatg gatctctacg agtttgccat tgatggacaa aaactaaatc 7200
cactaccgga gggtagaact aaaaacatgg taccttccct tttactcgac accccacaaa 7260
tagagacatc atccatcatt gcacttaatc atggaccggt aaatgatgca gaaatttcat 7320
ttctgacaac aattccgctt aaaaatgtaa aacctcataa gagataatta atctgactcg 7380
ag 7382
<210>50
<211>2439
<212>PRT
<213>人工序列
<220>
<223>TcdB2/TccC3融合蛋白pDAB8940
<400>50
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Arg Pro Met Lys Asn Ile Asp Pro Lys Leu Tyr
1475 1480 1485
Gln Lys Thr Pro Thr Val Ser Val Tyr Asp Asn Arg Gly Leu Ile
1490 1495 1500
Ile Arg Asn Ile Asp Phe His Arg Thr Thr Ala Asn Gly Asp Pro
1505 1510 1515
Asp Thr Arg Ile Thr Arg His Gln Tyr Asp Ile His Gly His Leu
1520 1525 1530
Asn Gln Ser Ile Asp Pro Arg Leu Tyr Glu Ala Lys Gln Thr Asn
1535 1540 1545
Asn Thr Ile Lys Pro Asn Phe Leu Trp Gln Tyr Asp Leu Thr Gly
1550 1555 1560
Asn Pro Leu Cys Thr Glu Ser Ile Asp Ala Gly Arg Thr Val Thr
1565 1570 1575
Leu Asn Asp Ile Glu Gly Arg Pro Leu Leu Thr Val Thr Ala Thr
1580 1585 1590
Gly Val Ile Gln Thr Arg Gln Tyr Glu Thr Ser Ser Leu Pro Gly
1595 1600 1605
Arg Leu Leu Ser Val Ala Glu Gln Thr Pro Glu Glu Lys Thr Ser
1610 1615 1620
Arg Ile Thr Glu Arg LeuIle Trp Ala Gly Asn Thr Glu Ala Glu
1625 1630 1635
Lys Asp His Asn Leu Ala Gly Gln Cys Val Arg His Tyr Asp Thr
1640 1645 1650
Ala Gly Val Thr Arg Leu Glu Ser Leu Ser Leu Thr Gly Thr Val
1655 1660 1665
Leu Ser Gln Ser Ser Gln Leu Leu Ile Asp Thr Gln Glu Ala Asn
1670 1675 1680
Trp Thr Gly Asp Asn Glu Thr Val Trp Gln Asn Met Leu Ala Asp
1685 1690 1695
Asp Ile Tyr Thr Thr Leu Ser Thr Phe Asp Ala Thr Gly Ala Leu
1700 1705 1710
Leu Thr Gln Thr Asp Ala Lys Gly Asn Ile Gln Arg Leu Ala Tyr
1715 1720 1725
Asp Val Ala Gly Gln Leu Asn Gly Ser Trp Leu Thr Leu Lys Gly
1730 1735 1740
Gln Thr Glu Gln Val Ile Ile Lys Ser Leu Thr Tyr Ser Ala Ala
1745 1750 1755
Gly Gln Lys Leu Arg Glu Glu His Gly Asn Asp Val Ile Thr Glu
1760 1765 1770
Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu Ile Gly Ile Lys Thr
1775 1780 1785
Arg Arg Pro Ser Asp Thr Lys Val Leu Gln Asp Leu Arg Tyr Glu
1790 1795 1800
Tyr Asp Pro Val Gly Asn Val Ile Ser Ile Arg Asn Asp Ala Glu
1805 18l0 1815
Ala Thr Arg Phe Trp His Asn Gln Lys Val Met Pro Glu Asn Thr
1820 1825 1830
Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly Arg
1835 1840 1845
Glu Met Ala Asn Ile Gly Gln Gln Ser His Gln Phe Pro Ser Pro
1850 1855 1860
Ala Leu Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr
1865 1870 1875
Tyr Thr Tyr Asp Arg Gly Gly Asn Leu Thr Lys Ile Gln His Ser
1880 1885 1890
Ser Pro Ala Thr Gln Asn Asn Tyr Thr Thr Asn Ile Thr Val Ser
1895 1900 1905
Asn Arg Ser Asn Arg Ala Val Leu Ser Thr Leu Thr Glu Asp Pro
1910 1915 1920
Ala Gln Val Asp Ala Leu Phe Asp Ala Gly Gly His Gln Asn Thr
1925 1930 1935
Leu Ile Ser Gly Gln Asn Leu Asn Trp Asn Thr Arg Gly Glu Leu
1940 1945 1950
Gln Gln Val Thr Leu Val Lys Arg Asp Lys Gly Ala Asn Asp Asp
1955 1960 1965
Arg Glu Trp Tyr Arg Tyr Ser Gly Asp Gly Arg Arg Met Leu Lys
1970 1975 1980
Ile Asn Glu Gln Gln Ala Ser Asn Asn Ala Gln Thr Gln Arg Val
1985 1990 1995
Thr Tyr Leu Pro Asn Leu Glu Leu Arg Leu Thr Gln Asn Ser Thr
2000 2005 2010
Ala Thr Thr Glu Asp Leu Gln Val Ile Thr Val Gly Glu Ala Gly
2015 2020 2025
Arg Ala Gln Val Arg Val Leu His Trp Glu Ser Gly Lys Pro Glu
2030 2035 2040
Asp Ile Asp Asn Asn Gln Leu Arg Tyr Ser Tyr Asp Asn Leu Ile
2045 2050 2055
Gly Ser Ser Gln Leu Glu Leu Asp Ser Glu Gly Gln Ile Ile Ser
2060 2065 2070
Glu Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Ala Ala
2075 2080 2085
Arg Asn Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly
2090 2095 2100
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr
2105 2110 2115
Tyr Gln Pro Trp Ile Gly Arg Trp Leu Ser Ser Asp Pro Ala Gly
2120 2125 2130
Thr Ile Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro
2135 2140 2145
Val Thr Leu Leu Asp Pro Asp Gly Leu Met Pro Thr Ile Ala Glu
2150 2155 2160
Arg Ile Ala Ala Leu Lys Lys Asn Lys Val Thr Asp Ser Ala Pro
2165 2170 2175
Ser Pro Ala Asn Ala Thr Asn Val Ala Ile Asn Ile Arg Pro Pro
2180 2185 2190
Val Ala Pro Lys Pro Ser Leu Pro Lys Ala Ser Thr Ser Ser Gln
2195 2200 2205
Pro Thr Thr His Pro Ile Gly Ala Ala Asn Ile Lys Pro Thr Thr
2210 2215 2220
Ser Gly Ser Ser Ile Val Ala Pro Leu Ser Pro Val Gly Asn Lys
2225 2230 2235
Ser Thr Ser Glu Ile Ser Leu Pro Glu Ser Ala Gln Ser Ser Ser
2240 2245 2250
Ser Ser Thr Thr Ser Thr Asn Leu Gln Lys Lys Ser Phe Thr Leu
2255 2260 2265
Tyr Arg Ala Asp Asn Arg Ser Phe Glu Glu Met Gln Ser Lys Phe
2270 2275 2280
Pro Glu Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr Lys Met Ala
2285 2290 2295
Arg Gln Phe Ala Ser Ile Phe Ile Gly Gln Lys Asp Thr Ser Asn
2300 2305 2310
Leu Pro Lys Glu Thr Val Lys Asn Ile Ser Thr Trp Gly Ala Lys
2315 2320 2325
Pro Lys Leu Lys Asp Leu Ser Asn Tyr Ile Lys Tyr Thr Lys Asp
2330 2335 2340
Lys Ser Thr Val Trp Val Ser Thr Ala Ile Asn Thr Glu Ala Gly
2345 2350 2355
Gly Gln Ser Ser Gly Ala Pro Leu His Lys Ile Asp Met Asp Leu
2360 2365 2370
Tyr Glu Phe Ala Ile Asp Gly Gln Lys Leu Asn Pro Leu Pro Glu
2375 2380 2385
Gly Arg Thr Lys Asn Met Val Pro Ser Leu Leu Leu Asp Thr Pro
2390 2395 2400
Gln Ile Glu Thr Ser Ser Ile Ile Ala Leu Asn His Gly Pro Val
2405 2410 2415
Asn Asp Ala Glu Ile Ser Phe Leu Thr Thr Ile Pro Leu Lys Asn
2420 2425 2430
Val Lys Pro His Lys Arg
2435
<210>51
<211>7409
<212>DNA
<213>人工序列
<220>
<223>编码TcdB2/TccC3融合蛋白pDAB 8920的核酸序列
<400>51
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatccga caacaagggt cagactatcc 4500
gcactaggcc tatgaaaaac atcgatccca aactttatca aaaaacccct actgtcagcg 4560
tttacgataa ccgtggtctg ataatccgta acatcgattt tcatcgtact accgcaaatg 4620
gtgatcccga tacccgtatt acccgccatc aatacgatat tcacggacac ctaaatcaaa 4680
gcatcgatcc gcgcctatat gaagccaagc aaaccaacaa tacgatcaaa cccaattttc 4740
tttggcagta tgatttgacc ggtaatcccc tatgtacaga gagcattgat gcaggtcgca 4800
ctgtcacctt gaatgatatt gaaggccgtc cgctactaac ggtgactgca acaggggtta 4860
tacaaactcg acaatatgaa acttcttccc tgcccggtcg tctgttatct gttgccgaac 4920
aaacacccga ggaaaaaaca tcccgtatca ccgaacgcct gatttgggct ggcaataccg 4980
aagcagagaa agaccataac cttgccggcc agtgcgtgcg tcactatgac acggcgggag 5040
ttacccggtt agagagttta tcactgaccg gtactgtttt atctcaatcc agccaactat 5100
tgatcgacac tcaagaggca aactggacag gtgataacga aaccgtctgg caaaacatgc 5160
tggctgatga catctacaca accctgagca ccttcgatgc caccggtgct ttactgactc 5220
agaccgatgc gaaagggaac attcagagac tggcttatga tgtggccggg cagctaaacg 5280
ggagctggct aacactcaaa ggccagacgg aacaagtgat tatcaaatcc ctgacctact 5340
ccgccgccgg acaaaaatta cgtgaggaac acggcaatga tgttatcacc gaatacagtt 5400
atgaaccgga aacccaacgg ctgatcggta tcaaaacccg ccgtccgtca gacactaaag 5460
tgctacaaga cctgcgctat gaatatgacc cggtaggcaa tgtcatcagc atccgtaatg 5520
acgcggaagc cacccgcttt tggcacaatc agaaagtgat gccggaaaac acttatacct 5580
acgattccct gtatcagctt atcagcgcca ccgggcgcga aatggcgaat ataggtcaac 5640
aaagtcacca atttccctca cccgctctac cttctgataa caacacctat accaactata 5700
cccgtactta tacttatgac cgtggcggca atctgaccaa aatccagcac agttcaccgg 5760
cgacgcaaaa caactacacc accaatatca cggtttcaaa tcgcagcaac cgcgcagtac 5820
tcagcacatt gaccgaagat ccggcgcaag tagatgcttt gtttgatgca ggcggacatc 5880
agaacacctt gatatcagga caaaacctga actggaatac tcgtggtgaa ctgcaacaag 5940
taacactggt taaacgggac aagggcgcca atgatgatcg ggaatggtat cgttatagcg 6000
gtgacggaag aaggatgtta aaaatcaatg aacagcaggc cagcaacaac gctcaaacac 6060
aacgtgtgac ttatttgccg aacttagaac ttcgtctaac acaaaacagc acggccacaa 6120
ccgaagattt gcaagttatc accgtaggcg aagcgggccg ggcacaggta cgagtattac 6180
attgggagag cggtaaaccg gaagatatcg acaataatca gttgcgttat agttacgata 6240
atcttatcgg ttccagtcaa cttgaattag atagcgaagg acaaattatc agtgaagaag 6300
aatattatcc ctatggtgga acagcattat gggccgccag gaatcagaca gaagccagtt 6360
ataaaactat ccgttattca ggcaaagagc gggatgccac cgggctatat tactacggct 6420
atcggtatta ccaaccgtgg ataggacggt ggttaagctc cgatccggca ggaacaatcg 6480
atgggctgaa tttatatcgg atggtgagga ataatccagt taccctcctt gatcctgatg 6540
gattaatgcc aacaattgca gaacgcatag cagcactaaa aaaaaataaa gtaacagact 6600
cagcgccttc gccagcaaat gccacaaacg tagcgataaa catccgcccg cctgtagcac 6660
caaaacctag cttaccgaaa gcatcaacga gtagccaacc aaccacacac cctatcggag 6720
ctgcaaacat aaaaccaacg acgtctgggt catctattgt tgctccattg agtccagtag 6780
gaaataaatc tacttctgaa atctctctgc cagaaagcgc tcaaagcagt tcttcaagca 6840
ctacctcgac aaatctacag aaaaaatcat ttactttata tagagcagat aacagatcct 6900
ttgaagaaat gcaaagtaaa ttccctgaag gatttaaagc ctggactcct ctagacacta 6960
agatggcaag gcaatttgct agtatcttta ttggtcagaa agatacatct aatttaccta 7020
aagaaacagt caagaacata agcacatggg gagcaaagcc aaaactaaaa gatctctcaa 7080
attacataaa atataccaag gacaaatcta cagtatgggt ttctactgca attaatactg 7140
aagcaggtgg acaaagctca ggggctccac tccataaaat tgatatggat ctctacgagt 7200
ttgccattga tggacaaaaa ctaaatccac taccggaggg tagaactaaa aacatggtac 7260
cttccctttt actcgacacc ccacaaatag agacatcatc catcattgca cttaatcatg 7320
gaccggtaaa tgatgcagaa atttcatttc tgacaacaat tccgcttaaa aatgtaaaac 7380
ctcataagag ataattaatc tgactcgag 7409
<210> 52
<211> 2448
<212> PRT
<213> 人工序列
<220>
<223> TcdB2/TccC3融合蛋白pDAB8920
<400> 52
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1170 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro
1475 1480 1485
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val
1490 1495 1500
Ser Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe
1505 1510 1515
His Arg Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg
1520 1525 1530
His Gln Tyr Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro
1535 1540 1545
Arg Leu Tyr Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn
1550 1555 1560
Phe Leu Trp Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu
1565 1570 1575
Ser Ile Asp Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu Gly
1580 1585 1590
Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr Arg
1595 1600 1605
Gln Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Ala
1610 1615 1620
Glu Gln Thr Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg Leu
1625 1630 1635
Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu Ala
1640 1645 1650
Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu
1655 1660 1665
Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser Gln
1670 1675 1680
Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn Glu
1685 1690 1695
Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr Leu
1700 1705 1710
Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
1715 1720 1725
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu
1730 1735 1740
Asn Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile
1745 1750 1755
Ile Lys Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu
1760 1765 1770
Glu His Gly Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu
1775 1780 1785
Thr Gln Arg Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr
1790 1795 1800
Lys Val Leu Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn
1805 1810 1815
Val Ile Ser Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp His
1820 1825 1830
Asn Gln Lys Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu
1835 1840 1845
Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly
1850 1855 1860
Gln Gln Ser His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp Asn
1865 1870 1875
Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly
1880 1885 1890
Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln Asn
1895 1900 1905
Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg Ala
1910 1915 1920
Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala Leu
1925 1930 1935
Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln Asn
1940 1945 1950
Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val
1955 1960 1965
Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr
1970 1975 1980
Ser Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala
1985 1990 1995
Ser Asn Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu
2000 2005 2010
Glu Leu Arg Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu
2015 2020 2025
Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val
2030 2035 2040
Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln
2045 2050 2055
Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu
2060 2065 2070
Leu Asp Ser Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro
2075 2080 2085
Tyr Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala
2090 2095 2100
Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr
2105 2110 2115
Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly
2120 2125 2130
Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn
2135 2140 2145
Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro
2150 2155 2160
Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu Lys
2165 2170 2175
Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala Thr
2180 2185 2190
Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser
2195 2200 2205
Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile
2210 2215 2220
Gly Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val
2225 2230 2235
Ala Pro Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser
2240 2245 2250
Leu Pro Glu Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr
2255 2260 2265
Asn Leu Gln Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg
2270 2275 2280
Ser Phe Glu Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala
2285 2290 2295
Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gln Phe Ala Ser Ile
2300 2305 2310
Phe Ile Gly Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val
2315 2320 2325
Lys Asn Ile Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp Leu
2330 2335 2340
Ser Asn Tyr Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val
2345 2350 2355
Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly Ala
2360 2365 2370
Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile Asp
2375 2380 2385
Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn Met
2390 2395 2400
Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser Ser
2405 2410 2415
Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile Ser
2420 2425 2430
Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
2435 2440 2445
<210> 53
<211> 7481
<212> DNA
<213> 人工序列
<220>
<223> 编码TcdB2/TccC3融合蛋白pDAB 8921的核酸序列
<400> 53
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cacgtctgga ccgcgcagca gatatcacta 4500
cccaaaatgc tcacgacagc gcaattgtcg ctctgcgtca gaatattcct actccggcac 4560
ctctgtccct gcgcagcagg cctatgaaaa acatcgatcc caaactttat caaaaaaccc 4620
ctactgtcag cgtttacgat aaccgtggtc tgataatccg taacatcgat tttcatcgta 4680
ctaccgcaaa tggtgatccc gatacccgta ttacccgcca tcaatacgat attcacggac 4740
acctaaatca aagcatcgat ccgcgcctat atgaagccaa gcaaaccaac aatacgatca 4800
aacccaattt tctttggcag tatgatttga ccggtaatcc cctatgtaca gagagcattg 4860
atgcaggtcg cactgtcacc ttgaatgata ttgaaggccg tccgctacta acggtgactg 4920
caacaggggt tatacaaact cgacaatatg aaacttcttc cctgcccggt cgtctgttat 4980
ctgttgccga acaaacaccc gaggaaaaaa catcccgtat caccgaacgc ctgatttggg 5040
ctggcaatac cgaagcagag aaagaccata accttgccgg ccagtgcgtg cgtcactatg 5100
acacggcggg agttacccgg ttagagagtt tatcactgac cggtactgtt ttatctcaat 5160
ccagccaact attgatcgac actcaagagg caaactggac aggtgataac gaaaccgtct 5220
ggcaaaacat gctggctgat gacatctaca caaccctgag caccttcgat gccaccggtg 5280
ctttactgac tcagaccgat gcgaaaggga acattcagag actggcttat gatgtggccg 5340
ggcagctaaa cgggagctgg ctaacactca aaggccagac ggaacaagtg attatcaaat 5400
ccctgaccta ctccgccgcc ggacaaaaat tacgtgagga acacggcaat gatgttatca 5460
ccgaatacag ttatgaaccg gaaacccaac ggctgatcgg tatcaaaacc cgccgtccgt 5520
cagacactaa agtgctacaa gacctgcgct atgaatatga cccggtaggc aatgtcatca 5580
gcatccgtaa tgacgcggaa gccacccgct tttggcacaa tcagaaagtg atgccggaaa 5640
acacttatac ctacgattcc ctgtatcagc ttatcagcgc caccgggcgc gaaatggcga 5700
atataggtca acaaagtcac caatttccct cacccgctct accttctgat aacaacacct 5760
ataccaacta tacccgtact tatacttatg accgtggcgg caatctgacc aaaatccagc 5820
acagttcacc ggcgacgcaa aacaactaca ccaccaatat cacggtttca aatcgcagca 5880
accgcgcagt actcagcaca ttgaccgaag atccggcgca agtagatgct ttgtttgatg 5940
caggcggaca tcagaacacc ttgatatcag gacaaaacct gaactggaat actcgtggtg 6000
aactgcaaca agtaacactg gttaaacggg acaagggcgc caatgatgat cgggaatggt 6060
atcgttatag cggtgacgga agaaggatgt taaaaatcaa tgaacagcag gccagcaaca 6120
acgctcaaac acaacgtgtg acttatttgc cgaacttaga acttcgtcta acacaaaaca 6180
gcacggccac aaccgaagat ttgcaagtta tcaccgtagg cgaagcgggc cgggcacagg 6240
tacgagtatt acattgggag agcggtaaac cggaagatat cgacaataat cagttgcgtt 6300
atagttacga taatcttatc ggttccagtc aacttgaatt agatagcgaa ggacaaatta 6360
tcagtgaaga agaatattat ccctatggtg gaacagcatt atgggccgcc aggaatcaga 6420
cagaagccag ttataaaact atccgttatt caggcaaaga gcgggatgcc accgggctat 6480
attactacgg ctatcggtat taccaaccgt ggataggacg gtggttaagc tccgatccgg 6540
caggaacaat cgatgggctg aatttatatc ggatggtgag gaataatcca gttaccctcc 6600
ttgatcctga tggattaatg ccaacaattg cagaacgcat agcagcacta aaaaaaaata 6660
aagtaacaga ctcagcgcct tcgccagcaa atgccacaaa cgtagcgata aacatccgcc 6720
cgcctgtagc accaaaacct agcttaccga aagcatcaac gagtagccaa ccaaccacac 6780
accctatcgg agctgcaaac ataaaaccaa cgacgtctgg gtcatctatt gttgctccat 6840
tgagtccagt aggaaataaa tctacttctg aaatctctct gccagaaagc gctcaaagca 6900
gttcttcaag cactacctcg acaaatctac agaaaaaatc atttacttta tatagagcag 6960
ataacagatc ctttgaagaa atgcaaagta aattccctga aggatttaaa gcctggactc 7020
ctctagacac taagatggca aggcaatttg ctagtatctt tattggtcag aaagatacat 7080
ctaatttacc taaagaaaca gtcaagaaca taagcacatg gggagcaaag ccaaaactaa 7140
aagatctctc aaattacata aaatatacca aggacaaatc tacagtatgg gtttctactg 7200
caattaatac tgaagcaggt ggacaaagct caggggctcc actccataaa attgatatgg 7260
atctctacga gtttgccatt gatggacaaa aactaaatcc actaccggag ggtagaacta 7320
aaaacatggt accttccctt ttactcgaca ccccacaaat agagacatca tccatcattg 7380
cacttaatca tggaccggta aatgatgcag aaatttcatt tctgacaaca attccgctta 7440
aaaatgtaaa acctcataag agataattaa tctgactcga g 7481
<210> 54
<211> 2472
<212> PRT
<213> 人工序列
<220>
<223> TcdB2/TccC3融合蛋白pDAB8921
<400> 54
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Arg Leu Asp Arg Ala Ala Asp Ile Thr Thr Gln Asn Ala
1475 1480 1485
His Asp Ser Ala Ile Val Ala Leu Arg Gln Asn Ile Pro Thr Pro
1490 1495 1500
Ala Pro Leu Ser Leu Arg Ser Arg Pro Met Lys Asn Ile Asp Pro
1505 1510 1515
Lys Leu Tyr Gln Lys Thr Pro Thr Val Ser Val Tyr Asp Asn Arg
1520 1525 1530
Gly Leu Ile Ile Arg Asn Ile Asp Phe His Arg Thr Thr Ala Asn
1535 1540 1545
Gly Asp Pro Asp Thr Arg Ile Thr Arg His Gln Tyr Asp Ile His
1550 1555 1560
Gly His Leu Asn Gln Ser Ile Asp Pro Arg Leu Tyr Glu Ala Lys
1565 1570 1575
Gln Thr Asn Asn Thr Ile Lys Pro Asn Phe Leu Trp Gln Tyr Asp
1580 1585 1590
Leu Thr Gly Asn Pro Leu Cys Thr Glu Ser Ile Asp Ala Gly Arg
1595 1600 1605
Thr Val Thr Leu Asn Asp Ile Glu Gly Arg Pro Leu Leu Thr Val
1610 1615 1620
Thr Ala Thr Gly Val Ile Gln Thr Arg Gln Tyr Glu Thr Ser Ser
1625 1630 1635
Leu Pro Gly Arg Leu Leu Ser Val Ala Glu Gln Thr Pro Glu Glu
1640 1645 1650
Lys Thr Ser Arg Ile Thr Glu Arg Leu Ile Trp Ala Gly Asn Thr
1655 1660 1665
Glu Ala Glu Lys Asp His Asn Leu Ala Gly Gln Cys Val Arg His
1670 1675 1680
Tyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser Leu Ser Leu Thr
1685 1690 1695
Gly Thr Val Leu Ser Gln Ser Ser Gln Leu Leu Ile Asp Thr Gln
1700 1705 1710
Glu Ala Asn Trp Thr Gly Asp Asn Glu Thr Val Trp Gln Asn Met
1715 1720 1725
Leu Ala Asp Asp Ile Tyr Thr Thr Leu Ser Thr Phe Asp Ala Thr
1730 1735 1740
Gly Ala Leu Leu Thr Gln Thr Asp Ala Lys Gly Asn Ile Gln Arg
1745 1750 1755
Leu Ala Tyr Asp Val Ala Gly Gln Leu Asn Gly Ser Trp Leu Thr
1760 1765 1770
Leu Lys Gly Gln Thr Glu Gln Val Ile Ile Lys Ser Leu Thr Tyr
1775 1780 1785
Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly Asn Asp Val
1790 1795 1800
Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu Ile Gly
1805 1810 1815
Ile Lys Thr Arg Arg Pro Ser Asp Thr Lys Val Leu Gln Asp Leu
1820 1825 1830
Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Ile Ser Ile Arg Asn
1835 1840 1845
Asp Ala Glu Ala Thr Arg Phe Trp His Asn Gln Lys Val Met Pro
1850 1855 1860
Glu Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala
1865 1870 1875
Thr Gly Arg Glu Met Ala Asn Ile Gly Gln Gln Ser His Gln Phe
1880 1885 1890
Pro Ser Pro Ala Leu Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr
1895 1900 1905
Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn Leu Thr Lys Ile
1910 1915 1920
Gln His Ser Ser Pro Ala Thr Gln Asn Asn Tyr Thr Thr Asn Ile
1925 1930 1935
Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser Thr Leu Thr
1940 1945 1950
Glu Asp Pro Ala Gln Val Asp Ala Leu Phe Asp Ala Gly Gly His
1955 1960 1965
Gln Asn Thr Leu Ile Ser Gly Gln Asn Leu Asn Trp Asn Thr Arg
1970 1975 1980
Gly Glu Leu Gln Gln Val Thr Leu Val Lys Arg Asp Lys Gly Ala
1985 1990 1995
Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser Gly Asp Gly Arg Arg
2000 2005 2010
Met Leu Lys Ile Asn Glu Gln Gln Ala Ser Asn Asn Ala Gln Thr
2015 2020 2025
Gln Arg Val Thr Tyr Leu Pro Asn Leu Glu Leu Arg Leu Thr Gln
2030 2035 2040
Asn Ser Thr Ala Thr Thr Glu Asp Leu Gln Val Ile Thr Val Gly
2045 2050 2055
Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ser Gly
2060 2065 2070
Lys Pro Glu Asp Ile Asp Asn Asn Gln Leu Arg Tyr Ser Tyr Asp
2075 2080 2085
Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Ser Glu Gly Gln
2090 2095 2100
Ile Ile Ser Glu Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Leu
2105 2110 2115
Trp Ala Ala Arg Asn Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg
2120 2125 2130
Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly
2135 2140 2145
Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly Arg Trp Leu Ser Ser Asp
2150 2155 2160
Pro Ala Gly Thr Ile Asp Gly Leu Asn Leu Tyr Arg Met Val Arg
2165 2170 2175
Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly Leu Met Pro Thr
2180 2185 2190
Ile Ala Glu Arg Ile Ala Ala Leu Lys Lys Asn Lys Val Thr Asp
2195 2200 2205
Ser Ala Pro Ser Pro Ala Asn Ala Thr Asn Val Ala Ile Asn Ile
2210 2215 2220
Arg Pro Pro Val Ala Pro Lys Pro Ser Leu Pro Lys Ala Ser Thr
2225 2230 2235
Ser Ser Gln Pro Thr Thr His Pro Ile Gly Ala Ala Asn Ile Lys
2240 2245 2250
Pro Thr Thr Ser Gly Ser Ser Ile Val Ala Pro Leu Ser Pro Val
2255 2260 2265
Gly Asn Lys Ser Thr Ser Glu Ile Ser Leu Pro Glu Ser Ala Gln
2270 2275 2280
Ser Ser Ser Ser Ser Thr Thr Ser Thr Asn Leu Gln Lys Lys Ser
2285 2290 2295
Phe Thr Leu Tyr Arg Ala Asp Asn Arg Ser Phe Glu Glu Met Gln
2300 2305 2310
Ser Lys Phe Pro Glu Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr
2315 2320 2325
Lys Met Ala Arg Gln Phe Ala Ser Ile Phe Ile Gly Gln Lys Asp
2330 2335 2340
Thr Ser Asn Leu Pro Lys Glu Thr Val Lys Asn Ile Ser Thr Trp
2345 2350 2355
Gly Ala Lys Pro Lys Leu Lys Asp Leu Ser Asn Tyr Ile Lys Tyr
2360 2365 2370
Thr Lys Asp Lys Ser Thr Val Trp Val Ser Thr Ala Ile Asn Thr
2375 2380 2385
Glu Ala Gly Gly Gln Ser Ser Gly Ala Pro Leu His Lys Ile Asp
2390 2395 2400
Met Asp Leu Tyr Glu Phe Ala Ile Asp Gly Gln Lys Leu Asn Pro
2405 2410 2415
Leu Pro Glu Gly Arg Thr Lys Asn Met Val Pro Ser Leu Leu Leu
2420 2425 2430
Asp Thr Pro Gln Ile Glu Thr Ser Ser Ile Ile Ala Leu Asn His
2435 2440 2445
Gly Pro Val Asn Asp Ala Glu Ile Ser Phe Leu Thr Thr Ile Pro
2450 2455 2460
Leu Lys Asn Val Lys Pro His Lys Arg
2465 2470
<210> 55
<211> 7646
<212> DNA
<213> 人工序列
<220>
<223> 编码TcdB2/TccC3融合蛋白pDAB 8923的核酸序列
<400> 55
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatccga agcttatgca gatacccatg 4500
tctatgatcc cataggtcga gaaatcaagg ttatcaccgc aaaaggttgg ttccgtcgaa 4560
ccttgttcac tccctggttt actgtcaatg aagatgaaaa tgacacagcc gctgaggtga 4620
agaaggtaaa gatgccacgt ctggaccgcg cagcagatat cactacccaa aatgctcacg 4680
acagcgcaat tgtcgctctg cgtcagaata ttcctactcc ggcacctctg tccctgcgca 4740
gcaggcctat gaaaaacatc gatcccaaac tttatcaaaa aacccctact gtcagcgttt 4800
acgataaccg tggtctgata atccgtaaca tcgattttca tcgtactacc gcaaatggtg 4860
atcccgatac ccgtattacc cgccatcaat acgatattca cggacaccta aatcaaagca 4920
tcgatccgcg cctatatgaa gccaagcaaa ccaacaatac gatcaaaccc aattttcttt 4980
ggcagtatga tttgaccggt aatcccctat gtacagagag cattgatgca ggtcgcactg 5040
tcaccttgaa tgatattgaa ggccgtccgc tactaacggt gactgcaaca ggggttatac 5100
aaactcgaca atatgaaact tcttccctgc ccggtcgtct gttatctgtt gccgaacaaa 5160
cacccgagga aaaaacatcc cgtatcaccg aacgcctgat ttgggctggc aataccgaag 5220
cagagaaaga ccataacctt gccggccagt gcgtgcgtca ctatgacacg gcgggagtta 5280
cccggttaga gagtttatca ctgaccggta ctgttttatc tcaatccagc caactattga 5340
tcgacactca agaggcaaac tggacaggtg ataacgaaac cgtctggcaa aacatgctgg 5400
ctgatgacat ctacacaacc ctgagcacct tcgatgccac cggtgcttta ctgactcaga 5460
ccgatgcgaa agggaacatt cagagactgg cttatgatgt ggccgggcag ctaaacggga 5520
gctggctaac actcaaaggc cagacggaac aagtgattat caaatccctg acctactccg 5580
ccgccggaca aaaattacgt gaggaacacg gcaatgatgt tatcaccgaa tacagttatg 5640
aaccggaaac ccaacggctg atcggtatca aaacccgccg tccgtcagac actaaagtgc 5700
tacaagacct gcgctatgaa tatgacccgg taggcaatgt catcagcatc cgtaatgacg 5760
cggaagccac ccgcttttgg cacaatcaga aagtgatgcc ggaaaacact tatacctacg 5820
attccctgta tcagcttatc agcgccaccg ggcgcgaaat ggcgaatata ggtcaacaaa 5880
gtcaccaatt tccctcaccc gctctacctt ctgataacaa cacctatacc aactataccc 5940
gtacttatac ttatgaccgt ggcggcaatc tgaccaaaat ccagcacagt tcaccggcga 6000
cgcaaaacaa ctacaccacc aatatcacgg tttcaaatcg cagcaaccgc gcagtactca 6060
gcacattgac cgaagatccg gcgcaagtag atgctttgtt tgatgcaggc ggacatcaga 6120
acaccttgat atcaggacaa aacctgaact ggaatactcg tggtgaactg caacaagtaa 6180
cactggttaa acgggacaag ggcgccaatg atgatcggga atggtatcgt tatagcggtg 6240
acggaagaag gatgttaaaa atcaatgaac agcaggccag caacaacgct caaacacaac 6300
gtgtgactta tttgccgaac ttagaacttc gtctaacaca aaacagcacg gccacaaccg 6360
aagatttgca agttatcacc gtaggcgaag cgggccgggc acaggtacga gtattacatt 6420
gggagagcgg taaaccggaa gatatcgaca ataatcagtt gcgttatagt tacgataatc 6480
ttatcggttc cagtcaactt gaattagata gcgaaggaca aattatcagt gaagaagaat 6540
attatcccta tggtggaaca gcattatggg ccgccaggaa tcagacagaa gccagttata 6600
aaactatccg ttattcaggc aaagagcggg atgccaccgg gctatattac tacggctatc 6660
ggtattacca accgtggata ggacggtggt taagctccga tccggcagga acaatcgatg 6720
ggctgaattt atatcggatg gtgaggaata atccagttac cctccttgat cctgatggat 6780
taatgccaac aattgcagaa cgcatagcag cactaaaaaa aaataaagta acagactcag 6840
cgccttcgcc agcaaatgcc acaaacgtag cgataaacat ccgcccgcct gtagcaccaa 6900
aacctagctt accgaaagca tcaacgagta gccaaccaac cacacaccct atcggagctg 6960
caaacataaa accaacgacg tctgggtcat ctattgttgc tccattgagt ccagtaggaa 7020
ataaatctac ttctgaaatc tctctgccag aaagcgctca aagcagttct tcaagcacta 7080
cctcgacaaa tctacagaaa aaatcattta ctttatatag agcagataac agatcctttg 7140
aagaaatgca aagtaaattc cctgaaggat ttaaagcctg gactcctcta gacactaaga 7200
tggcaaggca atttgctagt atctttattg gtcagaaaga tacatctaat ttacctaaag 7260
aaacagtcaa gaacataagc acatggggag caaagccaaa actaaaagat ctctcaaatt 7320
acataaaata taccaaggac aaatctacag tatgggtttc tactgcaatt aatactgaag 7380
caggtggaca aagctcaggg gctccactcc ataaaattga tatggatctc tacgagtttg 7440
ccattgatgg acaaaaacta aatccactac cggagggtag aactaaaaac atggtacctt 7500
cccttttact cgacacccca caaatagaga catcatccat cattgcactt aatcatggac 7560
cggtaaatga tgcagaaatt tcatttctga caacaattcc gcttaaaaat gtaaaacctc 7620
ataagagata attaatctga ctcgag 7646
<210> 56
<211> 2527
<212> PRT
<213> 人工序列
<220>
<223> TcdB2/TccC3融合蛋白pDAB8923
<400> 56
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 5l0
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Glu Ala Tyr Ala Asp Thr His Val Tyr Asp Pro
1475 1480 1485
Ile Gly Arg Glu Ile Lys Val Ile Thr Ala Lys Gly Trp Phe Arg
1490 1495 1500
Arg Thr Leu Phe Thr Pro Trp Phe Thr Val Asn Glu Asp Glu Asn
1505 1510 1515
Asp Thr Ala Ala Glu Val Lys Lys Val Lys Met Pro Arg Leu Asp
1520 1525 1530
Arg Ala Ala Asp Ile Thr Thr Gln Asn Ala His Asp Ser Ala Ile
1535 1540 1545
Val Ala Leu Arg Gln Asn Ile Pro Thr Pro Ala Pro Leu Ser Leu
1550 1555 1560
Arg Ser Arg Pro Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys
1565 1570 1575
Thr Pro Thr Val Ser Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg
1580 1585 1590
Asn Ile Asp Phe His Arg Thr Thr Ala Asn Gly Asp Pro Asp Thr
1595 1600 1605
Arg Ile Thr Arg His Gln Tyr Asp Ile His Gly His Leu Asn Gln
1610 1615 1620
Ser Ile Asp Pro Arg Leu Tyr Glu Ala Lys Gln Thr Asn Asn Thr
1625 1630 1635
Ile Lys Pro Asn Phe Leu Trp Gln Tyr Asp Leu Thr Gly Asn Pro
1640 1645 1650
Leu Cys Thr Glu Ser Ile Asp Ala Gly Arg Thr Val Thr Leu Asn
1655 1660 1665
Asp Ile Glu Gly Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val
1670 1675 1680
Ile Gln Thr Arg Gln Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu
1685 1690 1695
Leu Ser Val Ala Glu Gln Thr Pro Glu Glu Lys Thr Ser Arg Ile
1700 1705 1710
Thr Glu Arg Leu Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys Asp
1715 1720 1725
His Asn Leu Ala Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly
1730 1735 1740
Val Thr Arg Leu Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser
1745 1750 1755
Gln Ser Ser Gln Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr
1760 1765 1770
Gly Asp Asn Glu Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile
1775 1780 1785
Tyr Thr Thr Leu Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr
1790 1795 1800
Gln Thr Asp Ala Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val
1805 1810 1815
Ala Gly Gln Leu Asn Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr
1820 1825 1830
Glu Gln Val Ile Ile Lys Ser Leu Thr Tyr Ser Ala Ala Gly Gln
1835 1840 1845
Lys Leu Arg Glu Glu His Gly Asn Asp Val Ile Thr Glu Tyr Ser
1850 1855 1860
Tyr Glu Pro Glu Thr Gln Arg Leu Ile Gly Ile Lys Thr Arg Arg
1865 1870 1875
Pro Ser Asp Thr Lys Val Leu Gln Asp Leu Arg Tyr Glu Tyr Asp
1880 1885 1890
Pro Val Gly Asn Val Ile Ser Ile Arg Asn Asp Ala Glu Ala Thr
1895 1900 1905
Arg Phe Trp His Asn Gln Lys Val Met Pro Glu Asn Thr Tyr Thr
1910 1915 1920
Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met
1925 1930 1935
Ala Asn Ile Gly Gln Gln Ser His Gln Phe Pro Ser Pro Ala Leu
1940 1945 1950
Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr
1955 1960 1965
Tyr Asp Arg Gly Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro
1970 1975 1980
Ala Thr Gln Asn Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg
1985 1990 1995
Ser Asn Arg Ala Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln
2000 2005 2010
Val Asp Ala Leu Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile
2015 2020 2025
Ser Gly Gln Asn Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln
2030 2035 2040
Val Thr Leu Val Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu
2045 2050 2055
Trp Tyr Arg Tyr Ser Gly Asp Gly Arg Arg Met Leu Lys Ile Asn
2060 2065 2070
Glu Gln Gln Ala Ser Asn Asn Ala Gln Thr Gln Arg Val Thr Tyr
2075 2080 2085
Leu Pro Asn Leu Glu Leu Arg Leu Thr Gln Asn Ser Thr Ala Thr
2090 2095 2100
Thr Glu Asp Leu Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala
2105 2110 2115
Gln Val Arg Val Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile
2120 2125 2130
Asp Asn Asn Gln Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser
2135 2140 2145
Ser Gln Leu Glu Leu Asp Ser Glu Gly Gln Ile Ile Ser Glu Glu
2150 2155 2160
Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn
2165 2170 2175
Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu
2180 2185 2190
Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln
2195 2200 2205
Pro Trp Ile Gly Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile
2210 2215 2220
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr
2225 2230 2235
Leu Leu Asp Pro Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile
2240 2245 2250
Ala Ala Leu Lys Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro
2255 2260 2265
Ala Asn Ala Thr Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala
2270 2275 2280
Pro Lys Pro Ser Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr
2285 2290 2295
Thr His Pro Ile Gly Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly
2300 2305 2310
Ser Ser Ile Val Ala Pro Leu Ser Pro Val Gly Asn Lys Ser Thr
2315 2320 2325
Ser Glu Ile Ser Leu Pro Glu Ser Ala Gln Ser Ser Ser Ser Ser
2330 2335 2340
Thr Thr Ser Thr Asn Leu Gln Lys Lys Ser Phe Thr Leu Tyr Arg
2345 2350 2355
Ala Asp Asn Arg Ser Phe Glu Glu Met Gln Ser Lys Phe Pro Glu
2360 2365 2370
Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gln
2375 2380 2385
Phe Ala Ser Ile Phe Ile Gly Gln Lys Asp Thr Ser Asn Leu Pro
2390 2395 2400
Lys Glu Thr Val Lys Asn Ile Ser Thr Trp Gly Ala Lys Pro Lys
2405 2410 2415
Leu Lys Asp Leu Ser Asn Tyr Ile Lys Tyr Thr Lys Asp Lys Ser
2420 2425 2430
Thr Val Trp Val Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly Gln
2435 2440 2445
Ser Ser Gly Ala Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu
2450 2455 2460
Phe Ala Ile Asp Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg
2465 2470 2475
Thr Lys Asn Met Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile
2480 2485 2490
Glu Thr Ser Ser Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp
2495 2500 2505
Ala Glu Ile Ser Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys
2510 2515 2520
Pro His Lys Arg
2525
<210> 57
<211> 7454
<212> DNA
<213> 人工序列
<220>
<223> 编码TcdB2/TccC3融合蛋白pDAB 8951的核酸序列
<400> 57
aaacaagaag gagatatacc atgaaaaaca tcgatcccaa actttatcaa aaaaccccta 60
ctgtcagcgt ttacgataac cgtggtctga taatccgtaa catcgatttt catcgtacta 120
ccgcaaatgg tgatcccgat acccgtatta cccgccatca atacgatatt cacggacacc 180
taaatcaaag catcgatccg cgcctatatg aagccaagca aaccaacaat acgatcaaac 240
ccaattttct ttggcagtat gatttgaccg gtaatcccct atgtacagag agcattgatg 300
caggtcgcac tgtcaccttg aatgatattg aaggccgtcc gctactaacg gtgactgcaa 360
caggggttat acaaactcga caatatgaaa cttcttccct gcccggtcgt ctgttatctg 420
ttgccgaaca aacacccgag gaaaaaacat cccgtatcac cgaacgcctg atttgggctg 480
gcaataccga agcagagaaa gaccataacc ttgccggcca gtgcgtgcgt cactatgaca 540
cggcgggagt tacccggtta gagagtttat cactgaccgg tactgtttta tctcaatcca 600
gccaactatt gatcgacact caagaggcaa actggacagg tgataacgaa accgtctggc 660
aaaacatgct ggctgatgac atctacacaa ccctgagcac cttcgatgcc accggtgctt 720
tactgactca gaccgatgcg aaagggaaca ttcagagact ggcttatgat gtggccgggc 780
agctaaacgg gagctggcta acactcaaag gccagacgga acaagtgatt atcaaatccc 840
tgacctactc cgccgccgga caaaaattac gtgaggaaca cggcaatgat gttatcaccg 900
aatacagtta tgaaccggaa acccaacggc tgatcggtat caaaacccgc cgtccgtcag 960
acactaaagt gctacaagac ctgcgctatg aatatgaccc ggtaggcaat gtcatcagca 1020
tccgtaatga cgcggaagcc acccgctttt ggcacaatca gaaagtgatg ccggaaaaca 1080
cttataccta cgattccctg tatcagctta tcagcgccac cgggcgcgaa atggcgaata 1140
taggtcaaca aagtcaccaa tttccctcac ccgctctacc ttctgataac aacacctata 1200
ccaactatac ccgtacttat acttatgacc gtggcggcaa tctgaccaaa atccagcaca 1260
gttcaccggc gacgcaaaac aactacacca ccaatatcac ggtttcaaat cgcagcaacc 1320
gcgcagtact cagcacattg accgaagatc cggcgcaagt agatgctttg tttgatgcag 1380
gcggacatca gaacaccttg atatcaggac aaaacctgaa ctggaatact cgtggtgaac 1440
tgcaacaagt aacactggtt aaacgggaca agggcgccaa tgatgatcgg gaatggtatc 1500
gttatagcgg tgacggaaga aggatgttaa aaatcaatga acagcaggcc agcaacaacg 1560
ctcaaacaca acgtgtgact tatttgccga acttagaact tcgtctaaca caaaacagca 1620
cggccacaac cgaagatttg caagttatca ccgtaggcga agcgggccgg gcacaggtac 1680
gagtattaca ttgggagagc ggtaaaccgg aagatatcga caataatcag ttgcgttata 1740
gttacgataa tcttatcggt tccagtcaac ttgaattaga tagcgaagga caaattatca 1800
gtgaagaaga atattatccc tatggtggaa cagcattatg ggccgccagg aatcagacag 1860
aagccagtta taaaactatc cgttattcag gcaaagagcg ggatgccacc gggctatatt 1920
actacggcta tcggtattac caaccgtgga taggacggtg gttaagctcc gatccggcag 1980
gaacaatcga tgggctgaat ttatatcgga tggtgaggaa taatccagtt accctccttg 2040
atcctgatgg attaatgcca acaattgcag aacgcatagc agcactaaaa aaaaataaag 2100
taacagactc agcgccttcg ccagcaaatg ccacaaacgt agcgataaac atccgcccgc 2160
ctgtagcacc aaaacctagc ttaccgaaag catcaacgag tagccaacca accacacacc 2220
ctatcggagc tgcaaacata aaaccaacga cgtctgggtc atctattgtt gctccattga 2280
gtccagtagg aaataaatct acttctgaaa tctctctgcc agaaagcgct caaagcagtt 2340
cttcaagcac tacctcgaca aatctacaga aaaaatcatt tactttatat agagcagata 2400
acagatcctt tgaagaaatg caaagtaaat tccctgaagg atttaaagcc tggactcctc 2460
tagacactaa gatggcaagg caatttgcta gtatctttat tggtcagaaa gatacatcta 2520
atttacctaa agaaacagtc aagaacataa gcacatgggg agcaaagcca aaactaaaag 2580
atctctcaaa ttacataaaa tataccaagg acaaatctac agtatgggtt tctactgcaa 2640
ttaatactga agcaggtgga caaagctcag gggctccact ccataaaatt gatatggatc 2700
tctacgagtt tgccattgat ggacaaaaac taaatccact accggagggt agaactaaaa 2760
acatggtacc ttccctttta ctcgacaccc cacaaataga gacatcatcc atcattgcac 2820
ttaatcatgg accggtaaat gatgcagaaa tttcatttct gacaacaatt ccgcttaaga 2880
atgtaaaacc tcataagaga ccacgtctgg accgcgcagc agatatcact acccaaaatg 2940
ctcacgacag cgcaattgtc gctctgcgtc agaatattcc tactccggca cctctgtccc 3000
tgcgcagcag gcctatgcaa aattcacaag attttagtat tacggagctc tcactgccca 3060
aagggggggg cgctatcacg ggaatgggtg aagcattaac ccccactgga ccggatggta 3120
tggccgcgct atctctacca ttgcctattt ctgccgggcg cggttatgct cccgcattca 3180
ctctgaatta caacagcggc gccggtaaca gtccatttgg tctgggttgg gattgcaacg 3240
ttatgactat ccgccgccgc acccattttg gcgtccccca ttatgacgaa accgatacct 3300
ttttggggcc agaaggcgaa gtgctggtgg tagcggatca acctcgcgac gaatccacat 3360
tacagggtat caatttaggc gccaccttta ccgttaccgg ctaccgttcc cgtctggaaa 3420
gccatttcag ccgattggaa tattggcaac ccaaaacaac aggtaaaaca gatttttggt 3480
tgatatatag cccagatggg caggtgcatc tactgggtaa atcaccgcaa gcgcggatca 3540
gcaacccatc ccaaacgaca caaacagcac aatggctgct ggaagcctct gtatcatcac 3600
gtggcgaaca aatttattat caatatcgcg ccgaagatga cacaggttgc gaagcagatg 3660
aaattacgca ccatttacag gctacagcgc aacgttattt acacatcgtg tattacggca 3720
accgtacagc cagcgaaaca ttacccggtc tggatggcag cgccccatca caagcagact 3780
ggttgttcta tctggtattt gattacggcg aacgcagtaa caacctgaaa acgccaccag 3840
cattttcgac tacaggtagc tggctttgcc gtcaggaccg tttttcccgt tatgaatatg 3900
gctttgagat tcgtacccgc cgcttatgcc gtcaggtatt gatgtaccat cacctgcaag 3960
cactggatag taagataaca gaacacaacg gaccaacgct ggtttcacgc ctgatactca 4020
attacgacga aagcgcgata gccagcacgc tagtattcgt tcgccgagtg ggacacgagc 4080
aagatggtaa tgtcgtcacc ctgccgccat tagaattggc atatcaggat ttttcaccgc 4140
gacatcacgc tcactggcaa ccaatggatg tactggcaaa cttcaatgcc attcagcgct 4200
ggcagctagt cgatctaaaa ggcgaaggat tacccggcct gttatatcag gataaaggcg 4260
cttggtggta ccgctccgca cagcgtctgg gcgaaattgg ctcagatgcc gtcacttggg 4320
aaaagatgca acctttatcg gttattcctt ctttgcaaag taatgcctcg ttggtggata 4380
tcaatggaga cggccaactt gactgggtta tcaccggacc gggattacgg ggatatcata 4440
gtcaacgccc ggatggcagt tggacacgtt ttaccccact caacgctctg ccggtggaat 4500
acacccatcc acgcgcgcaa ctcgcagatt taatgggagc cgggctatcc gatttggtgc 4560
tgatcggccc taagagcgtg cgtttatatg ccaatacccg cgacggcttt gccaaaggaa 4620
aagatgtggt gcaatccggt gatatcacac tgccggtgcc gggcgccgat ccacgtaagt 4680
tggtggcgtt tagtgatgta ttgggttcag gtcaagccca tctggttgaa gtaagcgcga 4740
ctaaagtcac ctgctggcct aatctggggc gcggacgttt tggtcaaccc attaccttac 4800
cgggattcag ccagccagca accgagttta acccggctca agtttatctg gccgatctgg 4860
atggcagcgg tccaacggat ctgatttatg ttcatacaaa ccgtctggat atcttcctga 4920
acaaaagtgg caatggcttt gctgaaccag tgacattacg cttcccggaa ggtctgcgtt 4980
ttgatcatac ctgtcagtta caaatggccg atgtacaagg attaggcgtc gccagcctga 5040
tactgagcgt gccgcatatg tctccccatc actggcgctg cgatctgacc aacatgaagc 5100
cgtggttact caatgaaatg aacaacaata tgggggtcca tcacaccttg cgttaccgca 5160
gttcctccca attctggctg gatgaaaaag ccgcggcgct gactaccgga caaacaccgg 5220
tttgctatct ccccttcccg atccacaccc tatggcaaac ggaaacagaa gatgaaatca 5280
gcggcaacaa attagtcaca acacttcgtt atgctcgtgg cgcatgggac ggacgcgagc 5340
gggaatttcg cggatttggt tatgtagagc agacagacag ccatcaactg gctcaaggca 5400
acgcgccaga acgtacgcca ccggcgctga ccaaaaactg gtatgccacc ggactgccgg 5460
tgatagataa cgcattatca accgagtatt ggcgtgatga tcaggctttt gccggtttct 5520
caccgcgctt tacgacttgg caagataaca aagatgtccc gttaacaccg gaagatgata 5580
acagtcgtta ctggttcaac cgcgcgttga aaggtcaact gctacgtagt gaactgtacg 5640
gattggacga tagtacaaat aaacacgttc cctatactgt cactgaattt cgttcacagg 5700
tacgtcgatt acagcatacc gacagccgat accctgtact ttggtcatct gtagttgaaa 5760
gccgcaacta tcactacgaa cgtatcgcca gcgacccgca atgcagtcaa aatattacgc 5820
tatccagtga tcgatttggt cagccgctaa aacagctttc ggtacagtac ccgcgccgcc 5880
agcagccagc aatcaatctg tatcctgata cattgcctga taagttgtta gccaacagct 5940
atgatgacca acaacgccaa ttacggctca cctatcaaca atccagttgg catcacctga 6000
ccaacaatac cgttcgagta ttgggattac cggatagtac ccgcagtgat atctttactt 6060
atggcgctga aaatgtgcct gctggtggtt taaatctgga acttctgagt gataaaaata 6120
gcctgatcgc ggacgataaa ccacgtgaat acctcggtca gcaaaaaacc gcttataccg 6180
atggacaaaa tacaacgccg ttgcaaacac caacacggca agccctgatt gcctttaccg 6240
aaacaacggt attcaaccag tccacattat cagcgtttaa cggaagcatc ccgtccgata 6300
aattatcaac gacgctggag caagctggat atcagcaaac aaattatcta ttccctcgca 6360
ctggagaaga taaagtttgg gtagcccatc acggctatac cgattatggt acagcggcac 6420
agttctggcg cccgcaaaaa cagagcaaca cccaactcac cggtaaaatc accctcatct 6480
gggatgcaaa ctattgcgtt gtggtacaaa cccgggatgc tgctggactg acaacctcag 6540
ccaaatatga ctggcgtttt ctgaccccgg tgcaactcac cgatatcaat gacaatcagc 6600
accttatcac actggatgca ttgggccgac caatcacatt gcgcttttgg ggaactgaaa 6660
acggcaagat gacaggttat tcctcaccgg aaaaagcatc attttctcca ccatccgatg 6720
ttaatgccgc tattgagtta aaaaaaccgc tccctgtagc acagtgtcag gtctacgcac 6780
cagaaagctg gatgccagta ttaagtcaga aaaccttcaa tcgactggca gaacaagatt 6840
ggcaaaagtt atataacgcc cgaatcatca ccgaagatgg acgtatctgc acactggctt 6900
atcgccgctg ggtacaaagc caaaaggcaa tccctcaact cattagcctg ttaaacaacg 6960
gaccccgttt acctcctcac agcctgacat tgacgacgga tcgttatgat cacgatcctg 7020
agcaacagat ccgtcaacag gtggtattca gtgatggctt tggccgcttg ctgcaagccg 7080
ctgcccgaca tgaggcaggc atggcccggc aacgcaatga agacggctct ttgattataa 7140
atgtccagca tactgagaac cgttgggcag tgactggacg aacggaatat gacaataagg 7200
ggcaaccgat acgtacctat cagccctatt tcctcaatga ctggcgatac gtcagcaatg 7260
atagtgcccg gcaggaaaaa gaagcttatg cagataccca tgtctatgat cccataggtc 7320
gagaaatcaa ggttatcacc gcaaaaggtt ggttccgtcg aaccttgttc actccctggt 7380
ttactgtcaa tgaagatgaa aatgacacag ccgctgaggt gaagaaggta aagatgtaat 7440
taatctgact cgag 7454
<210> 58
<211> 2472
<212> PRT
<213> 人工序列
<220>
<223> TcdB2/TccC3融合蛋白pDAB8951
<400> 58
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val Ser
1 5 10 15
Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe His Arg
20 25 30
Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro Arg Leu Tyr Glu
50 55 60
Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn Phe Leu Trp Gln Tyr
65 70 75 80
Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu Ser Ile Asp Ala Gly Arg
85 90 95
Thr Val Thr Leu Asn Asp Ile Glu Gly Arg Pro Leu Leu Thr Val Thr
100 105 110
Ala Thr Gly Val Ile Gln Thr Arg Gln Tyr Glu Thr Ser Ser Leu Pro
115 120 125
Gly Arg Leu Leu Ser Val Ala Glu Gln Thr Pro Glu Glu Lys Thr Ser
130 135 140
Arg Ile Thr Glu Arg Leu Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys
145 150 155 160
Asp His Asn Leu Ala Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly
165 170 175
Val Thr Arg Leu Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln
180 185 190
Ser Ser Gln Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp
195 200 205
Asn Glu Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr
210 215 220
Leu Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
225 230 235 240
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu Asn
245 250 255
Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile Ile Lys
260 265 270
Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly
275 280 285
Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg Leu
290 295 300
Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr Lys Val Leu Gln Asp
305 310 315 320
Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Ile Ser Ile Arg Asn
325 330 335
Asp Ala Glu Ala Thr Arg Phe Trp His Asn Gln Lys Val Met Pro Glu
340 345 350
Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Thr Gly
355 360 365
Arg Glu Met Ala Asn Ile Gly Gln Gln Ser His Gln Phe Pro Ser Pro
370 375 380
Ala Leu Pro Ser Asp Asn Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr
385 390 395 400
Thr Tyr Asp Arg Gly Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro
405 410 415
Ala Thr Gln Asn Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser
420 425 430
Asn Arg Ala Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp
435 440 445
Ala Leu Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln
450 455 460
Asn Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val
465 470 475 480
Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser
485 490 495
Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala Ser Asn
500 505 510
Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu Glu Leu Arg
515 520 525
Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu Gln Val Ile Thr
530 535 540
Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ser
545 550 555 560
Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln Leu Arg Tyr Ser Tyr Asp
565 570 575
Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Ser Glu Gly Gln Ile
580 585 590
Ile Ser Glu Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Ala
595 600 605
Ala Arg Asn Gln Thr Glu Ala Ser Tyr Lys Thr Ile Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Ile Gly Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile
645 650 655
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu
660 665 670
Leu Asp Pro Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala
675 680 685
Leu Lys Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala
690 695 700
Thr Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser
705 710 715 720
Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile Gly
725 730 735
Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val Ala Pro
740 745 750
Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser Leu Pro Glu
755 760 765
Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr Asn Leu Gln Lys
770 775 780
Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg Ser Phe Glu Glu Met
785 790 795 800
Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr
805 810 815
Lys Met Ala Arg Gln Phe Ala Ser Ile Phe Ile Gly Gln Lys Asp Thr
820 825 830
Ser Asn Leu Pro Lys Glu Thr Val Lys Asn Ile Ser Thr Trp Gly Ala
835 840 845
Lys Pro Lys Leu Lys Asp Leu Ser Asn Tyr Ile Lys Tyr Thr Lys Asp
850 855 860
Lys Ser Thr Val Trp Val Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly
865 870 875 880
Gln Ser Ser Gly Ala Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu
885 890 895
Phe Ala Ile Asp Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr
900 905 910
Lys Asn Met Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr
915 920 925
Ser Ser Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile
930 935 940
Ser Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
945 950 955 960
Pro Arg Leu Asp Arg Ala Ala Asp Ile Thr Thr Gln Asn Ala His Asp
965 970 975
Ser Ala Ile Val Ala Leu Arg Gln Asn Ile Pro Thr Pro Ala Pro Leu
980 985 990
Ser Leu Arg Ser Arg Pro Met Gln Asn Ser Gln Asp Phe Ser Ile Thr
995 1000 1005
Glu Leu Ser Leu Pro Lys Gly Gly Gly Ala Ile Thr Gly Met Gly
1010 1015 1020
Glu Ala Leu Thr Pro Thr Gly Pro Asp Gly Met Ala Ala Leu Ser
1025 1030 1035
Leu Pro Leu Pro Ile Ser Ala Gly Arg Gly Tyr Ala Pro Ala Phe
1040 1045 1050
Thr Leu Asn Tyr Asn Ser Gly Ala Gly Asn Ser Pro Phe Gly Leu
1055 1060 1065
Gly Trp Asp Cys Asn Val Met Thr Ile Arg Arg Arg Thr His Phe
1070 1075 1080
Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe Leu Gly Pro Glu
1085 1090 1095
Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp Glu Ser Thr
1100 1105 1110
Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr Gly Tyr
1115 1120 1125
Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp Gln
1130 1135 1140
Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
1145 1150 1155
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile
1160 1165 1170
Ser Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu
1175 1180 1185
Ala Ser Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg
1190 1195 1200
Ala Glu Asp Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His
1205 1210 1215
Leu Gln Ala Thr Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly
1220 1225 1230
Asn Arg Thr Ala Ser Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala
1235 1240 1245
Pro Ser Gln Ala Asp Trp Leu Phe Tyr Leu Val Phe Asp Tyr Gly
1250 1255 1260
Glu Arg Ser Asn Asn Leu Lys Thr Pro Pro Ala Phe Ser Thr Thr
1265 1270 1275
Gly Ser Trp Leu Cys Arg Gln Asp Arg Phe Ser Arg Tyr Glu Tyr
1280 1285 1290
Gly Phe Glu Ile Arg Thr Arg Arg Leu Cys Arg Gln Val Leu Met
1295 1300 1305
Tyr His His Leu Gln Ala Leu Asp Ser Lys Ile Thr Glu His Asn
1310 1315 1320
Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn Tyr Asp Glu Ser
1325 1330 1335
Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val Gly His Glu
1340 1345 1350
Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu Ala Tyr
1355 1360 1365
Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met Asp
1370 1375 1380
Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
1385 1390 1395
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly
1400 1405 1410
Ala Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser
1415 1420 1425
Asp Ala Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro
1430 1435 1440
Ser Leu Gln Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly
1445 1450 1455
Gln Leu Asp Trp Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His
1460 1465 1470
Ser Gln Arg Pro Asp Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn
1475 1480 1485
Ala Leu Pro Val Glu Tyr Thr His Pro Arg Ala Gln Leu Ala Asp
1490 1495 1500
Leu Met Gly Ala Gly Leu Ser Asp Leu Val Leu Ile Gly Pro Lys
1505 1510 1515
Ser Val Arg Leu Tyr Ala Asn Thr Arg Asp Gly Phe Ala Lys Gly
1520 1525 1530
Lys Asp Val Val Gln Ser Gly Asp Ile Thr Leu Pro Val Pro Gly
1535 1540 1545
Ala Asp Pro Arg Lys Leu Val Ala Phe Ser Asp Val Leu Gly Ser
1550 1555 1560
Gly Gln Ala His Leu Val Glu Val Ser Ala Thr Lys Val Thr Cys
1565 1570 1575
Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro Ile Thr Leu
1580 1585 1590
Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala Gln Val
1595 1600 1605
Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile Tyr
1610 1615 1620
Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
1625 1630 1635
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg
1640 1645 1650
Phe Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu
1655 1660 1665
Gly Val Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His
1670 1675 1680
His Trp Arg Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn
1685 1690 1695
Glu Met Asn Asn Asn Met Gly Val His His Thr Leu Arg Tyr Arg
1700 1705 1710
Ser Ser Ser Gln Phe Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr
1715 1720 1725
Thr Gly Gln Thr Pro Val Cys Tyr Leu Pro Phe Pro Ile His Thr
1730 1735 1740
Leu Trp Gln Thr Glu Thr Glu Asp Glu Ile Ser Gly Asn Lys Leu
1745 1750 1755
Val Thr Thr Leu Arg Tyr Ala Arg Gly Ala Trp Asp Gly Arg Glu
1760 1765 1770
Arg Glu Phe Arg Gly Phe Gly Tyr Val Glu Gln Thr Asp Ser His
1775 1780 1785
Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg Thr Pro Pro Ala Leu
1790 1795 1800
Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val Ile Asp Asn Ala
1805 1810 1815
Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe Ala Gly Phe
1820 1825 1830
Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val Pro Leu
1835 1840 1845
Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala Leu
1850 1855 1860
Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
1865 1870 1875
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln
1880 1885 1890
Val Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp
1895 1900 1905
Ser Ser Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala
1910 1915 1920
Ser Asp Pro Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg
1925 1930 1935
Phe Gly Gln Pro Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg
1940 1945 1950
Gln Gln Pro Ala Ile Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys
1955 1960 1965
Leu Leu Ala Asn Ser Tyr Asp Asp Gln Gln Arg Gln Leu Arg Leu
1970 1975 1980
Thr Tyr Gln Gln Ser Ser Trp His His Leu Thr Asn Asn Thr Val
1985 1990 1995
Arg Val Leu Gly Leu Pro Asp Ser Thr Arg Ser Asp Ile Phe Thr
2000 2005 2010
Tyr Gly Ala Glu Asn Val Pro Ala Gly Gly Leu Asn Leu Glu Leu
2015 2020 2025
Leu Ser Asp Lys Asn Ser Leu Ile Ala Asp Asp Lys Pro Arg Glu
2030 2035 2040
Tyr Leu Gly Gln Gln Lys Thr Ala Tyr Thr Asp Gly Gln Asn Thr
2045 2050 2055
Thr Pro Leu Gln Thr Pro Thr Arg Gln Ala Leu Ile Ala Phe Thr
2060 2065 2070
Glu Thr Thr Val Phe Asn Gln Ser Thr Leu Ser Ala Phe Asn Gly
2075 2080 2085
Ser Ile Pro Ser Asp Lys Leu Ser Thr Thr Leu Glu Gln Ala Gly
2090 2095 2100
Tyr Gln Gln Thr Asn Tyr Leu Phe Pro Arg Thr Gly Glu Asp Lys
2105 2110 2115
Val Trp Val Ala His His Gly Tyr Thr Asp Tyr Gly Thr Ala Ala
2120 2125 2130
Gln Phe Trp Arg Pro Gln Lys Gln Ser Asn Thr Gln Leu Thr Gly
2135 2140 2145
Lys Ile Thr Leu Ile Trp Asp Ala Asn Tyr Cys Val Val Val Gln
2150 2155 2160
Thr Arg Asp Ala Ala Gly Leu Thr Thr Ser Ala Lys Tyr Asp Trp
2165 2170 2175
Arg Phe Leu Thr Pro Val Gln Leu Thr Asp Ile Asn Asp Asn Gln
2180 2185 2190
His Leu Ile Thr Leu Asp Ala Leu Gly Arg Pro Ile Thr Leu Arg
2195 2200 2205
Phe Trp Gly Thr Glu Asn Gly Lys Met Thr Gly Tyr Ser Ser Pro
2210 2215 2220
Glu Lys Ala Ser Phe Ser Pro Pro Ser Asp Val Asn Ala Ala Ile
2225 2230 2235
Glu Leu Lys Lys Pro Leu Pro Val Ala Gln Cys Gln Val Tyr Ala
2240 2245 2250
Pro Glu Ser Trp Met Pro Val Leu Ser Gln Lys Thr Phe Asn Arg
2255 2260 2265
Leu Ala Glu Gln Asp Trp Gln Lys Leu Tyr Asn Ala Arg Ile Ile
2270 2275 2280
Thr Glu Asp Gly Arg Ile Cys Thr Leu Ala Tyr Arg Arg Trp Val
2285 2290 2295
Gln Ser Gln Lys Ala Ile Pro Gln Leu Ile Ser Leu Leu Asn Asn
2300 2305 2310
Gly Pro Arg Leu Pro Pro His Ser Leu Thr Leu Thr Thr Asp Arg
2315 2320 2325
Tyr Asp His Asp Pro Glu Gln Gln Ile Arg Gln Gln Val Val Phe
2330 2335 2340
Ser Asp Gly Phe Gly Arg Leu Leu Gln Ala Ala Ala Arg His Glu
2345 2350 2355
Ala Gly Met Ala Arg Gln Arg Asn Glu Asp Gly Ser Leu Ile Ile
2360 2365 2370
Asn Val Gln His Thr Glu Asn Arg Trp Ala Val Thr Gly Arg Thr
2375 2380 2385
Glu Tyr Asp Asn Lys Gly Gln Pro Ile Arg Thr Tyr Gln Pro Tyr
2390 2395 2400
Phe Leu Asn Asp Trp Arg Tyr Val Ser Asn Asp Ser Ala Arg Gln
2405 2410 2415
Glu Lys Glu Ala Tyr Ala Asp Thr His Val Tyr Asp Pro Ile Gly
2420 2425 2430
Arg Glu Ile Lys Val Ile Thr Ala Lys Gly Trp Phe Arg Arg Thr
2435 2440 2445
Leu Phe Thr Pro Trp Phe Thr Val Asn Glu Asp Glu Asn Asp Thr
2450 2455 2460
Ala Ala Glu Val Lys Lys Val Lys Met
2465 2470
<210> 59
<211> 15036
<212> DNA
<213> 人工序列
<220>
<223> 编码TcdB2/TccC3融合蛋白pDAB 8811的核酸序列
<400> 59
tctagacgtg cgtcgacaag aaggagatat accatgtata gcacggctgt attactcaat 60
aaaatcagtc ccactcgcga cggtcagacg atgactcttg cggatctgca atatttatcc 120
ttcagtgaac tgagaaaaat ctttgatgac cagctcagtt ggggagaggc tcgccatctc 180
tatcatgaaa ctatagagca gaaaaaaaat aatcgcttgc tggaagcgcg tatttttacc 240
cgtgccaacc cacaattatc cggtgctatc cgactcggta ttgaacgaga cagcgtttca 300
cgcagttatg atgaaatgtt tggtgcccgt tcttcttcct ttgtgaaacc gggttcagtg 360
gcttccatgt tttcaccggc tggctatctc accgaattgt atcgtgaagc gaaggactta 420
catttttcaa gctctgctta tcatcttgat aatcgccgtc cggatctggc tgatctgact 480
ctgagccaga gtaatatgga tacagaaatt tccaccctga cactgtctaa cgaactgttg 540
ctggagcata ttacccgcaa gaccggaggt gattcggacg cattgatgga gagcctgtca 600
acttaccgtc aggccattga taccccttac catcagcctt acgagactat ccgtcaggtc 660
attatgaccc atgacagtac actgtcagcg ctgtcccgta atcctgaggt gatggggcag 720
gcggaagggg cttcattact ggcgattctg gccaatattt ctccggagct ttataacatt 780
ttgaccgaag agattacgga aaagaacgct gatgctttat ttgcgcaaaa cttcagtgaa 840
aatatcacgc ccgaaaattt cgcgtcacaa tcatggatag ccaagtatta tggtcttgaa 900
ctttctgagg tgcaaaaata cctcgggatg ttgcagaatg gctattctga cagcacctct 960
gcttatgtgg ataatatctc aacgggttta gtggtcaata atgaaagtaa actcgaagct 1020
tacaaaataa cacgtgtaaa aacagatgat tatgataaaa atataaatta ctttgatttg 1080
atgtatgaag gaaataatca gttctttata cgtgctaatt ttaaggtatc aagagaattt 1140
ggggctactc ttagaaaaaa cgcagggcca agtggcattg tcggcagcct ttccggtcct 1200
ctaatagcca atacgaattt taaaagtaat tatctaagta acatatctga ttctgaatac 1260
aaaaacggtg taaagatata cgcctatcgc tatacgtctt ccaccagcgc cacaaatcag 1320
ggcggcggaa tattcacttt tgagtcttat cccctgacta tatttgcgct caaactgaat 1380
aaagccattc gcttgtgcct gactagcggg ctttcaccga atgaactgca aactatcgta 1440
cgcagtgaca atgcacaagg catcatcaac gactccgttc tgaccaaagt tttctatact 1500
ctgttctaca gtcaccgtta tgcactgagc tttgatgatg cacaggtact gaacggatcg 1560
gtcattaatc aatatgccga cgatgacagt gtcagtcatt ttaaccgtct ctttaataca 1620
ccgccgctga aagggaaaat ctttgaagcc gacggcaaca cggtcagcat tgatccggat 1680
gaagagcaat ctacctttgc ccgttcagcc ctgatgcgtg gtctgggggt caacagtggt 1740
gaactgtatc agttaggcaa actggcgggt gtgctggacg cccaaaatac catcacactt 1800
tctgtcttcg ttatctcttc actgtatcgc ctcacgttac tggcccgtgt ccatcagctg 1860
acggtcaatg aactgtgtat gctttatggt ctttcgccgt tcaatggcaa aacaacggct 1920
tctttgtctt ccggggagtt gccacggctg gttatctggc tgtatcaggt gacgcagtgg 1980
ctgactgagg cggaaatcac cactgaagcg atctggttat tatgtacgcc agagtttagc 2040
gggaatattt caccggaaat cagtaatctg ctcaataacc tccgaccgag tattagtgaa 2100
gatatggcac agagtcacaa tcgggagctg caggctgaaa ttctcgcgcc gtttattgct 2160
gcaacgctgc atctggcgtc accggatatg gcacggtata tcctgttgtg gaccgataac 2220
ctgcggccgg gtggcttaga tattgccggg tttatgacac tggtattgaa agagtcgtta 2280
aatgccaatg aaaccaccca attggtacaa ttctgccatg tgatggcaca gttatcgctt 2340
tccgtacaga cactgcgcct cagtgaagcg gagctatccg tgctggtcat ctccggattc 2400
gccgtgctgg gggcaaaaaa tcaacctgcc ggacagcaca atattgatac gctattctca 2460
ctctaccgat tccaccagtg gattaatggg ctgggcaatc ccggctctga cacgctggat 2520
atgctgcgcc agcagacact cacggccgac agactggcct ccgtgatggg gctggacatc 2580
agtatggtaa cgcaggccat ggtttccgcc ggcgtgaacc agcttcagtg ttggcaggat 2640
atcaacaccg tgttgcagtg gatagatgtg gcatcagcac tgcacacgat gccgtcggtt 2700
atccgtacgc tggtgaatat ccgttacgtg actgcattaa acaaagccga gtcgaatctg 2760
ccttcctggg atgagtggca gacactggca gaaaatatgg aagccggact cagtacacaa 2820
caggctcaga cgctggcgga ttataccgcg gagcgcttga gtagcgtgct gtgcaattgg 2880
tttctggcga atatccagcc agaaggggtg tccctgcaca gccgggatga cctgtacagc 2940
tatttcctga ttgataatca ggtctcttct gccataaaaa ccacccgact ggcagaggcc 3000
attgccggta ttcagctcta catcaaccgg gcgctgaatc ggatagagcc taatgcccgt 3060
gccgatgtgt caacccgcca gttttttacc gactggacgg tgaataaccg ttacagcacc 3120
tggggcgggg tgtcgcggct ggtttattat ccggaaaatt acattgaccc aacccagcgt 3180
atcgggcaga cccggatgat ggatgaactg ctggaaaata tcagccagag taaacttagc 3240
cgggacacag tggaggatgc ctttaaaact tacctgaccc gctttgaaac cgtggcggat 3300
ctgaaagttg tcagcgccta tcacgacaac gtcaacagca acaccggact gacctggttt 3360
gtcggccaaa cgcgggagaa cctgccggaa tactactggt gtaacgtgga tatatcacgg 3420
atgcaggcgg gtgaactggc cgccaatgcc tggaaagagt ggacgaagat tgatacagcg 3480
gtcaacccct acaaggatgc aatacgtccg gtcatactca gggaacgttt gcaccttatc 3540
tgggtagaaa aagaggaagt ggcgaaaaat ggtactgatc cggtggaaac ctgtgaccgt 3600
tttactctga aactggcgtt tctgcgtcat gatggcagtt ggagtgcccc ctggtcttac 3660
gatatcacaa cgcaggtgga ggcggtcact gacaaaaaac ctgacactga acggctggcg 3720
ctggccgcat caggctttca gggcgaggac actctgctgg tgtttgtcta caaaaccggg 3780
aagagttact cggattttgg cggcagcaat aaaaatgtgg caggcatgac catttacggc 3840
gatggctcct tcaaaaagat ggagaacaca gcactcagcc gttacagcca actgaaaaat 3900
acctttgata tcattcatac tcaaggcaac gacttggtaa gaaaggccag ctatcgtttc 3960
gcgcaggatt ttgaagtgcc tgcctcgttg aatatgggtt ctgccatcgg tgatgatagt 4020
ctgacggtga tggagaacgg gaatattccg cagataacca gtaaatactc cagcgataac 4080
cttgctatta cgctacataa cgccgctttc actgtcagat atgatggcag tggcaatgtc 4140
atcagaaaca aacaaatcag cgccatgaaa ctgacggggg tggatggaaa gtcccagtac 4200
ggcaatgcat ttatcatcgc aaataccgtt aaacattatg gcggttactc tgatctgggg 4260
gggccgatca ccgtttataa taaaacgaaa aactatattg catcagttca aggccacttg 4320
atgaacgcag attacactag gcgtttgatt ctaacaccag ttgaaaataa ttattatgcc 4380
agattgttcg agtttccatt ttctccaaac acaattttaa acaccgtttt cacggttggt 4440
agcaataaaa ccagtgattt taaaaagtgc agttatgctg ttgatggtaa taattctcag 4500
ggcttccaga tatttagttc ctatcaatca tccggctggc tggatattga tacaggcatt 4560
aacaataccg atatcaaaat tacggtgatg gctggcagta aaacccacac ctttacggcc 4620
agtgaccata ttgcttcctt gccggcaaac agttttgatg ctatgccgta cacctttaag 4680
ccactggaaa tcgatgcttc atcgttggcc tttaccaata atattgctcc tctggatatc 4740
gtttttgaga ccaaagccaa agacgggcga gtgctgggta agatcaagca aacattatcg 4800
gtgaaacggg taaattataa tccggaagat attctgtttc tgcgtgaaac tcattcgggt 4860
gcccaatata tgcagctcgg ggtgtatcgt attcgtctta ataccctgct ggcttctcaa 4920
ctggtatcca gagcaaacac gggcattgat actatcctga caatggaaac ccagcggtta 4980
ccggaacctc cgttgggaga aggcttcttt gccaactttg ttctgcctaa atatgaccct 5040
gctgaacatg gcgatgagcg gtggtttaaa atccatattg ggaatgttgg cggtaacacg 5100
ggaaggcagc cttattacag cggaatgtta tccgatacgt cggaaaccag tatgacactg 5160
tttgtccctt atgccgaagg gtattacatg catgaaggtg tcagattggg ggttggatac 5220
cagaaaatta cctatgacaa cacttgggaa tctgctttct tttattttga tgagacaaaa 5280
cagcaatttg tattaattaa cgatgctgat catgattcag gaatgacgca acaggggatc 5340
gtgaaaaata tcaagaaata caaaggattt ttgaatgttt ctatcgcaac gggctattcc 5400
gccccgatgg atttcaatag tgccagcgcc ctctattact gggaattgtt ctattacacc 5460
ccgatgatgt gcttccagcg tttgctacag gaaaaacaat tcgacgaagc cacacaatgg 5520
ataaactacg tctacaatcc cgccggctat atcgttaacg gagaaatcgc cccctggatc 5580
tggaactgcc ggccgctgga agagaccacc tcctggaatg ccaatccgct ggatgccatc 5640
gatccggatg ccgtcgccca aaatgaccca atgcactaca agattgccac ctttatgcgc 5700
ctgttggatc aacttattct gcgcggcgat atggcctatc gagaactgac ccgcgatgcg 5760
ttgaatgaag ccaaaatgtg gtatgtgcgt actttagaat tgctcggtga tgagccggag 5820
gattacggta gccaacagtg ggcagcaccg tccctttccg gggcggcgag tcaaaccgtg 5880
caggcggctt atcagcagga tcttacgatg ctgggccgtg gtggggtttc caagaatctc 5940
cgtaccgcta actcgttggt gggtttgttc ctgccggaat ataacccggc gctcaccgat 6000
tactggcaaa ccctgcgttt gcgcctgttt aacctgcgcc ataatctttc cattgacgga 6060
cagccgttat cgctggcgat ttacgccgag cctaccgatc cgaaagcgct gctcaccagt 6120
atggtacagg cctctcaggg cggtagtgca gtgctgcccg gcacattgtc gttataccgc 6180
ttcccggtga tgctggagcg gacccgcaat ctggtagcgc aattaaccca gttcggcacc 6240
tctctgctca gtatggcaga gcatgatgat gccgatgaac tcaccacgct gctactacag 6300
cagggtatgg aactggcgac acagagcatc cgtattcagc aacgaactgt cgatgaagtg 6360
gatgctgata ttgctgtatt ggcagagagc cgccgcagtg cacaaaatcg tctggaaaaa 6420
taccagcagc tgtatgacga ggatatcaac cacggagaac agcgggcaat gtcactgctt 6480
gatgcagcgg caggtcagtc tctggccggg caggtgcttt caatagcgga aggggtggcc 6540
gatttagtgc caaacgtgtt cggtttagct tgtggcggca gtcgttgggg ggcagcactg 6600
cgtgcttccg cctccgtgat gtcgctttct gccacagctt cccaatattc cgcagacaaa 6660
atcagccgtt cggaagccta ccgccgccgc cgtcaggagt gggaaattca gcgtgataat 6720
gctgacggtg aagtcaaaca aatggatgcc cagttggaaa gcctgaaaat ccgccgcgaa 6780
gcagcacaga tgcaggtgga atatcaggag acccagcagg cccatactca ggctcagtta 6840
gagctgttac agcgtaaatt cacaaacaaa gcgctttaca gttggatgcg cggcaagctg 6900
agtgctatct attaccagtt ctttgacctg acccagtcct tctgcctgat ggcacaggaa 6960
gcgctgcgcc gcgagctgac cgacaacggt gttaccttta tccggggtgg ggcctggaac 7020
ggtacgactg cgggtttgat ggcgggtgaa acgttgctgc tgaatctggc agaaatggaa 7080
aaagtctggc tggagcgtga tgagcgggca ctggaagtga cccgtaccgt ctcgttggca 7140
cagttctatc aggccttatc atcagacaac tttaatctga ccgaaaaact cacgcaattc 7200
ctgcgtgaag ggaaaggcaa cgtaggagct tccggcaatg aattaaaact cagtaaccgt 7260
cagatagaag cctcagtgcg attgtctgat ttgaaaattt tcagcgacta ccccgaaagc 7320
cttggcaata cccgtcagtt gaaacaggtg agtgtcacct tgccggcgct ggttgggccg 7380
tatgaagata ttcgggcggt gctgaattac gggggcagca tcgtcatgcc acgcggttgc 7440
agtgctattg ctctctccca cggcgtgaat gacagtggtc aatttatgct ggatttcaac 7500
gattcccgtt atctgccgtt tgaaggtatt tccgtgaatg acagcggcag cctgacgttg 7560
agtttcccgg atgcgactga tcggcagaaa gcgctgctgg agagcctgag cgatatcatt 7620
ctgcatatcc gctataccat tcgttctcct agggatcgta cccgtccaac tagtatgcaa 7680
aattcacaag attttagtat tacggaactg tcactgccca aagggggggg cgctatcacg 7740
ggaatgggtg aagcattaac ccccactgga ccggatggta tggccgcgct atctctacca 7800
ttgcctattt ctgccgggcg cggttatgct cccgcattca ctctgaatta caacagcggc 7860
gccggtaaca gtccatttgg tctgggttgg gattgcaacg ttatgactat ccgccgccgc 7920
acccattttg gcgtccccca ttatgacgaa accgatacct ttttggggcc agaaggcgaa 7980
gtgctggtgg tagcggatca acctcgcgac gaatccacat tacagggtat caatttaggc 8040
gccaccttta ccgttaccgg ctaccgttcc cgtctggaaa gccatttcag ccgattggaa 8100
tattggcaac ccaaaacaac aggtaaaaca gatttttggt tgatatatag cccagatggg 8160
caggtgcatc tactgggtaa atcaccgcaa gcgcggatca gcaacccatc ccaaacgaca 8220
caaacagcac aatggctgct ggaagcctct gtatcatcac gtggcgaaca aatttattat 8280
caatatcgcg ccgaagatga cacaggttgc gaagcagatg aaattacgca ccatttacag 8340
gctacagcgc aacgttattt acacatcgtg tattacggca accgtacagc cagcgaaaca 8400
ttacccggtc tggatggcag cgccccatca caagcagact ggttgttcta tctggtattt 8460
gattacggcg aacgcagtaa caacctgaaa acgccaccag cattttcgac tacaggtagc 8520
tggctttgcc gtcaggaccg tttttcccgt tatgaatatg gctttgagat tcgtacccgc 8580
cgcttatgcc gtcaggtatt gatgtaccat cacctgcaag cactggatag taagataaca 8640
gaacacaacg gaccaacgct ggtttcacgc ctgatactca attacgacga aagcgcgata 8700
gccagcacgc tagtattcgt tcgccgagtg ggacacgagc aagatggtaa tgtcgtcacc 8760
ctgccgccat tagaattggc atatcaggat ttttcaccgc gacatcacgc tcactggcaa 8820
ccaatggatg tactggcaaa cttcaatgcc attcagcgct ggcagctagt cgatctaaaa 8880
ggcgaaggat tacccggcct gttatatcag gataaaggcg cttggtggta ccgctccgca 8940
cagcgtctgg gcgaaattgg ctcagatgcc gtcacttggg aaaagatgca acctttatcg 9000
gttattcctt ctttgcaaag taatgcctcg ttggtggata tcaatggaga cggccaactt 9060
gactgggtta tcaccggacc gggattacgg ggatatcata gtcaacgccc ggatggcagt 9120
tggacacgtt ttaccccact caacgctctg ccggtggaat acacccatcc acgcgcgcaa 9180
ctcgcagatt taatgggagc cgggctatcc gatttggtgc tgatcggccc taagagcgtg 9240
cgtttatatg ccaatacccg cgacggcttt gccaaaggaa aagatgtggt gcaatccggt 9300
gatatcacac tgccggtgcc gggcgccgat ccacgtaagt tggtggcgtt tagtgatgta 9360
ttgggttcag gtcaagccca tctggttgaa gtaagcgcga ctaaagtcac ctgctggcct 9420
aatctggggc gcggacgttt tggtcaaccc attaccttac cgggattcag ccagccagca 9480
accgagttta acccggctca agtttatctg gccgatctgg atggcagcgg tccaacggat 9540
ctgatttatg ttcatacaaa ccgtctggat atcttcctga acaaaagtgg caatggcttt 9600
gctgaaccag tgacattacg cttcccggaa ggtctgcgtt ttgatcatac ctgtcagtta 9660
caaatggccg atgtacaagg attaggcgtc gccagcctga tactgagcgt gccgcatatg 9720
tctccccatc actggcgctg cgatctgacc aacatgaagc cgtggttact caatgaaatg 9780
aacaacaata tgggggtcca tcacaccttg cgttaccgca gttcctccca attctggctg 9840
gatgaaaaag ccgcggcgct gactaccgga caaacaccgg tttgctatct ccccttcccg 9900
atccacaccc tatggcaaac ggaaacagaa gatgaaatca gcggcaacaa attagtcaca 9960
acacttcgtt atgctcgtgg cgcatgggac ggacgcgagc gggaatttcg cggatttggt 10020
tatgtagagc agacagacag ccatcaactg gctcaaggca acgcgccaga acgtacgcca 10080
ccggcgctga ccaaaaactg gtatgccacc ggactgccgg tgatagataa cgcattatca 10140
accgagtatt ggcgtgatga tcaggctttt gccggtttct caccgcgctt tacgacttgg 10200
caagataaca aagatgtccc gttaacaccg gaagatgata acagtcgtta ctggttcaac 10260
cgcgcgttga aaggtcaact gctacgtagt gaactgtacg gattggacga tagtacaaat 10320
aaacacgttc cctatactgt cactgaattt cgttcacagg tacgtcgatt acagcatacc 10380
gacagccgat accctgtact ttggtcatct gtagttgaaa gccgcaacta tcactacgaa 10440
cgtatcgcca gcgacccgca atgcagtcaa aatattacgc tatccagtga tcgatttggt 10500
cagccgctaa aacagctttc ggtacagtac ccgcgccgcc agcagccagc aatcaatctg 10560
tatcctgata cattgcctga taagttgtta gccaacagct atgatgacca acaacgccaa 10620
ttacggctca cctatcaaca atccagttgg catcacctga ccaacaatac cgttcgagta 10680
ttgggattac cggatagtac ccgcagtgat atctttactt atggcgctga aaatgtgcct 10740
gctggtggtt taaatctgga acttctgagt gataaaaata gcctgatcgc ggacgataaa 10800
ccacgtgaat acctcggtca gcaaaaaacc gcttataccg atggacaaaa tacaacgccg 10860
ttgcaaacac caacacggca agccctgatt gcctttaccg aaacaacggt attcaaccag 10920
tccacattat cagcgtttaa cggaagcatc ccgtccgata aattatcaac gacgctggag 10980
caagctggat atcagcaaac aaattatcta ttccctcgca ctggagaaga taaagtttgg 11040
gtagcccatc acggctatac cgattatggt acagcggcac agttctggcg cccgcaaaaa 11100
cagagcaaca cccaactcac cggtaaaatc accctcatct gggatgcaaa ctattgcgtt 11160
gtggtacaaa cccgggatgc tgctggactg acaacctcag ccaaatatga ctggcgtttt 11220
ctgaccccgg tgcaactcac cgatatcaat gacaatcagc accttatcac actggatgca 11280
ttgggccgac caatcacatt gcgcttttgg ggaactgaaa acggcaagat gacaggttat 11340
tcctcaccgg aaaaagcatc attttctcca ccatccgatg ttaatgccgc tattgagtta 11400
aaaaaaccgc tccctgtagc acagtgtcag gtctacgcac cagaaagctg gatgccagta 11460
ttaagtcaga aaaccttcaa tcgactggca gaacaagatt ggcaaaagtt atataacgcc 11520
cgaatcatca ccgaagatgg acgtatctgc acactggctt atcgccgctg ggtacaaagc 11580
caaaaggcaa tccctcaact cattagcctg ttaaacaacg gaccccgttt acctcctcac 11640
agcctgacat tgacgacgga tcgttatgat cacgatcctg agcaacagat ccgtcaacag 11700
gtggtattca gtgatggctt tggccgcttg ctgcaagccg ctgcccgaca tgaggcaggc 11760
atggcccggc aacgcaatga agacggctct ttgattataa atgtccagca tactgagaac 11820
cgttgggcag tgactggacg aacggaatat gacaataagg ggcaaccgat acgtacctat 11880
cagccctatt tcctcaatga ctggcgatac gtcagcaatg atagtgcccg gcaggaaaaa 11940
gaagcttatg cagataccca tgtctatgat cccataggtc gagaaatcaa ggttatcacc 12000
gcaaaaggtt ggttccgtcg aaccttgttc actccctggt ttactgtcaa tgaagatgaa 12060
aatgacacag ccgctgaggt gaagaaggta aagatgccgg gatccgacaa caagggtcag 12120
actatccgca ctaggcctat gaaaaacatc gatcccaaac tttatcaaaa aacccctact 12180
gtcagcgttt acgataaccg tggtctgata atccgtaaca tcgattttca tcgtactacc 12240
gcaaatggtg atcccgatac ccgtattacc cgccatcaat acgatattca cggacaccta 12300
aatcaaagca tcgatccgcg cctatatgaa gccaagcaaa ccaacaatac gatcaaaccc 12360
aattttcttt ggcagtatga tttgaccggt aatcccctat gtacagagag cattgatgca 12420
ggtcgcactg tcaccttgaa tgatattgaa ggccgtccgc tactaacggt gactgcaaca 12480
ggggttatac aaactcgaca atatgaaact tcttccctgc ccggtcgtct gttatctgtt 12540
gccgaacaaa cacccgagga aaaaacatcc cgtatcaccg aacgcctgat ttgggctggc 12600
aataccgaag cagagaaaga ccataacctt gccggccagt gcgtgcgtca ctatgacacg 12660
gcgggagtta cccggttaga gagtttatca ctgaccggta ctgttttatc tcaatccagc 12720
caactattga tcgacactca agaggcaaac tggacaggtg ataacgaaac cgtctggcaa 12780
aacatgctgg ctgatgacat ctacacaacc ctgagcacct tcgatgccac cggtgcttta 12840
ctgactcaga ccgatgcgaa agggaacatt cagagactgg cttatgatgt ggccgggcag 12900
ctaaacggga gctggctaac actcaaaggc cagacggaac aagtgattat caaatccctg 12960
acctactccg ccgccggaca aaaattacgt gaggaacacg gcaatgatgt tatcaccgaa 13020
tacagttatg aaccggaaac ccaacggctg atcggtatca aaacccgccg tccgtcagac 13080
actaaagtgc tacaagacct gcgctatgaa tatgacccgg taggcaatgt catcagcatc 13140
cgtaatgacg cggaagccac ccgcttttgg cacaatcaga aagtgatgcc ggaaaacact 13200
tatacctacg attccctgta tcagcttatc agcgccaccg ggcgcgaaat ggcgaatata 13260
ggtcaacaaa gtcaccaatt tccctcaccc gctctacctt ctgataacaa cacctatacc 13320
aactataccc gtacttatac ttatgaccgt ggcggcaatc tgaccaaaat ccagcacagt 13380
tcaccggcga cgcaaaacaa ctacaccacc aatatcacgg tttcaaatcg cagcaaccgc 13440
gcagtactca gcacattgac cgaagatccg gcgcaagtag atgctttgtt tgatgcaggc 13500
ggacatcaga acaccttgat atcaggacaa aacctgaact ggaatactcg tggtgaactg 13560
caacaagtaa cactggttaa acgggacaag ggcgccaatg atgatcggga atggtatcgt 13620
tatagcggtg acggaagaag gatgttaaaa atcaatgaac agcaggccag caacaacgct 13680
caaacacaac gtgtgactta tttgccgaac ttagaacttc gtctaacaca aaacagcacg 13740
gccacaaccg aagatttgca agttatcacc gtaggcgaag cgggccgggc acaggtacga 13800
gtattacatt gggagagcgg taaaccggaa gatatcgaca ataatcagtt gcgttatagt 13860
tacgataatc ttatcggttc cagtcaactt gaattagata gcgaaggaca aattatcagt 13920
gaagaagaat attatcccta tggtggaaca gcattatggg ccgccaggaa tcagacagaa 13980
gccagttata aaactatccg ttattcaggc aaagagcggg atgccaccgg gctatattac 14040
tacggctatc ggtattacca accgtggata ggacggtggt taagctccga tccggcagga 14100
acaatcgatg ggctgaattt atatcggatg gtgaggaata atccagttac cctccttgat 14160
cctgatggat taatgccaac aattgcagaa cgcatagcag cactaaaaaa aaataaagta 14220
acagactcag cgccttcgcc agcaaatgcc acaaacgtag cgataaacat ccgcccgcct 14280
gtagcaccaa aacctagctt accgaaagca tcaacgagta gccaaccaac cacacaccct 14340
atcggagctg caaacataaa accaacgacg tctgggtcat ctattgttgc tccattgagt 14400
ccagtaggaa ataaatctac ttctgaaatc tctctgccag aaagcgctca aagcagttct 14460
tcaagcacta cctcgacaaa tctacagaaa aaatcattta ctttatatag agcagataac 14520
agatcctttg aagaaatgca aagtaaattc cctgaaggat ttaaagcctg gactcctcta 14580
gacactaaga tggcaaggca atttgctagt atctttattg gtcagaaaga tacatctaat 14640
ttacctaaag aaacagtcaa gaacataagc acatggggag caaagccaaa actaaaagat 14700
ctctcaaatt acataaaata taccaaggac aaatctacag tatgggtttc tactgcaatt 14760
aatactgaag caggtggaca aagctcaggg gctccactcc ataaaattga tatggatctc 14820
tacgagtttg ccattgatgg acaaaaacta aatccactac cggagggtag aactaaaaac 14880
atggtacctt cccttttact cgacacccca caaatagaga catcatccat cattgcactt 14940
aatcatggac cggtaaatga tgcagaaatt tcatttctga caacaattcc gcttaaaaat 15000
gtaaaacctc ataagagata attaatctga ctcgag 15036
<210> 60
<211> 4995
<212> PRT
<213> 人工序列
<220>
<223> TcdB2/TccC3融合蛋白pDAB8811
<400> 60
Met Tyr Ser Thr Ala Val Leu Leu Asn Lys Ile Ser Pro Thr Arg Asp
1 5 10 15
Gly Gln Thr Met Thr Leu Ala Asp Leu Gln Tyr Leu Ser Phe Ser Glu
20 25 30
Leu Arg Lys Ile Phe Asp Asp Gln Leu Ser Trp Gly Glu Ala Arg His
35 40 45
Leu Tyr His Glu Thr Ile Glu Gln Lys Lys Asn Asn Arg Leu Leu Glu
50 55 60
Ala Arg Ile Phe Thr Arg Ala Asn Pro Gln Leu Ser Gly Ala Ile Arg
65 70 75 80
Leu Gly Ile Glu Arg Asp Ser Val Ser Arg Ser Tyr Asp Glu Met Phe
85 90 95
Gly Ala Arg Ser Ser Ser Phe Val Lys Pro Gly Ser Val Ala Ser Met
100 105 110
Phe Ser Pro Ala Gly Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp
115 120 125
Leu His Phe Ser Ser Ser Ala Tyr His Leu Asp Asn Arg Arg Pro Asp
130 135 140
Leu Ala Asp Leu Thr Leu Ser Gln Ser Asn Met Asp Thr Glu Ile Ser
145 150 155 160
Thr Leu Thr Leu Ser Asn Glu Leu Leu Leu Glu His Ile Thr Arg Lys
165 170 175
Thr Gly Gly Asp Ser Asp Ala Leu Met Glu Ser Leu Ser Thr Tyr Arg
180 185 190
Gln Ala Ile Asp Thr Pro Tyr His Gln Pro Tyr Glu Thr Ile Arg Gln
195 200 205
Val Ile Met Thr His Asp Ser Thr Leu Ser Ala Leu Ser Arg Asn Pro
210 215 220
Glu Val Met Gly Gln Ala Glu Gly Ala Ser Leu Leu Ala Ile Leu Ala
225 230 235 240
Asn Ile Ser Pro Glu Leu Tyr Asn Ile Leu Thr Glu Glu Ile Thr Glu
245 250 255
Lys Asn Ala Asp Ala Leu Phe Ala Gln Asn Phe Ser Glu Asn Ile Thr
260 265 270
Pro Glu Asn Phe Ala Ser Gln Ser Trp Ile Ala Lys Tyr Tyr Gly Leu
275 280 285
Glu Leu Ser Glu Val Gln Lys Tyr Leu Gly Met Leu Gln Asn Gly Tyr
290 295 300
Ser Asp Ser Thr Ser Ala Tyr Val Asp Asn Ile Ser Thr Gly Leu Val
305 310 315 320
Val Asn Asn Glu Ser Lys Leu Glu Ala Tyr Lys Ile Thr Arg Val Lys
325 330 335
Thr Asp Asp Tyr Asp Lys Asn Ile Asn Tyr Phe Asp Leu Met Tyr Glu
340 345 350
Gly Asn Asn Gln Phe Phe Ile Arg Ala Asn Phe Lys Val Ser Arg Glu
355 360 365
Phe Gly Ala Thr Leu Arg Lys Asn Ala Gly Pro Ser Gly Ile Val Gly
370 375 380
Ser Leu Ser Gly Pro Leu Ile Ala Asn Thr Asn Phe Lys Ser Asn Tyr
385 390 395 400
Leu Ser Asn Ile Ser Asp Ser Glu Tyr Lys Asn Gly Val Lys Ile Tyr
405 410 415
Ala Tyr Arg Tyr Thr Ser Ser Thr Ser Ala Thr Asn Gln Gly Gly Gly
420 425 430
Ile Phe Thr Phe Glu Ser Tyr Pro Leu Thr Ile Phe Ala Leu Lys Leu
435 440 445
Asn Lys Ala Ile Arg Leu Cys Leu Thr Ser Gly Leu Ser Pro Asn Glu
450 455 460
Leu Gln Thr Ile Val Arg Ser Asp Asn Ala Gln Gly Ile Ile Asn Asp
465 470 475 480
Ser Val Leu Thr Lys Val Phe Tyr Thr Leu Phe Tyr Ser His Arg Tyr
485 490 495
Ala Leu Ser Phe Asp Asp Ala Gln Val Leu Asn Gly Ser Val Ile Asn
500 505 510
Gln Tyr Ala Asp Asp Asp Ser Val Ser His Phe Asn Arg Leu Phe Asn
515 520 525
Thr Pro Pro Leu Lys Gly Lys Ile Phe Glu Ala Asp Gly Asn Thr Val
530 535 540
Ser Ile Asp Pro Asp Glu Glu Gln Ser Thr Phe Ala Arg Ser Ala Leu
545 550 555 560
Met Arg Gly Leu Gly Val Asn Ser Gly Glu Leu Tyr Gln Leu Gly Lys
565 570 575
Leu Ala Gly Val Leu Asp Ala Gln Asn Thr Ile Thr Leu Ser Val Phe
580 585 590
Val Ile Ser Ser Leu Tyr Arg Leu Thr Leu Leu Ala Arg Val His Gln
595 600 605
Leu Thr Val Asn Glu Leu Cys Met Leu Tyr Gly Leu Ser Pro Phe Asn
610 615 620
Gly Lys Thr Thr Ala Ser Leu Ser Ser Gly Glu Leu Pro Arg Leu Val
625 630 635 640
Ile Trp Leu Tyr Gln Val Thr Gln Trp Leu Thr Glu Ala Glu Ile Thr
645 650 655
Thr Glu Ala Ile Trp Leu Leu Cys Thr Pro Glu Phe Ser Gly Asn Ile
660 665 670
Ser Pro Glu Ile Ser Asn Leu Leu Asn Asn Leu Arg Pro Ser Ile Ser
675 680 685
Glu Asp Met Ala Gln Ser His Asn Arg Glu Leu Gln Ala Glu Ile Leu
690 695 700
Ala Pro Phe Ile Ala Ala Thr Leu His Leu Ala Ser Pro Asp Met Ala
705 710 715 720
Arg Tyr Ile Leu Leu Trp Thr Asp Asn Leu Arg Pro Gly Gly Leu Asp
725 730 735
Ile Ala Gly Phe Met Thr Leu Val Leu Lys Glu Ser Leu Asn Ala Asn
740 745 750
Glu Thr Thr Gln Leu Val Gln Phe Cys His Val Met Ala Gln Leu Ser
755 760 765
Leu Ser Val Gln Thr Leu Arg Leu Ser Glu Ala Glu Leu Ser Val Leu
770 775 780
Val Ile Ser Gly Phe Ala Val Leu Gly Ala Lys Asn Gln Pro Ala Gly
785 790 795 800
Gln His Asn Ile Asp Thr Leu Phe Ser Leu Tyr Arg Phe His Gln Trp
805 810 815
Ile Asn Gly Leu Gly Asn Pro Gly Ser Asp Thr Leu Asp Met Leu Arg
820 825 830
Gln Gln Thr Leu Thr Ala Asp Arg Leu Ala Ser Val Met Gly Leu Asp
835 840 845
Ile Ser Met Val Thr Gln Ala Met Val Ser Ala Gly Val Asn Gln Leu
850 855 860
Gln Cys Trp Gln Asp Ile Asn Thr Val Leu Gln Trp Ile Asp Val Ala
865 870 875 880
Ser Ala Leu His Thr Met Pro Ser Val Ile Arg Thr Leu Val Asn Ile
885 890 895
Arg Tyr Val Thr Ala Leu Asn Lys Ala Glu Ser Asn Leu Pro Ser Trp
900 905 910
Asp Glu Trp Gln Thr Leu Ala Glu Asn Met Glu Ala Gly Leu Ser Thr
915 920 925
Gln Gln Ala Gln Thr Leu Ala Asp Tyr Thr Ala Glu Arg Leu Ser Ser
930 935 940
Val Leu Cys Asn Trp Phe Leu Ala Asn Ile Gln Pro Glu Gly Val Ser
945 950 955 960
Leu His Ser Arg Asp Asp Leu Tyr Ser Tyr Phe Leu Ile Asp Asn Gln
965 970 975
Val Ser Ser Ala Ile Lys Thr Thr Arg Leu Ala Glu Ala Ile Ala Gly
980 985 990
Ile Gln Leu Tyr Ile Asn Arg Ala Leu Asn Arg Ile Glu Pro Asn Ala
995 1000 1005
Arg Ala Asp Val Ser Thr Arg Gln Phe Phe Thr Asp Trp Thr Val
1010 1015 1020
Asn Asn Arg Tyr Ser Thr Trp Gly Gly Val Ser Arg Leu Val Tyr
1025 1030 1035
Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Gln Arg Ile Gly Gln Thr
1040 1045 1050
Arg Met Met Asp Glu Leu Leu Glu Asn Ile Ser Gln Ser Lys Leu
1055 1060 1065
Ser Arg Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr Leu Thr Arg
1070 1075 1080
Phe Glu Thr Val Ala Asp Leu Lys Val Val Ser Ala Tyr His Asp
1085 1090 1095
Asn Val Asn Ser Asn Thr Gly Leu Thr Trp Phe Val Gly Gln Thr
1100 1105 1110
Arg Glu Asn Leu Pro Glu Tyr Tyr Trp Cys Asn Val Asp Ile Ser
1115 1120 1125
Arg Met Gln Ala Gly Glu Leu Ala Ala Asn Ala Trp Lys Glu Trp
1130 1135 1140
Thr Lys Ile Asp Thr Ala Val Asn Pro Tyr Lys Asp Ala Ile Arg
1145 1150 1155
Pro Val Ile Leu Arg Glu Arg Leu His Leu Ile Trp Val Glu Lys
1160 1165 1170
Glu Glu Val Ala Lys Asn Gly Thr Asp Pro Val Glu Thr Cys Asp
1175 1180 1185
Arg Phe Thr Leu Lys Leu Ala Phe Leu Arg His Asp Gly Ser Trp
1190 1195 1200
Ser Ala Pro Trp Ser Tyr Asp Ile Thr Thr Gln Val Glu Ala Val
1205 1210 1215
Thr Asp Lys Lys Pro Asp Thr Glu Arg Leu Ala Leu Ala Ala Ser
1220 1225 1230
Gly Phe Gln Gly Glu Asp Thr Leu Leu Val Phe Val Tyr Lys Thr
1235 1240 1245
Gly Lys Ser Tyr Ser Asp Phe Gly Gly Ser Asn Lys Asn Val Ala
1250 1255 1260
Gly Met Thr Ile Tyr Gly Asp Gly Ser Phe Lys Lys Met Glu Asn
1265 1270 1275
Thr Ala Leu Ser Arg Tyr Ser Gln Leu Lys Asn Thr Phe Asp Ile
1280 1285 1290
Ile His Thr Gln Gly Asn Asp Leu Val Arg Lys Ala Ser Tyr Arg
1295 1300 1305
Phe Ala Gln Asp Phe Glu Val Pro Ala Ser Leu Asn Met Gly Ser
1310 1315 1320
Ala Ile Gly Asp Asp Ser Leu Thr Val Met Glu Asn Gly Asn Ile
1325 1330 1335
Pro Gln Ile Thr Ser Lys Tyr Ser Ser Asp Asn Leu Ala Ile Thr
1340 1345 1350
Leu His Asn Ala Ala Phe Thr Val Arg Tyr Asp Gly Ser Gly Asn
1355 1360 1365
Val Ile Arg Asn Lys Gln Ile Ser Ala Met Lys Leu Thr Gly Val
1370 1375 1380
Asp Gly Lys Ser Gln Tyr Gly Asn Ala Phe Ile Ile Ala Asn Thr
1385 1390 1395
Val Lys His Tyr Gly Gly Tyr Ser Asp Leu Gly Gly Pro Ile Thr
1400 1405 1410
Val Tyr Asn Lys Thr Lys Asn Tyr Ile Ala Ser Val Gln Gly His
1415 1420 1425
Leu Met Asn Ala Asp Tyr Thr Arg Arg Leu Ile Leu Thr Pro Val
1430 1435 1440
Glu Asn Asn Tyr Tyr Ala Arg Leu Phe Glu Phe Pro Phe Ser Pro
1445 1450 1455
Asn Thr Ile Leu Asn Thr Val Phe Thr Val Gly Ser Asn Lys Thr
1460 1465 1470
Ser Asp Phe Lys Lys Cys Ser Tyr Ala Val Asp Gly Asn Asn Ser
1475 1480 1485
Gln Gly Phe Gln Ile Phe Ser Ser Tyr Gln Ser Ser Gly Trp Leu
1490 1495 1500
Asp Ile Asp Thr Gly Ile Asn Asn Thr Asp Ile Lys Ile Thr Val
1505 1510 1515
Met Ala Gly Ser Lys Thr His Thr Phe Thr Ala Ser Asp His Ile
1520 1525 1530
Ala Ser Leu Pro Ala Asn Ser Phe Asp Ala Met Pro Tyr Thr Phe
1535 1540 1545
Lys Pro Leu Glu Ile Asp Ala Ser Ser Leu Ala Phe Thr Asn Asn
1550 1555 1560
Ile Ala Pro Leu Asp Ile Val Phe Glu Thr Lys Ala Lys Asp Gly
1565 1570 1575
Arg Val Leu Gly Lys Ile Lys Gln Thr Leu Ser Val Lys Arg Val
1580 1585 1590
Asn Tyr Asn Pro Glu Asp Ile Leu Phe Leu Arg Glu Thr His Ser
1595 1600 1605
Gly Ala Gln Tyr Met Gln Leu Gly Val Tyr Arg Ile Arg Leu Asn
1610 1615 1620
Thr Leu Leu Ala Ser Gln Leu Val Ser Arg Ala Asn Thr Gly Ile
1625 1630 1635
Asp Thr Ile Leu Thr Met Glu Thr Gln Arg Leu Pro Glu Pro Pro
1640 1645 1650
Leu Gly Glu Gly Phe Phe Ala Asn Phe Val Leu Pro Lys Tyr Asp
1655 1660 1665
Pro Ala Glu His Gly Asp Glu Arg Trp Phe Lys Ile His Ile Gly
1670 1675 1680
Asn Val Gly Gly Asn Thr Gly Arg Gln Pro Tyr Tyr Ser Gly Met
1685 1690 1695
Leu Ser Asp Thr Ser Glu Thr Ser Met Thr Leu Phe Val Pro Tyr
1700 1705 1710
Ala Glu Gly Tyr Tyr Met His Glu Gly Val Arg Leu Gly Val Gly
1715 1720 1725
Tyr Gln Lys Ile Thr Tyr Asp Asn Thr Trp Glu Ser Ala Phe Phe
1730 1735 1740
Tyr Phe Asp Glu Thr Lys Gln Gln Phe Val Leu Ile Asn Asp Ala
1745 1750 1755
Asp His Asp Ser Gly Met Thr Gln Gln Gly Ile Val Lys Asn Ile
1760 1765 1770
Lys Lys Tyr Lys Gly Phe Leu Asn Val Ser Ile Ala Thr Gly Tyr
1775 1780 1785
Ser Ala Pro Met Asp Phe Asn Ser Ala Ser Ala Leu Tyr Tyr Trp
1790 1795 1800
Glu Leu Phe Tyr Tyr Thr Pro Met Met Cys Phe Gln Arg Leu Leu
1805 1810 1815
Gln Glu Lys Gln Phe Asp Glu Ala Thr Gln Trp Ile Asn Tyr Val
1820 1825 1830
Tyr Asn Pro Ala Gly Tyr Ile Val Asn Gly Glu Ile Ala Pro Trp
1835 1840 1845
Ile Trp Asn Cys Arg Pro Leu Glu Glu Thr Thr Ser Trp Asn Ala
1850 1855 1860
Asn Pro Leu Asp AlaIle Asp Pro Asp Ala Val Ala Gln Asn Asp
1865 1870 1875
Pro Met His Tyr Lys Ile Ala Thr Phe Met Arg Leu Leu Asp Gln
1880 1885 1890
Leu Ile Leu Arg Gly Asp Met Ala Tyr Arg Glu Leu Thr Arg Asp
1895 1900 1905
Ala Leu Asn Glu Ala Lys Met Trp Tyr Val Arg Thr Leu Glu Leu
1910 1915 1920
Leu Gly Asp Glu Pro Glu Asp Tyr Gly Ser Gln Gln Trp Ala Ala
1925 1930 1935
Pro Ser Leu Ser Gly Ala Ala Ser Gln Thr Val Gln Ala Ala Tyr
1940 1945 1950
Gln Gln Asp Leu Thr Met Leu Gly Arg Gly Gly Val Ser Lys Asn
1955 1960 1965
Leu Arg Thr Ala Asn Ser Leu Val Gly Leu Phe Leu Pro Glu Tyr
1970 1975 1980
Asn Pro Ala Leu Thr Asp Tyr Trp Gln Thr Leu Arg Leu Arg Leu
1985 1990 1995
Phe Asn Leu Arg His Asn Leu Ser Ile Asp Gly Gln Pro Leu Ser
2000 2005 2010
Leu Ala Ile Tyr Ala Glu Pro Thr Asp Pro Lys Ala Leu Leu Thr
2015 2020 2025
Ser Met Val Gln Ala Ser Gln Gly Gly Ser Ala Val Leu Pro Gly
2030 2035 2040
Thr Leu Ser Leu Tyr Arg Phe Pro Val Met Leu Glu Arg Thr Arg
2045 2050 2055
Asn Leu Val Ala Gln Leu Thr Gln Phe Gly Thr Ser Leu Leu Ser
2060 2065 2070
Met Ala Glu His Asp Asp Ala Asp Glu Leu Thr Thr Leu Leu Leu
2075 2080 2085
Gln Gln Gly Met Glu Leu Ala Thr Gln Ser Ile Arg Ile Gln Gln
2090 2095 2100
Arg Thr Val Asp Glu Val Asp Ala Asp Ile Ala Val Leu Ala Glu
2105 2110 2115
Ser Arg Arg Ser Ala Gln Asn Arg Leu Glu Lys Tyr Gln Gln Leu
2120 2125 2130
Tyr Asp Glu Asp Ile Asn His Gly Glu Gln Arg Ala Met Ser Leu
2135 2140 2145
Leu Asp Ala Ala Ala Gly Gln Ser Leu Ala Gly Gln Val Leu Ser
2150 2155 2160
Ile Ala Glu Gly Val Ala Asp Leu Val Pro Asn Val Phe Gly Leu
2165 2170 2175
Ala Cys Gly Gly Ser Arg Trp Gly Ala Ala Leu Arg Ala Ser Ala
2180 2185 2190
Ser Val Met Ser Leu Ser Ala Thr Ala Ser Gln Tyr Ser Ala Asp
2195 2200 2205
Lys Ile Ser Arg Ser Glu Ala Tyr Arg Arg Arg Arg Gln Glu Trp
2210 2215 2220
Glu Ile Gln Arg Asp Asn Ala Asp Gly Glu Val Lys Gln Met Asp
2225 2230 2235
Ala Gln Leu Glu Ser Leu Lys Ile Arg Arg Glu Ala Ala Gln Met
2240 2245 2250
Gln Val Glu Tyr Gln Glu Thr Gln Gln Ala His Thr Gln Ala Gln
2255 2260 2265
Leu Glu Leu Leu Gln Arg Lys Phe Thr Asn Lys Ala Leu Tyr Ser
2270 2275 2280
Trp Met Arg Gly Lys Leu Ser Ala Ile Tyr Tyr Gln Phe Phe Asp
2285 2290 2295
Leu Thr Gln Ser Phe Cys Leu Met Ala Gln Glu Ala Leu Arg Arg
2300 2305 2310
Glu Leu Thr Asp Asn Gly Val Thr Phe Ile Arg Gly Gly Ala Trp
2315 2320 2325
Asn Gly Thr Thr Ala Gly Leu Met Ala Gly Glu Thr Leu Leu Leu
2330 2335 2340
Asn Leu Ala Glu Met Glu Lys Val Trp Leu Glu Arg Asp Glu Arg
2345 2350 2355
Ala Leu Glu Val Thr Arg Thr Val Ser Leu Ala Gln Phe Tyr Gln
2360 2365 2370
Ala Leu Ser Ser Asp Asn Phe Asn Leu Thr Glu Lys Leu Thr Gln
2375 2380 2385
Phe Leu Arg Glu Gly Lys Gly Asn Val Gly Ala Ser Gly Asn Glu
2390 2395 2400
Leu Lys Leu Ser Asn Arg Gln Ile Glu Ala Ser Val Arg Leu Ser
2405 2410 2415
Asp Leu Lys Ile Phe Ser Asp Tyr Pro Glu Ser Leu Gly Asn Thr
2420 2425 2430
Arg Gln Leu Lys Gln Val Ser Val Thr Leu Pro Ala Leu Val Gly
2435 2440 2445
Pro Tyr Glu Asp Ile Arg Ala Val Leu Asn Tyr Gly Gly Ser Ile
2450 2455 2460
Val Met Pro Arg Gly Cys Ser Ala Ile Ala Leu Ser His Gly Val
2465 2470 2475
Asn Asp Ser Gly Gln Phe Met Leu Asp Phe Asn Asp Ser Arg Tyr
2480 2485 2490
Leu Pro Phe Glu Gly Ile Ser Val Asn Asp Ser Gly Ser Leu Thr
2495 2500 2505
Leu Ser Phe Pro Asp Ala Thr Asp Arg Gln Lys Ala Leu Leu Glu
2510 2515 2520
Ser Leu Ser Asp Ile Ile Leu His Ile Arg Tyr Thr Ile Arg Ser
2525 2530 2535
Pro Arg Asp Arg Thr Arg Pro Thr Ser Met Gln Asn Ser Gln Asp
2540 2545 2550
Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys Gly Gly Gly Ala Ile
2555 2560 2565
Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly Pro Asp Gly Met
2570 2575 2580
Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly Arg Gly Tyr
2585 2590 2595
Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly Asn Ser
2600 2605 2610
Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg Arg
2615 2620 2625
Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
2630 2635 2640
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg
2645 2650 2655
Asp Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr
2660 2665 2670
Val Thr Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu
2675 2680 2685
Glu Tyr Trp Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu
2690 2695 2700
Ile Tyr Ser Pro Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro
2705 2710 2715
Gln Ala Arg Ile Ser Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln
2720 2725 2730
Trp Leu Leu Glu Ala Ser Val Ser Ser Arg Gly Glu Gln Ile Tyr
2735 2740 2745
Tyr Gln Tyr Arg Ala Glu Asp Asp Thr Gly Cys Glu Ala Asp Glu
2750 2755 2760
Ile Thr His His Leu Gln Ala Thr Ala Gln Arg Tyr Leu His Ile
2765 2770 2775
Val Tyr Tyr Gly Asn Arg Thr Ala Ser Glu Thr Leu Pro Gly Leu
2780 2785 2790
Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp Leu Phe Tyr Leu Val
2795 2800 2805
Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys Thr Pro Pro Ala
2810 2815 2820
Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp Arg Phe Ser
2825 2830 2835
Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu Cys Arg
2840 2845 2850
Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys Ile
2855 2860 2865
Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
2870 2875 2880
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg
2885 2890 2895
Val Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu
2900 2905 2910
Glu Leu Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp
2915 2920 2925
Gln Pro Met Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp
2930 2935 2940
Gln Leu Val Asp Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr
2945 2950 2955
Gln Asp Lys Gly Ala Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly
2960 2965 2970
Glu Ile Gly Ser Asp Ala Val Thr Trp Glu Lys Met Gln Pro Leu
2975 2980 2985
Ser Val Ile Pro Ser Leu Gln Ser Asn Ala Ser Leu Val Asp Ile
2990 2995 3000
Asn Gly Asp Gly Gln Leu Asp Trp Val Ile Thr Gly Pro Gly Leu
3005 3010 3015
Arg Gly Tyr His Ser Gln Arg Pro Asp Gly Ser Trp Thr Arg Phe
3020 3025 3030
Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr Thr His Pro Arg Ala
3035 3040 3045
Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser Asp Leu Val Leu
3050 3055 3060
Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr Arg Asp Gly
3065 3070 3075
Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile Thr Leu
3080 3085 3090
Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser Asp
3095 3100 3105
Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
3110 3115 3120
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln
3125 3130 3135
Pro Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn
3140 3145 3150
Pro Ala Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr
3155 3160 3165
Asp Leu Ile Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn
3170 3175 3180
Lys Ser Gly Asn Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro
3185 3190 3195
Glu Gly Leu Arg Phe Asp His Thr Cys Gln Leu Gln Met Ala Asp
3200 3205 3210
Val Gln Gly Leu Gly Val Ala Ser Leu Ile Leu Ser Val Pro His
3215 3220 3225
Met Ser Pro His His Trp Arg Cys Asp Leu Thr Asn Met Lys Pro
3230 3235 3240
Trp Leu Leu Asn Glu Met Asn Asn Asn Met Gly Val His His Thr
3245 3250 3255
Leu Arg Tyr Arg Ser Ser Ser Gln Phe Trp Leu Asp Glu Lys Ala
3260 3265 3270
Ala Ala Leu Thr Thr Gly Gln Thr Pro Val Cys Tyr Leu Pro Phe
3275 3280 3285
Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu Asp Glu Ile Ser
3290 3295 3300
Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg Gly Ala Trp
3305 3310 3315
Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val Glu Gln
3320 3325 3330
Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg Thr
3335 3340 3345
Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
3350 3355 3360
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala
3365 3370 3375
Phe Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys
3380 3385 3390
Asp Val Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe
3395 3400 3405
Asn Arg Ala Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly
3410 3415 3420
Leu Asp Asp Ser Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu
3425 3430 3435
Phe Arg Ser Gln Val Arg Arg Leu Gln His Thr Asp Ser Arg Tyr
3440 3445 3450
Pro Val Leu Trp Ser Ser Val Val Glu Ser Arg Asn Tyr His Tyr
3455 3460 3465
Glu Arg Ile Ala Ser Asp Pro Gln Cys Ser Gln Asn Ile Thr Leu
3470 3475 3480
Ser Ser Asp Arg Phe Gly Gln Pro Leu Lys Gln Leu Ser Val Gln
3485 3490 3495
Tyr Pro Arg Arg Gln Gln Pro Ala Ile Asn Leu Tyr Pro Asp Thr
3500 3505 3510
Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr Asp Asp Gln Gln Arg
3515 3520 3525
Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp His His Leu Thr
3530 3535 3540
Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser Thr Arg Ser
3545 3550 3555
Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala Gly Gly Leu
3560 3565 3570
Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile Ala Asp Asp
3575 3580 3585
Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala Tyr Thr Asp
3590 3595 3600
Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg Gln Ala Leu
3605 3610 3615
Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser Thr Leu Ser
3620 3625 3630
Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser Thr Thr Leu
3635 3640 3645
Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe Pro Arg Thr
3650 3655 3660
Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr Thr Asp Tyr
3665 3670 3675
Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln Ser Asn Thr
3680 3685 3690
Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala Asn Tyr Cys
3695 3700 3705
Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr Thr Ser Ala
3710 3715 3720
Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu Thr Asp Ile
3725 3730 3735
Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu Gly Arg Pro
3740 3745 3750
Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys Met Thr Gly
3755 3760 3765
Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro Ser Asp Val
3770 3775 3780
Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val Ala Gln Cys
3785 3790 3795
Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu Ser Gln Lys
3800 3805 3810
Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys Leu Tyr Asn
3815 3820 3825
Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr Leu Ala Tyr
3830 3835 3840
Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln Leu Ile Ser
3845 3850 3855
Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser Leu Thr Leu
3860 3865 3870
Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln Ile Arg Gln
3875 3880 3885
Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu Gln Ala Ala
3890 3895 3900
Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn Glu Asp Gly
3905 3910 3915
Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg Trp Ala Val
3920 3925 3930
Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro Ile Arg Thr
3935 3940 3945
Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val Ser Asn Asp
3950 3955 3960
Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr His Val Tyr
3965 3970 3975
Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala Lys Gly Trp
3980 3985 3990
Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val Asn Glu Asp
3995 4000 4005
Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys Met Pro Gly
4010 4015 4020
Ser Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro Met Lys Asn
4025 4030 4035
Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val Ser Val Tyr
4040 4045 4050
Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe His Arg Thr
4055 4060 4065
Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg His Gln Tyr
4070 4075 4080
Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro Arg Leu Tyr
4085 4090 4095
Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn Phe Leu Trp
4100 4105 4110
Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu Ser Ile Asp
4115 4120 4125
Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu Gly Arg Pro Leu
4130 4135 4140
Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr Arg Gln Tyr Glu
4145 4150 4155
Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Ala Glu Gln Thr
4160 4165 4170
Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg Leu Ile Trp Ala
4175 4180 4185
Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu Ala Gly Gln Cys
4190 4195 4200
Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser Leu
4205 4210 4215
Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser Gln Leu Leu Ile
4220 4225 4230
Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn Glu Thr Val Trp
4235 4240 4245
Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr Leu Ser Thr Phe
4250 4255 4260
Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala Lys Gly Asn
4265 4270 4275
Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu Asn Gly Ser
4280 4285 4290
Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile Ile Lys Ser
4295 4300 4305
Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu His Gly
4310 4315 4320
Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu Thr Gln Arg
4325 4330 4335
Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr Lys Val Leu
4340 4345 4350
Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn Val Ile Ser
4355 4360 4365
Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp His Asn Gln Lys
4370 4375 4380
Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu Tyr Gln Leu
4385 4390 4395
Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly Gln Gln Ser
4400 4405 4410
His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp Asn Asn Thr Tyr
4415 4420 4425
Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn Leu
4430 4435 4440
Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln Asn Asn Tyr Thr
4445 4450 4455
Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser
4460 4465 4470
Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala Leu Phe Asp Ala
4475 4480 4485
Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln Asn Leu Asn Trp
4490 4495 4500
Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val Lys Arg Asp
4505 4510 4515
Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser Gly Asp
4520 4525 4530
Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala Ser Asn Asn
4535 4540 4545
Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu Glu Leu Arg
4550 4555 4560
Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu Gln Val Ile
4565 4570 4575
Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp
4580 4585 4590
Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln Leu Arg Tyr
4595 4600 4605
Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu Leu Asp Ser
4610 4615 4620
Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro Tyr Gly Gly
4625 4630 4635
Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala Ser Tyr Lys
4640 4645 4650
Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr
4655 4660 4665
Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly Arg Trp Leu
4670 4675 4680
Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn Leu Tyr Arg
4685 4690 4695
Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly Leu
4700 4705 4710
Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu Lys Lys Asn Lys
4715 4720 4725
Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala Thr Asn Val Ala
4730 4735 4740
Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser Leu Pro Lys
4745 4750 4755
Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile Gly Ala Ala
4760 4765 4770
Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val Ala Pro Leu
4775 4780 4785
Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser Leu Pro Glu
4790 4795 4800
Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr Asn Leu Gln
4805 4810 4815
Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg Ser Phe Glu
4820 4825 4830
Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala Trp Thr Pro
4835 4840 4845
Leu Asp Thr Lys Met Ala Arg Gln Phe Ala Ser Ile Phe Ile Gly
4850 4855 4860
Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val Lys Asn Ile
4865 4870 4875
Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp Leu Ser Asn Tyr
4880 4885 4890
Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val Ser Thr Ala
4895 4900 4905
Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly Ala Pro Leu His
4910 4915 4920
Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile Asp Gly Gln Lys
4925 4930 4935
Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn Met Val Pro Ser
4940 4945 4950
Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser Ser Ile Ile Ala
4955 4960 4965
Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile Ser Phe Leu Thr
4970 4975 4980
Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
4985 4990 4995
<210> 61
<211> 7569
<212> DNA
<213> 嗜线虫致病杆菌(Xenorhabdus nematophilus)
<400> 61
atgataaaag ttaatgaact gttagataag ataaatagaa aaaggtctgg tgatacttta 60
ttattgacaa acatttcgtt tatgtctttc agcgaatttc gtcataggac aagtggaact 120
ctgacgtggc gagaaacaga ctttttatat caacaggctc atcaggaatc aaaacagaat 180
aaacttgaag aactgcgcat tttgtcccgt gctaatccac aactggctaa taccactaac 240
cttaatatta caccgtcaac cctaaacaat agttacaaca gttggtttta tggccgtgcc 300
caccgttttg taaaaccggg atcaattgct tccatatttt caccagcggc ttatttaaca 360
gaattatatc gggaagcgaa agattttcat cctgacaatt ctcaatatca cctgaataaa 420
cgacgccccg acattgcttc actggcactg acacagaata atatggatga agaaatttcc 480
acattatcct tatctaatga attactgctg cataatattc agacgttaga gaaaactgac 540
tataacggtg taatgaaaat gttgtccact taccggcaaa ccggcatgac accctatcat 600
ctgccgtatg agtcagcccg tcaggcaatt ttattgcaag ataaaaacct caccgcattt 660
agccgtaata cagacgtagc ggaattaatg gacccaacat cgctactggc tattaagact 720
gatatatcgc ctgaattgta tcaaatcctt gtagaagaaa ttacaccgga aaattcaaca 780
gaactgatga agaaaaattt cggtacagat gatgtactga tttttaagag ttatgcttct 840
ttggctcgct actacgattt gtcttatgat gaactcagtt tatttgtcaa tctctccttc 900
ggtaagaaaa atacaaatca acagtataag aatgagcaac tgataacatt ggtcaatgac 960
gggaatgata cggcaacggc aagattgatt aagcgaaccc gcaaagattt ctacgattca 1020
catttaaact atgcagaact aattccaatc aaagaaaatg aatacaaata taatttcagt 1080
gtaaaaaaaa cagaacctga ccacttggat tttcgtctcc agaatggaga taaagaatat 1140
atataccaag ataaaaattt cgtccccatt gctaataccc attacagtat tcccattaaa 1200
ttgacgacag agcaaatcac caacggtata acactccgct tatggcgagt taaaccaaat 1260
ccgtcggatg ctatcaatgc caatgcatac tttaaaatga tggagttccc cggtgatata 1320
ttcctgttaa agctgaataa agcgattcgt ttgtataaag ccacaggcat atctccagaa 1380
gatatctggc aagtaataga aagtatttat gatgacttaa ccattgacag caatgtgttg 1440
ggtaagctgt tttatgttca atattatatg cagcactata atattagcgt cagcgatgcg 1500
ctggtattgt gtcattcaga tatcagccaa tattccacta aacaacaacc cagtcatttt 1560
acaatactgt tcaatacacc gctattaaat ggccaagagt tttctgctga taataccaaa 1620
ctggatttaa cccccggtga atcaaaaaac catttttatt tgggaataat gaaacgtgct 1680
ttcagagtga atgatactga actgtataca ttatggaagc tggctaatgg cggaacaaat 1740
ccagaattta tgtgttccat cgagaacctg tctctgcttt atcgcgttcg tctgctggca 1800
gacattcatc atctgacagt gaatgaatta tccatgttgt tgtcggtttc tccctatgtg 1860
aacacgaaaa ttgccctttt ttctgataca gcattaacgc aattaatcag ctttctgttc 1920
caatgcaccc agtggctgac aacacagaaa tggtctgtca gtgatgtgtt tctgatgacc 1980
acggataatt acagcactgt ccttacgccg gatattgaaa accttatcac gacactaagt 2040
aatggattat caacactttc actcggtgat gacgaactga tccgtgcagc tgccccgctg 2100
attgctgcca gcattcaaat ggattcagcc aagacagcag aaactatttt gctgtggatt 2160
aatcagataa aaccacaagg actgacattc gatgatttca tgattattgc ggctaaccgt 2220
gatcgctcag agaatgaaac cagcaacatg gtggcttttt gtcaggtact ggggcaactt 2280
tctctgattg tgcgcaatat tggactcagc gaaaacgaac tgaccctgtt ggtgacaaaa 2340
ccggagaaat tccaatcaga aaccacagca ctgcaacatg atctccccac tttgcaagcg 2400
ctgacccgct tccatgctgt gatcatgcgt tgtggaagct acgcgacaga aatcttaaca 2460
gcattggaac taggagcgct gactgccgaa caattggcgg tggcgttaaa atttgatgct 2520
caggttgtga cacaagcatt gcaacagacc ggtttgggag tgaatacctt taccaactgg 2580
agaactatag atgtcactct gcaatggctg gatgtcgctg ctacattggg tattaccccg 2640
gatggtgttg ctgcactcat aaaattaaaa tatatcggtg aaccagaaac cccgatgcca 2700
acatttgatg attggcaagc cgccagtact ttgttgcagg cgggactgaa cagtcaacaa 2760
tccgaccagc ttcaggcatg gctggatgaa gccacgacga cagcggccag tgcttactac 2820
atcaaaaata gtgcacctca acagattaag agccgggatg agttgtacag ctatctgctg 2880
attgataacc aagtttctgc ccaagtgaaa accacccgtg tggcagaagc cattgccagc 2940
attcagttat atgtcaaccg ggcgttgaat aatgttgaag gaaaagtatc aaagccagtg 3000
aaaacccgtc agttcttctg cgactgggaa acctacaatc gacggtatag cacctgggcc 3060
ggcgtatctg aactggccta ttatccggaa aactatatcg accccacgat tcgtattggt 3120
cagacaggta tgatgaacaa cctgttacag caactttccc aaagtcagtt aaatatcgat 3180
accgttgaag atagctttaa aaattatctg accgcatttg aagatgtcgc taacttgcag 3240
gtgattagcg gatatcatga cagtatcaat gtcaatgagg gactcactta tttaattggt 3300
tatagccaga cagaacccag aatatattat tggcgcaatg tcgatcacca aaagtgccag 3360
cacggtcaat ttgctgccaa tgcctgggga gaatggaaaa aaattgaaat acccatcaat 3420
gtatggcagg aaaatatcag acctgttatt tacaagtctc gtttgtattt actgtggctg 3480
gaacaaaaag agctgaaaaa tgaaagtgaa gatggcaaga tagatatcac tgattatata 3540
ttaaaactgt cacatattcg ttatgatggc agctggagct caccgtttaa ttttaatgtg 3600
actgataaaa tagaaaacct gatcaataaa aaagccagca ttggtatgta ttgttcttct 3660
gattatgaaa aagacgtcat tattgtttat ttccatgaga aaaaagacaa ttattctttt 3720
aatagtcttc ctgcaagaga agggatgacc attaaccctg atatgacatt atccattctc 3780
acagaaaatg atttagacgc cattgttaag agcacattat cagaacttga taccaggaca 3840
gaatacaaag tcaacaatca atttgctaca gattatttgg ccgaatataa ggaatctata 3900
accacaaaaa ataaattagc cagttttacc ggaaatattt ttgatctctc gtatatatca 3960
ccaggaaatg gtcatattaa tttaacgttc aatccttcaa tggaaattaa tttttcaaaa 4020
ggcaatatat ataatgatga ggttaaatac ctgttatcga tggtagaaga tgaaacggtt 4080
attttatttg attatgatag acatgatgaa atgcttggaa aagaagaaga agtttttcat 4140
tatggaactt tggattttat tatttccatc gatcttaaaa atgccgaata ttttagagtg 4200
ttaatgcatc taagaaccaa ggaaaaaatt cctagaaaat cagaaattgg agttggtata 4260
aattatgatt atgaatcaaa tgatgctgaa ttcaaacttg atactaacat agtattagat 4320
tggaaagata acacaggagt atggcatact atatgtgaat catttactaa tgatgtttca 4380
atcattaata acatgggaaa tattgcggca ctgttccttc gcgaggatcc atgtgtgtat 4440
ttatgttcaa tagccacaga tataaaaatt gcttcatcta tgatcgaaca gatccaagat 4500
aaaaacatta gttttttatt aaaaaatggc tctgatattc tagtggagtt aaatgctgaa 4560
gaccatgtgg catctaaacc ttcacacgaa tctgacccta tggtatatga ttttaatcaa 4620
gtaaaagttg atattgaagg ctatgatatt cctctggtga gcgagtttat tattaagcaa 4680
cccgacggcg gttataacga tattgttatt gaatcgccaa ttcatataaa actaaaatcc 4740
aaagatacaa gtaacgttat atcactgcat aaaatgccat caggcacaca atatatgcag 4800
attggccctt acagaacccg gttaaatact ttattttcca gaaaattagc tgaaagagcc 4860
aatattggta ttgataatgt tttaagtatg gaaacgcaaa atttaccaga gccgcaatta 4920
ggtgaagggt tttatgcgac atttaagttg cccccctaca ataaagagga gcatggtgat 4980
gaacgttggt ttaagatcca tattgggaat attgatggca attctgccag acaaccttat 5040
tacgaaggaa tgttatctga tattgaaacc acagtaacgc tctttgttcc ctatgctaaa 5100
ggatattaca tacgtgaagg tgtcagatta ggggttgggt acaaaaaaat tatctatgac 5160
aaatcctggg aatctgcttt cttttatttt gatgagacga aaaatcaatt tatattcatt 5220
aatgatgccg atcatgattc gggaatgaca caacagggga tagtaaaaaa tatcaaaaaa 5280
tataaagggt ttattcatgt cgttgtcatg aaaaataaca ctgaacccat ggatttcaac 5340
ggcgccaatg caatctattt ctgggaattg ttctattaca cgcccatgat ggtattccag 5400
cgcttattgc aagagcagaa ttttaccgaa tcgacacgct ggctgcgcta tatctggaac 5460
ccggccggat attcggttca gggtgaaatg caggattatt actggaacgt ccgcccattg 5520
gaggaagata cgtcctggaa tgccaatccg ctggattcgg tcgatcctga cgccgttgcc 5580
cagcatgatc cgatgcacta taaagtggct acctttatga aaatgctgga tttgttgatt 5640
acccgcggag atagcgccta tcgccagctt gaacgtgata ccttaaacga agctaaaatg 5700
tggtatgtac aggcgctcac tttattgggt gatgagcctt atttttcatt ggataacgat 5760
tggtcagagc cacggctgga agaagctgcc agccaaacaa tgcggcatca ttatcaacat 5820
aaaatgctgc aactgcgtca gcgcgctgca ttacccacga aacgtacggc aaattcgtta 5880
accgcattgt tcctccctca aattaataaa aaactgcaag gttactggca gacattgacg 5940
caacgcctct ataacttacg ccataacctg acaatcgacg gtcagccact gtcattatct 6000
ctctatgcca cgcccgcaga tccgtccatg ttactcagtg ctgccatcac tgcttcacaa 6060
ggcggcggcg atttacctca tgcagtgatg ccgatgtacc gttttccggt gattctggaa 6120
aatgccaagt ggggggtaag ccagttgata caatttggca ataccctgct cagcattact 6180
gaacggcagg atgcagaagc cttggctgaa atactgcaaa ctcaaggcag tgagttagcc 6240
ctgcaaagta ttaaaatgca ggataaggtc atggctgaaa ttgatgctga taaattggcg 6300
cttcaagaaa gccgtcatgg tgcacagtct cgttttgaca gtttcaatac gctgtacgac 6360
gaagatgtta acgctggtga aaaacaagcg atggatcttt acctctcttc atcggtcttg 6420
agcaccagcg gcacagccct gcatatggcc gccgccgcgg cagatctcgt ccccaatatt 6480
tacggttttg ctgtgggagg ttcccgtttt ggggcgcttt tcaatgccag tgcgattggt 6540
atcgaaattt ctgcgtcagc aacacgtatt gccgcagaca aaatcagcca atcagaaata 6600
taccgtcgcc gtcggcaaga gtgggaaatt cagcgcaata atgcggaagc tgagataaaa 6660
caaattgatg ctcaattagc gacgctggct gtacgtcgtg aagcggcagt attacaaaaa 6720
aactatctgg aaactcagca ggcacaaact caggcgcagt tagcctttct gcaaagtaaa 6780
ttcagtaatg cagcgctata caactggctc cgtggaaggt tgtccgctat ttattatcag 6840
ttttatgatt tggcggtctc actctgttta atggcagagc aaacttatca gtatgaattg 6900
aataatgcgg cagcacactt tattaaacca ggtgcctggc atgggactta tgcgggttta 6960
ttagcgggtg aaaccctgat gctgaattta gcacagatgg aaaaaagcta tttggaaaaa 7020
gatgaacggg cactggaggt caccagaacc gtttctctgg ctgaagtgta tgctggtctg 7080
acagaaaata gtttcatttt aaaagataaa gtgactgagt tagtcaatgc aggtgaaggc 7140
agtgcaggca caacgcttaa cggtttgaac gtcgaaggga cacaactgca agccagcctc 7200
aaattatcgg atctgaatat tgctaccgat tatcctgacg gtttaggtaa tacacgccgt 7260
atcaaacaaa tcagtgtgac attacctgcc cttttagggc cttatcagga tgttcgggca 7320
atactaagtt atggcggcag cacaatgatg ccacgtggct gcaaagcgat tgcgatctca 7380
catggcatga atgacagtgg tcaattccag atggatttca atgatgccaa gtacctgcca 7440
tttgaagggc ttcctgtggc cgatacaggc acattaaccc tcagttttcc cggtatcagt 7500
ggtaaacaga aaagcttatt gctcagcctg agcgatatca ttctgcatat ccgttacacc 7560
attcgttct 7569
<210> 62
<211> 7614
<212> DNA
<213> 嗜线虫致病杆菌
<400> 62
atgtatagca cggctgtatt actcaataaa atcagtccca ctcgcgacgg tcagacgatg 60
actcttgcgg atctgcaata tttatccttc agtgaactga gaaaaatctt tgatgaccag 120
ctcagttggg gagaggctcg ccatctctat catgaaacta tagagcagaa aaaaaataat 180
cgcttgctgg aagcgcgtat ttttacccgt gccaacccac aattatccgg tgctatccga 240
ctcggtattg aacgagacag cgtttcacgc agttatgatg aaatgtttgg tgcccgttct 300
tcttcctttg tgaaaccggg ttcagtggct tccatgtttt caccggctgg ctatctcacc 360
gaattgtatc gtgaagcgaa ggacttacat ttttcaagct ctgcttatca tcttgataat 420
cgccgtccgg atctggctga tctgactctg agccagagta atatggatac agaaatttcc 480
accctgacac tgtctaacga actgttgctg gagcatatta cccgcaagac cggaggtgat 540
tcggacgcat tgatggagag cctgtcaact taccgtcagg ccattgatac cccttaccat 600
cagccttacg agactatccg tcaggtcatt atgacccatg acagtacact gtcagcgctg 660
tcccgtaatc ctgaggtgat ggggcaggcg gaaggggctt cattactggc gattctggcc 720
aatatttctc cggagcttta taacattttg accgaagaga ttacggaaaa gaacgctgat 780
gctttatttg cgcaaaactt cagtgaaaat atcacgcccg aaaatttcgc gtcacaatca 840
tggatagcca agtattatgg tcttgaactt tctgaggtgc aaaaatacct cgggatgttg 900
cagaatggct attctgacag cacctctgct tatgtggata atatctcaac gggtttagtg 960
gtcaataatg aaagtaaact cgaagcttac aaaataacac gtgtaaaaac agatgattat 1020
gataaaaata taaattactt tgatttgatg tatgaaggaa ataatcagtt ctttatacgt 1080
gctaatttta aggtatcaag agaatttggg gctactctta gaaaaaacgc agggccaagt 1140
ggcattgtcg gcagcctttc cggtcctcta atagccaata cgaattttaa aagtaattat 1200
ctaagtaaca tatctgattc tgaatacaaa aacggtgtaa agatatacgc ctatcgctat 1260
acgtcttcca ccagcgccac aaatcagggc ggcggaatat tcacttttga gtcttatccc 1320
ctgactatat ttgcgctcaa actgaataaa gccattcgct tgtgcctgac tagcgggctt 1380
tcaccgaatg aactgcaaac tatcgtacgc agtgacaatg cacaaggcat catcaacgac 1440
tccgttctga ccaaagtttt ctatactctg ttctacagtc accgttatgc actgagcttt 1500
gatgatgcac aggtactgaa cggatcggtc attaatcaat atgccgacga tgacagtgtc 1560
agtcatttta accgtctctt taatacaccg ccgctgaaag ggaaaatctt tgaagccgac 1620
ggcaacacgg tcagcattga tccggatgaa gagcaatcta cctttgcccg ttcagccctg 1680
atgcgtggtc tgggggtcaa cagtggtgaa ctgtatcagt taggcaaact ggcgggtgtg 1740
ctggacgccc aaaataccat cacactttct gtcttcgtta tctcttcact gtatcgcctc 1800
acgttactgg cccgtgtcca tcagctgacg gtcaatgaac tgtgtatgct ttatggtctt 1860
tcgccgttca atggcaaaac aacggcttct ttgtcttccg gggagttgcc acggctggtt 1920
atctggctgt atcaggtgac gcagtggctg actgaggcgg aaatcaccac tgaagcgatc 1980
tggttattat gtacgccaga gtttagcggg aatatttcac cggaaatcag taatctgctc 2040
aataacctcc gaccgagtat tagtgaagat atggcacaga gtcacaatcg ggagctgcag 2100
gctgaaattc tcgcgccgtt tattgctgca acgctgcatc tggcgtcacc ggatatggca 2160
cggtatatcc tgttgtggac cgataacctg cggccgggtg gcttagatat tgccgggttt 2220
atgacactgg tattgaaaga gtcgttaaat gccaatgaaa ccacccaatt ggtacaattc 2280
tgccatgtga tggcacagtt atcgctttcc gtacagacac tgcgcctcag tgaagcggag 2340
ctatccgtgc tggtcatctc cggattcgcc gtgctggggg caaaaaatca acctgccgga 2400
cagcacaata ttgatacgct attctcactc taccgattcc accagtggat taatgggctg 2460
ggcaatcccg gctctgacac gctggatatg ctgcgccagc agacactcac ggccgacaga 2520
ctggcctccg tgatggggct ggacatcagt atggtaacgc aggccatggt ttccgccggc 2580
gtgaaccagc ttcagtgttg gcaggatatc aacaccgtgt tgcagtggat agatgtggca 2640
tcagcactgc acacgatgcc gtcggttatc cgtacgctgg tgaatatccg ttacgtgact 2700
gcattaaaca aagccgagtc gaatctgcct tcctgggatg agtggcagac actggcagaa 2760
aatatggaag ccggactcag tacacaacag gctcagacgc tggcggatta taccgcggag 2820
cgcctgagta gcgtgctgtg caattggttt ctggcgaata tccagccaga aggggtgtcc 2880
ctgcacagcc gggatgacct gtacagctat ttcctgattg ataatcaggt ctcttctgcc 2940
ataaaaacca cccgactggc agaggccatt gccggtattc agctctacat caaccgggcg 3000
ctgaatcgga tagagcctaa tgcccgtgcc gatgtgtcaa cccgccagtt ttttaccgac 3060
tggacggtga ataaccgtta cagcacctgg ggcggggtgt cgcggctggt ttattatccg 3120
gaaaattaca ttgacccaac ccagcgtatc gggcagaccc ggatgatgga tgaactgctg 3180
gaaaatatca gccagagtaa acttagccgg gacacagtgg aggatgcctt taaaacttac 3240
ctgacccgct ttgaaaccgt ggcggatctg aaagttgtca gcgcctatca cgacaacgtc 3300
aacagcaaca ccggactgac ctggtttgtc ggccaaacgc gggagaacct gccggaatac 3360
tactggcgta acgtggatat atcacggatg caggcgggtg aactggccgc caatgcctgg 3420
aaagagtgga cgaagattga tacagcggtc aacccctaca aggatgcaat acgtccggtc 3480
atattcaggg aacgtttgca ccttatctgg gtagaaaaag aggaagtggc gaaaaatggt 3540
actgatccgg tggaaaccta tgaccgtttt actctgaaac tggcgtttct gcgtcatgat 3600
ggcagttgga gtgccccctg gtcttacgat atcacaacgc aggtggaggc ggtcactgac 3660
aaaaaacctg acactgaacg gctggcgctg gccgcatcag gctttcaggg cgaggacact 3720
ctgctggtgt ttgtctacaa aaccgggaag agttactcgg attttggcgg cagcaataaa 3780
aatgtggcag gcatgaccat ttacggcgat ggctccttca aaaagatgga gaacacagca 3840
ctcagccgtt acagccaact gaaaaatacc tttgatatca ttcatactca aggcaacgac 3900
ttggtaagaa aggccagcta tcgtttcgcg caggattttg aagtgcctgc ctcgttgaat 3960
atgggttctg ccatcggtga tgatagtctg acggtgatgg agaacgggaa tattccgcag 4020
ataaccagta aatactccag cgataacctt gctattacgc tacataacgc cgctttcact 4080
gtcagatatg atggcagtgg caatgtcatc agaaacaaac aaatcagcgc catgaaactg 4140
acgggggtgg atggaaagtc ccagtacggc aatgcattta tcatcgcaaa taccgttaaa 4200
cattatggcg gttactctga tctggggggg ccgatcaccg tttataataa aacgaaaaac 4260
tatattgcat cagttcaagg ccacttgatg aacgcagatt acactaggcg tttgattcta 4320
acaccagttg aaaataatta ttatgccaga ttgttcgagt ttccattttc tccaaacaca 4380
attttaaaca ccgttttcac ggttggtagc aataaaacca gtgattttaa aaagtgcagt 4440
tatgctgttg atggtaataa ttctcagggc ttccagatat ttagttccta tcaatcatcc 4500
ggctggctgg atattgatac aggcattaac aataccgata tcaaaattac ggtgatggct 4560
ggcagtaaaa cccacacctt tacggccagt gaccatattg cttccttgcc ggcaaacagt 4620
tttgatgcta tgccgtacac ctttaagcca ctggaaatcg atgcttcatc gttggccttt 4680
accaataata ttgctcctct ggatatcgtt tttgagacca aagccaaaga cgggcgagtg 4740
ctgggtaaga tcaagcaaac attatcggtg aaacgggtaa attataatcc ggaagatatt 4800
ctgtttctgc gtgaaactca ttcgggtgcc caatatatgc agctcggggt gtatcgtatt 4860
cgtcttaata ccctgctggc ttctcaactg gtatccagag caaacacggg cattgatact 4920
atcctgacaa tggaaaccca gcggttaccg gaacctccgt tgggagaagg cttctttgcc 4980
aactttgttc tgcctaaata tgaccctgct gaacatggcg atgagcggtg gtttaaaatc 5040
catattggga atgttggcgg taacacggga aggcagcctt attacagcgg aatgttatcc 5100
gatacgtcgg aaaccagtat gacactgttt gtcccttatg ccgaagggta ttacatgcat 5160
gaaggtgtca gattgggggt tggataccag aaaattacct atgacaacac ttgggaatct 5220
gctttctttt attttgatga gacaaaacag caatttgtat taattaacga tgctgatcat 5280
gattcaggaa tgacgcaaca ggggatcgtg aaaaatatca agaaatacaa aggatttttg 5340
aatgtttcta tcgcaacggg ctattccgcc ccgatggatt tcaatagtgc cagcgccctc 5400
tattactggg aattgttcta ttacaccccg atgatgtgct tccagcgttt gctacaggaa 5460
aaacaattcg acgaagccac acaatggata aactacgtct acaatcccgc cggctatatc 5520
gttaacggag aaatcgcccc ctggatctgg aactgccggc cgctggaaga gaccacctcc 5580
tggaatgcca atccgctgga tgccatcgat ccggatgccg tcgcccaaaa tgacccaatg 5640
cactacaaga ttgccacctt tatgcgcctg ttggatcaac ttattctgcg cggcgatatg 5700
gcctatcgag aactgacccg cgatgcgttg aatgaagcca aaatgtggta tgtgcgtact 5760
ttagaattgc tcggtgatga gccggaggat tacggtagcc aacagtgggc agcaccgtcc 5820
ctttccgggg cggcgagtca aaccgtgcag gcggcttatc agcaggatct tacgatgctg 5880
ggccgtggtg gggtttccaa gaatctccgt accgctaact cgttggtggg tttgttcctg 5940
ccggaatata acccggcgct caccgattac tggcaaaccc tgcgtttgcg cctgtttaac 6000
ctgcgccata atctttccat tgacggacag ccgttatcgc tggcgattta cgccgagcct 6060
accgatccga aagcgctgct caccagtatg gtacaggcct ctcagggcgg tagtgcagtg 6120
ctgcccggca cattgtcgtt ataccgcttc ccggtgatgc tggagcggac ccgcaatctg 6180
gtagcgcaat taacccagtt cggcacctct ctgctcagta tggcagagca tgatgatgcc 6240
gatgaactca ccacgctgct actacagcag ggtatggaac tggcgacaca gagcatccgt 6300
attcagcaac gaactgtcga tgaagtggat gctgatattg ctgtattggc agagagccgc 6360
cgcagtgcac aaaatcgtct ggaaaaatac cagcagctgt atgacgagga tatcaaccac 6420
ggagaacagc gggcaatgtc actgcttgat gcagcggcag gtcagtctct ggccgggcag 6480
gtgctttcaa tagcggaagg ggtggccgat ttagtgccaa acgtgttcgg tttagcttgt 6540
ggcggcagtc gttggggggc agcactgcgt gcttccgcct ccgtgatgtc gctttctgcc 6600
acagcttccc aatattccgc agacaaaatc agccgttcgg aagcctaccg ccgccgccgt 6660
caggagtggg aaattcagcg tgataatgct gacggtgaag tcaaacaaat ggatgcccag 6720
ttggaaagcc tgaaaatccg ccgcgaagca gcacagatgc aggtggaata tcaggagacc 6780
cagcaggccc atactcaggc tcagttagag ctgttacagc gtaaattcac aaacaaagcg 6840
ctttacagtt ggatgcgcgg caagctgagt gctatctatt accagttctt tgacctgacc 6900
cagtccttct gcctgatggc acaggaagcg ctgcgccgcg agctgaccga caacggtgtt 6960
acctttatcc ggggtggggc ctggaacggt acgactgcgg gtttgatggc gggtgaaacg 7020
ttgctgctga atctggcaga aatggaaaaa gtctggctgg agcgtgatga gcgggcactg 7080
gaagtgaccc gtaccgtctc gttggcacag ttctatcagg ccttatcatc agacaacttt 7140
aatctgaccg aaaaactcac gcaattcctg cgtgaaggga aaggcaacgt aggagcttcc 7200
ggcaatgaat taaaactcag taaccgtcag atagaagcct cagtgcgatt gtctgatttg 7260
aaaattttca gcgactaccc cgaaagcctt ggcaataccc gtcagttgaa acaggtgagt 7320
gtcaccttgc cggcgctggt tgggccgtat gaagatattc gggcggtgct gaattacggg 7380
ggcagcatcg tcatgccacg cggttgcagt gctattgctc tctcccacgg cgtgaatgac 7440
agtggtcaat ttatgctgga tttcaacgat tcccgttatc tgccgtttga aggtatttcc 7500
gtgaatgaca gcggcagcct gacgttgagt ttcccggatg cgactgatcg gcagaaagcg 7560
ctgctggaga gcctgagcga tatcattctg catatccgct ataccattcg ttct 7614
<210> 63
<211> 7515
<212> DNA
<213> 发光光杆状菌
<400> 63
atgcaaaact cattatcaag cactatcgat actatttgtc agaaactgca attaacttgt 60
ccggcggaaa ttgctttgta tccctttgat actttccggg aaaaaactcg gggaatggtt 120
aattgggggg aagcaaaacg gatttatgaa attgcacaag cggaacagga tagaaaccta 180
cttcatgaaa aacgtatttt tgcctatgct aatccgctgc tgaaaaacgc tgttcggttg 240
ggtacccggc aaatgttggg ttttatacaa ggttatagtg atctgtttgg taatcgtgct 300
gataactatg ccgcgccggg ctcggttgca tcgatgttct caccggcggc ttatttgacg 360
gaattgtacc gtgaagccaa aaacttgcat gacagcagct caatttatta cctagataaa 420
cgtcgcccgg atttagcaag cttaatgctc agccagaaaa atatggatga ggaaatttca 480
acgctggctc tctctaatga attgtgcctt gccgggatcg aaacaaaaac aggaaaatca 540
caagatgaag tgatggatat gttgtcaact tatcgtttaa gtggagagac accttatcat 600
cacgcttatg aaactgttcg tgaaatcgtt catgaacgtg atccaggatt tcgtcatttg 660
tcacaggcac ccattgttgc tgctaagctc gatcctgtga ctttgttggg tattagctcc 720
catatttcgc cagaactgta taacttgctg attgaggaga tcccggaaaa agatgaagcc 780
gcgcttgata cgctttataa aacaaacttt ggcgatatta ctactgctca gttaatgtcc 840
ccaagttatc tggcccggta ttatggcgtc tcaccggaag atattgccta cgtgacgact 900
tcattatcac atgttggata tagcagtgat attctggtta ttccgttggt cgatggtgtg 960
ggtaagatgg aagtagttcg tgttacccga acaccatcgg ataattatac cagtcagacg 1020
aattatattg agctgtatcc acagggtggc gacaattatt tgatcaaata caatctaagc 1080
aatagttttg gtttggatga tttttatctg caatataaag atggttccgc tgattggact 1140
gagattgccc ataatcccta tcctgatatg gtcataaatc aaaagtatga atcacaggcg 1200
acaatcaaac gtagtgactc tgacaatata ctcagtatag ggttacaaag atggcatagc 1260
ggtagttata attttgccgc cgccaatttt aaaattgacc aatactcccc gaaagctttc 1320
ctgcttaaaa tgaataaggc tattcggttg ctcaaagcta ccggcctctc ttttgctacg 1380
ttggagcgta ttgttgatag tgttaatagc accaaatcca tcacggttga ggtattaaac 1440
aaggtttatc gggtaaaatt ctatattgat cgttatggca tcagtgaaga gacagccgct 1500
attttggcta atattaatat ctctcagcaa gctgttggca atcagcttag ccagtttgag 1560
caactattta atcacccgcc gctcaatggt attcgctatg aaatcagtga ggacaactcc 1620
aaacatcttc ctaatcctga tctgaacctt aaaccagaca gtaccggtga tgatcaacgc 1680
aaggcggttt taaaacgcgc gtttcaggtt aacgccagtg agttgtatca gatgttattg 1740
atcactgatc gtaaagaaga cggtgttatc aaaaataact tagagaattt gtctgatctg 1800
tatttggtta gtttgctggc ccagattcat aacctgacta ttgctgaatt gaacattttg 1860
ttggtgattt gtggctatgg cgacaccaac atttatcaga ttaccgacga taatttagcc 1920
aaaatagtgg aaacattgtt gtggatcact caatggttga agacccaaaa atggacagtt 1980
accgacctgt ttctgatgac cacggccact tacagcacca ctttaacgcc agaaattagc 2040
aatctgacgg ctacgttgtc ttcaactttg catggcaaag agagtctgat tggggaagat 2100
ctgaaaagag caatggcgcc ttgcttcact tcggctttgc atttgacttc tcaagaagtt 2160
gcgtatgacc tgctgttgtg gatagaccag attcaaccgg cacaaataac tgttgatggg 2220
ttttgggaag aagtgcaaac aacaccaacc agcttgaagg tgattacctt tgctcaggtg 2280
ctggcacaat tgagcctgat ctatcgtcgt attgggttaa gtgaaacgga actgtcactg 2340
atcgtgactc aatcttctct gctagtggca ggcaaaagca tactggatca cggtctgtta 2400
accctgatgg ccttggaagg ttttcatacc tgggttaatg gcttggggca acatgcctcc 2460
ttgatattgg cggcgttgaa agacggagcc ttgacagtta ccgatgtagc acaagctatg 2520
aataaggagg aatctctcct acaaatggca gctaatcagg tggagaagga tctaacaaaa 2580
ctgaccagtt ggacacagat tgacgctatt ctgcaatggt tacagatgtc ttcggccttg 2640
gcggtttctc cactggatct ggcagggatg atggccctga aatatgggat agatcataac 2700
tatgctgcct ggcaagctgc ggcggctgcg ctgatggctg atcatgctaa tcaggcacag 2760
aaaaaactgg atgagacgtt cagtaaggca ttatgtaact attatattaa tgctgttgtc 2820
gatagtgctg ctggagtacg tgatcgtaac ggtttatata cctatttgct gattgataat 2880
caggtttctg ccgatgtgat cacttcacgt attgcagaag ctatcgccgg tattcaactg 2940
tacgttaacc gggctttaaa ccgagatgaa ggtcagcttg catcggacgt tagtacccgt 3000
cagttcttca ctgactggga acgttacaat aaacgttaca gtacttgggc tggtgtctct 3060
gaactggtct attatccaga aaactatgtt gatcccactc agcgcattgg gcaaaccaaa 3120
atgatggatg cgctgttgca atccatcaac cagagccagc taaatgcgga tacggtggaa 3180
gatgctttca aaacttattt gaccagcttt gagcaggtag caaatctgaa agtaattagt 3240
gcttaccacg ataatgtgaa tgtggatcaa ggattaactt attttatcgg tatcgaccaa 3300
gcagctccgg gtacgtatta ctggcgtagt gttgatcaca gcaaatgtga aaatggcaag 3360
tttgccgcta atgcttgggg tgagtggaat aaaattacct gtgctgtcaa tccttggaaa 3420
aatatcatcc gtccggttgt ttatatgtcc cgcttatatc tgctatggct ggagcagcaa 3480
tcaaagaaaa gtgatgatgg taaaaccacg atttatcaat ataacttaaa actggctcat 3540
attcgttacg acggtagttg gaatacacca tttacttttg atgtgacaga aaaggtaaaa 3600
aattacacgt cgagtactga tgctgctgaa tctttagggt tgtattgtac tggttatcaa 3660
ggggaagaca ctctattagt tatgttctat tcgatgcaga gtagttatag ctcctatacc 3720
gataataatg cgccggtcac tgggctatat attttcgctg atatgtcatc agacaatatg 3780
acgaatgcac aagcaactaa ctattggaat aacagttatc cgcaatttga tactgtgatg 3840
gcagatccgg atagcgacaa taaaaaagtc ataaccagaa gagttaataa ccgttatgcg 3900
gaggattatg aaattccttc ctctgtgaca agtaacagta attattcttg gggtgatcac 3960
agtttaacca tgctttatgg tggtagtgtt cctaatatta cttttgaatc ggcggcagaa 4020
gatttaaggc tatctaccaa tatggcattg agtattattc ataatggata tgcgggaacc 4080
cgccgtatac aatgtaatct tatgaaacaa tacgcttcat taggtgataa atttataatt 4140
tatgattcat catttgatga tgcaaaccgt tttaatctgg tgccattgtt taaattcgga 4200
aaagacgaga actcagatga tagtatttgt atatataatg aaaacccttc ctctgaagat 4260
aagaagtggt atttttcttc gaaagatgac aataaaacag cggattataa tggtggaact 4320
caatgtatag atgctggaac cagtaacaaa gatttttatt ataatctcca ggagattgaa 4380
gtaattagtg ttactggtgg gtattggtcg agttataaaa tatccaaccc gattaatatc 4440
aatacgggca ttgatagtgc taaagtaaaa gtcaccgtaa aagcgggtgg tgacgatcaa 4500
atctttactg ctgataatag tacctatgtt cctcagcaac cggcacccag ttttgaggag 4560
atgatttatc agttcaataa cctgacaata gattgtaaga atttaaattt catcgacaat 4620
caggcacata ttgagattga tttcaccgct acggcacaag atggccgatt cttgggtgca 4680
gaaactttta ttatcccggt aactaaaaaa gttctcggta ctgagaacgt gattgcgtta 4740
tatagcgaaa ataacggtgt tcaatatatg caaattggcg catatcgtac ccgtttgaat 4800
acgttattcg ctcaacagtt ggttagccgt gctaatcgtg gcattgatgc agtgctcagt 4860
atggaaactc agaatattca ggaaccgcaa ttaggagcgg gcacatatgt gcagcttgtg 4920
ttggataaat atgatgagtc tattcatggc actaataaaa gctttgctat tgaatatgtt 4980
gatatattta aagagaacga tagttttgtg atttatcaag gagaacttag cgaaacaagt 5040
caaactgttg tgaaagtttt cttatcctat tttatagagg cgactggaaa taagaaccac 5100
ttatgggtac gtgctaaata ccaaaaggaa acgactgata agatcttgtt cgaccgtact 5160
gatgagaaag atccgcacgg ttggtttctc agcgacgatc acaagacctt tagtggtctc 5220
tcttccgcac aggcattaaa gaacgacagt gaaccgatgg atttctctgg cgccaatgct 5280
ctctatttct gggaactgtt ctattacacg ccgatgatga tggctcatcg tttgttgcag 5340
gaacagaatt ttgatgcggc gaaccattgg ttccgttatg tctggagtcc atccggttat 5400
atcgttgatg gtaaaattgc tatctaccac tggaacgtgc gaccgctgga agaagacacc 5460
agttggaatg cacaacaact ggactccacc gatccagatg ctgtagccca agatgatccg 5520
atgcactaca aggtggctac ctttatggcg acgttggatc tgctaatggc ccgtggtgat 5580
gctgcttacc gccagttaga gcgtgatacg ttggctgaag ctaaaatgtg gtatacacag 5640
gcgcttaatc tgttgggtga tgagccacaa gtgatgctga gtacgacttg ggctaatcca 5700
acattgggta atgctgcttc aaaaaccaca cagcaggttc gtcagcaagt gcttacccag 5760
ttgcgtctca atagcagggt aaaaaccccg ttgctaggaa cagccaattc cctgaccgct 5820
ttattcctgc cgcaggaaaa tagcaagctc aaaggctact ggcggacact ggcgcagcgt 5880
atgtttaatt tacgtcataa tctgtcgatt gacggccagc cgctctcctt gccgctgtat 5940
gctaaaccgg ctgatccaaa agctttactg agtgcggcgg tttcagcttc tcaaggggga 6000
gccgacttgc cgaaggcgcc gctgactatt caccgcttcc ctcaaatgct agaaggggca 6060
cggggcttgg ttaaccagct tatacagttc ggtagttcac tattggggta cagtgagcgt 6120
caggatgcgg aagctatgag tcaactactg caaacccaag ccagcgagtt aatactgacc 6180
agtattcgta tgcaggataa ccaattggca gagctggatt cggaaaaaac cgccttgcaa 6240
gtctctttag ctggagtgca acaacggttt gacagctata gccaactgta tgaggagaac 6300
atcaacgcag gtgagcagcg agcgctggcg ttacgctcag aatctgctat tgagtctcag 6360
ggagcgcaga tttcccgtat ggcaggcgcg ggtgttgata tggcaccaaa tatcttcggc 6420
ctggctgatg gcggcatgca ttatggtgct attgcctatg ccatcgctga cggtattgag 6480
ttgagtgctt ctgccaagat ggttgatgcg gagaaagttg ctcagtcgga aatatatcgc 6540
cgtcgccgtc aagaatggaa aattcagcgt gacaacgcac aagcggagat taaccagtta 6600
aacgcgcaac tggaatcact gtctattcgc cgtgaagccg ctgaaatgca aaaagagtac 6660
ctgaaaaccc agcaagctca ggcgcaggca caacttactt tcttaagaag caaattcagt 6720
aatcaagcgt tatatagttg gttacgaggg cgtttgtcag gtatttattt ccagttctat 6780
gacttggccg tatcacgttg cctgatggca gagcaatcct atcaatggga agctaatgat 6840
aattccatta gctttgtcaa accgggtgca tggcaaggaa cttacgccgg cttattgtgt 6900
ggagaagctt tgatacaaaa tctggcacaa atggaagagg catatctgaa atgggaatct 6960
cgcgctttgg aagtagaacg cacggtttca ttggcagtgg tttatgattc actggaaggt 7020
aatgatcgtt ttaatttagc ggaacaaata cctgcattat tggataaggg ggagggaaca 7080
gcaggaacta aagaaaatgg gttatcattg gctaatgcta tcctgtcagc ttcggtcaaa 7140
ttgtccgact tgaaactggg aacggattat ccagacagta tcgttggtag caacaaggtt 7200
cgtcgtatta agcaaatcag tgtttcgcta cctgcattgg ttgggcctta tcaggatgtt 7260
caggctatgc tcagctatgg tggcagtact caattgccga aaggttgttc agcgttggct 7320
gtgtctcatg gtaccaatga tagtggtcag ttccagttgg atttcaatga cggcaaatac 7380
ctgccatttg aaggtattgc tcttgatgat cagggtacac tgaatcttca atttccgaat 7440
gctaccgaca agcagaaagc aatattgcaa actatgagcg atattatttt gcatattcgt 7500
tataccatcc gttaa 7515
<210> 64
<211> 7551
<212> DNA
<213> 发光光杆状菌
<400> 64
atgaacgagt ctgtaaaaga gatacctgat gtattaaaaa gccagtgtgg ttttaattgt 60
ctgacagata ttagccacag ctcttttaat gaatttcgcc agcaagtatc tgagcacctc 120
tcctggtccg aaacacacga cttatatcat gatgcacaac aggcacaaaa ggataatcgc 180
ctgtatgaag cgcgtattct caaacgcgcc aatccccaat tacaaaatgc ggtgcatctt 240
gccattctcg ctcccaatgc tgaactgata ggctataaca atcaatttag cggtagagcc 300
agtcaatatg ttgcgccggg taccgtttct tccatgttct cccccgccgc ttatttgact 360
gaactttatc gtgaagcacg caatttacac gcaagtgact ccgtttatta tctggatacc 420
cgccgcccag atctcaaatc aatggcgctc agtcagcaaa atatggatat agaattatcc 480
acactctctt tgtccaatga gctgttattg gaaagcatta aaactgaatc taaactggaa 540
aactatacta aagtgatgga aatgctctcc actttccgtc cttccggcgc aacgccttat 600
catgatgctt atgaaaatgt gcgtgaagtt atccagctac aagatcctgg acttgagcaa 660
ctcaatgcat caccggcaat tgccgggttg atgcatcaag cctccctatt gggtattaac 720
gcttcaatct cgcctgagct atttaatatt ctgacggagg agattaccga aggtaatgct 780
gaggaacttt ataagaaaaa ttttggtaat atcgaaccgg cctcattggc tatgccggaa 840
taccttaaac gttattataa tttaagcgat gaagaactta gtcagtttat tggtaaagcc 900
agcaattttg gtcaacagga atatagtaat aaccaactta ttactccggt agtcaacagc 960
agtgatggca cggttaaggt atatcggatc acccgcgaat atacaaccaa tgcttatcaa 1020
atggatgtgg agctatttcc cttcggtggt gagaattatc ggttagatta taaattcaaa 1080
aatttttata atgcctctta tttatccatc aagttaaatg ataaaagaga acttgttcga 1140
actgaaggcg ctcctcaagt caatatagaa tactccgcaa atatcacatt aaataccgct 1200
gatatcagtc aaccttttga aattggcctg acacgagtac ttccttccgg ttcttgggca 1260
tatgccgccg caaaatttac cgttgaagag tataaccaat actcttttct gctaaaactt 1320
aacaaggcta ttcgtctatc acgtgcgaca gaattgtcac ccacgattct ggaaggcatt 1380
gtgcgcagtg ttaatctaca actggatatc aacacagacg tattaggtaa agtttttctg 1440
actaaatatt atatgcagcg ttatgctatt catgctgaaa ctgccctgat actatgcaac 1500
gcgcctattt cacaacgttc atatgataat caacctagcc aatttgatcg cctgtttaat 1560
acgccattac tgaacggaca atatttttct accggcgatg aggagattga tttaaattca 1620
ggtagcaccg gcgattggcg aaaaaccata cttaagcgtg catttaatat tgatgatgtc 1680
tcgctcttcc gcctgcttaa aattaccgac catgataata aagatggaaa aattaaaaat 1740
aacctaaaga atctttccaa tttatatatt ggaaaattac tggcagatat tcatcaatta 1800
accattgatg aactggattt attactgatt gccgtaggtg aaggaaaaac taatttatcc 1860
gctatcagtg ataagcaatt ggctaccctg atcagaaaac tcaatactat taccagctgg 1920
ctacatacac agaagtggag tgtattccag ctatttatca tgacctccac cagctataac 1980
aaaacgctaa cgcctgaaat taagaatttg ctggataccg tctaccacgg tttacaaggt 2040
tttgataaag acaaagcaga tttgctacat gtcatggcgc cctatattgc ggccaccttg 2100
caattatcat cggaaaatgt cgcccactcg gtactccttt gggcagataa gttacagccc 2160
ggcgacggcg caatgacagc agaaaaattc tgggactggt tgaatactaa gtatacgccg 2220
ggttcatcgg aagccgtaga aacgcaggaa catatcgttc agtattgtca ggctctggca 2280
caattggaaa tggtttacca ttccaccggc atcaacgaaa acgccttccg tctatttgtg 2340
acaaaaccag agatgtttgg cgctgcaact ggagcagcgc ccgcgcatga tgccctttca 2400
ctgattatgc tgacacgttt tgcggattgg gtgaacgcac taggcgaaaa agcgtcctcg 2460
gtgctagcgg catttgaagc taactcgtta acggcagaac aactggctga tgccatgaat 2520
cttgatgcta atttgctgtt gcaagccagt attcaagcac aaaatcatca acatcttccc 2580
ccagtaactc cagaaaatgc gttctcctgt tggacatcta tcaatactat cctgcaatgg 2640
gttaatgtcg cacaacaatt gaatgtcgcc ccacagggcg tttccgcttt ggtcgggctg 2700
gattatattc aatcaatgaa agagacaccg acctatgccc agtgggaaaa cgcggcaggc 2760
gtattaaccg ccgggttgaa ttcacaacag gctaatacat tacacgcttt tctggatgaa 2820
tctcgcagtg ccgcattaag cacctactat atccgtcaag tcgccaaggc agcggcggct 2880
attaaaagcc gtgatgactt gtatcaatac ttactgattg ataatcaggt ttctgcggca 2940
ataaaaacca cccggatcgc cgaagccatt gccagtattc aactgtacgt caaccgggca 3000
ttggaaaatg tggaagaaaa tgccaattcg ggggttatca gccgccaatt ctttatcgac 3060
tgggacaaat acaataaacg ctacagcact tgggcgggtg tttctcaatt agtttactac 3120
ccggaaaact atattgatcc gaccatgcgt atcggacaaa ccaaaatgat ggacgcatta 3180
ctgcaatccg tcagccaaag ccaattaaac gccgataccg tcgaagatgc ctttatgtct 3240
tatctgacat cgtttgaaca agtggctaat cttaaagtta ttagcgcata tcacgataat 3300
attaataacg atcaagggct gacctatttt atcggactca gtgaaactga tgccggtgaa 3360
tattattggc gcagtgtcga tcacagtaaa ttcaacgacg gtaaattcgc ggctaatgcc 3420
tggagtgaat ggcataaaat tgattgtcca attaaccctt ataaaagcac tatccgtcca 3480
gtgatatata aatcccgcct gtatctgctc tggttggaac aaaaggagat caccaaacag 3540
acaggaaata gtaaagatgg ctatcaaact gaaacggatt atcgttatga actaaaattg 3600
gcgcatatcc gctatgatgg cacttggaat acgccaatca cctttgatgt caataaaaaa 3660
atatccgagc taaaactgga aaaaaataga gcgcccggac tctattgtgc cggttatcaa 3720
ggtgaagata cgttgctggt gatgttttat aaccaacaag acacactaga tagttataaa 3780
aacgcttcaa tgcaaggact atatatcttt gctgatatgg catccaaaga tatgacccca 3840
gaacagagca atgtttatcg ggataatagc tatcaacaat ttgataccaa taatgtcaga 3900
agagtgaata accgctatgc agaggattat gagattcctt cctcggtaag tagccgtaaa 3960
gactatggtt ggggagatta ttacctcagc atggtatata acggagatat tccaactatc 4020
aattacaaag ccgcatcaag tgatttaaaa atctatatct caccaaaatt aagaattatt 4080
cataatggat atgaaggaca gaagcgcaat caatgcaatc tgatgaataa atatggcaaa 4140
ctaggtgata aatttattgt ttatactagc ttgggggtca atccaaataa ctcgtcaaat 4200
aagctcatgt tttaccccgt ctatcaatat agcggaaaca ccagtggact caatcaaggg 4260
agactactat tccaccgtga caccacttat ccatctaaag tagaagcttg gattcctgga 4320
gcaaaacgtt ctctaaccaa ccaaaatgcc gccattggtg atgattatgc tacagactct 4380
ctgaataaac cggatgatct taagcaatat atctttatga ctgacagtaa agggactgct 4440
actgatgtct caggcccagt agagattaat actgcaattt ctccagcaaa agttcagata 4500
atagtcaaag cgggtggcaa ggagcaaact tttaccgcag ataaagatgt ctccattcag 4560
ccatcaccta gctttgatga aatgaattat caatttaatg cccttgaaat agacggttct 4620
ggtctgaatt ttattaacaa ctcagccagt attgatgtta cttttaccgc atttgcggag 4680
gatggccgca aactgggtta tgaaagtttc agtattcctg ttaccctcaa ggtaagtacc 4740
gataatgccc tgaccctgca ccataatgaa aatggtgcgc aatatatgca atggcaatcc 4800
tatcgtaccc gcctgaatac tctatttgcc cgccagttgg ttgcacgcgc caccaccgga 4860
atcgatacaa ttctgagtat ggaaactcag aatattcagg aaccgcagtt aggcaaaggt 4920
ttctatgcta cgttcgtgat acctccctat aacctatcaa ctcatggtga tgaacgttgg 4980
tttaagcttt atatcaaaca tgttgttgat aataattcac atattatcta ttcaggccag 5040
ctaacagata caaatataaa catcacatta tttattcctc ttgatgatgt cccattgaat 5100
caagattatc acgccaaggt ttatatgacc ttcaagaaat caccatcaga tggtacctgg 5160
tggggccctc actttgttag agatgataaa ggaatagtaa caataaaccc taaatccatt 5220
ttgacccatt ttgagagcgt caatgtcctg aataatatta gtagcgaacc aatggatttc 5280
agcggcgcta acagcctcta tttctgggaa ctgttctact ataccccgat gctggttgct 5340
caacgtttgc tgcatgaaca gaacttcgat gaagccaacc gttggctgaa atatgtctgg 5400
agtccatccg gttatattgt ccacggccag attcagaact accagtggaa cgtccgcccg 5460
ttactggaag acaccagttg gaacagtgat cctttggatt ccgtcgatcc tgacgcggta 5520
gcacagcacg atccaatgca ctacaaagtt tcaactttta tgcgtacctt ggatctattg 5580
atagcacgcg gcgaccatgc ttatcgccaa ctggaacgag atacactcaa cgaagcgaag 5640
atgtggtata tgcaagcgct gcatctatta ggtgacaaac cttatctacc gctgagtacg 5700
acatggagtg atccacgact agacagagcc gcggatatca ctacccaaaa tgctcacgac 5760
agcgcaatag tcgctctgcg gcagaatata cctacaccgg cacctttatc attgcgcagc 5820
gctaataccc tgactgatct cttcctgccg caaatcaatg aagtgatgat gaattactgg 5880
cagacattag ctcagagagt atacaatctg cgtcataacc tctctatcga cggccagccg 5940
ttatatctgc caatctatgc cacaccggcc gatccgaaag cgttactcag cgccgccgtt 6000
gccacttctc aaggtggagg caagctaccg gaatcattta tgtccctgtg gcgtttcccg 6060
cacatgctgg aaaatgcgcg cggcatggtt agccagctca cccagttcgg ctccacgtta 6120
caaaatatta tcgaacgtca ggacgcggaa gcgctcaatg cgttattaca aaatcaggcc 6180
gccgagctga tattgactaa cctgagcatt caggacaaaa ccattgaaga attggatgcc 6240
gagaaaacgg tgttggaaaa atccaaagcg ggagcacaat cgcgctttga tagctacggc 6300
aaactgtacg atgagaatat caacgccggt gaaaaccaag ccatgacgct acgagcgtcc 6360
gccgccgggc ttaccacggc agttcaggca tcccgtctgg ccggtgcggc ggctgatctg 6420
gtgcctaaca tcttcggctt tgccggtggc ggcagccgtt ggggggctat cgctgaggcg 6480
acaggttatg tgatggaatt ctccgcgaat gttatgaaca ccgaagcgga taaaattagc 6540
caatctgaaa cctaccgtcg tcgccgtcag gagtgggaga tccagcggaa taatgccgaa 6600
gcggaattga agcaaatcga tgctcagctc aaatcactcg ctgtacgccg cgaagccgcc 6660
gtattgcaga aaaccagtct gaaaacccaa caagaacaga cccaatctca attggccttc 6720
ctgcaacgta agttcagcaa tcaggcgtta tacaactggc tgcgtggtcg actggcggcg 6780
atttacttcc agttctacga tttggccgtc gcgcgttgcc tgatggcaga acaagcttac 6840
cgttgggaac tcaatgatga ctctgcccgc ttcattaaac cgggcgcctg gcagggaacc 6900
tatgccggtc tgcttgcagg tgaaaccttg atgctgagtc tggcacaaat ggaagacgct 6960
catctgaaac gcgataaacg cgcattagag gttgaacgca cagtatcgct ggccgaagtt 7020
tatgcaggat taccaaaaga taacggtcca ttttccctgg ctcaggaaat tgacaagctg 7080
gtgagtcaag gttcaggcag tgccggcagt ggtaataata atttggcgtt cggcgccggc 7140
acggacacta aaacctcttt gcaggcatca gtttcattcg ctgatttgaa aattcgtgaa 7200
gattacccgg catcgcttgg caaaattcga cgtatcaaac agatcagcgt cactttgccc 7260
gcgctactgg gaccgtatca ggatgtacag gcaatattgt cttacggcga taaagccgga 7320
ttagctaacg gctgtgaagc gctggcagtt tctcacggta tgaatgacag cggccaattc 7380
cagctcgatt tcaacgatgg caaattcctg ccattcgaag gcatcgccat tgatcaaggc 7440
acgctgacac tgagcttccc aaatgcatct atgccggaga aaggtaaaca agccactatg 7500
ttaaaaaccc tgaacgatat cattttgcat attcgctaca ccattaaata a 7551
<210> 65
<211> 7500
<212> DNA
<213> 发光光杆状菌
<400> 65
atgaacacac tcaaatccga atatcaacaa gcgttaggag caggttttaa taatctaacc 60
gatatctgcc atctctcttt tgacgaactg cgcaaaaaag tgaaggataa actctcatgg 120
tcacagaccc agagcttata tcttgaagca cagcaggtgc aaaaggataa ccttctgcat 180
gaagcccgta ttctgaaacg cgcaaaccct catttacaaa gtgcggtcca tcttgccctg 240
acagcacctc atgcagacca gcaaggttat aatagccgat ttggcaatcg cgccagcaaa 300
tatgcagccc ctggcgcaat ttcctccatg ttttctcttg cggcttatct gactgaactt 360
tatcgtcagg cacgaaattt acatgcagaa ggttccattt atcatctgga tacgcgtcgc 420
ccagatctaa aatcattggt gctcagccag aaaaatatga atacggagat ttccacgctt 480
tctctgtcta ataacatgtt gctaaacagt attaagactc agcctaatct gaacagccac 540
gctaaagtga tggaaaagtt atcaactttc cgcacttctg gctcaatgcc atatcacgat 600
gcttatgaaa gtgtacgtaa gattattcaa ttacaagctc ctgtgtttga acaatccagt 660
acattaacag atacgccaat caccaaactg atgtatcaaa tctccttgct ggggattaat 720
gcctctgtct caccggagct gtttactatt ctgacgcaaa agataaaacc agcaaccaat 780
gctgataaca ctaatgaact aaaaaaactt tataagaaga attttggtga aattaaatct 840
attcaaatgg caagggcaga atacctgaaa agttattata atctgacaga caaagaactt 900
aaccagttta gtaaaaagat taaacaaata gatagcctgt ggaatatagg agacgagatt 960
acccaatacc atctattgaa attcaataaa gctattaatc tatctcgatc aaccgagcta 1020
tcaccaataa tccttaacag cattgccatc gatatcctta aaaaaacacc tccagaggat 1080
gactctgaca acccttttag ggacgaccct gattaccttg aaagctttca agaccttgac 1140
cttagtgacg aaccagatat agacgaagat gtattaagag aagctttacg tgttaaagac 1200
tatatgcaac gttatggtat tgatgctgag actgcattaa tactgtgcaa agcacccatt 1260
tcagaaaatc cttctcatcc cgatctatcc aaattactag cagacatcca tcaattaact 1320
attgatgaat taggggtact actggttgcc atagatgaag gaaaaaccga tttatctcag 1380
attactcatg acaatttagc ggttctaatt agcaaactct attccgttac caattggctg 1440
cgtacacgga aatggagtgt atatcagtta tttgtaatga cgaccgataa atataacaaa 1500
accttaaccc cggaaataaa caaccttctg gataccgtct acaatggctt gcagaacttt 1560
tacaaggata atttgctaaa aataaaagat aatctattga aagccaaaga aagtttacca 1620
gaagacaaag ataatdtgcc gaaagcmgag caakatstgt tggaagccga gaaatatctg 1680
ctagcagccg agaaatatct gctagcagcc gagaaatatc tattggaagc caataaaaat 1740
ccgctagaag ccaaaaaggc tctgaaagaa tacgagaaaa atcaggaggc atacgagaaa 1800
aatctgaaag aacacgagaa atatctgttg aaagccggag aaaatctgcc agcaatcaaa 1860
gagaatttgc taaaaatcaa ggaaaatctg ccaaaagcca tatctcctta tatcgccgcc 1920
gctctgcaat tgccatctga gaatgttgct ctctccgtgc tggcttgggc agataaacta 1980
aactctggca aagaaaacaa aatgacggca gattcattct ggaactggtt acggaaaaaa 2040
cccattgaaa ctcaatcgaa aacaactgaa gcaactgaag caactgaagc aactgaagca 2100
actgaagcaa ctgaagcaac tgaaaaaact acactaattc aacaagctgt ccaatattgc 2160
cagtgcctag cacaactggc gctgatttat cgctctaccg gtcttagcga aagcacttta 2220
cgtctgtttg tgacaaatcc acaaatcttt ggtcttaccg cgaaaacaac gtcaacacac 2280
aatgtattat cactgattat gctgacgcgt tttactgact gggttaactc actaggtgaa 2340
aacgcctctt ctgtactgac cgagtttgaa aaaggaacat taacggcaga actattggct 2400
aacgccatga atcttgataa aaatctacta gagcaagcca gtactcaagc acaagctgat 2460
ttctccaatt ggccatctat cgacaaccta ttgcagtgga ttaacatctc gcgtcaattg 2520
aacatctcgc cacaaggcgt ttctgaactg gcgaaaatat tagacataga atcttctact 2580
aattatgccc aatgggaaaa tgtcgcttca atattaaccg ccggactaga tacccaaaaa 2640
gccaataccc tacatgcatt tctgggtgag tctcgcagta ctgcgttaag tacatactat 2700
atttattctc ataaccaaaa agatcgagaa gaaagaaaac atacggtaat taaagaccgt 2760
gatgatctat atcaatacct gttgatcgat aaccaagtct ccgccgccat caaaaccacg 2820
gagattgctg aagctatcgc tagtatccaa ctgtatatta accgcgcatt gaaaaatatg 2880
gagggagata ccgacacaag tgtcactagc cgtttattct tcactaactg ggataaatac 2940
aacaaacgct acagcacctg ggctggtatt actaagctcc tttactaccc tgaaaactat 3000
atcgatccga cactgcggat cggccagaca aaaatgatgg atacgctact gcaatccatc 3060
agccaaagtc aattgaatac cgataccgta gaagatgcct ttaaatctta tctaacgtca 3120
ttcgaacaag tggctaatct ggaagtcatc agcgcctatc atgacaatat taataatgac 3180
caaggattga cctattttat cggacgcagt aaaacagaag tgaatcaata ttattggcgc 3240
agtgtagatc acaataaatt cagcgaaggt aaattccccg ctaatgcctg gagcgagtgg 3300
cacaaaattg actgcccaat taatccctac gaagatacta tccgcccggt agtctaccaa 3360
tcccgcctgt atattatctg gctggaacag aagaaggtaa ctaatcgagc agaaggagaa 3420
gctatcaaac aaggaagcaa aacgaccaca agctatcatt atgaactgaa attggcacat 3480
attcgttatg acggcacctg gaatacacca attacctttg atgtagatga aaaaatatct 3540
ggtctaaatt tagaactgaa taaagcgtta gggctctatt gtgcaagtta tcaaggcaaa 3600
gataaattgc tggttatgtt ttataaaaaa caggagcaat taaataatta cacagaaaaa 3660
acaggaaaca catacacagc accaataaaa gggctatata tcacttccaa tatgtctcct 3720
gaggaaatga cacccgaaag ttacagactt aatgctcata aacagtttga taccaacaat 3780
gtcgtaagag tcaataaccg ctatgcagaa agctacgaaa tcccttcatc agtaaacagt 3840
aataatggtt atgattgggg agagggctat ctgagtatgg tatacggcgg gagcattctg 3900
attacccgtg acccaagcga taactcaaaa atccaaatct caccaaagtt aagaattatt 3960
cataatggat atgaaggtcg acaacgtaat caatgcaatt tgatgaagaa atacggcaag 4020
ctcggtgata aattcattat ttatactacg ctaggtatta accccaataa tttatcaaat 4080
aaaaaactta tctaccctgt ttatcaatat gaaggaaatg aaagtaagct tagtcaagga 4140
agacttctgt tttatcggga tagcaccact aactttacaa gagcctggtt ccctaacctt 4200
tcttctgact caaaagaaat gtccataacc actggcggta acattagtgg taattatggt 4260
tatattgata acaaacatag tgacaacaaa ccattcgaag aatatttcta tatggacgac 4320
cacggcggta ttgacactga cgtttcggag ccaatattta ttaatacaaa aattcagcct 4380
tcaaatgtta aaatcatagt gaaaacagtg aaggatgatg gaaaattaga cagtaaacca 4440
tatatagcag aagacaaagt ttcagttaaa ccgacaccaa actttgaaga aatgtgttat 4500
cagtttaata atctcgatca aatagatgtc tccactctag tatttaaaaa taatgaagca 4560
agtattgata tcacctttac agcatctgct gacgcatttg aaagtggtaa agaacaacgt 4620
aatctaggtg aagaacattt cagtattcgt attatcaaaa aagcgaatgt taatgatgtc 4680
ctgacccttc accacgatcc aagtggggca caatatatgc aatggggagc ctatcgtact 4740
cgccttaata ccctgtttgc ccgtaaatta attagccgcg ccaatgcggg gatcgacact 4800
attttgagta tggaaactca gaatattcaa gagccacaat taggcaaagg cttttatgtt 4860
aatttcactc ttcctaaata tgatcaaaac acacatggta atgaacgcca gtttaaaatt 4920
catataggga atattgctgg tgataataca atgcggccat attaccaagg aatattggct 4980
gacaccgaaa ccagtgtcgt tctttttgtc ccttatgaga aacaatctta taccaatgaa 5040
ggtgttagat taggagttga atacaaaaaa gtatcttacc taggcgtctg ggaacccgct 5100
ttcttctatt tcaatgaaat tcaacagaag tttattctga ttaatgatgc cgatcataac 5160
tcagcaatga ctcaatctgg tgaaaaaaca ggaattaaaa aatacaaagg ctttcttgac 5220
gtttctattc ttatcgatca tcagcacaca gaaccaatgg acttcaacgg cgccaacagc 5280
ctctacttct gggaactgtt ctactatacc ccgatgctga tcgctcaacg tttgctacac 5340
gagcaaaatt tcgatgaagc taaccgttgg ctgaaatatg tctggaatcc atctggtcat 5400
attgccaatg gtcaaaaaca gcacccccac aactggaatg tccgcccatt acaagaggac 5460
accagttgga acgatgatcc attggataca tttgatcccg atgccatcgc tcaacatgat 5520
ccgatgcact acaaagtcgc cacctttatg tgcgcccttg atctattgat cgaacaggga 5580
gattacgcct atcgccagtt ggaacgggac acactcgccg aagccaaaat gtggtatatg 5640
caggcactgc atctattagg cgataaacct catttattac tcagttcaac atggagtgat 5700
ccagagctaa aagaagccgc agatcttgaa aaacaacagg cacatgccaa agcaatagca 5760
gatttacgac aaggccagcc taaagatgga agcaacacag atcttttcct gccacaggtc 5820
aacgaagtga tgttgagcta ttggcagaaa ctggaacaac ggttatataa cctgcgccat 5880
aacctctcta ttgatggtca acctttacat ttgcctattt tcgcgacacc ggcagatcca 5940
aaagcgctgc tcagcgccgc tgtcgccagt tcacaaggtg gaagcaatct tccgtcagag 6000
tttatatcag tttggcgttt cccacatatg ctggaaaacg cccgcagtat ggtcagtcaa 6060
ctcacccaat tcggctccac attacaaaat attatcgaac gtcaggatgc ggaagcatta 6120
aacacgctgt tacagaatca agcggcggaa ctgatattga ccaatctcag catacaggac 6180
aaaaccattg aagagctgga tgttgaaaaa actgtgctag aaaaaacccg cgccggagct 6240
aaatcgcgtt ttgatagcta cagcaaattc tacgatgaag atatcaacgc aggtgaaaaa 6300
caggcgatgg cgttgcgagc ctccgtcgca ggcatctcta ctgcacttca agcatcacat 6360
ctggcaggcg cagcgcttga tttggctccc aacatctttg gcttcgctga tggcggtagc 6420
cattggggag caattgccca agccacaagt aatgtcatgg aattctccgc cagtgtcatg 6480
agcaccgaag cggataaaat cagccagtct gaagcctacc gtcggcgtcg acaggagtgg 6540
aaaatccagc gtaacaacgc tgatgcagag ttgaaacaaa tcgatgctca acttcaatca 6600
ttagtcgtac gccgtgaagc cgccgtgttg cagaaaacca gcctgaaaac ccaacaagag 6660
cagacgcacg cacaactgac cttcctgcaa cataagttca gcaatcaggc attatacaac 6720
tggctgcgtg gtcggctgtc cgccatttac ttccagttct atgatttagc ggtagcccgt 6780
tgcctgatgg ctgaaatggc ctatcgttgg gagactaacg atgccgcagc acgctttatc 6840
aagcccggcg cctggcaggg aacccatgcc ggtctgctgg cgggtgaaac cttaatgctg 6900
aatctagcac agatggaaga tgcccacctg aaacaggagc aacgcgtact ggaggtagaa 6960
cgtaccgttt cactagcaga agtttataaa gagaaaggtc aattttctct gaccaagaaa 7020
attgcagaac tggtgaataa gaaaccagac actaccagta gcagaaataa cacactgaat 7080
tttggtgaag gaaatgccaa aacttctcta caagcgtcta tttcgttagc tgacttacaa 7140
attcgtcacg attacccaga aaacagtgga gccggtaacg tccgccggat taaacagatc 7200
agtgtcaccc tgccggcact gttaggacct tatcaggatg tgcaagcgat tctgtcttat 7260
ggcggagatg ccaccgggtt agccaaaggt tgtaaagcgc tggcagtttc tcacggaatg 7320
aatgacagcg gtcagttcca attggatttc aacgatggca aattcctgcc atttgaagga 7380
atcgaaatcg ataaaggtac gctgacatta agcttcccga atgcaaccga aaaacaaaaa 7440
accatgctgg agagtataag cgacatcatt ctgcatattc gctacaccat tcgccaataa 7500
<210> 66
<211> 7146
<212> DNA
<213> 发光光杆状菌
<400> 66
atgaactcat acgtgaaaga gatacctgat gtattacaaa gccaatatgg tattaattgt 60
ctgacagata tttgccacta ttcttttaat gaatttcgtc agcaagtctc tgatcatctc 120
tcctggtcag agaccaaccg cttatatcgt gatgcacaac aggaacaaaa agagaatcaa 180
ttatatgaag ctcgtattct taaacgcgct aacccgcagt tgcaaaatgc agtgcacctc 240
ggtattaccc tccctcatgc tgaattacga ggctataata gtgaattcgg tggccgagcc 300
agccaatatg tggcgccggg ttcggtttcc tctatgttct cccccgcagc ttatttaact 360
gaactctatc gtgaagcacg taatttacat gccagcgact ccgtttatca tctggatgaa 420
cgccgcccag acctccaatc aatgacgctc agccagcaaa atatggatac cgaactttcc 480
actctttctc tgtctaatga aattttgttg aaaggaatta aagctaatca gtctaatctg 540
gacagcgata ctaaggtgat ggaaatgtta tccactttcc gtccttccgg cacgatacct 600
tatcatgatg cttacgaaaa tgtacgtaaa gctatccaat tacaagatcc gaaacttgaa 660
caatttcaga aatcaccggc ggtcgccgga ttaatgcatc aagcttcatt attaggaatt 720
aataactcta tctcaccaga actgtttaat attctgacag aagagattac cgaagctaac 780
gcagaggcaa tttataaaca gaattttggc gatattgacc ctgcctgcct ggcaatgccg 840
gaatatctga aaagttatta taattttagt gatgaagaac tcagtcaatt tattcgcaaa 900
tatccagata atgaactaaa tactcagaaa atacatttac taaaaatcaa taaaattatt 960
ttattatcgc aagccgtgaa tctgccgttt ttaaagttag atgaaattat tccagaacag 1020
aacattaccc cgacagtatt agggaaaatc tttctagtta aatattatat gcagaaatac 1080
aatattggta cggaaactgc cttaatatta tgtaatgatt ccatttcaca atactcctat 1140
agtaatcaac ctagccaatt tgatcgccta tttaatacct cgccactcaa tggacaatat 1200
ttcgttatcg aagacactaa tattgaccta agtctgaaca gtaccgataa ctggcacaaa 1260
gcagtactta aacgtgcttt taatgtcgat gatatttccc tctatcgttt actccatatt 1320
gccaatcata acaataccga tggaaaaatt gctaataata taaaaaatct ttccaatctt 1380
tatatgacta aactactggc agatattcat caattaacga ttgatgaact gtatttacta 1440
ctgataacta ttggtgaaga taaaataaat ttatatgata ttgatgataa agagctggag 1500
aaactcataa acagactcga taccctaagc aattggctgc atacacaaaa gtggagtatc 1560
tatcagttat ttttgatgac caccaccaac tatgacaaaa cactaacgcc tgaaattcaa 1620
aacttactcg atacggtcta caatggctta cagaacttcg ataaaaataa aaccaaactt 1680
ctggcagcca tcgcgcctta tattgctgca acactacaat taccatctga aaatgtcgca 1740
cattctattc ttctctgggc tgataagata aaaccaagcg aaaataaaat aacggcagaa 1800
aaattttgga tctggttaca aaatagagat actacagaat tgtcaaaacc gccagaaatg 1860
caggaacaaa ttattcagta ctgccactgt ctggcacaat tgacaatgat ttatcgttct 1920
tccggcatta atgaaaacgc tttccgtcta tttatcgaaa agccaactat ttttggcatc 1980
cctgatgaac cgaataaagc gacaccagcc cataatgcac caacattaat catcctaacc 2040
cgctttgcca attgggttaa ttctctaggt gaaaaagcct cccctattct aacggctttt 2100
gaaaataaaa ccttaactgc ggaaaaatta gctaacgcca tgaatcttga tgctaattta 2160
ctggaacaag ccagtattca agcacaaaat tataagcagg ttactaaaga aaatacattc 2220
tccaattggc aatccatcga cattattctg caatggacta atatagccag taatttaaat 2280
atctccccac aaggtatttc ccctctaata gcattggatt atataaaacc ggctcaaaaa 2340
acaccgactt atgcccaatg ggaaaatgca gctatagcat taactgccgg gttagacact 2400
caacaaactc atactctaca cgtatttctg gacgaatctc gcagtaccgc attaagcaac 2460
tattatattg gcaaggttgc taatcgggca gcatcaatta aaagccgtga cgatttatac 2520
caatacttac tgattgataa tcaagtttcc gctgaaataa aaactacacg tattgccgaa 2580
gccattgcca gtatccaatt gtatgtcaac cgagcgctgg aaaatataga aatccatgcc 2640
gtttctgatg ttattacccg tcaatttttt atcgattggg ataaatataa taaacgttac 2700
agtacttggg ctggcgtttc acaattagtt tactatcccg aaaattatat cgacccgacg 2760
atgcgtatcg gacaaacgaa aatgatggat acgttattgc aatccgtcag ccagagccaa 2820
ttaaatgccg atacggtaga agatgcattt aaatcttacc tgacctcgtt tgaacaagtc 2880
gctaatttgg aagtcattag tgcttatcat gataacgtta ataatgacca aggactgacc 2940
tattttatcg ggaacagcaa aacagaagtt aatcaatatt actggcgcag cgtcgatcac 3000
agtaaattca acgacggtaa attcgctgct aatgcctgga gtgaatggca caaaattgac 3060
tgcgcaatta atccctacca aagcaccatt cgcccagtta tctataaatc ccgattatat 3120
ctgatttggc tggaacaaaa agaaacagct aaacaaaagg aggataataa agtcactaca 3180
gactatcact atgaattaaa attggctcat attcgttatg atggtacctg gaatgtgcca 3240
atcacctttg atgtagatga aaaaatacta gctttagaac tgacaaaatc tcaagcacct 3300
ggactctatt gcgcaggtta tcaaggggaa gatacactat taatcatgtt ttatagaaaa 3360
aaagagaaat tggatgatta taaaactgca ccaatgcaag gattttatat tttctccgat 3420
atgtcttcca aagatatgac caatgaacaa tgcaattctt atcgagataa cggttataca 3480
catttcgata ctaattctga tactaatagc gtcataagaa taaataatcg ctatgcagag 3540
gattatgaaa ttccttcatt gatcaatcat agcaatagcc atgattgggg ggaatataat 3600
cttagccagg tatatggcgg aaatatagtt atcaattaca aagttacatc aaatgatttg 3660
aaaatctata tttcaccaaa attaagaata atccatgatg gaaaagaagg tcgagagcgc 3720
attcagtcta atctaataaa gaaatacggc aaattgggtg ataaattcat tatttatact 3780
agtttgggaa tcaatccgaa taattcatca aatagattca tgttttaccc agtctatcaa 3840
tataatggaa acactagcgg ccttgctcaa gggagactat tattccatcg agacacgagt 3900
tattcatcta aagtagcggc ttggattcct ggggcaggac gttctttaat caatgaaaat 3960
gctaacatcg gtgatgattg tgctgaagat tctgtgaata aaccggatga tcttaagcaa 4020
tacatctata tgactgacag taaagggact gctactgatg tttccgggcc agtagatatc 4080
aacacagcaa tttcttctga aaaggttcaa atcacaatta aagctggcaa agaatactct 4140
cttacagcga ataaagatgt ctccgttcag ccatcaccta gctttgaaga aatgtgttac 4200
caatttaatg ctctcgaaat agatggctct aatctgaatt ttactaacaa ttcagccagt 4260
attgatgtca cttttaccgc actggcagat gatggacgca aattgggtta tgaaattttc 4320
aatatccctg ttattcaaaa ggttaaaacc gataatgctc taactctttt tcatgacgag 4380
aatggcgctc aatatatgca atggggagcc tatcgcattc gccttaatac gctatttgct 4440
cgccaattag ttgaacgagc taatactggt attgatacaa ttctaagtat ggaaactcag 4500
aatattcagg aaccgatgat gggaataggc gcttatatag aactcatttt ggataaatat 4560
aatcctgata tccacggcac taataaatca tttaagatta tatatggtga tatttttaaa 4620
gcaggtgatc attttcctat ttatcaggga gcattaagcg atattacaca aacaacagta 4680
aaattattct tacctcgcgt tgataacgct tatggaaata aaaacaatct ctatgtttac 4740
gcggcctatc aaaaagtgga aacaaatttc attcgattcg ttaaagagga taataataaa 4800
cccgctacat tcgacactac ctataagaat gggaccttcc cagggcttgc atcagccaga 4860
gtaatacaaa ctgtctcgga accaatggat ttcagcggcg ctaatagtct ctacttctgg 4920
gaactgttct actatacccc gatgatggtt gctcaacgtt tgctacatga acaaaacttt 4980
gatgaagcca accgttggct aaaatatgtc tggagcccat ccggttatat tgttcgaggt 5040
caaattaaaa actaccactg gaatgtgcgc ccattactgg aaaacaccag ttggaacagt 5100
gatcctttgg attccgtcga tcctgacgca gtggcacagc acgatccaat gcactataaa 5160
gtagccacct ttatgcgtac tctcgatcta ctgatggcac gcggcgatca cgcctatcgc 5220
caacttgagc gggatacgct gaacgaagcc aaaatgtggt atatgcaagc actgcacctg 5280
ttgggcaata aaccctatct gcctctgagt tctgtatgga atgatccacg tctggacaat 5340
gccgcagcca ctaccacaca aaaagcacac gcctacgcaa taacctctct acggcaaggc 5400
acgcaaacac cagcattatt attgcgctcc gctaataccc tgaccgatct tttcctgcca 5460
caaatcaacg acgttatgtt gagctactgg aacaaactgg aactgcgtct gtataactta 5520
cgtcataatc tctctatcga tggtcagcct ctccacctac cgatttacgc cacaccggcc 5580
gatccgaaag cgttactcag cgccgccgtt gctacttctc aaggcggcgg caaactacca 5640
gagtcattta tatcactgtg gcgcttcccg catatgttgg aaaatgcccg tagtatggtc 5700
actcagctaa tacagttcgg ctccacgttg caaaatatta ttgaacgcca agatgctgaa 5760
tccttaaatg ctctgctgca aaatcaagcc aaagagttga ttttgacaac gctcagcatt 5820
caagacaaaa ccatcgaaga aatagatgct gaaaaaactg tgctggaaaa atccaaagcc 5880
ggagcaaaat cgcgctttga caactacagc aaattatatg acgaagatgt caacgccggt 5940
gagcgtcaag ctctggatat gcgaatagct tcccaaagta ttacctcagg attgaaaggc 6000
ttgcacatgg ctgccgccgc actggagatg gtgcccaata tctacggctt tgcagtcggg 6060
gggacgcgct atggagcaat tgccaatgcc attgcgattg gtggcggtat cgccgcagaa 6120
ggtttgttaa ttgaagcaga gaaagtctcg caatctgaaa tatggcgccg tcgccgtcaa 6180
gagtgggaaa tccagcgtaa taatgccgaa gcagagatga aacaaatcga tgctcaactt 6240
aaatcactaa cggtacgccg tgaagcggcg gtattacaga aaaccggcct aaaaacccaa 6300
caggaacaaa ctcaagcgca actagctttc ctgcaacgaa aattcagcaa tcaagcgctg 6360
tataattggt tacgtggtcg gttagcagcc atttatttcc aattttacga tttagtcgtc 6420
gcccgttgtt tgatggcaga acaagcttac cgttgggaaa ctaatgatag ctctgcacgc 6480
tttattaaac cgggagcctg gcagggaacc tatgccggcc tgctcgccgg agaaacccta 6540
atgttgaacc tggcgcaaat ggaagacgcg cacctgaaac aagagcaacg cgcactggaa 6600
gtggaacgca cggtttctct ggcgcaggtg taccaatcct taggggagaa aagctttgca 6660
ttaaaagata aaattgaagc gttgctacaa ggagataaag agacttccgc cggtaacgac 6720
ggcaatcaat tgaaattaac caacaatacg ctatccgcga cgctaaccct gcaagatctg 6780
aaactcaaag atgactaccc ggaagagatg cagttaggta aaacacgccg cattaaacaa 6840
attagcgtct ccttaccggc attattggga ccgtatcaag atgttcaggc tgtcctgtct 6900
tatggtggcg atgccaccgg gctagctaaa ggttgtaaag ccttggcggt ctcccacggc 6960
ctgaatgaca acggtcagtt tcagctcgat tttaacgatg gcaaattcct gccgtttgaa 7020
gggatcgata ttaatgacaa agggacattc acgctaagtt tccccaatgc cgccagtaaa 7080
caaaaaaata tattacagat gctgaccgat attattctgc acattcgtta cactattctc 7140
gaataa 7146
<210> 67
<211> 15067
<212> DNA
<213> 人工序列
<220>
<223> 8836″BCA″三联融合多核苷酸序列
<400> 67
tctagactga gtcgacgcac tactagtaac aaagaaggag atataccatg caaaattcac 60
aagattttag tattacggaa ctgtcactgc ccaaaggggg gggcgctatc acgggaatgg 120
gtgaagcatt aacccccact ggaccggatg gtatggccgc gctatctcta ccattgccta 180
tttctgccgg gcgcggttat gctcccgcat tcactctgaa ttacaacagc ggcgccggta 240
acagtccatt tggtctgggt tgggattgca acgttatgac tatccgccgc cgcacccatt 300
ttggcgtccc ccattatgac gaaaccgata cctttttggg gccagaaggc gaagtgctgg 360
tggtagcgga tcaacctcgc gacgaatcca cattacaggg tatcaattta ggcgccacct 420
ttaccgttac cggctaccgt tcccgtctgg aaagccattt cagccgattg gaatattggc 480
aacccaaaac aacaggtaaa acagattttt ggttgatata tagcccagat gggcaggtgc 540
atctactggg taaatcaccg caagcgcgga tcagcaaccc atcccaaacg acacaaacag 600
cacaatggct gctggaagcc tctgtatcat cacgtggcga acaaatttat tatcaatatc 660
gcgccgaaga tgacacaggt tgcgaagcag atgaaattac gcaccattta caggctacag 720
cgcaacgtta tttacacatc gtgtattacg gcaaccgtac agccagcgaa acattacccg 780
gtctggatgg cagcgcccca tcacaagcag actggttgtt ctatctggta tttgattacg 840
gcgaacgcag taacaacctg aaaacgccac cagcattttc gactacaggt agctggcttt 900
gccgtcagga ccgtttttcc cgttatgaat atggctttga gattcgtacc cgccgcttat 960
gccgtcaggt attgatgtac catcacctgc aagcactgga tagtaagata acagaacaca 1020
acggaccaac gctggtttca cgcctgatac tcaattacga cgaaagcgcg atagccagca 1080
cgctagtatt cgttcgccga gtgggacacg agcaagatgg taatgtcgtc accctgccgc 1140
cattagaatt ggcatatcag gatttttcac cgcgacatca cgctcactgg caaccaatgg 1200
atgtactggc aaacttcaat gccattcagc gctggcagct agtcgatcta aaaggcgaag 1260
gattacccgg cctgttatat caggataaag gcgcttggtg gtaccgctcc gcacagcgtc 1320
tgggcgaaat tggctcagat gccgtcactt gggaaaagat gcaaccttta tcggttattc 1380
cttctttgca aagtaatgcc tcgttggtgg atatcaatgg agacggccaa cttgactggg 1440
ttatcaccgg accgggatta cggggatatc atagtcaacg cccggatggc agttggacac 1500
gttttacccc actcaacgct ctgccggtgg aatacaccca tccacgcgcg caactcgcag 1560
atttaatggg agccgggcta tccgatttgg tgctgatcgg ccctaagagc gtgcgtttat 1620
atgccaatac ccgcgacggc tttgccaaag gaaaagatgt ggtgcaatcc ggtgatatca 1680
cactgccggt gccgggcgcc gatccacgta agttggtggc gtttagtgat gtattgggtt 1740
caggtcaagc ccatctggtt gaagtaagcg cgactaaagt cacctgctgg cctaatctgg 1800
ggcgcggacg ttttggtcaa cccattacct taccgggatt cagccagcca gcaaccgagt 1860
ttaacccggc tcaagtttat ctggccgatc tggatggcag cggtccaacg gatctgattt 1920
atgttcatac aaaccgtctg gatatcttcc tgaacaaaag tggcaatggc tttgctgaac 1980
cagtgacatt acgcttcccg gaaggtctgc gttttgatca tacctgtcag ttacaaatgg 2040
ccgatgtaca aggattaggc gtcgccagcc tgatactgag cgtgccgcat atgtctcccc 2100
atcactggcg ctgcgatctg accaacatga agccgtggtt actcaatgaa atgaacaaca 2160
atatgggggt ccatcacacc ttgcgttacc gcagttcctc ccaattctgg ctggatgaaa 2220
aagccgcggc gctgactacc ggacaaacac cggtttgcta tctccccttc ccgatccaca 2280
ccctatggca aacggaaaca gaagatgaaa tcagcggcaa caaattagtc acaacacttc 2340
gttatgctcg tggcgcatgg gacggacgcg agcgggaatt tcgcggattt ggttatgtag 2400
agcagacaga cagccatcaa ctggctcaag gcaacgcgcc agaacgtacg ccaccggcgc 2460
tgaccaaaaa ctggtatgcc accggactgc cggtgataga taacgcatta tcaaccgagt 2520
attggcgtga tgatcaggct tttgccggtt tctcaccgcg ctttacgact tggcaagata 2580
acaaagatgt cccgttaaca ccggaagatg ataacagtcg ttactggttc aaccgcgcgt 2640
tgaaaggtca actgctacgt agtgaactgt acggattgga cgatagtaca aataaacacg 2700
ttccctatac tgtcactgaa tttcgttcac aggtacgtcg attacagcat accgacagcc 2760
gataccctgt actttggtca tctgtagttg aaagccgcaa ctatcactac gaacgtatcg 2820
ccagcgaccc gcaatgcagt caaaatatta cgctatccag tgatcgattt ggtcagccgc 2880
taaaacagct ttcggtacag tacccgcgcc gccagcagcc agcaatcaat ctgtatcctg 2940
atacattgcc tgataagttg ttagccaaca gctatgatga ccaacaacgc caattacggc 3000
tcacctatca acaatccagt tggcatcacc tgaccaacaa taccgttcga gtattgggat 3060
taccggatag tacccgcagt gatatcttta cttatggcgc tgaaaatgtg cctgctggtg 3120
gtttaaatct ggaacttctg agtgataaaa atagcctgat cgcggacgat aaaccacgtg 3180
aatacctcgg tcagcaaaaa accgcttata ccgatggaca aaatacaacg ccgttgcaaa 3240
caccaacacg gcaagccctg attgccttta ccgaaacaac ggtattcaac cagtccacat 3300
tatcagcgtt taacggaagc atcccgtccg ataaattatc aacgacgctg gagcaagctg 3360
gatatcagca aacaaattat ctattccctc gcactggaga agataaagtt tgggtagccc 3420
atcacggcta taccgattat ggtacagcgg cacagttctg gcgcccgcaa aaacagagca 3480
acacccaact caccggtaaa atcaccctca tctgggatgc aaactattgc gttgtggtac 3540
aaacccggga tgctgctgga ctgacaacct cagccaaata tgactggcgt tttctgaccc 3600
cggtgcaact caccgatatc aatgacaatc agcaccttat cacactggat gcattgggcc 3660
gaccaatcac attgcgcttt tggggaactg aaaacggcaa gatgacaggt tattcctcac 3720
cggaaaaagc atcattttct ccaccatccg atgttaatgc cgctattgag ttaaaaaaac 3780
cgctccctgt agcacagtgt caggtctacg caccagaaag ctggatgcca gtattaagtc 3840
agaaaacctt caatcgactg gcagaacaag attggcaaaa gttatataac gcccgaatca 3900
tcaccgaaga tggacgtatc tgcacactgg cttatcgccg ctgggtacaa agccaaaagg 3960
caatccctca actcattagc ctgttaaaca acggaccccg tttacctcct cacagcctga 4020
cattgacgac ggatcgttat gatcacgatc ctgagcaaca gatccgtcaa caggtggtat 4080
tcagtgatgg ctttggccgc ttgctgcaag ccgctgcccg acatgaggca ggcatggccc 4140
ggcaacgcaa tgaagacggc tctttgatta taaatgtcca gcatactgag aaccgttggg 4200
cagtgactgg acgaacggaa tatgacaata aggggcaacc gatacgtacc tatcagccct 4260
atttcctcaa tgactggcga tacgtcagca atgatagtgc ccggcaggaa aaagaagctt 4320
atgcagatac ccatgtctat gatcccatag gtcgagaaat caaggttatc accgcaaaag 4380
gttggttccg tcgaaccttg ttcactccct ggtttactgt caatgaagat gaaaatgaca 4440
cagccgctga ggtgaagaag gtaaagatgc cgggatccga caacaagggt cagactatcc 4500
gcactaggcc tatgaaaaac atcgatccca aactttatca aaaaacccct actgtcagcg 4560
tttacgataa ccgtggtctg ataatccgta acatcgattt tcatcgtact accgcaaatg 4620
gtgatcccga tacccgtatt acccgccatc aatacgatat tcacggacac ctaaatcaaa 4680
gcatcgatcc gcgcctatat gaagccaagc aaaccaacaa tacgatcaaa cccaattttc 4740
tttggcagta tgatttgacc ggtaatcccc tatgtacaga gagcattgat gcaggtcgca 4800
ctgtcacctt gaatgatatt gaaggccgtc cgctactaac ggtgactgca acaggggtta 4860
tacaaactcg acaatatgaa acttcttccc tgcccggtcg tctgttatct gttgccgaac 4920
aaacacccga ggaaaaaaca tcccgtatca ccgaacgcct gatttgggct ggcaataccg 4980
aagcagagaa agaccataac cttgccggcc agtgcgtgcg tcactatgac acggcgggag 5040
ttacccggtt agagagttta tcactgaccg gtactgtttt atctcaatcc agccaactat 5100
tgatcgacac tcaagaggca aactggacag gtgataacga aaccgtctgg caaaacatgc 5160
tggctgatga catctacaca accctgagca ccttcgatgc caccggtgct ttactgactc 5220
agaccgatgc gaaagggaac attcagagac tggcttatga tgtggccggg cagctaaacg 5280
ggagctggct aacactcaaa ggccagacgg aacaagtgat tatcaaatcc ctgacctact 5340
ccgccgccgg acaaaaatta cgtgaggaac acggcaatga tgttatcacc gaatacagtt 5400
atgaaccgga aacccaacgg ctgatcggta tcaaaacccg ccgtccgtca gacactaaag 5460
tgctacaaga cctgcgctat gaatatgacc cggtaggcaa tgtcatcagc atccgtaatg 5520
acgcggaagc cacccgcttt tggcacaatc agaaagtgat gccggaaaac acttatacct 5580
acgattccct gtatcagctt atcagcgcca ccgggcgcga aatggcgaat ataggtcaac 5640
aaagtcacca atttccctca cccgctctac cttctgataa caacacctat accaactata 5700
cccgtactta tacttatgac cgtggcggca atctgaccaa aatccagcac agttcaccgg 5760
cgacgcaaaa caactacacc accaatatca cggtttcaaa tcgcagcaac cgcgcagtac 5820
tcagcacatt gaccgaagat ccggcgcaag tagatgcttt gtttgatgca ggcggacatc 5880
agaacacctt gatatcagga caaaacctga actggaatac tcgtggtgaa ctgcaacaag 5940
taacactggt taaacgggac aagggcgcca atgatgatcg ggaatggtat cgttatagcg 6000
gtgacggaag aaggatgtta aaaatcaatg aacagcaggc cagcaacaac gctcaaacac 6060
aacgtgtgac ttatttgccg aacttagaac ttcgtctaac acaaaacagc acggccacaa 6120
ccgaagattt gcaagttatc accgtaggcg aagcgggccg ggcacaggta cgagtattac 6180
attgggagag cggtaaaccg gaagatatcg acaataatca gttgcgttat agttacgata 6240
atcttatcgg ttccagtcaa cttgaattag atagcgaagg acaaattatc agtgaagaag 6300
aatattatcc ctatggtgga acagcattat gggccgccag gaatcagaca gaagccagtt 6360
ataaaactat ccgttattca ggcaaagagc gggatgccac cgggctatat tactacggct 6420
atcggtatta ccaaccgtgg ataggacggt ggttaagctc cgatccggca ggaacaatcg 6480
atgggctgaa tttatatcgg atggtgagga ataatccagt taccctcctt gatcctgatg 6540
gattaatgcc aacaattgca gaacgcatag cagcactaaa aaaaaataaa gtaacagact 6600
cagcgccttc gccagcaaat gccacaaacg tagcgataaa catccgcccg cctgtagcac 6660
caaaacctag cttaccgaaa gcatcaacga gtagccaacc aaccacacac cctatcggag 6720
ctgcaaacat aaaaccaacg acgtctgggt catctattgt tgctccattg agtccagtag 6780
gaaataaatc tacttctgaa atctctctgc cagaaagcgc tcaaagcagt tcttcaagca 6840
ctacctcgac aaatctacag aaaaaatcat ttactttata tagagcagat aacagatcct 6900
ttgaagaaat gcaaagtaaa ttccctgaag gatttaaagc ctggactcct ctagtcacta 6960
agatggcaag gcaatttgct agtatcttta ttggtcagaa agatacatct aatttaccta 7020
aagaaacagt caagaacata agcacatggg gagcaaagcc aaaactaaaa gatctctcaa 7080
attacataaa atataccaag gacaaatcta cagtatgggt ttctactgca attaatactg 7140
aagcaggtgg acaaagctca ggggctccac tccataaaat tgatatggat ctctacgagt 7200
ttgccattga tggacaaaaa ctaaatccac taccggaggg tagaactaaa aacatggtac 7260
cttccctttt actcgacacc ccacaaatag agacatcatc catcattgca cttaatcatg 7320
gaccggtaaa tgatgcagaa atttcatttc tgacaacaat tccgcttaaa aatgtaaaac 7380
ctcataagag acctagggat aataaaggcc aaaccattcg tacccgtcca gaattcatgt 7440
atagcacggc tgtattactc aataaaatca gtcccactcg cgacggtcag acgatgactc 7500
ttgcggatct gcaatattta tccttcagtg aactgagaaa aatctttgat gaccagctca 7560
gttggggaga ggctcgccat ctctatcatg aaactataga gcagaaaaaa aataatcgct 7620
tgctggaagc gcgtattttt acccgtgcca acccacaatt atccggtgct atccgactcg 7680
gtattgaacg agacagcgtt tcacgcagtt atgatgaaat gtttggtgcc cgttcttctt 7740
cctttgtgaa accgggttca gtggcttcca tgttttcacc ggctggctat ctcaccgaat 7800
tgtatcgtga agcgaaggac ttacattttt caagctctgc ttatcatctt gataatcgcc 7860
gtccggatct ggctgatctg actctgagcc agagtaatat ggatacagaa atttccaccc 7920
tgacactgtc taacgaactg ttgctggagc atattacccg caagaccgga ggtgattcgg 7980
acgcattgat ggagagcctg tcaacttacc gtcaggccat tgatacccct taccatcagc 8040
cttacgagac tatccgtcag gtcattatga cccatgacag tacactgtca gcgctgtccc 8100
gtaatcctga ggtgatgggg caggcggaag gggcttcatt actggcgatt ctggccaata 8160
tttctccgga gctttataac attttgaccg aagagattac ggaaaagaac gctgatgctt 8220
tatttgcgca aaacttcagt gaaaatatca cgcccgaaaa tttcgcgtca caatcatgga 8280
tagccaagta ttatggtctt gaactttctg aggtgcaaaa atacctcggg atgttgcaga 8340
atggctattc tgacagcacc tctgcttatg tggataatat ctcaacgggt ttagtggtca 8400
ataatgaaag taaactcgaa gcttacaaaa taacacgtgt aaaaacagat gattatgata 8460
aaaatataaa ttactttgat ttgatgtatg aaggaaataa tcagttcttt atacgtgcta 8520
attttaaggt atcaagagaa tttggggcta ctcttagaaa aaacgcaggg ccaagtggca 8580
ttgtcggcag cctttccggt cctctaatag ccaatacgaa ttttaaaagt aattatctaa 8640
gtaacatatc tgattctgaa tacaaaaacg gtgtaaagat atacgcctat cgctatacgt 8700
cttccaccag cgccacaaat cagggcggcg gaatattcac ttttgagtct tatcccctga 8760
ctatatttgc gctcaaactg aataaagcca ttcgcttgtg cctgactagc gggctttcac 8820
cgaatgaact gcaaactatc gtacgcagtg acaatgcaca aggcatcatc aacgactccg 8880
ttctgaccaa agttttctat actctgttct acagtcaccg ttatgcactg agctttgatg 8940
atgcacaggt actgaacgga tcggtcatta atcaatatgc cgacgatgac agtgtcagtc 9000
attttaaccg tctctttaat acaccgccgc tgaaagggaa aatctttgaa gccgacggca 9060
acacggtcag cattgatccg gatgaagagc aatctacctt tgcccgttca gccctgatgc 9120
gtggtctggg ggtcaacagt ggtgaactgt atcagttagg caaactggcg ggtgtgctgg 9180
acgcccaaaa taccatcaca ctttctgtct tcgttatctc ttcactgtat cgcctcacgt 9240
tactggcccg tgtccatcag ctgacggtca atgaactgtg tatgctttat ggtctttcgc 9300
cgttcaatgg caaaacaacg gcttctttgt cttccgggga gttgccacgg ctggttatct 9360
ggctgtatca ggtgacgcag tggctgactg aggcggaaat caccactgaa gcgatctggt 9420
tattatgtac gccagagttt agcgggaata tttcaccgga aatcagtaat ctgctcaata 9480
acctccgacc gagtattagt gaagatatgg cacagagtca caatcgggag ctgcaggctg 9540
aaattctcgc gccgtttatt gctgcaacgc tgcatctggc gtcaccggat atggcacggt 9600
atatcctgtt gtggaccgat aacctgcggc cgggtggctt agatattgcc gggtttatga 9660
cactggtatt gaaagagtcg ttaaatgcca atgaaaccac ccaattggta caattctgcc 9720
atgtgatggc acagttatcg ctttccgtac agacactgcg cctcagtgaa gcggagctat 9780
ccgtgctggt catctccgga ttcgccgtgc tgggggcaaa aaatcaacct gccggacagc 9840
acaatattga tacgctattc tcactctacc gattccacca gtggattaat gggctgggca 9900
atcccggctc tgacacgctg gatatgctgc gccagcagac actcacggcc gacagactgg 9960
cctccgtgat ggggctggac atcagtatgg taacgcaggc catggtttcc gccggcgtga 10020
accagcttca gtgttggcag gatatcaaca ccgtgttgca gtggatagat gtggcatcag 10080
cactgcacac gatgccgtcg gttatccgta cgctggtgaa tatccgttac gtgactgcat 10140
taaacaaagc cgagtcgaat ctgccttcct gggatgagtg gcagacactg gcagaaaata 10200
tggaagccgg actcagtaca caacaggctc agacgctggc ggattatacc gcggagcgct 10260
tgagtagcgt gctgtgcaat tggtttctgg cgaatatcca gccagaaggg gtgtccctgc 10320
acagccggga tgacctgtac agctatttcc tgattgataa tcaggtctct tctgccataa 10380
aaaccacccg actggcagag gccattgccg gtattcagct ctacatcaac cgggcgctga 10440
atcggataga gcctaatgcc cgtgccgatg tgtcaacccg ccagtttttt accgactgga 10500
cggtgaataa ccgttacagc acctggggcg gggtgtcgcg gctggtttat tatccggaaa 10560
attacattga cccaacccag cgtatcgggc agacccggat gatggatgaa ctgctggaaa 10620
atatcagcca gagtaaactt agccgggaca cagtggagga tgcctttaaa acttacctga 10680
cccgctttga aaccgtggcg gatctgaaag ttgtcagcgc ctatcacgac aacgtcaaca 10740
gcaacaccgg actgacctgg tttgtcggcc aaacgcggga gaacctgccg gaatactact 10800
ggtgtaacgt ggatatatca cggatgcagg cgggtgaact ggccgccaat gcctggaaag 10860
agtggacgaa gattgataca gcggtcaacc cctacaagga tgcaatacgt ccggtcatac 10920
tcagggaacg tttgcacctt atctgggtag aaaaagagga agtggcgaaa aatggtactg 10980
atccggtgga aacctgtgac cgttttactc tgaaactggc gtttctgcgt catgatggca 11040
gttggagtgc cccctggtct tacgatatca caacgcaggt ggaggcggtc actgacaaaa 11100
aacctgacac tgaacggctg gcgctggccg catcaggctt tcagggcgag gacactctgc 11160
tggtgtttgt ctacaaaacc gggaagagtt actcggattt tggcggcagc aataaaaatg 11220
tggcaggcat gaccatttac ggcgatggct ccttcaaaaa gatggagaac acagcactca 11280
gccgttacag ccaactgaaa aatacctttg atatcattca tactcaaggc aacgacttgg 11340
taagaaaggc cagctatcgt ttcgcgcagg attttgaagt gcctgcctcg ttgaatatgg 11400
gttctgccat cggtgatgat agtctgacgg tgatggagaa cgggaatatt ccgcagataa 11460
ccagtaaata ctccagcgat aaccttgcta ttacgctaca taacgccgct ttcactgtca 11520
gatatgatgg cagtggcaat gtcatcagaa acaaacaaat cagcgccatg aaactgacgg 11580
gggtggatgg aaagtcccag tacggcaatg catttatcat cgcaaatacc gttaaacatt 11640
atggcggtta ctctgatctg ggggggccga tcaccgttta taataaaacg aaaaactata 11700
ttgcatcagt tcaaggccac ttgatgaacg cagattacac taggcgtttg attctaacac 11760
cagttgaaaa taattattat gccagattgt tcgagtttcc attttctcca aacacaattt 11820
taaacaccgt tttcacggtt ggtagcaata aaaccagtga ttttaaaaag tgcagttatg 11880
ctgttgatgg taataattct cagggcttcc agatatttag ttcctatcaa tcatccggct 11940
ggctggatat tgatacaggc attaacaata ccgatatcaa aattacggtg atggctggca 12000
gtaaaaccca cacctttacg gccagtgacc atattgcttc cttgccggca aacagttttg 12060
atgctatgcc gtacaccttt aagccactgg aaatcgatgc ttcatcgttg gcctttacca 12120
ataatattgc tcctctggat atcgtttttg agaccaaagc caaagacggg cgagtgctgg 12180
gtaagatcaa gcaaacatta tcggtgaaac gggtaaatta taatccggaa gatattctgt 12240
ttctgcgtga aactcattcg ggtgcccaat atatgcagct cggggtgtat cgtattcgtc 12300
ttaataccct gctggcttct caactggtat ccagagcaaa cacgggcatt gatactatcc 12360
tgacaatgga aacccagcgg ttaccggaac ctccgttggg agaaggcttc tttgccaact 12420
ttgttctgcc taaatatgac cctgctgaac atggcgatga gcggtggttt aaaatccata 12480
ttgggaatgt tggcggtaac acgggaaggc agccttatta cagcggaatg ttatccgata 12540
cgtcggaaac cagtatgaca ctgtttgtcc cttatgccga agggtattac atgcatgaag 12600
gtgtcagatt gggggttgga taccagaaaa ttacctatga caacacttgg gaatctgctt 12660
tcttttattt tgatgagaca aaacagcaat ttgtattaat taacgatgct gatcatgatt 12720
caggaatgac gcaacagggg atcgtgaaaa atatcaagaa atacaaagga tttttgaatg 12780
tttctatcgc aacgggctat tccgccccga tggatttcaa tagtgccagc gccctctatt 12840
actgggaatt gttctattac accccgatga tgtgcttcca gcgtttgcta caggaaaaac 12900
aattcgacga agccacacaa tggataaact acgtctacaa tcccgccggc tatatcgtta 12960
acggagaaat cgccccctgg atctggaact gccggccgct ggaagagacc acctcctgga 13020
atgccaatcc gctggatgcc atcgatccgg atgccgtcgc ccaaaatgac ccaatgcact 13080
acaagattgc cacctttatg cgcctgttgg atcaacttat tctgcgcggc gatatggcct 13140
atcgagaact gacccgcgat gcgttgaatg aagccaaaat gtggtatgtg cgtactttag 13200
aattgctcgg tgatgagccg gaggattacg gtagccaaca gtgggcagca ccgtcccttt 13260
ccggggcggc gagtcaaacc gtgcaggcgg cttatcagca ggatcttacg atgctgggcc 13320
gtggtggggt ttccaagaat ctccgtaccg ctaactcgtt ggtgggtttg ttcctgccgg 13380
aatataaccc ggcgctcacc gattactggc aaaccctgcg tttgcgcctg tttaacctgc 13440
gccataatct ttccattgac ggacagccgt tatcgctggc gatttacgcc gagcctaccg 13500
atccgaaagc gctgctcacc agtatggtac aggcctctca gggcggtagt gcagtgctgc 13560
ccggcacatt gtcgttatac cgcttcccgg tgatgctgga gcggacccgc aatctggtag 13620
cgcaattaac ccagttcggc acctctctgc tcagtatggc agagcatgat gatgccgatg 13680
aactcaccac gctgctacta cagcagggta tggaactggc gacacagagc atccgtattc 13740
agcaacgaac tgtcgatgaa gtggatgctg atattgctgt attggcagag agccgccgca 13800
gtgcacaaaa tcgtctggaa aaataccagc agctgtatga cgaggatatc aaccacggag 13860
aacagcgggc aatgtcactg cttgatgcag cggcaggtca gtctctggcc gggcaggtgc 13920
tttcaatagc ggaaggggtg gccgatttag tgccaaacgt gttcggttta gcttgtggcg 13980
gcagtcgttg gggggcagca ctgcgtgctt ccgcctccgt gatgtcgctt tctgccacag 14040
cttcccaata ttccgcagac aaaatcagcc gttcggaagc ctaccgccgc cgccgtcagg 14100
agtgggaaat tcagcgtgat aatgctgacg gtgaagtcaa acaaatggat gcccagttgg 14160
aaagcctgaa aatccgccgc gaagcagcac agatgcaggt ggaatatcag gagacccagc 14220
aggcccatac tcaggctcag ttagagctgt tacagcgtaa attcacaaac aaagcgcttt 14280
acagttggat gcgcggcaag ctgagtgcta tctattacca gttctttgac ctgacccagt 14340
ccttctgcct gatggcacag gaagcgctgc gccgcgagct gaccgacaac ggtgttacct 14400
ttatccgggg tggggcctgg aacggtacga ctgcgggttt gatggcgggt gaaacgttgc 14460
tgctgaatct ggcagaaatg gaaaaagtct ggctggagcg tgatgagcgg gcactggaag 14520
tgacccgtac cgtctcgttg gcacagttct atcaggcctt atcatcagac aactttaatc 14580
tgaccgaaaa actcacgcaa ttcctgcgtg aagggaaagg caacgtagga gcttccggca 14640
atgaattaaa actcagtaac cgtcagatag aagcctcagt gcgattgtct gatttgaaaa 14700
ttttcagcga ctaccccgaa agccttggca atacccgtca gttgaaacag gtgagtgtca 14760
ccttgccggc gctggttggg ccgtatgaag atattcgggc ggtgctgaat tacgggggca 14820
gcatcgtcat gccacgcggt tgcagtgcta ttgctctctc ccacggcgtg aatgacagtg 14880
gtcaatttat gctggatttc aacgattccc gttatctgcc gtttgaaggt atttccgtga 14940
atgacagcgg cagcctgacg ttgagtttcc cggatgcgac tgatcggcag aaagcgctgc 15000
tggagagcct gagcgatatc attctgcata tccgctatac cattcgttct taattaatgc 15060
tctcgag 15067
<210> 68
<211> 5001
<212> PRT
<213> 人工序列
<220>
<223> SEQ ID N0:67编码的8836″BCA″三联融合蛋白的氨基酸序列
<400> 68
Met Gln Asn Ser Gln Asp Phe Ser Ile Thr Glu Leu Ser Leu Pro Lys
1 5 10 15
Gly Gly Gly Ala Ile Thr Gly Met Gly Glu Ala Leu Thr Pro Thr Gly
20 25 30
Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro Ile Ser Ala Gly
35 40 45
Arg Gly Tyr Ala Pro Ala Phe Thr Leu Asn Tyr Asn Ser Gly Ala Gly
50 55 60
Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr Ile Arg
65 70 75 80
Arg Arg Thr His Phe Gly Val Pro His Tyr Asp Glu Thr Asp Thr Phe
85 90 95
Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Asp Gln Pro Arg Asp
100 105 110
Glu Ser Thr Leu Gln Gly Ile Asn Leu Gly Ala Thr Phe Thr Val Thr
115 120 125
Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser Arg Leu Glu Tyr Trp
130 135 140
Gln Pro Lys Thr Thr Gly Lys Thr Asp Phe Trp Leu Ile Tyr Ser Pro
145 150 155 160
Asp Gly Gln Val His Leu Leu Gly Lys Ser Pro Gln Ala Arg Ile Ser
165 170 175
Asn Pro Ser Gln Thr Thr Gln Thr Ala Gln Trp Leu Leu Glu Ala Ser
180 185 190
Val Ser Ser Arg Gly Glu Gln Ile Tyr Tyr Gln Tyr Arg Ala Glu Asp
195 200 205
Asp Thr Gly Cys Glu Ala Asp Glu Ile Thr His His Leu Gln Ala Thr
210 215 220
Ala Gln Arg Tyr Leu His Ile Val Tyr Tyr Gly Asn Arg Thr Ala Ser
225 230 235 240
Glu Thr Leu Pro Gly Leu Asp Gly Ser Ala Pro Ser Gln Ala Asp Trp
245 250 255
Leu Phe Tyr Leu Val Phe Asp Tyr Gly Glu Arg Ser Asn Asn Leu Lys
260 265 270
Thr Pro Pro Ala Phe Ser Thr Thr Gly Ser Trp Leu Cys Arg Gln Asp
275 280 285
Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Glu Ile Arg Thr Arg Arg Leu
290 295 300
Cys Arg Gln Val Leu Met Tyr His His Leu Gln Ala Leu Asp Ser Lys
305 310 315 320
Ile Thr Glu His Asn Gly Pro Thr Leu Val Ser Arg Leu Ile Leu Asn
325 330 335
Tyr Asp Glu Ser Ala Ile Ala Ser Thr Leu Val Phe Val Arg Arg Val
340 345 350
Gly His Glu Gln Asp Gly Asn Val Val Thr Leu Pro Pro Leu Glu Leu
355 360 365
Ala Tyr Gln Asp Phe Ser Pro Arg His His Ala His Trp Gln Pro Met
370 375 380
Asp Val Leu Ala Asn Phe Asn Ala Ile Gln Arg Trp Gln Leu Val Asp
385 390 395 400
Leu Lys Gly Glu Gly Leu Pro Gly Leu Leu Tyr Gln Asp Lys Gly Ala
405 410 415
Trp Trp Tyr Arg Ser Ala Gln Arg Leu Gly Glu Ile Gly Ser Asp Ala
420 425 430
Val Thr Trp Glu Lys Met Gln Pro Leu Ser Val Ile Pro Ser Leu Gln
435 440 445
Ser Asn Ala Ser Leu Val Asp Ile Asn Gly Asp Gly Gln Leu Asp Trp
450 455 460
Val Ile Thr Gly Pro Gly Leu Arg Gly Tyr His Ser Gln Arg Pro Asp
465 470 475 480
Gly Ser Trp Thr Arg Phe Thr Pro Leu Asn Ala Leu Pro Val Glu Tyr
485 490 495
Thr His Pro Arg Ala Gln Leu Ala Asp Leu Met Gly Ala Gly Leu Ser
500 505 510
Asp Leu Val Leu Ile Gly Pro Lys Ser Val Arg Leu Tyr Ala Asn Thr
515 520 525
Arg Asp Gly Phe Ala Lys Gly Lys Asp Val Val Gln Ser Gly Asp Ile
530 535 540
Thr Leu Pro Val Pro Gly Ala Asp Pro Arg Lys Leu Val Ala Phe Ser
545 550 555 560
Asp Val Leu Gly Ser Gly Gln Ala His Leu Val Glu Val Ser Ala Thr
565 570 575
Lys Val Thr Cys Trp Pro Asn Leu Gly Arg Gly Arg Phe Gly Gln Pro
580 585 590
Ile Thr Leu Pro Gly Phe Ser Gln Pro Ala Thr Glu Phe Asn Pro Ala
595 600 605
Gln Val Tyr Leu Ala Asp Leu Asp Gly Ser Gly Pro Thr Asp Leu Ile
610 615 620
Tyr Val His Thr Asn Arg Leu Asp Ile Phe Leu Asn Lys Ser Gly Asn
625 630 635 640
Gly Phe Ala Glu Pro Val Thr Leu Arg Phe Pro Glu Gly Leu Arg Phe
645 650 655
Asp His Thr Cys Gln Leu Gln Met Ala Asp Val Gln Gly Leu Gly Val
660 665 670
Ala Ser Leu Ile Leu Ser Val Pro His Met Ser Pro His His Trp Arg
675 680 685
Cys Asp Leu Thr Asn Met Lys Pro Trp Leu Leu Asn Glu Met Asn Asn
690 695 700
Asn Met Gly Val His His Thr Leu Arg Tyr Arg Ser Ser Ser Gln Phe
705 710 715 720
Trp Leu Asp Glu Lys Ala Ala Ala Leu Thr Thr Gly Gln Thr Pro Val
725 730 735
Cys Tyr Leu Pro Phe Pro Ile His Thr Leu Trp Gln Thr Glu Thr Glu
740 745 750
Asp Glu Ile Ser Gly Asn Lys Leu Val Thr Thr Leu Arg Tyr Ala Arg
755 760 765
Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe Arg Gly Phe Gly Tyr Val
770 775 780
Glu Gln Thr Asp Ser His Gln Leu Ala Gln Gly Asn Ala Pro Glu Arg
785 790 795 800
Thr Pro Pro Ala Leu Thr Lys Asn Trp Tyr Ala Thr Gly Leu Pro Val
805 810 815
Ile Asp Asn Ala Leu Ser Thr Glu Tyr Trp Arg Asp Asp Gln Ala Phe
820 825 830
Ala Gly Phe Ser Pro Arg Phe Thr Thr Trp Gln Asp Asn Lys Asp Val
835 840 845
Pro Leu Thr Pro Glu Asp Asp Asn Ser Arg Tyr Trp Phe Asn Arg Ala
850 855 860
Leu Lys Gly Gln Leu Leu Arg Ser Glu Leu Tyr Gly Leu Asp Asp Ser
865 870 875 880
Thr Asn Lys His Val Pro Tyr Thr Val Thr Glu Phe Arg Ser Gln Val
885 890 895
Arg Arg Leu Gln His Thr Asp Ser Arg Tyr Pro Val Leu Trp Ser Ser
900 905 910
Val Val Glu Ser Arg Asn Tyr His Tyr Glu Arg Ile Ala Ser Asp Pro
915 920 925
Gln Cys Ser Gln Asn Ile Thr Leu Ser Ser Asp Arg Phe Gly Gln Pro
930 935 940
Leu Lys Gln Leu Ser Val Gln Tyr Pro Arg Arg Gln Gln Pro Ala Ile
945 950 955 960
Asn Leu Tyr Pro Asp Thr Leu Pro Asp Lys Leu Leu Ala Asn Ser Tyr
965 970 975
Asp Asp Gln Gln Arg Gln Leu Arg Leu Thr Tyr Gln Gln Ser Ser Trp
980 985 990
His His Leu Thr Asn Asn Thr Val Arg Val Leu Gly Leu Pro Asp Ser
995 1000 1005
Thr Arg Ser Asp Ile Phe Thr Tyr Gly Ala Glu Asn Val Pro Ala
1010 1015 1020
Gly Gly Leu Asn Leu Glu Leu Leu Ser Asp Lys Asn Ser Leu Ile
1025 1030 1035
Ala Asp Asp Lys Pro Arg Glu Tyr Leu Gly Gln Gln Lys Thr Ala
1040 1045 1050
Tyr Thr Asp Gly Gln Asn Thr Thr Pro Leu Gln Thr Pro Thr Arg
1055 1060 1065
Gln Ala Leu Ile Ala Phe Thr Glu Thr Thr Val Phe Asn Gln Ser
1070 1075 1080
Thr Leu Ser Ala Phe Asn Gly Ser Ile Pro Ser Asp Lys Leu Ser
1085 1090 1095
Thr Thr Leu Glu Gln Ala Gly Tyr Gln Gln Thr Asn Tyr Leu Phe
1100 1105 1110
Pro Arg Thr Gly Glu Asp Lys Val Trp Val Ala His His Gly Tyr
1115 1120 1125
Thr Asp Tyr Gly Thr Ala Ala Gln Phe Trp Arg Pro Gln Lys Gln
1130 1135 1140
Ser Asn Thr Gln Leu Thr Gly Lys Ile Thr Leu Ile Trp Asp Ala
1145 1150 1155
Asn Tyr Cys Val Val Val Gln Thr Arg Asp Ala Ala Gly Leu Thr
1160 1165 1170
Thr Ser Ala Lys Tyr Asp Trp Arg Phe Leu Thr Pro Val Gln Leu
1175 1180 1185
Thr Asp Ile Asn Asp Asn Gln His Leu Ile Thr Leu Asp Ala Leu
1190 1195 1200
Gly Arg Pro Ile Thr Leu Arg Phe Trp Gly Thr Glu Asn Gly Lys
1205 1210 1215
Met Thr Gly Tyr Ser Ser Pro Glu Lys Ala Ser Phe Ser Pro Pro
1220 1225 1230
Ser Asp Val Asn Ala Ala Ile Glu Leu Lys Lys Pro Leu Pro Val
1235 1240 1245
Ala Gln Cys Gln Val Tyr Ala Pro Glu Ser Trp Met Pro Val Leu
1250 1255 1260
Ser Gln Lys Thr Phe Asn Arg Leu Ala Glu Gln Asp Trp Gln Lys
1265 1270 1275
Leu Tyr Asn Ala Arg Ile Ile Thr Glu Asp Gly Arg Ile Cys Thr
1280 1285 1290
Leu Ala Tyr Arg Arg Trp Val Gln Ser Gln Lys Ala Ile Pro Gln
1295 1300 1305
Leu Ile Ser Leu Leu Asn Asn Gly Pro Arg Leu Pro Pro His Ser
1310 1315 1320
Leu Thr Leu Thr Thr Asp Arg Tyr Asp His Asp Pro Glu Gln Gln
1325 1330 1335
Ile Arg Gln Gln Val Val Phe Ser Asp Gly Phe Gly Arg Leu Leu
1340 1345 1350
Gln Ala Ala Ala Arg His Glu Ala Gly Met Ala Arg Gln Arg Asn
1355 1360 1365
Glu Asp Gly Ser Leu Ile Ile Asn Val Gln His Thr Glu Asn Arg
1370 1375 1380
Trp Ala Val Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gln Pro
1385 1390 1395
Ile Arg Thr Tyr Gln Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val
1400 1405 1410
Ser Asn Asp Ser Ala Arg Gln Glu Lys Glu Ala Tyr Ala Asp Thr
1415 1420 1425
His Val Tyr Asp Pro Ile Gly Arg Glu Ile Lys Val Ile Thr Ala
1430 1435 1440
Lys Gly Trp Phe Arg Arg Thr Leu Phe Thr Pro Trp Phe Thr Val
1445 1450 1455
Asn Glu Asp Glu Asn Asp Thr Ala Ala Glu Val Lys Lys Val Lys
1460 1465 1470
Met Pro Gly Ser Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro
1475 1480 1485
Met Lys Asn Ile Asp Pro Lys Leu Tyr Gln Lys Thr Pro Thr Val
1490 1495 1500
Ser Val Tyr Asp Asn Arg Gly Leu Ile Ile Arg Asn Ile Asp Phe
1505 1510 1515
His Arg Thr Thr Ala Asn Gly Asp Pro Asp Thr Arg Ile Thr Arg
1520 1525 1530
His Gln Tyr Asp Ile His Gly His Leu Asn Gln Ser Ile Asp Pro
1535 1540 1545
Arg Leu Tyr Glu Ala Lys Gln Thr Asn Asn Thr Ile Lys Pro Asn
1550 1555 1560
Phe Leu Trp Gln Tyr Asp Leu Thr Gly Asn Pro Leu Cys Thr Glu
1565 1570 1575
Ser Ile Asp Ala Gly Arg Thr Val Thr Leu Asn Asp Ile Glu Gly
1580 1585 1590
Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val Ile Gln Thr Arg
1595 1600 1605
Gln Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Ala
1610 1615 1620
Glu Gln Thr Pro Glu Glu Lys Thr Ser Arg Ile Thr Glu Arg Leu
1625 1630 1635
Ile Trp Ala Gly Asn Thr Glu Ala Glu Lys Asp His Asn Leu Ala
1640 1645 1650
Gly Gln Cys Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu
1655 1660 1665
Glu Ser Leu Ser Leu Thr Gly Thr Val Leu Ser Gln Ser Ser Gln
1670 1675 1680
Leu Leu Ile Asp Thr Gln Glu Ala Asn Trp Thr Gly Asp Asn Glu
1685 1690 1695
Thr Val Trp Gln Asn Met Leu Ala Asp Asp Ile Tyr Thr Thr Leu
1700 1705 1710
Ser Thr Phe Asp Ala Thr Gly Ala Leu Leu Thr Gln Thr Asp Ala
1715 1720 1725
Lys Gly Asn Ile Gln Arg Leu Ala Tyr Asp Val Ala Gly Gln Leu
1730 1735 1740
Asn Gly Ser Trp Leu Thr Leu Lys Gly Gln Thr Glu Gln Val Ile
1745 1750 1755
Ile Lys Ser Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu
1760 1765 1770
Glu His Gly Asn Asp Val Ile Thr Glu Tyr Ser Tyr Glu Pro Glu
1775 1780 1785
Thr Gln Arg Leu Ile Gly Ile Lys Thr Arg Arg Pro Ser Asp Thr
1790 1795 1800
Lys Val Leu Gln Asp Leu Arg Tyr Glu Tyr Asp Pro Val Gly Asn
1805 1810 1815
Val Ile Ser Ile Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp His
1820 1825 1830
Asn Gln Lys Val Met Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu
1835 1840 1845
Tyr Gln Leu Ile Ser Ala Thr Gly Arg Glu Met Ala Asn Ile Gly
1850 1855 1860
Gln Gln Ser His Gln Phe Pro Ser Pro Ala Leu Pro Ser Asp Asn
1865 1870 1875
Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly
1880 1885 1890
Gly Asn Leu Thr Lys Ile Gln His Ser Ser Pro Ala Thr Gln Asn
1895 1900 1905
Asn Tyr Thr Thr Asn Ile Thr Val Ser Asn Arg Ser Asn Arg Ala
1910 1915 1920
Val Leu Ser Thr Leu Thr Glu Asp Pro Ala Gln Val Asp Ala Leu
1925 1930 1935
Phe Asp Ala Gly Gly His Gln Asn Thr Leu Ile Ser Gly Gln Asn
1940 1945 1950
Leu Asn Trp Asn Thr Arg Gly Glu Leu Gln Gln Val Thr Leu Val
1955 1960 1965
Lys Arg Asp Lys Gly Ala Asn Asp Asp Arg Glu Trp Tyr Arg Tyr
1970 1975 1980
Ser Gly Asp Gly Arg Arg Met Leu Lys Ile Asn Glu Gln Gln Ala
1985 1990 1995
Ser Asn Asn Ala Gln Thr Gln Arg Val Thr Tyr Leu Pro Asn Leu
2000 2005 2010
Glu Leu Arg Leu Thr Gln Asn Ser Thr Ala Thr Thr Glu Asp Leu
2015 2020 2025
Gln Val Ile Thr Val Gly Glu Ala Gly Arg Ala Gln Val Arg Val
2030 2035 2040
Leu His Trp Glu Ser Gly Lys Pro Glu Asp Ile Asp Asn Asn Gln
2045 2050 2055
Leu Arg Tyr Ser Tyr Asp Asn Leu Ile Gly Ser Ser Gln Leu Glu
2060 2065 2070
Leu Asp Ser Glu Gly Gln Ile Ile Ser Glu Glu Glu Tyr Tyr Pro
2075 2080 2085
Tyr Gly Gly Thr Ala Leu Trp Ala Ala Arg Asn Gln Thr Glu Ala
2090 2095 2100
Ser Tyr Lys Thr Ile Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr
2105 2110 2115
Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gln Pro Trp Ile Gly
2120 2125 2130
Arg Trp Leu Ser Ser Asp Pro Ala Gly Thr Ile Asp Gly Leu Asn
2135 2140 2145
Leu Tyr Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro
2150 2155 2160
Asp Gly Leu Met Pro Thr Ile Ala Glu Arg Ile Ala Ala Leu Lys
2165 2170 2175
Lys Asn Lys Val Thr Asp Ser Ala Pro Ser Pro Ala Asn Ala Thr
2180 2185 2190
Asn Val Ala Ile Asn Ile Arg Pro Pro Val Ala Pro Lys Pro Ser
2195 2200 2205
Leu Pro Lys Ala Ser Thr Ser Ser Gln Pro Thr Thr His Pro Ile
2210 2215 2220
Gly Ala Ala Asn Ile Lys Pro Thr Thr Ser Gly Ser Ser Ile Val
2225 2230 2235
Ala Pro Leu Ser Pro Val Gly Asn Lys Ser Thr Ser Glu Ile Ser
2240 2245 2250
Leu Pro Glu Ser Ala Gln Ser Ser Ser Ser Ser Thr Thr Ser Thr
2255 2260 2265
Asn Leu Gln Lys Lys Ser Phe Thr Leu Tyr Arg Ala Asp Asn Arg
2270 2275 2280
Ser Phe Glu Glu Met Gln Ser Lys Phe Pro Glu Gly Phe Lys Ala
2285 2290 2295
Trp Thr Pro Leu Val Thr Lys Met Ala Arg Gln Phe Ala Ser Ile
2300 2305 2310
Phe Ile Gly Gln Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val
2315 2320 2325
Lys Asn Ile Ser Thr Trp Gly Ala Lys Pro Lys Leu Lys Asp Leu
2330 2335 2340
Ser Asn Tyr Ile Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val
2345 2350 2355
Ser Thr Ala Ile Asn Thr Glu Ala Gly Gly Gln Ser Ser Gly Ala
2360 2365 2370
Pro Leu His Lys Ile Asp Met Asp Leu Tyr Glu Phe Ala Ile Asp
2375 2380 2385
Gly Gln Lys Leu Asn Pro Leu Pro Glu Gly Arg Thr Lys Asn Met
2390 2395 2400
Val Pro Ser Leu Leu Leu Asp Thr Pro Gln Ile Glu Thr Ser Ser
2405 2410 2415
Ile Ile Ala Leu Asn His Gly Pro Val Asn Asp Ala Glu Ile Ser
2420 2425 2430
Phe Leu Thr Thr Ile Pro Leu Lys Asn Val Lys Pro His Lys Arg
2435 2440 2445
Pro Arg Asp Asn Lys Gly Gln Thr Ile Arg Thr Arg Pro Glu Phe
2450 2455 2460
Met Tyr Ser Thr Ala Val Leu Leu Asn Lys Ile Ser Pro Thr Arg
2465 2470 2475
Asp Gly Gln Thr Met Thr Leu Ala Asp Leu Gln Tyr Leu Ser Phe
2480 2485 2490
Ser Glu Leu Arg Lys Ile Phe Asp Asp Gln Leu Ser Trp Gly Glu
2495 2500 2505
Ala Arg His Leu Tyr His Glu Thr Ile Glu Gln Lys Lys Asn Asn
2510 2515 2520
Arg Leu Leu Glu Ala Arg Ile Phe Thr Arg Ala Asn Pro Gln Leu
2525 2530 2535
Ser Gly Ala Ile Arg Leu Gly Ile Glu Arg Asp Ser Val Ser Arg
2540 2545 2550
Ser Tyr Asp Glu Met Phe Gly Ala Arg Ser Ser Ser Phe Val Lys
2555 2560 2565
Pro Gly Ser Val Ala Ser Met Phe Ser Pro Ala Gly Tyr Leu Thr
2570 2575 2580
Glu Leu Tyr Arg Glu Ala Lys Asp Leu His Phe Ser Ser Ser Ala
2585 2590 2595
Tyr His Leu Asp Asn Arg Arg Pro Asp Leu Ala Asp Leu Thr Leu
2600 2605 2610
Ser Gln Ser Asn Met Asp Thr Glu Ile Ser Thr Leu Thr Leu Ser
2615 2620 2625
Asn Glu Leu Leu Leu Glu His Ile Thr Arg Lys Thr Gly Gly Asp
2630 2635 2640
Ser Asp Ala Leu Met Glu Ser Leu Ser Thr Tyr Arg Gln Ala Ile
2645 2650 2655
Asp Thr Pro Tyr His Gln Pro Tyr Glu Thr Ile Arg Gln Val Ile
2660 2665 2670
Met Thr His Asp Ser Thr Leu Ser Ala Leu Ser Arg Asn Pro Glu
2675 2680 2685
Val Met Gly Gln Ala Glu Gly Ala Ser Leu Leu Ala Ile Leu Ala
2690 2695 2700
Asn Ile Ser Pro Glu Leu Tyr Asn Ile Leu Thr Glu Glu Ile Thr
2705 2710 2715
Glu Lys Asn Ala Asp Ala Leu Phe Ala Gln Asn Phe Ser Glu Asn
2720 2725 2730
Ile Thr Pro Glu Asn Phe Ala Ser Gln Ser Trp Ile Ala Lys Tyr
2735 2740 2745
Tyr Gly Leu Glu Leu Ser Glu Val Gln Lys Tyr Leu Gly Met Leu
2750 2755 2760
Gln Asn Gly Tyr Ser Asp Ser Thr Ser Ala Tyr Val Asp Asn Ile
2765 2770 2775
Ser Thr Gly Leu Val Val Asn Asn Glu Ser Lys Leu Glu Ala Tyr
2780 2785 2790
Lys Ile Thr Arg Val Lys Thr Asp Asp Tyr Asp Lys Asn Ile Asn
2795 2800 2805
Tyr Phe Asp Leu Met Tyr Glu Gly Asn Asn Gln Phe Phe Ile Arg
2810 2815 2820
Ala Asn Phe Lys Val Ser Arg Glu Phe Gly Ala Thr Leu Arg Lys
2825 2830 2835
Asn Ala Gly Pro Ser Gly Ile Val Gly Ser Leu Ser Gly Pro Leu
2840 2845 2850
Ile Ala Asn Thr Asn Phe Lys Ser Asn Tyr Leu Ser Asn Ile Ser
2855 2860 2865
Asp Ser Glu Tyr Lys Asn Gly Val Lys Ile Tyr Ala Tyr Arg Tyr
2870 2875 2880
Thr Ser Ser Thr Ser Ala Thr Asn Gln Gly Gly Gly Ile Phe Thr
2885 2890 2895
Phe Glu Ser Tyr Pro Leu Thr Ile Phe Ala Leu Lys Leu Asn Lys
2900 2905 29l0
Ala Ile Arg Leu Cys Leu Thr Ser Gly Leu Ser Pro Asn Glu Leu
2915 2920 2925
Gln Thr Ile Val Arg Ser Asp Asn Ala Gln Gly Ile Ile Asn Asp
2930 2935 2940
Ser Val Leu Thr Lys Val Phe Tyr Thr Leu Phe Tyr Ser His Arg
2945 2950 2955
Tyr Ala Leu Ser Phe Asp Asp Ala Gln Val Leu Asn Gly Ser Val
2960 2965 2970
Ile Asn Gln Tyr Ala Asp Asp Asp Ser Val Ser His Phe Asn Arg
2975 2980 2985
Leu Phe Asn Thr Pro Pro Leu Lys Gly Lys Ile Phe Glu Ala Asp
2990 2995 3000
Gly Asn Thr Val Ser Ile Asp Pro Asp Glu Glu Gln Ser Thr Phe
3005 3010 3015
Ala Arg Ser Ala Leu Met Arg Gly Leu Gly Val Asn Ser Gly Glu
3020 3025 3030
Leu Tyr Gln Leu Gly Lys Leu Ala Gly Val Leu Asp Ala Gln Asn
3035 3040 3045
Thr Ile Thr Leu Ser Val Phe Val Ile Ser Ser Leu Tyr Arg Leu
3050 3055 3060
Thr Leu Leu Ala Arg Val His Gln Leu Thr Val Asn Glu Leu Cys
3065 3070 3075
Met Leu Tyr Gly Leu Ser Pro Phe Asn Gly Lys Thr Thr Ala Ser
3080 3085 3090
Leu Ser Ser Gly Glu Leu Pro Arg Leu Val Ile Trp Leu Tyr Gln
3095 3100 3105
Val Thr Gln Trp Leu Thr Glu Ala Glu Ile Thr Thr Glu Ala Ile
3110 3115 3120
Trp Leu Leu Cys Thr Pro Glu Phe Ser Gly Asn Ile Ser Pro Glu
3125 3130 3135
Ile Ser Asn Leu Leu Asn Asn Leu Arg Pro Ser Ile Ser Glu Asp
3140 3145 3150
Met Ala Gln Ser His Asn Arg Glu Leu Gln Ala Glu Ile Leu Ala
3155 3160 3165
Pro Phe Ile Ala Ala Thr Leu His Leu Ala Ser Pro Asp Met Ala
3170 3175 3180
Arg Tyr Ile Leu Leu Trp Thr Asp Asn Leu Arg Pro Gly Gly Leu
3185 3190 3195
Asp Ile Ala Gly Phe Met Thr Leu Val Leu Lys Glu Ser Leu Asn
3200 3205 3210
Ala Asn Glu Thr Thr Gln Leu Val Gln Phe Cys His Val Met Ala
3215 3220 3225
Gln Leu Ser Leu Ser Val Gln Thr Leu Arg Leu Ser Glu Ala Glu
3230 3235 3240
Leu Ser Val Leu Val Ile Ser Gly Phe Ala Val Leu Gly Ala Lys
3245 3250 3255
Asn Gln Pro Ala Gly Gln His Asn Ile Asp Thr Leu Phe Ser Leu
3260 3265 3270
Tyr Arg Phe His Gln Trp Ile Asn Gly Leu Gly Asn Pro Gly Ser
3275 3280 3285
Asp Thr Leu Asp Met Leu Arg Gln Gln Thr Leu Thr Ala Asp Arg
3290 3295 3300
Leu Ala Ser Val Met Gly Leu Asp Ile Ser Met Val Thr Gln Ala
3305 3310 3315
Met Val Ser Ala Gly Val Asn Gln Leu Gln Cys Trp Gln Asp Ile
3320 3325 3330
Asn Thr Val Leu Gln Trp Ile Asp Val Ala Ser Ala Leu His Thr
333 5 3340 3345
Met Pro Ser Val Ile Arg Thr Leu Val Asn Ile Arg Tyr Val Thr
3350 3355 3360
Ala Leu Asn Lys Ala Glu Ser Asn Leu Pro Ser Trp Asp Glu Trp
3365 3370 3375
Gln Thr Leu Ala Glu Asn Met Glu Ala Gly Leu Ser Thr Gln Gln
3380 3385 3390
Ala Gln Thr Leu Ala Asp Tyr Thr Ala Glu Arg Leu Ser Ser Val
3395 3400 3405
Leu Cys Asn Trp Phe Leu Ala Asn Ile Gln Pro Glu Gly Val Ser
3410 3415 3420
Leu His Ser Arg Asp Asp Leu Tyr Ser Tyr Phe Leu Ile Asp Asn
3425 3430 3435
Gln Val Ser Ser Ala Ile Lys Thr Thr Arg Leu Ala Glu Ala Ile
3440 3445 3450
Ala Gly Ile Gln Leu Tyr Ile Asn Arg Ala Leu Asn Arg Ile Glu
3455 3460 3465
Pro Asn Ala Arg Ala Asp Val Ser Thr Arg Gln Phe Phe Thr Asp
3470 3475 3480
Trp Thr Val Asn Asn Arg Tyr Ser Thr Trp Gly Gly Val Ser Arg
3485 3490 3495
Leu Val Tyr Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Gln Arg Ile
3500 3505 3510
Gly Gln Thr Arg Met Met Asp Glu Leu Leu Glu Asn Ile Ser Gln
3515 3520 3525
Ser Lys Leu Ser Arg Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr
3530 3535 3540
Leu Thr Arg Phe Glu Thr Val Ala Asp Leu Lys Val Val Ser Ala
3545 3550 3555
Tyr His Asp Asn Val Asn Ser Asn Thr Gly Leu Thr Trp Phe Val
3560 3565 3570
Gly Gln Thr Arg Glu Asn Leu Pro Glu Tyr Tyr Trp Cys Asn Val
3575 3580 3585
Asp Ile Ser Arg Met Gln Ala Gly Glu Leu Ala Ala Asn Ala Trp
3590 3595 3600
Lys Glu Trp Thr Lys Ile Asp Thr Ala Val Asn Pro Tyr Lys Asp
3605 3610 3615
Ala Ile Arg Pro Val Ile Leu Arg Glu Arg Leu His Leu Ile Trp
3620 3625 3630
Val Glu Lys Glu Glu Val Ala Lys Asn Gly Thr Asp Pro Val Glu
3635 3640 3645
Thr Cys Asp Arg Phe Thr Leu Lys Leu Ala Phe Leu Arg His Asp
3650 3655 3660
Gly Ser Trp Ser Ala Pro Trp Ser Tyr Asp Ile Thr Thr Gln Val
3665 3670 3675
Glu Ala Val Thr Asp Lys Lys Pro Asp Thr Glu Arg Leu Ala Leu
3680 3685 3690
Ala Ala Ser Gly Phe Gln Gly Glu Asp Thr Leu Leu Val Phe Val
3695 3700 3705
Tyr Lys Thr Gly Lys Ser Tyr Ser Asp Phe Gly Gly Ser Asn Lys
3710 3715 3720
Asn Val Ala Gly Met Thr Ile Tyr Gly Asp Gly Ser Phe Lys Lys
3725 3730 3735
Met Glu Asn Thr Ala Leu Ser Arg Tyr Ser Gln Leu Lys Asn Thr
3740 3745 3750
Phe Asp Ile Ile His Thr Gln Gly Asn Asp Leu Val Arg Lys Ala
3755 3760 3765
Ser Tyr Arg Phe Ala Gln Asp Phe Glu Val Pro Ala Ser Leu Asn
3770 3775 3780
Met Gly Ser Ala Ile Gly Asp Asp Ser Leu Thr Val Met Glu Asn
3785 3790 3795
Gly Asn Ile Pro Gln Ile Thr Ser Lys Tyr Ser Ser Asp Asn Leu
3800 3805 3810
Ala Ile Thr Leu His Asn Ala Ala Phe Thr Val Arg Tyr Asp Gly
3815 3820 3825
Ser Gly Asn Val Ile Arg Asn Lys Gln Ile Ser Ala Met Lys Leu
3830 3835 3840
Thr Gly Val Asp Gly Lys Ser Gln Tyr Gly Asn Ala Phe Ile Ile
3845 3850 3855
Ala Asn Thr Val Lys His Tyr Gly Gly Tyr Ser Asp Leu Gly Gly
3860 3865 3870
Pro Ile Thr Val Tyr Asn Lys Thr Lys Asn Tyr Ile Ala Ser Val
3875 3880 3885
Gln Gly His Leu Met Asn Ala Asp Tyr Thr Arg Arg Leu Ile Leu
3890 3895 3900
Thr Pro Val Glu Asn Asn Tyr Tyr Ala Arg Leu Phe Glu Phe Pro
3905 3910 3915
Phe Ser Pro Asn Thr Ile Leu Asn Thr Val Phe Thr Val Gly Ser
3920 3925 3930
Asn Lys Thr Ser Asp Phe Lys Lys Cys Ser Tyr Ala Val Asp Gly
3935 3940 3945
Asn Asn Ser Gln Gly Phe Gln Ile Phe Ser Ser Tyr Gln Ser Ser
3950 3955 3960
Gly Trp Leu Asp Ile Asp Thr Gly Ile Asn Asn Thr Asp Ile Lys
3965 3970 3975
Ile Thr Val Met Ala Gly Ser Lys Thr His Thr Phe Thr Ala Ser
3980 3985 3990
Asp His Ile Ala Ser Leu Pro Ala Asn Ser Phe Asp Ala Met Pro
3995 4000 4005
Tyr Thr Phe Lys Pro Leu Glu Ile Asp Ala Ser Ser Leu Ala Phe
4010 4015 4020
Thr Asn Asn Ile Ala Pro Leu Asp Ile Val Phe Glu Thr Lys Ala
4025 4030 4035
Lys Asp Gly Arg Val Leu Gly Lys Ile Lys Gln Thr Leu Ser Val
4040 4045 4050
Lys Arg Val Asn Tyr Asn Pro Glu Asp Ile Leu Phe Leu Arg Glu
4055 4060 4065
Thr His Ser Gly Ala Gln Tyr Met Gln Leu Gly Val Tyr Arg Ile
4070 4075 4080
Arg Leu Asn Thr Leu Leu Ala Ser Gln Leu Val Ser Arg Ala Asn
4085 4090 4095
Thr Gly Ile Asp Thr Ile Leu Thr Met Glu Thr Gln Arg Leu Pro
4100 4105 4110
Glu Pro Pro Leu Gly Glu Gly Phe Phe Ala Asn Phe Val Leu Pro
4115 4120 4125
Lys Tyr Asp Pro Ala Glu His Gly Asp Glu Arg Trp Phe Lys Ile
4130 4135 4140
His Ile Gly Asn Val Gly Gly Asn Thr Gly Arg Gln Pro Tyr Tyr
4145 4150 4155
Ser Gly Met Leu Ser Asp Thr Ser Glu Thr Ser Met Thr Leu Phe
4160 4165 4170
Val Pro Tyr Ala Glu Gly Tyr Tyr Met His Glu Gly Val Arg Leu
4175 4180 4185
Gly Val Gly Tyr Gln Lys Ile Thr Tyr Asp Asn Thr Trp Glu Ser
4190 4195 4200
Ala Phe Phe Tyr Phe Asp Glu Thr Lys Gln Gln Phe Val Leu Ile
4205 4210 4215
Asn Asp Ala Asp His Asp Ser Gly Met Thr Gln Gln Gly Ile Val
4220 4225 4230
Lys Asn Ile Lys Lys Tyr Lys Gly Phe Leu Asn Val Ser Ile Ala
4235 4240 4245
Thr Gly Tyr Ser Ala Pro Met Asp Phe Asn Ser Ala Ser Ala Leu
4250 4255 4260
Tyr Tyr Trp Glu Leu Phe Tyr Tyr Thr Pro Met Met Cys Phe Gln
4265 4270 4275
Arg Leu Leu Gln Glu Lys Gln Phe Asp Glu Ala Thr Gln Trp Ile
4280 4285 4290
Asn Tyr Val Tyr Asn Pro Ala Gly Tyr Ile Val Asn Gly Glu Ile
4295 4300 4305
Ala Pro Trp Ile Trp Asn Cys Arg Pro Leu Glu Glu Thr Thr Ser
4310 4315 4320
Trp Asn Ala Asn Pro Leu Asp Ala Ile Asp Pro Asp Ala Val Ala
4325 4330 4335
Gln Asn Asp Pro Met His Tyr Lys Ile Ala Thr Phe Met Arg Leu
4340 4345 4350
Leu Asp Gln Leu Ile Leu Arg Gly Asp Met Ala Tyr Arg Glu Leu
4355 4360 4365
Thr Arg Asp Ala Leu Asn Glu Ala Lys Met Trp Tyr Val Arg Thr
4370 4375 4380
Leu Glu Leu Leu Gly Asp Glu Pro Glu Asp Tyr Gly Ser Gln Gln
4385 4390 4395
Trp Ala Ala Pro Ser Leu Ser Gly Ala Ala Ser Gln Thr Val Gln
4400 4405 4410
Ala Ala Tyr Gln Gln Asp Leu Thr Met Leu Gly Arg Gly Gly Val
4415 4420 4425
Ser Lys Asn Leu Arg Thr Ala Asn Ser Leu Val Gly Leu Phe Leu
4430 4435 4440
Pro Glu Tyr Asn Pro Ala Leu Thr Asp Tyr Trp Gln Thr Leu Arg
4445 4450 4455
Leu Arg Leu Phe Asn Leu Arg His Asn Leu Ser Ile Asp Gly Gln
4460 4465 4470
Pro Leu Ser Leu Ala Ile Tyr Ala Glu Pro Thr Asp Pro Lys Ala
4475 4480 4485
Leu Leu Thr Ser Met Val Gln Ala Ser Gln Gly Gly Ser Ala Val
4490 4495 4500
Leu Pro Gly Thr Leu Ser Leu Tyr Arg Phe Pro Val Met Leu Glu
4505 4510 4515
Arg Thr Arg Asn Leu Val Ala Gln Leu Thr Gln Phe Gly Thr Ser
4520 4525 4530
Leu Leu Ser Met Ala Glu His Asp Asp Ala Asp Glu Leu Thr Thr
4535 4540 4545
Leu Leu Leu Gln Gln Gly Met Glu Leu Ala Thr Gln Ser Ile Arg
4550 4555 4560
Ile Gln Gln Arg Thr Val Asp Glu Val Asp Ala Asp Ile Ala Val
4565 4570 4575
Leu Ala Glu Ser Arg Arg Ser Ala Gln Asn Arg Leu Glu Lys Tyr
4580 4585 4590
Gln Gln Leu Tyr Asp Glu Asp Ile Asn His Gly Glu Gln Arg Ala
4595 4600 4605
Met Ser Leu Leu Asp Ala Ala Ala Gly Gln Ser Leu Ala Gly Gln
4610 4615 4620
Val Leu Ser Ile Ala Glu Gly Val Ala Asp Leu Val Pro Asn Val
4625 4630 4635
Phe Gly Leu Ala Cys Gly Gly Ser Arg Trp Gly Ala Ala Leu Arg
4640 4645 4650
Ala Ser Ala Ser Val Met Ser Leu Ser Ala Thr Ala Ser Gln Tyr
4655 4660 4665
Ser Ala Asp Lys Ile Ser Arg Ser Glu Ala Tyr Arg Arg Arg Arg
4670 4675 4680
Gln Glu Trp Glu Ile Gln Arg Asp Asn Ala Asp Gly Glu Val Lys
4685 4690 4695
Gln Met Asp Ala Gln Leu Glu Ser Leu Lys Ile Arg Arg Glu Ala
4700 4705 4710
Ala Gln Met Gln Val Glu Tyr Gln Glu Thr Gln Gln Ala His Thr
4715 4720 4725
Gln Ala Gln Leu Glu Leu Leu Gln Arg Lys Phe Thr Asn Lys Ala
4730 4735 4740
Leu Tyr Ser Trp Met Arg Gly Lys Leu Ser Ala Ile Tyr Tyr Gln
4745 4750 4755
Phe Phe Asp Leu Thr Gln Ser Phe Cys Leu Met Ala Gln Glu Ala
4760 4765 4770
Leu Arg Arg Glu Leu Thr Asp Asn Gly Val Thr Phe Ile Arg Gly
4775 4780 4785
Gly Ala Trp Asn Gly Thr Thr Ala Gly Leu Met Ala Gly Glu Thr
4790 4795 4800
Leu Leu Leu Asn Leu Ala Glu Met Glu Lys Val Trp Leu Glu Arg
4805 4810 4815
Asp Glu Arg Ala Leu Glu Val Thr Arg Thr Val Ser Leu Ala Gln
4820 4825 4830
Phe Tyr Gln Ala Leu Ser Ser Asp Asn Phe Asn Leu Thr Glu Lys
4835 4840 4845
Leu Thr Gln Phe Leu Arg Glu Gly Lys Gly Asn Val Gly Ala Ser
4850 4855 4860
Gly Asn Glu Leu Lys Leu Ser Asn Arg Gln Ile Glu Ala Ser Val
4865 4870 4875
Arg Leu Ser Asp Leu Lys Ile Phe Ser Asp Tyr Pro Glu Ser Leu
4880 4885 4890
Gly Asn Thr Arg Gln Leu Lys Gln Val Ser Val Thr Leu Pro Ala
4895 4900 4905
Leu Val Gly Pro Tyr Glu Asp Ile Arg Ala Val Leu Asn Tyr Gly
4910 4915 4920
Gly Ser Ile Val Met Pro Arg Gly Cys Ser Ala Ile Ala Leu Ser
4925 4930 4935
His Gly Val Asn Asp Ser Gly Gln Phe Met Leu Asp Phe Asn Asp
4940 4945 4950
Ser Arg Tyr Leu Pro Phe Glu Gly Ile Ser Val Asn Asp Ser Gly
4955 4960 4965
Ser Leu Thr Leu Ser Phe Pro Asp Ala Thr Asp Arg Gln Lys Ala
4970 4975 4980
Leu Leu Glu Ser Leu Ser Asp Ile Ile Leu His Ile Arg Tyr Thr
4985 4990 4995
Ile Arg Ser
5000
Claims (31)
1.分离的融合蛋白,其包含毒素复合体B类多肽和毒素复合体C类多肽,其中:
所述B类多肽是130-180kDa的增效剂,其含有与选自以下的序列具有至少40%同一性的氨基酸序列:
TcdB1(SEQ ID NO:5)、
TcdB2(SEQ ID NO:6)、
TcaC(SEQ ID NO:7)、
XptC1wi(SEQ ID NO:8)、
XptB1xb(SEQ ID NO:9)、
PptB1529(SEQ ID NO:10)和
SepB(SEQ ID NO:11);及;
所述的C类多肽是90-120kDa的增效剂,其含有与选自以下的序列具有至少35%同一性的氨基酸序列:
TccC1(SEQ ID NO:12)、
TccC2(SEQ ID NO:13)、
TccC3(SEQ ID NO:14)、
TccC4(SEQ ID NO:15)、
TccC5(SEQ ID NO:16)、
XptB1wi(SEQ ID NO:17)、
XptC1xb(SEQ ID NO:18)、
PptC1(长)(SEQ ID NO:19)、
PptC1(短)(SEQ ID NO:20)和
SepC(SEQ ID NO:21);及
所述B类和所述C类多肽能够增强230-290kDa的A类毒素复合体多肽的杀虫活性。
2.权利要求1的融合蛋白,其中所述融合蛋白进一步包含所述A类毒素复合体多肽,且所述融合蛋白具有毒素活性。
3.权利要求2的融合蛋白,其中所述A类多肽含有与选自以下的序列具有至少40%同一性的氨基酸序列:
XptA1wi(SEQ ID NO:22)、
XptA2wi(SEQ ID NO:23)、
TcbA(SEQ ID NO:24)、
TcdA(SEQ ID NO:25)、
TcdA2(SEQ ID NO:26),及
TcbA4(SEQ ID NO:27)。
4.权利要求1的融合蛋白,其中所述B类TC多肽与所述C类TC多肽通过接头序列融合。
5.权利要求1的融合蛋白,其中所述B类TC多肽与所述C类TC多肽直接融合,没有接头序列。
6.权利要求1的融合蛋白,其中所述B类TC多肽位于所述C类TC多肽的氨基末端。
7.权利要求1的融合蛋白,其中所述C类TC多肽位于所述B类TC多肽的氨基末端。
8.权利要求1的融合蛋白,其中所述融合蛋白包含选自SEQ IDNO:2、46、48、50、54、56和58的氨基酸序列。
9.权利要求2的融合蛋白,其中所述融合蛋白包含选自SEQ IDNO:60和SEQ ID NO:68的氨基酸序列。
10.权利要求1的融合蛋白,其中所述B类TC多肽的编码多核苷酸在严格条件下与选自SEQ ID NO:28-33的核酸分子的互补分子杂交,并且其中所述C类TC多肽的编码多核苷酸在严格条件下与选自SEQ IDNO:34-42的核酸分子的互补分子杂交。
11.权利要求2的融合蛋白,其中所述A类多肽的编码多核苷酸在严格条件下与选自SEQ ID NO:61-66的核酸分子的互补分子杂交。
12.权利要求2的融合蛋白,其中所述A类多肽与选自SEQ IDNO:22-27的氨基酸序列具有至少75%的序列同一性,所述C类多肽与选自SEQ ID NO:12-21的氨基酸序列具有至少75%的序列同一性,并且所述B类多肽与选自SEQ ID NO:5-11的氨基酸序列具有至少75%的序列同一性。
13.权利要求2的融合蛋白,其中所述B类多肽是TcdB2,且所述C类多肽是TccC3。
14.权利要求2的融合蛋白,其中所述B类TC多肽位于所述C类TC多肽的氨基末端侧,并且所述C类TC多肽位于所述A类TC多肽的氨基末端侧。
15.权利要求2的融合蛋白,其中所述融合蛋白包含接头序列。
16.权利要求2的融合蛋白,其中所述A类多肽与所述B类多肽直接融合,没有接头序列。
17.权利要求2的融合蛋白,其中所述A类TC多肽与所述B类多肽通过接头序列融合。
18.权利要求2的融合蛋白,其中所述B类TC多肽与所述C类TC多肽通过接头序列融合。
19.编码权利要求1的融合蛋白的分离的多核苷酸。
20.权利要求19的多核苷酸,其中所述多核苷酸与异源启动子有效连接。
21.编码权利要求2的融合蛋白的分离的多核苷酸。
22.包含权利要求19的多核苷酸的转基因细菌或者植物细胞。
23.包含权利要求22的植物细胞的种子。
24.产生权利要求1的融合蛋白的植物。
25.产生权利要求2的融合蛋白的植物。
26.阻止昆虫摄食植物的方法,其中所述方法包括提供有效量的A类TC多肽和权利要求1的融合蛋白给所述昆虫摄取。
27.阻止昆虫食取植物的方法,其中所述方法包括提供有效量的权利要求2的融合蛋白给所述昆虫摄取。
28.权利要求26的方法,其中所述植物产生所述蛋白质。
29.权利要求26的方法,其中所述方法包括将所述蛋白质应用于所述植物。
30.保护植物免受昆虫侵害的方法,其中所述方法包括在所述植物中产生有效量的A类、B类和C类毒素复合体蛋白,其中所述B类和C类TC蛋白翻译自单个开放阅读框。
31.产生权利要求1的融合蛋白的方法,其中所述方法包括将所述B类多肽和所述C类多肽连接。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US54950204P | 2004-03-02 | 2004-03-02 | |
US60/549,502 | 2004-03-02 | ||
US60/549,516 | 2004-03-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1950395A true CN1950395A (zh) | 2007-04-18 |
Family
ID=38019339
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 200580014048 Pending CN1950395A (zh) | 2004-03-02 | 2005-03-02 | 杀虫毒素复合体融合蛋白 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1950395A (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102066408B (zh) * | 2008-06-25 | 2015-07-01 | 阿森尼克斯公司 | 毒素基因及其使用方法 |
CN113648427A (zh) * | 2021-08-20 | 2021-11-16 | 山东大学 | 透明质酸-es2-af肽偶联物及其制备方法和应用 |
-
2005
- 2005-03-02 CN CN 200580014048 patent/CN1950395A/zh active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102066408B (zh) * | 2008-06-25 | 2015-07-01 | 阿森尼克斯公司 | 毒素基因及其使用方法 |
CN113648427A (zh) * | 2021-08-20 | 2021-11-16 | 山东大学 | 透明质酸-es2-af肽偶联物及其制备方法和应用 |
CN113648427B (zh) * | 2021-08-20 | 2023-07-28 | 山东大学 | 透明质酸-es2-af肽偶联物及其制备方法和应用 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1751125A (zh) | 混合并匹配tc蛋白质用于病虫害防治 | |
CN1849397A (zh) | 苏云金芽孢杆菌分泌的杀虫蛋白及其应用 | |
CN1708588A (zh) | Cot102杀虫棉花 | |
CN1761753A (zh) | δ-内毒素基因及其使用方法 | |
CN1073717A (zh) | 编码杀虫剂蛋白的基因,用该基因转化的禾本科植物及其制备方法 | |
CN1933723A (zh) | 玉米植株mon88017和组合物以及检测它们的方法 | |
CN1314911A (zh) | 用dna改组优化害虫抗性基因 | |
CN101068929A (zh) | 参与植物纤维发育的多核苷酸和多肽和使用它们的方法 | |
CN1296482C (zh) | 杀虫毒素和编码这些毒素的核苷酸序列 | |
CN1745173A (zh) | 能赋予除草剂抗性的基因 | |
CN1886513A (zh) | 昆虫抗性棉花植株及对其进行探测的方法 | |
CN1414973A (zh) | 苏云金芽孢杆菌(bacillus thur_ingiensis)的杀虫蛋白 | |
CN1332800A (zh) | 转化植物以表达苏云金芽孢杆菌δ-内毒素的方法 | |
CN1527663A (zh) | 新的杀虫毒素 | |
CN1555414A (zh) | 来源于植物的抗性基因 | |
CN1642977A (zh) | 新的苏云金芽孢杆菌杀昆虫蛋白质 | |
CN1408023A (zh) | 杀虫蛋白 | |
CN1341151A (zh) | 除草剂的靶基因和方法 | |
CN1468311A (zh) | 异麦芽寡糖合酶 | |
CN1649483A (zh) | Ice1,一种植物冷诱导转录组和耐寒调节剂 | |
CN1625562A (zh) | CRY1EA和CRY1CA的嵌合型δ-内毒素蛋白 | |
CN1684974A (zh) | 可获自类芽孢杆菌种的杀虫活性蛋白质及多核苷酸 | |
CN1950395A (zh) | 杀虫毒素复合体融合蛋白 | |
CN1195063C (zh) | 蛋白酶抑制剂融合蛋白 | |
CN1650018A (zh) | 昆虫抗性植物及产生该植物的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20070418 |