控制昆虫虫害的方法
本发明涉及通过使用毒素蛋白,在农作物植株中控制鳞翅目秆野螟属种类,优选地Ostrinia furnacalis(亚洲玉米螟)的方法,毒素蛋白可从苏云金芽孢杆菌(Bt)和/或其它芽孢杆菌属种类得到。
苏云金芽孢杆菌属于革兰氏阳性大组,为需氧,形成内生孢子的细菌。与亲缘关系很近的其它芽孢杆菌属种类,如蜡状芽孢杆菌或炭疽芽孢杆菌不同,到目前为止已知的苏云金芽孢杆菌菌种大部分都在形成孢子时产生类芽孢内含体,由于它的晶体结构,一般称为晶体。该晶体含有具杀昆虫活性的晶体毒素原蛋白,即δ-内毒素。
苏云金芽孢杆菌对昆虫的毒性是由于该蛋白晶体。在口器摄入晶体,晶体溶于靶昆虫的肠液之前,δ-内毒素不表现其杀昆虫活性。在大多数情况下,由昆虫消化道的蛋白水解酶作用引起蛋白酶剪切,导致从毒素原释放有效毒性成分。
不同苏云金芽孢杆菌菌株的δ-内毒素,其特征在于对某些靶昆虫,尤其对应于不同的鳞翅目,鞘翅目和双翅目的幼虫有高度特异性以及针对这些易感幼虫有高度活性。苏云金芽孢杆菌δ-内毒素的进一步优点在于该毒素对人类,其它哺乳动物,鸟和鱼是无害的。
根据它们的活性范围和序列相似性,已将来自苏云金芽孢杆菌的不同杀昆虫晶体蛋白分类。按Hofte和Whiteley,微生物学综述53:242-255(1989)提出的分类将已知的杀昆虫晶体蛋白分成四大类。一般地,大类按它们的活性范围来分,CryI蛋白对鳞翅目有效,CryII蛋白对翅鳞目和双翅目都有效,CryIII蛋白对鞘翅目有效,CryIV蛋白对双翅目有效。在每一大类中,根据序列相似性将δ-内毒素分组。
CryI蛋白典型地以130-140KDa毒素原蛋白合成,并经蛋白酶剪切产生大约60-70KDa大小的具杀昆虫活性的毒素蛋白。δ-内毒素的活性部分位于全长分子的氨基端部分。Hofte和Whiteley,同上将已知的CryI蛋白分成六组,IA(a),IA(b),IA(c),IB,IC和ID。从那以后,分类命名为CryIE,CryIF,CryIG,CryIH和CryIX的蛋白也得到鉴定。
来自苏云金芽孢杆菌的每个δ-内毒素的杀昆虫活性范围趋于十分狭窄,所给的δ-内毒素只对一些昆虫有效。特异性是由于产生活性毒素蛋白涉及的各个步骤的效率和其随后与昆虫消化道上皮细胞相互作用的能力。
本发明的目的之一是提供在植物中控制Ostrina furnacalis(亚洲王米螟)种类的方法,所述植物优选农作物,包括,但不限于玉米,小麦,黑麦,燕麦,水稻,高梁,黍和相关作物,饲草,竹和甘蔗等种类。此目的可以在本发明的范围内,通过向要保护的作物植株施用苏云金芽孢杆菌的毒素蛋白如CryI型毒素蛋白令人惊奇地得到实现。在本发明的另一个实施方案中,毒素蛋白可从芽孢杆菌属种类的无性培养物得到,所谓的无性杀昆虫蛋白(VIPs),如VIP3[EP-A 0 690 916;国际申请号EP95/03826,此处全文引用此公开文件作为参考],也可以用于控制Ostrinia furnacalis(亚洲玉米螟)虫害。
因此,本发明涉及保护植物及其后代免受由Ostrinia furnacalis(亚洲玉米螟)种类引起的损害的方法,包括向所要保护的植物或植物种子或植物生长区域直接或间接施用纯的或者以至少含一种所述蛋白或微生物的杀昆虫组合物形式施用芽孢杆菌种类的毒素蛋白,优选地上述CryI型或VIP型蛋白,上述微生物优选含至少一个编码毒素蛋白的毒素基因的苏云金芽孢杆菌和/或蜡状芽孢杆菌菌株。在根据本发明的方法中使用的上述微生物可以是或者天然存在的菌株,或者可选择地,含编码毒素的重组基因的重组菌株。
在优选的实施方案中,使用转基因植物给要保护的植物施用毒素以抵抗由Ostrinia furnacalis(亚洲玉米螟)引起的损害。这样的植物可通过转化编码芽孢杆菌属种类的杀昆虫毒素蛋白的毒素基因得到,上述杀昆虫毒素蛋白是比如Cry型,优选地CryI型毒素蛋白或VIP型蛋白,并且在上述昆虫虫害出现的地区种植这样转化的植物时,转基因植物以足够提供针对Ostrinia furnacalis(亚洲玉米螟)种类的控制的量表达上述毒素蛋白。
在根据本发明的方法中使用的保护农作物植株以抵抗Ostriniafurnacalis(亚洲玉米螟)虫害的杀昆虫组合物,举例来说包含作为活性成分的至少一种Cry型毒素蛋白,更优选地至少一种CryI型毒素蛋白,甚至更优选的至少一种CryIA型毒素蛋白,尤其优选地至少一种CryIA(b)型毒素蛋白,尤其最优选地至少一种根据SEQ ID NOs53-55的CryIA(b)型毒素蛋白,甚至更优选地含至少一个编码上述毒素蛋白的基因的苏云金芽孢杆菌或微生物,优选地含至少一个编码上述毒素蛋白的基因的苏云金芽孢杆菌菌株,或其衍生物或突变株,结合农学上的佐剂如载体,稀释剂,表面活性剂或施用促进佐剂。在杀昆虫组合物中包含的活性成分也可以是在EP-A-0 690 916和PCT国际申请号EP95/03826中公开的VIP型毒素蛋白或CryI型和VIP型蛋白结合。在保护的范围内优选的是VIPI型蛋白,比如VIP1A(a)蛋白或VIP1A(b)蛋白,或VIP2型蛋白,比如VIP2A(a)蛋白或VIP2A(b)蛋白,或VIP1型蛋白和VIP2型蛋白的结合,或VIP3型蛋白,比如VIP3A(a)蛋白或VIP3A(b)蛋白。
在保护范围内更优选的是在SEQ ID NOs:1,2,4-7,17-24,26-32,35,36,39,40,42,43,45,46,49,50,51或52中表示的VIP型毒素蛋白。
组合物也可以包含进一步的生物活性化合物。上述化合物可以是肥料或微量营养物供体或其他影响植物生长的制品。其也可以是选择的除草剂、杀昆虫剂,杀真菌剂,杀细菌剂,杀线虫剂、杀软体动物剂或几种这样制品的混合物,如果需要,可进一步与在配制技术中常规使用的农学上可接受的载体,表面活性剂或施用促进佐剂结合。适当的载体和佐剂可以是固体或液体,并对应配制技术中一般使用的物质,例如天然的或再生的矿物质,溶剂,分散剂,润湿剂,粘着剂,粘合剂或肥料。
按重量比,组合物可以包括0.1-99%的活性成分,1-99.9%的固体或液体佐剂,和0-25%表面活性剂。活性成分或含上述活性成分的组合物可以与某一其它杀昆虫剂或化学品(1993农作物保护化学品参考手册,化学品和药品出版社,加拿大)结合施用给要保护植物或农作物而不失去效力。其可以与大多数其它通用的农业喷洒物质相容,但如果涉及CryI型毒素,则不应该在强碱性喷洒溶液中使用。其可以以粉末,悬浮液,可湿性粉末或任何对农业应用适当的其它物质形式施用。
活性成分,即优选地苏云金芽孢杆菌的CryI型毒素蛋白和/或前述的任一VIP型蛋白,或含上述活性成分的组合物可以施用给(a)可能有昆虫虫害的环境,(b)植物或植物的一部分,为了保护该植物或植物的一部分免受昆虫虫害引起损害,或(c)种子,为了保护从该种子长出的植株免受虫害的损害。
在植物保护区域内施用的优选方法是向植物的叶施用(叶敷),施用的数量和施用的频率根据要保护的植物种类和受该虫害侵袭的危险度。
在根据本发明的方法中使用的组合物也适于保护植物繁殖材料,例如种子,比如果实,块茎或籽粒,或植物插条,免受昆虫虫害。可以在种植前用配制品处理繁殖材料,举例来说,种子可以在播种前拌药。本发明的活性成分也可以通过用液体配制品浸泡籽粒或者通过用固体配制品覆盖籽粒施用给籽粒(涂盖层)。配制物也可以在种植繁殖材料时,施用在种植位置,例如播种时的种子梨沟。本发明也涉及处理植物繁殖材料的这些方法和这样处理的植物繁殖材料。
在本发明的范围内,组合物可以按任何已知的用细菌菌株处理种子或土壤的方法施用。例如,见美国专利号4,863,866。菌株即使在微生物不存活的情况也对生物控制有效。然而优选地,施用存活的微生物。
在本发明的范围内所要保护的靶农作物是那些Ostrinia furnacalis(亚洲玉米螟)的宿主植物,包括但不限于玉米、小麦、大麦、黑麦、燕麦、水稻、高梁、黍和相关农作物,饲草、竹和甘蔗。
根据本发明的活性成分可以以未修饰的形式或与任何适当的农业上可接受的载体结合使用。这样的载体是在农业配方技术中常规使用的佐剂,并且由此按已知方法配制或可乳化的浓缩物,可覆盖膏剂,直接可喷洒的或可溶的溶液,稀释乳剂,可湿性粉末,可溶粉末,药粉,颗粒,以及胶囊,例如以聚合物形式。正如组合物的性质,施用方法,如喷洒、喷雾、撒粉、分散或浇灌,也可根据预期目的和普遍情况选择。有效施用的比率从每公顷(“ha”大约2.471英亩)大约50g到大约5kg活性成分(a.i.)之间变动,优选地从大约100g到大约2kga.i./ha。施用的优选比率是200g到大约1kga.i./ha)或200g-500ga.i./ha。
对于拌种,有效施用的比率从每100kg种子0.5g到1000ga.i.之间变动,优选地从每100kg种子3g到100ga.i.。最优选的是施用比率从每100kg种子10g到50ga.i.。
适当载体和佐剂可以是固体或液体,并对应在配制技术中一般使用的物质,例如天然或再生的矿物质,溶剂,分散剂,润湿剂,粘着剂,粘合剂或肥料。配制物,即杀昆虫组合物,制品或其与其它活性成分和适当的固体或液体佐剂的混合物,可按已知的方法制备,例如通过将活性成分与填料,例如溶剂,固体载体和某些情况下的表面活性化合物(表面活性剂)均匀混合和/或研磨。
合适的溶剂是:芳香烃,优选地含8-12个碳原子的组分,例如二甲苯混合物或取代的萘,邻苯二甲酸酯如邻苯二甲酸二丁酯或邻苯二甲酸二辛酯,脂族烃如环己烷或链烷烃,醇和二元醇和它们的醚和酯,如乙醇,1,2-亚乙基二醇单甲基或单乙基醚,酮如环己酮,强极性溶剂如N-甲基-2-吡咯烷酮,二甲基亚枫或二甲基甲酰胺,和植物油或环氧化植物油如环氧化椰子油或豆油;或水。
所用的固体载体,例如对于药粉和可分散粉末,一般是天然矿物填料如方解虫,滑石粉,高岭土,蒙脱土或硅镁土。为了提高物理特性,也可能加高度分散的硅酸或高度分散的吸附剂聚合物。适当的颗粒吸附载体是多孔型,例如浮石,碎砖土,海泡石或皂土;适当的非吸附载体是诸如方解石或砂的材料。另外大量的具无机或有机性质的预先形成颗粒的材料也可使用,例如,尤其是白云石或粉状植物残渣。
根据要配制的活性成分的性质,适当的表面活性化合物是有很好乳化,分散和湿润性质的非离子,阳离子和/或阴离子表面活性剂。术语“表面活性剂”也将理解为包括表面活性剂混合物。适当的阴离子表面活性剂可以是水溶的肥皂和水溶的合成的表面活性化合物。适当的肥皂是高级脂肪酸的碱金属盐,碱土金属盐或未取代的或取代的铵盐(C10-C22),例如,油酸或硬脂酸的钠盐或钾盐,或例如,可从椰子油或牛羊油得到的天然脂肪酸混合物。更适当的表面活性剂也可是脂肪酸甲基氨基乙磺酸盐及修饰的和未修饰的磷脂。
然而,更经常地,使用所谓的合成的表面活性剂,尤其是脂肪族磺酸盐,脂肪族硫酸盐,磺化苯并咪唑衍生物或烷芳基磺酸盐。脂肪族磺酸盐或硫酸盐常常以碱金属盐,碱土金属盐或未取代的或取代的铵盐形式,并且一般含有C8-C22烷基,也包括酰基的烷基组分,例如木素磺化酸和十二烷基硫酸的钠盐或钙盐;或可从天然脂肪酸得到的脂肪族醇硫酸盐的混合物。这些化合物包括脂肪族醇/环氧乙烷加合物的硫酸盐和磺酸盐。磺化苯并咪唑衍生物优选地含2个磺酸基团和一个含大约8-22个碳原子的脂肪酸基团。烷芳基磺酸盐的例子是十二烷基苯磺酸,二丁基萘磺酸或萘磺酸/甲醛缩合产物的钠盐,钙盐或三乙醇胺盐。相应的磷酸盐也是适合的,例如对-壬基酚与4-14摩尔环氧乙烷的加合物的磷酸酯盐。非离子表面活性剂优选地是脂族醇或环脂族醇的聚乙二醇醚衍生物,或饱和的或非饱和的脂肪酸和烷基酚,上述衍生物在脂肪烃组分中含3-30个乙二醇醚基团和8-20个碳原子中并在烷基酚的烷基组分中含6-18个碳原子。
更适当的非离子表面活性剂是聚环氧乙烷与在烷基链中含1-10个碳原子的聚丙二醇,乙二胺聚丙二醇和烷基聚丙二醇的水溶加合物,该加合物含20-250个乙二醇醚基团和10-100丙二醇醚基团。这些化合物每个丙二醇单位通常含1-5个乙二醇单位。非离子表面活性剂的代表性的例子是壬基苯基聚乙氧基乙醇,蓖麻油聚乙二醇醚,聚环氧丙烷/环氧乙烷加合物,三丁基苯氧基聚乙氧基乙醇,聚乙二醇和癸基苯氧基聚乙氧基乙醇。聚氧化乙烯山梨聚糖的脂肪酸酯,如聚氧化乙烯山梨聚糖三油酸酯,也是合适的非离子去污剂。
阳离子去污剂优选地是季胺盐,包括如N-取代基,至少一个C8-C22烷基,及进一步取代基,低级的非取代的或卤代的烷基,苄基或羟基-低级烷基。该盐优选地以卤化物,硫酸二甲酯或硫酸二乙酯的形式,例如氯化硬脂酰三甲基胺或溴化苄基二-(2-氯乙基)乙基铵。
在配制技术中习惯使用的表面活性剂在例如以下文献中描述:“McCutcheon的去污剂和乳化剂年鉴”MC出版公司。Ridgewood,N.J.,1979;Dr.Helmut Stache,“Tensid Taschenbuch”(表面活性剂手册),Carl Hanser Verlag,Munich/vienna。
本发明的杀昆虫组合物另一特别优选的特征是当施用于植物和土壤时活性成分的持久性。引起活性丧失的可能原因包括由于紫外光,热,叶分泌物和PH的失活。例如,在高PH,尤其存在还原剂时,δ-内毒素晶体溶解,从而变得更容易蛋白水解失活。叶的高PH值也可能是重要的,特别是在叶表面PH可以在8-10的范围内的地方。在根据本发明的方法中使用的杀昆虫组合物的配方可以通过加入添加剂帮助防止活性成分的丧失或把物质包在胶囊内,以此方法保护活性成分免于失活来解决这些问题。胶囊化可以用化学方法McGuire和Shasha,J Econ Entomol85:1425-1433,1992)或生物方法(Barnes和Cunmings,1986;EP-A 0 192 319)进行。化学胶囊化涉及用聚合物包裹活性成分的步骤,而生物胶囊化涉及在在微生物中表达δ-内毒素基因,对于生物胶囊化,在配方中使用包含毒素蛋白的完整的微生物作为活性成分。加紫外线防护剂可以有效地减少辐射损伤。由于热引起的失活也可以通过加入合适的添加剂控制。
如果可能,在本发明中优选的是包括存活的微生物作为活性成分的配方,上述微生物或者以无性细胞的形式,或者更优选地以孢子的形式。适当的配方可以包含,例如与多价阴离子交联的聚合物胶,并包含这些微生物。例如由D.R.Frarel等在植物病理学,Vol.75,No.7,774-777,1985中描述藻酸盐作为聚合物材料。从此公开文本中也可知道载体材料可以一起被使用。这些配方可以按以下方式制备:通过将天然存在或合成的成胶聚合物,例如藻酸盐和多价金属离子盐水溶液混合,以形成单个液滴,从而对于微生物,有可能悬浮在两个反应溶液之一或在两个反应溶液中。胶的形成从以液滴形式混合开始。随后可能是这些胶颗粒的干燥。这个过程称为离子胶凝作用。根据干燥程度,形成了紧密和坚硬的聚合物颗粒,其在结构上通过多价阳离子交联,并包含普遍均匀分布的微生物和载体。颗粒大小可以长达5mm。
在EP-A1-0 097 571中描述了以部分交联的多聚糖及微生物为基础的组合物,其例如也可以包含精细分开的硅酸作为载体材料;例如通过Ca++离子发生交联。该组合物具有不超过0.3的水活度。W.J.Cornick等在综述文章[生物控制中的新动向:抑制农业上虫害和疾病的改进方法,345-372页,Alan R.Liss,Inc.(1990)]中描述了不同的配方系统,用蛭石作为载体的颗粒和通过所述的离子胶凝作用方法制备的紧密的藻酸盐珠粒。在杀昆虫配方和应用系统中这样的组合物也由D.R.Fravel公开:11卷,ASTM STP1112美国测试和材料学会,Philadelphia,1992,173-179页,并可以用来配制根据本发明的重组微生物。为配制存活微生物的进一步的方法在WO96/02638中描述。
根据本发明的组合物即使在低的施用比率时,也对虫害控制领域的预防性和/或治疗性处理有益,同时它对于温血种类,鱼和植物是耐受的和无毒性的,并具有很适当的杀生物剂范围。根据本发明的组合物可有效地抵抗Ostrinia furnacalis(亚洲玉米螟)虫害的所有或单独发育阶段。根据本发明化合物的杀昆虫作用可以直接显现出来,即立即消灭虫害,或者仅在过了一段时间之后显现。
上述组合物可以以包含基本上纯的毒素蛋白的化学混合物的形式或以包含至少一种作为微生物或转基因植物的一部分的毒素蛋白的混合物形式提供。
在本发明特定的实施方案中,活性成分之一可以直接地通过,例如如本文前述的叶面喷洒施用给植物,而另一活性原理可以由植物本身通过表达事先转化的编码上述另一原理的基因提供。
在根据本发明的方法中使用的杀昆虫组合物通常含从大约0.1%到大约99%,优选地从大约0.1至大约95%,最优选从大约3至90%的活性成分;从大约1至大约99.9%,优选地从大约1至大约99%,最优选地从大约5到大约95%的固体或液体佐剂;和从大约0至大约25%,优选地大约0.1到大约25%,最优选地从大约0.1到大约20%的表面活性剂。
同时商业产品优选地配制成浓缩物,而最终使用者一般将使用实际上更低浓度的稀释配制液,杀昆虫组合物也可能包含进一步成分,比如稳定剂,消泡剂,粘度调节剂,粘合剂,粘着剂,肥料或其它活性成分以得到特殊的效果。
本发明也涉及包含存活微生物为活性成分的配方,如果可能,上述微生物以无性细胞或更多地以孢子形式存在。
本发明的进一步目的涉及在控制农作物植株免受由Ostriniafurnacalis(亚洲玉米螟)种类引起的损害的方法中重组微生物的应用。该重组微生物含编码苏云金芽孢杆菌毒素蛋白如CryI型蛋白的毒素基因,该重组微生物或者直接施用给要保护的植物,或者重组合成的毒素蛋白在施用给要保护的农作物植株前先从重组微生物分离并如上述所配制。重组微生物也可以包含编码在EP-A-690 916和国际申请号NOEP95/03826中公开的VIP型毒素蛋白的毒素基因或者分别编码至少一种Cry型毒素和VIP型毒素的基因的组合。
为了在宿主生物中重组合成毒素蛋白,编码序列可以插入到为所选宿主设计的表达元件中并导入宿主,在宿主中重组合成。选择适合于所选宿主的特异调节序列如启动子,信号序列,5’和3’非翻译序列,和增强子在本领域从业者的技术水平之内。所得的分子,包含了适当的阅读框架中连接的特定元件,插入到能够转化到宿主细胞中的载体中。对于各种宿主生物适合的表达载体和重组合成蛋白的方法已众所周知。上述宿主生物包括如大肠杆菌[见,例如Studier and Moffatt,分子生物学杂志189:113(1986);Brosius,DNA 8:759(1989)],酵母[见,例如Schneiderand Guarente,酶学方法194:373(1991)]和昆虫细胞[见,例如,Luckow and Summers,生物/生物技术6:47(1988)]。特异的例子包括质粒如pBluescript(Stratagene,La Jolla,CA),pFLAG(International Biotechnologies,Inc.,New Haven,CT),pTrcHis(Invitrogen,La Jolla,CA)和杆状病毒表达载体,例如那些从苜蓿丫纹夜蛾核型多角体病毒基因组派生的载体。优选的杆状病毒/昆虫系统是PVI11392/SF21细胞(Invitrogen,La Jolla,CA)。
重组合成的毒素蛋白可以使用一系列标准技术分离并纯化。可以使用的有效技术将根据所用的宿主生物,毒素蛋白是否设计为分泌,和其它本领域技术人员意识到的这样的因素变动[见,例如Ausubel,F.等“分子生物学常规技术”,由John Wiley & Sons公司出版,(1994)第16章]。
本发明优选的目的涉及在保护农作物植株免受由Ostrinia furnacalis(亚洲玉米螟)虫害引起的损害的方法中转基因植物的应用,该转基因植物包含并以足以提供针对Ostrinia furnacalis(亚洲玉米螟)种类的控制的量表达编码苏云金芽孢杆菌的毒素蛋白,特别地CryI型毒素蛋白的毒素基因。植物可以是核转化或质粒转化的结果(见WO95/24492)。特别优选的是表达苏云金芽孢杆菌的CryIA(6)毒素蛋白的转基因植物。本发明也涉及包含编码如在EP-A-690 916和国际申请号EP95/03826(此处全文引用作为参考)中描述的VIP型蛋白的毒素基因的转基因植物的应用。本发明也涉及包含和表达编码苏云金芽孢杆菌的毒素蛋白,但尤其是Cry型毒素蛋白的毒素基因的,和也包含并足够提供针对Ostriniafurnacalis(亚洲玉米螟)种类的控制的量表达编码VIP型蛋白的毒性基因的转基因植物的应用。表达上述毒素基因的宿主植物将具有增强的对Ostrinia furnacalis(亚洲玉米螟)种类昆虫侵袭的抗性并将更好地减少与这种侵袭相关的农作物损失。
在一个优选的实施方案中,在转基因植物中表达一种或多种Bt δ-内毒素伴随表达一种或多种VIP型蛋白。在同一转基因植物中共表达多于一种杀昆虫原理可以通过遗传操作植物以包含并表达所有必须基因实现。可选择地,一种植物,亲本1,可以经遗传工程操作以表达VIP型蛋白。另一植物,亲本2,可以经遗传工程操作以表达Bt δ-内毒素。通过亲本1和亲本2的杂交,得到的子代表达所有导入亲本1和亲本2的基因。特别优选的Bt δ-内毒素是在EP-A 0618976中公开的那些,此处引用作为参考。
本发明包括含编码能在中度严格条件下与编码芽孢杆菌属种类毒素蛋白的DNA分子,但优选地与可从上述DNA分子得到的寡核苷酸探针杂交的DNA分子的基因的重组微生物或转基因植物的应用,其中上述探针长至少10个核苷酸包含上述毒素蛋白的编码序列的连续部分。本发明优选地包括编码与编码苏云金芽孢杆菌或蜡状芽孢杆菌的毒性蛋白的DNA分子,特别地与编码Cry型蛋白的DNA分子或编码VIP型毒素蛋白,优选地CryIA(b)蛋白的毒素基因杂交的DNA分子的基因的重组微生物或转基因植物的应用。
影响杂交稳定性的因素决定杂交的严格度。这样的因素之一是解链温度Tm,Tm可以容易根据公式得到计算,公式在DNA探针,GeorgeH.Keller和Mark M.Manak,Macmillan Publisher Ltd,1993,第一部分:分子杂交技术;第8页中提供。
优选的杂交温度在低于计算解链温度Tm大约25℃的范围内,并且优选地在低于计算解链温度Tm大约12-15℃的范围内,并且在寡核苷酸的情况下在低于解链温度Tm大约5-10℃的范围内。
本发明进一步涉及用于在农作物植株中控制Ostrinia furnacalis(亚洲玉米螟)虫害,包含转基因植物种子及可靠的使用说明的商品包,上述转基因植物包含至少一个毒素基因,并以足以提供针对Ostriniafurnacalis(亚洲玉米螟)种类的控制的量表达该毒素基因,该毒素基因编码苏云金芽孢杆菌的毒素蛋白,优选地Cry型毒素蛋白,更优选地CryI型毒素蛋白,但最优选地CryIA型毒素蛋白。在本发明内优选的是包括包含编码至少一个Cry型毒素蛋白或VIP型毒素蛋白的基因作为活性成分的转基因植物种子的商品包。特别优选地是CryIA(b)毒素蛋白与VIP型蛋白的组合。
本发明进一步的目的是用于在农作物植株中控制Ostrinia furnacalis(亚洲玉米螟)虫害,包括根据本发明的杀昆虫组合物及其可靠的使用说明的商品包。
对于植物,是指任何可以通过本领域已知的方法进行基因转化的植物种类,但尤其是Ostrinia furnacalis(亚洲玉米螟)种类的宿主植物的那些植物,包括,但不限于以下植物种类:玉米、小麦、大麦、黑麦、燕麦、水稻、高粱,黍和相关作物,饲草、竹(果园草,羊茅,等等),和甘蔗。
在本领域已知的植物转化的方法在下面讨论。作为目的作物的宿主植物包括,但不限于以上列出的那些种类。
本发明进一步涉及包含编码苏云金芽孢杆菌的毒素蛋白的基因并以足够提供针对Ostrinia furnacalis(亚洲玉米螟)种类的控制的量表达该毒素蛋白的转基因植物的种子,以及包括该种子的商品包。
对于植物,是指是Ostrinia furnacalis(亚洲玉米螟)宿主的任何植物种类,包括,但不限于玉米、小麦、大麦、黑麦、燕麦、水稻、高粱、黍和相关作物,饲草,竹和甘蔗。
已发现天然苏云金芽孢杆菌毒素基因的密码用法与植物基因的典型用法明显不同。特别地,天然苏云金芽孢杆菌基因的密码子用法与玉米基因的非常不同。因此,这个基因mRNA可能不能有效的利用。密码子用法可能在翻译或转录或mRNA加工水平上影响基因的表达。为使毒素蛋白在植物中,例如在玉米中最佳表达,通过在合成编码与天然毒素基因序列编码的相同的蛋白的合成基因中使用在玉米中最优选的密码子使密码子用法最佳化。最佳化的优选的玉米密码子用法对高水平表达Bt杀昆虫蛋白有效。构建玉米最佳化的合成的毒素基因的进一步细节可在WO/3/07278中得到,此处全文引用作为参考。
来自微生物的毒素基因也可能与植物基因不同。植物基因与微生物中发现的基因不同之处在于,它们转录的RNA不产生邻近起始甲硫氨酸的特定的核糖体结合位点序列。所以,微生物基因可以通过在ATG处插入真核生物保守的翻译起始区加强表达。Clontech(1993/1994目录,210页)建议序列GTCGACC
ATGGTC作为在植物中表达大肠杆菌uidA基因的保守的翻译起始区。另外,Joshi[核酸研究15:6643-6653(1987)]比较了一些邻近ATG的植物序列并证明了保守区TAAACA
ATGGCT。在植物中表达微生物ORFs遇到困难的情况下,在起始ATG处插入一个这样的序列可能改进翻译。由于第二个氨基酸残基的修饰,在这种情况下不适合将保守区的后三个核苷酸包含在修饰序列中。优选的邻近起始甲硫氨酸的序列可能在不同的植物种类之间不同。按在GenBank/EMBL数据库中提供的玉米基因的序列,可以发现哪些邻近ATG的核苷酸应该被修饰以增强转化入玉米中的毒素基因的翻译。
另外,已表明去除不合理剪切位点可以增强导入基因的表达和稳定性。从非植物来源克隆的,在植物中不适于表达的基因可能包括一些模式,这些模式能在植物中被识别为5′或3′剪切位点。因而,转录过程可能未成熟中止,产生截短的或缺失的mRNA。毒素基因可以被加工以去除这些不合理剪切位点。
一些苏云金芽孢杆菌的δ-内毒素蛋白表达为毒素原。这些毒素原在昆虫胃的碱性环境中溶解并由蛋白水解酶剪切成有毒核心片段[Hofte和whiteley,微生物综述,53:242-245(1989)]。对于CryI类的δ-内毒素蛋白,有毒核心片段位于毒素原的N端部分。在带给宿主植物杀昆虫特性的植物转化载体中使用的编码或者全长毒素原形式或完整毒素蛋白截短的有毒核心片段的基因都在本发明的范围之内。
重组DNA分子可以用很多本领域公认的方法导入植物细胞。本领域技术人员将同意,方法的选择可能依赖于要转化的靶植物类型,即单子叶或双子叶。转化植物细胞的适当方法包括微注射[Crossway等,生物技术4:320-334(1986)],电穿孔[Riggs等,美国国家科学院院报83:5602-5606(1986)],土壤农杆菌介导的转化[Hinchee等,生物技术6:915-921(1988)],直接基因转化[Paszkowski等,EMBO J.3:2717-2722(1984)],和弹粒加速,使用从Agracetus,Inc.,Madison,Wisconsin和Dupont,Inc.,Wilmington,Delaware可得到的仪器[见,例如,Sanford等,美国专利4,945,050;和McCabe等,生物技术6:923-926(1988)]。也见,Weissinger等,遗传学年鉴22:421-477(1988);Sanford等,微粒科学和技术5:27-379(1987)(离子);Christou等,植物生理学87:671-674(1988)(大豆);McCabe等,生物/生物技术6:923-926(1988)(大豆);Datta等,生物/生物技术8:736-740(1990)(水稻);Klein等,美国国家科学院院报,85:4305-4309(1988)(玉米);Klein等,生物/生物技术6:559-563(1988)(玉米);Klein等,植物生理学91:440-444(1988)(玉米);Fromm等,生物/生物技术8:833-839(1990);和Gordon-Kamm等,植物细胞2:603-618(1990)(玉米);Svab等,美国国家科学院院报87:8526-8530(1990)(烟草原生质体);Koziel等,生物技术11:194-200(1993)(maize);Shimamoto等,自然338:274-277(1989)(水稻);Christou等,生物技术9:957-962(1991)(水稻);欧洲专利申请EP 0 332 581(果园草和其它Pooideae);Vasil等,生物技术11:1553-1558(1993)(小麦);Weeks等,植物生理学102:1077-1084(1993)(小麦);Wan等,植物生理学104:37-48(1994)(大麦);Umbeck等,生物/生物技术5:263-266(1987)(棉花)。
特别优选的通过微粒轰击将重组DNA分子导入玉米的一系列实施方案可在WO93/07278中找到,此处全文引用作为参考。另一优选的实施方案是在申请EP-A-292 435中公开的玉米的原生质体转化方法,此处全文引用作为参考。
上述导入转基因种子和植物的基因特性通过性繁殖或无性生长传代,因此也可以在子代植物中维持和繁殖。一般地上述维持和繁殖使用已知用于适应特定目的如耕地,播种,或收获的农业方法。也可以使用特殊的方法如水载培或温室技术。因为正在生长的农作物易于受到昆虫或侵染的侵袭和损害及杂草植物的竞争,采取控制杂草,植物疾病,昆虫,线虫,和其它不良条件的办法以提高产量。这些包括机械方法如耕耘土壤或除去杂草和侵染的植物,及使用农业化学品如除草剂,杀真菌剂,杀配子剂,杀线虫剂,生长调节剂,催热剂和杀昆虫剂。
根据本发明的转基因植物和种子的有益遗传特性可进一步在植物育种中使用,遗传育种目的在于发展具改良特性的植物,如对虫害,除草剂或逆境的耐受性,改进营养价值,增加产量,或改进结构减少倒伏。各种育种步骤以良好管理的人类干预为特征,如选择要杂交的株系,介导亲本株系的传粉,或选择适当的子代植株。根据所需特征,采用不同的育种方法。相关技术在本领域内已知,包括但不限于杂交,近亲交配,回交育种,多系育种,多种混合,种间杂交,非整倍体技术等。杂交技术也包括通过机械,化学或生化方法,使植物不育以产生雄性或雌性不育植株。雄性不育植株与不同株系的花粉交叉授粉确保了雄性不育植株的基因组,而雌性可育植物将均等得到两个亲本株系的特征。这样,根据本发明的转基因种子和植物可以用于培育改良的植物株系,该株系例如提高常规方法如除草剂或杀虫剂处理的效率,或由于它们的改良的遗传特征用上述方法推广。可选择地,可以得到具改进的逆境抗性的新作物,由于它们最佳化的遗传“元件”,产出比不能耐受相应不良发育条件的产品更高品质的收获产品。
种子生产中,种子的发芽质量和均一性是必须的产品特性,而被农民收获和售出的种子的发芽质量和均一性并不重要。因为很难控制作物不受其他作物和杂草种子的影响,为了控制种生疾病和生产良好发芽的种子,种子生产者已发展了相当广泛和很好管理的种子生产方法,而种子生产者在生长,调控和市场销售纯种的领域内有经验。常用方法是农民购买满足特定品质标准的已检测的种子,而不是使用从自己的作物收获的种子;使用的繁殖材料如种子,常规地用保护性涂层处理,保护性涂层包括除草剂,杀昆虫剂,杀真菌剂,杀细菌剂,杀线虫剂,杀配子剂或它们的混合物。常规使用的保护性涂层包括诸如克菌丹,莠锈灵,福美双(TMTD_),methalaxyl(Apron_)和;虫螨磷(Actellic_)的化合物。如果需要,这些化合物进一步与其它在配制技术中常规使用的载体,表面活性剂或施用促进佐剂一起配制以提供保护抵抗细菌、真菌或动物虫害引起的损害。保护性涂层可以通过用液体配制物浸渍繁殖材料或通过用组合的湿或干的配制物涂敷施用。其它施用方法也是可能的,如直接在花芽或果实上处理。
本发明的另一方面是提供新的农业方法,如以上列举的方法,此方法的特征在于使用根据本发明的转基因植物,转基因植物材料,或转基因种子以提供针对Ostrinia furnacalis(亚洲玉米螟)的控制。
为从根据本发明的方法转化的植株繁育子代,可以使用如以下的方法:如本领域已知的,如在下述实施例中描述的得到的玉米植物在温室的盆中或在土壤中生长,并使之开花。从成熟雄花穗得到花粉,并用于给同一植株,同胞植株或任何所选的玉米植株的雌花穗传粉。按此方法得到的转化子代可以通过导入基因和/或伴随DNA(基因型)的存在,或带来的表现型与未转化的子代区别开。转化子代可以相似地自交或与其它植物杂交,一般与带有所需品质的任意植物杂交。相似地,烟草或其它按此方法得到转化植物可以如在本领域已知的自交或杂交,以产生具有所需特性的子代。相似地,通过结合本领域已知的方法和本发明产生的其它转基因植物可以如在本领域已知的培育以产生具所需特征的子代。
实施例
以下实施例进一步描述了为完成本发明的特定实施方案使用的材料和方法。它们通过说明的方法提供,并不应理解为限制本说明书的公开。
实施例1:一般方法
DMA操作使用在本领域常规应用的方法进行。这些方法可以在基本上不改变结果的情况下修饰和/或改变。除了其它已说明的文献,一些方法在常用的书本如Sambrook等,分子克隆:实验手册,冷泉港实验室出版社,第二版,1989中描述。
实施例2:植物转化载体
植物转化使用在WO 93/07278和Koziel等(1993)[生物技术Vol11 194-200]中描述的转化载体PCIB4431和PCIB3064进行,两个公开文献在此处引用作为参考。
PCIB4431是设计以转化玉米的载体。它包含两个嵌合的合成的在玉米能表达的Bt CryIA(b)内毒素基因,其中之一构建了PEP羧化酶启动子/合成的CryIA(b)基因,另一为花粉启动子/合成的CryIA(b)基因。
PCIB4431包含了在SEQ ID NO:1中提供的合成的CryIA(b)基因,并于1992年9月21日保存于农业研究机构保藏中心(NRRL),北部区域研究中心,1815 North Uni versity Street,Peoria,Illinois61604,U.S.A,保藏号为NRRL B-18998。
PCIB3064包含植物可表达的bar基因(615bP),原始地从吸水链霉菌克隆[Thompson等(1987)EMBO J 6,2519-2523]。它编码了磷丝菌素酰基转移酶(PAT),带来对磷丝菌素的抗性,bar基因在CaMV 35S启动子和终止子的控制下[OW等(1987)美国国家科学院院报84,4870-4874]以提供对磷丝菌素的抗性。
实施例3 含合成的玉米CryIA(b)基因的转基因玉米植株的产生。
以下实施例利用Biolistic仪器将DNA包被的颗粒导入玉米细胞,从中产生转化的植株。
3.1
组织
未成熟玉米胚,大约长1.5-2.5mm,在传粉后14-15天,从基因型6N6 15的雌穗切出,母株在温室中生长。在切除前,雌穗用20%Cloxox表面消毒20分钟,并用无菌水冲洗三次。单个胚盾片朝上放在2平方厘米区域内,每盘36个胚,并置于愈伤诱导培养基,2DG4+5草灭平培养[N6大量盐,B5微量盐,MS铁离子,2%蔗糖和5mg/l草灭平,20mg/l葡萄糖,以及在高压灭菌后加10mlG4添加剂(表1)]上。
表1:G4添加剂
成分 每升培养基
酪蛋白水解物 0.5g
脯氨酸 1.38g
尼克酸 0.2mg
吡哆素-HCl 0.2mg
硫胺素-HCl 0.5mg
胆碱-HCl 0.1mg
核黄素 0.05mg
生物素 0.1mg
叶酸 0.05mg
泛酸钙 0.1mg
对氨基苯甲酸 0.05mg
B12 0.136μg
3.2用于转化的DNA的制备
微载体根据与Biolistic仪器一起提供的说明书制备,在震荡50μl1.0μm金微载体的同时,加入5μl pCIB4431(1.23μg/μl)[#898]+2μl PCIB3064(0.895mg/μl)[#456],再加入50μl 2.5MCaCl2,然后加入20μl 0.1M亚精胺(自由基,TC级)。得到的混合物震荡混匀3分钟并离心10秒。去除上清,并用250μl 100%乙醇(HPLC级)通过短暂震荡,离心并去除上清冲洗微载体2次,微载体重悬于65μl 100%乙醇中。
3.3
轰击
使用PDS-1000He Biolistic仪器轰击组织。组织放置于制动筛选挡板(stopping screen shelf)以下8cm的格子上,用10μl DNA/金微载体溶液射击组织一次,并在大载体(macrocarrier)上干燥。所用的终止筛板使用10×10不锈钢筛目手动冲压,使用1550psi值的自裂片。轰击后,胚在黑暗中,25℃培养。
3.4
愈伤形成
轰击1天后,胚转移到含3mg/l PPT的愈伤诱导培养基上。轰击后2和3周评估胚的愈伤形成。所有有反应的胚转移到愈伤维持培养基,含3mg/L PPT的2DG4+0.5 2,4-D培养基上。愈伤维持培养基是N6大量盐,B5微量盐,MS铁,2%蔗糖,加0.5mg/L 2,4-D,20mg/l葡萄糖,并在高压灭菌后加入10ml G4添加剂。胚生愈伤每隔2周用含3mg/L PPT的新鲜维持培养基传代,所有愈伤组织在黑暗中于25℃温育。
I型愈伤形成反应是15%。每一个产生愈伤的胚作为单独情况培养,以形成单独株系。
3.5
再生
筛选12周后,将组织从含PPT的愈伤维持培养基上移到再生培养基上。再生培养基是0.25MS3S5BA(0.25mg/L 2,4-D,5mg/L BAP,MS盐类,3%蔗糖),2周后,在MS3S培养基上再培养以再生植株。5到10周后,取出植物,并放入GA7′S中。
实施例4:转基因玉米植株的分析
4.1
ELISA分析
在转基因玉米中CryIA(b)基因的表达检测使用亚洲玉米螟昆虫生物分析和ELISA分析监控,以从数量上确定得到的CryIA(b)蛋白的水平。
使用酶联免疫吸附分析(ELISA)对转基因植物叶中CryIA(b)杀昆虫蛋白进行数量上的测定,如在Clark MF,Lister R M,Bar-Joseph M:ELISA技术。于:Weissbach A,Weissbach H(编辑)酶学方法118:742-766,学术出版社,Florida(1986)公开。使用免疫亲合纯化的对苏云金芽孢杆菌库尔斯塔克亚种的杀昆虫蛋白特异的多克隆免和羊抗体测定每mg来自样品叶粗提物的可溶蛋白中杀昆虫蛋白的量。每个ELISA微量滴定板的加样孔使用50μg总蛋白时,双级三明治ELISA的灵敏度是每mg可溶蛋白1-5ng杀昆虫蛋白。
玉米提取物通过使用手持球珠匀浆器(AGDIA,ELkart IN.)在网状划线的塑料袋中研磨叶组织得到,匀浆器中有抽提缓冲液(50mMNa2CO3 PH9.5,100mM NaCl,0.05%Triton,0.05%Tween,1mM PMSF和1μ M Leupeptim)。蛋白测定使用Bio-Rad(Richmond,CA)蛋白试验进行。
4.2
亚洲玉米螟分析
从玉米植株展开的叶子上切下一个到4个4cm小块。每个叶块放在50×9mm培养皿中湿润的圆滤纸上。在每个叶块上放5只新生亚洲玉米螟幼虫(使每个植株上总数5-20只幼虫)。培养皿放在29.5℃温育。在24,48和72小时后评估叶块饲养损伤和死亡率。
实施例5:Ostrinia furnacalis(亚洲玉米螟)田间测试实验
将内含事先测试存在并表达了转化基因的转基因幼苗的小泥盆,移载到田野中。非转基因的近交系株系在同一田间载种超过六个星期,以作为对照并供传粉。
当田间植物达到展开叶片长度40cm时,开始用实验室饲养的Ostrinia furnacalis(亚洲玉米螟)幼虫侵袭转基因和非转基因对照植物。使用Davis培养箱,将大约300只与玉米穗轴石细胞团混合的新生幼虫导入每株植物的轮生叶中。以星期为单位,侵袭持续四周以刺激第一代亚洲玉米螟。起始侵袭后两周开始,每周使用分数1-9评估每株植物,持续四周(1=无可见的叶损伤;9=大多数叶有长的伤口,几片叶有折断的中脉,可能由于喂养亚洲玉米螟导致的矮化植株)。对于每株转基因植株和非转基因对照植株,亚洲玉米螟损伤的平均评估分数得到计算。当各植株达到开花期,每周用300幼虫/植株,持续四周,进行第二次侵袭。将各100只在玉米穗轴石细胞团中的新生幼虫导入初生雌穗的叶腋和初生雌穗结的以上和以下一节的叶腋。由此,总数大约2400只幼虫加在每一株植物上。最初二次侵袭后大约50天,收集所有移载的和一些非转基因植株的茎。鉴定第二次侵袭在整个植物中内部穿孔损伤的程度。
实施例6:测试来自转化原生质体的抽提物抵抗Ostrinia furnacalis(亚洲玉米螟)的活性
使用从已瞬间转化DNA以表达玉米最佳化基因的玉米细胞得到的抽提物,进行Western印迹分析。
使用收获的原生质进行数量上的昆虫毒性测试。为在生物分析中所测的每一重复实验制备悬浮液。如果一个重复试验引起明显比对照更高的死亡率,则认为它是阳性的。例如,使用亚洲玉米螟,Ostrinia furnacalis重复测试抽提物针对鳞翅目昆虫的活性。100μl在0.1%Triton X-100中的原生质体悬浮液滴到在50mm×10mm有咬合盖的培养皿中的人工黑色切根虫营养上(Bioserv,Inc.,frenchtown,NJ;F9240)。风干后,每培养皿加入10只新生幼虫。大约4天后记录死亡率。
实施例7:Ostrinia furnacalis(亚洲玉米螟)的植物浸泡试验
7.1
苏云金芽孢杆菌(Bt)晶体
苏云金芽孢杆菌(Bt)晶体用22ml双蒸水制备成贮备悬浮液。悬浮液保存于冰箱中。
7.2
记录规范
使用3天大的Ostrinia furnacalis(亚洲玉米螟)幼虫在玉米叶上饲养。幼虫已事先用未处理的叶喂养。120小时后,记录死亡幼虫的数量。观察每个试验中叶上饲养损伤的情况。
7.3
测试方法
使用对亚洲玉米螟易感的玉米(Zea mays)的两个纯合近交系植株(株系A和B)。9-10天大的籽苗在不同浓度的Bt蛋白悬浮液中浸泡,并在喂养实验中使用,其中幼虫放在籽苗的干了的叶子上,每植株5-10只幼虫。籽苗用尼龙网袋盖上,并放在尼龙网笼子中。测试4-5个浓度的4个重复实验,鉴定死亡率。温度保持在21-30℃。
7.4
结果
表2:株系A的结果
剂量(ppm) |
昆虫总数 |
死亡昆虫数 |
死亡率(%) |
80 |
40 |
31 |
77.5 |
40 |
40 |
29 |
72.5 |
20 |
40 |
15 |
37.5 |
10 |
40 |
13 |
32.5 |
5 |
40 |
5 |
12.5 |
0 |
40 |
0 |
0.00 |
表3:株系B的结果
剂量(ppm) |
昆虫总数 |
死亡昆虫数 |
死亡率(%) |
80 |
40 |
38 |
94.74 |
40 |
40 |
33 |
81.58 |
20 |
40 |
28 |
68.42 |
10 |
40 |
19 |
44.74 |
5 |
40 |
10 |
21.05 |
0 |
40 |
2 |
5.00 |
玉米叶损伤的两种情况清楚地区别出来:Bt浸泡的籽苗的叶轻微损伤,同时对照籽苗的叶子严重损伤。
得到下列LC50值:
株系A:LC50=23.412ppm(从17.834到30.734之间变动)
株系B:LC50=12.234ppm(从9.547到15.676之间变动)
实施例8:Ostrinia furnacalis(亚洲玉米螟)的植物浸泡实验(VIP3A)
8.1
VIP3A蛋白
5mg VIP3A蛋白用50ml双蒸水制备以准备VIP3蛋白的不同浓度:100ppm,50ppm,25ppm,12.5ppm,6.25ppm和0ppm(对照)。
8.2
Ostrinia furnacalis(亚洲玉米螟)
由昆虫学和动物科学部DOA从位于Racha Buri的农场收集蛹以便准备测试用幼虫(L2)
8.3
记录规范
培养5天后,通过计算死亡幼虫的数量收集数据,然后通过概率分析程序分析幼虫死亡的百分率。
8.4
测试方法
使用对亚洲玉米螟易感的玉米(Zea mays)的两个纯合近交系植株(株系B和C)。株系B用来作盆载植物测试(5个不同梯度浓度和对照的4组重复实验),株系C用于叶浸泡测试(100,50,25ppm和对照的4组重复实验)。10-14天大的籽苗在不同浓度的VIP3A蛋白悬浮液中浸泡,并在喂养实验中使用,其中幼虫放在籽苗的干了的叶子上,每植株5-10幼虫。籽苗用尼龙网袋盖上,并放于尼龙网笼子中。切下的叶子在密封塑料袋中,并置于控制室内,本试验使用4组重复实验。
8.4
结果
表4:株系B的结果
表4b
剂量(ppm) |
昆虫总数 |
死亡昆虫数 |
死亡率(%) |
100 |
40 |
31 |
76.92 |
50 |
40 |
26 |
64.10 |
25 |
40 |
16 |
38.46 |
12.5 |
40 |
14 |
33.33 |
6.25 |
40 |
9 |
20.51 |
0 |
40 |
1 |
2.50 |
表4b
重复 |
5天后死亡幼虫数目 |
浓度(ppm) |
|
对照 |
6.25 |
12.5 |
25 |
50 |
100 |
I(10) |
0 |
3 |
3 |
5 |
8 |
6 |
II(10) |
0 |
2 |
2 |
3 |
6 |
9 |
III(10) |
1 |
3 |
5 |
4 |
5 |
9 |
IV(10) |
0 |
1 |
4 |
4 |
7 |
7 |
总数 |
1 |
9 |
14 |
16 |
26 |
31 |
表5:株系C的结果
表5a
重复 |
24小时后死亡幼虫数目 |
浓度(ppm) |
|
对照 |
25 |
50 |
100 |
I(10) |
0 |
0 |
0 |
1 |
II(10) |
0 |
0 |
0 |
1 |
III(10) |
0 |
0 |
0 |
0 |
IV(10) |
0 |
0 |
0 |
1 |
总数 |
0 |
0 |
0 |
3 |
表5b
重复 |
48小时后死亡幼虫数目 |
浓度(ppm) |
|
对照 |
25 |
50 |
100 |
I(10) |
0 |
0 |
0 |
1 |
II(10) |
0 |
0 |
0 |
1 |
III(10) |
0 |
0 |
1 |
0 |
IV(10) |
0 |
0 |
0 |
1 |
总数 |
0 |
0 |
1 |
3 |
表5c
重复 |
72小时后死亡幼虫数目 |
浓度(ppm) |
|
对照 |
25 |
50 |
100 |
I(10) |
0 |
1 |
0 |
2 |
II(10) |
0 |
1 |
0 |
1 |
III(10) |
0 |
0 |
2 |
3 |
IV(10) |
0 |
0 |
2 |
1 |
总数 | 0 | 2 | 4 | 7 |
表5d
重复 |
96小时后死亡幼虫数目浓度(ppm) |
|
对照 |
25 |
50 |
100 |
I(10) |
0 |
1 |
1 |
4 |
II(10) |
0 |
2 |
1 |
3 |
III(10) |
0 |
1 |
3 |
5 |
IV(10) |
0 |
1 |
4 |
3 |
总数 |
0 |
5 |
1 |
15 |
表5e
重复 |
120小时后死亡幼虫数目浓度(ppm) |
|
对照 |
25 |
50 |
100 |
I(10) |
1 |
3 |
2 |
7 |
II(10) |
0 |
2 |
1 |
4 |
III(10) |
0 |
3 |
5 |
5 |
IV(10) |
0 |
2 |
4 |
9 |
总数 |
1 |
10 |
12 |
25 |
120小时后得到以下LC50值:
株系B:LC50=29.558ppm(从21.298到41.022之间变动)
株系C:LC50=74.498ppm(从53.644到114.866之间变动)
(1)基本信息
(i)申请人
(A)名称:Novartis AG
(B)街名:Schwarzwaldallee 215
(C)城市:巴塞尔
(D)国家:瑞士
(E)邮政编码(ZIP):4002
(F)电话:+41 61 69 11 11
(G)传真:+41 61 696 79 76
(H)传真:962 991
(ii)发明名称:控制昆虫虫害的方法
(iii)序列数目:55
(iv)计算机可读形式
(A)介质类型:软盘
(B)计算机:IBM PC兼容机
(C)操作系统:PC-DOS/MS-DOS
(D)软件:PatentIn Release#1.0,版本#1.30B
(2)关于SEQ ID NO:1的信息
(i)序列特征
(A)长度:6049个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(vi)原始来源
(A)生物:蜡状芽孢杆菌
(B)菌株:AB78
(C)特定分离物:NRRL B-21058
(ix)特征:
(A)名称/键:CDS
(B)位置:1082..2467
(D)其它信息:/产品=“VIP2A(a)”
(ix)特征:
(A)名称/键:misc_feature
(B)位置:2475..5126
(D)其它信息:/备注=100kd VIP1A(a)蛋白的编码序列。该编码序列在SEQ ID NO:4中重复并分别翻译。
(xi)序列描述:SEQ ID NO:1:
ATCGATACAA TGTTGTTTTA CTTAGACCGG TAGTCTCTGT AATTTGTTTA ATGCTATATT 60
CTTTACTTTG ATACATTTTA ATAGCCATTT CAACCTTATC AGTATGTTTT TGTGGTCTTC 120
CTCCTTTTTT TCCACGAGCT CTAGCTGCGT TTAATCCTGT TTTGGTACGT TCGCTAATAA 180
TATCTCTTTC TAATTCTGCA ATACTTGCCA TCATTCGAAA GAAGAATTTC CCCATAGCAT 240
TAGAGGTATC AATGTTGTCA TGAATAGAAA TAAAATCTAC ACCTAGCTCT TTGAATTTTT 300
CACTTAACTC AATTAGGTGT TTTGTAGAGC GAGAAATTCG ATCAAGTTTG TAAACAACTA 360
TCTTATCGCC TTTACGTAAT ACTTTTAGCA ACTCTTCGAG TTGAGGGCGC TCTTTTTTTA 420
TTCCTGTTAT TTTCTCCTGA TATAGCCTTT CTACACCATA TTGTTGCAAA GCATCTATTT 480
GCATATCGAG ATTTTGTTCT TCTGTGCTGA CACGAGCATA ACCAAAAATC AAATTGGTTT 540
CACTTCCTAT CTAAATATAT CTATTAAAAT AGCACCAAAA ACCTTATTAA ATTAAAATAA 600
GGAACTTTGT TTTTGGATAT GGATTTTGGT ACTCAATATG GATGAGTTTT TAACGCTTTT 660
GTTAAAAAAC AAACAAGTGC CATAAACGGT CGTTTTTGGG ATGACATAAT AAATAATCTG 720
TTTGATTAAC CTAACCTTGT ATCCTTACAG CCCAGTTTTA TTTGTACTTC AACTGACTGA 780
ATATGAAAAC AACATGAAGG TTTCATAAAA TTTATATATT TTCCATAACG GATGCTCTAT 840
CTTTAGGTTA TAGTTAAATT ATAAGAAAAA AACAAACGGA GGGAGTGAAA AAAAGCATCT 900
TCTCTATAAT TTTACAGGCT CTTTAATAAG AAGGGGGGAG ATTAGATAAT AAATATGAAT 960
ATCTATCTAT AATTGTTTGC TTCTACAATA ACTTATCTAA CTTTCATATA CAACAACAAA 1020
ACAGACTAAA TCCAGATTGT ATATTCATTT TCAGTTGTTC CTTTATAAAA TAATTTCATA 1080
A ATG AAA AGA ATG GAG GGA AAG TTG TTT ATG GTG TCA AAA AAA TTA 1126
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu
1 5 10 15
CAA GTA GTT ACT AAA ACT GTA TTG CTT AGT ACA GTT TTC TCT ATA TCT 1174
Gln Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile Ser
20 25 30
TTA TTA AAT AAT GAA GTG ATA AAA GCT GAA CAA TTA AAT ATA AAT TCT 1222
Leu Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn Ser
35 40 45
CAA AGT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC ACT GAC AAG GTA 1270
Gln Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys Val
50 55 60
GAG GAT TTT AAA GAA GAT AAG GAA AAA GCG AAA GAA TGG GGG AAA GAA 1318
Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu
65 70 75
AAA GAA AAA GAG TGG AAA CTA ACT GCT ACT GAA AAA GGA AAA ATG AAT 1366
Lys Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn
80 85 90 95
AAT TTT TTA GAT AAT AAA AAT GAT ATA AAG ACA AAT TAT AAA GAA ATT 1414
Asn Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile
100 105 110
ACT TTT TCT ATG GCA GGC TCA TTT GAA GAT GAA ATA AAA GAT TTA AAA 1462
Thr Phe Ser Met Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys
115 120 125
GAA ATT GAT AAG ATG TTT GAT AAA ACC AAT CTA TCA AAT TCT ATT ATC 1510
Glu Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile
130 135 140
ACC TAT AAA AAT GTG GAA CCG ACA ACA ATT GGA TTT AAT AAA TCT TTA 1558
Thr Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu
145 150 155
ACA GAA GGT AAT ACG ATT A AT TCT GAT GCA ATG GCA CAG TTT AAA GAA 1606
Thr Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu
160 165 170 175
CAA TTT TTA GAT AGG GAT ATT AAG TTT GAT AGT TAT CTA GAT ACG CAT 1654
Gln Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His
180 185 190
TTA ACT GCT CAA CAA GTT TCC AGT AAA GAA AGA GTT ATT TTG AAG GTT 1702
Leu Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val
195 200 205
ACG GTT CCG AGT GGG AAA GGT TCT ACT ACT CCA ACA AAA GCA GGT GTC 1750
Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val
210 215 220
ATT TTA AAT AAT AGT GAA TAC AAA ATG CTC ATT GAT AAT GGG TAT ATG 1798
Ile Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met
225 230 235
GTC CAT GTA GAT AAG GTA TCA AAA GTG GTG AAA AAA GGG GTG GAG TGC 1846
Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys
240 245 250 255
TTA CAA ATT GAA GGG ACT TTA AAA AAG AGT CTT GAC TTT AAA AAT GAT 1894
Leu Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp
260 265 270
ATA AAT GCT GAA GCG CAT AGC TGG GGT ATG AAG AAT TAT GAA GAG TGG 1942
Ile Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp
275 280 285
GCT AAA GAT TTA ACC GAT TCG CAA AGG GAA GCT TTA GAT GGG TAT GCT 1990
Ala Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala
290 295 300
AGG CAA GAT TAT AAA GAA ATC AAT AAT TAT TTA AGA AAT CAA GGC GGA 2038
Arg Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly
305 310 315
AGT GGA AAT GAA AAA CTA GAT GCT CAA ATA AAA AAT ATT TCT GAT GCT 2086
Ser Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala
320 325 330 335
TTA GGG AAG AAA CCA ATA CCG GAA AAT ATT ACT GTG TAT AGA TGG TGT 2134
Leu Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys
340 345 350
GGC ATG CCG GAA TTT GGT TAT CAA ATT AGT GAT CCG TTA CCT TCT TTA 2182
Gly Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu
355 360 365
AAA GAT TTT GAA GAA CAA TTT TTA AAT ACA ATC AAA GAA GAC AAA GGA 2230
Lys Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly
370 375 380
TAT ATG AGT ACA AGC TTA TCG AGT GAA CGT CTT GCA GCT TTT GGA TCT 2278
Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser
385 390 395
AGA AAA ATT ATA TTA CGA TTA CAA GTT CCG AAA GGA AGT ACG GGT GCG 2326
Arg Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala
400 405 410 415
TAT TTA AGT GCC ATT GGT GGA TTT GCA AGT GAA AAA GAG ATC CTA CTT 2374
Tyr Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu
420 425 430
GAT AAA GAT AGT AAA TAT CAT ATT GAT AAA GTA ACA GAG GTA ATT ATT 2422
Asp Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile
435 440 445
AAA GGT GTT AAG CGA TAT GTA GTG GAT GCA ACA TTA TTA ACA AAT 2467
Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
450 455 460
TAAGGAGATG AAAAATATGA AGAAAAAGTT AGCAAGTGTT GTAACGTGTA CGTTATTAGC 2527
TCCTATGTTT TTGAATGGAA ATGTGAATGC TGTTTACGCA GACAGCAAAA CAAATCAAAT 2587
TTCTACAACA GAGAAAAATC AACAGAAAGA GATGGACCGA AAAGGATTAC TTGGGTATTA 2647
TTTCAAAGGA AAAGATTTTA GTAATCTTAC TATGTTTGCA CCGACACGTG ATAGTACTCT 2707
TATTTATGAT CAACAAACAG CAAATAAACT ATTAGATAAA AAACAACAAG AATATCAGTC 2767
TATTCGTTGG ATTGGTTTGA TTCAGAGTAA AGAAACGGGA GATTTCACAT TTAACTTATC 2827
TGAGGATGAA CAGGCAATTA TAGAAATCAA TGGGAAAATT ATTTCTAATA AAGGGAAAGA 2887
AAAGCAAGTT GTCCATTTAG AAAAAGGAAA ATTAGTTCCA ATCAAAATAG AGTATCAATC 2947
AGATACAAAA TTTAATATTG ACAGTAAAAC ATTTAAAGAA CTTAAATTAT TTAAAATAGA 3007
TAGTCAAAAC CAACCCCAGC AAGTCCAGCA AGATGAACTG AGAAATCCTG AATTTAACAA 3067
GAAAGAATCA CAGGAATTCT TAGCGAAACC ATCGAAAATA AATCTTTTCA CTCAAAAAAT 3127
GAAAAGGGAA ATTGATGAAG ACACGGATAC GGATGGGGAC TCTATTCCTG ACCTTTGGGA 3187
AGAAAATGGG TATACGATTC ACAATAGAAT CGCTGTAAAG TGGGACGATT CTCTAGCAAG 3247
TAAAGGGTAT ACGAAATTTG TTTCAAATCC ACTAGAAAGT CACACAGTTG GTGATCCTTA 3307
TACAGATTAT GAAAAGGCAG CAAGAGATCT AGATTTGTCA AATGCAAAGG AAACGTTTAA 3367
CCCATTGGTA GCTGCTTTTC CAAGTGTGAA TGTTAGTATG GAAAAGGTGA TATTATCACC 3427
AAATGAAAAT TTATCCAATA GTGTAGAGTC TCATTCATCC ACGAATTGGT CTTATACAAA 3487
TACAGAAGGT GCTTCTGTTG AAGCGGGGAT TGGACCAAAA GGTATTTCGT TCGGAGTTAG 3547
CGTAAACTAT CAACACTCTG AAACAGTTGC ACAAGAATGG GGAACATCTA CAGGAAATAC 3607
TTCGCAATTC AATACGGCTT CAGCGGGATA TTTAAATGCA AATGTTCGAT ATAACAATGT 3667
AGGAACTGGT GCCATCTACG ATGTAAAACC TACAACAAGT TTTGTATTAA ATAACGATAC 3727
TATGGCAACT ATTACGGCGA AATCTAATTC TACAGCCTTA AATATATCTC CTGGAGAAAG 3787
TTACCCGAAA AAAGGACAAA ATGGAATCGC AATAACATCA ATGGATGATT TTAATTCCCA 3847
TCCGATTACA TTAAATAAAA AACAAGTAGA TAATCTGGCA AATAATAAAC CTATGATGTT 3907
GGAAACAAAC CAAACAGATG GTGTTTATAA GATAAAAGAT ACACATGGAA ATATAGTAAC 3967
TGGCGGAGAA TGGAATGGTG TCATACAACA AATCAAGGCT AAAACAGCGT CTATTATTGT 4027
GGATGATGGG GAACGTGTAG CAGAAAAACG TGTAGCGGCA AAAGATTATG AAAATCCAGA 4087
AGATAAAACA CCGTCTTTAA CTTTAAAAGA TGCCCTGAAG CTTTCATATC CAGATGAAAT 4147
AAAAGAAATA GAGGGATTAT TATATTATAA AAACAAACCG ATATACGAAT CGAGCGTTAT 4207
GACTTACTTA GATGAAAATA CAGCAAAAGA AGTGACCAAA CAATTAAATG ATACCACTGG 4267
GAAATTTAAA GATGTAAGTC ATTTATATGA TGTAAAACTG ACTCCAAAAA TGAATGTTAC 4327
AATCAAATTG TCTATACTTT ATGATAATGC TGAGTCTAAT GATAACTCAA TTGGTAAATG 4387
GACAAACACA AATATTGTTT CAGGTGGAAA TAACGGAAAA AAACAATATT CTTCTAATAA 4447
TCCGGATGCT AATTTGACAT TAAATACAGA TGCTCAAGAA AAATTAAATA AAAATCGTGA 4507
CTATTATATA AGTTTATATA TGAAGTCAGA AAAAAACACA CAATGTGAGA TTACTATAGA 4567
TGGGGAGATT TATCCGATCA CTACAAAAAC AGTGAATGTG AATAAAGACA ATTACAAAAG 4627
ATTAGATATT ATAGCTCATA ATATAAAAAG TAATCCAATT TCTTCACTTC ATATTAAAAC 4687
GAATGATGAA ATAACTTTAT TTTGGGATGA TATTTCTATA ACAGATGTAG CATCAATAAA 4747
ACCGGAAAAT TTAACAGATT CAGAAATTAA ACAGATTTAT AGTAGGTATG GTATTAAGTT 4807
AGAAGATGGA ATCCTTATTG ATAAAAAAGG TGGGATTCAT TATGGTGAAT TTATTAATGA 4867
AGCTAGTTTT AATATTGAAC CATTGCAAAA TTATGTGACC AAATATGAAG TTACTTATAG 4927
TAGTGAGTTA GGACCAAACG TGAGTGACAC ACTTGAAAGT GATAAAATTT ACAAGGATGG 4987
GACAATTAAA TTTGATTTTA CCAAATATAG TAAAAATGAA CAAGGATTAT TTTATGACAG 5047
TGGATTAAAT TGGGACTTTA AAATTAATGC TATTACTTAT GATGGTAAAG AGATGAATGT 5107
TTTTCATAGA TATAATAAAT AGTTATTATA TCTATGAAGC TGGTGCTAAA GATAGTGTAA 5167
AAGTTAATAT ACTGTAGGAT TGTAATAAAA GTAATGGAAT TGATATCGTA CTTTGGAGTG 5227
GGGGATACTT TGTAAATAGT TCTATCAGAA ACATTAGACT AAGAAAAGTT ACTACCCCCA 5287
CTTGAAAATG AAGATTCAAC TGATTACAAA CAACCTGTTA AATATTATAA GGTTTTAACA 5347
AAATATTAAA CTCTTTATGT TAATACTGTA ATATAAAGAG TTTAATTGTA TTCAAATGAA 5407
GCTTTCCCAC AAAATTAGAC TGATTATCTA ATGAAATAAT CAGTCTAATT TTGTAGAACA 5467
GGTCTGGTAT TATTGTACGT GGTCACTAAA AGATATCTAA TATTATTGGG CAAGGCGTTC 5527
CATGATTGAA TCCTCGAATG TCTTGCCCTT TTCATTTATT TAAGAAGGAT TGTGGAGAAA 5587
TTATGGTTTA GATAATGAAG AAAGACTTCA CTTCTAATTT TTGATGTTAA ATAAATCAAA 5647
ATTTGGCGAT TCACATTGTT TAATCCACTG ATAAAACATA CTGGAGTGTT CTTAAAAAAT 5707
CAGCTTTTTT CTTTATAAAA TTTTGCTTAG CGTACGAAAT TCGTGTTTTG TTGGTGGGAC 5767
CCCATGCCCA TCAACTTAAG AGTAAATTAG TAATGAACTT TCGTTCATCT GGATTAAAAT 5827
AACCTCAAAT TAGGACATGT TTTTAAAAAT AAGCAGACCA AATAAGCCTA GAATAGGTAT 5887
CATTTTTAAA AATTATGCTG CTTTCTTTTG TTTTCCAAAT CCATTATACT CATAAGCAAC 5947
ACCCATAATG TCAAAGACTG TTTTTGTCTC ATATCGATAA GCTTGATATC GATTCCTGGC 6007
AGCCCGGGGG ATCCACTAGT TCTAGAGCGG CCGCCACCGC GG 6049
(2)关于SEQ ID NO:2的信息
(i)序列特征
(A)长度:462个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:2:
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gln
1 5 10 15
Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile Ser Leu
20 25 30
Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn Ser Gln
35 40 45
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu
50 55 60
Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys
65 70 75 80
Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn
85 90 95
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
100 105 110
Phe Ser Met Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu
115 120 125
Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr
130 135 140
Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr
145 150 155 160
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
165 170 175
Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
180 185 190
Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr
195 200 205
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
210 215 220
Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val
225 230 235 240
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu
245 250 255
Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
260 265 270
Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala
275 280 285
Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
290 295 300
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
305 310 315 320
Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu
325 330 335
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
340 345 350
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
355 360 365
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
370 375 380
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
385 390 395 400
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
405 410 415
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
420 425 430
Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys
435 440 445
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
450 455 460
(2)关于SEQ ID NO:3的信息
(i)序列特征
(A)长度:20个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:肽
(ix)特征:
(A)名称/键:肽
(B)位置:1..20
(D)其它信息:/备注=用于导向液泡的信号肽
(xi)序列描述:SEQ ID NO:3:
Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro Ile Arg Val Thr Asp Arg
1 5 10 15
Ala Ala Ser Thr
20
(2)关于SEQ ID NO:4的信息
(i)序列特征
(A)长度:2655个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(iv)反义:否
(vi)原始来源
(A)生物:蜡状芽孢杆菌
(B)菌株:AB78
(C)特定分离物:NRRL B-21058
(ix)特征:
(A)名称/键:CDS
(B)位置:1..2652
(D)其它信息:/产品=“100kDa VIPlA(a)蛋白”,/备注=“该序列与SEQ ID NO:1的从2475到5126位的部分核苷酸相同。”
(xi)序列描述:SEQ ID NO:4:
ATG AAA AAT ATG AAG AAA AAG TTA GCA AGT GTT GTA ACG TGT ACG TTA 48
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Thr Leu
465 470 475
TTA GCT CCT ATG TTT TTG AAT GGA AAT GTG AAT GCT GTT TAC GCA GAC 96
Leu Ala Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp
480 485 490
AGC AAA ACA AAT CAA ATT TCT ACA ACA CAG AAA AAT CAA CAG AAA GAG 144
Ser Lys Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln Lys Glu
495 500 505 510
ATG GAC CGA AAA GGA TTA CTT GGG TAT TAT TTC AAA GGA AAA GAT TTT 192
Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe
515 520 525
AGT AAT CTT ACT ATG TTT GCA CCG ACA CGT GAT AGT ACT CTT ATT TAT 240
Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu Ile Tyr
530 535 540
GAT CAA CAA ACA GCA AAT AAA CTA TTA GAT AAA AAA CAA CAA GAA TAT 288
Asp Gln Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln Glu Tyr
545 550 555
CAG TCT ATT CGT TGG ATT GGT TTG ATT CAG AGT AAA GAA ACG GGA GAT 336
Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr Gly Asp
560 565 570
TTC ACA TTT AAC TTA TCT GAG GAT GAA CAG GCA ATT ATA GAA ATC AAT 384
Phe Thr Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu Ile Asn
575 580 585 590
GGG AAA ATT ATT TCT AAT AAA GGG AAA GAA AAG CAA GTT GTC CAT TTA 432
Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu
595 600 605
GAA AAA GGA AAA TTA GTT CCA ATC AAA ATA GAG TAT CAA TCA GAT ACA 480
Glu Lys Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr
610 615 620
AAA TTT AAT ATT GAC AGT AAA ACA TTT AAA GAA CTT AAA TTA TTT AAA 528
Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys
625 630 635
ATA GAT AGT CAA AAC CAA CCC CAG CAA GTC CAG CAA GAT GAA CTG AGA 576
Ile Asp Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu Leu Arg
640 645 650
AAT CCT GAA TTT AAC AAG AAA GAA TCA CAG GAA TTC TTA GCG AAA CCA 624
Asn Pro Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Pro
655 660 665 670
TCG AAA ATA AAT CTT TTC ACT CAA AAA ATG AAA AGG GAA ATT GAT GAA 672
Ser Lys Ile Asn Leu Phe Thr Gln Lys Met Lys Arg Glu Ile Asp Glu
675 680 685
GAC ACG GAT ACG GAT GGG GAC TCT ATT CCT GAC CTT TGG GAA GAA AAT 720
Asp Thr Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn
690 695 700
GGG TAT ACG ATT CAA AAT AGA ATC GCT GTA AAG TGG GAC GAT TCT CTA 768
Gly Tyr Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp Ser Leu
705 710 715
GCA AGT AAA GGG TAT ACG AAA TTT GTT TCA AAT CCA CTA GAA AGT CAC 816
Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His
720 725 730
ACA GTT GGT GAT CCT TAT ACA GAT TAT GAA AAG GCA GCA AGA GAT CTA 864
Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu
735 740 745 750
GAT TTG TCA AAT GCA AAG GAA ACG TTT AAC CCA TTG GTA GCT GCT TTT 912
Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe
755 760 765
CCA AGT GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA 960
Pro Ser Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu
770 775 780
AAT TTA TCC AAT AGT GTA GAG TCT CAT TCA TCC ACG AAT TGG TCT TAT 1008
Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr
785 790 795
ACA AAT ACA GAA GGT GCT TCT GTT GAA GCG GGG ATT GGA CCA AAA GGT 1056
Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro Lys Gly
800 805 810
ATT TCG TTC GGA GTT AGC GTA AAC TAT CAA CAC TCT GAA ACA GTT GCA 1104
Ile Ser Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr Val Ala
815 820 825 830
CAA GAA TGG GGA ACA TCT ACA GGA AAT ACT TCG CAA TTC AAT ACG GCT 1152
Gln Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala
835 840 845
TCA GCG GGA TAT TTA AAT GCA AAT GTT CGA TAT AAC AAT GTA GGA ACT 1200
Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr
850 855 860
GGT GCC ATC TAC GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC 1248
Gly Ala Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn
865 870 875
GAT ACT ATC GCA ACT ATT ACG GCG AAA TCT AAT TCT ACA GCC TTA AAT 1296
Asp Thr Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn
880 885 890
ATA TCT CCT GGA GAA AGT TAC CCG AAA AAA GGA CAA AAT GGA ATC GCA 1344
Ile Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly Ile Ala
895 900 905 910
ATA ACA TCA ATG GAT GAT TTT AAT TCC CAT CCG ATT ACA TTA AAT AAA 1392
Ile Thr Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys
915 920 925
AAA CAA GTA GAT AAT CTG CTA AAT AAT AAA CCT ATG ATG TTG GAA ACA 1440
Lys Gln Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr
930 935 940
AAC CAA ACA GAT GGT GTT TAT AAG ATA AAA GAT ACA CAT GGA AAT ATA 1488
Asn Gln Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly Asn Ile
945 950 955
GTA ACT GGC GGA GAA TGG AAT GGT GTC ATA CAA CAA ATC AAG GCT AAA 1536
Val Thr Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys Ala Lys
960 965 970
ACA GCG TCT ATT ATT GTG GAT GAT GGG GAA CGT GTA GCA GAA AAA CGT 1584
Thr Ala Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg
975 980 985 990
GTA GCG GCA AAA GAT TAT GAA AAT CCA GAA GAT AAA ACA CCG TCT TTA 1632
Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu
995 1000 1005
ACT TTA AAA GAT GCC CTG AAG CTT TCA TAT CCA GAT GAA ATA AAA GAA 1680
Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu
1010 1015 1020
ATA GAG GGA TTA TTA TAT TAT AAA AAC AAA CCG ATA TAC GAA TCG AGC 1728
Ile Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu Ser Ser
1025 1030 1035
GTT ATG ACT TAC TTA GAT GAA AAT ACA GCA AAA GAA GTG ACC AAA CAA 1776
Val Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gln
1040 1045 1050
TTA AAT GAT ACC ACT GGG AAA TTT AAA GAT GTA AGT CAT TTA TAT GAT 1824
Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp
1055 1060 1065 1070
GTA AAA CTG ACT CCA AAA ATG AAT GTT ACA ATC AAA TTG TCT ATA CTT 1872
Val Lys Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser Ile Leu
1075 1080 1085
TAT GAT AAT GCT GAG TCT AAT GAT AAC TCA ATT GGT AAA TGG ACA AAC 1920
Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp Thr Asn
1090 1095 1100
ACA AAT ATT GTT TCA GGT GGA AAT AAC GGA AAA AAA CAA TAT TCT TCT 1968
Thr Asn Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr Ser Ser
1105 1110 1115
AAT AAT CCG GAT GCT AAT TTG ACA TTA AAT ACA GAT GCT CAA GAA AAA 2016
Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln Glu Lys
1120 1125 1130
TTA AAT AAA AAT CGT GAC TAT TAT ATA AGT TTA TAT ATG AAG TCA GAA 2064
Leu Asn Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys Ser Glu
1135 1140 1145 1150
AAA AAC ACA CAA TGT GAG ATT ACT ATA GAT GGG GAG ATT TAT CCG ATC 2112
Lys Asn Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr Pro Ile
1155 1160 1165
ACT ACA AAA ACA GTG AAT GTG AAT AAA GAC AAT TAC AAA AGA TTA GAT 2160
Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp
1170 1175 1180
ATT ATA GCT CAT AAT ATA AAA AGT AAT CCA ATT TCT TCA CTT CAT ATT 2208
Ile Ile Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu His Ile
1185 1190 1195
AAA ACG AAT GAT GAA ATA ACT TTA TTT TGG GAT GAT ATT TCT ATA ACA 2256
Lys Thr Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser Ile Thr
1200 1205 1210
GAT GTA GCA TCA ATA AAA CCG GAA AAT TTA ACA GAT TCA GAA ATT AAA 2304
Asp Val Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu Ile Lys
1215 1220 1225 1230
CAG ATT TAT AGT AGG TAT GGT ATT AAG TTA GAA GAT GGA ATC CTT ATT 2352
Gln Ile Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile Leu Ile
1235 1240 1245
GAT AAA AAA GGT GGG ATT CAT TAT GGT GAA TTT ATT AAT GAA GCT AGT 2400
Asp Lys Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu Ala Ser
1250 1255 1260
TTT AAT ATT GAA CCA TTG CAA AAT TAT GTG ACC AAA TAT GAA GTT ACT 2448
Phe Asn Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu Val Thr
1265 1270 1275
TAT AGT AGT GAG TTA GGA CCA AAC GTG AGT GAC ACA CTT GAA AGT GAT 2496
Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp
1280 1285 1290
AAA ATT TAC AAG GAT GGG ACA ATT AAA TTT GAT TTT ACC AAA TAT AGT 2544
Lys Ile Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys Tyr Ser
1295 1300 1305 1310
AAA AAT GAA CAA GGA TTA TTT TAT GAC AGT GGA TTA AAT TGG GAC TTT 2592
Lys Asn Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe
1315 1320 1325
AAA ATT AAT GCT ATT ACT TAT GAT GGT AAA GAG ATG AAT GTT TTT CAT 2640
Lys Ile Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His
1330 1335 1340
AGA TAT AAT AAA TAG 2655
Arg Tyr Asn Lys
1345
(2)关于SEQ ID NO:5的信息
(i)序列特征
(A)长度:884个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:5:
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Thr Leu
1 5 10 15
Leu Ala Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp
20 25 30
Ser Lys Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln Lys Glu
35 40 45
Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe
50 55 60
Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu Ile Tyr
65 70 75 80
Asp Gln Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln Glu Tyr
85 90 95
Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr Gly Asp
100 105 110
Phe Thr Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu Ile Asn
115 120 125
Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu
130 135 140
Glu Lys Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr
145 150 155 160
Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys
165 170 175
Ile Asp Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu Leu Arg
180 185 190
Asn Pro Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Pro
195 200 205
Ser Lys Ile Asn Leu Phe Thr Gln Lys Met Lys Arg Glu Ile Asp Glu
210 215 220
Asp Thr Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn
225 230 235 240
Gly Tyr Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp Ser Leu
245 250 255
Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His
260 265 270
Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu
275 280 285
Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe
290 295 300
Pro Ser Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu
305 310 315 320
Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr
325 330 335
Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro Lys Gly
340 345 350
Ile Ser Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr Val Ala
355 360 365
Gln Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala
370 375 380
Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr
385 390 395 400
Gly Ala Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn
405 410 415
Asp Thr Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn
420 425 430
Ile Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly Ile Ala
435 440 445
Ile Thr Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys
450 455 460
Lys Gln Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr
465 470 475 480
Asn Gln Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly Asn Ile
485 490 495
Val Thr Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys Ala Lys
500 505 510
Thr Ala Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg
515 520 525
Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu
530 535 540
Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu
545 550 555 560
Ile Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu Ser Ser
565 570 575
Val Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gln
580 585 590
Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp
595 600 605
Val Lys Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser Ile Leu
610 615 620
Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp Thr Asn
625 630 635 640
Thr Asn Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr Ser Ser
645 650 655
Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln Glu Lys
660 665 670
Leu Asn Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys Ser Glu
675 680 685
Lys Asn Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr Pro Ile
690 695 700
Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp
705 710 715 720
Ile Ile Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu His Ile
725 730 735
Lys Thr Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser Ile Thr
740 745 750
Asp Val Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu Ile Lys
755 760 765
Gln Ile Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile Leu Ile
770 775 780
Asp Lys Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu Ala Ser
785 790 795 800
Phe Asn Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu Val Thr
805 810 815
Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp
820 825 830
Lys Ile Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys Tyr Ser
835 840 845
Lys Asn Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe
850 855 860
Lys Ile Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His
865 870 875 880
Arg Tyr Asn Lys
(2)关于SEQ ID NO:6的信息
(i)序列特征
(A)长度:2004个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(iv)反义:否
(vi)原始来源
(A)生物:蜡状芽孢杆菌
(B)菌株:AB78
(C)特定分离物:NRRL B-21058
(ix)特征:
(A)名称/键:CDS
(B)位置:1..2001
(D)其它信息:/产品=“80kDa VIP1A(a)蛋白”,/备注=“该序列与SEQ ID NO:1的从3126到5126位的核苷酸相同。”
(xi)序列描述:SEQ ID NO:6:
ATG AAA AGG GAA ATT GAT GAA GAC ACG GAT ACG GAT GGG GAC TCT ATT 48
Met Lys Arg Glu Ile Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser Ile
885 890 895 900
CCT GAC CTT TGG GAA GAA AAT GGG TAT ACG ATT CAA AAT AGA ATC GCT 96
Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr Ile Gln Asn Arg Ile Ala
905 910 915
GTA AAG TGG GAC GAT TCT CTA GCA AGT AAA GGG TAT ACG AAA TTT GTT 144
Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val
920 925 930
TCA AAT CCA CTA GAA AGT CAC ACA GTT GGT GAT CCT TAT ACA GAT TAT 192
Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr
935 940 945
GAA AAG GCA GCA AGA GAT CTA GAT TTG TCA AAT GCA AAG GAA ACG TTT 240
Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe
950 955 960
AAC CCA TTG GTA GCT GCT TTT CCA AGT GTG AAT GTT AGT ATG GAA AAG 288
Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys
965 970 975 980
GTG ATA TTA TCA CCA AAT GAA AAT TTA TCC AAT AGT GTA GAG TCT CAT 336
Val Ile Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His
985 990 995
TCA TCC ACG AAT TGG TCT TAT ACA AAT ACA GAA GGT GCT TCT GTT GAA 384
Ser Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu
1000 1005 1010
GCG GGG ATT GGA CCA AAA GGT ATT TCG TTC GGA GTT AGC GTA AAC TAT 432
Ala Gly Ile Gly Pro Lys Gly Ile Ser Phe Gly Val Ser Val Asn Tyr
1015 1020 1025
CAA CAC TCT GAA ACA GTT GCA CAA GAA TGG GGA ACA TCT ACA GGA AAT 480
Gln His Ser Glu Thr Val Ala Gln Glu Trp Gly Thr Ser Thr Gly Asn
1030 1035 1040
ACT TCG CAA TTC AAT ACG GCT TCA GCG GGA TAT TTA AAT GCA AAT GTT 528
Thr Ser Gln Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val
1045 1050 1055 1060
CGA TAT AAC AAT GTA GGA ACT GGT GCC ATC TAC GAT GTA AAA CCT ACA 576
Arg Tyr Asn Asn Val Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro Thr
1065 1070 1075
ACA AGT TTT GTA TTA AAT AAC GAT ACT ATC GCA ACT ATT ACG GCG AAA 624
Thr Ser Phe Val Leu Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala Lys
1080 1085 1090
TCT AAT TCT ACA GCC TTA AAT ATA TCT CCT GGA GAA AGT TAC CCG AAA 672
Ser Asn Ser Thr Ala Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro Lys
1095 1100 1105
AAA GGA CAA AAT GGA ATC GCA ATA ACA TCA ATG GAT GAT TTT AAT TCC 720
Lys Gly Gln Asn Gly Ile Ala Ile Thr Ser Met Asp Asp Phe Asn Ser
1110 1115 1120
CAT CCG ATT ACA TTA AAT AAA AAA CAA GTA GAT AAT CTG CTA AAT AAT 768
His Pro Ile Thr Leu Asn Lys Lys Gln Val Asp Asn Leu Leu Asn Asn
1125 1130 1135 1140
AAA CCT ATG ATG TTG GAA ACA AAC CAA ACA GAT GGT GTT TAT AAG ATA 816
Lys Pro Met Met Leu Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys Ile
1145 1150 1155
AAA GAT ACA CAT GGA AAT ATA GTA ACT GGC GGA GAA TGG AAT GGT GTC 864
Lys Asp Thr His Gly Asn Ile Val Thr Gly Gly Glu Trp Asn Gly Val
GAA AAG GCA GCA AGA GAT CTA GAT TTG TCA AAT GCA AAG GAA ACG TTT 240
Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe
950 955 960
AAC CCA TTG GTA GCT GCT TTT CCA AGT GTG AAT GTT AGT ATG GAA AAG 288
Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys
965 970 975 980
GTG ATA TTA TCA CCA AAT GAA AAT TTA TCC AAT AGT GTA GAG TCT CAT 336
Val Ile Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His
985 990 995
TCA TCC ACG AAT TGG TCT TAT ACA AAT ACA GAA GGT GCT TCT GTT GAA 384
Ser Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu
1000 1005 1010
GCG GGG ATT GGA CCA AAA GGT ATT TCG TTC GGA GTT AGC GTA AAC TAT 432
Ala Gly Ile Gly Pro Lys Gly Ile Ser Phe Gly Val Ser Val Asn Tyr
1015 1020 1025
CAA CAC TCT GAA ACA GTT GCA CAA GAA TGG GGA ACA TCT ACA GGA AAT 480
Gln His Ser Glu Thr Val Ala Gln Glu Trp Gly Thr Ser Thr Gly Asn
1030 1035 1040
ACT TCG CAA TTC AAT ACG GCT TCA GCG GGA TAT TTA AAT GCA AAT GTT 528
Thr Ser Gln Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val
1045 1050 1055 1060
CGA TAT AAC AAT GTA GGA ACT GGT GCC ATC TAC GAT GTA AAA CCT ACA 576
Arg Tyr Asn Asn Val Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro Thr
1065 1070 1075
ACA AGT TTT GTA TTA AAT AAC GAT ACT ATC GCA ACT ATT ACG GCG AAA 624
Thr Ser Phe Val Leu Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala Lys
1080 1085 1090
TCT AAT TCT ACA GCC TTA AAT ATA TCT CCT GGA GAA AGT TAC CCG AAA 672
Ser Asn Ser Thr Ala Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro Lys
1095 1100 1105
AAA GGA CAA AAT GGA ATC GCA ATA ACA TCA ATG GAT GAT TTT AAT TCC 720
Lys Gly Gln Asn Gly Ile Ala Ile Thr Ser Met Asp Asp Phe Asn Ser
1110 1115 1120
CAT CCG ATT ACA TTA AAT AAA AAA CAA GTA GAT AAT CTG CTA AAT AAT 768
His Pro Ile Thr Leu Asn Lys Lys Gln Val Asp Asn Leu Leu Asn Asn
1125 1130 1135 1140
AAA CCT ATG ATG TTG GAA ACA AAC CAA ACA GAT GGT GTT TAT AAG ATA 816
Lys Pro Met Met Leu Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys Ile
1145 1150 1155
AAA GAT ACA CAT GGA AAT ATA GTA ACT GGC GGA GAA TGG AAT GGT GTC 864
Lys Asp Thr His Gly Asn Ile Val Thr Gly Gly Glu Trp Asn Gly Val
1160 1165 1170
ATA CAA CAA ATC AAG GCT AAA ACA GCG TCT ATT ATT GTG GAT GAT GGG 912
Ile Gln Gln Ile Lys Ala Lys Thr Ala Ser Ile Ile Val Asp Asp Gly
1175 1180 1185
GAA CGT GTA GCA GAA AAA CGT GTA GCG GCA AAA GAT TAT GAA AAT CCA 960
Glu Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro
1190 1195 1200
GAA GAT AAA ACA CCG TCT TTA ACT TTA AAA GAT GCC CTG AAG CTT TCA 1008
Glu Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser
1205 1210 1215 1220
TAT CCA GAT GAA ATA AAA GAA ATA GAG GGA TTA TTA TAT TAT AAA AAC 1056
Tyr Pro Asp Glu Ile Lys Glu Ile Glu Gly Leu Leu Tyr Tyr Lys Asn
1225 1230 1235
AAA CCG ATA TAC GAA TCG AGC GTT ATG ACT TAC TTA GAT GAA AAT ACA 1104
Lys Pro Ile Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr
1240 1245 1250
GCA AAA GAA GTG ACC AAA CAA TTA AAT GAT ACC ACT GGG AAA TTT AAA 1152
Ala Lys Glu Val Thr Lys Gln Leu Asn Asp Thr Thr Gly Lys Phe Lys
1255 1260 1265
GAT GTA AGT CAT TTA TAT GAT GTA AAA CTG ACT CCA AAA ATG AAT GTT 1200
Asp Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val
1270 1275 1280
ACA ATC AAA TTG TCT ATA CTT TAT GAT AAT GCT GAG TCT AAT GAT AAC 1248
Thr Ile Lys Leu Ser Ile Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn
1285 1290 1295 1300
TCA ATT GGT AAA TGG ACA AAC ACA AAT ATT GTT TCA GGT GGA AAT AAC 1296
Ser Ile Gly Lys Trp Thr Asn Thr Asn Ile Val Ser Gly Gly Asn Asn
1305 1310 1315
GGA AAA AAA CAA TAT TCT TCT AAT AAT CCG GAT GCT AAT TTG ACA TTA 1344
Gly Lys Lys Gln Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu
1320 1325 1330
AAT ACA GAT GCT CAA GAA AAA TTA AAT AAA AAT CGT GAC TAT TAT ATA 1392
Asn Thr Asp Ala Gln Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr Ile
1335 1340 1345
AGT TTA TAT ATG AAG TCA GAA AAA AAC ACA CAA TGT GAG ATT ACT ATA 1440
Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gln Cys Glu Ile Thr Ile
1350 1355 1360
GAT GGG GAG ATT TAT CCG ATC ACT ACA AAA ACA GTG AAT GTG AAT AAA 1488
Asp Gly Glu Ile Tyr Pro Ile Thr Thr Lys Thr Val Asn Val Asn Lys
1365 1370 1375 1380
GAC AAT TAC AAA AGA TTA GAT ATT ATA GCT CAT AAT ATA AAA AGT AAT 1536
Asp Asn Tyr Lys Arg Leu Asp Ile Ile Ala His Asn Ile Lys Ser Asn
1385 1390 1395
CCA ATT TCT TCA CTT CAT ATT AAA ACG AAT GAT GAA ATA ACT TTA TTT 1584
Pro Ile Ser Ser Leu His Ile Lys Thr Asn Asp Glu Ile Thr Leu Phe
1400 1405 1410
TGG GAT GAT ATT TCT ATA ACA GAT GTA GCA TCA ATA AAA CCG GAA AAT 1632
Trp Asp Asp Ile Ser Ile Thr Asp Val Ala Ser Ile Lys Pro Glu Asn
1415 1420 1425
TTA ACA GAT TCA GAA ATT AAA CAG ATT TAT AGT AGG TAT GGT ATT AAG 1680
Leu Thr Asp Ser Glu Ile Lys Gln Ile Tyr Ser Arg Tyr Gly Ile Lys
1430 1435 1440
TTA GAA GAT GGA ATC CTT ATT GAT AAA AAA GGT GGG ATT CAT TAT GGT 1728
Leu Glu Asp Gly Ile Leu Ile Asp Lys Lys Gly Gly Ile His Tyr Gly
1445 1450 1455 1460
GAA TTT ATT AAT GAA GCT AGT TTT AAT ATT GAA CCA TTG CCA AAT TAT 1776
Glu Phe Ile Asn Glu Ala Ser Phe Asn Ile Glu Pro Leu Pro Asn Tyr
1465 1470 1475
GTG ACC AAA TAT GAA GTT ACT TAT AGT AGT GAG TTA GGA CCA AAC GTG 1824
Val Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val
1480 1485 1490
AGT GAC ACA CTT GAA AGT GAT AAA ATT TAC AAG GAT GGG ACA ATT AAA 1872
Ser Asp Thr Leu Glu Ser Asp Lys Ile Tyr Lys Asp Gly Thr Ile Lys
1495 1500 1505
TTT GAT TTT ACC AAA TAT AGT AAA AAT GAA CAA GGA TTA TTT TAT GAC 1920
Phe Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gln Gly Leu Phe Tyr Asp
1510 1515 1520
AGT GGA TTA AAT TGG GAC TTT AAA ATT AAT GCT ATT ACT TAT GAT GGT 1968
Ser Gly Leu Asn Trp Asp Phe Lys Ile Asn Ala Ile Thr Tyr Asp Gly
1525 1530 1535 1540
AAA GAG ATG AAT GTT TTT CAT AGA TAT AAT AAA TAG 2004
Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys
1545 1550
(2)关于SEQ ID NO:7的信息
(i)序列特征
(A)长度:667个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:7:
Met Lys Arg Glu Ile Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser Ile
1 5 10 15
Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr Ile Gln Asn Arg Ile Ala
20 25 30
Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val
35 40 45
Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr
50 55 60
Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe
65 70 75 80
Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys
85 90 95
Val Ile Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His
100 105 110
Ser Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu
115 120 125
Ala Gly Ile Gly Pro Lys Gly Ile Ser Phe Gly Val Ser Val Asn Tyr
130 135 140
Gln His Ser Glu Thr Val Ala Gln Glu Trp Gly Thr Ser Thr Gly Asn
145 150 155 160
Thr Ser Gln Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val
165 170 175
Arg Tyr Asn Asn Val Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro Thr
180 185 190
Thr Ser Phe Val Leu Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala Lys
195 200 205
Ser Asn Ser Thr Ala Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro Lys
210 215 220
Lys Gly Gln Asn Gly Ile Ala Ile Thr Ser Met Asp Asp Phe Asn Ser
225 230 235 240
His Pro Ile Thr Leu Asn Lys Lys Gln Val Asp Asn Leu Leu Asn Asn
245 250 255
Lys Pro Met Met Leu Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys Ile
260 265 270
Lys Asp Thr His Gly Asn Ile Val Thr Gly Gly Glu Trp Asn Gly Val
275 280 285
Ile Gln Gln Ile Lys Ala Lys Thr Ala Ser Ile Ile Val Asp Asp Gly
290 295 300
Glu Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro
305 310 315 320
Glu Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser
325 330 335
Tyr Pro Asp Glu Ile Lys Glu Ile Glu Gly Leu Leu Tyr Tyr Lys Asn
340 345 350
Lys Pro Ile Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr
355 360 365
Ala Lys Glu Val Thr Lys Gln Leu Asn Asp Thr Thr Gly Lys Phe Lys
370 375 380
Asp Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val
385 390 395 400
Thr Ile Lys Leu Ser Ile Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn
405 410 415
Ser Ile Gly Lys Trp Thr Asn Thr Asn Ile Val Ser Gly Gly Asn Asn
420 425 430
Gly Lys Lys Gln Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu
435 440 445
Asn Thr Asp Ala Gln Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr Ile
450 455 460
Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gln Cys Glu Ile Thr Ile
465 470 475 480
Asp Gly Glu Ile Tyr Pro Ile Thr Thr Lys Thr Val Asn Val Asn Lys
485 490 495
Asp Asn Tyr Lys Arg Leu Asp Ile Ile Ala His Asn Ile Lys Ser Asn
500 505 510
Pro Ile Ser Ser Leu His Ile Lys Thr Asn Asp Glu Ile Thr Leu Phe
515 520 525
Trp Asp Asp Ile Ser Ile Thr Asp Val Ala Ser Ile Lys Pro Glu Asn
530 535 540
Leu Thr Asp Ser Glu Ile Lys Gln Ile Tyr Ser Arg Tyr Gly Ile Lys
545 550 555 560
Leu Glu Asp Gly Ile Leu Ile Asp Lys Lys Gly Gly Ile His Tyr Gly
565 570 575
Glu Phe Ile Asn Glu Ala Ser Phe Asn Ile Glu Pro Leu Pro Asn Tyr
580 585 590
Val Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val
595 600 605
Ser Asp Thr Leu Glu Ser Asp Lys Ile Tyr Lys Asp Gly Thr Ile Lys
610 c615 620
Phe Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gln Gly Leu Phe Tyr Asp
625 630 635 640
Ser Gly Leu Asn Trp Asp Phe Lys Ile Asn Ala Ile Thr Tyr Asp Gly
645 650 655
Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys
660 665
(2)关于SEQ ID NO:8的信息
(i)序列特征
(A)长度:16个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:肽
(iii)假拟:否
(v)片段类型:N-端
(vi)原始来源
(A)生物:蜡状芽孢杆菌
(B)菌株:AB78
(C)特定分离物:NRRL B-21058
(ix)特征:
(A)名称/键:肽
(B)位置:1..16
(D)其它信息:/备注=从AB18菌株纯化的蛋白的N端序列
(xi)序列描述:SEQ ID NO:8:
Lys Arg Glu Ile Asp Glu Asp Thr Asp Thr Asx Gly Asp Ser Ile Pro
1 5 10 15
(2)关于SEQ ID NO:9的信息
(i)序列特征
(A)长度:21个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(iv)反义:否
(ix)特征:
(A)名称/键:misc_feature
(B)位置: 1..21
(D)其它信息:/备注=“基于SEQ ID NO:8的3到9位氨基酸的寡核苷酸探针,使用苏云金芽孢杆菌的密码子用法。”
(xi)序列描述:SEQ ID NO:9:
GAAATTGATC AAGATACNGA T 21
(2)关于SEQ ID NO:10的信息
(i)序列特征
(A)长度:14个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(iii)假拟:否
(v)片段类型:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(B)菌株:AB88
(ix)特征:
(A)名称/键:肽
(B)位置:1..14
(D)其它信息:/备注=“称为阴离子交换组分23的蛋白的N端氨基酸序列(更小的)。”
(xi)序列描述:SEQ ID NO:10:
Xaa Glu Pro Phe Val Ser Ala Xaa Xaa Xaa Gln Xaa Xaa Xaa
1 5 10
(2)关于SEQ ID NO:11的信息
(i)序列特征
(A)长度:13个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(xi)序列描述:SEQ ID NO:11:
Xaa Glu Tyr Glu Asn Val Glu Pro Phe Val Ser Ala Xaa
1 5 10
(2)关于SEQ ID NO:12的信息
(i)序列特征
(A)长度:14个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(xi)序列描述:SEQ ID NO:12:
Met Asn Lys Asn Asn Thr Lys Leu Pro Thr Arg Ala Leu Pro
1 5 10
(2)关于SEQ ID NO:13的信息
(i)序列特征
(A)长度:15个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(iii)假拟:否
(v)片段类型:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(B)菌株:AB88
(ix)特征:
(A)名称/键:肽
(B)位置:1..15
(D)其它信息:/备注=“对Agrotis ipsilon有活性的35kDa VIP的N端氨基酸序列。”
(xi)序列描述:SEQ ID NO:13:
Ala Leu Ser Glu Asn Thr Gly Lys Asp Gly Gly Ile Val Pro
1 5 10 15
(2)关于SEQ ID NO:14的信息
(i)序列特征
(A)长度:9个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(xi)序列描述:SEQ ID NO:14:
Met Asp Asn Asn Pro Asn Ile Asn Glu
1 5
(2)关于SEQ ID NO:15的信息
(i)序列特征
(A)长度:9个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(iii)假拟:否
(v)片段类型:N-端
(ix)特征:
(A)名称/键:肽
(B)位置:1..9
(D)其它信息:/备注=“80kDa δ-内毒素的N端序列。”
(xi)序列描述:SEQ ID NO:15:
Met Asp Asn Asn Pro Asn Ile Asn Glu
1 5
(2)关于SEQ ID NO:16的信息
(i)序列特征
(A)长度:11个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:肽
(iii)假拟:否
(v)片段类型:N-端
(vi)原始来源
(A)生物:苏云金芽孢杆菌
(ix)特征:
(A)名称/键:肽
(B)位置:1..11
(D)其它信息:/备注=“60kDa δ-内毒素的N端序列。”
(xi)序列描述:SEQ ID NO:16:
Met Asn Val Leu Asn Ser Gly Arg Thr Thr Ile
1 5 10
(2)关于SEQ ID NO:17的信息
(i)序列特征
(A)长度:2655个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(iv)反义:否
(ix)特征:
(A)名称/键:misc_feathure
(B)位置:1..2652
(D)其它信息:/备注=“来自AB78的编码100kd VIP1A(a)蛋白的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:17:
ATGAAGAACA TGAAGAAGAA GCTGGCCAGC GTGGTGACCT GCACCCTGCT GGCCCCCATG 60
TTCCTGAACG GCAACGTGAA CGCCGTGTAC GCCGACAGCA AGACCAACCA GATCAGCACC 120
ACCCAGAAGA ACCAGCAGAA GGAGATGGAC CGCAAGGGCC TGCTGGGCTA CTACTTCAAG 180
GGCAAGGACT TCAGCAACCT GACCATGTTC GCCCCCACGC GTGACAGCAC CCTGATCTAC 240
GACCAGCAGA CCGCCAACAA GCTGCTGGAC AAGAAGCAGC AGGAGTACCA GAGCATCCGC 300
TGGATCGGCC TGATCCAGAG CAAGGAGACC GGCGACTTCA CCTTCAACCT GAGCGAGGAC 360
GAGCAGGCCA TCATCGAGAT CAACGGCAAG ATCATCAGCA ACAAGGGCAA GGAGAAGCAG 420
GTGGTGCACC TGGAGAAGGG CAAGCTGGTG CCCATCAAGA TCGAGTACCA GAGCGACACC 480
AAGTTCAACA TCGACAGCAA GACCTTCAAG GAGCTGAAGC TTTTCAAGAT CGACAGCCAG 540
AACCAGCCCC AGCAGGTGCA GCAGGACGAG CTGCGCAACC CCGAGTTCAA CAAGAAGGAG 600
AGCCAGGAGT TCCTGGCCAA GCCCAGCAAG ATCAACCTGT TCACCCAGCA GATGAAGCGC 660
GAGATCGACG AGGACACCGA CACCGACGGC GACAGCATCC CCGACCTGTG GGAGGAGAAC 720
GGCTACACCA TCCAGAACCG CATCGCCGTG AAGTGGGACG ACAGCCTGGC TAGCAAGGGC 780
TACACCAAGT TCGTGAGCAA CCCCCTGGAG AGCCACACCG TGGGCGACCC CTACACCGAC 840
TACGAGAAGG CCGCCCGCGA CCTGGACCTG AGCAACGCCA AGGAGACCTT CAACCCCCTG 900
GTGGCCGCCT TCCCCAGCGT GAACGTGAGC ATGGAGAAGG TGATCCTGAG CCCCAACGAG 960
AACCTGAGCA ACAGCGTGGA GAGCCACTCG AGCACCAACT GGAGCTACAC CAACACCGAG 1020
GGCGCCAGCG TGGAGGCCGG CATCGGTCCC AAGGGCATCA GCTTCGGCGT GAGCGTGAAC 1080
TACCAGCACA GCGAGACCGT GGCCCAGGAG TGGGGCACCA GCACCGGCAA CACCAGCCAG 1140
TTCAACACCG CCAGCGCCGG CTACCTGAAC GCCAACGTGC GCTACAACAA CGTGGGCACC 1200
GGCGCCATCT ACGACGTGAA GCCCACCACC AGCTTCGTGC TGAACAACGA CACCATCGCC 1260
ACCATCACCG CCAAGTCGAA TTCCACCGCC CTGAACATCA GCCCCGGCGA GAGCTACCCC 1320
AAGAAGGGCC AGAACGGCAT CGCCATCACC AGCATGGACG ACTTCAACAG CCACCCCATC 1380
ACCCTGAACA AGAAGCAGGT GGACAACCTG CTGAACAACA AGCCCATGAT GCTGGAGACC 1440
AACCAGACCG ACGGCGTCTA CAAGATCAAG GACACCCACG GCAACATCGT GACCGGCGGC 1500
GAGTGGAACG GCGTGATCCA GCAGATCAAG GCCAAGACCG CCAGCATCAT CGTCGACGAC 1560
GGCGAGCGCG TGGCCGAGAA GCGCGTGGCC GCCAAGGACT ACGAGAACCC CGAGGACAAG 1620
ACCCCCAGCC TGACCCTGAA GGACGCCCTG AAGCTGAGCT ACCCCGACGA GATCAAGGAG 1680
ATCGAGGGCC TGCTGTACTA CAAGAACAAG CCCATCTACG AGAGCAGCGT GATGACCTAT 1740
CTAGACGAGA ACACCGCCAA GGAGGTGACC AAGCAGCTGA ACGACACCAC CGGCAAGTTC 1800
AAGGACGTGA GCCACCTGTA CGACGTGAAG CTGACCCCCA AGATGAACGT GACCATCAAG 1860
CTGAGCATCC TGTACGACAA CGCCGAGAGC AACGACAACA GCATCGGCAA GTGGACCAAC 1920
ACCAACATCG TGAGCGGCGG CAACAACGGC AAGAAGCAGT ACAGCAGCAA CAACCCCGAC 1980
GCCAACCTGA CCCTGAACAC CGACGCCCAG GAGAAGCTGA ACAAGAACCG CGACTACTAC 2040
ATCAGCCTGT ACATGAAGAG CGAGAAGAAC ACCCAGTGCG AGATCACCAT CGACGGCGAG 2100
ATATACCCCA TCACCACCAA GACCGTGAAC GTGAACAAGG ACAACTACAA GCGCCTGGAC 2160
ATCATCGCCC ACAACATCAA GAGCAACCCC ATCAGCAGCC TGCACATCAA GACCAACGAC 2220
GAGATCACCC TGTTCTGGGA CGACATATCG ATTACCGACG TCGCCAGCAT CAAGCCCGAG 2280
AACCTGACCG ACAGCGAGAT CAAGCAGATA TACAGTCGCT ACGGCATCAA GCTGGAGGAC 2340
GGCATCCTGA TCGACAAGAA GGGCGGCATC CACTACGGCG AGTTCATCAA CGAGGCCAGC 2400
TTCAACATCG AGCCCCTGCA GAACTACGTG ACCAAGTACG AGGTGACCTA CAGCAGCGAG 2460
CTGGGCCCCA ACGTGAGCGA CACCCTGGAG AGCGACAAGA TTTACAAGGA CGGCACCATC 2520
AAGTTCGACT TCACCAAGTA CAGCAAGAAC GAGCAGGGCC TGTTCTACGA CAGCGGGCCTG 2580
AACTGGGACT TCAAGATCAA CGCCATCACC TACGACGGCA AGGAGATGAA CGTGTTCCAC 2640
CGCTACAACA AGTAG 2655
(2)关于SEQ ID NO:18的信息
(i)序列特征
(A)长度:2004个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(iv)反义:否
(ix)特征:
(A)名称/键:misc_feathure
(B)位置:1..2004
(D)其它信息:/备注=“来自AB78的编码80kd VIP1A(a)蛋白的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:18:
ATGAAGCGCG AGATCGACGA GGACACCGAC ACCGACGGCG ACAGCATCCC CGACCTGTGG 60
GAGGAGAACG GCTACACCAT CCAGAACCGC ATCGCCGTGA AGTGGGACGA CAGCCTGGCT 120
AGCAAGGGCT ACACCAAGTT CGTGAGCAAC CCCCTGGAGA GCCACACCGT GGGCGACCCC 180
TACACCGACT ACGAGAAGGC CGCCCGCGAC CTGGACCTGA GCAACGCCAA GGAGACCTTC 240
AACCCCCTGG TGGCCGCCTT CCCCAGCGTG AACGTGAGCA TGGAGAAGGT GATCCTGAGC 300
CCCAACGAGA ACCTGAGCAA CAGCGTGGAG AGCCACTCGA GCACCAACTG GAGCTACACC 360
AACACCGAGG GCGCCAGCGT GGAGGCCGGC ATCGGTCCCA AGGGCATCAG CTTCGGCGTG 420
AGCGTGAACT ACCAGCACAG CGAGACCGTG GCCCAGGAGT GGGGCACCAG CACCGGCAAC 480
ACCAGCCAGT TCAACACCGC CAGCGCCGGC TACCTGAACG CCAACGTGCG CTACAACAAC 540
GTGGGCACCG GCGCCATCTA CGACGTGAAG CCCACCACCA GCTTCGTGCT GAACAACGAC 600
ACCATCGCCA CCATCACCGC CAAGTCGAAT TCCACCGCCC TGAACATCAG CCCCGGCGAG 660
AGCTACCCCA AGAAGGGCCA GAACGGCATC GCCATCACCA GCATGGACGA CTTCAACAGC 720
CACCCCATCA CCCTGAACAA GAAGCAGGTG GACAACCTGC TGAACAACAA GCCCATGATG 780
CTGGAGACCA ACCAGACCGA CGGCGTCTAC AAGATCAAGG ACACCCACGG CAAGATCGTG 840
ACCGGCGGCG AGTGGAACGG CGTGATCCAG CAGATCAAGG CCAAGACCGC CAGGATCATC 900
GTCGACGACG GCGAGCGCGT GGCCGAGAAG CGCGTGGCCG CCAAGGACTA CGAGAACCCC 960
GAGGACAAGA CCCCCAGCCT GACCCTGAAG GACGCCCTGA AGCTGAGCTA CCCCGACGAG 1020
ATCAAGGAGA TCGAGGGCCT GCTGTACTAC AAGAACAAGC CCATCTACGA GAGCAGCGTG 1080
ATGACCTATC TAGACGAGAA CACCGCCAAG GAGGTGACCA AGCAGCTGAA CGACACCACC 1140
GGCAAGTTCA AGGACGTGAG CCACCTGTAC GACGTGAAGC TGACCCCCAA GATGAACGTG 1200
ACCATCAAGC TGAGCATCCT GTACGACAAC GCCGAGAGCA ACGACAACAG CATCGGCAAG 1260
TGGACCAACA CCAACATCGT GAGCGGCGGC AACAACGGCA AGAAGCAGTA CAGCAGCAAC 1320
AACCCCGACG CCAACCTGAC CCTGAACACC GACGCCCAGG AGAAGCTGAA CAAGAACCGC 1380
GACTACTACA TCAGCCTGTA CATGAAGAGC GAGAAGAACA CCCAGTGCGA GATCACCATC 1440
GACGGCGAGA TATACCCCAT CACCACCAAG ACCGTGAACG TGAACAAGGA CAACTACAAG 1500
CGCCTGGACA TCATCGCCCA CAACATCAAG AGCAACCCCA TCAGCAGCCT GCACATCAAG 1560
ACCAACGACG AGATCACCCT GTTCTGGGAC GACATATCGA TTACCGACGT CGCCAGCATC 1620
AAGCCCGAGA ACCTGACCGA CAGCGAGATC AAGCAGATAT ACAGTCGCTA CGGCATCAAG 1680
CTGGAGGACG GCATCCTGAT CGACAAGAAG GGCGGCATCC ACTACGGCGA GTTCATCAAC 1740
GAGGCCAGCT TCAACATCGA GCCCCTGCAG AACTACGTGA CCAAGTACGA GGTGACCTAC 1800
AGCAGCGAGC TGGGCCCCAA CGTGAGCGAC ACCCTGGAGA GCGACAAGAT TTACAAGGAC 1860
GGCACCATCA AGTTCGACTT CACCAAGTAC AGCAAGAACG AGCAGGGCCT GTTCTACGAC 1920
AGCGGCCTGA ACTGGGACTT CAAGATCAAC GCCATCACCT ACGACGGCAA GGAGATGAAC 1980
GTGTTCCACC GCTACAACAA GTAG 2004
(2)关于SEQ ID NO:19的信息
(i)序列特征
(A)长度:4074个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(ix)特征:
(A)名称/键:CDS
(B)位置:1..1386
(D)其它信息:/产品=“来自Btt的VIP2A(b)”
(ix)特征:
(A)名称/键:CDS
(B)位置:1394..3895
(D)其它信息:/产品=“来自Btt的VIP1A(b)”
(ix)特征:
(A)名称/键:misc_feature
(B)位置:1..4074
(D)其它信息:/备注=“来自Btt的含编码VIP1A(b)和VIP2A(b)的基因的克隆DNA序列。”
(xi)序列描述:SEQ ID NO:19:
ATG CAA AGA ATG GAG GGA AAG TTG TTT GTG GTG TCA AAA ACA TTA CAA 48
Met Gln Arg Met Glu Gly Lys Leu Phe Val Val Ser Lys Thr Leu Gln
670 675 680
GTA GTT ACT AGA ACT GTA TTG CTT AGT ACA GTT TAC TCT ATA ACT TTA 96
Val Val Thr Arg Thr Val Leu Leu Ser Thr Val Tyr Ser Ile Thr Leu
685 690 695
TTA AAT AAT GTA GTG ATA AAA GCT GAC CAA TTA AAT ATA AAT TCT CAA 144
Leu Asn Asn Val Val Ile Lys Ala Asp Gln Leu Asn Ile Asn Ser Gln
700 705 710 715
AGT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC CCT GAT AAT GCA GAG 192
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Pro Asp Asn Ala Glu
720 725 730
GAT TTT AAA GAA GAT AAG GGG AAA GCG AAA GAA TGG GGG AAA GAG AAA 240
Asp Phe Lys Glu Asp Lys Gly Lys Ala Lys Glu Trp Gly Lys Glu Lys
735 740 745
GGG GAA GAG TGG AGG CCT CCT GCT ACT GAG AAA GGA GAA ATG AAT AAT 288
Gly Glu Glu Trp Arg Pro Pro Ala Thr Glu Lys Gly Glu Met Asn Asn
750 755 760
TTT TTA GAT AAT AAA AAT GAT ATA AAG ACC AAT TAT AAA GAA ATT ACT 336
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
765 770 775
TTT TCT ATG GCA GGT TCA TGT GAA GAT GAA ATA AAA GAT TTA GAA GAA 384
Phe Ser Met Ala Gly Ser Cys Glu Asp Glu Ile Lys Asp Leu Glu Glu
780 785 790 795
ATT GAT AAG ATC TTT GAT AAA GCC AAT CTC TCG AGT TCT ATT ATC ACC 432
Ile Asp Lys Ile Phe Asp Lys Ala Asn Leu Ser Ser Ser Ile Ile Thr
800 805 810
TAT AAA AAT GTG GAA CCA GCA ACA ATT GGA TTT AAT AAA TCT TTA ACA 480
Tyr Lys Asn Val Glu Pro Ala Thr Ile Gly Phe Asn Lys Ser Leu Thr
815 820 825
GAA GGT AAT ACG ATT AAT TCT GAT GCA ATG GCA CAG TTT AAA GAA CAA 528
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
830 835 840
TTT TTA GGT AAG GAT ATG AAG TTT GAT AGT TAT CTA GAT ACT CAT TTA 576
Phe Leu Gly Lys Asp Met Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
845 850 855
ACT GCT CAA CAA GTT TCC AGT AAA AAA AGA GTT ATT TTG AAG GTT ACG 624
Thr Ala Gln Gln Val Ser Ser Lys Lys Arg Val Ile Leu Lys Val Thr
860 865 870 875
GTT CCG AGT GGG AAA GGT TCT ACT ACT CCA ACA AAA GCA GGT GTC ATT 672
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
880 885 890
TTA AAC AAT AAT GAA TAC AAA ATG CTC ATT GAT AAT GGG TAT GTG CTC 720
Leu Asn Asn Asn Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Val Leu
895 900 905
CAT GTA GAT AAG GTA TCA AAA GTA GTA AAA AAA GGG ATG GAG TGC TTA 768
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Met Glu Cys Leu
910 915 920
CAA GTT GAA GGG ACT TTA AAA AAG AGT CTC GAC TTT AAA AAT GAT ATA 816
Gln Val Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
925 930 935
AAT GCT GAA GCG CAT AGC TGG GGG ATG AAA ATT TAT GAA GAC TGG GCT 864
Asn Ala Glu Ala His Ser Trp Gly Met Lys Ile Tyr Glu Asp Trp Ala
940 945 950 955
AAA AAT TTA ACC GCT TCG CAA AGG GAA GCT TTA GAT GGG TAT GCT AGG 912
Lys Asn Leu Thr Ala Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
960 965 970
CAA GAT TAT AAA GAA ATC AAT AAT TAT TTG CGC AAT CAA GGC GGG AGT 960
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
975 980 985
GGA AAT GAA AAG CTG GAT GCC CAA TTA AAA AAT ATT TCT GAT GCT TTA 1008
Gly Asn Glu Lys Leu Asp Ala Gln Leu Lys Asn Ile Ser Asp Ala Leu
990 995 1000
GGG AAG AAA CCC ATA CCA GAA AAT ATT ACC GTG TAT AGA TGG TGT GGC 1056
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
1005 1010 1015
ATG CCG GAA TTT GGT TAT CAA ATT AGT GAT CCG TTA CCT TCT TTA AAA 1104
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
1020 1025 1030 1035
GAT TTT GAA GAA CAA TTT TTA AAT ACA ATT AAA GAA GAC AAA GGG TAT 1152
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
1040 1045 1050
ATG AGT ACA AGC TTA TCG AGT GAA CGT CTT GCA GCT TTT GGA TCT AGA 1200
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
1055 1060 1065
AAA ATT ATA TTA CGC TTA CAA GTT CCG AAA GGA AGT ACG GGG GCG TAT 1248
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
1070 1075 1080
TTA AGT GCC ATT GGT GGA TTT GCA AGT GAA AAA GAG ATC CTA CTT GAT 1296
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
1085 1090 1095
AAA GAT AGT AAA TAT CAT ATT GAT AAA GCA ACA GAG GTA ATC ATT AAA 1344
Lys Asp Ser Lys Tyr His Ile Asp Lys Ala Thr Glu Val Ile Ile Lys
1100 1105 1110 1115
GGT GTT AAG CGA TAT GTA GTG GAT GCA ACA TTA TTA ACA AAT 1386
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
1120 1125
TAAGGAG ATG AAA AAT ATG AAG AAA AAG TTA GCA AGT GTT GTA ACC TGT 1435
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys
1 5 10
ATG TTA TTA GCT CCT ATG TTT TTG AAT GGA AAT GTG AAT GCT GTT AAC 1483
Met Leu Leu Ala Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Asn
15 20 25 30
GCG GAT AGT AAA ATA AAT CAG ATT TCT ACA ACG CAG GAA AAC CAA CAG 1531
Ala Asp Ser Lys Ile Asn Gln Ile Ser Thr Thr Gln Glu Asn Gln Gln
35 40 45
AAA GAG ATG GAC CGA AAG GGA TTA TTG GGA TAT TAT TTC AAA GGA AAA 1579
Lys Glu Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys
50 55 60
GAT TTT AAT AAT CTT ACT ATG TTT GCA CCG ACA CGT GAT AAT ACC CTT 1627
Asp Phe Asn Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Asn Thr Leu
65 70 75
ATG TAT GAC CAA CAA ACA GCG AAT GCA TTA TTA GAT AAA AAA CAA CAA 1675
Met Tyr Asp Gln Gln Thr Ala Asn Ala Leu Leu Asp Lys Lys Gln Gln
80 85 90
GAA TAT CAG TCC ATT CGT TGG ATT GGT TTG ATT CAG CGT AAA GAA ACG 1723
Glu Tyr Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Arg Lys Glu Thr
95 100 105 110
GGC GAT TTC ACA TTT AAC TTA TCA AAG GAT GAA CAG GCA ATT ATA GAA 1771
Gly Asp Phe Thr Phe Asn Leu Ser Lys Asp Glu Gln Ala Ile Ile Glu
115 120 125
ATC GAT GGG AAA ATC ATT TCT AAT AAA GGG AAA GAA AAG CAA GTT GTC 1819
Ile Asp Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val
130 135 140
CAT TTA GAA AAA GAA AAA TTA GTT CCA ATC AAA ATA GAG TAT CAA TCA 1867
His Leu Glu Lys Glu Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser
145 150 155
GAT ACG AAA TTT AAT ATT GAT AGT AAA ACA TTT AAA GAA CTT AAA TTA 1915
Asp Thr Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu
160 165 170
TTT AAA ATA GAT AGT CAA AAC CAA TCT CAA CAA GTT CAA CTG AGA AAC 1963
Phe Lys Ile Asp Ser Gln Asn Gln Ser Gln Gln Val Gln Leu Arg Asn
175 180 185 190
CCT GAA TTT AAC AAA AAA GAA TCA CAG GAA TTT TTA GCA AAA GCA TCA 2011
Pro Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Ala Ser
195 200 205
AAA ACA AAC CTT TTT AAG CAA AAA ATG AAA AGA GAT ATT GAT GAA GAT 2059
Lys Thr Asn Leu Phe Lys Gln Lys Met Lys Arg Asp Ile Asp Glu Asp
210 215 220
ACG GAT ACA GAT GGA GAC TCC ATT CCT GAT CTT TGG GAA GAA AAT GGG 2107
Thr Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn Gly
225 230 235
TAC ACG ATT CAA AAT AAA GTT GCT GTC AAA TGG GAT GAT TCG CTA GCA 2155
Tyr Thr Ile Gln Asn Lys Val Ala Val Lys Trp Asp Asp Ser Leu Ala
240 245 250
AGT AAG GGA TAT ACA AAA TTT GTT TCG AAT CCA TTA GAC AGC CAC ACA 2203
Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Asp Ser His Thr
255 260 265 270
GTT GGC GAT CCC TAT ACT GAT TAT GAA AAG GCC GCA AGG GAT TTA GAT 2251
Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp
275 280 285
TTA TCA AAT GCA AAG GAA ACG TTC AAC CCA TTG GTA GCT GCT TTT CCA 2299
Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro
290 295 300
AGT GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA AAT 2347
Ser Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu Asn
305 310 315
TTA TCC AAT AGT GTA GAG TCT CAT TCA TCC ACG AAT TGG TCT TAT ACG 2395
Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr
320 325 330
AAT ACA GAA GGA GCT TCC ATT GAA GCT GGT GGC GGT CCA TTA GGC CTT 2443
Asn Thr Glu Gly Ala Ser Ile Glu Ala Gly Gly Gly Pro Leu Gly Leu
335 340 345 350
TCT TTT GGC GTG AGT GTT ACT TAT CAA CAC TCT GAA ACA GTT GCA CAA 2491
Ser Phe Gly Val Ser Val Thr Tyr Gln His Ser Glu Thr Val Ala Gln
355 360 365
GAA TGG GGA ACA TCT ACA GGA AAT ACT TCA CAA TTC AAT ACG GCT TCA 2539
Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala Ser
370 375 380
GCG GGA TAT TTA AAT GCA AAT GTT CGG TAT AAC AAT GTA GGG ACT GGT 2587
Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly
385 390 395
GCC ATC TAT GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC AAT 2635
Ala Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asn
400 405 410
ACC ATC GCA ACG ATT ACA GCA AAA TCA AAT TCA ACA GCT TTA CGT ATA 2683
Thr Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Arg Ile
415 420 425 430
TCT CCG GGG GAT AGT TAT CCA GAA ATA GGA GAA AAC GCT ATT GCG ATT 2731
Ser Pro Gly Asp Ser Tyr Pro Glu Ile Gly Glu Asn Ala Ile Ala Ile
435 440 445
ACA TCT ATG GAT GAT TTT AAT TCT CAT CCA ATT ACA TTA AAT AAA CAA 2779
Thr Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys Gln
450 455 460
CAG GTA AAT CAA TTG ATA AAT AAT AAG CCA ATT ATG CTA GAG ACA GAC 2827
Gln Val Asn Gln Leu Ile Asn Asn Lys Pro Ile Met Leu Glu Thr Asp
465 470 475
CAA ACA GAT GGT GTT TAT AAA ATA AGA GAT ACA CAT GGA AAT ATT GTA 2875
Gln Thr Asp Gly Val Tyr Lys Ile Arg Asp Thr His Gly Asn Ile Val
480 485 490
ACT GGT GGA GAA TGG AAT GGT GTA ACA CAA CAA ATT AAA GCA AAA ACA 2923
Thr Gly Gly Glu Trp Asn Gly Val Thr Gln Gln Ile Lys Ala Lys Thr
495 500 505 510
GCG TCT ATT ATT GTG GAT GAC GGG AAA CAG GTA GCA GAA AAA CGT GTG 2971
Ala Ser Ile Ile Val Asp Asp Gly Lys Gln Val Ala Glu Lys Arg Val
515 520 525
GCG GCA AAA GAT TAT GGT CAT CCA GAA GAT AAA ACA CCA CCT TTA ACT 3019
Ala Ala Lys Asp Tyr Gly His Pro Glu Asp Lys Thr Pro Pro Leu Thr
530 535 540
TTA AAA GAT ACC CTG AAG CTT TCA TAC CCA GAT GAA ATA AAA GAA ACT 3067
Leu Lys Asp Thr Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu Thr
545 550 555
AAT GGA TTG TTG TAC TAT GAT GAC AAA CCA ATC TAT GAA TCG AGT GTC 3115
Asn Gly Leu Leu Tyr Tyr Asp Asp Lys Pro Ile Tyr Glu Ser Ser Val
560 565 570
ATG ACT TAT CTG GAT GAA AAT ACG GCA AAA GAA GTC AAA AAA CAA ATA 3163
Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Lys Lys Gln Ile
575 580 585 590
AAT GAT ACA ACC GGA AAA TTT AAG GAT GTA AAT CAC TTA TAT GAT GTA 3211
Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Asn His Leu Tyr Asp Val
595 600 605
AAA CTG ACT CCA AAA ATG AAT TTT ACG ATT AAA ATG GCT TCC TTG TAT 3259
Lys Leu Thr Pro Lys Met Asn Phe Thr Ile Lys Met Ala Ser Leu Tyr
10 615 620
GAT GGG GCT GAA AAT AAT CAT AAC TCT TTA GGA ACC TGG TAT TTA ACA 3307
Asp Gly Ala Glu Asn Asn His Asn Ser Leu Gly Thr Trp Tyr Leu Thr
625 630 635
TAT AAT GTT GCT GGT GGA AAT ACT GGG AAG AGA CAA TAT CGT TCA GCT 3355
Tyr Asn Val Ala Gly Gly Asn Thr Gly Lys Arg Gln Tyr Arg Ser Ala
640 645 650
CAT TCT TGT GCA CAT GTA GCT CTA TCT TCA GAA GCG AAA AAG AAA CTA 3403
His Ser Cys Ala His Val Ala Leu Ser Ser Glu Ala Lys Lys Lys Leu
655 660 665 670
AAT CAA AAT GCG AAT TAC TAT CTT AGC ATG TAT ATG AAG GCT GAT TCT 3451
Asn Gln Asn Ala Asn Tyr Tyr Leu Ser Met Tyr Met Lys Ala Asp Ser
675 680 685
ACT ACG GAA CCT ACA ATA GAA GTA GCT GGG GAA AAA TCT GCA ATA ACA 3499
Thr Thr Glu Pro Thr Ile Glu Val Ala Gly Glu Lys Ser Ala Ile Thr
690 695 700
AGT AAA AAA GTA AAA TTA AAT AAT CAA AAT TAT CAA AGA GTT GAT ATT 3547
Ser Lys Lys Val Lys Leu Asn Asn Gln Asn Tyr Gln Arg Val Asp Ile
705 710 715
TTA GTG AAA AAT TCT GAA AGA AAT CCA ATG GAT AAA ATA TAT ATA AGA 3595
Leu Val Lys Asn Ser Glu Arg Asn Pro Met Asp Lys Ile Tyr Ile Arg
720 725 730
GGA AAT GGC ACG ACA AAT GTT TAT GGG GAT GAT GTT ACT ATC CCA GAG 3643
Gly Asn Gly Thr Thr Asn Val Tyr Gly Asp Asp Val Thr Ile Pro Glu
735 740 745 750
GTA TCA GCT ATA AAT CCG GCT AGT CTA TCA GAT GAA GAA ATT CAA GAA 3691
Val Ser Ala Ile Asn Pro Ala Ser Leu Ser Asp Glu Glu Ile Gln Glu
755 760 765
ATA TTT AAA GAC TCA ACT ATT GAA TAT GGA AAT CCT AGT TTC GTT GCT 3739
Ile Phe Lys Asp Ser Thr Ile Glu Tyr Gly Asn Pro Ser Phe Val Ala
770 775 780
GAT GCC GTA ACA TTT AAA AAT ATA AAA CCT TTA CAA AAT TAT GTA AAG 3787
Asp Ala Val Thr Phe Lys Asn Ile Lys Pro Leu Gln Asn Tyr Val Lys
785 790 795
GAA TAT GAA ATA TAT CAT AAA TCT CAT CGA TAT GAA AAG AAA ACG GTC 3835
Glu Tyr Glu Ile Tyr His Lys Ser His Arg Tyr Glu Lys Lys Thr Val
800 805 810
TTT GAT ATC ATG GGT GTT CAT TAT GAG TAT AGT ATA GCT AGG GAA CAA 3883
Phe Asp Ile Met Gly Val His Tyr Glu Tyr Ser Ile Ala Arg Glu Gln
815 820 825 830
AAG AAA GCC GCA TAATTTTAAA AATAAAACTC GTTAGAGTTT ATTTAGCATG 3935
Lys Lys Ala Ala
GTATTTTTAA GAATAATCAA TATCTTGAAC CGTTTGTAGC TGTTTTGGAA GGGAATTTCA 3995
TTTTATTTGG TCTCTTAAGT TGATGGGCAT GGGATATGTT CAGCATCCAA GCGTTTNGGG 4055
GGTTANAAAA TCCAATTTT 4074
(2)关于SEQ ID NO:20的信息
(i)序列特征
(A)长度:462个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:20:
Met Gln Arg Met Glu Gly Lys Leu Phe Val Val Ser Lys Thr Leu Gln
1 5 10 15
Val Val Thr Arg Thr Val Leu Leu Ser Thr Val Tyr Ser Ile Thr Leu
20 25 30
Leu Asn Asn Val Val Ile Lys Ala Asp Gln Leu Asn Ile Asn Ser Gln
35 40 45
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Pro Asp Asn Ala Glu
50 55 60
Asp Phe Lys Glu Asp Lys Gly Lys Ala Lys Glu Trp Gly Lys Glu Lys
65 70 75 80
Gly Glu Glu Trp Arg Pro Pro Ala Thr Glu Lys Gly Glu Met Asn Asn
85 90 95
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
100 105 110
Phe Ser Met Ala Gly Ser Cys Glu Asp Glu Ile Lys Asp Leu Glu Glu
115 120 125
Ile Asp Lys Ile Phe Asp Lys Ala Asn Leu Ser Ser Ser Ile Ile Thr
130 135 140
Tyr Lys Asn Val Glu Pro Ala Thr Ile Gly Phe Asn Lys Ser Leu Thr
145 150 155 160
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
165 170 175
Phe Leu Gly Lys Asp Met Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
180 185 190
Thr Ala Gln Gln Val Ser Ser Lys Lys Arg Val Ile Leu Lys Val Thr
195 200 205
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
210 215 220
Leu Asn Asn Asn Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Val Leu
225 230 235 240
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Met Glu Cys Leu
245 250 255
Gln Val Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
260 265 270
Asn Ala Glu Ala His Ser Trp Gly Met Lys Ile Tyr Glu Asp Trp Ala
275 280 285
Lys Asn Leu Thr Ala Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
290 295 300
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
305 310 315 320
Gly Asn Glu Lys Leu Asp Ala Gln Leu Lys Asn Ile Ser Asp Ala Leu
325 330 335
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
340 345 350
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
355 360 365
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
370 375 380
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
385 390 395 400
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
405 410 415
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
420 425 430
Lys Asp Ser Lys Tyr His Ile Asp Lys Ala Thr Glu Val Ile Ile Lys
435 440 445
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
450 455 460
(2)关于SEQ ID NO:21的信息
(i)序列特征
(A)长度:834个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:21:
Met Lys Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Met Leu
1 5 10 15
Leu Ala Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Asn Ala Asp
20 25 30
Ser Lys Ile Asn Gln Ile Ser Thr Thr Gln Glu Asn Gln Gln Lys Glu
35 40 45
Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe
50 55 60
Asn Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Asn Thr Leu Met Tyr
65 70 75 80
Asp Gln Gln Thr Ala Asn Ala Leu Leu Asp Lys Lys Gln Gln Glu Tyr
85 90 95
Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Arg Lys Glu Thr Gly Asp
100 105 110
Phe Thr Phe Asn Leu Ser Lys Asp Glu Gln Ala Ile Ile Glu Ile Asp
115 120 125
Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu
130 135 140
Glu Lys Glu Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr
145 150 155 160
Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys
165 170 175
Ile Asp Ser Gln Asn Gln Ser Gln Gln Val Gln Leu Arg Asn Pro Glu
180 185 190
Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Ala Ser Lys Thr
195 200 205
Asn Leu Phe Lys Gln Lys Met Lys Arg Asp Ile Asp Glu Asp Thr Asp
210 215 220
Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr
225 230 235 240
Ile Gln Asn Lys Val Ala Val Lys Trp Asp Asp Ser Leu Ala Ser Lys
245 250 255
Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Asp Ser His Thr Val Gly
260 265 270
Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser
275 280 285
Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser Val
290 295 300
Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu Asn Leu Ser
305 310 315 320
Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr Asn Thr
325 330 335
Glu Gly Ala Ser Ile Glu Ala Gly Gly Gly Pro Leu Gly Leu Ser Phe
340 345 350
Gly Val Ser Val Thr Tyr Gln His Ser Glu Thr Val Ala Gln Glu Trp
355 360 365
Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala Ser Ala Gly
370 375 380
Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly Ala Ile
385 390 395 400
Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asn Thr Ile
405 410 415
Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Arg Ile Ser Pro
420 425 430
Gly Asp Ser Tyr Pro Glu Ile Gly Glu Asn Ala Ile Ala Ile Thr Ser
435 440 445
Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys Gln Gln Val
450 455 460
Asn Gln Leu Ile Asn Asn Lys Pro Ile Met Leu Glu Thr Asp Gln Thr
465 470 475 480
Asp Gly Val Tyr Lys Ile Arg Asp Thr His Gly Asn Ile Val Thr Gly
485 490 495
Gly Glu Trp Asn Gly Val Thr Gln Gln Ile Lys Ala Lys Thr Ala Ser
500 505 510
Ile Ile Val Asp Asp Gly Lys Gln Val Ala Glu Lys Arg Val Ala Ala
515 520 525
Lys Asp Tyr Gly His Pro Glu Asp Lys Thr Pro Pro Leu Thr Leu Lys
530 535 540
Asp Thr Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu Thr Asn Gly
545 550 555 560
Leu Leu Tyr Tyr Asp Asp Lys Pro Ile Tyr Glu Ser Ser Val Met Thr
565 570 575
Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Lys Lys Gln Ile Asn Asp
580 585 590
Thr Thr Gly Lys Phe Lys Asp Val Asn His Leu Tyr Asp Val Lys Leu
595 600 605
Thr Pro Lys Met Asn Phe Thr Ile Lys Met Ala Ser Leu Tyr Asp Gly
610 615 620
Ala Glu Asn Asn His Asn Ser Leu Gly Thr Trp Tyr Leu Thr Tyr Asn
625 630 635 640
Val Ala Gly Gly Asn Thr Gly Lys Arg Gln Tyr Arg Ser Ala His Ser
645 650 655
Cys Ala His Val Ala Leu Ser Ser Glu Ala Lys Lys Lys Leu Asn Gln
660 665 670
Asn Ala Asn Tyr Tyr Leu Ser Met Tyr Met Lys Ala Asp Ser Thr Thr
675 680 685
Glu Pro Thr Ile Glu Val Ala Gly Glu Lys Ser Ala Ile Thr Ser Lys
690 695 700
Lys Val Lys Leu Asn Asn Gln Asn Tyr Gln Arg Val Asp Ile Leu Val
705 710 715 720
Lys Asn Ser Glu Arg Asn Pro Met Asp Lys Ile Tyr Ile Arg Gly Asn
725 730 735
Gly Thr Thr Asn Val Tyr Gly Asp Asp Val Thr Ile Pro Glu Val Ser
740 745 750
Ala Ile Asn Pro Ala Ser Leu Ser Asp Glu Glu Ile Gln Glu Ile Phe
755 760 765
Lys Asp Ser Thr Ile Glu Tyr Gly Asn Pro Ser Phe Val Ala Asp Ala
770 775 780
Val Thr Phe Lys Asn Ile Lys Pro Leu Gln Asn Tyr Val Lys Glu Tyr
785 790 795 800
Glu Ile Tyr His Lys Ser His Arg Tyr Glu Lys Lys Thr Val Phe Asp
805 810 815
Ile Met Gly Val His Tyr Glu Tyr Ser Ile Ala Arg Glu Gln Lys Lys
820 825 830
Ala Ala
(2)关于SEQ ID NO:22的信息
(i)序列特征
(A)长度:4041个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(ix)特征:
(A)名称/键:CDS
(B)位置:1..4038
(D)其它信息:/产品=“VIP1A(a)和VIP2A(a)融合产物”
(xi)序列描述:SEQ ID NO:22:
ATG AAA AGA ATG GAG GGA AAG TTG TTT ATG GTG TCA AAA AAA TTA CAA 48
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gln
835 840 845 850
GTA GTT ACT AAA ACT GTA TTG CTT AGT ACA GTT TTC TCT ATA TCT TTA 96
Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile Ser Leu
855 860 865
TTA AAT AAT GAA GTG ATA AAA GCT GAA CAA TTA AAT ATA AAT TCT CAA 144
Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn Ser Gln
870 875 880
AGT AAA TAT ACT AAC TTG CAA AAT CTA AAA ATC ACT GAC AAG GTA GAG 192
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu
885 890 895
GAT TTT AAA GAA GAT AAG GAA AAA GCG AAA GAA TGG GGG AAA GAA AAA 240
Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys
900 905 910
GAA AAA GAG TGG AAA CTA ACT GCT ACT GAA AAA GGA AAA ATG AAT AAT 288
Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn
915 920 925 930
TTT TTA GAT AAT AAA AAT GAT ATA AAG ACA AAT TAT AAA GAA ATT ACT 336
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
935 940 945
TTT TCT ATG GCA GGC TCA TTT GAA GAT GAA ATA AAA GAT TTA AAA GAA 384
Phe Ser Met Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu
950 955 960
ATT GAT AAG ATG TTT GAT AAA ACC AAT CTA TCA AAT TCT ATT ATC ACC 432
Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr
965 970 975
TAT AAA AAT GTG GAA CCG ACA ACA ATT GGA TTT AAT AAA TCT TTA ACA 480
Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr
980 985 990
GAA GGT AAT ACG ATT AAT TCT GAT GCA ATG GCA CAG TTT AAA GAA CAA 528
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
995 1000 1005 1010
TTT TTA GAT AGG GAT ATT AAG TTT GAT AGT TAT CTA GAT ACG CAT TTA 576
Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
1015 1020 1025
ACT GCT CAA CAA GTT TCC AGT AAA GAA AGA GTT ATT TTG AAG GTT ACG 624
Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr
1030 1035 1040
GTT CCG AGT GGG AAA GGT TCT ACT ACT CCA ACA AAA GCA GGT GTC ATT 672
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
1045 1050 1055
TTA AAT AAT AGT GAA TAC AAA ATG CTC ATT GAT AAT GGG TAT ATG GTC 720
Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val
1060 1065 1070
CAT GTA GAT AAG GTA TCA AAA GTG GTG AAA AAA GGG GTG GAG TGC TTA 768
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu
1075 1080 1085 1090
CAA ATT GAA GGG ACT TTA AAA AAG AGT CTT GAC TTT AAA AAT GAT ATA 816
Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
1095 1100 1105
AAT GCT GAA GCG CAT AGC TGG GGT ATG AAG AAT TAT GAA GAG TGG GCT 864
Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala
1110 1115 1120
AAA GAT TTA ACC GAT TCG CAA AGG GAA GCT TTA GAT GGG TAT GCT AGG 912
Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
1125 1130 1135
CAA GAT TAT AAA GAA ATC AAT AAT TAT TTA AGA AAT CAA GGC GGA AGT 960
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
1140 1145 1150
GGA AAT GAA AAA CTA GAT GCT CAA ATA AAA AAT ATT TCT GAT GCT TTA 1008
Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu
1155 1160 1165 1170
GGG AAG AAA CCA ATA CCG GAA AAT ATT ACT GTG TAT AGA TGG TGT GGC 1056
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
1175 1180 1185
ATG CCG GAA TTT GGT TAT CAA ATT AGT GAT CCG TTA CCT TCT TTA AAA 1104
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
1190 1195 1200
GAT TTT GAA GAA CAA TTT TTA AAT ACA ATC AAA GAA GAC AAA GGA TAT 1152
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
1205 1210 1215
ATG AGT ACA AGC TTA TCG AGT GAA CGT CTT GCA GCT TTT GGA TCT AGA 1200
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
1220 1225 1230
AAA ATT ATA TTA CGA TTA CAA GTT CCG AAA GGA AGT ACG GGT GCG TAT 1248
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
1235 1240 1245 1250
TTA AGT GCC ATT GGT GGA TTT GCA AGT GAA AAA GAG ATC CTA CTT GAT 1296
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
1255 1260 1265
AAA GAT AGT AAA TAT CAT ATT GAT AAA GTA ACA GAG GTA ATT ATT AAA 1344
Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys
1270 1275 1280
GGT GTT AAG CGA TAT GTA GTG GAT GCA ACA TTA TTA ACA AAT ATG AAA 1392
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Met Lys
1285 1290 1295
AAT ATG AAG AAA AAG TTA GCA AGT GTT GTA ACG TGT ACG TTA TTA GCT 1440
Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Thr Leu Leu Ala
1300 1305 1310
CCT ATG TTT TTG AAT GGA AAT GTG AAT GCT GTT TAC GCA GAC AGC AAA 1488
Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp Ser Lys
1315 1320 1325 1330
ACA AAT CAA ATT TCT ACA ACA CAG AAA AAT CAA CAG AAA GAG ATG GAC 1536
Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln Lys Glu Met Asp
1335 1340 1345
CGA AAA GGA TTA CTT GGG TAT TAT TTC AAA GGA AAA GAT TTT AGT AAT 1584
Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe Ser Asn
1350 1355 1360
CTT ACT ATG TTT GCA CCG ACA CGT GAT AGT ACT CTT ATT TAT GAT CAA 1632
Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu Ile Tyr Asp Gln
1365 1370 1375
CAA ACA GCA AAT AAA CTA TTA GAT AAA AAA CAA CAA GAA TAT CAG TCT 1680
Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln Glu Tyr Gln Ser
1380 1385 1390
ATT CGT TGG ATT GGT TTG ATT CAG AGT AAA GAA ACG GGA GAT TTC ACA 1728
Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr Gly Asp Phe Thr
1395 1400 1405 1410
TTT AAC TTA TCT GAG GAT GAA CAG GCA ATT ATA GAA ATC AAT GGG AAA 1776
Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu Ile Asn Gly Lys
1415 1420 1425
ATT ATT TCT AAT AAA GGG AAA GAA AAG CAA GTT GTC CAT TTA GAA AAA 1824
Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu Glu Lys
1430 1435 1440
GGA AAA TTA GTT CCA ATC AAA ATA GAG TAT CAA TCA GAT ACA AAA TTT 1872
Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr Lys Phe
1445 1450 1455
AAT ATT GAC AGT AAA ACA TTT AAA GAA CTT AAA TTA TTT AAA ATA GAT 1920
Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys Ile Asp
1460 1465 1470
AGT CAA AAC CAA CCC CAG CAA GTC CAG CAA GAT GAA CTG AGA AAT CCT 1968
Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu Leu Arg Asn Pro
1475 1480 1485 1490
GAA TTT AAC AAG AAA GAA TCA CAG GAA TTC TTA GCG AAA CCA TCG AAA 2016
Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Pro Ser Lys
1495 1500 1505
ATA AAT CTT TTC ACT CAA AAA ATG AAA AGG GAA ATT GAT GAA GAC ACG 2064
Ile Asn Leu Phe Thr Gln Lys Met Lys Arg Glu Ile Asp Glu Asp Thr
1510 1515 1520
GAT ACG GAT GGG GAC TCT ATT CCT GAC CTT TGG GAA GAA AAT GGG TAT 2112
Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn Gly Tyr
1525 1530 1535
ACG ATT CAA AAT AGA ATC GCT GTA AAG TGG GAC GAT TCT CTA GCA AGT 2160
Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp Ser Leu Ala Ser
1540 1545 1550
AAA GGG TAT ACG AAA TTT GTT TCA AAT CCA CTA GAA AGT CAC ACA GTT 2208
Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His Thr Val
1555 1560 1565 1570
GGT GAT CCT TAT ACA GAT TAT GAA AAG GCA GCA AGA GAT CTA GAT TTG 2256
Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu
1575 1580 1585
TCA AAT GCA AAG GAA ACG TTT AAC CCA TTG GTA GCT GCT TTT CCA AGT 2304
Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser
1590 1595 1600
GTG AAT GTT AGT ATG GAA AAG GTG ATA TTA TCA CCA AAT GAA AAT TTA 2352
Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu Asn Leu
1605 1610 1615
TCC AAT AGT GTA GAG TCT CAT TCA TCC ACG AAT TGG TCT TAT ACA AAT 2400
Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr Asn
1620 1625 1630
ACA GAA GGT GCT TCT GTT GAA GCG GGG ATT GGA CCA AAA GGT ATT TCG 2448
Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro Lys Gly Ile Ser
1635 1640 1645 1650
TTC GGA GTT AGC GTA AAC TAT CAA CAC TCT GAA ACA GTT GCA CaA GAA 2496
Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr Val Ala Gln Glu
1655 1660 1665
TGG GGA ACA TCT ACA GGA AAT ACT TCG CAA TTC AAT ACG GCT TCA GCG 2544
Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala Ser Ala
1670 1675 1680
GGA TAT TTA AAT GCA AAT GTT CGA TAT AAC AAT GTA GGA ACT GGT GCC 2592
Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly Ala
1685 1690 1695
ATC TAC GAT GTA AAA CCT ACA ACA AGT TTT GTA TTA AAT AAC GAT ACT 2640
Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asp Thr
1700 1705 1710
ATC GCA ACT ATT ACG GCG AAA TCT AAT TCT ACA GCC TTA AAT ATA TCT 2688
Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn Ile Ser
1715 1720 1725 1730
CCT GGA GAA AGT TAC CCG AAA AAA GGA CAA AAT GGA ATC GCA ATA ACA 2736
Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly Ile Ala Ile Thr
1735 1740 1745
TCA ATG GAT GAT TTT AAT TCC CAT CCG ATT ACA TTA AAT AAA AAA CAA 2784
Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys Lys Gln
1750 1755 1760
GTA GAT AAT CTG CTA AAT AAT AAA CCT ATG ATG TTG GAA ACA AAC CAA 2832
Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr Asn Gln
1765 1770 1775
ACA GAT GGT GTT TAT AAG ATA AAA GAT ACA CAT GGA AAT ATA GTA ACT 2880
Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly Asn Ile Val Thr
1780 1785 1790
GGC GGA GAA TGG AAT GGT GTC ATA CAA CAA ATC AAG GCT AAA ACA GCG 2928
Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys Ala Lys Thr Ala
1795 1800 1805 1810
TCT ATT ATT GTG GAT GAT GGG GAA CGT GTA GCA GAA AAA CGT GTA GCG 2976
Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg Val Ala
1815 1820 1825
GCA AAA GAT TAT GAA AAT CCA GAA GAT AAA ACA CCG TCT TTA ACT TTA 3024
Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu Thr Leu
1830 1835 1840
AAA GAT GCC CTG AAG CTT TCA TAT CCA GAT GAA ATA AAA GAA ATA GAG 3072
Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu Ile Glu
1845 1850 1855
GGA TTA TTA TAT TAT AAA AAC AAA CCG ATA TAC GAA TCG AGC GTT ATG 3120
Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu Ser Ser Val Met
1860 1865 1870
ACT TAC TTA GAT GAA AAT ACA GCA AAA GAA GTG ACC AAA CAA TTA AAT 3168
Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gln Leu Asn
1875 1880 1885 1890
GAT ACC ACT GGG AAA TTT AAA GAT GTA AGT CAT TTA TAT GAT GTA AAA 3216
Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp Val Lys
1895 1900 1905
CTG ACT CCA AAA ATG AAT GTT ACA ATC AAA TTG TCT ATA CTT TAT GAT 3264
Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser Ile Leu Tyr Asp
1910 1915 1920
AAT GCT GAG TCT AAT GAT AAC TCA ATT GGT AAA TGG ACA AAC ACA AAT 3312
Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp Thr Asn Thr Asn
1925 1930 1935
ATT GTT TCA GGT GGA AAT AAC GGA AAA AAA CAA TAT TCT TCT AAT AAT 3360
Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr Ser Ser Asn Asn
1940 1945 1950
CCG GAT GCT AAT TTG ACA TTA AAT ACA GAT GCT CAA GAA AAA TTA AAT 3408
Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln Glu Lys Leu Asn
1955 1960 1965 1970
AAA AAT CGT GAC TAT TAT ATA AGT TTA TAT ATG AAG TCA GAA AAA AAC 3456
Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys Ser Glu Lys Asn
1975 1980 1985
ACA CAA TGT GAG ATT ACT ATA GAT GGG GAG ATT TAT CCG ATC ACT ACA 3504
Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr Pro Ile Thr Thr
1990 1995 2000
AAA ACA GTG AAT GTG AAT AAA GAC AAT TAC AAA AGA TTA GAT ATT ATA 3552
Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp Ile Ile
2005 2010 2015
GCT CAT AAT ATA AAA AGT AAT CCA ATT TCT TCA CTT CAT ATT AAA ACG 3600
Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu His Ile Lys Thr
2020 2025 2030
AAT GAT GAA ATA ACT TTA TTT TGG GAT GAT ATT TCT ATA ACA GAT GTA 3648
Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser Ile Thr Asp Val
2035 2040 2045 2050
GCA TCA ATA AAA CCG GAA AAT TTA ACA GAT TCA GAA ATT AAA CAG ATT 3696
Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu Ile Lys Gln Ile
2055 2060 2065
TAT AGT AGG TAT GGT ATT AAG TTA GAA GAT GGA ATC CTT ATT GAT AAA 3744
Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile Leu Ile Asp Lys
2070 2075 2080
AAA GGT GGG ATT CAT TAT GGT GAA TTT ATT AAT GAA GCT AGT TTT AAT 3792
Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu Ala Ser Phe Asn
2085 2090 2095
ATT GAA CCA TTG CAA AAT TAT GTG ACC AAA TAT GAA GTT ACT TAT AGT 3840
Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu Val Thr Tyr Ser
2100 2105 2110
AGT GAG TTA GGA CCA AAC GTG AGT GAC ACA CTT GAA AGT GAT AAA ATT 3888
Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp Lys Ile
2115 2120 2125 2130
TAC AAG GAT GGG ACA ATT AAA TTT GAT TTT ACC AAA TAT AGT AAA AAT 3936
Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys Tyr Ser Lys Asn
2135 2140 2145
GAA CAA GGA TTA TTT TAT GAC AGT GGA TTA AAT TGG GAC TTT AAA ATT 3984
Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe Lys Ile
2150 2155 2160
AAT GCT ATT ACT TAT GAT GGT AAA GAG ATG AAT GTT TTT CAT AGA TAT 4032
Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His Arg Tyr
2165 2170 2175
AAT AAA TAG 4041
Asn Lys
2180
(2)关于SEQ ID NO:23的信息
(i)序列特征
(A)长度:1346个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:23:
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gln
1 5 10 15
Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile Ser Leu
20 25 30
Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn Ser Gln
35 40 45
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu
50 55 60
Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys
65 70 75 80
Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn
85 90 95
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
100 105 110
Phe Ser Met Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu
115 120 125
Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr
130 135 140
Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr
145 150 155 160
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
165 170 175
Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
180 185 190
Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr
195 200 205
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
210 215 220
Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val
225 230 235 240
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu
245 250 255
Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
260 265 270
Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala
275 280 285
Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
290 295 300
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
305 310 315 320
Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu
325 330 335
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
340 345 350
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
355 360 365
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
370 375 380
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
385 390 395 400
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
405 410 415
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
420 425 430
Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys
435 440 445
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Met Lys
450 455 460
Asn Met Lys Lys Lys Leu Ala Ser Val Val Thr Cys Thr Leu Leu Ala
465 470 475 480
Pro Met Phe Leu Asn Gly Asn Val Asn Ala Val Tyr Ala Asp Ser Lys
485 490 495
Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln Lys Glu Met Asp
500 505 510
Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe Ser Asn
515 520 525
Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu Ile Tyr Asp Gln
530 535 540
Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln Glu Tyr Gln Ser
545 550 555 560
Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr Gly Asp Phe Thr
565 570 575
Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu Ile Asn Gly Lys
580 585 590
Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu Glu Lys
595 600 605
Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr Lys Phe
610 615 620
Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys Ile Asp
625 630 635 640
Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu Leu Arg Asn Pro
645 650 655
Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Pro Ser Lys
660 665 670
Ile Asn Leu Phe Thr Gln Lys Met Lys Arg Glu Ile Asp Glu Asp Thr
675 680 685
Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn Gly Tyr
690 695 700
Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp Ser Leu Ala Ser
705 710 715 720
Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His Thr Val
725 730 735
Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu
740 745 750
Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe Pro Ser
755 760 765
Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu Asn Leu
770 775 780
Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr Thr Asn
785 790 795 800
Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro Lys Gly Ile Ser
805 810 815
Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr Val Ala Gln Glu
820 825 830
Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala Ser Ala
835 840 845
Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr Gly Ala
850 855 860
Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn Asp Thr
865 870 875 880
Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn Ile Ser
885 890 895
Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly Ile Ala Ile Thr
900 905 910
Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys Lys Gln
915 920 925
Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr Asn Gln
930 935 940
Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly Asn Ile Val Thr
945 950 955 960
Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys Ala Lys Thr Ala
965 970 975
Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg Val Ala
980 985 990
Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu Thr Leu
995 1000 1005
Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu Ile Glu
1010 1015 1020
Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu Ser Ser Val Met
1025 1030 1035 1040
Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gln Leu Asn
1045 1050 1055
Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp Val Lys
1060 1065 1070
Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser Ile Leu Tyr Asp
1075 1080 1085
Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp Thr Asn Thr Asn
1090 1095 1100
Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr Ser Ser Asn Asn
1105 1110 1115 1120
Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln Glu Lys Leu Asn
1125 1130 1135
Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys Ser Glu Lys Asn
1140 1145 1150
Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr Pro Ile Thr Thr
1155 1160 1165
Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp Ile Ile
1170 1175 1180
Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu His Ile Lys Thr
1185 1190 1195 1200
Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser Ile Thr Asp Val
1205 1210 1215
Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu Ile Lys Gln Ile
1220 1225 1230
Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile Leu Ile Asp Lys
1235 1240 1245
Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu Ala Ser Phe Asn
1250 1255 1260
Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu Val Thr Tyr Ser
1265 1270 1275 1280
Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp Lys Ile
1285 1290 1295
Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys Tyr Ser Lys Asn
1300 1305 1310
Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe Lys Ile
1315 1320 1325
Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His Arg Tyr
1330 1335 1340
Asn Lys
1345
(2)关于SEQ ID NO:24的信息
(i)序列特征
(A)长度:1399个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(ix)特征:
(A)名称/键:misc_feature
(B)位置:1..1386
(D)其它信息:/备注=“来自AB78的编码VIP2A(a)的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:24:
ATGAAGCGCA TGGAGGGCAA GCTGTTCATG GTGAGCAAGA AGCTCCAGGT GGTGACCAAG 60
ACCGTGCTGC TGAGCACCGT GTTCAGCATC AGCCTGCTGA ACAACGAGGT GATCAAGGCC 120
GAGCAGCTGA ACATCAACAG CCAGAGCAAG TACACCAACC TCCAGAACCT GAAGATCACC 180
GACAAGGTGG AGGACTTCAA GGAGGACAAG GAGAAGGCCA ACCAGTGGGG CAAGGAGAAG 240
GAGAAGGAGT GGAAGCTTAC CGCCACCGAG AAGGGCAAGA TGAACAACTT CCTGGACAAC 300
AAGAACGACA TCAAGACCAA CTACAAGGAG ATCACCTTCA GCATGGCCGG CAGCTTCGAG 360
GACGAGATCA AGHSCCTGAA GGAGATCGAC AAGATGTTCG ACAAGACCAA CCTGAGCAAC 420
AGCATCATCA CCTACAAGAA CGTGGAGCCC ACCACCATCG GCTTCAACAA GAGCCTGACC 480
GAGGGCAACA CCATCAACAG CGACGCCATG GCCCAGTTCA AGGAGCAGTT CCTGGACCGC 540
GACATCAAGT TCGACAGCTA CCTGGACACC CACCTGACCG CCCAGCAGGT GAGCAGCAAG 600
GAGCGCGTGA TCCTGAAGGT GACCGTCCCC AGCGGCAAGG GCAGCACCAC CCCCACCAAG 660
GCCGGCGTGA TCCTGAACAA CAGCGAGTAC AAGATGCTGA TCGACAACGG CTACATGGTG 720
CACGTGGACA AGGTGAGCAA GGTGGTGAAG AAGGGCGTGG AGTGCCTCCA GATCGAGGGC 780
ACCCTGAAGA AGAGTCTAGA CTTCAAGAAC GACATCAACG CCGAGGCCCA CAGCTGGGGC 840
ATGAAGAACT ACGAGGAGTG GGCCAAGGAC CTGACCGACA GCCAGCGCGA GGCCCTGGAC 900
GGCTACGCCC GCCAGGACTA CAAGGAGATC AACAACTACC TGCGCAACCA GGGCGGCAGC 960
GGCAACGAGA AGCTGGACGC CCAGATCAAG AACATCAGGG ACGCCCTGGG CAAGAAGCCC 1020
ATCCCCGAGA ACATCACCGT GTACCGCTGG TGCGGCATGC CCGAGTTCGG CTACCAGATC 1080
AGCGACCCCC TGCCCAGCCT GAAGGACTTC GAGGAGCAGT TCCTGAACAC CATCAAGGAG 1140
GACAAGGGCT ACATGAGCAC CAGCCTGAGC AGCGAGCGCC TGGCCGCCTT CGGCAGCCGC 1200
AAGATCATCC TGCGCCTGCA GGTGCCCAAG GGCAGCACCG GCGCCTACCT GAGCGCCATC 1260
GGCGGCTTCG CCAGCGAGAA GGAGATCCTG CTGGACAAGG ACAGCAAGTA CCACATCGAC 1320
AAGGTGACCG AGGTGATCAT CAAGGGCGTG AAGCGCTACG TGGTGGACGC CACCCTGCTG 1380
ACCAACTAGA TCTGAGCTC 1399
(2)关于SEQ ID NO:25的信息
(i)序列特征
(A)长度:19个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:肽
(ix)特征:
(A)名称/键:肽
(B)位置:1..19
(D)其它信息:/备注=“从细胞向外分泌VIP2的分泌信号肽”
(xi)序列描述:SEQ ID NO:25:
Gly Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Ala Ala Gly Val
1 5 10 15
His Cys Leu
(2)关于SEQ ID NO:26的信息
(i)序列特征
(A)长度:2655个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:misc_feathure
(B)位置:1..2655
(D)其它信息:/备注=“编码VIP1A(a)的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:26:
ATGAAGAACA TGAAGAAGAA GCTGGCCAGC GTGGTGACCT GCACCCTGCT GGCCCCCATG 60
TTCCTGAACG GCAACGTGAA CGCCGTGTAC GCCGACAGCA AGACCAACCA GATCAGCACC 120
ACCCAGAAGA ACCAGCAGAA GGAGATGGAC CGCAAGGGCC TGCTGGGCTA CTACTTCAAG 180
GGCAAGGACT TCAGCAACCT GACCATGTTC GCCCCCACGC GTGACAGCAC CCTGATCTAC 240
GACCAGCAGA CCGCCAACAA GCTGCTGGAC AAGAAGCAGC AGGAGTACCA GAGCATCCGC 300
TGGATCGGCC TGATCCAGAG CAAGGAGACC GGCGACTTCA CCTTCAACCT GAGCGAGGAC 360
GAGCAGGCCA TCATCGAGAT CAACGGCAAG ATCATCAGCA ACAAGGGCAA GGAGAAGCAG 420
GTGGTGCACC TGGAGAAGGG CAAGCTGGTG CCCATCAAGA TCGAGTACCA GAGCGACACC 480
AAGTTCAACA TCGACAGCAA GACCTTCAAG GAGCTGAAGC TTTTCAAGAT CGACAGCCAG 540
AACCAGCCCC AGCAGGTGCA GCAGGACGAG CTGCGCAACC CCGAGTTCAA CAAGAAGGAG 600
AGCCAGGAGT TCCTGGCCAA GCCCAGCAAG ATCAACCTGT TCACCCAGCA GATGAAGCGC 660
GAGATCGACG AGGACACCGA CACCGACGGC GACAGCATCC CCGACCTGTG GGAGGAGAAC 720
GGCTACACCA TCCAGAACCG CATCGCCGTG AAGTGGGACG ACAGCCTGGC TAGCAAGGGC 780
TACACCAAGT TCGTGAGCAA CCCCCTGGAG AGCCACACCG TGGGCGACCC CTACACCGAC 840
TACGAGAAGG CCGCCCGCGA CCTGGACCTG AGCAACGCCA AGGAGACCTT CAACCCCCTG 900
GTGGCCGCCT TCCCCAGCGT GAACGTGAGC ATGGAGAAGG TGATCCTGAG CCCCAACGAG 960
AACCTGAGCA ACAGCGTGGA GAGCCACTCG AGCACCAACT GGAGCTACAC CAACACCGAG 1020
GGCGCCAGCG TGGAGGCCGG CATCGGTCCC AAGGGCATCA GCTTCGGCGT GAGCGTGAAC 1080
TACCAGCACA GCGAGACCGT GGCCCAGGAG TGGGGCACCA GCACCGGCAA CACCAGCCAG 1140
TTCAACACCG CCAGCGCCGG CTACCTGAAC GCCAACGTGC GCTACAACAA CGTGGGCACC 1200
GGCGCCATCT ACGACGTGAA GCCCACCACC AGCTTCGTGC TGAACAACGA CACCATCGCC 1260
ACCATCACCG CCAAGTCGAA TTCCACCGCC CTGAACATCA GCCCCGGCGA GAGCTACCCC 1320
AAGAAGGGCC AGAACGGCAT CGCCATCACC AGCATGGACG ACTTCAACAG CCACCCCATC 1380
ACCCTGAACA AGAAGCAGGT GGACAACCTG CTGAACAACA AGCCCATGAT GCTGGAGACC 1440
AACCAGACCG ACGGCGTCTA CAAGATCAAG GACACCCACG GCAACATCGT GACGGGCGGC 1500
GAGTGGAACG GCGTGATCCA GCAGATCAAG GCCAAGACCG CCAGCATCAT CGTCGACGAC 1560
GGCGAGCGCG TGGCCGAGAA GCGCGTGGCC GCCAAGGACT ACGAGAACCC CGAGGACAAG 1620
ACCCCCAGCC TGACCCTGAA GGACGCCCTG AAGCTGAGCT ACCCCGACGA GATCAAGGAG 1680
ATCGAGGGCT TGCTGTACTA CAAGAACAAG CCCATCTACG AGAGCAGCGT GATGACCTAT 1740
CTAGACGAGA ACACCGCCAA GGAGGTGACC AAGCAGCTGA ACGACACCAC CGGCAAGTTC 1800
AAGGACGTGA GCCACCTGTA CGACGTGAAG CTGACCCCCA AGATGAACGT GACCATCAAG 1860
CTGAGCATCC TGTACGACAA CGCCGAGAGC AACGACAACA GCATCGGCAA GTGGACCAAC 1920
ACCAACATCG TGAGCGGCGG CAACAACGGC AAGAAGCAGT ACAGCAGCAA CAACCCCGAC 1980
GCCAACCTGA CCCTGAACAC CGACGCCCAG GAGAAGCTGA ACAAGAACCG CGACTACTAC 2040
ATCAGCCTGT ACATGAAGAG CGAGAAGAAC ACCCAGTGCG AGATCACCAT CGACGGCGAG 2100
ATATACCCCA TCACCACCAA GACCGTGAAC GTGAACAAGG ACAACTACAA GCGCCTGGAC 2160
ATCATCGCCC ACAACATCAA GAGCAACCCC ATCAGCAGCC TGCACATCAA GACCAACGAC 2220
GAGATCACCC TGTTCTGGGA CGACATATCG ATTACCGACG TCGCCAGCAT CAAGCCCGAG 2280
AACCTGACCG ACAGCGAGAT CAAGCAGATA TACAGTCGCT ACGGCATCAA GCTGGAGGAC 2340
GGCATCCTGA TCGACAAGAA AGGCGGCATC CACTACGGCG AGTTCATCAA CGAGGCCAGC 2400
TTCAACATCG AGCCCCTGCA GAACTACGTG ACCAAGTACG AGGTGACCTA CAGCAGCGAG 2460
CTGGGCCCCA ACGTGAGCGA CACCCTGGAG AGCGACAAGA TTTACAAGGA CGGCACCATC 2520
AAGTTCGACT TCACCAAGTA CAGCAAGAAC GAGCAGGGCC TGTTCTACGA CAGCGGCCTG 2580
AACTGGGACT TCAAGATCAA CGCCATCACC TACGACGGCA AGGAGATGAA CGTGTTCACC 2640
CGCTACAACA AGTAG 2655
(2)关于SEQ ID NO:27的信息
(i)序列特征
(A)长度:1389个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:misc_feathure
(B)位置: 1..1389
(D)其它信息:/备注=“编码VIP2A(a)的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:27:
ATGAAGCGCA TGGAGGGCAA GCTGTTCATG GTGAGCAAGA AGCTCCAGGT GGTGACCAAG 60
ACCGTGCTGC TGAGCACCGT GTTCAGCATC AGCCTGCTGA ACAACGAGGT GATCAAGGCC 120
GAGCAGCTGA ACATCAACAG CCAGAGCAAG TACACCAACC TCCAGAACCT GAAGATCACC 180
GACAAGGTGG AGGACTTCAA GGAGGACAAG GAGAAGGCCA AGGAGTGGGG CAAGGAGAAG 240
GAGAAGGAGT GGAAGCTTAC CGCCACCGAG AAGGGCAAGA TGAACAACTT CCTGGACAAC 300
AAGAACGACA TCAAGACCAA CTACAAGGAG ATCACCTTCA GCATAGCCGG CAGCTTCGAG 360
GACGAGATCA AGGACCTGAA GGAGATCGAC AAGATGTTCG ACAAGACCAA CCTGAGCAAC 420
AGCATCATCA CCTACAAGAA CGTGGAGCCC ACCACCATCG GCTTCAACAA GAGCCTGACC 480
GAGGGCAACA CCATCAACAG CGACGCCATG GCCCAGTTCA AGGAGCAGTT CCTGGACCGC 540
GACATCAAGT TCGACAGCTA CCTGGACACC CACCTGACCG CCCAGCAGGT GAGCAGCAAG 600
GAGCGCGTGA TCCTGAAGGT GACCGTCCCC AGCGGCAAGG GCAGCACCAC CCCCACCAAG 660
GCCGGCGTGA TCCTGAACAA CAGCGAGTAC AAGATGCTGA TCGACAACGG CTACATGGTG 720
CACGTGGACA AGGTGAGCAA GGTGGTGAAG AAGGGCGTGG AGTGCCTCCA GATCGAGGGC 780
ACCCTGAAGA AGAGTCTAGA CTTCAAGAAC GACATCAACG CCGAGGCCCA CAGCTGGGGC 840
ATGAAGAACT ACGAGGAGTG GGCCAAGGAC CTGACCGACA GCCAGCGCGA GGCCCTGGAC 900
GGCTACGCCC GCCAGGACTA CAAGGAGATC AACAACTACC TGCGCAACCA GGGCGGCAGC 960
GGCAACGAGA AGCTGGACGC CCAGATCAAG AACATCAGCG ACGCCCTGGG CAAGAAGCCC 1020
ATCCCCGAGA ACATCACCGT GTACCGCTGG TGCGGCATGC CCGAGTTCGG CTACCAGATC 1080
AGCGACCCCC TGCCCAGCCT GAAGGACTTC GAGGAGCAGT TCCTGAACAC CATCAAGGAG 1140
GACAAGGGCT ACATGAGCAC CAGCCTGAGC AGCGAGCGCC TGGCCGCCTT CGGCAGCCGC 1200
AAGATCATCC TGCGCCTGCA GGTGCCCAAG GGCAGCACTG GTGCCTACCT GAGCGCCATC 1260
GGCGGCTTCG CCAGCGAGAA GGAGATCCTG CTGGATAAGG ACAGCAAGTA CCACATCGAC 1320
AAGGTGACCG AGGTGATCAT CAAGGGCGTG AAGCGCTACG TGGTGGACGC CACCCTGCTG 1380
ACCAACTAG 1389
(2)关于SEQ ID NO:28的信息
(i)序列特征
(A)长度:2378个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:9..2375
(D)其它信息:/备注=“来自AB88的编码VIP3A(a)蛋白的天然DNA序列,包含于pCIB7104中。”
(xi)序列描述:SEQ ID NO:28:
AGATGAAC ATG AAC AAG AAT AAT ACT AAA TTA AGC ACA AGA GCC TTA CCA 50
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro
1 5 10
AGT TTT ATT GAT TAT TTT AAT GGC ATT TAT GGA TTT GCC ACT GGT ATC 98
Ser Phe Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala Thr Gly Ile
15 20 25 30
AAA GAC ATT ATG AAC ATG ATT TTT AAA ACG GAT ACA GGT GGT GAT CTA 146
Lys Asp Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly Gly Asp Leu
35 40 45
ACC CTA GAC GAA ATT TTA AAG AAT CAG CAG TTA CTA AAT GAT ATT TCT 194
Thr Leu Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn Asp Ile Ser
50 55 60
GGT AAA TTG GAT GGG GTG AAT GGA AGC TTA AAT GAT CTT ATC GCA CAG 242
Gly Lys Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu Ile Ala Gln
65 70 75
GGA AAC TTA AAT ACA GAA TTA TCT AAG GAA ATA TTA AAA ATT GCA AAT 290
Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys Ile Ala Asn
80 85 90
GAA CAA AAT CAA GTT TTA AAT GAT GTT AAT AAC AAA CTC GAT GCG ATA 338
Glu Gln Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala Ile
95 100 105 110
AAT ACG ATG CTT CGG GTA TAT CTA CCT AAA ATT ACC TCT ATG TTG AGT 386
Asn Thr Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser Met Leu Ser
115 120 125
GAT GTA ATG AAA CAA AAT TAT GCG CTA AGT CTG CAA ATA GAA TAC TTA 434
Asp Val Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile Glu Tyr Leu
130 135 140
AGT AAA CAA TTG CAA GAG ATT TCT GAT AAG TTG GAT ATT ATT AAT GTA 482
Ser Lys Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile Ile Asn Val
145 150 155
AAT GTA CTT ATT AAC TCT ACA CTT ACT GAA ATT ACA CCT GCG TAT CAA 530
Asn Val Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro Ala Tyr Gln
160 165 170
AGG ATT AAA TAT GTG AAC GAA AAA TTT GAG GAA TTA ACT TTT GCT ACA 578
Arg Ile Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr
175 180 185 190
GAA ACT AGT TCA AAA GTA AAA AAG GAT GGC TCT CCT GCA GAT ATT CTT 626
Glu Thr Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp Ile Leu
195 200 205
GAT GAG TTA ACT GAG TTA ACT GAA CTA GCG AAA AGT GTA ACA AAA AAT 674
Asp Glu Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn
210 215 220
GAT GTG GAT GGT TTT GAA TTT TAC CTT AAT ACA TTC CAC GAT GTA ATG 722
Asp Val Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met
225 230 235
GTA GGA AAT AAT TTA TTC GGG CGT TCA GCT TTA AAA ACT GCA TCG GAA 770
Val Gly Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu
240 245 250
TTA ATT ACT AAA GAA AAT GTG AAA ACA AGT GGC AGT GAG GTC GGA AAT 818
Leu Ile Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn
255 260 265 270
GTT TAT AAC TTC TTA ATT GTA TTA ACA GCT CTG CAA GCC CAA GCT TTT 866
Val Tyr Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala Gln Ala Phe
275 280 285
CTT ACT TTA ACA ACA TGC CGA AAA TTA TTA GGC TTA GCA GAT ATT GAT 914
Leu Thr Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp Ile Asp
290 295 300
TAT ACT TCT ATT ATG AAT GAA CAT TTA AAT AAG GAA AAA GAG GAA TTT 962
Tyr Thr Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe
305 310 315
AGA GTA AAC ATC CTC CCT ACA CTT TCT AAT ACT TTT TCT AAT CCT AAT 1010
Arg Val Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn
320 325 330
TAT GCA AAA GTT AAA GGA AGT GAT GAA GAT GCA AAG ATG ATT GTG GAA 1058
Tyr Ala Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met Ile Val Glu
335 340 345 350
GCT AAA CCA GGA CAT GCA TTG ATT GGG TTT GAA ATT AGT AAT GAT TCA 1106
Ala Lys Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser Asn Asp Ser
355 360 365
ATT ACA GTA TTA AAA GTA TAT GAG GCT AAG CTA AAA CAA AAT TAT CAA 1154
Ile Thr Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln Asn Tyr Gln
370 375 380
GTC GAT AAG GAT TCC TTA TCG GAA GTT ATT TAT GGT GAT ATG GAT AAA 1202
Val Asp Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp Met Asp Lys
385 390 395
TTA TTG TGC CCA GAT CAA TCT GAA CAA ATC TAT TAT ACA AAT AAC ATA 1250
Leu Leu Cys Pro Asp Gln Ser Glu Gln Ile Tyr Tyr Thr Asn Asn Ile
400 405 410
GTA TTT CCA AAT GAA TAT GTA ATT ACT AAA ATT GAT TTC ACT AAA AAA 1298
Val Phe Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe Thr Lys Lys
415 420 425 430
ATG AAA ACT TTA AGA TAT GAG GTA ACA GCG AAT TTT TAT GAT TCT TCT 1346
Met Lys Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser
435 440 445
ACA GGA GAA ATT GAC TTA AAT AAG AAA AAA GTA GAA TCA AGT GAA GCG 1394
Thr Gly Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala
450 455 460
GAG TAT AGA ACG TTA AGT GCT AAT GAT GAT GGG GTG TAT ATG CCG TTA 1442
Glu Tyr Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu
465 470 475
GGT GTC ATC AGT GAA ACA TTT TTG ACT CCG ATT AAT GGG TTT GGC CTC 1490
Gly Val Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly Phe Gly Leu
480 485 490
CAA GCT GAT GAA AAT TCA AGA TTA ATT ACT TTA ACA TGT AAA TCA TAT 1538
Gln Ala Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys Lys Ser Tyr
495 500 505 510
TTA AGA GAA CTA CTG CTA GCA ACA GAC TTA AGC AAT AAA GAA ACT AAA 1586
Leu Arg Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys
515 520 525
TTG ATC GTC CCG CCA AGT GGT TTT ATT AGC AAT ATT GTA GAG AAC GGG 1634
Leu Ile Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val Glu Asn Gly
530 535 540
TCC ATA GAA GAG GAC AAT TTA GAG CCG TGG AAA GCA AAT AAT AAG AAT 1682
Ser Ile Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn
545 550 555
GCG TAT GTA GAT CAT ACA GGC GGA GTG AAT GGA ACT AAA GCT TTA TAT 1730
Ala Tyr Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr
560 565 570
GTT CAT AAG GAC GGA GGA ATT TCA CAA TTT ATT GGA GAT AAG TTA AAA 1778
Val His Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp Lys Leu Lys
575 580 585 590
CCG AAA ACT GAG TAT GTA ATC CAA TAT ACT GTT AAA GGA AAA CCT TCT 1826
Pro Lys Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly Lys Pro Ser
595 600 605
ATT CAT TTA AAA GAT GAA AAT ACT GGA TAT ATT CAT TAT GAA GAT ACA 1874
Ile His Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr Glu Asp Thr
610 615 620
AAT AAT AAT TTA GAA GAT TAT CAA ACT ATT AAT AAA CGT TTT ACT ACA 1922
Asn Asn Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg Phe Thr Thr
625 630 635
GGA ACT GAT TTA AAG GGA GTG TAT TTA ATT TTA AAA AGT CAA AAT GGA 1970
Gly Thr Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser Gln Asn Gly
640 645 650
GAT GAA GCT TGG GGA GAT AAC TTT ATT ATT TTG GAA ATT AGT CCT TCT 2018
Asp Glu Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile Ser Pro Ser
655 660 665 670
GAA AAG TTA TTA AGT CCA GAA TTA ATT AAT ACA AAT AAT TGG ACG AGT 2066
Glu Lys Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn Trp Thr Ser
675 680 685
ACG GGA TCA ACT AAT ATT AGC GGT AAT ACA CTC ACT CTT TAT CAG GGA 2114
Thr Gly Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Tyr Gln Gly
690 695 700
GGA CGA GGG ATT CTA AAA CAA AAC CTT CAA TTA GAT AGT TTT TCA ACT 2162
Gly Arg Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser Phe Ser Thr
705 710 715
TAT AGA GTG TAT TTT TCT GTG TCC GGA GAT GCT AAT GTA AGG ATT AGA 2210
Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg Ile Arg
720 725 730
AAT TCT AGG GAA GTG TTA TTT GAA AAA AGA TAT ATG AGC GGT GCT AAA 2258
Asn Ser Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys
735 740 745 750
GAT GTT TCT GAA ATG TTC ACT ACA AAA TTT GAG AAA GAT AAC TTT TAT 2306
Asp Val Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr
755 760 765
ATA GAG CTT TCT CAA GGG AAT AAT TTA TAT GGT GGT CCT ATT GTA CAT 2354
Ile Glu Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro Ile Val His
770 775 780
TTT TAC GAT GTC TCT ATT AAG TAA 2378
Phe Tyr Asp Val Ser Ile Lys
785
(2)关于SEQ ID NO:29的信息
(i)序列特征
(A)长度:789个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(i)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:29:
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe
1 5 10 15
Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala Thr Gly Ile Lys Asp
20 25 30
Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu
35 40 45
Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn Asp Ile Ser Gly Lys
50 55 60
Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu Ile Ala Gln Gly Asn
65 70 75 80
Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys Ile Ala Asn Glu Gln
85 90 95
Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala Ile Asn Thr
100 105 110
Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser Met Leu Ser Asp Val
115 120 125
Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile Glu Tyr Leu Ser Lys
130 135 140
Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile Ile Asn Val Asn Val
145 150 155 160
Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro Ala Tyr Gln Arg Ile
165 170 175
Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr
180 185 190
Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp Ile Leu Asp Glu
195 200 205
Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val
210 215 220
Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly
225 230 235 240
Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu Ile
245 250 255
Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr
260 265 270
Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala Gln Ala Phe Leu Thr
275 280 285
Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp Ile Asp Tyr Thr
290 295 300
Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val
305 310 315 320
Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala
325 330 335
Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met Ile Val Glu Ala Lys
340 345 350
Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser Asn Asp Ser Ile Thr
355 360 365
Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln Asn Tyr Gln Val Asp
370 375 380
Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp Met Asp Lys Leu Leu
385 390 395 400
Cys Pro Asp Gln Ser Glu Gln Ile Tyr Tyr Thr Asn Asn Ile Val Phe
405 410 415
Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe Thr Lys Lys Met Lys
420 425 430
Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly
435 440 445
Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr
450 455 460
Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val
465 470 475 480
Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly Phe Gly Leu Gln Ala
485 490 495
Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys Lys Ser Tyr Leu Arg
500 505 510
Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu Ile
515 520 525
Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val Glu Asn Gly Ser Ile
530 535 540
Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr
545 550 555 560
Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His
565 570 575
Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp Lys Leu Lys Pro Lys
580 585 590
Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly Lys Pro Ser Ile His
595 600 605
Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr Glu Asp Thr Asn Asn
610 615 620
Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg Phe Thr Thr Gly Thr
625 630 635 640
Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser Gln Asn Gly Asp Glu
645 650 655
Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile Ser Pro Ser Glu Lys
660 665 670
Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn Trp Thr Ser Thr Gly
675 680 685
Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Tyr Gln Gly Gly Arg
690 695 700
Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser Phe Ser Thr Tyr Arg
705 710 715 720
Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg Ile Arg Asn Ser
725 730 735
Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val
740 745 750
Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr Ile Glu
755 760 765
Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro Ile Val His Phe Tyr
770 775 780
Asp Val Ser Ile Lys
785
(2)关于SEQ ID NO:30的信息
(i)序列特征
(A)长度:2403个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型: 其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:misc_feature
(B)位置:11..2389
(D)其它信息:/备注=“编码VIP3A(a)的玉米最佳化DNA序列。”
(xi)序列描述:SEQ ID NO:30:
GGATCCACCA ATGAACATGA ACAAGAACAA CACCAAGCTG AGCACCCGCG CCCTGCCGAG 60
CTTCATCGAC TACTTCAACG GCATCTACGG CTTCGCCACC GGCATCAAGG ACATCATGAA 120
CATGATCTTC AAGACCGACA CCGGCGGCGA CCTGACCCTG GACGAGATCC TGAAGAACCA 180
GCAGCTGCTG AACGACATCA GCGGCAAGCT GGACGGCGTG AACGGCAGCC TGAACGACCT 240
GATCGCCCAG GGCAACCTGA ACACCGAGCT GAGCAAGGAG ATCCTTAAGA TCGCCAACGA 300
GCAGAACCAG GTGCTGAACG ACGTGAACAA CAAGCTGGAC GCCATCAACA CCATGCTGCG 360
CGTGTACCTG CCGAAGATCA CCAGCATGCT GAGCGACGTG ATGAAGCAGA ACTACGCCCT 420
GAGCCTGCAG ATCGAGTACC TGAGCAAGCA GCTGCAGGAG ATCAGCGACA AGCTGGACAT 480
CATCAACGTG AACGTCCTGA TCAACAGCAC CCTGACCGAG ATCACCCCGG CCTACCAGCG 540
CATCAAGTAC GTGAACGAGA AGTTCGAAGA GCTGACCTTC GCCACCGAGA CCAGCAGCAA 600
GGTGAAGAAG GACGGCAGCC CGGCCGACAT CCTGGACGAG CTGACCGAGC TGACCGAGCT 660
GGCCAAGAGC GTGACCAAGA ACGACGTGGA CGGCTTCGAG TTCTACCTGA ACACCTTCCA 720
CGACGTGATG GTGGGCAACA ACCTGTTCGG CCGCAGCGCC CTGAAGACCG CCAGCGAGCT 780
GATCACCAAG GAGAACGTGA AGACCAGCGG CAGCGAGGTG GGCAACGTGT ACAACTTCCT 840
GATCGTGCTG ACCGCCCTGC AGGCCCAGGC CTTCCTGACC CTGACCACCT GTCGCAAGCT 900
GCTGGGCCTG GCCGACATCG ACTACACCAG CATCATGAAC GAGCACTTGA ACAAGGAGAA 960
GGAGGAGTTC CGCGTGAACA TCCTGCCGAC CCTGAGCAAC ACCTTCAGCA ACCCGAACTA 1020
CGCCAAGGTG AAGGGCAGCG ACGAGGACGC CAAGATGATC GTGGAGGCTA AGCCGGGCCA 1080
CGCGTTGATC GGCTTCGAGA TCAGCAACGA CAGCATCACC GTGCTGAAGG TGTACGAGGC 1140
CAAGCTGAAG CAGAACTACC AGGTGGACAA GGACAGCTTG AGCGAGGTGA TCTACGGCGA 1200
CATGGACAAG CTGCTGTGTC CGGACCAGAG CGAGCAAATC TACTACACCA ACAACATCGT 1260
GTTCCCGAAC GAGTACGTGA TCACCAAGAT CGACTTCACC AAGAAGATGA AGACCCTGCG 1320
CTACGAGGTG ACCGCCAACT TCTACGACAG CAGCACCGGC GAGATCGACC TGAACAAGAA 1380
GAAGGTGGAG AGCAGCGAGG CCGAGTACCG CACCCTGAGC GCGAACGACG ACGGCGTCTA 1440
CATGCCACTG GGCGTGATCA GCGAGACCTT CCTGACCCCG ATCAACGGCT TTGGCCTGCA 1500
GGCCGACGAG AACAGCCGCC TGATCACCCT GACCTGTAAG AGCTACCTGC GCGAGCTGCT 1560
GCTAGCCACC GACCTGAGCA ACAAGGAGAC CAAGCTGATC GTGCCACCGA GCGGCTTCAT 1620
CAGCAACATC GTGGAGAACG GCAGCATCGA GGAGGACAAC CTGGAGCCGT GGAAGGCCAA 1680
CAACAAGAAC GCCTACGTGG ACCACACCGG CGGCGTGAAC GGCACCAAGG CCCTGTACGT 1740
GCACAAGGAC GGCGGCATCA GCCAGTTCAT CGGCGACAAG CTGAAGCCGA AGACCGAGTA 1800
CGTGATCCAG TACACCGTGA AGGGCAAGCC ATCGATTCAC CTGAAGGACG AGAACACCGG 1860
CTACATCCAC TACGAGGACA CCAACAACAA CCTGGAGGAC TACCAGACCA TCAACAAGCG 1920
CTTCACCACC GGCACCGACC TGAAGGGCGT GTACCTGATC CTGAAGAGCC AGAACGGCGA 1980
CGAGGCCTGG GGCGACAACT TCATCATCCT GGAGATCAGC CCGAGCGAGA AGCTGCTGAG 2040
CCCGGAGCTG ATCAACACCA ACAACTGGAC CAGCACCGGC AGCACCAACA TCAGCGGCAA 2100
CACCCTGACC CTGTACCAGG GCGGCCGCGG CATCCTGAAG CAGAACCTGC AGCTGGACAG 2160
CTTCAGCACC TACCGCGTGT ACTTCAGCGT GAGCGGCGAC GCCAACGTGC GCATCCGCAA 2220
CAGCCGCGAG GTGCTGTTCG AGAAGAGGTA CATGAGCGGC GCCAAGGACG TGAGCGAGAT 2280
GTTCACCACC AAGTTCGAGA AGGACAACTT CTACATCGAG CTGAGCCAGG GCAACAACCT 2340
GTACGGCGGC CCGATCGTGC ACTTCTACGA CGTGAGCATC AAGTTAACGT AGAGCTCAGA 2400
TCT 2403
(2)关于SEQ ID NO:31的信息
(i)序列特征
(A)长度:2612个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:118..2484
(D)其它信息:/备注=“来自AB424的编码VIP3A(b)的天然DNA序列。”
(xi)序列描述:SEQ ID NO:31:
ATTGAAATTG ATAAAAAGTT ATGAGTGTTT AATAATCAGT AATTACCAAT AAAGAATTAA 60
GAATACAAGT TTACAAGAAA TAAGTGTTAC AAAAAATAGC TGAAAAGGAA GATGAAC 117
ATG AAC AAG AAT AAT ACT AAA TTA AGC ACA AGA GCC TTA CCA AGT TTT 165
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe
790 795 800 805
ATT GAT TAT TTC AAT GGC ATT TAT GGA TTT GCC ACT GGT ATC AAA GAC 213
Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala Thr Gly Ile Lys Asp
810 815 820
ATT ATG AAC ATG ATT TTT AAA ACG GAT ACA GGT GGT GAT CTA ACC CTA 261
Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu
825 830 835
GAC GAA ATT TTA AAG AAT CAG CAG CTA CTA AAT GAT ATT TCT GGT AAA 309
Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn Asp Ile Ser Gly Lys
840 845 850
TTG GAT GGG GTG AAT GGA AGC TTA AAT GAT CTT ATC GCA CAG GGA AAC 357
Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu Ile Ala Gln Gly Asn
855 860 865
TTA AAT ACA GAA TTA TCT AAG GAA ATA TTA AAA ATT GCA AAT GAA CAA 405
Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys Ile Ala Asn Glu Gln
870 875 880 885
AAT CAA GTT TTA AAT GAT GTT AAT AAC AAA CTC GAT GCG ATA AAT ACG 453
Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala Ile Asn Thr
890 895 900
ATG CTT CGG GTA TAT CTA CCT AAA ATT ACC TCT ATG TTG AGT GAT GTA 501
Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser Met Leu Ser Asp Val
905 910 915
ATG AAA CAA AAT TAT GCG CTA AGT CTG CAA ATA GAA TAC TTA AGT AAA 549
Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile Glu Tyr Leu Ser Lys
920 925 930
CAA TTG CAA GAG ATT TCT GAT AAG TTG GAT ATT ATT AAT GTA AAT GTA 597
Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile Ile Asn Val Asn Val
935 940 945
CTT ATT AAC TCT ACA CTT ACT GAA ATT ACA CCT GCG TAT CAA AGG ATT 645
Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro Ala Tyr Gln Arg Ile
950 955 960 965
AAA TAT GTG AAC GAA AAA TTT GAG GAA TTA ACT TTT GCT ACA GAA ACT 693
Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr
970 975 980
AGT TCA AAA GTA AAA AAG GAT GGC TCT CCT GCA GAT ATT CGT GAT GAG 741
Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp Ile Arg Asp Glu
985 990 995
TTA ACT GAG TTA ACT GAA CTA GCG AAA AGT GTA ACA AAA AAT GAT GTG 789
Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val
1000 1005 1010
GAT GGT TTT GAA TTT TAC CTT AAT ACA TTC CAC GAT GTA ATG GTA GGA 837
Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly
1015 1020 1025
AAT AAT TTA TTC GGG CGT TCA GCT TTA AAA ACT GCA TCG GAA TTA ATT 885
Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu Ile
1030 1035 1040 1045
ACT AAA GAA AAT GTG AAA ACA AGT GGC AGT GAG GTC AAT GTT GTT TAT 933
Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr
1050 1055 1060
AAC TTC CTA ATT GTA TTA ACA GCT CTG CAA GCA AAA GCT TTT CTT ACT 981
Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala Lys Ala Phe Leu Thr
1065 1070 1075
TTA ACA CCA TGC CGA AAA TTA TTA GGC TTA GCA GAT ATT GAT TAT ACT 1029
Leu Thr Pro Cys Arg Lys Leu Leu Gly Leu Ala Asp Ile Asp Tyr Thr
1080 1085 1090
TCT ATT ATG AAT GAA CAT TTA AAT AAG GAA AAA GAG GAA TTT AGA GTA 1077
Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val
1095 1100 1105
AAC ATC CTC CCT ACA CTT TCT AAT ACT TTT TCT AAT CCT AAT TAT GCA 1125
Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala
1110 1115 1120 1125
AAA GTT AAA GGA AGT GAT GAA GAT GCA AAG ATG ATT GTG GAA GCT AAA 1173
Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met Ile Val Glu Ala Lys
1130 1135 1140
CCA GGA CAT GCA TTG ATT GGG TTT GAA ATT AGT AAT GAT TCA ATT ACA 1221
Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser Asn Asp Ser Ile Thr
1145 1150 1155
GTA TTA AAA GTA TAT GAG GCT AAG CTA AAA CAA AAT TAT CAA GTC GAT 1269
Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln Asn Tyr Gln Val Asp
1160 1165 1170
AAG GAT TCC TTA TCG GAA GTT ATT TAT GGC GAT ATG GAT AAA TTA TTG 1317
Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp Met Asp Lys Leu Leu
1175 1180 1185
TGC CCA GAT CAA TCT GGA CAA ATC TAT TAT ACA AAT AAC ATA GTA TTT 1365
Cys Pro Asp Gln Ser Gly Gln Ile Tyr Tyr Thr Asn Asn Ile Val Phe
1190 1195 1200 1205
CCA AAT GAA TAT GTA ATT ACT AAA ATT GAT TTC ACT AAA AAA ATG AAA 1413
Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe Thr Lys Lys Met Lys
1210 1215 1220
ACT TTA AGA TAT GAG GTA ACA GCG AAT TTT TAT GAT TCT TCT ACA GGA 1461
Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly
1225 1230 1235
GAA ATT GAC TTA AAT AAG AAA AAA GTA GAA TCA AGT GAA GCG GAG TAT 1509
Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr
1240 1245 1250
AGA ACG TTA AGT GCT AAT GAT GAT GGG GTG TAT ATG CCG TTA GGT GTC 1557
Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val
1255 1260 1265
ATC AGT GAA ACA TTT TTG ACT CCG ATT AAT GGG TTT GGC CTC CAA GCT 1605
Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly Phe Gly Leu Gln Ala
1270 1275 1280 1285
GAT GAA AAT TCA AGA TTA ATT ACT TTA ACA TGT AAA TCA TAT TTA AGA 1653
Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys Lys Ser Tyr Leu Arg
1290 1295 1300
GAA CTA CTG CTA GCA ACA GAC TTA AGC AAT AAA GAA ACT AAA TTG ATC 1701
Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu Ile
1305 1310 1315
GTC CCG CCA AGT GGT TTT ATT AGC AAT ATT GTA GAG AAC GGG TCC ATA 1749
Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val Glu Asn Gly Ser Ile
1320 1325 1330
GAA GAG GAC AAT TTA GAG CCG TGG AAA GCA AAT AAT AAG AAT GCG TAT 1797
Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr
1335 1340 1345
GTA GAT CAT ACA GGC GGA GTG AAT GGA ACT AAA GCT TTA TAT GTT CAT 1845
Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His
1350 1355 1360 1365
AAG GAC GGA GGA ATT TCA CAA TTT ATT GGA GAT AAG TTA AAA CCG AAA 1893
Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp Lys Leu Lys Pro Lys
1370 1375 1380
ACT GAG TAT GTA ATC CAA TAT ACT GTT AAA GGA AAA CCT TCT ATT CAT 1941
Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly Lys Pro Ser Ile His
1385 1390 1395
TTA AAA GAT GAA AAT ACT GGA TAT ATT CAT TAT GAA GAT ACA AAT AAT 1989
Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr Glu Asp Thr Asn Asn
1400 1405 1410
AAT TTA GAA GAT TAT CAA ACT ATT AAT AAA CGT TTT ACT ACA GGA ACT 2037
Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg Phe Thr Thr Gly Thr
1415 1420 1425
GAT TTA AAG GGA GTG TAT TTA ATT TTA AAA AGT CAA AAT GGA GAT GAA 2085
Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser Gln Asn Gly Asp Glu
1430 1435 1440 1445
GCT TGG GGA GAT AAC TTT ATT ATT TTG GAA ATT AGT CCT TCT GAA AAG 2133
Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile Ser Pro Ser Glu Lys
1450 1455 1460
TTA TTA AGT CCA GAA TTA ATT AAT ACA AAT AAT TGG ACG AGT ACG GGA 2181
Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn Trp Thr Ser Tnr Gly
1465 1470 1475
TCA ACT AAT ATT AGC GGT AAT ACA CTC ACT CTT TAT CAG GGA GGA CGA 2229
Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Tyr Gln Gly Gly Arg
1480 1485 1490
GGG ATT CTA AAA CAA AAC CTT CAA TTA GAT AGT TTT TCA ACT TAT AGA 2277
Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser Phe Ser Thr Tyr Arg
1495 1500 1505
GTG TAT TTC TCT GTG TCC GGA GAT GCT AAT GTA AGG ATT AGA AAT TCT 2325
Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg Ile Arg Asn Ser
1510 1515 1520 1525
AGG GAA GTG TTA TTT GAA AAA AGA TAT ATG AGC GGT GCT AAA GAT GTT 2373
Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val
1530 1535 1540
TCT GAA ATG TTC ACT ACA AAA TTT GAG AAA GAT AAC TTC TAT ATA GAG 2421
Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr Ile Glu
1545 1550 1555
CTT TCT CAA GGG AAT AAT TTA TAT GGT GGT CCT ATT GTA CAT TTT TAC 2469
Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro Ile Val His Phe Tyr
1560 1565 1570
GAT GTC TCT ATT AAG TAAGATCGGG ATCTAATATT AACAGTTTTT AGAAGCTAAT 2524
Asp Val Ser Ile Lys
1575
TCTTGTATAA TGTCCTTGAT TATGGAAAAA CACAATTTTG TTTGCTAAGA TGTATATATA 2584
GCTCACTCAT TAAAAGGCAA TCAAGCTT 2612
(2)关于SEQ ID NO:32的信息
(i)序列特征
(A)长度:789个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:32:
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe
1 5 10 15
Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala Thr Gly Ile Lys Asp
20 25 30
Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu
35 40 45
Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn Asp Ile Ser Gly Lys
50 55 60
Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu Ile Ala Gln Gly Asn
65 70 75 80
Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys Ile Ala Asn Glu Gln
85 90 95
Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala Ile Asn Thr
100 105 110
Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser Met Leu Ser Asp Val
115 120 125
Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile Glu Tyr Leu Ser Lys
130 135 140
Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile Ile Asn Val Asn Val
145 150 155 160
Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro Ala Tyr Gln Arg Ile
165 170 175
Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr
180 185 190
Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp Ile Arg Asp Glu
195 200 205
Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val
210 215 220
Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly
225 230 235 240
Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu Ile
245 250 255
Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr
260 265 270
Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala Lys Ala Phe Leu Thr
275 280 285
Leu Thr Pro Cys Arg Lys Leu Leu Gly Leu Ala Asp Ile Asp Tyr Thr
290 295 300
Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val
305 310 315 320
Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala
325 330 335
Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met Ile Val Glu Ala Lys
340 345 350
Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser Asn Asp Ser Ile Thr
355 360 365
Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln Asn Tyr Gln Val Asp
370 375 380
Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp Met Asp Lys Leu Leu
385 390 395 400
Cys Pro Asp Gln Ser Gly Gln Ile Tyr Tyr Thr Asn Asn Ile Val Phe
405 410 415
Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe Thr Lys Lys Met Lys
420 425 430
Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly
435 440 445
Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr
450 455 460
Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val
465 470 475 480
Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly Phe Gly Leu Gln Ala
485 490 495
Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys Lys Ser Tyr Leu Arg
500 505 510
Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu Ile
515 520 525
Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val Glu Asn Gly Ser Ile
530 535 540
Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr
545 550 555 560
Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His
565 570 575
Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp Lys Leu Lys Pro Lys
580 585 590
Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly Lys Pro Ser Ile His
595 600 605
Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr Glu Asp Thr Asn Asn
610 615 620
Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg Phe Thr Thr Gly Thr
625 630 635 640
Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser Gln Asn Gly Asp Glu
645 650 655
Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile Ser Pro Ser Glu Lys
660 665 670
Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn Trp Thr Ser Thr Gly
675 680 685
Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Tyr Gln Gly Gly Arg
690 695 700
Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser Phe Ser Thr Tyr Arg
705 710 715 720
Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg Ile Arg Asn Ser
725 730 735
Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val
740 745 750
Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr Ile Glu
755 760 765
Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro Ile Val His Phe Tyr
770 775 780
Asp Val Ser Ile Lys
785
(2)关于SEQ ID NO:33的信息
(i)序列特征
(A)长度:30个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于制备pCIB5526的正向引物”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:33:
GGATCCACCA TGAAGACCAA CCAGATCAGC 30
(2)关于SEQ ID NO:34的信息
(i)序列特征
(A)长度:15个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于制备pCIB5526的反向引物”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:34:
AAGCTTCAGC TCCTT 15
(2)关于SEQ ID NO:35的信息
(i)序列特征
(A)长度:2576个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:9..2564
(D)其它信息:/备注=“编码VIP1A(a)和切出的芽孢杆菌属分泌信号的玉米最佳化DNA序列,包含在pCIB5526中。”
(xi)序列描述:SEQ ID NO:35:
GATCCACC ATG AAG ACC AAC CAG ATC AGC ACC ACC CAG AAG AAC CAG CAG 50
Met Lys Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln
825 830 835
AAG GAG ATG GAC CGC AAG GGC CTG CTG GGC TAC TAC TTC AAG GGC AAG 98
Lys Glu Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys
840 845 850
GAC TTC AGC AAC CTG ACC ATG TTC GCC CCC ACG CGT GAC AGC ACC CTG 146
Asp Phe Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu
855 860 865
ATC TAC GAC CAG CAG ACC GCC AAC AAG CTG CTG GAC AAG AAG CAG CAG 194
Ile Tyr Asp Gln Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln
870 875 880
GAG TAC CAG AGC ATC CGC TGG ATC GGC CTG ATC CAG AGC AAG GAG ACC 242
Glu Tyr Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr
885 890 895
GGC GAC TTC ACC TTC AAC CTG AGC GAG GAC GAG CAG GCC ATC ATC GAG 290
Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu
900 905 910 915
ATC AAC GGC AAG ATC ATC AGC AAC AAG ~C AAG GAG AAG CAG GTG GTG 338
Ile Asn Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val
920 925 930
CAC CTG GAG AAG GGC AAG CTG GTG CCC ATC AAG ATC GAG TAC CAG AGC 386
His Leu Glu Lys Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser
935 940 945
GAC ACC AAG TTC AAC ATC GAC AGC AAG ACC TTC AAG GAG CTG AAG CTT 434
Asp Thr Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu
950 955 960
TTC AAG ATC GAC AGC CAG AAC CAG CCC CAG CAG GTG CAG CAG GAC GAG 482
Phe Lys Ile Asp Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu
965 970 975
CTG CGC AAC CCC GAG TTC AAC AAG AAG GAG AGC CAG GAG TTC CTG GCC 530
Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala
980 985 990 995
AAG CCC AGC AAG ATC AAC CTG TTC ACC CAG CAG ATG A AG CGC GAG ATC 578
Lys Pro Ser Lys Ile Asn Leu Phe Thr Gln Gln Met Lys Arg Glu Ile
1000 1005 1010
GAC GAG GAC ACC GAC ACC GAC GGC GAC AGC ATC CCC GAC CTG TGG GAG 626
Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu
1015 1020 1025
GAG AAC GGC TAC ACC ATC CAG AAC CGC ATC GCC GTG AAG TGG GAC GAC 674
Glu Asn Gly Tyr Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp
1030 1035 1040
AGC CTG GCT AGC AAG GGC TAC ACC AAG TTC GTG AGC AAC CCC CTG GAG 722
Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu
1045 1050 1055
AGC CAC ACC GTG GGC GAC CCC TAC ACC GAC TAC GAG AAG GCC GCC CGC 770
Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg
1060 1065 1070 1075
GAC CTG GAC CTG AGC AAC GCC AAG GAG ACC TTC AAC CCC CTG GTG GCC 818
Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala
1080 1085 1090
GCC TTC CCC AGC GTG AAC GTG AGC ATG GAG AAG GTG ATC CTG AGC CCC 866
Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro
1095 1100 1105
AAC GAG AAC CTG AGC AAC AGC GTG GAG AGC CAC TCG AGC ACC AAC TGG 914
Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp
1110 1115 1120
AGC TAC ACC AAC ACC GAG GGC GCC AGC GTG GAG GCC GGC ATC GGT CCC 962
Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro
1125 1130 1135
AAG GGC ATC AGC TTC GGC GTG AGC GTG AAC TAC CAG CAC AGC GAG ACC 1010
Lys Gly Ile Ser Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr
1140 1145 1150 1155
GTG GCC CAG GAG TGG GGC ACC AGC ACC GGC AAC ACC AGC CAG TTC AAC 1058
Val Ala Gln Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn
1160 1165 1170
ACC GCC AGC GCC GGC TAC CTG AAC GCC AAC GTG CGC TAC AAC AAC GTG 1106
Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val
1175 1180 1185
GGC ACC GGC GCC ATC TAC GAC GTG AAG CCC ACC ACC AGC TTC GTG CTG 1154
Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu
1190 1195 1200
AAC AAC GAC ACC ATC GCC ACC ATC ACC GCC AAG TCG AAT TCC ACC GCC 1202
Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala
1205 1210 1215
CTG AAC ATC AGC CCC GGC GAG AGC TAC CCC AAG AAG GGC CAG AAC GGC 1250
Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly
1220 1225 1230 1235
ATC GCC ATC ACC AGC ATG GAC GAC TTC AAC AGC CAC CCC ATC ACC CTG 1298
Ile Ala Ile Thr Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu
1240 1245 1250
AAC AAG AAG CAG GTG GAC AAC CTG CTG AAC AAC AAG CCC ATG ATG CTG 1346
Asn Lys Lys Gln Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu
1255 1260 1265
GAG ACC AAC CAG ACC GAC GGC GTC TAC AAG ATC AAG GAC ACC CAC GGC 1394
Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly
1270 1275 1280
AAC ATC GTG ACG GGC GGC GAG TGG AAC GGC GTG ATC CAG CAG ATC AAG 1442
Asn Ile Val Thr Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys
1285 1290 1295
GCC AAG ACC GCC AGC ATC ATC GTC GAC GAC GGC GAG CGC GTG GCC GAG 1490
Ala Lys Thr Ala Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu
1300 1305 1310 1315
AAG CGC GTG GCC GCC AAG GAC TAC GAG AAC CCC GAG GAC AAG ACC CCC 1538
Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro
1320 1325 1330
AGC CTG ACC CTG AAG GAC GCC CTG AAG CTG AGC TAC CCC GAC GAG ATC 1586
Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile
1335 1340 1345
AAG GAG ATC GAG GGC TTG CTG TAC TAC AAG AAC AAG CCC ATC TAC GAG 1634
Lys Glu Ile Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu
1350 1355 1360
AGC AGC GTG ATG ACC TAT CTA GAC GAG AAC ACC GCC AAG GAG GTG ACC 1682
Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr
1365 1370 1375
AAG CAG CTG AAC GAC ACC ACC GGC AAG TTC AAG GAC GTG AGC CAC CTG 1730
Lys Gln Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu
1380 1385 1390 1395
TAC GAC GTG AAG CTG ACC CCC AAG ATG AAC GTG ACC ATC AAG CTG AGC 1778
Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser
1400 1405 1410
ATC CTG TAC GAC AAC GCC GAG AGC AAC GAC AAC AGC ATC GGC AAG TGG 1826
Ile Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp
1415 1420 1425
ACC AAC ACC AAC ATC GTG AGC GGC GGC AAC AAC GGC AAG AAG CAG TAC 1874
Thr Asn Thr Asn Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr
1430 1435 1440
AGC AGC AAC AAC CCC GAC GCC AAC CTG ACC CTG AAC ACC GAC GCC CAG 1922
Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln
1445 1450 1455
GAG AAG CTG AAC AAG AAC CGC GAC TAC TAC ATC AGC CTG TAC ATG AAG 1970
Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys
1460 1465 1470 1475
AGC GAG AAG AAC ACC CAG TGC GAG ATC ACC ATC GAC GGC GAG ATA TAC 2018
Ser Glu Lys Asn Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr
1480 1485 1490
CCC ATC ACC ACC AAG ACC GTG AAC GTG AAC AAG GAC AAC TAC AAG CGC 2066
Pro Ile Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg
1495 1500 1505
CTG GAC ATC ATC GCC CAC AAC ATC AAG AGC AAC CCC ATC AGC AGC CTG 2114
Leu Asp Ile Ile Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu
1510 1515 1520
CAC ATC AAG ACC AAC GAC GAG ATC ACC CTG TTC TGG GAC GAC ATA TCG 2162
His Ile Lys Thr Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser
1525 1530 1535
ATT ACC GAC GTC GCC AGC ATC AAG CCC GAG AAC CTG ACC GAC AGC GAG 2210
Ile Thr Asp Val Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu
1540 1545 1550 1555
ATC AAG CAG ATA TAC AGT CGC TAC GGC ATC AAG CTG GAG GAC GGC ATC 2258
Ile Lys Gln Ile Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile
1560 1565 1570
CTG ATC GAC AAG AAA GGC GGC ATC CAC TAC GGC GAG TTC ATC AAC GAG 2306
Leu Ile Asp Lys Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu
1575 1580 1585
GCC AGC TTC AAC ATC GAG CCC CTG CAG AAC TAC GTG ACC AAG TAC GAG 2354
Ala Ser Phe Asn Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu
1590 1595 1600
GTG ACC TAC AGC AGC GAG CTG GGC CCC AAC GTG AGC GAC ACC CTG GAG 2402
Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu
1605 1610 1615
AGC GAC AAG ATT TAC AAG GAC GGC ACC ATC AAG TTC GAC TTC ACC AAG 2450
Ser Asp Lys Ile Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys
1620 1625 1630 1635
TAC AGC AAG AAC GAG CAG GGC CTG TTC TAC GAC AGC GGC CIG AAC TGG 2498
Tyr Ser Lys Asn Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp
1640 1645 1650
GAC TTC AAG ATC AAC GCC ATC ACC TAC GAC GGC AAG GAG ATG AAC GTG 2546
Asp Phe Lys Ile Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val
1655 1660 1665
TTC CAC CGC TAC AAC AAG TAGATCTGAG CT 2576
Phe His Arg Tyr Asn Lys
1670
(2)关于SEQ ID NO:36的信息
(i)序列特征
(A)长度:852个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:36:
Met Lys Thr Asn Gln Ile Ser Thr Thr Gln Lys Asn Gln Gln Lys Glu
1 5 10 15
Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr Phe Lys Gly Lys Asp Phe
20 25 30
Ser Asn Leu Thr Met Phe Ala Pro Thr Arg Asp Ser Thr Leu Ile Tyr
35 40 45
Asp Gln Gln Thr Ala Asn Lys Leu Leu Asp Lys Lys Gln Gln Glu Tyr
50 55 60
Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln Ser Lys Glu Thr Gly Asp
65 70 75 80
Phe Thr Phe Asn Leu Ser Glu Asp Glu Gln Ala Ile Ile Glu Ile Asn
85 90 95
Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu Lys Gln Val Val His Leu
100 105 110
Glu Lys Gly Lys Leu Val Pro Ile Lys Ile Glu Tyr Gln Ser Asp Thr
115 120 125
Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys Glu Leu Lys Leu Phe Lys
130 135 140
Ile Asp Ser Gln Asn Gln Pro Gln Gln Val Gln Gln Asp Glu Leu Arg
145 150 155 160
Asn Pro Glu Phe Asn Lys Lys Glu Ser Gln Glu Phe Leu Ala Lys Pro
165 170 175
Ser Lys Ile Asn Leu Phe Thr Gln Gln Met Lys Arg Glu Ile Asp Glu
180 185 190
Asp Thr Asp Thr Asp Gly Asp Ser Ile Pro Asp Leu Trp Glu Glu Asn
195 200 205
Gly Tyr Thr Ile Gln Asn Arg Ile Ala Val Lys Trp Asp Asp Ser Leu
210 215 220
Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser Asn Pro Leu Glu Ser His
225 230 235 240
Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu Lys Ala Ala Arg Asp Leu
245 250 255
Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn Pro Leu Val Ala Ala Phe
260 265 270
Pro Ser Val Asn Val Ser Met Glu Lys Val Ile Leu Ser Pro Asn Glu
275 280 285
Asn Leu Ser Asn Ser Val Glu Ser His Ser Ser Thr Asn Trp Ser Tyr
290 295 300
Thr Asn Thr Glu Gly Ala Ser Val Glu Ala Gly Ile Gly Pro Lys Gly
305 310 315 320
Ile Ser Phe Gly Val Ser Val Asn Tyr Gln His Ser Glu Thr Val Ala
325 330 335
Gln Glu Trp Gly Thr Ser Thr Gly Asn Thr Ser Gln Phe Asn Thr Ala
340 345 350
Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg Tyr Asn Asn Val Gly Thr
355 360 365
Gly Ala Ile Tyr Asp Val Lys Pro Thr Thr Ser Phe Val Leu Asn Asn
370 375 380
Asp Thr Ile Ala Thr Ile Thr Ala Lys Ser Asn Ser Thr Ala Leu Asn
385 390 395 400
Ile Ser Pro Gly Glu Ser Tyr Pro Lys Lys Gly Gln Asn Gly Ile Ala
405 410 415
Ile Thr Ser Met Asp Asp Phe Asn Ser His Pro Ile Thr Leu Asn Lys
420 425 430
Lys Gln Val Asp Asn Leu Leu Asn Asn Lys Pro Met Met Leu Glu Thr
435 440 445
Asn Gln Thr Asp Gly Val Tyr Lys Ile Lys Asp Thr His Gly Asn Ile
450 455 460
Val Thr Gly Gly Glu Trp Asn Gly Val Ile Gln Gln Ile Lys Ala Lys
465 470 475 480
Thr Ala Ser Ile Ile Val Asp Asp Gly Glu Arg Val Ala Glu Lys Arg
485 490 495
Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu Asp Lys Thr Pro Ser Leu
500 505 510
Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr Pro Asp Glu Ile Lys Glu
515 520 525
Ile Glu Gly Leu Leu Tyr Tyr Lys Asn Lys Pro Ile Tyr Glu Ser Ser
530 535 540
Val Met Thr Tyr Leu Asp Glu Asn Thr Ala Lys Glu Val Thr Lys Gln
545 550 555 560
Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp Val Ser His Leu Tyr Asp
565 570 575
Val Lys Leu Thr Pro Lys Met Asn Val Thr Ile Lys Leu Ser Ile Leu
580 585 590
Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser Ile Gly Lys Trp Thr Asn
595 600 605
Thr Asn Ile Val Ser Gly Gly Asn Asn Gly Lys Lys Gln Tyr Ser Ser
610 615 620
Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn Thr Asp Ala Gln Glu Lys
625 630 635 640
Leu Asn Lys Asn Arg Asp Tyr Tyr Ile Ser Leu Tyr Met Lys Ser Glu
645 650 655
Lys Asn Thr Gln Cys Glu Ile Thr Ile Asp Gly Glu Ile Tyr Pro Ile
660 665 670
Thr Thr Lys Thr Val Asn Val Asn Lys Asp Asn Tyr Lys Arg Leu Asp
675 680 685
Ile Ile Ala His Asn Ile Lys Ser Asn Pro Ile Ser Ser Leu His Ile
690 695 700
Lys Thr Asn Asp Glu Ile Thr Leu Phe Trp Asp Asp Ile Ser Ile Thr
705 710 715 720
Asp Val Ala Ser Ile Lys Pro Glu Asn Leu Thr Asp Ser Glu Ile Lys
725 730 735
Gln Ile Tyr Ser Arg Tyr Gly Ile Lys Leu Glu Asp Gly Ile Leu Ile
740 745 750
Asp Lys Lys Gly Gly Ile His Tyr Gly Glu Phe Ile Asn Glu Ala Ser
755 760 765
Phe Asn Ile Glu Pro Leu Gln Asn Tyr Val Thr Lys Tyr Glu Val Thr
770 775 780
Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser Asp Thr Leu Glu Ser Asp
785 790 795 800
Lys Ile Tyr Lys Asp Gly Thr Ile Lys Phe Asp Phe Thr Lys Tyr Ser
805 810 815
Lys Asn Glu Gln Gly Leu Phe Tyr Asp Ser Gly Leu Asn Trp Asp Phe
820 825 830
Lys Ile Asn Ala Ile Thr Tyr Asp Gly Lys Glu Met Asn Val Phe His
835 840 845
Arg Tyr Asn Lys
850
(2)关于SEQ ID NO:37的信息
(i)序列特征
(A)长度:32个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于制备pCIB5527的正向引物”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:37:
GGATCCACCA TGCTGCAGAA CCTGAAGATC AC 32
(2)关于SEQ ID NO:38的信息
(i)序列特征
(A)长度:18个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于制备pCIB5527的反向引物”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:38:
AAGCTTCCAC TCCTTCTC 18
(2)关于SEQ ID NO:39的信息
(i)序列特征
(A)长度:1241个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:9..1238
(D)其它信息:/备注=“编码VIP2A(a)和切出的芽孢杆菌属分泌信号的玉米最佳化DNA序列,包含在pCIB5527中。”
(xi)序列描述:SEQ ID NO:39:
GATCCACC ATG CTG CAG AAC CTG AAG ATC ACC GAC AAG GTG GAG GAC TTC 50
Met Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu Asp Phe
855 860 865
AAG GAG GAC AAG GAG AAG GCC AAG GAG TGG GGC AAG GAG AAG GAG AAG 98
Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys
870 875 880
GAG TGG AAG CTT ACC GCC ACC GAG AAG GGC AAG ATG AAC AAC TTC CTG 146
Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu
885 890 895
GAC AAC AAG AAC GAC ATC AAG ACC AAC TAC AAG GAG ATC ACC TTC AGC 194
Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr Phe Ser
900 905 910
ATA GCC GGC AGC TTC GAG GAC GAG ATC AAG GAC CTG AAG GAG ATC GAC 242
Ile Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu Ile Asp
915 920 925 930
AAG ATG TTC GAC AAG ACC AAC CTG AGC AAC AGC ATC ATC ACC TAC AAG 290
Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr Tyr Lys
935 940 945
AAC GTG GAG CCC ACC ACC ATC GGC TTC AAC AAG AGC CTG ACC GAG GGC 338
Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr Glu Gly
950 955 960
AAC ACC ATC AAC AGC GAC GCC ATG GCC CAG TTC AAG GAG CAG TTC CTG 386
Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln Phe Leu
965 970 975
GAC CGC GAC ATC AAG TTC GAC AGC TAC CTG GAC ACC CAC CTG ACC GCC 434
Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu Thr Ala
980 985 990
CAG CAG GTG AGC AGC AAG GAG CGC GTG ATC CTG AAG GTG ACC GTC CCC 482
Gln Gln Val Ser Ser Lys Glu ArG Val Ile Leu Lys Val Thr Val Pro
995 1000 1005 1010
AGC GGC AAG GGC AGC ACC ACC CCC ACC AAG GCC GGC GTG ATC CTG AAC 530
Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile Leu Asn
1015 1020 1025
AAC AGC GAG TAC AAG ATG CTG ATC GAC AAC GGC TAC ATG GTG CAC GTG 578
Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val His Val
1030 1035 1040
GAC AAG GTG AGC AAG GTG GTG AAG AAG GGC GTG GAG TGC CTC CAG ATC 626
Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gln Ile
1045 1050 1055
GAG GGC ACC CTG AAG AAG AGT CTA GAC TTC AAG AAC GAC ATC AAC GCC 674
Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile Asn Ala
1060 1065 1070
GAG GCC CAC AGC TGG GGC ATG AAG AAC TAC GAG GAG TGG GCC AAG GAC 722
Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp
1075 1080 1085 1090
CTG ACC GAC AGC CAG CGC GAG GCC CTG GAC GGC TAC GCC CGC CAG GAC 770
Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gln Asp
1095 1100 1105
TAC AAG GAG ATC AAC AAC TAC CTG CGC AAC CAG GGC GGC AGC GGC AAC 818
Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser Gly Asn
1110 1115 1120
GAG AAG CTG GAC GCC CAG ATC AAG AAC ATC AGC GAC GCC CTG GGC AAG 866
Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu Gly Lys
1125 1130 1135
AAG CCC ATC CCC GAG AAC ATC ACC GTG TAC CGC TGG TGC GGC ATG CCC 914
Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly Met Pro
1140 1145 1150
GAG TTC GGC TAC CAG ATC AGC GAC CCC CTG CCC AGC CTG AAG GAC TTC 962
Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe
1155 1160 1165 1170
GAG GAG CAG TTC CTG AAC ACC ATC AAG GAG GAC AAG GGC TAC ATG AGC 1010
Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr Met Ser
1175 1180 1185
ACC AGC CTG AGC AGC GAG CGC CTG GCC GCC TTC GGC AGC CGC AAG ATC 1058
Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys Ile
1190 1195 1200
ATC CTG CGC CTG CAG GTG CCC AAG GGC AGC ACT GGT GCC TAC CTG AGC 1106
Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu Ser
1205 1210 1215
GCC ATC GGC GGC TTC GCC AGC GAG AAG GAG ATC CTG CTG GAT AAG GAC 1154
Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp Lys Asp
1220 1225 1230
AGC AAG TAC CAC ATC GAC AAG GTG ACC GAG GTG ATC ATC AAG GGC GTG 1202
Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys Gly Val
1235 1240 1245 1250
AAG CGC TAC GTG GTG GAC GCC ACC CTG CTG ACC AAC TAG 1241
Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
1255 1260
(2)关于SEQ ID NO:40的信息
(i)序列特征
(A)长度:410个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:40:
Met Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu Asp Phe Lys Glu
1 5 10 15
Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys Glu Trp
20 25 30
Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu Asp Asn
35 40 45
Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr Phe Ser Ile Ala
50 55 60
Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu Ile Asp Lys Met
65 70 75 80
Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr Tyr Lys Asn Val
85 90 95
Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr Glu Gly Asn Thr
100 105 110
Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln Phe Leu Asp Arg
115 120 125
Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu Thr Ala Gln Gln
130 135 140
Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr Val Pro Ser Gly
145 150 155 160
Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile Leu Asn Asn Ser
165 170 175
Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val His Val Asp Lys
180 185 190
Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gln Ile Glu Gly
195 200 205
Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile Asn Ala Glu Ala
210 215 220
His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp Leu Thr
225 230 235 240
Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gln Asp Tyr Lys
245 250 255
Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser Gly Asn Glu Lys
260 265 270
Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu Gly Lys Lys Pro
275 280 285
Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly Met Pro Glu Phe
290 295 300
Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe Glu Glu
305 310 315 320
Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr Met Ser Thr Ser
325 330 335
Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys Ile Ile Leu
340 345 350
Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu Ser Ala Ile
355 360 365
Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp Lys Asp Ser Lys
370 375 380
Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys Gly Val Lys Arg
385 390 395 400
Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
405 410
(2)关于SEQ ID NO:41的信息
(i)序列特征
(A)长度:72个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于构建pCIB5527的编码真核生物分泌信号的寡核苷酸”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:41:
GGATCCACCA TGGGCTGGAG CTGGATCTTC CTGTTCCTGC TGAGCGGCGC CGCGGGCGTG 60
CACTGCCTGC AG 72
(2)关于SEQ ID NO:42的信息
(i)序列特征
(A)长度:1241个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:9..1238
(D)其它信息:/备注=“编码VIP2A(a)和切出的芽孢杆菌属分泌信号及插入的真核生物分泌信号的玉米最佳化DNA序列,包含于pCIB5528中。”
(xi)序列描述:SEQ ID NO 42:
GATCCACC ATG CTG CAG AAC CTG AAG ATC ACC GAC AAG GTG GAG GAC TTC 50
Met Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu Asp Phe
415 420
AAG GAG GAC AAG GAG AAG GCC AAG GAG TGG GGC AAG GAG AAG GAG AAG 98
Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys
425 430 435 440
GAG TGG AAG CTT ACC GCC ACC GAG AAG GGC AAG ATG AAC AAC TTC CTG 146
Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu
445 450 455
GAC AAC AAG AAC GAC ATC AAG ACC AAC TAC AAG GAG ATC ACC TTC AGC 194
Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr Phe Ser
460 465 470
ATA GCC GGC AGC TTC GAG GAC GAG ATC AAG GAC CTG AAG GAG ATC GAC 242
Ile Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu Ile Asp
475 480 485
AAG ATG TTC GAC AAG ACC AAC CTG AGC AAC AGC ATC ATC ACC TAC AAG 290
Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr Tyr Lys
490 495 500
AAC GTG GAG CCC ACC ACC ATC GGC TTC AAC AAG AGC CTG ACC GAG GGC 338
Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr Glu Gly
505 510 515 520
AAC ACC ATC AAC AGC GAC GCC ATG GCC CAG TTC AAG GAG CAG TTC CTG 386
Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln Phe Leu
525 530 535
GAC CGC GAC ATC AAG TTC GAC AGC TAC CTG GAC ACC CAC CTG ACC GCC 434
Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu Thr Ala
540 545 550
CAG CAG GTG AGC AGC AAG GAG CGC GTG ATC CTG AAG GTG ACC GTC CCC 482
Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr Val Pro
555 560 565
AGC GGC AAG GGC AGC ACC ACC CCC ACC AAG GCC GGC GTG ATC CTG AAC 530
Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile Leu Asn
570 575 580
AAC AGC GAG TAC AAG ATG CTG ATC GAC AAC GGC TAC ATG GTG CAC GTG 578
Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val His Val
585 590 595 600
GAC AAG GTG AGC AAG GTG GTG AAG AAG GGC GTG GAG TGC CTC CAG ATC 626
Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gln Ile
605 610 615
GAG GGC ACC CTG AAG AAG AGT CTA GAC TTC AAG AAC GAC ATC AAC GCC 674
Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile Asn Ala
620 625 630
GAG GCC CAC AGC TGG GGC ATG AAG AAC TAC GAG GAG TGG GCC AAG GAC 722
Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp
635 640 645
CTG ACC GAC AGC CAG CGC GAG GCC CTG GAC GGC TAC GCC CGC CAG GAC 770
Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gln Asp
650 655 660
TAC AAG GAG ATC AAC AAC TAC CTG CGC AAC CAG GGC GGC AGC GGC AAC 818
Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser Gly Asn
665 670 675 680
GAG AAG CTG GAC GCC CAG ATC AAG AAC ATC AGC GAC GCC CTG GGC AAG 866
Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu Gly Lys
685 690 695
AAG CCC ATC CCC GAG AAC ATC ACC GTG TAC CGC TGG TGC GGC ATG CCC 914
Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly Met Pro
700 705 710
GAG TTC GGC TAC CAG ATC AGC GAC CCC CTG CCC AGC CTG AAG GAC TTC 962
Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe
715 720 725
GAG GAG CAG TTC CTG AAC ACC ATC AAG GAG GAC AAG GGC TAC ATG AGC 1010
Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr Met Ser
730 735 740
ACC AGC CTG AGC AGC GAG CGC CTG GCC GCC TTC GGC AGC CGC AAG ATC 1058
Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys Ile
745 750 755 760
ATC CTG CGC CTG CAG GTG CCC AAG GGC AGC ACT GGT GCC TAC CTG AGC 1106
Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu eer
765 770 775
GCC ATC GGC GGC TTC GCC AGC GAG AAG GAG ATC CTG CTG GAT AAG GAC 1154
Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp Lys Asp
780 785 790
AGC AAG TAC CAC ATC GAC AAG GTG ACC GAG GTG ATC ATC AAG GGC GTG 1202
Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys Gly Val
795 800 805
AAG CGC TAC GTG GTG GAC GCC ACC CTG CTG ACC AAC TAG 1241
Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
810 815 820
(2)关于SEQ ID NO:43的信息
(i)序列特征
(A)长度:410个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:43:
Met Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu Asp Phe Lys Glu
1 5 10 15
Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys Glu Lys Glu Trp
20 25 30
Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn Phe Leu Asp Asn
35 40 45
Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr Phe Ser Ile Ala
50 55 60
Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu Ile Asp Lys Met
65 70 75 80
Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr Tyr Lys Asn Val
85 90 95
Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr Glu Gly Asn Thr
100 105 110
Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln Phe Leu Asp Arg
115 120 125
Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu Thr Ala Gln Gln
130 135 140
Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr Val Pro Ser Gly
145 150 155 160
Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile Leu Asn Asn Ser
165 170 175
Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val His Val Asp Lys
180 185 190
Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu Gln Ile Glu Gly
195 200 205
Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile Asn Ala Glu Ala
210 215 220
His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala Lys Asp Leu Thr
225 230 235 240
Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg Gln Asp Tyr Lys
245 250 255
Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser Gly Asn Glu Lys
260 265 270
Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu Gly Lys Lys Pro
275 280 285
Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly Met Pro Glu Phe
290 295 300
Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys Asp Phe Glu Glu
305 310 315 320
Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr Met Ser Thr Ser
325 330 335
Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg Lys Ile Ile Leu
340 345 350
Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr Leu Ser Ala Ile
355 360 365
Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp Lys Asp Ser Lys
370 375 380
Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys Gly Val Lys Arg
385 390 395 400
Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
405 410
(2)关于SEQ ID NO:44的信息
(i)序列特征
(A)长度:86个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于构建pCIB5533的编码液泡导向肽的寡核苷酸”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:44:
CCGCGGGCGT GCACTGCCTC AGCAGCAGCA GCTTTCGCCGA CAGCACCCC ATCCGCGTGA 60
CCGACCGCGC CGCCAGCACC CTGCAG 86
(2)关于SEQ ID NO:45的信息
(i)序列特征
(A)长度:1358个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:9..1335
(D)其它信息:/备注=“编码VIP2A(a)和切出的芽孢杆菌分泌信号及插入的液泡导向肽的玉米最佳化DNA序列,包含于pCIB5533中。”
(xi)序列描述:SEQ ID NO:45:
GATCCACC ATG GGC TGG AGC TGG ATC TTC CTG TTC CTG CTG AGC GGC GCC 50
Met Gly Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Ala
415 420
GCG GGC GTG CAC TGC CTC AGC AGC AGC AGC TTC GCC GAC AGC AAC CCC 98
Ala Gly Val His Cys Leu Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro
425 430 435 440
ATC CGC GTG ACC GAC CGC GCC GCC AGC ACC CTG CAG AAC CTG AAG ATC 146
Ile Arg Val Thr Asp Arg Ala Ala Ser Thr Leu Gln Asn Leu Lys Ile
445 450 455
ACC GAC AAG GTG GAG GAC TTC AAG GAG GAC AAG GAG AAG GCC AAG GAG 194
Thr Asp Lys Val Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu
460 465 470
TGG GGC AAG GAG AAG GAG AAG GAG TGG AAG CTT ACC GCC ACC GAG AAG 242
Trp Gly Lys Glu Lys Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys
475 480 485
GGC AAG ATG AAC AAC TTC CTG GAC AAC AAG AAC GAC ATC AAG ACC AAC 290
Gly Lys Met Asn Asn Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn
490 495 500
TAC AAG GAG ATC ACC TTC AGC ATA GCC GGC AGC TTC GAG GAC GAG ATC 338
Tyr Lys Glu Ile Thr Phe Ser Ile Ala Gly Ser Phe Glu Asp Glu Ile
505 510 515 520
AAG GAC CTG AAG GAG ATC GAC AAG ATG TTC GAC AAG ACC AAC CTG AGC 386
Lys Asp Leu Lys Glu Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser
525 530 535
AAC AGC ATC ATC ACC TAC AAG AAC GTG GAG CCC ACC ACC ATC GGC TTC 434
Asn Ser Ile Ile Thr Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe
540 545 550
AAC AAG AGC CTG ACC GAG GGC AAC ACC ATC AAC AGC GAC GCC ATG GCC 482
Asn Lys Ser Leu Thr Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala
555 560 565
CAG TTC AAG GAG CAG TTC CTG GAC CGC GAC ATC AAG TTC GAC AGC TAC 530
Gln Phe Lys Glu Gln Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr
570 575 580
CTG GAC ACC CAC CTG ACC GCC CAG CAG GTG AGC AGC AAG GAG CGC GTG 578
Leu Asp Thr His Leu Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val
585 590 595 600
ATC CTG AAG GTG ACC GTC CCC AGC GGC AAG GGC AGC ACC ACC CCC ACC 626
Ile Leu Lys Val Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr
605 610 615
AAG GCC GGC GTG ATC CTG AAC AAC AGC GAG TAC AAG ATG CTG ATC GAC 674
Lys Ala Gly Val Ile Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp
620 625 630
AAC GGC TAC ATG GTG CAC GTG GAC AAG GTG AGC AAG GTG GTG AAG AAG 722
Asn Gly Tyr Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys
635 640 645
GGC GTG GAG TGC CTC CAG ATC GAG GGC ACC CTG AAG AAG AGT CTA CAC 770
Gly Val Glu Cys Leu Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp
650 655 660
TTC AAG AAC GAC ATC AAC GCC GAG GCC CAC AGC TGG GGC ATG AAG AAC 818
Phe Lys Asn Asp Ile Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn
665 670 675 680
TAC GAG GAG TGG GCC AAG GAC CTG ACC GAC AGC CAG CGC GAG GCC CTG 866
Tyr Glu Glu Trp Ala Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu
685 690 695
GAC GGC TAC GCC CGC CAG GAC TAC AAG GAG ATC AAC AAC TAC CTG CGC 914
Asp Gly Tyr Ala Arg Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg
700 705 710
AAC CAG GGC GGC AGC GGC AAC GAG AAG CTG GAC GCC CAG ATC AAG AAC 962
Asn Gln Gly Gly Ser Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn
715 720 725
ATC AGC GAC GCC CTG GGC AAG AAG CCC ATC CCC GAG AAC ATC ACC GTG 1010
Ile Ser Asp Ala Leu Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val
730 735 740
TAC CGC TGG TGC GGC ATG CCC GAG TTC GGC TAC CAG ATC AGC GAC CCC 1058
Tyr Arg Trp Cys Gly Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro
745 750 755 760
CTG CCC AGC CTG AAG GAC TTC GAG GAG CAG TTC CTG AAC ACC ATC AAG 1106
Leu Pro Ser Leu Lys Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys
765 770 775
GAG GAC AAG GGC TAC ATG AGC ACC AGC CTG AGC AGC GAG CGC CTG GCC 1154
Glu Asp Lys Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala
780 785 790
GCC TTC GGC AGC CGC AAG ATC ATC CTG CGC CTG CAG GTG CCC AAG GGC 1202
Ala Phe Gly Ser Arg Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly
795 800 805
AGC ACT GGT GCC TAC CTG AGC GCC ATC GGC GGC TTC GCC AGC GAG AAG 1250
Ser Thr Gly Ala Tyr Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys
810 815 820
GAG ATC CTG CTG GAT AAG GAC AGC AAG TAC CAC ATC GAC AAG GTG ACC 1298
Glu Ile Leu Leu Asp Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr
825 830 835 840
GAG GTG ATC ATC AAG GGC GTG AAG CGC TAC GTG GTG GAC GCC ACC CTG 1346
Glu Val Ile Ile Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu
845 850 855
CTG ACC AAC TAG 1358
Leu Thr Asn
(2)关于SEQ ID NO:46的信息
(i)序列特征
(A)长度:449个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:46:
Met Gly Trp Ser Trp Ile Phe Leu Phe Leu Leu Ser Gly Ala Ala Gly
1 5 10 15
Val His Cys Leu Ser Ser Ser Ser Phe Ala Asp Ser Asn Pro Ile Arg
20 25 30
Val Thr Asp Arg Ala Ala Ser Thr Leu Gln Asn Leu Lys Ile Thr Asp
35 40 45
Lys Val Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly
50 55 60
Lys Glu Lys Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys
65 70 75 80
Met Asn Asn Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys
85 90 95
Glu Ile Thr Phe Ser Ile Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp
100 105 110
Leu Lys Glu Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser
115 120 125
Ile Ile Thr Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys
130 135 140
Ser Leu Thr Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe
145 150 155 160
Lys Glu Gln Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp
165 170 175
Thr His Leu Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu
180 185 190
Lys Val Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala
195 200 205
Gly Val Ile Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly
210 215 220
Tyr Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val
225 230 235 240
Glu Cys Leu Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys
245 250 255
Asn Asp Ile Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu
260 265 270
Glu Trp Ala Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly
275 280 285
Tyr Ala Arg Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln
290 295 300
Gly Gly Ser Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser
305 310 315 320
Asp Ala Leu Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg
325 330 335
Trp Cys Gly Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro
340 345 350
Ser Leu Lys Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp
355 360 365
Lys Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe
370 375 380
Gly Ser Arg Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr
385 390 395 400
Gly Ala Tyr Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile
405 410 415
Leu Leu Asp Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val
420 425 430
Ile Ile Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr
435 440 445
Asn
(2)关于SEQ ID NO:47的信息
(i)序列特征
(A)长度:16个氨基酸
(B)类型:氨基酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:肽
(iii)假拟:否
(ix)特征:
(A)名称/键:肽
(B)位置:1..16
(D)其它信息:/备注=“用于构建pCIB5533,融合VIP1A(a)和VIP2A(a)”的连接肽
(xi)序列描述:SEQ ID NO:47:
Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro Thr Pro Ser
1 5 10 15
(2)关于SEQ ID NO:48的信息
(i)序列特征
(A)长度:66个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/desc=“用于构建pCIB553的连接肽DNA”
(iii)假拟:否
(xi)序列描述:SEQ ID NO:48:
CCCGGGCCTT CTACTCCCCC AACTCCCTCT CCTAGCACGC CTCCGACACC TAGCGATATC 60
GGATCC 66
(2)关于SEQ ID NO:49的信息
(i)序列特征
(A)长度:4031个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:其它核酸
(A)描述:/dese=“合成的DNA”
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:6..4019
(D)其它信息:/备注=“编码VIP2A(a)-VIP1A(a)融合蛋白的玉米最佳化DNA序列,包含于pCIB5531中。”
(xi)序列描述:SEQ ID NO:49:
GATCC ATG AAG CGC ATG GAG GGC AAG CTG TTC ATG GTG AGC AAG AAG 47
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys
450 455 460
CTC CAG GTG GTG ACC AAG ACC GTG CTG CTG AGC ACC GTG TTC AGC ATC 95
Leu Gln Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile
465 470 475
AGC CTG CTG AAC AAC GAG GTG ATC AAG GCC GAG CAG CTG AAC ATC AAC 143
Ser Leu Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn
480 485 490 495
AGC CAG AGC AAG TAC ACC AAC CTC CAG AAC CTG AAG ATC ACC GAC AAG 191
Ser Gln Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys
500 505 510
GTG GAG GAC TTC AAG GAG GAC AAG GAG AAG GCC AAG GAG TGG GGC AAG 239
Val Glu Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys
515 520 525
GAG AAG GAG AAG GAG TGG AAG CTT ACC GCC ACC GAG AAG GGC AAG ATG 287
Glu Lys Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met
530 535 540
AAC AAC TTC CTG GAC AAC AAG AAC GAC ATC AAG ACC AAC TAC AAG GAG 335
Asn Asn Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu
545 550 555
ATC ACC TTC AGC ATA GCC GGC AGC TTC GAG GAC GAG ATC AAG GAC CTG 383
Ile Thr Phe Ser Ile Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu
560 565 570 575
AAG GAG ATC GAC AAG ATG TTC GAC AAG ACC AAC CTG AGC AAC AGC ATC 431
Lys Glu Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile
580 585 590
ATC ACC TAC AAG AAC GTG GAG CCC ACC ACC ATC GGC TTC AAC AAG AGC 479
Ile Thr Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser
595 600 605
CTG ACC GAG GGC AAC ACC ATC AAC AGC GAC GCC ATG GCC CAG TTC AAG 527
Leu Thr Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys
610 615 620
GAG CAG TTC CTG GAC CGC GAC ATC AAG TTC GAC AGC TAC CTG GAC ACC 575
Glu Gln Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr
625 630 635
CAC CTG ACC GCC CAG CAG GTG AGC AGC AAG GAG CGC GTG ATC CTG AAG 623
His Leu Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys
640 645 650 655
GTG ACC GTC CCC AGC GGC AAG GGC AGC ACC ACC CCC ACC AAG GCC GGC 671
Val Thr Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly
660 665 670
GTG ATC CTG AAC AAC AGC GAG TAC AAG ATG CTG ATC GAC AAC GGC TAC 719
Val Ile Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr
675 680 685
ATG GTG CAC GTG GAC AAG GTG AGC AAG GTG GTG AAG AAG GGC GTG GAG 767
Met Val His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu
690 695 700
TGC CTC CAG ATC GAG GGC ACC CTG AAG AAG AGT CTA GAC TTC AAG AAC 815
Cys Leu Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn
705 710 715
GAC ATC AAC GCC GAG GCC CAC AGC TGG GGC ATG AAG AAC TAC GAG GAG 863
Asp Ile Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu
720 725 730 735
TGG GCC AAG GAC CTG ACC GAC AGC CAG CGC GAG GCC CTG GAC GGC TAC 911
Trp Ala Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr
740 745 750
GCC CGC CAG GAC TAC AAG GAG ATC AAC AAC TAC CTG CGC AAC CAG GGC 959
Ala Arg Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly
755 760 765
GGC AGC GGC AAC GAG AAG CTG GAC GCC CAG ATC AAG AAC ATC AGC GAC 1007
Gly Ser Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp
770 775 780
GCC CTG GGC AAG AAG CCC ATC CCC GAG AAC ATC ACC GTG TAC CGC TGG 1055
Ala Leu Gly Lys Lys Pro Ile Pro Glu As Ile Thr Val Tyr Arg Trp
785 790 795
TGC GGC ATG CCC GAG TTC GGC TAC CAG ATC AGC GAC CCC CTG CCC AGC 1103
Cys Gly Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser
800 805 810 815
CTG AAG GAC TTC GAG GAG CAG TTC CTG AAC ACC ATC AAG GAG GAC AAG 1151
Leu Lys Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys
820 825 830
GGC TAC ATG AGC ACC AGC CTG AGC AGC GAG CGC CTG GCC GCC TTC GGC 1199
Gly Tyr Met Ser Thr Ser Leu Ser Ser Glu Arg Leu ALa Ala Phe Gly
835 840 845
AGC CGC AAG ATC ATC CTG CGC CTG CAG GTG CCC AAG GGC AGC ACT GGT 1247
Ser Arg Lys Ile Ile Leu Arg Leu Gln Vai Pro Lys Gly Ser Thr Gly
850 855 860
GCC TAC CTG AGC GCC ATC GGC GGC TTC GCC AGC GAG AAG GAG ATC CTG 1295
Ala Tyr Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu
865 870 875
CTG GAT AAG GAC AGC AAG TAC CAC ATC GAC AAG GTG ACC GAG GTG ATC 1343
Leu Asp Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile
880 885 890 895
ATC AAG GGC GTG AAG CGC TAC GTG GTG GAC GCC ACC CTG CTG ACC AAC 1391
Ile Lys Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn
900 905 910
TCC CGG GGG CCT TCT ACT CCC CCA ACT CCC TCT CCT AGC ACG CCT CCG 1439
Ser Arg Gly Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro
915 920 925
ACA CCT AGC GAT ATC GGA TCC ACC ATG AAG ACC AAC CAG ATC AGC ACC 1487
Thr Pro Ser Asp Ile Gly Ser Thr Met Lys Thr Asn Gln Ile Ser Thr
930 935 940
ACC CAG AAG AAC CAG CAG AAG GAG ATG GAC CGC AAG GGC CTG CTG GGC 1535
Thr Gln Lys Asn Gln Gln Lys Glu Met Asp Arg Lys Gly Leu Leu Gly
945 950 955
TAC TAC TTC AAG GGC AAG GAC TTC AGC AAC CTG ACC ATG TTC GCC CCC 1583
Tyr Tyr Phe Lys Gly Lys Asp Phe Ser Asn Leu Thr Met Phe Ala Pro
960 965 970 975
ACG CGT GAC AGC ACC CTG ATC TAC GAC CAG CAG ACC GCC AAC AAG CTG 1631
Thr Arg Asp Ser Thr Leu Ile Tyr Asp Gln Gln Thr Ala Asn Lys Leu
980 985 990
CTG GAC AAG AAG CAG CAG GAG TAC CAG AGC ATC CGC TGG ATC GGC CTG 1679
Leu Asp Lys Lys Gln Gln Glu Tyr Gln Ser Ile Arg Trp Ile Gly Leu
995 1000 1005
ATC CAG AGC AAG GAG ACC GGC GAC TTC ACC TTC AAC CTG AGC GAG GAC 1727
Ile Gln Ser Lys Glu Thr Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp
1010 1015 1020
GAG CAG GCC ATC ATC GAG ATC AAC GGC AAG ATC ATC AGC AAC AAG GGC 1775
Glu Gln Ala Ile Ile Glu Ile Asn Gly Lys Ile Ile Ser Asn Lys Gly
1025 1030 1035
AAG GAG AAG CAG GTG GTG CAC CTG GAG AAG GGC AAG CTG GTG CCC ATC 1823
Lys Glu Lys Gln Val Val His Leu Glu Lys Gly Lys Leu Val Pro Ile
1040 1045 1050 1055
AAG ATC GAG TAC CAG AGC GAC ACC AAG TTC AAC ATC GAC AGC AAG ACC 1871
Lys Ile Glu Tyr Gln Ser Asp Thr Lys Phe Asn Ile Asp Ser Lys Thr
1060 1065 1070
TTC AAG GAG CTG AAG CTT TTC AAG ATC GAC AGC CAG AAC CAG CCC CAG 1919
Phe Lys Glu Leu Lys Leu Phe Lys Ile Asp Ser Gln Asn Gln Pro Gln
1075 1080 1085
CAG GTG CAG CAG GAC GAG CTG CGC AAC CCC GAG TTC AAC AAG AAG GAG 1967
Gln Val Gln Gln Asp Glu Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu
1090 1095 1100
AGC CAG GAG TTC CTG GCC AAG CCC AGC AAG ATC AAC CTG TTC ACC CAG 2015
Ser Gln Glu Phe Leu Ala Lys Pro Ser Lys Ile Asn Leu Phe Thr Gln
1105 1110 1115
CAG ATG AAG CGC GAG ATC GAC GAG GAC ACC GAC ACC GAC GGC GAC AGC 2063
Gln Met Lys Arg Glu Ile Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser
1120 1125 1130 1135
ATC CCC GAC CTG TGG GAG GAG AAC GGC TAC ACC ATC CAG AAC CGC ATC 2111
Ile Pro Asp Leu Trp Glu Glu Asn Gly Tyr Thr Ile Gln Asn Arg Ile
1140 1145 1150
GCC GTG AAG TGG GAC GAC AGC CTG GCT AGC AAG GGC TAC ACC AAG TTC 2159
Ala Val Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe
1155 1160 1165
GTG AGC AAC CCC CTG GAG AGC CAC ACC GTG GGC GAC CCC TAC ACC GAC 2207
Val Ser Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp
1170 1175 1180
TAC GAG AAG GCC GCC CGC GAC CTG GAC CTG AGC AAC GCC AAG GAG ACC 2255
Tyr Glu Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr
1185 1190 1195
TTC AAC CCC CTG GTG GCC GCC TTC CCC AGC GTG AAC GTG AGC ATG GAG 2303
Phe Asn Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu
1200 1205 1210 1215
AAG GTG ATC CTG AGC CCC AAC GAG AAC CTG AGC AAC AGC GTG GAG AGC 2351
Lys Val Ile Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser
1220 1225 1230
CAC TCG AGC ACC AAC TGG AGC TAC ACC AAC ACC GAG GGC GCC AGC GTG 2399
His Ser Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val
1235 1240 1245
GAG GCC GGC ATC GGT CCC AAG GGC ATC AGC TTC GGC GTG AGC GTG AAC 2447
Glu Ala Gly Ile Gly Pro Lys Gly Ile Ser Phe Gly Val Ser Val Asn
1250 1255 1260
TAC CAG CAC AGC GAG ACC GTG GCC CAG GAG TGG GGC ACC AGC ACC GGC 2495
Tyr Gln His Ser Glu Thr Val Ala Gln Glu Trp Gly Thr Ser Thr Gly
1265 1270 1275
AAC ACC AGC CAG TTC AAC ACC GCC AGC GCC GGC TAC CTG AAC GCC AAC 2543
Asn Thr Ser Gln Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn
1280 1285 1290 1295
GTG CGC TAC AAC AAC GTG GGC ACC GGC GCC ATC TAC GAC GTG AAG CCC 2591
Val Arg Tyr Asn Asn Val Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro
1300 1305 1310
ACC ACC AGC TTC GTG CTG AAC AAC GAC ACC ATC GCC ACC ATC ACC GCC 2639
Thr Thr Ser Phe Val Leu Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala
1315 1320 1325
AAG TCG AAT TCC ACC GCC CTG AAC ATC AGC CCC GGC GAG AGC TAC CCC 2687
Lys Ser Asn Ser Thr Ala Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro
1330 1335 1340
AAG AAG GGC CAG AAC GGC ATC GCC ATC ACC AGC ATG GAC GAC TTC AAC 2735
Lys Lys Gly Gln Asn Gly Ile Ala Ile Thr Ser Met Asp Asp Phe Asn
1345 1350 1355
AGC CAC CCC ATC ACC CTG AAC AAG AAG CAG GTG GAC AAC CTG CTG AAC 2783
Ser His Pro Ile Thr Leu Asn Lys Lys Gln Val Asp Asn Leu Leu Asn
1360 1365 1370 1375
AAC AAG CCC ATG ATG CTG GAG ACC AAC CAG ACC GAC GGC GTC TAC AAG 2831
Asn Lys Pro Met Met Leu Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys
1380 1385 1390
ATC AAG GAC ACC CAC GGC AAC ATC GTG ACG GGC GGC GAG TGG AAC GGC 2879
Ile Lys Asp Thr His Gly Asn Ile Val Thr Gly Gly Glu Trp Asn Gly
1395 1400 1405
GTG ATC CAG CAG ATC AAG GCC AAG ACC GCC AGC ATC ATC GTC GAC GAC 2927
Val Ile Gln Gln Ile Lys Ala Lys Thr Ala Ser Ile Ile Val Asp Asp
1410 1415 1420
GGC GAG CGC GTG GCC GAG AAG CGC GTG GCC GCC AAG GAC TAC GAG AAC 2975
Gly Glu Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn
1425 1430 1435
CCC GAG GAC AAG ACC CCC AGC CTG ACC CTG AAG GAC GCC CTG AAG CTG 3023
Pro Glu Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu
1440 1445 1450 1455
AGC TAC CCC GAC GAG ATC AAG GAG ATC GAG GGC TTG CTG TAC TAC AAG 3071
Ser Tyr Pro Asp Glu Ile Lys Glu Ile Glu Gly Leu Leu Tyr Tyr Lys
1460 1465 1470
AAC AAG CCC ATC TAC GAG AGC AGC GTG ATG ACC TAT CTA GAC GAG AAC 3119
Asn Lys Pro Ile Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn
1475 1480 1485
ACC GCC AAG GAG GTG ACC AAG CAG CTG AAC GAC ACC ACC GGC AAG TTC 3167
Thr Ala Lys Glu Val Thr Lys Gln Leu Asn Asp Thr Thr Gly Lys Phe
1490 1495 1500
AAG GAC GTG AGC CAC CTG TAC GAC GTG AAG CTG ACC CCC AAG ATG AAC 3215
Lys Asp Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn
1505 1510 1515
GTG ACC ATC AAG CTG AGC ATC CTG TAC GAC AAC GCC GAG AGC AAC GAC 3263
Val Thr Ile Lys Leu Ser Ile Leu Tyr Asp Asn Ala Glu Ser Asn Asp
1520 1525 1530 1535
AAC AGC ATC GGC AAG TGG ACC AAC ACC AAC ATC GTG AGC GGC GGC AAC 3311
Asn Ser Ile Gly Lys Trp Thr Asn Thr Asn Ile Val Ser Gly Gly Asn
1540 1545 1550
AAC GGC AAG AAG CAG TAC AGC AGC AAC AAC CCC GAC GCC AAC CTG ACC 3359
Asn Gly Lys Lys Gln Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr
1555 1560 1565
CTG AAC ACC GAC GCC CAG GAG AAG CTG AAC AAG AAC CGC GAC TAC TAC 3407
Leu Asn Thr Asp Ala Gln Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr
1570 1575 1580
ATC AGC CTG TAC ATG AAG AGC GAG AAG AAC ACC CAG TGC GAG ATC ACC 3455
Ile Ser Leu Tyr Met Lys Ser Glu Lys Asn Thr Gln Cys Glu Ile Thr
1585 1590 1595
ATC GAC GGC GAG ATA TAC CCC ATC ACC ACC AAG ACC GTG AAC GTG AAC 3503
Ile Asp Gly Glu Ile Tyr Pro Ile Thr Thr Lys Thr Val Asn Val Asn
1600 1605 1610 1615
AAG GAC AAC TAC AAG CGC CTG GAC ATC ATC GCC CAC AAC ATC AAG AGC 3551
Lys Asp Asn Tyr Lys Arg Leu Asp Ile Ile Ala His Asn Ile Lys Ser
1620 1625 1630
AAC CCC ATC AGC AGC CTG CAC ATC AAG ACC AAC GAC GAG ATC ACC CTG 3599
Asn Pro Ile Ser Ser Leu His Ile Lys Thr Asn Asp Glu Ile Thr Leu
1635 1640 1645
TTC TGG GAC GAC ATA TCG ATT ACC GAC GTC GCC AGC ATC AAG CCC GAG 3647
Phe Trp Asp Asp Ile Ser Ile Thr Asp Val Ala Ser Ile Lys Pro Glu
1650 1655 1660
AAC CTG ACC GAC AGC GAG ATC AAG CAG ATA TAC AGT CGC TAC GGC ATC 3695
Asn Leu Thr Asp Ser Glu Ile Lys Gln Ile Tyr Ser Arg Tyr Gly Ile
1665 1670 1675
AAG CTG GAG GAC GGC ATC CTG ATC GAC AAG AAA GGC GGC ATC CAC TAC 3743
Lys Leu Glu Asp Gly Ile Leu Ile Asp Lys Lys Gly Gly Ile His Tyr
1680 1685 1690 1695
GGC GAG TTC ATC AAC GAG GCC AGC TTC AAC ATC GAG CCC CTG CAG AAC 3791
Gly Glu Phe Ile Asn Glu Ala Ser Phe Asn Ile Glu Pro Leu Gln Asn
1700 1705 1710
TAC GTG ACC AAG TAC GAG GTG ACC TAC AGC AGC GAG CTG GGC CCC AAC 3839
Tyr Val Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly pro Asn
1715 1720 1725
GTG AGC GAC ACC CTG GAG AGC GAC AAG ATT TAC AAG GAC GGC ACC ATC 3887
Val Ser Asp Thr Leu Glu Ser Asp Lys Ile Tyr Lys Asp Gly Thr Ile
1730 1735 1740
AAG TTC GAC TTC ACC AAG TAC AGC AAG AAC GAG CAG GGC CTG TTC TAC 3935
Lys Phe Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gln Gly Leu Phe Tyr
1745 1750 1755
GAC AGC GGC CTG AAC TGG GAC TTC AAG ATC AAC GCC ATC ACC TAC GAC 3983
Asp Ser Gly Leu Asn Trp Asp Phe Lys Ile Asn Ala Ile Thr Tyr Asp
1760 1765 1770 1775
GGC AAG GAG ATG AAC GTG TTC CAC CGC TAC AAC AAG TAGATCTGAG 4029
Gly Lys Glu Met Asn Val Phe His Arg Tyr Asn Lys
1780 1785
CT 4031
(2)关于SEQ ID NO:50的信息
(i)序列特征
(A)长度:1338个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:50:
Met Lys Arg Met Glu Gly Lys Leu Phe Met Val Ser Lys Lys Leu Gln
1 5 10 15
Val Val Thr Lys Thr Val Leu Leu Ser Thr Val Phe Ser Ile Ser Leu
20 25 30
Leu Asn Asn Glu Val Ile Lys Ala Glu Gln Leu Asn Ile Asn Ser Gln
35 40 45
Ser Lys Tyr Thr Asn Leu Gln Asn Leu Lys Ile Thr Asp Lys Val Glu
50 55 60
Asp Phe Lys Glu Asp Lys Glu Lys Ala Lys Glu Trp Gly Lys Glu Lys
65 70 75 80
Glu Lys Glu Trp Lys Leu Thr Ala Thr Glu Lys Gly Lys Met Asn Asn
85 90 95
Phe Leu Asp Asn Lys Asn Asp Ile Lys Thr Asn Tyr Lys Glu Ile Thr
100 105 110
Phe Ser Ile Ala Gly Ser Phe Glu Asp Glu Ile Lys Asp Leu Lys Glu
115 120 125
Ile Asp Lys Met Phe Asp Lys Thr Asn Leu Ser Asn Ser Ile Ile Thr
130 135 140
Tyr Lys Asn Val Glu Pro Thr Thr Ile Gly Phe Asn Lys Ser Leu Thr
145 150 155 160
Glu Gly Asn Thr Ile Asn Ser Asp Ala Met Ala Gln Phe Lys Glu Gln
165 170 175
Phe Leu Asp Arg Asp Ile Lys Phe Asp Ser Tyr Leu Asp Thr His Leu
180 185 190
Thr Ala Gln Gln Val Ser Ser Lys Glu Arg Val Ile Leu Lys Val Thr
195 200 205
Val Pro Ser Gly Lys Gly Ser Thr Thr Pro Thr Lys Ala Gly Val Ile
210 215 220
Leu Asn Asn Ser Glu Tyr Lys Met Leu Ile Asp Asn Gly Tyr Met Val
225 230 235 240
His Val Asp Lys Val Ser Lys Val Val Lys Lys Gly Val Glu Cys Leu
245 250 255
Gln Ile Glu Gly Thr Leu Lys Lys Ser Leu Asp Phe Lys Asn Asp Ile
260 265 270
Asn Ala Glu Ala His Ser Trp Gly Met Lys Asn Tyr Glu Glu Trp Ala
275 280 285
Lys Asp Leu Thr Asp Ser Gln Arg Glu Ala Leu Asp Gly Tyr Ala Arg
290 295 300
Gln Asp Tyr Lys Glu Ile Asn Asn Tyr Leu Arg Asn Gln Gly Gly Ser
305 310 315 320
Gly Asn Glu Lys Leu Asp Ala Gln Ile Lys Asn Ile Ser Asp Ala Leu
325 330 335
Gly Lys Lys Pro Ile Pro Glu Asn Ile Thr Val Tyr Arg Trp Cys Gly
340 345 350
Met Pro Glu Phe Gly Tyr Gln Ile Ser Asp Pro Leu Pro Ser Leu Lys
355 360 365
Asp Phe Glu Glu Gln Phe Leu Asn Thr Ile Lys Glu Asp Lys Gly Tyr
370 375 380
Met Ser Thr Ser Leu Ser Ser Glu Arg Leu Ala Ala Phe Gly Ser Arg
385 390 395 400
Lys Ile Ile Leu Arg Leu Gln Val Pro Lys Gly Ser Thr Gly Ala Tyr
405 410 415
Leu Ser Ala Ile Gly Gly Phe Ala Ser Glu Lys Glu Ile Leu Leu Asp
420 425 430
Lys Asp Ser Lys Tyr His Ile Asp Lys Val Thr Glu Val Ile Ile Lys
435 440 445
Gly Val Lys Arg Tyr Val Val Asp Ala Thr Leu Leu Thr Asn Ser Arg
450 455 460
Gly Pro Ser Thr Pro Pro Thr Pro Ser Pro Ser Thr Pro Pro Thr Pro
465 470 475 480
Ser Asp Ile Gly Ser Thr Met Lys Thr Asn Gln Ile Ser Thr Thr Gln
485 490 495
Lys Asn Gln Gln Lys Glu Met Asp Arg Lys Gly Leu Leu Gly Tyr Tyr
500 505 510
Phe Lys Gly Lys Asp Phe Ser Asn Leu Thr Met Phe Ala Pro Thr Arg
515 520 525
Asp Ser Thr Leu Ile Tyr Asp Gln Gln Thr Ala Asn Lys Leu Leu Asp
530 535 540
Lys Lys Gln Gln Glu Tyr Gln Ser Ile Arg Trp Ile Gly Leu Ile Gln
545 550 555 560
Ser Lys Glu Thr Gly Asp Phe Thr Phe Asn Leu Ser Glu Asp Glu Gln
565 570 575
Ala Ile Ile Glu Ile Asn Gly Lys Ile Ile Ser Asn Lys Gly Lys Glu
580 585 590
Lys Gln Val Val His Leu Glu Lys Gly Lys Leu Val Pro Ile Lys Ile
595 600 605
Glu Tyr Gln Ser Asp Thr Lys Phe Asn Ile Asp Ser Lys Thr Phe Lys
610 615 620
Glu Leu Lys Leu Phe Lys Ile Asp Ser Gln Asn Gln Pro Gln Gln Val
625 630 635 640
Gln Gln Asp Glu Leu Arg Asn Pro Glu Phe Asn Lys Lys Glu Ser Gln
645 650 655
Glu Phe Leu Ala Lys Pro Ser Lys Ile Asn Leu Phe Thr Gln Gln Met
660 665 670
Lys Arg Glu Ile Asp Glu Asp Thr Asp Thr Asp Gly Asp Ser Ile Pro
675 680 685
Asp Leu Trp Glu Glu Asn Gly Tyr Thr Ile Gln Asn Arg Ile Ala Val
690 695 700
Lys Trp Asp Asp Ser Leu Ala Ser Lys Gly Tyr Thr Lys Phe Val Ser
705 710 715 720
Asn Pro Leu Glu Ser His Thr Val Gly Asp Pro Tyr Thr Asp Tyr Glu
725 730 735
Lys Ala Ala Arg Asp Leu Asp Leu Ser Asn Ala Lys Glu Thr Phe Asn
740 745 750
Pro Leu Val Ala Ala Phe Pro Ser Val Asn Val Ser Met Glu Lys Val
755 760 765
Ile Leu Ser Pro Asn Glu Asn Leu Ser Asn Ser Val Glu Ser His Ser
770 775 780
Ser Thr Asn Trp Ser Tyr Thr Asn Thr Glu Gly Ala Ser Val Glu Ala
785 790 795 800
Gly Ile Gly Pro Lys Gly Ile Ser Phe Gly Val Ser Val Asn Tyr Gln
805 810 815
His Ser Glu Thr Val Ala Gln Glu Trp Gly Thr Ser Thr Gly Asn Thr
820 825 830
Ser Gln Phe Asn Thr Ala Ser Ala Gly Tyr Leu Asn Ala Asn Val Arg
835 840 845
Tyr Asn Asn Val Gly Thr Gly Ala Ile Tyr Asp Val Lys Pro Thr Thr
850 855 860
Ser Phe Val Leu Asn Asn Asp Thr Ile Ala Thr Ile Thr Ala Lys Ser
865 870 875 880
Asn Ser Thr Ala Leu Asn Ile Ser Pro Gly Glu Ser Tyr Pro Lys Lys
885 890 895
Gly Gln Asn Gly Ile Ala Ile Thr Ser Met Asp Asp Phe Asn Ser His
900 905 910
Pro Ile Thr Leu Asn Lys Lys Gln Val Asp Asn Leu Leu Asn Asn Lys
915 920 925
Pro Met Met Leu Glu Thr Asn Gln Thr Asp Gly Val Tyr Lys Ile Lys
930 935 940
Asp Thr His Gly Asn Ile Val Thr Gly Gly Glu Trp Asn Gly Val Ile
945 950 955 960
Gln Gln Ile Lys Ala Lys Thr Ala Ser Ile Ile Val Asp Asp Gly Glu
965 970 975
Arg Val Ala Glu Lys Arg Val Ala Ala Lys Asp Tyr Glu Asn Pro Glu
980 985 990
Asp Lys Thr Pro Ser Leu Thr Leu Lys Asp Ala Leu Lys Leu Ser Tyr
995 1000 1005
Pro Asp Glu Ile Lys Glu Ile Glu Gly Leu Leu Tyr Tyr Lys Asn Lys
1010 1015 1020
Pro Ile Tyr Glu Ser Ser Val Met Thr Tyr Leu Asp Glu Asn Thr Ala
1025 1030 1035 1040
Lys Glu Val Thr Lys Gln Leu Asn Asp Thr Thr Gly Lys Phe Lys Asp
1045 1050 1055
Val Ser His Leu Tyr Asp Val Lys Leu Thr Pro Lys Met Asn Val Thr
1060 1065 1070
Ile Lys Leu Ser Ile Leu Tyr Asp Asn Ala Glu Ser Asn Asp Asn Ser
1075 1080 1085
Ile Gly Lys Trp Thr Asn Thr Asn Ile Val Ser Gly Gly Asn Asn Gly
1090 1095 1100
Lys Lys Gln Tyr Ser Ser Asn Asn Pro Asp Ala Asn Leu Thr Leu Asn
1105 1110 1115 1120
Thr Asp Ala Gln Glu Lys Leu Asn Lys Asn Arg Asp Tyr Tyr Ile Ser
1125 1130 1135
Leu Tyr Met Lys Ser Glu Lys Asn Thr Gln Cys Glu Ile Thr Ile Asp
1140 1145 1150
Gly Glu Ile Tyr Pro Ile Thr Thr Lys Thr Val Asn Val Asn Lys Asp
1155 1160 1165
Asn Tyr Lys Arg Leu Asp Ile Ile Ala His Asn Ile Lys Ser Asn Pro
1170 1175 1180
Ile Ser Ser Leu His Ile Lys Thr Asn Asp Glu Ile Thr Leu Phe Trp
1185 1190 1195 1200
Asp Asp Ile Ser Ile Thr Asp Val Ala Ser Ile Lys Pro Glu Asn Leu
1205 1210 1215
Thr Asp Ser Glu Ile Lys Gln Ile Tyr Ser Arg Tyr Gly Ile Lys Leu
1220 1225 1230
Glu Asp Gly Ile Leu Ile Asp Lys Lys Gly Gly Ile His Tyr Gly Glu
1235 1240 1245
Phe Ile Asn Glu Ala Ser Phe Asn Ile Glu Pro Leu Gln Asn Tyr Val
1250 1255 1260
Thr Lys Tyr Glu Val Thr Tyr Ser Ser Glu Leu Gly Pro Asn Val Ser
1265 1270 1275 1280
Asp Thr Leu Glu Ser Asp Lys Ile Tyr Lys Asp Gly Thr Ile Lys Phe
1285 1290 1295
Asp Phe Thr Lys Tyr Ser Lys Asn Glu Gln Gly Leu Phe Tyr Asp Ser
1300 1305 1310
Gly Leu Asn Trp Asp Phe Lys Ile Asn Ala Ile Thr Tyr Asp Gly Lys
1315 1320 1325
Glu Met Asn Val Phe His Arg Tyr Asn Lys
1330 1335
(2)关于SEQ ID NO:51的信息
(i)序列特征
(A)长度:2444个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA(基因组)
(iii)假拟:否
(ix)特征:
(A)名称/键:CDS
(B)位置:17..2444
(D)其它信息:/产品=“3A(a)合成的:天然的融合”
(xi)序列描述:SEQ ID NO:51:
GGATCCACCA ATGAAC ATG AAC AAG AAC AAC ACC AAG CTG AGC ACC CGC 49
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg
1 5 10
GCC CTG CCG AGC TTC ATC GAC TAC TTC AAC GGC ATC TAC GGC TTC GCC 97
Ala Leu Pro Ser Phe Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala
15 20 25
ACC GGC ATC AAG GAC ATC ATG AAC ATG ATC TTC AAG ACC GAC ACC GGC 145
Thr Gly Ile Lys Ile Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly
30 35 40
GGC GAC CTG ACC CTG GAC GAG ATC CTG AAG AAC CAG CAG CTG CTG AAC 193
Gly Asp Leu Thr Leu Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn
45 50 55
GAC ATC AGC GGC AAG CTG GAC GGC GTG AAC GGC AGC CTG AAC GAC CTG 241
Asp Ile Ser Gly Lys Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu
60 65 70 75
ATC GCC CAG GGC AAC CTG AAC ACC GAG CTG AGC AAG GAG ATC CTT AAG 289
Ile Ala Gln Gly Asn Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys
80 85 90
ATC GCC AAC GAG CAG AAC CAG GTG CTG AAC GAC GTG AAC AAC AAG CTG 337
Ile Ala Asn Glu Gln Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu
95 100 105
GAC GCC ATC AAC ACC ATG CTG CGC GTG TAC CTG CCG AAG ATC ACC AGC 385
Asp Ala Ile Asn Thr Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser
110 115 120
ATG CTG AGC GAC GTG ATG AAG CAG AAC TAC GCC CTG AGC CTG CAG ATC 433
Met Leu Ser Asp Val Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile
125 130 135
GAG TAC CTG AGC AAG CAG CTG CAG GAG ATC AGC GAC AAG CTG GAC ATC 481
Glu Tyr Leu Ser Lys Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile
140 145 150 155
ATC AAC GTG AAC GTC CTG ATC AAC AGC ACC CTG ACC GAG ATC ACC CCG 529
Ile Asn Val Asn Val Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro
160 165 170
GCC TAC CAG CGC ATC AAG TAC GTG AAC GAG AAG TTC GAA GAG CTG ACC 577
Ala Tyr Gln Arg Ile Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr
175 180 185
TTC GCC ACC GAG ACC AGC AGC AAG GTG AAG AAG GAC GGC AGC CCG GCC 625
Phe Ala Thr Glu Thr Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala
190 195 200
GAC ATC CTG GAC GAG CTG ACC GAG CTG ACC GAG CTG GCC AAG AGC GTG 673
Asp Ile Leu Asp Glu Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val
205 210 215
ACC AAG AAC GAC GTG GAC GGC TTC GAG TTC TAC CTG AAC ACC TTC CAC 721
Thr Lys Asn Asp Val Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His
220 225 230 235
GAC GTG ATG GTG GGC AAC AAC CTG TTC GGC CGC AGC GCC CTG AAG ACC 769
Asp Val Met Val Gly Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr
240 245 250
GCC AGC GAG CTG ATC ACC AAG GAG AAC GTG AAG ACC AGC GGC AGC GAG 817
Ala Ser Glu Leu Ile Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu
255 260 265
GTG GGC AAC GTG TAC AAC TTC CTG ATC GTG CTG ACC GCC CTG CAG GCC 865
Val Gly Asn Val Tyr Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala
270 275 280
CAG GCC TTC CTG ACC CTG ACC ACC TGT CGC AAG CTG CTG GGC CTG GCC 913
Gln Ala Phe Leu Thr Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala
285 290 295
GAC ATC GAC TAC ACC AGC ATC ATG AAC GAG CAC TTG AAC AAG GAG AAG 961
Asp Ile Asp Tyr Thr Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys
300 305 310 315
GAG GAG TTC CGC GTG AAC ATC CTG CCG ACC CTG AGC AAC ACC TTC AGC 1009
Glu Glu Phe Arg Val Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser
320 325 330
AAC CCG AAC TAC GCC AAG GTG AAG GGC AGC GAC GAG GAC GCC AAG ATG 1057
Asn Pro Asn Tyr AAa Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met
335 340 345
ATC GTG GAG GCT AAG CCG GGC CAC GCG TTG ATC GGC TTC GAG ATC AGC 1105
Ile Val Glu Ala Lys Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser
350 355 360
AAC GAC AGC ATC ACC GTG CTG AAG GTG TAC GAG GCC AAG CTG AAG CAG 1153
Asn Asp Ser Ile Thr Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln
365 370 375
AAC TAC CAG GTG GAC AAG GAC AGC TTG AGC GAG GTG ATC TAC GGC GAC 1201
Asn Tyr Gln Val Asp Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp
380 385 390 395
ATG GAC AAG CTG CTG TGT CCG GAC CAG AGC GAG CAA ATC TAC TAC ACC 1249
Met Asp Lys Leu Leu Cys Pro Asp Gln Ser Glu Gln Ile Tyr Tyr Thr
400 405 410
AAC AAC ATC GTG TTC CCG AAC GAG TAC GTG ATC ACC AAG ATC GAC TTC 1297
Asn Asn Ile Val Phe Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe
415 420 425
ACC AAG AAG ATG AAG ACC CTG CGC TAC GAG GTG ACC GCC AAC TTC TAC 1345
Thr Lys Lys Met Lys Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr
430 435 440
GAC AGC AGC ACC GGC GAG ATC GAC CTG AAC AAG AAG AAG GTG GAG AGC 1393
Asp Ser Ser Thr Gly Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser
445 450 455
AGC GAG GCC GAG TAC CGC ACC CTG AGC GCG AAC GAC GAC GGC GTC TAC 1441
Ser Glu Ala Glu Tyr Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr
460 465 470 475
ATG CCA CTG GGC GTG ATC AGC GAG ACC TTC CTG ACC CCG ATC AAC GGC 1489
Met Pro Leu Gly Val Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly
480 485 490
TTT GGC CTG CAG GCC GAC GAG AAC AGC CGC CTG ATC ACC CTG ACC TGT 1537
Phe Gly Leu Gln Ala Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys
495 500 505
AAG AGC TAC CTG CGC GAG CTG CTG CTA GCC ACC GAC CTG AGC AAC AAG 1585
Lys Ser Tyr Leu Arg Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys
510 515 520
GAG ACC AAG CTG ATC GTG CCA CCG AGC GGC TTC ATC AGC AAC ATC GTG 1633
Glu Thr Lys Leu Ile Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val
525 530 535
GAG AAC GGC AGC ATC GAG GAG GAC AAC CTG GAG CCG TGG AAG GCC AAC 1681
Glu Asn Gly Ser Ile Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn
540 545 550 555
AAC AAG AAC GCC TAC GTG GAC CAC ACC GGC GGC GTG AAC GGC ACC AAG 1729
Asn Lys Asn Ala Tyr Val Asp His Thr Gly Gly Val Asn Gly Thr Lys
560 565 570
GCC CTG TAC GTG CAC AAG GAC GGC GGC ATC AGC CAG TTC ATC GGC GAC 1777
Ala Leu Tyr Val His Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp
575 580 585
AAG CTG AAG CCG AAG ACC GAG TAC GTG ATC CAG TAC ACC GTG AAG GGC 1825
Lys Leu Lys Pro Lys Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly
590 595 600
AAG CCA TCG ATT CAC CTG AAG GAC GAG AAC ACC GGC TAC ATC CAC TAC 1873
Lys Pro Ser Ile His Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr
605 610 615
GAG GAC ACC AAC AAC AAC CTG GAG GAC TAC CAG ACC ATC AAC AAG CGC 1921
Glu Asp Thr Asn Asn Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg
620 625 630 635
TTC ACC ACC GGC ACC GAC CTG AAG GGC GTG TAC CTG ATC CTG AAG AGC 1969
Phe Thr Thr Gly Thr Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser
640 645 650
CAG AAC GGC GAC GAG GCC TGG GGC GAC AAC TTC ATC ATC CTG GAG ATC 2017
Gln Asn Gly Asp Glu Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile
655 660 665
AGC CCG AGC GAG AAG CTG CTG AGC CCG GAG CTG ATC AAC ACC AAC AAC 2065
Ser Pro Ser Glu Lys Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn
670 675 680
TGG ACC AGC ACC GGC AGC ACC AAC ATC AGC GGC AAC ACC CTG ACC CTG 2113
Trp Thr Ser Thr Gly Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu
685 690 695
TAC CAG GGC GGC CGG GGG ATT CTA AAA CAA AAC CTT CAA TTA GAT AGT 2161
Tyr Gln Gly Gly Arg Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser
700 705 710 715
TTT TCA ACT TAT AGA GTG TAT TTT TCT GTG TCC GGA GAT GCT AAT GTA 2209
Phe Ser Thr Tyr Arg Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val
720 725 730
AGG ATT AGA AAT TCT AGG GAA GTG TTA TTT GAA AAA AGA TAT ATG AGC 2257
Arg Ile Arg Asn Ser Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser
735 740 745
GGT GCT AAA GAT GTT TCT GAA ATG TTC ACT ACA AAA TTT GAG AAA GAT 2305
Gly Ala Lys Asp Val Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp
750 755 760
AAC TTT TAT ATA GAG CTT TCT CAA GGG AAT AAT TTA TAT GGT GGT CCT 2353
Asn Phe Tyr Ile Glu Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro
765 770 775
ATT GTA CAT TTT TAC GAT GTC TCT ATT AAG NAA GAT CGG GAT CTA ATA 2401
Ile Val His Phe Tyr Asp Val Ser Ile Lys Xaa Asp Arg Asp Leu Ile
780 785 790 795
TTA ACA GTT TTT AAA AGC NAA TTC TTG TAT AAT GTC CTT GAT T 2444
Leu Thr Val Phe Lys Ser Xaa Phe Leu Tyr Asn Val Leu Asp
800 805
(2)关于SEQ ID NO:52的信息
(i)序列特征
(A)长度:809个氨基酸
(B)类型:氨基酸
(D)拓扑结构:线型
(ii)分子类型:蛋白质
(xi)序列描述:SEQ ID NO:52:
Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe
1 5 10 15
Ile Asp Tyr Phe Asn Gly Ile Tyr Gly Phe Ala Thr Gly Ile Lys Asp
20 25 30
Ile Met Asn Met Ile Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu
35 40 45
Asp Glu Ile Leu Lys Asn Gln Gln Leu Leu Asn Asp Ile Ser Gly Lys
50 55 60
Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu Ile Ala Gln Gly Asn
65 70 75 80
Leu Asn Thr Glu Leu Ser Lys Glu Ile Leu Lys Ile Ala Asn Glu Gln
85 90 95
Asn Gln Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala Ile Asn Thr
100 105 110
Met Leu Arg Val Tyr Leu Pro Lys Ile Thr Ser Met Leu Ser Asp Val
115 120 125
Met Lys Gln Asn Tyr Ala Leu Ser Leu Gln Ile Glu Tyr Leu Ser Lys
130 135 140
Gln Leu Gln Glu Ile Ser Asp Lys Leu Asp Ile Ile Asn Val Asn Val
145 150 155 160
Leu Ile Asn Ser Thr Leu Thr Glu Ile Thr Pro Ala Tyr Gln Arg Ile
165 170 175
Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr
180 185 190
Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp Ile Leu Asp Glu
195 200 205
Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val
210 215 220
Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly
225 230 235 240
Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu Ile
245 250 255
Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr
260 265 270
Asn Phe Leu Ile Val Leu Thr Ala Leu Gln Ala Gln Ala Phe Leu Thr
275 280 285
Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp Ile Asp Tyr Thr
290 295 300
Ser Ile Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val
305 310 315 320
Asn Ile Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala
325 330 335
Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met Ile Val Glu Ala Lys
340 345 350
Pro Gly His Ala Leu Ile Gly Phe Glu Ile Ser Asn Asp Ser Ile Thr
355 360 365
Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gln Asn Tyr Gln Val Asp
370 375 380
Lys Asp Ser Leu Ser Glu Val Ile Tyr Gly Asp Met Asp Lys Leu Leu
385 390 395 400
Cys Pro Asp Gln Ser Glu Gln Ile Tyr Tyr Thr Asn Asn Ile Val Phe
405 410 415
Pro Asn Glu Tyr Val Ile Thr Lys Ile Asp Phe Thr Lys Lys Met Lys
420 425 430
Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly
435 440 445
Glu Ile Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr
450 455 460
Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val
465 470 475 480
Ile Ser Glu Thr Phe Leu Thr Pro Ile Asn Gly Phe Gly Leu Gln Ala
485 490 495
Asp Glu Asn Ser Arg Leu Ile Thr Leu Thr Cys Lys Ser Tyr Leu Arg
500 505 510
Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu Ile
515 520 525
Val Pro Pro Ser Gly Phe Ile Ser Asn Ile Val Glu Asn Gly Ser Ile
530 535 540
Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr
545 550 555 560
Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His
565 570 575
Lys Asp Gly Gly Ile Ser Gln Phe Ile Gly Asp Lys Leu Lys Pro Lys
580 585 590
Thr Glu Tyr Val Ile Gln Tyr Thr Val Lys Gly Lys Pro Ser Ile His
595 600 605
Leu Lys Asp Glu Asn Thr Gly Tyr Ile His Tyr Glu Asp Thr Asn Asn
610 615 620
Asn Leu Glu Asp Tyr Gln Thr Ile Asn Lys Arg Phe Thr Thr Gly Thr
625 630 635 640
Asp Leu Lys Gly Val Tyr Leu Ile Leu Lys Ser Gln Asn Gly Asp Glu
645 650 655
Ala Trp Gly Asp Asn Phe Ile Ile Leu Glu Ile Ser Pro Ser Glu Lys
660 665 670
Leu Leu Ser Pro Glu Leu Ile Asn Thr Asn Asn Trp Thr Ser Thr Gly
675 680 685
Ser Thr Asn Ile Ser Gly Asn Thr Leu Thr Leu Tyr Gln Gly Gly Arg
690 695 700
Gly Ile Leu Lys Gln Asn Leu Gln Leu Asp Ser Phe Ser Thr Tyr Arg
705 710 715 720
Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg Ile Arg Asn Ser
725 730 735
Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val
740 745 750
Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp Asn Phe Tyr Ile Glu
755 760 765
Leu Ser Gln Gly Asn Asn Leu Tyr Gly Gly Pro Ile Val His Phe Tyr
770 775 780
Asp Val Ser Ile Lys Xaa Asp Arg Asp Leu Ile Leu Thr Val Phe Lys
785 790 795 800
Ser Xaa Phe Leu Tyr Asn Val Leu Asp
805
(2)关于SEQ ID NO:53的信息
(i)序列特征
(A)长度:3474个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA
(iii)假拟:否
(iv)反义:否
(vi)原始来源
(A)生物:纯化的玉米最佳化的合成BT CryIA(b)基因
(xi)序列描述:SEQ ID NO:53:
ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG
60
GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG
120
AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG
180
GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC
240
GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG
300
GAGGGCCTGA GCAACCTGTA CCAGATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC
360
CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC
420
CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG
480
TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTGAGCGT GTTCGGCCAG
540
CGCTGGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC
600
GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGC
660
CCCGACAGCC GCGACTGGAT CCGCTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG
720
CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG
780
AGCCAGCTGA CCCGCGAGAT CTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC
840
CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG
900
AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG
960
ATCATGGCCA GCCCCGTGGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC
1020
ATGGGCAACG CCGCCCCCCA GCAGCGCATC GTGGCCCAGC TGGGCCAGGG CGTGTACCGC
1080
ACCCTGAGCA GCACCCTGTA CCGCCGCCCC TTCAACATCG GCATCAACAA CCAGCAGCTG
1140
AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG
1200
TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCCCAGAA CAACAACGTG
1260
CCCCCCCGCC AGGGCTTCAG CCACCGCCTG AGCCACGTGA GCATGTTCCG CAGCGGCTTC
1320
AGCAACAGCA GCGTGAGCAT CATCCGCGCC CCCATGTTCA GCTGGATCCA CCGCAGCGCC
1380
GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC
1440
AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG
1500
CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCCTGAGC
1560
CAGCGCTACC GCGTGCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC
1620
ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC
1680
CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC
1740
AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT GTACATCGAC
1800
CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGCGCGCC
1860
CAGAAGGCCG TGAACGAGCT GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG
1920
ACCGACTACC ACATCGACCA GGTGAGCAAC CTGGTGGAGT GCCTGAGCGA CGAGTTCTGC
1980
CTGGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCCT GAGCGACGAG
2040
CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCCAGCTGGA CCGCGGCTGG
2100
CGCGGCAGCA CCGACATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAACTACGTG
2160
ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA GATCGACGAG
2220
AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC
2280
CTGGAGATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCACC
2340
GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CCCATCGGCA AGTGCGCCCA CCACAGCCAC
2400
CACTTCAGCC TGGACATCGA CGTGGGCTGC ACCGACCTGA ACGAGGACCT GGGCGTGTGG
2460
GTGATCTTCA AGATCAAGAC CCAGGACGGC CACGCCCGCC TGGGCAACCT GGAGTTCCTG
2520
GAGGAGAAGC CCCTGGTGGG CGAGGCCCTG GCCCGCGTGA AGCGCGCCGA GAAGAAGTGG
2580
CGCGACAAGC GCGAGAAGCT GGAGTGGGAG ACCAACATCG TGTACAAGGA GGCCAAGGAG
2640
AGCGTGGACG CCCTGTTCGT GAACAGCCAG TACGACCGCC TGCAGGCCGA CACCAACATC
2700
GCCATGATCC ACGCCGCCGA CAAGCGCGTG CACAGCATCC GCGAGGCCTA CCTGCCCGAG
2760
CTGAGCGTGA TCCCCGGCGT GAACGCCGCC ATCTTCGAGG AGCTGGAGGG CCGCATCTTC
2820
ACCGCCTTCA GCCTGTACGA CGCCCGCAAC GTGATCAAGA ACGGCGACTT CAACAACGGC
2880
CTGAGCTGCT GGAACGTGAA GGGCCACGTG GACGTGGAGG AGCAGAACAA CCACCGCAGC
2940
GTGCTGGTGG TGCCCGAGTG GGAGGCCGAG GTGAGCCAGG AGGTGCGCGT GTGCCCCGGC
3000
CGCGGCTACA TCCTGCGCGT GACCGCCTAC AAGGAGGGCT ACGGCGAGGG CTGCGTGACC
3060
ATCCACGAGA TCGAGAACAA CACCGACGAG CTGAAGTTCA GCAACTGCGT GGAGGAGGAG
3120
GTGTACCCCA ACAACACCGT GACCTGCAAC GACTACACCG CCACCCAGGA GGAGTACGAG
3180
GGCACCTACA CCAGCCGCAA CCGCGGCTAC GACGGCGCCT ACGAGAGCAA CAGCAGCGTG
3240
CCCGCCGACT ACGCCAGCGC CTACGAGGAG AAGGCCTACA CCGACGGCCG CCGCGACAAC
3300
CCCTGCGAGA GCAACCGCGG CTACGGCGAC TACACCCCCC TGCCCGCCGG CTACGTGACC
3360
AAGGAGCTGG AGTACTTCCC CGAGACCGAC AAGGTGTGGA TCGAGATCGG CGAGACCGAG
3420
GGCACCTTCA TCGTGGACAG CGTGGAGCTG CTGCTGATGG AGGAGTAGTA CATG
3474
(2)关于SEQ ID NO:54的信息
(i)序列特征
(A)长度:3508个核苷酸
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA
(iv)反义:否
(vi)原始来源
(A)生物:合成的玉米最佳化的全长BT CryIA(b)基因
(xi)序列描述:SEQ ID NO:54:
GATCCAACAA TGGACAACAA CCCCAACATC AACGAGTGCA TCCCCTACAA CTGCCTGAGC
60
AACCCCGAGG TGGAGGTGCT GGGCGGCGAG CGCATCGAGA CCGGCTACAC CCCCATCGAC
120
ATCAGCCTGA GCCTGACCCA GTTCCTGCTG AGCGAGTTCG TGCCCGGCGC CGGCTTCGTG
180
CTGGGCCTGG TGGACATCAT CTGGGGCATC TTCGGCCCCA GCCAGTGGGA CGCCTTCCTG
240
GTGCAGATCG AGCAGCTGAT CAACCAGCGC ATCGAGGAGT TCGCCCGCAA CCAGGCCATC
300
AGCCGCCTGG AGGGCCTGAG CAACCTGTAC CAAATCTACG CCGAGAGCTT CCGCGAGTGG
360
GAGGCCGACC CCACCAACCC CGCCCTGCGC GAGGAGATGC GCATCCAGTT CAACGACATG
420
AACAGCGCCC TGACCACCGC CATCCCCCTG TTCGCCGTGC AGAACTACCA GGTGCCCCTG
480
CTGAGCGTGT ACGTGCAGGC CGCCAACCTG CACCTGAGCG TGCTGCGCGA CGTCAGCGTG
540
TTCGGCCAGC GCTGGGGCTT CGACGCCGCC ACCATCAACA GCCGCTACAA CGACCTGACC
600
CGCCTGATCG GCAACTACAC CGACCACGCC GTGCGCTGGT ACAACACCGG CCTGGAGCGC
660
GTGTGGGGTC CCGACAGCCG CGACTGGATC AGGTACAACC AGTTCCGCCG CGAGCTGACC
720
CTGACCGTGC TGGACATCGT GAGCCTGTTC CCCAACTACG ACAGCCGCAC CTACCCCATC
780
CGCACCGTGA GCCAGCTGAC CCGCGAGATT TACACCAACC CCGTGCTGGA GAACTTCGAC
840
GGCAGCTTCC GCGGCAGCGC CCAGGGCATC GAGGGCAGCA TCCGCAGCCC CCACCTGATG
900
GACATCCTGA ACAGCATCAC CATCTACACC GACGCCCACC GCGGCGAGTA CTACTGGAGC
960
GGCCACCAGA TCATGGCCAG CCCCGTCGGC TTCAGCGGCC CCGAGTTCAC CTTCCCCCTG
1020
TACGGCACCA TGGGCAACGC TGCACCTCAG CAGCGCATCG TGGCACAGCT GGGCCAGGGA
1080
GTGTACCGCA CCCTGAGCAG CACCCTGTAC CGTCGACCTT TCAACATCGG CATCAACAAC
1140
CAGCAGCTGA GCGTGCTGGA CGGCACCGAG TTCGCCTACG GCACCAGCAG CAACCTGCCC
1200
AGCGCCGTGT ACCGCAAGAG CGGCACCGTG GACAGCCTGG ACGAGATCCC CCCTCAGAAC
1260
AACAACGTGC CACCTCGACA GGGCTTCAGC CACCGTCTGA GCCACGTGAG CATGTTCCGC
1320
AGTGGCTTCA GCAACAGCAG CGTGAGCATC ATCCGTGCAC CTATGTTCAG CTGGATTCAC
1380
CGCAGTGCCG AGTTCAACAA CATCATCCCC AGCAGCCAGA TCACCCAGAT CCCCCTGACC
1440
AAGAGCACCA ACCTGGGCAG CGGCACCAGC GTGGTGAAGG GCCCCGGCTT CACCGGCGGC
1500
GACATCCTGC GCCGCACCAG CCCCGGCCAG ATCAGCACCC TGCGCGTGAA CATCACCGCC
1560
CCCCTGAGCC AGCGCTACCG CGTCCGCATC CGCTACGCCA GCACCACCAA CCTGCAGTTC
1620
CACACCAGCA TCGACGGCCG CCCCATCAAC CAGGGCAACT TCAGCGCCAC CATGAGCAGC
1680
GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC CTTCAACTTC
1740
AGCAACGGCA GCAGCGTGTT CACCCTGAGC GCCCACGTGT TCAACAGCGG CAACGAGGTG
1800
TACATCGACC GCATCGAGTT CGTGCCCGCC GAGGTGACCT TCGAGGCCGA GTACGACCTG
1860
GAGAGGGCTC AGAAGGCCGT GAACGAGCTG TTCACCAGCA GCAACCAGAT CGGCCTGAAG
1920
ACCGACGTGA CCGACTACCA CATCGATCAG GTGAGCAACC TGGTGGAGTG CCTGAGCGAC
1980
GAGTTCTGCC TGGACGAGAA GAAGGAGCTG AGCGAGAAGG TGAAGCACGC CAAGCGCCTG
2040
AGCGACGAGC GCAACCTGCT GCAGGACCCC AACTTCCGCG GCATCAACCG CCAGCTGGAC
2100
CGCGGCTGGC GCGGCAGCAC CGACATCACC ATCCAGGGCG GCGACGACGT GTTCAAGGAG
2160
AACTACGTGA CCCTGCTGGG CACCTTCGAC GAGTGCTACC CCACCTACCT GTACCAGAAG
2220
ATCGACGAGA GCAAGCTGAA GGCCTACACC CGCTACCAGC TGCGCGGCTA CATCGAGGAC
2280
AGCCAGGACC TGGAGATCTA CCTGATCCGC TACAACGCCA AGCACGAGAC CGTGAACGTG
2340
CCCGGCACCG GCAGCCTGTG GCCCCTGAGC GCCCCCAGCC CCATCGGCAA GTGCGCCCAC
2400
CACAGCCACC ACTTCAGCCT GGACATCGAC GTGGGCTGCA CCGACCTGAA CGAGGACCTG
2460
GGCGTGTGGG TGATCTTCAA GATCAAGACC CAGGACGGCC ACGCCCGCCT GGGCAACCTG
2520
GAGTTCCTGG AGGAGAAGCC CCTGGTGGGC GAGGCCCTGG CCCGCGTGAA GCGCGCCGAG
2580
AAGAAGTGGC GCGACAAGCG CGAGAAGCTG GAGTGGGAGA CCAACATCGT GTACAAGGAG
2640
GCCAAGGAGA GCGTGGACGC CCTGTTCGTG AACAGCCAGT ACGACCGCCT GCAGGCCGAC
2700
ACCAACATCG CCATGATCCA CGCCGCCGAC AAGCGCGTGC ACAGCATTCG CGAGGCCTAC
2760
CTGCCCGAGC TGAGCGTGAT CCCCGGCGTG AACGCCGCCA TCTTCGAGGA GCTGGAGGGC
2820
CGCATCTTCA CCGCCTTCAG CCTGTACGAC GCCCGCAACG TGATCAAGAA CGGCGACTTC
2880
AACAACGGCC TGAGCTGCTG GAACGTGAAG GGCCACGTGG ACGTGGAGGA GCAGAACAAC
2940
CACCGCAGCG TGCTGGTGGT GCCCGAGTGG GAGGCCGAGG TGAGCCAGGA GGTGCGCGTG
3000
TGCCCCGGCC GCGGCTACAT CCTGCGCGTG ACCGCCTACA AGGAGGGCTA CGGCGAGGGC
3060
TGCGTGACCA TCCACGAGAT CGAGAACAAC ACCGACGAGC TCAAGTTCAG CAACTGCGTG
3120
GAGGAGGAGG TGTACCCCAA CAACACCGTG ACCTGCAACG ACTACACCGC CACCCAGGAG
3180
GAGTACGAGG GCACCTACAC CAGCCGCAAC CGCGGCTACG ACGGCGCCTA CGAGAGCAAC
3240
AGCAGCGTGC CCGCCGACTA CGCCAGCGCC TACGAGGAGA AGGCCTACAC CGACGGCCGC
3300
CGCGACAACC CCTGCGAGAG CAACCGCGGC TACGGCGACT ACACCCCCCT GCCCGCCGGC
3360
TACGTGACCA AGGAGCTGGA GTACTTCCCC GAGACCGACA AGGTGTGGAT CGAGATCGGC
3420
GAGACCGAGG GCACCTTCAT CGTGGACAGC GTGGAGCTGC TGCTGATGGA GGAGTAGTAC
3480
ATGTGATAGT ACGTAAGCTC GAGGATCT 3508
(2)关于SEQ ID NO:55的信息
(i)序列特征
(A)长度:1961个碱基对
(B)类型:核酸
(C)链型:单链
(D)拓扑结构:线型
(ii)分子类型:DNA
(iii)假拟:否
(iv)反义:否
(vi)原始来源
(A)生物:截短的合成的玉米最佳化BT CryIA(b)基因
(xi)序列描述:SEQ ID NO:55:
GATCCAACAA TGGACAACAA CCCCAACATC AACGAGTGCA TCCCCTACAA CTGCCTGAGC 60
AACCCCGAGG TGGAGGTGCT GGGCGGCGAG CGCATCGAGA CCGGCTACAC CCCCATCGAC 120
ATCAGCCTGA GCCTGACCCA GTTCCTGCTG AGCGAGTTCG TGCCCGGCGC CGGCTTCGTG 180
CTGGGCCTGG TGGACATCAT CTGGGGCATC TTCGGCCCCA GCCAGTGGGA CGCCTTCCTG 240
GTGCAGATCG AGCAGCTGAT CAACCAGCGC ATCGAGGAGT TCGCCCGCAA CCAGGCCATC 300
AGCCGCCTGG AGGGCCTGAG CAACCTGTAC CAAATCTACG CCGAGAGCTT CCGCGAGTGG 360
GAGGCCGACC CCACCAACCC CGCCCTGCGC GAGGAGATGC GCATCCAGTT CAACGACATG 420
AACAGCGCCC TGACCACCGC CATCCCCCTG TTCGCCGTGC AGAACTACCA GGTGCCCCTG 480
CTGAGCGTGT ACGTGCAGGC CGCCAACCTG CACCTGAGCG TGCTGCGCGA CGTCAGCGTG 540
TTCGGCCAGC GCTGGGGCTT CGACGCCGCC ACCATCAACA GCCGCTACAA CGACCTGACC 600
CGCCTGATCG GCAACTACAC CGACCACGCC GTGCGCTGGT ACAACACCGG CCTGGAGCGC 660
GTGTGGGGTC CCGACAGCCG CGACTGGATC AGGTACAACC AGTTCCGCCG CGAGCTGACC 720
CTGACCGTGC TGGACATCGT GAGCCTGTTC CCCAACTACG ACAGCCGCAC CTACCCCATC 780
CGCACCGTGA GCCAGCTGAC CCGCGAGATT TACACCAACC CCGTGCTGGA GAACTTCGAC 840
GGCAGCTTCC GCGGCAGCGC CCAGGGCATC GAGGGCAGCA TCCGCAGCCC CCACCTGATG 900
GACATCCTGA ACAGCATCAC CATCTACACC GACGCCCACC GCGGCGAGTA CTACTGGAGC 960
GGCCACCAGA TCATGGCCAG CCCCGTCGGC TTCAGCGGCC CCGAGTTCAC CTTCCCCCTG 1020
TACGGCACCA TGGGCAACGC TGCACCTCAG CAGCGCATCG TGGCACAGCT GGGCCAGGGA 1080
GTGTACCGCA CCCTGAGCAG CACCCTGTAC CGTCGACCTT TCAACATCGG CATCAACAAC 1140
CAGCAGCTGA GCGTGCTGGA CGGCACCGAG TTCGCCTACG GCACCAGCAG CAACCTGCCC 1200
AGCGCCGTGT ACCGCAAGAG CGGCACCGTG GACAGCCTGG ACGAGATCCC CCCTCAGAAC 1260
AACAACGTGC CACCTCGACA GGGCTTCAGC CACCGTCTGA GCCACGTGAG CATGTTCCGC 1320
AGTGGCTTCA GCAACAGCAG CGTGAGCATC ATCCGTGCAC CTATGTTCAG CTGGATTCAC 1380
CGCAGTGCCG AGTTCAACAA CATCATCCCC AGCAGCCAGA TCACCCAGAT CCCCCTGACC 1440
AAGAGCACCA ACCTGGGCAG CGGCACCAGC GTGGTGAAGG GCCCCGGCTT CACCGGCGGC 1500
GACATCCTGC GCCGCACCAG CCCCGGCCAG ATCAGCACCC TGCGCGTGAA CATCACCGCC 1560
CCCCTGAGCC AGCGCTACCG CGTCCGCATC CGCTACGCCA GCACCACCAA CCTGCAGTTC 1620
CACACCAGCA TCGACGGCCG CCCCATCAAC CAGGGCAACT TCAGCGCCAC CATGAGCAGC 1680
GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC CTTCAACTTC 1740
AGCAACGGCA GCAGCGTGTT CACCCTGAGC GCCCACGTGT TCAACAGCGG CAACGAGGTG 1800
TACATCGACC GCATCGAGTT CGTGCCCGCC GAGGTGACCT TCGAGGCCGA GTACGACCTG 1860
GAGAGGGCTC AGAAGGCCGT GAACGAGCTG TTCACCAGCA GCAACCAGAT CGGCCTGAAG 1920
ACCGACGTGA CCGACTACCA CATCGATCAG GTGTAGGAGC T 1961
保藏
于农业研究机构保藏中心(NRRL),Northern Regional ResearchCenter,1815 North University Street,Peoria,lllinois 61604,U.S.A。
菌株命名 保藏号 保藏日期
蜡状芽孢杆菌AB78 NRRL B-21058 1993年3月18日
大肠杆菌pCIB4431 NRRLB-18998 1992年9月21日