包括通过在分子氧和NAD(P)H和全细胞催化剂存在的情况下通过与CYP153家族的细胞色素P450单加氧酶接触来氧化脂肪酸或其酯的步骤,所述全细胞催化剂表达CYP153家族的重组细胞色素P450单加氧酶、重组醇脱氢酶、重组氨基转移酶和任选一种或多于一种选自丙氨酸脱氢酶、铁氧还蛋白和铁氧还蛋白还原酶的重组酶,并涉及该全细胞催化剂用于氧化脂肪酸或其酯的用途,
聚酰胺是一类特征在于重复的酰胺基的聚合物。与化学相关的蛋白不同,术语“聚酰胺”通常涉及合成的、可商购的、热塑性聚合物。聚酰胺衍生自在烃的裂化过程中通常获得的伯胺或仲胺。然而,还可以使用衍生物,更准确地为氨基羧酸、内酰胺和二胺,来产生聚合物。作为起始材料还关注的是短链的、气态烷,其可以使用生物技术方法从可再生原材料开始来获得。
许多高度需要的商品化聚酰胺从内酰胺开始生产。例如,“聚酰胺6”可以通过ε-己内酰胺的聚合来获得,并且“聚酰胺12”可以通过十二内酰胺的聚合来获得。其他商业关注的产品包括内酰胺的共聚物,例如ε-己内酰胺和十二内酰胺的共聚物。
常规化学技术生产胺依赖于化石原材料的供应,效率低,并且其中产生大量不期望的副产物,在合成的一些步骤中高达80%。此类方法的一个实例是生产十二内酰胺,其常规通过丁二烯的三聚化来获得。三聚化产物环十二碳三烯被氢化,并且从中产生的环十二烷氧化为环十二酮,其随后与羟胺反应来产生环十二烷肟,其经Beckmann重排最后转化为十二内酰胺。
考虑到所述的缺点,已经开发了方法来使用生物催化剂从可再生原材料开始来获得胺。合适的可再生原材料尤其是脂肪酸来源,其可以以油菜籽油、球蓟油、棕榈仁油、椰子油、葵花籽仁油和来自大量生物来源(尤其是来自植物)的类似的天然产物的形式获得。
PCT/EP 2008/067447描述了使用具有一系列合适的酶促活性并能够转化羧酸为对应的ω-氨基羧酸的细胞来生产化学相关产物(更准确地为ω-氨基羧酸)的生物系统。
本发明的进一步目的是改进方法,从而使得基于脂肪酸底物或其他底物的量的产率增加用于生物技术合成的细胞的碳底物的量,和/或使副产物的浓度或副产物与期望产物的比例降低。
本发明的进一步目的是改进方法,从而使得使用的生物催化剂(尤其是脂肪酸氧化酶)的选择性在反应开始时(即在产物浓度随时间达到平稳之前)或在调整平衡之后(即在达到平稳之后)增加和/或扩展。
在第二个实施方案中,其也是第一个实施方案的一个实施方案,所述目的通过这样的方法来实现,其中CYP153家族的细胞色素P450单加氧酶具有肽序列LL(I/L)(V/I)GGNDTTRN,并优选为来自泊库岛食烷菌(Alcanivorax borkumensis)SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)或其变体。
在第四个实施方案中,其也是第一个至第三个实施方案的一个实施方案,所述目的通过这样的方法实现,其中,在步骤a)中,存在另外的铁氧还蛋白还原酶,优选来自泊库岛食烷菌(Alcanivorax borkumensis)SK2的铁氧还蛋白还原酶(数据库编号YP_691923)或其变体,和/或铁氧还蛋白,优选来自泊库岛食烷菌(Alcanivorax borkumensis)SK2的铁氧还蛋白(数据库编号YP_691920)或其变体。
在第五个实施方案中,其也是第一个至第四个实施方案的一个实施方案,所述目的通过这样的方法实现,其中步骤c)在丙氨酸脱氢酶、铵和NADH存在的情况下进行,并且丙氨酸脱氢酶是来自枯草芽孢杆菌枯草亚种168株(Bacillus subtilis subsp. subtilis str. 168)的丙氨酸脱氢酶(数据库编号NP_391071)或其变体。
在第六个实施方案中,其也是第一个至第五个实施方案的一个实施方案,所述目的通过这样的方法实现,其中以全细胞催化剂的形式重组提供至少一种选自以下的酶:CYP153家族的细胞色素P450单加氧酶、醇脱氢酶、氨基转移酶、丙氨酸脱氢酶、铁氧还蛋白和铁氧还蛋白还原酶。
在第七个实施方案中,其也是第六个实施方案的一个实施方案,所述目的通过这样的方法实现,其中以一种或多于一种的全细胞催化剂的形式重组提供在步骤a)、b)或c)的至少一个中存在的或与脂肪酸或其酯、来自步骤b)的进一步氧化的脂肪酸或其酯或者来自步骤c)的胺化的进一步氧化的脂肪酸或其酯接触的选自CYP153家族的细胞色素P450单加氧酶、醇脱氢酶、氨基转移酶、丙氨酸脱氢酶、铁氧还蛋白和铁氧还蛋白还原酶的所有酶。
在第八个实施方案中,其也是第六个至第七个实施方案的一个实施方案,所述目的通过这样的方法实现,其中全细胞催化剂另外表达AlkL家族的多肽,优选选自以下的AlkL:来自恶臭假单胞菌的AlkL(数据库编号CAB69081)、来自水油海杆菌(Marinobacter aquaeolei) VT8的AlkL (数据库编号YP_957722)、来自亚历山大海洋柄菌(Oceanicaulis alexandrii) HTCC2633的AlkL (数据库编号ZP_00953584)、来自Marinobacter manganoxydans MnI7-9的AlkL (数据库编号ZP_09158756)、来自柄杆菌属种(Caulobacter sp.) K31的AlkL (数据库编号YP_001672217)、来自食油假单胞菌(Pseudomonas oleovorans)的AlkL(数据库编号Q00595)或其变体。
在第九个实施方案中,其也是第一个至第八个实施方案的一个实施方案,所述目的通过这样的方法实现,其中全细胞催化剂是具有相比于其野生型至少一种催化脂肪酸的β-氧化反应之一的酶降低的活性的细胞,其中所述酶优选选自脂肪酸输入蛋白、脂肪酸-CoA连接酶、酰基-CoA脱氢酶、2,4-二烯酰基-CoA还原酶、烯酰基-CoA水合酶和3-酮酰基-CoA硫解酶。
在第十个实施方案中,其也是第一个至第九个实施方案的一个实施方案,所述目的通过这样的方法实现,其中选择步骤c)中的丙氨酸脱氢酶,使得其还原步骤b)中由醇脱氢酶氧化的氧化还原辅因子,优选NAD+或NADP+。
在第二方面,所述目的通过全细胞催化剂实现,所述全细胞催化剂表达重组的CYP153家族的细胞色素P450单加氧酶、重组的醇脱氢酶、重组的氨基转移酶和任选一种或多于一种选自以下的重组酶:丙氨酸脱氢酶、铁氧还蛋白和铁氧还蛋白还原酶。
在第二方面的第一个实施方案中,所述目的通过全细胞催化剂实现,其中全细胞催化剂另外表达AlkL家族的多肽,优选选自以下的AlkL:来自恶臭假单胞菌的AlkL(数据库编号CAB69081)、来自水油海杆菌(Marinobacter aquaeolei) VT8的AlkL (数据库编号YP_957722)、来自亚历山大海洋柄菌(Oceanicaulis alexandrii) HTCC2633的AlkL (数据库编号ZP_00953584)、来自Marinobacter manganoxydans MnI7-9的AlkL (数据库编号ZP_09158756)、来自柄杆菌属种(Caulobacter sp.) K31的AlkL (数据库编号YP_001672217)、来自食油假单胞菌(Pseudomonas oleovorans)的AlkL(数据库编号Q00595)或其变体。
在第二个实施方案中,其也是第一个实施方案的一个实施方案,所述目的通过全细胞催化剂实现,其中所述全细胞催化剂是具有相比于其野生型至少一种催化脂肪酸的β-氧化反应之一的酶降低的活性的细胞,其中所述酶优选选自脂肪酸输入蛋白、脂肪酸-CoA连接酶、酰基-CoA脱氢酶、2,4-二烯酰基-CoA还原酶、烯酰基-CoA水合酶和3-酮酰基-CoA硫解酶。
在第四个实施方案中,其也是第一个至第三个实施方案的一个实施方案,所述目的通过全细胞催化剂实现,其中所述全细胞催化剂表达丙氨酸脱氢酶,并且其中所述丙氨酸脱氢酶是来自枯草芽孢杆菌枯草亚种168株(Bacillus subtilis subsp. subtilis str. 168)(数据库编号NP_391071)或其变体。
在第五个实施方案中,其也是第一个至第四个实施方案的一个实施方案,所述目的通过全细胞催化剂实现,其中CYP153家族的细胞色素P450单加氧酶具有肽序列LL(I/L)(V/I)GGNDTTRN和/或它是来自泊库岛食烷菌SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)或其变体,并且所述铁氧还蛋白还原酶是来自泊库岛食烷菌SK2的铁氧还蛋白还原酶(数据库编号YP_691923)或其变体,并且所述铁氧还蛋白是来自泊库岛食烷菌SK2的铁氧还蛋白(数据库编号YP_691920)或其变体。
在第七个实施方案中,其也是第一或第二方面及其实施方案的进一步的实施方案,所述目的通过根据第二方面或第二方面的实施方案的全细胞催化剂或根据第一方面或第一方面的实施方案的方法实现,其中所述氨基转移酶是来自恶臭假单胞菌GB-1的氨基转移酶(数据库编号YP_001668026.1)或其变体。
在第九个实施方案中,其也是第一或第二方面的进一步的实施方案,所述目的通过这样的方法实现,其中以全细胞催化剂的形式重组提供选自CYP153家族的细胞色素P450单加氧酶、醇脱氢酶、氨基转移酶、丙氨酸脱氢酶、铁氧还蛋白和铁氧还蛋白还原酶的至少一种酶,或者所述目的通过全细胞催化剂实现,其中所述全细胞催化剂是相对于细胞的野生型具有至少一种内源醛脱氢酶降低的活性的细胞。
在第三方面,本发明的目的通过根据本发明的第二方面或其实施方案之一的全细胞催化剂用于氧化和/或胺化脂肪酸或其酯的用途来实现,其中所述脂肪酸或其酯优选具有式(I)
在第三方面的第一个实施方案中,所述问题通过这样的用途解决,其中所述氧化产生氧化产物的混合物,基于反应的脂肪酸或其酯的物质的量,所述混合物包含至少90%的对应的醇、少于1%的对应的醛和少于10%的对应的酸。
本发明基于由本发明的发现,即使用特定的单加氧酶或表达此类单加氧酶(更准确地为CYP153家族的细胞色素P450单加氧酶)的全细胞催化剂,令人惊奇地导致脂肪酸的氧化和/或胺化,以更高的选择性和更好的相对产率形成期望的产物。
不希望受任何理论的束缚,本发明人认为这些单加氧酶的活性催化中心如此地形成,从而相比于从现有技术中已知的可比较的单加氧酶的情况,使得已经羟基化的产物,不是首先氧化成醛或完全氧化成酸的产物,以较低的亲和性相连并进一步氧化。
当在本申请中说明的所有化合物的情况下,脂肪酸不仅包括脂肪酸的质子化形式,还包括离解于水溶液中的所有形式、制剂或盐。例如,术语月桂酸还包括月桂酸酯或月桂酸钠。作为进一步的实例,氨基酸丙氨酸包括在水中羧基上去质子化的或质子化的形式和在氨基上去质子化的或质子化的形式。
本发明的方法不仅提供氢化脂肪酸的氧化,还允许脂肪酸有效反应为对应的ω-氨基羧酸,其借助于包含CYP153家族的细胞色素P450单加氧酶、醇脱氢酶、氨基转移酶和任选氨基酸脱氢酶的酶系统。此类酶的使用在与它们的酶促活性相容的条件下进行。这些包括首选选择合适的含水缓冲系统,其包含至少一种pH稳定缓冲剂,例如磷酸钠,任选额外至少一种盐,例如氯化钠,并在合适的pH下。在最优选的实施方案中,pH是5、5.5、6、6.5、7、7.5、8、8.5、9或10,优选2.5-7.5,尤其优选5.5-7.5。温度必须也符合待使用的一种或多种酶的活性。在优选的实施方案中,温度为1-45℃,更优选20-45℃,最优选28-42℃。选择合适的缓冲系统和稳定活性可以由本领域技术人员使用标准方法来进行,参见例如A Cornish-Bowden (1995), Fundamentals of Enzyme Kinetics, Portland Press Limited, 1995。CYP153家族的细胞色素P450单加氧酶(不具有或组合有铁氧还蛋白和铁氧还蛋白还原酶)的活性可以通过这样的测定法来测定,所述测定描述于Scheps, D., Malca, H., Hoffmann, B., Nestl, B. M和Hauer, B. (2011) Org. Biomol. Chem., 9, 6727。铁氧还蛋白和/或铁氧还蛋白还原酶是否有活性对于本领域技术人员而言,从在两种活性酶存在的情况下相比于它们不存在的情况下所述单加氧酶活性的显著增加中是显而易见。氨基转移酶活性的酶测试由Cayman Chemical Company, Ann Arbor, MI, (“Alanine Transaminase Activity Assay Kit, 产品号700260”)销售。氨基酸脱氢酶的活性可以通过Germano, H. J.和Anderson, K. E. (1968), J. Bact. 96 (1), pages 55-60来测定。
酶的使用进一步需要所有必需底物的存在。因此,除了根据本发明待反应的脂肪酸或其酯,氧和电子供体的存在对于CYP153家族的细胞色素P450单加氧酶活性是必需的。优选地,氧可以通过将包含一种或多种酶或细胞和底物的反应混合物与大气空气或者与纯氧或与富含氧的大气空气接触而使其可得,尤其优选通过当其与大气空气或者与纯氧或与富含氧的大气空气接触时搅拌反应混合物。
为了向CYP153家族的细胞色素P450单加氧酶最佳提供来自还原剂(优选NADH)的电子,优选所述单加氧酶和与其功能性相互作用的铁氧还蛋白还原酶和与其功能性相互作用的铁氧还蛋白一同使用。这些可以是分离的多肽,或在使用全细胞催化剂的情况下,是共表达多肽或是在N或C末端与CYP153家族的细胞色素P450单加氧酶融合的多肽。铁氧还蛋白还原酶或铁氧还蛋白与给定的CYP153家族的细胞色素P450单加氧酶彼此是否功能性相互作用可以由本领域技术人员通过在烷底物和所述三种多肽存在的情况下还原剂是否被氧化来容易地确定。或者,可以使用由Scheps, D., Malca, H., Hoffmann, B., Nestl, B. M和Hauer, B. (2011) Org. Biomol. Chem., 9, 6727描述的酶测试,其在功能性相互作用的多肽的情况下,展示出反应速率中的显著增加。在尤其优选的实施方案中,CYP153家族的细胞色素P450单加氧酶、铁氧还蛋白和铁氧还蛋白还原酶来源于同一生物。在尤其优选的实施方案中,它是来自泊库岛食烷菌SK2的铁氧还蛋白还原酶(数据库编号YP_691923)或其变体、来自泊库岛食烷菌SK2的铁氧还蛋白(数据库编号YP_691920)或其变体、和来自泊库岛食烷菌SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)或其变体。
对于根据本发明使用的所有酶,它们可以是包含对应的酶促活性的多肽的细胞或其溶胞产物或以所有纯化状态的多肽制备物(从粗溶胞产物直至纯的多肽)或全细胞催化剂。多种方法对于本领域技术人员是已知的,使用所述方法可以在合适细胞中过表达酶促活性多肽并可以纯化或分离。为了表达多肽,可以使用本领域技术人员可用的所有表达系统。层析方法适合于纯化,例如在组氨酸标签的情况下使用固定的配体(例如镍离子),在与目的蛋白融合的谷胱甘肽S转移酶的情况下使用固定的谷胱甘肽,或在含麦芽糖结合蛋白的标签的情况下使用固定的麦芽糖,来亲和性层析纯化提供有标签的重组蛋白。对于大量生物技术上重要的细胞类型,例如大肠杆菌,可用于表达或过表达核酸分子的合适方法和载体是已知的,例如pET或pGEX型的载体和适合于其表达的细胞(B A Moffatt和F W Studier (1986) J. Mol. Biol. 189, 113–130, A H Rosenberg, B N Lade, D Chui, S Lin, J J Dunn和F W Studier (1987) Gene 56, 125–135以及F W Studier, A H Rosenberg, J J Dunn和J W Dubendorff (1990) Meth. Enzymol.185, 60–89)。
纯化的酶可以以可溶形式或固定化形式使用。合适的方法对于本领域技术人员是已知的,通过所述方法可以将多肽共价地或非共价地固定在有机或无机固相上,例如通过巯基偶联化学(例如来自Pierce的试剂盒)。细胞膜缔合的酶或细胞膜嵌入的酶可以以膜制备物或溶解的形式来使用。
在使用至少一种全细胞催化剂的情况下,在延长的反应时间的情况下,必须确保条件与作为全细胞催化剂使用的至少一种细胞的活力相容。本领域技术人员可以采取条件和溶液的标准操作(例如Fuchs/Schlegel (2007) Allgemeine Mikrobiologie, 2008, Georg Thieme Verlag),其允许此类细胞保持活力的状态。
在优选的实施方案中,如本文所用的术语“全细胞催化剂”理解为表示完整的、有活力的且代谢上有活性的细胞,其提供了期望的酶促活性。全细胞催化剂可以转运待代谢的底物(在本发明的情况下为醇或从其产生的氧化产物)进入细胞内部,在那它被细胞溶胶的酶所代谢,或它可以呈递目标酶在它的表面上,其中它直接暴露于培养基中的底物。本领域技术人员知晓用于产生全细胞催化剂的许多系统,例如来自DE 60216245。
当使用全细胞催化剂时,可能产生必须使底物与细胞内定位的酶接触从而使它导致期望的反应这一问题。在长链烷及其衍生物的情况下,优选全细胞催化剂具有AlkL家族的多肽。在优选的实施方案中,如本文所用的“AlkL家族的多肽”是这样的多肽,其在长度为230个连续氨基酸上与来自恶臭假单胞菌的AlkL (数据库编号CAB69081)具有至少80,优选90,更优选90%的序列同一性,并且优选具有辅助长链烷进入细胞内部的能力。在进一步的实施方案中,如本文所用的“AlkL家族的多肽”是定位于革兰氏阴性细菌的外膜内的多肽,其具有序列基序DXWAPAXQ(V/A)GXR,其中X是产生蛋白质的氨基酸,并且优选是另外来自恶臭假单胞菌的AlkL (数据库编号CAB69081)或其变体。AlkL家族的成员的实例包括来自恶臭假单胞菌的AlkL(数据库编号CAB69081)、来自水油海杆菌(Marinobacter aquaeolei) VT8的AlkL (数据库编号YP_957722)、来自亚历山大海洋柄菌(Oceanicaulis alexandrii) HTCC2633的AlkL (数据库编号ZP_00953584)、来自Marinobacter manganoxydans MnI7-9的AlkL (数据库编号ZP_09158756)、来自柄杆菌属种(Caulobacter sp.) K31的AlkL (数据库编号YP_001672217)、来自食油假单胞菌(Pseudomonas oleovorans)的AlkL(数据库编号Q00595)及其变体。
推荐使用分离的酶用于多种应用。在优选的实施方案中,如本文所用的术语“分离的”表示酶相比于其天然来源以更纯的和/或更浓缩的形式存在。在优选的实施方案中,如果酶是多肽酶并且构成多于60、70、80、90或优选95%的对应制备物的大量蛋白级分,则将它归类为分离的。本领域技术人员知晓用于测量溶液中蛋白的质量的许多方法,例如参考SDS-聚丙烯酰胺凝胶上对应的蛋白条带的厚度的视觉评估、基于NMR光谱或质谱的方法。
根据本发明使用的酶优选是重组的酶。在优选的实施方案中,如本文所用的术语“重组的”理解为表示对应的核酸分子天然不存在和/或它是使用遗传工程方法来产生。在优选的实施方案中,如果对应的多肽由重组核酸所编码,则使用术语重组蛋白。在优选的实施方案中,如本文所用,将重组细胞理解为表示具有至少一种重组核酸或重组多肽的细胞。适合于产生重组分子或细胞的方法是本领域技术人员已知的,例如在Sambrook/Fritsch/Maniatis (1989):Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 第2版中描述的那些。
在优选的实施方案中,用作全细胞催化剂或用作表达系统的细胞是原核细胞,优选细菌细胞。根据本发明优选的是,基于良好的遗传可及性,使用微生物;其选自细菌,尤其是优选选自以下:磁球菌属(Magnetococcus)、深海菌属(Mariprofundus)、 醋酸杆菌属(Acetobacter)、嗜酸菌属(Acidiphilium)、阿菲波菌属(Afipia)、Ahrensia、不粘柄菌属(Asticcacaulis)、橙单胞菌属(Aurantimonas)、固氮根瘤菌属(Azorhizobium)、固氮螺菌属(Azospirillum)、芽孢杆菌(Bacillus)、巴尔通体(Bartonella, tribocorum)、拜叶林克氏菌属(Beijerinckia)、短根瘤菌属(Bradyrhizobium)、短波单胞菌属(Brevundimonas)、subvibrioides、布鲁菌属(Brucella)、柄杆菌属(Caulobacter)、Chelativorans、Citreicella、柠檬酸微菌属(Citromicrobium)、棒杆菌属(Corynebacterium)、Dinoroseobacter、赤杆菌属(Erythrobacter)、Fulvimarina、葡糖醋杆菌属(Gluconacetobacter)、Granulibacter、海氏菌属(Hirschia)、Hoeflea、生丝微菌属(Hyphomicrobium)、生丝单胞菌属(Hyphomonas)、Ketogulonicigenium、Labrenzia、Loktanella、磁螺菌属(Magnetospirillum)、Maricaulis、Maritimibacter、中慢生根瘤菌属(Mesorhizobium)、甲基杆菌属(Methylobacterium)、甲基孢囊菌属(Methylocystis)、甲基曲菌属 (Methylosinus)、硝化杆菌属(Nitrobacter)、新鞘脂菌属(Novosphingobium)、Oceanibulbus、Oceanicaulis、海洋生菌属(Oceanicola)、苍白杆菌属(Ochrobactrum)、Octadecabacter、寡养菌属(Oligotropha)、副球菌属(Paracoccus)、Parvibaculum、短小盒菌属(Parvularcula)、Pelagibaca、Phaeobacter、苯基杆菌属(Phenylobacterium)、Polymorphum、Pseudovibrio、红杆菌属(Rhodobacter)、红微菌属(Rhodomicrobium)、红假单胞菌属(Rhodopseudomonas)、红螺菌属(Rhodospirillum)、Roseibium、玫瑰杆菌属(Roseobacter)、玫瑰单胞菌属(Roseomonas)、Roseovarius、Ruegeria、Sagittula、Silicibacter、鞘脂菌属(Sphingobium)、鞘脂单胞菌属(Sphingomonas)、Sphingopyxis、Starkeya、Sulfitobacter、Thalassiobium、黄色杆菌属(Xanthobacter)、发酵单胞菌属(Zymomonas)、土壤杆菌属(Agrobacterium)、根瘤菌属(Rhizobium)、中华根瘤菌属(Sinorhizobium)、无形体属(Anaplasma)、埃里希体属(Ehrlichia)、新立克次体属(Neorickettsia)、Orientia、立克次体属(Rickettsia)、沃巴赫体属(Wolbachia)、博德氏菌属(Bordetella)、伯克氏菌属(Burkholderia)、贪铜菌属(Cupriavidus)、台湾贪铜菌(Cupriavidus, taiwanensis)、Lautropia、Limnobacter、Polynucleobacter、青枯菌属(Ralstonia)、色杆菌属(Chromobacterium)、艾肯氏菌属(Eikenella)、corrodens、Basfia、金氏菌属(Kingella)、鸥杆菌属(Laribacter)、Lutiella、奈瑟菌属(Neisseria)、Simonsiella、无色杆菌属(Achromobacter)、食酸菌属(Acidovorax)、Alicycliphilus、Aromatoleum、固氮弓菌属(Azoarcus)、丛毛单胞菌属(Comamonas)、Dechloromonas、代尔夫特菌属(Delftia)、嘉利翁氏菌属(Gallionella)、Herbaspirillum、Herminiimonas、Hylemonella、詹森菌属(Janthinobacterium)、纤发菌属(Leptothrix)、Methylibium、Methylobacillus、嗜甲基菌目(Methylophilales) 、Methyloversatilis、Methylovorus、亚硝化单胞菌属(Nitrosomonas)、亚硝化螺旋菌属(Nitrosospira)、草酸杆菌属(Oxalobacter)、Parasutterella、Polaromonas、Polaromonas、Pusillimonas、红育菌属(Rhodoferax)、红长命菌属(Rubrivivax)、Sideroxydans、Sutterella、wadsworthensis、泰勒氏菌属(Taylorella)、陶厄氏菌属(Thauera)、硫杆状菌属(Thiobacillus)、硫单胞菌属(Thiomonas)、贪食菌属(Variovorax)、Verminephrobacter、Anaeromyxobacter、蛭弧菌属(Bdellovibrio)、食菌蛭弧菌(Bdellovibrio, bacteriovorus)、Bilophila、脱硫盒菌属(Desulfarculus)、Desulfatibacillum、Desulfobacca、脱硫杆菌属 (Desulfobacterium) 、脱硫叶菌属(Desulfobulbus)、脱硫球菌属(Desulfococcus)、Desulfohalobium、脱亚硫酸菌属(Desulfitobacterium)、Desulfomicrobium 、Desulfonatronospira、Desulfotalea、脱硫弧菌属(Desulfovibrio)、除硫单胞菌属(Desulfuromonas)、地杆菌属(Geobacter)、Haliangium、Hippea、Lawsonia、黏球菌属(Myxococcus)、暗杆菌属(Pelobacter)、Plesiocystis、堆囊菌属(Sorangium)、标桩菌属(Stigmatella)、互营杆菌属(Syntrophobacter)、互营菌属(Syntrophus)、弓形杆菌属(Arcobacter)、Caminibacter、弯曲菌属(Campylobacter)、螺杆菌属(Helicobacter)、Nitratifractor、Nitratiruptor、Sulfuricurvum、Sulfurimonas、硫磺单胞菌属(Sulfurospirillum)、Sulfurovum、沃林氏菌属(Wolinella)、布赫纳氏菌属(Buchnera)、Blochmannia、Hamiltonella、Regiella、Riesia、柠檬酸杆菌属(Citrobacter)、Cronobacter、Dickeya、爱德华氏菌属(Edwardsiella)、肠杆菌属(Enterobacter)、欧文氏菌属(Erwinia)、埃希氏菌属(Escherichia)、克雷伯氏菌属(Klebsiella)、泛菌属(Pantoea)、果胶杆菌属 (Pectobacterium)、变形杆菌属(Proteus)、普罗维登斯菌属(Providencia)、拉恩氏菌属(Rahnella)、沙门氏菌属(Salmonella)、沙雷氏菌属(Serratia)、志贺氏菌属(Shigella)、Sodalis、Wigglesworthia、Glossina、致病杆菌属(Xenorhabdus)、耶尔辛氏菌属(Yersinia)、酸硫杆状菌属(Acidithiobacillus)、不动杆菌属(Acinetobacter)、气单胞菌属(Aeromonas)、食碱菌属(Alcanivorax)、碱湖生菌属(Alkalilimnicola)、Allochromatium、交替单胞菌目(Alteromonadales)、交替单胞菌属(Alteromonas)、Baumannia、贝扎托菌属(Beggiatoa)、Bermanella、Carsonella、Ruthia、Vesicomyosocius、心杆菌属(Cardiobacterium)、色盐杆菌属(Chromohalobacter)、Colwellia、聚集杆菌属(Congregibacter)、柯克斯体属(Coxiella)、Dichelobacter、Endoriftia、水栖菌属 (Enhydrobacter)、高铁单胞菌属(Ferrimonas)、弗朗西丝氏菌属(Francisella)、Glaciecola、河氏菌属(Hahella)、盐单胞菌属(Halomonas)、盐红螺旋菌属(Halorhodospira)、盐硫杆状菌属(Halothiobacillus)、Idiomarina、Kangiella、军团菌属(Legionella)、海杆菌属(Marinobacter)、海单胞菌属(Marinomonas)、甲基杆菌属(Methylobacter)、甲基球菌属(Methylococcus)、甲基微菌属(Methylomicrobium)、噬甲基菌属(Methylophaga)、莫拉菌属(Moraxella)、南极嗜冷菌属(Moritella)、Neptuniibacter、硝化球菌属(Nitrococcus)、假交替单胞菌属(Pseudoalteromonas)、冷杆菌属(Psychrobacter)、冷单胞菌属(Psychromonas)、Reinekea、立克次小体属(Rickettsiella)、Saccharophagus、希万氏菌属(Shewanella)、Succinatimonas、Teredinibacter、Thioalkalimicrobium、Thioalkalivibrio、石危微螺菌属(Thiomicrospira)、Tolumonas、弧菌目(Vibrionales) 、放线杆菌属(Actinobacillus)、Aggregatibacter、Gallibacterium、嗜血菌属(Haemophilus)、嗜组织菌属 (Histophilus)、默罕氏菌属 (Mannheimia)、巴斯德氏菌属(Pasteurella)、固氮菌属(Azotobacter)、纤维弧菌属(Cellvibrio)、假单胞菌属(Pseudomonas)、Aliivibrio、格里蒙菌属(Grimontia)、发光杆菌属(Photobacterium)、发光杆菌属(Phtobacterium)、弧菌属(Vibrio)、假黄色单胞菌属(Pseudoxanthomonas)、寡养单胞菌属(Stenotrophomonas)、黄色单胞菌属(Xanthomonas)、木杆菌属(Xylella)、疏螺旋体属(Borrelia)、短螺旋体属(Brachyspira)、钩端螺旋体属(Leptospira)、螺旋体属(Spirochaeta)、密螺旋体属(Treponema)、Hodgkinia、Puniceispirillum、韧皮杆菌属(Liberibacter)、远洋杆菌属(Pelagibacter)、Odyssella、Accumulibacter,尤其是枯草芽孢杆菌(B. subtilis)、巨大芽孢杆菌(B. megaterium)、谷氨酸棒杆菌(C. glutamicum)、大肠杆菌、假单胞菌属种(Pseudomonas sp.)、荧光假单胞菌(Pseudomonas fluorescens)、恶臭假单胞菌(Pseudomonas putida)、施氏假单胞菌(Pseudomonas stutzeri)、不动杆菌属种(Acinetobacter sp.)、伯克氏菌属种(Burkholderia sp.)、Burkholderia thailandensis、蓝细菌(Cyanobakterien)、克雷伯氏菌属种(Klebsiella sp.)、产酸克雷伯菌(Klebsiella oxytoca)、沙门氏菌属种(Salmonella sp.)、根瘤菌属种(Rhizobium sp.)和苜蓿根瘤菌(Rhizobium meliloti),并且大肠杆菌是特别优选的。
在优选的实施方案中,术语“CYP153家族的细胞色素P450单加氧酶”理解为表示作为3组分系统的部分的具有烷结合位点和羟基化烷的能力的细胞溶胶氧化酶,所述3组分系统还包含铁氧还蛋白和铁氧还蛋白还原酶。在尤其优选的实施方案中,它是与来自泊库岛食烷菌SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)具有至少80,优选90,最优选95或99%的序列同一性的酶,或是包含与来自泊库岛食烷菌SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)具有至少80,优选90,最优选95或99%的序列同一性的多肽序列并此外具有烷羟化酶活性的酶。在优选的实施方案中,如本文所用的术语“烷羟化酶活性”理解为表示催化烷或包含至少5个、优选12个碳原子的未取代的直链烷基的羟基化的能力。在进一步优选的实施方案中,术语“CYP153家族的细胞色素P450单加氧酶”理解为表示非膜结合的氧化酶,其包括对于烷、包含至少5个、优选12个碳原子的未取代的直链烷基或单羟基化的烷的结合位点,并且其多肽链包括基序LL(I/L)(V/I)GGNDTTRN。在优选的实施方案中,如本文所用的“CYP153家族的细胞色素P450单加氧酶”是来自泊库岛食烷菌SK2的CYP153家族的细胞色素P450单加氧酶(数据库编号YP_691921)或变体,其优选具有烷羟化酶活性。
CYP153家族的细胞色素P450单加氧酶用于烷的羟基化的用途描述于现有技术中,如测定酶活性的酶测试和用于表达和纯化的方法(Scheps, D., Malca, H., Hoffmann, B., Nestl, B. M和Hauer, B. (2011) Org. Biomol. Chem., 9, 6727)。除了待氧化的烷或包含至少5个、优选12个碳原子的未取代的直链烷基外,参与酶反应的底物包含氧和电子,所述电子以NADH的形式优选经其他两个组分铁氧还蛋白和铁氧还蛋白还原酶转移至氧化酶。Scheps等(2011)以及Roome, P. W., Jr., Philley, J. C.和Peterson (1983) J. Biol. Chem. 258, 2593;Roome, P.W.和Peterson, J. A. (1988), Arch.Biochem.Biophys., 266, 41以及Peterson, J. A., Lorence, M. C.和Amarneh, B. (1990) J. Biol. Chem, 265, 6066还公开了获得以功能形式的铁氧还蛋白和铁氧还蛋白还原酶的方法。
根据本发明,在步骤b)中,使用醇脱氢酶来氧化步骤a)中产生的脂肪酸醇。醇脱氢酶已在几十年间在与酿造发酵方法相关的生物化学中是大量观察的且生物技术高度关注的酶类别,所述类别包括多个组的同工型。因此,存在恶臭假单胞菌GPO1 AlkJ型的膜结合的、黄素依赖性醇脱氢酶,其使用黄素辅因子(Flavocofactor)替代NAD+。另一组包括含铁的醇脱氢酶,其对氧敏感并且其在细菌中发现以及在酵母中以无活性形式发现。另一组包括NAD+依赖的醇脱氢酶,在它们中有其中活性中心具有半胱氨酸配位的锌原子(其固定醇底物)的含锌的醇脱氢酶。在优选的实施方案中,如本文所用的术语“醇脱氢酶”理解为表示氧化醛或酮为对应的伯醇或仲醇的酶。优选地,在本发明的方法中的醇脱氢酶是NAD+依赖的醇脱氢酶,即使用NAD+作为辅因子氧化醇或使用NADH用于还原对应的醛或酮的醇脱氢酶。在最优选的实施方案中,醇脱氢酶是NAD+依赖的含锌的醇脱氢酶。
在进一步优选的实施方案中,醇脱氢酶是葡萄糖-甲醇-胆碱-氧化还原酶家族的醇脱氢酶氧化还原酶。在进一步优选的实施方案中,如本文所用的术语“葡萄糖-甲醇-胆碱-氧化还原酶家族的氧化还原酶”理解为表示包含FAD作为辅因子的醇脱氢酶,并且优选为来自恶臭假单胞菌的酶(数据库编号CAB69081)或其变体。实例包括SEQ ID No. 26、27、28和29,并且也包括来自恶臭假单胞菌的酶(数据库编号CAB69081)。
在进一步优选的实施方案中,醇脱氢酶是含黄素的醇脱氢酶家族的醇脱氢酶。在优选的实施方案中,如本文所用的术语“含黄素的醇脱氢酶的家族”理解为表示这样的醇氧化酶的组,其构成了c型的血红素蛋白,并且包含FAD作为辅因子,且优选另外属于具有数据库编号AAS46878.1的酶及其变体的组。含黄素的醇脱氢酶的实例包括来自热带念珠菌具有数据库编号AAS46878.1、AAS46880.1的酶和来自Candida cloacae具有数据库编号CAB75351.1的酶。
根据本发明,在步骤c)中使用氨基转移酶。在优选的实施方案中,如本文所用的术语“氨基转移酶”理解为表示这样的酶,其催化α-氨基从供体(优选氨基酸)转移至受体分子,优选α-酮羧酸。在优选的实施方案中,如本文所用的术语“胺供体”理解为表示L-氨基酸,其氨基可以从氨基转移酶转移至α-酮羧酸,并形成氨基酸。在尤其优选的实施方案中,胺供体是L-丙氨酸。在优选的实施方案中,氨基转移酶选自氨基转移酶及其变体的组,其特征在于它在对应于恶臭假单胞菌GB-1的氨基转移酶(数据库编号YP_001668026.1)的Val224的氨基酸序列的位置处具有选自异亮氨酸、缬氨酸、苯丙氨酸、甲硫氨酸和亮氨酸的氨基酸,并且在对应于恶臭假单胞菌GB-1的氨基转移酶(数据库编号YP_001668026.1)的Gly230的氨基酸序列的位置处具有除苏氨酸之外的氨基酸,并且优选是选自丝氨酸、半胱氨酸、赖氨酸和丙氨酸的氨基酸。在尤其优选的实施方案中,氨基转移酶选自来自青紫色素杆菌(Chromobacterium violaceum)DSM30191的ω-氨基转移酶,来自恶臭假单胞菌GB-1、恶臭假单胞菌W619、来自铜绿假单胞菌(Pseudomonas aeruginosa)PA01、天蓝色链霉菌(Streptomyces coelicolor)A3(2)、恶臭假单胞菌(数据库编号YP_001668026)、恶臭假单胞菌(数据库编号YP_001668026.1或YP_001671460)、类球红细菌(Rhodobacter sphaeroides)(菌株ATCC 17023;数据库编号YP_353455)和阿维链霉菌(Streptomyces avermitilis)MA 4680的氨基转移酶及其变体。
在步骤b)中的醇脱氢酶(如果它是NAD(P)依赖的醇脱氢酶)每个反应的底物分子消耗一分子的氧化还原辅因子NAD(P)H时,氨基脱氢酶氧化NAD(P)H。因此,使用其中醇脱氢酶和氨基酸脱氢酶转化相同氧化还原辅因子的系统是特别有优势的。NADP依赖的醇脱氢酶包括来自大肠杆菌的酶(YjgB, 数据库编号ZP_07117674)和来自大肠杆菌的其他酶(YahK, 数据库编号BAE76108.1)。NAD依赖的醇脱氢酶包括来自大肠杆菌的酶(AdhP, 数据库编号ZP_07145023)、来自枯草芽孢杆菌的酶(数据库编号NP_391071)、来自嗜热脂肪芽孢杆菌(Bacillus stearothermophilus)的酶(数据库编号P42328.1)和来自豌豆根瘤菌(Rhizobium leguminosarum)的酶(数据库编号YP_002975437)。NADP依赖的丙氨酸脱氢酶包括来自荚膜红细菌(Rhodobacter capsulatus)的酶(数据库编号ADE84249.1)。NAD依赖的丙氨酸脱氢酶包括来自枯草芽孢杆菌枯草亚种168株的丙氨酸脱氢酶(数据库编号NP_391071)。
本发明的教导不仅仅可以使用本文描述的生物大分子的准确的氨基酸或核酸序列来进行或应用于其上,还可以使用那些大分子的变体(其可以通过缺失、添加或取代一个或多于一个的氨基酸或核酸来获得)来进行或应用于其上。在优选的实施方案中,如本文所用的术语核酸序列或氨基酸序列的“变体”(其在下文中用术语“同源物(Homologon)”同义或可交换使用)意指另一条核酸或氨基酸序列,其相对于对应的原始野生型核酸或氨基酸序列,具有70、75、80、85、90、92、94、96、98、99%或更高百分比的同源性(本文用同一性同义使用),其中优选除形成催化活性中心的氨基酸或对于结构或折叠必需的氨基酸之外的氨基酸被缺失或取代或者此类仅保守取代,例如谷氨酸替代天冬氨酸或亮氨酸替代缬氨酸。现有技术描述了可用于计算两条序列同源性程度的算法,例如Arthur Lesk (2008), Introduction to Bioinformatics, 第3版。在本发明进一步更优选的实施方案中,氨基酸或核酸序列的变体,优选除上述序列同源性之外,具有与野生型分子或原始分子基本上相同的酶促活性。例如,有蛋白酶酶促活性的多肽的变体具有与多肽酶相同或基本上相同的蛋白水解活性,即催化肽键水解的能力。在一个具体的实施方案中,术语“基本上相同的酶促活性”表示这样的活性,其对于野生型多肽的底物而言,明确地在本底活性之上或/和在KM和/或k催化值上与野生型多肽对于相同底物展示出的不同,低3个、优选2个、更优选1个数量级。在进一步优选的实施方案中,术语核酸或氨基酸序列的“变体”包括核酸或氨基酸序列的至少一个活性部分/或片段。在进一步优选的实施方案中,如本文所用的术语“活性部分”表示氨基酸序列或核酸序列,其具有少于氨基酸序列全长的序列或编码少于氨基酸序列全长的序列,其中比野生型氨基酸序列长度短的氨基酸序列或编码的氨基酸序列基本上具有与野生型多肽或其变体相同的酶促活性,例如蛋白酶。在一个具体的实施方案中,术语核酸的“变体”包括其互补链的优选在严格条件下与野生型核酸结合的核酸。杂交反应的严格性可以由本领域技术人员容易地确定,并通常依赖于探针的长度、洗涤温度和盐浓度。通常,更长的探针需要更高的杂交温度,而更短的探针在更低的温度下操作。杂交是否发生通常依赖于变性DNA与其环境中存在的互补链退火的能力以及低于解链温度。杂交反应的严格性和相应的条件详细描述于F M Ausubel (1995), Current Protocols in Molecular Biology. John Wiley & Sons, Inc。用于通过杂交鉴定DNA序列的说明可以由本领域技术人员尤其在来自Boehringer Mannheim GmbH的手册“The DIG System Users Guide for Filter Hybridization” (Mannheim, Germany, 1993)和在Liebl等(International Journal of Systematic Bacteriology 41: 255-260 (1991))中发现。在优选的实施方案中杂交在严格条件下进行,即仅形成其中探针和目标序列(即,用探针处理的多核苷酸)为至少70%相同的杂交物。已知杂交的严格性(包括洗涤步骤)受缓冲液组成、温度及盐浓度的变化所影响和/或由其确定。相比于洗涤步骤,杂交反应通常在相对低的严格性下进行(Hybaid Hybridisation Guide, Hybaid Limited, Teddington, UK, 1996)。对于杂交反应,例如,对应于5x SSC缓冲液的缓冲液可以在约50℃ - 68℃的温度下使用。其中,探针还可以与多核苷酸(其与探针序列具有少于70%的同一性)杂交。此类杂交物是更不稳定的,并且在严格条件下通过洗涤而去除。这可以通过例如降低盐浓度至2x SSC且任选随后0.5x SSC来实现(The DIG System User′s Guide for Filter Hybridisation, Boehringer Mannheim, Mannheim, Germany, 1995),在这种情况下建立约50℃–68℃、约52℃–68℃、约54℃–68℃、约 56℃–68℃、约58℃–68℃、约60℃–68℃、约62℃–68℃、约64℃–68℃、约66℃–68℃的温度,这是渐增的优选次序。优选约64℃–68℃或约66℃–68℃的温度范围。任选可以降低盐浓度至对应于0.2 x SSC或0.1 x SSC的浓度。借助于在步骤中杂交温度从50℃至68℃约1–2℃的逐步提高,可以分离多核苷酸片段,例如其与使用的核酸分子序列具有至少70%或至少80%或至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%或至少99%的同一性,这是渐增的优选次序。关于杂交的进一步说明可以以所谓试剂盒的形式商购可得(例如来自Roche Diagnostics GmbH, Mannheim, Germany,的DIG Easy Hyb,目录号1603558)。在优选的实施方案中,如本文所用,术语核酸的“变体”包含编码与原始核酸相同的氨基酸序列的或在根据遗传密码简并性的情况下该氨基酸序列的变体的任何核酸序列。
如果根据本发明使用全细胞催化剂,则优选的是这样的细胞,其具有相比于其野生型至少一种催化脂肪酸的β-氧化反应之一的酶降低的活性,并且所述酶优选选自脂肪酸输入蛋白、脂肪酸-CoA连接酶、酰基-CoA脱氢酶、2,4-二烯酰基-CoA还原酶、烯酰基-CoA水合酶和3-酮酰基-CoA硫解酶。脂肪酸的β-氧化是广为人知的代谢途径,其同样使得原核生物和真核生物能够氧化脂肪酸并且使其中存在的化学能量可用于代谢(Y Fujita, H Matsuoka和K Hirooka (2007) Mol. Microbiology 66(4), 829-839)。在进一步的意义上,它开始于摄入脂肪酸进入细胞,在大肠杆菌的情况下为借助于转运蛋白FadL (P N Black (1991) J. Bacteriol. 173, 435-442),其引导其穿过革兰氏阴性细菌细胞的外膜或内膜,和借助于FadD基因产物(P N Black, C C DiRusso, A K Metzger和T L Heimert (1992) J. Biol. Chem. 267, 25513-25520),其将脂肪酸以CoA酯的形式释放入细胞溶胶中。其中,如果条件需要,则将脂肪酸首先通过酰基-CoA脱氢酶在CoA脂肪酸酯的β位处氧化,在大肠杆菌的情况下为FadE (J. W. Campbell & J. E. Cronan (200) J. Bacteriol.184, 3759-3764)。类似分子也可以可替换地从双不饱和脂肪酸通过2,4-二烯酰基-CoA还原酶还原来形成,在大肠杆菌的情况下为FadH。多功能酶烯酰基-CoA水合酶/3-羟基酰基-CoA脱氢酶(在大肠杆菌的情况下为FadB)然后催化水合以形成伯醇及其随后氧化为酮。在最后一步中,3-酮酰基-CoA硫解酶(在大肠杆菌的情况下为FadA)催化酮酰基-CoA的裂解,结果是释放乙酰-CoA和相比于起始分子少两个碳原子的脂肪酸的CoA酯。如果它并非同样也是乙酰CoA,则后者再次进入β氧化循环并经氧化缩短。还参与脂肪酸的β-氧化调节的是FadR,其是Fad操纵子的调控蛋白,所述操纵子包括降解脂肪酸所需的基因, FadR没有催化β-氧化的反应。在优选的实施方案中,术语“催化脂肪酸的β-氧化反应之一的酶”理解为表示与脂肪酸底物或在向乙酰-CoA的途径上由此产生的分子直接相互作用的任何酶,优选识别它作为底物,并且催化其转化为在该降解途径上更靠近乙酰-CoA的代谢产物,优选包括脂肪酸输入蛋白,其作用于脂肪酸摄入细胞。例如,根据前述定义,这些酶包括酰基-CoA脱氢酶,因为它与脂肪酸-CoA酯相互作用并催化它转化为烯酰-CoA,其在β-氧化的代谢途径上比脂肪酸-CoA酯更靠近乙酰-CoA。在优选的实施方案中,如本文所用的术语“催化脂肪酸的β-氧化反应之一的酶”理解为表示来自包含来自大肠杆菌的基因产物FadA、FadB、FadD、FadL和FadE和/或其变体或包含来自其他生物的同源物的任何酶。来自大肠杆菌的基因产物FadA、FadB、FadD、FadL和FadE以及变体和来自大量其他生物技术上有用的生物的同源物及其核酸和多肽序列描述于现有技术中,例如在登录号AP009048.1下的FadA、在登录号BAE77457.1下的FadB、在登录号BAA15609.1下的FadD、在登录号BAA77891.2下的FadE、在登录号BAA16205.1下的FadL。
随着现代遗传学、微生物学和分子生物学方法的发展,大量工具对于本领域技术人员是可用的,用这些方法技术人员能够常规测量并影响活细胞中存在的酶促活性。为了测定以粒珠的悬浮液形式存在的或可以以细胞培养物的处理过形式取出的酶的活性,可以使用酶标准测试并评价,如描述于教科书中,例如A Cornish-Bowden (1995), Fundamentals of Enzyme Kinetics, Portland Press Limited。现有技术公开了特别适合于测量催化脂肪酸的β-氧化反应之一的酶的活性的大量测试,例如在K Kameda & W D Nunn (1981) J. Biol. Chem. 256, 5702-5707;Hi Marrakchi, W E DeWolf, C Quinn, J West, B J Polizzi, C Y So等(2003) Biochem. J. 370, 1055-1062;S Lobo, G Florova和K A Reynolds (2001) Biochemistry 40 (39), 11955-64和X Yu, T Liu, F Zhu和C Khosla (2011) PNAS,印刷前的电子发表。用于降低细胞中酶活性的常规可用方法,例如通过暴露于放射性辐射随后富集或筛选突变体来非定向诱变细胞、通过点突变的定点插入或通过编码染色体上整合入细胞中的活性酶的基因的敲除,也描述于现有技术中,例如在Sambrook/Fritsch/Maniatis (1989): Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 第2版或在Fuchs/Schlegel (2007) Allgemeine Mikrobiologie, 2008, Georg Thieme Verlag中。在Fad基因产物的具体情况下,过表达转录阻抑蛋白,例如FadR(Y Fujita, H Matsuoka和K Hirooka (2007) Mol.Microbiology 66(4), 829-839)也适合于降低活性。基于RNA干扰的活性降低(T Tuschl (2001) ChemBioChem 2: 239-145)或使用特异性抑制剂也是可以的。在优选的实施方案中,如本文所用的措辞酶的“其中细胞具有相比于其野生型降低的活性”表示在修饰的细胞中酶活性相比于野生型细胞中相同酶的活性降低。在优选的实施方案中,相对降低为5、10、20、40、50、75、90、95、99或更高百分比的活性的降低,这是渐增的优选次序。在尤其优选的实施方案中,相比于背景的酶活性可以不再能检测到。
如果根据本发明使用全细胞催化剂,则如果全细胞催化剂是相对于细胞的野生型至少一种内源醛脱氢酶活性降低的细胞,它是更有利的。在优选的实施方案中,如本文所用的术语“内源醛脱氢酶”理解为表示能够催化醛向对应的羧酸的氧化并且天然存在于所用细胞的野生型的基因组中的酶。大肠杆菌内源的醇脱氢酶的一个实例是具有数据库编号BAA15032.1 (AldA)的酶及其变体。
实施例5(预知的)
通过具有基因来自泊库岛食烷菌SK2的CYP153、Fd和FdOR和来自食油假单胞菌的alkL或基因来自热带念珠菌的CYP52A12和OR和来自食油假单胞菌的alkL的表达载体的大肠杆菌菌株生产羟基月桂酸甲酯
为了产生具有表达载体pCOM[Ab_Fd / CYP153-2 / FdOR / alkL]或pCOM10-Ct CYP52A12_co加OR_co的大肠杆菌菌株,制备大肠杆菌W3110的电感受态细胞。这以本领域技术人员已知的方式进行。大肠杆菌W3110在每种情况下用两种列出的质粒之一转化并铺板到具有卡那霉素(50 μg/ml)的LB琼脂板上。通过质粒制备和分析性限制性分析来检查转化子中正确质粒的存在。以这种方式构建以下菌株:
· 大肠杆菌W3110 pCOM[Ab_Fd / CYP153-2 / FdOR / alkL]
· 大肠杆菌W3110 pCOM10-Ct CYP52A12_co加OR_co
使所述菌株进行补料分批发酵以研究它们产生HLSME的能力。这用来自DASGIP的8倍平行发酵系统来进行。
用装备有顶置式搅拌器和叶轮涡轮的1L反应器用于发酵。在线测量pH和pO2用于监测过程。尤其是OTR/CTR测量用于估计细胞的代谢活性和适合性。
pH探针按照DASGIP的技术参考文献用pH 4.0和pH 7.0的测量溶液两点校准来进行校准。根据技术参考文献提供具有所需传感器和连接的反应器,并且装有搅拌器轴。随后用300 ml水装填反应器,并在121℃下高压灭菌20 min以确保无菌。连接到测量放大器后,将pO2探针极化过夜(至少6 h)。随后在洁净台中去除水,并用高细胞密度培养基替换,所述培养基由以下组成:(NH4)2SO4 1.76 g/l、K2HPO4 19.08 g/l、KH2PO4 12.5 g/l、酵母提取物6.66 g/l、二水合柠檬酸三钠11.2 g/l、17 ml/l的滤器灭菌的1%强度的柠檬酸铁铵溶液和5 ml/l的滤器灭菌的痕量元素母液(其由HCl (37%) 36.50 g/l、MnCl2*4H2O 1.91 g/l、ZnSO4*7H2O 1.87 g/l、乙二胺四乙酸二水合物0.84 g/l、H3BO3 0.30 g/l、Na2MoO4*2H2O 0.25 g/l、CaCl2*2H2O 4.70 g/l、FeSO4*7H2O 17.80 g/l、CuCl2*2H2O 0.15 g/l组成),具有15 g/l葡萄糖作为碳源(通过计量添加30 ml/l无菌补料溶液来添加,所述补料溶液由500 g/l葡萄糖、1% (w/v) MgSO4*7H2O和2.2% (w/v) NH4Cl组成),以及50 mg/l卡那霉素。
随后,将pO2探针用单点校准(搅拌器:600 rpm / 通气: 10 sL/h空气)进行校准至100%,并且按照技术参考文献利用原地清理清理补料剂、校正剂和诱导剂部分。为此,将管首先用70%乙醇,随后用1 M NaOH,随后用无菌的去矿物质水冲洗,并最终用各种培养基装满。
所有前述的大肠杆菌菌株首先从冷冻培养物在具有50 mg/l卡那霉素的LB培养基(25 ml于100 ml磨口瓶中)中在37℃和200 rpm下过夜培养约18 h。随后,将2 ml的该培养物在第二预培养阶段转移至在100 ml摇瓶中的25 ml的高细胞密度培养基中并在37℃/200 rpm下再培养6 h,所述培养基由以下组成:(NH4)2SO4 1.76 g/l、K4HPO2 19.08 g/l、KH2PO4 12.5 g/l、酵母提取物6.66 g/l、二水合柠檬酸三钠11.2 g/l、17 ml/l的滤器灭菌的1%强度的柠檬酸铁铵溶液和5 ml/l的滤器灭菌的痕量元素母液(其由HCl (37%) 36.50 g/l、MnCl2*4H2O 1.91 g/l、ZnSO4*7H2O 1.87 g/l、乙二胺四乙酸二水合物0.84 g/l、H3BO3 0.30 g/l、Na2MoO4*2H2O 0.25 g/l、CaCl2*2H2O 4.70 g/l、FeSO4*7H2O 17.80 g/l、CuCl2*2H2O 0.15 g/l组成),具有15 g/l葡萄糖作为碳源(通过计量添加30 ml/l无菌补料溶液来添加,所述补料溶液由500 g/l葡萄糖、1% (w/v) MgSO4*7H2O和2.2% (w/v) NH4Cl组成),以及已描述的抗生素。
为了用0.1的光密度接种反应器,测量第二预培养阶段的OD600,并计算接种所需的培养物的量。借助于5 ml注射器经过隔板将所需量的培养物添加入经调温的且充气的反应器中。
使用以下标准程序:
用12.5%强度的氨溶液在一侧将pH调节为pH 6.8。培养和生物转化过程中,经搅拌器速度和通气速率将培养物中的溶解氧(pO2或DO)调解为至少30%。接种后,DO从100%掉至该30%,其中它在发酵的剩余过程中保持稳定。
发酵作为补料分批进行,其中当进入补料期时,用由500 g/L葡萄糖、1% (w/v) MgSO4*7H2O和2.2% (w/v) NH4Cl组成的5 g/l*h葡萄糖,经指示分批期结束的DO峰引发补料开始。补料起始后,37℃的温度降低至30℃。补料起始后10 h,用0.025% (v/v) DCPK诱导氧化基因的表达。补料开始后14 h进行羟基月桂酸甲酯生产的开始(=生物转化的开始)。为此目的,将150 ml的月桂酸甲酯和油酸(工业级90%)的混合物分批加入发酵液。
为了定量发酵样品中的LSME和HLSME,在生物转化开始后1/2/4/20/22 h取样。制备这些样品用于分析。(参见产物基于LC-ESI/MS2的定量)。
产物基于LC-ESI/MS2的定量
发酵样品中LSME和HLSME的定量通过LC-ESI/MS2参考所有分析物的外部校准(0.1–50 mg/l)并使用内标氨基十一烷酸(AUD对于HLSME)和d3-LSME (对于LSME)来进行。
在此采用以下设备:
· 具有自动取样器(G1367E)、二元泵(G1312B)和柱加热器(G1316A)的HPLC系统1260 (Agilent; B?blingen)
· 具有ESI源的质谱仪TripelQuad 6410 (Agilent; B?blingen)
· HPLC柱:Kinetex C18, 100 x 2.1 mm, 颗粒大小:2.6 μm, 孔径100 ? (Phenomenex; Aschaffenburg)
· 预柱:KrudKatcher Ultra HPLC串联滤器; 0.5 μm滤器深度和0.004 mm内径(Phenomenex; Aschaffenburg)。
通过移液器吸取1900 μL溶剂(80% (v/v)乙腈、20% 双蒸水(v/v)和0.1%甲酸)和100 μL样品至2 mL反应容器中来制备样品。将混合物涡旋约10秒并随后以约13 000 rpm离心5 min。使用移液器将澄清的上清液移除,并在用稀释剂(80% (v/v) ACN、20%双蒸水(v/v)和0.1%甲酸)合适稀释后分析。用移液器将100 μL的ISTD加入到每一900 μL样品中(对于90 μL的样品体积为10 μL)。
用上述柱或预柱进行HPLC分离。注射体积为0.7 μl,柱温为50℃,并且流速为0.6 ml/min。流动相由洗脱液A(0.1%强度(v/v)甲酸水溶液)和洗脱液B (含0.1% (v/v)甲酸的乙腈)组成。使用以下梯度概况:
用ESI源的以下参数以正极模式进行ESI-MS2分析:
· 气体温度280℃
· 气体流速11 L/min
· 雾化器压力50 psi
· 毛细管电压4000 V。
化合物HLSME的检测和定量用以下MRM参数进行,在每种情况下一种产物离子用作定性物,一种用作定量物:
以SIM模式检测分析物(m/z 201和215)。
发现大肠杆菌菌株W3110 pCOM[Ab_Fd / CYP153-2 / FdOR / alkL]能够从月桂酸甲酯形成ω-羟基月桂酸甲酯。菌株大肠杆菌W3110 pCOM10-Ct CYP52A12_co加OR_co能够转化月桂酸甲酯为ω-羟基月桂酸甲酯或其他氧化产物,仅以较低的程度。
事实上,在前面说明书、权利要求和实施例中公开的本发明的特征可以单独地或以任何期望的组合以其多种实施方案来实现本发明。
序列表
<110> Evonik Industries AG
<120> 脂肪酸的酶促ω氧化和ω氨基化
<130> 2012E00056 DE
<160> 66
<170> PatentIn版本3.5
<210> 1
<211> 230
<212> PRT
<213> 食油假单胞菌
<400> 1
Met Ser Phe Ser Asn Tyr Lys Val Ile Ala Met Pro Val Leu Val Ala
1 5 10 15
Asn Phe Val Leu Gly Ala Ala Thr Ala Trp Ala Asn Glu Asn Tyr Pro
20 25 30
Ala Lys Ser Ala Gly Tyr Asn Gln Gly Asp Trp Val Ala Ser Phe Asn
35 40 45
Phe Ser Lys Val Tyr Val Gly Glu Glu Leu Gly Asp Leu Asn Val Gly
50 55 60
Gly Gly Ala Leu Pro Asn Ala Asp Val Ser Ile Gly Asn Asp Thr Thr
65 70 75 80
Leu Thr Phe Asp Ile Ala Tyr Phe Val Ser Ser Asn Ile Ala Val Asp
85 90 95
Phe Phe Val Gly Val Pro Ala Arg Ala Lys Phe Gln Gly Glu Lys Ser
100 105 110
Ile Ser Ser Leu Gly Arg Val Ser Glu Val Asp Tyr Gly Pro Ala Ile
115 120 125
Leu Ser Leu Gln Tyr His Tyr Asp Ser Phe Glu Arg Leu Tyr Pro Tyr
130 135 140
Val Gly Val Gly Val Gly Arg Val Leu Phe Phe Asp Lys Thr Asp Gly
145 150 155 160
Ala Leu Ser Ser Phe Asp Ile Lys Asp Lys Trp Ala Pro Ala Phe Gln
165 170 175
Val Gly Leu Arg Tyr Asp Leu Gly Asn Ser Trp Met Leu Asn Ser Asp
180 185 190
Val Arg Tyr Ile Pro Phe Lys Thr Asp Val Thr Gly Thr Leu Gly Pro
195 200 205
Val Pro Val Ser Thr Lys Ile Glu Val Asp Pro Phe Ile Leu Ser Leu
210 215 220
Gly Ala Ser Tyr Val Phe
225 230
<210> 2
<211> 693
<212> DNA
<213> 食油假单胞菌
<400> 2
atgagttttt ctaattataa agtaatcgcg atgccggtgt tggttgctaa ttttgttttg 60
ggggcggcca ctgcatgggc gaatgaaaat tatccggcga aatctgctgg ctataatcag 120
ggtgactggg tcgctagctt caatttttct aaggtctatg tgggtgagga gcttggcgat 180
ctaaatgttg gagggggggc tttgccaaat gctgatgtaa gtattggtaa tgatacaaca 240
cttacgtttg atatcgccta ttttgttagc tcaaatatag cggtggattt ttttgttggg 300
gtgccagcta gggctaaatt tcaaggtgag aaatcaatct cctcgctggg aagagtcagt 360
gaagttgatt acggccctgc aattctttcg cttcaatatc attacgatag ctttgagcga 420
ctttatccat atgttggggt tggtgttggt cgggtgctat tttttgataa aaccgacggt 480
gctttgagtt cgtttgatat taaggataaa tgggcgcctg cttttcaggt tggccttaga 540
tatgaccttg gtaactcatg gatgctaaat tcagatgtgc gttatattcc tttcaaaacg 600
gacgtcacag gtactcttgg cccggttcct gtttctacta aaattgaggt tgatcctttc 660
attctcagtc ttggtgcgtc atatgttttc taa 693
<210> 3
<211> 230
<212> PRT
<213> 恶臭假单胞菌
<400> 3
Met Asn Pro Pro Ile Leu Lys Lys Leu Ala Met Ser Ile Leu Ala Thr
1 5 10 15
Ser Phe Val Leu Gly Gly Ala Ser Ala Trp Ser Gly Glu Ile Tyr Ser
20 25 30
Thr Glu Thr Ala Gly Tyr Asn Gln Gly Asp Trp Val Ala Ser Phe Asn
35 40 45
Met Ser Lys Val Tyr Val Asp Glu Thr Leu Gly Ser Leu Asn Val Gly
50 55 60
Gly Ala Thr Val Pro Asn Ala Ala Val Ser Ile Gly Asn Asp Thr Thr
65 70 75 80
Val Ser Phe Asp Ile Ser Tyr Phe Ile Ser Asn Asn Val Ala Leu Asp
85 90 95
Phe Phe Val Gly Ile Pro Ala Lys Ala Lys Phe Gln Gly Glu Lys Ser
100 105 110
Ile Ser Ala Leu Gly Arg Val Ser Glu Val Asp Tyr Gly Pro Ala Ile
115 120 125
Leu Ser Leu Gln Tyr His Phe Asp Asn Phe Glu Arg Leu Tyr Pro Tyr
130 135 140
Val Gly Leu Gly Val Gly Arg Val Phe Phe Phe Asp Lys Thr Asp Gly
145 150 155 160
Ala Leu Thr Ser Phe Asp Ile Lys Asp Lys Trp Ala Pro Ala Val Gln
165 170 175
Val Gly Leu Arg Tyr Asp Phe Gly Asn Ser Trp Met Leu Asn Ser Asp
180 185 190
Val Arg Tyr Ile Pro Phe Lys Thr Asp Val Ser Gly Thr Leu Gly Ala
195 200 205
Ala Pro Val Ser Thr Lys Ile Glu Ile Asp Pro Phe Ile Leu Ser Leu
210 215 220
Gly Ala Ser Tyr Lys Phe
225 230
<210> 4
<211> 693
<212> DNA
<213> 恶臭假单胞菌
<400> 4
atgaatccgc ctattttaaa aaaactcgct atgtcgatat tagcaactag ttttgtgttg 60
ggtggggcca gtgcgtggtc aggtgaaatc tattcgactg aaactgctgg ctacaatcag 120
ggcgactggg ttgctagctt taatatgtct aaagtttatg tagacgagac gctaggctcc 180
ctaaatgtag gtggggctac tgtacccaat gctgctgtaa gcatcggtaa tgatacaaca 240
gtttcttttg atatttccta ttttattagt aacaatgtag ctttggattt tttcgtcggg 300
attccagcta aagctaagtt tcaaggtgaa aaatccatct ctgcgctggg aagagtcagt 360
gaagttgatt atggccctgc aattttgtca cttcagtatc attttgataa ttttgagcga 420
ctttatccat atgtcggact aggtgtcggt cgagtgtttt tcttcgacaa aactgatggt 480
gccttgactt catttgatat caaagataaa tgggcgcctg ctgttcaggt cggccttaga 540
tatgattttg gtaactcatg gatgttaaat tcagatgtgc gctatattcc tttcaaaaca 600
gatgtttctg gtacacttgg ggctgcacct gtttctacca agattgagat tgatcctttc 660
attctgagtc ttggagcatc atataagttc tga 693
<210> 5
<211> 246
<212> PRT
<213> 水油海杆菌 VT8
<400> 5
Met Cys Tyr Glu Lys Leu Gln Phe Tyr Leu Lys Pro Ile Cys Phe Asn
1 5 10 15
Thr Lys Lys Asp Asn Lys Lys Met Lys Pro Lys Ile Ile Ser Lys Val
20 25 30
Ser Leu Val Ala Phe Leu Leu Leu Ser Leu Ala Ala Ser Leu Ala Asn
35 40 45
Ala Gln Ser Glu Pro Val Tyr Ser Arg Gly Asp Trp Val Val Gly Leu
50 55 60
Asn Ala Thr Arg Val Leu Thr Asp Glu Asp Leu Arg Ser Ala Ser Ala
65 70 75 80
Gly Gly Ala Pro Val Pro Asn Ser Asn Leu Ser Ile Asn Asn Asp Thr
85 90 95
Thr Val Ser Phe Asp Val Ser Tyr Phe Leu Ser Asn Gln Leu Ala Phe
100 105 110
Asn Ile Phe Gly Gly Ile Pro Ala Ser Ala Asp Leu Gln Gly Glu Glu
115 120 125
Ser Leu Ser Gly Leu Phe Leu Gly Gln Thr Asp Tyr Gly Pro Val Ile
130 135 140
Leu Ser Leu Gln Tyr His Val Leu Thr Gly Ser Asn Phe Ser Pro Tyr
145 150 155 160
Phe Gly Ala Gly Val Gly Arg Ile Leu Phe Leu Asp Glu Lys Asp Arg
165 170 175
Ala Leu Thr Asp Phe Asp Val Glu Asp Thr Trp Ala Pro Ala Ile Gln
180 185 190
Ala Gly Phe Arg Trp Arg Ile His Asn Asn Trp Ser Ala Asn Phe Asp
195 200 205
Val Arg Tyr Ala Pro Phe Lys Ala Asp Ile Thr Gly Asn Leu Gly Pro
210 215 220
Ala Pro Val Gln Ala Glu Val Glu Val Asp Pro Thr Ile Val Ser Ile
225 230 235 240
Gly Val Ala Tyr Arg Phe
245
<210> 6
<211> 741
<212> DNA
<213> 水油海杆菌
<400> 6
ttgtgttacg aaaaattgca gttttaccta aagccaattt gttttaatac caaaaaggat 60
aataaaaaaa tgaaacctaa aataattagt aaagtctcgt tagtggcgtt ccttttactt 120
tcacttgctg cgagcctggc caacgctcaa tctgagccgg tttacagtag aggcgactgg 180
gtggttggac tgaatgccac tagagtttta accgatgaag atttgcgatc agcctctgcg 240
gggggtgccc ccgttccaaa ttccaacctg tctattaaca acgatacgac cgtttcattc 300
gacgtgtcgt attttctgag taatcagctg gcatttaaca tttttggcgg cattcccgct 360
agtgcggacc tccagggcga agagtctctc tccggtcttt ttcttggtca aacagattat 420
ggtccggtaa ttctttcgct tcagtatcat gtcttaacgg gtagcaactt ctctccgtac 480
tttggagcgg gtgtaggacg gattctcttt ttagatgaga aggatcgcgc actaaccgac 540
ttcgacgtcg aagatacatg ggcccctgcg attcaggctg gttttcgctg gaggatacac 600
aataactggt cggcaaattt tgacgttaga tatgcaccct tcaaagcgga tatcaccggt 660
aacctaggcc cggcccctgt tcaggcagaa gtggaagtgg accccactat cgtgagcatc 720
ggtgtcgcat atcgctttta a 741
<210> 7
<211> 228
<212> PRT
<213> 亚历山大海洋柄菌 HTCC2633
<400> 7
Met Lys Gln Ser Asn Val Lys Ser Lys Gly Pro Phe Ala Ser Lys Phe
1 5 10 15
Val Leu Ile Thr Met Ile Gly Gly Phe Ser Ser Phe Ser Val Ala Asn
20 25 30
Ala Glu Pro Leu Tyr Ser Lys Gly Asp Trp Leu Phe Gly Leu Asn Ala
35 40 45
Ala Lys Val Phe Thr Asn Glu Thr Leu Asp Ser Ile Ser Ala Gly Gly
50 55 60
Ala Pro Ile Pro Gly Ala Gly Val Asn Ile Thr Asp Asp Thr Thr Leu
65 70 75 80
Ser Phe Asp Val Ser Tyr Phe Leu Asn Ser Ser Val Ala Leu Asn Phe
85 90 95
Phe Gly Gly Leu Pro Ala Ser Ala Asn Leu Ala Gly Ser Gly Ser Leu
100 105 110
Ala Gly Leu Pro Val Gly Glu Thr Glu Tyr Gly Pro Ala Val Leu Ser
115 120 125
Leu Gln Tyr His Phe Ser Thr Asn Ser Ser Val Ser Pro Tyr Val Gly
130 135 140
Ala Gly Ile Ala Arg Ile Leu Phe Leu Glu Glu Gln Gly Asp Ala Leu
145 150 155 160
Ala Asp Phe Asp Leu Lys Asp Ala Trp Ala Pro Ala Val Gln Val Gly
165 170 175
Met Arg Tyr Gln Met Ser Asp Asn Trp Phe Ala Asn Ala Asp Ile Arg
180 185 190
Tyr Thr Pro Phe Glu Thr Asp Ile Ser Gly Thr Leu Gly Gly Ala Pro
195 200 205
Val Arg Gly Lys Ile Ser Val Asp Pro Thr Ile Leu Asn Ile Gly Ile
210 215 220
Ala Tyr Arg Phe
225
<210> 8
<211> 741
<212> DNA
<213> 亚历山大海洋柄菌 HTCC2633
<400> 8
ttaaaagcga tatgcgacac cgatgctcac gatagtgggg tccacttcca cttctgcctg 60
aacaggggcc gggcctaggt taccggtgat atccgctttg aagggtgcat atctaacgtc 120
aaaatttgcc gaccagttat tgtgtatcct ccagcgaaaa ccagcctgaa tcgcaggggc 180
ccatgtatct tcgacgtcga agtcggttag tgcgcgatcc ttctcatcta aaaagagaat 240
ccgtcctaca cccgctccaa agtacggaga gaagttgcta cccgttaaga catgatactg 300
aagcgaaaga attaccggac cataatctgt ttgaccaaga aaaagaccgg agagagactc 360
ttcgccctgg aggtccgcac tagcgggaat gccgccaaaa atgttaaatg ccagctgatt 420
actcagaaaa tacgacacgt cgaatgaaac ggtcgtatcg ttgttaatag acaggttgga 480
atttggaacg ggggcacccc ccgcagaggc tgatcgcaaa tcttcatcgg ttaaaactct 540
agtggcattc agtccaacca cccagtcgcc tctactgtaa accggctcag attgagcgtt 600
ggccaggctc gcagcaagtg aaagtaaaag gaacgccact aacgagactt tactaattat 660
tttaggtttc atttttttat tatccttttt ggtattaaaa caaattggct ttaggtaaaa 720
ctgcaatttt tcgtaacaca a 741
<210> 9
<211> 223
<212> PRT
<213> Marinobacter manganoxydans MnI7-9
<400> 9
Met Lys Pro Lys Ile Ile Ser Lys Val Ser Leu Val Ala Phe Leu Leu
1 5 10 15
Leu Ser Leu Ala Ala Ser Leu Ala Asn Ala Gln Ser Glu Pro Val Tyr
20 25 30
Ser Arg Gly Asp Trp Val Val Gly Leu Asn Ala Thr Arg Val Leu Thr
35 40 45
Asp Glu Asp Leu Arg Ser Ala Ser Ala Gly Ser Ala Pro Val Pro Asn
50 55 60
Ser Asn Leu Ser Ile Asn Asn Asp Thr Thr Val Ser Phe Asp Val Ser
65 70 75 80
Tyr Phe Leu Ser Asn Gln Leu Ala Phe Asn Ile Phe Gly Gly Ile Pro
85 90 95
Ala Ser Ala Asp Leu Gln Gly Glu Glu Ser Leu Ser Gly Leu Phe Leu
100 105 110
Gly Gln Thr Asp Tyr Gly Pro Val Ile Leu Ser Leu Gln Tyr His Val
115 120 125
Leu Thr Gly Ser Asn Phe Ser Pro Tyr Phe Gly Ala Gly Val Gly Arg
130 135 140
Ile Leu Phe Leu Asp Glu Lys Asp Arg Ala Leu Thr Asp Phe Asp Val
145 150 155 160
Glu Asp Thr Trp Ala Pro Ala Val Gln Ala Gly Phe Arg Trp Arg Ile
165 170 175
His Asn Asn Trp Ser Ala Asn Phe Asp Val Arg Tyr Ala Pro Phe Glu
180 185 190
Ala Asp Ile Thr Gly Asn Leu Gly Pro Ala Pro Val Gln Ala Lys Val
195 200 205
Glu Val Asp Pro Thr Ile Val Ser Ile Gly Val Ala Tyr Arg Phe
210 215 220
<210> 10
<211> 687
<212> DNA
<213> Marinobacter manganoxydans MnI7-9
<400> 10
atgaaacaat ctaatgtaaa atccaagggg ccttttgcct cgaagtttgt gttaatcacg 60
atgatcggtg gcttttcgtc gttcagcgtc gcaaatgcgg aaccgcttta ctccaagggc 120
gactggcttt ttggcctgaa tgcggcgaaa gtattcacga acgaaacgtt ggattcaatc 180
agcgcgggcg gtgcgccaat acccggggct ggtgtcaata tcaccgatga caccacgctg 240
agtttcgacg tttcctattt tttgaattca tctgtagcgt tgaacttctt tggtggtttg 300
cctgccagcg ctaatcttgc agggagcggc agtttggcag gactgcctgt tggagagacg 360
gaatatggcc ctgctgtttt gtcgcttcaa tatcactttt cgactaattc atccgttagc 420
ccttatgttg gtgccggtat cgctcgaatc ttgtttctgg aagaacaagg ggatgcactc 480
gcggacttcg acttgaaaga cgcctgggcg cccgctgtcc aagttggaat gcgctatcaa 540
atgagcgata attggtttgc caatgccgat atacgttaca cgccattcga aacagatatc 600
tctgggacac tcggcggggc gccagtcaga ggcaagattt cggtggaccc aacaattctc 660
aatatcggta ttgcctaccg gttttaa 687
<210> 11
<211> 227
<212> PRT
<213> 柄杆菌属种 K31
<400> 11
Met Arg Lys Asn Gly Ala Tyr Met Tyr Ala Arg Leu Ala Gln Tyr Ala
1 5 10 15
Thr Ala Leu Ala Val Leu Ser Val Phe Pro Gln Ala Ala Leu Ala Gln
20 25 30
Asn Ser Glu Gly Phe Lys Leu Trp Ala Val Ser Leu Asn Ala Thr Arg
35 40 45
Val Phe Val Asp Glu Asp Ala Pro Asp Ile Thr Leu Ala Gly Gly Pro
50 55 60
Val Pro Gly Ser Asn Val Arg Ile Gly Asp Ala Thr Ser Ala Thr Ile
65 70 75 80
Asp Ile Gly Tyr Phe Phe Thr Pro Asn Val Ala Gly Asn Leu Phe Leu
85 90 95
Gly Val Pro Ala Thr Ala Gln Ile Asp Gly Ala Gly Ser Leu Glu Pro
100 105 110
Leu Gly Thr Leu Ala Lys Val Asn Tyr Gly Pro Ile Ile Leu Ser Ala
115 120 125
Gln Tyr His Phe Asn Asn Leu Gly Lys Val His Pro Tyr Leu Gly Val
130 135 140
Gly Val Gly Arg Ile Val Phe Leu Asn Glu Arg Asp Arg Ala Leu Leu
145 150 155 160
Asn Phe Ser Ile Asp Asp Ser Trp Ala Pro Ala Ala Gln Val Gly Val
165 170 175
Arg Tyr Glu Leu Gly Ala Glu Trp Met Leu Asn Ala Asp Val Arg Tyr
180 185 190
Val Pro Phe Ser Thr His Ala Thr Gly Ser Leu Gly Gly Ala Pro Val
195 200 205
Arg Thr Arg Leu Asp Ile Asp Pro Ile Leu Thr Ser Ala Gly Val Thr
210 215 220
Tyr Arg Phe
225
<210> 12
<211> 684
<212> DNA
<213> 柄杆菌属种 K31
<400> 12
atgagaaaaa atggagcata tatgtatgca aggcttgcgc aatacgccac cgcgctcgcc 60
gtcttgagcg tcttccccca ggctgcattg gcccagaatt cggaaggatt caagctgtgg 120
gccgtaagtc tgaatgcaac tagggtcttc gtggatgagg atgcgcccga catcaccttg 180
gctggagggc ctgttccggg ctcgaatgtg aggatcgggg atgccacgtc ggcgaccatt 240
gatatcggat atttcttcac gcccaatgtt gctggtaatt tgtttctcgg cgtgccggcg 300
accgcgcaaa ttgacggcgc tgggtcactt gagccgctcg gaactctagc caaggtcaac 360
tatggaccca tcatcttgtc ggcccagtac catttcaaca atcttggcaa ggttcatccc 420
tatctgggag tgggcgtcgg gcggatcgtc tttctgaatg agcgtgacag agctttgctt 480
aatttcagta tcgacgacag ttgggcgcct gcggctcagg tgggtgttcg gtacgagctt 540
ggcgcagaat ggatgctgaa cgctgacgtt cgatacgttc ccttctccac gcacgctacc 600
ggttcactgg gtggagcgcc tgtccggaca cgtttggaca tcgacccgat cctgacgagc 660
gccggagtga cttaccggtt ttag 684
<210> 13
<211> 410
<212> PRT
<213> 泊库岛食烷菌 SK2
<400> 13
Met Glu Asn Glu Lys Gln Asp Ala Thr Val Ile Val Gly Gly Gly His
1 5 10 15
Ala Ala Gly Ala Leu Met Thr Ala Leu Ile Gln Lys Lys Tyr Pro His
20 25 30
Glu Val Val Leu Val Gly Glu Glu Pro Tyr Pro Pro Tyr Gln Arg Pro
35 40 45
Pro Leu Ser Lys Thr Tyr Leu Ser Gly Glu Val Asn Glu Glu Ser Leu
50 55 60
Tyr Leu Lys Pro Arg Ser Val Tyr Glu Gly Ala Gly His Gln Leu Arg
65 70 75 80
Leu Gly Val Arg Val Glu Asn Ile Asp Arg Asp Asn Lys Thr Leu Thr
85 90 95
Leu Ser Asp Gln Ser Thr Leu Lys Tyr Gly Arg Leu Ile Leu Ala Thr
100 105 110
Gly Ser His Val Arg Arg Leu Asn Ala Pro Gly Ser Glu Leu Lys Gly
115 120 125
Ile His Tyr Leu His Asp Ile Ala Asp Thr Asp Thr Leu Arg Asp Gln
130 135 140
Leu Ser Pro Gly Ala Arg Leu Val Ile Val Gly Gly Gly Tyr Ile Gly
145 150 155 160
Leu Glu Val Ala Ala Ser Ala Ser Lys Lys Gly Val Asn Val Thr Val
165 170 175
Leu Glu Gly Ala Glu Arg Leu Met Gln Arg Val Thr Gly Val Glu Met
180 185 190
Ser Ser Phe Leu Tyr Ala Lys His Ser Gly Ser Gly Val Asp Val Arg
195 200 205
Leu Asn Thr Ala Val Thr Gly Phe Lys Ala Gly Asp Gln Gly Arg Val
210 215 220
Ala Gly Val Thr Leu Ala Asn Gly Glu Thr Val Asp Ala Asp Val Val
225 230 235 240
Leu Val Ser Ile Gly Val Ile Pro Glu Thr Ala Leu Ala Glu Ala Ala
245 250 255
Gly Leu Ser Cys Glu Asp Gly Ile Leu Val Asp Glu Tyr Val Arg Thr
260 265 270
Ser Asp Pro Ser Ile Leu Ala Ile Gly Asp Cys Thr Arg His Arg Asn
275 280 285
Leu Phe Phe Glu Lys Met Gln Arg Leu Glu Ser Val Ala Asn Ala Val
290 295 300
Asp Gln Ala Arg Thr Ala Ala Ala Thr Leu Met Gly Glu Asp Lys Pro
305 310 315 320
Tyr Asp Ser Ala Pro Trp Phe Trp Ser Asn Gln Tyr Asp Val Arg Leu
325 330 335
Gln Met Val Gly Leu Ser Gln Asp His Asp Glu Arg Val Met Arg Gly
340 345 350
Ser Thr Glu Asp Lys Ala Phe Ala Val Phe Tyr Leu Arg Glu Gly Cys
355 360 365
Val Ile Ala Val Asp Ala Val Asn Met Pro Ile Ala Phe Met Val Gly
370 375 380
Lys Gln Leu Val Gln His Arg Lys Ser Ile Ser Ala Asp Val Leu Ser
385 390 395 400
Asp Leu Asp Val Glu Leu Lys Ser Leu Ile
405 410
<210> 14
<211> 1233
<212> DNA
<213> 泊库岛食烷菌 SK2
<400> 14
atggaaaacg aaaaacaaga tgccactgtc atcgttggag gcgggcacgc agcaggtgcg 60
ttgatgacag ccttgataca aaagaaatat ccacacgagg tggttctggt gggcgaagag 120
ccttatccgc cctaccagcg cccgccttta tccaaaacgt atctgtcagg agaggttaac 180
gaggaatctc tctatttgaa accgcgctcg gtgtatgaag gtgcggggca tcagttgcga 240
cttggtgtgc gcgttgagaa cattgatcga gacaacaaaa cccttacatt gtcagatcag 300
agcacactga aatatggccg actgattctt gccacaggtt cacacgttag gcgtcttaat 360
gcgcctggat ctgaattaaa aggcatccat tatctgcatg acattgctga tacggataca 420
ttgcgcgatc aactgtcacc aggtgcccgt ttggttattg tcggtggcgg ctacattggc 480
cttgaggttg cagccagtgc gagcaagaaa ggcgttaatg ttacggtgct ggaaggcgct 540
gagcgtctaa tgcagcgagt tacgggcgtt gagatgtctt cgttcctgta tgctaagcac 600
agtggttctg gcgtggacgt gcgtcttaat actgctgtca ccggcttcaa agctggagat 660
caggggcgag tggctggcgt aacgttagca aatggcgaaa cggttgacgc agatgttgtg 720
cttgtctcga ttggcgttat acccgaaacg gctttggctg aggctgccgg cctatcctgt 780
gaagacggta tcctggtgga cgaatatgtc cgcacttctg acccaagcat cctggcgata 840
ggtgattgca ctcgtcaccg aaaccttttc ttcgagaaaa tgcagaggct cgagtccgtt 900
gctaacgctg tcgatcaagc acgtactgcg gcagcgacct tgatgggaga ggataagccc 960
tacgatagcg ctccatggtt ttggtcgaat caatatgatg ttcgtttgca aatggtgggg 1020
ctctcgcagg accatgatga acgagtcatg cgtggcagca cggaagacaa agcgtttgcg 1080
gtgttctatc tccgtgaggg ctgtgtgatt gccgttgatg cggtgaatat gcccattgcg 1140
tttatggttg gaaagcagtt ggttcagcac cgtaagagta ttagcgctga cgtgttgagt 1200
gatctggatg ttgaattaaa gtctttgatc tga 1233
<210> 15
<211> 106
<212> PRT
<213> 泊库岛食烷菌 SK2
<400> 15
Met Gly Lys Ile Thr Phe Ile Glu Asn Asp Lys Thr Glu His Val Thr
1 5 10 15
Glu Phe Glu Ala Gly Ile Thr Leu Met Gln Val Ala Leu Asp Asn Ala
20 25 30
Val Pro Gly Ile Asp Gly Asp Cys Gly Gly Glu Cys Ala Cys Gly Thr
35 40 45
Cys His Leu Ile Val Pro Glu Glu Trp Phe Asp Lys Thr Gly Pro Ile
50 55 60
Asn Asp Ala Glu Glu Gln Met Leu Ser Met Thr Pro Glu Arg Ala Lys
65 70 75 80
Thr Ser Arg Leu Gly Cys Gln Val Lys Ala Thr Glu Ala Met Asp Gly
85 90 95
Met Thr Val Gln Leu Pro Glu Phe Gln Met
100 105
<210> 16
<211> 321
<212> DNA
<213> 泊库岛食烷菌 SK2
<400> 16
gtgggaaaaa tcacctttat tgagaatgat aaaactgaac atgtaacaga atttgaggca 60
ggtattactt tgatgcaagt tgccttagac aacgccgttc ccggtattga tggggattgc 120
ggcggggagt gtgcctgtgg tacctgtcac ctgattgttc cagaagaatg gttcgataaa 180
accgggccga ttaatgatgc tgaagaacaa atgttgtcca tgacacctga gcgtgcaaaa 240
acctctcggt tggggtgtca ggttaaggcc actgaggcaa tggacggaat gactgttcaa 300
ttgccagaat ttcaaatgta a 321
<210> 17
<211> 336
<212> PRT
<213> 大肠杆菌
<400> 17
Met Lys Ala Ala Val Val Thr Lys Asp His His Val Asp Val Thr Tyr
1 5 10 15
Lys Thr Leu Arg Ser Leu Lys His Gly Glu Ala Leu Leu Lys Met Glu
20 25 30
Cys Cys Gly Val Cys His Thr Asp Leu His Val Lys Asn Gly Asp Phe
35 40 45
Gly Asp Lys Thr Gly Val Ile Leu Gly His Glu Gly Ile Gly Val Val
50 55 60
Ala Glu Val Gly Pro Gly Val Thr Ser Leu Lys Pro Gly Asp Arg Ala
65 70 75 80
Ser Val Ala Trp Phe Tyr Glu Gly Cys Gly His Cys Glu Tyr Cys Asn
85 90 95
Ser Gly Asn Glu Thr Leu Cys Arg Ser Val Lys Asn Ala Gly Tyr Ser
100 105 110
Val Asp Gly Gly Met Ala Glu Glu Cys Ile Val Val Ala Asp Tyr Ala
115 120 125
Val Lys Val Pro Asp Gly Leu Asp Ser Ala Ala Ala Ser Ser Ile Thr
130 135 140
Cys Ala Gly Val Thr Thr Tyr Lys Ala Val Lys Leu Ser Lys Ile Arg
145 150 155 160
Pro Gly Gln Trp Ile Ala Ile Tyr Gly Leu Gly Gly Leu Gly Asn Leu
165 170 175
Ala Leu Gln Tyr Ala Lys Asn Val Phe Asn Ala Lys Val Ile Ala Ile
180 185 190
Asp Val Asn Asp Glu Gln Leu Lys Leu Ala Thr Glu Met Gly Ala Asp
195 200 205
Leu Ala Ile Asn Ser His Thr Glu Asp Ala Ala Lys Ile Val Gln Glu
210 215 220
Lys Thr Gly Gly Ala His Ala Ala Val Val Thr Ala Val Ala Lys Ala
225 230 235 240
Ala Phe Asn Ser Ala Val Asp Ala Val Arg Ala Gly Gly Arg Val Val
245 250 255
Ala Val Gly Leu Pro Pro Glu Ser Met Ser Leu Asp Ile Pro Arg Leu
260 265 270
Val Leu Asp Gly Ile Glu Val Val Gly Ser Leu Val Gly Thr Arg Gln
275 280 285
Asp Leu Thr Glu Ala Phe Gln Phe Ala Ala Glu Gly Lys Val Val Pro
290 295 300
Lys Val Ala Leu Arg Pro Leu Ala Asp Ile Asn Thr Ile Phe Thr Glu
305 310 315 320
Met Glu Glu Gly Lys Ile Arg Gly Arg Met Val Ile Asp Phe Arg His
325 330 335
<210> 18
<211> 1011
<212> DNA
<213> 大肠杆菌
<400> 18
atgaaggctg cagttgttac gaaggatcat catgttgacg ttacgtataa aacactgcgc 60
tcactgaaac atggcgaagc cctgctgaaa atggagtgtt gtggtgtatg tcataccgat 120
cttcatgtta agaatggcga ttttggtgac aaaaccggcg taattctggg ccatgaaggc 180
atcggtgtgg tggcagaagt gggtccaggt gtcacctcat taaaaccagg cgatcgtgcc 240
agcgtggcgt ggttctacga aggatgcggt cattgcgaat actgtaacag tggtaacgaa 300
acgctctgcc gttcagttaa aaatgccgga tacagcgttg atggcgggat ggcggaagag 360
tgcatcgtgg tcgccgatta cgcggtaaaa gtgccagatg gtctggactc ggcggcggcc 420
agcagcatta cctgtgcggg agtcaccacc tacaaagccg ttaagctgtc aaaaattcgt 480
ccagggcagt ggattgctat ctacggtctt ggcggtctgg gtaacctcgc cctgcaatac 540
gcgaagaatg tctttaacgc caaagtgatc gccattgatg tcaatgatga gcagttaaaa 600
ctggcaaccg aaatgggcgc agatttagcg attaactcac acaccgaaga cgccgccaaa 660
attgtgcagg agaaaactgg tggcgctcac gctgcggtgg taacagcggt agctaaagct 720
gcgtttaact cggcagttga tgctgtccgt gcaggcggtc gtgttgtggc tgtcggtcta 780
ccgccggagt ctatgagcct ggatatccca cgtcttgtgc tggatggtat tgaagtggtc 840
ggttcgctgg tcggcacgcg ccaggattta actgaagcct tccagtttgc cgccgaaggt 900
aaagtggtgc cgaaagtcgc cctgcgtccg ttagcggaca tcaacaccat ctttactgag 960
atggaagaag gcaaaatccg tggccgcatg gtgattgatt tccgtcacta a 1011
<210> 19
<211> 470
<212> PRT
<213> 泊库岛食烷菌 SK2
<400> 19
Met Ser Thr Ser Ser Ser Thr Ser Asn Asp Ile Gln Ala Lys Ile Ile
1 5 10 15
Asn Ala Thr Ser Lys Val Val Pro Met His Leu Gln Ile Lys Ala Leu
20 25 30
Lys Asn Leu Met Lys Val Lys Arg Lys Thr Ile Gly Thr Ser Arg Pro
35 40 45
Gln Val His Phe Val Glu Thr Asp Leu Pro Asp Val Asn Asp Leu Ala
50 55 60
Ile Glu Asp Ile Asp Thr Ser Asn Pro Phe Leu Tyr Arg Gln Gly Lys
65 70 75 80
Ala Asn Ala Tyr Phe Lys Arg Leu Arg Asp Glu Ala Pro Val His Tyr
85 90 95
Gln Lys Asn Ser Ala Phe Gly Pro Phe Trp Ser Val Thr Arg Tyr Glu
100 105 110
Asp Ile Val Phe Val Asp Lys Ser His Asp Leu Phe Ser Ala Glu Pro
115 120 125
Gln Ile Ile Leu Gly Asp Pro Pro Glu Gly Leu Ser Val Glu Met Phe
130 135 140
Ile Ala Met Asp Pro Pro Lys His Asp Val Gln Arg Arg Ala Val Gln
145 150 155 160
Gly Val Val Ala Pro Lys Asn Leu Lys Glu Met Glu Gly Leu Ile Arg
165 170 175
Lys Arg Thr Gly Asp Val Leu Asp Ser Leu Pro Leu Asp Thr Pro Phe
180 185 190
Asn Trp Val Pro Val Val Ser Lys Glu Leu Thr Gly Arg Met Leu Ala
195 200 205
Ser Leu Leu Asp Phe Pro Tyr Asp Glu Arg Glu Lys Leu Val Gly Trp
210 215 220
Ser Asp Arg Leu Ser Gly Ala Ser Ser Ala Thr Gly Gly Glu Phe Thr
225 230 235 240
Asn Glu Asp Val Phe Phe Asp Asp Ala Ala Asp Met Ala Trp Ala Phe
245 250 255
Ser Lys Leu Trp Arg Asp Lys Glu Ala Arg Gln Lys Ala Gly Glu Glu
260 265 270
Pro Gly Phe Asp Leu Ile Ser Met Leu Gln Ser Asn Glu Asp Thr Lys
275 280 285
Asp Leu Ile Asn Arg Pro Leu Glu Phe Ile Gly Asn Leu Ala Leu Leu
290 295 300
Ile Val Gly Gly Asn Asp Thr Thr Arg Asn Ser Met Ser Gly Gly Val
305 310 315 320
Leu Ala Leu Asn Gln Phe Pro Glu Gln Phe Glu Lys Leu Lys Ala Asn
325 330 335
Pro Lys Leu Ile Pro Asn Met Val Ser Glu Ile Ile Arg Trp Gln Thr
340 345 350
Pro Leu Ala Tyr Met Arg Arg Val Ala Lys Gln Asp Val Glu Leu Asn
355 360 365
Gly Gln Thr Ile Lys Lys Gly Asp Arg Val Leu Met Trp Tyr Ala Ser
370 375 380
Gly Asn Gln Asp Glu Arg Lys Phe Glu Asn Pro Glu Gln Phe Ile Ile
385 390 395 400
Asp Arg Lys Asp Thr Arg Asn His Val Ser Phe Gly Tyr Gly Val His
405 410 415
Arg Cys Met Gly Asn Arg Leu Ala Glu Leu Gln Leu Arg Ile Leu Trp
420 425 430
Glu Glu Leu Leu Pro Arg Phe Glu Asn Ile Glu Val Ile Gly Glu Pro
435 440 445
Glu Arg Val Gln Ser Asn Phe Val Arg Gly Tyr Ser Lys Met Met Val
450 455 460
Lys Leu Thr Ala Lys Lys
465 470
<210> 20
<211> 1413
<212> DNA
<213> 泊库岛食烷菌 SK2
<400> 20
atgtcaacga gttcaagtac aagtaatgac atccaggcaa aaataattaa cgccacatcc 60
aaagtcgtgc caatgcatct acagatcaag gcactaaaaa acttgatgaa ggtgaagcgg 120
aagaccattg gcacttcccg ccctcaggtg cactttgttg aaaccgattt gcctgacgtc 180
aatgatttgg cgatagaaga tatcgatacg agtaaccctt ttttataccg acaaggtaag 240
gcgaatgcgt actttaagcg gttgcgtgat gaagcgccgg tgcactatca gaagaacagt 300
gctttcgggc cgttctggtc ggtaacacgc tacgaagata tcgtcttcgt ggacaagagc 360
catgatttgt tttccgccga accccaaatt atcttgggtg atcctccgga aggcctgtcg 420
gttgaaatgt tcatcgctat ggatcctccc aagcacgacg tacagcgtcg ggcagtccag 480
ggtgttgttg cgcccaagaa cctgaaagaa atggaaggac tgatccgcaa gcgcaccggg 540
gacgtactgg atagcctgcc gttggacact ccgttcaact gggtgccggt ggtgtcgaaa 600
gagctgaccg ggcgcatgct agcctcactg ttagatttcc cgtatgacga acgcgaaaaa 660
ctggttggct ggtcggatcg attgtccggc gcgtcctcgg caaccggcgg cgagtttacg 720
aatgaagatg tgttttttga tgacgcggca gatatggcgt gggctttctc caagctttgg 780
cgtgataaag aagcccgtca aaaagcaggt gaagagccgg gtttcgattt gatcagcatg 840
cttcagtcca atgaagacac aaaagatctg atcaatcgtc ctttggaatt cattggtaat 900
ctcgcgttgt tgattgttgg cggtaatgac accacgcgta actcaatgag cgggggggtg 960
ctggctttaa atcagttccc agagcaattc gagaagctaa aggcgaaccc aaagcttatc 1020
cccaatatgg tctctgaaat cattcgctgg caaacgccgc ttgcgtatat gcgccgggtt 1080
gccaagcagg atgtggagct gaacggacag accatcaaga agggtgatcg cgtgctgatg 1140
tggtatgcgt cgggcaacca ggatgagaga aaatttgaga atcctgagca attcatcatc 1200
gaccgcaaag atacgcgtaa ccatgtgtcg tttggttatg gggttcaccg ttgtatgggc 1260
aaccgccttg ccgaactgca gctgcgtatt ctgtgggaag agcttctccc tcgctttgaa 1320
aacatcgaag tgatcggtga gccggagcgc gtgcaatcga actttgtgcg gggctattcc 1380
aagatgatgg ttaagttgac ggctaaaaaa taa 1413
<210> 21
<211> 12
<212> PRT
<213> 人工的
<220>
<223> 来自CYP153的序列基序
<220>
<221> 替代物
<222> (3)..(4)
<223> Ile3可以是亮氨酸; val4可以是异亮氨酸
<400> 21
Leu Leu Ile Ile Gly Gly Asn Asp Thr Thr Arg Asn
1 5 10
<210> 22
<211> 378
<212> PRT
<213> 枯草芽孢杆菌枯草亚种168株
<400> 22
Met Ile Ile Gly Val Pro Lys Glu Ile Lys Asn Asn Glu Asn Arg Val
1 5 10 15
Ala Leu Thr Pro Gly Gly Val Ser Gln Leu Ile Ser Asn Gly His Arg
20 25 30
Val Leu Val Glu Thr Gly Ala Gly Leu Gly Ser Gly Phe Glu Asn Glu
35 40 45
Ala Tyr Glu Ser Ala Gly Ala Glu Ile Ile Ala Asp Pro Lys Gln Val
50 55 60
Trp Asp Ala Glu Met Val Met Lys Val Lys Glu Pro Leu Pro Glu Glu
65 70 75 80
Tyr Val Tyr Phe Arg Lys Gly Leu Val Leu Phe Thr Tyr Leu His Leu
85 90 95
Ala Ala Glu Pro Glu Leu Ala Gln Ala Leu Lys Asp Lys Gly Val Thr
100 105 110
Ala Ile Ala Tyr Glu Thr Val Ser Glu Gly Arg Thr Leu Pro Leu Leu
115 120 125
Thr Pro Met Ser Glu Val Ala Gly Arg Met Ala Ala Gln Ile Gly Ala
130 135 140
Gln Phe Leu Glu Lys Pro Lys Gly Gly Lys Gly Ile Leu Leu Ala Gly
145 150 155 160
Val Pro Gly Val Ser Arg Gly Lys Val Thr Ile Ile Gly Gly Gly Val
165 170 175
Val Gly Thr Asn Ala Ala Lys Met Ala Val Gly Leu Gly Ala Asp Val
180 185 190
Thr Ile Ile Asp Leu Asn Ala Asp Arg Leu Arg Gln Leu Asp Asp Ile
195 200 205
Phe Gly His Gln Ile Lys Thr Leu Ile Ser Asn Pro Val Asn Ile Ala
210 215 220
Asp Ala Val Ala Glu Ala Asp Leu Leu Ile Cys Ala Val Leu Ile Pro
225 230 235 240
Gly Ala Lys Ala Pro Thr Leu Val Thr Glu Glu Met Val Lys Gln Met
245 250 255
Lys Pro Gly Ser Val Ile Val Asp Val Ala Ile Asp Gln Gly Gly Ile
260 265 270
Val Glu Thr Val Asp His Ile Thr Thr His Asp Gln Pro Thr Tyr Glu
275 280 285
Lys His Gly Val Val His Tyr Ala Val Ala Asn Met Pro Gly Ala Val
290 295 300
Pro Arg Thr Ser Thr Ile Ala Leu Thr Asn Val Thr Val Pro Tyr Ala
305 310 315 320
Leu Gln Ile Ala Asn Lys Gly Ala Val Lys Ala Leu Ala Asp Asn Thr
325 330 335
Ala Leu Arg Ala Gly Leu Asn Thr Ala Asn Gly His Val Thr Tyr Glu
340 345 350
Ala Val Ala Arg Asp Leu Gly Tyr Glu Tyr Val Pro Ala Glu Lys Ala
355 360 365
Leu Gln Asp Glu Ser Ser Val Ala Gly Ala
370 375
<210> 23
<211> 1137
<212> DNA
<213> 枯草芽孢杆菌枯草亚种168株
<400> 23
atgatcatcg gcgtccctaa agaaatcaaa aacaacgaaa accgcgtggc actgaccccg 60
ggtggtgtgt cccaattgat cagcaacggc caccgtgtct tggtcgaaac cggcgctggt 120
ctgggtagcg gctttgagaa cgaggcatac gagtcggcag gtgcggagat tattgccgat 180
cctaagcagg tgtgggacgc cgagatggtt atgaaagtga aagaaccgct gccggaagaa 240
tacgtttact ttcgcaaagg tctggttctg ttcacctatc tgcacttggc cgctgagccg 300
gagctggcac aagcgctgaa ggataagggc gttacggcga tcgcgtatga aacggtgtct 360
gagggccgta ccctgccgct gctgaccccg atgagcgagg ttgccggtcg tatggcagcc 420
cagatcggtg cgcagttcct ggagaaaccg aaaggtggca agggcattct gctggcgggt 480
gtcccgggtg tttctcgtgg taaggtcact atcattggcg gtggcgtggt cggtaccaac 540
gcggcgaaga tggcggttgg cctgggtgct gacgttacga ttatcgactt gaacgctgat 600
cgcctgcgtc aattggacga catctttggc caccagatca agaccttgat ctccaatccg 660
gtgaatatcg cggacgcggt ggcggaggcg gatctgctga tttgcgcagt tctgattcct 720
ggcgcgaagg cgccgaccct ggtcacggaa gaaatggtga aacaaatgaa accgggtagc 780
gtgattgttg acgtagcgat tgatcagggt ggtatcgtgg aaactgttga ccacatcacg 840
actcatgatc agccgacgta cgagaaacat ggtgttgttc actatgcagt tgcaaatatg 900
ccgggtgcgg tcccgcgtac tagcacgatt gccctgacca atgtgaccgt tccgtatgca 960
ctgcaaattg caaataaggg tgcggtgaag gctttggcgg acaacaccgc gctgcgtgct 1020
ggtctgaata ccgcgaacgg tcatgtgacc tatgaggcgg tcgcacgtga cctgggttat 1080
gagtacgtgc ctgcagagaa ggcactgcag gacgagagct ccgtggcagg tgcgtaa 1137
<210> 24
<211> 459
<212> PRT
<213> 青紫色素杆菌 ATCC 12472
<400> 24
Met Gln Lys Gln Arg Thr Thr Ser Gln Trp Arg Glu Leu Asp Ala Ala
1 5 10 15
His His Leu His Pro Phe Thr Asp Thr Ala Ser Leu Asn Gln Ala Gly
20 25 30
Ala Arg Val Met Thr Arg Gly Glu Gly Val Tyr Leu Trp Asp Ser Glu
35 40 45
Gly Asn Lys Ile Ile Asp Gly Met Ala Gly Leu Trp Cys Val Asn Val
50 55 60
Gly Tyr Gly Arg Lys Asp Phe Ala Glu Ala Ala Arg Arg Gln Met Glu
65 70 75 80
Glu Leu Pro Phe Tyr Asn Thr Phe Phe Lys Thr Thr His Pro Ala Val
85 90 95
Val Glu Leu Ser Ser Leu Leu Ala Glu Val Thr Pro Ala Gly Phe Asp
100 105 110
Arg Val Phe Tyr Thr Asn Ser Gly Ser Glu Ser Val Asp Thr Met Ile
115 120 125
Arg Met Val Arg Arg Tyr Trp Asp Val Gln Gly Lys Pro Glu Lys Lys
130 135 140
Thr Leu Ile Gly Arg Trp Asn Gly Tyr His Gly Ser Thr Ile Gly Gly
145 150 155 160
Ala Ser Leu Gly Gly Met Lys Tyr Met His Glu Gln Gly Asp Leu Pro
165 170 175
Ile Pro Gly Met Ala His Ile Glu Gln Pro Trp Trp Tyr Lys His Gly
180 185 190
Lys Asp Met Thr Pro Asp Glu Phe Gly Val Val Ala Ala Arg Trp Leu
195 200 205
Glu Glu Lys Ile Leu Glu Ile Gly Ala Asp Lys Val Ala Ala Phe Val
210 215 220
Gly Glu Pro Ile Gln Gly Ala Gly Gly Val Ile Val Pro Pro Ala Thr
225 230 235 240
Tyr Trp Pro Glu Ile Glu Arg Ile Cys Arg Lys Tyr Asp Val Leu Leu
245 250 255
Val Ala Asp Glu Val Ile Cys Gly Phe Gly Arg Thr Gly Glu Trp Phe
260 265 270
Gly His Gln His Phe Gly Phe Gln Pro Asp Leu Phe Thr Ala Ala Lys
275 280 285
Gly Leu Ser Ser Gly Tyr Leu Pro Ile Gly Ala Val Phe Val Gly Lys
290 295 300
Arg Val Ala Glu Gly Leu Ile Ala Gly Gly Asp Phe Asn His Gly Phe
305 310 315 320
Thr Tyr Ser Gly His Pro Val Cys Ala Ala Val Ala His Ala Asn Val
325 330 335
Ala Ala Leu Arg Asp Glu Gly Ile Val Gln Arg Val Lys Asp Asp Ile
340 345 350
Gly Pro Tyr Met Gln Lys Arg Trp Arg Glu Thr Phe Ser Arg Phe Glu
355 360 365
His Val Asp Asp Val Arg Gly Val Gly Met Val Gln Ala Phe Thr Leu
370 375 380
Val Lys Asn Lys Ala Lys Arg Glu Leu Phe Pro Asp Phe Gly Glu Ile
385 390 395 400
Gly Thr Leu Cys Arg Asp Ile Phe Phe Arg Asn Asn Leu Ile Met Arg
405 410 415
Ala Cys Gly Asp His Ile Val Ser Ala Pro Pro Leu Val Met Thr Arg
420 425 430
Ala Glu Val Asp Glu Met Leu Ala Val Ala Glu Arg Cys Leu Glu Glu
435 440 445
Phe Glu Gln Thr Leu Lys Ala Arg Gly Leu Ala
450 455
<210> 25
<211> 1380
<212> DNA
<213> 青紫色素杆菌 ATCC 12472
<400> 25
atgcagaaac agcgtaccac ctctcagtgg cgtgaactgg atgcggcgca tcatctgcat 60
ccgtttaccg ataccgcgag cctgaatcag gcgggtgcgc gtgtgatgac ccgtggcgaa 120
ggcgtgtatc tgtgggatag cgaaggcaac aaaattattg atggcatggc gggcctgtgg 180
tgcgtgaacg tgggctatgg ccgtaaagat tttgcggaag cggcgcgtcg tcagatggaa 240
gaactgccgt tttataacac cttctttaaa accacccatc cggcggtggt ggaactgagc 300
agcctgctgg ccgaagttac cccggcaggt tttgatcgtg tgttttatac caacagcggc 360
agcgaaagcg tggataccat gattcgtatg gtgcgtcgtt attgggatgt gcagggcaaa 420
ccggaaaaaa aaaccctgat tggccgttgg aacggctatc acggcagcac cattggcggt 480
gcgagcctgg gcggcatgaa atatatgcat gaacagggcg atctgccgat tccgggcatg 540
gcgcatattg aacagccgtg gtggtataaa catggcaaag atatgacccc ggatgaattt 600
ggcgtggttg cggcgcgttg gctggaagaa aaaattctgg aaatcggcgc ggataaagtg 660
gcggcgtttg tgggcgaacc gattcagggt gcgggcggtg tgattgttcc gccggcaacc 720
tattggccgg aaattgaacg tatttgccgc aaatatgatg tgctgctggt tgcggatgaa 780
gtgatttgcg gctttggccg taccggcgaa tggtttggcc atcagcattt tggctttcag 840
ccggacctgt ttaccgcggc gaaaggcctg agcagcggct atctgccgat tggcgcggtg 900
tttgtgggca aacgtgttgc ggaaggtctg attgcgggcg gtgattttaa ccatggcttt 960
acctatagcg gccatccggt gtgtgcggcg gtggcgcatg cgaatgttgc ggcgctgcgt 1020
gatgaaggca ttgtgcagcg tgtgaaagat gatattggcc cgtatatgca gaaacgttgg 1080
cgtgaaacct ttagccgttt tgaacatgtg gatgatgtgc gtggcgtggg catggtgcag 1140
gcgtttaccc tggtgaaaaa caaagcgaaa cgtgaactgt ttccggattt tggcgaaatt 1200
ggcaccctgt gccgcgatat tttttttcgc aacaacctga ttatgcgtgc gtgcggcgat 1260
cacattgtgt ctgcaccgcc gctggttatg acccgtgcgg aagtggatga aatgctggcc 1320
gtggcggaac gttgcctgga agaatttgaa cagaccctga aagcgcgtgg cctggcctaa 1380
<210> 26
<211> 552
<212> PRT
<213> 食油假单胞菌
<400> 26
Met Tyr Asp Tyr Ile Ile Val Gly Ala Gly Ser Ala Gly Cys Val Leu
1 5 10 15
Ala Asn Arg Leu Ser Ala Asp Pro Ser Lys Arg Val Cys Leu Leu Glu
20 25 30
Ala Gly Pro Arg Asp Thr Asn Pro Leu Ile His Met Pro Leu Gly Ile
35 40 45
Ala Leu Leu Ser Asn Ser Lys Lys Leu Asn Trp Ala Phe Gln Thr Ala
50 55 60
Pro Gln Gln Asn Leu Asn Gly Arg Ser Leu Phe Trp Pro Arg Gly Lys
65 70 75 80
Thr Leu Gly Gly Ser Ser Ser Ile Asn Ala Met Val Tyr Ile Arg Gly
85 90 95
His Glu Asp Asp Tyr His Ala Trp Glu Gln Ala Ala Gly Arg Tyr Trp
100 105 110
Gly Trp Tyr Arg Ala Leu Glu Leu Phe Lys Arg Leu Glu Cys Asn Gln
115 120 125
Arg Phe Asp Lys Ser Glu His His Gly Val Asp Gly Glu Leu Ala Val
130 135 140
Ser Asp Leu Lys Tyr Ile Asn Pro Leu Ser Lys Ala Phe Val Gln Ala
145 150 155 160
Gly Met Glu Ala Asn Ile Asn Phe Asn Gly Asp Phe Asn Gly Glu Tyr
165 170 175
Gln Asp Gly Val Gly Phe Tyr Gln Val Thr Gln Lys Asn Gly Gln Arg
180 185 190
Trp Ser Ser Ala Arg Ala Phe Leu His Gly Val Leu Ser Arg Pro Asn
195 200 205
Leu Asp Ile Ile Thr Asp Ala His Ala Ser Lys Ile Leu Phe Glu Asp
210 215 220
Arg Lys Ala Val Gly Val Ser Tyr Ile Lys Lys Asn Met His His Gln
225 230 235 240
Val Lys Thr Thr Ser Gly Gly Glu Val Leu Leu Ser Leu Gly Ala Val
245 250 255
Gly Thr Pro His Leu Leu Met Leu Ser Gly Val Gly Ala Ala Ala Glu
260 265 270
Leu Lys Glu His Gly Val Ser Leu Val His Asp Leu Pro Glu Val Gly
275 280 285
Lys Asn Leu Gln Asp His Leu Asp Ile Thr Leu Met Cys Ala Ala Asn
290 295 300
Ser Arg Glu Pro Ile Gly Val Ala Leu Ser Phe Ile Pro Arg Gly Val
305 310 315 320
Ser Gly Leu Phe Ser Tyr Val Phe Lys Arg Glu Gly Phe Leu Thr Ser
325 330 335
Asn Val Ala Glu Ser Gly Gly Phe Val Lys Ser Ser Pro Asp Arg Asp
340 345 350
Arg Pro Asn Leu Gln Phe His Phe Leu Pro Thr Tyr Leu Lys Asp His
355 360 365
Gly Arg Lys Ile Ala Gly Gly Tyr Gly Tyr Thr Leu His Ile Cys Asp
370 375 380
Leu Leu Pro Lys Ser Arg Gly Arg Ile Gly Leu Lys Ser Ala Asn Pro
385 390 395 400
Leu Gln Pro Pro Leu Ile Asp Pro Asn Tyr Leu Ser Asp His Glu Asp
405 410 415
Ile Lys Thr Met Ile Ala Gly Ile Lys Ile Gly Arg Ala Ile Leu Gln
420 425 430
Ala Pro Ser Met Ala Lys His Phe Lys His Glu Val Val Pro Gly Gln
435 440 445
Ala Val Lys Thr Asp Asp Glu Ile Ile Glu Asp Ile Arg Arg Arg Ala
450 455 460
Glu Thr Ile Tyr His Pro Val Gly Thr Cys Arg Met Gly Lys Asp Pro
465 470 475 480
Ala Ser Val Val Asp Pro Cys Leu Lys Ile Arg Gly Leu Ala Asn Ile
485 490 495
Arg Val Val Asp Ala Ser Ile Met Pro His Leu Val Ala Gly Asn Thr
500 505 510
Asn Ala Pro Thr Ile Met Ile Ala Glu Asn Ala Ala Glu Ile Ile Met
515 520 525
Arg Asn Leu Asp Val Glu Ala Leu Glu Ala Ser Ala Glu Phe Ala Arg
530 535 540
Glu Gly Ala Glu Leu Glu Leu Ala
545 550
<210> 27
<211> 1656
<212> DNA
<213> 食油假单胞菌
<400> 27
atgtacgact atataatcgt tggtgctgga tctgcaggat gtgtgcttgc taatcgtctt 60
tcggccgacc cctctaaaag agtttgttta cttgaagctg ggccgcgaga tacgaatccg 120
ctaattcata tgccgttagg tattgctttg ctttcaaata gtaaaaagtt gaattgggct 180
tttcaaactg cgccacagca aaatctcaac ggccggagcc ttttctggcc acgaggaaaa 240
acgttaggtg gttcaagctc aatcaacgca atggtctata tccgagggca tgaagacgat 300
taccacgcat gggagcaggc ggccggccgc tactggggtt ggtaccgggc tcttgagttg 360
ttcaaaaggc ttgaatgcaa ccagcgattc gataagtccg agcaccatgg ggttgacgga 420
gaattagctg ttagtgattt aaaatatatc aatccgctta gcaaagcatt cgtgcaagcc 480
ggcatggagg ccaatattaa tttcaacgga gatttcaacg gcgagtacca ggacggcgta 540
gggttctatc aagtaaccca aaaaaatgga caacgctgga gctcggcgcg tgcattcttg 600
cacggtgtac tttccagacc aaatctagac atcattactg atgcgcatgc atcaaaaatt 660
ctttttgaag accgtaaggc ggttggtgtt tcttatataa agaaaaatat gcaccatcaa 720
gtcaagacaa cgagtggtgg tgaagtactt cttagtcttg gcgcagtcgg cacgcctcac 780
cttctaatgc tttctggtgt tggggctgca gccgagctta aggaacatgg tgtttctcta 840
gtccatgatc ttcctgaggt ggggaaaaat cttcaagatc atttggacat cacattgatg 900
tgcgcagcaa attcgagaga gccgataggt gttgctcttt ctttcatccc tcgtggtgtc 960
tcgggtttgt tttcatatgt gtttaagcgc gaggggtttc tcactagtaa cgtggcagag 1020
tcgggtggtt ttgtaaaaag ttctcctgat cgtgatcggc ccaatttgca gtttcatttc 1080
cttccaactt atcttaaaga tcacggtcga aaaatagcgg gtggttatgg ttatacgcta 1140
catatatgtg atcttttgcc taagagccga ggcagaattg gcctaaaaag cgccaatcca 1200
ttacagccgc ctttaattga cccgaactat cttagcgatc atgaagatat taaaaccatg 1260
attgcgggta ttaagatagg gcgcgctatt ttgcaggccc catcgatggc gaagcatttt 1320
aagcatgaag tagtaccggg ccaggctgtt aaaactgatg atgaaataat cgaagatatt 1380
cgtaggcgag ctgagactat ataccatccg gtaggtactt gtaggatggg taaagatcca 1440
gcgtcagttg ttgatccgtg cctgaagatc cgtgggttgg caaatattag agtcgttgat 1500
gcgtcaatta tgccgcactt ggtcgcgggt aacacaaacg ctccaactat tatgattgca 1560
gaaaatgcgg cagaaataat tatgcggaat cttgatgtgg aagcattaga ggctagcgct 1620
gagtttgctc gcgagggtgc agagctagag ttggcc 1656
<210> 28
<211> 535
<212> PRT
<213> 泊库岛食烷菌菌株SK2
<400> 28
Met Tyr Asp Tyr Ile Ile Val Gly Ala Gly Ser Ala Gly Cys Val Leu
1 5 10 15
Ala Asn Arg Leu Ser Ala Asp Thr Ser Lys Arg Val Ala Leu Ile Glu
20 25 30
Ala Gly Pro Arg Asp Lys Asn Pro Leu Ile His Met Pro Ile Gly Ile
35 40 45
Ala Leu Leu Ala Asn Asn Arg Lys Leu Asn Trp Ala Leu Glu Thr Glu
50 55 60
Pro Gln Glu His Leu Lys Gly Arg Gln Leu Phe Trp Pro Arg Gly Lys
65 70 75 80
Thr Leu Gly Gly Ser Ser Ser Ile Asn Ala Met Val Tyr Ile Arg Gly
85 90 95
His Lys Ala Asp Tyr Asp His Trp Gly Gln Val Ala Gly Asn Asn Asn
100 105 110
Leu Trp Gly Trp Asp Arg Ala Leu Thr Leu Phe Arg Arg Val Glu Asp
115 120 125
Asn Gln Arg Leu Gly Ala Asp Pro Tyr His Gly Lys Asp Gly Glu Leu
130 135 140
Thr Val Ser Glu Leu Lys Ser Ile Asn Pro Leu Ser Arg Asp Phe Val
145 150 155 160
Arg Ala Ala Pro His Val Asp Leu Pro Val Asn Thr Asp Phe Asn Gly
165 170 175
Lys Ser Gln Asp Gly Leu Gly Leu Tyr Gln Val Thr Gln Lys Asn Gly
180 185 190
Gln Arg Trp Ser Ser Ala Gln Ala Phe Leu Arg Ala Ala Glu Ser Arg
195 200 205
Ser Asn Leu Asp Val Leu Thr Asp Ala Arg Val Thr Arg Val Ala Met
210 215 220
Glu Gly Lys Arg Ala Val Gly Val Thr Leu Lys Gln Gly Ser Glu Tyr
225 230 235 240
Arg Gln Leu Arg Leu Asn Ala Gly Gly Glu Val Ile Leu Ser Gly Gly
245 250 255
Ala Val Asn Ser Pro Gln Leu Leu Leu Leu Ser Gly Ile Gly Asp Ser
260 265 270
Lys Glu Leu Ala Lys His Gly Ile Pro Leu Val His His Leu Pro Glu
275 280 285
Val Gly Gln Asn Leu Ala Asp His Leu Asp Ile Thr Ile Met His Thr
290 295 300
Ala Asn Ser Arg Leu Pro Ile Gly Val Ala Pro Ser Phe Leu Phe Arg
305 310 315 320
Gly Val Ser Ala Leu Phe Ser Tyr Ile Phe Ala Arg Arg Gly Phe Leu
325 330 335
Thr Ser Asn Val Ala Glu Ser Gly Gly Phe Val Lys Ser Asp Pro Ser
340 345 350
Ser Glu Arg Pro Asn Val Gln Phe His Phe Leu Pro Thr Tyr Leu Lys
355 360 365
Asp His Gly Arg Lys Val Met Ala Gly Tyr Gly Tyr Thr Leu His Ile
370 375 380
Cys Asp Leu Leu Pro Lys Ser Arg Gly Phe Ile Gly Leu Gln Ser Pro
385 390 395 400
Asp Pro Leu Ala Asn Pro Leu Ile Gln Pro Asn Tyr Leu Ser Asp Pro
405 410 415
Glu Asp Ile Lys Thr Met Ile Ser Ala Ile Lys Phe Gly Arg Arg Ile
420 425 430
Leu Gly Ala Pro Thr Met Ala Leu His Ser Lys Arg Glu Val Met Pro
435 440 445
Gly Glu Ser Val Ser Thr Asp Ala Gln Leu Ala Asp Phe Ile Arg Glu
450 455 460
Asn Ala Glu Thr Ile Tyr His Pro Val Gly Thr Cys Arg Met Gly Ala
465 470 475 480
Asp Pro Asp Ser Val Val Asp Pro Glu Leu Lys Val Arg Gly Val Glu
485 490 495
Gly Leu Arg Val Val Asp Ala Ser Ile Met Pro Ser Leu Val Ala Gly
500 505 510
Asn Thr Asn Ala Pro Thr Met Met Ile Ala Glu Asn Ala Ala Asp Ile
515 520 525
Leu Leu Gly Lys Val Gln Val
530 535
<210> 29
<211> 1608
<212> DNA
<213> 葡萄糖-甲醇-胆碱-氧化还原酶家族的氧化还原酶YP_694430 (Ab_AlkJ)
<400> 29
atgtacgatt acattattgt aggagccggg tcggctggct gtgtgttggc taaccgcctt 60
agtgcggata cttcaaaacg cgtagcgctg atcgaagccg gcccacggga taaaaatccc 120
ctaattcaca tgcctattgg catcgctctc ctcgccaaca atagaaaact caactgggcg 180
ctagaaaccg agcctcaaga acacttgaaa gggcgccaac tgttttggcc ccgcggaaag 240
acgcttggtg gctcttcttc tattaatgcc atggtgtata tcaggggcca taaagccgac 300
tacgatcatt ggggccaagt tgccggtaac aacaacctct ggggttggga tcgtgcatta 360
acattatttc gtcgggtgga agacaaccaa cgccttggcg cagacccata ccacggcaaa 420
gatggtgagc tcactgtaag cgaattaaaa tcgatcaacc cgctgagccg tgattttgtt 480
cgagcagcgc ctcacgtaga cctgcccgtg aacacagact ttaacggcaa atcacaagac 540
ggattggggc tttaccaagt aacgcagaaa aatggccagc gctggagttc agcgcaagca 600
tttttgcgtg ccgctgagag ccgctctaat cttgacgtgc taaccgatgc tcgagtaacc 660
cgcgtggcta tggagggtaa gcgagcggtt ggcgtgaccc tgaaacaggg aagtgaatat 720
cgccagctga gactcaatgc cggcggcgaa gtcatcctgt ctggtggtgc agttaattca 780
ccacagctcc tcctgctgtc tggcatcggg gatagtaaag agcttgcgaa acacggcatt 840
ccgctagttc atcaccttcc cgaagtcggc cagaatttgg ccgatcatct ggacatcacg 900
atcatgcaca cggcgaactc tcgtttgccc attggcgttg cacccagctt cttattccgt 960
ggggtgagcg cacttttctc ctatatcttt gcgcgacgtg gttttcttac cagtaatgtt 1020
gccgagtctg gcggctttgt gaaatcggac ccttcgtctg agcgacccaa tgtgcaattt 1080
cactttttgc ccacttacct gaaggatcat ggccgaaaag taatggcagg ctatggctac 1140
actttgcaca tttgcgattt gttgccgaaa agccgaggct tcattggatt gcaaagccct 1200
gacccattgg ccaatccgct gattcagcct aactatctga gcgaccccga agatatcaaa 1260
acaatgatat ccgccattaa gtttgggcga cgcattctcg gagcaccaac aatggcgctt 1320
catagtaaac gggaagttat gccgggagag tccgtatcta cggacgctca actagcagac 1380
tttatccgtg aaaatgctga aaccatctac caccctgttg gcacttgtcg tatgggggcc 1440
gaccctgatt ccgttgtcga tccggaactg aaagtcagag gcgttgaagg gctaagagtt 1500
gtcgatgcct cgataatgcc cagcttggtg gcgggtaaca cgaacgcacc cacaatgatg 1560
attgccgaaa atgcggccga catcctgctg ggaaaggtcc aagtgtaa 1608
<210> 30
<211> 416
<212> PRT
<213> 水油海杆菌 VT8
<400> 30
Met Val Ser Lys Arg Lys Glu Arg Thr Val Ile Val Gly Gly Gly His
1 5 10 15
Ala Ala Gly Ala Leu Leu Thr Ala Leu Leu Gln Lys Lys Tyr Gln His
20 25 30
Glu Val Val Leu Val Gly Asn Glu Pro His Pro Pro Tyr His Arg Pro
35 40 45
Pro Leu Ser Lys Asn Tyr Leu Thr Gly Asp Val Asp Gln Glu Ser Leu
50 55 60
Tyr Leu Lys Pro Arg Ser Val Tyr Glu Asn Ala Gly His Gln Leu Arg
65 70 75 80
Leu Gly Val Arg Val Glu Gln Ile Asp Arg Asp Ser Ser Thr Ile Ser
85 90 95
Leu Ser Asp Gln Ser Arg Leu Gln Tyr Asp Arg Leu Val Leu Ala Thr
100 105 110
Gly Ser His Leu Arg His Leu Asn Ala Pro Gly Ala Asp Leu Asn Gly
115 120 125
Ile His Tyr Leu His Asp Ile Ala Asp Ser Glu Val Leu Arg Glu Gln
130 135 140
Leu Val Ala Gly Lys Arg Leu Val Val Val Gly Gly Gly Tyr Ile Gly
145 150 155 160
Leu Glu Val Ala Ala Ser Ala Asn Lys Lys Gly Val Asn Val Thr Val
165 170 175
Leu Glu Ala Ala Glu Arg Leu Met Gln Arg Val Thr Gly Pro Glu Ile
180 185 190
Ser Ala Phe Leu Tyr Asp Lys His Arg Gly Ala Gly Val Asp Val Arg
195 200 205
Leu Asn Thr Ala Val Thr Gly Phe Glu Ala Gly Asp Gln Gly His Val
210 215 220
Ala Gly Val Thr Leu Ala Asp Gly Ser Thr Val Pro Ala Asp Ile Val
225 230 235 240
Leu Val Ser Ile Gly Ile Ile Pro Glu Thr Ala Leu Ala Lys Asp Ala
245 250 255
Gly Leu Pro Cys Asp Asn Gly Ile Ile Val Asp Glu Phe Thr Arg Thr
260 265 270
Glu Asp Pro Ala Ile Leu Ala Ile Gly Asp Cys Thr Arg His Arg Asn
275 280 285
Leu Phe Phe Glu Lys Met Gln Arg Leu Glu Ser Val Ala Asn Ala Val
290 295 300
Asp Gln Ala Arg Thr Ala Ala Ala Thr Leu Met Gly Glu Glu Lys Pro
305 310 315 320
Tyr Asp Ser Val Pro Trp Phe Trp Ser Asn Gln Tyr Asp Val Arg Leu
325 330 335
Gln Met Val Gly Leu Ser Gln Asn His Asp Gln Arg Val Val Arg Gly
340 345 350
Thr Pro Glu Asp Lys Gly Phe Ala Val Phe Tyr Leu Arg Glu Gly Cys
355 360 365
Val Ile Ala Val Asp Ala Val Asn Leu Pro Leu Ala Phe Leu Val Gly
370 375 380
Lys Thr Leu Val Gln Gln Arg Arg Thr Ile Asn Pro Glu Leu Ile Glu
385 390 395 400
Asp Pro Asp Thr Glu Leu Lys Ser Leu Val Asn Gly Arg Leu Gln Ser
405 410 415
<210> 31
<211> 1251
<212> DNA
<213> 水油海杆菌 VT8
<400> 31
atggtaagca aacgtaaaga gaggacggtc attgttggcg gtgggcacgc agcaggtgcc 60
ctcctgacag ccttactcca aaaaaaatat caacatgagg tcgttctggt ggggaatgaa 120
cctcatccgc cctaccatcg accgccgctg tccaagaatt acctgacagg agacgttgat 180
caggagtcgc tgtacctgaa accgcgctcg gtatacgaga acgcaggcca tcagttgcgg 240
ctcggtgtgc gcgtcgaaca aattgatcgg gacagtagca ccatcagctt gtcggatcag 300
agcaggctgc aatacgatcg actggtcctg gccaccgggt cacaccttcg acacctgaac 360
gcgcccgggg ctgacttaaa tggcattcat tacctgcacg acatagctga ttcagaggta 420
ctgcgtgaac agttagttgc tggaaagcgc ctggtcgtcg tgggtggtgg ttacatcggc 480
cttgaggtgg cggccagtgc caacaaaaaa ggtgttaatg tcacggtgct agaagccgcc 540
gaacgtctta tgcagcgcgt tacgggcccg gaaatatcag cgttccttta cgacaaacac 600
cgtggcgccg gcgtggacgt acgtctgaac acagcggtaa ccggcttcga agcgggcgat 660
caggggcatg tggctggcgt gacgttggcg gacggaagca ccgtaccggc cgacatcgtc 720
cttgtgtcga tcggcattat cccggaaacc gctctggcta aggacgccgg cctgccctgt 780
gataacggta ttattgttga cgaatttacc cgtaccgagg accccgccat cttggcgatc 840
ggtgactgca cccggcaccg gaatcttttc ttcgagaaga tgcaacgact cgagtctgtc 900
gccaatgctg tcgatcaggc tcgtacagcc gcggcaaccc tgatgggtga ggagaaaccc 960
tatgatagcg ttccatggtt ctggtcaaac cagtacgatg ttcgtctgca gatggtagga 1020
ttgtcgcaaa atcatgatca gcgagtggtt cgaggcaccc ccgaggataa aggatttgcc 1080
gtgttctatc tccgcgaagg ctgtgttatt gctgttgacg cggtcaacct gccccttgct 1140
tttttggtag gcaagacact cgttcaacaa cgcagaacga tcaacccgga actaatagag 1200
gatccggata ctgaactgaa atctttggtg aacggaaggc tccagagttg a 1251
<210> 32
<211> 404
<212> PRT
<213> 不动杆菌属种OC4
<400> 32
Met Gln Thr Ile Val Ile Ile Gly Ala Ser His Ala Ala Ala Gln Leu
1 5 10 15
Ala Ala Ser Leu Arg Pro Asp Gly Trp Gln Gly Glu Ile Val Val Ile
20 25 30
Gly Asp Glu Pro Tyr Leu Pro Tyr His Arg Pro Pro Leu Ser Lys Thr
35 40 45
Phe Leu Arg Gly Ala Gln Leu Val Asp Glu Leu Leu Ile Arg Pro Ala
50 55 60
Ala Phe Tyr Gln Lys Asn Gln Ile Glu Phe Arg His Gly Arg Val Val
65 70 75 80
Ala Ile Asp Arg Ala Ala Arg Ser Val Thr Leu Gln Asp Gly Ser Thr
85 90 95
Leu Ala Tyr Asp Gln Leu Ala Leu Cys Thr Gly Ala Arg Val Arg Thr
100 105 110
Val Ser Leu Ala Gly Ser Asp Leu Ala Gly Val His Tyr Leu Arg Asn
115 120 125
Ile Ser Asp Val Gln Ala Ile Gln Pro Phe Val Gln Pro Asn Gly Lys
130 135 140
Ala Val Val Ile Gly Gly Gly Tyr Ile Gly Leu Glu Thr Ala Ala Ala
145 150 155 160
Leu Thr Glu Gln Gly Met Gln Val Val Val Leu Glu Ala Ala Glu Arg
165 170 175
Ile Leu Gln Arg Val Thr Ala Pro Glu Val Ser Asp Phe Tyr Thr Arg
180 185 190
Ile His Arg Glu Gln Gly Val Thr Ile His Thr Gly Val Ser Val Thr
195 200 205
Ala Ile Thr Gly Glu Gly Arg Ala Gln Ala Val Leu Cys Ala Asp Gly
210 215 220
Ser Met Phe Asp Ala Asp Leu Val Ile Ile Gly Val Gly Val Val Pro
225 230 235 240
Asn Ile Glu Leu Ala Leu Asp Ala Gly Leu Gln Val Asp Asn Gly Ile
245 250 255
Val Ile Asp Glu Tyr Cys Arg Thr Ser Ala Pro Glu Ile Val Ala Ile
260 265 270
Gly Asp Cys Ala Asn Ala Phe Asn Pro Ile Tyr Gln Arg Arg Met Arg
275 280 285
Leu Glu Ser Val Pro Asn Ala Asn Glu Gln Ala Lys Ile Ala Ser Ala
290 295 300
Thr Leu Cys Gly Leu Gln Arg Thr Ser Lys Ser Leu Pro Trp Phe Trp
305 310 315 320
Ser Asp Gln Tyr Asp Leu Lys Leu Gln Ile Ala Gly Leu Ser Gln Gly
325 330 335
Tyr Asp Gln Ile Val Ile Arg Gly Asp Val Gln Gln Arg Arg Ser Phe
340 345 350
Ala Ala Phe Tyr Leu Gln Ala Gly Arg Leu Ile Ala Ala Asp Cys Val
355 360 365
Asn Arg Pro Gln Glu Phe Met Leu Ser Lys Lys Leu Ile Thr Ala Gly
370 375 380
Thr Ala Val Asp Pro Leu Arg Leu Ala Asp Glu Ser Ile Ala Val Gln
385 390 395 400
Ala Leu Met Gly
<210> 33
<211> 1215
<212> DNA
<213> 不动杆菌属种OC4
<400> 33
atgcagacca tcgtaatcat cggcgccagc cacgcagcag cccaactggc cgcgagcttg 60
cgtccggatg gttggcaggg tgagatcgtt gtcatcggtg atgaaccgta tctgccatac 120
caccgtccgc cgctgagcaa aaccttcctg cgtggtgcac aactggttga tgaactgctg 180
attcgtcctg ccgcttttta ccaaaagaac cagattgagt tccgtcacgg tcgtgtggtc 240
gcaattgatc gtgcagcgcg tagcgttacc ctgcaagacg gtagcaccct ggcttatgat 300
cagctggcac tgtgcactgg tgcacgtgtg cgtaccgtta gcttggctgg cagcgacctg 360
gctggtgttc actacctgcg caacattagc gatgtccaag caatccagcc gttcgtgcaa 420
ccgaatggta aagcggttgt gattggtggc ggctacatcg gtctggaaac ggctgcggcc 480
ctgacggaac aaggcatgca ggtggttgtt ttggaggcgg ctgagcgcat cctgcaacgc 540
gttacggcgc ctgaagttag cgacttttac acccgtattc accgtgaaca gggcgttacc 600
attcacacgg gtgtctcggt gacggccatt accggcgaag gtcgtgcgca ggcggtgctg 660
tgcgcggatg gtagcatgtt tgatgctgat ttggttatca ttggtgtcgg cgtcgtcccg 720
aatatcgagc tggctctgga tgcgggtctg caggtggata atggcatcgt catcgacgag 780
tactgtcgca cctctgcgcc ggagatcgtc gccattggtg actgcgcgaa tgctttcaac 840
ccgatctacc agcgtcgtat gcgcttggaa tcggtgccga acgcgaacga acaagcaaag 900
attgcaagcg caaccctgtg cggtctgcag cgtaccagca aatccctgcc gtggttttgg 960
agcgatcaat atgatctgaa actgcaaatt gctggcctga gccaaggtta tgaccaaatt 1020
gtgattcgtg gcgacgttca acaacgccgc agcttcgcgg cgttctacct gcaggcgggt 1080
cgtctgatcg cggcagattg tgtaaatcgt ccgcaggagt ttatgctgag caagaagctg 1140
attaccgctg gtacggcggt ggacccgctg cgtctggctg acgagagcat cgctgtacag 1200
gcgctgatgg gctga 1215
<210> 34
<211> 470
<212> PRT
<213> 水油海杆菌 VT8
<400> 34
Met Pro Thr Leu Pro Arg Thr Phe Asp Asp Ile Gln Ser Arg Leu Ile
1 5 10 15
Asn Ala Thr Ser Arg Val Val Pro Met Gln Arg Gln Ile Gln Gly Leu
20 25 30
Lys Phe Leu Met Ser Ala Lys Arg Lys Thr Phe Gly Pro Arg Arg Pro
35 40 45
Met Pro Glu Phe Val Glu Thr Pro Ile Pro Asp Val Asn Thr Leu Ala
50 55 60
Leu Glu Asp Ile Asp Val Ser Asn Pro Phe Leu Tyr Arg Gln Gly Gln
65 70 75 80
Trp Arg Ala Tyr Phe Lys Arg Leu Arg Asp Glu Ala Pro Val His Tyr
85 90 95
Gln Lys Asn Ser Pro Phe Gly Pro Phe Trp Ser Val Thr Arg Phe Glu
100 105 110
Asp Ile Leu Phe Val Asp Lys Ser His Asp Leu Phe Ser Ala Glu Pro
115 120 125
Gln Ile Ile Leu Gly Asp Pro Pro Glu Gly Leu Ser Val Glu Met Phe
130 135 140
Ile Ala Met Asp Pro Pro Lys His Asp Val Gln Arg Ser Ser Val Gln
145 150 155 160
Gly Val Val Ala Pro Lys Asn Leu Lys Glu Met Glu Gly Leu Ile Arg
165 170 175
Ser Arg Thr Gly Asp Val Leu Asp Ser Leu Pro Thr Asp Lys Pro Phe
180 185 190
Asn Trp Val Pro Ala Val Ser Lys Glu Leu Thr Gly Arg Met Leu Ala
195 200 205
Thr Leu Leu Asp Phe Pro Tyr Glu Glu Arg His Lys Leu Val Glu Trp
210 215 220
Ser Asp Arg Met Ala Gly Ala Ala Ser Ala Thr Gly Gly Glu Phe Ala
225 230 235 240
Asp Glu Asn Ala Met Phe Asp Asp Ala Ala Asp Met Ala Arg Ser Phe
245 250 255
Ser Arg Leu Trp Arg Asp Lys Glu Ala Arg Arg Ala Ala Gly Glu Glu
260 265 270
Pro Gly Phe Asp Leu Ile Ser Leu Leu Gln Ser Asn Lys Glu Thr Lys
275 280 285
Asp Leu Ile Asn Arg Pro Met Glu Phe Ile Gly Asn Leu Thr Leu Leu
290 295 300
Ile Val Gly Gly Asn Asp Thr Thr Arg Asn Ser Met Ser Gly Gly Leu
305 310 315 320
Val Ala Met Asn Glu Phe Pro Arg Glu Phe Glu Lys Leu Lys Ala Lys
325 330 335
Pro Glu Leu Ile Pro Asn Met Val Ser Glu Ile Ile Arg Trp Gln Thr
340 345 350
Pro Leu Ala Tyr Met Arg Arg Ile Ala Lys Gln Asp Val Glu Leu Gly
355 360 365
Gly Gln Thr Ile Lys Lys Gly Asp Arg Val Val Met Trp Tyr Ala Ser
370 375 380
Gly Asn Arg Asp Glu Arg Lys Phe Asp Asn Pro Asp Gln Phe Ile Ile
385 390 395 400
Asp Arg Lys Asp Ala Arg Asn His Met Ser Phe Gly Tyr Gly Val His
405 410 415
Arg Cys Met Gly Asn Arg Leu Ala Glu Leu Gln Leu Arg Ile Leu Trp
420 425 430
Glu Glu Ile Leu Lys Arg Phe Asp Asn Ile Glu Val Val Glu Glu Pro
435 440 445
Glu Arg Val Gln Ser Asn Phe Val Arg Gly Tyr Ser Arg Leu Met Val
450 455 460
Lys Leu Thr Pro Asn Ser
465 470
<210> 35
<211> 1413
<212> DNA
<213> 水油海杆菌 VT8
<400> 35
atgccaacac tgcccagaac atttgacgac attcagtccc gactgattaa cgccacctcc 60
agggtggtgc cgatgcagag gcaaattcag ggactgaaat tcttaatgag cgccaagagg 120
aagaccttcg gcccacgccg accgatgccc gaattcgttg aaacacccat cccggacgtt 180
aacacgctgg cccttgagga catcgatgtc agcaatccgt ttttataccg gcagggtcag 240
tggcgcgcct atttcaaacg gttgcgtgat gaggcgccgg tccattacca gaagaacagc 300
cctttcggcc ccttctggtc ggtaactcgg tttgaagaca tcctgttcgt ggataagagt 360
cacgacctgt tttccgccga gccgcaaatc attctcggtg accctccgga ggggctgtcg 420
gtggaaatgt tcatagcgat ggatccgccg aaacacgatg tgcagcgcag ctcggtgcag 480
ggagtagtgg caccgaaaaa cctgaaggag atggaggggc tgatccgatc acgcaccggc 540
gatgtgcttg acagcctgcc tacagacaaa ccctttaact gggtacctgc tgtttccaag 600
gaactcacag gccgcatgct ggcgacgctt ctggattttc cttacgagga acgccacaag 660
ctggttgagt ggtcggacag aatggcaggt gcagcatcgg ccaccggcgg ggagtttgcc 720
gatgaaaatg ccatgtttga cgacgcggca gacatggccc ggtctttctc caggctttgg 780
cgggacaagg aggcgcgccg cgcagcaggc gaggagcccg gtttcgattt gatcagcctg 840
ttgcagagca acaaagaaac gaaagacctg atcaatcggc cgatggagtt tatcggtaat 900
ttgacgctgc tcatagtcgg cggcaacgat acgacgcgca actcgatgag tggtggcctg 960
gtggccatga acgaattccc cagggaattt gaaaaattga aggcaaaacc ggagttgatt 1020
ccgaacatgg tgtcggaaat catccgctgg caaacgccgc tggcctatat gcgccgaatc 1080
gccaagcagg atgtcgaact gggcggccag accatcaaga agggtgatcg agttgtcatg 1140
tggtacgcgt cgggtaaccg ggacgagcgc aaatttgaca accccgatca gttcatcatt 1200
gatcgcaagg acgcacgaaa ccacatgtcg ttcggctatg gggttcaccg ttgcatgggc 1260
aaccgtctgg ctgaactgca actgcgcatc ctctgggaag aaatactcaa gcgttttgac 1320
aacatcgaag tcgtcgaaga gcccgagcgg gtgcagtcca acttcgtgcg gggctattcc 1380
aggttgatgg tcaaactgac accgaacagt taa 1413
<210> 36
<211> 106
<212> PRT
<213> 不动杆菌属种OC4
<400> 36
Met Gly Gln Ile Thr Phe Ile Ala His Asp Gly Ala Gln Thr Ser Val
1 5 10 15
Ala Ile Glu Ala Gly Lys Ser Leu Met Gln Leu Ala Val Glu Asn Gly
20 25 30
Val Ala Gly Ile Asp Gly Asp Cys Gly Gly Glu Cys Ala Cys Gly Thr
35 40 45
Cys His Val Ile Val Ser Ala Glu Trp Ser Asp Val Ala Gly Thr Ala
50 55 60
Gln Ala Asn Glu Gln Gln Met Leu Glu Met Thr Pro Glu Arg Ala Ala
65 70 75 80
Thr Ser Arg Leu Ala Cys Cys Ile Gln Val Thr Asp Ala Met Asp Gly
85 90 95
Met Thr Val His Leu Pro Glu Phe Gln Met
100 105
<210> 37
<211> 321
<212> DNA
<213> 不动杆菌属种OC4
<400> 37
atgggccaaa ttactttcat cgcacacgac ggcgcacaga cgagcgttgc gatcgaggcg 60
ggtaagtctt tgatgcagct ggcggtagaa aacggtgtcg ccggcatcga tggcgactgc 120
ggtggcgagt gtgcgtgtgg cacctgtcat gtcattgtct ctgcggagtg gagcgatgtg 180
gccggtaccg cgcaagccaa tgagcagcag atgctggaga tgactccgga gcgtgcggcc 240
accagccgtc tggcatgttg catccaagtg acggacgcga tggatggtat gaccgtccat 300
ctgccggaat ttcagatgta a 321
<210> 38
<211> 542
<212> PRT
<213> 柄杆菌属种 K31
<400> 38
Met Phe Asp Tyr Ile Ile Val Gly Ala Gly Ser Ala Gly Cys Leu Leu
1 5 10 15
Ala Glu Arg Leu Ser Ala Asn Pro Arg Thr Arg Val Cys Leu Leu Glu
20 25 30
Ala Gly Pro Pro Asp Arg Ser Pro Leu Ile His Met Pro Ile Gly Ile
35 40 45
Ala Leu Leu Ser Lys Ser Lys Ile Leu Asn Trp Ala Phe Glu Thr Gln
50 55 60
Pro Gln Ala Asn Leu Asp Gly Arg Arg Leu Phe Trp Pro Arg Gly Lys
65 70 75 80
Thr Leu Gly Gly Ser Ser Ser Ile Asn Ala Met Val Tyr Ile Arg Gly
85 90 95
His Arg Asp Asp Tyr Asp Ser Trp Gly Glu Ala Ala Asp Pro Ile Trp
100 105 110
Ser Tyr Asp Asn Val Leu Pro Leu Phe Lys Ala Met Glu Ser Asn Glu
115 120 125
Arg Phe Gly Thr Asp Ala Phe His Gly Gly Asp Gly Glu Leu His Val
130 135 140
Ser Asp Leu Arg Thr Arg Asn Pro Leu Ser Asp Ala Phe Val Glu Ala
145 150 155 160
Gly Gln Gln Ala Gln Phe Pro His Ala Val Asp Phe Asn Gly Lys Met
165 170 175
Gln Asp Gly Val Gly Leu Tyr Gln Val Thr Gln His Lys Gly Arg Arg
180 185 190
Trp Ser Ser Ala Arg Ala Phe Leu Ser Lys Ala Lys Gly Arg Pro Asn
195 200 205
Leu Arg Ile Val Thr Gly Ala Arg Ala Thr Arg Ile Ile Leu Glu Gly
210 215 220
Arg Lys Ala Val Gly Val Thr Tyr Ala Ala Gly Gly Lys Leu Val Asp
225 230 235 240
Val Arg Thr Arg Gly Gly Glu Val Ile Leu Ser Gly Gly Ala Val Asn
245 250 255
Ser Pro Gln Leu Leu Leu Leu Ser Gly Ile Gly Gly Ala Ala Glu Leu
260 265 270
Asn Ala Leu Gly Ile Pro Val Val Val Asp Leu Pro Ala Val Gly Lys
275 280 285
Asn Leu Gln Asp His Leu Asp Ile Thr Ile Met His Glu Ala Asn Asp
290 295 300
Arg Thr Pro Ile Gly Ile Ala Pro Ser Phe Ile Pro Arg Ala Leu Ser
305 310 315 320
Gly Ala Leu Ser Tyr Ala Phe Leu Arg Lys Gly Phe Leu Thr Ser Asn
325 330 335
Val Ala Glu Ala Gly Gly Phe Val Lys Ser Thr Pro Ser Arg Ser Arg
340 345 350
Pro Asn Leu Gln Phe His Phe Leu Pro Thr Leu Leu Lys Asp His Gly
355 360 365
Arg Glu Met Ala Phe Gly Tyr Gly Tyr Thr Leu His Val Cys Asp Leu
370 375 380
Leu Pro Lys Ser Arg Gly Arg Ile Gly Leu Thr Ser Pro Asp Pro Leu
385 390 395 400
Asp Asp Pro Leu Ile Asp Pro Asn Tyr Leu Ser Ala Pro Glu Asp Ile
405 410 415
Glu Thr Met Val Ala Ala Val Lys Ile Gly Arg Gln Ile Leu Ser Ala
420 425 430
Pro Ser Met Ala Ala Phe Ser Lys Thr Glu Leu Val Pro Gly Pro Ser
435 440 445
Val Gln Ser Lys Ala Asp Ile Met Ala Asp Ile Arg Arg Arg Ala Glu
450 455 460
Thr Ile Tyr His Pro Val Gly Thr Cys Arg Met Gly Arg Asp Pro Gln
465 470 475 480
Ser Val Val Asp Pro Ser Leu Arg Val Arg Gly Val Gln Gly Leu Arg
485 490 495
Val Val Asp Ala Ser Val Met Pro Thr Leu Val Ala Gly Asn Thr Asn
500 505 510
Ala Pro Thr Met Met Ile Ala Glu Arg Ala Ala Glu Leu Ile Leu Gly
515 520 525
Lys Thr Lys Leu Ala Leu Ser Ala Asn Ile Glu Ala Phe Arg
530 535 540
<210> 39
<211> 1629
<212> DNA
<213> 柄杆菌属种 K31
<400> 39
atgtttgact atattattgt tggagcgggg tctgccggat gcttgttggc ggagcgcttg 60
tcagccaatc ccaggacgcg ggtctgtctg cttgaggcgg gcccgcccga ccgcagcccg 120
ctgatccaca tgcccattgg gatagcgctt ctgtcaaaga gcaaaattct caattgggca 180
ttcgagacgc agccacaggc caatctcgat ggtcgacggc tgttttggcc gcgcggcaaa 240
acccttggcg gatcgagttc gatcaatgcg atggtctata tccgcgggca ccgggatgac 300
tatgactcct ggggcgaggc agccgatccg atctggtcct atgacaatgt gctcccgctg 360
ttcaaggcga tggagtccaa cgagagattt ggaaccgacg cgtttcatgg cggcgatggt 420
gagcttcacg tcagcgacct gcgaacccgc aaccccttga gcgatgcctt cgtcgaggcc 480
ggacaacagg cccagtttcc gcatgccgtc gatttcaatg ggaagatgca ggacggcgtc 540
ggcctgtacc aggtcaccca gcacaaaggc cggcgctgga gttccgcgcg cgcctttctt 600
tccaaggcca agggccggcc caatctacgg atagtcacgg gcgcgcgggc tacccggatc 660
attctggagg gccgcaaagc ggtcggcgtg acctatgccg caggcggcaa gctggtcgat 720
gtgcgaacca ggggcggcga ggtcattctt tcgggcggcg ccgtcaattc cccgcaactg 780
ctgctgcttt ccggcatcgg cggcgcggcc gagctgaacg cactcggcat tccggtggtc 840
gtcgaccttc cggcagttgg aaaaaatctg caggatcacc tcgatatcac aatcatgcat 900
gaggcgaacg atcgtacacc gatcggcatc gcaccgtcat tcatcccgcg ggcgctgtcc 960
ggagcgctat cctacgcctt ccttcgaaag ggtttcttga cgagcaacgt cgccgaggcg 1020
ggcggcttcg tcaaaagcac accttcgcgg agtcggccga atctacagtt tcatttcctc 1080
cccacgcttt tgaaggacca tgggcgcgaa atggcgttcg ggtatggcta tacattgcat 1140
gtctgcgatc ttctgcccaa gagccgaggc cgcatcgggc tcacaagccc cgacccgctc 1200
gacgatccgc tgatcgatcc aaactatctc tcggcccccg aagacattga gaccatggtc 1260
gcggcggtga agatcggccg gcaaattctg tcggcgccgt caatggcggc cttctcgaaa 1320
accgaactgg tccctgggcc atcggtccag agcaaggcgg atatcatggc ggatatccgt 1380
cggcgagcgg agacgatcta tcatccggtg ggaacatgcc ggatgggacg agaccctcag 1440
tcggttgtcg atccgtcact ccgagtgcgt ggcgtgcaag gccttcgcgt cgtcgacgcc 1500
tcggtcatgc cgacgctggt cgccggaaac accaacgccc cgacgatgat gattgcggaa 1560
agagctgccg agctcattct tgggaagacg aaactcgcac tcagcgccaa cattgaggca 1620
ttccgctaa 1629
<210> 40
<211> 704
<212> PRT
<213> 热带念珠菌
<400> 40
Met Ala Pro Phe Leu Pro Asp Gln Val Asp Tyr Lys His Val Asp Thr
1 5 10 15
Leu Met Leu Leu Cys Asp Gly Ile Ile His Glu Thr Thr Val Asp Glu
20 25 30
Ile Lys Asp Val Ile Ala Pro Asp Phe Pro Ala Asp Lys Tyr Glu Glu
35 40 45
Tyr Val Arg Thr Phe Thr Lys Pro Ser Glu Thr Pro Gly Phe Arg Glu
50 55 60
Thr Val Tyr Asn Thr Val Asn Ala Asn Thr Met Asp Ala Ile His Gln
65 70 75 80
Phe Ile Ile Leu Thr Asn Val Leu Gly Ser Arg Val Leu Ala Pro Ala
85 90 95
Leu Thr Asn Ser Leu Thr Pro Ile Lys Asp Met Ser Leu Glu Asp Arg
100 105 110
Glu Lys Leu Leu Ala Ser Trp Arg Asp Ser Pro Ile Ala Ala Lys Arg
115 120 125
Lys Leu Phe Arg Leu Val Ser Thr Leu Thr Leu Val Thr Phe Thr Arg
130 135 140
Leu Ala Asn Glu Leu His Leu Lys Ala Ile His Tyr Pro Gly Arg Glu
145 150 155 160
Asp Arg Glu Lys Ala Tyr Glu Thr Gln Glu Ile Asp Pro Phe Lys Tyr
165 170 175
Gln Phe Leu Glu Lys Pro Lys Phe Tyr Gly Ala Glu Leu Tyr Leu Pro
180 185 190
Asp Ile Asp Val Ile Ile Ile Gly Ser Gly Ala Gly Ala Gly Val Val
195 200 205
Ala His Thr Leu Thr Asn Asp Gly Phe Lys Ser Leu Val Leu Glu Lys
210 215 220
Gly Arg Tyr Phe Ser Asn Ser Glu Leu Asn Phe Asp Asp Lys Asp Gly
225 230 235 240
Val Gln Glu Leu Tyr Gln Ser Gly Gly Thr Leu Thr Thr Val Asn Gln
245 250 255
Gln Leu Phe Val Leu Ala Gly Ser Thr Phe Gly Gly Gly Thr Thr Val
260 265 270
Asn Trp Ser Ala Cys Leu Lys Thr Pro Phe Lys Val Arg Lys Glu Trp
275 280 285
Tyr Asp Glu Phe Gly Val Asp Phe Ala Ala Asp Glu Ala Tyr Asp Lys
290 295 300
Ala Gln Asp Tyr Val Trp Gln Gln Met Gly Ala Ser Thr Glu Gly Ile
305 310 315 320
Thr His Ser Leu Ala Asn Glu Ile Ile Ile Glu Gly Gly Lys Lys Leu
325 330 335
Gly Tyr Lys Ala Lys Val Leu Asp Gln Asn Ser Gly Gly His Pro His
340 345 350
His Arg Cys Gly Phe Cys Tyr Leu Gly Cys Lys His Gly Ile Lys Gln
355 360 365
Gly Ser Val Asn Asn Trp Phe Arg Asp Ala Ala Ala His Gly Ser Gln
370 375 380
Phe Met Gln Gln Val Arg Val Leu Gln Ile Leu Asn Lys Lys Gly Ile
385 390 395 400
Ala Tyr Gly Ile Leu Cys Glu Asp Val Val Thr Gly Ala Lys Phe Thr
405 410 415
Ile Thr Gly Pro Lys Lys Phe Val Val Ala Ala Gly Ala Leu Asn Thr
420 425 430
Pro Ser Val Leu Val Asn Ser Gly Phe Lys Asn Lys Asn Ile Gly Lys
435 440 445
Asn Leu Thr Leu His Pro Val Ser Val Val Phe Gly Asp Phe Gly Lys
450 455 460
Asp Val Gln Ala Asp His Phe His Asn Ser Ile Met Thr Ala Leu Cys
465 470 475 480
Ser Glu Ala Ala Asp Leu Asp Gly Lys Gly His Gly Cys Arg Ile Glu
485 490 495
Thr Ile Leu Asn Ala Pro Phe Ile Gln Ala Ser Phe Leu Pro Trp Arg
500 505 510
Gly Ser Asn Glu Ala Arg Arg Asp Leu Leu Arg Tyr Asn Asn Met Val
515 520 525
Ala Met Leu Leu Leu Ser Arg Asp Thr Thr Ser Gly Ser Val Ser Ser
530 535 540
His Pro Thr Lys Pro Glu Ala Leu Val Val Glu Tyr Asp Val Asn Lys
545 550 555 560
Phe Asp Arg Asn Ser Ile Leu Gln Ala Leu Leu Val Thr Ala Asp Leu
565 570 575
Leu Tyr Ile Gln Gly Ala Lys Arg Ile Leu Ser Pro Gln Pro Trp Val
580 585 590
Pro Ile Phe Glu Ser Asp Lys Pro Lys Asp Lys Arg Ser Ile Lys Asp
595 600 605
Glu Asp Tyr Val Glu Trp Arg Ala Lys Val Ala Lys Ile Pro Phe Asp
610 615 620
Thr Tyr Gly Ser Pro Tyr Gly Ser Ala His Gln Met Ser Ser Cys Arg
625 630 635 640
Met Ser Gly Lys Gly Pro Lys Tyr Gly Ala Val Asp Thr Asp Gly Arg
645 650 655
Leu Phe Glu Cys Ser Asn Val Tyr Val Ala Asp Ala Ser Leu Leu Pro
660 665 670
Thr Ala Ser Gly Ala Asn Pro Met Val Thr Thr Met Thr Leu Ala Arg
675 680 685
His Val Ala Leu Gly Leu Ala Asp Ser Leu Lys Thr Lys Ala Lys Leu
690 695 700
<210> 41
<211> 2115
<212> DNA
<213> 热带念珠菌
<400> 41
atggctccat ttttgcccga ccaggtcgac tacaaacacg tcgacaccct tatgttatta 60
tgtgacggga tcatccacga aaccaccgtg gacgaaatca aagacgtcat tgcccctgac 120
ttccccgccg acaaatacga ggagtacgtc aggacattca ccaaaccctc cgaaacccca 180
gggttcaggg aaaccgtcta caacaccgtc aacgcaaaca ccatggatgc aatccaccag 240
ttcattatct tgaccaatgt tttgggatca agggtcttgg caccagcttt gaccaactcg 300
ttgactccta tcaaggacat gagcttggaa gaccgtgaaa agttgttagc ctcgtggcgt 360
gactccccta ttgctgctaa aaggaagttg ttcaggttgg tttctacgct taccttggtc 420
acgttcacga gattggccaa tgagttgcat ttgaaagcca ttcattatcc aggaagagaa 480
gaccgtgaaa aggcttatga aacccaggag attgaccctt ttaagtacca gtttttggaa 540
aaaccgaagt tttacggcgc tgagttgtac ttgccagata ttgatgtgat cattattgga 600
tctggggccg gtgctggtgt cgtggcccac actttgacca acgacggctt caagagtttg 660
gttttggaaa agggcagata ctttagcaac tccgagttga actttgatga caaggacggg 720
gttcaagaat tataccaaag tggaggtact ttgaccaccg tcaaccagca gttgtttgtt 780
cttgctggtt ccacttttgg tggtggtacc actgtcaatt ggtcggcctg tcttaaaacg 840
ccattcaagg tgcgtaagga atggtatgat gagtttggcg ttgactttgc tgccgatgaa 900
gcctacgaca aagcacagga ttatgtttgg cagcaaatgg gagcttctac cgaaggcatc 960
acccactctt tggctaacga gattattatt gaaggtggca agaaattagg ttacaaggcc 1020
aaggtattag accaaaacag cggtggtcat cctcatcaca gatgcggttt ctgttatttg 1080
ggttgtaagc acggtatcaa gcagggctct gttaataact ggtttagaga cgcagctgcc 1140
cacggttctc agttcatgca acaggttaga gttttgcaaa tccttaacaa gaagggcatc 1200
gcttatggta tcttgtgtga ggatgttgta accggtgcca agttcaccat tactggcccc 1260
aaaaagtttg ttgttgccgc cggcgcctta aacactccat ctgtgttggt caactccgga 1320
ttcaagaaca agaacatcgg taagaactta actttgcatc cagtttctgt cgtgtttggt 1380
gattttggca aagacgttca agcagatcac ttccacaact ccatcatgac tgctctttgt 1440
tcagaagccg ctgatttaga cggcaagggt catggatgca gaattgaaac catcttgaac 1500
gctccattca tccaggcttc attcttacca tggagaggta gtaacgaggc tagacgagac 1560
ttgttgcgtt acaacaacat ggtggccatg ttacttctta gtcgtgatac caccagtggt 1620
tccgtttcgt cccatccaac taaacctgaa gcattagttg tcgagtacga cgtgaacaag 1680
tttgacagaa actccatctt gcaggcattg ttggtcactg ctgacttgtt gtacattcaa 1740
ggtgccaaga gaatccttag tccccaacca tgggtgccaa tttttgaatc cgacaagcca 1800
aaggataaga gatcaatcaa ggacgaggac tatgtcgaat ggagagccaa ggttgccaag 1860
attccttttg acacctacgg ctcgccttat ggttcggcgc atcaaatgtc ttcttgtcgt 1920
atgtcaggta agggtcctaa atacggtgct gttgataccg atggtagatt gtttgaatgt 1980
tcgaatgttt atgttgctga cgctagtctt ttgccaactg ctagcggtgc taatcctatg 2040
gtcaccacca tgactcttgc aagacatgtt gcgttaggtt tggcagactc cttgaagacc 2100
aaggccaagt tgtag 2115
<210> 42
<211> 470
<212> PRT
<213> 水油海杆菌 VT8
<400> 42
Met Pro Thr Leu Pro Arg Thr Phe Asp Asp Ile Gln Ser Arg Leu Ile
1 5 10 15
Asn Ala Thr Ser Arg Val Val Pro Met Gln Arg Gln Ile Gln Gly Leu
20 25 30
Lys Phe Leu Met Ser Ala Lys Arg Lys Thr Phe Gly Pro Arg Arg Pro
35 40 45
Met Pro Glu Phe Val Glu Thr Pro Ile Pro Asp Val Asn Thr Leu Ala
50 55 60
Leu Glu Asp Ile Asp Val Ser Asn Pro Phe Leu Tyr Arg Gln Gly Gln
65 70 75 80
Trp Arg Ala Tyr Phe Lys Arg Leu Arg Asp Glu Ala Pro Val His Tyr
85 90 95
Gln Lys Asn Ser Pro Phe Gly Pro Phe Trp Ser Val Thr Arg Phe Glu
100 105 110
Asp Ile Leu Phe Val Asp Lys Ser His Asp Leu Phe Ser Ala Glu Pro
115 120 125
Gln Ile Ile Leu Gly Asp Pro Pro Glu Gly Leu Ser Val Glu Met Phe
130 135 140
Ile Ala Met Asp Pro Pro Lys His Asp Val Gln Arg Ser Ser Val Gln
145 150 155 160
Gly Val Val Ala Pro Lys Asn Leu Lys Glu Met Glu Gly Leu Ile Arg
165 170 175
Ser Arg Thr Gly Asp Val Leu Asp Ser Leu Pro Thr Asp Lys Pro Phe
180 185 190
Asn Trp Val Pro Ala Val Ser Lys Glu Leu Thr Gly Arg Met Leu Ala
195 200 205
Thr Leu Leu Asp Phe Pro Tyr Glu Glu Arg His Lys Leu Val Glu Trp
210 215 220
Ser Asp Arg Met Ala Gly Ala Ala Ser Ala Thr Gly Gly Glu Phe Ala
225 230 235 240
Asp Glu Asn Ala Met Phe Asp Asp Ala Ala Asp Met Ala Arg Ser Phe
245 250 255
Ser Arg Leu Trp Arg Asp Lys Glu Ala Arg Arg Ala Ala Gly Glu Glu
260 265 270
Pro Gly Phe Asp Leu Ile Ser Leu Leu Gln Ser Asn Lys Glu Thr Lys
275 280 285
Asp Leu Ile Asn Arg Pro Met Glu Phe Ile Gly Asn Leu Thr Leu Leu
290 295 300
Ile Val Gly Gly Asn Asp Thr Thr Arg Asn Ser Met Ser Gly Gly Leu
305 310 315 320
Val Ala Met Asn Glu Phe Pro Arg Glu Phe Glu Lys Leu Lys Ala Lys
325 330 335
Pro Glu Leu Ile Pro Asn Met Val Ser Glu Ile Ile Arg Trp Gln Thr
340 345 350
Pro Leu Ala Tyr Met Arg Arg Ile Ala Lys Gln Asp Val Glu Leu Gly
355 360 365
Gly Gln Thr Ile Lys Lys Gly Asp Arg Val Val Met Trp Tyr Ala Ser
370 375 380
Gly Asn Arg Asp Glu Arg Lys Phe Asp Asn Pro Asp Gln Phe Ile Ile
385 390 395 400
Asp Arg Lys Asp Ala Arg Asn His Met Ser Phe Gly Tyr Gly Val His
405 410 415
Arg Cys Met Gly Asn Arg Leu Ala Glu Leu Gln Leu Arg Ile Leu Trp
420 425 430
Glu Glu Ile Leu Lys Arg Phe Asp Asn Ile Glu Val Val Glu Glu Pro
435 440 445
Glu Arg Val Gln Ser Asn Phe Val Arg Gly Tyr Ser Arg Leu Met Val
450 455 460
Lys Leu Thr Pro Asn Ser
465 470
<210> 43
<211> 1413
<212> DNA
<213> 水油海杆菌 VT8
<400> 43
atgccaacac tgcccagaac atttgacgac attcagtccc gactgattaa cgccacctcc 60
agggtggtgc cgatgcagag gcaaattcag ggactgaaat tcttaatgag cgccaagagg 120
aagaccttcg gcccacgccg accgatgccc gaattcgttg aaacacccat cccggacgtt 180
aacacgctgg cccttgagga catcgatgtc agcaatccgt ttttataccg gcagggtcag 240
tggcgcgcct atttcaaacg gttgcgtgat gaggcgccgg tccattacca gaagaacagc 300
cctttcggcc ccttctggtc ggtaactcgg tttgaagaca tcctgttcgt ggataagagt 360
cacgacctgt tttccgccga gccgcaaatc attctcggtg accctccgga ggggctgtcg 420
gtggaaatgt tcatagcgat ggatccgccg aaacacgatg tgcagcgcag ctcggtgcag 480
ggagtagtgg caccgaaaaa cctgaaggag atggaggggc tgatccgatc acgcaccggc 540
gatgtgcttg acagcctgcc tacagacaaa ccctttaact gggtacctgc tgtttccaag 600
gaactcacag gccgcatgct ggcgacgctt ctggattttc cttacgagga acgccacaag 660
ctggttgagt ggtcggacag aatggcaggt gcagcatcgg ccaccggcgg ggagtttgcc 720
gatgaaaatg ccatgtttga cgacgcggca gacatggccc ggtctttctc caggctttgg 780
cgggacaagg aggcgcgccg cgcagcaggc gaggagcccg gtttcgattt gatcagcctg 840
ttgcagagca acaaagaaac gaaagacctg atcaatcggc cgatggagtt tatcggtaat 900
ttgacgctgc tcatagtcgg cggcaacgat acgacgcgca actcgatgag tggtggcctg 960
gtggccatga acgaattccc cagggaattt gaaaaattga aggcaaaacc ggagttgatt 1020
ccgaacatgg tgtcggaaat catccgctgg caaacgccgc tggcctatat gcgccgaatc 1080
gccaagcagg atgtcgaact gggcggccag accatcaaga agggtgatcg agttgtcatg 1140
tggtacgcgt cgggtaaccg ggacgagcgc aaatttgaca accccgatca gttcatcatt 1200
gatcgcaagg acgcacgaaa ccacatgtcg ttcggctatg gggttcaccg ttgcatgggc 1260
aaccgtctgg ctgaactgca actgcgcatc ctctgggaag aaatactcaa gcgttttgac 1320
aacatcgaag tcgtcgaaga gcccgagcgg gtgcagtcca acttcgtgcg gggctattcc 1380
aggttgatgg tcaaactgac accgaacagt taa 1413
<210> 44
<211> 497
<212> PRT
<213> 不动杆菌属种OC4
<400> 44
Met Asn Ser Val Ala Glu Ile Phe Glu Lys Ile Thr Gln Thr Val Thr
1 5 10 15
Ser Thr Ala Ala Asp Val Ala Thr Thr Val Thr Asp Lys Val Lys Ser
20 25 30
Asn Glu Gln Phe Gln Thr Gly Lys Gln Phe Leu His Gly Gln Val Thr
35 40 45
Arg Phe Val Pro Leu His Thr Gln Val Arg Gly Ile Gln Trp Met Gln
50 55 60
Lys Ala Lys Phe Arg Val Phe Asn Val Gln Glu Phe Pro Ala Phe Ile
65 70 75 80
Glu Gln Pro Ile Pro Glu Val Ala Thr Leu Ala Leu Ala Glu Ile Asp
85 90 95
Val Ser Asn Pro Phe Leu Tyr Lys Gln Lys Lys Trp Gln Ser Tyr Phe
100 105 110
Lys Arg Leu Arg Asp Glu Ala Pro Val His Tyr Gln Ala Asn Ser Pro
115 120 125
Phe Gly Ala Phe Trp Ser Val Thr Arg Tyr Asp Asp Ile Val Tyr Val
130 135 140
Asp Lys Asn His Glu Ile Phe Ser Ala Glu Pro Val Ile Ala Ile Gly
145 150 155 160
Asn Thr Pro Pro Gly Leu Gly Ala Glu Met Phe Ile Ala Met Asp Pro
165 170 175
Pro Lys His Asp Val Gln Arg Gln Ala Val Gln Asp Val Val Ala Pro
180 185 190
Lys Asn Leu Lys Glu Leu Glu Gly Leu Ile Arg Leu Arg Val Gln Glu
195 200 205
Val Leu Asp Gln Leu Pro Thr Asp Gln Pro Phe Asp Trp Val Gln Asn
210 215 220
Val Ser Ile Glu Leu Thr Ala Arg Met Leu Ala Thr Leu Phe Asp Phe
225 230 235 240
Pro Tyr Glu Lys Arg His Lys Leu Val Glu Trp Ser Asp Leu Met Ala
245 250 255
Gly Thr Ala Glu Ala Thr Gly Gly Thr Val Thr Asn Leu Asp Glu Ile
260 265 270
Phe Asp Ala Ala Val Asp Ala Ala Lys His Phe Ala Glu Leu Trp His
275 280 285
Arg Lys Ala Ala Gln Lys Ser Ala Gly Ala Glu Met Gly Tyr Asp Leu
290 295 300
Ile Ser Leu Met Gln Ser Asn Glu Ala Thr Lys Asp Leu Ile Tyr Arg
305 310 315 320
Pro Met Glu Phe Met Gly Asn Leu Val Leu Leu Ile Val Gly Gly Asn
325 330 335
Asp Thr Thr Arg Asn Ser Met Thr Gly Gly Val Tyr Ala Leu Asn Leu
340 345 350
Phe Pro Asn Glu Phe Val Lys Leu Lys Asn Asn Pro Ser Leu Ile Pro
355 360 365
Asn Met Val Ser Glu Ile Ile Arg Trp Gln Thr Pro Leu Ala Tyr Met
370 375 380
Arg Arg Ile Ala Lys Gln Asp Val Glu Leu Asn Gly Gln Thr Ile Lys
385 390 395 400
Lys Gly Asp Lys Val Val Met Trp Tyr Val Ser Gly Asn Arg Asp Glu
405 410 415
Arg Val Ile Glu Arg Pro Asp Glu Leu Ile Ile Asp Arg Lys Gly Ala
420 425 430
Arg Asn His Leu Ser Phe Gly Phe Gly Val His Arg Cys Met Gly Asn
435 440 445
Arg Leu Ala Glu Met Gln Leu Arg Ile Leu Trp Glu Glu Leu Leu Gln
450 455 460
Arg Phe Glu Asn Ile Glu Val Leu Gly Glu Pro Glu Ile Val Gln Ser
465 470 475 480
Asn Phe Val Arg Gly Tyr Ala Lys Met Met Val Lys Leu Thr Ala Lys
485 490 495
Ala
<210> 45
<211> 1494
<212> DNA
<213> 不动杆菌属种OC4
<400> 45
atgaacagcg tagcagaaat cttcgagaaa atcacccaga ccgtcacgag caccgcagcc 60
gacgttgcaa ccactgtcac cgacaaggtc aaatccaacg agcaattcca aacgggtaag 120
cagtttttgc acggccaggt tacgcgtttt gtgccgttgc atacgcaggt ccgtggtatt 180
cagtggatgc agaaagccaa gtttcgcgtg ttcaatgtcc aggagttccc ggcctttatt 240
gaacaaccga ttccggaagt ggcaaccctg gcgctggcgg aaattgatgt cagcaacccg 300
ttcctgtaca aacagaaaaa gtggcaaagc tacttcaaac gtctgcgtga cgaggcgcca 360
gtgcattatc aagctaacag cccgttcggt gccttttggt ctgtgacccg ttacgacgac 420
atcgtttatg ttgacaaaaa tcatgaaatc ttctccgcag aaccggtgat tgctattggt 480
aataccccgc caggcctggg tgcggaaatg ttcatcgcaa tggaccctcc gaagcacgac 540
gtccagcgcc aagcggtgca ggacgtagtt gcgccgaaga acttgaagga actggagggt 600
ctgatccgcc tgcgtgtgca ggaagtcctg gatcagctgc cgacggatca gccgtttgac 660
tgggtgcaga acgtgtcgat tgaactgacc gcacgcatgc tggcgaccct gttcgacttt 720
ccgtatgaaa aacgtcataa actggtggaa tggtccgatc tgatggcagg tacggctgag 780
gccaccggcg gtactgtgac caatctggat gagatttttg atgcggcggt tgacgcagcc 840
aagcacttcg ctgagctgtg gcaccgcaaa gcggctcaaa agagcgccgg tgcggagatg 900
ggctacgatc tgatttccct gatgcaaagc aacgaggcga cgaaagacct gatctatcgc 960
ccgatggagt tcatgggtaa cctggtcctg ctgattgttg gtggcaatga caccacccgt 1020
aatagcatga cgggcggtgt ttatgcactg aacttgtttc cgaatgagtt tgtcaaactg 1080
aagaataacc cgagcctgat ccctaacatg gttagcgaga ttatccgctg gcagacgccg 1140
ctggcttaca tgcgccgcat tgccaagcaa gacgtcgagc tgaacggtca aaccatcaaa 1200
aagggcgata aggtcgtgat gtggtacgtc agcggtaacc gcgacgaacg tgtgattgaa 1260
cgcccggacg aactgatcat tgatcgcaag ggtgcgcgta accacctgtc cttcggcttt 1320
ggcgtgcatc gctgcatggg taatcgtttg gccgagatgc aactgcgtat cctgtgggag 1380
gaactgttgc aacgcttcga gaatatcgaa gttctgggcg aaccggagat tgttcagtcc 1440
aatttcgtgc gtggctatgc gaagatgatg gttaaactga cggcgaaagc ctaa 1494
<210> 46
<211> 558
<212> PRT
<213> 恶臭假单胞菌
<400> 46
Met Tyr Asp Tyr Ile Ile Val Gly Ala Gly Ser Ala Gly Cys Val Leu
1 5 10 15
Ala Asn Arg Leu Ser Ala Asp Pro Ser Lys Arg Val Cys Leu Leu Glu
20 25 30
Ala Gly Pro Arg Asp Thr Asn Pro Leu Ile His Met Pro Leu Gly Ile
35 40 45
Ala Leu Leu Ser Asn Ser Lys Lys Leu Asn Trp Ala Phe Gln Thr Ala
50 55 60
Pro Gln Gln Asn Leu Asn Gly Arg Ser Leu Phe Trp Pro Arg Gly Lys
65 70 75 80
Thr Leu Gly Gly Ser Ser Ser Ile Asn Ala Met Val Tyr Ile Arg Gly
85 90 95
His Glu Asp Asp Tyr His Ala Trp Glu Gln Ala Ala Gly Arg Tyr Trp
100 105 110
Gly Trp Tyr Arg Ala Leu Glu Leu Phe Lys Arg Leu Glu Cys Asn Gln
115 120 125
Arg Phe Asp Lys Ser Glu His His Gly Val Asp Gly Glu Leu Ala Val
130 135 140
Ser Asp Leu Lys Tyr Ile Asn Pro Leu Ser Lys Ala Phe Val Gln Ala
145 150 155 160
Gly Met Glu Ala Asn Ile Asn Phe Asn Gly Asp Phe Asn Gly Glu Tyr
165 170 175
Gln Asp Gly Val Gly Phe Tyr Gln Val Thr Gln Lys Asn Gly Gln Arg
180 185 190
Trp Ser Ser Ala Arg Ala Phe Leu His Gly Val Leu Ser Arg Pro Asn
195 200 205
Leu Asp Ile Ile Thr Asp Ala His Ala Ser Lys Ile Leu Phe Glu Asp
210 215 220
Arg Lys Ala Val Gly Val Ser Tyr Ile Lys Lys Asn Met His His Gln
225 230 235 240
Val Lys Thr Thr Ser Gly Gly Glu Val Leu Leu Ser Leu Gly Ala Val
245 250 255
Gly Thr Pro His Leu Leu Met Leu Ser Gly Val Gly Ala Ala Ala Glu
260 265 270
Leu Lys Glu His Gly Val Ser Leu Val His Asp Leu Pro Glu Val Gly
275 280 285
Lys Asn Leu Gln Asp His Leu Asp Ile Thr Leu Met Cys Ala Ala Asn
290 295 300
Ser Arg Glu Pro Ile Gly Val Ala Leu Ser Phe Ile Pro Arg Gly Val
305 310 315 320
Ser Gly Leu Phe Ser Tyr Val Phe Lys Arg Glu Gly Phe Leu Thr Ser
325 330 335
Asn Val Ala Glu Ser Gly Gly Phe Val Lys Ser Ser Pro Asp Arg Asp
340 345 350
Arg Pro Asn Leu Gln Phe His Phe Leu Pro Thr Tyr Leu Lys Asp His
355 360 365
Gly Arg Lys Ile Ala Gly Gly Tyr Gly Tyr Thr Leu His Ile Cys Asp
370 375 380
Leu Leu Pro Lys Ser Arg Gly Arg Ile Gly Leu Lys Ser Ala Asn Pro
385 390 395 400
Leu Gln Pro Pro Leu Ile Asp Pro Asn Tyr Leu Ser Asp His Glu Asp
405 410 415
Ile Lys Thr Met Ile Ala Gly Ile Lys Ile Gly Arg Ala Ile Leu Gln
420 425 430
Ala Pro Ser Met Ala Lys His Phe Lys His Glu Val Val Pro Gly Gln
435 440 445
Ala Val Lys Thr Asp Asp Glu Ile Ile Glu Asp Ile Arg Arg Arg Ala
450 455 460
Glu Thr Ile Tyr His Pro Val Gly Thr Cys Arg Met Gly Lys Asp Pro
465 470 475 480
Ala Ser Val Val Asp Pro Cys Leu Lys Ile Arg Gly Leu Ala Asn Ile
485 490 495
Arg Val Val Asp Ala Ser Ile Met Pro His Leu Val Ala Gly Asn Thr
500 505 510
Asn Ala Pro Thr Ile Met Ile Ala Glu Asn Ala Ala Glu Ile Ile Met
515 520 525
Arg Asn Leu Asp Val Glu Ala Leu Glu Ala Ser Ala Glu Phe Ala Arg
530 535 540
Glu Gly Ala Glu Leu Glu Leu Ala Met Ile Ala Val Cys Met
545 550 555
<210> 47
<211> 1677
<212> DNA
<213> 恶臭假单胞菌
<400> 47
atgtacgact atataatcgt tggtgctgga tctgcaggat gtgtgcttgc taatcgtctt 60
tcggccgacc cctctaaaag agtttgttta cttgaagctg ggccgcgaga tacgaatccg 120
ctaattcaca tgccgttagg tattgctttg ctttcaaata gtaaaaagtt gaattgggct 180
tttcaaactg cgccacagca aaatctcaac ggccggagcc ttttctggcc acgaggaaaa 240
acgttaggtg gttcaagctc aatcaacgca atggtctata tccgagggca tgaagacgat 300
taccacgcat gggagcaggc ggccggccgc tactggggtt ggtaccgggc tcttgagttg 360
ttcaaaaggc ttgaatgcaa ccagcgattc gataagtccg agcaccatgg ggttgacgga 420
gaattagctg ttagtgattt aaaatatatc aatccgctta gcaaagcatt cgtgcaagcc 480
ggcatggagg ccaatattaa tttcaacgga gatttcaacg gcgagtacca ggacggcgta 540
gggttctatc aagtaaccca aaaaaatgga caacgctgga gctcggcgcg tgcattcttg 600
cacggtgtac tttccagacc aaatctagac atcattactg atgcgcatgc atcaaaaatt 660
ctttttgaag accgtaaggc ggttggtgtt tcttatataa agaaaaatat gcaccatcaa 720
gtcaagacaa cgagtggtgg tgaagtactt cttagtcttg gcgcagtcgg cacgcctcac 780
cttctaatgc tttctggtgt tggggctgca gccgagctta aggaacatgg tgtttctcta 840
gtccatgatc ttcctgaggt ggggaaaaat cttcaagatc atttggacat cacattgatg 900
tgcgcagcaa attcgagaga gccgataggt gttgctcttt ctttcatccc tcgtggtgtc 960
tcgggtttgt tttcgtatgt gtttaagcgc gaggggtttc tcactagtaa cgtggcagag 1020
tcgggtggtt ttgtaaaaag ttctcctgat cgtgatcggc ccaatttgca gtttcatttc 1080
cttccaactt atcttaaaga tcacggtcga aaaatagcgg gtggttatgg ttatacgcta 1140
catatatgtg atcttttgcc taagagccga ggcagaattg gcctaaaaag cgccaatcca 1200
ttacagccgc ctttaattga cccgaactat cttagcgatc atgaagatat taaaaccatg 1260
attgcgggta ttaagatagg gcgcgctatt ttgcaggccc catcgatggc gaagcatttt 1320
aagcatgaag tagtaccggg ccaggctgtt aaaactgatg atgaaataat cgaagatatt 1380
cgtaggcgag ctgagactat ataccatccg gtaggtactt gtaggatggg taaagatcca 1440
gcttcagttg ttgatccgtg cctgaagatc cgtgggttgg caaatattag agtcgttgat 1500
gcgtcaatta tgccgcactt ggtcgcgggt aacacaaacg ctccaactat tatgattgca 1560
gaaaatgcgg cagaaataat tatgcggaat cttgatgtgg aagcattaga ggctagcgct 1620
gagtttgctc gcgagggtgc agagctagag ttggccatga tagctgtctg catgtaa 1677
<210> 48
<211> 452
<212> PRT
<213> 恶臭假单胞菌
<400> 48
Met Ser Glu Gln Asn Ser Gln Thr Leu Ala Trp Gln Ser Met Ser Arg
1 5 10 15
Asp His His Leu Ala Pro Phe Ser Asp Val Lys Gln Leu Ala Glu Lys
20 25 30
Gly Pro Arg Ile Ile Thr Ser Ala Lys Gly Val Tyr Leu Trp Asp Ser
35 40 45
Glu Gly Asn Lys Ile Leu Asp Gly Met Ala Gly Leu Trp Cys Val Ala
50 55 60
Val Gly Tyr Gly Arg Asp Glu Leu Ala Glu Val Ala Ser Gln Gln Met
65 70 75 80
Lys Gln Leu Pro Tyr Tyr Asn Leu Phe Phe Gln Thr Ala His Pro Pro
85 90 95
Ala Leu Glu Leu Ala Lys Ala Ile Ala Asp Val Ala Pro Gln Gly Met
100 105 110
Asn His Val Phe Phe Thr Gly Ser Gly Ser Glu Gly Asn Asp Thr Val
115 120 125
Leu Arg Met Val Arg His Tyr Trp Ala Leu Lys Gly Lys Lys Asn Lys
130 135 140
Asn Val Ile Ile Gly Arg Ile Asn Gly Tyr His Gly Ser Thr Val Ala
145 150 155 160
Gly Ala Ala Leu Gly Gly Met Ser Gly Met His Gln Gln Gly Gly Val
165 170 175
Ile Pro Asp Ile Val His Ile Pro Gln Pro Tyr Trp Phe Gly Glu Gly
180 185 190
Gly Asp Met Thr Glu Ala Asp Phe Gly Val Trp Ala Ala Glu Gln Leu
195 200 205
Glu Lys Lys Ile Leu Glu Val Gly Val Asp Asn Val Ala Ala Phe Ile
210 215 220
Ala Glu Pro Ile Gln Gly Ala Gly Gly Val Ile Ile Pro Pro Gln Thr
225 230 235 240
Tyr Trp Pro Lys Val Lys Glu Ile Leu Ala Arg Tyr Asp Ile Leu Phe
245 250 255
Val Ala Asp Glu Val Ile Cys Gly Phe Gly Arg Thr Gly Glu Trp Phe
260 265 270
Gly Thr Asp Tyr Tyr Asp Leu Lys Pro Asp Leu Met Thr Ile Ala Lys
275 280 285
Gly Leu Thr Ser Gly Tyr Ile Pro Met Gly Gly Val Ile Val Arg Asp
290 295 300
Glu Val Ala Lys Val Ile Ser Glu Gly Gly Asp Phe Asn His Gly Phe
305 310 315 320
Thr Tyr Ser Gly His Pro Val Ala Ala Ala Val Gly Leu Glu Asn Leu
325 330 335
Arg Ile Leu Arg Asp Glu Gln Ile Ile Gln Gln Val His Asp Lys Thr
340 345 350
Ala Pro Tyr Leu Gln Gln Arg Leu Arg Glu Leu Ala Asp His Pro Leu
355 360 365
Val Gly Glu Val Arg Gly Leu Gly Met Leu Gly Ala Ile Glu Leu Val
370 375 380
Lys Asp Lys Ala Thr Arg Ala Arg Tyr Glu Gly Lys Gly Val Gly Met
385 390 395 400
Ile Cys Arg Gln His Cys Phe Asp Asn Gly Leu Ile Met Arg Ala Val
405 410 415
Gly Asp Thr Met Ile Ile Ala Pro Pro Leu Val Ile Ser Ile Glu Glu
420 425 430
Ile Asp Glu Leu Val Glu Lys Ala Arg Lys Cys Leu Asp Leu Thr Tyr
435 440 445
Glu Ala Val Arg
450
<210> 49
<211> 1359
<212> DNA
<213> 恶臭假单胞菌
<400> 49
atgagcgaac aaaactccca aaccctggcc tggcagagca tgagccgcga tcaccatctg 60
gcgccgttca gcgatgtcaa gcaattggct gaaaaaggtc cgcgcattat cacgtccgcg 120
aaaggcgtgt acctgtggga ttccgaaggt aacaaaatcc tggatggtat ggcgggtctg 180
tggtgtgtgg cggtcggtta cggtcgtgat gaactggctg aggtcgccag ccaacaaatg 240
aagcaactgc cgtactacaa cctgttcttc caaacggccc atccgccagc actggagctg 300
gcaaaagcca ttgccgacgt agctccgcaa ggtatgaacc atgttttctt caccggttct 360
ggcagcgaag gcaatgatac cgtgctgcgt atggtacgcc actattgggc cctgaaaggt 420
aagaagaaca agaatgtcat cattggtcgt atcaacggtt accacggtag caccgtcgct 480
ggtgcggcgc tgggtggcat gagcggcatg caccagcagg gtggtgtcat tccggacatt 540
gtccacattc cgcagccgta ttggtttggt gaaggcggtg acatgacgga agcggacttc 600
ggtgtgtggg ctgccgagca attggagaag aagatcctgg aggttggcgt tgataatgtt 660
gcggcgttca ttgcggaacc gatccaaggc gctggcggtg ttatcatccc gccacagacg 720
tattggccaa aggtgaaaga gatcctggcg cgctacgata tcttgttcgt tgctgacgag 780
gtgatctgcg gttttggtcg caccggcgaa tggtttggca ccgattacta cgacctgaaa 840
ccggatctga tgaccatcgc caaaggtctg accagcggct acattccgat gggtggtgtc 900
attgtccgtg atgaggttgc caaggttatc agcgagggcg gtgactttaa ccacggcttt 960
acctacagcg gccatccggt ggcggcagcg gtcggtctgg aaaatctgcg tattctgcgt 1020
gacgagcaga tcatccaaca agttcacgac aagaccgctc cgtatctgca acagcgcctg 1080
cgcgaactgg cggaccatcc gctggttggc gaagtccgtg gcttgggcat gctgggtgcg 1140
attgaactgg ttaaagacaa ggcaacccgt gcacgttatg agggtaaggg cgtgggcatg 1200
atctgccgtc agcactgttt cgacaacggt ctgattatgc gtgcggttgg cgataccatg 1260
attatcgcac cgccgttggt catcagcatc gaagaaatcg acgaattggt agagaaagcg 1320
cgtaagtgtc tggatctgac gtatgaggct gtgcgctaa 1359
<210> 50
<211> 143
<212> DNA
<213> 人工的
<220>
<223> 分离的AlkB-启动子
<400> 50
aactacccgt aggtgtagtt ggcgcaagcg tccgattagc tcaggtttaa gatgtcgaga 60
gtgagagtgg gcggcttaac tttctcagtt aggcataaaa ttacgtctta aatctcgtag 120
cgactaattt aataaaaatt gga 143
<210> 51
<211> 40
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 51
ttaataaaaa ttggagtaca gacttttggt aggagaatgc 40
<210> 52
<211> 32
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 52
ccttgggctt attttttagc cgtcaactta ac 32
<210> 53
<211> 32
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 53
aaaaataagc ccaaggcaca gataaagaga ga 32
<210> 54
<211> 32
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 54
tagatccttc agatcaaaga ctttaattca ac 32
<210> 55
<211> 32
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 55
tgatctgaag gatctaggaa ccaaggagag tg 32
<210> 56
<211> 40
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 56
cttggctgca ggtcgattag aaaacatatg acgcaccaag 40
<210> 57
<211> 7675
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 57
atcgattgga tgcccgaggc atagactgta caaaaaaaca gtcataacaa gccatgaaaa 60
ccgccactgc gccgttacca ccgctgcgtt cggtcaaggt tctggaccag ttgcgtgagc 120
gcatacgcta cttgcattac agtttacgaa ccgaacaggc ttatgtcaat tcgcctctca 180
ggcgccgctg gtgccgctgg ttggacgcca agggtgaatc cgcctcgata ccctgattac 240
tcgcttcctg cgccctctca ggcggcgata ggggactggt aaaacgggga ttgcccagac 300
gcctcccccg ccccttcagg ggcacaaatg cggccccaac ggggccacgt agtggtgcgt 360
tttttgcgtt tccacccttt tcttcctttt cccttttaaa ccttttagga cgtctacagg 420
ccacgtaatc cgtggcctgt agagtttaaa aagggacgga tttgttgcca ttaagggacg 480
gatttgttgt taagaaggga cggatttgtt gttgtaaagg gacggatttg ttgtattgtg 540
ggacgcagat acagtgtccc cttatacaca aggaatgtcg aacgtggcct cacccccaat 600
ggtttacaaa agcaatgccc tggtcgaggc cgcgtatcgc ctcagtgttc aggaacagcg 660
gatcgttctg gcctgtatta gccaggtgaa gaggagcgag cctgtcaccg atgaagtgat 720
gtattcagtg acggcggagg acatagcgac gatggcgggt gtccctatcg aatcttccta 780
caaccagctc aaagaagcgg ccctgcgcct gaaacggcgg gaagtccggt taacccaaga 840
gcccaatggc aaggggaaaa gaccgagtgt gatgattacc ggctgggtgc aaacaatcat 900
ctaccgggag ggtgagggcc gtgtagaact caggttcacc aaagacatgc tgccgtacct 960
gacggaactc accaaacagt tcaccaaata cgccttggct gacgtggcca agatggacag 1020
cacccacgcg atcaggcttt acgagctgct catgcaatgg gacagcatcg gccagcgcga 1080
aatagaaatt gaccagctgc gaaagtggtt tcaactggaa ggccggtatc cctcgatcaa 1140
ggacttcaag ttgcgagtgc ttgatccagc cgtgacgcag atcaacgagc acagcccgct 1200
acaggtggag tgggcgcagc gaaagaccgg gcgcaaggtc acacatctgt tgttcagttt 1260
tggaccgaag aagcccgcca aggcggtggg taaggcccca gcgaagcgca aggccgggaa 1320
gatttcagat gctgagatcg cgaaacaggc tcgccctggt gagacatggg aagcggcccg 1380
cgctcgacta acccagatgc cgctggatct ggcctagagg ccgtggccac cacggcccgg 1440
cctgcctttc aggctgcatt attgaagcat ttatcagggt tattgtctca tgagcggata 1500
catatttgaa tgtatttaga aaaataaaca aaagagtttg tagaaacgca aaaaggccat 1560
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1620
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1680
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1740
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1800
ccatcggcgc tacggcgttt cacttctgag ttcggcatgg ggtcaggtgg gaccaccgcg 1860
ctactgccgc caggcaaatt ctgttttatc agaccgcttc tgcgttctga tttaatctgt 1920
atcaggctga aaatcttctc tcatccgcca aaacagaagc ttggctgcag gtcgacggat 1980
cccgggcgcg ccaagcatat ggaattctcc aatttttatt aaattagtcg ctacgagatt 2040
taagacgtaa ttttatgcct aactgagaaa gttaagccgc ccactctcac tctcgacatc 2100
ttaaacctga gctaatcgga cgcttgcgcc aactacacct acgggtagtt tttgctccgt 2160
cgtctgctgg aaaaacacga gctggccgca agcatgccag gtaccgcgag ctactcgcga 2220
cggctgaaag caccgaaatg agcgagctat ctggtcgatt ttgacccggt gcccgtcttc 2280
aaaatcggcg aaggccgaag tcggccagaa atagcggcct acttcagacc ttccctagta 2340
aatattttgc accaccgatc atgccgacta cacttaagtg tagttttaat atttaacacc 2400
gtaacctatg gtgaaaattt ccagtcagct ggcgcgagaa tagcataatg aaaataataa 2460
taaataatga tttcccggtc gctaaggtcg gagcggatca aattacgact ctagtaagtg 2520
ccaaagttca tagttgcata tatcggccaa gattgagtat cgcggatgga gccgctccca 2580
gagtatgcct ttacagagcc ccacctggat atgggaaaac cgttgctctt gcgttcgagt 2640
ggctacgcca cagaacagcc ggacgtcctg cagtgtggct ttctttaaga gccagttctt 2700
acagtgaatt tgatatctgc gcagagatta ttgagcagct tgaaactttc gaaatggtaa 2760
aattcagccg tgtgagagag ggtgtgagca agcctgcgct cttgcgagac cttgcatcta 2820
gtctttggca gagcacctcg aataacgaga tagaaacgct agtttgtttg gataatatta 2880
atcatgactt agacttgccg ttgttgcacg cacttatgga gtttatgtta aatacaccaa 2940
aaaatatcag gtttgcagtt gcaggcaata caataaaagg gttctcgcag cttaaacttg 3000
caggcgctat gcgggagtac accgagaaag acttggcctt tagcgcagaa gaggcggtgg 3060
cgttagcgga ggcagagtct gttcttggag ttcctgaaga acagatagag accttggtgc 3120
aagaagttga ggggtggcct gctcttgtag tttttttgtt aaagcgtgag ttgccggcca 3180
agcatatttc agcagtagtt gaagtagaca attactttag ggatgaaata tttgaggcga 3240
ttcccgagcg ctatcgtgtt tttcttgcaa attcttcatt gctcgatttc gtgacgcctg 3300
atcaatacaa ttatgtattc aaatgcgtca atggggtctc atgtattaag tatttaagca 3360
ctaattacat gttgcttcgc catgtgagcg gtgagccagc gcagtttaca ctgcatccag 3420
tactgcgtaa ttttctacga gaaattactt ggactgaaaa tcctgctaaa agatcctacc 3480
tgcttaagcg tgcagctttc tggcattggc gtagaggtga ataccagtat gcaatacgaa 3540
tatccctacg ggcgaatgac tgtcgctggg cagtcagcat gtctgagaga ataattttag 3600
atttgtcatt tcgtcagggc gaaatagatg cgctgagaca gtggctgtta gagctgccga 3660
agcaggcctg gcaccaaaaa cccatagtgc ttattagtta cgcgtgggta ttgtatttca 3720
gtcagcaagg cgcgcgagca gagaagttaa ttaaagacct atcttcacaa tccgataaaa 3780
aaaataaatg gcaagaaaag gaatggctgc agcttgtgct tgcaataggt aaagcaacca 3840
aagatgaaat gctttcgagt gaggagctct gtaataagtg gattagttta tttggggatt 3900
caaacgcagt tggaaaaggg gccgcgctaa cctgtttggc ttttattttt gccagtgagt 3960
atagatttgc agagttggag aaggtgctgg ctcaggccca agccgtgaat aaatttgcaa 4020
aacaaaattt tgcttttggt tggctgtatg tcgcgaggtt tcaacaagcc ctagcaagcg 4080
gaaaaatggg ctgggcgagg cagattataa ctcaagcacg cacagacagt cgcgcgcaga 4140
tgatggaatc cgagtttact tcgaaaatgt ttgacgctct agagcttgag ttacattatg 4200
aattgcgctg cttggacacc tcagaagaaa agctctccaa aattttagag ttcatttcca 4260
atcacggggt gacagacgtg tttttttccg tatgccgtgc tgtgtcagct tggcggcttg 4320
gaaggagtga cctaaatggc tccattgaga tattggagtg ggcgaaggcg catgcggttg 4380
aaaaaaatct accaagattg gaagttatga gccaaattga gatctatcag cgcttagtct 4440
gtcaaggcat aacgggcata aataatttaa aaactcttga agatcataag attttctccg 4500
gacagcactc agccccccta aaagcacgcc tgctgcttgt tcaatcacta gtgctttccc 4560
gagatcggaa ctttcatagt gccgcgcaca gagcgttatt ggctattcag caagcccgta 4620
aaattaacgc gggccagctg gaagtccgtg gattattgtg tttggccgga gcgcaggcag 4680
gtgccggtga tttaaaaaag gctcagctta acattgttta tgcagtggag atagcaaaac 4740
agcttcaatg ctttcaaaca gttcttgatg aagtatgttt aattgagcga ataataccgg 4800
cttcatgtga agccttcaca gcagttaatt tagatcaagc gattggggct tttagtcttc 4860
cgcgaatagt tgagattgga aagtccgcag agaataaagc tgacgcttta ttgacacgga 4920
agcagattgc tgtcttgagg cttgtaaaag aggggtgctc aaacaaacaa atagcaacaa 4980
atatgcatgt caccgaagat gctataaagt ggcacatgag gaaaatattt gccaccttga 5040
atgtagtgaa tcgcacgcaa gcaacaattg aagctgagcg tcaaggaatt atctaaaata 5100
atcggcatta agtgatatag tgaaaagtat actcgagctc atagtccacg acgcccgtga 5160
ttttgtagcc ctggccgacg gccagcaggt aggccgacag gctcatgccg gccgccgccg 5220
ccttttcctc aatcgctctt cgttcgtctg gaaggcagta caccttgata ggtgggctgc 5280
ccttcctggt tggcttggtt tcatcagcca tccgcttgcc ctcatctgtt acgccggcgg 5340
tagccggcca gcctcgcaga gcaggattcc cgttgagcac cgccaggtgc gaataaggga 5400
cagtgaagaa ggaacacccg ctcgcgggtg ggcctacttc acctatcctg cccggctgac 5460
gccgttggat acaccaagga aagtctacac gaaccctttg gcaaaatcct gtatatcgtg 5520
cgaaaaagga tggatatacc gaaaaaatcg ctataatgac cccgaagcag ggttatgcag 5580
cggaaaagcg ctgcttccct gctgttttgt ggaatatcta ccgactggaa acaggcaaat 5640
gcaggaaatt actgaactga ggggacaggc gagagaggat caatggctat ctgggggacc 5700
gagggctgtc gctgcgccaa ggcacgattg gagatcccct atgcggtgtg aaataccgca 5760
cagatgcgta aggagaaaat accgcatcag gcgctcttcc gcttcctcgc tcactgactc 5820
gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg 5880
gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa 5940
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 6000
cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 6060
ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct 6120
taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg 6180
ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc 6240
ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt 6300
aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta 6360
tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac 6420
agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc 6480
ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat 6540
tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc 6600
tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt 6660
cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta 6720
aacttggtct gacagttacc aatcgattgg tcggtcattt cgaaccccag agtcccgctc 6780
agaagaactc gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga gcggcgatac 6840
cgtaaagcac gaggaagcgg tcagcccatt cgccgccaag ctcttcagca atatcacggg 6900
tagccaacgc tatgtcctga tagcggtccg ccacacccag ccggccacag tcgatgaatc 6960
cagaaaagcg gccattttcc accatgatat tcggcaagca ggcatcgcca tgggtcacga 7020
cgagatcctc gccgtcgggc atgcgcgcct tgagcctggc gaacagttcg gctggcgcga 7080
gcccctgatg ctcttcgtcc agatcatcct gatcgacaag accggcttcc atccgagtac 7140
gtgctcgctc gatgcgatgt ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg 7200
tatgcagccg ccgcattgca tcagccatga tggatacttt ctcggcagga gcaaggtgag 7260
atgacaggag atcctgcccc ggcacttcgc ccaatagcag ccagtccctt cccgcttcag 7320
tgacaacgtc gagcacagct gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg 7380
ctgcctcgtc ctgcagttca ttcagggcac cggacaggtc ggtcttgaca aaaagaaccg 7440
ggcgcccctg cgctgacagc cggaacacgg cggcatcaga gcagccgatt gtctgttgtg 7500
cccagtcata gccgaatagc ctctccaccc aagcggccgg agaacctgcg tgcaatccat 7560
cttgttcaat catgcgaaac gatcctcatc ctgtctcttg atcagatctt gatcccctgc 7620
gccatcagat ccttggcggc aagaaagcca tccagtttac tttgcagggc ttccc 7675
<210> 58
<211> 11401
<212> DNA
<213> 人工的
<220>
<223> 表达载体
<400> 58
tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct gatggcgcag 60
gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg aacaagatgg 120
attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca 180
acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt 240
tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg 300
gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga 360
agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca 420
ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct 480
tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac 540
tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc 600
gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt 660
gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct tttctggatt 720
catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg 780
tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat 840
cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc 900
gggactctgg ggttcgaaat gaccgaccaa tcgattggta actgtcagac caagtttact 960
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1020
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1080
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1140
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1200
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1260
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1320
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1380
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1440
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1500
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1560
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1620
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1680
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1740
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1800
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1860
cagtgagcga ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg 1920
gtatttcaca ccgcataggg gatctccaat cgtgccttgg cgcagcgaca gccctcggtc 1980
ccccagatag ccattgatcc tctctcgcct gtcccctcag ttcagtaatt tcctgcattt 2040
gcctgtttcc agtcggtaga tattccacaa aacagcaggg aagcagcgct tttccgctgc 2100
ataaccctgc ttcggggtca ttatagcgat tttttcggta tatccatcct ttttcgcacg 2160
atatacagga ttttgccaaa gggttcgtgt agactttcct tggtgtatcc aacggcgtca 2220
gccgggcagg ataggtgaag taggcccacc cgcgagcggg tgttccttct tcactgtccc 2280
ttattcgcac ctggcggtgc tcaacgggaa tcctgctctg cgaggctggc cggctaccgc 2340
cggcgtaaca gatgagggca agcggatggc tgatgaaacc aagccaacca ggaagggcag 2400
cccacctatc aaggtgtact gccttccaga cgaacgaaga gcgattgagg aaaaggcggc 2460
ggcggccggc atgagcctgt cggcctacct gctggccgtc ggccagggct acaaaatcac 2520
gggcgtcgtg gactatgagc tcgagtatac ttttcactat atcacttaat gccgattatt 2580
ttagataatt ccttgacgct cagcttcaat tgttgcttgc gtgcgattca ctacattcaa 2640
ggtggcaaat attttcctca tgtgccactt tatagcatct tcggtgacat gcatatttgt 2700
tgctatttgt ttgtttgagc acccctcttt tacaagcctc aagacagcaa tctgcttccg 2760
tgtcaataaa gcgtcagctt tattctctgc ggactttcca atctcaacta ttcgcggaag 2820
actaaaagcc ccaatcgctt gatctaaatt aactgctgtg aaggcttcac atgaagccgg 2880
tattattcgc tcaattaaac atacttcatc aagaactgtt tgaaagcatt gaagctgttt 2940
tgctatctcc actgcataaa caatgttaag ctgagccttt tttaaatcac cggcacctgc 3000
ctgcgctccg gccaaacaca ataatccacg gacttccagc tggcccgcgt taattttacg 3060
ggcttgctga atagccaata acgctctgtg cgcggcacta tgaaagttcc gatctcggga 3120
aagcactagt gattgaacaa gcagcaggcg tgcttttagg ggggctgagt gctgtccgga 3180
gaaaatctta tgatcttcaa gagtttttaa attatttatg cccgttatgc cttgacagac 3240
taagcgctga tagatctcaa tttggctcat aacttccaat cttggtagat ttttttcaac 3300
cgcatgcgcc ttcgcccact ccaatatctc aatggagcca tttaggtcac tccttccaag 3360
ccgccaagct gacacagcac ggcatacgga aaaaaacacg tctgtcaccc cgtgattgga 3420
aatgaactct aaaattttgg agagcttttc ttctgaggtg tccaagcagc gcaattcata 3480
atgtaactca agctctagag cgtcaaacat tttcgaagta aactcggatt ccatcatctg 3540
cgcgcgactg tctgtgcgtg cttgagttat aatctgcctc gcccagccca tttttccgct 3600
tgctagggct tgttgaaacc tcgcgacata cagccaacca aaagcaaaat tttgttttgc 3660
aaatttattc acggcttggg cctgagccag caccttctcc aactctgcaa atctatactc 3720
actggcaaaa ataaaagcca aacaggttag cgcggcccct tttccaactg cgtttgaatc 3780
cccaaataaa ctaatccact tattacagag ctcctcactc gaaagcattt catctttggt 3840
tgctttacct attgcaagca caagctgcag ccattccttt tcttgccatt tatttttttt 3900
atcggattgt gaagataggt ctttaattaa cttctctgct cgcgcgcctt gctgactgaa 3960
atacaatacc cacgcgtaac taataagcac tatgggtttt tggtgccagg cctgcttcgg 4020
cagctctaac agccactgtc tcagcgcatc tatttcgccc tgacgaaatg acaaatctaa 4080
aattattctc tcagacatgc tgactgccca gcgacagtca ttcgcccgta gggatattcg 4140
tattgcatac tggtattcac ctctacgcca atgccagaaa gctgcacgct taagcaggta 4200
ggatctttta gcaggatttt cagtccaagt aatttctcgt agaaaattac gcagtactgg 4260
atgcagtgta aactgcgctg gctcaccgct cacatggcga agcaacatgt aattagtgct 4320
taaatactta atacatgaga ccccattgac gcatttgaat acataattgt attgatcagg 4380
cgtcacgaaa tcgagcaatg aagaatttgc aagaaaaaca cgatagcgct cgggaatcgc 4440
ctcaaatatt tcatccctaa agtaattgtc tacttcaact actgctgaaa tatgcttggc 4500
cggcaactca cgctttaaca aaaaaactac aagagcaggc cacccctcaa cttcttgcac 4560
caaggtctct atctgttctt caggaactcc aagaacagac tctgcctccg ctaacgccac 4620
cgcctcttct gcgctaaagg ccaagtcttt ctcggtgtac tcccgcatag cgcctgcaag 4680
tttaagctgc gagaaccctt ttattgtatt gcctgcaact gcaaacctga tattttttgg 4740
tgtatttaac ataaactcca taagtgcgtg caacaacggc aagtctaagt catgattaat 4800
attatccaaa caaactagcg tttctatctc gttattcgag gtgctctgcc aaagactaga 4860
tgcaaggtct cgcaagagcg caggcttgct cacaccctct ctcacacggc tgaattttac 4920
catttcgaaa gtttcaagct gctcaataat ctctgcgcag atatcaaatt cactgtaaga 4980
actggctctt aaagaaagcc acactgcagg acgtccggct gttctgtggc gtagccactc 5040
gaacgcaaga gcaacggttt tcccatatcc aggtggggct ctgtaaaggc atactctggg 5100
agcggctcca tccgcgatac tcaatcttgg ccgatatatg caactatgaa ctttggcact 5160
tactagagtc gtaatttgat ccgctccgac cttagcgacc gggaaatcat tatttattat 5220
tattttcatt atgctattct cgcgccagct gactggaaat tttcaccata ggttacggtg 5280
ttaaatatta aaactacact taagtgtagt cggcatgatc ggtggtgcaa aatatttact 5340
agggaaggtc tgaagtaggc cgctatttct ggccgacttc ggccttcgcc gattttgaag 5400
acgggcaccg ggtcaaaatc gaccagatag ctcgctcatt tcggtgcttt cagccgtcgc 5460
gagtagctcg cggtacctgg catgcttgcg gccagctcgt gtttttccag cagacgacgg 5520
agcaaaaact acccgtaggt gtagttggcg caagcgtccg attagctcag gtttaagatg 5580
tcgagagtga gagtgggcgg cttaactttc tcagttaggc ataaaattac gtcttaaatc 5640
tcgtagcgac taatttaata aaaattggag tacagacttt tggtaggaga atgcagctgt 5700
gggaaaaatc acctttattg agaatgataa aactgaacat gtaacagaat ttgaggcagg 5760
tattactttg atgcaagttg ccttagacaa cgccgttccc ggtattgatg gggattgcgg 5820
cggggagtgt gcctgtggta cctgtcacct gattgttcca gaagaatggt tcgataaaac 5880
cgggccgatt aatgatgctg aagaacaaat gttgtccatg acacctgagc gtgcaaaaac 5940
ctctcggttg gggtgtcagg ttaaggccac tgaggcaatg gacggaatga ctgttcaatt 6000
gccagaattt caaatgtaag tgcggagagc gacatgtcaa cgagttcaag tacaagtaat 6060
gacatccagg caaaaataat taacgccaca tccaaagtcg tgccaatgca tctacagatc 6120
aaggcactaa aaaacttgat gaaggtgaag cggaagacca ttggcacttc ccgccctcag 6180
gtgcactttg ttgaaaccga tttgcctgac gtcaatgatt tggcgataga agatatcgat 6240
acgagtaacc cttttttata ccgacaaggt aaggcgaatg cgtactttaa gcggttgcgt 6300
gatgaagcgc cggtgcacta tcagaagaac agtgctttcg ggccgttctg gtcggtaaca 6360
cgctacgaag atatcgtctt cgtggacaag agccatgatt tgttttccgc cgaaccccaa 6420
attatcttgg gtgatcctcc ggaaggcctg tcggttgaaa tgttcatcgc tatggatcct 6480
cccaagcacg acgtacagcg tcgggcagtc cagggtgttg ttgcgcccaa gaacctgaaa 6540
gaaatggaag gactgatccg caagcgcacc ggggacgtac tggatagcct gccgttggac 6600
actccgttca actgggtgcc ggtggtgtcg aaagagctga ccgggcgcat gctagcctca 6660
ctgttagatt tcccgtatga cgaacgcgaa aaactggttg gctggtcgga tcgattgtcc 6720
ggcgcgtcct cggcaaccgg cggcgagttt acgaatgaag atgtgttttt tgatgacgcg 6780
gcagatatgg cgtgggcttt ctccaagctt tggcgtgata aagaagcccg tcaaaaagca 6840
ggtgaagagc cgggtttcga tttgatcagc atgcttcagt ccaatgaaga cacaaaagat 6900
ctgatcaatc gtcctttgga attcattggt aatctcgcgt tgttgattgt tggcggtaat 6960
gacaccacgc gtaactcaat gagcgggggg gtgctggctt taaatcagtt cccagagcaa 7020
ttcgagaagc taaaggcgaa cccaaagctt atccccaata tggtctctga aatcattcgc 7080
tggcaaacgc cgcttgcgta tatgcgccgg gttgccaagc aggatgtgga gctgaacgga 7140
cagaccatca agaagggtga tcgcgtgctg atgtggtatg cgtcgggcaa ccaggatgag 7200
agaaaatttg agaatcctga gcaattcatc atcgaccgca aagatacgcg taaccatgtg 7260
tcgtttggtt atggggttca ccgttgtatg ggcaaccgcc ttgccgaact gcagctgcgt 7320
attctgtggg aagagcttct ccctcgcttt gaaaacatcg aagtgatcgg tgagccggag 7380
cgcgtgcaat cgaactttgt gcggggctat tccaagatga tggttaagtt gacggctaaa 7440
aaataagccc aaggcacaga taaagagaga agcatggaaa acgaaaaaca agatgccact 7500
gtcatcgttg gaggcgggca cgcagcaggt gcgttgatga cagccttgat acaaaagaaa 7560
tatccacacg aggtggttct ggtgggcgaa gagccttatc cgccctacca gcgcccgcct 7620
ttatccaaaa cgtatctgtc aggagaggtt aacgaggaat ctctctattt gaaaccgcgc 7680
tcggtgtatg aaggtgcggg gcatcagttg cgacttggtg tgcgcgttga gaacattgat 7740
cgagacaaca aaacccttac attgtcagat cagagcacac tgaaatatgg ccgactgatt 7800
cttgccacag gttcacacgt taggcgtctt aatgcgcctg gatctgaatt aaaaggcatc 7860
cattatctgc atgacattgc tgatacggat acattgcgcg atcaactgtc accaggtgcc 7920
cgtttggtta ttgtcggtgg cggctacatt ggccttgagg ttgcagccag tgcgagcaag 7980
aaaggcgtta atgttacggt gctggaaggc gctgagcgtc taatgcagcg agttacgggc 8040
gttgagatgt cttcgttcct gtatgctaag cacagtggtt ctggcgtgga cgtgcgtctt 8100
aatactgctg tcaccggctt caaagctgga gatcaggggc gagtggctgg cgtaacgtta 8160
gcaaatggcg aaacggttga cgcagatgtt gtgcttgtct cgattggcgt tatacccgaa 8220
acggctttgg ctgaggctgc cggcctatcc tgtgaagacg gtatcctggt ggacgaatat 8280
gtccgcactt ctgacccaag catcctggcg ataggtgatt gcactcgtca ccgaaacctt 8340
ttcttcgaga aaatgcagag gctcgagtcc gttgctaacg ctgtcgatca agcacgtact 8400
gcggcagcga ccttgatggg agaggataag ccctacgata gcgctccatg gttttggtcg 8460
aatcaatatg atgttcgttt gcaaatggtg gggctctcgc aggaccatga tgaacgagtc 8520
atgcgtggca gcacggaaga caaagcgttt gcggtgttct atctccgtga gggctgtgtg 8580
attgccgttg atgcggtgaa tatgcccatt gcgtttatgg ttggaaagca gttggttcag 8640
caccgtaaga gtattagcgc tgacgtgttg agtgatctgg atgttgaatt aaagtctttg 8700
atctgaagga tctaggaacc aaggagagtg gcatatgagt ttttctaatt ataaagtaat 8760
cgcgatgccg gtgttggttg ctaattttgt tttgggggcg gccactgcat gggcgaatga 8820
aaattatccg gcgaaatctg ctggctataa tcagggtgac tgggtcgcta gcttcaattt 8880
ttctaaggtc tatgtgggtg aggagcttgg cgatctaaat gttggagggg gggctttgcc 8940
aaatgctgat gtaagtattg gtaatgatac aacacttacg tttgatatcg cctattttgt 9000
tagctcaaat atagcggtgg atttttttgt tggggtgcca gctagggcta aatttcaagg 9060
tgagaaatca atctcctcgc tgggaagagt cagtgaagtt gattacggcc ctgcaattct 9120
ttcgcttcaa tatcattacg atagctttga gcgactttat ccatatgttg gggttggtgt 9180
tggtcgggtg ctattttttg ataaaaccga cggtgctttg agttcgtttg atattaagga 9240
taaatgggcg cctgcttttc aggttggcct tagatatgac cttggtaact catggatgct 9300
aaattcagat gtgcgttata ttcctttcaa aacggacgtc acaggtactc ttggcccggt 9360
tcctgtttct actaaaattg aggttgatcc tttcattctc agtcttggtg cgtcatatgt 9420
tttctaatcg acctgcagcc aagcttctgt tttggcggat gagagaagat tttcagcctg 9480
atacagatta aatcagaacg cagaagcggt ctgataaaac agaatttgcc tggcggcagt 9540
agcgcggtgg tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat 9600
ggtagtgtgg ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa 9660
ggctcagtcg aaagactggg cctttcgttt tatctgttgt ttgtcggtga acgctctcct 9720
gagtaggaca aatccgccgg gagcggattt gaacgttgcg aagcaacggc ccggagggtg 9780
gcgggcagga cgcccgccat aaactgccag gcatcaaatt aagcagaagg ccatcctgac 9840
ggatggcctt tttgcgtttc tacaaactct tttgtttatt tttctaaata cattcaaata 9900
tgtatccgct catgagacaa taaccctgat aaatgcttca ataatgcagc ctgaaaggca 9960
ggccgggccg tggtggccac ggcctctagg ccagatccag cggcatctgg gttagtcgag 10020
cgcgggccgc ttcccatgtc tcaccagggc gagcctgttt cgcgatctca gcatctgaaa 10080
tcttcccggc cttgcgcttc gctggggcct tacccaccgc cttggcgggc ttcttcggtc 10140
caaaactgaa caacagatgt gtgaccttgc gcccggtctt tcgctgcgcc cactccacct 10200
gtagcgggct gtgctcgttg atctgcgtca cggctggatc aagcactcgc aacttgaagt 10260
ccttgatcga gggataccgg ccttccagtt gaaaccactt tcgcagctgg tcaatttcta 10320
tttcgcgctg gccgatgctg tcccattgca tgagcagctc gtaaagcctg atcgcgtggg 10380
tgctgtccat cttggccacg tcagccaagg cgtatttggt gaactgtttg gtgagttccg 10440
tcaggtacgg cagcatgtct ttggtgaacc tgagttctac acggccctca ccctcccggt 10500
agatgattgt ttgcacccag ccggtaatca tcacactcgg tcttttcccc ttgccattgg 10560
gctcttgggt taaccggact tcccgccgtt tcaggcgcag ggccgcttct ttgagctggt 10620
tgtaggaaga ttcgataggg acacccgcca tcgtcgctat gtcctccgcc gtcactgaat 10680
acatcacttc atcggtgaca ggctcgctcc tcttcacctg gctaatacag gccagaacga 10740
tccgctgttc ctgaacactg aggcgatacg cggcctcgac cagggcattg cttttgtaaa 10800
ccattggggg tgaggccacg ttcgacattc cttgtgtata aggggacact gtatctgcgt 10860
cccacaatac aacaaatccg tccctttaca acaacaaatc cgtcccttct taacaacaaa 10920
tccgtccctt aatggcaaca aatccgtccc tttttaaact ctacaggcca cggattacgt 10980
ggcctgtaga cgtcctaaaa ggtttaaaag ggaaaaggaa gaaaagggtg gaaacgcaaa 11040
aaacgcacca ctacgtggcc ccgttggggc cgcatttgtg cccctgaagg ggcgggggag 11100
gcgtctgggc aatccccgtt ttaccagtcc cctatcgccg cctgagaggg cgcaggaagc 11160
gagtaatcag ggtatcgagg cggattcacc cttggcgtcc aaccagcggc accagcggcg 11220
cctgagaggc gaattgacat aagcctgttc ggttcgtaaa ctgtaatgca agtagcgtat 11280
gcgctcacgc aactggtcca gaaccttgac cgaacgcagc ggtggtaacg gcgcagtggc 11340
ggttttcatg gcttgttatg actgtttttt tgtacagtct atgcctcggg catccaatcg 11400
a 11401
<210> 59
<211> 523
<212> PRT
<213> 人工的
<220>
<223> 多肽
<400> 59
Met Ala Thr Gln Glu Ile Ile Asp Ser Val Leu Pro Tyr Leu Thr Lys
1 5 10 15
Trp Tyr Thr Val Ile Thr Ala Ala Val Leu Val Phe Leu Ile Ser Thr
20 25 30
Asn Ile Lys Asn Tyr Val Lys Ala Lys Lys Leu Lys Cys Val Asp Pro
35 40 45
Pro Tyr Leu Lys Asp Ala Gly Leu Thr Gly Ile Ser Ser Leu Ile Ala
50 55 60
Ala Ile Lys Ala Lys Asn Asp Gly Arg Leu Ala Asn Phe Ala Asp Glu
65 70 75 80
Val Phe Asp Glu Tyr Pro Asn His Thr Phe Tyr Leu Ser Val Ala Gly
85 90 95
Ala Leu Lys Ile Val Met Thr Val Asp Pro Glu Asn Ile Lys Ala Val
100 105 110
Leu Ala Thr Gln Phe Thr Asp Phe Ser Leu Gly Thr Arg His Ala His
115 120 125
Phe Ala Pro Leu Leu Gly Asp Gly Ile Phe Thr Leu Asp Gly Glu Gly
130 135 140
Trp Lys His Ser Arg Ala Met Leu Arg Pro Gln Phe Ala Arg Asp Gln
145 150 155 160
Ile Gly His Val Lys Ala Leu Glu Pro His Ile Gln Ile Met Ala Lys
165 170 175
Gln Ile Lys Leu Asn Gln Gly Lys Thr Phe Asp Ile Gln Glu Leu Phe
180 185 190
Phe Arg Phe Thr Val Asp Thr Ala Thr Glu Phe Leu Phe Gly Glu Ser
195 200 205
Val His Ser Leu Tyr Asp Glu Lys Leu Gly Ile Pro Thr Pro Asn Glu
210 215 220
Ile Pro Gly Arg Glu Asn Phe Ala Ala Ala Phe Asn Val Ser Gln His
225 230 235 240
Tyr Leu Ala Thr Arg Ser Tyr Ser Gln Thr Phe Tyr Phe Leu Thr Asn
245 250 255
Pro Lys Glu Phe Arg Asp Cys Asn Ala Lys Val His His Leu Ala Lys
260 265 270
Tyr Phe Val Asn Lys Ala Leu Asn Phe Thr Pro Glu Glu Leu Glu Glu
275 280 285
Lys Ser Lys Ser Gly Tyr Val Phe Leu Tyr Glu Leu Val Lys Gln Thr
290 295 300
Arg Asp Pro Lys Val Leu Gln Asp Gln Leu Leu Asn Ile Met Val Ala
305 310 315 320
Gly Arg Asp Thr Thr Ala Gly Leu Leu Ser Phe Ala Leu Phe Glu Leu
325 330 335
Ala Arg His Pro Glu Met Trp Ser Lys Leu Arg Glu Glu Ile Glu Val
340 345 350
Asn Phe Gly Val Gly Glu Asp Ser Arg Val Glu Glu Ile Thr Phe Glu
355 360 365
Ala Leu Lys Arg Cys Glu Tyr Leu Lys Ala Ile Leu Asn Glu Thr Leu
370 375 380
Arg Met Tyr Pro Ser Val Pro Val Asn Phe Arg Thr Ala Thr Arg Asp
385 390 395 400
Thr Thr Leu Pro Arg Gly Gly Gly Ala Asn Gly Thr Asp Pro Ile Tyr
405 410 415
Ile Pro Lys Gly Ser Thr Val Ala Tyr Val Val Tyr Lys Thr His Arg
420 425 430
Leu Glu Glu Tyr Tyr Gly Lys Asp Ala Asn Asp Phe Arg Pro Glu Arg
435 440 445
Trp Phe Glu Pro Ser Thr Lys Lys Leu Gly Trp Ala Tyr Val Pro Phe
450 455 460
Asn Gly Gly Pro Arg Val Cys Leu Gly Gln Gln Phe Ala Leu Thr Glu
465 470 475 480
Ala Ser Tyr Val Ile Thr Arg Leu Ala Gln Met Phe Glu Thr Val Ser
485 490 495
Ser Asp Pro Gly Leu Glu Tyr Pro Pro Pro Lys Cys Ile His Leu Thr
500 505 510
Met Ser His Asn Asp Gly Val Phe Val Lys Met
515 520
<210> 60
<211> 1572
<212> DNA
<213> 人工的
<220>
<223> 用于在大肠杆菌中表达的人工多肽
<400> 60
atggcgaccc aagaaatcat tgactctgtg ctgccatacc tgacgaagtg gtatactgtc 60
attaccgcgg ctgttctggt cttcctgatc tccactaaca tcaaaaacta cgttaaggct 120
aaaaagctga agtgcgttga tccgccttac ctgaaggacg ctggcctgac cggtatttct 180
tctctgattg cggcaatcaa agccaaaaat gacggccgcc tggccaactt cgccgatgag 240
gtctttgacg agtatccgaa tcacaccttc tacctgtctg ttgccggtgc actgaagatc 300
gttatgactg ttgaccctga aaatatcaag gccgttctgg caactcaatt cacggacttc 360
tctctgggca cgcgtcacgc ccatttcgct ccgctgctgg gtgacggtat ctttacgctg 420
gacggtgagg gctggaaaca ctctcgcgca atgctgcgcc cacaatttgc tcgtgaccag 480
attggtcatg taaaggcgct ggagccgcat atccagatta tggcaaaaca aatcaagctg 540
aatcagggta agaccttcga tattcaagaa ctgttttttc gctttacggt tgataccgcc 600
accgagtttc tgtttggtga atctgttcat tctctgtacg acgagaaact gggcattccg 660
actcctaatg aaatcccagg ccgtgaaaac tttgctgcgg cattcaatgt ctctcaacac 720
tacctggcaa cccgttctta ctctcagacg ttctattttc tgaccaaccc gaaggagttc 780
cgtgattgca acgctaaggt tcatcatctg gcgaaatact ttgtgaacaa agctctgaat 840
ttcaccccgg aggaactgga ggaaaaatct aaaagcggct acgttttcct gtatgagctg 900
gtgaaacaaa cccgtgatcc aaaagttctg caggaccagc tgctgaatat catggtggct 960
ggtcgtgaca ctactgcggg tctgctgtct tttgcgctgt ttgaactggc gcgtcatccg 1020
gaaatgtggt ctaaactgcg tgaggaaatc gaggttaact tcggtgttgg cgaggattct 1080
cgtgtggaag agatcacctt tgaagcgctg aaacgctgtg agtatctgaa agcgatcctg 1140
aacgaaacgc tgcgtatgta cccaagcgtt cctgttaatt tccgtaccgc gacgcgcgac 1200
accaccctgc cgcgtggtgg tggcgcgaat ggcacggacc caatctacat tcctaaaggc 1260
tctaccgttg cttacgttgt ttacaagacg caccgtctgg aagaatatta cggtaaagac 1320
gccaacgact ttcgtccgga acgttggttc gagccttcta ccaagaagct gggttgggcg 1380
tacgtgccgt tcaacggtgg tccgcgtgtt tgcctgggcc agcagttcgc tctgacggaa 1440
gcctcctacg tcatcactcg tctggcgcag atgttcgaaa ccgttagctc cgaccctggt 1500
ctggagtacc cgcctccgaa atgcatccac ctgactatgt ctcacaacga tggtgttttc 1560
gtgaagatgt aa 1572
<210> 61
<211> 517
<212> PRT
<213> 人工的
<220>
<223> 分离的多肽
<400> 61
Met Ile Glu Gln Leu Leu Glu Tyr Trp Tyr Val Val Val Pro Val Leu
1 5 10 15
Tyr Ile Ile Lys Gln Leu Leu Ala Tyr Thr Lys Thr Arg Val Leu Met
20 25 30
Lys Lys Leu Gly Ala Ala Pro Val Thr Asn Lys Leu Tyr Asp Asn Ala
35 40 45
Phe Gly Ile Val Asn Gly Trp Lys Ala Leu Gln Phe Lys Lys Glu Gly
50 55 60
Arg Ala Gln Glu Tyr Asn Asp Tyr Lys Phe Asp His Ser Lys Asn Pro
65 70 75 80
Ser Val Gly Thr Tyr Val Ser Ile Leu Phe Gly Thr Arg Ile Val Val
85 90 95
Thr Lys Asp Pro Glu Asn Ile Lys Ala Ile Leu Ala Thr Gln Phe Gly
100 105 110
Asp Phe Ser Leu Gly Lys Arg His Thr Leu Phe Lys Pro Leu Leu Gly
115 120 125
Asp Gly Ile Phe Thr Leu Asp Gly Glu Gly Trp Lys His Ser Arg Ala
130 135 140
Met Leu Arg Pro Gln Phe Ala Arg Glu Gln Val Ala His Val Thr Ser
145 150 155 160
Leu Glu Pro His Phe Gln Leu Leu Lys Lys His Ile Leu Lys His Lys
165 170 175
Gly Glu Tyr Phe Asp Ile Gln Glu Leu Phe Phe Arg Phe Thr Val Asp
180 185 190
Ser Ala Thr Glu Phe Leu Phe Gly Glu Ser Val His Ser Leu Lys Asp
195 200 205
Glu Ser Ile Gly Ile Asn Gln Asp Asp Ile Asp Phe Ala Gly Arg Lys
210 215 220
Asp Phe Ala Glu Ser Phe Asn Lys Ala Gln Glu Tyr Leu Ala Ile Arg
225 230 235 240
Thr Leu Val Gln Thr Phe Tyr Trp Leu Val Asn Asn Lys Glu Phe Arg
245 250 255
Asp Cys Thr Lys Ser Val His Lys Phe Thr Asn Tyr Tyr Val Gln Lys
260 265 270
Ala Leu Asp Ala Ser Pro Glu Glu Leu Glu Lys Gln Ser Gly Tyr Val
275 280 285
Phe Leu Tyr Glu Leu Val Lys Gln Thr Arg Asp Pro Asn Val Leu Arg
290 295 300
Asp Gln Ser Leu Asn Ile Leu Leu Ala Gly Arg Asp Thr Thr Ala Gly
305 310 315 320
Leu Leu Ser Phe Ala Val Phe Glu Leu Ala Arg His Pro Glu Ile Trp
325 330 335
Ala Lys Leu Arg Glu Glu Ile Glu Gln Gln Phe Gly Leu Gly Glu Asp
340 345 350
Ser Arg Val Glu Glu Ile Thr Phe Glu Ser Leu Lys Arg Cys Glu Tyr
355 360 365
Leu Lys Ala Phe Leu Asn Glu Thr Leu Arg Ile Tyr Pro Ser Val Pro
370 375 380
Arg Asn Phe Arg Ile Ala Thr Lys Asn Thr Thr Leu Pro Arg Gly Gly
385 390 395 400
Gly Ser Asp Gly Thr Ser Pro Ile Leu Ile Gln Lys Gly Glu Ala Val
405 410 415
Ser Tyr Gly Ile Asn Ser Thr His Leu Asp Pro Val Tyr Tyr Gly Pro
420 425 430
Asp Ala Ala Glu Phe Arg Pro Glu Arg Trp Phe Glu Pro Ser Thr Lys
435 440 445
Lys Leu Gly Trp Ala Tyr Leu Pro Phe Asn Gly Gly Pro Arg Ile Cys
450 455 460
Leu Gly Gln Gln Phe Ala Leu Thr Glu Ala Gly Tyr Val Leu Val Arg
465 470 475 480
Leu Val Gln Glu Phe Ser His Val Arg Ser Asp Pro Asp Glu Val Tyr
485 490 495
Pro Pro Lys Arg Leu Thr Asn Leu Thr Met Cys Leu Gln Asp Gly Ala
500 505 510
Ile Val Lys Phe Asp
515
<210> 62
<211> 1554
<212> DNA
<213> 人工的
<220>
<223> 分离的核酸
<400> 62
atgatcgaac aactgctgga atactggtac gttgttgttc cggttctgta catcatcaaa 60
cagctgctgg cgtacaccaa aacccgcgtt ctgatgaaaa aactgggtgc ggcgccggtt 120
accaacaaac tgtacgacaa cgcgttcggt atcgttaacg gttggaaagc gctgcagttc 180
aaaaaagaag gtcgtgcgca ggaatacaac gactacaaat tcgaccactc taaaaacccg 240
tctgttggta cctacgtttc tatcctgttc ggtacccgta tcgttgttac caaagacccg 300
gaaaacatca aagcgatcct ggcgacccag ttcggtgact tctctctggg taaacgtcac 360
accctgttca agccgctgct gggtgacggt atcttcaccc tggacggtga aggttggaaa 420
cactctcgcg cgatgctgcg tccgcagttc gcgcgtgaac aggttgcgca cgttacctct 480
ctggaaccgc acttccagct gctgaaaaag cacatcctga aacacaaagg tgaatacttc 540
gacatccagg aactgttctt ccgtttcacc gttgactctg cgaccgaatt cctgttcggt 600
gaatctgttc actctctgaa agacgaatct atcggtatca accaggacga catcgacttc 660
gcgggtcgta aagacttcgc ggaatctttc aataaagcgc aggaatacct ggcgatccgt 720
accctggttc agaccttcta ctggctggtt aacaacaaag aattccgtga ctgcaccaaa 780
tctgttcaca aattcaccaa ctactacgtt cagaaagcgc tggacgcgtc tccggaagaa 840
ctggaaaaac agtctggtta cgtattcctg tacgaactgg ttaaacagac ccgtgacccg 900
aacgttctgc gtgaccagtc tctgaacatc ctgctggcgg gtcgtgacac caccgcgggt 960
ctgctgtctt tcgcggtttt cgaactggcg cgtcacccgg aaatctgggc gaaactgcgt 1020
gaagaaatcg aacagcagtt cggtctgggt gaagactctc gtgttgaaga aatcaccttc 1080
gaatctctga aacgttgcga atacctgaaa gcgttcctga acgaaaccct gcgtatctac 1140
ccgtctgttc cgcgtaactt ccgtatcgcg accaaaaaca ccaccctgcc gcgtggtggt 1200
ggttctgacg gtacctctcc gatcctgatc cagaaaggtg aagcggtttc ttacggtatc 1260
aactctaccc acctggaccc ggtttactac ggtccggacg cggcggaatt ccgtccggaa 1320
cgttggttcg aaccgtctac caagaaactg ggttgggcgt acctgccgtt caacggtggt 1380
ccgcgtatct gcctgggtca gcagttcgcg ctgaccgaag cgggttacgt tctggttcgt 1440
ctggttcagg aattctctca cgttcgctct gacccggacg aagtttaccc gccgaaacgt 1500
ctgaccaacc tgaccatgtg cctgcaggac ggtgcgatcg ttaagttcga ctag 1554
<210> 63
<211> 680
<212> PRT
<213> 人工的
<220>
<223> 人工的多肽
<400> 63
Met Ala Leu Asp Lys Leu Asp Leu Tyr Val Ile Ile Thr Leu Val Val
1 5 10 15
Ala Ile Ala Ala Tyr Phe Ala Lys Asn Gln Phe Leu Asp Gln Gln Gln
20 25 30
Asp Thr Gly Phe Leu Asn Thr Asp Ser Gly Asp Gly Asn Ser Arg Asp
35 40 45
Ile Leu Gln Ala Leu Lys Lys Asn Asn Lys Asn Thr Leu Leu Leu Phe
50 55 60
Gly Ser Gln Thr Gly Thr Ala Glu Asp Tyr Ala Asn Lys Leu Ser Arg
65 70 75 80
Glu Leu His Ser Arg Phe Gly Leu Lys Thr Met Val Ala Asp Phe Ala
85 90 95
Asp Tyr Asp Phe Glu Asn Phe Gly Asp Ile Thr Glu Asp Ile Leu Val
100 105 110
Phe Phe Ile Val Ala Thr Tyr Gly Glu Gly Glu Pro Thr Asp Asn Ala
115 120 125
Asp Glu Phe His Thr Trp Leu Thr Glu Glu Ala Asp Thr Leu Ser Thr
130 135 140
Leu Lys Tyr Thr Val Phe Gly Leu Gly Asn Ser Thr Tyr Glu Phe Phe
145 150 155 160
Asn Ala Ile Gly Arg Lys Phe Asp Arg Leu Leu Gly Glu Lys Gly Gly
165 170 175
Asp Arg Phe Ala Glu Tyr Gly Glu Gly Asp Asp Gly Thr Gly Thr Leu
180 185 190
Asp Glu Asp Phe Leu Ala Trp Lys Asp Asn Val Phe Asp Ser Leu Lys
195 200 205
Asn Asp Leu Asn Phe Glu Glu Lys Glu Leu Lys Tyr Glu Pro Asn Val
210 215 220
Lys Leu Thr Glu Arg Asp Asp Leu Ser Gly Asn Asp Pro Asp Val Ser
225 230 235 240
Leu Gly Glu Pro Asn Val Lys Tyr Ile Lys Ser Glu Gly Val Asp Leu
245 250 255
Thr Lys Gly Pro Phe Asp His Thr His Pro Phe Leu Ala Arg Ile Val
260 265 270
Lys Thr Lys Glu Leu Phe Thr Ser Glu Asp Arg His Cys Val His Val
275 280 285
Glu Phe Asp Ile Ser Glu Ser Asn Leu Lys Tyr Thr Thr Gly Asp His
290 295 300
Leu Ala Ile Trp Pro Ser Asn Ser Asp Glu Asn Ile Lys Gln Phe Ala
305 310 315 320
Lys Cys Phe Gly Leu Glu Asp Lys Leu Asp Thr Val Ile Glu Leu Lys
325 330 335
Ala Leu Asp Ser Thr Tyr Ser Ile Pro Phe Pro Asn Pro Ile Thr Tyr
340 345 350
Gly Ala Val Ile Arg His His Leu Glu Ile Ser Gly Pro Val Ser Arg
355 360 365
Gln Phe Phe Leu Ser Ile Ala Gly Phe Ala Pro Asp Glu Glu Thr Lys
370 375 380
Lys Ser Phe Thr Arg Ile Gly Gly Asp Lys Gln Glu Phe Ala Ser Lys
385 390 395 400
Val Thr Arg Arg Lys Phe Asn Ile Ala Asp Ala Leu Leu Phe Ala Ser
405 410 415
Asn Asn Arg Pro Trp Ser Asp Val Pro Phe Glu Phe Leu Ile Glu Asn
420 425 430
Val Gln His Leu Thr Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Ser Leu
435 440 445
Ser Glu Lys Gln Thr Ile Asn Val Thr Ala Val Val Glu Ala Glu Glu
450 455 460
Glu Ala Asp Gly Arg Pro Val Thr Gly Val Val Thr Asn Leu Leu Lys
465 470 475 480
Asn Ile Glu Ile Glu Gln Asn Lys Thr Gly Glu Thr Pro Met Val His
485 490 495
Tyr Asp Leu Asn Gly Pro Arg Gly Lys Phe Ser Lys Phe Arg Leu Pro
500 505 510
Val His Val Arg Arg Ser Asn Phe Lys Leu Pro Lys Asn Ser Thr Thr
515 520 525
Pro Val Ile Leu Ile Gly Pro Gly Thr Gly Val Ala Pro Leu Arg Gly
530 535 540
Phe Val Arg Glu Arg Val Gln Gln Val Lys Asn Gly Val Asn Val Gly
545 550 555 560
Lys Thr Val Leu Phe Tyr Gly Cys Arg Asn Ser Glu Gln Asp Phe Leu
565 570 575
Tyr Lys Gln Glu Trp Ser Glu Tyr Ala Ser Val Leu Gly Glu Asn Phe
580 585 590
Glu Met Phe Asn Ala Phe Ser Arg Gln Asp Pro Thr Lys Lys Val Tyr
595 600 605
Val Gln Asp Lys Ile Leu Glu Asn Ser Ala Leu Val Asp Glu Leu Leu
610 615 620
Ser Ser Gly Ala Ile Ile Tyr Val Cys Gly Asp Ala Ser Arg Met Ala
625 630 635 640
Arg Asp Val Gln Ala Ala Ile Ala Lys Ile Val Ala Lys Ser Arg Asp
645 650 655
Ile His Glu Asp Lys Ala Ala Glu Leu Val Lys Ser Trp Lys Val Gln
660 665 670
Asn Arg Tyr Gln Glu Asp Val Trp
675 680
<210> 64
<211> 2043
<212> DNA
<213> 人工的
<220>
<223> 人工的核酸
<400> 64
atggcgctgg acaagctgga cctgtacgtt atcatcaccc tggttgttgc catcgcggcg 60
tacttcgcaa agaatcagtt cctggatcag cagcaggaca ccggcttcct gaacaccgac 120
tctggcgacg gcaattcccg tgatattctg caggcgctga agaaaaataa caaaaatacc 180
ctgctgctgt tcggttctca gaccggtacc gcggaagact acgcgaataa actgtctcgt 240
gaactgcact ctcgttttgg cctgaagacc atggttgcgg acttcgccga ctacgacttc 300
gaaaatttcg gtgacattac ggaagatatc ctggttttct tcatcgttgc gacctacggc 360
gaaggtgaac cgaccgacaa cgcggacgag ttccacacct ggctgaccga agaggcggat 420
accctgtcta ccctgaagta caccgtcttc ggcctgggta attctactta tgaattcttc 480
aacgccatcg gtcgtaaatt cgaccgtctg ctgggtgaaa aaggtggcga tcgtttcgcg 540
gagtacggtg agggtgacga cggtacgggt accctggacg aggactttct ggcgtggaaa 600
gacaatgttt tcgactccct gaaaaacgat ctgaactttg aggaaaaaga actgaaatac 660
gagccgaacg ttaaactgac cgaacgtgac gacctgtctg gtaatgaccc ggacgtttct 720
ctgggcgagc cgaatgttaa gtacatcaaa tctgaaggtg ttgacctgac gaaaggtccg 780
ttcgaccaca cccacccgtt cctggcacgt atcgttaaaa ccaaagagct gtttacctct 840
gaagaccgtc actgcgttca tgtcgaattc gatatctctg aatctaacct gaaatacacc 900
acgggtgacc acctggcgat ctggccgtct aattctgacg aaaatatcaa acagttcgcg 960
aaatgcttcg gtctggaaga caaactggat accgttatcg aactgaaagc gctggactct 1020
acgtactcca tcccgttccc gaacccgatc acctatggtg cggttatccg ccaccacctg 1080
gaaatctctg gtccggtttc tcgccagttc ttcctgtcta tcgcgggttt cgcgcctgac 1140
gaagagacga aaaaatcctt cacccgtatc ggtggtgaca aacaggaatt tgcgtctaag 1200
gttacccgtc gtaaattcaa catcgctgac gcgctgctgt ttgcgagcaa caatcgtccg 1260
tggtccgacg tgccattcga gtttctgatc gagaatgttc agcacctgac cccgcgttac 1320
tattctatct cctctagctc tctgtctgaa aaacagacga tcaacgttac cgctgttgta 1380
gaagcagaag aggaggcgga tggtcgtccg gttaccggtg ttgttaccaa tctgctgaaa 1440
aacatcgaaa tcgaacaaaa caaaaccggt gaaaccccga tggttcatta cgatctgaat 1500
ggtccgcgtg gtaagttctc taaattccgc ctgccggttc acgttcgtcg ttctaacttc 1560
aaactgccga aaaactccac taccccggtt attctgattg gtccgggtac tggtgttgcg 1620
ccgctgcgtg gtttcgttcg tgagcgtgtg caacaggtta aaaacggcgt taacgtcggt 1680
aagaccgtac tgttctacgg ttgccgtaac agcgaacagg actttctgta caaacaggaa 1740
tggtctgaat acgcatctgt actgggtgag aatttcgaga tgttcaacgc gttctctcgt 1800
caggacccga ccaaaaaagt ttacgttcaa gataaaatcc tggaaaactc tgcgctggtt 1860
gacgaactgc tgtcttctgg tgcaatcatt tacgtttgcg gtgatgcgtc ccgtatggcg 1920
cgtgacgttc aggcggcaat cgctaaaatc gttgctaaaa gccgtgatat ccatgaagat 1980
aaagcggcgg aactggttaa atcttggaaa gttcagaacc gctaccagga agacgtttgg 2040
taa 2043
<210> 65
<211> 12064
<212> DNA
<213> 人工的
<220>
<223> 表达载体
<400> 65
atcgattgga tgcccgaggc atagactgta caaaaaaaca gtcataacaa gccatgaaaa 60
ccgccactgc gccgttacca ccgctgcgtt cggtcaaggt tctggaccag ttgcgtgagc 120
gcatacgcta cttgcattac agtttacgaa ccgaacaggc ttatgtcaat tcgcctctca 180
ggcgccgctg gtgccgctgg ttggacgcca agggtgaatc cgcctcgata ccctgattac 240
tcgcttcctg cgccctctca ggcggcgata ggggactggt aaaacgggga ttgcccagac 300
gcctcccccg ccccttcagg ggcacaaatg cggccccaac ggggccacgt agtggtgcgt 360
tttttgcgtt tccacccttt tcttcctttt cccttttaaa ccttttagga cgtctacagg 420
ccacgtaatc cgtggcctgt agagtttaaa aagggacgga tttgttgcca ttaagggacg 480
gatttgttgt taagaaggga cggatttgtt gttgtaaagg gacggatttg ttgtattgtg 540
ggacgcagat acagtgtccc cttatacaca aggaatgtcg aacgtggcct cacccccaat 600
ggtttacaaa agcaatgccc tggtcgaggc cgcgtatcgc ctcagtgttc aggaacagcg 660
gatcgttctg gcctgtatta gccaggtgaa gaggagcgag cctgtcaccg atgaagtgat 720
gtattcagtg acggcggagg acatagcgac gatggcgggt gtccctatcg aatcttccta 780
caaccagctc aaagaagcgg ccctgcgcct gaaacggcgg gaagtccggt taacccaaga 840
gcccaatggc aaggggaaaa gaccgagtgt gatgattacc ggctgggtgc aaacaatcat 900
ctaccgggag ggtgagggcc gtgtagaact caggttcacc aaagacatgc tgccgtacct 960
gacggaactc accaaacagt tcaccaaata cgccttggct gacgtggcca agatggacag 1020
cacccacgcg atcaggcttt acgagctgct catgcaatgg gacagcatcg gccagcgcga 1080
aatagaaatt gaccagctgc gaaagtggtt tcaactggaa ggccggtatc cctcgatcaa 1140
ggacttcaag ttgcgagtgc ttgatccagc cgtgacgcag atcaacgagc acagcccgct 1200
acaggtggag tgggcgcagc gaaagaccgg gcgcaaggtc acacatctgt tgttcagttt 1260
tggaccgaag aagcccgcca aggcggtggg taaggcccca gcgaagcgca aggccgggaa 1320
gatttcagat gctgagatcg cgaaacaggc tcgccctggt gagacatggg aagcggcccg 1380
cgctcgacta acccagatgc cgctggatct ggcctagagg ccgtggccac cacggcccgg 1440
cctgcctttc aggctgcatt attgaagcat ttatcagggt tattgtctca tgagcggata 1500
catatttgaa tgtatttaga aaaataaaca aaagagtttg tagaaacgca aaaaggccat 1560
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1620
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1680
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1740
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1800
ccatcggcgc tacggcgttt cacttctgag ttcggcatgg ggtcaggtgg gaccaccgcg 1860
ctactgccgc caggcaaatt ctgttttatc agaccgcttc tgcgttctga tttaatctgt 1920
atcaggctga aaatcttctc tcatccgcca aaacagaagc ttggctgcag gtcgacagaa 1980
gcttggctgc aggtcgatta gaaaacatat gacgcaccaa gactgagaat gaaaggatca 2040
acctcaattt tagtagaaac aggaaccggg ccaagagtac ctgtgacgtc cgttttgaaa 2100
ggaatataac gcacatctga atttagcatc catgagttac caaggtcata tctaaggcca 2160
acctgaaaag caggcgccca tttatcctta atatcaaacg aactcaaagc accgtcggtt 2220
ttatcaaaaa atagcacccg accaacacca accccaacat atggataaag tcgctcaaag 2280
ctatcgtaat gatattgaag cgaaagaatt gcagggccgt aatcaacttc actgactctt 2340
cccagcgagg agattgattt ctcaccttga aatttagccc tagctggcac cccaacaaaa 2400
aaatccaccg ctatatttga gctaacaaaa taggcgatat caaacgtaag tgttgtatca 2460
ttaccaatac ttacatcagc atttggcaaa gccccccctc caacatttag atcgccaagc 2520
tcctcaccca catagacctt agaaaaattg aagctagcga cccagtcacc ctgattatag 2580
ccagcagatt tcgccggata attttcattc gcccatgcag tggccgcccc caaaacaaaa 2640
ttagcaacca acaccggcat cgcgattact ttataattag aaaaactcat atgccactct 2700
ccttggttcc tagatccttt accaaacgtc ttcctggtag cggttctgaa ctttccaaga 2760
tttaaccagt tccgccgctt tatcttcatg gatatcacgg cttttagcaa cgattttagc 2820
gattgccgcc tgaacgtcac gcgccatacg ggacgcatca ccgcaaacgt aaatgattgc 2880
accagaagac agcagttcgt caaccagcgc agagttttcc aggattttat cttgaacgta 2940
aacttttttg gtcgggtcct gacgagagaa cgcgttgaac atctcgaaat tctcacccag 3000
tacagatgcg tattcagacc attcctgttt gtacagaaag tcctgttcgc tgttacggca 3060
accgtagaac agtacggtct taccgacgtt aacgccgttt ttaacctgtt gcacacgctc 3120
acgaacgaaa ccacgcagcg gcgcaacacc agtacccgga ccaatcagaa taaccggggt 3180
agtggagttt ttcggcagtt tgaagttaga acgacgaacg tgaaccggca ggcggaattt 3240
agagaactta ccacgcggac cattcagatc gtaatgaacc atcggggttt caccggtttt 3300
gttttgttcg atttcgatgt ttttcagcag attggtaaca acaccggtaa ccggacgacc 3360
atccgcctcc tcttctgctt ctacaacagc ggtaacgttg atcgtctgtt tttcagacag 3420
agagctagag gagatagaat agtaacgcgg ggtcaggtgc tgaacattct cgatcagaaa 3480
ctcgaatggc acgtcggacc acggacgatt gttgctcgca aacagcagcg cgtcagcgat 3540
gttgaattta cgacgggtaa ccttagacgc aaattcctgt ttgtcaccac cgatacgggt 3600
gaaggatttt ttcgtctctt cgtcaggcgc gaaacccgcg atagacagga agaactggcg 3660
agaaaccgga ccagagattt ccaggtggtg gcggataacc gcaccatagg tgatcgggtt 3720
cgggaacggg atggagtacg tagagtccag cgctttcagt tcgataacgg tatccagttt 3780
gtcttccaga ccgaagcatt tcgcgaactg tttgatattt tcgtcagaat tagacggcca 3840
gatcgccagg tggtcacccg tggtgtattt caggttagat tcagagatat cgaattcgac 3900
atgaacgcag tgacggtctt cagaggtaaa cagctctttg gttttaacga tacgtgccag 3960
gaacgggtgg gtgtggtcga acggaccttt cgtcaggtca acaccttcag atttgatgta 4020
cttaacattc ggctcgccca gagaaacgtc cgggtcatta ccagacaggt cgtcacgttc 4080
ggtcagttta acgttcggct cgtatttcag ttctttttcc tcaaagttca gatcgttttt 4140
cagggagtcg aaaacattgt ctttccacgc cagaaagtcc tcgtccaggg tacccgtacc 4200
gtcgtcaccc tcaccgtact ccgcgaaacg atcgccacct ttttcaccca gcagacggtc 4260
gaatttacga ccgatggcgt tgaagaattc ataagtagaa ttacccaggc cgaagacggt 4320
gtacttcagg gtagacaggg tatccgcctc ttcggtcagc caggtgtgga actcgtccgc 4380
gttgtcggtc ggttcacctt cgccgtaggt cgcaacgatg aagaaaacca ggatatcttc 4440
cgtaatgtca ccgaaatttt cgaagtcgta gtcggcgaag tccgcaacca tggtcttcag 4500
gccaaaacga gagtgcagtt cacgagacag tttattcgcg tagtcttccg cggtaccggt 4560
ctgagaaccg aacagcagca gggtattttt gttatttttc ttcagcgcct gcagaatatc 4620
acgggaattg ccgtcgccag agtcggtgtt caggaagccg gtgtcctgct gctgatccag 4680
gaactgattc tttgcgaagt acgccgcgat ggcaacaacc agggtgatga taacgtacag 4740
gtccagcttg tccagcgcca tgcttctctc tttatctgtg ccttgggctt acatcttcac 4800
gaaaacacca tcgttgtgag acatagtcag gtggatgcat ttcggaggcg ggtactccag 4860
accagggtcg gagctaacgg tttcgaacat ctgcgccaga cgagtgatga cgtaggaggc 4920
ttccgtcaga gcgaactgct ggcccaggca aacacgcgga ccaccgttga acggcacgta 4980
cgcccaaccc agcttcttgg tagaaggctc gaaccaacgt tccggacgaa agtcgttggc 5040
gtctttaccg taatattctt ccagacggtg cgtcttgtaa acaacgtaag caacggtaga 5100
gcctttagga atgtagattg ggtccgtgcc attcgcgcca ccaccacgcg gcagggtggt 5160
gtcgcgcgtc gcggtacgga aattaacagg aacgcttggg tacatacgca gcgtttcgtt 5220
caggatcgct ttcagatact cacagcgttt cagcgcttca aaggtgatct cttccacacg 5280
agaatcctcg ccaacaccga agttaacctc gatttcctca cgcagtttag accacatttc 5340
cggatgacgc gccagttcaa acagcgcaaa agacagcaga cccgcagtag tgtcacgacc 5400
agccaccatg atattcagca gctggtcctg cagaactttt ggatcacggg tttgtttcac 5460
cagctcatac aggaaaacgt agccgctttt agatttttcc tccagttcct ccggggtgaa 5520
attcagagct ttgttcacaa agtatttcgc cagatgatga accttagcgt tgcaatcacg 5580
gaactccttc gggttggtca gaaaatagaa cgtctgagag taagaacggg ttgccaggta 5640
gtgttgagag acattgaatg ccgcagcaaa gttttcacgg cctgggattt cattaggagt 5700
cggaatgccc agtttctcgt cgtacagaga atgaacagat tcaccaaaca gaaactcggt 5760
ggcggtatca accgtaaagc gaaaaaacag ttcttgaata tcgaaggtct taccctgatt 5820
cagcttgatt tgttttgcca taatctggat atgcggctcc agcgccttta catgaccaat 5880
ctggtcacga gcaaattgtg ggcgcagcat tgcgcgagag tgtttccagc cctcaccgtc 5940
cagcgtaaag ataccgtcac ccagcagcgg agcgaaatgg gcgtgacgcg tgcccagaga 6000
gaagtccgtg aattgagttg ccagaacggc cttgatattt tcagggtcaa cagtcataac 6060
gatcttcagt gcaccggcaa cagacaggta gaaggtgtga ttcggatact cgtcaaagac 6120
ctcatcggcg aagttggcca ggcggccgtc atttttggct ttgattgccg caatcagaga 6180
agaaataccg gtcaggccag cgtccttcag gtaaggcgga tcaacgcact tcagcttttt 6240
agccttaacg tagtttttga tgttagtgga gatcaggaag accagaacag ccgcggtaat 6300
gacagtatac cacttcgtca ggtatggcag cacagagtca atgatttctt gggtcgccat 6360
gtcgctctcc gcaggcgcgc caagcatatg gaattctcca atttttatta aattagtcgc 6420
tacgagattt aagacgtaat tttatgccta actgagaaag ttaagccgcc cactctcact 6480
ctcgacatct taaacctgag ctaatcggac gcttgcgcca actacaccta cgggtagttt 6540
ttgctccgtc gtctgctgga aaaacacgag ctggccgcaa gcatgccagg taccgcgagc 6600
tactcgcgac ggctgaaagc accgaaatga gcgagctatc tggtcgattt tgacccggtg 6660
cccgtcttca aaatcggcga aggccgaagt cggccagaaa tagcggccta cttcagacct 6720
tccctagtaa atattttgca ccaccgatca tgccgactac acttaagtgt agttttaata 6780
tttaacaccg taacctatgg tgaaaatttc cagtcagctg gcgcgagaat agcataatga 6840
aaataataat aaataatgat ttcccggtcg ctaaggtcgg agcggatcaa attacgactc 6900
tagtaagtgc caaagttcat agttgcatat atcggccaag attgagtatc gcggatggag 6960
ccgctcccag agtatgcctt tacagagccc cacctggata tgggaaaacc gttgctcttg 7020
cgttcgagtg gctacgccac agaacagccg gacgtcctgc agtgtggctt tctttaagag 7080
ccagttctta cagtgaattt gatatctgcg cagagattat tgagcagctt gaaactttcg 7140
aaatggtaaa attcagccgt gtgagagagg gtgtgagcaa gcctgcgctc ttgcgagacc 7200
ttgcatctag tctttggcag agcacctcga ataacgagat agaaacgcta gtttgtttgg 7260
ataatattaa tcatgactta gacttgccgt tgttgcacgc acttatggag tttatgttaa 7320
atacaccaaa aaatatcagg tttgcagttg caggcaatac aataaaaggg ttctcgcagc 7380
ttaaacttgc aggcgctatg cgggagtaca ccgagaaaga cttggccttt agcgcagaag 7440
aggcggtggc gttagcggag gcagagtctg ttcttggagt tcctgaagaa cagatagaga 7500
ccttggtgca agaagttgag gggtggcctg ctcttgtagt ttttttgtta aagcgtgagt 7560
tgccggccaa gcatatttca gcagtagttg aagtagacaa ttactttagg gatgaaatat 7620
ttgaggcgat tcccgagcgc tatcgtgttt ttcttgcaaa ttcttcattg ctcgatttcg 7680
tgacgcctga tcaatacaat tatgtattca aatgcgtcaa tggggtctca tgtattaagt 7740
atttaagcac taattacatg ttgcttcgcc atgtgagcgg tgagccagcg cagtttacac 7800
tgcatccagt actgcgtaat tttctacgag aaattacttg gactgaaaat cctgctaaaa 7860
gatcctacct gcttaagcgt gcagctttct ggcattggcg tagaggtgaa taccagtatg 7920
caatacgaat atccctacgg gcgaatgact gtcgctgggc agtcagcatg tctgagagaa 7980
taattttaga tttgtcattt cgtcagggcg aaatagatgc gctgagacag tggctgttag 8040
agctgccgaa gcaggcctgg caccaaaaac ccatagtgct tattagttac gcgtgggtat 8100
tgtatttcag tcagcaaggc gcgcgagcag agaagttaat taaagaccta tcttcacaat 8160
ccgataaaaa aaataaatgg caagaaaagg aatggctgca gcttgtgctt gcaataggta 8220
aagcaaccaa agatgaaatg ctttcgagtg aggagctctg taataagtgg attagtttat 8280
ttggggattc aaacgcagtt ggaaaagggg ccgcgctaac ctgtttggct tttatttttg 8340
ccagtgagta tagatttgca gagttggaga aggtgctggc tcaggcccaa gccgtgaata 8400
aatttgcaaa acaaaatttt gcttttggtt ggctgtatgt cgcgaggttt caacaagccc 8460
tagcaagcgg aaaaatgggc tgggcgaggc agattataac tcaagcacgc acagacagtc 8520
gcgcgcagat gatggaatcc gagtttactt cgaaaatgtt tgacgctcta gagcttgagt 8580
tacattatga attgcgctgc ttggacacct cagaagaaaa gctctccaaa attttagagt 8640
tcatttccaa tcacggggtg acagacgtgt ttttttccgt atgccgtgct gtgtcagctt 8700
ggcggcttgg aaggagtgac ctaaatggct ccattgagat attggagtgg gcgaaggcgc 8760
atgcggttga aaaaaatcta ccaagattgg aagttatgag ccaaattgag atctatcagc 8820
gcttagtctg tcaaggcata acgggcataa ataatttaaa aactcttgaa gatcataaga 8880
ttttctccgg acagcactca gcccccctaa aagcacgcct gctgcttgtt caatcactag 8940
tgctttcccg agatcggaac tttcatagtg ccgcgcacag agcgttattg gctattcagc 9000
aagcccgtaa aattaacgcg ggccagctgg aagtccgtgg attattgtgt ttggccggag 9060
cgcaggcagg tgccggtgat ttaaaaaagg ctcagcttaa cattgtttat gcagtggaga 9120
tagcaaaaca gcttcaatgc tttcaaacag ttcttgatga agtatgttta attgagcgaa 9180
taataccggc ttcatgtgaa gccttcacag cagttaattt agatcaagcg attggggctt 9240
ttagtcttcc gcgaatagtt gagattggaa agtccgcaga gaataaagct gacgctttat 9300
tgacacggaa gcagattgct gtcttgaggc ttgtaaaaga ggggtgctca aacaaacaaa 9360
tagcaacaaa tatgcatgtc accgaagatg ctataaagtg gcacatgagg aaaatatttg 9420
ccaccttgaa tgtagtgaat cgcacgcaag caacaattga agctgagcgt caaggaatta 9480
tctaaaataa tcggcattaa gtgatatagt gaaaagtata ctcgagctca tagtccacga 9540
cgcccgtgat tttgtagccc tggccgacgg ccagcaggta ggccgacagg ctcatgccgg 9600
ccgccgccgc cttttcctca atcgctcttc gttcgtctgg aaggcagtac accttgatag 9660
gtgggctgcc cttcctggtt ggcttggttt catcagccat ccgcttgccc tcatctgtta 9720
cgccggcggt agccggccag cctcgcagag caggattccc gttgagcacc gccaggtgcg 9780
aataagggac agtgaagaag gaacacccgc tcgcgggtgg gcctacttca cctatcctgc 9840
ccggctgacg ccgttggata caccaaggaa agtctacacg aaccctttgg caaaatcctg 9900
tatatcgtgc gaaaaaggat ggatataccg aaaaaatcgc tataatgacc ccgaagcagg 9960
gttatgcagc ggaaaagcgc tgcttccctg ctgttttgtg gaatatctac cgactggaaa 10020
caggcaaatg caggaaatta ctgaactgag gggacaggcg agagaggatc aatggctatc 10080
tgggggaccg agggctgtcg ctgcgccaag gcacgattgg agatccccta tgcggtgtga 10140
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 10200
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 10260
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 10320
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 10380
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 10440
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 10500
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 10560
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 10620
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 10680
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 10740
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 10800
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 10860
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 10920
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 10980
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 11040
aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 11100
atatgagtaa acttggtctg acagttacca atcgattggt cggtcatttc gaaccccaga 11160
gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc gaatcgggag 11220
cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc tcttcagcaa 11280
tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc cggccacagt 11340
cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag gcatcgccat 11400
gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg aacagttcgg 11460
ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga ccggcttcca 11520
tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg 11580
gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc tcggcaggag 11640
caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc cagtcccttc 11700
ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg gccagccacg 11760
atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa 11820
aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag cagccgattg 11880
tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga gaacctgcgt 11940
gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga tcagatcttg 12000
atcccctgcg ccatcagatc cttggcggca agaaagccat ccagtttact ttgcagggct 12060
tccc 12064
<210> 66
<211> 12046
<212> DNA
<213> 人工的
<220>
<223> 表达载体
<400> 66
atcgattgga tgcccgaggc atagactgta caaaaaaaca gtcataacaa gccatgaaaa 60
ccgccactgc gccgttacca ccgctgcgtt cggtcaaggt tctggaccag ttgcgtgagc 120
gcatacgcta cttgcattac agtttacgaa ccgaacaggc ttatgtcaat tcgcctctca 180
ggcgccgctg gtgccgctgg ttggacgcca agggtgaatc cgcctcgata ccctgattac 240
tcgcttcctg cgccctctca ggcggcgata ggggactggt aaaacgggga ttgcccagac 300
gcctcccccg ccccttcagg ggcacaaatg cggccccaac ggggccacgt agtggtgcgt 360
tttttgcgtt tccacccttt tcttcctttt cccttttaaa ccttttagga cgtctacagg 420
ccacgtaatc cgtggcctgt agagtttaaa aagggacgga tttgttgcca ttaagggacg 480
gatttgttgt taagaaggga cggatttgtt gttgtaaagg gacggatttg ttgtattgtg 540
ggacgcagat acagtgtccc cttatacaca aggaatgtcg aacgtggcct cacccccaat 600
ggtttacaaa agcaatgccc tggtcgaggc cgcgtatcgc ctcagtgttc aggaacagcg 660
gatcgttctg gcctgtatta gccaggtgaa gaggagcgag cctgtcaccg atgaagtgat 720
gtattcagtg acggcggagg acatagcgac gatggcgggt gtccctatcg aatcttccta 780
caaccagctc aaagaagcgg ccctgcgcct gaaacggcgg gaagtccggt taacccaaga 840
gcccaatggc aaggggaaaa gaccgagtgt gatgattacc ggctgggtgc aaacaatcat 900
ctaccgggag ggtgagggcc gtgtagaact caggttcacc aaagacatgc tgccgtacct 960
gacggaactc accaaacagt tcaccaaata cgccttggct gacgtggcca agatggacag 1020
cacccacgcg atcaggcttt acgagctgct catgcaatgg gacagcatcg gccagcgcga 1080
aatagaaatt gaccagctgc gaaagtggtt tcaactggaa ggccggtatc cctcgatcaa 1140
ggacttcaag ttgcgagtgc ttgatccagc cgtgacgcag atcaacgagc acagcccgct 1200
acaggtggag tgggcgcagc gaaagaccgg gcgcaaggtc acacatctgt tgttcagttt 1260
tggaccgaag aagcccgcca aggcggtggg taaggcccca gcgaagcgca aggccgggaa 1320
gatttcagat gctgagatcg cgaaacaggc tcgccctggt gagacatggg aagcggcccg 1380
cgctcgacta acccagatgc cgctggatct ggcctagagg ccgtggccac cacggcccgg 1440
cctgcctttc aggctgcatt attgaagcat ttatcagggt tattgtctca tgagcggata 1500
catatttgaa tgtatttaga aaaataaaca aaagagtttg tagaaacgca aaaaggccat 1560
ccgtcaggat ggccttctgc ttaatttgat gcctggcagt ttatggcggg cgtcctgccc 1620
gccaccctcc gggccgttgc ttcgcaacgt tcaaatccgc tcccggcgga tttgtcctac 1680
tcaggagagc gttcaccgac aaacaacaga taaaacgaaa ggcccagtct ttcgactgag 1740
cctttcgttt tatttgatgc ctggcagttc cctactctcg catggggaga ccccacacta 1800
ccatcggcgc tacggcgttt cacttctgag ttcggcatgg ggtcaggtgg gaccaccgcg 1860
ctactgccgc caggcaaatt ctgttttatc agaccgcttc tgcgttctga tttaatctgt 1920
atcaggctga aaatcttctc tcatccgcca aaacagaagc ttggctgcag gtcgacagaa 1980
gcttggctgc aggtcgatta gaaaacatat gacgcaccaa gactgagaat gaaaggatca 2040
acctcaattt tagtagaaac aggaaccggg ccaagagtac ctgtgacgtc cgttttgaaa 2100
ggaatataac gcacatctga atttagcatc catgagttac caaggtcata tctaaggcca 2160
acctgaaaag caggcgccca tttatcctta atatcaaacg aactcaaagc accgtcggtt 2220
ttatcaaaaa atagcacccg accaacacca accccaacat atggataaag tcgctcaaag 2280
ctatcgtaat gatattgaag cgaaagaatt gcagggccgt aatcaacttc actgactctt 2340
cccagcgagg agattgattt ctcaccttga aatttagccc tagctggcac cccaacaaaa 2400
aaatccaccg ctatatttga gctaacaaaa taggcgatat caaacgtaag tgttgtatca 2460
ttaccaatac ttacatcagc atttggcaaa gccccccctc caacatttag atcgccaagc 2520
tcctcaccca catagacctt agaaaaattg aagctagcga cccagtcacc ctgattatag 2580
ccagcagatt tcgccggata attttcattc gcccatgcag tggccgcccc caaaacaaaa 2640
ttagcaacca acaccggcat cgcgattact ttataattag aaaaactcat atgccactct 2700
ccttggttcc tagatccttt accaaacgtc ttcctggtag cggttctgaa ctttccaaga 2760
tttaaccagt tccgccgctt tatcttcatg gatatcacgg cttttagcaa cgattttagc 2820
gattgccgcc tgaacgtcac gcgccatacg ggacgcatca ccgcaaacgt aaatgattgc 2880
accagaagac agcagttcgt caaccagcgc agagttttcc aggattttat cttgaacgta 2940
aacttttttg gtcgggtcct gacgagagaa cgcgttgaac atctcgaaat tctcacccag 3000
tacagatgcg tattcagacc attcctgttt gtacagaaag tcctgttcgc tgttacggca 3060
accgtagaac agtacggtct taccgacgtt aacgccgttt ttaacctgtt gcacacgctc 3120
acgaacgaaa ccacgcagcg gcgcaacacc agtacccgga ccaatcagaa taaccggggt 3180
agtggagttt ttcggcagtt tgaagttaga acgacgaacg tgaaccggca ggcggaattt 3240
agagaactta ccacgcggac cattcagatc gtaatgaacc atcggggttt caccggtttt 3300
gttttgttcg atttcgatgt ttttcagcag attggtaaca acaccggtaa ccggacgacc 3360
atccgcctcc tcttctgctt ctacaacagc ggtaacgttg atcgtctgtt tttcagacag 3420
agagctagag gagatagaat agtaacgcgg ggtcaggtgc tgaacattct cgatcagaaa 3480
ctcgaatggc acgtcggacc acggacgatt gttgctcgca aacagcagcg cgtcagcgat 3540
gttgaattta cgacgggtaa ccttagacgc aaattcctgt ttgtcaccac cgatacgggt 3600
gaaggatttt ttcgtctctt cgtcaggcgc gaaacccgcg atagacagga agaactggcg 3660
agaaaccgga ccagagattt ccaggtggtg gcggataacc gcaccatagg tgatcgggtt 3720
cgggaacggg atggagtacg tagagtccag cgctttcagt tcgataacgg tatccagttt 3780
gtcttccaga ccgaagcatt tcgcgaactg tttgatattt tcgtcagaat tagacggcca 3840
gatcgccagg tggtcacccg tggtgtattt caggttagat tcagagatat cgaattcgac 3900
atgaacgcag tgacggtctt cagaggtaaa cagctctttg gttttaacga tacgtgccag 3960
gaacgggtgg gtgtggtcga acggaccttt cgtcaggtca acaccttcag atttgatgta 4020
cttaacattc ggctcgccca gagaaacgtc cgggtcatta ccagacaggt cgtcacgttc 4080
ggtcagttta acgttcggct cgtatttcag ttctttttcc tcaaagttca gatcgttttt 4140
cagggagtcg aaaacattgt ctttccacgc cagaaagtcc tcgtccaggg tacccgtacc 4200
gtcgtcaccc tcaccgtact ccgcgaaacg atcgccacct ttttcaccca gcagacggtc 4260
gaatttacga ccgatggcgt tgaagaattc ataagtagaa ttacccaggc cgaagacggt 4320
gtacttcagg gtagacaggg tatccgcctc ttcggtcagc caggtgtgga actcgtccgc 4380
gttgtcggtc ggttcacctt cgccgtaggt cgcaacgatg aagaaaacca ggatatcttc 4440
cgtaatgtca ccgaaatttt cgaagtcgta gtcggcgaag tccgcaacca tggtcttcag 4500
gccaaaacga gagtgcagtt cacgagacag tttattcgcg tagtcttccg cggtaccggt 4560
ctgagaaccg aacagcagca gggtattttt gttatttttc ttcagcgcct gcagaatatc 4620
acgggaattg ccgtcgccag agtcggtgtt caggaagccg gtgtcctgct gctgatccag 4680
gaactgattc tttgcgaagt acgccgcgat ggcaacaacc agggtgatga taacgtacag 4740
gtccagcttg tccagcgcca tgcttctctc tttatctgtg ccttgggcct agtcgaactt 4800
aacgatcgca ccgtcctgca ggcacatggt caggttggtc agacgtttcg gcgggtaaac 4860
ttcgtccggg tcagagcgaa cgtgagagaa ttcctgaacc agacgaacca gaacgtaacc 4920
cgcttcggtc agcgcgaact gctgacccag gcagatacgc ggaccaccgt tgaacggcag 4980
gtacgcccaa cccagtttct tggtagacgg ttcgaaccaa cgttccggac ggaattccgc 5040
cgcgtccgga ccgtagtaaa ccgggtccag gtgggtagag ttgataccgt aagaaaccgc 5100
ttcacctttc tggatcagga tcggagaggt accgtcagaa ccaccaccac gcggcagggt 5160
ggtgtttttg gtcgcgatac ggaagttacg cggaacagac gggtagatac gcagggtttc 5220
gttcaggaac gctttcaggt attcgcaacg tttcagagat tcgaaggtga tttcttcaac 5280
acgagagtct tcacccagac cgaactgctg ttcgatttct tcacgcagtt tcgcccagat 5340
ttccgggtga cgcgccagtt cgaaaaccgc gaaagacagc agacccgcgg tggtgtcacg 5400
acccgccagc aggatgttca gagactggtc acgcagaacg ttcgggtcac gggtctgttt 5460
aaccagttcg tacaggaata cgtaaccaga ctgtttttcc agttcttccg gagacgcgtc 5520
cagcgctttc tgaacgtagt agttggtgaa tttgtgaaca gatttggtgc agtcacggaa 5580
ttctttgttg ttaaccagcc agtagaaggt ctgaaccagg gtacggatcg ccaggtattc 5640
ctgcgcttta ttgaaagatt ccgcgaagtc tttacgaccc gcgaagtcga tgtcgtcctg 5700
gttgataccg atagattcgt ctttcagaga gtgaacagat tcaccgaaca ggaattcggt 5760
cgcagagtca acggtgaaac ggaagaacag ttcctggatg tcgaagtatt cacctttgtg 5820
tttcaggatg tgctttttca gcagctggaa gtgcggttcc agagaggtaa cgtgcgcaac 5880
ctgttcacgc gcgaactgcg gacgcagcat cgcgcgagag tgtttccaac cttcaccgtc 5940
cagggtgaag ataccgtcac ccagcagcgg cttgaacagg gtgtgacgtt tacccagaga 6000
gaagtcaccg aactgggtcg ccaggatcgc tttgatgttt tccgggtctt tggtaacaac 6060
gatacgggta ccgaacagga tagaaacgta ggtaccaaca gacgggtttt tagagtggtc 6120
gaatttgtag tcgttgtatt cctgcgcacg accttctttt ttgaactgca gcgctttcca 6180
accgttaacg ataccgaacg cgttgtcgta cagtttgttg gtaaccggcg ccgcacccag 6240
ttttttcatc agaacgcggg ttttggtgta cgccagcagc tgtttgatga tgtacagaac 6300
cggaacaaca acgtaccagt attccagcag ttgttcgatc atgtcgctct ccgcaggcgc 6360
gccaagcata tggaattctc caatttttat taaattagtc gctacgagat ttaagacgta 6420
attttatgcc taactgagaa agttaagccg cccactctca ctctcgacat cttaaacctg 6480
agctaatcgg acgcttgcgc caactacacc tacgggtagt ttttgctccg tcgtctgctg 6540
gaaaaacacg agctggccgc aagcatgcca ggtaccgcga gctactcgcg acggctgaaa 6600
gcaccgaaat gagcgagcta tctggtcgat tttgacccgg tgcccgtctt caaaatcggc 6660
gaaggccgaa gtcggccaga aatagcggcc tacttcagac cttccctagt aaatattttg 6720
caccaccgat catgccgact acacttaagt gtagttttaa tatttaacac cgtaacctat 6780
ggtgaaaatt tccagtcagc tggcgcgaga atagcataat gaaaataata ataaataatg 6840
atttcccggt cgctaaggtc ggagcggatc aaattacgac tctagtaagt gccaaagttc 6900
atagttgcat atatcggcca agattgagta tcgcggatgg agccgctccc agagtatgcc 6960
tttacagagc cccacctgga tatgggaaaa ccgttgctct tgcgttcgag tggctacgcc 7020
acagaacagc cggacgtcct gcagtgtggc tttctttaag agccagttct tacagtgaat 7080
ttgatatctg cgcagagatt attgagcagc ttgaaacttt cgaaatggta aaattcagcc 7140
gtgtgagaga gggtgtgagc aagcctgcgc tcttgcgaga ccttgcatct agtctttggc 7200
agagcacctc gaataacgag atagaaacgc tagtttgttt ggataatatt aatcatgact 7260
tagacttgcc gttgttgcac gcacttatgg agtttatgtt aaatacacca aaaaatatca 7320
ggtttgcagt tgcaggcaat acaataaaag ggttctcgca gcttaaactt gcaggcgcta 7380
tgcgggagta caccgagaaa gacttggcct ttagcgcaga agaggcggtg gcgttagcgg 7440
aggcagagtc tgttcttgga gttcctgaag aacagataga gaccttggtg caagaagttg 7500
aggggtggcc tgctcttgta gtttttttgt taaagcgtga gttgccggcc aagcatattt 7560
cagcagtagt tgaagtagac aattacttta gggatgaaat atttgaggcg attcccgagc 7620
gctatcgtgt ttttcttgca aattcttcat tgctcgattt cgtgacgcct gatcaataca 7680
attatgtatt caaatgcgtc aatggggtct catgtattaa gtatttaagc actaattaca 7740
tgttgcttcg ccatgtgagc ggtgagccag cgcagtttac actgcatcca gtactgcgta 7800
attttctacg agaaattact tggactgaaa atcctgctaa aagatcctac ctgcttaagc 7860
gtgcagcttt ctggcattgg cgtagaggtg aataccagta tgcaatacga atatccctac 7920
gggcgaatga ctgtcgctgg gcagtcagca tgtctgagag aataatttta gatttgtcat 7980
ttcgtcaggg cgaaatagat gcgctgagac agtggctgtt agagctgccg aagcaggcct 8040
ggcaccaaaa acccatagtg cttattagtt acgcgtgggt attgtatttc agtcagcaag 8100
gcgcgcgagc agagaagtta attaaagacc tatcttcaca atccgataaa aaaaataaat 8160
ggcaagaaaa ggaatggctg cagcttgtgc ttgcaatagg taaagcaacc aaagatgaaa 8220
tgctttcgag tgaggagctc tgtaataagt ggattagttt atttggggat tcaaacgcag 8280
ttggaaaagg ggccgcgcta acctgtttgg cttttatttt tgccagtgag tatagatttg 8340
cagagttgga gaaggtgctg gctcaggccc aagccgtgaa taaatttgca aaacaaaatt 8400
ttgcttttgg ttggctgtat gtcgcgaggt ttcaacaagc cctagcaagc ggaaaaatgg 8460
gctgggcgag gcagattata actcaagcac gcacagacag tcgcgcgcag atgatggaat 8520
ccgagtttac ttcgaaaatg tttgacgctc tagagcttga gttacattat gaattgcgct 8580
gcttggacac ctcagaagaa aagctctcca aaattttaga gttcatttcc aatcacgggg 8640
tgacagacgt gtttttttcc gtatgccgtg ctgtgtcagc ttggcggctt ggaaggagtg 8700
acctaaatgg ctccattgag atattggagt gggcgaaggc gcatgcggtt gaaaaaaatc 8760
taccaagatt ggaagttatg agccaaattg agatctatca gcgcttagtc tgtcaaggca 8820
taacgggcat aaataattta aaaactcttg aagatcataa gattttctcc ggacagcact 8880
cagcccccct aaaagcacgc ctgctgcttg ttcaatcact agtgctttcc cgagatcgga 8940
actttcatag tgccgcgcac agagcgttat tggctattca gcaagcccgt aaaattaacg 9000
cgggccagct ggaagtccgt ggattattgt gtttggccgg agcgcaggca ggtgccggtg 9060
atttaaaaaa ggctcagctt aacattgttt atgcagtgga gatagcaaaa cagcttcaat 9120
gctttcaaac agttcttgat gaagtatgtt taattgagcg aataataccg gcttcatgtg 9180
aagccttcac agcagttaat ttagatcaag cgattggggc ttttagtctt ccgcgaatag 9240
ttgagattgg aaagtccgca gagaataaag ctgacgcttt attgacacgg aagcagattg 9300
ctgtcttgag gcttgtaaaa gaggggtgct caaacaaaca aatagcaaca aatatgcatg 9360
tcaccgaaga tgctataaag tggcacatga ggaaaatatt tgccaccttg aatgtagtga 9420
atcgcacgca agcaacaatt gaagctgagc gtcaaggaat tatctaaaat aatcggcatt 9480
aagtgatata gtgaaaagta tactcgagct catagtccac gacgcccgtg attttgtagc 9540
cctggccgac ggccagcagg taggccgaca ggctcatgcc ggccgccgcc gccttttcct 9600
caatcgctct tcgttcgtct ggaaggcagt acaccttgat aggtgggctg cccttcctgg 9660
ttggcttggt ttcatcagcc atccgcttgc cctcatctgt tacgccggcg gtagccggcc 9720
agcctcgcag agcaggattc ccgttgagca ccgccaggtg cgaataaggg acagtgaaga 9780
aggaacaccc gctcgcgggt gggcctactt cacctatcct gcccggctga cgccgttgga 9840
tacaccaagg aaagtctaca cgaacccttt ggcaaaatcc tgtatatcgt gcgaaaaagg 9900
atggatatac cgaaaaaatc gctataatga ccccgaagca gggttatgca gcggaaaagc 9960
gctgcttccc tgctgttttg tggaatatct accgactgga aacaggcaaa tgcaggaaat 10020
tactgaactg aggggacagg cgagagagga tcaatggcta tctgggggac cgagggctgt 10080
cgctgcgcca aggcacgatt ggagatcccc tatgcggtgt gaaataccgc acagatgcgt 10140
aaggagaaaa taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 10200
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 10260
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 10320
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 10380
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 10440
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 10500
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 10560
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 10620
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 10680
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 10740
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 10800
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 10860
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 10920
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 10980
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 11040
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 11100
tgacagttac caatcgattg gtcggtcatt tcgaacccca gagtcccgct cagaagaact 11160
cgtcaagaag gcgatagaag gcgatgcgct gcgaatcggg agcggcgata ccgtaaagca 11220
cgaggaagcg gtcagcccat tcgccgccaa gctcttcagc aatatcacgg gtagccaacg 11280
ctatgtcctg atagcggtcc gccacaccca gccggccaca gtcgatgaat ccagaaaagc 11340
ggccattttc caccatgata ttcggcaagc aggcatcgcc atgggtcacg acgagatcct 11400
cgccgtcggg catgcgcgcc ttgagcctgg cgaacagttc ggctggcgcg agcccctgat 11460
gctcttcgtc cagatcatcc tgatcgacaa gaccggcttc catccgagta cgtgctcgct 11520
cgatgcgatg tttcgcttgg tggtcgaatg ggcaggtagc cggatcaagc gtatgcagcc 11580
gccgcattgc atcagccatg atggatactt tctcggcagg agcaaggtga gatgacagga 11640
gatcctgccc cggcacttcg cccaatagca gccagtccct tcccgcttca gtgacaacgt 11700
cgagcacagc tgcgcaagga acgcccgtcg tggccagcca cgatagccgc gctgcctcgt 11760
cctgcagttc attcagggca ccggacaggt cggtcttgac aaaaagaacc gggcgcccct 11820
gcgctgacag ccggaacacg gcggcatcag agcagccgat tgtctgttgt gcccagtcat 11880
agccgaatag cctctccacc caagcggccg gagaacctgc gtgcaatcca tcttgttcaa 11940
tcatgcgaaa cgatcctcat cctgtctctt gatcagatct tgatcccctg cgccatcaga 12000
tccttggcgg caagaaagcc atccagttta ctttgcaggg cttccc 12046