CN113528550A - 草欧菌素的生物合成基因簇及其应用 - Google Patents

草欧菌素的生物合成基因簇及其应用 Download PDF

Info

Publication number
CN113528550A
CN113528550A CN202110637064.2A CN202110637064A CN113528550A CN 113528550 A CN113528550 A CN 113528550A CN 202110637064 A CN202110637064 A CN 202110637064A CN 113528550 A CN113528550 A CN 113528550A
Authority
CN
China
Prior art keywords
leu
ala
gly
val
arg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110637064.2A
Other languages
English (en)
Other versions
CN113528550B (zh
Inventor
徐孙德
陈云
马忠华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN202110637064.2A priority Critical patent/CN113528550B/zh
Publication of CN113528550A publication Critical patent/CN113528550A/zh
Application granted granted Critical
Publication of CN113528550B publication Critical patent/CN113528550B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K7/00Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
    • C07K7/04Linear peptides containing only normal peptide links
    • C07K7/06Linear peptides containing only normal peptide links having 5 to 11 amino acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/005Glycopeptides, glycoproteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/02Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Saccharide Compounds (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明公开了草欧菌素的生物合成基因簇,其至少包括7个基因:非核糖体肽合成酶基因ACBA基因,ACBB基因,ACBC基因,ACBD基因和ACBH基因;与糖基转移相关基因ACBI基因;以及与氧化还原相关基因ACBE基因。本发明通过获取非核糖体基因,预测生物合成基因簇和验证预测结果,最终获得准确的草欧菌素的生物合成基因簇,可用于制备草欧菌素。

Description

草欧菌素的生物合成基因簇及其应用
技术领域
本发明涉及微生物基因工程技术领域,尤其涉及草欧菌素的生物合成基因簇及其应用。
背景技术
草欧菌素对植物病原真菌有很好的防治效果,其包括两种类型,分别为A型和B型,结构式如下所示:
Figure BDA0003106185870000011
A型草欧菌素带有糖基,而B型草欧菌素不具有糖基。虽然,这类抗真菌是一类已知的化合物,并且这类脂肽类化合物具有良好的活性效果。但是这类化合物的基因簇却是一直处于未知状态。探索这类化合物对应的基因簇,对这位这类化合物的改造起到了关键的数据支持。
非核糖体肽合成酶NRPs由多个模块(module)按特定的空间顺序排列而成,每个模块负责将一个氨基酸整合到产物的骨架中,肽链延伸的一个反应循环过程:1.A结构域从底物池中选择结合特定的氨基酸,在ATP的作用下合成相应的氨酰-AMP而使氨基酸底物得到活化;2.氨酰-AMP与T结构域上的辅因子磷酸泛酰巯基乙胺的巯基相结合,形成复合体之后,在缩合结构域也就是C结构域下形成新的肽键,产生延伸了一个氨基酸的新的肽酰-S-载体复合体和游离的载体,在沿着顺序添加氨基酸之后,在最后一个模块上存在TE结构域,在这个结构域的作用下,产物被释放出来,形成线性肽链或者环状肽链。
发明内容
本发明的目的在于提供了一种草欧菌素的生物合成基因簇以及该生物合成基因簇在制备草欧菌素中的应用。
具体技术方案如下:
本发明提供了草欧菌素的生物合成基因簇,所述草欧菌素的生物合成基因簇至少至少包括7个基因,分别为:
非核糖体肽合成酶基因,即:ACBA基因,ACBB基因,ACBC基因,ACBD基因和ACBH基因;
所述ACBA基因编码1890个氨基酸,氨基酸序列如SEQ ID NO.2所示;
所述ACBB基因编码291个氨基酸,氨基酸序列如SEQ ID NO.3所示;
所述ACBC基因编码2499个氨基酸,氨基酸序列如SEQ ID NO.4所示;
所述ACBD基因编码3861个氨基酸,氨基酸序列如SEQ ID NO.5所示;
所述ACBH基因编码2799个氨基酸,氨基酸序列如SEQ ID NO.9所示;
与糖基转移相关基因,即:ACBI基因,其编码246个氨基酸,氨基酸序列如SEQ IDNO.10所示;
与氧化还原相关基因,即:ACBE基因,其编码254个氨基酸,氨基酸序列如SEQ IDNO.6所示。
草欧菌素分为A型和B型,其差别是A型草欧菌素带有糖基,而B型草欧菌素不具有糖基,本发明提供的生物合成基因簇中缺失了ACBI之后,就可以全部合成B型草欧菌素。
ACBA基因,ACBB基因,ACBC基因,ACBD基因和ACBH基因编码的5个非核糖体肽合成酶包含若干模块或结构域,即腺苷腺化结构域(A结构域)、肽酰基载体蛋白结构域(PCP/T结构域)、缩合结构域(C结构域),差向异构结构域(E),N-甲基化结构域(M)。
SEQ ID NO.1中有5个非核糖体肽合成酶基因(ACBA-D,ACBH),核苷酸互补序列及其氨基酸序列,是草欧菌素A合成所必须的;其中,包含8个模块,28个结构域如图2所示。ACBA包含2个模块:加载模块域中,A1、T1和C1负责草欧菌素A的起始合成,催化一个苏氨酸和一个长链脂肪酸作为起始单位,模块2含有A2、C2、T2结构域,负责引入一个苏氨酸。ACBB包括模块3,含有C3结构域。ACBC包括模块4和5,模块4含有A4-T4-C4-E4结构域,负责引入苏氨酸。模块5含有A5-T5-C5-E5结构域,负责引入亮氨酸。ACBD包含模块6,7,8。模块6含有A6-T6-C6-E6结构域,负责引入甘氨酸;模块7,含有A7-T7-C7-E7结构域,负责引入谷氨酰胺。ACBH包括模块9,含有A8-T8-C8结构域,负责引入甘氨酸单位。ACBH包含模块9,10。模块9,含有A9-T9-C9结构域,负责引入苏氨酸。模块10,含有A10-T10-C10-TE10,负责引入精氨酸,并在硫酯酶(TE)参与下完成碳链的环化及释放。最后在ACBI糖基转移酶的作用下,在长链脂肪酸部分加入葡萄糖分子。
进一步地,所述ACBA基因的核苷酸序列如SEQ ID NO.1中第3010-9258位所示;所述ACBB基因的核苷酸序列如SEQ ID NO.1中第9317-10588位所示;所述ACBC基因的核苷酸序列如SEQ ID NO.1中第10713-18212位所示;所述ACBD基因的核苷酸序列如SEQ ID NO.1中第18209-29794位所示;所述ACBH基因的核苷酸序列如SEQ ID NO.1中第32535-40934位所示;所述ACBI基因的核苷酸序列如SEQ ID NO.1中第40934-41674位所示;所述ACBE基因的其核苷酸序列如SEQ ID NO.1中第29791-30555位所示。
进一步地,所述草欧菌素的生物合成基因簇还包括ACBF基因,其编码70个氨基酸,氨基酸序列如SEQ ID NO.7所示,其核苷酸序列如SEQ ID NO.1中第30602-30814位所示;该基因具有辅助氨基酸腺苷化的作用,参与草欧菌素的生物合成能够有效提高草欧菌素的合成效率。
进一步地,所述草欧菌素的生物合成基因簇还包括ACBJ基因,其编码的氨基酸序列如SEQ ID NO.11所示,其核苷酸序列如SEQ ID NO.1中第41794-42483位所示;该基因具有对错误合成具有修正的作用,参与草欧菌素的生物合成可有效提高草欧菌素的合成效率。
进一步地,所述草欧菌素的生物合成基因簇还包括与转运相关的基因,即ACBG基因,其编码的氨基酸序列如SEQ ID NO.8所示,其核苷酸序列如SEQ ID NO.1中第30842-32506位所示。
进一步地,所述草欧菌素的生物合成基因簇还包括与抗性相关的基因,即ACBK基因,其编码的氨基酸序列如SEQ ID NO.12所示,其核苷酸序列如SEQ ID NO.1中42549-45662位所示。
进一步地,所述草欧菌素的生物合成基因簇的核苷酸序列如SEQ ID NO.1所示。
具体的,所述草欧菌素的生物合成基因簇来源于成团泛菌(Pantoeaagglomerans);更具体的,来源于成团泛菌(Pantoea agglomerans)ZJU23,保藏编号为CGMCC No.16174,保藏日期为2018年7月30日。
本发明还提供了一种包含所述草欧菌素的生物合成基因簇的表达载体或基因工程菌。
本发明还提供了所述生物合成基因簇在合成草欧菌素的应用。
与现有技术相比,本发明具有以下有益效果:
(1)本发明通过获取非核糖体基因,预测生物合成基因簇和验证预测结果,最终获得准确的草欧菌素的生物合成基因簇,可用于制备草欧菌素。
(2)本发明通过基因簇序列信息和结构分析,可以进一步对其生产菌进行遗传操作,获得新型的、更有效的抗生素,如通过基因操作来改变其PKS合成模块式结构,进行内酯环后修饰的改变,糖基因的置换或修饰,创造新的大环内酯类抗生素。
(3)本发明也可以通过对抗性基因或调节基因的遗传操作,提高抗生素的产量;所提供的基因及其蛋白质、抗体也可用以筛选和发展可用于医药、工业、农业的化合物或蛋白。
附图说明
图1为草欧菌素A和B的生物合成基因簇结构示意图;
其中,NRPS表示非核糖体肽合成酶;Reductase表示还原酶;Mbth family protein表示辅助氨基酸腺苷化蛋白;Glycosyltransferase表示糖基转移酶;Type II TE表示第二类硫酯酶;Resistance表示抗生素抗性;Transporter表示草欧菌素的转运子。
图2为草欧菌素A的基因合成顺序示意图;
其中,condenstaion表示缩合结构域;AMP-binding表示腺苷化结构域;Thiolation表示硫醇化结构域;Epimerization表示差向异构化结构域;Thioesterase表示硫酯酶;N-methyltransferase表示甲基转移酶。
图3为草欧菌素B的基因合成顺序示意图;
其中,condenstaion表示缩合结构域;AMP-binding表示腺苷化结构域;Thiolation表示硫醇化结构域;Epimerization表示差向异构化结构域;Thioesterase表示硫酯酶;N-methyltransferase表示甲基转移酶。
图4为ACBA、ACBB、ACBC、ACBD、ACBE、ACBF、ACBG、ACBH、ACBI、ACBJ、ACBK突变体对于禾谷镰刀菌的抑制活性照片。
图5为ACBA、ACBB、ACBC、ACBD、ACBE、ACBF、ACBG、ACBH、ACBI、ACBJ、ACBK突变体中A型草欧菌素含量的检测结果。
图6为ACBA、ACBC、ACBD、ACBH、ACBI突变体发酵后获得的不同中间产物的LC-MS总离子流图。
图7为ACBA、ACBC、ACBD、ACBH、ACBI突变体发酵后获得的不同中间产物的LC-MS数据图谱;
其中,A图为质谱数据图,B图为不同化合物的结构示意图;compound 1是A型草欧菌素,compound 2-5分别对应着ACBA、ACBC、ACBD、ACBH、ACBI突变体中间产物,compound 6是B型草欧菌素。
图8为不同突变体ACBA、ACBB、ACBC、ACBD、ACBE、ACBH的PCR验证图;
其中,ID-1对应的是突变体的验证条带,ID-2对应的是野生型条带,IN-1对应的是突变体内部引物验证,IN-2对应的是野生型内部引物验证。
具体实施方式
下面结合具体实施例对本发明作进一步描述,以下列举的仅是本发明的具体实施例,但本发明的保护范围不仅限于此。下列实施案例中未详细涉及的实验方法均为常规技术手段。
实施例1
1、非核糖体基因的获取
从小麦赤霉病菌的子囊壳结构中分离获取到成团泛菌菌株,我们将其命名为成团泛菌ZJU23,并保藏在中国普通微生物菌种保藏管理中心,保藏编号为CGMCC No.16174,保藏日期为2018年7月30日(本发明不涉及该菌株的保藏);对该菌进行全基因组测序后,根据后续的转录组对测序得到的序列进行注释。
2、草欧菌素A生物合成基因簇在非核糖体基因中的初定位
通过转座子随机插入,我们发现了大量失去抑菌活性的突变体,对这些突变体进行全基因测序,发现了大部分突变体在某个基因簇上大量聚集。因此初步推断草欧菌素A的生物合成基因簇在该核糖体基因簇上。
3、草欧菌素A生物合成基因簇的功能性预测
将初筛后的草欧菌素A生物合成基因簇进行功能性注释,将因簇的注释文件上传上到Antismash网址(https://antismash.secondarymetabolites.org/),得到对应的基因的预测功能和预测的底物;
而通过Antismash的预测,ACBA基因的底物为Thr(苏氨酸),Thr(苏氨酸);ACBC基因的底物氨基酸为Thr(苏氨酸),苯丙氨酸(亮氨酸);ACBD基因的底物为Gly(甘氨酸),Gln(谷氨酰胺),Gly(甘氨酸);ACBH基因的底物为Thr(苏氨酸),Asn(天冬酰胺)。ABAF基因的功能就是Mbth类蛋白。ABAG基因的功能就是转运子相关蛋白。ABAJ基因的功能就是第二类硫酯酶。ABAK基因的功能是多药抗性基因。
因为草欧菌素中不存在天冬酰胺和苯丙氨酸,因此我们对预测进行了矫正。HCBA基因的底物为Thr(苏氨酸),Thr(苏氨酸);HCBC基因的底物氨基酸为Thr(苏氨酸),Leu(亮氨酸);HCBD基因的底物为Gly(甘氨酸),Gln(谷氨酰胺),Gly(甘氨酸);HCBH基因的底物为Thr(苏氨酸),Arg(精氨酸)。
4、草欧菌素A生物合成基因簇的验证
在预测的基础上,分别对ACBA,ACBB,ACBC,ACBD,ACBE,ACBH,ACBI,ACBF,ACBG,ACBJ,ACBK 11个基因进行了单独的敲除,得到了11个突变体,分别为ACBA突变体,ACBB突变体,ACBC突变体,ACBD突变体,ACBE突变体,ACBH突变体,ACBI突变体,ACBF突变体,ACBG突变体,ACBJ突变体和ACBK突变体。
本研究使用pKD46质粒进行基因敲除,卡那霉素抗性片段作为筛选标记。在设计引物时,在卡那霉素两端直接通过PCR连上目标基因两侧50bp同源臂,作为同源重组替换片段。通过电击转化的方法,将同源重组片段导入到细菌中。
我们将11个突变体进行平板活性检测。结果如图3所示,ACBA,ACBB,ACBC,ACBD,ACBE,ACBH,ACBF,ACBJ的抑制真菌活性完全散失,而ACBI,ACBK的抑制真菌活性没有散失,而ACBG的抑菌活性严重减弱。将11个突变体和野生型(即:成团泛菌(Pantoeaagglomerans)ZJU23)在相同条件下(WA培养基)进行发酵,并得到对应的发酵产物。将种子液在LB培养基中过夜培养之后,按照1:1000的比例接种到WA培养基中,之后在25℃条件下培养3天后,检测发酵液中A型草欧菌素的含量。如图4所示,ACBA,ACBB,ACBC,ACBD,ACBE,ACBH,ACBF,ACBJ不能产生草欧菌素;ACBK突变体中A型草欧菌素含量不受影响;ACBG中A型草欧菌素含量显著下降。ACBI突变体含量中检测不到A型草欧菌素。
将图3和图4相结合,Antismash的预测结果显示ACBF,ACBG,ACBJ,ACBK基因对应的功能分别是Mbth类蛋白,ABC转运子,第二类硫酯酶,抗生素抗性基因。在文献报道中Mbth类蛋白辅助NRPS中氨基酸的腺苷化过程,帮助氨基酸与NRPS中的A型结构域相结合,从而帮助NRPS类非核糖体多肽的合成;ACBG是基因簇上的ABC转运子,是帮助A型草欧菌素向外转运。ACBG转运子的缺失,会导致其发酵液中A型草欧菌素的含量显著下降。ACBJ是第二类硫酯酶,能够修复在多肽合成过程中的错误合成,从而回收底物。根据文献报道,第二类硫酯酶的缺失会导致其终产量的下降,而ACBJ突变体的表型与之对应。而ACBK的突变体预测的功能是一类抗生素抗性基因,ACBK的缺失不会影响其A型草欧菌素的含量。这与预测的结果一致。
在酶活反应过程中,如果对应的酶作用消失,或者没有对应酶的表达,会导致生化反应的停止,从而导致了反应前体的增加和产物的消失。我们利用这一现象对各个突变体进行发酵,通过(LC-MS)质谱的方法检测突变体的发酵产物是否和预测结果的一致。若一致表明对应基因合成的酶的底物与预测的一样;若不一致表明底物与猜想的存在差异。在此基础上,将计算得到不同突变体中间产物的精准分子量和不同突变体的发酵产物的高分辨液相色谱数据进行比对。
如图5显示,对比与野生型发酵液中,在ACBA突变体发酵液中检测到累积的Compound 2;在ACBC突变体发酵液中检测到累积的Compound 3;在ACBD突变体发酵液中检测到累积的Compound 4;在ACBH突变体发酵液中检测到累积的Compound 5;在ACBI突变体发酵液中检测到累积的Compound 6。
如图6显示,在ACBA突变体发酵液中检测到累积的Compound 2,预测是[M+H]为328.2488,检测[M+H]为328.2664;在ACBC突变体发酵液中检测到累积的Compound 3,预测是[M+H]为429.2965,检测到为429.2971;在ACBD突变体发酵液中检测到累积的Compound4,预测是[M+H]为643.4282,检测到为643.4393;在ACBH突变体发酵液中检测到累积的Compound 5,预测是[M+H]为885.5297,检测到为885.5340;在ACBI突变体发酵液中检测到累积的Compound 6,预测是1138.6836[M+H]为检测到为1138.6839,与B型草欧菌素的分子量对应。这里也进一步说明ACBI突变体为什么依然具有抑制真菌活性,因为ACBI突变体中具有B型草欧菌素,B型草欧菌素依然具有抑制真菌活性;并且在ACBH中能找到ACBD的中间产物,在ACBD中能找ACBC的中间产物,在ACBC中能找到ACBC的中间产物,以此类推,我们认为这个基因簇的顺序是从ACBA-ACBC-ACBD-ACBH-ACBI,如图1所示。
结果如图7显示,我们分别对ACBA,ACBB,ACBC,ACBD,ACBE,ACBH,ACBI的突变体进行敲除,并进行了PCR验证。ID-1对应的是突变体的验证条带,ID-2对应的是野生型条带,IN-1对应的是突变体内部引物验证,IN-2对应的是野生型内部引物验证。图7表明了获得的转化子都为正确的突变体。
表1.1非核糖体肽合成酶ACBA基因各结构域及其氨基酸的位置
Figure BDA0003106185870000071
表1.2非核糖体肽合成酶ACBB基因各结构域及其氨基酸的位置
模块 结构域 氨基酸的位置
模块3 C 2-292
表1.3非核糖体肽合成酶ACBC基因各结构域及其氨基酸的位置
Figure BDA0003106185870000072
表1.4非核糖体肽合成酶ACBD基因各结构域及其氨基酸的位置
Figure BDA0003106185870000073
Figure BDA0003106185870000081
表1.4非核糖体肽合成酶ACBH基因各结构域及其氨基酸的位置
Figure BDA0003106185870000082
SEQ ID NO.1中有5个非核糖体肽合成酶基因(ACBA-D,ACBH),核苷酸互补序列及其氨基酸序列,是草欧菌素A合成所必须的;其中,包含8个模块,28个结构域如图2所示。ACBA包含2个模块:加载模块域中,A1、T1和C1负责草欧菌素A的起始合成,催化一个苏氨酸和一个长链脂肪酸作为起始单位,模块2含有A2、C2、T2结构域,负责引入一个苏氨酸。ACBB包括模块3,含有C3结构域。ACBC包括模块4和5,模块4含有A4-T4-C4-E4结构域,负责引入苏氨酸。模块5含有A5-T5-C5-E5结构域,负责引入亮氨酸。ACBD包含模块6,7,8。模块6含有A6-T6-C6-E6结构域,负责引入甘氨酸;模块7,含有A7-T7-C7-E7结构域,负责引入谷氨酰胺。ACBH包括模块9,含有A8-T8-C8结构域,负责引入甘氨酸单位。ACBH包含模块9,10。模块9,含有A9-T9-C9结构域,负责引入苏氨酸。模块10,含有A10-T10-C10-TE10,负责引入精氨酸,并在硫酯酶(TE)参与下完成碳链的环化及释放。最后在ACBI糖基转移酶的作用下,在长链脂肪酸部分加入葡萄糖分子。
序列表
<110> 浙江大学
<120> 草欧菌素的生物合成基因簇及其应用
<160> 12
<170> SIPOSequenceListing 1.0
<210> 1
<211> 45662
<212> DNA
<213> 成团泛菌(Pantoea agglomerans)
<400> 1
tgcttgcctc tggaattagc atcgaacaac tggtcattgt ttaggatcca gtaaccctga 60
tgaactagtt ttcagaaata caagtaaagg gctagctgct cacagcttgc aactatcctc 120
ttgttagtaa tgcactgatt tttgaacttg cccgtgtgat acaggataaa tcatggtcgc 180
gaatactaat caaatgcatg atgatatata cagtaaaacc actacccaaa aaggttatcc 240
cagtaaagca aacaaagagt aatttatgca acaacctcct atttgtgatg ttctttactt 300
tgctctcctc aatggtgatt tagatgcatt ggataatatc agtgcggaga ggcccgagga 360
aattatctcc gtacggacta tcatatgtac taatgcacaa ggggctattg acattaaatt 420
ggggctcggg ccaggagtgg ctgaaaacga actaattatt tcattttttg aaggcccacg 480
agcctatttt atgtcagcag gcacaagtct tcaggcatat ctgacgggcg gtaaaagtaa 540
cgtattgttc gattactcac taagcttcga ctccaatttt gccgagaaaa tgcgtgcggt 600
cgtgtcagga gaaagcattc aggaagttga acgtaatcgc gttattgaaa tattgttgct 660
gaaggcgcac aactccagag tgcagtttga tttgcttcca tttcttatcg aaaatgctcg 720
actttcaaga tctaatccgc aaaacaaacg ccctctcaat accctgattg cttttcgcat 780
gctggatcac ttggactgga agcgtttccg ggaatcacca ggcacgtttg tatacagtaa 840
atctgtagat accttaaggg cggaacttct tgaagatgcg gaagctttta tgcacaaaat 900
gcacagcagt gagtgcatag tacagcagga agcaaagagt acgttcaatc aagcattgtt 960
gttatgcttt gcaagactct ggcacagaga tagtaatcgc gatcacaaac tgattttgcg 1020
ccagctccta atatattccg ttacgaaact tggggctata ccacttacag agttacagct 1080
aatttggagg ggcatgacct catcgccagt atcgcccttt ttcggcccta taacagggag 1140
gtcacgcgaa atgcttaaag caatacgcgg catggcatgg gatatgacat tgctacgcct 1200
gcttgagcag gttgccaccg cgactcagaa tgggtcattt tttattccct acttcgtaac 1260
caatgatagt cgctggcgtg aactattacg gttaagcccg gtaaaaatga tggtcataga 1320
tgacaaggac catcgagtac taattgcgcg caggaatgag agagagttca gagctatttt 1380
aaaggaatgc ttgcaggatg aattacatcc ttatatgact cttgaattga ttgagcagcg 1440
tcgtcaggcc gcaaaaaacc tgcagcctga tgcattgaag gccctcgttg agcaagaaga 1500
ggcattctgg gcagcccatt aagaagaatg acttacgccc cgtttattca ttttcttcaa 1560
tgcaagttag acttatgact atttaattac aagcggacat aactaacctc ataccgcttg 1620
cataagtaga gagctacctg attctctttc tgccttgacg cgactgaaaa atgcgcgata 1680
tcccaatcag gaatgcgtcc ggccctcgtt gtctcttgcc tgtcgacttt ttactgagtc 1740
tctacggaat tattgcctag cagtcgttgc agtttcttta acagaaacag gtgtggcctg 1800
agtgcccacg tataaaacgc aacgtcgcgc tgctgttgca gaaatcccta acccgaccac 1860
ggcgcatccg gcgagagcag ggcggtcaaa cgacattgcc tgacccgcac attcgtcgcc 1920
aaccaccgct gtctgactaa tcttcttcgc tacgcgggga gcgtacgggt tacggatttg 1980
gcggagtcag ttctgaatta cggggtaacc taccagttaa gggccggtgc gtactggccc 2040
tttaagatgc gcacccgctt gcctgcagcc agaacaagaa gtcatgaaga tagtcaataa 2100
catggaaaag agtgcggccg tgttcattcg aagagccggt gcacatccgc atcacgcaga 2160
gctgcttgtc caggagctgt cagaaaaatc ctccccgatt ctcaacaaac gcccaggctg 2220
ggccaggacg tcaccatgtg ccggaaggag aaagaaggaa acgggccatt atgattcatt 2280
atcgtccacg tttattcgtg tcaggcgctg cactgcatga aatctgtgag aaaagcagga 2340
tagatcaacc catgcaaact cttagcaggt ttggcttttt ttgtacgcca gaggcatagg 2400
caggcacgat tgaatcgtca aattactatg ttacgcctaa caagatagag tgcaaaacct 2460
catgtaatgg catgaggata ttcttaaaac aataaataat ttttcattat ttggcatcta 2520
gctaaatata gcttacaaat catttcatat cattttcagt tagttacatg gaatcacata 2580
acgaagcggc tataaaagtt ttataattta acctaaaaag ttagtttttt taatgttttg 2640
ttcatattta aggggtagga ttcataagat atgtaaatgg ttatggcgaa caccttgctg 2700
ttagcggata aaatcaagtg ttcagaggga tgttgctaaa cgttttaaac gtcagttgcc 2760
gtctcagatt tagcgggtga tttctttaaa cataatgatg atatgaagga tagattcaaa 2820
atgttagttc ctttaatcct aagttcttcg cttcttctcc tcatatctcc attagcacgt 2880
cccataagtc tgcatacatt caaaaaaaca gttacgccat tgaaaaataa aatttaaaaa 2940
caagaggtta aaccttgttt tgaattgtgc gttagcgtct gaacacttaa ttgtgatagg 3000
tgaacgtgta tgttgctaga cagacagtat gaggtaattg ctaagcaggc attgtatcca 3060
ttcgctccta ttttcaatat cggtgcggta atagatatca gaggacctct tgacgagcag 3120
cgcatgtttg atgctgacca ggcagttaag cgagatcctg cattaagatc ggctttgtct 3180
atgaggggct atgaaccgga aatagttaca cttgctgaag atacctttcc cctgaaaatt 3240
ctggatcttt cctgcaatga tgaccctttc acaaacgctt ttctctgcat agagcagtct 3300
cttcagcaaa tgtttgcttt cgaggggaaa acgcctctga tgcaacatac gttaatacgc 3360
ctggcaaatg atcatcacct gatggtgggg atttaccatc atctggctta cgatgggtgg 3420
gcaacctcac ttatatatca acacctggcg gcttattaca acgatttcac ccgatttaac 3480
tctgtacgga atttatcgcc gctaagttat caggaacaga ttagtgctga aatgaattat 3540
aagcattcag cctcttatat ggccgatcat tcctactggc aagcgaggct atctggttat 3600
gatacaatgc tttttgcaca aggatgtagg gatgtagcag caaagcggta ttcattcact 3660
cttgatattc attatcgcga aaagttgcaa gagcttgcgc ttgattccgg gggaacactt 3720
tttcaggttc tgactggaat tactgctatt ttccttttcc aactttttgg tactgatgat 3780
gttgtcatag ggctgccagt gctcaatcgc cgcacggccc gcgctaaaca gacatttggt 3840
tttttcgcta atgtattgcc attccggctt caaagaaaaa ttaacgatac attcaagacc 3900
cttctcaaaa atattatcat tcttctgaaa gaggattacc gtcatcagcg ttttccggct 3960
aatcagattt taaagggagg gacatcatat gaggccaccc tttcctacga gaagcatgat 4020
tacagcgcaa tttttgaagg tacagatacg caactgaatg tattatccag ctcttgtcag 4080
gattaccctc ttaaactttt cattcgcgat tacgatccag aaaaaccctt aaaaatcgat 4140
attgattata acatttcggc tttcagcgaa atggatgttg agcatgtatt ccaggaattt 4200
aaaacaattc ttgataactg tattaatcat ccagagcgac agttggtcat aaatcatcaa 4260
atccgacctg acgaagcgag cttcataccg ccagatgtaa caactgaact ttgcacccag 4320
tttgaggcag ccgcatcgcg ccatgccgat cgggtggcca tcacctgcga aggcgaaagc 4380
ctgacctatg ccgcactcga cagcgccgcc agcgcgctcg cctggcgcct gcgcggcctg 4440
ggcgtcggca ccggcccgca cgaaagcctg gtcggcctca gcgccggccg cggacccggc 4500
ctgctggtcg ggatcctcgg catcctcaag gccggcggcg cgtacgtgcc gctggacccg 4560
gtttaccccg ccgagcgcct cgccttcctg gccgccgaca gcggtatccg cctggcggtg 4620
gctgacgaca cgggcctggc ggcgctggcc gggctcggcg tacagacggt gagcctttca 4680
gctgaccatc cgcgccgggc cggcaatcag gccccgccgc gctcgctgca cccgcagcag 4740
gcggcctacg tcatctacac ctccggctcc accggccagc ccaagggctg cgtcgtcagc 4800
cacgccagcg tggtgcgcct gtttaccgcc actgaacact acggcttcgg cgagtcggac 4860
gtctggacgc tgttccactc ctacgccttt gacttctcgg tctgggaaat ctggggcgcg 4920
ctgctgcacg gcgggcgcct ggtggtggtg ccctacctga gcagccgcga cccggagcgc 4980
tttgcccacc tgctggaagc ggagtcggtc accgtgctca gccagacccc ggcggccttc 5040
cgacagctga ccgcggcctc ggccggacgg gactttgcgg cgctgcggct ggtgctgttc 5100
ggcggcgaag ccctggagcc gggcagcctg gcgccgtggt tcgcgcagca cggcgggcgg 5160
gtgaggctgg tcaacatgta cggcatcacc gagaccacgg tacacgtgac cgagtacacg 5220
ctgacgccag agagcatgac gcagggcagc gtgatcggca cggcgctggc ggatttgcac 5280
gtgcaggtgc tggaccgcta cggcgagccg gtgccggcgg gggtaacggg cgagatgtac 5340
gtgggcggcg cgggcgtgac gcggggctac ctgggccggg cggcgctgac ggcgcagcgc 5400
ttcgtgccgg atccgttcgg cgcgccgggg gcgaggctct accgctccgg cgacctggcg 5460
cgccgccggg cggacggcgg cctggtgtac cagggccggg cggaccagca gctgaagctg 5520
cgcggctacc gcatcgagcc gggcgaaatc gaggcggcgc tgcgggcgca ggcgggggtg 5580
cgcgacgcgg cggtggtgct ggacgcgccg gcgcagggcc agccgcggct ggtggcttac 5640
gtggtggggg gcggaggggc gcaggcgctg cgcgaggcgc tgtcggcggc gctgccggag 5700
cacatggtgc ccgcggtcat catgccgctg gcgcggctgc cgctgaccgc gcacggcaaa 5760
ctggaccgga aggcgctgcc ggagccggaa gtcacggtgt ctgccggagg cgaggcgcga 5820
acggaggtgg aaaaaacgct ggccggcatc tggagcgagg tgctgtcgat cccggcgccg 5880
ggcattgacg acaacttctt cacgctgggc ggtgacagca tcagttcgct gcaggtggtt 5940
tcccgggcca gagctgcggg tattaacatt acgatagaag gctttttagc cggacagcat 6000
attcgaaaaa ttgccgccgg agtacaaagc gggcctgttg cagccgatga tgaaagcctg 6060
accgtacctt tcagcctctt gtcagcagcc gatcgtgcac ggctgccgga caacgttgac 6120
gacgcttttc ctttgtccag actacaggcc ggaatgctgt ttcactccac gctggctgaa 6180
gaaggcgcta ttttccatga cgtctttact ttccggctac gcatgccgtg gaacgaacac 6240
gcatggcgta gtgcctttga gctgctaccc gcttcccata cgcctttaag aacctctttt 6300
cactggacag gctatagcga accgctacag gttgtacatt cgaccgccga cattgattat 6360
caaatcgtcg atctgcgtta tctggaaacg gaacagcggc ggcaggcggt taatgatttt 6420
atcgcacaca gcaaatccta cggtttcgat ccggctaaag ggcgcatgtt tcgtgtgagc 6480
ctgcatcgcc atagcgatga agagctacag ctgacactgg attttcacca cgcaattttc 6540
gatggctgga gcgtggctac actgctcagc actttaatcc accgggtgac cggcacggag 6600
gcaacaaacg cacgctctga taccactgtc aacacggctt tcgtggcgct ggaacgtaaa 6660
gccgaggccg acgagcagct ggtggctaag tggcgggagc gggtagcaga tgttgtacct 6720
acgttgctgg gtgaccatag cgcggctgaa ctctccggca cacgacaggt tcagcgccgg 6780
gctttccgcc tgcctgacca tttaaccagc aagctcaaac aacgcgccac ggatctggct 6840
atcccgttaa aaatcgttct gctgacggcc catctcagcg cgctggctaa agtcaccggc 6900
ggaacagtta ctactacagg ctacgtcacc cacggtcgcc cggcaggtgc agataaggca 6960
gtcggtttat tccttaatac ccttccgttc agcatggcac taccaccagt gagctggaac 7020
tcgctgataa aaagcattgc cgctgaagag caggcgattc aggcaatacg ccgcctgccg 7080
gcctcggtaa ttaaaccgct gaacagcagc ggacagctct ataacgtcag ctttaattat 7140
attcacttcc atatctacaa cagcctgcct gacctggcag attttcaggt cgtcgatttt 7200
gagattttcg aagaaaccga ttttccgcta ctggctcaat attcgcagga tccgtttgat 7260
gcctcgcttg agctcacgct ggttgccgat cctgcggttg tccccgaatg gcaggtagag 7320
cagtttggcg attttgtact tcgcgctgca gaggcgatag tcagcggctc agaggccccg 7380
tggtatagca gtcttcgctc agaagcatta ccgctcgtgc ctgacgcatc atcggaactc 7440
accttggacc tctgtacgca atttgaggca gccgcatcgc gccatgccga tcgggtggcc 7500
atcacctgcg aaggcgaaag cctgacctat gccgcactcg acagcgccgc cagcgcgctc 7560
gcctggcgcc tgcgcggcct gggcgtcggc accggcccgc acgaaagcct ggtcggcctc 7620
agcgccggcc gcggacccgg cctgctggtc gggatcctcg gcatcctcaa ggccggcggc 7680
gcgtacgtgc cgctggaccc ggtttacccc gccgagcgcc tcgccttcct ggccgccgac 7740
agcggtatcc gcctggcggt ggctgacgac acgggcctgg cggcgctggc cgggctcggc 7800
gtgcagacgg tgagcctttc agctgaccat ccgcgccggg ccggcaatca ggccccgccg 7860
cgctcgctgc acccgcagca ggcggcctac gtcatctaca cctccggctc caccggccag 7920
cccaagggct gcgtcgtcag ccacgccagc gtggtgcgcc tgtttaccgc cactgaacac 7980
tacggcttcg gcgagtcgga cgtctggacg ctgttccact cctacgcctt tgacttctcg 8040
gtctgggaaa tctggggcgc gctgctgcac ggccgggcgc ctggtggtgg tgccctacct 8100
gagcagccgc gacccggagc gctttgccca cctgctggaa gcggagtcgg tcaccgtgct 8160
cagccagacc ccggcggcct tccgacagct gaccgcggcc tcggccggac gggactttgc 8220
ggcgctgcgg ctggtgctgt tcggcggcga agccctggag ccgggcagcc tggcgccgtg 8280
gttcgcgcag cacggcgggc gggtgaggct ggtcaacatg tacggcatca ccgagaccac 8340
ggtacacgtg accgagtaca cgctgacgcc agagagcatg acgcagggca gcgtgatcgg 8400
cacggcgctg gcggatttgc acgtgcaggt gctggaccgc tacggcgagc cggtgccggc 8460
gggggtgacg ggcgagatgt acgtgggcgg cgcgggcgtg acgcggggct acctgggccg 8520
ggcggcgctg acggcgcagc gcttcgtgcc ggatccgttc ggcgcgccgg gggcgaggct 8580
ctaccgctcc ggcgacctgg cgcgccgccg ggcggacggc ggcctggtgt accagggccg 8640
gcggaccagc agctgaagct gcgcggctac cgcatcgagc cgggcgaaat cgaggcggcg 8700
ctgcgggcgc aggcgggggt gcgcgacgcg gcggtggtgc tggacgcgcc ggcgcagggc 8760
cagccgcggc tggtggctta cgtggtgggg ggcggagggg cgcaggcgct gcgcgaggcg 8820
ctgtcggcgg cgctgccgga gcacatggtg cccgcggtca tcatgccgct ggcgcggctg 8880
ccgctgaccg cgcacggcaa gctggaccgg aaggccctgc cggagccgga aattgccgtg 8940
gcgcagaacg aagaaggcta tcaatccagc cttgaacagg aaattgcaga gctgttaagc 9000
agcgtgctgg ggctgtccgg catcggacgg catcaaagct ttttagagac gggcggcgat 9060
tccattctgg cgacacaggc cctcttccgc ctgcgcgagc tgtatggggt cgagcttccc 9120
ctgcgaacga ttttcgaagc aggcaccgtt gcaggcgtag ccgcgaaaat aaaggctctg 9180
cggcaggaag aacgtcacgg ggagcgacag atcagcgatt ccactccgct tctgccatca 9240
cgacgtcgtc aaaagtgaac gacccatctt acaaggagcg agcgcagtga ataattttta 9300
cgatgattct gaagccctgt tatttccttt atcctcggcc cagcggcgac tctggactct 9360
ggccgaaatc aatgaagcgg acgtgagtta taacattcct tttgccttac gctgccgcgg 9420
taaatttcat tatcaggcgc tgcgccaggc gctaactgat ttacaacagc gccatgaaat 9480
tctgcgcacc agttatggcc ttattgacga cagcccgatg cagcgtatac atcccgccga 9540
agacgatctg gcgctgccac ttattcgtat taacgaagca cagcttgaaa aaaaactggc 9600
agaggatgca gccgagcctt ttaatttgca gctggcccct gtctttcatg ctcgcgttta 9660
tcagttaaac gacgaccatc atattctttc catggtcgtt caccatattg cctgcgacgg 9720
ctggtcagtc actattctgc tgcgtgagct aagtcatttc tacaatgccc gcgttgccaa 9780
tatgtcccca acgcttgccg aacttccttt gcagtatgct gactatgccg aatgggaaga 9840
agcggaagct aaacggacgg caaatcccgc aggcgagacc ggaaccaggc ttcattttca 9900
gcctgcagtt gcattgcccg gctgtgagtc tgatgaggca gataaagaaa atgcctgcgg 9960
tatcgttcag cagcgttttg acgctgattt tctgcaaaaa cttaacggct acgcccggga 10020
acaccatacc acgctgtttg ttaccttact ggcgggcttt atggcgctgc tgcgccggtt 10080
aactcaggct gacgatgtct gtattggttt tccggtcgcc aacagaaagc gcagcgagct 10140
ggaaaatatt gttggctatt ttgtgaatac gctggttatc cgcgatgaaa taagccgtga 10200
cgatactttt gatagcctgg ttgcacgttg tgccagcagt gtgcttgacg cgctggagca 10260
cgaagaggcc agctatgaga aattgctgaa gcaaaccccg cgtgaaaata ctaacagcgt 10320
gccgtttacg gctatgtttg ccttcgagaa tatttccgca accgaatttg cgtttaacga 10380
cctgcaaatt gagctggtcg atgtttatcc ggctcaggct aaattcgatc tgacgctgct 10440
gctgaagcag gacggcgagg tgctgaccgc cactttcgaa ttccgcgcga gcgtatttta 10500
cccgagatcg cccgttcctg gatggcatgc tatcagcgct tactggaagc cgaagttctg 10560
gccccagccc aggcaattga tcgggtgaag ctggcggatg ctcttataaa accgtttaca 10620
tcaccgacga ttgctactga cctttgtacc cagtttgagg cagccgcatc gcgccatgcc 10680
gatcgggtgc catcacctgc gaaggcgaaa gcctgaccta tgccgcactc gacagcgccg 10740
ccagcgcgct cgcctggcac ctgcgcggcc tgggcgtcgg caccggcccg cacgaaagcc 10800
tggtcggcct cagcgccggc cgcggacccg gcctgctggt cgggatcctc ggcatcctca 10860
aggccggcgg cgcgtacgtg ccgctggacc cggtttaccc cgccgagcgc ctcgccttcc 10920
tggccgccga cagcggtatc cgcctggcgg tggctgacga cacgggcctg gcggcgctgg 10980
ccgggctcgg cgtgcagacg gtgagccttt cagctgacca tccgcgccgg gccggcaatc 11040
aggccccgcc gcgctcgctg catccgcagc aggcggccta cgtcatctac acctccggct 11100
ccaccggcca gcccaagggc tgcgtcgtca gccacgccag cgtggtgcgc ctgtttaccg 11160
ccactgaaca ctacggcttc ggcgagtcgg acgtctggac gctgttccac tcctacgcct 11220
ttgacttctc ggtctgggaa atctggggcg cgctgctgca cggcgggcgc ctggtggtgg 11280
tgccctacct gagcagccgc gacccggagc gctttgccca cctgctggaa gcggagtcgg 11340
tcaccgtgct cagccagacc ccggcggcct tccgacagct gaccgcggcc tcggccggac 11400
gggactttgc ggcgctgcgg ctggtgctgt tcggcggcga ggccctggag ccgggcagcc 11460
tggcgccgtg gttcgcgcag cacagcgggc gggtgaggct ggtcaacatg tacggcatca 11520
ccgagaccac ggtgcacgtg accgagtaca cgctgacgcc agagagcatg acgcagggca 11580
gcgtgatcgg cacggcgctg gcggatttgc acgtgcaggt gctggaccgc tacggcgagc 11640
cggtgccggc gggcgtgacg ggcgagatgt acgtgggcgg cgcgggcgtg acgcggggct 11700
acctgggccg ggcagcgctg acggcgcagc gcttcgtgcc ggatccgttc ggcgcgccgg 11760
gggcgaggct ctaccgctcc ggcgacctgg cgcgccgccg ggcggacggc ggcctggtgt 11820
accagggccg ggcggaccag cagctgaagc tgcgcggcta ccgcatcgag ccgggcgaaa 11880
tcgaggcggc gctgcgggcg caggcggggg tacgcgacgc ggcggtggtg ctggacgcgc 11940
cggcgcaggg ccagccgcgg ctggtggctt acgtggtggg gggcaaaggg gcgcaggcgc 12000
tgcgtgaggc gctgtcggcg gcgctgccgg agcacatggt gcccgcggtc atcatgccgc 12060
tggcgcggct gccgctgacc gcacacggca agctggaccg gaaggcgctg ccggagccgg 12120
aagtcacggt gtctgccggt ggcgaggcgc gtacggaggt ggaaaaaacg ctggccggca 12180
tctggagcga ggtgctgtcg atcccggtgc cgggcattga cgacaacttc ttcacgctgg 12240
gcggtgacag catcagttcg ctgcaggtgg tttccaaagc cagagccgca gggatcgcta 12300
tcactcctaa gcaggctctg cttttcacca ctctccgcaa gcttgcggcg gtagccgaga 12360
cgagcaaagg caacgcggcg ctacatcaaa acgcacgctg cccgtcaggg cccttactcc 12420
ctacgccgat tatcgcctgg ttccaggctt tgaaattatc tgcgcccgct cactggaatc 12480
agtcgctggc gcttgaaata gcgcatcccg ttgcaccaga tctgcttgcc caggcgctga 12540
aggctattgg acagcatcac gatgccttca ggctgcgcct ggactatggc aacgcagaga 12600
gcctctctct ggctgaagtt atgcaggaac ctttcccgct tgagatccgc accgtaaact 12660
cccaggttga gagggatgcg gccatcctcc atgcgcagaa ggggctgtcc ctggatgacg 12720
gtcctgtggg acgcgccatg ctgatccagc atgcgggcga gaccgatatc ttagtactgg 12780
ttatccacca tattgccgtt gacgcggttt cctggcacat cctgctggac gatctcaatg 12840
tagcgataaa acggctgcag aacgcgcaaa agattgtgct ggacccggtt gttaccaacc 12900
tgactgactg gagccgttct ctgcaaaccg cagcggaacg tgccgatcct cagcgctggc 12960
tgaggatggc ggcacagggg aatccttcgc ctttccatga ttttgtgaca gtgcaggggt 13020
tgaacagaga acagggactg accgtctgta gtcgcacgct ctcatcggaa aacagtgcat 13080
tgtttttaca actactctcc agaggcagtg aagcacgtgc ctccgccctg ctgtgtgccg 13140
cgctgtggcg gctttttaat gagcaaccgc tggcggtaac gctggaacac aacggccgcg 13200
atgtggataa ggatgccgat ctctcccgca ctctgggctg gttcaccagc ctctatcctt 13260
tcttttatag cgggcagccc gcgctggcgt cggctgaact gctggccgag atggaatcct 13320
cgctgctgga gctggcaccg cataaagccg aatatggttt agtccgctgg ttaagcgaag 13380
atgaagaagt ccgtgccaaa ctcgacgaag cagatcttcc agccctgagc ctgaactacc 13440
tgggccagat tcccgaccag caggaaggcg agtttgtgct gcgtcacgat atcagcagtg 13500
tcgaccgcgc tgtcggcaac gtcagggcat ttaccctgga cctggtcgcc gtagtgatca 13560
atggcgagct gcggttctac tggaactatt gccgcaatgt gcttaaaccc gagatcgtcg 13620
agggctgggc cgatgcgcta cagcagcatt tacagcagct tcttaccgaa ttgaccgctc 13680
ggcccctgct ggttgccgat ttcccgcttg ccaggatccg gcagattcag tttgaggcgc 13740
tggtcggcaa gcaagccgtg gcggacgcct atcccttatc acctctgcag gaaggaatgc 13800
tgttccacag cgtcgccgaa cccgaaaatc atgcttatca cgagcaggca gtggctctgt 13860
ttgaacgtct tgatgcagat ctgtttatca aagcgtggaa aacactgctc tcccgacacg 13920
atattttgcg taccagcttc cactggcaag atctgccgcg cccgctgcaa atcgttcatg 13980
ccacggctga cctgccggtc acggtattcg actggcgcgg tgaagaccct gccgaacgac 14040
ttgcggaatt tttacagcag gatgccgaca aagcgttcga tttaagtgtg gcgcctctgc 14100
tgcgtgtgat gctggccaga attgaccata acagctggcg ttgggtgtgc agctatcacc 14160
acattttaat ggacgggtgg tcgctgcctt tactgatggg cgagctggta catatttatg 14220
aaagcctggt agccgccaca cagccgacgc tgccgccgcc ggtacagtat gggcgtcata 14280
tcgccaggct tgtgcagcac gccagtgaac aaaccggtaa ggttttctgg ctgaacgcgc 14340
tggctggctt ggaacgccct acgttactga gccctcagca gcagccgtcc gctgattacc 14400
atgatctgct ggtaaccctg tcgccggaac aggagcaggc catacgcaca gccgcgcgcg 14460
aggccggcgt gtcactgggc aatgtcttta atgcagcctg ggggatcttg ctggcgctct 14520
ccggccatgg caacgacgtg gtatttggca gcacgctttc aggccgcgaa acaggcgtag 14580
aggacgttga taaaatgatt gggcttttta tcaatacttt acctctgcgt ctgcgcttac 14640
ggcccgaaat gagtgtgcgc gacctgctgc ataaagcccg tcagtttcag gctgatttac 14700
aagagcatag tcacgatcgc ctggttgacg tccagcgctg gagcggccta gaaggcgagg 14760
ggacgctgtt tgacagcgtg ctggtgattg aaaattatcc cggtggcgca ccggaagata 14820
acggtaaagg tttccggctg gtggagtttg cttataagga gcattcaaac tatccggtca 14880
cactggcggt gcttcctgat aacggcctga aaattaagct tgattataat tgcgccactt 14940
ttgatgacac agcggcggcc ctgctgctta agcggttaac agacttgatc agtaagatga 15000
tcgaggatcc ggatcgcagg ctgagtacac tggatctgct ggcggaagag gagcagataa 15060
tcgcccgcga ggtgtggaac gccggcgcat tcaacgccgc ttcgccggtg ctggcgcatc 15120
agatgtttga gaagagcgtc agccgtcagc cgcaggctcc ggcgctgctg caaggcgaaa 15180
caaaatacga ctacagccag ctcaatcaca aggctgacgc gctggctgca accttacaac 15240
agcagggcgt ggggccagaa tccgtagtgg cggttatgct gtcacgcggg ccggaagccg 15300
tgatctcgtt cctggcgatt ctgaaagcgg gcggggttta tctgccgctc gatgcccagt 15360
atccggttga ccgtctggat tacatgctac gggatagcca ggcagtgatg ctgctcagcg 15420
ataaggcaca gtccgtagag aaactgaccg cgatgccgaa ggcgttgctg ctgctggaca 15480
gcttcgattt tatgtcggac gcccgtcccg ccgcctgtac aaacctgact gctaataacc 15540
tggcctatct gatttatacc tctggctcta ccggcaaacc caagccggta ggcgtatccc 15600
atgccggtat cgccaatttg caggcggaaa cggagcggat gctggggaca gatgctcacg 15660
ccagagtgta catgcaggct cccctaagct tcgacgcttc ggtatgggaa atgatgatgg 15720
cgctgtttgg cggcggtgcg ctggtgctgc cggatggtga tgccgaaggc gacgtgctgg 15780
ccgcgctgaa tcaggccgcc gaacgccacc actttaccca cgtgctggtc acacccgcct 15840
tgctgggcct actgaaggat tatgctttac cttctcttca taccctgatc gttggtggcg 15900
atgcctcagc accaggcatg atggcgcact gggcgaaaag ccgccgggtt ttcaatgcct 15960
atggcccgtc ggaatgtacg gtatgcgtgg cgattgaacc ctgcggcgta aacaccgtta 16020
cccctccact ggggctgccg ctttatggca ttccaatgta tttgcttgat tcatggggta 16080
atcctgtacc accgggcgtt atcggtgaaa tcttccttgg cggtgacagc ctggctcgtg 16140
gctatatcgg caggccagcc ctgacggctg gcgtatttat tccggatcat ctgagcggcc 16200
tacccggcgc gcggctctac cgtaccggag acaccgccat ccgtctgcag gatggtcgca 16260
ttaaatatgc gggccgcacc ggcggttacg ccaagctcag aggcaaccgc atcgatctta 16320
atggcgtaga gctgctgttg caggggcatc ctgccgtgcg ggaagcgctg gcgatgattc 16380
gaacagtgga aaacggccag tcgctgatcg cctgggtcgt ggccgaaaaa gggacggaag 16440
cgaatgaact gcgcgactac atggtaaagc atgcggccgc attcgaagtg ccgggtgcca 16500
ttgtaccgct gacccgatgg ccgttaacgc cagccggtaa gatcgaccgc aacgctttac 16560
cgcttcctgc aaccgcgcct cgcgcttccg ttgacggtaa ggcgctcagg ccggcagaag 16620
cagcgttgct gcaaatctgg tcgcaggcgc tcggccgcga cgatatcgat ctgcacgatg 16680
actatttttc gctgggtggc gattcgatta tcgcgctgca aattacctca ctggcccgtc 16740
aggaaggatg gtcggtgacg ccaagaatgg tcctacagta tcgcaccgtg gcagccctcg 16800
ccgctatggc cagcgtgcta gacacccacg agcccgaacc ggataacgca aaggtcgaac 16860
ttgcgccgat ccagcactgg tattttgcgc aaaatctgcc tgccgtcgcg cactggaatc 16920
tgagcatccg cctggagctt cagtcgcgga tggtgccaca gctgcttcag caggcgctga 16980
atgagctggt gaaactacac ccggcgctgc gcctgcgttt tgaacacgtc gacggcgtct 17040
ggcagcagca ttacagcgac gccgcaacta tcccgcttga actgctgccc gaatcgcatc 17100
agaaagcggc agacagagaa gccgggttac agtcgctgct caatctttcc acagggccgc 17160
tattacgcgc tgcgtatcgc gacgccggag aaacaaacca gcccgaactg gtgctgattg 17220
cgcatcatct catcatggat acctggtcgt tacgcatcct ggtagaggat ctggcttctc 17280
tttatagctc gctgcagagc ggcacgccgc tcagggtttt gcaggaaggc accagctatc 17340
gccagtggtc acagtggctc acacagcacg ccgcagactt tactgcacag acatcttact 17400
ggcgaaacat gctggacgcc gggacgccgc cagtggcaat gccccggaaa ggctgcgtgg 17460
gcgatcgtca ggttatcttt gccgagctgg acagagagac aagcgatctc ctgaccggtg 17520
atgcgcatca ggcttaccat agccgaggac aagagctgct gctgaccgcg ctggcgcagg 17580
catggcaccg ctggtgtggc aatacgcatc ttgccattga gctggaaacg cacggacgtg 17640
aagcctttca ggatgccgcg atggatctct ctcgcagcgt tggctggttt accgcgctgt 17700
ttccactgtg tattgcggcc ggcagcgact gggccaatac cgttgataac gtcaagcaaa 17760
ccctgcgtca tatcccttcc ggcggccacg gctacggcat tttacgctat ctgctaaaga 17820
cgccggacat ctgcaaactt acaccacctt ccatcagctt taactatctg ggcgatacgg 17880
ccatgtcagc ctcgtccggg atggctatcc agctctcccg gcgcgaagcc ggtcctggcc 17940
aggccgctta tcagctcctg cctcatgcct tgaacgtcac ggtaatgctt gttgcgggtc 18000
ggcttcggtt gtcattggct tatgccgaca cctccgccga taccgcgatg caaacattgc 18060
ttaaccacta tcagcacgcc ctgcacgatt tggccgaaca ttgtcgcctc gccgagccgg 18120
ttgaccttca gagcagcgat gtctccggag ttcagctcag tgactcagag ttgtctgcca 18180
ttttgagcga tttaaccgag gacgatcaat gaacaccagt gtgaaagcga aaatccggat 18240
tgaaagcgcc cataagctga cgcctttaca gatgggcgtg ctgtttcacg ccatgtatgc 18300
gcccgatagc gccgcctatt ttgaacagct gttttgccgg ctagatggcg atatcgatcc 18360
gcaacagttt gagcaggcgc tggcattact ggcccagcgg cacgccatta tgcgcaccgg 18420
catcgtgact aaaggccagc gtgatccgct tcaggtcgtg ctggaaaaag tcaccgtgcc 18480
tttaacggtc tacgactggc gcgatcgcag cggcgaagtg caggaagccg ccttccaaag 18540
gctgctggat gatgaccgac aggagggctt taaccttaat cggccgccac tcatgcgttt 18600
tattttagtg cagtttagcg aacgtgagtg gcggctggtc tggagccatc accacctgct 18660
gctggatggc tggtcggtac agctgctgct gaaagatttc ttccagctga tggcgggcaa 18720
cagaacagaa gctgcctcca ggccattctc tgattatctc gcctggctgg aggggcagtc 18780
gcaggaagct gcccgcgatt tctggcaaag cattctgggc gatttgcagg atccaacgcc 18840
gctgggtgtt gataagccga gcggcgcaaa agagaaagat tttgcggagc gccgtcatag 18900
cctgaaggta ccggcactgg ctaacgccgc ttccgcctgc aaggtcagcg ttggcacact 18960
gctgatggca ggctgggcgg tgctgctcgg ccattatgcg cgcagagatg atgtgacctt 19020
cggcgtaacc ttatcaggac gcgctatcga gctgcccggc gtcgataaca tcgtagggct 19080
gttaatcaat acgctgccgc tgcgtctgcg acctgagccg cagcgaaaac tggctgactg 19140
gctggcggaa gtacaggaag cgcagtttgc tctgcaacgc tactcctaca gcgcgctgtc 19200
agatattcag acctgtagcg gcgtaccaca ggggacatcc ctgtttgaaa gcctgctgat 19260
tatcgataac tttcccgttg gcgatctgcg cttaagtgag caactgcctt tcgacatgag 19320
cgggattgat atgtatgagc gcacccacta tccgctggcg ctgacgatgg ttccgaaaga 19380
gggtgaagtc tcattaaagc tcggttacga ccgcaataga attgacgacg taaccgctga 19440
aaaaattatc aaagattttg agttgttgct gaatgaaatc agcgacgggt cagagaatac 19500
tcttggcgca tgggccgggt gcctgggttc tgcgccatta actgaagttc atacgctcgg 19560
gcagcatgcc tggcacgaca ggcagactga gcgcttctgg cgcgactacc tgcacggtgt 19620
ggagaccagc ccggttggcg aagagcgttc atctgacggc gagcatcagc ggcagatcac 19680
cgaactctct tcagagctga cgcaacgcct gttcccgctg gcaacaagtc agcaggtgac 19740
tgtaaacgcg ctcgtgcaga gcgcctatgc ggtagcgctg gcccgcctga gcggacggcc 19800
ggaagcgctg tttggcgtga ccctgtctgc agcagaagac aggatggttt cacaaatctt 19860
tccgatgcgc gttgactgtg cacctggtgc taaagtcatc atgctttccg atcaggtgca 19920
ggttttacag gaagagattg agcggcacgc ccacgttcag cctgccgata tcctgggctg 19980
ggcaggcttt gccgccggtc agcctctgtt cgacagcgtg ctgatctgtg cagatctgca 20040
gacagatgaa gccagtctga gcgccgacgt aaccgaggtg ctcaactatc ctcactacgc 20100
cttcacgctg tacgtcaagc ggcgcggcac cgggctgacg ctggaggcgg tgttcgatcc 20160
ggcgcgggtc gacgcggcgc gcgccggcct gctgcttgag ggcacgtgcg gcatgctggc 20220
tcagctggcc gaaggcgcaa cccacgtcgg cgcgctgcgg ctgacccgcg gacggcagaa 20280
cgaaacagag gctcaggcca gcgagacggg ccttacggac gcgcgcctgc aggaggcgga 20340
cgcaggtctg ccggaactgt tccgccgcgc ggccgcgcac gccccggcgc agcgggcggt 20400
gtccggtgcc ggccgcgagc tgagctacgg ccagctgctg gcggagagcc gcaactttgc 20460
ccgccggctg gctgaaaacg gcgtgcggcc gggcatggcg gtggcggtgt gccttgaccg 20520
cggcgccgac atgctctgcg cgctgctggg cgtcatgtgg gccggggccg aatacgtccc 20580
ggttgacccg acgcacccgg ccgcgcggcg ggcgatgatt ctggaggacg ccgcgccgca 20640
gctggtggtg gtggatgccg ctaacgagca cgctttcacc ggccagccga cgctgcgcta 20700
cgtcagcgac tggcgaaagt ccgagggcga actgccgggc gacgcactgt ccccgctcgc 20760
gcctgcctat accatcttta cctccggcag caccggacgg cccaaaggtg tgcgggtcac 20820
ccacggcgcg ctcgccaata tcctgctgca cttccgcacc cgtccggggc tggacgcggc 20880
cgaccgcctg ctggcggtca ccacgctgag ctttgacatc gcggcgctgg agctgttttt 20940
accgctgagc tgcggcgcgg aagtggtgat agcgaccgcc gcgcaggcca ccggcggcgg 21000
gccgctggcg gagctgattg cgcatcacgg catcacggtc atgcaggcca cccccgccag 21060
ctggcgcatg ctgctggcgg cgggctggcg gccgccggag ggcttccgcg cctggtgcgg 21120
cggcgaggcg ctgccggccg agctggcgcg cgatctgctg gccagcggcg tgcagctgtg 21180
gaacctgtac ggcccgacgg aaaccaccat ctggtcggcg gaaaccgaag tcaccgagcc 21240
gctggcggtg cccctgccgg tgggcaggcc gatacgccgc accgcgctgt acgtgctgga 21300
cggagccgga cagcgcctgc ccgcgggcgt cagcggcgag ctggcgatag gcggcgcggg 21360
gctgagcacg ggctacctgc gtgacccggc gcgtaccgcg cgggccttcc ggcccgaccc 21420
ggccggcgcg gagccgggca gccgtcttta cctcaccggc gacctggcgc gcgagcgcgc 21480
cgacggccgc atcgaggtgc tggggcggct ggaccaccag attaagctca acggcttccg 21540
catagagctg ggcgaaatcg acgcggcgct gcgcgcgctg ccgggcgtgc gtgacgcggc 21600
ggcggccatt caccgcacgc cttcgggcgg ccagctggcg ggctatctgg tggcggcaga 21660
ggacgcgccg gcggacgcgg cctggctgga ggcgctgacc ggagcgctgc cgcgctacat 21720
gctgcccacg gcgctggtgc ggatgcccgc gctgccgctg acggccaacg gcaagatcga 21780
tcgtaaagca ttaccgcagc cgcagatccg taatacatct tatgtgtcgc cacgcacacc 21840
ggaacagaaa acgttagccg caatctggca ggaagtgctg ggcgtggagc aggtaggcat 21900
caccgacaac tatttctctc tgggcggcaa ctcgatcctc agtatacgcg tcgtgacgca 21960
agccgccgcg cagggcatcc gcctgaacat tgaagatctg ttccagaagc tcaccatcga 22020
acgcctaact gaaagtaaca gcactcccgt tcaggcagcc gaagcgcccc acattgacgc 22080
ctttgcgttg ctgacggaag aagatcgtcg cgctgttcct gaaggggcgg tggacggcta 22140
tccgctcagc gagctacagg ctggtatgct gttccataac ggggctgatg aaaccaaccg 22200
gctctatcac aacgtggtca gctacctgct ggataacccg gcaatggaca cgggcctggt 22260
gcgccagcgg ttaaacaaac tgatagccct gcatccggtg ctgcgtacag gtttttcact 22320
ggcgggctat agtcggccac tgcaatgggt ttatgcgcag gcagaacctt taatcgaaga 22380
agaggatttg cggaaagcca gtgaagtggc tcagcatacc ctgattggcc gtgcgcagca 22440
gcgcctgcgc gaagagagat tcgatctggc taagccaccg ttgctgcgga tgctgttcca 22500
gcgccttgac gacagccgct ggcaggtcac cgttgccctg catcacgtca ttctggatgg 22560
ctggagcctg gcttcgctgc tgactggctt gctgcaggat gaaacagcag aatcgacggc 22620
agaaccgcag cacatctttc gcgactttat tcatctagag cagcaggcgc tgcacagcac 22680
gcgtgaccat acgttctggc agaagcaact aaaagatctg ccggttacga ccttaccacg 22740
ctggccattt accgataaga acgcagagtc cgcgcaggcc agctacgaaa ccgccttgcc 22800
gccagcttta tatcagggcc tggcagcgct ggcgaaagaa aaaggcatgc cgctaaaaag 22860
cgtactgctg gctatccata tgcgcgtact ggcccactgg agtggcgaat gtgaagtcgt 22920
gacgggcctg gtgaccaatg gccggcctga aagcgcaggc agcgccgatg cgctgggatt 22980
attccttaac acgttgccta tgcgcatcaa taccggcgga ttgacaggca acgaactgct 23040
ggaagctgtt cgccaggccg aaagtgctca gttgccgcac cgtcgttttg ccatgaacga 23100
gctgcgccgc atgctgcaaa acagaacgct gtttgaaacc accttcaatt ttgtcgattt 23160
tcacgtctac aacgatgcag ccaccagcgg gggcgacaga tttgatccag taaaaatcct 23220
taacgccgca ggtagtcagg cgcttgatat tccactggca accagcttca gcgtcgatcg 23280
tcaacagggc accctgcagc tgattttaac ctgtgacggt acgcggttcc cggctgcgca 23340
ggtggaggct atgagtgcca gctacctgcg tgcggctgaa accctgctca atgtcacaga 23400
agaggtgtgt gacagcatgt cgcttatttc ggcagaagag cgggacgaaa tggcacagcg 23460
atctttcggt gccagctctg tatcgcagcc ggtgctacag gcatttcaga ctatggtaga 23520
gcgtcatccg caggcgccag cagtagtgag cgccgatggt gaaatggact atgcaacgct 23580
ggatcgacgc gccagtgagc tggccgcgca gatgcagcgt gccgggctac gtccggatgt 23640
gccggtagcc ttgctatttg aacgctcgcc cgatctggtt gtcgccatgc tggccgccat 23700
gaagaccgcc tgtccttacg tgccactggc accttatctg ccgcagggaa ggctggcgga 23760
aattctggcg gacgtcagac cacaggcaac gttaaccgtg caggcgctgc aacatattct 23820
gccggaagca agcgatgccg gatatatttt tgccctggac gcattgccgg aaacccttta 23880
cccgctgccg gaactgccgc aggcccatcc ggctaccctg gcctacatat tgttcacatc 23940
aggatcgacc gggaaaccaa aagggatcgg tattccgaca ggtgcgctgg cgaaccacat 24000
ggcgtggatg cagcgccgtt tcccgctcac atcggccgac cgcgtactgc aaaaaacgcc 24060
ggttggtttt gatgcttccg tctgggagtt ctgggcacca ttaatggcgg gcgcaaccct 24120
ggttttacct gccgacggtg tggaaaacga cgctatcgcg atgcttgaag tggttcagcg 24180
acatgcgata accgtgctgc aactggtgcc gggcgtgctg gatatgttga ccagattgcc 24240
ggaactgact gcatgtacct cattgcgacg cgtgtttgtc ggcggtgagg cgctccaggc 24300
ctcgaccatt gaacgcttta acagcgtgct cggcgtaccg ctgatcaatc tttatggtcc 24360
gaccgagacc accatcgata ccacttttgc ctgttactgt ggcgatgtgg gtgaagtggt 24420
gagcattggc gagccgatcg atggcgtctc cgtttatgtg cttgaccagc gcatgcagcc 24480
cgctggcgtc ggcatttatg gcgagttgtg gattggcggg gctggcttag cccgagggta 24540
ctggaatcgc gccacggaaa cagcagcggg tttccgcccg gatccttttt cagtgcagcc 24600
aggtgaacgc atgttccgca cccgcgacgt agtgcgctgg ctaccagggg gcggtctgca 24660
gtacgccgga cgcagcgaca gccagatcaa actgcgcggc aaccgcattg agctggccga 24720
tattgaagcc gtgctgtcgc gccagcctgg cgtgacacgc agtgccgtgc gcgtttgcgc 24780
agaaaagccg ggccagctgg tggcctgggt aatgggcccg gctgcactgg aagccgcgcc 24840
gcttattgcc gcattacgta accatctgcc ggactatatg ctgccgcagc gcatcattgc 24900
cgttaactcg tggccgctga caccaaacgg caaaactgac cacgcagcgc tagcgaagtt 24960
tgccgccatc acggagcccg cgtctgcggt tgtgccacca gaaagtgaga tcgaaagcga 25020
actggtggcg atatggcaga aactattgcc gcaattgacc ttaggcatca cagataactt 25080
ttttgaagtg ggcggcgact cgattcttgc tatgcaaatt gctgccgaga tgcgccgtaa 25140
aggctggtcg ataacgccgc gccacctgtt cgagcatcca actattcgcg agctggcggc 25200
ggtcattatt ccttcgcaca acgagaaaca gccggactat gtcgcgcctg ttggcccgct 25260
gccactgtca ccggtgcagc gctggttctt tgagctggag ctgtctgacc ggaaccactg 25320
gaaccaggcg gtgatgcttc gcgtgccgca gcatattcag ccgcaccgcc tgcataaaac 25380
gctcgaacgc ctggtcagtc tgcatgaggc gttccgcctt cgcttcctgc aaaaagaggc 25440
gagctggttt gcccggctgg aggagaacgc cggcgactgg tacagcagcc tgaacgtttc 25500
tgacctctca gccgttgaat accgggaggt tacagatacc cttgtcgaaa ccacccagcg 25560
cagcctcaac ctggagcagg gaccgctgtt taaagcggtg caccttgata aggggctgga 25620
agtagaaggc cggctgttgc tggttatcca ccatctgatt gtcgatggcg tctcgtggcg 25680
catcctgctg aacgaaatca acctgctcct taacggcgtt gaactggctg tgcccgcgcc 25740
gggctttggc ggctggctgg cattgcagga caagtatgag atgccgaaag cggtcctcga 25800
ctactggctg gggcaggcga caaagtcatc ggaatcattc cgtgcgccat cctttattca 25860
gccacagcac agcggccact actcgcaggt cagaacaatt gaaaaatcct tcggaaaccc 25920
gatcgcgcag cggctgatag atcattcaca gctacacctt aaggcgaggc ctctggagct 25980
gctgcttacg tcggtacttt gcgccatggg tcgctgggcg cacgaagatc gtatcgcgct 26040
gacgctcgaa gggcatggcc gcgatagcac tggtgactgg acgctggaac gcacaccggg 26100
ctggtttacc gtgctttatc cggtgatgtt cgatttgaaa gatacggaca gcgaaatgac 26160
cgtactacag acggtcaaga agacgctgcg ggaaattcca gacggcggct acggctacgg 26220
gcagctgcgc gatggcgaac cactgccgcc agtctcattt aactatctgg gccagtttga 26280
agagagtaat gagcgcggac tgaccgtggt tgacgaggcc gttggcgata atgaagaccc 26340
ccatggtaaa cgtccgttcc cgctggagat cgtggctttt atccgggccg gaaaattaac 26400
cctgcgctgt gtgtttgacg accggattcc ggaagccgca aatatcacgg caatgctcga 26460
ttccgccgct gactggctgc aaaaaatgct cgcctgtgaa gatgtctctg ctgcctggac 26520
actgcatgat ttcccgctgg ctgacgtgga agaacgtggc ctggcgatag cgctgggcga 26580
tgcgggcgat aatcttgctg acctgtggaa gactacgcca acccagcagg gcatgctgtt 26640
tcacagccgc ttagaaaacg acgcgagcga agtctacctt gagcaaatcg tgatgcggct 26700
gcatgaggag atggataccg atctgctggc gcaggcgtgg aacatggtga tcaatcgcca 26760
tgatgccttg cgcgtttctt ttgtctggga agatctcgac catccacagc agcgggtgtg 26820
gcgcagcgtt caggttcctt ttgagacggt tgatcttaag ggcgatgccg ctgagcttga 26880
ggcattcatg acggcagatc ggcagcgcgg tatcgattta agcgtcgcgc cgatgatgcg 26940
ggtcagcctg ctcaggaaac agggcaagcc gtggcgtctg gtctggctgc atcaccacgc 27000
gctgctggac ggctggtcga tggcgctgat ctttaacgat ctggccgagt gttatcacgc 27060
gctgaagctg aatcagaact ggccaacaaa tactgcgcct tcttacgcta cctatcttcg 27120
ctggctgaaa cagcagagtg cgacgcagga gtcagcagag cggttctggc gcgattactt 27180
ccagggcctg gagctggcca gcccggttgg cgaagagagt acgcgtaccg gtattcacca 27240
gcgccttacc aataagcttt cgccagcgct gacgcagcgt ctgtcgcagc tggcgtccag 27300
ccagcaggtg acggtcaaca cgctggttca gagcgcctat gcggtagcgc tggcccgcct 27360
gagcggacgg ccggaggcgc tgtttggcgt gaccctgtcc ggtcgtccgg cagagctggc 27420
acagtcggaa aatatagtgg gcctgtttat ccagactctg ccgatgcgcg ttaactgcgc 27480
accgggtacc gacattgcta cgctggccgg tagagtgcag actttgcagg gagaaattga 27540
gcgacacgcc cacgttcagc ctgccgatat ccagcgctgg tcgggctttg ccgccggtca 27600
gcctctgttc gacagcgtgc tgatttatga aaactacccg ctgggacagg ggctggtgga 27660
tgcatctgac agcctgaacg cggatgtaac cgaggtgctc gaccatcctc actatgcgtt 27720
ttcgttgtac gtcaagccac gcggcgccgg gctgacgctg gaggcggtgt tcgatccggc 27780
gcgggttgac gcggcgcgcg ccggcctgct gcttgagggc acgtgcggca tgctggctca 27840
gctggccgaa ggcgcaaccc acgtcggcgc gctgcggctg acccgcggac ggcagaacga 27900
aacagaggct caggccagcg agacgggcct tacggacgcg cgcctgcagg aggcggacgc 27960
aggtctgccg gaactgttcc gccgcgcggc cgcgcacgcc ccggcgcagc gggcggtgtc 28020
cggtgccggc cgcgagctga gctacggcca gctgctggcg gagagccgca actttgcccg 28080
ccggctggct gaaaacggcg tgcggccggg catggcggtg gcggtgtgcc ttgaccgcgg 28140
cgccgacatg ctctgcgcgc tgctgggcgt catgtgggcc ggggccgaat acgtcccggt 28200
tgacccgacg cacccggccg cgcggcgggc gatgattctg gaggacgccg cgccgcagct 28260
ggtggtggtg gatgccgcta acgagcacgc tttcaccggc cagccgacgc tgcgctacgt 28320
cagcgactgg cgaaagtccg agggcgaact gccgggcgac gcactgtccc cgctcgcgcc 28380
tgcctatacc atctttacct ccggcagcac cggacggccc aaaggtgtgc gggtcaccca 28440
cggcgcgctc gccaatatcc tgctgcactt ccgcacccgt ccggggctgg acgcggccga 28500
ccgcctgctg gcggtcacca cgctgagctt tgacatcgcg gcgctggagc tgtttttacc 28560
gctgagctgc ggcgcggaag tggtgatagc gaccgccgcg caggccaccg gcggcgggcc 28620
gctggcggag ctgattgcgc atcacggcat cacggtcatg caggccaccc ccgccagctg 28680
gcgcatgctg ctggcggcgg gctggcggcc gccggagggc ttccgcgcct ggtgcggcgg 28740
cgaggcgctg ccggccgagc tggcgcgcga tctgctggcc agcggcgtgc agctgtggaa 28800
cctgtacggc ccgacggaaa ccaccatctg gtcggcggaa accgaagtca ccgagccgct 28860
ggcggtgccc ctgccggtgg gcgggccgat acgccgcacg gcgctgtacg tgctggacgg 28920
agccggacag cgcctgcccg cgggcgtcag cggcgagctg gcgataggcg gcgcggggct 28980
gagcacgggc tacctgcgtg acccggcgcg taccgcgcgg gccttccggc ccgacccggc 29040
cggcgcggag ccgggcagcc gtctttacct caccggcgac ctggcgcgcg agcgcgccga 29100
cggccgcatc gaggtgctgg ggcggctgga ccaccagatt aagctcaacg gcttccgcat 29160
agagctgggc gaaatcgacg cggcgctgcg cgcgctgccg ggcgtgcgtg acgcggcggc 29220
ggccatccac cgcacgcctt cgggcggcca gctgacgggc tatctggtgg cggcagagga 29280
cgcgccggcg gacgcggcct ggctggaggc gctggccgga gcgctgccgc gctacatgct 29340
gcccacggcg ctggtgcgga tgcccgcgct gccgctgacg gccaacggca agatcgatcg 29400
taaagcgtta gccgagatcg aagtcactga aagaaacgcc agtttcctgc cgcccaatgg 29460
tccggttgaa actgcggtat gcgctatctg gcaaaccgtc ttctcccttg agcaggttgg 29520
tgttgaggac gatttctacg cgctgggcgg ccattcatta atggctaccc agatccatac 29580
acgcctcgtg cgcatcttcc gcatctcccc accgctgggc gaggtattca gagcgaccac 29640
cccacgagag ctgaccgccg tcatttacgc ccattccgat aaagggcggg ctacccaaat 29700
ggcggaagct tatttgcgtc tgcgcgcgat gacgcccgaa cagcgccagg cattacgtaa 29760
cgaaggatca cttatcacag gaggttcggc atgaactggg aaaaatctgt tgccatcgtc 29820
accggcgcag gagggggcat cggcgggact tttgttcgtc agttgctgag cggcggctgt 29880
cgggttgtag ctattgacaa gcagagcgac aggctggagg aactcgcggt agcgtgccag 29940
gcatggcgcg acgcgctggc catccggccc gtcgatatca ccaacgaagc ggagattcgt 30000
gcaactttta ccgacctgag tatgcacttt ggtgtaccgg aaattctggt caataacgct 30060
ggagtactga gggatggcct gctgataaag aaggaagcgg acagctacgt acgcaagctg 30120
ccaacggcgc agtggcgagg cgtactggaa gctaacctga ccggaacgta cctgatgagc 30180
cgggaatttg cagctatacg gtcgcagcaa gcgggcgagg gggtcatcgt caatatctct 30240
tccgtgacca gtgccggtaa tcccggtcag tccgcctacg cggcttccaa agctgggatg 30300
gatgcgctga cccgcacctg ggcgctggag ctggccgaca gccatatccg cgtggtcggc 30360
atcgcgcccg gtttgactga cacaccgatg gcccgtgcgc tgccggagac cgagctgaac 30420
gatatgctca aaaatattcc gctggaacgc atggctacgc cgctggagat ctggcagggc 30480
ctgcgtttcg cgctggaatg cgattacttc aatggtcgta ttctgaccat tgatggcggt 30540
gcaggattct gctgattatc agcaactgac ttaacacaac cctaacttac ccgaggtcgt 30600
tatgagtgcg attgaaaaca atgctgtgac ttactttgtt gtgatgaacc atgaagagca 30660
gtactccatc tggccgacct accgcgacat cccggcaggc tggcagcagg tcggcgaacc 30720
cgccagcgag caggagtgcc tggcccacat tgagaaagtc tggacagaca tgcgaccact 30780
gagcttacgc aaagccatgg aagataatcg ttaaattcag gatgcaggcg ggagagtgaa 30840
catgaacatg tacaggctac tttctgaaag accgcagggt gcaccccggc tgtcaatgtt 30900
actggcagcg caatcgctgg ccgggctggc cggtgcggga ctggttgcaa ttcttactca 30960
ggcagcgcat gccgttgaac agcaggggaa ggcgctttct ctggtcgcgc tgactgcctt 31020
aacgttgttc atgttccttt tcagccagcg ttacgctatg cgctgcacgg ccctgcgcgt 31080
ggagcgttct atccacaacg tgcgcgtgcg cgttgtggat aaactgacca ggattgattt 31140
gcagacttat gagcaaattg gcgaaaagaa cctgatggca tgtgtggaaa aagacatcaa 31200
aaccatgtcc aacgcctgca cggcaattat cgcttccggc cagtcggtga tgctgtttgt 31260
ctgcgcggca gcttatctgg cctggctttc gttaccggca tttcttctca ccgcaggggt 31320
tatcgtgctt ggcgttgcgc tgaattttat ccgcatgcgg gctattttca acgccacgga 31380
acaggcgcta cagtcggaaa acagcctgtc tgggttaacc agccacatta tcagaggatt 31440
caaggaactt aagctccatc agaaacgccg tcgggaagta tatgaggaac tggtggaggc 31500
gtccgatcaa accgccagcc tcaatcaaaa agcgtttggt ctggcgaccg accatatgat 31560
tatgctgcag tcgattctct atattctgat tggcctggtg atctttgtgc tgcccatggc 31620
tggccagatg cagacgcttt tacaggttca ggtcattgcg gtgatcctct ttctcaatgg 31680
tccgctgagc cagtttatcg gcattttacc gatgtacgct caggccaatg cagcagcaaa 31740
aagcatcggc gagctggagc agcagctgga tgccgccgcc aaccgcgatc ccgatttgcc 31800
ggatgctgtc atcgagccaa tgcgcagtat tgaactgaaa gatgtccgct ttgcctacga 31860
ggcgaacgag ggcccggcct ttgaaatcgc tccccttaac ttgctgattt ttcagggtga 31920
ggtcattttt gttactggcg gaaatggctc aggaaaatcg actttcctga aattgctgac 31980
cggtctgcgc tttgccagcc acggcgatgt gctgctgaac ggcgagcgag tcaacaaacc 32040
ggaaaaagta gcgggttatc gcgggttatt ctcggcgata ttcgctgact atcatctttt 32100
taccaaactc tacggcacgg aagtcccgca ccgttcactg ataagcgagc agcttagcag 32160
gctcgccctt gacggaaagg tgcgcctgga cggccgcatt ttcaccccgc tgaatctctc 32220
taccggccag cgtaagcgac tcgctcacct ggtcacgttg ctcgaagatc ggcagatcta 32280
catctttgat gagtgggccg ccgatcagga tccccatttt cgcagctggt tttaccgcga 32340
agagctaccc aggctgaaag ccctgggcaa gaccatcatc gccgttaccc acgatgagca 32400
gtacttcgaa catgccgacc gctggttcca ttttgaggaa ggccgttgcg aagagcgctt 32460
ttttaaatca gccgtacctg tgcgcctttt ccccgaacat gcctgattaa tgttggcccc 32520
caaggagtag cccgatgagt gttagtggta atctggacca gaacgtcggt tttgacgaag 32580
accttgatct ccttgatgca ttgttagctg aggatctgct ggagcagcaa ccggctatcg 32640
cggcccaggc ggtgaataaa gggccgctat ccttccagca ggagcgcctg tggtttctca 32700
gcgaattaga tcctgacgcc gccgcctata cgatttttaa cgccttccgc ctgcacggcc 32760
agcttaatga acaggcgctg tgcgcggcgc tggaaaccct ggttgagcgc cacgaagctt 32820
tacgcactgc cattgacaac cagaacggcc aggcagaaca gcgcattatg ccgggttata 32880
tgcccgtaca aaaaagcgtt gatttaaccc attccgcaga gaaggacgtg gacgaagcgc 32940
tgcataagct gctgcgcagc gaagcggcgc gtccctttgt gctcaccgac ggtaaacctt 33000
tccgtgctgt gctggcgaag ctggggtcgg atgaacacgc gctgatgctc agccttcacc 33060
acatcatcag cgatgcgtgg tcaatgaccg tgctgatgag cgagctggcc gttctgtatc 33120
acgcttatgc tcgcaatgaa cggccaattc tgcctcacca gccggtgcgc tatctcgatt 33180
atgctctctg gcaacgagga aatggctctg ctcaggagcg tgaaaacaag gagatgaatt 33240
actggctaag tgagctgcaa gacctgccgt tgctggagct gccttgcgat ctccctcgac 33300
cgcacaaaca gacatttaac ggcgcaacaa tcagctttca ggtgcctgat gccacgaccc 33360
gtgctttgca gatgctggcc cacggagagc gctctacgct gttcagttta atgatggctg 33420
cgctgcacgt actgatggga cgtcatgcgc ggcagaccga tattgctatc ggcacctcaa 33480
ttgcgggacg cgataaccct gaactggaag ggctgattgg cttctttgcc aatatggtgg 33540
ttatcagggc caggctggaa agcgacccaa gcttccgcga actgttgcgc accaccaccg 33600
gaaaagttca cgccgcaatg gaacacggta cgctggctta tgatcggctt gtcgagggga 33660
tgaaaatcgc ccgcgacccc agccgcaacc cgcttttcca gattgctatg accatgctga 33720
acctgcccgc cacgcgcatg tcgttaggca cccttgaagc cgaaaggttg ctcagccagg 33780
aagctgcccg tttcgatctt gagctgtttt taagcgagtc ggacggtacg ttgtcaggca 33840
cgtttgtcta taacactgat ctgtttctgc cagcgtctgt caaccgcctg actgagcagt 33900
ggctgatact gttagccgat attgccgttt cgccagacaa gccagtatcg cggctggccc 33960
tggtaaaaga gcaagcgcct ttattgccgc ttcctcttct cgctgagcca ctaccgttca 34020
gaccgctaca tgagaagatt ttactgcacg ccgagatgta tcctgatcgg cgtgcgctgc 34080
ggcttggcga agagagtctt tcttacggcg agctagccgc acaggcacgt cgtatcgccc 34140
acgctttgct ggctgccggg ataaaagcgg aagtgccggt agggctatgg tttgagccgg 34200
gttttgacat gattgccgca atgctcggga cgtggatggc cggtggcgcc tatctgcccg 34260
tcgatcttca ttctccggcc gagcgtatca ccaccatcct ggaagacagc caggtaaaat 34320
tcattctgtc agatacggcc agtgtggcct cgctgcccgt ctttgtcggt acagtgcttt 34380
gcattgatga aactgacgag ccgccagcgg gtgaacttcc gcaggtcagc gcccaccagc 34440
ttgcctacat tatctataca tctggttcta cgggacggcc caagggcgtg gagatcaccc 34500
acgctaatgt ggcgaggctg ttcaccgtct gcgacagcct tttcgagttt gaccgaaatg 34560
acgtctggac ctttttccac tcttacgcct ttgacttctc ggtatgggaa atctggggtg 34620
cgctggtcca cggcgcctct ttgctaatcg taccgcccat tgtggcgcgg acgactgaca 34680
gtttctacga cctgctgtgc gaaaagaaag tcaccgtact aagccagaca ccctctgctt 34740
ttcgccagct gatggctgcc gaagaggcta atccgcgtga gggcgatctg gctttgcgct 34800
acgtcgtctt tggcggtgag gcgctggata ttgccagtct cgcttcctgg atggacaggc 34860
acggcgacga ggagccccgg ctggttaata tgtatggcat caccgaaatc accgtacatg 34920
ccacgttccg cctgattacc tggcgggatt tatcgcgcgc gtccagcagt gtaatcggca 34980
cacctctgcc cgatttatgt cttcgtcttc tcgatcctca tggcgaaccc gtaccgcaag 35040
gcatggtcgg agaaattttt gtcggtggcg caggtgtagc gcgcggctat cgttatcagc 35100
cagaactgac cgctgcgcgc tttcagcatg atgcaagcgg catgcctttt tatcgtagcg 35160
gcgacctggc gcgcattaac gtttggggtg aaatggaata ccgggggcga gccgattccc 35220
agatcaaact gcgcggctat cgtatcgaga ccggcgagat cgagaatacc ttacgacgtc 35280
atcccgcgat tgacgatgct gtggtggtgg tcagagggca gcaggaagca gcacgactgg 35340
tggcctacgt gcgtaagcgc cagacatatc tgccggaaag cggcgcgtct gccgaagact 35400
ggcgaccaag cttcgatatg atttatgccg cagaggttga ggatgatgag ctggatgtgg 35460
tcggctggaa cgattcttat gataacaagc cgctgccgct ggaagagatg cgcctgtggc 35520
gggatgaaat tctgcaacgc ctgcgtgcgc tcgcgccgac ccgcattctg gaaataggca 35580
ccggttcagg catgctgctg ttgccgctag ctcaggaagt cggacgctat cagggactgg 35640
acttttctgc cgaggccgtt gcccgactct ccagaaaggt ggcgcagcgt ggcttaacac 35700
atgtgcagct ggagcagcgt gaggcccgcg acctttcagg attgggggaa aattttgacc 35760
tggtgatcct taactccgtc gcgcagtact ttcccgatgc ccgctacttt attgatgtta 35820
tggaacaggc gatggatcgg ctgcacaccg atggccgtct gtttattggc gacctgcgcc 35880
atctgggtct gctgcggcat tttcatgcca gccgcctggt gcatcgtcgg ccagccggtg 35940
ctgaccggac cagcctgctg tcgcagctgg aaaaaatggt ggaggaggag aaggagttgc 36000
tggtcgatcc cgactttttc ttccactggg cctcgcagcg caacgatatc gcaaatatcg 36060
acgttttacc gaaggtcagc ggcggccaga acgaactgac cacctatcgt tacgacgtcg 36120
tgattgtcaa aggcgacccg caaacctttg cgcctgttgc acgtatggaa gctgcaagcg 36180
tcgaaacgtc ctggaaagga cagcctgcgt taatctgtaa tattcctaac tcacggctgg 36240
cctgtgttga agcgtttctg aactggctgg ctgacgacgc cacaacagtg ccaaccgcgc 36300
aggagtggga agcctggtca ggcacccaat ccggctccga tcccgctatg ctggttgata 36360
tctggcaggc acgtgcgggt gcggcgaagc tatgctgggc cagtcagggc cagcctgggc 36420
agtttgatct ggccgtggct acccataccg aagcgctgcc ttcttttaca cctgtcatca 36480
gtcgtaacgc ggagcttacc cgcttcttta acattcccgt acagctgagg gagggagacg 36540
cgctggcaac gacgctgcgt agctaccttt ccgcttacct tcccgactat atgctgccgg 36600
cggtttacgt gccgctggac gtttttccac tgaccataaa cggcaagctc gactttgctg 36660
cgctgccgga aacgggtcag gaaataaaag aagccgccgc agaccagaat cagcagttaa 36720
gcgagaccga atggaaggta gcagatatct gggccgaagt attacagctg gcccggccat 36780
cactgcatgc aaacttcttc gaaacaggcg gccattcgct gctggcaacg caggtgattt 36840
cgcgccttaa tgcggcgttt tcagtgaagt tgccgttaag aagcctgttc gatcggccaa 36900
ctatcgccgg gctggcgtcc ctgctggatg atttgcagaa gaaaactgaa tctgctccgc 36960
cgcagcctgc ggccatcagg gcggttccgc gcgaaggcct gctgccatta gcttataccc 37020
agcagcgttt ctggtttatg gagcagatag atcaggggcc tgtcggctca tataatattt 37080
cgctggcact gaggctacgc gggcagttag tgcctgtggc cctgcatacg gctatccaga 37140
ctatcgtgag gcgtcacgaa gcattgcgca ccgtctttat tcagcacgat ggtcagcctg 37200
cacagctgat taaattagag tgggcaccag caattgagga aacggatttc agccatcttt 37260
cccgcgccga ggccgaaacg gcgctgaggg atttgctctc cgtgcaggcc aatacgcgct 37320
tcagccttga tgtcgcaccg ccattacgcc tgaacctggt acggataggg gagcaggaac 37380
acgtgctgca gcttaccctg catcatgcta tttgcgatgg ctggtcgctg ggcgtgatgg 37440
tgcgtgaatt cagcgaatgc tactccgcct gcgtggccgg acgagcccca cagctggcgg 37500
cgctgcccgt acagctggct gactttgccg tctggcagcg gtcagagatg gcaggcacgc 37560
gtttgcagtc tattttgcag cagtggaagc agcgtttgca gggcgtgcct tatgacctgg 37620
cactgccttt tgaaagagca ccgcatgccg acaccccaca gatgggcaaa atcatctact 37680
tcaattttga cgcggtgcag ctcggtcagc taaaacgctt tgctgaaaca aacggcgcga 37740
ccctgtttat ggtactgacg acaggttatg cagccctttt agggcgttac agcggcgtgg 37800
atgatgtggt catcggcacg ccaatcgctc agcgtcagca gaaggagctg gaaggaattg 37860
tcggctgctt cctgaatacg ctcgcattgc gtattcaggg cgaggccgga ttaagcggac 37920
aggcgctgct ggcgcacgta cgtgagcgcg tgctggaagc ttatgaatgg caggatgcgc 37980
catttgacgc ggtagtcagc gagcttagcc cggaacgctc acgcgatcgc catgcgctgt 38040
tccagactat gctgaccctg caaaacatgc cgctgggcaa tttcacacta ccggggctgg 38100
aggctgaacc tcttcagggc caggaaggaa ttgcaggctt cgatcttagt ctgaccttca 38160
ttgaaatggc agacgctagc ggccaggatg gcttgcaggg gatgctcgaa tacgatgcga 38220
ataaatattt atatgcatct gtagagcatt tcgccagcca gctgaaaacg ctgctgctgg 38280
cgatggcggc aaggccagag atgccggtca acaggctgga tcttctggca gcggacgagc 38340
gcaaacggct actggaaacg cttaacgata ccagtcatca gatcccgcag ctgtgcctgc 38400
atgaacttat cgccgggcag gcgtcccgta cgccggacag catagccatt cgggatgcgt 38460
ctggcgaaat cagctatgcg gagctggagg cgcgagctaa tgcagtagcc tgcgcactgc 38520
atgagcaggg cgtcggacct gacactatcg tgggcctctg caccgagcgc gatcgcggaa 38580
tggttatcgg gctgctgggg attatgaagg ccggcgccgc ctatctgccg ctggatccgg 38640
cttatccgat cgagcgtctg gatttaattc tggccgatgc ccaaccgccg gtgctggtaa 38700
cccagaccgc gctgacgtcg actaccaatt ttagcgggcc gaaaattttg ctggaagaac 38760
tgagtcagag cagtagctgt ccggccagcg acgcgacgct cgcgaacctt gcttatatca 38820
tctacacctc tggctctaca ggcgtgccga aaggcgtcat gattacccat ggtgctatcg 38880
ttaactatct gagctgggcg cagggtaact atatttcagg cagtcagggc agcgtattaa 38940
tgacgccttc ttacgccttt gatggcagta tgacgacgct gtttacgccg ctgattagcg 39000
ggcgctgtat gcaactgatg ctgcgtgatg acgtgctttc ccgtatcaga aattcgctgc 39060
tggagagcag ggagccgctg gcgctgattg actgcggccc ggcacagctg gaagtattgc 39120
agcacgtgct ggaacctgaa cagctggccg ccagccaggt cggggctatc gtgatcggcg 39180
gcgaggctct gcatgctgct accgttgaac agtggcgtcg gcatgcgcct gccacccgcc 39240
tgtacaatga atatggccca accgaagcaa cagtaggctg ctgtaattat cacatcaccg 39300
ccgatacccc ctggttcggg ccggtgccga tcggtcgcgg catctggaac gtcagggttt 39360
atgttttgga taaatacctc cagccgttgc ccgtgggaat gcctggcgat ctctatgtgg 39420
cgggtgaggg gctggcgcgt ggttatgcag ggaaaccggc gttaaccgcg caaagtttta 39480
ttccggatcc attctcggaa ggcgggcgtc tctatcgcac cggcgaccgc gcctgttggg 39540
gaacagggag cgttattcac tatctgggcc gcagcgataa tcaggttaaa ttccgtggct 39600
tccgcattga gcctggcgag attgaggaaa aaatccgtct ctatccgggc gtgtctgaag 39660
cggcagtgaa agtgcatacg gatgaacagg gtataagccg tctggtcgcc tggctggcgg 39720
gcgaaattca cgacggcctt gacgcctggc tgcgcgaaag cctgcctggc ttcatggtgc 39780
cttcgcacta tgtactacta ccgatgctgc ctatctccgt tagtgggaaa gtggatcgta 39840
acgccttgtc attgcctgaa attacccacc agcctctgac acacagcgaa agcagagcgc 39900
tgaacgcaac cgagcagcga ctggctgcga tctggcagga agtgattggc catccggtca 39960
gcgagccgca ggccaacttc tttgaagccg ggggcgattc actgcgggcg gtgaagctga 40020
ttttcctgat tgagcgtgag ttcaaacgcg tattgccgct ggcgagcctt ttcgggctac 40080
acacgctcga ggcgcaggcg gcggcgctaa ccgctgaaag taatgcggcc accgacgcgc 40140
tggtgccgat ccacgtacgt gaaaatgcgc ccagcgtggt gctggtgcat gacatctcgg 40200
gccagatact ttcctaccgt tcactggccg aagagcttac agcatttggc gtctacgcta 40260
tccaggcgct ggccggacag catacccgcg caccttcggt tgccgatatg gcagagctgt 40320
atgctcgcgc cattatggaa gccagaatac ctggccccct gatactggtt ggccactcct 40380
ttggcgcaca ggtggctacc gagctgtcgc ggaagctgac tactctgggt aaaaagccgt 40440
tactgctggc cattcttgac ggcatagcgg agccggacag ggagtctttg caacagcttc 40500
cgcgcgacga tctggatctg atggattaca tgatccgcac cattgagctt tccatggata 40560
aacgcattga tgtggatgcc gctcgcctgc gcgcgctgcc tgaaagcgag cgtgcaagct 40620
ggattaccgc cagcatcacc agggcgggcg tcgtacctga acacacctca ccggagcatg 40680
ttatgcagct gtttactatc tacaaaaaca atcttgaaag cctgcatggc tatcagccgg 40740
gtcgggtgac gtgcccggtg acgttatggg ccaccgaagc gcttggtcag caggaagacg 40800
ccggctgggg gaaatatgcc gaccgggtta cggtttatca ggccagcggc gatcacgtca 40860
gcatgcttaa acccccacac gtacaggaac tggctgcgag cctgactaaa gccattaatg 40920
acgagatgcg ataatgagta ttccacgtat agttcatcag atttggtatc agggcgaaaa 40980
ccaggttccg gataaatacc ggcgttaccg cgaaacctgg caacagtatc atccggactg 41040
gcagtgcatg ctgtgggatg cgcatactct gcgtgaacac gtagccagcc actggccaca 41100
gtttttgccc atttacgatg cttatccgca ggacgtacag cgcatggata gcgctcgcta 41160
ctgcctgctg gcaacgcagg gcggccttta tgccgatctg gatatcgaat gtttacggcc 41220
tgttgacgag ctgctgaccg gccatgaact tattctttcc caaactgtgg gttacaacat 41280
tgccttcatc gccagcgccg ccgctcaccc cctgtgggaa acagtattga atcatttaac 41340
caataaaata agcgccgatt taagcgacgt gccttctttt atgcgggaaa acgtggcgat 41400
gcaaatcgcg gtggtgtcgg ggccgcgttt cttcacgcta tgcgttgaag aaagcggtgt 41460
actggcttta ccgggaacac tggcctgccc gggggaatat tttgagagca cggccacgcc 41520
cggttatgtc catgacaagc aaaaagactg gatcccttac gggcggcacg atatggattt 41580
gaactggatg tcgccttctg cgcgactgct ttccaggctg gcgcgcggct tctcaacggt 41640
cgttagcggt gtgcgcgcgt tcgtcagaca gtaagtacgg ttaataaaac agggcagggt 41700
cagccactgc ccataaagcc gacattatat tcaggccagc ggcgtgacgg tgaagcagcc 41760
atgccgcttt gagaaagaaa aggagtgttt ctgatgcgac tgatttgttt cccttatgca 41820
ggcggcagta cggcaatttt tcgtgggctg gcgcaattac tgcctgatat tgaggtacat 41880
actcccgaac tgcccgggca cggttcgcga atgaacgaag cagcgttcac gtcgatagaa 41940
gagcttgccg aacgcatgat catggaacta cgtcctcatt tttcccgccc atttgcgtta 42000
tttggccaca gtatgggggc cgcgctgtct tttgaaatcg tcagccagtt atcttttccc 42060
gagcgtgcaa acctgcgtca tctttttgtt agcgcctgtc ctgcacctgg ttttgctacc 42120
attcgacgcc gaccattgca ggatcttaac gatgctgact ttattgaaga gctgcgtctt 42180
cttggcggca caccgtcaga gatcctggat aatgcagagc tgatggcgct gctgcttcct 42240
atgctccggg ccgattttac cgccgtagaa aaccatcggg caaaatctga catcgttctt 42300
gacgccagcg tgacggcttt agctggcgac agggatgaaa gggtaactgc cgaggccgtc 42360
tttgcgtggc ggcatgcaac gcgcggcaac tttgtctcac atctgctgca gggcgaccat 42420
ttttttctta agccacagtt tcttacgatc gctaatatca ttaatttgcg actggcagct 42480
tagctatagc agggcgattc aggatagtcc tggtttacaa gccctgatcg actatatgga 42540
actccccgat gtcgcagttt tttattaatc gacccatctt tgcatgggtt attgccttat 42600
ttattgttct ggcaggattg attgccattc ctcagcttcc cgttgcacag tatccgtcag 42660
ttgctcctcc cagcgtcagc gtcagcgtga cctatccggg cgctacgccg gagacgatga 42720
acgaatccgt gatctcattg ctggagcgcg aaatatccgg cgtggataac atgctctatt 42780
tcgaatcctc cagcgacacc tcgggcacgg ccagtattac tatcaccttc catcccggaa 42840
ccgatgtcaa actggcgcag gtagatgtcc agaataaact caaggttgta gaagcccgtc 42900
tgccgcaaac ggttcggcaa aacggcatac aggtagaggc ggctaactca ggatttctga 42960
tgattgtcgg ccttcgatcg ccttcaggca cctataccga ccaggatctg agcgactatt 43020
tcgggcgtaa cgtttcggat gaactgcagc gcgtacccgg cgtcgggaaa gtgcagttct 43080
ttggtgctga aaaagcgatg cggatctggt tagatcccaa taagctgtat acctataatc 43140
tgtcggcttc cgacgttatc accgccttaa cgcagcaaaa cgcccaggtt tctccgggac 43200
gcgttggcga tgaacccgcc cggtcaggtc aaaaggttac ctatcagctt accgttcagg 43260
gacagctttc ttccatcgag gcgttccgta acatcaccct caaagcgcag cccgacggct 43320
cccgcgtacg gctgggcgac gtggcccgaa ttgagcacgg cctgcaaaac tactcttttg 43380
ctattcgtga aaacggcaag cccgctacgg ctgcagccat ccagctgacg cccggcgcta 43440
acgcggtcag cacggcagaa ggggtgcgcg cgcgactgag tgaactctcc acagcgctgc 43500
cggaaggcat ggcgttttcc gtaccttttg ataccgcgcc ttttgtgaaa ctgtcgattg 43560
aaaaagtcat tcataccttt attgaagcga tggtgctggt ttttctggtg atgctgttgt 43620
tcttacaaaa gctgcgttac acctttattc cggccattgt ggcacccgtc gcgctgcttg 43680
gcaccttcac tatcatgctg ttgagcggct tctctattaa cgtgctgacc atgtttggta 43740
tggtgctggc gattggtatt atcgtggatg atgccatcgt agtggtagaa aatgtggagc 43800
gcctgatggc ggaaaaggga atgtcgccca gggaagctac gcaagaggcg atgcgcgaga 43860
tcacgccagc catcattggc atcacgctgg tactgacggc ggtatttatc ccgatggggt 43920
ttgcgagcgg ttcaataggc gtgatttatc gccagtttac gctttctatg gccgtctcta 43980
ttctcttttc ggcttttctg gccttaacgc tgacgccagc tttatgtgct tctttgctgc 44040
atcctgttac cacgcacagt acaaataaaa aaggattttt cggctggttt aacaggcgtt 44100
ttaatcggct ggcaaacggc tatcgttcgg ggctgcggtt taccttgaag cgcagcggaa 44160
gaatgatgat tctttacgta ttgttatgct gtgtcgtttt tatggcttac cgcacgttgc 44220
cctcttcctt tctgcctgat gaagatcagg gatattttat gacggctatt cagttaccat 44280
cagatgccac acaggaacga acccgcaagg tggccgatca tctggaatcc atcgtggata 44340
aacgggacgg gataaacggc aatattaccg tctttggata tggtttttcc ggctcaggtc 44400
cgaatactgc actggcattt accactctga aagactggga ccagcgcaac ggagtaatgg 44460
ccgaaggcga agcggcgttt gtacagcagg agatggatac gcagcctgat gctatagcaa 44520
tgagcctgct gccgccggcg atagccgata tgggtacatc ttccggcttt actctttatc 44580
tggaagaccg gggtggaaaa ggctatgccg cgttaatgca ggcggccaca aagttaaccg 44640
ggctggccgc tggcagtagt atagtcagcg gcgtttacac cgacggattg ccagaaggcg 44700
tcagcgccag acttaatgtc gatcgggaaa aagcgcaggc gatgggagtg tcttttgacg 44760
agattaacca gactttatcg gtggcgaccg gctcttatta cgtcaacgat tatgttgacg 44820
ccggtcgtgt tcagcaggtg attgtgcagg cggatgcgcc ataccgcatg cagcttcagg 44880
atctgctcaa gctctacgtg cgcaacagta agggggagat ggtgccgcta tctgccttta 44940
taaccaccag ctggacgcag ctaccacaac aactaaatcg ctatcagggc tatccggcga 45000
taaaaatcag tggtagcaca gcacctggct actccagcgg ggcggcaatg gcagaaatgg 45060
aacggcttgc cggcacctta cccaaaggtt ttatggcaga gtggagcggt acttcacttc 45120
aggagaagaa ttcggcgtca cagatgccaa tgctgctggc attgtcagtg ctggtggtct 45180
ttatggttct ggccgcgctg tatgaaagct ggtcggtgcc gttttcggtg ttgatggtgg 45240
tcccgctggg cctggctggc gcgcttgcgg cggtttacct ggcacgtatg ccaaacgacg 45300
tattttttaa ggtgggcatg attatgctga tcgggctttc agctaaaaat gccattctta 45360
tcgttgagtt tgcccgccag ctgcatgcgc agggggcgac ggttttagaa gcaactattg 45420
aggcggcaat cctgcgcctt cggccgatta ttatgacctc gctggccttt accctgggtg 45480
ttgtgccgct gatgctggca acaggagcca gcgagcgcac ccagcacgcc atcggtaccg 45540
gcgtttttgg cgggatgatc agcggtacat taatggcgat ttactttgtc cccgttttct 45600
tcatttgcgt ctcatatctg gctacaaagt tatcatcagg cgataaaaaa gatcgtcatt 45660
aa 45662
<210> 2
<211> 2082
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 2
Met Leu Leu Asp Arg Gln Tyr Glu Val Ile Ala Lys Gln Ala Leu Tyr
1 5 10 15
Pro Phe Ala Pro Ile Phe Asn Ile Gly Ala Val Ile Asp Ile Arg Gly
20 25 30
Pro Leu Asp Glu Gln Arg Met Phe Asp Ala Asp Gln Ala Val Lys Arg
35 40 45
Asp Pro Ala Leu Arg Ser Ala Leu Ser Met Arg Gly Tyr Glu Pro Glu
50 55 60
Ile Val Thr Leu Ala Glu Asp Thr Phe Pro Leu Lys Ile Leu Asp Leu
65 70 75 80
Ser Cys Asn Asp Asp Pro Phe Thr Asn Ala Phe Leu Cys Ile Glu Gln
85 90 95
Ser Leu Gln Gln Met Phe Ala Phe Glu Gly Lys Thr Pro Leu Met Gln
100 105 110
His Thr Leu Ile Arg Leu Ala Asn Asp His His Leu Met Val Gly Ile
115 120 125
Tyr His His Leu Ala Tyr Asp Gly Trp Ala Thr Ser Leu Ile Tyr Gln
130 135 140
His Leu Ala Ala Tyr Tyr Asn Asp Phe Thr Arg Phe Asn Ser Val Arg
145 150 155 160
Asn Leu Ser Pro Leu Ser Tyr Gln Glu Gln Ile Ser Ala Glu Met Asn
165 170 175
Tyr Lys His Ser Ala Ser Tyr Met Ala Asp His Ser Tyr Trp Gln Ala
180 185 190
Arg Leu Ser Gly Tyr Asp Thr Met Leu Phe Ala Gln Gly Cys Arg Asp
195 200 205
Val Ala Ala Lys Arg Tyr Ser Phe Thr Leu Asp Ile His Tyr Arg Glu
210 215 220
Lys Leu Gln Glu Leu Ala Leu Asp Ser Gly Gly Thr Leu Phe Gln Val
225 230 235 240
Leu Thr Gly Ile Thr Ala Ile Phe Leu Phe Gln Leu Phe Gly Thr Asp
245 250 255
Asp Val Val Ile Gly Leu Pro Val Leu Asn Arg Arg Thr Ala Arg Ala
260 265 270
Lys Gln Thr Phe Gly Phe Phe Ala Asn Val Leu Pro Phe Arg Leu Gln
275 280 285
Arg Lys Ile Asn Asp Thr Phe Lys Thr Leu Leu Lys Asn Ile Ile Ile
290 295 300
Leu Leu Lys Glu Asp Tyr Arg His Gln Arg Phe Pro Ala Asn Gln Ile
305 310 315 320
Leu Lys Gly Gly Thr Ser Tyr Glu Ala Thr Leu Ser Tyr Glu Lys His
325 330 335
Asp Tyr Ser Ala Ile Phe Glu Gly Thr Asp Thr Gln Leu Asn Val Leu
340 345 350
Ser Ser Ser Cys Gln Asp Tyr Pro Leu Lys Leu Phe Ile Arg Asp Tyr
355 360 365
Asp Pro Glu Lys Pro Leu Lys Ile Asp Ile Asp Tyr Asn Ile Ser Ala
370 375 380
Phe Ser Glu Met Asp Val Glu His Val Phe Gln Glu Phe Lys Thr Ile
385 390 395 400
Leu Asp Asn Cys Ile Asn His Pro Glu Arg Gln Leu Val Ile Asn His
405 410 415
Gln Ile Arg Pro Asp Glu Ala Ser Phe Ile Pro Pro Asp Val Thr Thr
420 425 430
Glu Leu Cys Thr Gln Phe Glu Ala Ala Ala Ser Arg His Ala Asp Arg
435 440 445
Val Ala Ile Thr Cys Glu Gly Glu Ser Leu Thr Tyr Ala Ala Leu Asp
450 455 460
Ser Ala Ala Ser Ala Leu Ala Trp Arg Leu Arg Gly Leu Gly Val Gly
465 470 475 480
Thr Gly Pro His Glu Ser Leu Val Gly Leu Ser Ala Gly Arg Gly Pro
485 490 495
Gly Leu Leu Val Gly Ile Leu Gly Ile Leu Lys Ala Gly Gly Ala Tyr
500 505 510
Val Pro Leu Asp Pro Val Tyr Pro Ala Glu Arg Leu Ala Phe Leu Ala
515 520 525
Ala Asp Ser Gly Ile Arg Leu Ala Val Ala Asp Asp Thr Gly Leu Ala
530 535 540
Ala Leu Ala Gly Leu Gly Val Gln Thr Val Ser Leu Ser Ala Asp His
545 550 555 560
Pro Arg Arg Ala Gly Asn Gln Ala Pro Pro Arg Ser Leu His Pro Gln
565 570 575
Gln Ala Ala Tyr Val Ile Tyr Thr Ser Gly Ser Thr Gly Gln Pro Lys
580 585 590
Gly Cys Val Val Ser His Ala Ser Val Val Arg Leu Phe Thr Ala Thr
595 600 605
Glu His Tyr Gly Phe Gly Glu Ser Asp Val Trp Thr Leu Phe His Ser
610 615 620
Tyr Ala Phe Asp Phe Ser Val Trp Glu Ile Trp Gly Ala Leu Leu His
625 630 635 640
Gly Gly Arg Leu Val Val Val Pro Tyr Leu Ser Ser Arg Asp Pro Glu
645 650 655
Arg Phe Ala His Leu Leu Glu Ala Glu Ser Val Thr Val Leu Ser Gln
660 665 670
Thr Pro Ala Ala Phe Arg Gln Leu Thr Ala Ala Ser Ala Gly Arg Asp
675 680 685
Phe Ala Ala Leu Arg Leu Val Leu Phe Gly Gly Glu Ala Leu Glu Pro
690 695 700
Gly Ser Leu Ala Pro Trp Phe Ala Gln His Gly Gly Arg Val Arg Leu
705 710 715 720
Val Asn Met Tyr Gly Ile Thr Glu Thr Thr Val His Val Thr Glu Tyr
725 730 735
Thr Leu Thr Pro Glu Ser Met Thr Gln Gly Ser Val Ile Gly Thr Ala
740 745 750
Leu Ala Asp Leu His Val Gln Val Leu Asp Arg Tyr Gly Glu Pro Val
755 760 765
Pro Ala Gly Val Thr Gly Glu Met Tyr Val Gly Gly Ala Gly Val Thr
770 775 780
Arg Gly Tyr Leu Gly Arg Ala Ala Leu Thr Ala Gln Arg Phe Val Pro
785 790 795 800
Asp Pro Phe Gly Ala Pro Gly Ala Arg Leu Tyr Arg Ser Gly Asp Leu
805 810 815
Ala Arg Arg Arg Ala Asp Gly Gly Leu Val Tyr Gln Gly Arg Ala Asp
820 825 830
Gln Gln Leu Lys Leu Arg Gly Tyr Arg Ile Glu Pro Gly Glu Ile Glu
835 840 845
Ala Ala Leu Arg Ala Gln Ala Gly Val Arg Asp Ala Ala Val Val Leu
850 855 860
Asp Ala Pro Ala Gln Gly Gln Pro Arg Leu Val Ala Tyr Val Val Gly
865 870 875 880
Gly Gly Gly Ala Gln Ala Leu Arg Glu Ala Leu Ser Ala Ala Leu Pro
885 890 895
Glu His Met Val Pro Ala Val Ile Met Pro Leu Ala Arg Leu Pro Leu
900 905 910
Thr Ala His Gly Lys Leu Asp Arg Lys Ala Leu Pro Glu Pro Glu Val
915 920 925
Thr Val Ser Ala Gly Gly Glu Ala Arg Thr Glu Val Glu Lys Thr Leu
930 935 940
Ala Gly Ile Trp Ser Glu Val Leu Ser Ile Pro Ala Pro Gly Ile Asp
945 950 955 960
Asp Asn Phe Phe Thr Leu Gly Gly Asp Ser Ile Ser Ser Leu Gln Val
965 970 975
Val Ser Arg Ala Arg Ala Ala Gly Ile Asn Ile Thr Ile Glu Gly Phe
980 985 990
Leu Ala Gly Gln His Ile Arg Lys Ile Ala Ala Gly Val Gln Ser Gly
995 1000 1005
Pro Val Ala Ala Asp Asp Glu Ser Leu Thr Val Pro Phe Ser Leu Leu
1010 1015 1020
Ser Ala Ala Asp Arg Ala Arg Leu Pro Asp Asn Val Asp Asp Ala Phe
1025 1030 1035 1040
Pro Leu Ser Arg Leu Gln Ala Gly Met Leu Phe His Ser Thr Leu Ala
1045 1050 1055
Glu Glu Gly Ala Ile Phe His Asp Val Phe Thr Phe Arg Leu Arg Met
1060 1065 1070
Pro Trp Asn Glu His Ala Trp Arg Ser Ala Phe Glu Leu Leu Pro Ala
1075 1080 1085
Ser His Thr Pro Leu Arg Thr Ser Phe His Trp Thr Gly Tyr Ser Glu
1090 1095 1100
Pro Leu Gln Val Val His Ser Thr Ala Asp Ile Asp Tyr Gln Ile Val
1105 1110 1115 1120
Asp Leu Arg Tyr Leu Glu Thr Glu Gln Arg Arg Gln Ala Val Asn Asp
1125 1130 1135
Phe Ile Ala His Ser Lys Ser Tyr Gly Phe Asp Pro Ala Lys Gly Arg
1140 1145 1150
Met Phe Arg Val Ser Leu His Arg His Ser Asp Glu Glu Leu Gln Leu
1155 1160 1165
Thr Leu Asp Phe His His Ala Ile Phe Asp Gly Trp Ser Val Ala Thr
1170 1175 1180
Leu Leu Ser Thr Leu Ile His Arg Val Thr Gly Thr Glu Ala Thr Asn
1185 1190 1195 1200
Ala Arg Ser Asp Thr Thr Val Asn Thr Ala Phe Val Ala Leu Glu Arg
1205 1210 1215
Lys Ala Glu Ala Asp Glu Gln Leu Val Ala Lys Trp Arg Glu Arg Val
1220 1225 1230
Ala Asp Val Val Pro Thr Leu Leu Gly Asp His Ser Ala Ala Glu Leu
1235 1240 1245
Ser Gly Thr Arg Gln Val Gln Arg Arg Ala Phe Arg Leu Pro Asp His
1250 1255 1260
Leu Thr Ser Lys Leu Lys Gln Arg Ala Thr Asp Leu Ala Ile Pro Leu
1265 1270 1275 1280
Lys Ile Val Leu Leu Thr Ala His Leu Ser Ala Leu Ala Lys Val Thr
1285 1290 1295
Gly Gly Thr Val Thr Thr Thr Gly Tyr Val Thr His Gly Arg Pro Ala
1300 1305 1310
Gly Ala Asp Lys Ala Val Gly Leu Phe Leu Asn Thr Leu Pro Phe Ser
1315 1320 1325
Met Ala Leu Pro Pro Val Ser Trp Asn Ser Leu Ile Lys Ser Ile Ala
1330 1335 1340
Ala Glu Glu Gln Ala Ile Gln Ala Ile Arg Arg Leu Pro Ala Ser Val
1345 1350 1355 1360
Ile Lys Pro Leu Asn Ser Ser Gly Gln Leu Tyr Asn Val Ser Phe Asn
1365 1370 1375
Tyr Ile His Phe His Ile Tyr Asn Ser Leu Pro Asp Leu Ala Asp Phe
1380 1385 1390
Gln Val Val Asp Phe Glu Ile Phe Glu Glu Thr Asp Phe Pro Leu Leu
1395 1400 1405
Ala Gln Tyr Ser Gln Asp Pro Phe Asp Ala Ser Leu Glu Leu Thr Leu
1410 1415 1420
Val Ala Asp Pro Ala Val Val Pro Glu Trp Gln Val Glu Gln Phe Gly
1425 1430 1435 1440
Asp Phe Val Leu Arg Ala Ala Glu Ala Ile Val Ser Gly Ser Glu Ala
1445 1450 1455
Pro Trp Tyr Ser Ser Leu Arg Ser Glu Ala Leu Pro Leu Val Pro Asp
1460 1465 1470
Ala Ser Ser Glu Leu Thr Leu Asp Leu Cys Thr Gln Phe Glu Ala Ala
1475 1480 1485
Ala Ser Arg His Ala Asp Arg Val Ala Ile Thr Cys Glu Gly Glu Ser
1490 1495 1500
Leu Thr Tyr Ala Ala Leu Asp Ser Ala Ala Ser Ala Leu Ala Trp Arg
1505 1510 1515 1520
Leu Arg Gly Leu Gly Val Gly Thr Gly Pro His Glu Ser Leu Val Gly
1525 1530 1535
Leu Ser Ala Gly Arg Gly Pro Gly Leu Leu Val Gly Ile Leu Gly Ile
1540 1545 1550
Leu Lys Ala Gly Gly Ala Tyr Val Pro Leu Asp Pro Val Tyr Pro Ala
1555 1560 1565
Glu Arg Leu Ala Phe Leu Ala Ala Asp Ser Gly Ile Arg Leu Ala Val
1570 1575 1580
Ala Asp Asp Thr Gly Leu Ala Ala Leu Ala Gly Leu Gly Val Gln Thr
1585 1590 1595 1600
Val Ser Leu Ser Ala Asp His Pro Arg Arg Ala Gly Asn Gln Ala Pro
1605 1610 1615
Pro Arg Ser Leu His Pro Gln Gln Ala Ala Tyr Val Ile Tyr Thr Ser
1620 1625 1630
Gly Ser Thr Gly Gln Pro Lys Gly Cys Val Val Ser His Ala Ser Val
1635 1640 1645
Val Arg Leu Phe Thr Ala Thr Glu His Tyr Gly Phe Gly Glu Ser Asp
1650 1655 1660
Val Trp Thr Leu Phe His Ser Tyr Ala Phe Asp Phe Ser Val Trp Glu
1665 1670 1675 1680
Ile Trp Gly Ala Leu Leu His Gly Arg Ala Pro Gly Gly Gly Ala Leu
1685 1690 1695
Pro Glu Gln Pro Arg Pro Gly Ala Leu Cys Pro Pro Ala Gly Ser Gly
1700 1705 1710
Val Gly His Arg Ala Gln Pro Asp Pro Gly Gly Leu Pro Thr Ala Asp
1715 1720 1725
Arg Gly Leu Gly Arg Thr Gly Leu Cys Gly Ala Ala Ala Gly Ala Val
1730 1735 1740
Arg Arg Arg Ser Pro Gly Ala Gly Gln Pro Gly Ala Val Val Arg Ala
1745 1750 1755 1760
Ala Arg Arg Ala Gly Glu Ala Gly Gln His Val Arg His His Arg Asp
1765 1770 1775
His Gly Thr Arg Asp Arg Val His Ala Asp Ala Arg Glu His Asp Ala
1780 1785 1790
Gly Gln Arg Asp Arg His Gly Ala Gly Gly Phe Ala Arg Ala Gly Ala
1795 1800 1805
Gly Pro Leu Arg Arg Ala Gly Ala Gly Gly Gly Asp Gly Arg Asp Val
1810 1815 1820
Arg Gly Arg Arg Gly Arg Asp Ala Gly Leu Pro Gly Pro Gly Gly Ala
1825 1830 1835 1840
Asp Gly Ala Ala Leu Arg Ala Gly Ser Val Arg Arg Ala Gly Gly Glu
1845 1850 1855
Ala Leu Pro Leu Arg Arg Pro Gly Ala Pro Pro Gly Gly Arg Arg Pro
1860 1865 1870
Gly Val Pro Gly Pro Ala Asp Gln Gln Leu Lys Leu Arg Gly Tyr Arg
1875 1880 1885
Ile Glu Pro Gly Glu Ile Glu Ala Ala Leu Arg Ala Gln Ala Gly Val
1890 1895 1900
Arg Asp Ala Ala Val Val Leu Asp Ala Pro Ala Gln Gly Gln Pro Arg
1905 1910 1915 1920
Leu Val Ala Tyr Val Val Gly Gly Gly Gly Ala Gln Ala Leu Arg Glu
1925 1930 1935
Ala Leu Ser Ala Ala Leu Pro Glu His Met Val Pro Ala Val Ile Met
1940 1945 1950
Pro Leu Ala Arg Leu Pro Leu Thr Ala His Gly Lys Leu Asp Arg Lys
1955 1960 1965
Ala Leu Pro Glu Pro Glu Ile Ala Val Ala Gln Asn Glu Glu Gly Tyr
1970 1975 1980
Gln Ser Ser Leu Glu Gln Glu Ile Ala Glu Leu Leu Ser Ser Val Leu
1985 1990 1995 2000
Gly Leu Ser Gly Ile Gly Arg His Gln Ser Phe Leu Glu Thr Gly Gly
2005 2010 2015
Asp Ser Ile Leu Ala Thr Gln Ala Leu Phe Arg Leu Arg Glu Leu Tyr
2020 2025 2030
Gly Val Glu Leu Pro Leu Arg Thr Ile Phe Glu Ala Gly Thr Val Ala
2035 2040 2045
Gly Val Ala Ala Lys Ile Lys Ala Leu Arg Gln Glu Glu Arg His Gly
2050 2055 2060
Glu Arg Gln Ile Ser Asp Ser Thr Pro Leu Leu Pro Ser Arg Arg Arg
2065 2070 2075 2080
Gln Lys
<210> 3
<211> 423
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 3
Leu Leu Phe Pro Leu Ser Ser Ala Gln Arg Arg Leu Trp Thr Leu Ala
1 5 10 15
Glu Ile Asn Glu Ala Asp Val Ser Tyr Asn Ile Pro Phe Ala Leu Arg
20 25 30
Cys Arg Gly Lys Phe His Tyr Gln Ala Leu Arg Gln Ala Leu Thr Asp
35 40 45
Leu Gln Gln Arg His Glu Ile Leu Arg Thr Ser Tyr Gly Leu Ile Asp
50 55 60
Asp Ser Pro Met Gln Arg Ile His Pro Ala Glu Asp Asp Leu Ala Leu
65 70 75 80
Pro Leu Ile Arg Ile Asn Glu Ala Gln Leu Glu Lys Lys Leu Ala Glu
85 90 95
Asp Ala Ala Glu Pro Phe Asn Leu Gln Leu Ala Pro Val Phe His Ala
100 105 110
Arg Val Tyr Gln Leu Asn Asp Asp His His Ile Leu Ser Met Val Val
115 120 125
His His Ile Ala Cys Asp Gly Trp Ser Val Thr Ile Leu Leu Arg Glu
130 135 140
Leu Ser His Phe Tyr Asn Ala Arg Val Ala Asn Met Ser Pro Thr Leu
145 150 155 160
Ala Glu Leu Pro Leu Gln Tyr Ala Asp Tyr Ala Glu Trp Glu Glu Ala
165 170 175
Glu Ala Lys Arg Thr Ala Asn Pro Ala Gly Glu Thr Gly Thr Arg Leu
180 185 190
His Phe Gln Pro Ala Val Ala Leu Pro Gly Cys Glu Ser Asp Glu Ala
195 200 205
Asp Lys Glu Asn Ala Cys Gly Ile Val Gln Gln Arg Phe Asp Ala Asp
210 215 220
Phe Leu Gln Lys Leu Asn Gly Tyr Ala Arg Glu His His Thr Thr Leu
225 230 235 240
Phe Val Thr Leu Leu Ala Gly Phe Met Ala Leu Leu Arg Arg Leu Thr
245 250 255
Gln Ala Asp Asp Val Cys Ile Gly Phe Pro Val Ala Asn Arg Lys Arg
260 265 270
Ser Glu Leu Glu Asn Ile Val Gly Tyr Phe Val Asn Thr Leu Val Ile
275 280 285
Arg Asp Glu Ile Ser Arg Asp Asp Thr Phe Asp Ser Leu Val Ala Arg
290 295 300
Cys Ala Ser Ser Val Leu Asp Ala Leu Glu His Glu Glu Ala Ser Tyr
305 310 315 320
Glu Lys Leu Leu Lys Gln Thr Pro Arg Glu Asn Thr Asn Ser Val Pro
325 330 335
Phe Thr Ala Met Phe Ala Phe Glu Asn Ile Ser Ala Thr Glu Phe Ala
340 345 350
Phe Asn Asp Leu Gln Ile Glu Leu Val Asp Val Tyr Pro Ala Gln Ala
355 360 365
Lys Phe Asp Leu Thr Leu Leu Leu Lys Gln Asp Gly Glu Val Leu Thr
370 375 380
Ala Thr Phe Glu Phe Arg Ala Ser Val Phe Tyr Pro Arg Ser Pro Val
385 390 395 400
Pro Gly Trp His Ala Ile Ser Ala Tyr Trp Lys Pro Lys Phe Trp Pro
405 410 415
Gln Pro Arg Gln Leu Ile Gly
420
<210> 4
<211> 2499
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 4
Leu Thr Tyr Ala Ala Leu Asp Ser Ala Ala Ser Ala Leu Ala Trp His
1 5 10 15
Leu Arg Gly Leu Gly Val Gly Thr Gly Pro His Glu Ser Leu Val Gly
20 25 30
Leu Ser Ala Gly Arg Gly Pro Gly Leu Leu Val Gly Ile Leu Gly Ile
35 40 45
Leu Lys Ala Gly Gly Ala Tyr Val Pro Leu Asp Pro Val Tyr Pro Ala
50 55 60
Glu Arg Leu Ala Phe Leu Ala Ala Asp Ser Gly Ile Arg Leu Ala Val
65 70 75 80
Ala Asp Asp Thr Gly Leu Ala Ala Leu Ala Gly Leu Gly Val Gln Thr
85 90 95
Val Ser Leu Ser Ala Asp His Pro Arg Arg Ala Gly Asn Gln Ala Pro
100 105 110
Pro Arg Ser Leu His Pro Gln Gln Ala Ala Tyr Val Ile Tyr Thr Ser
115 120 125
Gly Ser Thr Gly Gln Pro Lys Gly Cys Val Val Ser His Ala Ser Val
130 135 140
Val Arg Leu Phe Thr Ala Thr Glu His Tyr Gly Phe Gly Glu Ser Asp
145 150 155 160
Val Trp Thr Leu Phe His Ser Tyr Ala Phe Asp Phe Ser Val Trp Glu
165 170 175
Ile Trp Gly Ala Leu Leu His Gly Gly Arg Leu Val Val Val Pro Tyr
180 185 190
Leu Ser Ser Arg Asp Pro Glu Arg Phe Ala His Leu Leu Glu Ala Glu
195 200 205
Ser Val Thr Val Leu Ser Gln Thr Pro Ala Ala Phe Arg Gln Leu Thr
210 215 220
Ala Ala Ser Ala Gly Arg Asp Phe Ala Ala Leu Arg Leu Val Leu Phe
225 230 235 240
Gly Gly Glu Ala Leu Glu Pro Gly Ser Leu Ala Pro Trp Phe Ala Gln
245 250 255
His Ser Gly Arg Val Arg Leu Val Asn Met Tyr Gly Ile Thr Glu Thr
260 265 270
Thr Val His Val Thr Glu Tyr Thr Leu Thr Pro Glu Ser Met Thr Gln
275 280 285
Gly Ser Val Ile Gly Thr Ala Leu Ala Asp Leu His Val Gln Val Leu
290 295 300
Asp Arg Tyr Gly Glu Pro Val Pro Ala Gly Val Thr Gly Glu Met Tyr
305 310 315 320
Val Gly Gly Ala Gly Val Thr Arg Gly Tyr Leu Gly Arg Ala Ala Leu
325 330 335
Thr Ala Gln Arg Phe Val Pro Asp Pro Phe Gly Ala Pro Gly Ala Arg
340 345 350
Leu Tyr Arg Ser Gly Asp Leu Ala Arg Arg Arg Ala Asp Gly Gly Leu
355 360 365
Val Tyr Gln Gly Arg Ala Asp Gln Gln Leu Lys Leu Arg Gly Tyr Arg
370 375 380
Ile Glu Pro Gly Glu Ile Glu Ala Ala Leu Arg Ala Gln Ala Gly Val
385 390 395 400
Arg Asp Ala Ala Val Val Leu Asp Ala Pro Ala Gln Gly Gln Pro Arg
405 410 415
Leu Val Ala Tyr Val Val Gly Gly Lys Gly Ala Gln Ala Leu Arg Glu
420 425 430
Ala Leu Ser Ala Ala Leu Pro Glu His Met Val Pro Ala Val Ile Met
435 440 445
Pro Leu Ala Arg Leu Pro Leu Thr Ala His Gly Lys Leu Asp Arg Lys
450 455 460
Ala Leu Pro Glu Pro Glu Val Thr Val Ser Ala Gly Gly Glu Ala Arg
465 470 475 480
Thr Glu Val Glu Lys Thr Leu Ala Gly Ile Trp Ser Glu Val Leu Ser
485 490 495
Ile Pro Val Pro Gly Ile Asp Asp Asn Phe Phe Thr Leu Gly Gly Asp
500 505 510
Ser Ile Ser Ser Leu Gln Val Val Ser Lys Ala Arg Ala Ala Gly Ile
515 520 525
Ala Ile Thr Pro Lys Gln Ala Leu Leu Phe Thr Thr Leu Arg Lys Leu
530 535 540
Ala Ala Val Ala Glu Thr Ser Lys Gly Asn Ala Ala Leu His Gln Asn
545 550 555 560
Ala Arg Cys Pro Ser Gly Pro Leu Leu Pro Thr Pro Ile Ile Ala Trp
565 570 575
Phe Gln Ala Leu Lys Leu Ser Ala Pro Ala His Trp Asn Gln Ser Leu
580 585 590
Ala Leu Glu Ile Ala His Pro Val Ala Pro Asp Leu Leu Ala Gln Ala
595 600 605
Leu Lys Ala Ile Gly Gln His His Asp Ala Phe Arg Leu Arg Leu Asp
610 615 620
Tyr Gly Asn Ala Glu Ser Leu Ser Leu Ala Glu Val Met Gln Glu Pro
625 630 635 640
Phe Pro Leu Glu Ile Arg Thr Val Asn Ser Gln Val Glu Arg Asp Ala
645 650 655
Ala Ile Leu His Ala Gln Lys Gly Leu Ser Leu Asp Asp Gly Pro Val
660 665 670
Gly Arg Ala Met Leu Ile Gln His Ala Gly Glu Thr Asp Ile Leu Val
675 680 685
Leu Val Ile His His Ile Ala Val Asp Ala Val Ser Trp His Ile Leu
690 695 700
Leu Asp Asp Leu Asn Val Ala Ile Lys Arg Leu Gln Asn Ala Gln Lys
705 710 715 720
Ile Val Leu Asp Pro Val Val Thr Asn Leu Thr Asp Trp Ser Arg Ser
725 730 735
Leu Gln Thr Ala Ala Glu Arg Ala Asp Pro Gln Arg Trp Leu Arg Met
740 745 750
Ala Ala Gln Gly Asn Pro Ser Pro Phe His Asp Phe Val Thr Val Gln
755 760 765
Gly Leu Asn Arg Glu Gln Gly Leu Thr Val Cys Ser Arg Thr Leu Ser
770 775 780
Ser Glu Asn Ser Ala Leu Phe Leu Gln Leu Leu Ser Arg Gly Ser Glu
785 790 795 800
Ala Arg Ala Ser Ala Leu Leu Cys Ala Ala Leu Trp Arg Leu Phe Asn
805 810 815
Glu Gln Pro Leu Ala Val Thr Leu Glu His Asn Gly Arg Asp Val Asp
820 825 830
Lys Asp Ala Asp Leu Ser Arg Thr Leu Gly Trp Phe Thr Ser Leu Tyr
835 840 845
Pro Phe Phe Tyr Ser Gly Gln Pro Ala Leu Ala Ser Ala Glu Leu Leu
850 855 860
Ala Glu Met Glu Ser Ser Leu Leu Glu Leu Ala Pro His Lys Ala Glu
865 870 875 880
Tyr Gly Leu Val Arg Trp Leu Ser Glu Asp Glu Glu Val Arg Ala Lys
885 890 895
Leu Asp Glu Ala Asp Leu Pro Ala Leu Ser Leu Asn Tyr Leu Gly Gln
900 905 910
Ile Pro Asp Gln Gln Glu Gly Glu Phe Val Leu Arg His Asp Ile Ser
915 920 925
Ser Val Asp Arg Ala Val Gly Asn Val Arg Ala Phe Thr Leu Asp Leu
930 935 940
Val Ala Val Val Ile Asn Gly Glu Leu Arg Phe Tyr Trp Asn Tyr Cys
945 950 955 960
Arg Asn Val Leu Lys Pro Glu Ile Val Glu Gly Trp Ala Asp Ala Leu
965 970 975
Gln Gln His Leu Gln Gln Leu Leu Thr Glu Leu Thr Ala Arg Pro Leu
980 985 990
Leu Val Ala Asp Phe Pro Leu Ala Arg Ile Arg Gln Ile Gln Phe Glu
995 1000 1005
Ala Leu Val Gly Lys Gln Ala Val Ala Asp Ala Tyr Pro Leu Ser Pro
1010 1015 1020
Leu Gln Glu Gly Met Leu Phe His Ser Val Ala Glu Pro Glu Asn His
1025 1030 1035 1040
Ala Tyr His Glu Gln Ala Val Ala Leu Phe Glu Arg Leu Asp Ala Asp
1045 1050 1055
Leu Phe Ile Lys Ala Trp Lys Thr Leu Leu Ser Arg His Asp Ile Leu
1060 1065 1070
Arg Thr Ser Phe His Trp Gln Asp Leu Pro Arg Pro Leu Gln Ile Val
1075 1080 1085
His Ala Thr Ala Asp Leu Pro Val Thr Val Phe Asp Trp Arg Gly Glu
1090 1095 1100
Asp Pro Ala Glu Arg Leu Ala Glu Phe Leu Gln Gln Asp Ala Asp Lys
1105 1110 1115 1120
Ala Phe Asp Leu Ser Val Ala Pro Leu Leu Arg Val Met Leu Ala Arg
1125 1130 1135
Ile Asp His Asn Ser Trp Arg Trp Val Cys Ser Tyr His His Ile Leu
1140 1145 1150
Met Asp Gly Trp Ser Leu Pro Leu Leu Met Gly Glu Leu Val His Ile
1155 1160 1165
Tyr Glu Ser Leu Val Ala Ala Thr Gln Pro Thr Leu Pro Pro Pro Val
1170 1175 1180
Gln Tyr Gly Arg His Ile Ala Arg Leu Val Gln His Ala Ser Glu Gln
1185 1190 1195 1200
Thr Gly Lys Val Phe Trp Leu Asn Ala Leu Ala Gly Leu Glu Arg Pro
1205 1210 1215
Thr Leu Leu Ser Pro Gln Gln Gln Pro Ser Ala Asp Tyr His Asp Leu
1220 1225 1230
Leu Val Thr Leu Ser Pro Glu Gln Glu Gln Ala Ile Arg Thr Ala Ala
1235 1240 1245
Arg Glu Ala Gly Val Ser Leu Gly Asn Val Phe Asn Ala Ala Trp Gly
1250 1255 1260
Ile Leu Leu Ala Leu Ser Gly His Gly Asn Asp Val Val Phe Gly Ser
1265 1270 1275 1280
Thr Leu Ser Gly Arg Glu Thr Gly Val Glu Asp Val Asp Lys Met Ile
1285 1290 1295
Gly Leu Phe Ile Asn Thr Leu Pro Leu Arg Leu Arg Leu Arg Pro Glu
1300 1305 1310
Met Ser Val Arg Asp Leu Leu His Lys Ala Arg Gln Phe Gln Ala Asp
1315 1320 1325
Leu Gln Glu His Ser His Asp Arg Leu Val Asp Val Gln Arg Trp Ser
1330 1335 1340
Gly Leu Glu Gly Glu Gly Thr Leu Phe Asp Ser Val Leu Val Ile Glu
1345 1350 1355 1360
Asn Tyr Pro Gly Gly Ala Pro Glu Asp Asn Gly Lys Gly Phe Arg Leu
1365 1370 1375
Val Glu Phe Ala Tyr Lys Glu His Ser Asn Tyr Pro Val Thr Leu Ala
1380 1385 1390
Val Leu Pro Asp Asn Gly Leu Lys Ile Lys Leu Asp Tyr Asn Cys Ala
1395 1400 1405
Thr Phe Asp Asp Thr Ala Ala Ala Leu Leu Leu Lys Arg Leu Thr Asp
1410 1415 1420
Leu Ile Ser Lys Met Ile Glu Asp Pro Asp Arg Arg Leu Ser Thr Leu
1425 1430 1435 1440
Asp Leu Leu Ala Glu Glu Glu Gln Ile Ile Ala Arg Glu Val Trp Asn
1445 1450 1455
Ala Gly Ala Phe Asn Ala Ala Ser Pro Val Leu Ala His Gln Met Phe
1460 1465 1470
Glu Lys Ser Val Ser Arg Gln Pro Gln Ala Pro Ala Leu Leu Gln Gly
1475 1480 1485
Glu Thr Lys Tyr Asp Tyr Ser Gln Leu Asn His Lys Ala Asp Ala Leu
1490 1495 1500
Ala Ala Thr Leu Gln Gln Gln Gly Val Gly Pro Glu Ser Val Val Ala
1505 1510 1515 1520
Val Met Leu Ser Arg Gly Pro Glu Ala Val Ile Ser Phe Leu Ala Ile
1525 1530 1535
Leu Lys Ala Gly Gly Val Tyr Leu Pro Leu Asp Ala Gln Tyr Pro Val
1540 1545 1550
Asp Arg Leu Asp Tyr Met Leu Arg Asp Ser Gln Ala Val Met Leu Leu
1555 1560 1565
Ser Asp Lys Ala Gln Ser Val Glu Lys Leu Thr Ala Met Pro Lys Ala
1570 1575 1580
Leu Leu Leu Leu Asp Ser Phe Asp Phe Met Ser Asp Ala Arg Pro Ala
1585 1590 1595 1600
Ala Cys Thr Asn Leu Thr Ala Asn Asn Leu Ala Tyr Leu Ile Tyr Thr
1605 1610 1615
Ser Gly Ser Thr Gly Lys Pro Lys Pro Val Gly Val Ser His Ala Gly
1620 1625 1630
Ile Ala Asn Leu Gln Ala Glu Thr Glu Arg Met Leu Gly Thr Asp Ala
1635 1640 1645
His Ala Arg Val Tyr Met Gln Ala Pro Leu Ser Phe Asp Ala Ser Val
1650 1655 1660
Trp Glu Met Met Met Ala Leu Phe Gly Gly Gly Ala Leu Val Leu Pro
1665 1670 1675 1680
Asp Gly Asp Ala Glu Gly Asp Val Leu Ala Ala Leu Asn Gln Ala Ala
1685 1690 1695
Glu Arg His His Phe Thr His Val Leu Val Thr Pro Ala Leu Leu Gly
1700 1705 1710
Leu Leu Lys Asp Tyr Ala Leu Pro Ser Leu His Thr Leu Ile Val Gly
1715 1720 1725
Gly Asp Ala Ser Ala Pro Gly Met Met Ala His Trp Ala Lys Ser Arg
1730 1735 1740
Arg Val Phe Asn Ala Tyr Gly Pro Ser Glu Cys Thr Val Cys Val Ala
1745 1750 1755 1760
Ile Glu Pro Cys Gly Val Asn Thr Val Thr Pro Pro Leu Gly Leu Pro
1765 1770 1775
Leu Tyr Gly Ile Pro Met Tyr Leu Leu Asp Ser Trp Gly Asn Pro Val
1780 1785 1790
Pro Pro Gly Val Ile Gly Glu Ile Phe Leu Gly Gly Asp Ser Leu Ala
1795 1800 1805
Arg Gly Tyr Ile Gly Arg Pro Ala Leu Thr Ala Gly Val Phe Ile Pro
1810 1815 1820
Asp His Leu Ser Gly Leu Pro Gly Ala Arg Leu Tyr Arg Thr Gly Asp
1825 1830 1835 1840
Thr Ala Ile Arg Leu Gln Asp Gly Arg Ile Lys Tyr Ala Gly Arg Thr
1845 1850 1855
Gly Gly Tyr Ala Lys Leu Arg Gly Asn Arg Ile Asp Leu Asn Gly Val
1860 1865 1870
Glu Leu Leu Leu Gln Gly His Pro Ala Val Arg Glu Ala Leu Ala Met
1875 1880 1885
Ile Arg Thr Val Glu Asn Gly Gln Ser Leu Ile Ala Trp Val Val Ala
1890 1895 1900
Glu Lys Gly Thr Glu Ala Asn Glu Leu Arg Asp Tyr Met Val Lys His
1905 1910 1915 1920
Ala Ala Ala Phe Glu Val Pro Gly Ala Ile Val Pro Leu Thr Arg Trp
1925 1930 1935
Pro Leu Thr Pro Ala Gly Lys Ile Asp Arg Asn Ala Leu Pro Leu Pro
1940 1945 1950
Ala Thr Ala Pro Arg Ala Ser Val Asp Gly Lys Ala Leu Arg Pro Ala
1955 1960 1965
Glu Ala Ala Leu Leu Gln Ile Trp Ser Gln Ala Leu Gly Arg Asp Asp
1970 1975 1980
Ile Asp Leu His Asp Asp Tyr Phe Ser Leu Gly Gly Asp Ser Ile Ile
1985 1990 1995 2000
Ala Leu Gln Ile Thr Ser Leu Ala Arg Gln Glu Gly Trp Ser Val Thr
2005 2010 2015
Pro Arg Met Val Leu Gln Tyr Arg Thr Val Ala Ala Leu Ala Ala Met
2020 2025 2030
Ala Ser Val Leu Asp Thr His Glu Pro Glu Pro Asp Asn Ala Lys Val
2035 2040 2045
Glu Leu Ala Pro Ile Gln His Trp Tyr Phe Ala Gln Asn Leu Pro Ala
2050 2055 2060
Val Ala His Trp Asn Leu Ser Ile Arg Leu Glu Leu Gln Ser Arg Met
2065 2070 2075 2080
Val Pro Gln Leu Leu Gln Gln Ala Leu Asn Glu Leu Val Lys Leu His
2085 2090 2095
Pro Ala Leu Arg Leu Arg Phe Glu His Val Asp Gly Val Trp Gln Gln
2100 2105 2110
His Tyr Ser Asp Ala Ala Thr Ile Pro Leu Glu Leu Leu Pro Glu Ser
2115 2120 2125
His Gln Lys Ala Ala Asp Arg Glu Ala Gly Leu Gln Ser Leu Leu Asn
2130 2135 2140
Leu Ser Thr Gly Pro Leu Leu Arg Ala Ala Tyr Arg Asp Ala Gly Glu
2145 2150 2155 2160
Thr Asn Gln Pro Glu Leu Val Leu Ile Ala His His Leu Ile Met Asp
2165 2170 2175
Thr Trp Ser Leu Arg Ile Leu Val Glu Asp Leu Ala Ser Leu Tyr Ser
2180 2185 2190
Ser Leu Gln Ser Gly Thr Pro Leu Arg Val Leu Gln Glu Gly Thr Ser
2195 2200 2205
Tyr Arg Gln Trp Ser Gln Trp Leu Thr Gln His Ala Ala Asp Phe Thr
2210 2215 2220
Ala Gln Thr Ser Tyr Trp Arg Asn Met Leu Asp Ala Gly Thr Pro Pro
2225 2230 2235 2240
Val Ala Met Pro Arg Lys Gly Cys Val Gly Asp Arg Gln Val Ile Phe
2245 2250 2255
Ala Glu Leu Asp Arg Glu Thr Ser Asp Leu Leu Thr Gly Asp Ala His
2260 2265 2270
Gln Ala Tyr His Ser Arg Gly Gln Glu Leu Leu Leu Thr Ala Leu Ala
2275 2280 2285
Gln Ala Trp His Arg Trp Cys Gly Asn Thr His Leu Ala Ile Glu Leu
2290 2295 2300
Glu Thr His Gly Arg Glu Ala Phe Gln Asp Ala Ala Met Asp Leu Ser
2305 2310 2315 2320
Arg Ser Val Gly Trp Phe Thr Ala Leu Phe Pro Leu Cys Ile Ala Ala
2325 2330 2335
Gly Ser Asp Trp Ala Asn Thr Val Asp Asn Val Lys Gln Thr Leu Arg
2340 2345 2350
His Ile Pro Ser Gly Gly His Gly Tyr Gly Ile Leu Arg Tyr Leu Leu
2355 2360 2365
Lys Thr Pro Asp Ile Cys Lys Leu Thr Pro Pro Ser Ile Ser Phe Asn
2370 2375 2380
Tyr Leu Gly Asp Thr Ala Met Ser Ala Ser Ser Gly Met Ala Ile Gln
2385 2390 2395 2400
Leu Ser Arg Arg Glu Ala Gly Pro Gly Gln Ala Ala Tyr Gln Leu Leu
2405 2410 2415
Pro His Ala Leu Asn Val Thr Val Met Leu Val Ala Gly Arg Leu Arg
2420 2425 2430
Leu Ser Leu Ala Tyr Ala Asp Thr Ser Ala Asp Thr Ala Met Gln Thr
2435 2440 2445
Leu Leu Asn His Tyr Gln His Ala Leu His Asp Leu Ala Glu His Cys
2450 2455 2460
Arg Leu Ala Glu Pro Val Asp Leu Gln Ser Ser Asp Val Ser Gly Val
2465 2470 2475 2480
Gln Leu Ser Asp Ser Glu Leu Ser Ala Ile Leu Ser Asp Leu Thr Glu
2485 2490 2495
Asp Asp Gln
<210> 5
<211> 3861
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 5
Met Asn Thr Ser Val Lys Ala Lys Ile Arg Ile Glu Ser Ala His Lys
1 5 10 15
Leu Thr Pro Leu Gln Met Gly Val Leu Phe His Ala Met Tyr Ala Pro
20 25 30
Asp Ser Ala Ala Tyr Phe Glu Gln Leu Phe Cys Arg Leu Asp Gly Asp
35 40 45
Ile Asp Pro Gln Gln Phe Glu Gln Ala Leu Ala Leu Leu Ala Gln Arg
50 55 60
His Ala Ile Met Arg Thr Gly Ile Val Thr Lys Gly Gln Arg Asp Pro
65 70 75 80
Leu Gln Val Val Leu Glu Lys Val Thr Val Pro Leu Thr Val Tyr Asp
85 90 95
Trp Arg Asp Arg Ser Gly Glu Val Gln Glu Ala Ala Phe Gln Arg Leu
100 105 110
Leu Asp Asp Asp Arg Gln Glu Gly Phe Asn Leu Asn Arg Pro Pro Leu
115 120 125
Met Arg Phe Ile Leu Val Gln Phe Ser Glu Arg Glu Trp Arg Leu Val
130 135 140
Trp Ser His His His Leu Leu Leu Asp Gly Trp Ser Val Gln Leu Leu
145 150 155 160
Leu Lys Asp Phe Phe Gln Leu Met Ala Gly Asn Arg Thr Glu Ala Ala
165 170 175
Ser Arg Pro Phe Ser Asp Tyr Leu Ala Trp Leu Glu Gly Gln Ser Gln
180 185 190
Glu Ala Ala Arg Asp Phe Trp Gln Ser Ile Leu Gly Asp Leu Gln Asp
195 200 205
Pro Thr Pro Leu Gly Val Asp Lys Pro Ser Gly Ala Lys Glu Lys Asp
210 215 220
Phe Ala Glu Arg Arg His Ser Leu Lys Val Pro Ala Leu Ala Asn Ala
225 230 235 240
Ala Ser Ala Cys Lys Val Ser Val Gly Thr Leu Leu Met Ala Gly Trp
245 250 255
Ala Val Leu Leu Gly His Tyr Ala Arg Arg Asp Asp Val Thr Phe Gly
260 265 270
Val Thr Leu Ser Gly Arg Ala Ile Glu Leu Pro Gly Val Asp Asn Ile
275 280 285
Val Gly Leu Leu Ile Asn Thr Leu Pro Leu Arg Leu Arg Pro Glu Pro
290 295 300
Gln Arg Lys Leu Ala Asp Trp Leu Ala Glu Val Gln Glu Ala Gln Phe
305 310 315 320
Ala Leu Gln Arg Tyr Ser Tyr Ser Ala Leu Ser Asp Ile Gln Thr Cys
325 330 335
Ser Gly Val Pro Gln Gly Thr Ser Leu Phe Glu Ser Leu Leu Ile Ile
340 345 350
Asp Asn Phe Pro Val Gly Asp Leu Arg Leu Ser Glu Gln Leu Pro Phe
355 360 365
Asp Met Ser Gly Ile Asp Met Tyr Glu Arg Thr His Tyr Pro Leu Ala
370 375 380
Leu Thr Met Val Pro Lys Glu Gly Glu Val Ser Leu Lys Leu Gly Tyr
385 390 395 400
Asp Arg Asn Arg Ile Asp Asp Val Thr Ala Glu Lys Ile Ile Lys Asp
405 410 415
Phe Glu Leu Leu Leu Asn Glu Ile Ser Asp Gly Ser Glu Asn Thr Leu
420 425 430
Gly Ala Trp Ala Gly Cys Leu Gly Ser Ala Pro Leu Thr Glu Val His
435 440 445
Thr Leu Gly Gln His Ala Trp His Asp Arg Gln Thr Glu Arg Phe Trp
450 455 460
Arg Asp Tyr Leu His Gly Val Glu Thr Ser Pro Val Gly Glu Glu Arg
465 470 475 480
Ser Ser Asp Gly Glu His Gln Arg Gln Ile Thr Glu Leu Ser Ser Glu
485 490 495
Leu Thr Gln Arg Leu Phe Pro Leu Ala Thr Ser Gln Gln Val Thr Val
500 505 510
Asn Ala Leu Val Gln Ser Ala Tyr Ala Val Ala Leu Ala Arg Leu Ser
515 520 525
Gly Arg Pro Glu Ala Leu Phe Gly Val Thr Leu Ser Ala Ala Glu Asp
530 535 540
Arg Met Val Ser Gln Ile Phe Pro Met Arg Val Asp Cys Ala Pro Gly
545 550 555 560
Ala Lys Val Ile Met Leu Ser Asp Gln Val Gln Val Leu Gln Glu Glu
565 570 575
Ile Glu Arg His Ala His Val Gln Pro Ala Asp Ile Leu Gly Trp Ala
580 585 590
Gly Phe Ala Ala Gly Gln Pro Leu Phe Asp Ser Val Leu Ile Cys Ala
595 600 605
Asp Leu Gln Thr Asp Glu Ala Ser Leu Ser Ala Asp Val Thr Glu Val
610 615 620
Leu Asn Tyr Pro His Tyr Ala Phe Thr Leu Tyr Val Lys Arg Arg Gly
625 630 635 640
Thr Gly Leu Thr Leu Glu Ala Val Phe Asp Pro Ala Arg Val Asp Ala
645 650 655
Ala Arg Ala Gly Leu Leu Leu Glu Gly Thr Cys Gly Met Leu Ala Gln
660 665 670
Leu Ala Glu Gly Ala Thr His Val Gly Ala Leu Arg Leu Thr Arg Gly
675 680 685
Arg Gln Asn Glu Thr Glu Ala Gln Ala Ser Glu Thr Gly Leu Thr Asp
690 695 700
Ala Arg Leu Gln Glu Ala Asp Ala Gly Leu Pro Glu Leu Phe Arg Arg
705 710 715 720
Ala Ala Ala His Ala Pro Ala Gln Arg Ala Val Ser Gly Ala Gly Arg
725 730 735
Glu Leu Ser Tyr Gly Gln Leu Leu Ala Glu Ser Arg Asn Phe Ala Arg
740 745 750
Arg Leu Ala Glu Asn Gly Val Arg Pro Gly Met Ala Val Ala Val Cys
755 760 765
Leu Asp Arg Gly Ala Asp Met Leu Cys Ala Leu Leu Gly Val Met Trp
770 775 780
Ala Gly Ala Glu Tyr Val Pro Val Asp Pro Thr His Pro Ala Ala Arg
785 790 795 800
Arg Ala Met Ile Leu Glu Asp Ala Ala Pro Gln Leu Val Val Val Asp
805 810 815
Ala Ala Asn Glu His Ala Phe Thr Gly Gln Pro Thr Leu Arg Tyr Val
820 825 830
Ser Asp Trp Arg Lys Ser Glu Gly Glu Leu Pro Gly Asp Ala Leu Ser
835 840 845
Pro Leu Ala Pro Ala Tyr Thr Ile Phe Thr Ser Gly Ser Thr Gly Arg
850 855 860
Pro Lys Gly Val Arg Val Thr His Gly Ala Leu Ala Asn Ile Leu Leu
865 870 875 880
His Phe Arg Thr Arg Pro Gly Leu Asp Ala Ala Asp Arg Leu Leu Ala
885 890 895
Val Thr Thr Leu Ser Phe Asp Ile Ala Ala Leu Glu Leu Phe Leu Pro
900 905 910
Leu Ser Cys Gly Ala Glu Val Val Ile Ala Thr Ala Ala Gln Ala Thr
915 920 925
Gly Gly Gly Pro Leu Ala Glu Leu Ile Ala His His Gly Ile Thr Val
930 935 940
Met Gln Ala Thr Pro Ala Ser Trp Arg Met Leu Leu Ala Ala Gly Trp
945 950 955 960
Arg Pro Pro Glu Gly Phe Arg Ala Trp Cys Gly Gly Glu Ala Leu Pro
965 970 975
Ala Glu Leu Ala Arg Asp Leu Leu Ala Ser Gly Val Gln Leu Trp Asn
980 985 990
Leu Tyr Gly Pro Thr Glu Thr Thr Ile Trp Ser Ala Glu Thr Glu Val
995 1000 1005
Thr Glu Pro Leu Ala Val Pro Leu Pro Val Gly Arg Pro Ile Arg Arg
1010 1015 1020
Thr Ala Leu Tyr Val Leu Asp Gly Ala Gly Gln Arg Leu Pro Ala Gly
1025 1030 1035 1040
Val Ser Gly Glu Leu Ala Ile Gly Gly Ala Gly Leu Ser Thr Gly Tyr
1045 1050 1055
Leu Arg Asp Pro Ala Arg Thr Ala Arg Ala Phe Arg Pro Asp Pro Ala
1060 1065 1070
Gly Ala Glu Pro Gly Ser Arg Leu Tyr Leu Thr Gly Asp Leu Ala Arg
1075 1080 1085
Glu Arg Ala Asp Gly Arg Ile Glu Val Leu Gly Arg Leu Asp His Gln
1090 1095 1100
Ile Lys Leu Asn Gly Phe Arg Ile Glu Leu Gly Glu Ile Asp Ala Ala
1105 1110 1115 1120
Leu Arg Ala Leu Pro Gly Val Arg Asp Ala Ala Ala Ala Ile His Arg
1125 1130 1135
Thr Pro Ser Gly Gly Gln Leu Ala Gly Tyr Leu Val Ala Ala Glu Asp
1140 1145 1150
Ala Pro Ala Asp Ala Ala Trp Leu Glu Ala Leu Thr Gly Ala Leu Pro
1155 1160 1165
Arg Tyr Met Leu Pro Thr Ala Leu Val Arg Met Pro Ala Leu Pro Leu
1170 1175 1180
Thr Ala Asn Gly Lys Ile Asp Arg Lys Ala Leu Pro Gln Pro Gln Ile
1185 1190 1195 1200
Arg Asn Thr Ser Tyr Val Ser Pro Arg Thr Pro Glu Gln Lys Thr Leu
1205 1210 1215
Ala Ala Ile Trp Gln Glu Val Leu Gly Val Glu Gln Val Gly Ile Thr
1220 1225 1230
Asp Asn Tyr Phe Ser Leu Gly Gly Asn Ser Ile Leu Ser Ile Arg Val
1235 1240 1245
Val Thr Gln Ala Ala Ala Gln Gly Ile Arg Leu Asn Ile Glu Asp Leu
1250 1255 1260
Phe Gln Lys Leu Thr Ile Glu Arg Leu Thr Glu Ser Asn Ser Thr Pro
1265 1270 1275 1280
Val Gln Ala Ala Glu Ala Pro His Ile Asp Ala Phe Ala Leu Leu Thr
1285 1290 1295
Glu Glu Asp Arg Arg Ala Val Pro Glu Gly Ala Val Asp Gly Tyr Pro
1300 1305 1310
Leu Ser Glu Leu Gln Ala Gly Met Leu Phe His Asn Gly Ala Asp Glu
1315 1320 1325
Thr Asn Arg Leu Tyr His Asn Val Val Ser Tyr Leu Leu Asp Asn Pro
1330 1335 1340
Ala Met Asp Thr Gly Leu Val Arg Gln Arg Leu Asn Lys Leu Ile Ala
1345 1350 1355 1360
Leu His Pro Val Leu Arg Thr Gly Phe Ser Leu Ala Gly Tyr Ser Arg
1365 1370 1375
Pro Leu Gln Trp Val Tyr Ala Gln Ala Glu Pro Leu Ile Glu Glu Glu
1380 1385 1390
Asp Leu Arg Lys Ala Ser Glu Val Ala Gln His Thr Leu Ile Gly Arg
1395 1400 1405
Ala Gln Gln Arg Leu Arg Glu Glu Arg Phe Asp Leu Ala Lys Pro Pro
1410 1415 1420
Leu Leu Arg Met Leu Phe Gln Arg Leu Asp Asp Ser Arg Trp Gln Val
1425 1430 1435 1440
Thr Val Ala Leu His His Val Ile Leu Asp Gly Trp Ser Leu Ala Ser
1445 1450 1455
Leu Leu Thr Gly Leu Leu Gln Asp Glu Thr Ala Glu Ser Thr Ala Glu
1460 1465 1470
Pro Gln His Ile Phe Arg Asp Phe Ile His Leu Glu Gln Gln Ala Leu
1475 1480 1485
His Ser Thr Arg Asp His Thr Phe Trp Gln Lys Gln Leu Lys Asp Leu
1490 1495 1500
Pro Val Thr Thr Leu Pro Arg Trp Pro Phe Thr Asp Lys Asn Ala Glu
1505 1510 1515 1520
Ser Ala Gln Ala Ser Tyr Glu Thr Ala Leu Pro Pro Ala Leu Tyr Gln
1525 1530 1535
Gly Leu Ala Ala Leu Ala Lys Glu Lys Gly Met Pro Leu Lys Ser Val
1540 1545 1550
Leu Leu Ala Ile His Met Arg Val Leu Ala His Trp Ser Gly Glu Cys
1555 1560 1565
Glu Val Val Thr Gly Leu Val Thr Asn Gly Arg Pro Glu Ser Ala Gly
1570 1575 1580
Ser Ala Asp Ala Leu Gly Leu Phe Leu Asn Thr Leu Pro Met Arg Ile
1585 1590 1595 1600
Asn Thr Gly Gly Leu Thr Gly Asn Glu Leu Leu Glu Ala Val Arg Gln
1605 1610 1615
Ala Glu Ser Ala Gln Leu Pro His Arg Arg Phe Ala Met Asn Glu Leu
1620 1625 1630
Arg Arg Met Leu Gln Asn Arg Thr Leu Phe Glu Thr Thr Phe Asn Phe
1635 1640 1645
Val Asp Phe His Val Tyr Asn Asp Ala Ala Thr Ser Gly Gly Asp Arg
1650 1655 1660
Phe Asp Pro Val Lys Ile Leu Asn Ala Ala Gly Ser Gln Ala Leu Asp
1665 1670 1675 1680
Ile Pro Leu Ala Thr Ser Phe Ser Val Asp Arg Gln Gln Gly Thr Leu
1685 1690 1695
Gln Leu Ile Leu Thr Cys Asp Gly Thr Arg Phe Pro Ala Ala Gln Val
1700 1705 1710
Glu Ala Met Ser Ala Ser Tyr Leu Arg Ala Ala Glu Thr Leu Leu Asn
1715 1720 1725
Val Thr Glu Glu Val Cys Asp Ser Met Ser Leu Ile Ser Ala Glu Glu
1730 1735 1740
Arg Asp Glu Met Ala Gln Arg Ser Phe Gly Ala Ser Ser Val Ser Gln
1745 1750 1755 1760
Pro Val Leu Gln Ala Phe Gln Thr Met Val Glu Arg His Pro Gln Ala
1765 1770 1775
Pro Ala Val Val Ser Ala Asp Gly Glu Met Asp Tyr Ala Thr Leu Asp
1780 1785 1790
Arg Arg Ala Ser Glu Leu Ala Ala Gln Met Gln Arg Ala Gly Leu Arg
1795 1800 1805
Pro Asp Val Pro Val Ala Leu Leu Phe Glu Arg Ser Pro Asp Leu Val
1810 1815 1820
Val Ala Met Leu Ala Ala Met Lys Thr Ala Cys Pro Tyr Val Pro Leu
1825 1830 1835 1840
Ala Pro Tyr Leu Pro Gln Gly Arg Leu Ala Glu Ile Leu Ala Asp Val
1845 1850 1855
Arg Pro Gln Ala Thr Leu Thr Val Gln Ala Leu Gln His Ile Leu Pro
1860 1865 1870
Glu Ala Ser Asp Ala Gly Tyr Ile Phe Ala Leu Asp Ala Leu Pro Glu
1875 1880 1885
Thr Leu Tyr Pro Leu Pro Glu Leu Pro Gln Ala His Pro Ala Thr Leu
1890 1895 1900
Ala Tyr Ile Leu Phe Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly Ile
1905 1910 1915 1920
Gly Ile Pro Thr Gly Ala Leu Ala Asn His Met Ala Trp Met Gln Arg
1925 1930 1935
Arg Phe Pro Leu Thr Ser Ala Asp Arg Val Leu Gln Lys Thr Pro Val
1940 1945 1950
Gly Phe Asp Ala Ser Val Trp Glu Phe Trp Ala Pro Leu Met Ala Gly
1955 1960 1965
Ala Thr Leu Val Leu Pro Ala Asp Gly Val Glu Asn Asp Ala Ile Ala
1970 1975 1980
Met Leu Glu Val Val Gln Arg His Ala Ile Thr Val Leu Gln Leu Val
1985 1990 1995 2000
Pro Gly Val Leu Asp Met Leu Thr Arg Leu Pro Glu Leu Thr Ala Cys
2005 2010 2015
Thr Ser Leu Arg Arg Val Phe Val Gly Gly Glu Ala Leu Gln Ala Ser
2020 2025 2030
Thr Ile Glu Arg Phe Asn Ser Val Leu Gly Val Pro Leu Ile Asn Leu
2035 2040 2045
Tyr Gly Pro Thr Glu Thr Thr Ile Asp Thr Thr Phe Ala Cys Tyr Cys
2050 2055 2060
Gly Asp Val Gly Glu Val Val Ser Ile Gly Glu Pro Ile Asp Gly Val
2065 2070 2075 2080
Ser Val Tyr Val Leu Asp Gln Arg Met Gln Pro Ala Gly Val Gly Ile
2085 2090 2095
Tyr Gly Glu Leu Trp Ile Gly Gly Ala Gly Leu Ala Arg Gly Tyr Trp
2100 2105 2110
Asn Arg Ala Thr Glu Thr Ala Ala Gly Phe Arg Pro Asp Pro Phe Ser
2115 2120 2125
Val Gln Pro Gly Glu Arg Met Phe Arg Thr Arg Asp Val Val Arg Trp
2130 2135 2140
Leu Pro Gly Gly Gly Leu Gln Tyr Ala Gly Arg Ser Asp Ser Gln Ile
2145 2150 2155 2160
Lys Leu Arg Gly Asn Arg Ile Glu Leu Ala Asp Ile Glu Ala Val Leu
2165 2170 2175
Ser Arg Gln Pro Gly Val Thr Arg Ser Ala Val Arg Val Cys Ala Glu
2180 2185 2190
Lys Pro Gly Gln Leu Val Ala Trp Val Met Gly Pro Ala Ala Leu Glu
2195 2200 2205
Ala Ala Pro Leu Ile Ala Ala Leu Arg Asn His Leu Pro Asp Tyr Met
2210 2215 2220
Leu Pro Gln Arg Ile Ile Ala Val Asn Ser Trp Pro Leu Thr Pro Asn
2225 2230 2235 2240
Gly Lys Thr Asp His Ala Ala Leu Ala Lys Phe Ala Ala Ile Thr Glu
2245 2250 2255
Pro Ala Ser Ala Val Val Pro Pro Glu Ser Glu Ile Glu Ser Glu Leu
2260 2265 2270
Val Ala Ile Trp Gln Lys Leu Leu Pro Gln Leu Thr Leu Gly Ile Thr
2275 2280 2285
Asp Asn Phe Phe Glu Val Gly Gly Asp Ser Ile Leu Ala Met Gln Ile
2290 2295 2300
Ala Ala Glu Met Arg Arg Lys Gly Trp Ser Ile Thr Pro Arg His Leu
2305 2310 2315 2320
Phe Glu His Pro Thr Ile Arg Glu Leu Ala Ala Val Ile Ile Pro Ser
2325 2330 2335
His Asn Glu Lys Gln Pro Asp Tyr Val Ala Pro Val Gly Pro Leu Pro
2340 2345 2350
Leu Ser Pro Val Gln Arg Trp Phe Phe Glu Leu Glu Leu Ser Asp Arg
2355 2360 2365
Asn His Trp Asn Gln Ala Val Met Leu Arg Val Pro Gln His Ile Gln
2370 2375 2380
Pro His Arg Leu His Lys Thr Leu Glu Arg Leu Val Ser Leu His Glu
2385 2390 2395 2400
Ala Phe Arg Leu Arg Phe Leu Gln Lys Glu Ala Ser Trp Phe Ala Arg
2405 2410 2415
Leu Glu Glu Asn Ala Gly Asp Trp Tyr Ser Ser Leu Asn Val Ser Asp
2420 2425 2430
Leu Ser Ala Val Glu Tyr Arg Glu Val Thr Asp Thr Leu Val Glu Thr
2435 2440 2445
Thr Gln Arg Ser Leu Asn Leu Glu Gln Gly Pro Leu Phe Lys Ala Val
2450 2455 2460
His Leu Asp Lys Gly Leu Glu Val Glu Gly Arg Leu Leu Leu Val Ile
2465 2470 2475 2480
His His Leu Ile Val Asp Gly Val Ser Trp Arg Ile Leu Leu Asn Glu
2485 2490 2495
Ile Asn Leu Leu Leu Asn Gly Val Glu Leu Ala Val Pro Ala Pro Gly
2500 2505 2510
Phe Gly Gly Trp Leu Ala Leu Gln Asp Lys Tyr Glu Met Pro Lys Ala
2515 2520 2525
Val Leu Asp Tyr Trp Leu Gly Gln Ala Thr Lys Ser Ser Glu Ser Phe
2530 2535 2540
Arg Ala Pro Ser Phe Ile Gln Pro Gln His Ser Gly His Tyr Ser Gln
2545 2550 2555 2560
Val Arg Thr Ile Glu Lys Ser Phe Gly Asn Pro Ile Ala Gln Arg Leu
2565 2570 2575
Ile Asp His Ser Gln Leu His Leu Lys Ala Arg Pro Leu Glu Leu Leu
2580 2585 2590
Leu Thr Ser Val Leu Cys Ala Met Gly Arg Trp Ala His Glu Asp Arg
2595 2600 2605
Ile Ala Leu Thr Leu Glu Gly His Gly Arg Asp Ser Thr Gly Asp Trp
2610 2615 2620
Thr Leu Glu Arg Thr Pro Gly Trp Phe Thr Val Leu Tyr Pro Val Met
2625 2630 2635 2640
Phe Asp Leu Lys Asp Thr Asp Ser Glu Met Thr Val Leu Gln Thr Val
2645 2650 2655
Lys Lys Thr Leu Arg Glu Ile Pro Asp Gly Gly Tyr Gly Tyr Gly Gln
2660 2665 2670
Leu Arg Asp Gly Glu Pro Leu Pro Pro Val Ser Phe Asn Tyr Leu Gly
2675 2680 2685
Gln Phe Glu Glu Ser Asn Glu Arg Gly Leu Thr Val Val Asp Glu Ala
2690 2695 2700
Val Gly Asp Asn Glu Asp Pro His Gly Lys Arg Pro Phe Pro Leu Glu
2705 2710 2715 2720
Ile Val Ala Phe Ile Arg Ala Gly Lys Leu Thr Leu Arg Cys Val Phe
2725 2730 2735
Asp Asp Arg Ile Pro Glu Ala Ala Asn Ile Thr Ala Met Leu Asp Ser
2740 2745 2750
Ala Ala Asp Trp Leu Gln Lys Met Leu Ala Cys Glu Asp Val Ser Ala
2755 2760 2765
Ala Trp Thr Leu His Asp Phe Pro Leu Ala Asp Val Glu Glu Arg Gly
2770 2775 2780
Leu Ala Ile Ala Leu Gly Asp Ala Gly Asp Asn Leu Ala Asp Leu Trp
2785 2790 2795 2800
Lys Thr Thr Pro Thr Gln Gln Gly Met Leu Phe His Ser Arg Leu Glu
2805 2810 2815
Asn Asp Ala Ser Glu Val Tyr Leu Glu Gln Ile Val Met Arg Leu His
2820 2825 2830
Glu Glu Met Asp Thr Asp Leu Leu Ala Gln Ala Trp Asn Met Val Ile
2835 2840 2845
Asn Arg His Asp Ala Leu Arg Val Ser Phe Val Trp Glu Asp Leu Asp
2850 2855 2860
His Pro Gln Gln Arg Val Trp Arg Ser Val Gln Val Pro Phe Glu Thr
2865 2870 2875 2880
Val Asp Leu Lys Gly Asp Ala Ala Glu Leu Glu Ala Phe Met Thr Ala
2885 2890 2895
Asp Arg Gln Arg Gly Ile Asp Leu Ser Val Ala Pro Met Met Arg Val
2900 2905 2910
Ser Leu Leu Arg Lys Gln Gly Lys Pro Trp Arg Leu Val Trp Leu His
2915 2920 2925
His His Ala Leu Leu Asp Gly Trp Ser Met Ala Leu Ile Phe Asn Asp
2930 2935 2940
Leu Ala Glu Cys Tyr His Ala Leu Lys Leu Asn Gln Asn Trp Pro Thr
2945 2950 2955 2960
Asn Thr Ala Pro Ser Tyr Ala Thr Tyr Leu Arg Trp Leu Lys Gln Gln
2965 2970 2975
Ser Ala Thr Gln Glu Ser Ala Glu Arg Phe Trp Arg Asp Tyr Phe Gln
2980 2985 2990
Gly Leu Glu Leu Ala Ser Pro Val Gly Glu Glu Ser Thr Arg Thr Gly
2995 3000 3005
Ile His Gln Arg Leu Thr Asn Lys Leu Ser Pro Ala Leu Thr Gln Arg
3010 3015 3020
Leu Ser Gln Leu Ala Ser Ser Gln Gln Val Thr Val Asn Thr Leu Val
3025 3030 3035 3040
Gln Ser Ala Tyr Ala Val Ala Leu Ala Arg Leu Ser Gly Arg Pro Glu
3045 3050 3055
Ala Leu Phe Gly Val Thr Leu Ser Gly Arg Pro Ala Glu Leu Ala Gln
3060 3065 3070
Ser Glu Asn Ile Val Gly Leu Phe Ile Gln Thr Leu Pro Met Arg Val
3075 3080 3085
Asn Cys Ala Pro Gly Thr Asp Ile Ala Thr Leu Ala Gly Arg Val Gln
3090 3095 3100
Thr Leu Gln Gly Glu Ile Glu Arg His Ala His Val Gln Pro Ala Asp
3105 3110 3115 3120
Ile Gln Arg Trp Ser Gly Phe Ala Ala Gly Gln Pro Leu Phe Asp Ser
3125 3130 3135
Val Leu Ile Tyr Glu Asn Tyr Pro Leu Gly Gln Gly Leu Val Asp Ala
3140 3145 3150
Ser Asp Ser Leu Asn Ala Asp Val Thr Glu Val Leu Asp His Pro His
3155 3160 3165
Tyr Ala Phe Ser Leu Tyr Val Lys Pro Arg Gly Ala Gly Leu Thr Leu
3170 3175 3180
Glu Ala Val Phe Asp Pro Ala Arg Val Asp Ala Ala Arg Ala Gly Leu
3185 3190 3195 3200
Leu Leu Glu Gly Thr Cys Gly Met Leu Ala Gln Leu Ala Glu Gly Ala
3205 3210 3215
Thr His Val Gly Ala Leu Arg Leu Thr Arg Gly Arg Gln Asn Glu Thr
3220 3225 3230
Glu Ala Gln Ala Ser Glu Thr Gly Leu Thr Asp Ala Arg Leu Gln Glu
3235 3240 3245
Ala Asp Ala Gly Leu Pro Glu Leu Phe Arg Arg Ala Ala Ala His Ala
3250 3255 3260
Pro Ala Gln Arg Ala Val Ser Gly Ala Gly Arg Glu Leu Ser Tyr Gly
3265 3270 3275 3280
Gln Leu Leu Ala Glu Ser Arg Asn Phe Ala Arg Arg Leu Ala Glu Asn
3285 3290 3295
Gly Val Arg Pro Gly Met Ala Val Ala Val Cys Leu Asp Arg Gly Ala
3300 3305 3310
Asp Met Leu Cys Ala Leu Leu Gly Val Met Trp Ala Gly Ala Glu Tyr
3315 3320 3325
Val Pro Val Asp Pro Thr His Pro Ala Ala Arg Arg Ala Met Ile Leu
3330 3335 3340
Glu Asp Ala Ala Pro Gln Leu Val Val Val Asp Ala Ala Asn Glu His
3345 3350 3355 3360
Ala Phe Thr Gly Gln Pro Thr Leu Arg Tyr Val Ser Asp Trp Arg Lys
3365 3370 3375
Ser Glu Gly Glu Leu Pro Gly Asp Ala Leu Ser Pro Leu Ala Pro Ala
3380 3385 3390
Tyr Thr Ile Phe Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Arg
3395 3400 3405
Val Thr His Gly Ala Leu Ala Asn Ile Leu Leu His Phe Arg Thr Arg
3410 3415 3420
Pro Gly Leu Asp Ala Ala Asp Arg Leu Leu Ala Val Thr Thr Leu Ser
3425 3430 3435 3440
Phe Asp Ile Ala Ala Leu Glu Leu Phe Leu Pro Leu Ser Cys Gly Ala
3445 3450 3455
Glu Val Val Ile Ala Thr Ala Ala Gln Ala Thr Gly Gly Gly Pro Leu
3460 3465 3470
Ala Glu Leu Ile Ala His His Gly Ile Thr Val Met Gln Ala Thr Pro
3475 3480 3485
Ala Ser Trp Arg Met Leu Leu Ala Ala Gly Trp Arg Pro Pro Glu Gly
3490 3495 3500
Phe Arg Ala Trp Cys Gly Gly Glu Ala Leu Pro Ala Glu Leu Ala Arg
3505 3510 3515 3520
Asp Leu Leu Ala Ser Gly Val Gln Leu Trp Asn Leu Tyr Gly Pro Thr
3525 3530 3535
Glu Thr Thr Ile Trp Ser Ala Glu Thr Glu Val Thr Glu Pro Leu Ala
3540 3545 3550
Val Pro Leu Pro Val Gly Gly Pro Ile Arg Arg Thr Ala Leu Tyr Val
3555 3560 3565
Leu Asp Gly Ala Gly Gln Arg Leu Pro Ala Gly Val Ser Gly Glu Leu
3570 3575 3580
Ala Ile Gly Gly Ala Gly Leu Ser Thr Gly Tyr Leu Arg Asp Pro Ala
3585 3590 3595 3600
Arg Thr Ala Arg Ala Phe Arg Pro Asp Pro Ala Gly Ala Glu Pro Gly
3605 3610 3615
Ser Arg Leu Tyr Leu Thr Gly Asp Leu Ala Arg Glu Arg Ala Asp Gly
3620 3625 3630
Arg Ile Glu Val Leu Gly Arg Leu Asp His Gln Ile Lys Leu Asn Gly
3635 3640 3645
Phe Arg Ile Glu Leu Gly Glu Ile Asp Ala Ala Leu Arg Ala Leu Pro
3650 3655 3660
Gly Val Arg Asp Ala Ala Ala Ala Ile His Arg Thr Pro Ser Gly Gly
3665 3670 3675 3680
Gln Leu Thr Gly Tyr Leu Val Ala Ala Glu Asp Ala Pro Ala Asp Ala
3685 3690 3695
Ala Trp Leu Glu Ala Leu Ala Gly Ala Leu Pro Arg Tyr Met Leu Pro
3700 3705 3710
Thr Ala Leu Val Arg Met Pro Ala Leu Pro Leu Thr Ala Asn Gly Lys
3715 3720 3725
Ile Asp Arg Lys Ala Leu Ala Glu Ile Glu Val Thr Glu Arg Asn Ala
3730 3735 3740
Ser Phe Leu Pro Pro Asn Gly Pro Val Glu Thr Ala Val Cys Ala Ile
3745 3750 3755 3760
Trp Gln Thr Val Phe Ser Leu Glu Gln Val Gly Val Glu Asp Asp Phe
3765 3770 3775
Tyr Ala Leu Gly Gly His Ser Leu Met Ala Thr Gln Ile His Thr Arg
3780 3785 3790
Leu Val Arg Ile Phe Arg Ile Ser Pro Pro Leu Gly Glu Val Phe Arg
3795 3800 3805
Ala Thr Thr Pro Arg Glu Leu Thr Ala Val Ile Tyr Ala His Ser Asp
3810 3815 3820
Lys Gly Arg Ala Thr Gln Met Ala Glu Ala Tyr Leu Arg Leu Arg Ala
3825 3830 3835 3840
Met Thr Pro Glu Gln Arg Gln Ala Leu Arg Asn Glu Gly Ser Leu Ile
3845 3850 3855
Thr Gly Gly Ser Ala
3860
<210> 6
<211> 254
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 6
Met Asn Trp Glu Lys Ser Val Ala Ile Val Thr Gly Ala Gly Gly Gly
1 5 10 15
Ile Gly Gly Thr Phe Val Arg Gln Leu Leu Ser Gly Gly Cys Arg Val
20 25 30
Val Ala Ile Asp Lys Gln Ser Asp Arg Leu Glu Glu Leu Ala Val Ala
35 40 45
Cys Gln Ala Trp Arg Asp Ala Leu Ala Ile Arg Pro Val Asp Ile Thr
50 55 60
Asn Glu Ala Glu Ile Arg Ala Thr Phe Thr Asp Leu Ser Met His Phe
65 70 75 80
Gly Val Pro Glu Ile Leu Val Asn Asn Ala Gly Val Leu Arg Asp Gly
85 90 95
Leu Leu Ile Lys Lys Glu Ala Asp Ser Tyr Val Arg Lys Leu Pro Thr
100 105 110
Ala Gln Trp Arg Gly Val Leu Glu Ala Asn Leu Thr Gly Thr Tyr Leu
115 120 125
Met Ser Arg Glu Phe Ala Ala Ile Arg Ser Gln Gln Ala Gly Glu Gly
130 135 140
Val Ile Val Asn Ile Ser Ser Val Thr Ser Ala Gly Asn Pro Gly Gln
145 150 155 160
Ser Ala Tyr Ala Ala Ser Lys Ala Gly Met Asp Ala Leu Thr Arg Thr
165 170 175
Trp Ala Leu Glu Leu Ala Asp Ser His Ile Arg Val Val Gly Ile Ala
180 185 190
Pro Gly Leu Thr Asp Thr Pro Met Ala Arg Ala Leu Pro Glu Thr Glu
195 200 205
Leu Asn Asp Met Leu Lys Asn Ile Pro Leu Glu Arg Met Ala Thr Pro
210 215 220
Leu Glu Ile Trp Gln Gly Leu Arg Phe Ala Leu Glu Cys Asp Tyr Phe
225 230 235 240
Asn Gly Arg Ile Leu Thr Ile Asp Gly Gly Ala Gly Phe Cys
245 250
<210> 7
<211> 70
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 7
Met Ser Ala Ile Glu Asn Asn Ala Val Thr Tyr Phe Val Val Met Asn
1 5 10 15
His Glu Glu Gln Tyr Ser Ile Trp Pro Thr Tyr Arg Asp Ile Pro Ala
20 25 30
Gly Trp Gln Gln Val Gly Glu Pro Ala Ser Glu Gln Glu Cys Leu Ala
35 40 45
His Ile Glu Lys Val Trp Thr Asp Met Arg Pro Leu Ser Leu Arg Lys
50 55 60
Ala Met Glu Asp Asn Arg
65 70
<210> 8
<211> 553
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 8
Met Asn Met Tyr Arg Leu Leu Ser Glu Arg Pro Gln Gly Ala Pro Arg
1 5 10 15
Leu Ser Met Leu Leu Ala Ala Gln Ser Leu Ala Gly Leu Ala Gly Ala
20 25 30
Gly Leu Val Ala Ile Leu Thr Gln Ala Ala His Ala Val Glu Gln Gln
35 40 45
Gly Lys Ala Leu Ser Leu Val Ala Leu Thr Ala Leu Thr Leu Phe Met
50 55 60
Phe Leu Phe Ser Gln Arg Tyr Ala Met Arg Cys Thr Ala Leu Arg Val
65 70 75 80
Glu Arg Ser Ile His Asn Val Arg Val Arg Val Val Asp Lys Leu Thr
85 90 95
Arg Ile Asp Leu Gln Thr Tyr Glu Gln Ile Gly Glu Lys Asn Leu Met
100 105 110
Ala Cys Val Glu Lys Asp Ile Lys Thr Met Ser Asn Ala Cys Thr Ala
115 120 125
Ile Ile Ala Ser Gly Gln Ser Val Met Leu Phe Val Cys Ala Ala Ala
130 135 140
Tyr Leu Ala Trp Leu Ser Leu Pro Ala Phe Leu Leu Thr Ala Gly Val
145 150 155 160
Ile Val Leu Gly Val Ala Leu Asn Phe Ile Arg Met Arg Ala Ile Phe
165 170 175
Asn Ala Thr Glu Gln Ala Leu Gln Ser Glu Asn Ser Leu Ser Gly Leu
180 185 190
Thr Ser His Ile Ile Arg Gly Phe Lys Glu Leu Lys Leu His Gln Lys
195 200 205
Arg Arg Arg Glu Val Tyr Glu Glu Leu Val Glu Ala Ser Asp Gln Thr
210 215 220
Ala Ser Leu Asn Gln Lys Ala Phe Gly Leu Ala Thr Asp His Met Ile
225 230 235 240
Met Leu Gln Ser Ile Leu Tyr Ile Leu Ile Gly Leu Val Ile Phe Val
245 250 255
Leu Pro Met Ala Gly Gln Met Gln Thr Leu Leu Gln Val Gln Val Ile
260 265 270
Ala Val Ile Leu Phe Leu Asn Gly Pro Leu Ser Gln Phe Ile Gly Ile
275 280 285
Leu Pro Met Tyr Ala Gln Ala Asn Ala Ala Ala Lys Ser Ile Gly Glu
290 295 300
Leu Glu Gln Gln Leu Asp Ala Ala Ala Asn Arg Asp Pro Asp Leu Pro
305 310 315 320
Asp Ala Val Ile Glu Pro Met Arg Ser Ile Glu Leu Lys Asp Val Arg
325 330 335
Phe Ala Tyr Glu Ala Asn Glu Gly Pro Ala Phe Glu Ile Ala Pro Leu
340 345 350
Asn Leu Leu Ile Phe Gln Gly Glu Val Ile Phe Val Thr Gly Gly Asn
355 360 365
Gly Ser Gly Lys Ser Thr Phe Leu Lys Leu Leu Thr Gly Leu Arg Phe
370 375 380
Ala Ser His Gly Asp Val Leu Leu Asn Gly Glu Arg Val Asn Lys Pro
385 390 395 400
Glu Lys Val Ala Gly Tyr Arg Gly Leu Phe Ser Ala Ile Phe Ala Asp
405 410 415
Tyr His Leu Phe Thr Lys Leu Tyr Gly Thr Glu Val Pro His Arg Ser
420 425 430
Leu Ile Ser Glu Gln Leu Ser Arg Leu Ala Leu Asp Gly Lys Val Arg
435 440 445
Leu Asp Gly Arg Ile Phe Thr Pro Leu Asn Leu Ser Thr Gly Gln Arg
450 455 460
Lys Arg Leu Ala His Leu Val Thr Leu Leu Glu Asp Arg Gln Ile Tyr
465 470 475 480
Ile Phe Asp Glu Trp Ala Ala Asp Gln Asp Pro His Phe Arg Ser Trp
485 490 495
Phe Tyr Arg Glu Glu Leu Pro Arg Leu Lys Ala Leu Gly Lys Thr Ile
500 505 510
Ile Ala Val Thr His Asp Glu Gln Tyr Phe Glu His Ala Asp Arg Trp
515 520 525
Phe His Phe Glu Glu Gly Arg Cys Glu Glu Arg Phe Phe Lys Ser Ala
530 535 540
Val Pro Val Arg Leu Phe Pro Glu His
545 550
<210> 9
<211> 2799
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 9
Met Ser Val Ser Gly Asn Leu Asp Gln Asn Val Gly Phe Asp Glu Asp
1 5 10 15
Leu Asp Leu Leu Asp Ala Leu Leu Ala Glu Asp Leu Leu Glu Gln Gln
20 25 30
Pro Ala Ile Ala Ala Gln Ala Val Asn Lys Gly Pro Leu Ser Phe Gln
35 40 45
Gln Glu Arg Leu Trp Phe Leu Ser Glu Leu Asp Pro Asp Ala Ala Ala
50 55 60
Tyr Thr Ile Phe Asn Ala Phe Arg Leu His Gly Gln Leu Asn Glu Gln
65 70 75 80
Ala Leu Cys Ala Ala Leu Glu Thr Leu Val Glu Arg His Glu Ala Leu
85 90 95
Arg Thr Ala Ile Asp Asn Gln Asn Gly Gln Ala Glu Gln Arg Ile Met
100 105 110
Pro Gly Tyr Met Pro Val Gln Lys Ser Val Asp Leu Thr His Ser Ala
115 120 125
Glu Lys Asp Val Asp Glu Ala Leu His Lys Leu Leu Arg Ser Glu Ala
130 135 140
Ala Arg Pro Phe Val Leu Thr Asp Gly Lys Pro Phe Arg Ala Val Leu
145 150 155 160
Ala Lys Leu Gly Ser Asp Glu His Ala Leu Met Leu Ser Leu His His
165 170 175
Ile Ile Ser Asp Ala Trp Ser Met Thr Val Leu Met Ser Glu Leu Ala
180 185 190
Val Leu Tyr His Ala Tyr Ala Arg Asn Glu Arg Pro Ile Leu Pro His
195 200 205
Gln Pro Val Arg Tyr Leu Asp Tyr Ala Leu Trp Gln Arg Gly Asn Gly
210 215 220
Ser Ala Gln Glu Arg Glu Asn Lys Glu Met Asn Tyr Trp Leu Ser Glu
225 230 235 240
Leu Gln Asp Leu Pro Leu Leu Glu Leu Pro Cys Asp Leu Pro Arg Pro
245 250 255
His Lys Gln Thr Phe Asn Gly Ala Thr Ile Ser Phe Gln Val Pro Asp
260 265 270
Ala Thr Thr Arg Ala Leu Gln Met Leu Ala His Gly Glu Arg Ser Thr
275 280 285
Leu Phe Ser Leu Met Met Ala Ala Leu His Val Leu Met Gly Arg His
290 295 300
Ala Arg Gln Thr Asp Ile Ala Ile Gly Thr Ser Ile Ala Gly Arg Asp
305 310 315 320
Asn Pro Glu Leu Glu Gly Leu Ile Gly Phe Phe Ala Asn Met Val Val
325 330 335
Ile Arg Ala Arg Leu Glu Ser Asp Pro Ser Phe Arg Glu Leu Leu Arg
340 345 350
Thr Thr Thr Gly Lys Val His Ala Ala Met Glu His Gly Thr Leu Ala
355 360 365
Tyr Asp Arg Leu Val Glu Gly Met Lys Ile Ala Arg Asp Pro Ser Arg
370 375 380
Asn Pro Leu Phe Gln Ile Ala Met Thr Met Leu Asn Leu Pro Ala Thr
385 390 395 400
Arg Met Ser Leu Gly Thr Leu Glu Ala Glu Arg Leu Leu Ser Gln Glu
405 410 415
Ala Ala Arg Phe Asp Leu Glu Leu Phe Leu Ser Glu Ser Asp Gly Thr
420 425 430
Leu Ser Gly Thr Phe Val Tyr Asn Thr Asp Leu Phe Leu Pro Ala Ser
435 440 445
Val Asn Arg Leu Thr Glu Gln Trp Leu Ile Leu Leu Ala Asp Ile Ala
450 455 460
Val Ser Pro Asp Lys Pro Val Ser Arg Leu Ala Leu Val Lys Glu Gln
465 470 475 480
Ala Pro Leu Leu Pro Leu Pro Leu Leu Ala Glu Pro Leu Pro Phe Arg
485 490 495
Pro Leu His Glu Lys Ile Leu Leu His Ala Glu Met Tyr Pro Asp Arg
500 505 510
Arg Ala Leu Arg Leu Gly Glu Glu Ser Leu Ser Tyr Gly Glu Leu Ala
515 520 525
Ala Gln Ala Arg Arg Ile Ala His Ala Leu Leu Ala Ala Gly Ile Lys
530 535 540
Ala Glu Val Pro Val Gly Leu Trp Phe Glu Pro Gly Phe Asp Met Ile
545 550 555 560
Ala Ala Met Leu Gly Thr Trp Met Ala Gly Gly Ala Tyr Leu Pro Val
565 570 575
Asp Leu His Ser Pro Ala Glu Arg Ile Thr Thr Ile Leu Glu Asp Ser
580 585 590
Gln Val Lys Phe Ile Leu Ser Asp Thr Ala Ser Val Ala Ser Leu Pro
595 600 605
Val Phe Val Gly Thr Val Leu Cys Ile Asp Glu Thr Asp Glu Pro Pro
610 615 620
Ala Gly Glu Leu Pro Gln Val Ser Ala His Gln Leu Ala Tyr Ile Ile
625 630 635 640
Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Glu Ile Thr His
645 650 655
Ala Asn Val Ala Arg Leu Phe Thr Val Cys Asp Ser Leu Phe Glu Phe
660 665 670
Asp Arg Asn Asp Val Trp Thr Phe Phe His Ser Tyr Ala Phe Asp Phe
675 680 685
Ser Val Trp Glu Ile Trp Gly Ala Leu Val His Gly Ala Ser Leu Leu
690 695 700
Ile Val Pro Pro Ile Val Ala Arg Thr Thr Asp Ser Phe Tyr Asp Leu
705 710 715 720
Leu Cys Glu Lys Lys Val Thr Val Leu Ser Gln Thr Pro Ser Ala Phe
725 730 735
Arg Gln Leu Met Ala Ala Glu Glu Ala Asn Pro Arg Glu Gly Asp Leu
740 745 750
Ala Leu Arg Tyr Val Val Phe Gly Gly Glu Ala Leu Asp Ile Ala Ser
755 760 765
Leu Ala Ser Trp Met Asp Arg His Gly Asp Glu Glu Pro Arg Leu Val
770 775 780
Asn Met Tyr Gly Ile Thr Glu Ile Thr Val His Ala Thr Phe Arg Leu
785 790 795 800
Ile Thr Trp Arg Asp Leu Ser Arg Ala Ser Ser Ser Val Ile Gly Thr
805 810 815
Pro Leu Pro Asp Leu Cys Leu Arg Leu Leu Asp Pro His Gly Glu Pro
820 825 830
Val Pro Gln Gly Met Val Gly Glu Ile Phe Val Gly Gly Ala Gly Val
835 840 845
Ala Arg Gly Tyr Arg Tyr Gln Pro Glu Leu Thr Ala Ala Arg Phe Gln
850 855 860
His Asp Ala Ser Gly Met Pro Phe Tyr Arg Ser Gly Asp Leu Ala Arg
865 870 875 880
Ile Asn Val Trp Gly Glu Met Glu Tyr Arg Gly Arg Ala Asp Ser Gln
885 890 895
Ile Lys Leu Arg Gly Tyr Arg Ile Glu Thr Gly Glu Ile Glu Asn Thr
900 905 910
Leu Arg Arg His Pro Ala Ile Asp Asp Ala Val Val Val Val Arg Gly
915 920 925
Gln Gln Glu Ala Ala Arg Leu Val Ala Tyr Val Arg Lys Arg Gln Thr
930 935 940
Tyr Leu Pro Glu Ser Gly Ala Ser Ala Glu Asp Trp Arg Pro Ser Phe
945 950 955 960
Asp Met Ile Tyr Ala Ala Glu Val Glu Asp Asp Glu Leu Asp Val Val
965 970 975
Gly Trp Asn Asp Ser Tyr Asp Asn Lys Pro Leu Pro Leu Glu Glu Met
980 985 990
Arg Leu Trp Arg Asp Glu Ile Leu Gln Arg Leu Arg Ala Leu Ala Pro
995 1000 1005
Thr Arg Ile Leu Glu Ile Gly Thr Gly Ser Gly Met Leu Leu Leu Pro
1010 1015 1020
Leu Ala Gln Glu Val Gly Arg Tyr Gln Gly Leu Asp Phe Ser Ala Glu
1025 1030 1035 1040
Ala Val Ala Arg Leu Ser Arg Lys Val Ala Gln Arg Gly Leu Thr His
1045 1050 1055
Val Gln Leu Glu Gln Arg Glu Ala Arg Asp Leu Ser Gly Leu Gly Glu
1060 1065 1070
Asn Phe Asp Leu Val Ile Leu Asn Ser Val Ala Gln Tyr Phe Pro Asp
1075 1080 1085
Ala Arg Tyr Phe Ile Asp Val Met Glu Gln Ala Met Asp Arg Leu His
1090 1095 1100
Thr Asp Gly Arg Leu Phe Ile Gly Asp Leu Arg His Leu Gly Leu Leu
1105 1110 1115 1120
Arg His Phe His Ala Ser Arg Leu Val His Arg Arg Pro Ala Gly Ala
1125 1130 1135
Asp Arg Thr Ser Leu Leu Ser Gln Leu Glu Lys Met Val Glu Glu Glu
1140 1145 1150
Lys Glu Leu Leu Val Asp Pro Asp Phe Phe Phe His Trp Ala Ser Gln
1155 1160 1165
Arg Asn Asp Ile Ala Asn Ile Asp Val Leu Pro Lys Val Ser Gly Gly
1170 1175 1180
Gln Asn Glu Leu Thr Thr Tyr Arg Tyr Asp Val Val Ile Val Lys Gly
1185 1190 1195 1200
Asp Pro Gln Thr Phe Ala Pro Val Ala Arg Met Glu Ala Ala Ser Val
1205 1210 1215
Glu Thr Ser Trp Lys Gly Gln Pro Ala Leu Ile Cys Asn Ile Pro Asn
1220 1225 1230
Ser Arg Leu Ala Cys Val Glu Ala Phe Leu Asn Trp Leu Ala Asp Asp
1235 1240 1245
Ala Thr Thr Val Pro Thr Ala Gln Glu Trp Glu Ala Trp Ser Gly Thr
1250 1255 1260
Gln Ser Gly Ser Asp Pro Ala Met Leu Val Asp Ile Trp Gln Ala Arg
1265 1270 1275 1280
Ala Gly Ala Ala Lys Leu Cys Trp Ala Ser Gln Gly Gln Pro Gly Gln
1285 1290 1295
Phe Asp Leu Ala Val Ala Thr His Thr Glu Ala Leu Pro Ser Phe Thr
1300 1305 1310
Pro Val Ile Ser Arg Asn Ala Glu Leu Thr Arg Phe Phe Asn Ile Pro
1315 1320 1325
Val Gln Leu Arg Glu Gly Asp Ala Leu Ala Thr Thr Leu Arg Ser Tyr
1330 1335 1340
Leu Ser Ala Tyr Leu Pro Asp Tyr Met Leu Pro Ala Val Tyr Val Pro
1345 1350 1355 1360
Leu Asp Val Phe Pro Leu Thr Ile Asn Gly Lys Leu Asp Phe Ala Ala
1365 1370 1375
Leu Pro Glu Thr Gly Gln Glu Ile Lys Glu Ala Ala Ala Asp Gln Asn
1380 1385 1390
Gln Gln Leu Ser Glu Thr Glu Trp Lys Val Ala Asp Ile Trp Ala Glu
1395 1400 1405
Val Leu Gln Leu Ala Arg Pro Ser Leu His Ala Asn Phe Phe Glu Thr
1410 1415 1420
Gly Gly His Ser Leu Leu Ala Thr Gln Val Ile Ser Arg Leu Asn Ala
1425 1430 1435 1440
Ala Phe Ser Val Lys Leu Pro Leu Arg Ser Leu Phe Asp Arg Pro Thr
1445 1450 1455
Ile Ala Gly Leu Ala Ser Leu Leu Asp Asp Leu Gln Lys Lys Thr Glu
1460 1465 1470
Ser Ala Pro Pro Gln Pro Ala Ala Ile Arg Ala Val Pro Arg Glu Gly
1475 1480 1485
Leu Leu Pro Leu Ala Tyr Thr Gln Gln Arg Phe Trp Phe Met Glu Gln
1490 1495 1500
Ile Asp Gln Gly Pro Val Gly Ser Tyr Asn Ile Ser Leu Ala Leu Arg
1505 1510 1515 1520
Leu Arg Gly Gln Leu Val Pro Val Ala Leu His Thr Ala Ile Gln Thr
1525 1530 1535
Ile Val Arg Arg His Glu Ala Leu Arg Thr Val Phe Ile Gln His Asp
1540 1545 1550
Gly Gln Pro Ala Gln Leu Ile Lys Leu Glu Trp Ala Pro Ala Ile Glu
1555 1560 1565
Glu Thr Asp Phe Ser His Leu Ser Arg Ala Glu Ala Glu Thr Ala Leu
1570 1575 1580
Arg Asp Leu Leu Ser Val Gln Ala Asn Thr Arg Phe Ser Leu Asp Val
1585 1590 1595 1600
Ala Pro Pro Leu Arg Leu Asn Leu Val Arg Ile Gly Glu Gln Glu His
1605 1610 1615
Val Leu Gln Leu Thr Leu His His Ala Ile Cys Asp Gly Trp Ser Leu
1620 1625 1630
Gly Val Met Val Arg Glu Phe Ser Glu Cys Tyr Ser Ala Cys Val Ala
1635 1640 1645
Gly Arg Ala Pro Gln Leu Ala Ala Leu Pro Val Gln Leu Ala Asp Phe
1650 1655 1660
Ala Val Trp Gln Arg Ser Glu Met Ala Gly Thr Arg Leu Gln Ser Ile
1665 1670 1675 1680
Leu Gln Gln Trp Lys Gln Arg Leu Gln Gly Val Pro Tyr Asp Leu Ala
1685 1690 1695
Leu Pro Phe Glu Arg Ala Pro His Ala Asp Thr Pro Gln Met Gly Lys
1700 1705 1710
Ile Ile Tyr Phe Asn Phe Asp Ala Val Gln Leu Gly Gln Leu Lys Arg
1715 1720 1725
Phe Ala Glu Thr Asn Gly Ala Thr Leu Phe Met Val Leu Thr Thr Gly
1730 1735 1740
Tyr Ala Ala Leu Leu Gly Arg Tyr Ser Gly Val Asp Asp Val Val Ile
1745 1750 1755 1760
Gly Thr Pro Ile Ala Gln Arg Gln Gln Lys Glu Leu Glu Gly Ile Val
1765 1770 1775
Gly Cys Phe Leu Asn Thr Leu Ala Leu Arg Ile Gln Gly Glu Ala Gly
1780 1785 1790
Leu Ser Gly Gln Ala Leu Leu Ala His Val Arg Glu Arg Val Leu Glu
1795 1800 1805
Ala Tyr Glu Trp Gln Asp Ala Pro Phe Asp Ala Val Val Ser Glu Leu
1810 1815 1820
Ser Pro Glu Arg Ser Arg Asp Arg His Ala Leu Phe Gln Thr Met Leu
1825 1830 1835 1840
Thr Leu Gln Asn Met Pro Leu Gly Asn Phe Thr Leu Pro Gly Leu Glu
1845 1850 1855
Ala Glu Pro Leu Gln Gly Gln Glu Gly Ile Ala Gly Phe Asp Leu Ser
1860 1865 1870
Leu Thr Phe Ile Glu Met Ala Asp Ala Ser Gly Gln Asp Gly Leu Gln
1875 1880 1885
Gly Met Leu Glu Tyr Asp Ala Asn Lys Tyr Leu Tyr Ala Ser Val Glu
1890 1895 1900
His Phe Ala Ser Gln Leu Lys Thr Leu Leu Leu Ala Met Ala Ala Arg
1905 1910 1915 1920
Pro Glu Met Pro Val Asn Arg Leu Asp Leu Leu Ala Ala Asp Glu Arg
1925 1930 1935
Lys Arg Leu Leu Glu Thr Leu Asn Asp Thr Ser His Gln Ile Pro Gln
1940 1945 1950
Leu Cys Leu His Glu Leu Ile Ala Gly Gln Ala Ser Arg Thr Pro Asp
1955 1960 1965
Ser Ile Ala Ile Arg Asp Ala Ser Gly Glu Ile Ser Tyr Ala Glu Leu
1970 1975 1980
Glu Ala Arg Ala Asn Ala Val Ala Cys Ala Leu His Glu Gln Gly Val
1985 1990 1995 2000
Gly Pro Asp Thr Ile Val Gly Leu Cys Thr Glu Arg Asp Arg Gly Met
2005 2010 2015
Val Ile Gly Leu Leu Gly Ile Met Lys Ala Gly Ala Ala Tyr Leu Pro
2020 2025 2030
Leu Asp Pro Ala Tyr Pro Ile Glu Arg Leu Asp Leu Ile Leu Ala Asp
2035 2040 2045
Ala Gln Pro Pro Val Leu Val Thr Gln Thr Ala Leu Thr Ser Thr Thr
2050 2055 2060
Asn Phe Ser Gly Pro Lys Ile Leu Leu Glu Glu Leu Ser Gln Ser Ser
2065 2070 2075 2080
Ser Cys Pro Ala Ser Asp Ala Thr Leu Ala Asn Leu Ala Tyr Ile Ile
2085 2090 2095
Tyr Thr Ser Gly Ser Thr Gly Val Pro Lys Gly Val Met Ile Thr His
2100 2105 2110
Gly Ala Ile Val Asn Tyr Leu Ser Trp Ala Gln Gly Asn Tyr Ile Ser
2115 2120 2125
Gly Ser Gln Gly Ser Val Leu Met Thr Pro Ser Tyr Ala Phe Asp Gly
2130 2135 2140
Ser Met Thr Thr Leu Phe Thr Pro Leu Ile Ser Gly Arg Cys Met Gln
2145 2150 2155 2160
Leu Met Leu Arg Asp Asp Val Leu Ser Arg Ile Arg Asn Ser Leu Leu
2165 2170 2175
Glu Ser Arg Glu Pro Leu Ala Leu Ile Asp Cys Gly Pro Ala Gln Leu
2180 2185 2190
Glu Val Leu Gln His Val Leu Glu Pro Glu Gln Leu Ala Ala Ser Gln
2195 2200 2205
Val Gly Ala Ile Val Ile Gly Gly Glu Ala Leu His Ala Ala Thr Val
2210 2215 2220
Glu Gln Trp Arg Arg His Ala Pro Ala Thr Arg Leu Tyr Asn Glu Tyr
2225 2230 2235 2240
Gly Pro Thr Glu Ala Thr Val Gly Cys Cys Asn Tyr His Ile Thr Ala
2245 2250 2255
Asp Thr Pro Trp Phe Gly Pro Val Pro Ile Gly Arg Gly Ile Trp Asn
2260 2265 2270
Val Arg Val Tyr Val Leu Asp Lys Tyr Leu Gln Pro Leu Pro Val Gly
2275 2280 2285
Met Pro Gly Asp Leu Tyr Val Ala Gly Glu Gly Leu Ala Arg Gly Tyr
2290 2295 2300
Ala Gly Lys Pro Ala Leu Thr Ala Gln Ser Phe Ile Pro Asp Pro Phe
2305 2310 2315 2320
Ser Glu Gly Gly Arg Leu Tyr Arg Thr Gly Asp Arg Ala Cys Trp Gly
2325 2330 2335
Thr Gly Ser Val Ile His Tyr Leu Gly Arg Ser Asp Asn Gln Val Lys
2340 2345 2350
Phe Arg Gly Phe Arg Ile Glu Pro Gly Glu Ile Glu Glu Lys Ile Arg
2355 2360 2365
Leu Tyr Pro Gly Val Ser Glu Ala Ala Val Lys Val His Thr Asp Glu
2370 2375 2380
Gln Gly Ile Ser Arg Leu Val Ala Trp Leu Ala Gly Glu Ile His Asp
2385 2390 2395 2400
Gly Leu Asp Ala Trp Leu Arg Glu Ser Leu Pro Gly Phe Met Val Pro
2405 2410 2415
Ser His Tyr Val Leu Leu Pro Met Leu Pro Ile Ser Val Ser Gly Lys
2420 2425 2430
Val Asp Arg Asn Ala Leu Ser Leu Pro Glu Ile Thr His Gln Pro Leu
2435 2440 2445
Thr His Ser Glu Ser Arg Ala Leu Asn Ala Thr Glu Gln Arg Leu Ala
2450 2455 2460
Ala Ile Trp Gln Glu Val Ile Gly His Pro Val Ser Glu Pro Gln Ala
2465 2470 2475 2480
Asn Phe Phe Glu Ala Gly Gly Asp Ser Leu Arg Ala Val Lys Leu Ile
2485 2490 2495
Phe Leu Ile Glu Arg Glu Phe Lys Arg Val Leu Pro Leu Ala Ser Leu
2500 2505 2510
Phe Gly Leu His Thr Leu Glu Ala Gln Ala Ala Ala Leu Thr Ala Glu
2515 2520 2525
Ser Asn Ala Ala Thr Asp Ala Leu Val Pro Ile His Val Arg Glu Asn
2530 2535 2540
Ala Pro Ser Val Val Leu Val His Asp Ile Ser Gly Gln Ile Leu Ser
2545 2550 2555 2560
Tyr Arg Ser Leu Ala Glu Glu Leu Thr Ala Phe Gly Val Tyr Ala Ile
2565 2570 2575
Gln Ala Leu Ala Gly Gln His Thr Arg Ala Pro Ser Val Ala Asp Met
2580 2585 2590
Ala Glu Leu Tyr Ala Arg Ala Ile Met Glu Ala Arg Ile Pro Gly Pro
2595 2600 2605
Leu Ile Leu Val Gly His Ser Phe Gly Ala Gln Val Ala Thr Glu Leu
2610 2615 2620
Ser Arg Lys Leu Thr Thr Leu Gly Lys Lys Pro Leu Leu Leu Ala Ile
2625 2630 2635 2640
Leu Asp Gly Ile Ala Glu Pro Asp Arg Glu Ser Leu Gln Gln Leu Pro
2645 2650 2655
Arg Asp Asp Leu Asp Leu Met Asp Tyr Met Ile Arg Thr Ile Glu Leu
2660 2665 2670
Ser Met Asp Lys Arg Ile Asp Val Asp Ala Ala Arg Leu Arg Ala Leu
2675 2680 2685
Pro Glu Ser Glu Arg Ala Ser Trp Ile Thr Ala Ser Ile Thr Arg Ala
2690 2695 2700
Gly Val Val Pro Glu His Thr Ser Pro Glu His Val Met Gln Leu Phe
2705 2710 2715 2720
Thr Ile Tyr Lys Asn Asn Leu Glu Ser Leu His Gly Tyr Gln Pro Gly
2725 2730 2735
Arg Val Thr Cys Pro Val Thr Leu Trp Ala Thr Glu Ala Leu Gly Gln
2740 2745 2750
Gln Glu Asp Ala Gly Trp Gly Lys Tyr Ala Asp Arg Val Thr Val Tyr
2755 2760 2765
Gln Ala Ser Gly Asp His Val Ser Met Leu Lys Pro Pro His Val Gln
2770 2775 2780
Glu Leu Ala Ala Ser Leu Thr Lys Ala Ile Asn Asp Glu Met Arg
2785 2790 2795
<210> 10
<211> 246
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 10
Met Ser Ile Pro Arg Ile Val His Gln Ile Trp Tyr Gln Gly Glu Asn
1 5 10 15
Gln Val Pro Asp Lys Tyr Arg Arg Tyr Arg Glu Thr Trp Gln Gln Tyr
20 25 30
His Pro Asp Trp Gln Cys Met Leu Trp Asp Ala His Thr Leu Arg Glu
35 40 45
His Val Ala Ser His Trp Pro Gln Phe Leu Pro Ile Tyr Asp Ala Tyr
50 55 60
Pro Gln Asp Val Gln Arg Met Asp Ser Ala Arg Tyr Cys Leu Leu Ala
65 70 75 80
Thr Gln Gly Gly Leu Tyr Ala Asp Leu Asp Ile Glu Cys Leu Arg Pro
85 90 95
Val Asp Glu Leu Leu Thr Gly His Glu Leu Ile Leu Ser Gln Thr Val
100 105 110
Gly Tyr Asn Ile Ala Phe Ile Ala Ser Ala Ala Ala His Pro Leu Trp
115 120 125
Glu Thr Val Leu Asn His Leu Thr Asn Lys Ile Ser Ala Asp Leu Ser
130 135 140
Asp Val Pro Ser Phe Met Arg Glu Asn Val Ala Met Gln Ile Ala Val
145 150 155 160
Val Ser Gly Pro Arg Phe Phe Thr Leu Cys Val Glu Glu Ser Gly Val
165 170 175
Leu Ala Leu Pro Gly Thr Leu Ala Cys Pro Gly Glu Tyr Phe Glu Ser
180 185 190
Thr Ala Thr Pro Gly Tyr Val His Asp Lys Gln Lys Asp Trp Ile Pro
195 200 205
Tyr Gly Arg His Asp Met Asp Leu Asn Trp Met Ser Pro Ser Ala Arg
210 215 220
Leu Leu Ser Arg Leu Ala Arg Gly Phe Ser Thr Val Val Ser Gly Val
225 230 235 240
Arg Ala Phe Val Arg Gln
245
<210> 11
<211> 229
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 11
Met Arg Leu Ile Cys Phe Pro Tyr Ala Gly Gly Ser Thr Ala Ile Phe
1 5 10 15
Arg Gly Leu Ala Gln Leu Leu Pro Asp Ile Glu Val His Thr Pro Glu
20 25 30
Leu Pro Gly His Gly Ser Arg Met Asn Glu Ala Ala Phe Thr Ser Ile
35 40 45
Glu Glu Leu Ala Glu Arg Met Ile Met Glu Leu Arg Pro His Phe Ser
50 55 60
Arg Pro Phe Ala Leu Phe Gly His Ser Met Gly Ala Ala Leu Ser Phe
65 70 75 80
Glu Ile Val Ser Gln Leu Ser Phe Pro Glu Arg Ala Asn Leu Arg His
85 90 95
Leu Phe Val Ser Ala Cys Pro Ala Pro Gly Phe Ala Thr Ile Arg Arg
100 105 110
Arg Pro Leu Gln Asp Leu Asn Asp Ala Asp Phe Ile Glu Glu Leu Arg
115 120 125
Leu Leu Gly Gly Thr Pro Ser Glu Ile Leu Asp Asn Ala Glu Leu Met
130 135 140
Ala Leu Leu Leu Pro Met Leu Arg Ala Asp Phe Thr Ala Val Glu Asn
145 150 155 160
His Arg Ala Lys Ser Asp Ile Val Leu Asp Ala Ser Val Thr Ala Leu
165 170 175
Ala Gly Asp Arg Asp Glu Arg Val Thr Ala Glu Ala Val Phe Ala Trp
180 185 190
Arg His Ala Thr Arg Gly Asn Phe Val Ser His Leu Leu Gln Gly Asp
195 200 205
His Phe Phe Leu Lys Pro Gln Phe Leu Thr Ile Ala Asn Ile Ile Asn
210 215 220
Leu Arg Leu Ala Ala
225
<210> 12
<211> 1037
<212> PRT
<213> 成团泛菌(Pantoea agglomerans)
<400> 12
Met Ser Gln Phe Phe Ile Asn Arg Pro Ile Phe Ala Trp Val Ile Ala
1 5 10 15
Leu Phe Ile Val Leu Ala Gly Leu Ile Ala Ile Pro Gln Leu Pro Val
20 25 30
Ala Gln Tyr Pro Ser Val Ala Pro Pro Ser Val Ser Val Ser Val Thr
35 40 45
Tyr Pro Gly Ala Thr Pro Glu Thr Met Asn Glu Ser Val Ile Ser Leu
50 55 60
Leu Glu Arg Glu Ile Ser Gly Val Asp Asn Met Leu Tyr Phe Glu Ser
65 70 75 80
Ser Ser Asp Thr Ser Gly Thr Ala Ser Ile Thr Ile Thr Phe His Pro
85 90 95
Gly Thr Asp Val Lys Leu Ala Gln Val Asp Val Gln Asn Lys Leu Lys
100 105 110
Val Val Glu Ala Arg Leu Pro Gln Thr Val Arg Gln Asn Gly Ile Gln
115 120 125
Val Glu Ala Ala Asn Ser Gly Phe Leu Met Ile Val Gly Leu Arg Ser
130 135 140
Pro Ser Gly Thr Tyr Thr Asp Gln Asp Leu Ser Asp Tyr Phe Gly Arg
145 150 155 160
Asn Val Ser Asp Glu Leu Gln Arg Val Pro Gly Val Gly Lys Val Gln
165 170 175
Phe Phe Gly Ala Glu Lys Ala Met Arg Ile Trp Leu Asp Pro Asn Lys
180 185 190
Leu Tyr Thr Tyr Asn Leu Ser Ala Ser Asp Val Ile Thr Ala Leu Thr
195 200 205
Gln Gln Asn Ala Gln Val Ser Pro Gly Arg Val Gly Asp Glu Pro Ala
210 215 220
Arg Ser Gly Gln Lys Val Thr Tyr Gln Leu Thr Val Gln Gly Gln Leu
225 230 235 240
Ser Ser Ile Glu Ala Phe Arg Asn Ile Thr Leu Lys Ala Gln Pro Asp
245 250 255
Gly Ser Arg Val Arg Leu Gly Asp Val Ala Arg Ile Glu His Gly Leu
260 265 270
Gln Asn Tyr Ser Phe Ala Ile Arg Glu Asn Gly Lys Pro Ala Thr Ala
275 280 285
Ala Ala Ile Gln Leu Thr Pro Gly Ala Asn Ala Val Ser Thr Ala Glu
290 295 300
Gly Val Arg Ala Arg Leu Ser Glu Leu Ser Thr Ala Leu Pro Glu Gly
305 310 315 320
Met Ala Phe Ser Val Pro Phe Asp Thr Ala Pro Phe Val Lys Leu Ser
325 330 335
Ile Glu Lys Val Ile His Thr Phe Ile Glu Ala Met Val Leu Val Phe
340 345 350
Leu Val Met Leu Leu Phe Leu Gln Lys Leu Arg Tyr Thr Phe Ile Pro
355 360 365
Ala Ile Val Ala Pro Val Ala Leu Leu Gly Thr Phe Thr Ile Met Leu
370 375 380
Leu Ser Gly Phe Ser Ile Asn Val Leu Thr Met Phe Gly Met Val Leu
385 390 395 400
Ala Ile Gly Ile Ile Val Asp Asp Ala Ile Val Val Val Glu Asn Val
405 410 415
Glu Arg Leu Met Ala Glu Lys Gly Met Ser Pro Arg Glu Ala Thr Gln
420 425 430
Glu Ala Met Arg Glu Ile Thr Pro Ala Ile Ile Gly Ile Thr Leu Val
435 440 445
Leu Thr Ala Val Phe Ile Pro Met Gly Phe Ala Ser Gly Ser Ile Gly
450 455 460
Val Ile Tyr Arg Gln Phe Thr Leu Ser Met Ala Val Ser Ile Leu Phe
465 470 475 480
Ser Ala Phe Leu Ala Leu Thr Leu Thr Pro Ala Leu Cys Ala Ser Leu
485 490 495
Leu His Pro Val Thr Thr His Ser Thr Asn Lys Lys Gly Phe Phe Gly
500 505 510
Trp Phe Asn Arg Arg Phe Asn Arg Leu Ala Asn Gly Tyr Arg Ser Gly
515 520 525
Leu Arg Phe Thr Leu Lys Arg Ser Gly Arg Met Met Ile Leu Tyr Val
530 535 540
Leu Leu Cys Cys Val Val Phe Met Ala Tyr Arg Thr Leu Pro Ser Ser
545 550 555 560
Phe Leu Pro Asp Glu Asp Gln Gly Tyr Phe Met Thr Ala Ile Gln Leu
565 570 575
Pro Ser Asp Ala Thr Gln Glu Arg Thr Arg Lys Val Ala Asp His Leu
580 585 590
Glu Ser Ile Val Asp Lys Arg Asp Gly Ile Asn Gly Asn Ile Thr Val
595 600 605
Phe Gly Tyr Gly Phe Ser Gly Ser Gly Pro Asn Thr Ala Leu Ala Phe
610 615 620
Thr Thr Leu Lys Asp Trp Asp Gln Arg Asn Gly Val Met Ala Glu Gly
625 630 635 640
Glu Ala Ala Phe Val Gln Gln Glu Met Asp Thr Gln Pro Asp Ala Ile
645 650 655
Ala Met Ser Leu Leu Pro Pro Ala Ile Ala Asp Met Gly Thr Ser Ser
660 665 670
Gly Phe Thr Leu Tyr Leu Glu Asp Arg Gly Gly Lys Gly Tyr Ala Ala
675 680 685
Leu Met Gln Ala Ala Thr Lys Leu Thr Gly Leu Ala Ala Gly Ser Ser
690 695 700
Ile Val Ser Gly Val Tyr Thr Asp Gly Leu Pro Glu Gly Val Ser Ala
705 710 715 720
Arg Leu Asn Val Asp Arg Glu Lys Ala Gln Ala Met Gly Val Ser Phe
725 730 735
Asp Glu Ile Asn Gln Thr Leu Ser Val Ala Thr Gly Ser Tyr Tyr Val
740 745 750
Asn Asp Tyr Val Asp Ala Gly Arg Val Gln Gln Val Ile Val Gln Ala
755 760 765
Asp Ala Pro Tyr Arg Met Gln Leu Gln Asp Leu Leu Lys Leu Tyr Val
770 775 780
Arg Asn Ser Lys Gly Glu Met Val Pro Leu Ser Ala Phe Ile Thr Thr
785 790 795 800
Ser Trp Thr Gln Leu Pro Gln Gln Leu Asn Arg Tyr Gln Gly Tyr Pro
805 810 815
Ala Ile Lys Ile Ser Gly Ser Thr Ala Pro Gly Tyr Ser Ser Gly Ala
820 825 830
Ala Met Ala Glu Met Glu Arg Leu Ala Gly Thr Leu Pro Lys Gly Phe
835 840 845
Met Ala Glu Trp Ser Gly Thr Ser Leu Gln Glu Lys Asn Ser Ala Ser
850 855 860
Gln Met Pro Met Leu Leu Ala Leu Ser Val Leu Val Val Phe Met Val
865 870 875 880
Leu Ala Ala Leu Tyr Glu Ser Trp Ser Val Pro Phe Ser Val Leu Met
885 890 895
Val Val Pro Leu Gly Leu Ala Gly Ala Leu Ala Ala Val Tyr Leu Ala
900 905 910
Arg Met Pro Asn Asp Val Phe Phe Lys Val Gly Met Ile Met Leu Ile
915 920 925
Gly Leu Ser Ala Lys Asn Ala Ile Leu Ile Val Glu Phe Ala Arg Gln
930 935 940
Leu His Ala Gln Gly Ala Thr Val Leu Glu Ala Thr Ile Glu Ala Ala
945 950 955 960
Ile Leu Arg Leu Arg Pro Ile Ile Met Thr Ser Leu Ala Phe Thr Leu
965 970 975
Gly Val Val Pro Leu Met Leu Ala Thr Gly Ala Ser Glu Arg Thr Gln
980 985 990
His Ala Ile Gly Thr Gly Val Phe Gly Gly Met Ile Ser Gly Thr Leu
995 1000 1005
Met Ala Ile Tyr Phe Val Pro Val Phe Phe Ile Cys Val Ser Tyr Leu
1010 1015 1020
Ala Thr Lys Leu Ser Ser Gly Asp Lys Lys Asp Arg His
1025 1030 1035

Claims (10)

1.草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇至少包括7个基因,分别为:
非核糖体肽合成酶基因,即:ACBA基因,ACBB基因,ACBC基因,ACBD基因和ACBH基因;
所述ACBA基因编码1890个氨基酸,氨基酸序列如SEQ ID NO.2所示;
所述ACBB基因编码291个氨基酸,氨基酸序列如SEQ ID NO.3所示;
所述ACBC基因编码2499个氨基酸,氨基酸序列如SEQ ID NO.4所示;
所述ACBD基因编码3861个氨基酸,氨基酸序列如SEQ ID NO.5所示;
所述ACBH基因编码2799个氨基酸,氨基酸序列如SEQ ID NO.9所示;
与糖基转移相关基因,即:ACBI基因,其编码246个氨基酸,氨基酸序列如SEQ ID NO.10所示;
与氧化还原相关基因,即:ACBE基因,其编码254个氨基酸,氨基酸序列如SEQ ID NO.6所示。
2.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,
所述ACBA基因的核苷酸序列如SEQ ID NO.1中第3010-9258位所示;
所述ACBB基因的核苷酸序列如SEQ ID NO.1中第9317-10588位所示;
所述ACBC基因的核苷酸序列如SEQ ID NO.1中第10713-18212位所示;
所述ACBD基因的核苷酸序列如SEQ ID NO.1中第18209-29794位所示;
所述ACBH基因的核苷酸序列如SEQ ID NO.1中第32535-40934位所示;
所述ACBI基因的核苷酸序列如SEQ ID NO.1中第40934-41674位所示;
所述ACBE基因的其核苷酸序列如SEQ ID NO.1中第29791-30555位所示。
3.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇还包括ACBF基因,其编码70个氨基酸,氨基酸序列如SEQ ID NO.7所示,其核苷酸序列如SEQ ID NO.1中第30602-30814位所示。
4.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇还包括ACBJ基因,其编码的氨基酸序列如SEQ ID NO.11所示,其核苷酸序列如SEQ ID NO.1中第41794-42483位所示。
5.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇还包括与转运相关的基因,即ACBG基因,其编码的氨基酸序列如SEQ ID NO.8所示,其核苷酸序列如SEQ ID NO.1中第30842-32506位所示。
6.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇还包括与抗性相关的基因,即ACBK基因,其编码的氨基酸序列如SEQ ID NO.12所示,其核苷酸序列如SEQ ID NO.1中42549-45662位所示。
7.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇的核苷酸序列如SEQ ID NO.1所示。
8.如权利要求1所述的草欧菌素的生物合成基因簇,其特征在于,所述草欧菌素的生物合成基因簇来源于成团泛菌(Pantoea agglomerans)ZJU23,保藏编号为CGMCC No.16174,保藏日期为2018年7月30日。
9.一种包含权利要求1~8任一项所述草欧菌素的生物合成基因簇的表达载体或基因工程菌。
10.如权利要求1~8任一项所述生物合成基因簇在合成草欧菌素的应用。
CN202110637064.2A 2021-06-08 2021-06-08 草欧菌素的生物合成基因簇及其应用 Active CN113528550B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110637064.2A CN113528550B (zh) 2021-06-08 2021-06-08 草欧菌素的生物合成基因簇及其应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110637064.2A CN113528550B (zh) 2021-06-08 2021-06-08 草欧菌素的生物合成基因簇及其应用

Publications (2)

Publication Number Publication Date
CN113528550A true CN113528550A (zh) 2021-10-22
CN113528550B CN113528550B (zh) 2023-03-24

Family

ID=78124665

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110637064.2A Active CN113528550B (zh) 2021-06-08 2021-06-08 草欧菌素的生物合成基因簇及其应用

Country Status (1)

Country Link
CN (1) CN113528550B (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115820676A (zh) * 2022-09-14 2023-03-21 浙江大学 Pa2643基因在调控成团泛菌合成草欧菌素A产量中的应用

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160057920A (ko) * 2014-11-14 2016-05-24 대한민국(농촌진흥청장) 판토에아 아글로메란스 sh1 및 이의 용도

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160057920A (ko) * 2014-11-14 2016-05-24 대한민국(농촌진흥청장) 판토에아 아글로메란스 sh1 및 이의 용도

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
S. XU等: "Fusarium fruiting body microbiome member Pantoea agglomerans inhibits fungal pathogenesis by targeting lipid rafts", 《 NAT MICROBIOL》 *
T. KAMBER等,: "Characterization of the Biosynthetic Operon for the Antibacterial Peptide Herbicolin in Pantoea vagans Biocontrol Strain C9-1 and Incidence in Pantoea Species", 《APPL ENVIRON MICROBIOL》 *
郭翼奋 等: "利用工程重组草生欧文氏菌C...组接合转移及抗真菌活性测定", 《全国生物防治学术讨论会论文集中国植物保护学会生物入侵分会会议论文集》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115820676A (zh) * 2022-09-14 2023-03-21 浙江大学 Pa2643基因在调控成团泛菌合成草欧菌素A产量中的应用
CN115820676B (zh) * 2022-09-14 2024-04-09 浙江大学 Pa2643基因在调控成团泛菌合成草欧菌素A产量中的应用

Also Published As

Publication number Publication date
CN113528550B (zh) 2023-03-24

Similar Documents

Publication Publication Date Title
Schneider et al. Targeted alteration of the substrate specificity of peptide synthetases by rational module swapping
Nguyen et al. Genetically engineered lipopeptide antibiotics related to A54145 and daptomycin with improved properties
Cacho et al. Identification and characterization of the echinocandin B biosynthetic gene cluster from Emericella rugulosa NRRL 11440
KR101261870B1 (ko) 폴리믹신 b 또는 e 생합성 효소 및 이를 코딩하는 유전자 군
Yang et al. A novel two-component system PdeK/PdeR regulates c-di-GMP turnover and virulence of Xanthomonas oryzae pv. oryzae
DK2279265T3 (da) Cyanobakterie-saxitoxin-gencluster og detektering af cyanotoksiske organismer
Wells et al. CUS1, a suppressor of cold-sensitive U2 snRNA mutations, is a novel yeast splicing factor homologous to human SAP 145.
KR20190099396A (ko) 화합물의 생산을 위한 조성물 및 방법
CN104024272B (zh) 用于浅灰霉素和甲基浅灰霉素的生物合成的基因簇
KR20200111172A (ko) 네페탈락톨 산화 환원 효소, 네페탈락톨 합성 효소, 및 네페탈락톤을 생산할 수 있는 미생물
KR20100039443A (ko) 답토마이신 생합성 유전자 클러스터에 관련된 조성물 및 방법
KR20070060821A (ko) 폴리믹신 생합성 효소 및 이를 코딩하는 유전자 군
CN113528550B (zh) 草欧菌素的生物合成基因簇及其应用
CN101275141A (zh) 阿嗪霉素的生物合成基因簇
KR20110118811A (ko) 비-리보좀 펩티드 합성효소를 코딩하는 생합성 클러스터의 핵산 분자 및 그의 용도
KR102359972B1 (ko) 화합물의 제조를 위한 조성물 및 방법
CN110997700A (zh) 用于在杀真菌素链霉菌的基因工程菌株中增强恩拉霉素的生产的组合物和方法
JP2005508140A (ja) エンジイン環構造の生合成に関与する遺伝子及びタンパク質
JPWO2019216248A1 (ja) ペプチド類の大環状化酵素
CN110305881A (zh) 一种聚酮类化合物neoenterocins的生物合成基因簇及其应用
KR20110092510A (ko) 트리데캅틴 생합성 효소 및 이를 코딩하는 유전자
CN115466732B (zh) 用于制备Hispidin的重组蛋白及其应用
CA2354030A1 (en) Micromonospora echinospora genes encoding for biosynthesis of calicheamicin and self-resistance thereto
WO2002024736A9 (en) Polynucleotides and polypeptides associated with antibiotic biosynthesis and uses therefor
EP2857416A1 (en) Non-ribosomal protein synthesis pigment fusion peptides

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Chen Yun

Inventor after: Ma Zhonghua

Inventor after: Xu Sunde

Inventor before: Xu Sunde

Inventor before: Chen Yun

Inventor before: Ma Zhonghua

GR01 Patent grant
GR01 Patent grant