CN108012523A - 具有增加的种子大小的植物 - Google Patents

具有增加的种子大小的植物 Download PDF

Info

Publication number
CN108012523A
CN108012523A CN201680020523.XA CN201680020523A CN108012523A CN 108012523 A CN108012523 A CN 108012523A CN 201680020523 A CN201680020523 A CN 201680020523A CN 108012523 A CN108012523 A CN 108012523A
Authority
CN
China
Prior art keywords
plant
sod7
ngal3
seq
polypeptides
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201680020523.XA
Other languages
English (en)
Inventor
李云海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Genetics and Developmental Biology of CAS
Original Assignee
Institute of Genetics and Developmental Biology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Genetics and Developmental Biology of CAS filed Critical Institute of Genetics and Developmental Biology of CAS
Publication of CN108012523A publication Critical patent/CN108012523A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8218Antisense, co-suppression, viral induced gene silencing [VIGS], post-transcriptional induced gene silencing [PTGS]
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H5/00Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
    • A01H5/10Seeds
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Abstract

本发明涉及具有改变的种子表型、特别是增加的种子大小的转基因植物。本发明涉及不产生功能性NGAL2多肽或者不产生功能性NGAL2和NGAL3多肽的植物。NGAL2和NGAL3是RAV家族的成员且包含B3 DNA‑结合结构域和转录阻抑基序。

Description

具有增加的种子大小的植物
发明领域
本发明涉及具有改良的生长和产量相关性状、特别是增加的种子大小的转基因植物。本发明还涉及相关的方法、应用、分离的核酸及载体构建体。
发明背景
不断增加的世界人口及可用于农业的耕地供应减少需要关于增加农业学效率并提供量粮食安全的研究。作物和园艺改良的常规方式是利用选择性育种技术以鉴别具有希望的特性的植物。然而,这种选择性育种技术具有一些缺点,即这些技术典型是费力的,且导致植物通常含有异源遗传成分,其在从亲本植物传代中可能不总是产生希望的性状。分子生物学的进展允许人类修改动物和植物的种质。植物的遗传工程化需要分离和操纵遗传物质(典型是DNA或RNA形式),随后将该遗传物质导入植物中。这种技术具有产生具有各种改良的经济学、农业学或者园艺学性状、包括增加的产量的作物或植物的能力。由许多方法可用于例如基因组编辑(使用CRISPR或TALEN)或者诱变。
特别具有经济学意义的性状是增加的种子大小。种子大小是增加作物产量的重要农业学性状,也是影响物种再生策略的许多方面的关键的生态学性状,如幼苗存活率和种子传播综合征(Harper et al.,1970;Westoby et al.,2002;Moles et al.,2005;Fan etal.,2006;Orsi and Tanksley,2009;Gegas et al.,2010)。尽管种子的大小是植物中最重要的农业学性状之一,但是几乎未知决定种子最终大小的遗传学和分子学机制。在高等植物中,种子发育是以双受精过程开始,其中两个单倍体花粉核之一与单倍体卵细胞融合产生二倍体胚,同时另一精核与二倍体中央细胞融合形成三倍体胚乳(Lopes and Larkins,1993)。围绕胚珠的珠被是母体组织,在受精后形成种皮。因此,种子的大小是胚芽、胚乳和母体组织生长的结果。然而,几乎未知在植物中限制种子生长的遗传和分子机制。
母体地调节种子大小的一些因素已经在拟南芥中鉴别。例如,TRANSPARENTTESTAGLABRA2(TTG2)通过在母体珠被中增加细胞延长而影响种子生长(Garcia et al.,2005;Ohto et al.,2009),而APETALA2(AP2)可以通过在母体珠被中限制细胞延长而控制种子生长(Jofuku et al.,2005;Ohto et al.,2005;Ohto et al.,2009)。相反,AUXINRESPONSE FACTOR 2(ARF2)通过限制细胞增殖母体地作用于控制种子生长(Schruff etal.,2006)。相似地,遍在蛋白受体DA1与E3遍在蛋白连接酶DA2和EOD1/BB协同作用,通过在母体珠被中限制细胞增殖而控制种子大小(Li et al.,2008;Xia et al.,2013)。编码遍在蛋白特异性蛋白酶(UBP15)的da1-1的阻抑物(SOD2)中的突变抑制da1-1的较大种子表型(Du et al.,2014)。DA1与UBP15/SOD2生理性结合并调节UBP15稳定性。这些研究表明遍在蛋白途径在种子大小的母体控制中起重要作用。KLU/CYTOCHROME P450 78A5(CYP78A5)通过在胚珠的母体珠被中增加细胞增殖而调节种子大小(Adamski et al.,2009)。KLU也已经提示产生可动的植物生长物质,其促进细胞增殖(Anastasiou et al.,2007;Adamski etal.,2009)。相反,CYP78A6/EOD3的过表达在珠被中增加了细胞增殖和细胞延长,导致较大种子(Fang et al.,2012)。种子大小也由合子组织确定。已经描述了一些因素在拟南芥中通过合子组织影响种子大小,包括HAIKU1(IKU1)、IKU2、MINISEED3(MINI3)和SHORTHYPOCOTYL UNDER BLUE1(SHB1)(Garcia et al.,2003;Luo et al.,2005;Zhou et al.,2009;Wang et al.,2010;Kang et al.,2013)。iku和mini3突变体由于胚乳的早熟细胞化而形成小种子(Garcia et al.,2003;Luo et al.,2005;Wang et al.,2010)。SHB1与MINI3和IKU2启动子结合并调节MINI3和IKU2的表达(Zhou et al.,2009;Kang et al.,2013)。ABAINSENSITIVE5(ABI5)近年来描述了抑制SHB1的表达(Cheng et al.,2014),及MINI3报道了激活细胞因子氧化酶的表达(CKX2)(Li et al.,2013),提示植物激素在调节胚乳生长中的作用。此外,胚乳生长受亲本来源效应的影响(Scott et al.,1998;Xiao et al.,2006)。
本发明的目的在于提供具有有益于农业的改良的产量性状的植物。
发明概述
第一方面,本发明涉及所产生的不产生功能性NGAL2多肽或者不产生功能性NGAL2和NGAL3多肽的植物。
另一方面,本发明涉及一种改变植物表型的方法,包括相对于对照植物降低或者消除编码NGAL2多肽的核酸序列的表达,或者降低或者消除NGAL2的活性,或者降低或者消除编码NGAL2和NGAL3多肽的核酸序列的表达,或者降低或者消除NGAL2和NGAL3多肽的活性。
另一方面,本发明涉及一种制备具有改变的表型的植物的方法,包括与对照植物相比降低或者消除编码NGAL2多肽的核酸序列的表达,或者降低或者消除NGAL2的活性,或者降低或消除编码NGAL2和NGAL3多肽的核酸序列的表达,或者降低或者消除NGAL2和NGAL3多肽的活性。
另一方面,本发明涉及通过上述任何方法获得或者可以获得的植物。
另一方面,本发明涉及包含具有或者由SEQ ID NO:1或2组成的序列或者其功能变体或同系物的分离的核酸。
另一方面,本发明涉及包含上述分离的核酸的载体。
另一方面,本发明涉及沉默核酸构建体,其靶向包含或者由SEQ ID NO:1、2或3组成的序列或者其功能变体、一部分或者同系物。
附图简述
本发明在如下非限制性附图中进一步描述。
图1:da1-1(sod7-1D)阻抑物的分离.
(A)来自野生型、da1-1和sod7-1D da1-1植物的种子(从左至右)。(B)野生型、da1-1和sod7-1D da1-1的成熟胚(从左至右)。(C)野生型、da1-1和sod7-1D da1-1植物的花(从左至右)。(D)野生型、da1-1和sod7-1D da1-1的30天龄植物(从左至右)。(E)野生型、da1-1和sod7-1D da1-1种子的投影面积。(F)野生型、da1-1和sod7-1D da1-1种子的重量。(G)10天龄的野生型、da1-1和sod7-1D da1-1幼苗的子叶面积。数值(E-G)以相对于各个野生型数值的平均值±SD给出,设置为100%。**,与da1-1相比P<0.01(Student’s t-检验)。标尺=0.5mm(A)、0.2mm(B)、1mm(C)和5cm(D)。
图2:在sod7-1D突变体中的种子和器官大小
(A和B)Col-0(A)和sod7-1D(B)的种子。(C和D)Col-0(C)和sod7-1D(D)的成熟胚。(E和F)Col-0(E)和sod7-1D(F)的10天龄幼苗。(G)Col-0和sod7-1D种子的投影面积。(H)Col-0和sod7-1D种子的重量。(I)10天龄Col-0和sod7-1D幼苗的子叶面积。(G-I)数值以各个野生型数值的平均值±SD给出,设置为100%。**,与野生型相比的P<0.01(Student’s t-检验)。标尺=0.5mm(A)和(B),0.2mm(C)和(D),及1mm(E)和(F)。
图3:SOD7基因的克隆
(A)sod7-1D突变体中T-DNA插入体的结构。(B)da1-1和sod7-1D da1幼苗中At3g11580(SOD7)和At3g11590的表达水平。
(C)SOD7蛋白质含有B3DNA结合结构域(浅色阴影中第二个结构域)和转录抑制基序(深色阴影中小的浅色方框,用箭头标示)。(D)Col-0、35S:GFP-SOD7#3和35S:GFP-SOD7#5种子的投影面积。(E)10天龄的Col-0,35S:GFP-SOD7#3和35S:GFP-SOD7#5幼苗的子叶面积。(F)Col-0、35S:GFP-SOD7#3和35S:GFP-SOD7#5幼苗中SOD7表达水平。数值(D-F)是以相对于各自野生型数值的平均值±SD给出,设置为100%。**,与野生型相比的P<0.01(Student’st-检验)。
图4:SOD7的表达模式和亚细胞定位。
(A-K)SOD7表达活性通过pSOD7:GUS转基因表达监测。在发育中的叶(A、B和C)、发育中的萼片(D、E)、发育中的花瓣(F、G)、发育中的雄蕊(H、I)和发育中的心皮(J、K)中GUS活性的组织化学分析。(L)在pSOD7:SOD7-GFP转基因植物的幼胚珠中SOD7-GFP的GFP荧光性。(M-O)示出SOD7-GFP(M)、DAPI染色(N)和融合的(O)图像的GFP荧光性。pSOD7:SOD7-GFP叶中的表皮细胞用于观测GFP信号。(P-R)示出GFP-SOD7(P)、DAPI染色(Q)和融合的(R)图像的GFP荧光性。35S:GFP-SOD7叶中的表皮细胞用于观测GFP信号。标尺=100μm(A-K),10μm(L),及2μm(M-R)。
图5:SOD7与NGAL3冗余作用以控制种子大小。
(A)SOD7基因结构。图中示出起始密码子(ATG)和终止密码子(TGA)。实心方框表示编码序列,框之间的线表示内含子。图中示出SOD7基因中T-DNA插入位点(sod7-ko1)。(B)NGAL3基因结构。图中示出起始密码子(ATG)和终止密码子(TGA)。实心方框表示编码序列,框之间的线表示内含子。图中示出NGAL3基因中的T-DNA插入位点(ngal3-ko1)。(C)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1植物的种子(从左至右)。(D)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1的成熟胚(从左至右)。(E)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1的25天龄植物(从左至右)。(F)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1的花(从左至右)。(G)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1种子的投影面积。(H)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1种子的重量。(I)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1幼苗的子叶面积。数值(G-I)以相对于各个野生型数值的平均值±SD给出,设置为100%。**,与野生型(Col-0)相比的P<0.01(Student’st-检验)。标尺=0.5mm(C)、0.2mm(D)、5cm(E)及1mm(F)。
图6:SOD7起母体作用以确定种子大小。
(A)Col-0×Col-0(C/C)F1、Col-0×sod7-ko1ngal3-ko1(C/d)F1、sod7-ko1ngal3-ko1×Col-0(d/C)F1和sod7-ko1ngal3-ko1×sod7-ko1ngal3-ko1(d/d)F1种子的投影面积。数值以相对于各自野生型数值的平均值±SD给出,设置为100%。(B)Col-0×Col-0(C/C)F2、Col-0×sod7-ko1ngal3-ko1(C/d)F2、sod7-ko1ngal3-ko1×Col-0(d/C)F2和sod7-ko1ngal3-ko1×sod7-ko1ngal3-ko1(d/d)F2种子的投影面积。数值以相对于各个野生型数值的平均值±SD给出,设置为100%。(C和D)Col-0(C)和sod7-ko1ngal3-ko1(D)的成熟胚珠。(E)成熟Col-0(浅色bar至左侧)和sod7-ko1ngal3-ko1(深色bar至右侧)胚珠的外珠被的长度。数值以平均值±SD给出。(F)在第0、6和8DAP,Col-0和sod7-ko1ngal3-ko1的外珠被中细胞数目。数值以平均值±SD给出。(F)在0、6和8DAP,Col-0和sod7-ko1ngal3-ko1的外珠被中细胞长度。数值以平均值±SD给出。**,与野生型(Col-0)数值相比的P<0.01(Student’s t-检验)。标尺=50μm(C)和(D)。
图7:klu-4在关于种子大小方面对于sod7-ko1ngal3-ko1是上位的。
(A)Col-0、klu-4、sod7-ko1ngal3-ko1和klu-4sod7-ko1ngal3-ko1的种子面积(从左至右)。熟知以相对于各个野生型数值的平均值±SD给出,设置为100%。(B)Col-0,klu-4、sod7-ko1ngal3-ko1和klu-4sod7-ko1ngal3-ko1的种子重量(从左至右)。熟知以相对于各个野生型数值的平均值±SD给出,设置为100%。(C)Col-0、klu-4、sod7-ko1ngal3-ko1和klu-4sod7-ko1ngal3-ko1的外珠被长度(从左至右)。在第0和8DAP的ngal3-ko1。熟知以平均值±SD给出。(D)在第0和8DAP,Col-0、klu-4、sod7-ko1ngal3-ko1和klu-4sod7-ko1ngal3-ko1的外珠被中的细胞数目(从左至右)。数值以平均值±SD给出。**,与其各自的对照物相比P<0.01(Student’s t-检验)。
图8:SOD7直接结合KLU的启动子并抑制KLU的表达。
(A)在用β-雌二醇处理0、4和8小时的pER8-SOD7转基因植物中SOD7和KLU的表达动力学。从三个生物学样品中计算平均值。数值以平均值±SD给出。**,分别与KLU和SOD7在0小时的表达水平相比P<0.01(Student’s t-检验)。(B)在其ATG密码子上游的KLU的2-kb启动子区含有CACTTG序列。PF1和PF2表示用于ChIP定量PCR分析的PCR片段。A和A-m分别表示用于EMSA测定中的野生型探针和突变的探针。(C)ChIP-Qpcr分析示出SOD7结合KLU的启动子片段PF1。来自35S:GFP和35S:GFP-SOD7转基因植物的染色质通过抗GFP免疫沉淀,所述片段的富集通过定量实时PCR确定。ACTIN7启动子用作阴性对照。富集倍数根据ACTIN7扩增子标准化,设定为1。从三个生物学样品中计算平均值。数值以平均值±SD给出。**,与35S:GFP转基因植物相比P<0.01(Student’s t-检验)。(D)通过EMSA确定的SOD7与KLU启动子之间的直接相互作用。生物素标记的探针A与MBP-SOD7形成DNA-蛋白质复合物,而突变的探针A-m与MBP-SOD7不形成DNA-蛋白质复合物。通过使用未标记的探针A竞争减少阻滞的(retarded)DNA-蛋白质复合物。
图9:35S:GFP-SOD7转基因植物的器官大小表型。SOD7的过表达导致与野生型相比较小的植物。标尺=5cm。
图10:拟南芥中RAV家族成员的系统树。
图11:SOD7与NGAL3冗余作用影响器官大小。
Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1的花瓣面积。(B)Col-0、sod7-ko1、ngal3-ko1和sod7-ko1ngal3-ko1的第七叶的面积。数值(A和B)以各个野生型数值的平均值±SD给出,设置为100%。**,与野生型(Col-0)相比P<0.01,*,与野生型(Col-0)相比P<0.05。
图12:NGAL2、NGAL3和同系物中的保守结构域。a)B box基序;b)阻抑物基序。
图13:序列排列。图中示出如下序列(从上至下):RMZM2G053008、HvMLOC_57250、Os12g0157000、GmLoc100778733、Bra004501、Bra000434、Bra040478、Bra014415、Bra003482、Bra007646、GmLoc100781489、GRMZM2G024948_T01、os02g0683500、HvMLOC_66387、os04g0581400、GRMZM2G102059_T01、Os10g0537100、GRMZM2G142999_T01、GRMZM2G125095_T01、os03g0120900、GRMZM2G098443_T01、GRMZM2G082227_T01、Os11g0156000、GRMZM2G328742_T01、GmLoc100802734、GmLoc100795470、GmLoc100818164、Bra017262、At2g36080/NGAL1、Bra005301、At3g11580/SOD7、BraLOC103849927、Bra034828、At5g06250/NGAL3、Bra005886、GmLoc102660503、HvMLOC_38822、os01g0693400、HvMLOC44012、HvMLOC_7940、HvMLOC_75135、TRAECDM81004、HvMLOC_56567、TRAES3BF098300010CFD_t1、HvMLOC_63261、TRAES3BF062700040CFD_t1、TRAES3BF062600010CFD_t1、Bra038346、GmLoc732601、GmLoc100789009、GmLoc100776987、GmLoc100801107。保守的B3结构域和阻抑物基序在方框中示出。
图14:在水稻中敲除水稻基因Os11g01560000和Os12g0157000的基因组编辑实验。gRNA表示引导RNA,与gRNA支架连接的靶位点使CAS9酶募集至基因组中靶位点并引起基因编辑。
发明详述
现在对本发明做进一步描述。在如下文章段落中更详细地定义本发明的不同方面。除非特别指出是相反的,则如此定义的每个方面可以与任何其它方面组合。特别地,优选或有利的任何特征可以与优选或有利的任何其它特征组合。
除非特别指出,实施本发明将使用本领域技术人员熟知的植物学、微生物学、组织培养、分子生物学技术、化学、生物化学和重组DNA技术及生物信息学领域的常规技术。这些技术在文献中充分阐明。
如本文所用,单词“核酸”、“核酸序列”、“核苷酸”、“核酸分子”或者“多核苷酸”是指包括DNA分子(例如cDNA或者基因组DNA)、RNA分子(例如mRNA)、天然发生的、突变的、合成的DNA或RNA分子,及使用核苷酸类似物产生的DNA或RNA的类似物。其可以是单链或双链的。这种核酸或多核苷酸包括但不限于结构基因的编码序列、翻译序列及不编码mRNA或蛋白质产物的非调节序列。这些术语还涵盖基因。术语“基因”或者“基因序列”广泛用于是指与生物功能相关的DNA核酸。因此,基因可包括内含子和外显子,如在基因组序列中的,或者可以仅包含编码序列,如在cDNA中,和/或可包括cDNA组合调节序列。
术语“肽”、“多肽”和“蛋白质”在本文可互换使用,是指任何长度的通过肽键连接在一起的聚合物形式的氨基酸。
对于本发明,“转基因的”、“转基因”或者“重组”是指关于例如核酸序列、表达盒、基因构建体或者包含核酸序列的载体,或者用本发明的核酸序列、表达盒或载体转化的生物体,所有这些构建均通过重组方法引起,其中
(a)编码可用于本发明方法中的蛋白质的核酸序列,或者
(b)与本发明的核酸序列可操纵地连接的遗传控制序列,例如启动子,或者
(c)上述(a)和(b)二者
不是位于其天然遗传环境中或者已经通过遗传干预技术修饰,其可能采取例如取代、添加、缺失、倒位或者插入一或多个核苷酸残基的形式修饰。天然遗传环境应理解为在原始植物中的天然的基因组或染色体位点或者在基因组文库中存在。在基因组文库的情况中,核酸序列的天然遗传环境优选至少部分保留。所述环境在核酸序列两侧的至少一侧,且具有至少50bp、优选至少500bp、尤其优选至少1000bp、最优选至少5000bp的序列长度。天然发生的表达盒-例如核酸序列的天然启动子与如上述编码可用于本发明方法中的多肽的相应核酸序列的天然发生的组合-当这种表达盒通过非天然合成的(人工的)方法如诱变处理而修饰时成为转基因表达盒。合适的方法在例如US 5,565,350或WO 00/15815中描述,所述文献均通过引用并入本文。
在某些实施方案中,本发明的转基因植物因此应理解为是指用于本发明方法中的核酸在所述植物基因组中不是在其天然位点,其对于被同源或异源表达的核酸是可能的。因此,植物可以表达沉默构建体转基因。然而,在某些实施方案中,转基因也意味着虽然根据本发明的不同实施方案的核酸在植物基因组中位于其天然位置,但是该序列关于其天然序列已经被修饰,和/或天然序列的调节序列已经通过例如诱变被修饰。
转基因优选应理解为是指发生本发明核酸在基因组中非天然位点的表达,即核酸的同源表达或者优选异源表达。根据本发明,转基因被稳定整合进植物中,所述植物优选对于转基因是纯合的。
本发明的各个方面使用遗传工程方法。因此植物已经通过使用遗传工程方法例如转基因表达、诱变、基因靶向、基因沉默或者基因组编辑等方法产生,如下文详细描述。因此,本发明的各个方面可包括重组DNA技术。本发明的植物因此是已经遗传工程化的突变体植物,即通过人为干预操纵。本发明各个方面的植物与未通过遗传工程方法操纵的天然变体不相关。在一些实施方案中,植物可以是转基因植物,例如包含表达沉默构建体的核酸构建体的植物。
在优选的实施方案中,不包括仅基于通过传统育种方法产生植物的实施方案。
本发明人已经鉴别了由Atda1-1的抑制基因(AtSOD7)编码的称作AtNGAL2的B3结构域转录阻抑物,其起母体作用通过在胚珠和发育中的种子的珠被中限制细胞增殖而控制种子大小。
本发明人先前鉴别了遍在蛋白受体DA1在拟南芥中作为种子大小的阴性调节物(Li et al.,2008)。da1-1突变体由于在母体珠被中增加细胞增殖而形成较大种子(Li etal.,2008;Xia et al.,2013)。为了鉴别在DA1途径中新的成分或者其它种子大小调节物,本发明人开始针对da1-1修饰物的T-DNA活化标签筛选(Fang et al.,2012)。da1-1的显性抑制基因(sod7-1D)从大约16,000株T1植物产生的种子中分离(图1A)。sod7-1D da1-1双突变体的种子显然比da1-1种子小且轻(图1A、E和F)。结果示出sod7-1D突变抑制da1-1的种子和器官大小表型。分离SOD7基因,且发现其编码含有结合结构域和转录阻抑基序的NGATHA样蛋白(NGAL2)(图3C)(Alvarez et al.,2009;Ikeda and Ohme-Takagi,2009;Trigueroset al.,2009)。SOD7属于RAV基因家族,其在拟南芥中由13个成员组成(图10)(Swaminathanet al.,2008)。RAV家族的一些成员含有推定的转录阻抑基序,包括NGA1、NGA2、NGA3、NGA4、NGAL1、NGAL2/SOD7和NGAL3(图10)(Ikeda and Ohme-Takagi,2009)。已知NGA1、NGAL1和NGAL2/SOD7中的转录阻抑基序具有抑制活性(Ikeda and Ohme-Takagi,2009),表明其是转录阻抑物。SOD7呈现出与NGAL3/DEVELOPMENT-RELATED PcG TARGET IN THE APEX 4(DPA4)拟南芥的最高相似性(图10),已知其在调节叶锯齿中起作用(Engelhorn et al.,2012),但是先前未鉴别在种子大小控制中的功能。
本发明人示出AtSOD7的过表达显著降低野生型植物的种子大小,而AtSOD7的破坏增加种子大小。本发明人示出AtNGAL3(AtSOD7的密切同系物)的破坏也增加种子大小。此外,同时破坏AtSOD7和AtNGAL3以协同方式进一步增加种子大小。由本发明人进行的遗传分析表明AtSOD7以与种子大小调节物AtKLU共同的途径起作用控制种子生长,但是与AtDA1无关。进一步的结果表明AtSOD7在体外和体内直接结合AtKLU的启动子并抑制AtKLU表达。因此,本发明人的发现示出AtSOD7(aka AtNGAL2)在农作物中是种子大小改良的靶位。本发明的植物特征在于增加的器官大小,例如增加的种子大小,及例如也增加的花瓣大小、增加的胚芽大小。增加的种子大小导致增加的种子产量,本发明的植物因此特征在于增加的种子产量。
因此,本发明涉及这样的植物,其中所述植物不产生功能性NGAL2和/或NGAL3多肽。例如,植物不产生编码NGAL2和/或NGAL3蛋白的核酸序列的全长转录体。在另一实施方案中,植物产生编码NGAL2和/或NGAL3的核酸序列的全长转录体,但是所得蛋白质不是功能性的。在优选的实施方案中,所述植物不产生功能性NGAL2多肽,也不产生功能性NGAL3多肽。这种植物是双重敲除或敲低突变体(丧失功能突变体),如下述的本发明方法涉及产生这种双突变体。
本发明的植物是已经遗传修饰的且不是天然发生的变异的突变体植物。因此,使用遗传工程方法例如下文详细描述的诱变、基因靶向、基因沉默或者基因组编辑方法产生植物。因此,本发明的各个方面可包括重组DNA技术。在一些实施方案中,所述植物可以是转基因植物,例如包含转基因以沉默SOD7和/或NGAL3的基因表达的植物。在其它实施方案中,植物不携带转基因,但是是突变体植物,其中编码NGAL2和/或NGAL3多肽的内源核酸序列或者内源SOD7和/或NGAL3启动子序列已经被操纵为降低或消除编码NGAL2和/或NGAL3多肽的核酸序列的表达,或者降低或消除NGAL2和/或NGAL3多肽的活性。本发明各个方面的植物与未经遗传工程方法操纵的天然变体不相关。
一方面,本发明涉及通过遗传工程方法产生的植物,其中编码NGAL2和/或NGAL3多肽的核酸序列的表达和/或NGAL2和/或NGAL3多肽的活性相对于对照植物被降低或消除。在一个实施方案中,编码NGAL2多肽的核酸序列的表达或者NGAL2多肽的活性被降低或消除。在另一实施方案中,编码NGAL3多肽的核酸序列的表达或者NGAL3多肽的活性被降低或消除。在优选的实施方案中,这两种蛋白质的功能被影响,换句话说,所述植物特征在于在所述植物中编码NGAL2多肽的核酸序列的表达或者NGAL2多肽的活性被降低或消除,及编码NGAL3多肽的核酸序列的表达或者NGAL3多肽的活性被降低或消除。
例如,所述植物可具有降低或消除的编码NGAL2多肽的核酸序列表达及降低或消除的编码NGAL3多肽的核酸序列的表达。在另一实施方案中,所述植物可具有降低或消除的NGAL2多肽的活性及降低或消除的NGAL3多肽的活性。在另一实施方案中,所述植物可具有降低或消除的编码NGAL2多肽的核酸序列的表达及降低或消除的NGAL3多肽的活性。在另一实施方案中,所述植物可具有降低或消除的编码NGAL3多肽的核酸序列的表达及降低或消除的NGAL2多肽的活性。
如在本发明各个方面中描述的NGAL2或NGAL3多肽具有如下文阐述的特征性结构域结构。
如在本发明各个方面中描述的NGAL2或NGLA3多肽包含B3DNA结合结构域,该结构域的结构在图12中示出。
在一个实施方案中,所述结构域是SNNNNNNGGSGDDVACHFQRFDLHRLFIGWRGE(SEQ IDNO:6)或者与其具有至少80%、至少95%或者至少95%的序列相同性的结构域。
如在本发明各个方面中描述的NGAL2或NGAL3多肽还包含如图12所示转录阻抑基序。
在一个实施方案中,所述结构域是VRLFGVNLE(SEQ ID NO:7)或者与其具有至少95%序列相同性的结构域。
在一个实施方案中,NGAL2蛋白是AtNGAL2、其功能变体、一部分或者同系物。AtNGAL2由AtSOD7编码。术语AtSOD7是指野生型AtSOD7核酸序列,其包含或者由SEQ IDNO.1(cDNA)或者SEQ ID NO 2(基因组DNA)组成。由AtSOD7编码的蛋白质称作AtNGAL2(SEQID NO.3)。在一个实施方案中,所述功能性同系物不是AtNGAL3。
在一个实施方案中,NGAL3蛋白是AtNGAL3、其功能变体、一部分或者同系物。术语AtNGAL3是指野生型AtNGAL3核酸序列,其包含或者由SEQ ID NO.4组成。由AtNGAL3编码的蛋白质称作AtNGAL3(SEQ ID NO.5)。
术语“功能性”是指NGAL2或NGAL3的生物功能,即其在控制器官大小、特别是种子大小中的功能。如本文所用,关于SEQ ID NO:1、2或3或者SEQ ID NO:4或5的术语“功能性变体”或者“功能部分”是指变体基因或者多肽序列或者基因或多肽序列的一部分,其保留全长非变体SOD7/NGAL2或NGAL2/NGAL3序列的生物学功能,即调节种子大小的功能。这样的序列分别互补Atsod7-1D突变体或者Atngal3突变体。
因此,应理解本领域技术人员将意识到本发明的各个方面不仅涵盖靶向AtSOD7和/或AtNGAL3核酸,例如分别包含或者由SEQ ID NO:1或SEQ ID NO:2或SEQ ID NO:4组成的核酸序列,或者包含或者由SEQ ID NO:3或SEQ ID NO:5组成的多肽,或者AtSOD7和/或AtNGAL3核酸的启动子。本发明还涵盖AtNGAL2或AtNGAL3的功能变体,其不影响所得蛋白质的生物活性和功能。核酸序列中导致在指定位点产生不同氨基酸、然而不影响编码多肽的功能性质的改变为本领域熟知。例如,氨基酸丙氨酸(一种疏水性氨基酸)的密码子可以由编码另一弱疏水性残基如甘氨酸或者更强疏水性残基如缬氨酸、亮氨酸或者异亮氨酸的密码子取代。相似地,导致一个负电荷残基由另一残基取代的改变,如用天冬氨酸取代谷氨酸,或者一个正电荷残基由另一残基取代,如赖氨酸取代精氨酸,这样也可以产生功能性等价产物。每个提议的修饰均在本领域技术人员常规技术范围内,决定因素是保留编码产物的生物学活性。本发明还涵盖了与如本发明所示野生型序列基本相同的变体,即仅具有在例如非保守残基中的一些序列改变,且是生物学活性的。
通常地,如本文所述特定的SOD7/NGAL3核苷酸序列或者NGAL2/NGAL3多肽的变体与特定的非变体核苷酸序列具有至少大约60%、优选至少大约75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、92%、94%、95%、96%、97%、98%或99%或更高的序列相同性,如通过本文别处描述的序列对比程序确定。
进一步地,本发明的各个方面不仅涵盖AtSOD7和/或AtNGAL3核酸,例如分别包含或者由SEQ ID NO:1或SEQ ID NO:2或SEQ ID NO:4组成的核酸序列,或者包含或者由SEQID NO:3或SEQ ID NO:5组成的多肽,或者其功能变体,也涵盖在拟南芥或前体植物中AtSOD7和/或AtNGAL3的同系物。也在本发明范围内的是这种同系物的功能变体。
如本文所用术语同系物也称作其它植物物种的AtSOD7和/或AtNGAL3直系同源物。AtNGAL2或AtNGAL3多肽的同系物分别与SEQ ID NO:3或5所示氨基酸序列具有(以优选性增加的顺序)至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或者至少99%整体序列相同性。优选地,所述整体序列相同性是至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、or 99%、mostpreferably 90%、91%、92%、93%、94%、95%、96%、97%、98%或者至少99%。
在另一实施方案中,AtSOD7或AtNGAL3核酸序列的同系物分别与SEQ ID NO:1或2或4所示序列具有(以增加的优选性顺序)至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或者至少99%整体序列相同性。优选地,整体序列相同性是至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%,最优选90%、91%、92%、93%、94%、95%、96%、97%、98%或者至少99%。所述整体序列相同性是使用本领域已知的整体序列排列对比算法确定的,如GAP程序中的Needleman Wunsch算法(GCG Wisconsin Package,Accelrys)。
在优选的实施方案中,NGAL2或NGAL3同系物来自不是拟南芥的植物。
在一个实施方案中,AtNGAL2或其同系物或者AtNGAL3或其同系物包含具有如上定义的序列的B3结构域。
在一个实施方案中,AtNGAL2或其同系物或者AtNGAL3或其同系物包含具有如上定义的序列的转录抑制基序。
同系物的实例在图13和SEQ ID NO:49-145中示出。在某些实施方案中,如果植物具有一个以上的AtNGAL2和/或AtNGAL3同系物,则所有同系物均是敲除或敲低的。合适的同系物可以通过序列对比和保守结构域鉴别而鉴别。本领域具有可用于鉴别这种序列的预测物。同系物的功能可以如本文所述鉴别,例如当在植物中过表达或者在植物中敲除时、或者当在植物中表达或者通过在拟南芥表达同系物核酸序列增加功能突变体时,技术人员因此能证实该功能。
因此,本发明的及本文描述的核苷酸序列也可用于从其它生物体、特别是其它植物例如农作物中分离相应序列。在这种方式中,如PCR、杂交等方法可用于基于其与本文描述的序列的序列同源性鉴别这种序列。当鉴别和分离同系物时也考虑所述序列和特征性结构域的拓扑学。序列可以基于其与全部序列或其片段的序列相同性分离。在杂交技术中,所有或部分已知核苷酸序列用作探针,选择性地杂交在选择的植物的一群克隆的基因组DNA片段或者cDNA片段(即基因组或cDNA文库)中存在的其它相应核苷酸序列。杂交探针可以是基因组DNA片段、cDNA片段、RNA片段或者其它寡核苷酸,及可以用可检测基团或者任何其它可检测标记进行标记。因此,例如杂交探针可以通过标记基于本发明的ABA结合的序列合成的寡核苷酸而产生。制备探针以杂交和构建cDNA和基因组文库的方法为本领域已知,在Sambrook,et al.,(1989)Molecular Cloning:ALibrary Manual(2d ed.,Cold SpringHarbor Laboratory Press,Plainview,New York)中揭示。
这种序列的杂交可以在严格条件下进行。“严格条件”或者“严格杂交条件”是指在此条件下探针以超过与其它序列杂交的程度杂交其靶序列(例如超过背景至少2倍)。严格条件是序列依赖性的,且在不同环境中是不同的。通过控制杂交和/或洗涤条件的严格性,可以鉴别与探针100%互补的靶序列(同源探查)。或者,可以调节严格条件以允许在序列中的一些错配,由此检测到较低程度的相似性(异源探查)。通常地,探针的长度小于大约1000个核苷酸,优选小于500个核苷酸。
典型地,严格条件是其中盐浓度低于大约1.5M钠离子,典型为大约0.01-1.0M钠离子浓度(或者其它盐),pH 7.0-8.3,对于短探针(例如10-50个核苷酸)的温度为至少大约30℃,对于长探针(例如大于50个核苷酸)的温度为至少大约60℃。杂交的持续时间通常低于大约24小时,通常是大约4-12小时。严格条件也可以通过加入去稳定剂如甲酰胺而实现。
根据本发明,优选的AtSOD7和AtNGAL3肽的同系物选自农作物,例如粮食作物。优选的AtNGAL2和AtNGAL3的同系物及其多肽序列在图13中示出。
根据本发明的各个方面的植物、包括本文所述的转基因植物、方法和使用可以是单子叶或双子叶植物。
双子叶植物可以选自包括但不限于如下科植物:菊科(Asteraceae)、十字花科(Brassicaceae(例如油菜(Brassica napus))、藜科(Chenopodiaceae)、葫芦科(Cucurbitaceae)、豆科(Leguminosae(Caesalpiniaceae、Aesalpiniaceae Mimosaceae、Papilionaceae或者Fabaceae))、锦葵科(Malvaceae)、蔷薇科(Rosaceae)或者茄科(Solanaceae)。例如,植物可以选自莴苣、向日葵、拟南芥、花椰菜、菠菜、西瓜、南瓜、卷心菜、番茄、马铃薯、山药、辣椒、烟草、棉花、秋葵、苹果、玫瑰、草莓、苜蓿、菜豆、大豆、蚕豆、豌豆、兵豆、花生、鹰嘴豆、杏、梨、桃、葡萄、甜椒、红辣椒或者柑橘。
单子叶植物可例如选自棕榈科(Arecaceae)、石蒜科(Amaryllidaceae)或者禾本科(Poaceae)植物。例如,植物可以是禾谷类作物如玉米、小麦、水稻、大麦、燕麦、高粱、黑麦、小米、荞麦,或者饲料作物如黑麦草属或者羊茅属,或者作物如甘蔗、洋葱、韭葱、山药或者香蕉。
所述植物还包括生物燃料和生物能源作物如油菜、甘蔗、甜高粱、Panicumvirgatum(柳枝稷)、亚麻籽、羽扇豆和柳树、白杨木、白杨木杂种、芒草属或裸子植物,如火炬松。所述植物还包括用于青贮饲料(玉米)、牧草或草料(草、三叶草、红豆草、苜蓿)、纤维(例如棉花、亚麻)、建筑材料(例如松树、橡树)、制浆(例如白杨木)、用于化学工业的给料(feeder stock)(例如高芥酸油菜、亚麻籽)及用于舒适目的的植物(例如高尔夫球场的草皮草)、用于公众和私人花园的观赏植物(例如金鱼草、矮牵牛花、玫瑰、老鹳草、烟草属(Nicotiana sp.))及用于家庭的植物和切花(非洲紫罗兰、秋海棠、菊花、天竺葵、彩叶吊兰(Coleus spider plant)、龙血树、橡胶植物)。
优选地,所述植物是作物植物。作物植物是指以商业规模生长用于人或动物消耗或使用的任何植物。在优选的实施方案中,所述植物是谷物。
最优选的植物是玉米、水稻、小麦、油菜、高粱、大豆、向日葵、苜蓿、马铃薯、番茄、烟草、葡萄、大麦、豌豆、菜豆、蚕豆、莴苣、棉花、甘蔗、甜菜、花椰菜或者其它蔬菜芸苔类植物或者白杨木。
如本文所用术语“植物”涵盖了全植物及植物和后代和植物一部分,包括种子、果实、地上部分、茎、叶、根(包括块茎)、花及组织和器官,其中每个前述部分均包含感兴趣的基因/核酸。术语“植物”也涵盖植物细胞、悬浮培养物、愈伤组织、胚芽、分生组织区、配子体、孢子体、花粉和小孢子,再次其中每个上述部分均包含感兴趣的基因/核酸。
根据本发明的各个方面,包括本发明的植物和方法,消除、失活、抑制、降低或者负调节NGAL2和/或NGAL3多肽的活性可以通过不同方式实现。这意味着在本发明各个方面的范围内的是消除或降低SOD7和/或NGAL3基因翻译或转录、SOD7和/或NGAL3转录体稳定性的去稳定、NGAL2和/或NGAL3多肽稳定性的去稳定、或者消除或降低NGAL2和/或NGAL3多肽的激活或活性的方法。因此,在一个实施方案中,内源性SOD7和/或NGAL3基因或其启动子携带功能性突变,由此产生非全长转录体。在另一实施方案中,在所述植物中使用基因沉默技术沉默SOD7和/或NGAL3基因。在另一实施方案中,SOD7和/或NGAL3核酸序列已经被改变以导入突变,获得具有降低的或消除活性的NGAL2/NGAL3蛋白质。这些实施方案和使用的技术在下文详细描述。
另一方面,本发明涉及一种改变植物表型的方法,包括相对于对照植物降低或消除编码NGAL2和/或NGAL3多肽的核酸序列的表达,和/或降低或消除NGAL2和/或NGAL3多肽的活性。
另一方面,本发明涉及一种产生具有改变的表型的植物的方法,包括相对于对照植物降低或消除编码NGAL2和/或NGAL3多肽的核酸序列的表达,和/或降低或消除NGAL2和/或NGAL3多肽的活性。
如前文所述,上述这些方法使用遗传工程方法。
在这个方面,可以靶向野生型植物同时敲除或敲低SOD7和NGAL3的功能。或者,所述方法可包括如下步骤:
a)在第一植物中敲除或敲低SOD7功能;
b)在第二植物中敲除或敲低NGAL3功能;及
c)使从所述第一植物再生的植物与从所述第二植物再生的植物杂交。
在这些方法的一个实施方案中,编码NGAL2多肽的核酸序列的表达或者NGAL2多肽的活性被降低或消除。在另一实施方案中,编码NGAL3多肽的核酸序列的表达或者NGAL3多肽的活性被降低或消除。在优选的实施方案中,所述方法包括降低或消除编码NGAL2多肽的核酸序列的表达或者NGAL2多肽的活性,及降低或消除编码NGAL3多肽的核酸序列的表达或者NGAL3多肽的活性,以产生双重丧失功能突变体。
例如,所述方法包括降低或消除编码NGAL2多肽的核酸序列的表达以及降低或消除编码NGAL3多肽的核酸序列的表达。在另一实施方案中,所述方法包括降低或消除NGAL2多肽的活性及降低或消除NGAL3多肽的活性。在另一实施方案中,所述方法包括降低或消除编码NGAL2多肽的核酸序列的表达及降低或消除NGAL3多肽的活性。在另一实施方案中,所述方法包括降低或消除编码NGAL3多肽的核酸序列的表达或者降低或消除NGAL2多肽的活性。
根据这些方法,所述表型优选选自增加的器官大小,例如增加的种子大小或增加的种子重量。增加的种子大小导致产量增加,本发明的方法也增加产量。
术语“产量”通常是指典型关于指定的作物、区域及时间的可测量的经济价值产生。各个植物部分基于其数目、大小和/或重量直接有助于产量,或者实际产量是每平方米作物一年的产量,通过总产量(包括收获的和评估的产量)除以种植的平方米确定。如本文所述,术语“产量”涉及产量相关的性状及可以涉及植物生物量(根和/或地方部分生物量)、生殖器官和/或植物的繁殖体(如种子)。因此,根据本发明,术语产量是指器官大小,特别是种子大小,可以通过评定种子大小或者种子重量或者子叶大小测量。
术语“增加”、“改良”或者“增强”可互换使用。产量或种子大小例如与对照植物相比增加至少2%、3%、4%、5%、6%、7%、8%、9%或10%,优选至少15%或20%,更优选25%、30%、35%、40%或者50%或更高。
如本文所用,根据本发明所有方面使用的对照植物是未根据本发明的方法修饰的植物。因此,对照植物未经如本文所述遗传修饰以改变编码NGAL2或NGAL3多肽的核酸的表达或者改变NGAL2或NGAL3多肽的活性。在一个实施方案中,对照植物是未经遗传改变的野生型植物。在另一实施方案中,对照植物是未改变编码NGAL2或NGAL3多肽的核酸的表达或者未改变NGAL2或NGAL3多肽活性、但是已经以其它方式进行遗传改变如表达希望的转基因以赋予某些性状的转基因植物。
与对照植物相比,根据本发明的NGAL2和/或NGAL3多肽或者相应SOD7和/或NGAL3核酸序列的活性降低、减少、负调节或者抑制至少10%、20%、30%、40%或50%。
例如,所述植物是功能降低(敲低)或者丧失(敲除)突变体,其中SOD7和/或NGAL3核酸序列的功能与野生型对照植物相比降低或丧失。为此,将突变导入SOD7和/或NGAL3核酸序列或者相应启动子序列,其破坏基因的转录,产生无功能或者功能降低的基因产物。所述突变可以是缺失、插入或者取代。活性蛋白质的表达可因此通过突变在植物细胞中编码NGAL2或NGAL3多肽的核酸序列而被消除,并从突变的细胞中再生植物。核酸可以通过插入或缺失一或多个核苷酸而被突变。失活或敲除靶基因的技术为本领域熟知。这些技术包括使用靶向感兴趣的基因并允许在特定位点整合转基因的载体的基因靶向技术。将靶向构建体工程化以与靶基因重组,这是通过将来自基因自身的序列掺入构建体而而完成。然后在基因内该序列的区域中发生重组,导致插入外来序列以破坏该基因。随着其序列被中断,改变的基因被翻译成非功能性蛋白质(如果其被翻译)。其它技术包括如下文描述的基因组编辑(靶向基因组工程化)。在优选的实施方案中,使用这些技术之一修饰分别赋予NGAL2或NGAL3功能的保守结构域。
本领域技术人员已知可用于产生这种突变体的进一步的方法。在一个实施方案中,使用插入诱变技术,例如使用T-DNA诱变(其将来自根癌农杆菌(Agrobacteriumtumefaciens)T-质粒的T-DNA插入DNA中,导致基因功能丧失或者基因功能增加的突变)、定向核酸酶(SDN)或者转座子作为诱变剂。插入诱变是另一种破坏基因功能的方式,其基于将外来DNA插入感兴趣的基因中(见Krysan et al,The Plant Cell,Vol.11,2283-2290,December1999)。
在一个实施方案中,如在实施例中论述,T-DNA可用作插入诱变剂,其破坏SOD7和/或NGAL3基因表达。T-DNA不仅破坏其插入之中的基因的表达,而且还作为随后鉴别突变的标记。由于已知插入元件的序列,因此可以使用各种基于克隆或PCR策略重新获得已经发生插入的基因。插入长度大约为5-25kb的T-DNA通常产生基因功能的破坏。如果产生足够大的T-DNA转化系群体,则有相当好的机会发现在任何感兴趣的基因内携带T-DNA插入体的转基因植物。用T-DNA转化孢子通过农杆菌介导的方法实现,包括将植物细胞和组织暴露于农杆菌细胞悬浮液。
这种方法的详细内容为本领域技术人员熟知。简而言之,通过农杆菌的植物转化导致称作T-DNA的序列整合进核基因组中,所述序列在细菌质粒上携带。使用T-DNA转化导致稳定的单一插入。对所得转化系的进一步突变分析是直接进行的,每一个插入系可以通过对插入体两侧的DNA直接测序和分析快速鉴定。将突变体中的基因表达与野生型植物中SOD7和/或NGAL3核酸序列的表达进行对比,及进行表型分析。其它插入诱变技术包括使用转座子。
在另一实施方案中,诱变是物理诱变,如应用紫外线照射、X射线、γ射线、快或热中子或质子。然后可以筛选靶向的群体以鉴别SOD7或NGAL3功能丧失突变体。
在本发明各个方面的另一实施方案中,植物是衍生自用诱变剂诱变的植物群体的突变植物。所述诱变剂可以是快中子放射或者化学诱变剂,例如选自如下非限制性制剂:甲磺酸乙酯(EMS)、甲基甲磺酸(MMS)、N-乙基-N-硝基脲(ENU)、三乙基三聚氰胺(1'EM)、N-甲基-N-硝基脲(MNU)、甲基苄肼、苯丁酸氮芥、环磷酰胺、硫酸二乙酯、丙烯酰胺单体、美法仑、氮芥、长春新碱、二甲基亚硝胺、N-甲基-N'-硝基-亚硝基胍(MNNG)、亚硝基胍、2-氨基嘌呤、7,12-二甲基-苯蒽(DMBA)、氧化乙烯、六甲基磷酰胺、bisulfan、二环氧烷烃(二环氧辛烷(DEO)、双环氧丁烷(BEB)等)、2-甲氧基-6-氯-9[3-(乙基-2-氯乙基)氨基丙基氨基]二乙酸吖啶(ICR-170)或者甲醛。
在一个实施方案中,用于产生和分析突变的方法是靶向基因组中诱导的局部突变(TLLING),参见Henikoff et al,2004所述。在这个方法中,将种子用化学诱变剂如EMS诱变。所得M1植物自花授粉,并将个体的M2世代用于制备DNA样品以进行突变筛选。集合DNA样品,在微滴定平板上排成阵列并进行基因特异性PCR。使用鉴别野生型与突变基因之间的异源双链核酸分子的任何方法筛选所述PCR扩增产物在SOD7和/或NGAL3靶基因中的突变。例如但不限于变性高压液相层析(dHPLC)、恒定变性毛细管电泳(CDCE)、温度梯度毛细管电泳(TGCE)或者通过使用化学裂解片段化。优选地,将PCR扩增产物与核酸内切酶一起保温,其优先裂解野生型与突变体序列之间的异源双链核酸分子中的错配。裂解产物使用自动测序凝胶装置电泳,凝胶图像借助于标准商业图像处理程序分析。特异于SOD7或NGAL3核酸序列的任何引物可用于扩增集合的DNA样品内的SOD7或NGAL3核酸序列。优选地,引物被设计为扩增SOD7和/或NGAL3基因的一些区域,其中有用的突变特别是在高度保守的区域中最可能产生和/或赋予如本文别处阐明的活性。为了便于在凝胶上检测PCR产物,PCR引物可以用任何常规标记方法标记。
快速高通量筛选程序因此允许分析扩增产物以鉴别赋予与相应未诱变的野生型植物相比SOD7和/或NGAL3基因的表达降低或失活的突变。一旦在感兴趣的基因中鉴别了突变,将携带突变的M2植物的种子生长为成熟M3植物,并筛选与靶基因SOD7或NGAL3相关的表型特征。因此可以鉴别与对照植物相比具有增加的种子大小的功能丧失或者功能降低的突变体。
通过这种方法获得或者可以获得的在内源性SOD7和/或NGAL3基因座携带功能突变的植物也在本发明范围内。
在另一实施方案中,RNA介导的基因抑制或者RNA沉默可用于实现SOD7和/或NGAL3核酸序列的沉默。“基因沉默”是通常用于描述通过由RNA分子介导的序列特异性相互作用抑制基因表达的术语。降低程度可以是全部消除编码的基因产物的产生,但是更通常的是部分消除表达,保留一定程度的表达。该术语因此不应是要求表达的完全“沉默”。
转基因可用于抑制内源性植物基因。这个结果最初是当在矮牵牛花中查尔酮合成酶转基因导致内源性查尔酮合成酶基因抑制而发现的,且易于通过可见的色素改变而表明。随后,描述了有多少(如果不是全部)植物基因可以被转基因“沉默”。基因沉默要求转基因与被沉默的基因之间的序列相似性。这种序列同源性可包括沉默的靶基因的启动子区或者编码区。当包含编码区时,能导致基因沉默的转基因可以构建为具有转录有义或反义方向编码序列RNA的启动子。可能的是基因沉默的各个实例包括尚未完全了解的不同机制。在不同的实施例中,可以是转录或转录后基因沉默,根据本发明的方法这两种基因沉默均可使用。
最初在九十年代在植物中发现、随后在新杆状线虫(Caenorhabditis elegans)中揭示的基因沉默机制及其在遗传工程中的应用,在文献中广泛描述。
根据本发明的方法的RNA介导的基因抑制或者RNA沉默包括共抑制,其中靶有义RNA或mRNA(SOD7和/或NGAL3有义RNA或mRNA)的过表达导致相关基因的表达水平降低。转基因和同源内源性基因的RNA同等地抑制。用于本发明方法中的其它技术包括反义RNA以降低植物中内源性靶基因的转录水平。在这种方法中,RNA沉默不影响基因座的转录,但是仅导致靶mRNA的序列特异性降解。“反义”核酸序列包含与编码NGAL2和/或NGAL3蛋白质或蛋白质的一部分的“有义”核酸序列互补的核苷酸序列,即互补于双链cDNA分子的编码链,或者互补于mRNA转录体序列。反义核酸序列优选互补于被沉默的内源性SOD7和/或NGAL3基因。所述互补性可位于基因的“编码区”和/或“非编码区”。术语“编码区”是指包含被翻译为氨基酸残基的密码子的核苷酸序列区域。术语“非编码区”是指在编码区两侧的5’和3’序列,其被转录但不失翻译为氨基酸(也称作5’和3’非翻译区)。
反义核酸序列可以根据Watson和Crick碱基配对原则设计。反义核酸序列可以互补于整个SOD7和/或NGAL3核酸序列,但是也可以是仅对于一部分核酸序列(包括mRNA5’和3’UTR)反义的寡核苷酸。例如,反义寡核苷酸序列可以互补于编码多肽的mRNA转录体的翻译起始位点周围的区域。合适的反义寡核苷酸序列的长度为本领域已知,可以起自长度为大约50、45、40、35、30、25、20、15或10个核苷酸或更少。根据本发明的反义核酸序列可以使用本领域已知的方法通过化学合成和酶连接反应构建。例如,反义核酸序列(例如反义寡核苷酸序列)可以使用天然发生的核苷酸或者设计为增加分子的生物稳定性或者增加在反义与有义核酸序列之间形成的双链体的物理稳定性的不同修饰的核苷酸经化学合成,例如可以使用硫代磷酸酯衍生物和吖啶取代的核苷酸。可用于产生反义核酸序列的修饰的核苷酸的实例为本领域熟知。反义核酸序列可以使用表达载体经生物学产生,所述表达载体中核酸序列已经以反义方向亚克隆(即从插入的核酸中转录的RNA对于感兴趣的靶核酸是反义方向)。优选地,在植物中反义核酸序列的产生借助于包含启动子、可操纵连接的反义寡核苷酸及终止子的稳定整合的核酸构建体而发生。
将在本发明方法中用于沉默的核酸分子与mRNA转录体杂交或结合和/或插入编码多肽的基因组DNA中,从而例如通过抑制转录和/或翻译而抑制蛋白质的表达。杂交可以通过常规核苷酸互补形成稳定双链体,或者例如在结合DNA双链体的反义核酸序列的情况中,通过在双链体螺旋的大沟中的特异性相互作用而形成。反义核酸序列可以通过转化或者在特定组织位点直接注射而导入植物中。或者,反义核酸序列可以修饰为靶向选择的细胞,然后系统性给予。例如,对于系统性给予,可以修饰反义核酸序列,由此其例如通过将所述反义核酸序列与结合细胞表面受体或抗原的肽或抗体连接而特异性结合在选择的细胞表面上表达的受体或抗原。也可以通过使用载体将反义核酸序列输送至细胞。
RNA干扰(RNAi)是另一种转录后基因沉默现象,根据本发明的方法可以使用这种现象。这种现象是通过双链RNA诱导的,其中与dsRNA同源的mRNA被特异性降解。这是指通过短干扰RNAs(siRNA)介导的序列特异性转录后基因沉默的过程。在当酶DICER遭遇dsRNA并将其裂解成称作小干扰RNA(siRNA)的片段时开始RNAi的过程。这种酶属于RNase III核酸酶家族。收集这些RNA的蛋白质复合物保留并使用其密码作为引导以搜索和破坏细胞中具有匹配序列的任何RNA,如靶mRNA。
人工和/或天然微RNA(miRNA)可用于敲除基因表达和/或mRNA翻译。微RNA(miRNA)miRNA是典型的单链小RNA,长度典型为19-24个核苷酸。大多数植物miRNA与其靶序列具有完美的或者接近完美的互补性。然而,存在具有至多5个错配的天然靶序列。其是通过Dicer家族的双链特异性RNases从具有特征性折返结构的较长的非编码RNA中加工的。在加工的基础上,通过结合其主要成分-Argonaute蛋白-而被掺入RNA诱导的沉默复合物(RISC)中。miRNA作为RISC的特异性成分,因为其与细胞质中靶核酸、主要是mRNA进行碱基配对。随后的调节事项包括靶mRNA裂解及破坏和/或翻译抑制。miRNA过表达的作用因此通常体现在靶基因的mRNA水平降低。人工微RNA(amiRNA)技术已经用于拟南芥(Arabidopsis thaliana)及其它植物中,以有效地沉默感兴趣的靶基因。amiRNA的设计原理已经广泛化并并入基于网页的工具中(http://wmd.weigelworld.org)。
因此,根据本发明的各个方面,可以转化植物以导入RNAi、shRNA、snRNA、dsRNA、siRNA、miRNA、ta-siRNA、amiRNA或者共抑制分子,这些分子已经被设计为靶向SOD7和/或NGAL3核酸序列的表达并选择性降低或抑制基因的表达或者其转录体的稳定性。优选地,根据本发明各个方面使用的RNAi、snRNA、dsRNA、shRNAsiRNA、miRNA、amiRNA、ta-siRNA或者共抑制分子包括至少17nt、优选22-26nt的片段,且可以基于SEQ ID NO:1所示信息设计。设计有效的siRNA的引导原则为本领域技术人员熟知。简而言之,选择靶基因序列的短片段(例如长度为19-40个核苷酸)作为本发明siRNA的靶序列。靶基因序列的短片段是靶基因mRNA的片段。在优选的实施方案中,从靶基因mRNA中选择序列片段作为候选siRNA分子的标准包括:1)来自靶基因mRNA的序列,其是来自天然mRNA分子的5’或3’末端的至少50-100个核苷酸;2)来自靶基因mRNA的序列,其G/C含量为30%-70%,最优选大约50%;3)来自靶基因mRNA的序列,其不含有重复序列(例如AAA、CCC、GGG、TTT、AAAA、CCCC、GGGG、TTTT);4)来自靶基因mRNA的序列,其在mRNA中是可进入的;5)来自靶基因mRNA的序列,其对于靶基因是独特的;6)避开起始密码子的75个碱基内的区域。来自靶基因mRNA的序列片段可以符合上述一或多个标准。考虑到上述关于设计最佳寡核苷酸的所有变量,在预定程序将选择的基因作为核苷酸序列导入。这个程序扫描了对于siRNA靶向易感的区域的任何mRNA核苷酸序列。这种分析的输出是可能的siRNA寡核苷酸的得分。最高分用于设计双链RNA寡核苷酸,其典型是通过化学合成法产生。除了互补于mRNA靶区域的siRNA之外,简并siRNA序列可用于靶向同源区域。根据本发明的siRNA可以通过本领域已知的任何方法合成。优选使用合适当的保护的核糖核苷亚磷酰胺和常规DNA/RNA合成仪经化学合成RNA。此外,siRNA可以得自商业RNA寡核苷酸合成供应商。
根据本发明的siRNA分子可以是双链的。在一个实施方案中,双链siRNA分子包含钝端。在另一实施方案中,双链siRNA分子包含突出端核苷酸(例如1-5个核苷酸突出端,优选2个核苷酸突出端)。在一些实施方案中,siRNA是短发夹RNA(shRNA);siRNA分子的两个链可以通过接头区连接(例如核苷酸接头或者非核苷酸接头)。本发明的siRNA可含有一或多个修饰的核苷酸和/或非磷酸二酯键。本领域熟知的化学修饰能增加siRNA的稳定性、利用度和/或细胞摄取。技术人员知道可掺入RNA分子中的其它类型化学修饰。
在一个实施方案中,可以使用如US 6635805所述的重组DNA构建体,所述专利并入本文作参考。
使用常规方法如载体及农杆菌介导的转化方法将沉默RNA分子导入植物中。产生稳定转化的植物,并分析与野生型对照植物相比SOD7和/或NGAL3基因的表达情况。
SOD7和/或NGAL3核酸序列的沉默也可以使用病毒诱导的基因沉默完成。
因此,在本发明的一个实施方案中,植物表达包含RNAi、shRNA snRNA、dsRNA、siRNA、miRNA、ta-siRNA、amiRNA或共抑制分子的核酸构建体,其靶向如本文所述SOD7或NGAL3核酸序列并降低内源SOD7或NGAL3核酸序列的表达。当例如与对照植物相比RNAi、snRNA、dsRNA、siRNA、shRNA、miRNA、ta-siRNA、amiRNA或共抑制分子选择性降低或抑制基因的表达时,该基因被靶向。或者当RNAi、shRNAsnRNA、dsRNA、siRNA、miRNA、ta-siRNA、amiRNA或共抑制分子在严格条件下与基因转录体杂交时,所述RNAi、snRNA、dsRNA、siRNA、miRNA、ta-siRNA、amiRNA或共抑制分子靶向SOD7或NGAL3核酸序列。
如果在内源基因上存在突变和/或在随后导入植物中的分离的基因/核酸上存在突变,也可以发生基因沉默。所述降低或基本上消除可以由非功能性多肽引起。例如,多肽可以结合各种相互作用蛋白质;一或多个突变和/或截短因此可以提供给多肽,其仍能结合相互作用蛋白质(如受体蛋白),但是不能呈现出其正常功能(如信号配体)。
基因沉默的另一种方法是通过靶向互补于基因调节区(例如启动子和/或增强子)的核酸序列,形成阻止基因在靶细胞中转录的三螺旋结构。本领域技术人员熟知其它方法,如使用内源多肽的抗体抑制其在原位的功能或者干扰多肽参与的信号途径。特别地,可以设想人工分子可用于抑制靶多肽的生物功能,或者干扰靶多肽参与的信号途径。
在一个实施方案中,阻抑物核酸可以是NGAL2或NGAL3多肽表达的反义阻抑物。在使用反义序列负调节基因表达中,将核苷酸序列以“相反方向”置于启动子的控制下,由此转录产生与从靶基因的“有义”链转录的正常mRNA互补的RNA。
反义阻抑物核酸可包含来自靶核苷酸序列的至少10个核苷酸的反义序列。可优选在用于负调节靶序列表达的序列与靶序列中存在完全序列相同性,但是序列的全部互补性或相似性不是必需的。在使用的来自靶基因的序列中有一或多个核苷酸可以不同。因此,根据本发明用于负调节基因表达的序列可以是选自那些可利用的野生型序列(例如基因)或者这种序列的变体。
序列不需要包括开放读框或者指定可翻译的RNA。优选杂交的各自的反义和有义RNA分子是足够同源的。即使在使用的序列与靶基因之间存在大约5%、10%、15%或20%或者更多的错配的情况中,也可以负调节基因表达。有效地,所述同源应足以发生负调节基因表达。
阻抑物核酸可以可操纵地与组织特异性或可诱导启动子连接。例如,珠被和种子特异性启动子可用于在发育中的胚珠和种子中特异性负调节SOD或NGAL3核酸以增加最终的种子大小。
如本文所述抑制NGAL2或NGAL3多肽表达的核酸可以与异源调节序列如与启动子可操纵地连接,例如组成型启动子、可诱导启动子、组织特异性启动子或者发育特异性启动子。所述构建体或载体可以如本文所述转化进植物细胞中并表达。包含这种载体的植物细胞也在本发明范围内。
另一方面,本发明涉及沉默构建体以沉默通过如本文所述方法可获得或者获得的NGAL2或NGAL3的表达,及涉及包含这种构建体的植物细胞。因此,本发明还涉及包含或者由SEQ ID NO:1、2或3或者其一部分内或者SEQ ID NO:1、2或3的同系物或者其一部分组成的核酸序列在沉默NGAL2或NGAL3的表达中的应用。用这种构建体转化的宿主细胞也在本发明范围内。
近年来,基因组编辑技术呈现出是常规诱变方法(如物理和化学诱变)或者使用转基因在植物中表达以产生具有农业学重要的改良的表型的突变体植物的方法的替代方法。这些技术应用序列特异性核酸酶(SSN),包括锌指核酸酶(ZFN)、转录激活物样效应子核酸酶(TALEN)及RNA引导的核酸酶Cas9(CRISPR/Cas9),其产生靶向的DNA双链断裂(DSB),然后主要通过易错非同源末端结合(NHEJ)或者高保真同源重组(HR)而修复。SSN已经用于在从模式植物拟南芥和烟草至重要的农作物如大麦、大豆、水稻和玉米的各个物种中产生靶向敲除植物。使用CRISPR/Cas9系统和TALEN在拟南芥和水稻中已经证实可遗传的基因修饰。
靶向基因组修饰或者靶向基因组标记是一种基因组工程化技术,其使用靶向的DNA双链断裂(DSB)以通过同源重组(HR)介导的重组事件刺激基因组编辑。为了通过导入位点特异性DNADSB实现有效的基因组编辑,可以使用四种主要类别的可定制的DNA结合蛋白:衍生自微生物可动遗传元件的大范围核酸酶,基于真核转录因子的ZF核酸酶,来自黄单胞菌细菌的转录激活物样效应子(TALE)及来自II型细菌适应性免疫系统CRISPR(成簇的、规律间隔的短回文结构重复序列)的RNA引导的DNA核酸内切酶Cas9。大范围核酸酶、ZF和TALE蛋白通过蛋白质-DNA相互作用均识别特异性DNA序列。尽管大范围核酸酶整合其核酸酶和DNA结合结构域,但是ZF和TALE蛋白由分别靶向DNA的3或1个核苷酸(nt)的各个模块组成。Z和TALE可以希望的组合方式装配,并附着于FokI的核酸酶结构域以引导核酸裂解活性针对特异性基因组位点。
基于通过细菌III型分泌系统输送进宿主细胞中,TAL效应子进入细胞核,结合宿主基因启动子中效应子特异性序列并激活转录。其靶向特异性通过串联的33-35个氨基酸重复的一个中心结构域而确定。随后是单一截短的20个氨基酸重复序列。检验的大多数天然发生的TAL效应子具有12-27个完全重复。
这些重复序列彼此仅是两个相邻氨基酸(其重复可变双残基(RVD))不同。决定哪个单一核苷酸是TAL效应子的RVD将识别:一个RVD相应于一个核苷酸,四个最常见的RVD均优先结合四个碱基之一。天然发生的识别位点一致地由T在前面,这是TAL效应子活性所要求的。TAL效应子可以与FokI核酸酶的催化结构域融合产生TAL效应子核酸酶(TALEN),其在体内产生靶向的DNA双链断裂(DSB)以进行基因组编辑。这种技术在基因组编辑中的应用在本领域已经充分描述,例如在US 8,440,431、US 8,440,432和US 8,450,471中描述。参考文献30描述了一系列定制的质粒,其可与Golden Gate克隆方法一起使用以装配多个DNA片段。如本文所述,Golden Gate方法使用IIS型限制性核酸内切酶,其在外面裂解其识别位点产生独特的4bp突出端。通过在相同反应混合物中消化和连接加速克隆,因为正确的装配消除酶识别位点。装配定制的TALEN或TAL效应子构建体,包括两个步骤:(i)将重复模块装配成1-10个重复的间隔阵列,(ii)将间隔阵列与主链结合产生最终构建体。
根据本发明的各个方面可以使用的另一基因组编辑方法是CRISPR。这种技术在基因组编辑中的应用在本领域已经充分描述,例如在US 8,697,359及其中列举的参考文献中描述。简而言之,CRISPR是一种微生物核酸酶系统,参与防御侵入的噬菌体和质粒。在微生物宿主中CRISPR位点含有CRISPR相关的(Cas)基因以及能程序化CRISPR介导核酸裂解(sgRNA)的特异性的非编码RNA元件的组合。在广泛的细菌宿主中已经鉴别了三种类型(I-III)的CRISPR系统。每个CRISPR位点的一个关键特征是存在由短非重复序列(间隔基)间隔的重复序列阵列(直接重复)。将非编码CRISPR阵列转录并在直接重复内裂解为含有各个间隔序列的短crRNA,其引导Cas核酸酶定向于靶位点(前间区)。II型CRISPR是经最充分鉴定的系统,在四个相继的步骤中进行靶向DNA双链断裂。首先,从CRISPR位点转录两个非编码RNA,pre-crRNAarray和tracrRNA。第二步,tracrRNA与pre-crRNA的重复区杂交病介导pre-crRNA加工为含有各个间隔序列的成熟crRNA。第三步,成熟crRNA:tracrRNA复合物通过crRNA上间隔序列与紧邻靶识别的另外要求的前间区相邻基序(PAM)的靶DNA上前间区之间Watson-Crick碱基配对引导Cas9定向于靶DNA。最后,Cas9介导靶DNA裂解产生前间区内双链断裂。
Cas9因此是II型CRISPR-Cas系统的标志蛋白,通过两个非编码RNA-CRIPSR RNA(crRNA)和反式激活crRNA(tracrRNA)-的复合物将较大的单体DNA核酸酶引导至PAM(前间区相邻基序)序列相邻的DNA靶序列。Cas9蛋白含有与RuvC和HNH核酸酶同源的两个核酸酶结构域。HNH核酸酶裂解互补DNA链,而RuvC样结构域裂解非互补链,结果在靶DNA中导入钝切割。Cas9与sgRNA一起的异源表达在来自不同生物体的活细胞的基因组DNA中可以导入位点特异性双链断裂(DSB)。对于在真核生物体中的应用,使用源自细菌酿脓链球菌的密码子优化形式的Cas9。
单一引导RNA(sgRNA)是CRISPR/Cas系统的另一种成分,其与Cas9核酸酶形成复合物。sgRNA是通过融合crRNA与tracrRNA产生的合成的RNA嵌合体。位于其5’末端的sgRNA引导序列赋予DNA靶特异性。因此,脱修饰引导序列,可以产生具有不同靶特异性的sgRNA。引导序列的标准长度是20bp。在植物中,已经使用植物RNA聚合酶III启动子如U6和U3表达sgRNA。
使用这些技术,可以特异性靶向保守的结构域以消除NGAL2和/或NGAL3多肽的功能。
例如,可以靶向保守的B3结构域或者抑制基序。
因此,本发明的另一实施方案涉及突变体植物、植物细胞、植物或其一部分,特征在于NGAL2多肽的活性被改变,及所述植物表达包含突变体SEQ ID NO:1或2及编码突变体NGAL2多肽的核酸、其功能同系物或变体,例如在B3或者阻抑结构域中携带一个突变的核酸。
因此,本发明另一实施方案涉及突变体植物、植物细胞、植物或者其一部分,特征在于NGAL3多肽的活性被改变,及所述植物表达包含突变体SEQ ID NO:4及编码突变体NGAL3多肽的核酸、其功能同系物或变体,其在B3或阻抑结构域中携带一个突变。
在优选的实施方案中,本发明涉及突变体植物、植物细胞、植物或其一部分,特征在于NGAL2和NGAL3多肽的活性被改变,及所述植物表达包含突变体SEQ ID NO.1或2及编码突变体NGAL2多肽的核酸、其功能同系物或变体,例如在B3或阻抑结构域中携带一个突变的核酸,及所述植物表达包含突变体SEQ ID NO.4及编码突变体NGAL3多肽的核酸,其在B3或阻抑结构域中携带一个突变。
导致功能丧失的SOD7和/或NGAL3启动子区域中的突变也在本发明范围内。
如本文所示使用敲除或者敲低NGAL2或NGAL3的基因组编辑技术设计的构建体以及包含这些构建体的宿主细胞也在本发明范围内。在一个实施方案中,所述构建体包含或者由选自SEQ ID NO:155、156、157或158的序列组成。因此,在本发明的另一方面,提供了包含选自SEQ ID NO:155、156、157或158的序列的核酸构建体。在本发明的再一方面,提供了包含至少一个CRISPR靶序列的核酸构建体,其中靶序列选自SEQ ID No:150、160、161、162和163。优选地,靶序列包含至少两个CRISPR靶序列,优选SEQ ID No:159和160或者SEQ IDNo:161和162、或者SEQ ID No:161和163或者SEQ ID No:159和163。
在本发明方法的另一实施方案中,失活、抑制或者负调节NGAL2和/或NGAL3活性可以通过操纵植物如转基因植物中SOD7和/或NGAL3抑制剂的表达而实现。例如,可以将表达抑制SOD7和/或NGAL3基因表达或者SOD7和/或NGAL3蛋白质活性的蛋白质的基因导入植物中并过表达。所述抑制剂可以与指导SOD7和/或NGAL3基因表达的调节序列相互作用,以负调节或者抑制SOD7和/或NGAL3基因表达。例如,所述抑制剂可以是转录阻抑物。或者,其可以与转录调节物如正调节SOD7和/或NGAL3基因表达的转录因子相互作用并抑制所述转录调节物。或者,抑制剂可以与NGAL2和/或NGAL3蛋白直接相互作用以抑制其活性,或者与NGAL2和/或NGAL3蛋白的调节物相互作用。例如,可以通过操纵导致降低或丧失的活性的NGAL2和/或NGAL3蛋白的转录后修饰而失活、抑制或者负调节NGAL2和/或NGAL3蛋白质的活性。
在一个实施方案中,本发明的方法包括将NGAL2和/或NGAL3多肽的活性和/或SOD7和/或NGAL3基因的表达与对照植物中NGAL2和/或NGAL3多肽的活性和/或SOD7和/或NGAL3基因的表达进行对比。
另一方面,本发明涉及通过本文所述方法可获得或者获得的植物。
另一方面,本发明涉及与调节元件可操纵地连接的表达盒,所述表达盒包含具有或者由SEQ ID NO:1或2所示序列组成的分离的核酸序列、其功能部分、变体、同系物或者直向同源物。另一方面,本发明涉及与调节元件可操纵地连接的表达盒,其包含具有或者由SEQ ID NO:4所示序列组成的分离的核酸序列或者其功能部分、变体、同系物或直向同源物。所述调节元件可以是启动子。本发明还涉及包含这种表达盒的载体。本发明还涉及包含上述两个表达盒的组合物。
在本发明所述的方法中,可以从如上述转化或者遗传改变的植物中再生植物,通过已知方法分析表型、特别是种子表型。
转化方法为本领域已知。通过称作转化的方法将核酸序列导入所述植物中。术语“导入”或者“转化”在本文涵盖了将外源多核苷酸移至宿主细胞中,与用于转移的方法无关。能随后克隆繁殖(无论是通过器官发生或是胚发生)的植物组织可以用本发明的遗传构建体转化,并从中再生完整植物。选择的特定组织根据可用于及最适于特定转化物种的克隆繁殖系统而变化。举例的靶位组织包括叶盘、花粉、胚芽、子叶、下胚轴、大配子体、愈伤组织、现存的分生组织(例如顶端分生组织、腋芽和根分生组织)和诱导的分生组织(例如子叶分生组织和下胚轴分生组织)。多核苷酸可以瞬时或者稳定导入宿主细胞中及可以维持非整合的状态,例如作为质粒。或者,其可以整合如宿主基因组中。所得转化的植物细胞然后可用于以本领域技术人员已知的方式再生转化的植物。
将外来基因移至植物基因组中称作转化。目前植物的转化在许多物种中是一种常规技术。有利地,任何一些转化方法可用于将感兴趣的基因导入合适的祖细胞中。从植物组织或植物细胞中转化和再生植物的方法可用于瞬时或者稳定转化。转化方法包括使用脂质体、电穿孔、增加游离DNA摄取的化合物、将DNA直接注入植物中、基因枪轰击、使用病毒或花粉转化以及显微注射。方法可以选自对于原生质体的钙/聚乙二醇方法、原生质体电穿孔、显微注射仅植物材料中、DNA或RNA包被的粒子轰击、用(非整合的)病毒感染等。转基因植物、优选转基因农作物优选通过根癌农杆菌介导的转化方法产生。
为了选择转化的植物,通常将在转化中获得的植物材料经历选择性条件,一遍可以区分转化的植物与非转化的植物。例如,可以播种以上述方式获得的种子,及在最初的生长时期之后通过喷雾进行合适的选择。进一步的可能性是在灭菌后如果合适则在琼脂平板上生长种子,使用合适的选择剂以使得仅转化的种子可以生长为植物。或者,筛选转化的植物中可选择标记如上述那些标记的存在。在DNA转移和再生之后,例如使用Southern分析也可以评估推定转化的植物中感兴趣的基因的存在、拷贝数和/或基因组组构。或者或另外,可以使用Northern和/或Western分析监测新导入的DNA的表达水平,这两种技术均为本领域技术人员熟知。
产生的转化的植物可以通过各种方式增殖,例如通过克隆繁殖或者传统的育种技术。例如,可以将第一代(或T1)转化的植物自交并选择纯合的第二代(或T2)转化体,然后将T2植物通过传统育种技术进一步繁殖。产生的转化的生物体可以是各种形式。例如,其可以是转化的细胞和非转化的细胞的嵌合体;克隆转化体(例如所有细胞均转化为包含表达盒);转化与非转化组织的嫁接(例如在植物中,转化的根茎与非转化的接穗嫁接)。
除非特别指出,本文描述的本发明的各个方面明确扩展至通过本发明所述任何方法产生、获得或者可获得的任何植物细胞或者任何植物,及所有植物部分及其繁殖体。本发明进一步扩展至涵盖通过上述任何方法产生的原始转化或转染的细胞、组织、器官或全植物的后代,唯一要求是后代呈现出与本发明方法中亲本产生的相同的那些基因型和/或表型特征。
本发明还扩展至如上述本发明的植物的可收获部分,例如但不限于种子、叶、果实、花、茎、根、根茎、块茎和鳞茎。本发明进一步涉及衍生自、优选直接衍生自植物的可收获部分如干团块(pellet)或粉末、油、脂肪和脂肪酸、淀粉或蛋白质。本发明还涉及包含本发明的植物或其一部分的食物产品和食物补充剂。
虽然前文提供了关于涵盖在本发明范围内的主题的一般描述,包括产生和使用本发明的方法以及其最佳模式,但是提供如下实施例以使暴露于技术人员可以实施本发明及提供其完整的书面描述。然而,暴露于技术人员意识到这些实施例的细节不应理解为限制本发明,本发明的范围应从所附权利要求书及其等价物中读取。鉴于本发明揭示,本领域技术人员将显而易见本发明的各种进一步的方面和实施方案。
在本说明书中提及的所有文献均以其全部内容并入本文作参考,包括涉及的基因和蛋白登记号。
如在本文所用,除非特别指出,“和/或”是指明确表示多个指定特征或成分的每一项,有或无每个组合物的另一项。例如,“A、B和/或C”是指明确表示(i)A、(ii)B、(iii)C、(iv)A和B、(v)B和C或者(vi)A和B和C的每一种情况,恰如单独陈述每一个一样。
除非特别指出,上文陈述的关于特征的描述和定义不限于本发明的任何特定方面或实施方案,可以等同地用于描述的所有方面和实施方案。
本发明在如下非限制性实施例中进一步描述。
实施例
方法
植物材料与生长条件
拟南芥Columbia(Col-0)用作野生型品系。da1-1、sod7-1D、sod7-ko1和ngal3-ko1在Col-0背景中。通过使用T-DNA活化标签方法鉴别sod7-1D是da1-1阻抑物。sod7-ko1(SM_3_34191)和ngal3-ko1(SM_3_36641)在AtIDB中鉴别(www.atidb.org)及得自ArabidopsisStock Centre NASC收藏。T-DNA插入通过PCR证实,及通过使用表1所示引物测序。将拟南芥植物在长昼条件(16小时光照/8小时黑暗)下在22℃生长。活化标签筛选。使用根癌农杆菌菌株GV3101(Fan et al.,2009;Fang et al.,2012)将活化标签质粒pJFAT260导入da1-1突变体植物中,使用除草剂Basta筛选T1植物。从T1植物中产生的种子用于分离da1-1的修饰剂。
形态学和细胞分析
为了测量种子大小,使用Leica CCD(DFC420)在Leica显微镜(LEICA S8APO)下对野生型和突变体的干燥种子拍照。野生型和突变体种子的投影区域通过使用Image J软件测量。平均种子重量通过使用电子分析天平(METTLER TOLEDO AL104,China)分100批称重成熟干燥种子而确定。每个种子批测量5批样品的重量。扫描完全扩展的子叶、花瓣(阶段14)和叶以产生数字图像进行区域测量。为了测量细胞数目和细胞大小,将花瓣、叶、胚珠和种子置于一滴透明化溶液中[30ml H2O,80g水合氯醛(Sigma,C8383),10ml 100%甘油(Sigma,G6279)]。将变透明的样品在Leica显微镜(LEICA DM2500)下用微分干涉相差(DIC)镜片成像,并用SPOT FLEX Cooled CCD数字成像系统拍照。使用Image J软件进行区域测量。
SOD7基因的克隆
根据先前报道的方法(Liu et al.,1995),通过热不对称交错PCR(TAIL-PCR)鉴别sod7-1D突变体的T-DNA插入的侧翼序列。简而言之,TAIL-PCR利用在pJFAT260载体的T-DNA区域内的三个嵌套的特异性引物(OJF22、OJF23和OJF24)及一个较短的任意简并引物(AD1)。因此,特异性和非特异性产物的相关扩增效率可以热控制。使用引物OJF24对TAIL-PCR产物测序。特异性引物OJF22、OJF23和OJF24及任意的简并(AD1)引物在表1中描述。
构建体和植物转化
使用基于PCR的Gateway系统产生35S:GFP-SOD7、pSOD7:SOD7-GFP和pSOD7:GUS构建体。SOD7的编码序列(CDS)使用引物SOD7CDS-F和SOD7CDS-R(表1)扩增。将PCR产物克隆进pCR8/TOPO TA克隆载体中。然后将SOD7CDS与GFP基因亚克隆进双元载体pMDC43中,产生转化质粒35S:GFP-SOD7。使用引物SOD7G-F和SOD7G-R(表1)扩增含有2040-bp启动子序列和2104-bp SOD7基因的SOD7基因组序列。将PCR产物克隆进pCR8/TOPO TA克隆载体中。然后将SOD7基因组序列与GFP基因亚克隆进双元载体pMDC107中,产生转化质粒pSOD7:SOD7-GFP。使用引物SOD7P-F和SOD7P-R(表1)扩增2262-bp SOD7启动子序列。将PCR产物克隆进pCR8/TOPO TA克隆载体中。然后将SOD7启动子与GUS基因亚克隆进双元载体pGWB3中,产生转化质粒pSOD7:GUS。使用根癌农杆菌GV3101分别将质粒35S:GFP-SOD7、pSOD7:SOD7-GFP和pSOD7:GUS导入Col-0或sod7-ko1ngal3ko1植物中,在含有潮霉素(30μg/ml)的培养基上选择转化体。将SOD7cDNA克隆进双元载体pER8的ApaI和SpeI位点,产生化学可诱导的构建体pER8-SOD7。pER8-SOD7构建体的特异性引物是SOP7ER-F和SOD7ER-R。使用根癌农杆菌GV3101将质粒pER8-SOD7导入Col-0植物中,在含有潮霉素(30μg/ml)的培养基上选择转化体。GUS染色。将样品(pSOD7:GUS)在GUS染色溶液(1mM X-gluc,50Mm NaPO4缓冲液,0.4mM K3Fe(CN)6/K4Fe(CN)6,及0.1%(v/v)Triton X-100)中染色,并在37℃保温3小时。在GUS染色之后,通过70%乙醇除去叶绿素。RT-PCR和定量实时RT-PCR。使用RNAprep pure Plant试剂盒(TIANGEN)从拟南芥籽苗中提取总RNA。使用SuperScriptIII逆转录酶(Invitrogen)将mRNA逆转录为cDNA。使用引物ACTIN2-F和ACTIN2-R(表1)将cDNA样品在ACTIN2转录量标准化。使用Lightcycler 480SYBR Green I Master(Roche)用Lightcycler 480机器(Roche)进行定量实时RT-PCR分析。ACTIN2mRNA用作内部对照,使用比较循环阈值方法计算mRNA的相对量。用于RT-PCR和定量实时RT-PCR的引物在表1中描述。
染色质免疫沉淀(ChIP)测定
如先前所述略加修改进行染色质免疫沉淀(ChIP)测定(Gendrel et al.,2005)。简而言之,将35S:GFP和35S:GFP-SOD7转基因种子在1/2MS平板上生长10天。将籽苗在真空中通过1%甲醛交联15分钟,通过0.125M甘氨酸结束。将样品在液氮中研磨,分离细胞核。通过抗-GFP(Roche,11814460001)和蛋白质A+G珠(Millpore Magna ChIP Protein A+GMagnetic Beads,16-663)免疫沉淀染色质。将DNA通过糖原、NaOAc和乙醇沉淀,用70%乙醇洗涤并溶解于60μl水中。基因特异性引物(PF1-F、PF1-R、PF-2F、PF2-R、ACTIN7-ChIP-F和ACTIN7-ChIP-R)用于量化每个片段的富集(表1)。
DNA电泳迁移率变动分析(EMSA)
将SOD7的编码序列克隆进pMAL-C2载体的NdeI和BamHI位点,产生构建体MBP-SOD7。将MBP-SOD7融合蛋白在大肠杆菌BL21(DE3)(Biomed)中表达并通过Amylose树脂(NewEngland Biolabs)纯化。合成生物素标记和未标记的探针作为正向和反向链。然后将正向和反向链在溶液(50mM Tris-HCl,5mM EDTA和250mM NaCl)中在95℃保温10分钟,并在室温复性为双链探针。如先前所述进行凝胶迁移测定(Smaczniak et al.,2012)。
结果
sod7-1D抑制da1-1的种子大小表型
我们之前鉴别了遍在蛋白受体DA1作为拟南芥中种子大小的负调节剂(Li etal.,2008)。由于在母体珠被中增加的细胞增殖导致da1-1突变体形成较大种子(Li etal.,2008;Xia et al.,2013)。为了鉴别DA1途径中新的成分或者其它种子大小调节剂,对da1-1的修饰剂开始T-DNA活化标签筛选(Fang et al.,2012)。从大约16,000株T1植物产生种子中分离da1-1的显性阻抑物(sod7-1D)(图1A)。sod7-1D da1-1双重突变体的种子显著小于和轻于da1-1种子(图1A、E和F)。胚芽组成了拟南芥中成熟种子的主要体积。sod7-1Dda1-1胚芽小于da1-1胚芽(图1B)。与da1-1子叶相比,sod7-1D da1-1子叶的大小显著降低(图1G)。此外,sod7-1D da1-1双重突变体形成比da1-1较小的叶和花(图1C和1D)。因此,这些结果示出sod7-1D突变抑制da1-1的种子和器官大小表型。
sod7-1D产生小种子
我们分离了衍生自野生型(Col-0)与sod7-1D da1-1杂交植物的F2后代的单一sod7-1D突变体。sod7-1D种子显著小于和轻于野生型种子(图2A、B、G和H)。我们进一步分离和观测了成熟野生型和sod7-1D种子的胚芽。sod7-1D胚芽显然小于野生型胚芽(图2C和D)。种子大小的改变也体现在籽苗的大小(图2E和F)。10天龄的sod7-1D子叶显然小于野生型子叶(图2E、F和I)。此外,sod7-1D突变体与野生型相比呈现出较小的叶和花。sod7-1D叶和花瓣大小降低不是由较小的细胞导致,表明sod7-1D突变导致细胞数目减少。事实上,sod7-1D花瓣中表皮细胞的平均面积大于野生型花瓣,提示在细胞数目与细胞大小之间的可能的补偿机制。
SOD7编码B3结构域转录阻抑物NGAL2
为了确定sod7-1D的种子和器官大小表型是否由T-DNA插入引起,我们首先分析具有Basta抗性(这是由活化标签载体的选择标记赋予)的突变体表型的遗传连锁(Fan etal.,2009)。在T2群中,具有sod7-1D da1-1表型的181株植物是抗性的,而具有da1-1表型的55株植物是敏感的,表明所述插入与sod7-1D表型是共分离的。为了克隆SOD7基因,我们使用热不对称交错PCR(Liu et al.,1995)分离了T-DNA侧翼序列。DNA测序表明T-DNA在At3g11580上游插入大约5.6kb,在At3g11590基因上游插入大约3.7kb(图3A)。为了确定哪个基因与sod7-1D表型相关,我们检验这两个基因的mRNA水平。At3g11590基因的mRNA在sod7-1D da1-1和da1-1中以相似水平积累,提示At3g11590不是SOD7基因(图3B)。相反,At3g11580基因在sod7-1D da1-1植物中的表达水平明显高于在da1-1植物中表达水平,提示At3g11580是SOD7基因(图3B)。为了进一步证实sod7-1D表型是否由异位At3g11580表达引起,将At3g11580基因(35S:GFP-SOD7)在野生型植物(Col-0)中过表达并分离37株转基因植物。大多数转基因品系示出小种子和器官(图3D-F),与在sod7-1D单突变体中观测的结果相似,表明At3g11580是SOD7基因。SOD7基因编码含有B3DNA结合结构域和转录阻抑基序的NGATHA样蛋白质(NGAL2)(图3C)(Alvarez et al.,2009;Ikeda and Ohme-Takagi,2009;Trigueros et al.,2009)。SOD7属于在拟南芥中由13个成员组成的RAV基因家族(图10)(Swaminathan et al.,2008)。RAV家族的一些成员含有推定的转录阻抑基序,包括NGA1、NGA2、NGA3、NGA4、NGAL1、NGAL2/SOD7和NGAL3(图10)(Ikeda and Ohme-Takagi,2009)。已知NGA1、NGAL1和NGAL2/SOD7中转录阻抑基序具有抑制活性(Ikeda and Ohme-Takagi,2009),表明它们是转录阻抑物。SOD7呈现出与拟南芥NGAL3/DEVELOPMENT-RELATED PcG TARGETIN THE APEX 4(DPA4)的最高相似性(图10),其在调节叶锯齿中具有已知作用(Engelhornet al.,2012),但是之前未鉴别其在种子大小控制中的功能。
SOD7的表达模式和亚细胞定位
为了监测在发育期间的SOD7表达模式,构建pSOD7:GUS和pSOD7:SOD7-GFP载体并分别转化野生型植物。SOD7的组织特异性表达模式使用针对GUS活性的组织化学测定检验。在籽苗中,在较幼的叶中与在较老的叶子中相比检测到相对较高的GUS活性(图4A-C)。在花中,在萼片、花瓣、雄蕊和心皮中观测到GUS活性(图4D-K)。GUS活性在较幼的花器官中强于在较老的花器官中(图4D-K)。SOD7的表达在胚珠中也检测到(图4L)。因此,这些分析表明SOD7是时间和空间表达的基因。由于SOD7编码B3结构域转录阻抑物,我们推测SOD7位于细胞核中。为了确定SOD7的亚细胞定位,我们观测了在pSOD7:SOD7-GFP转基因植物中的GFP荧光性。如图4M-O所示,GFP信号仅在细胞核中检测到。我们还在野生型植物中在35S启动子控制下表达了GFP-SOD7融合蛋白。过表达GFP-SOD7的转基因品系形成比野生型小的种子(图3D),表明GFP-SOD7融合蛋白是功能性的。如图4P-R所示,在35S:GFP-SOD7转基因植物中GFP荧光只在细胞核中观测到。因此,这些结果示出SOD7是定位于细胞核的蛋白质。
SOD7/NGAL2与NGAL3冗余作用控制种子大小
为了进一步研究SOD7在种子大小控制中的功能,我们分离了最密切相关的家族成员SOD7和NGAL3的T-DNA插入功能丧失突变体。经鉴别sod7-ko1(SM_3_34191)在SOD7基因的第一个外显子中具有T-DNA插入(图5A)。ngal3-ko1(SM_3_36641)在NGAL3基因的第一个外显子中具有T-DNA插入(图5B)。所述T-DNA插入通过使用T-DNA特异性和侧翼引物进行PCR并对PCR产物测序而证实。sod7-ko1和ngal3-ko1突变体分别没有可检测的SOD7和NGAL3全长转录体。sod7-ko1和ngal3-ko1突变体的种子略大于和重于野生型植物的种子(图5C、G和H)。与野生型相比,sod7-ko1和ngal3-ko1突变体的子叶面积增加(图5I)。考虑到SOD7与NGAL3呈现最高相似性,推测SOD7可与NGAL3冗余作用影响种子大小。为了检测这个推测,产生sod7-ko1ngal3-ko1双重突变体。如图5C、D、G和H所示,sod7-ko1突变体的种子大小和重量表型通过破坏NGAL3而协同增强,表明SOD7与NGAL3冗余作用以控制种子大小。我们进一步测量了10天龄籽苗的子叶面积。也观测到ngal3-ko1突变协同增强sod7-ko1子叶大小(图5I)。此外,sod7-ko1ngal3-ko1双重突变体较其亲本品系形成较大的叶和花(图5E和F;11)。因此,这些结果表明SOD7和NGAL3冗余作用以控制种子和器官生长。
SOD7母体作用以控制种子大小
由于种子的大小是由合子和/或母体组织决定的(Garcia et al.,2005;Xia etal.,2013;Du et al.,2014),我们研究了SOD7是通过母体还是合子起作用。因此,在野生型与sod7-ko1 ngal3-ko1之间进行正反交试验。sod7-ko1 ngal3-ko1对于种子大小的作用仅在当sod7-ko1 ngal3-ko1用作母本植物时观测到(图6A)。用野生型植物花粉授粉的sod7-ko1 ngal3-ko1植物的种子大小与自花授粉的sod7-ko1 ngal3-ko1植物的种子大小相似(图6A)。相反,用sod7-ko1 ngal3-ko1突变体花粉授粉的野生型植物的种子的大小与自花授粉的野生型植物的种子大小相似(图6A)。这些结果表明sod7-ko1 ngal3-ko1起母作用以影响种子大小。我们进一步研究了Col-0/Col-0 F2、Col-0/sod7-ko1 ngal3-ko1 F2、sod7-ko1 ngal3-ko1/Col-0 F2和sod7-ko1 ngal3-ko1/sod7-ko1 ngal3-ko1 F2种子的大小。如图6B所示,sod7-ko1 ngal3-ko1/sod7-ko1 ngal3-ko1 F2种子大于野生型植物种子,而Col-0/sod7-ko1 ngal3-ko1 F2种子和sod7-ko1 ngal3-ko1/Col-0 F2种子与野生型植物种子相似。因此,这些结果表明SOD7的胚芽和胚乳表型不决定种子大小,SOD7在母本植物孢子体组织中被需要以控制种子生长。
SOD7在母体珠被中调节细胞增殖
正反交试验示出SOD7起母体作用以影响种子大小。胚珠周围的珠被是母起组织,其可以在受精后设定种皮的生长潜力。与这个观点一致,一些研究示出珠被大小影响拟南芥中最终种子大小(Garcia et al.,2005;Schruff et al.,2006;Adamski et al.,2009;Xia et al.,2013;Du et al.,2014)。因此,我们进一步研究了SOD7是否通过母体珠被起作用以决定种子大小。为此,我们鉴定了野生型和sod7-ko1 ngal3-ko1的成熟胚珠。如图6C和D所示,sod7-ko1 ngal3-ko1胚珠显然大于野生型胚珠。与野生型胚珠相比,sod7-ko1ngal3-ko1胚珠的外层珠被长度显著降低(图6E)。由于珠被的大小由细胞增殖和细胞扩展决定,我们检验了野生型和sod7-ko1 ngal3-ko1胚珠中外层珠被细胞的数目和大小。如图6F所示,与野生型胚珠相比,sod7-ko1 ngal3-ko1胚珠中外层珠被细胞的数目增加。相反,sod7-ko1 ngal3-ko1胚珠中外层珠被细胞的长度与野生型胚珠的相似(图6G)。这些结果示出SOD7在胚珠的母体珠被中为细胞增殖所必需。在受精后,珠被中的细胞主要精力扩展,但仍分裂。我们进一步检验了在授粉后第6和8天(DAP)的野生型和sod7-ko1 ngal3-ko1种子中外层珠被细胞的数目和大小。在野生型种子中,在6DAP的外层珠被细胞的数目与在8DAP相似(图6F),表明在野生型种子外层珠被中的细胞在6DAP完全停止分裂。相似地,sod7-ko1ngal3-ko1种子的外侧珠被中的细胞在6DAP也停止分裂。与野生型种子相比,sod7-ko1ngal3-ko1种子中外层珠被细胞数显著增加(图6F)。相反,与野生型种子相比,sod7-ko1ngal3-ko1种子中外层珠被细胞的长度不降低(图6G)。因此,这些分析表明SOD7在胚珠的母体珠被和发育中的种子中为细胞增殖所必需。
SOD7在与KLU共同的途径中起作用以控制种子大小,但是不依赖于DA1
拟南芥klu突变体由于在珠被中降低的细胞增殖导致形成较小的种子,而过表达KLU/CYP78A5的植物由于在珠被中增加的细胞增殖的结果产生较大的种子(Adamski etal.,2009),提示SOD7和KLU在共同途径中可以拮抗地起作用以控制种子生长。为了检测SOD7与KLU之间的遗传相互作用,我们产生了klu-4 sod7-ko1 ngal3-ko1三重突变体并测量了野生型、klu-4、sod7-ko1 ngal3-ko1和klu-4 sod7-ko1 ngal3-ko1的种子大小。如图7A和B所示,klu-4 sod7-ko1 ngal3-ko1种子的平均大小和重量与klu-4单突变体的相似,表明klu-4对于sod7-ko1 ngal3-ko1在种子大小和重量方面是上位显性的。我们进一步研究了野生型、klu-4、sod7-ko1 ngal3-ko1和klu-4 sod7-ko1 ngal3-ko1植物的成熟胚珠。klu-4 sod7-ko1 ngal3-ko1胚珠的外层珠被长度与klu-4胚珠的相似(图7C)。相似地,在8DAP,klu-4 sod7-ko1 ngal3-ko1种子的外层珠被长度于klu-4种子的不可区分(图7C)。此外,klu-4 sod7-ko1 ngal3-ko1花瓣的大小与klu-4花瓣相似。
因此,这些遗传分析示出klu-4对于sod7-ko1 ngal3-ko1在种子和器官大小方面是上位的,表明SOD7和KLU在共同途径中拮抗地起作用以控制种子和器官生长。为了进一步了解SOD7与KLU之间上位相互作用的细胞学基础,我们研究了来自野生型、klu-4、sod7-ko1ngal3-ko1和klu-4 sod7-ko1 ngal3-ko1植物的胚珠和发育中的种子的外层珠被细胞数目。klu-4 sod7-ko1 ngal3-ko1胚珠中外层珠被细胞数目与在klu-4胚珠中的相似(图7D)。相似地,klu-4 sod7-ko1 ngal3-ko1种子中外层珠被细胞的数目与在klu-4种子中的相似(图7D)。这些结果表明klu-4对于sod7-ko1 ngal3-ko1在外层珠被细胞数目方面是上位的。我们还观测了在klu-4和klu-4 sod7-ko1 ngal3-ko1种子的外层珠被中的细胞略长于在野生型种子中的那些,提示在细胞增殖与细胞扩展中的可能的补偿机制。总之,这些发现示出SOD7与KLU在共同途径中拮抗地起作用以在母体珠被中控制细胞增殖。
考虑到sod7-1D被鉴别为在种子大小中是da1-1阻抑物,我们进一步研究了SOD7和DA1是否可以在相同的遗传途径中起作用。为此,我们测量了野生型、da1-1、sod7-1D和sod7-1D da1-1种子的大小。与sod7-1D和da1-1单突变体相比,sod7-1D与da1-1之间的遗传相互作用对于种子大小基本是叠加的,表明SOD7与DA1也许单独地起作用以控制种子大小。我们进一步杂交sod7-ko1 ngal3-ko1与da1-1,产生sod7-ko1 ngal3-ko1 da1-1三重突变体,并测量其种子大小。与其亲本品系相比,sod7-ko1 ngal3-ko1与da1-1之间的遗传相互作用对于种子大小也是叠加的,进一步支持了SOD7与DA1单独地起作用控制种子生长。
SOD7直接结合KLU启动子并抑制KLU表达
考虑到SOD7与KLU在共同途径中拮抗地起作用控制种子大小,我们研究了转录阻抑物SOD7是否可以抑制KLU表达。我们因此研究了KLU在化学可诱导的SOD7(pER8-SOD7)转基因植物中的表达。在将pER8-SOD7转基因植物用诱导剂(β-雌二醇)处理后,SOD7的表达在第4和8小时被强力诱导(图8A)。正如预期的,KLU的表达在第4和8小时被显著抑制(图8A)。因此,这些结果表明SOD7抑制KLU的表达,也提示KLU也许是SOD7的直接靶位。
为了确定SOD7是否可以直接结合KLU基因的启动子,我们用35S:GFP和35:GFP-SOD7转基因植物进行染色质免疫沉淀(ChIP)测定。已经报道了CACCTG序列被认为是RAV1的B3结构域,这是RAV家族的一个成员(Kagaya et al.,1999;Yamasaki et al.,2004)。我们因此分析了KLU的启动子序列,在KLU的2kb启动子区域内未发现完整CACCTG序列。然而,我们在KLU的启动子区域中发现相似序列(CACTTG)(图8B),其可能是潜在的SOD7结合位点。为此,我们通过ChIP分析检验了含有CACTTG序列的KLU启动子片段(PF1)的富集,并发现片段PF1在用抗GFP抗体进行的染色质免疫沉淀的DNA中大量富集(图8B和C)。相反,我们未检测到ACTIN7启动子序列和KLU启动子片段PF2的显著富集,其不含有CACTTG序列(图8B和C)。这个结果示出SOD7在体内与KLU的启动子相关。我们将SOD7作为MBP融合蛋白(MBP-SOD7)进一步表达,并进行DNA电泳迁移率变动分析(EMSA)。如图8B和D所示,MBP-SOD7能结合含有CACTTG序列的生物素标记的探针A,所述结合通过加入未标记的探针A而降低。相反,MBP-SOD7不能结合在CACTTG序列中有突变的探针A-m(图8B和D)。总之,这些结果示出SOD7直接结合KLU的启动子并抑制KLU表达。
讨论
种子大小对于植物适合性和农业学目的是重要因素,但是关于设定植物中最终种子大小的遗传和分子学机制了解不多。在本发明研究中,我们示出SOD7起母体作用以通过限制胚珠和发育中的种子的珠被中细胞增殖控制种子大小。SOD7编码B3结构域转录阻抑物NGAL2并与其密切同系物NGAL3冗余作用控制种子大小。遗传分析表明SOD7在与母体因子KLU共同途径中起作用以控制种子生长,但是不依赖于DA1起作用。进一步的结果表明SOD7直接结合KLU的启动子区域并抑制KLU表达。因此,我们鉴别SOD7是种子大小的负因子,并明确了SOD7和KLU在种子大小控制中的遗传和分子学机制。
SOD7起母体作用以调节种子大小
经鉴别sod7-1D功能增加突变体是da1-1大种子表型的阻抑物。然而,遗传分析示出SOD7不依赖于DA1起作用以控制种子生长。sod7-1D单一突变体产生小种子和器官(图2),而同时破坏SOD7及密切相关的家族成员NGAL3导致大种子和器官(图5),表明SOD7是种子和器官大小的负调节物。一些先前的研究提示种子大小与器官生长之间存在可能的联系。例如,arf2、da1-1、da2-1和eod3-1D突变体产生大种子和器官(Schruff et al.,2006;Li etal.,2008;Fang et al.,2012;Xia et al.,2013),而klu和sod2/ubp15突变体形成小种子和器官(Anastasiou et al.,2007;Adamski et al.,2009;Du et al.,2014)。然而,种子大小与器官大小不总是相关的。例如,具有较大器官的eod8/med25突变体形成正常大小的种子(Xu and Li,2011),而具有正常大小器官的ap2突变体产生大种子(Jofuku et al.,2005;Ohto et al.,2005)。因此,这些发现提示种子和器官不仅共有共同机制而且还具有不同途径以控制其各自的大小。
正反交试验示出SOD7起母体作用限制种子生长,SOD7的胚乳和胚芽表型不决定种子大小(图6)。胚珠周围的珠被是母体组织,并在受精后形成种皮。具有较大珠被的拟南芥arf2、ap2、da1-1、da2-1和eod3-1D突变体形成较大种子(Jofuku et al.,2005;Ohto etal.,2005;Schruff et al.,2006;Li et al.,2008;Fang et al.,2012;Xia et al.,2013),而具有较小珠被的klu-4和ubp15/sod2突变体产生小种子(Adamski et al.,2009;Du et al.,2014),表明母体珠被对于在拟南芥中决定种子大小是关键因素。与这个观点一致,成熟eod7-ko1 ngal3-ko1胚珠大于野生型胚珠(图6C和D)。与野生型胚珠和种子相比,eod7-ko1 ngal3-ko1胚珠和发育中种子的珠被长度显著增加(图6E和7C)。考虑到母体珠被或种子不仅作为保护结构而且还限制种子生长,因此调节母体珠被的大小是控制种子大小的一个重要机制。珠被的大小由细胞增殖和细胞扩展决定;这两个过程被认为是协同的。与野生型胚珠和种子相比,sod7-ko1 ngal3-ko1胚珠和种子中外层珠被细胞的数目显著增加(图6F),表明SOD7通过限制母体珠被中细胞增殖而控制种子生长。相似地,在母体珠被中细胞数目增加的一些突变体在拟南芥中产生大种子(Schruff et al.,2006;Li et al.,2008;Xia et al.,2013)。相反,在母体珠被中具有降低的细胞数目的一些其它突变体在拟南芥中形成小种子(Adamski etal.,2009;Du et al.,2014)。考虑到在受精后珠被中的细胞主要经历扩展(Garcia et al.,2005),因此珠被中的细胞数目可能决定在受精后种皮的生长潜力。
SOD7和KLU在种子大小控制中的遗传和分子学机制
如在klu突变体中所见,sod7-1D突变体具有较小的种子和器官(图2)(Anastasiouet al.,2007;Adamski et al.,2009)。KLU编码细胞色素P450CYP78A5,其已经被提议产生可动植物生长物质(Anastasiou et al.,2007;Adamski et al.,2009)。KLU通过在母体胚珠的珠被中促进细胞增殖而调节种子大小(Anastasiou et al.,2007;Adamski et al.,2009)。相反,SOD7以母体作用通过在胚珠的珠被和发育中的种子中限制细胞增殖而控制种子大小(图6)。这些结果提示SOD7在与KLU共同的途径中相反作用以控制种子大小。在本发明的生长条件中,由于在发育中的种子珠被中降低的细胞数目和略增加的细胞长度导致klu-4形成比野生型略小的种子(图7A和D),提示在klu-4珠被中细胞增殖与细胞延伸之间的可能的补偿机制。重要的是遗传分析表明在关于种子和器官大小方面,klu-4对于sod7-ko1 ngal3-ko1是上位的(图7A和B)。在关于外层珠被长度方面,klu-4对于sod7-ko1ngal3-ko1也是上位的(图7C)。进一步的结果表明klu-4 sod7-ko1 ngal3-ko1胚珠和发育中的种子的外层珠被中细胞的数目与klu-4胚珠和发育中种子的相似(图7D)。因此,这些遗传结果表明SOD7在与KLU的共同途径中起作用,通过调节母体珠被中血小板增殖而控制种子大小。
SOD7编码B3结构域转录阻抑物NGAL2,其位于拟南芥细胞核中(图4M-R)。因此,可能的是SOD7可以直接结合KLU的启动子并抑制KLU表达。SOD7的可诱导的表达导致KLU表达显著降低的结果支持这个想法(图8A)。我们的ChIP-qPCR数据示出SOD7在体内与KLU启动子区域相关(图8B和C)。EMSA试验表明SOD7直接结合KLU基因的启动子中CACTTG序列(图8B和D)。因此,这些结果例证了SOD7直接靶向KLU的启动子区域并抑制KLU表达,从而决定种子大小。总之,这些发现表明SOD7和KLU在调节拟南芥种子大小中的遗传和分子学机制。
对于许多植物,种子是收获的主要产物,种子大小的增加对于种植户是有益的。在这项研究中,我们鉴别了SOD7是种子大小的负调节物,并表明SOD7在与KLU共同的遗传途径中控制种子大小。目前关于SOD7功能的知识提示SOD7基因(及其在其它植物物种中的同系物)可用于在作物中设计较大的种子大小。考虑到作物植物在驯化期间已经经历大种子大小的选择(Fan et al.,2006;Song et al.,2007;Gegas et al.,2010),获知SOD7基因的有益等位基因是否已经由植物育种者应用是一项有意义的挑战。
使用基因组编辑在水稻中的敲除试验
使用crisper-cas9系统在水稻中进行敲除os11g01560000和/或Os12g0157000的基因组编辑试验。如图14所述构建四个载体以实现这些敲除,每个载体均具有两个识别(CRISPR靶)位点。简而言之,如下获得所述载体:
1.鉴别靶位点。靶位点在NGG序列之前应是(或者大约是)20个核苷酸,N是任何核苷酸。然后使用网站http://cbi.hzau.edu.cn/crispr/help.php(在此并入本文作参考)评估靶序列。值得注意的是靶位点在基因组中应是唯一的。
2.如图14所示,使用重叠PCR,将靶序列与U6序列连接。U6是为了转录活性。
3.使用注入技术连接U6-引导-gRNA支架片段与载体pMDC99-cas9,获得pMDC99-cas9-U6-引导-gRNA支架构建体。这些构建体称作zyy1、zyy2、zyy3、zyy4。这些构建体的全部序列分别以SEQ ID NO:155、156、157和158示出。每个构建体含有两个识别位点,在序列信息中突出示出,分别以SEQ ID No:159、160、161、162和163示出。
4.然后将这些构建体转化进农杆菌中,并使用农杆菌介导的方法转化水稻,获得基因编辑的水稻。植物的转化是本领域技术人员熟知的常规技术。尽管如此,在上文提供了转化技术的简要概述。
分析敲除品系以评估表型。
表1:这项研究中使用的引物
引物名称引物序列
T-DNA鉴别引物
SM_3_34191-LP ACCATGACATTCGAGGTTCAC(SEQ ID NO.8)
SM_3_34191-RP ATCACCACCAAAACGACGTAG(SEQ ID NO.9)
SM_3_36641-RP TACGTCATGCTTCAAATCGTG(SEQ ID NO.10)
SM_3_36641-RP AGGACACGAACAATTCATTCG(SEQ ID NO.11)
Spm32 TACGAATAAGAGCGTCCATTTTAGAGTGA(SEQ ID NO.12)
SM_3_39145-LP ACCCAAAGAACAGCAATCATG(SEQ ID NO.13)
SM_3_39145-RP AAAACACTCCGCCATTAAACC(SEQ ID NO.14)
TAIL-PCR引物
OJF22 CGAGTATCAATGGAAACTTAACCG(SEQ ID NO.15)
OJF23 AACGGAGAGTGGCTTGAGAT(SEQ ID NO.16)
OJF24 TGGCCCTTATGGTTTCTGCA(SEQ ID NO.17)
AD1 NTCGA(G/C)T(A/T)T(G/C)G(A/T)GTT(SEQ ID NO.18)
构建体引物
SOD7CDS-F ATGTCAGTCAACCATTACCAC(SEQ ID NO.19)
SOD7CDS-R CAGGTAGGAGATGGACGAGGTTGA(SEQ ID NO.20)
SOD7G-F TGAGAGGAACCATTTCTTAGAGG(SEQ ID NO.21)
SOD7G-R ACCTCGTCCATCTCCTACCTGC(SEQ ID NO.22)
SOD7P-F AAACACGTCAAATATAACGAAT(SEQ ID NO.23)
SOD7P-R CTTTTTTTTGGTTTCTTGGAGTGAGAGAGAGAG(SEQ ID NO.24)
SOD7-ER-F AGTCTGGGCCCATGTCAGTCAACCATTAC(SEQ ID NO.25)
SOD7-ER-R GCGACTAGTTTATAAAAGAGTTAAAATTA(SEQ ID NO.25)
MBP-SOD7-FP CGGGATCCTCAGTCAACCATTACC(SEQ ID NO.27)
MBP-SOD7-RP ACTAGTCGACTCAACCTCGTCCATCTCC(SEQ ID NO.28)
RT-PCR和qRT-PCR引物
ACTIN2-F GAAATCACAGCACTTGCACC(SEQ ID NO.29)
ACTIN2-R AAGCCTTTGATCTTGAGAGC(SEQ ID NO.30)
SOD7-EX-F GCGACGACGGAGAAAGGG(SEQ ID NO.31)
SOD7-EX-R ACGACGGCGCCATAGTGT(SEQ ID NO.32)
NGAL3-EX-F TTTGAAGACGAGTCAGGCAAGT(SEQ ID NO.33)
NGAL3-EX-R TACGGCGGCTCCATAGTGGG(SEQ ID NO.34)
SOD7-q-FP GTATTGGAGCGGCTTGACTACACC(SEQ ID NO.35)
SOD7-q-RP GACGGCATCACCATGACATTCG(SEQ ID NO.36)
KLU-q-FP TGATTCTGACATGATTGCTGTTCT(SEQ ID NO.37)
KLU-q-RP TCGCAACTGTATCTGTCCCTCTA(SEQ ID NO.38)
ChIP测定引物
ACTIN7-ChIP-FP CGTTTCGCTTTCCTTAGTGTTAGCT(SEQ ID NO.29)
ACTIN7-ChIP-RP AGCGAACGGATCTAGAGACTCACCTTG(SEQ ID NO.40)
PF1-F CAGGCCTAAGCCTAACAGTAGAC(SEQ ID NO.41)
PF1-R TGTACTAGGATTTATTTACGTAG(SEQ ID NO.42)
PF2-F TATTGTTCATAGAAACCCTGCAAA(SEQ ID NO.43)
PF2-R AGTCAATGGTTTAATGGCGGAGTG(SEQ ID NO.44)
EMSA探针
A-Biotin-FP TTCTACTACACTTGCTCTCTGTA(SEQ ID NO.45)
A-Biotin-RP TACAGAGAGCAAGTGTAGTAGAA(SEQ ID NO.46)
A-Biotin-m-FP TTCTACTAACACCTCTCTCTGTA(SEQ ID NO.47)
A-Biotin-m-RP TACAGAGAGAGGTGTTAGTAGAA(SEQ ID NO.48)
参考文献
Adamski,N.M.,Anastasiou,E.,Eriksson,S.,O'Neill,C.M.,and Lenhard,M.(2009).mLocal maternal control of seed size by KLUH/CYP78A5-dependent growthsignaling.Proceedings of the National Academy of Sciences of the UnitedStates of America 106,20115-20120.
Alvarez,J.P.,Goldshmidt,A.,Efroni,I.,Bowman,J.L.,and Eshed,Y.(2009).Th NGATHA distal organ development genes are essential for stylespecification in Arabidopsis.Plant Cell 21,1373-1393.
Anastasiou,E.,Kenz,S.,Gerstung,M.,MacLean,D.,Timmer,J.,Fleck,C.,andLenhard,M.(2007).Control of plant organ size by KLUH/CYP78A5-dependentintercellular signaling.Developmental cell 13,843-856.
Cheng,Z.J.,Zhao,X.Y.,Shao,X.X.,Wang,F.,Zhou,C.,Liu,Y.G.,Zhang,Y.,andZhang,X.S.(2014).Abscisic Acid Regulates Early Seed Development inArabidopsis by ABI5-Mediated Transcription of SHORT HYPOCOTYL UNDERBLUE1.Plant Cell 26,1053-1068.
Du,L.,Li,N.,Chen,L.,Xu,Y.,Li,Y.,Zhang,Y.,and Li,C.(2014).TheUbiquitin Receptor DA1Regulates Seed and Organ Size by Modulating theStability of the Ubiquitin-Specific Protease UBP15/SOD2in Arabidopsis.PlantCell 26,665-677.
Engelhorn,J.,Reimer,J.J.,Leuz,I.,Gobel,U.,Huettel,B.,Farrona,S.,andTurck,F.(2012).Development-related PcG target in the apex 4controls leafmargin architecture in Arabidopsis thaliana.Development 139,2566-2575.
Fan,C.,Xing,Y.,Mao,H.,Lu,T.,Han,B.,Xu,C.,Li,X.,and Zhang,Q.(2006).GS3,a major QTL for grain length and weight and minor QTL for grain widthand thickness in rice,encodes a putative transmembrane protein.Theor ApplGenet 112,1164-1171.
Fan,J.,Hill,L.,Crooks,C.,Doerner,P.,and Lamb,C.(2009).Abscisic acidhas a key role in modulating diverse plant-pathogen interactions.Plantphysiology 150,1750-1761.
Fang,W.,Wang,Z.,Cui,R.,Li,J.,and Li,Y.(2012).Maternal control of seedsize by EOD3/CYP78A6 in Arabidopsis thaliana.Plant J 70,929-939.
Garcia,D.,Fitz Gerald,J.N.,and Berger,F.(2005).Maternal control ofintegument cell elongation and zygotic control of endosperm growth arecoordinated to determine seed size in Arabidopsis.Plant Cell 17,52-60.
Garcia,D.,Saingery,V.,Chambrier,P.,Mayer,U.,Jurgens,G.,and Berger,F.(2003).Arabidopsis haiku mutants reveal new controls of seed size byendosperm.Plant physiology 131,1661-1670.
Gegas,V.C.,Nazari,A.,Griffiths,S.,Simmonds,J.,Fish,L.,Orford,S.,Sayers,L.,Doonan,J.H.,and Snape,J.W.(2010).A genetic framework for grain sizeand shape variation in wheat.Plant Cell 22,1046-1056.
Gendrel,A.V.,Lippman,Z.,Martienssen,R.,and Colot,V.(2005).Profilinghistone modification patterns in plants using genomic tiling microarrays.NatMethods 2,213-218.
Harper,J.L.,Lovell,P.H.,and Moore,K.G.(1970).The Shapes and Sizes ofSeeds.Annual Review of Ecology and Systematics 1,327-356
Ikeda,M.,and Ohme-Takagi,M.(2009).A novel group of transcriptionalrepressors in Arabidopsis.Plant&cell physiology 50,970-975.
Jofuku,K.D.,Omidyar,P.K.,Gee,Z.,and Okamuro,J.K.(2005).Control ofseed mass and seed yield by the floral homeotic gene APETALA2.Proceedings ofthe National Academy of Sciences of the United States of America 102,3117-3122.
Kagaya,Y.,Ohmiya,K.,and Hattori,T.(1999).RAV1,a novel DNA-bindingprotein,binds to bipartite recognition sequence through two distinct DNA-binding domains uniquely found in higher plants.Nucleic Acids Res 27,470-478.
Kang,X.,Li,W.,Zhou,Y.,and Ni,M.(2013).A WRKY transcription factorrecruits the SYG1-like protein SHB1 to activate gene expression and seedcavity enlargement.PLoS Genet 9,e1003347.
Li,J.,Nie,X.,Tan,J.L.,and Berger,F.(2013).Integration of epigeneticand genetic controls of seed size by cytokinin in Arabidopsis.Proceedings ofthe National Academy of Sciences of the United States of America 110,15479-15484.
Li,Y.,Zheng,L.,Corke,F.,Smith,C.,and Bevan,M.W.(2008).Control offinal seed and organ size by the DA1 gene family in Arabidopsisthaliana.Genes Dev 22,1331-1336.
Liu,Y.G.,Mitsukawa,N.,Oosumi,T.,and Whittier,R.F.(1995).Efficientisolation and mapping of Arabidopsis thaliana T-DNA insert junctions bythermal asymmetric interlaced PCR.Plant J 8,457-463.
Lopes,M.A.,and Larkins,B.A.(1993).Endosperm origin,development,andfunction.Plant Cell 5,1383-1399.
Luo,M.,Dennis,E.S.,Berger,F.,Peacock,W.J.,and Chaudhury,A.(2005).MINISEED3(MINI3),a WRKY family gene,and HAIKU2(IKU2),a leucine-rich repeat(LRR)KINASE gene,are regulators of seed size in Arabidopsis.Proceedings ofthe National Academy of Sciences of the United States of America 102,17531-17536.
Moles,A.T.,Ackerly,D.D.,Webb,C.O.,Tweddle,J.C.,Dickie,J.B.,andWestoby,M.(2005).A brief history of seed size.Science 307,576-580.
Ohto,M.A.,Fischer,R.L.,Goldberg,R.B.,Nakamura,K.,and Harada,J.J.(2005).Control of seed mass by APETALA2.Proceedings of the National Academyof Sciences of the United States of America 102,3123-3128.
Ohto,M.A.,Floyd,S.K.,Fischer,R.L.,Goldberg,R.B.,and Harada,J.J.(2009).Effects of APETALA2 on embryo,endosperm,and seed coat developmentdetermine seed size in Arabidopsis.Sex Plant Reprod 22,277-289.
Orsi,C.H.,and Tanksley,S.D.(2009).Natural variation in an ABCtransporter gene associated with seed size evolution in tomato species.PLoSGenet 5,e1000347.
Schruff,M.C.,Spielman,M.,Tiwari,S.,Adams,S.,Fenby,N.,and Scott,R.J.(2006).The AUXIN RESPONSE FACTOR 2 gene of Arabidopsis links auxinsignalling,cell division,and the size of seeds and other organs.Development133,251-261.Scott,R.J.,Spielman,M.,Bailey,J.,and Dickinson,H.G.(1998).Parent-of-origin effects on seed development in Arabidopsis thaliana.Development125,3329-3341.
Smaczniak,C.,Immink,R.G.,Muino,J.M.,Blanvillain,R.,Busscher,M.,Busscher-Lange,J.,Dinh,Q.D.,Liu,S.,Westphal,A.H.,Boeren,S.,Parcy,F.,Xu,L.,Carles,C.C.,Angenent,G.C.,and Kaufmann,K.(2012).Characterization of MADS-domain transcription factor complexes in Arabidopsis flowerdevelopment.Proceedings of the National Academy of Sciences of the UnitedStates of America 109,1560-1565.
Song,X.J.,Huang,W.,Shi,M.,Zhu,M.Z.,and Lin,H.X.(2007).A QTL for ricegrain width and weight encodes a previously unknown RING-type E3 ubiquitinligase.Nat Genet 39,623-630.
Swaminathan,K.,Peterson,K.,and Jack,T.(2008).The plant B3superfamily.Trends Plant Sci 13,647-655.
Trigueros,M.,Navarrete-Gomez,M.,Sato,S.,Christensen,S.K.,Pelaz,S.,Weigel,D.,Yanofsky,M.F.,and Ferrandiz,C.(2009).The NGATHA genes direct styledevelopment in the Arabidopsis gynoecium.Plant Cell 21,1394-1409.
Wang,A.,Garcia,D.,Zhang,H.,Feng,K.,Chaudhury,A.,Berger,F.,Peacock,W.J.,Dennis,E.S.,and Luo,M.(2010).The VQ motif protein IKU1regulatesendosperm growth and seed size in Arabidopsis.Plant J 64,670-679.
Westoby,M.,Falster,D.S.,Moles,A.T.,Vesk,P.A.,and Wright,I.J.(2002).PLANT ECOLOGICAL STRATEGIES:Some Leading Dimensions of Variation BetweenSpecies.Annual Review of Ecology and Systematics 33,125-159.
Xia,T.,Li,N.,Dumenil,J.,Li,J.,Kamenski,A.,Bevan,M.W.,Gao,F.,and Li,Y.(2013).The Ubiquitin Receptor DA1 Interacts with the E3 Ubiquitin LigaseDA2to Regulate Seed and Organ Size in Arabidopsis.Plant Cell 25,3347-3359.
Xiao,W.,Brown,R.C.,Lemmon,B.E.,Harada,J.J.,Goldberg,R.B.,and Fischer,R.L.(2006).Regulation of seed size by hypomethylation of maternal andpaternal genomes.Plant physiology 142,1160-1168.
Xu,R.,and Li,Y.(2011).Control of final organ size by Mediator complexsubunit 25 in Arabidopsis thaliana.Development 138,4545-4554.
Yamasaki,K.,Kigawa,T.,Inoue,M.,Tateno,M.,Yamasaki,T.,Yabuki,T.,Aoki,M.,Seki,E.,Matsuda,T.,Tomo,Y.,Hayami,N.,Terada,T.,Shirouzu,M.,Osanai,T.,Tanaka,A.,Seki,M.,Shinozaki,K.,and Yokoyama,S.(2004).Solution structure ofthe B3DNA binding domain of the Arabidopsis cold-responsive transcriptionfactor RAV1.Plant Cell 16,3448-3459.
Zhou,Y.,Zhang,X.,Kang,X.,Zhao,X.,and Ni,M.(2009).SHORT HYPOCOTYLUNDERBLUE1 associates with MINISEED3 and HAIKU2 promoters in vivo to regulateArabidopsis seed development.Plant Cell 21,106-117.
序列信息
示出了同系物与NGAL2的相同性。
AtSOD7核酸序列SEQ ID NO.1(cDNA)At3g11580
ATGTCAGTCAACCATTACCACAACACTCTCTCGTTGCATCATCACCACCAAAACGACGTAGCTATAGCACAACGAGAGTCTTTGTTCGAGAAATCACTCACACCAAGCGACGTCGGAAAGCTAAACCGCTTAGTCATACCAAAACAACACGCCGAGAAATACTTCCCTCTCAATAATAATAATAATAATGGCGGCAGCGGAGATGACGTGGCGACGACGGAGAAAGGGATGCTTCTTAGCTTCGAGGATGAGTCAGGCAAGTGTTGGAAATTCAGATACTCTTATTGGAACAGTAGCCAAAGCTACGTGTTGACCAAAGGATGGAGCAGGTACGTCAAAGACAAACACCTCGACGCAGGCGACGTTGTTTTCTTTCAACGTCACCGTTTTGATCTCCATAGACTCTTCATTGGCTGGCGGAGACGCGGTGAAGCTTCTTCCTCTCCCGCTGTCTCCGTTGTGTCTCAAGAAGCTCTAGTTAATACGACGGCGTATTGGAGCGGCTTGACTACACCTTATCGTCAAGTACACGCGTCAACTACTTACCCTAATATTCACCAAGAGTATTCACACTATGGCGCCGTCGTTGATCATGCTCAGTCGATACCACCGGTGGTCGCAGGTAGCTCGAGGACGGTGAGGCTTTTTGGCGTGAACCTCGAATGTCATGGTGATGCCGTCGAGCCACCACCGCGTCCTGATGTCTATAATGACCAACACATTTACTATTACTCAACTCCTCATCCCATGAATATATCATTTGCTGGGGAAGCATTGGAGCAGGTAGGAGATGGACGAGGTTGA
AtSOD7核酸SEQ ID NO.2(基因组DNA).
ttgtttcggctatttgttatactattgttataacagtcacaagacttgacctcaacgaaaacttttacaaaacgtgaattggaaatttttacaaaatatgctcttaatcgttaatgcttcccaattaggtgagttaaattgtgagaggaaccatttcttagaggaaatggttcatgaaaacaaatatgaaatagtatcactagtcttagttttgcgagaaaattaggaaaaatagaaacgtgtaagcaccaatgatattcctgaaagcacgtgacagatatttcatgatcctataattaacaagtgataaagatattaaataaaattaacgatacttgagaaattcgtcaaataaaatagaagaggaccactcacgtaaccatttgcacgtcccattgatttttgtggtagacttggtatgttatattacttatattcacagaattatatacgaaactcacgacttaagatgcacggtaataactacagatggaaatttacccatcaaacaagaaaacaacatttactcaagcatctagctagaccaaaatgtttgtttacttgttgacttgcgatccatagatatattagttagaactttttcttctacaattgatcaaatgtttcacactgttctcaatttctcatctagattcatgacttatatgtttggtcaaatatcacagcttgatgagcattaaatagcgtcgaagtataggatggttacgttgttcaatattgtaaaggaaaaaaagagaaagagtgccaaaaggtcaagtcgatttcacaaataaatcttgaagtctttatccctctcgattataaaatgattaggaaaagaaaaagagagaataaaatgtagataaagagaaagagaaagagagagaggaacataagggatggtatgaagtagaagtgaagatgcatgcgatggtgtgtcggaaaggcaaagcacatgctacacaacttgagcttctcacttgcgtcagggataagtatcctctgtaccttcttacttttgcgtaatatgtaccacctcacttctcaaccgtttgatctttaatccttcattatttcttcattaccttctctttttgtttttgttttcgttttcaatttctcatagattcatttacaaactaaatatcataggaaggtgttatctctagttaatttcttatcctactttaacaaaatttaattgtcaaaagattatttttacgtttatagacaaaagatactgacacatcaattccacgaaccaaatggttgagaaaaacaaaacgactatctttgtcttgcaaataaattaatggcagttagtaagattctcagctgaaaattcatacaagagtaaatgatcaaataaccatttatgagagaaatttaatccttcagaaaccaatgaggatctgatcaagtaattgcaaaccacatgagtccatgataaaggattgtttgacttacgcaatccacatatttatggctgcttgatatgtaaggtttatctgctttgacagtctatagaatcttgctaatcaatacgtcatatccggtgaatactgaaacttttttaattaagaaaacacaaatcatcttttctccggaggatttcgaatttagttccggcaatgctgaaataacatatgttgaacttataacattccaagacatcaaattttactaatatataaataattacatattcttcttctacatgatcaaaaccttttcaactttaattaaagggttacgtcgcggcgttttgtgtggcttactctttttttacactataactatagaacactcgtggatccaatgccgtttaggacaagattttatcagacgagaaaaaaaaaaacaataccacatttttaaatatatatggattatggactgcaacaacaatatagaaaagaagagaaaaaaataaaaataatgattgaaaggaaatatcatcacgcaaaaccttaaaagtactatcggtatcgtgtcgtcctctcctcatcaaatagttcccacagttttcacatcaatttaaccattttcaatttttttcactctctgtctctctcctttgtataatactatattagtaccattacccatctctctttcaccaccaaaccaacacctgcaaatcctctctctctctctcactccaagaaaccaaaaaaaaagATGTCAGTCAACCATTACCACAACACTCTCTCGTTGCATCATCACCACCAAAACGACGTAGCTATAGCACAACGAGAGTCTTTGTTCGAGAAATCACTCACACCAAGCGACGTCGGAAAGCTAAACCGCTTAGTCATACCAAAACAACACGCCGAGAAATACTTCCCTCTCAATAATAATAATAATAATGGCGGCAGCGGAGATGACGTGGCGACGACGGAGAAAGGGATGCTTCTTAGCTTCGAGGATGAGTCAGGCAAGTGTTGGAAATTCAGATACTCTTATTGGAACAGTAGCCAAAGCTACGTGTTGACCAAAGGATGGAGCAGGTACGTCAAAGACAAACACCTCGACGCAGGCGACGTTGTTTTCTTTCAACGTCACCGTTTTGATCTCCATAGACTCTTCATTGGCTGGCGGAGACGCGGTGAAGCTTCTTCCTCTCCCGCTGTCTCCGTTGTGTCTCAAGAAGCTCTAGTTAATACGACGGCGTATTGGAGCGGCTTGACTACACCTTATCGTCAAGTACACGCGTCAACTACTTACCCTAATATTCACCAAGAGTATTCACACTATGgtaaattcaaaccctttatttcctcttttgttttttctttctctcttatctatatgtcagatttatactcctctctgttctcttttaagatttgtctttttcataaaaatagatgattcgtaatttgtattgcatatttacatgttctcttaaaaaaagtaatagagattaatattttatgcatggtattttagattatctgcctactttatatggtagtaaacaagaacattcatctttatttggttttataaacaaaatatgagaatttttaaaggttagggcaagcacttggaaagctcaaccattttagttagctggtggaatatctttcttataaaaagcaaatgagttatctaaaactatatgacaattattttagttgcgtgtgtaatgtatataaaataacaacatgaaataacattttgtcttttatttttgtcattcttattatttaattttggacccgacaatttcaaataatcttctccaagttgtaactaatccgttacatgcgcgtgaggagaaccgtccaatccacttagactaacgtgccctttatttcttccttttaattctatgttaaaaaaacaatttaactaaaagatgcgcacgtgtcttgacggtggaaaaaaattgtagGCGCCGTCGTTGATCATGCTCAGTCGATACCACCGGTGGTCGCAGGTAGCTCGAGGACGGTGAGGCTTTTTGGCGTGAACCTCGAATGTCATGGTGATGCCGTCGAGCCACCACCGCGTCCTGATGTCTATAATGACCAACACATTTACTATTACTCAACTCCTCATCCCATGgtaaatattttttttttttacatttttgtcagattcaaatttttgcttacgtatgatataattattaaacagatgtcgtggctgtttctcgagacgagacagatgaaaattagtaattttaaaatagacctgaaagagatttttatgtttaataaattatataaaggaggaatcagagagaataatactatacacttgactgtaaaaccacatggccaatttggtttttatttgattactttgatttgttttgtttactcttttgtctctgtagcctccttttgttcattaattaatatcagccgtaagtatatagtttcctgtgaaaacagtctctattttggttttactattctaatttgttaggcaccgtcagttttttttgtgaaaccaaattattgactaataagctggaaagcaaaactgactaaaagcattacaaacttatcaatgacataagttttgaatttattaccatgttttgtaatgttcagatataatttgaaatgcttagaattatatatttgtatacttaaattaatgaaataaagtgaatactaaagatagttttatttttcatattattctatacaattcggtgtacaatttgtttttgatgataataaaaataataaaattgcgtgttggaattgtgaaacagAATATATCATTTGCTGGGGAAGCATTGGAGCAGGTAGGAGATGGACGAGGT
AtNGAL2SEQ ID NO.3(AtSOD7编码的蛋白质)
MSVNHYHNTLSLHHHHQNDVAIAQRESLFEKSLTPSDVGKLNRLVIPKQHAEKYFPLNNNNNNGGSGDDVATTEKGMLLSFEDESGKCWKFRYSYWNSSQSYVLTKGWSRYVKDKHLDAGDVVFFQRHRFDLHRLFIGWRRRGEASSSPAVSVVSQEALVNTTAYWSGLTTPYRQVHASTTYPNIHQEYSHYGAVVDHAQSIPPVVAGSSRTVRLFGVNLECHGDAVEPPPRPDVYNDQHIYYYSTPHPMNISFAGEALEQVGDGRG
AtNGAL3核酸序列SEQ ID NO.4(cDNA)at5g06250
ATGTCAGTCAACCATTACTCCACAGACCACCACCACACTCTCTTGTGGCAGCAACAGCAACACCGCCACACCACCGACACATCGGAGACAACCACCACCGCCACATGGCTCCACGACGACCTAAAAGAGTCACTCTTCGAGAAGTCTCTCACACCAAGCGACGTCGGGAAACTCAACCGCCTCGTCATACCAAAACAACACGCAGAGAAATACTTCCCTCTCAATGCCGTCCTAGTCTCCTCTGCTGCTGCTGACACGTCATCTTCGGAGAAAGGGATGCTTCTAAGCTTTGAAGACGAGTCAGGCAAGTCATGGAGGTTCAGATACTCTTACTGGAACAGCAGTCAAAGCTATGTCTTGACTAAAGGATGGAGCAGATTTGTCAAAGACAAACAGCTCGATCCAGGCGACGTTGTTTTCTTCCAACGACACCGTTCTGATTCTAGGAGACTCTTCATTGGCTGGCGCAGACGTGGACAAGGCTCCTCATCCTCCGTCGCGGCCACTAACTCCGCCGTGAATACGAGTTCTATGGGAGCTCTTTCTTATCATCAAATCCACGCCACTAGTAATTACTCTAATCCTCCCTCTCACTCAGAGTATTCCCACTATGGAGCCGCCGTAGCAACAGCGGCTGAGACTCACAGCACACCGTCGTCTTCCGTCGTCGGGAGCTCAAGGACGGTGAGGCTTTTCGGTGTGAATCTGGAGTGTCAAATGGATGAAAACGACGGAGATGATTCTGTTGCAGTTGCCACCACCGTTGAATCTCCCGACGGTTACTACGGCCAAAACATGTACTATTATTACTCTCATCCTCATAACATGGTAATTTTAACTCTTTTATAA
AtNGAL3氨基酸SEQ ID NO.5
MSVNHYSTDHHHTLLWQQQQHRHTTDTSETTTTATWLHDDLKESLFEKSLTPSDVGKLNRLVIPKQHAEKYFPLNAVLVSSAAADTSSSEKGMLLSFEDESGKSWRFRYSYWNSSQSYVLTKGWSRFVKDKQLDPGDVVFFQRHRSDSRRLFIGWRRRGQGSSSSVAATNSAVNTSSMGALSYHQIHATSNYSNPPSHSEYSHYGAAVATAAETHSTPSSSVVGSSRTVRLFGVNLECQMDENDGDDSVAVATTVESPDGYYGQNMYYYYSHPHNMVILTLL
Oryza sativa
Os12g0157000 LOC_Os12g06080.1
覆盖73% 相同性53%
SEQ ID NO:49
MAMHAGHAWWGVAMYTNHYHHHYRHKTSDVGKNRVKHARYGGGDSGKGSDSGKWRRYSYWTSSSYVTKGWSRYVKKRDAGDVVHRVRGGAADRGCRRRGSAAAVRVTANGGWSMCYSTSGSSYDTSANSYAYHRSVDDHSDHAGSRADAKSSSAASASRRRGVNDCGADATAMYGYMHHSYAAVSTVNYWSV
CDS SEQ ID NO:50
ATGGCCATGCACCCTCTCGCCCAGGGGCACCCCCAGGCGTGGCCATGGGGTGTAGCCATGTACACCAACCTGCACTACCACCACCACTACGAGAGGGAGCACCTGTTCGAGAAGCCGCTGACGCCGAGCGACGTCGGCAAGCTCAACAGGCTGGTGATCCCCAAGCAGCACGCCGAGAGGTACTTCCCGCTCGGCGGCGGCGACTCCGGTGAGAAGGGCCTCCTCCTCTCCTTCGAGGACGAGTCCGGCAAGCCATGGCGGTTCCGCTACTCCTACTGGACCAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTACGTCAAGGAGAAGCGCCTCGACGCCGGCGACGTCGTCCACTTCGAGCGCGTCCGCGGCCTCGGCGCCGCCGACCGCCTCTTCATCGGCTGCAGGCGCCGCGGCGAGAGCGCGCCCGCGCCGCCGCCCGCCGTTCGCGTCACGCCGCAGCCGCCTGCCCTCAACGGCGGCGAGCAGCAGCCGTGGAGCCCAATGTGTTACAGCACGTCGGGCTCGTCCTACGACCCTACCAGCCCTGCCAATTCATATGCCTACCATCGCTCCGTAGACCAAGATCACAGCGACATACTACACGCAGGAGAGTCGCAGAGAGAAGCAGACGCCAAGAGCAGCAGCGCGGCGTCGGCGCCGCCGCCGTCGAGGCGGCTCAGGCTGTTCGGCGTTAACCTCGACTGCGGCCCGGAGCCGGAGGCGGATCAGGCGACGGCAATGTACGGCTACATGCACCACCAGAGCCCCTACGCCGCAGTGTCTACAGTGCCAAATTACTGGTCAGTATTTTTTCAGTTTTAA
Os11g0156000
LOC_Os11g05740.1
覆盖81% 相同性47%
SEQ ID NO:51
MAMNHPLFSQEQPQSWPWGVAMYANFHYHHHYEKEHMFEKPLTPSDVGKLNRLVIPKQHAERYFPLGAGDAADKGLILSFEDEAGAPWRFRYSYWTSSQSYVLTKGWSRYVKEKRLDAGDVVHFERVRGSFGVGDRLFIGCRRRGDAAAAQTPAPPPAVRVAPAAQNAGEQQPWSPMCYSTSGGGSYPTSPANSYAYRRAADHDHGDMHHADESPRDTDSPSFSAGSAPSRRLRLFGVNLDCGPEPEADTTAAATMYGYMHQQSSYAAMSAVPSYWGNS
CDS SEQ ID NO:52
ATGGCCATGAACCACCCTCTCTTCTCCCAGGAGCAACCCCAGTCCTGGCCATGGGGTGTGGCCATGTACGCCAACTTCCACTACCACCACCACTACGAGAAGGAGCACATGTTTGAGAAGCCCCTGACGCCCAGTGACGTGGGGAAGCTGAACCGGCTGGTGATCCCCAAGCAGCACGCCGAGAGGTACTTCCCCCTCGGCGCCGGCGACGCCGCCGACAAGGGCCTGATCCTGTCGTTCGAGGACGAGGCCGGCGCGCCGTGGCGGTTCAGGTACTCCTACTGGACGAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTACGTCAAGGAGAAGCGCCTCGACGCCGGCGACGTCGTGCACTTCGAGAGGGTGCGCGGCTCCTTCGGCGTCGGCGACCGTCTCTTCATCGGCTGCAGGCGCCGCGGCGACGCCGCCGCCGCGCAAACACCCGCACCGCCGCCCGCCGTGCGCGTCGCCCCGGCTGCACAGAACGCCGGCGAGCAGCAGCCGTGGAGCCCAATGTGTTACAGCACGTCGGGCGGCGGCTCATACCCTACCAGCCCAGCCAACTCCTACGCCTACCGCCGCGCAGCAGATCATGATCACGGGGACATGCACCATGCAGACGAGTCTCCGCGCGACACGGACAGCCCAAGCTTCAGTGCAGGCTCGGCGCCATCGAGGCGGCTCAGGCTGTTCGGCGTCAACCTCGACTGCGGGCCAGAGCCGGAGGCAGACACCACGGCAGCGGCAACAATGTACGGCTACATGCACCAGCAGAGCTCCTATGCTGCCATGTCTGCAGTACCCAGTTACTGGGGCAATTCATAA
os02g0683500 LOC_Os02g45850
覆盖47% 相同性62%
SEQ ID NO:53
MEFTTSSRFSKEEEDEEQDEAGRREIPFMTATAEAAPAPTSSSSSPAHHAASASASASASGSSTPFRSDDGAGASGSGGGGGGGGEAEVVEKEHMFDKVVTPSDVGKLNRLVIPKQYAEKYFPLDAAANEKGLLLNFEDRAGKPWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFSRGIGDEAARHRLFIDWKRRADTRDPLRLPRGLPLPMPLTSHYAPWGIGGGGGFFVQPSPPATLYEHRLRQGLDFRAFNPAAAMGRQVLLFGSARIPPQAPLLARAPSPLHHHYTLQPSGDGVRAAGSPVVLDSVPVIESPTTAAKRVRLFGVNLDNPHAGGGGGAAAGESSNHGNALSLQTPAWMRRDPTLRLLELPPHHHHGAESSAASSPSSSSSSKRDAHSALDLDL
CDS SEQ ID NO:54
ATGGAGTTCACTACAAGCAGTAGGTTTTCTAAAGAAGAGGAGGACGAGGAGCAGGATGAGGCGGGAAGGCGAGAGATCCCCTTCATGACGGCCACGGCCGAAGCCGCGCCTGCGCCCACGTCGTCGTCGTCGTCTCCTGCTCATCACGCGGCTTCCGCGTCGGCGTCGGCGTCTGCGTCAGGGAGCAGCACTCCCTTTCGCTCCGACGATGGCGCCGGGGCGTCTGGGAGCGGCGGCGGCGGCGGCGGCGGCGGAGAAGCGGAGGTGGTGGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCGAGCGACGTTGGGAAGCTGAACCGGCTGGTGATCCCGAAGCAGTACGCCGAGAAGTACTTCCCGCTGGACGCGGCGGCGAACGAGAAGGGCCTCCTGCTCAACTTCGAGGACCGCGCGGGGAAGCCATGGCGGTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAGGGGTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCCGGGGACACCGTCTCCTTCTCCCGCGGCATCGGCGACGAGGCGGCGCGGCACCGCCTCTTCATCGACTGGAAGCGCCGCGCCGACACCCGCGACCCGCTCCGGCTGCCCCGCGGGCTGCCGCTCCCGATGCCGCTCACGTCGCACTACGCCCCGTGGGGGATCGGCGGCGGAGGGGGATTCTTCGTGCAGCCCTCGCCGCCGGCCACGCTCTACGAGCACCGCCTCAGGCAAGGCCTCGACTTCCGCGCCTTCAACCCCGCCGCCGCGATGGGGAGGCAGGTCCTCCTGTTCGGCTCGGCGAGGATTCCTCCGCAAGCACCACTGCTGGCGCGCGCGCCGTCGCCGCTGCACCACCACTACACGCTGCAGCCGAGCGGCGATGGTGTAAGGGCGGCGGGCTCACCGGTGGTGCTCGACTCGGTTCCGGTCATCGAGAGCCCCACGACGGCCGCGAAGCGCGTGCGGCTGTTCGGCGTGAACCTCGACAACCCGCATGCCGGCGGCGGCGGCGGCGCCGCCGCCGGCGAGTCGAGCAATCATGGCAATGCACTGTCATTGCAGACGCCCGCGTGGATGAGGAGGGATCCAACACTGCGGCTGCTGGAATTGCCTCCTCACCACCACCATGGCGCCGAGTCGTCCGCTGCATCGTCTCCGTCGTCGTCGTCTTCCTCC
AAGAGGGACGCGCATTCGGCCTTGGATCTCGATCTGTAG
os04g0581400 LOC_Os04g49230
覆盖46% 相同性64%
CDS SEQ ID NO:55
ATGGAGTTTGCTACAACGAGTAGTAGGTTTTCCAAGGAAGAGGAGGAGGAGGAGGAAGGGGAACAGGAGATGGAGCAGGAGCAGGATGAAGAGGAGGAGGAGGCGGAGGCCTCGCCCCGCGAGATCCCCTTCATGACGTCGGCGGCGGCGGCGGCCACCGCCTCATCGTCCTCCCCGACATCGGTCTCCCCTTCCGCCACCGCTTCCGCGGCGGCGTCCACGTCGGCGTCGGGCTCTCCCTTCCGGTCGAGCGACGGTGCGGGAGCGTCGGGGAGTGGCGGCGGCGGTGGCGGCGAGGACGTGGAGGTGATCGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCGAGCGACGTGGGGAAGCTGAACCGGCTGGTGATCCCGAAGCAGCACGCCGAGAAGTACTTCCCGCTGGACTCGGCGGCGAACGAGAAGGGCCTTCTCCTCAGCTTCGAGGACCGAACCGGCAAGCTATGGCGCTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTCATGACCAAGGGTTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCCGGGGACACCGTCTCCTTCTGCCGCGGCGCCGCCGAGGCCACCCGCGACCGCCTCTTCATCGACTGGAAGCGCCGCGCCGACGTCCGCGACCCGCACCGCTTCCAGCGCCTACCGCTCCCCATGACCTCGCCCTACGGCCCGTGGGGCGGCGGCGCGGGCGCTTCTTCATGCCGCCCGCGCCGCCCGCCACGCTCTACGAGCATCACCGCTTTCGCCAGGGCTTCGACTTCCGCAACATCAACCCCGCTGTGCCGGCGAGGCAGCTCGTCTTCTTCGGCTCCCCAGGGACGGGGATTCATCAGCACCCGCCCTTGCCACCGCCGCCGTCGCCACCTCCGCCTCCTCACCAACTCCACATTACGGTGCACCACCCGAGCCCCGTAG
SEQ ID NO:56
MEFATTSSRFSKEEEEEEEGEQEMEQEQDEEEEEAEASPREIPFMTSAAAAATASSSSPTSVSPSATASAAASTSASGSPFRSSDGAGASGSGGGGGGEDVEVIEKEHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSAANEKGLLLSFEDRTGKLWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFCRGAAEATRDRLFIDWKRRADVRDPHRFQRLPLPMTSPYGPWGGGAGASSCRPRRPPRSTSITAFARASTSATSTPLCRRGSSSSSAPQGRGFISTRPCHRRRRHLRLLTNSTLRCTTRAP
os03g0120900 LOC_Os03g02900
覆盖47% 相同性63%
CDS SEQ ID NO:57
ATGGAGTTCATCACGCCAATCGTGAGGCCGGCATCGGCGGCGGCGGGCGGCGGCGAGGTGCAGGAGAGTGGTGGGAGGAGCTTGGCGGCGGTGGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCGAGCGACGTGGGGAAGCTGAACCGGCTGGTGATCCCGAAGCAGCACGCGGAGAAGTACTTCCCGCTGGACGCGGCGTCCAACGAGAAGGGGCTCCTGCTCAGCTTCGAGGACCGCACGGGGAAGCCATGGCGGTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAGGGGTGGAGCCGCTTCGTCAAGGAGAAGCGACTCGACGCCGGGGACACCGTCTCCTTCGGCCGCGGCGTCGGCGAGGCCGCGCGCGGGAGGCTCTTCATCGACTGGCGCCGCCGCCCCGACGTCGTCGCCGCGCTCCAGCCGCCCACGCACCGCTTCGCCCACCACCTCCCTTCCTCCATCCCCTTCGCTCCCTGGGCGCACCACCACGGACACGGAGCCGCCGCCGCCGCCGCCGCCGCCGCCGGCGCCAGGTTTCTCCTGCCTCCCTCCTCGACTCCCATCTACGACCACCACCGCCGACACGCCCACGCCGTCGGGTACGACGCGTACGCCGCGGCCACCAGCAGGCAGGTGCTGTTCTACCGGCCGTTGCCGCCGCAGCAGCAGCATCATCCCGCGGTGGTGCTGGAGTCGGTGCCGGTGCGCATGACGGCGGGGCACGCGGAGCCGCCGTCGGCTCCGTCGAAGCGAGTTCGGCTGTTCGGGGTGAACCTCGACTGCGCGAATTCCGAACAAGACCACGCCGGCGTGGTCGGGAAGACGGCGCCGCCGCCGCTGCCATCGCCGCCGTCATCATCGTCATCTTCCTCCGGGAAAGCGAGGTGCTCCTTGAACCTTGACTTGTGA
SEQ ID NO:58
MEFITPIVRPASAAAGGGEVQESGGRSLAAVEKEHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDAASNEKGLLLSFEDRTGKPWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFGRGVGEAARGRLFIDWRRRPDVVAALQPPTHRFAHHLPSSIPFAPWAHHHGHGAAAAAAAAAGARFLLPPSSTPIYDHHRRHAHAVGYDAYAAATSRQVLFYRPLPPQQQHHPAVVLESVPVRMTAGHAEPPSAPSKRVRLFGVNLDCANSEQDHAGVVGKTAPPPLPSPPSSSSSSSGKARCSLNLDL
os01g0693400
覆盖47% 相同性63%
CDS SEQ ID NO:59
ATGGACAGCTCCAGCTGCCTGGTGGATGATACCAACAGCGGCGGCTCGTCCACGGACAAGCTGAGGGCGTTGGCCGCCGCGGCGGCGGAGACGGCGCCGCTGGAGCGCATGGGGAGCGGGGCGAGCGCGGTGGTGGACGCGGCCGAGCCTGGCGCGGAGGCGGACTCCGGGTCCGGGGGACGTGTGTGCGGCGGCGGCGGCGGCGGTGCCGGCGGTGCGGGAGGGAAGCTGCCGTCGTCCAAGTTCAAGGGCGTCGTGCCGCAGCCCAACGGGAGGTGGGGCGCGCAGATCTACGAGCGGCACCAGCGGGTGTGGCTCGGCACGTTCGCCGGGGAGGACGACGCCGCGCGCGCCTACGACGTCGCCGCGCAGCGCTTCCGCGGCCGCGACGCCGTCACCAACTTCCGCCCGCTCGCCGAGGCCGACCCGGACGCCGCCGCCGAGCTTCGCTTCCTCGCCACGCGCTCCAAGGCCGAGGTCGTCGACATGCTCCGCAAGCACACCTACTTCGACGAGCTCGCGCAGAGCAAGCGCACCTTCGCCGCCTCCACGCCGTCGGCCGCGACCACCACCGCCTCCCTCTCCAACGGCCACCTCTCGTCGCCCCGCTCCCCCTTCGCGCCCGCCGCGGCGCGCGACCACCTGTTCGACAAGACGGTCACCCCGAGCGACGTGGGCAAGCTGAACAGGCTCGTCATACCGAAGCAGCACGCCGAGAAGCACTTCCCGCTACAGCTCCCGTCCGCCGGCGGCGAGAGCAAGGGTGTCCTCCTCAACTTCGAGGACGCCGCCGGCAAGGTGTGGCGGTTCCGGTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTAACCAAGGGCTGGAGCCGCTTCGTCAAGGAGAAGGGTCTCCACGCCGGCGACGTCGTCGGCTTCTACCGCTCCGCCGCCAGTGCCGGCGACGACGGCAAGCTCTTCATCGACTGCAAGTTAGTACGGTCGACCGGCGCCGCCCTCGCGTCGCCCGCTGATCAGCCAGCGCCGTCGCCGGTGAAGGCCGTCAGGCTCTTCGGCGTGGACCTGCTCACGGCGCCGGCGCCGGTCGAACAGATGGCCGGGTGCAAGAGAGCCAGGGACTTGGCGGCGACGACGCCTCCACAAGCGGCGGCGTTCAAGAAGCAATGCATAGAGCTGGCACTAGTATAG
SEQ ID NO:49
60MDSSSCLVDDTNSGGSSTDKLRALAAAAAETAPLERMGSGASAVVDAAEPGAEADSGSGGRVCGGGGGGAGGAGGKLPSSKFKGVVPQPNGRWGAQIYERHQRVWLGTFAGEDDAARAYDVAAQRFRGRDAVTNFRPLAEADPDAAAELRFLATRSKAEVVDMLRKHTYFDELAQSKRTFAASTPSAATTTASLSNGHLSSPRSPFAPAAARDHLFDKTVTPSDVGKLNRLVIPKQHAEKHFPLQLPSAGGESKGVLLNFEDAAGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKGLHAGDVVGFYRSAASAGDDGKLFIDCKLVRSTGAALASPADQPAPSPVKAVRLFGVDLLTAPAPVEQMAGCKRARDLAATTPPQAAAFKKQCIELALV
Os10g0537100 LOC_Os10g39190
覆盖47% 相同性60%
CDS SEQ ID NO:61
ATGGAGTTCACCCCAATTTCGCCGCCGACGAGGGTCGCCGGCGGTGAGGAGGATTCCGAGAGGGGGGCGGCGGCGTGGGCGGTGGTGGAGAAGGAGCACATGTTTGAGAAGGTCGTGACGCCGAGCGACGTGGGGAAGCTGAACCGATTGGTCATCCCCAAGCAGCACGCCGAGAGGTACTTCCCGCTCGACGCCGCGGCGGGCGCCGGCGGCGGCGGTGGTGGCGGCGGTGGCGGCGGCGGGGGGAAGGGGCTGGTGCTGAGCTTCGAGGACAGGACGGGGAAGGCGTGGAGGTTCCGGTACTCGTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAAGGGTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGGCGCCGGCGACACCGTGTCGTTCGGCCGCGGCCTCGGCGACGCCGCCCGCGGCCGCCTCTTCATCGACTTCCGCCGCCGCCGCCAGGACGCCGGCAGCTTCATGTTCCCGCCGACGGCGGCGCCGCCGTCGCACTCGCACCACCATCATCAGCGACACCACCCGCCGCTCCCGTCCGTGCCCCTTTGCCCGTGGCGAGACTACACCACCGCCTATGGCGGCGGCTACGGCTACGGCTACGGCGGCGGCTCCACCCCGGCGTCCAGCCGCCACGTGCTGTTCCTCCGGCCGCAGGTGCCGGCCGCTGTGGTGCTCAAGTCGGTGCCGGTGCACGTCGCGGCCACCTCGGCGGTGCAGGAGGCGGCGACGACGACAAGGCCGAAGCGTGTCCGGCTGTTCGGGGTGAACCTCGACTGCCCGGCGGCCATGGACGACGACGACGACATCGCCGGAGCGGCGAGCCGGACGGCAGCGTCGTCTCTCCTGCAGCTCCCCTCGCCGTCGTCCTCGACGTCGTCGTCGACGGCGGGGAAGAAGATGTGCTCCTTGGATCTTGGGTTGTGA
SEQ ID NO:62
MEFTPISPPTRVAGGEEDSERGAAAWAVVEKEHMFEKVVTPSDVGKLNRLVIPKQHAERYFPLDAAAGAGGGGGGGGGGGGGKGLVLSFEDRTGKAWRFRYSYWNSSQSYVMTKGWSRFVKEKRLGAGDTVSFGRGLGDAARGRLFIDFRRRRQDAGSFMFPPTAAPPSHSHHHHQRHHPPLPSVPLCPWRDYTTAYGGGYGYGYGGGSTPASSRHVLFLRPQVPAAVVLKSVPVHVAATSAVQEAATTTRPKRVRLFGVNLDCPAAMDDDDDIAGAASRTAASSLLQLPSPSSSTSSSTAGKKMCSLDLGL
大豆(Glycine max)
Loc100795470
覆盖75% 相同性53%
SEQ ID NO:63
Msinhysmdlpeptlwwphphhqqqqltlmdpdplrlnlnsddgngndndndenqttttggeqeilddkepmfekpltpsdvgklnrlvipkqhaekyfplsgdsggseckglllsfedesgkcwrfrysywnssqsyvltkgwsryvkdkrldagdvvlferhrvdaqrlfigwrrrrqsdaalppahvssrksgggdgnsnknegwtrgfysahhpypthhlhhhqpspyqqqhdclhagrgsqgqnqrmrpvgnnssssssssrvlrlfgvdmecqpehddsgpstpqcsynsnnmlpstqgtdhshhnfyqqqpsnsnpsphhmmvhhqpyyy
CDS SEQ ID NO:64
ATGTCCATAAACCACTACTCCATGGACCTTCCCGAACCGACACTCTGGTGGCCACACCCACACCACCAACAACAACAACTAACCTTAATGGATCCTGACCCTCTCCGTCTCAACCTCAATAGCGACGATGGCAATGGCAATGACAACGACAACGACGAAAATCAAACAACCACAACAGGAGGAGAACAAGAAATATTAGACGATAAAGAACCGATGTTCGAGAAGCCCTTAACCCCGAGCGACGTGGGGAAGCTGAACCGTCTCGTAATCCCGAAGCAGCACGCGGAGAAGTACTTCCCACTGAGTGGTGACTCGGGCGGGAGCGAGTGCAAGGGGCTGTTACTGAGTTTCGAGGACGAGTCGGGGAAGTGTTGGCGCTTCCGCTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAAGGGTGGAGCCGCTACGTCAAGGACAAGCGCCTTGACGCGGGCGACGTCGTTTTGTTCGAGCGTCACCGCGTCGACGCGCAGCGCCTCTTCATCGGGTGGAGGCGCAGGCGGCAGAGCGATGCCGCCTTGCCGCCTGCGCACGTTAGCAGTAGGAAGAGTGGTGGTGGTGATGGGAATAGTAATAAGAATGAGGGGTGGACCAGAGGGTTCTATTCTGCGCATCATCCTTATCCTACGCATCATCTTCATCATCATCAGCCCTCGCCATACCAACAACAACATGACTGTCTTCATGCAGGTAGAGGGTCCCAAGGTCAGAACCAAAGGATGAGACCAGTGGGAAACAACAGTTCTAGCTCTAGTTCGAGTTCAAGGGTACTTAGGCTGTTCGGGGTCGACATGGAATGCCAACCCGAACATGATGATTCTGGTCCCTCCACACCCCAATGCTCCTACAATAGTAACAACATGTTGCCATCAACACAGGGCACAGATCATTCCCATCACAATTTCTACCAACAGCAACCTTCTAATTCCAATCCTTCCCCTCATCACATGATGGTACATCACCAACCATACTACTACTAG
Loc100818164
覆盖50% 相同性73%
SEQ ID NO:65
MSTNHYTMDLPEPTLWWPHPHQQQLTLIDPDPLPLNLNNDDNDNGDDNDNDENQTVTTTTTGGEEEIINNKEPMFEKPLTPSDVGKLNRLVIPKQHAEKYFPLSGGDSGSSECKGLLLSFEDESGKCWRFRYSYWNSSQSYVLTKGWSRYVKDKRLDAGDVVLFQRHRADAQRLFIGWRRRRQSDALPPPAHVSSRKSGGDGNSSKNEGDVGVGWTRGFYPAHHPYPTHHHHPSPYHHQQDDSLHAVRGSQGQNQRTRPVGNSSSSSSSSSRVLRLFGVNMECQPEHDDSGPSTPQCSYNTNNILPSTQGTDIHSHLNFYQQQQTSNSKPPPHHMMIRHQPYYY
SEQ ID NO:66
ATGTCGACAAACCACTACACCATGGACCTTCCCGAACCAACACTCTGGTGGCCACACCCACACCAACAACAACTAACCTTAATAGATCCAGACCCTCTCCCTCTGAACCTCAACAACGACGACAACGACAATGGCGACGACAACGACAACGACGAAAACCAAACAGTTACAACAACCACAACAGGAGGAGAAGAAGAAATAATAAACAATAAAGAACCGATGTTCGAGAAGCCGCTAACCCCGAGCGACGTGGGGAAGCTGAACCGCCTCGTAATCCCGAAGCAGCACGCTGAGAAGTACTTTCCACTGAGTGGTGGTGACTCGGGCAGTAGCGAGTGCAAGGGGCTGTTACTGAGTTTCGAGGACGAGTCGGGGAAGTGCTGGCGCTTCCGCTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAAGGGTGGAGCCGTTACGTGAAGGACAAGCGCCTCGATGCGGGAGATGTCGTTTTATTCCAGCGCCACCGCGCCGACGCGCAGCGCCTCTTCATCGGCTGGAGGCGCAGGCGGCAGAGCGACGCCCTGCCGCCGCCTGCGCACGTTAGCAGCAGGAAGAGTGGTGGTGATGGGAATAGTAGTAAGAATGAGGGTGATGTGGGCGTGGGCTGGACCAGAGGGTTCTATCCTGCGCATCATCCTTATCCTACGCATCATCATCATCCCTCGCCATACCATCACCAACAAGATGACTCTCTTCATGCAGTTAGAGGGTCCCAAGGTCAGAACCAAAGGACGAGACCAGTGGGAAACAGCAGTTCTAGTTCGAGTTCGAGTTCAAGGGTACTTAGGCTATTCGGGGTCAACATGGAATGCCAACCCGAACATGATGATTCTGGACCCTCCACACCCCAATGCTCCTACAATACTAACAACATATTGCCATCCACACAGGGCACAGATATTCATTCCCATCTCAATTTCTACCAACAACAACAAACTTCTAATTCCAAGCCTCCCCCTCATCACATGATGATACGTCACCAACCATACTACTACTAG
Loc100802734
覆盖77% 相同性53%
SEQ ID NO:67
MSSINHYSPETTLYWTNDQQQQAAMWLSNSHTPRFNLNDEEEEEEDDVIVSDKATNNLTQEEEKVAMFEKPLTPSDVGKLNRLVIPKQHAEKHFPLDSSAAKGLLLSFEDESGKCWRFRYSYWNSSQSYVLTKGWSRYVKDKRLHAGDVVLFHRHRSLPQRFFISCSRRQPNPVPAHVSTTRSSASFYSAHPPYPAHHFPFPYQPHSLHAPGGGSQGQNETTPGGNSSSSGSGRVLRLFGVNMECQPDNHNDSQNSTPECSYTHLYHHQTSSYSSSSNPHHHMVPQQP
SEQ ID NO:68
ATGTCATCGATAAACCACTATTCACCGGAAACAACACTATACTGGACCAACGACCAACAGCAACAAGCCGCCATGTGGCTGAGTAATTCCCACACCCCGCGTTTCAATCTGAACGACGAGGAGGAGGAGGAGGAAGACGACGTTATCGTTTCGGACAAGGCTACTAATAACTTGACGCAAGAGGAGGAGAAGGTAGCCATGTTCGAGAAGCCGTTGACGCCGAGCGACGTCGGGAAGCTGAACCGGCTCGTGATTCCGAAACAGCACGCGGAGAAGCACTTCCCTCTCGACTCGTCGGCGGCGAAGGGGCTGTTGCTGAGTTTCGAGGACGAGTCCGGGAAGTGTTGGCGCTTCCGTTACTCTTATTGGAACAGTAGCCAGAGTTACGTTTTGACCAAAGGATGGAGCCGTTACGTCAAAGACAAACGCCTCCACGCTGGCGACGTCGTTTTGTTCCACAGACACCGCTCCCTCCCTCAACGCTTCTTCATCTCCTGCAGCCGCCGCCAACCCAACCCGGTCCCCGCTCACGTTAGCACCACCAGATCCTCCGCTTCCTTCTACTCTGCGCACCCACCTTATCCTGCGCACCACTTCCCCTTCCCATACCAACCTCACTCTCTTCATGCACCAGGTGGAGGGTCCCAAGGACAGAACGAAACGACACCGGGAGGGAACAGTAGTTCAAGTGGCAGTGGCAGGGTGCTGAGGCTCTTTGGTGTGAACATGGAATGCCAACCTGATAATCATAATGATTCCCAGAACTCCACACCAGAATGCTCCTACACCCACTTATACCACCATCAAACCTCTTCTTATTCTTCTTCTTCAAACCCTCACCATCACATGGTACCTCAACAACCATAA
Loc100781489
覆盖49% 相同性64%
SEQ ID NO:69
MELMQQVKGNYSDSREEEEEEEAAAITRESESSRLHQQDTASNFGKKLDLMDLSLGSSKEEEEEGNLQQGGGGVVHHAHQVVEKEHMFEKVATPSDVGKLNRLVIPKQHAEKYFPLDSSTNEKGLLLNFEDRNGKVWRFRYSYWNSSQSYVMTKGWSRFVKEKKLDAGDIVSFQRGLGDLYRHRLYIDWKRRPDHAHAHPPHHHDPLFLPSIRLYSLPPTMPPRYHHDHHFHHHLNYNNLFTFQQHQYQQLGAATTTHHNNYGYQNSGSGSLYYLRSSMSMGGGDQNLQGRGSNIVPMIIDSVPVNVAHHNNNRHGNGGITSGGTNCSGKRLRLFGVNMECASSAEDSKELSSGSAAHVTTAASSSSLHHQRLRVPVPVPLEDPLSSSAAAAARFGDHKGASTGTSLLFDLDPSLQYHRH
CDS SEQ ID NO:70
ATGGAGTTGATGCAACAAGTTAAAGGTAATTATTCTGATAGCAGGGAGGAAGAGGAGGAAGAGGAAGCTGCAGCAATCACAAGGGAATCAGAAAGCAGCAGGTTACACCAACAAGATACAGCATCCAATTTTGGAAAGAAGCTAGACTTGATGGACTTGTCACTAGGGAGCAGCAAGGAAGAGGAAGAGGAAGGGAATTTGCAACAAGGAGGAGGAGGAGTGGTTCATCATGCTCACCAAGTAGTGGAGAAAGAACACATGTTTGAGAAAGTGGCGACACCGAGCGACGTAGGGAAGCTGAACAGGCTGGTGATACCGAAGCAGCACGCGGAGAAGTACTTCCCCCTTGACTCCTCAACCAACGAGAAGGGTCTGCTCCTGAATTTCGAGGACAGGAATGGGAAGGTGTGGCGATTCAGGTATTCCTATTGGAACAGCAGCCAGAGCTATGTGATGACAAAAGGGTGGAGCCGCTTTGTTAAGGAGAAGAAGCTGGATGCCGGTGACATTGTCTCCTTCCAGCGTGGCCTTGGGGATTTGTATAGACATCGGTTGTATATAGATTGGAAGAGAAGGCCCGATCATGCTCATGCTCATCCACCTCATCATCACGATCCTTTGTTTCTTCCCTCTATCAGATTGTACTCTCTCCCTCCCACCATGCCACCTCGCTACCACCACGATCATCACTTTCACCACCATCTCAATTACAACAACCTCTTCACTTTTCAGCAACACCAGTACCAGCAGCTTGGTGCTGCCACTACCACTCATCACAACAACTATGGTTACCAGAATTCGGGATCTGGTTCACTCTATTACCTAAGGTCCTCTATGTCAATGGGTGGTGGTGATCAAAACTTGCAAGGGAGAGGGAGCAACATTGTCCCCATGATCATTGATTCTGTGCCGGTTAACGTTGCTCATCACAACAACAATCGCCATGGGAATGGGGGCATCACGAGTGGTGGTACTAATTGTAGTGGAAAACGACTAAGGCTATTTGGGGTGAACATGGAATGCGCTTCTTCGGCAGAAGATTCCAAAGAATTGTCCTCGGGTTCGGCAGCACACGTGACGACAGCTGCTTCTTCTTCTTCTCTTCATCATCAGCGCTTGAGGGTGCCAGTGCCAGTGCCACTTGAAGATCCACTTTCGTCGTCAGCAGCAGCAGCAGCAAGGTTTGGGGATCACAAAGGGGCCAGTACTGGGACTTCGCTGCTGTTTGATTTGGATCCCTCTTTGCAGTATCATCGCCACTGA
Loc100776987
覆盖46% 相同性62%
SEQ ID NO:71
MDAISCLDESTTTESLSISQAKPSSTIMSSEKASPSPPPPNRLCRVGSGASAVVDSDGGGGGGSTEVESRKLPSSKYKGVVPQPNGRWGSQIYEKHQRVWLGTFNEEDEAARAYDVAVQRFRGKDAVTNFKPLSGTDDDDGESEFLNSHSKSEIVDMLRKHTYNDELEQSKRSRGFVRRRGSAAGAGNGNSISGACVMKAREQLFQKAVTPSDVGKLNRLVIPKQHAEKHFPLQSAANGVSATATAAKGVLLNFEDVGGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLKAGDTVCFQRSTGPDRQLYIDWKTRNVVNEVALFGPVVEPIQMVRLFGVNILKLPGSDSIANNNNASGCCNGKRREMELFSLECSKKPKIIGAL
CDS SEQ ID NO:72
ATGGATGCAATTAGTTGCCTGGATGAGAGCACCACCACCGAGTCACTCTCCATAAGTCAGGCGAAGCCTTCTTCGACGATTATGTCGTCCGAGAAGGCTTCTCCTTCCCCGCCGCCGCCGAACAGGCTGTGCCGCGTCGGTAGCGGTGCTAGCGCAGTCGTGGATTCCGACGGCGGCGGCGGGGGTGGCAGCACCGAGGTGGAGTCGCGGAAGCTCCCCTCGTCCAAGTATAAGGGCGTCGTGCCCCAGCCCAACGGCCGCTGGGGCTCGCAGATTTACGAGAAGCACCAGCGCGTGTGGCTGGGAACGTTCAACGAGGAAGACGAGGCGGCGCGTGCGTACGACGTCGCCGTGCAGCGATTCCGCGGCAAGGACGCCGTCACAAACTTCAAGCCGCTCTCCGGCACCGACGACGACGACGGGGAATCGGAGTTTCTCAACTCGCATTCGAAATCCGAGATCGTCGACATGCTGCGTAAGCATACGTACAATGACGAGCTGGAACAAAGCAAGCGCAGCCGCGGCTTCGTACGTCGGCGCGGCTCCGCCGCCGGCGCCGGAAACGGAAACTCAATCTCCGGCGCGTGTGTTATGAAGGCGCGTGAGCAGCTATTCCAGAAGGCCGTTACGCCGAGCGACGTTGGGAAACTGAACCGTTTGGTGATACCGAAGCAGCACGCGGAGAAGCACTTTCCTTTACAGAGCGCTGCTAACGGCGTTAGCGCGACGGCGACGGCGGCGAAGGGCGTTTTGTTGAACTTCGAAGACGTTGGAGGGAAAGTGTGGCGGTTTCGTTACTCGTATTGGAACAGTAGCCAGAGTTACGTCTTGACCAAAGGTTGGAGCCGGTTCGTTAAGGAGAAGAATCTGAAAGCCGGTGACACGGTTTGTTTTCAACGGTCCACTGGACCGGACAGGCAGCTTTACATCGATTGGAAGACGAGGAATGTTGTTAACGAGGTCGCGTTGTTCGGACCGGTTGTCGAACCGATCCAGATGGTTCGGCTCTTTGGTGTTAACATTTTGAAACTACCCGGTTCAGATTCTATCGCCAATAACAATAATGCAAGTGGGTGCTGCAATGGCAAGAGAAGAGAAATGGAACTCTTTTCATTAGAGTGTAGCAAGAAACCTAAGATTATTGGTGCTTTGTAG
Loc100778733
覆盖44% 相同性64%
SEQ ID NO:73
MELMQEVKGYSDGREEEEEEEEAAEEIITREESSRLLHQHQEAAGSNFIINNNHHHHQHHHHHTTKQLDFMDLSLGSSKDEGNLQGSSSSVYAHHHHAASASSSANGNNNNSSSSNLQQQQQQPAEKEHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDSSANEKGLLLNFEDRNGKLWRFRYSYWNSSQSYVMTKGWSRFVKEKKLDAGDMVSFQRGVGELYRHRLYIDWWRRPDHHHHHHHGPDHSTTLFTPFLIPNQPHHLMSIRWGATGRLYSLPSPTPPRHHEHLNYNNNAMYHPFHHHGAGSGINATTHHYNNYHEMSSTTTSGSAGSVFYHRSTPPISMPLADHQTLNTRQQQQQQQQQEGAGNVSLSPMIIDSVPVAHHLHHQQHHGGKSSGPSSTSTSPSTAGKRLRLFGVNMECASSTSEDPKCFSLLSSSSMANSNSQPPLQLLREDTLSSSSARFGDQRGVGEPSMLFDLDPSLQYRQ
SEQ ID NO:74
ATGGAGTTGATGCAAGAAGTGAAAGGGTATTCTGATGGCAGAGAGGAGGAGGAGGAGGAAGAGGAAGCAGCAGAAGAAATCATCACAAGAGAAGAAAGCAGCAGGTTGTTACACCAGCACCAGGAGGCAGCAGGTTCCAATTTCATCATCAACAATAATCATCATCATCATCAACATCACCACCACCACACAACAAAGCAGCTAGACTTCATGGACTTGTCACTTGGTAGCAGCAAGGATGAAGGGAATTTGCAAGGATCATCTTCTTCTGTCTATGCTCATCATCATCATGCAGCAAGTGCTAGTTCTTCTGCCAATGGTAACAACAACAACAGCAGCAGCAGCAACTTGCAGCAACAGCAGCAGCAGCCTGCTGAGAAGGAGCACATGTTTGATAAAGTAGTGACACCAAGTGATGTGGGGAAGCTGAACCGGTTGGTGATACCAAAGCAGCATGCTGAGAAGTATTTCCCTCTTGATTCCTCAGCCAATGAGAAGGGTCTGTTGCTGAATTTTGAGGACAGGAATGGTAAGTTGTGGAGGTTCAGGTACTCCTATTGGAACAGCAGCCAGAGCTATGTGATGACCAAAGGTTGGAGCCGTTTTGTTAAGGAGAAGAAGCTTGATGCTGGTGACATGGTGTCCTTCCAGCGTGGTGTTGGGGAGTTGTATAGGCATAGGTTGTACATAGATTGGTGGAGAAGGCCTGATCATCATCACCATCACCATCATGGCCCTGACCATTCAACCACACTCTTCACACCTTTCTTAATTCCCAATCAGCCTCATCACTTAATGTCCATCAGATGGGGTGCCACTGGCAGATTGTACTCCCTCCCTTCCCCAACCCCACCACGCCACCATGAACACCTCAATTACAACAATAACGCCATGTATCATCCCTTTCATCACCATGGTGCTGGAAGTGGAATTAATGCTACTACTCATCACTACAACAACTATCATGAGATGAGTAGTACTACTACTTCAGGATCTGCAGGCTCAGTCTTTTACCACAGGTCAACACCCCCAATATCAATGCCATTGGCTGACCACCAAACCTTGAACACAAGGCAGCAGCAACAACAACAACAACAACAAGAGGGAGCTGGCAATGTTTCTCTTTCCCCTATGATCATTGATTCTGTTCCAGTTGCTCACCACCTCCATCATCAACAACACCATGGTGGCAAGAGTAGTGGTCCTAGTAGTACTAGTACTAGTCCTAGCACTGCAGGGAAAAGACTAAGGCTATTTGGGGTCAACATGGAATGTGCTTCTTCAACATCAGAAGACCCCAAATGCTTCAGCTTGTTGTCCTCATCTTCAATGGCTAATTCCAATTCACAACCACCACTTCAGCTTTTGAGGGAAGATACACTTTCGTCATCATCGGCAAGGTTTGGGGATCAGAGAGGAGTAGGGGAACCTTCAATGCTTTTTGATCTGGACCCTTCTTTGCAATACCGGCAGTGA
Loc732601
覆盖44% 相同性62%
SEQ ID NO:75
MDGGCVTDETTTSSDSLSVPPPSRVGSVASAVVDPDGCCVSGEAESRKLPSSKYKGVVPQPNGRWGAQIYEKHQRVWLGTFNEEDEAARAYDIAALRFRGPDAVTNFKPPAASDDAESEFLNSHSKFEIVDMLRKHTYDDELQQSTRGGRRRLDADTASSGVFDAKAREQLFEKTVTPSDVGKLNRLVIPKQHAEKHFPLSGSGDESSPCVAGASAAKGMLLNFEDVGGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDAVQFFKSTGPDRQLYIDCKARSGEVNNNAGGLFVPIGPVVEPVQMVRLFGVNLLKLPVPGSDGVGKRKEMELFAFECCKKLKVIGAL
CDS SEQ ID NO:76
ATGGATGGAGGCTGTGTCACAGACGAAACCACCACATCCAGCGACTCTCTTTCCGTTCCGCCGCCCAGCCGCGTCGGCAGCGTTGCAAGCGCCGTCGTCGACCCCGACGGTTGTTGCGTTTCCGGCGAGGCCGAATCCCGGAAACTCCCTTCGTCGAAATACAAAGGCGTGGTGCCGCAACCGAACGGTCGCTGGGGAGCTCAGATTTACGAGAAGCACCAGCGCGTGTGGCTCGGCACTTTCAACGAGGAAGACGAAGCCGCCAGAGCCTACGACATCGCCGCGCTGCGCTTCCGCGGCCCCGACGCCGTCACCAACTTCAAGCCTCCCGCCGCCTCCGACGACGCCGAGTCCGAGTTCCTCAACTCGCATTCCAAGTTCGAGATCGTCGACATGCTCCGCAAGCACACCTACGACGACGAGCTCCAGCAGAGCACGCGCGGTGGTAGGCGCCGCCTCGACGCTGACACCGCGTCGAGCGGTGTGTTCGACGCGAAAGCGCGTGAGCAGCTGTTCGAGAAAACGGTTACGCCGAGCGACGTCGGGAAGCTGAATCGATTAGTGATACCGAAGCAGCACGCGGAGAAGCACTTTCCGTTAAGCGGATCCGGCGACGAAAGCTCGCCGTGCGTGGCGGGGGCTTCGGCGGCGAAGGGAATGTTGTTGAACTTTGAGGACGTTGGAGGGAAAGTGTGGCGGTTTCGTTACTCTTATTGGAACAGTAGCCAGAGCTACGTGCTTACCAAAGGATGGAGCCGGTTCGTTAAGGAGAAGAATCTTCGAGCCGGTGACGCGGTTCAGTTCTTCAAGTCGACCGGACCGGACCGGCAGCTATATATAGACTGCAAGGCGAGGAGTGGTGAGGTTAACAATAATGCTGGCGGTTTGTTTGTTCCGATTGGACCGGTCGTTGAGCCGGTTCAGATGGTTCGGCTTTTCGGGGTCAACCTTTTGAAACTACCCGTACCCGGTTCGGATGGTGTAGGGAAGAGAAAAGAGATGGAACTGTTTGCATTTGAATGTTGCAAGAAGTTAAAAGTAATTGGAGCTTTGTAA
Loc100801107
覆盖44% 相同性61%
SEQ ID NO:77
MDAISCMDESTTTESLSISLSPTSSSEKAKPSSMITSSEKVSLSPPPSNRLCRVGSGASAVVDPDGGGSGAEVESRKLPSSKYKGVVPQPNGRWGAQIYEKHQRVWLGTFNEEDEAARAYDIAAQRFRGKDAVTNFKPLAGADDDDGESEFLNSHSKPEIVDMLRKHTYNDELEQSKRSRGVVRRRGSAAAGTANSISGACFTKAREQLFEKAVTPSDVGKLNRLVIPKQHAEKHFPLQSSNGVSATTIAAVTATPTAAKGVLLNFEDVGGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLKAGDTVCFHRSTGPDKQLYIDWKTRNVVNNEVALFGPVGPVVEPIQMVRLFGVNILKLPGSDTIVGNNNNASGCCNGKRREMELFSLECSKKPKIIGAL
CDS SEQ ID NO:78
ATGGATGCAATTAGTTGCATGGATGAGAGCACCACCACTGAGTCACTCTCTATAAGTCTTTCTCCGACGTCATCGTCGGAGAAAGCGAAGCCTTCTTCGATGATTACATCGTCGGAGAAGGTTTCTCTGTCCCCGCCGCCGTCAAACAGACTATGCCGTGTTGGAAGCGGCGCGAGCGCAGTCGTGGATCCTGATGGCGGCGGCAGCGGCGCTGAGGTAGAGTCGCGGAAACTCCCCTCGTCGAAGTACAAGGGCGTGGTGCCCCAGCCCAACGGCCGCTGGGGTGCGCAGATTTACGAGAAGCACCAGCGCGTGTGGCTTGGAACGTTCAACGAGGAAGACGAGGCGGCGCGTGCGTACGACATCGCCGCGCAGCGGTTCCGCGGCAAGGACGCCGTCACGAACTTCAAGCCGCTCGCCGGCGCCGACGACGACGACGGAGAATCGGAGTTTCTCAACTCGCATTCCAAACCCGAGATCGTCGACATGCTGCGAAAGCACACGTACAATGACGAGCTGGAGCAGAGCAAGCGCAGCCGCGGCGTCGTCCGGCGGCGAGGCTCCGCCGCCGCCGGCACCGCAAACTCAATTTCCGGCGCGTGCTTTACTAAGGCACGTGAGCAGCTATTCGAGAAGGCTGTTACGCCGAGCGACGTTGGGAAATTGAACCGTTTGGTGATACCGAAGCAGCACGCGGAGAAGCACTTTCCGTTACAGAGCTCTAACGGCGTTAGCGCGACGACGATAGCGGCGGTGACGGCGACGCCGACGGCGGCGAAGGGCGTTTTGTTGAACTTCGAAGACGTTGGAGGGAAAGTGTGGCGGTTTCGTTACTCGTATTGGAACAGTAGCCAGAGTTACGTCTTAACCAAAGGTTGGAGCCGGTTCGTTAAGGAGAAGAATCTGAAAGCTGGTGACACGGTTTGTTTTCACCGGTCCACTGGACCGGACAAGCAGCTTTACATCGATTGGAAGACGAGGAATGTTGTTAACAACGAGGTCGCGTTGTTCGGACCGGTCGGACCGGTTGTCGAACCGATCCAGATGGTTCGGCTCTTTGGGGTTAACATTTTGAAACTACCCGGTTCAGATACTATTGTTGGCAATAACAATAATGCAAGTGGGTGCTGCAATGGCAAGAGAAGAGAAATGGAACTGTTCTCGTTAGAGTGTAGCAAGAAACCTAAGATTATTGGTGCTTTGTAA
Loc100789009
覆盖44% 相同性62%
SEQ ID NO:79
MDGGSVTDETTTTSNSLSVPANLSPPPLSLVGSGATAVVYPDGCCVSGEAESRKLPSSKYKGVVPQPNGRWGAQIYEKHQRVWLGTFNEEDEAARAYDIAAHRFRGRDAVTNFKPLAGADDAEAEFLSTHSKSEIVDMLRKHTYDNELQQSTRGGRRRRDAETASSGAFDAKAREQLFEKTVTQSDVGKLNRLVIPKQHAEKHFPLSGSGGGALPCMAAAAGAKGMLLNFEDVGGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDAVQFFKSTGLDRQLYIDCKARSGKVNNNAAGLFIPVGPVVEPVQMVRLFGVDLLKLPVPGSDGIGVGCDGKRKEMELFAFECSKKLKVIGAL
SEQ ID NO:80
ATGGATGGAGGCAGTGTCACAGACGAAACCACCACAACCAGCAACTCTCTTTCGGTTCCGGCGAATCTATCTCCGCCGCCTCTCAGCCTTGTCGGCAGCGGCGCAACCGCCGTCGTCTACCCCGACGGTTGTTGCGTCTCCGGCGAAGCCGAATCCCGGAAACTCCCGTCCTCGAAATACAAAGGCGTGGTGCCGCAACCGAACGGTCGTTGGGGAGCTCAGATTTACGAGAAGCACCAGCGCGTGTGGCTCGGCACCTTCAACGAGGAAGACGAAGCCGCCAGAGCCTACGACATCGCCGCGCATCGCTTCCGCGGCCGCGACGCCGTCACTAACTTCAAGCCTCTCGCCGGCGCCGACGACGCCGAAGCCGAGTTCCTCAGCACGCATTCCAAGTCCGAGATCGTCGACATGCTCCGCAAGCACACCTACGACAACGAGCTCCAGCAGAGCACCCGCGGCGGCAGGCGCCGCCGGGACGCCGAAACCGCGTCGAGCGGCGCGTTCGACGCGAAGGCGCGTGAGCAGCTGTTCGAGAAAACCGTTACGCAGAGCGACGTCGGGAAGCTGAACCGATTAGTGATACCAAAGCAGCACGCGGAGAAGCACTTTCCGTTAAGCGGATCCGGCGGCGGAGCCTTGCCGTGCATGGCGGCGGCTGCGGGGGCGAAGGGAATGTTGCTGAACTTTGAGGACGTTGGAGGGAAAGTGTGGCGGTTCCGTTACTCGTATTGGAACAGTAGCCAGAGCTACGTGCTTACCAAAGGATGGAGCCGGTTCGTTAAGGAGAAGAATCTTCGAGCTGGTGACGCGGTTCAGTTCTTCAAGTCGACCGGACTGGACCGGCAACTATATATAGACTGCAAGGCGAGGAGTGGTAAGGTTAACAATAATGCTGCCGGTTTGTTTATTCCGGTTGGACCGGTTGTTGAGCCGGTTCAGATGGTACGGCTTTTCGGGGTCGACCTTTTGAAACTACCCGTACCCGGTTCGGATGGTATTGGGGTTGGCTGTGACGGGAAGAGAAAAGAGATGGAGCTGTTTGCATTTGAATGTAGCAAGAAGTTAAAAGTAATTGGAGCTTTGTAA
Loc102660503
覆盖36% 相同性57%
SEQ ID NO:81
migvekvticmrievntekgrralmdcwqisgvhessdcseikfafdavvkrarheennaaaqkfkgvvsqqngnwgaqiyahqqriwlgtfksereaamaydsasiklrsgechrnfpwndqtvqepqfqshysaetvlnmirdgtypskfatflktrqtqkgvakhiglkgddeeqfcctqlfqkeltpsdvgklnrlvipkkhavsyfpyvggsadesgsvdveavfydklmrlwkfrycywkssqsyvftrgwnrfvkdkklkakdviafftwgksggegeafalidviynnnaeedskgdtkqvlgnqlqlagseegededanigkdfnaqkglrlfgvcit
CDS SEQ ID NO:82
atgattggagttgagaaagtgacaatttgtatgagaatagaggtgaatactgaaaagggaagaagggctttaatggactgttggcaaatatcaggagttcatgaaagttcagattgtagcgaaatcaaatttgcattcgacgcagtagtaaaacgcgcgaggcatgaagagaataatgcagcagcacagaagttcaaaggcgttgtgtctcaacaaaatgggaactggggtgcacagatatatgcacaccagcagagaatctggttggggaccttcaaatctgaaagagaggctgcaatggcttatgacagcgccagcataaaacttagaagcggagagtgccacagaaactttccatggaacgaccaaacagttcaagagcctcagttccaaagccattacagcgcagaaacagtgctaaacatgattagagatggcacctatccatcaaaatttgctacatttctcaaaactcgtcaaacccaaaaaggcgttgcgaaacacataggtctgaagggtgatgacgaggaacagttttgttgcacccaactttttcagaaggaattaacaccaagtgatgtgggcaagctcaacaggcttgtcatcccaaagaagcatgcagttagctattttccttacgttggtggcagtgctgatgagagtggtagtgttgacgtggaggctgtgttttatgacaaactcatgcgattgtggaagttccgatactgctattggaagagcagccaaagttacgtgttcaccagaggctggaatcggtttgtgaaggataagaagttgaaggctaaagatgtcattgcgttttttacgtggggaaaaagtggaggagagggagaagcttttgcattgatcgatgtaatttataataataatgcagaagaagacagcaagggagacaccaaacaagttttgggaaaccaattacaattagctggcagtgaagaaggtgaagatgaagatgcaaacattggaaaggatttcaatgcacaaaagggtctgaggctctttggtgtgtgtatcacctaa
大麦(Hordeum vulgare)
MLOC_66387
覆盖47% 相同性64%
SEQ ID NO:83
MEFTATSSRFSKGEEEVEEEQEEASMREIPFMTPAAATCAAAPPSASASASTPASASGSSPPFRSGDDAGASGSGAGDGSRSNVAEAVEKEHMFDKVVTPSDVGKLNRLVIPKQYAEKYFPLDSAANEKGLLLNFEDSAGKPWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFSRGAGEAARHRLFIDWKRRADTRDPLRLPRLPLPMPLTSHYSPWGLGAGARGFFMPPSPPATLYEHRLRQGFDFRGMNPSYPTMGRQVILFGSAARMPPHGPAPLLVPRPPPPLHFTVQQQGSDAGGSVTAGSPVVLDSVPVIESPTTATKKRVRLFGVNLDNPQHPGDGGGESSNYGSALPLQMPASAWRPRDHTLRLLEFPSHGAEASSPSSSSSSKREAHSGLDLDL
SEQ ID NO:84
ATGGAGTTTACTGCGACAAGCAGTAGGTTTTCTAAAGGAGAGGAGGAGGTGGAGGAGGAGCAGGAGGAGGCGTCGATGCGCGAGATCCCTTTCATGACGCCCGCGGCCGCCACCTGCGCCGCGGCGCCGCCTTCTGCTTCTGCGTCGGCCTCGACACCCGCGTCAGCGTCTGGAAGTAGCCCTCCCTTTCGATCTGGGGATGACGCCGGAGCGTCGGGGAGCGGGGCCGGCGACGGCAGCCGCAGCAACGTGGCGGAGGCCGTGGAGAAGGAGCACATGTTCGACAAAGTGGTGACGCCGAGCGACGTGGGGAAGCTTAACCGGCTGGTCATCCCCAAGCAGTACGCCGAGAAGTACTTCCCGCTGGACTCGGCGGCCAACGAGAAGGGCCTTCTGCTCAACTTCGAGGACAGCGCCGGGAAGCCATGGCGCTTCCGCTATTCCTACTGGAACAGCAGCCAGAGCTACGTCATGACCAAAGGCTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCTGGGGACACCGTCTCCTTCTCCCGCGGCGCCGGTGAGGCCGCGCGCCACCGCCTCTTCATCGACTGGAAGCGCCGAGCCGACACCAGAGACCCGCTCCGCTTGCCCCGCCTCCCGCTCCCGATGCCGCTGACGTCGCACTACAGCCCGTGGGGCCTCGGCGCCGGCGCCAGAGGATTCTTCATGCCTCCCTCGCCGCCAGCCACGCTCTACGAGCACCGTCTCCGTCAAGGCTTCGACTTCCGCGGCATGAACCCCAGTTACCCCACAATGGGGAGACAGGTCATCCTTTTCGGCTCGGCCGCCAGGATGCCTCCGCACGGACCAGCACCACTCCTCGTGCCGCGCCCGCCGCCGCCGCTGCACTTCACGGTGCAGCAACAAGGCAGCGACGCCGGCGGAAGTGTAACCGCAGGATCCCCAGTGGTGCTCGACTCAGTGCCGGTAATCGAAAGCCCCACGACGGCAACGAAGAAGCGCGTGCGCTTGTTCGGCGTGAACTTGGACAACCCGCAGCATCCCGGTGATGGCGGGGGCGAATCGAGCAATTATGGCAGTGCACTGCCATTGCAGATGCCCGCATCAGCATGGCGGCCAAGGGACCATACGCTGAGGCTGCTCGAATTCCCCTCGCACGGTGCCGAGGCGTCGTCTCCATCGTCGTCGTCGTCTTCCAAGAGGGAGGCGCATTCGGGCTTGGATCTCGATCTGTGA
MLOC44012
覆盖55% 相同性63%
SEQ ID NO:85
MLRKHTYFDELAQSKRAFAASAALSAPTTSGDAGGSASPPSPAAVREHLFDKTVTPSDVGKLNRLVIPKQNAEKHFPLQLPAGGGESKGLLLNFEDDAGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKGLGAGDVVGFYRSAAGRTGEDSKFFIDCRLRPNTNTAAEADPVDQSSAPVQKAVRLFGVDLLAAPEQGMPGGCKRARDLVKPPPPKVAFKKQCIELALA
SEQ ID NO:86
ATGCTCCGCAAGCACACCTACTTCGACGAGCTCGCCCAGAGCAAGCGCGCCTTCGCCGCGTCGGCCGCGCTCTCCGCGCCCACCACCTCGGGCGACGCCGGCGGCAGCGCCTCGCCGCCCTCCCCGGCCGCCGTGCGCGAGCACCTCTTCGACAAGACCGTCACGCCCAGCGACGTCGGCAAGCTGAACAGGCTGGTGATACCGAAGCAGAACGCCGAGAAGCACTTCCCGCTGCAGCTCCCGGCCGGCGGCGGCGAGAGCAAGGGCCTGCTCCTCAACTTCGAGGACGATGCGGGCAAGGTGTGGCGGTTCCGCTACTCGTACTGGAACAGCAGCCAGAGCTACGTCCTCACCAAGGGCTGGAGCCGCTTCGTGAAGGAGAAGGGCCTCGGCGCCGGAGACGTCGTCGGGTTCTACCGCTCCGCCGCCGGGAGGACCGGCGAAGACAGCAAGTTCTTCATTGACTGCAGGCTGCGGCCGAACACCAACACCGCCGCCGAAGCAGACCCCGTGGACCAGTCGTCGGCGCCCGTGCAGAAGGCCGTGAGACTCTTCGGCGTCGATCTTCTCGCGGCGCCGGAGCAGGGCATGCCGGGCGGGTGCAAGAGGGCCAGAGACTTGGTGAAGCCGCCGCCTCCGAAAGTGGCGTTCAAGAAGCAATGCATAGAGCTGGCGCTAGCGTAG
MLOC_57250
覆盖50% 相同性57%
SEQ ID NO:87
MYCSRGRIDPAEEGQVMGGLGVRDASWALFKVLEQSDVQVGQNRLLLTKEAVWGGPIPKLFPELEELRGDGLNAENRVAVKILDADGCEGDANFRYLNSSKAYRVMGPQWSRLVKETGMCKGDRLDLYAATATAASSCSGARAAVAPAIPPGAIVKAAGF
CDS SEQ ID NO:88
ATGTATTGTTCCCGCGGCCGCATCGATCCCGCGGAAGAAGGGCAGGTGATGGGCGGCCTCGGCGTGCGCGACGCCAGCTGGGCGCTGTTCAAGGTGTTGGAGCAGTCCGACGTCCAGGTGGGGCAGAACCGGCTGCTCCTCACCAAGGAGGCGGTGTGGGGCGGCCCTATCCCCAAGCTTTTCCCGGAGCTGGAGGAGCTCCGCGGCGACGGCCTCAACGCCGAGAACAGGGTCGCGGTCAAGATCCTCGACGCCGACGGCTGCGAGGGGGACGCCAACTTCCGCTACCTCAACTCCAGCAAGGCGTACCGGGTCATGGGGCCTCAGTGGAGCCGGCTCGTGAAGGAGACCGGCATGTGCAAGGGAGACCGCCTCGATCTGTACGCGGCAACGGCGACCGCTGCCTCTTCGTGTTCTGGAGCCAGGGCGGCTGTGGCGCCGGCGATACCTCCCGGAGCAATCGTGAAGGCAGCCGGGTTCTAA
MLOC_38822
覆盖47% 相同性56%
SEQ ID NO:89
MLRKHIYPDELAQHKRAFFFAAASSPTSSSSPLASPAPSAAAARREHLFDKTVTPSDVGKLNRLVIPKQHAEKHFPLQLPSASAAVPGECKGVLLNFDDATGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKGLHAGDAVEFYRAASGNNQLFIDCKLRSKSTTTTTSVNSEAAPSPAPVTRTVRLFGVDLLIAPAARHAHEHEDYGMAKTNKRTMEASVAAPTPAHAVWKKRCVDFALTYRLATTPQCPRSRDQLEGVQAAGSTFAL
CDS SEQ ID NO:90
ATGCTGCGCAAGCACATCTATCCCGACGAGCTCGCGCAGCACAAGCGCGCCTTCTTCTTCGCCGCGGCGTCGTCCCCTACGTCGTCGTCGTCACCTCTCGCCTCGCCGGCTCCTTCAGCCGCGGCGGCGCGGCGCGAGCACCTGTTCGACAAGACGGTCACGCCCAGCGACGTGGGGAAGCTGAACCGGCTGGTGATCCCCAAGCAGCACGCCGAGAAGCACTTCCCGCTGCAGCTCCCTTCTGCCAGCGCCGCCGTGCCAGGCGAGTGCAAGGGCGTGCTGCTCAACTTCGATGACGCGACCGGCAAGGTGTGGAGGTTCCGGTACTCCTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAGGGGTGGAGCCGCTTCGTGAAGGAGAAGGGCCTTCACGCCGGCGACGCCGTCGAGTTCTACCGCGCCGCCTCCGGCAACAACCAGCTCTTCATCGACTGCAAGCTCCGGTCCAAGAGCACCACGACGACGACCTCCGTCAACTCGGAGGCCGCCCCATCGCCGGCACCCGTGACGAGGACAGTGCGACTCTTCGGGGTCGACCTTCTCATCGCGCCGGCGGCGAGGCACGCGCATGAGCACGAGGACTACGGCATGGCCAAGACAAACAAGAGAACCATGGAGGCCAGCGTAGCGGCGCCTACTCCGGCGCACGCGGTGTGGAAGAAGCGGTGCGTAGACTTCGCGCTGACCTACCGACTTGCCACCACCCCACAGTGCCCGAGGTCAAGAGATCAACTAGAAGGAGTACAAGCAGCTGGGAGTACATTTGCTCTATAG
MLOC_7940
覆盖49% 相同性52%
SEQ ID NO:91
MGVEILSSTGEHSSQYSSGAASTATTESGVGGRPPTAPSLPVSIADESATSRSASAQSTSSRFKGVVPQPNGRWGAQIYERHARVWLGTFPDEDSAARAYDVAALRYRGREAATNFPCAAAEAELAFLAAHSKAEIVDMLRKHTYTDELRQGLRRGRGMGARAQPTPSWAREPLFEKAVTPSDVGKLNRLVVPKQHAEKHFPLKRTPETTTTTGKGVLLNFEDGEGKVWRFRYSYWNSSQSYVLTKGWSRFVREKGLGAGDSIVFSCSAYGQEKQFFIDCKKNKTMTSCPADDRGAATASPPVSEPTKGEQVRVVRLFGVDIAGEKRGRAAPVEQELFKRQCVAHSQHSPALGAFVL
CDS SEQ ID NO:92
ATGGGGGTGGAGATCCTGAGCTCAACGGGGGAACACTCCTCCCAGTACTCTTCCGGAGCCGCGTCCACGGCGACGACGGAGTCAGGCGTGGGCGGACGGCCGCCGACTGCGCCGAGCCTACCTGTTTCCATCGCCGACGAGTCGGCGACCTCGCGGTCGGCATCGGCGCAGTCGACGTCGTCGCGGTTCAAGGGCGTGGTGCCGCAGCCCAACGGGCGGTGGGGCGCCCAGATCTACGAGCGCCACGCCCGCGTCTGGCTCGGCACGTTCCCGGACGAAGACTCTGCGGCGCGCGCCTACGACGTGGCCGCGCTCCGGTACCGGGGCCGCGAGGCCGCCACCAACTTCCCGTGCGCGGCCGCCGAGGCGGAGCTCGCCTTCCTGGCGGCACACTCCAAGGCCGAGATCGTCGACATGCTCCGGAAGCACACCTACACCGACGAGCTCCGCCAGGGCCTGCGGCGCGGCCGCGGCATGGGGGCGCGCGCGCAGCCGACGCCGTCGTGGGCGCGGGAGCCCCTTTTCGAGAAGGCCGTGACCCCGAGCGACGTGGGCAAGCTCAACCGCCTCGTTGTGCCGAAGCAGCACGCCGAGAAGCACTTCCCCCTGAAACGCACGCCGGAGACGACAACGACCACCGGCAAGGGGGTGCTTCTCAACTTCGAGGATGGCGAGGGGAAAGTGTGGAGGTTCCGGTACTCGTATTGGAACAGCAGCCAGAGCTACGTGCTCACCAAGGGATGGAGCCGCTTCGTTCGGGAGAAGGGCCTCGGTGCCGGCGACTCCATCGTGTTCTCCTGCTCGGCGTACGGTCAGGAGAAGCAGTTCTTCATCGACTGCAAGAAGAACAAGACGATGACGAGCTGCCCCGCCGATGACCGCGGCGCCGCAACAGCGTCGCCGCCAGTGTCAGAGCCAACAAAAGGAGAACAAGTCCGTGTTGTGAGGCTGTTCGGCGTCGACATCGCCGGAGAGAAGAGGGGGCGAGCGGCGCCGGTGGAGCAGGAGTTGTTCAAGAGGCAATGCGTGGCACACAGCCAGCACTCTCCAGCCCTAGGTGCCTTCGTCTTATAG
MLOC_56567
覆盖42% 相同性59%
SEQ ID NO:93
MGVEILSSMVEHSFQYSSGASSATAESGAVGTPPRHLSLPVAIADESLTSRSASSRFKGVVPQPNGRWGAQIYERHARVWLGTFPDQDSAARAYDVASLRYRGGDAAFNFPCVVVEAELAFLAAHSKAEIVDMLRKQTYADELRQGLRRGRGMGVRAQPMPSWARVPLFEKAVTPSDVGKLNRLVVPKQHAEKHFPLKRSPETTTTTGNGVLLNFEDGQGKVWRFRYSYWNSSQSYVLTKGWSRFVREKGLGAGDSIMFSCSAYGQEKQFFIDCKKNTTVNGGKSASPLQVMEIAKAEQVRVVRLFGVDIAGVKRERAATAEQGPQGWFKRQCMAHGQHSPALGDFAL
SEQ ID NO:94
ATGGGGGTGGAGATCCTGAGCTCCATGGTGGAGCACTCCTTCCAGTACTCTTCGGGCGCGTCCTCGGCCACCGCGGAGTCAGGCGCCGTCGGAACACCGCCGAGGCATCTGAGCCTACCTGTCGCCATCGCCGACGAGTCCCTGACCTCACGGTCGGCGTCGTCTCGGTTCAAGGGCGTGGTGCCGCAGCCCAACGGGCGGTGGGGCGCCCAGATCTACGAGCGCCACGCTCGCGTCTGGCTCGGCACGTTCCCAGACCAGGACTCGGCGGCGCGCGCCTACGACGTTGCCTCGCTCAGGTACCGCGGCGGCGACGCCGCCTTCAACTTCCCGTGCGTGGTGGTGGAGGCGGAGCTCGCCTTCCTGGCGGCGCACTCCAAGGCTGAGATCGTTGACATGCTCCGGAAGCAGACCTACGCCGATGAACTCCGCCAGGGACTACGGCGCGGCCGTGGCATGGGGGTGCGCGCGCAGCCGATGCCGTCGTGGGCGCGGGTTCCCCTTTTCGAGAAGGCCGTGACCCCTAGCGATGTCGGCAAGCTCAATCGCCTGGTGGTGCCGAAGCAGCACGCCGAGAAGCACTTCCCCCTGAAGCGCAGCCCGGAGACGACGACCACCACCGGCAACGGCGTACTGCTCAACTTTGAGGACGGCCAGGGAAAAGTGTGGAGGTTCCGGTACTCATATTGGAACAGCAGCCAGAGCTACGTGCTCACCAAAGGCTGGAGCCGCTTCGTCCGGGAGAAGGGCCTCGGCGCCGGTGACTCCATCATGTTCTCCTGCTCGGCGTACGGGCAGGAGAAGCAGTTCTTCATCGACTGCAAGAAGAACACGACCGTGAACGGAGGCAAATCGGCGTCGCCGCTGCAGGTGATGGAGATTGCCAAAGCAGAACAAGTCCGCGTCGTTAGACTGTTCGGTGTCGACATCGCCGGGGTGAAGAGGGAGCGAGCGGCGACGGCGGAGCAAGGCCCGCAGGGGTGGTTCAAGAGGCAATGCATGGCACACGGCCAGCACTCTCCTGCCCTAGGTGACTTCGCCTTATAG
MLOC_75135
覆盖43% 相同性57%
SEQ ID NO:95
MGMEILSSTVEHCSQYSSSASTATTESGAAGRSTTALSLPVAITDESVTSRSASAQPASSRFKGVVPQPNGRWGSQIYERHARVWLGTFPDQDSAARAYDVASLRYRGRDAATNFPCAAAEAELAFLTAHSKAEIVDMLRKHTYADELRQGLRRGRGMGARAQPTPSWARVPLFEKAVTPSDVGKLNRLVVPKQHAEKHFPLKCTAETTTTTGNGVLLNFEDGEGKVWRFRYSYWNSSQSYVLTKGWSSFVREKGLGAGDSIVFSSSAYGQEKQLFINCKKNTTMNGGKTALPLPVVETAKGEQDHVVKLFGVDIAGVKRVRAATGELGPPELFKRQSVAHGCGRMNYICYSIGTIGPLMLN
SEQ ID NO:96
ATGGGGATGGAAATCCTGAGCTCCACGGTGGAGCACTGCTCCCAGTACTCTTCCAGCGCGTCCACGGCCACAACGGAGTCAGGCGCCGCCGGAAGATCGACGACGGCTCTGAGCCTACCAGTTGCCATCACCGACGAGTCCGTTACCTCGCGGTCGGCATCGGCGCAGCCGGCGTCATCACGGTTCAAGGGCGTGGTGCCGCAGCCCAACGGGCGGTGGGGCTCCCAGATCTACGAGCGCCACGCTCGCGTCTGGCTCGGCACCTTCCCGGATCAGGACTCGGCGGCGCGTGCCTACGACGTTGCCTCGCTCAGGTACCGGGGCCGCGATGCCGCCACCAACTTCCCGTGCGCCGCTGCGGAAGCGGAGCTCGCCTTCCTGACCGCGCACTCCAAGGCCGAGATCGTCGACATGCTCCGGAAGCACACCTACGCCGACGAACTCCGCCAGGGCCTGCGGCGCGGCCGCGGCATGGGTGCGCGCGCGCAGCCGACGCCGTCGTGGGCGCGGGTTCCCCTTTTCGAGAAGGCTGTGACCCCTAGCGATGTCGGCAAGCTCAATCGCCTGGTGGTGCCGAAGCAGCACGCCGAGAAGCACTTCCCCCTGAAGTGCACCGCAGAGACGACGACCACCACCGGCAACGGCGTGCTGCTAAACTTCGAGGATGGTGAGGGGAAGGTGTGGAGGTTCCGGTACTCGTATTGGAACAGTAGCCAGAGCTACGTGCTCACCAAAGGCTGGAGCAGCTTCGTCCGGGAGAAGGGCCTCGGCGCAGGCGACTCCATCGTCTTCTCCTCCTCGGCGTACGGGCAGGAGAAGCAGTTATTCATCAACTGCAAAAAGAACACGACTATGAACGGCGGCAAAACAGCGTTGCCGCTGCCAGTGGTGGAGACTGCCAAAGGAGAACAAGACCACGTCGTTAAGTTGTTCGGTGTTGACATCGCCGGTGTGAAGAGGGTGCGAGCGGCGACGGGGGAGCTAGGCCCGCCGGAGTTGTTCAAGAGACAATCCGTGGCACACGGATGCGGAAGGATGAACTACATTTGCTACTCCATAGGGACAATAGGACCTCTTATGCTCAACTGA
MLOC_63261
覆盖49% 相同性51%
SEQ ID NO:97
MASSKPTNPEVDNDMECSSPESGAEDAVESSSPVAAPSSRFKGVVPQPNGRWGAQIYEKHSRVWLGTFGDEEAAACAYDVAALRFRGRDAVTNHQRLPAAEGAGWSSTSELAFLADHSKAEIVDMLRKHTYDDELRQGLRRGHGRAQPTPAWAREFLFEKALTPSDVGKLNRLVVPKQHAEKHFPPTTAAAAGSDGKGLLLNFEDGQGKVWRFRYSYWNSSQSYVLTKGWSRFVQEKGLCAGDTVTFSRSAYVMNDTDEQLFIDYKQSSKNDEAADVATADENEAGHVAVKLFGVDIGWAGMAGSSGG
SEQ ID NO:98
ATGGCGTCTAGCAAGCCGACAAACCCCGAGGTAGACAATGACATGGAGTGCTCCTCCCCGGAATCGGGTGCCGAGGACGCCGTGGAGTCGTCGTCGCCGGTGGCAGCGCCATCTTCGCGGTTCAAGGGCGTCGTGCCGCAGCCTAACGGGCGCTGGGGCGCGCAGATCTACGAGAAGCACTCGCGGGTGTGGCTTGGCACGTTCGGGGACGAGGAAGCCGCCGCGTGCGCCTACGACGTGGCCGCGCTCCGCTTCCGCGGCCGCGACGCCGTCACCAACCACCAGCGCCTGCCGGCGGCGGAGGGGGCCGGCTGGTCGTCCACGAGCGAGCTCGCCTTCCTCGCCGACCACTCCAAGGCCGAGATCGTCGACATGCTCCGGAAGCACACCTACGACGACGAGCTCCGGCAGGGCCTGCGCCGCGGCCACGGGCGCGCGCAGCCCACGCCGGCGTGGGCGCGAGAGTTCCTCTTCGAGAAGGCCCTGACCCCGAGCGACGTCGGCAAGCTCAACCGCCTGGTCGTTCCGAAGCAGCACGCCGAGAAGCACTTCCCCCCGACGACGGCGGCGGCCGCCGGAAGCGACGGCAAGGGCTTGCTGCTCAACTTCGAGGACGGCCAAGGGAAGGTGTGGAGGTTCCGGTACTCATACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTTCGTCCAAGAAAAGGGCCTCTGCGCCGGCGACACCGTGACGTTCTCCCGGTCGGCGTACGTGATGAATGACACGGATGAGCAGCTCTTCATCGACTACAAGCAGAGTAGCAAGAACGACGAAGCGGCCGACGTAGCCACTGCCGATGAGAATGAGGCCGGCCATGTCGCCGTGAAGCTCTTCGGGGTCGACATTGGCTGGGCTGGGATGGCGGGATCATCAGGTGGGTGA
MLOC_64708
覆盖49% 相同性51%
SEQ ID NO:99
MLFDSSVSASLGTMRPLVKKLDMLLAPARGYSTLCKRIKEVMHLLKHDVEEISSYLDELTEVEDPPPMAKCWMNEARDLSYDMEDYIDSLLFVPPGHFIKKKKKKKKKGKKKMVIKKRLKWCKQIVFTKQVSDHGIKTSKIIHVNVPRLPNKPKVAKIILQFRIYVQEAIERYDKYRLHHCSTLRRRLLSTGSMLSVPIPYEEAAQIVTDGRMNEFISSLAANNAADQQQLKVVSVLGSGCLGKTTLANVLYDRIGMQFECRAFIRVSKKPDMKRLFRDLLSQFHQKQPLPTSCNELGISDNIIKHLQDKRYLIVIDDLWDLSVWDIIKYAFPKGNHGSRIIITTQIEDVALTCCCDHSEHVFEMKPLNIGHSRELFFNRLFGSESDCLEEFKRVSNEIVDICGGLPLATINIASHLANQETEVSLDLLTDTRDLLRSCLWSNSTSERTKQVLNLSYSNLPDYLKTCLLYLHMYPVGSIIWKDDLVKQLVAEGFIATREGKDQDQEMIEKAAGLCFDALIDRRFIQPIYTKYNNKVLSCTVHEVVHDLIAQKSAEENFIVVADHNRKNIALSHKVRRLSLIFGDTIYAKTPANITKSQIRSFRFFGLFECMPCITEFKVLRVLNLQLSGHRGDNDPIDLTGISELFQLRYLKITSDVCIKLPNQMQKLQYLETLDIMDAPRVTAVPWDIINLPHLLHLTLPVDTYLLDWISSMTDSVISLWTLGKLNYLQHLHLTSSSTRPSYHLERSVEALGYLIGGHGKLKTIVVAHVSSAQNTVVRGAPEVTISWDRMSPPPLLQRFECPHSCFIFYRIPKWVTELGNLCILKIAVKELHMICLGTLRGLHALTDLSLYVETAPIDKIIFDKAGFSVLKYCKLRFAAGIAWLKFEADAMPSLWKLMLVFNAIPRMDQNLVFFHHSRPAMHQRGGAVIIVEHMPGLRVISAKFGGAASDLEYASRTVVSNHPSNPTINMQLVCYSSNGKRSRKRKQQPYDVVKGQPDEYAKRLERPAEKRISTPTKSSLRLHVPEITPKPMQITDNNVQRREHMFDTVLTRGDVGMLNRLVVPKKHAEKYFPLDSSSTRTSKAIVLSFEDPAGKSWFFHYSYRSSSQNYVMFKGWTGFVKEKFLEAGDTVSFSRGVGEATRGRLFIDCQNEQRYMFERVLTASDMESDGCSLMVPVNLVWPHPGLRKTIKGRHAVLQFEDGSGNGKVWPFQFEASGQYYLMKGLNYFVNDRDLAAGYTVSFYRAGTRLFVDSGRKDDKVALGTRSRERIYPKIVRSQ
油菜(Brassica rapa)
LOC103849927
覆盖99% 相同性80%
CDS SEQ ID NO:100
ATGTTGTTTGATAGTTCAGTGAGTGCTTCGTTGGGCACCATGAGACCACTTGTCAAGAAGCTCGACATGCTGCTAGCTCCTGCTCGGGGATACAGTACCTTGTGCAAGAGGATCAAGGAAGTGATGCACCTTCTCAAACATGATGTTGAAGAGATAAGCTCCTACCTTGATGAACTTACAGAGGTGGAGGACCCTCCACCAATGGCCAAGTGCTGGATGAACGAGGCACGCGACCTGTCTTATGATATGGAGGATTACATTGATAGCTTGTTATTTGTGCCACCTGGCCATTTCATCAAGAAGAAGAAGAAGAAGAAGAAGAAGGGAAAGAAGAAGATGGTGATAAAGAAGAGGCTCAAGTGGTGCAAACAGATCGTATTCACAAAGCAAGTGTCAGACCATGGTATCAAGACCAGTAAAATCATTCATGTTAATGTCCCTCGTCTTCCCAATAAGCCCAAGGTTGCAAAAATAATATTACAGTTCAGGATCTATGTCCAGGAGGCTATTGAACGGTATGACAAGTATAGGCTTCACCATTGCAGCACCTTGAGGCGTAGATTGTTGTCCACTGGTAGTATGCTTTCAGTGCCAATACCCTATGAAGAAGCTGCCCAAATTGTAACTGATGGCCGGATGAATGAGTTTATCAGCTCACTGGCTGCTAATAATGCAGCAGATCAGCAGCAGCTCAAGGTGGTATCTGTTCTTGGATCTGGGTGTCTAGGTAAAACTACGCTTGCGAATGTGTTGTACGACAGAATTGGGATGCAATTCGAATGCAGAGCTTTCATTCGAGTGTCCAAAAAGCCTGATATGAAGAGACTTTTCCGTGACTTGCTCTCGCAATTCCACCAGAAGCAGCCACTGCCTACCAGTTGTAATGAGCTTGGCATAAGTGACAATATCATCAAACATCTGCAAGATAAAAGGTATCTAATTGTTATTGATGATTTGTGGGATTTATCAGTATGGGATATTATTAAATATGCTTTTCCAAAGGGAAACCATGGAAGCAGAATAATAATAACTACACAGATTGAAGATGTTGCATTAACTTGTTGCTGTGATCACTCGGAGCATGTTTTCGAGATGAAACCTCTCAACATTGGTCACTCAAGAGAGCTATTTTTTAATAGACTTTTTGGTTCTGAAAGTGACTGTCTTGAAGAATTCAAACGAGTTTCAAACGAAATTGTTGATATATGTGGTGGTTTACCGCTAGCAACAATCAACATAGCTAGTCATTTGGCAAACCAGGAGACAGAAGTATCATTGGATTTGCTAACAGACACACGTGATTTGTTGAGGTCCTGTTTGTGGTCAAATTCTACTTCAGAAAGAACAAAACAAGTACTGAACCTCAGCTACAGTAATCTTCCTGATTATCTGAAGACATGTTTGCTGTATCTTCATATGTATCCAGTGGGCTCCATAATCTGGAAGGATGATCTGGTGAAGCAATTGGTGGCTGAAGGGTTTATTGCTACAAGAGAAGGGAAAGACCAAGACCAAGAAATGATAGAGAAAGCTGCAGGACTCTGTTTCGATGCACTTATTGATAGAAGATTCATCCAGCCTATATATACCAAGTACAACAATAAGGTGTTGTCCTGCACGGTTCATGAGGTGGTACATGATCTTATTGCCCAAAAGTCTGCTGAAGAGAATTTCATTGTGGTAGCAGACCACAATCGAAAGAATATAGCACTTTCTCATAAGGTTCGTCGACTATCTCTCATCTTTGGCGACACAATATATGCCAAGACACCAGCAAACATCACAAAGTCACAAATTCGGTCATTCAGATTTTTTGGATTATTCGAGTGTATGCCTTGTATTACAGAGTTCAAGGTTCTCCGTGTTCTAAACCTTCAACTATCTGGTCATCGTGGGGACAATGACCCTATAGACCTCACTGGGATTTCAGAACTGTTTCAGCTGAGATATTTAAAGATTACAAGTGATGTGTGCATAAAACTACCAAATCAAATGCAAAAACTGCAATATTTGGAAACGTTGGACATTATGGATGCACCAAGAGTCACTGCTGTTCCATGGGATATTATAAATCTCCCACACCTGTTGCACCTGACTCTTCCTGTTGATACATATCTGCTGGATTGGATTAGCAGCATGACTGACTCCGTCATCAGTCTGTGGACCCTTGGCAAGCTGAACTACCTGCAGCATCTTCATCTTACTAGTTCTTCTACACGTCCTTCATACCATCTGGAGAGAAGTGTGGAGGCTCTGGGTTATTTGATCGGAGGACATGGCAAGCTGAAAACTATAGTAGTCGCTCATGTCTCCTCTGCTCAAAATACTGTGGTTCGTGGCGCCCCAGAAGTAACCATTTCATGGGATCGTATGTCACCTCCCCCCCTTCTCCAGAGATTCGAATGCCCACACAGCTGCTTCATATTTTACCGAATTCCTAAGTGGGTTACAGAACTTGGCAACCTGTGCATTTTGAAGATTGCAGTGAAGGAGCTTCATATGATTTGTCTTGGTACTCTCAGAGGATTGCATGCCCTCACTGATCTGTCGCTGTATGTGGAGACAGCGCCCATTGACAAGATCATCTTTGACAAGGCCGGGTTCTCAGTTCTCAAGTACTGCAAATTGCGCTTCGCGGCTGGTATAGCTTGGCTGAAATTTGAGGCTGATGCAATGCCTAGTCTATGGAAACTGATGCTAGTTTTCAACGCCATCCCACGAATGGACCAAAATCTTGTTTTCTTTCACCACAGCCGACCGGCGATGCATCAACGTGGTGGTGCAGTAATCATTGTCGAGCATATGCCAGGGCTTAGAGTGATCTCCGCAAAATTTGGGGGCGCAGCTTCTGATCTAGAGTATGCTTCGAGGACCGTCGTTAGTAACCATCCAAGCAATCCTACAATCAACATGCAATTGGTGTGTTATAGTTCCAATGGTAAGAGAAGCAGAAAAAGGAAACAACAACCTTACGACGTTGTGAAGGGACAACCAGATGAATACGCCAAGAGATTGGAGAGACCAGCTGAGAAAAGGATTTCAACGCCGACAAAGTCTTCTTTGCGTCTGCATGTTCCAGAAATTACACCAAAACCTATGCAGATTACAGACAACAATGTTCAGAGGAGGGAGCACATGTTCGATACGGTTCTGACTCGGGGGGACGTGGGGATGCTGAACCGGCTGGTGGTACCGAAGAAGCACGCGGAGAAGTACTTCCCGCTGGACAGTTCCTCCACCCGCACCAGCAAGGCCATCGTACTCAGCTTTGAGGACCCTGCTGGGAAGTCATGGTTCTTCCACTACTCCTACCGGAGCAGCAGCCAGAACTACGTCATGTTCAAGGGGTGGACTGGCTTCGTCAAGGAGAAGTTTCTCGAAGCCGGCGACACCGTCTCCTTCAGCCGCGGCGTCGGGGAGGCCACGAGGGGGAGGCTCTTCATCGACTGTCAAAATGAGCAGAGGTACATGTTCGAGCGAGTGCTGACGGCGAGTGATATGGAGTCGGATGGCTGCTCGCTGATGGTCCCAGTGAACTTGGTGTGGCCGCACCCCGGCCTCCGCAAGACGATCAAGGGGAGGCACGCCGTGCTGCAGTTTGAGGACGGCAGCGGCAACGGGAAGGTGTGGCCATTTCAGTTTGAGGCCTCCGGCCAATACTATCTCATGAAGGGCTTGAACTACTTTGTTAACGACCGCGACCTTGCGGCTGGCTATACCGTCTCCTTCTACCGCGCCGGCACGCGGTTGTTCGTCGACTCCGGGCGTAAAGATGACAAAGTAGCCTTGGGAACCAGAAGCCGCGAAAGGATCTATCCTAAGATCGTGCGGTCGCAGTAG
LOC103849927
SEQ ID NO:101
msgnhysrdihhntpsvhhhqnyavvdreylfeksltpsdvgklnrlvipkqhaekhfplnnagddvaaaettekgmlltfedesgkcwkfrysywnssqsyvltkgwsryvkdkhlhagdvvffqrhrfdlhrvfigwrkrgevssptavsvvsqearvnttaywsglttpyrqvhastssypnihqeyshygavaeiptvvtgssrtvrlfgvnlechgdvvetppcpdgyngqhfyyystpdpmnisfageameqvgdgrr
Bra034828
覆盖100% 相同性79%
SEQ ID NO:102
MSVNHYSNTLSSHNHHNEHKESLFEKSLTPSDVGKLNRLVIPKQHAERYLPLNNCGGGGDVTAESTEKGVLLSFEDESGKSWKFRYSYWNSSQSYVLTKGWSRYVKDKHLNAGDVVLFQRHRFDIHRLFIGWRRRGEASSSSAVSAVTQDPRANTTAYWNGLTTPYRQVHASTSSYPNNIHQEYSHYGPVAETPTVAAGSSKTVRLFGVNLECHSDVVEPPPCPDAYNGQHIYYYSTPHPMNISFAGEAMEQVGDGRG
CDS SEQ ID NO:103
ATGTCAGTCAACCATTACTCAAACACTCTCTCGTCGCACAATCACCACAACGAACATAAAGAGTCTTTGTTCGAGAAGTCACTCACGCCAAGCGATGTTGGAAAGCTAAACCGTTTAGTCATACCAAAACAACACGCCGAGAGATACCTCCCTCTCAATAATTGCGGCGGCGGCGGCGACGTGACGGCGGAGTCGACGGAGAAAGGGGTGCTTCTCAGCTTCGAGGACGAGTCGGGAAAATCTTGGAAATTCAGATACTCATATTGGAACAGTAGTCAAAGCTACGTGTTGACCAAAGGATGGAGCAGGTACGTCAAAGACAAGCACCTCAACGCAGGGGACGTCGTTTTATTTCAACGGCACCGTTTTGATATTCATAGACTCTTCATTGGCTGGAGGAGACGCGGAGAGGCTTCTTCCTCTTCCGCCGTTTCCGCCGTGACTCAAGATCCTCGAGCTAACACGACGGCGTACTGGAACGGTTTGACTACACCTTATCGTCAAGTACACGCGTCAACTAGTTCTTACCCTAACAACATCCACCAAGAGTATTCACATTATGGCCCTGTTGCTGAGACACCGACGGTAGCTGCAGGGAGCTCGAAGACGGTGAGGCTATTTGGAGTTAACCTCGAATGTCACAGTGACGTTGTGGAGCCACCACCGTGTCCTGACGCCTACAACGGCCAACACATTTACTATTACTCAACTCCACATCCCATGAATATCTCATTTGCTGGAGAAGCAATGGAGCAGGTAGGAGATGGACGAGGTTGA
Bra005886
覆盖100% 相同性79%
SEQ ID NO:104
MSVNHYSTDHHQVHHHHTLFLQNLHTTDTSEPTTTAATSLREDQKEYLFEKSLTPSDVGKLNRLVIPKQHAEKYFPLNTIISNNAEEKGMLLSFEDESGKCWRFRYSYWNSSQSYVLTKGWSRYVKDKQLDPADVVFFQRQRSDSRRLFIGWRRRGQGSSSAANTTSYSSSMTAPPYSNYSNRPAHSEYSHYGAAVATATETHFIPSSSAVGSSRTVRLFGVNLECQMDEDEGDDSVATAAAAECPRQDSYYDQNMYNYYTPHSSAS
CDS 105
ATGTCAGTCAACCATTACTCCACGGACCACCACCAGGTCCACCACCACCACACTCTCTTCTTGCAGAACCTCCACACCACCGACACATCGGAGCCAACCACAACCGCCGCCACATCACTCCGCGAAGACCAGAAAGAGTATCTCTTCGAGAAATCTCTCACACCAAGCGACGTTGGCAAACTCAACCGTCTCGTTATACCAAAACAGCACGCGGAGAAGTACTTCCCTCTCAACACCATCATCTCCAATAATGCTGAGGAGAAAGGGATGCTTCTAAGCTTCGAAGACGAGTCAGGCAAGTGCTGGAGGTTCAGATACTCTTACTGGAACAGCAGTCAAAGCTACGTGTTGACTAAAGGATGGAGCAGATACGTCAAAGACAAACAGCTCGACCCAGCCGATGTTGTTTTCTTCCAACGTCAACGTTCTGATTCCCGGAGACTCTTTATTGGCTGGCGTAGACGCGGTCAAGGCTCCTCCTCCGCCGCGAATACGACGTCGTATTCTAGTTCCATGACTGCTCCACCGTATAGTAATTACTCTAATCGTCCTGCTCACTCAGAGTATTCCCACTATGGCGCCGCCGTAGCAACAGCGACGGAGACGCACTTCATACCATCGTCTTCCGCCGTCGGGAGCTCGAGGACGGTGAGGCTTTTTGGTGTGAATTTGGAGTGTCAAATGGATGAAGACGAAGGAGATGATTCGGTTGCCACGGCAGCCGCCGCTGAGTGTCCTCGTCAGGACAGCTACTACGACCAAAACATGTACAATTATTACACTCCTCACTCCTCAGCCTCATAA
Bra005301
覆盖100% 相同性58%
SEQ ID NO:106
MSINQYSSDFNYHSLMWQQQQHRHHHHQNDVAEEKEALFEKPLTPSDVGKLNRLVIPKQHAERYFPLAAAAADAMEKGLLLCFEDEEGKPWRFRYSYWNSSQSYVLTKGWSRYVKEKQLDAGDVILFHRHRVDGGRFFIGWRRRGNSSSSSDSYRHLQSNASLQYYPHAGVQAVESQRGNSKTLRLFGVNMECQLDSDLPDPSTPDGSTICPTSHDQFHLYPQQHYPPPYYMDISFTGDVHQTRSPQG
CDS SEQ ID NO:107
ATGTCAATAAACCAATACTCAAGCGATTTCAACTACCACTCTCTCATGTGGCAACAACAGCAGCACCGCCACCACCACCATCAAAACGACGTCGCGGAGGAAAAAGAAGCTCTTTTCGAGAAACCCTTAACCCCAAGTGACGTCGGAAAACTCAACCGCCTCGTCATCCCAAAACAGCACGCCGAGAGATACTTCCCTCTCGCAGCAGCCGCCGCAGACGCGATGGAGAAGGGATTACTTCTCTGCTTCGAGGACGAGGAAGGTAAGCCATGGAGATTCAGATACTCGTATTGGAACAGTAGCCAGAGTTATGTCTTGACCAAAGGATGGAGCAGATACGTCAAGGAGAAGCAGCTCGACGCCGGTGACGTCATTCTCTTCCACCGCCACCGTGTTGACGGAGGAAGATTCTTCATTGGCTGGAGAAGACGCGGCAACTCTTCCTCCTCTTCCGACTCTTATCGCCATCTTCAGTCCAATGCCTCGCTCCAATATTATCCTCATGCAGGAGTTCAAGCGGTGGAGAGCCAGAGAGGGAATTCGAAGACATTAAGACTGTTCGGAGTGAACATGGAGTGTCAGCTAGACTCCGACTTGCCCGATCCATCTACACCAGACGGTTCCACCATATGTCCGACCAGTCACGACCAGTTTCATCTCTACCCTCAACAACACTATCCTCCTCCGTACTACATGGACATAAGTTTCACAGGAGATGTGCACCAGACGAGAAGCCCACAAGGATAA
Bra017262
覆盖92% 相同性56%
SEQ ID NO:108
MSINQYSSEFYYHSLMWQQQQQHHHQNEVVEEKEALFEKPLTPSDVGKLNRLVIPKQHAERYFPLAAAAVDAVEKGLLLCFEDEEGKPWRFRYSYWNSSQSYVLTKGWSRYVKEKQLDAGDVVLFHRHRADGGRFFIGWRRRGDSSSSSDSYRNLQSNSSLQYYPHAGAQAVENQRGNSKTLRLFGVNMECQIDSDWSEPSTPDGFTTCPTNHDQFPIYPEHFPPPYYMDVSFTGDVHQTSSQQG
CDS SEQ ID NO:109
ATGTCAATAAATCAATATTCAAGCGAGTTCTACTACCATTCTCTCATGTGGCAACAACAGCAGCAACACCACCATCAAAACGAAGTCGTGGAGGAAAAAGAAGCTCTTTTCGAGAAACCCTTAACCCCAAGTGACGTCGGAAAACTAAACCGCCTAGTCATCCCTAAACAGCACGCCGAGAGATACTTCCCTCTCGCCGCCGCCGCGGTAGACGCCGTGGAGAAGGGATTACTCCTCTGCTTCGAGGACGAGGAAGGTAAGCCATGGAGATTCAGATACTCTTATTGGAATAGTAGCCAGAGTTACGTCTTGACCAAAGGATGGAGCAGATATGTTAAAGAGAAGCAACTTGACGCCGGCGACGTTGTTCTCTTTCATCGCCACCGTGCTGACGGTGGAAGATTCTTCATTGGCTGGAGAAGACGCGGCGACTCTTCCTCCTCCTCCGACTCTTATCGCAATCTTCAATCTAATTCCTCGCTCCAATATTATCCTCATGCAGGGGCTCAAGCGGTGGAGAACCAGAGAGGTAACTCCAAGACATTGAGACTTTTTGGAGTGAACATGGAGTGCCAGATAGACTCAGACTGGTCCGAGCCATCCACACCTGACGGTTTTACCACATGTCCAACCAATCACGACCAGTTTCCTATCTACCCTGAACACTTTCCTCCTCCGTACTACATGGACGTAAGTTTCACAGGAGATGTGCACCAGACGAGTAGCCAACAAGGATAG
Bra000434
覆盖96% 相同性47%
SEQ ID NO:110MMTNLSLAREGEEEEEEAGAKKPTEEVEREHMFDKVVTPSDVGKLNRLVIPKQHAERYFPLDSSTNEKGLILNFEDLTGKSWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFLRCVGDTGRDSRLFIDWRRRPKVPDYTTSTSHFPAGAMFPRFYSFQTATTSTSYNPYNHQQPRHHHSGYCYPQIPREFGYGYVVRSVDQRAVVADPLVIESVPVMMHGGARVNQAAVGTAGKRLRLFGVDMECGESGGTNSTEEESSSSGGSLPRGGASPSSSMFQLRLGNSSEDDHLFKKGKSSLPFNLDQ
SEQ ID NO:111
ATGATGACAAATTTGTCTCTTGCAAGAGAAGGAGAAGAAGAAGAAGAAGAGGCAGGAGCAAAGAAGCCCACAGAAGAAGTGGAGAGAGAGCACATGTTCGACAAAGTGGTGACTCCAAGTGACGTCGGGAAACTAAACCGACTCGTGATCCCAAAGCAACACGCGGAGAGATACTTCCCTTTAGATTCATCCACAAACGAGAAGGGTTTGATTCTAAACTTCGAAGATCTCACGGGAAAGTCATGGAGGTTCCGTTACTCTTACTGGAACAGCAGTCAGAGCTATGTCATGACTAAAGGTTGGAGCCGTTTCGTTAAAGACAAGAAGCTAGACGCTGGAGATATTGTCTCTTTCCTGAGATGTGTCGGAGACACAGGAAGGGACAGCCGCTTGTTTATCGATTGGAGGAGACGACCTAAAGTCCCTGACTACACGACATCGACTTCTCACTTTCCTGCCGGAGCTATGTTCCCTAGGTTTTACAGTTTTCAGACAGCAACTACTTCCACAAGTTACAATCCCTATAATCATCAGCAGCCACGTCATCATCACAGTGGTTACTGTTATCCTCAAATCCCGAGAGAATTTGGATATGGGTATGTCGTTAGGTCAGTAGATCAGAGGGCGGTGGTGGCTGATCCGTTAGTGATCGAATCTGTGCCGGTGATGATGCACGGAGGAGCTCGAGTGAACCAGGCGGCTGTTGGAACGGCCGGGAAAAGGCTGAGGCTTTTTGGAGTCGATATGGAATGTGGCGAGAGTGGAGGAACAAACAGTACGGAGGAAGAATCTTCATCTTCCGGTGGGAGTTTGCCACGTGGCGGTGCTTCTCCGTCTTCCTCTATGTTTCAGCTGAGGCTTGGAAACAGCAGTGAAGATGATCACTTATTTAAGAAAGGAAAGTCTTCATTGCCTTTTAATTTGGATCAATAA
Bra040478
覆盖96% 相同性48%
SEQ ID NO:112
MMTNLSLAREGEAQVKKPIEEVEREHMFDKVVTPSDVGKLNRLVIPKQHAERYFPLDSSSNEKGLLLNFEDLTGKSWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFQRCVGDSRLFIDWRRRPKVPDYPTSTAHFAAGAMFPRFYSFPTATTSTCYDLYNHQPPRHHHIGYGYPQIPREFGYGYFVRSVDQRAVVADPLVIESVPVMMRGGARVSQEVVGTAGKRLRLFGVDMEEESSSSGGSLPRAGGGGASSSSSLFQLRLGSSCEDDHFSKKGKSSLPFDLDQ
SEQ ID NO:113
ATGATGACCAACTTGTCTCTTGCAAGGGAAGGAGAAGCACAAGTAAAGAAGCCCATAGAAGAAGTTGAGAGAGAGCACATGTTCGACAAAGTGGTGACTCCAAGCGACGTAGGGAAACTAAACAGACTCGTGATCCCAAAGCAACACGCAGAGAGATACTTCCCTCTAGATTCATCCTCAAACGAGAAAGGTTTGCTTCTAAACTTTGAAGATCTAACAGGAAAGTCATGGAGGTTCCGTTACTCTTACTGGAACAGTAGCCAGAGCTATGTCATGACTAAAGGTTGGAGTCGTTTCGTTAAAGACAAGAAGCTTGACGCCGGAGATATTGTCTCTTTCCAGAGATGTGTCGGAGACAGCCGCTTGTTTATCGATTGGAGGAGACGACCTAAAGTCCCTGACTATCCGACATCGACTGCTCACTTTGCTGCAGGAGCTATGTTCCCTAGGTTTTACAGTTTTCCGACAGCAACTACTTCGACATGTTACGATCTGTACAATCATCAGCCGCCACGTCATCATCACATTGGTTACGGTTATCCACAGATTCCGAGAGAATTTGGATACGGGTATTTCGTTAGGTCAGTGGACCAGAGAGCGGTGGTGGCTGATCCGTTGGTGATCGAATCTGTGCCGGTGATGATGCGCGGAGGAGCTCGAGTTAGTCAGGAGGTTGTTGGAACGGCCGGGAAGAGGCTGAGGCTTTTTGGAGTCGATATGGAGGAAGAATCTTCATCTTCCGGTGGGAGTTTGCCGCGTGCCGGAGGTGGCGGTGCTTCTTCATCTTCCTCTTTGTTTCAGCTGAGACTTGGGAGCAGCTGTGAAGATGATCACTTCTCTAAGAAAGGAAAGTCTTCATTGCCTTTTGATTTGGATCAATAA
Bra004501
覆盖74% 相同性45%
SEQ ID NO:114
MMMTNLSLSREGEEEEEEEQEEAKKPMEEVEREHMFDKVVTPSDVGKLNRLVIPKQYAERYFPLDSSTNEKGLLLNFEDLAGKSWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFQRCVGDSGRDSRLFIDWRRRPKVPDHPTSIAHFAAGSMFPRFYSFPTATSYNLYNYQQPRHHHHSGYNYPQIPREFGYGYLVDQRAVVADPLVIESVPVMMHGGAQVSQAVVGTAGKRLRLFGVDMEEESSSSGGSLPRGDASPSSSLFQLRLGSSSEDDHFSKKGKSSLPFDLDQ
SEQ ID NO:133
ATGATGATGACAAACTTGTCTCTTTCAAGAGAAGGAGAAGAGGAGGAAGAAGAAGAACAAGAAGAGGCCAAGAAGCCCATGGAAGAAGTAGAGAGAGAGCACATGTTCGACAAAGTGGTGACTCCAAGCGATGTTGGTAAACTAAACCGGCTCGTGATCCCAAAGCAATACGCAGAGAGATACTTCCCTTTAGATTCATCCACAAACGAGAAAGGTTTGCTTCTAAACTTCGAAGATCTCGCAGGAAAGTCATGGAGGTTCCGTTACTCTTACTGGAACAGTAGTCAGAGCTATGTCATGACTAAAGGTTGGAGCCGTTTCGTTAAAGACAAAAAGCTAGACGCCGGAGATATTGTCTCTTTCCAGAGATGTGTCGGAGATTCAGGAAGAGACAGCCGCTTGTTTATTGATTGGAGGAGAAGACCTAAAGTTCCTGACCATCCGACATCGATTGCTCACTTTGCTGCCGGATCTATGTTTCCTAGGTTTTACAGTTTTCCGACAGCAACTAGTTACAATCTTTACAACTATCAGCAGCCACGTCATCATCATCACAGTGGTTATAATTATCCTCAAATTCCGAGAGAATTTGGATACGGGTACTTGGTGGATCAAAGAGCCGTGGTGGCTGATCCGTTGGTGATTGAATCTGTGCCGGTGATGATGCACGGAGGAGCTCAAGTTAGTCAGGCGGTTGTTGGAACGGCCGGGAAGAGGCTGAGGCTTTTTGGAGTCGATATGGAGGAAGAATCTTCATCTTCCGGTGGGAGTTTGCCACGTGGTGACGCTTCTCCGTCTTCCTCTTTGTTTCAGCTGAGACTTGGAAGCAGCAGTGAAGATGATCACTTCTCTAAGAAAGGAAAGTCCTCATTGCCTTTTGATTTGGATCAATAA
Bra003482
覆盖79% 相同性44%
SEQ ID NO:115
MNQEEENPVEKASSMEREHMFEKVVTPSDVGKLNRLVIPKQHAERYFPLDNNSDSSKGLLLNFEDRTGNSWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFQRDPGNKDKLFIDWRRRPKIPDHHHQFAGAMFPRFYSFSHPQNLYHRYQQDLGIGYYVSSMERNDPTAVIESVPLIMQRRAAHVAAIPSSRGEKRLRLFGVDMECGGGGGSVNSTEEESSSSGGGGGVSMASVGSLLQLRLVSSDDESLVAMEAASVDEDHHLFTKKGKSSLSFDLDRK
SEQ ID NO:116
ATGAATCAAGAAGAAGAGAATCCTGTGGAAAAAGCCTCTTCAATGGAGAGAGAGCACATGTTTGAAAAAGTAGTAACACCAAGCGACGTAGGCAAACTAAACCGACTCGTGATCCCAAAGCAACACGCGGAGAGATACTTCCCTTTAGACAACAATTCTGACAGCAGCAAAGGTTTGCTTCTAAACTTCGAAGACCGAACAGGAAACTCATGGAGATTCCGTTACTCTTACTGGAACAGTAGCCAGAGTTATGTCATGACAAAAGGTTGGAGCCGCTTCGTCAAAGACAAGAAGCTTGATGCTGGCGACATCGTTTCTTTTCAGAGAGATCCTGGTAATAAAGACAAGCTTTTCATTGATTGGAGGAGACGACCAAAGATTCCAGATCATCATCATCAATTCGCTGGAGCTATGTTCCCTAGGTTTTACTCTTTCTCTCATCCTCAGAACCTTTATCATCGATATCAACAAGATCTTGGAATTGGGTATTATGTGAGTTCAATGGAGAGAAATGATCCAACGGCTGTAATTGAATCTGTGCCGTTGATAATGCAAAGGAGAGCAGCACACGTGGCTGCTATACCTTCATCAAGAGGAGAGAAGAGGTTAAGGCTGTTTGGAGTGGACATGGAGTGCGGCGGCGGCGGAGGAAGTGTGAATAGCACGGAGGAAGAGTCGTCGTCTTCCGGTGGTGGCGGCGGCGTTTCTATGGCTAGTGTTGGTTCTCTTCTCCAATTGAGGCTAGTGAGCAGTGATGATGAGTCTTTGGTAGCAATGGAAGCTGCAAGTGTCGATGAGGATCATCACTTGTTTACAAAGAAAGGAAAGTCTTCTTTGTCTTTCGATTTGGATAGAAAATGA
Bra007646
覆盖74% 相同性45%
SEQ ID NO:117
MNQENKKPLEEASTSMERENMFDKVVTPSDVGKLNRLVIPKQHAERYFPLDNSSTNNKGLLLDFEDRTGSSWRFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFQRDPCNKDKLYIDWRRRPKIPDHHQFAGAMFPRFYSFPHPQMPTSFESSHNLYHHRFQRDLGIGYYPTAVIESVPVIMQRREAQVANMASSRGEKRLRLFGVDVECGGGGGGSVNSTEEESSSSGGSMSRGGVSMAGVGSLLQLRLVSSDDESLVAMEGATVDEDHHLFTTKKGKSSLSFDLDI
CDS SEQ ID NO:118
ATGAATCAAGAAAACAAGAAGCCTTTGGAAGAAGCTTCGACTTCAATGGAGAGAGAGAACATGTTCGACAAAGTAGTAACACCAAGCGACGTAGGGAAACTAAACCGACTCGTGATCCCAAAGCAACACGCAGAGAGATACTTCCCTTTAGACAACTCCTCAACAAACAACAAAGGGTTGCTTCTAGACTTCGAAGACCGTACAGGAAGCTCATGGAGATTCCGTTACTCTTACTGGAACAGTAGCCAAAGTTATGTCATGACAAAAGGTTGGAGCCGTTTTGTCAAAGACAAGAAGCTTGATGCTGGTGACATCGTGTCTTTTCAAAGAGATCCCTGTAATAAAGACAAGCTTTACATAGATTGGAGGAGACGACCAAAGATTCCAGATCATCATCAGTTCGCCGGAGCTATGTTCCCTAGGTTTTACTCTTTCCCTCACCCTCAGATGCCGACAAGTTTTGAAAGTAGTCACAACCTTTATCATCATCGGTTTCAACGAGATCTTGGAATTGGGTATTATCCAACGGCTGTGATTGAATCTGTGCCGGTGATAATGCAAAGGAGAGAAGCACAAGTGGCTAATATGGCTTCATCAAGAGGAGAGAAGAGGTTAAGGCTGTTTGGAGTGGACGTGGAGTGCGGCGGCGGAGGAGGAGGAAGTGTGAATAGCACGGAGGAAGAGTCGTCGTCTTCCGGTGGTAGTATGTCACGTGGCGGCGTTTCTATGGCTGGTGTTGGTTCTCTCCTTCAGTTGAGGTTAGTGAGCAGTGATGATGAGTCTTTAGTAGCGATGGAAGGTGCTACTGTCGATGAGGATCATCACTTGTTTACAACTAAGAAAGGAAAGTCTTCTTTGTCTTTCGATTTGGATATATGA
Bra014415
覆盖48% 相同性60%
SEQ ID NO:119
MERKSNDLERSENIDSQNKKMNLEEERPVQEASSMEREHMFDKVVTPSDVGKLNRLVIPKQHAERYFPLDNNSSDNNKGLLLNFEDRIGILWSFRYSYWNSSQSYVMTKGWSRFVKDKKLDAGDIVSFHRGSCNKDKLFIDWKRRPKIPDHQVVGAMFPRFYSYPYPQIQASYERHNLYHRYQRDIGIGYYVRSMERYDPTAVIESVPVIMQRRAHVATMASSRGEKRLRLFGVDMECVRGGRGGGGSVNSTEEESSTSGGSISRGGVSMAGVGSPLQLRLVSSDGDDQSLVARGAARVDEDHHLFTKKGKSSLSFDLDK
CDS SEQ ID NO:120
ATGGAGAGGAAGTCCAATGATCTTGAGAGATCTGAGAATATTGATTCTCAAAACAAGAAGATGAATCTAGAAGAAGAGAGGCCTGTACAAGAAGCTTCTTCGATGGAGAGAGAGCACATGTTCGACAAAGTAGTAACACCAAGCGACGTTGGGAAACTAAACCGGCTGGTGATCCCAAAGCAACACGCAGAGCGATACTTCCCTTTAGACAATAATTCCTCAGACAACAACAAAGGTTTGCTTCTAAACTTCGAAGATCGAATAGGAATCTTATGGAGTTTCCGTTACTCCTACTGGAACAGTAGCCAAAGTTATGTAATGACTAAAGGCTGGAGCCGTTTCGTCAAAGACAAGAAGCTTGATGCTGGCGACATAGTTTCTTTTCATAGAGGTTCTTGTAATAAAGACAAGCTTTTCATTGATTGGAAGAGACGACCAAAGATTCCTGATCACCAAGTCGTCGGAGCTATGTTCCCTAGGTTTTACTCTTACCCTTATCCTCAGATACAGGCTAGTTATGAACGTCACAACCTTTATCATCGATATCAACGAGATATAGGAATTGGGTATTATGTGAGGTCAATGGAGAGATATGATCCAACGGCTGTAATTGAATCTGTGCCGGTGATAATGCAAAGGAGAGCACATGTGGCTACTATGGCTTCATCAAGAGGAGAGAAGAGGTTAAGGCTTTTTGGAGTGGATATGGAGTGCGTCAGAGGCGGCCGAGGAGGAGGAGGAAGTGTGAATAGCACGGAGGAAGAGTCTTCGACTTCCGGTGGTAGTATCTCACGTGGCGGCGTTTCTATGGCTGGTGTTGGCTCTCCACTCCAGTTGAGGTTAGTGAGCAGTGACGGTGATGATCAGTCTCTAGTAGCTAGGGGAGCTGCTAGGGTTGATGAGGATCATCACTTGTTTACAAAGAAAGGAAAGTCTTCTTTGTCTTTCGATTTGGATAAATGA
Bra038346
覆盖51% 相同性57%
SEQ ID NO:121
MVFSCIDESSSTSESFSPATATATATATKFSAPPLPPLRLNRMRSGGSNVVLDSKNGVDIDSRKLSSSKYKGVVPQPNGRWGAQIYVKHQRVWLGTFCDEEEAAHSYDIAARKFRGRDAVVNFKTFLASEDDNGELCFLEAHSKAEIVDMLRKHTYADELAQSNKRSGANTNTNTTQSHTVSRTREVLFEKVVTPSDVGKLNRLVIPKQHAEKYFPLPSLSVTKGVLINFEDVTGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVTFERSTGSDRQLYIDWKIRSGPSKNPVQVVVRLFGVDIFNVTSAKPSNVVDACGGKRSRDVDMFALRCSKKHAIINAL
CDS SEQ ID NO:122
ATGGTATTCAGTTGCATAGACGAGAGCTCTTCCACTTCAGAATCTTTTTCACCCGCAACCGCAACCGCAACCGCAACCGCCACAAAGTTCTCTGCTCCTCCGCTTCCACCGTTACGCCTCAACCGGATGAGAAGCGGTGGAAGCAACGTCGTGTTGGATTCAAAGAATGGCGTAGATATTGATTCACGGAAGCTATCGTCGTCAAAGTACAAAGGCGTGGTTCCTCAGCCCAACGGAAGATGGGGAGCTCAGATTTACGTGAAGCACCAGCGAGTTTGGCTGGGCACTTTCTGCGATGAAGAGGAAGCTGCTCACTCCTACGACATAGCCGCCCGTAAATTCCGTGGCCGTGACGCCGTTGTCAACTTCAAAACCTTCCTCGCCTCAGAGGACGACAACGGCGAGTTATGTTTCCTTGAAGCTCACTCCAAGGCCGAGATCGTCGACATGTTGAGGAAACACACTTACGCTGACGAGCTTGCGCAGAGCAATAAACGCAGCGGAGCGAATACGAATACGAATACGACTCAAAGCCACACCGTTTCGAGAACACGTGAAGTGCTTTTCGAGAAGGTTGTCACGCCTAGCGACGTTGGTAAGCTAAACCGCCTCGTGATACCTAAACAGCACGCGGAGAAATATTTTCCGTTACCGTCACTGTCGGTGACTAAAGGCGTTCTGATCAACTTCGAAGACGTGACGGGTAAGGTGTGGCGGTTCCGTTACTCATACTGGAACAGTAGTCAAAGTTACGTGTTGACCAAGGGATGGAGTCGGTTCGTTAAGGAGAAGAATCTCCGAGCCGGTGATGTCGTTACTTTCGAGAGATCGACCGGTTCAGACCGGCAGCTTTATATTGATTGGAAAATCCGGTCTGGTCCGAGCAAAAACCCTGTTCAGGTTGTGGTTAGGCTTTTCGGAGTTGACATCTTCAACGTGACAAGCGCGAAGCCGAGCAACGTTGTAGACGCGTGCGGTGGAAAGAGATCTCGGGATGTTGATATGTTTGCGCTACGGTGTTCCAAAAAACACGCTATAATCAATGCTTTGTGA
玉米(Zea mays)
GRMZM2G053008
覆盖74% 相同性47%
SEQ ID NO:123
MAASPSSPLTAPPEPVTPPSPWTITDGAISGTLPAAEAFAVHYPGYPSSPARAARTLGGLPGLAKVRSSDPGARLELRFRPEDPYCHPAFGQSRASTGLLLRLSKRKGAAAPCAHVVARVRTAYYFEGMADFQHVVPVHAAQTRKRKHSDSQNDNENFGSDKTGHDEADGDVMMLVPPLFSVKDRPTKIALVPSSNAISKTMHRGVVQERWEMNVGPTLALPFNTQVVPEKINWEDHIRKNSVEWGWQMAVCKLFDERPVWPRQSLYERFLDDNVHVSQNQFKRLLFRAGYYFSTGPFGKFWIRRGYDPRKDSESQIYQRIDFRMPPELRYLLRLKNSESRKWADMCKLETMPSQSFIYLQLYELKDDFIQAEIRKPSYQSVCSRSTGWFSKPMIKTLRLQVSIRLLSLLHNEEAKNLLRNAHELIERSKKQEALSRSELSIEYNDADQVSAAHTGTEDQVGPNNSDSEDVDDEEEEEELEGYDSPPMADDIHEFTLGDSYAFGEGFSNGYLEEVLRSLPLQEDGQKKLCDAPINADASD
CDS SEQ ID NO:124
ATGGCCGCCTCGCCCTCTTCACCCTTGACAGCGCCGCCAGAGCCGGTGACCCCGCCGTCCCCATGGACCATCACAGACGGAGCCATCTCTGGCACGCTCCCAGCAGCCGAGGCCTTCGCAGTGCACTACCCGGGCTACCCCTCCTCTCCCGCCCGCGCCGCCCGCACCCTCGGCGGTCTCCCCGGCCTCGCCAAGGTCCGGAGTTCCGATCCCGGCGCCCGCCTCGAGCTCCGCTTCCGCCCCGAGGACCCCTACTGCCATCCAGCCTTTGGCCAGTCCCGCGCCTCCACTGGCCTTCTGCTGCGCCTCTCCAAGCGCAAAGGAGCTGCGGCACCTTGTGCCCATGTGGTCGCTCGTGTCCGGACTGCTTACTACTTCGAAGGTATGGCAGATTTTCAACATGTTGTTCCAGTGCATGCTGCACAAACAAGAAAAAGAAAACACTCAGATTCTCAAAATGATAATGAGAATTTTGGTAGTGATAAGACAGGACATGATGAAGCAGATGGAGATGTCATGATGTTGGTACCCCCTCTCTTTTCAGTGAAGGATAGGCCAACAAAGATAGCGCTTGTACCATCGTCCAATGCCATATCTAAAACCATGCACAGGGGAGTTGTACAAGAACGGTGGGAGATGAATGTTGGACCAACTCTGGCGCTTCCGTTCAACACTCAAGTTGTCCCGGAGAAGATTAATTGGGAAGACCACATTAGAAAGAATTCTGTAGAATGGGGTTGGCAAATGGCTGTTTGCAAATTGTTTGATGAGCGCCCTGTGTGGCCAAGGCAATCACTTTATGAGCGGTTCCTTGATGATAATGTGCATGTCTCTCAAAACCAATTCAAAAGGCTTCTGTTTAGAGCTGGATACTACTTCTCTACTGGACCCTTTGGAAAATTTTGGATCAGAAGAGGATATGACCCTCGTAAAGACTCTGAGTCACAAATATATCAGAGAATTGATTTTCGCATGCCTCCCGAGCTACGATATCTTCTAAGGCTGAAGAATTCTGAGTCTCGAAAGTGGGCAGATATGTGCAAGCTTGAAACAATGCCATCACAGAGTTTCATCTACCTGCAATTATATGAACTGAAGGATGATTTTATTCAAGCAGAAATTCGAAAACCTTCTTATCAATCAGTTTGTTCACGTTCTACAGGATGGTTTTCTAAGCCAATGATCAAAACCCTGAGGTTGCAAGTGAGCATAAGGCTCCTCTCTTTATTGCATAATGAAGAGGCTAAAAACTTGTTGAGGAATGCCCATGAGCTTATTGAAAGGTCCAAGAAGCAGGAAGCCCTTTCGAGATCTGAGCTGTCAATAGAATATAATGATGCTGATCAAGTTTCTGCCGCACATACTGGAACTGAGGATCAAGTCGGCCCTAACAACTCTGATAGTGAAGATGTGGATGATGAAGAAGAGGAAGAGGAATTGGAGGGTTATGATTCTCCACCTATGGCAGATGATATTCATGAGTTCACCTTAGGTGATTCCTATGCATTTGGTGAAGGCTTCTCGAATGGATACCTCGAAGAAGTACTGCGCAGCTTGCCATTGCAGGAAGACGGCCAAAAGAAATTATGTGATGCTCCTATCAACGCTGATGCAAGTGATGGAGAGTTTGAAATTTACGAACAGCCCAGTGATGATGAAGATTCTGATGGCTAG
GRMZM2G102059_T01
覆盖47% 相同性62%
SEQ ID NO:125
MEFASSSSRFSREEDEEEEQEEEEEEEEASPREIPFMTAAATADTGAAASSSSPSAAASSGPAAAPRSSDGAGASGSGGGGSDDVQVIEKEHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDAAANEKGQLLSFEDRAGKLWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFCRGAGDTARDRLFIDWKRRADSRDPHRMPRLPLPMAPVASPYGPWGGGGGGGAGGFFMPPAPPATLYEHHRFRQALDFRNINAAAAPARQLLFFGSAGMPPRASMPQQQQPPPPPHPPLHSIMLVQPSPAPPTASVPMLLDSVPLVNSPTAASKRVRLFGVNLDNPQPGTSAESSQDANALSLRTPGWQRPGPLRFFESPQRGAESSAASSPSSSSSSKREAHSSLDLDL
CDS SEQ ID NO:126
ATGGAGTTCGCGAGCTCTTCGAGTAGGTTTTCCAGGGAGGAGGACGAGGAGGAAGAGCAGGAGGAAGAGGAGGAGGAGGAGGAGGCGTCTCCGCGCGAGATCCCCTTCATGACAGCGGCAGCGACGGCCGACACCGGAGCCGCCGCCTCCTCGTCCTCGCCTTCCGCGGCGGCCTCATCGGGTCCTGCTGCTGCCCCCCGCTCGAGCGACGGCGCCGGGGCGTCCGGGAGCGGCGGCGGCGGGAGCGACGACGTGCAGGTGATCGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCCAGCGACGTGGGGAAGCTCAACCGGCTGGTGATCCCGAAGCAGCACGCGGAGAAGTACTTCCCGCTGGACGCGGCGGCCAACGAGAAGGGCCAGCTGCTCAGCTTCGAGGACCGCGCCGGTAAGCTCTGGCGCTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTCATGACCAAGGGCTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCCGGCGACACCGTCTCCTTCTGCCGCGGCGCCGGCGACACCGCGCGGGACCGCCTCTTCATCGACTGGAAGCGCCGCGCCGACTCCCGCGACCCGCACCGCATGCCGCGCCTCCCGCTCCCCATGGCGCCCGTCGCGTCGCCCTACGGCCCCTGGGGCGGCGGCGGCGGCGGCGGCGCGGGCGGTTTCTTCATGCCGCCCGCGCCGCCCGCCACACTCTACGAGCACCACCGCTTCCGCCAGGCCCTCGACTTCCGCAACATCAACGCCGCGGCCGCGCCGGCCAGGCAGCTCCTCTTCTTCGGCTCAGCCGGCATGCCCCCGCGCGCGTCCATGCCGCAGCAGCAGCAGCCGCCTCCGCCCCCGCACCCGCCTCTGCACAGCATTATGTTGGTGCAACCCAGCCCCGCGCCGCCCACGGCCAGCGTGCCCATGCTTCTCGACTCGGTACCGCTCGTCAACAGCCCAACGGCAGCGTCGAAGCGCGTCCGCCTGTTTGGGGTCAACCTCGACAACCCGCAACCAGGCACAAGTGCGGAGTCAAGCCAAGATGCCAACGCATTGTCGCTGAGGACACCGGGATGGCAAAGGCCGGGGCCGTTGAGGTTCTTCGAATCGCCTCAACGCGGCGCCGAGTCATCTGCAGCCTCCTCGCCGTCGTCATCGTCGTCCTCCAAGAGAGAAGCGCACTCGTCCTTGGATCTCGATCTGTGA
GRMZM2G098443_T01
覆盖47% 相同性63%
SEQ ID NO:127
MEFTTPPPATRSGGGEERAAAEHNQHHQQQHATVEKEHMFDKVVTPSDVGKLNRLVIPKQHAEKYFPLDAAANEKGLLLSFEDRTGKPWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFGRGISEAARDRLFIDWRCRPDPPVVHHQYHHRLPLPSAVVPYAPWAAHAHHHHYPADGHTEPVTPCLCATLVATEMRASSSQLSLTRSNLSRPPQPRIARVDGAQPRPSSSPRQPQSLWCRSCQPQPRRTADVP
CDS SEQ ID NO:128
ATGGAGTTCACCACTCCCCCGCCCGCGACCCGGTCGGGCGGCGGAGAGGAGAGGGCGGCTGCTGAGCACAACCAGCACCACCAGCAGCAGCATGCGACGGTGGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCGAGCGACGTCGGGAAGCTGAACCGGCTGGTGATCCCGAAGCAGCACGCGGAGAAGTACTTCCCGCTGGACGCGGCGGCGAACGAGAAGGGCCTCCTGCTCAGCTTCGAGGACCGCACGGGGAAGCCCTGGCGCTTCCGCTACTCCTACTGGAACAGTAGCCAGAGCTACGTGATGACCAAGGGCTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCCGGGGACACAGTCTCCTTCGGCCGCGGCATCAGCGAGGCGGCGCGCGACAGGCTTTTCATCGACTGGCGGTGCCGACCCGACCCGCCCGTCGTGCACCACCAGTACCACCACCGCCTCCCTCTCCCCTCCGCCGTCGTCCCCTACGCGCCGTGGGCGGCGCACGCGCACCACCACCACTACCCAGCAGATGGGCACACGGAACCAGTAACACCTTGCCTGTGCGCCACACTCGTTGCCACTGAAATGAGAGCATCATCTTCGCAACTGTCACTCACACGCTCCAACCTCTCCAGGCCGCCACAACCTAGAATAGCCAGAGTCGATGGCGCCCAGCCACGGCCGTCGTCGTCACCACGCCAGCCACAGTCGTTGTGGTGCCGGTCGTGCCAACCGCAACCACGGCGAACGGCCGACGTTCCTTGA
GRMZM2G082227_T01
覆盖45% 相同性64%
SEQ ID NO:129
MEFTAPPPATRSGGGEERAAAEHHQQQQQATVEKEHMFDKVVTPSDVGKLNRLVIPKQHAERYFPLDAAANDKGLLLSFEDRAGKPWRFRYSYWNSSQSYVMTKGWSRFVKEKRLDAGDTVSFGRGVGEAARGRLFIDWRRRPDPPVVHHQYHHHRLPLPSAVVPYAPWAAAAHAHHHHYPAAGVGAARTTTTTTTTVLHHLPPSPSPLYLDTRRRHVGYDAYGAGTRQLLFYRPHQQPSTTVMLDSVPVRLPPTPGQHAEPPPPAVASSASKRVRLFGVNLDCAAAAGSEEENVGGWRTSAPPTQQASSSSSYSSGKARCSLNLDL
CDS SEQ ID NO:130
ATGGAGTTCACCGCTCCCCCGCCCGCGACCCGGTCGGGCGGCGGCGAGGAGAGGGCGGCTGCTGAGCACCACCAGCAGCAGCAGCAGGCGACGGTGGAGAAGGAGCACATGTTCGACAAGGTGGTGACGCCGAGCGACGTCGGGAAGCTGAACCGGCTGGTGATCCCGAAGCAGCACGCGGAGAGGTACTTCCCGCTGGACGCGGCGGCGAACGACAAGGGCCTGCTGCTCAGCTTCGAGGACCGCGCGGGGAAGCCCTGGCGCTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAGGGCTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGACGCCGGGGACACCGTCTCCTTCGGCCGCGGCGTCGGCGAGGCGGCGCGCGGCAGGCTCTTCATCGACTGGCGGCGCCGACCCGACCCGCCCGTCGTGCACCACCAGTACCACCACCACCGCCTCCCTCTCCCCTCCGCCGTCGTCCCCTACGCGCCGTGGGCGGCGGCGGCGCACGCGCACCACCACCACTACCCAGCAGCTGGGGTCGGTGCCGCCAGGACGACGACGACGACGACGACGACGGTGCTCCACCACCTGCCGCCCTCGCCCTCCCCGCTCTACCTTGACACCCGCCGCCGCCACGTCGGCTACGACGCCTACGGGGCCGGCACCAGGCAACTTCTCTTCTACAGGCCGCACCAGCAGCCCTCCACGACGGTGATGCTGGACTCCGTGCCGGTACGGTTACCGCCAACGCCAGGGCAGCACGCCGAGCCGCCGCCCCCCGCCGTGGCGTCGTCAGCCTCGAAGCGGGTGCGCCTGTTCGGGGTGAACCTCGACTGCGCCGCCGCCGCCGGCTCAGAGGAGGAGAACGTCGGCGGGTGGAGGACTAGTGCGCCGCCGACGCAGCAGGCGTCCTCCTCCTCATCCTACTCTTCCGGGAAAGCGAGGTGCTCCTTGAACCTTGACTTGTGA
GRMZM2G024948_T01
覆盖46% 相同性63%
SEQ ID NO:131
MDQFAASGRFSREEEADEEQEDASNSMREISFMPPAAASSSSAAASASASASTSASACASGSSSAPFRSASASGDAAGASGSGGPADADAEAEAVEKEHMFDKVVTPSDVGKLNRLVIPKQYAEKYFPLDAAANEKGLLLSFEDSAGKHWRFRYSYWNSSQSYVMTKGWSRFVKEKRLVAGDTVSFSRAAAEDARHRLFIDWKRRVDTRGPLRFSGLALPMPLPSSHYGGPHHYSPWGFGGGGGGGGGFFMPPSPPATLYEHRLRQGLDFRSMTTTYPAPTVGRQLLFFGSARMPPHHAPPPQPRPFSLPLHHYTVQPSAAGVTAASRPVLLDSVPVIESPTTAAKRVRLFGVNLDNNPDGGGEASHQGDALSLQMPGWQQRTPTLRLLELPRHGGESSAASSPSSSSSSKREARSALDLDL
CDS SEQ ID NO:132
ATGGACCAGTTCGCCGCGAGCGGGAGGTTCTCTAGAGAGGAGGAGGCGGACGAGGAGCAGGAGGATGCGTCCAATTCCATGCGCGAGATCTCCTTCATGCCGCCGGCTGCGGCCTCGTCATCTTCGGCGGCTGCTTCCGCGTCCGCGTCCGCCTCCACCAGCGCATCCGCGTGTGCATCGGGAAGCAGCAGCGCCCCCTTCCGCTCCGCCTCCGCGTCGGGGGATGCCGCCGGAGCGTCGGGGAGCGGCGGCCCAGCGGACGCGGACGCGGAGGCGGAGGCGGTGGAGAAGGAGCACATGTTCGACAAGGTGGTCACGCCGAGCGACGTGGGGAAGCTCAACCGGCTGGTGATCCCGAAGCAGTACGCGGAGAAGTACTTCCCGCTGGACGCGGCGGCCAACGAGAAGGGCCTCCTCCTCAGCTTCGAGGACAGCGCCGGCAAGCACTGGCGCTTCCGCTACTCCTACTGGAACAGCAGCCAGAGCTACGTCATGACCAAGGGCTGGAGCCGCTTCGTCAAGGAGAAGCGCCTCGTCGCCGGGGACACCGTCTCCTTCTCCCGCGCCGCCGCCGAGGACGCGCGCCACCGCCTCTTCATCGACTGGAAGCGCCGGGTCGACACCCGCGGCCCGCTTCGTTTCTCCGGCCTCGCGCTGCCGATGCCGCTGCCGTCGTCGCACTACGGCGGGCCCCACCACTACAGCCCGTGGGGCTTCGGCGGCGGCGGCGGCGGCGGCGGCGGATTCTTCATGCCGCCCTCGCCGCCCGCCACGCTCTACGAGCACCGCCTCAGACAGGGCCTCGACTTCCGCAGCATGACGACGACCTACCCCGCGCCGACCGTGGGGAGGCAGCTCCTGTTTTTCGGCTCGGCCAGGATGCCTCCTCATCACGCGCCGCCGCCCCAGCCGCGCCCGTTCTCGCTGCCGCTGCATCACTACACGGTGCAACCGAGCGCCGCCGGCGTCACCGCCGCGTCACGGCCGGTCCTTCTTGACTCGGTGCCGGTCATCGAGAGCCCGACGACCGCCGCGAAGCGCGTGCGGCTGTTCGGCGTCAACCTGGACAACAACCCAGATGGCGGCGGCGAGGCTAGCCATCAGGGCGATGCATTGTCATTGCAGATGCCCGGGTGGCAGCAAAGGACTCCAACTCTAAGGCTACTAGAATTGCCTCGCCATGGCGGGGAGTCCTCCGCGGCGTCGTCTCCGTCGTCGTCGTCTTCCTCCAAGAGGGAGGCGCGTTCAGCTTTGGATCTCGATCTGTGA
GRMZM2G328742_T01
覆盖55% 相同性64%
SEQ ID NO:134
MATNHLSQGQHQHPQAWPWGVAMYTNLHYHHQQHHHYEKEHLFEKPLTPSDVGKLNRLVIPKQHAERYFPLSSSGAGDKGLILCFEDDDDDEAAAANKPWRFRYSYWTSSQSYVLTKGWSRYVKEKQLDAGDVVRFQRMRGFGMPDRLFISHSRRGETTATAATTVPPAAAAVRVVVAPAQSAGADHQQQQQPSPWSPMCYSTSGSYSYPTSSPANSQHAYHRHSADHDHSNNMQHAGESQSDRDNRSCSAASAPPPPSRRLRLFGVNLDCGPGPEPETPTAMYGYMHQSPYAYNNWGSPYQHDEEI
CDS 135
ATGGCCACGAACCATCTCTCCCAAGGGCAGCACCAGCACCCGCAGGCCTGGCCCTGGGGCGTGGCCATGTACACCAACCTACACTACCACCACCAGCAGCACCACCACTACGAGAAGGAGCACCTGTTCGAGAAGCCGCTGACGCCGAGCGACGTGGGCAAGCTCAACAGGCTGGTGATCCCCAAGCAGCACGCCGAGAGGTACTTCCCTCTCAGCAGCAGCGGCGCCGGCGACAAAGGCCTCATCCTGTGCTTCGAGGACGACGACGACGACGAGGCTGCCGCCGCCAACAAGCCGTGGCGGTTCCGCTACTCGTACTGGACCAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTACGTCAAGGAGAAGCAGCTTGACGCCGGCGACGTCGTGCGCTTCCAGAGGATGCGTGGTTTCGGCATGCCCGACCGCCTGTTCATCAGCCACAGCCGCCGCGGCGAGACTACTGCTACTGCTGCAACAACAGTGCCCCCCGCTGCTGCTGCCGTGCGCGTAGTAGTGGCACCTGCACAGAGCGCTGGCGCAGACCACCAGCAGCAGCAGCAGCCGTCGCCTTGGAGCCCAATGTGCTACAGCACATCAGGCTCGTACTCGTACCCCACCAGCAGCCCAGCCAATTCCCAGCATGCCTACCACCGCCACTCAGCTGACCATGACCACAGCAACAACATGCAACATGCAGGAGAATCTCAGTCCGACAGAGACAACAGGAGCTGCAGTGCAGCTTCGGCACCGCCGCCACCGTCGCGGCGGCTCCGGCTGTTCGGCGTAAACCTCGACTGCGGCCCGGGGCCGGAGCCGGAGACACCAACGGCGATGTACGGCTACATGCACCAAAGCCCCTACGCTTACAACAACTGGGGCAGTCCATACCAGCATGACGAGGAGATTTAA
GRMZM2G142999_T01
覆盖44% 相同性64%
SEQ ID NO:136
MEFTPAHAHARVVEDSERPRGGVAWVEKEHMFEKVVTPSDVGKLNRLVIPKQHAERYFPALDASSAAAAAAAAAAGGGKGLVLSFEDRAGKAWRFRYSYWNSSQSYVMTKGWSRFVKEKRLGAGDTVLFARGAGGARGRFFIDFRRRRQDLAFLQPTLASAQRLLPLPSVPICPWQDYGASAPAPNRHVLFLRPQVPAAVVLKSVPVHVAASAVEATMSKRVRLFGVNLDCPPDAEDSATVPRGRAASTTLLQLPSPSSSTSSSTAGKDVCCLDLGL
CDS SEQ ID NO:137
ATGGAGTTCACGCCCGCGCATGCGCATGCCCGTGTCGTTGAGGATTCCGAGAGGCCTCGCGGCGGCGTGGCCTGGGTGGAGAAGGAGCACATGTTCGAGAAGGTGGTCACCCCGAGCGACGTGGGGAAGCTCAATCGCCTGGTCATCCCAAAGCAGCACGCGGAGCGCTACTTCCCCGCGCTGGACGCCTCGTCCGCCGCGGCGGCGGCGGCGGCAGCAGCCGCGGGAGGCGGGAAGGGGCTGGTGCTCAGCTTCGAGGACCGGGCGGGGAAGGCGTGGCGCTTCCGCTACTCGTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAAGGTTGGAGCCGCTTCGTGAAGGAGAAGCGCCTCGGTGCCGGGGACACAGTCTTGTTCGCGCGCGGCGCGGGCGGCGCGCGCGGCCGCTTCTTCATCGATTTCCGCCGCCGTCGCCAGGATCTCGCGTTCCTGCAGCCGACGCTGGCGTCTGCGCAGCGACTCCTGCCGCTGCCGTCGGTGCCCATCTGCCCGTGGCAGGACTACGGCGCCTCGGCTCCGGCGCCCAACCGGCACGTGCTGTTCCTGCGGCCGCAGGTGCCGGCCGCCGTAGTGCTCAAGTCGGTCCCCGTGCACGTTGCTGCATCCGCGGTGGAGGCGACCATGTCGAAGCGCGTCCGCCTGTTCGGGGTGAACCTCGACTGCCCGCCGGACGCCGAAGACAGCGCCACAGTCCCCCGGGGCCGGGCGGCGTCGACGACGCTTCTGCAACTGCCCTCGCCATCGTCGTCAACATCCTCCTCGACGGCAGGGAAGGACGTGTGCTGTTTGGATCTTGGACTGTGA
GRMZM2G125095_T01
覆盖85% 相同性40%
SEQ ID NO:138
MEFRPAHARVFEDSERPRGGVAWLEKEHMFEKVVTPSDVGKLNRLVIPKQHAERYFPALDASAAAASASASAGGGKAGLVLSFEDRAGKAWRFRYSYWNSSQSYVMTKGWSRFVKEKRLGAGDTVLFARGAGATRGRFFIDFRRRRHELAFLQPPLASAQRLLPLPSVPICPWQGYGASAPAPSRHVLFLRPQVPAAVVLTSVPVRVAASAVEEATRSKRVRLFGVNLDCPPDAEDGATATRTPSTLLQLPSPSSSTSSSTGGKDVRSLDLGL
CDS SEQ ID NO:139
ATGGAGTTCAGGCCCGCGCATGCCCGTGTCTTCGAGGATTCCGAGAGGCCTCGCGGCGGCGTGGCGTGGCTGGAGAAGGAGCACATGTTCGAGAAAGTGGTCACCCCGAGCGACGTGGGGAAGCTCAATCGCCTGGTCATCCCGAAGCAGCACGCCGAGCGCTACTTCCCCGCGCTGGACGCCTCGGCCGCCGCGGCGTCGGCATCGGCGTCGGCGGGCGGCGGGAAGGCGGGGCTGGTGCTCAGCTTCGAGGACCGGGCGGGGAAGGCGTGGCGCTTCCGCTACTCGTACTGGAACAGCAGCCAGAGCTACGTGATGACCAAGGGATGGAGCCGCTTCGTGAAAGAGAAGCGCCTCGGTGCCGGGGACACGGTATTGTTCGCGCGCGGCGCGGGCGCCACGCGCGGCCGCTTCTTCATCGATTTCCGCCGCCGCCGCCACGAGCTCGCGTTCCTGCAGCCGCCGCTGGCGTCTGCGCAGCGCCTCCTGCCGCTCCCGTCGGTGCCCATCTGCCCGTGGCAGGGCTACGGCGCCTCCGCTCCGGCGCCAAGCCGGCACGTGCTGTTCCTGCGGCCGCAGGTGCCGGCCGCCGTAGTGCTCACGTCGGTGCCCGTGCGCGTCGCCGCATCCGCGGTGGAGGAGGCGACGAGGTCGAAGCGCGTCCGCCTGTTCGGGGTGAACCTCGACTGCCCGCCGGACGCCGAAGACGGTGCCACAGCCACCCGGACGCCGTCGACGCTTCTGCAGCTGCCCTCGCCATCGTCGTCAACATCCTCCTCCACGGGAGGCAAGGATGTGCGTTCTTTGGATCTTGGACTTTGA
小麦(Tricum aeseirum)
TRAES3BF098300010CFD_t1
覆盖:42% 相同性60%
SEQ ID NO:140
MGVEILSSMVEHSFQYSSGVSTATTESGTAGTPPRPLSLPVAIADESVTSRSASSRFKGVVPQPNGRWGAQIYERHARVWLGTFPDQDSAARAYDVASLRYRGRDVAFNFPCAAVEGELAFLAAHSKAEIVDMLRKQTYADELRQGLRRGRGMGARAQPTPSWAREPLFEKAVTPSDVGKLNRLVVPKQHAEKHFPLKRTPETPTTTGKGVLLNFEDGEGKVWRFRYSYWNSSQSYVLTKGWSRFVREKGLGAGDSILFSCSLYEQEKQFFIDCKKNTSMNGGKSASPLPVGVTTKGEQVRVVRLFGVDISGVKRGRAATATAEQGLQELFKRQCVAPGQHSPALGAFAL
CDS SEQ ID NO:141
ATGGGGGTGGAAATCCTGAGCTCCATGGTGGAGCACTCCTTCCAGTACTCTTCCGGCGTGTCCACGGCCACGACGGAGTCAGGCACCGCCGGAACACCGCCGAGGCCTTTGAGCCTACCTGTCGCCATCGCCGACGAGTCCGTGACCTCGCGGTCGGCGTCGTCTCGGTTCAAGGGCGTGGTGCCGCAGCCAAACGGGCGATGGGGCGCCCAGATCTACGAGCGCCACGCTCGCGTCTGGCTCGGCACGTTCCCAGACCAGGACTCGGCGGCGCGCGCCTACGACGTAGCCTCGCTCAGGTACCGCGGCCGCGACGTCGCCTTCAACTTCCCGTGCGCGGCCGTGGAGGGGGAGCTCGCCTTCCTGGCGGCGCACTCCAAGGCTGAGATAGTGGACATGCTCCGGAAGCAGACCTACGCCGATGAACTCCGCCAGGGCCTGCGGCGCGGCCGTGGCATGGGGGCGCGCGCGCAGCCGACGCCGTCGTGGGCGCGGGAGCCCCTTTTCGAGAAGGCCGTGACCCCTAGCGATGTCGGCAAGCTCAATCGCCTCGTAGTGCCGAAGCAGCACGCCGAGAAGCACTTCCCCCTGAAGCGCACGCCGGAGACGCCGACCACCACCGGCAAGGGCGTGCTGCTCAACTTCGAGGACGGCGAGGGGAAGGTGTGGAGGTTCCGGTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAAGGCTGGAGCCGCTTCGTCCGGGAGAAGGGCCTAGGTGCCGGCGACTCCATCCTATTCTCGTGCTCGCTGTACGAACAGGAGAAGCAGTTCTTCATCGACTGCAAGAAGAACACTAGCATGAACGGAGGCAAATCGGCGTCGCCGCTGCCAGTGGGGGTGACTACCAAAGGAGAACAAGTTCGCGTCGTTAGGCTATTCGGTGTCGACATCTCGGGAGTGAAGAGGGGGCGAGCGGCGACGGCAACGGCGGAGCAAGGCCTGCAGGAGTTGTTCAAGAGGCAATGCGTGGCACCCGGCCAGCACTCTCCTGCCCTAGGTGCCTTCGCCTTATAG
TRAES3BF062700040CFD_t1
覆盖47% 相同性55%
SEQ ID NO:142
MASGKPTNHGMEDDNDMEYSSAESGAEDAAEPSSSPVLAPPRAAPSSRFKGVVPQPNGRWGAQIYEKHSRVWLGTFPDEDAAVRAYDVAALRFRGPDAVINHQRPTAAEEAGSSSSRSELDPELGFLADHSKAEIVDMLRKHTYDDELRQGLRRGRGRAQPTPAWARELLFEKAVTPSDVGKLNRLVVPKQQAEKHFPPTTAAATGSNGKGVLLNFEDGEGKVWRFRYSYWNSSQSYVLTKGWSRFVKETGLRAGDTVAFYRSAYGNDTEDQLFIDYKKMNKNDDAADAAISDENETGHVAVKLFGVDIAGGGMAGSSGG
CDS SEQ ID NO:143
ATGGCATCTGGCAAGCCGACAAACCACGGGATGGAGGACGACAACGACATGGAGTACTCCTCCGCGGAATCGGGGGCCGAGGACGCGGCGGAGCCGTCGTCGTCGCCGGTGCTGGCGCCGCCCCGGGCGGCTCCATCGTCGCGGTTCAAGGGCGTCGTGCCGCAGCCCAACGGGCGGTGGGGAGCGCAGATCTACGAGAAGCACTCGCGGGTGTGGCTCGGAACGTTCCCCGACGAGGACGCCGCCGTGCGCGCCTACGACGTGGCCGCGCTCCGCTTCCGCGGCCCGGACGCCGTCATCAACCACCAGCGACCGACGGCCGCGGAGGAGGCCGGCTCGTCGTCGTCCAGGAGCGAGCTGGATCCAGAGCTCGGCTTCCTTGCCGACCACTCCAAGGCCGAGATCGTCGACATGCTCCGGAAGCACACCTACGACGACGAGCTCCGTCAGGGCCTGCGCCGCGGCCGCGGGCGCGCGCAGCCGACGCCGGCGTGGGCACGAGAGCTCCTCTTCGAGAAGGCCGTGACCCCGAGCGACGTCGGCAAGCTCAACCGCCTCGTGGTGCCGAAGCAGCAGGCCGAGAAGCACTTCCCTCCGACCACTGCGGCGGCCACCGGCAGCAACGGCAAGGGCGTGCTGCTCAACTTCGAGGACGGCGAAGGGAAGGTGTGGCGCTTCCGGTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTTCGTCAAGGAGACGGGCCTCCGCGCCGGCGACACCGTGGCGTTCTACCGGTCGGCGTACGGGAATGACACGGAGGATCAGCTCTTCATCGACTACAAGAAGATGAACAAGAATGACGATGCTGCGGACGCGGCGATTTCCGATGAGAATGAGACAGGCCATGTCGCCGTCAAGCTCTTCGGCGTTGACATTGCCGGTGGAGGGATGGCGGGATCATCAGGTGGCTGA
TRAES3BF062600010CFD_t1
覆盖43% 相同性58%
SEQ ID NO:144
MASGKPTNHGMEDDNDMEYSSAESGAEDAAEPSSSPVLAPPRAAPSSRFKGVVPQPNGRWGAQIYEKHSRVWLGTFPDEDAAARAYDVAALRFRGPDAVINHQRPTAAEEAGSSSSRSELDPELGFLADHSKAEIVDMLRKHTYDDELRQGLRRGRGRAQPTPAWARELLFEKAVTPSDVGKLNRLVVPKQQAEKHFPPTTAAATGSNGKGVLLNFEDGEGKVWRFRYSYWNSSQSYVLTKGWSRFVKETGLRAGDTVAFYRSAYGNDTEDQLFIDYKKMNKNDDAADAAISDENETGHVAVKLFGVDIAGGGMAGSSGG
CDS SEQ ID NO:145
ATGGCATCTGGCAAGCCGACAAACCACGGGATGGAGGACGACAACGACATGGAGTACTCCTCCGCGGAATCGGGGGCCGAGGACGCGGCGGAGCCGTCGTCGTCGCCGGTGCTGGCGCCGCCCCGGGCGGCTCCATCGTCGCGGTTCAAGGGCGTCGTGCCGCAGCCCAACGGGCGGTGGGGAGCGCAGATCTACGAGAAGCACTCGCGGGTGTGGCTCGGAACGTTCCCCGACGAGGACGCCGCCGCGCGCGCCTACGACGTGGCCGCGCTCCGCTTCCGCGGCCCGGACGCCGTCATCAACCACCAGCGACCGACGGCCGCGGAGGAGGCCGGCTCGTCGTCGTCCAGGAGCGAGCTGGATCCAGAGCTCGGCTTCCTCGCCGACCACTCCAAGGCCGAGATCGTCGACATGCTCCGGAAGCACACCTACGACGACGAGCTCCGTCAGGGCCTGCGCCGCGGCCGCGGGCGCGCGCAGCCGACGCCGGCGTGGGCACGAGAGCTCCTCTTCGAGAAGGCCGTGACCCCGAGCGACGTCGGCAAGCTCAACCGCCTCGTGGTGCCGAAGCAGCAGGCCGAGAAGCACTTCCCTCCGACCACTGCGGCGGCCACCGGCAGCAACGGCAAGGGCGTGCTGCTCAACTTCGAGGACGGCGAAGGGAAGGTGTGGCGCTTCCGGTACTCGTACTGGAACAGCAGCCAGAGCTACGTGCTCACCAAGGGCTGGAGCCGCTTCGTCAAGGAGACGGGCCTCCGCGCCGGCGACACCGTGGCGTTCTACCGGTCGGCGTACGGGAATGACACGGAGGATCAGCTCTTCATCGACTACAAGAAGATGAACAAGAATGACGATGCTGCGGACGCGGCGATTTCCGATGAGAATGAGACAGGCCATGTCGCCGTCAAGCTCTTCGGCGTTGACATTGCCGGTGGAGGGATGGCGGGATCATCAGGTGGCTGA
序列表
<110> 遗传与发育生物学研究所
<120> 转基因植物
<130> PC927564WOA
<150> PCT/CN2015/072156
<151> 2015-02-03
<160> 272
<170> BiSSAP 1.3
<210> 1
<211> 804
<212> DNA
<213> Artificial Sequence
<220>
<223> cDNA
<400> 1
atgtcagtca accattacca caacactctc tcgttgcatc atcaccacca aaacgacgta 60
gctatagcac aacgagagtc tttgttcgag aaatcactca caccaagcga cgtcggaaag 120
ctaaaccgct tagtcatacc aaaacaacac gccgagaaat acttccctct caataataat 180
aataataatg gcggcagcgg agatgacgtg gcgacgacgg agaaagggat gcttcttagc 240
ttcgaggatg agtcaggcaa gtgttggaaa ttcagatact cttattggaa cagtagccaa 300
agctacgtgt tgaccaaagg atggagcagg tacgtcaaag acaaacacct cgacgcaggc 360
gacgttgttt tctttcaacg tcaccgtttt gatctccata gactcttcat tggctggcgg 420
agacgcggtg aagcttcttc ctctcccgct gtctccgttg tgtctcaaga agctctagtt 480
aatacgacgg cgtattggag cggcttgact acaccttatc gtcaagtaca cgcgtcaact 540
acttacccta atattcacca agagtattca cactatggcg ccgtcgttga tcatgctcag 600
tcgataccac cggtggtcgc aggtagctcg aggacggtga ggctttttgg cgtgaacctc 660
gaatgtcatg gtgatgccgt cgagccacca ccgcgtcctg atgtctataa tgaccaacac 720
atttactatt actcaactcc tcatcccatg aatatatcat ttgctgggga agcattggag 780
caggtaggag atggacgagg ttga 804
<210> 2
<211> 4285
<212> DNA
<213> Arabidopsis thaliana
<400> 2
ttgtttcggc tatttgttat actattgtta taacagtcac aagacttgac ctcaacgaaa 60
acttttacaa aacgtgaatt ggaaattttt acaaaatatg ctcttaatcg ttaatgcttc 120
ccaattaggt gagttaaatt gtgagaggaa ccatttctta gaggaaatgg ttcatgaaaa 180
caaatatgaa atagtatcac tagtcttagt tttgcgagaa aattaggaaa aatagaaacg 240
tgtaagcacc aatgatattc ctgaaagcac gtgacagata tttcatgatc ctataattaa 300
caagtgataa agatattaaa taaaattaac gatacttgag aaattcgtca aataaaatag 360
aagaggacca ctcacgtaac catttgcacg tcccattgat ttttgtggta gacttggtat 420
gttatattac ttatattcac agaattatat acgaaactca cgacttaaga tgcacggtaa 480
taactacaga tggaaattta cccatcaaac aagaaaacaa catttactca agcatctagc 540
tagaccaaaa tgtttgttta cttgttgact tgcgatccat agatatatta gttagaactt 600
tttcttctac aattgatcaa atgtttcaca ctgttctcaa tttctcatct agattcatga 660
cttatatgtt tggtcaaata tcacagcttg atgagcatta aatagcgtcg aagtatagga 720
tggttacgtt gttcaatatt gtaaaggaaa aaaagagaaa gagtgccaaa aggtcaagtc 780
gatttcacaa ataaatcttg aagtctttat ccctctcgat tataaaatga ttaggaaaag 840
aaaaagagag aataaaatgt agataaagag aaagagaaag agagagagga acataaggga 900
tggtatgaag tagaagtgaa gatgcatgcg atggtgtgtc ggaaaggcaa agcacatgct 960
acacaacttg agcttctcac ttgcgtcagg gataagtatc ctctgtacct tcttactttt 1020
gcgtaatatg taccacctca cttctcaacc gtttgatctt taatccttca ttatttcttc 1080
attaccttct ctttttgttt ttgttttcgt tttcaatttc tcatagattc atttacaaac 1140
taaatatcat aggaaggtgt tatctctagt taatttctta tcctacttta acaaaattta 1200
attgtcaaaa gattattttt acgtttatag acaaaagata ctgacacatc aattccacga 1260
accaaatggt tgagaaaaac aaaacgacta tctttgtctt gcaaataaat taatggcagt 1320
tagtaagatt ctcagctgaa aattcataca agagtaaatg atcaaataac catttatgag 1380
agaaatttaa tccttcagaa accaatgagg atctgatcaa gtaattgcaa accacatgag 1440
tccatgataa aggattgttt gacttacgca atccacatat ttatggctgc ttgatatgta 1500
aggtttatct gctttgacag tctatagaat cttgctaatc aatacgtcat atccggtgaa 1560
tactgaaact tttttaatta agaaaacaca aatcatcttt tctccggagg atttcgaatt 1620
tagttccggc aatgctgaaa taacatatgt tgaacttata acattccaag acatcaaatt 1680
ttactaatat ataaataatt acatattctt cttctacatg atcaaaacct tttcaacttt 1740
aattaaaggg ttacgtcgcg gcgttttgtg tggcttactc tttttttaca ctataactat 1800
agaacactcg tggatccaat gccgtttagg acaagatttt atcagacgag aaaaaaaaaa 1860
acaataccac atttttaaat atatatggat tatggactgc aacaacaata tagaaaagaa 1920
gagaaaaaaa taaaaataat gattgaaagg aaatatcatc acgcaaaacc ttaaaagtac 1980
tatcggtatc gtgtcgtcct ctcctcatca aatagttccc acagttttca catcaattta 2040
accattttca atttttttca ctctctgtct ctctcctttg tataatacta tattagtacc 2100
attacccatc tctctttcac caccaaacca acacctgcaa atcctctctc tctctctcac 2160
tccaagaaac caaaaaaaaa gatgtcagtc aaccattacc acaacactct ctcgttgcat 2220
catcaccacc aaaacgacgt agctatagca caacgagagt ctttgttcga gaaatcactc 2280
acaccaagcg acgtcggaaa gctaaaccgc ttagtcatac caaaacaaca cgccgagaaa 2340
tacttccctc tcaataataa taataataat ggcggcagcg gagatgacgt ggcgacgacg 2400
gagaaaggga tgcttcttag cttcgaggat gagtcaggca agtgttggaa attcagatac 2460
tcttattgga acagtagcca aagctacgtg ttgaccaaag gatggagcag gtacgtcaaa 2520
gacaaacacc tcgacgcagg cgacgttgtt ttctttcaac gtcaccgttt tgatctccat 2580
agactcttca ttggctggcg gagacgcggt gaagcttctt cctctcccgc tgtctccgtt 2640
gtgtctcaag aagctctagt taatacgacg gcgtattgga gcggcttgac tacaccttat 2700
cgtcaagtac acgcgtcaac tacttaccct aatattcacc aagagtattc acactatggt 2760
aaattcaaac cctttatttc ctcttttgtt ttttctttct ctcttatcta tatgtcagat 2820
ttatactcct ctctgttctc ttttaagatt tgtctttttc ataaaaatag atgattcgta 2880
atttgtattg catatttaca tgttctctta aaaaaagtaa tagagattaa tattttatgc 2940
atggtatttt agattatctg cctactttat atggtagtaa acaagaacat tcatctttat 3000
ttggttttat aaacaaaata tgagaatttt taaaggttag ggcaagcact tggaaagctc 3060
aaccatttta gttagctggt ggaatatctt tcttataaaa agcaaatgag ttatctaaaa 3120
ctatatgaca attattttag ttgcgtgtgt aatgtatata aaataacaac atgaaataac 3180
attttgtctt ttatttttgt cattcttatt atttaatttt ggacccgaca atttcaaata 3240
atcttctcca agttgtaact aatccgttac atgcgcgtga ggagaaccgt ccaatccact 3300
tagactaacg tgccctttat ttcttccttt taattctatg ttaaaaaaac aatttaacta 3360
aaagatgcgc acgtgtcttg acggtggaaa aaaattgtag gcgccgtcgt tgatcatgct 3420
cagtcgatac caccggtggt cgcaggtagc tcgaggacgg tgaggctttt tggcgtgaac 3480
ctcgaatgtc atggtgatgc cgtcgagcca ccaccgcgtc ctgatgtcta taatgaccaa 3540
cacatttact attactcaac tcctcatccc atggtaaata tttttttttt ttacattttt 3600
gtcagattca aatttttgct tacgtatgat ataattatta aacagatgtc gtggctgttt 3660
ctcgagacga gacagatgaa aattagtaat tttaaaatag acctgaaaga gatttttatg 3720
tttaataaat tatataaagg aggaatcaga gagaataata ctatacactt gactgtaaaa 3780
ccacatggcc aatttggttt ttatttgatt actttgattt gttttgttta ctcttttgtc 3840
tctgtagcct ccttttgttc attaattaat atcagccgta agtatatagt ttcctgtgaa 3900
aacagtctct attttggttt tactattcta atttgttagg caccgtcagt tttttttgtg 3960
aaaccaaatt attgactaat aagctggaaa gcaaaactga ctaaaagcat tacaaactta 4020
tcaatgacat aagttttgaa tttattacca tgttttgtaa tgttcagata taatttgaaa 4080
tgcttagaat tatatatttg tatacttaaa ttaatgaaat aaagtgaata ctaaagatag 4140
ttttattttt catattattc tatacaattc ggtgtacaat ttgtttttga tgataataaa 4200
aataataaaa ttgcgtgttg gaattgtgaa acagaatata tcatttgctg gggaagcatt 4260
ggagcaggta ggagatggac gaggt 4285
<210> 3
<211> 267
<212> PRT
<213> Arabidopsis thaliana
<400> 3
Met Ser Val Asn His Tyr His Asn Thr Leu Ser Leu His His His His
1 5 10 15
Gln Asn Asp Val Ala Ile Ala Gln Arg Glu Ser Leu Phe Glu Lys Ser
20 25 30
Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
35 40 45
Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Asn Asn Asn Asn Asn Gly
50 55 60
Gly Ser Gly Asp Asp Val Ala Thr Thr Glu Lys Gly Met Leu Leu Ser
65 70 75 80
Phe Glu Asp Glu Ser Gly Lys Cys Trp Lys Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Asp Lys His Leu Asp Ala Gly Asp Val Val Phe Phe Gln Arg His
115 120 125
Arg Phe Asp Leu His Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Glu
130 135 140
Ala Ser Ser Ser Pro Ala Val Ser Val Val Ser Gln Glu Ala Leu Val
145 150 155 160
Asn Thr Thr Ala Tyr Trp Ser Gly Leu Thr Thr Pro Tyr Arg Gln Val
165 170 175
His Ala Ser Thr Thr Tyr Pro Asn Ile His Gln Glu Tyr Ser His Tyr
180 185 190
Gly Ala Val Val Asp His Ala Gln Ser Ile Pro Pro Val Val Ala Gly
195 200 205
Ser Ser Arg Thr Val Arg Leu Phe Gly Val Asn Leu Glu Cys His Gly
210 215 220
Asp Ala Val Glu Pro Pro Pro Arg Pro Asp Val Tyr Asn Asp Gln His
225 230 235 240
Ile Tyr Tyr Tyr Ser Thr Pro His Pro Met Asn Ile Ser Phe Ala Gly
245 250 255
Glu Ala Leu Glu Gln Val Gly Asp Gly Arg Gly
260 265
<210> 4
<211> 849
<212> DNA
<213> Arabidopsis thaliana
<220>
<223> cDNA
<400> 4
atgtcagtca accattactc cacagaccac caccacactc tcttgtggca gcaacagcaa 60
caccgccaca ccaccgacac atcggagaca accaccaccg ccacatggct ccacgacgac 120
ctaaaagagt cactcttcga gaagtctctc acaccaagcg acgtcgggaa actcaaccgc 180
ctcgtcatac caaaacaaca cgcagagaaa tacttccctc tcaatgccgt cctagtctcc 240
tctgctgctg ctgacacgtc atcttcggag aaagggatgc ttctaagctt tgaagacgag 300
tcaggcaagt catggaggtt cagatactct tactggaaca gcagtcaaag ctatgtcttg 360
actaaaggat ggagcagatt tgtcaaagac aaacagctcg atccaggcga cgttgttttc 420
ttccaacgac accgttctga ttctaggaga ctcttcattg gctggcgcag acgtggacaa 480
ggctcctcat cctccgtcgc ggccactaac tccgccgtga atacgagttc tatgggagct 540
ctttcttatc atcaaatcca cgccactagt aattactcta atcctccctc tcactcagag 600
tattcccact atggagccgc cgtagcaaca gcggctgaga ctcacagcac accgtcgtct 660
tccgtcgtcg ggagctcaag gacggtgagg cttttcggtg tgaatctgga gtgtcaaatg 720
gatgaaaacg acggagatga ttctgttgca gttgccacca ccgttgaatc tcccgacggt 780
tactacggcc aaaacatgta ctattattac tctcatcctc ataacatggt aattttaact 840
cttttataa 849
<210> 5
<211> 282
<212> PRT
<213> Arabidopsis thaliana
<400> 5
Met Ser Val Asn His Tyr Ser Thr Asp His His His Thr Leu Leu Trp
1 5 10 15
Gln Gln Gln Gln His Arg His Thr Thr Asp Thr Ser Glu Thr Thr Thr
20 25 30
Thr Ala Thr Trp Leu His Asp Asp Leu Lys Glu Ser Leu Phe Glu Lys
35 40 45
Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
50 55 60
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Ala Val Leu Val Ser
65 70 75 80
Ser Ala Ala Ala Asp Thr Ser Ser Ser Glu Lys Gly Met Leu Leu Ser
85 90 95
Phe Glu Asp Glu Ser Gly Lys Ser Trp Arg Phe Arg Tyr Ser Tyr Trp
100 105 110
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val
115 120 125
Lys Asp Lys Gln Leu Asp Pro Gly Asp Val Val Phe Phe Gln Arg His
130 135 140
Arg Ser Asp Ser Arg Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Gln
145 150 155 160
Gly Ser Ser Ser Ser Val Ala Ala Thr Asn Ser Ala Val Asn Thr Ser
165 170 175
Ser Met Gly Ala Leu Ser Tyr His Gln Ile His Ala Thr Ser Asn Tyr
180 185 190
Ser Asn Pro Pro Ser His Ser Glu Tyr Ser His Tyr Gly Ala Ala Val
195 200 205
Ala Thr Ala Ala Glu Thr His Ser Thr Pro Ser Ser Ser Val Val Gly
210 215 220
Ser Ser Arg Thr Val Arg Leu Phe Gly Val Asn Leu Glu Cys Gln Met
225 230 235 240
Asp Glu Asn Asp Gly Asp Asp Ser Val Ala Val Ala Thr Thr Val Glu
245 250 255
Ser Pro Asp Gly Tyr Tyr Gly Gln Asn Met Tyr Tyr Tyr Tyr Ser His
260 265 270
Pro His Asn Met Val Ile Leu Thr Leu Leu
275 280
<210> 6
<211> 33
<212> PRT
<213> Artificial Sequence
<220>
<223> domain
<400> 6
Ser Asn Asn Asn Asn Asn Asn Gly Gly Ser Gly Asp Asp Val Ala Cys
1 5 10 15
His Phe Gln Arg Phe Asp Leu His Arg Leu Phe Ile Gly Trp Arg Gly
20 25 30
Glu
<210> 7
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> domain
<400> 7
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 8
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 8
accatgacat tcgaggttca c 21
<210> 9
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 9
atcaccacca aaacgacgta g 21
<210> 10
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 10
tacgtcatgc ttcaaatcgt g 21
<210> 11
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 11
aggacacgaa caattcattc g 21
<210> 12
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 12
tacgaataag agcgtccatt ttagagtga 29
<210> 13
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 13
acccaaagaa cagcaatcat g 21
<210> 14
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 14
aaaacactcc gccattaaac c 21
<210> 15
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 15
cgagtatcaa tggaaactta accg 24
<210> 16
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 16
aacggagagt ggcttgagat 20
<210> 17
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 17
tggcccttat ggtttctgca 20
<210> 18
<211> 15
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 18
ntcgantntn gngtt 15
<210> 19
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 19
atgtcagtca accattacca c 21
<210> 20
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 20
caggtaggag atggacgagg ttga 24
<210> 21
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 21
tgagaggaac catttcttag agg 23
<210> 22
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 22
acctcgtcca tctcctacct gc 22
<210> 23
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 23
aaacacgtca aatataacga at 22
<210> 24
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 24
cttttttttg gtttcttgga gtgagagaga gag 33
<210> 25
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 25
agtctgggcc catgtcagtc aaccattac 29
<210> 26
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 26
gcgactagtt tataaaagag ttaaaatta 29
<210> 27
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 27
cgggatcctc agtcaaccat tacc 24
<210> 28
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 28
actagtcgac tcaacctcgt ccatctcc 28
<210> 29
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 29
gaaatcacag cacttgcacc 20
<210> 30
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 30
aagcctttga tcttgagagc 20
<210> 31
<211> 18
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 31
gcgacgacgg agaaaggg 18
<210> 32
<211> 18
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 32
acgacggcgc catagtgt 18
<210> 33
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 33
tttgaagacg agtcaggcaa gt 22
<210> 34
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 34
tacggcggct ccatagtggg 20
<210> 35
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 35
gtattggagc ggcttgacta cacc 24
<210> 36
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 36
gacggcatca ccatgacatt cg 22
<210> 37
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 37
tgattctgac atgattgctg ttct 24
<210> 38
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 38
tcgcaactgt atctgtccct cta 23
<210> 39
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 39
cgtttcgctt tccttagtgt tagct 25
<210> 40
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 40
agcgaacgga tctagagact caccttg 27
<210> 41
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 41
caggcctaag cctaacagta gac 23
<210> 42
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 42
tgtactagga tttatttacg tag 23
<210> 43
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 43
tattgttcat agaaaccctg caaa 24
<210> 44
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 44
agtcaatggt ttaatggcgg agtg 24
<210> 45
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 45
ttctactaca cttgctctct gta 23
<210> 46
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 46
tacagagagc aagtgtagta gaa 23
<210> 47
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 47
ttctactaac acctctctct gta 23
<210> 48
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 48
tacagagaga ggtgttagta gaa 23
<210> 49
<211> 192
<212> PRT
<213> Oryza sativa
<400> 49
Met Ala Met His Ala Gly His Ala Trp Trp Gly Val Ala Met Tyr Thr
1 5 10 15
Asn His Tyr His His His Tyr Arg His Lys Thr Ser Asp Val Gly Lys
20 25 30
Asn Arg Val Lys His Ala Arg Tyr Gly Gly Gly Asp Ser Gly Lys Gly
35 40 45
Ser Asp Ser Gly Lys Trp Arg Arg Tyr Ser Tyr Trp Thr Ser Ser Ser
50 55 60
Tyr Val Thr Lys Gly Trp Ser Arg Tyr Val Lys Lys Arg Asp Ala Gly
65 70 75 80
Asp Val Val His Arg Val Arg Gly Gly Ala Ala Asp Arg Gly Cys Arg
85 90 95
Arg Arg Gly Ser Ala Ala Ala Val Arg Val Thr Ala Asn Gly Gly Trp
100 105 110
Ser Met Cys Tyr Ser Thr Ser Gly Ser Ser Tyr Asp Thr Ser Ala Asn
115 120 125
Ser Tyr Ala Tyr His Arg Ser Val Asp Asp His Ser Asp His Ala Gly
130 135 140
Ser Arg Ala Asp Ala Lys Ser Ser Ser Ala Ala Ser Ala Ser Arg Arg
145 150 155 160
Arg Gly Val Asn Asp Cys Gly Ala Asp Ala Thr Ala Met Tyr Gly Tyr
165 170 175
Met His His Ser Tyr Ala Ala Val Ser Thr Val Asn Tyr Trp Ser Val
180 185 190
<210> 50
<211> 834
<212> DNA
<213> Oryza sativa
<400> 50
atggccatgc accctctcgc ccaggggcac ccccaggcgt ggccatgggg tgtagccatg 60
tacaccaacc tgcactacca ccaccactac gagagggagc acctgttcga gaagccgctg 120
acgccgagcg acgtcggcaa gctcaacagg ctggtgatcc ccaagcagca cgccgagagg 180
tacttcccgc tcggcggcgg cgactccggt gagaagggcc tcctcctctc cttcgaggac 240
gagtccggca agccatggcg gttccgctac tcctactgga ccagcagcca gagctacgtg 300
ctcaccaagg gctggagccg ctacgtcaag gagaagcgcc tcgacgccgg cgacgtcgtc 360
cacttcgagc gcgtccgcgg cctcggcgcc gccgaccgcc tcttcatcgg ctgcaggcgc 420
cgcggcgaga gcgcgcccgc gccgccgccc gccgttcgcg tcacgccgca gccgcctgcc 480
ctcaacggcg gcgagcagca gccgtggagc ccaatgtgtt acagcacgtc gggctcgtcc 540
tacgacccta ccagccctgc caattcatat gcctaccatc gctccgtaga ccaagatcac 600
agcgacatac tacacgcagg agagtcgcag agagaagcag acgccaagag cagcagcgcg 660
gcgtcggcgc cgccgccgtc gaggcggctc aggctgttcg gcgttaacct cgactgcggc 720
ccggagccgg aggcggatca ggcgacggca atgtacggct acatgcacca ccagagcccc 780
tacgccgcag tgtctacagt gccaaattac tggtcagtat tttttcagtt ttaa 834
<210> 51
<211> 279
<212> PRT
<213> Oryza sativa
<400> 51
Met Ala Met Asn His Pro Leu Phe Ser Gln Glu Gln Pro Gln Ser Trp
1 5 10 15
Pro Trp Gly Val Ala Met Tyr Ala Asn Phe His Tyr His His His Tyr
20 25 30
Glu Lys Glu His Met Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly
35 40 45
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
50 55 60
Pro Leu Gly Ala Gly Asp Ala Ala Asp Lys Gly Leu Ile Leu Ser Phe
65 70 75 80
Glu Asp Glu Ala Gly Ala Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Thr
85 90 95
Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys
100 105 110
Glu Lys Arg Leu Asp Ala Gly Asp Val Val His Phe Glu Arg Val Arg
115 120 125
Gly Ser Phe Gly Val Gly Asp Arg Leu Phe Ile Gly Cys Arg Arg Arg
130 135 140
Gly Asp Ala Ala Ala Ala Gln Thr Pro Ala Pro Pro Pro Ala Val Arg
145 150 155 160
Val Ala Pro Ala Ala Gln Asn Ala Gly Glu Gln Gln Pro Trp Ser Pro
165 170 175
Met Cys Tyr Ser Thr Ser Gly Gly Gly Ser Tyr Pro Thr Ser Pro Ala
180 185 190
Asn Ser Tyr Ala Tyr Arg Arg Ala Ala Asp His Asp His Gly Asp Met
195 200 205
His His Ala Asp Glu Ser Pro Arg Asp Thr Asp Ser Pro Ser Phe Ser
210 215 220
Ala Gly Ser Ala Pro Ser Arg Arg Leu Arg Leu Phe Gly Val Asn Leu
225 230 235 240
Asp Cys Gly Pro Glu Pro Glu Ala Asp Thr Thr Ala Ala Ala Thr Met
245 250 255
Tyr Gly Tyr Met His Gln Gln Ser Ser Tyr Ala Ala Met Ser Ala Val
260 265 270
Pro Ser Tyr Trp Gly Asn Ser
275
<210> 52
<211> 840
<212> DNA
<213> Oryza sativa
<400> 52
atggccatga accaccctct cttctcccag gagcaacccc agtcctggcc atggggtgtg 60
gccatgtacg ccaacttcca ctaccaccac cactacgaga aggagcacat gtttgagaag 120
cccctgacgc ccagtgacgt ggggaagctg aaccggctgg tgatccccaa gcagcacgcc 180
gagaggtact tccccctcgg cgccggcgac gccgccgaca agggcctgat cctgtcgttc 240
gaggacgagg ccggcgcgcc gtggcggttc aggtactcct actggacgag cagccagagc 300
tacgtgctca ccaagggctg gagccgctac gtcaaggaga agcgcctcga cgccggcgac 360
gtcgtgcact tcgagagggt gcgcggctcc ttcggcgtcg gcgaccgtct cttcatcggc 420
tgcaggcgcc gcggcgacgc cgccgccgcg caaacacccg caccgccgcc cgccgtgcgc 480
gtcgccccgg ctgcacagaa cgccggcgag cagcagccgt ggagcccaat gtgttacagc 540
acgtcgggcg gcggctcata ccctaccagc ccagccaact cctacgccta ccgccgcgca 600
gcagatcatg atcacgggga catgcaccat gcagacgagt ctccgcgcga cacggacagc 660
ccaagcttca gtgcaggctc ggcgccatcg aggcggctca ggctgttcgg cgtcaacctc 720
gactgcgggc cagagccgga ggcagacacc acggcagcgg caacaatgta cggctacatg 780
caccagcaga gctcctatgc tgccatgtct gcagtaccca gttactgggg caattcataa 840
<210> 53
<211> 412
<212> PRT
<213> Oryza sativa
<400> 53
Met Glu Phe Thr Thr Ser Ser Arg Phe Ser Lys Glu Glu Glu Asp Glu
1 5 10 15
Glu Gln Asp Glu Ala Gly Arg Arg Glu Ile Pro Phe Met Thr Ala Thr
20 25 30
Ala Glu Ala Ala Pro Ala Pro Thr Ser Ser Ser Ser Ser Pro Ala His
35 40 45
His Ala Ala Ser Ala Ser Ala Ser Ala Ser Ala Ser Gly Ser Ser Thr
50 55 60
Pro Phe Arg Ser Asp Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly
65 70 75 80
Gly Gly Gly Gly Gly Glu Ala Glu Val Val Glu Lys Glu His Met Phe
85 90 95
Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
100 105 110
Ile Pro Lys Gln Tyr Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala Ala
115 120 125
Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Arg Ala Gly Lys Pro
130 135 140
Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met
145 150 155 160
Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly
165 170 175
Asp Thr Val Ser Phe Ser Arg Gly Ile Gly Asp Glu Ala Ala Arg His
180 185 190
Arg Leu Phe Ile Asp Trp Lys Arg Arg Ala Asp Thr Arg Asp Pro Leu
195 200 205
Arg Leu Pro Arg Gly Leu Pro Leu Pro Met Pro Leu Thr Ser His Tyr
210 215 220
Ala Pro Trp Gly Ile Gly Gly Gly Gly Gly Phe Phe Val Gln Pro Ser
225 230 235 240
Pro Pro Ala Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Leu Asp Phe
245 250 255
Arg Ala Phe Asn Pro Ala Ala Ala Met Gly Arg Gln Val Leu Leu Phe
260 265 270
Gly Ser Ala Arg Ile Pro Pro Gln Ala Pro Leu Leu Ala Arg Ala Pro
275 280 285
Ser Pro Leu His His His Tyr Thr Leu Gln Pro Ser Gly Asp Gly Val
290 295 300
Arg Ala Ala Gly Ser Pro Val Val Leu Asp Ser Val Pro Val Ile Glu
305 310 315 320
Ser Pro Thr Thr Ala Ala Lys Arg Val Arg Leu Phe Gly Val Asn Leu
325 330 335
Asp Asn Pro His Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly Glu Ser
340 345 350
Ser Asn His Gly Asn Ala Leu Ser Leu Gln Thr Pro Ala Trp Met Arg
355 360 365
Arg Asp Pro Thr Leu Arg Leu Leu Glu Leu Pro Pro His His His His
370 375 380
Gly Ala Glu Ser Ser Ala Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser
385 390 395 400
Lys Arg Asp Ala His Ser Ala Leu Asp Leu Asp Leu
405 410
<210> 54
<211> 1239
<212> DNA
<213> Oryza sativa
<400> 54
atggagttca ctacaagcag taggttttct aaagaagagg aggacgagga gcaggatgag 60
gcgggaaggc gagagatccc cttcatgacg gccacggccg aagccgcgcc tgcgcccacg 120
tcgtcgtcgt cgtctcctgc tcatcacgcg gcttccgcgt cggcgtcggc gtctgcgtca 180
gggagcagca ctccctttcg ctccgacgat ggcgccgggg cgtctgggag cggcggcggc 240
ggcggcggcg gcggagaagc ggaggtggtg gagaaggagc acatgttcga caaggtggtg 300
acgccgagcg acgttgggaa gctgaaccgg ctggtgatcc cgaagcagta cgccgagaag 360
tacttcccgc tggacgcggc ggcgaacgag aagggcctcc tgctcaactt cgaggaccgc 420
gcggggaagc catggcggtt ccgctactcc tactggaaca gcagccagag ctacgtgatg 480
accaaggggt ggagccgctt cgtcaaggag aagcgcctcg acgccgggga caccgtctcc 540
ttctcccgcg gcatcggcga cgaggcggcg cggcaccgcc tcttcatcga ctggaagcgc 600
cgcgccgaca cccgcgaccc gctccggctg ccccgcgggc tgccgctccc gatgccgctc 660
acgtcgcact acgccccgtg ggggatcggc ggcggagggg gattcttcgt gcagccctcg 720
ccgccggcca cgctctacga gcaccgcctc aggcaaggcc tcgacttccg cgccttcaac 780
cccgccgccg cgatggggag gcaggtcctc ctgttcggct cggcgaggat tcctccgcaa 840
gcaccactgc tggcgcgcgc gccgtcgccg ctgcaccacc actacacgct gcagccgagc 900
ggcgatggtg taagggcggc gggctcaccg gtggtgctcg actcggttcc ggtcatcgag 960
agccccacga cggccgcgaa gcgcgtgcgg ctgttcggcg tgaacctcga caacccgcat 1020
gccggcggcg gcggcggcgc cgccgccggc gagtcgagca atcatggcaa tgcactgtca 1080
ttgcagacgc ccgcgtggat gaggagggat ccaacactgc ggctgctgga attgcctcct 1140
caccaccacc atggcgccga gtcgtccgct gcatcgtctc cgtcgtcgtc gtcttcctcc 1200
aagagggacg cgcattcggc cttggatctc gatctgtag 1239
<210> 55
<211> 951
<212> DNA
<213> Oryza sativa
<400> 55
atggagtttg ctacaacgag tagtaggttt tccaaggaag aggaggagga ggaggaaggg 60
gaacaggaga tggagcagga gcaggatgaa gaggaggagg aggcggaggc ctcgccccgc 120
gagatcccct tcatgacgtc ggcggcggcg gcggccaccg cctcatcgtc ctccccgaca 180
tcggtctccc cttccgccac cgcttccgcg gcggcgtcca cgtcggcgtc gggctctccc 240
ttccggtcga gcgacggtgc gggagcgtcg gggagtggcg gcggcggtgg cggcgaggac 300
gtggaggtga tcgagaagga gcacatgttc gacaaggtgg tgacgccgag cgacgtgggg 360
aagctgaacc ggctggtgat cccgaagcag cacgccgaga agtacttccc gctggactcg 420
gcggcgaacg agaagggcct tctcctcagc ttcgaggacc gaaccggcaa gctatggcgc 480
ttccgctact cctactggaa cagcagccag agctacgtca tgaccaaggg ttggagccgc 540
ttcgtcaagg agaagcgcct cgacgccggg gacaccgtct ccttctgccg cggcgccgcc 600
gaggccaccc gcgaccgcct cttcatcgac tggaagcgcc gcgccgacgt ccgcgacccg 660
caccgcttcc agcgcctacc gctccccatg acctcgccct acggcccgtg gggcggcggc 720
gcgggcgctt cttcatgccg cccgcgccgc ccgccacgct ctacgagcat caccgctttc 780
gccagggctt cgacttccgc aacatcaacc ccgctgtgcc ggcgaggcag ctcgtcttct 840
tcggctcccc agggacgggg attcatcagc acccgccctt gccaccgccg ccgtcgccac 900
ctccgcctcc tcaccaactc cacattacgg tgcaccaccc gagccccgta g 951
<210> 56
<211> 316
<212> PRT
<213> Oryza sativa
<400> 56
Met Glu Phe Ala Thr Thr Ser Ser Arg Phe Ser Lys Glu Glu Glu Glu
1 5 10 15
Glu Glu Glu Gly Glu Gln Glu Met Glu Gln Glu Gln Asp Glu Glu Glu
20 25 30
Glu Glu Ala Glu Ala Ser Pro Arg Glu Ile Pro Phe Met Thr Ser Ala
35 40 45
Ala Ala Ala Ala Thr Ala Ser Ser Ser Ser Pro Thr Ser Val Ser Pro
50 55 60
Ser Ala Thr Ala Ser Ala Ala Ala Ser Thr Ser Ala Ser Gly Ser Pro
65 70 75 80
Phe Arg Ser Ser Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly Gly
85 90 95
Gly Gly Glu Asp Val Glu Val Ile Glu Lys Glu His Met Phe Asp Lys
100 105 110
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
115 120 125
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ser Ala Ala Asn Glu
130 135 140
Lys Gly Leu Leu Leu Ser Phe Glu Asp Arg Thr Gly Lys Leu Trp Arg
145 150 155 160
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
165 170 175
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
180 185 190
Val Ser Phe Cys Arg Gly Ala Ala Glu Ala Thr Arg Asp Arg Leu Phe
195 200 205
Ile Asp Trp Lys Arg Arg Ala Asp Val Arg Asp Pro His Arg Phe Gln
210 215 220
Arg Leu Pro Leu Pro Met Thr Ser Pro Tyr Gly Pro Trp Gly Gly Gly
225 230 235 240
Ala Gly Ala Ser Ser Cys Arg Pro Arg Arg Pro Pro Arg Ser Thr Ser
245 250 255
Ile Thr Ala Phe Ala Arg Ala Ser Thr Ser Ala Thr Ser Thr Pro Leu
260 265 270
Cys Arg Arg Gly Ser Ser Ser Ser Ser Ala Pro Gln Gly Arg Gly Phe
275 280 285
Ile Ser Thr Arg Pro Cys His Arg Arg Arg Arg His Leu Arg Leu Leu
290 295 300
Thr Asn Ser Thr Leu Arg Cys Thr Thr Arg Ala Pro
305 310 315
<210> 57
<211> 936
<212> DNA
<213> Oryza sativa
<400> 57
atggagttca tcacgccaat cgtgaggccg gcatcggcgg cggcgggcgg cggcgaggtg 60
caggagagtg gtgggaggag cttggcggcg gtggagaagg agcacatgtt cgacaaggtg 120
gtgacgccga gcgacgtggg gaagctgaac cggctggtga tcccgaagca gcacgcggag 180
aagtacttcc cgctggacgc ggcgtccaac gagaaggggc tcctgctcag cttcgaggac 240
cgcacgggga agccatggcg gttccgctac tcctactgga acagcagcca gagctacgtg 300
atgaccaagg ggtggagccg cttcgtcaag gagaagcgac tcgacgccgg ggacaccgtc 360
tccttcggcc gcggcgtcgg cgaggccgcg cgcgggaggc tcttcatcga ctggcgccgc 420
cgccccgacg tcgtcgccgc gctccagccg cccacgcacc gcttcgccca ccacctccct 480
tcctccatcc ccttcgctcc ctgggcgcac caccacggac acggagccgc cgccgccgcc 540
gccgccgccg ccggcgccag gtttctcctg cctccctcct cgactcccat ctacgaccac 600
caccgccgac acgcccacgc cgtcgggtac gacgcgtacg ccgcggccac cagcaggcag 660
gtgctgttct accggccgtt gccgccgcag cagcagcatc atcccgcggt ggtgctggag 720
tcggtgccgg tgcgcatgac ggcggggcac gcggagccgc cgtcggctcc gtcgaagcga 780
gttcggctgt tcggggtgaa cctcgactgc gcgaattccg aacaagacca cgccggcgtg 840
gtcgggaaga cggcgccgcc gccgctgcca tcgccgccgt catcatcgtc atcttcctcc 900
gggaaagcga ggtgctcctt gaaccttgac ttgtga 936
<210> 58
<211> 311
<212> PRT
<213> Oryza sativa
<400> 58
Met Glu Phe Ile Thr Pro Ile Val Arg Pro Ala Ser Ala Ala Ala Gly
1 5 10 15
Gly Gly Glu Val Gln Glu Ser Gly Gly Arg Ser Leu Ala Ala Val Glu
20 25 30
Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys
35 40 45
Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro
50 55 60
Leu Asp Ala Ala Ser Asn Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp
65 70 75 80
Arg Thr Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
85 90 95
Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
100 105 110
Arg Leu Asp Ala Gly Asp Thr Val Ser Phe Gly Arg Gly Val Gly Glu
115 120 125
Ala Ala Arg Gly Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Asp Val
130 135 140
Val Ala Ala Leu Gln Pro Pro Thr His Arg Phe Ala His His Leu Pro
145 150 155 160
Ser Ser Ile Pro Phe Ala Pro Trp Ala His His His Gly His Gly Ala
165 170 175
Ala Ala Ala Ala Ala Ala Ala Ala Gly Ala Arg Phe Leu Leu Pro Pro
180 185 190
Ser Ser Thr Pro Ile Tyr Asp His His Arg Arg His Ala His Ala Val
195 200 205
Gly Tyr Asp Ala Tyr Ala Ala Ala Thr Ser Arg Gln Val Leu Phe Tyr
210 215 220
Arg Pro Leu Pro Pro Gln Gln Gln His His Pro Ala Val Val Leu Glu
225 230 235 240
Ser Val Pro Val Arg Met Thr Ala Gly His Ala Glu Pro Pro Ser Ala
245 250 255
Pro Ser Lys Arg Val Arg Leu Phe Gly Val Asn Leu Asp Cys Ala Asn
260 265 270
Ser Glu Gln Asp His Ala Gly Val Val Gly Lys Thr Ala Pro Pro Pro
275 280 285
Leu Pro Ser Pro Pro Ser Ser Ser Ser Ser Ser Ser Gly Lys Ala Arg
290 295 300
Cys Ser Leu Asn Leu Asp Leu
305 310
<210> 59
<211> 1182
<212> DNA
<213> Oryza sativa
<400> 59
atggacagct ccagctgcct ggtggatgat accaacagcg gcggctcgtc cacggacaag 60
ctgagggcgt tggccgccgc ggcggcggag acggcgccgc tggagcgcat ggggagcggg 120
gcgagcgcgg tggtggacgc ggccgagcct ggcgcggagg cggactccgg gtccggggga 180
cgtgtgtgcg gcggcggcgg cggcggtgcc ggcggtgcgg gagggaagct gccgtcgtcc 240
aagttcaagg gcgtcgtgcc gcagcccaac gggaggtggg gcgcgcagat ctacgagcgg 300
caccagcggg tgtggctcgg cacgttcgcc ggggaggacg acgccgcgcg cgcctacgac 360
gtcgccgcgc agcgcttccg cggccgcgac gccgtcacca acttccgccc gctcgccgag 420
gccgacccgg acgccgccgc cgagcttcgc ttcctcgcca cgcgctccaa ggccgaggtc 480
gtcgacatgc tccgcaagca cacctacttc gacgagctcg cgcagagcaa gcgcaccttc 540
gccgcctcca cgccgtcggc cgcgaccacc accgcctccc tctccaacgg ccacctctcg 600
tcgccccgct cccccttcgc gcccgccgcg gcgcgcgacc acctgttcga caagacggtc 660
accccgagcg acgtgggcaa gctgaacagg ctcgtcatac cgaagcagca cgccgagaag 720
cacttcccgc tacagctccc gtccgccggc ggcgagagca agggtgtcct cctcaacttc 780
gaggacgccg ccggcaaggt gtggcggttc cggtactcgt actggaacag cagccagagc 840
tacgtgctaa ccaagggctg gagccgcttc gtcaaggaga agggtctcca cgccggcgac 900
gtcgtcggct tctaccgctc cgccgccagt gccggcgacg acggcaagct cttcatcgac 960
tgcaagttag tacggtcgac cggcgccgcc ctcgcgtcgc ccgctgatca gccagcgccg 1020
tcgccggtga aggccgtcag gctcttcggc gtggacctgc tcacggcgcc ggcgccggtc 1080
gaacagatgg ccgggtgcaa gagagccagg gacttggcgg cgacgacgcc tccacaagcg 1140
gcggcgttca agaagcaatg catagagctg gcactagtat ag 1182
<210> 60
<211> 393
<212> PRT
<213> Oryza sativa
<400> 60
Met Asp Ser Ser Ser Cys Leu Val Asp Asp Thr Asn Ser Gly Gly Ser
1 5 10 15
Ser Thr Asp Lys Leu Arg Ala Leu Ala Ala Ala Ala Ala Glu Thr Ala
20 25 30
Pro Leu Glu Arg Met Gly Ser Gly Ala Ser Ala Val Val Asp Ala Ala
35 40 45
Glu Pro Gly Ala Glu Ala Asp Ser Gly Ser Gly Gly Arg Val Cys Gly
50 55 60
Gly Gly Gly Gly Gly Ala Gly Gly Ala Gly Gly Lys Leu Pro Ser Ser
65 70 75 80
Lys Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln
85 90 95
Ile Tyr Glu Arg His Gln Arg Val Trp Leu Gly Thr Phe Ala Gly Glu
100 105 110
Asp Asp Ala Ala Arg Ala Tyr Asp Val Ala Ala Gln Arg Phe Arg Gly
115 120 125
Arg Asp Ala Val Thr Asn Phe Arg Pro Leu Ala Glu Ala Asp Pro Asp
130 135 140
Ala Ala Ala Glu Leu Arg Phe Leu Ala Thr Arg Ser Lys Ala Glu Val
145 150 155 160
Val Asp Met Leu Arg Lys His Thr Tyr Phe Asp Glu Leu Ala Gln Ser
165 170 175
Lys Arg Thr Phe Ala Ala Ser Thr Pro Ser Ala Ala Thr Thr Thr Ala
180 185 190
Ser Leu Ser Asn Gly His Leu Ser Ser Pro Arg Ser Pro Phe Ala Pro
195 200 205
Ala Ala Ala Arg Asp His Leu Phe Asp Lys Thr Val Thr Pro Ser Asp
210 215 220
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
225 230 235 240
His Phe Pro Leu Gln Leu Pro Ser Ala Gly Gly Glu Ser Lys Gly Val
245 250 255
Leu Leu Asn Phe Glu Asp Ala Ala Gly Lys Val Trp Arg Phe Arg Tyr
260 265 270
Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser
275 280 285
Arg Phe Val Lys Glu Lys Gly Leu His Ala Gly Asp Val Val Gly Phe
290 295 300
Tyr Arg Ser Ala Ala Ser Ala Gly Asp Asp Gly Lys Leu Phe Ile Asp
305 310 315 320
Cys Lys Leu Val Arg Ser Thr Gly Ala Ala Leu Ala Ser Pro Ala Asp
325 330 335
Gln Pro Ala Pro Ser Pro Val Lys Ala Val Arg Leu Phe Gly Val Asp
340 345 350
Leu Leu Thr Ala Pro Ala Pro Val Glu Gln Met Ala Gly Cys Lys Arg
355 360 365
Ala Arg Asp Leu Ala Ala Thr Thr Pro Pro Gln Ala Ala Ala Phe Lys
370 375 380
Lys Gln Cys Ile Glu Leu Ala Leu Val
385 390
<210> 61
<211> 939
<212> DNA
<213> Oryza sativa
<400> 61
atggagttca ccccaatttc gccgccgacg agggtcgccg gcggtgagga ggattccgag 60
aggggggcgg cggcgtgggc ggtggtggag aaggagcaca tgtttgagaa ggtcgtgacg 120
ccgagcgacg tggggaagct gaaccgattg gtcatcccca agcagcacgc cgagaggtac 180
ttcccgctcg acgccgcggc gggcgccggc ggcggcggtg gtggcggcgg tggcggcggc 240
ggggggaagg ggctggtgct gagcttcgag gacaggacgg ggaaggcgtg gaggttccgg 300
tactcgtact ggaacagcag ccagagctac gtgatgacca aagggtggag ccgcttcgtc 360
aaggagaagc gcctcggcgc cggcgacacc gtgtcgttcg gccgcggcct cggcgacgcc 420
gcccgcggcc gcctcttcat cgacttccgc cgccgccgcc aggacgccgg cagcttcatg 480
ttcccgccga cggcggcgcc gccgtcgcac tcgcaccacc atcatcagcg acaccacccg 540
ccgctcccgt ccgtgcccct ttgcccgtgg cgagactaca ccaccgccta tggcggcggc 600
tacggctacg gctacggcgg cggctccacc ccggcgtcca gccgccacgt gctgttcctc 660
cggccgcagg tgccggccgc tgtggtgctc aagtcggtgc cggtgcacgt cgcggccacc 720
tcggcggtgc aggaggcggc gacgacgaca aggccgaagc gtgtccggct gttcggggtg 780
aacctcgact gcccggcggc catggacgac gacgacgaca tcgccggagc ggcgagccgg 840
acggcagcgt cgtctctcct gcagctcccc tcgccgtcgt cctcgacgtc gtcgtcgacg 900
gcggggaaga agatgtgctc cttggatctt gggttgtga 939
<210> 62
<211> 312
<212> PRT
<213> Oryza sativa
<400> 62
Met Glu Phe Thr Pro Ile Ser Pro Pro Thr Arg Val Ala Gly Gly Glu
1 5 10 15
Glu Asp Ser Glu Arg Gly Ala Ala Ala Trp Ala Val Val Glu Lys Glu
20 25 30
His Met Phe Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn
35 40 45
Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Leu Asp
50 55 60
Ala Ala Ala Gly Ala Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly
65 70 75 80
Gly Gly Lys Gly Leu Val Leu Ser Phe Glu Asp Arg Thr Gly Lys Ala
85 90 95
Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met
100 105 110
Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Gly Ala Gly
115 120 125
Asp Thr Val Ser Phe Gly Arg Gly Leu Gly Asp Ala Ala Arg Gly Arg
130 135 140
Leu Phe Ile Asp Phe Arg Arg Arg Arg Gln Asp Ala Gly Ser Phe Met
145 150 155 160
Phe Pro Pro Thr Ala Ala Pro Pro Ser His Ser His His His His Gln
165 170 175
Arg His His Pro Pro Leu Pro Ser Val Pro Leu Cys Pro Trp Arg Asp
180 185 190
Tyr Thr Thr Ala Tyr Gly Gly Gly Tyr Gly Tyr Gly Tyr Gly Gly Gly
195 200 205
Ser Thr Pro Ala Ser Ser Arg His Val Leu Phe Leu Arg Pro Gln Val
210 215 220
Pro Ala Ala Val Val Leu Lys Ser Val Pro Val His Val Ala Ala Thr
225 230 235 240
Ser Ala Val Gln Glu Ala Ala Thr Thr Thr Arg Pro Lys Arg Val Arg
245 250 255
Leu Phe Gly Val Asn Leu Asp Cys Pro Ala Ala Met Asp Asp Asp Asp
260 265 270
Asp Ile Ala Gly Ala Ala Ser Arg Thr Ala Ala Ser Ser Leu Leu Gln
275 280 285
Leu Pro Ser Pro Ser Ser Ser Thr Ser Ser Ser Thr Ala Gly Lys Lys
290 295 300
Met Cys Ser Leu Asp Leu Gly Leu
305 310
<210> 63
<211> 337
<212> PRT
<213> Glycine max
<400> 63
Met Ser Ile Asn His Tyr Ser Met Asp Leu Pro Glu Pro Thr Leu Trp
1 5 10 15
Trp Pro His Pro His His Gln Gln Gln Gln Leu Thr Leu Met Asp Pro
20 25 30
Asp Pro Leu Arg Leu Asn Leu Asn Ser Asp Asp Gly Asn Gly Asn Asp
35 40 45
Asn Asp Asn Asp Glu Asn Gln Thr Thr Thr Thr Gly Gly Glu Gln Glu
50 55 60
Ile Leu Asp Asp Lys Glu Pro Met Phe Glu Lys Pro Leu Thr Pro Ser
65 70 75 80
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu
85 90 95
Lys Tyr Phe Pro Leu Ser Gly Asp Ser Gly Gly Ser Glu Cys Lys Gly
100 105 110
Leu Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Cys Trp Arg Phe Arg
115 120 125
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp
130 135 140
Ser Arg Tyr Val Lys Asp Lys Arg Leu Asp Ala Gly Asp Val Val Leu
145 150 155 160
Phe Glu Arg His Arg Val Asp Ala Gln Arg Leu Phe Ile Gly Trp Arg
165 170 175
Arg Arg Arg Gln Ser Asp Ala Ala Leu Pro Pro Ala His Val Ser Ser
180 185 190
Arg Lys Ser Gly Gly Gly Asp Gly Asn Ser Asn Lys Asn Glu Gly Trp
195 200 205
Thr Arg Gly Phe Tyr Ser Ala His His Pro Tyr Pro Thr His His Leu
210 215 220
His His His Gln Pro Ser Pro Tyr Gln Gln Gln His Asp Cys Leu His
225 230 235 240
Ala Gly Arg Gly Ser Gln Gly Gln Asn Gln Arg Met Arg Pro Val Gly
245 250 255
Asn Asn Ser Ser Ser Ser Ser Ser Ser Ser Arg Val Leu Arg Leu Phe
260 265 270
Gly Val Asp Met Glu Cys Gln Pro Glu His Asp Asp Ser Gly Pro Ser
275 280 285
Thr Pro Gln Cys Ser Tyr Asn Ser Asn Asn Met Leu Pro Ser Thr Gln
290 295 300
Gly Thr Asp His Ser His His Asn Phe Tyr Gln Gln Gln Pro Ser Asn
305 310 315 320
Ser Asn Pro Ser Pro His His Met Met Val His His Gln Pro Tyr Tyr
325 330 335
Tyr
<210> 64
<211> 1014
<212> DNA
<213> Glycine max
<400> 64
atgtccataa accactactc catggacctt cccgaaccga cactctggtg gccacaccca 60
caccaccaac aacaacaact aaccttaatg gatcctgacc ctctccgtct caacctcaat 120
agcgacgatg gcaatggcaa tgacaacgac aacgacgaaa atcaaacaac cacaacagga 180
ggagaacaag aaatattaga cgataaagaa ccgatgttcg agaagccctt aaccccgagc 240
gacgtgggga agctgaaccg tctcgtaatc ccgaagcagc acgcggagaa gtacttccca 300
ctgagtggtg actcgggcgg gagcgagtgc aaggggctgt tactgagttt cgaggacgag 360
tcggggaagt gttggcgctt ccgctactcg tactggaaca gcagccagag ctacgtgctc 420
accaaagggt ggagccgcta cgtcaaggac aagcgccttg acgcgggcga cgtcgttttg 480
ttcgagcgtc accgcgtcga cgcgcagcgc ctcttcatcg ggtggaggcg caggcggcag 540
agcgatgccg ccttgccgcc tgcgcacgtt agcagtagga agagtggtgg tggtgatggg 600
aatagtaata agaatgaggg gtggaccaga gggttctatt ctgcgcatca tccttatcct 660
acgcatcatc ttcatcatca tcagccctcg ccataccaac aacaacatga ctgtcttcat 720
gcaggtagag ggtcccaagg tcagaaccaa aggatgagac cagtgggaaa caacagttct 780
agctctagtt cgagttcaag ggtacttagg ctgttcgggg tcgacatgga atgccaaccc 840
gaacatgatg attctggtcc ctccacaccc caatgctcct acaatagtaa caacatgttg 900
ccatcaacac agggcacaga tcattcccat cacaatttct accaacagca accttctaat 960
tccaatcctt cccctcatca catgatggta catcaccaac catactacta ctag 1014
<210> 65
<211> 344
<212> PRT
<213> Glycine max
<400> 65
Met Ser Thr Asn His Tyr Thr Met Asp Leu Pro Glu Pro Thr Leu Trp
1 5 10 15
Trp Pro His Pro His Gln Gln Gln Leu Thr Leu Ile Asp Pro Asp Pro
20 25 30
Leu Pro Leu Asn Leu Asn Asn Asp Asp Asn Asp Asn Gly Asp Asp Asn
35 40 45
Asp Asn Asp Glu Asn Gln Thr Val Thr Thr Thr Thr Thr Gly Gly Glu
50 55 60
Glu Glu Ile Ile Asn Asn Lys Glu Pro Met Phe Glu Lys Pro Leu Thr
65 70 75 80
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His
85 90 95
Ala Glu Lys Tyr Phe Pro Leu Ser Gly Gly Asp Ser Gly Ser Ser Glu
100 105 110
Cys Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Cys Trp
115 120 125
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
130 135 140
Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys Arg Leu Asp Ala Gly Asp
145 150 155 160
Val Val Leu Phe Gln Arg His Arg Ala Asp Ala Gln Arg Leu Phe Ile
165 170 175
Gly Trp Arg Arg Arg Arg Gln Ser Asp Ala Leu Pro Pro Pro Ala His
180 185 190
Val Ser Ser Arg Lys Ser Gly Gly Asp Gly Asn Ser Ser Lys Asn Glu
195 200 205
Gly Asp Val Gly Val Gly Trp Thr Arg Gly Phe Tyr Pro Ala His His
210 215 220
Pro Tyr Pro Thr His His His His Pro Ser Pro Tyr His His Gln Gln
225 230 235 240
Asp Asp Ser Leu His Ala Val Arg Gly Ser Gln Gly Gln Asn Gln Arg
245 250 255
Thr Arg Pro Val Gly Asn Ser Ser Ser Ser Ser Ser Ser Ser Ser Arg
260 265 270
Val Leu Arg Leu Phe Gly Val Asn Met Glu Cys Gln Pro Glu His Asp
275 280 285
Asp Ser Gly Pro Ser Thr Pro Gln Cys Ser Tyr Asn Thr Asn Asn Ile
290 295 300
Leu Pro Ser Thr Gln Gly Thr Asp Ile His Ser His Leu Asn Phe Tyr
305 310 315 320
Gln Gln Gln Gln Thr Ser Asn Ser Lys Pro Pro Pro His His Met Met
325 330 335
Ile Arg His Gln Pro Tyr Tyr Tyr
340
<210> 66
<211> 1035
<212> DNA
<213> Glycine max
<400> 66
atgtcgacaa accactacac catggacctt cccgaaccaa cactctggtg gccacaccca 60
caccaacaac aactaacctt aatagatcca gaccctctcc ctctgaacct caacaacgac 120
gacaacgaca atggcgacga caacgacaac gacgaaaacc aaacagttac aacaaccaca 180
acaggaggag aagaagaaat aataaacaat aaagaaccga tgttcgagaa gccgctaacc 240
ccgagcgacg tggggaagct gaaccgcctc gtaatcccga agcagcacgc tgagaagtac 300
tttccactga gtggtggtga ctcgggcagt agcgagtgca aggggctgtt actgagtttc 360
gaggacgagt cggggaagtg ctggcgcttc cgctactcgt actggaacag cagccagagc 420
tacgtgctca ccaaagggtg gagccgttac gtgaaggaca agcgcctcga tgcgggagat 480
gtcgttttat tccagcgcca ccgcgccgac gcgcagcgcc tcttcatcgg ctggaggcgc 540
aggcggcaga gcgacgccct gccgccgcct gcgcacgtta gcagcaggaa gagtggtggt 600
gatgggaata gtagtaagaa tgagggtgat gtgggcgtgg gctggaccag agggttctat 660
cctgcgcatc atccttatcc tacgcatcat catcatccct cgccatacca tcaccaacaa 720
gatgactctc ttcatgcagt tagagggtcc caaggtcaga accaaaggac gagaccagtg 780
ggaaacagca gttctagttc gagttcgagt tcaagggtac ttaggctatt cggggtcaac 840
atggaatgcc aacccgaaca tgatgattct ggaccctcca caccccaatg ctcctacaat 900
actaacaaca tattgccatc cacacagggc acagatattc attcccatct caatttctac 960
caacaacaac aaacttctaa ttccaagcct ccccctcatc acatgatgat acgtcaccaa 1020
ccatactact actag 1035
<210> 67
<211> 288
<212> PRT
<213> Glycine max
<400> 67
Met Ser Ser Ile Asn His Tyr Ser Pro Glu Thr Thr Leu Tyr Trp Thr
1 5 10 15
Asn Asp Gln Gln Gln Gln Ala Ala Met Trp Leu Ser Asn Ser His Thr
20 25 30
Pro Arg Phe Asn Leu Asn Asp Glu Glu Glu Glu Glu Glu Asp Asp Val
35 40 45
Ile Val Ser Asp Lys Ala Thr Asn Asn Leu Thr Gln Glu Glu Glu Lys
50 55 60
Val Ala Met Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu
65 70 75 80
Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu
85 90 95
Asp Ser Ser Ala Ala Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu Ser
100 105 110
Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
115 120 125
Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys Arg Leu
130 135 140
His Ala Gly Asp Val Val Leu Phe His Arg His Arg Ser Leu Pro Gln
145 150 155 160
Arg Phe Phe Ile Ser Cys Ser Arg Arg Gln Pro Asn Pro Val Pro Ala
165 170 175
His Val Ser Thr Thr Arg Ser Ser Ala Ser Phe Tyr Ser Ala His Pro
180 185 190
Pro Tyr Pro Ala His His Phe Pro Phe Pro Tyr Gln Pro His Ser Leu
195 200 205
His Ala Pro Gly Gly Gly Ser Gln Gly Gln Asn Glu Thr Thr Pro Gly
210 215 220
Gly Asn Ser Ser Ser Ser Gly Ser Gly Arg Val Leu Arg Leu Phe Gly
225 230 235 240
Val Asn Met Glu Cys Gln Pro Asp Asn His Asn Asp Ser Gln Asn Ser
245 250 255
Thr Pro Glu Cys Ser Tyr Thr His Leu Tyr His His Gln Thr Ser Ser
260 265 270
Tyr Ser Ser Ser Ser Asn Pro His His His Met Val Pro Gln Gln Pro
275 280 285
<210> 68
<211> 867
<212> DNA
<213> Glycine max
<400> 68
atgtcatcga taaaccacta ttcaccggaa acaacactat actggaccaa cgaccaacag 60
caacaagccg ccatgtggct gagtaattcc cacaccccgc gtttcaatct gaacgacgag 120
gaggaggagg aggaagacga cgttatcgtt tcggacaagg ctactaataa cttgacgcaa 180
gaggaggaga aggtagccat gttcgagaag ccgttgacgc cgagcgacgt cgggaagctg 240
aaccggctcg tgattccgaa acagcacgcg gagaagcact tccctctcga ctcgtcggcg 300
gcgaaggggc tgttgctgag tttcgaggac gagtccggga agtgttggcg cttccgttac 360
tcttattgga acagtagcca gagttacgtt ttgaccaaag gatggagccg ttacgtcaaa 420
gacaaacgcc tccacgctgg cgacgtcgtt ttgttccaca gacaccgctc cctccctcaa 480
cgcttcttca tctcctgcag ccgccgccaa cccaacccgg tccccgctca cgttagcacc 540
accagatcct ccgcttcctt ctactctgcg cacccacctt atcctgcgca ccacttcccc 600
ttcccatacc aacctcactc tcttcatgca ccaggtggag ggtcccaagg acagaacgaa 660
acgacaccgg gagggaacag tagttcaagt ggcagtggca gggtgctgag gctctttggt 720
gtgaacatgg aatgccaacc tgataatcat aatgattccc agaactccac accagaatgc 780
tcctacaccc acttatacca ccatcaaacc tcttcttatt cttcttcttc aaaccctcac 840
catcacatgg tacctcaaca accataa 867
<210> 69
<211> 420
<212> PRT
<213> Glycine max
<400> 69
Met Glu Leu Met Gln Gln Val Lys Gly Asn Tyr Ser Asp Ser Arg Glu
1 5 10 15
Glu Glu Glu Glu Glu Glu Ala Ala Ala Ile Thr Arg Glu Ser Glu Ser
20 25 30
Ser Arg Leu His Gln Gln Asp Thr Ala Ser Asn Phe Gly Lys Lys Leu
35 40 45
Asp Leu Met Asp Leu Ser Leu Gly Ser Ser Lys Glu Glu Glu Glu Glu
50 55 60
Gly Asn Leu Gln Gln Gly Gly Gly Gly Val Val His His Ala His Gln
65 70 75 80
Val Val Glu Lys Glu His Met Phe Glu Lys Val Ala Thr Pro Ser Asp
85 90 95
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
100 105 110
Tyr Phe Pro Leu Asp Ser Ser Thr Asn Glu Lys Gly Leu Leu Leu Asn
115 120 125
Phe Glu Asp Arg Asn Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp
130 135 140
Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val
145 150 155 160
Lys Glu Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Gly
165 170 175
Leu Gly Asp Leu Tyr Arg His Arg Leu Tyr Ile Asp Trp Lys Arg Arg
180 185 190
Pro Asp His Ala His Ala His Pro Pro His His His Asp Pro Leu Phe
195 200 205
Leu Pro Ser Ile Arg Leu Tyr Ser Leu Pro Pro Thr Met Pro Pro Arg
210 215 220
Tyr His His Asp His His Phe His His His Leu Asn Tyr Asn Asn Leu
225 230 235 240
Phe Thr Phe Gln Gln His Gln Tyr Gln Gln Leu Gly Ala Ala Thr Thr
245 250 255
Thr His His Asn Asn Tyr Gly Tyr Gln Asn Ser Gly Ser Gly Ser Leu
260 265 270
Tyr Tyr Leu Arg Ser Ser Met Ser Met Gly Gly Gly Asp Gln Asn Leu
275 280 285
Gln Gly Arg Gly Ser Asn Ile Val Pro Met Ile Ile Asp Ser Val Pro
290 295 300
Val Asn Val Ala His His Asn Asn Asn Arg His Gly Asn Gly Gly Ile
305 310 315 320
Thr Ser Gly Gly Thr Asn Cys Ser Gly Lys Arg Leu Arg Leu Phe Gly
325 330 335
Val Asn Met Glu Cys Ala Ser Ser Ala Glu Asp Ser Lys Glu Leu Ser
340 345 350
Ser Gly Ser Ala Ala His Val Thr Thr Ala Ala Ser Ser Ser Ser Leu
355 360 365
His His Gln Arg Leu Arg Val Pro Val Pro Val Pro Leu Glu Asp Pro
370 375 380
Leu Ser Ser Ser Ala Ala Ala Ala Ala Arg Phe Gly Asp His Lys Gly
385 390 395 400
Ala Ser Thr Gly Thr Ser Leu Leu Phe Asp Leu Asp Pro Ser Leu Gln
405 410 415
Tyr His Arg His
420
<210> 70
<211> 1263
<212> DNA
<213> Glycine max
<400> 70
atggagttga tgcaacaagt taaaggtaat tattctgata gcagggagga agaggaggaa 60
gaggaagctg cagcaatcac aagggaatca gaaagcagca ggttacacca acaagataca 120
gcatccaatt ttggaaagaa gctagacttg atggacttgt cactagggag cagcaaggaa 180
gaggaagagg aagggaattt gcaacaagga ggaggaggag tggttcatca tgctcaccaa 240
gtagtggaga aagaacacat gtttgagaaa gtggcgacac cgagcgacgt agggaagctg 300
aacaggctgg tgataccgaa gcagcacgcg gagaagtact tcccccttga ctcctcaacc 360
aacgagaagg gtctgctcct gaatttcgag gacaggaatg ggaaggtgtg gcgattcagg 420
tattcctatt ggaacagcag ccagagctat gtgatgacaa aagggtggag ccgctttgtt 480
aaggagaaga agctggatgc cggtgacatt gtctccttcc agcgtggcct tggggatttg 540
tatagacatc ggttgtatat agattggaag agaaggcccg atcatgctca tgctcatcca 600
cctcatcatc acgatccttt gtttcttccc tctatcagat tgtactctct ccctcccacc 660
atgccacctc gctaccacca cgatcatcac tttcaccacc atctcaatta caacaacctc 720
ttcacttttc agcaacacca gtaccagcag cttggtgctg ccactaccac tcatcacaac 780
aactatggtt accagaattc gggatctggt tcactctatt acctaaggtc ctctatgtca 840
atgggtggtg gtgatcaaaa cttgcaaggg agagggagca acattgtccc catgatcatt 900
gattctgtgc cggttaacgt tgctcatcac aacaacaatc gccatgggaa tgggggcatc 960
acgagtggtg gtactaattg tagtggaaaa cgactaaggc tatttggggt gaacatggaa 1020
tgcgcttctt cggcagaaga ttccaaagaa ttgtcctcgg gttcggcagc acacgtgacg 1080
acagctgctt cttcttcttc tcttcatcat cagcgcttga gggtgccagt gccagtgcca 1140
cttgaagatc cactttcgtc gtcagcagca gcagcagcaa ggtttgggga tcacaaaggg 1200
gccagtactg ggacttcgct gctgtttgat ttggatccct ctttgcagta tcatcgccac 1260
tga 1263
<210> 71
<211> 384
<212> PRT
<213> Glycine max
<400> 71
Met Asp Ala Ile Ser Cys Leu Asp Glu Ser Thr Thr Thr Glu Ser Leu
1 5 10 15
Ser Ile Ser Gln Ala Lys Pro Ser Ser Thr Ile Met Ser Ser Glu Lys
20 25 30
Ala Ser Pro Ser Pro Pro Pro Pro Asn Arg Leu Cys Arg Val Gly Ser
35 40 45
Gly Ala Ser Ala Val Val Asp Ser Asp Gly Gly Gly Gly Gly Gly Ser
50 55 60
Thr Glu Val Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val
65 70 75 80
Val Pro Gln Pro Asn Gly Arg Trp Gly Ser Gln Ile Tyr Glu Lys His
85 90 95
Gln Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala Arg
100 105 110
Ala Tyr Asp Val Ala Val Gln Arg Phe Arg Gly Lys Asp Ala Val Thr
115 120 125
Asn Phe Lys Pro Leu Ser Gly Thr Asp Asp Asp Asp Gly Glu Ser Glu
130 135 140
Phe Leu Asn Ser His Ser Lys Ser Glu Ile Val Asp Met Leu Arg Lys
145 150 155 160
His Thr Tyr Asn Asp Glu Leu Glu Gln Ser Lys Arg Ser Arg Gly Phe
165 170 175
Val Arg Arg Arg Gly Ser Ala Ala Gly Ala Gly Asn Gly Asn Ser Ile
180 185 190
Ser Gly Ala Cys Val Met Lys Ala Arg Glu Gln Leu Phe Gln Lys Ala
195 200 205
Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
210 215 220
Gln His Ala Glu Lys His Phe Pro Leu Gln Ser Ala Ala Asn Gly Val
225 230 235 240
Ser Ala Thr Ala Thr Ala Ala Lys Gly Val Leu Leu Asn Phe Glu Asp
245 250 255
Val Gly Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
260 265 270
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
275 280 285
Asn Leu Lys Ala Gly Asp Thr Val Cys Phe Gln Arg Ser Thr Gly Pro
290 295 300
Asp Arg Gln Leu Tyr Ile Asp Trp Lys Thr Arg Asn Val Val Asn Glu
305 310 315 320
Val Ala Leu Phe Gly Pro Val Val Glu Pro Ile Gln Met Val Arg Leu
325 330 335
Phe Gly Val Asn Ile Leu Lys Leu Pro Gly Ser Asp Ser Ile Ala Asn
340 345 350
Asn Asn Asn Ala Ser Gly Cys Cys Asn Gly Lys Arg Arg Glu Met Glu
355 360 365
Leu Phe Ser Leu Glu Cys Ser Lys Lys Pro Lys Ile Ile Gly Ala Leu
370 375 380
<210> 72
<211> 1155
<212> DNA
<213> Glycine max
<400> 72
atggatgcaa ttagttgcct ggatgagagc accaccaccg agtcactctc cataagtcag 60
gcgaagcctt cttcgacgat tatgtcgtcc gagaaggctt ctccttcccc gccgccgccg 120
aacaggctgt gccgcgtcgg tagcggtgct agcgcagtcg tggattccga cggcggcggc 180
gggggtggca gcaccgaggt ggagtcgcgg aagctcccct cgtccaagta taagggcgtc 240
gtgccccagc ccaacggccg ctggggctcg cagatttacg agaagcacca gcgcgtgtgg 300
ctgggaacgt tcaacgagga agacgaggcg gcgcgtgcgt acgacgtcgc cgtgcagcga 360
ttccgcggca aggacgccgt cacaaacttc aagccgctct ccggcaccga cgacgacgac 420
ggggaatcgg agtttctcaa ctcgcattcg aaatccgaga tcgtcgacat gctgcgtaag 480
catacgtaca atgacgagct ggaacaaagc aagcgcagcc gcggcttcgt acgtcggcgc 540
ggctccgccg ccggcgccgg aaacggaaac tcaatctccg gcgcgtgtgt tatgaaggcg 600
cgtgagcagc tattccagaa ggccgttacg ccgagcgacg ttgggaaact gaaccgtttg 660
gtgataccga agcagcacgc ggagaagcac tttcctttac agagcgctgc taacggcgtt 720
agcgcgacgg cgacggcggc gaagggcgtt ttgttgaact tcgaagacgt tggagggaaa 780
gtgtggcggt ttcgttactc gtattggaac agtagccaga gttacgtctt gaccaaaggt 840
tggagccggt tcgttaagga gaagaatctg aaagccggtg acacggtttg ttttcaacgg 900
tccactggac cggacaggca gctttacatc gattggaaga cgaggaatgt tgttaacgag 960
gtcgcgttgt tcggaccggt tgtcgaaccg atccagatgg ttcggctctt tggtgttaac 1020
attttgaaac tacccggttc agattctatc gccaataaca ataatgcaag tgggtgctgc 1080
aatggcaaga gaagagaaat ggaactcttt tcattagagt gtagcaagaa acctaagatt 1140
attggtgctt tgtag 1155
<210> 73
<211> 491
<212> PRT
<213> Glycine max
<400> 73
Met Glu Leu Met Gln Glu Val Lys Gly Tyr Ser Asp Gly Arg Glu Glu
1 5 10 15
Glu Glu Glu Glu Glu Glu Ala Ala Glu Glu Ile Ile Thr Arg Glu Glu
20 25 30
Ser Ser Arg Leu Leu His Gln His Gln Glu Ala Ala Gly Ser Asn Phe
35 40 45
Ile Ile Asn Asn Asn His His His His Gln His His His His His Thr
50 55 60
Thr Lys Gln Leu Asp Phe Met Asp Leu Ser Leu Gly Ser Ser Lys Asp
65 70 75 80
Glu Gly Asn Leu Gln Gly Ser Ser Ser Ser Val Tyr Ala His His His
85 90 95
His Ala Ala Ser Ala Ser Ser Ser Ala Asn Gly Asn Asn Asn Asn Ser
100 105 110
Ser Ser Ser Asn Leu Gln Gln Gln Gln Gln Gln Pro Ala Glu Lys Glu
115 120 125
His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn
130 135 140
Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp
145 150 155 160
Ser Ser Ala Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Arg Asn
165 170 175
Gly Lys Leu Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
180 185 190
Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Lys Leu
195 200 205
Asp Ala Gly Asp Met Val Ser Phe Gln Arg Gly Val Gly Glu Leu Tyr
210 215 220
Arg His Arg Leu Tyr Ile Asp Trp Trp Arg Arg Pro Asp His His His
225 230 235 240
His His His His Gly Pro Asp His Ser Thr Thr Leu Phe Thr Pro Phe
245 250 255
Leu Ile Pro Asn Gln Pro His His Leu Met Ser Ile Arg Trp Gly Ala
260 265 270
Thr Gly Arg Leu Tyr Ser Leu Pro Ser Pro Thr Pro Pro Arg His His
275 280 285
Glu His Leu Asn Tyr Asn Asn Asn Ala Met Tyr His Pro Phe His His
290 295 300
His Gly Ala Gly Ser Gly Ile Asn Ala Thr Thr His His Tyr Asn Asn
305 310 315 320
Tyr His Glu Met Ser Ser Thr Thr Thr Ser Gly Ser Ala Gly Ser Val
325 330 335
Phe Tyr His Arg Ser Thr Pro Pro Ile Ser Met Pro Leu Ala Asp His
340 345 350
Gln Thr Leu Asn Thr Arg Gln Gln Gln Gln Gln Gln Gln Gln Gln Glu
355 360 365
Gly Ala Gly Asn Val Ser Leu Ser Pro Met Ile Ile Asp Ser Val Pro
370 375 380
Val Ala His His Leu His His Gln Gln His His Gly Gly Lys Ser Ser
385 390 395 400
Gly Pro Ser Ser Thr Ser Thr Ser Pro Ser Thr Ala Gly Lys Arg Leu
405 410 415
Arg Leu Phe Gly Val Asn Met Glu Cys Ala Ser Ser Thr Ser Glu Asp
420 425 430
Pro Lys Cys Phe Ser Leu Leu Ser Ser Ser Ser Met Ala Asn Ser Asn
435 440 445
Ser Gln Pro Pro Leu Gln Leu Leu Arg Glu Asp Thr Leu Ser Ser Ser
450 455 460
Ser Ala Arg Phe Gly Asp Gln Arg Gly Val Gly Glu Pro Ser Met Leu
465 470 475 480
Phe Asp Leu Asp Pro Ser Leu Gln Tyr Arg Gln
485 490
<210> 74
<211> 1476
<212> DNA
<213> Glycine max
<400> 74
atggagttga tgcaagaagt gaaagggtat tctgatggca gagaggagga ggaggaggaa 60
gaggaagcag cagaagaaat catcacaaga gaagaaagca gcaggttgtt acaccagcac 120
caggaggcag caggttccaa tttcatcatc aacaataatc atcatcatca tcaacatcac 180
caccaccaca caacaaagca gctagacttc atggacttgt cacttggtag cagcaaggat 240
gaagggaatt tgcaaggatc atcttcttct gtctatgctc atcatcatca tgcagcaagt 300
gctagttctt ctgccaatgg taacaacaac aacagcagca gcagcaactt gcagcaacag 360
cagcagcagc ctgctgagaa ggagcacatg tttgataaag tagtgacacc aagtgatgtg 420
gggaagctga accggttggt gataccaaag cagcatgctg agaagtattt ccctcttgat 480
tcctcagcca atgagaaggg tctgttgctg aattttgagg acaggaatgg taagttgtgg 540
aggttcaggt actcctattg gaacagcagc cagagctatg tgatgaccaa aggttggagc 600
cgttttgtta aggagaagaa gcttgatgct ggtgacatgg tgtccttcca gcgtggtgtt 660
ggggagttgt ataggcatag gttgtacata gattggtgga gaaggcctga tcatcatcac 720
catcaccatc atggccctga ccattcaacc acactcttca cacctttctt aattcccaat 780
cagcctcatc acttaatgtc catcagatgg ggtgccactg gcagattgta ctccctccct 840
tccccaaccc caccacgcca ccatgaacac ctcaattaca acaataacgc catgtatcat 900
ccctttcatc accatggtgc tggaagtgga attaatgcta ctactcatca ctacaacaac 960
tatcatgaga tgagtagtac tactacttca ggatctgcag gctcagtctt ttaccacagg 1020
tcaacacccc caatatcaat gccattggct gaccaccaaa ccttgaacac aaggcagcag 1080
caacaacaac aacaacaaca agagggagct ggcaatgttt ctctttcccc tatgatcatt 1140
gattctgttc cagttgctca ccacctccat catcaacaac accatggtgg caagagtagt 1200
ggtcctagta gtactagtac tagtcctagc actgcaggga aaagactaag gctatttggg 1260
gtcaacatgg aatgtgcttc ttcaacatca gaagacccca aatgcttcag cttgttgtcc 1320
tcatcttcaa tggctaattc caattcacaa ccaccacttc agcttttgag ggaagataca 1380
ctttcgtcat catcggcaag gtttggggat cagagaggag taggggaacc ttcaatgctt 1440
tttgatctgg acccttcttt gcaataccgg cagtga 1476
<210> 75
<211> 351
<212> PRT
<213> Glycine max
<400> 75
Met Asp Gly Gly Cys Val Thr Asp Glu Thr Thr Thr Ser Ser Asp Ser
1 5 10 15
Leu Ser Val Pro Pro Pro Ser Arg Val Gly Ser Val Ala Ser Ala Val
20 25 30
Val Asp Pro Asp Gly Cys Cys Val Ser Gly Glu Ala Glu Ser Arg Lys
35 40 45
Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg
50 55 60
Trp Gly Ala Gln Ile Tyr Glu Lys His Gln Arg Val Trp Leu Gly Thr
65 70 75 80
Phe Asn Glu Glu Asp Glu Ala Ala Arg Ala Tyr Asp Ile Ala Ala Leu
85 90 95
Arg Phe Arg Gly Pro Asp Ala Val Thr Asn Phe Lys Pro Pro Ala Ala
100 105 110
Ser Asp Asp Ala Glu Ser Glu Phe Leu Asn Ser His Ser Lys Phe Glu
115 120 125
Ile Val Asp Met Leu Arg Lys His Thr Tyr Asp Asp Glu Leu Gln Gln
130 135 140
Ser Thr Arg Gly Gly Arg Arg Arg Leu Asp Ala Asp Thr Ala Ser Ser
145 150 155 160
Gly Val Phe Asp Ala Lys Ala Arg Glu Gln Leu Phe Glu Lys Thr Val
165 170 175
Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln
180 185 190
His Ala Glu Lys His Phe Pro Leu Ser Gly Ser Gly Asp Glu Ser Ser
195 200 205
Pro Cys Val Ala Gly Ala Ser Ala Ala Lys Gly Met Leu Leu Asn Phe
210 215 220
Glu Asp Val Gly Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn
225 230 235 240
Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys
245 250 255
Glu Lys Asn Leu Arg Ala Gly Asp Ala Val Gln Phe Phe Lys Ser Thr
260 265 270
Gly Pro Asp Arg Gln Leu Tyr Ile Asp Cys Lys Ala Arg Ser Gly Glu
275 280 285
Val Asn Asn Asn Ala Gly Gly Leu Phe Val Pro Ile Gly Pro Val Val
290 295 300
Glu Pro Val Gln Met Val Arg Leu Phe Gly Val Asn Leu Leu Lys Leu
305 310 315 320
Pro Val Pro Gly Ser Asp Gly Val Gly Lys Arg Lys Glu Met Glu Leu
325 330 335
Phe Ala Phe Glu Cys Cys Lys Lys Leu Lys Val Ile Gly Ala Leu
340 345 350
<210> 76
<211> 1056
<212> DNA
<213> Glycine max
<400> 76
atggatggag gctgtgtcac agacgaaacc accacatcca gcgactctct ttccgttccg 60
ccgcccagcc gcgtcggcag cgttgcaagc gccgtcgtcg accccgacgg ttgttgcgtt 120
tccggcgagg ccgaatcccg gaaactccct tcgtcgaaat acaaaggcgt ggtgccgcaa 180
ccgaacggtc gctggggagc tcagatttac gagaagcacc agcgcgtgtg gctcggcact 240
ttcaacgagg aagacgaagc cgccagagcc tacgacatcg ccgcgctgcg cttccgcggc 300
cccgacgccg tcaccaactt caagcctccc gccgcctccg acgacgccga gtccgagttc 360
ctcaactcgc attccaagtt cgagatcgtc gacatgctcc gcaagcacac ctacgacgac 420
gagctccagc agagcacgcg cggtggtagg cgccgcctcg acgctgacac cgcgtcgagc 480
ggtgtgttcg acgcgaaagc gcgtgagcag ctgttcgaga aaacggttac gccgagcgac 540
gtcgggaagc tgaatcgatt agtgataccg aagcagcacg cggagaagca ctttccgtta 600
agcggatccg gcgacgaaag ctcgccgtgc gtggcggggg cttcggcggc gaagggaatg 660
ttgttgaact ttgaggacgt tggagggaaa gtgtggcggt ttcgttactc ttattggaac 720
agtagccaga gctacgtgct taccaaagga tggagccggt tcgttaagga gaagaatctt 780
cgagccggtg acgcggttca gttcttcaag tcgaccggac cggaccggca gctatatata 840
gactgcaagg cgaggagtgg tgaggttaac aataatgctg gcggtttgtt tgttccgatt 900
ggaccggtcg ttgagccggt tcagatggtt cggcttttcg gggtcaacct tttgaaacta 960
cccgtacccg gttcggatgg tgtagggaag agaaaagaga tggaactgtt tgcatttgaa 1020
tgttgcaaga agttaaaagt aattggagct ttgtaa 1056
<210> 77
<211> 401
<212> PRT
<213> Glycine max
<400> 77
Met Asp Ala Ile Ser Cys Met Asp Glu Ser Thr Thr Thr Glu Ser Leu
1 5 10 15
Ser Ile Ser Leu Ser Pro Thr Ser Ser Ser Glu Lys Ala Lys Pro Ser
20 25 30
Ser Met Ile Thr Ser Ser Glu Lys Val Ser Leu Ser Pro Pro Pro Ser
35 40 45
Asn Arg Leu Cys Arg Val Gly Ser Gly Ala Ser Ala Val Val Asp Pro
50 55 60
Asp Gly Gly Gly Ser Gly Ala Glu Val Glu Ser Arg Lys Leu Pro Ser
65 70 75 80
Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala
85 90 95
Gln Ile Tyr Glu Lys His Gln Arg Val Trp Leu Gly Thr Phe Asn Glu
100 105 110
Glu Asp Glu Ala Ala Arg Ala Tyr Asp Ile Ala Ala Gln Arg Phe Arg
115 120 125
Gly Lys Asp Ala Val Thr Asn Phe Lys Pro Leu Ala Gly Ala Asp Asp
130 135 140
Asp Asp Gly Glu Ser Glu Phe Leu Asn Ser His Ser Lys Pro Glu Ile
145 150 155 160
Val Asp Met Leu Arg Lys His Thr Tyr Asn Asp Glu Leu Glu Gln Ser
165 170 175
Lys Arg Ser Arg Gly Val Val Arg Arg Arg Gly Ser Ala Ala Ala Gly
180 185 190
Thr Ala Asn Ser Ile Ser Gly Ala Cys Phe Thr Lys Ala Arg Glu Gln
195 200 205
Leu Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg
210 215 220
Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Gln Ser
225 230 235 240
Ser Asn Gly Val Ser Ala Thr Thr Ile Ala Ala Val Thr Ala Thr Pro
245 250 255
Thr Ala Ala Lys Gly Val Leu Leu Asn Phe Glu Asp Val Gly Gly Lys
260 265 270
Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
275 280 285
Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Lys Ala
290 295 300
Gly Asp Thr Val Cys Phe His Arg Ser Thr Gly Pro Asp Lys Gln Leu
305 310 315 320
Tyr Ile Asp Trp Lys Thr Arg Asn Val Val Asn Asn Glu Val Ala Leu
325 330 335
Phe Gly Pro Val Gly Pro Val Val Glu Pro Ile Gln Met Val Arg Leu
340 345 350
Phe Gly Val Asn Ile Leu Lys Leu Pro Gly Ser Asp Thr Ile Val Gly
355 360 365
Asn Asn Asn Asn Ala Ser Gly Cys Cys Asn Gly Lys Arg Arg Glu Met
370 375 380
Glu Leu Phe Ser Leu Glu Cys Ser Lys Lys Pro Lys Ile Ile Gly Ala
385 390 395 400
Leu
<210> 78
<211> 1206
<212> DNA
<213> Glycine max
<400> 78
atggatgcaa ttagttgcat ggatgagagc accaccactg agtcactctc tataagtctt 60
tctccgacgt catcgtcgga gaaagcgaag ccttcttcga tgattacatc gtcggagaag 120
gtttctctgt ccccgccgcc gtcaaacaga ctatgccgtg ttggaagcgg cgcgagcgca 180
gtcgtggatc ctgatggcgg cggcagcggc gctgaggtag agtcgcggaa actcccctcg 240
tcgaagtaca agggcgtggt gccccagccc aacggccgct ggggtgcgca gatttacgag 300
aagcaccagc gcgtgtggct tggaacgttc aacgaggaag acgaggcggc gcgtgcgtac 360
gacatcgccg cgcagcggtt ccgcggcaag gacgccgtca cgaacttcaa gccgctcgcc 420
ggcgccgacg acgacgacgg agaatcggag tttctcaact cgcattccaa acccgagatc 480
gtcgacatgc tgcgaaagca cacgtacaat gacgagctgg agcagagcaa gcgcagccgc 540
ggcgtcgtcc ggcggcgagg ctccgccgcc gccggcaccg caaactcaat ttccggcgcg 600
tgctttacta aggcacgtga gcagctattc gagaaggctg ttacgccgag cgacgttggg 660
aaattgaacc gtttggtgat accgaagcag cacgcggaga agcactttcc gttacagagc 720
tctaacggcg ttagcgcgac gacgatagcg gcggtgacgg cgacgccgac ggcggcgaag 780
ggcgttttgt tgaacttcga agacgttgga gggaaagtgt ggcggtttcg ttactcgtat 840
tggaacagta gccagagtta cgtcttaacc aaaggttgga gccggttcgt taaggagaag 900
aatctgaaag ctggtgacac ggtttgtttt caccggtcca ctggaccgga caagcagctt 960
tacatcgatt ggaagacgag gaatgttgtt aacaacgagg tcgcgttgtt cggaccggtc 1020
ggaccggttg tcgaaccgat ccagatggtt cggctctttg gggttaacat tttgaaacta 1080
cccggttcag atactattgt tggcaataac aataatgcaa gtgggtgctg caatggcaag 1140
agaagagaaa tggaactgtt ctcgttagag tgtagcaaga aacctaagat tattggtgct 1200
ttgtaa 1206
<210> 79
<211> 362
<212> PRT
<213> Glycine max
<400> 79
Met Asp Gly Gly Ser Val Thr Asp Glu Thr Thr Thr Thr Ser Asn Ser
1 5 10 15
Leu Ser Val Pro Ala Asn Leu Ser Pro Pro Pro Leu Ser Leu Val Gly
20 25 30
Ser Gly Ala Thr Ala Val Val Tyr Pro Asp Gly Cys Cys Val Ser Gly
35 40 45
Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val
50 55 60
Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Lys His Gln
65 70 75 80
Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala Arg Ala
85 90 95
Tyr Asp Ile Ala Ala His Arg Phe Arg Gly Arg Asp Ala Val Thr Asn
100 105 110
Phe Lys Pro Leu Ala Gly Ala Asp Asp Ala Glu Ala Glu Phe Leu Ser
115 120 125
Thr His Ser Lys Ser Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asn Glu Leu Gln Gln Ser Thr Arg Gly Gly Arg Arg Arg Arg Asp
145 150 155 160
Ala Glu Thr Ala Ser Ser Gly Ala Phe Asp Ala Lys Ala Arg Glu Gln
165 170 175
Leu Phe Glu Lys Thr Val Thr Gln Ser Asp Val Gly Lys Leu Asn Arg
180 185 190
Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Ser Gly
195 200 205
Ser Gly Gly Gly Ala Leu Pro Cys Met Ala Ala Ala Ala Gly Ala Lys
210 215 220
Gly Met Leu Leu Asn Phe Glu Asp Val Gly Gly Lys Val Trp Arg Phe
225 230 235 240
Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly
245 250 255
Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Ala Val
260 265 270
Gln Phe Phe Lys Ser Thr Gly Leu Asp Arg Gln Leu Tyr Ile Asp Cys
275 280 285
Lys Ala Arg Ser Gly Lys Val Asn Asn Asn Ala Ala Gly Leu Phe Ile
290 295 300
Pro Val Gly Pro Val Val Glu Pro Val Gln Met Val Arg Leu Phe Gly
305 310 315 320
Val Asp Leu Leu Lys Leu Pro Val Pro Gly Ser Asp Gly Ile Gly Val
325 330 335
Gly Cys Asp Gly Lys Arg Lys Glu Met Glu Leu Phe Ala Phe Glu Cys
340 345 350
Ser Lys Lys Leu Lys Val Ile Gly Ala Leu
355 360
<210> 80
<211> 1089
<212> DNA
<213> Glycine max
<400> 80
atggatggag gcagtgtcac agacgaaacc accacaacca gcaactctct ttcggttccg 60
gcgaatctat ctccgccgcc tctcagcctt gtcggcagcg gcgcaaccgc cgtcgtctac 120
cccgacggtt gttgcgtctc cggcgaagcc gaatcccgga aactcccgtc ctcgaaatac 180
aaaggcgtgg tgccgcaacc gaacggtcgt tggggagctc agatttacga gaagcaccag 240
cgcgtgtggc tcggcacctt caacgaggaa gacgaagccg ccagagccta cgacatcgcc 300
gcgcatcgct tccgcggccg cgacgccgtc actaacttca agcctctcgc cggcgccgac 360
gacgccgaag ccgagttcct cagcacgcat tccaagtccg agatcgtcga catgctccgc 420
aagcacacct acgacaacga gctccagcag agcacccgcg gcggcaggcg ccgccgggac 480
gccgaaaccg cgtcgagcgg cgcgttcgac gcgaaggcgc gtgagcagct gttcgagaaa 540
accgttacgc agagcgacgt cgggaagctg aaccgattag tgataccaaa gcagcacgcg 600
gagaagcact ttccgttaag cggatccggc ggcggagcct tgccgtgcat ggcggcggct 660
gcgggggcga agggaatgtt gctgaacttt gaggacgttg gagggaaagt gtggcggttc 720
cgttactcgt attggaacag tagccagagc tacgtgctta ccaaaggatg gagccggttc 780
gttaaggaga agaatcttcg agctggtgac gcggttcagt tcttcaagtc gaccggactg 840
gaccggcaac tatatataga ctgcaaggcg aggagtggta aggttaacaa taatgctgcc 900
ggtttgttta ttccggttgg accggttgtt gagccggttc agatggtacg gcttttcggg 960
gtcgaccttt tgaaactacc cgtacccggt tcggatggta ttggggttgg ctgtgacggg 1020
aagagaaaag agatggagct gtttgcattt gaatgtagca agaagttaaa agtaattgga 1080
gctttgtaa 1089
<210> 81
<211> 347
<212> PRT
<213> Glycine max
<400> 81
Met Ile Gly Val Glu Lys Val Thr Ile Cys Met Arg Ile Glu Val Asn
1 5 10 15
Thr Glu Lys Gly Arg Arg Ala Leu Met Asp Cys Trp Gln Ile Ser Gly
20 25 30
Val His Glu Ser Ser Asp Cys Ser Glu Ile Lys Phe Ala Phe Asp Ala
35 40 45
Val Val Lys Arg Ala Arg His Glu Glu Asn Asn Ala Ala Ala Gln Lys
50 55 60
Phe Lys Gly Val Val Ser Gln Gln Asn Gly Asn Trp Gly Ala Gln Ile
65 70 75 80
Tyr Ala His Gln Gln Arg Ile Trp Leu Gly Thr Phe Lys Ser Glu Arg
85 90 95
Glu Ala Ala Met Ala Tyr Asp Ser Ala Ser Ile Lys Leu Arg Ser Gly
100 105 110
Glu Cys His Arg Asn Phe Pro Trp Asn Asp Gln Thr Val Gln Glu Pro
115 120 125
Gln Phe Gln Ser His Tyr Ser Ala Glu Thr Val Leu Asn Met Ile Arg
130 135 140
Asp Gly Thr Tyr Pro Ser Lys Phe Ala Thr Phe Leu Lys Thr Arg Gln
145 150 155 160
Thr Gln Lys Gly Val Ala Lys His Ile Gly Leu Lys Gly Asp Asp Glu
165 170 175
Glu Gln Phe Cys Cys Thr Gln Leu Phe Gln Lys Glu Leu Thr Pro Ser
180 185 190
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Lys His Ala Val
195 200 205
Ser Tyr Phe Pro Tyr Val Gly Gly Ser Ala Asp Glu Ser Gly Ser Val
210 215 220
Asp Val Glu Ala Val Phe Tyr Asp Lys Leu Met Arg Leu Trp Lys Phe
225 230 235 240
Arg Tyr Cys Tyr Trp Lys Ser Ser Gln Ser Tyr Val Phe Thr Arg Gly
245 250 255
Trp Asn Arg Phe Val Lys Asp Lys Lys Leu Lys Ala Lys Asp Val Ile
260 265 270
Ala Phe Phe Thr Trp Gly Lys Ser Gly Gly Glu Gly Glu Ala Phe Ala
275 280 285
Leu Ile Asp Val Ile Tyr Asn Asn Asn Ala Glu Glu Asp Ser Lys Gly
290 295 300
Asp Thr Lys Gln Val Leu Gly Asn Gln Leu Gln Leu Ala Gly Ser Glu
305 310 315 320
Glu Gly Glu Asp Glu Asp Ala Asn Ile Gly Lys Asp Phe Asn Ala Gln
325 330 335
Lys Gly Leu Arg Leu Phe Gly Val Cys Ile Thr
340 345
<210> 82
<211> 1044
<212> DNA
<213> Glycine max
<400> 82
atgattggag ttgagaaagt gacaatttgt atgagaatag aggtgaatac tgaaaaggga 60
agaagggctt taatggactg ttggcaaata tcaggagttc atgaaagttc agattgtagc 120
gaaatcaaat ttgcattcga cgcagtagta aaacgcgcga ggcatgaaga gaataatgca 180
gcagcacaga agttcaaagg cgttgtgtct caacaaaatg ggaactgggg tgcacagata 240
tatgcacacc agcagagaat ctggttgggg accttcaaat ctgaaagaga ggctgcaatg 300
gcttatgaca gcgccagcat aaaacttaga agcggagagt gccacagaaa ctttccatgg 360
aacgaccaaa cagttcaaga gcctcagttc caaagccatt acagcgcaga aacagtgcta 420
aacatgatta gagatggcac ctatccatca aaatttgcta catttctcaa aactcgtcaa 480
acccaaaaag gcgttgcgaa acacataggt ctgaagggtg atgacgagga acagttttgt 540
tgcacccaac tttttcagaa ggaattaaca ccaagtgatg tgggcaagct caacaggctt 600
gtcatcccaa agaagcatgc agttagctat tttccttacg ttggtggcag tgctgatgag 660
agtggtagtg ttgacgtgga ggctgtgttt tatgacaaac tcatgcgatt gtggaagttc 720
cgatactgct attggaagag cagccaaagt tacgtgttca ccagaggctg gaatcggttt 780
gtgaaggata agaagttgaa ggctaaagat gtcattgcgt tttttacgtg gggaaaaagt 840
ggaggagagg gagaagcttt tgcattgatc gatgtaattt ataataataa tgcagaagaa 900
gacagcaagg gagacaccaa acaagttttg ggaaaccaat tacaattagc tggcagtgaa 960
gaaggtgaag atgaagatgc aaacattgga aaggatttca atgcacaaaa gggtctgagg 1020
ctctttggtg tgtgtatcac ctaa 1044
<210> 83
<211> 409
<212> PRT
<213> Hordeum vulgare
<400> 83
Met Glu Phe Thr Ala Thr Ser Ser Arg Phe Ser Lys Gly Glu Glu Glu
1 5 10 15
Val Glu Glu Glu Gln Glu Glu Ala Ser Met Arg Glu Ile Pro Phe Met
20 25 30
Thr Pro Ala Ala Ala Thr Cys Ala Ala Ala Pro Pro Ser Ala Ser Ala
35 40 45
Ser Ala Ser Thr Pro Ala Ser Ala Ser Gly Ser Ser Pro Pro Phe Arg
50 55 60
Ser Gly Asp Asp Ala Gly Ala Ser Gly Ser Gly Ala Gly Asp Gly Ser
65 70 75 80
Arg Ser Asn Val Ala Glu Ala Val Glu Lys Glu His Met Phe Asp Lys
85 90 95
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
100 105 110
Lys Gln Tyr Ala Glu Lys Tyr Phe Pro Leu Asp Ser Ala Ala Asn Glu
115 120 125
Lys Gly Leu Leu Leu Asn Phe Glu Asp Ser Ala Gly Lys Pro Trp Arg
130 135 140
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
145 150 155 160
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
165 170 175
Val Ser Phe Ser Arg Gly Ala Gly Glu Ala Ala Arg His Arg Leu Phe
180 185 190
Ile Asp Trp Lys Arg Arg Ala Asp Thr Arg Asp Pro Leu Arg Leu Pro
195 200 205
Arg Leu Pro Leu Pro Met Pro Leu Thr Ser His Tyr Ser Pro Trp Gly
210 215 220
Leu Gly Ala Gly Ala Arg Gly Phe Phe Met Pro Pro Ser Pro Pro Ala
225 230 235 240
Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Phe Asp Phe Arg Gly Met
245 250 255
Asn Pro Ser Tyr Pro Thr Met Gly Arg Gln Val Ile Leu Phe Gly Ser
260 265 270
Ala Ala Arg Met Pro Pro His Gly Pro Ala Pro Leu Leu Val Pro Arg
275 280 285
Pro Pro Pro Pro Leu His Phe Thr Val Gln Gln Gln Gly Ser Asp Ala
290 295 300
Gly Gly Ser Val Thr Ala Gly Ser Pro Val Val Leu Asp Ser Val Pro
305 310 315 320
Val Ile Glu Ser Pro Thr Thr Ala Thr Lys Lys Arg Val Arg Leu Phe
325 330 335
Gly Val Asn Leu Asp Asn Pro Gln His Pro Gly Asp Gly Gly Gly Glu
340 345 350
Ser Ser Asn Tyr Gly Ser Ala Leu Pro Leu Gln Met Pro Ala Ser Ala
355 360 365
Trp Arg Pro Arg Asp His Thr Leu Arg Leu Leu Glu Phe Pro Ser His
370 375 380
Gly Ala Glu Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu
385 390 395 400
Ala His Ser Gly Leu Asp Leu Asp Leu
405
<210> 84
<211> 1230
<212> DNA
<213> Hordeum vulgare
<400> 84
atggagttta ctgcgacaag cagtaggttt tctaaaggag aggaggaggt ggaggaggag 60
caggaggagg cgtcgatgcg cgagatccct ttcatgacgc ccgcggccgc cacctgcgcc 120
gcggcgccgc cttctgcttc tgcgtcggcc tcgacacccg cgtcagcgtc tggaagtagc 180
cctccctttc gatctgggga tgacgccgga gcgtcgggga gcggggccgg cgacggcagc 240
cgcagcaacg tggcggaggc cgtggagaag gagcacatgt tcgacaaagt ggtgacgccg 300
agcgacgtgg ggaagcttaa ccggctggtc atccccaagc agtacgccga gaagtacttc 360
ccgctggact cggcggccaa cgagaagggc cttctgctca acttcgagga cagcgccggg 420
aagccatggc gcttccgcta ttcctactgg aacagcagcc agagctacgt catgaccaaa 480
ggctggagcc gcttcgtcaa ggagaagcgc ctcgacgctg gggacaccgt ctccttctcc 540
cgcggcgccg gtgaggccgc gcgccaccgc ctcttcatcg actggaagcg ccgagccgac 600
accagagacc cgctccgctt gccccgcctc ccgctcccga tgccgctgac gtcgcactac 660
agcccgtggg gcctcggcgc cggcgccaga ggattcttca tgcctccctc gccgccagcc 720
acgctctacg agcaccgtct ccgtcaaggc ttcgacttcc gcggcatgaa ccccagttac 780
cccacaatgg ggagacaggt catccttttc ggctcggccg ccaggatgcc tccgcacgga 840
ccagcaccac tcctcgtgcc gcgcccgccg ccgccgctgc acttcacggt gcagcaacaa 900
ggcagcgacg ccggcggaag tgtaaccgca ggatccccag tggtgctcga ctcagtgccg 960
gtaatcgaaa gccccacgac ggcaacgaag aagcgcgtgc gcttgttcgg cgtgaacttg 1020
gacaacccgc agcatcccgg tgatggcggg ggcgaatcga gcaattatgg cagtgcactg 1080
ccattgcaga tgcccgcatc agcatggcgg ccaagggacc atacgctgag gctgctcgaa 1140
ttcccctcgc acggtgccga ggcgtcgtct ccatcgtcgt cgtcgtcttc caagagggag 1200
gcgcattcgg gcttggatct cgatctgtga 1230
<210> 85
<211> 227
<212> PRT
<213> Hordeum vulgare
<400> 85
Met Leu Arg Lys His Thr Tyr Phe Asp Glu Leu Ala Gln Ser Lys Arg
1 5 10 15
Ala Phe Ala Ala Ser Ala Ala Leu Ser Ala Pro Thr Thr Ser Gly Asp
20 25 30
Ala Gly Gly Ser Ala Ser Pro Pro Ser Pro Ala Ala Val Arg Glu His
35 40 45
Leu Phe Asp Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg
50 55 60
Leu Val Ile Pro Lys Gln Asn Ala Glu Lys His Phe Pro Leu Gln Leu
65 70 75 80
Pro Ala Gly Gly Gly Glu Ser Lys Gly Leu Leu Leu Asn Phe Glu Asp
85 90 95
Asp Ala Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
100 105 110
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
115 120 125
Gly Leu Gly Ala Gly Asp Val Val Gly Phe Tyr Arg Ser Ala Ala Gly
130 135 140
Arg Thr Gly Glu Asp Ser Lys Phe Phe Ile Asp Cys Arg Leu Arg Pro
145 150 155 160
Asn Thr Asn Thr Ala Ala Glu Ala Asp Pro Val Asp Gln Ser Ser Ala
165 170 175
Pro Val Gln Lys Ala Val Arg Leu Phe Gly Val Asp Leu Leu Ala Ala
180 185 190
Pro Glu Gln Gly Met Pro Gly Gly Cys Lys Arg Ala Arg Asp Leu Val
195 200 205
Lys Pro Pro Pro Pro Lys Val Ala Phe Lys Lys Gln Cys Ile Glu Leu
210 215 220
Ala Leu Ala
225
<210> 86
<211> 684
<212> DNA
<213> Hordeum vulgare
<400> 86
atgctccgca agcacaccta cttcgacgag ctcgcccaga gcaagcgcgc cttcgccgcg 60
tcggccgcgc tctccgcgcc caccacctcg ggcgacgccg gcggcagcgc ctcgccgccc 120
tccccggccg ccgtgcgcga gcacctcttc gacaagaccg tcacgcccag cgacgtcggc 180
aagctgaaca ggctggtgat accgaagcag aacgccgaga agcacttccc gctgcagctc 240
ccggccggcg gcggcgagag caagggcctg ctcctcaact tcgaggacga tgcgggcaag 300
gtgtggcggt tccgctactc gtactggaac agcagccaga gctacgtcct caccaagggc 360
tggagccgct tcgtgaagga gaagggcctc ggcgccggag acgtcgtcgg gttctaccgc 420
tccgccgccg ggaggaccgg cgaagacagc aagttcttca ttgactgcag gctgcggccg 480
aacaccaaca ccgccgccga agcagacccc gtggaccagt cgtcggcgcc cgtgcagaag 540
gccgtgagac tcttcggcgt cgatcttctc gcggcgccgg agcagggcat gccgggcggg 600
tgcaagaggg ccagagactt ggtgaagccg ccgcctccga aagtggcgtt caagaagcaa 660
tgcatagagc tggcgctagc gtag 684
<210> 87
<211> 160
<212> PRT
<213> Hordeum vulgare
<400> 87
Met Tyr Cys Ser Arg Gly Arg Ile Asp Pro Ala Glu Glu Gly Gln Val
1 5 10 15
Met Gly Gly Leu Gly Val Arg Asp Ala Ser Trp Ala Leu Phe Lys Val
20 25 30
Leu Glu Gln Ser Asp Val Gln Val Gly Gln Asn Arg Leu Leu Leu Thr
35 40 45
Lys Glu Ala Val Trp Gly Gly Pro Ile Pro Lys Leu Phe Pro Glu Leu
50 55 60
Glu Glu Leu Arg Gly Asp Gly Leu Asn Ala Glu Asn Arg Val Ala Val
65 70 75 80
Lys Ile Leu Asp Ala Asp Gly Cys Glu Gly Asp Ala Asn Phe Arg Tyr
85 90 95
Leu Asn Ser Ser Lys Ala Tyr Arg Val Met Gly Pro Gln Trp Ser Arg
100 105 110
Leu Val Lys Glu Thr Gly Met Cys Lys Gly Asp Arg Leu Asp Leu Tyr
115 120 125
Ala Ala Thr Ala Thr Ala Ala Ser Ser Cys Ser Gly Ala Arg Ala Ala
130 135 140
Val Ala Pro Ala Ile Pro Pro Gly Ala Ile Val Lys Ala Ala Gly Phe
145 150 155 160
<210> 88
<211> 483
<212> DNA
<213> Hordeum vulgare
<400> 88
atgtattgtt cccgcggccg catcgatccc gcggaagaag ggcaggtgat gggcggcctc 60
ggcgtgcgcg acgccagctg ggcgctgttc aaggtgttgg agcagtccga cgtccaggtg 120
gggcagaacc ggctgctcct caccaaggag gcggtgtggg gcggccctat ccccaagctt 180
ttcccggagc tggaggagct ccgcggcgac ggcctcaacg ccgagaacag ggtcgcggtc 240
aagatcctcg acgccgacgg ctgcgagggg gacgccaact tccgctacct caactccagc 300
aaggcgtacc gggtcatggg gcctcagtgg agccggctcg tgaaggagac cggcatgtgc 360
aagggagacc gcctcgatct gtacgcggca acggcgaccg ctgcctcttc gtgttctgga 420
gccagggcgg ctgtggcgcc ggcgatacct cccggagcaa tcgtgaaggc agccgggttc 480
taa 483
<210> 89
<211> 267
<212> PRT
<213> Hordeum vulgare
<400> 89
Met Leu Arg Lys His Ile Tyr Pro Asp Glu Leu Ala Gln His Lys Arg
1 5 10 15
Ala Phe Phe Phe Ala Ala Ala Ser Ser Pro Thr Ser Ser Ser Ser Pro
20 25 30
Leu Ala Ser Pro Ala Pro Ser Ala Ala Ala Ala Arg Arg Glu His Leu
35 40 45
Phe Asp Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
50 55 60
Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Gln Leu Pro
65 70 75 80
Ser Ala Ser Ala Ala Val Pro Gly Glu Cys Lys Gly Val Leu Leu Asn
85 90 95
Phe Asp Asp Ala Thr Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp
100 105 110
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val
115 120 125
Lys Glu Lys Gly Leu His Ala Gly Asp Ala Val Glu Phe Tyr Arg Ala
130 135 140
Ala Ser Gly Asn Asn Gln Leu Phe Ile Asp Cys Lys Leu Arg Ser Lys
145 150 155 160
Ser Thr Thr Thr Thr Thr Ser Val Asn Ser Glu Ala Ala Pro Ser Pro
165 170 175
Ala Pro Val Thr Arg Thr Val Arg Leu Phe Gly Val Asp Leu Leu Ile
180 185 190
Ala Pro Ala Ala Arg His Ala His Glu His Glu Asp Tyr Gly Met Ala
195 200 205
Lys Thr Asn Lys Arg Thr Met Glu Ala Ser Val Ala Ala Pro Thr Pro
210 215 220
Ala His Ala Val Trp Lys Lys Arg Cys Val Asp Phe Ala Leu Thr Tyr
225 230 235 240
Arg Leu Ala Thr Thr Pro Gln Cys Pro Arg Ser Arg Asp Gln Leu Glu
245 250 255
Gly Val Gln Ala Ala Gly Ser Thr Phe Ala Leu
260 265
<210> 90
<211> 804
<212> DNA
<213> Hordeum vulgare
<400> 90
atgctgcgca agcacatcta tcccgacgag ctcgcgcagc acaagcgcgc cttcttcttc 60
gccgcggcgt cgtcccctac gtcgtcgtcg tcacctctcg cctcgccggc tccttcagcc 120
gcggcggcgc ggcgcgagca cctgttcgac aagacggtca cgcccagcga cgtggggaag 180
ctgaaccggc tggtgatccc caagcagcac gccgagaagc acttcccgct gcagctccct 240
tctgccagcg ccgccgtgcc aggcgagtgc aagggcgtgc tgctcaactt cgatgacgcg 300
accggcaagg tgtggaggtt ccggtactcc tactggaaca gcagccagag ctacgtgctc 360
accaaggggt ggagccgctt cgtgaaggag aagggccttc acgccggcga cgccgtcgag 420
ttctaccgcg ccgcctccgg caacaaccag ctcttcatcg actgcaagct ccggtccaag 480
agcaccacga cgacgacctc cgtcaactcg gaggccgccc catcgccggc acccgtgacg 540
aggacagtgc gactcttcgg ggtcgacctt ctcatcgcgc cggcggcgag gcacgcgcat 600
gagcacgagg actacggcat ggccaagaca aacaagagaa ccatggaggc cagcgtagcg 660
gcgcctactc cggcgcacgc ggtgtggaag aagcggtgcg tagacttcgc gctgacctac 720
cgacttgcca ccaccccaca gtgcccgagg tcaagagatc aactagaagg agtacaagca 780
gctgggagta catttgctct atag 804
<210> 91
<211> 357
<212> PRT
<213> Hordeum vulgare
<400> 91
Met Gly Val Glu Ile Leu Ser Ser Thr Gly Glu His Ser Ser Gln Tyr
1 5 10 15
Ser Ser Gly Ala Ala Ser Thr Ala Thr Thr Glu Ser Gly Val Gly Gly
20 25 30
Arg Pro Pro Thr Ala Pro Ser Leu Pro Val Ser Ile Ala Asp Glu Ser
35 40 45
Ala Thr Ser Arg Ser Ala Ser Ala Gln Ser Thr Ser Ser Arg Phe Lys
50 55 60
Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu
65 70 75 80
Arg His Ala Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp Ser Ala
85 90 95
Ala Arg Ala Tyr Asp Val Ala Ala Leu Arg Tyr Arg Gly Arg Glu Ala
100 105 110
Ala Thr Asn Phe Pro Cys Ala Ala Ala Glu Ala Glu Leu Ala Phe Leu
115 120 125
Ala Ala His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr
130 135 140
Tyr Thr Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Met Gly
145 150 155 160
Ala Arg Ala Gln Pro Thr Pro Ser Trp Ala Arg Glu Pro Leu Phe Glu
165 170 175
Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val
180 185 190
Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Lys Arg Thr Pro Glu
195 200 205
Thr Thr Thr Thr Thr Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly
210 215 220
Glu Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
225 230 235 240
Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Arg Glu Lys Gly
245 250 255
Leu Gly Ala Gly Asp Ser Ile Val Phe Ser Cys Ser Ala Tyr Gly Gln
260 265 270
Glu Lys Gln Phe Phe Ile Asp Cys Lys Lys Asn Lys Thr Met Thr Ser
275 280 285
Cys Pro Ala Asp Asp Arg Gly Ala Ala Thr Ala Ser Pro Pro Val Ser
290 295 300
Glu Pro Thr Lys Gly Glu Gln Val Arg Val Val Arg Leu Phe Gly Val
305 310 315 320
Asp Ile Ala Gly Glu Lys Arg Gly Arg Ala Ala Pro Val Glu Gln Glu
325 330 335
Leu Phe Lys Arg Gln Cys Val Ala His Ser Gln His Ser Pro Ala Leu
340 345 350
Gly Ala Phe Val Leu
355
<210> 92
<211> 1074
<212> DNA
<213> Hordeum vulgare
<400> 92
atgggggtgg agatcctgag ctcaacgggg gaacactcct cccagtactc ttccggagcc 60
gcgtccacgg cgacgacgga gtcaggcgtg ggcggacggc cgccgactgc gccgagccta 120
cctgtttcca tcgccgacga gtcggcgacc tcgcggtcgg catcggcgca gtcgacgtcg 180
tcgcggttca agggcgtggt gccgcagccc aacgggcggt ggggcgccca gatctacgag 240
cgccacgccc gcgtctggct cggcacgttc ccggacgaag actctgcggc gcgcgcctac 300
gacgtggccg cgctccggta ccggggccgc gaggccgcca ccaacttccc gtgcgcggcc 360
gccgaggcgg agctcgcctt cctggcggca cactccaagg ccgagatcgt cgacatgctc 420
cggaagcaca cctacaccga cgagctccgc cagggcctgc ggcgcggccg cggcatgggg 480
gcgcgcgcgc agccgacgcc gtcgtgggcg cgggagcccc ttttcgagaa ggccgtgacc 540
ccgagcgacg tgggcaagct caaccgcctc gttgtgccga agcagcacgc cgagaagcac 600
ttccccctga aacgcacgcc ggagacgaca acgaccaccg gcaagggggt gcttctcaac 660
ttcgaggatg gcgaggggaa agtgtggagg ttccggtact cgtattggaa cagcagccag 720
agctacgtgc tcaccaaggg atggagccgc ttcgttcggg agaagggcct cggtgccggc 780
gactccatcg tgttctcctg ctcggcgtac ggtcaggaga agcagttctt catcgactgc 840
aagaagaaca agacgatgac gagctgcccc gccgatgacc gcggcgccgc aacagcgtcg 900
ccgccagtgt cagagccaac aaaaggagaa caagtccgtg ttgtgaggct gttcggcgtc 960
gacatcgccg gagagaagag ggggcgagcg gcgccggtgg agcaggagtt gttcaagagg 1020
caatgcgtgg cacacagcca gcactctcca gccctaggtg ccttcgtctt atag 1074
<210> 93
<211> 348
<212> PRT
<213> Hordeum vulgare
<400> 93
Met Gly Val Glu Ile Leu Ser Ser Met Val Glu His Ser Phe Gln Tyr
1 5 10 15
Ser Ser Gly Ala Ser Ser Ala Thr Ala Glu Ser Gly Ala Val Gly Thr
20 25 30
Pro Pro Arg His Leu Ser Leu Pro Val Ala Ile Ala Asp Glu Ser Leu
35 40 45
Thr Ser Arg Ser Ala Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
50 55 60
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Arg His Ala Arg Val Trp
65 70 75 80
Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala Arg Ala Tyr Asp Val
85 90 95
Ala Ser Leu Arg Tyr Arg Gly Gly Asp Ala Ala Phe Asn Phe Pro Cys
100 105 110
Val Val Val Glu Ala Glu Leu Ala Phe Leu Ala Ala His Ser Lys Ala
115 120 125
Glu Ile Val Asp Met Leu Arg Lys Gln Thr Tyr Ala Asp Glu Leu Arg
130 135 140
Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Val Arg Ala Gln Pro Met
145 150 155 160
Pro Ser Trp Ala Arg Val Pro Leu Phe Glu Lys Ala Val Thr Pro Ser
165 170 175
Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln His Ala Glu
180 185 190
Lys His Phe Pro Leu Lys Arg Ser Pro Glu Thr Thr Thr Thr Thr Gly
195 200 205
Asn Gly Val Leu Leu Asn Phe Glu Asp Gly Gln Gly Lys Val Trp Arg
210 215 220
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys
225 230 235 240
Gly Trp Ser Arg Phe Val Arg Glu Lys Gly Leu Gly Ala Gly Asp Ser
245 250 255
Ile Met Phe Ser Cys Ser Ala Tyr Gly Gln Glu Lys Gln Phe Phe Ile
260 265 270
Asp Cys Lys Lys Asn Thr Thr Val Asn Gly Gly Lys Ser Ala Ser Pro
275 280 285
Leu Gln Val Met Glu Ile Ala Lys Ala Glu Gln Val Arg Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Ala Gly Val Lys Arg Glu Arg Ala Ala Thr
305 310 315 320
Ala Glu Gln Gly Pro Gln Gly Trp Phe Lys Arg Gln Cys Met Ala His
325 330 335
Gly Gln His Ser Pro Ala Leu Gly Asp Phe Ala Leu
340 345
<210> 94
<211> 1047
<212> DNA
<213> Hordeum vulgare
<400> 94
atgggggtgg agatcctgag ctccatggtg gagcactcct tccagtactc ttcgggcgcg 60
tcctcggcca ccgcggagtc aggcgccgtc ggaacaccgc cgaggcatct gagcctacct 120
gtcgccatcg ccgacgagtc cctgacctca cggtcggcgt cgtctcggtt caagggcgtg 180
gtgccgcagc ccaacgggcg gtggggcgcc cagatctacg agcgccacgc tcgcgtctgg 240
ctcggcacgt tcccagacca ggactcggcg gcgcgcgcct acgacgttgc ctcgctcagg 300
taccgcggcg gcgacgccgc cttcaacttc ccgtgcgtgg tggtggaggc ggagctcgcc 360
ttcctggcgg cgcactccaa ggctgagatc gttgacatgc tccggaagca gacctacgcc 420
gatgaactcc gccagggact acggcgcggc cgtggcatgg gggtgcgcgc gcagccgatg 480
ccgtcgtggg cgcgggttcc ccttttcgag aaggccgtga cccctagcga tgtcggcaag 540
ctcaatcgcc tggtggtgcc gaagcagcac gccgagaagc acttccccct gaagcgcagc 600
ccggagacga cgaccaccac cggcaacggc gtactgctca actttgagga cggccaggga 660
aaagtgtgga ggttccggta ctcatattgg aacagcagcc agagctacgt gctcaccaaa 720
ggctggagcc gcttcgtccg ggagaagggc ctcggcgccg gtgactccat catgttctcc 780
tgctcggcgt acgggcagga gaagcagttc ttcatcgact gcaagaagaa cacgaccgtg 840
aacggaggca aatcggcgtc gccgctgcag gtgatggaga ttgccaaagc agaacaagtc 900
cgcgtcgtta gactgttcgg tgtcgacatc gccggggtga agagggagcg agcggcgacg 960
gcggagcaag gcccgcaggg gtggttcaag aggcaatgca tggcacacgg ccagcactct 1020
cctgccctag gtgacttcgc cttatag 1047
<210> 95
<211> 362
<212> PRT
<213> Hordeum vulgare
<400> 95
Met Gly Met Glu Ile Leu Ser Ser Thr Val Glu His Cys Ser Gln Tyr
1 5 10 15
Ser Ser Ser Ala Ser Thr Ala Thr Thr Glu Ser Gly Ala Ala Gly Arg
20 25 30
Ser Thr Thr Ala Leu Ser Leu Pro Val Ala Ile Thr Asp Glu Ser Val
35 40 45
Thr Ser Arg Ser Ala Ser Ala Gln Pro Ala Ser Ser Arg Phe Lys Gly
50 55 60
Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ser Gln Ile Tyr Glu Arg
65 70 75 80
His Ala Arg Val Trp Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala
85 90 95
Arg Ala Tyr Asp Val Ala Ser Leu Arg Tyr Arg Gly Arg Asp Ala Ala
100 105 110
Thr Asn Phe Pro Cys Ala Ala Ala Glu Ala Glu Leu Ala Phe Leu Thr
115 120 125
Ala His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Ala Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Ala
145 150 155 160
Arg Ala Gln Pro Thr Pro Ser Trp Ala Arg Val Pro Leu Phe Glu Lys
165 170 175
Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro
180 185 190
Lys Gln His Ala Glu Lys His Phe Pro Leu Lys Cys Thr Ala Glu Thr
195 200 205
Thr Thr Thr Thr Gly Asn Gly Val Leu Leu Asn Phe Glu Asp Gly Glu
210 215 220
Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
225 230 235 240
Tyr Val Leu Thr Lys Gly Trp Ser Ser Phe Val Arg Glu Lys Gly Leu
245 250 255
Gly Ala Gly Asp Ser Ile Val Phe Ser Ser Ser Ala Tyr Gly Gln Glu
260 265 270
Lys Gln Leu Phe Ile Asn Cys Lys Lys Asn Thr Thr Met Asn Gly Gly
275 280 285
Lys Thr Ala Leu Pro Leu Pro Val Val Glu Thr Ala Lys Gly Glu Gln
290 295 300
Asp His Val Val Lys Leu Phe Gly Val Asp Ile Ala Gly Val Lys Arg
305 310 315 320
Val Arg Ala Ala Thr Gly Glu Leu Gly Pro Pro Glu Leu Phe Lys Arg
325 330 335
Gln Ser Val Ala His Gly Cys Gly Arg Met Asn Tyr Ile Cys Tyr Ser
340 345 350
Ile Gly Thr Ile Gly Pro Leu Met Leu Asn
355 360
<210> 96
<211> 1089
<212> DNA
<213> Hordeum vulgare
<400> 96
atggggatgg aaatcctgag ctccacggtg gagcactgct cccagtactc ttccagcgcg 60
tccacggcca caacggagtc aggcgccgcc ggaagatcga cgacggctct gagcctacca 120
gttgccatca ccgacgagtc cgttacctcg cggtcggcat cggcgcagcc ggcgtcatca 180
cggttcaagg gcgtggtgcc gcagcccaac gggcggtggg gctcccagat ctacgagcgc 240
cacgctcgcg tctggctcgg caccttcccg gatcaggact cggcggcgcg tgcctacgac 300
gttgcctcgc tcaggtaccg gggccgcgat gccgccacca acttcccgtg cgccgctgcg 360
gaagcggagc tcgccttcct gaccgcgcac tccaaggccg agatcgtcga catgctccgg 420
aagcacacct acgccgacga actccgccag ggcctgcggc gcggccgcgg catgggtgcg 480
cgcgcgcagc cgacgccgtc gtgggcgcgg gttccccttt tcgagaaggc tgtgacccct 540
agcgatgtcg gcaagctcaa tcgcctggtg gtgccgaagc agcacgccga gaagcacttc 600
cccctgaagt gcaccgcaga gacgacgacc accaccggca acggcgtgct gctaaacttc 660
gaggatggtg aggggaaggt gtggaggttc cggtactcgt attggaacag tagccagagc 720
tacgtgctca ccaaaggctg gagcagcttc gtccgggaga agggcctcgg cgcaggcgac 780
tccatcgtct tctcctcctc ggcgtacggg caggagaagc agttattcat caactgcaaa 840
aagaacacga ctatgaacgg cggcaaaaca gcgttgccgc tgccagtggt ggagactgcc 900
aaaggagaac aagaccacgt cgttaagttg ttcggtgttg acatcgccgg tgtgaagagg 960
gtgcgagcgg cgacggggga gctaggcccg ccggagttgt tcaagagaca atccgtggca 1020
cacggatgcg gaaggatgaa ctacatttgc tactccatag ggacaatagg acctcttatg 1080
ctcaactga 1089
<210> 97
<211> 308
<212> PRT
<213> Hordeum vulgare
<400> 97
Met Ala Ser Ser Lys Pro Thr Asn Pro Glu Val Asp Asn Asp Met Glu
1 5 10 15
Cys Ser Ser Pro Glu Ser Gly Ala Glu Asp Ala Val Glu Ser Ser Ser
20 25 30
Pro Val Ala Ala Pro Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
35 40 45
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Lys His Ser Arg Val Trp
50 55 60
Leu Gly Thr Phe Gly Asp Glu Glu Ala Ala Ala Cys Ala Tyr Asp Val
65 70 75 80
Ala Ala Leu Arg Phe Arg Gly Arg Asp Ala Val Thr Asn His Gln Arg
85 90 95
Leu Pro Ala Ala Glu Gly Ala Gly Trp Ser Ser Thr Ser Glu Leu Ala
100 105 110
Phe Leu Ala Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys
115 120 125
His Thr Tyr Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly His Gly
130 135 140
Arg Ala Gln Pro Thr Pro Ala Trp Ala Arg Glu Phe Leu Phe Glu Lys
145 150 155 160
Ala Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro
165 170 175
Lys Gln His Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Ala
180 185 190
Gly Ser Asp Gly Lys Gly Leu Leu Leu Asn Phe Glu Asp Gly Gln Gly
195 200 205
Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr
210 215 220
Val Leu Thr Lys Gly Trp Ser Arg Phe Val Gln Glu Lys Gly Leu Cys
225 230 235 240
Ala Gly Asp Thr Val Thr Phe Ser Arg Ser Ala Tyr Val Met Asn Asp
245 250 255
Thr Asp Glu Gln Leu Phe Ile Asp Tyr Lys Gln Ser Ser Lys Asn Asp
260 265 270
Glu Ala Ala Asp Val Ala Thr Ala Asp Glu Asn Glu Ala Gly His Val
275 280 285
Ala Val Lys Leu Phe Gly Val Asp Ile Gly Trp Ala Gly Met Ala Gly
290 295 300
Ser Ser Gly Gly
305
<210> 98
<211> 927
<212> DNA
<213> Hordeum vulgare
<400> 98
atggcgtcta gcaagccgac aaaccccgag gtagacaatg acatggagtg ctcctccccg 60
gaatcgggtg ccgaggacgc cgtggagtcg tcgtcgccgg tggcagcgcc atcttcgcgg 120
ttcaagggcg tcgtgccgca gcctaacggg cgctggggcg cgcagatcta cgagaagcac 180
tcgcgggtgt ggcttggcac gttcggggac gaggaagccg ccgcgtgcgc ctacgacgtg 240
gccgcgctcc gcttccgcgg ccgcgacgcc gtcaccaacc accagcgcct gccggcggcg 300
gagggggccg gctggtcgtc cacgagcgag ctcgccttcc tcgccgacca ctccaaggcc 360
gagatcgtcg acatgctccg gaagcacacc tacgacgacg agctccggca gggcctgcgc 420
cgcggccacg ggcgcgcgca gcccacgccg gcgtgggcgc gagagttcct cttcgagaag 480
gccctgaccc cgagcgacgt cggcaagctc aaccgcctgg tcgttccgaa gcagcacgcc 540
gagaagcact tccccccgac gacggcggcg gccgccggaa gcgacggcaa gggcttgctg 600
ctcaacttcg aggacggcca agggaaggtg tggaggttcc ggtactcata ctggaacagc 660
agccagagct acgtgctcac caagggctgg agccgcttcg tccaagaaaa gggcctctgc 720
gccggcgaca ccgtgacgtt ctcccggtcg gcgtacgtga tgaatgacac ggatgagcag 780
ctcttcatcg actacaagca gagtagcaag aacgacgaag cggccgacgt agccactgcc 840
gatgagaatg aggccggcca tgtcgccgtg aagctcttcg gggtcgacat tggctgggct 900
gggatggcgg gatcatcagg tgggtga 927
<210> 99
<211> 1279
<212> PRT
<213> Hordeum vulgare
<400> 99
Met Leu Phe Asp Ser Ser Val Ser Ala Ser Leu Gly Thr Met Arg Pro
1 5 10 15
Leu Val Lys Lys Leu Asp Met Leu Leu Ala Pro Ala Arg Gly Tyr Ser
20 25 30
Thr Leu Cys Lys Arg Ile Lys Glu Val Met His Leu Leu Lys His Asp
35 40 45
Val Glu Glu Ile Ser Ser Tyr Leu Asp Glu Leu Thr Glu Val Glu Asp
50 55 60
Pro Pro Pro Met Ala Lys Cys Trp Met Asn Glu Ala Arg Asp Leu Ser
65 70 75 80
Tyr Asp Met Glu Asp Tyr Ile Asp Ser Leu Leu Phe Val Pro Pro Gly
85 90 95
His Phe Ile Lys Lys Lys Lys Lys Lys Lys Lys Lys Gly Lys Lys Lys
100 105 110
Met Val Ile Lys Lys Arg Leu Lys Trp Cys Lys Gln Ile Val Phe Thr
115 120 125
Lys Gln Val Ser Asp His Gly Ile Lys Thr Ser Lys Ile Ile His Val
130 135 140
Asn Val Pro Arg Leu Pro Asn Lys Pro Lys Val Ala Lys Ile Ile Leu
145 150 155 160
Gln Phe Arg Ile Tyr Val Gln Glu Ala Ile Glu Arg Tyr Asp Lys Tyr
165 170 175
Arg Leu His His Cys Ser Thr Leu Arg Arg Arg Leu Leu Ser Thr Gly
180 185 190
Ser Met Leu Ser Val Pro Ile Pro Tyr Glu Glu Ala Ala Gln Ile Val
195 200 205
Thr Asp Gly Arg Met Asn Glu Phe Ile Ser Ser Leu Ala Ala Asn Asn
210 215 220
Ala Ala Asp Gln Gln Gln Leu Lys Val Val Ser Val Leu Gly Ser Gly
225 230 235 240
Cys Leu Gly Lys Thr Thr Leu Ala Asn Val Leu Tyr Asp Arg Ile Gly
245 250 255
Met Gln Phe Glu Cys Arg Ala Phe Ile Arg Val Ser Lys Lys Pro Asp
260 265 270
Met Lys Arg Leu Phe Arg Asp Leu Leu Ser Gln Phe His Gln Lys Gln
275 280 285
Pro Leu Pro Thr Ser Cys Asn Glu Leu Gly Ile Ser Asp Asn Ile Ile
290 295 300
Lys His Leu Gln Asp Lys Arg Tyr Leu Ile Val Ile Asp Asp Leu Trp
305 310 315 320
Asp Leu Ser Val Trp Asp Ile Ile Lys Tyr Ala Phe Pro Lys Gly Asn
325 330 335
His Gly Ser Arg Ile Ile Ile Thr Thr Gln Ile Glu Asp Val Ala Leu
340 345 350
Thr Cys Cys Cys Asp His Ser Glu His Val Phe Glu Met Lys Pro Leu
355 360 365
Asn Ile Gly His Ser Arg Glu Leu Phe Phe Asn Arg Leu Phe Gly Ser
370 375 380
Glu Ser Asp Cys Leu Glu Glu Phe Lys Arg Val Ser Asn Glu Ile Val
385 390 395 400
Asp Ile Cys Gly Gly Leu Pro Leu Ala Thr Ile Asn Ile Ala Ser His
405 410 415
Leu Ala Asn Gln Glu Thr Glu Val Ser Leu Asp Leu Leu Thr Asp Thr
420 425 430
Arg Asp Leu Leu Arg Ser Cys Leu Trp Ser Asn Ser Thr Ser Glu Arg
435 440 445
Thr Lys Gln Val Leu Asn Leu Ser Tyr Ser Asn Leu Pro Asp Tyr Leu
450 455 460
Lys Thr Cys Leu Leu Tyr Leu His Met Tyr Pro Val Gly Ser Ile Ile
465 470 475 480
Trp Lys Asp Asp Leu Val Lys Gln Leu Val Ala Glu Gly Phe Ile Ala
485 490 495
Thr Arg Glu Gly Lys Asp Gln Asp Gln Glu Met Ile Glu Lys Ala Ala
500 505 510
Gly Leu Cys Phe Asp Ala Leu Ile Asp Arg Arg Phe Ile Gln Pro Ile
515 520 525
Tyr Thr Lys Tyr Asn Asn Lys Val Leu Ser Cys Thr Val His Glu Val
530 535 540
Val His Asp Leu Ile Ala Gln Lys Ser Ala Glu Glu Asn Phe Ile Val
545 550 555 560
Val Ala Asp His Asn Arg Lys Asn Ile Ala Leu Ser His Lys Val Arg
565 570 575
Arg Leu Ser Leu Ile Phe Gly Asp Thr Ile Tyr Ala Lys Thr Pro Ala
580 585 590
Asn Ile Thr Lys Ser Gln Ile Arg Ser Phe Arg Phe Phe Gly Leu Phe
595 600 605
Glu Cys Met Pro Cys Ile Thr Glu Phe Lys Val Leu Arg Val Leu Asn
610 615 620
Leu Gln Leu Ser Gly His Arg Gly Asp Asn Asp Pro Ile Asp Leu Thr
625 630 635 640
Gly Ile Ser Glu Leu Phe Gln Leu Arg Tyr Leu Lys Ile Thr Ser Asp
645 650 655
Val Cys Ile Lys Leu Pro Asn Gln Met Gln Lys Leu Gln Tyr Leu Glu
660 665 670
Thr Leu Asp Ile Met Asp Ala Pro Arg Val Thr Ala Val Pro Trp Asp
675 680 685
Ile Ile Asn Leu Pro His Leu Leu His Leu Thr Leu Pro Val Asp Thr
690 695 700
Tyr Leu Leu Asp Trp Ile Ser Ser Met Thr Asp Ser Val Ile Ser Leu
705 710 715 720
Trp Thr Leu Gly Lys Leu Asn Tyr Leu Gln His Leu His Leu Thr Ser
725 730 735
Ser Ser Thr Arg Pro Ser Tyr His Leu Glu Arg Ser Val Glu Ala Leu
740 745 750
Gly Tyr Leu Ile Gly Gly His Gly Lys Leu Lys Thr Ile Val Val Ala
755 760 765
His Val Ser Ser Ala Gln Asn Thr Val Val Arg Gly Ala Pro Glu Val
770 775 780
Thr Ile Ser Trp Asp Arg Met Ser Pro Pro Pro Leu Leu Gln Arg Phe
785 790 795 800
Glu Cys Pro His Ser Cys Phe Ile Phe Tyr Arg Ile Pro Lys Trp Val
805 810 815
Thr Glu Leu Gly Asn Leu Cys Ile Leu Lys Ile Ala Val Lys Glu Leu
820 825 830
His Met Ile Cys Leu Gly Thr Leu Arg Gly Leu His Ala Leu Thr Asp
835 840 845
Leu Ser Leu Tyr Val Glu Thr Ala Pro Ile Asp Lys Ile Ile Phe Asp
850 855 860
Lys Ala Gly Phe Ser Val Leu Lys Tyr Cys Lys Leu Arg Phe Ala Ala
865 870 875 880
Gly Ile Ala Trp Leu Lys Phe Glu Ala Asp Ala Met Pro Ser Leu Trp
885 890 895
Lys Leu Met Leu Val Phe Asn Ala Ile Pro Arg Met Asp Gln Asn Leu
900 905 910
Val Phe Phe His His Ser Arg Pro Ala Met His Gln Arg Gly Gly Ala
915 920 925
Val Ile Ile Val Glu His Met Pro Gly Leu Arg Val Ile Ser Ala Lys
930 935 940
Phe Gly Gly Ala Ala Ser Asp Leu Glu Tyr Ala Ser Arg Thr Val Val
945 950 955 960
Ser Asn His Pro Ser Asn Pro Thr Ile Asn Met Gln Leu Val Cys Tyr
965 970 975
Ser Ser Asn Gly Lys Arg Ser Arg Lys Arg Lys Gln Gln Pro Tyr Asp
980 985 990
Val Val Lys Gly Gln Pro Asp Glu Tyr Ala Lys Arg Leu Glu Arg Pro
995 1000 1005
Ala Glu Lys Arg Ile Ser Thr Pro Thr Lys Ser Ser Leu Arg Leu His
1010 1015 1020
Val Pro Glu Ile Thr Pro Lys Pro Met Gln Ile Thr Asp Asn Asn Val
1025 1030 1035 1040
Gln Arg Arg Glu His Met Phe Asp Thr Val Leu Thr Arg Gly Asp Val
1045 1050 1055
Gly Met Leu Asn Arg Leu Val Val Pro Lys Lys His Ala Glu Lys Tyr
1060 1065 1070
Phe Pro Leu Asp Ser Ser Ser Thr Arg Thr Ser Lys Ala Ile Val Leu
1075 1080 1085
Ser Phe Glu Asp Pro Ala Gly Lys Ser Trp Phe Phe His Tyr Ser Tyr
1090 1095 1100
Arg Ser Ser Ser Gln Asn Tyr Val Met Phe Lys Gly Trp Thr Gly Phe
1105 1110 1115 1120
Val Lys Glu Lys Phe Leu Glu Ala Gly Asp Thr Val Ser Phe Ser Arg
1125 1130 1135
Gly Val Gly Glu Ala Thr Arg Gly Arg Leu Phe Ile Asp Cys Gln Asn
1140 1145 1150
Glu Gln Arg Tyr Met Phe Glu Arg Val Leu Thr Ala Ser Asp Met Glu
1155 1160 1165
Ser Asp Gly Cys Ser Leu Met Val Pro Val Asn Leu Val Trp Pro His
1170 1175 1180
Pro Gly Leu Arg Lys Thr Ile Lys Gly Arg His Ala Val Leu Gln Phe
1185 1190 1195 1200
Glu Asp Gly Ser Gly Asn Gly Lys Val Trp Pro Phe Gln Phe Glu Ala
1205 1210 1215
Ser Gly Gln Tyr Tyr Leu Met Lys Gly Leu Asn Tyr Phe Val Asn Asp
1220 1225 1230
Arg Asp Leu Ala Ala Gly Tyr Thr Val Ser Phe Tyr Arg Ala Gly Thr
1235 1240 1245
Arg Leu Phe Val Asp Ser Gly Arg Lys Asp Asp Lys Val Ala Leu Gly
1250 1255 1260
Thr Arg Ser Arg Glu Arg Ile Tyr Pro Lys Ile Val Arg Ser Gln
1265 1270 1275
<210> 100
<211> 3840
<212> DNA
<213> Brassica rapa
<400> 100
atgttgtttg atagttcagt gagtgcttcg ttgggcacca tgagaccact tgtcaagaag 60
ctcgacatgc tgctagctcc tgctcgggga tacagtacct tgtgcaagag gatcaaggaa 120
gtgatgcacc ttctcaaaca tgatgttgaa gagataagct cctaccttga tgaacttaca 180
gaggtggagg accctccacc aatggccaag tgctggatga acgaggcacg cgacctgtct 240
tatgatatgg aggattacat tgatagcttg ttatttgtgc cacctggcca tttcatcaag 300
aagaagaaga agaagaagaa gaagggaaag aagaagatgg tgataaagaa gaggctcaag 360
tggtgcaaac agatcgtatt cacaaagcaa gtgtcagacc atggtatcaa gaccagtaaa 420
atcattcatg ttaatgtccc tcgtcttccc aataagccca aggttgcaaa aataatatta 480
cagttcagga tctatgtcca ggaggctatt gaacggtatg acaagtatag gcttcaccat 540
tgcagcacct tgaggcgtag attgttgtcc actggtagta tgctttcagt gccaataccc 600
tatgaagaag ctgcccaaat tgtaactgat ggccggatga atgagtttat cagctcactg 660
gctgctaata atgcagcaga tcagcagcag ctcaaggtgg tatctgttct tggatctggg 720
tgtctaggta aaactacgct tgcgaatgtg ttgtacgaca gaattgggat gcaattcgaa 780
tgcagagctt tcattcgagt gtccaaaaag cctgatatga agagactttt ccgtgacttg 840
ctctcgcaat tccaccagaa gcagccactg cctaccagtt gtaatgagct tggcataagt 900
gacaatatca tcaaacatct gcaagataaa aggtatctaa ttgttattga tgatttgtgg 960
gatttatcag tatgggatat tattaaatat gcttttccaa agggaaacca tggaagcaga 1020
ataataataa ctacacagat tgaagatgtt gcattaactt gttgctgtga tcactcggag 1080
catgttttcg agatgaaacc tctcaacatt ggtcactcaa gagagctatt ttttaataga 1140
ctttttggtt ctgaaagtga ctgtcttgaa gaattcaaac gagtttcaaa cgaaattgtt 1200
gatatatgtg gtggtttacc gctagcaaca atcaacatag ctagtcattt ggcaaaccag 1260
gagacagaag tatcattgga tttgctaaca gacacacgtg atttgttgag gtcctgtttg 1320
tggtcaaatt ctacttcaga aagaacaaaa caagtactga acctcagcta cagtaatctt 1380
cctgattatc tgaagacatg tttgctgtat cttcatatgt atccagtggg ctccataatc 1440
tggaaggatg atctggtgaa gcaattggtg gctgaagggt ttattgctac aagagaaggg 1500
aaagaccaag accaagaaat gatagagaaa gctgcaggac tctgtttcga tgcacttatt 1560
gatagaagat tcatccagcc tatatatacc aagtacaaca ataaggtgtt gtcctgcacg 1620
gttcatgagg tggtacatga tcttattgcc caaaagtctg ctgaagagaa tttcattgtg 1680
gtagcagacc acaatcgaaa gaatatagca ctttctcata aggttcgtcg actatctctc 1740
atctttggcg acacaatata tgccaagaca ccagcaaaca tcacaaagtc acaaattcgg 1800
tcattcagat tttttggatt attcgagtgt atgccttgta ttacagagtt caaggttctc 1860
cgtgttctaa accttcaact atctggtcat cgtggggaca atgaccctat agacctcact 1920
gggatttcag aactgtttca gctgagatat ttaaagatta caagtgatgt gtgcataaaa 1980
ctaccaaatc aaatgcaaaa actgcaatat ttggaaacgt tggacattat ggatgcacca 2040
agagtcactg ctgttccatg ggatattata aatctcccac acctgttgca cctgactctt 2100
cctgttgata catatctgct ggattggatt agcagcatga ctgactccgt catcagtctg 2160
tggacccttg gcaagctgaa ctacctgcag catcttcatc ttactagttc ttctacacgt 2220
ccttcatacc atctggagag aagtgtggag gctctgggtt atttgatcgg aggacatggc 2280
aagctgaaaa ctatagtagt cgctcatgtc tcctctgctc aaaatactgt ggttcgtggc 2340
gccccagaag taaccatttc atgggatcgt atgtcacctc ccccccttct ccagagattc 2400
gaatgcccac acagctgctt catattttac cgaattccta agtgggttac agaacttggc 2460
aacctgtgca ttttgaagat tgcagtgaag gagcttcata tgatttgtct tggtactctc 2520
agaggattgc atgccctcac tgatctgtcg ctgtatgtgg agacagcgcc cattgacaag 2580
atcatctttg acaaggccgg gttctcagtt ctcaagtact gcaaattgcg cttcgcggct 2640
ggtatagctt ggctgaaatt tgaggctgat gcaatgccta gtctatggaa actgatgcta 2700
gttttcaacg ccatcccacg aatggaccaa aatcttgttt tctttcacca cagccgaccg 2760
gcgatgcatc aacgtggtgg tgcagtaatc attgtcgagc atatgccagg gcttagagtg 2820
atctccgcaa aatttggggg cgcagcttct gatctagagt atgcttcgag gaccgtcgtt 2880
agtaaccatc caagcaatcc tacaatcaac atgcaattgg tgtgttatag ttccaatggt 2940
aagagaagca gaaaaaggaa acaacaacct tacgacgttg tgaagggaca accagatgaa 3000
tacgccaaga gattggagag accagctgag aaaaggattt caacgccgac aaagtcttct 3060
ttgcgtctgc atgttccaga aattacacca aaacctatgc agattacaga caacaatgtt 3120
cagaggaggg agcacatgtt cgatacggtt ctgactcggg gggacgtggg gatgctgaac 3180
cggctggtgg taccgaagaa gcacgcggag aagtacttcc cgctggacag ttcctccacc 3240
cgcaccagca aggccatcgt actcagcttt gaggaccctg ctgggaagtc atggttcttc 3300
cactactcct accggagcag cagccagaac tacgtcatgt tcaaggggtg gactggcttc 3360
gtcaaggaga agtttctcga agccggcgac accgtctcct tcagccgcgg cgtcggggag 3420
gccacgaggg ggaggctctt catcgactgt caaaatgagc agaggtacat gttcgagcga 3480
gtgctgacgg cgagtgatat ggagtcggat ggctgctcgc tgatggtccc agtgaacttg 3540
gtgtggccgc accccggcct ccgcaagacg atcaagggga ggcacgccgt gctgcagttt 3600
gaggacggca gcggcaacgg gaaggtgtgg ccatttcagt ttgaggcctc cggccaatac 3660
tatctcatga agggcttgaa ctactttgtt aacgaccgcg accttgcggc tggctatacc 3720
gtctccttct accgcgccgg cacgcggttg ttcgtcgact ccgggcgtaa agatgacaaa 3780
gtagccttgg gaaccagaag ccgcgaaagg atctatccta agatcgtgcg gtcgcagtag 3840
<210> 101
<211> 264
<212> PRT
<213> Brassica rapa
<400> 101
Met Ser Gly Asn His Tyr Ser Arg Asp Ile His His Asn Thr Pro Ser
1 5 10 15
Val His His His Gln Asn Tyr Ala Val Val Asp Arg Glu Tyr Leu Phe
20 25 30
Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
35 40 45
Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Asn Asn Ala Gly
50 55 60
Asp Asp Val Ala Ala Ala Glu Thr Thr Glu Lys Gly Met Leu Leu Thr
65 70 75 80
Phe Glu Asp Glu Ser Gly Lys Cys Trp Lys Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Asp Lys His Leu His Ala Gly Asp Val Val Phe Phe Gln Arg His
115 120 125
Arg Phe Asp Leu His Arg Val Phe Ile Gly Trp Arg Lys Arg Gly Glu
130 135 140
Val Ser Ser Pro Thr Ala Val Ser Val Val Ser Gln Glu Ala Arg Val
145 150 155 160
Asn Thr Thr Ala Tyr Trp Ser Gly Leu Thr Thr Pro Tyr Arg Gln Val
165 170 175
His Ala Ser Thr Ser Ser Tyr Pro Asn Ile His Gln Glu Tyr Ser His
180 185 190
Tyr Gly Ala Val Ala Glu Ile Pro Thr Val Val Thr Gly Ser Ser Arg
195 200 205
Thr Val Arg Leu Phe Gly Val Asn Leu Glu Cys His Gly Asp Val Val
210 215 220
Glu Thr Pro Pro Cys Pro Asp Gly Tyr Asn Gly Gln His Phe Tyr Tyr
225 230 235 240
Tyr Ser Thr Pro Asp Pro Met Asn Ile Ser Phe Ala Gly Glu Ala Met
245 250 255
Glu Gln Val Gly Asp Gly Arg Arg
260
<210> 102
<211> 258
<212> PRT
<213> Brassica rapa
<400> 102
Met Ser Val Asn His Tyr Ser Asn Thr Leu Ser Ser His Asn His His
1 5 10 15
Asn Glu His Lys Glu Ser Leu Phe Glu Lys Ser Leu Thr Pro Ser Asp
20 25 30
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg
35 40 45
Tyr Leu Pro Leu Asn Asn Cys Gly Gly Gly Gly Asp Val Thr Ala Glu
50 55 60
Ser Thr Glu Lys Gly Val Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys
65 70 75 80
Ser Trp Lys Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
85 90 95
Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys His Leu Asn Ala
100 105 110
Gly Asp Val Val Leu Phe Gln Arg His Arg Phe Asp Ile His Arg Leu
115 120 125
Phe Ile Gly Trp Arg Arg Arg Gly Glu Ala Ser Ser Ser Ser Ala Val
130 135 140
Ser Ala Val Thr Gln Asp Pro Arg Ala Asn Thr Thr Ala Tyr Trp Asn
145 150 155 160
Gly Leu Thr Thr Pro Tyr Arg Gln Val His Ala Ser Thr Ser Ser Tyr
165 170 175
Pro Asn Asn Ile His Gln Glu Tyr Ser His Tyr Gly Pro Val Ala Glu
180 185 190
Thr Pro Thr Val Ala Ala Gly Ser Ser Lys Thr Val Arg Leu Phe Gly
195 200 205
Val Asn Leu Glu Cys His Ser Asp Val Val Glu Pro Pro Pro Cys Pro
210 215 220
Asp Ala Tyr Asn Gly Gln His Ile Tyr Tyr Tyr Ser Thr Pro His Pro
225 230 235 240
Met Asn Ile Ser Phe Ala Gly Glu Ala Met Glu Gln Val Gly Asp Gly
245 250 255
Arg Gly
<210> 103
<211> 777
<212> DNA
<213> Brassica rapa
<400> 103
atgtcagtca accattactc aaacactctc tcgtcgcaca atcaccacaa cgaacataaa 60
gagtctttgt tcgagaagtc actcacgcca agcgatgttg gaaagctaaa ccgtttagtc 120
ataccaaaac aacacgccga gagatacctc cctctcaata attgcggcgg cggcggcgac 180
gtgacggcgg agtcgacgga gaaaggggtg cttctcagct tcgaggacga gtcgggaaaa 240
tcttggaaat tcagatactc atattggaac agtagtcaaa gctacgtgtt gaccaaagga 300
tggagcaggt acgtcaaaga caagcacctc aacgcagggg acgtcgtttt atttcaacgg 360
caccgttttg atattcatag actcttcatt ggctggagga gacgcggaga ggcttcttcc 420
tcttccgccg tttccgccgt gactcaagat cctcgagcta acacgacggc gtactggaac 480
ggtttgacta caccttatcg tcaagtacac gcgtcaacta gttcttaccc taacaacatc 540
caccaagagt attcacatta tggccctgtt gctgagacac cgacggtagc tgcagggagc 600
tcgaagacgg tgaggctatt tggagttaac ctcgaatgtc acagtgacgt tgtggagcca 660
ccaccgtgtc ctgacgccta caacggccaa cacatttact attactcaac tccacatccc 720
atgaatatct catttgctgg agaagcaatg gagcaggtag gagatggacg aggttga 777
<210> 104
<211> 267
<212> PRT
<213> Brassica rapa
<400> 104
Met Ser Val Asn His Tyr Ser Thr Asp His His Gln Val His His His
1 5 10 15
His Thr Leu Phe Leu Gln Asn Leu His Thr Thr Asp Thr Ser Glu Pro
20 25 30
Thr Thr Thr Ala Ala Thr Ser Leu Arg Glu Asp Gln Lys Glu Tyr Leu
35 40 45
Phe Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
50 55 60
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Thr Ile
65 70 75 80
Ile Ser Asn Asn Ala Glu Glu Lys Gly Met Leu Leu Ser Phe Glu Asp
85 90 95
Glu Ser Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
100 105 110
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys
115 120 125
Gln Leu Asp Pro Ala Asp Val Val Phe Phe Gln Arg Gln Arg Ser Asp
130 135 140
Ser Arg Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Gln Gly Ser Ser
145 150 155 160
Ser Ala Ala Asn Thr Thr Ser Tyr Ser Ser Ser Met Thr Ala Pro Pro
165 170 175
Tyr Ser Asn Tyr Ser Asn Arg Pro Ala His Ser Glu Tyr Ser His Tyr
180 185 190
Gly Ala Ala Val Ala Thr Ala Thr Glu Thr His Phe Ile Pro Ser Ser
195 200 205
Ser Ala Val Gly Ser Ser Arg Thr Val Arg Leu Phe Gly Val Asn Leu
210 215 220
Glu Cys Gln Met Asp Glu Asp Glu Gly Asp Asp Ser Val Ala Thr Ala
225 230 235 240
Ala Ala Ala Glu Cys Pro Arg Gln Asp Ser Tyr Tyr Asp Gln Asn Met
245 250 255
Tyr Asn Tyr Tyr Thr Pro His Ser Ser Ala Ser
260 265
<210> 105
<211> 804
<212> DNA
<213> Brassica rapa
<400> 105
atgtcagtca accattactc cacggaccac caccaggtcc accaccacca cactctcttc 60
ttgcagaacc tccacaccac cgacacatcg gagccaacca caaccgccgc cacatcactc 120
cgcgaagacc agaaagagta tctcttcgag aaatctctca caccaagcga cgttggcaaa 180
ctcaaccgtc tcgttatacc aaaacagcac gcggagaagt acttccctct caacaccatc 240
atctccaata atgctgagga gaaagggatg cttctaagct tcgaagacga gtcaggcaag 300
tgctggaggt tcagatactc ttactggaac agcagtcaaa gctacgtgtt gactaaagga 360
tggagcagat acgtcaaaga caaacagctc gacccagccg atgttgtttt cttccaacgt 420
caacgttctg attcccggag actctttatt ggctggcgta gacgcggtca aggctcctcc 480
tccgccgcga atacgacgtc gtattctagt tccatgactg ctccaccgta tagtaattac 540
tctaatcgtc ctgctcactc agagtattcc cactatggcg ccgccgtagc aacagcgacg 600
gagacgcact tcataccatc gtcttccgcc gtcgggagct cgaggacggt gaggcttttt 660
ggtgtgaatt tggagtgtca aatggatgaa gacgaaggag atgattcggt tgccacggca 720
gccgccgctg agtgtcctcg tcaggacagc tactacgacc aaaacatgta caattattac 780
actcctcact cctcagcctc ataa 804
<210> 106
<211> 248
<212> PRT
<213> Brassica rapa
<400> 106
Met Ser Ile Asn Gln Tyr Ser Ser Asp Phe Asn Tyr His Ser Leu Met
1 5 10 15
Trp Gln Gln Gln Gln His Arg His His His His Gln Asn Asp Val Ala
20 25 30
Glu Glu Lys Glu Ala Leu Phe Glu Lys Pro Leu Thr Pro Ser Asp Val
35 40 45
Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr
50 55 60
Phe Pro Leu Ala Ala Ala Ala Ala Asp Ala Met Glu Lys Gly Leu Leu
65 70 75 80
Leu Cys Phe Glu Asp Glu Glu Gly Lys Pro Trp Arg Phe Arg Tyr Ser
85 90 95
Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg
100 105 110
Tyr Val Lys Glu Lys Gln Leu Asp Ala Gly Asp Val Ile Leu Phe His
115 120 125
Arg His Arg Val Asp Gly Gly Arg Phe Phe Ile Gly Trp Arg Arg Arg
130 135 140
Gly Asn Ser Ser Ser Ser Ser Asp Ser Tyr Arg His Leu Gln Ser Asn
145 150 155 160
Ala Ser Leu Gln Tyr Tyr Pro His Ala Gly Val Gln Ala Val Glu Ser
165 170 175
Gln Arg Gly Asn Ser Lys Thr Leu Arg Leu Phe Gly Val Asn Met Glu
180 185 190
Cys Gln Leu Asp Ser Asp Leu Pro Asp Pro Ser Thr Pro Asp Gly Ser
195 200 205
Thr Ile Cys Pro Thr Ser His Asp Gln Phe His Leu Tyr Pro Gln Gln
210 215 220
His Tyr Pro Pro Pro Tyr Tyr Met Asp Ile Ser Phe Thr Gly Asp Val
225 230 235 240
His Gln Thr Arg Ser Pro Gln Gly
245
<210> 107
<211> 747
<212> DNA
<213> Brassica rapa
<400> 107
atgtcaataa accaatactc aagcgatttc aactaccact ctctcatgtg gcaacaacag 60
cagcaccgcc accaccacca tcaaaacgac gtcgcggagg aaaaagaagc tcttttcgag 120
aaacccttaa ccccaagtga cgtcggaaaa ctcaaccgcc tcgtcatccc aaaacagcac 180
gccgagagat acttccctct cgcagcagcc gccgcagacg cgatggagaa gggattactt 240
ctctgcttcg aggacgagga aggtaagcca tggagattca gatactcgta ttggaacagt 300
agccagagtt atgtcttgac caaaggatgg agcagatacg tcaaggagaa gcagctcgac 360
gccggtgacg tcattctctt ccaccgccac cgtgttgacg gaggaagatt cttcattggc 420
tggagaagac gcggcaactc ttcctcctct tccgactctt atcgccatct tcagtccaat 480
gcctcgctcc aatattatcc tcatgcagga gttcaagcgg tggagagcca gagagggaat 540
tcgaagacat taagactgtt cggagtgaac atggagtgtc agctagactc cgacttgccc 600
gatccatcta caccagacgg ttccaccata tgtccgacca gtcacgacca gtttcatctc 660
taccctcaac aacactatcc tcctccgtac tacatggaca taagtttcac aggagatgtg 720
caccagacga gaagcccaca aggataa 747
<210> 108
<211> 245
<212> PRT
<213> Brassica rapa
<400> 108
Met Ser Ile Asn Gln Tyr Ser Ser Glu Phe Tyr Tyr His Ser Leu Met
1 5 10 15
Trp Gln Gln Gln Gln Gln His His His Gln Asn Glu Val Val Glu Glu
20 25 30
Lys Glu Ala Leu Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys
35 40 45
Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro
50 55 60
Leu Ala Ala Ala Ala Val Asp Ala Val Glu Lys Gly Leu Leu Leu Cys
65 70 75 80
Phe Glu Asp Glu Glu Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Glu Lys Gln Leu Asp Ala Gly Asp Val Val Leu Phe His Arg His
115 120 125
Arg Ala Asp Gly Gly Arg Phe Phe Ile Gly Trp Arg Arg Arg Gly Asp
130 135 140
Ser Ser Ser Ser Ser Asp Ser Tyr Arg Asn Leu Gln Ser Asn Ser Ser
145 150 155 160
Leu Gln Tyr Tyr Pro His Ala Gly Ala Gln Ala Val Glu Asn Gln Arg
165 170 175
Gly Asn Ser Lys Thr Leu Arg Leu Phe Gly Val Asn Met Glu Cys Gln
180 185 190
Ile Asp Ser Asp Trp Ser Glu Pro Ser Thr Pro Asp Gly Phe Thr Thr
195 200 205
Cys Pro Thr Asn His Asp Gln Phe Pro Ile Tyr Pro Glu His Phe Pro
210 215 220
Pro Pro Tyr Tyr Met Asp Val Ser Phe Thr Gly Asp Val His Gln Thr
225 230 235 240
Ser Ser Gln Gln Gly
245
<210> 109
<211> 738
<212> DNA
<213> Brassica rapa
<400> 109
atgtcaataa atcaatattc aagcgagttc tactaccatt ctctcatgtg gcaacaacag 60
cagcaacacc accatcaaaa cgaagtcgtg gaggaaaaag aagctctttt cgagaaaccc 120
ttaaccccaa gtgacgtcgg aaaactaaac cgcctagtca tccctaaaca gcacgccgag 180
agatacttcc ctctcgccgc cgccgcggta gacgccgtgg agaagggatt actcctctgc 240
ttcgaggacg aggaaggtaa gccatggaga ttcagatact cttattggaa tagtagccag 300
agttacgtct tgaccaaagg atggagcaga tatgttaaag agaagcaact tgacgccggc 360
gacgttgttc tctttcatcg ccaccgtgct gacggtggaa gattcttcat tggctggaga 420
agacgcggcg actcttcctc ctcctccgac tcttatcgca atcttcaatc taattcctcg 480
ctccaatatt atcctcatgc aggggctcaa gcggtggaga accagagagg taactccaag 540
acattgagac tttttggagt gaacatggag tgccagatag actcagactg gtccgagcca 600
tccacacctg acggttttac cacatgtcca accaatcacg accagtttcc tatctaccct 660
gaacactttc ctcctccgta ctacatggac gtaagtttca caggagatgt gcaccagacg 720
agtagccaac aaggatag 738
<210> 110
<211> 310
<212> PRT
<213> Brassica rapa
<400> 110
Met Met Thr Asn Leu Ser Leu Ala Arg Glu Gly Glu Glu Glu Glu Glu
1 5 10 15
Glu Ala Gly Ala Lys Lys Pro Thr Glu Glu Val Glu Arg Glu His Met
20 25 30
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
35 40 45
Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Leu Asp Ser Ser
50 55 60
Thr Asn Glu Lys Gly Leu Ile Leu Asn Phe Glu Asp Leu Thr Gly Lys
65 70 75 80
Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
85 90 95
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp Lys Lys Leu Asp Ala
100 105 110
Gly Asp Ile Val Ser Phe Leu Arg Cys Val Gly Asp Thr Gly Arg Asp
115 120 125
Ser Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Val Pro Asp Tyr
130 135 140
Thr Thr Ser Thr Ser His Phe Pro Ala Gly Ala Met Phe Pro Arg Phe
145 150 155 160
Tyr Ser Phe Gln Thr Ala Thr Thr Ser Thr Ser Tyr Asn Pro Tyr Asn
165 170 175
His Gln Gln Pro Arg His His His Ser Gly Tyr Cys Tyr Pro Gln Ile
180 185 190
Pro Arg Glu Phe Gly Tyr Gly Tyr Val Val Arg Ser Val Asp Gln Arg
195 200 205
Ala Val Val Ala Asp Pro Leu Val Ile Glu Ser Val Pro Val Met Met
210 215 220
His Gly Gly Ala Arg Val Asn Gln Ala Ala Val Gly Thr Ala Gly Lys
225 230 235 240
Arg Leu Arg Leu Phe Gly Val Asp Met Glu Cys Gly Glu Ser Gly Gly
245 250 255
Thr Asn Ser Thr Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Leu Pro
260 265 270
Arg Gly Gly Ala Ser Pro Ser Ser Ser Met Phe Gln Leu Arg Leu Gly
275 280 285
Asn Ser Ser Glu Asp Asp His Leu Phe Lys Lys Gly Lys Ser Ser Leu
290 295 300
Pro Phe Asn Leu Asp Gln
305 310
<210> 111
<211> 933
<212> DNA
<213> Brassica rapa
<400> 111
atgatgacaa atttgtctct tgcaagagaa ggagaagaag aagaagaaga ggcaggagca 60
aagaagccca cagaagaagt ggagagagag cacatgttcg acaaagtggt gactccaagt 120
gacgtcggga aactaaaccg actcgtgatc ccaaagcaac acgcggagag atacttccct 180
ttagattcat ccacaaacga gaagggtttg attctaaact tcgaagatct cacgggaaag 240
tcatggaggt tccgttactc ttactggaac agcagtcaga gctatgtcat gactaaaggt 300
tggagccgtt tcgttaaaga caagaagcta gacgctggag atattgtctc tttcctgaga 360
tgtgtcggag acacaggaag ggacagccgc ttgtttatcg attggaggag acgacctaaa 420
gtccctgact acacgacatc gacttctcac tttcctgccg gagctatgtt ccctaggttt 480
tacagttttc agacagcaac tacttccaca agttacaatc cctataatca tcagcagcca 540
cgtcatcatc acagtggtta ctgttatcct caaatcccga gagaatttgg atatgggtat 600
gtcgttaggt cagtagatca gagggcggtg gtggctgatc cgttagtgat cgaatctgtg 660
ccggtgatga tgcacggagg agctcgagtg aaccaggcgg ctgttggaac ggccgggaaa 720
aggctgaggc tttttggagt cgatatggaa tgtggcgaga gtggaggaac aaacagtacg 780
gaggaagaat cttcatcttc cggtgggagt ttgccacgtg gcggtgcttc tccgtcttcc 840
tctatgtttc agctgaggct tggaaacagc agtgaagatg atcacttatt taagaaagga 900
aagtcttcat tgccttttaa tttggatcaa taa 933
<210> 112
<211> 293
<212> PRT
<213> Brassica rapa
<400> 112
Met Met Thr Asn Leu Ser Leu Ala Arg Glu Gly Glu Ala Gln Val Lys
1 5 10 15
Lys Pro Ile Glu Glu Val Glu Arg Glu His Met Phe Asp Lys Val Val
20 25 30
Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln
35 40 45
His Ala Glu Arg Tyr Phe Pro Leu Asp Ser Ser Ser Asn Glu Lys Gly
50 55 60
Leu Leu Leu Asn Phe Glu Asp Leu Thr Gly Lys Ser Trp Arg Phe Arg
65 70 75 80
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp
85 90 95
Ser Arg Phe Val Lys Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser
100 105 110
Phe Gln Arg Cys Val Gly Asp Ser Arg Leu Phe Ile Asp Trp Arg Arg
115 120 125
Arg Pro Lys Val Pro Asp Tyr Pro Thr Ser Thr Ala His Phe Ala Ala
130 135 140
Gly Ala Met Phe Pro Arg Phe Tyr Ser Phe Pro Thr Ala Thr Thr Ser
145 150 155 160
Thr Cys Tyr Asp Leu Tyr Asn His Gln Pro Pro Arg His His His Ile
165 170 175
Gly Tyr Gly Tyr Pro Gln Ile Pro Arg Glu Phe Gly Tyr Gly Tyr Phe
180 185 190
Val Arg Ser Val Asp Gln Arg Ala Val Val Ala Asp Pro Leu Val Ile
195 200 205
Glu Ser Val Pro Val Met Met Arg Gly Gly Ala Arg Val Ser Gln Glu
210 215 220
Val Val Gly Thr Ala Gly Lys Arg Leu Arg Leu Phe Gly Val Asp Met
225 230 235 240
Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Leu Pro Arg Ala Gly Gly
245 250 255
Gly Gly Ala Ser Ser Ser Ser Ser Leu Phe Gln Leu Arg Leu Gly Ser
260 265 270
Ser Cys Glu Asp Asp His Phe Ser Lys Lys Gly Lys Ser Ser Leu Pro
275 280 285
Phe Asp Leu Asp Gln
290
<210> 113
<211> 882
<212> DNA
<213> Brassica rapa
<400> 113
atgatgacca acttgtctct tgcaagggaa ggagaagcac aagtaaagaa gcccatagaa 60
gaagttgaga gagagcacat gttcgacaaa gtggtgactc caagcgacgt agggaaacta 120
aacagactcg tgatcccaaa gcaacacgca gagagatact tccctctaga ttcatcctca 180
aacgagaaag gtttgcttct aaactttgaa gatctaacag gaaagtcatg gaggttccgt 240
tactcttact ggaacagtag ccagagctat gtcatgacta aaggttggag tcgtttcgtt 300
aaagacaaga agcttgacgc cggagatatt gtctctttcc agagatgtgt cggagacagc 360
cgcttgttta tcgattggag gagacgacct aaagtccctg actatccgac atcgactgct 420
cactttgctg caggagctat gttccctagg ttttacagtt ttccgacagc aactacttcg 480
acatgttacg atctgtacaa tcatcagccg ccacgtcatc atcacattgg ttacggttat 540
ccacagattc cgagagaatt tggatacggg tatttcgtta ggtcagtgga ccagagagcg 600
gtggtggctg atccgttggt gatcgaatct gtgccggtga tgatgcgcgg aggagctcga 660
gttagtcagg aggttgttgg aacggccggg aagaggctga ggctttttgg agtcgatatg 720
gaggaagaat cttcatcttc cggtgggagt ttgccgcgtg ccggaggtgg cggtgcttct 780
tcatcttcct ctttgtttca gctgagactt gggagcagct gtgaagatga tcacttctct 840
aagaaaggaa agtcttcatt gccttttgat ttggatcaat aa 882
<210> 114
<211> 297
<212> PRT
<213> Brassica rapa
<400> 114
Met Met Met Thr Asn Leu Ser Leu Ser Arg Glu Gly Glu Glu Glu Glu
1 5 10 15
Glu Glu Glu Gln Glu Glu Ala Lys Lys Pro Met Glu Glu Val Glu Arg
20 25 30
Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu
35 40 45
Asn Arg Leu Val Ile Pro Lys Gln Tyr Ala Glu Arg Tyr Phe Pro Leu
50 55 60
Asp Ser Ser Thr Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Leu
65 70 75 80
Ala Gly Lys Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
85 90 95
Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp Lys Lys
100 105 110
Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Cys Val Gly Asp Ser
115 120 125
Gly Arg Asp Ser Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Val
130 135 140
Pro Asp His Pro Thr Ser Ile Ala His Phe Ala Ala Gly Ser Met Phe
145 150 155 160
Pro Arg Phe Tyr Ser Phe Pro Thr Ala Thr Ser Tyr Asn Leu Tyr Asn
165 170 175
Tyr Gln Gln Pro Arg His His His His Ser Gly Tyr Asn Tyr Pro Gln
180 185 190
Ile Pro Arg Glu Phe Gly Tyr Gly Tyr Leu Val Asp Gln Arg Ala Val
195 200 205
Val Ala Asp Pro Leu Val Ile Glu Ser Val Pro Val Met Met His Gly
210 215 220
Gly Ala Gln Val Ser Gln Ala Val Val Gly Thr Ala Gly Lys Arg Leu
225 230 235 240
Arg Leu Phe Gly Val Asp Met Glu Glu Glu Ser Ser Ser Ser Gly Gly
245 250 255
Ser Leu Pro Arg Gly Asp Ala Ser Pro Ser Ser Ser Leu Phe Gln Leu
260 265 270
Arg Leu Gly Ser Ser Ser Glu Asp Asp His Phe Ser Lys Lys Gly Lys
275 280 285
Ser Ser Leu Pro Phe Asp Leu Asp Gln
290 295
<210> 115
<211> 894
<212> DNA
<213> Brassica rapa
<400> 115
atgatgatga caaacttgtc tctttcaaga gaaggagaag aggaggaaga agaagaacaa 60
gaagaggcca agaagcccat ggaagaagta gagagagagc acatgttcga caaagtggtg 120
actccaagcg atgttggtaa actaaaccgg ctcgtgatcc caaagcaata cgcagagaga 180
tacttccctt tagattcatc cacaaacgag aaaggtttgc ttctaaactt cgaagatctc 240
gcaggaaagt catggaggtt ccgttactct tactggaaca gtagtcagag ctatgtcatg 300
actaaaggtt ggagccgttt cgttaaagac aaaaagctag acgccggaga tattgtctct 360
ttccagagat gtgtcggaga ttcaggaaga gacagccgct tgtttattga ttggaggaga 420
agacctaaag ttcctgacca tccgacatcg attgctcact ttgctgccgg atctatgttt 480
cctaggtttt acagttttcc gacagcaact agttacaatc tttacaacta tcagcagcca 540
cgtcatcatc atcacagtgg ttataattat cctcaaattc cgagagaatt tggatacggg 600
tacttggtgg atcaaagagc cgtggtggct gatccgttgg tgattgaatc tgtgccggtg 660
atgatgcacg gaggagctca agttagtcag gcggttgttg gaacggccgg gaagaggctg 720
aggctttttg gagtcgatat ggaggaagaa tcttcatctt ccggtgggag tttgccacgt 780
ggtgacgctt ctccgtcttc ctctttgttt cagctgagac ttggaagcag cagtgaagat 840
gatcacttct ctaagaaagg aaagtcctca ttgccttttg atttggatca ataa 894
<210> 116
<211> 286
<212> PRT
<213> Brassica rapa
<400> 116
Met Asn Gln Glu Glu Glu Asn Pro Val Glu Lys Ala Ser Ser Met Glu
1 5 10 15
Arg Glu His Met Phe Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys
20 25 30
Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro
35 40 45
Leu Asp Asn Asn Ser Asp Ser Ser Lys Gly Leu Leu Leu Asn Phe Glu
50 55 60
Asp Arg Thr Gly Asn Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser
65 70 75 80
Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp
85 90 95
Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Asp Pro Gly
100 105 110
Asn Lys Asp Lys Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Ile Pro
115 120 125
Asp His His His Gln Phe Ala Gly Ala Met Phe Pro Arg Phe Tyr Ser
130 135 140
Phe Ser His Pro Gln Asn Leu Tyr His Arg Tyr Gln Gln Asp Leu Gly
145 150 155 160
Ile Gly Tyr Tyr Val Ser Ser Met Glu Arg Asn Asp Pro Thr Ala Val
165 170 175
Ile Glu Ser Val Pro Leu Ile Met Gln Arg Arg Ala Ala His Val Ala
180 185 190
Ala Ile Pro Ser Ser Arg Gly Glu Lys Arg Leu Arg Leu Phe Gly Val
195 200 205
Asp Met Glu Cys Gly Gly Gly Gly Gly Ser Val Asn Ser Thr Glu Glu
210 215 220
Glu Ser Ser Ser Ser Gly Gly Gly Gly Gly Val Ser Met Ala Ser Val
225 230 235 240
Gly Ser Leu Leu Gln Leu Arg Leu Val Ser Ser Asp Asp Glu Ser Leu
245 250 255
Val Ala Met Glu Ala Ala Ser Val Asp Glu Asp His His Leu Phe Thr
260 265 270
Lys Lys Gly Lys Ser Ser Leu Ser Phe Asp Leu Asp Arg Lys
275 280 285
<210> 117
<211> 861
<212> DNA
<213> Brassica rapa
<400> 117
atgaatcaag aagaagagaa tcctgtggaa aaagcctctt caatggagag agagcacatg 60
tttgaaaaag tagtaacacc aagcgacgta ggcaaactaa accgactcgt gatcccaaag 120
caacacgcgg agagatactt ccctttagac aacaattctg acagcagcaa aggtttgctt 180
ctaaacttcg aagaccgaac aggaaactca tggagattcc gttactctta ctggaacagt 240
agccagagtt atgtcatgac aaaaggttgg agccgcttcg tcaaagacaa gaagcttgat 300
gctggcgaca tcgtttcttt tcagagagat cctggtaata aagacaagct tttcattgat 360
tggaggagac gaccaaagat tccagatcat catcatcaat tcgctggagc tatgttccct 420
aggttttact ctttctctca tcctcagaac ctttatcatc gatatcaaca agatcttgga 480
attgggtatt atgtgagttc aatggagaga aatgatccaa cggctgtaat tgaatctgtg 540
ccgttgataa tgcaaaggag agcagcacac gtggctgcta taccttcatc aagaggagag 600
aagaggttaa ggctgtttgg agtggacatg gagtgcggcg gcggcggagg aagtgtgaat 660
agcacggagg aagagtcgtc gtcttccggt ggtggcggcg gcgtttctat ggctagtgtt 720
ggttctcttc tccaattgag gctagtgagc agtgatgatg agtctttggt agcaatggaa 780
gctgcaagtg tcgatgagga tcatcacttg tttacaaaga aaggaaagtc ttctttgtct 840
ttcgatttgg atagaaaatg a 861
<210> 118
<211> 292
<212> PRT
<213> Brassica rapa
<400> 118
Met Asn Gln Glu Asn Lys Lys Pro Leu Glu Glu Ala Ser Thr Ser Met
1 5 10 15
Glu Arg Glu Asn Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly
20 25 30
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
35 40 45
Pro Leu Asp Asn Ser Ser Thr Asn Asn Lys Gly Leu Leu Leu Asp Phe
50 55 60
Glu Asp Arg Thr Gly Ser Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn
65 70 75 80
Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys
85 90 95
Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Asp Pro
100 105 110
Cys Asn Lys Asp Lys Leu Tyr Ile Asp Trp Arg Arg Arg Pro Lys Ile
115 120 125
Pro Asp His His Gln Phe Ala Gly Ala Met Phe Pro Arg Phe Tyr Ser
130 135 140
Phe Pro His Pro Gln Met Pro Thr Ser Phe Glu Ser Ser His Asn Leu
145 150 155 160
Tyr His His Arg Phe Gln Arg Asp Leu Gly Ile Gly Tyr Tyr Pro Thr
165 170 175
Ala Val Ile Glu Ser Val Pro Val Ile Met Gln Arg Arg Glu Ala Gln
180 185 190
Val Ala Asn Met Ala Ser Ser Arg Gly Glu Lys Arg Leu Arg Leu Phe
195 200 205
Gly Val Asp Val Glu Cys Gly Gly Gly Gly Gly Gly Ser Val Asn Ser
210 215 220
Thr Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Met Ser Arg Gly Gly
225 230 235 240
Val Ser Met Ala Gly Val Gly Ser Leu Leu Gln Leu Arg Leu Val Ser
245 250 255
Ser Asp Asp Glu Ser Leu Val Ala Met Glu Gly Ala Thr Val Asp Glu
260 265 270
Asp His His Leu Phe Thr Thr Lys Lys Gly Lys Ser Ser Leu Ser Phe
275 280 285
Asp Leu Asp Ile
290
<210> 119
<211> 879
<212> DNA
<213> Brassica rapa
<400> 119
atgaatcaag aaaacaagaa gcctttggaa gaagcttcga cttcaatgga gagagagaac 60
atgttcgaca aagtagtaac accaagcgac gtagggaaac taaaccgact cgtgatccca 120
aagcaacacg cagagagata cttcccttta gacaactcct caacaaacaa caaagggttg 180
cttctagact tcgaagaccg tacaggaagc tcatggagat tccgttactc ttactggaac 240
agtagccaaa gttatgtcat gacaaaaggt tggagccgtt ttgtcaaaga caagaagctt 300
gatgctggtg acatcgtgtc ttttcaaaga gatccctgta ataaagacaa gctttacata 360
gattggagga gacgaccaaa gattccagat catcatcagt tcgccggagc tatgttccct 420
aggttttact ctttccctca ccctcagatg ccgacaagtt ttgaaagtag tcacaacctt 480
tatcatcatc ggtttcaacg agatcttgga attgggtatt atccaacggc tgtgattgaa 540
tctgtgccgg tgataatgca aaggagagaa gcacaagtgg ctaatatggc ttcatcaaga 600
ggagagaaga ggttaaggct gtttggagtg gacgtggagt gcggcggcgg aggaggagga 660
agtgtgaata gcacggagga agagtcgtcg tcttccggtg gtagtatgtc acgtggcggc 720
gtttctatgg ctggtgttgg ttctctcctt cagttgaggt tagtgagcag tgatgatgag 780
tctttagtag cgatggaagg tgctactgtc gatgaggatc atcacttgtt tacaactaag 840
aaaggaaagt cttctttgtc tttcgatttg gatatatga 879
<210> 120
<211> 320
<212> PRT
<213> Brassica rapa
<400> 120
Met Glu Arg Lys Ser Asn Asp Leu Glu Arg Ser Glu Asn Ile Asp Ser
1 5 10 15
Gln Asn Lys Lys Met Asn Leu Glu Glu Glu Arg Pro Val Gln Glu Ala
20 25 30
Ser Ser Met Glu Arg Glu His Met Phe Asp Lys Val Val Thr Pro Ser
35 40 45
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu
50 55 60
Arg Tyr Phe Pro Leu Asp Asn Asn Ser Ser Asp Asn Asn Lys Gly Leu
65 70 75 80
Leu Leu Asn Phe Glu Asp Arg Ile Gly Ile Leu Trp Ser Phe Arg Tyr
85 90 95
Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser
100 105 110
Arg Phe Val Lys Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe
115 120 125
His Arg Gly Ser Cys Asn Lys Asp Lys Leu Phe Ile Asp Trp Lys Arg
130 135 140
Arg Pro Lys Ile Pro Asp His Gln Val Val Gly Ala Met Phe Pro Arg
145 150 155 160
Phe Tyr Ser Tyr Pro Tyr Pro Gln Ile Gln Ala Ser Tyr Glu Arg His
165 170 175
Asn Leu Tyr His Arg Tyr Gln Arg Asp Ile Gly Ile Gly Tyr Tyr Val
180 185 190
Arg Ser Met Glu Arg Tyr Asp Pro Thr Ala Val Ile Glu Ser Val Pro
195 200 205
Val Ile Met Gln Arg Arg Ala His Val Ala Thr Met Ala Ser Ser Arg
210 215 220
Gly Glu Lys Arg Leu Arg Leu Phe Gly Val Asp Met Glu Cys Val Arg
225 230 235 240
Gly Gly Arg Gly Gly Gly Gly Ser Val Asn Ser Thr Glu Glu Glu Ser
245 250 255
Ser Thr Ser Gly Gly Ser Ile Ser Arg Gly Gly Val Ser Met Ala Gly
260 265 270
Val Gly Ser Pro Leu Gln Leu Arg Leu Val Ser Ser Asp Gly Asp Asp
275 280 285
Gln Ser Leu Val Ala Arg Gly Ala Ala Arg Val Asp Glu Asp His His
290 295 300
Leu Phe Thr Lys Lys Gly Lys Ser Ser Leu Ser Phe Asp Leu Asp Lys
305 310 315 320
<210> 121
<211> 963
<212> DNA
<213> Brassica rapa
<400> 121
atggagagga agtccaatga tcttgagaga tctgagaata ttgattctca aaacaagaag 60
atgaatctag aagaagagag gcctgtacaa gaagcttctt cgatggagag agagcacatg 120
ttcgacaaag tagtaacacc aagcgacgtt gggaaactaa accggctggt gatcccaaag 180
caacacgcag agcgatactt ccctttagac aataattcct cagacaacaa caaaggtttg 240
cttctaaact tcgaagatcg aataggaatc ttatggagtt tccgttactc ctactggaac 300
agtagccaaa gttatgtaat gactaaaggc tggagccgtt tcgtcaaaga caagaagctt 360
gatgctggcg acatagtttc ttttcataga ggttcttgta ataaagacaa gcttttcatt 420
gattggaaga gacgaccaaa gattcctgat caccaagtcg tcggagctat gttccctagg 480
ttttactctt acccttatcc tcagatacag gctagttatg aacgtcacaa cctttatcat 540
cgatatcaac gagatatagg aattgggtat tatgtgaggt caatggagag atatgatcca 600
acggctgtaa ttgaatctgt gccggtgata atgcaaagga gagcacatgt ggctactatg 660
gcttcatcaa gaggagagaa gaggttaagg ctttttggag tggatatgga gtgcgtcaga 720
ggcggccgag gaggaggagg aagtgtgaat agcacggagg aagagtcttc gacttccggt 780
ggtagtatct cacgtggcgg cgtttctatg gctggtgttg gctctccact ccagttgagg 840
ttagtgagca gtgacggtga tgatcagtct ctagtagcta ggggagctgc tagggttgat 900
gaggatcatc acttgtttac aaagaaagga aagtcttctt tgtctttcga tttggataaa 960
tga 963
<210> 122
<211> 350
<212> PRT
<213> Brassica rapa
<400> 122
Met Val Phe Ser Cys Ile Asp Glu Ser Ser Ser Thr Ser Glu Ser Phe
1 5 10 15
Ser Pro Ala Thr Ala Thr Ala Thr Ala Thr Ala Thr Lys Phe Ser Ala
20 25 30
Pro Pro Leu Pro Pro Leu Arg Leu Asn Arg Met Arg Ser Gly Gly Ser
35 40 45
Asn Val Val Leu Asp Ser Lys Asn Gly Val Asp Ile Asp Ser Arg Lys
50 55 60
Leu Ser Ser Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg
65 70 75 80
Trp Gly Ala Gln Ile Tyr Val Lys His Gln Arg Val Trp Leu Gly Thr
85 90 95
Phe Cys Asp Glu Glu Glu Ala Ala His Ser Tyr Asp Ile Ala Ala Arg
100 105 110
Lys Phe Arg Gly Arg Asp Ala Val Val Asn Phe Lys Thr Phe Leu Ala
115 120 125
Ser Glu Asp Asp Asn Gly Glu Leu Cys Phe Leu Glu Ala His Ser Lys
130 135 140
Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu
145 150 155 160
Ala Gln Ser Asn Lys Arg Ser Gly Ala Asn Thr Asn Thr Asn Thr Thr
165 170 175
Gln Ser His Thr Val Ser Arg Thr Arg Glu Val Leu Phe Glu Lys Val
180 185 190
Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
195 200 205
Gln His Ala Glu Lys Tyr Phe Pro Leu Pro Ser Leu Ser Val Thr Lys
210 215 220
Gly Val Leu Ile Asn Phe Glu Asp Val Thr Gly Lys Val Trp Arg Phe
225 230 235 240
Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly
245 250 255
Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val
260 265 270
Thr Phe Glu Arg Ser Thr Gly Ser Asp Arg Gln Leu Tyr Ile Asp Trp
275 280 285
Lys Ile Arg Ser Gly Pro Ser Lys Asn Pro Val Gln Val Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Phe Asn Val Thr Ser Ala Lys Pro Ser Asn
305 310 315 320
Val Val Asp Ala Cys Gly Gly Lys Arg Ser Arg Asp Val Asp Met Phe
325 330 335
Ala Leu Arg Cys Ser Lys Lys His Ala Ile Ile Asn Ala Leu
340 345 350
<210> 123
<211> 1053
<212> DNA
<213> Brassica rapa
<400> 123
atggtattca gttgcataga cgagagctct tccacttcag aatctttttc acccgcaacc 60
gcaaccgcaa ccgcaaccgc cacaaagttc tctgctcctc cgcttccacc gttacgcctc 120
aaccggatga gaagcggtgg aagcaacgtc gtgttggatt caaagaatgg cgtagatatt 180
gattcacgga agctatcgtc gtcaaagtac aaaggcgtgg ttcctcagcc caacggaaga 240
tggggagctc agatttacgt gaagcaccag cgagtttggc tgggcacttt ctgcgatgaa 300
gaggaagctg ctcactccta cgacatagcc gcccgtaaat tccgtggccg tgacgccgtt 360
gtcaacttca aaaccttcct cgcctcagag gacgacaacg gcgagttatg tttccttgaa 420
gctcactcca aggccgagat cgtcgacatg ttgaggaaac acacttacgc tgacgagctt 480
gcgcagagca ataaacgcag cggagcgaat acgaatacga atacgactca aagccacacc 540
gtttcgagaa cacgtgaagt gcttttcgag aaggttgtca cgcctagcga cgttggtaag 600
ctaaaccgcc tcgtgatacc taaacagcac gcggagaaat attttccgtt accgtcactg 660
tcggtgacta aaggcgttct gatcaacttc gaagacgtga cgggtaaggt gtggcggttc 720
cgttactcat actggaacag tagtcaaagt tacgtgttga ccaagggatg gagtcggttc 780
gttaaggaga agaatctccg agccggtgat gtcgttactt tcgagagatc gaccggttca 840
gaccggcagc tttatattga ttggaaaatc cggtctggtc cgagcaaaaa ccctgttcag 900
gttgtggtta ggcttttcgg agttgacatc ttcaacgtga caagcgcgaa gccgagcaac 960
gttgtagacg cgtgcggtgg aaagagatct cgggatgttg atatgtttgc gctacggtgt 1020
tccaaaaaac acgctataat caatgctttg tga 1053
<210> 124
<211> 540
<212> PRT
<213> Zea mays
<400> 124
Met Ala Ala Ser Pro Ser Ser Pro Leu Thr Ala Pro Pro Glu Pro Val
1 5 10 15
Thr Pro Pro Ser Pro Trp Thr Ile Thr Asp Gly Ala Ile Ser Gly Thr
20 25 30
Leu Pro Ala Ala Glu Ala Phe Ala Val His Tyr Pro Gly Tyr Pro Ser
35 40 45
Ser Pro Ala Arg Ala Ala Arg Thr Leu Gly Gly Leu Pro Gly Leu Ala
50 55 60
Lys Val Arg Ser Ser Asp Pro Gly Ala Arg Leu Glu Leu Arg Phe Arg
65 70 75 80
Pro Glu Asp Pro Tyr Cys His Pro Ala Phe Gly Gln Ser Arg Ala Ser
85 90 95
Thr Gly Leu Leu Leu Arg Leu Ser Lys Arg Lys Gly Ala Ala Ala Pro
100 105 110
Cys Ala His Val Val Ala Arg Val Arg Thr Ala Tyr Tyr Phe Glu Gly
115 120 125
Met Ala Asp Phe Gln His Val Val Pro Val His Ala Ala Gln Thr Arg
130 135 140
Lys Arg Lys His Ser Asp Ser Gln Asn Asp Asn Glu Asn Phe Gly Ser
145 150 155 160
Asp Lys Thr Gly His Asp Glu Ala Asp Gly Asp Val Met Met Leu Val
165 170 175
Pro Pro Leu Phe Ser Val Lys Asp Arg Pro Thr Lys Ile Ala Leu Val
180 185 190
Pro Ser Ser Asn Ala Ile Ser Lys Thr Met His Arg Gly Val Val Gln
195 200 205
Glu Arg Trp Glu Met Asn Val Gly Pro Thr Leu Ala Leu Pro Phe Asn
210 215 220
Thr Gln Val Val Pro Glu Lys Ile Asn Trp Glu Asp His Ile Arg Lys
225 230 235 240
Asn Ser Val Glu Trp Gly Trp Gln Met Ala Val Cys Lys Leu Phe Asp
245 250 255
Glu Arg Pro Val Trp Pro Arg Gln Ser Leu Tyr Glu Arg Phe Leu Asp
260 265 270
Asp Asn Val His Val Ser Gln Asn Gln Phe Lys Arg Leu Leu Phe Arg
275 280 285
Ala Gly Tyr Tyr Phe Ser Thr Gly Pro Phe Gly Lys Phe Trp Ile Arg
290 295 300
Arg Gly Tyr Asp Pro Arg Lys Asp Ser Glu Ser Gln Ile Tyr Gln Arg
305 310 315 320
Ile Asp Phe Arg Met Pro Pro Glu Leu Arg Tyr Leu Leu Arg Leu Lys
325 330 335
Asn Ser Glu Ser Arg Lys Trp Ala Asp Met Cys Lys Leu Glu Thr Met
340 345 350
Pro Ser Gln Ser Phe Ile Tyr Leu Gln Leu Tyr Glu Leu Lys Asp Asp
355 360 365
Phe Ile Gln Ala Glu Ile Arg Lys Pro Ser Tyr Gln Ser Val Cys Ser
370 375 380
Arg Ser Thr Gly Trp Phe Ser Lys Pro Met Ile Lys Thr Leu Arg Leu
385 390 395 400
Gln Val Ser Ile Arg Leu Leu Ser Leu Leu His Asn Glu Glu Ala Lys
405 410 415
Asn Leu Leu Arg Asn Ala His Glu Leu Ile Glu Arg Ser Lys Lys Gln
420 425 430
Glu Ala Leu Ser Arg Ser Glu Leu Ser Ile Glu Tyr Asn Asp Ala Asp
435 440 445
Gln Val Ser Ala Ala His Thr Gly Thr Glu Asp Gln Val Gly Pro Asn
450 455 460
Asn Ser Asp Ser Glu Asp Val Asp Asp Glu Glu Glu Glu Glu Glu Leu
465 470 475 480
Glu Gly Tyr Asp Ser Pro Pro Met Ala Asp Asp Ile His Glu Phe Thr
485 490 495
Leu Gly Asp Ser Tyr Ala Phe Gly Glu Gly Phe Ser Asn Gly Tyr Leu
500 505 510
Glu Glu Val Leu Arg Ser Leu Pro Leu Gln Glu Asp Gly Gln Lys Lys
515 520 525
Leu Cys Asp Ala Pro Ile Asn Ala Asp Ala Ser Asp
530 535 540
<210> 125
<211> 1674
<212> DNA
<213> Zea mays
<400> 125
atggccgcct cgccctcttc acccttgaca gcgccgccag agccggtgac cccgccgtcc 60
ccatggacca tcacagacgg agccatctct ggcacgctcc cagcagccga ggccttcgca 120
gtgcactacc cgggctaccc ctcctctccc gcccgcgccg cccgcaccct cggcggtctc 180
cccggcctcg ccaaggtccg gagttccgat cccggcgccc gcctcgagct ccgcttccgc 240
cccgaggacc cctactgcca tccagccttt ggccagtccc gcgcctccac tggccttctg 300
ctgcgcctct ccaagcgcaa aggagctgcg gcaccttgtg cccatgtggt cgctcgtgtc 360
cggactgctt actacttcga aggtatggca gattttcaac atgttgttcc agtgcatgct 420
gcacaaacaa gaaaaagaaa acactcagat tctcaaaatg ataatgagaa ttttggtagt 480
gataagacag gacatgatga agcagatgga gatgtcatga tgttggtacc ccctctcttt 540
tcagtgaagg ataggccaac aaagatagcg cttgtaccat cgtccaatgc catatctaaa 600
accatgcaca ggggagttgt acaagaacgg tgggagatga atgttggacc aactctggcg 660
cttccgttca acactcaagt tgtcccggag aagattaatt gggaagacca cattagaaag 720
aattctgtag aatggggttg gcaaatggct gtttgcaaat tgtttgatga gcgccctgtg 780
tggccaaggc aatcacttta tgagcggttc cttgatgata atgtgcatgt ctctcaaaac 840
caattcaaaa ggcttctgtt tagagctgga tactacttct ctactggacc ctttggaaaa 900
ttttggatca gaagaggata tgaccctcgt aaagactctg agtcacaaat atatcagaga 960
attgattttc gcatgcctcc cgagctacga tatcttctaa ggctgaagaa ttctgagtct 1020
cgaaagtggg cagatatgtg caagcttgaa acaatgccat cacagagttt catctacctg 1080
caattatatg aactgaagga tgattttatt caagcagaaa ttcgaaaacc ttcttatcaa 1140
tcagtttgtt cacgttctac aggatggttt tctaagccaa tgatcaaaac cctgaggttg 1200
caagtgagca taaggctcct ctctttattg cataatgaag aggctaaaaa cttgttgagg 1260
aatgcccatg agcttattga aaggtccaag aagcaggaag ccctttcgag atctgagctg 1320
tcaatagaat ataatgatgc tgatcaagtt tctgccgcac atactggaac tgaggatcaa 1380
gtcggcccta acaactctga tagtgaagat gtggatgatg aagaagagga agaggaattg 1440
gagggttatg attctccacc tatggcagat gatattcatg agttcacctt aggtgattcc 1500
tatgcatttg gtgaaggctt ctcgaatgga tacctcgaag aagtactgcg cagcttgcca 1560
ttgcaggaag acggccaaaa gaaattatgt gatgctccta tcaacgctga tgcaagtgat 1620
ggagagtttg aaatttacga acagcccagt gatgatgaag attctgatgg ctag 1674
<210> 126
<211> 409
<212> PRT
<213> Zea mays
<400> 126
Met Glu Phe Ala Ser Ser Ser Ser Arg Phe Ser Arg Glu Glu Asp Glu
1 5 10 15
Glu Glu Glu Gln Glu Glu Glu Glu Glu Glu Glu Glu Ala Ser Pro Arg
20 25 30
Glu Ile Pro Phe Met Thr Ala Ala Ala Thr Ala Asp Thr Gly Ala Ala
35 40 45
Ala Ser Ser Ser Ser Pro Ser Ala Ala Ala Ser Ser Gly Pro Ala Ala
50 55 60
Ala Pro Arg Ser Ser Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly
65 70 75 80
Gly Ser Asp Asp Val Gln Val Ile Glu Lys Glu His Met Phe Asp Lys
85 90 95
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
100 105 110
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala Ala Asn Glu
115 120 125
Lys Gly Gln Leu Leu Ser Phe Glu Asp Arg Ala Gly Lys Leu Trp Arg
130 135 140
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
145 150 155 160
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
165 170 175
Val Ser Phe Cys Arg Gly Ala Gly Asp Thr Ala Arg Asp Arg Leu Phe
180 185 190
Ile Asp Trp Lys Arg Arg Ala Asp Ser Arg Asp Pro His Arg Met Pro
195 200 205
Arg Leu Pro Leu Pro Met Ala Pro Val Ala Ser Pro Tyr Gly Pro Trp
210 215 220
Gly Gly Gly Gly Gly Gly Gly Ala Gly Gly Phe Phe Met Pro Pro Ala
225 230 235 240
Pro Pro Ala Thr Leu Tyr Glu His His Arg Phe Arg Gln Ala Leu Asp
245 250 255
Phe Arg Asn Ile Asn Ala Ala Ala Ala Pro Ala Arg Gln Leu Leu Phe
260 265 270
Phe Gly Ser Ala Gly Met Pro Pro Arg Ala Ser Met Pro Gln Gln Gln
275 280 285
Gln Pro Pro Pro Pro Pro His Pro Pro Leu His Ser Ile Met Leu Val
290 295 300
Gln Pro Ser Pro Ala Pro Pro Thr Ala Ser Val Pro Met Leu Leu Asp
305 310 315 320
Ser Val Pro Leu Val Asn Ser Pro Thr Ala Ala Ser Lys Arg Val Arg
325 330 335
Leu Phe Gly Val Asn Leu Asp Asn Pro Gln Pro Gly Thr Ser Ala Glu
340 345 350
Ser Ser Gln Asp Ala Asn Ala Leu Ser Leu Arg Thr Pro Gly Trp Gln
355 360 365
Arg Pro Gly Pro Leu Arg Phe Phe Glu Ser Pro Gln Arg Gly Ala Glu
370 375 380
Ser Ser Ala Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu
385 390 395 400
Ala His Ser Ser Leu Asp Leu Asp Leu
405
<210> 127
<211> 259
<212> PRT
<213> Zea mays
<400> 127
Met Glu Phe Thr Thr Pro Pro Pro Ala Thr Arg Ser Gly Gly Gly Glu
1 5 10 15
Glu Arg Ala Ala Ala Glu His Asn Gln His His Gln Gln Gln His Ala
20 25 30
Thr Val Glu Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp
35 40 45
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
50 55 60
Tyr Phe Pro Leu Asp Ala Ala Ala Asn Glu Lys Gly Leu Leu Leu Ser
65 70 75 80
Phe Glu Asp Arg Thr Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val
100 105 110
Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr Val Ser Phe Gly Arg Gly
115 120 125
Ile Ser Glu Ala Ala Arg Asp Arg Leu Phe Ile Asp Trp Arg Cys Arg
130 135 140
Pro Asp Pro Pro Val Val His His Gln Tyr His His Arg Leu Pro Leu
145 150 155 160
Pro Ser Ala Val Val Pro Tyr Ala Pro Trp Ala Ala His Ala His His
165 170 175
His His Tyr Pro Ala Asp Gly His Thr Glu Pro Val Thr Pro Cys Leu
180 185 190
Cys Ala Thr Leu Val Ala Thr Glu Met Arg Ala Ser Ser Ser Gln Leu
195 200 205
Ser Leu Thr Arg Ser Asn Leu Ser Arg Pro Pro Gln Pro Arg Ile Ala
210 215 220
Arg Val Asp Gly Ala Gln Pro Arg Pro Ser Ser Ser Pro Arg Gln Pro
225 230 235 240
Gln Ser Leu Trp Cys Arg Ser Cys Gln Pro Gln Pro Arg Arg Thr Ala
245 250 255
Asp Val Pro
<210> 128
<211> 780
<212> DNA
<213> Zea mays
<400> 128
atggagttca ccactccccc gcccgcgacc cggtcgggcg gcggagagga gagggcggct 60
gctgagcaca accagcacca ccagcagcag catgcgacgg tggagaagga gcacatgttc 120
gacaaggtgg tgacgccgag cgacgtcggg aagctgaacc ggctggtgat cccgaagcag 180
cacgcggaga agtacttccc gctggacgcg gcggcgaacg agaagggcct cctgctcagc 240
ttcgaggacc gcacggggaa gccctggcgc ttccgctact cctactggaa cagtagccag 300
agctacgtga tgaccaaggg ctggagccgc ttcgtcaagg agaagcgcct cgacgccggg 360
gacacagtct ccttcggccg cggcatcagc gaggcggcgc gcgacaggct tttcatcgac 420
tggcggtgcc gacccgaccc gcccgtcgtg caccaccagt accaccaccg cctccctctc 480
ccctccgccg tcgtccccta cgcgccgtgg gcggcgcacg cgcaccacca ccactaccca 540
gcagatgggc acacggaacc agtaacacct tgcctgtgcg ccacactcgt tgccactgaa 600
atgagagcat catcttcgca actgtcactc acacgctcca acctctccag gccgccacaa 660
cctagaatag ccagagtcga tggcgcccag ccacggccgt cgtcgtcacc acgccagcca 720
cagtcgttgt ggtgccggtc gtgccaaccg caaccacggc gaacggccga cgttccttga 780
<210> 129
<211> 327
<212> PRT
<213> Zea mays
<400> 129
Met Glu Phe Thr Ala Pro Pro Pro Ala Thr Arg Ser Gly Gly Gly Glu
1 5 10 15
Glu Arg Ala Ala Ala Glu His His Gln Gln Gln Gln Gln Ala Thr Val
20 25 30
Glu Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly
35 40 45
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
50 55 60
Pro Leu Asp Ala Ala Ala Asn Asp Lys Gly Leu Leu Leu Ser Phe Glu
65 70 75 80
Asp Arg Ala Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser
85 90 95
Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu
100 105 110
Lys Arg Leu Asp Ala Gly Asp Thr Val Ser Phe Gly Arg Gly Val Gly
115 120 125
Glu Ala Ala Arg Gly Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Asp
130 135 140
Pro Pro Val Val His His Gln Tyr His His His Arg Leu Pro Leu Pro
145 150 155 160
Ser Ala Val Val Pro Tyr Ala Pro Trp Ala Ala Ala Ala His Ala His
165 170 175
His His His Tyr Pro Ala Ala Gly Val Gly Ala Ala Arg Thr Thr Thr
180 185 190
Thr Thr Thr Thr Thr Val Leu His His Leu Pro Pro Ser Pro Ser Pro
195 200 205
Leu Tyr Leu Asp Thr Arg Arg Arg His Val Gly Tyr Asp Ala Tyr Gly
210 215 220
Ala Gly Thr Arg Gln Leu Leu Phe Tyr Arg Pro His Gln Gln Pro Ser
225 230 235 240
Thr Thr Val Met Leu Asp Ser Val Pro Val Arg Leu Pro Pro Thr Pro
245 250 255
Gly Gln His Ala Glu Pro Pro Pro Pro Ala Val Ala Ser Ser Ala Ser
260 265 270
Lys Arg Val Arg Leu Phe Gly Val Asn Leu Asp Cys Ala Ala Ala Ala
275 280 285
Gly Ser Glu Glu Glu Asn Val Gly Gly Trp Arg Thr Ser Ala Pro Pro
290 295 300
Thr Gln Gln Ala Ser Ser Ser Ser Ser Tyr Ser Ser Gly Lys Ala Arg
305 310 315 320
Cys Ser Leu Asn Leu Asp Leu
325
<210> 130
<211> 984
<212> DNA
<213> Zea mays
<400> 130
atggagttca ccgctccccc gcccgcgacc cggtcgggcg gcggcgagga gagggcggct 60
gctgagcacc accagcagca gcagcaggcg acggtggaga aggagcacat gttcgacaag 120
gtggtgacgc cgagcgacgt cgggaagctg aaccggctgg tgatcccgaa gcagcacgcg 180
gagaggtact tcccgctgga cgcggcggcg aacgacaagg gcctgctgct cagcttcgag 240
gaccgcgcgg ggaagccctg gcgcttccgc tactcctact ggaacagcag ccagagctac 300
gtgatgacca agggctggag ccgcttcgtc aaggagaagc gcctcgacgc cggggacacc 360
gtctccttcg gccgcggcgt cggcgaggcg gcgcgcggca ggctcttcat cgactggcgg 420
cgccgacccg acccgcccgt cgtgcaccac cagtaccacc accaccgcct ccctctcccc 480
tccgccgtcg tcccctacgc gccgtgggcg gcggcggcgc acgcgcacca ccaccactac 540
ccagcagctg gggtcggtgc cgccaggacg acgacgacga cgacgacgac ggtgctccac 600
cacctgccgc cctcgccctc cccgctctac cttgacaccc gccgccgcca cgtcggctac 660
gacgcctacg gggccggcac caggcaactt ctcttctaca ggccgcacca gcagccctcc 720
acgacggtga tgctggactc cgtgccggta cggttaccgc caacgccagg gcagcacgcc 780
gagccgccgc cccccgccgt ggcgtcgtca gcctcgaagc gggtgcgcct gttcggggtg 840
aacctcgact gcgccgccgc cgccggctca gaggaggaga acgtcggcgg gtggaggact 900
agtgcgccgc cgacgcagca ggcgtcctcc tcctcatcct actcttccgg gaaagcgagg 960
tgctccttga accttgactt gtga 984
<210> 131
<211> 422
<212> PRT
<213> Zea mays
<400> 131
Met Asp Gln Phe Ala Ala Ser Gly Arg Phe Ser Arg Glu Glu Glu Ala
1 5 10 15
Asp Glu Glu Gln Glu Asp Ala Ser Asn Ser Met Arg Glu Ile Ser Phe
20 25 30
Met Pro Pro Ala Ala Ala Ser Ser Ser Ser Ala Ala Ala Ser Ala Ser
35 40 45
Ala Ser Ala Ser Thr Ser Ala Ser Ala Cys Ala Ser Gly Ser Ser Ser
50 55 60
Ala Pro Phe Arg Ser Ala Ser Ala Ser Gly Asp Ala Ala Gly Ala Ser
65 70 75 80
Gly Ser Gly Gly Pro Ala Asp Ala Asp Ala Glu Ala Glu Ala Val Glu
85 90 95
Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys
100 105 110
Leu Asn Arg Leu Val Ile Pro Lys Gln Tyr Ala Glu Lys Tyr Phe Pro
115 120 125
Leu Asp Ala Ala Ala Asn Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp
130 135 140
Ser Ala Gly Lys His Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
145 150 155 160
Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
165 170 175
Arg Leu Val Ala Gly Asp Thr Val Ser Phe Ser Arg Ala Ala Ala Glu
180 185 190
Asp Ala Arg His Arg Leu Phe Ile Asp Trp Lys Arg Arg Val Asp Thr
195 200 205
Arg Gly Pro Leu Arg Phe Ser Gly Leu Ala Leu Pro Met Pro Leu Pro
210 215 220
Ser Ser His Tyr Gly Gly Pro His His Tyr Ser Pro Trp Gly Phe Gly
225 230 235 240
Gly Gly Gly Gly Gly Gly Gly Gly Phe Phe Met Pro Pro Ser Pro Pro
245 250 255
Ala Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Leu Asp Phe Arg Ser
260 265 270
Met Thr Thr Thr Tyr Pro Ala Pro Thr Val Gly Arg Gln Leu Leu Phe
275 280 285
Phe Gly Ser Ala Arg Met Pro Pro His His Ala Pro Pro Pro Gln Pro
290 295 300
Arg Pro Phe Ser Leu Pro Leu His His Tyr Thr Val Gln Pro Ser Ala
305 310 315 320
Ala Gly Val Thr Ala Ala Ser Arg Pro Val Leu Leu Asp Ser Val Pro
325 330 335
Val Ile Glu Ser Pro Thr Thr Ala Ala Lys Arg Val Arg Leu Phe Gly
340 345 350
Val Asn Leu Asp Asn Asn Pro Asp Gly Gly Gly Glu Ala Ser His Gln
355 360 365
Gly Asp Ala Leu Ser Leu Gln Met Pro Gly Trp Gln Gln Arg Thr Pro
370 375 380
Thr Leu Arg Leu Leu Glu Leu Pro Arg His Gly Gly Glu Ser Ser Ala
385 390 395 400
Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu Ala Arg Ser
405 410 415
Ala Leu Asp Leu Asp Leu
420
<210> 132
<211> 1269
<212> DNA
<213> Zea mays
<400> 132
atggaccagt tcgccgcgag cgggaggttc tctagagagg aggaggcgga cgaggagcag 60
gaggatgcgt ccaattccat gcgcgagatc tccttcatgc cgccggctgc ggcctcgtca 120
tcttcggcgg ctgcttccgc gtccgcgtcc gcctccacca gcgcatccgc gtgtgcatcg 180
ggaagcagca gcgccccctt ccgctccgcc tccgcgtcgg gggatgccgc cggagcgtcg 240
gggagcggcg gcccagcgga cgcggacgcg gaggcggagg cggtggagaa ggagcacatg 300
ttcgacaagg tggtcacgcc gagcgacgtg gggaagctca accggctggt gatcccgaag 360
cagtacgcgg agaagtactt cccgctggac gcggcggcca acgagaaggg cctcctcctc 420
agcttcgagg acagcgccgg caagcactgg cgcttccgct actcctactg gaacagcagc 480
cagagctacg tcatgaccaa gggctggagc cgcttcgtca aggagaagcg cctcgtcgcc 540
ggggacaccg tctccttctc ccgcgccgcc gccgaggacg cgcgccaccg cctcttcatc 600
gactggaagc gccgggtcga cacccgcggc ccgcttcgtt tctccggcct cgcgctgccg 660
atgccgctgc cgtcgtcgca ctacggcggg ccccaccact acagcccgtg gggcttcggc 720
ggcggcggcg gcggcggcgg cggattcttc atgccgccct cgccgcccgc cacgctctac 780
gagcaccgcc tcagacaggg cctcgacttc cgcagcatga cgacgaccta ccccgcgccg 840
accgtgggga ggcagctcct gtttttcggc tcggccagga tgcctcctca tcacgcgccg 900
ccgccccagc cgcgcccgtt ctcgctgccg ctgcatcact acacggtgca accgagcgcc 960
gccggcgtca ccgccgcgtc acggccggtc cttcttgact cggtgccggt catcgagagc 1020
ccgacgaccg ccgcgaagcg cgtgcggctg ttcggcgtca acctggacaa caacccagat 1080
ggcggcggcg aggctagcca tcagggcgat gcattgtcat tgcagatgcc cgggtggcag 1140
caaaggactc caactctaag gctactagaa ttgcctcgcc atggcgggga gtcctccgcg 1200
gcgtcgtctc cgtcgtcgtc gtcttcctcc aagagggagg cgcgttcagc tttggatctc 1260
gatctgtga 1269
<210> 133
<211> 894
<212> DNA
<213> Brassica rapa
<400> 133
atgatgatga caaacttgtc tctttcaaga gaaggagaag aggaggaaga agaagaacaa 60
gaagaggcca agaagcccat ggaagaagta gagagagagc acatgttcga caaagtggtg 120
actccaagcg atgttggtaa actaaaccgg ctcgtgatcc caaagcaata cgcagagaga 180
tacttccctt tagattcatc cacaaacgag aaaggtttgc ttctaaactt cgaagatctc 240
gcaggaaagt catggaggtt ccgttactct tactggaaca gtagtcagag ctatgtcatg 300
actaaaggtt ggagccgttt cgttaaagac aaaaagctag acgccggaga tattgtctct 360
ttccagagat gtgtcggaga ttcaggaaga gacagccgct tgtttattga ttggaggaga 420
agacctaaag ttcctgacca tccgacatcg attgctcact ttgctgccgg atctatgttt 480
cctaggtttt acagttttcc gacagcaact agttacaatc tttacaacta tcagcagcca 540
cgtcatcatc atcacagtgg ttataattat cctcaaattc cgagagaatt tggatacggg 600
tacttggtgg atcaaagagc cgtggtggct gatccgttgg tgattgaatc tgtgccggtg 660
atgatgcacg gaggagctca agttagtcag gcggttgttg gaacggccgg gaagaggctg 720
aggctttttg gagtcgatat ggaggaagaa tcttcatctt ccggtgggag tttgccacgt 780
ggtgacgctt ctccgtcttc ctctttgttt cagctgagac ttggaagcag cagtgaagat 840
gatcacttct ctaagaaagg aaagtcctca ttgccttttg atttggatca ataa 894
<210> 134
<211> 307
<212> PRT
<213> Zea mays
<400> 134
Met Ala Thr Asn His Leu Ser Gln Gly Gln His Gln His Pro Gln Ala
1 5 10 15
Trp Pro Trp Gly Val Ala Met Tyr Thr Asn Leu His Tyr His His Gln
20 25 30
Gln His His His Tyr Glu Lys Glu His Leu Phe Glu Lys Pro Leu Thr
35 40 45
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His
50 55 60
Ala Glu Arg Tyr Phe Pro Leu Ser Ser Ser Gly Ala Gly Asp Lys Gly
65 70 75 80
Leu Ile Leu Cys Phe Glu Asp Asp Asp Asp Asp Glu Ala Ala Ala Ala
85 90 95
Asn Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Thr Ser Ser Gln Ser
100 105 110
Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Glu Lys Gln Leu
115 120 125
Asp Ala Gly Asp Val Val Arg Phe Gln Arg Met Arg Gly Phe Gly Met
130 135 140
Pro Asp Arg Leu Phe Ile Ser His Ser Arg Arg Gly Glu Thr Thr Ala
145 150 155 160
Thr Ala Ala Thr Thr Val Pro Pro Ala Ala Ala Ala Val Arg Val Val
165 170 175
Val Ala Pro Ala Gln Ser Ala Gly Ala Asp His Gln Gln Gln Gln Gln
180 185 190
Pro Ser Pro Trp Ser Pro Met Cys Tyr Ser Thr Ser Gly Ser Tyr Ser
195 200 205
Tyr Pro Thr Ser Ser Pro Ala Asn Ser Gln His Ala Tyr His Arg His
210 215 220
Ser Ala Asp His Asp His Ser Asn Asn Met Gln His Ala Gly Glu Ser
225 230 235 240
Gln Ser Asp Arg Asp Asn Arg Ser Cys Ser Ala Ala Ser Ala Pro Pro
245 250 255
Pro Pro Ser Arg Arg Leu Arg Leu Phe Gly Val Asn Leu Asp Cys Gly
260 265 270
Pro Gly Pro Glu Pro Glu Thr Pro Thr Ala Met Tyr Gly Tyr Met His
275 280 285
Gln Ser Pro Tyr Ala Tyr Asn Asn Trp Gly Ser Pro Tyr Gln His Asp
290 295 300
Glu Glu Ile
305
<210> 135
<211> 924
<212> DNA
<213> Zea mays
<400> 135
atggccacga accatctctc ccaagggcag caccagcacc cgcaggcctg gccctggggc 60
gtggccatgt acaccaacct acactaccac caccagcagc accaccacta cgagaaggag 120
cacctgttcg agaagccgct gacgccgagc gacgtgggca agctcaacag gctggtgatc 180
cccaagcagc acgccgagag gtacttccct ctcagcagca gcggcgccgg cgacaaaggc 240
ctcatcctgt gcttcgagga cgacgacgac gacgaggctg ccgccgccaa caagccgtgg 300
cggttccgct actcgtactg gaccagcagc cagagctacg tgctcaccaa gggctggagc 360
cgctacgtca aggagaagca gcttgacgcc ggcgacgtcg tgcgcttcca gaggatgcgt 420
ggtttcggca tgcccgaccg cctgttcatc agccacagcc gccgcggcga gactactgct 480
actgctgcaa caacagtgcc ccccgctgct gctgccgtgc gcgtagtagt ggcacctgca 540
cagagcgctg gcgcagacca ccagcagcag cagcagccgt cgccttggag cccaatgtgc 600
tacagcacat caggctcgta ctcgtacccc accagcagcc cagccaattc ccagcatgcc 660
taccaccgcc actcagctga ccatgaccac agcaacaaca tgcaacatgc aggagaatct 720
cagtccgaca gagacaacag gagctgcagt gcagcttcgg caccgccgcc accgtcgcgg 780
cggctccggc tgttcggcgt aaacctcgac tgcggcccgg ggccggagcc ggagacacca 840
acggcgatgt acggctacat gcaccaaagc ccctacgctt acaacaactg gggcagtcca 900
taccagcatg acgaggagat ttaa 924
<210> 136
<211> 277
<212> PRT
<213> Zea mays
<400> 136
Met Glu Phe Thr Pro Ala His Ala His Ala Arg Val Val Glu Asp Ser
1 5 10 15
Glu Arg Pro Arg Gly Gly Val Ala Trp Val Glu Lys Glu His Met Phe
20 25 30
Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
35 40 45
Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Ala Leu Asp Ala Ser
50 55 60
Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Gly Gly Lys Gly
65 70 75 80
Leu Val Leu Ser Phe Glu Asp Arg Ala Gly Lys Ala Trp Arg Phe Arg
85 90 95
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp
100 105 110
Ser Arg Phe Val Lys Glu Lys Arg Leu Gly Ala Gly Asp Thr Val Leu
115 120 125
Phe Ala Arg Gly Ala Gly Gly Ala Arg Gly Arg Phe Phe Ile Asp Phe
130 135 140
Arg Arg Arg Arg Gln Asp Leu Ala Phe Leu Gln Pro Thr Leu Ala Ser
145 150 155 160
Ala Gln Arg Leu Leu Pro Leu Pro Ser Val Pro Ile Cys Pro Trp Gln
165 170 175
Asp Tyr Gly Ala Ser Ala Pro Ala Pro Asn Arg His Val Leu Phe Leu
180 185 190
Arg Pro Gln Val Pro Ala Ala Val Val Leu Lys Ser Val Pro Val His
195 200 205
Val Ala Ala Ser Ala Val Glu Ala Thr Met Ser Lys Arg Val Arg Leu
210 215 220
Phe Gly Val Asn Leu Asp Cys Pro Pro Asp Ala Glu Asp Ser Ala Thr
225 230 235 240
Val Pro Arg Gly Arg Ala Ala Ser Thr Thr Leu Leu Gln Leu Pro Ser
245 250 255
Pro Ser Ser Ser Thr Ser Ser Ser Thr Ala Gly Lys Asp Val Cys Cys
260 265 270
Leu Asp Leu Gly Leu
275
<210> 137
<211> 834
<212> DNA
<213> Zea mays
<400> 137
atggagttca cgcccgcgca tgcgcatgcc cgtgtcgttg aggattccga gaggcctcgc 60
ggcggcgtgg cctgggtgga gaaggagcac atgttcgaga aggtggtcac cccgagcgac 120
gtggggaagc tcaatcgcct ggtcatccca aagcagcacg cggagcgcta cttccccgcg 180
ctggacgcct cgtccgccgc ggcggcggcg gcggcagcag ccgcgggagg cgggaagggg 240
ctggtgctca gcttcgagga ccgggcgggg aaggcgtggc gcttccgcta ctcgtactgg 300
aacagcagcc agagctacgt gatgaccaaa ggttggagcc gcttcgtgaa ggagaagcgc 360
ctcggtgccg gggacacagt cttgttcgcg cgcggcgcgg gcggcgcgcg cggccgcttc 420
ttcatcgatt tccgccgccg tcgccaggat ctcgcgttcc tgcagccgac gctggcgtct 480
gcgcagcgac tcctgccgct gccgtcggtg cccatctgcc cgtggcagga ctacggcgcc 540
tcggctccgg cgcccaaccg gcacgtgctg ttcctgcggc cgcaggtgcc ggccgccgta 600
gtgctcaagt cggtccccgt gcacgttgct gcatccgcgg tggaggcgac catgtcgaag 660
cgcgtccgcc tgttcggggt gaacctcgac tgcccgccgg acgccgaaga cagcgccaca 720
gtcccccggg gccgggcggc gtcgacgacg cttctgcaac tgccctcgcc atcgtcgtca 780
acatcctcct cgacggcagg gaaggacgtg tgctgtttgg atcttggact gtga 834
<210> 138
<211> 273
<212> PRT
<213> Zea mays
<400> 138
Met Glu Phe Arg Pro Ala His Ala Arg Val Phe Glu Asp Ser Glu Arg
1 5 10 15
Pro Arg Gly Gly Val Ala Trp Leu Glu Lys Glu His Met Phe Glu Lys
20 25 30
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
35 40 45
Lys Gln His Ala Glu Arg Tyr Phe Pro Ala Leu Asp Ala Ser Ala Ala
50 55 60
Ala Ala Ser Ala Ser Ala Ser Ala Gly Gly Gly Lys Ala Gly Leu Val
65 70 75 80
Leu Ser Phe Glu Asp Arg Ala Gly Lys Ala Trp Arg Phe Arg Tyr Ser
85 90 95
Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg
100 105 110
Phe Val Lys Glu Lys Arg Leu Gly Ala Gly Asp Thr Val Leu Phe Ala
115 120 125
Arg Gly Ala Gly Ala Thr Arg Gly Arg Phe Phe Ile Asp Phe Arg Arg
130 135 140
Arg Arg His Glu Leu Ala Phe Leu Gln Pro Pro Leu Ala Ser Ala Gln
145 150 155 160
Arg Leu Leu Pro Leu Pro Ser Val Pro Ile Cys Pro Trp Gln Gly Tyr
165 170 175
Gly Ala Ser Ala Pro Ala Pro Ser Arg His Val Leu Phe Leu Arg Pro
180 185 190
Gln Val Pro Ala Ala Val Val Leu Thr Ser Val Pro Val Arg Val Ala
195 200 205
Ala Ser Ala Val Glu Glu Ala Thr Arg Ser Lys Arg Val Arg Leu Phe
210 215 220
Gly Val Asn Leu Asp Cys Pro Pro Asp Ala Glu Asp Gly Ala Thr Ala
225 230 235 240
Thr Arg Thr Pro Ser Thr Leu Leu Gln Leu Pro Ser Pro Ser Ser Ser
245 250 255
Thr Ser Ser Ser Thr Gly Gly Lys Asp Val Arg Ser Leu Asp Leu Gly
260 265 270
Leu
<210> 139
<211> 822
<212> DNA
<213> Zea mays
<400> 139
atggagttca ggcccgcgca tgcccgtgtc ttcgaggatt ccgagaggcc tcgcggcggc 60
gtggcgtggc tggagaagga gcacatgttc gagaaagtgg tcaccccgag cgacgtgggg 120
aagctcaatc gcctggtcat cccgaagcag cacgccgagc gctacttccc cgcgctggac 180
gcctcggccg ccgcggcgtc ggcatcggcg tcggcgggcg gcgggaaggc ggggctggtg 240
ctcagcttcg aggaccgggc ggggaaggcg tggcgcttcc gctactcgta ctggaacagc 300
agccagagct acgtgatgac caagggatgg agccgcttcg tgaaagagaa gcgcctcggt 360
gccggggaca cggtattgtt cgcgcgcggc gcgggcgcca cgcgcggccg cttcttcatc 420
gatttccgcc gccgccgcca cgagctcgcg ttcctgcagc cgccgctggc gtctgcgcag 480
cgcctcctgc cgctcccgtc ggtgcccatc tgcccgtggc agggctacgg cgcctccgct 540
ccggcgccaa gccggcacgt gctgttcctg cggccgcagg tgccggccgc cgtagtgctc 600
acgtcggtgc ccgtgcgcgt cgccgcatcc gcggtggagg aggcgacgag gtcgaagcgc 660
gtccgcctgt tcggggtgaa cctcgactgc ccgccggacg ccgaagacgg tgccacagcc 720
acccggacgc cgtcgacgct tctgcagctg ccctcgccat cgtcgtcaac atcctcctcc 780
acgggaggca aggatgtgcg ttctttggat cttggacttt ga 822
<210> 140
<211> 350
<212> PRT
<213> Tricum aeseirum
<400> 140
Met Gly Val Glu Ile Leu Ser Ser Met Val Glu His Ser Phe Gln Tyr
1 5 10 15
Ser Ser Gly Val Ser Thr Ala Thr Thr Glu Ser Gly Thr Ala Gly Thr
20 25 30
Pro Pro Arg Pro Leu Ser Leu Pro Val Ala Ile Ala Asp Glu Ser Val
35 40 45
Thr Ser Arg Ser Ala Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
50 55 60
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Arg His Ala Arg Val Trp
65 70 75 80
Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala Arg Ala Tyr Asp Val
85 90 95
Ala Ser Leu Arg Tyr Arg Gly Arg Asp Val Ala Phe Asn Phe Pro Cys
100 105 110
Ala Ala Val Glu Gly Glu Leu Ala Phe Leu Ala Ala His Ser Lys Ala
115 120 125
Glu Ile Val Asp Met Leu Arg Lys Gln Thr Tyr Ala Asp Glu Leu Arg
130 135 140
Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Ala Arg Ala Gln Pro Thr
145 150 155 160
Pro Ser Trp Ala Arg Glu Pro Leu Phe Glu Lys Ala Val Thr Pro Ser
165 170 175
Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln His Ala Glu
180 185 190
Lys His Phe Pro Leu Lys Arg Thr Pro Glu Thr Pro Thr Thr Thr Gly
195 200 205
Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp Arg
210 215 220
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys
225 230 235 240
Gly Trp Ser Arg Phe Val Arg Glu Lys Gly Leu Gly Ala Gly Asp Ser
245 250 255
Ile Leu Phe Ser Cys Ser Leu Tyr Glu Gln Glu Lys Gln Phe Phe Ile
260 265 270
Asp Cys Lys Lys Asn Thr Ser Met Asn Gly Gly Lys Ser Ala Ser Pro
275 280 285
Leu Pro Val Gly Val Thr Thr Lys Gly Glu Gln Val Arg Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Ser Gly Val Lys Arg Gly Arg Ala Ala Thr
305 310 315 320
Ala Thr Ala Glu Gln Gly Leu Gln Glu Leu Phe Lys Arg Gln Cys Val
325 330 335
Ala Pro Gly Gln His Ser Pro Ala Leu Gly Ala Phe Ala Leu
340 345 350
<210> 141
<211> 1053
<212> DNA
<213> Tricum aeseirum
<400> 141
atgggggtgg aaatcctgag ctccatggtg gagcactcct tccagtactc ttccggcgtg 60
tccacggcca cgacggagtc aggcaccgcc ggaacaccgc cgaggccttt gagcctacct 120
gtcgccatcg ccgacgagtc cgtgacctcg cggtcggcgt cgtctcggtt caagggcgtg 180
gtgccgcagc caaacgggcg atggggcgcc cagatctacg agcgccacgc tcgcgtctgg 240
ctcggcacgt tcccagacca ggactcggcg gcgcgcgcct acgacgtagc ctcgctcagg 300
taccgcggcc gcgacgtcgc cttcaacttc ccgtgcgcgg ccgtggaggg ggagctcgcc 360
ttcctggcgg cgcactccaa ggctgagata gtggacatgc tccggaagca gacctacgcc 420
gatgaactcc gccagggcct gcggcgcggc cgtggcatgg gggcgcgcgc gcagccgacg 480
ccgtcgtggg cgcgggagcc ccttttcgag aaggccgtga cccctagcga tgtcggcaag 540
ctcaatcgcc tcgtagtgcc gaagcagcac gccgagaagc acttccccct gaagcgcacg 600
ccggagacgc cgaccaccac cggcaagggc gtgctgctca acttcgagga cggcgagggg 660
aaggtgtgga ggttccggta ctcgtactgg aacagcagcc agagctacgt gctcaccaaa 720
ggctggagcc gcttcgtccg ggagaagggc ctaggtgccg gcgactccat cctattctcg 780
tgctcgctgt acgaacagga gaagcagttc ttcatcgact gcaagaagaa cactagcatg 840
aacggaggca aatcggcgtc gccgctgcca gtgggggtga ctaccaaagg agaacaagtt 900
cgcgtcgtta ggctattcgg tgtcgacatc tcgggagtga agagggggcg agcggcgacg 960
gcaacggcgg agcaaggcct gcaggagttg ttcaagaggc aatgcgtggc acccggccag 1020
cactctcctg ccctaggtgc cttcgcctta tag 1053
<210> 142
<211> 320
<212> PRT
<213> Tricum aeseirum
<400> 142
Met Ala Ser Gly Lys Pro Thr Asn His Gly Met Glu Asp Asp Asn Asp
1 5 10 15
Met Glu Tyr Ser Ser Ala Glu Ser Gly Ala Glu Asp Ala Ala Glu Pro
20 25 30
Ser Ser Ser Pro Val Leu Ala Pro Pro Arg Ala Ala Pro Ser Ser Arg
35 40 45
Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile
50 55 60
Tyr Glu Lys His Ser Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp
65 70 75 80
Ala Ala Val Arg Ala Tyr Asp Val Ala Ala Leu Arg Phe Arg Gly Pro
85 90 95
Asp Ala Val Ile Asn His Gln Arg Pro Thr Ala Ala Glu Glu Ala Gly
100 105 110
Ser Ser Ser Ser Arg Ser Glu Leu Asp Pro Glu Leu Gly Phe Leu Ala
115 120 125
Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Arg Ala Gln
145 150 155 160
Pro Thr Pro Ala Trp Ala Arg Glu Leu Leu Phe Glu Lys Ala Val Thr
165 170 175
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln Gln
180 185 190
Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Thr Gly Ser Asn
195 200 205
Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp
210 215 220
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
225 230 235 240
Lys Gly Trp Ser Arg Phe Val Lys Glu Thr Gly Leu Arg Ala Gly Asp
245 250 255
Thr Val Ala Phe Tyr Arg Ser Ala Tyr Gly Asn Asp Thr Glu Asp Gln
260 265 270
Leu Phe Ile Asp Tyr Lys Lys Met Asn Lys Asn Asp Asp Ala Ala Asp
275 280 285
Ala Ala Ile Ser Asp Glu Asn Glu Thr Gly His Val Ala Val Lys Leu
290 295 300
Phe Gly Val Asp Ile Ala Gly Gly Gly Met Ala Gly Ser Ser Gly Gly
305 310 315 320
<210> 143
<211> 963
<212> DNA
<213> Tricum aeseirum
<400> 143
atggcatctg gcaagccgac aaaccacggg atggaggacg acaacgacat ggagtactcc 60
tccgcggaat cgggggccga ggacgcggcg gagccgtcgt cgtcgccggt gctggcgccg 120
ccccgggcgg ctccatcgtc gcggttcaag ggcgtcgtgc cgcagcccaa cgggcggtgg 180
ggagcgcaga tctacgagaa gcactcgcgg gtgtggctcg gaacgttccc cgacgaggac 240
gccgccgtgc gcgcctacga cgtggccgcg ctccgcttcc gcggcccgga cgccgtcatc 300
aaccaccagc gaccgacggc cgcggaggag gccggctcgt cgtcgtccag gagcgagctg 360
gatccagagc tcggcttcct tgccgaccac tccaaggccg agatcgtcga catgctccgg 420
aagcacacct acgacgacga gctccgtcag ggcctgcgcc gcggccgcgg gcgcgcgcag 480
ccgacgccgg cgtgggcacg agagctcctc ttcgagaagg ccgtgacccc gagcgacgtc 540
ggcaagctca accgcctcgt ggtgccgaag cagcaggccg agaagcactt ccctccgacc 600
actgcggcgg ccaccggcag caacggcaag ggcgtgctgc tcaacttcga ggacggcgaa 660
gggaaggtgt ggcgcttccg gtactcgtac tggaacagca gccagagcta cgtgctcacc 720
aagggctgga gccgcttcgt caaggagacg ggcctccgcg ccggcgacac cgtggcgttc 780
taccggtcgg cgtacgggaa tgacacggag gatcagctct tcatcgacta caagaagatg 840
aacaagaatg acgatgctgc ggacgcggcg atttccgatg agaatgagac aggccatgtc 900
gccgtcaagc tcttcggcgt tgacattgcc ggtggaggga tggcgggatc atcaggtggc 960
tga 963
<210> 144
<211> 320
<212> PRT
<213> Tricum aeseirum
<400> 144
Met Ala Ser Gly Lys Pro Thr Asn His Gly Met Glu Asp Asp Asn Asp
1 5 10 15
Met Glu Tyr Ser Ser Ala Glu Ser Gly Ala Glu Asp Ala Ala Glu Pro
20 25 30
Ser Ser Ser Pro Val Leu Ala Pro Pro Arg Ala Ala Pro Ser Ser Arg
35 40 45
Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile
50 55 60
Tyr Glu Lys His Ser Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp
65 70 75 80
Ala Ala Ala Arg Ala Tyr Asp Val Ala Ala Leu Arg Phe Arg Gly Pro
85 90 95
Asp Ala Val Ile Asn His Gln Arg Pro Thr Ala Ala Glu Glu Ala Gly
100 105 110
Ser Ser Ser Ser Arg Ser Glu Leu Asp Pro Glu Leu Gly Phe Leu Ala
115 120 125
Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Arg Ala Gln
145 150 155 160
Pro Thr Pro Ala Trp Ala Arg Glu Leu Leu Phe Glu Lys Ala Val Thr
165 170 175
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln Gln
180 185 190
Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Thr Gly Ser Asn
195 200 205
Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp
210 215 220
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
225 230 235 240
Lys Gly Trp Ser Arg Phe Val Lys Glu Thr Gly Leu Arg Ala Gly Asp
245 250 255
Thr Val Ala Phe Tyr Arg Ser Ala Tyr Gly Asn Asp Thr Glu Asp Gln
260 265 270
Leu Phe Ile Asp Tyr Lys Lys Met Asn Lys Asn Asp Asp Ala Ala Asp
275 280 285
Ala Ala Ile Ser Asp Glu Asn Glu Thr Gly His Val Ala Val Lys Leu
290 295 300
Phe Gly Val Asp Ile Ala Gly Gly Gly Met Ala Gly Ser Ser Gly Gly
305 310 315 320
<210> 145
<211> 963
<212> DNA
<213> Tricum aeseirum
<400> 145
atggcatctg gcaagccgac aaaccacggg atggaggacg acaacgacat ggagtactcc 60
tccgcggaat cgggggccga ggacgcggcg gagccgtcgt cgtcgccggt gctggcgccg 120
ccccgggcgg ctccatcgtc gcggttcaag ggcgtcgtgc cgcagcccaa cgggcggtgg 180
ggagcgcaga tctacgagaa gcactcgcgg gtgtggctcg gaacgttccc cgacgaggac 240
gccgccgcgc gcgcctacga cgtggccgcg ctccgcttcc gcggcccgga cgccgtcatc 300
aaccaccagc gaccgacggc cgcggaggag gccggctcgt cgtcgtccag gagcgagctg 360
gatccagagc tcggcttcct cgccgaccac tccaaggccg agatcgtcga catgctccgg 420
aagcacacct acgacgacga gctccgtcag ggcctgcgcc gcggccgcgg gcgcgcgcag 480
ccgacgccgg cgtgggcacg agagctcctc ttcgagaagg ccgtgacccc gagcgacgtc 540
ggcaagctca accgcctcgt ggtgccgaag cagcaggccg agaagcactt ccctccgacc 600
actgcggcgg ccaccggcag caacggcaag ggcgtgctgc tcaacttcga ggacggcgaa 660
gggaaggtgt ggcgcttccg gtactcgtac tggaacagca gccagagcta cgtgctcacc 720
aagggctgga gccgcttcgt caaggagacg ggcctccgcg ccggcgacac cgtggcgttc 780
taccggtcgg cgtacgggaa tgacacggag gatcagctct tcatcgacta caagaagatg 840
aacaagaatg acgatgctgc ggacgcggcg atttccgatg agaatgagac aggccatgtc 900
gccgtcaagc tcttcggcgt tgacattgcc ggtggaggga tggcgggatc atcaggtggc 960
tga 963
<210> 146
<211> 488
<212> RNA
<213> Oryza sativa
<400> 146
gacggccagt gccaagcttc tcggatccac tagtaacggc cgccagtgtg ctggaattgc 60
ccttggatca tgaaccaacg gcctggctgt atttggtggt tgtgtaggga gatggggaga 120
agaaaagccc gattctcttc gctgtgatgg gctggatgca tgcgggggag cgggaggccc 180
aagtacgtgc acggtgagcg gcccacaggg cgagtgtgag cgcgagaggc gggaggaaca 240
gtttagtacc acattgccca gctaactcga acgcgaccaa cttataaacc cgcgcgctgt 300
cgcttgtgtg ggaaggaaga gacagattgg ttttagagct agaaatagca agttaaaata 360
aggctagtcc gttatcaact tgaaaaagtg gcaccgagtc ggtgcttttt ttgtcccttc 420
gaagggcaat tctgcagata tccatcacac tggcggccgc tcgaggtcga agcttgcatg 480
cctgcagg 488
<210> 147
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 147
ggactggggt tgctcctggg acacaagcga cagcgcgcgg g 41
<210> 148
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 148
cccaggagca accccagtcc gttttagagc tagaaatagc a 41
<210> 149
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 149
tgctatttct agctctaaaa cacacaagcg acagcgcgcg gg 42
<210> 150
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 150
gcccctgacg cccagtgacg gttttagagc tagaaatagc a 41
<210> 151
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 151
gggggtgccc ctgggcgaga acacaagcga cagcgcgcgg g 41
<210> 152
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 152
tctcgcccag gggcaccccc gttttagagc tagaaatagc a 41
<210> 153
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 153
ctcgtagtgg tggtggtagt acacaagcga cagcgcgcgg g 41
<210> 154
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> primer
<400> 154
actaccacca ccactacgag gttttagagc tagaaatagc a 41
<210> 155
<211> 15681
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<220>
<221> misc_feature
<222> 5964..5984
<223> /note="target sequence"
<220>
<221> misc_feature
<222> 6617..6637
<223> /note="target sequence"
<400> 155
aattcccgat ctagtaacat agatgacacc gcgcgcgata atttatccta gtttgcgcgc 60
tatattttgt tttctatcgc gtattaaatg tataattgcg ggactctaat cataaaaacc 120
catctcataa ataacgtcat gcattacatg ttaattatta catgcttaac gtaattcaac 180
agaaattata tgataatcat cgcaagaccg gcaacaggat tcaatcttaa gaaactttat 240
tgccaaatgt ttgaacgatc ggggaaattc gagctctatc gatcaatcag gatccttact 300
ttttcttttt tgcctggccg gcctttttcg tggccgccgg ccttttgtcg cctcccagct 360
gagacaggtc gatccgtgtc tcgtacaggc cggtgatgct ctggtggatc agggtggcgt 420
ccagcacctc tttggtgctg gtgtacctct tccggtcgat ggtggtgtca aagtacttga 480
aggcggcagg ggctcccaga ttggtcaggg taaacaggtg gatgatattc tcggcctgct 540
ctctgatggg cttatcccgg tgcttgttgt aggcggacag cactttgtcc agattagcgt 600
cggccaggat cactctcttg gagaactcgc tgatctgctc gatgatctcg tccaggtagt 660
gcttgtgctg ttccacaaac agctgtttct gctcattatc ctcgggggag cccttcagct 720
tctcatagtg gctggccagg tacaggaagt tcacatattt ggagggcagg gccagttcgt 780
ttcccttctg cagttcgccg gcagaggcca gcattctctt ccggccgttt tccagctcga 840
acagggagta cttaggcagc ttgatgatca ggtccttttt cacttctttg tagcccttgg 900
cttccagaaa gtcgatggga ttcttctcga agctgcttct ttccatgatg gtgatcccca 960
gcagctcttt cacactcttc agtttcttgg acttgccctt ttccactttg gccaccacca 1020
gcacagaata ggccacggtg gggctgtcga agccgccgta cttcttaggg tcccagtcct 1080
tctttctggc gatcagctta tcgctgttcc tcttgggcag gatagactct ttgctgaagc 1140
cgcctgtctg cacctcggtc tttttcacga tattcacttg gggcatgctc agcactttcc 1200
gcacggtggc aaaatcccgg cccttatccc acacgatctc cccggtttcg ccgtttgtct 1260
cgatcagagg ccgcttccgg atctcgccgt tggccagggt aatctcggtc ttgaaaaagt 1320
tcatgatgtt gctgtagaag aagtacttgg cggtagcctt gccgatttcc tgctcgctct 1380
tggcgatcat cttccgcacg tcgtacacct tgtagtcgcc gtacacgaac tcgctttcca 1440
gcttagggta ctttttgatc agggcggttc ccacgacggc gttcaggtag gcgtcgtggg 1500
cgtggtggta gttgttgatc tcgcgcactt tgtaaaactg gaaatccttc cggaaatcgg 1560
acaccagctt ggacttcagg gtgatcactt tcacttcccg gatcagcttg tcattctcgt 1620
cgtacttagt gttcatccgg gagtccagga tctgtgccac gtgctttgtg atctgccggg 1680
tttccaccag ctgtctcttg atgaagccgg ccttatccag ttcgctcagg ccgcctctct 1740
cggccttggt cagattgtcg aactttctct gggtaatcag cttggcgttc agcagctgcc 1800
gccagtagtt cttcatcttc ttcacgacct cttcggaggg cacgttgtcg ctcttgcccc 1860
ggttcttgtc gcttctggtc agcaccttgt tgtcgatgga gtcgtccttc agaaagctct 1920
gaggcacgat atggtccaca tcgtagtcgg acagccggtt gatgtccagt tcctggtcca 1980
cgtacatatc ccgcccattc tgcaggtagt acaggtacag cttctcgttc tgcagctggg 2040
tgttttccac ggggtgttct ttcaggatct ggctgcccag ctctttgatg ccctcttcga 2100
tccgcttcat tctctcgcgg ctgttcttct gtcccttctg ggtggtctgg ttctctctgg 2160
ccatttcgat cacgatgttc tcgggcttgt gccggcccat cactttcacg agctcgtcca 2220
ccaccttcac tgtctgcagg atgcccttct taatggcggg gctgccggcc agattggcaa 2280
tgtgctcgtg caggctatcg ccctggccgg acacctgggc tttctggatg tcctctttaa 2340
aggtcaggct gtcgtcgtgg atcagctgca tgaagtttct gttggcgaag ccgtcggact 2400
tcaggaaatc caggattgtc ttgccggact gcttgtcccg gatgccgttg atcagcttcc 2460
ggctcagcct gccccagccg gtgtatctcc gccgcttcag ctgcttcatc actttgtcgt 2520
cgaacaggtg ggcataggtt ttcagccgtt cctcgatcat ctctctgtcc tcaaacagtg 2580
tcagggtcag cacgatatct tccagaatgt cctcgttttc ctcattgtcc aggaagtcct 2640
tgtccttgat aattttcagc agatcgtggt atgtgcccag ggaggcgttg aaccgatctt 2700
ccacgccgga gatttccacg gagtcgaagc actcgatttt cttgaagtag tcctctttca 2760
gctgcttcac ggtcactttc cggttggtct tgaacagcag gtccacgatg gcctttttct 2820
gctcgccgct caggaaggcg ggctttctca ttccctcggt cacgtatttc actttggtca 2880
gctcgttata cacggtgaag tactcgtaca gcaggctgtg cttgggcagc accttctcgt 2940
tgggcaggtt cttatcgaag ttggtcatcc gctcgatgaa gctctgggcg gaagcgccct 3000
tgtccaccac ttcctcgaag ttccaggggg tgatggtttc ctcgctcttt ctggtcatcc 3060
aggcgaatct gctgtttccc ctggccagag ggcccacgta gtaggggatg cggaaggtca 3120
ggatcttctc gatcttttcc cggttgtcct tcaggaatgg gtaaaaatct tcctgccgcc 3180
gcagaatggc gtgcagctct cccaggtgga tctggtgggg gatgctgccg ttgtcgaagg 3240
tccgctgctt ccgcagcagg tcctctctgt tcagcttcac gagcagttcc tcggtgccgt 3300
ccatcttttc caggatgggc ttgatgaact tgtagaactc ttcctggctg gctccgccgt 3360
caatgtagcc ggcgtagccg ttcttgctct ggtcgaagaa aatctctttg tacttctcag 3420
gcagctgctg ccgcacgaga gctttcagca gggtcaggtc ctggtggtgc tcgtcgtatc 3480
tcttgatcat agaggcgctc aggggggcct tggtgatctc ggtgttcact ctcaggatgt 3540
cgctcagcag gatggcgtcg gacaggttct tggcggccag aaacaggtcg gcgtactggt 3600
cgccgatctg ggccagcagg ttgtccaggt cgtcgtcgta ggtgtccttg ctcagctgca 3660
gtttggcatc ctcggccagg tcgaagttgc tcttgaagtt gggggtcagg cccaggctca 3720
gggcaatcag gtttccgaac aggccattct tcttctcgcc gggcagctgg gcgatcagat 3780
tttccagccg tctgctcttg ctcagtctgg cagacaggat ggccttggcg tccacgccgc 3840
tggcgttgat ggggttttcc tcgaacagct ggttgtaggt ctgcaccagc tggatgaaca 3900
gcttgtccac gtcgctgttg tcggggttca ggtcgccctc gatcaggaag tggccccgga 3960
acttgatcat gtgggccagg gccagataga tcagccgcag gtcggccttg tcggtgctgt 4020
ccaccagttt ctttctcagg tggtagatgg tggggtactt ctcgtggtag gccacctcgt 4080
ccacgatgtt gccgaagatg gggtgccgct cgtgcttctt atcctcttcc accaggaagg 4140
actcttccag tctgtggaag aagctgtcgt ccaccttggc catctcgttg ctgaagatct 4200
cttgcagata gcagatccgg ttcttccgtc tggtgtatct tcttctggcg gttctcttca 4260
gccgggtggc ctcggctgtt tcgccgctgt cgaacagcag ggctccgatc aggttcttct 4320
tgatgctgtg ccggtcggtg ttgcccagca ccttgaattt cttgctgggc accttgtact 4380
cgtcggtgat cacggcccag cccacagagt tggtgccgat gtccaggccg atgctgtact 4440
tcttgtcggc tgctgggact ccgtggatac cgaccttccg cttcttcttt ggggccatct 4500
tatcgtcatc gtctttgtaa tcaatatcat gatccttgta gtctccgtcg tggtccttat 4560
agtccatctc gagtatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta 4620
aaagaaatga tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt 4680
gttttgtata tgttgtgttg aggtcgaggt cctctccaaa tgaaatgaac ttccttatat 4740
agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt cagtggagat 4800
atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt ccacgatgct 4860
cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt caacgatggc 4920
ctttccttta tcgcaatgat ggcatttgta ggagccacct tccttttcca ctatcttcac 4980
aataaagtga cagatagctg ggcaatggaa tccgaggagg tttccggata tcaccctttg 5040
ttgaaaagtc tcaattgccc tttggtcttc tgagactgta tctttgatat ttttggagta 5100
gacaagtgtg tcgtgctcca ccatgttatc acatcaatcc acttgctttg aagacgtggt 5160
tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt tgggaccact 5220
gtcggcagag gcatcttcaa cgatggcctt tcctttatcg caatgatggc atttgtagga 5280
gccaccttcc ttttccacta tcttcacaat aaagtgacag atagctgggc aatggaatcc 5340
gaggaggttt ccggatatta ccctttgttg aaaagtctca attgcccttt ggtcttctga 5400
gactgtatct ttgatatttt tggagtagac aagtgtgtcg tgctccacca tgttgacctg 5460
caggcatgcc tcggatccac tagtaacggc cgccagtgtg ctggaattgc ccttaagctt 5520
cgttgaacaa cggaaactcg acttgccttc cgcacaatac atcatttctt cttagctttt 5580
tttcttcttc ttcgttcata cagttttttt ttgtttatca gcttacattt tcttgaaccg 5640
tagctttcgt tttcttcttt ttaactttcc attcggagtt tttgtatctt gtttcatagt 5700
ttgtcccagg attagaatga ttaggcatcg aaccttcaag aatttgattg aataaaacat 5760
cttcattctt aagatatgaa gataatcttc aaaaggcccc tgggaatctg aaagaagaga 5820
agcaggccca tttatatggg aaagaacaat agtatttctt atataggccc atttaagttg 5880
aaaacaatct tcaaaagtcc cacatcgctt agataagaaa acgaagctga gtttatatac 5940
agctagagtc gaagtagtga tttccccacg tcactgggcg tcgttttaga gctagaaata 6000
gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga gtcggtgctt 6060
tttttgtccc ttcgaagggc ctttctcaga tatccatcac actggcggcc gctcgaggtc 6120
gctcggatcc actagtaacg gccgccagtg tgctggaatt gcccttaagc ttcgttgaac 6180
aacggaaact cgacttgcct tccgcacaat acatcatttc ttcttagctt tttttcttct 6240
tcttcgttca tacagttttt ttttgtttat cagcttacat tttcttgaac cgtagctttc 6300
gttttcttct ttttaacttt ccattcggag tttttgtatc ttgtttcata gtttgtccca 6360
ggattagaat gattaggcat cgaaccttca agaatttgat tgaataaaac atcttcattc 6420
ttaagatatg aagataatct tcaaaaggcc cctgggaatc tgaaagaaga gaagcaggcc 6480
catttatatg ggaaagaaca atagtatttc ttatataggc ccatttaagt tgaaaacaat 6540
cttcaaaagt cccacatcgc ttagataaga aaacgaagct gagtttatat acagctagag 6600
tcgaagtagt gattcacacc ccatggccag gactgtttta gagctagaaa tagcaagtta 6660
aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc tttttttgtc 6720
ccttcgaagg gcctttctca gatatccatc acactggcgg ccgctcgagg tcgaagcttg 6780
gcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 6840
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 6900
cgcccttccc aacagttgcg cagcctgaat ggcgaatgct agagcagctt gagcttggat 6960
cagattgtcg tttcccgcct tcagtttaaa ctatcagtgt ttgacaggat atattggcgg 7020
gtaaacctaa gagaaaagag cgtttattag aataacggat atttaaaagg gcgtgaaaag 7080
gtttatccgt tcgtccattt gtatgtgcat gccaaccaca gggttcccct cgggatcaaa 7140
gtactttgat ccaacccctc cgctgctata gtgcagtcgg cttctgacgt tcagtgcagc 7200
cgtcttctga aaacgacatg tcgcacaagt cctaagttac gcgacaggct gccgccctgc 7260
ccttttcctg gcgttttctt gtcgcgtgtt ttagtcgcat aaagtagaat acttgcgact 7320
agaaccggag acattacgcc atgaacaaga gcgccgccgc tggcctgctg ggctatgccc 7380
gcgtcagcac cgacgaccag gacttgacca accaacgggc cgaactgcac gcggccggct 7440
gcaccaagct gttttccgag aagatcaccg gcaccaggcg cgaccgcccg gagctggcca 7500
ggatgcttga ccacctacgc cctggcgacg ttgtgacagt gaccaggcta gaccgcctgg 7560
cccgcagcac ccgcgaccta ctggacattg ccgagcgcat ccaggaggcc ggcgcgggcc 7620
tgcgtagcct ggcagagccg tgggccgaca ccaccacgcc ggccggccgc atggtgttga 7680
ccgtgttcgc cggcattgcc gagttcgagc gttccctaat catcgaccgc acccggagcg 7740
ggcgcgaggc cgccaaggcc cgaggcgtga agtttggccc ccgccctacc ctcaccccgg 7800
cacagatcgc gcacgcccgc gagctgatcg accaggaagg ccgcaccgtg aaagaggcgg 7860
ctgcactgct tggcgtgcat cgctcgaccc tgtaccgcgc acttgagcgc agcgaggaag 7920
tgacgcccac cgaggccagg cggcgcggtg ccttccgtga ggacgcattg accgaggccg 7980
acgccctggc ggccgccgag aatgaacgcc aagaggaaca agcatgaaac cgcaccagga 8040
cggccaggac gaaccgtttt tcattaccga agagatcgag gcggagatga tcgcggccgg 8100
gtacgtgttc gagccgcccg cgcacgtctc aaccgtgcgg ctgcatgaaa tcctggccgg 8160
tttgtctgat gccaagctgg cggcctggcc ggccagcttg gccgctgaag aaaccgagcg 8220
ccgccgtcta aaaaggtgat gtgtatttga gtaaaacagc ttgcgtcatg cggtcgctgc 8280
gtatatgatg cgatgagtaa ataaacaaat acgcaagggg aacgcatgaa ggttatcgct 8340
gtacttaacc agaaaggcgg gtcaggcaag acgaccatcg caacccatct agcccgcgcc 8400
ctgcaactcg ccggggccga tgttctgtta gtcgattccg atccccaggg cagtgcccgc 8460
gattgggcgg ccgtgcggga agatcaaccg ctaaccgttg tcggcatcga ccgcccgacg 8520
attgaccgcg acgtgaaggc catcggccgg cgcgacttcg tagtgatcga cggagcgccc 8580
caggcggcgg acttggctgt gtccgcgatc aaggcagccg acttcgtgct gattccggtg 8640
cagccaagcc cttacgacat atgggccacc gccgacctgg tggagctggt taagcagcgc 8700
attgaggtca cggatggaag gctacaagcg gcctttgtcg tgtcgcgggc gatcaaaggc 8760
acgcgcatcg gcggtgaggt tgccgaggcg ctggccgggt acgagctgcc cattcttgag 8820
tcccgtatca cgcagcgcgt gagctaccca ggcactgccg ccgccggcac aaccgttctt 8880
gaatcagaac ccgagggcga cgctgcccgc gaggtccagg cgctggccgc tgaaattaaa 8940
tcaaaactca tttgagttaa tgaggtaaag agaaaatgag caaaagcaca aacacgctaa 9000
gtgccggccg tccgagcgca cgcagcagca aggctgcaac gttggccagc ctggcagaca 9060
cgccagccat gaagcgggtc aactttcagt tgccggcgga ggatcacacc aagctgaaga 9120
tgtacgcggt acgccaaggc aagaccatta ccgagctgct atctgaatac atcgcgcagc 9180
taccagagta aatgagcaaa tgaataaatg agtagatgaa ttttagcggc taaaggaggc 9240
ggcatggaaa atcaagaaca accaggcacc gacgccgtgg aatgccccat gtgtggagga 9300
acgggcggtt ggccaggcgt aagcggctgg gttgtctgcc ggccctgcaa tggcactgga 9360
acccccaagc ccgaggaatc ggcgtgacgg tcgcaaacca tccggcccgg tacaaatcgg 9420
cgcggcgctg ggtgatgacc tggtggagaa gttgaaggcc gcgcaggccg cccagcggca 9480
acgcatcgag gcagaagcac gccccggtga atcgtggcaa gcggccgctg atcgaatccg 9540
caaagaatcc cggcaaccgc cggcagccgg tgcgccgtcg attaggaagc cgcccaaggg 9600
cgacgagcaa ccagattttt tcgttccgat gctctatgac gtgggcaccc gcgatagtcg 9660
cagcatcatg gacgtggccg ttttccgtct gtcgaagcgt gaccgacgag ctggcgaggt 9720
gatccgctac gagcttccag acgggcacgt agaggtttcc gcagggccgg ccggcatggc 9780
cagtgtgtgg gattacgacc tggtactgat ggcggtttcc catctaaccg aatccatgaa 9840
ccgataccgg gaagggaagg gagacaagcc cggccgcgtg ttccgtccac acgttgcgga 9900
cgtactcaag ttctgccggc gagccgatgg cggaaagcag aaagacgacc tggtagaaac 9960
ctgcattcgg ttaaacacca cgcacgttgc catgcagcgt acgaagaagg ccaagaacgg 10020
ccgcctggtg acggtatccg agggtgaagc cttgattagc cgctacaaga tcgtaaagag 10080
cgaaaccggg cggccggagt acatcgagat cgagctagct gattggatgt accgcgagat 10140
cacagaaggc aagaacccgg acgtgctgac ggttcacccc gattactttt tgatcgatcc 10200
cggcatcggc cgttttctct accgcctggc acgccgcgcc gcaggcaagg cagaagccag 10260
atggttgttc aagacgatct acgaacgcag tggcagcgcc ggagagttca agaagttctg 10320
tttcaccgtg cgcaagctga tcgggtcaaa tgacctgccg gagtacgatt tgaaggagga 10380
ggcggggcag gctggcccga tcctagtcat gcgctaccgc aacctgatcg agggcgaagc 10440
atccgccggt tcctaatgta cggagcagat gctagggcaa attgccctag caggggaaaa 10500
aggtcgaaaa ggtctctttc ctgtggatag cacgtacatt gggaacccaa agccgtacat 10560
tgggaaccgg aacccgtaca ttgggaaccc aaagccgtac attgggaacc ggtcacacat 10620
gtaagtgact gatataaaag agaaaaaagg cgatttttcc gcctaaaact ctttaaaact 10680
tattaaaact cttaaaaccc gcctggcctg tgcataactg tctggccagc gcacagccga 10740
agagctgcaa aaagcgccta cccttcggtc gctgcgctcc ctacgccccg ccgcttcgcg 10800
tcggcctatc gcggccgctg gccgctcaaa aatggctggc ctacggccag gcaatctacc 10860
agggcgcgga caagccgcgc cgtcgccact cgaccgccgg cgcccacatc aaggcaccct 10920
gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg 10980
tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg 11040
gtgttggcgg gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata 11100
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 11160
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 11220
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 11280
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 11340
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 11400
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 11460
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 11520
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 11580
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 11640
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 11700
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 11760
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 11820
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 11880
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 11940
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 12000
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatgc attctaggta 12060
ctaaaacaat tcatccagta aaatataata ttttattttc tcccaatcag gcttgatccc 12120
cagtaagtca aaaaatagct cgacatactg ttcttccccg atatcctccc tgatcgaccg 12180
gacgcagaag gcaatgtcat accacttgtc cgccctgccg cttctcccaa gatcaataaa 12240
gccacttact ttgccatctt tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa 12300
gacaagttcc tcttcgggct tttccgtctt taaaaaatca tacagctcgc gcggatcttt 12360
aaatggagtg tcttcttccc agttttcgca atccacatcg gccagatcgt tattcagtaa 12420
gtaatccaat tcggctaagc ggctgtctaa gctattcgta tagggacaat ccgatatgtc 12480
gatggagtga aagagcctga tgcactccgc atacagctcg ataatctttt cagggctttg 12540
ttcatcttca tactcttccg agcaaaggac gccatcggcc tcactcatga gcagattgct 12600
ccagccatca tgccgttcaa agtgcaggac ctttggaaca ggcagctttc cttccagcca 12660
tagcatcatg tccttttccc gttccacatc ataggtggtc cctttatacc ggctgtccgt 12720
catttttaaa tataggtttt cattttctcc caccagctta tataccttag caggagacat 12780
tccttccgta tcttttacgc agcggtattt ttcgatcagt tttttcaatt ccggtgatat 12840
tctcatttta gccatttatt atttccttcc tcttttctac agtatttaaa gataccccaa 12900
gaagctaatt ataacaagac gaactccaat tcactgttcc ttgcattcta aaaccttaaa 12960
taccagaaaa cagctttttc aaagttgttt tcaaagttgg cgtataacat agtatcgacg 13020
gagccgattt tgaaaccgcg gtgatcacag gcagcaacgc tctgtcatcg ttacaatcaa 13080
catgctaccc tccgcgagat catccgtgtt tcaaacccgg cagcttagtt gccgttcttc 13140
cgaatagcat cggtaacatg agcaaagtct gccgccttac aacggctctc ccgctgacgc 13200
cgtcccggac tgatgggctg cctgtatcga gtggtgattt tgtgccgagc tgccggtcgg 13260
ggagctgttg gctggctggt ggcaggatat attgtggtgt aaacaaattg acgcttagac 13320
aacttaataa cacattgcgg acgtttttaa tgtactgaat taacgccgaa ttaattcggg 13380
ggatctggat tttagtactg gattttggtt ttaggaatta gaaattttat tgatagaagt 13440
attttacaaa tacaaataca tactaagggt ttcttatatg ctcaacacat gagcgaaacc 13500
ctataggaac cctaattccc ttatctggga actactcaca cattattatg gagaaactcg 13560
agcttgtcga tcgacagatc cggtcggcat ctactctatt tctttgccct cggacgagtg 13620
ctggggcgtc ggtttccact atcggcgagt acttctacac agccatcggt ccagacggcc 13680
gcgcttctgc gggcgatttg tgtacgcccg acagtcccgg ctccggatcg gacgattgcg 13740
tcgcatcgac cctgcgccca agctgcatca tcgaaattgc cgtcaaccaa gctctgatag 13800
agttggtcaa gaccaatgcg gagcatatac gcccggagtc gtggcgatcc tgcaagctcc 13860
ggatgcctcc gctcgaagta gcgcgtctgc tgctccatac aagccaacca cggcctccag 13920
aagaagatgt tggcgacctc gtattgggaa tccccgaaca tcgcctcgct ccagtcaatg 13980
accgctgtta tgcggccatt gtccgtcagg acattgttgg agccgaaatc cgcgtgcacg 14040
aggtgccgga cttcggggca gtcctcggcc caaagcatca gctcatcgag agcctgcgcg 14100
acggacgcac tgacggtgtc gtccatcaca gtttgccagt gatacacatg gggatcagca 14160
atcgcgcata tgaaatcacg ccatgtagtg tattgaccga ttccttgcgg tccgaatggg 14220
ccgaacccgc tcgtctggct aagatcggcc gcagcgatcg catccatagc ctccgcgacc 14280
ggttgtagaa cagcgggcag ttcggtttca ggcaggtctt gcaacgtgac accctgtgca 14340
cggcgggaga tgcaataggt caggctctcg ctaaactccc caatgtcaag cacttccgga 14400
atcgggagcg cggccgatgc aaagtgccga taaacataac gatctttgta gaaaccatcg 14460
gcgcagctat ttacccgcag gacatatcca cgccctccta catcgaagct gaaagcacga 14520
gattcttcgc cctccgagag ctgcatcagg tcggagacgc tgtcgaactt ttcgatcaga 14580
aacttctcga cagacgtcgc ggtgagttca ggctttttca tatctcattg ccccccggga 14640
tctgcgaaag ctcgagagag atagatttgt agagagagac tggtgatttc agcgtgtcct 14700
ctccaaatga aatgaacttc cttatataga ggaaggtctt gcgaaggata gtgggattgt 14760
gcgtcatccc ttacgtcagt ggagatatca catcaatcca cttgctttga agacgtggtt 14820
ggaacgtctt ctttttccac gatgctcctc gtgggtgggg gtccatcttt gggaccactg 14880
tcggcagagg catcttgaac gatagccttt cctttatcgc aatgatggca tttgtaggtg 14940
ccaccttcct tttctactgt ccttttgatg aagtgacaga tagctgggca atggaatccg 15000
aggaggtttc ccgatattac cctttgttga aaagtctcaa tagccctttg gtcttctgag 15060
actgtatctt tgatattctt ggagtagacg agagtgtcgt gctccaccat gttatcacat 15120
caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg 15180
ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaacgat agcctttcct 15240
ttatcgcaat gatggcattt gtaggtgcca ccttcctttt ctactgtcct tttgatgaag 15300
tgacagatag ctgggcaatg gaatccgagg aggtttcccg atattaccct ttgttgaaaa 15360
gtctcaatag ccctttggtc ttctgagact gtatctttga tattcttgga gtagacgaga 15420
gtgtcgtgct ccaccatgtt ggcaagctgc tctagccaat acgcaaaccg cctctccccg 15480
cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca 15540
gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 15600
ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 15660
acagctatga ccatgattac g 15681
<210> 156
<211> 15681
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<220>
<221> misc_feature
<222> 5964..5984
<223> /note="target sequence"
<220>
<221> misc_feature
<222> 6617..6637
<223> /note="target sequence"
<400> 156
aattcccgat ctagtaacat agatgacacc gcgcgcgata atttatccta gtttgcgcgc 60
tatattttgt tttctatcgc gtattaaatg tataattgcg ggactctaat cataaaaacc 120
catctcataa ataacgtcat gcattacatg ttaattatta catgcttaac gtaattcaac 180
agaaattata tgataatcat cgcaagaccg gcaacaggat tcaatcttaa gaaactttat 240
tgccaaatgt ttgaacgatc ggggaaattc gagctctatc gatcaatcag gatccttact 300
ttttcttttt tgcctggccg gcctttttcg tggccgccgg ccttttgtcg cctcccagct 360
gagacaggtc gatccgtgtc tcgtacaggc cggtgatgct ctggtggatc agggtggcgt 420
ccagcacctc tttggtgctg gtgtacctct tccggtcgat ggtggtgtca aagtacttga 480
aggcggcagg ggctcccaga ttggtcaggg taaacaggtg gatgatattc tcggcctgct 540
ctctgatggg cttatcccgg tgcttgttgt aggcggacag cactttgtcc agattagcgt 600
cggccaggat cactctcttg gagaactcgc tgatctgctc gatgatctcg tccaggtagt 660
gcttgtgctg ttccacaaac agctgtttct gctcattatc ctcgggggag cccttcagct 720
tctcatagtg gctggccagg tacaggaagt tcacatattt ggagggcagg gccagttcgt 780
ttcccttctg cagttcgccg gcagaggcca gcattctctt ccggccgttt tccagctcga 840
acagggagta cttaggcagc ttgatgatca ggtccttttt cacttctttg tagcccttgg 900
cttccagaaa gtcgatggga ttcttctcga agctgcttct ttccatgatg gtgatcccca 960
gcagctcttt cacactcttc agtttcttgg acttgccctt ttccactttg gccaccacca 1020
gcacagaata ggccacggtg gggctgtcga agccgccgta cttcttaggg tcccagtcct 1080
tctttctggc gatcagctta tcgctgttcc tcttgggcag gatagactct ttgctgaagc 1140
cgcctgtctg cacctcggtc tttttcacga tattcacttg gggcatgctc agcactttcc 1200
gcacggtggc aaaatcccgg cccttatccc acacgatctc cccggtttcg ccgtttgtct 1260
cgatcagagg ccgcttccgg atctcgccgt tggccagggt aatctcggtc ttgaaaaagt 1320
tcatgatgtt gctgtagaag aagtacttgg cggtagcctt gccgatttcc tgctcgctct 1380
tggcgatcat cttccgcacg tcgtacacct tgtagtcgcc gtacacgaac tcgctttcca 1440
gcttagggta ctttttgatc agggcggttc ccacgacggc gttcaggtag gcgtcgtggg 1500
cgtggtggta gttgttgatc tcgcgcactt tgtaaaactg gaaatccttc cggaaatcgg 1560
acaccagctt ggacttcagg gtgatcactt tcacttcccg gatcagcttg tcattctcgt 1620
cgtacttagt gttcatccgg gagtccagga tctgtgccac gtgctttgtg atctgccggg 1680
tttccaccag ctgtctcttg atgaagccgg ccttatccag ttcgctcagg ccgcctctct 1740
cggccttggt cagattgtcg aactttctct gggtaatcag cttggcgttc agcagctgcc 1800
gccagtagtt cttcatcttc ttcacgacct cttcggaggg cacgttgtcg ctcttgcccc 1860
ggttcttgtc gcttctggtc agcaccttgt tgtcgatgga gtcgtccttc agaaagctct 1920
gaggcacgat atggtccaca tcgtagtcgg acagccggtt gatgtccagt tcctggtcca 1980
cgtacatatc ccgcccattc tgcaggtagt acaggtacag cttctcgttc tgcagctggg 2040
tgttttccac ggggtgttct ttcaggatct ggctgcccag ctctttgatg ccctcttcga 2100
tccgcttcat tctctcgcgg ctgttcttct gtcccttctg ggtggtctgg ttctctctgg 2160
ccatttcgat cacgatgttc tcgggcttgt gccggcccat cactttcacg agctcgtcca 2220
ccaccttcac tgtctgcagg atgcccttct taatggcggg gctgccggcc agattggcaa 2280
tgtgctcgtg caggctatcg ccctggccgg acacctgggc tttctggatg tcctctttaa 2340
aggtcaggct gtcgtcgtgg atcagctgca tgaagtttct gttggcgaag ccgtcggact 2400
tcaggaaatc caggattgtc ttgccggact gcttgtcccg gatgccgttg atcagcttcc 2460
ggctcagcct gccccagccg gtgtatctcc gccgcttcag ctgcttcatc actttgtcgt 2520
cgaacaggtg ggcataggtt ttcagccgtt cctcgatcat ctctctgtcc tcaaacagtg 2580
tcagggtcag cacgatatct tccagaatgt cctcgttttc ctcattgtcc aggaagtcct 2640
tgtccttgat aattttcagc agatcgtggt atgtgcccag ggaggcgttg aaccgatctt 2700
ccacgccgga gatttccacg gagtcgaagc actcgatttt cttgaagtag tcctctttca 2760
gctgcttcac ggtcactttc cggttggtct tgaacagcag gtccacgatg gcctttttct 2820
gctcgccgct caggaaggcg ggctttctca ttccctcggt cacgtatttc actttggtca 2880
gctcgttata cacggtgaag tactcgtaca gcaggctgtg cttgggcagc accttctcgt 2940
tgggcaggtt cttatcgaag ttggtcatcc gctcgatgaa gctctgggcg gaagcgccct 3000
tgtccaccac ttcctcgaag ttccaggggg tgatggtttc ctcgctcttt ctggtcatcc 3060
aggcgaatct gctgtttccc ctggccagag ggcccacgta gtaggggatg cggaaggtca 3120
ggatcttctc gatcttttcc cggttgtcct tcaggaatgg gtaaaaatct tcctgccgcc 3180
gcagaatggc gtgcagctct cccaggtgga tctggtgggg gatgctgccg ttgtcgaagg 3240
tccgctgctt ccgcagcagg tcctctctgt tcagcttcac gagcagttcc tcggtgccgt 3300
ccatcttttc caggatgggc ttgatgaact tgtagaactc ttcctggctg gctccgccgt 3360
caatgtagcc ggcgtagccg ttcttgctct ggtcgaagaa aatctctttg tacttctcag 3420
gcagctgctg ccgcacgaga gctttcagca gggtcaggtc ctggtggtgc tcgtcgtatc 3480
tcttgatcat agaggcgctc aggggggcct tggtgatctc ggtgttcact ctcaggatgt 3540
cgctcagcag gatggcgtcg gacaggttct tggcggccag aaacaggtcg gcgtactggt 3600
cgccgatctg ggccagcagg ttgtccaggt cgtcgtcgta ggtgtccttg ctcagctgca 3660
gtttggcatc ctcggccagg tcgaagttgc tcttgaagtt gggggtcagg cccaggctca 3720
gggcaatcag gtttccgaac aggccattct tcttctcgcc gggcagctgg gcgatcagat 3780
tttccagccg tctgctcttg ctcagtctgg cagacaggat ggccttggcg tccacgccgc 3840
tggcgttgat ggggttttcc tcgaacagct ggttgtaggt ctgcaccagc tggatgaaca 3900
gcttgtccac gtcgctgttg tcggggttca ggtcgccctc gatcaggaag tggccccgga 3960
acttgatcat gtgggccagg gccagataga tcagccgcag gtcggccttg tcggtgctgt 4020
ccaccagttt ctttctcagg tggtagatgg tggggtactt ctcgtggtag gccacctcgt 4080
ccacgatgtt gccgaagatg gggtgccgct cgtgcttctt atcctcttcc accaggaagg 4140
actcttccag tctgtggaag aagctgtcgt ccaccttggc catctcgttg ctgaagatct 4200
cttgcagata gcagatccgg ttcttccgtc tggtgtatct tcttctggcg gttctcttca 4260
gccgggtggc ctcggctgtt tcgccgctgt cgaacagcag ggctccgatc aggttcttct 4320
tgatgctgtg ccggtcggtg ttgcccagca ccttgaattt cttgctgggc accttgtact 4380
cgtcggtgat cacggcccag cccacagagt tggtgccgat gtccaggccg atgctgtact 4440
tcttgtcggc tgctgggact ccgtggatac cgaccttccg cttcttcttt ggggccatct 4500
tatcgtcatc gtctttgtaa tcaatatcat gatccttgta gtctccgtcg tggtccttat 4560
agtccatctc gagtatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta 4620
aaagaaatga tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt 4680
gttttgtata tgttgtgttg aggtcgaggt cctctccaaa tgaaatgaac ttccttatat 4740
agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt cagtggagat 4800
atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt ccacgatgct 4860
cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt caacgatggc 4920
ctttccttta tcgcaatgat ggcatttgta ggagccacct tccttttcca ctatcttcac 4980
aataaagtga cagatagctg ggcaatggaa tccgaggagg tttccggata tcaccctttg 5040
ttgaaaagtc tcaattgccc tttggtcttc tgagactgta tctttgatat ttttggagta 5100
gacaagtgtg tcgtgctcca ccatgttatc acatcaatcc acttgctttg aagacgtggt 5160
tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt tgggaccact 5220
gtcggcagag gcatcttcaa cgatggcctt tcctttatcg caatgatggc atttgtagga 5280
gccaccttcc ttttccacta tcttcacaat aaagtgacag atagctgggc aatggaatcc 5340
gaggaggttt ccggatatta ccctttgttg aaaagtctca attgcccttt ggtcttctga 5400
gactgtatct ttgatatttt tggagtagac aagtgtgtcg tgctccacca tgttgacctg 5460
caggcatgcc tcggatccac tagtaacggc cgccagtgtg ctggaattgc ccttaagctt 5520
cgttgaacaa cggaaactcg acttgccttc cgcacaatac atcatttctt cttagctttt 5580
tttcttcttc ttcgttcata cagttttttt ttgtttatca gcttacattt tcttgaaccg 5640
tagctttcgt tttcttcttt ttaactttcc attcggagtt tttgtatctt gtttcatagt 5700
ttgtcccagg attagaatga ttaggcatcg aaccttcaag aatttgattg aataaaacat 5760
cttcattctt aagatatgaa gataatcttc aaaaggcccc tgggaatctg aaagaagaga 5820
agcaggccca tttatatggg aaagaacaat agtatttctt atataggccc atttaagttg 5880
aaaacaatct tcaaaagtcc cacatcgctt agataagaaa acgaagctga gtttatatac 5940
agctagagtc gaagtagtga ttgcggagac tcgtctacag ttgttttaga gctagaaata 6000
gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga gtcggtgctt 6060
tttttgtccc ttcgaagggc ctttctcaga tatccatcac actggcggcc gctcgaggtc 6120
gctcggatcc actagtaacg gccgccagtg tgctggaatt gcccttaagc ttcgttgaac 6180
aacggaaact cgacttgcct tccgcacaat acatcatttc ttcttagctt tttttcttct 6240
tcttcgttca tacagttttt ttttgtttat cagcttacat tttcttgaac cgtagctttc 6300
gttttcttct ttttaacttt ccattcggag tttttgtatc ttgtttcata gtttgtccca 6360
ggattagaat gattaggcat cgaaccttca agaatttgat tgaataaaac atcttcattc 6420
ttaagatatg aagataatct tcaaaaggcc cctgggaatc tgaaagaaga gaagcaggcc 6480
catttatatg ggaaagaaca atagtatttc ttatataggc ccatttaagt tgaaaacaat 6540
cttcaaaagt cccacatcgc ttagataaga aaacgaagct gagtttatat acagctagag 6600
tcgaagtagt gattatgtgt tacagcacgt cggggtttta gagctagaaa tagcaagtta 6660
aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc tttttttgtc 6720
ccttcgaagg gcctttctca gatatccatc acactggcgg ccgctcgagg tcgaagcttg 6780
gcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 6840
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 6900
cgcccttccc aacagttgcg cagcctgaat ggcgaatgct agagcagctt gagcttggat 6960
cagattgtcg tttcccgcct tcagtttaaa ctatcagtgt ttgacaggat atattggcgg 7020
gtaaacctaa gagaaaagag cgtttattag aataacggat atttaaaagg gcgtgaaaag 7080
gtttatccgt tcgtccattt gtatgtgcat gccaaccaca gggttcccct cgggatcaaa 7140
gtactttgat ccaacccctc cgctgctata gtgcagtcgg cttctgacgt tcagtgcagc 7200
cgtcttctga aaacgacatg tcgcacaagt cctaagttac gcgacaggct gccgccctgc 7260
ccttttcctg gcgttttctt gtcgcgtgtt ttagtcgcat aaagtagaat acttgcgact 7320
agaaccggag acattacgcc atgaacaaga gcgccgccgc tggcctgctg ggctatgccc 7380
gcgtcagcac cgacgaccag gacttgacca accaacgggc cgaactgcac gcggccggct 7440
gcaccaagct gttttccgag aagatcaccg gcaccaggcg cgaccgcccg gagctggcca 7500
ggatgcttga ccacctacgc cctggcgacg ttgtgacagt gaccaggcta gaccgcctgg 7560
cccgcagcac ccgcgaccta ctggacattg ccgagcgcat ccaggaggcc ggcgcgggcc 7620
tgcgtagcct ggcagagccg tgggccgaca ccaccacgcc ggccggccgc atggtgttga 7680
ccgtgttcgc cggcattgcc gagttcgagc gttccctaat catcgaccgc acccggagcg 7740
ggcgcgaggc cgccaaggcc cgaggcgtga agtttggccc ccgccctacc ctcaccccgg 7800
cacagatcgc gcacgcccgc gagctgatcg accaggaagg ccgcaccgtg aaagaggcgg 7860
ctgcactgct tggcgtgcat cgctcgaccc tgtaccgcgc acttgagcgc agcgaggaag 7920
tgacgcccac cgaggccagg cggcgcggtg ccttccgtga ggacgcattg accgaggccg 7980
acgccctggc ggccgccgag aatgaacgcc aagaggaaca agcatgaaac cgcaccagga 8040
cggccaggac gaaccgtttt tcattaccga agagatcgag gcggagatga tcgcggccgg 8100
gtacgtgttc gagccgcccg cgcacgtctc aaccgtgcgg ctgcatgaaa tcctggccgg 8160
tttgtctgat gccaagctgg cggcctggcc ggccagcttg gccgctgaag aaaccgagcg 8220
ccgccgtcta aaaaggtgat gtgtatttga gtaaaacagc ttgcgtcatg cggtcgctgc 8280
gtatatgatg cgatgagtaa ataaacaaat acgcaagggg aacgcatgaa ggttatcgct 8340
gtacttaacc agaaaggcgg gtcaggcaag acgaccatcg caacccatct agcccgcgcc 8400
ctgcaactcg ccggggccga tgttctgtta gtcgattccg atccccaggg cagtgcccgc 8460
gattgggcgg ccgtgcggga agatcaaccg ctaaccgttg tcggcatcga ccgcccgacg 8520
attgaccgcg acgtgaaggc catcggccgg cgcgacttcg tagtgatcga cggagcgccc 8580
caggcggcgg acttggctgt gtccgcgatc aaggcagccg acttcgtgct gattccggtg 8640
cagccaagcc cttacgacat atgggccacc gccgacctgg tggagctggt taagcagcgc 8700
attgaggtca cggatggaag gctacaagcg gcctttgtcg tgtcgcgggc gatcaaaggc 8760
acgcgcatcg gcggtgaggt tgccgaggcg ctggccgggt acgagctgcc cattcttgag 8820
tcccgtatca cgcagcgcgt gagctaccca ggcactgccg ccgccggcac aaccgttctt 8880
gaatcagaac ccgagggcga cgctgcccgc gaggtccagg cgctggccgc tgaaattaaa 8940
tcaaaactca tttgagttaa tgaggtaaag agaaaatgag caaaagcaca aacacgctaa 9000
gtgccggccg tccgagcgca cgcagcagca aggctgcaac gttggccagc ctggcagaca 9060
cgccagccat gaagcgggtc aactttcagt tgccggcgga ggatcacacc aagctgaaga 9120
tgtacgcggt acgccaaggc aagaccatta ccgagctgct atctgaatac atcgcgcagc 9180
taccagagta aatgagcaaa tgaataaatg agtagatgaa ttttagcggc taaaggaggc 9240
ggcatggaaa atcaagaaca accaggcacc gacgccgtgg aatgccccat gtgtggagga 9300
acgggcggtt ggccaggcgt aagcggctgg gttgtctgcc ggccctgcaa tggcactgga 9360
acccccaagc ccgaggaatc ggcgtgacgg tcgcaaacca tccggcccgg tacaaatcgg 9420
cgcggcgctg ggtgatgacc tggtggagaa gttgaaggcc gcgcaggccg cccagcggca 9480
acgcatcgag gcagaagcac gccccggtga atcgtggcaa gcggccgctg atcgaatccg 9540
caaagaatcc cggcaaccgc cggcagccgg tgcgccgtcg attaggaagc cgcccaaggg 9600
cgacgagcaa ccagattttt tcgttccgat gctctatgac gtgggcaccc gcgatagtcg 9660
cagcatcatg gacgtggccg ttttccgtct gtcgaagcgt gaccgacgag ctggcgaggt 9720
gatccgctac gagcttccag acgggcacgt agaggtttcc gcagggccgg ccggcatggc 9780
cagtgtgtgg gattacgacc tggtactgat ggcggtttcc catctaaccg aatccatgaa 9840
ccgataccgg gaagggaagg gagacaagcc cggccgcgtg ttccgtccac acgttgcgga 9900
cgtactcaag ttctgccggc gagccgatgg cggaaagcag aaagacgacc tggtagaaac 9960
ctgcattcgg ttaaacacca cgcacgttgc catgcagcgt acgaagaagg ccaagaacgg 10020
ccgcctggtg acggtatccg agggtgaagc cttgattagc cgctacaaga tcgtaaagag 10080
cgaaaccggg cggccggagt acatcgagat cgagctagct gattggatgt accgcgagat 10140
cacagaaggc aagaacccgg acgtgctgac ggttcacccc gattactttt tgatcgatcc 10200
cggcatcggc cgttttctct accgcctggc acgccgcgcc gcaggcaagg cagaagccag 10260
atggttgttc aagacgatct acgaacgcag tggcagcgcc ggagagttca agaagttctg 10320
tttcaccgtg cgcaagctga tcgggtcaaa tgacctgccg gagtacgatt tgaaggagga 10380
ggcggggcag gctggcccga tcctagtcat gcgctaccgc aacctgatcg agggcgaagc 10440
atccgccggt tcctaatgta cggagcagat gctagggcaa attgccctag caggggaaaa 10500
aggtcgaaaa ggtctctttc ctgtggatag cacgtacatt gggaacccaa agccgtacat 10560
tgggaaccgg aacccgtaca ttgggaaccc aaagccgtac attgggaacc ggtcacacat 10620
gtaagtgact gatataaaag agaaaaaagg cgatttttcc gcctaaaact ctttaaaact 10680
tattaaaact cttaaaaccc gcctggcctg tgcataactg tctggccagc gcacagccga 10740
agagctgcaa aaagcgccta cccttcggtc gctgcgctcc ctacgccccg ccgcttcgcg 10800
tcggcctatc gcggccgctg gccgctcaaa aatggctggc ctacggccag gcaatctacc 10860
agggcgcgga caagccgcgc cgtcgccact cgaccgccgg cgcccacatc aaggcaccct 10920
gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg 10980
tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg 11040
gtgttggcgg gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata 11100
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 11160
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 11220
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 11280
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 11340
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 11400
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 11460
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 11520
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 11580
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 11640
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 11700
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 11760
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 11820
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 11880
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 11940
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 12000
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatgc attctaggta 12060
ctaaaacaat tcatccagta aaatataata ttttattttc tcccaatcag gcttgatccc 12120
cagtaagtca aaaaatagct cgacatactg ttcttccccg atatcctccc tgatcgaccg 12180
gacgcagaag gcaatgtcat accacttgtc cgccctgccg cttctcccaa gatcaataaa 12240
gccacttact ttgccatctt tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa 12300
gacaagttcc tcttcgggct tttccgtctt taaaaaatca tacagctcgc gcggatcttt 12360
aaatggagtg tcttcttccc agttttcgca atccacatcg gccagatcgt tattcagtaa 12420
gtaatccaat tcggctaagc ggctgtctaa gctattcgta tagggacaat ccgatatgtc 12480
gatggagtga aagagcctga tgcactccgc atacagctcg ataatctttt cagggctttg 12540
ttcatcttca tactcttccg agcaaaggac gccatcggcc tcactcatga gcagattgct 12600
ccagccatca tgccgttcaa agtgcaggac ctttggaaca ggcagctttc cttccagcca 12660
tagcatcatg tccttttccc gttccacatc ataggtggtc cctttatacc ggctgtccgt 12720
catttttaaa tataggtttt cattttctcc caccagctta tataccttag caggagacat 12780
tccttccgta tcttttacgc agcggtattt ttcgatcagt tttttcaatt ccggtgatat 12840
tctcatttta gccatttatt atttccttcc tcttttctac agtatttaaa gataccccaa 12900
gaagctaatt ataacaagac gaactccaat tcactgttcc ttgcattcta aaaccttaaa 12960
taccagaaaa cagctttttc aaagttgttt tcaaagttgg cgtataacat agtatcgacg 13020
gagccgattt tgaaaccgcg gtgatcacag gcagcaacgc tctgtcatcg ttacaatcaa 13080
catgctaccc tccgcgagat catccgtgtt tcaaacccgg cagcttagtt gccgttcttc 13140
cgaatagcat cggtaacatg agcaaagtct gccgccttac aacggctctc ccgctgacgc 13200
cgtcccggac tgatgggctg cctgtatcga gtggtgattt tgtgccgagc tgccggtcgg 13260
ggagctgttg gctggctggt ggcaggatat attgtggtgt aaacaaattg acgcttagac 13320
aacttaataa cacattgcgg acgtttttaa tgtactgaat taacgccgaa ttaattcggg 13380
ggatctggat tttagtactg gattttggtt ttaggaatta gaaattttat tgatagaagt 13440
attttacaaa tacaaataca tactaagggt ttcttatatg ctcaacacat gagcgaaacc 13500
ctataggaac cctaattccc ttatctggga actactcaca cattattatg gagaaactcg 13560
agcttgtcga tcgacagatc cggtcggcat ctactctatt tctttgccct cggacgagtg 13620
ctggggcgtc ggtttccact atcggcgagt acttctacac agccatcggt ccagacggcc 13680
gcgcttctgc gggcgatttg tgtacgcccg acagtcccgg ctccggatcg gacgattgcg 13740
tcgcatcgac cctgcgccca agctgcatca tcgaaattgc cgtcaaccaa gctctgatag 13800
agttggtcaa gaccaatgcg gagcatatac gcccggagtc gtggcgatcc tgcaagctcc 13860
ggatgcctcc gctcgaagta gcgcgtctgc tgctccatac aagccaacca cggcctccag 13920
aagaagatgt tggcgacctc gtattgggaa tccccgaaca tcgcctcgct ccagtcaatg 13980
accgctgtta tgcggccatt gtccgtcagg acattgttgg agccgaaatc cgcgtgcacg 14040
aggtgccgga cttcggggca gtcctcggcc caaagcatca gctcatcgag agcctgcgcg 14100
acggacgcac tgacggtgtc gtccatcaca gtttgccagt gatacacatg gggatcagca 14160
atcgcgcata tgaaatcacg ccatgtagtg tattgaccga ttccttgcgg tccgaatggg 14220
ccgaacccgc tcgtctggct aagatcggcc gcagcgatcg catccatagc ctccgcgacc 14280
ggttgtagaa cagcgggcag ttcggtttca ggcaggtctt gcaacgtgac accctgtgca 14340
cggcgggaga tgcaataggt caggctctcg ctaaactccc caatgtcaag cacttccgga 14400
atcgggagcg cggccgatgc aaagtgccga taaacataac gatctttgta gaaaccatcg 14460
gcgcagctat ttacccgcag gacatatcca cgccctccta catcgaagct gaaagcacga 14520
gattcttcgc cctccgagag ctgcatcagg tcggagacgc tgtcgaactt ttcgatcaga 14580
aacttctcga cagacgtcgc ggtgagttca ggctttttca tatctcattg ccccccggga 14640
tctgcgaaag ctcgagagag atagatttgt agagagagac tggtgatttc agcgtgtcct 14700
ctccaaatga aatgaacttc cttatataga ggaaggtctt gcgaaggata gtgggattgt 14760
gcgtcatccc ttacgtcagt ggagatatca catcaatcca cttgctttga agacgtggtt 14820
ggaacgtctt ctttttccac gatgctcctc gtgggtgggg gtccatcttt gggaccactg 14880
tcggcagagg catcttgaac gatagccttt cctttatcgc aatgatggca tttgtaggtg 14940
ccaccttcct tttctactgt ccttttgatg aagtgacaga tagctgggca atggaatccg 15000
aggaggtttc ccgatattac cctttgttga aaagtctcaa tagccctttg gtcttctgag 15060
actgtatctt tgatattctt ggagtagacg agagtgtcgt gctccaccat gttatcacat 15120
caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg 15180
ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaacgat agcctttcct 15240
ttatcgcaat gatggcattt gtaggtgcca ccttcctttt ctactgtcct tttgatgaag 15300
tgacagatag ctgggcaatg gaatccgagg aggtttcccg atattaccct ttgttgaaaa 15360
gtctcaatag ccctttggtc ttctgagact gtatctttga tattcttgga gtagacgaga 15420
gtgtcgtgct ccaccatgtt ggcaagctgc tctagccaat acgcaaaccg cctctccccg 15480
cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca 15540
gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 15600
ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 15660
acagctatga ccatgattac g 15681
<210> 157
<211> 15681
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<220>
<221> misc_feature
<222> 5964..5984
<223> /note="target sequence"
<220>
<221> misc_feature
<222> 6617..6637
<223> /note="target sequence"
<400> 157
aattcccgat ctagtaacat agatgacacc gcgcgcgata atttatccta gtttgcgcgc 60
tatattttgt tttctatcgc gtattaaatg tataattgcg ggactctaat cataaaaacc 120
catctcataa ataacgtcat gcattacatg ttaattatta catgcttaac gtaattcaac 180
agaaattata tgataatcat cgcaagaccg gcaacaggat tcaatcttaa gaaactttat 240
tgccaaatgt ttgaacgatc ggggaaattc gagctctatc gatcaatcag gatccttact 300
ttttcttttt tgcctggccg gcctttttcg tggccgccgg ccttttgtcg cctcccagct 360
gagacaggtc gatccgtgtc tcgtacaggc cggtgatgct ctggtggatc agggtggcgt 420
ccagcacctc tttggtgctg gtgtacctct tccggtcgat ggtggtgtca aagtacttga 480
aggcggcagg ggctcccaga ttggtcaggg taaacaggtg gatgatattc tcggcctgct 540
ctctgatggg cttatcccgg tgcttgttgt aggcggacag cactttgtcc agattagcgt 600
cggccaggat cactctcttg gagaactcgc tgatctgctc gatgatctcg tccaggtagt 660
gcttgtgctg ttccacaaac agctgtttct gctcattatc ctcgggggag cccttcagct 720
tctcatagtg gctggccagg tacaggaagt tcacatattt ggagggcagg gccagttcgt 780
ttcccttctg cagttcgccg gcagaggcca gcattctctt ccggccgttt tccagctcga 840
acagggagta cttaggcagc ttgatgatca ggtccttttt cacttctttg tagcccttgg 900
cttccagaaa gtcgatggga ttcttctcga agctgcttct ttccatgatg gtgatcccca 960
gcagctcttt cacactcttc agtttcttgg acttgccctt ttccactttg gccaccacca 1020
gcacagaata ggccacggtg gggctgtcga agccgccgta cttcttaggg tcccagtcct 1080
tctttctggc gatcagctta tcgctgttcc tcttgggcag gatagactct ttgctgaagc 1140
cgcctgtctg cacctcggtc tttttcacga tattcacttg gggcatgctc agcactttcc 1200
gcacggtggc aaaatcccgg cccttatccc acacgatctc cccggtttcg ccgtttgtct 1260
cgatcagagg ccgcttccgg atctcgccgt tggccagggt aatctcggtc ttgaaaaagt 1320
tcatgatgtt gctgtagaag aagtacttgg cggtagcctt gccgatttcc tgctcgctct 1380
tggcgatcat cttccgcacg tcgtacacct tgtagtcgcc gtacacgaac tcgctttcca 1440
gcttagggta ctttttgatc agggcggttc ccacgacggc gttcaggtag gcgtcgtggg 1500
cgtggtggta gttgttgatc tcgcgcactt tgtaaaactg gaaatccttc cggaaatcgg 1560
acaccagctt ggacttcagg gtgatcactt tcacttcccg gatcagcttg tcattctcgt 1620
cgtacttagt gttcatccgg gagtccagga tctgtgccac gtgctttgtg atctgccggg 1680
tttccaccag ctgtctcttg atgaagccgg ccttatccag ttcgctcagg ccgcctctct 1740
cggccttggt cagattgtcg aactttctct gggtaatcag cttggcgttc agcagctgcc 1800
gccagtagtt cttcatcttc ttcacgacct cttcggaggg cacgttgtcg ctcttgcccc 1860
ggttcttgtc gcttctggtc agcaccttgt tgtcgatgga gtcgtccttc agaaagctct 1920
gaggcacgat atggtccaca tcgtagtcgg acagccggtt gatgtccagt tcctggtcca 1980
cgtacatatc ccgcccattc tgcaggtagt acaggtacag cttctcgttc tgcagctggg 2040
tgttttccac ggggtgttct ttcaggatct ggctgcccag ctctttgatg ccctcttcga 2100
tccgcttcat tctctcgcgg ctgttcttct gtcccttctg ggtggtctgg ttctctctgg 2160
ccatttcgat cacgatgttc tcgggcttgt gccggcccat cactttcacg agctcgtcca 2220
ccaccttcac tgtctgcagg atgcccttct taatggcggg gctgccggcc agattggcaa 2280
tgtgctcgtg caggctatcg ccctggccgg acacctgggc tttctggatg tcctctttaa 2340
aggtcaggct gtcgtcgtgg atcagctgca tgaagtttct gttggcgaag ccgtcggact 2400
tcaggaaatc caggattgtc ttgccggact gcttgtcccg gatgccgttg atcagcttcc 2460
ggctcagcct gccccagccg gtgtatctcc gccgcttcag ctgcttcatc actttgtcgt 2520
cgaacaggtg ggcataggtt ttcagccgtt cctcgatcat ctctctgtcc tcaaacagtg 2580
tcagggtcag cacgatatct tccagaatgt cctcgttttc ctcattgtcc aggaagtcct 2640
tgtccttgat aattttcagc agatcgtggt atgtgcccag ggaggcgttg aaccgatctt 2700
ccacgccgga gatttccacg gagtcgaagc actcgatttt cttgaagtag tcctctttca 2760
gctgcttcac ggtcactttc cggttggtct tgaacagcag gtccacgatg gcctttttct 2820
gctcgccgct caggaaggcg ggctttctca ttccctcggt cacgtatttc actttggtca 2880
gctcgttata cacggtgaag tactcgtaca gcaggctgtg cttgggcagc accttctcgt 2940
tgggcaggtt cttatcgaag ttggtcatcc gctcgatgaa gctctgggcg gaagcgccct 3000
tgtccaccac ttcctcgaag ttccaggggg tgatggtttc ctcgctcttt ctggtcatcc 3060
aggcgaatct gctgtttccc ctggccagag ggcccacgta gtaggggatg cggaaggtca 3120
ggatcttctc gatcttttcc cggttgtcct tcaggaatgg gtaaaaatct tcctgccgcc 3180
gcagaatggc gtgcagctct cccaggtgga tctggtgggg gatgctgccg ttgtcgaagg 3240
tccgctgctt ccgcagcagg tcctctctgt tcagcttcac gagcagttcc tcggtgccgt 3300
ccatcttttc caggatgggc ttgatgaact tgtagaactc ttcctggctg gctccgccgt 3360
caatgtagcc ggcgtagccg ttcttgctct ggtcgaagaa aatctctttg tacttctcag 3420
gcagctgctg ccgcacgaga gctttcagca gggtcaggtc ctggtggtgc tcgtcgtatc 3480
tcttgatcat agaggcgctc aggggggcct tggtgatctc ggtgttcact ctcaggatgt 3540
cgctcagcag gatggcgtcg gacaggttct tggcggccag aaacaggtcg gcgtactggt 3600
cgccgatctg ggccagcagg ttgtccaggt cgtcgtcgta ggtgtccttg ctcagctgca 3660
gtttggcatc ctcggccagg tcgaagttgc tcttgaagtt gggggtcagg cccaggctca 3720
gggcaatcag gtttccgaac aggccattct tcttctcgcc gggcagctgg gcgatcagat 3780
tttccagccg tctgctcttg ctcagtctgg cagacaggat ggccttggcg tccacgccgc 3840
tggcgttgat ggggttttcc tcgaacagct ggttgtaggt ctgcaccagc tggatgaaca 3900
gcttgtccac gtcgctgttg tcggggttca ggtcgccctc gatcaggaag tggccccgga 3960
acttgatcat gtgggccagg gccagataga tcagccgcag gtcggccttg tcggtgctgt 4020
ccaccagttt ctttctcagg tggtagatgg tggggtactt ctcgtggtag gccacctcgt 4080
ccacgatgtt gccgaagatg gggtgccgct cgtgcttctt atcctcttcc accaggaagg 4140
actcttccag tctgtggaag aagctgtcgt ccaccttggc catctcgttg ctgaagatct 4200
cttgcagata gcagatccgg ttcttccgtc tggtgtatct tcttctggcg gttctcttca 4260
gccgggtggc ctcggctgtt tcgccgctgt cgaacagcag ggctccgatc aggttcttct 4320
tgatgctgtg ccggtcggtg ttgcccagca ccttgaattt cttgctgggc accttgtact 4380
cgtcggtgat cacggcccag cccacagagt tggtgccgat gtccaggccg atgctgtact 4440
tcttgtcggc tgctgggact ccgtggatac cgaccttccg cttcttcttt ggggccatct 4500
tatcgtcatc gtctttgtaa tcaatatcat gatccttgta gtctccgtcg tggtccttat 4560
agtccatctc gagtatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta 4620
aaagaaatga tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt 4680
gttttgtata tgttgtgttg aggtcgaggt cctctccaaa tgaaatgaac ttccttatat 4740
agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt cagtggagat 4800
atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt ccacgatgct 4860
cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt caacgatggc 4920
ctttccttta tcgcaatgat ggcatttgta ggagccacct tccttttcca ctatcttcac 4980
aataaagtga cagatagctg ggcaatggaa tccgaggagg tttccggata tcaccctttg 5040
ttgaaaagtc tcaattgccc tttggtcttc tgagactgta tctttgatat ttttggagta 5100
gacaagtgtg tcgtgctcca ccatgttatc acatcaatcc acttgctttg aagacgtggt 5160
tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt tgggaccact 5220
gtcggcagag gcatcttcaa cgatggcctt tcctttatcg caatgatggc atttgtagga 5280
gccaccttcc ttttccacta tcttcacaat aaagtgacag atagctgggc aatggaatcc 5340
gaggaggttt ccggatatta ccctttgttg aaaagtctca attgcccttt ggtcttctga 5400
gactgtatct ttgatatttt tggagtagac aagtgtgtcg tgctccacca tgttgacctg 5460
caggcatgcc tcggatccac tagtaacggc cgccagtgtg ctggaattgc ccttaagctt 5520
cgttgaacaa cggaaactcg acttgccttc cgcacaatac atcatttctt cttagctttt 5580
tttcttcttc ttcgttcata cagttttttt ttgtttatca gcttacattt tcttgaaccg 5640
tagctttcgt tttcttcttt ttaactttcc attcggagtt tttgtatctt gtttcatagt 5700
ttgtcccagg attagaatga ttaggcatcg aaccttcaag aatttgattg aataaaacat 5760
cttcattctt aagatatgaa gataatcttc aaaaggcccc tgggaatctg aaagaagaga 5820
agcaggccca tttatatggg aaagaacaat agtatttctt atataggccc atttaagttg 5880
aaaacaatct tcaaaagtcc cacatcgctt agataagaaa acgaagctga gtttatatac 5940
agctagagtc gaagtagtga ttgcggagac tcgtctacag ttgttttaga gctagaaata 6000
gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga gtcggtgctt 6060
tttttgtccc ttcgaagggc ctttctcaga tatccatcac actggcggcc gctcgaggtc 6120
gctcggatcc actagtaacg gccgccagtg tgctggaatt gcccttaagc ttcgttgaac 6180
aacggaaact cgacttgcct tccgcacaat acatcatttc ttcttagctt tttttcttct 6240
tcttcgttca tacagttttt ttttgtttat cagcttacat tttcttgaac cgtagctttc 6300
gttttcttct ttttaacttt ccattcggag tttttgtatc ttgtttcata gtttgtccca 6360
ggattagaat gattaggcat cgaaccttca agaatttgat tgaataaaac atcttcattc 6420
ttaagatatg aagataatct tcaaaaggcc cctgggaatc tgaaagaaga gaagcaggcc 6480
catttatatg ggaaagaaca atagtatttc ttatataggc ccatttaagt tgaaaacaat 6540
cttcaaaagt cccacatcgc ttagataaga aaacgaagct gagtttatat acagctagag 6600
tcgaagtagt gattttggtc tacggagcga tggtgtttta gagctagaaa tagcaagtta 6660
aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc tttttttgtc 6720
ccttcgaagg gcctttctca gatatccatc acactggcgg ccgctcgagg tcgaagcttg 6780
gcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 6840
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 6900
cgcccttccc aacagttgcg cagcctgaat ggcgaatgct agagcagctt gagcttggat 6960
cagattgtcg tttcccgcct tcagtttaaa ctatcagtgt ttgacaggat atattggcgg 7020
gtaaacctaa gagaaaagag cgtttattag aataacggat atttaaaagg gcgtgaaaag 7080
gtttatccgt tcgtccattt gtatgtgcat gccaaccaca gggttcccct cgggatcaaa 7140
gtactttgat ccaacccctc cgctgctata gtgcagtcgg cttctgacgt tcagtgcagc 7200
cgtcttctga aaacgacatg tcgcacaagt cctaagttac gcgacaggct gccgccctgc 7260
ccttttcctg gcgttttctt gtcgcgtgtt ttagtcgcat aaagtagaat acttgcgact 7320
agaaccggag acattacgcc atgaacaaga gcgccgccgc tggcctgctg ggctatgccc 7380
gcgtcagcac cgacgaccag gacttgacca accaacgggc cgaactgcac gcggccggct 7440
gcaccaagct gttttccgag aagatcaccg gcaccaggcg cgaccgcccg gagctggcca 7500
ggatgcttga ccacctacgc cctggcgacg ttgtgacagt gaccaggcta gaccgcctgg 7560
cccgcagcac ccgcgaccta ctggacattg ccgagcgcat ccaggaggcc ggcgcgggcc 7620
tgcgtagcct ggcagagccg tgggccgaca ccaccacgcc ggccggccgc atggtgttga 7680
ccgtgttcgc cggcattgcc gagttcgagc gttccctaat catcgaccgc acccggagcg 7740
ggcgcgaggc cgccaaggcc cgaggcgtga agtttggccc ccgccctacc ctcaccccgg 7800
cacagatcgc gcacgcccgc gagctgatcg accaggaagg ccgcaccgtg aaagaggcgg 7860
ctgcactgct tggcgtgcat cgctcgaccc tgtaccgcgc acttgagcgc agcgaggaag 7920
tgacgcccac cgaggccagg cggcgcggtg ccttccgtga ggacgcattg accgaggccg 7980
acgccctggc ggccgccgag aatgaacgcc aagaggaaca agcatgaaac cgcaccagga 8040
cggccaggac gaaccgtttt tcattaccga agagatcgag gcggagatga tcgcggccgg 8100
gtacgtgttc gagccgcccg cgcacgtctc aaccgtgcgg ctgcatgaaa tcctggccgg 8160
tttgtctgat gccaagctgg cggcctggcc ggccagcttg gccgctgaag aaaccgagcg 8220
ccgccgtcta aaaaggtgat gtgtatttga gtaaaacagc ttgcgtcatg cggtcgctgc 8280
gtatatgatg cgatgagtaa ataaacaaat acgcaagggg aacgcatgaa ggttatcgct 8340
gtacttaacc agaaaggcgg gtcaggcaag acgaccatcg caacccatct agcccgcgcc 8400
ctgcaactcg ccggggccga tgttctgtta gtcgattccg atccccaggg cagtgcccgc 8460
gattgggcgg ccgtgcggga agatcaaccg ctaaccgttg tcggcatcga ccgcccgacg 8520
attgaccgcg acgtgaaggc catcggccgg cgcgacttcg tagtgatcga cggagcgccc 8580
caggcggcgg acttggctgt gtccgcgatc aaggcagccg acttcgtgct gattccggtg 8640
cagccaagcc cttacgacat atgggccacc gccgacctgg tggagctggt taagcagcgc 8700
attgaggtca cggatggaag gctacaagcg gcctttgtcg tgtcgcgggc gatcaaaggc 8760
acgcgcatcg gcggtgaggt tgccgaggcg ctggccgggt acgagctgcc cattcttgag 8820
tcccgtatca cgcagcgcgt gagctaccca ggcactgccg ccgccggcac aaccgttctt 8880
gaatcagaac ccgagggcga cgctgcccgc gaggtccagg cgctggccgc tgaaattaaa 8940
tcaaaactca tttgagttaa tgaggtaaag agaaaatgag caaaagcaca aacacgctaa 9000
gtgccggccg tccgagcgca cgcagcagca aggctgcaac gttggccagc ctggcagaca 9060
cgccagccat gaagcgggtc aactttcagt tgccggcgga ggatcacacc aagctgaaga 9120
tgtacgcggt acgccaaggc aagaccatta ccgagctgct atctgaatac atcgcgcagc 9180
taccagagta aatgagcaaa tgaataaatg agtagatgaa ttttagcggc taaaggaggc 9240
ggcatggaaa atcaagaaca accaggcacc gacgccgtgg aatgccccat gtgtggagga 9300
acgggcggtt ggccaggcgt aagcggctgg gttgtctgcc ggccctgcaa tggcactgga 9360
acccccaagc ccgaggaatc ggcgtgacgg tcgcaaacca tccggcccgg tacaaatcgg 9420
cgcggcgctg ggtgatgacc tggtggagaa gttgaaggcc gcgcaggccg cccagcggca 9480
acgcatcgag gcagaagcac gccccggtga atcgtggcaa gcggccgctg atcgaatccg 9540
caaagaatcc cggcaaccgc cggcagccgg tgcgccgtcg attaggaagc cgcccaaggg 9600
cgacgagcaa ccagattttt tcgttccgat gctctatgac gtgggcaccc gcgatagtcg 9660
cagcatcatg gacgtggccg ttttccgtct gtcgaagcgt gaccgacgag ctggcgaggt 9720
gatccgctac gagcttccag acgggcacgt agaggtttcc gcagggccgg ccggcatggc 9780
cagtgtgtgg gattacgacc tggtactgat ggcggtttcc catctaaccg aatccatgaa 9840
ccgataccgg gaagggaagg gagacaagcc cggccgcgtg ttccgtccac acgttgcgga 9900
cgtactcaag ttctgccggc gagccgatgg cggaaagcag aaagacgacc tggtagaaac 9960
ctgcattcgg ttaaacacca cgcacgttgc catgcagcgt acgaagaagg ccaagaacgg 10020
ccgcctggtg acggtatccg agggtgaagc cttgattagc cgctacaaga tcgtaaagag 10080
cgaaaccggg cggccggagt acatcgagat cgagctagct gattggatgt accgcgagat 10140
cacagaaggc aagaacccgg acgtgctgac ggttcacccc gattactttt tgatcgatcc 10200
cggcatcggc cgttttctct accgcctggc acgccgcgcc gcaggcaagg cagaagccag 10260
atggttgttc aagacgatct acgaacgcag tggcagcgcc ggagagttca agaagttctg 10320
tttcaccgtg cgcaagctga tcgggtcaaa tgacctgccg gagtacgatt tgaaggagga 10380
ggcggggcag gctggcccga tcctagtcat gcgctaccgc aacctgatcg agggcgaagc 10440
atccgccggt tcctaatgta cggagcagat gctagggcaa attgccctag caggggaaaa 10500
aggtcgaaaa ggtctctttc ctgtggatag cacgtacatt gggaacccaa agccgtacat 10560
tgggaaccgg aacccgtaca ttgggaaccc aaagccgtac attgggaacc ggtcacacat 10620
gtaagtgact gatataaaag agaaaaaagg cgatttttcc gcctaaaact ctttaaaact 10680
tattaaaact cttaaaaccc gcctggcctg tgcataactg tctggccagc gcacagccga 10740
agagctgcaa aaagcgccta cccttcggtc gctgcgctcc ctacgccccg ccgcttcgcg 10800
tcggcctatc gcggccgctg gccgctcaaa aatggctggc ctacggccag gcaatctacc 10860
agggcgcgga caagccgcgc cgtcgccact cgaccgccgg cgcccacatc aaggcaccct 10920
gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg 10980
tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg 11040
gtgttggcgg gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata 11100
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 11160
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 11220
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 11280
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 11340
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 11400
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 11460
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 11520
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 11580
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 11640
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 11700
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 11760
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 11820
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 11880
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 11940
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 12000
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatgc attctaggta 12060
ctaaaacaat tcatccagta aaatataata ttttattttc tcccaatcag gcttgatccc 12120
cagtaagtca aaaaatagct cgacatactg ttcttccccg atatcctccc tgatcgaccg 12180
gacgcagaag gcaatgtcat accacttgtc cgccctgccg cttctcccaa gatcaataaa 12240
gccacttact ttgccatctt tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa 12300
gacaagttcc tcttcgggct tttccgtctt taaaaaatca tacagctcgc gcggatcttt 12360
aaatggagtg tcttcttccc agttttcgca atccacatcg gccagatcgt tattcagtaa 12420
gtaatccaat tcggctaagc ggctgtctaa gctattcgta tagggacaat ccgatatgtc 12480
gatggagtga aagagcctga tgcactccgc atacagctcg ataatctttt cagggctttg 12540
ttcatcttca tactcttccg agcaaaggac gccatcggcc tcactcatga gcagattgct 12600
ccagccatca tgccgttcaa agtgcaggac ctttggaaca ggcagctttc cttccagcca 12660
tagcatcatg tccttttccc gttccacatc ataggtggtc cctttatacc ggctgtccgt 12720
catttttaaa tataggtttt cattttctcc caccagctta tataccttag caggagacat 12780
tccttccgta tcttttacgc agcggtattt ttcgatcagt tttttcaatt ccggtgatat 12840
tctcatttta gccatttatt atttccttcc tcttttctac agtatttaaa gataccccaa 12900
gaagctaatt ataacaagac gaactccaat tcactgttcc ttgcattcta aaaccttaaa 12960
taccagaaaa cagctttttc aaagttgttt tcaaagttgg cgtataacat agtatcgacg 13020
gagccgattt tgaaaccgcg gtgatcacag gcagcaacgc tctgtcatcg ttacaatcaa 13080
catgctaccc tccgcgagat catccgtgtt tcaaacccgg cagcttagtt gccgttcttc 13140
cgaatagcat cggtaacatg agcaaagtct gccgccttac aacggctctc ccgctgacgc 13200
cgtcccggac tgatgggctg cctgtatcga gtggtgattt tgtgccgagc tgccggtcgg 13260
ggagctgttg gctggctggt ggcaggatat attgtggtgt aaacaaattg acgcttagac 13320
aacttaataa cacattgcgg acgtttttaa tgtactgaat taacgccgaa ttaattcggg 13380
ggatctggat tttagtactg gattttggtt ttaggaatta gaaattttat tgatagaagt 13440
attttacaaa tacaaataca tactaagggt ttcttatatg ctcaacacat gagcgaaacc 13500
ctataggaac cctaattccc ttatctggga actactcaca cattattatg gagaaactcg 13560
agcttgtcga tcgacagatc cggtcggcat ctactctatt tctttgccct cggacgagtg 13620
ctggggcgtc ggtttccact atcggcgagt acttctacac agccatcggt ccagacggcc 13680
gcgcttctgc gggcgatttg tgtacgcccg acagtcccgg ctccggatcg gacgattgcg 13740
tcgcatcgac cctgcgccca agctgcatca tcgaaattgc cgtcaaccaa gctctgatag 13800
agttggtcaa gaccaatgcg gagcatatac gcccggagtc gtggcgatcc tgcaagctcc 13860
ggatgcctcc gctcgaagta gcgcgtctgc tgctccatac aagccaacca cggcctccag 13920
aagaagatgt tggcgacctc gtattgggaa tccccgaaca tcgcctcgct ccagtcaatg 13980
accgctgtta tgcggccatt gtccgtcagg acattgttgg agccgaaatc cgcgtgcacg 14040
aggtgccgga cttcggggca gtcctcggcc caaagcatca gctcatcgag agcctgcgcg 14100
acggacgcac tgacggtgtc gtccatcaca gtttgccagt gatacacatg gggatcagca 14160
atcgcgcata tgaaatcacg ccatgtagtg tattgaccga ttccttgcgg tccgaatggg 14220
ccgaacccgc tcgtctggct aagatcggcc gcagcgatcg catccatagc ctccgcgacc 14280
ggttgtagaa cagcgggcag ttcggtttca ggcaggtctt gcaacgtgac accctgtgca 14340
cggcgggaga tgcaataggt caggctctcg ctaaactccc caatgtcaag cacttccgga 14400
atcgggagcg cggccgatgc aaagtgccga taaacataac gatctttgta gaaaccatcg 14460
gcgcagctat ttacccgcag gacatatcca cgccctccta catcgaagct gaaagcacga 14520
gattcttcgc cctccgagag ctgcatcagg tcggagacgc tgtcgaactt ttcgatcaga 14580
aacttctcga cagacgtcgc ggtgagttca ggctttttca tatctcattg ccccccggga 14640
tctgcgaaag ctcgagagag atagatttgt agagagagac tggtgatttc agcgtgtcct 14700
ctccaaatga aatgaacttc cttatataga ggaaggtctt gcgaaggata gtgggattgt 14760
gcgtcatccc ttacgtcagt ggagatatca catcaatcca cttgctttga agacgtggtt 14820
ggaacgtctt ctttttccac gatgctcctc gtgggtgggg gtccatcttt gggaccactg 14880
tcggcagagg catcttgaac gatagccttt cctttatcgc aatgatggca tttgtaggtg 14940
ccaccttcct tttctactgt ccttttgatg aagtgacaga tagctgggca atggaatccg 15000
aggaggtttc ccgatattac cctttgttga aaagtctcaa tagccctttg gtcttctgag 15060
actgtatctt tgatattctt ggagtagacg agagtgtcgt gctccaccat gttatcacat 15120
caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg 15180
ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaacgat agcctttcct 15240
ttatcgcaat gatggcattt gtaggtgcca ccttcctttt ctactgtcct tttgatgaag 15300
tgacagatag ctgggcaatg gaatccgagg aggtttcccg atattaccct ttgttgaaaa 15360
gtctcaatag ccctttggtc ttctgagact gtatctttga tattcttgga gtagacgaga 15420
gtgtcgtgct ccaccatgtt ggcaagctgc tctagccaat acgcaaaccg cctctccccg 15480
cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca 15540
gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 15600
ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 15660
acagctatga ccatgattac g 15681
<210> 158
<211> 15681
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<220>
<221> misc_feature
<222> 5964..5984
<223> /note="target sequence"
<220>
<221> misc_feature
<222> 6617..6637
<223> /note="target sequence"
<400> 158
aattcccgat ctagtaacat agatgacacc gcgcgcgata atttatccta gtttgcgcgc 60
tatattttgt tttctatcgc gtattaaatg tataattgcg ggactctaat cataaaaacc 120
catctcataa ataacgtcat gcattacatg ttaattatta catgcttaac gtaattcaac 180
agaaattata tgataatcat cgcaagaccg gcaacaggat tcaatcttaa gaaactttat 240
tgccaaatgt ttgaacgatc ggggaaattc gagctctatc gatcaatcag gatccttact 300
ttttcttttt tgcctggccg gcctttttcg tggccgccgg ccttttgtcg cctcccagct 360
gagacaggtc gatccgtgtc tcgtacaggc cggtgatgct ctggtggatc agggtggcgt 420
ccagcacctc tttggtgctg gtgtacctct tccggtcgat ggtggtgtca aagtacttga 480
aggcggcagg ggctcccaga ttggtcaggg taaacaggtg gatgatattc tcggcctgct 540
ctctgatggg cttatcccgg tgcttgttgt aggcggacag cactttgtcc agattagcgt 600
cggccaggat cactctcttg gagaactcgc tgatctgctc gatgatctcg tccaggtagt 660
gcttgtgctg ttccacaaac agctgtttct gctcattatc ctcgggggag cccttcagct 720
tctcatagtg gctggccagg tacaggaagt tcacatattt ggagggcagg gccagttcgt 780
ttcccttctg cagttcgccg gcagaggcca gcattctctt ccggccgttt tccagctcga 840
acagggagta cttaggcagc ttgatgatca ggtccttttt cacttctttg tagcccttgg 900
cttccagaaa gtcgatggga ttcttctcga agctgcttct ttccatgatg gtgatcccca 960
gcagctcttt cacactcttc agtttcttgg acttgccctt ttccactttg gccaccacca 1020
gcacagaata ggccacggtg gggctgtcga agccgccgta cttcttaggg tcccagtcct 1080
tctttctggc gatcagctta tcgctgttcc tcttgggcag gatagactct ttgctgaagc 1140
cgcctgtctg cacctcggtc tttttcacga tattcacttg gggcatgctc agcactttcc 1200
gcacggtggc aaaatcccgg cccttatccc acacgatctc cccggtttcg ccgtttgtct 1260
cgatcagagg ccgcttccgg atctcgccgt tggccagggt aatctcggtc ttgaaaaagt 1320
tcatgatgtt gctgtagaag aagtacttgg cggtagcctt gccgatttcc tgctcgctct 1380
tggcgatcat cttccgcacg tcgtacacct tgtagtcgcc gtacacgaac tcgctttcca 1440
gcttagggta ctttttgatc agggcggttc ccacgacggc gttcaggtag gcgtcgtggg 1500
cgtggtggta gttgttgatc tcgcgcactt tgtaaaactg gaaatccttc cggaaatcgg 1560
acaccagctt ggacttcagg gtgatcactt tcacttcccg gatcagcttg tcattctcgt 1620
cgtacttagt gttcatccgg gagtccagga tctgtgccac gtgctttgtg atctgccggg 1680
tttccaccag ctgtctcttg atgaagccgg ccttatccag ttcgctcagg ccgcctctct 1740
cggccttggt cagattgtcg aactttctct gggtaatcag cttggcgttc agcagctgcc 1800
gccagtagtt cttcatcttc ttcacgacct cttcggaggg cacgttgtcg ctcttgcccc 1860
ggttcttgtc gcttctggtc agcaccttgt tgtcgatgga gtcgtccttc agaaagctct 1920
gaggcacgat atggtccaca tcgtagtcgg acagccggtt gatgtccagt tcctggtcca 1980
cgtacatatc ccgcccattc tgcaggtagt acaggtacag cttctcgttc tgcagctggg 2040
tgttttccac ggggtgttct ttcaggatct ggctgcccag ctctttgatg ccctcttcga 2100
tccgcttcat tctctcgcgg ctgttcttct gtcccttctg ggtggtctgg ttctctctgg 2160
ccatttcgat cacgatgttc tcgggcttgt gccggcccat cactttcacg agctcgtcca 2220
ccaccttcac tgtctgcagg atgcccttct taatggcggg gctgccggcc agattggcaa 2280
tgtgctcgtg caggctatcg ccctggccgg acacctgggc tttctggatg tcctctttaa 2340
aggtcaggct gtcgtcgtgg atcagctgca tgaagtttct gttggcgaag ccgtcggact 2400
tcaggaaatc caggattgtc ttgccggact gcttgtcccg gatgccgttg atcagcttcc 2460
ggctcagcct gccccagccg gtgtatctcc gccgcttcag ctgcttcatc actttgtcgt 2520
cgaacaggtg ggcataggtt ttcagccgtt cctcgatcat ctctctgtcc tcaaacagtg 2580
tcagggtcag cacgatatct tccagaatgt cctcgttttc ctcattgtcc aggaagtcct 2640
tgtccttgat aattttcagc agatcgtggt atgtgcccag ggaggcgttg aaccgatctt 2700
ccacgccgga gatttccacg gagtcgaagc actcgatttt cttgaagtag tcctctttca 2760
gctgcttcac ggtcactttc cggttggtct tgaacagcag gtccacgatg gcctttttct 2820
gctcgccgct caggaaggcg ggctttctca ttccctcggt cacgtatttc actttggtca 2880
gctcgttata cacggtgaag tactcgtaca gcaggctgtg cttgggcagc accttctcgt 2940
tgggcaggtt cttatcgaag ttggtcatcc gctcgatgaa gctctgggcg gaagcgccct 3000
tgtccaccac ttcctcgaag ttccaggggg tgatggtttc ctcgctcttt ctggtcatcc 3060
aggcgaatct gctgtttccc ctggccagag ggcccacgta gtaggggatg cggaaggtca 3120
ggatcttctc gatcttttcc cggttgtcct tcaggaatgg gtaaaaatct tcctgccgcc 3180
gcagaatggc gtgcagctct cccaggtgga tctggtgggg gatgctgccg ttgtcgaagg 3240
tccgctgctt ccgcagcagg tcctctctgt tcagcttcac gagcagttcc tcggtgccgt 3300
ccatcttttc caggatgggc ttgatgaact tgtagaactc ttcctggctg gctccgccgt 3360
caatgtagcc ggcgtagccg ttcttgctct ggtcgaagaa aatctctttg tacttctcag 3420
gcagctgctg ccgcacgaga gctttcagca gggtcaggtc ctggtggtgc tcgtcgtatc 3480
tcttgatcat agaggcgctc aggggggcct tggtgatctc ggtgttcact ctcaggatgt 3540
cgctcagcag gatggcgtcg gacaggttct tggcggccag aaacaggtcg gcgtactggt 3600
cgccgatctg ggccagcagg ttgtccaggt cgtcgtcgta ggtgtccttg ctcagctgca 3660
gtttggcatc ctcggccagg tcgaagttgc tcttgaagtt gggggtcagg cccaggctca 3720
gggcaatcag gtttccgaac aggccattct tcttctcgcc gggcagctgg gcgatcagat 3780
tttccagccg tctgctcttg ctcagtctgg cagacaggat ggccttggcg tccacgccgc 3840
tggcgttgat ggggttttcc tcgaacagct ggttgtaggt ctgcaccagc tggatgaaca 3900
gcttgtccac gtcgctgttg tcggggttca ggtcgccctc gatcaggaag tggccccgga 3960
acttgatcat gtgggccagg gccagataga tcagccgcag gtcggccttg tcggtgctgt 4020
ccaccagttt ctttctcagg tggtagatgg tggggtactt ctcgtggtag gccacctcgt 4080
ccacgatgtt gccgaagatg gggtgccgct cgtgcttctt atcctcttcc accaggaagg 4140
actcttccag tctgtggaag aagctgtcgt ccaccttggc catctcgttg ctgaagatct 4200
cttgcagata gcagatccgg ttcttccgtc tggtgtatct tcttctggcg gttctcttca 4260
gccgggtggc ctcggctgtt tcgccgctgt cgaacagcag ggctccgatc aggttcttct 4320
tgatgctgtg ccggtcggtg ttgcccagca ccttgaattt cttgctgggc accttgtact 4380
cgtcggtgat cacggcccag cccacagagt tggtgccgat gtccaggccg atgctgtact 4440
tcttgtcggc tgctgggact ccgtggatac cgaccttccg cttcttcttt ggggccatct 4500
tatcgtcatc gtctttgtaa tcaatatcat gatccttgta gtctccgtcg tggtccttat 4560
agtccatctc gagtatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta 4620
aaagaaatga tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt 4680
gttttgtata tgttgtgttg aggtcgaggt cctctccaaa tgaaatgaac ttccttatat 4740
agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt cagtggagat 4800
atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt ccacgatgct 4860
cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt caacgatggc 4920
ctttccttta tcgcaatgat ggcatttgta ggagccacct tccttttcca ctatcttcac 4980
aataaagtga cagatagctg ggcaatggaa tccgaggagg tttccggata tcaccctttg 5040
ttgaaaagtc tcaattgccc tttggtcttc tgagactgta tctttgatat ttttggagta 5100
gacaagtgtg tcgtgctcca ccatgttatc acatcaatcc acttgctttg aagacgtggt 5160
tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt tgggaccact 5220
gtcggcagag gcatcttcaa cgatggcctt tcctttatcg caatgatggc atttgtagga 5280
gccaccttcc ttttccacta tcttcacaat aaagtgacag atagctgggc aatggaatcc 5340
gaggaggttt ccggatatta ccctttgttg aaaagtctca attgcccttt ggtcttctga 5400
gactgtatct ttgatatttt tggagtagac aagtgtgtcg tgctccacca tgttgacctg 5460
caggcatgcc tcggatccac tagtaacggc cgccagtgtg ctggaattgc ccttaagctt 5520
cgttgaacaa cggaaactcg acttgccttc cgcacaatac atcatttctt cttagctttt 5580
tttcttcttc ttcgttcata cagttttttt ttgtttatca gcttacattt tcttgaaccg 5640
tagctttcgt tttcttcttt ttaactttcc attcggagtt tttgtatctt gtttcatagt 5700
ttgtcccagg attagaatga ttaggcatcg aaccttcaag aatttgattg aataaaacat 5760
cttcattctt aagatatgaa gataatcttc aaaaggcccc tgggaatctg aaagaagaga 5820
agcaggccca tttatatggg aaagaacaat agtatttctt atataggccc atttaagttg 5880
aaaacaatct tcaaaagtcc cacatcgctt agataagaaa acgaagctga gtttatatac 5940
agctagagtc gaagtagtga tttccccacg tcactgggcg tcgttttaga gctagaaata 6000
gcaagttaaa ataaggctag tccgttatca acttgaaaaa gtggcaccga gtcggtgctt 6060
tttttgtccc ttcgaagggc ctttctcaga tatccatcac actggcggcc gctcgaggtc 6120
gctcggatcc actagtaacg gccgccagtg tgctggaatt gcccttaagc ttcgttgaac 6180
aacggaaact cgacttgcct tccgcacaat acatcatttc ttcttagctt tttttcttct 6240
tcttcgttca tacagttttt ttttgtttat cagcttacat tttcttgaac cgtagctttc 6300
gttttcttct ttttaacttt ccattcggag tttttgtatc ttgtttcata gtttgtccca 6360
ggattagaat gattaggcat cgaaccttca agaatttgat tgaataaaac atcttcattc 6420
ttaagatatg aagataatct tcaaaaggcc cctgggaatc tgaaagaaga gaagcaggcc 6480
catttatatg ggaaagaaca atagtatttc ttatataggc ccatttaagt tgaaaacaat 6540
cttcaaaagt cccacatcgc ttagataaga aaacgaagct gagtttatat acagctagag 6600
tcgaagtagt gattttggtc tacggagcga tggtgtttta gagctagaaa tagcaagtta 6660
aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc tttttttgtc 6720
ccttcgaagg gcctttctca gatatccatc acactggcgg ccgctcgagg tcgaagcttg 6780
gcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat 6840
cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat 6900
cgcccttccc aacagttgcg cagcctgaat ggcgaatgct agagcagctt gagcttggat 6960
cagattgtcg tttcccgcct tcagtttaaa ctatcagtgt ttgacaggat atattggcgg 7020
gtaaacctaa gagaaaagag cgtttattag aataacggat atttaaaagg gcgtgaaaag 7080
gtttatccgt tcgtccattt gtatgtgcat gccaaccaca gggttcccct cgggatcaaa 7140
gtactttgat ccaacccctc cgctgctata gtgcagtcgg cttctgacgt tcagtgcagc 7200
cgtcttctga aaacgacatg tcgcacaagt cctaagttac gcgacaggct gccgccctgc 7260
ccttttcctg gcgttttctt gtcgcgtgtt ttagtcgcat aaagtagaat acttgcgact 7320
agaaccggag acattacgcc atgaacaaga gcgccgccgc tggcctgctg ggctatgccc 7380
gcgtcagcac cgacgaccag gacttgacca accaacgggc cgaactgcac gcggccggct 7440
gcaccaagct gttttccgag aagatcaccg gcaccaggcg cgaccgcccg gagctggcca 7500
ggatgcttga ccacctacgc cctggcgacg ttgtgacagt gaccaggcta gaccgcctgg 7560
cccgcagcac ccgcgaccta ctggacattg ccgagcgcat ccaggaggcc ggcgcgggcc 7620
tgcgtagcct ggcagagccg tgggccgaca ccaccacgcc ggccggccgc atggtgttga 7680
ccgtgttcgc cggcattgcc gagttcgagc gttccctaat catcgaccgc acccggagcg 7740
ggcgcgaggc cgccaaggcc cgaggcgtga agtttggccc ccgccctacc ctcaccccgg 7800
cacagatcgc gcacgcccgc gagctgatcg accaggaagg ccgcaccgtg aaagaggcgg 7860
ctgcactgct tggcgtgcat cgctcgaccc tgtaccgcgc acttgagcgc agcgaggaag 7920
tgacgcccac cgaggccagg cggcgcggtg ccttccgtga ggacgcattg accgaggccg 7980
acgccctggc ggccgccgag aatgaacgcc aagaggaaca agcatgaaac cgcaccagga 8040
cggccaggac gaaccgtttt tcattaccga agagatcgag gcggagatga tcgcggccgg 8100
gtacgtgttc gagccgcccg cgcacgtctc aaccgtgcgg ctgcatgaaa tcctggccgg 8160
tttgtctgat gccaagctgg cggcctggcc ggccagcttg gccgctgaag aaaccgagcg 8220
ccgccgtcta aaaaggtgat gtgtatttga gtaaaacagc ttgcgtcatg cggtcgctgc 8280
gtatatgatg cgatgagtaa ataaacaaat acgcaagggg aacgcatgaa ggttatcgct 8340
gtacttaacc agaaaggcgg gtcaggcaag acgaccatcg caacccatct agcccgcgcc 8400
ctgcaactcg ccggggccga tgttctgtta gtcgattccg atccccaggg cagtgcccgc 8460
gattgggcgg ccgtgcggga agatcaaccg ctaaccgttg tcggcatcga ccgcccgacg 8520
attgaccgcg acgtgaaggc catcggccgg cgcgacttcg tagtgatcga cggagcgccc 8580
caggcggcgg acttggctgt gtccgcgatc aaggcagccg acttcgtgct gattccggtg 8640
cagccaagcc cttacgacat atgggccacc gccgacctgg tggagctggt taagcagcgc 8700
attgaggtca cggatggaag gctacaagcg gcctttgtcg tgtcgcgggc gatcaaaggc 8760
acgcgcatcg gcggtgaggt tgccgaggcg ctggccgggt acgagctgcc cattcttgag 8820
tcccgtatca cgcagcgcgt gagctaccca ggcactgccg ccgccggcac aaccgttctt 8880
gaatcagaac ccgagggcga cgctgcccgc gaggtccagg cgctggccgc tgaaattaaa 8940
tcaaaactca tttgagttaa tgaggtaaag agaaaatgag caaaagcaca aacacgctaa 9000
gtgccggccg tccgagcgca cgcagcagca aggctgcaac gttggccagc ctggcagaca 9060
cgccagccat gaagcgggtc aactttcagt tgccggcgga ggatcacacc aagctgaaga 9120
tgtacgcggt acgccaaggc aagaccatta ccgagctgct atctgaatac atcgcgcagc 9180
taccagagta aatgagcaaa tgaataaatg agtagatgaa ttttagcggc taaaggaggc 9240
ggcatggaaa atcaagaaca accaggcacc gacgccgtgg aatgccccat gtgtggagga 9300
acgggcggtt ggccaggcgt aagcggctgg gttgtctgcc ggccctgcaa tggcactgga 9360
acccccaagc ccgaggaatc ggcgtgacgg tcgcaaacca tccggcccgg tacaaatcgg 9420
cgcggcgctg ggtgatgacc tggtggagaa gttgaaggcc gcgcaggccg cccagcggca 9480
acgcatcgag gcagaagcac gccccggtga atcgtggcaa gcggccgctg atcgaatccg 9540
caaagaatcc cggcaaccgc cggcagccgg tgcgccgtcg attaggaagc cgcccaaggg 9600
cgacgagcaa ccagattttt tcgttccgat gctctatgac gtgggcaccc gcgatagtcg 9660
cagcatcatg gacgtggccg ttttccgtct gtcgaagcgt gaccgacgag ctggcgaggt 9720
gatccgctac gagcttccag acgggcacgt agaggtttcc gcagggccgg ccggcatggc 9780
cagtgtgtgg gattacgacc tggtactgat ggcggtttcc catctaaccg aatccatgaa 9840
ccgataccgg gaagggaagg gagacaagcc cggccgcgtg ttccgtccac acgttgcgga 9900
cgtactcaag ttctgccggc gagccgatgg cggaaagcag aaagacgacc tggtagaaac 9960
ctgcattcgg ttaaacacca cgcacgttgc catgcagcgt acgaagaagg ccaagaacgg 10020
ccgcctggtg acggtatccg agggtgaagc cttgattagc cgctacaaga tcgtaaagag 10080
cgaaaccggg cggccggagt acatcgagat cgagctagct gattggatgt accgcgagat 10140
cacagaaggc aagaacccgg acgtgctgac ggttcacccc gattactttt tgatcgatcc 10200
cggcatcggc cgttttctct accgcctggc acgccgcgcc gcaggcaagg cagaagccag 10260
atggttgttc aagacgatct acgaacgcag tggcagcgcc ggagagttca agaagttctg 10320
tttcaccgtg cgcaagctga tcgggtcaaa tgacctgccg gagtacgatt tgaaggagga 10380
ggcggggcag gctggcccga tcctagtcat gcgctaccgc aacctgatcg agggcgaagc 10440
atccgccggt tcctaatgta cggagcagat gctagggcaa attgccctag caggggaaaa 10500
aggtcgaaaa ggtctctttc ctgtggatag cacgtacatt gggaacccaa agccgtacat 10560
tgggaaccgg aacccgtaca ttgggaaccc aaagccgtac attgggaacc ggtcacacat 10620
gtaagtgact gatataaaag agaaaaaagg cgatttttcc gcctaaaact ctttaaaact 10680
tattaaaact cttaaaaccc gcctggcctg tgcataactg tctggccagc gcacagccga 10740
agagctgcaa aaagcgccta cccttcggtc gctgcgctcc ctacgccccg ccgcttcgcg 10800
tcggcctatc gcggccgctg gccgctcaaa aatggctggc ctacggccag gcaatctacc 10860
agggcgcgga caagccgcgc cgtcgccact cgaccgccgg cgcccacatc aaggcaccct 10920
gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg 10980
tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg 11040
gtgttggcgg gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata 11100
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 11160
aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct 11220
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 11280
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 11340
ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 11400
cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 11460
actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 11520
cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 11580
tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 11640
gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc 11700
caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 11760
agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 11820
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 11880
tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 11940
gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 12000
gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatgc attctaggta 12060
ctaaaacaat tcatccagta aaatataata ttttattttc tcccaatcag gcttgatccc 12120
cagtaagtca aaaaatagct cgacatactg ttcttccccg atatcctccc tgatcgaccg 12180
gacgcagaag gcaatgtcat accacttgtc cgccctgccg cttctcccaa gatcaataaa 12240
gccacttact ttgccatctt tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa 12300
gacaagttcc tcttcgggct tttccgtctt taaaaaatca tacagctcgc gcggatcttt 12360
aaatggagtg tcttcttccc agttttcgca atccacatcg gccagatcgt tattcagtaa 12420
gtaatccaat tcggctaagc ggctgtctaa gctattcgta tagggacaat ccgatatgtc 12480
gatggagtga aagagcctga tgcactccgc atacagctcg ataatctttt cagggctttg 12540
ttcatcttca tactcttccg agcaaaggac gccatcggcc tcactcatga gcagattgct 12600
ccagccatca tgccgttcaa agtgcaggac ctttggaaca ggcagctttc cttccagcca 12660
tagcatcatg tccttttccc gttccacatc ataggtggtc cctttatacc ggctgtccgt 12720
catttttaaa tataggtttt cattttctcc caccagctta tataccttag caggagacat 12780
tccttccgta tcttttacgc agcggtattt ttcgatcagt tttttcaatt ccggtgatat 12840
tctcatttta gccatttatt atttccttcc tcttttctac agtatttaaa gataccccaa 12900
gaagctaatt ataacaagac gaactccaat tcactgttcc ttgcattcta aaaccttaaa 12960
taccagaaaa cagctttttc aaagttgttt tcaaagttgg cgtataacat agtatcgacg 13020
gagccgattt tgaaaccgcg gtgatcacag gcagcaacgc tctgtcatcg ttacaatcaa 13080
catgctaccc tccgcgagat catccgtgtt tcaaacccgg cagcttagtt gccgttcttc 13140
cgaatagcat cggtaacatg agcaaagtct gccgccttac aacggctctc ccgctgacgc 13200
cgtcccggac tgatgggctg cctgtatcga gtggtgattt tgtgccgagc tgccggtcgg 13260
ggagctgttg gctggctggt ggcaggatat attgtggtgt aaacaaattg acgcttagac 13320
aacttaataa cacattgcgg acgtttttaa tgtactgaat taacgccgaa ttaattcggg 13380
ggatctggat tttagtactg gattttggtt ttaggaatta gaaattttat tgatagaagt 13440
attttacaaa tacaaataca tactaagggt ttcttatatg ctcaacacat gagcgaaacc 13500
ctataggaac cctaattccc ttatctggga actactcaca cattattatg gagaaactcg 13560
agcttgtcga tcgacagatc cggtcggcat ctactctatt tctttgccct cggacgagtg 13620
ctggggcgtc ggtttccact atcggcgagt acttctacac agccatcggt ccagacggcc 13680
gcgcttctgc gggcgatttg tgtacgcccg acagtcccgg ctccggatcg gacgattgcg 13740
tcgcatcgac cctgcgccca agctgcatca tcgaaattgc cgtcaaccaa gctctgatag 13800
agttggtcaa gaccaatgcg gagcatatac gcccggagtc gtggcgatcc tgcaagctcc 13860
ggatgcctcc gctcgaagta gcgcgtctgc tgctccatac aagccaacca cggcctccag 13920
aagaagatgt tggcgacctc gtattgggaa tccccgaaca tcgcctcgct ccagtcaatg 13980
accgctgtta tgcggccatt gtccgtcagg acattgttgg agccgaaatc cgcgtgcacg 14040
aggtgccgga cttcggggca gtcctcggcc caaagcatca gctcatcgag agcctgcgcg 14100
acggacgcac tgacggtgtc gtccatcaca gtttgccagt gatacacatg gggatcagca 14160
atcgcgcata tgaaatcacg ccatgtagtg tattgaccga ttccttgcgg tccgaatggg 14220
ccgaacccgc tcgtctggct aagatcggcc gcagcgatcg catccatagc ctccgcgacc 14280
ggttgtagaa cagcgggcag ttcggtttca ggcaggtctt gcaacgtgac accctgtgca 14340
cggcgggaga tgcaataggt caggctctcg ctaaactccc caatgtcaag cacttccgga 14400
atcgggagcg cggccgatgc aaagtgccga taaacataac gatctttgta gaaaccatcg 14460
gcgcagctat ttacccgcag gacatatcca cgccctccta catcgaagct gaaagcacga 14520
gattcttcgc cctccgagag ctgcatcagg tcggagacgc tgtcgaactt ttcgatcaga 14580
aacttctcga cagacgtcgc ggtgagttca ggctttttca tatctcattg ccccccggga 14640
tctgcgaaag ctcgagagag atagatttgt agagagagac tggtgatttc agcgtgtcct 14700
ctccaaatga aatgaacttc cttatataga ggaaggtctt gcgaaggata gtgggattgt 14760
gcgtcatccc ttacgtcagt ggagatatca catcaatcca cttgctttga agacgtggtt 14820
ggaacgtctt ctttttccac gatgctcctc gtgggtgggg gtccatcttt gggaccactg 14880
tcggcagagg catcttgaac gatagccttt cctttatcgc aatgatggca tttgtaggtg 14940
ccaccttcct tttctactgt ccttttgatg aagtgacaga tagctgggca atggaatccg 15000
aggaggtttc ccgatattac cctttgttga aaagtctcaa tagccctttg gtcttctgag 15060
actgtatctt tgatattctt ggagtagacg agagtgtcgt gctccaccat gttatcacat 15120
caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg 15180
ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaacgat agcctttcct 15240
ttatcgcaat gatggcattt gtaggtgcca ccttcctttt ctactgtcct tttgatgaag 15300
tgacagatag ctgggcaatg gaatccgagg aggtttcccg atattaccct ttgttgaaaa 15360
gtctcaatag ccctttggtc ttctgagact gtatctttga tattcttgga gtagacgaga 15420
gtgtcgtgct ccaccatgtt ggcaagctgc tctagccaat acgcaaaccg cctctccccg 15480
cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca 15540
gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 15600
ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 15660
acagctatga ccatgattac g 15681
<210> 159
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> target site
<400> 159
tccccacgtc actgggcgtc 20
<210> 160
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> target site
<400> 160
cacaccccat ggccaggact 20
<210> 161
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> target site
<400> 161
gcggagactc gtctacagtt 20
<210> 162
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> target site
<400> 162
atgtgttaca gcacgtcggg 20
<210> 163
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> vector
<400> 163
ttggtctacg gagcgatggt 20
<210> 164
<211> 10
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 164
Asp Arg Leu Phe Ile Asp Trp Lys Arg Arg
1 5 10
<210> 165
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 165
Leu Arg Leu Phe Gly Val Asp Val Glu
1 5
<210> 166
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 166
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 167
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 167
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 168
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 168
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 169
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 169
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 170
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 170
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 171
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 171
Leu Arg Leu Phe Gly Val Asp Met Glu
1 5
<210> 172
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 172
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 173
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 173
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 174
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 174
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 175
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 175
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 176
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 176
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 177
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 177
Leu Arg Leu Phe Gly Val Asn Met Glu
1 5
<210> 178
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 178
Leu Arg Leu Phe Gly Val Cys Ile Thr
1 5
<210> 179
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 179
Val Arg Leu Phe Gly Val Asp Ile Ala
1 5
<210> 180
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 180
Val Arg Leu Phe Gly Val Asp Ile Ala
1 5
<210> 181
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 181
Val Arg Leu Phe Gly Val Asp Ile Phe
1 5
<210> 182
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 182
Val Arg Leu Phe Gly Val Asp Ile Ser
1 5
<210> 183
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 183
Val Arg Leu Phe Gly Val Asn Ile Leu
1 5
<210> 184
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 184
Val Arg Leu Phe Gly Val Asn Ile Leu
1 5
<210> 185
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 185
Val Arg Leu Phe Gly Val Asp Leu Leu
1 5
<210> 186
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 186
Val Arg Leu Phe Gly Val Asp Leu Leu
1 5
<210> 187
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 187
Val Arg Leu Phe Gly Val Asp Leu Leu
1 5
<210> 188
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 188
Val Arg Leu Phe Gly Val Asp Leu Leu
1 5
<210> 189
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 189
Val Arg Leu Phe Gly Val Asn Leu Leu
1 5
<210> 190
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 190
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 191
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 191
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 192
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 192
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 193
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 193
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 194
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 194
Val Arg Leu Phe Gly Val Asn Leu Glu
1 5
<210> 195
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 195
Leu Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 196
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 196
Leu Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 197
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 197
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 198
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 198
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 199
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 199
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 200
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 200
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 201
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 201
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 202
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 202
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 203
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 203
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 204
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 204
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 205
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 205
Val Arg Leu Phe Gly Val Asn Leu Asp
1 5
<210> 206
<211> 894
<212> DNA
<213> Brassica rapa
<400> 206
atgatgatga caaacttgtc tctttcaaga gaaggagaag aggaggaaga agaagaacaa 60
gaagaggcca agaagcccat ggaagaagta gagagagagc acatgttcga caaagtggtg 120
actccaagcg atgttggtaa actaaaccgg ctcgtgatcc caaagcaata cgcagagaga 180
tacttccctt tagattcatc cacaaacgag aaaggtttgc ttctaaactt cgaagatctc 240
gcaggaaagt catggaggtt ccgttactct tactggaaca gtagtcagag ctatgtcatg 300
actaaaggtt ggagccgttt cgttaaagac aaaaagctag acgccggaga tattgtctct 360
ttccagagat gtgtcggaga ttcaggaaga gacagccgct tgtttattga ttggaggaga 420
agacctaaag ttcctgacca tccgacatcg attgctcact ttgctgccgg atctatgttt 480
cctaggtttt acagttttcc gacagcaact agttacaatc tttacaacta tcagcagcca 540
cgtcatcatc atcacagtgg ttataattat cctcaaattc cgagagaatt tggatacggg 600
tacttggtgg atcaaagagc cgtggtggct gatccgttgg tgattgaatc tgtgccggtg 660
atgatgcacg gaggagctca agttagtcag gcggttgttg gaacggccgg gaagaggctg 720
aggctttttg gagtcgatat ggaggaagaa tcttcatctt ccggtgggag tttgccacgt 780
ggtgacgctt ctccgtcttc ctctttgttt cagctgagac ttggaagcag cagtgaagat 840
gatcacttct ctaagaaagg aaagtcctca ttgccttttg atttggatca ataa 894
<210> 207
<211> 540
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 207
Met Ala Ala Ser Pro Ser Ser Pro Leu Thr Ala Pro Pro Glu Pro Val
1 5 10 15
Thr Pro Pro Ser Pro Trp Thr Ile Thr Asp Gly Ala Ile Ser Gly Thr
20 25 30
Leu Pro Ala Ala Glu Ala Phe Ala Val His Tyr Pro Gly Tyr Pro Ser
35 40 45
Ser Pro Ala Arg Ala Ala Arg Thr Leu Gly Gly Leu Pro Gly Leu Ala
50 55 60
Lys Val Arg Ser Ser Asp Pro Gly Ala Arg Leu Glu Leu Arg Phe Arg
65 70 75 80
Pro Glu Asp Pro Tyr Cys His Pro Ala Phe Gly Gln Ser Arg Ala Ser
85 90 95
Thr Gly Leu Leu Leu Arg Leu Ser Lys Arg Lys Gly Ala Ala Ala Pro
100 105 110
Cys Ala His Val Val Ala Arg Val Arg Thr Ala Tyr Tyr Phe Glu Gly
115 120 125
Met Ala Asp Phe Gln His Val Val Pro Val His Ala Ala Gln Thr Arg
130 135 140
Lys Arg Lys His Ser Asp Ser Gln Asn Asp Asn Glu Asn Phe Gly Ser
145 150 155 160
Asp Lys Thr Gly His Asp Glu Ala Asp Gly Asp Val Met Met Leu Val
165 170 175
Pro Pro Leu Phe Ser Val Lys Asp Arg Pro Thr Lys Ile Ala Leu Val
180 185 190
Pro Ser Ser Asn Ala Ile Ser Lys Thr Met His Arg Gly Val Val Gln
195 200 205
Glu Arg Trp Glu Met Asn Val Gly Pro Thr Leu Ala Leu Pro Phe Asn
210 215 220
Thr Gln Val Val Pro Glu Lys Ile Asn Trp Glu Asp His Ile Arg Lys
225 230 235 240
Asn Ser Val Glu Trp Gly Trp Gln Met Ala Val Cys Lys Leu Phe Asp
245 250 255
Glu Arg Pro Val Trp Pro Arg Gln Ser Leu Tyr Glu Arg Phe Leu Asp
260 265 270
Asp Asn Val His Val Ser Gln Asn Gln Phe Lys Arg Leu Leu Phe Arg
275 280 285
Ala Gly Tyr Tyr Phe Ser Thr Gly Pro Phe Gly Lys Phe Trp Ile Arg
290 295 300
Arg Gly Tyr Asp Pro Arg Lys Asp Ser Glu Ser Gln Ile Tyr Gln Arg
305 310 315 320
Ile Asp Phe Arg Met Pro Pro Glu Leu Arg Tyr Leu Leu Arg Leu Lys
325 330 335
Asn Ser Glu Ser Arg Lys Trp Ala Asp Met Cys Lys Leu Glu Thr Met
340 345 350
Pro Ser Gln Ser Phe Ile Tyr Leu Gln Leu Tyr Glu Leu Lys Asp Asp
355 360 365
Phe Ile Gln Ala Glu Ile Arg Lys Pro Ser Tyr Gln Ser Val Cys Ser
370 375 380
Arg Ser Thr Gly Trp Phe Ser Lys Pro Met Ile Lys Thr Leu Arg Leu
385 390 395 400
Gln Val Ser Ile Arg Leu Leu Ser Leu Leu His Asn Glu Glu Ala Lys
405 410 415
Asn Leu Leu Arg Asn Ala His Glu Leu Ile Glu Arg Ser Lys Lys Gln
420 425 430
Glu Ala Leu Ser Arg Ser Glu Leu Ser Ile Glu Tyr Asn Asp Ala Asp
435 440 445
Gln Val Ser Ala Ala His Thr Gly Thr Glu Asp Gln Val Gly Pro Asn
450 455 460
Asn Ser Asp Ser Glu Asp Val Asp Asp Glu Glu Glu Glu Glu Glu Leu
465 470 475 480
Glu Gly Tyr Asp Ser Pro Pro Met Ala Asp Asp Ile His Glu Phe Thr
485 490 495
Leu Gly Asp Ser Tyr Ala Phe Gly Glu Gly Phe Ser Asn Gly Tyr Leu
500 505 510
Glu Glu Val Leu Arg Ser Leu Pro Leu Gln Glu Asp Gly Gln Lys Lys
515 520 525
Leu Cys Asp Ala Pro Ile Asn Ala Asp Ala Ser Asp
530 535 540
<210> 208
<211> 160
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 208
Met Tyr Cys Ser Arg Gly Arg Ile Asp Pro Ala Glu Glu Gly Gln Val
1 5 10 15
Met Gly Gly Leu Gly Val Arg Asp Ala Ser Trp Ala Leu Phe Lys Val
20 25 30
Leu Glu Gln Ser Asp Val Gln Val Gly Gln Asn Arg Leu Leu Leu Thr
35 40 45
Lys Glu Ala Val Trp Gly Gly Pro Ile Pro Lys Leu Phe Pro Glu Leu
50 55 60
Glu Glu Leu Arg Gly Asp Gly Leu Asn Ala Glu Asn Arg Val Ala Val
65 70 75 80
Lys Ile Leu Asp Ala Asp Gly Cys Glu Gly Asp Ala Asn Phe Arg Tyr
85 90 95
Leu Asn Ser Ser Lys Ala Tyr Arg Val Met Gly Pro Gln Trp Ser Arg
100 105 110
Leu Val Lys Glu Thr Gly Met Cys Lys Gly Asp Arg Leu Asp Leu Tyr
115 120 125
Ala Ala Thr Ala Thr Ala Ala Ser Ser Cys Ser Gly Ala Arg Ala Ala
130 135 140
Val Ala Pro Ala Ile Pro Pro Gly Ala Ile Val Lys Ala Ala Gly Phe
145 150 155 160
<210> 209
<211> 192
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 209
Met Ala Met His Ala Gly His Ala Trp Trp Gly Val Ala Met Tyr Thr
1 5 10 15
Asn His Tyr His His His Tyr Arg His Lys Thr Ser Asp Val Gly Lys
20 25 30
Asn Arg Val Lys His Ala Arg Tyr Gly Gly Gly Asp Ser Gly Lys Gly
35 40 45
Ser Asp Ser Gly Lys Trp Arg Arg Tyr Ser Tyr Trp Thr Ser Ser Ser
50 55 60
Tyr Val Thr Lys Gly Trp Ser Arg Tyr Val Lys Lys Arg Asp Ala Gly
65 70 75 80
Asp Val Val His Arg Val Arg Gly Gly Ala Ala Asp Arg Gly Cys Arg
85 90 95
Arg Arg Gly Ser Ala Ala Ala Val Arg Val Thr Ala Asn Gly Gly Trp
100 105 110
Ser Met Cys Tyr Ser Thr Ser Gly Ser Ser Tyr Asp Thr Ser Ala Asn
115 120 125
Ser Tyr Ala Tyr His Arg Ser Val Asp Asp His Ser Asp His Ala Gly
130 135 140
Ser Arg Ala Asp Ala Lys Ser Ser Ser Ala Ala Ser Ala Ser Arg Arg
145 150 155 160
Arg Gly Val Asn Asp Cys Gly Ala Asp Ala Thr Ala Met Tyr Gly Tyr
165 170 175
Met His His Ser Tyr Ala Ala Val Ser Thr Val Asn Tyr Trp Ser Val
180 185 190
<210> 210
<211> 491
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 210
Met Glu Leu Met Gln Glu Val Lys Gly Tyr Ser Asp Gly Arg Glu Glu
1 5 10 15
Glu Glu Glu Glu Glu Glu Ala Ala Glu Glu Ile Ile Thr Arg Glu Glu
20 25 30
Ser Ser Arg Leu Leu His Gln His Gln Glu Ala Ala Gly Ser Asn Phe
35 40 45
Ile Ile Asn Asn Asn His His His His Gln His His His His His Thr
50 55 60
Thr Lys Gln Leu Asp Phe Met Asp Leu Ser Leu Gly Ser Ser Lys Asp
65 70 75 80
Glu Gly Asn Leu Gln Gly Ser Ser Ser Ser Val Tyr Ala His His His
85 90 95
His Ala Ala Ser Ala Ser Ser Ser Ala Asn Gly Asn Asn Asn Asn Ser
100 105 110
Ser Ser Ser Asn Leu Gln Gln Gln Gln Gln Gln Pro Ala Glu Lys Glu
115 120 125
His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn
130 135 140
Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp
145 150 155 160
Ser Ser Ala Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Arg Asn
165 170 175
Gly Lys Leu Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
180 185 190
Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Lys Leu
195 200 205
Asp Ala Gly Asp Met Val Ser Phe Gln Arg Gly Val Gly Glu Leu Tyr
210 215 220
Arg His Arg Leu Tyr Ile Asp Trp Trp Arg Arg Pro Asp His His His
225 230 235 240
His His His His Gly Pro Asp His Ser Thr Thr Leu Phe Thr Pro Phe
245 250 255
Leu Ile Pro Asn Gln Pro His His Leu Met Ser Ile Arg Trp Gly Ala
260 265 270
Thr Gly Arg Leu Tyr Ser Leu Pro Ser Pro Thr Pro Pro Arg His His
275 280 285
Glu His Leu Asn Tyr Asn Asn Asn Ala Met Tyr His Pro Phe His His
290 295 300
His Gly Ala Gly Ser Gly Ile Asn Ala Thr Thr His His Tyr Asn Asn
305 310 315 320
Tyr His Glu Met Ser Ser Thr Thr Thr Ser Gly Ser Ala Gly Ser Val
325 330 335
Phe Tyr His Arg Ser Thr Pro Pro Ile Ser Met Pro Leu Ala Asp His
340 345 350
Gln Thr Leu Asn Thr Arg Gln Gln Gln Gln Gln Gln Gln Gln Gln Glu
355 360 365
Gly Ala Gly Asn Val Ser Leu Ser Pro Met Ile Ile Asp Ser Val Pro
370 375 380
Val Ala His His Leu His His Gln Gln His His Gly Gly Lys Ser Ser
385 390 395 400
Gly Pro Ser Ser Thr Ser Thr Ser Pro Ser Thr Ala Gly Lys Arg Leu
405 410 415
Arg Leu Phe Gly Val Asn Met Glu Cys Ala Ser Ser Thr Ser Glu Asp
420 425 430
Pro Lys Cys Phe Ser Leu Leu Ser Ser Ser Ser Met Ala Asn Ser Asn
435 440 445
Ser Gln Pro Pro Leu Gln Leu Leu Arg Glu Asp Thr Leu Ser Ser Ser
450 455 460
Ser Ala Arg Phe Gly Asp Gln Arg Gly Val Gly Glu Pro Ser Met Leu
465 470 475 480
Phe Asp Leu Asp Pro Ser Leu Gln Tyr Arg Gln
485 490
<210> 211
<211> 297
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 211
Met Met Met Thr Asn Leu Ser Leu Ser Arg Glu Gly Glu Glu Glu Glu
1 5 10 15
Glu Glu Glu Gln Glu Glu Ala Lys Lys Pro Met Glu Glu Val Glu Arg
20 25 30
Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu
35 40 45
Asn Arg Leu Val Ile Pro Lys Gln Tyr Ala Glu Arg Tyr Phe Pro Leu
50 55 60
Asp Ser Ser Thr Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Leu
65 70 75 80
Ala Gly Lys Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
85 90 95
Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp Lys Lys
100 105 110
Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Cys Val Gly Asp Ser
115 120 125
Gly Arg Asp Ser Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Val
130 135 140
Pro Asp His Pro Thr Ser Ile Ala His Phe Ala Ala Gly Ser Met Phe
145 150 155 160
Pro Arg Phe Tyr Ser Phe Pro Thr Ala Thr Ser Tyr Asn Leu Tyr Asn
165 170 175
Tyr Gln Gln Pro Arg His His His His Ser Gly Tyr Asn Tyr Pro Gln
180 185 190
Ile Pro Arg Glu Phe Gly Tyr Gly Tyr Leu Val Asp Gln Arg Ala Val
195 200 205
Val Ala Asp Pro Leu Val Ile Glu Ser Val Pro Val Met Met His Gly
210 215 220
Gly Ala Gln Val Ser Gln Ala Val Val Gly Thr Ala Gly Lys Arg Leu
225 230 235 240
Arg Leu Phe Gly Val Asp Met Glu Glu Glu Ser Ser Ser Ser Gly Gly
245 250 255
Ser Leu Pro Arg Gly Asp Ala Ser Pro Ser Ser Ser Leu Phe Gln Leu
260 265 270
Arg Leu Gly Ser Ser Ser Glu Asp Asp His Phe Ser Lys Lys Gly Lys
275 280 285
Ser Ser Leu Pro Phe Asp Leu Asp Gln
290 295
<210> 212
<211> 310
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 212
Met Met Thr Asn Leu Ser Leu Ala Arg Glu Gly Glu Glu Glu Glu Glu
1 5 10 15
Glu Ala Gly Ala Lys Lys Pro Thr Glu Glu Val Glu Arg Glu His Met
20 25 30
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
35 40 45
Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Leu Asp Ser Ser
50 55 60
Thr Asn Glu Lys Gly Leu Ile Leu Asn Phe Glu Asp Leu Thr Gly Lys
65 70 75 80
Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
85 90 95
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp Lys Lys Leu Asp Ala
100 105 110
Gly Asp Ile Val Ser Phe Leu Arg Cys Val Gly Asp Thr Gly Arg Asp
115 120 125
Ser Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Val Pro Asp Tyr
130 135 140
Thr Thr Ser Thr Ser His Phe Pro Ala Gly Ala Met Phe Pro Arg Phe
145 150 155 160
Tyr Ser Phe Gln Thr Ala Thr Thr Ser Thr Ser Tyr Asn Pro Tyr Asn
165 170 175
His Gln Gln Pro Arg His His His Ser Gly Tyr Cys Tyr Pro Gln Ile
180 185 190
Pro Arg Glu Phe Gly Tyr Gly Tyr Val Val Arg Ser Val Asp Gln Arg
195 200 205
Ala Val Val Ala Asp Pro Leu Val Ile Glu Ser Val Pro Val Met Met
210 215 220
His Gly Gly Ala Arg Val Asn Gln Ala Ala Val Gly Thr Ala Gly Lys
225 230 235 240
Arg Leu Arg Leu Phe Gly Val Asp Met Glu Cys Gly Glu Ser Gly Gly
245 250 255
Thr Asn Ser Thr Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Leu Pro
260 265 270
Arg Gly Gly Ala Ser Pro Ser Ser Ser Met Phe Gln Leu Arg Leu Gly
275 280 285
Asn Ser Ser Glu Asp Asp His Leu Phe Lys Lys Gly Lys Ser Ser Leu
290 295 300
Pro Phe Asn Leu Asp Gln
305 310
<210> 213
<211> 293
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 213
Met Met Thr Asn Leu Ser Leu Ala Arg Glu Gly Glu Ala Gln Val Lys
1 5 10 15
Lys Pro Ile Glu Glu Val Glu Arg Glu His Met Phe Asp Lys Val Val
20 25 30
Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln
35 40 45
His Ala Glu Arg Tyr Phe Pro Leu Asp Ser Ser Ser Asn Glu Lys Gly
50 55 60
Leu Leu Leu Asn Phe Glu Asp Leu Thr Gly Lys Ser Trp Arg Phe Arg
65 70 75 80
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp
85 90 95
Ser Arg Phe Val Lys Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser
100 105 110
Phe Gln Arg Cys Val Gly Asp Ser Arg Leu Phe Ile Asp Trp Arg Arg
115 120 125
Arg Pro Lys Val Pro Asp Tyr Pro Thr Ser Thr Ala His Phe Ala Ala
130 135 140
Gly Ala Met Phe Pro Arg Phe Tyr Ser Phe Pro Thr Ala Thr Thr Ser
145 150 155 160
Thr Cys Tyr Asp Leu Tyr Asn His Gln Pro Pro Arg His His His Ile
165 170 175
Gly Tyr Gly Tyr Pro Gln Ile Pro Arg Glu Phe Gly Tyr Gly Tyr Phe
180 185 190
Val Arg Ser Val Asp Gln Arg Ala Val Val Ala Asp Pro Leu Val Ile
195 200 205
Glu Ser Val Pro Val Met Met Arg Gly Gly Ala Arg Val Ser Gln Glu
210 215 220
Val Val Gly Thr Ala Gly Lys Arg Leu Arg Leu Phe Gly Val Asp Met
225 230 235 240
Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Leu Pro Arg Ala Gly Gly
245 250 255
Gly Gly Ala Ser Ser Ser Ser Ser Leu Phe Gln Leu Arg Leu Gly Ser
260 265 270
Ser Cys Glu Asp Asp His Phe Ser Lys Lys Gly Lys Ser Ser Leu Pro
275 280 285
Phe Asp Leu Asp Gln
290
<210> 214
<211> 320
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 214
Met Glu Arg Lys Ser Asn Asp Leu Glu Arg Ser Glu Asn Ile Asp Ser
1 5 10 15
Gln Asn Lys Lys Met Asn Leu Glu Glu Glu Arg Pro Val Gln Glu Ala
20 25 30
Ser Ser Met Glu Arg Glu His Met Phe Asp Lys Val Val Thr Pro Ser
35 40 45
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu
50 55 60
Arg Tyr Phe Pro Leu Asp Asn Asn Ser Ser Asp Asn Asn Lys Gly Leu
65 70 75 80
Leu Leu Asn Phe Glu Asp Arg Ile Gly Ile Leu Trp Ser Phe Arg Tyr
85 90 95
Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser
100 105 110
Arg Phe Val Lys Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe
115 120 125
His Arg Gly Ser Cys Asn Lys Asp Lys Leu Phe Ile Asp Trp Lys Arg
130 135 140
Arg Pro Lys Ile Pro Asp His Gln Val Val Gly Ala Met Phe Pro Arg
145 150 155 160
Phe Tyr Ser Tyr Pro Tyr Pro Gln Ile Gln Ala Ser Tyr Glu Arg His
165 170 175
Asn Leu Tyr His Arg Tyr Gln Arg Asp Ile Gly Ile Gly Tyr Tyr Val
180 185 190
Arg Ser Met Glu Arg Tyr Asp Pro Thr Ala Val Ile Glu Ser Val Pro
195 200 205
Val Ile Met Gln Arg Arg Ala His Val Ala Thr Met Ala Ser Ser Arg
210 215 220
Gly Glu Lys Arg Leu Arg Leu Phe Gly Val Asp Met Glu Cys Val Arg
225 230 235 240
Gly Gly Arg Gly Gly Gly Gly Ser Val Asn Ser Thr Glu Glu Glu Ser
245 250 255
Ser Thr Ser Gly Gly Ser Ile Ser Arg Gly Gly Val Ser Met Ala Gly
260 265 270
Val Gly Ser Pro Leu Gln Leu Arg Leu Val Ser Ser Asp Gly Asp Asp
275 280 285
Gln Ser Leu Val Ala Arg Gly Ala Ala Arg Val Asp Glu Asp His His
290 295 300
Leu Phe Thr Lys Lys Gly Lys Ser Ser Leu Ser Phe Asp Leu Asp Lys
305 310 315 320
<210> 215
<211> 286
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 215
Met Asn Gln Glu Glu Glu Asn Pro Val Glu Lys Ala Ser Ser Met Glu
1 5 10 15
Arg Glu His Met Phe Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys
20 25 30
Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro
35 40 45
Leu Asp Asn Asn Ser Asp Ser Ser Lys Gly Leu Leu Leu Asn Phe Glu
50 55 60
Asp Arg Thr Gly Asn Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser
65 70 75 80
Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Asp
85 90 95
Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Asp Pro Gly
100 105 110
Asn Lys Asp Lys Leu Phe Ile Asp Trp Arg Arg Arg Pro Lys Ile Pro
115 120 125
Asp His His His Gln Phe Ala Gly Ala Met Phe Pro Arg Phe Tyr Ser
130 135 140
Phe Ser His Pro Gln Asn Leu Tyr His Arg Tyr Gln Gln Asp Leu Gly
145 150 155 160
Ile Gly Tyr Tyr Val Ser Ser Met Glu Arg Asn Asp Pro Thr Ala Val
165 170 175
Ile Glu Ser Val Pro Leu Ile Met Gln Arg Arg Ala Ala His Val Ala
180 185 190
Ala Ile Pro Ser Ser Arg Gly Glu Lys Arg Leu Arg Leu Phe Gly Val
195 200 205
Asp Met Glu Cys Gly Gly Gly Gly Gly Ser Val Asn Ser Thr Glu Glu
210 215 220
Glu Ser Ser Ser Ser Gly Gly Gly Gly Gly Val Ser Met Ala Ser Val
225 230 235 240
Gly Ser Leu Leu Gln Leu Arg Leu Val Ser Ser Asp Asp Glu Ser Leu
245 250 255
Val Ala Met Glu Ala Ala Ser Val Asp Glu Asp His His Leu Phe Thr
260 265 270
Lys Lys Gly Lys Ser Ser Leu Ser Phe Asp Leu Asp Arg Lys
275 280 285
<210> 216
<211> 292
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 216
Met Asn Gln Glu Asn Lys Lys Pro Leu Glu Glu Ala Ser Thr Ser Met
1 5 10 15
Glu Arg Glu Asn Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly
20 25 30
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
35 40 45
Pro Leu Asp Asn Ser Ser Thr Asn Asn Lys Gly Leu Leu Leu Asp Phe
50 55 60
Glu Asp Arg Thr Gly Ser Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn
65 70 75 80
Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys
85 90 95
Asp Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Asp Pro
100 105 110
Cys Asn Lys Asp Lys Leu Tyr Ile Asp Trp Arg Arg Arg Pro Lys Ile
115 120 125
Pro Asp His His Gln Phe Ala Gly Ala Met Phe Pro Arg Phe Tyr Ser
130 135 140
Phe Pro His Pro Gln Met Pro Thr Ser Phe Glu Ser Ser His Asn Leu
145 150 155 160
Tyr His His Arg Phe Gln Arg Asp Leu Gly Ile Gly Tyr Tyr Pro Thr
165 170 175
Ala Val Ile Glu Ser Val Pro Val Ile Met Gln Arg Arg Glu Ala Gln
180 185 190
Val Ala Asn Met Ala Ser Ser Arg Gly Glu Lys Arg Leu Arg Leu Phe
195 200 205
Gly Val Asp Val Glu Cys Gly Gly Gly Gly Gly Gly Ser Val Asn Ser
210 215 220
Thr Glu Glu Glu Ser Ser Ser Ser Gly Gly Ser Met Ser Arg Gly Gly
225 230 235 240
Val Ser Met Ala Gly Val Gly Ser Leu Leu Gln Leu Arg Leu Val Ser
245 250 255
Ser Asp Asp Glu Ser Leu Val Ala Met Glu Gly Ala Thr Val Asp Glu
260 265 270
Asp His His Leu Phe Thr Thr Lys Lys Gly Lys Ser Ser Leu Ser Phe
275 280 285
Asp Leu Asp Ile
290
<210> 217
<211> 420
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 217
Met Glu Leu Met Gln Gln Val Lys Gly Asn Tyr Ser Asp Ser Arg Glu
1 5 10 15
Glu Glu Glu Glu Glu Glu Ala Ala Ala Ile Thr Arg Glu Ser Glu Ser
20 25 30
Ser Arg Leu His Gln Gln Asp Thr Ala Ser Asn Phe Gly Lys Lys Leu
35 40 45
Asp Leu Met Asp Leu Ser Leu Gly Ser Ser Lys Glu Glu Glu Glu Glu
50 55 60
Gly Asn Leu Gln Gln Gly Gly Gly Gly Val Val His His Ala His Gln
65 70 75 80
Val Val Glu Lys Glu His Met Phe Glu Lys Val Ala Thr Pro Ser Asp
85 90 95
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
100 105 110
Tyr Phe Pro Leu Asp Ser Ser Thr Asn Glu Lys Gly Leu Leu Leu Asn
115 120 125
Phe Glu Asp Arg Asn Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp
130 135 140
Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val
145 150 155 160
Lys Glu Lys Lys Leu Asp Ala Gly Asp Ile Val Ser Phe Gln Arg Gly
165 170 175
Leu Gly Asp Leu Tyr Arg His Arg Leu Tyr Ile Asp Trp Lys Arg Arg
180 185 190
Pro Asp His Ala His Ala His Pro Pro His His His Asp Pro Leu Phe
195 200 205
Leu Pro Ser Ile Arg Leu Tyr Ser Leu Pro Pro Thr Met Pro Pro Arg
210 215 220
Tyr His His Asp His His Phe His His His Leu Asn Tyr Asn Asn Leu
225 230 235 240
Phe Thr Phe Gln Gln His Gln Tyr Gln Gln Leu Gly Ala Ala Thr Thr
245 250 255
Thr His His Asn Asn Tyr Gly Tyr Gln Asn Ser Gly Ser Gly Ser Leu
260 265 270
Tyr Tyr Leu Arg Ser Ser Met Ser Met Gly Gly Gly Asp Gln Asn Leu
275 280 285
Gln Gly Arg Gly Ser Asn Ile Val Pro Met Ile Ile Asp Ser Val Pro
290 295 300
Val Asn Val Ala His His Asn Asn Asn Arg His Gly Asn Gly Gly Ile
305 310 315 320
Thr Ser Gly Gly Thr Asn Cys Ser Gly Lys Arg Leu Arg Leu Phe Gly
325 330 335
Val Asn Met Glu Cys Ala Ser Ser Ala Glu Asp Ser Lys Glu Leu Ser
340 345 350
Ser Gly Ser Ala Ala His Val Thr Thr Ala Ala Ser Ser Ser Ser Leu
355 360 365
His His Gln Arg Leu Arg Val Pro Val Pro Val Pro Leu Glu Asp Pro
370 375 380
Leu Ser Ser Ser Ala Ala Ala Ala Ala Arg Phe Gly Asp His Lys Gly
385 390 395 400
Ala Ser Thr Gly Thr Ser Leu Leu Phe Asp Leu Asp Pro Ser Leu Gln
405 410 415
Tyr His Arg His
420
<210> 218
<211> 422
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 218
Met Asp Gln Phe Ala Ala Ser Gly Arg Phe Ser Arg Glu Glu Glu Ala
1 5 10 15
Asp Glu Glu Gln Glu Asp Ala Ser Asn Ser Met Arg Glu Ile Ser Phe
20 25 30
Met Pro Pro Ala Ala Ala Ser Ser Ser Ser Ala Ala Ala Ser Ala Ser
35 40 45
Ala Ser Ala Ser Thr Ser Ala Ser Ala Cys Ala Ser Gly Ser Ser Ser
50 55 60
Ala Pro Phe Arg Ser Ala Ser Ala Ser Gly Asp Ala Ala Gly Ala Ser
65 70 75 80
Gly Ser Gly Gly Pro Ala Asp Ala Asp Ala Glu Ala Glu Ala Val Glu
85 90 95
Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys
100 105 110
Leu Asn Arg Leu Val Ile Pro Lys Gln Tyr Ala Glu Lys Tyr Phe Pro
115 120 125
Leu Asp Ala Ala Ala Asn Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp
130 135 140
Ser Ala Gly Lys His Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
145 150 155 160
Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
165 170 175
Arg Leu Val Ala Gly Asp Thr Val Ser Phe Ser Arg Ala Ala Ala Glu
180 185 190
Asp Ala Arg His Arg Leu Phe Ile Asp Trp Lys Arg Arg Val Asp Thr
195 200 205
Arg Gly Pro Leu Arg Phe Ser Gly Leu Ala Leu Pro Met Pro Leu Pro
210 215 220
Ser Ser His Tyr Gly Gly Pro His His Tyr Ser Pro Trp Gly Phe Gly
225 230 235 240
Gly Gly Gly Gly Gly Gly Gly Gly Phe Phe Met Pro Pro Ser Pro Pro
245 250 255
Ala Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Leu Asp Phe Arg Ser
260 265 270
Met Thr Thr Thr Tyr Pro Ala Pro Thr Val Gly Arg Gln Leu Leu Phe
275 280 285
Phe Gly Ser Ala Arg Met Pro Pro His His Ala Pro Pro Pro Gln Pro
290 295 300
Arg Pro Phe Ser Leu Pro Leu His His Tyr Thr Val Gln Pro Ser Ala
305 310 315 320
Ala Gly Val Thr Ala Ala Ser Arg Pro Val Leu Leu Asp Ser Val Pro
325 330 335
Val Ile Glu Ser Pro Thr Thr Ala Ala Lys Arg Val Arg Leu Phe Gly
340 345 350
Val Asn Leu Asp Asn Asn Pro Asp Gly Gly Gly Glu Ala Ser His Gln
355 360 365
Gly Asp Ala Leu Ser Leu Gln Met Pro Gly Trp Gln Gln Arg Thr Pro
370 375 380
Thr Leu Arg Leu Leu Glu Leu Pro Arg His Gly Gly Glu Ser Ser Ala
385 390 395 400
Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu Ala Arg Ser
405 410 415
Ala Leu Asp Leu Asp Leu
420
<210> 219
<211> 412
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 219
Met Glu Phe Thr Thr Ser Ser Arg Phe Ser Lys Glu Glu Glu Asp Glu
1 5 10 15
Glu Gln Asp Glu Ala Gly Arg Arg Glu Ile Pro Phe Met Thr Ala Thr
20 25 30
Ala Glu Ala Ala Pro Ala Pro Thr Ser Ser Ser Ser Ser Pro Ala His
35 40 45
His Ala Ala Ser Ala Ser Ala Ser Ala Ser Ala Ser Gly Ser Ser Thr
50 55 60
Pro Phe Arg Ser Asp Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly
65 70 75 80
Gly Gly Gly Gly Gly Glu Ala Glu Val Val Glu Lys Glu His Met Phe
85 90 95
Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
100 105 110
Ile Pro Lys Gln Tyr Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala Ala
115 120 125
Asn Glu Lys Gly Leu Leu Leu Asn Phe Glu Asp Arg Ala Gly Lys Pro
130 135 140
Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met
145 150 155 160
Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly
165 170 175
Asp Thr Val Ser Phe Ser Arg Gly Ile Gly Asp Glu Ala Ala Arg His
180 185 190
Arg Leu Phe Ile Asp Trp Lys Arg Arg Ala Asp Thr Arg Asp Pro Leu
195 200 205
Arg Leu Pro Arg Gly Leu Pro Leu Pro Met Pro Leu Thr Ser His Tyr
210 215 220
Ala Pro Trp Gly Ile Gly Gly Gly Gly Gly Phe Phe Val Gln Pro Ser
225 230 235 240
Pro Pro Ala Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Leu Asp Phe
245 250 255
Arg Ala Phe Asn Pro Ala Ala Ala Met Gly Arg Gln Val Leu Leu Phe
260 265 270
Gly Ser Ala Arg Ile Pro Pro Gln Ala Pro Leu Leu Ala Arg Ala Pro
275 280 285
Ser Pro Leu His His His Tyr Thr Leu Gln Pro Ser Gly Asp Gly Val
290 295 300
Arg Ala Ala Gly Ser Pro Val Val Leu Asp Ser Val Pro Val Ile Glu
305 310 315 320
Ser Pro Thr Thr Ala Ala Lys Arg Val Arg Leu Phe Gly Val Asn Leu
325 330 335
Asp Asn Pro His Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly Glu Ser
340 345 350
Ser Asn His Gly Asn Ala Leu Ser Leu Gln Thr Pro Ala Trp Met Arg
355 360 365
Arg Asp Pro Thr Leu Arg Leu Leu Glu Leu Pro Pro His His His His
370 375 380
Gly Ala Glu Ser Ser Ala Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser
385 390 395 400
Lys Arg Asp Ala His Ser Ala Leu Asp Leu Asp Leu
405 410
<210> 220
<211> 409
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 220
Met Glu Phe Thr Ala Thr Ser Ser Arg Phe Ser Lys Gly Glu Glu Glu
1 5 10 15
Val Glu Glu Glu Gln Glu Glu Ala Ser Met Arg Glu Ile Pro Phe Met
20 25 30
Thr Pro Ala Ala Ala Thr Cys Ala Ala Ala Pro Pro Ser Ala Ser Ala
35 40 45
Ser Ala Ser Thr Pro Ala Ser Ala Ser Gly Ser Ser Pro Pro Phe Arg
50 55 60
Ser Gly Asp Asp Ala Gly Ala Ser Gly Ser Gly Ala Gly Asp Gly Ser
65 70 75 80
Arg Ser Asn Val Ala Glu Ala Val Glu Lys Glu His Met Phe Asp Lys
85 90 95
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
100 105 110
Lys Gln Tyr Ala Glu Lys Tyr Phe Pro Leu Asp Ser Ala Ala Asn Glu
115 120 125
Lys Gly Leu Leu Leu Asn Phe Glu Asp Ser Ala Gly Lys Pro Trp Arg
130 135 140
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
145 150 155 160
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
165 170 175
Val Ser Phe Ser Arg Gly Ala Gly Glu Ala Ala Arg His Arg Leu Phe
180 185 190
Ile Asp Trp Lys Arg Arg Ala Asp Thr Arg Asp Pro Leu Arg Leu Pro
195 200 205
Arg Leu Pro Leu Pro Met Pro Leu Thr Ser His Tyr Ser Pro Trp Gly
210 215 220
Leu Gly Ala Gly Ala Arg Gly Phe Phe Met Pro Pro Ser Pro Pro Ala
225 230 235 240
Thr Leu Tyr Glu His Arg Leu Arg Gln Gly Phe Asp Phe Arg Gly Met
245 250 255
Asn Pro Ser Tyr Pro Thr Met Gly Arg Gln Val Ile Leu Phe Gly Ser
260 265 270
Ala Ala Arg Met Pro Pro His Gly Pro Ala Pro Leu Leu Val Pro Arg
275 280 285
Pro Pro Pro Pro Leu His Phe Thr Val Gln Gln Gln Gly Ser Asp Ala
290 295 300
Gly Gly Ser Val Thr Ala Gly Ser Pro Val Val Leu Asp Ser Val Pro
305 310 315 320
Val Ile Glu Ser Pro Thr Thr Ala Thr Lys Lys Arg Val Arg Leu Phe
325 330 335
Gly Val Asn Leu Asp Asn Pro Gln His Pro Gly Asp Gly Gly Gly Glu
340 345 350
Ser Ser Asn Tyr Gly Ser Ala Leu Pro Leu Gln Met Pro Ala Ser Ala
355 360 365
Trp Arg Pro Arg Asp His Thr Leu Arg Leu Leu Glu Phe Pro Ser His
370 375 380
Gly Ala Glu Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu
385 390 395 400
Ala His Ser Gly Leu Asp Leu Asp Leu
405
<210> 221
<211> 316
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 221
Met Glu Phe Ala Thr Thr Ser Ser Arg Phe Ser Lys Glu Glu Glu Glu
1 5 10 15
Glu Glu Glu Gly Glu Gln Glu Met Glu Gln Glu Gln Asp Glu Glu Glu
20 25 30
Glu Glu Ala Glu Ala Ser Pro Arg Glu Ile Pro Phe Met Thr Ser Ala
35 40 45
Ala Ala Ala Ala Thr Ala Ser Ser Ser Ser Pro Thr Ser Val Ser Pro
50 55 60
Ser Ala Thr Ala Ser Ala Ala Ala Ser Thr Ser Ala Ser Gly Ser Pro
65 70 75 80
Phe Arg Ser Ser Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly Gly
85 90 95
Gly Gly Glu Asp Val Glu Val Ile Glu Lys Glu His Met Phe Asp Lys
100 105 110
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
115 120 125
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ser Ala Ala Asn Glu
130 135 140
Lys Gly Leu Leu Leu Ser Phe Glu Asp Arg Thr Gly Lys Leu Trp Arg
145 150 155 160
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
165 170 175
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
180 185 190
Val Ser Phe Cys Arg Gly Ala Ala Glu Ala Thr Arg Asp Arg Leu Phe
195 200 205
Ile Asp Trp Lys Arg Arg Ala Asp Val Arg Asp Pro His Arg Phe Gln
210 215 220
Arg Leu Pro Leu Pro Met Thr Ser Pro Tyr Gly Pro Trp Gly Gly Gly
225 230 235 240
Ala Gly Ala Ser Ser Cys Arg Pro Arg Arg Pro Pro Arg Ser Thr Ser
245 250 255
Ile Thr Ala Phe Ala Arg Ala Ser Thr Ser Ala Thr Ser Thr Pro Leu
260 265 270
Cys Arg Arg Gly Ser Ser Ser Ser Ser Ala Pro Gln Gly Arg Gly Phe
275 280 285
Ile Ser Thr Arg Pro Cys His Arg Arg Arg Arg His Leu Arg Leu Leu
290 295 300
Thr Asn Ser Thr Leu Arg Cys Thr Thr Arg Ala Pro
305 310 315
<210> 222
<211> 409
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 222
Met Glu Phe Ala Ser Ser Ser Ser Arg Phe Ser Arg Glu Glu Asp Glu
1 5 10 15
Glu Glu Glu Gln Glu Glu Glu Glu Glu Glu Glu Glu Ala Ser Pro Arg
20 25 30
Glu Ile Pro Phe Met Thr Ala Ala Ala Thr Ala Asp Thr Gly Ala Ala
35 40 45
Ala Ser Ser Ser Ser Pro Ser Ala Ala Ala Ser Ser Gly Pro Ala Ala
50 55 60
Ala Pro Arg Ser Ser Asp Gly Ala Gly Ala Ser Gly Ser Gly Gly Gly
65 70 75 80
Gly Ser Asp Asp Val Gln Val Ile Glu Lys Glu His Met Phe Asp Lys
85 90 95
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
100 105 110
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala Ala Asn Glu
115 120 125
Lys Gly Gln Leu Leu Ser Phe Glu Asp Arg Ala Gly Lys Leu Trp Arg
130 135 140
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys
145 150 155 160
Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr
165 170 175
Val Ser Phe Cys Arg Gly Ala Gly Asp Thr Ala Arg Asp Arg Leu Phe
180 185 190
Ile Asp Trp Lys Arg Arg Ala Asp Ser Arg Asp Pro His Arg Met Pro
195 200 205
Arg Leu Pro Leu Pro Met Ala Pro Val Ala Ser Pro Tyr Gly Pro Trp
210 215 220
Gly Gly Gly Gly Gly Gly Gly Ala Gly Gly Phe Phe Met Pro Pro Ala
225 230 235 240
Pro Pro Ala Thr Leu Tyr Glu His His Arg Phe Arg Gln Ala Leu Asp
245 250 255
Phe Arg Asn Ile Asn Ala Ala Ala Ala Pro Ala Arg Gln Leu Leu Phe
260 265 270
Phe Gly Ser Ala Gly Met Pro Pro Arg Ala Ser Met Pro Gln Gln Gln
275 280 285
Gln Pro Pro Pro Pro Pro His Pro Pro Leu His Ser Ile Met Leu Val
290 295 300
Gln Pro Ser Pro Ala Pro Pro Thr Ala Ser Val Pro Met Leu Leu Asp
305 310 315 320
Ser Val Pro Leu Val Asn Ser Pro Thr Ala Ala Ser Lys Arg Val Arg
325 330 335
Leu Phe Gly Val Asn Leu Asp Asn Pro Gln Pro Gly Thr Ser Ala Glu
340 345 350
Ser Ser Gln Asp Ala Asn Ala Leu Ser Leu Arg Thr Pro Gly Trp Gln
355 360 365
Arg Pro Gly Pro Leu Arg Phe Phe Glu Ser Pro Gln Arg Gly Ala Glu
370 375 380
Ser Ser Ala Ala Ser Ser Pro Ser Ser Ser Ser Ser Ser Lys Arg Glu
385 390 395 400
Ala His Ser Ser Leu Asp Leu Asp Leu
405
<210> 223
<211> 312
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 223
Met Glu Phe Thr Pro Ile Ser Pro Pro Thr Arg Val Ala Gly Gly Glu
1 5 10 15
Glu Asp Ser Glu Arg Gly Ala Ala Ala Trp Ala Val Val Glu Lys Glu
20 25 30
His Met Phe Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn
35 40 45
Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Leu Asp
50 55 60
Ala Ala Ala Gly Ala Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly
65 70 75 80
Gly Gly Lys Gly Leu Val Leu Ser Phe Glu Asp Arg Thr Gly Lys Ala
85 90 95
Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met
100 105 110
Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Gly Ala Gly
115 120 125
Asp Thr Val Ser Phe Gly Arg Gly Leu Gly Asp Ala Ala Arg Gly Arg
130 135 140
Leu Phe Ile Asp Phe Arg Arg Arg Arg Gln Asp Ala Gly Ser Phe Met
145 150 155 160
Phe Pro Pro Thr Ala Ala Pro Pro Ser His Ser His His His His Gln
165 170 175
Arg His His Pro Pro Leu Pro Ser Val Pro Leu Cys Pro Trp Arg Asp
180 185 190
Tyr Thr Thr Ala Tyr Gly Gly Gly Tyr Gly Tyr Gly Tyr Gly Gly Gly
195 200 205
Ser Thr Pro Ala Ser Ser Arg His Val Leu Phe Leu Arg Pro Gln Val
210 215 220
Pro Ala Ala Val Val Leu Lys Ser Val Pro Val His Val Ala Ala Thr
225 230 235 240
Ser Ala Val Gln Glu Ala Ala Thr Thr Thr Arg Pro Lys Arg Val Arg
245 250 255
Leu Phe Gly Val Asn Leu Asp Cys Pro Ala Ala Met Asp Asp Asp Asp
260 265 270
Asp Ile Ala Gly Ala Ala Ser Arg Thr Ala Ala Ser Ser Leu Leu Gln
275 280 285
Leu Pro Ser Pro Ser Ser Ser Thr Ser Ser Ser Thr Ala Gly Lys Lys
290 295 300
Met Cys Ser Leu Asp Leu Gly Leu
305 310
<210> 224
<211> 277
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 224
Met Glu Phe Thr Pro Ala His Ala His Ala Arg Val Val Glu Asp Ser
1 5 10 15
Glu Arg Pro Arg Gly Gly Val Ala Trp Val Glu Lys Glu His Met Phe
20 25 30
Glu Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
35 40 45
Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Ala Leu Asp Ala Ser
50 55 60
Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Gly Gly Lys Gly
65 70 75 80
Leu Val Leu Ser Phe Glu Asp Arg Ala Gly Lys Ala Trp Arg Phe Arg
85 90 95
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp
100 105 110
Ser Arg Phe Val Lys Glu Lys Arg Leu Gly Ala Gly Asp Thr Val Leu
115 120 125
Phe Ala Arg Gly Ala Gly Gly Ala Arg Gly Arg Phe Phe Ile Asp Phe
130 135 140
Arg Arg Arg Arg Gln Asp Leu Ala Phe Leu Gln Pro Thr Leu Ala Ser
145 150 155 160
Ala Gln Arg Leu Leu Pro Leu Pro Ser Val Pro Ile Cys Pro Trp Gln
165 170 175
Asp Tyr Gly Ala Ser Ala Pro Ala Pro Asn Arg His Val Leu Phe Leu
180 185 190
Arg Pro Gln Val Pro Ala Ala Val Val Leu Lys Ser Val Pro Val His
195 200 205
Val Ala Ala Ser Ala Val Glu Ala Thr Met Ser Lys Arg Val Arg Leu
210 215 220
Phe Gly Val Asn Leu Asp Cys Pro Pro Asp Ala Glu Asp Ser Ala Thr
225 230 235 240
Val Pro Arg Gly Arg Ala Ala Ser Thr Thr Leu Leu Gln Leu Pro Ser
245 250 255
Pro Ser Ser Ser Thr Ser Ser Ser Thr Ala Gly Lys Asp Val Cys Cys
260 265 270
Leu Asp Leu Gly Leu
275
<210> 225
<211> 273
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 225
Met Glu Phe Arg Pro Ala His Ala Arg Val Phe Glu Asp Ser Glu Arg
1 5 10 15
Pro Arg Gly Gly Val Ala Trp Leu Glu Lys Glu His Met Phe Glu Lys
20 25 30
Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
35 40 45
Lys Gln His Ala Glu Arg Tyr Phe Pro Ala Leu Asp Ala Ser Ala Ala
50 55 60
Ala Ala Ser Ala Ser Ala Ser Ala Gly Gly Gly Lys Ala Gly Leu Val
65 70 75 80
Leu Ser Phe Glu Asp Arg Ala Gly Lys Ala Trp Arg Phe Arg Tyr Ser
85 90 95
Tyr Trp Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg
100 105 110
Phe Val Lys Glu Lys Arg Leu Gly Ala Gly Asp Thr Val Leu Phe Ala
115 120 125
Arg Gly Ala Gly Ala Thr Arg Gly Arg Phe Phe Ile Asp Phe Arg Arg
130 135 140
Arg Arg His Glu Leu Ala Phe Leu Gln Pro Pro Leu Ala Ser Ala Gln
145 150 155 160
Arg Leu Leu Pro Leu Pro Ser Val Pro Ile Cys Pro Trp Gln Gly Tyr
165 170 175
Gly Ala Ser Ala Pro Ala Pro Ser Arg His Val Leu Phe Leu Arg Pro
180 185 190
Gln Val Pro Ala Ala Val Val Leu Thr Ser Val Pro Val Arg Val Ala
195 200 205
Ala Ser Ala Val Glu Glu Ala Thr Arg Ser Lys Arg Val Arg Leu Phe
210 215 220
Gly Val Asn Leu Asp Cys Pro Pro Asp Ala Glu Asp Gly Ala Thr Ala
225 230 235 240
Thr Arg Thr Pro Ser Thr Leu Leu Gln Leu Pro Ser Pro Ser Ser Ser
245 250 255
Thr Ser Ser Ser Thr Gly Gly Lys Asp Val Arg Ser Leu Asp Leu Gly
260 265 270
Leu
<210> 226
<211> 282
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 226
Met Glu Phe Ile Thr Pro Ile Val Arg Pro Ala Ser Ala Ala Ala Gly
1 5 10 15
Gly Gly Glu Val Gln Glu Ser Glu Arg Pro Arg Gly Gly Val Ala Trp
20 25 30
Leu Glu Lys Glu His Met Phe Glu Lys Val Val Thr Pro Ser Asp Val
35 40 45
Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr
50 55 60
Phe Pro Ala Leu Asp Ala Ser Ala Ala Ala Ala Ser Ala Ser Ala Ser
65 70 75 80
Ala Gly Gly Gly Lys Ala Gly Leu Val Leu Ser Phe Glu Asp Arg Ala
85 90 95
Gly Lys Ala Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
100 105 110
Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu
115 120 125
Gly Ala Gly Asp Thr Val Leu Phe Ala Arg Gly Ala Gly Ala Thr Arg
130 135 140
Gly Arg Phe Phe Ile Asp Phe Arg Arg Arg Arg His Glu Leu Ala Phe
145 150 155 160
Leu Gln Pro Pro Leu Ala Ser Ala Gln Arg Leu Leu Pro Leu Pro Ser
165 170 175
Val Pro Ile Cys Pro Trp Gln Gly Tyr Gly Ala Ser Ala Pro Ala Pro
180 185 190
Ser Arg His Val Leu Phe Leu Arg Pro Gln Val Pro Ala Ala Val Val
195 200 205
Leu Thr Ser Val Pro Val Arg Val Ala Ala Ser Ala Val Glu Glu Ala
210 215 220
Thr Arg Ser Lys Arg Val Arg Leu Phe Gly Val Asn Leu Asp Cys Pro
225 230 235 240
Pro Asp Ala Glu Asp Gly Ala Thr Ala Thr Arg Thr Pro Ser Thr Leu
245 250 255
Leu Gln Leu Pro Ser Pro Ser Ser Ser Thr Ser Ser Ser Thr Gly Gly
260 265 270
Lys Asp Val Arg Ser Leu Asp Leu Gly Leu
275 280
<210> 227
<211> 259
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 227
Met Glu Phe Thr Thr Pro Pro Pro Ala Thr Arg Ser Gly Gly Gly Glu
1 5 10 15
Glu Arg Ala Ala Ala Glu His Asn Gln His His Gln Gln Gln His Ala
20 25 30
Thr Val Glu Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp
35 40 45
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
50 55 60
Tyr Phe Pro Leu Asp Ala Ala Ala Asn Glu Lys Gly Leu Leu Leu Ser
65 70 75 80
Phe Glu Asp Arg Thr Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val
100 105 110
Lys Glu Lys Arg Leu Asp Ala Gly Asp Thr Val Ser Phe Gly Arg Gly
115 120 125
Ile Ser Glu Ala Ala Arg Asp Arg Leu Phe Ile Asp Trp Arg Cys Arg
130 135 140
Pro Asp Pro Pro Val Val His His Gln Tyr His His Arg Leu Pro Leu
145 150 155 160
Pro Ser Ala Val Val Pro Tyr Ala Pro Trp Ala Ala His Ala His His
165 170 175
His His Tyr Pro Ala Asp Gly His Thr Glu Pro Val Thr Pro Cys Leu
180 185 190
Cys Ala Thr Leu Val Ala Thr Glu Met Arg Ala Ser Ser Ser Gln Leu
195 200 205
Ser Leu Thr Arg Ser Asn Leu Ser Arg Pro Pro Gln Pro Arg Ile Ala
210 215 220
Arg Val Asp Gly Ala Gln Pro Arg Pro Ser Ser Ser Pro Arg Gln Pro
225 230 235 240
Gln Ser Leu Trp Cys Arg Ser Cys Gln Pro Gln Pro Arg Arg Thr Ala
245 250 255
Asp Val Pro
<210> 228
<211> 327
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 228
Met Glu Phe Thr Ala Pro Pro Pro Ala Thr Arg Ser Gly Gly Gly Glu
1 5 10 15
Glu Arg Ala Ala Ala Glu His His Gln Gln Gln Gln Gln Ala Thr Val
20 25 30
Glu Lys Glu His Met Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly
35 40 45
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
50 55 60
Pro Leu Asp Ala Ala Ala Asn Asp Lys Gly Leu Leu Leu Ser Phe Glu
65 70 75 80
Asp Arg Ala Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser
85 90 95
Ser Gln Ser Tyr Val Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu
100 105 110
Lys Arg Leu Asp Ala Gly Asp Thr Val Ser Phe Gly Arg Gly Val Gly
115 120 125
Glu Ala Ala Arg Gly Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Asp
130 135 140
Pro Pro Val Val His His Gln Tyr His His His Arg Leu Pro Leu Pro
145 150 155 160
Ser Ala Val Val Pro Tyr Ala Pro Trp Ala Ala Ala Ala His Ala His
165 170 175
His His His Tyr Pro Ala Ala Gly Val Gly Ala Ala Arg Thr Thr Thr
180 185 190
Thr Thr Thr Thr Thr Val Leu His His Leu Pro Pro Ser Pro Ser Pro
195 200 205
Leu Tyr Leu Asp Thr Arg Arg Arg His Val Gly Tyr Asp Ala Tyr Gly
210 215 220
Ala Gly Thr Arg Gln Leu Leu Phe Tyr Arg Pro His Gln Gln Pro Ser
225 230 235 240
Thr Thr Val Met Leu Asp Ser Val Pro Val Arg Leu Pro Pro Thr Pro
245 250 255
Gly Gln His Ala Glu Pro Pro Pro Pro Ala Val Ala Ser Ser Ala Ser
260 265 270
Lys Arg Val Arg Leu Phe Gly Val Asn Leu Asp Cys Ala Ala Ala Ala
275 280 285
Gly Ser Glu Glu Glu Asn Val Gly Gly Trp Arg Thr Ser Ala Pro Pro
290 295 300
Thr Gln Gln Ala Ser Ser Ser Ser Ser Tyr Ser Ser Gly Lys Ala Arg
305 310 315 320
Cys Ser Leu Asn Leu Asp Leu
325
<210> 229
<211> 279
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 229
Met Ala Met Asn His Pro Leu Phe Ser Gln Glu Gln Pro Gln Ser Trp
1 5 10 15
Pro Trp Gly Val Ala Met Tyr Ala Asn Phe His Tyr His His His Tyr
20 25 30
Glu Lys Glu His Met Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly
35 40 45
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
50 55 60
Pro Leu Gly Ala Gly Asp Ala Ala Asp Lys Gly Leu Ile Leu Ser Phe
65 70 75 80
Glu Asp Glu Ala Gly Ala Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Thr
85 90 95
Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys
100 105 110
Glu Lys Arg Leu Asp Ala Gly Asp Val Val His Phe Glu Arg Val Arg
115 120 125
Gly Ser Phe Gly Val Gly Asp Arg Leu Phe Ile Gly Cys Arg Arg Arg
130 135 140
Gly Asp Ala Ala Ala Ala Gln Thr Pro Ala Pro Pro Pro Ala Val Arg
145 150 155 160
Val Ala Pro Ala Ala Gln Asn Ala Gly Glu Gln Gln Pro Trp Ser Pro
165 170 175
Met Cys Tyr Ser Thr Ser Gly Gly Gly Ser Tyr Pro Thr Ser Pro Ala
180 185 190
Asn Ser Tyr Ala Tyr Arg Arg Ala Ala Asp His Asp His Gly Asp Met
195 200 205
His His Ala Asp Glu Ser Pro Arg Asp Thr Asp Ser Pro Ser Phe Ser
210 215 220
Ala Gly Ser Ala Pro Ser Arg Arg Leu Arg Leu Phe Gly Val Asn Leu
225 230 235 240
Asp Cys Gly Pro Glu Pro Glu Ala Asp Thr Thr Ala Ala Ala Thr Met
245 250 255
Tyr Gly Tyr Met His Gln Gln Ser Ser Tyr Ala Ala Met Ser Ala Val
260 265 270
Pro Ser Tyr Trp Gly Asn Ser
275
<210> 230
<211> 307
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 230
Met Ala Thr Asn His Leu Ser Gln Gly Gln His Gln His Pro Gln Ala
1 5 10 15
Trp Pro Trp Gly Val Ala Met Tyr Thr Asn Leu His Tyr His His Gln
20 25 30
Gln His His His Tyr Glu Lys Glu His Leu Phe Glu Lys Pro Leu Thr
35 40 45
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His
50 55 60
Ala Glu Arg Tyr Phe Pro Leu Ser Ser Ser Gly Ala Gly Asp Lys Gly
65 70 75 80
Leu Ile Leu Cys Phe Glu Asp Asp Asp Asp Asp Glu Ala Ala Ala Ala
85 90 95
Asn Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Thr Ser Ser Gln Ser
100 105 110
Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Glu Lys Gln Leu
115 120 125
Asp Ala Gly Asp Val Val Arg Phe Gln Arg Met Arg Gly Phe Gly Met
130 135 140
Pro Asp Arg Leu Phe Ile Ser His Ser Arg Arg Gly Glu Thr Thr Ala
145 150 155 160
Thr Ala Ala Thr Thr Val Pro Pro Ala Ala Ala Ala Val Arg Val Val
165 170 175
Val Ala Pro Ala Gln Ser Ala Gly Ala Asp His Gln Gln Gln Gln Gln
180 185 190
Pro Ser Pro Trp Ser Pro Met Cys Tyr Ser Thr Ser Gly Ser Tyr Ser
195 200 205
Tyr Pro Thr Ser Ser Pro Ala Asn Ser Gln His Ala Tyr His Arg His
210 215 220
Ser Ala Asp His Asp His Ser Asn Asn Met Gln His Ala Gly Glu Ser
225 230 235 240
Gln Ser Asp Arg Asp Asn Arg Ser Cys Ser Ala Ala Ser Ala Pro Pro
245 250 255
Pro Pro Ser Arg Arg Leu Arg Leu Phe Gly Val Asn Leu Asp Cys Gly
260 265 270
Pro Gly Pro Glu Pro Glu Thr Pro Thr Ala Met Tyr Gly Tyr Met His
275 280 285
Gln Ser Pro Tyr Ala Tyr Asn Asn Trp Gly Ser Pro Tyr Gln His Asp
290 295 300
Glu Glu Ile
305
<210> 231
<211> 288
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 231
Met Ser Ser Ile Asn His Tyr Ser Pro Glu Thr Thr Leu Tyr Trp Thr
1 5 10 15
Asn Asp Gln Gln Gln Gln Ala Ala Met Trp Leu Ser Asn Ser His Thr
20 25 30
Pro Arg Phe Asn Leu Asn Asp Glu Glu Glu Glu Glu Glu Asp Asp Val
35 40 45
Ile Val Ser Asp Lys Ala Thr Asn Asn Leu Thr Gln Glu Glu Glu Lys
50 55 60
Val Ala Met Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu
65 70 75 80
Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu
85 90 95
Asp Ser Ser Ala Ala Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu Ser
100 105 110
Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
115 120 125
Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys Arg Leu
130 135 140
His Ala Gly Asp Val Val Leu Phe His Arg His Arg Ser Leu Pro Gln
145 150 155 160
Arg Phe Phe Ile Ser Cys Ser Arg Arg Gln Pro Asn Pro Val Pro Ala
165 170 175
His Val Ser Thr Thr Arg Ser Ser Ala Ser Phe Tyr Ser Ala His Pro
180 185 190
Pro Tyr Pro Ala His His Phe Pro Phe Pro Tyr Gln Pro His Ser Leu
195 200 205
His Ala Pro Gly Gly Gly Ser Gln Gly Gln Asn Glu Thr Thr Pro Gly
210 215 220
Gly Asn Ser Ser Ser Ser Gly Ser Gly Arg Val Leu Arg Leu Phe Gly
225 230 235 240
Val Asn Met Glu Cys Gln Pro Asp Asn His Asn Asp Ser Gln Asn Ser
245 250 255
Thr Pro Glu Cys Ser Tyr Thr His Leu Tyr His His Gln Thr Ser Ser
260 265 270
Tyr Ser Ser Ser Ser Asn Pro His His His Met Val Pro Gln Gln Pro
275 280 285
<210> 232
<211> 337
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 232
Met Ser Ile Asn His Tyr Ser Met Asp Leu Pro Glu Pro Thr Leu Trp
1 5 10 15
Trp Pro His Pro His His Gln Gln Gln Gln Leu Thr Leu Met Asp Pro
20 25 30
Asp Pro Leu Arg Leu Asn Leu Asn Ser Asp Asp Gly Asn Gly Asn Asp
35 40 45
Asn Asp Asn Asp Glu Asn Gln Thr Thr Thr Thr Gly Gly Glu Gln Glu
50 55 60
Ile Leu Asp Asp Lys Glu Pro Met Phe Glu Lys Pro Leu Thr Pro Ser
65 70 75 80
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu
85 90 95
Lys Tyr Phe Pro Leu Ser Gly Asp Ser Gly Gly Ser Glu Cys Lys Gly
100 105 110
Leu Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Cys Trp Arg Phe Arg
115 120 125
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp
130 135 140
Ser Arg Tyr Val Lys Asp Lys Arg Leu Asp Ala Gly Asp Val Val Leu
145 150 155 160
Phe Glu Arg His Arg Val Asp Ala Gln Arg Leu Phe Ile Gly Trp Arg
165 170 175
Arg Arg Arg Gln Ser Asp Ala Ala Leu Pro Pro Ala His Val Ser Ser
180 185 190
Arg Lys Ser Gly Gly Gly Asp Gly Asn Ser Asn Lys Asn Glu Gly Trp
195 200 205
Thr Arg Gly Phe Tyr Ser Ala His His Pro Tyr Pro Thr His His Leu
210 215 220
His His His Gln Pro Ser Pro Tyr Gln Gln Gln His Asp Cys Leu His
225 230 235 240
Ala Gly Arg Gly Ser Gln Gly Gln Asn Gln Arg Met Arg Pro Val Gly
245 250 255
Asn Asn Ser Ser Ser Ser Ser Ser Ser Ser Arg Val Leu Arg Leu Phe
260 265 270
Gly Val Asp Met Glu Cys Gln Pro Glu His Asp Asp Ser Gly Pro Ser
275 280 285
Thr Pro Gln Cys Ser Tyr Asn Ser Asn Asn Met Leu Pro Ser Thr Gln
290 295 300
Gly Thr Asp His Ser His His Asn Phe Tyr Gln Gln Gln Pro Ser Asn
305 310 315 320
Ser Asn Pro Ser Pro His His Met Met Val His His Gln Pro Tyr Tyr
325 330 335
Tyr
<210> 233
<211> 344
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 233
Met Ser Thr Asn His Tyr Thr Met Asp Leu Pro Glu Pro Thr Leu Trp
1 5 10 15
Trp Pro His Pro His Gln Gln Gln Leu Thr Leu Ile Asp Pro Asp Pro
20 25 30
Leu Pro Leu Asn Leu Asn Asn Asp Asp Asn Asp Asn Gly Asp Asp Asn
35 40 45
Asp Asn Asp Glu Asn Gln Thr Val Thr Thr Thr Thr Thr Gly Gly Glu
50 55 60
Glu Glu Ile Ile Asn Asn Lys Glu Pro Met Phe Glu Lys Pro Leu Thr
65 70 75 80
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His
85 90 95
Ala Glu Lys Tyr Phe Pro Leu Ser Gly Gly Asp Ser Gly Ser Ser Glu
100 105 110
Cys Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Cys Trp
115 120 125
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
130 135 140
Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys Arg Leu Asp Ala Gly Asp
145 150 155 160
Val Val Leu Phe Gln Arg His Arg Ala Asp Ala Gln Arg Leu Phe Ile
165 170 175
Gly Trp Arg Arg Arg Arg Gln Ser Asp Ala Leu Pro Pro Pro Ala His
180 185 190
Val Ser Ser Arg Lys Ser Gly Gly Asp Gly Asn Ser Ser Lys Asn Glu
195 200 205
Gly Asp Val Gly Val Gly Trp Thr Arg Gly Phe Tyr Pro Ala His His
210 215 220
Pro Tyr Pro Thr His His His His Pro Ser Pro Tyr His His Gln Gln
225 230 235 240
Asp Asp Ser Leu His Ala Val Arg Gly Ser Gln Gly Gln Asn Gln Arg
245 250 255
Thr Arg Pro Val Gly Asn Ser Ser Ser Ser Ser Ser Ser Ser Ser Arg
260 265 270
Val Leu Arg Leu Phe Gly Val Asn Met Glu Cys Gln Pro Glu His Asp
275 280 285
Asp Ser Gly Pro Ser Thr Pro Gln Cys Ser Tyr Asn Thr Asn Asn Ile
290 295 300
Leu Pro Ser Thr Gln Gly Thr Asp Ile His Ser His Leu Asn Phe Tyr
305 310 315 320
Gln Gln Gln Gln Thr Ser Asn Ser Lys Pro Pro Pro His His Met Met
325 330 335
Ile Arg His Gln Pro Tyr Tyr Tyr
340
<210> 234
<211> 245
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 234
Met Ser Ile Asn Gln Tyr Ser Ser Glu Phe Tyr Tyr His Ser Leu Met
1 5 10 15
Trp Gln Gln Gln Gln Gln His His His Gln Asn Glu Val Val Glu Glu
20 25 30
Lys Glu Ala Leu Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys
35 40 45
Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro
50 55 60
Leu Ala Ala Ala Ala Val Asp Ala Val Glu Lys Gly Leu Leu Leu Cys
65 70 75 80
Phe Glu Asp Glu Glu Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Glu Lys Gln Leu Asp Ala Gly Asp Val Val Leu Phe His Arg His
115 120 125
Arg Ala Asp Gly Gly Arg Phe Phe Ile Gly Trp Arg Arg Arg Gly Asp
130 135 140
Ser Ser Ser Ser Ser Asp Ser Tyr Arg Asn Leu Gln Ser Asn Ser Ser
145 150 155 160
Leu Gln Tyr Tyr Pro His Ala Gly Ala Gln Ala Val Glu Asn Gln Arg
165 170 175
Gly Asn Ser Lys Thr Leu Arg Leu Phe Gly Val Asn Met Glu Cys Gln
180 185 190
Ile Asp Ser Asp Trp Ser Glu Pro Ser Thr Pro Asp Gly Phe Thr Thr
195 200 205
Cys Pro Thr Asn His Asp Gln Phe Pro Ile Tyr Pro Glu His Phe Pro
210 215 220
Pro Pro Tyr Tyr Met Asp Val Ser Phe Thr Gly Asp Val His Gln Thr
225 230 235 240
Ser Ser Gln Gln Gly
245
<210> 235
<211> 244
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 235
Met Ser Ile Asn Gln Tyr Ser Ser Asp Phe His Tyr His Ser Leu Met
1 5 10 15
Trp Gln Gln Gln Gln Gln Gln Gln Gln His Gln Asn Asp Val Val Glu
20 25 30
Glu Lys Glu Ala Leu Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly
35 40 45
Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe
50 55 60
Pro Leu Ala Ala Ala Ala Ala Asp Ala Val Glu Lys Gly Leu Leu Leu
65 70 75 80
Cys Phe Glu Asp Glu Glu Gly Lys Pro Trp Arg Phe Arg Tyr Ser Tyr
85 90 95
Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr
100 105 110
Val Lys Glu Lys His Leu Asp Ala Gly Asp Val Val Leu Phe His Arg
115 120 125
His Arg Ser Asp Gly Gly Arg Phe Phe Ile Gly Trp Arg Arg Arg Gly
130 135 140
Asp Ser Ser Ser Ser Ser Asp Ser Tyr Arg His Val Gln Ser Asn Ala
145 150 155 160
Ser Leu Gln Tyr Tyr Pro His Ala Gly Ala Gln Ala Val Glu Ser Gln
165 170 175
Arg Gly Asn Ser Lys Thr Leu Arg Leu Phe Gly Val Asn Met Glu Cys
180 185 190
Gln Leu Asp Ser Asp Trp Ser Glu Pro Ser Thr Pro Asp Gly Ser Asn
195 200 205
Thr Tyr Thr Thr Asn His Asp Gln Phe His Phe Tyr Pro Gln Gln Gln
210 215 220
His Tyr Pro Pro Pro Tyr Tyr Met Asp Ile Ser Phe Thr Gly Asp Met
225 230 235 240
Asn Arg Thr Ser
<210> 236
<211> 248
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 236
Met Ser Ile Asn Gln Tyr Ser Ser Asp Phe Asn Tyr His Ser Leu Met
1 5 10 15
Trp Gln Gln Gln Gln His Arg His His His His Gln Asn Asp Val Ala
20 25 30
Glu Glu Lys Glu Ala Leu Phe Glu Lys Pro Leu Thr Pro Ser Asp Val
35 40 45
Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg Tyr
50 55 60
Phe Pro Leu Ala Ala Ala Ala Ala Asp Ala Met Glu Lys Gly Leu Leu
65 70 75 80
Leu Cys Phe Glu Asp Glu Glu Gly Lys Pro Trp Arg Phe Arg Tyr Ser
85 90 95
Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg
100 105 110
Tyr Val Lys Glu Lys Gln Leu Asp Ala Gly Asp Val Ile Leu Phe His
115 120 125
Arg His Arg Val Asp Gly Gly Arg Phe Phe Ile Gly Trp Arg Arg Arg
130 135 140
Gly Asn Ser Ser Ser Ser Ser Asp Ser Tyr Arg His Leu Gln Ser Asn
145 150 155 160
Ala Ser Leu Gln Tyr Tyr Pro His Ala Gly Val Gln Ala Val Glu Ser
165 170 175
Gln Arg Gly Asn Ser Lys Thr Leu Arg Leu Phe Gly Val Asn Met Glu
180 185 190
Cys Gln Leu Asp Ser Asp Leu Pro Asp Pro Ser Thr Pro Asp Gly Ser
195 200 205
Thr Ile Cys Pro Thr Ser His Asp Gln Phe His Leu Tyr Pro Gln Gln
210 215 220
His Tyr Pro Pro Pro Tyr Tyr Met Asp Ile Ser Phe Thr Gly Asp Val
225 230 235 240
His Gln Thr Arg Ser Pro Gln Gly
245
<210> 237
<211> 267
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 237
Met Ser Val Asn His Tyr His Asn Thr Leu Ser Leu His His His His
1 5 10 15
Gln Asn Asp Val Ala Ile Ala Gln Arg Glu Ser Leu Phe Glu Lys Ser
20 25 30
Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
35 40 45
Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Asn Asn Asn Asn Asn Gly
50 55 60
Gly Ser Gly Asp Asp Val Ala Thr Thr Glu Lys Gly Met Leu Leu Ser
65 70 75 80
Phe Glu Asp Glu Ser Gly Lys Cys Trp Lys Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Asp Lys His Leu Asp Ala Gly Asp Val Val Phe Phe Gln Arg His
115 120 125
Arg Phe Asp Leu His Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Glu
130 135 140
Ala Ser Ser Ser Pro Ala Val Ser Val Val Ser Gln Glu Ala Leu Val
145 150 155 160
Asn Thr Thr Ala Tyr Trp Ser Gly Leu Thr Thr Pro Tyr Arg Gln Val
165 170 175
His Ala Ser Thr Thr Tyr Pro Asn Ile His Gln Glu Tyr Ser His Tyr
180 185 190
Gly Ala Val Val Asp His Ala Gln Ser Ile Pro Pro Val Val Ala Gly
195 200 205
Ser Ser Arg Thr Val Arg Leu Phe Gly Val Asn Leu Glu Cys His Gly
210 215 220
Asp Ala Val Glu Pro Pro Pro Arg Pro Asp Val Tyr Asn Asp Gln His
225 230 235 240
Ile Tyr Tyr Tyr Ser Thr Pro His Pro Met Asn Ile Ser Phe Ala Gly
245 250 255
Glu Ala Leu Glu Gln Val Gly Asp Gly Arg Gly
260 265
<210> 238
<211> 264
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 238
Met Ser Gly Asn His Tyr Ser Arg Asp Ile His His Asn Thr Pro Ser
1 5 10 15
Val His His His Gln Asn Tyr Ala Val Val Asp Arg Glu Tyr Leu Phe
20 25 30
Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val
35 40 45
Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Asn Asn Ala Gly
50 55 60
Asp Asp Val Ala Ala Ala Glu Thr Thr Glu Lys Gly Met Leu Leu Thr
65 70 75 80
Phe Glu Asp Glu Ser Gly Lys Cys Trp Lys Phe Arg Tyr Ser Tyr Trp
85 90 95
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val
100 105 110
Lys Asp Lys His Leu His Ala Gly Asp Val Val Phe Phe Gln Arg His
115 120 125
Arg Phe Asp Leu His Arg Val Phe Ile Gly Trp Arg Lys Arg Gly Glu
130 135 140
Val Ser Ser Pro Thr Ala Val Ser Val Val Ser Gln Glu Ala Arg Val
145 150 155 160
Asn Thr Thr Ala Tyr Trp Ser Gly Leu Thr Thr Pro Tyr Arg Gln Val
165 170 175
His Ala Ser Thr Ser Ser Tyr Pro Asn Ile His Gln Glu Tyr Ser His
180 185 190
Tyr Gly Ala Val Ala Glu Ile Pro Thr Val Val Thr Gly Ser Ser Arg
195 200 205
Thr Val Arg Leu Phe Gly Val Asn Leu Glu Cys His Gly Asp Val Val
210 215 220
Glu Thr Pro Pro Cys Pro Asp Gly Tyr Asn Gly Gln His Phe Tyr Tyr
225 230 235 240
Tyr Ser Thr Pro Asp Pro Met Asn Ile Ser Phe Ala Gly Glu Ala Met
245 250 255
Glu Gln Val Gly Asp Gly Arg Arg
260
<210> 239
<211> 258
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 239
Met Ser Val Asn His Tyr Ser Asn Thr Leu Ser Ser His Asn His His
1 5 10 15
Asn Glu His Lys Glu Ser Leu Phe Glu Lys Ser Leu Thr Pro Ser Asp
20 25 30
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Arg
35 40 45
Tyr Leu Pro Leu Asn Asn Cys Gly Gly Gly Gly Asp Val Thr Ala Glu
50 55 60
Ser Thr Glu Lys Gly Val Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys
65 70 75 80
Ser Trp Lys Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
85 90 95
Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys His Leu Asn Ala
100 105 110
Gly Asp Val Val Leu Phe Gln Arg His Arg Phe Asp Ile His Arg Leu
115 120 125
Phe Ile Gly Trp Arg Arg Arg Gly Glu Ala Ser Ser Ser Ser Ala Val
130 135 140
Ser Ala Val Thr Gln Asp Pro Arg Ala Asn Thr Thr Ala Tyr Trp Asn
145 150 155 160
Gly Leu Thr Thr Pro Tyr Arg Gln Val His Ala Ser Thr Ser Ser Tyr
165 170 175
Pro Asn Asn Ile His Gln Glu Tyr Ser His Tyr Gly Pro Val Ala Glu
180 185 190
Thr Pro Thr Val Ala Ala Gly Ser Ser Lys Thr Val Arg Leu Phe Gly
195 200 205
Val Asn Leu Glu Cys His Ser Asp Val Val Glu Pro Pro Pro Cys Pro
210 215 220
Asp Ala Tyr Asn Gly Gln His Ile Tyr Tyr Tyr Ser Thr Pro His Pro
225 230 235 240
Met Asn Ile Ser Phe Ala Gly Glu Ala Met Glu Gln Val Gly Asp Gly
245 250 255
Arg Gly
<210> 240
<211> 278
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 240
Met Ser Val Asn His Tyr Ser Thr Asp His His His Thr Leu Leu Trp
1 5 10 15
Gln Gln Gln Gln His Arg His Thr Thr Asp Thr Ser Glu Thr Thr Thr
20 25 30
Thr Ala Thr Trp Leu His Asp Asp Leu Lys Glu Ser Leu Phe Glu Lys
35 40 45
Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro
50 55 60
Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Ala Val Leu Val Ser
65 70 75 80
Ser Ala Ala Ala Asp Thr Ser Ser Ser Leu Leu Ser Phe Glu Asp Glu
85 90 95
Ser Gly Lys Ser Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
100 105 110
Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Asp Lys Gln
115 120 125
Leu Asp Pro Gly Asp Val Val Phe Phe Gln Arg His Arg Ser Asp Ser
130 135 140
Arg Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Gln Gly Ser Ser Ser
145 150 155 160
Ser Val Ala Ala Thr Asn Ser Ala Val Asn Thr Ser Ser Met Gly Ala
165 170 175
Leu Ser Tyr His Gln Ile His Ala Thr Ser Asn Tyr Ser Asn Pro Pro
180 185 190
Ser His Ser Glu Tyr Ser His Tyr Gly Ala Ala Val Ala Thr Ala Ala
195 200 205
Glu Thr His Ser Thr Pro Ser Ser Ser Val Val Gly Ser Ser Arg Thr
210 215 220
Val Arg Leu Phe Gly Val Asn Leu Glu Cys Gln Met Asp Glu Asn Asp
225 230 235 240
Gly Asp Asp Ser Val Ala Val Ala Thr Thr Val Glu Ser Pro Asp Gly
245 250 255
Tyr Tyr Gly Gln Asn Met Tyr Tyr Tyr Tyr Ser His Pro His Asn Met
260 265 270
Val Ile Leu Thr Leu Leu
275
<210> 241
<211> 267
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 241
Met Ser Val Asn His Tyr Ser Thr Asp His His Gln Val His His His
1 5 10 15
His Thr Leu Phe Leu Gln Asn Leu His Thr Thr Asp Thr Ser Glu Pro
20 25 30
Thr Thr Thr Ala Ala Thr Ser Leu Arg Glu Asp Gln Lys Glu Tyr Leu
35 40 45
Phe Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
50 55 60
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Thr Ile
65 70 75 80
Ile Ser Asn Asn Ala Glu Glu Lys Gly Met Leu Leu Ser Phe Glu Asp
85 90 95
Glu Ser Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
100 105 110
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys
115 120 125
Gln Leu Asp Pro Ala Asp Val Val Phe Phe Gln Arg Gln Arg Ser Asp
130 135 140
Ser Arg Arg Leu Phe Ile Gly Trp Arg Arg Arg Gly Gln Gly Ser Ser
145 150 155 160
Ser Ala Ala Asn Thr Thr Ser Tyr Ser Ser Ser Met Thr Ala Pro Pro
165 170 175
Tyr Ser Asn Tyr Ser Asn Arg Pro Ala His Ser Glu Tyr Ser His Tyr
180 185 190
Gly Ala Ala Val Ala Thr Ala Thr Glu Thr His Phe Ile Pro Ser Ser
195 200 205
Ser Ala Val Gly Ser Ser Arg Thr Val Arg Leu Phe Gly Val Asn Leu
210 215 220
Glu Cys Gln Met Asp Glu Asp Glu Gly Asp Asp Ser Val Ala Thr Ala
225 230 235 240
Ala Ala Ala Glu Cys Pro Arg Gln Asp Ser Tyr Tyr Asp Gln Asn Met
245 250 255
Tyr Asn Tyr Tyr Thr Pro His Ser Ser Ala Ser
260 265
<210> 242
<211> 347
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 242
Met Ile Gly Val Glu Lys Val Thr Ile Cys Met Arg Ile Glu Val Asn
1 5 10 15
Thr Glu Lys Gly Arg Arg Ala Leu Met Asp Cys Trp Gln Ile Ser Gly
20 25 30
Val His Glu Ser Ser Asp Cys Ser Glu Ile Lys Phe Ala Phe Asp Ala
35 40 45
Val Val Lys Arg Ala Arg His Glu Glu Asn Asn Ala Ala Ala Gln Lys
50 55 60
Phe Lys Gly Val Val Ser Gln Gln Asn Gly Asn Trp Gly Ala Gln Ile
65 70 75 80
Tyr Ala His Gln Gln Arg Ile Trp Leu Gly Thr Phe Lys Ser Glu Arg
85 90 95
Glu Ala Ala Met Ala Tyr Asp Ser Ala Ser Ile Lys Leu Arg Ser Gly
100 105 110
Glu Cys His Arg Asn Phe Pro Trp Asn Asp Gln Thr Val Gln Glu Pro
115 120 125
Gln Phe Gln Ser His Tyr Ser Ala Glu Thr Val Leu Asn Met Ile Arg
130 135 140
Asp Gly Thr Tyr Pro Ser Lys Phe Ala Thr Phe Leu Lys Thr Arg Gln
145 150 155 160
Thr Gln Lys Gly Val Ala Lys His Ile Gly Leu Lys Gly Asp Asp Glu
165 170 175
Glu Gln Phe Cys Cys Thr Gln Leu Phe Gln Lys Glu Leu Thr Pro Ser
180 185 190
Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Lys His Ala Val
195 200 205
Ser Tyr Phe Pro Tyr Val Gly Gly Ser Ala Asp Glu Ser Gly Ser Val
210 215 220
Asp Val Glu Ala Val Phe Tyr Asp Lys Leu Met Arg Leu Trp Lys Phe
225 230 235 240
Arg Tyr Cys Tyr Trp Lys Ser Ser Gln Ser Tyr Val Phe Thr Arg Gly
245 250 255
Trp Asn Arg Phe Val Lys Asp Lys Lys Leu Lys Ala Lys Asp Val Ile
260 265 270
Ala Phe Phe Thr Trp Gly Lys Ser Gly Gly Glu Gly Glu Ala Phe Ala
275 280 285
Leu Ile Asp Val Ile Tyr Asn Asn Asn Ala Glu Glu Asp Ser Lys Gly
290 295 300
Asp Thr Lys Gln Val Leu Gly Asn Gln Leu Gln Leu Ala Gly Ser Glu
305 310 315 320
Glu Gly Glu Asp Glu Asp Ala Asn Ile Gly Lys Asp Phe Asn Ala Gln
325 330 335
Lys Gly Leu Arg Leu Phe Gly Val Cys Ile Thr
340 345
<210> 243
<211> 267
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 243
Met Leu Arg Lys His Ile Tyr Pro Asp Glu Leu Ala Gln His Lys Arg
1 5 10 15
Ala Phe Phe Phe Ala Ala Ala Ser Ser Pro Thr Ser Ser Ser Ser Pro
20 25 30
Leu Ala Ser Pro Ala Pro Ser Ala Ala Ala Ala Arg Arg Glu His Leu
35 40 45
Phe Asp Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
50 55 60
Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Gln Leu Pro
65 70 75 80
Ser Ala Ser Ala Ala Val Pro Gly Glu Cys Lys Gly Val Leu Leu Asn
85 90 95
Phe Asp Asp Ala Thr Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp
100 105 110
Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val
115 120 125
Lys Glu Lys Gly Leu His Ala Gly Asp Ala Val Glu Phe Tyr Arg Ala
130 135 140
Ala Ser Gly Asn Asn Gln Leu Phe Ile Asp Cys Lys Leu Arg Ser Lys
145 150 155 160
Ser Thr Thr Thr Thr Thr Ser Val Asn Ser Glu Ala Ala Pro Ser Pro
165 170 175
Ala Pro Val Thr Arg Thr Val Arg Leu Phe Gly Val Asp Leu Leu Ile
180 185 190
Ala Pro Ala Ala Arg His Ala His Glu His Glu Asp Tyr Gly Met Ala
195 200 205
Lys Thr Asn Lys Arg Thr Met Glu Ala Ser Val Ala Ala Pro Thr Pro
210 215 220
Ala His Ala Val Trp Lys Lys Arg Cys Val Asp Phe Ala Leu Thr Tyr
225 230 235 240
Arg Leu Ala Thr Thr Pro Gln Cys Pro Arg Ser Arg Asp Gln Leu Glu
245 250 255
Gly Val Gln Ala Ala Gly Ser Thr Phe Ala Leu
260 265
<210> 244
<211> 393
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 244
Met Asp Ser Ser Ser Cys Leu Val Asp Asp Thr Asn Ser Gly Gly Ser
1 5 10 15
Ser Thr Asp Lys Leu Arg Ala Leu Ala Ala Ala Ala Ala Glu Thr Ala
20 25 30
Pro Leu Glu Arg Met Gly Ser Gly Ala Ser Ala Val Val Asp Ala Ala
35 40 45
Glu Pro Gly Ala Glu Ala Asp Ser Gly Ser Gly Gly Arg Val Cys Gly
50 55 60
Gly Gly Gly Gly Gly Ala Gly Gly Ala Gly Gly Lys Leu Pro Ser Ser
65 70 75 80
Lys Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln
85 90 95
Ile Tyr Glu Arg His Gln Arg Val Trp Leu Gly Thr Phe Ala Gly Glu
100 105 110
Asp Asp Ala Ala Arg Ala Tyr Asp Val Ala Ala Gln Arg Phe Arg Gly
115 120 125
Arg Asp Ala Val Thr Asn Phe Arg Pro Leu Ala Glu Ala Asp Pro Asp
130 135 140
Ala Ala Ala Glu Leu Arg Phe Leu Ala Thr Arg Ser Lys Ala Glu Val
145 150 155 160
Val Asp Met Leu Arg Lys His Thr Tyr Phe Asp Glu Leu Ala Gln Ser
165 170 175
Lys Arg Thr Phe Ala Ala Ser Thr Pro Ser Ala Ala Thr Thr Thr Ala
180 185 190
Ser Leu Ser Asn Gly His Leu Ser Ser Pro Arg Ser Pro Phe Ala Pro
195 200 205
Ala Ala Ala Arg Asp His Leu Phe Asp Lys Thr Val Thr Pro Ser Asp
210 215 220
Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln His Ala Glu Lys
225 230 235 240
His Phe Pro Leu Gln Leu Pro Ser Ala Gly Gly Glu Ser Lys Gly Val
245 250 255
Leu Leu Asn Phe Glu Asp Ala Ala Gly Lys Val Trp Arg Phe Arg Tyr
260 265 270
Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser
275 280 285
Arg Phe Val Lys Glu Lys Gly Leu His Ala Gly Asp Val Val Gly Phe
290 295 300
Tyr Arg Ser Ala Ala Ser Ala Gly Asp Asp Gly Lys Leu Phe Ile Asp
305 310 315 320
Cys Lys Leu Val Arg Ser Thr Gly Ala Ala Leu Ala Ser Pro Ala Asp
325 330 335
Gln Pro Ala Pro Ser Pro Val Lys Ala Val Arg Leu Phe Gly Val Asp
340 345 350
Leu Leu Thr Ala Pro Ala Pro Val Glu Gln Met Ala Gly Cys Lys Arg
355 360 365
Ala Arg Asp Leu Ala Ala Thr Thr Pro Pro Gln Ala Ala Ala Phe Lys
370 375 380
Lys Gln Cys Ile Glu Leu Ala Leu Val
385 390
<210> 245
<211> 254
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 245
Met Leu Arg Lys His Thr Tyr Phe Asp Glu Leu Ala Gln Ser Lys Arg
1 5 10 15
Ala Phe Ala Ala Ser Ala Ala Leu Ser Ala Pro Thr Thr Ser Gly Asp
20 25 30
Ala Gly Gly Ser Ala Ser Pro Pro Ser Pro Ala Ala Val Arg Glu His
35 40 45
Leu Phe Asp Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg
50 55 60
Leu Val Ile Pro Lys Gln Asn Ala Glu Lys His Phe Pro Leu Gln Leu
65 70 75 80
Pro Ala Gly Gly Gly Glu Ser Lys Gly Leu Leu Leu Asn Phe Glu Asp
85 90 95
Asp Ala Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
100 105 110
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
115 120 125
Gly Leu Gly Ala Gly Asp Val Val Gly Phe Tyr Arg Ser Ala Ala Gly
130 135 140
Arg Thr Gly Glu Asp Ser Lys Phe Phe Ile Asp Cys Arg Leu Arg Pro
145 150 155 160
Asn Thr Asn Thr Ala Ala Glu Ala Asp Pro Val Tyr Gly Asn Asp Thr
165 170 175
Glu Asp Gln Leu Phe Ile Asp Tyr Lys Lys Met Asn Lys Asn Asp Asp
180 185 190
Ala Ala Asp Ala Ala Ile Asp Gln Ser Ser Ala Pro Val Gln Lys Ala
195 200 205
Val Arg Leu Phe Gly Val Asp Leu Leu Ala Ala Pro Glu Gln Gly Met
210 215 220
Pro Gly Gly Cys Lys Arg Ala Arg Asp Leu Val Lys Pro Pro Pro Pro
225 230 235 240
Lys Val Ala Phe Lys Lys Gln Cys Ile Glu Leu Ala Leu Ala
245 250
<210> 246
<211> 357
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 246
Met Gly Val Glu Ile Leu Ser Ser Thr Gly Glu His Ser Ser Gln Tyr
1 5 10 15
Ser Ser Gly Ala Ala Ser Thr Ala Thr Thr Glu Ser Gly Val Gly Gly
20 25 30
Arg Pro Pro Thr Ala Pro Ser Leu Pro Val Ser Ile Ala Asp Glu Ser
35 40 45
Ala Thr Ser Arg Ser Ala Ser Ala Gln Ser Thr Ser Ser Arg Phe Lys
50 55 60
Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu
65 70 75 80
Arg His Ala Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp Ser Ala
85 90 95
Ala Arg Ala Tyr Asp Val Ala Ala Leu Arg Tyr Arg Gly Arg Glu Ala
100 105 110
Ala Thr Asn Phe Pro Cys Ala Ala Ala Glu Ala Glu Leu Ala Phe Leu
115 120 125
Ala Ala His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr
130 135 140
Tyr Thr Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Met Gly
145 150 155 160
Ala Arg Ala Gln Pro Thr Pro Ser Trp Ala Arg Glu Pro Leu Phe Glu
165 170 175
Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val
180 185 190
Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Lys Arg Thr Pro Glu
195 200 205
Thr Thr Thr Thr Thr Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly
210 215 220
Glu Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
225 230 235 240
Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Arg Glu Lys Gly
245 250 255
Leu Gly Ala Gly Asp Ser Ile Val Phe Ser Cys Ser Ala Tyr Gly Gln
260 265 270
Glu Lys Gln Phe Phe Ile Asp Cys Lys Lys Asn Lys Thr Met Thr Ser
275 280 285
Cys Pro Ala Asp Asp Arg Gly Ala Ala Thr Ala Ser Pro Pro Val Ser
290 295 300
Glu Pro Thr Lys Gly Glu Gln Val Arg Val Val Arg Leu Phe Gly Val
305 310 315 320
Asp Ile Ala Gly Glu Lys Arg Gly Arg Ala Ala Pro Val Glu Gln Glu
325 330 335
Leu Phe Lys Arg Gln Cys Val Ala His Ser Gln His Ser Pro Ala Leu
340 345 350
Gly Ala Phe Val Leu
355
<210> 247
<211> 362
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 247
Met Gly Met Glu Ile Leu Ser Ser Thr Val Glu His Cys Ser Gln Tyr
1 5 10 15
Ser Ser Ser Ala Ser Thr Ala Thr Thr Glu Ser Gly Ala Ala Gly Arg
20 25 30
Ser Thr Thr Ala Leu Ser Leu Pro Val Ala Ile Thr Asp Glu Ser Val
35 40 45
Thr Ser Arg Ser Ala Ser Ala Gln Pro Ala Ser Ser Arg Phe Lys Gly
50 55 60
Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ser Gln Ile Tyr Glu Arg
65 70 75 80
His Ala Arg Val Trp Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala
85 90 95
Arg Ala Tyr Asp Val Ala Ser Leu Arg Tyr Arg Gly Arg Asp Ala Ala
100 105 110
Thr Asn Phe Pro Cys Ala Ala Ala Glu Ala Glu Leu Ala Phe Leu Thr
115 120 125
Ala His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Ala Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Ala
145 150 155 160
Arg Ala Gln Pro Thr Pro Ser Trp Ala Arg Val Pro Leu Phe Glu Lys
165 170 175
Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro
180 185 190
Lys Gln His Ala Glu Lys His Phe Pro Leu Lys Cys Thr Ala Glu Thr
195 200 205
Thr Thr Thr Thr Gly Asn Gly Val Leu Leu Asn Phe Glu Asp Gly Glu
210 215 220
Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser
225 230 235 240
Tyr Val Leu Thr Lys Gly Trp Ser Ser Phe Val Arg Glu Lys Gly Leu
245 250 255
Gly Ala Gly Asp Ser Ile Val Phe Ser Ser Ser Ala Tyr Gly Gln Glu
260 265 270
Lys Gln Leu Phe Ile Asn Cys Lys Lys Asn Thr Thr Met Asn Gly Gly
275 280 285
Lys Thr Ala Leu Pro Leu Pro Val Val Glu Thr Ala Lys Gly Glu Gln
290 295 300
Asp His Val Val Lys Leu Phe Gly Val Asp Ile Ala Gly Val Lys Arg
305 310 315 320
Val Arg Ala Ala Thr Gly Glu Leu Gly Pro Pro Glu Leu Phe Lys Arg
325 330 335
Gln Ser Val Ala His Gly Cys Gly Arg Met Asn Tyr Ile Cys Tyr Ser
340 345 350
Ile Gly Thr Ile Gly Pro Leu Met Leu Asn
355 360
<210> 248
<211> 351
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 248
Met Gly Val Glu Ile Leu Ser Ser Met Val Glu Asp Ser Ser Gln Tyr
1 5 10 15
Ser Ser Gly Ala Ser Thr Ala Thr Thr Glu Ser Gly Thr Thr Gly Arg
20 25 30
Ala Leu Thr Ala Leu Ser Leu Pro Val Ala Ile Ala Asp Glu Ser Val
35 40 45
Thr Ser Ala Gln Ser Ala Pro Ser Arg Phe Lys Gly Val Val Pro Gln
50 55 60
Pro Asn Gly Arg Trp Gly Ser Gln Ile Tyr Glu Arg His Ala Arg Val
65 70 75 80
Trp Leu Gly Thr Phe Pro Asp Gln Asp Leu Ala Ala Arg Ala Tyr Asp
85 90 95
Val Ala Ala Leu Arg Tyr Arg Gly Arg Asp Ala Ala Thr Asn Phe Pro
100 105 110
Cys Ala Ala Ala Glu Ala Glu Leu Ala Phe Leu Gly Ala His Ser Lys
115 120 125
Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu
130 135 140
Arg Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Ala Arg Ala Gln Pro
145 150 155 160
Thr Pro Ser Trp Ala Arg Glu Pro Leu Phe Glu Lys Ala Val Thr Pro
165 170 175
Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln His Ala
180 185 190
Glu Lys His Phe Pro Leu Lys Arg Thr Pro Glu Arg Thr Thr Thr Thr
195 200 205
Gly Asn Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp
210 215 220
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
225 230 235 240
Lys Gly Trp Ser Arg Phe Val Arg Glu Lys Gly Leu Ala Ala Gly Asp
245 250 255
Ser Ile Ile Phe Ser Cys Ser Ala Tyr Gly Gln Glu Lys Gln Leu Phe
260 265 270
Ile Asp Cys Lys Lys Asn Thr Thr Val Asn Ser Gly Lys Ser Ala Ser
275 280 285
Pro Leu Pro Val Val Glu Thr Ala Lys Gly Glu Gln Val Arg Val Val
290 295 300
Arg Leu Phe Gly Val Asp Ile Ala Gly Val Lys Arg Gly Arg Ala Ala
305 310 315 320
Thr Ala Glu Gln Gly Pro Pro Glu Leu Leu Lys Arg Gln Cys Val Pro
325 330 335
Leu Pro His Gly Gln Arg Ser Pro Ala Leu Gly Ala Phe Val Leu
340 345 350
<210> 249
<211> 348
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 249
Met Gly Val Glu Ile Leu Ser Ser Met Val Glu His Ser Phe Gln Tyr
1 5 10 15
Ser Ser Gly Ala Ser Ser Ala Thr Ala Glu Ser Gly Ala Val Gly Thr
20 25 30
Pro Pro Arg His Leu Ser Leu Pro Val Ala Ile Ala Asp Glu Ser Leu
35 40 45
Thr Ser Arg Ser Ala Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
50 55 60
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Arg His Ala Arg Val Trp
65 70 75 80
Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala Arg Ala Tyr Asp Val
85 90 95
Ala Ser Leu Arg Tyr Arg Gly Gly Asp Ala Ala Phe Asn Phe Pro Cys
100 105 110
Val Val Val Glu Ala Glu Leu Ala Phe Leu Ala Ala His Ser Lys Ala
115 120 125
Glu Ile Val Asp Met Leu Arg Lys Gln Thr Tyr Ala Asp Glu Leu Arg
130 135 140
Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Val Arg Ala Gln Pro Met
145 150 155 160
Pro Ser Trp Ala Arg Val Pro Leu Phe Glu Lys Ala Val Thr Pro Ser
165 170 175
Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln His Ala Glu
180 185 190
Lys His Phe Pro Leu Lys Arg Ser Pro Glu Thr Thr Thr Thr Thr Gly
195 200 205
Asn Gly Val Leu Leu Asn Phe Glu Asp Gly Gln Gly Lys Val Trp Arg
210 215 220
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys
225 230 235 240
Gly Trp Ser Arg Phe Val Arg Glu Lys Gly Leu Gly Ala Gly Asp Ser
245 250 255
Ile Met Phe Ser Cys Ser Ala Tyr Gly Gln Glu Lys Gln Phe Phe Ile
260 265 270
Asp Cys Lys Lys Asn Thr Thr Val Asn Gly Gly Lys Ser Ala Ser Pro
275 280 285
Leu Gln Val Met Glu Ile Ala Lys Ala Glu Gln Val Arg Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Ala Gly Val Lys Arg Glu Arg Ala Ala Thr
305 310 315 320
Ala Glu Gln Gly Pro Gln Gly Trp Phe Lys Arg Gln Cys Met Ala His
325 330 335
Gly Gln His Ser Pro Ala Leu Gly Asp Phe Ala Leu
340 345
<210> 250
<211> 350
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 250
Met Gly Val Glu Ile Leu Ser Ser Met Val Glu His Ser Phe Gln Tyr
1 5 10 15
Ser Ser Gly Val Ser Thr Ala Thr Thr Glu Ser Gly Thr Ala Gly Thr
20 25 30
Pro Pro Arg Pro Leu Ser Leu Pro Val Ala Ile Ala Asp Glu Ser Val
35 40 45
Thr Ser Arg Ser Ala Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
50 55 60
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Arg His Ala Arg Val Trp
65 70 75 80
Leu Gly Thr Phe Pro Asp Gln Asp Ser Ala Ala Arg Ala Tyr Asp Val
85 90 95
Ala Ser Leu Arg Tyr Arg Gly Arg Asp Val Ala Phe Asn Phe Pro Cys
100 105 110
Ala Ala Val Glu Gly Glu Leu Ala Phe Leu Ala Ala His Ser Lys Ala
115 120 125
Glu Ile Val Asp Met Leu Arg Lys Gln Thr Tyr Ala Asp Glu Leu Arg
130 135 140
Gln Gly Leu Arg Arg Gly Arg Gly Met Gly Ala Arg Ala Gln Pro Thr
145 150 155 160
Pro Ser Trp Ala Arg Glu Pro Leu Phe Glu Lys Ala Val Thr Pro Ser
165 170 175
Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln His Ala Glu
180 185 190
Lys His Phe Pro Leu Lys Arg Thr Pro Glu Thr Pro Thr Thr Thr Gly
195 200 205
Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp Arg
210 215 220
Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys
225 230 235 240
Gly Trp Ser Arg Phe Val Arg Glu Lys Gly Leu Gly Ala Gly Asp Ser
245 250 255
Ile Leu Phe Ser Cys Ser Leu Tyr Glu Gln Glu Lys Gln Phe Phe Ile
260 265 270
Asp Cys Lys Lys Asn Thr Ser Met Asn Gly Gly Lys Ser Ala Ser Pro
275 280 285
Leu Pro Val Gly Val Thr Thr Lys Gly Glu Gln Val Arg Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Ser Gly Val Lys Arg Gly Arg Ala Ala Thr
305 310 315 320
Ala Thr Ala Glu Gln Gly Leu Gln Glu Leu Phe Lys Arg Gln Cys Val
325 330 335
Ala Pro Gly Gln His Ser Pro Ala Leu Gly Ala Phe Ala Leu
340 345 350
<210> 251
<211> 308
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 251
Met Ala Ser Ser Lys Pro Thr Asn Pro Glu Val Asp Asn Asp Met Glu
1 5 10 15
Cys Ser Ser Pro Glu Ser Gly Ala Glu Asp Ala Val Glu Ser Ser Ser
20 25 30
Pro Val Ala Ala Pro Ser Ser Arg Phe Lys Gly Val Val Pro Gln Pro
35 40 45
Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Lys His Ser Arg Val Trp
50 55 60
Leu Gly Thr Phe Gly Asp Glu Glu Ala Ala Ala Cys Ala Tyr Asp Val
65 70 75 80
Ala Ala Leu Arg Phe Arg Gly Arg Asp Ala Val Thr Asn His Gln Arg
85 90 95
Leu Pro Ala Ala Glu Gly Ala Gly Trp Ser Ser Thr Ser Glu Leu Ala
100 105 110
Phe Leu Ala Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys
115 120 125
His Thr Tyr Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly His Gly
130 135 140
Arg Ala Gln Pro Thr Pro Ala Trp Ala Arg Glu Phe Leu Phe Glu Lys
145 150 155 160
Ala Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro
165 170 175
Lys Gln His Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Ala
180 185 190
Gly Ser Asp Gly Lys Gly Leu Leu Leu Asn Phe Glu Asp Gly Gln Gly
195 200 205
Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr
210 215 220
Val Leu Thr Lys Gly Trp Ser Arg Phe Val Gln Glu Lys Gly Leu Cys
225 230 235 240
Ala Gly Asp Thr Val Thr Phe Ser Arg Ser Ala Tyr Val Met Asn Asp
245 250 255
Thr Asp Glu Gln Leu Phe Ile Asp Tyr Lys Gln Ser Ser Lys Asn Asp
260 265 270
Glu Ala Ala Asp Val Ala Thr Ala Asp Glu Asn Glu Ala Gly His Val
275 280 285
Ala Val Lys Leu Phe Gly Val Asp Ile Gly Trp Ala Gly Met Ala Gly
290 295 300
Ser Ser Gly Gly
305
<210> 252
<211> 293
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 252
Met Ala Ser Gly Lys Pro Thr Asn His Gly Met Glu Asp Asp Asn Asp
1 5 10 15
Met Glu Tyr Ser Ser Ala Glu Ser Gly Ala Glu Asp Ala Ala Glu Pro
20 25 30
Ser Ser Ser Pro Val Leu Ala Pro Pro Arg Ala Ala Pro Ser Ser Arg
35 40 45
Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile
50 55 60
Tyr Glu Lys His Ser Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp
65 70 75 80
Ala Ala Val Arg Ala Tyr Asp Val Ala Ala Leu Arg Phe Arg Gly Pro
85 90 95
Asp Ala Val Ile Asn His Gln Arg Pro Thr Ala Ala Glu Glu Ala Gly
100 105 110
Ser Ser Ser Ser Arg Ser Glu Leu Asp Pro Glu Leu Gly Phe Leu Ala
115 120 125
Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Arg Ala Gln
145 150 155 160
Pro Thr Pro Ala Trp Ala Arg Glu Leu Leu Phe Glu Lys Ala Val Thr
165 170 175
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln Gln
180 185 190
Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Thr Gly Ser Asn
195 200 205
Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp
210 215 220
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
225 230 235 240
Lys Gly Trp Ser Arg Phe Val Lys Glu Thr Gly Leu Arg Ala Gly Asp
245 250 255
Thr Val Ala Phe Tyr Arg Ser Ala Ser Asp Glu Asn Glu Thr Gly His
260 265 270
Val Ala Val Lys Leu Phe Gly Val Asp Ile Ala Gly Gly Gly Met Ala
275 280 285
Gly Ser Ser Gly Gly
290
<210> 253
<211> 320
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 253
Met Ala Ser Gly Lys Pro Thr Asn His Gly Met Glu Asp Asp Asn Asp
1 5 10 15
Met Glu Tyr Ser Ser Ala Glu Ser Gly Ala Glu Asp Ala Ala Glu Pro
20 25 30
Ser Ser Ser Pro Val Leu Ala Pro Pro Arg Ala Ala Pro Ser Ser Arg
35 40 45
Phe Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile
50 55 60
Tyr Glu Lys His Ser Arg Val Trp Leu Gly Thr Phe Pro Asp Glu Asp
65 70 75 80
Ala Ala Ala Arg Ala Tyr Asp Val Ala Ala Leu Arg Phe Arg Gly Pro
85 90 95
Asp Ala Val Ile Asn His Gln Arg Pro Thr Ala Ala Glu Glu Ala Gly
100 105 110
Ser Ser Ser Ser Arg Ser Glu Leu Asp Pro Glu Leu Gly Phe Leu Ala
115 120 125
Asp His Ser Lys Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asp Glu Leu Arg Gln Gly Leu Arg Arg Gly Arg Gly Arg Ala Gln
145 150 155 160
Pro Thr Pro Ala Trp Ala Arg Glu Leu Leu Phe Glu Lys Ala Val Thr
165 170 175
Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Val Pro Lys Gln Gln
180 185 190
Ala Glu Lys His Phe Pro Pro Thr Thr Ala Ala Ala Thr Gly Ser Asn
195 200 205
Gly Lys Gly Val Leu Leu Asn Phe Glu Asp Gly Glu Gly Lys Val Trp
210 215 220
Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr
225 230 235 240
Lys Gly Trp Ser Arg Phe Val Lys Glu Thr Gly Leu Arg Ala Gly Asp
245 250 255
Thr Val Ala Phe Tyr Arg Ser Ala Tyr Gly Asn Asp Thr Glu Asp Gln
260 265 270
Leu Phe Ile Asp Tyr Lys Lys Met Asn Lys Asn Asp Asp Ala Ala Asp
275 280 285
Ala Ala Ile Ser Asp Glu Asn Glu Thr Gly His Val Ala Val Lys Leu
290 295 300
Phe Gly Val Asp Ile Ala Gly Gly Gly Met Ala Gly Ser Ser Gly Gly
305 310 315 320
<210> 254
<211> 350
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 254
Met Val Phe Ser Cys Ile Asp Glu Ser Ser Ser Thr Ser Glu Ser Phe
1 5 10 15
Ser Pro Ala Thr Ala Thr Ala Thr Ala Thr Ala Thr Lys Phe Ser Ala
20 25 30
Pro Pro Leu Pro Pro Leu Arg Leu Asn Arg Met Arg Ser Gly Gly Ser
35 40 45
Asn Val Val Leu Asp Ser Lys Asn Gly Val Asp Ile Asp Ser Arg Lys
50 55 60
Leu Ser Ser Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg
65 70 75 80
Trp Gly Ala Gln Ile Tyr Val Lys His Gln Arg Val Trp Leu Gly Thr
85 90 95
Phe Cys Asp Glu Glu Glu Ala Ala His Ser Tyr Asp Ile Ala Ala Arg
100 105 110
Lys Phe Arg Gly Arg Asp Ala Val Val Asn Phe Lys Thr Phe Leu Ala
115 120 125
Ser Glu Asp Asp Asn Gly Glu Leu Cys Phe Leu Glu Ala His Ser Lys
130 135 140
Ala Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu
145 150 155 160
Ala Gln Ser Asn Lys Arg Ser Gly Ala Asn Thr Asn Thr Asn Thr Thr
165 170 175
Gln Ser His Thr Val Ser Arg Thr Arg Glu Val Leu Phe Glu Lys Val
180 185 190
Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
195 200 205
Gln His Ala Glu Lys Tyr Phe Pro Leu Pro Ser Leu Ser Val Thr Lys
210 215 220
Gly Val Leu Ile Asn Phe Glu Asp Val Thr Gly Lys Val Trp Arg Phe
225 230 235 240
Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly
245 250 255
Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val
260 265 270
Thr Phe Glu Arg Ser Thr Gly Ser Asp Arg Gln Leu Tyr Ile Asp Trp
275 280 285
Lys Ile Arg Ser Gly Pro Ser Lys Asn Pro Val Gln Val Val Val Arg
290 295 300
Leu Phe Gly Val Asp Ile Phe Asn Val Thr Ser Ala Lys Pro Ser Asn
305 310 315 320
Val Val Asp Ala Cys Gly Gly Lys Arg Ser Arg Asp Val Asp Met Phe
325 330 335
Ala Leu Arg Cys Ser Lys Lys His Ala Ile Ile Asn Ala Leu
340 345 350
<210> 255
<211> 351
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 255
Met Asp Gly Gly Cys Val Thr Asp Glu Thr Thr Thr Ser Ser Asp Ser
1 5 10 15
Leu Ser Val Pro Pro Pro Ser Arg Val Gly Ser Val Ala Ser Ala Val
20 25 30
Val Asp Pro Asp Gly Cys Cys Val Ser Gly Glu Ala Glu Ser Arg Lys
35 40 45
Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg
50 55 60
Trp Gly Ala Gln Ile Tyr Glu Lys His Gln Arg Val Trp Leu Gly Thr
65 70 75 80
Phe Asn Glu Glu Asp Glu Ala Ala Arg Ala Tyr Asp Ile Ala Ala Leu
85 90 95
Arg Phe Arg Gly Pro Asp Ala Val Thr Asn Phe Lys Pro Pro Ala Ala
100 105 110
Ser Asp Asp Ala Glu Ser Glu Phe Leu Asn Ser His Ser Lys Phe Glu
115 120 125
Ile Val Asp Met Leu Arg Lys His Thr Tyr Asp Asp Glu Leu Gln Gln
130 135 140
Ser Thr Arg Gly Gly Arg Arg Arg Leu Asp Ala Asp Thr Ala Ser Ser
145 150 155 160
Gly Val Phe Asp Ala Lys Ala Arg Glu Gln Leu Phe Glu Lys Thr Val
165 170 175
Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys Gln
180 185 190
His Ala Glu Lys His Phe Pro Leu Ser Gly Ser Gly Asp Glu Ser Ser
195 200 205
Pro Cys Val Ala Gly Ala Ser Ala Ala Lys Gly Met Leu Leu Asn Phe
210 215 220
Glu Asp Val Gly Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn
225 230 235 240
Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys
245 250 255
Glu Lys Asn Leu Arg Ala Gly Asp Ala Val Gln Phe Phe Lys Ser Thr
260 265 270
Gly Pro Asp Arg Gln Leu Tyr Ile Asp Cys Lys Ala Arg Ser Gly Glu
275 280 285
Val Asn Asn Asn Ala Gly Gly Leu Phe Val Pro Ile Gly Pro Val Val
290 295 300
Glu Pro Val Gln Met Val Arg Leu Phe Gly Val Asn Leu Leu Lys Leu
305 310 315 320
Pro Val Pro Gly Ser Asp Gly Val Gly Lys Arg Lys Glu Met Glu Leu
325 330 335
Phe Ala Phe Glu Cys Cys Lys Lys Leu Lys Val Ile Gly Ala Leu
340 345 350
<210> 256
<211> 362
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 256
Met Asp Gly Gly Ser Val Thr Asp Glu Thr Thr Thr Thr Ser Asn Ser
1 5 10 15
Leu Ser Val Pro Ala Asn Leu Ser Pro Pro Pro Leu Ser Leu Val Gly
20 25 30
Ser Gly Ala Thr Ala Val Val Tyr Pro Asp Gly Cys Cys Val Ser Gly
35 40 45
Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val
50 55 60
Pro Gln Pro Asn Gly Arg Trp Gly Ala Gln Ile Tyr Glu Lys His Gln
65 70 75 80
Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala Arg Ala
85 90 95
Tyr Asp Ile Ala Ala His Arg Phe Arg Gly Arg Asp Ala Val Thr Asn
100 105 110
Phe Lys Pro Leu Ala Gly Ala Asp Asp Ala Glu Ala Glu Phe Leu Ser
115 120 125
Thr His Ser Lys Ser Glu Ile Val Asp Met Leu Arg Lys His Thr Tyr
130 135 140
Asp Asn Glu Leu Gln Gln Ser Thr Arg Gly Gly Arg Arg Arg Arg Asp
145 150 155 160
Ala Glu Thr Ala Ser Ser Gly Ala Phe Asp Ala Lys Ala Arg Glu Gln
165 170 175
Leu Phe Glu Lys Thr Val Thr Gln Ser Asp Val Gly Lys Leu Asn Arg
180 185 190
Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Ser Gly
195 200 205
Ser Gly Gly Gly Ala Leu Pro Cys Met Ala Ala Ala Ala Gly Ala Lys
210 215 220
Gly Met Leu Leu Asn Phe Glu Asp Val Gly Gly Lys Val Trp Arg Phe
225 230 235 240
Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly
245 250 255
Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Ala Val
260 265 270
Gln Phe Phe Lys Ser Thr Gly Leu Asp Arg Gln Leu Tyr Ile Asp Cys
275 280 285
Lys Ala Arg Ser Gly Lys Val Asn Asn Asn Ala Ala Gly Leu Phe Ile
290 295 300
Pro Val Gly Pro Val Val Glu Pro Val Gln Met Val Arg Leu Phe Gly
305 310 315 320
Val Asp Leu Leu Lys Leu Pro Val Pro Gly Ser Asp Gly Ile Gly Val
325 330 335
Gly Cys Asp Gly Lys Arg Lys Glu Met Glu Leu Phe Ala Phe Glu Cys
340 345 350
Ser Lys Lys Leu Lys Val Ile Gly Ala Leu
355 360
<210> 257
<211> 384
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 257
Met Asp Ala Ile Ser Cys Leu Asp Glu Ser Thr Thr Thr Glu Ser Leu
1 5 10 15
Ser Ile Ser Gln Ala Lys Pro Ser Ser Thr Ile Met Ser Ser Glu Lys
20 25 30
Ala Ser Pro Ser Pro Pro Pro Pro Asn Arg Leu Cys Arg Val Gly Ser
35 40 45
Gly Ala Ser Ala Val Val Asp Ser Asp Gly Gly Gly Gly Gly Gly Ser
50 55 60
Thr Glu Val Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val
65 70 75 80
Val Pro Gln Pro Asn Gly Arg Trp Gly Ser Gln Ile Tyr Glu Lys His
85 90 95
Gln Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala Arg
100 105 110
Ala Tyr Asp Val Ala Val Gln Arg Phe Arg Gly Lys Asp Ala Val Thr
115 120 125
Asn Phe Lys Pro Leu Ser Gly Thr Asp Asp Asp Asp Gly Glu Ser Glu
130 135 140
Phe Leu Asn Ser His Ser Lys Ser Glu Ile Val Asp Met Leu Arg Lys
145 150 155 160
His Thr Tyr Asn Asp Glu Leu Glu Gln Ser Lys Arg Ser Arg Gly Phe
165 170 175
Val Arg Arg Arg Gly Ser Ala Ala Gly Ala Gly Asn Gly Asn Ser Ile
180 185 190
Ser Gly Ala Cys Val Met Lys Ala Arg Glu Gln Leu Phe Gln Lys Ala
195 200 205
Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val Ile Pro Lys
210 215 220
Gln His Ala Glu Lys His Phe Pro Leu Gln Ser Ala Ala Asn Gly Val
225 230 235 240
Ser Ala Thr Ala Thr Ala Ala Lys Gly Val Leu Leu Asn Phe Glu Asp
245 250 255
Val Gly Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
260 265 270
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys
275 280 285
Asn Leu Lys Ala Gly Asp Thr Val Cys Phe Gln Arg Ser Thr Gly Pro
290 295 300
Asp Arg Gln Leu Tyr Ile Asp Trp Lys Thr Arg Asn Val Val Asn Glu
305 310 315 320
Val Ala Leu Phe Gly Pro Val Val Glu Pro Ile Gln Met Val Arg Leu
325 330 335
Phe Gly Val Asn Ile Leu Lys Leu Pro Gly Ser Asp Ser Ile Ala Asn
340 345 350
Asn Asn Asn Ala Ser Gly Cys Cys Asn Gly Lys Arg Arg Glu Met Glu
355 360 365
Leu Phe Ser Leu Glu Cys Ser Lys Lys Pro Lys Ile Ile Gly Ala Leu
370 375 380
<210> 258
<211> 401
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 258
Met Asp Ala Ile Ser Cys Met Asp Glu Ser Thr Thr Thr Glu Ser Leu
1 5 10 15
Ser Ile Ser Leu Ser Pro Thr Ser Ser Ser Glu Lys Ala Lys Pro Ser
20 25 30
Ser Met Ile Thr Ser Ser Glu Lys Val Ser Leu Ser Pro Pro Pro Ser
35 40 45
Asn Arg Leu Cys Arg Val Gly Ser Gly Ala Ser Ala Val Val Asp Pro
50 55 60
Asp Gly Gly Gly Ser Gly Ala Glu Val Glu Ser Arg Lys Leu Pro Ser
65 70 75 80
Ser Lys Tyr Lys Gly Val Val Pro Gln Pro Asn Gly Arg Trp Gly Ala
85 90 95
Gln Ile Tyr Glu Lys His Gln Arg Val Trp Leu Gly Thr Phe Asn Glu
100 105 110
Glu Asp Glu Ala Ala Arg Ala Tyr Asp Ile Ala Ala Gln Arg Phe Arg
115 120 125
Gly Lys Asp Ala Val Thr Asn Phe Lys Pro Leu Ala Gly Ala Asp Asp
130 135 140
Asp Asp Gly Glu Ser Glu Phe Leu Asn Ser His Ser Lys Pro Glu Ile
145 150 155 160
Val Asp Met Leu Arg Lys His Thr Tyr Asn Asp Glu Leu Glu Gln Ser
165 170 175
Lys Arg Ser Arg Gly Val Val Arg Arg Arg Gly Ser Ala Ala Ala Gly
180 185 190
Thr Ala Asn Ser Ile Ser Gly Ala Cys Phe Thr Lys Ala Arg Glu Gln
195 200 205
Leu Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg
210 215 220
Leu Val Ile Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Gln Ser
225 230 235 240
Ser Asn Gly Val Ser Ala Thr Thr Ile Ala Ala Val Thr Ala Thr Pro
245 250 255
Thr Ala Ala Lys Gly Val Leu Leu Asn Phe Glu Asp Val Gly Gly Lys
260 265 270
Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
275 280 285
Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Lys Ala
290 295 300
Gly Asp Thr Val Cys Phe His Arg Ser Thr Gly Pro Asp Lys Gln Leu
305 310 315 320
Tyr Ile Asp Trp Lys Thr Arg Asn Val Val Asn Asn Glu Val Ala Leu
325 330 335
Phe Gly Pro Val Gly Pro Val Val Glu Pro Ile Gln Met Val Arg Leu
340 345 350
Phe Gly Val Asn Ile Leu Lys Leu Pro Gly Ser Asp Thr Ile Val Gly
355 360 365
Asn Asn Asn Asn Ala Ser Gly Cys Cys Asn Gly Lys Arg Arg Glu Met
370 375 380
Glu Leu Phe Ser Leu Glu Cys Ser Lys Lys Pro Lys Ile Ile Gly Ala
385 390 395 400
Leu
<210> 259
<211> 192
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 259
Met Ala Met His Ala Gly His Ala Trp Trp Gly Val Ala Met Tyr Thr
1 5 10 15
Asn His Tyr His His His Tyr Arg His Lys Thr Ser Asp Val Gly Lys
20 25 30
Asn Arg Val Lys His Ala Arg Tyr Gly Gly Gly Asp Ser Gly Lys Gly
35 40 45
Ser Asp Ser Gly Lys Trp Arg Arg Tyr Ser Tyr Trp Thr Ser Ser Ser
50 55 60
Tyr Val Thr Lys Gly Trp Ser Arg Tyr Val Lys Lys Arg Asp Ala Gly
65 70 75 80
Asp Val Val His Arg Val Arg Gly Gly Ala Ala Asp Arg Gly Cys Arg
85 90 95
Arg Arg Gly Ser Ala Ala Ala Val Arg Val Thr Ala Asn Gly Gly Trp
100 105 110
Ser Met Cys Tyr Ser Thr Ser Gly Ser Ser Tyr Asp Thr Ser Ala Asn
115 120 125
Ser Tyr Ala Tyr His Arg Ser Val Asp Asp His Ser Asp His Ala Gly
130 135 140
Ser Arg Ala Asp Ala Lys Ser Ser Ser Ala Ala Ser Ala Ser Arg Arg
145 150 155 160
Arg Gly Val Asn Asp Cys Gly Ala Asp Ala Thr Ala Met Tyr Gly Tyr
165 170 175
Met His His Ser Tyr Ala Ala Val Ser Thr Val Asn Tyr Trp Ser Val
180 185 190
<210> 260
<211> 116
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 260
Phe Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Asn Asn
20 25 30
Asn Asn Asn Gly Gly Ser Gly Asp Asp Val Ala Thr Thr Glu Lys Gly
35 40 45
Met Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Cys Trp Lys Phe Arg
50 55 60
Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp
65 70 75 80
Ser Arg Tyr Val Lys Asp Lys His Leu Asp Ala Gly Asp Val Val Phe
85 90 95
Phe Gln Arg His Arg Phe Asp Leu His Arg Leu Phe Ile Gly Trp Arg
100 105 110
Arg Arg Gly Glu
115
<210> 261
<211> 114
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 261
Phe Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Arg Tyr Leu Pro Leu Asn Asn Cys
20 25 30
Gly Gly Gly Gly Asp Val Thr Ala Glu Ser Thr Glu Lys Gly Val Leu
35 40 45
Leu Ser Phe Glu Asp Glu Ser Gly Lys Ser Trp Lys Phe Arg Tyr Ser
50 55 60
Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg
65 70 75 80
Tyr Val Lys Asp Lys His Leu Asn Ala Gly Asp Val Val Leu Phe Gln
85 90 95
Arg His Arg Phe Asp Ile His Arg Leu Phe Ile Gly Trp Arg Arg Arg
100 105 110
Gly Glu
<210> 262
<211> 106
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 262
Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Ser Gly Asp
20 25 30
Ser Gly Gly Ser Glu Cys Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu
35 40 45
Ser Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln
50 55 60
Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys Arg
65 70 75 80
Leu Asp Ala Gly Asp Val Val Leu Phe Glu Arg His Arg Val Asp Ala
85 90 95
Gln Arg Leu Phe Ile Gly Trp Arg Arg Arg
100 105
<210> 263
<211> 107
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 263
Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Ser Gly Gly
20 25 30
Asp Ser Gly Ser Ser Glu Cys Lys Gly Leu Leu Leu Ser Phe Glu Asp
35 40 45
Glu Ser Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
50 55 60
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys
65 70 75 80
Arg Leu Asp Ala Gly Asp Val Val Leu Phe Gln Arg His Arg Ala Asp
85 90 95
Ala Gln Arg Leu Phe Ile Gly Trp Arg Arg Arg
100 105
<210> 264
<211> 107
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 264
Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ser Ser
20 25 30
Gly Gly Asp Ser Ala Ala Ala Lys Gly Leu Leu Leu Ser Phe Glu Asp
35 40 45
Glu Ser Gly Lys Cys Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser
50 55 60
Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Asp Lys
65 70 75 80
Arg Leu His Ala Gly Asp Val Val Leu Phe His Arg His Arg Ala His
85 90 95
Pro Gln Arg Phe Phe Ile Ser Cys Thr Arg His
100 105
<210> 265
<211> 108
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 265
Phe Glu Lys Pro Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Arg Tyr Phe Pro Leu Gly Gly Gly
20 25 30
Asp Ser Gly Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp Glu Ser Gly
35 40 45
Lys Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Thr Ser Ser Gln Ser Tyr
50 55 60
Val Leu Thr Lys Gly Trp Ser Arg Tyr Val Lys Glu Lys Arg Leu Asp
65 70 75 80
Ala Gly Asp Val Val His Phe Glu Arg Val Arg Gly Leu Gly Ala Ala
85 90 95
Asp Arg Leu Phe Ile Gly Cys Arg Arg Arg Gly Glu
100 105
<210> 266
<211> 115
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 266
Phe Glu Lys Ser Leu Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asn Ala Val
20 25 30
Leu Val Ser Ser Ala Ala Ala Asp Thr Ser Ser Ser Glu Lys Gly Met
35 40 45
Leu Leu Ser Phe Glu Asp Glu Ser Gly Lys Ser Trp Arg Phe Arg Tyr
50 55 60
Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser
65 70 75 80
Arg Phe Val Lys Asp Lys Gln Leu Asp Pro Gly Asp Val Val Phe Phe
85 90 95
Gln Arg His Arg Ser Asp Ser Arg Arg Leu Phe Ile Gly Trp Arg Arg
100 105 110
Arg Gly Gln
115
<210> 267
<211> 107
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 267
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala
20 25 30
Ala Asn Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp Arg Gly Gly Lys
35 40 45
Leu Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
50 55 60
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala
65 70 75 80
Gly Asp Thr Val Ser Phe Cys Arg Gly Ala Ala Asp Ala Thr Arg Asp
85 90 95
Arg Leu Phe Ile Asp Trp Lys Arg Arg Val Glu
100 105
<210> 268
<211> 105
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 268
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala
20 25 30
Ala Asn Glu Lys Gly Leu Leu Leu Ser Phe Glu Asp Arg Ala Gly Lys
35 40 45
Leu Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
50 55 60
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala
65 70 75 80
Gly Asp Thr Val Ser Phe Cys Arg Gly Ala Ala Asp Ala Ala Arg Asp
85 90 95
Arg Leu Phe Ile Asp Trp Arg Lys Arg
100 105
<210> 269
<211> 107
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 269
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ala
20 25 30
Ala Asn Glu Lys Gly Gln Leu Leu Ser Phe Glu Asp Arg Ala Gly Lys
35 40 45
Leu Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
50 55 60
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala
65 70 75 80
Gly Asp Thr Val Ser Phe Cys Arg Gly Ala Gly Asp Thr Ala Arg Asp
85 90 95
Arg Leu Phe Ile Asp Trp Lys Arg Arg Ala Asp
100 105
<210> 270
<211> 107
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 270
Phe Asp Lys Val Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Asp Ala Ser
20 25 30
Ser Thr Asp Lys Gly Leu Leu Leu Ser Phe Glu Asp Arg Ala Gly Lys
35 40 45
Pro Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr Val
50 55 60
Met Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Arg Leu Asp Ala
65 70 75 80
Gly Asp Thr Val Ser Phe Gly Arg Gly Val Gly Glu Ala Ala Arg Gly
85 90 95
Arg Leu Phe Ile Asp Trp Arg Arg Arg Pro Asp
100 105
<210> 271
<211> 104
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 271
Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Ile Pro Lys Gln His Ala Glu Lys Tyr Phe Pro Leu Gln Ser Gly
20 25 30
Ser Ala Ser Ser Lys Gly Val Leu Leu Asn Phe Glu Asp Val Thr Gly
35 40 45
Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gln Ser Tyr
50 55 60
Val Leu Ile Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Lys
65 70 75 80
Ala Gly Asp Ile Val Ser Phe Gln Arg Ser Thr Gly Thr Glu Lys Gln
85 90 95
Leu Tyr Ile Asp Trp Lys Ala Arg
100
<210> 272
<211> 102
<212> PRT
<213> Artificial Sequence
<220>
<223> alignment
<400> 272
Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu
1 5 10 15
Val Val Pro Lys Gln His Ala Glu Lys His Phe Pro Leu Lys Arg Thr
20 25 30
Pro Glu Thr Pro Thr Thr Thr Gly Lys Gly Val Leu Leu Asn Phe Glu
35 40 45
Asp Gly Glu Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser
50 55 60
Ser Gln Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Arg Glu
65 70 75 80
Lys Gly Leu Gly Ala Gly Asp Ser Ile Leu Phe Ser Cys Ser Leu Tyr
85 90 95
Glu Gln Glu Lys Gln Phe
100

Claims (38)

1.一种植物,其中所述植物不产生功能性NGAL2多肽或者不产生功能性NGAL2和NGAL3多肽。
2.权利要求1的植物,其中编码NGAL2多肽的核酸序列的表达或者NGAL2多肽的活性被降低或者消除。
3.权利要求1或2的植物,其中编码NGAL3多肽的核酸序列的表达或者NGAL3多肽的活性被降低或者消除。
4.前述权利要求之一的植物,其中NGAL2多肽包含具有或者由SEQ ID NO:3组成的序列、其功能变体或同系物。
5.前述权利要求之一的植物,其中编码NGAL2多肽的SOD7核酸序列包含具有或者由SEQID NO:1或2组成的核酸序列、其功能变体或同系物。
6.权利要求5的植物,其中所述功能变体或同系物包含具有或者由SEQ ID No:6或7组成的序列。
7.前述权利要求之一的植物,其中NGAL3多肽包含具有或者由SEQ ID NO:5组成的序列、其功能变体或者同系物。
8.前述权利要求之一的植物,其中编码NGAL3多肽的NGAL3核酸序列包含具有或者由SEQ ID NO:4组成的核酸序列、其功能变体或者同系物。
9.权利要求8的植物,其中所述功能变体或同系物包含具有或者由SEQ ID NO:6或7组成的序列。
10.前述权利要求之一的植物,其中内源性SOD7核酸序列或者其启动子携带功能性突变。
11.权利要求1-9任一项的植物,其中所述植物包含降低SOD7、其功能变体或者同系物表达的RNA干扰构建体。
12.前述权利要求之一的植物,其中内源性NGAL3核酸序列或其启动子携带功能性突变。
13.权利要求2-11任一项的植物,其中所述植物包含降低NGAL3、其功能变体或者同系物表达的RNA干扰构建体。
14.前述权利要求之一的植物,其中所述植物是作物植物。
15.前述权利要求之一的植物,其中所述植物是单子叶植物或者双子叶植物。
16.前述权利要求之一的植物,其中所述植物选自玉米、水稻、小麦、油菜、高粱、大豆、向日葵、苜蓿、土豆、番茄、烟草、葡萄、大麦、豌豆、豆、蚕豆、莴苣、棉花、甘蔗、甜菜、花椰菜或者其它蔬菜芸苔类植物或者白杨。
17.一种改变植物表型的方法,包括相对于对照植物降低或消除编码NGAL2多肽的核酸序列的表达,或者降低或消除NGAL2的活性,或者降低或消除编码NGAL2和NGAL3多肽的核酸序列的表达,或者降低或消除NGAL2和NGAL3多肽的活性。
18.一种制备具有改变的表型的植物的方法,包括相对于对照植物降低或消除编码NGAL2多肽的核酸序列的表达,或者降低或消除NGAL2的活性,或者降低或消除编码NGAL2和NGAL3多肽的核酸序列的表达,或者降低或消除NGAL2和NGAL3多肽的活性。
19.权利要求17或18的方法,包括降低或消除编码NGAL2多肽的核酸序列的表达以及降低或消除编码NGAL3多肽的核酸序列的表达。
20.权利要求17或18的方法,包括降低或消除NGAL2多肽的活性以及降低或消除NGAL3多肽的活性。
21.权利要求17或18的方法,包括降低或消除编码NGAL2多肽的核酸序列的表达以及降低或消除NGAL3多肽的活性。
22.权利要求17或18的方法,包括降低或消除编码NGAL3多肽的核酸序列的表达以及降低或消除NGAL2多肽的活性。
23.权利要求17-22任一项的方法,其中NGAL2多肽包含具有或由SEQ ID NO:1或2组成的序列、其功能变体或同系物。
24.权利要求17-22任一项的方法,其中编码NGAL2多肽的SOD7核酸序列包含具有或者由SEQ ID NO:1或2组成的核酸序列、其功能变体或同系物。
25.权利要求24的方法,其中所述功能变体或同系物包含SEQ ID NO:49-145所示核酸序列。
26.权利要求17-25任一项的方法,其中NGAL3多肽包含具有或者由SEQ ID NO:5组成的序列、其功能变体或者同系物。
27.权利要求17-26的方法,其中编码NGAL3多肽的NGAL3核酸序列包含具有或者由SEQID NO:4组成的核酸序列、其功能变体或者同系物。
28.权利要求27的方法,其中所述功能变体或者同系物包含具有或者由SEQ ID NO:49-145组成的核酸序列。
29.权利要求17-28的方法,其中所述方法包括在植物中编码NGAL2和/或NGAL3蛋白质或肽的核酸序列或者在其相应启动子中导入功能突变。
30.权利要求29的方法,其中所述突变是使用T-DNA插入、化学诱变或者基因组编辑方式导入的。
31.权利要求30的方法,包括使用TILLING。
32.权利要求17-28任一项的方法,包括沉默SOD7和/或NGAL3核酸序列。
33.权利要求32的方法,包括将靶向SOD7或NGAL3核酸序列基因的RNAi、shRNA、snRNA、dsRNA、siRNA、miRNA、ta-siRNA或者共抑制分子导入植物中。
34.权利要求17-33任一项的方法,其中所述表型特征在于相对于对照植物增加的种子大小。
35.通过权利要求17-34任一项的方法获得或者可以获得的植物。
36.分离的核酸,其包含具有或者由SEQ ID NO:1或2组成的序列或其功能变体或同系物。
37.包含权利要求36的分离的核酸的载体。
38.沉默核酸构建体,其靶向包含或由SEQ ID NO:1、2或3组成的序列或其功能变体、一部分或者同系物。
CN201680020523.XA 2015-02-03 2016-02-03 具有增加的种子大小的植物 Pending CN108012523A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN2015072143 2015-02-03
CNPCT/CN2015/072143 2015-02-03
PCT/GB2016/050245 WO2016124918A1 (en) 2015-02-03 2016-02-03 Plants with increased seed size

Publications (1)

Publication Number Publication Date
CN108012523A true CN108012523A (zh) 2018-05-08

Family

ID=55353239

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680020523.XA Pending CN108012523A (zh) 2015-02-03 2016-02-03 具有增加的种子大小的植物

Country Status (3)

Country Link
US (2) US10793868B2 (zh)
CN (1) CN108012523A (zh)
WO (1) WO2016124918A1 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110607308A (zh) * 2019-09-01 2019-12-24 天津大学 佛甲草抗旱基因SlERF及其应用
CN111172170A (zh) * 2019-09-01 2020-05-19 天津大学 佛甲草抗旱基因SlAP2及其应用
CN112063626A (zh) * 2019-06-10 2020-12-11 中国农业大学 玉米基因ZmRAVL1和功能位点及其用途
CN115232823A (zh) * 2022-05-19 2022-10-25 华南农业大学 芥蓝菇叶发育相关基因及其应用

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016124918A1 (en) 2015-02-03 2016-08-11 The Institute Of Genetics And Developmental Biology Plants with increased seed size
CN106520782A (zh) * 2016-11-20 2017-03-22 东北农业大学 一种与大豆光周期调控相关基因GmRAV1的应用
CN109136218B (zh) * 2018-08-28 2021-05-11 大连民族大学 紫斑牡丹iku2基因制备方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090144849A1 (en) * 2002-02-11 2009-06-04 Lutfiyya Linda L Nucleic acid molecules and other molecules associated with transcription in plants
CN102245769A (zh) * 2008-10-16 2011-11-16 独立行政法人理化学研究所 具有增大的种子大小的转基因植物

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8440A (en) 1851-10-21 Improvement in the tops of cans or canisters
US432A (en) 1837-10-20 Improvement in gun-carriages
DE69425903T2 (de) 1993-12-09 2001-02-15 Thomas Jefferson University Ph Verbindungen und verfahren zur ortsspezifischen mutation in eukaryotischen zellen
GB9703146D0 (en) 1997-02-14 1997-04-02 Innes John Centre Innov Ltd Methods and means for gene silencing in transgenic plants
US6555732B1 (en) 1998-09-14 2003-04-29 Pioneer Hi-Bred International, Inc. Rac-like genes and methods of use
US7446241B2 (en) * 2002-07-30 2008-11-04 Texas Tech University Transcription factors, DNA and methods for introduction of value-added seed traits and stress tolerance
EP3249046B1 (en) 2002-09-18 2020-07-08 Mendel Biotechnology, Inc. Polynucleotides and polypeptides in plants
TR201815882T4 (tr) 2009-12-10 2018-11-21 Univ Iowa State Res Found Inc Tal efektörü aracılı dna modifikasyonu.
US8697359B1 (en) 2012-12-12 2014-04-15 The Broad Institute, Inc. CRISPR-Cas systems and methods for altering expression of gene products
WO2016124918A1 (en) 2015-02-03 2016-08-11 The Institute Of Genetics And Developmental Biology Plants with increased seed size

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090144849A1 (en) * 2002-02-11 2009-06-04 Lutfiyya Linda L Nucleic acid molecules and other molecules associated with transcription in plants
CN102245769A (zh) * 2008-10-16 2011-11-16 独立行政法人理化学研究所 具有增大的种子大小的转基因植物

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
CARABELLI,M等: "Arabidopsis thaliana mRNA for putative auxin response factor 32 (arf32 gene) GenBank: AJ441076.1", 《GENEBANK》 *
JULIA ENGELHORN等: "Development-related PcG target in the apex 4 controls leaf margin architecture in Arabidopsis thaliana", 《DEVELOPMENT》 *
LIN,X等: "Arabidopsis thaliana chromosome III BAC F24K9 genomic sequence, complete sequence GenBank: AC008153.3", 《GENEBANK》 *
LUZ IRINA A. CALDERON-VILLALOBOS等: "LucTrap Vectors Are Tools to Generate Luciferase Fusions for the Quantification of Transcript and Protein Abundance in Vivo", 《PLANT PHYSIOLOGY》 *
TOWN,C.D等: "Arabidopsis thaliana DNA-binding protein, putative (At3g11580) mRNA, complete cds NCBI Reference Sequence: NM_111991.2", 《GENEBANK》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112063626A (zh) * 2019-06-10 2020-12-11 中国农业大学 玉米基因ZmRAVL1和功能位点及其用途
CN110607308A (zh) * 2019-09-01 2019-12-24 天津大学 佛甲草抗旱基因SlERF及其应用
CN111172170A (zh) * 2019-09-01 2020-05-19 天津大学 佛甲草抗旱基因SlAP2及其应用
CN115232823A (zh) * 2022-05-19 2022-10-25 华南农业大学 芥蓝菇叶发育相关基因及其应用
CN115232823B (zh) * 2022-05-19 2023-09-08 华南农业大学 芥蓝菇叶发育相关基因及其应用

Also Published As

Publication number Publication date
US10793868B2 (en) 2020-10-06
US20180265882A1 (en) 2018-09-20
US20200354735A1 (en) 2020-11-12
WO2016124918A1 (en) 2016-08-11

Similar Documents

Publication Publication Date Title
KR102607893B1 (ko) 지베렐린 대사의 조작을 통해 저신장 식물의 수확량을 증가시키기 위한 방법 및 조성물
US10450580B2 (en) Transcriptional regulation for improved plant productivity
CN101495640B (zh) 具有增强的产量相关性状的伸展蛋白受体样激酶受调节表达的植物和用于产生该植物的方法
CN101365786B (zh) 具有改良的生长特征的植物及其生产方法
KR101662483B1 (ko) 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법
CN108012523A (zh) 具有增加的种子大小的植物
CN104053778B (zh) 一种育性基因及其应用
US20160264982A1 (en) Method for plant genome site-directed modification
KR20120126061A (ko) 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법
CN109321582B (zh) 粗山羊草Yr4DS基因在麦族植物抗条锈病育种的应用
CA2694006C (en) Late blight resistance genes and methods
CN101952441B (zh) 具有增强的产量相关性状的植物及其制备方法
AU2016355682A1 (en) Haploid induction compositions and methods for use therefor
CN102027120A (zh) 具有增强的产量相关性状的植物和用于制备该植物的方法
KR20120034588A (ko) 향상된 수확량 관련 형질을 갖는 식물 및 이의 제조 방법
CN101883783A (zh) 具有增强的产量相关性状的植物及其制备方法
CN101605902A (zh) 具有增强的产量相关性状和/或提高的非生物胁迫抗性的植物和制备该植物的方法
CN101563461A (zh) 具有改良特征的植物及其制备方法
CN109705202B (zh) 一种培育抗灰斑病植物的方法
CN114846022A (zh) 通过阻抑基因的下调增强作物的疾病抗性
CN115175556A (zh) 与大豆中锈病抗性相关联的新颖的遗传基因座
WO2018098420A1 (en) Induction of haploid plants using baby boom1 gene
CN109112136B (zh) Ggc2基因的分离克隆及其在水稻改良中的应用
CN113980919B (zh) 调控玉米穗腐病抗性的dna序列及其突变体、分子标记和应用
CN115867565A (zh) 用于改善对大豆锈病的抗性的方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180508

WD01 Invention patent application deemed withdrawn after publication