CN112239756B - 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用 - Google Patents

一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用 Download PDF

Info

Publication number
CN112239756B
CN112239756B CN201910584598.6A CN201910584598A CN112239756B CN 112239756 B CN112239756 B CN 112239756B CN 201910584598 A CN201910584598 A CN 201910584598A CN 112239756 B CN112239756 B CN 112239756B
Authority
CN
China
Prior art keywords
ala
leu
ser
gly
val
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910584598.6A
Other languages
English (en)
Other versions
CN112239756A (zh
Inventor
张蕾
李相敢
侯丽敏
祁幼林
王身昌
黎跃进
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Keji Dalong Beijing Biotechnology Co ltd
Original Assignee
Keji Dalong Beijing Biotechnology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Keji Dalong Beijing Biotechnology Co ltd filed Critical Keji Dalong Beijing Biotechnology Co ltd
Priority to CN201910584598.6A priority Critical patent/CN112239756B/zh
Publication of CN112239756A publication Critical patent/CN112239756A/zh
Application granted granted Critical
Publication of CN112239756B publication Critical patent/CN112239756B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/78Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y305/00Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
    • C12Y305/04Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
    • C12Y305/04001Cytosine deaminase (3.5.4.1)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)

Abstract

本发明公开了一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统的方法及应用。实验证明,植物来源的胞嘧啶脱氨酶可以成功的对受体基因进行碱基编辑,对于基因编辑在植物中的应用具有重要的价值。

Description

一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的 应用
技术领域
本发明属于基因的应用技术领域,具体涉及在生物技术领域中,在碱基编辑系统中用来源于植物的胞嘧啶脱氨酶替代非植物源的胞嘧啶脱氨酶,对植物进行的基因编辑的应用。
背景技术
基因编辑技术作为近年来飞速发展的生物技术之一,被广泛用于动物,植物和微生物中,针对该技术原理的探索或者应用的扩展也在积极的开展中。基因编辑技术就是通过对细胞基因组中目的基因的一段核苷酸序列甚至是单个核苷酸进行替换,切除,增加或者是插入外源的DNA序列,使之产生可遗传的改变。[1]- [3] 虽然各种基因编辑技术的原理及作用方式并不相同,但它们的共同之处是基因编辑都建立在使目标基因DNA产生双链断裂(Double Strand Breaking,DSB)的基础上。因为DNA分子单链断裂或缺失后容易被细胞内的各种修复机制所修复而不产生任何改变 [4] ,但DNA双链断裂的结果则有很大的不同。细胞内DNA双链断裂的修复有两种方式,即同源重组修复(Homologous Recombination,HR)和非同源末端链接修复(Non-Homologous End Join, NHEJ) 。[5] -[7]
对于这两种修复方式,同源重组的效率很低,而非同源末端链接修复会造成基因的功能的缺失或者影响基因的正常功能。与此同时,人类的遗传疾病或者植物的一些缺陷是由碱基突变引起的,所以开发出精确的编辑工具对碱基突变的修复有着重要的应用价值。从2016年到2017年,哈佛大学的David R Liu研究团队主要开发了两种碱基基因编辑工具(BE系统)。第一种工具可以将C-G碱基对转变成T-A碱基对,第一代的单碱基编辑工具在发表的文章中称为BE1(Cytidine Base Editing),之后在提高编辑效率,扩大BE可以编辑的基因组位点方面又有持续的研究发展,BE的后续版本分别称为BE1,BE2和BE3;第二种工具可以将A-T碱基对转变成为G-C碱基对,称为ABE(Adenine Base Editing)。
BE系统实现C->T碱基替换的原理是通过胞嘧啶脱氨酶可以直接实现对单个胞嘧啶(Cytosine,C)碱基的编辑,突变为胸腺嘧啶(Thymine,T),其中尿嘧啶糖苷酶抑制剂(Uracil DNA glycosylase inhibitor, UGI)可以提高突变效率。具体的替换过程为是gRNA引导Cas9与胞嘧啶脱氨酶的融合蛋白结合在目标DNA上,产生单链断裂,胞嘧啶脱氨酶会将靶点上的C脱氨变成U,尿嘧啶糖苷酶抑制剂(uracil DNA glycosylase inhibitor,UGI)能够抑制尿嘧啶DNA糖苷酶(uracil DNA glycosylase, UDG)清除DNA中的U,之后U:G配对将其更多的修复为U:A,通过DNA的复制,U:A实现T:A配对,从而实现C到T的精确替换。在此过程中胞嘧啶脱氨酶是实现碱基替换的关键,在David Liu开发的各个BE版本中,胞嘧啶脱氨酶始终用的是来源于小鼠的胞嘧啶脱氨酶(rAPOBEC1),其他的研究人员也在持续优化改良碱基基因编辑工具,而后期陆续发现的也可用在碱基编辑工具中的胞嘧啶脱氨酶有PmCDA1/APOBEC3A等,但是它们都来源于动物或者微生物。[8] [9] 至今尚未发现一个来源于植物的胞嘧啶脱氨酶可以成功的用于碱基基因编辑。
在对植物进行基因编辑的研发过程中,寻找一个从植物来源并可以用于碱基基因编辑的胞嘧啶脱氨酶基因就尤为重要。关于植物中的胞嘧啶脱氨酶的研究有很多,但是从未有研究植物来源的胞嘧啶脱氨酶在基因编辑技术中的应用,国内外也未有任何关于植物来源的胞嘧啶脱氨酶在基因编辑技术中应用的报导。而在对胞嘧啶脱氨酶(APOBEC)家族中的研究发现,APOBEC家族中的每个蛋白的功能十分的多样化,虽然可能在蛋白质结构上可以将这些蛋白归于一类,但是对于每种APOBEC蛋白来说,其功能和组织特异性表达都有着很大的差异。 [10] [11] 本发明的意义在于找到了来自于植物源的胞嘧啶脱氨酶,并且在碱基基因编辑的系统中对水稻成功的实现了碱基C到T的替换。
发明内容
本发明的目的在于提供一个或者多个可用于碱基基因编辑的来源于植物的胞嘧啶脱氨酶的基因序列。在碱基编辑过程中,使用上述序列之一,可以对目标核苷酸的碱基进行编辑,使其发生C到T的替换。
本发明的一个目的在于提供与基因序列表序列1到序列46中的任一的基因的不低于90%的基因相似性的基因,其来源于植物,包括但不限于拟南芥,水稻,大豆,玉米,木薯,亚麻,大叶杨,苜蓿,菜豆,苹果,棉花,高粱,花生,马铃薯,黄瓜,番茄等等。在碱基编辑过程中,使用上述序列之一,可以对目标核苷酸的碱基进行编辑,使其发生C到T的替换。
本发明的另一个目的在于提供一个或者多个的基因序列,上述基因序列选自基因序列表中序列1到序列46, 上述基因都来自于植物,其来源分别为拟南芥,水稻,大豆,玉米,木薯,亚麻,大叶杨,苜蓿,菜豆,苹果,棉花,高粱,花生,马铃薯,黄瓜,番茄。在碱基编辑过程中,使用上述序列之一,可以对目标核苷酸的碱基进行编辑,使其发生C到T的替换。
本发明的另一个目的在于提供一种可用于植物的碱基基因编辑的方法,具体可为方法G1,采用该方法,在不引入非植物源的胞嘧啶脱氨酶的情况下,可以对目标核苷酸的碱基进行编辑,使其发生C到T的转变。具体的步骤包括:在受体中表达sgRNA,Cas9蛋白和植物来源的胞嘧啶脱氨酶,从而所述受体基因中的目标核苷酸的碱基进行编辑,发生从C到T的替换。
本发明的另一个目的在于提供一种用于植物的碱基基因编辑的方法,具体可为方法G2,采用该方法,在不引入非植物源的胞嘧啶脱氨酶的情况下,可以对目标核苷酸的碱基进行编辑,使其发生C到T的转变。具体的步骤包括:在受体中表达sgRNA,Cas9蛋白,植物来源的胞嘧啶脱氨酶和尿嘧啶糖苷酶抑制剂,从而所述受体基因中的目标核苷酸的碱基进行编辑,发生从C到T的替换。
所述方法G1中,“在受体中表达sgRNA,Cas9蛋白和植物来源的胞嘧啶脱氨酶”可通过将所述sgRNA的编码基因,Cas9蛋白的编码基因和所述的植物来源的胞嘧啶脱氨酶的编码基因导入受体实现。具体的,可以放在一个载体中导入受体,也可以放在不同载体中分别导入受体。
所述方法G2中,“在受体中表达sgRNA,Cas9蛋白,植物来源的胞嘧啶脱氨酶和尿嘧啶糖苷酶抑制剂”可通过将所述sgRNA的编码基因,Cas9蛋白的编码基因,所述的植物来源的胞嘧啶脱氨酶的编码基因和尿嘧啶糖苷酶抑制剂的编码基因导入受体实现。具体的,可以放在一个载体中导入受体,也可以放在不同载体中分别导入受体。
上述任一所述的方法中,还可在受体中表达抗潮霉素蛋白(用于筛选)。所述“在受体中表达抗潮霉素蛋白”可通过将所述抗潮霉素蛋白的编码基因导入受体实现。具体的,可以跟上述元件放在一个载体中导入受体,也可以放在不同的载体中分别导入受体。
上述任一所述的方法中,所述受体可为n1)或n2)或n3)或n4):n1)植物;n2)单子叶植物或双子叶植物;n3)禾本科植物;n4)水稻。
上述任一所述的方法中,所述受体可为植物愈伤组织。
上述任一所述的方法中,“导入受体”可经过侵染,共培养,筛选等步骤,得到编辑后的植物愈伤组织。
上述任一所述的方法中,所述sgRNA可根据受体基因组中预期进行碱基编辑的靶基因设计。
上述任一所述的方法中,所述Cas9蛋白可为SpCas9n蛋白。SpCas9n蛋白的氨基酸序列可如序列表中基因序列47 自5’末端起第3061至7161位所示的核苷酸序列翻译而成。编码所述的SpCas9n蛋白的核苷酸序列可如序列表中基因序列47自5’末端起第3061至7161位所示。
上述任一所述的方法中,所述尿嘧啶糖苷酶抑制剂的氨基酸序列可如序列表中基因序列47自5’末端起第2743至2991位所示的核苷酸序列翻译而成。编码所述的尿嘧啶糖苷酶抑制剂的核苷酸序列可如序列表中基因序列47自5’末端起第2743至2991位所示。
上述任一所述的方法中,所述抗潮霉素蛋白可为潮霉素磷酸转移酶。潮霉素磷酸转移酶的氨基酸序列可由序列表中基因序列47自5’末端起第318至1343位所示的核苷酸序列翻译而成。编码所述的潮霉素磷酸转移酶的核苷酸序列可如序列表中基因序列47自5’末端起第318至1343位所示。
上述任一所述的方法中,所述植物来源的胞嘧啶脱氨酶可为基因序列表中序列1到序列46的任一基因或者与基因序列表中序列1 到序列46的任一基因有至少90%的相似性的基因。
上述任一所述的方法中,所述植物来源的胞嘧啶脱氨酶可来源于拟南芥,其氨基酸序列可为基因序列表中序列1的所示。编码所述的来源于拟南芥的胞嘧啶脱氨酶的核苷酸序列可如序列表中序列47第7216至8115位所示。
上述任一所述的方法中,所述植物来源的胞嘧啶脱氨酶可来源于水稻,其氨基酸序列可为基因序列表中序列2的所示。编码所述的来源于水稻的胞嘧啶脱氨酶的核苷酸序列可如序列表中序列 48 第7216至8148 位所示。
本发明的另一个目的在于提供一个可用于植物的碱基编辑载体,其特征在于:包括基因序列表中序列1 到序列46的任一基因或者与基因序列表中序列1 到序列46的任一基因有至少90%的相似性的基因序列的编码基因的重组载体。
本发明的另一个目的在于提供一个用于植物的碱基编辑载体,其特征在于:包括sgRNA的编码基因, Cas9蛋白的编码基因,和基因序列表中序列1 到序列46的任一基因或者与基因序列表中序列1 到序列46的任一基因有至少90%的相似性的基因序列的编码基因的重组载体。
本发明的另一个目的在于提供一个可用于植物的碱基编辑载体,其特征在于:包括sgRNA的编码基因, Cas9蛋白的编码基因,基因序列表中序列1 到序列46的任一或者与基因序列表中序列1 到序列46的任一基因有至少90%的相似性的基因序列的编码基因和尿嘧啶糖苷酶抑制剂的编码基因的重组载体。
本发明的另一个目的在于提供任一上述胞嘧啶脱氨酶基因在x1)或x2)或x3)或x4)中的碱基基因编辑的应用: x1)植物;x2)单子叶植物或双子叶植物;x3)禾本科植物;x4)水稻。
在本发明的一个实施例中,本发明的发明人构建了载体nCas9&AtCDA1&UGI。该载体中的靶基因名称,对应的靶点名称和靶点序列如表1所示。该载体的核苷酸序列如序列表中序列47所示。上述序列中,第93至277位为PolyA终止子的核苷酸序列,第318至1343位为潮霉素磷酸转移酶的核苷酸序列,第1378至2157位为35S启动子的核苷酸序列,第2452至2706位为NOS终止子的核苷酸序列,第2743至2991位为UGI的核苷酸序列,第3061至7161 位为SpCas9n的核苷酸序列(不含有终止密码子),第7216至8115 位为AtCDA1的核苷酸序列(不含有终止密码子),第8140至10111位为玉米Ubiquitin启动子的核苷酸序列,第10138至10220位为sgRNA骨架的核苷酸序列,第10221至10240位为靶点ALS4的核苷酸序列,第10241至10623位为OsU3启动子的核苷酸序列。将该载体导入农杆菌EHA105,得到重组农杆菌,之后制备农杆菌侵染液,将制备好的水稻愈伤浸泡置于农杆菌侵染液中,经过暗培养,恢复培养,再生培养,最后得到水稻E0苗,经过筛选,成功得到编辑后的水稻阳性E0苗。
在本发明的一个实施例中,本发明的发明人构建了载体nCas9&OsAPOBEC1&UGI。该载体中的靶基因名称,对应的靶点名称和靶点序列如表1所示。该载体的核苷酸序列如序列表中序列48所示。上述序列中,第93至277位为PolyA终止子的核苷酸序列,第318至1343位为潮霉素的核苷酸序列,第1378至2157位为35S启动子的核苷酸序列,第2452至2706位为NOS终止子的核苷酸序列,第2743至2991位为UGI的核苷酸序列,第3061至7161 位为SpCas9n的核苷酸序列(不含有终止密码子),第7216至8148 位为OsAPOBEC的核苷酸序列(不含有终止密码子),第8173至10144位为玉米Ubiquitin启动子的核苷酸序列,第10171至10253位为sgRNA骨架的核苷酸序列,第10254至10273位为靶点ALS4的核苷酸序列,第10274至10656位为OsU3启动子的核苷酸序列。将该载体导入农杆菌EHA105,得到重组农杆菌,之后制备农杆菌侵染液,将制备好的水稻愈伤浸泡置于农杆菌侵染液中,经过暗培养,恢复培养,再生培养,最后得到水稻E0苗,经过筛选,成功得到编辑后的水稻阳性E0苗。
上述实施例中,使用植物来源的胞嘧啶脱氨酶成功的完成了碱基编辑,证明植物来源的胞嘧啶脱氨酶确实可以代替非植物源的胞嘧啶脱氨酶在碱基编辑中的应用,本发明具有重要的应用价值。
表一为具体实验中所涉及的靶基因名称,对应的靶点名称和靶点序列。
表二为实验1&2的基因编辑结果的总结表。
具体实施方式
以下的实施例只是便于更好地理解本发明,但并不会限定本发明。
下面结合具体实施方式对本发明进行进一步的详细描述。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的材料、试剂、仪器等,如无特殊说明,均为自常规生化试剂商店购买得到的。下述实施例中,如无特殊说明,序列表中各核苷酸序列第1位均为5’端核苷酸,末位均为3’端核苷酸。
引物对由引物F026:5’- GGTCATCACCAACCACCTCTTC -3’和引物R026:5’CCACCACCGACATAGAGAATCG -3’组成,用于扩增靶点ALS4。
以下实施例中,C到T的碱基替换是指靶点序列中第6位到第10位的C突变为T。
C->T碱基替换效率=发生C->T碱基替换的阳性抗性E0苗数/分析的总阳性抗性E0苗数×100%。
水稻南梗46:公众可以从中国农科院水稻所获得。
共培培养基:含有2mg/L 2,4-二氯苯氧乙酸的NB固体培养基。
恢复培养基:含有200mg/L特美汀的NB固体培养基。
潮霉素筛选培养基:含有30-50mg/L潮霉素的NB固体培养基。
氯磺隆筛选培养基:含有0.5-20mg/L氯磺隆的NB固体培养基。
分化培养基:含有0.5g/L谷氨酰胺、0.5g/L脯氨酸、2mg/L卡那霉素和0.2mg/Lα-萘乙酸的NB固体培养基。
生根培养基:含有0.5g/L谷氨酰胺、0.5g/L脯氨酸、0.2mg/Lα-萘乙酸的MS固体培养基。
下列实施例选择表1中所示的靶点名称进行实验,靶点序列见表1中第3列,对应的靶基因名称见表1中第1列。
表1:
靶基因名称 靶点名称 靶点序列
ALS ALS4 CAGGTCCCCCGCCGCATGAT
实施例1:用来源于拟南芥的胞嘧啶脱氨酶在水稻中进行碱基基因编辑。
1.1.基因编辑载体的构建
本发明的发明人构建了载体nCas9&AtCDA1&UGI。其中AtCDA1是来源于拟南芥的胞嘧啶脱氨酶,其氨基酸序列为序列表中的序列1 所示。
本例子使用的载体nCas9&AtCDA1&UGI为环状质粒,其核苷酸序列如序列表中序列47所示。上述序列中,第93至277位为PolyA终止子的核苷酸序列,第318至1343位为潮霉素磷酸转移酶的核苷酸序列,第1378至2157位为35S启动子的核苷酸序列,第2452至2706位为NOS终止子的核苷酸序列,第2743至2991位为UGI的核苷酸序列,第3061至7161 位为SpCas9n的核苷酸序列(不含有终止密码子),第7216至8115 位为AtCDA1的核苷酸序列(不含有终止密码子),第8140至10111位为玉米Ubiquitin启动子的核苷酸序列,第10138至10220位为sgRNA骨架的核苷酸序列,第10221至10240位为靶点ALS4的核苷酸序列,第10241至10623位为OsU3启动子的核苷酸序列。
1.2 在水稻愈伤中进行基因编辑
1、将载体nCas9&AtCDA1&UGI导入农杆菌EHA105,得到重组农杆菌。
2、在转化实验之前1-5天,在含50mg/L 卡那霉素+15mg/L 利富平的YEB固体培养基上接种重组农杆菌,用3M 胶带封皿;28 ℃,倒置,暗培养 1-5天。
3、用接菌环刮取“Z”字形尾部的菌体,温和悬于100μM 含2mg/L 2,4-二氯苯氧乙酸的NB液体培养基中,使菌液浓度达到OD600=0.1,制成农杆菌侵染液备用。
4、将稻谷去掉种皮后置于三角瓶中,无菌水洗3次,加入75 %酒精浸没种子,轻轻晃动1 min,倒掉酒精。加入2.5 % 氯酸钠水溶液,150-170 rpm振荡25 min。加入无菌水轻摇振荡冲洗5-8次,倒净水。将种子接种于愈伤诱导培养基上,于30℃暗培养4-6周左右,得到水稻愈伤。
5、将步骤4得到的水稻愈伤浸泡置于农杆菌侵染液中浸泡10min,然后,放在铺有两层灭菌滤纸的共培培养基上,22℃暗培养3天。
6、取步骤5得到的水稻愈伤放入恢复培养基上,30℃培养4-7天。
7、取步骤6得到的水稻愈伤,置于潮霉素筛选培养基,30℃培养2周。
8、取步骤7得到的水稻愈伤,置于氯磺隆筛选培养基,30℃培养2周。
9、将旺盛生长的愈伤组织转移到再生培养基上,在30°C培养20-30天。植株将从淡黄色或绿色的愈伤组织上产生。
10、挑选健壮的大于3cm高的再生绿色幼苗,转移至生根培养基上,培养7-14天后,得到水稻E0苗。
1.3 水稻E0苗的检测及分析
11、提取上述步骤10得到的水稻E0苗的基因组DNA并以其作为模板,采用引物F022(GTGAGCGTGGTGGTAGTTGTTG)和R022(ATGAAGAGGATTGAGGAGGGTATC)组成的引物对进行 PCR扩增,得到PCR扩增产物;将该PCR扩增产物进行琼脂糖凝胶电泳,然后进行如下判断:如果PCR扩增产物中含有约615bp的DNA片段,则相应的抗性愈伤为水稻阳性E0苗;如果PCR扩增产物中不含有约615bp的DNA片段,则相应的抗性愈伤不为水稻阳性E0苗。
12、取步骤11所获得的水稻阳性E0苗的基因组DNA作为模板,采用引物F026(GGTCATCACCAACCACCTCTTC)和R026(CCACCACCGACATAGAGAATCG)组成的引物对进行 PCR扩增,得到PCR扩增产物。
13、将步骤12得到的PCR扩增产物进行Sanger测序和分析。测序结果只针对靶点区进行分析。
1.4 统计结果
统计ALS4发生C->T碱基替换的水稻阳性E0苗数, 计算得出C->T碱基替换效率且同时统计突变基因型及对应水稻E0苗数,结果见表2。所产生的突变类型主要为靶点内单个C和三个C的突变。结果表明该碱基编辑系统的活性窗口主要为靶点序列内4-9bp,编辑效率最高的C主要发生在C7或C7附近位置。 nCas9&AtCDA1&UGI碱基编辑系统在水稻植株中实现了对ALS基因的编辑,实现了约36.4%的碱基编辑,证实了植物来源的胞嘧啶脱氨酶确实可以实现碱基编辑的功能。
实施例2:用来源于水稻的胞嘧啶脱氨酶APOBEC1在水稻中进行碱基基因编辑。
2.1 基因编辑载体的构建
本发明的发明人构建了载体nCas9&APOBEC1&UGI。其中APOBEC1是来源于水稻的胞嘧啶脱氨酶,其氨基酸序列为序列表中的序列2所示。
本例子使用的载体nCas9&APOBEC1&UGI为环状质粒,其核苷酸序列如序列表中序列48所示。上述序列中,第93至277位为PolyA终止子的核苷酸序列,第318至1343位为潮霉素的核苷酸序列,第1378至2157位为35S启动子的核苷酸序列,第2452至2706位为NOS终止子的核苷酸序列,第2743至2991位为UGI的核苷酸序列,第3061至7161 位为SpCas9n的核苷酸序列(不含有终止密码子),第7216至8148 位为OsAPOBEC的核苷酸序列(不含有终止密码子),第8173至10144位为玉米Ubiquitin启动子的核苷酸序列,第10171至10253位为sgRNA骨架的核苷酸序列,第10254至10273位为靶点ALS4的核苷酸序列,第10274至10656位为OsU3启动子的核苷酸序列。
2.2 在水稻愈伤中进行基因编辑
1、将载体nCas9&APOBEC1&UGI导入农杆菌EHA105,得到重组农杆菌。
2、在转化实验之前1-5天,在含50mg/L 卡那霉素+15mg/L 利富平的YEB固体培养基上接种重组农杆菌,用3M 胶带封皿;28 ℃,倒置,暗培养 1-5天。
3、用接菌环刮取“Z”字形尾部的菌体,温和悬于100μM 含2mg/L 2,4-二氯苯氧乙酸的NB液体培养基中,使菌液浓度达到OD600=0.1,制成农杆菌侵染液备用。
4、将稻谷去掉种皮后置于三角瓶中,无菌水洗3次,加入75 %酒精浸没种子,轻轻晃动1 min,倒掉酒精。加入2.5 % 氯酸钠水溶液,150-170 rpm振荡25 min。加入无菌水轻摇振荡冲洗5-8次,倒净水。将种子接种于愈伤诱导培养基上,于30℃暗培养4-6周左右,得到水稻愈伤。
5、将步骤4得到的水稻愈伤浸泡置于农杆菌侵染液中浸泡10min,然后,放在铺有两层灭菌滤纸的共培培养基上,22℃暗培养3天。
6、取步骤5得到的水稻愈伤放入恢复培养基上,30℃培养4-7天。
7、取步骤6得到的水稻愈伤,置于潮霉素筛选培养基,30℃培养2周。
8、取步骤7得到的水稻愈伤,置于氯磺隆筛选培养基,30℃培养2周。
9、将旺盛生长的愈伤组织转移到再生培养基上,在30°C培养20-30天。植株将从淡黄色或绿色的愈伤组织上产生。
10、挑选健壮的大于3cm高的再生绿色幼苗,转移至生根培养基上,培养7-14天后,得到水稻E0苗。
2.3水稻E0苗的检测及分析
11、提取步骤10得到的水稻E0苗的基因组DNA并以其作为模板,采用引物F022(GTGAGCGTGGTGGTAGTTGTTG)和R022(ATGAAGAGGATTGAGGAGGGTATC)组成的引物对进行 PCR扩增,得到PCR扩增产物;将该PCR扩增产物进行琼脂糖凝胶电泳,然后进行如下判断:如果PCR扩增产物中含有约615bp的DNA片段,则相应的抗性愈伤为水稻阳性E0苗;如果PCR扩增产物中不含有约615bp的DNA片段,则相应的抗性愈伤不为水稻阳性E0苗。
12、取步骤11所获得的水稻阳性E0苗的基因组DNA作为模板,采用引物F026(GGTCATCACCAACCACCTCTTC)和R026(CCACCACCGACATAGAGAATCG)组成的引物对进行 PCR扩增,得到PCR扩增产物。
13、将步骤12得到的PCR扩增产物进行Sanger测序和分析。测序结果只针对靶点区进行分析。
2.4统计结果
统计ALS4发生C->T碱基替换的水稻阳性E0苗数, 计算得出C->T碱基替换效率且同时统计突变基因型及对应水稻E0苗数,结果见表2。所产生的突变类型主要为靶点内单个C和三个C的突变。结果表明该碱基编辑系统的活性窗口主要为靶点序列内4-9bp,编辑效率最高的C主要发生在C7或C7附近位置。nCas9&OsAPOBEC1&UGI碱基编辑系统在水稻植株中实现了对ALS基因的编辑,实现了约10%的碱基编辑,证实了植物来源的胞嘧啶脱氨酶确实可以实现碱基编辑的功能。
表2 碱基编辑的统计结果
重组表达载体 靶点名称 发生C->T碱基替换的阳性E0苗数 总阳性E0苗数 C->T碱基替换效率(%) 突变基因型及对应E0苗数
nCas9&AtCDA1&UGI ALS4 4 11 36.4 C6C7C8->T6T7T8(2); C7->T7(1);C9->T9(1)
nCas9&OsAPOBEC1&UGI ALS4 1 10 10 C7->T7(1)
注:表2中,“突变基因型及对应E0苗数”列中,括号内数字表示E0苗数,C和T后面的数字表示发生C->T碱基替换的C所处的核苷酸位数(自靶点中的非PAM序列一端(即5′端)的第一位核苷酸开始计数),“->”表示替换为。
参考文献
1. Esvelt, K.M. and Wang, H.H. (2013) Genome-scale engineering forsystems and synthetic biology. Molecular Systems Biology, 9, 641. http://dx.doi.org/10.1038/msb.2012.66
2. Puchta, H. and Fauser, F. (2013) Gene targeting in plants: 25years later. The International Journal of Developmental Biology, 57, 629-637.http://dx.doi.org/10.1387/ijdb.130194hp
3. Tan, W.S., Carlson, D.F., Walton, M.W., Fahrenkrug, S.C. andHackett, P.B. (2012) Precision editing of large animal genomes. Advances inGenetics, 80, 37-97. http://dx.doi.org/10.1016/B978-0-12-404742-6.00002-8
4. Dianov, G.L. and Hubscher, U. (2013) Mammalian base excisionrepair: The forgotten archangel. Nucleic Acids Research, 41, 3483-3490.http://dx.doi.org/10.1093/nar/gkt076
5. Händel, E.M. and Cathomen, T. (2011) Zinc-finger nuclease basedge-nome surgery: It’s all about specificity. Current Gene Therapy, 11, 28-37.http://dx.doi.org/10.2174/156652311794520120
6. Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J.A. andCharpentier, E. (2012) A programmable dual- RNA-guided DNA endonuclease inadaptive bacterial immunity. Science, 337, 816-821. http://dx.doi.org/10.1126/science.1225829
7. Yuchang Zhu, Xiaojiang Zhang and Yibing Hu. (2015) Methods,Principles and Application of Gene Editing. 05(03):32-41
8. Satomi Banno, Keiji Nishida, Takayuki Arazoe, Hitoshi Mitsunobuand Akihiko Kondo. (2018) Deaminase-mediated multiplex genome editing inEscherichia coli. Nature microbiology. https://doi.org/10.1038/s41564-017-0102-6
9. XiaoWang,Jianan Li, Ying Wang, Bei Yang, Jia Wei, Jing Wu, RuixuanWang, Xingxu Huang, Jia Chen and Li Yang. (2018) Efficient base editing inmethylated regions with a human APOBEC3A-Cas9 fusion. Nature Biotechnology;http://dx.doi.org/10.1038/nbt.4198
10. Jason D. Salter, Ryan P. Bennett and Harold C. Smith. The APOBECProtein Family: United by Struture, Divergent in Function. Trends BiochemSci. 2016 Jul; 41(7): 578-594.
11. Harold C. Smith, Ryan P. Bennett, Ayse Kizilyer, William M.McDougall and Kimberly M. Prohaska. Functions and Regulation of the APOBECFamily of Proteins (2012) Semin Cell Dev Biol. 2012 May; 23(3):258-268.
序列表
<110> 科稷达隆生物技术有限公司
<120> 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用
<130> reference
<160> 48
<170> SIPOSequenceListing 1.0
<210> 1
<211> 301
<212> PRT
<213> Arabidopsis thaliana
<400> 1
Met Asp Lys Pro Ser Phe Val Ile Gln Ser Lys Glu Ala Glu Ser Ala
1 5 10 15
Ala Lys Gln Leu Gly Val Ser Val Ile Gln Leu Leu Pro Ser Leu Val
20 25 30
Lys Pro Ala Gln Ser Tyr Ala Arg Thr Pro Ile Ser Lys Phe Asn Val
35 40 45
Ala Val Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Leu Gly Val Asn
50 55 60
Val Glu Phe Pro Asn Leu Pro Leu His His Ser Ile His Ala Glu Gln
65 70 75 80
Phe Leu Val Thr Asn Leu Thr Leu Asn Gly Glu Arg His Leu Asn Phe
85 90 95
Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Ile Arg Asp Ala Pro Glu Ile Lys Ile Leu Ile Thr Asp Pro Asn
115 120 125
Asn Ser Ala Asp Ser Asp Ser Ala Ala Asp Ser Asp Gly Phe Leu Arg
130 135 140
Leu Gly Ser Phe Leu Pro His Arg Phe Gly Pro Asp Asp Leu Leu Gly
145 150 155 160
Lys Asp His Pro Leu Leu Leu Glu Ser His Asp Asn His Leu Lys Ile
165 170 175
Ser Asp Leu Asp Ser Ile Cys Asn Gly Asn Thr Asp Ser Ser Ala Asp
180 185 190
Leu Lys Gln Thr Ala Leu Ala Ala Ala Asn Arg Ser Tyr Ala Pro Tyr
195 200 205
Ser Leu Cys Pro Ser Gly Val Ser Leu Val Asp Cys Asp Gly Lys Val
210 215 220
Tyr Arg Gly Trp Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser Met Gly
225 230 235 240
Pro Val Gln Ala Ala Leu Val Asp Tyr Val Ala Asn Gly Gly Gly Gly
245 250 255
Gly Tyr Glu Arg Ile Val Gly Ala Val Leu Val Glu Lys Glu Asp Ala
260 265 270
Val Val Arg Gln Glu His Thr Ala Arg Leu Leu Leu Glu Thr Ile Ser
275 280 285
Pro Lys Cys Glu Phe Lys Val Phe His Cys Tyr Glu Ala
290 295 300
<210> 2
<211> 312
<212> PRT
<213> Oryza sativa
<400> 2
Met Gly Glu Glu Lys Val Ala Pro Lys Ser Asp Ala Ala Pro Ala Val
1 5 10 15
Glu Leu Leu Gly Phe Val Met Ser Ala Glu Glu Ala Glu Arg Ala Ala
20 25 30
Ala Ala Ala Gly Val Glu Thr Val Glu Asp Leu Leu Pro Leu Leu Val
35 40 45
Pro Ser Ala Met Arg Arg Ala Arg Ala Pro Ile Ser Arg Phe Pro Val
50 55 60
Gly Ala Val Gly Leu Gly Ala Ser Gly Arg Val Tyr Ala Gly Val Asn
65 70 75 80
Leu Glu Phe Arg Gly Leu Pro Leu Ser His Ser Val His Ala Glu Gln
85 90 95
Phe Leu Val Val Asn Ala Ala Ala Ala Gly Glu Ser Glu Leu Arg Ala
100 105 110
Val Ala Val Ser His Met Pro Cys Gly His Cys Arg Gln Phe Leu Gln
115 120 125
Glu Ile Arg Gly Ala Gly Gly Ile Arg Ile Ile Val Thr Ser Asp Ala
130 135 140
Glu Asp Gly Cys Ala Pro Glu Trp Arg Thr Val Ala Ser Leu Leu Pro
145 150 155 160
Arg Pro Phe Gly Pro His Asp Leu Leu Pro Lys His Val Pro Leu Val
165 170 175
Leu Glu Pro His Asp Asn Pro Leu Gly Glu Pro Ala Ala Val Ala Asn
180 185 190
Gly Phe Ala His Gly Asp Leu Glu Ala Arg Leu Arg Glu Ala Ala Glu
195 200 205
Ala Ala Ala Arg Ala Ala His Ala Pro Tyr Ser Glu Cys Pro Ser Gly
210 215 220
Phe Ala Val Ala Asp Gly Glu Gly Lys Val Tyr Ala Gly Gly Cys Leu
225 230 235 240
Glu Ser Ala Ala Tyr Asn Pro Thr Leu Gly Pro Val Gln Ala Ala Ile
245 250 255
Ile Gly Met Val Ala Ala Gly Gly Gly Ala Ala Gly Asp Val Val Ala
260 265 270
Ala Ala Leu Val Glu Lys Glu Ala Gly Leu Val Ser Gln Glu Ala Thr
275 280 285
Ala Arg Ile Phe Leu Ala Ala Val Ala Pro Gln Ala Thr Phe His Val
290 295 300
Tyr Asn Tyr Thr Pro Ser Asp Ala
305 310
<210> 3
<211> 186
<212> PRT
<213> Oryza sativa
<400> 3
Met Glu Glu Ala Gln Phe Val Glu Ser Lys Asp Gly Thr Ile Ser Val
1 5 10 15
Ala Ser Ala Phe Ala Gly His His Glu Ala Val Gln Asp Arg Asp His
20 25 30
Lys Phe Leu Thr Lys Ala Val Glu Glu Ala Tyr Arg Gly Val Asp Cys
35 40 45
Gly Asp Gly Gly Pro Phe Gly Ala Val Val Val Arg Asn Asp Glu Val
50 55 60
Val Val Ser Cys His Asn Met Val Leu Lys His Thr Asp Pro Thr Ala
65 70 75 80
His Ala Glu Val Thr Ala Ile Arg Glu Ala Cys Lys Lys Leu Gly Lys
85 90 95
Ile Glu Leu Ser Asp Cys Glu Ile Tyr Ala Ser Cys Glu Pro Cys Pro
100 105 110
Met Cys Phe Gly Ala Val His Leu Ser Arg Ile Lys Arg Leu Val Tyr
115 120 125
Gly Ala Lys Ala Glu Ala Ala Ile Ala Ile Gly Phe Asp Asp Phe Ile
130 135 140
Ala Asp Ala Leu Arg Gly Thr Gly Phe Tyr Gln Lys Ala Asn Met Glu
145 150 155 160
Ile Lys Lys Ala Asp Gly Asn Gly Ala Leu Ile Ala Glu Gln Val Phe
165 170 175
Glu Lys Thr Lys Glu Lys Phe Gln Met Tyr
180 185
<210> 4
<211> 248
<212> PRT
<213> Oryza sativa
<400> 4
Met Ala Ser Ala Arg Asp Leu Ala Val Ala Ser Ile Ser Ala Ala Val
1 5 10 15
Gly Ala Ala Ala Ala Ala Ala Ala Leu Arg Phe Leu Ser Ser Tyr Gly
20 25 30
Ala Ser Ser Ala Lys Gln Arg Ser Pro Pro Thr Pro Cys Ala Glu His
35 40 45
Leu Ala Val Asn Gly Cys Ala Ala Glu Arg Pro Pro Val Gln Ser Pro
50 55 60
Phe Asp Pro Ala Lys Arg Glu Gly Tyr Ile Ser Trp Asp Asp Tyr Phe
65 70 75 80
Met Ala Ile Ala Phe Leu Ser Ala Lys Arg Ser Lys Asp Pro Asn Arg
85 90 95
Gln Val Gly Ala Cys Leu Val Ser Gln Glu Gly Ile Ile Leu Gly Ile
100 105 110
Gly Tyr Asn Gly Phe Pro Arg Gly Cys Ser Asp Asn Lys Leu Pro Trp
115 120 125
Ala Lys Lys Ser Ala Lys Gly Asp Pro Leu Glu Thr Lys Tyr Pro Tyr
130 135 140
Val Val His Ala Glu Val Asn Ala Ile Leu Asn Thr Asn His Ala Ser
145 150 155 160
Ala Ala Gly Gln Lys Leu Tyr Val Thr Met Phe Pro Cys Asn Glu Cys
165 170 175
Ala Lys Ile Ile Ile Gln Ser Gly Val Ser Glu Val Ile Tyr Phe Val
180 185 190
Glu Lys Arg Ile Asp Asn Ser Asp Tyr Val Tyr Val Ala Ser His Lys
195 200 205
Leu Leu Ser Met Ala Gly Val Lys Val Arg Lys His Gln Pro Gln Met
210 215 220
Ser Gln Ile Pro Ile Lys Phe Gln Glu Pro Arg Asn Gly Glu Pro Ser
225 230 235 240
Met Asn Ala Ala Ser Ile Phe Ser
245
<210> 5
<211> 183
<212> PRT
<213> Oryza sativa
<400> 5
Met Ala Ala Ala Ala Ala Glu Phe Met Glu Leu Ala Leu Glu Gln Ala
1 5 10 15
Lys Phe Ala Leu Asp Asn Leu Glu Val Pro Val Gly Cys Val Ile Val
20 25 30
Glu Asp Gly Lys Val Ile Ser Ser Gly Ser Asn Lys Thr Asn Ala Thr
35 40 45
Arg Asn Ala Thr Arg His Ala Glu Met Glu Ala Ile Asp Ile Leu Leu
50 55 60
Arg Glu Trp Gln Gly Met Gly Leu Asp Gln Pro Gln Val Ala Glu Lys
65 70 75 80
Phe Ala Arg Cys Asp Leu Tyr Val Thr Cys Glu Pro Cys Ile Met Cys
85 90 95
Ala Met Ala Leu Ser Ile Leu Gly Ile Arg Glu Val Tyr Phe Gly Cys
100 105 110
Ala Asn Asp Lys Phe Gly Gly Cys Gly Ser Ile Met Ser Leu His Gln
115 120 125
Ser Ser Ser Ala Glu Leu Ser Gly Glu Glu Ile Pro Gly Pro Lys Gly
130 135 140
Tyr Lys Cys Thr Gly Gly Ile Met Ala Glu Glu Ala Val Ala Leu Phe
145 150 155 160
Arg Asn Phe Tyr Glu Gln Gly Asn Pro Asn Ala Pro Lys Pro His Arg
165 170 175
Pro Val Arg Ile Ala Pro Gln
180
<210> 6
<211> 186
<212> PRT
<213> Oryza sativa
<400> 6
Met Glu Glu Ala Gln Val Val Glu Ser Lys Asp Gly Thr Ile Ser Val
1 5 10 15
Ala Ser Ala Phe Ala Gly His Gln Glu Ala Val Gln Asp Arg Asp His
20 25 30
Lys Phe Leu Ser Lys Ala Val Glu Glu Ala Tyr Gln Gly Val Asp Cys
35 40 45
Gly His Gly Gly Pro Phe Gly Ala Val Val Val Arg Asn Asp Glu Ile
50 55 60
Val Val Ser Cys His Asn Met Val Leu Asp Tyr Thr Asp Pro Thr Ala
65 70 75 80
His Ala Glu Val Thr Ala Ile Arg Glu Ala Cys Lys Lys Leu Gly Lys
85 90 95
Ile Glu Leu Ser Asp Cys Glu Met Tyr Ala Ser Cys Glu Pro Cys Pro
100 105 110
Met Cys Phe Gly Ala Val His Leu Ser Arg Ile Lys Arg Leu Val Tyr
115 120 125
Gly Ala Lys Ala Glu Ala Ala Ile Ala Ile Gly Phe Asp Asp Phe Ile
130 135 140
Ala Asp Ala Leu Arg Gly Thr Ala Tyr Tyr Gln Lys Ala Asn Leu Glu
145 150 155 160
Ile Arg Arg Ala Asp Gly Asn Gly Ala Leu Ile Ala Glu Gln Val Phe
165 170 175
Glu Asn Thr Lys Glu Lys Phe Arg Met Tyr
180 185
<210> 7
<211> 411
<212> PRT
<213> Oryza sativa
<400> 7
Met Ala Trp Glu Leu Thr Glu Val Pro Gly Asn Pro Thr Pro Ser Leu
1 5 10 15
Arg Gly Ser Thr Val Glu Val Ile Ala Ala Lys Ile Glu Pro Lys Leu
20 25 30
Ala Asn Ser Leu Ile Arg Gln Leu Ser Gln Ala Cys Pro Leu Glu Asn
35 40 45
Leu Arg His Val Lys Arg Val Arg Arg Cys Cys Glu Tyr Gly Glu Lys
50 55 60
Ser Glu Leu Ser Ile Ile Leu Cys Leu Ala Thr Gly Pro Glu His Cys
65 70 75 80
Ser Glu Met Phe Pro Gln Asp Val Lys Lys Ile Val Gly Thr Tyr Glu
85 90 95
Leu Asn Thr Phe Ile Ala Lys Val Ala Arg Phe Pro Ala Thr Ser Lys
100 105 110
Glu Glu Trp Glu Glu Gln Cys Lys Leu Trp Pro Thr Ser Tyr His Pro
115 120 125
Pro His Asp Leu Asp Gly Val Ser Gly Phe Lys Glu Cys Glu Leu Pro
130 135 140
Ser Ile Phe Asn Cys Met Arg Thr Ala Leu Arg Leu Ser Glu Val Gly
145 150 155 160
Asn Ala Ala Val Val Val Asn Pro Ser Thr Met Gln Ile Ile Ala Lys
165 170 175
Ala Thr Asp Gln Thr Leu Gln His Asp Ser Leu Lys Ser Asn Lys Cys
180 185 190
Ala Glu Leu Asn Ser Asp Ser Pro Phe Ser Ser Leu Glu Val Thr Glu
195 200 205
Lys Lys Gly Ser Arg Leu Phe Leu Ser Asn Ser Asn Val Ser Lys Cys
210 215 220
Asn Ser Leu Asn Met Glu Ala Ser Cys Leu Asn Pro Trp Gly Trp Met
225 230 235 240
Lys Pro Arg Pro Ser Glu Gln Lys Ser Leu Pro Cys Glu Gly Gly Phe
245 250 255
Pro Trp His Pro Leu Arg His Ala Ala Ile Val Ala Ile Glu Asn Ala
260 265 270
Ala Glu Arg Asp Lys Val Met Phe Pro Ser Ile Ile Ser Ser Thr Lys
275 280 285
Pro Asn Ser Asp Gly Asn Met Glu Tyr Tyr Ser Val Asn Glu Ser Ala
290 295 300
Lys Arg Leu Lys Val Asp Arg Asn Asp Asp Lys Lys Ile Ala His Glu
305 310 315 320
Ala Ile Cys Asp Asp Leu Ser Glu Thr Arg Pro Tyr Leu Cys Thr Gly
325 330 335
Phe Asp Ile Tyr Leu Val Trp Glu Pro Cys Ser Met Cys Ala Met Ala
340 345 350
Leu Val His Gln Arg Phe Lys Arg Ile Phe Tyr Ala Phe Pro Asn Pro
355 360 365
Ile Thr Gly Ala Leu Gly Ser Val Tyr Arg Leu His Gly Glu Lys Ser
370 375 380
Leu Asn His Arg Tyr Ser Val Phe Arg Val Lys Val Pro Glu Ser Tyr
385 390 395 400
Ser Asn Ser Ser Gly Asp Cys Ser Asp Lys Cys
405 410
<210> 8
<211> 337
<212> PRT
<213> Arabidopsis thaliana
<400> 8
Met Ala Gln Arg Pro Asn Leu Leu Ser His Leu Gln Asp Leu Val Thr
1 5 10 15
Lys Phe Lys Asn Met Thr Met Ala Gln Asp Arg Phe Lys Phe Val Phe
20 25 30
Thr Ala Asn Glu Ala Ala Leu Glu Gly Val Thr Asp Pro Ile Arg Leu
35 40 45
Pro Asn Leu Ile Arg Lys Ala Met Cys Leu Ala Arg Ala Pro Ile Ser
50 55 60
Lys Tyr Lys Val Gly Ala Val Gly Arg Ala Ser Ser Gly Arg Val Tyr
65 70 75 80
Leu Gly Val Asn Val Asp Phe Pro Gly Leu Pro Leu His His Ser Ile
85 90 95
His Ala Glu Gln Phe Leu Val Thr Asn Leu Ala Leu Asn Tyr Glu Lys
100 105 110
Asp Leu Cys Lys Leu Ala Val Ala Ile Ser Thr Asp Gly Leu Glu Phe
115 120 125
Gly Thr Pro Cys Gly Asn Cys Leu Gln Phe Leu Met Glu Met Ser Asn
130 135 140
Ala Leu Asp Met Lys Ile Leu Ser Lys Pro Lys His Glu Ala Gly Ser
145 150 155 160
Phe Ser Ser Leu Arg Leu Leu Leu Pro Asn Val Leu Pro Lys Gly Ser
165 170 175
Pro Phe Leu Leu Glu Lys Arg Tyr Asn Cys Leu Thr Leu Ser Gly Ser
180 185 190
Ala Gly Glu Ile Cys Ser Leu Asp Cys Ser His Leu Lys Arg Arg Ala
195 200 205
Leu Ala Ala Ala Asn Asn Ser Phe Ser Pro Tyr Thr Glu Ser Pro Ser
210 215 220
Gly Val Ala Leu Leu Asp Asn Asp Gly Asn Trp Tyr Arg Gly Trp Tyr
225 230 235 240
Ile Glu Ser Val Ala Ser Asn Pro Ser Leu Gly Pro Val Gln Ala Ala
245 250 255
Leu Val Asp Phe Val Ala Arg Ser Arg Gly Lys Met Phe Asn Lys Ile
260 265 270
Val Gln Ala Val Leu Val Glu Lys Asn Asn Ala Ser Val Ser Gln Glu
275 280 285
Arg Thr Ala Lys Ile Ile Leu Asp Thr Ile Ala Pro Asn Cys Asp Phe
290 295 300
Lys Val Phe His Cys Ser Val Asp Cys Ala Lys Arg Leu Lys Tyr Leu
305 310 315 320
Arg Glu Thr Leu Val Ile Asp Thr Leu Gly Asp Tyr Thr Gly Leu His
325 330 335
Tyr
<210> 9
<211> 223
<212> PRT
<213> Arabidopsis thaliana
<400> 9
Met Ala Gln Asp Gln Tyr Lys Phe Val Phe Thr Ala Lys Glu Ala Glu
1 5 10 15
Ser Glu Gly Val Thr Glu Pro Met Arg Leu Pro Asn Leu Ile Gly Lys
20 25 30
Ala Met Ser Leu Ala Leu Ala Pro Ile Ser Lys Tyr Lys Val Gly Ala
35 40 45
Val Gly Arg Ala Arg Ser Gly Arg Ile Tyr Leu Gly Val Asn Val Glu
50 55 60
Leu Pro Gly Leu Pro Leu His His Ser Ile His Ala Glu Gln Phe Leu
65 70 75 80
Val Thr Asn Leu Ala Leu Asn Ser Glu Lys Gly Leu His Leu Leu Ala
85 90 95
Val Thr Ile Ser Thr Asp Gly Asn Asp Phe Gly Ala Pro Cys Gly Asn
100 105 110
Cys Arg Gln Phe Leu Met Glu Ile Ser Lys Ala Leu Asn Ile Lys Ile
115 120 125
Leu Leu Lys Ser Lys Tyr Glu Ala Glu Gly Ser Phe Lys Ser Leu Arg
130 135 140
Leu Leu Leu Pro Asp Arg Phe Ser Pro Asp Asp Val Leu Pro Lys Gly
145 150 155 160
Ser Pro Leu Leu Leu Glu Lys Arg His Asn Cys Leu Ser Leu Ser Gly
165 170 175
Ser Ala Glu Glu Ile Cys Ser Ser Asp Cys Ser His Leu Lys Cys Lys
180 185 190
Ala Leu Ala Ala Ala Asn Asn Ser Phe Ser Pro Tyr Thr Asn Ser Pro
195 200 205
Ser Gly Val Ala Leu Gln Asp Asp Asp Gly Asn Trp Tyr Arg Gly
210 215 220
<210> 10
<211> 251
<212> PRT
<213> Arabidopsis thaliana
<400> 10
Met Thr Gln Gln Leu Lys Phe Ile Leu Thr Arg Glu Glu Ala Ala Ser
1 5 10 15
Lys Gly Val Ser Arg Pro Ser Asp Leu Val Lys Leu Glu Glu Glu Ala
20 25 30
Met Ile Leu Ala Arg Ala Pro Ile Ser Gly Val Gln Asp Ala Val Leu
35 40 45
Gly Leu Ala Ser Ser Asp Arg Ile Phe Leu Gly Val Asn Val Glu Phe
50 55 60
Glu Gly Leu Pro Leu His His Ser Ile Ser Ala Glu Gln Phe Leu Val
65 70 75 80
Ala Asn Leu Ala Leu Asn Phe Glu Gln Glu Leu His Ala Cys Leu Ile
85 90 95
Pro Ser Arg Phe Tyr Leu Glu Ser Phe Glu Glu Asp Val Pro Leu Leu
100 105 110
Leu Val Pro Gln Asn Asn Arg Leu Ala His Ser Asp Pro Phe Ser Ala
115 120 125
Ala Glu Ile Cys Ser Asn Pro Glu His Cys Ser His Leu Lys Cys Arg
130 135 140
Ala Leu Thr Ala Ala Asn Lys Ser Asn Ala Gln Tyr Ser Lys Cys Pro
145 150 155 160
Ser Gly Val Ala Leu Ile Cys Glu Gly Glu Val Tyr Gly Gly Trp Cys
165 170 175
Ile Glu Ser Ala Ala Tyr Asn Leu Ser Leu Gly Pro Val Gln Ala Ala
180 185 190
Leu Val Asp Phe Met Ala Arg Gly Glu Gly Lys Gly Phe Glu Met Ile
195 200 205
Thr Gly Ala Val Leu Val Glu Met Asn Asp Ala Lys Val Ser Gln Glu
210 215 220
Ala Thr Ala Arg Ile Leu Leu Lys Thr Ile Ala Pro Gly Cys Asn Phe
225 230 235 240
Ser Val Phe Arg Cys His Lys Thr Ala Glu Asn
245 250
<210> 11
<211> 346
<212> PRT
<213> Arabidopsis thaliana
<400> 11
Met Ala Gln Gln Tyr Lys Phe Val Phe Thr Ala Glu Gln Ala Ala Ser
1 5 10 15
Glu Gly Val Thr Asp His Lys Lys Leu Pro Lys Leu Ile Arg Lys Ala
20 25 30
Arg Asn Leu Val Lys Ala Pro Ser Lys Val Gly Ala Val Gly Arg Ala
35 40 45
Ser Ser Gly Arg Phe Tyr Leu Gly Val Asn Val Glu Phe Lys Gly Leu
50 55 60
Leu Pro His Phe Ser Ile His Ala Glu Gln Phe Leu Ile Ala Asn Leu
65 70 75 80
Ala Leu Asn Ser Glu Pro Lys Leu Thr His Leu Ala Val Ser Asp Asn
85 90 95
Gly Thr Val Phe Gln Asp Pro Cys Tyr Asp Cys Thr Arg Phe Leu Lys
100 105 110
Glu Ile Asn Asn Ala His Gln Ile Glu Ile Leu Ile Lys Asn Ala His
115 120 125
Gly Arg Asp Gly Ser Phe Lys Ser Leu Glu Ser His Met Pro Asp Glu
130 135 140
Phe Gly Ser Glu Ser Ile Leu Ser Ala Glu Pro Ser Leu Leu Leu Met
145 150 155 160
Glu Arg Asp Asn Cys Leu Ala Leu Ile Asp Glu Asp Ser Ala Ala Gly
165 170 175
Gly Ile Ser Ser Asn Ala Asp Leu Cys Ser Phe Leu Lys Leu Glu Ala
180 185 190
Leu Lys Ala Ala Asn Lys Ser Tyr Ala Pro Tyr Arg Lys Cys Pro Ser
195 200 205
Gly Val Ala Leu Phe Cys Glu Gly Glu Val Tyr Ala Gly Trp Tyr Ile
210 215 220
Glu Thr Val Asp Arg Thr Ile Ser Leu Gly Pro Val Gln Ala Ala Leu
225 230 235 240
Val Asp Phe Ile Ala Arg Gly Glu Gly Lys Gly Phe Asp Lys Ile Thr
245 250 255
Gly Ala Val Leu Val Glu Lys Lys Asp Ala Lys Val Gly Gln Glu Asp
260 265 270
Thr Ala Arg Lys Leu Leu Glu Lys Ile Ala Ala Pro Asn Cys Asp Phe
275 280 285
Lys Val Phe His Cys Gln Glu Glu Arg Lys Asp Trp Ile Thr Gly Ala
290 295 300
Val Leu Val Glu Lys Lys Asp Ala Lys Glu Gly Gln Glu Gly Lys Leu
305 310 315 320
Leu Glu Lys Ile Ala Ala Pro Asn Cys Asp Phe Lys Val Ser His Cys
325 330 335
Asp Glu Glu Leu Lys Asp Trp Ile Lys Leu
340 345
<210> 12
<211> 293
<212> PRT
<213> Arabidopsis thaliana
<400> 12
Met Lys Phe Val Tyr Thr Pro Ser Glu Ala Ala Glu Glu Gly Val Arg
1 5 10 15
Gly Pro Ser Asp Leu Pro Lys Leu Ile Asp Lys Ala Met Ser Leu Ala
20 25 30
Arg Ala Pro Val Ser Thr Phe Lys Val Gly Ala Val Gly Leu Thr Ser
35 40 45
Ser Gly Glu Val Phe Leu Gly Val Asn Val Glu Phe Pro Asn Leu Pro
50 55 60
Leu His His Thr Ile His Ala Glu Gln Phe Leu Val Thr Asn Leu Ala
65 70 75 80
Leu Asn Ser Met Lys Lys Leu Thr His Ile Ala Val Ser Val Thr Gly
85 90 95
Thr Ile Phe Gly Ala Pro Cys Gly His Cys Arg Gln Phe Tyr Gln Glu
100 105 110
Met Arg Asn Ala Pro Glu Ile Glu Ile Leu Ile Lys Arg Pro Lys Asp
115 120 125
Gly Ile Asp Glu Phe Met Ser Leu Lys Ser Leu Met Pro Glu Arg Phe
130 135 140
Gly Pro Asp Ser Ile Leu Pro Glu Asp Ala Ser Leu Leu Leu Glu Gln
145 150 155 160
Arg Asp Asn Ser Leu Val Leu Ser Asp Pro Glu Glu Ile Cys Ser Asp
165 170 175
Pro Glu Asp Cys Ser His Thr Lys Cys Arg Ala Leu Ala Ala Ala Asn
180 185 190
Lys Ser Tyr Ala Pro Tyr Ser Lys Cys Pro Ser Gly Val Ala Leu Ile
195 200 205
Cys Gly Gly Glu Val Tyr Lys Gly Trp Tyr Ile Glu Ser Val Ala Tyr
210 215 220
Asn Pro Ser Leu Gly Pro Val Glu Ala Ala Leu Val Asp Phe Val Ala
225 230 235 240
Arg Gly Gly Gly Lys Glu Phe Asn Glu Ile Thr Glu Val Val Leu Val
245 250 255
Glu Met Lys Asp Val Lys Val Ser Gln Glu Ala Thr Ala Arg Thr Phe
260 265 270
Leu Asp Lys Ile Ala Pro Lys Cys Asp Phe Lys Val Leu His Cys Tyr
275 280 285
Lys Thr Asn Lys Asn
290
<210> 13
<211> 307
<212> PRT
<213> Arabidopsis thaliana
<400> 13
Met Ala Ala Gln Asp Lys Tyr Lys Phe Val Phe Thr Ala Lys Glu Ala
1 5 10 15
Ala Ser Glu Gly Val Thr Glu Pro Ile Arg Leu Pro Lys Leu Ile Arg
20 25 30
Lys Ala Met Ser Leu Ala Arg Gly Pro Ile Ser Lys Tyr Lys Val Gly
35 40 45
Ala Val Gly Arg Ala Ser Ser Gly Arg Val Tyr Leu Gly Val Asn Val
50 55 60
Glu Phe Pro Gly Leu Pro Leu His His Ser Ile His Pro Glu Gln Phe
65 70 75 80
Leu Val Thr Asn Leu Ala Leu Asn Ser Glu Lys Gly Leu Arg Gln Leu
85 90 95
Ala Val Ala Ile Ser Ser Asp Cys Ile Glu Phe Gly Ala Pro Cys Gly
100 105 110
Asn Cys Arg Gln Phe Leu Met Glu Thr Ser Asn Glu Leu Asp Ile Lys
115 120 125
Ile Leu Leu Lys Ser Lys His Glu Ala Glu Gly Ser Phe Ser Ser Leu
130 135 140
Lys Leu Leu Leu Pro Tyr Arg Phe Thr Pro Asp Asp Val Leu Pro Lys
145 150 155 160
Gly Ser Pro Leu Leu Leu Glu Lys Arg Asp Asn Cys Leu Thr Leu Ser
165 170 175
Gly Ser Thr Glu Glu Ile Cys Ser Ser Asp Cys Ser His Leu Lys Cys
180 185 190
Lys Ala Leu Ala Ala Ala Asn Asn Ser Phe Ser Pro Tyr Thr Glu Ser
195 200 205
Pro Ser Gly Val Ala Leu Gln Asp Asp Glu Gly Lys Trp Tyr Arg Gly
210 215 220
Trp Tyr Ile Glu Ser Val Ala Tyr Ser Pro Ser Leu Gly Pro Val Gln
225 230 235 240
Ala Ala Leu Val Asp Phe Val Ala Arg Ser Arg Gly Lys Gly Phe Asn
245 250 255
Lys Ile Val Glu Ala Val Leu Val Glu Lys Asn Asn Ala Arg Val Ser
260 265 270
Gln Glu Arg Thr Ala Lys Met Ile Leu Asp Thr Ile Ala Ala Pro Asn
275 280 285
Cys Asp Phe Lys Val Phe His Cys Tyr Val Asp Leu Gln Lys Lys Phe
290 295 300
Ile Thr Glu
305
<210> 14
<211> 293
<212> PRT
<213> Arabidopsis thaliana
<400> 14
Met Ala Gln Pro Met Arg Phe Met Leu Asn His Ile Glu Thr Glu Ser
1 5 10 15
Tyr Gly Ala Phe Thr Pro Gln Asn Leu Ser Pro Leu Ile Asn Arg Ala
20 25 30
Ile Pro His Thr Arg Ala Gln Ile Ser Gly Ser Pro Val Val Ala Val
35 40 45
Gly Arg Gly Ser Ser Gly Arg Thr Phe Phe Gly Val Asn Val Glu Leu
50 55 60
Pro Gly Leu Pro Leu Asp His Ser Ile His Ala Glu Gln Phe Leu Leu
65 70 75 80
Ala Asn Leu Ala Leu His Phe Glu Gln Lys Leu Glu Cys Ile Ala Ile
85 90 95
Ser Thr Asn Gly Tyr Tyr Phe Gln Glu Pro Cys Gly His Cys Cys Gln
100 105 110
Leu Leu His Lys Ile Arg Asp Met Ser Asp Thr Lys Ile Leu Leu Thr
115 120 125
Asn Pro Thr Gly Gln Lys Gly Thr Tyr Met Asn Leu Ser Thr Phe Leu
130 135 140
Pro Gln Gly Leu Ile Ser Gln Ala Asn Val Pro Arg Leu Leu Glu Arg
145 150 155 160
Asn Phe Asn Cys Ile Glu Leu Ile Asn His Ser Leu Tyr Met Asp Ile
165 170 175
Cys Ser Tyr Ser Glu His Cys Asn His Leu Asn Cys Arg Ala Leu Lys
180 185 190
Ala Ala Thr Ile Ser Tyr Ala Pro Asp Ser Lys Cys Pro Ser Gly Val
195 200 205
Ala Leu Ile Asp His Arg Gly Lys Val Tyr Ser Gly Gly Tyr Met Glu
210 215 220
Ser Val Ala His Asn Thr Ser Leu Gly Pro Val Gln Ala Ala Leu Val
225 230 235 240
Asp Phe Val Ala Asn Gly Asp Gly Gln Glu Phe Lys Asn Ile Val Glu
245 250 255
Ala Val Leu Val Glu Lys Lys Cys Gly Val Leu Ser Gln Glu Ala Thr
260 265 270
Ala Arg Met Ile Leu Glu Lys Ile Ala Asp Pro Asp Cys Ile Phe Arg
275 280 285
Val Leu His Cys Lys
290
<210> 15
<211> 298
<212> PRT
<213> Arabidopsis thaliana
<400> 15
Met Ala Gln Pro Pro Asn Pro Tyr Ala Ala Leu Thr Pro Thr Glu Ala
1 5 10 15
Glu Ser Ser Gly Pro Phe Glu Pro Glu Thr Leu Leu Pro Leu Ile Asn
20 25 30
Arg Ala Leu Pro Leu Ala Gln Ala Leu Pro Ser Gln Ser Pro Leu Val
35 40 45
Ala Val Gly Arg Gly Ser Ser Gly Arg Thr Phe Leu Gly Val Asn Val
50 55 60
Glu Leu Pro Gly Leu Ser Pro Leu His Ser Ile His Ala Gly Gln Phe
65 70 75 80
Leu Val Val His Leu Ala Leu Asn Asn Glu Arg Thr Leu Asn Cys Leu
85 90 95
Ala Phe Ser Ser Asn Gly Ser Tyr Phe Asp Pro Pro Cys Pro His Cys
100 105 110
Cys Gln Leu Leu Gln Glu Ile Arg Asn Ala Ser Ser Thr Lys Leu Leu
115 120 125
Ile Thr Asp Pro Ser Arg Gln Arg Asp Met Ser Leu Ser Thr Tyr Leu
130 135 140
Pro Gln Lys Tyr Leu Ser Leu Tyr Asn Glu Val Pro Lys Tyr Phe Phe
145 150 155 160
Ala Arg Leu Leu Asp Glu Asn Arg Asn Asn Gly Leu Thr Leu Ile Asn
165 170 175
Pro Asn Pro Ile Arg Asp Cys Leu Asp Ser Glu Ile Cys Asn His Leu
180 185 190
Ser Cys Arg Ala Leu Lys Ala Ala Asn Arg Ser Tyr Ala Pro Tyr Ser
195 200 205
Lys Ser Pro Ser Gly Val Ala Leu Met Asp Phe Gln Gly Arg Val Tyr
210 215 220
Ser Gly Trp Ser Ile Glu Ser Val Ala Asn Pro Ile Leu Gly Ala Ala
225 230 235 240
Gln Ala Ala Leu Val Asp Phe Met Thr Asn Gly Gly Gly His Glu Phe
245 250 255
Asn Asn Ile Val Arg Gly Phe Leu Val Glu Lys Arg Asp Ala Lys Leu
260 265 270
Ser His Leu Ala Thr Ala Arg Glu Ile Leu Asn Lys Val Ala His Phe
275 280 285
Ser Phe Ile Leu Arg Val Leu His Cys Gln
290 295
<210> 16
<211> 293
<212> PRT
<213> Glycine max
<400> 16
Met Asp Pro Pro Pro Ser Lys Phe Val Ile Glu Ala Ser Glu Ala Leu
1 5 10 15
Ala Leu Ala Glu Ser Ala Ala Val Thr Leu Pro Glu Leu Leu Pro Thr
20 25 30
Leu Val Pro Ala Ala Gln Pro Leu Ala Arg Pro Pro Ile Ser Lys Phe
35 40 45
Ser Val Ala Ala Val Gly Leu Ala Pro Ser Gly Arg Ile Phe Val Gly
50 55 60
Val Asn Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala
65 70 75 80
Glu Gln Phe Leu Ile Thr Asn Leu Ser Leu Asn Ala Glu Pro His Leu
85 90 95
Val Ser Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe
100 105 110
Leu Gln Glu Leu Arg Ala Ala Ala Asp Val Lys Ile Leu Val Thr Ser
115 120 125
Glu Ala Thr Ala Glu Phe Arg Ala Leu Ser Asp Phe Leu Pro His Arg
130 135 140
Phe Gly Pro His Asp Leu Leu Pro Leu Glu Ala Pro Leu Leu Leu Glu
145 150 155 160
Pro His His Tyr Thr Leu Thr Leu Gln His Tyr Leu Asn Ala His Val
165 170 175
Pro Asn His Lys Leu Lys Ile Ala Ala Leu Glu Ala Ala Asn Lys Ser
180 185 190
His Ala Pro Tyr Ser Gly Ser Pro Ser Gly Val Ala Leu Leu Asp Cys
195 200 205
His Gly Asn Val Phe Lys Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn
210 215 220
Pro Ser Leu Gly Pro Val Gln Ala Ala Leu Val Ala Phe Val Ala Gly
225 230 235 240
Gly Gly Gly Asp Tyr Asp Arg Ile Val Gly Ala Ala Leu Val Glu Met
245 250 255
Asp Gly Ala Val Val Lys Gln Glu His Thr Ala Arg Leu Leu Ile His
260 265 270
Ser Ile Ser Pro Asn Cys Gln Phe Asp Thr Phe Leu Cys His Asn Asn
275 280 285
Asn Asn Asn Glu Tyr
290
<210> 17
<211> 278
<212> PRT
<213> Glycine max
<400> 17
Met Glu Pro Arg Phe Val Ile Ser Ala Ser Glu Ala Glu Ser Gln Ser
1 5 10 15
Ile Ser Lys Leu Leu Pro Ser Leu Val Ser Ser Ser Gln Ser Leu Ala
20 25 30
Arg Pro Ser Ile Ser Asn Phe Pro Val Ala Ala Val Gly Leu Ala Ala
35 40 45
Ser Gly Arg Ile Phe Val Gly Val Asn Val Glu Phe Pro Gly Leu Pro
50 55 60
Phe His His Thr Ile His Ala Glu Gln Phe Leu Leu Thr Asn Met Ala
65 70 75 80
Asn Asn Ala Glu Thr Arg Leu Asp Ser Phe Ala Val Ser Ala Ala Pro
85 90 95
Cys Gly His Cys Arg Gln Phe Leu Gln Glu Leu Arg Asp Ala Pro Asp
100 105 110
Ile Gln Ile Leu Ile Thr Ser His Lys Asn Pro His Phe Ser Pro Leu
115 120 125
Ser His Phe Leu Ser His His Phe Gly Pro His Asp Leu Leu Pro Lys
130 135 140
Thr Val Pro Leu Leu Leu Glu Pro Arg His Asn Ala Leu Ser Leu Pro
145 150 155 160
Gln Asn Asp His Phe Asn Ala Leu Ala Ile Ala Ala Leu Glu Ala Ala
165 170 175
Asn Asn Ser His Ala Pro Tyr Ser Ala Ser Pro Ser Gly Val Ala Leu
180 185 190
Leu Asp Ser Lys Gly Asn Val Phe Lys Gly Ser Tyr Ile Glu Ser Ala
195 200 205
Ala Tyr Asn Pro Ser Leu Gly Pro Leu Gln Ala Ala Ile Val Ala Phe
210 215 220
Ile Ala Gly Gly Gly Gly Asp Tyr Glu Glu Ile Val Ala Ala Val Leu
225 230 235 240
Val Glu Lys Glu Gly Ala Val Ile Lys Gln Asp His Thr Ala Arg Leu
245 250 255
Leu Leu His Ser Ile Ala Pro Arg Cys His Phe Asn Asn Phe Leu Ala
260 265 270
Ser Gln Ser Pro Ser Thr
275
<210> 18
<211> 277
<212> PRT
<213> Glycine max
<400> 18
Met Asp Gln Pro Ser Arg Lys Phe Leu Ile Ser Ala Ser Glu Ala Leu
1 5 10 15
Ala Met Ala Glu Ser Ala Ala Val Thr Leu Pro Glu Leu Leu Pro Thr
20 25 30
Leu Val Pro Thr Ala Gln Pro Leu Ala Pro Ser Gly Arg Ile Leu Val
35 40 45
Gly Val Asn Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His
50 55 60
Ala Glu Gln Phe Leu Ile Thr Asn Leu Ser Leu Asn Ala Glu Pro His
65 70 75 80
Leu Val Ser Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln
85 90 95
Phe Leu Gln Glu Leu Arg Ala Ala Ala Asp Val Gln Ile Leu Val Thr
100 105 110
Ser Glu Ala Thr Ala Glu Phe Gly Pro Leu Ser Asp Leu Leu Pro Gln
115 120 125
Arg Phe Cys Pro His Asp Leu Leu Pro Leu Glu Ala Pro Leu Leu Leu
130 135 140
Glu Pro His His Asn Thr Leu Thr Leu Thr Leu His His Gln His Leu
145 150 155 160
Pro Asn Tyr Lys Leu Lys Thr Ala Ala Leu Glu Ala Ala Asn Lys Ser
165 170 175
His Ala Pro Tyr Ser Gly Ser Pro Ser Gly Val Ala Leu Leu Asp Cys
180 185 190
His Gly Asn Val Phe Lys Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn
195 200 205
Pro Ser Leu Gly Pro Val Gln Ala Ala Leu Val Ala Phe Val Ser Gly
210 215 220
Gly Gly Gly Asp Tyr Asp Gln Ile Val Gly Ala Val Leu Val Glu Lys
225 230 235 240
Glu Asp Ala Val Val Lys Gln Glu Ser Thr Ala Arg Leu Leu Ile Asn
245 250 255
Ser Ile Ser Pro Asn Cys Gln Phe Asp Thr Phe Leu Cys His Cys Asn
260 265 270
Pro Asn His Ile Val
275
<210> 19
<211> 288
<212> PRT
<213> Zea mays
<400> 19
Met Gly Ala Glu Glu Ala Glu Arg Ala Ala Ala Ala Ala Gly Val Ala
1 5 10 15
Thr Val Gln Asp Leu Leu Pro Leu Leu Val Pro Ser Ala Met Glu Arg
20 25 30
Ala Arg Ala Pro Ile Ser Arg Phe Arg Val Gly Ala Val Gly Leu Gly
35 40 45
Ala Ser Gly Arg Val Tyr Ala Gly Val Asn Leu Glu Phe Arg Gly Leu
50 55 60
Pro Leu Cys His Ser Val His Ala Glu Gln Phe Leu Val Ala Asn Ala
65 70 75 80
Ala Ala Ala Gly Glu Pro Ala Leu Arg Ala Val Ala Val Ser His Met
85 90 95
Pro Cys Gly His Cys Arg Gln Phe Leu Gln Glu Met Arg Gly Ala Ala
100 105 110
Ala Val Gln Ile Leu Val Thr Ser Asp Ala Ala Glu Gly Arg Ala Pro
115 120 125
Glu Trp Arg Ala Leu Ala Ser Leu Leu Pro Arg Pro Phe Gly Pro Arg
130 135 140
Asp Leu Leu Pro Glu Asp Val Ala Leu Val Leu Glu Pro His Gly Asn
145 150 155 160
Asp Leu Gly Gly Pro Val Ala Asn Gly Phe Ala Arg Gly Asp Leu Asp
165 170 175
Ala Arg Leu Arg Glu Ala Ala Glu Ala Ala Ala Arg Ala Ala His Ala
180 185 190
Pro Tyr Ser Lys Cys Pro Ser Gly Phe Ala Val Ala Asp Gly Asp Gly
195 200 205
Arg Ile Tyr Ala Gly Gly Cys Ile Glu Ser Ala Ala Tyr Asn Pro Thr
210 215 220
Leu Gly Pro Val Gln Ala Ala Ile Ile Gly Met Val Ala Ala Gly Gly
225 230 235 240
Gly Pro Ala Gly Asp Val Val Ala Ala Ala Leu Val Glu Lys Glu Arg
245 250 255
Ala Glu Val Ala Gln Glu Ala Thr Ala Arg Ile Phe Leu Glu Ala Val
260 265 270
Ala Pro Arg Ala Thr Phe His Val Tyr Asn Tyr Arg Pro Ser Asp Ala
275 280 285
<210> 20
<211> 288
<212> PRT
<213> Zea mays
<400> 20
Met Gly Ala Glu Asp Ala Glu Arg Ala Ala Ala Ala Ala Gly Val Ala
1 5 10 15
Thr Val Gln Asp Leu Leu Pro Leu Leu Val Pro Ser Ala Arg Lys Arg
20 25 30
Ala Gln Val Pro Ile Ser Gly Phe Pro Val Gly Ala Val Gly Leu Gly
35 40 45
Ala Ser Gly Arg Val Tyr Val Gly Val Asn Leu Glu Phe Arg Gly Leu
50 55 60
Pro Leu Cys His Ser Val His Ala Glu Gln Phe Leu Val Ala Asn Ala
65 70 75 80
Ala Ala Ala Gly Glu Pro Glu Leu Arg Ala Val Ala Val Ser His Met
85 90 95
Pro Cys Gly His Cys Arg Gln Phe Leu Gln Glu Ile Arg Gly Ala Ala
100 105 110
Ser Ile Arg Ile Leu Val Thr Ser Asp Ala Ala Glu Gly Cys Ala Pro
115 120 125
Glu Trp Arg Thr Val Ala Ser Leu Leu Leu Arg Pro Phe Gly Pro His
130 135 140
Asp Leu Leu Pro Gly Asp Val Pro Leu Val Leu Glu Ala His Asp Asn
145 150 155 160
Ala Leu Gly Asp Pro Val Ala Asn Gly Leu Ala Arg Ser Asp Leu Asp
165 170 175
Ala Arg Leu Arg Gln Ala Ala Glu Ala Ala Ala Arg Ala Ala His Ala
180 185 190
Pro Tyr Ser Lys Cys Pro Ser Gly Phe Ala Val Ala Asp Gly Asp Gly
195 200 205
Arg Ile Tyr Ala Gly Gly Cys Leu Glu Ser Ala Ala Tyr Asn Pro Thr
210 215 220
Leu Gly Pro Val Gln Ala Ala Ile Ile Ala Met Val Ala Ala Gly Gly
225 230 235 240
Cys Ser Ala Gly Asp Val Val Ala Ala Ala Leu Val Glu Lys Glu Gln
245 250 255
Ala Ser Val Ala Gln Glu Ala Thr Ala Arg Ile Phe Leu Asp Ala Val
260 265 270
Ala Pro His Ala Ser Phe His Val Tyr Asn Tyr Arg Pro Ser Asp Ala
275 280 285
<210> 21
<211> 310
<212> PRT
<213> Manihot esculenta
<400> 21
Met Glu Gln Pro Arg Phe Val Ile Glu Ala Cys Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Lys Ser Gly Leu Thr Val Pro Gln Leu Leu Pro Thr Leu Val
20 25 30
Gln Glu Ala Arg Ser Leu Ala Arg Thr Pro Ile Ser Asn Tyr Tyr Val
35 40 45
Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Phe Gly Ala Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Thr Leu Asn Ala Glu Pro Gly Leu Asn Tyr
85 90 95
Val Ser Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Ile Arg Asn Ala Pro Asp Ile Gln Ile Leu Ile Thr Asp Asp Ser
115 120 125
Asn Asn Asn Asn Cys Ser Gly Val Ser Gln Asn Asn Gly Asp Thr Arg
130 135 140
Glu Phe Glu Ser Phe Ser Arg Phe Leu Pro His Arg Phe Gly Pro Asp
145 150 155 160
Asp Leu Leu Asp Lys Asp Val Pro Leu Ala Leu Glu Pro His Asn Asn
165 170 175
His Leu Ser Phe Leu Tyr Asp Ser Asn Ser Asn Ile Pro Asn Gly Ile
180 185 190
Asn Thr Pro Val Cys Asp Asp Leu Lys Tyr Glu Ala Leu Lys Ala Ala
195 200 205
Asn Lys Ser His Ala Pro Tyr Ser Asn Cys Pro Ser Gly Val Ala Leu
210 215 220
Met Asp Cys Glu Gly Lys Val Tyr Arg Gly Ser Tyr Met Glu Ser Ala
225 230 235 240
Ala Tyr Asn Pro Ser Leu Gly Pro Val Gln Ala Ala Ile Val Ala Tyr
245 250 255
Ile Val Gly Gly Gly Gly Gly Gly Tyr Glu Lys Ile Val Ala Ala Val
260 265 270
Leu Val Glu Lys Glu Gly Ala Val Val Arg Gln Glu Tyr Thr Ala Arg
275 280 285
Leu Leu Leu Gln Val Ile Ser Pro Lys Cys Glu Phe Lys Val Leu His
290 295 300
Cys Gly Met Lys Ser Cys
305 310
<210> 22
<211> 309
<212> PRT
<213> Manihot esculenta
<400> 22
Met Glu His Pro Arg Phe Val Ile Asp Ala Ser Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Gln Leu Gly Leu Thr Val Leu Gln Leu Leu Pro Thr Leu Val
20 25 30
Lys Glu Ala Arg Ser Leu Ala Arg Pro Pro Ile Ser Ser Tyr Tyr Val
35 40 45
Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Phe Gly Ser Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Thr Leu Asn Ala Glu Ser Gly Leu Asn Tyr
85 90 95
Val Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Ile Arg Asn Ala Pro Asp Ile Lys Ile Leu Ile Thr Asp Asp Ser
115 120 125
Asn Arg Asn Asn Cys Gly Gly Val Ala Glu Asn Ser Gly Asp Ala Lys
130 135 140
Phe Glu Ser Leu Ser His Phe Leu Pro His Arg Phe Gly Pro Asp Asp
145 150 155 160
Leu Leu Asp Lys Asn Val Pro Leu Ala Leu Glu Pro His His Asn His
165 170 175
Leu Ser Leu Leu Ser Asp Phe Asn Ser Asn Ile Pro Asn Gly Ile Thr
180 185 190
Ser Arg Val Cys Asp Asp Leu Lys Tyr Glu Ala Leu Glu Ala Ala Asn
195 200 205
Lys Ser His Ala Pro Tyr Ser Asp Cys Pro Ser Gly Val Ala Leu Met
210 215 220
Asp Cys Glu Gly Lys Val Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala
225 230 235 240
His Asn Pro Ser Leu Gly Pro Val Gln Ala Ala Leu Val Ala Tyr Val
245 250 255
Val Gly Gly Gly Gly Ser Gly Tyr Glu Lys Ile Val Ala Ala Val Leu
260 265 270
Val Glu Lys Glu Gly Ala Met Val Arg Gln Gly Tyr Thr Ala Arg Leu
275 280 285
Leu Leu Gln Met Ile Ser Pro Lys Cys Glu Phe Lys Val Ile His Cys
290 295 300
Gly Leu Lys Ser Arg
305
<210> 23
<211> 284
<212> PRT
<213> Linum usitatissimum
<400> 23
Met Glu Glu Gln Val Arg Phe Val Ile Glu Ala Ala Glu Ala Glu Ser
1 5 10 15
Met Thr Lys Gln Phe Gly Leu Thr Lys Val Glu Gly Leu Leu Pro Thr
20 25 30
Leu Val Arg Ser Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Asn Tyr
35 40 45
Gln Val Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Phe Gly
50 55 60
Ala Asn Ile Glu Phe Pro Gly Leu Pro Leu Asn Asn Ser Ile His Ala
65 70 75 80
Glu Gln Phe Leu Val Thr Asn Leu Tyr Leu Asn Ala Glu Pro Ser Met
85 90 95
Ser Ser Met Ala Val Ser Ser Ala Pro Cys Gly His Cys Arg Gln Phe
100 105 110
Phe Gln Glu Leu Arg Ser Ser Gly Ser Ile Arg Ile Leu Val Thr Asp
115 120 125
Asp Asp His Arg Arg Glu Phe Glu Asp Phe Ser Lys Phe Leu Pro His
130 135 140
Arg Phe Gly Pro Asp Asp Leu Leu His Lys Asp Ser Pro Leu Ile Leu
145 150 155 160
Glu Arg His Asp Asn Lys Leu Ser Leu Ile Ser Asp Ser Asp Ser Glu
165 170 175
Gly Glu Leu Lys Met Glu Ala Leu Glu Ala Ala Asn Arg Ser His Ala
180 185 190
Pro Tyr Ser Lys Cys Pro Ser Gly Ile Ala Leu Met Asp Ser Glu Gly
195 200 205
Lys Val Tyr Arg Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn Pro Ser
210 215 220
Leu Gly Pro Val Gln Ala Ala Met Val Ala Cys Ile Val Gly Gly Ala
225 230 235 240
Tyr Glu Arg Ile Val Ala Ala Val Leu Val Glu Lys Glu Gly Ala Leu
245 250 255
Val Lys Gln Glu Ala Thr Ala Arg Leu Ile Leu Ala Thr Val Ala Pro
260 265 270
Lys Cys Glu Leu Gln Val Phe His Cys Arg Ser Gly
275 280
<210> 24
<211> 316
<212> PRT
<213> Linum usitatissimum
<400> 24
Met Glu Glu Gln Val Arg Phe Val Ile Glu Ala Ala Glu Ala Glu Ser
1 5 10 15
Met Ala Lys Gln Leu Gly Leu Thr Lys Val Glu Glu Leu Leu Pro Thr
20 25 30
Leu Val Arg Ser Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Asn Tyr
35 40 45
Gln Val Gly Ala Gly Gly Leu Gly Ala Pro Pro Leu Thr Arg Ser Ala
50 55 60
Leu Ser Trp Pro Arg Pro Pro Ile Ser Asn Tyr Gln Val Gly Ala Val
65 70 75 80
Gly Leu Gly Ser Ser Gly Arg Ile Phe Phe Gly Ala Asn Ile Glu Phe
85 90 95
Pro Gly Leu Pro Leu Asn Asn Ser Ile His Ala Glu Gln Phe Leu Val
100 105 110
Ile Asn Leu Tyr Leu Asn Ala Glu Pro Ala Met Ser Ser Met Ala Val
115 120 125
Ser Ser Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln Glu Leu Arg
130 135 140
Ser Ser Gly Ser Ile Arg Ile Leu Val Thr Asp Asp Asp His Arg Ser
145 150 155 160
Arg Glu Phe Glu Asp Phe Ser Thr Phe Leu Pro His Arg Phe Gly Pro
165 170 175
Asp Asp Leu Leu His Lys Asp Ser Pro Leu Ile Leu Glu Arg His Asp
180 185 190
Asn Lys Leu Ser Leu Ile Ser Asp Ser Asp Ser Glu Ser Asp Ser Glu
195 200 205
Lys Glu Leu Lys Met Glu Ala Leu Glu Ala Ala Asn Arg Ser His Ala
210 215 220
Pro Tyr Ser Lys Cys Pro Ser Gly Ile Ala Met Met Asp Arg Glu Gly
225 230 235 240
Lys Val Tyr Arg Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn Pro Ser
245 250 255
Leu Gly Pro Val Gln Ala Ala Met Val Ala Cys Ile Val Gly Gly Ala
260 265 270
Tyr Glu Lys Ile Val Ala Ala Ile Leu Val Glu Lys Glu Gly Ala Leu
275 280 285
Val Arg Gln Glu Ala Thr Ala Arg Leu Ile Leu Ala Thr Val Ala Pro
290 295 300
Lys Cys Glu Leu Lys Val Phe His Cys Arg Ser Gly
305 310 315
<210> 25
<211> 314
<212> PRT
<213> Populus trichocarpa
<400> 25
Met Asp Gly Pro Ile Phe Val Ile Glu Ala Ser Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Gln Ser Gly Leu Thr Val Leu Gln Leu Leu Pro Thr Leu Val
20 25 30
Lys Ser Ala Gln Ala Leu Ala Arg Pro Pro Ile Ser Lys Tyr His Val
35 40 45
Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Leu Gly Gly Asn
50 55 60
Ile Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Thr Leu Asn Ala Glu Pro Ser Leu Lys Tyr
85 90 95
Val Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Ile Arg His Ala Pro Asp Ile His Ile Leu Ile Thr Gly Asp Ser
115 120 125
Asn Ser Asn His Asn Tyr Lys Asn Asp Leu Ala Asn Glu Glu Gln Phe
130 135 140
Glu Pro Met Ser Cys Leu Leu Pro His Arg Phe Gly Pro Asp Asp Leu
145 150 155 160
Leu Gly Lys Asp Val Pro Leu Leu Leu Glu Pro His His Asn Asn Leu
165 170 175
Ser Phe Leu Gly Asp Asp Lys Leu Pro Tyr Gly Val Ser Ala Ala Leu
180 185 190
Glu Asp Leu Lys Asn Glu Ala Leu Glu Ala Ala Lys Lys Ser His Ala
195 200 205
Pro Tyr Thr Asn Cys Pro Ser Gly Leu Ala Leu Met Asp Cys Glu Gly
210 215 220
Lys Val Phe Arg Gly Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser
225 230 235 240
Met Gly Pro Val Gln Ala Ala Leu Val Ala Tyr Val Met Gly Gly Arg
245 250 255
Gly Gly Gly Phe Asp Arg Ile Val Ala Ala Ala Leu Val Glu Lys Gln
260 265 270
Gly Ala Gln Ala Arg Gln Glu Gln Thr Ala Arg Leu Leu Leu Lys Glu
275 280 285
Ile Ser Pro Lys Cys Glu Phe Met Val Phe His Cys Val Ser Asn Ser
290 295 300
Gly Ser Ser Pro Ser Phe Asn Asn Leu Asn
305 310
<210> 26
<211> 315
<212> PRT
<213> Populus trichocarpa
<400> 26
Met Asp Gly Pro Ile Phe Val Ile Glu Ala Ser Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Gln Ser Gly Leu Thr Val Leu Gln Leu Leu Pro Ala Leu Val
20 25 30
Lys Ser Ala Gln Ala Leu Ala Arg Pro Pro Ile Ser Asp Tyr His Val
35 40 45
Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Leu Gly Gly Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Thr Leu Asn Ala Glu Pro Ser Leu Lys Tyr
85 90 95
Ile Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Ile Arg His Ala Pro Asp Val Gln Ile Leu Ile Thr Gly Asp Ser
115 120 125
Thr Asn Asn Gln Ser Tyr Lys Asn Asp Leu Ala Asn Lys Gln Gln Phe
130 135 140
Glu Pro Leu Ser Cys Leu Leu Pro His Arg Phe Gly Pro Asp Asp Leu
145 150 155 160
Leu Asp Lys Asp Ile Pro Leu Leu Leu Glu Thr Arg His Asn Asn Leu
165 170 175
Ser Phe Val Gly Asp Ala Leu Leu Pro Asn Gly Ile Cys Ala Ser Phe
180 185 190
Asp Asp Leu Glu Asn Glu Ala Leu Glu Ala Ala Asn Lys Ser His Ala
195 200 205
Pro Phe Thr Asn Cys Pro Ser Gly Val Ala Leu Met Asp Cys Glu Gly
210 215 220
Lys Val Tyr Arg Gly Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser
225 230 235 240
Ile Gly Pro Val Gln Ala Ala Leu Val Ala Tyr Val Met Gly Gly Arg
245 250 255
Gly Gly Gly Tyr Asp Arg Ile Val Ala Ala Val Leu Val Glu Lys Gln
260 265 270
Gly Ala Lys Ala Arg Gln Glu Gln Thr Ala Arg Leu Leu Leu Lys Glu
275 280 285
Ile Ser Pro Lys Cys Glu Leu Lys Val Phe His Cys Gly Ser Ser Ser
290 295 300
Ser Phe Asn Gly Cys Asn Asn Gln Asn Ser Cys
305 310 315
<210> 27
<211> 287
<212> PRT
<213> Medicago truncatula
<400> 27
Met Asp Arg Pro Val Phe Val Ile Glu Ala Thr Glu Ala Lys Ser Met
1 5 10 15
Ala Lys Ser Ser Ser Leu Thr Leu Thr Gln Leu Leu Pro Ser Leu Val
20 25 30
Thr Ser Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Asn Phe His Val
35 40 45
Gly Ala Val Gly Leu Ser Pro Ser Gly Arg Ile Leu Ile Gly Val Asn
50 55 60
Val Glu Phe Pro Gly Leu Pro Leu His His Ser Ile His Ala Glu Gln
65 70 75 80
Phe Leu Leu Thr Asn Leu Ser Leu His Asp Glu Pro Asn Leu His Ser
85 90 95
Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Ile Arg Gly Ala Pro Asp Ile Gln Ile Ile Ile Thr Ser Glu Ser
115 120 125
Asp Pro Asn Phe Thr Ser Leu Ser His Phe Leu Pro Tyr Arg Phe Gly
130 135 140
Pro His Asp Leu Leu Pro Gln His Ala Pro Leu Phe Leu Glu Pro Arg
145 150 155 160
Asn Asn Gly Leu Thr Gln Lys Leu Pro Asn Gly Val Cys Lys Gly Asp
165 170 175
Ala Val Asp Glu Lys Leu Lys Ile Ala Ala Met Glu Gly Ala Asn Lys
180 185 190
Ser His Ala Pro Tyr Ser Asn Ser Pro Ser Gly Met Ala Ile Val Asp
195 200 205
Cys Asn Gly Lys Ile Tyr Lys Gly Ser Tyr Val Glu Ser Ala Ala Phe
210 215 220
Asn Pro Ser Leu Gly Pro Leu Gln Ala Ala Val Val Ala Phe Met Val
225 230 235 240
Gly Gly Gly Gly Lys Tyr Asp Glu Ile Val Gly Ala Val Leu Val Glu
245 250 255
Lys Asp Gly Ala Met Val Lys Gln Glu Gly Thr Val Arg Leu Leu Leu
260 265 270
Glu Ala Ile Ser Pro Lys Cys Gln Leu Gln Thr Phe Leu Cys Glu
275 280 285
<210> 28
<211> 298
<212> PRT
<213> Medicago truncatula
<400> 28
Met Glu Glu Pro Lys Phe Ile Ile Glu Val Ser Glu Ala Lys Ser Leu
1 5 10 15
Thr Gln Ser Thr Asn Leu Thr Leu Thr Gln Leu Leu Pro Thr Ile Val
20 25 30
Lys Ser Ser Gln Pro Leu Ala Arg Val Pro Ile Ser Lys Phe His Val
35 40 45
Ala Ala Val Ala Val Gly Ile Ser Gly Arg Ile Phe Ile Gly Val Asn
50 55 60
Val Glu Phe Pro Asn Leu Pro Phe His His Thr Ile His Ala Glu Gln
65 70 75 80
Phe Leu Leu Thr Asn Leu Ser His Asn Lys Glu Thr Gln Ile Gln Tyr
85 90 95
Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Ile Arg Gly Ala Gly Asp Ile Pro Leu Leu Ile Thr Ser Asp Leu
115 120 125
Glu Lys Thr Ala Lys Thr Glu Phe Thr Ser Leu Ser Glu Phe Leu Ser
130 135 140
His Pro Phe Gly Pro His Asp Leu Leu Pro Lys His Val Pro Leu Leu
145 150 155 160
Leu Glu Pro Arg His Asn Asp Leu Cys Phe Ile Glu Ile Glu Gly Pro
165 170 175
Ile Thr Asn Ser Ile Lys Pro His Leu Lys Asp Ala Ala Leu Lys Ala
180 185 190
Ala Asn Glu Ser His Ala Pro Tyr Ser Asp Ser Pro Ser Gly Val Ala
195 200 205
Leu Leu Asp Ser Arg Gly Asn Val Tyr Lys Gly Ser Tyr Met Glu Ser
210 215 220
Ala Ala Tyr Asn Pro Ser Met Gly Pro Val Gln Ala Ala Leu Val Ala
225 230 235 240
Phe Ile Val Gly Ala Arg Asp Gly Ala Ala Glu Tyr Gly Glu Leu Val
245 250 255
Glu Ala Val Leu Val Glu Lys Glu Asp Ala Val Val Lys Gln Glu Asp
260 265 270
Thr Ala Arg Leu Leu Leu Arg Ser Ile Ala Pro Gln Cys Ser Phe Asn
275 280 285
Val Phe Leu Cys Ser Ser Asn Asn Lys Ile
290 295
<210> 29
<211> 286
<212> PRT
<213> Phaseolus vulgaris
<400> 29
Met Glu Gln Ala Lys Pro Lys Phe Val Ile Ser Ala Ser Glu Ala Gln
1 5 10 15
Ser His Ser Ser Pro Ile Ala Lys Leu Leu Pro Ser Leu Val Pro Ser
20 25 30
Ala His Ser Leu Ala Arg Pro Pro Ile Ser Asn Phe Pro Val Ala Ala
35 40 45
Val Gly Leu Gly Ser Ser Gly Arg Ile Phe Val Gly Val Asn Val Glu
50 55 60
Phe Pro Gly Leu Pro Phe His His Thr Ile His Ala Glu Gln Phe Leu
65 70 75 80
Leu Thr Asn Leu Ser Leu His Gly Glu Thr Arg Leu Asp Ser Phe Ala
85 90 95
Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln Glu Leu
100 105 110
Arg Asp Ala Pro Asp Ile Gln Ile Leu Ile Thr Ser His Ala Asn Pro
115 120 125
Asn Phe Thr Pro Leu Ser His Phe Leu Ser His Arg Phe Gly Pro His
130 135 140
Asp Leu Leu Pro Lys Thr Ala Pro Leu Leu Leu Glu Pro Arg His Asn
145 150 155 160
Ala Leu Ser Leu Pro Thr Pro Ile Pro His Asn Thr Asn Pro Asn Leu
165 170 175
Thr Leu Ser Ala Leu Glu Ala Ala Asn Ser Ser His Ala Pro Tyr Ser
180 185 190
Ala Ala Pro Ser Gly Val Ala Leu Leu Asp Ser Lys Gly Thr Val Tyr
195 200 205
Lys Gly Ser Tyr Ile Glu Ser Ala Ala Tyr Asn Pro Ser Leu Gly Pro
210 215 220
Leu Gln Ala Ala Leu Val Ala Phe Ile Val Gly Gly Gly Gly Ala Tyr
225 230 235 240
Asp Glu Ile Val Gly Ala Val Leu Val Glu Lys Glu Gly Ala Val Ile
245 250 255
Lys Gln Glu Pro Thr Ala Arg Leu Leu Leu His Ser Ile Ser Pro His
260 265 270
Cys His Phe Arg Thr Phe Leu Ala Thr Ser Ser His Ser Pro
275 280 285
<210> 30
<211> 299
<212> PRT
<213> Phaseolus vulgaris
<400> 30
Met Asp His Ser Pro Lys Phe Leu Ile Asp Pro Ser Glu Ala Leu Ala
1 5 10 15
Leu Ala Asp Ser Ala Ala Ile Thr Leu Pro Glu Leu Leu Pro Arg Leu
20 25 30
Val Pro Ala Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Asn Phe Pro
35 40 45
Val Ala Ala Val Gly Leu Gly Pro Ser Gly Arg Ile Phe Val Gly Val
50 55 60
Asn Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu
65 70 75 80
Gln Phe Leu Leu Cys Asn Leu Ser Leu Asn Ala Glu Ala Asn Leu Ala
85 90 95
Ser Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu
100 105 110
Gln Glu Leu Arg Ala Ala Ser Asp Val Asn Ile Leu Val Thr Ser His
115 120 125
Ala Thr Pro Gln Phe Thr Pro Leu Ser Asp Phe Leu Pro His Gln Phe
130 135 140
Gly Pro His Asp Leu Leu Ser Leu Arg Thr Pro Leu Leu Leu Glu Pro
145 150 155 160
His His Asn Ala Leu Thr Leu Leu Pro Ser His Ala Ala Asn Asp Ala
165 170 175
Ala Leu Ser Asn Gly His Leu His Asn His Lys Leu Lys Asn Ala Ala
180 185 190
Leu Asp Ala Ala Asn Lys Ser His Ala Pro Tyr Thr Ala Ser Pro Ser
195 200 205
Gly Val Ala Leu Leu Asp Arg Gln Gly Asn Leu Tyr Lys Gly Ser Tyr
210 215 220
Leu Glu Ser Ala Ala Phe Asn Pro Ser Leu Gly Pro Val Gln Ala Ala
225 230 235 240
Leu Val Ala Phe Val Ala Ala Gly Gly Gly Asp Tyr His Gln Ile Val
245 250 255
Asp Ala Val Leu Val Glu Lys Glu Asp Ala Ala Val Lys Gln Glu His
260 265 270
Thr Ala Arg Leu Leu Leu His Ser Ile Ser Pro Asp Cys Asn Phe Ser
275 280 285
Thr Phe Leu Cys His Ser Glu Pro Ser Pro Pro
290 295
<210> 31
<211> 306
<212> PRT
<213> Malus domestica
<220>
<221> UNSURE
<222> (284)..(284)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> UNSURE
<222> (284)..(284)
<223> The 'Xaa' at location 284 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (284)..(284)
<223> The 'Xaa' at location 284 stands for Gln, Arg, Pro, or Leu.
<400> 31
Met Asp Arg Pro Arg Phe Val Ile Glu Ala Ser Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Gln Ser Asn Leu Thr Val Leu Gln Leu Leu Pro Ser Leu Val
20 25 30
Lys Pro Ala Gln Ala Leu Ala Arg Pro Pro Ile Ser Lys Phe His Val
35 40 45
Gly Ala Val Gly Tyr Gly Ser Ser Gly Arg Ile Phe Leu Gly Val Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His Tyr Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Val Thr Asn Leu Ser Ile Gln Ser Glu Ser Lys Leu Gln Tyr
85 90 95
Ile Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Ile Arg Gly Ala Gln Asp Ile Gln Ile Leu Ile Thr Ser Val Glu
115 120 125
Ser Gly Asp Asp Asn Ser Gly Leu Asn Gly Phe Asp Pro Leu Leu His
130 135 140
Leu Leu Pro His Arg Phe Gly Pro Glu Asp Leu Leu Gly Gln Asp Val
145 150 155 160
Pro Leu Leu Leu Glu Gln His Glu Asn Gly Leu Ser Phe Phe Ser Glu
165 170 175
Thr Gln Asn Leu Thr Gly Asp Phe Lys Leu Asn Ala Glu Leu Lys Ala
180 185 190
Ala Ala Leu Glu Ala Ala Asn Lys Ser His Ala Pro Tyr Ser Gly Cys
195 200 205
Pro Ser Gly Val Ala Ile Leu Asp Ser Asp Gly Lys Met Tyr Lys Gly
210 215 220
Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser Met Gly Pro Ala Gln
225 230 235 240
Ala Ala Leu Val Ala Tyr Ile Val Gly Gly Gly Gly Gly Tyr Glu Lys
245 250 255
Ile Val Val Ala Val Leu Val Glu Lys Ala Asp Val Leu Val Lys Gln
260 265 270
Glu His Thr Ala Arg Leu Leu Phe Gln Ala Ile Xaa Pro Lys Ile Asp
275 280 285
Phe Arg Val Phe His Cys Asp Ser Gly Ser Asn Cys Val Lys Ser Leu
290 295 300
Asp Leu
305
<210> 32
<211> 301
<212> PRT
<213> Malus domestica
<220>
<221> UNSURE
<222> (136)..(136)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> UNSURE
<222> (136)..(136)
<223> The 'Xaa' at location 136 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (136)..(136)
<223> The 'Xaa' at location 136 stands for Gln, Arg, Pro, or Leu.
<400> 32
Met Asp Gln Pro Arg Phe Val Ile Glu Ala Ser Glu Ala Glu Ser Met
1 5 10 15
Ala Lys Gln Ser Asn Leu Thr Val Leu Gln Leu Leu Pro Ser Leu Val
20 25 30
Lys Pro Ala Gln Asp Leu Ala Arg Pro Pro Ile Ser Lys Phe His Val
35 40 45
Gly Ala Val Gly Tyr Gly Ser Ser Gly Arg Ile Phe Ile Gly Val Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His Tyr Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Val Thr Asn Leu Ser Ile Gln Ser Glu Ser Lys Leu Gln Tyr
85 90 95
Ile Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Ile Gln
100 105 110
Glu Ile Arg Gly Ala Gln Asp Ile Gln Ile Leu Ile Thr Ser Val Glu
115 120 125
Ser Gly Asp Asp Asn Ser Gly Xaa Asn Ser Phe Asp Pro Leu Leu His
130 135 140
Leu Leu Pro His Arg Phe Gly Pro Glu Asp Leu Leu Gly Gln Asp Val
145 150 155 160
Pro Leu Leu Leu Glu Gln Leu Glu Asn Gly Leu Ser Phe Leu Ser Glu
165 170 175
Thr Glu Asn Leu Thr Asp Asp Phe Lys Leu Asn Ala Glu Leu Lys Ala
180 185 190
Ala Ala Leu Glu Ala Ala Asn Lys Ser His Ala Pro Tyr Ser Gly Cys
195 200 205
Pro Ser Gly Val Ala Ile Leu Asp Ser Asp Gly Lys Met Tyr Lys Gly
210 215 220
Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser Met Gly Pro Ala Gln
225 230 235 240
Ala Ala Val Val Ala Tyr Ile Val Gly Gly Gly Gly Gly Tyr Glu Lys
245 250 255
Ile Val Gly Ala Val Leu Val Glu Lys Asp Gly Ala Val Val Lys Gln
260 265 270
Glu His Thr Ala Arg Leu Leu Phe Gln Ala Ile Ser Pro Lys Val Asp
275 280 285
Phe Arg Val Phe His Cys Gly Ser Gly Ser Asn Gly Val
290 295 300
<210> 33
<211> 295
<212> PRT
<213> Gossypium raimondii
<400> 33
Met Asp Arg Pro Arg Phe Val Ile Asp Ala Val Glu Ala Glu Gln Met
1 5 10 15
Ala Lys Gln Ser Gly Lys Thr Val Leu Gln Leu Leu Pro Ser Leu Val
20 25 30
Lys Ser Ala Gln Ser Val Ala Arg Pro Pro Ile Ser Lys Tyr His Val
35 40 45
Gly Ala Val Gly Ile Gly Ser Ser Gly Arg Ile Phe Phe Gly Ala Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu Asn His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Ser Leu Asn Ala Glu Leu Arg Leu Lys Tyr
85 90 95
Ile Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Phe Gln
100 105 110
Glu Leu Arg Gly Ala Pro Asp Val Lys Met Leu Ile Thr Ser Ser Asp
115 120 125
Asp Glu Lys Glu Asn Lys Ile Ser Asn Thr Cys Asn Asp Lys Asp Gln
130 135 140
Glu Phe Thr Pro Leu Ser His Phe Leu Pro His Arg Phe Gly Pro Asp
145 150 155 160
Asp Leu Leu Gly Lys Asp Ala Pro Leu Leu Leu Glu Pro Arg Arg Asn
165 170 175
Gly Leu Ser Phe Thr Ser Asp Gly Cys Glu Asn Asp Glu Leu Lys His
180 185 190
Ala Ala Leu Asp Ala Ala Asn Met Ser Tyr Ala Pro Tyr Ser Gly Cys
195 200 205
Pro Ser Gly Val Ala Leu Ile Asp Val Glu Gly Lys Ile Tyr Lys Gly
210 215 220
Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser Leu Pro Pro Val Gln
225 230 235 240
Ala Ala Ile Val Ala Tyr Val Ala Ser Gly Gly Gly Gly Gly Tyr Glu
245 250 255
Arg Ile Val Arg Ala Val Leu Val Glu Lys Ser Asp Ala Val Ile Lys
260 265 270
Gln Glu His Thr Ala Arg Leu Leu Leu Gln Cys Ile Ser Pro Lys Cys
275 280 285
Glu Val Lys Val Phe Ser Val
290 295
<210> 34
<211> 312
<212> PRT
<213> Gossypium raimondii
<400> 34
Met Asp Arg Pro Arg Phe Val Ile Asp Ser Ala Glu Val Glu Gln Met
1 5 10 15
Ala Lys Gln Ser Arg Gln Thr Val Leu His Leu Leu Pro Ser Leu Val
20 25 30
Lys Ser Ala Gln Ala Leu Ala Arg Pro Pro Ile Ser Asn Tyr His Val
35 40 45
Gly Ala Val Gly Val Gly Ser Ser Gly Arg Ile Phe Phe Gly Val Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu Asn His Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Ser Leu Asn Ala Glu Ser Cys Leu Lys Phe
85 90 95
Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Leu Arg Ala Ala Pro Asp Val Asn Ile Leu Ile Thr Ala Thr Glu
115 120 125
Lys Glu Lys Glu Asn Lys Thr Ile Asn Asn Arg Asn Asp Lys Asp Met
130 135 140
Gln Phe Ala Pro Leu Ser Tyr Phe Leu Pro His Arg Phe Gly Pro Asp
145 150 155 160
Asp Leu Leu Glu Lys Asp Val Pro Leu Leu Leu Glu Pro His Arg Asn
165 170 175
Gly Leu Ser Phe Cys Ser Asp Thr Arg Asn Gly Glu Ile Asn Gly Asp
180 185 190
Gly Asp Asp Ser Lys Tyr Ala Ala Leu Glu Ala Ala Asn Ala Ser His
195 200 205
Ala Pro Tyr Ser Gly Cys Pro Ser Gly Ala Ala Leu Val Asp Val Glu
210 215 220
Gly Lys Ile Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro
225 230 235 240
Ser Leu Pro Pro Val Gln Ala Ala Leu Val Ala Tyr Val Ala Ser Gly
245 250 255
Gly Gly Gly Gly Tyr Glu Arg Ile Val Lys Ala Val Leu Val Glu Thr
260 265 270
Ala Asp Ala Val Ile Lys Gln Glu His Thr Ala Lys Leu Leu Leu Gln
275 280 285
Cys Ile Ser Pro Lys Cys Glu Phe Lys Val Phe His Cys Asn Lys Asn
290 295 300
Met Leu Lys Cys Glu Tyr Gln Val
305 310
<210> 35
<211> 304
<212> PRT
<213> Gossypium raimondii
<400> 35
Met Asp Arg Pro Arg Phe Val Ile Glu Ser Ala Glu Ala Glu Gln Met
1 5 10 15
Ala Lys Gln Ser Gly Gln Thr Val Leu Gln Leu Leu Pro Ser Leu Val
20 25 30
Lys Ser Ala Gln Thr Leu Ala Arg Pro Pro Ile Ser Asn Tyr His Val
35 40 45
Gly Ala Val Gly Met Gly Pro Ser Gly Arg Ile Phe Phe Gly Val Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu Asn Gln Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Ile Thr Asn Leu Ser Leu Asn Ala Glu Pro Arg Leu Arg Tyr
85 90 95
Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Leu Arg Gly Ala Ser Asp Val Lys Ile Leu Ile Thr Ser Ser Glu
115 120 125
Asp Glu Lys Glu Asn Lys Ile Asn Asn Asn Cys Asn Asp Lys Asp Gln
130 135 140
Glu Phe Thr Pro Leu Ser His Phe Leu Pro His Arg Phe Gly Pro Asp
145 150 155 160
Asp Leu Leu Glu Lys Asp Val Pro Leu Leu Leu Glu Pro His Arg Asn
165 170 175
Gly Leu Ser Phe Cys Asn Asp Leu Cys Asn Gly Lys Ile Asn Gly Val
180 185 190
Asp Asp Leu Lys His Ala Ala Val Asp Ala Ala Asn Met Ser His Ala
195 200 205
Pro Tyr Ser Gly Cys Pro Ser Gly Met Ala Leu Leu Asp Val Glu Gly
210 215 220
Lys Ile Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Tyr Asn Pro Ser
225 230 235 240
Leu Pro Pro Ala Gln Ala Ala Leu Val Ala Tyr Val Ala Gly Gly Gly
245 250 255
Gly Gly Gly Tyr Glu Arg Ile Val Gly Ala Val Leu Val Glu Lys Ala
260 265 270
Asp Ala Val Ile Lys Gln Glu His Thr Ser Arg Leu Leu Leu Gln Cys
275 280 285
Ile Ser Pro Lys Cys Glu Phe Lys Val Phe His Cys Thr Lys Thr Cys
290 295 300
<210> 36
<211> 316
<212> PRT
<213> Sorghum bicolor
<400> 36
Met Gly Glu Glu Gln Ala Ala Pro Lys Pro Ala Glu Ala Ala Ala Ala
1 5 10 15
Ala Leu Glu Leu Thr Gly Phe Val Met Ser Ala Glu Asp Ala Glu Arg
20 25 30
Arg Ala Ala Ala Ala Gly Val Ala Thr Val Gln Asp Leu Leu Pro Leu
35 40 45
Leu Ile Pro Ser Ala Arg Lys Arg Ala Gln Val Pro Ile Ser Asp Phe
50 55 60
Gln Val Gly Ala Val Gly Leu Gly Ala Ser Gly Arg Val Tyr Val Gly
65 70 75 80
Val Asn Leu Glu Phe Arg Gly Val Pro Leu Cys His Ser Val His Ala
85 90 95
Glu Gln Phe Leu Val Ala Asn Ala Ala Ala Ala Gly Glu Ser Ala Leu
100 105 110
Arg Ala Val Ala Val Ser His Met Pro Cys Gly His Cys Arg Gln Phe
115 120 125
Leu Gln Glu Ile Arg Gly Ala Ala Gly Ile Gln Ile Leu Val Thr Ser
130 135 140
Asp Ala Glu Gln Gly Arg Ala Pro Glu Trp Arg Thr Val Ala Ser Leu
145 150 155 160
Leu Leu Arg Pro Phe Gly Pro His Asp Leu Leu Asp Glu Asn Val Pro
165 170 175
Leu Val Leu Glu Ala His Asp Asn Ala Leu Gly Asp Pro Val Val Ala
180 185 190
Ala Ala Ala Asn Gly Phe Ala Pro Gly Asp Leu Asp Ala Arg Leu Arg
195 200 205
Asp Ala Ala Glu Ala Ala Ala Arg Ala Ala His Ala Pro Tyr Ser Gln
210 215 220
Cys Pro Ser Gly Phe Ala Val Ala Asp Gly Asp Gly Arg Ile Tyr Ala
225 230 235 240
Gly Gly Cys Leu Glu Ser Ala Ala Tyr Asn Pro Thr Leu Gly Pro Val
245 250 255
Gln Ala Ala Ile Ile Ala Met Val Ala Ala Gly Gly Gly Pro Ala Gly
260 265 270
Asp Val Val Ala Ala Ala Leu Val Glu Lys Glu Gln Ala Ala Val Ala
275 280 285
Gln Glu Ala Thr Ala Arg Ile Phe Leu Asp Ala Met Ala Pro His Ala
290 295 300
Ser Phe His Val Tyr Lys Tyr Arg Pro Ser Asp Ala
305 310 315
<210> 37
<211> 351
<212> PRT
<213> Helianthus annuus
<400> 37
Met Asn Leu Pro Pro Lys Thr Tyr Ile Ile Glu Ala Glu Glu Ala Lys
1 5 10 15
Ser Met Ala Lys Ser Lys Asn Leu Thr Leu Pro Gln Leu Leu Pro Ser
20 25 30
Leu Ile Lys Ser Ala Gln Asn Leu Ala Asn Pro Pro Ile Ser Asn Phe
35 40 45
Ala Val Gly Val Val Gly Leu Thr Ser Asp Gly Arg Ile Phe Phe Gly
50 55 60
Gly Asn Ile Glu Phe Pro Gly Leu Pro Leu His His Thr Ile His Gly
65 70 75 80
Glu Gln Phe Phe Ile Thr Asn Leu Ala Ala His Ala Ala Gly Ala Lys
85 90 95
Leu Leu Tyr Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln
100 105 110
Phe Phe Gln Glu Leu Arg Gly Ile Ser Asp Thr Gln Ile Val Ile Thr
115 120 125
Asp Gln Pro Gln Glu Asn Pro Asp Tyr Lys Pro Phe Ser Ser Ile Leu
130 135 140
Pro His Pro Phe Gly Pro Phe Asp Leu Leu Asp Glu Asn Thr Pro Leu
145 150 155 160
Ile Leu Glu Lys His Asn Asn Gln Leu Thr Leu Lys Asp Gly Asn Phe
165 170 175
Ile Ile Gln Asp Glu Lys Met Cys Asp Leu Pro Asn Gly Phe Ser Asp
180 185 190
Leu Asn Ala Lys Asn Glu Glu Phe Phe Lys Thr Glu Ala Ser Ala Ala
195 200 205
Ala Arg Gly Ser His Ala Pro Pro Lys Asp Gly Asn Phe Ile Asn Gln
210 215 220
Asp Glu Asn Thr Phe Asp Leu Ser Asn Gly Phe Cys Lys Leu Ile Ala
225 230 235 240
Lys Asn Lys Glu Phe Leu Lys Ser Glu Ala Leu Ala Ala Ala Arg Gly
245 250 255
Ser His Ala Pro Tyr Ser Gly Cys Pro Ser Gly Val Ala Leu Met Asp
260 265 270
Cys Glu Gly Lys Val Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Tyr
275 280 285
Asn Pro Ser Met Met Pro Val Gln Ala Ala Leu Val Ala Tyr Met Val
290 295 300
Ala Gly Gly Gly Gly Tyr Glu Lys Ile Val Ala Ala Val Val Val Glu
305 310 315 320
Lys Glu Gly Val Val Val Arg Gln Glu Asp Thr Ala Arg Leu Val Leu
325 330 335
Lys His Leu Ser Pro Lys Cys Glu Phe Arg Val Val Tyr Cys Asn
340 345 350
<210> 38
<211> 303
<212> PRT
<213> Helianthus annuus
<400> 38
Met Ala Asn Tyr Ile Ile Ser Ala Ser Asp Ala Glu Ser Ile Ala Lys
1 5 10 15
Ser Lys Asn Leu Thr Val Pro Gln Leu Leu Pro Ser Leu Val Gln Ser
20 25 30
Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Asn Phe His Val Gly Ala
35 40 45
Val Gly Leu Ser Ser Asp Gly Arg Ile Phe Phe Gly Gly Asn Val Glu
50 55 60
Phe Pro Gly Leu Pro Leu His His Ser Ile His Ala Glu Gln Phe Leu
65 70 75 80
Ile Thr Asn Leu Ala Ala His Gly Gly Gly Pro Lys Leu Leu Tyr Val
85 90 95
Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln Glu
100 105 110
Leu Arg Gly Val Cys Asp Thr Gln Ile Val Ile Thr Asp Gln Pro Gln
115 120 125
Glu Asn Pro Val Tyr Lys Pro Ile Ser Ser Ile Leu Pro Asp Pro Phe
130 135 140
Gly Pro Phe Asp Leu Leu Asp Gln Asn Thr Pro Leu Ile Leu Glu Lys
145 150 155 160
His Asp Asn Lys Leu Ser Phe Lys Asp Asp Asp Phe Leu Thr Glu Lys
165 170 175
Phe Ser Asn Leu Ser Asn Leu Ser Asn Gly Tyr Ser Glu Leu Val Met
180 185 190
Lys Asn Glu Val Val Leu Lys Asn Lys Ala Leu Glu Ala Ala Arg Glu
195 200 205
Ser His Ala Pro Tyr Ser Gly Cys Pro Ser Gly Val Ala Leu Met Asp
210 215 220
Cys Glu Gly Lys Val Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Tyr
225 230 235 240
Asn Pro Ser Met Met Pro Val Gln Ala Ala Leu Val Ala Tyr Met Ala
245 250 255
Ala Gly Gly Gly Gly Tyr Glu Arg Ile Val Ala Ala Val Leu Val Glu
260 265 270
Lys Glu Gly Val Val Val Arg Gln Glu Glu Thr Ala Arg Leu Met Leu
275 280 285
Asn His Val Ser Pro Lys Cys Glu Leu Arg Val Ala His Cys Val
290 295 300
<210> 39
<211> 304
<212> PRT
<213> Arachis hypogaea
<400> 39
Met Asp Arg Pro Gly Phe Val Ile Glu Pro Ile Glu Ala Gln Pro Thr
1 5 10 15
Ala Gln Ser Leu Ala Ser Leu Val Asp Ser Thr Arg Thr Leu Ala Arg
20 25 30
Pro Pro Ile Ser Asn Phe Arg Val Ser Ala Val Gly Leu Gly Ser Ser
35 40 45
Gly Arg Ile Tyr Val Gly Val Asn Leu Glu Phe Pro Gly Leu Pro Leu
50 55 60
His His Ser Val His Ala Glu Gln Phe Leu Val Thr Asn Leu Leu Leu
65 70 75 80
Asn Gly Glu Thr Ser Leu Thr Ser Phe Ala Val Ser Ala Ala Pro Cys
85 90 95
Gly His Cys Arg Gln Phe Leu Gln Glu Leu Arg Asp Ser Glu Asn Ile
100 105 110
Gln Ile Leu Ile Ile Ser Asp Asn Asn Asp Gln Phe Thr Pro Leu Ser
115 120 125
Asp Phe Leu Pro His Arg Phe Gly Pro Leu Asp Leu Phe Pro Glu Gly
130 135 140
Ser Pro Phe Leu Leu Glu Pro Arg Asn Asn Gly Leu Lys Phe Thr Thr
145 150 155 160
Arg Thr Arg Met Asp Arg Asp Asp Glu Asp Glu Asn Leu Arg Ser Asn
165 170 175
Asn Gly Val Val Cys Asn Gly His Gly Thr Val Val Asp Asp Glu Lys
180 185 190
Leu Lys Ile Ala Ala Leu Glu Ala Ala Asn Ala Ser His Ala Pro Tyr
195 200 205
Ser Gly Ser Pro Ser Gly Val Ala Leu Val Asp Cys Gly Gly Lys Val
210 215 220
Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn Pro Ser Leu Gly
225 230 235 240
Pro Val Gln Ala Ala Val Val Ala Phe Val Ala Gly Gly Gly Gly Gly
245 250 255
Tyr Asp Glu Ile Val Gly Ala Val Leu Val Glu Lys Glu Gly Ala Pro
260 265 270
Val Arg Gln Glu Glu Thr Ala Arg Leu Leu Met Arg Ser Ile Ser Ala
275 280 285
Thr Cys Ser Phe Gln Ala Leu Leu Cys Cys Ser Asp Ser Asn Tyr Val
290 295 300
<210> 40
<211> 142
<212> PRT
<213> Arachis hypogaea
<400> 40
Met Thr Asn Phe Tyr Phe Leu Pro His His Tyr Asp Pro Ile Asp Leu
1 5 10 15
Phe Leu Glu Thr Ser Pro Phe Leu Leu Glu Pro Arg Asn Asn Glu Leu
20 25 30
Arg Leu Thr Thr Arg Thr Arg Met Asn His Asp Asp Glu Asp Glu Asn
35 40 45
Leu Arg Ser Lys Asn Gly Asp Ile Cys Asn Lys His Gly Thr Val Val
50 55 60
Asp Asp Glu Lys Leu Lys Ile Val Ala Leu Glu Val Ala Asn Ala Ser
65 70 75 80
His Ala Ser Tyr Ser Gly Trp Ser Pro Ser Gly Val Ala Leu Val Asp
85 90 95
Cys Asp Arg Lys Leu Tyr Lys Gly Ser Asn Ile Glu Ser Val Ala Phe
100 105 110
Asn Pro Ser Leu Gly Pro Val Gln Val Ala Met Val Val Gly Lys Met
115 120 125
Arg Ser Gln Val Gln Cys Trp Trp Arg Lys Lys Val Leu Gln
130 135 140
<210> 41
<211> 290
<212> PRT
<213> Arachis hypogaea
<400> 41
Met Asp Gln Pro Arg Phe Val Ile Pro Ala Thr Glu Ala Gln Leu Leu
1 5 10 15
Ala Gln Ser Ser Gly Leu Pro Ser Val Asn Glu Val Leu Pro Thr Leu
20 25 30
Val Pro Thr Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Lys Phe Pro
35 40 45
Val Ala Ala Val Gly Leu Gly Lys Ser Gly Arg Ile Phe Ile Gly Val
50 55 60
Asn Leu Glu Phe Pro Gly Leu Pro Phe His His Thr Ile His Ala Glu
65 70 75 80
Gln Phe Leu Val Ala Asn Leu Leu Leu Asn Ser Glu Pro Ser Leu Val
85 90 95
Ser Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu
100 105 110
Gln Glu Leu Pro Asn Ala Thr Asp Leu Lys Ile Val Ile Thr Asn Gln
115 120 125
Arg Asn Pro Asn Phe Ser Pro Leu Ser Gln Phe Leu Asn His Arg Phe
130 135 140
Gly Pro Arg Asp Leu Leu Pro Asp Ser Val Pro Leu Leu Leu Glu Pro
145 150 155 160
His Asn His Gly Leu Ser Leu Pro Asn Lys Phe Asn Ser Asn Asp Phe
165 170 175
Asp Leu Lys Leu Ala Phe Ala Ala Leu Glu Ala Ala Asn Ala Ser His
180 185 190
Ala Pro Tyr Ser Thr Ser Pro Ser Gly Ala Ala Val Met Asp Cys Glu
195 200 205
Gly Lys Val Tyr Arg Gly Ser Tyr Ile Glu Ser Ala Ala Tyr Asn Pro
210 215 220
Ser Leu Gly Pro Leu Gln Ala Ala Ile Val Ala Tyr Val Ala Gly Gly
225 230 235 240
Gly Gly Asp Tyr Glu Arg Ile Val Ala Ala Val Leu Val Glu Lys Asp
245 250 255
Gly Ala Val Val Lys Gln Glu His Thr Ala Arg Leu Leu Leu Arg Ala
260 265 270
Ile Ser Pro Lys Cys Asp Phe Lys Ala Leu Ile Cys Ser Pro Lys Asn
275 280 285
Glu Asp
290
<210> 42
<211> 304
<212> PRT
<213> Arachis hypogaea
<400> 42
Met Asp Arg Pro Gly Phe Val Ile Lys Pro Ile Gln Val Gln Pro Thr
1 5 10 15
Ala Gln Ser Leu Ala Ser Leu Val Asp Ser Thr Arg Thr Leu Ala Arg
20 25 30
Pro Pro Ile Ser Asn Phe Arg Val Ala Ala Val Gly Leu Gly Ser Ser
35 40 45
Gly Arg Ile Tyr Val Gly Val Asn Leu Glu Phe Pro Gly Leu Pro Leu
50 55 60
His His Ser Val His Ala Glu Gln Phe Leu Val Thr Asn Leu Leu Leu
65 70 75 80
Asn Gly Glu Thr Ser Leu Thr Ser Phe Ala Val Ser Ala Ala Pro Cys
85 90 95
Gly His Cys Arg Gln Phe Leu Gln Glu Leu Arg Asp Ser Glu Asn Ile
100 105 110
Gln Ile Leu Ile Ile Ser Asp Asn Asn Asp Gln Phe Thr Pro Leu Ser
115 120 125
Asp Phe Leu Pro His Arg Phe Gly Pro Leu Asp Leu Phe Pro Glu Gly
130 135 140
Ser Pro Phe Leu Leu Glu Pro Arg Asn Asn Gly Leu Lys Leu Thr Ala
145 150 155 160
Arg Thr Arg Met Asn Arg Asp Asp Glu Asp Glu Asn Leu Arg Ser Asn
165 170 175
Asn Gly Val Val Cys Asn Gly His Gly Thr Val Val Asp Asp Glu Lys
180 185 190
Leu Lys Ile Ala Ala Leu Glu Ala Ala Asn Ala Ser His Ala Pro Tyr
195 200 205
Ser Gly Ser Pro Ser Gly Val Ala Leu Val Asp Cys Gly Gly Lys Val
210 215 220
Tyr Lys Gly Ser Tyr Met Glu Ser Ala Ala Phe Asn Pro Ser Leu Gly
225 230 235 240
Pro Val Gln Ala Ala Val Val Ala Phe Val Ala Gly Gly Gly Gly Gly
245 250 255
Tyr Asp Glu Ile Val Gly Ala Val Leu Val Glu Lys Glu Gly Ala Pro
260 265 270
Val Arg Gln Glu Glu Thr Ala Arg Leu Leu Met Arg Ser Ile Ser Ala
275 280 285
Thr Cys Ser Phe Gln Ala Leu Leu Cys Cys Ser Asp Ser Asn Asp Val
290 295 300
<210> 43
<211> 290
<212> PRT
<213> Arachis hypogaea
<400> 43
Met Asp Gln Pro Arg Phe Val Ile Pro Ala Thr Glu Ala Gln Leu Leu
1 5 10 15
Ala Gln Ser Ser Gly Leu Pro Ser Val Asn Glu Val Leu Pro Thr Ile
20 25 30
Val Pro Thr Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Lys Phe His
35 40 45
Val Ala Ala Val Gly Leu Gly Lys Ser Gly Arg Ile Phe Ile Gly Val
50 55 60
Asn Leu Glu Phe Pro Gly Leu Pro Phe His His Thr Ile His Ala Glu
65 70 75 80
Gln Phe Leu Val Ala Asn Leu Leu Leu Asn Ser Glu Pro Ser Leu Val
85 90 95
Ser Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu
100 105 110
Gln Glu Leu Pro Asn Ala Thr Asp Leu Lys Ile Val Ile Thr Asn Gln
115 120 125
Arg Asn Pro Asn Phe Ser Pro Leu Ser Gln Phe Leu Asn His Arg Phe
130 135 140
Gly Pro Arg Asp Leu Leu Pro Asp Ser Val Pro Leu Leu Leu Glu Pro
145 150 155 160
His Asn His Gly Leu Ser Leu Pro Asn Lys Phe Asn Ser Asn Asp Phe
165 170 175
Asp Leu Lys Leu Ala Phe Ala Ala Leu Glu Ala Ala Asn Ala Ser His
180 185 190
Ala Pro Tyr Ser Ala Ser Pro Ser Gly Ala Ala Val Met Asp Cys Glu
195 200 205
Gly Lys Val Tyr Arg Gly Ser Tyr Ile Glu Ser Ala Ala Tyr Asn Pro
210 215 220
Ser Leu Gly Pro Leu Gln Ala Ala Ile Val Ala Tyr Val Ala Gly Gly
225 230 235 240
Gly Gly Asp Tyr Glu Arg Ile Val Ala Ala Val Leu Val Glu Lys Asp
245 250 255
Gly Ala Val Val Lys Gln Glu His Thr Ala Arg Leu Leu Leu Arg Ala
260 265 270
Ile Ser Pro Lys Cys Asp Phe Lys Ala Leu Ile Cys Ser Pro Lys Asn
275 280 285
Glu Asp
290
<210> 44
<211> 324
<212> PRT
<213> Solanum tuberosum
<400> 44
Met Asp Gln Leu Lys Phe Val Val Glu Ala Ser Glu Ala Glu Ser Ile
1 5 10 15
Thr His Lys Leu Gly Leu Pro Ser Val His His Leu Leu Pro Val Leu
20 25 30
Val Gln Pro Ala Gln Thr Leu Ala Arg Pro Pro Ile Ser Asn Tyr His
35 40 45
Val Ala Ala Val Gly Leu Gly Ser Asp Gly Arg Val Phe Leu Gly Val
50 55 60
Asn Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu
65 70 75 80
Gln Phe Leu Ile Thr Asn Leu Ala Val His Arg Cys Pro Arg Leu Val
85 90 95
Ala Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu
100 105 110
Gln Glu Leu Arg Asn Pro Ser Asp Leu Gln Ile His Ile Thr Ser Gln
115 120 125
His Gln Asn Asn Pro Asn Val Thr Phe Glu Pro Leu Arg Glu Ile Leu
130 135 140
Pro Asn Pro Phe Gly Pro Phe Asp Leu Leu Asp Asp Glu Thr Pro Leu
145 150 155 160
Leu Leu Glu Arg His Asn Asn Gly Leu Ile Leu Ser Tyr Glu Ile Asn
165 170 175
His Asp Gly Asp Leu Cys Asn Gly Phe Ser Asp Asp Asp Leu Lys Ser
180 185 190
Gly Asn Leu Ser Asn Gly Phe Tyr Lys Leu Thr Glu Thr Glu Ser Thr
195 200 205
Leu Leu Arg Ile Ala Ala Leu Glu Gly Ala Asn Asp Ser His Ala Pro
210 215 220
Tyr Ser Gly Cys Pro Ser Gly Val Ala Ile Met Asp Tyr Glu Gly Lys
225 230 235 240
Ile Tyr Arg Gly Ser Tyr Val Glu Ser Ala Ala Tyr Asn Pro Ser Leu
245 250 255
Gly Pro Val Gln Ala Ala Leu Val Ala Phe Val Ala Glu Gly Gly Gly
260 265 270
Gly Tyr Glu Arg Ile Val Ala Ala Ala Leu Val Glu Lys Glu Gly Ala
275 280 285
Lys Val Arg Gln Glu Asp Thr Ala Arg Ile Phe Leu Lys Leu Val Ser
290 295 300
Pro Lys Cys Asp Leu Lys Val Phe His Cys Cys Val Ala Glu Asn Gly
305 310 315 320
Cys Lys Lys Asp
<210> 45
<211> 304
<212> PRT
<213> Cucumis sativus
<400> 45
Met Asp Gly Thr Arg Phe Val Ile Glu Ala Pro Glu Ala Glu Ser Leu
1 5 10 15
Ala Lys Gln Ser Ala Leu Thr Ile Pro Leu Leu Leu Pro Thr Leu Val
20 25 30
Lys Ser Ala Gln Ser Leu Ala Arg Pro Pro Ile Ser Lys Tyr His Val
35 40 45
Gly Ala Val Gly Leu Gly Ser Ser Gly Arg Val Phe Phe Gly Val Asn
50 55 60
Leu Glu Phe Pro Gly Leu Pro Leu His Gln Ser Val His Ala Glu Gln
65 70 75 80
Phe Leu Val Thr Asn Leu Ala Leu Asn Ala Glu Ser His Leu Asn Tyr
85 90 95
Leu Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu Gln
100 105 110
Glu Val Arg Ser Ser Ala Asp Ile Lys Ile Leu Val Ser Asp Ser Gly
115 120 125
Ser Asp Ser Gly Ser Asp Ser Lys Pro Asp Val Tyr Val Pro Leu Pro
130 135 140
Gln Phe Leu Pro His Arg Phe Gly Pro Tyr Asp Leu Leu Ala Lys Asp
145 150 155 160
Val Pro Leu Leu Leu Glu Pro Arg Phe Asn Gly Leu Ser Leu Pro Asn
165 170 175
Glu Thr Ala Glu Asn Asn Lys Leu Cys Asn Gly Asn His Gly Glu Asn
180 185 190
Leu Glu Lys Leu Lys Arg Ala Ala Leu Asp Ala Ala Asn Met Ser His
195 200 205
Ala Pro Tyr Ser Lys Cys Pro Ser Gly Val Ala Leu Met Asp Asp Asn
210 215 220
Gly Arg Ile Tyr Asn Gly Pro Tyr Met Glu Ser Ala Ala Tyr Asn Pro
225 230 235 240
Ser Met Gly Pro Val Gln Ala Ala Ile Val Ala Tyr Ile Ala Gly Gly
245 250 255
Gly Ala Gly Tyr Glu Arg Ile Val Ala Ala Val Leu Val Glu Lys Asp
260 265 270
Gly Val Glu Val Lys Gln Glu Arg Ala Ala Arg Leu Leu Leu Glu Thr
275 280 285
Ile Ser Pro Glu Cys Glu Phe Thr Val Val His Cys Ile Ala Ala Val
290 295 300
<210> 46
<211> 326
<212> PRT
<213> Solanum lycopersicum
<400> 46
Met Asp Gln Leu Gln Phe Val Val Glu Ala Ser Glu Ala Asp Ser Ile
1 5 10 15
Thr Gln Lys Leu Gly Leu Pro Ser Val His His Leu Leu Pro Leu Leu
20 25 30
Val Gln Pro Ala Gln Thr Leu Ala Arg Pro Pro Ile Ser Asn Tyr His
35 40 45
Val Ala Ala Val Gly Leu Gly Ser Asp Gly Arg Val Phe Leu Gly Val
50 55 60
Asn Leu Glu Phe Pro Gly Leu Pro Leu His His Ser Val His Ala Glu
65 70 75 80
Gln Phe Leu Ile Thr Asn Leu Ala Val His Leu Cys Pro Arg Leu Val
85 90 95
Ala Phe Ala Val Ser Ala Ala Pro Cys Gly His Cys Arg Gln Phe Leu
100 105 110
Gln Glu Leu Arg Asn Ser Ser Asp Leu Gln Ile His Ile Thr Ser Gln
115 120 125
His Gln Asn Asn Pro Asp Val Ile Phe Glu Pro Leu Arg Glu Ile Leu
130 135 140
Pro Asn Pro Phe Gly Pro Phe Asp Leu Leu Asp Asp Glu Thr Pro Leu
145 150 155 160
Leu Leu Glu Arg His Asn Asn Asn Leu Ile Leu Ser Tyr Glu Ile Asn
165 170 175
His Val Gly Asp Leu Cys Asn Gly Phe Ser Asp Asp Asp Leu Lys Ser
180 185 190
Gly Lys Asn Leu Ser Asn Gly Phe Tyr Lys Leu Thr Glu Thr Glu Ser
195 200 205
Thr Leu Leu Arg Ile Ala Ala Leu Gly Gly Ala Asn Asn Ser His Ala
210 215 220
Pro Tyr Ser Glu Cys Pro Ser Gly Val Ala Ile Met Asp Cys Asp Gly
225 230 235 240
Lys Ile Tyr Lys Gly Ser Tyr Val Glu Ser Ala Ala Tyr Asn Pro Ser
245 250 255
Leu Gly Pro Met Gln Ala Ala Leu Val Ala Phe Val Ala Glu Gly Gly
260 265 270
Gly Gly Tyr Glu Arg Ile Val Ala Ala Ala Leu Val Glu Lys Glu Gly
275 280 285
Ala Lys Val Arg Gln Glu Asp Thr Ala Arg Ile Phe Leu Lys Leu Val
290 295 300
Ser Pro Lys Cys Asp Leu Lys Val Phe His Cys Cys Val Ala Ala Glu
305 310 315 320
Asn Gly Cys Lys Lys Asp
325
<210> 47
<211> 19030
<212> DNA
<213> Artificial Sequence
<400> 47
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 60
gacgttttta atgtactgaa ttaacgccga attaattcgg gggatctgga ttttagtact 120
ggattttggt tttaggaatt agaaatttta ttgatagaag tattttacaa atacaaatac 180
atactaaggg tttcttatat gctcaacaca tgagcgaaac cctataggaa ccctaattcc 240
cttatctggg aactactcac acattattat ggagaaactc gagcttgtcg atcgacagat 300
cccggtcggc atctactcta tttctttgcc ctcggacgag tgctggggcg tcggtttcca 360
ctatcggcga gtacttctac acagccatcg gtccagacgg ccgcgcttct gcgggcgatt 420
tgtgtacgcc cgacagtccc ggctccggat cggacgattg cgtcgcatcg accctgcgcc 480
caagctgcat catcgaaatt gccgtcaacc aagctctgat agagttggtc aagaccaatg 540
cggagcatat acgcccggag tcgtggcgat cctgcaagct ccggatgcct ccgctcgaag 600
tagcgcgtct gctgctccat acaagccaac cacggcctcc agaagaagat gttggcgacc 660
tcgtattggg aatccccgaa catcgcctcg ctccagtcaa tgaccgctgt tatgcggcca 720
ttgtccgtca ggacattgtt ggagccgaaa tccgcgtgca cgaggtgccg gacttcgggg 780
cagtcctcgg cccaaagcat cagctcatcg agagcctgcg cgacggacgc actgacggtg 840
tcgtccatca cagtttgcca gtgatacaca tggggatcag caatcgcgca tatgaaatca 900
cgccatgtag tgtattgacc gattccttgc ggtccgaatg ggccgaaccc gctcgtctgg 960
ctaagatcgg ccgcagcgat cgcatccata gcctccgcga ccggttgtag aacagcgggc 1020
agttcggttt caggcaggtc ttgcaacgtg acaccctgtg aacggcggga gatgcaatag 1080
gtcaggctct cgctaaactc cccaatgtca agcacttccg gaatcgggag cgcggccgat 1140
gcaaagtgcc gataaacata acgatctttg tagaaaccat cggcgcagct atttacccgc 1200
aggacatatc cacgccctcc tacatcgaag ctgaaagcac gagattcttc gccctccgag 1260
agctgcatca ggtcggagac gctgtcgaac ttttcgatca gaaacttctc gacagacgtc 1320
gcggtgagtt caggcttttt catatctcat tgccccccgg atctgcgaaa gctcgagaga 1380
gatagatttg tagagagaga ctggtgattt cagcgtgtcc tctccaaatg aaatgaactt 1440
ccttatatag aggaagggtc ttgcgaagga tagtgggatt gtgcgtcatc ccttacgtca 1500
gtggagatat cacatcaatc cacttgcttt gaagacgtgg ttggaacgtc ttctttttcc 1560
acgatgctct tcgtgggtgg gggtccatct ttgggaccac tgtcggcaga ggcatcttga 1620
acgatagcct ttcctttatc gcaatgatgg catttgtagg tgccaccttc cttttctact 1680
gtccttttga tgaagtgaca gatagctggg caatggaatc cgaggaggtt tcccgatatt 1740
accctttgtt gaaaagtctc aatagccctt tggtcttctg agactgtatc tttgatattc 1800
ttggagtaga cgagagtgtc gtgctccacc atgttcacat caatccactt gctttgaaga 1860
cgtggttgga acgtcttctt tttccacgat gctcctcgtg ggtgggggtc catctttggg 1920
accactgtcg gcagaggcat cttgaacgat agcctttcct ttatcgcaat gatggcattt 1980
gtaggtgcca ccttcctttt ctactgtcct tttgatgaag tgacagatag ctgggcaatg 2040
gaatccgagg aggtttcccg atattaccct ttgttgaaaa gtctcaatag ccctttggtc 2100
ttctgagact gtatctttga tattcttgga gtagacgaga gtgtcgtgct ccaccatgtt 2160
ggcaagctgc tctagccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 2220
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 2280
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2340
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga catgattacg 2400
aattcgagct cggtaccgta actataacgg tcctaaggta gcgaaggatc ctctagtaac 2460
atagatgaca ccgcgcgcga taatttatcc tagtttgcgc gctatatttt gttttctatc 2520
gcgtattaaa tgtataattg cgggactcta atcataaaaa cccacctcat aaataacgtc 2580
atgcattaca tgttaattat tacatgctta acgtaattca acagaaatta tatgataatc 2640
atcgcaagac cggcaacagg attcaatctt aagaaacttt attgccaaat gtttgaacga 2700
tcgggactac accttccgct tcttctttgg gctccccccc gacagcatct tgatcttgtt 2760
ctccccgttg ctgtcctgga tgacgagggc ccatggcttg tactctgggg cgtcgctggt 2820
caggagcatg acgttctcgt cggtggactc gtcgtaggcg gtgtgcacca ggatgtcgga 2880
ctctggcttg ttgccgatga cctcctccac ctcctctggc agcatgagga tgctctcctg 2940
gatcacgagt tgcttgcccg tctccttctc gatgatgtcg gacaggttgg tgctgccgcc 3000
ggagtcacgc gtcttcttct tctttgcttg ccctgccttc ttggttgctg cgggtctctt 3060
atcaccacca agctgggaaa ggtcgatacg agtctcgtaa agaccggtga tggactggtg 3120
gatgagagta gcgtcgagaa cctccttggt ggacgtgtaa cgcttcctgt cgatggtggt 3180
gtcgaagtac ttgaaagcag caggggcgcc gaggttcgtg agcgtgaaga ggtggatgat 3240
gttctcggcc tgctcgcgga tgggcttgtc gcggtgcttg ttgtaggcgg agaggacctt 3300
gtcgaggtta gcgtcagcga ggatgacgcg cttggagaac tcggagatct gctcgatgat 3360
ctcgtcgagg tagtgcttgt gctgctccac gaagagctgc ttctgctcgt tatcctcagg 3420
ggaacccttg agcttctcgt agtgggaggc gaggtagagg aagttcacgt acttggaagg 3480
aagagcaagc tcgttaccct tctggagctc accagcggaa gccagcatcc tcttacgacc 3540
gttctcgagc tcgaaaagag agtacttggg gagcttgatg atgaggtcct tcttgacctc 3600
cttgtagccc ttggcctcga ggaagtcgat cgggttcttc tcgaaggagg agcgctccat 3660
gatggtgatg ccgaggagct ccttgacgga cttgagcttc ttcgacttac ccttctccac 3720
cttggcgacc acgaggacgg agtaggcgac agtaggggag tcgaaaccac cgtacttctt 3780
agggtcccaa tccttcttcc tagcgatgag cttgtccgag tttctctttg gaaggataga 3840
ctccttggag aagccaccgg tctggacctc ggtcttcttg acgatgttaa cctgaggcat 3900
agaaaggacc ttgcgaacag tagcgaagtc gcgacccttg tcccaaacga tctcaccagt 3960
ctcaccgttc gtctcgataa gagggcgctt gcggatctcg ccgttggcga gggtgatctc 4020
ggtcttgaag aagttcatga tgttggagta gaagaagtac ttggcggtgg ccttgccgat 4080
ctcctgctcg gacttggcga tcatcttacg aacgtcgtag accttgtagt caccgtagac 4140
gaactcggac tcaagcttag ggtacttctt gataagagcg gtaccaacga cagcgttaag 4200
gtaagcatcg tgagcgtggt ggtagttgtt gatctcgcgg accttgtaga actggaagtc 4260
cttgcggaag tcggagacga gcttggactt gagggtgatc accttgacct cgcggatgag 4320
cttgtcgttc tcgtcgtact tggtgttcat cctagaatcg aggatctgag caacgtgctt 4380
ggtaatctgc ctcgtctcaa caagctgcct cttgatgaaa ccagccttgt caagctcgga 4440
aaggccaccc ctctcagcct tcgtgaggtt gtcgaacttc ctctgggtaa tgagcttagc 4500
gttgagaagc tgcctccagt agttcttcat cttcttgaca acctcctcgg aagggacgtt 4560
gtccgactta cccctgttct tgtcggacct cgtgaggacc ttgttgtcga tggagtcatc 4620
cttaaggaaa gactgaggaa caatgtggtc gacgtcgtag tcagaaagcc tgttgatgtc 4680
gagctcctgg tcaacgtaca tatccctacc gttctggagg tagtagaggt agagcttctc 4740
gttctggagc tgggtgttct cgacagggtg ctccttaagg atctgagaac caagctcctt 4800
gataccctcc tcaatcctct tcatgcgctc cctcgagttc ttctgaccct tctgggtagt 4860
ctggttctca cgagccatct cgatgacgat gttctcaggc ttgtgacgac ccataacctt 4920
gacaagctca tcgacaacct taacagtctg aaggatgccc ttcttgatag caggggaacc 4980
agcaaggtta gcaatgtgct cgtggagaga gtcgccctga ccggacacct gagccttctg 5040
aatatcctcc ttgaaggtaa gagagtcatc gtggatgagc tgcatgaagt tcctgttagc 5100
gaaaccatca gacttgagga agtcgaggat agtcttgccg ctctgcttat ccctgatacc 5160
gttgatgagc ttgcgggaga gcctacccca accggtgtaa cggcgacgct tgagctgctt 5220
cataaccttg tcatcgaaga gatgagcgta agtcttgagc ctctcctcga tcatctccct 5280
atcctcgaag agagtaagag tgaggacgat gtcctcgagg atgtcctcgt tctcctcgtt 5340
gtcgaggaag tccttgtcct tgatgatctt gaggagatcg tggtaggtac cgagagaagc 5400
gttgaaacgg tcctcaacgc cgctgatctc gacggagtcg aagcactcga tcttcttgaa 5460
gtagtcctcc ttgagctgct tgacggtgac cttgcggttg gtcttgaaga ggaggtcaac 5520
gatagccttc ttctgctcgc cggagaggaa ggcaggcttg cgcataccct cggtgacgta 5580
cttgaccttg gtgagctcgt tgtagaccgt gaagtactcg tagaggaggg agtgcttggg 5640
gaggaccttc tcgttgggga ggttcttgtc gaagttggtc atgcgctcga tgaaggactg 5700
ggcggaagca cccttgtcaa caacctcctc gaagttccaa ggggtgatag tctcctcgga 5760
cttcctagtc atccaagcga aacgggagtt accacgagca agaggaccaa cgtagtaagg 5820
aatacggaaa gtaaggatct tctcgatctt ctcacggttg tccttgagga aagggtagaa 5880
gtcctcctgc ctacgaagga tggcgtgaag ctcaccaagg tggatctggt gagggataga 5940
gccgttatcg aaagtcctct gcttcctaag gaggtcctca cggttaagct taacgagaag 6000
ctcctcagta ccatccatct tctcaaggat tggcttgatg aacttgtaga actcctcctg 6060
gctagctcca ccgtcaatgt aaccggcgta gccgttcttg gactggtcga agaagatctc 6120
cttgtacttc tcggggagct gctgacgaac aagagccttg aggagggtga ggtcctggtg 6180
gtgctcgtcg tagcgcttga tcatggaagc agaaagagga gccttagtga tctcagtgtt 6240
aaccctaagg atatccgaaa ggaggatagc atcggagagg ttcttagcag caaggaagag 6300
atcagcgtac tgatctccaa tctgagcgag gaggttgtcg agatcatcgt cgtaggtatc 6360
cttggagagc tgaagcttag cgtcctcagc gaggtcgaag ttggacttga agttaggggt 6420
cagaccgagg gagagagcga tgaggttacc gaaaagaccg ttcttcttct caccagggag 6480
ctgggcgatg aggttctcga gacgcctgga cttggagagc ctagcggaaa ggatcgcctt 6540
agcgtcgaca ccggaagcgt tgatagggtt ctcctcgaag agctggttgt aggtctggac 6600
gagctggatg aagagcttgt ccacgtcgga gttatcaggg ttaaggtcac cctcgataag 6660
gaagtgacca cggaacttga tcatgtgagc gagagcaagg tagatgagac gaagatcagc 6720
cttatcagta gagtcaacaa gcttcttacg aaggtggtag atagtggggt acttctcgtg 6780
gtaggcgacc tcgtcgacga tgttgccgaa gatggggtgg cgctcgtgct tcttgtcctc 6840
ctccacgagg aaggactcct cgaggcggtg gaagaaggag tcgtcgacct tcgccatctc 6900
gttggagaag atctcctgga ggtagcagat gcggttcttg cggcgggtgt agcggcggcg 6960
ggcggtgcgc ttgaggcggg tcgcctccgc cgtctcgccg gagtcgaaga ggagggcgcc 7020
gatgaggttc ttcttgatgg agtggcggtc ggtgttgccc aggaccttga acttcttgga 7080
cgggaccttg tactcgtcgg tgatcaccgc ccagccgacg ctgttggtgc cgatggcgag 7140
gccgatggag tacttcttgt ccttaagttc aggcgtagcg ctctcggagg tgcctggcgt 7200
ctcgctgccg gacttagctt catagcaatg aaacactttg aattcgcatt tcggcgatat 7260
agtctctaat aacaacctcg ccgtgtgctc ttgcctcacc accgcatctt ctttctccac 7320
cagaaccgct ccgacgatcc tctcgtatcc tcctccacca ccattagcca cataatcaac 7380
caacgccgcc tgtactggtc ccatactagg attatacgcc gccgattcca tataccaacc 7440
tctgtacact ttcccgtcac aatccaccag cgaaactccc gatggacata aactatacgg 7500
cgcgtacgat ctattcgccg ccgctaaagc cgtttgtttc aaatcggcgg atgaatcggt 7560
gtttccgtta caaatcgaat ccagatctga gattttgaga tggttatcgt gagattcgag 7620
aagaagagga tgatctttcc cgagaagatc gtcgggaccg aatctgtgtg gcaagaagct 7680
tccgagacgt aagaatccgt ctgaatcggc ggcggaatcg gaatcggcgg agttgtttgg 7740
atcggtgata aggattttga tttcaggtgc gtcgcgaatt tcttggagga attgacggca 7800
atggccacat ggtgcggcgg agacggcgaa gaaattgaga tgacgttcac cgttgagtgt 7860
gagattggtg acgaggaact gttcggcgtg gattgagtgg tggagaggga gatttgggaa 7920
ttcgacattg acgcctaaga agatccgacc tgatgatccg agtccgacga ctgcgacgtt 7980
gaatttcgaa atcggagttc gagcgtagga ttgtgctggt ttgactagcg acgggaggag 8040
ctgaatgacg gaaacgccga gttgtttcgc ggcggattct gcttctttgg attggattac 8100
gaagcttggc ttatcaacct tcctcttctt ctttggcatc tgcagaagta acaccaaaca 8160
acagggtgag catcgacaaa agaaacagta ccaagcaaat aaatagcgta tgaaggcagg 8220
gctaaaaaaa tccacatata gctgctgcat atgccatcat ccaagtatat caagatcaaa 8280
ataattataa aacatacttg tttattataa tagataggta ctcaaggtta gagcatatga 8340
atagatgctg catatgccat catgtatatg catcagtaaa acccacatca acatgtatac 8400
ctatcctaga tcgatatttc catccatctt aaactcgtaa ctatgaagat gtatgacaca 8460
cacatacagt tccaaaatta ataaatacac caggtagttt gaaacagtat tctactccga 8520
tctagaacga atgaacgacc gcccaaccac accacatcat cacaaccaag cgaacaaaaa 8580
gcatctctgt atatgcatca gtaaaacccg catcaacatg tatacctatc ctagatcgat 8640
atttccatcc atcatcttca attcgtaact atgaatatgt atggcacaca catacagatc 8700
caaaattaat aaatccacca ggtagtttga aacagaattc tactccgatc tagaacgacc 8760
gcccaaccag accacatcat cacaaccaag acaaaaaaaa gcatgaaaag atgacccgac 8820
aaacaagtgc acggcatata ttgaaataaa ggaaaagggc aaaccaaacc ctatgcaacg 8880
aaacaaaaaa aatcatgaaa tcgatcccgt ctgcggaacg gctagagcca tcccaggatt 8940
ccccaaagag aaacactggc aagttagcaa tcagaacgtg tctgacgtac aggtcgcatc 9000
cgtgtacgaa cgctagcagc acggatctaa cacaaacacg gatctaacac aaacatgaac 9060
agaagtagaa ctaccgggcc ctaaccatgg accggaacgc cgatctagag aaggtagaga 9120
gggggggggg ggaggacgag cggcgtacct tgaagcggag gtgccgacgg gtggatttgg 9180
gggagatctg gttgtgtgtg tgtgcgctcc gaacaacacg aggttgggga aagagggtgt 9240
ggagggggtg tctatttatt acggcgggcg aggaagggaa agcgaaggag cggtgggaaa 9300
ggaatccccc gtagctgccg gtgccgtgag aggaggagga ggccgcctgc cgtgccggct 9360
cacgtctgcc gctccgccac gcaatttctg gatgccgaca gcggagcaag tccaacggtg 9420
gagcggaact ctcgagaggg gtccagaggc agcgacagag atgccgtgcc gtctgcttcg 9480
cttggcccga cgcgacgctg ctggttcgct ggttggtgtc cgttagactc gtcgacggcg 9540
tttaacaggc tggcattatc tactcgaaac aagaaaaatg tttccttagt ttttttaatt 9600
tcttaaaggg tatttgttta atttttagtc actttatttt attctatttt atatctaaat 9660
tattaaataa aaaaactaaa atagagtttt agttttctta atttagaggc taaaatagaa 9720
taaaatagat gtactaaaaa aattagtcta taaaaaccat taaccctaaa ccctaaatgg 9780
atgtactaat aaaatggatg aagtattata taggtgaagc tatttgcaaa aaaaaaggag 9840
aacacatgca cactaaaaag ataaaactgt agagtcctgt tgtcaaaata ctcaattgtc 9900
ctttagacca tgtctaactg ttcatttata tgattctcta aaacactgat attattgtag 9960
tactatagat tatattattc gtagagtaaa gtttaaatat atgtataaag atagataaac 10020
tgcacttcaa acaagtgtga caaaaaaaat atgtggtaat tttttataac ttagacatgc 10080
aatgctcatt atctctagag aggggcacga ccgggtcacg ctgcactgca gcctaggaaa 10140
aaaagcaccg actcggtgcc actttttcaa gttgataacg gactagcctt attttaactt 10200
gctatttcta gctctaaaac atcatgcggc gggggacctg tgccacggat catctgcaca 10260
actcttttaa accagctttg atctatgtgg atagccgagg tggtactaat actagtcttt 10320
gttgtcgtcc aattgcgtaa tgggccggcc catactgcaa tacatgtcct gaaaggcttc 10380
atggcccact acgaaatgct tttctcctac agtttatctt actccttcac atcacacgtg 10440
gtttccaacg tacccagtgt tcccggcttc cagcatttgc tggtagcacc agtagaagac 10500
gcctgtcttg tgctatggtc cctgactgca catctgattc ctccaagatc catgcatgcc 10560
tgataacttt aagttgcttc agaagaactt taagtgatct gttcgtatgt ttaaagattc 10620
cttcccgggg atctcgatgt gtagtctacg agaagggtta accgtctctt cgtgagaata 10680
accgtggcct aaaaataagc cgatgaggat aaataaaatg tggtggtaca gtacttcaag 10740
aggtttactc atcaagagga tgcttttccg atgagctcta gtagtacatc ggacctcaca 10800
tacctccatt gtggtgaaat attttgtgct catttagtga tgggtaaatt ttgtttatgt 10860
cactctaggt tttgacattt cagttttgcc actcttaggt tttgacaaat aatttccatt 10920
ccgcggcaaa agcaaaacaa ttttatttta cttttaccac tcttagcttt cacaatgtat 10980
cacaaatgcc actctagaaa ttctgtttat gccacagaat gtgaaaaaaa acactcactt 11040
atttgaagcc aaggtgttca tggcatggaa atgtgacata aagtaacgtt cgtgtataag 11100
aaaaaattgt actcctcgta acaagagacg gaaacatcat gagacaatcg cgtttggaag 11160
gctttgcatc acctttggat gatgcgcatg aatggagtcg tctgcttgct agccttcgcc 11220
taccgcccac tgagtccggg cggcaactac catcggcgaa cgacccagct gacctctacc 11280
gaccggactt gaatgcgcta ccttcgtcag cgacgatggc cgcgtacgct ggcgacgtgc 11340
ccccgcatgc atggcggcac atggcgagct cagaccgtgc gtggctggct acaaatacgt 11400
accccgtgag tgccctagct agaaacttac acctgcaact gcgagagcga gcgtgtgagt 11460
gtagccgagt agatccgcca ccatggcctc ctccgagaac gtcatcaccg agttcatgcg 11520
cttcaaggtg cgcatggagg gcaccgtgaa cggccacgag ttcgagatcg agggcgaggg 11580
cgagggccgc ccctacgagg gccacaacac cgtgaagctg aaggtgacga agggcggccc 11640
cctgcccttc gcctgggaca tcctgtcccc ccagttccag tacggctcca aggtgtacgt 11700
gaagcacccc gccgacatcc ccgactacaa gaagctgtcc ttccccgagg gcttcaagtg 11760
ggagcgcgtg atgaacttcg aggacggcgg cgtggcgacc gtgacccagg actcctccct 11820
gcaggacggc tgcttcatct acaaggtgaa gttcatcggc gtgaacttcc cctccgacgg 11880
ccccgtgatg cagaagaaga ccatgggctg ggaggcctcc accgagcgcc tgtacccccg 11940
cgacggcgtg ctgaagggcg agacgcacaa ggccctgaag ctgaaggacg gcggccacta 12000
cctggtggag ttcaagtcca tctacatggc caagaagccc gtgcagctgc ccggctacta 12060
ctacgtggac gccaagctgg acatcacctc ccacaacgag gactacacca tcgtggagca 12120
gtacgagcgc accgagggcc gccaccacct gttcctgtag cggcccatgg atattcgaac 12180
gcgtaggtac cacatggtta acctagactt gtccatcttc tggattggcc aacttaatta 12240
atgtatgaaa taaaaggatg cacacatagt gacatgctaa tcactataat gtgggcatca 12300
aagttgtgtg ttatgtgtaa ttactagtta tctgaataaa agagaaagag atcatccata 12360
tttcttatcc taaatgaatg tcacgtgtct ttataattct ttgatgaacc agatgcattt 12420
cattaaccaa atccatatac atataaatat taatcatata taattaatat caattgggtt 12480
agcaaaacaa atctagtcta ggtgtgtttt gcgaatgcgg ccaagcttgg cactggccgt 12540
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 12600
acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 12660
acagttgcgc agcctgaatg gcgaatgcta gagcagcttg agcttggatc agattgtcgt 12720
ttcccgcctt cagtttaaac tatcagtgtt tgacaggata tattggcggg taaacctaag 12780
agaaaagagc gtttattaga ataatcggat atttaaaagg gcgtgaaaag gtttatccgt 12840
tcgtccattt gtatgtgcat gccaaccaca gggttcccct cgggatcaaa gtactttgat 12900
ccaacccctc cgctgctata gtgcagtcgg cttctgacgt tcagtgcagc cgtcttctga 12960
aaacgacatg tcgcacaagt cctaagttac gcgacaggct gccgccctgc ccttttcctg 13020
gcgttttctt gtcgcgtgtt ttagtcgcat aaagtagaat acttgcgact agaaccggag 13080
acattacgcc atgaacaaga gcgccgccgc tggcctgctg ggctatgccc gcgtcagcac 13140
cgacgaccag gacttgacca accaacgggc cgaactgcac gcggccggct gcaccaagct 13200
gttttccgag aagatcaccg gcaccaggcg cgaccgcccg gagctggcca ggatgcttga 13260
ccacctacgc cctggcgacg ttgtgacagt gaccaggcta gaccgcctgg cccgcagcac 13320
ccgcgaccta ctggacattg ccgagcgcat ccaggaggcc ggcgcgggcc tgcgtagcct 13380
ggcagagccg tgggccgaca ccaccacgcc ggccggccgc atggtgttga ccgtgttcgc 13440
cggcattgcc gagttcgagc gttccctaat catcgaccgc acccggagcg ggcgcgaggc 13500
cgccaaggcc cgaggcgtga agtttggccc ccgccctacc ctcaccccgg cacagatcgc 13560
gcacgcccgc gagctgatcg accaggaagg ccgcaccgtg aaagaggcgg ctgcactgct 13620
tggcgtgcat cgctcgaccc tgtaccgcgc acttgagcgc agcgaggaag tgacgcccac 13680
cgaggccagg cggcgcggtg ccttccgtga ggacgcattg accgaggccg acgccctggc 13740
ggccgccgag aatgaacgcc aagaggaaca agcatgaaac cgcaccagga cggccaggac 13800
gaaccgtttt tcattaccga agagatcgag gcggagatga tcgcggccgg gtacgtgttc 13860
gagccgcccg cgcacgtctc aaccgtgcgg ctgcatgaaa tcctggccgg tttgtctgat 13920
gccaagctgg cggcctggcc ggccagcttg gccgctgaag aaaccgagcg ccgccgtcta 13980
aaaaggtgat gtgtatttga gtaaaacagc ttgcgtcatg cggtcgctgc gtatatgatg 14040
cgatgagtaa ataaacaaat acgcaagggg aacgcatgaa ggttatcgct gtacttaacc 14100
agaaaggcgg gtcaggcaag acgaccatcg caacccatct agcccgcgcc ctgcaactcg 14160
ccggggccga tgttctgtta gtcgattccg atccccaggg cagtgcccgc gattgggcgg 14220
ccgtgcggga agatcaaccg ctaaccgttg tcggcatcga ccgcccgacg attgaccgcg 14280
acgtgaaggc catcggccgg cgcgacttcg tagtgatcga cggagcgccc caggcggcgg 14340
acttggctgt gtccgcgatc aaggcagccg acttcgtgct gattccggtg cagccaagcc 14400
cttacgacat atgggccacc gccgacctgg tggagctggt taagcagcgc attgaggtca 14460
cggatggaag gctacaagcg gcctttgtcg tgtcgcgggc gatcaaaggc acgcgcatcg 14520
gcggtgaggt tgccgaggcg ctggccgggt acgagctgcc cattcttgag tcccgtatca 14580
cgcagcgcgt gagctaccca ggcactgccg ccgccggcac aaccgttctt gaatcagaac 14640
ccgagggcga cgctgcccgc gaggtccagg cgctggccgc tgaaattaaa tcaaaactca 14700
tttgagttaa tgaggtaaag agaaaatgag caaaagcaca aacacgctaa gtgccggccg 14760
tccgagcgca cgcagcagca aggctgcaac gttggccagc ctggcagaca cgccagccat 14820
gaagcgggtc aactttcagt tgccggcgga ggatcacacc aagctgaaga tgtacgcggt 14880
acgccaaggc aagaccatta ccgagctgct atctgaatac atcgcgcagc taccagagta 14940
aatgagcaaa tgaataaatg agtagatgaa ttttagcggc taaaggaggc ggcatggaaa 15000
atcaagaaca accaggcacc gacgccgtgg aatgccccat gtgtggagga acgggcggtt 15060
ggccaggcgt aagcggctgg gttgcctgcc ggccctgcaa tggcactgga acccccaagc 15120
ccgaggaatc ggcgtgagcg gtcgcaaacc atccggcccg gtacaaatcg gcgcggcgct 15180
gggtgatgac ctggtggaga agttgaaggc cgcgcaggcc gcccagcggc aacgcatcga 15240
ggcagaagca cgccccggtg aatcgtggca agcggccgct gatcgaatcc gcaaagaatc 15300
ccggcaaccg ccggcagccg gtgcgccgtc gattaggaag ccgcccaagg gcgacgagca 15360
accagatttt ttcgttccga tgctctatga cgtgggcacc cgcgatagtc gcagcatcat 15420
ggacgtggcc gttttccgtc tgtcgaagcg tgaccgacga gctggcgagg tgatccgcta 15480
cgagcttcca gacgggcacg tagaggtttc cgcagggccg gccggcatgg ccagtgtgtg 15540
ggattacgac ctggtactga tggcggtttc ccatctaacc gaatccatga accgataccg 15600
ggaagggaag ggagacaagc ccggccgcgt gttccgtcca cacgttgcgg acgtactcaa 15660
gttctgccgg cgagccgatg gcggaaagca gaaagacgac ctggtagaaa cctgcattcg 15720
gttaaacacc acgcacgttg ccatgcagcg tacgaagaag gccaagaacg gccgcctggt 15780
gacggtatcc gagggtgaag ccttgattag ccgctacaag atcgtaaaga gcgaaaccgg 15840
gcggccggag tacatcgaga tcgagctagc tgattggatg taccgcgaga tcacagaagg 15900
caagaacccg gacgtgctga cggttcaccc cgattacttt ttgatcgatc ccggcatcgg 15960
ccgttttctc taccgcctgg cacgccgcgc cgcaggcaag gcagaagcca gatggttgtt 16020
caagacgatc tacgaacgca gtggcagcgc cggagagttc aagaagttct gtttcaccgt 16080
gcgcaagctg atcgggtcaa atgacctgcc ggagtacgat ttgaaggagg aggcggggca 16140
ggctggcccg atcctagtca tgcgctaccg caacctgatc gagggcgaag catccgccgg 16200
ttcctaatgt acggagcaga tgctagggca aattgcccta gcaggggaaa aaggtcgaaa 16260
agttctcttt cctgtggata gcacgtacat tgggaaccca aagccgtaca ttgggaaccg 16320
gaacccgtac attgggaacc caaagccgta cattgggaac cggtcacaca tgtaagtgac 16380
tgatataaaa gagaaaaaag gcgatttttc cgcctaaaac tctttaaaac ttattaaaac 16440
tcttaaaacc cgcctggcct gtgcataact gtctggccag cgcacagccg aagagctgca 16500
aaaagcgcct acccttcggt cgctgcgctc cctacgcccc gccgcttcgc gtcggcctat 16560
cgcggccgct ggccgctcaa aaatggctgg cctacggcca ggcaatctac cagggcgcgg 16620
acaagccgcg ccgtcgccac tcgaccgccg gcgcccacat caaggcaccc tgcctcgcgc 16680
gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt 16740
gtctgtaagc ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg 16800
ggtgtcgggg cgcagccatg acccagtcac gtagcgatag cggagtgtat actggcttaa 16860
ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca 16920
cagatgcgta aggagaaaat accgcatcag gcgctcttcc gcttcctcgc tcactgactc 16980
gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg 17040
gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa 17100
ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 17160
cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 17220
ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct 17280
taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc atagctcacg 17340
ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc 17400
ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt 17460
aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta 17520
tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac 17580
agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc 17640
ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat 17700
tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc 17760
tcagtggaac gaaaactcac gttaagggat tttggtcatg cattctaggt actaaaacaa 17820
ttcatccagt aaaatataat attttatttt ctcccaatca ggcttgatcc ccagtaagtc 17880
aaaaaatagc tcgacatact gttcttcccc gatatcctcc ctgatcgacc ggacgcagaa 17940
ggcaatgtca taccacttgt ccgccctgcc gcttctccca agatcaataa agccacttac 18000
tttgccatct ttcacaaaga tgttgctgtc tcccaggtcg ccgtgggaaa agacaagttc 18060
ctcttcgggc ttttccgtct ttaaaaaatc atacagctcg cgcggatctt taaatggagt 18120
gtcttcttcc cagttttcgc aatccacatc ggccagatcg ttattcagta agtaatccaa 18180
ttcggctaag cggctgtcta agctattcgt atagggacaa tccgatatgt cgatggagtg 18240
aaagagcctg atgcactccg catacagctc gataatcttt tcagggcttt gttcatcttc 18300
atactcttcc gagcaaagga cgccatcggc ctcactcatg agcagattgc tccagccatc 18360
atgccgttca aagtgcagga cctttggaac aggcagcttt ccttccagcc atagcatcat 18420
gtccttttcc cgttccacat cataggtggt ccctttatac cggctgtccg tcatttttaa 18480
atataggttt tcattttctc ccaccagctt atatacctta gcaggagaca ttccttccgt 18540
atcttttacg cagcggtatt tttcgatcag ttttttcaat tccggtgata ttctcatttt 18600
agccatttat tatttccttc ctcttttcta cagtatttaa agatacccca agaagctaat 18660
tataacaaga cgaactccaa ttcactgttc cttgcattct aaaaccttaa ataccagaaa 18720
acagcttttt caaagttgtt ttcaaagttg gcgtataaca tagtatcgac ggagccgatt 18780
ttgaaaccgc ggtgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc 18840
ctccgcgaga tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca 18900
tcggtaacat gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga 18960
ctgatgggct gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt 19020
ggctggctgg 19030
<210> 48
<211> 19063
<212> DNA
<213> Artificial Sequence
<400> 48
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 60
gacgttttta atgtactgaa ttaacgccga attaattcgg gggatctgga ttttagtact 120
ggattttggt tttaggaatt agaaatttta ttgatagaag tattttacaa atacaaatac 180
atactaaggg tttcttatat gctcaacaca tgagcgaaac cctataggaa ccctaattcc 240
cttatctggg aactactcac acattattat ggagaaactc gagcttgtcg atcgacagat 300
cccggtcggc atctactcta tttctttgcc ctcggacgag tgctggggcg tcggtttcca 360
ctatcggcga gtacttctac acagccatcg gtccagacgg ccgcgcttct gcgggcgatt 420
tgtgtacgcc cgacagtccc ggctccggat cggacgattg cgtcgcatcg accctgcgcc 480
caagctgcat catcgaaatt gccgtcaacc aagctctgat agagttggtc aagaccaatg 540
cggagcatat acgcccggag tcgtggcgat cctgcaagct ccggatgcct ccgctcgaag 600
tagcgcgtct gctgctccat acaagccaac cacggcctcc agaagaagat gttggcgacc 660
tcgtattggg aatccccgaa catcgcctcg ctccagtcaa tgaccgctgt tatgcggcca 720
ttgtccgtca ggacattgtt ggagccgaaa tccgcgtgca cgaggtgccg gacttcgggg 780
cagtcctcgg cccaaagcat cagctcatcg agagcctgcg cgacggacgc actgacggtg 840
tcgtccatca cagtttgcca gtgatacaca tggggatcag caatcgcgca tatgaaatca 900
cgccatgtag tgtattgacc gattccttgc ggtccgaatg ggccgaaccc gctcgtctgg 960
ctaagatcgg ccgcagcgat cgcatccata gcctccgcga ccggttgtag aacagcgggc 1020
agttcggttt caggcaggtc ttgcaacgtg acaccctgtg aacggcggga gatgcaatag 1080
gtcaggctct cgctaaactc cccaatgtca agcacttccg gaatcgggag cgcggccgat 1140
gcaaagtgcc gataaacata acgatctttg tagaaaccat cggcgcagct atttacccgc 1200
aggacatatc cacgccctcc tacatcgaag ctgaaagcac gagattcttc gccctccgag 1260
agctgcatca ggtcggagac gctgtcgaac ttttcgatca gaaacttctc gacagacgtc 1320
gcggtgagtt caggcttttt catatctcat tgccccccgg atctgcgaaa gctcgagaga 1380
gatagatttg tagagagaga ctggtgattt cagcgtgtcc tctccaaatg aaatgaactt 1440
ccttatatag aggaagggtc ttgcgaagga tagtgggatt gtgcgtcatc ccttacgtca 1500
gtggagatat cacatcaatc cacttgcttt gaagacgtgg ttggaacgtc ttctttttcc 1560
acgatgctct tcgtgggtgg gggtccatct ttgggaccac tgtcggcaga ggcatcttga 1620
acgatagcct ttcctttatc gcaatgatgg catttgtagg tgccaccttc cttttctact 1680
gtccttttga tgaagtgaca gatagctggg caatggaatc cgaggaggtt tcccgatatt 1740
accctttgtt gaaaagtctc aatagccctt tggtcttctg agactgtatc tttgatattc 1800
ttggagtaga cgagagtgtc gtgctccacc atgttcacat caatccactt gctttgaaga 1860
cgtggttgga acgtcttctt tttccacgat gctcctcgtg ggtgggggtc catctttggg 1920
accactgtcg gcagaggcat cttgaacgat agcctttcct ttatcgcaat gatggcattt 1980
gtaggtgcca ccttcctttt ctactgtcct tttgatgaag tgacagatag ctgggcaatg 2040
gaatccgagg aggtttcccg atattaccct ttgttgaaaa gtctcaatag ccctttggtc 2100
ttctgagact gtatctttga tattcttgga gtagacgaga gtgtcgtgct ccaccatgtt 2160
ggcaagctgc tctagccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 2220
tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 2280
gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 2340
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga catgattacg 2400
aattcgagct cggtaccgta actataacgg tcctaaggta gcgaaggatc ctctagtaac 2460
atagatgaca ccgcgcgcga taatttatcc tagtttgcgc gctatatttt gttttctatc 2520
gcgtattaaa tgtataattg cgggactcta atcataaaaa cccacctcat aaataacgtc 2580
atgcattaca tgttaattat tacatgctta acgtaattca acagaaatta tatgataatc 2640
atcgcaagac cggcaacagg attcaatctt aagaaacttt attgccaaat gtttgaacga 2700
tcgggactac accttccgct tcttctttgg gctccccccc gacagcatct tgatcttgtt 2760
ctccccgttg ctgtcctgga tgacgagggc ccatggcttg tactctgggg cgtcgctggt 2820
caggagcatg acgttctcgt cggtggactc gtcgtaggcg gtgtgcacca ggatgtcgga 2880
ctctggcttg ttgccgatga cctcctccac ctcctctggc agcatgagga tgctctcctg 2940
gatcacgagt tgcttgcccg tctccttctc gatgatgtcg gacaggttgg tgctgccgcc 3000
ggagtcacgc gtcttcttct tctttgcttg ccctgccttc ttggttgctg cgggtctctt 3060
atcaccacca agctgggaaa ggtcgatacg agtctcgtaa agaccggtga tggactggtg 3120
gatgagagta gcgtcgagaa cctccttggt ggacgtgtaa cgcttcctgt cgatggtggt 3180
gtcgaagtac ttgaaagcag caggggcgcc gaggttcgtg agcgtgaaga ggtggatgat 3240
gttctcggcc tgctcgcgga tgggcttgtc gcggtgcttg ttgtaggcgg agaggacctt 3300
gtcgaggtta gcgtcagcga ggatgacgcg cttggagaac tcggagatct gctcgatgat 3360
ctcgtcgagg tagtgcttgt gctgctccac gaagagctgc ttctgctcgt tatcctcagg 3420
ggaacccttg agcttctcgt agtgggaggc gaggtagagg aagttcacgt acttggaagg 3480
aagagcaagc tcgttaccct tctggagctc accagcggaa gccagcatcc tcttacgacc 3540
gttctcgagc tcgaaaagag agtacttggg gagcttgatg atgaggtcct tcttgacctc 3600
cttgtagccc ttggcctcga ggaagtcgat cgggttcttc tcgaaggagg agcgctccat 3660
gatggtgatg ccgaggagct ccttgacgga cttgagcttc ttcgacttac ccttctccac 3720
cttggcgacc acgaggacgg agtaggcgac agtaggggag tcgaaaccac cgtacttctt 3780
agggtcccaa tccttcttcc tagcgatgag cttgtccgag tttctctttg gaaggataga 3840
ctccttggag aagccaccgg tctggacctc ggtcttcttg acgatgttaa cctgaggcat 3900
agaaaggacc ttgcgaacag tagcgaagtc gcgacccttg tcccaaacga tctcaccagt 3960
ctcaccgttc gtctcgataa gagggcgctt gcggatctcg ccgttggcga gggtgatctc 4020
ggtcttgaag aagttcatga tgttggagta gaagaagtac ttggcggtgg ccttgccgat 4080
ctcctgctcg gacttggcga tcatcttacg aacgtcgtag accttgtagt caccgtagac 4140
gaactcggac tcaagcttag ggtacttctt gataagagcg gtaccaacga cagcgttaag 4200
gtaagcatcg tgagcgtggt ggtagttgtt gatctcgcgg accttgtaga actggaagtc 4260
cttgcggaag tcggagacga gcttggactt gagggtgatc accttgacct cgcggatgag 4320
cttgtcgttc tcgtcgtact tggtgttcat cctagaatcg aggatctgag caacgtgctt 4380
ggtaatctgc ctcgtctcaa caagctgcct cttgatgaaa ccagccttgt caagctcgga 4440
aaggccaccc ctctcagcct tcgtgaggtt gtcgaacttc ctctgggtaa tgagcttagc 4500
gttgagaagc tgcctccagt agttcttcat cttcttgaca acctcctcgg aagggacgtt 4560
gtccgactta cccctgttct tgtcggacct cgtgaggacc ttgttgtcga tggagtcatc 4620
cttaaggaaa gactgaggaa caatgtggtc gacgtcgtag tcagaaagcc tgttgatgtc 4680
gagctcctgg tcaacgtaca tatccctacc gttctggagg tagtagaggt agagcttctc 4740
gttctggagc tgggtgttct cgacagggtg ctccttaagg atctgagaac caagctcctt 4800
gataccctcc tcaatcctct tcatgcgctc cctcgagttc ttctgaccct tctgggtagt 4860
ctggttctca cgagccatct cgatgacgat gttctcaggc ttgtgacgac ccataacctt 4920
gacaagctca tcgacaacct taacagtctg aaggatgccc ttcttgatag caggggaacc 4980
agcaaggtta gcaatgtgct cgtggagaga gtcgccctga ccggacacct gagccttctg 5040
aatatcctcc ttgaaggtaa gagagtcatc gtggatgagc tgcatgaagt tcctgttagc 5100
gaaaccatca gacttgagga agtcgaggat agtcttgccg ctctgcttat ccctgatacc 5160
gttgatgagc ttgcgggaga gcctacccca accggtgtaa cggcgacgct tgagctgctt 5220
cataaccttg tcatcgaaga gatgagcgta agtcttgagc ctctcctcga tcatctccct 5280
atcctcgaag agagtaagag tgaggacgat gtcctcgagg atgtcctcgt tctcctcgtt 5340
gtcgaggaag tccttgtcct tgatgatctt gaggagatcg tggtaggtac cgagagaagc 5400
gttgaaacgg tcctcaacgc cgctgatctc gacggagtcg aagcactcga tcttcttgaa 5460
gtagtcctcc ttgagctgct tgacggtgac cttgcggttg gtcttgaaga ggaggtcaac 5520
gatagccttc ttctgctcgc cggagaggaa ggcaggcttg cgcataccct cggtgacgta 5580
cttgaccttg gtgagctcgt tgtagaccgt gaagtactcg tagaggaggg agtgcttggg 5640
gaggaccttc tcgttgggga ggttcttgtc gaagttggtc atgcgctcga tgaaggactg 5700
ggcggaagca cccttgtcaa caacctcctc gaagttccaa ggggtgatag tctcctcgga 5760
cttcctagtc atccaagcga aacgggagtt accacgagca agaggaccaa cgtagtaagg 5820
aatacggaaa gtaaggatct tctcgatctt ctcacggttg tccttgagga aagggtagaa 5880
gtcctcctgc ctacgaagga tggcgtgaag ctcaccaagg tggatctggt gagggataga 5940
gccgttatcg aaagtcctct gcttcctaag gaggtcctca cggttaagct taacgagaag 6000
ctcctcagta ccatccatct tctcaaggat tggcttgatg aacttgtaga actcctcctg 6060
gctagctcca ccgtcaatgt aaccggcgta gccgttcttg gactggtcga agaagatctc 6120
cttgtacttc tcggggagct gctgacgaac aagagccttg aggagggtga ggtcctggtg 6180
gtgctcgtcg tagcgcttga tcatggaagc agaaagagga gccttagtga tctcagtgtt 6240
aaccctaagg atatccgaaa ggaggatagc atcggagagg ttcttagcag caaggaagag 6300
atcagcgtac tgatctccaa tctgagcgag gaggttgtcg agatcatcgt cgtaggtatc 6360
cttggagagc tgaagcttag cgtcctcagc gaggtcgaag ttggacttga agttaggggt 6420
cagaccgagg gagagagcga tgaggttacc gaaaagaccg ttcttcttct caccagggag 6480
ctgggcgatg aggttctcga gacgcctgga cttggagagc ctagcggaaa ggatcgcctt 6540
agcgtcgaca ccggaagcgt tgatagggtt ctcctcgaag agctggttgt aggtctggac 6600
gagctggatg aagagcttgt ccacgtcgga gttatcaggg ttaaggtcac cctcgataag 6660
gaagtgacca cggaacttga tcatgtgagc gagagcaagg tagatgagac gaagatcagc 6720
cttatcagta gagtcaacaa gcttcttacg aaggtggtag atagtggggt acttctcgtg 6780
gtaggcgacc tcgtcgacga tgttgccgaa gatggggtgg cgctcgtgct tcttgtcctc 6840
ctccacgagg aaggactcct cgaggcggtg gaagaaggag tcgtcgacct tcgccatctc 6900
gttggagaag atctcctgga ggtagcagat gcggttcttg cggcgggtgt agcggcggcg 6960
ggcggtgcgc ttgaggcggg tcgcctccgc cgtctcgccg gagtcgaaga ggagggcgcc 7020
gatgaggttc ttcttgatgg agtggcggtc ggtgttgccc aggaccttga acttcttgga 7080
cgggaccttg tactcgtcgg tgatcaccgc ccagccgacg ctgttggtgc cgatggcgag 7140
gccgatggag tacttcttgt ccttaagttc aggcgtagcg ctctcggagg tgcctggcgt 7200
ctcgctgccg gactttgcat cggacggtgt ataattgtac acgtggaagg tggcctgtgg 7260
ggccaccgcg gccaggaaga tcctggccgt cgcttcctgc gacaccaacc ccgcttcctt 7320
ctccacgagc gccgccgcca ccacgtctcc ggcggcgccg cccccggcgg ccaccatccc 7380
tatgatggcc gcctgcacgg ggcctagtgt cggattatac gccgcggact ccaggcatcc 7440
tccggcgtac accttccctt cgccgtccgc cacggcgaat cccgacgggc actcgctata 7500
cggcgcatgc gccgcccgcg cggccgcctc cgctgcctcc ctcaggcgcg cctctaggtc 7560
accgtgggcg aagccattgg cgacggcggc gggttcgccg agagggttgt cgtgcggttc 7620
gaggacgagg gggacgtgct tggggaggag atcatggggc ccgaaggggc gcgggaggag 7680
ggacgccacc gtgcgccact cgggggcgca gccgtcctcg gcgtcggagg tgacaatgat 7740
tcggatccca ccggcgccgc ggatctcctg gaggaactgg cggcagtggc cgcagggcat 7800
gtgggagacg gcgacggcgc gcagctccga ctcccccgcg gcggccgcgt tgacgacaag 7860
gaactgctcg gcgtggacgg aatgggacag cgggaggccc cggaactcga ggttgacgcc 7920
ggcgtagacg cggccgctcg cgccgaggcc cacggcgccc acggggaacc gcgagatcgg 7980
ggcacgcgcg cgccgcatcg cggaagggac gagcagcggc agcaggtcct ccaccgtctc 8040
caccccggcc gccgccgcgg cgcgctcggc ctcctccgcg ctcatcacga accccagcaa 8100
ctccaccgcc ggggcagcgt cagacttcgg cgccactttc tcctccccaa ccttcctctt 8160
cttctttggc atctgcagaa gtaacaccaa acaacagggt gagcatcgac aaaagaaaca 8220
gtaccaagca aataaatagc gtatgaaggc agggctaaaa aaatccacat atagctgctg 8280
catatgccat catccaagta tatcaagatc aaaataatta taaaacatac ttgtttatta 8340
taatagatag gtactcaagg ttagagcata tgaatagatg ctgcatatgc catcatgtat 8400
atgcatcagt aaaacccaca tcaacatgta tacctatcct agatcgatat ttccatccat 8460
cttaaactcg taactatgaa gatgtatgac acacacatac agttccaaaa ttaataaata 8520
caccaggtag tttgaaacag tattctactc cgatctagaa cgaatgaacg accgcccaac 8580
cacaccacat catcacaacc aagcgaacaa aaagcatctc tgtatatgca tcagtaaaac 8640
ccgcatcaac atgtatacct atcctagatc gatatttcca tccatcatct tcaattcgta 8700
actatgaata tgtatggcac acacatacag atccaaaatt aataaatcca ccaggtagtt 8760
tgaaacagaa ttctactccg atctagaacg accgcccaac cagaccacat catcacaacc 8820
aagacaaaaa aaagcatgaa aagatgaccc gacaaacaag tgcacggcat atattgaaat 8880
aaaggaaaag ggcaaaccaa accctatgca acgaaacaaa aaaaatcatg aaatcgatcc 8940
cgtctgcgga acggctagag ccatcccagg attccccaaa gagaaacact ggcaagttag 9000
caatcagaac gtgtctgacg tacaggtcgc atccgtgtac gaacgctagc agcacggatc 9060
taacacaaac acggatctaa cacaaacatg aacagaagta gaactaccgg gccctaacca 9120
tggaccggaa cgccgatcta gagaaggtag agaggggggg gggggaggac gagcggcgta 9180
ccttgaagcg gaggtgccga cgggtggatt tgggggagat ctggttgtgt gtgtgtgcgc 9240
tccgaacaac acgaggttgg ggaaagaggg tgtggagggg gtgtctattt attacggcgg 9300
gcgaggaagg gaaagcgaag gagcggtggg aaaggaatcc cccgtagctg ccggtgccgt 9360
gagaggagga ggaggccgcc tgccgtgccg gctcacgtct gccgctccgc cacgcaattt 9420
ctggatgccg acagcggagc aagtccaacg gtggagcgga actctcgaga ggggtccaga 9480
ggcagcgaca gagatgccgt gccgtctgct tcgcttggcc cgacgcgacg ctgctggttc 9540
gctggttggt gtccgttaga ctcgtcgacg gcgtttaaca ggctggcatt atctactcga 9600
aacaagaaaa atgtttcctt agttttttta atttcttaaa gggtatttgt ttaattttta 9660
gtcactttat tttattctat tttatatcta aattattaaa taaaaaaact aaaatagagt 9720
tttagttttc ttaatttaga ggctaaaata gaataaaata gatgtactaa aaaaattagt 9780
ctataaaaac cattaaccct aaaccctaaa tggatgtact aataaaatgg atgaagtatt 9840
atataggtga agctatttgc aaaaaaaaag gagaacacat gcacactaaa aagataaaac 9900
tgtagagtcc tgttgtcaaa atactcaatt gtcctttaga ccatgtctaa ctgttcattt 9960
atatgattct ctaaaacact gatattattg tagtactata gattatatta ttcgtagagt 10020
aaagtttaaa tatatgtata aagatagata aactgcactt caaacaagtg tgacaaaaaa 10080
aatatgtggt aattttttat aacttagaca tgcaatgctc attatctcta gagaggggca 10140
cgaccgggtc acgctgcact gcagcctagg aaaaaaagca ccgactcggt gccacttttt 10200
caagttgata acggactagc cttattttaa cttgctattt ctagctctaa aacatcatgc 10260
ggcgggggac ctgtgccacg gatcatctgc acaactcttt taaaccagct ttgatctatg 10320
tggatagccg aggtggtact aatactagtc tttgttgtcg tccaattgcg taatgggccg 10380
gcccatactg caatacatgt cctgaaaggc ttcatggccc actacgaaat gcttttctcc 10440
tacagtttat cttactcctt cacatcacac gtggtttcca acgtacccag tgttcccggc 10500
ttccagcatt tgctggtagc accagtagaa gacgcctgtc ttgtgctatg gtccctgact 10560
gcacatctga ttcctccaag atccatgcat gcctgataac tttaagttgc ttcagaagaa 10620
ctttaagtga tctgttcgta tgtttaaaga ttccttcccg gggatctcga tgtgtagtct 10680
acgagaaggg ttaaccgtct cttcgtgaga ataaccgtgg cctaaaaata agccgatgag 10740
gataaataaa atgtggtggt acagtacttc aagaggttta ctcatcaaga ggatgctttt 10800
ccgatgagct ctagtagtac atcggacctc acatacctcc attgtggtga aatattttgt 10860
gctcatttag tgatgggtaa attttgttta tgtcactcta ggttttgaca tttcagtttt 10920
gccactctta ggttttgaca aataatttcc attccgcggc aaaagcaaaa caattttatt 10980
ttacttttac cactcttagc tttcacaatg tatcacaaat gccactctag aaattctgtt 11040
tatgccacag aatgtgaaaa aaaacactca cttatttgaa gccaaggtgt tcatggcatg 11100
gaaatgtgac ataaagtaac gttcgtgtat aagaaaaaat tgtactcctc gtaacaagag 11160
acggaaacat catgagacaa tcgcgtttgg aaggctttgc atcacctttg gatgatgcgc 11220
atgaatggag tcgtctgctt gctagccttc gcctaccgcc cactgagtcc gggcggcaac 11280
taccatcggc gaacgaccca gctgacctct accgaccgga cttgaatgcg ctaccttcgt 11340
cagcgacgat ggccgcgtac gctggcgacg tgcccccgca tgcatggcgg cacatggcga 11400
gctcagaccg tgcgtggctg gctacaaata cgtaccccgt gagtgcccta gctagaaact 11460
tacacctgca actgcgagag cgagcgtgtg agtgtagccg agtagatccg ccaccatggc 11520
ctcctccgag aacgtcatca ccgagttcat gcgcttcaag gtgcgcatgg agggcaccgt 11580
gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggccacaa 11640
caccgtgaag ctgaaggtga cgaagggcgg ccccctgccc ttcgcctggg acatcctgtc 11700
cccccagttc cagtacggct ccaaggtgta cgtgaagcac cccgccgaca tccccgacta 11760
caagaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg 11820
cggcgtggcg accgtgaccc aggactcctc cctgcaggac ggctgcttca tctacaaggt 11880
gaagttcatc ggcgtgaact tcccctccga cggccccgtg atgcagaaga agaccatggg 11940
ctgggaggcc tccaccgagc gcctgtaccc ccgcgacggc gtgctgaagg gcgagacgca 12000
caaggccctg aagctgaagg acggcggcca ctacctggtg gagttcaagt ccatctacat 12060
ggccaagaag cccgtgcagc tgcccggcta ctactacgtg gacgccaagc tggacatcac 12120
ctcccacaac gaggactaca ccatcgtgga gcagtacgag cgcaccgagg gccgccacca 12180
cctgttcctg tagcggccca tggatattcg aacgcgtagg taccacatgg ttaacctaga 12240
cttgtccatc ttctggattg gccaacttaa ttaatgtatg aaataaaagg atgcacacat 12300
agtgacatgc taatcactat aatgtgggca tcaaagttgt gtgttatgtg taattactag 12360
ttatctgaat aaaagagaaa gagatcatcc atatttctta tcctaaatga atgtcacgtg 12420
tctttataat tctttgatga accagatgca tttcattaac caaatccata tacatataaa 12480
tattaatcat atataattaa tatcaattgg gttagcaaaa caaatctagt ctaggtgtgt 12540
tttgcgaatg cggccaagct tggcactggc cgtcgtttta caacgtcgtg actgggaaaa 12600
ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa 12660
tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg 12720
ctagagcagc ttgagcttgg atcagattgt cgtttcccgc cttcagttta aactatcagt 12780
gtttgacagg atatattggc gggtaaacct aagagaaaag agcgtttatt agaataatcg 12840
gatatttaaa agggcgtgaa aaggtttatc cgttcgtcca tttgtatgtg catgccaacc 12900
acagggttcc cctcgggatc aaagtacttt gatccaaccc ctccgctgct atagtgcagt 12960
cggcttctga cgttcagtgc agccgtcttc tgaaaacgac atgtcgcaca agtcctaagt 13020
tacgcgacag gctgccgccc tgcccttttc ctggcgtttt cttgtcgcgt gttttagtcg 13080
cataaagtag aatacttgcg actagaaccg gagacattac gccatgaaca agagcgccgc 13140
cgctggcctg ctgggctatg cccgcgtcag caccgacgac caggacttga ccaaccaacg 13200
ggccgaactg cacgcggccg gctgcaccaa gctgttttcc gagaagatca ccggcaccag 13260
gcgcgaccgc ccggagctgg ccaggatgct tgaccaccta cgccctggcg acgttgtgac 13320
agtgaccagg ctagaccgcc tggcccgcag cacccgcgac ctactggaca ttgccgagcg 13380
catccaggag gccggcgcgg gcctgcgtag cctggcagag ccgtgggccg acaccaccac 13440
gccggccggc cgcatggtgt tgaccgtgtt cgccggcatt gccgagttcg agcgttccct 13500
aatcatcgac cgcacccgga gcgggcgcga ggccgccaag gcccgaggcg tgaagtttgg 13560
cccccgccct accctcaccc cggcacagat cgcgcacgcc cgcgagctga tcgaccagga 13620
aggccgcacc gtgaaagagg cggctgcact gcttggcgtg catcgctcga ccctgtaccg 13680
cgcacttgag cgcagcgagg aagtgacgcc caccgaggcc aggcggcgcg gtgccttccg 13740
tgaggacgca ttgaccgagg ccgacgccct ggcggccgcc gagaatgaac gccaagagga 13800
acaagcatga aaccgcacca ggacggccag gacgaaccgt ttttcattac cgaagagatc 13860
gaggcggaga tgatcgcggc cgggtacgtg ttcgagccgc ccgcgcacgt ctcaaccgtg 13920
cggctgcatg aaatcctggc cggtttgtct gatgccaagc tggcggcctg gccggccagc 13980
ttggccgctg aagaaaccga gcgccgccgt ctaaaaaggt gatgtgtatt tgagtaaaac 14040
agcttgcgtc atgcggtcgc tgcgtatatg atgcgatgag taaataaaca aatacgcaag 14100
gggaacgcat gaaggttatc gctgtactta accagaaagg cgggtcaggc aagacgacca 14160
tcgcaaccca tctagcccgc gccctgcaac tcgccggggc cgatgttctg ttagtcgatt 14220
ccgatcccca gggcagtgcc cgcgattggg cggccgtgcg ggaagatcaa ccgctaaccg 14280
ttgtcggcat cgaccgcccg acgattgacc gcgacgtgaa ggccatcggc cggcgcgact 14340
tcgtagtgat cgacggagcg ccccaggcgg cggacttggc tgtgtccgcg atcaaggcag 14400
ccgacttcgt gctgattccg gtgcagccaa gcccttacga catatgggcc accgccgacc 14460
tggtggagct ggttaagcag cgcattgagg tcacggatgg aaggctacaa gcggcctttg 14520
tcgtgtcgcg ggcgatcaaa ggcacgcgca tcggcggtga ggttgccgag gcgctggccg 14580
ggtacgagct gcccattctt gagtcccgta tcacgcagcg cgtgagctac ccaggcactg 14640
ccgccgccgg cacaaccgtt cttgaatcag aacccgaggg cgacgctgcc cgcgaggtcc 14700
aggcgctggc cgctgaaatt aaatcaaaac tcatttgagt taatgaggta aagagaaaat 14760
gagcaaaagc acaaacacgc taagtgccgg ccgtccgagc gcacgcagca gcaaggctgc 14820
aacgttggcc agcctggcag acacgccagc catgaagcgg gtcaactttc agttgccggc 14880
ggaggatcac accaagctga agatgtacgc ggtacgccaa ggcaagacca ttaccgagct 14940
gctatctgaa tacatcgcgc agctaccaga gtaaatgagc aaatgaataa atgagtagat 15000
gaattttagc ggctaaagga ggcggcatgg aaaatcaaga acaaccaggc accgacgccg 15060
tggaatgccc catgtgtgga ggaacgggcg gttggccagg cgtaagcggc tgggttgcct 15120
gccggccctg caatggcact ggaaccccca agcccgagga atcggcgtga gcggtcgcaa 15180
accatccggc ccggtacaaa tcggcgcggc gctgggtgat gacctggtgg agaagttgaa 15240
ggccgcgcag gccgcccagc ggcaacgcat cgaggcagaa gcacgccccg gtgaatcgtg 15300
gcaagcggcc gctgatcgaa tccgcaaaga atcccggcaa ccgccggcag ccggtgcgcc 15360
gtcgattagg aagccgccca agggcgacga gcaaccagat tttttcgttc cgatgctcta 15420
tgacgtgggc acccgcgata gtcgcagcat catggacgtg gccgttttcc gtctgtcgaa 15480
gcgtgaccga cgagctggcg aggtgatccg ctacgagctt ccagacgggc acgtagaggt 15540
ttccgcaggg ccggccggca tggccagtgt gtgggattac gacctggtac tgatggcggt 15600
ttcccatcta accgaatcca tgaaccgata ccgggaaggg aagggagaca agcccggccg 15660
cgtgttccgt ccacacgttg cggacgtact caagttctgc cggcgagccg atggcggaaa 15720
gcagaaagac gacctggtag aaacctgcat tcggttaaac accacgcacg ttgccatgca 15780
gcgtacgaag aaggccaaga acggccgcct ggtgacggta tccgagggtg aagccttgat 15840
tagccgctac aagatcgtaa agagcgaaac cgggcggccg gagtacatcg agatcgagct 15900
agctgattgg atgtaccgcg agatcacaga aggcaagaac ccggacgtgc tgacggttca 15960
ccccgattac tttttgatcg atcccggcat cggccgtttt ctctaccgcc tggcacgccg 16020
cgccgcaggc aaggcagaag ccagatggtt gttcaagacg atctacgaac gcagtggcag 16080
cgccggagag ttcaagaagt tctgtttcac cgtgcgcaag ctgatcgggt caaatgacct 16140
gccggagtac gatttgaagg aggaggcggg gcaggctggc ccgatcctag tcatgcgcta 16200
ccgcaacctg atcgagggcg aagcatccgc cggttcctaa tgtacggagc agatgctagg 16260
gcaaattgcc ctagcagggg aaaaaggtcg aaaagttctc tttcctgtgg atagcacgta 16320
cattgggaac ccaaagccgt acattgggaa ccggaacccg tacattggga acccaaagcc 16380
gtacattggg aaccggtcac acatgtaagt gactgatata aaagagaaaa aaggcgattt 16440
ttccgcctaa aactctttaa aacttattaa aactcttaaa acccgcctgg cctgtgcata 16500
actgtctggc cagcgcacag ccgaagagct gcaaaaagcg cctacccttc ggtcgctgcg 16560
ctccctacgc cccgccgctt cgcgtcggcc tatcgcggcc gctggccgct caaaaatggc 16620
tggcctacgg ccaggcaatc taccagggcg cggacaagcc gcgccgtcgc cactcgaccg 16680
ccggcgccca catcaaggca ccctgcctcg cgcgtttcgg tgatgacggt gaaaacctct 16740
gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac 16800
aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc atgacccagt 16860
cacgtagcga tagcggagtg tatactggct taactatgcg gcatcagagc agattgtact 16920
gagagtgcac catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat 16980
caggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg 17040
agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc 17100
aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt 17160
gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag 17220
tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc 17280
cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc 17340
ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt 17400
cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt 17460
atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc 17520
agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa 17580
gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa 17640
gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg 17700
tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga 17760
agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg 17820
gattttggtc atgcattcta ggtactaaaa caattcatcc agtaaaatat aatattttat 17880
tttctcccaa tcaggcttga tccccagtaa gtcaaaaaat agctcgacat actgttcttc 17940
cccgatatcc tccctgatcg accggacgca gaaggcaatg tcataccact tgtccgccct 18000
gccgcttctc ccaagatcaa taaagccact tactttgcca tctttcacaa agatgttgct 18060
gtctcccagg tcgccgtggg aaaagacaag ttcctcttcg ggcttttccg tctttaaaaa 18120
atcatacagc tcgcgcggat ctttaaatgg agtgtcttct tcccagtttt cgcaatccac 18180
atcggccaga tcgttattca gtaagtaatc caattcggct aagcggctgt ctaagctatt 18240
cgtataggga caatccgata tgtcgatgga gtgaaagagc ctgatgcact ccgcatacag 18300
ctcgataatc ttttcagggc tttgttcatc ttcatactct tccgagcaaa ggacgccatc 18360
ggcctcactc atgagcagat tgctccagcc atcatgccgt tcaaagtgca ggacctttgg 18420
aacaggcagc tttccttcca gccatagcat catgtccttt tcccgttcca catcataggt 18480
ggtcccttta taccggctgt ccgtcatttt taaatatagg ttttcatttt ctcccaccag 18540
cttatatacc ttagcaggag acattccttc cgtatctttt acgcagcggt atttttcgat 18600
cagttttttc aattccggtg atattctcat tttagccatt tattatttcc ttcctctttt 18660
ctacagtatt taaagatacc ccaagaagct aattataaca agacgaactc caattcactg 18720
ttccttgcat tctaaaacct taaataccag aaaacagctt tttcaaagtt gttttcaaag 18780
ttggcgtata acatagtatc gacggagccg attttgaaac cgcggtgatc acaggcagca 18840
acgctctgtc atcgttacaa tcaacatgct accctccgcg agatcatccg tgtttcaaac 18900
ccggcagctt agttgccgtt cttccgaata gcatcggtaa catgagcaaa gtctgccgcc 18960
ttacaacggc tctcccgctg acgccgtccc ggactgatgg gctgcctgta tcgagtggtg 19020
attttgtgcc gagctgccgg tcggggagct gttggctggc tgg 19063

Claims (6)

1.一种用于植物的碱基编辑方法,为方法G1或方法G2:
所述方法G1包括如下步骤:在受体中表达sgRNA,Cas9蛋白,氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶,从而对所述受体基因组中的靶基因进行碱基编辑;
所述方法G2包括如下步骤:在受体中表达sgRNA,Cas9蛋白,氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶和尿嘧啶糖苷酶抑制剂,从而对所述受体基因组中的靶基因进行碱基编辑;
所述方法可以对目标核苷酸的碱基进行编辑,使其发生C到T的替换;
所述受体为水稻。
2.如权利要求1所述的方法,其特征在于:
所述方法G1中,“在受体中表达sgRNA,Cas9蛋白和氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶”是通过将所述sgRNA的编码基因,所述Cas9蛋白的编码基因和所述胞嘧啶脱氨酶的编码基因导入受体中实现。
3.如权利要求1所述的方法,其特征在于:
所述方法G2中,“在受体中表达sgRNA,Cas9蛋白和氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶和尿嘧啶糖苷酶抑制剂”是通过将所述sgRNA的编码基因,所述Cas9蛋白的编码基因,所述胞嘧啶脱氨酶的编码基因和所述尿嘧啶糖苷酶抑制剂的编码基因导入受体中实现。
4.如权利要求1-3任一所述的方法,其特征在于:所述受体为植物愈伤组织。
5.一个用于植物的碱基编辑载体,其特征在于:包括sgRNA的编码基因,Cas9蛋白的编码基因,氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶的编码基因的重组载体,所述植物为水稻。
6.一个用于植物的碱基编辑载体,其特征在于:包括sgRNA的编码基因,Cas9蛋白的编码基因,氨基酸序列为SEQ ID NO:1–2的任一序列的来源于植物的胞嘧啶脱氨酶的编码基因和尿嘧啶糖苷酶抑制剂的编码基因的重组载体,所述植物为水稻。
CN201910584598.6A 2019-07-01 2019-07-01 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用 Active CN112239756B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910584598.6A CN112239756B (zh) 2019-07-01 2019-07-01 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910584598.6A CN112239756B (zh) 2019-07-01 2019-07-01 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用

Publications (2)

Publication Number Publication Date
CN112239756A CN112239756A (zh) 2021-01-19
CN112239756B true CN112239756B (zh) 2022-04-19

Family

ID=74175408

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910584598.6A Active CN112239756B (zh) 2019-07-01 2019-07-01 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用

Country Status (1)

Country Link
CN (1) CN112239756B (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114686456B (zh) * 2022-05-10 2023-02-17 中山大学 基于双分子脱氨酶互补的碱基编辑系统及其应用

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108070611A (zh) * 2016-11-14 2018-05-25 中国科学院遗传与发育生物学研究所 植物碱基编辑方法
WO2018099256A1 (zh) * 2016-12-01 2018-06-07 中国农业科学院作物科学研究所 一种CRISPR/nCas9介导的定点碱基替换在植物中的应用

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11268082B2 (en) * 2017-03-23 2022-03-08 President And Fellows Of Harvard College Nucleobase editors comprising nucleic acid programmable DNA binding proteins

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108070611A (zh) * 2016-11-14 2018-05-25 中国科学院遗传与发育生物学研究所 植物碱基编辑方法
WO2018099256A1 (zh) * 2016-12-01 2018-06-07 中国农业科学院作物科学研究所 一种CRISPR/nCas9介导的定点碱基替换在植物中的应用

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
cytidine deaminase 1 [Arabidopsis thaliana],登录号:NP_179547.1;LIN X.,et al.;《Genbank》;20190214;全文 *
Cytidine deaminase 1[Oryza sativa Japonica group],登录号:XP_015621794.1;佚名;《Genbank》;20180807;全文 *
Improved base editor for efficiently inducing genetic variations in rice with crispr/CAS9-guided hyperactive hAID mutant;BIN BEN等;《MOL PLANT.》;20180402;第11卷(第4期);623-626 *

Also Published As

Publication number Publication date
CN112239756A (zh) 2021-01-19

Similar Documents

Publication Publication Date Title
CN108486146B (zh) LbCpf1-RR突变体用于CRISPR/Cpf1系统在植物基因编辑中的应用
CN108546712B (zh) 一种利用CRISPR/LbCpf1系统实现目的基因在植物中同源重组的方法
AU2016315655A1 (en) Ochrobactrum-mediated transformation of plants
CA2734798C (en) Engineering of bolting resistance in sugar beet by means of the transgenic expression of the beet homologue of flowering time control gene ft
CN108064297B (zh) 小麦育性相关基因TaMS7及其应用方法
MX2013001191A (es) Cepas de agrobacterium modificadas para incrementar la frecuencia de transformacion de plantas.
JP2005517447A (ja) Tn5結合Cre/loxP切除システムによる最小化ゲノムを含む新規菌株の構築
AU2016350610A1 (en) Methods and compositions of improved plant transformation
KR20100085930A (ko) 거대 게놈 dna의 포획 및 변형 및 합성 엽록체를 갖는 생물의 구성을 위한 시스템
CN110607320A (zh) 一种植物基因组定向碱基编辑骨架载体及其应用
CN112359057B (zh) CRISPR/Cas12a基因编辑系统在84K杨树基因编辑中的应用
CN113621642A (zh) 一种用于农作物杂交育种制种的遗传智能化育制种系统及其应用
CN112239756B (zh) 一组来源于植物的胞嘧啶脱氨酶和其在碱基编辑系统中的应用
CN109022285B (zh) 一种提高集胞藻pcc6803铵盐耐受能力的方法与应用
CN110669775B (zh) 差异代理技术在a·g碱基替换细胞富集中的应用
CN111518735A (zh) 一种谷氨酸棒状杆菌重组菌、制备方法和应用
Hansen et al. The genes for the eight subunits of the membrane bound ATP synthase of Escherichia coli
CN110951702B (zh) 水稻DMNT和TMTT合成相关蛋白OsCYP92C21及其编码基因与应用
CN109694841A (zh) 一种谷氨酸棒状杆菌重组菌、制备方法和应用
CN110818784B (zh) 水稻基因OsATL15在调节农药的吸收转运中的应用
CN112553246A (zh) 一种基于CRISPR-SaCas9系统的高效基因组编辑载体及其应用
CN113631715A (zh) 新的小麦cenh3等位基因
CN113943748B (zh) 一种丁香假单胞菌中的重组系统及应用
CN112566492A (zh) 人工标志物等位基因
GB2475435A (en) Producing a product using shuttle vectors containing essential chloroplast genes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 102206 No.27, shengshengyuan Road, Changping District, Beijing

Applicant after: Keji Dalong (Beijing) Biotechnology Co.,Ltd.

Address before: 102206 No.27, shengshengyuan Road, Changping District, Beijing

Applicant before: Keji Dalong Biotechnology Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant