CN114667292A - 用生长调节因子(grf)、grf相互作用因子(gif)或嵌合grf-gif改进植物再生的方法 - Google Patents

用生长调节因子(grf)、grf相互作用因子(gif)或嵌合grf-gif改进植物再生的方法 Download PDF

Info

Publication number
CN114667292A
CN114667292A CN202080062752.4A CN202080062752A CN114667292A CN 114667292 A CN114667292 A CN 114667292A CN 202080062752 A CN202080062752 A CN 202080062752A CN 114667292 A CN114667292 A CN 114667292A
Authority
CN
China
Prior art keywords
grf
plant
gif
polypeptide
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080062752.4A
Other languages
English (en)
Inventor
J·杜卜科夫斯基
J·M·德贝纳尔迪
D·特里科利
J·帕拉特尼克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Science And Technology Research Commission
University of California
Original Assignee
National Science And Technology Research Commission
University of California
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Science And Technology Research Commission, University of California filed Critical National Science And Technology Research Commission
Publication of CN114667292A publication Critical patent/CN114667292A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/475Growth factors; Growth regulators
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8209Selection, visualisation of transformants, reporter constructs, e.g. antibiotic resistance markers
    • C12N15/821Non-antibiotic resistance markers, e.g. morphogenetic, metabolic markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8262Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield involving plant development
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8291Hormone-influenced development
    • C12N15/8295Cytokinins
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Zoology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Cell Biology (AREA)
  • Endocrinology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Botany (AREA)
  • Peptides Or Proteins (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

公开了具有改进的再生效率的产生植物的方法,所述方法使用生长调节因子(GRF)、GRF相互作用因子(GIF)或嵌合GRF‑GIF基因和蛋白。本公开还以改进的再生效率提供通过所公开的方法产生的植物,在植物再生中减少外源细胞分裂素使用的方法,以及提高植物再生效率的方法。

Description

用生长调节因子(GRF)、GRF相互作用因子(GIF)或嵌合GRF- GIF改进植物再生的方法
相关申请的引用
本申请要求先前在2019年7月11日提交的共同未决申请USSN 62/873,123的优先权,其内容通过引用全文纳入本文。
关于联邦资助研究的声明
本发明在美国农业部、国家粮食和农业研究所授予的资助号2017-67007-25939下由政府支持完成。政府对本发明拥有一定权力。
序列表
本发明包含以ASCII格式经EFS-Web提交的序列,并通过引用全文纳入本文。所述ASCII拷贝创建于2020年5月27日,名为P1322WO00_SEQLISTING_06-17-2020_ST25且大小为385,059字节。
背景技术
现在,植物育种和基因工程已经使用了好几十年以改善植物性能和提供植物尤其是农作物的农业改良。植物转化技术的一个缺点是某些植物的再生效率低,包括选择植物属和种且甚至植物种内特定基因型的低效率。因此,需要改进植物的再生效率。
发明概述
本方法要提高植物再生效率。所述方法采用向一个或多个植物细胞的引入生长调节因子(GRF)和/或GRF相互作用因子蛋白(GIF)(其引入植物细胞),或引入植物细胞的GRF-GIF嵌合体。引入编码多肽的核酸分子或引入所述多肽提高再生效率。在某些实施方案中,所述方法提高低再生效率植物的再生效率。其他实施方案加速产生转基因植物的时间。提供实施方案用于突变GRF序列的miR396靶区域,以降低植物细胞中GRF蛋白受miR396的阻遏。还提供其他实施方案,其中植物细胞内存在GRF和GIF或GRF-GIF嵌合多肽,允许细胞在培养基上再生,该培养基的细胞分裂素浓度对于不包含所引入的多肽的植物细胞再生而言过低。可在这类培养基上选择植物,而不需额外标记序列。在某些实施方案中,用GRF和/或GIF或GRF-GIF转化的植物具有高再生效率,用于后续转化实验。GRF和/或GRF或GRF-GIF嵌合体与诱导型启动子或结构域可操作连接,还允许选择多肽表达或活性的时间。在某些实施方案中,用GRF和/或GIF或GRF-GIF转化的植物具有高再生效率,用于后续转化实验。还提供其他实施方案,其中GRF、GIF或GRF-GIF嵌合体与基因编辑技术组合,且GRF、GIF或GRF-GIF嵌合体以及基因编辑构建体可通过经编辑的植物后代中的隔离(segregation)来移除。
技术领域
一般而言,本公开涉及植物再生方法。更具体地,本公开涉及通过用生长调节因子(GRF)和/或GRF相互作用因子(GIF)或GRF-GIF嵌合体转化改进植物再生效率的方法。
附图简要说明
图1.A-C)用邻接法的分子系统发育分析。A)拟南芥(Arabidopsis)、小鼠和小麦GRF蛋白。B)拟南芥、小鼠和小麦GIF蛋白。C)通过小麦UBIQUITIN启动子表达的小麦GRF4-GIF1嵌合体方案。
图2.细胞分裂素再生培养基中的愈伤组织。在Ubi::GRF4-GIF1嵌合体(SEQ IDNO:5,中间)存在下于Kronos转化期间再生的芽(shoot)数量高于用同一载体但没有GRF-GIF嵌合体的Kronos转化(如两侧所示)。
图3.A)通过PCR验证26个独立转化事件。用引物Fw-GRF4b和Rev-GIF1b获得PCR产物,使用分离自26株独立T0植物的基因组DNA作为模板。各数字对应于独立转化事件。L=DNA梯状长度标记(ladder)。B)GRF4-GIF1嵌合体和用于基因分型的引物的示意图。扩增片段是1.087Kb。
图4.细胞分裂素再生培养基中的愈伤组织。Ubi::GIF1(SEQ ID NO:2)存在下小麦转化期间的芽再生,和使用同一载体但没有GIF1基因的小麦转化中的芽再生。
图5.细胞分裂素再生培养基中的愈伤组织。用Ubi::GRF4(SEQ ID NO:1),Ubi::GIF1(SEQ ID NO:2),Ubi::GRF4-GIF1嵌合体(SEQ ID NO:5)转化的植物细胞的再生效率与对照作比较。
图6.GRF4-GIF1嵌合体对从叶片外植体的再生的影响。
图7.A)小麦转化不同步骤的示意图。B)GRF4-GIF1嵌合体对没有细胞分裂素情况下诱导胚发生的影响。
图8.由小麦UBIQUITIN启动子(SEQ ID NO:59)表达的小麦GRF4-GIF1嵌合体方案。以下是野生型miR396靶位点(顶部(SEQ ID NO:53)、带有沉默突变的miR396-抗性位点(底部(SEQ ID NO:54)和其与miR396相互作用(中间(SEQ ID NO:55))的序列。
图9.A)用构建体转化的植物细胞的芽再生,所述构建体包含GRF4-GIF1嵌合体、Cas9和靶向基因Q的引导RNA(gRNA)。B)用引导RNA靶向的基因Q区域(SEQ ID NO:68)。C)确认靶基因的编辑。
图10.用GRF4-GIF1嵌合体转化的植物细胞的芽再生,其在十分之一的正常细胞分裂素浓度生长。
图11.小麦嵌合克隆的编码序列。A)小麦GRF4-GIF1的核苷酸序列。序列来自四倍体小麦Kronos。B)小麦GRF4-GIF1编码蛋白的序列。蓝色的序列来自GRF4,黑色为间隔子,绿色为GIF1。
图12.pLC41-Ubi::GRF4-GIF1的序列(SEQ ID NO:5)。GRF4序列采用蓝色字母,间隔子是黑色的,GIF序列采用绿色字母。Ubi启动子以灰色突出显示,HA标签为红色,NOS终止子为粉色,35S启动子为绿色,HPT为黄色。LB(GTTTACACCACAATATATCCTGCCA)(SEQ ID NO:57)和RB(GTTTACCCGCCAATATATCCTGTCA)(SEQ ID NO:58)序列带有下划线。
图13.酿酒葡萄(Vitis vinifera)GRF4-GIF1嵌合体的序列。A)葡萄(Vitis)GRF4-GIF1的核苷酸序列(SEQ ID NO:6)。B)葡萄GRF4-GIF1编码的蛋白的序列(SEQ ID NO:7)。蓝色的序列来自GRF4,黑色为间隔子,绿色为GIF1。
图14.A)串联的QLQ-WRC结构域的示范性序列,用于BLASTP检索和系统发育分析。B)5个选定小麦GRF的预测的蛋白序列(SEQ ID NO:9-13)。保守QLQ和WRC结构域分别以黄色和绿色突出显示。
图15.5个水稻GRF直系同源物的预测的蛋白序列(SEQ ID NO:14-18)。保守QLQ和WRC结构域分别以黄色和绿色突出显示。
图16显示A)最接近的拟南芥GRF的预测的序列(SEQ ID NO:19-22)和B)最接近的葡萄的预测的蛋白序列(SEQ ID NO:23)。保守QLQ和WRC结构域分别以黄色和绿色突出显示。
图17显示A)最接近的拟南芥GRF的预测的序列(SEQ ID NO:24、SEQ ID NO:25和SEQ ID NO:26)以及B)显示预测的小麦GIF序列(SEQ ID NO:27、SEQ ID NO:28和SEQ IDNO:29)。保守SNH结构域以黄色突出显示。
图18显示用单一小麦GRF4或GIF1的小麦转化效率。A)不同盒的方案,其用于表达单独的小麦GIF1和GRF4、同一T-DNA中的单独的GRF4和GIF1以及GRF4-GIF1嵌合体。B-D)转化有小麦GRF4-GIF1嵌合体的转基因Kronos植株相对于转化有以下的Kronos植株的再生频率:B)空载体(n=14,****P<0.0001)。C)对照载体和仅含GIF1或仅含GRF4的载体(n=5,不同字母指示在P<0.05的显著差异性,图基检验)。D)包括在不同Ubi启动子下的GRF4和GIF1的载体(未融合,n=5,**P<0.0144)。
图19显示小麦(黄色突出显示)、水稻、拟南芥、柑橘和葡萄GRF和GIF家族的系统树。小麦GRF4和GIF1的最接近的直系同源物,柑橘以橙色突出且葡萄以紫色突出显示。我们把这些基因和其编码的蛋白称为柑橘GRF4(Ciclev10032065m)、柑橘GIF1(Ciclev10022144m)以及葡萄GRF4(GSVIVT01024326001)和葡萄GIF1(GSVIVT01036262001)。A)我们使用QLQ和WRC结构域分析GRF蛋白和B)使用SNH结构域分析GIF蛋白。进化史用最大似然法推断。我们用最高对数似然值显示树。在分支旁显示关联的分类群聚集在一起的树百分比。我们用程序MEGA X进行进化分析。黄色突出显示:小麦。橙色突出:选定的柑橘直系同源物。紫色突出显示:选定的葡萄直系同源物。
图20显示小麦转化,采用包括其他GRF和/或GIF序列的嵌合体。A)检测的GRF-GIF1嵌合体方案。B)嵌合体诱导的再生效率,所述嵌合体将不同小麦GRF基因与GIF1组合。平均值基于3次实验,除了GRF5(2次实验可用)。柱上的不同字母指示图基检验中与对照的显著差异(P<0.05)。仅GRF4-GIF1和GRF5-GIF1显著不同于对照。星号指示组合的GRF4-GIF1和GRF5-GIF1嵌合体(进化相关)相较于组合的GRFGRF1-GIF1和GRF9-GIF1嵌合体(相关性更远,图19A),再生效率的显著差异(P=0.0368)。C)检测的GRF4-GIF嵌合体方案。D)转基因Kronos植株的再生效率,所述植株用与GIF1、GIF2或GIF3融合的GRF4嵌合体转化(3次实验,对比嵌合体带有GIF1vs.GIF2和GIF3P=0.0046)。柱上的不同字母指示显著差异(P<0.05,图基检验)。误差棒是s.e.m。接种的胚的数目在构建体下标示。残差的常态通过Shapiro-Wilk检验确认且同方差性通过Levene检验确认(原始数据可获自表7)。
图21显示GRF4-GIF1嵌合体对不同基因型中再生效率的影响。A)代表性转化显示在不同小麦和黑小麦基因型中,GRF4-GIF1嵌合体存在下的再生芽频率高于对照(空载体)。B)在四倍体和六倍体小麦及黑小麦育种系UC3190的不同栽培品种中,GRF4-GIF1 vs对照的再生效率。接种的胚的数目在名称下标示且频率在柱顶部标示。原始数据在表9中。
图22显示A)诱导型小麦GRF4-GR-GIF1嵌合体的方案,其带有在GRF4-GIF1中间的大鼠糖皮质激素受体(GR)(SEQ ID NO:32、33)。C)含有再生培养基中的经转化的Kronos胚的平板图片,没有DEX(-dex)或存在10uM DEX(+dex)。DEX的存在诱导GRF4-GR-GIF1活性并显著增加再生效率。
图23显示GRF4-GIF1嵌合体在没有细胞分裂素情况下诱导胚发生。A)不同小麦转化步骤的示意图。B).在没有潮霉素的生长素培养基中的代表性愈伤组织。注意在没有细胞分裂素情况下用小麦GRF4-GIF1嵌合体转化的愈伤组织中生长的绿芽(红色箭头)。对照:pLC41。C)转基因特异性PCR产物(箭头)显示在4个再生自对照的植株中没有转基因植株,而9个再生自GRF4-GIF1嵌合体的植株中有5个转基因植株。一对特异于T-DNA的引物用于PCR。
图24显示在没有外源细胞分裂素情况下,GRF4-GIF1嵌合体对再生效率的影响。小麦幼胚来自GRF4-GIF1转基因Kronos T1植株和隔离的非转基因T1姊妹系,其在标准转化操作后进行处理,排除农杆菌接种和向平板加入潮霉素。最后一个步骤中,愈伤组织转移至再生培养基,没有细胞分裂素。GRF4-GIF转基因植株中再生绿芽的愈伤组织数目(27个中的21个)显著高于非转基因姊妹对照(26个中的3个)。代表性平板的图片显示无细胞分裂素的再生培养基中的愈伤组织。
图25显示相对于在UC Davis转化设施的正常小麦转化方案,用GRF4-GIF1嵌合体的小麦转化操作加速。用GRF4-GIF1嵌合体的方案更快,使全过程减少5周。
图26显示用GRF4-GIF1嵌合体转化柑橘。A)用空载体和柑橘GRF4-GIF1嵌合体转化柑橘上胚轴(农杆菌接种后60d)。B)用空载体和柑橘GRF4-GIF1及miR396抗性GRF4-GIF1(rGRF4-GIF1)嵌合体转化柑橘上胚轴(农杆菌接种后120d)。C)葡萄GRF4-GIF1嵌合体的方案,显示miR396靶位点和其与下面miR396的相互作用。在miR396抗性rGRF4-GIF1形式中,引入沉默突变(红色)以减少与miR396的相互作用。D)3次柑橘实验的统计学比较。不同字母指示图基检验中的显著差异(P<0.05)。顶部的水平线指示对照与GRF-GIF构建体之间的显著差异(P=0.0153)。
图27显示用rGRF4-GIF1嵌合体转化葡萄。照片是用空载体和葡萄rGRF4-GIF1嵌合体转化的葡萄正在再生的愈伤组织。
图28显示用GRF4.1-GIF1.1嵌合体转化甜椒(番椒(Capsicum annuum))栽培品种R&C Cayenne,所述嵌合体包括分别与小麦GRF4=甜椒LOC107869915(SEQ ID NO:138)和小麦GIF1=甜椒LOC107870303(SEQ ID NO:139)最接近的番椒直系同源物。图片对应于实验#201027/28,显示用GRF4.1-GIF1.1嵌合体转化的甜椒子叶块中的再生效率(42个中的10个=23.8%)相较于用空对照载体转化的那些(40个中的2个=5.0%)增加>4倍(表11,方法6)。
发明详述
本公开提供使用生长调节因子蛋白(GRF)和GRF相互作用因子蛋白(GIF)(单独或组合或作为融合嵌合体)提高植物再生效率的方法。GRF基因属于的保守植物特异性转录因子(TF)家族(van der Knaap等,2000;Kim等2003)。GRF家族定义为存在结构域QLQ和WRC,其分别介导蛋白-蛋白和蛋白-DNA相互作用(Kim等,2003,Kim和Kende 2004,Horiguchi等,2005)。GRF TF在陆生植物中高度保守,并在双子叶植物、单子叶植物、裸子植物和藓类植物中鉴定了GRF基因(Omidbakhshfard等,2015)。这些基因还是微小RNA miR396的保守靶标(Debernardi等,2012)。GRF能控制许多植物器官的大小,并通过增加发育器官的细胞增殖而用作生长促进物(Rodriguez等,2010)。GRF中的功能缺失突变或通过miR396过表达而下调,能显著降低诸如拟南芥和水稻等种类中的植物大小(Horiguchi等,2005;Kim等,2003;Kim和Kende,2004;Wang等,2011;Liu等,2009;Rodriguez等,2010;Li S等,2016)。另一方面,GRF活性提高可在拟南芥水稻、小麦和芸苔属植物等中生成更大的器官,包括更大的叶、谷粒和根(Horiguchi等,2005;Rodriguez等,2010;Debernardi等,2014;Beltramino等,2018;Li,S.等.2018)。
GRF蛋白能形成复合体并与GIF基因家族成员编码的蛋白共同起作用。GIF蛋白不具有DNA结合域(Kim和Kende,2004),但已证明GIF蛋白与GRF和染色质重塑复合体在体内相互作用(Debernardi等,2014;Vercruyssen et al.,2014)。基于此观察结果,推测GIF能用作辅助活化剂将染色质重塑复合体带到GRF识别的DNA序列(Debernardi等,2014,Vercruyssen等,2014)。GIF基因的突变模拟GRF功能缺失中观察到的大部分表型,而GIF过表达能促进器官生长且增强GRF活性(Kim和Kende 2004,Horiguchi等,2005;He等,2017;Shimano等,2018;Zhang等,2018;Debernardi等,2014)。先前观察到当拟南芥GRF3和GIF1一起表达为嵌合体时,其能促进叶片尺寸相对于个体单独基因增加较大(专利WO 2013/102762 A1)。
在一个实施方案中,所述引入GRF和/或GIF序列的植物是转基因的,即其具有引入的异源核酸分子或多肽,或其具有通过任何可用技术编辑的基因组,包括本文提供的示例。这种异源核酸分子或多肽是未天然发现紧挨相邻核酸分子的那些,或者其中多肽水平高于或低于不含异源核酸分子或多肽的植物。另一示例中,所述核酸分子或多肽可来自另一生物。当涉及异源核酸分子时,其包括连接启动子、未天然与启动子序列一起出现和/或经人工干预在基因组基因座中修饰的这类分子。示例中的核苷酸序列与启动子序列异源,但其可来自任何来源,可与植物宿主同源或天然且发现在植物细胞中天然出现,或与植物宿主异源或是外来的。使用本文所述方法,转化有GRF-GIF嵌合体或GRF和/或GIF转基因以及异源核酸分子或多肽的植物的再生效率提高。本公开提供可通过显著增加植物再生效率来大幅扩增植物物种数目的方法,该植物物种适合有效转化技术。本文所提供方法还可用于扩增基因型,其能在某一作物内转化。例如,在小麦中,仅一些栽培品种能够有效转化,包括Bobwhite、Fielder和Kronos,且这些仍显示低再生效率。转化不同基因型的能力对于育种应用而言是重要的,因为直接转化高生产基因型的能力可以不再需要昂贵且冗长的回交过程。本文所提供的方法可用于加速从叶外植体而不是胚产生转基因植物。
在某些实施方案中,本公开提供再生效率改进的产生植物的方法。此方法包括步骤:(1)用一种或多种核酸分子转化一个或多个植物细胞,其中一种或多种核酸分子编码GRF蛋白、GIF蛋白或GRF-GIF嵌合体;和(2)在再生培养基中培养所述一个或多个植物细胞。实施方案提供GRF和GIF核酸分子或多肽,所述核酸分子或多肽能分开引入或作为嵌合体引入植物。构建体可包括编码嵌合体的核酸分子或实施方案中可引入多个构建体。
所述一个或多个植物细胞可瞬时或稳定转化。细胞可衍生自任何植物组织,例如来自叶外植体、小孢子、胚珠等。细胞可以是原生质体。
实施方案使得从转化时间到生成转基因植物的时间大幅减少。生成这类植物的时间可加速,从而生成转基因植物的时间相较于GRF/GIF蛋白未引入植物情况下生成转基因植物的时间减少5天、10天、15天、20天、25天、30天或更多,或是介于两者之间的时间量。例如,在小麦中用所述蛋白和其编码核酸分子使转基因植物的产生从90天加速到60天。
术语植物或植物材料或植物部分在本文中广泛用于包括任意发育阶段的任何植物,或植物部分,包括植物扦插、植物细胞、植物细胞培养物、植物器官、植物种子和小植物。植物细胞是植物的结构和生理单元,包括原生质体和细胞壁。植物细胞可处于分离的单一细胞或细胞聚集物形式,如疏松型愈伤组织或培养细胞,或可以是更有序单元,例如植物组织、植物器官或植物。因此,植物细胞可以是原生质体、配子生成细胞或能再生成完整植物的细胞或细胞集合。如此,包括多个植物细胞且能够再生成全植物的种子,出于本公开目的被视作植物细胞。植物组织或植物器官可以是种子、原生质体、愈伤组织或任意其他植物细胞群,所述细胞组成结构或功能单元。特别有用的植物部分包括可收获部分和用于后代植物繁殖的部分。植物的可收获部分可以是植物的任意有用部分,例如花、花粉、幼苗、块茎、叶、茎、果实、种子、根等。用于繁殖的植物部分包括例如种子、果实、扦插、幼苗、块茎、砧木(rootstock)等。组织培养物优选能够再生植物。优选地,这类组织培养物中的可再生细胞是胚、原生质体、分生细胞、愈伤组织、花粉、叶、花药、根、根尖、须、花、核仁、穗、穗轴、外壳或秆。还提供从组织培养物中再生的植物。
再生效率改进或提高指从一个或多个细胞再生的植物细胞、组织或植物的数目增加,所述一个或多个细胞引入了GRF、GIF或GRF/GIF嵌合体。在一个实施方案中,所述芽再生和胚发生的数目增加。其他实施方案提供能在培养基上出现的再生,所述培养基的细胞分裂素浓度对于植物细胞再生而言过低。使用此类方法,能够选择含有GRF、GIF或GRF-GIF嵌合体的植物,而不需单独标记物以鉴定转化细胞。能在这类培养基上生长的植物包含所述分子或蛋白。在某些实施方案中,所述GRF蛋白可以是小麦GRFGRF1、GRF2GRF2、GRF3、GRF4、GRF5、GRF6或GRF9多肽或者其他植物物种中的相关蛋白。实施方案的一个示例中,所述GRF蛋白是小麦、水稻、拟南芥、葡萄、柑橘或辣椒GRF多肽。其他实施方案中,所述GRF蛋白是小麦GRF4或与小麦GRF4有显著相同性的蛋白,GIF是小麦GIF1或与小麦GIF1有显著相同性的蛋白。示范性实施方案中,所述GRF多肽可如SEQ ID NO:9-23和37-39中任一者所示。GIF序列示例是SEQ ID NO:24-30,43-44和52所示那些中的任一者。在一些实施方案中,所述GRF与上面所列序列至少约50%、至少约60%、至少约70%、至少约80%、至少约85%、至少约90%、至少约95%或至少约99%相同。在一些实施方案中,所述GIF与上面所列序列至少约50%、至少约60%、至少约70%、至少约80%、至少约90%、至少约95%或至少约99%相同。在一些实施方案中,所述一种或多种核酸分子编码嵌合GRF-GIF构建体,其中GRF部分与小麦GRF4对应部分或其他上面所列GRF序列至少约50%、至少约60%、至少约70%、至少约80%、至少约85%、至少约90%、至少约95%或至少约99%相同,GIF部分与上面GIF序列对应部分且在一个实施方案中与小麦GIF1序列至少约50%、至少约60%、至少约70%、至少约80%、至少约85%、至少约90%、至少约95%或至少约99%相同。在一些实施方案中,所述一种或多种核酸分子编码嵌合GRF4-GIF1构建体。
在一些实施方案中,所述GRF蛋白包含QLQ和WRC结构域,其与可在SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13,SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21或SEQ ID NO:22中找到的至少一个所述结构域至少70%、至少80%、至少85%、至少90%、至少95%、至少99%相同。在其他实施方案中,所述GRF包含QLQ和WRC结构域,其与SEQID NO:69–83的至少之一的至少一种QLQ结构域和SEQ ID NO:84–95的至少之一的WRC结构域至少70%、至少85%、至少90%、至少95%、至少99%相同,所述结构域可以在SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21或SEQ ID NO:22的全长GRF序列中找到。在示例中,QLQ结构域是如SEQ ID NO:69–83所示的那些,WRC结构域是如SEQ ID NO:84–95所示的和与其具有这种相同性的那些。
在一些实施方案中,所述GIF包含SNH结构域,其与SEQ ID NO:97–103中任一者至少70%、至少80%、至少85%、至少90%、至少95%、至少99%相同,所述SEQ ID NO:97–103中任一者在SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、orSEQ ID NO:29or SEQ ID NO:133(Os11g40100)、SEQ ID NO:134(Os12g31350)or SEQ IDNO:135(Os03g52320)的全长GIF序列中找到。这类结构域示例是在SEQ ID NO:97-103、136和137中找到的和与其具有这种相同性的那些。
在一些实施方案中,所述一种或多种编码GRF的核酸分子包含miR396识别位点中的一个或多个突变。其他实施方案提供的突变是沉默突变。其他实施方案提供减少GRF蛋白阻遏的突变。还有其他实施方案提供对miR396靶位点进行的突变,如SEQ ID NO:53野生型序列所示,且在一个示例中能生成SEQ ID NO:54的经修饰miR396。
术语核酸分子是指一个核酸分子,其可以是RNA分子以及DNA分子,且可以是编码所需多肽或蛋白的分子,而且还指不构成完整基因且不必定编码多肽或蛋白的核酸分子。例如,当用于同源重组过程时,启动子可置于构建体内,该构建体具有与植物染色体区域类似的序列,其可能不编码蛋白。若需要,感兴趣的核苷酸序列能对植物翻译优化,这是通过优化用于植物的密码子和就植物而言翻译起始位点周围的序列进行的。还能避免导致潜在mRNA不稳定的序列。
除非另有说明,特定核酸序列还暗示涵盖其保守修饰变体(如简并密码子置换)和互补序列以及明确所示序列。术语保守修饰变体适用于氨基酸和核酸序列。关于特定核酸序列,保守修饰变体指编码相同氨基酸序列或其保守修饰变体的那些核酸。由于遗传密码的简并性,大量功能相同的核酸编码任意给定蛋白。例如,密码子GCA、GCC、GCG和GCU都编码氨基酸丙氨酸。因此,在丙氨酸由密码子指定的每一个位置,该密码子可改变成所述任意对应密码子,而不需改变所编码多肽。这种核酸变化是沉默变化且代表一种保守修饰变化。本文的各核酸序列编码多肽,还通过参照遗传密码,描述核酸的每一种可能沉默变化。普通技术人员会认识到,核酸的各密码子(除了AUG,其通常是甲硫氨酸的唯一密码子;和UGG,其通常是色氨酸的唯一密码子)能修饰成产生功能相同分子。因此,多肽编码核酸的各沉默变化示于各所述多肽序列且在所述产物和过程范围内。
对于氨基酸序列,技术人员会认识到单独取代、缺失或加入核酸、肽、多肽或蛋白序列,会改变、添加或删除编码序列中的单一氨基酸或小部分氨基酸,这是“保守修饰变体”,在本文中称为“变体”,其中改变导致某一氨基酸被化学相似氨基酸取代。本领域熟知提供功能相似氨基酸的保守取代表。参见例如Davis等,"分子生物学基本方法(BasicMethods in Molecular Biology)"Appleton&Lange,康涅狄克州诺瓦克.(1994)。
下列8组各包含彼此保守取代的氨基酸:1)丙氨酸(A)、甘氨酸(G);2)天冬氨酸(D)、谷氨酸(E);3)天冬酰胺(N)、谷氨酰胺(Q);4)精氨酸(R)、赖氨酸(K);5)异亮氨酸(I)、亮氨酸(L)、甲硫氨酸(M)、缬氨酸(V);6)苯丙氨酸(F)、酪氨酸(Y)、色氨酸(W);7)丝氨酸(S)、苏氨酸(T);和8)半胱氨酸(C)、甲硫氨酸(M)(参见例如Creighton,《蛋白质:结构和分子性质》(Proteins:Structures and Molecular Properties)(WH Freeman&Co.;第2版(1993年12月))。
在一个实施方案中,所述编码同一物质的GRF多肽或其编码核酸分子选自小麦GRFGRF1、GRF2GRF2、GRF3、GRF4、GRF5、GRF6、GRF9多肽或其编码核酸分子或者其同源物。实施方案提供编码GIF多肽或其编码核酸分子,选自小麦GIF1、GIF2或GIF3多肽或其编码核酸分子或者其同源物。实施方案提供所述多肽或其编码核酸分子,其是小麦GRFGRF1、GRF2GRF2、GRF3、GRF4、GRF5、GRF6、GRF9多肽或其编码核酸分子或者这类小麦多肽或其编码核酸分子的同源物。其他实施方案提供所述多肽或其编码核酸分子,其是小麦GIF1、GIF2或GIF3多肽或其编码核酸分子或者其同源物。同源物是保留相同生物功能的不同种类中所遗传基因或多肽。GRF家族如本文所定义,是具有介导蛋白-蛋白和蛋白-DNA相互作用的保守QLQ和WRC结构域的转录因子,如本文所讨论,在陆生植物中高度保守并已在双子叶植物、单子叶植物、裸子植物和藓类植物中鉴定。GIF蛋白与GRF相互作用且具有保守SNH结构域。这类GRF和GIF多肽具有改进再生效率的特性,尤其是以嵌合蛋白组合时。本文提供这种同源物示例,如水稻、柑橘、葡萄、辣椒和拟南芥中发现的那些。本领域技术人员能容易地鉴定这类同源物。例如,NCBI提供在其他生物中搜索基因或蛋白的同源物。参见ncbi.nlm.nih.gov/homologene。此外,这种序列包括与本文所示GRF和GIF核酸分子或多肽序列在严格杂交条件下杂交的那些。
用于比较的序列比对方法为本领域熟知。因此,确定任意2种序列之间的相同性百分比,能用数学算法完成。
用于比较的最佳序列比对能采用本领域已知任何方式以分析序列相同性(同源性),如通过以下方法进行:称为“PILEUP”的渐进式比对方法(Morrison,(1997)Mol.Biol.Evol.14:428-441,作为应用PILEUP的示例);Smith&Waterman的局部同源性算法(Adv.Appl.Math.2:482(1981));Needleman&Wunsch的同源比对算法(J.Mol.Biol.48:443-453(1970));Pearson的相似性检索法(Proc.Natl.Acad.Sci.USA 85:2444(1988));这些算法的电脑化实施(如Wisconsin Genetics软件包中的GAP,BEST FIT,FASTA和TFASTA,GCG(Genetics Computer Group),575Science Dr.,威斯康星州麦迪逊);ClustalW(PC/Gene程序中的CLUSTAL,Intelligenetics,加利福尼亚州山景城,描述于例如Higgins(1988),Gene73:237-244;Corpet(1988),Nucleic Acids Res.16:10881-10890;Huang,ComputerApplications in the Biosciences 8:155-165(1992);和Pearson(1994),Methods inMol.Biol.24:307-331);Pfam(Sonnhammer(1998),Nucleic Acids Res.26:322-325);TreeAlign(Hein(1994),Methods Mol.Biol.25:349-364);MEG-ALIGN和SAM序列比对计算机程序;或人工目检。
适合确定序列相似性的另一示例是BLAST算法,其描述于Altschul等,(1990)J.Mol.Biol.215:403-410。Altschul,S.F.等的BLAST程序(基本局部比对搜索工具)在默认参数下搜索与BLAST“GENEMBL”数据库所含序列的相同性。能分析序列与GENEMBL所含全部公开可用DNA序列的相同性,使用默认参数下的BLASTN算法。
进行BLAST分析的软件通过National Center for Biotechnology Information,www.ncbi.nlm.nih.gov/可公开获得;就"PowerBLAST"变化而言还参见Zhang(1997),Genome Res.7:649-656。此算法涉及首先鉴定高得分序列对(HSP),这是通过在查询序列中鉴定长度W的短字,其与数据库序列中相同长度的字比对时,匹配或满足一些正值阈值得分T。T被称为邻域字得分阈值(Altschul等(1990),J.Mol.Biol.215:403-410)。这些初始邻域字命中用作起始搜索的种子以发现含有其的较长HSP。字命中在沿着各序列的2个方向延伸,只要累积比对得分能够增加。各方向的字命中延伸在以下情况时停止:累积比对得分比其最高实现值减少量X;累积得分达到零或以下,这归因于一个或多个负分残基比对的积累;或达到任一序列末端。BLAST算法参数W,T和X决定比对的灵敏度和速度。BLAST程序采用如下默认值:字长(W)为11,BLOSUM62打分矩阵(参见Henikoff(1992),Proc.Natl.Acad.Sci.USA 89:10915-10919)比对(B)为50,预期(E)为10,M=5,N=-4,和2条链的比较。术语BLAST指对2种序列之间相似性进行统计分析的BLAST算法;参见例如Karlin(1993),Proc.Natl.Acad.Sci.USA90:5873-5787。BLAST算法提供的一个相似性量度是最小概率和(P(N)),其指示2种核苷酸或氨基酸序列之间的匹配会偶然发生的可能性。例如,若测试核酸与参照核酸对比中的最小概率和小于约0.1,更优选小于约0.01,最优选小于约0.001,则核酸被视作与参照序列相似。
在一个实施方案中,能使用GAP(全局比对程序)。GAP使用Needleman和Wunsch算法(J.Mol.Biol.48:443-453,1970)以发现2种使匹配数最大且缺口数最小的完整序列。常用Wisconsin
Figure BDA0003535299770000081
(Accelrys公司(Accelrys,Inc.),加利福尼亚州圣地亚哥)版本10就蛋白序列而言默认缺口产生罚分值和缺口延伸罚分值,分别为8和2。对于核苷酸序列,默认缺口产生罚分是50,而默认缺口延伸罚分是3。相似性百分比是类似符号的百分比。忽视跨缺口的符号。当一对符号的打分矩阵值高于或等于0.50时,对相似性进行打分。通用打分系统是BLOSUM62矩阵(Henikoff和Henikoff(1993),Proteins 17:49-61),其目前是BLAST程序的默认选择。BLOSUM62使用3个矩阵的组合以覆盖所有意外事件。Altschul,J.Mol.Biol.36:290-300(1993)通过引用全文纳入本文,其是用于Wisconsin
Figure BDA0003535299770000082
(Accelrys公司,加利福尼亚州圣地亚哥)版本10的打分矩阵(参见Henikoff&Henikoff(1989)Proc.Natl.Acad.Sci.USA 89:10915)。
(c)如本文所用,2种核酸序列背景下的“序列相同性”或“相同性”指在指定比较窗口中就最大对应性比对时,2种序列内相同的残基。
(d)如本文所用,“序列相同性百分比”指通过在比较窗口对比2种最优比对序列而确定的值,其中比较窗口内的多核苷酸序列部分可包括相较参照序列(不包括添加或缺失)的添加或缺失(即缺口)以最优比对2种序列。百分比如下计算:确定相同核酸碱基在2种序列中出现的位置数以产生匹配位置数,将匹配位置除以比较窗口中的位置总数,该结果乘以100,产生序列相同性百分比。
涉及杂交技术时,所有或部分的已知核苷酸序列能用作探针,其选择性杂交来自选定生物的克隆基因组DNA片段或cDNA片段(即基因组或cDNA库)群中存在的其他对应核苷酸序列。杂交探针可以是基因组DNA片段、cDNA片段、RNA片段或其他寡核苷酸,且能用可检测基团如32P或任意其他可检测标志物标记。因此,例如,通过基于DNA序列标记合成寡核苷酸,能制备杂交探针。用于制备杂交探针和构建cDNA及基因组库的方法为本领域众所周知并公开(Sambrook等,2001)。
例如,本文公开的序列或其一个或多个部分,可用作能够与对应序列特异性杂交的探针。为在多种条件下实现特异性杂交,这类探针包括在待筛选序列中独特且优选长度为至少约10个核苷酸、最优选至少约20个核苷酸的序列。或者,这种序列可用于从选定植物中通过PCR扩增对应序列。此技术可用于从所需植物分离序列或作为诊断试验以确定植物中序列的存在。杂交技术包括DNA文库铺板成为噬斑或菌落的杂交筛选(Sambrook等,2001)。
这种序列的杂交可在严格条件下完成。“严格条件”或“严格杂交条件”是预定条件,在该条件下探针与其靶序列的杂交程度明显高于其他序列(如相比背景至少2倍)。严谨条件是序列依赖性且在不同环境下有差异。通过控制杂交严谨性和/或洗涤条件,能鉴定与探针100%互补的靶序列(同源探测)。或者,严谨条件能调整成允许序列的某些错配,从而检测到较低程度的相似性(异源探测)。一般,探针长度小于约1000个核苷酸,优选长度小于500个核苷酸。
通常,严格条件是盐浓度低于约1.5M Na离子的那些,一般在pH 7.0-8.3下约0.01-1.0M Na浓度(或其他盐),且温度就短探针(如10-50个核苷酸)而言是至少约30℃,就长探针(如大于50个核苷酸)而言是至少约60℃。严格条件还可通过加入去稳定剂如甲酰胺来实现。示范性低严谨条件包括用30-35%甲酰胺,1M NaC1,1%SDS(十二烷基磺酸钠)的缓冲溶液在37℃杂交,在1X-2X SSC(20X SSC=3.0M NaC1/0.3M柠檬酸三钠)中50-55℃洗涤。示范性中等严谨条件包括用40-45%甲酰胺,1M NaC1,1%SDS在37℃杂交,在0.5X-1X SSC中55-50℃洗涤。示范性高严谨条件包括用50%甲酰胺,1M NaC1,0.1%SDS在37℃杂交,在0.1X SSC中60-65℃洗涤。
特异性通常是杂交后洗涤的函数,关键因子是终洗涤溶液的离子强度和温度。对于DNA-DNA杂交体,Tm能从Meinkoth和Wahl等式中近似推出,Anal.Biochem.,138:267-284(1984):Tm=81.5℃+16.6(log M)+0.41(%GC)–0.61(%form)–500/L;其中M是一价阳离子的摩尔浓度,%GC是DNA中鸟苷和胞嘧啶核苷酸的百分比,%形式是杂交溶液中甲酰胺的百分比,L是碱基对中杂交体的长度。Tm是温度(在定义离子强度和pH下),在该处50%互补靶序列与完全匹配的探针杂交。就各1%错配而言,Tm降低约1℃;因此,Tm,杂交和/或洗涤条件能调整成与具有所需相同性的序列杂交。例如,若搜索~90%相同性的序列,Tm能减少10℃。一般,就确定离子强度和pH下的特异序列和其补体而言,严谨条件选择为比热熔点(Tm)低5℃。然而,非常严谨条件能采用在低于热熔点(Tm)1、2、3或4℃下的杂交和/或洗涤;中等严谨条件能采用在低于热熔点(Tm)6、7、8、9或10℃下的杂交和/或洗涤;低严谨条件能采用在低于热熔点(Tm)11、12、13、14、15或20℃下的杂交和/或洗涤。使用等式、杂交和洗涤组合物以及所需Tm,普通技术人员应理解杂交和/或洗涤溶液的严谨性变化如固有所描述。如果所需错配程度导致Tm小于45℃(水溶液)或32℃(甲酰胺溶液),优选增加SSC浓度,从而能使用更高温度。关于核酸杂交的广泛指南参见Tijssen(1993)《生物化学与分子生物学的实验室技术—用核酸探针的杂交》(Laboratory Techniques in Biochemistry and MolecularBiology--Hybridization with Nucleic Acid Probes),第I部分,第2章(纽约爱思唯尔(Elsevier));和Ausubel等编.(1995)《精编分子生物学实验指南》(Current Protocols inMolecular Biology),第2章(格林出版社(Greene Publishing)和Wiley-Interscience出版公司(Wiley-Interscience),纽约)。参见Sambrook等.(2001)《分子克隆:实验室手册》(Molecular Cloning:A Laboratory Manual)(第3版,冷泉港实验室出版社(Cold SpringHarbor Laboratory Press),纽约普莱恩维尤)和Haymes等.(1985)收录于:《核酸杂交,实用方法》(Nucleic Acid Hybridization,a Practical Approach),华盛顿特区的IRL出版社(IRL Press)。
在核酸或多肽插入细胞上下文中引入的术语包括转染或转化或转导,且在一个实施方案中包括将核酸并入真核或原核细胞,其中核酸可并入细胞基因组(如染色体、质粒、质体或线粒体DNA),以及转变成自主复制子或瞬时表达(如转染mRNA)。如下所讨论,向植物引入蛋白能采用诸如实施例8所述那些方法。
多种转化/转染方法可用。由于有更新的方法可用于转化农作物或其他宿主细胞,其可直接应用。因此,开发了多种方法向宿主细胞基因组插入DNA序列,以获得转录或转录本或序列翻译,从而实现生物体的表型改变。由此,可采用提供有效转化/转染的任意方法。使用本文方法,再生效率提高。
例如但不限于,本领域技术人员可获得的向植物组织引入表达载体的方法可变且取决于所选植物。熟知转化多种植物物种的程序并在文献中描述(参见例如Miki和McHugh(2004)Biotechnol.107,193-232;Klein等.(1992)Biotechnology(N Y)10,286-291;和Weising等.(1988)Annu.Rev.Genet.22,421-477)。例如,DNA构建体可引入植物细胞基因组DNA,使用技术如微弹介导的递送(Klein等.1992,同上)、电穿孔(Fromm等,1985Proc.Natl.Acad.Sci.USA 82,5824-5828)、聚乙二醇(PEG)沉淀(Mathur和Koncz,1998Methods Mol.Biol.82,267-276)、直接基因转移(WO 85/01856和EP-A-275 069)、体外原生质体转化(美国专利号4,684,611)和微注射植物细胞原生质体或胚性愈伤组织(Crossway,A.(1985)Mol.Gen.Genet.202,179-185)。Ishida等.(1996)的且还描述于美国专利号5,591,616的农杆菌转化方法是另一选择。植物组织与根癌农杆菌(Agrobacteriumtumefaciens)共培养是一种变化形式,其中DNA构建体置于双元载体系统内(Ishida等,1996Nat.Biotechnol.14,745-750)。根癌农杆菌的毒性作用会在细胞受细菌感染时指导构建体插入植物细胞DNA。参见例如Fraley等.(1983)Proc.Natl.Acad.Sci.USA,80,4803-4807。农杆菌主要用于双子叶植物,但包括玉米在内的单子叶植物能由农杆菌转化。参见例如美国专利号5,550,318。在此方法的许多变化之一中,农杆菌感染玉米能与热激幼胚(Wilson等.美国专利号6,420,630)或抗生素选择II型愈伤组织(Wilson等,美国专利号6,919,494)一起使用。
水稻转化描述于Hiei等.(1994)Plant J.6,271-282和Lee等.(1991)Proc.Nat.Acad.Sci.USA 88,6389-6393。油菜籽转化的标准方法描述于Moloney等.(1989)Plant Cell Reports 8,238-242。玉米转化描述于Fromm等.(1990)Biotechnology(N Y)8,833-839和Gordon-Kamm等.(1990)同上。小麦能通过与转化玉米或水稻所用类似的技术来转化。高粱转化描述于Casas et al.(Casas等.(1993)《经微粒轰击的转基因高粱植物》(Transgenic sorghum plants via microprojectile bombardment).Proc.Natl.Acad.Sci.USA 90,11212-11216)且大麦转化描述于Wan和Lemaux(Wan和Lemaux(1994)《产生大量独立转化的可育大麦植株》(Generation of large numbers ofindependently transformed fertile barley plants).Plant Physiol.104,37-48)。一些出版物描述了大豆转化,包括美国专利号5,015,580。在某些实施方案中,所述调节序列包括诱导型启动子。在其他实施方案中,所述GRF、GIF或GRF-GIF嵌合体的活性受诱导型系统调控。诱导型系统可以是或可包括与GRF、GIF或GRF-GIF蛋白融合的糖皮质激素受体。还可使用其他诱导型系统。
在本文所述某些实施方案中,所述一个或多个转化植物细胞可在含次优浓度外源细胞分裂素的再生培养基中培养。术语“未达最优的浓度”定义为任意浓度,其低到无法允许未转化有GRF-GIF嵌合体或GRF和/或GIF转基因的植物细胞适当再生。例如,未达最优的浓度可以是小于约50%,小于约10%,小于约5%,小于约1%或小于0.01%的植物再生常用细胞分裂素浓度。可测试细胞分裂素浓度以确定未达最优的浓度,其允许用GRF-GIF嵌合体转化的植物细胞再生,但不足以诱导非转基因植物再生。这能用作正相选择法以鉴定转基因芽,而不需使用抗生素标志物。
在某些实施方案中,本文所公开的方法提高再生效率。再生效率指能再生的植物细胞数目。该方法使得效率相较于未引入GRF/GIF多肽或核苷酸的植物的再生增加。再生效率可增加1%、2%、3%、4%、5%、10%、15%、20%、25%、30%、40%、45%、50%、55%、60%、65%、70%、80%、90%或更多或两者之间。例如但不限于,未用GRF/GIF情况下再生效率为10%的植物能增加至50–70%。效率为1%的植物能增加至10–20%且再生效率为零的植物能增加至1-5%。
在一些实施方案中,所述用于本文公开方法的一种或多种核酸分子除了编码GRF和GIF,还编码至少一种额外感兴趣多核苷酸。
本文所公开方法可用于多种植物。在某些实施方案中,所述方法可用于再生效率低的植物。在某些实施方案中,所述植物是单子叶植物物种。在某些其他实施方案中,所述植物是双子叶植物物种。在某些其他实施方案中,所述植物既不是单子叶,也不是双子叶植物物种。示范性物种包括但不限于玉米(玉蜀黍(Zea mays))、油菜(欧洲油菜(Brassicanapus)、芜菁(Brassica rapa ssp.))、苜蓿(紫花苜蓿(Medicago sativa))、水稻(水稻(Oryza sativa))、黑麦(黑麦(Secale cereale))、高粱(高粱(Sorghum bicolor)、Sorghumvulgare)、向日葵(向日葵(Helianthus annuus))、小麦(小麦(Triticum aestivum))、黑小麦((×小黑麦(Triticosecale),小麦和黑麦的杂交)、黑小麦(Triticale)(×小黑麦(Triticosecale)、小麦和黑麦的杂交种)、大豆(大豆(Glycine max))、烟草(烟草(Nicotiana tabacum))、土豆(马铃薯(Solanum tuberosum))、花生(花生(Arachishypogaea))、棉花(陆地棉(Gossypium hirsutum))、甘薯(金叶甘薯(Ipomoea batatus))、木薯(木薯(Manihot esculenta))、咖啡(Cofea spp.)、椰子(椰子(Cocos nucifera))、菠萝(菠萝(Ananas comosus))、柑橘树(柑橘(Citrus spp.))、可可(可可(Theobromacacao))、茶(野茶树(Camellia sinensis))、香蕉(芭蕉类(Musa spp.))、鳄梨(牛油果(Persea americana))、无花果(无花果(Ficus casica))、番石榴(番石榴(Psidiumguajava))、芒果(芒果(Mangifera indica))、橄榄(油橄榄(Olea europaea))、番木瓜(木瓜(Carica papaya))、腰果(腰果(Anacardium occidentale))、澳洲坚果(澳洲坚果(Macadamia integrifolia))、杏仁(巴旦杏(Prunus amygdalus))、甜菜(甜菜(Betavulgaris))、燕麦(燕麦(Avena))、大麦(大麦(Hordeum))、蔬菜、观赏植物和松柏类。蔬菜包括番茄(番茄(Lycopersicon esculentum))、甜椒(番椒(Capsicum annuum))、生菜(如莴苣(Lactuca sativa))、青豆(菜豆(Phaseolus vulgaris))、利马豆(莱豆(Phaseoluslimensis))、豌豆(山黧豆属(Lathyrus spp.))和黄瓜属成员如黄瓜(黄瓜(C.sativus))、哈密瓜(罗马甜瓜(C.cantalupensis))及甜瓜(甜瓜(C.melo))。观赏植物包括杜鹃花(杜鹃花(Rhododendron spp.))、绣球花(绣球花(Macrophylla hydrangea))、木槿(芙蓉(Hibiscus rosasanensis))、玫瑰(玫瑰(Rosa spp.))、郁金香(郁金香(Tulipa spp.))、水仙花(水仙(Narcissus spp.))、矮牵牛花(矮牵牛(Petunia hybrida))、康乃馨(康乃馨(Dianthus caryophyllus))、一品红(一品红(Euphorbia pulcherrima))和菊花。可用于本发明的松柏类包括例如松树如火炬松(火炬松(Pinus taeda))、湿地松(湿地松(Pinuselliotii))、杰克松(西黄松(Pinus ponderosa))、美国黑松(扭叶松(Pinus contotta))和辐射松(辐射松(Pinus radiata));花旗松(黄杉(Pseudotsuga menziesii));西部铁杉(加拿大铁杉(Tsuga canadensis));西加云杉(白云杉(Picea glauca));红木树(加州红木(Sequoia sempervirens));冷杉如银杉(胶冷杉(Abies amabilis))和胶枞(香脂冷杉(Abies balsamea));和香柏如西红杉(北美乔柏(Thuja plicata))和阿拉斯加柏木(黄扁柏(Chamaecyparis nootkatensis))。
优选实施方案中的植物可选自稻属(Oryza)、棉属(Gossypium)、甘氨酸、葡萄属(Vitis)、苜蓿属(Medicago)、胡桃属(Juglans)、柑橘属(Citrus)、辣椒属(Capsicum)、高粱属(Sorghum)、玉蜀黍属(Zea)、大麦属(Hordeum)或小麦属(Triticum)或其他合适植物物种。在某些实施方案中,所述植物是圆锥小麦(Triticum turgidum)或小麦(Triticumaestivum)。
本公开还提供通过本文所述任意方法产生的植物。
本公开提供调整植物中GRF和GIF基因表达的方法,以改进再生效率和允许在外源细胞分裂素缺乏情况下的再生。这些方法可如下实施:提高植物、植物细胞或原生质体中GRF和/或GIF或GRF-GIF嵌合体的蛋白水平和活性。
在一些实施方案中,所述植物或植物细胞中的GRF和GIF基因可通过多种方式增加,包括使用各自仅有GRF基因或GIF基因的载体,或有GRF-GIF嵌合体的载体。GRF基因可突变使得其对miR396遏制不太敏感。编码生长调节因子蛋白(GRF)和/或GRF相互作用因子蛋白(GIF)的一种或多种核酸分子可操作连接植物细胞内可操纵的调节序列。载体中的基因可由多种启动子控制,包括诱导型、组织特异性和组成型启动子。植物或植物细胞可通过多种方式转化,包括使用携带有GRF和/或GIF基因的载体的农杆菌或轰击。
基因编辑可与本发明联用。例如,有GRF和/或GIF或GRF-GIF嵌合体及Cas9的载体可用于植物或植物细胞的基因编辑。植物或植物细胞能转化以瞬时或稳定表达核酸。GRF和/或GIF或GRF-GIF嵌合体蛋白可递送到植物或植物细胞以增加再生效率,采用其他研究者所述递送CRISPR/Cas9预组装核糖核蛋白复合体(RNP)的方法(Woo等,2015;Subburaj等,2016;Malnoy等,2016;Kim等,2017;Liang等,2017;Svitashev等,2016;Wolter等,2017)。
使感兴趣的分子(MOI)靶向靶基因靶位点的方法可用于本方法。以下通过举例方式提供,而不是限制。引导核酸分子可指导核酸酶到基因组中特定切割位点,无论是否通过使用结合域、识别域、引导RNA或其他机制。将引导核酸分子引入细胞,所处条件适合操作引导核酸分子将切割导向靶基因座。本领域技术人员可获得一些能使用的方法,最常用核酸酶切割基因靶区域,以及在靶基因座识别序列和指导切割到基因座的序列。能切割多核苷酸链中磷酸二酯键的任何核酸酶可用于本文所述方法。例如但不限于,可用的系统包括采用特异核酸酶(SSN)的那些,如ZFN(锌指核酸酶)Whyte,等.《细胞生物学研讨会:产生定制设计修饰的锌指核酸酶》(Cell Biology Symposium:Zinc finger nucleases to createcustom-designed modifications).J Anim Sci 90,1111-1117(2012));TALEN(转录激活因子样效应物核酸酶)(参见Carlson,D.F.等.《动物中的有效TALEN介导基因敲除》(Efficient TALEN-mediated gene knockout in animals).Proc Natl Acad Sci US A109,17382-17387(2012);Tan,W.等.《家畜中用定制核酸内切酶的有效非减数分裂等位基因基因渗入》(Efficient nonmeiotic allele introgression in livestock usingcustom endonucleases).Proc Natl Acad Sci U S A 110,16526-16531(2013);和CRISPR(规律成簇的间隔短回文重复)–相关(Cas)核酸酶系统(Hai,T.,Teng,F.,Guo,R.,Li,W.&Zhou,Q.《通过受精卵注射CRISPR/Cas系统一步式产生基因敲除猪》(One-step generationof knockout pigs by zygote injection of CRISPR/Cas system)Cell Res 24,372-375(2014))允许动物基因组编辑。重组酶如美国专利号6,720,475所述FLP/FRT或美国专利号5,658,772所述CRE/LOX,能用于整合多核苷酸序列到特定染色体位点。大范围核酸酶用于将供体多核苷酸靶入特定染色体位置,如描述于Puchta等,PNAS USA 93(1996)第5055-5060页。ZFN与结合结合域的蛋白或蛋白结构域一起运作,所述结合域由于使用锌离子而具有稳定结构。TALEN利用有重复氨基酸的结构域,其能特异性识别DNA序列中的碱基对。关于2种系统的讨论,参见Voytas等.美国专利号8,697,853,其通过引用全文纳入本文。这些系统采用就各靶序列制备的酶。
提到靶基因或分子,意在指基因组内任何核酸分子,其需要如所述修饰或在该处需要删除或插入核酸分子或以某些方式修饰分子。靶分子是核酸序列时,靶分子可以是例如编码基因产物(如蛋白)的序列或非编码序列(如调节多核苷酸或垃圾DNA)。
本文所用术语“再生”指从另一植物或植物细胞产生一种植物或植物细胞。这包括从产生自植物胚或初生叶的愈伤组织生成芽。在一些实施方案中,所述植物、植物细胞或原生质体可再生自经转化从而GRF和/或GIF或GRF-GIF嵌合体表达增加的植物、植物细胞或原生质体。在一些实施方案中,所述植物、植物细胞或原生质体可用至少一种额外多肽转化,该植物、植物细胞或原生质体再生自经转化从而GRF和/或GIF或GRF-GIF嵌合体表达增加的植物、植物细胞或原生质体。
在多个实施方案中,所述植物或植物细胞源自多个植物物种,包括抗转化、再生效率低的那些,和小麦属、葡萄属、稻属、棉属、柑橘属、辣椒属、高粱属和胡桃属中的那些。
本公开还提供在没有外源细胞分裂素情况下再生植物或植物细胞的方法。在某些实施方案中,所述方法包括向植物细胞引入GRF基因和/或GIF基因或GRF-GIF嵌合体并在基本没有外源细胞分裂素的培养基中培养细胞以再生植物的方法。GRF和GIF基因可以是普通小麦、圆锥小麦硬粒小麦(Triticum turgidum ssp.durum)或另一植物物种,包括再生效率低或抗转化的那些。
在多个实施方案中,所述植物可转化,使用各自仅有GRF基因或GIF基因的载体,有作为嵌合体的GRF-GIF的载体,或有非嵌合形式GRF-GIF的载体。
在某些实施方案中,所述植物难以或被视作难以再生。
核酸分子或多肽可与其他组分一起引入植物。如本文所用,核苷酸区段在以功能性方式与另一DNA区段放置时被称作可操作连接。例如,用于信号序列的DNA可操作连接多肽编码DNA,此时其表达为参与多肽分泌的前蛋白;启动子或增强子可操作连接编码序列,此时其刺激序列转录。可操作连接的元件可以是连续或非连续。当指连接2个蛋白编码区时,可操作连接指编码区处于同一阅读框。或者,额外的基因能在多个表达盒提供。提供这类表达盒,带有多个限制性位点和/或重组位点用于插入多个核苷酸以在调控区转录调节下。除了启动子,表达盒还能包括一个或多个增强子。增强子指增加启动子使用的顺式作用序列。这类增强子可以对基因而言是天然的或来自异源基因。此外,应认识到一些启动子能包含一个或多个增强子或增强子样元件。一个这种增强子示例是35S增强子,其可以是单一增强子或重复的。参见例如McPherson等,美国专利5,322,938。
术语启动子、启动子区或启动子序列一般指基因的转录调节区,其可在编码区5'或3'侧发现,或在编码区内,或在内含子内。通常,启动子是DNA调节区,其能够结合细胞中的RNA聚合酶并起始下游(3'方向)编码序列转录。典型5'启动子序列通过转录起始位点在其3'末端结合并延伸上游(5'方向)以包含在高于背景的可检测水平上起始转录所需的最小碱基或元件数目。启动子序列内包括转录起始位点(通过核酸酶S1作图来方便地定义)以及负责RNA聚合酶结合的蛋白结合域(共有序列)。
时空调控通常也对驱动植物中基因表达重要。例如,可选择和可打分的标志物必须在合适时间及合适组织中表达以允许筛选和控制酶,调节因子必须分别在代谢活性和生理响应性组织中生成。类似地,赋予宿主保护的基因必须在用于病原体或害虫的靶组织中表达,植物生成的蛋白产物应在适合蛋白积聚和存储的组织中表达。此外,由于某些蛋白产物在对生存及生长必需的代谢活性组织中表达时可能对植物健康和收率产生有害影响,在选定的植物存储组织中有活性,但在其他、非存储组织中显示低活性或没有活性的启动子可能是有利的。
在所述方法中,能采用很多指导植物中核酸分子表达的启动子。这类启动子可以选自组成型、化学调节、诱导型、组织特异性和种子优选启动子。例如,组成型启动子包括核心CaMV 35S启动子(Odell等.(1985)Nature 313:810-812);水稻肌动蛋白(McElroy等.(1990)Plant Cell 2:163-171);泛素(欧洲专利申请号0 342 926;Christensen等.(1989)Plant Mol.Biol.12:619-632和Christensen等.(1992)Plant Mol.Biol.18:675-689);pEMU(Last等.(1991)Theor.Appl.Genet.81:581-588);MAS(Velten等.(1984)EMBO J.3:2723-2730),Rsyn7启动子的核心启动子和公开于WO 99/43838和美国专利号6,072,050的其他组成型启动子等。
本领域技术人员应理解启动子序列能修饰成提供一定范围的表达水平和可操作连接的异源核酸分子。可使用不到整个的启动子区且驱动表达的能力保留。然而,认识到mRNA表达水平能在缺失部分启动子序列下减少。因此,启动子能修饰成弱或强启动子。一般,“弱启动子”意在指以低水平驱动编码序列表达的启动子。“低水平”意在指约1/10,000转录本到约1/100,000转录物到约1/500,000转录物的水平。相反,强启动子以高水平或约1/10转录物到约1/100转录物到约1/1,000转录物的水平,驱动编码序列表达。
组织偏爱启动子能用于靶向特定植物组织内的增强转录和/或表达。当涉及优选表达时,意味着特定植物组织内的表达水平高于其他植物组织。这些启动子类型示例包括种子优选表达,如菜豆素启动子(Bustos等.(1989)The Plant Cell卷1,839-853)和玉米球蛋白-1基因Belanger等.(1991)Genetics 129:863-972提供的那些。
可用植物相容性启动子的范围包括诱导型启动子。任何诱导型启动子能用于本发明的方法。参见Ward等.Plant Mol.Biol.22:361-366(1993)。示范性诱导型启动子包括蜕皮激素受体启动子,美国专利号No.6,504,082;来自ACE1系统的启动子,其响应铜(Mett等.PNAS 90:4567-4571(1993));来自玉米的In2-1和In2-2基因,其响应苯磺酰胺除草剂安全剂(美国专利号5,364,780;Hershey等,Mol.Gen.Genetics 227:229-237(1991)和Gatz等,Mol.Gen.Genetics 243:32-38(1994))来自Tn10的Tet阻遏物(Gatz等,Mol.Gen.Genet.227:229-237(1991);或来自甾类激素基因,其转录活性由糖皮质类固醇激素诱导。Schena等,Proc.Natl.Acad.Sci.U.S.A.88:10421(1991);玉米GST启动子,其由用作萌前除草剂的疏水亲电化合物激活;和烟草PR-1a启动子,其水杨酸激活。其他感兴趣的化学调节启动子包括类固醇响应启动子(参见例如糖皮质激素诱导型启动子,Schena等.(1991)Proc.Natl.Acad.Sci.USA88:10421-10425和McNellis等.(1998)Plant J.14(2):247-257)以及四环素诱导型和四环素抑制型启动子(参见例如Gatz等.(1991)Mol.Gen.Genet.227:229-237和美国专利号5,814,618及5,789,156)。
冷响应调节元件或热激调节元件,其转录能分别响应暴露于冷或热而实现(Takahashi等,Plant Physiol.99:383-390,1992);可由厌氧条件诱导的乙醇脱氢酶基因的启动子(Gerlach等,PNAS USA 79:2981-2985(1982);Walker等,PNAS 84(19):6624-6628(1987));和光诱导型启动子,衍生自pea rbcS基因或豌豆psaDb基因(Yamamoto等.(1997)Plant J.12(2):255-265);光诱导型调节元件(Feinbaum等,Mol.Gen.Genet.226:449,1991;Lam和Chua,Science 248:471,1990;Matsuoka等.(1993)Proc.Natl.Acad.Sci.USA90(20):9586-9590;Orozco等.(1993)Plant Mol.Bio.23(6):1129-1138),植物激素诱导型调节元件(Yamaguchi-Shinozaki等,Plant Mol.Biol.15:905,1990;Kares等,PlantMol.Biol.15:225,1990)等。诱导型调节元件还可以是玉米In2-1或In2-2基因的启动子,其响应苯磺酰胺除草剂安全剂(Hershey等,Mol.Gen.Gente.227:229-237,1991;Gatz等,Mol.Gen.Genet.243:32-38,1994)和转座子Tn10的Tet阻遏物(Gatz等,Mol.Gen.Genet.227:229-237,1991)。应激诱导型启动子包括盐/水应激诱导型启动子如P5CS(Zang等.(1997)Plant Sciences 129:81-89);冷诱导型启动子如cor15a(Hajela等.(1990)Plant Physiol.93:1246-1252),cor15b(Wilhelm等.(1993)Plant Mol Biol 23:1073-1077),wsc120(Ouellet等.(1998)FEBS Lett.423-324-328),ci7(Kirch等.(1997)Plant Mol Biol.33:897-909),ci21A(Schneider等.(1997)Plant Physiol.113:335-45);干旱诱导型启动子如Trg-31(Chaudhary等(1996)Plant Mol.Biol.30:1247-57),rd29(Kasuga等.(1999)Nature Biotechnology 18:287-291);渗透诱导型启动子如Rab17(Vilardell等.(1991)Plant Mol.Biol.17:985-93)和渗调蛋白(Raghothama等.(1993)Plant Mol Biol 23:1117-28);和热诱导型启动子如热激蛋白(Barros等.(1992)PlantMol.19:665-75;Marrs等.(1993)Dev.Genet.14:27-41),smHSP(Waters等.(1996)J.Experimental Botany 47:325-338)及来自欧芹泛素启动子的热激诱导型元件(WO 03/102198)。其他应激诱导型启动子包括rip2(美国专利号5,332,808)和rd29a(Yamaguchi-Shinozaki等.(1993)Mol.Gen.Genet.236:331-340)。某些启动子可通过创伤诱导,包括农杆菌pmas启动子(Guevara-Garcia等.(1993)Plant J.4(3):495-505)和农杆菌ORF13启动子(Hansen等,(1997)Mol.Gen.Genet.254(3):337-343)。
另外,可采用促进植物细胞鉴定的标志物,所述细胞含有编码标志物的多核苷酸。可打分或可筛选标志物有用,其中序列的存在生成了可测量产物且能生成植物细胞未破坏的产物。示例包括β-葡萄糖醛酸酶或uidA基因(GUS),其所编码酶的多个显色底物已知(例如美国专利5,268,463和5,599,670);氯霉素乙酰转移酶(Jefferson等.(1987)The EMBOJournal vol.6No.13pp.3901-3907);碱性磷酸酶。其他可筛选标志物一般包括花青素/黄酮类基因(讨论参见Taylor和Briggs,(1990)The Plant Cell 2:115-127),包括例如R-基因座基因,其编码的产物可调节植物组织中生成花色苷色素(红色)(Dellaporta等,《染色 体结构和功能》(Chromosome Structure and Function),克吕韦尔学术出版集团(KluwerAcademic Publishers),Appels和Gustafson编,第263-282页(1988));控制黄酮类色素生物合成的基因,如玉米C1基因(Kao等,(1996)Plant Cell 8:1171-1179;Scheffler等.(1994)Mol.Gen.Genet.242:40-48)和玉米C2(Wienand等,(1986)Mol.Gen.Genet.203:202-207);B基因(Chandler等,(1989)Plant Cell 1:1175-1183),p1基因(Grotewold等,(1991Proc.Natl.Acad.Sci USA)88:4587-4591;Grotewold等,(1994)Cell 76:543-553;Sidorenko等,(1999)Plant Mol.Biol.39:11-19);bronze基因座基因(Ralston等,(1988)Genetics 119:185-197;Nash等,(1990)Plant Cell 2(11):1039-1049)等。如本文所讨论,本发明使用细胞因子水平低于植物细胞再生所需的培养基时,可利用GRF或GIF或GRF-GIF嵌合体,而没有额外可选择标志物。
需要可纳入载体的多种其他信号序列和多聚腺苷酸化序列。
实施例
下列实施例仅用于说明。根据本公开,本领域技术人员会认识到这些实施例和所公开主题的其他实施方案的变化可行,而不需过度实验。提及本文实验时,参见图14,显示选定小麦GRF基因GRFGRF1-5的预测蛋白(SEQ ID NO:9–13)。小麦基因名基于ChineseSpring RefSeq v1.0。GRF数字是基于水稻直系同源物。仅提供小麦A基因组同源物的序列(B和D基因组超过90%相同)。最接近小麦、水稻和拟南芥同源物的编码蛋白序列如SEQ IDNO:9–22所示,保守QLQ和WRC结构域分别以黄色及绿色突出显示。5个水稻GRF直系同源物的预测蛋白如图15所示(SEQ ID NO:13–18)。最接近拟南芥GRF3-6的预测蛋白如图16A所示(SEQ ID NO:19–22)。嵌合体所用最接近葡萄GRF的预测蛋白如图13B所示(SEQ ID NO:23)。
图17A显示拟南芥GIF1和GIF2的预测蛋白(SEQ ID NO:24–26)且17B显示普通小麦GIF1、GIF2和GIF3的预测蛋白(SEQ ID NO:27–29)。小麦基因名基于Chinese SpringRefSeq v1.0。GIF数字基于水稻直系同源物。仅提供小麦A基因组同源物的序列(B和D基因组超过90%相同)。最接近的拟南芥和小麦同源物的编码蛋白序列如下所示,保守SNH结构域以黄色突出显示。葡萄GIF如图17C所示(SEQ ID NO:30)。
实施例1–用GRF-GIF嵌合体转化小麦
为了尝试增加植物生物量以促进农业重要作物的谷物产量增加,我们测试了GRF、GIF和GRF-GIF嵌合体对小麦生长的影响。生成在玉米UBIQUITIN启动子下表达小麦GRF-GIF嵌合体的转基因植物。通过分子系统发育分析用邻接法鉴定了小麦基因组内10个GRF和3个GIF(图1)。为生成小麦GRF-GIF嵌合体,我们选择了水稻OsGRF4的小麦GRF同源物(图1A),其在水稻中促进谷粒和植物生长(Duan P等,2015;Hu J等,2015;Che R等,2015;Sun P等2016;Li,S.等.2018)。对于GIF伴侣,我们选择了拟南芥GIF1的最接近同源物(图1B),因为此进化枝成员显示在拟南芥、水稻和玉米中控制生长(Kim和Kende 2004,Horiguchi等,2005;Shimano S等,2018;Zhang D等,2018)。SNH保守结构域的氨基酸序列用于进行图1B的系统发育分析。拟南芥GIF1的小麦最接近同源物以粗体突出显示。图1C中是嵌合构建体的示意图。来自拟南芥、水稻和短柄草(Brachypodium)的GRF及GIF基因的氨基酸序列获自phytozome(https://phytozome.jgi.doe.gov/pz/portal.html)。GRF及GIF基因的氨基酸序列获自小麦基因组RefSeq v1.0。系统发育分析用MEGA6进行。小麦嵌合GRF4-GIF1的编码序列如图11A所提供且蛋白序列如图11B所提供。用于转化的构建体pLC41-Ubi::GRF4-GIF1序列如图12所提供。
随后我们通过农杆菌介导的转化将小麦GRF4-GIF1嵌合体转化入四倍体小麦Kronos(圆锥小麦硬粒小麦亚种(Triticum turgidum ssp.durum))。用携带GRF4-GIF1双元载体(构建体的示意图如图1C所示)的农杆菌感染的植物细胞再生出数量惊人的芽,显著高于用具有相同pLC41骨架的载体的对照平行转化所得芽数目(图2)。GRF和GIF基因先前未与转基因植物中再生效率增加相关联。
接着我们在2个用GRF4-GIF1嵌合体的独立小麦转化实验和2个没有嵌合体的实验中分析愈伤组织块再生芽的频率。统计分析证明转化有GRF4-GIF1嵌合体的愈伤组织(92.8%±3.2%)中的愈伤组织块再生芽平均数比用无GRF4-GIF1嵌合体的构建体转化的愈伤组织(6.0%±2%)高15倍(P=0.0019)。最后的数字代表Kronos小麦转化,其中通常从含有25个愈伤组织的平板仅回收1或2个芽(图2,两侧)。对于小麦GRF4-GIF1嵌合体(图2,中间),我们每个含有25个愈伤组织平板回收了约50个芽,转基因小麦植物的再生效率显著增加。
我们将26株独立T0植物转移至土壤以验证GRF4-GIF1嵌合体的存在。PCR扩增来自26株独立T0植物的基因组DNA,使用引物Fw-GRF4b(5’-CCTCCGACTCCAAGTATTGC-3’)(SEQ IDNO:104)和Rev-GIF1c(5’-ATCATCAGGTTGGACGGGTA-3’)(SEQ ID NO:105),证实了26株T0植物中的23株具有插入基因组DNA的GRF4-GIF1转基因(图3)。转基因植物可育且显示正常表型。不受特定理论约束,这可能是因为内源miR396控制过表达GRF4-GIF1嵌合体在植物发育后期的水平。
实施例2–仅用GRF4或GIF1转化小麦
小麦转化实验仅用GRF4(SEQ ID NO:1)或仅用GIF1(SEQ ID NO:2)在同一pLC41载体中进行,如就GRF4-GIF1嵌合体所述。也观察到相对于对照的再生效率显著增加(图4和5)。再生效率的增加低于用嵌合GRF4-GIF1转化实验中观察到的。所有再生植物通过PCR确认为转基因。我们还通过T1植物的qRT-PCR证实了所有再生植物过表达相应的转基因。
实施例3–仅用GRF4或GIF1转化其他植物物种或组织
在小麦中,GRF4-GIF1嵌合体使得再生效率显著高于任一单独的2个基因。然而,在其他植物物种或组织转化中,可能这2个基因中的仅一个足以提高再生效率到与用嵌合体转化植物类似的水平。例如,若在特定植物中这些基因中的一个以足够高水平表达,用另一个基因单独转化可能足以在该植物中达到显著更高的再生效率。
实施例4–从叶外植体的再生
生成5个独立GFR4-GIF1 T0小麦品系与1个DsRed T0对照品系的克隆并培养维持。非常幼嫩的叶片在立体显微镜下无菌解剖,移除外部的1-2轮叶片。移出内部叶片的基部(base of the inner lamina)并置于含有生长素的愈伤组织诱导培养基的平板以诱导愈伤组织形成。14天后,组织传代培养到新鲜愈伤组织诱导培养基。额外21天后,愈伤组织转移至含有细胞分裂素的芽再生培养基。28天后,对再生芽的愈伤组织数目进行打分。
用GRF4-GIF1嵌合体转化的叶外植体显示再生效率为55%,比用无GRF4-GIF1嵌合体的构建体转化的叶外植体再生效率高2.8倍(图6)。
实施例5–在未达最优水平的外源细胞分裂素中的植物再生
在常规转化中,首先用农杆菌接种幼胚,随后将其放置于含有不同激素的不同培养基以再生转基因植物。最初,含有生长素的培养基诱导愈伤组织形成。接着,愈伤组织转至含有细胞分裂素的培养基,其促进芽发育(图7A)。有趣的是,观察到用带有小麦GRF4-GIF1嵌合体(SEQ ID NO:5)的农杆菌接种的胚能够在生长素培养基中再生绿芽而不需暴露于细胞分裂素(图7B)。不受任何特定理论约束,此结果的一种可能解释是通过GRF4-GIF1嵌合体促进内源细胞分裂素生成。此观察结果表明GRF4-GIF1嵌合体能够促进胚发生和芽再生,而不需加入外源细胞分裂素,这能用作鉴定转基因芽的正选择法,而不使用基于抗生素的标志物。
实施例6–构建miR396抗性GRF
图6的结果显示GRF4-GIF1嵌合体表达增加可改善从小麦叶外植体再生的效率,这指示此方法用于不同植物物种和组织的潜能。然而,在一些植物物种或一些物种的特定组织中,高水平miR396能切割GRF RNA且可能限制GRF蛋白或GRF-GIF嵌合体高水平表达。
为避免此潜在问题,我们生成了修饰的GRF基因(单独和在GRF-GIF嵌合体中,SEQID NO:34),其在miR396靶位点中具有沉默突变,使得其对miR396阻遏较不敏感(图8)。拟南芥GRF3(SEQ ID NO:19)或水稻GRF4(SEQ ID NO:17)中的miR396结合位点中的突变显示引起GRF活性更高且器官尺寸增加(Debernardi等,2014;Beltramino等,2018;Duan等,2015;Hu等,2015;Che等,2015;Sun等.2016;Li等.2018)。
实施例7–用诱导型系统调节表达
在小麦中,我们未观察到GRF4-GIF1嵌合体对植物表型的有害作用,可能是因为内源miR396的存在足以切割转基因表达产生的任意过量GRF4。然而,可能在其他种类中,GRF-GIF嵌合体的存在能够以不需要的方式影响表型。这些影响在用GRF变体时可能更显著,所述变体带有miR396靶位点突变,使得其对miR396阻遏不太敏感。
用诱导型系统控制此嵌合体在植物中表达和/或活性的能力能够解决此问题。Debernardi和Palatnik(专利申请WO 2016/098027 Al)证明了类似GRF-GIF嵌合体在拟南芥中的活性可通过在GRF-GIF中部克隆大鼠糖皮质激素受体(GR)而外源控制。新的GRF-GR-GIF嵌合体仅在合成激素地塞米松存在下激活。使用此策略,生成GRF-GR-GIF嵌合体用于葡萄(SEQ ID NO:31)和小麦(SEQ ID NO:32及SEQ ID NO:33),其能仅在转化培养基中选择性诱导,且不会后续在经转化的T0植物或后代中有活性,防止对植物发育的潜在影响。
另外,诱导型系统允许使用过度活跃形式的GRF4-GIF1嵌合体,而限制对植物发育的潜在影响。例如,在GRF的miR396靶位点纳入同义突变会消除miR396介导的嵌合体转录后阻遏,增强其表达且可能甚至其在miR396所表达组织(如叶)中的活性。通过诱导型系统,GRF4-GIF1嵌合体能仅在再生期间过表达。另一种选择是应用组织特异性启动子。
实施例8–采用蛋白递送的方法
能递送蛋白到植物细胞。靶向诱变在许多植物物种中通过递送CRISPR/Cas9预组装核糖核蛋白复合物(RNP)获得(Woo等,2015;Subburaj等,2016;Malnoy等,2016;Kim等,2017;Liang等,2017;Svitashev等,2016;Wolter等,2017)。前述方法能通过向植物细胞递送GRF、GIF或GRF-GIF嵌合蛋白来进行。例如,GRF、GIF或GRF-GIF嵌合体能与CRISPR/Cas9RNP一起递送,以增强经编辑植物细胞的再生。蛋白递送后,植物组织可随后在再生培养基中培养。
实施例9–用额外载体的转化
植物用GRF、GIF或GRF-GIF嵌合体转化并获得高再生能力后,来自此植物或其后代的细胞能进一步用于瞬时或稳定转化,采用GRF、GIF或GRF-GIF嵌合体以外的载体。植物仍具有高再生能力。植物细胞随后能在细胞分裂素水平正常或降低的再生培养基中培养。
实施例10–用其他GRF和/或GIF序列的转化
前述方法能用其他GRF基因或相关核苷酸序列进行,例如其中所编码的GRF蛋白的QLQ和WRC结构域的氨基酸序列与小麦蛋白TaGRFGRF1(SEQ ID NO:9)、TaGRF2GRF2(SEQ IDNO:10)、TaGRF3(SEQ ID NO:11)、TaGRF4(SEQ ID NO:12)或TaGRF5(SEQ ID NO:13)或者拟南芥蛋白AtGRF3(SEQ ID NO:19)、AtGRF4(SEQ ID NO:20)、AtGRF5(SEQ ID NO:21)或AtGRF6(SEQ ID NO:22)的QLQ和WRC结构域的序列至少约70%、约80%、约90%或约95%相同。
所述方法还可用GRF基因实施,其中QLQ和WRC结构域的编码氨基酸序列与前面段落所提供的序列至少约80%相似、约90%相似或约95%相似。
相同性和相似性百分比用本公开所示蛋白序列中的连接的(concatenated)QLQ(下面带下划线)和WRC结构域(下面加粗显示)通过BLASTP计算。连接的QLQ-WRC结构域(SEQID NO:96)示例如下所示是小麦TaGRF4的(完整蛋白序列是SEQ ID NO:12)。
>TaGRF4 TraesCS6A01G269600
Figure BDA0003535299770000181
5个小麦蛋白中连接的QLQ-WRC结构域的成对比较如表1所示。所有成对比较大于70%相同和80%相似,确认其相似性和潜在重叠功能。
我们将对来自小麦GRF蛋白的QLQ-WRC结构域与直系同源水稻(水稻)蛋白的对应结构域之间的比较作为这些蛋白保守的另一示例。所有成对比较显示>70%相同和>80%相似(表2)。这些标准排除亲缘关系远(distantly related)的GRF,如OsGRFGRF10(LOC_Os02g45570.1)、OsGRFGRF11(LOC_Os07g28430.1)和OsGRFGRF12(LOC_Os04g48510.1)。
作为双子叶植物物种的参考,比较来自最接近的拟南芥GRF蛋白序列AtGRF3(SEQID NO:19)、AtGRF4(SEQ ID NO:20)、AtGRF5(SEQ ID NO:21)和AtGRF6(SEQ ID NO:22)的连接的QLQ-WRC结构域(表3)。表3显示这些拟南芥蛋白与至少一种小麦GRF蛋白至少70%相同或80%相似。在高度分化的双子叶植物与单子叶植物中发现此组蛋白关键结构域之间存在相对高水平的相似性,证明这些蛋白的保守和其功能的潜在重叠。该可能性也由以下观察支持:拟南芥GRF3(SEQ ID NO:19)或水稻GRF4(SEQ ID NO:17)的活性更高,这归因于miR396识别位点的突变增加了这2个亲缘关系远的物种的种子尺寸(Beltramino等,2018;Duan等,2015;Hu等,2015;Che等,2015;Sun等.2016;Li等.2018)。
表1.本专利包括的5个小麦(小麦)旁系同源蛋白(TaGRF)的比较。连接的QLQ-WRC保守结构域的相同性百分比(第一数字)和相似性百分比(第二数字)通过BLASTP获得。
Figure BDA0003535299770000182
Figure BDA0003535299770000191
表2.本专利包括的5个小麦TaGRF蛋白(序列可获自文件结尾处)与最接近的水稻直系同源物(水稻)的比较。连接的QLQ-WRC保守结构域的相同性百分比和相似性百分比通过BLAST P获得。
Figure BDA0003535299770000192
表3.本专利包括的5个小麦TaGRF蛋白与最接近的拟南芥(Arabidopsisthaliana)直系同源物(AtGRF)的比较。连接的QLQ-WRC保守结构域的相同性百分比和相似性百分比通过BLAST P获得。序列可获自文件结尾处。
Figure BDA0003535299770000193
同样,本公开的方法能用其他GIF基因或相关核苷酸序序列进行,如其中所编码的GIF蛋白SNH结构域的氨基酸序列与小麦蛋白TaGIF1(SEQ ID NO:27)、TaGIF2(SEQ ID NO:28)或TaGIF3(SEQ ID NO:29)的SNH结构域的序列或者拟南芥蛋白AtGIF1(SEQ ID NO:24)、AtGIF2(SEQ ID NO:25)或AtGIF3(SEQ ID NO:26)至少约70%相同、约80%、约90%或约95%相同。
所述方法还可用GIF基因进行,其中所编码的SNH结构域的氨基酸序列与前面段落所提供的序列至少约80%相似、约90%相似或约95%相似。
拟南芥GIF1中的突变,也称为ANGUSTIFOLIA3(AN3),导致叶片和花瓣更小、更窄,细胞数目下降,与多个grf突变体中所见表型相似(Kim和Kende 2004;Horiguchi等.2005)。单一gif2和gif3突变体与野生型植物相似,表明GIF1对叶发育的影响超过其他GIF家族成员(Lee等.2009)。然而,双突变体gif1,2or gif1,3和三突变体gif1,2,3的表型更剧烈,指示GIF基因具有冗余功能(Lee等.2009)。另外,所有3个GIF的异位过量表达能回复GIF1突变(Lee等.2009)。
基于上述重叠GIF功能,我们预测小麦中的不同GIF旁系同源物可对转基因植物再生效率提高有类似效果,其单独或作为与不同GRF的蛋白嵌合体。因此,我们将拟南芥蛋白AtGIF1(SEQ ID NO:24)、AtGIF2(SEQ ID NO:25)、AtGIF3(SEQ ID NO:26)以及3个最接近小麦同源蛋白TaGIF1(SEQ ID NO:27)、TaGIF2(SEQ ID NO:28)和TaGIF3(SEQ ID NO:29)纳入此专利,以及显示与保守SNH结构域至少70%的相同性或80%的相似性的任何其他植物蛋白,如在此文件结尾处所述的蛋白序列中突出显示的。3个小麦GIF蛋白的SNH结构域之间的比较显示至少70%的相同性或89%的相似性(表4)。
表4.本专利包括的3个小麦(小麦)旁系同源蛋白(TaGIF)的比较。保守SNH结构域的相同性百分比和相似性百分比通过BLAST P获得。
Figure BDA0003535299770000201
当小麦SNH结构域与来自拟南芥的对应SNH结构域作比较时,相同性在66%-91%之间变化且相似性在89-96%之间变化(表5)。各拟南芥SNH结构域与3个小麦SNH结构域中至少一个为至少70%相同或80%相似(表5)。
表5.本专利包括的3个小麦TaGIF蛋白(上述序列)和最接近拟南芥同源物(AtGIF)的比较。保守SNH结构域的相同性百分比和相似性百分比通过BLAST P获得。
Figure BDA0003535299770000202
实施例11–与基因组编辑联用
本公开所提供方法可能对与基因组编辑联用是理想的,因为转基因能在编辑事件完成后隔离出来。这消除了GRF-GIF嵌合体会负面影响植物表型的风险。所公开方法的高转化效率可扩展植物品种列表,该品种受益于CRISPR筛选以在重要农艺基因的调控和编码区中测试数组突变体。我们确认了用载体JD635(GRF4-GIF1-gRNA-GeneQ)转化的胚的高再生效率(图9A),所述载体包括GRF4-GIF1嵌合体、Cas9和靶向基因Q的gRNA(图9B)。我们测试了32个来自此转化的愈伤组织并确认其中30个的靶基因编辑(图9C)。
实施例12–在缺乏选择性标志物的情况下选择转基因植物
本文公开的方法可用于避免使用选择性标志物,所述标志物受到不同意应用转基因植物的人的反对。GRF4-GIF1嵌合体在缺乏细胞分裂素情况下再生的能力证明了仅通过消除或大幅降低细胞分裂素浓度,能从非转基因植物选择转基因植物。图10显示来自图9所示相同JD635构建体(GRF4-GIF1-Cas9-gRNA-GeneQ)的愈伤组织,其在没有潮霉素且具有十分之一正常细胞分裂素浓度的培养基中生长。JD635存在下观察到多个再生芽(图10A,绿色箭头),在用DsRed(无GRF4-GIF1嵌合体)转化的对照中没有观察到再生(图10B)。
用于实施例1-12的材料与方法。
载体。所有克隆PCR用Phusion高保真DNA聚合酶(NEB)进行。克隆反应用Gateway克隆技术(英杰(Invitrogen))完成。用Spectrum植物总RNA试剂盒(西格玛奥德里奇(Sigma-Aldrich))从穗提取RNA,经RQ1无RNase的DNA酶(普洛麦格(Promega))处理,随后用SuperScript II逆转录酶(英杰)完成cDNA合成。为克隆编码(SEQ ID NO:12)所描述的蛋白的小麦GRF4和编码(SEQ ID NO:27)所描述的蛋白的GIF1的编码区,对产生自Kronos穗的cDNA实施PCR。特异于GRF4(Fw-GRF4a/Rev-GRF4a)和GIF1(Fw-GIF1a/Rev-GIF1a)的引物序列如表6所示。
PCR片段首先通过B/P gateway反应克隆于pDONR,转化入化学感受态大肠杆菌(E.coli)DH5α。克隆序列通过桑格测序确认。GRF4-GIF1嵌合体通过重叠PCR产生。第一步中,用引物FW-GRF4a/Rev-GRF4b和Fw-GIF1b/Rev-GIF1b,从作为模板的pDONR-GRF4和pDONR-GIF1克隆扩增GRF4和GIF1编码序列。引物Rev-GRF4b产生的3’末端与Fw-GIF1b 5’末端重叠12个核苷酸。这12个核苷酸在GRF4与GIF1之间产生4个丙氨酸的桥。PCR片段进行凝胶纯化,在第二次PCR中用作模板,采用引物Fw-GRF4/Rev-GIF1b。引物序列如表6所示。所得产物通过B/P gateway反应克隆于pDONR并转化入化学感受态大肠杆菌DH5α。载体序列通过桑格测序验证。接着,GIF1和嵌合GRF4-GIF1基因通过L/R gateway反应克隆于双元载体pLC41,其在玉米UBIQUITIN启动子下,并转化入化学感受态大肠杆菌DH5α。单个基因pLC41:GRF4(SEQ ID NO:1)和pLC41:GIF1(SEQ ID NO:2)以及嵌合pLC41:GRF4-GIF1(SEQ ID NO:5)的所得的载体通过限制性酶消化证实并电穿孔转化于农杆菌EHA105中。
葡萄VviGRF4-GIF1嵌合体(DNA,SEQ ID NO:6,蛋白,SEQ ID NO:7)通过基因合成产生。DNA片段通过B/P gateway反应克隆于pDONR并转化入化学感受态大肠杆菌DH5α。载体序列通过桑格测序验证。接着,葡萄嵌合GRF4-GIF1通过B/P gateway反应克隆于双元载体pGWB14,其在病毒35S启动子下,并转化入化学感受态大肠杆菌DH5α。所得载体pGWB14-VviGRF4-GIF1(DNA,SEQ ID NO:8)通过限制性酶消化证实并电穿孔转化于农杆菌EHA105中。地塞米松诱导型形式VviGRF4-GR-GIF1(SEQ ID NO:31)通过重叠PCR产生。
编码蛋白(SEQ ID NO:32)的小麦地塞米松诱导型GRF4-GR-GIF1嵌合体通过重叠PCR产生。第一步中,GRF4、GIF1和大鼠糖皮质激素受体(GR)用引物FW-GRF4a/Rev-GRF4b、Fw-GIF1-GR/Rev-GIF1b、Fw-GR/Rev-GR扩增。引物Rev-GRF4b产生的3’末端与Fw-GR 5’末端重叠12个核苷酸,Rev-GR产生的3’末端与Fw-GIF1-GR5’末端重叠12个核苷酸。这12个核苷酸在GRF4与GR以及GR与GIF1之间产生4个丙氨酸的桥。这3个PCR片段进行凝胶纯化,在第二次PCR中用作模板,采用引物Fw-GRF4/Rev-GIF1b。引物序列如表6所示。所得产物通过B/Pgateway反应克隆于pDONR并转化入化学感受态大肠杆菌DH5α。载体序列通过桑格测序验证。接着,嵌合GRF4-GR-GIF1通过L/R gateway反应克隆于双元载体pLC41,其在玉米UBIQUITIN启动子下,并转化入化学感受态大肠杆菌DH5α。所得载体pLC41:GRF4-GR-GIF1(SEQ ID NO:33)通过限制性酶消化证实并电穿孔转化于农杆菌EHA105中。
miR396抗性形式的小麦GRF4-GIF1(rGRF4-GIF1)(SEQ ID NO:34)通过重叠PCR产生。第一步中,2次PCR用引物Fw-GRF4a/rGRF-Rev和rGRF-Fw/Rev-GIF1b实施,pLC41-GRF4-GIF1克隆用作模板。引物rGRF-Fw和rGRF-Rev重叠17nt且在miR396靶位点内引入沉默突变。2种PCR片段都进行凝胶纯化,在第二次PCR中用作模板,采用引物Fw-GRF4/Rev-GIF1b。引物序列如表6所示。所得产物通过B/P gateway反应克隆于pDONR并转化入化学感受态大肠杆菌DH5α。载体序列通过桑格测序验证。其次,嵌合rGRF4-GIF1通过L/R gateway反应克隆于双元载体pLC41,其在玉米UBIQUITIN启动子下,并转化入化学感受态大肠杆菌DH5α。所得载体pLC41:rGRF4-GIF1通过限制性酶消化证实并电穿孔转化于农杆菌EHA105中。塞米松诱导型形式rGRF4-GR-GIF1通过重叠PCR产生。
为产生JD635-GRF4-GIF1-Cas9-gRNA-GeneQ载体,包括玉米UBIQUITIN启动子、GRF4-GIF1嵌合体和Nos终止子的盒通过PCR扩增。PCR产物进行凝胶纯化,通过In-fusion(宝生物工程株式会社美国公司(Takara Bio USA,Inc.))克隆入含有小麦密码子优化的Cas9(TaCas9)的pYP25F双元载体AscI位点,并转化入化学感受态大肠杆菌DH5α。载体序列通过桑格测序验证。其次,靶向基因Q编码区的引导RNA构建体通过Golden Gate反应克隆到载体的2个AarI位点,转化入化学感受态大肠杆菌DH5α。所得JD635-GRF4-GIF1-Cas9-gRNA-GeneQ载体(SEQ ID NO:35)通过桑格测序验证并电穿孔转化于农杆菌EHA105中。
对照载体:pLC41骨架中的JD518载体包含驱动LhG4表达的GLOSSY启动子,其是在小麦中没有靶标的人工转录因子。pLC41骨架中的pOp:FT1载体包含pOp下游克隆的FT1编码序列,其是人工启动子。中的pLC41-DsRed在pLC41骨架含有克隆在玉米UBIQUITIN启动子下游的DsRed编码序列。
小麦转化。转基因小麦植物最初用来自日本烟草公司(JT)的技术产生。然而,细胞分裂素浓度在一些后续实验中下降。就各载体进行2次独立实验。各实验中,来自Kronos的至少25个幼胚用农杆菌EHA105转化,采用前述pLC41构建体。转基因植物的选择用潮霉素实施,除非实验中细胞分裂素浓度降低(图10),其中没有使用潮霉素或任何其他抗生素。再生植物转移至土壤且转基因插入通过DNA提取和PCR验证。用于对T0植物基因分型的引物如表6所示。
表6.用于克隆和基因分型的引物序列
Figure BDA0003535299770000221
植物生长条件。转基因植物生长于PGR15生长箱(Conviron),该生长箱调至16小时光照(22℃)和8小时黑暗(18℃)。在植物头部高度测量的卤化钠光强度为(~260μM m-2s-1)。
系统发育分析。来自拟南芥及稻的GRF和GIF基因的氨基酸序列获自phytozome(phytozome.jgi.doe.gov/pz/portal.html)。小麦GRF和GIF基因的氨基酸序列获自小麦基因组RefSeq v1.0。系统发育分析用MEGA6进行。
实施例13–用单一GRF4或GIF1基因的小麦转化
小麦转化实验用GRF4-GIF1嵌合体(SEQ ID NO:5)、GRF4(SEQ ID NO:1)或GIF1(SEQ ID NO:2),单独或在同一构建体内(图18A)实施。14次实验中(表7),GRF4-GIF1嵌合体的再生效率(65.8±5.3%)比空载体对照(8.7±2.0%,P<0.0001,图18B)高>7倍。在5次单独转化实验中(表7),观察到转化有单独GRF4基因(20.4±11.4%)或单独GIF1基因(17.2±6.6%)的胚再生效率显著低于GRF4-GIF1嵌合体(54.6±9.8%,图基P<0.05,图18C)。转化有单个基因的愈伤组织再生效率比对照(6.0±3.0%)高约3倍,但图基检验中的差异不显著(图18C)。
然后,我们比较了在嵌合体中融合GRF4和GIF1或两者由单独的Ubi启动子于同一构建体分开表达(未融合,SEQ ID NO:36)对再生效率的影响(图18A,表7)。在5次不同实验中,分开的GRF4和GIF1基因的平均再生效率(38.6±12.9%)显著低于(P<0.0144)GRF4-GIF1嵌合体的再生效率(62.6±10.3%,图18D)。此结果证明嵌合体中2种蛋白的受迫接近提高了其诱导再生的能力。
实施例14–用包括其他GRF和/或GIF序列组合的嵌合体的小麦转化
GRF和GIF基因家族包含数个成员(图19A,B),其展示出重叠功能和冗余性。因此,我们预期除了GRF4和GIF1外,其他GRF和/或GIF也能促进再生。
测试来自不同进化枝的GRF基因(图19A),包括与GRF4同一进化枝中的GRF5(SEQID NO:13,37)、最接近的进化枝(其包括拟南芥GRF5,SEQ ID NO:21)中的GRFGRF1(SEQ IDNO:9,38)、更远进化枝中的GRF9(SEQ ID NO:39)。8种小麦蛋白中连接的QLQ-WRC结构域的成对比较如表8所示,所述蛋白包括此专利中测试的4种。用于GRFGRF1-GRF6和GRF9的所有成对比较为≥70%相同或>80相似,证实其相似性和潜在重叠功能。
我们生成了所有选定的GRF和GIF1的嵌合体(图20A)(SEQ ID NO:40-42),并在Kronos中测试转化频率。由包括与GIF1融合的密切相关的GRF4和GRF5基因的嵌合体诱导的再生效率显著高于包括与GIF1融合的关系更远的GRFGRF1和GRF9基因的嵌合体所观察到的再生效率(对比P=0.0368,图20B)。有趣的是,包括GRFGRF1和GRF9的嵌合体显示再生频率高于对照,但差异不显著(表7)。因此,来自不同进化枝的GRF能促进再生,但来自GRF4-进化枝的GRF似乎再生活性更高。
接着,通过产生与GRF4的嵌合体(图20C),我们比较了系统树中包括的3个小麦GIF基因的活性(图19B)。如同GIF1,含有GIF2(SEQ ID NO:28、43、45)和GIF3(SEQ ID NO:29、44、46)的嵌合体相较于用空载体转化的愈伤组织,再生频率也增加(图20D、表7)。然而,GRF4-GIF1组合产生的再生效率高于GRF4-GIF2和GRF4-GIF3组合(对比P=0.0046),所有3个嵌合体显示再生效率高于对照(图基检验P<0.05)。
实施例15.GRF4-GIF1改进再生效率低或难培养的小麦和黑小麦品种的转化
我们随后测试GRF4-GIF1嵌合体从商业硬质小麦、面包小麦和黑小麦品系产生转基因植物的潜能,所述品系先前不适应农杆菌介导或在UCD植物转化设施中的再生效率低(图21)。采用GRF4-GIF1嵌合体观察到四倍体小麦良种系Desert King(63.0±17.0%vs.2.5±2.5%)和六倍体小麦Fielder(54.0±4.0%vs.2.5±2.5%)的再生频率相对于对照增加。对于之前都无法生成转基因植物的六倍体小麦品种Hahn和Cadenza以及黑小麦繁育系UC3190,观察到GRF4-GIF1嵌合体的再生频率为9-19%,相比之下对照为0%(图21和表9A、B)。
实施例16.小麦GRF4-GIF1改进水稻品种Kitaake的转化
我们还在水稻品种Kitaake中测试了小麦GRF4-GIF1嵌合体。我们在2次独立转化实验中观察到,用小麦GRF4-GIF1嵌合体转化的愈伤组织中水稻再生效率(平均40.1±5.4%)相较于用对照载体转化的那些(17.6±5.9%,表9C)增加2-3倍。这些结果表明小麦GRF4-GIF1嵌合体对在另一农艺上重要的单子叶植物物种中提高再生有效。
实施例17-用诱导型系统调节表达
通过在GRF4-GIF1中间克隆大鼠糖皮质激素受体(GR),我们生成了诱导型形式的小麦嵌合体GRF4-GIF1(SEQ ID NO:32、33)(图22A)。此嵌合体仅在合成激素地塞米松(Dex)存在下激活。来自Kronos的幼胚用此载体转化,然后接种于常规转化培养基或补充有10μMDEX的培养基。没有DEX时,我们从23个胚回收了1个芽,而存在DEX时,从24个胚回收了4个芽(图22B)。此结果表明GRF-GR-GIF嵌合体具有功能且可通过加入DEX而外源控制。
实施例18-无外源细胞分裂素的转基因植物再生
在许多植物转化系统中,再生芽需要细胞分裂素(图23A)。有趣的是,观察到接种了用小麦GRF4-GIF1嵌合体转化农杆菌的胚能够在没有细胞分裂素的生长素培养基中快速再生出绿芽(图23B)。随后,在没有细胞分裂素和潮霉素情况下,测试来自稳定GRF4-GIF1转基因(n=27)和非转基因(n=26)T1姊妹系的幼胚再生效率。这些条件下,GRF4-GIF1转基因植物的再生效率(77.8%)显著高于非转基因姊妹系(11.5%,图24)。这些结果表明GRF4-GIF1嵌合体在未加入外源细胞分裂素的情况下能促进胚发生和芽再生。
基于先前结果,我们开发了在生长素培养基中选择转基因芽的操作,不需要使用基于抗生素的标志物。我们在3次实验中用GRF4-GIF1无标志物载体(SEQ ID NO:47)回收了40个芽,用空载体回收了15个。基因分型显示40个GRF4-GIF1芽中的10个(25%)是转基因,而没有对照是阳性的(图23C)。这些过表达GRF4-GIF1嵌合体而不用选择标志物的高再生转基因植物能用于未来的转化实验,以使用选择性标志物纳入其他基因。此策略产生分开的插入位点用于GRF4-GIF1和第二转基因,这有利于在下一代中分离GRF4-GIF1插入。
实施例19.GRF4-GIF1加速小麦转化
小麦GRF4-GIF1嵌合体加速再生过程,这允许我们开发更快的小麦转化方案,就此手稿所示全部小麦实验所需的时间而言为56天,而不需91天(图25)。
实施例20-GRF-GIF提高双子叶植物物种的转化
我们进行一系列柑橘转化实验以测试GRF-GIF技术在再生效率有限的双子叶作物中的作用。在2个物种中生成异源柑橘(SEQ ID NO:48)和葡萄(SEQ ID NO:6-8)GRF4-GIF1嵌合体,使用与小麦GRF4(柑橘Ciclev10032065m和葡萄GSVIVT01024326001)及GIF1(柑橘Ciclev10022144m和葡萄GSVIVT01036262001)最接近的同源物(图19A和B)。在香椽砧木Carrizo的3次独立转化实验中,上胚轴用柑橘和葡萄GRF4-GIF1嵌合体转化。用柑橘GRF4-GIF1嵌合体转化的上胚轴显示再生效率相对于转化有空载体对照的那些增加4.8倍(图26A和表10)。异源的葡萄GRF4-GIF1嵌合体产生的柑橘再生效率提高类似柑橘嵌合体(图26B和表10)。
我们还测试了抗miR396的葡萄GRF4-GIF1形式的效果(此后,rGRF4-GIF1,SEQ IDNO:50),其中在用于miR396的GRF结合位点引入沉默突变以避免切割(图26B-C)。在3次独立实验中,观察到葡萄rGRF4-GIF1嵌合体产生最高频率的转基因事件(相较于对照增加8.4倍,P<0.05)。比较对照vs.3个组合GRF-GIF构建体的统计分析也是显著的(P=0.0153,图26D和表10)。尽管其再生频率较高,rGRF-GIF构建体需要额外优化(如诱导型系统),因为一些转基因事件生成了无法产生芽的巨大愈伤组织。
我们在葡萄转化中测试葡萄rGRF41-GIF1嵌合体。葡萄转化方法缓慢且很费力,需要8-12个月来产生转基因植物品系。产生自未成熟花中花药花丝的前胚发生(pro-embryogenic)愈伤组织用携带空载体和rGRF4-GIF1载体的农杆菌接种。培养6个月后,观察到87%用rGRF4-GIF1转化的愈伤组织再生出芽,而仅12%对照愈伤组织生成芽(图27)。有趣的是,与柑橘rGRF4-GIF1转基因不同,所有葡萄rGRF4-GIF1发育出生成叶的芽。
然后,我们在甜椒(番椒)转化中测试GRF-GIF技术,使用栽培品种“R&C cayenne”。在番椒中,有2个接近的GRF4旁系同源物,命名为GRF4.1和GRF4.2,以及2个接近的GIF1旁系同源物,在此命名为GIF1.1和GIF1.2。我们在构建GRF4.1-GIF1.1嵌合体(SEQ ID NO:138-139)中使用了GRF4.1(LOC107869915)和GIF1.1(LOC107870303)旁系同源物,该嵌合体克隆入携带BAR选择的双元载体pEarleyGate100。对于转化实验,使用靠近茎尖的子叶基部,叶柄仍附着。用甜椒GRF4.1-GIF1.1嵌合体转化的40个子叶块显示再生效率为23.8%,比用空载体pEarleyGate100转化的子叶块再生效率高4.76倍(5.0%再生效率,表11,图28)。
用于实施例13-20的方法。
方法1.用于转化实验的载体。
小麦载体。为产生表达GRF4和GRFGRF1但未融合的载体(Ubi::GRF4-term Ubi::GIF1-term,SEQ ID NO:36),完整Ubi::GRF4-term盒通过PCR扩增,pLC41:GRF4(SEQ ID NO:1)用作模板,采用引物
Fw_HindIII gccactcagcaagctttgcagcgt(SEQ ID NO:119)和
Rev-HindIII TCACGCTGCAAAGCTCTAATTCCCGATCTAGTAAC(SEQ ID NO:120)。我们在pGEMT-easy中克隆PCR片段并将Ubi::GRF4-term片段次克隆到pLC41:GIF1(SEQ ID NO:2)的HindIII位点。
为产生不同小麦GRF-GIF嵌合体,GRFGRF1(SEQ ID NO:38)、GRF5(SEQ ID NO:37)、GRF9(SEQ ID NO:39)、GIF2(SEQ ID NO:43)和GIF3(SEQ ID NO:44)的编码序列通过基因合成获得。然后,不同嵌合体(GRFGRF1-GIF1、GRF5-GIF1、GRF9-GIF1、GRF4-GIF2、GRF4-GIF3)通过重叠PCR产生,遵循的策略与就产生GRF4-GIF1所述相同。在pLC41载体中通过L/R反应克隆所有嵌合体并通过限制性酶消化确认载体,电穿孔转化于农杆菌菌株EHA105中。
我们通过基因合成和pDONR中克隆来产生柑橘GRF4-GIF1嵌合体(SEQ ID NO:48)。经桑格测序检查序列后,我们将其通过L/R反应克隆入pGWB14双元载体,其在病毒35S启动子下,并转化入化学感受态大肠杆菌DH5α。我们通过限制性酶消化证实所得载体pGWB14-柑橘GRF4-GIF1(DNA,SEQ ID NO:49)并电穿孔转化于农杆菌EHA105中。
我们通过重叠PCR产生抗miR396形式的葡萄GRF4-GIF1(rGRF4-GIF1)。实施2次PCR反应,采用引物
Fw-GRF GGGGacaagtttgtacaaaaaagcTGCCACCATGAAGCAAAGCTTTGTGG(SEQ ID NO:121)
/rGRF-Rev TCGACCGGTTTTCTAGAACGGTTGCGG(SEQ ID NO:122)和
rGRF-Fw TCTAGAAAACCGGTCGAATCACAAACTA(SEQ ID NO:123
/Rev-GIF
GGGGACCACTTTGTACAAGAAAGCTGAACGTCAATTCCCATCTTCAGCA(SEQ ID NO:124)使用pGBW14-葡萄GRF4-GIF1克隆作为模板(SEQ ID NO:8)。引物rGRF-Fw和rGRF-Rev重叠17个核苷酸且在miR396靶位点引入沉默突变(图26C)。PCR片段进行凝胶纯化并在第二次PCR中用作模板,引物为Fw-GRF/Rev-GIF。在pDONR中通过B/P gateway反应克隆所得产物。接着,我们通过L/R gateway反应将嵌合rGRF4-GIF1克隆于双元载体pGWB14,其在病毒35S启动子下。所得载体在农杆菌EHA105中通过电穿孔转化。
我们通过基因合成产生甜椒GRF4.1-GIF1.1嵌合体(SEQ ID:NO:140),并将其克隆入双元载体pEarleyGate100,其在35S启动子下。pEarleyGate100具有BAR选择标志物。我们将该构建体,命名为pTH1903,通过电穿孔转化于农杆菌EHA105中。作为对照,我们使用了孔载体pEarleyGate100。
植物转化
方法2.小麦转化方案。小麦转化遵循先前发表的方案。简言之,在生长箱中以长日照光周期(16小时的380μM m-2s-1光照,26℃白天和18℃晚上)培养不同小麦和黑小麦栽培品种。开花后约2周从穗收获未成熟谷粒,70%乙醇中1分钟,然后在1.2%(v/v)次氯酸钠溶液加5μl吐温(tween)中10分钟进行表面消毒。表面消毒后,无菌水洗涤种子3次,立体显微镜下分离幼胚。
我们离心液体培养基中的分离的幼胚,随后用农杆菌接种。胚以盾片朝上转移至共培养培养基并在23℃避光温育。2-3天后,切除胚轴并移至无选择的愈伤组织诱导培养基,我们将其在所述培养基上25℃避光温育。5天后,胚转移至有30mg/l潮霉素的选择培养基,25℃避光温育。
3周后,我们将愈伤组织转移至含有100mg/l潮霉素的选择培养基。另外3周后,我们将正在增殖的组织转移至含有50mg/l潮霉素的再生培养基,连续光照(30μM m-2s-1)下25℃温育2周。将再生芽转移至含有50mg/l潮霉素的生根培养基。生根的植物如下适应土壤:将其转移至含有36个片插入物(sheet insert)的1020托盘,填充有Sunshine盆栽混合土并覆盖有11x 21x 2英寸透明塑料圆顶,在16小时100μM光照和26℃下持续10天。近期,我们开发了一个更短的转化操作以产生GRF4-GIF1转基因小麦植物,如图25所概括。
方法3.水稻转化方案。水稻转化遵循先前发表的方案(Ishida等.2015,第12届国际小麦遗传会议论文集(Proc.12th Int.Wheat Genet.Symp.)167-173)。简言之,我们选择新鲜稻种,将其脱壳,在含有20%(v/v)漂白剂的旋转瓶中表面消毒30分钟。随后,种子用无菌水冲洗3次。将约25-50粒种子/平板置于愈伤组织诱导培养基(MSD,有维生素的1xMurashige and Skoog培养基,含有30g/l蔗糖、2mg/l 2,4-二氯苯氧基乙酸、1.2%(w/v)琼脂,pH 5.6-5.8),而不让胚碰到培养基,用医用胶带包裹平板,在16小时光照/8小时黑暗,28℃下。10-14天后,我们将愈伤组织与萌发种子的其他部分分离,转移至新鲜MSD琼脂板,共培养前再持续5天。
农杆菌培养:我们从分离自平板的单一菌落中制备甘油冷冻储液。然后,接种含有适当抗生素的1ml LB以维持农杆菌和质粒,28℃,250rpm温育过夜。第二天,将300μl农杆菌培养物接入含适当抗生素和200μM乙酰丁香酮的20ml TY(pH 5.5)。培养物在振荡培养箱中28℃,250rpm温育,直至培养物的OD600达到0.1-0.2(约2-4小时)。
转化和共培养:我们将愈伤组织置于农杆菌悬液30分钟,摇晃悬液以确保对愈伤组织可及性一致。振荡温育后,在无菌沃特曼纸上干燥愈伤组织以去除过量细菌悬液。我们将愈伤组织转移到共培养培养基(MSD+S+AS,有维生素的1x Murashige and Skoog培养基,含有30g/l蔗糖、5%山梨醇、2mg/l 2,4-二氯苯氧基乙酸、200μM乙酰丁香酮、1.6%(w/v)琼脂,pH 5.6-5.8)上并在22℃避光温育3天。
选择:将共培养的愈伤组织转移至选择培养基(MSD+CH+PPM,有维生素的1xMurashige and Skoog培养基,含有30g/l蔗糖、2mg/l 2,4-二氯苯氧基乙酸、400mg/L羧苄青霉素、200mg/l特美汀、1ml/l植物防腐混合物、80mg/L潮霉素、1.2%琼脂,pH5.6-5.8),在连续光照下28℃温育平板。我们将这些愈伤组织在新鲜选择培养基上每8-9天继代培养。
再生和生根:在选择培养基上4-5周后,约2-5mm宽的耐受性微愈伤组织开始出现。我们将这些从初始愈伤组织中挑去并转移到皮氏培养皿,采用再生培养基(BN+S+CH,有维生素的1x Murashige and Skoog培养基,含有30g/l蔗糖、5%山梨醇、3mg/lBAP、0.5mg/lNAA、400mg/l羧苄青霉素、200mg/l特美汀、1ml/l植物防腐混合物、50mg/l潮霉素、1.6%(w/v)琼脂,pH 5.6-5.8),并在连续光照下28℃温育。这些愈伤组织在新鲜选择培养基上每8-9天继代培养。4-5周后,当愈伤组织开始变绿时,再生植物转移到生根培养基(MS+H,有维生素的1x Murashige and Skoog培养基,含有50mg/l潮霉素、1.2%(w/v)琼脂,pH 5.6-5.),并在16小时光照/8小时黑暗下28℃温育。当根发育良好时,我们将植物转移至土壤。
方法4.柑橘转化操作。我们将Carrizo枳橙砧木的种子置于水中以吸水且随后剥去种皮,确保不移除珠被。种子在0.6%(v/v)次氯酸钠溶液加5μl吐温20中表面消毒,这是通过将其置于50ml离心管并在100rpms振荡20分钟进行的。用150-200ml无菌蒸馏水冲洗种子3x。我们将种子置于琼脂固化的1/2x Murashige and Skoog基本有机培养基(1/2xMSO),所述培养基含有15g/l蔗糖、7gm TC琼脂(pH 5.6-5.8),将种子轻轻推入培养基以更均一地发芽。26℃避光温育。
农杆菌培养:我们从分离自平板的单一菌落制备甘油冷冻储液。然后,我们使用40μl储液接种含有适当抗生素的20ml MGL培养基(pH 7.0)从而维持农杆菌和质粒,28℃,250rpm温育过夜。第二天,移出5ml过夜生长物并转移至含有适当抗生素和200μM乙酰丁香酮的15ml TY培养基(pH 5.5)。我们将过夜培养物在28℃,250rpm温育过夜,接着将生长于TY培养基的过夜培养物稀释到O.D600nm0.1-0.2。
共培养:我们收集2-5周龄的黄化上胚轴并置于皮氏培养皿,该培养皿含有10ml如上所制备的农杆菌溶液(0.1-0.2OD600)。将上胚轴浸入0.5cm切片且浸泡10分钟。我们将上胚轴切片转移到由以下组成的共培养培养基:Murashige and Skoog基本有机培养基(MSO),用30g/l蔗糖、3.0mg/l BAP、0.1mg/l NAA和200-μM乙酰丁香酮改良(modified),pH5.6-5.8。23℃避光温育。
诱导:2-3天后,将上胚轴块转移至诱导培养基,其由用30g/l蔗糖、3.0mg/l BAP、0.1mg/l NAA、400mg/l羧苄青霉素、150mg/l特美汀和100mg/l硫酸卡那霉素改良的MSO组成,避光温育。10天后,我们将上胚轴切片在配制相同的新鲜培养基中继代培养,接着每21天继代培养。避光下第二个21天周期后,培养物转移至30μM光照和16小时光照/8小时黑暗的光周期。每21天继续转移到相同培养基的新鲜培养基,直至器官性的分化芽在切割末端发育。
伸长:芽开始形成后,我们将正在发育的芽转移至由用30g/l蔗糖、0.1mg/l BA、400mg/l羧苄青霉素、150mg/l特美汀和100mg/l硫酸卡那霉素改良的MSO组成的伸长培养基。我们将培养物如上温育且每21天按需继代培养,直至芽伸长。
生根:芽达到2-4cm高度后,收获芽并将其转移至由用30g/l蔗糖、5mg/l NAA、250mg/l头孢噻肟和100mg/l卡那霉素改良的MSO组成的生根培养基。3-5天后,我们将芽转移至用30g/l蔗糖、0.0mg/l NAA,400mg/l羧苄青霉素和100mg/l卡那霉素改良MSO。芽在14天内开始生根。
方法5.葡萄转化方案。为产生转基因葡萄(汤姆逊无核),我们将体细胞胚转移至5ml液体Lloyd and McCown WPM,其补充有20g/l蔗糖、1g/l酪蛋白、1mM MES、500mg/l活性炭、0.5mg/l BAP、0.1mg/l NAA、200μM乙酰丁香酮和12.5μl pluronic F68。我们通过把管置于45℃浴10分钟,对胚进行热激。
我们向溶液加入农杆菌至OD 0.1-0.2,随后将胚转移到含有7mm沃特曼滤纸片的空100x20mm皮氏培养皿。用石蜡膜包裹皮氏培养皿,23℃避光温育。2-3天后,收获胚并将其转移至WPM选择培养基。我们将所述培养基补充了20g/l蔗糖、1g/l酪蛋白、1mM MES、500mg/l活性炭、0.5mg/l BAP、0.1mg/l NAA、400mg/l羧苄青霉素、150mg/l特美汀、200mg/l卡那霉素(或25mg/l潮霉素)、50g/山梨醇、14g/l琼脂和4ml植物防腐混合物(PPM)。
我们将胚在26℃避光温育,7天后,我们将胚在配制相同的新鲜培养基中继代培养。14和28天后,我们将胚在新鲜培养基中继代培养。额外14天后,我们将胚转移到相同的WPM补充培养基,但没有山梨醇而有8g/l琼脂。我们在平板上将胚分成独立簇,16小时光照,26℃下温育,每2-3周继代培养。我们每14-21天持续继代培养,直至植物形成。
为收获正在形成的(germinating)植物,切割根并将芽置于补充了30g/l蔗糖、0.01mg/l IBA、150mg/l特美汀、400mg/l羧苄青霉素和100mg/l卡那霉素(或25mg/l潮霉素)的WPM上。芽生根后,我们将其转移至含有湿润土壤(1份Sunshine混合物2份蛭石)的2英寸盆,并将盆置于密封的密保诺保鲜袋7天。在16小时100μM m-2s-1光照,26℃下温育,用塑料圆顶盖住苗床(flat)。1周后,打开密保诺保鲜袋以降低湿度。如果新的生长明显,从密保诺保鲜袋中移出植物且圆顶下再放置7天以完成环境适应。
方法6.甜椒转化操作。
我们将番椒栽培品种“R&C cayenne”种子在含5μl吐温20的1.2%(v/v)次氯酸钠溶液中表面消毒。种子在定轨摇床上100rpms搅拌20m,然后在150-200ml无菌蒸馏水中冲洗一次。将其转移至琼脂固化的1/2强度Murashige and Skoog基本有机培养基(1/2xMSO),所述培养基补充了16g/l葡萄糖、600mg/l MES、8gm PhytoAgar(Plantmedia产品目录号40100072-4)(pH 5.6-5.8)。种子在16小时光周期和30μM m2 s-1光照,26℃下温育。
10天后,我们将幼苗置于调整到O.D.600 0.1-0.2的根癌农杆菌(菌株EHA 105)溶液。幼苗浸没在农杆菌溶液中时,我们将子叶顶端的2/3去除,随后从幼苗仔细去除子叶保留子叶的叶柄而不是顶端分生组织。5分钟后,将外植体转移入含有共培养培养基的100x20mm皮氏培养皿,所述培养基由用16g/l葡萄糖、600mg/l MES、10mg/l苄氨基嘌呤(BAP)、1.0mg/l吲哚-3-乙酸(IAA)和200μM乙酰丁香酮改良的MSO培养基组成,pH 5.6-5.8。我们将组织在23℃避光温育。
2-3天后,我们将子叶块转移至由补充了16g/l葡萄糖、600mg/l MES、10mg/l BAP、1.0mg/l IAA、300mg/l特美汀、400mg/l羧苄青霉素、8mg/l草铵膦(西格玛奥德里奇(Sigma-Aldrich)产品目录号45520)MSO培养基组成的诱导培养基。10天后,我们将子叶在新鲜培养基中继代培养,接着每21天继代培养。
幼芽(bud)开始形成后,我们将正在发育的幼芽转移到伸长培养基。此培养基由Driver和Kuniyaki Walnut(DKW)培养基组成,补充了30g/l葡萄糖、1.3g/l ca葡糖酸盐、2.0mg/l meta-topolin(MT)、10mg/l赤霉酸(GA3)、2ml/l硫代硫酸银储液(STS)(1.0ml100mg/ml AgNO3储液,加入1.0ml 12mM(95mg/50ml水储液)、300mg/l特美汀、400mg/l羧苄青霉素、4ml/l PPM和8mg/l草铵膦。
随着芽开始从幼芽发育,我们将其转移到由补充了30g/l葡萄糖、1.3g/l ca葡糖酸盐、0.5mg/l MT、10mg/l GA3、2.0ml/l STS、300mg/l特美汀、400mg/l羧苄青霉素、4ml/lPPM和8mg/l草铵膦的DKW组成的伸长培养基。如上所述温育,每14天转移至新鲜培养基。我们在每次继代培养去除芽基部的任何正在发育的愈伤组织(图28)。
连续2次14天继代培养后,将伸长的芽转移到补充了30g/l葡萄糖、1.3g/l ca葡糖酸盐、0.1mg/l MT、10mg/l GA3、1.0ml/l STS、300mg/l特美汀、400mg/l羧苄青霉素、4ml/lPPM和8mg/l草铵膦的DKW培养基,所述培养基。当芽达到2-3cm尺寸时,将其转移到由补充了30g/l蔗糖、0.1mg/l NAA、2ml/l STS、300mg/l特美汀400mg/l羧苄青霉素4ml/L PPM和4g/l草铵膦的DKW培养基组成的生根培养基。
表7.在四倍体小麦Kronos中,不同GRF-GIF组合相较于空载体的再生频率(17次实验)。接种了各特定构建体的胚数(n)/基因型。再生频率估计为显示至少一个再生芽的愈伤组织数目/接种的胚的总数。这是一个保守估计,因为从用GRF4-GIF1构建体转化的胚中生成的愈伤组织通常产生多个独立转化芽。蓝色“x”指示不同图所示纳入统计分析的实验。此表中的所有实验用常规91d方案完成。
Figure BDA0003535299770000291
表8.比较8个小麦(小麦)旁系同源蛋白(TaGRF)。连接的QLQ-WRC保守结构域的相同性百分比(第一个数)和相似性百分比(第二个数)通过BLAST P获得。
Figure BDA0003535299770000292
Figure BDA0003535299770000301
用于比较的序列(QLQ带下划线,WRC以粗体显示)
>TraesCS6A01G335900TaGRFGRF1
Figure BDA0003535299770000302
>TraesCS7A01G165600TaGRF2GRF2
Figure BDA0003535299770000303
>TraesCS6A01G269600TaGRF4
Figure BDA0003535299770000304
>TraesCS2A01G435100TaGRF3
Figure BDA0003535299770000305
>TraesCS7A01G049100TaGRF5
Figure BDA0003535299770000306
>TraesCS4A01G291500TaGRF9
Figure BDA0003535299770000307
>TraesCS4A01G255000TaGRF6
Figure BDA0003535299770000308
>TraesCS6A01G257600TaGRFGRF12
Figure BDA0003535299770000309
表9.用小麦GRF4-GIF1嵌合体或空载体转化的植物的再生频率。A)四倍体和六倍体小麦商业品种。B)黑小麦育种品系和C)水稻栽培品种Kitaake。在所有实验中,使用pLC41载体(对于小麦优化),除了第二次水稻实验,其中采用更常用于水稻的pCAMBIA1300载体。EHA105和AGL1是用于渗入的2个不同农杆菌菌株(这2个菌株之间没有观察到差异)。在接种胚数(n)中,第一个数字指示用空载体接种的那些,第二个数字指示用GRF4-GIF1嵌合体接种的胚。
9A.小麦
Figure BDA00035352997700003010
Figure BDA00035352997700003011
Figure BDA0003535299770000311
Figure BDA0003535299770000312
Figure BDA0003535299770000313
9B.黑小麦
Figure BDA0003535299770000314
9C.水稻。水稻(水稻)栽培品种Kitaake的再生频率。2个实验都用农杆菌菌株EHA105进行。第一次实验使用小麦-优化的载体pLC41,实验2-4使用水稻-优化的载体pCAMBIA1300载体。接种愈伤组织中的第一个数字代表用空载体接种的愈伤组织,第二个数字代表用GRF4-GIF1嵌合体接种的愈伤组织。实验2-4是用同一种子储液(各100个胚)实施的3次实验。实验2包括Ubi::GRF4-GIF1嵌合体和靶向编码酪氨酸蛋白磺基转移酶(TPST)的基因OsKitaake06g041700的sgRNA,实验3和4包括pCAMBIA1300-gus,而没有嵌合体。
Figure BDA0003535299770000315
表10.柑橘的再生频率。实验1-3使用基于柑橘序列的GRF4-GIF1嵌合体,而实验4-6使用基于葡萄序列的GRF4-GIF1嵌合体。最后3次实验包括第二葡萄构建体,其有miR396结合位点(rGRF4-GIF1)中的突变,阻碍其切割。接种愈伤组织中的第一个数字代表用空载体接种的愈伤组织,其他数字代表用不同GRF4-GIF1嵌合体接种的愈伤组织。
Figure BDA0003535299770000321
表11.番椒的再生频率
栽培品种R&C cayenne(实验号201027/201028),用嵌合体GRF4.1-GIF1.1(SEQ IDNO:140)转化40个子叶块,用空载体pEarleyGate100转化40个子叶块。在接种外植体数(n)中,第一个数字指示用空载体接种的那些,第二个数字指示用GRF4.1-GIF1.1嵌合体接种的胚。
Figure BDA0003535299770000322
序列标识列表
SEQ ID NO:
1:全载体pLC41,具有TaGRF4核苷酸序列
2:全载体pLC41,具有TaGIF1核苷酸序列
3:TaGRF4-GIF1嵌合体核苷酸序列
4:TaGRF4-GIF1氨基酸序列
5:全构建体pLC41小麦GRF4-GIF1嵌合体
6:葡萄GRF4-GIF1嵌合体核苷酸序列
7:葡萄GRF4-GIF1嵌合体蛋白序列
8:全构建体pGWB14,具有葡萄GRF4-GIF1嵌合体
9:TaGRFGRF1氨基酸序列
10:TaGRF2GRF2氨基酸序列
11:TaGRF3氨基酸序列
12:TaGRF4氨基酸序列
13:TaGRF5氨基酸序列
14:OsGRFGRF1氨基酸序列
15:OsGRF2GRF2氨基酸序列
16:OsGRF3氨基酸序列
17:OsGRF4氨基酸序列
18:OsGRF5氨基酸序列
19:AtGRF3氨基酸序列
20:AtGRF4氨基酸序列
21:AtGRF5氨基酸序列
22:AtGRF6氨基酸序列
23:用于嵌合体的葡萄GRF4(SVIVT01024326001)
24:AtGIF1氨基酸序列
25:AtGIF2氨基酸序列
26:AtGIF3氨基酸序列
27:TaGIF1氨基酸序列
28:TaGIF2氨基酸序列
29:TaGIF3氨基酸序列
30:葡萄GIF1(GSVIVT01036262001)
31:地塞米松诱导型嵌合体葡萄GRF4-GR-GIF1蛋白
32:地塞米松诱导型TaGRF4-GR-GIF1
33:pLC41全载体,带有小麦GRF4-GR-GIF1嵌合体
34:小麦rGRF4-GIF1,带有miR396靶位点中的4个突变
35:全构建体JD635,带有小麦GRF4-GIF1嵌合体
36:全构建体pLC41,带有分开的Ubi::GRF4-Ubi::GIF1
37:TaGRF5
38:TaGRFGRF1
39:TaGRF9
40:TaGRFGRF1-GIF1
41:TaGRF5-GIF1
42:TaGRF9-GIF1
43:TaGIF2
44:TaGIF3
45:TaGRF4-GIF2
46:TaGRF4-GIF3
47:全载体pLC41,无标志物,带有小麦GRF4-GIF1嵌合体48:柑橘GRF4-GIF1
49:全pGWB14构建体,带有柑橘GRF4-GIF1
50:葡萄rGRF4-GIF1 miR396抗性嵌合体
51TaGRF4核苷酸序列–来自SEQ ID NO:1载体
52TaGIF1核苷酸序列–来自SEQ ID NO:2载体
53野生型miR396靶位点
54:带有沉默突变的miR396靶位点
55:成熟miR396序列
56:图8的蛋白序列
57:pLC41的LB
58:pLC41的RB
59:玉米UBIQUITIN启动子
60:HA标签
61:nos终止子
62:35s启动子
63:HPT
64:来自SEQ ID NO:35的TaCas9
65:来自SEQ ID NO:35的TaU6启动子
66:来自SEQ ID NO:35的引导RNA基因Q
67:来自SEQ ID NO:35的引导RNA支架
68:图9B中的序列
69:来自SEQ ID NO:9的QLQ结构域
70:来自SEQ ID NO:10的QLQ结构域
71:来自SEQ ID NO:11的QLQ结构域
72:来自SEQ ID NO:12的QLQ结构域
73:来自SEQ ID NO:13的QLQ结构域
74:来自SEQ ID NO:14的QLQ结构域
75:来自SEQ ID NO:15的QLQ结构域
76:来自SEQ ID NO:16的QLQ结构域
77:来自SEQ ID NO:17的QLQ结构域
78:来自SEQ ID NO:18的QLQ结构域
79:来自SEQ ID NO:19的QLQ结构域
80:来自SEQ ID NO:20的QLQ结构域
81:来自SEQ ID NO:21的QLQ结构域
82:来自SEQ ID NO:22的QLQ结构域
83:来自SEQ ID NO:23的QLQ结构域
84:来自SEQ ID NO:9的WRC结构域
85:来自SEQ ID NO:10的WRC结构域
86:来自SEQ ID NO:11的WRC结构域
87:来自SEQ ID NO:12的WRC结构域
88:来自SEQ ID NO:13的WRC结构域
89:来自SEQ ID NO:15的WRC结构域
90:来自SEQ ID NO:16的WRC结构域
91:来自SEQ ID NO:19的WRC结构域
92:来自SEQ ID NO:20的WRC结构域
93:来自SEQ ID NO:21的WRC结构域
94:来自SEQ ID NO:22的WRC结构域
95:来自SEQ ID NO:23的WRC结构域
96:小麦TaGRF4的连接的QLQ-WRC结构域(完整蛋白序列是SEQ ID NO:12):
97:来自SEQ ID NO:24的SNH结构域
98:来自SEQ ID NO:25的SNH结构域
99:来自SEQ ID NO:26的SNH结构域
100:来自SEQ ID NO:27的SNH结构域
101:来自SEQ ID NO:28的SNH结构域
102:来自SEQ ID NO:29的SNH结构域
103:来自SEQ ID NO:30的SNH结构域
104:引物Fw-GRF4b
105:引物Rev-GIF1c
106–118:表6引物
119:引物Fw_HindIII
120:引物Rev-HindIII
121:引物Fw-GRF
122:引物rGRF-Rev
123:引物rGRF-Fw
124:引物Rev-GIF
125:来自TraesCS6A01G335900TaGRFGRF1的QLQ-WRC
126:来自TraesCS7A01G165600TaGRF2GRF2的QLQ-WRC
127:来自TraesCS6A01G269600TaGRF4的QLQ-WRC
128:来自TraesCS2A01G435100TaGRF3的QLQ-WRC
129:来自TraesCS7A01G049100TaGRF5的QLQ-WRC
130:来自TraesCS4A01G291500TaGRF9的QLQ-WRC
131:来自TraesCS4A01G255000TaGRF6的QLQ-WRC
132:来自TraesCS6A01G257600TaGRFGRF12的QLQ-WRC
133:Os11g40100(GIF2)
134:Os12g31350(GIF3)
135:Os03g52320(GIF1)
136:OsGIF3 SNH结构域
137:OsGIF1 SNH结构域
138:辣椒GRF4(LOC107869915)
139辣椒GIF1(LOC107870303)
140辣椒GRF4-GIF1嵌合体.
参考文献
本公开引用的所有参考文献通过引用全文纳入。
Beltramino,M.,Ercoli,M.F.等.(2018).“Robust increase of leaf size byArabidopsis thaliana GRF3-like transcription factors under different growthconditions”.Scientific Reports 8:13447.
Che,R.,Tong,H.等.(2015).“Control of grain size and rice yield by GL2-mediated brassinosteroid responses.”Nat Plants 2:15195.
Debernardi,J.M.,Mecchia,M.A.等.(2014)."Post-transcriptional controlof GRF transcription factors by microRNA miR396 and GIF co-activator affectsleaf size and longevity”.Plant J.74:413-426.
Debernardi,J.M.,Rodriguez,R.E.等.(2012)."Functional specialization ofthe plant miR396 regulatory network through distinct microRNA-targetinteractions".PLoS Genet 8:e1002419.
Duan,P.,Ni,S.等.(2015).“Regulation of OsGRF4 by OsmiR396 controlsgrain size and yield in rice”.Nat Plants 2:15203.
He,Z.,Zeng,J.等,(2017).“OsGIF1 positively regulates the sizes ofstems,leaves,and grains in rice”.Front Plant Sci 8:1730.
Horiguchi,G.,Kim,G.T.等.(2005)."The transcription factor AtGRF5 andthe transcription coactivator AN3 regulate cell proliferation in leafprimordia of Arabidopsis thaliana".Plant J 43:68-78.
Hu,J.,Wang,Y.等.(2015).“A rare allele of GS2 enhances grain size andgrain yield in rice”.Mol Plant 8:1455-1465.
Kim,H.,Kim,S.T.,Ryu,J.,Kang,B.C.,Kim,J.S.,Kim,S.G.(2017).“CRISPR/Cpf1-mediated DNA-free plant genome editing”.Nat Commun 8:14406.
Kim,J.H.和B.H.Lee(2006)."GROWTH-REGULATING FACTOR4 of Arabidopsisthaliana is required for development of leaves,cotyledons,and shoot apicalmeristem."Journal of Plant Biology 49:463-468.
Kim,J.H.,Choi,D.等.(2003)."The AtGRF family of putative transcriptionfactors is involved in leaf and cotyledon growth in Arabidopsis".Plant J 36:94-104.
Kim,J.H.和Kende,H.(2004)."A transcriptional coactivator,AtGIF1,isinvolved in regulating leaf growth and morphology in Arabidopsis".Proc NatlAcad Sci U S A 101:13374-13379.
Lee,B.H.,J.H.Ko等.(2009)."The Arabidopsis GRF-INTERACTING FACTOR genefamily performs an overlapping function in determining organ size as well asmultiple developmental properties."Plant Physiol 151:655-668.
Li,S.,Gao,F.等.(2016).“The OsmiR396c-OsGRF4-OsGIF1 regulatory moduledetermines grain size and yield in rice”.Plant Biotechnol J 14:2134-2146.
Li,S.,Tian,Y.等.(2018).“Modulating plant growth-metabolismcoordination for sustainable agriculture”.Nature 560:595-600.
Liang,Z.,Chen,K.,Li,T.,Zhang,Y.,Wang,Y.,Zhao,Q.,Liu,J.,Zhang,H.,Liu,C.,Ran,Y.,Gao,C.(2017).“Efficient DNA-free genome editing of bread wheatusing CRISPR/Cas9 ribonucleoprotein complexes”.Nat Commun 8:14261.
Liu,D.,Song,Y.等.(2009)."Ectopic expression of miR396 suppresses GRFtarget gene expression and alters leaf growth in Arabidopsis".Physiol Plant136:223-236.
Malnoy,M.,Viola,R.,等.(2016).“DNA-free genetically edited grapevineand apple protoplast using CRISPR/Cas9 ribonucleoproteins”.Front Plant Sci 7:1904
Omidbakhshfard,M.A.,Proost,S.等.(2015).“Growth-Regulating Factors(GRFs):A Small Transcription Factor Family with Important Functions in PlantBiology”.Mol Plant 8:998-1010.
Rodriguez,R.E.,Mecchia,M.A.等.(2010)."Control of cell proliferationin Arabidopsis thaliana by microRNA miR396".Development 137:103-112.
Shimano,S.,Hibara,K.I.等.(2018).“Conserved functional control,butdistinct regulation,of cell proliferation in rice and Arabidopsis leavesrevealed by comparative analysis of GRF-INTERACTING FACTOR 1orthologs”.Development 145:dev159624.
Subburaj,S.,Chung,S.J.等.(2016).“Site-directed mutagenesis in Petuniax hybrid protoplast system using direct delivery of purified recombinant Cas9ribonucleoproteins”.Plant Cell Rep 35:1535–1544.
Sun,P.,Zhang,W.等.(2016).“OsGRF4 controls grain shape,panicle lengthand seed shattering in rice”.J Integr Plant Biol 58:836-847.
Svitashev,S.,Schwartz,C.,Lenderts,B.,Young,J.K.,Mark Cigan,A.(2016).“Genome editing in maize directed by CRISPR-Cas9 ribonucleoproteincomplexes”.Nat Commun 7:13274.
van der Knaap,E.,Kim,J.H.等.(2000)."Anovel gibberellin-induced genefrom rice and its potential regulatory role in stem growth".Plant Physiol122:695-704.
Vercruyssen,L.,Verkest,A.等.(2014)."ANGUSTIFOLIA3 binds to SWI/SNFchromatin remodeling complexes to regulate transcription during Arabidopsisleaf development".Plant Cell 26:210-229.
Wang,L.,Gu,X.等.(2011)."miR396-targeted AtGRF transcription factorsare required for coordination of cell division and differentiation duringleaf development in Arabidopsis".J Exp Bot 62:761-773.
Wolter,F.,Puchta,H.(2017).“Knocking out consumer concerns andregulator’s rules:efficient use of CRISPR/Cas ribonucleoprotein complexes forgenome editing in cereals”.Genome Biol 18:43
Woo,J.W.,Kim,J.等.(2015).“DNA-free genome editing in plants withpreassembled CRISPR-Cas9 ribonucleoproteins”.Nat Biotechnol 33:1162–1164
Zhang,D.,Sun,W.等(2018).“GRF-INTERACTING FACTOR 1regulates shootarchitecture and meristem determinacy in maize”.Plant Cell.30:360-374.
序列表
<110> 加利福尼亚大学董事会
国家科学技术研究委员会
<120> 用生长调节因子(GRF)、GRF相互作用因子(GIF)或嵌合GRF-GIF改进植物再生的方法
<130> P1322WO00
<160> 140
<170> PatentIn version 3.5
<210> 1
<211> 16633
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 1
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc agactagagt tatcaacaag tttgtacaaa aaagctgcca ccatggcgat 2040
gccgtatgcc tctctttccc cggcaggcga ccgccgctcc tccccggccg ccaccgccac 2100
cgcctccctc ctccccttct gccgctcctc ccccttctcc gccggcggca atggcggcat 2160
gggggaggag gcgcggatgg acgggaggtg gatggcgagg ccggtgccct tcacggcggc 2220
gcagtacgag gagctggagc accaggcgct catatacaag tacctggtgg ccggcgtgtc 2280
cgtcccgccg gatctcgtgc tccccatccg ccgcggcatc gagtccctcg ccgcccgctt 2340
ctaccacaac cccctcgcca tcgggtacgg atcgtacctg ggcaagaagg tggatccgga 2400
gccgggccgg tgccggcgca cggacggcaa gaagtggcgg tgcgccaagg aggccgcctc 2460
cgactccaag tattgcgagc gccacatgca ccgcggccgc aaccgttcaa gaaagcctgt 2520
ggaaacgcag ctcgtctcgc actcccagcc gccggccgcc tccgtcgtgc cgcccctcgc 2580
caccggcttc cacaaccact ccctctaccc cgccatcggc ggcaccaacg gtggtggagg 2640
cggggggaac aacggcatgt ccatgcccgg cacgttctcc tccgcgctgg ggccgcctca 2700
gcagcacatg ggcaacaatg ccgcctctcc ctacgcggct ctcggcggcg ccggaacatg 2760
caaagatttc aggtataccg catatggaat aagatctttg gcagacgagc agagtcagct 2820
catgacagaa gccatgaaca cctccgtgga gaacccatgg cgcctgccgc catcttctca 2880
aacgactaca ttcccgctct caagctactc tcctcagctt ggagcaacga gtgacctggg 2940
tcagaacaac agcagcaaca acaacagcgg cgtcaaggcc gagggacagc agcagcagca 3000
gccgctctcc ttcccggggt gcggcgactt cggcagcggc gactccgcga agcaggagaa 3060
ccagacgctg cggccgttct tcgacgagtg gccgaagacg agggactcgt ggtcggacct 3120
gaccgacgac aactcgaacg tcgcctcctt ctcggccacc cagctgtcga tctcgatacc 3180
catgacgtcc tccgacttct ccgccgccag ctcccagtcg cccaacggca tgctgttcgc 3240
cggcgagatg taccgttcag ctttcttgta caaagtggtt gataacagcg ggttaattaa 3300
catcttttac ccatacgatg ttcctgacta tgcgggctat ccctatgacg tcccggacta 3360
tgcaggatcc tatccatatg acgttccaga ttacgctgct cagtgcagcg cttagagctc 3420
gaatttcccc gatcgttcaa acatttggca ataaagtttc ttaagattga atcctgttgc 3480
cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg taataattaa 3540
catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc cgcaattata 3600
catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat tatcgcgcgc 3660
ggtgtcatct atgttactag atcgggaatt agcactagta acggccgcca gtgtgctgga 3720
attcgccctt taaataagcg atcgccccga gcaataatct ccaggagatc aaataccttc 3780
ccaagaaggt taaagatgca gtcaaaagat tcaggactaa ctgcatcaag aacacagaga 3840
aagatatatt tctcaagatc agaagtacta ttccagtatg gacgattcaa ggcttgcttc 3900
acaaaccaag gcaagtaata gagattggag tctctaaaaa ggtagttccc actgaatcaa 3960
aggccatgga gtcaaagatt caaatagagg acctaacaga actcgccgta aagactggcg 4020
aacagttcat acagagtctc ttacgactca atgacaagaa gaaaatcttc gtcaacatgg 4080
tggagcacga cacgcttgtc tactccaaaa atatcaaaga tacagtctca gaagaccaaa 4140
gggcaattga gacttttcaa caaagggtaa tatccggaaa cctcctcgga ttccattgcc 4200
cagctatctg tcactttatt gtgaagatag tggaaaagga aggtggctcc tacaaatgcc 4260
atcattgcga taaaggaaag gccatcgttg aagatgcctc tgccgacagt ggtcccaaag 4320
atggaccccc acccacgagg agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa 4380
agcaagtgga ttgatgtgat atctccactg acgtaaggga tgacgcacaa tcccactatc 4440
cttcgcaaga cccttcctct atataaggaa gttcatttca tttggagagg acacggggga 4500
ctctagagga tccccgatcc ctacagggta aatttctagt ttttctcctt cattttcttg 4560
gttaggaccc ttttctcttt ttattttttt gagctttgat ctttctttaa actgatctat 4620
tttttaattg attggttatg gtgtaaatat tacatagctt taactgataa tctgattact 4680
ttatttcgtg tgtctatgat gatgatgata gttacagaac cgtcgagggg gatccaatga 4740
gatatgaaaa agcctgaact caccgcgacg tctgtcgaga agtttctgat cgaaaagttc 4800
gacagcgtct ccgacctgat gcagctctcg gagggcgaag aatctcgtgc tttcagcttc 4860
gatgtaggag ggcgtggata tgtcctgcgg gtaaatagct gcgccgatgg tttctacaaa 4920
gatcgttatg tttatcggca ctttgcatcg gccgcgctcc cgattccgga agtgcttgac 4980
attggggaat tcagcgagag cctgacctat tgcatctccc gccgtgcaca gggtgtcacg 5040
ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg 5100
gatgcgatcg ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa 5160
ggaatcggtc aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg 5220
tatcactggc aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat 5280
gagctgatgc tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc 5340
ggctccaaca atgtcctgac ggacaatggc cgcataacag cggtcattga ctggagcgag 5400
gcgatgttcg gggattccca atacgaggtc gccaacatct tcttctggag gccgtggttg 5460
gcttgtatgg agcagcagac gcgctacttc gagcggaggc atccggagct tgcaggatcg 5520
ccgcggctcc gggcgtatat gctccgcatt ggtcttgacc aactctatca gagcttggtt 5580
gacggcaatt tcgatgatgc agcttgggcg cagggtcgat gcgacgcaat cgtccgatcc 5640
ggagccggga ctgtcgggcg tacacaaatc gcccgcagaa gcgcggccgt ctggaccgat 5700
ggctgtgtag aagtactcgc cgatagtgga aaccgacgcc ccagcactcg tccgagggca 5760
aaggaataga gtagatttaa attgaaatca ccagtctctc tctacaaatc tatctctctc 5820
tataataatg tgtgagtagt tcccagataa gggaattagg gttcttatag ggtttcgctc 5880
atgtgttgag catataagaa acccttagta tgtatttgta tttgtaaaat acttctatca 5940
ataaaatttc taattcctaa aaccaaaatc cagtgggcgc ggggtaccga gctcgaattc 6000
agtacattaa aaacgtccgc aatgtgttat taagttgtct aagcgtcaat ttgtttacac 6060
cacaatatat cctgccacca gccagccaac agctccccga ccggcagctc ggcacaaaat 6120
caccactcga tacaggcagc ccatcagtcc gggacggcgt cagcgggaga gccgttgtaa 6180
ggcggcagac tttgctcatg ttaccgatgc tattcggaag aacggcaact aagctgccgg 6240
gtttgaaaca cggatgatct cgcggagggt agcatgttga ttgtaacgat gacagagcgt 6300
tgctgcctgt gatcaaatat catctccctc gcagagatcc gaattatcag ccttcttatt 6360
catttctcgc ttaaccgtga caggctgtcg atcttgagaa ctatgccgac ataataggaa 6420
atcgctggat aaagccgctg aggaagctga gtggcgctat ttctttagaa gtgaacgttg 6480
acgatcgtcg accgtacccc gatgaattaa ttcggacgta cgttctgaac acagctggat 6540
acttacttgg gcgattgtca tacatgacat caacaatgta cccgtttgtg taaccgtctc 6600
ttggaggttc gtatgacact aggtcgctac cttaggaccg ttatagttac tagcgaattg 6660
acatgaggtt gccccgtatt cagtgtcgct gatttgtatt gtctgaagtt gtttttacgt 6720
taagttgatg cagatcaatt aatacgatac ctgcgtcata attgattatt tgacgtggtt 6780
tgatggcctc cacgcacgtt gtgatatgta gatgataatc attatcactt tacgggtcct 6840
ttccggtgat ccgacaggtt acggggcggc gacctcgcgg gttttcgcta tttatgaaaa 6900
ttttccggtt taaggcgttt ccgttcttct tcgtcataac ttaatgtttt tatttaaaat 6960
accctctgaa aagaaaggaa acgacaggtg ctgaaagcga gctttttggc ctctgtcgtt 7020
tcctttctct gtttttgtcc gtggaatgaa caatggaagg atcttctcgg cggcgatcac 7080
gacgccggcc ctgcggagcc ttcgccgcgt gcgcgattca tggcggccgt ggaggccaag 7140
gatttcgcgc gagtgcaaga gctgatcgag gcgcgtggag ccaagtcggc ggctgattat 7200
gtccttgcgc agctcgccgt ggccgaaggt ctggaccgca agcctggtgc gcgcgtcgtg 7260
gtcgggaaag cggcgggcag catggcaatg ccgcctgcgg cgctgggttt tacgccaagg 7320
ggagaagcgg catacgccat cgagcggtca gcctatggtg agccgaggtc cagcattgcg 7380
aagcagtacc agcaggaatg gaaccggaag gcggcgacct ggtgggcgat ggccggtgtg 7440
gccggcatca tcggcgcgat cctggcggcg gcggcaaccg gctttgttgg gctggcagtg 7500
tcgatccgca accgagtgaa gcgcgtgcgc gacctgttgg tgatggagcc gggtgcagag 7560
ccataagcgg caagagacga aagcccggtt tccgggcttt tgttttgtta cgccaaggac 7620
gagttttagc ggctaaaggt gttgacgtgc gagaaatgtt tagctaaact tctctcatgt 7680
gctggcggct gtcaccgcta tgttcaacca aggcgcggag caaattatgg gtgttatcca 7740
tgaagaaacg gcttaccgaa agccagttcc aggaggcgat ccaggggctg gaagtggggc 7800
agcagaccat cgagatagcg cggggcgtct tagtcgatgg gaagccacag gcgacgttcg 7860
caacgtcgct gggactgacc aggggcgcag tgtcgcaagc ggtgcatcgc gtgtgggccg 7920
cgttcgagga caagaacttg cccgaggggt acgcgcgggt aacggcggtt ctgccggaac 7980
atcaggcgta catcgtccgg aagtgggaag cggacgccaa gaaaaaacag gaaaccaaac 8040
gatgaaaact ttggtcacgg ccaaccagaa aggcggcgtc ggcaagactt cgacccttgt 8100
gcatcttgcc ttcgactttt tcgagcgcgg cttgcgggtt gccgtgatcg acctggaccc 8160
ccagggcaat gcgtcctaca cgctcaagga ctttgctacc ggcctgcatg caagcaagct 8220
gttcggcgct gtccctgccg gcggctggac cgaaaccgca cccgcagccg gcgacggcca 8280
ggccgcgcgc ctcgccctca tcgagtccaa cccggtactg gcgaacgccg aacggctgtc 8340
gctggacgac gcccgcgagc tgttcggggc gaacatcaag gccctggcga accaaggctt 8400
cgacgtgtgc ctgatcgaca cggccccgac ccttggcgtc ggcctggcgg ccgccctctt 8460
cgcggccgac tatgtgctgt cccccatcga gcttgaggcg tacagcatcc agggcatcaa 8520
gaagatggtc acgaccattg cgaacgtgcg ccagaagaac gccaagctgc aattccttgg 8580
catggtgccc agcaaggtcg atgcgcggaa tccgcgccac gcgcgccacc aagccgagct 8640
gctggccgcg taccccaaga tgatgattcc ggccaccgtt ggcctgcgca gcagcatcgc 8700
cgatgccctc gcatccggtg tgccggtctg gaagatcaag aaaacggccg cgcgcaaggc 8760
atcgaaagag gttcgcgccc tggctgatta cgtgttcacg aagatggaga tttcccaatg 8820
actgcggctc aagccaagac caccaagaaa aacaccgctg cggccgctca ggaagccgca 8880
ggcgcggcgc agccgtccgg cctggggttg gatagcatcg gcgacctgtc gagcctcctg 8940
gacgctcctg cggcgtctca gggcggttcc ggccctatcg agctggacct ggacctgatc 9000
gacgaagatc cgcatcagcc gcggacggcc gacaaccccg gcttttcccc ggagagcatc 9060
gcggaaatcg gtgccacgat caaagagcgc ggggtgaagt cacccatttc ggtgcgcgag 9120
aaccaggagc agccgggccg ctatatcatc aatcacggcg cccgccgcta ccgtggctcg 9180
aagtgggccg gcaagaagtc catcccggcg ttcatcgaca acgactacaa cgaagccgac 9240
caggttatcg agaacctgca acgcaacgag ctgaccccgc gcgaaattgc cgacttcatt 9300
ggccgcgagc tggcgaaggg caagaagaaa ggcgatatcg ccaaggaaat cggcaagtcg 9360
ccggcgttca tcacccagca cgtcacgctg ctggacctgc cggagaagat cgccgatgcg 9420
ttcaacaccg gccgcgtgcg cgacgtgacc gtggtgaacg agctggtgac ggccttcaag 9480
aagcgcccgg aggaagtcga ggcgtggctt gacgacgaca cccaggaaat cacgcgcggc 9540
acggtcaagc tgctgcgcga gttcctggac gagaagggcc gcgatcccaa caccgtcgat 9600
gccttcaacg gccagactga tgccgagcgt gacgcggagg ccggcgacgg ccaggacggc 9660
gaggacggcg accaggacgg taaggacgcc aaggaaaagg gcgcgaagga gccggacccg 9720
gacaagctga aaaaggccat cgtccaggtc gagcacgacg agcgccctgc ccgccttatc 9780
ctcaaccgtc ggccgccggc ggaaggctat gcctggttga agtacgagga cgacggccag 9840
gagttcgagg cgaaccttgc cgacgtgaaa ctggtcgcgc tcatcgaggg ctgatcccca 9900
aagacagcgg cgcgggccac ccgcgccgca cagacaacgg ttccgctaca aggaggaccg 9960
aagaatgaat ccgatgctgt tctacatcgc gggaggcgta ggcgcggcgt tgctgctggt 10020
ttccgcgatc atgctgttca agctgcgcga gccgaagaag gaacaccgac cgcagcgcaa 10080
ggcggcggcc ccgacgccgc agccggtcga taacgagctg ctgcgcactc tagtgatatt 10140
ccacaaaaca gcagggaagc agcgcttttc cgctgcataa ccctgcttcg gggtcattat 10200
agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt gccaaagggt 10260
tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg 10320
cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa 10380
cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg agggcaagcg 10440
gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg tgtactgcct 10500
tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc 10560
ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact atgagcacgt 10620
ccgcgagctg gcccgcatca atggcgacct gggccgcctg ggcggcctgc tgaaactctg 10680
gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc acgatcctcg ccctgctggc 10740
gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg tccgcccgag 10800
ggcagagcca tgactttttt agccgctaaa acggccgggg ggtgcgcgtg attgccaagc 10860
acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag tacatcaccg 10920
acgagcaagg caagaccgag cgccagatcc aaaacaactg tcaaagcgca cccgcccgat 10980
gccattcgcg gcacggcttc cgttgaggat gtcgatatga tgcgcgagcc gacggcccgc 11040
agagaagggg ccgttttagc ggctaaagaa ggaagtgcaa gccctaaccc ttggcgtcag 11100
agccttccac gcagcttttt tcgggtgtcg tcgccccatt tctttacgat aaacgcctta 11160
tgtgacggca aaaccacact gatgcgttcg tatccgggcg gcacgctgct cttgaaagga 11220
tgacccgcaa tctccgcgag tgcctcgcgg tcaaggtcgg tggactccag gagaagaggt 11280
aggggagttt ccagggcgtc ggcaatggcc tccatcacct tcaacgaggg gttggcctta 11340
ccgttggtta agtctgataa aaacgaaatt gaaacccctg ccctctccga cagctcatgt 11400
ttcgtcatgc cccgctcatc gagcagacga aggatgttgg tgaaaaatat ctggttgtac 11460
acagcggaag ccgcccctcg cacctttggt cgcggcccgc aaaattttag ccgctaaagt 11520
tcttgacagc ggaaccaatg tttagctaaa ctagagtctc ctttctcaag gagactttcg 11580
atatgagcca taatcagttc cagtttatcg gtaatcttac ccgtgacacc gaggtacgtc 11640
atggcaattc taacaagccg caagcaattt tcgatatagc ggttaatgaa gagtggcgca 11700
acgatgccgg cgacaagcag gagcgcaccg acttcttccg catcaagtgt tttggctctc 11760
aggccgaggc ccacggcaag tatttgggca aggggtcgct ggtattcgtg cagggcaaga 11820
ttcggaatac caagtacgag aaggacggcc agacggtcta cgggaccgac ttcattgccg 11880
ataaggtgga ttatctggac accaaggcac caggcgggtc aaatcaggaa taagggcaca 11940
ttgccccggc gtgagtcggg gcaatcccgc aaggagggtg aatgaatcgg acgtttgacc 12000
ggaaggcata caggcaagaa ctgatcgacg cggggttttc cgccgaggat gccgaaacca 12060
tcgcaagccg caccgtcatg cgtgcgcccc gcgaaacctt ccagtccgtc ggctcgatgg 12120
tccagcaagc tacggccaag atcgagcgcg acagcgtgca actggctccc cctgccctgc 12180
ccgcgccatc ggccgccgtg gagcgttcgc gtcgtctcga acaggaggcg gcaggtttgg 12240
cgaagtcgat gaccatcgac acgcgaggaa ctatgacgac caagaagcga aaaaccgccg 12300
gcgaggacct ggcaaaacag gtcagcgagg ccaagcaggc cgcgttgctg aaacacacga 12360
agcagcagat caaggaaatg cagctttcct tgttcgatat tgcgccgtgg ccggacacga 12420
tgcgagcgat gccaaacgac acggcccgct ctgccctgtt caccacgcgc aacaagaaaa 12480
tcccgcgcga ggcgctgcaa aacaaggtca ttttccacgt caacaaggac gtgaagatca 12540
cctacaccgg cgtcgagctg cgggccgacg atgacgaact ggtgtggcag caggtgttgg 12600
agtacgcgaa gcgcacccct atcggcgagc cgatcacctt cacgttctac gagctttgcc 12660
aggacctggg ctggtcgatc aatggccggt attacacgaa ggccgaggaa tgcctgtcgc 12720
gcctacaggc gacggcgatg ggcttcacgt ccgaccgcgt tgggcacctg gaatcggtgt 12780
cgctgctgca ccgcttccgc gtcctggacc gtggcaagaa aacgtcccgt tgccaggtcc 12840
tgatcgacga ggaaatcgtc gtgctgtttg ctggcgacca ctacacgaaa ttcatatggg 12900
agaagtaccg caagctgtcg ccgacggccc gacggatgtt cgactatttc agctcgcacc 12960
gggagccgta cccgctcaag ctggaaacct tccgcctcat gtgcggatcg gattccaccc 13020
gcgtgaagaa gtggcgcgag caggtcggcg aagcctgcga agagttgcga ggcagcggcc 13080
tggtggaaca cgcctgggtc aatgatgacc tggtgcattg caaacgctag ggccttgtgg 13140
ggtcagttcc ggctgggggt tcagcagcca gcgctttact ggcatttcag gaacaagcgg 13200
gcactgctcg acgcacttgc ttcgctcagt atcgctcggg acgcacggcg cgctctacga 13260
actgccgata aacagaggat taaaattgac aattgtgatt aaggctcaga ttcgacggct 13320
tggagcggcc gacgtgcagg atttccgcga gatccgattg tcggccctga agaaagctcc 13380
agagatgttc gggtccgttt acgagcacga ggagaaaaag cccatggagg cgttcgctga 13440
acggttgcga gatgccgtgg cattcggcgc ctacatcgac ggcgagatca ttgggctgtc 13500
ggtcttcaaa caggaggacg gccccaagga cgctcacaag gcgcatctgt ccggcgtttt 13560
cgtggagccc gaacagcgag gccgaggggt cgccggtatg ctgctgcggg cgttgccggc 13620
gggtttattg ctcgtgatga tcgtccgaca gattccaacg ggaatctggt ggatgcgcat 13680
cttcatcctc ggcgcactta atatttcgct attctggagc ttgttgttta tttcggtcta 13740
ccgcctgccg ggcggggtcg cggcgacggt aggcgctgtg cagccgctga tggtcgtgtt 13800
catctctgcc gctctgctag gtagcccgat acgattgatg gcggtcctgg gggctatttg 13860
cggaactgcg ggcgtggcgc tgttggtgtt gacaccaaac gcagcgctag atcctgtcgg 13920
cgtcgcagcg ggcctggcgg gggcggtttc catggcgttc ggaaccgtgc tgacccgcaa 13980
gtggcaacct cccgtgcctc tgctcacctt taccgcctgg caactggcgg ccggaggact 14040
tctgctcgtt ccagtagctt tagtgtttga tccgccaatc ccgatgccta caggaaccaa 14100
tgttctcggc tgctcgactg cacgaatacc agcgacccct tgcccaaata cttgccgtgg 14160
gcctcggcct gagagccaaa acacttgatg cggaagaagt cggtgcgctc ctgcttgtcg 14220
ccggcatcgt tgcgccacat ctaggtacta aaacaattca tccagtaaaa tataatattt 14280
tattttctcc caatcaggct tgatccccag taagtcaaaa aatagctcga catactgttc 14340
ttccccgata tcctccctga tcgaccggac gcagaaggca atgtcatacc acttgtccgc 14400
cctgccgctt ctcccaagat caataaagcc acttactttg ccatctttca caaagatgtt 14460
gctgtctccc aggtcgccgt gggaaaagac aagttcctct tcgggctttt ccgtctttaa 14520
aaaatcatac agctcgcgcg gatctttaaa tggagtgtct tcttcccagt tttcgcaatc 14580
cacatcggcc agatcgttat tcagtaagta atccaattcg gctaagcggc tgtctaagct 14640
attcgtatag ggacaatccg atatgtcgat ggagtgaaag agcctgatgc actccgcata 14700
cagctcgata atcttttcag ggctttgttc atcttcatac tcttccgagc aaaggacgcc 14760
atcggcctca ctcatgagca gattgctcca gccatcatgc cgttcaaagt gcaggacctt 14820
tggaacaggc agctttcctt ccagccatag catcatgtcc ttttcccgtt ccacatcata 14880
ggtggtccct ttataccggc tgtccgtcat ttttaaatat aggttttcat tttctcccac 14940
cagcttatat accttagcag gagacattcc ttccgtatct tttacgcagc ggtatttttc 15000
gatcagtttt ttcaattccg gtgatattct cattttagcc atttattatt tccttcctct 15060
tttctacagt atttaaagat accccaagaa gctaattata acaagacgaa ctccaattca 15120
ctgttccttg cattctaaaa ccttaaatac cagaaaacag ctttttcaaa gttgttttca 15180
aagttggcgt ataacatagt atcgattcga tagcgtggac tcaaggctct cgcgaatggc 15240
tcgcgttgga aactttcatt gacacttgag gggcaccgca gggaaattct cgtccttgcg 15300
agaaccggct atgtcgtgct gcgcatcgag cctgcgccct tggcttgtct cgcccctctc 15360
cgcgtcgcta cggggcttcc agcgcctttc cgacgctcac cgggctggtt gccctcgccg 15420
ctgggctggc ggccgtctat ggccctgcaa acgcgccaga aacgccgtcg aagccgtgtg 15480
cgagacaccg cggccgccgg cgttgtggat acctcgcgga aaacttggcc ctcactgaca 15540
gatgaggggc ggacgttgac acttgagggg ccgactcacc cggcgcggcg ttgacagatg 15600
aggggcaggc tcgatttcgg ccggcgacgt ggagctggcc agcctcgcaa atcggcgaaa 15660
acgcctgatt ttacgcgagt ttcccacaga tgatgtggac aagcctgggg ataagtgccc 15720
tgcggtattg acacttgagg ggcgcgacta ctgacagatg aggggcgcga tccttgacac 15780
ttgaggggca gagtgctgac agatgagggg cgcacctatt gacatttgag gggctgtcca 15840
caggcagaaa atccagcatt tgcaagggtt tccgcccgtt tttcggccac cgctaacctg 15900
tcttttaacc tgcttttaaa ccaatattta taaaccttgt ttttaaccag ggctgcgccc 15960
tgtgcgcgtg accgcgcacg ccgaaggggg gtgccccccc ttctcgaacc ctcccggccc 16020
gctaacgcgg gcctcccatc cccccagggg ctgcgcccct cggccgcgaa cggcctcacc 16080
ccaaaaatgg cagcgccagc caggacgtcg gccgaaagag cgacaagcag atcacgcttt 16140
tcgacagcgt cggatttgcg atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg 16200
ttgagggatc aagccacagc agcccactcg accttctagc cgacccagac gagccaaggg 16260
atctttttgg aatgctgctc cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag 16320
tcattatcgc acggaatgcc aagcactccc gaggggaacc ctgtggttgg catgcacata 16380
caaatggacg aacggataaa ccttttcacg cccttttaaa tatccgatta ttctaataaa 16440
cgctcttttc tcttaggttt acccgccaat atatcctgtc aaacactgat agtttaaact 16500
gaaggcggga aacgacaatc tgatcatgag cggagaatta agggagtcac gttatgaccc 16560
ccgccgatga cgcgggacaa gccgttttac gtttggaact gacagaaccg caacgttgaa 16620
ggagccactc agc 16633
<210> 2
<211> 16087
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 2
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc agactagagt tatcaacaag tttgtacaaa aaagctgcca ccatgcagca 2040
gcaacacctg atgcagatga accagagcat gatggggggc tacgcttcct ctaccactgc 2100
caccactgat ctcattcagc agtacctgga tgagaacaag cagctgatcc tggccatcct 2160
cgacaaccag aacaacggca aggtggagga gtgcgcacgg aaccaagcta agctccagca 2220
gaacctcatg tacctcgccg ccatcgccga cagccagcct ccgcagacgg catcgctgtc 2280
tcagtacccg tccaacctga tgatgcagtc cgggccgcgg tacatgcagc agcagtcggc 2340
gcagatgatg tcgccgcagt cgctgatggc ggcgcggtcg tcgatgatgt acgcgcagca 2400
ggccatgtcg ccgctccagc agcagcagca gcagcagcag caccaggcgg ccgcgcacgg 2460
ccagctgggg atgtcctccg gcgcgaccac cgggttcaac ctcctgcacg gtgaggccag 2520
catgggcggc ggcggcggcg ccactggcaa cagcatgatg aacgccagcg tcttctcgga 2580
ctatggccgc ggcggcagcg gcgccaagga ggggtcgacc tcgctgtcgg ccgacgctcg 2640
cggcgccaac tctggcgcgc acagcggcga cggggagtac ctcaagggca ccgaggagga 2700
aggaagccgt tcagctttct tgtacaaagt ggttgataac agcgggttaa ttaacatctt 2760
ttacccatac gatgttcctg actatgcggg ctatccctat gacgtcccgg actatgcagg 2820
atcctatcca tatgacgttc cagattacgc tgctcagtgc agcgcttaga gctcgaattt 2880
ccccgatcgt tcaaacattt ggcaataaag tttcttaaga ttgaatcctg ttgccggtct 2940
tgcgatgatt atcatataat ttctgttgaa ttacgttaag catgtaataa ttaacatgta 3000
atgcatgacg ttatttatga gatgggtttt tatgattaga gtcccgcaat tatacattta 3060
atacgcgata gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc 3120
atctatgtta ctagatcggg aattagcact agtaacggcc gccagtgtgc tggaattcgc 3180
cctttaaata agcgatcgcc ccgagcaata atctccagga gatcaaatac cttcccaaga 3240
aggttaaaga tgcagtcaaa agattcagga ctaactgcat caagaacaca gagaaagata 3300
tatttctcaa gatcagaagt actattccag tatggacgat tcaaggcttg cttcacaaac 3360
caaggcaagt aatagagatt ggagtctcta aaaaggtagt tcccactgaa tcaaaggcca 3420
tggagtcaaa gattcaaata gaggacctaa cagaactcgc cgtaaagact ggcgaacagt 3480
tcatacagag tctcttacga ctcaatgaca agaagaaaat cttcgtcaac atggtggagc 3540
acgacacgct tgtctactcc aaaaatatca aagatacagt ctcagaagac caaagggcaa 3600
ttgagacttt tcaacaaagg gtaatatccg gaaacctcct cggattccat tgcccagcta 3660
tctgtcactt tattgtgaag atagtggaaa aggaaggtgg ctcctacaaa tgccatcatt 3720
gcgataaagg aaaggccatc gttgaagatg cctctgccga cagtggtccc aaagatggac 3780
ccccacccac gaggagcatc gtggaaaaag aagacgttcc aaccacgtct tcaaagcaag 3840
tggattgatg tgatatctcc actgacgtaa gggatgacgc acaatcccac tatccttcgc 3900
aagacccttc ctctatataa ggaagttcat ttcatttgga gaggacacgg gggactctag 3960
aggatccccg atccctacag ggtaaatttc tagtttttct ccttcatttt cttggttagg 4020
acccttttct ctttttattt ttttgagctt tgatctttct ttaaactgat ctatttttta 4080
attgattggt tatggtgtaa atattacata gctttaactg ataatctgat tactttattt 4140
cgtgtgtcta tgatgatgat gatagttaca gaaccgtcga gggggatcca atgagatatg 4200
aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 4260
gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 4320
ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 4380
tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 4440
gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 4500
gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 4560
atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 4620
ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 4680
tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 4740
atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 4800
aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 4860
ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 4920
atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 4980
ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 5040
aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 5100
gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 5160
gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 5220
tagagtagat ttaaattgaa atcaccagtc tctctctaca aatctatctc tctctataat 5280
aatgtgtgag tagttcccag ataagggaat tagggttctt atagggtttc gctcatgtgt 5340
tgagcatata agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa 5400
tttctaattc ctaaaaccaa aatccagtgg gcgcggggta ccgagctcga attcagtaca 5460
ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat 5520
atatcctgcc accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac 5580
tcgatacagg cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc 5640
agactttgct catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga 5700
aacacggatg atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc 5760
ctgtgatcaa atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc 5820
tcgcttaacc gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct 5880
ggataaagcc gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgatc 5940
gtcgaccgta ccccgatgaa ttaattcgga cgtacgttct gaacacagct ggatacttac 6000
ttgggcgatt gtcatacatg acatcaacaa tgtacccgtt tgtgtaaccg tctcttggag 6060
gttcgtatga cactaggtcg ctaccttagg accgttatag ttactagcga attgacatga 6120
ggttgccccg tattcagtgt cgctgatttg tattgtctga agttgttttt acgttaagtt 6180
gatgcagatc aattaatacg atacctgcgt cataattgat tatttgacgt ggtttgatgg 6240
cctccacgca cgttgtgata tgtagatgat aatcattatc actttacggg tcctttccgg 6300
tgatccgaca ggttacgggg cggcgacctc gcgggttttc gctatttatg aaaattttcc 6360
ggtttaaggc gtttccgttc ttcttcgtca taacttaatg tttttattta aaataccctc 6420
tgaaaagaaa ggaaacgaca ggtgctgaaa gcgagctttt tggcctctgt cgtttccttt 6480
ctctgttttt gtccgtggaa tgaacaatgg aaggatcttc tcggcggcga tcacgacgcc 6540
ggccctgcgg agccttcgcc gcgtgcgcga ttcatggcgg ccgtggaggc caaggatttc 6600
gcgcgagtgc aagagctgat cgaggcgcgt ggagccaagt cggcggctga ttatgtcctt 6660
gcgcagctcg ccgtggccga aggtctggac cgcaagcctg gtgcgcgcgt cgtggtcggg 6720
aaagcggcgg gcagcatggc aatgccgcct gcggcgctgg gttttacgcc aaggggagaa 6780
gcggcatacg ccatcgagcg gtcagcctat ggtgagccga ggtccagcat tgcgaagcag 6840
taccagcagg aatggaaccg gaaggcggcg acctggtggg cgatggccgg tgtggccggc 6900
atcatcggcg cgatcctggc ggcggcggca accggctttg ttgggctggc agtgtcgatc 6960
cgcaaccgag tgaagcgcgt gcgcgacctg ttggtgatgg agccgggtgc agagccataa 7020
gcggcaagag acgaaagccc ggtttccggg cttttgtttt gttacgccaa ggacgagttt 7080
tagcggctaa aggtgttgac gtgcgagaaa tgtttagcta aacttctctc atgtgctggc 7140
ggctgtcacc gctatgttca accaaggcgc ggagcaaatt atgggtgtta tccatgaaga 7200
aacggcttac cgaaagccag ttccaggagg cgatccaggg gctggaagtg gggcagcaga 7260
ccatcgagat agcgcggggc gtcttagtcg atgggaagcc acaggcgacg ttcgcaacgt 7320
cgctgggact gaccaggggc gcagtgtcgc aagcggtgca tcgcgtgtgg gccgcgttcg 7380
aggacaagaa cttgcccgag gggtacgcgc gggtaacggc ggttctgccg gaacatcagg 7440
cgtacatcgt ccggaagtgg gaagcggacg ccaagaaaaa acaggaaacc aaacgatgaa 7500
aactttggtc acggccaacc agaaaggcgg cgtcggcaag acttcgaccc ttgtgcatct 7560
tgccttcgac tttttcgagc gcggcttgcg ggttgccgtg atcgacctgg acccccaggg 7620
caatgcgtcc tacacgctca aggactttgc taccggcctg catgcaagca agctgttcgg 7680
cgctgtccct gccggcggct ggaccgaaac cgcacccgca gccggcgacg gccaggccgc 7740
gcgcctcgcc ctcatcgagt ccaacccggt actggcgaac gccgaacggc tgtcgctgga 7800
cgacgcccgc gagctgttcg gggcgaacat caaggccctg gcgaaccaag gcttcgacgt 7860
gtgcctgatc gacacggccc cgacccttgg cgtcggcctg gcggccgccc tcttcgcggc 7920
cgactatgtg ctgtccccca tcgagcttga ggcgtacagc atccagggca tcaagaagat 7980
ggtcacgacc attgcgaacg tgcgccagaa gaacgccaag ctgcaattcc ttggcatggt 8040
gcccagcaag gtcgatgcgc ggaatccgcg ccacgcgcgc caccaagccg agctgctggc 8100
cgcgtacccc aagatgatga ttccggccac cgttggcctg cgcagcagca tcgccgatgc 8160
cctcgcatcc ggtgtgccgg tctggaagat caagaaaacg gccgcgcgca aggcatcgaa 8220
agaggttcgc gccctggctg attacgtgtt cacgaagatg gagatttccc aatgactgcg 8280
gctcaagcca agaccaccaa gaaaaacacc gctgcggccg ctcaggaagc cgcaggcgcg 8340
gcgcagccgt ccggcctggg gttggatagc atcggcgacc tgtcgagcct cctggacgct 8400
cctgcggcgt ctcagggcgg ttccggccct atcgagctgg acctggacct gatcgacgaa 8460
gatccgcatc agccgcggac ggccgacaac cccggctttt ccccggagag catcgcggaa 8520
atcggtgcca cgatcaaaga gcgcggggtg aagtcaccca tttcggtgcg cgagaaccag 8580
gagcagccgg gccgctatat catcaatcac ggcgcccgcc gctaccgtgg ctcgaagtgg 8640
gccggcaaga agtccatccc ggcgttcatc gacaacgact acaacgaagc cgaccaggtt 8700
atcgagaacc tgcaacgcaa cgagctgacc ccgcgcgaaa ttgccgactt cattggccgc 8760
gagctggcga agggcaagaa gaaaggcgat atcgccaagg aaatcggcaa gtcgccggcg 8820
ttcatcaccc agcacgtcac gctgctggac ctgccggaga agatcgccga tgcgttcaac 8880
accggccgcg tgcgcgacgt gaccgtggtg aacgagctgg tgacggcctt caagaagcgc 8940
ccggaggaag tcgaggcgtg gcttgacgac gacacccagg aaatcacgcg cggcacggtc 9000
aagctgctgc gcgagttcct ggacgagaag ggccgcgatc ccaacaccgt cgatgccttc 9060
aacggccaga ctgatgccga gcgtgacgcg gaggccggcg acggccagga cggcgaggac 9120
ggcgaccagg acggtaagga cgccaaggaa aagggcgcga aggagccgga cccggacaag 9180
ctgaaaaagg ccatcgtcca ggtcgagcac gacgagcgcc ctgcccgcct tatcctcaac 9240
cgtcggccgc cggcggaagg ctatgcctgg ttgaagtacg aggacgacgg ccaggagttc 9300
gaggcgaacc ttgccgacgt gaaactggtc gcgctcatcg agggctgatc cccaaagaca 9360
gcggcgcggg ccacccgcgc cgcacagaca acggttccgc tacaaggagg accgaagaat 9420
gaatccgatg ctgttctaca tcgcgggagg cgtaggcgcg gcgttgctgc tggtttccgc 9480
gatcatgctg ttcaagctgc gcgagccgaa gaaggaacac cgaccgcagc gcaaggcggc 9540
ggccccgacg ccgcagccgg tcgataacga gctgctgcgc actctagtga tattccacaa 9600
aacagcaggg aagcagcgct tttccgctgc ataaccctgc ttcggggtca ttatagcgat 9660
tttttcggta tatccatcct ttttcgcacg atatacagga ttttgccaaa gggttcgtgt 9720
agactttcct tggtgtatcc aacggcgtca gccgggcagg ataggtgaag taggcccacc 9780
cgcgagcggg tgttccttct tcactgtccc ttattcgcac ctggcggtgc tcaacgggaa 9840
tcctgctctg cgaggctggc cggctaccgc cggcgtaaca gatgagggca agcggatggc 9900
tgatgaaacc aagccaacca ggaagggcag cccacctatc aaggtgtact gccttccaga 9960
cgaacgaaga gcgattgagg aaaaggcggc ggcggccggc atgagcctgt cggcctacct 10020
gctggccgtc ggccagggct acaaaatcac gggcgtcgtg gactatgagc acgtccgcga 10080
gctggcccgc atcaatggcg acctgggccg cctgggcggc ctgctgaaac tctggctcac 10140
cgacgacccg cgcacggcgc ggttcggtga tgccacgatc ctcgccctgc tggcgaagat 10200
cgaagagaag caggacgagc ttggcaaggt catgatgggc gtggtccgcc cgagggcaga 10260
gccatgactt ttttagccgc taaaacggcc ggggggtgcg cgtgattgcc aagcacgtcc 10320
ccatgcgctc catcaagaag agcgacttcg cggagctggt gaagtacatc accgacgagc 10380
aaggcaagac cgagcgccag atccaaaaca actgtcaaag cgcacccgcc cgatgccatt 10440
cgcggcacgg cttccgttga ggatgtcgat atgatgcgcg agccgacggc ccgcagagaa 10500
ggggccgttt tagcggctaa agaaggaagt gcaagcccta acccttggcg tcagagcctt 10560
ccacgcagct tttttcgggt gtcgtcgccc catttcttta cgataaacgc cttatgtgac 10620
ggcaaaacca cactgatgcg ttcgtatccg ggcggcacgc tgctcttgaa aggatgaccc 10680
gcaatctccg cgagtgcctc gcggtcaagg tcggtggact ccaggagaag aggtagggga 10740
gtttccaggg cgtcggcaat ggcctccatc accttcaacg aggggttggc cttaccgttg 10800
gttaagtctg ataaaaacga aattgaaacc cctgccctct ccgacagctc atgtttcgtc 10860
atgccccgct catcgagcag acgaaggatg ttggtgaaaa atatctggtt gtacacagcg 10920
gaagccgccc ctcgcacctt tggtcgcggc ccgcaaaatt ttagccgcta aagttcttga 10980
cagcggaacc aatgtttagc taaactagag tctcctttct caaggagact ttcgatatga 11040
gccataatca gttccagttt atcggtaatc ttacccgtga caccgaggta cgtcatggca 11100
attctaacaa gccgcaagca attttcgata tagcggttaa tgaagagtgg cgcaacgatg 11160
ccggcgacaa gcaggagcgc accgacttct tccgcatcaa gtgttttggc tctcaggccg 11220
aggcccacgg caagtatttg ggcaaggggt cgctggtatt cgtgcagggc aagattcgga 11280
ataccaagta cgagaaggac ggccagacgg tctacgggac cgacttcatt gccgataagg 11340
tggattatct ggacaccaag gcaccaggcg ggtcaaatca ggaataaggg cacattgccc 11400
cggcgtgagt cggggcaatc ccgcaaggag ggtgaatgaa tcggacgttt gaccggaagg 11460
catacaggca agaactgatc gacgcggggt tttccgccga ggatgccgaa accatcgcaa 11520
gccgcaccgt catgcgtgcg ccccgcgaaa ccttccagtc cgtcggctcg atggtccagc 11580
aagctacggc caagatcgag cgcgacagcg tgcaactggc tccccctgcc ctgcccgcgc 11640
catcggccgc cgtggagcgt tcgcgtcgtc tcgaacagga ggcggcaggt ttggcgaagt 11700
cgatgaccat cgacacgcga ggaactatga cgaccaagaa gcgaaaaacc gccggcgagg 11760
acctggcaaa acaggtcagc gaggccaagc aggccgcgtt gctgaaacac acgaagcagc 11820
agatcaagga aatgcagctt tccttgttcg atattgcgcc gtggccggac acgatgcgag 11880
cgatgccaaa cgacacggcc cgctctgccc tgttcaccac gcgcaacaag aaaatcccgc 11940
gcgaggcgct gcaaaacaag gtcattttcc acgtcaacaa ggacgtgaag atcacctaca 12000
ccggcgtcga gctgcgggcc gacgatgacg aactggtgtg gcagcaggtg ttggagtacg 12060
cgaagcgcac ccctatcggc gagccgatca ccttcacgtt ctacgagctt tgccaggacc 12120
tgggctggtc gatcaatggc cggtattaca cgaaggccga ggaatgcctg tcgcgcctac 12180
aggcgacggc gatgggcttc acgtccgacc gcgttgggca cctggaatcg gtgtcgctgc 12240
tgcaccgctt ccgcgtcctg gaccgtggca agaaaacgtc ccgttgccag gtcctgatcg 12300
acgaggaaat cgtcgtgctg tttgctggcg accactacac gaaattcata tgggagaagt 12360
accgcaagct gtcgccgacg gcccgacgga tgttcgacta tttcagctcg caccgggagc 12420
cgtacccgct caagctggaa accttccgcc tcatgtgcgg atcggattcc acccgcgtga 12480
agaagtggcg cgagcaggtc ggcgaagcct gcgaagagtt gcgaggcagc ggcctggtgg 12540
aacacgcctg ggtcaatgat gacctggtgc attgcaaacg ctagggcctt gtggggtcag 12600
ttccggctgg gggttcagca gccagcgctt tactggcatt tcaggaacaa gcgggcactg 12660
ctcgacgcac ttgcttcgct cagtatcgct cgggacgcac ggcgcgctct acgaactgcc 12720
gataaacaga ggattaaaat tgacaattgt gattaaggct cagattcgac ggcttggagc 12780
ggccgacgtg caggatttcc gcgagatccg attgtcggcc ctgaagaaag ctccagagat 12840
gttcgggtcc gtttacgagc acgaggagaa aaagcccatg gaggcgttcg ctgaacggtt 12900
gcgagatgcc gtggcattcg gcgcctacat cgacggcgag atcattgggc tgtcggtctt 12960
caaacaggag gacggcccca aggacgctca caaggcgcat ctgtccggcg ttttcgtgga 13020
gcccgaacag cgaggccgag gggtcgccgg tatgctgctg cgggcgttgc cggcgggttt 13080
attgctcgtg atgatcgtcc gacagattcc aacgggaatc tggtggatgc gcatcttcat 13140
cctcggcgca cttaatattt cgctattctg gagcttgttg tttatttcgg tctaccgcct 13200
gccgggcggg gtcgcggcga cggtaggcgc tgtgcagccg ctgatggtcg tgttcatctc 13260
tgccgctctg ctaggtagcc cgatacgatt gatggcggtc ctgggggcta tttgcggaac 13320
tgcgggcgtg gcgctgttgg tgttgacacc aaacgcagcg ctagatcctg tcggcgtcgc 13380
agcgggcctg gcgggggcgg tttccatggc gttcggaacc gtgctgaccc gcaagtggca 13440
acctcccgtg cctctgctca cctttaccgc ctggcaactg gcggccggag gacttctgct 13500
cgttccagta gctttagtgt ttgatccgcc aatcccgatg cctacaggaa ccaatgttct 13560
cggctgctcg actgcacgaa taccagcgac cccttgccca aatacttgcc gtgggcctcg 13620
gcctgagagc caaaacactt gatgcggaag aagtcggtgc gctcctgctt gtcgccggca 13680
tcgttgcgcc acatctaggt actaaaacaa ttcatccagt aaaatataat attttatttt 13740
ctcccaatca ggcttgatcc ccagtaagtc aaaaaatagc tcgacatact gttcttcccc 13800
gatatcctcc ctgatcgacc ggacgcagaa ggcaatgtca taccacttgt ccgccctgcc 13860
gcttctccca agatcaataa agccacttac tttgccatct ttcacaaaga tgttgctgtc 13920
tcccaggtcg ccgtgggaaa agacaagttc ctcttcgggc ttttccgtct ttaaaaaatc 13980
atacagctcg cgcggatctt taaatggagt gtcttcttcc cagttttcgc aatccacatc 14040
ggccagatcg ttattcagta agtaatccaa ttcggctaag cggctgtcta agctattcgt 14100
atagggacaa tccgatatgt cgatggagtg aaagagcctg atgcactccg catacagctc 14160
gataatcttt tcagggcttt gttcatcttc atactcttcc gagcaaagga cgccatcggc 14220
ctcactcatg agcagattgc tccagccatc atgccgttca aagtgcagga cctttggaac 14280
aggcagcttt ccttccagcc atagcatcat gtccttttcc cgttccacat cataggtggt 14340
ccctttatac cggctgtccg tcatttttaa atataggttt tcattttctc ccaccagctt 14400
atatacctta gcaggagaca ttccttccgt atcttttacg cagcggtatt tttcgatcag 14460
ttttttcaat tccggtgata ttctcatttt agccatttat tatttccttc ctcttttcta 14520
cagtatttaa agatacccca agaagctaat tataacaaga cgaactccaa ttcactgttc 14580
cttgcattct aaaaccttaa ataccagaaa acagcttttt caaagttgtt ttcaaagttg 14640
gcgtataaca tagtatcgat tcgatagcgt ggactcaagg ctctcgcgaa tggctcgcgt 14700
tggaaacttt cattgacact tgaggggcac cgcagggaaa ttctcgtcct tgcgagaacc 14760
ggctatgtcg tgctgcgcat cgagcctgcg cccttggctt gtctcgcccc tctccgcgtc 14820
gctacggggc ttccagcgcc tttccgacgc tcaccgggct ggttgccctc gccgctgggc 14880
tggcggccgt ctatggccct gcaaacgcgc cagaaacgcc gtcgaagccg tgtgcgagac 14940
accgcggccg ccggcgttgt ggatacctcg cggaaaactt ggccctcact gacagatgag 15000
gggcggacgt tgacacttga ggggccgact cacccggcgc ggcgttgaca gatgaggggc 15060
aggctcgatt tcggccggcg acgtggagct ggccagcctc gcaaatcggc gaaaacgcct 15120
gattttacgc gagtttccca cagatgatgt ggacaagcct ggggataagt gccctgcggt 15180
attgacactt gaggggcgcg actactgaca gatgaggggc gcgatccttg acacttgagg 15240
ggcagagtgc tgacagatga ggggcgcacc tattgacatt tgaggggctg tccacaggca 15300
gaaaatccag catttgcaag ggtttccgcc cgtttttcgg ccaccgctaa cctgtctttt 15360
aacctgcttt taaaccaata tttataaacc ttgtttttaa ccagggctgc gccctgtgcg 15420
cgtgaccgcg cacgccgaag gggggtgccc ccccttctcg aaccctcccg gcccgctaac 15480
gcgggcctcc catcccccca ggggctgcgc ccctcggccg cgaacggcct caccccaaaa 15540
atggcagcgc cagccaggac gtcggccgaa agagcgacaa gcagatcacg cttttcgaca 15600
gcgtcggatt tgcgatcgag gatttttcgg cgctgcgcta cgtccgcgac cgcgttgagg 15660
gatcaagcca cagcagccca ctcgaccttc tagccgaccc agacgagcca agggatcttt 15720
ttggaatgct gctccgtcgt caggctttcc gacgtttggg tggttgaaca gaagtcatta 15780
tcgcacggaa tgccaagcac tcccgagggg aaccctgtgg ttggcatgca catacaaatg 15840
gacgaacgga taaacctttt cacgcccttt taaatatccg attattctaa taaacgctct 15900
tttctcttag gtttacccgc caatatatcc tgtcaaacac tgatagttta aactgaaggc 15960
gggaaacgac aatctgatca tgagcggaga attaagggag tcacgttatg acccccgccg 16020
atgacgcggg acaagccgtt ttacgtttgg aactgacaga accgcaacgt tgaaggagcc 16080
actcagc 16087
<210> 3
<211> 1911
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 3
atggcgatgc cgtatgcctc tctttccccg gcaggcgacc gccgctcctc cccggccgcc 60
accgccaccg cctccctcct ccccttctgc cgctcctccc ccttctccgc cggcggcaat 120
ggcggcatgg gggaggaggc gcggatggac gggaggtgga tggcgaggcc ggtgcccttc 180
acggcggcgc agtacgagga gctggagcac caggcgctca tatacaagta cctggtggcc 240
ggcgtgtccg tcccgccgga tctcgtgctc cccatccgcc gcggcatcga gtccctcgcc 300
gcccgcttct accacaaccc cctcgccatc gggtacggat cgtacctggg caagaaggtg 360
gatccggagc cgggccggtg ccggcgcacg gacggcaaga agtggcggtg cgccaaggag 420
gccgcctccg actccaagta ttgcgagcgc cacatgcacc gcggccgcaa ccgttcaaga 480
aagcctgtgg aaacgcagct cgtctcgcac tcccagccgc cggccgcctc cgtcgtgccg 540
cccctcgcca ccggcttcca caaccactcc ctctaccccg ccatcggcgg caccaacggt 600
ggtggaggcg gggggaacaa cggcatgtcc atgcccggca cgttctcctc cgcgctgggg 660
ccgcctcagc agcacatggg caacaatgcc gcctctccct acgcggctct cggcggcgcc 720
ggaacatgca aagatttcag gtataccgca tatggaataa gatctttggc agacgagcag 780
agtcagctca tgacagaagc catgaacacc tccgtggaga acccatggcg cctgccgcca 840
tcttctcaaa cgactacatt cccgctctca agctactctc ctcagcttgg agcaacgagt 900
gacctgggtc agaacaacag cagcaacaac aacagcggcg tcaaggccga gggacagcag 960
cagcagcagc cgctctcctt cccggggtgc ggcgacttcg gcagcggcga ctccgcgaag 1020
caggagaacc agacgctgcg gccgttcttc gacgagtggc cgaagacgag ggactcgtgg 1080
tcggacctga ccgacgacaa ctcgaacgtc gcctccttct cggccaccca gctgtcgatc 1140
tcgataccca tgacgtcctc cgacttctcc gccgccagct cccagtcgcc caacggcatg 1200
ctgttcgccg gcgaaatgta cgcggccgct gccatgcagc agcaacacct gatgcagatg 1260
aaccagagca tgatgggggg ctacgcttcc tctaccactg ccaccactga tctcattcag 1320
cagtacctgg atgagaacaa gcagctgatc ctggccatcc tcgacaacca gaacaacggc 1380
aaggtggagg agtgcgcacg gaaccaagct aagctccagc agaacctcat gtacctcgcc 1440
gccatcgccg acagccagcc tccgcagacg gcatcgctgt ctcagtaccc gtccaacctg 1500
atgatgcagt ccgggccgcg gtacatgcag cagcagtcgg cgcagatgat gtcgccgcag 1560
tcgctgatgg cggcgcggtc gtcgatgatg tacgcgcagc aggccatgtc gccgctccag 1620
cagcagcagc agcagcagca gcaccaggcg gccgcgcacg gccagctggg gatgtcctcc 1680
ggcgcgacca ccgggttcaa cctcctgcac ggtgaggcca gcatgggcgg cggcggcggc 1740
gccactggca acagcatgat gaacgccagc gtcttctcgg actatggccg cggcggcagc 1800
ggcgccaagg aggggtcgac ctcgctgtcg gccgacgctc gcggcgccaa ctctggcgcg 1860
cacagcggcg acggggagta cctcaagggc accgaggagg aaggaagcta g 1911
<210> 4
<211> 636
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 4
Met Ala Met Pro Tyr Ala Ser Leu Ser Pro Ala Gly Asp Arg Arg Ser
1 5 10 15
Ser Pro Ala Ala Thr Ala Thr Ala Ser Leu Leu Pro Phe Cys Arg Ser
20 25 30
Ser Pro Phe Ser Ala Gly Gly Asn Gly Gly Met Gly Glu Glu Ala Arg
35 40 45
Met Asp Gly Arg Trp Met Ala Arg Pro Val Pro Phe Thr Ala Ala Gln
50 55 60
Tyr Glu Glu Leu Glu His Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala
65 70 75 80
Gly Val Ser Val Pro Pro Asp Leu Val Leu Pro Ile Arg Arg Gly Ile
85 90 95
Glu Ser Leu Ala Ala Arg Phe Tyr His Asn Pro Leu Ala Ile Gly Tyr
100 105 110
Gly Ser Tyr Leu Gly Lys Lys Val Asp Pro Glu Pro Gly Arg Cys Arg
115 120 125
Arg Thr Asp Gly Lys Lys Trp Arg Cys Ala Lys Glu Ala Ala Ser Asp
130 135 140
Ser Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg
145 150 155 160
Lys Pro Val Glu Thr Gln Leu Val Ser His Ser Gln Pro Pro Ala Ala
165 170 175
Ser Val Val Pro Pro Leu Ala Thr Gly Phe His Asn His Ser Leu Tyr
180 185 190
Pro Ala Ile Gly Gly Thr Asn Gly Gly Gly Gly Gly Gly Asn Asn Gly
195 200 205
Met Ser Met Pro Gly Thr Phe Ser Ser Ala Leu Gly Pro Pro Gln Gln
210 215 220
His Met Gly Asn Asn Ala Ala Ser Pro Tyr Ala Ala Leu Gly Gly Ala
225 230 235 240
Gly Thr Cys Lys Asp Phe Arg Tyr Thr Ala Tyr Gly Ile Arg Ser Leu
245 250 255
Ala Asp Glu Gln Ser Gln Leu Met Thr Glu Ala Met Asn Thr Ser Val
260 265 270
Glu Asn Pro Trp Arg Leu Pro Pro Ser Ser Gln Thr Thr Thr Phe Pro
275 280 285
Leu Ser Ser Tyr Ser Pro Gln Leu Gly Ala Thr Ser Asp Leu Gly Gln
290 295 300
Asn Asn Ser Ser Asn Asn Asn Ser Gly Val Lys Ala Glu Gly Gln Gln
305 310 315 320
Gln Gln Gln Pro Leu Ser Phe Pro Gly Cys Gly Asp Phe Gly Ser Gly
325 330 335
Asp Ser Ala Lys Gln Glu Asn Gln Thr Leu Arg Pro Phe Phe Asp Glu
340 345 350
Trp Pro Lys Thr Arg Asp Ser Trp Ser Asp Leu Thr Asp Asp Asn Ser
355 360 365
Asn Val Ala Ser Phe Ser Ala Thr Gln Leu Ser Ile Ser Ile Pro Met
370 375 380
Thr Ser Ser Asp Phe Ser Ala Ala Ser Ser Gln Ser Pro Asn Gly Met
385 390 395 400
Leu Phe Ala Gly Glu Met Tyr Ala Ala Ala Ala Met Gln Gln Gln His
405 410 415
Leu Met Gln Met Asn Gln Ser Met Met Gly Gly Tyr Ala Ser Ser Thr
420 425 430
Thr Ala Thr Thr Asp Leu Ile Gln Gln Tyr Leu Asp Glu Asn Lys Gln
435 440 445
Leu Ile Leu Ala Ile Leu Asp Asn Gln Asn Asn Gly Lys Val Glu Glu
450 455 460
Cys Ala Arg Asn Gln Ala Lys Leu Gln Gln Asn Leu Met Tyr Leu Ala
465 470 475 480
Ala Ile Ala Asp Ser Gln Pro Pro Gln Thr Ala Ser Leu Ser Gln Tyr
485 490 495
Pro Ser Asn Leu Met Met Gln Ser Gly Pro Arg Tyr Met Gln Gln Gln
500 505 510
Ser Ala Gln Met Met Ser Pro Gln Ser Leu Met Ala Ala Arg Ser Ser
515 520 525
Met Met Tyr Ala Gln Gln Ala Met Ser Pro Leu Gln Gln Gln Gln Gln
530 535 540
Gln Gln Gln His Gln Ala Ala Ala His Gly Gln Leu Gly Met Ser Ser
545 550 555 560
Gly Ala Thr Thr Gly Phe Asn Leu Leu His Gly Glu Ala Ser Met Gly
565 570 575
Gly Gly Gly Gly Ala Thr Gly Asn Ser Met Met Asn Ala Ser Val Phe
580 585 590
Ser Asp Tyr Gly Arg Gly Gly Ser Gly Ala Lys Glu Gly Ser Thr Ser
595 600 605
Leu Ser Ala Asp Ala Arg Gly Ala Asn Ser Gly Ala His Ser Gly Asp
610 615 620
Gly Glu Tyr Leu Lys Gly Thr Glu Glu Glu Gly Ser
625 630 635
<210> 5
<211> 17323
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 5
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc agactagagt tatcaacaag tttgtacaaa aaagctgcca ccatggcgat 2040
gccgtatgcc tctctttccc cggcaggcga ccgccgctcc tccccggccg ccaccgccac 2100
cgcctccctc ctccccttct gccgctcctc ccccttctcc gccggcggca atggcggcat 2160
gggggaggag gcgcggatgg acgggaggtg gatggcgagg ccggtgccct tcacggcggc 2220
gcagtacgag gagctggagc accaggcgct catatacaag tacctggtgg ccggcgtgtc 2280
cgtcccgccg gatctcgtgc tccccatccg ccgcggcatc gagtccctcg ccgcccgctt 2340
ctaccacaac cccctcgcca tcgggtacgg atcgtacctg ggcaagaagg tggatccgga 2400
gccgggccgg tgccggcgca cggacggcaa gaagtggcgg tgcgccaagg aggccgcctc 2460
cgactccaag tattgcgagc gccacatgca ccgcggccgc aaccgttcaa gaaagcctgt 2520
ggaaacgcag ctcgtctcgc actcccagcc gccggccgcc tccgtcgtgc cgcccctcgc 2580
caccggcttc cacaaccact ccctctaccc cgccatcggc ggcaccaacg gtggtggagg 2640
cggggggaac aacggcatgt ccatgcccgg cacgttctcc tccgcgctgg ggccgcctca 2700
gcagcacatg ggcaacaatg ccgcctctcc ctacgcggct ctcggcggcg ccggaacatg 2760
caaagatttc aggtataccg catatggaat aagatctttg gcagacgagc agagtcagct 2820
catgacagaa gccatgaaca cctccgtgga gaacccatgg cgcctgccgc catcttctca 2880
aacgactaca ttcccgctct caagctactc tcctcagctt ggagcaacga gtgacctggg 2940
tcagaacaac agcagcaaca acaacagcgg cgtcaaggcc gagggacagc agcagcagca 3000
gccgctctcc ttcccggggt gcggcgactt cggcagcggc gactccgcga agcaggagaa 3060
ccagacgctg cggccgttct tcgacgagtg gccgaagacg agggactcgt ggtcggacct 3120
gaccgacgac aactcgaacg tcgcctcctt ctcggccacc cagctgtcga tctcgatacc 3180
catgacgtcc tccgacttct ccgccgccag ctcccagtcg cccaacggca tgctgttcgc 3240
cggcgaaatg tacgcggccg ctgccatgca gcagcaacac ctgatgcaga tgaaccagag 3300
catgatgggg ggctacgctt cctctaccac tgccaccact gatctcattc agcagtacct 3360
ggatgagaac aagcagctga tcctggccat cctcgacaac cagaacaacg gcaaggtgga 3420
ggagtgcgca cggaaccaag ctaagctcca gcagaacctc atgtacctcg ccgccatcgc 3480
cgacagccag cctccgcaga cggcatcgct gtctcagtac ccgtccaacc tgatgatgca 3540
gtccgggccg cggtacatgc agcagcagtc ggcgcagatg atgtcgccgc agtcgctgat 3600
ggcggcgcgg tcgtcgatga tgtacgcgca gcaggccatg tcgccgctcc agcagcagca 3660
gcagcagcag cagcaccagg cggccgcgca cggccagctg gggatgtcct ccggcgcgac 3720
caccgggttc aacctcctgc acggtgaggc cagcatgggc ggcggcggcg gcgccactgg 3780
caacagcatg atgaacgcca gcgtcttctc ggactatggc cgcggcggca gcggcgccaa 3840
ggaggggtcg acctcgctgt cggccgacgc tcgcggcgcc aactctggcg cgcacagcgg 3900
cgacggggag tacctcaagg gcaccgagga ggaaggaagc tagcgttcag ctttcttgta 3960
caaagtggtt gataacagcg ggttaattaa catcttttac ccatacgatg ttcctgacta 4020
tgcgggctat ccctatgacg tcccggacta tgcaggatcc tatccatatg acgttccaga 4080
ttacgctgct cagtgcagcg cttagagctc gaatttcccc gatcgttcaa acatttggca 4140
ataaagtttc ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct 4200
gttgaattac gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg 4260
ggtttttatg attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata 4320
gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgggaatt 4380
agcactagta acggccgcca gtgtgctgga attcgccctt taaataagcg atcgccccga 4440
gcaataatct ccaggagatc aaataccttc ccaagaaggt taaagatgca gtcaaaagat 4500
tcaggactaa ctgcatcaag aacacagaga aagatatatt tctcaagatc agaagtacta 4560
ttccagtatg gacgattcaa ggcttgcttc acaaaccaag gcaagtaata gagattggag 4620
tctctaaaaa ggtagttccc actgaatcaa aggccatgga gtcaaagatt caaatagagg 4680
acctaacaga actcgccgta aagactggcg aacagttcat acagagtctc ttacgactca 4740
atgacaagaa gaaaatcttc gtcaacatgg tggagcacga cacgcttgtc tactccaaaa 4800
atatcaaaga tacagtctca gaagaccaaa gggcaattga gacttttcaa caaagggtaa 4860
tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt gtgaagatag 4920
tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag gccatcgttg 4980
aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg agcatcgtgg 5040
aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat atctccactg 5100
acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct atataaggaa 5160
gttcatttca tttggagagg acacggggga ctctagagga tccccgatcc ctacagggta 5220
aatttctagt ttttctcctt cattttcttg gttaggaccc ttttctcttt ttattttttt 5280
gagctttgat ctttctttaa actgatctat tttttaattg attggttatg gtgtaaatat 5340
tacatagctt taactgataa tctgattact ttatttcgtg tgtctatgat gatgatgata 5400
gttacagaac cgtcgagggg gatccaatga gatatgaaaa agcctgaact caccgcgacg 5460
tctgtcgaga agtttctgat cgaaaagttc gacagcgtct ccgacctgat gcagctctcg 5520
gagggcgaag aatctcgtgc tttcagcttc gatgtaggag ggcgtggata tgtcctgcgg 5580
gtaaatagct gcgccgatgg tttctacaaa gatcgttatg tttatcggca ctttgcatcg 5640
gccgcgctcc cgattccgga agtgcttgac attggggaat tcagcgagag cctgacctat 5700
tgcatctccc gccgtgcaca gggtgtcacg ttgcaagacc tgcctgaaac cgaactgccc 5760
gctgttctgc agccggtcgc ggaggccatg gatgcgatcg ctgcggccga tcttagccag 5820
acgagcgggt tcggcccatt cggaccgcaa ggaatcggtc aatacactac atggcgtgat 5880
ttcatatgcg cgattgctga tccccatgtg tatcactggc aaactgtgat ggacgacacc 5940
gtcagtgcgt ccgtcgcgca ggctctcgat gagctgatgc tttgggccga ggactgcccc 6000
gaagtccggc acctcgtgca cgcggatttc ggctccaaca atgtcctgac ggacaatggc 6060
cgcataacag cggtcattga ctggagcgag gcgatgttcg gggattccca atacgaggtc 6120
gccaacatct tcttctggag gccgtggttg gcttgtatgg agcagcagac gcgctacttc 6180
gagcggaggc atccggagct tgcaggatcg ccgcggctcc gggcgtatat gctccgcatt 6240
ggtcttgacc aactctatca gagcttggtt gacggcaatt tcgatgatgc agcttgggcg 6300
cagggtcgat gcgacgcaat cgtccgatcc ggagccggga ctgtcgggcg tacacaaatc 6360
gcccgcagaa gcgcggccgt ctggaccgat ggctgtgtag aagtactcgc cgatagtgga 6420
aaccgacgcc ccagcactcg tccgagggca aaggaataga gtagatttaa attgaaatca 6480
ccagtctctc tctacaaatc tatctctctc tataataatg tgtgagtagt tcccagataa 6540
gggaattagg gttcttatag ggtttcgctc atgtgttgag catataagaa acccttagta 6600
tgtatttgta tttgtaaaat acttctatca ataaaatttc taattcctaa aaccaaaatc 6660
cagtgggcgc ggggtaccga gctcgaattc agtacattaa aaacgtccgc aatgtgttat 6720
taagttgtct aagcgtcaat ttgtttacac cacaatatat cctgccacca gccagccaac 6780
agctccccga ccggcagctc ggcacaaaat caccactcga tacaggcagc ccatcagtcc 6840
gggacggcgt cagcgggaga gccgttgtaa ggcggcagac tttgctcatg ttaccgatgc 6900
tattcggaag aacggcaact aagctgccgg gtttgaaaca cggatgatct cgcggagggt 6960
agcatgttga ttgtaacgat gacagagcgt tgctgcctgt gatcaaatat catctccctc 7020
gcagagatcc gaattatcag ccttcttatt catttctcgc ttaaccgtga caggctgtcg 7080
atcttgagaa ctatgccgac ataataggaa atcgctggat aaagccgctg aggaagctga 7140
gtggcgctat ttctttagaa gtgaacgttg acgatcgtcg accgtacccc gatgaattaa 7200
ttcggacgta cgttctgaac acagctggat acttacttgg gcgattgtca tacatgacat 7260
caacaatgta cccgtttgtg taaccgtctc ttggaggttc gtatgacact aggtcgctac 7320
cttaggaccg ttatagttac tagcgaattg acatgaggtt gccccgtatt cagtgtcgct 7380
gatttgtatt gtctgaagtt gtttttacgt taagttgatg cagatcaatt aatacgatac 7440
ctgcgtcata attgattatt tgacgtggtt tgatggcctc cacgcacgtt gtgatatgta 7500
gatgataatc attatcactt tacgggtcct ttccggtgat ccgacaggtt acggggcggc 7560
gacctcgcgg gttttcgcta tttatgaaaa ttttccggtt taaggcgttt ccgttcttct 7620
tcgtcataac ttaatgtttt tatttaaaat accctctgaa aagaaaggaa acgacaggtg 7680
ctgaaagcga gctttttggc ctctgtcgtt tcctttctct gtttttgtcc gtggaatgaa 7740
caatggaagg atcttctcgg cggcgatcac gacgccggcc ctgcggagcc ttcgccgcgt 7800
gcgcgattca tggcggccgt ggaggccaag gatttcgcgc gagtgcaaga gctgatcgag 7860
gcgcgtggag ccaagtcggc ggctgattat gtccttgcgc agctcgccgt ggccgaaggt 7920
ctggaccgca agcctggtgc gcgcgtcgtg gtcgggaaag cggcgggcag catggcaatg 7980
ccgcctgcgg cgctgggttt tacgccaagg ggagaagcgg catacgccat cgagcggtca 8040
gcctatggtg agccgaggtc cagcattgcg aagcagtacc agcaggaatg gaaccggaag 8100
gcggcgacct ggtgggcgat ggccggtgtg gccggcatca tcggcgcgat cctggcggcg 8160
gcggcaaccg gctttgttgg gctggcagtg tcgatccgca accgagtgaa gcgcgtgcgc 8220
gacctgttgg tgatggagcc gggtgcagag ccataagcgg caagagacga aagcccggtt 8280
tccgggcttt tgttttgtta cgccaaggac gagttttagc ggctaaaggt gttgacgtgc 8340
gagaaatgtt tagctaaact tctctcatgt gctggcggct gtcaccgcta tgttcaacca 8400
aggcgcggag caaattatgg gtgttatcca tgaagaaacg gcttaccgaa agccagttcc 8460
aggaggcgat ccaggggctg gaagtggggc agcagaccat cgagatagcg cggggcgtct 8520
tagtcgatgg gaagccacag gcgacgttcg caacgtcgct gggactgacc aggggcgcag 8580
tgtcgcaagc ggtgcatcgc gtgtgggccg cgttcgagga caagaacttg cccgaggggt 8640
acgcgcgggt aacggcggtt ctgccggaac atcaggcgta catcgtccgg aagtgggaag 8700
cggacgccaa gaaaaaacag gaaaccaaac gatgaaaact ttggtcacgg ccaaccagaa 8760
aggcggcgtc ggcaagactt cgacccttgt gcatcttgcc ttcgactttt tcgagcgcgg 8820
cttgcgggtt gccgtgatcg acctggaccc ccagggcaat gcgtcctaca cgctcaagga 8880
ctttgctacc ggcctgcatg caagcaagct gttcggcgct gtccctgccg gcggctggac 8940
cgaaaccgca cccgcagccg gcgacggcca ggccgcgcgc ctcgccctca tcgagtccaa 9000
cccggtactg gcgaacgccg aacggctgtc gctggacgac gcccgcgagc tgttcggggc 9060
gaacatcaag gccctggcga accaaggctt cgacgtgtgc ctgatcgaca cggccccgac 9120
ccttggcgtc ggcctggcgg ccgccctctt cgcggccgac tatgtgctgt cccccatcga 9180
gcttgaggcg tacagcatcc agggcatcaa gaagatggtc acgaccattg cgaacgtgcg 9240
ccagaagaac gccaagctgc aattccttgg catggtgccc agcaaggtcg atgcgcggaa 9300
tccgcgccac gcgcgccacc aagccgagct gctggccgcg taccccaaga tgatgattcc 9360
ggccaccgtt ggcctgcgca gcagcatcgc cgatgccctc gcatccggtg tgccggtctg 9420
gaagatcaag aaaacggccg cgcgcaaggc atcgaaagag gttcgcgccc tggctgatta 9480
cgtgttcacg aagatggaga tttcccaatg actgcggctc aagccaagac caccaagaaa 9540
aacaccgctg cggccgctca ggaagccgca ggcgcggcgc agccgtccgg cctggggttg 9600
gatagcatcg gcgacctgtc gagcctcctg gacgctcctg cggcgtctca gggcggttcc 9660
ggccctatcg agctggacct ggacctgatc gacgaagatc cgcatcagcc gcggacggcc 9720
gacaaccccg gcttttcccc ggagagcatc gcggaaatcg gtgccacgat caaagagcgc 9780
ggggtgaagt cacccatttc ggtgcgcgag aaccaggagc agccgggccg ctatatcatc 9840
aatcacggcg cccgccgcta ccgtggctcg aagtgggccg gcaagaagtc catcccggcg 9900
ttcatcgaca acgactacaa cgaagccgac caggttatcg agaacctgca acgcaacgag 9960
ctgaccccgc gcgaaattgc cgacttcatt ggccgcgagc tggcgaaggg caagaagaaa 10020
ggcgatatcg ccaaggaaat cggcaagtcg ccggcgttca tcacccagca cgtcacgctg 10080
ctggacctgc cggagaagat cgccgatgcg ttcaacaccg gccgcgtgcg cgacgtgacc 10140
gtggtgaacg agctggtgac ggccttcaag aagcgcccgg aggaagtcga ggcgtggctt 10200
gacgacgaca cccaggaaat cacgcgcggc acggtcaagc tgctgcgcga gttcctggac 10260
gagaagggcc gcgatcccaa caccgtcgat gccttcaacg gccagactga tgccgagcgt 10320
gacgcggagg ccggcgacgg ccaggacggc gaggacggcg accaggacgg taaggacgcc 10380
aaggaaaagg gcgcgaagga gccggacccg gacaagctga aaaaggccat cgtccaggtc 10440
gagcacgacg agcgccctgc ccgccttatc ctcaaccgtc ggccgccggc ggaaggctat 10500
gcctggttga agtacgagga cgacggccag gagttcgagg cgaaccttgc cgacgtgaaa 10560
ctggtcgcgc tcatcgaggg ctgatcccca aagacagcgg cgcgggccac ccgcgccgca 10620
cagacaacgg ttccgctaca aggaggaccg aagaatgaat ccgatgctgt tctacatcgc 10680
gggaggcgta ggcgcggcgt tgctgctggt ttccgcgatc atgctgttca agctgcgcga 10740
gccgaagaag gaacaccgac cgcagcgcaa ggcggcggcc ccgacgccgc agccggtcga 10800
taacgagctg ctgcgcactc tagtgatatt ccacaaaaca gcagggaagc agcgcttttc 10860
cgctgcataa ccctgcttcg gggtcattat agcgattttt tcggtatatc catccttttt 10920
cgcacgatat acaggatttt gccaaagggt tcgtgtagac tttccttggt gtatccaacg 10980
gcgtcagccg ggcaggatag gtgaagtagg cccacccgcg agcgggtgtt ccttcttcac 11040
tgtcccttat tcgcacctgg cggtgctcaa cgggaatcct gctctgcgag gctggccggc 11100
taccgccggc gtaacagatg agggcaagcg gatggctgat gaaaccaagc caaccaggaa 11160
gggcagccca cctatcaagg tgtactgcct tccagacgaa cgaagagcga ttgaggaaaa 11220
ggcggcggcg gccggcatga gcctgtcggc ctacctgctg gccgtcggcc agggctacaa 11280
aatcacgggc gtcgtggact atgagcacgt ccgcgagctg gcccgcatca atggcgacct 11340
gggccgcctg ggcggcctgc tgaaactctg gctcaccgac gacccgcgca cggcgcggtt 11400
cggtgatgcc acgatcctcg ccctgctggc gaagatcgaa gagaagcagg acgagcttgg 11460
caaggtcatg atgggcgtgg tccgcccgag ggcagagcca tgactttttt agccgctaaa 11520
acggccgggg ggtgcgcgtg attgccaagc acgtccccat gcgctccatc aagaagagcg 11580
acttcgcgga gctggtgaag tacatcaccg acgagcaagg caagaccgag cgccagatcc 11640
aaaacaactg tcaaagcgca cccgcccgat gccattcgcg gcacggcttc cgttgaggat 11700
gtcgatatga tgcgcgagcc gacggcccgc agagaagggg ccgttttagc ggctaaagaa 11760
ggaagtgcaa gccctaaccc ttggcgtcag agccttccac gcagcttttt tcgggtgtcg 11820
tcgccccatt tctttacgat aaacgcctta tgtgacggca aaaccacact gatgcgttcg 11880
tatccgggcg gcacgctgct cttgaaagga tgacccgcaa tctccgcgag tgcctcgcgg 11940
tcaaggtcgg tggactccag gagaagaggt aggggagttt ccagggcgtc ggcaatggcc 12000
tccatcacct tcaacgaggg gttggcctta ccgttggtta agtctgataa aaacgaaatt 12060
gaaacccctg ccctctccga cagctcatgt ttcgtcatgc cccgctcatc gagcagacga 12120
aggatgttgg tgaaaaatat ctggttgtac acagcggaag ccgcccctcg cacctttggt 12180
cgcggcccgc aaaattttag ccgctaaagt tcttgacagc ggaaccaatg tttagctaaa 12240
ctagagtctc ctttctcaag gagactttcg atatgagcca taatcagttc cagtttatcg 12300
gtaatcttac ccgtgacacc gaggtacgtc atggcaattc taacaagccg caagcaattt 12360
tcgatatagc ggttaatgaa gagtggcgca acgatgccgg cgacaagcag gagcgcaccg 12420
acttcttccg catcaagtgt tttggctctc aggccgaggc ccacggcaag tatttgggca 12480
aggggtcgct ggtattcgtg cagggcaaga ttcggaatac caagtacgag aaggacggcc 12540
agacggtcta cgggaccgac ttcattgccg ataaggtgga ttatctggac accaaggcac 12600
caggcgggtc aaatcaggaa taagggcaca ttgccccggc gtgagtcggg gcaatcccgc 12660
aaggagggtg aatgaatcgg acgtttgacc ggaaggcata caggcaagaa ctgatcgacg 12720
cggggttttc cgccgaggat gccgaaacca tcgcaagccg caccgtcatg cgtgcgcccc 12780
gcgaaacctt ccagtccgtc ggctcgatgg tccagcaagc tacggccaag atcgagcgcg 12840
acagcgtgca actggctccc cctgccctgc ccgcgccatc ggccgccgtg gagcgttcgc 12900
gtcgtctcga acaggaggcg gcaggtttgg cgaagtcgat gaccatcgac acgcgaggaa 12960
ctatgacgac caagaagcga aaaaccgccg gcgaggacct ggcaaaacag gtcagcgagg 13020
ccaagcaggc cgcgttgctg aaacacacga agcagcagat caaggaaatg cagctttcct 13080
tgttcgatat tgcgccgtgg ccggacacga tgcgagcgat gccaaacgac acggcccgct 13140
ctgccctgtt caccacgcgc aacaagaaaa tcccgcgcga ggcgctgcaa aacaaggtca 13200
ttttccacgt caacaaggac gtgaagatca cctacaccgg cgtcgagctg cgggccgacg 13260
atgacgaact ggtgtggcag caggtgttgg agtacgcgaa gcgcacccct atcggcgagc 13320
cgatcacctt cacgttctac gagctttgcc aggacctggg ctggtcgatc aatggccggt 13380
attacacgaa ggccgaggaa tgcctgtcgc gcctacaggc gacggcgatg ggcttcacgt 13440
ccgaccgcgt tgggcacctg gaatcggtgt cgctgctgca ccgcttccgc gtcctggacc 13500
gtggcaagaa aacgtcccgt tgccaggtcc tgatcgacga ggaaatcgtc gtgctgtttg 13560
ctggcgacca ctacacgaaa ttcatatggg agaagtaccg caagctgtcg ccgacggccc 13620
gacggatgtt cgactatttc agctcgcacc gggagccgta cccgctcaag ctggaaacct 13680
tccgcctcat gtgcggatcg gattccaccc gcgtgaagaa gtggcgcgag caggtcggcg 13740
aagcctgcga agagttgcga ggcagcggcc tggtggaaca cgcctgggtc aatgatgacc 13800
tggtgcattg caaacgctag ggccttgtgg ggtcagttcc ggctgggggt tcagcagcca 13860
gcgctttact ggcatttcag gaacaagcgg gcactgctcg acgcacttgc ttcgctcagt 13920
atcgctcggg acgcacggcg cgctctacga actgccgata aacagaggat taaaattgac 13980
aattgtgatt aaggctcaga ttcgacggct tggagcggcc gacgtgcagg atttccgcga 14040
gatccgattg tcggccctga agaaagctcc agagatgttc gggtccgttt acgagcacga 14100
ggagaaaaag cccatggagg cgttcgctga acggttgcga gatgccgtgg cattcggcgc 14160
ctacatcgac ggcgagatca ttgggctgtc ggtcttcaaa caggaggacg gccccaagga 14220
cgctcacaag gcgcatctgt ccggcgtttt cgtggagccc gaacagcgag gccgaggggt 14280
cgccggtatg ctgctgcggg cgttgccggc gggtttattg ctcgtgatga tcgtccgaca 14340
gattccaacg ggaatctggt ggatgcgcat cttcatcctc ggcgcactta atatttcgct 14400
attctggagc ttgttgttta tttcggtcta ccgcctgccg ggcggggtcg cggcgacggt 14460
aggcgctgtg cagccgctga tggtcgtgtt catctctgcc gctctgctag gtagcccgat 14520
acgattgatg gcggtcctgg gggctatttg cggaactgcg ggcgtggcgc tgttggtgtt 14580
gacaccaaac gcagcgctag atcctgtcgg cgtcgcagcg ggcctggcgg gggcggtttc 14640
catggcgttc ggaaccgtgc tgacccgcaa gtggcaacct cccgtgcctc tgctcacctt 14700
taccgcctgg caactggcgg ccggaggact tctgctcgtt ccagtagctt tagtgtttga 14760
tccgccaatc ccgatgccta caggaaccaa tgttctcggc tgctcgactg cacgaatacc 14820
agcgacccct tgcccaaata cttgccgtgg gcctcggcct gagagccaaa acacttgatg 14880
cggaagaagt cggtgcgctc ctgcttgtcg ccggcatcgt tgcgccacat ctaggtacta 14940
aaacaattca tccagtaaaa tataatattt tattttctcc caatcaggct tgatccccag 15000
taagtcaaaa aatagctcga catactgttc ttccccgata tcctccctga tcgaccggac 15060
gcagaaggca atgtcatacc acttgtccgc cctgccgctt ctcccaagat caataaagcc 15120
acttactttg ccatctttca caaagatgtt gctgtctccc aggtcgccgt gggaaaagac 15180
aagttcctct tcgggctttt ccgtctttaa aaaatcatac agctcgcgcg gatctttaaa 15240
tggagtgtct tcttcccagt tttcgcaatc cacatcggcc agatcgttat tcagtaagta 15300
atccaattcg gctaagcggc tgtctaagct attcgtatag ggacaatccg atatgtcgat 15360
ggagtgaaag agcctgatgc actccgcata cagctcgata atcttttcag ggctttgttc 15420
atcttcatac tcttccgagc aaaggacgcc atcggcctca ctcatgagca gattgctcca 15480
gccatcatgc cgttcaaagt gcaggacctt tggaacaggc agctttcctt ccagccatag 15540
catcatgtcc ttttcccgtt ccacatcata ggtggtccct ttataccggc tgtccgtcat 15600
ttttaaatat aggttttcat tttctcccac cagcttatat accttagcag gagacattcc 15660
ttccgtatct tttacgcagc ggtatttttc gatcagtttt ttcaattccg gtgatattct 15720
cattttagcc atttattatt tccttcctct tttctacagt atttaaagat accccaagaa 15780
gctaattata acaagacgaa ctccaattca ctgttccttg cattctaaaa ccttaaatac 15840
cagaaaacag ctttttcaaa gttgttttca aagttggcgt ataacatagt atcgattcga 15900
tagcgtggac tcaaggctct cgcgaatggc tcgcgttgga aactttcatt gacacttgag 15960
gggcaccgca gggaaattct cgtccttgcg agaaccggct atgtcgtgct gcgcatcgag 16020
cctgcgccct tggcttgtct cgcccctctc cgcgtcgcta cggggcttcc agcgcctttc 16080
cgacgctcac cgggctggtt gccctcgccg ctgggctggc ggccgtctat ggccctgcaa 16140
acgcgccaga aacgccgtcg aagccgtgtg cgagacaccg cggccgccgg cgttgtggat 16200
acctcgcgga aaacttggcc ctcactgaca gatgaggggc ggacgttgac acttgagggg 16260
ccgactcacc cggcgcggcg ttgacagatg aggggcaggc tcgatttcgg ccggcgacgt 16320
ggagctggcc agcctcgcaa atcggcgaaa acgcctgatt ttacgcgagt ttcccacaga 16380
tgatgtggac aagcctgggg ataagtgccc tgcggtattg acacttgagg ggcgcgacta 16440
ctgacagatg aggggcgcga tccttgacac ttgaggggca gagtgctgac agatgagggg 16500
cgcacctatt gacatttgag gggctgtcca caggcagaaa atccagcatt tgcaagggtt 16560
tccgcccgtt tttcggccac cgctaacctg tcttttaacc tgcttttaaa ccaatattta 16620
taaaccttgt ttttaaccag ggctgcgccc tgtgcgcgtg accgcgcacg ccgaaggggg 16680
gtgccccccc ttctcgaacc ctcccggccc gctaacgcgg gcctcccatc cccccagggg 16740
ctgcgcccct cggccgcgaa cggcctcacc ccaaaaatgg cagcgccagc caggacgtcg 16800
gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg atcgaggatt 16860
tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc agcccactcg 16920
accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc cgtcgtcagg 16980
ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc aagcactccc 17040
gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa ccttttcacg 17100
cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt acccgccaat 17160
atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaatc tgatcatgag 17220
cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa gccgttttac 17280
gtttggaact gacagaaccg caacgttgaa ggagccactc agc 17323
<210> 6
<211> 1698
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 6
atgaagcaaa gctttgtggg ggtttcttca caaacagagg ctgagacaag agccagagag 60
agtgacacac agacacaccc acaaaggaac ctcagtcact gtccccaaaa acatcctcaa 120
aacttgcaaa gaacccaaaa tggtgggtct gttggggggg catcaatggc aacagggatg 180
atggtgaggt catcaccatt cacagtgtct cagtggcagg agctggagca tcaagctttg 240
atctttaagt atctaatggc aggactaccc gtcccacctg atcttgtgct tcctattcag 300
aagagctttg aatctatgtc ttctaggttc ttccaccatc ctaccatggg ttactgttcc 360
ttctatggga agaaggtgga cccggagcca ggtagatgcc ggaggactga tggcaagaaa 420
tggaggtgct ccaaagacgc ctacccggac tccaagtact gtgagcgcca catgcaccgt 480
ggccgcaacc gttcaagaaa gcctgtggaa tcacaaacta ctacacagtc ttcttcgact 540
gtgacttcac taactgtcac tggaggcagt ggcagtggag ccgggacctt ccagaacctt 600
cccctgcaaa acttcaataa cccccaaggc ggcccggggt ccggaaccaa ccaatcccaa 660
ttccatatgg agcccattcc ctatggcatt cctaataaag attacaggta tcttcatgga 720
cttaaacccg aggtaggcga acatagtttc ttctctgaag cttcaggaag caacaggggt 780
cttcagatgg actcgccttt ggacagcaca tggcctctga tgccatcacg ggtctcatca 840
ttccccccat caaaatcaag tgaaaactcc atcttgcaga gtgactaccc ccagcattcg 900
ttcttcagca gcgaatttgc gtcgggagag gttgtgaaac aggagggtca gtctctccgt 960
cctttctttg acgagtggcc taaaaccaga gactcctggt ccggccttga ggaagagagg 1020
tccaaccaga cctcattctc cacaacccag ctctcaattt cgatcccaat ggcctcctct 1080
gacttctcaa caaccagctc tcggtctccc cacgataacg cggccgctgc catgcagccc 1140
atgatggcag cctattaccc cagcaacgtc accactgatc acattcagca gtatcttgat 1200
gaaaacaagt cattgattct gaagattgtt gagagccaga attcaggaaa attgactgaa 1260
tgtgcagaga accaggcaag actacagaga aacctcatgt acctggctgc aatcgctgat 1320
tctcaacccc aaccacccac catgcatgct cagttccctc ctagtggcat tgttcagcca 1380
ggagctcact acatgcagca ccaacaagct caacaaatga caccacagtc gctcctggct 1440
gcacgctcct ccatgctgta ctcccaacaa ccattttcgg ccctgcaaca acaacaagcc 1500
atccatagcc agcttggcat gggctctggt ggaagtgcag gacttcacat gctgcaaagc 1560
gaggggagta atccaggagg caatggaaca ctggggactg atatgggaaa tgcagaaggg 1620
cgaggaggga actcaggagg tcagggtggg gatggaggtg agactcttta cttgaaagct 1680
gctgaagatg ggaattga 1698
<210> 7
<211> 565
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 7
Met Lys Gln Ser Phe Val Gly Val Ser Ser Gln Thr Glu Ala Glu Thr
1 5 10 15
Arg Ala Arg Glu Ser Asp Thr Gln Thr His Pro Gln Arg Asn Leu Ser
20 25 30
His Cys Pro Gln Lys His Pro Gln Asn Leu Gln Arg Thr Gln Asn Gly
35 40 45
Gly Ser Val Gly Gly Ala Ser Met Ala Thr Gly Met Met Val Arg Ser
50 55 60
Ser Pro Phe Thr Val Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu
65 70 75 80
Ile Phe Lys Tyr Leu Met Ala Gly Leu Pro Val Pro Pro Asp Leu Val
85 90 95
Leu Pro Ile Gln Lys Ser Phe Glu Ser Met Ser Ser Arg Phe Phe His
100 105 110
His Pro Thr Met Gly Tyr Cys Ser Phe Tyr Gly Lys Lys Val Asp Pro
115 120 125
Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser
130 135 140
Lys Asp Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met His Arg
145 150 155 160
Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Ser Gln Thr Thr Thr Gln
165 170 175
Ser Ser Ser Thr Val Thr Ser Leu Thr Val Thr Gly Gly Ser Gly Ser
180 185 190
Gly Ala Gly Thr Phe Gln Asn Leu Pro Leu Gln Asn Phe Asn Asn Pro
195 200 205
Gln Gly Gly Pro Gly Ser Gly Thr Asn Gln Ser Gln Phe His Met Glu
210 215 220
Pro Ile Pro Tyr Gly Ile Pro Asn Lys Asp Tyr Arg Tyr Leu His Gly
225 230 235 240
Leu Lys Pro Glu Val Gly Glu His Ser Phe Phe Ser Glu Ala Ser Gly
245 250 255
Ser Asn Arg Gly Leu Gln Met Asp Ser Pro Leu Asp Ser Thr Trp Pro
260 265 270
Leu Met Pro Ser Arg Val Ser Ser Phe Pro Pro Ser Lys Ser Ser Glu
275 280 285
Asn Ser Ile Leu Gln Ser Asp Tyr Pro Gln His Ser Phe Phe Ser Ser
290 295 300
Glu Phe Ala Ser Gly Glu Val Val Lys Gln Glu Gly Gln Ser Leu Arg
305 310 315 320
Pro Phe Phe Asp Glu Trp Pro Lys Thr Arg Asp Ser Trp Ser Gly Leu
325 330 335
Glu Glu Glu Arg Ser Asn Gln Thr Ser Phe Ser Thr Thr Gln Leu Ser
340 345 350
Ile Ser Ile Pro Met Ala Ser Ser Asp Phe Ser Thr Thr Ser Ser Arg
355 360 365
Ser Pro His Asp Asn Ala Ala Ala Ala Met Gln Pro Met Met Ala Ala
370 375 380
Tyr Tyr Pro Ser Asn Val Thr Thr Asp His Ile Gln Gln Tyr Leu Asp
385 390 395 400
Glu Asn Lys Ser Leu Ile Leu Lys Ile Val Glu Ser Gln Asn Ser Gly
405 410 415
Lys Leu Thr Glu Cys Ala Glu Asn Gln Ala Arg Leu Gln Arg Asn Leu
420 425 430
Met Tyr Leu Ala Ala Ile Ala Asp Ser Gln Pro Gln Pro Pro Thr Met
435 440 445
His Ala Gln Phe Pro Pro Ser Gly Ile Val Gln Pro Gly Ala His Tyr
450 455 460
Met Gln His Gln Gln Ala Gln Gln Met Thr Pro Gln Ser Leu Leu Ala
465 470 475 480
Ala Arg Ser Ser Met Leu Tyr Ser Gln Gln Pro Phe Ser Ala Leu Gln
485 490 495
Gln Gln Gln Ala Ile His Ser Gln Leu Gly Met Gly Ser Gly Gly Ser
500 505 510
Ala Gly Leu His Met Leu Gln Ser Glu Gly Ser Asn Pro Gly Gly Asn
515 520 525
Gly Thr Leu Gly Thr Asp Met Gly Asn Ala Glu Gly Arg Gly Gly Asn
530 535 540
Ser Gly Gly Gln Gly Gly Asp Gly Gly Glu Thr Leu Tyr Leu Lys Ala
545 550 555 560
Ala Glu Asp Gly Asn
565
<210> 8
<211> 17390
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 8
tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60
gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120
cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180
gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240
atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300
gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360
cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420
cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480
gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540
ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600
agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660
ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720
tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780
tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140
cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200
aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260
cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320
ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380
gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440
cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500
ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560
cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620
cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680
cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740
ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800
attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860
ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920
gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980
ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040
ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100
cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160
ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220
gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280
ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340
gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400
tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460
cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520
tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580
gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640
tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700
taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760
attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820
gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880
gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940
ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000
gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060
ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120
gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180
tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240
accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300
aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360
aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420
atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480
cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540
aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600
ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660
ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720
gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780
catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840
cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900
gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960
cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020
cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080
gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140
catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200
cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260
tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320
gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380
gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440
ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500
aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560
ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620
ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680
ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740
tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800
cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860
tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920
caggaaacag ctatgaccat gattacgcca agcttgcatg cctgcaggtc cccagattag 4980
ccttttcaat ttcagaaaga atgctaaccc acagatggtt agagaggctt acgcagcagg 5040
tctcatcaag acgatctacc cgagcaataa tctccaggaa atcaaatacc ttcccaagaa 5100
ggttaaagat gcagtcaaaa gattcaggac taactgcatc aagaacacag agaaagatat 5160
atttctcaag atcagaagta ctattccagt atggacgatt caaggcttgc ttcacaaacc 5220
aaggcaagta atagagattg gagtctctaa aaaggtagtt cccactgaat caaaggccat 5280
ggagtcaaag attcaaatag aggacctaac agaactcgcc gtaaagactg gcgaacagtt 5340
catacagagt ctcttacgac tcaatgacaa gaagaaaatc ttcgtcaaca tggtggagca 5400
cgacacactt gtctactcca aaaatatcaa agatacagtc tcagaagacc aaagggcaat 5460
tgagactttt caacaaaggg taatatccgg aaacctcctc ggattccatt gcccagctat 5520
ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc tcctacaaat gccatcattg 5580
cgataaagga aaggccatcg ttgaagatgc ctctgccgac agtggtccca aagatggacc 5640
cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt 5700
ggattgatgt gatatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca 5760
agacccttcc tctatataag gaagttcatt tcatttggag agaacacggg ggactctaga 5820
gttatcaaca agtttgtaca aaaaagctgc caccatgaag caaagctttg tgggggtttc 5880
ttcacaaaca gaggctgaga caagagccag agagagtgac acacagacac acccacaaag 5940
gaacctcagt cactgtcccc aaaaacatcc tcaaaacttg caaagaaccc aaaatggtgg 6000
gtctgttggg ggggcatcaa tggcaacagg gatgatggtg aggtcatcac cattcacagt 6060
gtctcagtgg caggagctgg agcatcaagc tttgatcttt aagtatctaa tggcaggact 6120
acccgtccca cctgatcttg tgcttcctat tcagaagagc tttgaatcta tgtcttctag 6180
gttcttccac catcctacca tgggttactg ttccttctat gggaagaagg tggacccgga 6240
gccaggtaga tgccggagga ctgatggcaa gaaatggagg tgctccaaag acgcctaccc 6300
ggactccaag tactgtgagc gccacatgca ccgtggccgc aaccgttcaa gaaagcctgt 6360
ggaatcacaa actactacac agtcttcttc gactgtgact tcactaactg tcactggagg 6420
cagtggcagt ggagccggga ccttccagaa ccttcccctg caaaacttca ataaccccca 6480
aggcggcccg gggtccggaa ccaaccaatc ccaattccat atggagccca ttccctatgg 6540
cattcctaat aaagattaca ggtatcttca tggacttaaa cccgaggtag gcgaacatag 6600
tttcttctct gaagcttcag gaagcaacag gggtcttcag atggactcgc ctttggacag 6660
cacatggcct ctgatgccat cacgggtctc atcattcccc ccatcaaaat caagtgaaaa 6720
ctccatcttg cagagtgact acccccagca ttcgttcttc agcagcgaat ttgcgtcggg 6780
agaggttgtg aaacaggagg gtcagtctct ccgtcctttc tttgacgagt ggcctaaaac 6840
cagagactcc tggtccggcc ttgaggaaga gaggtccaac cagacctcat tctccacaac 6900
ccagctctca atttcgatcc caatggcctc ctctgacttc tcaacaacca gctctcggtc 6960
tccccacgat aacgcggccg ctgccatgca gcccatgatg gcagcctatt accccagcaa 7020
cgtcaccact gatcacattc agcagtatct tgatgaaaac aagtcattga ttctgaagat 7080
tgttgagagc cagaattcag gaaaattgac tgaatgtgca gagaaccagg caagactaca 7140
gagaaacctc atgtacctgg ctgcaatcgc tgattctcaa ccccaaccac ccaccatgca 7200
tgctcagttc cctcctagtg gcattgttca gccaggagct cactacatgc agcaccaaca 7260
agctcaacaa atgacaccac agtcgctcct ggctgcacgc tcctccatgc tgtactccca 7320
acaaccattt tcggccctgc aacaacaaca agccatccat agccagcttg gcatgggctc 7380
tggtggaagt gcaggacttc acatgctgca aagcgagggg agtaatccag gaggcaatgg 7440
aacactgggg actgatatgg gaaatgcaga agggcgagga gggaactcag gaggtcaggg 7500
tggggatgga ggtgagactc tttacttgaa agctgctgaa gatgggaatt gacgttcagc 7560
tttcttgtac aaagtggttg ataacagcgg gttaattaac atcttttacc catacgatgt 7620
tcctgactat gcgggctatc cctatgacgt cccggactat gcaggatcct atccatatga 7680
cgttccagat tacgctgctc agtgcagcgc ttagagctcg aatttccccg atcgttcaaa 7740
catttggcaa taaagtttct taagattgaa tcctgttgcc ggtcttgcga tgattatcat 7800
ataatttctg ttgaattacg ttaagcatgt aataattaac atgtaatgca tgacgttatt 7860
tatgagatgg gtttttatga ttagagtccc gcaattatac atttaatacg cgatagaaaa 7920
caaaatatag cgcgcaaact aggataaatt atcgcgcgcg gtgtcatcta tgttactaga 7980
tcgggaatta gcttcatcaa cgcaagacat gcgcacgacc gtctgacagg agaggaattt 8040
ccgacgagca cagaaaggac ttgctcttgg acgtaggcct atttctcagg cacatgtatc 8100
aagtgttcgg acgtgggttt tcgatggtgt atcagccgcc gccaactggg agatgaggag 8160
gctttcttgg ggggcagtca gcagttcatt tcacaagaca gaggaacttg taaggagatg 8220
cactgattta tcttggcgca aaccagcagg acgaattagt gggaatagcc cgcgaatatc 8280
taagttatgc ctgtcggcat gagcagaaac ttccaattcg aaacagtttg gagaggttgt 8340
ttttgggcat accttttgtt agtcagcctc tcgattgctc atcgtcatta cacagtaccg 8400
aagtttgatc gatctagtaa catagatgac accgcgcgcg ataatttatc ctagtttgcg 8460
cgctatattt tgttttctat cgcgtattaa atgtataatt gcgggactct aatcataaaa 8520
acccatctca taaataacgt catgcattac atgttaatta ttacatgctt aacgtaattc 8580
aacagaaatt atatgataat catcgcaaga ccggcaacag gattcaatct taagaaactt 8640
tattgccaaa tgtttgaacg atctgcttcg acgcactcct tctttactcc accatctcgt 8700
ccttattgaa aacgtgggta gcaccaaaac gaatcaagtc gctggaactg aagttaccaa 8760
tcacgctgga tgatttgcca gttggattaa tcttgccttt ccccgcatga ataatattga 8820
tgaatgcatg cgtgaggggt atttcgattt tggcaatagc tgcaattgcc gcgacatcct 8880
ccaacgagca taattcttca gaaaaatagc gatgttccat gttgtcaggg catgcatgat 8940
gcacgttatg aggtgacggt gctaggcagt attccctcaa agtttcatag tcagtatcat 9000
attcatcatt gcattcctgc aagagagaat tgagacgcaa tccacacgct gcggcaacct 9060
tccggcgttc gtggtctatt tgctcttgga cgttgcaaac gtaagtgttg gatcccggtc 9120
ggcatctact ctattccttt gccctcggac gagtgctggg gcgtcggttt ccactatcgg 9180
cgagtacttc tacacagcca tcggtccaga cggccgcgct tctgcgggcg atttgtgtac 9240
gcccgacagt cccggctccg gatcggacga ttgcgtcgca tcgaccctgc gcccaagctg 9300
catcatcgaa attgccgtca accaagctct gatagagttg gtcaagacca atgcggagca 9360
tatacgcccg gagccgcggc gatcctgcaa gctccggatg cctccgctcg aagtagcgcg 9420
tctgctgctc catacaagcc aaccacggcc tccagaagaa gatgttggcg acctcgtatt 9480
gggaatcccc gaacatcgcc tcgctccagt caatgaccgc tgttatgcgg ccattgtccg 9540
tcaggacatt gttggagccg aaatccgcgt gcacgaggtg ccggacttcg gggcagtcct 9600
cggcccaaag catcagctca tcgagagcct gcgcgacgga cgcactgacg gtgtcgtcca 9660
tcacagtttg ccagtgatac acatggggat cagcaatcgc gcatatgaaa tcacgccatg 9720
tagtgtattg accgattcct tgcggtccga atgggccgaa cccgctcgtc tggctaagat 9780
cggccgcagc gatcgcatcc atggcctccg cgaccggctg cagaacagcg ggcagttcgg 9840
tttcaggcag gtcttgcaac gtgacaccct gtgcacggcg ggagatgcaa taggtcaggc 9900
tctcgctgaa ttccccaatg tcaagcactt ccggaatcgg gagcgcggcc gatgcaaagt 9960
gccgataaac ataacgatct ttgtagaaac catcggcgca gctatttacc cgcaggacat 10020
atccacgccc tcctacatcg aagctgaaag cacgagattc ttcgccctcc gagagctgca 10080
tcaggtcgga gacgctgtcg aacttttcga tcagaaactt ctcgacagac gtcgcggtga 10140
gttcaggctt tttcatatcg gggtcgtcct ctccaaatga aatgaacttc cttatataga 10200
ggaagggtct tgcgaaggat agtgggattg tgcgtcatcc cttacgtcag tggagatatc 10260
acatcaatcc acttgctttg aagacgtggt tggaacgtct tctttttcca cgatgctcct 10320
cgtgggtggg ggtccatctt tgggaccact gtcggcagag gcatcttgaa cgatagcctt 10380
tcctttatcg caatgatggc atttgtaggt gccaccttcc ttttctactg tccttttgat 10440
gaagtgacag atagctgggc aatggaatcc gaggaggttt cccgatatta ccctttgttg 10500
aaaagtctca atagcccttt ggtcttctga gactgtatct ttgatattct tggagtagac 10560
gagagtgtcg tgctccacca tgttgacgga tctctaggac gcgtcctaga agctaattca 10620
ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc 10680
cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc 10740
ccttcccaac agttgcgcag cctgaatggc gcccgctcct ttcgctttct tcccttcctt 10800
tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt 10860
ccgatttagt gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg 10920
tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt 10980
taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt 11040
tgatttataa gggattttgc cgatttcgga accaccatca aacaggattt tcgcctgctg 11100
gggcaaacca gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat 11160
cagctgttgc ccgtctcact ggtgaaaaga aaaaccaccc cagtacatta aaaacgtccg 11220
caatgtgtta ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc 11280
agccagccaa cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag 11340
cccatcagtc cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat 11400
gttaccgatg ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc 11460
tcgcggaggg tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata 11520
tcatctccct cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg 11580
acaggctgtc gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct 11640
gaggaagctg agtggcgcta tttctttaga agtgaacgtt gacgatatca actcccctat 11700
ccattgctca ccgaatggta caggtcgggg acccgaagtt ccgactgtcg gcctgatgca 11760
tccccggctg atcgacccca gatctggggc tgagaaagcc cagtaaggaa acaactgtag 11820
gttcgagtcg cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc 11880
cgagccacgc caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca 11940
ctaaagctac tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga 12000
gcagaggcac gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg 12060
cccccgccag gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca 12120
acagcgccac gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc 12180
tacctagcag agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg 12240
ccgcgacccc gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa 12300
tattaagtgc gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga 12360
cgatcatcac gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc 12420
ggcctcgctg ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg 12480
ggccgtcctc ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga 12540
atgccacggc atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct 12600
cgtaaacgga cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga 12660
aatcctgcac gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt 12720
ttaatcctct gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactgagc 12780
gaagcaagtg cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc 12840
tgaaccccca gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc 12900
attgacccag gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac 12960
ctgctcgcgc cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc 13020
cagcttgagc gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt 13080
cggcgacagc ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag 13140
cacgacgatt tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag 13200
gacgcggaag cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa 13260
gcccatcgcc gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc 13320
attgatcgac cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc 13380
gataggggtg cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc 13440
ccgcagctcg acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt 13500
gttttgcagc gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc 13560
cgtgtcgttt ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag 13620
ctgcatttcc ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct 13680
gacctgtttt gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg 13740
cgtgtcgatg gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg 13800
ctccacggcg gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc 13860
gatcttggcc gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc 13920
acgcatgacg gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat 13980
cagttcttgc ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat 14040
tgccccgact cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt 14100
ggtgtccaga taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc 14160
cttctcgtac ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa 14220
atacttgccg tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg 14280
ctcctgcttg tcgccggcat cgttgcgcca catctaggta ctaaaacaat tcatccagta 14340
aaatataata ttttattttc tcccaatcag gcttgatccc cagtaagtca aaaaatagct 14400
cgacatactg ttcttccccg atatcctccc tgatcgaccg gacgcagaag gcaatgtcat 14460
accacttgtc cgccctgccg cttctcccaa gatcaataaa gccacttact ttgccatctt 14520
tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa gacaagttcc tcttcgggct 14580
tttccgtctt taaaaaatca tacagctcgc gcggatcttt aaatggagtg tcttcttccc 14640
agttttcgca atccacatcg gccagatcgt tattcagtaa gtaatccaat tcggctaagc 14700
ggctgtctaa gctattcgta tagggacaat ccgatatgtc gatggagtga aagagcctga 14760
tgcactccgc atacagctcg ataatctttt cagggctttg ttcatcttca tactcttccg 14820
agcaaaggac gccatcggcc tcactcatga gcagattgct ccagccatca tgccgttcaa 14880
agtgcaggac ctttggaaca ggcagctttc cttccagcca tagcatcatg tccttttccc 14940
gttccacatc ataggtggtc cctttatacc ggctgtccgt catttttaaa tataggtttt 15000
cattttctcc caccagctta tataccttag caggagacat tccttccgta tcttttacgc 15060
agcggtattt ttcgatcagt tttttcaatt ccggtgatat tctcatttta gccatttatt 15120
atttccttcc tcttttctac agtatttaaa gataccccaa gaagctaatt ataacaagac 15180
gaactccaat tcactgttcc ttgcattcta aaaccttaaa taccagaaaa cagctttttc 15240
aaagttgttt tcaaagttgg cgtataacat agtatcgacg gagccgattt tgaaaccaca 15300
attatgggtg atgctgccaa cttactgatt tagtgtatga tggtgttttt gaggtgctcc 15360
agtggcttct gtgtctatca gctgtccctc ctgttcagct actgacgggg tggtgcgtaa 15420
cggcaaaagc accgccggac atcagcgcta tctctgctct cactgccgta aaacatggca 15480
actgcagttc acttacaccg cttctcaacc cggtacgcac cagaaaatca ttgatatggc 15540
catgaatggc gttggatgcc gggcaacagc ccgcattatg ggcgttggcc tcaacacgat 15600
tttacgtcac ttaaaaaact caggccgcag tcggtaacct cgcgcataca gccgggcagt 15660
gacgtcatcg tctgcgcgga aatggacgaa cagtggggct atgtcggggc taaatcgcgc 15720
cagcgctggc tgttttacgc gtatgacagt ctccggaaga cggttgttgc gcacgtattc 15780
ggtgaacgca ctatggcgac gctggggcgt cttatgagcc tgctgtcacc ctttgacgtg 15840
gtgatatgga tgacggatgg ctggccgctg tatgaatccc gcctgaaggg aaagctgcac 15900
gtaatcagca agcgatatac gcagcgaatt gagcggcata acctgaatct gaggcagcac 15960
ctggcacggc tgggacggaa gtcgctgtcg ttctcaaaat cggtggagct gcatgacaaa 16020
gtcatcgggc attatctgaa cataaaacac tatcaataag ttggagtcat tacccaatta 16080
tgatagaatt tacaagctat aaggttattg tcctgggttt caagcattag tccatgcaag 16140
tttttatgct ttgcccattc tatagatata ttgataagcg cgctgcctat gccttgcccc 16200
ctgaaatcct tacatacggc gatatcttct atataaaaga tatattatct tatcagtatt 16260
gtcaatatat tcaaggcaat ctgcctcctc atcctcttca tcctcttcgt cttggtagct 16320
ttttaaatat ggcgcttcat agagtaattc tgtaaaggtc caattctcgt tttcatacct 16380
cggtataatc ttacctatca cctcaaatgg ttcgctgggt ttatcgcacc cccgaacacg 16440
agcacggcac ccgcgaccac tatgccaaga atgcccaagg taaaaattgc cggccccgcc 16500
atgaagtccg tgaatgcccc gacggccgaa gtgaagggca ggccgccacc caggccgccg 16560
ccctcactgc ccggcacctg gtcgctgaat gtcgatgcca gcacctgcgg cacgtcaatg 16620
cttccgggcg tcgcgctcgg gctgatcgcc catcccgtta ctgccccgat cccggcaatg 16680
gcaaggactg ccagcgctgc catttttggg gtgaggccgt tcgcggccga ggggcgcagc 16740
ccctgggggg atgggaggcc cgcgttagcg ggccgggagg gttcgagaag ggggggcacc 16800
ccccttcggc gtgcgcggtc acgcgcacag ggcgcagccc tggttaaaaa caaggtttat 16860
aaatattggt ttaaaagcag gttaaaagac aggttagcgg tggccgaaaa acgggcggaa 16920
acccttgcaa atgctggatt ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc 16980
ccctcatctg tcagcactct gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt 17040
agtcgcgccc ctcaagtgtc aataccgcag ggcacttatc cccaggcttg tccacatcat 17100
ctgtgggaaa ctcgcgtaaa atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca 17160
cgtcgccggc cgaaatcgag cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc 17220
ccctcaagtg tcaacgtccg cccctcatct gtcagtgagg gccaagtttt ccgcgaggta 17280
tccacaacgc cggcggccgc ggtgtctcgc acacggcttc gacggcgttt ctggcgcgtt 17340
tgcagggcca tagacggccg ccagcccagc ggcgagggca accagcccgg 17390
<210> 9
<211> 409
<212> PRT
<213> Triticum aestivum
<400> 9
Met Met Met Met Gly Gly Arg Ala Gly Ala Gly Gly Val Gly Ala Gly
1 5 10 15
Gly Gly Arg Cys Pro Phe Thr Ala Thr Gln Trp Gln Glu Leu Glu His
20 25 30
Gln Ala Leu Ile Tyr Lys Tyr Met Ala Ser Gly Val Pro Ile Pro Ser
35 40 45
Asp Leu Leu Leu Pro Leu Arg Arg Ser Phe Leu Leu Asp Ser Ala Leu
50 55 60
Ala Thr Ser Pro Ser Leu Ala Phe Pro Pro Gln Ala Ala Leu Gly Trp
65 70 75 80
Gly Cys Phe Gly Met Gly Phe Gly Arg Lys Ala Glu Asp Pro Glu Pro
85 90 95
Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu
100 105 110
Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Lys His Met His Arg Gly Lys
115 120 125
Asn Arg Ser Arg Lys Pro Val Glu Met Ser Leu Ala Thr Pro Pro Pro
130 135 140
Pro Pro Ser Ser Ser Ala Ser Ser Ser Ser Ser Asn Val His Ser Ala
145 150 155 160
Val Asn Val Ala Thr Thr Thr Thr Ser Pro Ala Pro Ser Tyr His Arg
165 170 175
His Ala Ala Ala Thr His Asp Thr Thr Pro Tyr His Ala Leu Tyr Gly
180 185 190
Gly Pro Tyr Ser Ser Ala Gly Arg Gln Gln His Ala Ser Ala Tyr His
195 200 205
His Ala Ala Gln Val Ser Pro Phe His Leu His Leu Asp Thr Thr His
210 215 220
Pro His Pro Pro Pro Ser Tyr Tyr Ser Thr Met Asp His Ser Lys Asp
225 230 235 240
Ser Tyr Ala Tyr Gly His Ser Val Lys Glu Val His Gly Gly Gly Glu
245 250 255
His Ala Phe Phe Ser Ser Asp Val Ser Thr Asp Arg Asp His His His
260 265 270
His Gln His Gln His His Ala Ser Ala Gly Gly Asn Gly Gln Trp Gln
275 280 285
Phe Lys Gln Leu Gly Gly Met Glu Pro Lys Gln His Asn Pro Thr Ser
290 295 300
Leu Phe Pro Gly Tyr Gly Asn Asn Ala Ala Tyr Ala Ile Asp Leu Ser
305 310 315 320
Ser Lys Glu Glu Asp Glu Glu Lys Glu Arg Arg Gln Gln Gln Gln His
325 330 335
Cys Phe Leu Leu Gly Ala Asp Leu Arg Leu Asp Lys Pro Ser Ser Gly
340 345 350
His Gly Asp Ser Ala Asp Gln Lys Pro Leu Arg Pro Phe Phe Asp Glu
355 360 365
Trp Pro His Glu Lys Thr Gly Ser Lys Gly Ser Trp Met Gly Leu Glu
370 375 380
Gly Glu Thr Gln Leu Ser Ile Ser Ile Ala Asn Glu Leu Pro Ile Thr
385 390 395 400
Thr Thr Ser Arg Tyr His His Gly Glu
405
<210> 10
<211> 311
<212> PRT
<213> Triticum aestivum
<400> 10
Met Met Leu Gly Gly His Gly Gly Gly Gly Gly Arg Cys Leu Phe Thr
1 5 10 15
Ala Ser Gln Trp Arg Glu Leu Glu His Gln Ala Leu Ile Tyr Lys Tyr
20 25 30
Met Ala Ala Gly Ser Gln Val Pro His Glu Leu Val Leu Pro Leu Arg
35 40 45
His Arg Asp Ala Ala Ala Phe Ala Gly Ile Asp Thr Ala Pro Ser Val
50 55 60
Ala Cys Tyr Pro Pro Pro Gln Pro Ser Leu Gly Trp Gly Leu Tyr Gly
65 70 75 80
Ala Gly Ala Gln Tyr Ala Arg Lys Pro Glu Asp Pro Glu Pro Gly Arg
85 90 95
Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser Arg Glu Ala Tyr
100 105 110
Gly Glu Ser Lys Tyr Cys Asp Arg His Met His Arg Gly Lys Asn Arg
115 120 125
Ser Arg Lys Pro Val Glu Pro Met Ser Ser Ser Ser Val Ser Ser Pro
130 135 140
Ala Ala Ser Tyr Arg Gln Thr Thr Leu Ser Met Ser Pro Pro Thr Pro
145 150 155 160
Ala Asp Thr Pro Ser Tyr Gly His Gly His Leu Arg Ala Ala Ala Ser
165 170 175
Gln Ser Gln Ile Asn Pro Leu Gln Leu His Leu Asp Thr Pro Ser Pro
180 185 190
Pro Pro Ser Tyr His Arg Tyr Ala Pro Ala Gln Gln Tyr Gly Gly Ser
195 200 205
Phe Phe Pro Ser Arg Gln Gln Val Gln Glu Glu Ala Glu Ala Glu Ala
210 215 220
Arg Arg Arg Gln His Phe Leu Ala Leu Gly Ala Asp Leu Ser Leu Asp
225 230 235 240
Lys Pro Asp Ala Thr Thr Ala Ala Ser Ser Thr Thr Glu Glu Lys Pro
245 250 255
Leu Arg Arg Phe Phe Asp Glu Trp Pro Arg Asp Gly Asn Ala Val Glu
260 265 270
Val Arg Pro Trp Asn Met Gly His Arg Asp Glu Thr Leu Leu Ser Met
275 280 285
Ser Ile Pro Thr Thr Thr Ala Ser His Pro Asp Leu Ala Ala Tyr Arg
290 295 300
His His Asn Gly Thr Phe Ala
305 310
<210> 11
<211> 384
<212> PRT
<213> Triticum aestivum
<400> 11
Met Ala Met Pro Phe Ala Ser Leu Ser Pro Ala Ala Asp His His Arg
1 5 10 15
Ser Ser Pro Ile Phe Pro Phe Cys Arg Ser Ser Pro Leu Tyr Ser Ala
20 25 30
Gly Glu Glu Ala Ala Gln Gln Gln Gln Gln Gln Gln His Ala Met Ser
35 40 45
Gly Ala Arg Trp Ala Ala Ala Arg Pro Ala Thr Phe Thr Ala Ala Gln
50 55 60
Tyr Glu Glu Leu Glu Gln Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala
65 70 75 80
Gly Val Pro Val Pro Pro Asp Leu Leu Leu Pro Ile Arg Arg Gly Phe
85 90 95
Asp Ser Leu Ala Ser Arg Phe Tyr His His His Ala Leu Gly Tyr Gly
100 105 110
Ser Tyr Phe Gly Lys Lys Leu Asp Pro Glu Pro Gly Arg Cys Arg Arg
115 120 125
Thr Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Ala Gln Asp Ser
130 135 140
Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys
145 150 155 160
Pro Val Glu Thr Gln Leu Val Ser His Ser Gln Gln Leu Gln Gln Gln
165 170 175
Ala Pro Ala Ala Ala Phe His Gly His Ser Pro Tyr Pro Ala Ile Ala
180 185 190
Thr Gly Ala Gly Ala Pro Gly Ser Phe Ala Leu Gly Ser Thr Ala Gln
195 200 205
Leu His Met Asp Asn Ala Ala Ala Pro Tyr Ala Thr Ala Gly Ala Ala
210 215 220
Gly Asn Lys Asp Phe Arg Tyr Ser Ala Tyr Gly Phe Arg Thr Ser Ala
225 230 235 240
Met Glu Asp His Asn Gln Phe Ile Ser Ala Ala Met Asp Thr Ala Met
245 250 255
Asp Asn Tyr Ser Trp Arg Leu Leu Pro Ala Gln Asn Ser Ser Phe Ser
260 265 270
Leu Ser Ser Tyr Pro Met Leu Ser Thr Leu Ser Asp Leu Asp Gln Ser
275 280 285
Ala Ile Cys Ser Leu Ala Lys Thr Glu Arg Glu Pro Leu Ser Phe Phe
290 295 300
Gly Val Gly Gly Gly Phe Asp Asp Asp Glu Ser Ala Val Lys Gln Glu
305 310 315 320
Asn Gln Thr Leu Arg Pro Phe Phe Asp Glu Trp Pro Lys Asp Arg Asp
325 330 335
Ser Trp Pro Glu Leu Gln Asp His Asp Ser Asn His Asn Asn Glu Ala
340 345 350
Phe Ser Ala Thr Lys Leu Ser Ile Ser Ile Pro Val Thr Ser Ser Asp
355 360 365
Phe Ser Thr Thr Ala Gly Ser Arg Ser Pro His Gly Ile Tyr Ser Arg
370 375 380
<210> 12
<211> 408
<212> PRT
<213> Triticum aestivum
<400> 12
Met Ala Met Pro Tyr Ala Ser Leu Ser Pro Ala Gly Asp Arg Arg Ser
1 5 10 15
Ser Pro Ala Ala Thr Ala Thr Ala Ser Leu Leu Pro Phe Cys Arg Ser
20 25 30
Ser Pro Phe Ser Ala Gly Gly Asn Gly Gly Met Gly Glu Glu Ala Arg
35 40 45
Met Asp Gly Arg Trp Met Ala Arg Pro Val Pro Phe Thr Ala Ala Gln
50 55 60
Tyr Glu Glu Leu Glu His Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala
65 70 75 80
Gly Val Ser Val Pro Pro Asp Leu Val Leu Pro Ile Arg Arg Gly Ile
85 90 95
Glu Ser Leu Ala Ala Arg Phe Tyr His Asn Pro Leu Ala Ile Gly Tyr
100 105 110
Gly Ser Tyr Leu Gly Lys Lys Val Asp Pro Glu Pro Gly Arg Cys Arg
115 120 125
Arg Thr Asp Gly Lys Lys Trp Arg Cys Ala Lys Glu Ala Ala Ser Asp
130 135 140
Ser Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg
145 150 155 160
Lys Pro Val Glu Thr Gln Leu Val Pro His Ser Gln Pro Pro Ala Ala
165 170 175
Ser Ala Val Pro Pro Leu Ala Thr Gly Phe His Ser His Ser Leu Tyr
180 185 190
Pro Ala Ile Gly Gly Gly Thr Asn Gly Gly Gly Gly Gly Gly Asn Asn
195 200 205
Gly Met Ser Met Pro Gly Thr Phe Ser Ser Ala Leu Gly Pro Pro Gln
210 215 220
Gln His Met Gly Asn Asn Ala Ala Ser Pro Tyr Ala Ala Leu Gly Gly
225 230 235 240
Ala Gly Thr Cys Lys Asp Phe Arg Tyr Thr Ala Tyr Gly Ile Arg Ser
245 250 255
Leu Ala Asp Glu Gln Ser Gln Leu Met Thr Glu Ala Met Asn Thr Ser
260 265 270
Val Glu Asn Pro Trp Arg Leu Pro Pro Ser Ser Gln Thr Thr Thr Phe
275 280 285
Pro Leu Ser Ser Tyr Ser Pro Gln Leu Gly Ala Thr Ser Asp Leu Gly
290 295 300
Gln Asn Asn Ser Ser Asn Asn Asn Ser Gly Val Lys Ala Glu Gly Gln
305 310 315 320
Gln Gln Gln Gln Pro Leu Ser Phe Pro Gly Cys Gly Asp Phe Gly Ser
325 330 335
Gly Asp Ser Ala Lys Gln Glu Asn Gln Thr Leu Arg Pro Phe Phe Asp
340 345 350
Glu Trp Pro Lys Thr Arg Asp Ser Trp Ser Asp Leu Thr Asp Asp Asn
355 360 365
Ser Asn Val Ala Ser Phe Ser Ala Thr Gln Leu Ser Ile Ser Ile Pro
370 375 380
Met Thr Ser Ser Asp Phe Ser Ala Ala Ser Ser Gln Ser Pro Asn Gly
385 390 395 400
Met Leu Phe Ala Gly Glu Met Tyr
405
<210> 13
<211> 368
<212> PRT
<213> Triticum aestivum
<400> 13
Met Leu Ser Ser Ser Ala Ala Met Gly Met Gly Leu Gly Gly Tyr Gly
1 5 10 15
Gln Gln Gln Gln Gln Gln Gln Met Gln Met Gln Met Gln Arg Gly Ala
20 25 30
Gly Pro Val Phe Thr Pro Ala Gln Trp Ala Glu Leu Glu Gln Gln Ala
35 40 45
Leu Ile Tyr Lys Tyr Leu Met Ala Gly Val Pro Val Pro Pro Asp Leu
50 55 60
Leu Leu Pro Ile Arg Pro His Pro Ala Gly Ala Gly Ala Thr Phe Ser
65 70 75 80
Phe Ala Asn Pro Ala Ala Ser Pro Phe Tyr His His His His Pro Ser
85 90 95
Met Ser Tyr Tyr Ala Tyr Tyr Gly Lys Lys Leu Asp Pro Glu Pro Trp
100 105 110
Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala
115 120 125
His Pro Asp Ser Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn
130 135 140
Arg Ser Arg Lys Pro Val Glu Ser Lys Ser Ala Ser Pro Ala His Gln
145 150 155 160
Ser Gln Gln Pro Pro Leu Ser Ala Val Thr Ser Ala Thr Arg Asp Ala
165 170 175
Glu Pro Leu Pro Ser Leu Pro Ala Gly Ala Lys Thr His Gly Leu Ser
180 185 190
Leu Gly Gly Ala Gly Ser Ser Gln Met His Val Asp Ala Ser Ser Tyr
195 200 205
Gly Asn Lys Tyr Ser Leu Gly Ala Lys Ser Asp Val Gly Glu Leu Ser
210 215 220
Phe Phe Ser Gly Ala Ser Gly Asn Asn Asn Arg Gly Phe Thr Ile Asp
225 230 235 240
Ser Pro Thr Asp Ser Ser Trp His Ser Met Gly Ser Ser Leu Pro Pro
245 250 255
Tyr Gln Leu Ser Lys Pro Arg Asp Ser Gly Leu Met Gln Gly Gly Phe
260 265 270
Ser Tyr Ser His Phe Glu Pro Ser Gln Glu Leu Gly Gln Val Thr Ile
275 280 285
Ala Ser Leu Ser His Ser Gln Glu Gln Asp Arg Arg Ser Phe Gly Gly
290 295 300
Gly Gly Gly Gly Gly Gly Gly Gly Ala Gly Leu Met Gly Asn Val Lys
305 310 315 320
Gln Glu Asn Gln Pro Leu Arg Pro Phe Phe Asp Glu Trp Pro Gly Arg
325 330 335
Arg Asp Ser Trp Ser Glu Met Asp Asp Glu Arg Ser Asn Gly Thr Ser
340 345 350
Phe Ser Thr Thr Gln Leu Ser Ile Ser Ile Pro Met Pro Arg Cys Asp
355 360 365
<210> 14
<211> 397
<212> PRT
<213> Oryza sativa
<400> 14
Met Met Met Met Ser Gly Arg Pro Ser Gly Gly Ala Gly Gly Gly Arg
1 5 10 15
Tyr Pro Phe Thr Ala Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu
20 25 30
Ile Tyr Lys Tyr Met Ala Ser Gly Thr Pro Ile Pro Ser Asp Leu Ile
35 40 45
Leu Pro Leu Arg Arg Ser Phe Leu Leu Asp Ser Ala Leu Ala Thr Ser
50 55 60
Pro Ser Leu Ala Phe Pro Pro Gln Pro Ser Leu Gly Trp Gly Cys Phe
65 70 75 80
Gly Met Gly Phe Gly Arg Lys Ala Glu Asp Pro Glu Pro Gly Arg Cys
85 90 95
Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Tyr Pro
100 105 110
Asp Ser Lys Tyr Cys Glu Lys His Met His Arg Gly Lys Asn Arg Ser
115 120 125
Arg Lys Pro Val Glu Met Ser Leu Ala Thr Pro Pro Pro Pro Ser Ser
130 135 140
Ser Ala Thr Ser Ala Ala Ser Asn Thr Ser Ala Gly Val Ala Pro Thr
145 150 155 160
Thr Thr Thr Thr Ser Ser Pro Ala Pro Ser Tyr Ser Arg Pro Ala Pro
165 170 175
His Asp Ala Ala Pro Tyr Gln Ala Leu Tyr Gly Gly Pro Tyr Ala Ala
180 185 190
Ala Thr Ala Arg Thr Pro Ala Ala Ala Ala Tyr His Ala Gln Val Ser
195 200 205
Pro Phe His Leu Gln Leu Asp Thr Thr His Pro His Pro Pro Pro Ser
210 215 220
Tyr Tyr Ser Met Asp His Lys Glu Tyr Ala Tyr Gly His Ala Thr Lys
225 230 235 240
Glu Val His Gly Glu His Ala Phe Phe Ser Asp Gly Thr Glu Arg Glu
245 250 255
His His His Ala Ala Ala Gly His Gly Gln Trp Gln Phe Lys Gln Leu
260 265 270
Gly Met Glu Pro Lys Gln Ser Thr Thr Pro Leu Phe Pro Gly Ala Gly
275 280 285
Tyr Gly His Thr Ala Ala Ser Pro Tyr Ala Ile Asp Leu Ser Lys Glu
290 295 300
Asp Asp Asp Glu Lys Glu Arg Arg Gln Gln Gln Gln Gln Gln Gln Gln
305 310 315 320
Gln His Cys Phe Leu Leu Gly Ala Asp Leu Arg Leu Glu Lys Pro Ala
325 330 335
Gly His Asp His Ala Ala Ala Ala Gln Lys Pro Leu Arg His Phe Phe
340 345 350
Asp Glu Trp Pro His Glu Lys Asn Ser Lys Gly Ser Trp Met Gly Leu
355 360 365
Glu Gly Glu Thr Gln Leu Ser Met Ser Ile Pro Met Ala Ala Asn Asp
370 375 380
Leu Pro Ile Thr Thr Thr Ser Arg Tyr His Asn Asp Asp
385 390 395
<210> 15
<211> 301
<212> PRT
<213> Oryza sativa
<400> 15
Met Met Ala Gly Gly Gly Ser Gly Arg Cys Leu Phe Thr Ala Thr Gln
1 5 10 15
Trp Gln Glu Leu Glu His Gln Ala Leu Ile Tyr Lys Tyr Met Ala Ala
20 25 30
Gly Ala Pro Val Pro Pro Asp Leu Leu Leu His Leu Arg His Arg Ala
35 40 45
Ala Ala Ala Ala Ala Ala Asp Val Asp Thr Val Pro Ser Leu Ala Phe
50 55 60
Pro Pro His His Leu Gly Trp Gly Cys Tyr Gly Ala Ala Ala Ala Gln
65 70 75 80
Tyr Gly Arg Arg Val Glu Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
85 90 95
Asp Gly Lys Lys Trp Arg Cys Ser Arg Glu Ala Tyr Gly Glu Ser Lys
100 105 110
Tyr Cys Glu Lys His Met His Arg Gly Lys Asn Arg Ser Arg Lys Pro
115 120 125
Val Glu Met Pro Pro Pro Ala Ala Ala Ala Val Tyr Arg Pro Ser Ala
130 135 140
Leu Ser Ile Ser Pro Pro Pro His Asp Ala Asp Ala Pro Ser Tyr Gly
145 150 155 160
Ala Gly Ala Gly Ala Pro Leu Gln Leu His Leu Asp Ser Phe His Ala
165 170 175
Ser Thr Ser Pro Pro Pro Ser Tyr His Arg Tyr Ala His Thr Ser Ser
180 185 190
Ala Pro Leu Phe Pro Ser Ser Ala Ala Gly Tyr Gly Gly Gly Trp Ser
195 200 205
Leu Ser Lys Glu His Cys Leu Thr Leu Gly Gly Ala Ala Ala Asp Leu
210 215 220
Ser Leu Asp Lys Pro Ala Asp His His His Asp Ala Thr Ser Ala Thr
225 230 235 240
Thr Glu Lys Pro Leu Arg Arg Phe Phe Asp Glu Trp Pro Arg Ser Asp
245 250 255
Asp Gly Arg Thr Pro Trp Asp Gly Thr Gln Leu Ser Ile Ser Ile Pro
260 265 270
Thr Ala Ala Ala Ala Ser Pro Asp Leu Ala Ile Ala Gly Ala Ala Ser
275 280 285
Arg Tyr His Ser Asn Gly Asp His Leu Arg Thr Ser Glu
290 295 300
<210> 16
<211> 387
<212> PRT
<213> Oryza sativa
<400> 16
Met Ala Met Pro Phe Ala Ser Leu Ser Pro Ala Ala Asp His Arg Pro
1 5 10 15
Ser Phe Ile Phe Pro Phe Cys Arg Ser Ser Pro Leu Ser Ala Val Gly
20 25 30
Glu Glu Ala Gln Gln His Met Met Gly Ala Arg Trp Ala Ala Ala Val
35 40 45
Ala Arg Pro Pro Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu Gln
50 55 60
Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Ala
65 70 75 80
Asp Leu Leu Leu Pro Ile Arg Arg Gly Leu Asp Ser Leu Ala Ser Arg
85 90 95
Phe Tyr His His Pro Val Leu Gly Tyr Gly Ser Tyr Phe Gly Lys Lys
100 105 110
Leu Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp
115 120 125
Arg Cys Ser Lys Glu Ala Ala Pro Asp Ser Lys Tyr Cys Glu Arg His
130 135 140
Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Ala Gln Leu
145 150 155 160
Val Ala Pro His Ser Gln Pro Pro Ala Thr Ala Pro Ala Ala Ala Val
165 170 175
Thr Ser Thr Ala Phe Gln Asn His Ser Leu Tyr Pro Ala Ile Ala Asn
180 185 190
Gly Gly Gly Ala Asn Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser
195 200 205
Ala Pro Gly Ser Phe Ala Leu Gly Ser Asn Thr Gln Leu His Met Asp
210 215 220
Asn Ala Ala Ser Tyr Ser Thr Val Ala Ala Gly Ala Gly Asn Lys Asp
225 230 235 240
Phe Arg Tyr Ser Ala Tyr Gly Val Arg Pro Leu Ala Asp Glu His Ser
245 250 255
Pro Leu Ile Thr Gly Ala Met Asp Thr Ser Ile Asp Asn Ser Trp Cys
260 265 270
Leu Leu Pro Ser Gln Thr Ser Thr Phe Ser Val Ser Ser Tyr Pro Met
275 280 285
Leu Gly Asn Leu Ser Glu Leu Asp Gln Asn Thr Ile Cys Ser Leu Pro
290 295 300
Lys Val Glu Arg Glu Pro Leu Ser Phe Phe Gly Ser Asp Tyr Val Thr
305 310 315 320
Val Asp Ser Gly Lys Gln Glu Asn Gln Thr Leu Arg Pro Phe Phe Asp
325 330 335
Glu Trp Pro Lys Ala Arg Asp Ser Trp Pro Asp Leu Ala Asp Asp Asn
340 345 350
Ser Leu Ala Thr Phe Ser Ala Thr Gln Leu Ser Ile Ser Ile Pro Met
355 360 365
Ala Thr Ser Asp Phe Ser Thr Thr Ser Ser Arg Ser His Asn Gly Ile
370 375 380
Tyr Ser Arg
385
<210> 17
<211> 422
<212> PRT
<213> Oryza sativa
<400> 17
Met Pro Pro Cys Leu Arg Arg Trp Pro Thr Thr Ala Arg Pro Arg Gln
1 5 10 15
Pro Arg Pro Pro Pro Ser Ser Pro Ser Ala Ala Pro Pro Arg Ser Pro
20 25 30
Arg Lys Gln Arg Glu Pro Ala Ala Thr Thr His Phe Leu Gly Ser Ser
35 40 45
Gly Ala Cys Asp Asn Thr Val Arg Arg Cys Val Trp Val Gly Gly Cys
50 55 60
Arg Gly Gly Gly Gly Val Ala Met Gly Glu Asp Ala Pro Met Thr Ala
65 70 75 80
Arg Trp Pro Pro Ala Ala Ala Ala Arg Leu Pro Pro Phe Thr Ala Ala
85 90 95
Gln Tyr Glu Glu Leu Glu Gln Gln Ala Leu Ile Tyr Lys Tyr Leu Val
100 105 110
Ala Gly Val Pro Val Pro Pro Asp Leu Val Leu Pro Ile Arg Arg Gly
115 120 125
Leu Asp Ser Leu Ala Ala Arg Phe Tyr Asn His Pro Ala Leu Gly Tyr
130 135 140
Gly Pro Tyr Phe Gly Lys Lys Leu Asp Pro Glu Pro Gly Arg Cys Arg
145 150 155 160
Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Ala Pro Asp
165 170 175
Ser Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg
180 185 190
Lys Pro Val Glu Thr Gln Leu Val Ala Gln Ser Gln Pro Pro Ser Ser
195 200 205
Val Val Gly Ser Ala Ala Ala Pro Leu Ala Ala Ala Ser Asn Gly Ser
210 215 220
Ser Phe Gln Asn His Ser Leu Tyr Pro Ala Ile Ala Gly Ser Asn Gly
225 230 235 240
Gly Gly Gly Gly Arg Asn Met Pro Ser Ser Phe Gly Ser Ala Leu Gly
245 250 255
Ser Gln Leu His Met Asp Asn Ala Ala Pro Tyr Ala Ala Val Gly Gly
260 265 270
Gly Thr Gly Lys Asp Leu Arg Tyr Thr Ala Tyr Gly Thr Arg Ser Leu
275 280 285
Ala Asp Glu Gln Ser Gln Leu Ile Thr Glu Ala Ile Asn Thr Ser Ile
290 295 300
Glu Asn Pro Trp Arg Leu Leu Pro Ser Gln Asn Ser Pro Phe Pro Leu
305 310 315 320
Ser Ser Tyr Ser Gln Leu Gly Ala Leu Ser Asp Leu Gly Gln Asn Thr
325 330 335
Pro Ser Ser Leu Ser Lys Val Gln Arg Gln Pro Leu Ser Phe Phe Gly
340 345 350
Asn Asp Tyr Ala Ala Val Asp Ser Val Lys Gln Glu Asn Gln Thr Leu
355 360 365
Arg Pro Phe Phe Asp Glu Trp Pro Lys Gly Arg Asp Ser Trp Ser Asp
370 375 380
Leu Ala Asp Glu Asn Ala Asn Leu Ser Ser Phe Ser Gly Thr Gln Leu
385 390 395 400
Ser Ile Ser Ile Pro Met Ala Ser Ser Asp Phe Ser Ala Ala Ser Ser
405 410 415
Arg Ser Thr Asn Gly Asp
420
<210> 18
<211> 356
<212> PRT
<213> Oryza sativa
<400> 18
Met Leu Ser Ser Ser Pro Ser Ala Ala Ala Pro Gly Ile Gly Gly Tyr
1 5 10 15
Gln Pro Gln Arg Gly Ala Ala Val Phe Thr Ala Ala Gln Trp Ala Glu
20 25 30
Leu Glu Gln Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala Gly Val Pro
35 40 45
Val Pro Gly Asp Leu Leu Leu Pro Ile Arg Pro His Ser Ser Ala Ala
50 55 60
Ala Thr Tyr Ser Phe Ala Asn Pro Ala Ala Ala Pro Phe Tyr His His
65 70 75 80
His His His Pro Ser Leu Ser Tyr Tyr Ala Tyr Tyr Gly Lys Lys Leu
85 90 95
Asp Pro Glu Pro Trp Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
100 105 110
Cys Ser Lys Glu Ala His Pro Asp Ser Lys Tyr Cys Glu Arg His Met
115 120 125
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Ser Lys Thr Ala
130 135 140
Ala Pro Ala Pro Gln Ser Gln Pro Gln Leu Ser Asn Val Thr Thr Ala
145 150 155 160
Thr His Asp Thr Asp Ala Pro Leu Pro Ser Leu Thr Val Gly Ala Lys
165 170 175
Thr His Gly Leu Ser Leu Gly Gly Ala Gly Ser Ser Gln Phe His Val
180 185 190
Asp Ala Pro Ser Tyr Gly Ser Lys Tyr Ser Leu Gly Ala Lys Ala Asp
195 200 205
Val Gly Glu Leu Ser Phe Phe Ser Gly Ala Ser Gly Asn Thr Arg Gly
210 215 220
Phe Thr Ile Asp Ser Pro Thr Asp Ser Ser Trp His Ser Leu Pro Ser
225 230 235 240
Ser Val Pro Pro Tyr Pro Met Ser Lys Pro Arg Asp Ser Gly Leu Leu
245 250 255
Pro Gly Ala Tyr Ser Tyr Ser His Leu Glu Pro Ser Gln Glu Leu Gly
260 265 270
Gln Val Thr Ile Ala Ser Leu Ser Gln Glu Gln Glu Arg Arg Ser Phe
275 280 285
Gly Gly Gly Ala Gly Gly Met Leu Gly Asn Val Lys His Glu Asn Gln
290 295 300
Pro Leu Arg Pro Phe Phe Asp Glu Trp Pro Gly Arg Arg Asp Ser Trp
305 310 315 320
Ser Glu Met Asp Glu Glu Arg Ser Asn Gln Thr Ser Phe Ser Thr Thr
325 330 335
Gln Leu Ser Ile Ser Ile Pro Met Pro Arg Cys Gly Ser Pro Ile Gly
340 345 350
Pro Arg Leu Pro
355
<210> 19
<211> 398
<212> PRT
<213> Arabidopsis thaliana
<400> 19
Met Asp Leu Gln Leu Lys Gln Trp Arg Ser Gln Gln Gln Gln Gln His
1 5 10 15
Gln Thr Glu Ser Glu Glu Gln Pro Ser Ala Ala Lys Ile Pro Lys His
20 25 30
Val Phe Asp Gln Ile His Ser His Thr Ala Thr Ser Thr Ala Leu Pro
35 40 45
Leu Phe Thr Pro Glu Pro Thr Ser Ser Lys Leu Ser Ser Leu Ser Pro
50 55 60
Asp Ser Ser Ser Arg Phe Pro Lys Met Gly Ser Phe Phe Ser Trp Ala
65 70 75 80
Gln Trp Gln Glu Leu Glu Leu Gln Ala Leu Ile Tyr Arg Tyr Met Leu
85 90 95
Ala Gly Ala Ala Val Pro Gln Glu Leu Leu Leu Pro Ile Lys Lys Ser
100 105 110
Leu Leu His Leu Ser Pro Ser Tyr Phe Leu His His Pro Leu Gln His
115 120 125
Leu Pro His Tyr Gln Pro Ala Trp Tyr Leu Gly Arg Ala Ala Met Asp
130 135 140
Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys
145 150 155 160
Ser Arg Asp Val Phe Ala Gly His Lys Tyr Cys Glu Arg His Met His
165 170 175
Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Thr Pro Thr Thr Val
180 185 190
Asn Ala Thr Ala Thr Ser Met Ala Ser Ser Val Ala Ala Ala Ala Thr
195 200 205
Thr Thr Thr Ala Thr Thr Thr Ser Thr Phe Ala Phe Gly Gly Gly Gly
210 215 220
Gly Ser Glu Glu Val Val Gly Gln Gly Gly Ser Phe Phe Phe Ser Gly
225 230 235 240
Ser Ser Asn Ser Ser Ser Glu Leu Leu His Leu Ser Gln Ser Cys Ser
245 250 255
Glu Met Lys Gln Glu Ser Asn Asn Met Asn Asn Lys Arg Pro Tyr Glu
260 265 270
Ser His Ile Gly Phe Ser Asn Asn Arg Ser Asp Gly Gly His Ile Leu
275 280 285
Arg Pro Phe Phe Asp Asp Trp Pro Arg Ser Ser Leu Gln Glu Ala Asp
290 295 300
Asn Ser Ser Ser Pro Met Ser Ser Ala Thr Cys Leu Ser Ile Ser Met
305 310 315 320
Pro Gly Asn Ser Ser Ser Asp Val Ser Leu Lys Leu Ser Thr Gly Asn
325 330 335
Glu Glu Gly Ala Arg Ser Asn Asn Asn Gly Arg Asp Gln Gln Asn Met
340 345 350
Ser Trp Trp Ser Gly Gly Gly Ser Asn His His His His Asn Met Gly
355 360 365
Gly Pro Leu Ala Glu Ala Leu Arg Ser Ser Ser Ser Ser Ser Pro Thr
370 375 380
Ser Val Leu His Gln Leu Gly Val Ser Thr Gln Ala Phe His
385 390 395
<210> 20
<211> 380
<212> PRT
<213> Arabidopsis thaliana
<400> 20
Met Asp Leu Gln Leu Lys Gln Trp Arg Ser Gln Gln Gln Asn Glu Ser
1 5 10 15
Glu Glu Gln Gly Ser Ala Ala Thr Lys Ile Ser Asn Phe Phe Phe Asp
20 25 30
Gln Ile Gln Ser Gln Thr Ala Thr Ser Ala Ala Ala Ala Pro Leu Pro
35 40 45
Leu Phe Val Pro Glu Pro Thr Ser Ser Ser Ser Phe Ser Cys Phe Ser
50 55 60
Pro Asp Ser Ser Asn Ser Ser Ser Ser Ser Arg Phe Leu Lys Met Gly
65 70 75 80
Asn Phe Phe Ser Trp Ala Gln Trp Gln Glu Leu Glu Leu Gln Ala Leu
85 90 95
Ile Tyr Arg Tyr Met Leu Ala Gly Ala Ser Val Pro Gln Glu Leu Leu
100 105 110
Leu Pro Ile Lys Lys Ser Leu Leu His Gln Ser Pro Met His Phe Leu
115 120 125
His His Pro Leu Gln His Ser Phe Pro His His Gln Pro Ser Trp Tyr
130 135 140
Trp Gly Arg Gly Ala Met Asp Pro Glu Pro Gly Arg Cys Lys Arg Thr
145 150 155 160
Asp Gly Lys Lys Trp Arg Cys Ser Arg Asp Val Val Ala Gly His Lys
165 170 175
Tyr Cys Asp Arg His Ile His Arg Gly Arg Asn Arg Ser Arg Lys Pro
180 185 190
Val Glu Thr Ala Thr Thr Thr Ile Thr Thr Thr Ala Thr Thr Thr Ala
195 200 205
Ser Ser Phe Val Leu Gly Glu Glu Leu Gly His Gly Pro Asn Asn Asn
210 215 220
His Phe Phe Ser Ser Gly Ser Ser Gln Pro Leu His Leu Ser His Gln
225 230 235 240
Gln Ser Cys Ser Ser Glu Met Lys Gln Glu Ser Asn Asn Asn Lys Arg
245 250 255
Pro Tyr Glu Ala Asn Ser Gly Phe Ser Asn Gly Arg Ser Asp Asp Gly
260 265 270
His Ile Leu Arg His Phe Phe Asp Asp Trp Pro Arg Ser Ser Asp Ser
275 280 285
Thr Ser Ser Pro Met Ser Ser Ser Thr Cys His Leu Ser Ile Ser Met
290 295 300
Pro Gly Asn Asn Thr Ser Ser Asp Val Ser Leu Lys Leu Ser Thr Gly
305 310 315 320
Asn Glu Glu Glu Glu Glu Asn Met Arg Asn Asn Asn Asn Glu Arg Glu
325 330 335
Gln Met Asn Trp Trp Ser Asn Gly Gly Asn His His Asn Asn Met Gly
340 345 350
Gly Pro Leu Ala Glu Ala Leu Arg Ser Ala Ser Ser Thr Ser Ser Val
355 360 365
Leu His Gln Met Gly Ile Ser Thr Gln Val Phe His
370 375 380
<210> 21
<211> 397
<212> PRT
<213> Arabidopsis thaliana
<400> 21
Met Met Ser Leu Ser Gly Ser Ser Gly Arg Thr Ile Gly Arg Pro Pro
1 5 10 15
Phe Thr Pro Thr Gln Trp Glu Glu Leu Glu His Gln Ala Leu Ile Tyr
20 25 30
Lys Tyr Met Val Ser Gly Val Pro Val Pro Pro Glu Leu Ile Phe Ser
35 40 45
Ile Arg Arg Ser Leu Asp Thr Ser Leu Val Ser Arg Leu Leu Pro His
50 55 60
Gln Ser Leu Gly Trp Gly Cys Tyr Gln Met Gly Phe Gly Arg Lys Pro
65 70 75 80
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
85 90 95
Cys Ser Arg Glu Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Lys His Met
100 105 110
His Arg Gly Arg Asn Arg Ala Arg Lys Ser Leu Asp Gln Asn Gln Thr
115 120 125
Thr Thr Thr Pro Leu Thr Ser Pro Ser Leu Ser Phe Thr Asn Asn Asn
130 135 140
Asn Pro Ser Pro Thr Leu Ser Ser Ser Ser Ser Ser Asn Ser Ser Ser
145 150 155 160
Thr Thr Tyr Ser Ala Ser Ser Ser Ser Met Asp Ala Tyr Ser Asn Ser
165 170 175
Asn Arg Phe Gly Leu Gly Gly Ser Ser Ser Asn Thr Arg Gly Tyr Phe
180 185 190
Asn Ser His Ser Leu Asp Tyr Pro Tyr Pro Ser Thr Ser Pro Lys Gln
195 200 205
Gln Gln Gln Thr Leu His His Ala Ser Ala Leu Ser Leu His Gln Asn
210 215 220
Thr Asn Ser Thr Ser Gln Phe Asn Val Leu Ala Ser Ala Thr Asp His
225 230 235 240
Lys Asp Phe Arg Tyr Phe Gln Gly Ile Gly Glu Arg Val Gly Gly Val
245 250 255
Gly Glu Arg Thr Phe Phe Pro Glu Ala Ser Arg Ser Phe Gln Asp Ser
260 265 270
Pro Tyr His His His Gln Gln Pro Leu Ala Thr Val Met Asn Asp Pro
275 280 285
Tyr His His Cys Ser Thr Asp His Asn Lys Ile Asp His His His Thr
290 295 300
Tyr Ser Ser Ser Ser Ser Ser Gln His Leu His His Asp His Asp His
305 310 315 320
Arg Gln Gln Gln Cys Phe Val Leu Gly Ala Asp Met Phe Asn Lys Pro
325 330 335
Thr Arg Ser Val Leu Ala Asn Ser Ser Arg Gln Asp Gln Asn Gln Glu
340 345 350
Glu Asp Glu Lys Asp Ser Ser Glu Ser Ser Lys Lys Ser Leu His His
355 360 365
Phe Phe Gly Glu Asp Trp Ala Gln Asn Lys Asn Ser Ser Asp Ser Trp
370 375 380
Leu Asp Leu Ser Ser His Ser Arg Leu Asp Thr Gly Ser
385 390 395
<210> 22
<211> 244
<212> PRT
<213> Arabidopsis thaliana
<400> 22
Met Ala Thr Arg Ile Pro Phe Thr Glu Ser Gln Trp Glu Glu Leu Glu
1 5 10 15
Asn Gln Ala Leu Val Phe Lys Tyr Leu Ala Ala Asn Met Pro Val Pro
20 25 30
Pro His Leu Leu Phe Leu Ile Lys Arg Pro Phe Leu Phe Ser Ser Ser
35 40 45
Ser Ser Ser Ser Ser Ser Ser Ser Phe Phe Ser Pro Thr Leu Ser Pro
50 55 60
His Phe Gly Trp Asn Val Tyr Glu Met Gly Met Gly Arg Lys Ile Asp
65 70 75 80
Ala Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys
85 90 95
Ser Lys Glu Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met His
100 105 110
Arg Gly Lys Asn Arg Ser Ser Ser Arg Lys Pro Pro Pro Thr Gln Phe
115 120 125
Thr Pro Asn Leu Phe Leu Asp Ser Ser Ser Arg Arg Arg Arg Ser Gly
130 135 140
Tyr Met Asp Asp Phe Phe Ser Ile Glu Pro Ser Gly Ser Ile Lys Ser
145 150 155 160
Cys Ser Gly Ser Ala Met Glu Asp Asn Asp Asp Gly Ser Cys Arg Gly
165 170 175
Ile Asn Asn Glu Glu Lys Gln Pro Asp Arg His Cys Phe Ile Leu Gly
180 185 190
Thr Asp Leu Arg Thr Arg Glu Arg Pro Leu Met Leu Glu Glu Lys Leu
195 200 205
Lys Gln Arg Asp His Asp Asn Glu Glu Glu Gln Gly Ser Lys Arg Phe
210 215 220
Tyr Arg Phe Leu Asp Glu Trp Pro Ser Ser Lys Ser Ser Val Ser Thr
225 230 235 240
Ser Leu Phe Ile
<210> 23
<211> 373
<212> PRT
<213> Vitis vinifera
<400> 23
Met Lys Gln Ser Phe Val Gly Val Ser Ser Gln Thr Glu Ala Glu Thr
1 5 10 15
Arg Ala Arg Glu Ser Asp Thr Gln Thr His Pro Gln Arg Asn Leu Ser
20 25 30
His Cys Pro Gln Lys His Pro Gln Asn Leu Gln Arg Thr Gln Asn Gly
35 40 45
Gly Ser Val Gly Gly Ala Ser Met Ala Thr Gly Met Met Val Arg Ser
50 55 60
Ser Pro Phe Thr Val Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu
65 70 75 80
Ile Phe Lys Tyr Leu Met Ala Gly Leu Pro Val Pro Pro Asp Leu Val
85 90 95
Leu Pro Ile Gln Lys Ser Phe Glu Ser Met Ser Ser Arg Phe Phe His
100 105 110
His Pro Thr Met Gly Tyr Cys Ser Phe Tyr Gly Lys Lys Val Asp Pro
115 120 125
Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser
130 135 140
Lys Asp Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met His Arg
145 150 155 160
Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Ser Gln Thr Thr Thr Gln
165 170 175
Ser Ser Ser Thr Val Thr Ser Leu Thr Val Thr Gly Gly Ser Gly Ser
180 185 190
Gly Ala Gly Thr Phe Gln Asn Leu Pro Leu Gln Asn Phe Asn Asn Pro
195 200 205
Gln Gly Gly Pro Gly Ser Gly Thr Asn Gln Ser Gln Phe His Met Glu
210 215 220
Pro Ile Pro Tyr Gly Ile Pro Asn Lys Asp Tyr Arg Tyr Leu His Gly
225 230 235 240
Leu Lys Pro Glu Val Gly Glu His Ser Phe Phe Ser Glu Ala Ser Gly
245 250 255
Ser Asn Arg Gly Leu Gln Met Asp Ser Pro Leu Asp Ser Thr Trp Pro
260 265 270
Leu Met Pro Ser Arg Val Ser Ser Phe Pro Pro Ser Lys Ser Ser Glu
275 280 285
Asn Ser Ile Leu Gln Ser Asp Tyr Pro Gln His Ser Phe Phe Ser Ser
290 295 300
Glu Phe Ala Ser Gly Glu Val Val Lys Gln Glu Gly Gln Ser Leu Arg
305 310 315 320
Pro Phe Phe Asp Glu Trp Pro Lys Thr Arg Asp Ser Trp Ser Gly Leu
325 330 335
Glu Glu Glu Arg Ser Asn Gln Thr Ser Phe Ser Thr Thr Gln Leu Ser
340 345 350
Ile Ser Ile Pro Met Ala Ser Ser Asp Phe Ser Thr Thr Ser Ser Arg
355 360 365
Ser Pro His Asp Asn
370
<210> 24
<211> 210
<212> PRT
<213> Arabidopsis thaliana
<400> 24
Met Gln Gln His Leu Met Gln Met Gln Pro Met Met Ala Gly Tyr Tyr
1 5 10 15
Pro Ser Asn Val Thr Ser Asp His Ile Gln Gln Tyr Leu Asp Glu Asn
20 25 30
Lys Ser Leu Ile Leu Lys Ile Val Glu Ser Gln Asn Ser Gly Lys Leu
35 40 45
Ser Glu Cys Ala Glu Asn Gln Ala Arg Leu Gln Arg Asn Leu Met Tyr
50 55 60
Leu Ala Ala Ile Ala Asp Ser Gln Pro Gln Pro Pro Ser Val His Ser
65 70 75 80
Gln Tyr Gly Ser Ala Gly Gly Gly Met Ile Gln Gly Glu Gly Gly Ser
85 90 95
His Tyr Leu Gln Gln Gln Gln Ala Thr Gln Gln Gln Gln Met Thr Gln
100 105 110
Gln Ser Leu Met Ala Ala Arg Ser Ser Met Leu Tyr Ala Gln Gln Gln
115 120 125
Gln Gln Gln Gln Pro Tyr Ala Thr Leu Gln His Gln Gln Leu His His
130 135 140
Ser Gln Leu Gly Met Ser Ser Ser Ser Gly Gly Gly Gly Ser Ser Gly
145 150 155 160
Leu His Ile Leu Gln Gly Glu Ala Gly Gly Phe His Asp Phe Gly Arg
165 170 175
Gly Lys Pro Glu Met Gly Ser Gly Gly Gly Gly Glu Gly Arg Gly Gly
180 185 190
Ser Ser Gly Asp Gly Gly Glu Thr Leu Tyr Leu Lys Ser Ser Asp Asp
195 200 205
Gly Asn
210
<210> 25
<211> 195
<212> PRT
<213> Arabidopsis thaliana
<400> 25
Met Gln Gln Gln Gln Ser Pro Gln Met Phe Pro Met Val Pro Ser Ile
1 5 10 15
Pro Pro Ala Asn Asn Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp
20 25 30
Glu Asn Lys Lys Leu Ile Met Ala Ile Met Glu Asn Gln Asn Leu Gly
35 40 45
Lys Leu Ala Glu Cys Ala Gln Tyr Gln Ala Leu Leu Gln Lys Asn Leu
50 55 60
Met Tyr Leu Ala Ala Ile Ala Asp Ala Gln Pro Pro Pro Pro Thr Pro
65 70 75 80
Gly Pro Ser Pro Ser Thr Ala Val Ala Ala Gln Met Ala Thr Pro His
85 90 95
Ser Gly Met Gln Pro Pro Ser Tyr Phe Met Gln His Pro Gln Ala Ser
100 105 110
Pro Ala Gly Ile Phe Ala Pro Arg Gly Pro Leu Gln Phe Gly Ser Pro
115 120 125
Leu Gln Phe Gln Asp Pro Gln Gln Gln Gln Gln Ile His Gln Gln Ala
130 135 140
Met Gln Gly His Met Gly Ile Arg Pro Met Gly Met Thr Asn Asn Gly
145 150 155 160
Met Gln His Ala Met Gln Gln Pro Glu Thr Gly Leu Gly Gly Asn Val
165 170 175
Gly Leu Arg Gly Gly Lys Gln Asp Gly Ala Asp Gly Gln Gly Lys Asp
180 185 190
Asp Gly Lys
195
<210> 26
<211> 223
<212> PRT
<213> Arabidopsis thaliana
<400> 26
Met Gln Gln Ser Pro Gln Met Ile Pro Met Val Leu Pro Ser Phe Pro
1 5 10 15
Pro Thr Asn Asn Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu
20 25 30
Asn Lys Lys Leu Ile Met Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys
35 40 45
Leu Ala Glu Cys Ala Gln Tyr Gln Ala Leu Leu Gln Lys Asn Leu Met
50 55 60
Tyr Leu Ala Ala Ile Ala Asp Ala Gln Pro Gln Pro Pro Ala Ala Thr
65 70 75 80
Leu Thr Ser Gly Ala Met Thr Pro Gln Ala Met Ala Pro Asn Pro Ser
85 90 95
Ser Met Gln Pro Pro Pro Ser Tyr Phe Met Gln Gln His Gln Ala Val
100 105 110
Gly Met Ala Gln Gln Ile Pro Pro Gly Ile Phe Pro Pro Arg Gly Pro
115 120 125
Leu Gln Phe Gly Ser Pro His Gln Phe Leu Asp Pro Gln Gln Gln Leu
130 135 140
His Gln Gln Ala Met Gln Gly His Met Gly Ile Arg Pro Met Gly Leu
145 150 155 160
Asn Asn Asn Asn Gly Leu Gln His Gln Met His His His Glu Thr Ala
165 170 175
Leu Ala Ala Asn Asn Ala Gly Pro Asn Asp Ala Ser Gly Gly Gly Lys
180 185 190
Pro Asp Gly Thr Asn Met Ser Gln Ser Gly Ala Asp Gly Gln Gly Gly
195 200 205
Ser Ala Ala Arg His Gly Gly Gly Asp Ala Lys Thr Glu Gly Lys
210 215 220
<210> 27
<211> 226
<212> PRT
<213> Triticum aestivum
<400> 27
Met Gln Gln Gln His Leu Met Gln Met Asn Gln Ser Met Met Gly Gly
1 5 10 15
Tyr Ala Ser Ser Thr Thr Val Thr Thr Asp Leu Ile Gln Gln Tyr Leu
20 25 30
Asp Glu Asn Lys Gln Leu Ile Leu Ala Ile Leu Asp Asn Gln Asn Asn
35 40 45
Gly Lys Val Glu Glu Cys Ala Arg Asn Gln Ala Lys Leu Gln Gln Asn
50 55 60
Leu Met Tyr Leu Ala Ala Ile Ala Asp Ser Gln Pro Pro Gln Thr Ala
65 70 75 80
Ser Leu Ser Gln Tyr Pro Ser Asn Leu Met Met Gln Ser Gly Pro Arg
85 90 95
Tyr Met Gln Gln Gln Ser Ala Gln Met Met Ser Pro Gln Ser Leu Met
100 105 110
Ala Ala Arg Ser Ser Met Met Tyr Ala Gln Gln Ala Met Ser Pro Leu
115 120 125
Gln Gln Gln Gln Gln Gln Gln Gln Gln His Gln Ala Ala Ser His Gly
130 135 140
Gln Leu Gly Met Ser Ser Gly Ala Thr Thr Gly Phe Asn Leu Leu His
145 150 155 160
Gly Glu Ala Ser Met Gly Gly Gly Gly Gly Ala Thr Gly Asn Ser Met
165 170 175
Met Asn Ala Ser Val Phe Ser Asp Tyr Gly Arg Gly Gly Ser Gly Ala
180 185 190
Lys Glu Gly Ser Thr Ser Leu Ser Ala Asp Ala Arg Gly Ala Asn Ser
195 200 205
Gly Ala His Ser Gly Asp Gly Glu Tyr Leu Lys Gly Thr Glu Glu Glu
210 215 220
Gly Ser
225
<210> 28
<211> 201
<212> PRT
<213> Triticum aestivum
<400> 28
Met Gln Gln Ala Met Ser Leu Pro Pro Gly Ala Val Gly Ala Val Ser
1 5 10 15
Pro Pro Ala Gly Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu
20 25 30
Asn Lys Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys
35 40 45
Leu Ala Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu Leu
50 55 60
Tyr Leu Ala Ala Ile Ala Asp Ala Gln Pro Pro Gln Asn Pro Ala Ser
65 70 75 80
Arg Pro Gln Met Val Gln Gln Pro Gly Ser Met Gln Gly Ala Gly His
85 90 95
Tyr Met Ser Gln Val Pro Met Phe Pro Pro Arg Thr Pro Leu Thr Pro
100 105 110
Gln Gln Met Gln Glu Gln Gln His Gln Gln Leu Gln Gln Gln Gln Ala
115 120 125
Gln Ala Leu Ser Phe Pro Ala Gln Val Val Met Arg Pro Gly Thr Val
130 135 140
Asn Gly Met Gln Gln Pro Met Gln Ala Ala Gly Asp Leu Gln Pro Ala
145 150 155 160
Ala Ala Pro Gly Gly Ser Lys Gln Asp Ala Ala Val Ala Gly Ala Ser
165 170 175
Ser Glu Pro Ser Gly Thr Lys Ser His Lys Asn Ala Gly Ala Glu Glu
180 185 190
Val Gly Ala Asp Val Ala Glu Gln Ser
195 200
<210> 29
<211> 185
<212> PRT
<213> Triticum aestivum
<400> 29
Met Gln Gln Ala Met Pro Met Pro Pro Ala Ala Ala Ala Pro Gly Met
1 5 10 15
Pro Pro Ser Ala Gly Leu Ser Thr Glu Gln Ile Gln Lys Tyr Leu Asp
20 25 30
Glu Asn Lys Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly
35 40 45
Lys Leu Ala Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu
50 55 60
Leu Tyr Leu Ala Ala Ile Ala Asp Thr Gln Pro Gln Thr Thr Val Ser
65 70 75 80
Arg Pro Gln Met Ala Pro Pro Ser Ala Ser Pro Gly Ala Gly His Tyr
85 90 95
Met Ser Gln Val Pro Met Phe Pro Pro Arg Thr Pro Leu Thr Pro Gln
100 105 110
Gln Met Gln Glu Gln Gln Leu Gln Gln Gln Gln Ala Gln Met Leu Pro
115 120 125
Phe Ala Gly Gln Met Val Ala Arg Pro Gly Ala Val Asn Gly Met Pro
130 135 140
Gln Ala Pro Gln Val Glu Pro Ala Tyr Ala Ala Gly Gly Ala Ser Ser
145 150 155 160
Glu Pro Ser Gly Thr Glu Ser His Arg Ser Thr Gly Ala Asp Asn Asp
165 170 175
Gly Gly Ser Gly Leu Ala Asp Gln Ser
180 185
<210> 30
<211> 188
<212> PRT
<213> Vitis vinifera
<400> 30
Met Gln Pro Met Met Ala Ala Tyr Tyr Pro Ser Asn Val Thr Thr Asp
1 5 10 15
His Ile Gln Gln Tyr Leu Asp Glu Asn Lys Ser Leu Ile Leu Lys Ile
20 25 30
Val Glu Ser Gln Asn Ser Gly Lys Leu Thr Glu Cys Ala Glu Asn Gln
35 40 45
Ala Arg Leu Gln Arg Asn Leu Met Tyr Leu Ala Ala Ile Ala Asp Ser
50 55 60
Gln Pro Gln Pro Pro Thr Met His Ala Gln Phe Pro Pro Ser Gly Ile
65 70 75 80
Val Gln Pro Gly Ala His Tyr Met Gln His Gln Gln Ala Gln Gln Met
85 90 95
Thr Pro Gln Ser Leu Leu Ala Ala Arg Ser Ser Met Leu Tyr Ser Gln
100 105 110
Gln Pro Phe Ser Ala Leu Gln Gln Gln Gln Ala Ile His Ser Gln Leu
115 120 125
Gly Met Gly Ser Gly Gly Ser Ala Gly Leu His Met Leu Gln Ser Glu
130 135 140
Gly Ser Asn Pro Gly Gly Asn Gly Thr Leu Gly Thr Asp Met Gly Asn
145 150 155 160
Ala Glu Gly Arg Gly Gly Asn Ser Gly Gly Gln Gly Gly Asp Gly Gly
165 170 175
Glu Thr Leu Tyr Leu Lys Ala Ala Glu Asp Gly Asn
180 185
<210> 31
<211> 843
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 31
Met Lys Gln Ser Phe Val Gly Val Ser Ser Gln Thr Glu Ala Glu Thr
1 5 10 15
Arg Ala Arg Glu Ser Asp Thr Gln Thr His Pro Gln Arg Asn Leu Ser
20 25 30
His Cys Pro Gln Lys His Pro Gln Asn Leu Gln Arg Thr Gln Asn Gly
35 40 45
Gly Ser Val Gly Gly Ala Ser Met Ala Thr Gly Met Met Val Arg Ser
50 55 60
Ser Pro Phe Thr Val Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu
65 70 75 80
Ile Phe Lys Tyr Leu Met Ala Gly Leu Pro Val Pro Pro Asp Leu Val
85 90 95
Leu Pro Ile Gln Lys Ser Phe Glu Ser Met Ser Ser Arg Phe Phe His
100 105 110
His Pro Thr Met Gly Tyr Cys Ser Phe Tyr Gly Lys Lys Val Asp Pro
115 120 125
Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg Cys Ser
130 135 140
Lys Asp Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met His Arg
145 150 155 160
Gly Arg Asn Arg Ser Arg Lys Pro Val Glu Ser Gln Thr Thr Thr Gln
165 170 175
Ser Ser Ser Thr Val Thr Ser Leu Thr Val Thr Gly Gly Ser Gly Ser
180 185 190
Gly Ala Gly Thr Phe Gln Asn Leu Pro Leu Gln Asn Phe Asn Asn Pro
195 200 205
Gln Gly Gly Pro Gly Ser Gly Thr Asn Gln Ser Gln Phe His Met Glu
210 215 220
Pro Ile Pro Tyr Gly Ile Pro Asn Lys Asp Tyr Arg Tyr Leu His Gly
225 230 235 240
Leu Lys Pro Glu Val Gly Glu His Ser Phe Phe Ser Glu Ala Ser Gly
245 250 255
Ser Asn Arg Gly Leu Gln Met Asp Ser Pro Leu Asp Ser Thr Trp Pro
260 265 270
Leu Met Pro Ser Arg Val Ser Ser Phe Pro Pro Ser Lys Ser Ser Glu
275 280 285
Asn Ser Ile Leu Gln Ser Asp Tyr Pro Gln His Ser Phe Phe Ser Ser
290 295 300
Glu Phe Ala Ser Gly Glu Val Val Lys Gln Glu Gly Gln Ser Leu Arg
305 310 315 320
Pro Phe Phe Asp Glu Trp Pro Lys Thr Arg Asp Ser Trp Ser Gly Leu
325 330 335
Glu Glu Glu Arg Ser Asn Gln Thr Ser Phe Ser Thr Thr Gln Leu Ser
340 345 350
Ile Ser Ile Pro Met Ala Ser Ser Asp Phe Ser Thr Thr Ser Ser Arg
355 360 365
Ser Pro His Asp Asn Ala Ala Ile Gln Gln Ala Thr Ala Gly Val Ser
370 375 380
Gln Asp Thr Ser Glu Asn Pro Asn Lys Thr Ile Val Pro Ala Ala Leu
385 390 395 400
Pro Gln Leu Thr Pro Thr Leu Val Ser Leu Leu Glu Val Ile Glu Pro
405 410 415
Glu Val Leu Tyr Ala Gly Tyr Asp Ser Ser Val Pro Asp Ser Ala Trp
420 425 430
Arg Ile Met Thr Thr Leu Asn Met Leu Gly Gly Arg Gln Val Ile Ala
435 440 445
Ala Val Lys Trp Ala Lys Ala Ile Pro Gly Phe Arg Asn Leu His Leu
450 455 460
Asp Asp Gln Met Thr Leu Leu Gln Tyr Ser Trp Met Phe Leu Met Ala
465 470 475 480
Phe Ala Leu Gly Trp Arg Ser Tyr Arg Gln Ser Ser Gly Asn Leu Leu
485 490 495
Cys Phe Ala Pro Asp Leu Ile Ile Asn Glu Gln Arg Met Ser Leu Pro
500 505 510
Cys Met Tyr Asp Gln Cys Lys His Met Leu Phe Val Ser Ser Glu Leu
515 520 525
Gln Arg Leu Gln Val Ser Tyr Glu Glu Tyr Leu Cys Met Lys Thr Leu
530 535 540
Leu Leu Leu Ser Ser Val Pro Lys Glu Gly Leu Lys Ser Gln Glu Leu
545 550 555 560
Phe Asp Glu Ile Arg Met Thr Tyr Ile Lys Glu Leu Gly Lys Ala Ile
565 570 575
Val Lys Arg Glu Gly Asn Ser Ser Gln Asn Trp Gln Arg Phe Tyr Gln
580 585 590
Leu Thr Lys Leu Leu Asp Ser Met His Glu Val Val Glu Asn Leu Leu
595 600 605
Thr Tyr Cys Phe Gln Thr Phe Leu Asp Lys Thr Met Ser Ile Glu Phe
610 615 620
Pro Glu Met Leu Ala Glu Ile Ile Thr Asn Gln Ile Pro Lys Tyr Ser
625 630 635 640
Asn Gly Asn Ile Lys Lys Leu Leu Phe His Gln Lys Ala Ala Ala Met
645 650 655
Gln Pro Met Met Ala Ala Tyr Tyr Pro Ser Asn Val Thr Thr Asp His
660 665 670
Ile Gln Gln Tyr Leu Asp Glu Asn Lys Ser Leu Ile Leu Lys Ile Val
675 680 685
Glu Ser Gln Asn Ser Gly Lys Leu Thr Glu Cys Ala Glu Asn Gln Ala
690 695 700
Arg Leu Gln Arg Asn Leu Met Tyr Leu Ala Ala Ile Ala Asp Ser Gln
705 710 715 720
Pro Gln Pro Pro Thr Met His Ala Gln Phe Pro Pro Ser Gly Ile Val
725 730 735
Gln Pro Gly Ala His Tyr Met Gln His Gln Gln Ala Gln Gln Met Thr
740 745 750
Pro Gln Ser Leu Leu Ala Ala Arg Ser Ser Met Leu Tyr Ser Gln Gln
755 760 765
Pro Phe Ser Ala Leu Gln Gln Gln Gln Ala Ile His Ser Gln Leu Gly
770 775 780
Met Gly Ser Gly Gly Ser Ala Gly Leu His Met Leu Gln Ser Glu Gly
785 790 795 800
Ser Asn Pro Gly Gly Asn Gly Thr Leu Gly Thr Asp Met Gly Asn Ala
805 810 815
Glu Gly Arg Gly Gly Asn Ser Gly Gly Gln Gly Gly Asp Gly Gly Glu
820 825 830
Thr Leu Tyr Leu Lys Ala Ala Glu Asp Gly Asn
835 840
<210> 32
<211> 917
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 32
Met Ala Met Pro Tyr Ala Ser Leu Ser Pro Ala Gly Asp Arg Arg Ser
1 5 10 15
Ser Pro Ala Ala Thr Ala Thr Ala Ser Leu Leu Pro Phe Cys Arg Ser
20 25 30
Ser Pro Phe Ser Ala Gly Gly Asn Gly Gly Met Gly Glu Glu Ala Arg
35 40 45
Met Asp Gly Arg Trp Met Ala Arg Pro Val Pro Phe Thr Ala Ala Gln
50 55 60
Tyr Glu Glu Leu Glu His Gln Ala Leu Ile Tyr Lys Tyr Leu Val Ala
65 70 75 80
Gly Val Ser Val Pro Pro Asp Leu Val Leu Pro Ile Arg Arg Gly Ile
85 90 95
Glu Ser Leu Ala Ala Arg Phe Tyr His Asn Pro Leu Ala Ile Gly Tyr
100 105 110
Gly Ser Tyr Leu Gly Lys Lys Val Asp Pro Glu Pro Gly Arg Cys Arg
115 120 125
Arg Thr Asp Gly Lys Lys Trp Arg Cys Ala Lys Glu Ala Ala Ser Asp
130 135 140
Ser Lys Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg
145 150 155 160
Lys Pro Val Glu Thr Gln Leu Val Ser His Ser Gln Pro Pro Ala Ala
165 170 175
Ser Val Val Pro Pro Leu Ala Thr Gly Phe His Asn His Ser Leu Tyr
180 185 190
Pro Ala Ile Gly Gly Thr Asn Gly Gly Gly Gly Gly Gly Asn Asn Gly
195 200 205
Met Ser Met Pro Gly Thr Phe Ser Ser Ala Leu Gly Pro Pro Gln Gln
210 215 220
His Met Gly Asn Asn Ala Ala Ser Pro Tyr Ala Ala Leu Gly Gly Ala
225 230 235 240
Gly Thr Cys Lys Asp Phe Arg Tyr Thr Ala Tyr Gly Ile Arg Ser Leu
245 250 255
Ala Asp Glu Gln Ser Gln Leu Met Thr Glu Ala Met Asn Thr Ser Val
260 265 270
Glu Asn Pro Trp Arg Leu Pro Pro Ser Ser Gln Thr Thr Thr Phe Pro
275 280 285
Leu Ser Ser Tyr Ser Pro Gln Leu Gly Ala Thr Ser Asp Leu Gly Gln
290 295 300
Asn Asn Ser Ser Asn Asn Asn Ser Gly Val Lys Ala Glu Gly Gln Gln
305 310 315 320
Gln Gln Gln Pro Leu Ser Phe Pro Gly Cys Gly Asp Phe Gly Ser Gly
325 330 335
Asp Ser Ala Lys Gln Glu Asn Gln Thr Leu Arg Pro Phe Phe Asp Glu
340 345 350
Trp Pro Lys Thr Arg Asp Ser Trp Ser Asp Leu Thr Asp Asp Asn Ser
355 360 365
Asn Val Ala Ser Phe Ser Ala Thr Gln Leu Ser Ile Ser Ile Pro Met
370 375 380
Thr Ser Ser Asp Phe Ser Ala Ala Ser Ser Gln Ser Pro Asn Gly Met
385 390 395 400
Leu Phe Ala Gly Glu Met Tyr Ala Ala Ala Ala Ile Gln Gln Ala Thr
405 410 415
Ala Gly Val Ser Gln Asp Thr Ser Glu Asn Pro Asn Lys Thr Ile Val
420 425 430
Pro Ala Ala Leu Pro Gln Leu Thr Pro Thr Leu Val Ser Leu Leu Glu
435 440 445
Val Ile Glu Pro Glu Val Leu Tyr Ala Gly Tyr Asp Ser Ser Val Pro
450 455 460
Asp Ser Ala Trp Arg Ile Met Thr Thr Leu Asn Met Leu Gly Gly Arg
465 470 475 480
Gln Val Ile Ala Ala Val Lys Trp Ala Lys Ala Ile Pro Gly Leu Arg
485 490 495
Asn Leu His Leu Asp Asp Gln Met Thr Leu Leu Gln Tyr Ser Trp Met
500 505 510
Phe Leu Met Ala Phe Ala Leu Gly Trp Arg Ser Tyr Arg Gln Ser Ser
515 520 525
Gly Asn Leu Leu Cys Phe Ala Pro Asp Leu Ile Ile Asn Glu Gln Arg
530 535 540
Met Ser Leu Pro Cys Met Tyr Asp Gln Cys Lys His Met Leu Phe Val
545 550 555 560
Ser Ser Glu Leu Gln Arg Leu Gln Val Ser Tyr Glu Glu Tyr Leu Cys
565 570 575
Met Lys Thr Leu Leu Leu Leu Ser Ser Val Pro Lys Glu Gly Leu Lys
580 585 590
Ser Gln Glu Leu Phe Asp Glu Ile Arg Met Thr Tyr Ile Lys Glu Leu
595 600 605
Gly Lys Ala Ile Val Lys Arg Glu Gly Asn Ser Ser Gln Asn Trp Gln
610 615 620
Arg Phe Tyr Gln Leu Thr Lys Leu Leu Asp Ser Met His Glu Val Val
625 630 635 640
Glu Asn Leu Leu Thr Tyr Cys Phe Gln Thr Phe Leu Asp Lys Thr Met
645 650 655
Ser Ile Glu Phe Pro Glu Met Leu Ala Glu Ile Ile Thr Asn Gln Ile
660 665 670
Pro Lys Tyr Ser Asn Gly Asn Ile Lys Lys Leu Leu Phe His Gln Lys
675 680 685
Ala Ala Ala Ala Met Gln Gln Gln His Leu Met Gln Met Asn Gln Ser
690 695 700
Met Met Gly Gly Tyr Ala Ser Ser Thr Thr Ala Thr Thr Asp Leu Ile
705 710 715 720
Gln Gln Tyr Leu Asp Glu Asn Lys Gln Leu Ile Leu Ala Ile Leu Asp
725 730 735
Asn Gln Asn Asn Gly Lys Val Glu Glu Cys Ala Arg Asn Gln Ala Lys
740 745 750
Leu Gln Gln Asn Leu Met Tyr Leu Ala Ala Ile Ala Asp Ser Gln Pro
755 760 765
Pro Gln Thr Ala Ser Leu Ser Gln Tyr Pro Ser Asn Leu Met Met Gln
770 775 780
Ser Gly Pro Arg Tyr Met Gln Gln Gln Ser Ala Gln Met Met Ser Pro
785 790 795 800
Gln Ser Leu Met Ala Ala Arg Ser Ser Met Met Tyr Ala Gln Gln Ala
805 810 815
Met Ser Pro Leu Gln Gln Gln Gln Gln Gln Gln Gln His Gln Ala Ala
820 825 830
Ala His Gly Gln Leu Gly Met Ser Ser Gly Ala Thr Thr Gly Phe Asn
835 840 845
Leu Leu His Gly Glu Ala Ser Met Gly Gly Gly Gly Gly Ala Thr Gly
850 855 860
Asn Ser Met Met Asn Ala Ser Val Phe Ser Asp Tyr Gly Arg Gly Gly
865 870 875 880
Ser Gly Ala Lys Glu Gly Ser Thr Ser Leu Ser Ala Asp Ala Arg Gly
885 890 895
Ala Asn Ser Gly Ala His Ser Gly Asp Gly Glu Tyr Leu Lys Gly Thr
900 905 910
Glu Glu Glu Gly Ser
915
<210> 33
<211> 18166
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 33
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc agactagagt tatcaacaag tttgtacaaa aaagctgcca ccatggcgat 2040
gccgtatgcc tctctttccc cggcaggcga ccgccgctcc tccccggccg ccaccgccac 2100
cgcctccctc ctccccttct gccgctcctc ccccttctcc gccggcggca atggcggcat 2160
gggggaggag gcgcggatgg acgggaggtg gatggcgagg ccggtgccct tcacggcggc 2220
gcagtacgag gagctggagc accaggcgct catatacaag tacctggtgg ccggcgtgtc 2280
cgtcccgccg gatctcgtgc tccccatccg ccgcggcatc gagtccctcg ccgcccgctt 2340
ctaccacaac cccctcgcca tcgggtacgg atcgtacctg ggcaagaagg tggatccgga 2400
gccgggccgg tgccggcgca cggacggcaa gaagtggcgg tgcgccaagg aggccgcctc 2460
cgactccaag tattgcgagc gccacatgca ccgcggccgc aaccgttcaa gaaagcctgt 2520
ggaaacgcag ctcgtctcgc actcccagcc gccggccgcc tccgtcgtgc cgcccctcgc 2580
caccggcttc cacaaccact ccctctaccc cgccatcggc ggcaccaacg gtggtggagg 2640
cggggggaac aacggcatgt ccatgcccgg cacgttctcc tccgcgctgg ggccgcctca 2700
gcagcacatg ggcaacaatg ccgcctctcc ctacgcggct ctcggcggcg ccggaacatg 2760
caaagatttc aggtataccg catatggaat aagatctttg gcagacgagc agagtcagct 2820
catgacagaa gccatgaaca cctccgtgga gaacccatgg cgcctgccgc catcttctca 2880
aacgactaca ttcccgctct caagctactc tcctcagctt ggagcaacga gtgacctggg 2940
tcagaacaac agcagcaaca acaacagcgg cgtcaaggcc gagggacagc agcagcagca 3000
gccgctctcc ttcccggggt gcggcgactt cggcagcggc gactccgcga agcaggagaa 3060
ccagacgctg cggccgttct tcgacgagtg gccgaagacg agggactcgt ggtcggacct 3120
gaccgacgac aactcgaacg tcgcctcctt ctcggccacc cagctgtcga tctcgatacc 3180
catgacgtcc tccgacttct ccgccgccag ctcccagtcg cccaacggca tgctgttcgc 3240
cggcgaaatg tacgcggccg ctgccattca gcaagccact gcaggagtct cacaagacac 3300
ttcggaaaat cctaacaaaa caatagttcc tgctgcatta ccacagctca cccctacctt 3360
ggtgtcactg ctggaggtga ttgaacccga ggtgttgtat gcaggatatg atagctctgt 3420
tccagattca gcatggagaa ttatgaccac actcaacatg ttaggtgggc gtcaagtgat 3480
tgcagcagtg aaatgggcaa aggcgatacc aggcttcaga aacttacacc tggatgacca 3540
aatgaccctg ctacagtact catggatgtt tctcatggca tttgccctgg gttggagatc 3600
atacagacaa tcaagtggaa acctgctctg ctttgctcct gatctgatta ttaatgagca 3660
gagaatgtct ctaccctgca tgtatgacca atgtaaacac atgctgtttg tctcctctga 3720
attacaaaga ttgcaggtat cctatgaaga gtatctctgt atgaaaacct tactgcttct 3780
ctcctcagtt cctaaggaag gtctgaagag ccaagagtta tttgatgaga ttcgaatgac 3840
ttatatcaaa gagctaggaa aagccatcgt caaaagggaa gggaactcca gtcagaactg 3900
gcaacggttt taccaactga caaagcttct ggactccatg catgaggtgg ttgagaatct 3960
ccttacctac tgcttccaga catttttgga taagaccatg agtattgaat tcccagagat 4020
gttagctgaa atcatcacta atcagatacc aaaatattca aatggaaata tcaaaaagct 4080
tctgtttcat caaaaagctg ccgcggccat gcagcagcaa cacctgatgc agatgaacca 4140
gagcatgatg gggggctacg cttcctctac cactgccacc actgatctca ttcagcagta 4200
cctggatgag aacaagcagc tgatcctggc catcctcgac aaccagaaca acggcaaggt 4260
ggaggagtgc gcacggaacc aagctaagct ccagcagaac ctcatgtacc tcgccgccat 4320
cgccgacagc cagcctccgc agacggcatc gctgtctcag tacccgtcca acctgatgat 4380
gcagtccggg ccgcggtaca tgcagcagca gtcggcgcag atgatgtcgc cgcagtcgct 4440
gatggcggcg cggtcgtcga tgatgtacgc gcagcaggcc atgtcgccgc tccagcagca 4500
gcagcagcag cagcagcacc aggcggccgc gcacggccag ctggggatgt cctccggcgc 4560
gaccaccggg ttcaacctcc tgcacggtga ggccagcatg ggcggcggcg gcggcgccac 4620
tggcaacagc atgatgaacg ccagcgtctt ctcggactat ggccgcggcg gcagcggcgc 4680
caaggagggg tcgacctcgc tgtcggccga cgctcgcggc gccaactctg gcgcgcacag 4740
cggcgacggg gagtacctca agggcaccga ggaggaagga agctagcgtt cagctttctt 4800
gtacaaagtg gttgataaca gcgggttaat taacatcttt tacccatacg atgttcctga 4860
ctatgcgggc tatccctatg acgtcccgga ctatgcagga tcctatccat atgacgttcc 4920
agattacgct gctcagtgca gcgcttagag ctcgaatttc cccgatcgtt caaacatttg 4980
gcaataaagt ttcttaagat tgaatcctgt tgccggtctt gcgatgatta tcatataatt 5040
tctgttgaat tacgttaagc atgtaataat taacatgtaa tgcatgacgt tatttatgag 5100
atgggttttt atgattagag tcccgcaatt atacatttaa tacgcgatag aaaacaaaat 5160
atagcgcgca aactaggata aattatcgcg cgcggtgtca tctatgttac tagatcggga 5220
attagcacta gtaacggccg ccagtgtgct ggaattcgcc ctttaaataa gcgatcgccc 5280
cgagcaataa tctccaggag atcaaatacc ttcccaagaa ggttaaagat gcagtcaaaa 5340
gattcaggac taactgcatc aagaacacag agaaagatat atttctcaag atcagaagta 5400
ctattccagt atggacgatt caaggcttgc ttcacaaacc aaggcaagta atagagattg 5460
gagtctctaa aaaggtagtt cccactgaat caaaggccat ggagtcaaag attcaaatag 5520
aggacctaac agaactcgcc gtaaagactg gcgaacagtt catacagagt ctcttacgac 5580
tcaatgacaa gaagaaaatc ttcgtcaaca tggtggagca cgacacgctt gtctactcca 5640
aaaatatcaa agatacagtc tcagaagacc aaagggcaat tgagactttt caacaaaggg 5700
taatatccgg aaacctcctc ggattccatt gcccagctat ctgtcacttt attgtgaaga 5760
tagtggaaaa ggaaggtggc tcctacaaat gccatcattg cgataaagga aaggccatcg 5820
ttgaagatgc ctctgccgac agtggtccca aagatggacc cccacccacg aggagcatcg 5880
tggaaaaaga agacgttcca accacgtctt caaagcaagt ggattgatgt gatatctcca 5940
ctgacgtaag ggatgacgca caatcccact atccttcgca agacccttcc tctatataag 6000
gaagttcatt tcatttggag aggacacggg ggactctaga ggatccccga tccctacagg 6060
gtaaatttct agtttttctc cttcattttc ttggttagga cccttttctc tttttatttt 6120
tttgagcttt gatctttctt taaactgatc tattttttaa ttgattggtt atggtgtaaa 6180
tattacatag ctttaactga taatctgatt actttatttc gtgtgtctat gatgatgatg 6240
atagttacag aaccgtcgag ggggatccaa tgagatatga aaaagcctga actcaccgcg 6300
acgtctgtcg agaagtttct gatcgaaaag ttcgacagcg tctccgacct gatgcagctc 6360
tcggagggcg aagaatctcg tgctttcagc ttcgatgtag gagggcgtgg atatgtcctg 6420
cgggtaaata gctgcgccga tggtttctac aaagatcgtt atgtttatcg gcactttgca 6480
tcggccgcgc tcccgattcc ggaagtgctt gacattgggg aattcagcga gagcctgacc 6540
tattgcatct cccgccgtgc acagggtgtc acgttgcaag acctgcctga aaccgaactg 6600
cccgctgttc tgcagccggt cgcggaggcc atggatgcga tcgctgcggc cgatcttagc 6660
cagacgagcg ggttcggccc attcggaccg caaggaatcg gtcaatacac tacatggcgt 6720
gatttcatat gcgcgattgc tgatccccat gtgtatcact ggcaaactgt gatggacgac 6780
accgtcagtg cgtccgtcgc gcaggctctc gatgagctga tgctttgggc cgaggactgc 6840
cccgaagtcc ggcacctcgt gcacgcggat ttcggctcca acaatgtcct gacggacaat 6900
ggccgcataa cagcggtcat tgactggagc gaggcgatgt tcggggattc ccaatacgag 6960
gtcgccaaca tcttcttctg gaggccgtgg ttggcttgta tggagcagca gacgcgctac 7020
ttcgagcgga ggcatccgga gcttgcagga tcgccgcggc tccgggcgta tatgctccgc 7080
attggtcttg accaactcta tcagagcttg gttgacggca atttcgatga tgcagcttgg 7140
gcgcagggtc gatgcgacgc aatcgtccga tccggagccg ggactgtcgg gcgtacacaa 7200
atcgcccgca gaagcgcggc cgtctggacc gatggctgtg tagaagtact cgccgatagt 7260
ggaaaccgac gccccagcac tcgtccgagg gcaaaggaat agagtagatt taaattgaaa 7320
tcaccagtct ctctctacaa atctatctct ctctataata atgtgtgagt agttcccaga 7380
taagggaatt agggttctta tagggtttcg ctcatgtgtt gagcatataa gaaaccctta 7440
gtatgtattt gtatttgtaa aatacttcta tcaataaaat ttctaattcc taaaaccaaa 7500
atccagtggg cgcggggtac cgagctcgaa ttcagtacat taaaaacgtc cgcaatgtgt 7560
tattaagttg tctaagcgtc aatttgttta caccacaata tatcctgcca ccagccagcc 7620
aacagctccc cgaccggcag ctcggcacaa aatcaccact cgatacaggc agcccatcag 7680
tccgggacgg cgtcagcggg agagccgttg taaggcggca gactttgctc atgttaccga 7740
tgctattcgg aagaacggca actaagctgc cgggtttgaa acacggatga tctcgcggag 7800
ggtagcatgt tgattgtaac gatgacagag cgttgctgcc tgtgatcaaa tatcatctcc 7860
ctcgcagaga tccgaattat cagccttctt attcatttct cgcttaaccg tgacaggctg 7920
tcgatcttga gaactatgcc gacataatag gaaatcgctg gataaagccg ctgaggaagc 7980
tgagtggcgc tatttcttta gaagtgaacg ttgacgatcg tcgaccgtac cccgatgaat 8040
taattcggac gtacgttctg aacacagctg gatacttact tgggcgattg tcatacatga 8100
catcaacaat gtacccgttt gtgtaaccgt ctcttggagg ttcgtatgac actaggtcgc 8160
taccttagga ccgttatagt tactagcgaa ttgacatgag gttgccccgt attcagtgtc 8220
gctgatttgt attgtctgaa gttgttttta cgttaagttg atgcagatca attaatacga 8280
tacctgcgtc ataattgatt atttgacgtg gtttgatggc ctccacgcac gttgtgatat 8340
gtagatgata atcattatca ctttacgggt cctttccggt gatccgacag gttacggggc 8400
ggcgacctcg cgggttttcg ctatttatga aaattttccg gtttaaggcg tttccgttct 8460
tcttcgtcat aacttaatgt ttttatttaa aataccctct gaaaagaaag gaaacgacag 8520
gtgctgaaag cgagcttttt ggcctctgtc gtttcctttc tctgtttttg tccgtggaat 8580
gaacaatgga aggatcttct cggcggcgat cacgacgccg gccctgcgga gccttcgccg 8640
cgtgcgcgat tcatggcggc cgtggaggcc aaggatttcg cgcgagtgca agagctgatc 8700
gaggcgcgtg gagccaagtc ggcggctgat tatgtccttg cgcagctcgc cgtggccgaa 8760
ggtctggacc gcaagcctgg tgcgcgcgtc gtggtcggga aagcggcggg cagcatggca 8820
atgccgcctg cggcgctggg ttttacgcca aggggagaag cggcatacgc catcgagcgg 8880
tcagcctatg gtgagccgag gtccagcatt gcgaagcagt accagcagga atggaaccgg 8940
aaggcggcga cctggtgggc gatggccggt gtggccggca tcatcggcgc gatcctggcg 9000
gcggcggcaa ccggctttgt tgggctggca gtgtcgatcc gcaaccgagt gaagcgcgtg 9060
cgcgacctgt tggtgatgga gccgggtgca gagccataag cggcaagaga cgaaagcccg 9120
gtttccgggc ttttgttttg ttacgccaag gacgagtttt agcggctaaa ggtgttgacg 9180
tgcgagaaat gtttagctaa acttctctca tgtgctggcg gctgtcaccg ctatgttcaa 9240
ccaaggcgcg gagcaaatta tgggtgttat ccatgaagaa acggcttacc gaaagccagt 9300
tccaggaggc gatccagggg ctggaagtgg ggcagcagac catcgagata gcgcggggcg 9360
tcttagtcga tgggaagcca caggcgacgt tcgcaacgtc gctgggactg accaggggcg 9420
cagtgtcgca agcggtgcat cgcgtgtggg ccgcgttcga ggacaagaac ttgcccgagg 9480
ggtacgcgcg ggtaacggcg gttctgccgg aacatcaggc gtacatcgtc cggaagtggg 9540
aagcggacgc caagaaaaaa caggaaacca aacgatgaaa actttggtca cggccaacca 9600
gaaaggcggc gtcggcaaga cttcgaccct tgtgcatctt gccttcgact ttttcgagcg 9660
cggcttgcgg gttgccgtga tcgacctgga cccccagggc aatgcgtcct acacgctcaa 9720
ggactttgct accggcctgc atgcaagcaa gctgttcggc gctgtccctg ccggcggctg 9780
gaccgaaacc gcacccgcag ccggcgacgg ccaggccgcg cgcctcgccc tcatcgagtc 9840
caacccggta ctggcgaacg ccgaacggct gtcgctggac gacgcccgcg agctgttcgg 9900
ggcgaacatc aaggccctgg cgaaccaagg cttcgacgtg tgcctgatcg acacggcccc 9960
gacccttggc gtcggcctgg cggccgccct cttcgcggcc gactatgtgc tgtcccccat 10020
cgagcttgag gcgtacagca tccagggcat caagaagatg gtcacgacca ttgcgaacgt 10080
gcgccagaag aacgccaagc tgcaattcct tggcatggtg cccagcaagg tcgatgcgcg 10140
gaatccgcgc cacgcgcgcc accaagccga gctgctggcc gcgtacccca agatgatgat 10200
tccggccacc gttggcctgc gcagcagcat cgccgatgcc ctcgcatccg gtgtgccggt 10260
ctggaagatc aagaaaacgg ccgcgcgcaa ggcatcgaaa gaggttcgcg ccctggctga 10320
ttacgtgttc acgaagatgg agatttccca atgactgcgg ctcaagccaa gaccaccaag 10380
aaaaacaccg ctgcggccgc tcaggaagcc gcaggcgcgg cgcagccgtc cggcctgggg 10440
ttggatagca tcggcgacct gtcgagcctc ctggacgctc ctgcggcgtc tcagggcggt 10500
tccggcccta tcgagctgga cctggacctg atcgacgaag atccgcatca gccgcggacg 10560
gccgacaacc ccggcttttc cccggagagc atcgcggaaa tcggtgccac gatcaaagag 10620
cgcggggtga agtcacccat ttcggtgcgc gagaaccagg agcagccggg ccgctatatc 10680
atcaatcacg gcgcccgccg ctaccgtggc tcgaagtggg ccggcaagaa gtccatcccg 10740
gcgttcatcg acaacgacta caacgaagcc gaccaggtta tcgagaacct gcaacgcaac 10800
gagctgaccc cgcgcgaaat tgccgacttc attggccgcg agctggcgaa gggcaagaag 10860
aaaggcgata tcgccaagga aatcggcaag tcgccggcgt tcatcaccca gcacgtcacg 10920
ctgctggacc tgccggagaa gatcgccgat gcgttcaaca ccggccgcgt gcgcgacgtg 10980
accgtggtga acgagctggt gacggccttc aagaagcgcc cggaggaagt cgaggcgtgg 11040
cttgacgacg acacccagga aatcacgcgc ggcacggtca agctgctgcg cgagttcctg 11100
gacgagaagg gccgcgatcc caacaccgtc gatgccttca acggccagac tgatgccgag 11160
cgtgacgcgg aggccggcga cggccaggac ggcgaggacg gcgaccagga cggtaaggac 11220
gccaaggaaa agggcgcgaa ggagccggac ccggacaagc tgaaaaaggc catcgtccag 11280
gtcgagcacg acgagcgccc tgcccgcctt atcctcaacc gtcggccgcc ggcggaaggc 11340
tatgcctggt tgaagtacga ggacgacggc caggagttcg aggcgaacct tgccgacgtg 11400
aaactggtcg cgctcatcga gggctgatcc ccaaagacag cggcgcgggc cacccgcgcc 11460
gcacagacaa cggttccgct acaaggagga ccgaagaatg aatccgatgc tgttctacat 11520
cgcgggaggc gtaggcgcgg cgttgctgct ggtttccgcg atcatgctgt tcaagctgcg 11580
cgagccgaag aaggaacacc gaccgcagcg caaggcggcg gccccgacgc cgcagccggt 11640
cgataacgag ctgctgcgca ctctagtgat attccacaaa acagcaggga agcagcgctt 11700
ttccgctgca taaccctgct tcggggtcat tatagcgatt ttttcggtat atccatcctt 11760
tttcgcacga tatacaggat tttgccaaag ggttcgtgta gactttcctt ggtgtatcca 11820
acggcgtcag ccgggcagga taggtgaagt aggcccaccc gcgagcgggt gttccttctt 11880
cactgtccct tattcgcacc tggcggtgct caacgggaat cctgctctgc gaggctggcc 11940
ggctaccgcc ggcgtaacag atgagggcaa gcggatggct gatgaaacca agccaaccag 12000
gaagggcagc ccacctatca aggtgtactg ccttccagac gaacgaagag cgattgagga 12060
aaaggcggcg gcggccggca tgagcctgtc ggcctacctg ctggccgtcg gccagggcta 12120
caaaatcacg ggcgtcgtgg actatgagca cgtccgcgag ctggcccgca tcaatggcga 12180
cctgggccgc ctgggcggcc tgctgaaact ctggctcacc gacgacccgc gcacggcgcg 12240
gttcggtgat gccacgatcc tcgccctgct ggcgaagatc gaagagaagc aggacgagct 12300
tggcaaggtc atgatgggcg tggtccgccc gagggcagag ccatgacttt tttagccgct 12360
aaaacggccg gggggtgcgc gtgattgcca agcacgtccc catgcgctcc atcaagaaga 12420
gcgacttcgc ggagctggtg aagtacatca ccgacgagca aggcaagacc gagcgccaga 12480
tccaaaacaa ctgtcaaagc gcacccgccc gatgccattc gcggcacggc ttccgttgag 12540
gatgtcgata tgatgcgcga gccgacggcc cgcagagaag gggccgtttt agcggctaaa 12600
gaaggaagtg caagccctaa cccttggcgt cagagccttc cacgcagctt ttttcgggtg 12660
tcgtcgcccc atttctttac gataaacgcc ttatgtgacg gcaaaaccac actgatgcgt 12720
tcgtatccgg gcggcacgct gctcttgaaa ggatgacccg caatctccgc gagtgcctcg 12780
cggtcaaggt cggtggactc caggagaaga ggtaggggag tttccagggc gtcggcaatg 12840
gcctccatca ccttcaacga ggggttggcc ttaccgttgg ttaagtctga taaaaacgaa 12900
attgaaaccc ctgccctctc cgacagctca tgtttcgtca tgccccgctc atcgagcaga 12960
cgaaggatgt tggtgaaaaa tatctggttg tacacagcgg aagccgcccc tcgcaccttt 13020
ggtcgcggcc cgcaaaattt tagccgctaa agttcttgac agcggaacca atgtttagct 13080
aaactagagt ctcctttctc aaggagactt tcgatatgag ccataatcag ttccagttta 13140
tcggtaatct tacccgtgac accgaggtac gtcatggcaa ttctaacaag ccgcaagcaa 13200
ttttcgatat agcggttaat gaagagtggc gcaacgatgc cggcgacaag caggagcgca 13260
ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc aagtatttgg 13320
gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac gagaaggacg 13380
gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg gacaccaagg 13440
caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc ggggcaatcc 13500
cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa gaactgatcg 13560
acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc atgcgtgcgc 13620
cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc aagatcgagc 13680
gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc gtggagcgtt 13740
cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc gacacgcgag 13800
gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa caggtcagcg 13860
aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa atgcagcttt 13920
ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac gacacggccc 13980
gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg caaaacaagg 14040
tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag ctgcgggccg 14100
acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc cctatcggcg 14160
agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg atcaatggcc 14220
ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg atgggcttca 14280
cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc cgcgtcctgg 14340
accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc gtcgtgctgt 14400
ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg tcgccgacgg 14460
cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc aagctggaaa 14520
ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc gagcaggtcg 14580
gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg gtcaatgatg 14640
acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg ggttcagcag 14700
ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact tgcttcgctc 14760
agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag gattaaaatt 14820
gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc aggatttccg 14880
cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg tttacgagca 14940
cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg tggcattcgg 15000
cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg acggccccaa 15060
ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc gaggccgagg 15120
ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga tgatcgtccg 15180
acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac ttaatatttc 15240
gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg tcgcggcgac 15300
ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc taggtagccc 15360
gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg cgctgttggt 15420
gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg cgggggcggt 15480
ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc ctctgctcac 15540
ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag ctttagtgtt 15600
tgatccgcca atcccgatgc ctacaggaac caatgttctc ggctgctcga ctgcacgaat 15660
accagcgacc ccttgcccaa atacttgccg tgggcctcgg cctgagagcc aaaacacttg 15720
atgcggaaga agtcggtgcg ctcctgcttg tcgccggcat cgttgcgcca catctaggta 15780
ctaaaacaat tcatccagta aaatataata ttttattttc tcccaatcag gcttgatccc 15840
cagtaagtca aaaaatagct cgacatactg ttcttccccg atatcctccc tgatcgaccg 15900
gacgcagaag gcaatgtcat accacttgtc cgccctgccg cttctcccaa gatcaataaa 15960
gccacttact ttgccatctt tcacaaagat gttgctgtct cccaggtcgc cgtgggaaaa 16020
gacaagttcc tcttcgggct tttccgtctt taaaaaatca tacagctcgc gcggatcttt 16080
aaatggagtg tcttcttccc agttttcgca atccacatcg gccagatcgt tattcagtaa 16140
gtaatccaat tcggctaagc ggctgtctaa gctattcgta tagggacaat ccgatatgtc 16200
gatggagtga aagagcctga tgcactccgc atacagctcg ataatctttt cagggctttg 16260
ttcatcttca tactcttccg agcaaaggac gccatcggcc tcactcatga gcagattgct 16320
ccagccatca tgccgttcaa agtgcaggac ctttggaaca ggcagctttc cttccagcca 16380
tagcatcatg tccttttccc gttccacatc ataggtggtc cctttatacc ggctgtccgt 16440
catttttaaa tataggtttt cattttctcc caccagctta tataccttag caggagacat 16500
tccttccgta tcttttacgc agcggtattt ttcgatcagt tttttcaatt ccggtgatat 16560
tctcatttta gccatttatt atttccttcc tcttttctac agtatttaaa gataccccaa 16620
gaagctaatt ataacaagac gaactccaat tcactgttcc ttgcattcta aaaccttaaa 16680
taccagaaaa cagctttttc aaagttgttt tcaaagttgg cgtataacat agtatcgatt 16740
cgatagcgtg gactcaaggc tctcgcgaat ggctcgcgtt ggaaactttc attgacactt 16800
gaggggcacc gcagggaaat tctcgtcctt gcgagaaccg gctatgtcgt gctgcgcatc 16860
gagcctgcgc ccttggcttg tctcgcccct ctccgcgtcg ctacggggct tccagcgcct 16920
ttccgacgct caccgggctg gttgccctcg ccgctgggct ggcggccgtc tatggccctg 16980
caaacgcgcc agaaacgccg tcgaagccgt gtgcgagaca ccgcggccgc cggcgttgtg 17040
gatacctcgc ggaaaacttg gccctcactg acagatgagg ggcggacgtt gacacttgag 17100
gggccgactc acccggcgcg gcgttgacag atgaggggca ggctcgattt cggccggcga 17160
cgtggagctg gccagcctcg caaatcggcg aaaacgcctg attttacgcg agtttcccac 17220
agatgatgtg gacaagcctg gggataagtg ccctgcggta ttgacacttg aggggcgcga 17280
ctactgacag atgaggggcg cgatccttga cacttgaggg gcagagtgct gacagatgag 17340
gggcgcacct attgacattt gaggggctgt ccacaggcag aaaatccagc atttgcaagg 17400
gtttccgccc gtttttcggc caccgctaac ctgtctttta acctgctttt aaaccaatat 17460
ttataaacct tgtttttaac cagggctgcg ccctgtgcgc gtgaccgcgc acgccgaagg 17520
ggggtgcccc cccttctcga accctcccgg cccgctaacg cgggcctccc atccccccag 17580
gggctgcgcc cctcggccgc gaacggcctc accccaaaaa tggcagcgcc agccaggacg 17640
tcggccgaaa gagcgacaag cagatcacgc ttttcgacag cgtcggattt gcgatcgagg 17700
atttttcggc gctgcgctac gtccgcgacc gcgttgaggg atcaagccac agcagcccac 17760
tcgaccttct agccgaccca gacgagccaa gggatctttt tggaatgctg ctccgtcgtc 17820
aggctttccg acgtttgggt ggttgaacag aagtcattat cgcacggaat gccaagcact 17880
cccgagggga accctgtggt tggcatgcac atacaaatgg acgaacggat aaaccttttc 17940
acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg tttacccgcc 18000
aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca atctgatcat 18060
gagcggagaa ttaagggagt cacgttatga cccccgccga tgacgcggga caagccgttt 18120
tacgtttgga actgacagaa ccgcaacgtt gaaggagcca ctcagc 18166
<210> 34
<211> 1911
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 34
atggcgatgc cgtatgcctc tctttccccg gcaggcgacc gccgctcctc cccggccgcc 60
accgccaccg cctccctcct ccccttctgc cgctcctccc ccttctccgc cggcggcaat 120
ggcggcatgg gggaggaggc gcggatggac gggaggtgga tggcgaggcc ggtgcccttc 180
acggcggcgc agtacgagga gctggagcac caggcgctca tatacaagta cctggtggcc 240
ggcgtgtccg tcccgccgga tctcgtgctc cccatccgcc gcggcatcga gtccctcgcc 300
gcccgcttct accacaaccc cctcgccatc gggtacggat cgtacctggg caagaaggtg 360
gatccggagc cgggccggtg ccggcgcacg gacggcaaga agtggcggtg cgccaaggag 420
gccgcctccg actccaagta ttgcgagcgc cacatgcacc gcggccgcaa ccgttctaga 480
aaaccggtcg aaacgcagct cgtctcgcac tcccagccgc cggccgcctc cgtcgtgccg 540
cccctcgcca ccggcttcca caaccactcc ctctaccccg ccatcggcgg caccaacggt 600
ggtggaggcg gggggaacaa cggcatgtcc atgcccggca cgttctcctc cgcgctgggg 660
ccgcctcagc agcacatggg caacaatgcc gcctctccct acgcggctct cggcggcgcc 720
ggaacatgca aagatttcag gtataccgca tatggaataa gatctttggc agacgagcag 780
agtcagctca tgacagaagc catgaacacc tccgtggaga acccatggcg cctgccgcca 840
tcttctcaaa cgactacatt cccgctctca agctactctc ctcagcttgg agcaacgagt 900
gacctgggtc agaacaacag cagcaacaac aacagcggcg tcaaggccga gggacagcag 960
cagcagcagc cgctctcctt cccggggtgc ggcgacttcg gcagcggcga ctccgcgaag 1020
caggagaacc agacgctgcg gccgttcttc gacgagtggc cgaagacgag ggactcgtgg 1080
tcggacctga ccgacgacaa ctcgaacgtc gcctccttct cggccaccca gctgtcgatc 1140
tcgataccca tgacgtcctc cgacttctcc gccgccagct cccagtcgcc caacggcatg 1200
ctgttcgccg gcgaaatgta cgcggccgct gccatgcagc agcaacacct gatgcagatg 1260
aaccagagca tgatgggggg ctacgcttcc tctaccactg ccaccactga tctcattcag 1320
cagtacctgg atgagaacaa gcagctgatc ctggccatcc tcgacaacca gaacaacggc 1380
aaggtggagg agtgcgcacg gaaccaagct aagctccagc agaacctcat gtacctcgcc 1440
gccatcgccg acagccagcc tccgcagacg gcatcgctgt ctcagtaccc gtccaacctg 1500
atgatgcagt ccgggccgcg gtacatgcag cagcagtcgg cgcagatgat gtcgccgcag 1560
tcgctgatgg cggcgcggtc gtcgatgatg tacgcgcagc aggccatgtc gccgctccag 1620
cagcagcagc agcagcagca gcaccaggcg gccgcgcacg gccagctggg gatgtcctcc 1680
ggcgcgacca ccgggttcaa cctcctgcac ggtgaggcca gcatgggcgg cggcggcggc 1740
gccactggca acagcatgat gaacgccagc gtcttctcgg actatggccg cggcggcagc 1800
ggcgccaagg aggggtcgac ctcgctgtcg gccgacgctc gcggcgccaa ctctggcgcg 1860
cacagcggcg acggggagta cctcaagggc accgaggagg aaggaagcta g 1911
<210> 35
<211> 21282
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 35
tccatttgta tgtgcatgcc aaccacaggg ttcccctcgg gatcaaagta ctttgatcca 60
acccctccgc tgctatagtg cagtcggctt ctgacgttca gtgcagccgt cttctgaaaa 120
cgacatgtcg cacaagtcct aagttacgcg acaggctgcc gccctgccct tttcctggcg 180
ttttcttgtc gcgtgtttta gtcgcataaa gtagaatact tgcgactaga accggagaca 240
ttacgccatg aacaagagcg ccgccgctgg cctgctgggc tatgcccgcg tcagcaccga 300
cgaccaggac ttgaccaacc aacgggccga actgcacgcg gccggctgca ccaagctgtt 360
ttccgagaag atcaccggca ccaggcgcga ccgcccggag ctggccagga tgcttgacca 420
cctacgccct ggcgacgttg tgacagtgac caggctagac cgcctggccc gcagcacccg 480
cgacctactg gacattgccg agcgcatcca ggaggccggc gcgggcctgc gtagcctggc 540
agagccgtgg gccgacacca ccacgccggc cggccgcatg gtgttgaccg tgttcgccgg 600
cattgccgag ttcgagcgtt ccctaatcat cgaccgcacc cggagcgggc gcgaggccgc 660
caaggcccga ggcgtgaagt ttggcccccg ccctaccctc accccggcac agatcgcgca 720
cgcccgcgag ctgatcgacc aggaaggccg caccgtgaaa gaggcggctg cactgcttgg 780
cgtgcatcgc tcgaccctgt accgcgcact tgagcgcagc gaggaagtga cgcccaccga 840
ggccaggcgg cgcggtgcct tccgtgagga cgcattgacc gaggccgacg ccctggcggc 900
cgccgagaat gaacgccaag aggaacaagc atgaaaccgc accaggacgg ccaggacgaa 960
ccgtttttca ttaccgaaga gatcgaggcg gagatgatcg cggccgggta cgtgttcgag 1020
ccgcccgcgc acggctcaac cgtgcggctg catgaaatcc tggccggttt gtctgatgcc 1080
aagctggcgg cctggccggc cagcttggcc gctgaagaaa ccgagcgccg ccgtctaaaa 1140
aggtgatgtg tatttgagta aaacagcttg cgtcatgcgg tcgctgcgta tatgatgcga 1200
tgagtaaata aacaaatacg caaggggaac gcatgaaggt tatcgctgta cttaaccaga 1260
aaggcgggtc aggcaagacg accatcgcaa cccatctagc ccgcgccctg caactcgccg 1320
gggccgatgt tctgttagtc gattccgatc cccagggcag tgcccgcgat tgggcggccg 1380
tgcgggaaga tcaaccgcta accgttgtcg gcatcgaccg cccgacgatt gaccgcgacg 1440
tgaaggccat cggccggcgc gacttcgtag tgatcgacgg agcgccccag gcggcggact 1500
tggctgtgtc cgcgatcaag gcagccgact tcgtgctgat tccggtgcag ccaagccctt 1560
acgacatatg ggccaccgcc gacctggtgg agctggttaa gcagcgcatt gaggtcacgg 1620
atggaaggct acaagcggcc tttgtcgtgt cgcgggcgat caaaggcacg cgcatcggcg 1680
gtgaggttgc cgaggcgctg gccgggtacg agctgcccat tcttgagtcc cgtatcacgc 1740
agcgcgtgag ctacccaggc actgccgccg ccggcacaac cgttcttgaa tcagaacccg 1800
agggcgacgc tgcccgcgag gtccaggcgc tggccgctga aattaaatca aaactcattt 1860
gagttaatga ggtaaagaga aaatgagcaa aagcacaaac acgctaagtg ccggccgtcc 1920
gagcgcacgc agcagcaagg ctgcaacgtt ggccagcctg gcagacacgc cagccatgaa 1980
gcgggtcaac tttcagttgc cggcggagga tcacaccaag ctgaagatgt acgcggtacg 2040
ccaaggcaag accattaccg agctgctatc tgaatacatc gcgcagctac cagagtaaat 2100
gagcaaatga ataaatgagt agatgaattt tagcggctaa aggaggcggc atggaaaatc 2160
aagaacaacc aggcaccgac gccgtggaat gccccatgtg tggaggaacg ggcggttggc 2220
caggcgtaag cggctgggtt gtctgccggc cctgcaatgg cactggaacc cccaagcccg 2280
aggaatcggc gtgacggtcg caaaccatcc ggcccggtac aaatcggcgc ggcgctgggt 2340
gatgacctgg tggagaagtt gaaggccgcg caggccgccc agcggcaacg catcgaggca 2400
gaagcacgcc ccggtgaatc gtggcaagcg gccgctgatc gaatccgcaa agaatcccgg 2460
caaccgccgg cagccggtgc gccgtcgatt aggaagccgc ccaagggcga cgagcaacca 2520
gattttttcg ttccgatgct ctatgacgtg ggcacccgcg atagtcgcag catcatggac 2580
gtggccgttt tccgtctgtc gaagcgtgac cgacgagctg gcgaggtgat ccgctacgag 2640
cttccagacg ggcacgtaga ggtttccgca gggccggccg gcatggccag tgtgtgggat 2700
tacgacctgg tactgatggc ggtttcccat ctaaccgaat ccatgaaccg ataccgggaa 2760
gggaagggag acaagcccgg ccgcgtgttc cgtccacacg ttgcggacgt actcaagttc 2820
tgccggcgag ccgatggcgg aaagcagaaa gacgacctgg tagaaacctg cattcggtta 2880
aacaccacgc acgttgccat gcagcgtacg aagaaggcca agaacggccg cctggtgacg 2940
gtatccgagg gtgaagcctt gattagccgc tacaagatcg taaagagcga aaccgggcgg 3000
ccggagtaca tcgagatcga gctagctgat tggatgtacc gcgagatcac agaaggcaag 3060
aacccggacg tgctgacggt tcaccccgat tactttttga tcgatcccgg catcggccgt 3120
tttctctacc gcctggcacg ccgcgccgca ggcaaggcag aagccagatg gttgttcaag 3180
acgatctacg aacgcagtgg cagcgccgga gagttcaaga agttctgttt caccgtgcgc 3240
aagctgatcg ggtcaaatga cctgccggag tacgatttga aggaggaggc ggggcaggct 3300
ggcccgatcc tagtcatgcg ctaccgcaac ctgatcgagg gcgaagcatc cgccggttcc 3360
taatgtacgg agcagatgct agggcaaatt gccctagcag gggaaaaagg tcgaaaaggc 3420
ctctttcctg tggatagcac gtacattggg aacccaaagc cgtacattgg gaaccggaac 3480
ccgtacattg ggaacccaaa gccgtacatt gggaaccggt cacacatgta agtgactgat 3540
ataaaagaga aaaaaggcga tttttccgcc taaaactctt taaaacttat taaaactctt 3600
aaaacccgcc tggcctgtgc ataactgtct ggccagcgca cagcccaaga gctgcaaaaa 3660
gcgcctaccc ttcggtcgct gcgctcccta cgccccgccg cttcgcgtcg gcctatcgcg 3720
gccgctggcc gctcaaaaat ggctggccta cggccaggca atctaccagg gcgcggacaa 3780
gccgcgccgt cgccactcga ccgccggcgc ccacatcaag gcaccctgcc tcgcgcgttt 3840
cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gaaacggtca cagcttgtct 3900
gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg 3960
tcggggcgca gccatgaccc agtcacgtag cgatagcgga gtgtatactg gcttaactat 4020
gcggcatcag agcagattgt actgagagtg caccatatgc ggtgtgaaat accgcacaga 4080
tgcgtaagga gaaaataccg catcaggccc tcttccgctt cctcgctcac tgactcgctg 4140
cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 4200
tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 4260
aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 4320
catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 4380
caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 4440
ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 4500
aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 4560
gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 4620
cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 4680
ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta 4740
tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 4800
tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 4860
cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 4920
tggaacgaaa actcacgtta agggattttg gtcatgcatt ctaggtacta aaacaattca 4980
tccagtaaaa tataatattt tattttctcc caatcaggct tgatccccag taagtcaaaa 5040
aatagctcga catactgttc ttccccgata tcctccctga tcgaccggac gcagaaggca 5100
atgtcatacc acttgtccgc cctgccgctt ctcccaagat caataaagcc acttactttg 5160
ccatctttca caaagatgtt gctgtctccc aggtcgccgt gggaaaagac aagttcctct 5220
tcgggctttt ccgtctttaa aaaatcatac agctcgcgcg gatctttaaa tggagtgtct 5280
tcttcccagt tttcgcaatc cacatcggcc agatcgttat tcagtaagta atccaattcg 5340
gctaagcggc tgtctaagct attcgtatag ggacaatccg atatgtcgat ggagtgaaag 5400
agcctgatgc actccgcata cagctcgata atcttttcag ggctttgttc atcttcatac 5460
tcttccgagc aaaggacgcc atcggcctca ctcatgagca gattgctcca gccatcatgc 5520
cgttcaaagt gcaggacctt tggaacaggc agctttcctt ccagccatag catcatgtcc 5580
ttttcccgtt ccacatcata ggtggtccct ttataccggc tgtccgtcat ttttaaatat 5640
aggttttcat tttctcccac cagcttatat accttagcag gagacattcc ttccgtatct 5700
tttacgcagc ggtatttttc gatcagtttt ttcaattccg gtgatattct cattttagcc 5760
atttattatt tccttcctct tttctacagt atttaaagat accccaagaa gctaattata 5820
acaagacgaa ctccaattca ctgttccttg cattctaaaa ccttaaatac cagaaaacag 5880
ctttttcaaa gttgttttca aagttggcgt ataacatagt atcgacggag ccgattttga 5940
aaccgcggtg atcacaggca gcaacgctct gtcatcgtta caatcaacat gctaccctcc 6000
gcgagatcat ccgtgtttca aacccggcag cttagttgcc gttcttccga atagcatcgg 6060
taacatgagc aaagtctgcc gccttacaac ggctctcccg ctgacgccgt cccggactga 6120
tgggctgcct gtatcgagtg gtgattttgt gccgagctgc cggtcgggga gctgttggct 6180
ggctggtggc aggatatatt gtggtgtaaa caaattgacg cttagacaac ttaataacac 6240
attgcggacg tttttaatgt actgaattaa cgccgaatta attcggggga tctggatttt 6300
agtactggat tttggtttta ggaattagaa attttattga tagaagtatt ttacaaatac 6360
aaatacatac taagggtttc ttatatgctc aacacatgag cgaaacccta taggaaccct 6420
aattccctta tctgggaact actcacacat tattatggag aaactcgagc ttgtcgatcg 6480
acagatcccg gtcggcatct actctatttc tttgccctcg gacgagtgct ggggcgtcgg 6540
tttccactat cggcgagtac ttctacacag ccatcggtcc agacggccgc gcttctgcgg 6600
gcgatttgtg tacgcccgac agtcccggct ccggatcgga cgattgcgtc gcatcgaccc 6660
tgcgcccaag ctgcatcatc gaaattgccg tcaaccaagc tctgatagag ttggtcaaga 6720
ccaatgcgga gcatatacgc ccggagtcgt ggcgatcctg caagctccgg atgcctccgc 6780
tcgaagtagc gcgtctgctg ctccatacaa gccaaccacg gcctccagaa gaagatgttg 6840
gcgacctcgt attgggaatc cccgaacatc gcctcgctcc agtcaatgac cgctgttatg 6900
cggccattgt ccgtcaggac attgttggag ccgaaatccg cgtgcacgag gtgccggact 6960
tcggggcagt cctcggccca aagcatcagc tcatcgagag cctgcgcgac ggacgcactg 7020
acggtgtcgt ccatcacagt ttgccagtga tacacatggg gatcagcaat cgcgcatatg 7080
aaatcacgcc atgtagtgta ttgaccgatt ccttgcggtc cgaatgggcc gaacccgctc 7140
gtctggctaa gatcggccgc agcgatcgca tccatagcct ccgcgaccgg ttgtagaaca 7200
gcgggcagtt cggtttcagg caggtcttgc aacgtgacac cctgtgaacg gcgggagatg 7260
caataggtca ggctctcgct aaactcccca atgtcaagca cttccggaat cgggagcgcg 7320
gccgatgcaa agtgccgata aacataacga tctttgtaga aaccatcggc gcagctattt 7380
acccgcagga catatccacg ccctcctaca tcgaagctga aagcacgaga ttcttcgccc 7440
tccgagagct gcatcaggtc ggacacgctg tcgaactttt cgatcagaaa cttctcgaca 7500
gacgtcgcgg tgagttcagg ctttttcata agcttctgca aaagagaacc agacaacagg 7560
gtaagtgcct agcagtaaac aaacagaact catcacaagc aaacagcaac atcatattca 7620
taccaacagg tcatgtgtgt tcatcacatc attagtacta agcatgccat catccaagta 7680
tatcaaagta agggcaaaga gcatgcatga tcatcaggtg cacaaaagaa tcatcaaatt 7740
gtagcagtac atatcttcat ctatcatgca tatctatcca taacaggacg atgcatgttg 7800
accaggtaaa agctacagga tcctatagga acagcagcgt atatatcttc accaatctta 7860
gcatcttaat catgtggcac atgcagtttc aatttaagca catgagctag ttgattatga 7920
ggtaccagag aatcatcaaa ctgatgtagc agcatatatc ctcatctatc atgcaatcta 7980
atctaatcta actaaacagg aaaggtgtgc tattcagtta aaagctaccg catcatatac 8040
aaacggcagc ataagaaaaa gcataatcat cttaatcatg caacaaacgc agattcataa 8100
taagcccaag agctagcttg tgatgatctt attctactct gatctacagc aatcagataa 8160
cgacctaacc ttgcacatgg caacaaaaca atcgatcgga cgaatcagtt gtttgttcct 8220
agctagcacc atcgaaccag ataatagatg cacgtacaga tcccgaaaac gaacccaaaa 8280
acagggcaga cctagctgaa cctaggcagc gacccagcag atcgtgagaa cgatctcatc 8340
tacgaacagc ctagaagcaa ccccacgatt cccggacaaa cgacctaaaa tccccacaaa 8400
tcacatgagc atgacaggat aaacagcgga accgatcaga tctacacgaa aaccccacct 8460
cccagccacc cacgatcagg aaacacgcgg atctagcatg atttcgtcaa cgcctcagcc 8520
tagttcctag ccacagacca agcagaacca ccaaaccacg ccgagcgagg agatggggca 8580
agaggacggg ggagacgatc gccgtacctt gaagcggggg aaggatcgcc gagggtcgcg 8640
aggagagcaa tttggatttg gaacccgggg gttgtgcgct ccgaacgatg agacgatgtg 8700
agattgtggg aagaggcgcg gagggccctg tatttatggg ctgcgacggg gggaggagag 8760
gtggggaggg ttggggaagg aatcccccac ccgtgccgtg acggttccgg gccgtgtgag 8820
aggagcccgc tcgtctccgc cacgcaattt ccgcgatcgg agcggagctt tcgagaggcg 8880
gctggatggt tggtggccgt tagatttgta gacgccgtta acgcctcgcc tccaccggga 8940
agagttttga gcagccgctt atgacaatgg cttaacgacg ttagacggag cgttagtggc 9000
aggccatgat agaatagacg tattgcaatg ggattataat taaataaata agaatataat 9060
aagatatggc aagtcggcac tcatgacatg gtcttcgaaa tgatagtgct cactttctta 9120
gccgagaaag ttgacgcgac tgatttagaa gttaagatat tatttctctc ttcttttctt 9180
tcctcgtcat ataaggatga aataaacttt agagattgcc ggtagtgatt ttggatttcg 9240
gcgatcaggc ttggtttgcc ggtttcggac ggtgtgcctt aggccacccg cagtgtatct 9300
tgtaatgttc aaccgataag caagggtggg gctcaagcaa gtagtaaaca actatgtcaa 9360
atgtcaccat ggttatggtc ttgtttagtt ggcttcttgg ccgattcatt aatgcagctg 9420
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta 9480
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg 9540
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgaattcga 9600
gctcggtacc cggggatctg caggcgcgtg cagcgtgacc cggtcgtgcc cctctctaga 9660
gataatgagc attgcatgtc taagttataa aaaattacca catatttttt ttgtcacact 9720
tgtttgaagt gcagtttatc tatctttata catatattta aactttactc tacgaataat 9780
ataatctata gtactacaat aatatcagtg ttttagagaa tcatataaat gaacagttag 9840
acatggtcta aaggacaatt gagtattttg acaacaggac tctacagttt tatcttttta 9900
gtgtgcatgt gttctccttt ttttttgcaa atagcttcac ctatataata cttcatccat 9960
tttattagta catccattta gggtttaggg ttaatggttt ttatagacta atttttttag 10020
tacatctatt ttattctatt ttagcctcta aattaagaaa actaaaactc tattttagtt 10080
tttttattta ataatttaga tataaaatag aataaaataa agtgactaaa aattaaacaa 10140
atacccttta agaaattaaa aaaactaagg aaacattttt cttgtttcga gtagataatg 10200
ccagcctgtt aaacgccgtc gacgagtcta acggacacca accagcgaac cagcagcgtc 10260
gcgtcgggcc aagcgaagca gacggcacgg catctctgtc gctgcctctg gacccctctc 10320
gagagttccg ctccaccgtt ggacttgctc cgctgtcggc atccagaaat tgcgtggcgg 10380
agcggcagac gtgagccggc acggcaggcg gcctcctcct cctctcacgg cacggcagct 10440
acgggggatt cctttcccac cgctccttcg ctttcccttc ctcgcccgcc gtaataaata 10500
gacaccccct ccacaccctc tttccccaac ctcgtgttgt tcggagcgca cacacacaca 10560
accagatctc ccccaaatcc acccgtcggc acctccgctt caaggtacgc cgctcgtcct 10620
cccccccccc ccctctctac cttctctaga tcggcgttcc ggtccatggt tagggcccgg 10680
tagttctact tctgttcatg tttgtgttag atccgtgttt gtgttagatc cgtgctgcta 10740
gcgttcgtac acggatgcga cctgtacgtc agacacgttc tgattgctaa cttgccagtg 10800
tttctctttg gggaatcctg ggatggctct agccgttccg cagacgggat cgatttcatg 10860
attttttttg tttcgttgca tagggtttgg tttgcccttt tcctttattt caatatatgc 10920
cgtgcacttg tttgtcgggt catcttttca tgcttttttt tgtcttggtt gtgatgatgt 10980
ggtctggttg ggcggtcgtt ctagatcgga gtagaattct gtttcaaact acctggtgga 11040
tttattaatt ttggatctgt atgtgtgtgc catacatatt catagttacg aattgaagat 11100
gatggatgga aatatcgatc taggataggt atacatgttg atgcgggttt tactgatgca 11160
tatacagaga tgctttttgt tcgcttggtt gtgatgatgt ggtgtggttg ggcggtcgtt 11220
cattcgttct agatcggagt agaatactgt ttcaaactac ctggtgtatt tattaatttt 11280
ggaactgtat gtgtgtgtca tacatcttca tagttacgag tttaagatgg atggaaatat 11340
cgatctagga taggtataca tgttgatgtg ggttttactg atgcatatac atgatggcat 11400
atgcagcatc tattcatatg ctctaacctt gagtacctat ctattataat aaacaagtat 11460
gttttataat tattttgatc ttgatatact tggatgatgg catatgcagc agctatatgt 11520
ggattttttt agccctgcct tcatacgcta tttatttgct tggtactgtt tcttttgtcg 11580
atgctcaccc tgttgtttgg tgttacttct gcagactaga gttatcaaca agtttgtaca 11640
aaaaagctgc caccatggcg atgccgtatg cctctctttc cccggcaggc gaccgccgct 11700
cctccccggc cgccaccgcc accgcctccc tcctcccctt ctgccgctcc tcccccttct 11760
ccgccggcgg caatggcggc atgggggagg aggcgcggat ggacgggagg tggatggcga 11820
ggccggtgcc cttcacggcg gcgcagtacg aggagctgga gcaccaggcg ctcatataca 11880
agtacctggt ggccggcgtg tccgtcccgc cggatctcgt gctccccatc cgccgcggca 11940
tcgagtccct cgccgcccgc ttctaccaca accccctcgc catcgggtac ggatcgtacc 12000
tgggcaagaa ggtggatccg gagccgggcc ggtgccggcg cacggacggc aagaagtggc 12060
ggtgcgccaa ggaggccgcc tccgactcca agtattgcga gcgccacatg caccgcggcc 12120
gcaaccgttc aagaaagcct gtggaaacgc agctcgtctc gcactcccag ccgccggccg 12180
cctccgtcgt gccgcccctc gccaccggct tccacaacca ctccctctac cccgccatcg 12240
gcggcaccaa cggtggtgga ggcgggggga acaacggcat gtccatgccc ggcacgttct 12300
cctccgcgct ggggccgcct cagcagcaca tgggcaacaa tgccgcctct ccctacgcgg 12360
ctctcggcgg cgccggaaca tgcaaagatt tcaggtatac cgcatatgga ataagatctt 12420
tggcagacga gcagagtcag ctcatgacag aagccatgaa cacctccgtg gagaacccat 12480
ggcgcctgcc gccatcttct caaacgacta cattcccgct ctcaagctac tctcctcagc 12540
ttggagcaac gagtgacctg ggtcagaaca acagcagcaa caacaacagc ggcgtcaagg 12600
ccgagggaca gcagcagcag cagccgctct ccttcccggg gtgcggcgac ttcggcagcg 12660
gcgactccgc gaagcaggag aaccagacgc tgcggccgtt cttcgacgag tggccgaaga 12720
cgagggactc gtggtcggac ctgaccgacg acaactcgaa cgtcgcctcc ttctcggcca 12780
cccagctgtc gatctcgata cccatgacgt cctccgactt ctccgccgcc agctcccagt 12840
cgcccaacgg catgctgttc gccggcgaaa tgtacgcggc cgctgccatg cagcagcaac 12900
acctgatgca gatgaaccag agcatgatgg ggggctacgc ttcctctacc actgccacca 12960
ctgatctcat tcagcagtac ctggatgaga acaagcagct gatcctggcc atcctcgaca 13020
accagaacaa cggcaaggtg gaggagtgcg cacggaacca agctaagctc cagcagaacc 13080
tcatgtacct cgccgccatc gccgacagcc agcctccgca gacggcatcg ctgtctcagt 13140
acccgtccaa cctgatgatg cagtccgggc cgcggtacat gcagcagcag tcggcgcaga 13200
tgatgtcgcc gcagtcgctg atggcggcgc ggtcgtcgat gatgtacgcg cagcaggcca 13260
tgtcgccgct ccagcagcag cagcagcagc agcagcacca ggcggccgcg cacggccagc 13320
tggggatgtc ctccggcgcg accaccgggt tcaacctcct gcacggtgag gccagcatgg 13380
gcggcggcgg cggcgccact ggcaacagca tgatgaacgc cagcgtcttc tcggactatg 13440
gccgcggcgg cagcggcgcc aaggaggggt cgacctcgct gtcggccgac gctcgcggcg 13500
ccaactctgg cgcgcacagc ggcgacgggg agtacctcaa gggcaccgag gaggaaggaa 13560
gctagcgttc agctttcttg tacaaagtgg ttgataacag cgggttaatt aacatctttt 13620
acccatacga tgttcctgac tatgcgggct atccctatga cgtcccggac tatgcaggat 13680
cctatccata tgacgttcca gattacgctg ctcagtgcag cgcttagagc tcgaatttcc 13740
ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 13800
cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 13860
gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 13920
acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc gcggtgtcat 13980
ctatgttact agatcgggaa ttagcgcgcc tgcagtgcag cgtgacccgg tcgtgcccct 14040
ctctagagat aatgagcatt gcatgtctaa gttataaaaa attaccacat attttttttg 14100
tcacacttgt ttgaagtgca gtttatctat ctttatacat atatttaaac tttactctac 14160
gaataatata atctatagta ctacaataat atcagtgttt tagagaatca tataaatgaa 14220
cagttagaca tggtctaaag gacaattgag tattttgaca acaggactct acagttttat 14280
ctttttagtg tgcatgtgtt ctcctttttt tttgcaaata gcttcaccta tataatactt 14340
catccatttt attagtacat ccatttaggg tttagggtta atggttttta tagactaatt 14400
tttttagtac atctatttta ttctatttta gcctctaaat taagaaaact aaaactctat 14460
tttagttttt ttatttaata atttagatat aaaatagaat aaaataaagt gactaaaaat 14520
taaacaaata ccctttaaga aattaaaaaa actaaggaaa catttttctt gtttcgagta 14580
gataatgcca gcctgttaaa cgccgtcgac gagtctaacg gacaccaacc agcgaaccag 14640
cagcgtcgcg tcgggccaag cgaagcagac ggcacggcat ctctgtcgct gcctctggac 14700
ccctctcgag agttccgctc caccgttgga cttgctccgc tgtcggcatc cagaaattgc 14760
gtggcggagc ggcagacgtg agccggcacg gcaggcggcc tcctcctcct ctcacggcac 14820
cggcagctac gggggattcc tttcccaccg ctccttcgct ttcccttcct cgcccgccgt 14880
aataaataga caccccctcc acaccctctt tccccaacct cgtgttgttc ggagcgcaca 14940
cacacacaac cagatctccc ccaaatccac ccgtcggcac ctccgcttca aggtacgccg 15000
ctcgtcctcc cccccccccc tctctacctt ctctagatcg gcgttccggt ccatggttag 15060
ggcccggtag ttctacttct gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt 15120
gctgctagcg ttcgtacacg gatgcgacct gtacgtcaga cacgttctga ttgctaactt 15180
gccagtgttt ctctttgggg aatcctggga tggctctagc cgttccgcag acgggatcga 15240
tttcatgatt ttttttgttt cgttgcatag ggtttggttt gcccttttcc tttatttcaa 15300
tatatgccgt gcacttgttt gtcgggtcat cttttcatgc ttttttttgt cttggttgtg 15360
atgatgtggt ctggttgggc ggtcgttcta gatcggagta gaattaattc tgtttcaaac 15420
tacctggtgg atttattaat tttggatctg tatgtgtgtg ccatacatat tcatagttac 15480
gaattgaaga tgatggatgg aaatatcgat ctaggatagg tatacatgtt gatgcgggtt 15540
ttactgatgc atatacagag atgctttttg ttcgcttggt tgtgatgatg tggtgtggtt 15600
gggcggtcgt tcattcgttc tagatcggag tagaatactg tttcaaacta cctggtgtat 15660
ttattaattt tggaactgta tgtgtgtgtc atacatcttc atagttacga gtttaagatg 15720
gatggaaata tcgatctagg ataggtatac atgttgatgt gggttttact gatgcatata 15780
catgatggca tatgcagcat ctattcatat gctctaacct tgagtaccta tctattataa 15840
taaacaagta tgttttataa ttattttgat cttgatatac ttggatgatg gcatatgcag 15900
cagctatatg tggatttttt tagccctgcc ttcatacgct atttatttgc ttggtactgt 15960
ttcttttgtc gatgctcacc ctgttgtttg gtgttacttc ctgcagggcg atctattcga 16020
atgcctaaga agaagagaaa ggtcggtatt cacggcgttc ctgcggcgag tgacaagaag 16080
tactcgatcg gcctcgacat cgggacgaac tcagttggct gggccgtgat caccgacgag 16140
tacaaggtgc cctctaagaa gttcaaggtc ctggggaaca ccgaccgcca ttccatcaag 16200
aagaacctca tcggcgctct cctgttcgac agcggggaga ccgctgaggc tacgaggctc 16260
aagagaaccg ctaggcgccg gtacacgaga aggaagaaca ggatctgcta cctccaagag 16320
attttctcca acgagatggc caaggttgac gattcattct tccaccgcct ggaggagtct 16380
ttcctcgtgg aggaggataa gaagcacgag cggcatccca tcttcggcaa catcgtggac 16440
gaggttgcct accacgagaa gtaccctacg atctaccatc tgcggaagaa gctcgtggac 16500
tccaccgata aggcggacct cagactgatc tacctcgctc tggcccacat gatcaagttc 16560
cgcggccatt tcctgatcga gggggatctc aacccagaca acagcgatgt tgacaagctg 16620
ttcatccaac tcgtgcagac ctacaaccaa ctcttcgagg agaacccgat caacgcctct 16680
ggcgtggacg cgaaggctat cctgtccgcg aggctctcga agtccaggag gctggagaac 16740
ctgatcgctc agctcccagg cgagaagaag aacggcctgt tcgggaacct catcgctctc 16800
agcctggggc tcaccccgaa cttcaagtcg aacttcgatc tcgctgagga cgccaagctg 16860
caactctcca aggacaccta cgacgatgac ctcgataacc tcctggccca gatcggcgat 16920
caatacgcgg acctgttcct cgctgccaag aacctgtcgg acgccatcct cctgtcagat 16980
atcctccgcg tgaacaccga gatcacgaag gctccactct ctgcctccat gatcaagcgc 17040
tacgacgagc accatcagga tctgaccctc ctgaaggcgc tggtccgcca acagctcccg 17100
gagaagtaca aggagatttt cttcgatcag tcgaagaacg gctacgctgg gtacatcgac 17160
ggcggggcct cacaagagga gttctacaag ttcatcaagc caatcctgga gaagatggac 17220
ggcacggagg agctcctggt gaagctcaac agggaggacc tcctgcggaa gcagagaacc 17280
ttcgataacg gcagcatccc ccaccaaatc catctcgggg agctgcacgc catcctgaga 17340
aggcaagagg acttctaccc tttcctcaag gataaccggg agaagatcga gaagatcctg 17400
accttcagaa tcccatacta cgtcggccct ctcgcgcggg ggaactcaag attcgcttgg 17460
atgacccgca agtctgagga gaccatcacg ccgtggaact tcgaggaggt ggtggacaag 17520
ggcgctagcg ctcagtcgtt catcgagagg atgaccaact tcgacaagaa cctgcccaac 17580
gagaaggtgc tccctaagca ctcgctcctg tacgagtact tcaccgtcta caacgagctc 17640
acgaaggtga agtacgtcac cgagggcatg cgcaagccag cgttcctgtc cggggagcag 17700
aagaaggcta tcgtggacct cctgttcaag accaaccgga aggtcacggt taagcaactc 17760
aaggaggact acttcaagaa gatcgagtgc ttcgattcgg tcgagatcag cggcgttgag 17820
gaccgcttca acgccagcct cgggacctac cacgatctcc tgaagatcat caaggataag 17880
gacttcctgg acaacgagga gaacgaggat atcctggagg acatcgtgct gaccctcacg 17940
ctgttcgagg acagggagat gatcgaggag cgcctgaaga cgtacgccca tctcttcgat 18000
gacaaggtca tgaagcaact caagcgccgg agatacaccg gctgggggag gctgtcccgc 18060
aagctcatca acggcatccg ggacaagcag tccgggaaga ccatcctcga cttcctgaag 18120
agcgatggct tcgccaacag gaacttcatg caactgatcc acgatgacag cctcaccttc 18180
aaggaggata tccaaaaggc tcaagtgagc ggccaggggg actcgctgca cgagcatatc 18240
gcgaacctcg ctggctcccc cgcgatcaag aagggcatcc tccagaccgt gaaggttgtg 18300
gacgagctcg tgaaggtcat gggccggcac aagcctgaga acatcgtcat cgagatggcc 18360
agagagaacc aaaccacgca gaaggggcaa aagaactcta gggagcgcat gaagcgcatc 18420
gaggagggca tcaaggagct ggggtcccaa atcctcaagg agcacccagt ggagaacacc 18480
caactgcaga acgagaagct ctacctgtac tacctccaga acggcaggga tatgtacgtg 18540
gaccaagagc tggatatcaa ccgcctcagc gattacgacg tcgatcatat cgttccccag 18600
tctttcctga aggatgactc catcgacaac aaggtcctca ccaggtcgga caagaaccgc 18660
ggcaagtcag ataacgttcc atctgaggag gtcgttaaga agatgaagaa ctactggagg 18720
cagctcctga acgccaagct gatcacgcaa aggaagttcg acaacctcac caaggctgag 18780
agaggcgggc tctcagagct ggacaaggcc ggcttcatca agcggcagct ggtcgagacc 18840
agacaaatca cgaagcacgt tgcgcaaatc ctcgactctc ggatgaacac gaagtacgat 18900
gagaacgaca agctgatcag ggaggttaag gtgatcaccc tgaagtctaa gctcgtctcc 18960
gacttcagga aggatttcca gttctacaag gttcgcgaga tcaacaacta ccaccatgcc 19020
catgacgctt acctcaacgc tgtggtcggc accgctctga tcaagaagta cccaaagctg 19080
gagtccgagt tcgtgtacgg ggactacaag gtttacgatg tgcgcaagat gatcgccaag 19140
tcggagcaag agatcggcaa ggctaccgcc aagtacttct tctactcaaa catcatgaac 19200
ttcttcaaga ccgagatcac gctggccaac ggcgagatcc ggaagagacc gctcatcgag 19260
accaacggcg agacggggga gatcgtgtgg gacaagggca gggatttcgc gaccgtccgc 19320
aaggttctct ccatgcccca ggtgaacatc gtcaagaaga ccgaggtcca aacgggcggg 19380
ttctcaaagg agtctatcct gcctaagcgg aacagcgaca agctcatcgc cagaaagaag 19440
gactgggacc caaagaagta cggcgggttc gacagcccta ccgtggccta ctcggtcctg 19500
gttgtggcga aggttgagaa gggcaagtcc aagaagctca agagcgtgaa ggagctcctg 19560
gggatcacca tcatggagag gtccagcttc gagaagaacc caatcgactt cctggaggcc 19620
aagggctaca aggaggtgaa gaaggacctg atcatcaagc tcccgaagta ctctctcttc 19680
gagctggaga acggcaggaa gagaatgctg gcttccgctg gcgagctcca gaaggggaac 19740
gagctcgcgc tgccaagcaa gtacgtgaac ttcctctacc tggcttccca ctacgagaag 19800
ctcaagggca gcccggagga caacgagcaa aagcagctgt tcgtcgagca gcacaagcat 19860
tacctcgacg agatcatcga gcaaatctcc gagttcagca agcgcgtgat cctcgccgac 19920
gcgaacctgg ataaggtcct ctccgcctac aacaagcacc gggacaagcc catcagagag 19980
caagcggaga acatcatcca tctcttcacc ctgacgaacc tcggcgctcc tgctgctttc 20040
aagtacttcg acaccacgat cgatcggaag agatacacct ccacgaagga ggtcctggac 20100
gcgaccctca tccaccagtc gatcaccggc ctgtacgaga cgaggatcga cctctcacaa 20160
ctcggcgggg ataagagacc cgcagcaacc aagaaggcag ggcaagcaaa gaagaagaag 20220
tgactcgaga tatgaagatg aagatgaaat atttggtgtg tcaaataaaa agcttgtgtg 20280
cttaagtttg tgtttttttc ttggcttgtt gtgttatgaa tttgtggctt tttctaatat 20340
taaatgaatg taagatcaca ttataatgaa taaacaaatg tttctataat ccattgtgaa 20400
tgttttgttg gatctcttct gcagcatata actactgtat gtgctatggt atggactatg 20460
gaatatgatt aaagataagg agctccggtg acggacgacc aagcccgtta ttctgacagt 20520
tctggtgctc aacacattta tatttatcaa ggagcacatt gttactcact gctaggaggg 20580
aatcgaacta ggaatattga tcagaggaac tacgagagag ctgaagataa ctgccctcta 20640
gctctcactg atctgggtcg catagtgaga tgcagcccac gtgagttcag caacggtcta 20700
gcgctgggct tttaggcccg catgatcggg cttttgtcgg gtggtcgacg tgttcacgat 20760
tggggagagc aacgcagcag ttcctcttag tttagtccca cctcgcctgt ccagcagagt 20820
tctgaccggt ttataaactc gcttgctgca tcagacttga tgaggaactg gaccaagggt 20880
tttagagcta gaaatagcaa gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg 20940
caccgagtcg gtgctttttt tcccggtggc actggccgtc gttttacaac gtcgtgactg 21000
ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg 21060
gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg 21120
cgaatgctag agcagcttga gcttggatca gattgtcgtt tcccgccttc agtttaaact 21180
atcagtgttt gacaggatat attggcgggt aaacctaaga gaaaagagcg tttattagaa 21240
taatcggata tttaaaaggg cgtgaaaagg tttatccgtt cg 21282
<210> 36
<211> 19778
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 36
actgacagaa ccgcaacgtt gaaggagcca ctcagcaagc tttgcagcgt gacccggtcg 60
tgcccctctc tagagataat gagcattgca tgtctaagtt ataaaaaatt accacatatt 120
ttttttgtca cacttgtttg aagtgcagtt tatctatctt tatacatata tttaaacttt 180
actctacgaa taatataatc tatagtacta caataatatc agtgttttag agaatcatat 240
aaatgaacag ttagacatgg tctaaaggac aattgagtat tttgacaaca ggactctaca 300
gttttatctt tttagtgtgc atgtgttctc cttttttttt gcaaatagct tcacctatat 360
aatacttcat ccattttatt agtacatcca tttagggttt agggttaatg gtttttatag 420
actaattttt ttagtacatc tattttattc tattttagcc tctaaattaa gaaaactaaa 480
actctatttt agttttttta tttaataatt tagatataaa atagaataaa ataaagtgac 540
taaaaattaa acaaataccc tttaagaaat taaaaaaact aaggaaacat ttttcttgtt 600
tcgagtagat aatgccagcc tgttaaacgc cgtcgacgag tctaacggac accaaccagc 660
gaaccagcag cgtcgcgtcg ggccaagcga agcagacggc acggcatctc tgtcgctgcc 720
tctggacccc tctcgagagt tccgctccac cgttggactt gctccgctgt cggcatccag 780
aaattgcgtg gcggagcggc agacgtgagc cggcacggca ggcggcctcc tcctcctctc 840
acggcacggc agctacgggg gattcctttc ccaccgctcc ttcgctttcc cttcctcgcc 900
cgccgtaata aatagacacc ccctccacac cctctttccc caacctcgtg ttgttcggag 960
cgcacacaca cacaaccaga tctcccccaa atccacccgt cggcacctcc gcttcaaggt 1020
acgccgctcg tcctcccccc ccccccctct ctaccttctc tagatcggcg ttccggtcca 1080
tggttagggc ccggtagttc tacttctgtt catgtttgtg ttagatccgt gtttgtgtta 1140
gatccgtgct gctagcgttc gtacacggat gcgacctgta cgtcagacac gttctgattg 1200
ctaacttgcc agtgtttctc tttggggaat cctgggatgg ctctagccgt tccgcagacg 1260
ggatcgattt catgattttt tttgtttcgt tgcatagggt ttggtttgcc cttttccttt 1320
atttcaatat atgccgtgca cttgtttgtc gggtcatctt ttcatgcttt tttttgtctt 1380
ggttgtgatg atgtggtctg gttgggcggt cgttctagat cggagtagaa ttctgtttca 1440
aactacctgg tggatttatt aattttggat ctgtatgtgt gtgccataca tattcatagt 1500
tacgaattga agatgatgga tggaaatatc gatctaggat aggtatacat gttgatgcgg 1560
gttttactga tgcatataca gagatgcttt ttgttcgctt ggttgtgatg atgtggtgtg 1620
gttgggcggt cgttcattcg ttctagatcg gagtagaata ctgtttcaaa ctacctggtg 1680
tatttattaa ttttggaact gtatgtgtgt gtcatacatc ttcatagtta cgagtttaag 1740
atggatggaa atatcgatct aggataggta tacatgttga tgtgggtttt actgatgcat 1800
atacatgatg gcatatgcag catctattca tatgctctaa ccttgagtac ctatctatta 1860
taataaacaa gtatgtttta taattatttt gatcttgata tacttggatg atggcatatg 1920
cagcagctat atgtggattt ttttagccct gccttcatac gctatttatt tgcttggtac 1980
tgtttctttt gtcgatgctc accctgttgt ttggtgttac ttctgcagac tagagttatc 2040
aacaagtttg tacaaaaaag ctgccaccat ggcgatgccg tatgcctctc tttccccggc 2100
aggcgaccgc cgctcctccc cggccgccac cgccaccgcc tccctcctcc ccttctgccg 2160
ctcctccccc ttctccgccg gcggcaatgg cggcatgggg gaggaggcgc ggatggacgg 2220
gaggtggatg gcgaggccgg tgcccttcac ggcggcgcag tacgaggagc tggagcacca 2280
ggcgctcata tacaagtacc tggtggccgg cgtgtccgtc ccgccggatc tcgtgctccc 2340
catccgccgc ggcatcgagt ccctcgccgc ccgcttctac cacaaccccc tcgccatcgg 2400
gtacggatcg tacctgggca agaaggtgga tccggagccg ggccggtgcc ggcgcacgga 2460
cggcaagaag tggcggtgcg ccaaggaggc cgcctccgac tccaagtatt gcgagcgcca 2520
catgcaccgc ggccgcaacc gttcaagaaa gcctgtggaa acgcagctcg tctcgcactc 2580
ccagccgccg gccgcctccg tcgtgccgcc cctcgccacc ggcttccaca accactccct 2640
ctaccccgcc atcggcggca ccaacggtgg tggaggcggg gggaacaacg gcatgtccat 2700
gcccggcacg ttctcctccg cgctggggcc gcctcagcag cacatgggca acaatgccgc 2760
ctctccctac gcggctctcg gcggcgccgg aacatgcaaa gatttcaggt ataccgcata 2820
tggaataaga tctttggcag acgagcagag tcagctcatg acagaagcca tgaacacctc 2880
cgtggagaac ccatggcgcc tgccgccatc ttctcaaacg actacattcc cgctctcaag 2940
ctactctcct cagcttggag caacgagtga cctgggtcag aacaacagca gcaacaacaa 3000
cagcggcgtc aaggccgagg gacagcagca gcagcagccg ctctccttcc cggggtgcgg 3060
cgacttcggc agcggcgact ccgcgaagca ggagaaccag acgctgcggc cgttcttcga 3120
cgagtggccg aagacgaggg actcgtggtc ggacctgacc gacgacaact cgaacgtcgc 3180
ctccttctcg gccacccagc tgtcgatctc gatacccatg acgtcctccg acttctccgc 3240
cgccagctcc cagtcgccca acggcatgct gttcgccggc gagatgtacc gttcagcttt 3300
cttgtacaaa gtggttgata acagcgggtt aattaacatc ttttacccat acgatgttcc 3360
tgactatgcg ggctatccct atgacgtccc ggactatgca ggatcctatc catatgacgt 3420
tccagattac gctgctcagt gcagcgctta gagctcgaat ttccccgatc gttcaaacat 3480
ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata 3540
atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat 3600
gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa 3660
aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg 3720
ggaattagag ctttgcagcg tgacccggtc gtgcccctct ctagagataa tgagcattgc 3780
atgtctaagt tataaaaaat taccacatat tttttttgtc acacttgttt gaagtgcagt 3840
ttatctatct ttatacatat atttaaactt tactctacga ataatataat ctatagtact 3900
acaataatat cagtgtttta gagaatcata taaatgaaca gttagacatg gtctaaagga 3960
caattgagta ttttgacaac aggactctac agttttatct ttttagtgtg catgtgttct 4020
cctttttttt tgcaaatagc ttcacctata taatacttca tccattttat tagtacatcc 4080
atttagggtt tagggttaat ggtttttata gactaatttt tttagtacat ctattttatt 4140
ctattttagc ctctaaatta agaaaactaa aactctattt tagttttttt atttaataat 4200
ttagatataa aatagaataa aataaagtga ctaaaaatta aacaaatacc ctttaagaaa 4260
ttaaaaaaac taaggaaaca tttttcttgt ttcgagtaga taatgccagc ctgttaaacg 4320
ccgtcgacga gtctaacgga caccaaccag cgaaccagca gcgtcgcgtc gggccaagcg 4380
aagcagacgg cacggcatct ctgtcgctgc ctctggaccc ctctcgagag ttccgctcca 4440
ccgttggact tgctccgctg tcggcatcca gaaattgcgt ggcggagcgg cagacgtgag 4500
ccggcacggc aggcggcctc ctcctcctct cacggcacgg cagctacggg ggattccttt 4560
cccaccgctc cttcgctttc ccttcctcgc ccgccgtaat aaatagacac cccctccaca 4620
ccctctttcc ccaacctcgt gttgttcgga gcgcacacac acacaaccag atctccccca 4680
aatccacccg tcggcacctc cgcttcaagg tacgccgctc gtcctccccc cccccccctc 4740
tctaccttct ctagatcggc gttccggtcc atggttaggg cccggtagtt ctacttctgt 4800
tcatgtttgt gttagatccg tgtttgtgtt agatccgtgc tgctagcgtt cgtacacgga 4860
tgcgacctgt acgtcagaca cgttctgatt gctaacttgc cagtgtttct ctttggggaa 4920
tcctgggatg gctctagccg ttccgcagac gggatcgatt tcatgatttt ttttgtttcg 4980
ttgcataggg tttggtttgc ccttttcctt tatttcaata tatgccgtgc acttgtttgt 5040
cgggtcatct tttcatgctt ttttttgtct tggttgtgat gatgtggtct ggttgggcgg 5100
tcgttctaga tcggagtaga attctgtttc aaactacctg gtggatttat taattttgga 5160
tctgtatgtg tgtgccatac atattcatag ttacgaattg aagatgatgg atggaaatat 5220
cgatctagga taggtataca tgttgatgcg ggttttactg atgcatatac agagatgctt 5280
tttgttcgct tggttgtgat gatgtggtgt ggttgggcgg tcgttcattc gttctagatc 5340
ggagtagaat actgtttcaa actacctggt gtatttatta attttggaac tgtatgtgtg 5400
tgtcatacat cttcatagtt acgagtttaa gatggatgga aatatcgatc taggataggt 5460
atacatgttg atgtgggttt tactgatgca tatacatgat ggcatatgca gcatctattc 5520
atatgctcta accttgagta cctatctatt ataataaaca agtatgtttt ataattattt 5580
tgatcttgat atacttggat gatggcatat gcagcagcta tatgtggatt tttttagccc 5640
tgccttcata cgctatttat ttgcttggta ctgtttcttt tgtcgatgct caccctgttg 5700
tttggtgtta cttctgcaga ctagagttat caacaagttt gtacaaaaaa gctgccacca 5760
tgcagcagca acacctgatg cagatgaacc agagcatgat ggggggctac gcttcctcta 5820
ccactgccac cactgatctc attcagcagt acctggatga gaacaagcag ctgatcctgg 5880
ccatcctcga caaccagaac aacggcaagg tggaggagtg cgcacggaac caagctaagc 5940
tccagcagaa cctcatgtac ctcgccgcca tcgccgacag ccagcctccg cagacggcat 6000
cgctgtctca gtacccgtcc aacctgatga tgcagtccgg gccgcggtac atgcagcagc 6060
agtcggcgca gatgatgtcg ccgcagtcgc tgatggcggc gcggtcgtcg atgatgtacg 6120
cgcagcaggc catgtcgccg ctccagcagc agcagcagca gcagcagcac caggcggccg 6180
cgcacggcca gctggggatg tcctccggcg cgaccaccgg gttcaacctc ctgcacggtg 6240
aggccagcat gggcggcggc ggcggcgcca ctggcaacag catgatgaac gccagcgtct 6300
tctcggacta tggccgcggc ggcagcggcg ccaaggaggg gtcgacctcg ctgtcggccg 6360
acgctcgcgg cgccaactct ggcgcgcaca gcggcgacgg ggagtacctc aagggcaccg 6420
aggaggaagg aagccgttca gctttcttgt acaaagtggt tgataacagc gggttaatta 6480
acatctttta cccatacgat gttcctgact atgcgggcta tccctatgac gtcccggact 6540
atgcaggatc ctatccatat gacgttccag attacgctgc tcagtgcagc gcttagagct 6600
cgaatttccc cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg 6660
ccggtcttgc gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta 6720
acatgtaatg catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat 6780
acatttaata cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg 6840
cggtgtcatc tatgttacta gatcgggaat tagcactagt aacggccgcc agtgtgctgg 6900
aattcgccct ttaaataagc gatcgccccg agcaataatc tccaggagat caaatacctt 6960
cccaagaagg ttaaagatgc agtcaaaaga ttcaggacta actgcatcaa gaacacagag 7020
aaagatatat ttctcaagat cagaagtact attccagtat ggacgattca aggcttgctt 7080
cacaaaccaa ggcaagtaat agagattgga gtctctaaaa aggtagttcc cactgaatca 7140
aaggccatgg agtcaaagat tcaaatagag gacctaacag aactcgccgt aaagactggc 7200
gaacagttca tacagagtct cttacgactc aatgacaaga agaaaatctt cgtcaacatg 7260
gtggagcacg acacgcttgt ctactccaaa aatatcaaag atacagtctc agaagaccaa 7320
agggcaattg agacttttca acaaagggta atatccggaa acctcctcgg attccattgc 7380
ccagctatct gtcactttat tgtgaagata gtggaaaagg aaggtggctc ctacaaatgc 7440
catcattgcg ataaaggaaa ggccatcgtt gaagatgcct ctgccgacag tggtcccaaa 7500
gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac cacgtcttca 7560
aagcaagtgg attgatgtga tatctccact gacgtaaggg atgacgcaca atcccactat 7620
ccttcgcaag acccttcctc tatataagga agttcatttc atttggagag gacacggggg 7680
actctagagg atccccgatc cctacagggt aaatttctag tttttctcct tcattttctt 7740
ggttaggacc cttttctctt tttatttttt tgagctttga tctttcttta aactgatcta 7800
ttttttaatt gattggttat ggtgtaaata ttacatagct ttaactgata atctgattac 7860
tttatttcgt gtgtctatga tgatgatgat agttacagaa ccgtcgaggg ggatccaatg 7920
agatatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt 7980
cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt 8040
cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa 8100
agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga 8160
cattggggaa ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac 8220
gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggccat 8280
ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccgca 8340
aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt 8400
gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga 8460
tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt 8520
cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga 8580
ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt 8640
ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc 8700
gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt 8760
tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc 8820
cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga 8880
tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc 8940
aaaggaatag agtagattta aattgaaatc accagtctct ctctacaaat ctatctctct 9000
ctataataat gtgtgagtag ttcccagata agggaattag ggttcttata gggtttcgct 9060
catgtgttga gcatataaga aacccttagt atgtatttgt atttgtaaaa tacttctatc 9120
aataaaattt ctaattccta aaaccaaaat ccagtgggcg cggggtaccg agctcgaatt 9180
cagtacatta aaaacgtccg caatgtgtta ttaagttgtc taagcgtcaa tttgtttaca 9240
ccacaatata tcctgccacc agccagccaa cagctccccg accggcagct cggcacaaaa 9300
tcaccactcg atacaggcag cccatcagtc cgggacggcg tcagcgggag agccgttgta 9360
aggcggcaga ctttgctcat gttaccgatg ctattcggaa gaacggcaac taagctgccg 9420
ggtttgaaac acggatgatc tcgcggaggg tagcatgttg attgtaacga tgacagagcg 9480
ttgctgcctg tgatcaaata tcatctccct cgcagagatc cgaattatca gccttcttat 9540
tcatttctcg cttaaccgtg acaggctgtc gatcttgaga actatgccga cataatagga 9600
aatcgctgga taaagccgct gaggaagctg agtggcgcta tttctttaga agtgaacgtt 9660
gacgatcgtc gaccgtaccc cgatgaatta attcggacgt acgttctgaa cacagctgga 9720
tacttacttg ggcgattgtc atacatgaca tcaacaatgt acccgtttgt gtaaccgtct 9780
cttggaggtt cgtatgacac taggtcgcta ccttaggacc gttatagtta ctagcgaatt 9840
gacatgaggt tgccccgtat tcagtgtcgc tgatttgtat tgtctgaagt tgtttttacg 9900
ttaagttgat gcagatcaat taatacgata cctgcgtcat aattgattat ttgacgtggt 9960
ttgatggcct ccacgcacgt tgtgatatgt agatgataat cattatcact ttacgggtcc 10020
tttccggtga tccgacaggt tacggggcgg cgacctcgcg ggttttcgct atttatgaaa 10080
attttccggt ttaaggcgtt tccgttcttc ttcgtcataa cttaatgttt ttatttaaaa 10140
taccctctga aaagaaagga aacgacaggt gctgaaagcg agctttttgg cctctgtcgt 10200
ttcctttctc tgtttttgtc cgtggaatga acaatggaag gatcttctcg gcggcgatca 10260
cgacgccggc cctgcggagc cttcgccgcg tgcgcgattc atggcggccg tggaggccaa 10320
ggatttcgcg cgagtgcaag agctgatcga ggcgcgtgga gccaagtcgg cggctgatta 10380
tgtccttgcg cagctcgccg tggccgaagg tctggaccgc aagcctggtg cgcgcgtcgt 10440
ggtcgggaaa gcggcgggca gcatggcaat gccgcctgcg gcgctgggtt ttacgccaag 10500
gggagaagcg gcatacgcca tcgagcggtc agcctatggt gagccgaggt ccagcattgc 10560
gaagcagtac cagcaggaat ggaaccggaa ggcggcgacc tggtgggcga tggccggtgt 10620
ggccggcatc atcggcgcga tcctggcggc ggcggcaacc ggctttgttg ggctggcagt 10680
gtcgatccgc aaccgagtga agcgcgtgcg cgacctgttg gtgatggagc cgggtgcaga 10740
gccataagcg gcaagagacg aaagcccggt ttccgggctt ttgttttgtt acgccaagga 10800
cgagttttag cggctaaagg tgttgacgtg cgagaaatgt ttagctaaac ttctctcatg 10860
tgctggcggc tgtcaccgct atgttcaacc aaggcgcgga gcaaattatg ggtgttatcc 10920
atgaagaaac ggcttaccga aagccagttc caggaggcga tccaggggct ggaagtgggg 10980
cagcagacca tcgagatagc gcggggcgtc ttagtcgatg ggaagccaca ggcgacgttc 11040
gcaacgtcgc tgggactgac caggggcgca gtgtcgcaag cggtgcatcg cgtgtgggcc 11100
gcgttcgagg acaagaactt gcccgagggg tacgcgcggg taacggcggt tctgccggaa 11160
catcaggcgt acatcgtccg gaagtgggaa gcggacgcca agaaaaaaca ggaaaccaaa 11220
cgatgaaaac tttggtcacg gccaaccaga aaggcggcgt cggcaagact tcgacccttg 11280
tgcatcttgc cttcgacttt ttcgagcgcg gcttgcgggt tgccgtgatc gacctggacc 11340
cccagggcaa tgcgtcctac acgctcaagg actttgctac cggcctgcat gcaagcaagc 11400
tgttcggcgc tgtccctgcc ggcggctgga ccgaaaccgc acccgcagcc ggcgacggcc 11460
aggccgcgcg cctcgccctc atcgagtcca acccggtact ggcgaacgcc gaacggctgt 11520
cgctggacga cgcccgcgag ctgttcgggg cgaacatcaa ggccctggcg aaccaaggct 11580
tcgacgtgtg cctgatcgac acggccccga cccttggcgt cggcctggcg gccgccctct 11640
tcgcggccga ctatgtgctg tcccccatcg agcttgaggc gtacagcatc cagggcatca 11700
agaagatggt cacgaccatt gcgaacgtgc gccagaagaa cgccaagctg caattccttg 11760
gcatggtgcc cagcaaggtc gatgcgcgga atccgcgcca cgcgcgccac caagccgagc 11820
tgctggccgc gtaccccaag atgatgattc cggccaccgt tggcctgcgc agcagcatcg 11880
ccgatgccct cgcatccggt gtgccggtct ggaagatcaa gaaaacggcc gcgcgcaagg 11940
catcgaaaga ggttcgcgcc ctggctgatt acgtgttcac gaagatggag atttcccaat 12000
gactgcggct caagccaaga ccaccaagaa aaacaccgct gcggccgctc aggaagccgc 12060
aggcgcggcg cagccgtccg gcctggggtt ggatagcatc ggcgacctgt cgagcctcct 12120
ggacgctcct gcggcgtctc agggcggttc cggccctatc gagctggacc tggacctgat 12180
cgacgaagat ccgcatcagc cgcggacggc cgacaacccc ggcttttccc cggagagcat 12240
cgcggaaatc ggtgccacga tcaaagagcg cggggtgaag tcacccattt cggtgcgcga 12300
gaaccaggag cagccgggcc gctatatcat caatcacggc gcccgccgct accgtggctc 12360
gaagtgggcc ggcaagaagt ccatcccggc gttcatcgac aacgactaca acgaagccga 12420
ccaggttatc gagaacctgc aacgcaacga gctgaccccg cgcgaaattg ccgacttcat 12480
tggccgcgag ctggcgaagg gcaagaagaa aggcgatatc gccaaggaaa tcggcaagtc 12540
gccggcgttc atcacccagc acgtcacgct gctggacctg ccggagaaga tcgccgatgc 12600
gttcaacacc ggccgcgtgc gcgacgtgac cgtggtgaac gagctggtga cggccttcaa 12660
gaagcgcccg gaggaagtcg aggcgtggct tgacgacgac acccaggaaa tcacgcgcgg 12720
cacggtcaag ctgctgcgcg agttcctgga cgagaagggc cgcgatccca acaccgtcga 12780
tgccttcaac ggccagactg atgccgagcg tgacgcggag gccggcgacg gccaggacgg 12840
cgaggacggc gaccaggacg gtaaggacgc caaggaaaag ggcgcgaagg agccggaccc 12900
ggacaagctg aaaaaggcca tcgtccaggt cgagcacgac gagcgccctg cccgccttat 12960
cctcaaccgt cggccgccgg cggaaggcta tgcctggttg aagtacgagg acgacggcca 13020
ggagttcgag gcgaaccttg ccgacgtgaa actggtcgcg ctcatcgagg gctgatcccc 13080
aaagacagcg gcgcgggcca cccgcgccgc acagacaacg gttccgctac aaggaggacc 13140
gaagaatgaa tccgatgctg ttctacatcg cgggaggcgt aggcgcggcg ttgctgctgg 13200
tttccgcgat catgctgttc aagctgcgcg agccgaagaa ggaacaccga ccgcagcgca 13260
aggcggcggc cccgacgccg cagccggtcg ataacgagct gctgcgcact ctagtgatat 13320
tccacaaaac agcagggaag cagcgctttt ccgctgcata accctgcttc ggggtcatta 13380
tagcgatttt ttcggtatat ccatcctttt tcgcacgata tacaggattt tgccaaaggg 13440
ttcgtgtaga ctttccttgg tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag 13500
gcccacccgc gagcgggtgt tccttcttca ctgtccctta ttcgcacctg gcggtgctca 13560
acgggaatcc tgctctgcga ggctggccgg ctaccgccgg cgtaacagat gagggcaagc 13620
ggatggctga tgaaaccaag ccaaccagga agggcagccc acctatcaag gtgtactgcc 13680
ttccagacga acgaagagcg attgaggaaa aggcggcggc ggccggcatg agcctgtcgg 13740
cctacctgct ggccgtcggc cagggctaca aaatcacggg cgtcgtggac tatgagcacg 13800
tccgcgagct ggcccgcatc aatggcgacc tgggccgcct gggcggcctg ctgaaactct 13860
ggctcaccga cgacccgcgc acggcgcggt tcggtgatgc cacgatcctc gccctgctgg 13920
cgaagatcga agagaagcag gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga 13980
gggcagagcc atgacttttt tagccgctaa aacggccggg gggtgcgcgt gattgccaag 14040
cacgtcccca tgcgctccat caagaagagc gacttcgcgg agctggtgaa gtacatcacc 14100
gacgagcaag gcaagaccga gcgccagatc caaaacaact gtcaaagcgc acccgcccga 14160
tgccattcgc ggcacggctt ccgttgagga tgtcgatatg atgcgcgagc cgacggcccg 14220
cagagaaggg gccgttttag cggctaaaga aggaagtgca agccctaacc cttggcgtca 14280
gagccttcca cgcagctttt ttcgggtgtc gtcgccccat ttctttacga taaacgcctt 14340
atgtgacggc aaaaccacac tgatgcgttc gtatccgggc ggcacgctgc tcttgaaagg 14400
atgacccgca atctccgcga gtgcctcgcg gtcaaggtcg gtggactcca ggagaagagg 14460
taggggagtt tccagggcgt cggcaatggc ctccatcacc ttcaacgagg ggttggcctt 14520
accgttggtt aagtctgata aaaacgaaat tgaaacccct gccctctccg acagctcatg 14580
tttcgtcatg ccccgctcat cgagcagacg aaggatgttg gtgaaaaata tctggttgta 14640
cacagcggaa gccgcccctc gcacctttgg tcgcggcccg caaaatttta gccgctaaag 14700
ttcttgacag cggaaccaat gtttagctaa actagagtct cctttctcaa ggagactttc 14760
gatatgagcc ataatcagtt ccagtttatc ggtaatctta cccgtgacac cgaggtacgt 14820
catggcaatt ctaacaagcc gcaagcaatt ttcgatatag cggttaatga agagtggcgc 14880
aacgatgccg gcgacaagca ggagcgcacc gacttcttcc gcatcaagtg ttttggctct 14940
caggccgagg cccacggcaa gtatttgggc aaggggtcgc tggtattcgt gcagggcaag 15000
attcggaata ccaagtacga gaaggacggc cagacggtct acgggaccga cttcattgcc 15060
gataaggtgg attatctgga caccaaggca ccaggcgggt caaatcagga ataagggcac 15120
attgccccgg cgtgagtcgg ggcaatcccg caaggagggt gaatgaatcg gacgtttgac 15180
cggaaggcat acaggcaaga actgatcgac gcggggtttt ccgccgagga tgccgaaacc 15240
atcgcaagcc gcaccgtcat gcgtgcgccc cgcgaaacct tccagtccgt cggctcgatg 15300
gtccagcaag ctacggccaa gatcgagcgc gacagcgtgc aactggctcc ccctgccctg 15360
cccgcgccat cggccgccgt ggagcgttcg cgtcgtctcg aacaggaggc ggcaggtttg 15420
gcgaagtcga tgaccatcga cacgcgagga actatgacga ccaagaagcg aaaaaccgcc 15480
ggcgaggacc tggcaaaaca ggtcagcgag gccaagcagg ccgcgttgct gaaacacacg 15540
aagcagcaga tcaaggaaat gcagctttcc ttgttcgata ttgcgccgtg gccggacacg 15600
atgcgagcga tgccaaacga cacggcccgc tctgccctgt tcaccacgcg caacaagaaa 15660
atcccgcgcg aggcgctgca aaacaaggtc attttccacg tcaacaagga cgtgaagatc 15720
acctacaccg gcgtcgagct gcgggccgac gatgacgaac tggtgtggca gcaggtgttg 15780
gagtacgcga agcgcacccc tatcggcgag ccgatcacct tcacgttcta cgagctttgc 15840
caggacctgg gctggtcgat caatggccgg tattacacga aggccgagga atgcctgtcg 15900
cgcctacagg cgacggcgat gggcttcacg tccgaccgcg ttgggcacct ggaatcggtg 15960
tcgctgctgc accgcttccg cgtcctggac cgtggcaaga aaacgtcccg ttgccaggtc 16020
ctgatcgacg aggaaatcgt cgtgctgttt gctggcgacc actacacgaa attcatatgg 16080
gagaagtacc gcaagctgtc gccgacggcc cgacggatgt tcgactattt cagctcgcac 16140
cgggagccgt acccgctcaa gctggaaacc ttccgcctca tgtgcggatc ggattccacc 16200
cgcgtgaaga agtggcgcga gcaggtcggc gaagcctgcg aagagttgcg aggcagcggc 16260
ctggtggaac acgcctgggt caatgatgac ctggtgcatt gcaaacgcta gggccttgtg 16320
gggtcagttc cggctggggg ttcagcagcc agcgctttac tggcatttca ggaacaagcg 16380
ggcactgctc gacgcacttg cttcgctcag tatcgctcgg gacgcacggc gcgctctacg 16440
aactgccgat aaacagagga ttaaaattga caattgtgat taaggctcag attcgacggc 16500
ttggagcggc cgacgtgcag gatttccgcg agatccgatt gtcggccctg aagaaagctc 16560
cagagatgtt cgggtccgtt tacgagcacg aggagaaaaa gcccatggag gcgttcgctg 16620
aacggttgcg agatgccgtg gcattcggcg cctacatcga cggcgagatc attgggctgt 16680
cggtcttcaa acaggaggac ggccccaagg acgctcacaa ggcgcatctg tccggcgttt 16740
tcgtggagcc cgaacagcga ggccgagggg tcgccggtat gctgctgcgg gcgttgccgg 16800
cgggtttatt gctcgtgatg atcgtccgac agattccaac gggaatctgg tggatgcgca 16860
tcttcatcct cggcgcactt aatatttcgc tattctggag cttgttgttt atttcggtct 16920
accgcctgcc gggcggggtc gcggcgacgg taggcgctgt gcagccgctg atggtcgtgt 16980
tcatctctgc cgctctgcta ggtagcccga tacgattgat ggcggtcctg ggggctattt 17040
gcggaactgc gggcgtggcg ctgttggtgt tgacaccaaa cgcagcgcta gatcctgtcg 17100
gcgtcgcagc gggcctggcg ggggcggttt ccatggcgtt cggaaccgtg ctgacccgca 17160
agtggcaacc tcccgtgcct ctgctcacct ttaccgcctg gcaactggcg gccggaggac 17220
ttctgctcgt tccagtagct ttagtgtttg atccgccaat cccgatgcct acaggaacca 17280
atgttctcgg ctgctcgact gcacgaatac cagcgacccc ttgcccaaat acttgccgtg 17340
ggcctcggcc tgagagccaa aacacttgat gcggaagaag tcggtgcgct cctgcttgtc 17400
gccggcatcg ttgcgccaca tctaggtact aaaacaattc atccagtaaa atataatatt 17460
ttattttctc ccaatcaggc ttgatcccca gtaagtcaaa aaatagctcg acatactgtt 17520
cttccccgat atcctccctg atcgaccgga cgcagaaggc aatgtcatac cacttgtccg 17580
ccctgccgct tctcccaaga tcaataaagc cacttacttt gccatctttc acaaagatgt 17640
tgctgtctcc caggtcgccg tgggaaaaga caagttcctc ttcgggcttt tccgtcttta 17700
aaaaatcata cagctcgcgc ggatctttaa atggagtgtc ttcttcccag ttttcgcaat 17760
ccacatcggc cagatcgtta ttcagtaagt aatccaattc ggctaagcgg ctgtctaagc 17820
tattcgtata gggacaatcc gatatgtcga tggagtgaaa gagcctgatg cactccgcat 17880
acagctcgat aatcttttca gggctttgtt catcttcata ctcttccgag caaaggacgc 17940
catcggcctc actcatgagc agattgctcc agccatcatg ccgttcaaag tgcaggacct 18000
ttggaacagg cagctttcct tccagccata gcatcatgtc cttttcccgt tccacatcat 18060
aggtggtccc tttataccgg ctgtccgtca tttttaaata taggttttca ttttctccca 18120
ccagcttata taccttagca ggagacattc cttccgtatc ttttacgcag cggtattttt 18180
cgatcagttt tttcaattcc ggtgatattc tcattttagc catttattat ttccttcctc 18240
ttttctacag tatttaaaga taccccaaga agctaattat aacaagacga actccaattc 18300
actgttcctt gcattctaaa accttaaata ccagaaaaca gctttttcaa agttgttttc 18360
aaagttggcg tataacatag tatcgattcg atagcgtgga ctcaaggctc tcgcgaatgg 18420
ctcgcgttgg aaactttcat tgacacttga ggggcaccgc agggaaattc tcgtccttgc 18480
gagaaccggc tatgtcgtgc tgcgcatcga gcctgcgccc ttggcttgtc tcgcccctct 18540
ccgcgtcgct acggggcttc cagcgccttt ccgacgctca ccgggctggt tgccctcgcc 18600
gctgggctgg cggccgtcta tggccctgca aacgcgccag aaacgccgtc gaagccgtgt 18660
gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg aaaacttggc cctcactgac 18720
agatgagggg cggacgttga cacttgaggg gccgactcac ccggcgcggc gttgacagat 18780
gaggggcagg ctcgatttcg gccggcgacg tggagctggc cagcctcgca aatcggcgaa 18840
aacgcctgat tttacgcgag tttcccacag atgatgtgga caagcctggg gataagtgcc 18900
ctgcggtatt gacacttgag gggcgcgact actgacagat gaggggcgcg atccttgaca 18960
cttgaggggc agagtgctga cagatgaggg gcgcacctat tgacatttga ggggctgtcc 19020
acaggcagaa aatccagcat ttgcaagggt ttccgcccgt ttttcggcca ccgctaacct 19080
gtcttttaac ctgcttttaa accaatattt ataaaccttg tttttaacca gggctgcgcc 19140
ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc cttctcgaac cctcccggcc 19200
cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc tcggccgcga acggcctcac 19260
cccaaaaatg gcagcgccag ccaggacgtc ggccgaaaga gcgacaagca gatcacgctt 19320
ttcgacagcg tcggatttgc gatcgaggat ttttcggcgc tgcgctacgt ccgcgaccgc 19380
gttgagggat caagccacag cagcccactc gaccttctag ccgacccaga cgagccaagg 19440
gatctttttg gaatgctgct ccgtcgtcag gctttccgac gtttgggtgg ttgaacagaa 19500
gtcattatcg cacggaatgc caagcactcc cgaggggaac cctgtggttg gcatgcacat 19560
acaaatggac gaacggataa accttttcac gcccttttaa atatccgatt attctaataa 19620
acgctctttt ctcttaggtt tacccgccaa tatatcctgt caaacactga tagtttaaac 19680
tgaaggcggg aaacgacaat ctgatcatga gcggagaatt aagggagtca cgttatgacc 19740
cccgccgatg acgcgggaca agccgtttta cgtttgga 19778
<210> 37
<211> 1107
<212> DNA
<213> Triticum aestivum
<400> 37
atgctgagct cgtcggcggc gatggggatg gggctgggcg ggtacggcca gcagcagcag 60
cagcagcaga tgcagatgca gatgcagcgg ggggcggggc cggtgttcac gccggcgcag 120
tgggccgagc tggagcagca ggcgctgatt tacaagtacc tcatggcggg cgtgcccgtg 180
ccgcccgatc tcctgctccc catccgcccc caccccgccg gcgccggagc caccttctcc 240
ttcgccaacc ccgccgcctc gcccttctac caccaccacc acccctccat gagttactac 300
gcctactatg gcaagaagct cgacccggag ccgtggcggt gccgccgcac cgacggcaag 360
aagtggaggt gctccaagga ggcgcacccc gactccaagt actgcgagcg ccacatgcac 420
cgtggccgca accgttcaag aaagcctgtg gaatccaagt ctgcttcccc tgcgcaccag 480
tcgcagcagc ccccgttgtc cgccgtcacg tccgccaccc gcgacgccga gcctctcccc 540
tccctcccgg cgggggctaa gacccatggc ctgtccctcg gcggggctgg ctcgtcgcag 600
atgcacgtcg acgcctcgtc atacggcaac aaatactccc ttggagctaa atctgacgtg 660
ggtgaactga gcttcttctc tggagcatca ggaaacaaca acaggggctt caccatcgat 720
tccccaacgg acagctcgtg gcactcaatg ggatccagcc tgcccccgta ccaactgtcg 780
aaacctagag attccggcct catgcaaggc ggcttctcgt attcccactt tgagccgtcg 840
caggagcttg ggcaggtaac catcgcctcg ctgtcccact cccaggagca ggaccgccgc 900
tccttcggtg gcggcggtgg tggtggaggt ggaggggcag ggctcatggg aaatgttaag 960
caggagaacc agccgctgag gcccttcttc gacgagtggc cggggaggcg ggactcgtgg 1020
tcggagatgg acgacgagcg ctccaacggc acctccttct cgacgaccca gctctcgatc 1080
tccatcccaa tgcctcgatg tgattga 1107
<210> 38
<211> 1230
<212> DNA
<213> Triticum aestivum
<400> 38
atgatgatga tgggcggtcg cgcgggggcc ggcggcgtcg gggcaggcgg cggccggtgc 60
ccgttcacgg cgacgcagtg gcaggagctg gagcaccagg cgctcatcta caagtacatg 120
gcctccggcg tgcccatccc ctccgacctc ctcctcccgc tccgccgcag cttcctcctc 180
gactccgccc tcgccacctc cccctccctc gccttccctc cccaggccgc actgggctgg 240
ggatgtttcg ggatggggtt cggccggaag gcggaggacc cggagccggg gcggtgccgg 300
cggactgacg gcaagaagtg gcgctgctcc aaggaggcgt acccggactc caagtactgc 360
gagaagcaca tgcaccgggg caagaaccgt tcaagaaagc ctgtggaaat gtccttggcc 420
acgcccccgc cgccgccttc ctcctcggcc tcctcttcct cctccaacgt ccactccgcc 480
gtcaacgtcg ccaccaccac cacctcccca gcgccgtcct accaccgcca cgccgctgcg 540
actcacgaca cgacgcccta ccacgccctc tacggcggcc cctactcctc cgccggccgc 600
cagcagcacg ccagcgccta ccaccacgcg gcgcaggtca gcccgttcca cctgcacctc 660
gacaccaccc acccgcaccc gccgccgtcc tactactcca ccatggacca cagcaaggac 720
agctacgcct acgggcacag cgtcaaggag gtgcacggcg gcggcgagca cgccttcttc 780
tcctccgacg tcagcaccga cagggaccac caccaccatc agcaccaaca ccacgctagc 840
gccggcggca acggccagtg gcagttcaag cagctcggcg gcatggagcc caagcagcac 900
aaccccacgt cgctcttccc cggctacggc aacaacgcgg cgtacgccat cgacctgtcc 960
agcaaagaag aggacgagga gaaggagagg cggcaacagc agcagcactg cttcctgctg 1020
ggcgccgacc tgaggctcga caagccgtcg tcggggcacg gcgactccgc cgaccagaag 1080
cctctccggc cgttcttcga cgagtggccg cacgagaaga ctggcagcaa ggggtcgtgg 1140
atggggctcg agggggagac gcagctctcc atctccatcg ccaatgaact ccccatcacc 1200
accacctccc gctaccacca tggtgaatga 1230
<210> 39
<211> 1227
<212> DNA
<213> Triticum aestivum
<400> 39
atggagctcg ggcaggtgct gggctacacg ccaccggcga ccaaggacgc gagatccggc 60
ggcggcttca cccaggctgc cgcttgcccc tacccctacc cctccccctt cctcgacgag 120
cagaagatgc tcagcttctc caaggccgcc gccgctcacc agccgccctc aggtatggat 180
tttgggaggt ccaatgagca gaggctgttg ctggccagga gcaagatgcc cttcacccct 240
tcacagtgga tggagctgga gcaccaggcc ctcatttaca agtatctcaa tgcaaaggcc 300
cccatacctt ccggcctgct catctccatc agcaagagct tcagaccctc ctccgataga 360
atgccctgga ggcctgtcta tcaagggttc accaatgcag attctgaccc ggaacctgga 420
agatgccgtc gaacagacgg caagaaatgg cggtgctcaa aggaggcgat ggccgagcac 480
aagtactgtg agcggcacat caataggaac cgccatcgtt caagaaagcc tgtggaaaac 540
caaacaagga agaacgccaa agagacacct gctgctggct cgatatcggc cgctgtctca 600
cagggtggct gtaagaaagc aaaagctggt gatgaactga agccagggag cgtcagttat 660
tggacagata atttaaacag ggcaatggtg agcaaagcca ggggaaacaa ccctgaagaa 720
ggcaacagtg ctccactcct gaattctact aatcaacaac acacattgtc cttgttctct 780
caactgaagc aacagagcaa accagataag ttcagcccgg cagtcgatag tgaatcgatc 840
tcctcaaata ctgtattgaa gccttgggaa agaagcaacc agcagagcag taaggacgtt 900
tcttccacga cgctccatga tcgcgggtgc cttcaatcag tccttcaaga tttcagcatg 960
cataagaatg acaagatcga gtctcagaaa aacaatgctt cagtgccatc tactttctat 1020
tcatctacag aaggtcgaca catcagctgc cttgcatcta acatgatgca agtgcaggag 1080
gattgcatct caagctcttg ggagatacct caaggtgggc ctttaggtga aatcctaaca 1140
aactccaaga atactgatga cttgaccaat aagtgtgaat caagatcata tggttggtta 1200
ctgagtcttg atgaacatga aatgtga 1227
<210> 40
<211> 1917
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 40
atgatgatga tgggcggtcg cgcgggggcc ggcggcgtcg gggcaggcgg cggccggtgc 60
ccgttcacgg cgacgcagtg gcaggagctg gagcaccagg cgctcatcta caagtacatg 120
gcctccggcg tgcccatccc ctccgacctc ctcctcccgc tccgccgcag cttcctcctc 180
gactccgccc tcgccacctc cccctccctc gccttccctc cccaggccgc actgggctgg 240
ggatgtttcg ggatggggtt cggccggaag gcggaggacc cggagccggg gcggtgccgg 300
cggactgacg gcaagaagtg gcgctgctcc aaggaggcgt acccggactc caagtactgc 360
gagaagcaca tgcaccgggg caagaaccgt tcaagaaagc ctgtggaaat gtccttggcc 420
acgcccccgc cgccgccttc ctcctcggcc tcctcttcct cctccaacgt ccactccgcc 480
gtcaacgtcg ccaccaccac cacctcccca gcgccgtcct accaccgcca cgccgctgcg 540
actcacgaca cgacgcccta ccacgccctc tacggcggcc cctactcctc cgccggccgc 600
cagcagcacg ccagcgccta ccaccacgcg gcgcaggtca gcccgttcca cctgcacctc 660
gacaccaccc acccgcaccc gccgccgtcc tactactcca ccatggacca cagcaaggac 720
agctacgcct acgggcacag cgtcaaggag gtgcacggcg gcggcgagca cgccttcttc 780
tcctccgacg tcagcaccga cagggaccac caccaccatc agcaccaaca ccacgctagc 840
gccggcggca acggccagtg gcagttcaag cagctcggcg gcatggagcc caagcagcac 900
aaccccacgt cgctcttccc cggctacggc aacaacgcgg cgtacgccat cgacctgtcc 960
agcaaagaag aggacgagga gaaggagagg cggcaacagc agcagcactg cttcctgctg 1020
ggcgccgacc tgaggctcga caagccgtcg tcggggcacg gcgactccgc cgaccagaag 1080
cctctccggc cgttcttcga cgagtggccg cacgagaaga ctggcagcaa ggggtcgtgg 1140
atggggctcg agggggagac gcagctctcc atctccatcg ccaatgaact ccccatcacc 1200
accacctccc gctaccacca tggtgaagcg gccgctgcca tgcagcagca acacctgatg 1260
cagatgaacc agagcatgat ggggggctac gcttcctcta ccactgccac cactgatctc 1320
attcagcagt acctggatga gaacaagcag ctgatcctgg ccatcctcga caaccagaac 1380
aacggcaagg tggaggagtg cgcacggaac caagctaagc tccagcagaa cctcatgtac 1440
ctcgccgcca tcgccgacag ccagcctccg cagacggcat cgctgtctca gtacccgtcc 1500
aacctgatga tgcagtccgg gccgcggtac atgcagcagc agtcggcgca gatgatgtcg 1560
ccgcagtcgc tgatggcggc gcggtcgtcg atgatgtacg cgcagcaggc catgtcgccg 1620
ctccagcagc agcagcagca gcagcagcac caggcggccg cgcacggcca gctggggatg 1680
tcctccggcg cgaccaccgg gttcaacctc ctgcacggtg aggccagcat gggcggcggc 1740
ggcggcgcca ctggcaacag catgatgaac gccagcgtct tctcggacta tggccgcggc 1800
ggcagcggcg ccaaggaggg gtcgacctcg ctgtcggccg acgctcgcgg cgccaactct 1860
ggcgcgcaca gcggcgacgg ggagtacctc aagggcaccg aggaggaagg aagctag 1917
<210> 41
<211> 1794
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 41
atgctgagct cgtcggcggc gatggggatg gggctgggcg ggtacggcca gcagcagcag 60
cagcagcaga tgcagatgca gatgcagcgg ggggcggggc cggtgttcac gccggcgcag 120
tgggccgagc tggagcagca ggcgctgatt tacaagtacc tcatggcggg cgtgcccgtg 180
ccgcccgatc tcctgctccc catccgcccc caccccgccg gcgccggagc caccttctcc 240
ttcgccaacc ccgccgcctc gcccttctac caccaccacc acccctccat gagttactac 300
gcctactatg gcaagaagct cgacccggag ccgtggcggt gccgccgcac cgacggcaag 360
aagtggaggt gctccaagga ggcgcacccc gactccaagt actgcgagcg ccacatgcac 420
cgtggccgca accgttcaag aaagcctgtg gaatccaagt ctgcttcccc tgcgcaccag 480
tcgcagcagc ccccgttgtc cgccgtcacg tccgccaccc gcgacgccga gcctctcccc 540
tccctcccgg cgggggctaa gacccatggc ctgtccctcg gcggggctgg ctcgtcgcag 600
atgcacgtcg acgcctcgtc atacggcaac aaatactccc ttggagctaa atctgacgtg 660
ggtgaactga gcttcttctc tggagcatca ggaaacaaca acaggggctt caccatcgat 720
tccccaacgg acagctcgtg gcactcaatg ggatccagcc tgcccccgta ccaactgtcg 780
aaacctagag attccggcct catgcaaggc ggcttctcgt attcccactt tgagccgtcg 840
caggagcttg ggcaggtaac catcgcctcg ctgtcccact cccaggagca ggaccgccgc 900
tccttcggtg gcggcggtgg tggtggaggt ggaggggcag ggctcatggg aaatgttaag 960
caggagaacc agccgctgag gcccttcttc gacgagtggc cggggaggcg ggactcgtgg 1020
tcggagatgg acgacgagcg ctccaacggc acctccttct cgacgaccca gctctcgatc 1080
tccatcccaa tgcctcgatg tgatgcggcc gctgccatgc agcagcaaca cctgatgcag 1140
atgaaccaga gcatgatggg gggctacgct tcctctacca ctgccaccac tgatctcatt 1200
cagcagtacc tggatgagaa caagcagctg atcctggcca tcctcgacaa ccagaacaac 1260
ggcaaggtgg aggagtgcgc acggaaccaa gctaagctcc agcagaacct catgtacctc 1320
gccgccatcg ccgacagcca gcctccgcag acggcatcgc tgtctcagta cccgtccaac 1380
ctgatgatgc agtccgggcc gcggtacatg cagcagcagt cggcgcagat gatgtcgccg 1440
cagtcgctga tggcggcgcg gtcgtcgatg atgtacgcgc agcaggccat gtcgccgctc 1500
cagcagcagc agcagcagca gcagcaccag gcggccgcgc acggccagct ggggatgtcc 1560
tccggcgcga ccaccgggtt caacctcctg cacggtgagg ccagcatggg cggcggcggc 1620
ggcgccactg gcaacagcat gatgaacgcc agcgtcttct cggactatgg ccgcggcggc 1680
agcggcgcca aggaggggtc gacctcgctg tcggccgacg ctcgcggcgc caactctggc 1740
gcgcacagcg gcgacgggga gtacctcaag ggcaccgagg aggaaggaag ctag 1794
<210> 42
<211> 1914
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 42
atggagctcg ggcaggtgct gggctacacg ccaccggcga ccaaggacgc gagatccggc 60
ggcggcttca cccaggctgc cgcttgcccc tacccctacc cctccccctt cctcgacgag 120
cagaagatgc tcagcttctc caaggccgcc gccgctcacc agccgccctc aggtatggat 180
tttgggaggt ccaatgagca gaggctgttg ctggccagga gcaagatgcc cttcacccct 240
tcacagtgga tggagctgga gcaccaggcc ctcatttaca agtatctcaa tgcaaaggcc 300
cccatacctt ccggcctgct catctccatc agcaagagct tcagaccctc ctccgataga 360
atgccctgga ggcctgtcta tcaagggttc accaatgcag attctgaccc ggaacctgga 420
agatgccgtc gaacagacgg caagaaatgg cggtgctcaa aggaggcgat ggccgagcac 480
aagtactgtg agcggcacat caataggaac cgccatcgtt caagaaagcc tgtggaaaac 540
caaacaagga agaacgccaa agagacacct gctgctggct cgatatcggc cgctgtctca 600
cagggtggct gtaagaaagc aaaagctggt gatgaactga agccagggag cgtcagttat 660
tggacagata atttaaacag ggcaatggtg agcaaagcca ggggaaacaa ccctgaagaa 720
ggcaacagtg ctccactcct gaattctact aatcaacaac acacattgtc cttgttctct 780
caactgaagc aacagagcaa accagataag ttcagcccgg cagtcgatag tgaatcgatc 840
tcctcaaata ctgtattgaa gccttgggaa agaagcaacc agcagagcag taaggacgtt 900
tcttccacga cgctccatga tcgcgggtgc cttcaatcag tccttcaaga tttcagcatg 960
cataagaatg acaagatcga gtctcagaaa aacaatgctt cagtgccatc tactttctat 1020
tcatctacag aaggtcgaca catcagctgc cttgcatcta acatgatgca agtgcaggag 1080
gattgcatct caagctcttg ggagatacct caaggtgggc ctttaggtga aatcctaaca 1140
aactccaaga atactgatga cttgaccaat aagtgtgaat caagatcata tggttggtta 1200
ctgagtcttg atgaacatga aatggcggcc gctgccatgc agcagcaaca cctgatgcag 1260
atgaaccaga gcatgatggg gggctacgct tcctctacca ctgccaccac tgatctcatt 1320
cagcagtacc tggatgagaa caagcagctg atcctggcca tcctcgacaa ccagaacaac 1380
ggcaaggtgg aggagtgcgc acggaaccaa gctaagctcc agcagaacct catgtacctc 1440
gccgccatcg ccgacagcca gcctccgcag acggcatcgc tgtctcagta cccgtccaac 1500
ctgatgatgc agtccgggcc gcggtacatg cagcagcagt cggcgcagat gatgtcgccg 1560
cagtcgctga tggcggcgcg gtcgtcgatg atgtacgcgc agcaggccat gtcgccgctc 1620
cagcagcagc agcagcagca gcagcaccag gcggccgcgc acggccagct ggggatgtcc 1680
tccggcgcga ccaccgggtt caacctcctg cacggtgagg ccagcatggg cggcggcggc 1740
ggcgccactg gcaacagcat gatgaacgcc agcgtcttct cggactatgg ccgcggcggc 1800
agcggcgcca aggaggggtc gacctcgctg tcggccgacg ctcgcggcgc caactctggc 1860
gcgcacagcg gcgacgggga gtacctcaag ggcaccgagg aggaaggaag ctag 1914
<210> 43
<211> 558
<212> DNA
<213> Triticum aestivum
<400> 43
atgcagcaag cgatgcccat gccgccggcg gcggcggcgc cggggatgcc tccatctgct 60
ggcctcagca ccgagcagat ccaaaagtac ctggatgaaa ataagcaact aattttggct 120
atcttggaaa atcagaacct gggaaagttg gcggaatgtg ctcagtatca agctcagctt 180
cagaagaatc ttttgtattt ggctgcaatt gctgatactc agccacagac ctctgtaagc 240
cgtcctcaga tggcaccacc tagtgcatcc ccaggggtag ggcattacat gtcacaggtg 300
ccaatgttcc ctccaaggac ccctctaacg cctcagcaga tgcaggagca gcaactacag 360
caacaacagg ctcagatgct tccgtttgct ggtcaaatgg ttgcgagacc tggggctgtg 420
aatggcatgg cccgggcccc tcaagttgaa ccagcctatg cagcaggtgg ggccagttcc 480
gagccttctg gcactgagag ccataggagc actggtgccg ataacaacgg tggaagcggc 540
ttggctgacc agtcctaa 558
<210> 44
<211> 606
<212> DNA
<213> Triticum aestivum
<400> 44
atgcagcagg cgatgtcctt gcccccggga gcggtcggcg cggtgtcccc gccggccggc 60
atcaccaccg agcagatcca aaagtatttg gatgaaaata agcaacttat tttggccatc 120
cttgaaaatc agaacctagg aaagttggct gaatgtgctc aatatcaagc tcagctccaa 180
aagaatctct tgtatctagc tgctatcgcg gatgcccaac caccacagaa ccctgcaagt 240
cgcccccaga tggtgcagca gcctggtagt atgcaaggtg cagggcatta catgtcacaa 300
gtaccaatgt tccctccaag aacgccttta accccacagc agatgcaaga gcagcagcac 360
cagcagcttc agcagcagca agcccaggcc ctttctttcc ccgcccaggt ggtcatgaga 420
ccaggcaccg tcaacggcat gcagcagcct atgcaagcag ccggcgacct ccagccagca 480
gcagcacctg gagggagcaa gcaggacgcc gcagtggctg gggccagctc ggaaccatct 540
ggcaccaaga gccacaagaa cgcgggagca gaggaggtgg gcgctgatgt agcagaacaa 600
tcctaa 606
<210> 45
<211> 1791
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 45
atggcgatgc cgtatgcctc tctttccccg gcaggcgacc gccgctcctc cccggccgcc 60
accgccaccg cctccctcct ccccttctgc cgctcctccc ccttctccgc cggcggcaat 120
ggcggcatgg gggaggaggc gcggatggac gggaggtgga tggcgaggcc ggtgcccttc 180
acggcggcgc agtacgagga gctggagcac caggcgctca tatacaagta cctggtggcc 240
ggcgtgtccg tcccgccgga tctcgtgctc cccatccgcc gcggcatcga gtccctcgcc 300
gcccgcttct accacaaccc cctcgccatc gggtacggat cgtacctggg caagaaggtg 360
gatccggagc cgggccggtg ccggcgcacg gacggcaaga agtggcggtg cgccaaggag 420
gccgcctccg actccaagta ttgcgagcgc cacatgcacc gcggccgcaa ccgttcaaga 480
aagcctgtgg aaacgcagct cgtctcgcac tcccagccgc cggccgcctc cgtcgtgccg 540
cccctcgcca ccggcttcca caaccactcc ctctaccccg ccatcggcgg caccaacggt 600
ggtggaggcg gggggaacaa cggcatgtcc atgcccggca cgttctcctc cgcgctgggg 660
ccgcctcagc agcacatggg caacaatgcc gcctctccct acgcggctct cggcggcgcc 720
ggaacatgca aagatttcag gtataccgca tatggaataa gatctttggc agacgagcag 780
agtcagctca tgacagaagc catgaacacc tccgtggaga acccatggcg cctgccgcca 840
tcttctcaaa cgactacatt cccgctctca agctactctc ctcagcttgg agcaacgagt 900
gacctgggtc agaacaacag cagcaacaac aacagcggcg tcaaggccga gggacagcag 960
cagcagcagc cgctctcctt cccggggtgc ggcgacttcg gcagcggcga ctccgcgaag 1020
caggagaacc agacgctgcg gccgttcttc gacgagtggc cgaagacgag ggactcgtgg 1080
tcggacctga ccgacgacaa ctcgaacgtc gcctccttct cggccaccca gctgtcgatc 1140
tcgataccca tgacgtcctc cgacttctcc gccgccagct cccagtcgcc caacggcatg 1200
ctgttcgccg gcgagatgta cgcggccgct gccatgcagc aagcgatgcc catgccgccg 1260
gcggcggcgg cgccggggat gcctccatct gctggcctca gcaccgagca gatccaaaag 1320
tacctggatg aaaataagca actaattttg gctatcttgg aaaatcagaa cctgggaaag 1380
ttggcggaat gtgctcagta tcaagctcag cttcagaaga atcttttgta tttggctgca 1440
attgctgata ctcagccaca gacctctgta agccgtcctc agatggcacc acctagtgca 1500
tccccagggg tagggcatta catgtcacag gtgccaatgt tccctccaag gacccctcta 1560
acgcctcagc agatgcagga gcagcaacta cagcaacaac aggctcagat gcttccgttt 1620
gctggtcaaa tggttgcgag acctggggct gtgaatggca tggcccgggc ccctcaagtt 1680
gaaccagcct atgcagcagg tggggccagt tccgagcctt ctggcactga gagccatagg 1740
agcactggtg ccgataacaa cggtggaagc ggcttggctg accagtccta a 1791
<210> 46
<211> 1839
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 46
atggcgatgc cgtatgcctc tctttccccg gcaggcgacc gccgctcctc cccggccgcc 60
accgccaccg cctccctcct ccccttctgc cgctcctccc ccttctccgc cggcggcaat 120
ggcggcatgg gggaggaggc gcggatggac gggaggtgga tggcgaggcc ggtgcccttc 180
acggcggcgc agtacgagga gctggagcac caggcgctca tatacaagta cctggtggcc 240
ggcgtgtccg tcccgccgga tctcgtgctc cccatccgcc gcggcatcga gtccctcgcc 300
gcccgcttct accacaaccc cctcgccatc gggtacggat cgtacctggg caagaaggtg 360
gatccggagc cgggccggtg ccggcgcacg gacggcaaga agtggcggtg cgccaaggag 420
gccgcctccg actccaagta ttgcgagcgc cacatgcacc gcggccgcaa ccgttcaaga 480
aagcctgtgg aaacgcagct cgtctcgcac tcccagccgc cggccgcctc cgtcgtgccg 540
cccctcgcca ccggcttcca caaccactcc ctctaccccg ccatcggcgg caccaacggt 600
ggtggaggcg gggggaacaa cggcatgtcc atgcccggca cgttctcctc cgcgctgggg 660
ccgcctcagc agcacatggg caacaatgcc gcctctccct acgcggctct cggcggcgcc 720
ggaacatgca aagatttcag gtataccgca tatggaataa gatctttggc agacgagcag 780
agtcagctca tgacagaagc catgaacacc tccgtggaga acccatggcg cctgccgcca 840
tcttctcaaa cgactacatt cccgctctca agctactctc ctcagcttgg agcaacgagt 900
gacctgggtc agaacaacag cagcaacaac aacagcggcg tcaaggccga gggacagcag 960
cagcagcagc cgctctcctt cccggggtgc ggcgacttcg gcagcggcga ctccgcgaag 1020
caggagaacc agacgctgcg gccgttcttc gacgagtggc cgaagacgag ggactcgtgg 1080
tcggacctga ccgacgacaa ctcgaacgtc gcctccttct cggccaccca gctgtcgatc 1140
tcgataccca tgacgtcctc cgacttctcc gccgccagct cccagtcgcc caacggcatg 1200
ctgttcgccg gcgagatgta cgcggccgct gccatgcagc aggcgatgtc cttgcccccg 1260
ggagcggtcg gcgcggtgtc cccgccggcc ggcatcacca ccgagcagat ccaaaagtat 1320
ttggatgaaa ataagcaact tattttggcc atccttgaaa atcagaacct aggaaagttg 1380
gctgaatgtg ctcaatatca agctcagctc caaaagaatc tcttgtatct agctgctatc 1440
gcggatgccc aaccaccaca gaaccctgca agtcgccccc agatggtgca gcagcctggt 1500
agtatgcaag gtgcagggca ttacatgtca caagtaccaa tgttccctcc aagaacgcct 1560
ttaaccccac agcagatgca agagcagcag caccagcagc ttcagcagca gcaagcccag 1620
gccctttctt tccccgccca ggtggtcatg agaccaggca ccgtcaacgg catgcagcag 1680
cctatgcaag cagccggcga cctccagcca gcagcagcac ctggagggag caagcaggac 1740
gccgcagtgg ctggggccag ctcggaacca tctggcacca agagccacaa gaacgcggga 1800
gcagaggagg tgggcgctga tgtagcagaa caatcctaa 1839
<210> 47
<211> 15957
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 47
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc agactagagt tatcaacaag tttgtacaaa aaagctgcca ccatggcgat 2040
gccgtatgcc tctctttccc cggcaggcga ccgccgctcc tccccggccg ccaccgccac 2100
cgcctccctc ctccccttct gccgctcctc ccccttctcc gccggcggca atggcggcat 2160
gggggaggag gcgcggatgg acgggaggtg gatggcgagg ccggtgccct tcacggcggc 2220
gcagtacgag gagctggagc accaggcgct catatacaag tacctggtgg ccggcgtgtc 2280
cgtcccgccg gatctcgtgc tccccatccg ccgcggcatc gagtccctcg ccgcccgctt 2340
ctaccacaac cccctcgcca tcgggtacgg atcgtacctg ggcaagaagg tggatccgga 2400
gccgggccgg tgccggcgca cggacggcaa gaagtggcgg tgcgccaagg aggccgcctc 2460
cgactccaag tattgcgagc gccacatgca ccgcggccgc aaccgttcaa gaaagcctgt 2520
ggaaacgcag ctcgtctcgc actcccagcc gccggccgcc tccgtcgtgc cgcccctcgc 2580
caccggcttc cacaaccact ccctctaccc cgccatcggc ggcaccaacg gtggtggagg 2640
cggggggaac aacggcatgt ccatgcccgg cacgttctcc tccgcgctgg ggccgcctca 2700
gcagcacatg ggcaacaatg ccgcctctcc ctacgcggct ctcggcggcg ccggaacatg 2760
caaagatttc aggtataccg catatggaat aagatctttg gcagacgagc agagtcagct 2820
catgacagaa gccatgaaca cctccgtgga gaacccatgg cgcctgccgc catcttctca 2880
aacgactaca ttcccgctct caagctactc tcctcagctt ggagcaacga gtgacctggg 2940
tcagaacaac agcagcaaca acaacagcgg cgtcaaggcc gagggacagc agcagcagca 3000
gccgctctcc ttcccggggt gcggcgactt cggcagcggc gactccgcga agcaggagaa 3060
ccagacgctg cggccgttct tcgacgagtg gccgaagacg agggactcgt ggtcggacct 3120
gaccgacgac aactcgaacg tcgcctcctt ctcggccacc cagctgtcga tctcgatacc 3180
catgacgtcc tccgacttct ccgccgccag ctcccagtcg cccaacggca tgctgttcgc 3240
cggcgaaatg tacgcggccg ctgccatgca gcagcaacac ctgatgcaga tgaaccagag 3300
catgatgggg ggctacgctt cctctaccac tgccaccact gatctcattc agcagtacct 3360
ggatgagaac aagcagctga tcctggccat cctcgacaac cagaacaacg gcaaggtgga 3420
ggagtgcgca cggaaccaag ctaagctcca gcagaacctc atgtacctcg ccgccatcgc 3480
cgacagccag cctccgcaga cggcatcgct gtctcagtac ccgtccaacc tgatgatgca 3540
gtccgggccg cggtacatgc agcagcagtc ggcgcagatg atgtcgccgc agtcgctgat 3600
ggcggcgcgg tcgtcgatga tgtacgcgca gcaggccatg tcgccgctcc agcagcagca 3660
gcagcagcag cagcaccagg cggccgcgca cggccagctg gggatgtcct ccggcgcgac 3720
caccgggttc aacctcctgc acggtgaggc cagcatgggc ggcggcggcg gcgccactgg 3780
caacagcatg atgaacgcca gcgtcttctc ggactatggc cgcggcggca gcggcgccaa 3840
ggaggggtcg acctcgctgt cggccgacgc tcgcggcgcc aactctggcg cgcacagcgg 3900
cgacggggag tacctcaagg gcaccgagga ggaaggaagc tagcgttcag ctttcttgta 3960
caaagtggtt gataacagcg ggttaattaa catcttttac ccatacgatg ttcctgacta 4020
tgcgggctat ccctatgacg tcccggacta tgcaggatcc tatccatatg acgttccaga 4080
ttacgctgct cagtgcagcg cttagagctc gaatttcccc gatcgttcaa acatttggca 4140
ataaagtttc ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct 4200
gttgaattac gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg 4260
ggtttttatg attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata 4320
gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgggaatt 4380
agcactagta acggccgcca gtgtgctgga attcgccctt taaataagcg atcgctgcgg 4440
ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc ggtcaataca 4500
ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac tggcaaactg 4560
tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg atgctttggg 4620
ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc aacaatgtcc 4680
tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg ttcggggatt 4740
cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt atggagcagc 4800
agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg ctccgggcgt 4860
atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc aatttcgatg 4920
atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc gggactgtcg 4980
ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt gtagaagtac 5040
tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa tagagtagat 5100
ttaaattgaa atcaccagtc tctctctaca aatctatctc tctctataat aatgtgtgag 5160
tagttcccag ataagggaat tagggttctt atagggtttc gctcatgtgt tgagcatata 5220
agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 5280
ctaaaaccaa aatccagtgg gcgcggggta ccgagctcga attcagtaca ttaaaaacgt 5340
ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat atatcctgcc 5400
accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac tcgatacagg 5460
cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc agactttgct 5520
catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga aacacggatg 5580
atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc ctgtgatcaa 5640
atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc tcgcttaacc 5700
gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct ggataaagcc 5760
gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgatc gtcgaccgta 5820
ccccgatgaa ttaattcgga cgtacgttct gaacacagct ggatacttac ttgggcgatt 5880
gtcatacatg acatcaacaa tgtacccgtt tgtgtaaccg tctcttggag gttcgtatga 5940
cactaggtcg ctaccttagg accgttatag ttactagcga attgacatga ggttgccccg 6000
tattcagtgt cgctgatttg tattgtctga agttgttttt acgttaagtt gatgcagatc 6060
aattaatacg atacctgcgt cataattgat tatttgacgt ggtttgatgg cctccacgca 6120
cgttgtgata tgtagatgat aatcattatc actttacggg tcctttccgg tgatccgaca 6180
ggttacgggg cggcgacctc gcgggttttc gctatttatg aaaattttcc ggtttaaggc 6240
gtttccgttc ttcttcgtca taacttaatg tttttattta aaataccctc tgaaaagaaa 6300
ggaaacgaca ggtgctgaaa gcgagctttt tggcctctgt cgtttccttt ctctgttttt 6360
gtccgtggaa tgaacaatgg aaggatcttc tcggcggcga tcacgacgcc ggccctgcgg 6420
agccttcgcc gcgtgcgcga ttcatggcgg ccgtggaggc caaggatttc gcgcgagtgc 6480
aagagctgat cgaggcgcgt ggagccaagt cggcggctga ttatgtcctt gcgcagctcg 6540
ccgtggccga aggtctggac cgcaagcctg gtgcgcgcgt cgtggtcggg aaagcggcgg 6600
gcagcatggc aatgccgcct gcggcgctgg gttttacgcc aaggggagaa gcggcatacg 6660
ccatcgagcg gtcagcctat ggtgagccga ggtccagcat tgcgaagcag taccagcagg 6720
aatggaaccg gaaggcggcg acctggtggg cgatggccgg tgtggccggc atcatcggcg 6780
cgatcctggc ggcggcggca accggctttg ttgggctggc agtgtcgatc cgcaaccgag 6840
tgaagcgcgt gcgcgacctg ttggtgatgg agccgggtgc agagccataa gcggcaagag 6900
acgaaagccc ggtttccggg cttttgtttt gttacgccaa ggacgagttt tagcggctaa 6960
aggtgttgac gtgcgagaaa tgtttagcta aacttctctc atgtgctggc ggctgtcacc 7020
gctatgttca accaaggcgc ggagcaaatt atgggtgtta tccatgaaga aacggcttac 7080
cgaaagccag ttccaggagg cgatccaggg gctggaagtg gggcagcaga ccatcgagat 7140
agcgcggggc gtcttagtcg atgggaagcc acaggcgacg ttcgcaacgt cgctgggact 7200
gaccaggggc gcagtgtcgc aagcggtgca tcgcgtgtgg gccgcgttcg aggacaagaa 7260
cttgcccgag gggtacgcgc gggtaacggc ggttctgccg gaacatcagg cgtacatcgt 7320
ccggaagtgg gaagcggacg ccaagaaaaa acaggaaacc aaacgatgaa aactttggtc 7380
acggccaacc agaaaggcgg cgtcggcaag acttcgaccc ttgtgcatct tgccttcgac 7440
tttttcgagc gcggcttgcg ggttgccgtg atcgacctgg acccccaggg caatgcgtcc 7500
tacacgctca aggactttgc taccggcctg catgcaagca agctgttcgg cgctgtccct 7560
gccggcggct ggaccgaaac cgcacccgca gccggcgacg gccaggccgc gcgcctcgcc 7620
ctcatcgagt ccaacccggt actggcgaac gccgaacggc tgtcgctgga cgacgcccgc 7680
gagctgttcg gggcgaacat caaggccctg gcgaaccaag gcttcgacgt gtgcctgatc 7740
gacacggccc cgacccttgg cgtcggcctg gcggccgccc tcttcgcggc cgactatgtg 7800
ctgtccccca tcgagcttga ggcgtacagc atccagggca tcaagaagat ggtcacgacc 7860
attgcgaacg tgcgccagaa gaacgccaag ctgcaattcc ttggcatggt gcccagcaag 7920
gtcgatgcgc ggaatccgcg ccacgcgcgc caccaagccg agctgctggc cgcgtacccc 7980
aagatgatga ttccggccac cgttggcctg cgcagcagca tcgccgatgc cctcgcatcc 8040
ggtgtgccgg tctggaagat caagaaaacg gccgcgcgca aggcatcgaa agaggttcgc 8100
gccctggctg attacgtgtt cacgaagatg gagatttccc aatgactgcg gctcaagcca 8160
agaccaccaa gaaaaacacc gctgcggccg ctcaggaagc cgcaggcgcg gcgcagccgt 8220
ccggcctggg gttggatagc atcggcgacc tgtcgagcct cctggacgct cctgcggcgt 8280
ctcagggcgg ttccggccct atcgagctgg acctggacct gatcgacgaa gatccgcatc 8340
agccgcggac ggccgacaac cccggctttt ccccggagag catcgcggaa atcggtgcca 8400
cgatcaaaga gcgcggggtg aagtcaccca tttcggtgcg cgagaaccag gagcagccgg 8460
gccgctatat catcaatcac ggcgcccgcc gctaccgtgg ctcgaagtgg gccggcaaga 8520
agtccatccc ggcgttcatc gacaacgact acaacgaagc cgaccaggtt atcgagaacc 8580
tgcaacgcaa cgagctgacc ccgcgcgaaa ttgccgactt cattggccgc gagctggcga 8640
agggcaagaa gaaaggcgat atcgccaagg aaatcggcaa gtcgccggcg ttcatcaccc 8700
agcacgtcac gctgctggac ctgccggaga agatcgccga tgcgttcaac accggccgcg 8760
tgcgcgacgt gaccgtggtg aacgagctgg tgacggcctt caagaagcgc ccggaggaag 8820
tcgaggcgtg gcttgacgac gacacccagg aaatcacgcg cggcacggtc aagctgctgc 8880
gcgagttcct ggacgagaag ggccgcgatc ccaacaccgt cgatgccttc aacggccaga 8940
ctgatgccga gcgtgacgcg gaggccggcg acggccagga cggcgaggac ggcgaccagg 9000
acggtaagga cgccaaggaa aagggcgcga aggagccgga cccggacaag ctgaaaaagg 9060
ccatcgtcca ggtcgagcac gacgagcgcc ctgcccgcct tatcctcaac cgtcggccgc 9120
cggcggaagg ctatgcctgg ttgaagtacg aggacgacgg ccaggagttc gaggcgaacc 9180
ttgccgacgt gaaactggtc gcgctcatcg agggctgatc cccaaagaca gcggcgcggg 9240
ccacccgcgc cgcacagaca acggttccgc tacaaggagg accgaagaat gaatccgatg 9300
ctgttctaca tcgcgggagg cgtaggcgcg gcgttgctgc tggtttccgc gatcatgctg 9360
ttcaagctgc gcgagccgaa gaaggaacac cgaccgcagc gcaaggcggc ggccccgacg 9420
ccgcagccgg tcgataacga gctgctgcgc actctagtga tattccacaa aacagcaggg 9480
aagcagcgct tttccgctgc ataaccctgc ttcggggtca ttatagcgat tttttcggta 9540
tatccatcct ttttcgcacg atatacagga ttttgccaaa gggttcgtgt agactttcct 9600
tggtgtatcc aacggcgtca gccgggcagg ataggtgaag taggcccacc cgcgagcggg 9660
tgttccttct tcactgtccc ttattcgcac ctggcggtgc tcaacgggaa tcctgctctg 9720
cgaggctggc cggctaccgc cggcgtaaca gatgagggca agcggatggc tgatgaaacc 9780
aagccaacca ggaagggcag cccacctatc aaggtgtact gccttccaga cgaacgaaga 9840
gcgattgagg aaaaggcggc ggcggccggc atgagcctgt cggcctacct gctggccgtc 9900
ggccagggct acaaaatcac gggcgtcgtg gactatgagc acgtccgcga gctggcccgc 9960
atcaatggcg acctgggccg cctgggcggc ctgctgaaac tctggctcac cgacgacccg 10020
cgcacggcgc ggttcggtga tgccacgatc ctcgccctgc tggcgaagat cgaagagaag 10080
caggacgagc ttggcaaggt catgatgggc gtggtccgcc cgagggcaga gccatgactt 10140
ttttagccgc taaaacggcc ggggggtgcg cgtgattgcc aagcacgtcc ccatgcgctc 10200
catcaagaag agcgacttcg cggagctggt gaagtacatc accgacgagc aaggcaagac 10260
cgagcgccag atccaaaaca actgtcaaag cgcacccgcc cgatgccatt cgcggcacgg 10320
cttccgttga ggatgtcgat atgatgcgcg agccgacggc ccgcagagaa ggggccgttt 10380
tagcggctaa agaaggaagt gcaagcccta acccttggcg tcagagcctt ccacgcagct 10440
tttttcgggt gtcgtcgccc catttcttta cgataaacgc cttatgtgac ggcaaaacca 10500
cactgatgcg ttcgtatccg ggcggcacgc tgctcttgaa aggatgaccc gcaatctccg 10560
cgagtgcctc gcggtcaagg tcggtggact ccaggagaag aggtagggga gtttccaggg 10620
cgtcggcaat ggcctccatc accttcaacg aggggttggc cttaccgttg gttaagtctg 10680
ataaaaacga aattgaaacc cctgccctct ccgacagctc atgtttcgtc atgccccgct 10740
catcgagcag acgaaggatg ttggtgaaaa atatctggtt gtacacagcg gaagccgccc 10800
ctcgcacctt tggtcgcggc ccgcaaaatt ttagccgcta aagttcttga cagcggaacc 10860
aatgtttagc taaactagag tctcctttct caaggagact ttcgatatga gccataatca 10920
gttccagttt atcggtaatc ttacccgtga caccgaggta cgtcatggca attctaacaa 10980
gccgcaagca attttcgata tagcggttaa tgaagagtgg cgcaacgatg ccggcgacaa 11040
gcaggagcgc accgacttct tccgcatcaa gtgttttggc tctcaggccg aggcccacgg 11100
caagtatttg ggcaaggggt cgctggtatt cgtgcagggc aagattcgga ataccaagta 11160
cgagaaggac ggccagacgg tctacgggac cgacttcatt gccgataagg tggattatct 11220
ggacaccaag gcaccaggcg ggtcaaatca ggaataaggg cacattgccc cggcgtgagt 11280
cggggcaatc ccgcaaggag ggtgaatgaa tcggacgttt gaccggaagg catacaggca 11340
agaactgatc gacgcggggt tttccgccga ggatgccgaa accatcgcaa gccgcaccgt 11400
catgcgtgcg ccccgcgaaa ccttccagtc cgtcggctcg atggtccagc aagctacggc 11460
caagatcgag cgcgacagcg tgcaactggc tccccctgcc ctgcccgcgc catcggccgc 11520
cgtggagcgt tcgcgtcgtc tcgaacagga ggcggcaggt ttggcgaagt cgatgaccat 11580
cgacacgcga ggaactatga cgaccaagaa gcgaaaaacc gccggcgagg acctggcaaa 11640
acaggtcagc gaggccaagc aggccgcgtt gctgaaacac acgaagcagc agatcaagga 11700
aatgcagctt tccttgttcg atattgcgcc gtggccggac acgatgcgag cgatgccaaa 11760
cgacacggcc cgctctgccc tgttcaccac gcgcaacaag aaaatcccgc gcgaggcgct 11820
gcaaaacaag gtcattttcc acgtcaacaa ggacgtgaag atcacctaca ccggcgtcga 11880
gctgcgggcc gacgatgacg aactggtgtg gcagcaggtg ttggagtacg cgaagcgcac 11940
ccctatcggc gagccgatca ccttcacgtt ctacgagctt tgccaggacc tgggctggtc 12000
gatcaatggc cggtattaca cgaaggccga ggaatgcctg tcgcgcctac aggcgacggc 12060
gatgggcttc acgtccgacc gcgttgggca cctggaatcg gtgtcgctgc tgcaccgctt 12120
ccgcgtcctg gaccgtggca agaaaacgtc ccgttgccag gtcctgatcg acgaggaaat 12180
cgtcgtgctg tttgctggcg accactacac gaaattcata tgggagaagt accgcaagct 12240
gtcgccgacg gcccgacgga tgttcgacta tttcagctcg caccgggagc cgtacccgct 12300
caagctggaa accttccgcc tcatgtgcgg atcggattcc acccgcgtga agaagtggcg 12360
cgagcaggtc ggcgaagcct gcgaagagtt gcgaggcagc ggcctggtgg aacacgcctg 12420
ggtcaatgat gacctggtgc attgcaaacg ctagggcctt gtggggtcag ttccggctgg 12480
gggttcagca gccagcgctt tactggcatt tcaggaacaa gcgggcactg ctcgacgcac 12540
ttgcttcgct cagtatcgct cgggacgcac ggcgcgctct acgaactgcc gataaacaga 12600
ggattaaaat tgacaattgt gattaaggct cagattcgac ggcttggagc ggccgacgtg 12660
caggatttcc gcgagatccg attgtcggcc ctgaagaaag ctccagagat gttcgggtcc 12720
gtttacgagc acgaggagaa aaagcccatg gaggcgttcg ctgaacggtt gcgagatgcc 12780
gtggcattcg gcgcctacat cgacggcgag atcattgggc tgtcggtctt caaacaggag 12840
gacggcccca aggacgctca caaggcgcat ctgtccggcg ttttcgtgga gcccgaacag 12900
cgaggccgag gggtcgccgg tatgctgctg cgggcgttgc cggcgggttt attgctcgtg 12960
atgatcgtcc gacagattcc aacgggaatc tggtggatgc gcatcttcat cctcggcgca 13020
cttaatattt cgctattctg gagcttgttg tttatttcgg tctaccgcct gccgggcggg 13080
gtcgcggcga cggtaggcgc tgtgcagccg ctgatggtcg tgttcatctc tgccgctctg 13140
ctaggtagcc cgatacgatt gatggcggtc ctgggggcta tttgcggaac tgcgggcgtg 13200
gcgctgttgg tgttgacacc aaacgcagcg ctagatcctg tcggcgtcgc agcgggcctg 13260
gcgggggcgg tttccatggc gttcggaacc gtgctgaccc gcaagtggca acctcccgtg 13320
cctctgctca cctttaccgc ctggcaactg gcggccggag gacttctgct cgttccagta 13380
gctttagtgt ttgatccgcc aatcccgatg cctacaggaa ccaatgttct cggctgctcg 13440
actgcacgaa taccagcgac cccttgccca aatacttgcc gtgggcctcg gcctgagagc 13500
caaaacactt gatgcggaag aagtcggtgc gctcctgctt gtcgccggca tcgttgcgcc 13560
acatctaggt actaaaacaa ttcatccagt aaaatataat attttatttt ctcccaatca 13620
ggcttgatcc ccagtaagtc aaaaaatagc tcgacatact gttcttcccc gatatcctcc 13680
ctgatcgacc ggacgcagaa ggcaatgtca taccacttgt ccgccctgcc gcttctccca 13740
agatcaataa agccacttac tttgccatct ttcacaaaga tgttgctgtc tcccaggtcg 13800
ccgtgggaaa agacaagttc ctcttcgggc ttttccgtct ttaaaaaatc atacagctcg 13860
cgcggatctt taaatggagt gtcttcttcc cagttttcgc aatccacatc ggccagatcg 13920
ttattcagta agtaatccaa ttcggctaag cggctgtcta agctattcgt atagggacaa 13980
tccgatatgt cgatggagtg aaagagcctg atgcactccg catacagctc gataatcttt 14040
tcagggcttt gttcatcttc atactcttcc gagcaaagga cgccatcggc ctcactcatg 14100
agcagattgc tccagccatc atgccgttca aagtgcagga cctttggaac aggcagcttt 14160
ccttccagcc atagcatcat gtccttttcc cgttccacat cataggtggt ccctttatac 14220
cggctgtccg tcatttttaa atataggttt tcattttctc ccaccagctt atatacctta 14280
gcaggagaca ttccttccgt atcttttacg cagcggtatt tttcgatcag ttttttcaat 14340
tccggtgata ttctcatttt agccatttat tatttccttc ctcttttcta cagtatttaa 14400
agatacccca agaagctaat tataacaaga cgaactccaa ttcactgttc cttgcattct 14460
aaaaccttaa ataccagaaa acagcttttt caaagttgtt ttcaaagttg gcgtataaca 14520
tagtatcgat tcgatagcgt ggactcaagg ctctcgcgaa tggctcgcgt tggaaacttt 14580
cattgacact tgaggggcac cgcagggaaa ttctcgtcct tgcgagaacc ggctatgtcg 14640
tgctgcgcat cgagcctgcg cccttggctt gtctcgcccc tctccgcgtc gctacggggc 14700
ttccagcgcc tttccgacgc tcaccgggct ggttgccctc gccgctgggc tggcggccgt 14760
ctatggccct gcaaacgcgc cagaaacgcc gtcgaagccg tgtgcgagac accgcggccg 14820
ccggcgttgt ggatacctcg cggaaaactt ggccctcact gacagatgag gggcggacgt 14880
tgacacttga ggggccgact cacccggcgc ggcgttgaca gatgaggggc aggctcgatt 14940
tcggccggcg acgtggagct ggccagcctc gcaaatcggc gaaaacgcct gattttacgc 15000
gagtttccca cagatgatgt ggacaagcct ggggataagt gccctgcggt attgacactt 15060
gaggggcgcg actactgaca gatgaggggc gcgatccttg acacttgagg ggcagagtgc 15120
tgacagatga ggggcgcacc tattgacatt tgaggggctg tccacaggca gaaaatccag 15180
catttgcaag ggtttccgcc cgtttttcgg ccaccgctaa cctgtctttt aacctgcttt 15240
taaaccaata tttataaacc ttgtttttaa ccagggctgc gccctgtgcg cgtgaccgcg 15300
cacgccgaag gggggtgccc ccccttctcg aaccctcccg gcccgctaac gcgggcctcc 15360
catcccccca ggggctgcgc ccctcggccg cgaacggcct caccccaaaa atggcagcgc 15420
cagccaggac gtcggccgaa agagcgacaa gcagatcacg cttttcgaca gcgtcggatt 15480
tgcgatcgag gatttttcgg cgctgcgcta cgtccgcgac cgcgttgagg gatcaagcca 15540
cagcagccca ctcgaccttc tagccgaccc agacgagcca agggatcttt ttggaatgct 15600
gctccgtcgt caggctttcc gacgtttggg tggttgaaca gaagtcatta tcgcacggaa 15660
tgccaagcac tcccgagggg aaccctgtgg ttggcatgca catacaaatg gacgaacgga 15720
taaacctttt cacgcccttt taaatatccg attattctaa taaacgctct tttctcttag 15780
gtttacccgc caatatatcc tgtcaaacac tgatagttta aactgaaggc gggaaacgac 15840
aatctgatca tgagcggaga attaagggag tcacgttatg acccccgccg atgacgcggg 15900
acaagccgtt ttacgtttgg aactgacaga accgcaacgt tgaaggagcc actcagc 15957
<210> 48
<211> 1671
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 48
atgaatagcg gcaacggcgg tggcggtgct ggtggggctg gtgggttggg gatgatggga 60
atgagggcat caccttttac ggtatcacaa tggcaagaac tggaacatca agctttgatc 120
tttaagtaca tggtggcagg tttgcctgtg ccacctgatc ttgtactccc tatacaaaag 180
agctttgagt ctatttctca taggttcttt catcacccca ccatgggtta ttgttccttc 240
tatgggaaaa aagttgatcc cgaaccagga cggtgcagga ggactgacgg caaaaaatgg 300
cggtgttcca aagatgcata cccagactca aagtactgtg agcgccacat gcaccgcggc 360
cgcaaccgtt caagaaagcc tgtggaatca caaaccataa cacagtcatc atccactgtg 420
acatcactga ctgccactgg aagcagtggt gggactggaa gctaccagaa ccttccatta 480
catgcctttg ctaatcccca aggcactact tctggaacca accaatccca ttatcatgtg 540
gactccattt cctgtggaat tcccaacaaa gaatacaggt accttcaagt tcaaggactt 600
aaacccgagg ttggagagca tagtttcttc actgaagctt caggaagcaa taggggtctc 660
cagatggact caccaattga cagtgcatgg cctctgatgc aatctagagt ctcctcgcat 720
ccccagtcga aatccagcaa cacctccatt ttgcagaatg attatcctca gcattcactt 780
ttcagtagtg aattcacctc gggtgagcct gtaaaacagg aaggtcagtc tctccgacct 840
ttctttgatg agtggcctaa gacaagagac tcatggtctg gtcttgaaga caagagatcc 900
caccagacct cattctcaac aaccaagctc tctatatcca tccctatggc ctcatccgac 960
ttctcggcaa cgagttctcg atcgcctcat gataacgcgg ccgctgccat gcaacagcac 1020
ctgatgcaga tgcagcccat gatggcagct tattatccca acaacgtcac tactgaccac 1080
attcaacagt atctagatga gaacaaatca ttgattttga agattgttga gagccagaat 1140
tcagggaaac tgagcgagtg tgcagagaac caggcaagat tgcagcggaa tctcatgtac 1200
ctggctgcta ttgctgatgc tcaaccccaa ccacctagcg ttcatgccca gttctcttct 1260
ggtggcatta tgcagccagg agctcactat atgcaacacc agcaatctca gccaatgaca 1320
ccacagtcac ttatggctgc acgctcatcc atggtgtact ctcaacagca attttcagtg 1380
cttcagcaac agcaagcctt gcatggtcag cttggcatga gctctggtgg tagctcagga 1440
cttcacatgc tgcaaagtga gggtagtact gcaggaggta gtggttcact tgggggtggg 1500
ggattccctg attttggccg tggctcatct ggtgaaggct tgcactcaag gggaatgggg 1560
agcaagcatg atataggcag ttctggatct gctgaaggac gaggagggag ctcaggaagc 1620
caagatggag gcgaaactct ctacttgaaa ggggctgatg atggaaatta a 1671
<210> 49
<211> 17363
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 49
tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60
gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120
cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180
gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240
atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300
gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360
cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420
cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480
gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540
ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600
agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660
ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720
tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780
tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140
cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200
aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260
cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320
ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380
gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440
cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500
ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560
cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620
cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680
cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740
ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800
attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860
ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920
gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980
ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040
ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100
cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160
ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220
gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280
ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340
gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400
tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460
cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520
tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580
gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640
tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700
taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760
attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820
gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880
gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940
ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000
gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060
ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120
gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180
tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240
accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300
aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360
aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420
atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480
cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540
aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600
ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660
ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720
gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780
catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840
cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900
gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960
cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020
cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080
gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140
catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200
cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260
tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320
gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380
gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440
ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500
aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560
ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620
ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680
ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740
tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800
cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860
tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920
caggaaacag ctatgaccat gattacgcca agcttgcatg cctgcaggtc cccagattag 4980
ccttttcaat ttcagaaaga atgctaaccc acagatggtt agagaggctt acgcagcagg 5040
tctcatcaag acgatctacc cgagcaataa tctccaggaa atcaaatacc ttcccaagaa 5100
ggttaaagat gcagtcaaaa gattcaggac taactgcatc aagaacacag agaaagatat 5160
atttctcaag atcagaagta ctattccagt atggacgatt caaggcttgc ttcacaaacc 5220
aaggcaagta atagagattg gagtctctaa aaaggtagtt cccactgaat caaaggccat 5280
ggagtcaaag attcaaatag aggacctaac agaactcgcc gtaaagactg gcgaacagtt 5340
catacagagt ctcttacgac tcaatgacaa gaagaaaatc ttcgtcaaca tggtggagca 5400
cgacacactt gtctactcca aaaatatcaa agatacagtc tcagaagacc aaagggcaat 5460
tgagactttt caacaaaggg taatatccgg aaacctcctc ggattccatt gcccagctat 5520
ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc tcctacaaat gccatcattg 5580
cgataaagga aaggccatcg ttgaagatgc ctctgccgac agtggtccca aagatggacc 5640
cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt 5700
ggattgatgt gatatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca 5760
agacccttcc tctatataag gaagttcatt tcatttggag agaacacggg ggactctaga 5820
gttatcaaca agtttgtaca aaaaagctgc caccatgaat agcggcaacg gcggtggcgg 5880
tgctggtggg gctggtgggt tggggatgat gggaatgagg gcatcacctt ttacggtatc 5940
acaatggcaa gaactggaac atcaagcttt gatctttaag tacatggtgg caggtttgcc 6000
tgtgccacct gatcttgtac tccctataca aaagagcttt gagtctattt ctcataggtt 6060
ctttcatcac cccaccatgg gttattgttc cttctatggg aaaaaagttg atcccgaacc 6120
aggacggtgc aggaggactg acggcaaaaa atggcggtgt tccaaagatg catacccaga 6180
ctcaaagtac tgtgagcgcc acatgcaccg cggccgcaac cgttcaagaa agcctgtgga 6240
atcacaaacc ataacacagt catcatccac tgtgacatca ctgactgcca ctggaagcag 6300
tggtgggact ggaagctacc agaaccttcc attacatgcc tttgctaatc cccaaggcac 6360
tacttctgga accaaccaat cccattatca tgtggactcc atttcctgtg gaattcccaa 6420
caaagaatac aggtaccttc aagttcaagg acttaaaccc gaggttggag agcatagttt 6480
cttcactgaa gcttcaggaa gcaatagggg tctccagatg gactcaccaa ttgacagtgc 6540
atggcctctg atgcaatcta gagtctcctc gcatccccag tcgaaatcca gcaacacctc 6600
cattttgcag aatgattatc ctcagcattc acttttcagt agtgaattca cctcgggtga 6660
gcctgtaaaa caggaaggtc agtctctccg acctttcttt gatgagtggc ctaagacaag 6720
agactcatgg tctggtcttg aagacaagag atcccaccag acctcattct caacaaccaa 6780
gctctctata tccatcccta tggcctcatc cgacttctcg gcaacgagtt ctcgatcgcc 6840
tcatgataac gcggccgctg ccatgcaaca gcacctgatg cagatgcagc ccatgatggc 6900
agcttattat cccaacaacg tcactactga ccacattcaa cagtatctag atgagaacaa 6960
atcattgatt ttgaagattg ttgagagcca gaattcaggg aaactgagcg agtgtgcaga 7020
gaaccaggca agattgcagc ggaatctcat gtacctggct gctattgctg atgctcaacc 7080
ccaaccacct agcgttcatg cccagttctc ttctggtggc attatgcagc caggagctca 7140
ctatatgcaa caccagcaat ctcagccaat gacaccacag tcacttatgg ctgcacgctc 7200
atccatggtg tactctcaac agcaattttc agtgcttcag caacagcaag ccttgcatgg 7260
tcagcttggc atgagctctg gtggtagctc aggacttcac atgctgcaaa gtgagggtag 7320
tactgcagga ggtagtggtt cacttggggg tgggggattc cctgattttg gccgtggctc 7380
atctggtgaa ggcttgcact caaggggaat ggggagcaag catgatatag gcagttctgg 7440
atctgctgaa ggacgaggag ggagctcagg aagccaagat ggaggcgaaa ctctctactt 7500
gaaaggggct gatgatggaa attaacgttc agctttcttg tacaaagtgg ttgataacag 7560
cgggttaatt aacatctttt acccatacga tgttcctgac tatgcgggct atccctatga 7620
cgtcccggac tatgcaggat cctatccata tgacgttcca gattacgctg ctcagtgcag 7680
cgcttagagc tcgaatttcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 7740
gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 7800
tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 7860
cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 7920
attatcgcgc gcggtgtcat ctatgttact agatcgggaa ttagcttcat caacgcaaga 7980
catgcgcacg accgtctgac aggagaggaa tttccgacga gcacagaaag gacttgctct 8040
tggacgtagg cctatttctc aggcacatgt atcaagtgtt cggacgtggg ttttcgatgg 8100
tgtatcagcc gccgccaact gggagatgag gaggctttct tggggggcag tcagcagttc 8160
atttcacaag acagaggaac ttgtaaggag atgcactgat ttatcttggc gcaaaccagc 8220
aggacgaatt agtgggaata gcccgcgaat atctaagtta tgcctgtcgg catgagcaga 8280
aacttccaat tcgaaacagt ttggagaggt tgtttttggg catacctttt gttagtcagc 8340
ctctcgattg ctcatcgtca ttacacagta ccgaagtttg atcgatctag taacatagat 8400
gacaccgcgc gcgataattt atcctagttt gcgcgctata ttttgttttc tatcgcgtat 8460
taaatgtata attgcgggac tctaatcata aaaacccatc tcataaataa cgtcatgcat 8520
tacatgttaa ttattacatg cttaacgtaa ttcaacagaa attatatgat aatcatcgca 8580
agaccggcaa caggattcaa tcttaagaaa ctttattgcc aaatgtttga acgatctgct 8640
tcgacgcact ccttctttac tccaccatct cgtccttatt gaaaacgtgg gtagcaccaa 8700
aacgaatcaa gtcgctggaa ctgaagttac caatcacgct ggatgatttg ccagttggat 8760
taatcttgcc tttccccgca tgaataatat tgatgaatgc atgcgtgagg ggtatttcga 8820
ttttggcaat agctgcaatt gccgcgacat cctccaacga gcataattct tcagaaaaat 8880
agcgatgttc catgttgtca gggcatgcat gatgcacgtt atgaggtgac ggtgctaggc 8940
agtattccct caaagtttca tagtcagtat catattcatc attgcattcc tgcaagagag 9000
aattgagacg caatccacac gctgcggcaa ccttccggcg ttcgtggtct atttgctctt 9060
ggacgttgca aacgtaagtg ttggatcccg gtcggcatct actctattcc tttgccctcg 9120
gacgagtgct ggggcgtcgg tttccactat cggcgagtac ttctacacag ccatcggtcc 9180
agacggccgc gcttctgcgg gcgatttgtg tacgcccgac agtcccggct ccggatcgga 9240
cgattgcgtc gcatcgaccc tgcgcccaag ctgcatcatc gaaattgccg tcaaccaagc 9300
tctgatagag ttggtcaaga ccaatgcgga gcatatacgc ccggagccgc ggcgatcctg 9360
caagctccgg atgcctccgc tcgaagtagc gcgtctgctg ctccatacaa gccaaccacg 9420
gcctccagaa gaagatgttg gcgacctcgt attgggaatc cccgaacatc gcctcgctcc 9480
agtcaatgac cgctgttatg cggccattgt ccgtcaggac attgttggag ccgaaatccg 9540
cgtgcacgag gtgccggact tcggggcagt cctcggccca aagcatcagc tcatcgagag 9600
cctgcgcgac ggacgcactg acggtgtcgt ccatcacagt ttgccagtga tacacatggg 9660
gatcagcaat cgcgcatatg aaatcacgcc atgtagtgta ttgaccgatt ccttgcggtc 9720
cgaatgggcc gaacccgctc gtctggctaa gatcggccgc agcgatcgca tccatggcct 9780
ccgcgaccgg ctgcagaaca gcgggcagtt cggtttcagg caggtcttgc aacgtgacac 9840
cctgtgcacg gcgggagatg caataggtca ggctctcgct gaattcccca atgtcaagca 9900
cttccggaat cgggagcgcg gccgatgcaa agtgccgata aacataacga tctttgtaga 9960
aaccatcggc gcagctattt acccgcagga catatccacg ccctcctaca tcgaagctga 10020
aagcacgaga ttcttcgccc tccgagagct gcatcaggtc ggagacgctg tcgaactttt 10080
cgatcagaaa cttctcgaca gacgtcgcgg tgagttcagg ctttttcata tcggggtcgt 10140
cctctccaaa tgaaatgaac ttccttatat agaggaaggg tcttgcgaag gatagtggga 10200
ttgtgcgtca tcccttacgt cagtggagat atcacatcaa tccacttgct ttgaagacgt 10260
ggttggaacg tcttcttttt ccacgatgct cctcgtgggt gggggtccat ctttgggacc 10320
actgtcggca gaggcatctt gaacgatagc ctttccttta tcgcaatgat ggcatttgta 10380
ggtgccacct tccttttcta ctgtcctttt gatgaagtga cagatagctg ggcaatggaa 10440
tccgaggagg tttcccgata ttaccctttg ttgaaaagtc tcaatagccc tttggtcttc 10500
tgagactgta tctttgatat tcttggagta gacgagagtg tcgtgctcca ccatgttgac 10560
ggatctctag gacgcgtcct agaagctaat tcactggccg tcgttttaca acgtcgtgac 10620
tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc 10680
tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat 10740
ggcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg 10800
tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga 10860
ccccaaaaaa cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt 10920
ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg 10980
aacaacactc aaccctatct cgggctattc ttttgattta taagggattt tgccgatttc 11040
ggaaccacca tcaaacagga ttttcgcctg ctggggcaaa ccagcgtgga ccgcttgctg 11100
caactctctc agggccaggc ggtgaagggc aatcagctgt tgcccgtctc actggtgaaa 11160
agaaaaacca ccccagtaca ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt 11220
caatttgttt acaccacaat atatcctgcc accagccagc caacagctcc ccgaccggca 11280
gctcggcaca aaatcaccac tcgatacagg cagcccatca gtccgggacg gcgtcagcgg 11340
gagagccgtt gtaaggcggc agactttgct catgttaccg atgctattcg gaagaacggc 11400
aactaagctg ccgggtttga aacacggatg atctcgcgga gggtagcatg ttgattgtaa 11460
cgatgacaga gcgttgctgc ctgtgatcaa atatcatctc cctcgcagag atccgaatta 11520
tcagccttct tattcatttc tcgcttaacc gtgacaggct gtcgatcttg agaactatgc 11580
cgacataata ggaaatcgct ggataaagcc gctgaggaag ctgagtggcg ctatttcttt 11640
agaagtgaac gttgacgata tcaactcccc tatccattgc tcaccgaatg gtacaggtcg 11700
gggacccgaa gttccgactg tcggcctgat gcatccccgg ctgatcgacc ccagatctgg 11760
ggctgagaaa gcccagtaag gaaacaactg taggttcgag tcgcgagatc ccccggaacc 11820
aaaggaagta ggttaaaccc gctccgatca ggccgagcca cgccaggccg agaacattgg 11880
ttcctgtagg catcgggatt ggcggatcaa acactaaagc tactggaacg agcagaagtc 11940
ctccggccgc cagttgccag gcggtaaagg tgagcagagg cacgggaggt tgccacttgc 12000
gggtcagcac ggttccgaac gccatggaaa ccgcccccgc caggcccgct gcgacgccga 12060
caggatctag cgctgcgttt ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa 12120
tagcccccag gaccgccatc aatcgtatcg ggctacctag cagagcggca gagatgaaca 12180
cgaccatcag cggctgcaca gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga 12240
ccgaaataaa caacaagctc cagaatagcg aaatattaag tgcgccgagg atgaagatgc 12300
gcatccacca gattcccgtt ggaatctgtc ggacgatcat cacgagcaat aaacccgccg 12360
gcaacgcccg cagcagcata ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa 12420
cgccggacag atgcgccttg tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca 12480
gcccaatgat ctcgccgtcg atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag 12540
cgaacgcctc catgggcttt ttctcctcgt gctcgtaaac ggacccgaac atctctggag 12600
ctttcttcag ggccgacaat cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc 12660
gtcgaatctg agccttaatc acaattgtca attttaatcc tctgtttatc ggcagttcgt 12720
agagcgcgcc gtgcgtcccg agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc 12780
ttgttcctga aatgccagta aagcgctggc tgctgaaccc ccagccggaa ctgaccccac 12840
aaggccctag cgtttgcaat gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc 12900
gctgcctcgc aactcttcgc aggcttcgcc gacctgctcg cgccacttct tcacgcgggt 12960
ggaatccgat ccgcacatga ggcggaaggt ttccagcttg agcgggtacg gctcccggtg 13020
cgagctgaaa tagtcgaaca tccgtcgggc cgtcggcgac agcttgcggt acttctccca 13080
tatgaatttc gtgtagtggt cgccagcaaa cagcacgacg atttcctcgt cgatcaggac 13140
ctggcaacgg gacgttttct tgccacggtc caggacgcgg aagcggtgca gcagcgacac 13200
cgattccagg tgcccaacgc ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga 13260
caggcattcc tcggccttcg tgtaataccg gccattgatc gaccagccca ggtcctggca 13320
aagctcgtag aacgtgaagg tgatcggctc gccgataggg gtgcgcttcg cgtactccaa 13380
cacctgctgc cacaccagtt cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat 13440
cttcacgtcc ttgttgacgt ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt 13500
cttgttgcgc gtggtgaaca gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt 13560
gtccggccac ggcgcaatat cgaacaagga aagctgcatt tccttgatct gctgcttcgt 13620
gtgtttcagc aacgcggcct gcttggcctc gctgacctgt tttgccaggt cctcgccggc 13680
ggtttttcgc ttcttggtcg tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa 13740
acctgccgcc tcctgttcga gacgacgcga acgctccacg gcggccgatg gcgcgggcag 13800
ggcaggggga gccagttgca cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat 13860
cgagccgacg gactggaagg tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt 13920
ttcggcatcc tcggcggaaa accccgcgtc gatcagttct tgcctgtatg ccttccggtc 13980
aaacgtccga ttcattcacc ctccttgcgg gattgccccg actcacgccg gggcaatgtg 14040
cccttattcc tgatttgacc cgcctggtgc cttggtgtcc agataatcca ccttatcggc 14100
aatgaagtcg gtcccgtaga ccgtctggcc gtccttctcg tacttggtat tccgaatctt 14160
gccctgcacg aataccagcg accccttgcc caaatacttg ccgtgggcct cggcctgaga 14220
gccaaaacac ttgatgcgga agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg 14280
ccacatctag gtactaaaac aattcatcca gtaaaatata atattttatt ttctcccaat 14340
caggcttgat ccccagtaag tcaaaaaata gctcgacata ctgttcttcc ccgatatcct 14400
ccctgatcga ccggacgcag aaggcaatgt cataccactt gtccgccctg ccgcttctcc 14460
caagatcaat aaagccactt actttgccat ctttcacaaa gatgttgctg tctcccaggt 14520
cgccgtggga aaagacaagt tcctcttcgg gcttttccgt ctttaaaaaa tcatacagct 14580
cgcgcggatc tttaaatgga gtgtcttctt cccagttttc gcaatccaca tcggccagat 14640
cgttattcag taagtaatcc aattcggcta agcggctgtc taagctattc gtatagggac 14700
aatccgatat gtcgatggag tgaaagagcc tgatgcactc cgcatacagc tcgataatct 14760
tttcagggct ttgttcatct tcatactctt ccgagcaaag gacgccatcg gcctcactca 14820
tgagcagatt gctccagcca tcatgccgtt caaagtgcag gacctttgga acaggcagct 14880
ttccttccag ccatagcatc atgtcctttt cccgttccac atcataggtg gtccctttat 14940
accggctgtc cgtcattttt aaatataggt tttcattttc tcccaccagc ttatatacct 15000
tagcaggaga cattccttcc gtatctttta cgcagcggta tttttcgatc agttttttca 15060
attccggtga tattctcatt ttagccattt attatttcct tcctcttttc tacagtattt 15120
aaagataccc caagaagcta attataacaa gacgaactcc aattcactgt tccttgcatt 15180
ctaaaacctt aaataccaga aaacagcttt ttcaaagttg ttttcaaagt tggcgtataa 15240
catagtatcg acggagccga ttttgaaacc acaattatgg gtgatgctgc caacttactg 15300
atttagtgta tgatggtgtt tttgaggtgc tccagtggct tctgtgtcta tcagctgtcc 15360
ctcctgttca gctactgacg gggtggtgcg taacggcaaa agcaccgccg gacatcagcg 15420
ctatctctgc tctcactgcc gtaaaacatg gcaactgcag ttcacttaca ccgcttctca 15480
acccggtacg caccagaaaa tcattgatat ggccatgaat ggcgttggat gccgggcaac 15540
agcccgcatt atgggcgttg gcctcaacac gattttacgt cacttaaaaa actcaggccg 15600
cagtcggtaa cctcgcgcat acagccgggc agtgacgtca tcgtctgcgc ggaaatggac 15660
gaacagtggg gctatgtcgg ggctaaatcg cgccagcgct ggctgtttta cgcgtatgac 15720
agtctccgga agacggttgt tgcgcacgta ttcggtgaac gcactatggc gacgctgggg 15780
cgtcttatga gcctgctgtc accctttgac gtggtgatat ggatgacgga tggctggccg 15840
ctgtatgaat cccgcctgaa gggaaagctg cacgtaatca gcaagcgata tacgcagcga 15900
attgagcggc ataacctgaa tctgaggcag cacctggcac ggctgggacg gaagtcgctg 15960
tcgttctcaa aatcggtgga gctgcatgac aaagtcatcg ggcattatct gaacataaaa 16020
cactatcaat aagttggagt cattacccaa ttatgataga atttacaagc tataaggtta 16080
ttgtcctggg tttcaagcat tagtccatgc aagtttttat gctttgccca ttctatagat 16140
atattgataa gcgcgctgcc tatgccttgc cccctgaaat ccttacatac ggcgatatct 16200
tctatataaa agatatatta tcttatcagt attgtcaata tattcaaggc aatctgcctc 16260
ctcatcctct tcatcctctt cgtcttggta gctttttaaa tatggcgctt catagagtaa 16320
ttctgtaaag gtccaattct cgttttcata cctcggtata atcttaccta tcacctcaaa 16380
tggttcgctg ggtttatcgc acccccgaac acgagcacgg cacccgcgac cactatgcca 16440
agaatgccca aggtaaaaat tgccggcccc gccatgaagt ccgtgaatgc cccgacggcc 16500
gaagtgaagg gcaggccgcc acccaggccg ccgccctcac tgcccggcac ctggtcgctg 16560
aatgtcgatg ccagcacctg cggcacgtca atgcttccgg gcgtcgcgct cgggctgatc 16620
gcccatcccg ttactgcccc gatcccggca atggcaagga ctgccagcgc tgccattttt 16680
ggggtgaggc cgttcgcggc cgaggggcgc agcccctggg gggatgggag gcccgcgtta 16740
gcgggccggg agggttcgag aagggggggc accccccttc ggcgtgcgcg gtcacgcgca 16800
cagggcgcag ccctggttaa aaacaaggtt tataaatatt ggtttaaaag caggttaaaa 16860
gacaggttag cggtggccga aaaacgggcg gaaacccttg caaatgctgg attttctgcc 16920
tgtggacagc ccctcaaatg tcaataggtg cgcccctcat ctgtcagcac tctgcccctc 16980
aagtgtcaag gatcgcgccc ctcatctgtc agtagtcgcg cccctcaagt gtcaataccg 17040
cagggcactt atccccaggc ttgtccacat catctgtggg aaactcgcgt aaaatcaggc 17100
gttttcgccg atttgcgagg ctggccagct ccacgtcgcc ggccgaaatc gagcctgccc 17160
ctcatctgtc aacgccgcgc cgggtgagtc ggcccctcaa gtgtcaacgt ccgcccctca 17220
tctgtcagtg agggccaagt tttccgcgag gtatccacaa cgccggcggc cgcggtgtct 17280
cgcacacggc ttcgacggcg tttctggcgc gtttgcaggg ccatagacgg ccgccagccc 17340
agcggcgagg gcaaccagcc cgg 17363
<210> 50
<211> 1698
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 50
atgaagcaaa gctttgtggg ggtttcttca caaacagagg ctgagacaag agccagagag 60
agtgacacac agacacaccc acaaaggaac ctcagtcact gtccccaaaa acatcctcaa 120
aacttgcaaa gaacccaaaa tggtgggtct gttggggggg catcaatggc aacagggatg 180
atggtgaggt catcaccatt cacagtgtct cagtggcagg agctggagca tcaagctttg 240
atctttaagt atctaatggc aggactaccc gtcccacctg atcttgtgct tcctattcag 300
aagagctttg aatctatgtc ttctaggttc ttccaccatc ctaccatggg ttactgttcc 360
ttctatggga agaaggtgga cccggagcca ggtagatgcc ggaggactga tggcaagaaa 420
tggaggtgct ccaaagacgc ctacccggac tccaagtact gtgagcgcca catgcaccgt 480
ggccgcaacc gttctagaaa accggtcgaa tcacaaacta ctacacagtc ttcttcgact 540
gtgacttcac taactgtcac tggaggcagt ggcagtggag ccgggacctt ccagaacctt 600
cccctgcaaa acttcaataa cccccaaggc ggcccggggt ccggaaccaa ccaatcccaa 660
ttccatatgg agcccattcc ctatggcatt cctaataaag attacaggta tcttcatgga 720
cttaaacccg aggtaggcga acatagtttc ttctctgaag cttcaggaag caacaggggt 780
cttcagatgg actcgccttt ggacagcaca tggcctctga tgccatcacg ggtctcatca 840
ttccccccat caaaatcaag tgaaaactcc atcttgcaga gtgactaccc ccagcattcg 900
ttcttcagca gcgaatttgc gtcgggagag gttgtgaaac aggagggtca gtctctccgt 960
cctttctttg acgagtggcc taaaaccaga gactcctggt ccggccttga ggaagagagg 1020
tccaaccaga cctcattctc cacaacccag ctctcaattt cgatcccaat ggcctcctct 1080
gacttctcaa caaccagctc tcggtctccc cacgataacg cggccgctgc catgcagccc 1140
atgatggcag cctattaccc cagcaacgtc accactgatc acattcagca gtatcttgat 1200
gaaaacaagt cattgattct gaagattgtt gagagccaga attcaggaaa attgactgaa 1260
tgtgcagaga accaggcaag actacagaga aacctcatgt acctggctgc aatcgctgat 1320
tctcaacccc aaccacccac catgcatgct cagttccctc ctagtggcat tgttcagcca 1380
ggagctcact acatgcagca ccaacaagct caacaaatga caccacagtc gctcctggct 1440
gcacgctcct ccatgctgta ctcccaacaa ccattttcgg ccctgcaaca acaacaagcc 1500
atccatagcc agcttggcat gggctctggt ggaagtgcag gacttcacat gctgcaaagc 1560
gaggggagta atccaggagg caatggaaca ctggggactg atatgggaaa tgcagaaggg 1620
cgaggaggga actcaggagg tcagggtggg gatggaggtg agactcttta cttgaaagct 1680
gctgaagatg ggaattga 1698
<210> 51
<211> 1383
<212> DNA
<213> Triticum aestivum
<400> 51
atggcgatgc cgtatgcctc tctttccccg gcaggcgacc gccgctcctc cccggccgcc 60
accgccaccg cctccctcct ccccttctgc cgctcctccc ccttctccgc cggcggcaat 120
ggcggcatgg gggaggaggc gcggatggac gggaggtgga tggcgaggcc ggtgcccttc 180
acggcggcgc agtacgagga gctggagcac caggcgctca tatacaagta cctggtggcc 240
ggcgtgtccg tcccgccgga tctcgtgctc cccatccgcc gcggcatcga gtccctcgcc 300
gcccgcttct accacaaccc cctcgccatc gggtacggat cgtacctggg caagaaggtg 360
gatccggagc cgggccggtg ccggcgcacg gacggcaaga agtggcggtg cgccaaggag 420
gccgcctccg actccaagta ttgcgagcgc cacatgcacc gcggccgcaa ccgttcaaga 480
aagcctgtgg aaacgcagct cgtctcgcac tcccagccgc cggccgcctc cgtcgtgccg 540
cccctcgcca ccggcttcca caaccactcc ctctaccccg ccatcggcgg caccaacggt 600
ggtggaggcg gggggaacaa cggcatgtcc atgcccggca cgttctcctc cgcgctgggg 660
ccgcctcagc agcacatggg caacaatgcc gcctctccct acgcggctct cggcggcgcc 720
ggaacatgca aagatttcag gtataccgca tatggaataa gatctttggc agacgagcag 780
agtcagctca tgacagaagc catgaacacc tccgtggaga acccatggcg cctgccgcca 840
tcttctcaaa cgactacatt cccgctctca agctactctc ctcagcttgg agcaacgagt 900
gacctgggtc agaacaacag cagcaacaac aacagcggcg tcaaggccga gggacagcag 960
cagcagcagc cgctctcctt cccggggtgc ggcgacttcg gcagcggcga ctccgcgaag 1020
caggagaacc agacgctgcg gccgttcttc gacgagtggc cgaagacgag ggactcgtgg 1080
tcggacctga ccgacgacaa ctcgaacgtc gcctccttct cggccaccca gctgtcgatc 1140
tcgataccca tgacgtcctc cgacttctcc gccgccagct cccagtcgcc caacggcatg 1200
ctgttcgccg gcgagatgta ccgttcagct ttcttgtaca aagtggttga taacagcggg 1260
ttaattaaca tcttttaccc atacgatgtt cctgactatg cgggctatcc ctatgacgtc 1320
ccggactatg caggatccta tccatatgac gttccagatt acgctgctca gtgcagcgct 1380
tag 1383
<210> 52
<211> 837
<212> DNA
<213> Triticum aestivum
<400> 52
atgcagcagc aacacctgat gcagatgaac cagagcatga tggggggcta cgcttcctct 60
accactgcca ccactgatct cattcagcag tacctggatg agaacaagca gctgatcctg 120
gccatcctcg acaaccagaa caacggcaag gtggaggagt gcgcacggaa ccaagctaag 180
ctccagcaga acctcatgta cctcgccgcc atcgccgaca gccagcctcc gcagacggca 240
tcgctgtctc agtacccgtc caacctgatg atgcagtccg ggccgcggta catgcagcag 300
cagtcggcgc agatgatgtc gccgcagtcg ctgatggcgg cgcggtcgtc gatgatgtac 360
gcgcagcagg ccatgtcgcc gctccagcag cagcagcagc agcagcagca ccaggcggcc 420
gcgcacggcc agctggggat gtcctccggc gcgaccaccg ggttcaacct cctgcacggt 480
gaggccagca tgggcggcgg cggcggcgcc actggcaaca gcatgatgaa cgccagcgtc 540
ttctcggact atggccgcgg cggcagcggc gccaaggagg ggtcgacctc gctgtcggcc 600
gacgctcgcg gcgccaactc tggcgcgcac agcggcgacg gggagtacct caagggcacc 660
gaggaggaag gaagccgttc agctttcttg tacaaagtgg ttgataacag cgggttaatt 720
aacatctttt acccatacga tgttcctgac tatgcgggct atccctatga cgtcccggac 780
tatgcaggat cctatccata tgacgttcca gattacgctg ctcagtgcag cgcttag 837
<210> 53
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 53
gcggccgcaa ccgttcaaga aagcctgtgg aaacgcagct cg 42
<210> 54
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 54
gcggccgcaa ccgttctaga aaaccggtcg aaacgcagct cg 42
<210> 55
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 55
gtcaagttct ttcggacacc t 21
<210> 56
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 56
Arg Ser Arg Lys Leu Val Glu
1 5
<210> 57
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 57
gtttacacca caatatatcc tgcca 25
<210> 58
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 58
gtttacccgc caatatatcc tgtca 25
<210> 59
<211> 1992
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 59
aagctttgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga 660
cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg 720
acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac 780
ggcaggcggc ctcctcctcc tctcacggca cggcagctac gggggattcc tttcccaccg 840
ctccttcgct ttcccttcct cgcccgccgt aataaataga caccccctcc acaccctctt 900
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac 960
ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc cccccccccc ctctctacct 1020
tctctagatc ggcgttccgg tccatggtta gggcccggta gttctacttc tgttcatgtt 1080
tgtgttagat ccgtgtttgt gttagatccg tgctgctagc gttcgtacac ggatgcgacc 1140
tgtacgtcag acacgttctg attgctaact tgccagtgtt tctctttggg gaatcctggg 1200
atggctctag ccgttccgca gacgggatcg atttcatgat tttttttgtt tcgttgcata 1260
gggtttggtt tgcccttttc ctttatttca atatatgccg tgcacttgtt tgtcgggtca 1320
tcttttcatg cttttttttg tcttggttgt gatgatgtgg tctggttggg cggtcgttct 1380
agatcggagt agaattctgt ttcaaactac ctggtggatt tattaatttt ggatctgtat 1440
gtgtgtgcca tacatattca tagttacgaa ttgaagatga tggatggaaa tatcgatcta 1500
ggataggtat acatgttgat gcgggtttta ctgatgcata tacagagatg ctttttgttc 1560
gcttggttgt gatgatgtgg tgtggttggg cggtcgttca ttcgttctag atcggagtag 1620
aatactgttt caaactacct ggtgtattta ttaattttgg aactgtatgt gtgtgtcata 1680
catcttcata gttacgagtt taagatggat ggaaatatcg atctaggata ggtatacatg 1740
ttgatgtggg ttttactgat gcatatacat gatggcatat gcagcatcta ttcatatgct 1800
ctaaccttga gtacctatct attataataa acaagtatgt tttataatta ttttgatctt 1860
gatatacttg gatgatggca tatgcagcag ctatatgtgg atttttttag ccctgccttc 1920
atacgctatt tatttgcttg gtactgtttc ttttgtcgat gctcaccctg ttgtttggtg 1980
ttacttctgc ag 1992
<210> 60
<211> 120
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 60
gggttaatta acatctttta cccatacgat gttcctgact atgcgggcta tccctatgac 60
gtcccggact atgcaggatc ctatccatat gacgttccag attacgctgc tcagtgcagc 120
<210> 61
<211> 253
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 61
gatcgttcaa acatttggca ataaagtttc ttaagattga atcctgttgc cggtcttgcg 60
atgattatca tataatttct gttgaattac gttaagcatg taataattaa catgtaatgc 120
atgacgttat ttatgagatg ggtttttatg attagagtcc cgcaattata catttaatac 180
gcgatagaaa acaaaatata gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct 240
atgttactag atc 253
<210> 62
<211> 755
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 62
ccgagcaata atctccagga gatcaaatac cttcccaaga aggttaaaga tgcagtcaaa 60
agattcagga ctaactgcat caagaacaca gagaaagata tatttctcaa gatcagaagt 120
actattccag tatggacgat tcaaggcttg cttcacaaac caaggcaagt aatagagatt 180
ggagtctcta aaaaggtagt tcccactgaa tcaaaggcca tggagtcaaa gattcaaata 240
gaggacctaa cagaactcgc cgtaaagact ggcgaacagt tcatacagag tctcttacga 300
ctcaatgaca agaagaaaat cttcgtcaac atggtggagc acgacacgct tgtctactcc 360
aaaaatatca aagatacagt ctcagaagac caaagggcaa ttgagacttt tcaacaaagg 420
gtaatatccg gaaacctcct cggattccat tgcccagcta tctgtcactt tattgtgaag 480
atagtggaaa aggaaggtgg ctcctacaaa tgccatcatt gcgataaagg aaaggccatc 540
gttgaagatg cctctgccga cagtggtccc aaagatggac ccccacccac gaggagcatc 600
gtggaaaaag aagacgttcc aaccacgtct tcaaagcaag tggattgatg tgatatctcc 660
actgacgtaa gggatgacgc acaatcccac tatccttcgc aagacccttc ctctatataa 720
ggaagttcat ttcatttgga gaggacacgg gggac 755
<210> 63
<211> 1038
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 63
atgagatatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa 60
gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag 120
cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta 180
caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct 240
tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt 300
cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc 360
catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc 420
gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca 480
tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct 540
cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga 600
tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag 660
cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg 720
gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg 780
atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt 840
ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg 900
atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac 960
cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag 1020
ggcaaaggaa tagagtag 1038
<210> 64
<211> 4203
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 64
atgcctaaga agaagagaaa ggtcggtatt cacggcgttc ctgcggcgag tgacaagaag 60
tactcgatcg gcctcgacat cgggacgaac tcagttggct gggccgtgat caccgacgag 120
tacaaggtgc cctctaagaa gttcaaggtc ctggggaaca ccgaccgcca ttccatcaag 180
aagaacctca tcggcgctct cctgttcgac agcggggaga ccgctgaggc tacgaggctc 240
aagagaaccg ctaggcgccg gtacacgaga aggaagaaca ggatctgcta cctccaagag 300
attttctcca acgagatggc caaggttgac gattcattct tccaccgcct ggaggagtct 360
ttcctcgtgg aggaggataa gaagcacgag cggcatccca tcttcggcaa catcgtggac 420
gaggttgcct accacgagaa gtaccctacg atctaccatc tgcggaagaa gctcgtggac 480
tccaccgata aggcggacct cagactgatc tacctcgctc tggcccacat gatcaagttc 540
cgcggccatt tcctgatcga gggggatctc aacccagaca acagcgatgt tgacaagctg 600
ttcatccaac tcgtgcagac ctacaaccaa ctcttcgagg agaacccgat caacgcctct 660
ggcgtggacg cgaaggctat cctgtccgcg aggctctcga agtccaggag gctggagaac 720
ctgatcgctc agctcccagg cgagaagaag aacggcctgt tcgggaacct catcgctctc 780
agcctggggc tcaccccgaa cttcaagtcg aacttcgatc tcgctgagga cgccaagctg 840
caactctcca aggacaccta cgacgatgac ctcgataacc tcctggccca gatcggcgat 900
caatacgcgg acctgttcct cgctgccaag aacctgtcgg acgccatcct cctgtcagat 960
atcctccgcg tgaacaccga gatcacgaag gctccactct ctgcctccat gatcaagcgc 1020
tacgacgagc accatcagga tctgaccctc ctgaaggcgc tggtccgcca acagctcccg 1080
gagaagtaca aggagatttt cttcgatcag tcgaagaacg gctacgctgg gtacatcgac 1140
ggcggggcct cacaagagga gttctacaag ttcatcaagc caatcctgga gaagatggac 1200
ggcacggagg agctcctggt gaagctcaac agggaggacc tcctgcggaa gcagagaacc 1260
ttcgataacg gcagcatccc ccaccaaatc catctcgggg agctgcacgc catcctgaga 1320
aggcaagagg acttctaccc tttcctcaag gataaccggg agaagatcga gaagatcctg 1380
accttcagaa tcccatacta cgtcggccct ctcgcgcggg ggaactcaag attcgcttgg 1440
atgacccgca agtctgagga gaccatcacg ccgtggaact tcgaggaggt ggtggacaag 1500
ggcgctagcg ctcagtcgtt catcgagagg atgaccaact tcgacaagaa cctgcccaac 1560
gagaaggtgc tccctaagca ctcgctcctg tacgagtact tcaccgtcta caacgagctc 1620
acgaaggtga agtacgtcac cgagggcatg cgcaagccag cgttcctgtc cggggagcag 1680
aagaaggcta tcgtggacct cctgttcaag accaaccgga aggtcacggt taagcaactc 1740
aaggaggact acttcaagaa gatcgagtgc ttcgattcgg tcgagatcag cggcgttgag 1800
gaccgcttca acgccagcct cgggacctac cacgatctcc tgaagatcat caaggataag 1860
gacttcctgg acaacgagga gaacgaggat atcctggagg acatcgtgct gaccctcacg 1920
ctgttcgagg acagggagat gatcgaggag cgcctgaaga cgtacgccca tctcttcgat 1980
gacaaggtca tgaagcaact caagcgccgg agatacaccg gctgggggag gctgtcccgc 2040
aagctcatca acggcatccg ggacaagcag tccgggaaga ccatcctcga cttcctgaag 2100
agcgatggct tcgccaacag gaacttcatg caactgatcc acgatgacag cctcaccttc 2160
aaggaggata tccaaaaggc tcaagtgagc ggccaggggg actcgctgca cgagcatatc 2220
gcgaacctcg ctggctcccc cgcgatcaag aagggcatcc tccagaccgt gaaggttgtg 2280
gacgagctcg tgaaggtcat gggccggcac aagcctgaga acatcgtcat cgagatggcc 2340
agagagaacc aaaccacgca gaaggggcaa aagaactcta gggagcgcat gaagcgcatc 2400
gaggagggca tcaaggagct ggggtcccaa atcctcaagg agcacccagt ggagaacacc 2460
caactgcaga acgagaagct ctacctgtac tacctccaga acggcaggga tatgtacgtg 2520
gaccaagagc tggatatcaa ccgcctcagc gattacgacg tcgatcatat cgttccccag 2580
tctttcctga aggatgactc catcgacaac aaggtcctca ccaggtcgga caagaaccgc 2640
ggcaagtcag ataacgttcc atctgaggag gtcgttaaga agatgaagaa ctactggagg 2700
cagctcctga acgccaagct gatcacgcaa aggaagttcg acaacctcac caaggctgag 2760
agaggcgggc tctcagagct ggacaaggcc ggcttcatca agcggcagct ggtcgagacc 2820
agacaaatca cgaagcacgt tgcgcaaatc ctcgactctc ggatgaacac gaagtacgat 2880
gagaacgaca agctgatcag ggaggttaag gtgatcaccc tgaagtctaa gctcgtctcc 2940
gacttcagga aggatttcca gttctacaag gttcgcgaga tcaacaacta ccaccatgcc 3000
catgacgctt acctcaacgc tgtggtcggc accgctctga tcaagaagta cccaaagctg 3060
gagtccgagt tcgtgtacgg ggactacaag gtttacgatg tgcgcaagat gatcgccaag 3120
tcggagcaag agatcggcaa ggctaccgcc aagtacttct tctactcaaa catcatgaac 3180
ttcttcaaga ccgagatcac gctggccaac ggcgagatcc ggaagagacc gctcatcgag 3240
accaacggcg agacggggga gatcgtgtgg gacaagggca gggatttcgc gaccgtccgc 3300
aaggttctct ccatgcccca ggtgaacatc gtcaagaaga ccgaggtcca aacgggcggg 3360
ttctcaaagg agtctatcct gcctaagcgg aacagcgaca agctcatcgc cagaaagaag 3420
gactgggacc caaagaagta cggcgggttc gacagcccta ccgtggccta ctcggtcctg 3480
gttgtggcga aggttgagaa gggcaagtcc aagaagctca agagcgtgaa ggagctcctg 3540
gggatcacca tcatggagag gtccagcttc gagaagaacc caatcgactt cctggaggcc 3600
aagggctaca aggaggtgaa gaaggacctg atcatcaagc tcccgaagta ctctctcttc 3660
gagctggaga acggcaggaa gagaatgctg gcttccgctg gcgagctcca gaaggggaac 3720
gagctcgcgc tgccaagcaa gtacgtgaac ttcctctacc tggcttccca ctacgagaag 3780
ctcaagggca gcccggagga caacgagcaa aagcagctgt tcgtcgagca gcacaagcat 3840
tacctcgacg agatcatcga gcaaatctcc gagttcagca agcgcgtgat cctcgccgac 3900
gcgaacctgg ataaggtcct ctccgcctac aacaagcacc gggacaagcc catcagagag 3960
caagcggaga acatcatcca tctcttcacc ctgacgaacc tcggcgctcc tgctgctttc 4020
aagtacttcg acaccacgat cgatcggaag agatacacct ccacgaagga ggtcctggac 4080
gcgaccctca tccaccagtc gatcaccggc ctgtacgaga cgaggatcga cctctcacaa 4140
ctcggcgggg ataagagacc cgcagcaacc aagaaggcag ggcaagcaaa gaagaagaag 4200
tga 4203
<210> 65
<211> 362
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 65
gaccaagccc gttattctga cagttctggt gctcaacaca tttatattta tcaaggagca 60
cattgttact cactgctagg agggaatcga actaggaata ttgatcagag gaactacgag 120
agagctgaag ataactgccc tctagctctc actgatctgg gtcgcatagt gagatgcagc 180
ccacgtgagt tcagcaacgg tctagcgctg ggcttttagg cccgcatgat cgggcttttg 240
tcgggtggtc gacgtgttca cgattgggga gagcaacgca gcagttcctc ttagtttagt 300
cccacctcgc ctgtccagca gagttctgac cggtttataa actcgcttgc tgcatcagac 360
tt 362
<210> 66
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 66
gatgaggaac tggaccaagg 20
<210> 67
<211> 51
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 67
gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac t 51
<210> 68
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 68
gttttttgcc agatgaggaa ctggaccaag gaggagttcg 40
<210> 69
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 69
Pro Phe Thr Ala Thr Gln Trp Gln Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ser Gly Val Pro Ile Pro Ser Asp Leu Leu Leu
20 25 30
Pro Leu Arg
35
<210> 70
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 70
Leu Phe Thr Ala Ser Gln Trp Arg Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ala Gly Ser Gln Val Pro His Glu Leu Val Leu
20 25 30
Pro Leu Arg
35
<210> 71
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 71
Thr Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Pro Asp Leu Leu Leu
20 25 30
Pro Ile Arg
35
<210> 72
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 72
Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Ser Val Pro Pro Asp Leu Val Leu
20 25 30
Pro Ile Arg
35
<210> 73
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 73
Val Phe Thr Pro Ala Gln Trp Ala Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Met Ala Gly Val Pro Val Pro Pro Asp Leu Leu Leu
20 25 30
Pro Ile Arg
35
<210> 74
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 74
Pro Phe Thr Ala Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ser Gly Thr Pro Ile Pro Ser Asp Leu Ile Leu
20 25 30
Pro Leu Arg
35
<210> 75
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 75
Leu Phe Thr Ala Thr Gln Trp Gln Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ala Gly Ala Pro Val Pro Pro Asp Leu Leu Leu
20 25 30
His Leu Arg
35
<210> 76
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 76
Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Ala Asp Leu Leu Leu
20 25 30
Pro Ile Arg
35
<210> 77
<211> 36
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 77
Pro Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu Gln Gln Ala Leu
1 5 10 15
Ile Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Pro Asp Leu Val
20 25 30
Leu Pro Ile Arg
35
<210> 78
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 78
Val Phe Thr Ala Ala Gln Trp Ala Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Gly Asp Leu Leu Leu
20 25 30
Pro Ile Arg
35
<210> 79
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 79
Phe Phe Ser Trp Ala Gln Trp Gln Glu Leu Glu Leu Gln Ala Leu Ile
1 5 10 15
Tyr Arg Tyr Met Leu Ala Gly Ala Ala Val Pro Gln Glu Leu Leu Leu
20 25 30
Pro Ile Lys
35
<210> 80
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 80
Phe Phe Ser Trp Ala Gln Trp Gln Glu Leu Glu Leu Gln Ala Leu Ile
1 5 10 15
Tyr Arg Tyr Met Leu Ala Gly Ala Ser Val Pro Gln Glu Leu Leu Leu
20 25 30
Pro Ile Lys
35
<210> 81
<211> 36
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 81
Pro Phe Thr Pro Thr Gln Trp Glu Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Val Ser Gly Val Pro Val Pro Pro Glu Leu Ile Phe
20 25 30
Ser Ile Arg Arg
35
<210> 82
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 82
Pro Phe Thr Glu Ser Gln Trp Glu Glu Leu Glu Asn Gln Ala Leu Val
1 5 10 15
Phe Lys Tyr Leu Ala Ala Asn Met Pro Val Pro Pro His Leu Leu Phe
20 25 30
Leu Ile Lys
35
<210> 83
<211> 36
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 83
Pro Phe Thr Val Ser Gln Trp Gln Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Phe Lys Tyr Leu Met Ala Gly Leu Pro Val Pro Pro Asp Leu Val Leu
20 25 30
Pro Ile Gln Lys
35
<210> 84
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 84
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Glu Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Lys His Met
20 25 30
His Arg Gly Lys Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 85
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 85
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Arg Glu Ala Tyr Gly Glu Ser Lys Tyr Cys Asp Arg His Met
20 25 30
His Arg Gly Lys Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 86
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 86
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Glu Ala Ala Gln Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 87
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 87
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ala Lys Glu Ala Ala Ser Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 88
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 88
Asp Pro Glu Pro Trp Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Glu Ala His Pro Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 89
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 89
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Arg Glu Ala Tyr Gly Glu Ser Lys Tyr Cys Glu Lys His Met
20 25 30
His Arg Gly Lys Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 90
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 90
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Glu Ala Ala Pro Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 91
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 91
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Arg Asp Val Phe Ala Gly His Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 92
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 92
Asp Pro Glu Pro Gly Arg Cys Lys Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Arg Asp Val Val Ala Gly His Lys Tyr Cys Asp Arg His Ile
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 93
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 93
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Arg Glu Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Lys His Met
20 25 30
His Arg Gly Arg Asn Arg Ala Arg Lys Ser Leu Asp
35 40
<210> 94
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 94
Asp Ala Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Glu Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Lys Asn Arg Ser Ser Ser Arg Lys Pro
35 40
<210> 95
<211> 44
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 95
Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys Lys Trp Arg
1 5 10 15
Cys Ser Lys Asp Ala Tyr Pro Asp Ser Lys Tyr Cys Glu Arg His Met
20 25 30
His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
35 40
<210> 96
<211> 79
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 96
Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Ser Val Pro Pro Asp Leu Val Leu
20 25 30
Pro Ile Arg Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr Asp Gly Lys
35 40 45
Lys Trp Arg Cys Ala Lys Glu Ala Ala Ser Asp Ser Lys Tyr Cys Glu
50 55 60
Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro Val Glu
65 70 75
<210> 97
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 97
Ser Asn Val Thr Ser Asp His Ile Gln Gln Tyr Leu Asp Glu Asn Lys
1 5 10 15
Ser Leu Ile Leu Lys Ile Val Glu Ser Gln Asn Ser Gly Lys Leu Ser
20 25 30
Glu Cys Ala Glu Asn Gln Ala Arg Leu Gln Arg Asn Leu Met Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ser Gln Pro Gln Pro Pro
50 55
<210> 98
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 98
Asn Asn Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys
1 5 10 15
Lys Leu Ile Met Ala Ile Met Glu Asn Gln Asn Leu Gly Lys Leu Ala
20 25 30
Glu Cys Ala Gln Tyr Gln Ala Leu Leu Gln Lys Asn Leu Met Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ala Gln Pro Pro Pro Pro
50 55
<210> 99
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 99
Asn Asn Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys
1 5 10 15
Lys Leu Ile Met Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys Leu Ala
20 25 30
Glu Cys Ala Gln Tyr Gln Ala Leu Leu Gln Lys Asn Leu Met Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ala Gln Pro Gln Pro Pro
50 55
<210> 100
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 100
Thr Thr Val Thr Thr Asp Leu Ile Gln Gln Tyr Leu Asp Glu Asn Lys
1 5 10 15
Gln Leu Ile Leu Ala Ile Leu Asp Asn Gln Asn Asn Gly Lys Val Glu
20 25 30
Glu Cys Ala Arg Asn Gln Ala Lys Leu Gln Gln Asn Leu Met Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ser Gln Pro Pro Gln Thr
50 55
<210> 101
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 101
Ala Gly Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys
1 5 10 15
Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys Leu Ala
20 25 30
Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu Leu Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ala Gln Pro Pro Gln Asn
50 55
<210> 102
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 102
Ala Gly Leu Ser Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys
1 5 10 15
Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys Leu Ala
20 25 30
Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu Leu Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Thr Gln Pro Gln Thr Thr
50 55
<210> 103
<211> 58
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 103
Asn Val Thr Thr Asp His Ile Gln Gln Tyr Leu Asp Glu Asn Lys Ser
1 5 10 15
Leu Ile Leu Lys Ile Val Glu Ser Gln Asn Ser Gly Lys Leu Thr Glu
20 25 30
Cys Ala Glu Asn Gln Ala Arg Leu Gln Arg Asn Leu Met Tyr Leu Ala
35 40 45
Ala Ile Ala Asp Ser Gln Pro Gln Pro Pro
50 55
<210> 104
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 104
cctccgactc caagtattgc 20
<210> 105
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 105
atcatcaggt tggacgggta 20
<210> 106
<211> 52
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 106
ggggacaagt ttgtacaaaa aagctgccac catggcgatg ccgtatgcct ct 52
<210> 107
<211> 51
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 107
ggggaccact ttgtacaaga aagctgaacg gtacatytcg ccggcgaaca g 51
<210> 108
<211> 52
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 108
ggggacaagt ttgtacaaaa aagctgccac catgcagcag caacacctga tg 52
<210> 109
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 109
ggggaccact ttgtacaaga aagctgaacg gcttccttcc tcctcggt 48
<210> 110
<211> 52
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 110
ggggacaagt ttgtacaaaa aagctgccac catggcgatg ccgtatgcct ct 52
<210> 111
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 111
ggcagcggcc gcgtacatyt cgccggcgaa cag 33
<210> 112
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 112
gcggccgctg ccatgcagca gcaacacctg atg 33
<210> 113
<211> 51
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 113
ggggaccact ttgtacaaga aagctgaacg ctagcttcct tcctcctcgg t 51
<210> 114
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 114
aagctgccgc ggccatgcag cagcaacacc tg 32
<210> 115
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 115
tacgcggccg ctgccgaagc tcgaaaaaca aag 33
<210> 116
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 116
catggccgcg gcagcttttt gatgaaacag aag 33
<210> 117
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 117
tctagaaaac cggtcgaaac gcagctcg 28
<210> 118
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 118
tcgaccggtt ttctagaacg gttgcgg 27
<210> 119
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 119
gccactcagc aagctttgca gcgt 24
<210> 120
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 120
tcacgctgca aagctctaat tcccgatcta gtaac 35
<210> 121
<211> 50
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 121
ggggacaagt ttgtacaaaa aagctgccac catgaagcaa agctttgtgg 50
<210> 122
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 122
tcgaccggtt ttctagaacg gttgcgg 27
<210> 123
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 123
tctagaaaac cggtcgaatc acaaacta 28
<210> 124
<211> 49
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 124
ggggaccact ttgtacaaga aagctgaacg tcaattccca tcttcagca 49
<210> 125
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 125
Pro Phe Thr Ala Thr Gln Trp Gln Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ser Gly Val Pro Ile Pro Ser Asp Leu Leu Leu
20 25 30
Pro Leu Arg Arg Ser Phe Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Tyr Pro Asp Ser Lys
50 55 60
Tyr Cys Glu Lys His Met His Arg Gly Lys Asn Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 126
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 126
Leu Phe Thr Ala Ser Gln Trp Arg Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Met Ala Ala Gly Ser Gln Val Pro His Glu Leu Val Leu
20 25 30
Pro Leu Arg His Arg Asp Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Arg Glu Ala Tyr Gly Glu Ser Lys
50 55 60
Tyr Cys Asp Arg His Met His Arg Gly Lys Asn Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 127
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 127
Pro Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Ser Val Pro Pro Asp Leu Val Leu
20 25 30
Pro Ile Arg Arg Gly Ile Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ala Lys Glu Ala Ala Ser Asp Ser Lys
50 55 60
Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 128
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 128
Thr Phe Thr Ala Ala Gln Tyr Glu Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Val Ala Gly Val Pro Val Pro Pro Asp Leu Leu Leu
20 25 30
Pro Ile Arg Arg Gly Phe Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Ala Gln Asp Ser Lys
50 55 60
Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 129
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 129
Val Phe Thr Pro Ala Gln Trp Ala Glu Leu Glu Gln Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Met Ala Gly Val Pro Val Pro Pro Asp Leu Leu Leu
20 25 30
Pro Ile Arg Pro His Pro Asp Pro Glu Pro Trp Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala His Pro Asp Ser Lys
50 55 60
Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 130
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 130
Pro Phe Thr Pro Ser Gln Trp Met Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Asn Ala Lys Ala Pro Ile Pro Ser Gly Leu Leu Ile
20 25 30
Ser Ile Ser Lys Ser Phe Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Lys Glu Ala Met Ala Glu His Lys
50 55 60
Tyr Cys Glu Arg His Ile Asn Arg Asn Arg His Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 131
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 131
Pro Phe Thr Pro Ser Gln Trp Met Glu Leu Glu His Gln Ala Leu Ile
1 5 10 15
Tyr Lys Tyr Leu Ala Ala Asn Ile Ala Val Pro His Ser Leu Leu Val
20 25 30
Pro Ile Arg Arg Ser Val Asp Leu Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Arg Asp Ala Val Ala Asp Gln Lys
50 55 60
Tyr Cys Glu Arg His Met Asn Arg Gly Arg His Arg Ser Arg Lys His
65 70 75 80
Val Glu
<210> 132
<211> 82
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 132
Ala Leu Thr Phe Met Gln Gln Gln Glu Leu Glu His Gln Val Leu Ile
1 5 10 15
Tyr Arg Tyr Phe Ala Ala Gly Ala Pro Val Pro Val His Leu Val Leu
20 25 30
Pro Ile Trp Lys Ser Val Glu Pro Glu Pro Gly Arg Cys Arg Arg Thr
35 40 45
Asp Gly Lys Lys Trp Arg Cys Ser Arg Asp Val Val Gln Gly His Lys
50 55 60
Tyr Cys Glu Arg His Val His Arg Gly Arg Gly Arg Ser Arg Lys Pro
65 70 75 80
Val Glu
<210> 133
<211> 205
<212> PRT
<213> Oryza sativa
<400> 133
Met Gln Gln Gln Met Ala Met Pro Ala Gly Ala Ala Ala Ala Ala Val
1 5 10 15
Pro Pro Ala Ala Gly Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp
20 25 30
Glu Asn Lys Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly
35 40 45
Lys Leu Ala Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu
50 55 60
Leu Tyr Leu Ala Ala Ile Ala Asp Ala Gln Pro Pro Gln Asn Pro Gly
65 70 75 80
Ser Arg Pro Gln Met Met Gln Pro Gly Ala Thr Pro Gly Ala Gly His
85 90 95
Tyr Met Ser Gln Val Pro Met Phe Pro Pro Arg Thr Pro Leu Thr Pro
100 105 110
Gln Gln Met Gln Glu Gln Gln Gln Gln Gln Leu Gln Gln Gln Gln Ala
115 120 125
Gln Ala Leu Ala Phe Pro Gly Gln Met Leu Met Arg Pro Gly Thr Val
130 135 140
Asn Gly Met Gln Ser Ile Pro Val Ala Asp Pro Ala Arg Ala Ala Asp
145 150 155 160
Leu Gln Thr Ala Ala Pro Gly Ser Val Asp Gly Arg Gly Asn Lys Gln
165 170 175
Asp Ala Thr Ser Glu Pro Ser Gly Thr Glu Ser His Lys Ser Ala Gly
180 185 190
Ala Asp Asn Asp Ala Gly Gly Asp Ile Ala Glu Lys Ser
195 200 205
<210> 134
<211> 185
<212> PRT
<213> Oryza sativa
<400> 134
Met Gln Gln Gln Pro Met Pro Met Pro Ala Gln Ala Pro Pro Thr Ala
1 5 10 15
Gly Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys Gln
20 25 30
Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys Leu Ala Glu
35 40 45
Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu Leu Tyr Leu Ala
50 55 60
Ala Ile Ala Asp Thr Gln Pro Gln Thr Thr Ile Ser Arg Pro Gln Met
65 70 75 80
Val Pro His Gly Ala Ser Pro Gly Leu Gly Gly Gln Tyr Met Ser Gln
85 90 95
Val Pro Met Phe Pro Pro Arg Thr Pro Leu Thr Pro Gln Gln Met Gln
100 105 110
Glu Gln Gln Leu Gln Gln Gln Gln Ala Gln Leu Leu Ser Phe Gly Gly
115 120 125
Gln Met Val Met Arg Pro Gly Val Val Asn Gly Ile Pro Gln Leu Leu
130 135 140
Gln Gly Glu Met His Arg Gly Ala Asp His Gln Asn Ala Gly Gly Ala
145 150 155 160
Thr Ser Glu Pro Ser Glu Ser His Arg Ser Thr Gly Thr Glu Asn Asp
165 170 175
Gly Gly Ser Asp Phe Gly Asp Gln Ser
180 185
<210> 135
<211> 227
<212> PRT
<213> Oryza sativa
<400> 135
Met Gln Gln Gln His Leu Met Gln Met Asn Gln Gly Met Met Gly Gly
1 5 10 15
Tyr Ala Ser Pro Thr Thr Val Thr Thr Asp Leu Ile Gln Gln Tyr Leu
20 25 30
Asp Glu Asn Lys Gln Leu Ile Leu Ala Ile Leu Asp Asn Gln Asn Asn
35 40 45
Gly Lys Val Glu Glu Cys Ala Arg Asn Gln Ala Lys Leu Gln His Asn
50 55 60
Leu Met Tyr Leu Ala Ala Ile Ala Asp Ser Gln Pro Pro Gln Thr Ala
65 70 75 80
Ala Met Ser Gln Tyr Pro Ser Asn Leu Met Met Gln Ser Gly Ala Arg
85 90 95
Tyr Met Pro Gln Gln Ser Ala Gln Met Met Ala Pro Gln Ser Leu Met
100 105 110
Ala Ala Arg Ser Ser Met Met Tyr Ala Gln Pro Ala Leu Ser Pro Leu
115 120 125
Gln Gln Gln Gln Gln Gln Gln Ala Ala Ala Ala His Gly Gln Leu Gly
130 135 140
Met Gly Ser Gly Gly Thr Thr Ser Gly Phe Ser Ile Leu His Gly Glu
145 150 155 160
Ala Ser Met Gly Gly Gly Gly Gly Gly Gly Gly Ala Gly Asn Ser Met
165 170 175
Met Asn Ala Gly Val Phe Ser Asp Phe Gly Arg Gly Gly Gly Gly Gly
180 185 190
Gly Lys Glu Gly Ser Thr Ser Leu Ser Val Asp Val Arg Gly Ala Asn
195 200 205
Ser Gly Ala Gln Ser Gly Asp Gly Glu Tyr Leu Lys Gly Thr Glu Glu
210 215 220
Glu Gly Ser
225
<210> 136
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 136
Ala Gly Ile Thr Thr Glu Gln Ile Gln Lys Tyr Leu Asp Glu Asn Lys
1 5 10 15
Gln Leu Ile Leu Ala Ile Leu Glu Asn Gln Asn Leu Gly Lys Leu Ala
20 25 30
Glu Cys Ala Gln Tyr Gln Ala Gln Leu Gln Lys Asn Leu Leu Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ala Gln Pro Pro Gln Asn
50 55
<210> 137
<211> 59
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 137
Thr Thr Val Thr Thr Asp Leu Ile Gln Gln Tyr Leu Asp Glu Asn Lys
1 5 10 15
Gln Leu Ile Leu Ala Ile Leu Asp Asn Gln Asn Asn Gly Lys Val Glu
20 25 30
Glu Cys Ala Arg Asn Gln Ala Lys Leu Gln His Asn Leu Met Tyr Leu
35 40 45
Ala Ala Ile Ala Asp Ser Gln Pro Pro Gln Thr
50 55
<210> 138
<211> 360
<212> PRT
<213> Capsicum annuum
<400> 138
Met Ser Gly Thr Ser Thr Ser Val Val Gly Gly Gly Gly Glu Gly Gly
1 5 10 15
Met Gly Tyr Gly Tyr Gly Tyr Arg Pro Pro Phe Thr Ala Val Gln Trp
20 25 30
Gln Glu Leu Glu His Gln Ala Met Ile Tyr Lys Tyr Leu Val Ala Gly
35 40 45
Leu Pro Val Pro Pro Asp Leu Val Val Pro Ile Arg Arg Ser Phe Glu
50 55 60
Ala Ile Ser Ala Arg Phe Phe His His Pro Ser Leu Gly Tyr Cys Ser
65 70 75 80
Tyr Tyr Gly Lys Lys Phe Asp Pro Glu Pro Gly Arg Cys Arg Arg Thr
85 90 95
Asp Gly Lys Lys Trp Arg Cys Ser Lys Asp Ala Tyr Pro Asp Ser Lys
100 105 110
Tyr Cys Glu Arg His Met His Arg Gly Arg Asn Arg Ser Arg Lys Pro
115 120 125
Val Glu Ser Gln Ser Thr Ser Gln Ser Leu Ser Thr Ser Met Ser Gln
130 135 140
Ile Thr Thr Gly Ser Ser Asn Thr Arg Gly Ser Phe Gln Thr Ser Gly
145 150 155 160
Ser Gly Ser Phe Gln Asn Met Pro Leu Tyr Ser Val Ala Asn Ser Gly
165 170 175
Ala Leu Asn Tyr Gly Ser Thr Ala Thr Lys Leu Gln Lys Glu Pro Val
180 185 190
Ser Cys Gly Ile Asp Asn Lys Asp Tyr Arg Tyr Leu His Gly Ile Ala
195 200 205
Pro Asp Ala Asp Glu His Asn Leu Ser Ser Glu Ala Ser Ala Thr Val
210 215 220
Arg Ser Phe Gly Met Gly Thr Asn Thr Asp Gly Thr Trp Val Phe Pro
225 230 235 240
Ser Gln Val Ser Ser Ser Pro Met Ala Arg Gln Lys Asn Asp Ser Gln
245 250 255
Met Leu Gly Ser Ser Thr Glu Met His Leu Pro Asn Leu Leu Glu Pro
260 265 270
Met Ile Asp Ser Thr Ile Ser Lys Arg Arg His Gln His Cys Phe Phe
275 280 285
Gly Ser Asp Ile Asp Ser Pro Gly Thr Val Lys Glu Glu Gln Gln His
290 295 300
Ser Met Arg Pro Phe Phe Asn Glu Trp Pro Thr Ala Lys Glu Ser Trp
305 310 315 320
Ser Asn Leu Asp Asp Glu Gly Ser Asn Lys Asn Asn Phe Ser Thr Thr
325 330 335
Gln Leu Ser Ile Ser Ile Pro Ile Ala Pro Ser Asn Phe Ser Ser Arg
340 345 350
Ser Ala Cys Ser Pro Asn Asp Ala
355 360
<210> 139
<211> 215
<212> PRT
<213> Capsicum annuum
<400> 139
Met Gln Gln His Leu Met Gln Met Gln Pro Met Met Ala Ala Tyr Tyr
1 5 10 15
Pro Thr Asn Val Thr Thr Asp His Ile Gln Gln Tyr Leu Asp Glu Asn
20 25 30
Lys Ser Leu Ile Leu Lys Ile Val Glu Ser Gln Asn Ser Gly Lys Leu
35 40 45
Gly Glu Cys Ala Glu Asn Gln Ala Arg Leu Gln Arg Asn Leu Met Tyr
50 55 60
Leu Ala Ala Ile Ala Asp Ser Gln Pro Gln Pro Ser Ser Met His Ser
65 70 75 80
Gln Phe Ser Ser Gly Gly Met Met Gln Pro Gly Thr His Asn Tyr Leu
85 90 95
Gln Gln Gln His Gln Gln Ala Gln Gln Ile Ala Thr Gln Ser Leu Ile
100 105 110
Ala Ala Arg Ser Ser Ser Met Leu Tyr Gly Gln Gln Gln Gln Gln Gln
115 120 125
Gln His Gln Leu Ser Pro Phe Gln Gln Gly Leu His Gly Ser Gln Leu
130 135 140
Gly Met Ser Ser Gly Ser Gly Gly Ser Thr Gly Phe His Met Leu Gln
145 150 155 160
Ser Glu Ser Ser Pro His Gly Gly Gly Gly Gly Ser Gly Gly Phe Pro
165 170 175
His Asp Phe Gly Arg Ala Asn Lys Gln Asp Ile Gly Ser Thr Met Ser
180 185 190
Ala Glu Gly Arg Gly Arg Ser Ser Gly Gly Asp Gly Gly Glu Asn Leu
195 200 205
Tyr Leu Lys Ala Ser Glu Asp
210 215
<210> 140
<211> 1740
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct
<400> 140
atgagtggca cctcaacatc agtggtcggg ggtgggggtg aggggggaat ggggtatggg 60
tatggttacc ggccaccgtt tactgcagtg caatggcagg agctagaaca tcaagcaatg 120
atatataagt acttagtggc aggtcttcct gtgccacctg atcttgttgt tcctattcgt 180
cgtagctttg aagctatctc agctaggttc tttcatcatc ccagcttggg ctattgttcc 240
tattatggaa agaagtttga tcctgagcct ggaaggtgta gaaggactga tggaaagaag 300
tggagatgct cgaaagatgc atatcccgac tccaaatatt gcgagcgtca catgcatcga 360
ggccgcaacc gttcaagaaa gcctgtggaa tctcaatcta cttcccaatc cttgtcgact 420
agtatgtcac aaattacaac tgggagcagt aataccagag gaagtttcca aactagcggc 480
agcggaagct tccaaaacat gccattgtat tctgttgcta attcaggagc gctgaattat 540
ggaagcactg caacaaagtt gcagaaggag cctgtctcct gtggaataga taacaaggac 600
tataggtatc tccatggaat tgctccagat gctgatgagc ataatttatc ttcagaagct 660
tctgcaactg tcagaagttt tgggatgggg accaacacag acggtacctg ggtatttcct 720
tctcaagttt cttcaagccc catggcgaga caaaagaatg attcccagat gctgggtagc 780
tcaacagaga tgcatctacc taatctactt gagcctatga ttgactcaac aatttcaaag 840
cgacgacacc agcattgctt ctttggcagt gacattgatt cacctggaac agtaaaggag 900
gagcagcagc attcgatgcg ccctttcttt aacgaatggc ccactgctaa agaatcgtgg 960
tccaaccttg acgatgaggg atccaacaaa aacaatttct ccactacaca actatccata 1020
tccattccaa tcgctccttc caacttctct tcaaggagtg cttgctcccc aaatgatgct 1080
gcggccgctg ccatgcagca gcacctgatg cagatgcagc ccatgatggc agcttactat 1140
ccaacaaacg tcactactga ccatattcaa cagtatttgg atgagaacaa atcactcatt 1200
ctgaagattg ttgagagcca gaactccggg aaactcggcg aatgtgcaga gaaccaggct 1260
aggcttcaga ggaatctgat gtaccttgct gctatcgctg attcacaacc tcaaccttct 1320
agcatgcatt ctcagttctc ttctggtggc atgatgcagc cagggacaca caattacctg 1380
cagcagcagc accaacaagc ccaacaaata gcaacacaat cactcatagc tgcaagatcc 1440
tcttcaatgc tctatggaca acaacaacaa caacagcagc atcaattatc gccatttcaa 1500
caaggcttgc atggtagcca acttggcatg agctccggca gcggtggaag cactggtttt 1560
cacatgcttc aaagtgaatc atcacctcat ggtggtggtg gtggaagtgg tggtttccct 1620
catgacttcg gccgtgcaaa taagcaagac attgggagta ctatgtctgc tgaagggcgc 1680
ggcagaagct caggtggtga tggtggtgag aatctttatc ttaaagcttc tgaggattga 1740

Claims (25)

1.一种提高一个或多个转基因植物细胞再生效率的方法,所述方法包括:
a)向所述一个或多个植物细胞引入编码以下的核酸分子或包含以下的多肽,
i)生长调节因子(GRF)多肽和/或GRF相互作用因子(GIF)多肽;或
ii)GRF-GIF多肽嵌合体;和
b)所述一个或多个植物细胞相较于不包含所述引入的核酸分子或多肽的植物具有增加的再生效率,所述核酸分子编码GRF多肽和/或GIF多肽或GRF-GIF嵌合体或所述多肽包含GRF多肽和/或GIF多肽或GRF-GIF嵌合体:。
2.如权利要求1所述的方法,还包括从所述一个或多个植物细胞产生植物。
3.如权利要求1或2所述的方法,其中所述GRF多肽选自小麦GRFGRF1、GRF2GRF2、GRF3、GRF4、GRF5、GRF6或GRF 9多肽或者来自其他植物物种的同源物或其组合。
4.如权利要求1或2所述的方法,其中所述GIF多肽选自小麦GIF1、GIF2、GIF3多肽或者来自其他植物物种的同源物或其组合。
5.如权利要求1或2所述的方法,其中所述GIF多肽选自SEQ ID NO:24-30、43-44、52、133-135、139或者在SNH结构域中与其具有至少70%相同性的多肽序列或其组合。
6.如权利要求1或2所述的方法,其在所述GRF多肽选自SEQ ID NO:9-23、37-39、51、125-132或138或者在QLQ和WRC结构域中与其具有至少70%相同性的多肽序列或其组合。
7.如权利要求1-6中任一项所述的方法,其中所述GRF多肽包含选自SEQ ID NO:69-83的QLQ和选自SEQ ID NO:84-95的WRC结构域或与其具有至少70%相同性的序列。
8.如权利要求1-7中任一项所述的方法,其中所述GIF多肽包含选自SEQ ID NO:97-103,136,137的SNH结构域或与其具有至少70%相同性的序列。
9.如权利要求1-8中任一项所述的方法,其中所述GRF多肽在miR396靶位点中包含一个或多个突变且降低所述植物中所述GRF多肽受miR396的阻遏。
10.如权利要求9所述的方法,其中所述突变包含所述miR396靶位点的沉默突变。
11.如权利要求9所述的方法,其中所述miRNA靶位点包含SEQ ID NO:53。
12.如权利要求1-11中任一项所述的方法,还包括可操作连接所述GRF、GIF或GRF-GIF嵌合体的诱导型核酸分子。
13.如权利要求1-12中任一项所述的方法,其中在包含细胞分裂素的培养基上再生所述一个或多个植物细胞,所述细胞分裂素的浓度不足以使不具有引入的所述GRF、GIF或GRF-GIF嵌合体的植物细胞再生。
14.如权利要求13所述的方法,其中通过其在所述培养基上生长的能力选择所述一个或多个包含所述异源核酸分子或多肽的转化的植物细胞。
15.如权利要求1-14中任一项所述的方法,其中相较于不包含所述引入GRF、GIF或GRF-GIF嵌合体的植物细胞,所述植物的再生效率增加至少10%。
16.如权利要求1-15中任一项所述的方法,其中所述植物选自小麦属、稻属、葡萄属、柑橘属、牛油果、胡桃、开心果、桃子、苹果、樱桃、草莓、蓝莓、覆盆子、豆、西兰花、花椰菜、豇豆、韭葱、甜瓜、洋葱、胡椒、菠菜、南瓜或西瓜。
17.如权利要求1-16中任一项所述的方法,其中所述植物包括优良植物。
18.如权利要求1-17中任一项所述的方法,其中所述一个或多个植物细胞包括一个或多个叶外植体细胞或来自任何其他组织的细胞。
19.如权利要求1-18中任一项所述的方法,其中所述GRF、GIF或GRF与GIF嵌合体的所述核酸分子或多肽与CRISPR-CAS9或任意其他基因编辑系统组合。
20.如权利要求19所述的方法,其中将GRF、GIF或GRF和GIF嵌合体与基因编辑组合的所述构建体通过在编辑的植物后代中的隔离来去除。
21.如权利要求1-20中任一项所述的方法,其中编码GRF、GIF或GRF-GIF嵌合体多肽的所述核酸分子或者包含GRF、GIF或GRF-GIF嵌合体多肽的所述多肽在引入所述异源核酸分子或多肽前引入所述植物。
22.一种提高植物再生效率的方法,其中所述植物含有miR396,所述方法包括
a)向所述植物引入编码以下的核酸分子或包含以下的多肽,
i)生长调节因子(GRF)多肽和/或GRF相互作用因子(GIF)多肽;或
ii)GRF-GIF多肽嵌合体;和
b)所述GRF多肽包含具有一个或多个突变的miR396靶位点,且降低所述植物中所述GRF多肽受所述miRNA的阻遏。
23.如权利要求1-22中任一项所述的方法,其中所述植物是转基因植物。
24.一种通过权利要求1-23中任一项所述的方法产生的植物。
25.如前述权利要求中任一项所述的方法,其中所述GRF、GIF或GRF-GIF嵌合体用于加速再生植物所需的时间。
CN202080062752.4A 2019-07-11 2020-07-08 用生长调节因子(grf)、grf相互作用因子(gif)或嵌合grf-gif改进植物再生的方法 Pending CN114667292A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962873123P 2019-07-11 2019-07-11
US62/873,123 2019-07-11
PCT/US2020/041135 WO2021007284A2 (en) 2019-07-11 2020-07-08 Methods for improved regeneration of transgenic plants using growth-regulating factor (grf), grf-interacting factor (gif), or chimeric grf-gif genes and proteins

Publications (1)

Publication Number Publication Date
CN114667292A true CN114667292A (zh) 2022-06-24

Family

ID=74114249

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080062752.4A Pending CN114667292A (zh) 2019-07-11 2020-07-08 用生长调节因子(grf)、grf相互作用因子(gif)或嵌合grf-gif改进植物再生的方法

Country Status (6)

Country Link
US (1) US20230032478A1 (zh)
EP (1) EP3997111A4 (zh)
CN (1) CN114667292A (zh)
AR (1) AR122277A1 (zh)
AU (2) AU2020310877B2 (zh)
WO (1) WO2021007284A2 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116286863A (zh) * 2023-02-07 2023-06-23 深圳市兰科植物保护研究中心 多核苷酸在促进兰科植物芽体生长中的应用
CN116584386A (zh) * 2023-05-24 2023-08-15 北京林业大学 一种用于杨梅的组培培养基及杨梅种子萌发方法和杨梅组培快繁方法
CN118166001A (zh) * 2024-05-15 2024-06-11 四川省农业科学院园艺研究所 一种提升番茄再生芽率的复合基因、植物过表达载体、转基因工程菌和转化方法

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240018535A1 (en) * 2020-03-19 2024-01-18 Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences Method for improving plant genetic transformation and gene editing efficiency
CN114317573B (zh) * 2022-01-17 2024-02-09 南京农业大学 一种提高森林草莓遗传转化效率的嵌合基因及其载体和应用
CN114672513B (zh) * 2022-04-12 2024-04-02 北京大学现代农业研究院 一种基因编辑系统及其应用
WO2024030442A1 (en) * 2022-08-01 2024-02-08 Purdue Research Foundation Efficient genotype-independent in planta transformation of cereals

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150033413A1 (en) * 2012-01-04 2015-01-29 Universidad Nacional De Rosario GRF3 Mutants, Methods and Plants
US20170362601A1 (en) * 2014-12-17 2017-12-21 Carla SCHOMMER Chimeric proteins which enhance the activity of dna binding domains (dbd) and transcription factors in plants
EP3508581A1 (en) * 2018-01-03 2019-07-10 Kws Saat Se Regeneration of genetically modified plants
CN114269933A (zh) * 2019-06-28 2022-04-01 科沃施种子欧洲股份两合公司 通过使用grf1加强基因实现的增强的植物再生及转化

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8200523A (nl) 1982-02-11 1983-09-01 Univ Leiden Werkwijze voor het in vitro transformeren van planteprotoplasten met plasmide-dna.
EP0160692A1 (en) 1983-11-03 1985-11-13 DE WET, Johannes Martenis Jacob Method for the transfer of exogenous genes in plants using pollen as a vector
US5268463A (en) 1986-11-11 1993-12-07 Jefferson Richard A Plant promoter α-glucuronidase gene construct
US5015580A (en) 1987-07-29 1991-05-14 Agracetus Particle-mediated transformation of soybean plants and lines
EP0275069A3 (en) 1987-01-13 1990-04-25 DNA PLANT TECHNOLOGY CORPORATION (under the laws of the state of Delaware) Pollen-mediated gene transformation in plants
US5322938A (en) 1987-01-13 1994-06-21 Monsanto Company DNA sequence for enhancing the efficiency of transcription
ATE112314T1 (de) 1988-05-17 1994-10-15 Lubrizol Genetics Inc Pflanzliches ubiquitinpromotorsystem.
HU218717B (hu) 1989-03-17 2000-11-28 E. I. Du Pont De Nemours And Co. Nukleinsav-termelést fokozó növényi eredetű génfragmentek és eljárás előállításukra
US5550318A (en) 1990-04-17 1996-08-27 Dekalb Genetics Corporation Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof
US5658772A (en) 1989-12-22 1997-08-19 E. I. Du Pont De Nemours And Company Site-specific recombination of DNA in plant cells
EP1983056A1 (en) 1992-07-07 2008-10-22 Japan Tobacco Inc. Method for transforming monocotyledons
US5332808A (en) 1992-09-08 1994-07-26 North Carolina State University DNA encoding a ribosome inactivating protein
US5789156A (en) 1993-06-14 1998-08-04 Basf Ag Tetracycline-regulated transcriptional inhibitors
US5814618A (en) 1993-06-14 1998-09-29 Basf Aktiengesellschaft Methods for regulating gene expression
US6072050A (en) 1996-06-11 2000-06-06 Pioneer Hi-Bred International, Inc. Synthetic promoters
US6720475B1 (en) 1997-11-18 2004-04-13 Pioneer Hi-Bred International, Inc. Modified nucleic acid sequence encoding FLP recombinase
CA2336207C (en) 1998-09-10 2004-03-23 Pioneer Hi-Bred International, Inc. Ecdysone receptors and methods for their use
US6420630B1 (en) 1998-12-01 2002-07-16 Stine Biotechnology Methods for tissue culturing and transforming elite inbreds of Zea mays L.
DE10224889A1 (de) 2002-06-04 2003-12-18 Metanomics Gmbh & Co Kgaa Verfahren zur stabilen Expression von Nukleinsäuren in transgenen Pflanzen
US20080148432A1 (en) * 2005-12-21 2008-06-19 Mark Scott Abad Transgenic plants with enhanced agronomic traits
UA103176C2 (ru) * 2007-09-21 2013-09-25 Басф Плант Саенс Гмбх Растение с повышенными показателями урожайности и способ его получения
PL2510096T5 (pl) 2009-12-10 2018-06-29 Regents Of The University Of Minnesota Modyfikacja DNA zależna od efektora TAL
US20240018535A1 (en) * 2020-03-19 2024-01-18 Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences Method for improving plant genetic transformation and gene editing efficiency

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150033413A1 (en) * 2012-01-04 2015-01-29 Universidad Nacional De Rosario GRF3 Mutants, Methods and Plants
US20170362601A1 (en) * 2014-12-17 2017-12-21 Carla SCHOMMER Chimeric proteins which enhance the activity of dna binding domains (dbd) and transcription factors in plants
EP3508581A1 (en) * 2018-01-03 2019-07-10 Kws Saat Se Regeneration of genetically modified plants
CN114269933A (zh) * 2019-06-28 2022-04-01 科沃施种子欧洲股份两合公司 通过使用grf1加强基因实现的增强的植物再生及转化

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JUAN M. DEBERNARDI等: "Post-transcriptional control of GRF transcription factors by microRNA miR396 and GIF co-activator affects leaf size and longevity", THE PLANT JOURNAL, vol. 79, no. 3, 2 July 2014 (2014-07-02), pages 414 - 416 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116286863A (zh) * 2023-02-07 2023-06-23 深圳市兰科植物保护研究中心 多核苷酸在促进兰科植物芽体生长中的应用
CN116286863B (zh) * 2023-02-07 2023-11-28 深圳市兰科植物保护研究中心 多核苷酸在促进兰科植物芽体生长中的应用
CN116584386A (zh) * 2023-05-24 2023-08-15 北京林业大学 一种用于杨梅的组培培养基及杨梅种子萌发方法和杨梅组培快繁方法
CN116584386B (zh) * 2023-05-24 2024-03-19 北京林业大学 一种用于杨梅的组培培养基及杨梅种子萌发方法和杨梅组培快繁方法
CN118166001A (zh) * 2024-05-15 2024-06-11 四川省农业科学院园艺研究所 一种提升番茄再生芽率的复合基因、植物过表达载体、转基因工程菌和转化方法

Also Published As

Publication number Publication date
US20230032478A1 (en) 2023-02-02
EP3997111A2 (en) 2022-05-18
WO2021007284A2 (en) 2021-01-14
AU2020310877B2 (en) 2024-05-23
AU2024205750A1 (en) 2024-09-12
WO2021007284A3 (en) 2021-02-18
AR122277A1 (es) 2022-08-31
EP3997111A4 (en) 2023-07-26
AU2020310877A1 (en) 2022-02-24

Similar Documents

Publication Publication Date Title
CN114667292A (zh) 用生长调节因子(grf)、grf相互作用因子(gif)或嵌合grf-gif改进植物再生的方法
AU2016201566B2 (en) Methods and compositions for the introduction and regulated expression of genes in plants
CN108368517B (zh) 用于快速植物转化的方法和组合物
CA2449254C (en) Alteration of oil traits in plants
US20030131386A1 (en) Stress-induced polynucleotides
AU2007201884A1 (en) Regulatory element from a sugarcane proline rich protein and uses thereof
JP2000507446A (ja) シングルステップ切出し手段
AU2016350610A1 (en) Methods and compositions of improved plant transformation
CN108012523A (zh) 具有增加的种子大小的植物
CN112779282A (zh) 一种通过基因编辑技术从番茄背景材料中创制多种果色材料的方法
CN113832179B (zh) ZmELF3.1蛋白及其功能缺失突变体在调控作物雄穗分支数中的应用
US7638681B2 (en) Auxin transport proteins
CN110857316B (zh) 花青素合成相关蛋白及其在调控植物花青素含量中的应用
CN110818784B (zh) 水稻基因OsATL15在调节农药的吸收转运中的应用
CA2617876A1 (en) Nitrate transport components
CA2404471C (en) A construct capable of release in closed circular form from a larger nucleotide sequence permitting site specific expression and/or developmentally regulated expression of selected genetic sequences
US6713666B2 (en) Invertase inhibitors and methods of use
CN110172477B (zh) 一种优化的获得无选择标记转基因植物的化学诱导删除表达载体及其应用
CN114805508A (zh) 水稻抽穗期基因dhd3功能以及应用
WO1999054490A2 (en) Plant-derived resistance gene
US6887988B2 (en) Plant reproduction polynucleotides and methods of use
AU780662B2 (en) Orthologues of bacterial RuvB:cDNAs and uses thereof
CN115703822A (zh) BaZFP924蛋白及其相关生物材料与应用
AU2008212072B2 (en) Regulatory element from a sugarcane proline rich protein and uses thereof
CN116635529A (zh) 双单倍体诱导物

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination