CN111527101A - 用于改善生长的光合生物基因调节 - Google Patents

用于改善生长的光合生物基因调节 Download PDF

Info

Publication number
CN111527101A
CN111527101A CN201880084364.9A CN201880084364A CN111527101A CN 111527101 A CN111527101 A CN 111527101A CN 201880084364 A CN201880084364 A CN 201880084364A CN 111527101 A CN111527101 A CN 111527101A
Authority
CN
China
Prior art keywords
seq
mutant
gene
photosynthetic organism
gly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880084364.9A
Other languages
English (en)
Inventor
I·阿加维
F·I·库兹米诺夫
R·R·拉达科维茨
J·H·维卢托
S·波茨
R·斯普雷菲科
W·F·兰伯特
J·N·格雷纳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Viridos Inc
Original Assignee
Synthetic Genomics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Synthetic Genomics Inc filed Critical Synthetic Genomics Inc
Publication of CN111527101A publication Critical patent/CN111527101A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K36/00Medicinal preparations of undetermined constitution containing material from algae, lichens, fungi or plants, or derivatives thereof, e.g. traditional herbal medicines
    • A61K36/02Algae
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/405Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from algae
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/12Unicellular algae; Culture media therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/825Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving pigment biosynthesis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8262Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield involving plant development
    • C12N15/8269Photosynthesis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biotechnology (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Cell Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Botany (AREA)
  • Natural Medicines & Medicinal Plants (AREA)
  • Physiology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Public Health (AREA)
  • Alternative & Traditional Medicine (AREA)
  • Epidemiology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Mycology (AREA)
  • Veterinary Medicine (AREA)
  • Medical Informatics (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Nutrition Science (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)

Abstract

提供了具有减少的叶绿素和增加的光合效率的突变体光合生物。所述突变体菌株具有突变或弱化的:叶绿体SRP54基因和SGI1基因;叶绿体SRP54基因和SGI2基因;公开了叶绿体SRP54基因、SGI1基因和SGI2基因。相对于野生型菌株,所述突变体光合生物展现出增加的生产力。还提供了具有突变或弱化的胞质SRP54基因的突变体光合生物。本文提供了使用在SRP54基因、SGI1基因、SGI2基因、SGI1/SRP54的组合以及SGI2基因和SRP54基因的组合中具有突变的菌株生产生物质和如脂质等其它产物的方法。还包含用于弱化或破坏SRP54基因、SGI1基因和SGI2基因的构建体和方法。

Description

用于改善生长的光合生物基因调节
相关申请的交叉引用
本申请根据35 U.S.C.§119(e)要求于2017年12月29日提交的美国序列号62/612,251和于2018年6月26日提交的美国序列号62/690,205的优先权的权益,所述申请的全部内容通过引用整体并入本文。
序列表的并入
所附序列表中的材料特此通过引用并入到本申请中。名称为SGI2140_2WO_Sequence_Listing.txt的所附序列表文本文件于2018年12月18日创建并且大小为419kb。此文件可以在使用Windows OS的计算机上使用Microsoft Word来访问。
背景技术
光合生物的生物质生产力的提高与从生物燃料到高价值产物的各种商业应用相关。用于增加生物质的总蛋白质含量的基因操作是非常令人期望的,但是这样做的策略在本领域中并不明显。
对光合生物进行工程化以增加光合效率,从而获得更高的生产力是植物和藻类生物学家的长期目标。US 2014/0220638和US 2016/030489(两者均通过引用并入本文)描述了用于获得叶绿素减少的藻类突变体的突变体筛选,所述藻类突变体在低光适应能力方面受损,也就是说,其即使在低光下仍能保持高光适应细胞的低叶绿素状态。US 2014/0220638描述了在光适应调控子LAR1、LAR2和LAR3基因中具有突变的藻类突变体,并且US2016/0304896公开了在叶绿体SRP54基因中具有突变的藻类突变体。
发明内容
本文公开了包括光合效率和生产力增加的调节基因的光合生物,其在光能自养条件下生产产物的用途,和生产此类光合生物的方法以及用于调节此类基因的核酸分子和构建体。
一方面,提供了突变体光合生物,其包括突变或弱化的对显著生长改善基因2(SGI2)进行编码的基因。
一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。
一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。
一方面,提供了突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。
一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。
一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。
一方面,提供了突变体光合生物,其包括突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。
一方面,提供了包括突变体光合生物的生物质,其中所述突变体光合生物包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)和/或突变或弱化的显著生长改善基因2(SGI2)。
一方面,提供了生产生物产物的方法。所述方法包含培养突变体光合生物,其中所述突变体光合生物包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)和/或突变或弱化的显著生长改善基因2(SGI2);以及从所述培养物中分离出至少一种产物。
一方面,提供了将CRISPR基因的单拷贝插入到微生物的所选基因座中的方法。在一些实施例中,所述CRISPR基因被密码子优化以用于在微生物中表达。在一些实施例中,所述所插入的CRISPR基因包括多个异源内含子。在一些实施例中,所述异源内含子的数量可以是至少5个、6个、7个、8个、9个、10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个、21个、22个、23个、24个、25个、26个、27个、28个、29个、30个、35个、40个或更多。所述CRISPR基因的非限制性实例包含Cas9和Cpf1。在一些实施例中,所述CRISPR基因可以可操作地连接到微生物的天然启动子。在一些实施例中,所述启动子是诱导型的。在一些实施例中,所述CRISPR基因可以可操作地连接到微生物的异源启动子。
在一些实施例中,生物产物是脂质、蛋白质、肽、一个或多个氨基酸、氨基酸、一个或多个核苷酸、维生素、辅因子、激素、抗氧化剂或颜料或着色剂。在一些实施例中,所述生物产物是生物质。在一些实施例中,突变体光合生物是藻类,并且所述生物质是藻类生物质。
在一些实施例中,所述突变体光合生物被工程化成包含对参与所述脂质的生产的多肽进行编码的至少一种外源基因。在一些实施例中,所述突变体光合生物是光养培养的。在一些实施例中,所述突变体光合生物是藻类,并且所述藻类是在池塘或水沟中进行培养。
一方面,提供了用于同源重组的核酸分子构建体,所述核酸分子构建体包括来自或邻近对SGI2蛋白进行编码的天然存在的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与选自由以下组成的组的氨基酸序列具有至少55%同一性的氨基酸序列:SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ IDNO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ IDNO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ IDNO:55和SEQ ID NO:56。
一方面,提供了用于同源重组的多个核酸分子构建体,所述核酸分子构建体包括来自或邻近对cpSRP54蛋白进行编码的天然存在的光合生物基因和对SGI1蛋白进行编码的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84和SEQ IDNO:85具有至少55%同一性的氨基酸序列,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因对具有以下氨基酸序列的多肽进行编码,所述氨基酸祖烈包括与选自由以下组成的组的氨基酸序列具有至少55%同一性的氨基酸序列:SEQ ID NO:3、SEQ ID NO:9、SEQID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQID NO:36、SEQ ID NO:37、SEQ ID NO:38和SEQ ID NO:39。
一方面,提供了用于同源重组的多个核酸分子构建体,所述核酸分子构建体包括来自或邻近对cpSRP54蛋白进行编码的天然存在的光合生物基因和对SGI2蛋白进行编码的光合生物基因的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ IDNo:85具有至少55%同一性的氨基酸序列,并且其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ IDNO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ IDNO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列。
一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的核酸分子构建体,所述核酸分子构建体包括与对SGI2蛋白进行编码的天然存在的光合生物基因的至少一部分互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述SGI2蛋白包括与SEQ ID NO:5、SEQID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列。
一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的多个核酸分子构建体,所述核酸分子构建体包括与对cpSRP54蛋白进行编码的天然存在的光合生物基因的至少一部分和对SGI1蛋白进行编码的光合生物基因互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1蛋白包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ IDNO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ IDNO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ IDNO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ IDNO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39具有至少55%同一性的氨基酸序列。
在一些实施例中,所述构建体包括cpSRP54、SGI1、SGI2或两个或更多个基因的组合的5'UTR的至少一部分、cpSRP54、SGI1、SGI2或两个或更多个基因的组合的启动子区的至少一部分和/或cpSRP54、SGI1、SGI2或两个或更多个基因的组合的3'UTR的至少一部分。在一些实例中,所述构建体可以是RNAi、核酶或反义构建体,并且可以包含在有义或反义朝向上从cpSRP54、SGI1、SGI2或所述基因中的两个或更多个的组合的转录区开始的序列。在进一步的实例中,一种构建体可以被设计用于引导RNA的体外或体内表达,所述引导RNA被设计成靶向cpSRP54、SGI1、SGI2或两个或更多个基因的组合,并且可以包含与任何基因的一部分同源的序列,包含例如基因的内含子、5'UTR、启动子区和/或3'UTR。在又进一步的实例中,用于弱化对cpSRP54、SGI1或SGI2多肽进行编码的基因的表达的构建体可以是引导RNA或反义寡核苷酸,其中所述序列与在反义朝向上的cpSRP54、SGI1、SGI2或两个或更多个基因的组合的转录区同源。
一方面,提供了用于表达反义RNA、shRNA、微RNA或核酶的多个核酸分子构建体,所述核酸分子构建体包括与对cpSRP54蛋白进行编码的天然存在的光合生物基因的至少一部分和对SGI2蛋白进行编码的光合生物基因互补的核苷酸序列,其中在所述基因的突变或弱化之前,所述cpSRP54蛋白编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2蛋白编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ IDNO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列的蛋白质。
一方面,提供了对引导RNA进行编码的多个核酸分子,其中所述引导RNA包括天然存在的光合生物基因SGI2的至少一部分,其中在所述基因的突变或弱化之前,SGI2基因编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少55%同一性的氨基酸序列的蛋白质。
一方面,提供了对至少两个引导RNA进行编码的多个核酸分子,其中所述引导RNA包括对cpSRP54进行编码的天然存在的光合生物基因的至少一部分和对SGI1进行编码的光合生物基因,其中在所述基因的突变或弱化之前,所述cpSRP54编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ IDNO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ IDNO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ IDNO:38或SEQ ID NO:39具有至少55%同一性的氨基酸序列。
一方面,提供了对至少两个引导RNA进行编码的多个核酸分子,其中所述引导RNA包括天然存在的光合生物cpSRP54基因的至少一部分和光合生物基因SGI2基因,其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQID NO:82、SEQ ID NO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2基因包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ IDNO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列。
一方面,提供了增加光合生物的生物质的方法,所述方法包括调节SGI2基因。
一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节叶绿体信号识别蛋白54(cpSRP54)和显著生长改善基因1(SGI1),其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述SGI1基因的突变或弱化之前,所述SGI1基因包括与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ IDNO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ IDNO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39具有至少的氨基酸序列。
一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节叶绿体信号识别蛋白54基因(cpSRP54)和显著生长改善基因2(SGI2),其中在所述基因的突变或弱化之前,所述cpSRP54基因编码包括与SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID No:85具有至少55%同一性的氨基酸序列的蛋白质,并且其中在所述基因的突变或弱化之前,所述SGI2基因包括与SEQ ID NO:5、SEQ ID NO:40、SEQID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列。
一方面,提供了一种增加光合生物的生物质的方法,所述方法包括调节胞质信号识别蛋白54(cytoSRP54)和显著生长改善基因2(SGI2),其中在所述基因的突变或弱化之前,所述SGI2基因编码包括与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ IDNO:54、SEQ ID NO:55或SEQ ID NO:56具有至少的氨基酸序列的蛋白质。
在一些实施例中,所述突变体光合生物的培养物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,所述突变体光合生物在光能自养培养物中显示出更大的生物质生产力。在一些实施例中,在连续光条件下,所述突变体光合生物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,在昼夜循环条件下,所述突变体光合生物展现出比同一物种的对照光合生物的培养物更大的生物质生产力。在一些实施例中,在昼夜循环条件下,所述突变体光合生物展现了比同一物种的对照光合生物的培养物更大的生物质生产力,其中光概况模拟自然日光概况。
在一些实施例中,增加光合生物的生物质包括总有机碳的增加。在一些实施例中,增加光合生物的生物质包括总脂质含量的增加。在一些实施例中,增加光合生物的生物质包括总氮含量的增加。
在一些实施例中,相对于同一物种的对照光合生物,所述突变体光合生物展现出叶绿素在低光条件下的减少和在高于100、125、150、200或250μE m-2s-1的所有生理相关辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,相对于同一物种的对照光合生物,叶绿素的减少为至少减少20%、30%、40%、50%、60%或70%。在一些实施例中,相对于同一物种的对照光合生物,所述突变体光合生物展现出在高于125、150、200或250μE m-2s-1的所有生理相关辐照度下,非光化学淬灭(NPQ)更低。
在一些实施例中,所述突变体光合生物在每个叶绿素的基础上展现出同一物种的对照光合生物更高的固碳率。在一些实施例中,所述固碳率比同一物种的对照光合生物高至少50%、60%、70%、80%、90%或100%。
在一些实施例中,所述突变体光合生物展现出比同一物种的对照光合生物至少100%、150%、200%、300%、400%或更高的析氧率每毫克叶绿素。在一些实施例中,所述突变体光合生物展现出总有机碳(TOC)的μ至少100%、150%、200%、300%、400%或更高的析氧率每克。
在一些实施例中,所述突变体光合生物的培养物展现出比同一物种的对照光合生物的培养物更大的脂质生产力。在一些实施例中,所述突变体光合生物在光能自养培养物中展现出更大的脂质生产力。在一些实施例中,所述突变体光合生物是藻类。
在一些实施例中,所述突变体光合生物通过调节生物的SGI2基因生成。在一些实施例中,所述突变体光合生物通过调节生物的cpSRP54基因和SGI1或SGI2基因生成。在一些实施例中,调节所述基因包括UV辐照、γ辐照或化学诱变。在一些实施例中,调节所述基因包括所述cpSRP54基因、SGI1基因、SGI2基因或所述基因的组合中的碱基取代突变、插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统。
在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物包括对具有以下氨基酸序列的蛋白质进行编码的cpSRP54基因,所述氨基酸序列与选自由以下组成的组的氨基酸序列具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%或95%同一性的氨基酸序列:SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ IDNO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ IDNO:84或SEQ ID NO:85。在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物包括对具有以下氨基酸序列的蛋白质进行编码的cpSRP54基因,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与选自由以下组成的组的氨基酸序列的全长具有至少50%、55%、60%、65%、70%、75%、80%、85%、90%或95%同一性:SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ IDNO:77、SEQ ID NO:78、SEQ ID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:84或SEQ ID NO:85。
在一些实施例中,所述突变体光合生物包括在对cpSRP54 GTP酶结构域的前169个氨基酸进行编码的序列之外发生的cpSRP54基因中的突变。在一些实施例中,所述对SRP54蛋白进行编码的cpSRP54基因中的所述突变发生在对所述cpSRP54 GTP酶结构域进行编码的所述序列之外。在一些实施例中,所述cpSRP54基因中的突变不包含所述cpSRP54 GTP酶结构域中的基因破坏性突变。
在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物的所述SGI2基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ IDNO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ IDNO:52、SEQ ID NO:53、SEQ ID NO:54、SEQ ID NO:55或SEQ ID NO:56的氨基酸序列至少50%、65%、70%、75%、80%、85%、90%、95%同一性。在一些实施例中,在所述基因的突变或弱化之前,所述突变体光合生物的所述SGI2基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQ ID NO:54、SEQID NO:55或SEQ ID NO:56的氨基酸序列的全长至少50%、65%、70%、75%、80%、85%、90%、95%同一性。
在一些实施例中,在所述SGI1基因的突变或弱化之前,所述突变体光合生物的所述SGI1基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与SEQ ID NO:3、SEQ IDNO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ IDNO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ IDNO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ IDNO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ IDNO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ ID NO:38或SEQ ID NO:39的氨基酸序列至少50%、65%、70%、75%、80%、85%、90%、95%同一性。在一些实施例中,在所述SGI1基因的突变或弱化之前,所述突变体光合生物的所述SGI1基因编码具有以下氨基酸序列的蛋白质,所述氨基酸序列与至少30个、35个、40个、45个、50个、60个、70个、80个、100个、150个、200个、250个、300个氨基酸或与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ IDNO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ IDNO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ IDNO:37、SEQ ID NO:38或SEQ ID NO:39的氨基酸序列的全长至少50%、65%、70%、75%、80%、85%、90%、95%同一性。
在上述方面的一些实施例中,所述光合生物是多倍体的,例如,二倍体、三倍体、四倍体。在一些实施例中,基因:cpSRP54、SGI1或SGI2的一个或多个拷贝是突变或弱化的,留下所述基因的其它拷贝是未改变的或未弱化的,以生成突变体光合生物。在一些实施例中,因此所述突变体光合生物生成,相对于同一物种的对照光合生物,所述突变体光合生物展现出叶绿素在低光条件下的减少和在高于100、125、150、200或250μE m-2s-1的所有生理相关辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,因此所述突变体光合生物生成,所述突变体光合生物展现出比同一物种的对照光合生物更大的生物质生产力。在一些实施例中,因此所述突变体光合生物生成,所述突变体光合生物展现出比同一物种的对照光合生物更大的脂质生产力。
在上述方面的一些实施例中,所述突变体光合生物是藻类。在一些实施例中,所述藻类属于属:曲壳藻属(Achnanthes)、茧形藻属(Amphiprora)、双眉藻属(Amphora)、纤维藻属(Ankistrodesmus)、星胞藻属(Asteromonas)、黄金色藻属(Boekelovia)、波利氏藻属(Bolidomonas)、包特氏菌属(Borodinella)、气球藻属(Botrydium)、葡萄藻属(Botryococcus)、荀球藻属(Bracteococcus)、角毛藻属(Chaetoceros)、四鞭藻属(Carteria)、衣藻属(Chlamydomonas)、绿球藻属(Chlorococcum)、绿梭藻属(Chlorogonium)、小球藻属(Chlorella)、蓝隐藻属(Chroomonas)、金球藻属(Chrysosphaera)、球钙板藻属(Cricosphaera)、隐甲藻属(Crypthecodinium)、隐藻属(Cryptomonas)、小环藻属(Cyclotella)、杜氏藻属(Dunaliella)、椭球藻属(Ellipsoidon)、球石藻属(Emiliania)、独球藻属(Eremosphaera)、衣迪斯藻属(Ernodesmius)、裸藻属(Euglena)、真眼点藻属(Eustigmatos)、伏氏藻属(Franceia)、脆杆藻属(Fragilaria)、丽丝藻属(Gloeothamnion)、红球藻属(Haematococcus)、嗜盐古菌(Halocafeteria)、异弯藻属(Heterosigma)、膜胞藻属(Hymenomonas)、等鞭金藻属(Isochrysis)、鳞孔藻属(Lepocinclis)、微芒藻属(Micractinium)、蒜头藻属(Monodus)、单针藻属(Monoraphidium)、微球藻属(Nannochloris)、微拟球藻属(Nannochloropsis)、舟形藻属(Navicula)、新绿藻属(Neochloris)、肾鞭藻属(Nephrochloris)、肾爿藻属(Nephroselmis)、菱形藻属(Nitzschia)、棕鞭藻属(Ochromonas)、鞘藻属(Oedogonium)、卵囊藻属(Oocystis)、蚝球藻属(Ostreococcus)、巴夫藻属(Pavlova)、拟小球藻属(Parachlorella)、帕氏藻属(Pascheria)、普莱格门(Pelagomonas)、褐指藻属(Phaeodactylum)、噬菌体属(Phagus)、微绿藻属(Picochlorum)、扁藻属(Platymonas)、颗石藻属(Pleurochrysis)、宽球藻属(Pleurococcus)、原壁菌属(Prototheca)、假小球藻属(Pseudochlorella)、拟新绿藻属(Pseudoneochloris)、假十字趾藻属(Pseudostaurastrum)、塔胞藻属(Pyramimonas)、葡串藻属(Pyrobotrys)、栅列藻属(Scenedesmus)、骨条藻属(Skeletonema)、螺旋藻属(Spyrogyra)、裂丝藻属(Stichococcus)、四爿藻属(Tetraselmis)、海链藻属(Thalassiosira)、黄丝藻属(Tribonema)、无隔藻属(Vaucheria)、鲜绿球藻属(Viridiella)魏氏藻属(Vischeria)和团藻属(Volvox)。在一些实施例中,所述突变体光合生物是绿藻门或轮藻门的成员,并且可以是例如,绿藻门类中任何一个的成员:绿藻纲(Chlorophyceae)、共球藻纲(Trebouxiophyceae)、四爿藻纲(Chlorodendrophyceae)、石莼纲(Ulvophyceae)、平藻纲(Pedinophyceae)或绿枝藻纲(Prasinophyceae)。例如,所述藻类突变体可以是属于以下的物种:绿藻纲、共球藻纲或四爿藻纲。在一些实施例中,所述突变体藻类细胞是绿藻门藻类细胞,并且可以是共球藻纲类的绿藻门藻类细胞,例如,基因的物种的藻类细胞,如葡萄藻属、小球藻属、产油微藻属(Auxenochlorella)、海南橡藻属(Heveochlorella)、海水小球藻属(Marinichlorella)、拟小球藻属、假小球藻属、四球藻属(Tetrachlorella)、独球藻属、伏氏藻属、微芒藻属、微球藻属、卵囊藻属、微绿藻属、或原壁菌属。在一些实施例中,所述突变体藻类可以是属于以下的物种:产油微藻属、小球藻属、海南橡藻属、海水小球藻属、拟小球藻属、假小球藻属或四球藻属。
在一些实施例中,所述突变体光合微生物是蓝藻菌。在一些实施例中,所述蓝藻菌是蓝细菌、阿格门氏藻(Agmenellum)、项圈藻属、拟项圈藻属、组囊藻属、丝囊藻属、节旋藻、球藻属、博氏藻属、眉藻属、管孢藻属、拟绿胶蓝细菌属、拟甲色球藻属、色球藻属、发毛针藻属、蓝藻菌、蓝菌属、蓝囊胞菌属、螺菌属、蓝丝菌属、拟柱胞藻属、筒孢藻属、蓝纤维藻属、小皮果蓝细菌属、侧生藻属、Fremyella、蓝细菌属、线状蓝细菌属、粘杆菌属、蓝绿藻属、粘杆藻属、盐螺旋藻属、形态属、鞘丝藻属、丝藻属、林氏藻属、鞘藻属、微胞藻属、粘囊藻属、节球藻属、念珠藻属、拟珠藻属、颤藻属、席藻属、浮丝藻属、厚皮藻属、原绿球藻、原绿藻属、原绿发藻属、伪项圈藻属、胶须藻属、裂须藻属、伪枝藻属、螺旋藻属、斯塔尼尔氏菌属、斯塔尔氏蓝细菌属、真枝藻属、束藻属、聚球藻属、集胞藻属、热集胞藻属(thermosynechocystis)、单歧藻属、束毛藻属、常丝藻属或异球藻属物种。
在一些实施例中,所述突变体光合微生物是植物。植物的非限制性实例包含单子叶植物和双子叶植物,如包含谷类作物(例如,小麦、玉米、水稻、小米、大麦)的作物、水果作物(例如,番茄、苹果、梨、草莓、橙子)、饲料作物(例如,苜蓿)、根蔬菜作物(例如,胡萝卜、马铃薯、甜菜、山药)、多叶蔬菜作物(例如,莴苣、菠菜);开花植物(例如,矮牵牛、玫瑰菊花)、针叶树和松树(例如,松杉、云杉)、用于植物修复的植物(例如,积聚重金属的植物);油料作物(例如,向日葵、油菜籽)和用于实验目的的植物(例如,拟南芥)。
突变的双子叶植物的非限制性实例包含属于以下目的植物:木兰目、Miciales、樟目、胡椒目、马兜铃目、睡莲目、毛茛目、罂粟目、瓶子草科、昆栏树目、金缕梅目、杜仲目、莱脱纳目、杨梅目、壳斗目、木麻黄目、石竹目、肉穗果目、蓼目、蓝雪目、五桠果目、山茶目、锦葵目、荨麻目、玉蕊目、紫堇目、杨柳目、白花菜目、欧石楠目、岩梅目、柿树目、报春花目、蔷薇目、豆目、河苔草目、小二仙草目、桃金娘目、山茱萸目、山龙眼目、San tales、大花草目、卫矛目、大戟目、鼠李目、无患子目、胡桃目、牻牛儿苗目、远志目、伞形目、龙胆目、花葱目、唇形目、车前草目、玄参目、桔梗目、茜草目、川绿断目和菊目。
突变的单子叶植物的非限制性实例包含属于以下目的植物:泽泻目、水鳖目、茨藻目、霉草目、鸭跖草目、谷精草目、帚灯草目、禾本目、灯芯草目、莎草科、香蒲目、凤梨目、姜目、槟榔目、环花目、露兜树目、天南星目、百合目、和兰目,或属于裸子植物的植物,例如,属于以下顺序的那些:松目、银杏目、苏铁目、南洋杉目、柏目和麻黄目。
在一些实施例中,所述突变的植物可以是沙生芥(Arabidopsis arenicola)、沙芥(Arabidopsis arenosa)、Arabidopsis cebennensis、Arabidopsis croatica、圆叶拟南芥(Arabidopsis halleri)、琴叶拟南芥(Arabidopsis lyrata)、Arabidopsis neglecta、Arabidopsis pedemontana、Arabidopsis suecica)、拟南芥(Arabidopsis thaliana)、玉米(Zea mays)、稻(Oryza sativa)、小麦(Triticum aestivum)、马铃薯(Solanumtuberosum)、洋葱(Allium cepa)、大蒜(Allium sativum)、大豆(Glycine max)、番茄(Solanum lycopersicum)、陆地棉(Gossypium hirsutum)、草本棉(Gossypiumherbaceum)、木本棉(Gossypium arboreum)、毛棉(Gossypium tomentosum)、黑芥(Brassica nigra)或芸薹属(Brassica sp.)。
在一些实施例中,植物中的SRP54、SGI1、SGI2或一种或多种基因的组合的调节可以是组织特异性的。在一些实施例中,所述植物组织可以是叶、茎或根。在一些实施例中,可以通过调节基因的组织特异性非编码区,例如,启动子、增强子、内含子、3'-或5'-非翻译区来实现对组织特异性基因的调节。在一些实施例中,所述植物中的SRP54、SGI1、SGI2或一种或多种基因的组合的调节可以在植物的不同发育阶段进行。
当结合附图阅读本发明的以下详细描述时,本发明的这些和其它目的和特征将变得更充分明显。
附图说明
图1A-1B.图1A示出了SGI1基因的示意图。指示了被设计成破坏SGI1基因(CRISPR靶)的gRNA的推定位置。图1B示出了SPR54基因的示意图。指示了被设计成破坏SPR54基因(CRISPR靶)的gRNA的推定位置。
图2A-2C.图2A示出了SGI1基因的示意图。指示了被设计成破坏SGI1基因(CRISPR靶)的gRNA的推定位置。图2B示出了SGI1蛋白的示意图。图2C示出了SPR54基因的示意图。指示了被设计成破坏SPR54基因(CRISPR靶)的gRNA的推定位置。
图3.图3示出了拟小球藻属(Parachorella sp.)SGI2蛋白的示例性结构域架构分析。
图4.图4示出了卵囊藻属(Oocystis sp.)SGI2蛋白的示例性结构域架构分析。
图5.图5示出了四爿藻属(Tetraselmis sp)SGI2蛋白的示例性结构域架构分析。
图6.图6示出了拟南芥(Arabidopsis thaliana)SGI2蛋白的示例性结构域架构分析。
图7.图7示出了拟南芥SGI2蛋白的示例性结构域架构分析。
图8.图8示出了拟南芥SGI2蛋白的示例性结构域架构分析。
图9.图9示出了拟南芥SGI2蛋白的示例性结构域架构分析。
图10A-10B.图10A示出了含有密码子优化的Cre基因的DNA盒的示意图,所述Cre基因侧翼为亚硝酸盐还原酶启动子和终止子。图10B示出了包括bleR和GFP序列的DNA盒的示意图。
图11.图11示出了拟小球藻属野生型菌株、SRP54敲除菌株、SGI2敲除菌株以及SGI2和SRP54的双敲除菌株的生产力测定的结果。
图12A-12B.图12A示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24550)的半连续区域TOC生产力测定的结果。图12B示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24550)的分批TOC生产力测定的结果。
图13A-13B.图13A示出了指示拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR25761)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的半连续区域TOC生产力的测定的结果。图13B示出了指示拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR25761)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批TOC生产力的测定的结果。
图14.图14示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的分批FAME生产力测定的结果。
图15.图15示出了拟小球藻属野生型菌株(STR00010)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)和SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批FAME生产力测定的结果。
图16A-16B.图16A示出了用于敲除拟小球藻属SPR54的选择盒的示意图。图16B示出了用于敲除拟小球藻属SGI2的选择盒的示意图。
图17.图17示出了包括Cas9、GFP、BleR、Cre基因和lox位点的重组pCC1BAC载体的示意图。
具体实施方式
本申请的发明人令人惊讶且出乎意料地发现,调节光合生物中的SGI1和SGI2基因会导致叶绿素在低光条件下的减少,以及在所有生理相关的辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出在所有生理相关的辐照度下,非光化学淬灭(NPQ)较低。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出比同一物种的对照光合生物增加的生物质。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个叶绿素的基础上展现出更高的固碳率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个TOC的基础上展现出比同一物种的对照光合生物更高的固碳率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个mg叶绿素上展现出比同一物种的对照光合生物更高的析氧率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在每个TOC的基础上展现出比同一物种的对照光合生物更高的析氧率。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物展现出比同一物种的对照光合生物更高的脂质生产力。在一些实施例中,包括突变或弱化的SGI1或SGI2基因的突变体光合生物在光能自养培养物中展现出更大的脂质生产力。
本申请的发明人还令人惊讶地发现,在光合生物中调节SGI1或SGI2基因以及调节SRP54基因的协同效应(synergistic effect)。在一些实施例中,与仅调节SGI1或SGI2基因的突变体光合生物相比,在调节SRP54和SGI1或SGI2基因的突变体光合生物中,叶绿素进一步减少、生物质增加更多、在每个叶绿素的基础上的更高碳固定、在每个TOC的基础上的更高碳固定、更高脂质生产力。
SGI1基因
如本文所描述的,显著生长改善基因1(SGI1)多肽是包含两个结构域的多肽:响应接收或“RR”结构域(Pfam PF00072)和Myb结构域(Pfam PF00249),其中RR结构域定位于Myb结构域的N末端。RR结构域和Myb结构域被氨基酸序列分开,发现所述氨基酸序列在SGI1多肽中是不良保守的或不保守的,有时在本文中被称为两个结构域之间的接头,其中所述接头的长度范围可以为例如一到300个氨基酸,或十到200个氨基酸。接头区可以任选地包含核定位序列(NLS)。
响应接收“RR”结构域(Pfam PF00072)的存在负责其作为CheY样多肽的生物信息注释。RR结构域大约从拟小球藻属SGI1多肽(SEQ ID NO:3)的氨基酸36延伸到氨基酸148,并且在保守结构域数据库(CDD)中还被表征为“信号接收结构域”,cd00156,大约从氨基酸37延伸到氨基酸154。所述RR结构域在蛋白质直系同源簇数据库中也被表征为“CheY样接收(REC)结构域”,COG0784,以及被表征为Interpro“CheY样超家族”结构域,IPR011006,其中这两个表征的结构域均从SEQ ID NO:3的拟小球藻属SGI1多肽的大约氨基酸33延伸到大约氨基酸161。RR结构域发现于细菌双组分调节系统(如包含称为CheY的多肽的细菌趋化性双组分系统)中,其中,其从传感器配偶体接收信号。此类系统的RR结构域通常发现于DNA结合结构域的N末端,并且包含可以被磷酸化的磷酸受体位点,所述位点可能负责其激活或去激活。
SGI1蛋白内的RR结构域可以表征为例如Pfam PF00072,或表征为“信号接收结构域”或简单地“接收结构域”,和/或可以分类为保守结构域数据库(CDD)中的cd00156、蛋白质直系同源簇数据库中的COG0784或Interpro“CheY样超家族”结构域的IPR011006。RR结构域发现于细菌双组分调节系统(如包含称为CheY的多肽的细菌趋化性双组分系统)中,其中,其从传感器配偶体接收信号。此类系统的RR结构域通常发现于DNA结合结构域的N末端,并且包含可以被磷酸化的磷酸受体位点,所述位点可能负责其激活或去激活。
SGI1蛋白内的myb结构域可以表征为例如pfamPF00249:“Myb样DNA结合结构域”,和/或可以鉴定为保守结构域TIGR01557“myb样DNA结合结构域,SHAQKYF类(作为SEQ IDNO:102公开的“SHAQKYF”)”,或鉴定为Interpro同源异型样结构域超家族结构域(IPR009057)和/或Interpro Myb结构域(IPR017930)。
除了在myb结构域的N末端具有RR结构域之外,当使用隐马尔可夫模型(HMM)进行扫描时,所述隐马尔可夫模型被设计成在查询蛋白质氨基酸序列与藻类中的SGI1同源物区域的保守氨基酸匹配程度基础上对蛋白质进行评分,本文所提供的SGI1蛋白的评分可以为300或更高、320或更高、340或更高、350或更高、360或更高或370或更高,其中在多肽的比较区域内高度保守的氨基酸位置比不良保守的氨基酸位置权重更大,以达到评分。当基于包含包括使用开发的RR结构域、接头和myb结构域的单个连续序列的藻类SGI1多肽的蛋白质序列用HMM模型进行扫描时,评分为350或更高,如370或更高的多肽包含但不限于:藻类和植物物种的多肽,拟小球藻属1185(SEQ ID NO:3)、胶球藻(SEQ ID NO:9)、海洋发光蚝球藻(SEQ ID NO:10)、莱茵衣藻(SEQ ID NO:11)、卡特氏团藻(SEQ ID NO:13)、四爿藻属105(SEQ ID No:14、15和16)、卵囊藻属(SEQ ID NO:17)、微胞藻属RCC299(SEQ ID NO:18)、细小微胞藻(SEQ ID NO:19)、伪泥炭藓(SEQ ID NO:20)、小立碗藓(SEQ ID NO:21)、拟南芥(SEQ ID NO:22)、圆叶拟南芥(SEQ ID NO:23)、琴叶拟南芥(SEQ ID NO:24)、向日葵(SEQID NO:25)、葡萄(SEQ ID NO:26)、无油樟(SEQ ID NO:27)、蓖麻(SEQ ID NO:28)、番茄(SEQID NO:29)、马铃薯(SEQ ID NO:30)、陆地棉(SEQ ID NO:31)、可可(SEQ ID NO:32)、菜豆(Phaeolis vulgaris)(SEQ ID NO:33)、大豆(SEQ ID NO:34)、藜麦(SEQ ID NO:35)、苹果(Domestica)(SEQ ID NO:36)、玉米(SEQ ID NO:37)、芜菁(SEQ ID NO:38)和稻(SEQ IDNO:39)以及与前述任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性的多肽,其中所述多肽具有RR结构域和myb结构域,并且RR结构域是myb结构域的N末端。在各个实施例中,SGI1多肽来自植物或藻类物种(algal species)。对如本文所提供的SGI1多肽进行编码的基因,例如如本文所提供的,在突变体中被破坏或其表达被弱化的基因可以是各个实施例中编码如本文所公开的多肽的植物或藻类物种的天然存在的基因。
在一些实施例中,如本文所提供的SGI1多肽是藻类SGI1多肽,例如,具有天然存在的藻类SGI1多肽的序列,其中所述藻类多肽包含RR结构域和myb结构域,并且RR结构域是myb结构域的N末端。藻类多肽可以任选地与本文所公开的任何藻类SGI1多肽具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。在一些实施例中,SGI1基因可以是对藻类SGI1多肽进行编码的基因,例如具有天然存在的藻类SGI1多肽的序列的多肽。编码具有天然存在的藻类SGI多肽的序列的多肽的SGI1基因可以是具有基因编码序列的天然存在的基因序列的基因,或可以具有与天然存在的基因序列不同的序列。在各个实施例中,如本文所公开的,在突变体光合生物中被弱化、突变或破坏的SGI1基因可以是通过BLAST进行鉴定的基因,例如使用本文所公开的序列和/或通过HMM扫描,其中HMM基于连续氨基酸序列,例如通过比较至少六个SGI多肽得到,其中所述连续氨基酸序列包含RR结构域和myb结构域,其中RR结构域是myb结构域的N末端,并且其中RR与myb结构域之间存在不属于任一个结构域的接头序列。
在一些实施例中,SGI多肽具有藻类SGI1多肽的序列或是与天然存在的藻类SGI1多肽具有至少85%、至少90%或至少95%同一性的天然存在的藻类SGI1多肽的变体,和/或与SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18或SEQ IDNO:19中的任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。
在一些实施例中,SGI多肽具有植物SGI1多肽的序列或是与天然存在的藻类SGI多肽具有至少85%、至少90%或至少95%同一性的天然存在的植物SGI1多肽的变体,和/或与SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ IDNO:31、SEQ ID NO:32、SEQ ID NO:33、SEQ ID NO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ IDNO:37、SEQ ID NO:38或SEQ ID NO:39中的任一项具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%同一性。
发现作为SEQ ID NO:1提供的拟小球藻属SGI1基因序列对包含两个主要功能结构域的多肽(SEQ ID NO:3)进行编码,所述两个功能结构域均出现在具有619个氨基酸的蛋白质的一半的N末端中。示例性拟小球藻属SGI1 cDNA序列被提供为SEQ ID NO:2。
在SGI1多肽的myb结构域的C末端的区域中,即在蛋白质的(大约)的C末端一半中,没有发现保守的蛋白质结构域。另一方面,RR结构域和Myb结构域(其中myb结构域定位于RR结构域的C末端)可以于在绿色植物(Viridiplantae)(绿色植物(green plant),涵盖藻类)基因组中编码的许多蛋白质中发现。生物信息学分析用于鉴定另外的植物和藻类物种中的SGI1的可能直向同源物。
为了鉴别另外的光合生物中的一类SGI1蛋白,构建隐马尔可夫模型(HMM)以-用于在拟小球藻属SGI1中发现RR结构域myb结构域架构。作为第一步,拟小球藻属SGI1多肽序列(SEQ ID NO:3)用于BLAST搜索包含植物和藻类基因组的JGI Phytozome数据库v.12。还将四种专有藻类基因组(来自拟小球藻属、拟南芥、四爿藻属和卵囊藻属)添加到所搜索的数据库中。当搜索达到大约2,000次命中时,停止搜索。然后通过InterProScan(可从EMBL-EBI[欧洲分子生物学实验室-欧洲生物信息学研究所(European Molecular BiologyLaboratories-European Bioinformatics Institute),例如,在ebi.ac.uk]获得)来分析这些结果,以确保所选结果具有Interpro CheY样超家族结构域(IPR011006)和InterproHomeobox样或Myb结构域(IPR009057或IPR017930)两者。此步骤将所选命中的数量减少到介于900与1,000之间,其中多肽在藻类和高等植物两者的多肽中清楚地鉴别出了双结构域架构(RR结构域N端-myb结构域)。所得序列用于基于序列同源性来组装系统发育树(phylogenetic tree)。系统发育树示出了来自藻类物种的相关多肽的清晰分组,包含拟小球藻属、四爿藻属、卵囊藻属、衣藻属、团藻属、蚝球藻属、微胞藻属和胶球藻属的SGI1同源物。
表:藻类物种中的SGI1直向同源物
生物 多肽序列 HMM评分
拟小球藻属1185 SEQ ID NO:3 400.20
胶球藻 SEQ ID NO:9 403.0
海洋发光蚝球藻 SEQ ID NO:10 425.8
莱茵衣藻 SEQ ID NO:11 413.3
佐夫色绿藻 SEQ ID NO:12 292.6
卡特氏团藻 SEQ ID NO:13 441.4
四爿藻属105 SEQ ID NO:14 403.6
四爿藻属105 SEQ ID NO:15 403.0
四爿藻属105 SEQ ID NO:16 402.9
卵囊藻属 SEQ ID NO:17 426.9
微胞藻属RC299 SEQ ID NO:18 418.4
细小微胞藻 SEQ ID NO:19 405.9
为了建立其它光合生物中可能的SGI1直向同源物的标准,然后基于SGI1多肽序列的藻类簇开发隐马尔可夫模型(HMM)。HMM是基于SGI1多肽的N末端部分开发的,所述SGI1多肽涵盖RR和myb结构域两者,包含两个保守结构域之间的接头区。不包含任何可识别的保守结构的myb结构域的多肽C末端的序列从模型构建中排除。HMMER 3.1b2用于使用来自以下的多序列比对(MSA)来构建HMM:拟小球藻属、卵囊藻属和四爿藻属多肽的专有序列以及莱茵衣藻、卡特氏团藻、佐夫色绿藻、胶球藻、微胞藻属RCC299和海洋发光蚝球藻的多肽的公共数据库的序列。使用ETE3 toolkit和eggnog41工作流程来生成蛋白质的N末端一半的多序列比对(MSA)。此程序在内部使用程序Muscle、MAFFT、ClustalΩ,以及用于比对的M-coffee、用于比对修剪的trimAI和用于系统发生干扰的PhyML。与用于同源性比较的单个蛋白质序列不同,HMM例如从多个蛋白质序列中捕获信息,并且因此能够区分高度保守与高度趋异的残基,并在确定序列的相关性时考虑到这一点。当HMM用于对序列进行评分时,高度保守的残基比高度趋异的残基接收更多的权重,由此提供比更简单的PSA更优越的灵敏度和准确度。
SGI1 HHM用于将得分分配给在BLAST搜索中所鉴定的多肽,所述多肽也被验证为具有两个保守结构域(RR和myb)。在生物信息学搜索中,几乎在藻类物种和单一植物多肽中发现的最高评分允许鉴定其它藻类物种中的兴趣蛋白质(表1)。这些代表可能的直向同源物,其基因可以被弱化或敲除以在其它生物中提供高生产力突变体。
表2:植物物种中的SGI1直向同源物
生物 多肽序列 HMM评分
伪泥炭藓 SEQ ID NO:20 397.3
小立碗藓 SEQ ID NO:21 372.3
拟南芥 SEQ ID NO:22 371.1
圆叶拟南芥 SEQ ID NO:23 475.9
琴叶拟南芥 SEQ ID NO:24 395.5
向日葵 SEQ ID NO:25 391.2
葡萄 SEQ ID NO:26 390.6
无油樟 SEQ ID NO:27 390.1
蓖麻 SEQ ID NO:28 390.1
番茄 SEQ ID NO:29 388.4
马铃薯 SEQ ID NO:30 387.2
陆地棉 SEQ ID NO:31 385.8
可可 SEQ ID NO:32 383.0
菜豆 SEQ ID NO:33 381.6
大豆 SEQ ID NO:34 381.4
藜麦 SEQ ID NO:35 373.7
苹果 SEQ ID NO:36 372.6
玉米 SEQ ID NO:37 371.5
芜菁 SEQ ID NO:38 370.5
SEQ ID NO:39 369.6
图1A中示出了SGI1基因的示意图。
在一些实施例中,调节SGI1基因,如藻类物种中的SGI1基因的突变、弱化或敲除,例如,增加光系统II中的光化学最大量子产率(Fv/FM)(约10%-14%),同时与其所源自的野生型菌株相比,展现出减小的触角尺寸(即,功能吸收横截面)。
在一些实施例中,调节SGI1基因也可能引起光系统II(PSII)和光系统I(PSI)的天线尺寸减小(即,功能吸收横截面)(相对于野生型下降40%-50%),在饱和光下关于PSII(1/τ’Qa)受体侧的高电子传递速率(增加约35%到约130%,并且相对于野生型在这些工程化的突变体中至少大约100%)和高固碳率(Pmax)(相对于野生型,高达至少30%-40%),同时,如通过多反应监测蛋白质测定所确定的,保持在每个TOC的基础上的光系统的数量。
SGI2基因
本申请的发明人已经鉴定显著生长改善基因2(SGI2)作为存在于光合生物(例如,藻类)中的直向同源物,这一类调节基因的植物称为双组分系统(TCS),因为已知所述植物调节重要的细胞过程,包含细菌细胞周期进展和发育(Skerker等人,2015;“调控细菌中生长和细胞周期进展的双组分信号转导途径:系统-水平分析(Two-component signaltransduction pathways regulating growth and cell cycle progression in abacterium:a system-level analysis)”,《PLoS生物学(PLoS Biology.)》3(10):e334),氮感测(Sanders等人,1992;“其共价中间体激活转录的蛋白磷酸酶NtrC的磷酸化位点(Phosphorylation site of NtrC,a protein phosphatase whose covalentintermediate activates transcription)”《细菌学期刊(Journal of Bacteriology)》174(15):5117-22)以及细菌趋化性(Sanders等人,1989;“趋化性响应调控蛋白CheY的磷酸化位点的鉴定(Identification of the site of phosphorylation of the chemotaxisresponse regulator protein,CheY)”;《生物化学期刊(The Journal of BiologicalChemistry)》264(36):21770-8)。在细菌中,这些蛋白质通常由检测特定环境刺激的组氨酸激酶和介导细胞应答的对应的响应调控结构域(PF00072)组成,主要通过靶基因的差异表达。然而,在光合生物中,SGI2基因包括对应的响应调控结构域(PF00072),并且缺少双组分系统的另一个结构域。
图2A中示出了SGI1基因的示意图,并且图2B中示出了对应的蛋白质的示意图。
发现一种示例性拟小球藻属SGI2基因序列被提供为SEQ ID NO:4,其编码包括响应调控结构域(SEQ ID NO:6)的多肽(SEQ ID NO:5)。
各种光合生物中的示例性直向同源多肽序列在下表3中示出。
表3:各种光合生物中的直向同源SGI2序列
光合生物 多肽序列
卵囊藻属 SEQ ID NO:40
四爿藻属 SEQ ID NO:41
拟南芥 SEQ ID NO:42
拟南芥 SEQ ID NO:43
拟南芥 SEQ ID NO:44
拟南芥 SEQ ID NO:45
拟南芥 SEQ ID NO:46
大豆 SEQ ID NO:47
葡萄 SEQ ID NO:48
可可 SEQ ID NO:49
SEQ ID NO:50
玉米 SEQ ID NO:51
小立碗藓 SEQ ID NO:52
卡特氏团藻 SEQ ID NO:53
莱茵衣藻 SEQ ID NO:54
佐夫小球藻 SEQ ID NO:55
胶球藻C-169 SEQ ID NO:56
示例性拟小球藻属SGI2 cDNA序列被提供为SEQ ID NO:7。其它光合生物中的SGI2基因的直向同源cDNA序列在下表4中示出。
表4:其它光合生物中的SGI2基因的直向同源cDNA序列
光合生物 cDNA序列
卵囊藻属 SEQ ID NO:57
四爿藻属 SEQ ID NO:58
大豆 SEQ ID NO:59
葡萄 SEQ ID NO:60
可可 SEQ ID NO:61
SEQ ID NO:62
玉米 SEQ ID NO:63
小立碗藓 SEQ ID NO:64
卡特氏团藻 SEQ ID NO:65
莱茵衣藻 SEQ ID NO:66
胶球藻 SEQ ID NO:67
在一些实施例中,光合生物的SGI2多肽包括与SEQ ID NO:6至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同的氨基酸序列。在一些实施例中,光合生物的SGI2多肽包括与至少100个、150个、200个、250个氨基酸或SEQ ID NO:5、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55或56的全长至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同的氨基酸序列。
在一些实施例中,光合生物包括对SGI2多肽进行编码的多核苷酸,其中所述多核苷酸的核酸序列与至少100个、150个、200个、250个核苷酸或SEQ ID NO:4、7、57、58、59、60、61、62、63、64、65、66或67的全长至少35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%相同。
在一些实施例中,调节SGI2基因,如光合生物(如藻类物种)中的SGI2基因的突变、弱化或敲除,增加了光系统II中的光化学最大量子产率(Fv/FM)(约10%-14%),减少的叶绿素/每总有机碳(TOC),增加的生物质。
SPR54基因
美国专利申请公开2016/0304896中已经描述了SPR54基因的调节,所述美国专利申请通过引用整体并入本文。示例性拟小球藻属叶绿体SRP54(cpSRP54)cDNA序列被提供为SEQ ID NO:8提供,其编码具有SEQ ID NO:68的多肽。
其它非限制性的示例性cpSRP54直向同源多肽包含GenBank登录号:EDP00260,用于莱茵衣藻(SEQ ID NO:75);EEH59526,用于细小微胞藻(SEQ ID NO:76);EEH59526,用于微胞藻属(SEQ ID NO:77);ACB42577,用于载色宝琳虫(Paulinella chromatophora)(SEQID NO:78);ABO94038,用于海洋发光蚝球藻(SEQ ID NO:79);Q01H03,用于金牛蚝球藻(SEQID NO:80);EFJ41797,用于卡特氏团藻(SEQ ID NO:81);EEC48599,用于三角褐指藻(Phaeodactylum tricornutum)(SEQ ID NO:82);EED94755,用于假微型海链藻(Thalassiosira pseudonana)(SEQ ID NO:83);EGB12501,用于抑食金球藻(Aureococcusanophagefferens)(SEQ ID NO:84);CBN76263,用于长囊水云(Ectocarpus siliculosus)(SEQ ID NO:85)。
在一些实施例中,光合生物的cpSRP54基因编码与上文所公开的cpSRP54至少55%、至少60%、至少65%、至少70%、至少75%、至少80%或至少85%、至少90%或至少95%序列同一性的多肽。
光合生物的SGI2、SGI1和SRP54的组合、SGI2和SRP54基因的组合或SGI1、SGI2和SRP54基因的组合的调节
光合生物的SGI2、SGI1和SRP54的组合、SGI2和SRP54基因的组合或SGI1、SGI2和SRP54基因的组合的调节生产突变体光合生物。SGI1、SGI2、SRP54基因可以通过UV诱变、γ射线或基因工程技术来调节。可以改变基因序列,可以部分或完全缺失基因序列,可以改变基因的表达。
在一些实施例中,SGI1、SGI2和/或SRP54基因可以可操作地连接到如美国申请公开2017/0058303中所描述的藻类启动子和终止子序列,所述美国申请通过引用整体并入本文。
在一些实施例中,相对于对照细胞,突变体光合生物(例如,植物、藻类)的总叶绿素具有至少20%减少、至少30%减少、至少40%减少、至少50%减少、至少55%减少、至少60%减少、至少65%减少或至少70%减少,任选地进一步,其中所述突变体具有至少相对于对照细胞增加的叶绿素a与叶绿素b的比率,进一步任选地,其中叶绿素a与叶绿素b的比率为至少约2.8:1、至少约3:1、至少约3.2:1、约3.3:1、至少约3.5:1、至少约3.7:1、至少约3.9:1、至少约4:1或至少约4.3:1。
在一些实施例中,突变体光合生物(例如,植物或藻类)展现出:(a)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于同一物种的对照光合生物更高的qP;
(b)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于对照藻类更低的NPQ;
(c)在介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间的所有辐照度下,相对于光合生物(例如,藻类)更高的Y(II);
(d)介于约100与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1、或介于约10与约2800μmol光子m-2sec-1之间,相对于对照藻类更高的Fv/FM
(e)介于约250与约2800μmol光子m-2sec-1、介于约150与约2800μmol光子m-2sec-1、介于约75与约2800μmol光子m-2sec-1、介于约40与约2800μmol光子m-2sec-1或介于约10与约2800μmol光子m-2sec-1之间,相对于对照藻类更高的ESR(II);
(f)相对于对照藻类,在每个叶绿素的基础上的析氧增加了至少50%、至少100%、至少200%、至少300%、至少350%或至少400%;并且
(g)相对于同一物种的对照光合生物,在每个叶绿素的基础上的碳固定增加了至少50%、至少60%、至少70%、至少80%、至少90%或至少100%。
在一些实施例中,突变体光合生物显示出比在相同条件下培养的对照藻类高至少5%、至少6%、至少8%或至少10%、至少15%、至少25%、或至少30%的生物质生产力。
在一些实施例中,在具有模拟自然日光的可变光强度的昼夜循环培养中,相对于对照藻类,突变体光合生物(例如植物、藻类)显示出更大的生产力,任选地其中光强度的峰值处于约1900与约2000μmol光子m-2sec-1之间。
在一些实施例中,相对于不具有一个或多个改变的或弱化的基因的同一物种的对照光合生物,突变体光合生物(例如,植物或藻类)具有更大的脂质生产力,例如至少5%、至少10%、至少15%、至少20%或至少25%更大的脂质生产力。
定义
除非另外定义,否则本文中所使用的所有技术术语和科学术语的含义与本发明所属领域的普通技术人员通常所理解的含义相同。在冲突的情况下,以本申请(包含定义)为准。除非上下文另外要求,否则单数术语应该包含复数含义,并且复数术语应该包含单数含义。本文引用的所有公开、专利和其它参考文献均出于所有目的通过引用整体并入本文,如同每个单独的公开或专利申请被专门且单独地指示通过引用并入。
如在本公开和权利要求中所使用的,除非上下文另外明确指明,否则单数形式的“一个(a)”“一种(an)”和“所述(the)”也包含复数形式。
在本申请内所提供的所有范围包含所述范围的上端和下端的值。
在本文中,在短语如“A和/或B”中所使用的术语“和/或”旨在包含“A和B”、“A或B”、“A”和“B”。
术语“基因”被广泛地用于指对多肽或表达的RNA进行编码的核酸分子(典型地DNA,但任选地RNA)的任何区段。因此,基因包含对表达的RNA进行编码的序列(其可以包含多肽编码序列或例如功能性RNA,如核糖体RNA、tRNA、反义RNA、微RNA、短发夹RNA、核酶等)。基因可以进一步包括其表达所需或影响其表达的调控序列,以及与蛋白质或处于其天然状态的RNA编码序列相关的序列,例如内含子序列、5'或3'非翻译序列等。在一些实例中,“基因”可以仅指DNA或RNA分子的蛋白质编码部分,其可以包含或可以不包含内含子。基因的长度优选地大于50个核苷酸,更优选地长度大于100个核苷酸,并且长度可以例如介于50个核苷酸与500,000个核苷酸之间,如长度介于100个核苷酸与100,000个核苷酸之间或长度介于约200个核苷酸与约50,000个核苷酸之间或长度介于约200个核苷酸与约20,000个核苷酸之间。基因可以从多种来源获得,包含从兴趣来源克隆或从已知或预测的序列信息合成。
术语“核酸”或“核酸分子”是指DNA或RNA(例如,mRNA)的区段,并且还包含具有经过修饰的主链(例如,肽核酸、锁核酸)或经过修饰的或非天然存在的核碱基的核酸。核酸分子可以是双链的、部分双链的或单链的;包括基因或其部分的单链核酸可以是编码(有义)链或非编码(反义)链。
核酸分子可以“源自”指示的来源,其包含从所指示的来源分离(全部或部分)核酸区段。核酸分子也可以通过例如直接克隆、PCR扩增或从指示的多核苷酸来源人工合成或基于与指示的多核苷酸来源相关的序列而源自指示的来源。源自特定来源或物种的基因或核酸分子还包含相对于来源核酸分子具有序列修饰的基因或核酸分子。例如,源自来源(例如,特定参照基因)的基因或核酸分子可以包含相对于来源基因或核酸分子的一种或多种突变,这些突变是非预期的或有意引入的,并且如果一种或多种突变(包含取代、缺失或插入)是有意引入的,则这些序列改变可以通过细胞或核酸的随机或靶向突变、通过扩增或其它基因合成或分子生物学技术或通过化学合成或其任何组合来引入。源自编码功能性RNA或多肽的参照基因或核酸分子的基因或核酸分子可以编码功能性RNA或多肽,所述功能性RNA或多肽与参照或来源功能性RNA或多肽或与其功能性片段具有至少75%、至少80%、至少85%、至少90%或至少95%序列同一性。例如,源自编码功能性RNA或多肽的参照基因或核酸分子的基因或核酸分子可以编码功能性RNA或多肽,所述功能性RNA或多肽与参照或来源功能性RNA或多肽或与其功能性片段具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%序列同一性。
如本文所使用的,“分离的”核酸或蛋白质是从其天然环境或核酸或蛋白质在自然界中存在的环境中去除的。例如,分离的蛋白质或核酸分子是从在其天然或自然环境中与其相关的细胞或生物中去除的。在一些实例中,分离的核酸或蛋白质可以部分地或基本上纯化,但是分离不需要特定水平的纯化。因此,例如,分离的核酸分子可以是已经从其在自然界中被整合到的染色体、基因组或附加体切离的核酸序列。
“纯化的”核酸分子或核苷酸序列或蛋白质或多肽序列基本上不含细胞材料和细胞组分。纯化的核酸分子或蛋白质可以基本上不含例如除缓冲液或溶剂之外的化学品。“基本上不含”不旨在意指除了这些新核酸分子之外的其它组分是不可检测的。
术语“天然存在的”和“野生型”是指在自然界中发现的形式。例如,天然存在的或野生型核酸分子、核苷酸序列或蛋白质可以存在于从天然来源分离的物质中并且未通过人为操纵有意地修饰。
如本文所使用的,“弱化的”意指量、程度、强度(intensity)或强度(strength)降低。弱化的基因表达可以指所讨论的基因的显著降低的量和/或转录速率,或经过编码的蛋白质的翻译、折叠或组装的显著降低的量和/或转录速率。作为非限制性实例,弱化的基因可以是由于基因调控序列的改变或破坏而具有减少的表达的突变或破坏的基因(例如,通过部分或全部缺失、截短、移码或插入突变而破坏的基因),或可以是由减少基因表达的构建体(例如,反义RNA、微RNA、RNAi分子、或核酶)所靶向的基因。
“外源核酸分子”或“外源基因”是指已经被引入(“转化”)到细胞中的核酸分子或基因。转化的细胞可以称为重组细胞,其中可以引入另外的一个或多个外源基因。如果用核酸分子转化的细胞的后代已经遗传外源核酸分子,则其也称为“转化的”。相对于被转化的细胞,外源基因可以来自不同物种(并且因此是“异源的”)或来自同一物种(并且因此是“同源的”)。“内源性”核酸分子、基因或蛋白质是天然核酸分子、基因或蛋白质,因为其存在于宿主中或由宿主天然产生。
本文所使用的术语“天然的”用于指代核酸序列或氨基酸序列,因为所述核酸序列或氨基酸序列天然存在于宿主中。本文所使用的术语“非天然的”用于指代不天然存在于宿主中的核酸序列或氨基酸序列。已经从细胞中去除、经受实验室操作、并引入或重新引入到宿主细胞的核酸序列或氨基酸序列被认为是“非天然的”。引入到宿主细胞中的合成或部分地合成的基因是“非天然的”。非天然基因进一步包含对宿主微生物内源的基因,所述基因可操作地连接到已经重组到宿主基因组中的一种或多种异源调控序列。
“重组的”或“工程化的”核酸分子是已经通过人为操纵改变的核酸分子。作为非限制性实例,重组核酸分子包含执行以下的任何核酸分子:1)在体外已经部分地或完全地合成或修饰,例如使用化学或酶促技术(例如,通过使用化学核酸合成,或通过使用用于核酸分子的复制、聚合、消化(核酸外切或核酸内切)、连接、逆转录、转录、碱基修饰(包含例如甲基化)、整合或重组(包含同源和位点特异性重组)的酶);2)包含本质上不连接的连接核苷酸序列;3)已经使用分子克隆技术工程化,使得其相对于天然存在的核酸分子序列缺少一个或多个核苷酸;和/或4)已经使用分子克隆技术进行了操作,使得其相对于天然存在的核酸序列具有一个或多个序列改变或重排。作为非限制性实例,cDNA是重组DNA分子,如已经通过体外一个或多个聚合酶反应产生的或接头已经附着到的或已经整合到载体(如克隆载体或表达载体)中的任何核酸分子。
如本文所使用的,术语“重组蛋白质”是指通过基因工程产生的蛋白质。
当应用于生物时,术语重组、工程化或基因工程化是指已经通过将异源或外源重组核酸序列引入到生物中而被操作的生物,并且包含基因敲除、靶向突变、基因置换和启动子置换、缺失或插入,以及将转基因或合成基因引入到生物中。重组或基因工程化生物也可以是已经引入到其的用于基因“敲低”的构建体的生物。此类构建体包含但不限于RNAi、微RNA、shRNA、siRNA、反义和核酶构建体。还包含其基因组已经通过大范围核酸酶、锌指核酸酶、TALEN或Cas/CRISPR系统的活性改变的生物。外源或重组核酸分子可以整合到重组/基因工程化生物的基因组中或在其它实例中,可以不整合到宿主基因组中。如本文所使用的,“重组微生物”或“重组宿主细胞”包含本发明的重组微生物的后代或衍生物。因为某些修饰可能由于突变或环境影响而在后续世代中发生,所以此类后代或衍生物实际上可能不与亲本细胞相同,但仍包含在如本文所使用的术语的范围内。
术语“启动子”是指能够结合细胞中的RNA聚合酶并启动下游(3'方向)编码序列的转录的核酸序列。启动子包含以高于背景的可检测水平启动转录所必需的最小数量的碱基或元件。启动子可以包含转录起始位点以及负责结合RNA聚合酶的蛋白质结合结构域(共有序列)。真核启动子通常但不总是含有“TATA”盒和“CAT”盒。原核启动子可以含有-10和-35原核启动子共有序列。来自多种不同来源的大量启动子(包含组成型、诱导型和阻抑型启动子)是本领域所众所周知的。代表性来源包含例如藻类、病毒、哺乳动物、昆虫、植物、酵母和细菌细胞类型,并且来自这些来源的合适的启动子是容易获得的,或可以基于公开在线获得的序列或例如,从保藏机构(如ATCC)以及其它商业或个体来源合成地制备。启动子可以是单向的(启动一个方向的转录)或双向的(启动任一方向的转录)。启动子可以是组成型启动子、阻抑型启动子或诱导型启动子。除了RNA聚合酶结合以启动转录的基因近端启动子之外,启动子区还可以包含基因上游的额外序列,所述额外序列可以处于基因的转录起始位点的1kb、2kb、3kb、4kb、5kb或更多内,其中所述额外序列可以影响下游基因的转录速率并且任选地影响启动子对发育、环境或生物化学(例如,代谢)条件的反应性。
当关于多核苷酸、基因、核酸、多肽或酶使用时,术语“异源的”是指来自来源或源自除宿主生物物种之外的来源的多核苷酸、基因、核酸、多肽或酶。相比而言,“同源的”多核苷酸、基因、核酸、多肽或酶在本文中用于表示源自宿主生物物种的多核苷酸、基因、核酸、多肽或酶。当提及基因调控序列或提及用于保持或操纵基因序列的辅助核酸序列时(例如启动子、5'非翻译区、3'非翻译区、poly A添加序列、内含子序列、剪接位点、核糖体结合位点、内部核糖体进入序列、基因组同源区、重组位点等),“异源的”意指调控序列或辅助序列不与调控序列或辅助核酸序列与之并置在构建体、基因组、染色体或附加体中的基因天然地缔合。因此,可操作地连接到其在其天然状态下(即,在非基因工程化生物的基因组中)不可操作地连接的基因的启动子在本文中被称为“异源启动子”,即使所述启动子可以源自与其连接的基因相同的物种(或在一些情况下,同一生物)。
如本文所使用的,术语“蛋白质”或“多肽”旨在涵盖单数“多肽”以及复数“多肽”,并且是指由通过酰胺键(也称为肽键)线性连接的单体(氨基酸)构成的分子。术语“多肽”是指两个或更多个氨基酸中的任何链或多条链,并且不是指产物的特定长度。因此,肽、二肽、三肽、寡肽、“蛋白质”、“氨基酸链”或用于指代两个或更多个氨基酸的一条链或多条链的任何其它术语被包含在“多肽”的定义内,并且术语“多肽”可以代替或与这些术语中的任何一个可互换地使用。
基因和蛋白质登录号(通常在基因或物种名称之后的括号中提供)是在美国国立卫生研究院(United States National Institutes of Health)维护的美国国家生物技术信息中心(NCBI)网站(ncbi.nlm.nih.gov)可公开获得的序列记录的唯一标识符。“GenInfo标识符”(GI)序列标识号是特定于核苷酸或氨基酸序列的。如果序列以任何方式改变,则分配新的GI号。序列修订历史工具可用于跟踪出现在特定GenBank记录中的序列的各种GI号、版本号和更新日期。基于登录号和GI号来搜索和获得核酸或基因序列或蛋白质序列在例如细胞生物学、生物化学、分子生物学和分子遗传学领域中是众所周知的。
如本文所使用的,相对于核酸或多肽序列的术语“同一性百分比”或“同源性”被定义为在比对序列达到最大百分比同一性并且在必要时引入空位以实现最大百分比同源性之后,候选序列中与已知多肽相同的核苷酸或氨基酸残基的百分比。N末端或C末端插入或缺失不应被解释为影响同源性,并且多肽序列中少于约30个、少于约20个或少于约10个氨基酸残基的内部缺失和/或插入不应被解释为影响同源性。核苷酸或氨基酸序列水平的同源性或同一性可以使用程序blastp、blastn、blastx、tblastn和tblastx(Altschul(1997),《核酸研究(Nucleic Acids Res.)》25,3389-3402,和Karlin(1990),《美国国家科学院院刊(Proc.Natl.Acad.Sci.USA)》87,2264-2268)所采用的算法,通过BLAST(基本局部比对搜索工具)分析来确定,其被定制用于序列相似性搜索。BLAST程序所使用的方法是首先考虑查询序列与数据库序列之间具有和不具有缺口的相似区段,然后评估所鉴定的所有匹配的统计显著性,并且最后仅概括满足预先选择的显著性阈值的那些匹配。关于序列数据库相似性搜索中的基本问题的讨论,参见Altschul(1994),《自然遗传学(Nature Genetics)》6,119-129。直方图、描述、比对、预期(即,用于报告针对数据库序列的匹配的统计显著性阈值)、截断、矩阵和过滤器(低复杂度)的搜索参数可以处于默认设置。blastp、blastx、tblastn和tblastx所使用的默认评分矩阵是BLOSUM62矩阵(Henikoff(1992),《美国国家科学院院刊》89,10915-10919),推荐长度超过85的查询序列(核苷酸碱基或氨基酸)。
对于设计用于比较核苷酸序列的blastn,评分矩阵通过M(即,一对匹配残基的奖励评分)与N(即,错配残基的罚分)的比率来设定,其中M和N的默认值可以分别为+5和-4。四个blastn参数可以调整如下:Q=10(空位产生罚分);R=10(空位延伸罚分);wink=1(在沿着查询的每个winkth位置处生成字命中);和gapw=16(设置其中生成空位对准的窗口宽度)。用于氨基酸序列比较的等效Blastp参数设置可以为:Q=9;R=2;wink=1;以及gapw=32。GCG软件包版本10.0中可获得的序列之间的Bestfit比较可以使用DNA参数GAP=50(空位产生罚分)和LEN=3(空位延伸罚分),并且蛋白质比较中的等效设置可以为GAP=8和LEN=2。
因此,当提及本发明的多肽或核酸序列时,包含的是与全长多肽或核酸序列或与其片段(其包括整个蛋白质的至少50个、至少75个、至少100个、至少125个、至少150个或更多个氨基酸残基的连续序列)具有至少40%、至少45%、至少50%、至少55%、至少70%、至少65%、至少70%、至少75%、至少80%、或至少85%,例如至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或约100%序列同一性的序列同一性;此类序列的变体,例如其中至少一个氨基酸残基已经被插入所公开的含有插入和取代的一个或多个序列的N-和/或C-末端和/或内部。预期的变体可以另外地或可替代地包含通过例如同源重组或定点或PCR诱变而含有预定突变的那些变体,以及其它物种的对应多肽或核酸,包含但不限于本文所描述的那些多肽或核酸,含有插入和取代的多肽或核酸家族的等位基因或其它天然存在的变体;和/或衍生物,其中所述多肽已经通过取代、化学、酶促或其它适当方式用除了含有插入和取代的天然存在的氨基酸之外的部分(例如,可检测的部分如酶)共价修饰。
如本文所使用的,短语“保守性氨基酸取代”或“保守性突变”是指一个氨基酸被另一个具有共同性质的氨基酸置换。用于定义单独的氨基酸之间的共同性质的功能性方法是分析同源生物的对应蛋白质之间氨基酸变化的归一化频率(Schulz(1979),《蛋白质结构原理(Principles of Protein Structure)》,施普林格出版社(Springer-Verlag))。根据此类分析,可以定义氨基酸基团,其中一个基团内的氨基酸优先地彼此交换,并且因此在其对整体蛋白质结构的影响方面彼此最相似(Schulz(1979),《蛋白质结构原理》,施普林格出版社)。以这种方式定义的氨基酸基团的实例可以包含:“带电/极性基团”,其包含Glu、Asp、Asn、Gln、Lys、Arg和His;“芳香族或环状基团”,其包含Pro、Phe、Tyr和Trp;以及“脂肪族基团”,其包含Gly、Ala、Val、Leu、Ile、Met、Ser、Thr和Cys。在每个基团内,也可以鉴定亚基。例如,带电/极性氨基酸的基团可以被细分为亚基,包含:“带正电亚基”,其包括Lys、Arg和His;“带负电亚基”,其包括Glu和Asp;以及“极性亚基”,其包括Asn和Gln。在另一个实例中,芳香族或环状基团可以被细分为亚基,包含:“氮环亚基”,其包括Pro、His和Trp;以及“苯基亚基”,其包括Phe和Tyr。在另一个进一步的实例中,脂肪族或环状基团可以被细分为亚基,包含:“大脂肪族非极性亚基”,其包括Val、Leu和Ile;“脂肪族微极性亚基”,其包括Met、Ser、Thr和Cys;以及“小残基亚基”,其包括Gly和Ala。保守性突变的实例包含上文亚基内的氨基酸的氨基酸取代,如但不限于:Lys取代Arg,或反之亦然,使得可以保持正电荷;Glu取代Asp,或反之亦然,使得可以保持负电荷;Ser取代Thr,或反之亦然,使得可以保持游离的-OH;并且Gln取代Asn,或反之亦然,使得可以保持游离的-NH2。“保守性变体”是包含一个或多个氨基酸的多肽,所述一个或多个氨基酸已被取代以用具有共同性质(例如,属于与上文所描绘的氨基酸基团或亚基相同的氨基酸基团或亚基)的氨基酸置换参考多肽的一个或多个氨基酸(例如,其序列已经在出版物或序列数据库中公开或其序列已经通过核酸测序确定的多肽)。
如本文所使用的,术语基因的“调节(modulating)”或“调节(modulation)”是指改变所述基因的核酸序列、完全地或部分地缺失所述基因、引起所述基因中的断裂、改变所述基因的表达、抑制基因的表达或使所述基因的表达沉默。在一些实施例中,改变基因的序列是通过插入一个或多个核苷酸、缺失一个或多个核苷酸、取代核苷酸。改变序列可以通过UV辐照、γ辐照、基因工程来实现。
如本文所使用的,“弱化基因表达”意指以减少全功能性蛋白质的产生的任何方式减少或消除基因的表达。
如本文所使用的,“表达”包含至少在RNA生产水平上的基因表达,并且“表达产物”包含所得产物,例如多肽或功能性RNA(例如,核糖体RNA、tRNA、反义RNA、微RNA、shRNA、核酶等)。术语“增加的表达”包含基因表达的改变以促进增加的mRNA产生和/或增加的多肽表达。“增加的产生”包含与多肽的天然产生或酶活性相比,多肽表达的量、多肽的酶活性水平或两者的组合的增加。
本发明的一些方面包含特定多核苷酸序列的表达的部分、基本或完全缺失、沉默、失活或下调。这些基因可以部分地、基本上或完全地缺失、沉默、失活,或其表达可以被下调,以便影响由其编码的多肽所进行的活性,如酶的活性。通过插入破坏基因的功能和/或表达的核酸序列(例如,病毒插入、转座子诱变、大范围核酸酶工程化、同源重组或本领域已知的其它方法),基因可以部分地、基本上或完全地缺失、沉默、失活或下调。术语“消除(eliminate)”、“消除(elimination)”和“敲除(knockout)”可以与术语“缺失”、“部分缺失”、“基本缺失”或“完全缺失”互换使用。在某些实施例中,可以通过定点同源重组工程化兴趣微生物,以敲除特定的兴趣基因。在仍其它实施例中,RNAi或反义DNA(asDNA)构建体可以用于部分地、基本上或完全地沉默、失活或下调特定兴趣基因。
某些核酸分子或特定多核苷酸序列的这些插入、缺失或其它修饰可以被理解为涵盖“一种或多种基因修饰”或“一种或多种转化”,使得这些微生物或宿主细胞的所得菌株可以被理解为“经过基因修饰的”、“基因工程化的”或“转化的”。
如本文所使用的,“上调的(up-regulated)”或“上调(up-regulation)”包含兴趣基因或核酸分子的表达或酶活性的增加,例如,与未被上调的在其它方面相同的基因或酶中的表达或活性相比,基因表达或酶活性的增加。
如本文所使用的,“下调的(down-regulated)”或“下调(down-regulation)”包含兴趣基因或核酸分子的表达或酶活性的降低,例如,与未被下调的在其它方面相同的基因或酶中的表达或活性相比,基因表达或酶活性的降低。
如本文所使用的,“突变体”是指一种非天然存在的并在基因中具有突变的生物,所述突变是由于经典诱变(例如,使用γ辐照、UV、或化学诱变剂)而出现的。如本文所使用的,“突变体”还指由于基因工程化而具有改变的基因结构或表达的重组细胞,作为非限制性实例,所述基因工程化可以包含过表达,包含在不同的时间、生物、或环境调控下的基因表达和/或与天然发生的程度不同的基因表达和/或在重组细胞中非天然表达的基因的表达;同源重组,包含敲除和敲入(例如,用对具有比野生型多肽更高或更低活性的多肽进行编码的基因和/或显性阴性多肽进行基因置换);通过RNAi、反义RNA或核酶等的基因弱化;以及使用大范围核酸酶、TALEN和/或CRISPR技术等的基因组工程化。兴趣突变体生物通常具有与缺少突变的对应野生型或祖先株的表型不同的表型,其中所述表型可以通过生长测定、产物分析、光合性质、生物化学测定等来评估。当提及基因“突变体”时,意指相对于自然或野生型基因,所述基因具有至少一个碱基(核苷酸)改变、缺失或插入。突变(一个或多个核苷酸的改变、缺失和/或插入)可以在基因的编码区中,或可以在内含子、3'UTR、5'UTR或启动子区中,例如转录起始位点的2kb内或翻译起始位点的3kb内。作为非限制性实例,突变体基因可以是在启动子区内具有可以增加或减少基因表达的插入的基因;可以是具有缺失的基因,从而导致非功能性蛋白质、截短蛋白质、显性阴性蛋白质或无蛋白质的产生;可以是具有一个或多个点突变的基因,所述点突变导致经过编码的蛋白质的氨基酸改变或导致基因转录物的异常剪接等。如本文所使用的,“突变体”是指一种非天然存在的并在基因中具有突变的生物,所述突变是由于经典诱变(例如,使用γ射线照射、UV或化学诱变剂)而出现的。如本文所使用的,“突变体”还指由于基因工程化而具有改变的基因结构或表达的重组细胞,作为非限制性实例,所述重组细胞可以包含过表达,包含在不同的时间、生物、或环境调节下的基因表达和/或与天然发生的程度不同的基因表达和/或在重组细胞中非天然表达的基因表达。
术语“Pfam”是指由Pfam协会(Pfam Consortium)维护的蛋白质结构域和蛋白质家族的大型集合,并且可在以下获得:维康信托基金会(Welcome Trust),桑格研究所(SangerInstitute);pfam.sbc.su.se(斯德哥尔摩生物信息学中心(Stockholm BioinformaticsCenter);珍利亚农场(Janelia Farm),霍华德·休斯医学研究所(Howard Hughes MedicalInstitute);国家农学研究所(Institut national de la Recherche Agronomique)。Pfam的最新版本是基于UniProt蛋白质数据库版本2012_06的Pfam 27.0(2013年3月)。使用多序列比对和隐马尔可夫模型(HMM)来鉴定Pfam结构域和家族。Pfam-A家族或结构域分配是使用蛋白质家族的代表性成员通过策划的种子比对而产生的高质量分配,并且基于种子比对概述隐马尔可夫模型。(除非另有说明,否则所查询的蛋白质与Pfam结构域或家族的匹配是Pfam-A匹配。)然后,将属于家族的所有经过鉴定的序列用于自动生成所述家族的完全比对(Sonnhammer(1998),《核酸研究(Nucleic Acids Research)》26,320-322;Bateman(2000),《核酸研究》26,263-266;Bateman(2004),《核酸研究》32,《数据库专辑(Database Issue)》,D138-D141;Finn(2006),《核酸研究-数据库专辑(Database Issue)》34,D247-251;Finn(2010),《核酸研究-数据库专辑》38,D211-222)。通过访问Pfam数据库,例如使用任何上述网站,可以使用HMMER同源性搜索软件(例如,HMMER2、HMMER3或更高版本)针对HMM来查询蛋白质序列。将所查询的蛋白质鉴定为处于Pfam家族(或具有特定Pfam结构域)的显著匹配是其中位评分大于或等于Pfam结构域的聚集阈值的那些匹配。期望值(e值)也可以用作在Pfam中包含所查询的蛋白质或确定所查询的蛋白质是否具有特定Pfam结构域的标准,其中低e值(远小于1.0,例如小于0.1,或小于或等于0.01)表示匹配归因于机会的低概率。
如本文所使用的,术语“光合生物”是指可以将光能转化为化学能的生物。在一些实施例中,化学能可以稍后被释放以给这些生物的活动提供燃料(能量转化)。在一些实施例中,这种化学能储存于碳水化合物分子(如糖)中,所述碳水化合物分子从二氧化碳和水中合成。
光合生物的非限制性实例包含植物、藻类和蓝藻细菌。藻类的非限制性实例属于属:曲壳藻属、茧形藻属、双眉藻属、纤维藻属、星胞藻属、黄金色藻属、波利氏藻属、包特氏菌属、气球藻属、葡萄藻属、荀球藻属、角毛藻属、四鞭藻属、衣藻属、绿球藻属、绿梭藻属、小球藻属、蓝隐藻属、金球藻属、球钙板藻属、隐甲藻属、隐藻属、小环藻属、杜氏藻属、椭球藻属、球石藻属、独球藻属、衣迪斯藻属、裸藻属、真眼点藻属、伏氏藻属、脆杆藻属、丽丝藻属、红球藻属、嗜盐古菌、异弯藻属、膜胞藻属、等鞭金藻属、鳞孔藻属、微芒藻属、蒜头藻属、单针藻属、微球藻属、微拟球藻属、舟形藻属、新绿藻属、肾鞭藻属、肾爿藻属、菱形藻属、棕鞭藻属、鞘藻属、卵囊藻属、蚝球藻属、巴夫藻属、拟小球藻属、帕氏藻属、普莱格门、褐指藻属、噬菌体属、微绿藻属、扁藻属、颗石藻属、宽球藻属、原壁菌属、假小球藻属、拟新绿藻属、假十字趾藻属、塔胞藻属、葡串藻属、栅列藻属、骨条藻属、螺旋藻属、裂丝藻属、四爿藻属、海链藻属、黄丝藻属、无隔藻属、鲜绿球藻属、魏氏藻属和团藻属。
植物的非限制性实例包含沙生芥(Arabidopsis arenicola)、沙芥(Arabidopsisarenosa)、Arabidopsis cebennensis、Arabidopsis croatica、圆叶拟南芥、琴叶拟南芥、Arabidopsis neglecta、Arabidopsis pedemontana、Arabidopsis suecica、拟南芥、玉米、稻、小麦、马铃薯、洋葱、大蒜、大豆、番茄、陆地棉、草本棉(Gossypium herbaceum)、木本棉(Gossypium arboreum)、毛棉(Gossypium tomentosum)、黑芥(Brassica nigra)和芸薹属(Brassica sp)。
如本文所使用的,术语“突变体光合生物”或“突变体藻类”是指一种光合生物或藻类,其中至少SGI1、SGI2、SGI1和SRP54的组合、SGI2和SRP54的组合或SGI1、SGI2和SRP54的组合被调节。此类调节可以包含核酸序列的改变或一个或多个基因表达的改变。
如本文所使用的,SGI1和SRP54基因的组合的调节是指在同一光合生物中的SGI1的调节和SRP54基因的调节。类似地,SGI2和SRP54基因的组合的调节是指在同一光合生物中的SGI2的调节和SRP54基因的调节。同样地,SGI1、SGI2和SRP54基因的组合的调节是指在同一光合生物中的SGI1的调节、SGI2的调节和SRP54基因的调节。
如本文所使用的,术语对照光合生物是指一种光合生物,所述光合生物在所有相关方面与突变体光合生物在基因上基本上相同,除了对照光合生物不具有突变或弱化的SRP54、SGI1、SGI2或两个或更多个基因的组合之外。例如,对照光合生物是同一物种,并且除了改变cpSRP54、胞质SRP54、SGI1或SGI2基因或用于使cpSRP54弱化的构建体之外,存在于突变体中的胞质SRP54、SGI1、SGI2基因在基因上是相同的,除了不影响通过正常繁殖在诱变期间可能发生的细胞生理学的小的基因组变化(例如,“SNP”)之外。在各个实施例中,对照光合生物是一种菌株,从所述菌株中衍生出具有胞质SRP54、cpSRP54、SGI1、SGI2或至少两个基因的组合的弱化表达的突变体光合生物。
当提及光合生物(如藻类)时,术语“适应低光”意指所述光合生物在暴露于低光强度持续一段时间之后具有增加的叶绿素和光合性质,所述时间足以使叶绿素和光合性质的改变在低光条件下稳定。低光可以是例如小于200μE·m-2·s-1,并且优选地,约100μE·m-2·s-1或更小或50μE·m-2·s-1或更小,并且用于适应的时间段可以是至少约四小时、至少约六小时、至少约八小时、或至少约十二小时、至少24小时或至少48个小时,并且可能长达2天、3天、4天或5天。
“cDNA”是包括mRNA分子的核苷酸序列的至少一部分的DNA分子,除了所述DNA分子取代了存在于mRNA序列中的核碱基胸腺嘧啶或T以代替尿苷或U。cDNA可以是双链或单链的,并且可以是例如mRNA序列的互补序列。在优选的实例中,cDNA不包含一个或多个内含子序列,所述内含子序列存在于cDNA所对应的天然存在的基因中(即,存在于生物的基因组中的基因)。例如,cDNA可以具有来自天然存在的基因的内含子上游的序列,所述序列与天然存在的基因的内含子下游的序列并置,其中上游和下游序列在自然界中不在DNA分子中并置(即,这些序列在天然存在的基因中不并置)。cDNA可以通过mRNA分子的逆转录产生,或可以例如通过化学合成和/或通过使用一种或多种限制酶、一种或多种连接酶、一种或多种聚合酶(包含但不限于可用于聚合酶链式反应(PCR)的耐高温聚合酶)、一种或多种重组酶等基于对cDNA序列的了解来合成,其中对cDNA序列的了解可以任选地基于对来自基因组序列的编码区或从序列中编译的多个部分cDNA的鉴别。
“在低光适应中失调的”藻类突变体(或“在高光适应下锁定”或LIHLA突变体)是不展现出表型和基因表达变化的突变体,所述变化是低光适应的野生型藻类细胞的特征,包含:叶绿素的显著增加和大部分光收获复合蛋白(LHCP)基因的表达的显著增加。当适应低光时,在低光适应中失调的藻类突变体相对于在野生型细胞的低光适应期间上调的多个基因(例如,至少十个、至少二十个、至少三十个、至少四十个或至少五十个基因)的低光适应的野生型细胞具有减少的表达。进一步地,在低光适应中失调的藻类突变体相对于在野生型细胞的低光适应期间下调的(例如,至少五个、至少六个、至少七个、至少八个、至少九个或至少十个基因)低光适应的野生型细胞具有增加的基因表达。进一步地,如本文所公开的,当突变体和野生型细胞两者均适应低光时,在低光适应中失调的藻类突变体可以具有与野生型细胞的光合性质显著不同的光合性质。
“光合性质”、“光合性质”,“光生理性质”或光生理参数”包含但不限于最大光合速率Pmax(在每个细胞或每毫克叶绿素的基础上进行计算)、光合作用饱和下的强度Ek(如通过析氧所测量的)以及α(“alpha(α)”)光合作用(析氧)与辐照度强度(P/I)曲线的初始斜率。另外的光合性质包含可以使用荧光检测进行测量的各种参数,包含例如光系统II中的光化学最大量子产率Fv/FM;光系统II(PSII)的光合量子产率,φPSII;光化学淬灭,或开放PSII中心的比例,qP;非光化学淬灭,NPQ;PSII电子传递速率,ETRPSII;PSI电子传递速率,ETRPSI;PSI的功能性吸收横截面尺寸(σPSI)和PSII的功能性吸收横截面(σPSII)。此处的列表并不详尽,并且术语不排除测量光合作用的各个方面的其它参数。
提及“基本上相同”的性质旨在意指这些性质处于参考值的10%以内,并且优选地处于5%以内。
虽然类似于或等同于本文所描述的那些方法和材料的方法和材料可以用于本发明的实践或测试中,但是下面描述了合适的方法和材料。材料、方法和实例仅是说明性的并且不旨是限制性的。根据详细描述和权利要求,本发明的其它特征和优点将变得显而易见。
基因弱化
突变体光合生物可以是通过任何可行的方法生成的突变体,所述方法包含但不限于UV辐照、γ辐照或化学诱变,并且筛选具有本文所公开的光合性质的低叶绿素突变体。用于生成微生物菌株的突变体的方法是众所周知的。可以通过本领域已知的方法来鉴定突变体,包含例如基因组测序、PCR、cpSRP54或cytoSRP54蛋白的免疫检测以及表达分析(例如,逆转录/PCR)。
本文所提供的突变体光合生物也可以在SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54的组合中进行基因工程化,例如,其已经通过同源重组靶向敲除或基因置换(例如,用可以编码相对于野生型多肽活性降低的多肽的基因突变形式)。在另外的实例中,可以通过定点同源重组来工程化兴趣藻类菌株,以将具有或不具有表达控制序列的特定兴趣基因(如启动子)插入特定基因组基因座中,或将启动子插入宿主微生物的遗传基因座中,以影响特定基因或一组基因在基因座处的表达。
例如,通过同源重组进行的基因敲除或置换可以通过核酸(例如,DNA)片段的转化来进行,所述核酸片段包含与待改变的基因组区域同源的序列,其中所述同源序列被外源序列(通常为允许选择整合的构建体的选择性标记基因)间断。在外源序列或突变的基因序列的任一侧上的基因组同源侧翼序列的长度可以是例如至少50个、至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1,000个、至少1,200个、至少1,500个、至少1,750个或至少2,000个核苷酸。基因敲除或基因“敲入”构建体(其中外源序列侧翼为靶基因序列)可以提供于载体中,所述载体可以任选地被线性化,例如,在经历同源重组的区域之外被线性化,或可以被提供为不在载体的背景下的线性片段,例如,敲除或敲入构建体可以是分离的或合成的片段,包含但不限于PCR产物。在一些实例中,分裂标记系统可以用于通过同源重组生成基因敲除,其中可以引入两个DNA片段,所述片段可以再生可选择标记并且通过三个交换事件破坏兴趣基因座(Jeong等人(2007),《FEMS微生物学快报(FEMS Microbiol Lett)》273:157-163)。
一方面,本发明提供了经过基因修饰的生物,例如具有一种或多种用于使SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达弱化的基因修饰的微生物。如本文所使用的,“弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达”意指以减少全功能性蛋白质产生的任何方式减少或消除一种或多种上述基因的表达。
例如,被工程化为具有弱化的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的重组光合生物可以具有破坏的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其中所述重组微生物可以具有SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其包含至少一个减少或消除基因表达的插入、突变或缺失,使得不产生或以比同一物种的对照光合生物所产生的量更低的量产生全功能性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合或cytoSRP54基因。被破坏的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合、SGI2和cpSRP54的组合或cytoSRP54基因可以通过例如,通过同源重组和/或通过大范围核酸酶、锌指核酸酶的活性介导的插入或基因置换而被破坏(Perez-Pinera等人,(2012),《化学生物学当代评论(Curr.Opin.Chem.Biol.)》16:268-277),TALEN(WO 2014/207043;WO2014/076571)或RNA引导的核酸内切酶,如CRISPR系统的cas蛋白(例如,Cas9蛋白)。
CRISPR系统,最近由Hsu等人(《细胞(Cell)》157:1262-1278,2014)综述,除了Cas核酸酶多肽或复合物之外,还包含靶向RNA(通常称为“crRNA”),其通过与靶位点序列的互补性与基因组靶位点相互作用,还包含反式激活(“tracr”)RNA,其与Cas多肽复合,并且还包含与靶向crRNA结合(通过互补性)的区域。
本发明考虑了两个RNA分子(“crRNA”和“tracrRNA”)的用途,这两个RNA分子可以被共转化到表达cas蛋白或用cas蛋白转染的宿主菌株(或在宿主菌株中表达)中以用于基因组编辑,或包含与靶序列互补的序列以及与cas蛋白相互作用的序列的单一指导RNA的用途。也就是说,在一些策略中,本文所使用的CRISPR系统可以包括两个分开的RNA分子(RNA多核苷酸:“tracr-RNA”和“targeter-RNA”或“crRNA”,参见下文),并且在本文中被称为“双分子DNA靶向RNA(double-molecule DNA-targeting RNA)”或“双分子DNA靶向RNA(two-molecule DNA-targeting RNA)”。可替代地,如实例中所示出的,DNA靶向RNA还可以包含用于与Cas蛋白相互作用的反式激活序列(除了靶向同源(“cr”)序列之外),即DNA靶向RNA可以是单个RNA分子(单个RNA多核苷酸),并且在本文中被称为“嵌合引导RNA”、“单引导RNA”或“sgRNA”。术语“DNA靶向RNA”和“gRNA”是包含性的,是指双分子DNA靶向RNA和单分子DNA靶向RNA(即,sgRNA)两者。单分子引导RNA和两种RNA系统均已经在文献中并且例如,在美国专利申请公开号US 2014/0068797中进行了详细描述,所述文献通过引用整体并入本文。
任何Cas蛋白可以用于本文的方法中,例如,Cas1、Cas1B、Cas2、Cas3、Cas4、Cas5、Cas6、Cas7、Cas8、Cas9(也称为Csn1和Csx12)、Cas10、Csy1、Csy2、Csy3、Cse1、Cse2、Csc1、Csc2、Csa5、Csn2、Csm2、Csm3、Csm4、Csm5、Csm6、Cmr1、Cmr3、Cmr4、Cmr5、Cmr6、Csb1、Csb2、Csb3、Csx17、Csx14、Csx10、Csx16、CsaX、Csx3、Csx1、Csx15、Csf1、Csf2、Csf3、Csf4、其同源物或其修饰形式。在一些实施例中,Cas蛋白是II类Cas蛋白。作为非限制性实例,Cas蛋白可以是Cas9蛋白,如酿脓葡萄球菌(Staphylococcus pyogenes)、嗜热链球菌(S.thermophilus)、肺炎链球菌(S.pneumonia)、金黄色葡萄球菌(S.aureus)或脑膜炎奈瑟氏球菌(Neisseria meningitidis)的Cas9蛋白。其它兴趣Cas蛋白包含但不限于Cpf1 RNA引导的核酸内切酶(Zetsche等人(2015),《细胞》163:1-13)以及C2c1、C2c2、C2c3 RNA引导的核酸酶(Shmakov等人(2015),《分子细胞(Molecular Cell)》60:1-13)。还考虑了在美国专利申请公开号US 2014/0068797中提供为SEQ ID NO:1-256和795-1346的Cas9蛋白,以及可以结合来自多于一个Cas9蛋白的结构域的嵌合Cas9蛋白,以及鉴定的cas9蛋白的变体和的突变体。(例如,由引入到宿主细胞中的核酸分子编码的Cas9蛋白可以包括相对于野生型Cas9蛋白的至少一个突变;例如,Cas9蛋白可以在所述蛋白的切割结构域之一中失活,从而产生“切口酶”变体。突变的非限制性实例包含D10A、H840A、N854A和N863A。)可以对兴趣宿主细胞的对Cas蛋白进行编码的核酸序列进行密码子优化。
Cas核酸酶活性切割靶DNA以产生双链断裂。然后通过细胞以两种方式之一来修复这些断裂:非同源末端连接或同源性定向修复。在非同源末端连接(NHEJ)中,双链断裂通过将断裂末端彼此直接连接来修复。在这种情况下,没有新的核酸材料插入到位点中,但是一些核酸材料可能丢失,从而导致缺失或改变,通常导致突变。在同源性定向修复中,将可以与切割的靶DNA序列具有同源性的供体多核苷酸(有时称为“供体DNA”或“编辑DNA”)用作用于修复切割的靶DNA序列的模板,从而导致将基因信息从供体多核苷酸转移到靶DNA中。这样,新的核酸材料可以插入/复制到位点中。由于NHEJ和/或同源性定向修复(例如,使用供体DNA分子)引起的靶DNA的修饰可以导致例如基因校正、基因置换、基因标记、转基因插入、核苷酸缺失、基因破坏、基因突变等。
在一些实例中,通过定点修饰多肽(例如,Cas核酸酶、锌指核酸酶、大范围核酸酶或TALEN)切割DNA可以用于通过切割靶DNA序列并且允许细胞在不存在外源提供的供体多核苷酸的情况下修复序列来从靶DNA序列中缺失核酸材料。此类NHEJ事件可以在切割的末端的重新连接的位点处导致突变(“错误修复”),从而导致基因破坏。
可替代地,如果将DNA靶向RNA以及供体DNA共同施用于表达cas核酸酶的细胞,则主题方法可以用于向靶DNA序列添加(即,插入或置换)核酸材料(例如,通过插入诱变“敲除”或“敲入”编码蛋白质(例如,可选择标记和/或任何兴趣蛋白质)、siRNA、miRNA等核酸,以修饰核酸序列(例如,引入突变)。
在特定实施例中,供体DNA可以包含基因调控序列(例如,启动子),所述基因调控序列可以使用CRISPR靶向,插入基因的编码区上游和基因的推定的近端启动子区的上游,例如,cpSRP54基因的编码区起始ATG上游至少50bp、至少100bp、至少120bp、至少150bp、至少200bp、至少250bp、至少300bp、至少350bp、至少400bp、至少450bp或至少500bp。供体DNA可以包含可能干扰天然启动子的序列,如可选择标记或任何方便的序列。插入在SGI1、SGI2、cpSRP54、cytoSRP54、基因的组合或开放阅读框(例如,在5'UTR或cpSRP54基因的转录起始位点的上游)的组合的起始ATG上游的另外的序列可以减少或甚至消除内源性SGI1、SGI2、cpSRP54、cytoSRP54、基因的组合的表达。可替代地或另外,天然SGI1、SGI2、cpSRP54、cytoSRP54或基因的组合可以使其内源性启动子全部或部分地被较弱或不同调控的启动子或非启动子序列置换。
在一些实例中,引入到宿主细胞中用于生成高效基因组编辑细胞系的核酸分子编码相对于对应的野生型酶突变的Cas9酶,使得突变的Cas9酶缺乏切割含有靶序列的靶多核苷酸的一条或两条链的能力。例如,来自酿脓链球菌的Cas9的RuvC I催化结构域中的天冬氨酸至丙氨酸取代(D10A)将Cas9从切割两条链的核酸酶转化为切口酶(切割单链的酶)。致使Cas9成为切口酶的突变的其它实例包含但不限于H840A、N854A和N863A。在一些实施例中,Cas9切口酶可以与一个或多个引导序列(例如,两个引导序列)组合使用,所述引导序列分别靶向DNA靶的有义链和反义链。此组合允许两条链被切割并用于诱导NHEJ。两种切口酶靶(紧密接近但靶向DNA的不同链内)可以用于诱导诱变的NHEJ。使用在交错位置处切割相对菌株的酶对基因座的此类靶向还可以减少非靶切割,因为两条链必须被精确地且特异性地切割以实现基因组突变。
在另外的实例中,可以在细胞中表达其切割DNA的能力方面受损的突变体Cas9酶,其中还引入靶向靶向基因的转录或翻译起始位点上游的序列的一个或多个引导RNA。在这种情况下,Cas酶可以结合靶序列并阻断靶向基因的转录(Qi等人(2013),《细胞》152:1173-1183)。
在一些情况下,Cas多肽(如Cas9多肽)是融合多肽,包括例如:i)Cas9多肽(其可以任选地是如上所述的变体Cas9多肽);以及b)共价连接的异源多肽(也被称为“融合配偶体”)。异源核酸序列可以连接到另一核酸序列(例如,通过基因工程)以生成对嵌合多肽进行编码的嵌合核苷酸序列。在一些实施例中,Cas9融合多肽是通过将Cas9多肽与提供亚细胞定位的异源序列融合生成的(即,异源序列是亚细胞定位序列,例如,用于靶向细胞核的核定位信号(NLS);用于靶向线粒体的线粒体定位信号;用于靶向叶绿体的叶绿体定位信号;ER滞留信号等)。在一些实施例中,异源序列可以提供标签(即,异源序列是可检测标记),以便于追踪和/或纯化(例如,荧光蛋白,例如,绿色荧光蛋白(GFP)、YFP、RFP、CFP、mCherry、tdTomato等;血凝素(HA)标签;FLAG标签;Myc标签等)。
宿主细胞可以用例如载体构建体进行基因工程化(例如转导、转化或转染),所述载体构建体可以是例如用于同源重组的载体,所述载体包含与SGI1、SGI2、cpSRP54、cytoSRP54的一部分、SGI1和cpSRP54基因的组合或宿主细胞的SGI2和cpSRP54基因基因座的组合或与其相邻的区域同源的核酸序列,或可以是用于表达以下任何一种或组合的表达载体:Cas蛋白(例如,II类Cas蛋白)、CRISPR嵌合引导RNA、crRNA和/或tracrRNA、RNAi构建体(例如,shRNA)、反义RNA或核酶。载体可以是例如质粒、病毒颗粒、噬菌体等形式。用于表达用于基因组编辑的多肽或RNA的载体也可以设计用于整合到宿主中,例如通过同源重组。含有本文所描述的多核苷酸序列的载体,例如,与宿主SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因序列(包含cpSRP54或cytoSRP54编码序列的上游和下游的序列)的组合具有同源性的序列,以及任选地,可选择标记或报告基因可以用于转化合适的宿主,以导致SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的弱化。
在一些实例中,重组光合生物可以减少但不消除SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达,并且重组光合生物可以具有叶绿素约10%到约90%的减少,例如,总叶绿素约20%减少到约80%的减少。如本文提供的经过基因修饰的微生物可以在一些实例中包含用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的核酸构建体。例如,宿主微生物可以包含用于表达减少SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的RNAi分子、核酶或反义分子的构建体。在一些实例中,如本文所提供的重组微生物可以包含用于减少SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的表达的至少一种引入的(外源或非天然的)构建体。
使用本领域已知的方法,例如,RNA-Seq或逆转录PCR(RT-PCR),可以选择工程化菌株用于表达SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合,其相对于不包含用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因表达的组合的基因修饰的对照细胞减少,但不消除。
如本文所提供的基因工程化的菌株可以被工程化以包含用于通过减少对SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合进行编码的基因的mRNA的量、稳定性或翻译性来弱化基因表达的构建体。例如,使用本领域已知的方法,可以用靶向SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合的mRNA的反义RNA、RNAi或核酶构建体转化光合生物,如植物或藻类或不等鞭毛菌株(heterokont strain)。例如,可以将包含基因的全部或部分转录区的反义RNA构建体引入到微生物中以减少基因表达(Shroda等人(1999),《植物细胞(The PlantCell)》11:1165-78;Ngiam等人(2000),《应用环境微生物学期刊(Appl.Environ.Microbiol.)》66:775-782;Ohnuma等人(2009),《原生质(Protoplasma)》236:107-112;Lavaud等人(2012),《公共科学图书馆期刊(PLoS One)》7:e36806)。可替代地或另外,可以将靶向cpSRP54或cytoSRP54基因的RNAi构建体(例如,对短发夹RNA进行编码的构建体)引入到微生物(如藻类或不等鞭毛体)中,以用于减少cpSRP54或cytoSRP54基因的表达(参见,例如,Cerruti等人(2011),《真核細胞(Eukaryotic Cell)》(2011)10:1164-1172;Shroda等人(2006),《当代遗传学(Curr.Genet.)》49:69-84))。
核酶是以位点特异性方式切割核酸的RNA-蛋白质复合物。核酶具有拥有核酸内切酶活性的特异性催化结构域。例如,美国专利第5,354,855号报告了某些核酶可以充当核酸内切酶,其序列特异性大于已知的核糖核酸酶的序列特异性,并且接近DNA限制酶的序列特异性。催化性RNA构建体(核酶)可以被设计成与对本文所提供的基因进行编码的mRNA碱基配对,以切割mRNA靶。在一些实例中,核酶序列可以整合在反义RNA构建体内以介导靶的切割。可以考虑各种类型的核酶,其设计和用途是本领域已知的,并且例如在Haseloff等人(1988),《自然(Nature)》334:585-591中描述的。
借助于互补碱基对相互作用与位点退火,将核酶靶向给定序列。这种靶向需要两个同源性段。这些同源序列的段位于上文所定义的催化性核酶结构的侧翼。同源序列的每个段的长度可以为7到15个核苷酸不等。定义同源序列的唯一要求是,在靶RNA上,其被作为切割位点的特定序列分开。对于锤头状核酶,切割位点是靶RNA上的二核苷酸序列,其为尿嘧啶(U),然后是腺嘌呤、胞嘧啶或尿嘧啶(A、C或U)(Thompson等人(1995),《核酸研究》23:2250-68)。此二核苷酸在任何给定RNA中存在的频率在统计学上是16个中的3个。因此,对于给定的1,000个碱基的靶信使RNA,187个二核苷酸切割位点在统计学上是可能的。
已经详细讨论了核酶指导的RNA切割活性的一般设计和优化(Haseloff和Gerlach(1988),《自然》334:585-591;Symons(1992),《生物化学年鉴(Ann Rev Biochem)》61:641-71;Chowrira等人(1994),《生物化学期刊(J Biol Chem)》269:25856-64;Thompson等人(1995),见上文)。设计和测试用于有效切割靶RNA的核酶是本领域的技术人员众所周知的方法。用于设计和测试核酶的科学方法的实例是由Chorwrira等人(1994),见上文;以及Lieber和Strauss(1995),《分子细胞生物学(Mol Cell Biol.)》15:540-51描述的,每个所述文献通过引入并入。用于下调给定基因的有效和优选序列的鉴定是制备和测试给定序列的问题,并且是本领域的技术人员已知的常规实践的“筛选”方法。
RNAi构建体的用途在上文所引用的文献中以及例如US2005/0166289和WO 2013/016267中进行了描述。将与靶基因具有同源性的双链RNA递送到细胞或通过RNAi构建体(例如,RNAi短发夹(sh)构建体)的表达在细胞中产生。构建体可以包含与靶基因相同的序列,或与靶基因的序列至少70%、80%、90%、95%或95%与100%之间相同。构建体可以具有至少20个、至少30个、至少40个、至少50个、至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、或至少1kb的与靶基因同源的序列。可以使用选择用于RNAi构建体(如产生shRNA的构建体)的连续或诱导型表达的启动子来工程化表达载体。
用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或待工程化的微生物的SGI2和cpSRP54基因的组合的序列的至少一部分具有至少80%同一性,如至少85%、至少90%、至少95%或至少99%或互补性。用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与天然存在的基因(如对多肽进行编码的基因,其与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%或至少85%、至少90%或至少95%序列同一性)的序列具有至少80%,如至少95%、或约100%同一性或互补性。例如,用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个或至少六十个核苷酸,所述核苷酸与天然存在的cpSRP54基因(如本文所提供的任何基因)的序列具有至少80%同一性或互补性。核苷酸序列可以是例如,约30个核苷酸到约3千碱基或更大,例如,长度为30-50个核苷酸、长度为50到100个核苷酸、长度为100到500个核苷酸、长度为500个核苷酸到1kb、长度为1kb到2kb或2到5kb。例如,反义序列的长度可以为约100个核苷酸到约1kb。例如,用于基因弱化的核酸构建体,例如核酶、RNAi或反义构建体,可以包含至少十五个、至少二十个、至少三十个、至少四十个、至少五十个、至少六十个或至少100个核苷酸,所述核苷酸与内源性SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因的组合或其部分具有至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、或至少85%,例如,至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%或至少95%同一性或互补性。
反义、RNAi或核酶构建体中所使用的启动子可以是在宿主生物中起作用并且适合于将靶基因的表达减少到期望量所需的表达水平的任何启动子。在藻类和不等鞭毛体中起作用的启动子是本领域已知的并且在本文中公开的。可以使用任何可行的方法(包含本文公开的任何方法)将构建体转化为藻类。用核酸分子(所述核酸分子用于弱化SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54基因表达的组合,如但不限于反义、RNAi或核酶构建体)转化的重组生物或微生物,可以具有本文所描述的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54突变体的组合的性质,包含例如,相对于不包含导致弱化的基因表达的外源核酸分子的宿主生物或微生物,减少的叶绿素、增加的光合效率以及增加的培养生产力。
核酸分子和构建体
本领域的技术人员应了解,许多转化方法可以用于微生物的基因转化,并且因此可用于本发明的方法。“稳定转化”旨在意指引入到生物中的核酸构建体整合到生物的基因组中,或是稳定的附加型构建体的一部分,并且能够被其后代遗传。“瞬时转化”旨在意指将多核苷酸引入到生物中,并且不整合到基因组中,或以其它方式通过连续世代而被建立并稳定地遗传。
遗传转化可以导致转基因、来自细胞核或质体的构建体的稳定插入和/或表达,并且在一些情况下可以导致转基因的瞬时表达。这些转化方法也可以用于引入引导RNA或编辑DNA。已经报告,对于多于30种不同的微藻株(其属于绿藻、红藻和褐藻、硅藻、裸藻藻和鞭毛藻的至少约22个物种)而言,微藻的基因转化是成功的(参见,例如,Radakovits等人,《真核细胞》,2010;以及Gong等人,《工业微生物学与生物技术期刊(J.Ind.Microbiol.Biotechnol.)》,2011)。此类有用的转化方法的非限制性实例包含在玻璃珠或碳化硅晶须存在下的搅拌细胞,如由以下报告的,例如,Dunahay,《生物技术(Biotechniques)》,15(3):452–460,1993;Kindle,《美国国家科学院院刊》,1990;Michael和Miller,《植物学杂志(Plant J.)》,13,427–435,1998。电穿孔技术已经成功用于若干种微藻物种的基因转化,所述微藻物种包含微拟球藻属(参见例如,Chen等人,《藻类学杂志(J.Phycol.)》,44:768–76,2008)、小球藻属(参见,例如,Chen等人,《当代遗传学》,39:365-370,2001;Chow和Tung,《植物细胞报告(Plant Cell Rep.)》,第18卷,第9期,778-780,1999)、衣藻属(Shimogawara等人,《遗传学(Genetics)》,148:1821–1828,1998)、杜氏藻属(Sun等人,《分子生物技术(Mol.Biotechnol.)》,30(3):185-192,2005)。微弹轰击,也称为微粒轰击、基因枪转化或生物弹轰击,已经成功用于若干种藻类物种,包含例如硅藻物种(如褐指藻属)(Apt等人,《分子和普通遗传学(Mol.Gen.Genet.)》,252:572-579,1996)、小环藻属和舟形藻属(Dunahay等人,《藻类学杂志》,31:1004–1012,1995)、细柱藻属(Fischer等人,《藻类学杂志》,35:113-120,1999)以及角毛藻属(Miyagawa-Yamaguchi等人,《藻类学研究(Phycol.Res.)》59:113–119,2011),以及绿色藻类物种如小球藻属(El-Sheekh,《植物生物学(Biologia Plantarum)》,第42卷,第2期:209-216,1999)和团藻物种(Jakobiak等人,《原生生物(Protist)》,155:381-93,2004)。另外,农杆菌介导的基因转移技术也可用于微藻的基因转化,如已经由以下所报告的,例如,Kumar,《植物科学(Plant Sci.)》,166(3):731-738,2004以及Cheney等人,《藻类学杂志》,第37卷,增刊11,2001。
如本文所描述的转化载体或构建体将典型地包括标记基因,所述标记基因在靶宿主细胞(例如,藻类细胞)上赋予可选择或可评分的表型,或可以与包含标记的构建体共转化。已经成功地开发了用于有效分离藻类的基因转化体的许多可选择标记。常见的可选择标记包含抗生素抗性、荧光标记和生物化学标记。若干种不同的抗生素抗性基因已经成功地用于选择微藻转化体,包含灭瘟素(blastocydin)、博来霉素(bleomycin)(参见例如,Apt等人,1996,见上文;Fischer等人,1999,见上文;Fuhrmann等人,《植物杂志》,19,353-61,1999,Lumbreras等人,《植物学杂志》,14(4):441-447,1998;Zaslavskaia等人,《藻类学杂志》,36:379–386,2000)、壮观霉素(spectinomycin)(Cerutti等人,《遗传学》,145:97–110,1997;Doetsch等人,《当代遗传学》,39,49-60,2001;Fargo,《分子细胞生物学》,19:6980-90,1999)、链霉素(streptomycin)(Berthold等人,《原生生物》,153:401-412,2002)、巴龙霉素(paromomycin)(Jakobiak等人,《原生生物》见上文;Sizova等人,《基因(Gene)》,277:221-229,2001)、诺尔丝菌素(nourseothricin)(Zaslavskaia等人,2000,见上文)、G418(Dunahay等人,1995,见上文;Poulsen和Kroger,《FEBS快报(FEBS Lett.)》,272:3413–3423,2005;Zaslavskaia等人,2000,见上文)、潮霉素(hygromycin)(Berthold等人,2002,见上文)、氯霉素(Poulsen和Kroger,2005,见上文)以及许多其它微藻转化体。用于微藻类(如衣藻属)的另外可选择标记可以是提供对以下的抗性的标记:卡那霉素和阿米卡星抗性(Bateman,《分子和普通遗传学》263:404-10,2000)、沙阿霉素(zeomycin)和腐草霉素(phleomycin)(例如,ZEOCINTM腐草霉素D1)抗性(Stevens,《分子和普通遗传学》251:23-30,1996)以及巴龙霉素(paramomycin)和新霉素(neomycin)抗性(Sizova等人,2001,见上文)。已经使用的其它荧光或生色标记包含荧光素酶(Falciatore等人,《水产生物技术期刊(J.Mar.Biotechnol.)》,1:239–251,1999;Fuhrmann等人,《植物分子生物学(PlantMol.Biol.)》,2004;Jarvis和Brown,《当代遗传学》,19:317–322,1991)、β-葡萄糖醛酸酶(Chen等人,2001,见上文;(Cheney等人,2001,见上文;Chow和Tung,1999,见上文;El-Sheekh,1999,见上文;Falciatore等人,1999,见上文;Kubler等人,《水产生物技术期刊》,1:165–169,1994)、β-半乳糖苷酶(Gan等人,《应用藻类学期刊(J.Appl.Phycol.)》,15:345–349,2003;Jiang等人,《植物细胞报告》,21:1211–1216,2003;Qin等人,《高技术通讯(HighTechnol.Lett.)》,13:87–89,2003)和绿色荧光蛋白(GFP)(Cheney等人,2001,见上文;Ender等人,《植物细胞》,2002;Franklin等人,《植物学杂志》,2002;56,148,210)。
本领域的技术人员将容易理解,多种已知的启动子序列可以有用地部署用于根据本发明的微藻物种的转化系统。例如,通常用于驱动微藻中的转基因表达的启动子包含多种形式的花椰菜花叶病毒启动子35S(CaMV35S),其已经用于甲藻门(dinoflagellate)和绿藻门(Chlorophyta)两者(Chow等人,《植物细胞报告》,18:778-780,1999;Jarvis和Brown,《当代遗传学》,317-321,1991;Lohuis和Miller,《植物学杂志》,13:427-435,1998)。还报告了来自猿猴病毒的SV40启动子在若干种藻类中是有活性的(Gan等人,《应用植物学期刊》,151345-349,2003;Qin等人,《水生生物学(Hydrobiologia)》398-399,469-472,1999)。来自衣藻属的RBCS2(二磷酸核酮糖羧化酶,小亚基)(Fuhrmann等人,《植物学杂志》,19:353-361,1999)和PsaD(光系统I复合物的丰富蛋白质;Fischer和Rochaix,《FEBS快报》581:5555-5560,2001)的启动子也可以是有用的。HSP70A/RBCS2和HSP70A/β2TUB(微管蛋白)的融合启动子(Schroda等人,《植物学杂志》,21:121-131,2000)也可以用于改善的转基因表达,其中HSP70A启动子在置于其它启动子上游时可以充当转录激活因子。兴趣基因的高水平表达也可以在例如硅藻物种中,在对硅藻岩藻黄质-叶绿素a/b结合蛋白进行编码的fcp基因(Falciatore等人,《水产生物技术期刊》,1:239-251,1999;Zaslavskaia等人,《藻类学杂志》36:379-386,2000)或对黄绿藻门黄质-叶绿素a/b结合蛋白进行编码的vcp基因(参见美国专利第8,318,482号)的启动子的控制下实现。如果需要,诱导型启动子可以在转基因微藻类中提供快速且严格控制的基因表达。例如,对硝酸盐还原酶进行编码的NR基因的启动子区可以用作此类诱导型启动子。NR启动子活性通常被铵抑制并在铵被硝酸盐置换时被诱导(Poulsen和Kroger,《FEBS快报》272:3413-3423,2005),因此当微藻细胞在铵/硝酸盐存在下生长时,可以关闭或打开基因表达。可以在本文所提供的构建体和转化系统中发现用途的另外的藻类启动子包含以下专利申请中所公开的那些藻类启动子:美国专利第8,883,993号;美国专利申请公开号US2013/0023035;美国专利申请公开号US 2013/0323780;以及美国专利申请公开号US 2014/0363892。
宿主细胞可以是未转化的细胞或是已经用至少一种核酸分子转染的细胞。例如,被工程化以具有弱化cpSRP54基因表达的藻类宿主细胞可以进一步包含可以赋予任何所期望特性的一种或多种基因,如但不限于增加的兴趣生物分子(如一种或多种蛋白质、颜料、醇或脂质)的产量。
从光合生物产生产物的方法
本文还提供了通过培养具有增加的光合效率的光合生物来从光合生物(如藻类)产生产物的方法,如本文所公开的SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54突变体的组合。所述方法包含在合适的培养基中培养光合生物突变体SGI1、SGI2、cpSRP54、cytoSRP54、SGI1和cpSRP54基因的组合或SGI2和cpSRP54的组合以提供光合生物培养物,并且从所述培养物中回收生物质或至少一种产物。在一些实施例中,所述产物是脂质。包括光合生物的培养物优选地是光能自养培养物,并且培养基优选地不包含显著量的还原碳,即,培养物不包含藻类可以用于生长的形式或水平的还原碳。
在一些实施例中,光合生物可以在任何合适的容器中培养,包含烧瓶或生物反应器,其中所述光合生物可以暴露于人造光或自然光。可以在光/黑暗周期中培养包括突变的光合生物的培养物,所述光/黑暗周期可以是例如,自然的或程序化的光/黑暗周期,并且作为说明性实例,可以提供十二小时的光到十二小时的黑暗、十四小时的光到十小时的黑暗、十六小时的光到八小时的黑暗等。
培养是指通过使用所选和/或受控条件来有意促进一个或多个细胞的生长(例如,增加细胞大小、细胞含量和/或细胞活性)和/或繁殖(例如,通过有丝分裂增加细胞数量)。生长和繁殖两者的结合可以被称为增殖。如本文的实例所表明的,相对于在低光适应中未失调的同一菌株的培养野生型藻类细胞,本文所提供的对低光强度展现出失调的适应性的突变体可以随着时间推移(例如,在一周或更长的时间段内)实现培养物的更高细胞密度。例如,可以将cpSRP54突变体培养持续至少五天、至少六天、至少七天、至少八天、至少九天、至少十天、至少十一天、至少十二天、至少十三天、至少十四天或至少十五天,或至少一周、二周、三周、四周、五周、六周、七周、八周、九周或十周或更长时间。
可以用于培养重组微生物的所选和/或受控条件的非限制性实例可以包含使用限定的培养基(具有已知特征,如pH、离子强度和/或碳源)、指定温度、氧张力、二氧化碳水平、生物反应器中的生长等或其组合。在一些实施例中,可以使用光和还原碳源两者使微生物或宿主细胞混合营养生长。可替代地,微生物或宿主细胞可以光养培养。当光养生长时,藻类菌株可以有利地使用光作为能源。无机碳源(如CO2或碳酸氢盐)可以用于由微生物合成的生物分子。如本文所使用的,“无机碳”包含不能被生物用作可持续能源的含碳化合物或分子。通常,“无机碳”可以是CO2(二氧化碳)、碳酸、碳酸氢盐(bicarbonate salt)、碳酸盐、碳酸氢盐(bicarbonate salt)等或其组合的形式,其不能被进一步氧化以得到可持续的能量,也不能用作生物的还原力(reducing power)的来源。光能自养生长的微生物可以在培养基中生长,其中无机碳基本上是唯一碳源。例如,在其中无机碳基本上是唯一碳源的培养物中,可以在培养基中提供的任何有机(还原)碳分子或有机碳化合物不能由细胞吸收和/或代谢为能量和/或不以足以为细胞培养物的生长和增殖提供可持续能量的量存在。
根据本发明的方法可以使用的微生物和宿主细胞可以在世界各地的不同位置和环境中找到。用于脂质和/或其它产物的最佳繁殖和生成的特定生长培养基可以变化,并且可以被优化以促进生物质或产物(如脂质、蛋白质、色素、抗氧化剂)的生长、繁殖或产生。固体和液体生长培养基通常可从多种来源获得,如用于制备适用于多种微生物菌株的特定培养基的说明。例如,各种淡水和盐水培养基可以包含在Barsanti(2005),《藻类:解剖学、生物化学和生物技术(Algae:Anatomy,Biochemistry&Biotechnology)》,CRC出版社(CRCPress),用于培养藻类的培养基和方法中所描述的那些淡水和盐水培养基。藻类培养基配方也可以在各种藻类培养物保藏中心的网站上找到,包含作为非限制性实例,藻类的UTEX培养物保藏中心(www.sbs.utexas.edu/utex/media.aspx);藻类和原生动物的培养物保藏中心(www.ccap.ac.uk);和Katedra Botaniky(botany.natur.cuni.cz/algo/caup-media.html)。
培养方法可以任选地包含诱导用于产生产物(如但不限于参与脂质产生的蛋白质、一种或多种蛋白质、抗氧化剂或色素)的一种或多种基因的表达,和/或调控微生物中的代谢途径。诱导表达可以包含向培养物中添加营养物或化合物、从培养基中去除一种或多种组分、增加或减少光和/或温度和/或促进兴趣基因表达的其它操作。此类操作可以在很大程度上取决于可操作地连接到兴趣基因的(异源)启动子的性质。
在本发明的一些实施例中,在适应低光强度下失调的微生物可以在配备有人工光源和/或具有一个或多个壁的“光生物反应器”中进行培养,所述一个或多个壁对于光(包含阳光)是足够透明的以使得、促进和/或维持可接受的微生物生长和增殖。为了产生脂肪酸产物或甘油三酯,光合微生物或宿主细胞可以另外地或可替代地在摇瓶、试管、小瓶、微量滴定皿(microtiter dishes)、皮氏培养皿(Petri dishes)等或其组合中进行培养。
另外地或可替代地,重组光合微生物或宿主细胞可以在池塘、水道、海基型生长容器、沟渠、水沟、航道等或其组合中生长。在这样的系统中,温度可以是未调控的或可以采用各种加热或冷却方法或装置。与标准生物反应器一样,无机碳源(如但不限于CO2、碳酸氢盐、碳酸盐等)包含但不限于空气、富含CO2的空气、烟道气等或其组合可以提供给培养物。在提供烟道气和/或其它无机源(除CO2之外还可能含有CO)时,可能需要预处理此类来源,使得引入(光)生物反应器的CO水平不构成关于微生物的生长、增殖和/或存活的危险和/致命剂量。
突变体光合生物可以包含对用于产生产物的多肽进行编码的一种或多种非天然基因,所述产物如但不限于脂质、着色剂或色素、抗氧化剂、维生素、核苷酸、核酸、氨基酸、激素、细胞因子、肽、蛋白质或聚合物。例如,经过编码的多肽可以是酶、代谢调节剂、辅因子、载剂蛋白或转运蛋白。所述方法包含培养cpSRP54突变体或cytoSRP54突变体以产生生物质或至少一种藻类产物,所述突变体包含对参与产物产生的多肽进行编码的至少一种非天然基因。产物(如脂质和蛋白质)可以通过本领域的普通技术人员已知的回收方法从培养物中进行回收,如通过全培养物提取,例如使用有机溶剂。在一些情况下,脂肪酸产物的回收可以通过细胞均质化来增强。例如,可以通过在升高的温度和/或压力下用溶剂提取藻类而从藻类中分离脂质(如脂肪酸、脂肪酸衍生物和/或甘油三酯),如在共同未决的、共同转让的美国专利申请公开号US2013/0225846中所描述的,所述美国专利申请通过引用整体并入本文。
在审阅本公开之后,其它替代性实施例和方法对本领域的技术人员而言将是显而易见的。本文给出的一般方法的讨论仅旨在用于说明性目的。下文提供了以下非限制性实例。
实例
实例1
过表达CAS9的拟小球藻属菌株的生成
在美国专利申请公开2016/0304896中描述了过表达Cas9的拟小球藻属菌株的生成,所述美国专利申请通过引用整体并入。
简而言之,将载体pSGE-6709工程化以用于在拟小球藻属中表达酿脓链球菌Cas9基因。所述载体包含以下三个元素:(1)Cas9表达盒,其含有针对拟小球藻属进行密码子优化的工程化的Cas9基因并且含有来自拟小球藻属的内含子,其还包含N末端FLAG标签、核定位信号和可操作地连接到拟小球藻属RPS17启动子,以及肽接头,其由拟小球藻属RPS17终止子终止可选择标记表达盒,其含有针对拟小球藻属进行密码子优化的来自土曲霉(Aspergillus terreus)的杀稻瘟菌素抗性基因并且含有拟小球藻属内含子,可操作地连接到拟小球藻属RPS4启动子并由拟小球藻属RPS4终止子终止,GFP报告基因表达盒,其含有TurboGFP基因(Evrogen公司(Evrogen),莫斯科,俄罗斯),其由拟小球藻属ACP1启动子驱动并由拟小球藻属ACP1终止子终止。
通过基因枪法将载体转化为拟小球藻属。使用伯乐
Figure BDA0002557691090000661
基因枪系统(BioRad
Figure BDA0002557691090000662
Gene Gun System)完成拟小球藻属野生型菌株WT-1185的转化,基本上如美国专利公开号2014/0154806中所描述的,所述美国专利通过引用并入本文。将用于转化的DNA沉淀在金颗粒上,这些金颗粒粘附在管子长度的内部,并且燃烧一阵氦气通过定位在基因枪内的管子,以将这些DNA包被的金颗粒推进到拟小球藻属菌株WT-1185细胞中,所述细胞粘附在固体非选择性培养基(含有PM074藻类生长培养基的2%琼脂平板)上。使用
Figure BDA0002557691090000663
基因枪在600psi下,从距平板3-6cm的距离发射两个子弹每细胞圈。第二天,将细胞转移到用于转化的菌落生长的选择性培养基上。
通过流式细胞术和鉴别经过转化的菌株来筛选菌落的完全GFP外显率,所述经过转化的菌株的单个荧光峰偏移到比野生型荧光峰更高的值。为了证明Cas9表达,通过抗Cas9蛋白质印迹来测试完全渗透的Cas9菌株的Cas9表达,所述菌株相对于未转化的细胞显示出明显偏移的荧光峰。基于这些筛选,将分离株6709-2向前进行并给予菌株标识符GE-15699。
实例2:
使用完全渗透的拟小球藻属CAS9编辑系敲除CPSRP54
在美国专利申请公开2016/0304896中描述了使用完全渗透的拟小球藻属Cas9编辑系敲除cpSRP54,所述美国专利申请通过引用整体并入。简而言之,设计了嵌合gRNA(SEQID NO:103),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SRP54基因。
将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。
通过对100mL种子培养物进行接种来进行电穿孔,所述种子培养物在使用转化前六天被接种成1×106个细胞/毫升,以在转化前两天将1L培养物接种成1×106个细胞/毫升。在转化当天,通过在5000x g下离心持续20分钟来沉淀细胞,用0.1um过滤的385mM山梨糖醇洗涤三次,并且在385mM山梨糖醇中重悬到5×109细胞/毫升。在不同条件下,在伯乐(BioRad)Gene Pulser XcellTM中,在0.2cm比色皿中进行100μL浓缩细胞的电穿孔。用于优化电穿孔的DNA是线性化的pSG6640,包含bleR和TurboGFP表达盒。TurboGFP盒包含可操作地连接到TurboGFP基因(SEQ ID NO:24)的拟小球藻属ACP1启动子(SEQ ID NO:67)和拟小球藻属ACP1终止子(SEQ ID NO:68)。在电穿孔预冷的细胞和比色皿后,立即添加1mL冷山梨糖醇并且用于将细胞转移到10mL PM074中。在过夜回收之后,将细胞浓缩并铺展到含有250mg/L博莱霉素(zeocin)的直径为13cm的PM074培养基上,并且在基因枪部分中所列出的条件下生长。
电穿孔条件为1.0-1.2kV(5000-6000V/cm)、200-300欧姆和25-50μF。使用更大量的DNA增加了所得博莱霉素抗性菌落的数量,尽管在大于4μg的量下效果达到稳定。在电穿孔之后,将细胞铺在含有250μg/ml博莱霉素的琼脂培养基(PM130)上,以选择掺入ble盒的转化体。使用设计用于跨天然靶向基因座(oligo-AE596和oligo-AE597)扩增的引物,通过菌落PCR筛选转化体。引物被设计成在不存在整合(例如,BleR盒的“敲入”)到基因座中的情况下产生700bp条带,或如果存在单个ble盒整合到靶向基因座中的话,则产生4.3kb条带。另外,还使用设计成扩增从cpSRP54基因(oligo-AE597)延伸到可选择标记的片段的引物来进行菌落PCR。根据整合的ble盒的朝向,1.2kb条带将由从ble盒内向外跨越进入到cpSRP54基因中的引物405/597或引物406/597的扩增来产生。结果显示,在不存在同源性臂的情况下,BleR盒敲入到靶向基因座中的高频率(在此样品中介于40%与45%之间)。cpSRP54敲除导致浅绿色表型。
实例3
使用完全渗透的拟小球藻属CAS9编辑系敲除SGI2
使用完全渗透的拟小球藻属Cas9编辑系敲除SGI2基本上如上文针对cpSRP54所述进行。简而言之,设计了嵌合gRNA(SEQ ID NO:104),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SGI2基因。
将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。
选择Ble抗性菌落,并且通过PCR确认敲除。
实例4
使用完全渗透的拟小球藻属CAS9编辑系敲除SGI1
SGI1敲除菌株24183是从表达Cas9的母菌株GE--15699开始产生的。将GE-15699细胞电穿孔为嵌合gRNA(SEQ ID NO:105,SEQ ID NO:105的最后三个核苷酸代表PAM),以及DNA盒,所述DNA盒含有侧翼为亚硝酸盐还原酶启动子和终止子的密码子优化的Cre基因,并且如图10A所示。所述盒还含有先前已经使用的ble和GFP基因。Ble和GFP的侧翼为lox2272位点。当表达Cre时,lox位点重组,在这些序列之间环出DNA。围绕CRISPR靶的SGI1基因的同源序列也位于所述盒的末端,以增强单拷贝整合。在SGI1基因座中存在盒的情况下,通过DNA测序来确认序列。使用ddPCR确认拷贝数是单拷贝整合体。然后,将菌株在非含铵培养基(non-ammonium containing media)中培养,以表达Cre。当表达Cre时,lox位点重组,在这些序列之间环出DNA。
实例5
使用完全渗透的拟小球藻属CAS9编辑系双敲除SGI2和CPSRP54
使用完全渗透的拟小球藻属Cas9编辑系双敲除SGI2和SRP54基本上如上文针对cpSRP54所述进行。简而言之,设计了两种嵌合gRNA,一种针对cpSRP54(SEQ ID NO:69)并且另一种针对SGI2(SEQ ID NO:73),最后三个核苷酸代表PAM,并且在体外合成以靶向拟小球藻属编码序列中的叶绿体SGI1基因。
将GE-15699通过电穿孔用1-2μg纯化的嵌合引导RNA和1μg可选择标记DNA进行转化,所述可选择标记DNA含有针对拟小球藻属进行密码子优化的博来霉素抗性“BleR”基因,并且含有来自拟小球藻属(SEQ ID:70)的内含子。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。
选择Ble抗性菌落,并且通过PCR确认敲除。
实例6
使用完全渗透的拟小球藻属CAS9编辑系双敲除SGI1和CPSRP54
将如上所述的拟小球藻属SGI1敲除菌株24183用靶向cpSRP54(SEQ ID NO:69)的嵌合gRNA与包括ble和GFP序列的DNA盒进行电穿孔(图10B),以产生SGI1和cpSRP54的双敲除。选择Ble抗性菌落,并且通过PCR确认敲除。生成了三种双敲除菌株:STR24538、STR24540和STR24541,所述双敲除菌株在光生理性质和物理表型上相同。
实例7
包括单拷贝CAS9基因的拟小球藻属SGI1敲除菌株的生成
将针对拟小球藻属进行密码子优化的并且包括来自拟小球藻属(SEQ ID:70)、GFP基因、Cre基因、lox位点和Cas9基因的内含子的博来霉素抗性“BleR”基因克隆到pCC1BAC载体中。Cas9基因可操作地连接到拟小球藻属RPS17启动子,并且包括29个天然PBP内含子,并且位于lox2272位点之外。Cas9基因由拟小球藻属RPS17终止子终止。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子,并由拟小球藻属ACP1终止子终止。Cre基因可操作地连接到拟小球藻属亚硝酸盐还原酶启动子和拟小球藻属亚硝酸盐还原酶终止子。这些基因的侧翼是SGI1(CheY)序列的用作同源重组位点的一部分。图17示出了重组pCC1BAC载体的示意图。
转化WT拟小球藻属宿主菌株:STR00010
将Cas9基因WT拟小球藻属宿主菌株与靶向SGI1基因(SEQ ID NO:74)的gRNA和PvuI消化的且旋转纯化的选择盒(NAS00460,SEQ ID NO:86)共转化。
选择盒(NAS00460)包括片段,所述片段包含SGI1同源重组(HR)上臂的上游1.7kb载体主链(对应于SEQ ID NO:86的序列1-1761)和SGI1 Hr下臂的下游没有载体的部分、针对拟小球藻属进行密码子优化的并且含有来自拟小球藻属(SEQ ID:70)、GFP基因(对应于SEQ ID NO:86的序列8260-8961)和Cas9基因的内含子的博莱霉素抗性“BleR”基因。选择盒在lox位点内含有ble和GFP。CRE基因(对应于SEQ ID NO:86的序列10418-13326)包括6个亚硝酸盐还原酶密码子优化的拟小球藻属内含子,处于亚硝酸盐还原酶诱导型启动子之下(对应于SEQ ID NO:86的序列9906-10417)。Cre基因由亚硝酸盐还原酶终止子终止(对应于SEQ ID NO:86的序列13327-15140)。包含29个天然PBP内含子的Cas9基因对应于SEQ IDNO:86的序列15754到序列25918。Cas9基因处于拟小球藻属RPS17启动子之下(对应于SEQID NO:86的序列15166-15753),并且含有29个天然PBP内含子,并且位于lox位点之外。Cas9基因由拟小球藻属RPS17终止子终止(对应于SEQ ID NO:86的序列25919-26373)。T
BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子(对应于SEQ ID NO:86的序列7688到8259),并且由拟小球藻属ACP1终止子(对应于SEQ ID NO:86的序列8692-9830)终止。SGI1同源重组(HR)上臂对应于SEQ ID NO:86的序列1762-3578。SGI1同源重组(HR)下游臂对应于SEQ ID NO:86的序列26448-28447。5'lox2272位点对应于SEQID NO:86的序列3831-3864,并且3'lox2272对应于SEQ ID NO:86的序列9839-9872。所有序列均处于SGI1 CRISPR靶的上游和下游的2kb同源区内。
在共同转化SGI1 gRNA(SEQ ID NO:105)和选择盒(SEQ ID NO:86)时,将SGI1基因敲除,并且通过同源重组将包括Cas9、BleR和GFP基因的选择盒插入到SGI1位点中。BleR和GFP基因的侧翼是lox2272位点,而选择盒的Cas9和Cre基因位于lox2272位点之外,但位于充当同源重组位点的SGI1序列的部分内。
一旦将选择盒插入到SGI1基因座中,Cre基因可操作地连接到诱导型亚硝酸盐还原酶启动子。因此,当微生物在包括亚硝酸盐的生长培养基中生长时,Cre基因表达被诱导。在Cre基因表达时,Cre酶作用于lox2272位点并去除侧翼于lox位点内的BleR和GFP序列。这导致一种系统,在所述系统中,可以在其它序列的子序列转化期间重新引入可选择标记(例如,GFP、其它抗生素标记,例如,BleR)。
筛选用于Cas9插入的经过转化的拟小球藻属菌株
将转化的拟小球藻属细胞在含有铵的选择性平板上铺板到单一菌落,以抑制CRE表达,再次在选择性抑制平板上铺板修补菌落,并且使用PCR和GFP位移筛选敲除。用于确认敲除的PCR引物如下所示:
AE803:AGGCTACTCTCAGACATGACGGTGGCTCTG(SEQ ID NO:87)
ST815:GCCACAAATGAAGGTTGGCAGGGTCAGTGC(SEQ ID NO:88)
将PCR阳性反应送去测序,以确认敲除(盒的插入)和完美的HR。本申请的发明人惊讶地且出乎意料地发现,是Cas9基因的单拷贝被插入到SGI1基因座中。
实例8
使用完全渗透的拟小球藻属单拷贝CAS9编辑系三重敲除SGI1、SGI2和CPSRP54
产生如上所述的拟小球藻属SGI1敲除菌株STR24129,其具有插入到SGI1基因座中的Cas9和Cre的单拷贝,其中使用SGI1敲除引导序列:ACACCACCTTAAGGCACATGAGG(SEQ IDNO:89)将标记物(ble/GFP)侧接移除。
将SGI1敲除菌株STR24129用作敲除SGI2和SRP54基因的转化宿主。将宿主菌株STR24129与靶向SGI2和SRP54基因的gRNA和选择盒(pSGE06866)共转化,所述选择盒包括包含用于每个靶(例如,SRP54和SGI2)的同源重组(HR)臂的Ultramers。BleR基因可操作地连接到拟小球藻属RPS4启动子(SEQ ID:71),并且由拟小球藻属RPS4终止子(SEQ ID:72)终止。GFP基因可操作地连接到拟小球藻属ACP1启动子,并由拟小球藻属ACP1终止子终止。选择盒包括由lox位点围绕的ble和GFP标记,用于潜在的标记再循环。当表达Cre时,lox位点重组,在这些序列之间环出DNA。
将转化的宿主细胞铺在选择性平板上,挑取修补菌落和单一菌落,并使用PCR筛选敲除。将PCR阳性反应送去测序,以确认每个靶的敲除(盒的插入)。
图16A和16B示出了用于敲除拟小球藻属SRP54和拟小球藻属SGI2的选择盒的示意图。gRNA的序列、具有HR臂的Ultramer如下所示。
SRP54-EMRE3EUKT592650
gRNA序列:GGCGTGGGACATGGTGCGCAAGG(SEQ ID NO:90)
具有HR臂以扩增pSGE06866的Ultramer:
ST938_HR_SRP54-UP
TGAAGCACCCCCCGGCCTCTCCCCCCGCAGGGCCGCCCCTCCCGCCTCGTCGTGC(SEQ ID NO:91)
ST939_HR_SRP54-DOWN
CGCAACGCTCTCCCTCCCCACCCCCCAGCCTCACATCCGCCTCAAGCAGCGCCCTG(SEQ ID NO:92)
引物序列:
ST949_CasPipe9GT_SRP54-fwd:caagctatgcgaggaagggagggtc(SEQ ID NO:93)
ST950_CasPipe9GT_SRP54-rev:ctgccgcaagtgagtgtgctgtc(SEQ ID NO:94)
用于筛选的其它引物-位于选择盒中:
JV946-接头5-用于:caccagatataggtgacccgataac(SEQ ID NO:95)
AE608 ble rev:AAAACTCCACTGCACCTGCAACAT(SEQ ID NO:96)
SGI2-EMRE3EUKT590485
gRNA序列:
ST937_crRNA_064_EMRE3EUKT590485:TGCGGTGAAGCTTGGAGCTG(SEQ ID NO:97)
将HR臂置于PSGE06866上的Ultramer序列
ST940_HR_SGI2-UP
TTGCCGTCGACGAGACTTCGGGGCGCGCATTTATCGACTCTCTTGAAGATACACCGGTT(SEQ IDNO:98)
ST941_HR_SGI2-DOWN
TCCAATTGTAGATATCATATTGTTTCCGGACCTACCTTACGCACTGAGTGCTGCCAGATGTTCTT(SEQID NO:99)
引物序列:
ST046CasPipe9GT-064-fwd:gaggtgggtggtagtgcttcgcgaggtg(SEQ ID NO:100)
ST047CasPipe9GT-064-rev:atcacagctcacagggcagacactgcgtc(SEQ ID NO:101)
引物序列:
将引物JV946和AE608也用作筛选引物。
实例9
SGI2蛋白的结构域架构的生物信息学分析
使用在线工具InterProScan(工具版本5.27,数据库版本66.0,来自EMBL-EBI,辛克斯顿,剑桥郡,CB10 1SD,英国),分析来自拟小球藻属、卵囊藻属、四爿藻属、拟南芥的示例性SGI2蛋白的结构域架构。
如图3-9所示,在SGI2蛋白的N末端鉴定了单个保守的响应接收结构域。
实例10
各种SGI2蛋白的响应接收结构域的生物信息学分析
使用BLOSUM62矩阵、空位罚分10和延伸罚分0.5,将拟小球藻属响应接收结构域(SEQ ID NO:6)与来自其它藻类物种和各种植物的其它直向同源蛋白质对进行了局部比对。拟小球藻属响应接收结构域(SEQ ID NO:6)与各种光合生物的局部比对在下表5中示出。
表5:拟小球藻属响应接收结构域与各种直向同源蛋白质的局部比对结果。
Figure BDA0002557691090000751
Figure BDA0002557691090000761
Figure BDA0002557691090000771
拟小球藻属的响应接收结构域示出了与其它藻类物种更高百分比的同一性以及与各种植物物种的高度相似性。
实例11
低叶绿素拟小球藻属菌株WT-1185突变体的筛选
在敲除拟小球藻属基因的SGI1、SGI2、双敲除SGI1和cpSRP54或双敲除SGI2和cpSRP54之后,如上所述,选择来自浅色菌落的细胞并允许其在低光(100μmol光子m-2sec-1)下生长一到五天,之后使用BD FACSAria II流式细胞仪(BD生物科学公司(BDBiosciences),圣何塞,加利福尼亚州)通过流式细胞术对其进行分选,以选择具有低叶绿素荧光的细胞。通常,选择与总细胞群体相比具有最低大约0.5%到2%叶绿素荧光的细胞部分。在所分选的细胞铺板之后,通过目视选择浅绿色或黄色菌落,进一步初步筛选通过流式细胞术分离的天线减少的细胞系。为了从其它减少的色素突变体和假阳性中筛选推定的触角减少的细胞系,在光生理测量之前,对所选菌落进行中等通量的二次培养筛选,以使分离物适应低光条件。在低光适应期间监测叶绿素荧光,以选择保留了高光适应状态的减少的叶绿素荧光特性的克隆。当从高光转移到低光时,所选克隆仅显示叶绿素的少量增加(相对于野生型细胞)。
使用在75cm2组织培养烧瓶中的165ml培养物,在恒定的高光下(大约1,700μmol光子m-2sec-1)进行半连续培养物测定,以鉴定相对于野生型祖先株WT-1185,具有增加的生产力(增加的生物质产生速率,测量为总有机碳(TOC)积聚)的菌株。用给定突变体菌株的种子培养物接种两个75cm2烧瓶。烧瓶具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶的宽度(最窄的尺寸)与LED灯组进行比对。培养物的深度(从最靠近光源的烧瓶壁到烧瓶后部壁的距离)大约为8.0cm。在光周期开始时,通过去除65%的培养物体积并且将其用稀释的新鲜PM119培养基置换而每天对培养物进行稀释,以适应由于培养物(212ml di H2O到1L PM119培养基)中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。
实例12
拟小球藻属突变体的半连续生产力测定
对发现在低光条件下具有减少的叶绿素的拟小球藻属菌株进行分析,以增加生产力。在生产力测定中,突变体的光能自养培养物在恒定光半连续模式(CL-SCPA)下生长若干天,其中每天去除培养物样品用于生物质确定。每天将光保持在恒定的1900-2000μmol光子m-2sec-1下持续24小时。在此测定中,用给定突变体菌株的种子培养物接种225cm2烧瓶中的PM119培养基。每个菌株启动三种培养物。烧瓶包含搅拌棒并具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶的宽度(最窄的尺寸)与LED灯组进行比对。从光源向后延伸的烧瓶的“深度”尺寸为13.7cm。考虑到烧瓶的位置,烧瓶中细胞距光源表面的最远距离大约为15.5cm。通过去除65%的培养物体积并且将其用稀释的新鲜PM119培养基置换而每天对培养物进行稀释,以适应由于培养物中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。一旦培养物达到稳定状态,半连续生产力测定运行持续12天。
通过从每天去除的样品中测量总有机碳(TOC)来评估测定的生产力。总有机碳(TOC)通过用去离子水将2mL细胞培养物稀释到20mL总体积来确定。每次测量注射三次,注射到Shimadzu TOC-Vcsj分析仪中,以确定总碳(TC)和总无机碳(TIC)。将燃烧炉设定为720℃,并且通过从TC中减去TIC来确定TOC。4点校准范围为2ppm到200ppm,对应于非稀释培养物的20-2000ppm,其中相关系数r2>0.999。
已经描述了本发明的多个实施例。然而,应当理解的是,本文所描述的实施例的元素可以被组合以形成另外的实施例,并且在不脱离本发明的精神和范围的情况下可以进行各种修改。因此,其它实施例、替代方案和等效物处于本文所描述且要求保护的本发明的范围内。
实例13
拟小球藻属突变体的半连续尿素分批测定
在SCUBA(半连续尿素分批测定)中,突变体的光能自养培养物在充满氮的昼夜光半连续模式下生长若干天,然后在氮耗尽的分批模式下生长。光被编程为模仿加利福尼亚州的因皮里尔河谷(Imperial Valley)的平均5月4日从黑暗到中午的2000μmol光子m-2sec-1。每天在“黄昏”取样。在此测定中,用给定突变体菌株的种子培养物接种500ml方形烧瓶中的基于尿素的420ml的PM153培养基。
PM152是一种基于PM074的营养耗尽培养基,但包含尿素而不是硝酸盐作为氮源。其通过将1.3ml
Figure BDA0002557691090000791
F/2藻类饲料A部分(水生生态系统公司(Aquatic Eco-Systems))和1.3ml‘溶液C’添加到最终体积为1升的速溶海盐溶液(17.5g/L)中而制成(水生生态系统公司,阿波普卡,佛罗里达州)。溶液C为38.75g/L NaH2PO4 H2O、758mg/L盐酸硫胺素、3.88mg/L维生素B12和3.84mg/L生物素。
每个菌株启动三种培养物。烧瓶包含搅拌棒并具有塞子,所述塞子具有与注射器过滤器连接的管子,以用于递送通过培养物鼓泡的富含CO2的空气(1%CO2)。将烧瓶与朝向0.0875m2光的孔对准,并且从光源向后延伸的烧瓶的“深度”尺寸为8cm。对于半连续生物质的测定,通过去除40%的培养物体积并且将其用稀释的新鲜PM153培养基置换而每天对培养物进行稀释,以适应由于培养物中发生蒸发而导致的盐度增加。从去除以进行稀释的培养物中取出用于TOC分析的样品。将半连续生产力测定运行,直到培养物达到稳定状态。半连续后,将培养物从测定中去除,使用离心法沉淀并重悬于420ml氮耗尽PM152培养基中。使用与半连续模式相同的生长条件,将培养物分批培养4-5天。在分批模式期间,采集FAME样品以确定脂质生产力,并且采集TOC样品以确定FAME/TOC。
实例14
SGI1、SGI2基因、SGI1和SRP54以及SGI2和SRP54基因的双敲除的拟小球藻属敲除突变体的叶绿素含量、触角大小和光生理学
通过用甲醇提取细胞并通过分光光度法分析上清液来确定高生产力突变体的叶绿素含量。简而言之,将500μl培养液等分试样吸移到2.0ml扭曲顶部试管中,并使用台式微量离心机以15,000rpm持续沉淀10分钟。从沉淀中吸出上清液,并将每个沉淀物重悬于1.5ml的99.8%的甲醇(先前用碳酸镁中和)中。将0.2ml玻璃珠(直径为0.1mm)添加到每个小瓶中,并敲打持续3分钟。将1.0ml上清液转移到新的1.7ml翻转式管子中,并且在台式微量离心机中以15,000rpm离心持续10分钟。所得沉淀物为白色,表明已经进行完全提取。将0.8ml的每种上清液吸移到光学玻璃比色皿中,并且立即在720nm、665nm和652nm的波长下读取吸收波长。使用99.8%甲醇坯料在双光束模式下进行分光光度测量。以下等式用于计算叶绿素浓度:叶绿素a[g m-3]=16.72(A665-A720)+9.16(A652-A720)以及叶绿素b[g m-3]=34.09(A652-A720)–15.28(A665-A720)。叶绿素a和b的量在每个细胞和每个TOC的基础上进行标准化。虽然在SGI1-2261突变体中的每个细胞的总叶绿素量有所不同,但相对于野生型细胞其普遍减少了约30%到约65%,这与所观察到的触角大小的减少一致。在每个TOC的基础上,相对于野生型细胞,SGI1突变体中总叶绿素的减少范围为约30%到约50%。
除叶绿素含量之外,还分析了SGI1和SGI2敲除突变体以及SGI1和SRP54以及SGI2和SRP54的双敲除的PSII功能性吸收横截面、PSI功能性吸收横截面、1/τ'Qa(在光饱和时,光系统II的受体侧上的电子传递的光饱和速率,线性光合电子传递的效率的测量)以及最大固碳率Pmax。在上述恒定的光半连续培养测定(CL-SCPA)中培养野生型和突变体菌株的细胞。
使用荧光诱导和弛豫(FIRe)技术(其被开发用于测量光合生物的一系列综合光合和生理特征)对各种光合参数进行分析(Gorbunov和Falkowski(2005),“用于监测水生生态系统中的光合过程和初级产生的荧光诱导和弛豫(FIRe)技术以及仪器(FluorescenceInduction and Relaxation(FIRe)Technique and Instrumentation for MonitoringPhotosynthetic Processes and Primary Production in Aquatic Ecosystems)”:《光合作用:全球视野的基本面(Photosynthesis:Fundamental Aspects to GlobalPerspectives)》,第13届国际光合作用大会,蒙特利尔,2004年8月29日到9月3日。(编辑:A.van der Est和D.Bruce),亚伦出版社(Allen Press),第2卷,第1029-1031页)。FIRe技术依赖于对叶绿素“可变荧光”图谱的测量和分析(由Falkowski等人,2004,“叶绿素可变荧光技术在海洋生态系统中的发展和应用(Development and Application of VariableChlorophyll Fluorescence Techniques in Marine Ecosystems)”所综述的:《叶绿素A荧光:光合作用的特征(Chlorophyll a Fluorescence:A Signature of Photosynthesis)》(C Papageorgiou和Govingjee编辑,施普林格出版社,第757-778页),这取决于叶绿素荧光与光合过程效率之间的关系。这项技术提供了一组参数,这些参数表征了光合的光收集过程、光系统II(PSII)中的光化学以及光合电子向下传递到碳固定。本文进行的测量使用了由新泽西州东布朗士维克罗格斯大学的Maxim Gorbunov公司生产的微型FIRe装置。可从海鸟科技公司(Sea-Bird Scientific)(哈利法克斯,加拿大,satlantic.com和planet-ocean.co.uk)商购获得FIRe装置。公司手册中提供了有关FIRe装置使用的进一步信息。所有测量均使用恒定光(2000μmol光子m-2sec-1)半连续培养物(CL-SCPA)进行的(参见实例3)。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。表6中所呈现的Fv/FM和σPSII的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量),这些参数的误差不超过5%。
PSI横截面的测量是使用改良的JTS-10光谱仪(所述光谱仪具有过滤器组)进行的,以用于在520nm处通过配备有定制的单翻转闪光灯(STF)来测量电致变色位移(ECS)。样品室中的峰值功率密度足够高,以确保在大约10μs内完全关闭反应中心。所得激发率为每10μs每个反应中心约1-3次命中(取决于光系统的功能性吸收横截面)。STF生成短的超亮蓝光脉冲(455nm,其中半带宽为30nm),并且脉冲定时由JTS-10光谱仪的触发器控制。脉冲持续时间由STF脉冲控制盒控制,并且可以使用前板上的电位计在1μs到50μs的范围内进行调整。为了测量PSI横截面,使用配备有积分球(integrating sphere)的Perkin ElmerLambda 650分光光度计,基于细胞悬浮液的吸收光谱的测量,在最大叶绿素(约440nm)处将培养物稀释到约0.2的OD。在DCMU和羟胺存在的情况下,使用强度范围为4000到120,000μmol光子m-2s-1的10μs闪烁来测量ECS。用简单的指数函数拟合实验曲线
Figure BDA0002557691090000831
其中ECSMECSM是最大的ECS信号;ItIt是光子密度,单位为光子/平方米;并且σPSIσPSI是PSI的功能性横截面。所获得的野生型拟小球藻属(WT-1185)的PSI功能性横截面的值为(4.0±0.5)×10-18(4.0±0.5)×10-18m2。这些值接近在相同条件下(σPSII=(4.3±0.1)×10-18σPSII=(4.3±0.1)×10-18m2)生长的PSII的功能性横截面所获得的值。这些参数的误差估计不超过20%。
固碳率(C14 Pmax)使用在含有0.5g l-1(5.95mM)碳酸氢钠的培养基中归一化为5μg chl ml-1的培养物来测量。将20.4μCi ml-1C14标记的碳酸氢钠添加到每种培养物中,并暴露于2500μE中持续10分钟。立即用2N HCl酸化样品,并且允许其排气过夜。第二天,使用Beckman LS6500闪烁计数器测量样品并进行量化。
τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和暗诱导的弛豫动力学(DIRK)图谱测量的。相对于野生型的体积PSII浓度估计为(Fv/σ530PSII)。这些参数的误差估计不超过15%。使用以下等式来估计光学吸收横截面(在光源发射光谱上的平均值):
Figure BDA0002557691090000832
其中[Chl/TOC]是样品的叶绿素/TOC,OD(λ)OD(λ)是在波长λλ下所测量的样品的光密度,ΔlΔl是比色皿(1cm)中所测量的光束路径长度,I(λ)I(λ)是用于在波长λλ下生长藻类的光源强度。
表6.用FIRe技术所测量的荧光和光合参数
Figure BDA0002557691090000841
概述并评估了野生型拟小球藻属菌株WT-1185的光生理数据、叶绿素含量和生产力数据,拟小球藻属中的SRP54和SGI2基因的单敲除以及SGI2和SRP54基因的双敲除。所有测量均使用CL-SCPA培养物进行。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。针对Fv/Fm和σPSII呈现的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量)—这些参数的误差不超过5%。τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和DIRK图谱测量的。PSI横截面的测量如上所述进行。结果在以下表7中概括。
表7.光生理学、叶绿素和生产力数据
Figure BDA0002557691090000842
PSII的功能性吸收横截面显著减少(50%),并且功能性PSII复合物的数量也有所减少。细胞具有提高的碳固定能力(Pmax增加26%)。与野生型菌株相比,SGI2或SRP54的单敲除示出了TOC生产力增加了至少17%。总体而言,双SGI2/SRP54敲除菌株示出了TOC生产力提高了32%(两次在CL-SCPA测定上运行双SGI2/SRP54敲除菌株,均显示生产力>40克/平方米/天),其中如图11所示,观察到针对拟小球藻属的生产力的增加最高,并且比单敲除SRP54或SGI2的平均提高更高。结果表明,当SGI2和SRP54基因两者被敲除时,似乎对生产力有协同效应。
评估了野生型拟小球藻属菌株WT-1185的光生理数据,SRP54和SGI1基因的单敲除以及具有拟小球藻属中的SGI1和SRP54基因的双敲除的三种菌株。所有测量均使用CL-SCPA培养物进行。为了获得FV/FM和σPSII,在黑暗中进行了荧光诱导和弛豫(FIRe)动力学的测量。针对FV/FM和σPSII呈现的值计算为6次测量的平均值(2次生物复制中每一次复制的3次测量)—这些参数的误差不超过5%。τ'Qa(在饱和光条件下所测量的PSII受体侧上的电子传递时间—由线性光合电子传递的最慢步骤有效确定)是从FIRe光曲线和DIRK图谱测量的。结果在表8中概括。
表8.拟小球藻属菌株的光生理学
Figure BDA0002557691090000851
与单个SGI1或SRP54基因敲除相比,SGI1/SRP54双敲除菌株PSII的功能性横截面显著减少,并且电子传递的光饱和率降低,这表明光合作用的速率提高。功能性PSII复合物的数量也有所增加。与单个敲除SRP54或SGI1相比,双敲除菌株中光系统II中的光化学最大量子产率(FV/FM)有所提高。
实例15
SGI1/SGI2、SGI1/SRP54和SGI1/SGI2/SRP54敲除突变体的微近似分析
为了确定SGI1/SGI2、SGI1/SRP54和SGI1/SGI2/SRP54敲除突变体的总体生物质组分,对来自以每天40%稀释的在半连续模式下生长的培养物的样品进行量化分析,以确定总有机碳(TOC)和半连续培养物中细胞的脂质含量。在培养物达到稳定状态之后,所去除的每天稀释的培养物的等分试样用于分析脂质、蛋白质和碳水化合物。藻类培养物样品的总有机碳(TOC)通过用去离子水将2mL细胞培养物稀释到20mL总体积来确定。每次测量注射三次,注射到Shimadzu TOC-Vcsj分析仪中,以确定总碳(TC)和总无机碳(TIC)。将燃烧炉设定为720℃,并且通过从TC中减去TIC来确定TOC。4点校准范围为2ppm到200ppm,对应于非稀释培养物的20-2000ppm,其中相关系数r2>0.999。
为了确定脂质含量,对使用GeneVac HT-4X干燥的2mL样品进行了FAME分析。向干燥的沉淀物中添加以下:500μL 500mM KOH甲醇溶液、200μL的含有0.05%丁基化羟基甲苯的四氢呋喃、40μL 2mg/ml C11:0游离脂肪酸/C13:0甘油三酯/C23:0脂肪酸甲酯内标混合物和500μL玻璃珠(直径425–600μm)。将小瓶用顶部开口的PTFE隔垫衬里盖加盖,并且在SPEX GenoGrinder中以1.65krpm放置持续7.5分钟。然后将样品在80℃下持续加热五分钟并使其冷却。为了衍生化,在80℃下加热持续30分钟之前,向样品中添加500μL 10%三氟化硼甲醇溶液。在添加2mL庚烷和500μL 5M NaCl之前,将试管冷却。然后将样品以2krpm涡旋持续五分钟,并且最后以1krpm离心持续三分钟。使用Gerstel MPS自动进样器,对庚烷层进行采样。定量使用80μg C23:0FAME内标。
图12A和12B分别示出了表明拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的半连续区域TOC生产力和分批TOC的测定结果。SRP54敲除突变体、SGI1敲除突变体、SGI1/SRP54双敲除突变体示出了相对于拟小球藻属野生型菌株增加的TOC生产力。
图13A和13B分别示出了表明拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)以及SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的半连续区域TOC生产力和分批TOC的测定结果。SGI1敲除突变体、SGI2/SRP54双敲除突变体以及SGI1/SGI2/SRP54三重敲除突变体示出了相对于拟小球藻属野生型菌株增加的TOC生产力。
图14示出了拟小球藻属野生型菌株(STR00010)、SRP54敲除突变体(STR00625)、SGI1敲除突变体(STR24183)、SGI1/SRP54双敲除突变体(STR24538和STR24540)的分批FAME生产力测定的结果。SGI1敲除突变体和SGI/SRP54敲除突变体示出了相对于拟小球藻属野生型菌株增加的FAME生产力。
图15示出了拟小球藻属野生型菌株(STR00010)、SGI1敲除突变体(STR00012)、SGI2/SRP54双敲除突变体(STR00516)以及SGI1/SGI2/SRP54三重敲除突变体(STR25761和STR25762)的分批FAME生产力测定的结果。
本申请中的标题仅仅是为了方便读者,而不以任何方式限制本发明或其实施例的范围。
本说明书中所提及的所有公开和专利申请通过引用并入本文,其程度如同每个单独的公开或专利申请被专门地且单独地指示通过引用并入。
序列表
<110> 合成基因组学公司
I.阿加维
F.I.库兹米诺夫
R.R.拉达科维茨
J.H.维卢托
S.波茨
R.斯普雷菲科
W.F.兰伯特
J.N.格雷纳
<120> 用于改善生长的光合生物基因调节
<130> SGI2140-2WO
<150> US 62/690,205
<151> 2018-06-26
<150> US 62/612,251
<151> 2017-12-29
<160> 105
<170> PatentIn版本3.5
<210> 1
<211> 4531
<212> DNA
<213> 拟小球藻属
<400> 1
atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc 60
ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggtaaa gtagaaagac 120
actcatacac atcttggttc ggcgttgaaa gtaggtcatt aacatactct ataaccaata 180
tttgtaggtt ctggtcgtgg acgacgacct catgtgcctt aaggtggtgt cagccatgtt 240
gaagaggtgc agctatcaag gtgaggtctt tactggtgtc tgttattgct gtaacatcat 300
ttcgctgttg cacaatttaa acatttgtaa tttactgttg ttattgcagt ggccacttgt 360
agcagtggca gcgaggcact gacacttcta cgtgaacgca acgaggacgg atcctccgac 420
cagttcgacc tcgtactgtc agatgtttac atgccgggta tgtcgtattc ctttgtaaac 480
tttacaatat gcgtctagtt tgacgcgtac actttgtaca ctttgcaaaa acgcaccctg 540
cgaggtctgc catttggtca ctacaacttg gccaccttgg ttgcaagttt gcaagttcgc 600
tctacgtcaa cgctgcaaaa tgaaccaatt gttttgcact gaccctgcca accttcattt 660
gtggctgcag acatggacgg tttcaagctg cttgaacaca tcggtctaga gttggagctt 720
cccgttatca gtaagttgat cgagccgagt ccagagcgaa gcctgcttct atactattag 780
cagctgtctt ttgatatttg acagcttgac ttgatatggt cacagagcat acttgcaacc 840
aggttacctg ttgaactagc aactgtgccc aagcatctct tcaagcacct ccgtcagtcc 900
atagggtact gttgatttgt actctgcaat actgcactgt aatgcgctgt gaatcactgc 960
ccttcacctc tagatggtgc ttccctggag ccctccccca cctccgcctc aagcccctca 1020
catgcctctc ccccccctgc agtgatgtca tccaacgggg acacgaatgt cgtgctgcgg 1080
ggggtcaccc acggggctgt ggactttctg atcaagcccg ttcgaattga ggagctgcgg 1140
aacgtgtggc agcacgtggt gcgtcgtcgt tccatggcgc tggccaggac gccagacgag 1200
gggggacact cggacgagga ctctcaggtg cccttggcag cttctgggcg gcttgctgtg 1260
tcggatgcca cttggactgg ggatgcacga ggggtggggg gacaatggga gatgggccat 1320
agtaggccag agttgatggc agtggtggtg ggggggagta ggcgggagag aagcagccat 1380
cctggtgttg gttttgatga ttgagtgcat ggggatgatg cacaggtgag ctgactggat 1440
gccttgtctt gctgtgctgc gctgcagcgg cacagtgtga aacgcaagga gtcggagcag 1500
agcccgctgc agctcagcac agagcagggc gggaacaaga agccaagagt ggtgtggtcg 1560
gtggagatgc accaacaggt gtgcttgcgg gcgggtgtat acgggggagg ggggccagct 1620
gctggctgac ctggcgtgcg cggtgcattg cacttggcga tgaggggcgt gcttcagtat 1680
gtagctggga cgcaattggt tgtgctgtgt gaccagtgca caaaatacat ccctgaattc 1740
cagtgggttg aacagagttg tcctggaggt gggaagcaaa cgcgcacgtg gtagagggga 1800
gcagggtgca gaacagccgc agcaggggtg ttgcgcagtg tgcaggtatc ctgcctccat 1860
gccccgggcc atgggcatac tacgctggta ccgtcaggat gggcgttgag cctggcttgg 1920
ggggcagggg gcgagcgaat gcggaatggg agcggcaggt gctgggaggg tggctgactg 1980
gcttgcagga gcgcaagtcc tgtcgggggc gtcgtcctgt tccctcctgc ccgcttcacc 2040
cacgttcact ctcatgcctc cacactcctg ctgctgacac acctgtcgcc acctccgctg 2100
cagtttgtga acgcggtcaa ctccctgggc attgacaagg cggtgcccaa gcggattctg 2160
gacctgatga acgtggaggg gctgacgcgc gagaacgtgg ccagccatct gcaggtgcct 2220
gccatgaccc ctcccaccag ggacctggtg ttttgacacc ctggaactcc tctttgacgg 2280
agcctccagt tcaattccag caatcgaatt gaatcaaaaa gcatgtgcac ccacgtgctg 2340
tttgaatgtc ccatgtggta ggaaacacaa ctgccccctt gccatttgct ggagggtgcc 2400
cgctgcgcca tgcccgagtg cgctgtgctc agcgttgtgc tgcgcccccc gctgactgaa 2460
gctgacagcg tgcggctgag gagggtactg ggggaggggg ggtgggaggc ggccgctggc 2520
ggcggaaggg agggtgtgca cgcatggaca cagggccttt ccgccctgca cggcctctac 2580
tgcaccctgc cacgtgatgt atcgacatgg tgggccatgc tgtgctgtgc cgctgcagaa 2640
gtaccgcctg tacctgaagc gggtggaggg agtgcaatcg ggtgcggcag cctccaagca 2700
gcaccagcac ccgcagtatc accagcagca gcagcagcag caagcgcaac ctcgtgcagc 2760
tgtctcccct gcagcagctt cctttggtgc cctttccttg ggagccccgc agcaggcgca 2820
gcagggcatg ccgcagctgg ggatgcctgt gcaggtgaag actgcccccc cccccctccc 2880
cctttccatc ttccctccat cagcctgctg ttccttaccc ttgtcaaccc gtctctcctt 2940
tttcgcaagc agcgcaccac cccccatgca cgccttgcct ggcactgttg tcagctgccc 3000
ccctagaaat acacaaggtg tgggtgcaac tggtgggacc ccctcccccc cccccctggg 3060
gctgcagggt ctccctccaa acttggcagc catgggatcc cagccgccgc acatcccctt 3120
ccagcaggcc ctggccatgc aggcggcggc tgcggcggct gcagccagcg gcgcgctccc 3180
cgggagtctg cccccctaca tgccaccccc ggggatgatg ccccccggca tgccgggggg 3240
ggtccccggt atgggagggg tggtggggca tcctcaggta cgggcagcac atgagtgggc 3300
aggggtattg gagaggggaa gggcagggag gttgcatgtg aggggctgca tggcaaagag 3360
gctgcagcgc aggtgttgct tgcagcactt cccctcggtg gcgcttgcat caaattttga 3420
atcctccccc gatgggcacg cccgtgtgtg ggggggggtg ggatggggga tgggggtggt 3480
tttgtggcat gtcgggcgct ttcatctacc cgggcccctg cccctgcctg tacgcgtgcg 3540
catgtgtgca gatgcccgcc ccagggatgg actttgcggg tttcaacggg tatggcaacg 3600
ctgcgggggg gctgatgttt ggcgggcagc agcaggcgca gcacgcgcag cagcacgcgt 3660
cagcgcaagc gggctcgctg gcgcagcagc aggcgcagca agtatccatg ggcttgggcc 3720
ttatgccccc cccgttgggg ttcccgccca cctcgctcgc cgcgccagcc ccgcgctccg 3780
cagcaactga gcccgccgca gccccactcc ccctgacgtc ctcgccgcca gctgcttcag 3840
caggcggcag cggcggccca gcagcagctg ctccgcagca cagcagcggc gccgcagcag 3900
cccaagcccc ccatcaccac ccacagtgct cggagcaggg agcggggggg ctcccgcccc 3960
cgctgcccgc gtccagcgcc ccgcagtcct atcccctccc tcccccctcc tcgcaggccg 4020
ctttgcatga cccggacgaa cactaccccc caggctcggc agaggtgagc acgtcccccc 4080
gccccctccc cccccccccc cccccttccc ttcaccctgg cttggcgtgc aatgaaaccc 4140
taaataaccc taaaacctca ttatcagttg caaattggac ccgtgaagcg ggcgggggca 4200
actgcgctct gctggtgtca gcgctgtctc tgccggttcc tgcccagcgt gcgcctgcat 4260
gcaagggggg atgggggggg ggaggcattt aacaataggc cagtcatctc caatccaccg 4320
tcaatttcag ccccctcccc ccccctccct catccccttg cagatgcacc accagcacct 4380
cccagggctg tgtggcttta acccggacga cctgctgggg gggcagctgg gggacatggg 4440
gttcctgggg gagctggggg gggcggtggg aggaaagcac gaacaggacg acttcctgga 4500
cctgctgctg aagggggagg aggagctgtg a 4531
<210> 2
<211> 1860
<212> DNA
<213> 拟小球藻属
<400> 2
atgtctggtt cagctggatc gggccaggct actctcagac atgacggtgg ctctgctggc 60
ggcagtgggc ctgtctcaga cggtttttca ccggccggcc tgaaggttct ggtcgtggac 120
gacgacctca tgtgccttaa ggtggtgtca gccatgttga agaggtgcag ctatcaagtg 180
gccacttgta gcagtggcag cgaggcactg acacttctac gtgaacgcaa cgaggacgga 240
tcctccgacc agttcgacct cgtactgtca gatgtttaca tgccggacat ggacggtttc 300
aagctgcttg aacacatcgg tctagagttg gagcttcccg ttatcatgat gtcatccaac 360
ggggacacga atgtcgtgct gcggggggtc acccacgggg ctgtggactt tctgatcaag 420
cccgttcgaa ttgaggagct gcggaacgtg tggcagcacg tggtgcgtcg tcgttccatg 480
gcgctggcca ggacgccaga cgagggggga cactcggacg aggactctca gcggcacagt 540
gtgaaacgca aggagtcgga gcagagcccg ctgcagctca gcacagagca gggcgggaac 600
aagaagccaa gagtggtgtg gtcggtggag atgcaccaac agtttgtgaa cgcggtcaac 660
tccctgggca ttgacaaggc ggtgcccaag cggattctgg acctgatgaa cgtggagggg 720
ctgacgcgcg agaacgtggc cagccatctg cagaagtacc gcctgtacct gaagcgggtg 780
gagggagtgc aatcgggtgc ggcagcctcc aagcagcacc agcacccgca gtatcaccag 840
cagcagcagc agcagcaagc gcaacctcgt gcagctgtct cccctgcagc agcttccttt 900
ggtgcccttt ccttgggagc cccgcagcag gcgcagcagg gcatgccgca gctggggatg 960
cctgtgcagg gtctccctcc aaacttggca gccatgggat cccagccgcc gcacatcccc 1020
ttccagcagg ccctggccat gcaggcggcg gctgcggcgg ctgcagccag cggcgcgctc 1080
cccgggagtc tgccccccta catgccaccc ccggggatga tgccccccgg catgccgggg 1140
ggggtccccg gtatgggagg ggtggtgggg catcctcaga tgcccgcccc agggatggac 1200
tttgcgggtt tcaacgggta tggcaacgct gcgggggggc tgatgtttgg cgggcagcag 1260
caggcgcagc acgcgcagca gcacgcgtca gcgcaagcgg gctcgctggc gcagcagcag 1320
gcgcagcaag tatccatggg cttgggcctt atgccccccc cgttggggtt cccgcccacc 1380
tcgctcgccg cgccagcccc gcgctccgca gcaactgagc ccgccgcagc cccactcccc 1440
ctgacgtcct cgccgccagc tgcttcagca ggcggcagcg gcggcccagc agcagctgct 1500
ccgcagcaca gcagcggcgc cgcagcagcc caagcccccc atcaccaccc acagtgctcg 1560
gagcagggag cgggggggct cccgcccccg ctgcccgcgt ccagcgcccc gcagtcctat 1620
cccctccctc ccccctcctc gcaggccgct ttgcatgacc cggacgaaca ctacccccca 1680
ggctcggcag agatgcacca ccagcacctc ccagggctgt gtggctttaa cccggacgac 1740
ctgctggggg ggcagctggg ggacatgggg ttcctggggg agctgggggg ggcggtggga 1800
ggaaagcacg aacaggacga cttcctggac ctgctgctga agggggagga ggagctgtga 1860
<210> 3
<211> 619
<212> PRT
<213> 拟小球藻属
<400> 3
Met Ser Gly Ser Ala Gly Ser Gly Gln Ala Thr Leu Arg His Asp Gly
1 5 10 15
Gly Ser Ala Gly Gly Ser Gly Pro Val Ser Asp Gly Phe Ser Pro Ala
20 25 30
Gly Leu Lys Val Leu Val Val Asp Asp Asp Leu Met Cys Leu Lys Val
35 40 45
Val Ser Ala Met Leu Lys Arg Cys Ser Tyr Gln Val Ala Thr Cys Ser
50 55 60
Ser Gly Ser Glu Ala Leu Thr Leu Leu Arg Glu Arg Asn Glu Asp Gly
65 70 75 80
Ser Ser Asp Gln Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp
85 90 95
Met Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Glu Leu
100 105 110
Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Asn Val Val Leu Arg
115 120 125
Gly Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Ile
130 135 140
Glu Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Arg Ser Met
145 150 155 160
Ala Leu Ala Arg Thr Pro Asp Glu Gly Gly His Ser Asp Glu Asp Ser
165 170 175
Gln Arg His Ser Val Lys Arg Lys Glu Ser Glu Gln Ser Pro Leu Gln
180 185 190
Leu Ser Thr Glu Gln Gly Gly Asn Lys Lys Pro Arg Val Val Trp Ser
195 200 205
Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Ser Leu Gly Ile
210 215 220
Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Asn Val Glu Gly
225 230 235 240
Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr
245 250 255
Leu Lys Arg Val Glu Gly Val Gln Ser Gly Ala Ala Ala Ser Lys Gln
260 265 270
His Gln His Pro Gln Tyr His Gln Gln Gln Gln Gln Gln Gln Ala Gln
275 280 285
Pro Arg Ala Ala Val Ser Pro Ala Ala Ala Ser Phe Gly Ala Leu Ser
290 295 300
Leu Gly Ala Pro Gln Gln Ala Gln Gln Gly Met Pro Gln Leu Gly Met
305 310 315 320
Pro Val Gln Gly Leu Pro Pro Asn Leu Ala Ala Met Gly Ser Gln Pro
325 330 335
Pro His Ile Pro Phe Gln Gln Ala Leu Ala Met Gln Ala Ala Ala Ala
340 345 350
Ala Ala Ala Ala Ser Gly Ala Leu Pro Gly Ser Leu Pro Pro Tyr Met
355 360 365
Pro Pro Pro Gly Met Met Pro Pro Gly Met Pro Gly Gly Val Pro Gly
370 375 380
Met Gly Gly Val Val Gly His Pro Gln Met Pro Ala Pro Gly Met Asp
385 390 395 400
Phe Ala Gly Phe Asn Gly Tyr Gly Asn Ala Ala Gly Gly Leu Met Phe
405 410 415
Gly Gly Gln Gln Gln Ala Gln His Ala Gln Gln His Ala Ser Ala Gln
420 425 430
Ala Gly Ser Leu Ala Gln Gln Gln Ala Gln Gln Val Ser Met Gly Leu
435 440 445
Gly Leu Met Pro Pro Pro Leu Gly Phe Pro Pro Thr Ser Leu Ala Ala
450 455 460
Pro Ala Pro Arg Ser Ala Ala Thr Glu Pro Ala Ala Ala Pro Leu Pro
465 470 475 480
Leu Thr Ser Ser Pro Pro Ala Ala Ser Ala Gly Gly Ser Gly Gly Pro
485 490 495
Ala Ala Ala Ala Pro Gln His Ser Ser Gly Ala Ala Ala Ala Gln Ala
500 505 510
Pro His His His Pro Gln Cys Ser Glu Gln Gly Ala Gly Gly Leu Pro
515 520 525
Pro Pro Leu Pro Ala Ser Ser Ala Pro Gln Ser Tyr Pro Leu Pro Pro
530 535 540
Pro Ser Ser Gln Ala Ala Leu His Asp Pro Asp Glu His Tyr Pro Pro
545 550 555 560
Gly Ser Ala Glu Met His His Gln His Leu Pro Gly Leu Cys Gly Phe
565 570 575
Asn Pro Asp Asp Leu Leu Gly Gly Gln Leu Gly Asp Met Gly Phe Leu
580 585 590
Gly Glu Leu Gly Gly Ala Val Gly Gly Lys His Glu Gln Asp Asp Phe
595 600 605
Leu Asp Leu Leu Leu Lys Gly Glu Glu Glu Leu
610 615
<210> 4
<211> 2802
<212> DNA
<213> 拟小球藻属
<400> 4
atggctgccc ccccagtatc tatctcttcc aattttccaa aggttagtat ttacgtaaca 60
tttgccgaca gttgggcaat aacgctgagt tggagtgttg ccaacaagct tttgtgccgt 120
ttccagggtt tgcgggttct cttggtcgat caacagccaa gtaggagcca tattgaagcg 180
cagctgatgc agccggatct taattacaca ggtttgctgc agttttgcac attccaagct 240
tggcctttct ccgtgccaaa cccagcgcgc tgagctcttg ttgtttgttg cagttactgg 300
ttgcgagagc gtttctgaag ctctttcata ttgccgctcg ggagtaagca gctttgacgt 360
ggtgcttgcg gaggtgggtg gtagtgcttc gcgaggtgca cagtgcgcac cgggaaaaac 420
ttgaaaagta tttgtaaaat taattttgaa acttctgtat tattttacac ctcttaacaa 480
tgcacccaat gtttgttatg agcgccacgt taccggacgt ttgttgcagg caaggatcgt 540
tgccgtcgac gagacttcgg ggcgcgcatt tatcgactct cttgaagata caccggttat 600
tcttatgtcg gagggaagca cgacgggcga cgttcttcgt gcggtgaagc ttggagctgt 660
ggactggctg gataagcctc tctccgtcct gaagctcaag aacatctggc agcactcagt 720
gcgtaaggta ggtccggaaa caatatgata tctacaattg gaaagttacc agtgtcaact 780
atggaaaacg ttgtactggg tgctagtttc agttgagcca gttgcctgta tatgcatata 840
aggggcagtg acgcagtgtc tgccctgtga gctgtgatgc atcagggtgc acctgaagct 900
ggcagtggat cactccaccc aagatgttgt tgcaatccaa tgtgttgctg atgccttgct 960
tttctgactt gcaaacatgg tgtgggataa aagcgttgct agacagccac cgtgctccac 1020
gttgtcttct gcatgcaaaa ctgcagatga tgcagcgcac cacgttttac gacacttgct 1080
ccgagcagcc aacccagccg gcgcgcagca agctttcttc aggaatcgaa tcgccgagca 1140
cacccacgct gggagactct gtggacttgg acgccatctc ggcggcttcc ttcggcagca 1200
tcaaggactt gaccgatttt tcattttcca gcggagctga ggtgggcatc gctggttgtc 1260
cagcactgca gcatttccca ccagcttggt tggttgcctg tgttttagtg cagagcagag 1320
gccgaggcta ctggttcaac cagcctagtt actcaaaaca attttggcaa cctgctgact 1380
tctctcttaa cctgcagagc gtttcacagc atgtacactt ttcagtgggt ttcgtaattt 1440
cgtagcgcac ccgctggctt ttttctgcag gtcctgagag cctcctttga cagctgtgac 1500
ggctccgagg tcaacctagg cagcgctttg ggccagcctc gcccccctct ggcagtcaag 1560
cccagctcct ttggccccct ggtgagtggc atagctcagc aggagaccca caagtggctg 1620
gaacccacca tgttggcgcg caccctgccc tcgcacgcgc ggctgccgtc tgcgcagcgc 1680
gccgcggtgc gccgctgtgg agttgtggtg ttgcggagtc actcgcgggc cagtgcttca 1740
cagcccattc tcgccgcaca caccctgccc gcacaaatgg ctgccacccc cctaaagcgg 1800
tcctcaggcc acccggatgt gctcaggatg gatccgccga atctcgcacc cctcctccct 1860
caatcccggt tgttcagacg gtttggaacc cctccggctc tctacccctc tgcaggtacc 1920
cgtccctccc acctcccagt ggccccagct gcaggctggc tgcgtgtggg gcactcccgt 1980
gggcggcccg ctggcgcccc cctccatgac caacgcccag catggtgccc cccacagcgt 2040
gcccctggca gacgcacact tggccggcag cgccagttac atgtccctct cctctgtgag 2100
tctcctcccc tccaccccta catcttccaa tcgaacatgc gacgcacgca cacccatagt 2160
ccctaaacaa gtgctttggt gttttttcac ttgcaaaccc caaccctgac acctgaagcg 2220
tgacacaggc gactgcgctg ctccccgccc ccacacgccc ttggttgttt gtgccctgca 2280
cttctgccac gacatgcatg tcatgtcttt tcacgcctgc gatgtcgctg cttaaacttg 2340
aaactcattg tggccggggt gcagctcatg gaggaggaca ccccctgtcc cttggacatg 2400
gatgcaccag aggacgggat gcagcttcct gttgacttcc tgtctgttgc caacgtcagc 2460
agcaatggta ggtccagcac cagacgcctc tgtctgctat gagacgcacc tccagccgcc 2520
ccctctggac agacagcgcg ctgcacgctc tgcgcgctgg accttgccgc acacgcgcgc 2580
gacaaggcct ggtgtgatgc ttggatgtgg aaggttccag catggttgga caagatggta 2640
tcctggcaca catattggta tgcagcatac acccaggctg cccccttacc ctcgcacgcc 2700
ctacccctta ctgcaggcag cggtcccatt gggttgaagc tgaagaaaag caacagcctg 2760
ctgaacatga tcaacgcagc gctgatgtct ggtggtcagt ga 2802
<210> 5
<211> 359
<212> PRT
<213> 拟小球藻属
<400> 5
Met Ala Ala Pro Pro Val Ser Ile Ser Ser Asn Phe Pro Lys Gly Leu
1 5 10 15
Arg Val Leu Leu Val Asp Gln Gln Pro Ser Arg Ser His Ile Glu Ala
20 25 30
Gln Leu Met Gln Pro Asp Leu Asn Tyr Thr Val Thr Gly Cys Glu Ser
35 40 45
Val Ser Glu Ala Leu Ser Tyr Cys Arg Ser Gly Val Ser Ser Phe Asp
50 55 60
Val Val Leu Ala Glu Ala Arg Ile Val Ala Val Asp Glu Thr Ser Gly
65 70 75 80
Arg Ala Phe Ile Asp Ser Leu Glu Asp Thr Pro Val Ile Leu Met Ser
85 90 95
Glu Gly Ser Thr Thr Gly Asp Val Leu Arg Ala Val Lys Leu Gly Ala
100 105 110
Val Asp Trp Leu Asp Lys Pro Leu Ser Val Leu Lys Leu Lys Asn Ile
115 120 125
Trp Gln His Ser Val Arg Lys Met Met Gln Arg Thr Thr Phe Tyr Asp
130 135 140
Thr Cys Ser Glu Gln Pro Thr Gln Pro Ala Arg Ser Lys Leu Ser Ser
145 150 155 160
Gly Ile Glu Ser Pro Ser Thr Pro Thr Leu Gly Asp Ser Val Asp Leu
165 170 175
Asp Ala Ile Ser Ala Ala Ser Phe Gly Ser Ile Lys Asp Leu Thr Asp
180 185 190
Phe Ser Phe Ser Ser Gly Ala Glu Val Leu Arg Ala Ser Phe Asp Ser
195 200 205
Cys Asp Gly Ser Glu Val Asn Leu Gly Ser Ala Leu Gly Gln Pro Arg
210 215 220
Pro Pro Leu Ala Val Lys Pro Ser Ser Phe Gly Pro Leu Val Pro Val
225 230 235 240
Pro Pro Thr Ser Gln Trp Pro Gln Leu Gln Ala Gly Cys Val Trp Gly
245 250 255
Thr Pro Val Gly Gly Pro Leu Ala Pro Pro Ser Met Thr Asn Ala Gln
260 265 270
His Gly Ala Pro His Ser Val Pro Leu Ala Asp Ala His Leu Ala Gly
275 280 285
Ser Ala Ser Tyr Met Ser Leu Ser Ser Leu Met Glu Glu Asp Thr Pro
290 295 300
Cys Pro Leu Asp Met Asp Ala Pro Glu Asp Gly Met Gln Leu Pro Val
305 310 315 320
Asp Phe Leu Ser Val Ala Asn Val Ser Ser Asn Gly Ser Gly Pro Ile
325 330 335
Gly Leu Lys Leu Lys Lys Ser Asn Ser Leu Leu Asn Met Ile Asn Ala
340 345 350
Ala Leu Met Ser Gly Gly Gln
355
<210> 6
<211> 119
<212> PRT
<213> 拟小球藻属
<400> 6
Gly Leu Arg Val Leu Leu Val Asp Gln Gln Pro Ser Arg Ser His Ile
1 5 10 15
Glu Ala Gln Leu Met Gln Asp Leu Asn Tyr Thr Val Thr Gly Cys Glu
20 25 30
Ser Val Ser Glu Ala Leu Ser Tyr Cys Arg Ser Gly Val Ser Ser Phe
35 40 45
Asp Val Val Leu Ala Glu Ala Arg Ile Val Ala Val Asp Glu Thr Ser
50 55 60
Gly Arg Ala Phe Ile Asp Ser Leu Glu Asp Thr Pro Val Ile Leu Met
65 70 75 80
Ser Glu Gly Ser Thr Thr Gly Asp Val Leu Arg Ala Val Lys Leu Gly
85 90 95
Ala Val Asp Trp Leu Asp Lys Pro Leu Ser Val Leu Lys Leu Lys Asn
100 105 110
Ile Trp Gln His Ser Val Arg
115
<210> 7
<211> 1080
<212> DNA
<213> 拟小球藻属
<400> 7
atggctgccc ccccagtatc tatctcttcc aattttccaa agggtttgcg ggttctcttg 60
gtcgatcaac agccaagtag gagccatatt gaagcgcagc tgatgcagcc ggatcttaat 120
tacacagtta ctggttgcga gagcgtttct gaagctcttt catattgccg ctcgggagta 180
agcagctttg acgtggtgct tgcggaggca aggatcgttg ccgtcgacga gacttcgggg 240
cgcgcattta tcgactctct tgaagataca ccggttattc ttatgtcgga gggaagcacg 300
acgggcgacg ttcttcgtgc ggtgaagctt ggagctgtgg actggctgga taagcctctc 360
tccgtcctga agctcaagaa catctggcag cactcagtgc gtaagatgat gcagcgcacc 420
acgttttacg acacttgctc cgagcagcca acccagccgg cgcgcagcaa gctttcttca 480
ggaatcgaat cgccgagcac acccacgctg ggagactctg tggacttgga cgccatctcg 540
gcggcttcct tcggcagcat caaggacttg accgattttt cattttccag cggagctgag 600
gtcctgagag cctcctttga cagctgtgac ggctccgagg tcaacctagg cagcgctttg 660
ggccagcctc gcccccctct ggcagtcaag cccagctcct ttggccccct ggtacccgtc 720
cctcccacct cccagtggcc ccagctgcag gctggctgcg tgtggggcac tcccgtgggc 780
ggcccgctgg cgcccccctc catgaccaac gcccagcatg gtgcccccca cagcgtgccc 840
ctggcagacg cacacttggc cggcagcgcc agttacatgt ccctctcctc tctcatggag 900
gaggacaccc cctgtccctt ggacatggat gcaccagagg acgggatgca gcttcctgtt 960
gacttcctgt ctgttgccaa cgtcagcagc aatggcagcg gtcccattgg gttgaagctg 1020
aagaaaagca acagcctgct gaacatgatc aacgcagcgc tgatgtctgg tggtcagtga 1080
<210> 8
<211> 1707
<212> DNA
<213> 拟小球藻属
<400> 8
atgcttcggc agcagctgtt gcacagcggc aggcagccgg gtgcgacatg cagcttacta 60
acctgctcga catggcgacc gtctgccttg ttcggccgtc ctaagcccca aaaactgcac 120
agccagcgct tgcagcatca gggccgcccc tcccgcctcg tcgtgcgcag cgcaatgttc 180
gacaacctga gccgcagcct ggagagggcg tgggacatgg tgcgcaagga cgggcggcta 240
acggcggaca acatcaagga gcccatgcgg gagattcgca gggcgctgct tgaggcggat 300
gtgaggctgg gggcgccgct gatcagattc ttggtatcta cccccccccc ctcccaggtc 360
tccctccccg tggtgcgcaa gtttgtgaag gcggtggagg agaaggcgct gggttctgca 420
gtgaccaagg gtgtcacccc cgaccagcag ctggtgaagg tggtgtacga ccagctgcgg 480
gagctgatgg gggggcagca ggaagggctg gtgcccactt cgccagagga gccgcaggtg 540
atcttgatgg cggggctgca gggcacgggg aagacgacag ctgcggggaa gctggccttg 600
ttcctgcaga agaaggggca gaaggtgctg ctggtggcca ccgacatcta ccgccccgcc 660
gccatcgacc agctggtgaa gctgggcgac aggatagggg tgccggtgtt ccagctggga 720
acccaggtgc agccgccgga gattgcaagg caggggctgg agaaggcgcg agcagagggg 780
tttgacgccg tcatcgtcga cacggcgggg cggctgcaga tcgaccagag catgatggag 840
gagctggtgc agatcaagtc cacggtgaag ccctccgaca cgctgctagt ggtcgatgcg 900
atgacggggc aggaggcagc cgggctggtg aaggcgttca atgatgccgt ggacatcaca 960
ggcgccgtgc tgaccaagct tgacggggac agccgcggcg gcgccgcgct gagcgtgcgc 1020
caggtcagcg ggcggcccat caagtttgtg ggcatggggg agggcatgga ggcgctggag 1080
cccttctacc ccgagcgcat ggccagcagg attctgggca tgggtgacgt ggtcaccctg 1140
gtggagaagg ctgaggagag catcaaggaa gaggaggcgc aggagatatc gcggaagatg 1200
ctgtcggcca aatttgactt tgacgacttc ctgaagcagt acaagatggt ggcggggatg 1260
gggaacatgg cccaaatcat gaagatgctg ccaggcatga acaagtttac ggagaagcag 1320
ctggcgggcg ttgagaagca gtacaaggtg tacgagagca tgatccagag catgacggtg 1380
aaggagcgca agcagccgga gctgttggtg aagtcgccct ccaggaggcg gcgcatagcg 1440
cgcgggtcgg ggcgctcgga gcgggaggtc acagagctgc tgggggtgtt caccaacctg 1500
cggacgcaga tgcagagctt ctccaaaatg atggccatgg gggggatggg catgggctcc 1560
atgatgagcg acgaggagat gatgcaggcc acgctggcag gcgccggccc ccgccccgtg 1620
ccagctggca aggtgcggcg gaagaagctg gccgcggcgg gcgggtcgcg gggcatggct 1680
gagctggcat ccctgaaggc agaatga 1707
<210> 9
<211> 302
<212> PRT
<213> 胶球藻
<400> 9
Met Gly Leu Lys Ala Arg Ala Ala Ser Val Ser Val His Ser Ser Ala
1 5 10 15
Asn Asn Thr Ala Ser Pro Leu Ser Ser Gly Arg Arg Gly Phe Pro His
20 25 30
Ser Gly Glu Met Ser Gly Glu Asp Leu Ala Arg Ser Asp Ser Trp Glu
35 40 45
Met Phe Pro Ala Gly Leu Lys Val Leu Val Val Asp Asp Asp Pro Leu
50 55 60
Cys Leu Lys Val Val Glu His Met Leu Arg Arg Cys Asn Tyr Gln Val
65 70 75 80
Thr Thr Cys Pro Asn Gly Lys Ala Ala Leu Glu Lys Leu Arg Asp Arg
85 90 95
Ser Val His Phe Asp Leu Val Leu Ser Asp Val Tyr Met Pro Asp Met
100 105 110
Asp Gly Phe Lys Leu Leu Glu His Ile Gly Leu Glu Leu Asp Leu Pro
115 120 125
Val Ile Met Met Ser Ser Asn Gly Glu Thr Asn Val Val Leu Arg Gly
130 135 140
Val Thr His Gly Ala Val Asp Phe Leu Ile Lys Pro Val Arg Val Glu
145 150 155 160
Glu Leu Arg Asn Val Trp Gln His Val Val Arg Arg Lys Arg Asp Gln
165 170 175
Ala Val Ser Gln Ala Arg Asp Ser Arg Asp Ile Ser Asp Glu Glu Gly
180 185 190
Thr Asp Asp Gly Lys Pro Arg Asp Lys Lys Arg Lys Glu Val Ile Leu
195 200 205
Val Leu Trp Trp Asp Met Gln Arg Arg Asp Ser Asp Asp Gly Val Ser
210 215 220
Ala Lys Lys Ala Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe
225 230 235 240
Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg
245 250 255
Ile Leu Asp Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala
260 265 270
Ser His Leu Gln Val Pro His Leu Ser Ile Phe Ser Pro Leu Phe Ala
275 280 285
Glu Leu Met Ser Thr Leu Pro Arg Arg Cys Phe Tyr Asp Phe
290 295 300
<210> 10
<211> 269
<212> PRT
<213> 海洋发光蚝球藻
<400> 10
Phe Pro Ala Gly Leu Gly Val Leu Val Val Asp Asp Asp Leu Leu Cys
1 5 10 15
Leu Lys Val Val Glu Lys Met Leu Lys Ala Cys Lys Tyr Lys Val Thr
20 25 30
Ala Cys Ser Thr Ala Lys Thr Ala Leu Glu Ile Leu Arg Thr Arg Lys
35 40 45
Glu Glu Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp
50 55 60
Gly Phe Lys Leu Leu Glu Ile Ile Gln Phe Glu Leu Ala Leu Pro Val
65 70 75 80
Leu Met Met Ser Ala Asn Ser Asp Ser Ser Val Val Leu Arg Gly Ile
85 90 95
Ile His Gly Ala Val Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu
100 105 110
Leu Arg Asn Ile Trp Gln His Val Val Arg Arg Asp Tyr Ser Ser Ala
115 120 125
Lys Ser Ser Gly Ser Glu Asp Val Glu Ala Ser Ser Pro Ser Lys Arg
130 135 140
Ala Lys Thr Ser Gly Ser Asn Ser Lys Ser Glu Glu Val Asp Arg Thr
145 150 155 160
Ala Ser Glu Met Ser Ser Gly Lys Ala Arg Lys Lys Pro Thr Gly Lys
165 170 175
Lys Gly Gly Lys Ser Val Lys Glu Ala Glu Lys Lys Asp Val Val Asp
180 185 190
Asn Ser Asn Ser Lys Lys Pro Arg Val Val Trp Ser Ala Glu Leu His
195 200 205
Ala Gln Phe Val Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val
210 215 220
Pro Lys Arg Ile Leu Asp Leu Met Gly Val Gln Gly Leu Thr Glu Asn
225 230 235 240
Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln
245 250 255
Gly Asn Asp Ala Arg Gly Gly Gly Asn Ala Ser Ser Thr
260 265
<210> 11
<211> 941
<212> PRT
<213> 莱茵衣藻
<400> 11
Met Asp Ser Gln Gly Val Lys Leu Glu Glu His Pro Gly His Thr Gly
1 5 10 15
Gly His Trp Gln Gly Phe Pro Ala Gly Leu Arg Leu Leu Val Val Asp
20 25 30
Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg Lys Cys
35 40 45
Ser Tyr Glu Val Thr Val Cys Ser Asn Ala Thr Thr Ala Leu Asn Ile
50 55 60
Leu Arg Asp Lys Asn Thr Glu Tyr Asp Leu Val Leu Ser Asp Val Tyr
65 70 75 80
Met Pro Asp Met Asp Gly Phe Arg Leu Leu Glu Leu Val Gly Leu Glu
85 90 95
Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr Ser Asn
100 105 110
Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile Lys Pro
115 120 125
Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val Arg Arg
130 135 140
Arg Arg Gln His Ala Gln Glu Ile Asp Ser Asp Glu Gln Ser Gln Glu
145 150 155 160
Arg Asp Glu Asp Gln Thr Arg Asn Lys Arg Lys Ala Asp Ala Ala Gly
165 170 175
Val Thr Gly Asp Gln Cys Arg Leu Asn Gly Ser Gly Ser Gly Gly Ala
180 185 190
Ala Gly Pro Gly Ser Gly Gly Gly Ala Gly Gly Met Thr Asp Glu Met
195 200 205
Leu Met Met Ser Gly Gly Glu Asn Gly Ser Asn Lys Lys Ala Arg Val
210 215 220
Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln
225 230 235 240
Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Gly
245 250 255
Val Asp Gly Ser Ala Gly Arg Leu Ala Asp Thr Ser Gly Arg Asp Val
260 265 270
Cys Gly Thr Val Tyr Arg Leu Tyr Leu Lys Arg Val Ser Gly Val Thr
275 280 285
Pro Ser Gly His His His Asn Ala Ala His Lys Ser Asn Lys Pro Ser
290 295 300
Pro His Thr Thr Pro Pro Pro Pro Ala Leu Pro Gly Gln Ala Gly Thr
305 310 315 320
His Pro Ala Asn Gln Ala Thr Ala Ile Pro Pro Pro Pro Gln Pro Gly
325 330 335
Ser Gly Thr Ala Ala Gly Ala Gly Ala Ala Ala Ala Gly Thr Gly Gly
340 345 350
Gly Ala Ala Ala Ala Asn Gly His Ala Ala Thr Thr Gly Ala Gly Thr
355 360 365
Pro Gly Ala Ala Pro Gly Ala Gly Gly Gly Val Gly Gly Thr Gly Ala
370 375 380
Gly Gly Leu Gly Ser Gly Pro Asp Gly Ala Ala Ala Ala Ala Gly Pro
385 390 395 400
Gly Pro Gly Ala Ala Val Pro Gly Gly Leu Gly Gly Leu Pro Leu Pro
405 410 415
Pro Gly Ala Gly Pro Gly Pro Gly Pro Gly Gly Phe Gly Gly Pro Ser
420 425 430
Pro Pro Pro Pro Pro His Pro Ala Ala Leu Leu Ala Asn Pro Met Ala
435 440 445
Ala Ala Val Ala Gly Leu Asn Gln Ser Leu Leu Asn Ala Met Gly Ser
450 455 460
Leu Gly Val Gly Val Gly Gly Met Ser Pro Leu Gly Pro Val Gly Pro
465 470 475 480
Leu Gly Pro Leu Gly Gly Leu Pro Gly Leu Pro Gly Met Gln Pro Pro
485 490 495
Pro Leu Gly Met Gly Gly Leu Gln Pro Gly Met Gly Pro Leu Gly Pro
500 505 510
Leu Gly Leu Pro Gly Met Gly Gly Leu Pro Gly Leu Pro Gly Met Asn
515 520 525
Pro Met Ala Asn Leu Met Gln Gly Met Ala Ala Gly Met Ala Ala Ala
530 535 540
Asn Gln Met Asn Gly Met Gly Gly His Met Gly Gly His Met Gly Gly
545 550 555 560
Met Asn Gly Pro Met Gly Ala Leu Ala Gly Met Asn Gly Leu Asn Gly
565 570 575
Ala Met Met Gly Gly Leu Pro Gly Met Gly Gly Pro Gln Asn Met Phe
580 585 590
Gln Ala Ala Ala Ala Ala Ala Ala Gln Gln Gln Gln Gln Gln Gln Glu
595 600 605
Gln Gln His Ala Met Met Gln Gln Ala Ala Ala Gly Leu Leu Ala Ser
610 615 620
Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Ala
625 630 635 640
Leu Gln Gln Gln Gln Gln Gln Gly Met Ala Val Ser Pro Pro Gly Pro
645 650 655
His Asn Ala Thr Pro Asn Gly Gln Leu His Thr His Pro Gln Ala His
660 665 670
His Pro His Gln His Gly Leu His Ala His Ala His Pro His Gln His
675 680 685
Leu Asn Thr Ala Pro Ala Gly Ala Leu Gly Leu Ser Pro Pro Gln Pro
690 695 700
Pro Ala Gly Leu Leu Ser Ala Ser Gly Leu Ser Ser Gly Pro Asp Gly
705 710 715 720
Ser Gly Leu Gly Ser Gly Val Gly Gly Leu Leu Asp Gly Leu Gln Gln
725 730 735
His Pro His His Pro Gln Leu Gln Leu Ala Gly Ser Leu Gly Thr Gly
740 745 750
Gly Thr Gly Arg Ser Ser Gly Ala Ala Gly Arg Gly Ser Leu Asp Leu
755 760 765
Pro Ala Asp Leu Met Gly Met Ala Leu Leu Asp Phe Pro Pro Val Pro
770 775 780
Val Pro Gly Gly Ala Asp Val Gly Met Ala Gly Ala Gly Gly Gly Ala
785 790 795 800
Ala Gly Ala His His His Gly His Gln Gly His Gln Gly Ile Gly Gly
805 810 815
Gly Ala Gly Val Gly Ile Ala Gly Gly Val Gly Cys Gly Val Pro Ala
820 825 830
Ala Ala His Gly Leu Glu Pro Ala Ile Leu Met Asp Asp Pro Ala Asp
835 840 845
Leu Gly Ala Val Phe Ser Asp Val Met Tyr Gly Thr Pro Gly Gly Gly
850 855 860
Gly Val Pro Gly Gly Val Pro Gly Gly Gly Val Gly Leu Gly Leu Gly
865 870 875 880
Ala Gly Gln Val Pro Ser Gly Pro Ala Gly Ala Gly Gly Leu His Ser
885 890 895
His His His Gln His His His His Gln His His Leu Gly His Val Val
900 905 910
Pro Val Gly Gly Val Asp Pro Leu Ala Gly Asp Ala Ala Lys Met Ala
915 920 925
Met Asn Asp Asp Asp Phe Phe Asn Phe Leu Leu Lys Asn
930 935 940
<210> 12
<211> 523
<212> PRT
<213> 佐夫色绿藻
<400> 12
Met Asp Gly Phe Lys Leu Leu Glu Thr Val Gly Leu Glu Leu Asp Leu
1 5 10 15
Pro Val Ile Met Met Ser Ser Asn Gly Glu His Thr Thr Val Met Arg
20 25 30
Gly Val Thr His Gly Ala Cys Asp Phe Leu Ile Lys Pro Val Arg Ile
35 40 45
Glu Glu Leu Arg Asn Ile Trp Gln His Val Ile Arg Arg Thr Arg His
50 55 60
Pro Val Phe Arg Asp Leu Glu Pro Asp Asp His Glu Gly Gly Asp Tyr
65 70 75 80
Glu Ala Ser Lys Lys Arg Lys Asp Leu Tyr Arg Gly Glu Asn Ser Ser
85 90 95
Gly Ser Gly Gly Ala Gly Gly Leu Glu Arg Asp Asp Asp Gly Ser Ala
100 105 110
Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe
115 120 125
Val Gln Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys
130 135 140
Ile Leu Glu Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala
145 150 155 160
Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Gln Gly Val
165 170 175
Gln Ala Pro Phe Gly Leu Pro Asn Ile Gln Leu Pro Arg Gln Thr Ser
180 185 190
Ser Lys Gly Ala Gly Ser Ser Ser Gln Gln Gln His His Gln Gln Gln
195 200 205
Gln His Gln Gln Gln His Gln His Gln His Gln Thr Ala Leu Gly Thr
210 215 220
Gly Gln Gln Gln Ser His Gln Leu Gln Pro Cys Pro Val Ser Thr Ala
225 230 235 240
Thr Pro Val Met Pro Ser Pro Asp Ala Met Val Ala Ala Ser Met Met
245 250 255
Ser Ser Gln Ala Met Ala Ala Met Ala Pro Gly Val Met Asn Pro Met
260 265 270
Thr Ala Met Asn Ser Met Met Ala Gly Leu Asn Pro Asn Met Met Gly
275 280 285
Met Ala Ala Gly Leu Gly Leu Ala Gly Leu Gly Ile Gly Gly Met Ala
290 295 300
Gly His Pro Val Pro Asn Pro Met Leu Ala Gly Met Gly Pro Met Gly
305 310 315 320
Leu Gly Leu Pro Pro Pro Pro Gly Met Pro Pro Pro Pro Pro Gly Met
325 330 335
Pro Pro Gly Met Pro Pro Gly Met Pro Pro Gly Met Pro Ala Met Met
340 345 350
Gln Gly Leu Ser Met Ala Gly Met Ser His Leu Ala Ala Ala Gly Met
355 360 365
Arg Pro Pro Pro Gly Ala Leu Gly Gly His Leu Gly Gly Pro Gly Leu
370 375 380
Ser Pro Phe Gly Pro Pro Pro Pro Pro Gly Ala Asp Pro Ala Asn Met
385 390 395 400
Met Ala Asn Met Ser Ser Met Met Ala Asn Met Gln Ala Ala Leu Ala
405 410 415
Phe Gln Ala Asp Ala Ala Ala Ala Ala Gln His Gln Ala Ala Ser Thr
420 425 430
Gly Ser Val Ala Pro Gly Arg Gln Gln Gln Val His Gln His Gln Gln
435 440 445
Ala Val Gly Met Ala Val Asp Asp Ala Ala Ala Phe Pro Ser Pro Gly
450 455 460
Cys Arg Pro Asn Gly Ser Ala Asp Ala Gly Ala Gln Ser Ala Ala Glu
465 470 475 480
Pro Asn Asp Phe Ser Arg Val Phe Asp Asp Pro Phe Ala Gln Pro Ala
485 490 495
Ala Ser Pro Ser Gly Ala Ala Ala Ala Gly Ser Asn Glu Ala Pro Gly
500 505 510
Met Asp Asp Phe Leu Asp Phe Phe Leu Lys Ser
515 520
<210> 13
<211> 834
<212> PRT
<213> 卡特氏团藻
<400> 13
Met Asp Gly Arg Ala Glu Gly Thr Val Ala Ile Lys Gln Glu Asp His
1 5 10 15
Ala Ser Gly His Trp His Asn Phe Pro Ala Gly Leu Arg Leu Leu Val
20 25 30
Val Asp Asp Asp Pro Leu Cys Leu Lys Val Val Glu Gln Met Leu Arg
35 40 45
Lys Cys Ser Tyr Asp Val Thr Thr Cys Thr Asn Ala Thr Met Ala Leu
50 55 60
Asn Leu Leu Arg Asp Lys Ser Thr Glu Tyr Asp Leu Val Leu Ser Asp
65 70 75 80
Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Val Val Gly
85 90 95
Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ser Asn Gly Asp Thr
100 105 110
Ser Asn Val Leu Arg Gly Val Thr His Gly Ala Cys Asp Tyr Leu Ile
115 120 125
Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Leu Trp Gln His Val Val
130 135 140
Arg Arg Arg Arg Gln Leu Asn Leu Asp Met Asp Ser Asp Glu His Ser
145 150 155 160
Gln Glu Arg Asp Asp Asp Gln Gly Arg Lys Arg Lys Ala Asp Thr Ala
165 170 175
Gly Cys Ile Gly Asp Gln Leu Arg Met Met Gly Ala Gly Cys Ser Gly
180 185 190
Gly Ala Asn Gly Leu Gly Ser Thr Gly Asn Leu Gly Ala Val Ala Thr
195 200 205
Gly Ser Ala Gly Leu Gly Leu Gly Leu Gly Thr Ala Ala Asp Glu Leu
210 215 220
Gly Leu Gly Leu Asp Asn Gly Ser Ser Lys Lys Ala Arg Val Val Trp
225 230 235 240
Ser Val Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly
245 250 255
Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Ile Met Asn Val Asp
260 265 270
Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu
275 280 285
Tyr Leu Lys Arg Val Ser Gly Ala Gln Gln Pro Gly Gln Asn Arg Val
290 295 300
Ser Arg Pro Ser Pro Pro Gln Pro Gln Ser Pro Gln Val Pro Ser Gln
305 310 315 320
Gln Gln Gln Ser Leu Pro Gly Gly Gly Gly Ala Ala Ala Ala Gly Ala
325 330 335
Gly Gln Leu Gln Gly Gly Gly Gly Ala Ala Ala Ala Ala Ala Ser Leu
340 345 350
Ala Ser Ile Leu Ala Gly Gly Gly Pro Ala Gly Gly Gly Ala Gly Ala
355 360 365
Gly Pro Pro Pro Gly Gly Gly Gln Leu Gly Ala Asp Gly Gly Gly Pro
370 375 380
Gly Pro Gly Leu Ser Ser Ala Val Ala Asn Ala Met Ser Ala Ala Ala
385 390 395 400
Ala Ala Gly Gly Phe Pro Thr Pro Pro Pro Pro Pro Pro Pro His Pro
405 410 415
Ala Ala Leu Leu Ala Ala Asn Pro Met Met Ala Ala Ala Ala Gly Leu
420 425 430
Asn Pro Leu Leu Gly Ala Met Gly Gly Leu Gly Val Gly Pro Leu Gly
435 440 445
Pro Leu Asn Pro Leu Asn Gly Met Pro Met Pro Gly Met Gln Pro Pro
450 455 460
Leu Gly Leu Leu Pro Gly Leu Pro Gly Pro Gly Gly Gln Leu Gly Leu
465 470 475 480
Gly Pro Leu Gly Pro Ile Gly Leu Pro Gly Pro Gly Pro Leu Pro Ser
485 490 495
Leu Pro Ala Gly Leu Pro Leu Asn Pro Met Ala Asn Gly Leu Gln Gln
500 505 510
Met Ala Ala Ala Asn Leu Met Gln Gly Met Ala Gly Met Gly Gln Leu
515 520 525
Pro Ala Leu Ser Met Asn Gly Met Asn Gly Ile Met Gly Pro Leu Pro
530 535 540
Gly Val Gly Leu Pro Gly Pro Gln Gln His Leu Phe Pro Gln Gln Gln
545 550 555 560
Gln Pro His Leu Gln Gln Gln Gln Gln Gln Gln Gln Gln Lys Asp Leu
565 570 575
Gln Met Ala Gln Lys Gln His Gln Ala Ala Ala Ala Ala Ala Ala Val
580 585 590
Ala Ala Ala Val Ala Ala Ala Gln His Gln Gln Gln Gln Pro Gln Ala
595 600 605
Gln Gln Gln Pro Gln Pro Gln Gln Gln Gln Gln Gln Pro Gly Lys Leu
610 615 620
Pro Gln Ala Thr Val Gly Thr Pro Ala Leu Ala Ser Pro Ala Gly Ala
625 630 635 640
Leu Pro Arg Gln Pro Ser Gly Gln His Pro His Thr Leu Ser Ser Ser
645 650 655
Ser Leu His Thr Gln Gln Pro His Gln Gln Gln Leu Leu His Ser Gln
660 665 670
Pro Ser Ser Thr His Leu Ala Thr Asn Asn Thr Leu Ala Met Ala Pro
675 680 685
Ala Leu Asn Gly Thr Leu Asp Val Gly Gly Lys Gly His Leu His Ala
690 695 700
Ala Gly Gly Gln Gly Ala Gly Ala Gly Ala Gly Ala Val Leu Asp Ile
705 710 715 720
Pro Pro Asp Leu Ile Gly Gly Leu Ile Glu Asp Gly Phe Gly Ala Pro
725 730 735
Pro Gly Pro Thr Ile Gln Leu Ala His Gly Thr Ala Ala Val Leu Asp
740 745 750
Pro Thr Met Leu Leu Asp Glu Gly Asp Asn Ser Asp Phe Ala Ala Val
755 760 765
Phe Gln Glu Met Ser Ser Tyr Gly Gly Gly Gly Val Ile Gly Gly Gly
770 775 780
Gly Ser Gly Ala Gly Ala Met Gly Val Leu Gly His Gly Leu Leu Ala
785 790 795 800
Ala Gly Gly Pro Val Met Val Asp Val Ala Ala Gly Leu Ala Gly Val
805 810 815
Thr Glu Thr Ala Thr Arg Val Asp Asp Asp Phe Leu Asn Phe Leu Leu
820 825 830
Lys Ser
<210> 14
<211> 446
<212> PRT
<213> 四爿藻属
<400> 14
Met Ser Cys Thr Val Ala Ser Phe Pro Pro Ala Ala Gly Gly Gln Gly
1 5 10 15
Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp Leu Leu Val Lys Arg Gln
20 25 30
Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu Arg Val Leu Val Ala Asp
35 40 45
Asn Asp Pro Ala Ser Leu Gln Gln Val Glu Lys Met Leu Lys Lys Cys
50 55 60
Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile
65 70 75 80
Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu Val Leu Ala Asp Ala Asn
85 90 95
Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu His Val Cys His Thr Glu
100 105 110
Leu Ser Leu Pro Val Val Leu Met Ser Gly Thr Ser Asp Thr Gln Leu
115 120 125
Val Met Arg Gly Val Met Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro
130 135 140
Leu Arg Val Glu Glu Leu Lys Val Leu Trp Gln His Leu Val Arg Phe
145 150 155 160
Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln Leu Asn Val Val Lys Val
165 170 175
Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu Val Ser Thr Ser Gln Asn
180 185 190
Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys
195 200 205
Gln Arg Met Asn Trp Ser Asp Glu Met His Gln Gln Phe Val Asn Ala
210 215 220
Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp
225 230 235 240
Leu Met Ser Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu
245 250 255
Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met Ala Asn His Gln Glu Asn
260 265 270
Gly Lys Gln Ala Val Met Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala
275 280 285
Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln Gln Met Met Gln Gln Glu
290 295 300
His Ser Gly Gln Ala Val Gln Tyr Ser Gln Pro His Ala Pro Gly Gly
305 310 315 320
Leu His Gln Gln Ala Met Pro Ala Gln Met His Met Gly Met Met Pro
325 330 335
Ala Gly Pro Gln Pro Gly Ser Met Gln Met Ala Pro His His Val Met
340 345 350
Gln Met Pro Asn Gly Gln Val Met Val Met Gln Gln Met Gly Pro Arg
355 360 365
Pro Gly Met Pro Pro Gly Met Pro Gln Gln Met Met Ala Ser Ser Gln
370 375 380
Gln Met Gly Met Leu Gln Pro Gly Met Pro Ala Gly Gln Met Leu His
385 390 395 400
Phe Gln His Pro Gln Gln Val His Gln His Pro Pro Ser Ser Gly Pro
405 410 415
Met His Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln
420 425 430
Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Asn Gln Ala
435 440 445
<210> 15
<211> 490
<212> PRT
<213> 四爿藻属
<400> 15
Met Thr Pro Thr Pro Pro Met Ser Cys Thr Val Ala Ser Phe Pro Pro
1 5 10 15
Ala Ala Gly Gly Gln Gly Ser Pro Ala Thr Pro Val Pro Tyr Gln Asp
20 25 30
Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe Pro Ala Gly Leu
35 40 45
Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu Gln Gln Val Glu
50 55 60
Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu Cys Ser Ser Gly
65 70 75 80
Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu Glu Phe Asp Leu
85 90 95
Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly Phe Lys Leu Leu
100 105 110
His Val Cys His Thr Glu Leu Ser Leu Pro Val Val Leu Met Ser Gly
115 120 125
Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met Asp Gly Ala Arg
130 135 140
Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu Lys Val Leu Trp
145 150 155 160
Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys Thr Asp Ala Gln
165 170 175
Leu Asn Val Val Lys Val Glu Leu Asp Gly Gly Arg Pro Ala Gly Glu
180 185 190
Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Thr Glu Arg Glu Gly Glu
195 200 205
Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser Asp Glu Met His
210 215 220
Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val
225 230 235 240
Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly Leu Thr Arg Glu
245 250 255
Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys Arg Met
260 265 270
Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met Ser Thr Asp Thr
275 280 285
Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met Pro Gln Gly Gln
290 295 300
Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val Gln Tyr Ser Gln
305 310 315 320
Pro His Ala Pro Gly Gly Leu His Gln Gln Ala Met Pro Ala Gln Met
325 330 335
His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly Ser Met Gln Met
340 345 350
Ala Pro His His Val Met Gln Met Pro Asn Gly Gln Val Met Val Met
355 360 365
Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly Met Pro Gln Gln
370 375 380
Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln Pro Gly Met Pro
385 390 395 400
Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln Val His Gln His
405 410 415
Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu Met Ile Asp Pro
420 425 430
Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr Ile Gly Pro Asn
435 440 445
Gly Gln His Met Pro Ala Pro Ala Met Gly Met Pro Ser Gly Thr Val
450 455 460
Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln Met Ala Gly Trp
465 470 475 480
Pro Val Gln Gly Gln Pro Gly Asn Gln Ala
485 490
<210> 16
<211> 574
<212> PRT
<213> 四爿藻属
<400> 16
Met Thr Met Pro Leu Gly Gly Gly Leu Cys Met Lys Asp Arg Ile His
1 5 10 15
Gly Asp Glu Arg Tyr Arg Ser Lys Ala Lys Arg Gln Val Asn Thr Ile
20 25 30
Phe Ala Phe Thr Gln Arg Asn Thr Trp Arg Gly Arg Phe Arg Leu Cys
35 40 45
Ser Tyr Arg Thr Thr Glu Leu Leu Gly Gly Ser Lys Thr Thr Glu Pro
50 55 60
Gly Arg Gly Thr Phe Val Leu Gln Ile Phe Met Cys Val Lys Asn Ala
65 70 75 80
Ser Ile Asp Asp Gly Ser Arg His Ile Ser Thr Ser Arg Gly Leu Glu
85 90 95
Ser Val Leu Lys Arg Arg Gly Gly Gln Gly Ala Pro Ala Ala Pro Val
100 105 110
Pro Tyr His Asp Leu Leu Val Lys Arg Gln Asp Gln Trp Ser Asn Phe
115 120 125
Pro Ala Gly Leu Arg Val Leu Val Ala Asp Asn Asp Pro Ala Ser Leu
130 135 140
Gln Gln Val Glu Lys Met Leu Lys Lys Cys Ser Tyr Gln Val Thr Leu
145 150 155 160
Cys Ser Ser Gly Lys Asn Ser Leu Glu Ile Leu Arg Lys Arg Arg Glu
165 170 175
Glu Phe Asp Leu Val Leu Ala Asp Ala Asn Leu Pro Asp Ile Asp Gly
180 185 190
Phe Lys Leu Leu His Val Cys His Thr Glu Leu Ser Leu Pro Val Val
195 200 205
Leu Met Ser Gly Thr Ser Asp Thr Gln Leu Val Met Arg Gly Val Met
210 215 220
Asp Gly Ala Arg Asp Phe Leu Ile Lys Pro Leu Arg Val Glu Glu Leu
225 230 235 240
Lys Val Leu Trp Gln His Leu Val Arg Phe Thr Ser Glu Ile Thr Lys
245 250 255
Thr Asp Ala Gln Leu Asn Val Val Lys Val Glu Leu Asp Ser Gly Arg
260 265 270
Pro Ala Gly Glu Val Ser Thr Ser Gln Asn Gly Ser Gln Cys Ala Glu
275 280 285
Arg Glu Gly Glu Gly Asn Ser Ser Lys Lys Gln Arg Met Asn Trp Ser
290 295 300
Asp Glu Met His Gln Gln Phe Val Asn Ala Val Asn Gln Leu Gly Ile
305 310 315 320
Asp Lys Ala Val Pro Lys Arg Ile Leu Asp Leu Met Ser Val Glu Gly
325 330 335
Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr
340 345 350
Leu Lys Arg Met Ala Asn His Gln Glu Asn Gly Lys Gln Ala Val Met
355 360 365
Ser Thr Asp Thr Ile Ala Arg Ala Glu Ala Ala Tyr Gln Gly Gly Met
370 375 380
Pro Gln Gly Gln Gln Met Met Gln Gln Glu His Ser Gly Gln Ala Val
385 390 395 400
Gln Tyr Ser Gln Pro His Ala Pro Ser Gly Leu His Gln Gln Ala Met
405 410 415
Pro Ala Gln Met His Met Gly Met Met Pro Ala Gly Pro Gln Pro Gly
420 425 430
Ser Met Gln Met Ala Pro His His Val Met Gln Met Pro Asn Gly Gln
435 440 445
Val Met Val Met Gln Gln Met Gly Pro Arg Pro Gly Met Pro Pro Gly
450 455 460
Met Pro Gln Gln Met Met Ala Ser Ser Gln Gln Met Gly Met Leu Gln
465 470 475 480
Pro Gly Met Pro Ala Gly Gln Met Leu His Phe Gln His Pro Gln Gln
485 490 495
Val His Gln His Pro Pro Ser Ser Gly Pro Met His Ala Gly Gly Glu
500 505 510
Met Ile Asp Pro Gly Ser Met Gln Arg Leu His Gln Gln Pro His Tyr
515 520 525
Ile Val Pro Asn Ala Gln His Met Pro Ala Pro Ala Met Gly Met Pro
530 535 540
Pro Gly Ala Val Gln His Met Glu Tyr Ala Tyr Ser Gln Pro Met Gln
545 550 555 560
Met Ala Gly Trp Pro Val Gln Gly Gln Pro Gly Ser Gln Ala
565 570
<210> 17
<211> 674
<212> PRT
<213> 卵囊藻属
<400> 17
Met Leu Ala Phe Thr His Gln Arg Met Thr Thr Ala Pro Ala Leu Ala
1 5 10 15
Val Ala Thr Ser His Phe Phe Ala His Val Arg Val Thr Thr Gly Ser
20 25 30
Ser Ala Ile Ala Thr Val Phe Ala Ala Arg Ser Arg Gly Ser Gly Leu
35 40 45
Leu Ala Gly Phe Asn Thr Met Glu Asn Val Lys Val Glu Val Pro Glu
50 55 60
Val Val Pro Glu Asn Val Asn Phe Pro Ala Gly Leu Lys Val Leu Val
65 70 75 80
Val Asp Asp Asp Pro Leu Cys Leu Lys Val Ile Asp Gln Met Leu Arg
85 90 95
Arg Cys Asn Tyr Ala Ala Thr Thr Cys Gln Ser Ser Leu Glu Ala Leu
100 105 110
Glu Leu Leu Arg Ser Ser Lys Glu Asn His Phe Asp Leu Val Leu Ser
115 120 125
Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Ile Ile
130 135 140
Gly Leu Glu Met Gly Leu Pro Val Ile Met Met Ser Ser Asn Gly Glu
145 150 155 160
Thr Gly Val Val Phe Arg Gly Val Thr His Gly Ala Val Asp Phe Leu
165 170 175
Ile Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Leu Trp Gln His Val
180 185 190
Val Arg Lys Thr Met Val Val Pro Ser Asn Asp Lys Ala Thr Ser Glu
195 200 205
Glu Asp Gly Glu Glu Ser Lys His Arg Val Asp Arg Lys Arg Lys Glu
210 215 220
Ser Phe His Ser Arg Ala Arg Glu Gln Val Glu Ile Ala Cys Ser Val
225 230 235 240
Val Pro Ala Leu Leu Trp Pro Thr Val Pro Pro Ser Ser Val His Pro
245 250 255
Thr Ser Ser Ser Phe Leu Arg Ser His Val Leu Leu Leu Gln Arg Ser
260 265 270
Ser Gly Gly Lys Asp Val Leu Asp Glu Gly Gly Ser Asn Ala Lys Lys
275 280 285
Pro Arg Val Val Trp Ser Val Glu Met His Gln Gln Phe Val Asn Ala
290 295 300
Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp
305 310 315 320
Leu Met Asn Val Asp Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu
325 330 335
Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Val Ala Gly Ile Asn Thr Ala
340 345 350
Thr Gly Ser Arg Asn Gly Lys Gly Arg Ser Asp Val Ser Gly Leu Ser
355 360 365
Gly Met Pro Asn Gly Ser Leu Pro Met Pro Gly Met Met Pro Pro His
370 375 380
Met Ala Ala Gly Met Leu Leu Ala Gly Met Ala Ala Asp Val Gly Pro
385 390 395 400
Arg Pro His Pro Phe Pro Ile Met Pro Met Pro Ala Met Ala Leu Gln
405 410 415
Gly Met His Gly Gly Met Ala Gln Met Met Gln Leu Pro Pro Gly Met
420 425 430
Pro Pro Pro Met Met Met Pro Met Ala Pro Leu Leu Pro Ser Gln Leu
435 440 445
Ala Ala Leu Gly Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Val Ala
450 455 460
Arg Ser Glu Ser Met Pro Ser Glu Asn Gly Val Ala Gly Pro Ser Gly
465 470 475 480
Ser Phe Thr Ala Met Leu Asn Gly Pro Ala Pro Met Glu Ser Ser Pro
485 490 495
Phe Ala Ala Leu Gln Val Phe Gly Pro Pro Gln Gly Met Glu Gln Leu
500 505 510
Thr Gln Gln Gln Gln Gln Gln Gln Gln Ala Gly Ala Ala Ala Phe Val
515 520 525
Ala Ala Phe Ala Ala Ala Asn Gly Gly Asp Met Gln Gly Gly Gly Gly
530 535 540
Gly Pro Gly Pro Met Leu Gly Gly Ala Gly Gly Ala Gly Pro Leu Leu
545 550 555 560
Gly Gly Val Gly Gly Gly Asp Pro Leu His Gly Gly Gly Gly Ser Ser
565 570 575
Ala Leu Gly Gly Arg Pro Met Met Ser Ala Glu Gln Pro Met Gly Gly
580 585 590
Ser Gly Gly Leu Ala Ser Asn Ser Leu Thr Val Gln Gln Asn Asp Leu
595 600 605
Ala Gln Met Cys Ser Gln Leu Asp Val Asn Gly Leu Gln Ala Val Ala
610 615 620
Ala Ala Ala Ala Ala Gly Ala Met Gly Ala Pro Gly Gly Ala Gly Gly
625 630 635 640
Ala Met Pro Pro Ser Ser Val Gly Gly Val Gly Pro Asp Met Lys Leu
645 650 655
Thr Glu Gln Asp Asp Phe Phe Ser Phe Leu Leu Lys Asp Ser Asn Leu
660 665 670
Ile Asp
<210> 18
<211> 488
<212> PRT
<213> 微胞藻属
<400> 18
Met Ser Thr Pro Ala Val Ser Lys Gly Phe Pro Ile Gly Leu Arg Val
1 5 10 15
Leu Val Val Asp Asp Asp Pro Leu Cys Leu Lys Ile Val Glu Lys Met
20 25 30
Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala Glu
35 40 45
Ala Leu Lys Thr Leu Arg Glu Arg Lys Asp Asp Phe Asp Ile Val Leu
50 55 60
Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His
65 70 75 80
Ile Ala Leu Glu Leu Asp Ile Pro Val Met Met Met Ser Ala Asn Cys
85 90 95
Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp Tyr
100 105 110
Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His
115 120 125
Val Val Arg Arg Lys Arg Glu Ser Ser Gln Gly Asn Leu Arg Ser Gly
130 135 140
Glu Gly Gly Ser Asn Gly Arg Thr Val Ser Gly Gly Ser Thr Gly Glu
145 150 155 160
Gly Gly Gly Lys Asp Ser Lys Gly Ser Ser Glu Gln His Gly Asp Ala
165 170 175
Lys Asp Lys Thr Gly Ser Ala Gly Gly Ser Gly Gly Ser Ser Lys Arg
180 185 190
Lys Lys Gly Ser Gly Lys Lys Gly Asp Glu Gly Thr Asp Glu Val Lys
195 200 205
Asp Gly Ser Gly Gly Asp Glu Asn Glu Asp Ser Ser Ala Leu Lys Lys
210 215 220
Pro Arg Val Val Trp Ser Ala Glu Leu His Gln Gln Phe Val Thr Ala
225 230 235 240
Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Asp
245 250 255
Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu
260 265 270
Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn Ser Gly
275 280 285
Gly Ala Pro Gly Gly Gly Pro Gly Phe Met Ser Pro Ile Ala Leu Asp
290 295 300
Gly Ser Met Val Gln Gly Gly Pro Gly Gly Arg Val Gly Ser Pro Ala
305 310 315 320
Ile Gly Gly Pro Asn Gly Pro Ile Met Val Gly His Gly His Ile Asp
325 330 335
Pro Ala Met Leu Ala Gly Gly Ala Pro Gln Thr Ile Gln Met Gly Met
340 345 350
Val Tyr Gly Gly Pro Gly Met Gly Pro Pro Gln Met Met Ala Pro Asn
355 360 365
Gly Lys Gly Gly Gly Gly Met Pro Gly Gly Tyr Val Met Gln Pro Gly
370 375 380
Gln Met Met Ala Pro Asn Gly Gln Met Met Pro Val Gly Gln Met Gly
385 390 395 400
Pro Gly Gly Met Met Val Gln Gly Pro Gly Gly Gly Met Met Gln Met
405 410 415
His Asp Gly Gly Met Met Asn Gly Asn Gly Ser Tyr Gly Ser Leu Gln
420 425 430
Asn Met Lys Gln Gly Asn Gly Val Val Met Met Pro Asn Gly Gly Met
435 440 445
Gly Gly Val Asp Gly Ala Ile Pro Asn Met Ala Thr Gly Leu Ile Asn
450 455 460
Gly Gln Gly Leu Pro Asp Asp Asp Val Leu Asp Met Phe Leu Lys Asp
465 470 475 480
Gly Leu Pro Glu Gly Glu Gly Phe
485
<210> 19
<211> 544
<212> PRT
<213> 细小微胞藻
<400> 19
Met Thr Ala Glu Lys Lys Glu Leu Lys Val Phe Pro Ala Gly Leu Arg
1 5 10 15
Val Leu Val Val Asp Asp Asp Pro Leu Cys Leu Arg Ile Val Glu Lys
20 25 30
Met Leu Lys Arg Cys Gln Tyr Glu Val Thr Thr Phe Ser Arg Gly Ala
35 40 45
Glu Ala Leu Glu Thr Leu Arg Ala Arg Arg Asp Asp Phe Asp Ile Val
50 55 60
Leu Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu
65 70 75 80
His Ile Ala Leu Glu Leu Asp Val Pro Val Met Met Met Ser Ala Asn
85 90 95
Cys Ala Thr Asp Val Val Leu Arg Gly Ile Ile His Gly Ala Val Asp
100 105 110
Tyr Leu Leu Lys Pro Val Arg Leu Glu Glu Leu Arg Asn Ile Trp Gln
115 120 125
His Val Val Arg Arg Gln Arg Glu Pro Ser Lys Asp Gly Ala Ala Gly
130 135 140
Lys Gly Gly Gly Ala Ser Gly Ala Pro Glu Val Ser Gly Asp Thr His
145 150 155 160
Ala Asn Thr Asp Asp Lys Gln Asp Gly Asn Ala Thr Asp Ser Lys Gly
165 170 175
Ser Gly Ser Gln Lys Arg Lys Ser Gly Lys Ser Gly Asp Asp Gly Gly
180 185 190
Lys Asp Gly Gly Gly Ser Gly Gly Lys Asp Gly Asp Ala Ser Asn Lys
195 200 205
Gly Asn Asn Asn Lys Arg Lys Lys Gly Lys Ser Asn Asp Ala Thr Glu
210 215 220
Thr Ala Gly Gly Ala Gly Val Glu Asp Asn Asp Asp Thr Ser Gly Leu
225 230 235 240
Lys Lys Pro Arg Val Val Trp Ser Pro Glu Leu His Gln Gln Phe Val
245 250 255
Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile
260 265 270
Leu Asp Leu Met Gly Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser
275 280 285
His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Gln Gly Val Asn
290 295 300
Asn Asn Gly Thr Val Pro Ser Gly Ala Ala Gly Phe Met Thr Gly Leu
305 310 315 320
Ala Ile Asp Gly Val Gly Gly Val Met Gly Pro Pro Thr Thr Gly Ser
325 330 335
Pro Ala Met Asn Gly Pro Gly Gly Pro Gly Gly Gly Leu Val Met Gly
340 345 350
Pro Gly His Met Gly Gly Pro His Met Asp Gly Ser Gly Met Met His
355 360 365
Met Gly Pro Gly Gly Pro Met Ala Gly Met Thr Val Val Tyr Gly Gly
370 375 380
Gly Met Pro Gly Gly Met Pro Gly Gly Ala Asp Ser Lys Asn Gly Ala
385 390 395 400
Ser Gly Gln Pro Pro Pro Gly Gly Tyr Val Val Met Gly Gly Pro His
405 410 415
Gly Gly Gly Pro Gly Gly Ala Pro Met Met Met Gln His Gly Gly Met
420 425 430
Val Pro Gly Pro Gly Pro Gly Leu Val Pro Gly Pro Gly Gly Ser Leu
435 440 445
Met Met Pro Ala Gly Met Met Pro Asp Gly Gly Gly Gly Met Val Gly
450 455 460
Val His Val Gly Pro Gly Val Val Met Gly Gln His Gln Leu Gly Gly
465 470 475 480
Lys His Ser Ser Gly Gly Ala Gly Met Ala Gly Gly Ser Ala Ala Gly
485 490 495
Lys Gly Ala Gln Arg Gly Gly Val Gly Gly Ala Phe Asp Val Pro Pro
500 505 510
Thr Asn Gly Ser Leu Asp Ala Asp Glu Ile Gly Asp Asp Val Leu Thr
515 520 525
Met Phe Leu Lys Asp Gly Leu Pro Glu Met Asn Asp Gly Asp Ala Leu
530 535 540
<210> 20
<211> 776
<212> PRT
<213> 伪泥炭藓
<400> 20
Met Ser Gly Gly Asp Leu Ser Arg Val Arg Glu Gly Thr Ala Asp Leu
1 5 10 15
Asp Pro Val Met Ala Ser His Gln His Pro Pro Pro Arg Gln Gln Ser
20 25 30
His Gln Gln Pro Lys Asn His Gln Gln Glu Ala His Gln Gln His Cys
35 40 45
Ser Ser Ala Glu Thr Thr Ser Pro Asn Asn Thr Ala Arg Gly Ala Gly
50 55 60
Ala Thr Tyr Gly Lys Met Glu Pro Ala Asp Asp Phe Pro Ala Gly Leu
65 70 75 80
Arg Ile Leu Val Val Asp Asp Asp Pro Thr Cys Leu Ala Ile Leu Lys
85 90 95
Lys Met Leu Gln Gln Cys Ser Tyr Gln Val Thr Thr Cys Gly Arg Ala
100 105 110
Thr Arg Ala Leu Glu Leu Leu Arg Glu Asp Lys Asp Lys Phe Asp Leu
115 120 125
Val Ile Ser Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys Leu Leu
130 135 140
Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Gly
145 150 155 160
Asn Gly Glu Thr Ser Val Val Met Lys Gly Ile Thr His Gly Ala Cys
165 170 175
Asp Tyr Leu Leu Lys Pro Val Arg Ile Glu Glu Leu Ser Asn Ile Trp
180 185 190
Gln His Val Val Arg Lys Leu Arg Ser Glu Pro Lys Glu His Ser Ala
195 200 205
Ser Leu Glu Asp Gly Asp Arg Gln Arg Arg Gly Gly Ala Glu Asp Ala
210 215 220
Asp Asn Thr Ser Ser Ala Ala Asp Thr Ala Asp Gly Ile Trp Arg Asn
225 230 235 240
Lys Lys Lys Lys Glu Ala Lys Glu Asp Glu Glu Asp Phe Glu Gln Asp
245 250 255
Asn Asp Asp Pro Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val
260 265 270
Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp
275 280 285
Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Ser Val Gln Gly Leu
290 295 300
Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu
305 310 315 320
Lys Arg Leu Ser Gly Val Thr Ser Gln Ser Asn Ser Leu Asn Val Ser
325 330 335
Phe Gly Gly Pro Asp Ala Gly Tyr Gly Gly Leu Phe Gly Leu Asp Glu
340 345 350
Met Ser Asp Tyr Arg Asn Leu Val Thr Asn Gly His Leu Pro Ala Gln
355 360 365
Thr Ile Ala Ala Leu His His Ala Asn Met Ala Gly Arg Leu Gly Ala
370 375 380
Ser Ser Gly Met Val Gly Pro Ser Ser Pro Leu Asp Pro Ser Val Leu
385 390 395 400
Ala Gln Ile Ala Ala Leu Gln Ser Gly Ser Leu Pro Arg Pro Gly Met
405 410 415
Asp Gly Ser Leu Gln Gly Asn Gln Ala Gly Leu Leu Gln Ser Leu Ser
420 425 430
Gly Ala Leu Asp Tyr Asn Ser Leu His Gln Ser His Leu Leu Pro Ala
435 440 445
Ile Gly Gln Leu Gly Gln Leu Asp Glu Leu Pro Ser Leu Lys Ser Met
450 455 460
Gln His Gln Leu Gly Met Gly Ser Leu Gly Gly Ser Thr Arg Asn Leu
465 470 475 480
Ala Gly Ser Pro Asn Glu Glu Leu Thr Met Gln Leu Leu Gln Gln Arg
485 490 495
Ala Gln Gln Gln Ser Gly Gly Ser Pro Ile Asn Leu Pro Gln Ala Thr
500 505 510
Gly Ile Leu Arg Pro Leu Ser Ser Asn Ile Asn Gln Gly Gly Ser Val
515 520 525
Pro Asn Leu Val Gly Val Ile Pro Gly Thr Ala Ile Gly Leu Ser Asn
530 535 540
Met Cys Ser Gly Gly Arg Glu Phe Gly Ser Ser Ser Gly Leu Leu Ser
545 550 555 560
Ala Ser Gly Ser Leu Met Gln Ser Ser Thr Val Glu Ala Gln Asn Leu
565 570 575
Asn Phe Gly Gly Ser Ser Gly Ser Ser Gly Cys Ser Phe Gln Ala Ser
580 585 590
Val Leu Ser Ser Lys Thr Gly Gly Leu Glu Asp Leu Asn Pro Ala Lys
595 600 605
Arg Val Arg Thr Thr Tyr Ser Ala Leu Ser His Ser Ser Pro Asp Leu
610 615 620
Gly Gln Ser Ser Arg Pro Ala Trp Leu Gly Ser Gln Glu Gly Leu Val
625 630 635 640
His Gly Asp Pro Val Tyr Ser Pro His Gln Leu Ser Leu Pro Arg Gln
645 650 655
Asp Ile Val Gly Gly Ile Gly Ser Ser Gly Arg Pro Ala Tyr Met Gly
660 665 670
Ser Gln Ser Met Gly Ser Leu Gly Met Asn Phe Pro Leu Ser Leu Ala
675 680 685
Val Asp Ala Gly Ala Val Arg Pro Ser Leu Thr Arg Gly Gln Ser Leu
690 695 700
Thr Glu Gln Val Ala Ala Asn Arg Glu Leu Lys Phe Pro Lys Glu Glu
705 710 715 720
Arg Gly Arg Asp Asn Leu Met Cys Ala Arg Leu Gly Gly Gly Met Ile
725 730 735
Thr Asn Glu Ser Ser Ser Glu Glu Leu Leu Asn Tyr Leu Lys Gln Ser
740 745 750
His Glu Gly Leu Gly Phe Met Glu Gly Asp Leu Val Ser Asp Gly Tyr
755 760 765
Pro Val Asp Asn Leu Tyr Val Lys
770 775
<210> 21
<211> 715
<212> PRT
<213> 小立碗藓
<400> 21
Met Gly Gly Gly Tyr Leu Ser Ser Thr Val Asn Met Gly Glu Ser Arg
1 5 10 15
Asp Gly Gly Ser Pro Ala Met Ala Thr Leu Gln Gln Gln Gln Lys His
20 25 30
Gln Pro Leu Asn Pro Asn His Gln Asn Pro Arg Asn Arg Ser Asn Ser
35 40 45
Ser Pro Thr Asn Cys Tyr Ser Asn Thr Ala Trp Gly Ala Lys Pro Ala
50 55 60
Lys Leu Asp Thr Pro Asp Glu Phe Pro Val Gly Met Arg Val Leu Val
65 70 75 80
Val Asp Asp Asn Pro Thr Cys Leu Met Ile Leu Glu Gln Met Leu Val
85 90 95
Arg Cys Ala Tyr Arg Val Thr Thr Cys Gly Lys Ala Thr Glu Ala Leu
100 105 110
Ser Met Leu Arg Glu Asp Ile Gly Lys Phe Asp Val Val Ile Ser Asp
115 120 125
Val Asp Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly
130 135 140
Leu Glu Met Asp Leu Pro Val Ile Met Val Ser Gly Asn Gly Glu Thr
145 150 155 160
Ser Ala Val Met Lys Gly Ile Thr His Gly Ala Cys Asp Tyr Leu Leu
165 170 175
Lys Pro Val Arg Ile Glu Glu Leu Arg Asn Ile Trp Gln His Val Val
180 185 190
Arg Lys Lys Arg Arg Glu Val Lys Ala Val Ala Thr Lys Ser Val Glu
195 200 205
Glu Ala Gly Gly Cys Glu Arg Pro Lys Arg Gly Gly Gly Ala Asp Asp
210 215 220
Ala Asp Tyr Thr Ser Ser Ala Thr Asp Thr Thr Asp Ser Asn Trp Lys
225 230 235 240
Leu Thr Lys Arg Arg Lys Gly Glu Phe Lys Asp Glu Asn Glu Glu Asp
245 250 255
Asn Glu Gln Glu Asn Asp Asp Pro Ser Thr Leu Lys Arg Pro Arg Val
260 265 270
Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln
275 280 285
Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Gly
290 295 300
Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr
305 310 315 320
Arg Leu Tyr Leu Lys Arg Leu Ser Gly Val Thr Ser Gln Gln Gly Asn
325 330 335
Met Ser Ala His Phe Gly Gly Ser Asp Pro Phe Cys Met Met Pro Pro
340 345 350
Asp Met Ser Leu Ala Asn Gly Gln Leu Thr Pro Gln Ala Leu Ala Lys
355 360 365
Phe His Met Leu Gly Arg Met Asn Ala Thr Asn Gly Ile Gly Phe Ser
370 375 380
Gly Gly Gly Leu Asp Pro Gly Met Asn Gln Met Phe Leu Gln Asp Leu
385 390 395 400
Pro Arg Pro Pro Gln Leu Asn Ser Met Leu Arg Asn Asn Thr Gly Leu
405 410 415
Leu Ala Ser Val Pro Asn Gly Leu Gln His Leu Glu Gln Leu Ser Glu
420 425 430
Pro His His Val His Val Val Asn Glu Leu Glu His Tyr Pro Ser Asn
435 440 445
Thr Lys Val Tyr Pro Gln Leu Asn Gly Asn Leu Asp Val Ser Val Gly
450 455 460
Pro Leu Gly Ala Ala Asn Gly Asn Leu Ala Ser Asn Pro Asn Ser Asp
465 470 475 480
Thr Leu Leu Met His Ile Leu His Ser Arg Ala Ser Gln Gln Gly Val
485 490 495
Gly Ser Pro Ser Thr Leu Pro Gln Pro Arg Cys Gly Leu Asn Pro Thr
500 505 510
His Leu Leu Ser Asn Asp Ile Asn Phe Ala Pro Val Gly Ser Leu Pro
515 520 525
Asn Leu Ala Gly Ser Leu Gly Pro Ala Val Gly Leu Ser Ala Ile Pro
530 535 540
Gly Ser Ala Gly Gly Arg Asp Leu Ser Pro Ser Val Gly Gly Ser Gly
545 550 555 560
Ala Ser Leu Ser Ser Pro Leu Gly Ser Leu Val Arg Arg Pro Leu Met
565 570 575
Ala Glu Glu Gln Ser Asn Pro Val Asn Ser Thr Asn Gly Thr Tyr Ser
580 585 590
Met Ala His Ser Gly Gln Ser Pro Lys Pro Ser Gly Asp Thr Leu Pro
595 600 605
Thr Pro Leu Asn Glu Gly Leu Glu Gln Gln Gln Pro Leu Trp Ala Leu
610 615 620
Tyr Gln Asn Pro Met Asn Gln Leu Ser His Gly Pro Ser Gln Gly Phe
625 630 635 640
Pro His Asp Ser Leu Gln Trp Ser Val Leu Thr Glu Asn Leu Ser Phe
645 650 655
Gly Asp Met Gly Gln Ser Leu Ser Ala Gly Leu Ile Ser Gln Phe Ser
660 665 670
Ser Gln Gly Gln Asp Asn Gly Ile Gly Phe Ala Pro Pro Ser Gln Arg
675 680 685
Gly Ser Tyr Thr Arg Gln Ser Val Ser Phe Pro Ala Ser Ser Ala Leu
690 695 700
Asp Gly Arg Met Val Arg Ser Ser Tyr Glu Pro
705 710 715
<210> 22
<211> 664
<212> PRT
<213> 拟南芥
<400> 22
Met Val Asn Pro Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala
1 5 10 15
Gly Gly Ser Asn Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val
20 25 30
Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu Glu Arg Met Leu Met
35 40 45
Thr Cys Leu Tyr Arg Val Thr Lys Cys Asn Arg Ala Glu Ser Ala Leu
50 55 60
Ser Leu Leu Arg Lys Asn Lys Asn Gly Phe Asp Ile Val Ile Ser Asp
65 70 75 80
Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly
85 90 95
Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Asp Ser Lys
100 105 110
Ser Val Val Leu Lys Gly Val Thr His Gly Ala Val Asp Tyr Leu Ile
115 120 125
Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile Trp Gln His Val Val
130 135 140
Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu His Ser Gly Gly Ser
145 150 155 160
Ile Glu Asp Thr Gly Gly Asp Arg Asp Arg Gln Gln Gln His Arg Glu
165 170 175
Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu Gly Asn Gly Arg Ser
180 185 190
Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp Gln Gly Asp Asp Lys
195 200 205
Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu
210 215 220
Leu His Gln Gln Phe Val Ala Ala Val Asn Gln Leu Gly Val Asp Lys
225 230 235 240
Ala Val Pro Lys Lys Ile Leu Glu Met Met Asn Val Pro Gly Leu Thr
245 250 255
Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Arg
260 265 270
Arg Leu Gly Gly Val Ser Gln His Gln Gly Asn Met Asn His Ser Phe
275 280 285
Met Thr Gly Gln Asp Gln Ser Phe Gly Pro Leu Ser Ser Leu Asn Gly
290 295 300
Phe Asp Leu Gln Ser Leu Ala Val Thr Gly Gln Leu Pro Pro Gln Ser
305 310 315 320
Leu Ala Gln Leu Gln Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala Lys
325 330 335
Pro Gly Met Ser Val Ser Pro Leu Val Asp Gln Arg Ser Ile Phe Asn
340 345 350
Phe Glu Asn Pro Lys Ile Arg Phe Gly Asp Gly His Gly Gln Thr Met
355 360 365
Asn Asn Gly Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met Arg
370 375 380
Leu Arg Pro Gly Gln Asn Val Gln Ser Ser Gly Met Met Leu Pro Val
385 390 395 400
Ala Asp Gln Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu Gly
405 410 415
Gln Gln Pro Ile Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu Thr
420 425 430
Gly Ala Leu Ala Val Arg Asn Ser Ile Pro Glu Thr Asn Ser Arg Val
435 440 445
Leu Pro Thr Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu Pro
450 455 460
Arg Ser Ser Phe Pro Leu Ala Ser Ala Pro Gly Ile Ser Val Pro Val
465 470 475 480
Ser Val Ser Tyr Gln Glu Glu Val Asn Ser Ser Asp Ala Lys Gly Gly
485 490 495
Ser Ser Ala Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp Ile Phe
500 505 510
Asn Asp Phe Pro Gln His Gln Gln His Asn Lys Asn Ile Ser Asn Lys
515 520 525
Leu Asn Asp Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser Asn
530 535 540
Gln Asp Ala Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser Glu
545 550 555 560
Ala Tyr Ser Ser Ser Ser Thr Gln Arg Lys Arg Arg Glu Thr Asp Ala
565 570 575
Thr Val Val Gly Glu His Gly Gln Asn Leu Gln Ser Pro Ser Arg Asn
580 585 590
Leu Tyr His Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg Val
595 600 605
Lys Ser Glu Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn Thr
610 615 620
Leu Phe His Glu Gln Tyr Asn Gln Glu Asp Leu Met Ser Ala Phe Leu
625 630 635 640
Lys Gln Glu Gly Ile Pro Ser Val Asp Asn Glu Phe Glu Phe Asp Gly
645 650 655
Tyr Ser Ile Asp Asn Ile Gln Val
660
<210> 23
<211> 1036
<212> PRT
<213> 圆叶拟南芥
<400> 23
Leu Ser Lys Lys Gln Asn Glu Asp Ala Ser Gly Arg Lys Glu Glu Asp
1 5 10 15
Gly Lys Gly Asn Glu His Asn Gly Met Glu Ser Cys Thr Arg Met Lys
20 25 30
Arg Thr Val Trp Thr Val Glu Leu His Gln Lys Phe Val Asn Ala Phe
35 40 45
Gln Gln Leu Gly Leu Asp Lys Ala Ser Pro Glu Gln Ile His Ala Leu
50 55 60
Met Asn Val Glu Gly Leu Pro Val Ile Asn Val Ala Ser His Leu Gln
65 70 75 80
Lys Tyr Arg Leu Phe Leu Lys Lys Ile Tyr Glu Gly Gln Gln Leu Asp
85 90 95
Met Ala Thr Ile Gln Leu Leu Leu Ser Ala Gly Ser His Phe Pro Gln
100 105 110
Thr Pro Trp Thr Asn His Cys Ser Ser Phe Ile Gln Gln Gly His His
115 120 125
Gln Asn Ser Ser Asn Ser Ser Glu Thr Tyr His Thr Thr Leu Ser Pro
130 135 140
Arg Val Gln Lys Val Asn Thr Phe Gln Pro Ser Ser Ser Pro Leu Lys
145 150 155 160
Pro Leu Leu Phe Pro Lys Ser Asn Ile Ser Ala Phe Lys Glu Asp Phe
165 170 175
Lys Ser Ile Lys Glu Pro Ala Ile Val Gly Asp Ser Ser Leu Asp Ser
180 185 190
Ser Lys Pro Arg Asn Ser Phe Gln Thr Ala Ser Lys Phe Pro Lys Thr
195 200 205
Asp Pro Cys Thr Gly Ser Tyr Ile Ile Glu Ile Met Thr Glu Pro Tyr
210 215 220
Tyr Gly Lys Ser Ser Arg Arg His Ser Asn Phe Ser Ala Tyr Met Gly
225 230 235 240
Asp Phe Lys Ser Ile Lys Asp Pro Glu Ile Val Gln Glu Ser Arg Thr
245 250 255
Arg Lys Asn His Gly Arg Val Val Trp Ser His Glu Leu His Gln Lys
260 265 270
Phe Leu Asn Ala Ile Asp Gln Leu Gly Gly Asn Glu Lys Ala Ile Pro
275 280 285
Lys Lys Ile Leu Ala Val Met Asn Val Glu Gly Leu Thr Arg Leu Asn
290 295 300
Val Ala Thr His Leu Gln Lys Tyr Arg Gln Cys Cys Ser Ala Glu Ala
305 310 315 320
Gln Gln Leu Asn Met Ala Thr Arg Lys Leu Pro Ser Ser Glu His Leu
325 330 335
Pro Gln Ser Pro Ser Thr Asn His His Ser Ser Leu Ser Pro Arg Val
340 345 350
Gln Asp Val Asn Ile Arg Leu Trp Ser Ser Ser Pro Lys Arg Gln Asp
355 360 365
Gln Ile Leu Val Tyr Val Leu Phe Ser Phe Glu Asn Asp Asn Gly Arg
370 375 380
Glu Glu Thr Thr Cys Arg Arg Ile Ala Ser Thr Met Glu Leu Gly Ser
385 390 395 400
Thr Glu Asp Gly Arg His Asp Lys Phe Pro Val Gly Met Arg Val Leu
405 410 415
Ala Val Asp Asp Asn Pro Thr Cys Leu Arg Lys Leu Glu Glu Leu Leu
420 425 430
Leu Arg Cys Lys Tyr His Val Thr Lys Thr Met Glu Ser Arg Lys Ala
435 440 445
Leu Glu Leu Leu Arg Glu Asn Ser Asn Met Phe Asp Leu Val Ile Ser
450 455 460
Asp Val Glu Met Pro Asp Thr Asp Gly Phe Lys Leu Leu Glu Ile Gly
465 470 475 480
Leu Glu Met Asp Leu Pro Val Ile Met Leu Ser Ala His Ser Asp Tyr
485 490 495
Asp Ser Val Met Lys Gly Ile Ile His Gly Ala Cys Asp Tyr Leu Val
500 505 510
Lys Pro Val Gly Leu Lys Glu Leu Gln Asn Ile Trp His His Val Val
515 520 525
Lys Lys Asn Ile Lys Ser Tyr Ala Lys Asn Ile Gly Pro Ser Arg Gln
530 535 540
Leu Leu Pro Pro Ser Glu Ser Asn Leu Val Pro Ser Ala Ser Lys Lys
545 550 555 560
Arg Lys Glu Lys Ala Ser Asp Ser Gly Asp Glu Asp Asp Ser Asp Arg
565 570 575
Glu Glu Asp Asp Gly Glu Gly Ser Glu Gln Asp Gly Glu Glu Ser Gly
580 585 590
Thr Arg Lys Lys Pro Arg Val Val Trp Ser Gln Glu Leu His Gln Lys
595 600 605
Phe Val Ser Ala Val Gln Gln Leu Gly Leu Asp Lys Ala Val Pro Lys
610 615 620
Lys Ile Leu Asp Leu Met Ser Ile Glu Gly Leu Thr Arg Glu Asn Val
625 630 635 640
Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Lys Lys Ile Asp Glu
645 650 655
Gly Gln Gln Gln Asn Met Thr Pro Asp Ala Phe Gly Thr Arg Asp Ser
660 665 670
Ser Tyr Phe Gln Met Ala Gln Leu Asp Gly Leu Arg Asp Phe Thr Ala
675 680 685
Thr Arg Gln Ile Pro Ser Ser Gly Leu Leu Ser Arg Ser His Leu Thr
690 695 700
Lys Leu Gln Pro Pro Met Tyr Ser Ser Ile Asn Leu Gln Gly Met Asn
705 710 715 720
Ser Ser Ser Phe Ile Gln Gln Gly His His His Asn Ser Ser Asn Ser
725 730 735
Ala Asn Pro Phe Gly Thr Tyr His Thr Thr Leu Ser Pro Arg Ile Gln
740 745 750
Asn Val Asn Leu Leu Gln Arg Thr Ser Ser Pro Leu Glu Thr Leu Gln
755 760 765
Phe Pro Arg Ser Lys Ser Tyr Ile Gly Asp Phe Lys Gly Ile Gly Asp
770 775 780
Arg Ala Val Gly Gly Ser Phe Leu Asp Ser Cys Met Pro Phe Gly Ser
785 790 795 800
Ser Ser Thr Ser Leu Pro Ser Ala Ser Thr Asn Thr Leu Met Leu Gln
805 810 815
Ala Asn Tyr Thr Gln Pro Leu His Ile Ala Ser Asp Gly Asn Gln Pro
820 825 830
Cys Ile Glu Gly Thr Pro Ser Asn Ser Ala Ser Pro Asn Ile Ser Phe
835 840 845
Gln Gly Leu Ser Arg Phe Pro Ser His Ser Trp Gln Gly Asn Leu Asn
850 855 860
Thr Thr Arg Phe Pro Pro Ser Ser Leu Pro Leu Asn Gln Ala Phe Leu
865 870 875 880
Pro Asp Gln Val Thr Cys Ala Gly Asn Asn Leu Gly Asp Cys Thr Ser
885 890 895
Leu Val Ser Ala Gly Asn Pro Gly Gly Glu Met Gln Cys Glu Pro Gln
900 905 910
Leu Leu Gly Gly Phe Met Gln Asn Met Asn Pro Leu Asp Gly Gln Lys
915 920 925
Trp Glu Gln Gln Asn Ser Met Leu Asn Asn Pro Phe Gly Asn Ile Glu
930 935 940
Tyr Pro Leu Ser Ala Asp Asn Met Val Phe Arg Asp Asn Asn Ala Thr
945 950 955 960
Arg Asn Lys Gly Leu Asp Glu Ser Leu Met Asn Pro Ile Asp Asn Ser
965 970 975
Gln Glu Tyr Val Gly Lys Ala Thr Thr Met Leu Asp Pro Glu Met Lys
980 985 990
Ser Gly Lys Pro Glu Asn Asp Asn Gln His Asp Val Phe Asp Asp Ile
995 1000 1005
Met Asn Glu Met Met Lys Gln Glu Glu Asn Asn Gly Met Val Ser
1010 1015 1020
Val Ala Thr Arg Phe Gly Phe Asp Ser Phe Pro Pro Pro
1025 1030 1035
<210> 24
<211> 774
<212> PRT
<213> 琴叶拟南芥
<400> 24
Met Gly Asp Phe Lys Ser Ile Lys Glu Pro Glu Ile Val Gln Glu Ser
1 5 10 15
Arg Thr Arg Lys Asn His Gly Arg Val Val Trp Ser His Glu Leu His
20 25 30
Gln Lys Phe Leu His Ala Ile Asp Gln Leu Gly Gly Asn Asp Lys Ala
35 40 45
Ile Pro Lys Lys Ile Leu Ala Val Met Asn Val Glu Gly Leu Thr Arg
50 55 60
Leu Asn Val Ala Thr His Leu Gln Lys Tyr Arg Gln Cys Cys Ser Thr
65 70 75 80
Glu Ala Gln Gln Leu Asn Met Ala Thr Arg Lys Leu Pro Ser Ser Glu
85 90 95
His Leu Pro Gln Ser Pro Ser Thr Asn His His Ser Ser Leu Ser Pro
100 105 110
Arg Val Gln Asp Asn Asp Asn Gly Arg Glu Glu Thr Thr Cys Arg Arg
115 120 125
Ile Ala Ser Thr Met Glu Leu Gly Ser Thr Glu Asp Gly Arg His Asp
130 135 140
Lys Phe Pro Val Gly Met Arg Val Leu Ala Val Asp Asp Asn Pro Thr
145 150 155 160
Cys Leu Arg Lys Leu Glu Glu Leu Leu Leu Arg Cys Lys Tyr His Val
165 170 175
Thr Lys Thr Met Glu Ser Arg Lys Ala Leu Glu Leu Leu Arg Glu Asn
180 185 190
Ser Asn Met Phe Asp Leu Val Ile Ser Asp Val Glu Met Pro Asp Thr
195 200 205
Asp Gly Phe Lys Leu Leu Glu Ile Gly Leu Glu Met Asp Leu Pro Val
210 215 220
Ile Met Leu Ser Ala His Ser Asp Tyr Asp Ser Val Met Lys Gly Ile
225 230 235 240
Ile His Gly Ala Cys Asp Tyr Leu Val Lys Pro Val Gly Leu Lys Glu
245 250 255
Leu Gln Asn Ile Trp His His Val Val Lys Lys Asn Ile Lys Ser Tyr
260 265 270
Ala Lys Asn Ile Gly Pro Ser Arg Gln Leu Leu Pro Pro Ser Glu Ser
275 280 285
Asn Leu Val Pro Ser Ala Ser Lys Lys Arg Lys Glu Lys Ala Asn Asp
290 295 300
Ser Gly Asp Glu Asp Asp Ser Asp Arg Glu Glu Asp Asp Gly Glu Gly
305 310 315 320
Ser Glu Gln Asp Gly Asp Glu Ala Gly Thr Arg Lys Lys Pro Arg Val
325 330 335
Val Trp Ser Gln Glu Leu His Gln Lys Phe Val Ser Ala Val Gln Gln
340 345 350
Leu Gly Leu Asp Lys Ala Val Pro Lys Lys Ile Leu Asp Leu Met Ser
355 360 365
Ile Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr
370 375 380
Arg Leu Tyr Leu Lys Lys Ile Asp Glu Gly Gln Gln Gln Asn Met Thr
385 390 395 400
Pro Asp Ala Phe Gly Thr Arg Asp Ser Ser Tyr Phe Gln Met Ala Gln
405 410 415
Leu Asp Gly Leu Arg Asp Phe Thr Ala Thr Arg Gln Ile Pro Ser Ser
420 425 430
Gly Leu Leu Ser Arg Ser His Leu Thr Lys Leu Gln Pro Pro Met Tyr
435 440 445
Ser Ser Ile Asn Leu Gln Gly Met Asn Ser Ser Ser Phe Ile Gln Gln
450 455 460
Gly His His His Asn Ser Ser Asn Ser Ala Asn Pro Phe Gly Thr Tyr
465 470 475 480
His Thr Thr Leu Ser Pro Arg Ile Gln Asn Val Asn Leu Phe Gln Arg
485 490 495
Thr Ser Ser Pro Leu Glu Thr Leu Gln Phe Pro Arg Ser Lys Ser Tyr
500 505 510
Ile Gly Asp Phe Lys Gly Ile Gly Asp Arg Ala Val Gly Gly Ser Phe
515 520 525
Leu Asp Ser Cys Met Pro Phe Gly Ser Ser Ser Thr Ser Leu Pro Ser
530 535 540
Ala Ser Thr Asn Thr Leu Met Leu Gln Ala Asn Tyr Thr Gln Pro Leu
545 550 555 560
His Ile Ser Ser Asp Gly Asn Gln Pro Cys Ile Glu Gly Thr Pro Ser
565 570 575
Asn Ser Ala Ser Pro Asn Ile Ser Phe Gln Gly Leu Ser Arg Phe Pro
580 585 590
Ser His Ser Trp Gln Gly Asn Leu Asn Thr Thr Arg Phe Pro Pro Ser
595 600 605
Ser Leu Pro Leu Asn Pro Ala Phe Leu Pro Asp Gln Val Thr Cys Ala
610 615 620
Gly Asn Asn Leu Gly Asp Cys Thr Ser Leu Val Ser Ala Gly Asn Pro
625 630 635 640
Gly Gly Glu Ile Gln Cys Glu Pro Gln Leu Leu Gly Gly Phe Met Gln
645 650 655
Asn Met Asn Pro Leu Asp Gly Gln Lys Trp Glu Gln Gln Asn Cys Thr
660 665 670
Met Leu Asn Asn Pro Phe Gly Asn Ile Glu Tyr Pro Leu Pro Ala Asp
675 680 685
Asn Met Val Phe Arg Asp Asn Asn Ala Thr Arg Ser Lys Gly Leu Asp
690 695 700
Glu Ser Leu Met Asn Pro Ile Asp Asn Ser Gln Glu Tyr Val Gly Lys
705 710 715 720
Ala Thr Thr Met Leu Asp Pro Glu Met Lys Ser Gly Lys Pro Glu Asn
725 730 735
Asp Asn Gln His Asp Val Phe Asp Asp Leu Met Asn Glu Met Met Lys
740 745 750
Gln Glu Glu Asn Asn Gly Met Val Ser Val Ala Thr Arg Phe Gly Phe
755 760 765
Asp Ser Phe Pro Pro Pro
770
<210> 25
<211> 578
<212> PRT
<213> 向日葵
<400> 25
Met Thr Thr Gly Ser Ser Phe Gly Ser Gly Ser Leu Gly Cys Lys Gln
1 5 10 15
Glu Thr Gly Val Pro Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val
20 25 30
Val Asp Asp Asp Val Ile Cys Leu Lys Ile Leu Glu Gln Met Leu Arg
35 40 45
Arg Cys Ser Tyr His Val Thr Thr Cys Ser Gln Ala Thr Ala Ala Leu
50 55 60
Asn Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Val Leu Ser Asp
65 70 75 80
Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly
85 90 95
Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Arg Thr
100 105 110
Asn Leu Val Leu Arg Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile
115 120 125
Lys Pro Ile Arg Glu Glu Gln Leu Lys Asn Ile Trp Gln His Val Ile
130 135 140
Arg Lys Lys Trp Asn Glu Asn Lys Glu His Glu His Ser Gly Ser Val
145 150 155 160
Asp Asp Lys Asp Arg His Lys Arg Gly Gly Asp Asp Asn Asp Tyr Ala
165 170 175
Ser Ser Val Asn Glu Gly Gly Asp Gly Ile Leu Thr Ser His Lys Lys
180 185 190
Lys Arg His Asn Asn Lys Glu Glu Asp Asp Gly Glu Leu Glu Thr Asp
195 200 205
Glu Pro Gly Gly Ser Lys Lys Ala Arg Val Val Trp Ser Val Glu Leu
210 215 220
His Gln Gln Phe Val Thr Ala Val Asn Gln Leu Gly Ile Asp Lys Ala
225 230 235 240
Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg
245 250 255
Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg
260 265 270
Leu Ser Gly Val Ala Gln Gln Gly Gly Gly Pro Asn Ser Phe Cys Gly
275 280 285
Ser Ile Asp Gln Asn Pro Lys Leu Ala Ser Tyr Ala Arg Phe Glu Ile
290 295 300
Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Val Ala
305 310 315 320
Leu His Ala Glu Leu Leu Gly Gln Pro Thr Ala Asn Val Gly Met Pro
325 330 335
Val Leu Asp His Gln Pro Leu Met Gln Pro Ser Lys Cys Gly Pro Val
340 345 350
Asp His Val Met Ser Tyr Gly Gln Thr Leu Pro Ser Asn Val Thr Lys
355 360 365
Gln Val Pro Gln Pro Ala Ile Glu Asp Val His Ser Gly Leu Gly Ala
370 375 380
Trp His Ser Asn Asn Met Val Gly Gly Tyr Gly Gln Leu Gly Gly Gln
385 390 395 400
Asn Trp His Asn Met Leu Leu Gly Met Leu Gln Ser Gln Ser His Gln
405 410 415
Leu Gln Lys Gln Ser Ile Thr Val Gln Pro Ser Arg Leu Val Val Pro
420 425 430
Ser Gln Ser Ser Asn Phe Gln Ala Val Asn Asn Gly Val Pro Val Asn
435 440 445
Gln Thr Thr Gly Phe Asn Asn Ser Thr Val Ile Asn Tyr Ala Val Gly
450 455 460
Gln Arg Thr Glu Arg Asp Val Glu Asn Gln Ile Gly Gly Gln Ser Ser
465 470 475 480
Val Ser Asn Ile Ser Val Lys Glu Met Gly Glu Lys Gln Ile Ser Phe
485 490 495
Gly Glu Ser Val His Val Leu Asp Gln Gly Ser Leu Arg Asn Leu Gly
500 505 510
Phe Val Gly Lys Lys Ser Ser Ile Pro Ser Arg Phe Ala Val Tyr Glu
515 520 525
Ala Ala Glu Ser Leu Thr His Asn Leu Asn Tyr Gly Asp Asn Asn Gly
530 535 540
Glu Arg Arg Val Lys Gln Glu Pro Asn Ile Glu Phe Leu Glu Asn Ser
545 550 555 560
Lys Ala Gly Ala His Arg Val Ser Gln Asn Asp Leu Met Ser Lys Gln
565 570 575
Val Arg
<210> 26
<211> 428
<212> PRT
<213> 葡萄
<400> 26
Met Ala Ala Leu Leu Lys Val Pro Pro Gln Ser Ser Gly Gly Thr Asn
1 5 10 15
Gly Ser Cys Lys Ala Asp Val Val Val Ser Asp Gln Phe Pro Ala Gly
20 25 30
Leu Arg Val Leu Val Val Asp Asp Asp Val Thr Cys Leu Lys Ile Leu
35 40 45
Glu Gln Met Leu Arg Arg Cys Leu Tyr His Val Thr Thr Cys Ser Gln
50 55 60
Ala Thr Ile Ala Leu Asn Ile Leu Arg Glu Lys Lys Gly Cys Phe Asp
65 70 75 80
Ile Val Leu Ser Asp Val His Met Pro Asp Met Asp Gly Tyr Lys Leu
85 90 95
Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser
100 105 110
Ala Asp Gly Arg Thr Ser Ala Val Met Arg Gly Ile Arg His Gly Ala
115 120 125
Cys Asp Tyr Leu Ile Lys Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile
130 135 140
Trp Gln His Val Val Arg Lys Lys Trp Asn Glu Asn Lys Glu His Glu
145 150 155 160
His Ser Gly Ser Leu Glu Asp Asn Asp Arg His Lys Arg Gly Gly Glu
165 170 175
Asp Ala Glu Tyr Ala Ser Ser Val Asn Glu Gly Ala Glu Gly Ile Leu
180 185 190
Lys Gly Gln Lys Lys Arg Arg Asp Ser Lys Asp Glu Asp Asp Gly Glu
195 200 205
Leu Glu Asn Glu Asp Pro Ser Thr Ser Lys Lys Pro Arg Val Val Trp
210 215 220
Ser Val Glu Leu His Gln Gln Phe Val Ser Ala Val Asn Gln Leu Gly
225 230 235 240
Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro
245 250 255
Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu
260 265 270
Tyr Leu Lys Arg Leu Ser Gly Val Ala Gln Gln Gln Gly Gly Ile Pro
275 280 285
Asn Ser Phe Cys Gly Pro Val Glu Pro Asn Val Lys Leu Gly Ser Leu
290 295 300
Gly Arg Phe Asp Ile Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro
305 310 315 320
Gln Thr Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Arg Pro Thr Ser
325 330 335
Asn Leu Val Leu Pro Ala Met Asp Gln Pro Ala Leu Leu Gln Ala Ser
340 345 350
Leu Gln Gly Pro Lys Cys Ile Pro Val Glu His Gly Val Ala Phe Gly
355 360 365
Gln Pro Leu Val Lys Cys Gln Thr Asn Ile Ser Lys His Phe Pro Pro
370 375 380
Thr Val Val Ser Thr Glu Asp Val Pro Ser Gly Phe Gly Ala Trp Pro
385 390 395 400
Ser Asn Ser Leu Gly Thr Val Gly Thr Ser Gly Ser Leu Gly Gly Leu
405 410 415
Ser Ala Gln Asn Asn Asn Ile Leu Met Asp Met Lys
420 425
<210> 27
<211> 659
<212> PRT
<213> 无油樟
<400> 27
Met Ala Asn Val Gln Lys Leu Pro His Ser Ser Ile Ser Thr Ala Ser
1 5 10 15
Ser Tyr Gly Ser Cys Arg Gly Glu Gly Val Pro Asp Gln Phe Pro Ala
20 25 30
Gly Leu Arg Val Leu Val Val Asp Asp Asp Thr Thr Cys Leu Arg Ile
35 40 45
Leu Glu Gln Met Leu Arg Lys Cys Met Tyr Lys Val Thr Thr Cys Cys
50 55 60
Arg Ala Thr Asp Ala Leu Asp Thr Leu Arg Gly Ser Lys Gly Cys Phe
65 70 75 80
Asp Val Val Ile Ser Asp Val Tyr Met Pro Asp Met Asp Gly Phe Lys
85 90 95
Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met
100 105 110
Ser Ala Asp Ala Arg Phe Ser Ala Val Met Lys Gly Ile Lys His Gly
115 120 125
Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Glu Leu Lys Asn
130 135 140
Ile Trp Gln His Val Val Arg Lys Lys Trp Asn Glu Thr Lys Glu His
145 150 155 160
Asp Gln Ser Gly Ser Ile Glu Asp Asn Glu Arg His Lys Arg Gly Ser
165 170 175
Asp Asp Ala Glu Tyr Ala Ser Ser Val Asn Glu Gly Thr Asp Gly Asn
180 185 190
Trp Lys Val Gln Lys Lys Arg Lys Asp Ser Lys Glu Glu Glu Asp Asp
195 200 205
Gly Glu Gln Glu Asn Glu Asp Pro Ser Ala Ala Lys Lys Pro Arg Val
210 215 220
Val Trp Ser Val Glu Leu His Gln Gln Phe Val Asn Ala Val Asn Gln
225 230 235 240
Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn
245 250 255
Val Gln Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe
260 265 270
Arg Leu Tyr Leu Lys Arg Leu Ser Gly His Gln Ala Gly Val Ser Ser
275 280 285
Ser Phe Cys Gly Ser Val Asp Pro Asn Ser Lys Leu Gly Pro Leu Ser
290 295 300
Gln Leu Asp Ile Arg Ala Leu Thr Ala Ser Gly Gln Ile Pro Ser Gln
305 310 315 320
Thr Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Arg Pro Ser Asn Asn
325 330 335
Val Ala Met Pro Val Tyr Gly Gln Thr Leu Val Lys Cys Gln Pro Asn
340 345 350
Leu Pro Lys Gln Phe Pro Gln Pro Asn Leu Pro Val Asp Asp Val Gln
355 360 365
Ser Ser Leu Ser Ile Trp Gln His His Leu Ser Ser Gly Met Pro Leu
370 375 380
Gly Gly Leu Asn Pro Gln Asn Asn Gly Leu Leu Met Gln Gln Gln Gln
385 390 395 400
Gln Leu Thr Ile Glu Ser Asn Arg Pro Cys Asn Val Gln Pro Ser Cys
405 410 415
His Val Ala Pro Ser Asn Gly Gly Phe Thr Met Arg Asn Asn Pro Thr
420 425 430
Ser Ser Asn Ala Ser Ser Val Glu Tyr Asn Ser Leu Leu Ser Ser Gln
435 440 445
Gly Asp Val Gly Gln Ile Ser Gln Ala Ser Gly Ser Asp Leu Ala Thr
450 455 460
Thr Val Gln Ser Asn Gly Gly Phe Lys Ser Leu Asp Tyr Arg Asn Met
465 470 475 480
Gly Gln Val Ser Leu Glu Ser Thr Ser Asp Leu Val Ser Thr Gln Asn
485 490 495
Asn Gly Phe Lys Gly Met Glu Leu Arg Asn Val Gly Ser Leu Gly Gly
500 505 510
Tyr Pro Leu Ser Ser Ser Val Ser Ala Gly Ser Thr Lys Thr Glu Asn
515 520 525
Gly Gln Ser Phe Ser Gln Val Arg Thr Gly Pro Arg Met Ser Met Gly
530 535 540
Pro Thr Gly Gln Phe Val Gly Pro Pro Thr Ile Arg Arg Leu Pro Met
545 550 555 560
Val Asp Gly Gly Thr His Arg Asn Ser Leu Gly Phe Val Gly Lys Gly
565 570 575
Val Ser Ile Pro Ser Arg Phe Met Pro Asp Ser Gly Ser Pro Thr Gly
580 585 590
Val Gly Glu Glu Cys Thr Leu Pro Lys Gln Glu Val Asp Pro Asp Phe
595 600 605
Phe Asp Ser Leu Lys Val Gly Pro Val Gly Val Gln His Tyr Ala Ser
610 615 620
Gly Asp Leu Met Ser Val Leu Ser Lys Gln Gln Gln Ala Ser Thr Gly
625 630 635 640
Asn Leu Asp Cys Glu Phe Gly Ile Asp Gly Tyr Gln Leu Gly Asn Ile
645 650 655
His Val Lys
<210> 28
<211> 669
<212> PRT
<213> 蓖麻
<400> 28
Met Ala Ala Leu Gln Arg Val Ala Ser Ser Val Ser Ala Thr Ala Ser
1 5 10 15
Asn Tyr Ser Ser Cys Lys Gly Asn Gly Val Val Thr Ala Thr Ala Asp
20 25 30
Val Ala Val Ser Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val Val
35 40 45
Asp Asp Asp Thr Thr Cys Leu Arg Ile Leu Glu Gln Met Leu Arg Arg
50 55 60
Cys Leu Tyr His Val Thr Thr Cys Ser Gln Ala Lys Val Ala Leu Asn
65 70 75 80
Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Val Leu Ser Asp Val
85 90 95
His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu
100 105 110
Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Arg Thr Ser
115 120 125
Ala Val Met Arg Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile Lys
130 135 140
Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg
145 150 155 160
Lys Lys Trp His Glu Asn Lys Glu Ile Glu His Ser Gly Ser Leu Glu
165 170 175
Asp Asn Asp Arg His Lys Arg Gly Asn Glu Asp Ala Glu Tyr Thr Ser
180 185 190
Ser Val Asn Glu Gly Thr Glu Gly Val Leu Lys Gly Gln Lys Arg Arg
195 200 205
Ser Asn Ser Lys Asp Glu Asp Asp Gly Glu Pro Asp Ser Asp Asp Pro
210 215 220
Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln
225 230 235 240
Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro
245 250 255
Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn
260 265 270
Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser
275 280 285
Gly Val Ala Gln Gln Gly Gly Ile Ser Ser Thr Phe Cys Gly Pro Met
290 295 300
Asp Ser Asn Val Lys Leu Asn Ser Leu Gly Arg Phe Asp Ile Gln Ala
305 310 315 320
Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Ala Ala Leu His
325 330 335
Ala Glu Leu Phe Gly Arg Pro Thr Gly Ser Leu Val Thr Thr Met Asp
340 345 350
Gln Pro Thr Leu Leu Gln Ala Ser Arg Gln Ser Pro Lys Cys Ile Pro
355 360 365
Val Glu His Gly Val Thr Phe Gly Gln Pro Ile Val Lys Cys Ser Ser
370 375 380
Gly Ile Ser Lys His Phe Pro Gln Asn Met Val Ser Val Glu Glu Val
385 390 395 400
Ser Ser Gly Tyr Gly Ala Trp Pro Ser Asn Ser Leu Gly Thr Val Gly
405 410 415
Pro Ser Thr Asn Leu Gly Gly Met Thr Thr Gln Asn Gly Asn Met Leu
420 425 430
Met Asp Ile Phe His Gln Gln Gln Lys Gln Gln Gln Pro Gln Gln Gln
435 440 445
Gln Ser Leu Ala Asp Pro Ser Arg Ser Ile Asn Val Gln Pro Ser Cys
450 455 460
Leu Val Val Pro Ser Gln Ser Ser Ala Cys Phe Gln Ala Gly Asn Ser
465 470 475 480
Pro Ala Ser Val Asn Gln Ser Asn Phe Asn Arg Asn Val Val Ile Asp
485 490 495
Tyr Ser Leu Leu Ser Ser Gln Ser Asn Asn Ser Ala Leu Asn Ile Gly
500 505 510
His Ile Pro Glu Gly Asp Leu Lys Thr Thr Gly Ala Val Asn Gly Tyr
515 520 525
Ser Ala Pro Gly Ser Leu Ser Pro Pro Ala Ser Ser Cys Ser Val Asn
530 535 540
Ala Asp Ser Gly Val Pro Arg Gln Val Gln Asn Pro Thr Leu Ala Phe
545 550 555 560
Gly Ala Val Arg Gln Leu Pro Ala Leu Ser Pro Asn Ile Phe Asn Ile
565 570 575
Gln Gly Ser Tyr Gly Val Arg Ser Asp Asp Ile Leu Asp Gln Gly Pro
580 585 590
Phe Phe Lys Asn Leu Gly Phe Val Gly Lys Gly Thr Cys Ile Pro Ser
595 600 605
Arg Phe Ala Val Asp Glu Phe Glu Thr Pro Ser Ser Asn Leu Ser His
610 615 620
Gly Lys Leu Tyr Val Glu Asn Asn Asp Asn Lys Val Lys Gln Glu Pro
625 630 635 640
Asn Ile Asp Phe Thr Asp Thr Ser Arg Val Gly Ile Pro Val Leu Gln
645 650 655
Gln Tyr Pro Pro Asn Asp Leu Met Ser Val Phe Thr Glu
660 665
<210> 29
<211> 654
<212> PRT
<213> 番茄
<400> 29
Met Val Ser Met Ser Gly Glu Val Ala Thr Cys Lys Ser Glu Ala Thr
1 5 10 15
Val Val Thr Asp His Phe Pro Val Gly Leu Arg Val Leu Val Val Asp
20 25 30
Asp Asp Val Val Cys Leu Arg Ile Ile Glu Gln Met Leu Arg Arg Cys
35 40 45
Lys Tyr Ser Val Thr Thr Cys Thr Gln Ala Met Val Ala Leu Asn Leu
50 55 60
Leu Arg Glu Lys Arg Gly Thr Phe Asp Ile Val Leu Ser Asp Val His
65 70 75 80
Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu
85 90 95
Met Asp Leu Pro Val Ile Met Met Ser Gly Asp Gly Arg Thr Asn Leu
100 105 110
Val Met Arg Gly Val Gln His Gly Ala Cys Asp Tyr Leu Ile Lys Pro
115 120 125
Ile Arg Asp Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg Lys
130 135 140
Arg Tyr Asn Ser Ser Lys Glu Pro Glu Cys Ser Gly Ser Leu Asp Asp
145 150 155 160
Asn Asp Arg Tyr Arg Arg Arg Ser Asp Asp Ala Glu Cys Ala Ser Ser
165 170 175
Val Ile Glu Gly Ala Asp Gly Val Leu Lys Pro Gln Lys Lys Lys Arg
180 185 190
Glu Ala Lys Glu Asp Asp Thr Glu Met Glu Asn Asp Asp Pro Ser Thr
195 200 205
Thr Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe
210 215 220
Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg
225 230 235 240
Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala
245 250 255
Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser Gly Val
260 265 270
Val Gln Gln Gln Gly Gly Leu Pro Ser Thr Phe Cys Gly Pro Ile Glu
275 280 285
Gln Asn Ser Glu Leu Gly Ser Leu Gly Arg Phe Asp Ile Gln Ala Leu
290 295 300
Ala Ala Ser Gly Gln Ile Pro Pro Glu Thr Leu Thr Ala Leu His Ala
305 310 315 320
Glu Leu Leu Gly Arg Ser Thr Ser Asn Leu Val Leu Pro Ala Val Glu
325 330 335
Gln Gln Asn Leu Val Gln Val Ser Leu Gln Gln Ala Lys Cys Ile Pro
340 345 350
Val Asp Gln Val Met Ala Tyr Gly Gln Pro Leu Leu Lys Cys Pro Ala
355 360 365
Ser Ile Ser Asn Ser Lys His Leu Ser Gln Ala Ile Leu Ser Ala Glu
370 375 380
Asp Val His Ser Gly Phe Gly Ser Gln Arg Ala Lys Asn Ile Cys Met
385 390 395 400
Val Pro Ser Ser Asn Pro Ile Ala Pro Asn Ser Asn Met Leu Thr Ala
405 410 415
Met Met Gln Gln Gln Gln Trp Gln Lys Gln Gln Gln Ile Glu Leu Gln
420 425 430
His Arg Gln Ser Gly Pro Pro Glu Val Asn Arg Ser Ile Asn Val Gln
435 440 445
Pro Ser Cys Leu Val Leu Pro Ser Gln Leu Pro Gly His Phe Gln Val
450 455 460
Gly Asp Ser Pro Ala Ser Ile Ser Arg Ala Gly Ser Leu Ser Lys Ser
465 470 475 480
Ser Val Ile Asp Tyr Gly Val Leu Ser Pro Gln Ser Asn Asn Ser Ser
485 490 495
Gly Val Val Gln Val Leu Asp Arg Glu Leu Lys Pro Glu Cys Gly Leu
500 505 510
Asn Arg Leu Pro Ser Gly Gly Ser Leu Ser Arg Ser Cys Ser Ile Asn
515 520 525
Ala Asp Asn Ser Val Asp Leu Gln Leu His Asn Ser Ser Ser Ala Phe
530 535 540
Gly Ser Ser Lys Gln Leu Pro Gly Leu Ile Pro Ser His Leu Gly Ser
545 550 555 560
Pro Val Pro Tyr Cys Ile Asn Ser Ser Leu Val Leu Asp Gln Gly Arg
565 570 575
Met Lys Gly Ala Ser Ile Pro Ser Arg Phe Ala Val Asp Glu Ser Asp
580 585 590
Ser Pro Met Cys Asn Phe Asn Thr Ala Lys Ile Tyr Leu Glu Glu Thr
595 600 605
Lys Val Lys Gln Glu Pro Asn Met Asn Val Met Glu Asn Ala Lys Val
610 615 620
Gly Pro Ala Ile Phe Gln Lys Phe Gln Pro Gly Asp Leu Met Ser Val
625 630 635 640
Phe Arg Leu Ser Phe Ala Arg Val Lys Val Ser Ser Ser Pro
645 650
<210> 30
<211> 653
<212> PRT
<213> 马铃薯
<400> 30
Met Ser Gly Asp Val Ala Thr Cys Lys Ser Glu Ala Thr Val Val Thr
1 5 10 15
Asp His Phe Pro Leu Gly Leu Arg Val Leu Val Val Asp Asp Asp Val
20 25 30
Val Cys Leu Arg Ile Ile Glu Gln Met Leu Arg Arg Cys Lys Tyr Ser
35 40 45
Val Thr Thr Cys Thr Gln Ala Met Val Ala Leu Asn Leu Leu Arg Glu
50 55 60
Lys Arg Gly Thr Phe Asp Ile Val Leu Ser Asp Val His Met Pro Asp
65 70 75 80
Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu Met Asp Leu
85 90 95
Pro Val Ile Met Met Ser Gly Asp Gly Arg Thr Asn Leu Val Met Arg
100 105 110
Gly Val Gln His Gly Ala Cys Asp Tyr Leu Ile Lys Pro Ile Arg Asp
115 120 125
Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg Lys Arg Tyr Asn
130 135 140
Ser Ser Lys Glu Leu Glu Cys Ser Gly Ser Leu Asp Asp Asn Asp Arg
145 150 155 160
Tyr Lys Arg Gly Ser Asp Asp Ala Glu Cys Ala Ser Ser Val Ile Glu
165 170 175
Gly Ala Asp Gly Val Leu Lys Pro Gln Lys Lys Lys Arg Glu Ala Lys
180 185 190
Glu Glu Asp Asp Thr Glu Met Glu Asn Asp Asp Pro Ser Thr Ser Lys
195 200 205
Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ser
210 215 220
Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Arg Ile Leu
225 230 235 240
Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His
245 250 255
Leu Gln Glu Asn Gln Lys Phe Arg Leu Tyr Leu Lys Arg Leu Ser Gly
260 265 270
Val Val Gln Gln Gln Gly Gly Leu Pro Ser Thr Phe Cys Gly Pro Ile
275 280 285
Glu Gln Asn Ser Glu Leu Gly Ser Leu Gly Arg Phe Asp Ile Gln Ala
290 295 300
Leu Ala Ala Ser Gly Gln Ile Pro Pro Glu Thr Leu Thr Ala Leu His
305 310 315 320
Ala Glu Leu Leu Gly Arg Ser Thr Ser Asn Leu Val Leu Pro Ala Val
325 330 335
Glu Ile Gln Asn Leu Leu Gln Ala Ser Leu Gln Gln Ala Lys Cys Ile
340 345 350
Pro Ala Asp Gln Val Met Ala Tyr Gly Gln Pro Leu Leu Lys Cys His
355 360 365
Pro Ser Ile Ser Asn Ser Lys His Leu Ser Gln Ser Ile Leu Ser Ala
370 375 380
Glu Asp Val His Ser Gly Phe Gly Ser Gln Arg Ala Lys Asn Ile Cys
385 390 395 400
Leu Val Pro Ser Ser Asn Pro Ile Gly Leu Ala Ala Pro Asn Ser Asn
405 410 415
Met Leu Met Ala Met Met Gln Gln Gln Gln Trp Gln Lys Gln Gln Gln
420 425 430
Met Glu Leu Gln His Arg Arg Ser Gly Pro Pro Glu Val Asn His Ser
435 440 445
Ile Asn Val Gln Pro Ser Cys Leu Val Leu Pro Ser Gln Leu Pro Gly
450 455 460
Asn Phe Gln Val Gly Asp Ser Pro Ala Ser Ile Ser Arg Ala Gly Ser
465 470 475 480
Leu Ser Lys Ser Ser Val Ile Asp Tyr Gly Val Leu Ser Pro Gln Ser
485 490 495
Asn Asn Ser Ser Gly Val Val Gln Val Leu Asp Arg Glu Leu Lys Pro
500 505 510
Glu Cys Gly Leu Asn Arg Leu Pro Ser Gly Gly Ser Leu Ser Arg Ser
515 520 525
Cys Ser Ile Asn Ala Asp Asn Ser Val Gly Leu Gln Leu His Asn Ser
530 535 540
Ser Ser Ala Phe Gly Ser Ser Lys Gln Leu Pro Ala Leu Ile Pro Asn
545 550 555 560
His Leu Gly Ser Pro Val Pro Tyr Tyr Ile Asn Ser Ser Gln Val Leu
565 570 575
Asp Gln Gly His Thr Arg Asn Pro Gly Val Gly Lys Cys Ala Ser Ile
580 585 590
Pro Ser Arg Phe Ala Val Asp Glu Ser Asp Ser Pro Met Cys Asn Phe
595 600 605
Asn Thr Ala Lys Asn Tyr Leu Glu Glu Thr Lys Val Lys Gln Glu Pro
610 615 620
Asn Met Asn Val Met Glu Asn Ala Lys Val Gly Pro Ala Ile Phe Gln
625 630 635 640
Lys Phe Gln Pro Gly Asp Leu Met Ser Val Phe Ser Asp
645 650
<210> 31
<211> 669
<212> PRT
<213> 陆地棉
<400> 31
Met Ala Thr Met His Arg Val Val Gln Ser Ser Val Ser Thr Ser Asp
1 5 10 15
Ala Thr Thr Thr Ser Tyr Asp Gly Leu Thr Ser Cys Lys Ala Ala Asp
20 25 30
Ile Val Ile Ser Asp Gln Phe Pro Ala Gly Leu Arg Val Leu Val Val
35 40 45
Asp Asp Asp Ile Thr Cys Leu Lys Ile Leu Glu Lys Met Leu His Arg
50 55 60
Cys Arg Tyr His Val Thr Thr Cys Pro Gln Ala Lys Val Ala Leu Asn
65 70 75 80
Leu Leu Arg Glu Arg Lys Gly Cys Phe Asp Val Ile Leu Ser Asp Val
85 90 95
Tyr Met Pro Asp Met Asp Gly Tyr Lys Leu Leu Glu His Val Gly Leu
100 105 110
Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp Gly Ser Thr Arg
115 120 125
Ala Val Met Lys Gly Ile Arg His Gly Ala Cys Asp Tyr Leu Ile Lys
130 135 140
Pro Ile Arg Glu Glu Glu Leu Lys Asn Ile Trp Gln His Val Val Arg
145 150 155 160
Lys Lys Trp Asn Glu Asn Lys Glu Leu Glu His Ser Gly Ser Leu Asp
165 170 175
Asp Thr Asp Gln His Lys Gln Arg His Asp Asp Ala Glu Tyr Ala Ser
180 185 190
Ser Val Asn Asp Ala Thr Glu Thr Ser Leu Lys Pro Leu Lys Lys Arg
195 200 205
Ser Asn Ser Lys Glu Glu Asp Asp Gly Glu Ile Asp Asn Asp Asp Pro
210 215 220
Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln
225 230 235 240
Gln Phe Val Ser Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro
245 250 255
Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn
260 265 270
Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu Lys Arg Ile Ser
275 280 285
Gly Val Ala Gln Gln Gly Gly Ile Ala Asn Pro Leu Cys Gly Pro Val
290 295 300
Glu Ala Asn Val Lys Ile Gly Ser Leu Gly Ser Phe Asn Ile Gln Ala
305 310 315 320
Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr Leu Ala Ala Val His
325 330 335
Ala Glu Leu Leu Gly Arg Ser Ala Gly Asn Leu Val Val Ala Thr Asp
340 345 350
Gln Pro Ala Leu Leu Gln Ala Thr Pro Gln Gly Ala Lys Cys Ile Gln
355 360 365
Val Asp Gln Gly Val Ala Phe Val Gln His Ser Val Lys Ser Glu Ser
370 375 380
Ser Ser Ser Lys His Phe Ser Gln Ser Phe Ala Pro Val Glu Asp Val
385 390 395 400
Ala Ser Gly Phe Arg Ser Trp Pro Ser Asn Asn Ile Gly Thr Ala Gly
405 410 415
Pro Ser Asn Ser Gly Gly Leu Ser Ser Gln Asn Gly Asn Met Leu Ile
420 425 430
Asp Leu Leu Gln Gln Gln Gln Gln Leu Gln Lys Pro Gln Gln Arg Ser
435 440 445
Thr Val Ser Glu Leu Arg Arg Ser Ile Asn Val Gln Pro Ser Cys His
450 455 460
Val Val Pro Ser Gln Ser Ser Ala Ser Phe Arg Ala Gly Asn Ser Pro
465 470 475 480
Val Ser Val Thr Gln Asn Gly Ser Tyr Ser Arg Thr Ala Val Ile Asp
485 490 495
Tyr Ser Leu Leu Ser Ser Gln Ser Asn Cys Pro Ser Leu Asn Ile Gly
500 505 510
Gln Val Ser Asp Val Asn Leu Gln Thr Thr Gly Val Leu Ser Gly Tyr
515 520 525
Ile Pro Pro Ala Ser Val Ser Pro Ser Val Ser Ser Cys Ser Val Asn
530 535 540
Ala Asp Asn Cys Ala Ser Gln Gln Val Gln Thr Ser Ser Met Thr Phe
545 550 555 560
Lys Ala Ser Arg His Leu Pro Gly Phe Val His Ser Thr Ser Asn Ile
565 570 575
Pro Asp Pro Tyr Gly Ser Thr Lys Ser Gly Asp Leu Leu Asn Gln Glu
580 585 590
Pro Phe Asn Asn Leu Gly Tyr Ile Asn Lys Gly Thr Cys Leu Pro Ala
595 600 605
Lys Phe Ala Val Asp Glu Phe Gln Ser His Leu Ser Ser Ser Ser His
610 615 620
Gly Lys Val Phe Ser Glu Asn Ile Gly Thr Arg Val Lys Gln Glu Pro
625 630 635 640
Ser Met Glu Phe Gly Asp Asn Ala Lys Val Gly Ile Pro Met Leu Gln
645 650 655
Gln Phe Arg Pro Asn Asp Leu Met Ser Val Phe Thr Glu
660 665
<210> 32
<211> 681
<212> PRT
<213> 可可
<400> 32
Met Asn Ser Ser Ser Gly Lys Gly Ser Met Ser Ala Ala Ser Ser Ser
1 5 10 15
Ala Ala Trp Lys Ala Gly Asp Val Val Pro Asp Gln Phe Pro Ala Gly
20 25 30
Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu
35 40 45
Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg
50 55 60
Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp
65 70 75 80
Ile Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu
85 90 95
Leu Glu His Ile Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser
100 105 110
Ala Asp Asp Gly Lys His Val Val Met Lys Gly Val Thr His Gly Ala
115 120 125
Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile
130 135 140
Trp Gln His Val Val Arg Lys Arg Lys Asn Glu Trp Lys Asp Phe Glu
145 150 155 160
Gln Ser Gly Ser Val Glu Glu Gly Asp Arg Gln Pro Lys Gln Ser Glu
165 170 175
Glu Ala Asp Tyr Ser Ser Ser Ala Asn Glu Gly Asn Trp Lys Ser Ser
180 185 190
Lys Lys Arg Lys Asp Asp Asp Asp Glu Ala Glu Glu Arg Asp Asp Thr
195 200 205
Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln
210 215 220
Gln Phe Val Ala Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro
225 230 235 240
Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn
245 250 255
Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser
260 265 270
Gly Val Ser Gln His Gln Ser Asn Leu Asn Asn Ser Phe Met Ser Pro
275 280 285
Gln Glu Ala Thr Phe Gly Pro Leu Ser Pro Leu Asn Gly Leu Asp Leu
290 295 300
Gln Thr Leu Ala Ala Thr Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr
305 310 315 320
Phe Gln Ala Ala Gly Leu Gly Arg Ser Thr Ala Lys Ser Gly Ile Ala
325 330 335
Met Pro Leu Val Asp Gln Arg Asn Ile Phe Ser Phe Glu Asn Pro Lys
340 345 350
Leu Arg Phe Gly Glu Gly Gln Gln Gln His Met Asn Asn Asn Lys Gln
355 360 365
Leu Asn Leu Leu His Gly Ile Pro Thr Thr Met Glu Pro Lys Gln Leu
370 375 380
Ala Ser Leu His His Ser Ala Gln Ser Ile Gly Asn Ile Asn Met Gln
385 390 395 400
Val Thr Ser His Gly Val Gln Gly Ser Gln Asn Asn Ser Leu Leu Ile
405 410 415
Gln Met Ala Gln Pro Gln Pro Arg Gly Gln Ile Leu Asn Asp Ser Thr
420 425 430
Gly Ser His Ala Pro Arg Leu Pro Ser Thr Leu Gly Gln Pro Ile Leu
435 440 445
Ser Asn Gly Ile Ala Ala Asn Val Ser Thr Arg Asn Gly Ile Pro Glu
450 455 460
Asn Ile Arg Gly Pro Gly Tyr Asn Pro Val Ser Gln Thr Ser Ser Leu
465 470 475 480
Leu Asn Phe Pro Met Asn His Thr Ser Glu Leu Pro Gly Asn Ser Phe
485 490 495
Pro Leu Gly Thr Thr Pro Gly Ile Ser Ser Leu Thr Ser Lys Gly Ala
500 505 510
Phe Gln Glu Asp Ile Asn Ser Asp Val Lys Gly Ser Gly Gly Phe Met
515 520 525
Pro Ser Tyr Asp Ile Phe Asn Asp Leu Asn Gln His Lys Pro Gln Asn
530 535 540
Trp Glu Leu Gln Asn Val Gly Met Thr Phe Asp Ala Ser Gln His Ser
545 550 555 560
Asn Ser Leu Gln Gly Asn Leu Asp Leu Ala Gln Ser Ile Leu Val Gln
565 570 575
Gln Gly Phe Ser Ser Gly Gln Met Asn Gly Gln Asn Arg Ser Ala Ala
580 585 590
Val Val Ser Lys Ala Met Phe Ser Ala Gly Asp Cys Thr Glu Gln Gly
595 600 605
Asn Ala Gln Asn Val Asn His His Leu Asn Asn Leu Leu Val Asp Asn
610 615 620
Thr Ile Arg Ile Lys Ser Glu Arg Val Ala Asp Ala Gly Pro Ala Asn
625 630 635 640
Leu Phe Pro Asp His Phe Gly Gln Glu Asp Leu Met Ser Ala Leu Leu
645 650 655
Lys Gln Gln Asp Gly Ile Ala Pro Ala Glu Asn Glu Phe Asp Phe Asp
660 665 670
Gly Tyr Ser Met Asp Asn Ile Pro Val
675 680
<210> 33
<211> 579
<212> PRT
<213> 菜豆
<400> 33
Met Asn Leu Ser Asn Gly Lys Gly Ser Met Ser Thr Val Thr Thr Thr
1 5 10 15
Ala Val Met Lys Ser Gly Asp Ala Val Ser Asp Gln Phe Pro Ala Gly
20 25 30
Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu
35 40 45
Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg
50 55 60
Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp
65 70 75 80
Ile Val Ser Ala Asn Glu Gly Ser Trp Arg Asn Ser Lys Lys Arg Arg
85 90 95
Asp Glu Glu Glu Glu Ala Glu Asp Arg Asp Asp Thr Ser Thr Leu Lys
100 105 110
Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln Gln Phe Val Ala
115 120 125
Ala Val Asp Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu
130 135 140
Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His
145 150 155 160
Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser Gly Val Ser Gln
165 170 175
His Gln Asn Asn Leu Asn Asn Ser Phe Leu Gly Ser Gln Glu Ala Thr
180 185 190
Phe Gly Thr Ile Ser Ser Ile Asn Gly Ile Asp Leu Gln Thr Leu Ala
195 200 205
Val Thr Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr Leu Gln Ala Ala
210 215 220
Gly Leu Gly Arg Ser Thr Ala Lys Thr Gly Val Pro Met Pro Leu Met
225 230 235 240
Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Pro Arg Val Arg Phe Gly
245 250 255
Glu Gly Gln Gln Gln His Leu Ser Ser Ser Lys Pro Met Asn Leu Leu
260 265 270
Leu Gly Ile Pro Thr Asn Met Glu Pro Lys Gln Leu Ala Asn Leu His
275 280 285
Gln Ser Thr Gln Ser Ile Ala Ser Leu Asn Met Arg Val Asn Ala Ser
290 295 300
Ala Thr Gln Gly Asn Pro Leu Met Met Gln Met Pro Gln Ser Gln Pro
305 310 315 320
Arg Gly Gln Met Leu Ser Glu Asn Thr Gly Pro Arg Val Pro Arg Leu
325 330 335
Pro Ser Ser Leu Gly Gln Pro Thr Val Ser Asn Gly Ile Ser Asn Gly
340 345 350
Phe Leu Gly Arg Asn Gly Ile Ala Gly Asn Asn Arg Gly Pro Ala Tyr
355 360 365
Asn Pro Val Pro Pro Asn Ser Ser Leu Leu Ser Phe Pro Met Asn Gln
370 375 380
Ser Ser Glu Val Ser Val Asn Asn Ser Leu Pro Leu Gly Ser Ser Pro
385 390 395 400
Gly Ile Ser Ser Ile Thr Thr Lys Gly Ser Phe Gln Glu Glu Val Thr
405 410 415
Ser Gly Ile Lys Ala Thr Gly Gly Phe Pro Ser Tyr Asp Ile Phe Asn
420 425 430
Glu Leu His His Gln Lys Ser His Asp Trp Glu Ile Thr Asn Pro Ser
435 440 445
Leu Thr Tyr Ser Ala Ser His His Ala Asn Pro Leu Gln Gly Asn Ile
450 455 460
Asp Val Ser Pro Ser Val Leu Val His Gln Gly Phe Ser Ser Thr Gln
465 470 475 480
Gln Asn Gly Gln Ser Arg Asp Ala Thr Leu Ile Gly Lys Ala Met Phe
485 490 495
Ser Leu Gly Glu Gly Ser Glu Gln Asp Asn Leu Gln Asn Ala Val Gln
500 505 510
His Leu His Pro Leu Leu Val Asp Asn Ser Ile Arg Val Lys Ala Glu
515 520 525
Arg Ile Pro Asp Ala Ser Ser Gln Thr Asn Leu Phe Pro Asp His Tyr
530 535 540
Val Gln Glu Asp Leu Met Ser Ala Leu Leu Lys Gln Gln Glu Gly Met
545 550 555 560
Gly Pro Ala Glu Ser Glu Phe Glu Phe Asp Ala Tyr Ser Leu Asp Asn
565 570 575
Ile Pro Val
<210> 34
<211> 679
<212> PRT
<213> 大豆
<400> 34
Met Asn Leu Ser Asn Gly Lys Gly Ser Met Ser Thr Leu Thr Ala Ser
1 5 10 15
Val Val Met Lys Ser Gly Asp Ala Val Ser Asp Gln Phe Pro Ala Gly
20 25 30
Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Met Ile Leu
35 40 45
Glu Lys Met Leu Arg Thr Cys Leu Tyr Glu Val Thr Lys Cys Asn Arg
50 55 60
Ala Glu Thr Ala Leu Ser Leu Leu Arg Glu Asn Lys Asn Gly Phe Asp
65 70 75 80
Ile Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu
85 90 95
Leu Glu His Ile Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser
100 105 110
Ala Asp Asp Gly Lys Ser Val Val Met Lys Gly Val Thr His Gly Ala
115 120 125
Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys Asn Ile
130 135 140
Trp Gln His Val Val Arg Lys Arg Lys Asn Glu Trp Lys Asp Ala Glu
145 150 155 160
Gln Ser Gly Ser Ala Glu Glu Gly Asp Arg Gln Pro Lys Ala Ser Asp
165 170 175
Glu Ala Asp Tyr Ser Ser Ser Ala Asn Glu Gly Ser Trp Arg Asn Ser
180 185 190
Lys Lys Arg Arg Asp Glu Glu Glu Glu Ala Glu Asp Arg Asp Asp Thr
195 200 205
Ser Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His Gln
210 215 220
Gln Phe Val Ala Ala Val Asp Gln Leu Gly Ile Asp Lys Ala Val Pro
225 230 235 240
Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu Asn
245 250 255
Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu Ser
260 265 270
Gly Val Ser Gln His Gln Asn Asn Met Asn Asn Ser Phe Leu Ser Pro
275 280 285
Gln Glu Ala Thr Phe Gly Thr Ile Ser Ser Ile Asn Gly Ile Asp Leu
290 295 300
Gln Thr Leu Ala Val Ala Gly Gln Leu Pro Ala Gln Ser Leu Ala Thr
305 310 315 320
Leu Gln Ala Ala Gly Leu Gly Arg Pro Thr Gly Lys Ala Gly Val Pro
325 330 335
Met Pro Leu Met Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Pro Arg
340 345 350
Leu Arg Phe Gly Glu Gly Gln Gln Gln His Leu Ser Thr Ser Lys Pro
355 360 365
Met Asn Leu Leu His Gly Ile Pro Thr Asn Met Glu Pro Lys Gln Leu
370 375 380
Ala Asn Leu His Gln Ser Thr Gln Ser Ile Gly Ser Leu Asn Met Arg
385 390 395 400
Val Asn Ala Ser Ala Thr Gln Gly Ser Pro Leu Leu Met Gln Met Ala
405 410 415
Gln Ser Gln Pro Arg Gly Gln Met Leu Ser Glu Asn Ile Gly Pro Arg
420 425 430
Val Pro Arg Leu Pro Ser Ser Leu Gly Gln Pro Thr Val Ser Asn Gly
435 440 445
Ile Ser Asn Gly Leu Leu Gly Arg Asn Gly Ile Ala Gly Asn Asn Arg
450 455 460
Gly Pro Ala Tyr Asn Pro Val Pro Pro Ser Ser Ser Leu Leu Ser Phe
465 470 475 480
Pro Met Asn Gln Thr Ser Glu Met Ser Val Asn Asn Ser Phe Pro Leu
485 490 495
Gly Ser Thr Pro Gly Ile Ser Ser Ile Thr Thr Lys Gly Ser Phe Gln
500 505 510
Glu Glu Val Thr Ser Gly Ile Lys Gly Ser Gly Gly Phe Pro Ser Tyr
515 520 525
Asp Ile Phe Asn Glu Leu His His Gln Lys Pro His Asp Trp Glu Ile
530 535 540
Thr Asn Pro Asn Leu Thr Tyr Asn Ala Ser Gln His Ala Asn Pro Leu
545 550 555 560
Gln Gly Asn Ile Asp Val Thr Pro Ser Val Leu Val His Gln Gly Phe
565 570 575
Ser Ser Thr Gln Gln Thr Gly Gln Ser Arg Asp Ala Ala Leu Ile Gly
580 585 590
Lys Ala Met Phe Ser Met Gly Glu Gly Leu Glu Gln Asn Asn Phe Gln
595 600 605
Asn Ala Ser Gln Asn Leu Asn Ser Leu Leu Leu Asp Asn Ser Ile Arg
610 615 620
Val Lys Ala Glu Arg Ile Pro Asp Ala Ser Ser Gln Thr Asn Leu Phe
625 630 635 640
Pro Glu His Tyr Gly Gln Glu Asp Leu Met Ser Ala Leu Leu Lys Gln
645 650 655
Gln Glu Gly Met Gly Pro Ser Glu Asn Glu Phe Asp Phe Asp Gly Tyr
660 665 670
Ser Leu Asp Asn Ile Pro Val
675
<210> 35
<211> 668
<212> PRT
<213> 藜麦
<400> 35
Met Asn Leu Gly Gly Gly Leu Met Gly Ser Met Ala Met Pro Ser Ser
1 5 10 15
Thr Val Ser Arg Lys Ser Ser Glu Val Val Thr Ala Asp Gln Phe Pro
20 25 30
Val Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Thr
35 40 45
Ile Leu Glu Lys Met Leu Arg Thr Cys Arg Tyr Glu Val Thr Lys Thr
50 55 60
Asn Arg Ala Glu His Ala Leu Asn Met Leu Arg Glu Asn Lys Asn Gly
65 70 75 80
Phe Asp Val Val Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe
85 90 95
Lys Leu Leu Glu Gln Val Gly Leu Glu Met Asp Leu Pro Val Ile Met
100 105 110
Met Ser Ala Asp Asp Ser Lys Gln Val Val Met Lys Gly Val Thr His
115 120 125
Gly Ala Cys Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu Lys
130 135 140
Asn Ile Trp Gln His Val Val Arg Lys Lys Lys Tyr Glu Tyr Asn Lys
145 150 155 160
Asp Val Glu Gln Ser Gly Ser Trp Asp Glu Gly Asp Arg Gln Leu Lys
165 170 175
His Asp Asp Ala Val Ser Ser Pro Ala Asn Asp Gly Ser Trp Lys Asn
180 185 190
Ser Lys Arg Lys Ser Gly Glu Asp Asp Glu Ala Asp Asp Lys Asp Asp
195 200 205
Thr Thr Thr Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu Leu His
210 215 220
Gln Gln Phe Val Ala Ala Val Asn Gln Leu Gly Ile Asp Lys Ala Val
225 230 235 240
Pro Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly Leu Thr Arg Glu
245 250 255
Asn Val Ala Ser His Leu Gln Lys Tyr Arg Leu Tyr Leu Arg Arg Leu
260 265 270
Ser Gly Val Ser Gln His Gln Gly Gly Leu Asn Ser Ser Phe Met Pro
275 280 285
Gln Asp Pro Ser Phe Ser Thr Met Ser Ser Leu Gly Gly Ile Asp Leu
290 295 300
Gln Thr Leu Ala Ala Thr Gly Gln Leu Ser Ala Gln Thr Leu Ala Ala
305 310 315 320
Tyr Thr Arg Leu Pro Pro Thr Ile Lys Pro Gly Ile Ser Met Pro Phe
325 330 335
Val Asp Gln Arg Asn Leu Phe Ser Phe Glu Asn Ser Lys Leu Arg Tyr
340 345 350
Gly Asp Gly Gln Gln Ser Gln Ile Ser Asn Val Ser Lys Gln Met Asn
355 360 365
Leu Leu His Gly Phe Pro Thr Thr Met Glu Pro Lys Gln Leu Ala Val
370 375 380
Leu Asn Gln Ser Ala Gln Thr Leu Gly Ser Met Asn Met Gln Ala Asn
385 390 395 400
Ala Ser Ser Ser His Gln Ser Ser Ser Leu Leu Met Gln Gln Met Val
405 410 415
Pro Gln Gln Arg Gly His Ile Ser Asn Glu Ser Ile Ser Ser Gln Val
420 425 430
Pro Arg Ile Gln Pro Ser Val Gly Gln Pro Leu Gln Ser Asn Gly Asn
435 440 445
Ala Asn Ala Val Leu Ser Arg Asn Gly Ile Pro Tyr Asp Pro Val Asn
450 455 460
Gln Ser Ala Ser Val Val Asp Phe Ser Val Asn His Ile Pro Glu Leu
465 470 475 480
Pro Gly Asn Ser Phe Pro Leu Gly Ser Thr Pro Gly Ile Thr Ser Ile
485 490 495
Thr Ser Lys Gly Phe Asn Gln Glu Glu Ile Gly Ser Asp Ile Lys Val
500 505 510
Ser Arg Gly Phe Val Gly Ser Tyr Asp Met Phe Ser Glu Leu Gln His
515 520 525
Lys Pro Gln Glu Trp Gln Met Gln Asn Pro Asn Met Gly Phe Ala Gly
530 535 540
Ser Ser Gln His Val Pro Ser Val Gln Ser Gly Val Asn Val Ala Pro
545 550 555 560
Ser Ile Met Val Asn Gln Ser Tyr Val Ser Gly Gln Lys Asn Glu Gln
565 570 575
Asn Gly His Ser Met Ala Gly Lys Pro Met Tyr Ser Ala Gly Leu Glu
580 585 590
Asn Gln His Met Gly Met Gln Asn Val Asn Gln Asn Tyr Asn Ser Ile
595 600 605
His Val Asn Asn Ser Ser Arg Val Lys Ala Glu Ser Val Ser Asp Val
610 615 620
Val Asn Leu Gly Ala Asn Leu Phe Asp Tyr Ser Pro Glu Asp Met Leu
625 630 635 640
Ser Thr Ile Met Leu Lys Gln Gln Glu Gly Ile Gly Ser Gly Asp Phe
645 650 655
Asp Phe Asp Gly Tyr Thr Leu Asp Asn Ile Pro Val
660 665
<210> 36
<211> 670
<212> PRT
<213> 苹果
<220>
<221> MOD_RES
<222> (195)..(195)
<223> 任何氨基酸
<220>
<221> MOD_RES
<222> (215)..(215)
<223> 任何氨基酸
<220>
<221> MOD_RES
<222> (530)..(530)
<223> 任何氨基酸
<220>
<221> MOD_RES
<222> (540)..(540)
<223> 任何氨基酸
<220>
<221> MOD_RES
<222> (579)..(579)
<223> 任何氨基酸
<400> 36
Met Ala Ala Leu Gln Arg Val Ala Gln Ser Ser Val Ser Thr Thr Ala
1 5 10 15
Ser Ser Tyr Gly Ser Cys Lys Val Gly Gly Gly Val Leu Ser Pro Ser
20 25 30
Ala Gly Ile Glu Met Ala Val Pro Asn Gln Phe Pro Ala Gly Leu Arg
35 40 45
Val Leu Val Val Asp Asp Asp Thr Thr Cys Leu Arg Ile Leu Glu Leu
50 55 60
Met Leu Leu Arg Cys Leu Tyr Gln Val Thr Thr Cys Ser Glu Ala Thr
65 70 75 80
Val Ala Leu Asn Leu Leu Arg Glu Arg Lys Asp Cys Phe Asp Val Val
85 90 95
Leu Ser Asp Val His Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu
100 105 110
His Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala Asp
115 120 125
Gly Arg Thr Ser Val Val Met Arg Gly Ile Arg His Gly Ala Cys Asp
130 135 140
Phe Leu Ile Lys Pro Ile Ser Glu Ala Glu Leu Lys Asn Ile Trp Gln
145 150 155 160
His Val Val Arg Lys Lys Trp Asn Gly Ser Lys Glu Leu Glu His Ser
165 170 175
Gly Ser Leu Glu Asp Asn Asp Pro His Lys Arg Gly Asn Asn Asp Phe
180 185 190
Glu Tyr Xaa Ser Ser Val Asn Glu Gly Thr Glu Val Ser Leu Lys Gly
195 200 205
His Lys Lys Arg Ile Asn Xaa Lys Glu Asp Asp Asp Gly Asp Thr Glu
210 215 220
Asn Asp Asp Leu Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser Val
225 230 235 240
Glu Leu His Gln Gln Phe Val Thr Ala Val Asn Gln Leu Gly Leu Asp
245 250 255
Lys Ala Val Pro Lys Arg Ile Leu Glu Leu Met Asn Val Pro Gly Leu
260 265 270
Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr Leu
275 280 285
Lys Arg Leu Ser Gly Val Ala Gln Gln Gln Ser Gly Ile Ala Asn Pro
290 295 300
Leu Cys Gly Pro Val Asp Ser Asn Gly Lys Leu Gly Ser Leu Ser Arg
305 310 315 320
Phe Asp Phe Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Thr
325 330 335
Leu Ala Ala Leu Gln Ala Glu Leu Leu Gly Gln Pro Ala Gly Asn Leu
340 345 350
Val Pro Ala Met Asp Gln Pro Ala Leu Leu His Ala Ser Leu Gln Ala
355 360 365
Pro Lys Arg Pro Pro Val Glu His Gly Val Pro Phe Met Gln Pro Phe
370 375 380
Val Lys Ser Gln Ser Asn Val Ser Lys His Phe Pro Gln Ser Val Ile
385 390 395 400
Ser Ala Glu Asp Ala Ser Leu Gly Phe Gly Gln Trp Arg Ser Asn Ser
405 410 415
Arg Ser Thr Val Ala Pro Ser Asn Asp His Gly Gly Leu Ser Thr Gln
420 425 430
Asn Ser Asn Leu Leu Met Gly Ile Val Pro Gln Glu Gln Arg Gln His
435 440 445
Lys Arg Thr Gln Gln Gln Ser Val Leu Thr Glu Pro Ser Arg Ser Phe
450 455 460
Asn Val Gln Pro Ser Cys Leu Val Val Pro Ser Gln Ser Ser Thr Gly
465 470 475 480
Phe Gln Ala Gly Asn Ser Pro Ala Ser Val Asn Gln Ser Ser Ser Phe
485 490 495
Asn Arg Ser Thr Val Val Asp Tyr Ser Leu Pro Ser Asp Gln Ser Asn
500 505 510
Asn Ser Leu Asn Val Gly His Ile Pro Thr Gly Asn Pro Lys Thr Ser
515 520 525
Gly Xaa Leu Gly Gly Tyr Ser Gly Pro Gly Ser Xaa Cys Ala Thr Ser
530 535 540
Cys Leu Val Asn Ala Asp Asn Ser Thr Ser Tyr Gln Asn Ser Thr Ala
545 550 555 560
Thr Phe Ser Asp Ser Arg Glu Leu Pro Gly Phe Leu His Asn Thr Ala
565 570 575
Asn Ser Xaa Gly Phe Tyr Val Asp Lys Ser Gly Glu Met Leu Asp Gln
580 585 590
Gly Pro Leu Arg Asn Leu Gly Phe Val Gly Lys Glu Thr Cys Ile Pro
595 600 605
Ser Arg Phe Ala Val Asp Asp Phe Glu Ser Gln Met Ser Asn Leu Asn
610 615 620
Pro Gly Arg Ile His Val Glu Ser Ser Gly Thr Leu Val Lys Gln Glu
625 630 635 640
Pro Ser Glu Asp Tyr Val Asp Asn Ala Lys Leu Gly Ile Pro Ile Leu
645 650 655
His Gln Tyr Ser Ser Ser Asp Phe Met Ser Pro Phe Ala Asp
660 665 670
<210> 37
<211> 802
<212> PRT
<213> 玉米
<400> 37
Pro Tyr Pro Thr His Thr Leu Leu Pro Gln Pro His Leu Ser Leu Ser
1 5 10 15
Ala Cys Val Leu Leu Val Leu Leu Ser Leu Ser Ser Pro Ala Leu Thr
20 25 30
Ser Pro Pro Phe Pro Ala Val Ser Trp Ile Ser Arg Ile Gln Thr Thr
35 40 45
Ala Leu Val Ser Leu Pro Ser Cys Leu Leu Pro Ala Tyr Val Gln Glu
50 55 60
Gly Pro Cys Leu Gly Asp Pro Gly Ala Trp Phe Leu Gly Ser Ala Ala
65 70 75 80
Ser Ala Ala Val Gly Phe Ala Glu Pro Glu Pro Pro Glu Met Thr Val
85 90 95
Asp Glu Leu Lys Leu Gln Ala Arg Ala Ser Gly Gly His Gly Ala Lys
100 105 110
Asp Gln Phe Pro Val Gly Met Arg Val Leu Ala Val Asp Asp Asp Pro
115 120 125
Thr Cys Leu Lys Ile Leu Glu Asn Leu Leu Leu Arg Cys Gln Tyr His
130 135 140
Val Thr Thr Thr Gly Gln Ala Ala Thr Ala Leu Lys Leu Leu Arg Glu
145 150 155 160
Lys Lys Asp Gln Phe Asp Leu Val Ile Ser Asp Val His Met Pro Asp
165 170 175
Met Asp Gly Phe Lys Leu Leu Glu Leu Val Gly Leu Glu Met Asp Leu
180 185 190
Pro Val Ile Met Leu Ser Ala Asn Gly Glu Thr Gln Thr Val Met Lys
195 200 205
Gly Ile Thr His Gly Ala Cys Asp Tyr Leu Leu Lys Pro Val Arg Ile
210 215 220
Glu Gln Leu Arg Thr Ile Trp Gln His Val Val Arg Arg Arg Ser Cys
225 230 235 240
Asp Ala Lys Asn Ser Gly Asn Asp Asn Asp Asp Ser Gly Lys Lys Leu
245 250 255
Gln Val Val Ser Ala Glu Gly Asp Asn Gly Gly Val Asn Arg Asn Lys
260 265 270
Arg Ile Ser Arg Lys Gly Arg Asp Asp Asn Gly Asp Asp Gly Asp Asp
275 280 285
Ser Asp Asp Asn Ser Asn Glu Asn Gly Asp Ser Ser Ser Gln Lys Lys
290 295 300
Pro Arg Val Val Trp Ser Val Glu Leu His Arg Lys Phe Val Ala Ala
305 310 315 320
Val Asn Gln Leu Gly Ile Asp Lys Ala Val Pro Lys Lys Ile Leu Asp
325 330 335
Leu Met Asn Val Glu Asn Ile Thr Arg Glu Asn Val Ala Ser His Leu
340 345 350
Gln Lys Tyr Arg Leu Tyr Leu Lys Arg Leu Ser Ala Asp Ala Ser Arg
355 360 365
Gln Ala Asn Leu Thr Ala Ala Phe Gly Gly Arg Asn Pro Ala Tyr Val
370 375 380
Asn Met Gly Leu Asp Ala Phe Arg Gln Tyr Asn Ala Tyr Gly Arg Tyr
385 390 395 400
Arg Pro Val Pro Thr Thr Asn His Ser Gln Pro Asn Asn Leu Leu Ala
405 410 415
Arg Met Asn Ser Pro Ala Phe Gly Met His Gly Leu Leu Pro Ser Gln
420 425 430
Pro Leu Gln Ile Gly His Asn Gln Asn Asn Leu Ser Thr Ser Leu Gly
435 440 445
Asn Val Gly Gly Met Asn Asn Gly Asn Leu Ile Arg Gly Ala His Met
450 455 460
Pro Leu Gln Asp Thr Ser Lys Cys Phe Pro Thr Gly Pro Ser Gly Asn
465 470 475 480
Ser Phe Ala Asn Ile Ser Asn Ser Thr Gln Leu Val Thr Thr Asn Asn
485 490 495
Leu Pro Leu Gln Ser Leu Glu Pro Ser Asn Gln Gln His Leu Gly Arg
500 505 510
Leu His Ser Ser Ala Asp Pro Phe Asn Ser Phe Val Gly Glu Pro Pro
515 520 525
Gln Phe Ala Asp Leu Gly Arg Cys Asn Thr Thr Trp Pro Thr Ala Val
530 535 540
Ser Ser Ser Asn Val Gln Glu Ile Gly Gln Lys Asp Arg Ile Val Asn
545 550 555 560
Arg Pro Lys Leu Glu Pro Leu Ser Ser Phe Thr Glu Ala Ser Ser Gln
565 570 575
Ile Pro Leu Leu Gly Asn Glu Met Gln Ser His Gln Val Ala Ser Leu
580 585 590
Ala Ser Asn Gly Leu Pro Met Pro Phe Thr Gln Glu Ala Val Pro Phe
595 600 605
Ala Tyr Gly Ser Ser Thr Asn Ser Arg Glu Met Leu Asn Asn Asn Leu
610 615 620
Ala Leu Ser Asn Ser Gly Val Asn Ser Thr Leu Pro Asn Leu Arg Ile
625 630 635 640
Asp Gly Ser Val Val Pro Gly Gln Thr Leu Gly Gly Ser Asn Ser Gly
645 650 655
Gly Cys Val Val Pro Pro Leu Gln Asp Gly Arg Ile Asp His Gln Ala
660 665 670
Val Ser Ser His Leu Asn Tyr Asn Asn Glu Leu Met Gly Thr Gly Arg
675 680 685
Leu Gln Arg Gly Leu Ser Gly Gly Leu Asp Asp Ile Val Val Asp Met
690 695 700
Phe Arg Pro Asp Arg Ala Asp Asp Gly Val Ser Phe Ile Asp Gly Asp
705 710 715 720
Trp Glu Leu Arg Pro Gly Ser Ser Val Thr Ser Glu Tyr Gln Leu Cys
725 730 735
Gly Ile Cys Tyr Leu Asn Ser Tyr Asp Tyr Val Phe Lys Ser Gly Val
740 745 750
Asn Cys Gly Tyr Arg Asp Ile Gln His Val Tyr Glu Pro Arg Asn Asp
755 760 765
Val Leu Phe Pro Leu Gly Asn Arg Phe Ala Val Pro Phe Val Asp Cys
770 775 780
His Cys Ile Val Ala Ser Leu Ala Glu Thr Glu Val Lys Gly Lys Asp
785 790 795 800
Gln Ala
<210> 38
<211> 591
<212> PRT
<213> 芜菁
<400> 38
Met Leu Asn Pro Gly Val Val Gly Gly Ser Ser Asn Ser Asp Pro Phe
1 5 10 15
Pro Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu
20 25 30
Met Ile Leu Glu Arg Met Leu Lys Thr Cys Leu Tyr Arg Val Thr Lys
35 40 45
Cys Asn Arg Ala Glu Ile Ala Leu Ser Leu Leu Arg Lys Asn Lys Asn
50 55 60
Gly Phe Asp Ile Val Ile Ser Asp Val His Met Pro Asp Met Asn Gly
65 70 75 80
Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val Ile
85 90 95
Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val Thr
100 105 110
His Gly Ala Val Asp Tyr Leu Ile Lys Pro Val Arg Ile Glu Ala Leu
115 120 125
Lys Asn Ile Trp Gln His Val Val Arg Lys Lys Gln Asn Val Ser Glu
130 135 140
His Ser Gly Ser Val Glu Glu Thr Gly Gly Asp Arg Gln Gln Gln Gln
145 150 155 160
Arg Gly Asp Asp Asp Asp Asp Gly Asn Asn Ser Ser Ser Gly Asn Asn
165 170 175
Glu Gly Asn Leu Arg Lys Arg Lys Glu Glu Glu Gln Gly Asp Asp Lys
180 185 190
Glu Asp Thr Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val Glu
195 200 205
Leu His Gln Gln Phe Val Ala Ala Val Asn His Leu Gly Val Asp Lys
210 215 220
Ala Val Pro Lys Lys Ile Leu Glu Met Met Asn Val Gln Gly Leu Thr
225 230 235 240
Arg Glu Asn Val Ala Ser His Leu Gln Lys Tyr Arg Ile Tyr Leu Lys
245 250 255
Arg Leu Gly Gly Val Ser Gln Gly Asn Met Asn His Ser Phe Leu Thr
260 265 270
Gly Gln Asp Pro Ser Tyr Gly Pro Leu Asn Gly Phe Asp Leu Gln Gly
275 280 285
Leu Ala Thr Ala Gly Gln Leu Gln Ala Gln Ser Leu Ala Gln Leu Gln
290 295 300
Ala Val Gly Leu Gly Gln Ser Ser Ser Pro Leu Ile Lys Pro Gly Ile
305 310 315 320
Thr Ser Val Asp Gln Arg Ser Phe Phe Thr Phe Gln Asn Ser Lys Ser
325 330 335
Arg Phe Gly Asp Gly His Gly Pro Met Met Met Asn Gly Gly Gly Gly
340 345 350
Asn Lys Gln Thr Ser Leu Leu His Gly Val Pro Thr Gly His Met Arg
355 360 365
Leu Gln Gln Gln Gln Met Ala Gly Met Arg Val Ala Gly Pro Ser Met
370 375 380
Gln Gln Gln Gln Gln Gln Ser Met Leu Ser Arg Arg Ser Val Pro Glu
385 390 395 400
Thr Arg Ser Ser Arg Val Leu Pro Ala Ala Thr His Ser Ala Leu Asn
405 410 415
Asn Ser Phe Pro Leu Ala Ser Ala Pro Gly Met Met Ser Val Ser Asp
420 425 430
Thr Lys Gly Val Asn Glu Phe Cys Asn Pro Ser Tyr Asp Ile Leu Asn
435 440 445
Asn Phe Pro Gln Gln Gln His His Asn Asn Asn Asn Asn Arg Val Asn
450 455 460
Glu Trp Asp Leu Arg Asn Val Gly Met Val Phe Asn Ser His Gln Asp
465 470 475 480
Asn Thr Thr Ser Ala Ala Phe Ser Thr Ser Glu Ala Tyr Ser Ser Ser
485 490 495
Ser Thr His Lys Arg Lys Arg Glu Ala Glu Leu Val Val Glu His Gly
500 505 510
Gln Asn Gln Gln Gln Pro Gln Ser Arg Ser Val Lys Pro Met Asn Gln
515 520 525
Thr Tyr Met Asp Gly Gly Gly Ser Val Arg Met Lys Thr Glu Thr Val
530 535 540
Thr Cys Pro Pro Gln Ala Thr Thr Met Phe His Glu Gln Tyr Ser Asn
545 550 555 560
Gln Asp Asp Leu Leu Ser Asp Leu Leu Lys Gln Glu Gly Leu Leu Asp
565 570 575
Thr Glu Phe Asp Phe Glu Gly Tyr Ser Phe Asp Ser Ile Leu Val
580 585 590
<210> 39
<211> 691
<212> PRT
<213> 稻
<400> 39
Met Ala Pro Val Glu Asp Gly Gly Gly Val Glu Phe Pro Val Gly Met
1 5 10 15
Lys Val Leu Val Val Asp Asp Asp Pro Thr Cys Leu Ala Val Leu Lys
20 25 30
Arg Met Leu Leu Glu Cys Arg Tyr Asp Ala Thr Thr Cys Ser Gln Ala
35 40 45
Thr Arg Ala Leu Thr Met Leu Arg Glu Asn Arg Arg Gly Phe Asp Val
50 55 60
Ile Ile Ser Asp Val His Met Pro Asp Met Asp Gly Phe Arg Leu Leu
65 70 75 80
Glu Leu Val Gly Leu Glu Met Asp Leu Pro Val Ile Met Met Ser Ala
85 90 95
Asp Ser Arg Thr Asp Ile Val Met Lys Gly Ile Lys His Gly Ala Cys
100 105 110
Asp Tyr Leu Ile Lys Pro Val Arg Met Glu Glu Leu Lys Asn Ile Trp
115 120 125
Gln His Val Ile Arg Lys Lys Phe Asn Glu Asn Lys Glu His Glu His
130 135 140
Ser Gly Ser Leu Asp Asp Thr Asp Arg Thr Arg Pro Thr Asn Asn Asp
145 150 155 160
Asn Glu Tyr Ala Ser Ser Ala Asn Asp Gly Ala Glu Gly Ser Trp Lys
165 170 175
Ser Gln Lys Lys Lys Arg Asp Lys Asp Asp Asp Asp Gly Glu Leu Glu
180 185 190
Ser Gly Asp Pro Ser Ser Thr Ser Lys Lys Pro Arg Val Val Trp Ser
195 200 205
Val Glu Leu His Gln Gln Phe Val Asn Ala Val Asn His Leu Gly Ile
210 215 220
Asp Lys Ala Val Pro Lys Lys Ile Leu Glu Leu Met Asn Val Pro Gly
225 230 235 240
Leu Thr Arg Glu Asn Val Ala Ser His Leu Gln Lys Phe Arg Leu Tyr
245 250 255
Leu Lys Arg Ile Ala Gln His His Ala Gly Ile Ala Asn Pro Phe Cys
260 265 270
Pro Pro Ala Ser Ser Gly Lys Val Gly Ser Leu Gly Gly Leu Asp Phe
275 280 285
Gln Ala Leu Ala Ala Ser Gly Gln Ile Pro Pro Gln Ala Leu Ala Ala
290 295 300
Leu Gln Asp Glu Leu Leu Gly Arg Pro Thr Asn Ser Leu Val Leu Pro
305 310 315 320
Gly Arg Asp Gln Ser Ser Leu Arg Leu Ala Ala Val Lys Gly Asn Lys
325 330 335
Pro His Gly Glu Arg Glu Ile Ala Phe Gly Gln Pro Ile Tyr Lys Cys
340 345 350
Gln Asn Asn Ala Tyr Gly Ala Phe Pro Gln Ser Ser Pro Ala Val Gly
355 360 365
Gly Met Pro Ser Phe Ser Ala Trp Pro Asn Asn Lys Leu Gly Met Ala
370 375 380
Asp Ser Thr Gly Thr Leu Gly Gly Met Ser Asn Ser Gln Asn Ser Asn
385 390 395 400
Ile Val Leu His Glu Leu Gln Gln Gln Pro Asp Ala Met Leu Ser Gly
405 410 415
Thr Leu His Ser Leu Asp Val Lys Pro Ser Gly Ile Val Met Pro Ser
420 425 430
Gln Ser Leu Asn Thr Phe Ser Ala Ser Glu Gly Leu Ser Pro Asn Gln
435 440 445
Asn Thr Leu Met Ile Pro Ala Gln Ser Ser Gly Phe Leu Ala Ala Met
450 455 460
Pro Pro Ser Met Lys His Glu Pro Val Leu Ala Thr Ser Gln Pro Ser
465 470 475 480
Ser Ser Leu Leu Gly Gly Ile Asp Leu Val Asn Gln Ala Ser Thr Ser
485 490 495
Gln Pro Leu Ile Ser Ala His Gly Gly Gly Asn Leu Ser Gly Leu Val
500 505 510
Asn Arg Asn Pro Asn Val Val Pro Ser Gln Gly Ile Ser Thr Phe His
515 520 525
Thr Pro Asn Asn Pro Tyr Leu Val Ser Pro Asn Ser Met Gly Met Gly
530 535 540
Ser Lys Gln Pro Pro Gly Val Leu Lys Thr Glu Asn Ser Asp Ala Leu
545 550 555 560
Asn His Ser Tyr Gly Tyr Leu Gly Gly Ser Asn Pro Pro Met Asp Ser
565 570 575
Gly Leu Leu Ser Ser Gln Ser Lys Asn Thr Gln Phe Gly Leu Leu Gly
580 585 590
Gln Asp Asp Ile Thr Gly Ser Trp Ser Pro Leu Pro Asn Val Asp Ser
595 600 605
Tyr Gly Asn Thr Val Gly Leu Ser His Pro Gly Ser Ser Ser Ser Ser
610 615 620
Phe Gln Ser Ser Asn Val Ala Leu Gly Lys Leu Pro Asp Gln Gly Arg
625 630 635 640
Gly Lys Asn His Gly Phe Val Gly Lys Gly Thr Cys Ile Pro Ser Arg
645 650 655
Phe Ala Val Asp Glu Ile Glu Ser Pro Thr Asn Asn Leu Ser His Ser
660 665 670
Ile Gly Ser Ser Gly Asp Ile Met Ser Pro Asp Ile Phe Gly Phe Ser
675 680 685
Gly Gln Met
690
<210> 40
<211> 428
<212> PRT
<213> 卵囊藻属
<400> 40
Met Ala Leu Lys Arg Val Pro Ser Phe Ser Gly Arg Pro Asn Phe Pro
1 5 10 15
Ala Gly Leu Gln Ile Leu Val Val Asp Ser Asp Ser Ser Ser Arg Glu
20 25 30
Ala Val Glu Met Gln Leu Lys Ser His Ser Tyr Leu Ala Thr Cys Cys
35 40 45
Cys Thr Cys Gly Glu Ala Val Glu Gln Leu Gly Thr Ser Lys Tyr Asp
50 55 60
Ile Val Leu Ala Glu Ser Lys Leu Val Ala Ala Glu Cys Val Asp Ser
65 70 75 80
Thr Arg Leu Cys Glu Ala Ala Arg Ala Leu Pro Leu Val Leu Met Cys
85 90 95
Glu Asp Ser Thr Ala Asp Asp Val Leu Lys Gly Ile Arg Leu Gly Ala
100 105 110
Cys Asp Phe Leu Glu Lys Pro Leu Ser Pro Leu Lys Leu Lys Asn Ile
115 120 125
Trp Gln His Val Val Arg Lys Met Met Glu Gln Met His Val Arg Arg
130 135 140
Thr Asp Asp Ala Asp Thr Cys Thr Thr Lys Ser Ser Arg Asp Gln Ser
145 150 155 160
Cys Ala Ile Lys Gly Lys Ser Val Ala Ser Thr Pro Ser Cys Pro Lys
165 170 175
Thr Pro Ser Pro Ala Ala Ser Gly Ala Asp Ile Gly Cys Ser Ile Ala
180 185 190
Thr Ser Val Ser Lys Ala Gly Asp Val Val Gly Glu Ser Ser Ser Ser
195 200 205
Glu Thr Arg Lys Glu His Cys Ser Glu Thr Thr Glu Cys Ser Asp Leu
210 215 220
Lys Ser Cys Ala Ala Lys Ser Ala Val Ser Ala Gln Thr Pro Val Ser
225 230 235 240
Thr Ala Thr Val Ala Ala Thr Trp Gly Ala Ser Lys Lys Lys Ser Thr
245 250 255
Ala Ser Ala Thr Thr Ser Ser Val Ser Asn Arg Pro Pro Leu Ala Ile
260 265 270
Lys Met Pro Ala Pro Ala Val Ala Tyr Thr Ser Gly Leu Ala Pro Phe
275 280 285
Pro Pro Pro Met Phe Val Pro Gly Gly Trp Gly Gln Ser Ser Asn Pro
290 295 300
Cys Val Val Gly Thr Pro Met Met Pro Pro Pro Pro Gly Met Gly Met
305 310 315 320
Pro Pro His His His Ala Pro Tyr Gly Gln Val Pro Pro Pro Gly Tyr
325 330 335
Pro Val Ala Cys Met Pro Ser Ala Phe Val Pro Thr Pro Met Gly Pro
340 345 350
Gly Gly Val Ala Phe Ala Pro Pro Pro Gly Ala Ser Cys Thr Ser Ala
355 360 365
Ala Tyr Tyr Pro His Pro Ala Val Asp Ala Ser Ala Ser Ala Thr Ala
370 375 380
Thr Phe Thr Gly His Val Gln Ile Asp Leu Thr Asn Val Ser Ala Glu
385 390 395 400
Glu Pro Ala Pro Ile Gly Leu Ala Leu Arg Lys Thr Ala Ser Leu Leu
405 410 415
Asp Leu Val Ser Asp Arg Leu Gly Gln Arg Ala Cys
420 425
<210> 41
<211> 341
<212> PRT
<213> 四爿藻属
<400> 41
Met Leu Cys Pro Ala Val Gln Val Ala Thr Met Ala Thr Val Leu Ala
1 5 10 15
Ser Thr His Phe Ser Glu Arg Pro Ser Phe Pro Ala Asp Leu Glu Val
20 25 30
Leu Leu Leu Asp Ser Ala Thr Gln Gly Ala Glu Thr Ala Ser Lys Leu
35 40 45
Leu Leu Ser Cys Ser Tyr Arg Val Thr Val Cys Arg Ser Val Ser Glu
50 55 60
Ala Leu Ser His Met Ala Cys Lys Ala Phe Asp Val Val Leu Val Glu
65 70 75 80
Gln Lys Leu Phe Ser Gly Arg Asp Ala Ala Ala Ala Gln Leu Lys Ala
85 90 95
Leu Ala Gly Val Ile Pro Thr Val Val Leu Ser Asp Ser Gly Ser Ala
100 105 110
Lys Asp Thr Trp Ala Ala Ile Val Gly Gln Ala Ala Asp Val Leu Ile
115 120 125
Arg Pro Leu Thr Lys Gln Lys Leu Gln Thr Leu Trp Gln His Thr Val
130 135 140
Arg Met Gln Arg Ala Ala Ser Ser Ala Ser Ala Ala Thr Ser Met Val
145 150 155 160
Ala Lys Pro Val Ala Val Leu Ser Ser Ala Leu Lys Pro Ala Ala Ser
165 170 175
Ser Ala Ser Leu Asp Lys Gly Gln Lys Arg Lys Leu Lys Asp His Met
180 185 190
Met Gly Pro Ile Met Ala His Pro Gln Val Ser Asn Pro Gly Phe Ile
195 200 205
Trp Gly Ala Pro Val Met Gly Val Pro Ala Gly Gln Gln Ala Pro Gln
210 215 220
Lys Ser Glu Ala Pro Val Thr Pro Gln Lys Pro Gly Ser Glu Met His
225 230 235 240
Pro Glu Leu Asp Ala Thr Ser His Ile Ala Met Gly Ser Ser Asp Asn
245 250 255
Phe Asn Val Pro Val Tyr Glu Ser Gly Thr Asp Ser Gln Glu Ser Gln
260 265 270
Pro Thr Cys Asp Pro Thr Ser Leu Asp Asp Ile Asn Glu Asp Asp Tyr
275 280 285
Ala Phe Ile Asp Phe Ala Leu Ser Asp Ser Phe Pro Thr Val Glu Glu
290 295 300
Asp Glu Ile Leu Pro Pro Ile Gly Leu Ser Leu Lys Lys Ser Ser Ser
305 310 315 320
Leu Leu Asn Met Leu Asn Gly Val Leu Leu Ser Ala His Ser Val Pro
325 330 335
Leu Gln Leu Pro Gln
340
<210> 42
<211> 558
<212> PRT
<213> 拟南芥
<400> 42
Met Ser Ser Ser Glu Glu Val Val Glu Val Thr Val Val Lys Ala Pro
1 5 10 15
Glu Ala Gly Gly Gly Lys Leu Ser Arg Arg Lys Ile Arg Lys Lys Asp
20 25 30
Ala Gly Val Asp Gly Leu Val Lys Trp Glu Arg Phe Leu Pro Lys Ile
35 40 45
Ala Leu Arg Val Leu Leu Val Glu Ala Asp Asp Ser Thr Arg Gln Ile
50 55 60
Ile Ala Ala Leu Leu Arg Lys Cys Ser Tyr Arg Val Ala Ala Val Pro
65 70 75 80
Asp Gly Leu Lys Ala Trp Glu Met Leu Lys Gly Lys Pro Glu Ser Val
85 90 95
Asp Leu Ile Leu Thr Glu Val Asp Leu Pro Ser Ile Ser Gly Tyr Ala
100 105 110
Leu Leu Thr Leu Ile Met Glu His Asp Ile Cys Lys Asn Ile Pro Val
115 120 125
Ile Met Met Ser Thr Gln Asp Ser Val Asn Thr Val Tyr Lys Cys Met
130 135 140
Leu Lys Gly Ala Ala Asp Tyr Leu Val Lys Pro Leu Arg Arg Asn Glu
145 150 155 160
Leu Arg Asn Leu Trp Gln His Val Trp Arg Arg Gln Thr Ser Leu Ala
165 170 175
Pro Asp Ser Phe Pro Trp Asn Glu Ser Val Gly Gln Gln Lys Ala Glu
180 185 190
Gly Ala Ser Ala Asn Asn Ser Asn Gly Lys Arg Asp Asp His Val Val
195 200 205
Ile Gly Asn Gly Gly Asp Ala Gln Ser Ser Cys Thr Arg Pro Glu Met
210 215 220
Glu Gly Glu Ser Ala Asp Val Glu Val Ser Ala Arg Asp Ala Val Gln
225 230 235 240
Met Glu Cys Ala Lys Ser Gln Phe Asn Glu Thr Gln Leu Leu Ala Asn
245 250 255
Glu Leu Gln Ser Lys Gln Ala Glu Ala Ile Asp Phe Met Gly Ala Ser
260 265 270
Phe Arg Arg Thr Gly Arg Arg Asn Arg Glu Glu Ser Val Ala Gln Tyr
275 280 285
Glu Ser Arg Ile Glu Leu Asp Leu Ser Leu Arg Arg Pro Asn Ala Ser
290 295 300
Glu Asn Gln Ser Ser Gly Asp Arg Pro Ser Leu His Pro Ser Ser Ala
305 310 315 320
Ser Ala Phe Thr Arg Tyr Val His Arg Pro Leu Gln Thr Gln Cys Ser
325 330 335
Ala Ser Pro Val Val Pro Asp Gln Arg Lys Asn Val Ala Ala Ser Gln
340 345 350
Asp Asp Asn Ile Val Leu Met Asn Gln Tyr Asn Thr Ser Glu Pro Pro
355 360 365
Pro Asn Ala Pro Arg Arg Asn Asp Thr Ser Phe Tyr Thr Gly Thr Asp
370 375 380
Ser Pro Gly Pro Pro Phe Ser Asn Gln Met Asn Ser Trp Pro Gly Gln
385 390 395 400
Gly Ser Tyr Pro Thr Pro Thr Pro Ile Asn Asn Ile Gln Phe Arg Gly
405 410 415
Pro Asn Thr Ala Tyr Thr Ser Ala Met Ala Pro Ala Ser Leu Ser Pro
420 425 430
Ser Pro Ser Ser Val Ser Pro His Glu Tyr Ser Ser Met Phe His Pro
435 440 445
Phe Asn Ser Lys Pro Glu Gly Leu Gln Asp Arg Asp Cys Ser Met Asp
450 455 460
Val Asp Asp Arg Arg Tyr Val Ser Ser Ala Thr Glu His Ser Ala Ile
465 470 475 480
Gly Asn His Ile Asp Gln Leu Ile Glu Lys Lys Asn Glu Asp Gly Tyr
485 490 495
Ser Ser Ser Val Gly Lys Ile Gln Gln Ser Leu Gln Arg Glu Ala Ala
500 505 510
Leu Thr Lys Phe Arg Met Lys Arg Lys Asp Arg Cys Phe Glu Lys Lys
515 520 525
Val Arg Tyr Glu Ser Arg Lys Lys Leu Ala Glu Gln Arg Pro Arg Ile
530 535 540
Lys Gly Gln Phe Val Arg Gln Val Gln Ser Thr Gln Ala Pro
545 550 555
<210> 43
<211> 186
<212> PRT
<213> 拟南芥
<400> 43
Met Ala Glu Val Met Leu Pro Arg Lys Met Glu Ile Leu Asn His Ser
1 5 10 15
Ser Lys Phe Gly Ser Pro Asp Pro Leu His Val Leu Ala Val Asp Asp
20 25 30
Ser His Val Asp Arg Lys Phe Ile Glu Arg Leu Leu Arg Val Ser Ser
35 40 45
Cys Lys Val Thr Val Val Asp Ser Ala Thr Arg Ala Leu Gln Tyr Leu
50 55 60
Gly Leu Asp Val Glu Glu Lys Ser Val Gly Phe Glu Asp Leu Lys Val
65 70 75 80
Asn Leu Ile Met Thr Asp Tyr Ser Met Pro Gly Met Thr Gly Tyr Glu
85 90 95
Leu Leu Lys Lys Ile Lys Glu Ser Ser Ala Phe Arg Glu Val Pro Val
100 105 110
Val Ile Met Ser Ser Glu Asn Ile Leu Pro Arg Ile Asp Arg Cys Leu
115 120 125
Glu Glu Gly Ala Glu Asp Phe Leu Leu Lys Pro Val Lys Leu Ser Asp
130 135 140
Val Lys Arg Leu Arg Asp Ser Leu Met Lys Val Glu Asp Leu Ser Phe
145 150 155 160
Thr Lys Ser Ile Gln Lys Arg Glu Leu Glu Thr Glu Asn Val Tyr Pro
165 170 175
Val His Ser Gln Leu Lys Arg Ala Lys Ile
180 185
<210> 44
<211> 727
<212> PRT
<213> 拟南芥
<400> 44
Met Asn Ala Asn Glu Glu Gly Glu Gly Ser Arg Tyr Pro Ile Thr Asp
1 5 10 15
Arg Lys Thr Gly Glu Thr Lys Phe Asp Arg Val Glu Ser Arg Thr Glu
20 25 30
Lys His Ser Glu Glu Glu Lys Thr Asn Gly Ile Thr Met Asp Val Arg
35 40 45
Asn Gly Ser Ser Gly Gly Leu Gln Ile Pro Leu Ser Gln Gln Thr Ala
50 55 60
Ala Thr Val Cys Trp Glu Arg Phe Leu His Val Arg Thr Ile Arg Val
65 70 75 80
Leu Leu Val Glu Asn Asp Asp Cys Thr Arg Tyr Ile Val Thr Ala Leu
85 90 95
Leu Arg Asn Cys Ser Tyr Glu Val Val Glu Ala Ser Asn Gly Ile Gln
100 105 110
Ala Trp Lys Val Leu Glu Asp Leu Asn Asn His Ile Asp Ile Val Leu
115 120 125
Thr Glu Val Ile Met Pro Tyr Leu Ser Gly Ile Gly Leu Leu Cys Lys
130 135 140
Ile Leu Asn His Lys Ser Arg Arg Asn Ile Pro Val Ile Met Met Ser
145 150 155 160
Ser His Asp Ser Met Gly Leu Val Phe Lys Cys Leu Ser Lys Gly Ala
165 170 175
Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu Lys Ile Leu
180 185 190
Trp Gln His Val Trp Arg Arg Cys Gln Ser Ser Ser Gly Ser Gly Ser
195 200 205
Glu Ser Gly Thr His Gln Thr Gln Lys Ser Val Lys Ser Lys Ser Ile
210 215 220
Lys Lys Ser Asp Gln Asp Ser Gly Ser Ser Asp Glu Asn Glu Asn Gly
225 230 235 240
Ser Ile Gly Leu Asn Ala Ser Asp Gly Ser Ser Asp Gly Ser Gly Ala
245 250 255
Gln Ser Ser Trp Thr Lys Lys Ala Val Asp Val Asp Asp Ser Pro Arg
260 265 270
Ala Val Ser Leu Trp Asp Arg Val Asp Ser Thr Cys Ala Gln Val Val
275 280 285
His Ser Asn Pro Glu Phe Pro Ser Asn Gln Leu Val Ala Pro Pro Ala
290 295 300
Glu Lys Glu Thr Gln Glu His Asp Asp Lys Phe Glu Asp Val Thr Met
305 310 315 320
Gly Arg Asp Leu Glu Ile Ser Ile Arg Arg Asn Cys Asp Leu Ala Leu
325 330 335
Glu Pro Lys Asp Glu Pro Leu Ser Lys Thr Thr Gly Ile Met Arg Gln
340 345 350
Asp Asn Ser Phe Glu Lys Ser Ser Ser Lys Trp Lys Met Lys Val Gly
355 360 365
Lys Gly Pro Leu Asp Leu Ser Ser Glu Ser Pro Ser Ser Lys Gln Met
370 375 380
His Glu Asp Gly Gly Ser Ser Phe Lys Ala Met Ser Ser His Leu Gln
385 390 395 400
Asp Asn Arg Glu Pro Glu Ala Pro Asn Thr His Leu Lys Thr Leu Asp
405 410 415
Thr Asn Glu Ala Ser Val Lys Ile Ser Glu Glu Leu Met His Val Glu
420 425 430
His Ser Ser Lys Arg His Arg Gly Thr Lys Asp Asp Gly Thr Leu Val
435 440 445
Arg Asp Asp Arg Asn Val Leu Arg Arg Ser Glu Gly Ser Ala Phe Ser
450 455 460
Arg Tyr Asn Pro Ala Ser Asn Ala Asn Lys Ile Ser Gly Gly Asn Leu
465 470 475 480
Gly Ser Thr Ser Leu Gln Asp Asn Asn Ser Gln Asp Leu Ile Lys Lys
485 490 495
Thr Glu Ala Ala Tyr Asp Cys His Ser Asn Met Asn Glu Ser Leu Pro
500 505 510
His Asn His Arg Ser His Val Gly Ser Asn Asn Phe Asp Met Ser Ser
515 520 525
Thr Thr Glu Asn Asn Ala Phe Thr Lys Pro Gly Ala Pro Lys Val Ser
530 535 540
Ser Ala Gly Ser Ser Ser Val Lys His Ser Ser Phe Gln Pro Leu Pro
545 550 555 560
Cys Asp His His Asn Asn His Ala Ser Tyr Asn Leu Val His Val Ala
565 570 575
Glu Arg Lys Lys Leu Pro Pro Gln Cys Gly Ser Ser Asn Val Tyr Asn
580 585 590
Glu Thr Ile Glu Gly Asn Asn Asn Thr Val Asn Tyr Ser Val Asn Gly
595 600 605
Ser Val Ser Gly Ser Gly His Gly Ser Asn Gly Pro Tyr Gly Ser Ser
610 615 620
Asn Gly Met Asn Ala Gly Gly Met Asn Met Gly Ser Asp Asn Gly Ala
625 630 635 640
Gly Lys Asn Gly Asn Gly Asp Gly Ser Gly Ser Gly Ser Gly Ser Gly
645 650 655
Ser Gly Asn Leu Ala Asp Glu Asn Lys Ile Ser Gln Arg Glu Ala Ala
660 665 670
Leu Thr Lys Phe Arg Gln Lys Arg Lys Glu Arg Cys Phe Arg Lys Lys
675 680 685
Val Arg Tyr Gln Ser Arg Lys Lys Leu Ala Glu Gln Arg Pro Arg Val
690 695 700
Arg Gly Gln Phe Val Arg Lys Thr Ala Ala Ala Thr Asp Asp Asn Asp
705 710 715 720
Ile Lys Asn Ile Glu Asp Ser
725
<210> 45
<211> 444
<212> PRT
<213> 拟南芥
<400> 45
Met Gly Glu Ile Val Val Leu Ser Ser Asp Asp Gly Met Glu Thr Ile
1 5 10 15
Lys Asn Arg Val Lys Ser Ser Glu Val Val Gln Trp Glu Lys Tyr Leu
20 25 30
Pro Lys Thr Val Leu Arg Val Leu Leu Val Glu Ser Asp Tyr Ser Thr
35 40 45
Arg Gln Ile Ile Thr Ala Leu Leu Arg Lys Cys Cys Tyr Lys Val Val
50 55 60
Ala Val Ser Asp Gly Leu Ala Ala Trp Glu Val Leu Lys Glu Lys Ser
65 70 75 80
His Asn Ile Asp Leu Ile Leu Thr Glu Leu Asp Leu Pro Ser Ile Ser
85 90 95
Gly Phe Ala Leu Leu Ala Leu Val Met Glu His Glu Ala Cys Lys Asn
100 105 110
Ile Pro Val Ile Met Met Ser Ser Gln Asp Ser Ile Lys Met Val Leu
115 120 125
Lys Cys Met Leu Arg Gly Ala Ala Asp Tyr Leu Ile Lys Pro Met Arg
130 135 140
Lys Asn Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg Arg Leu Thr
145 150 155 160
Leu Arg Asp Asp Pro Thr Ala His Ala Gln Ser Leu Pro Ala Ser Gln
165 170 175
His Asn Leu Glu Asp Thr Asp Glu Thr Cys Glu Asp Ser Arg Tyr His
180 185 190
Ser Asp Gln Gly Ser Gly Ala Gln Ala Ile Asn Tyr Asn Gly His Asn
195 200 205
Lys Leu Met Glu Asn Gly Lys Ser Val Asp Glu Arg Asp Glu Phe Lys
210 215 220
Glu Thr Phe Asp Val Thr Met Asp Leu Ile Gly Gly Ile Asp Lys Arg
225 230 235 240
Pro Asp Ser Ile Tyr Lys Asp Lys Ser Arg Asp Glu Cys Val Gly Pro
245 250 255
Glu Leu Gly Leu Ser Leu Lys Arg Ser Cys Ser Val Ser Phe Glu Asn
260 265 270
Gln Asp Glu Ser Lys His Gln Lys Leu Ser Leu Ser Asp Ala Ser Ala
275 280 285
Phe Ser Arg Phe Glu Glu Ser Lys Ser Ala Glu Lys Ala Val Val Ala
290 295 300
Leu Glu Glu Ser Thr Ser Gly Glu Pro Lys Thr Pro Thr Glu Ser His
305 310 315 320
Glu Lys Leu Arg Lys Val Thr Ser Asp Gln Gly Ser Ala Thr Thr Ser
325 330 335
Ser Asn Gln Glu Asn Ile Gly Ser Ser Ser Val Ser Phe Arg Asn Gln
340 345 350
Val Leu Gln Ser Thr Val Thr Asn Gln Lys Gln Asp Ser Pro Ile Pro
355 360 365
Val Glu Ser Asn Arg Glu Lys Ala Ala Ser Lys Glu Val Glu Ala Gly
370 375 380
Ser Gln Ser Thr Asn Glu Gly Ile Ala Gly Gln Ser Ser Ser Thr Glu
385 390 395 400
Lys Pro Lys Glu Glu Glu Ser Ala Lys Gln Arg Trp Ser Arg Ser Gln
405 410 415
Arg Glu Ala Ala Leu Met Lys Phe Arg Leu Lys Arg Lys Asp Arg Cys
420 425 430
Phe Asp Lys Lys Val Arg Asp Thr Gln Ala Ser Ser
435 440
<210> 46
<211> 204
<212> PRT
<213> 拟南芥
<400> 46
Met Ala Leu Arg Asp Leu Ser Ser Ser Ser Ser Ser Pro Glu Leu His
1 5 10 15
Val Leu Ala Val Asp Asp Ser Phe Val Asp Arg Lys Val Leu Glu Arg
20 25 30
Leu Leu Lys Ile Ser Ala Cys Lys Val Thr Thr Val Glu Ser Gly Thr
35 40 45
Arg Ala Leu Gln Tyr Leu Gly Leu Asp Gly Asp Asn Gly Ser Ser Gly
50 55 60
Leu Lys Asp Leu Lys Val Asn Leu Ile Val Thr Asp Tyr Ser Met Pro
65 70 75 80
Gly Leu Thr Gly Tyr Glu Leu Leu Lys Lys Ile Lys Glu Ser Ser Ala
85 90 95
Leu Arg Glu Ile Pro Val Val Ile Met Ser Ser Glu Asn Ile Gln Pro
100 105 110
Arg Ile Glu Gln Cys Met Ile Glu Gly Ala Glu Glu Phe Leu Leu Lys
115 120 125
Pro Val Lys Leu Ala Asp Val Lys Arg Leu Lys Glu Leu Ile Met Arg
130 135 140
Gly Gly Glu Ala Glu Glu Gly Lys Thr Lys Lys Leu Ser Pro Lys Arg
145 150 155 160
Ile Leu Gln Asn Asp Ile Asp Ser Ser Pro Ser Ser Ser Ser Ser Thr
165 170 175
Ser Ser Ser Ser Ser Ser His Asp Val Ser Ser Leu Asp Asp Asp Thr
180 185 190
Pro Ser Ser Lys Arg Ile Lys Leu Glu Ser Arg Gly
195 200
<210> 47
<211> 691
<212> PRT
<213> 大豆
<400> 47
Met Gly Glu Val Val Ile Met Ser Gly Glu Lys Lys Ser Val Arg Val
1 5 10 15
Glu Gly Val Glu Lys Glu Asp Ser Gly Gly Ser Gly Ser Lys Ala Gly
20 25 30
Glu Phe Lys Gly Leu Met Arg Trp Glu Lys Phe Leu Pro Lys Met Val
35 40 45
Leu Arg Val Leu Leu Val Glu Ala Asp Asp Ser Thr Arg Gln Ile Ile
50 55 60
Ala Ala Leu Leu Arg Lys Cys Ser Tyr Lys Val Val Ala Val Pro Asp
65 70 75 80
Gly Leu Lys Ala Trp Glu Leu Leu Lys Gly Arg Pro His Asn Val Asp
85 90 95
Leu Ile Leu Thr Glu Val Asp Leu Pro Ser Ile Ser Gly Tyr Ala Leu
100 105 110
Leu Thr Leu Ile Met Glu His Glu Ile Cys Lys Asn Ile Pro Val Ile
115 120 125
Met Met Ser Ser Gln Asp Ser Ile Ser Thr Val Tyr Lys Cys Met Leu
130 135 140
Arg Gly Ala Ala Asp Tyr Leu Val Lys Pro Ile Arg Lys Asn Glu Leu
145 150 155 160
Arg Asn Leu Trp Gln His Val Trp Arg Arg Gln Ser Ser Thr Thr Gly
165 170 175
Ile Asn Gly Leu Gln Asp Glu Ser Val Ala Gln Gln Lys Val Glu Ala
180 185 190
Thr Ala Glu Asn Asn Ala Ala Ser Asn Arg Ser Ser Gly Asp Ala Ala
195 200 205
Cys Ile Gln Arg Asn Ile Glu Leu Ile Glu Lys Gly Ser Asp Ala Gln
210 215 220
Ser Ser Cys Thr Lys Pro Asp Cys Glu Ala Glu Ser Asp Pro Val Gly
225 230 235 240
Asn Met Gln Glu Phe Ser Leu Leu Lys Cys Gly Glu Ala Tyr Pro Ser
245 250 255
Gly Thr Glu Thr Gln Gln Val Glu Thr Ser Phe Arg Leu Gly Gln Thr
260 265 270
Leu Met Met His Asp Cys His Ala Gly Gly Leu Asn Val Ser Ile Arg
275 280 285
Lys Asn Gly Glu Ala Ser Thr Thr Asn Asp Lys Asp Thr Asp Thr Glu
290 295 300
His Phe Gly Asn Ala Ser Ile Ser Gly Glu Ala His Asp Asn Pro Tyr
305 310 315 320
Val Gln Ile Asn Ser Ser Lys Glu Ala Met Asp Leu Ile Gly Ala Phe
325 330 335
His Thr His Pro Asn Cys Ser Leu Lys Asn Ser Thr Val Asn Cys Thr
340 345 350
Gly Asn Phe Asp His Ser Pro Gln Leu Asp Leu Ser Leu Arg Arg Ser
355 360 365
Cys Pro Gly Ser Phe Glu Asn Lys Leu Thr Glu Glu Arg His Thr Leu
370 375 380
Met His Ser Asn Ala Ser Ala Phe Lys Arg Tyr Thr Thr Arg Gln Leu
385 390 395 400
Gln Ile Ser Met Pro Ala Val Leu Ile Asn Phe Ser Asp Gln Gln Arg
405 410 415
Glu Gln Ile Thr Asn Cys Glu Lys Asn Ile Ser His Ile Ala Thr Gly
420 425 430
Ser Asn Ser Asp Ser Ser Thr Pro Met Gln Arg Cys Ile Val Ser Pro
435 440 445
Thr Thr Val Gln Ser Lys Glu Ser Glu Leu Ala Thr Ser His Pro Pro
450 455 460
Gln Gly His Ser Leu Pro Ile Pro Val Lys Gly Val Arg Phe Asn Asp
465 470 475 480
Leu Cys Thr Ala Tyr Gly Ser Val Leu Pro Ser Val Phe His Thr Gln
485 490 495
Ser Gly Pro Pro Ala Met Pro Ser Pro Asn Ser Val Val Leu Leu Glu
500 505 510
Pro Asn Phe Gln Val Asn Ala Phe Tyr Gln Ser Asn Met Lys Glu Ser
515 520 525
Ser Ser Glu Gln Leu Tyr Glu Ser Arg Gly Pro Asn Gly Asn Thr Thr
530 535 540
Gln Asn His Ile Val Tyr Thr Gln Glu His Lys Ser Glu His Ala Glu
545 550 555 560
Asp Arg Gly His Ile Ser Pro Thr Thr Asp Gln Ser Val Ser Ser Ser
565 570 575
Phe Cys Asn Gly Asn Ala Ser His Leu Asn Ser Ile Gly Tyr Gly Ser
580 585 590
Asn Cys Gly Ser Ser Ser Asn Val Asp Gln Val Asn Thr Val Trp Ala
595 600 605
Ala Ser Glu Gly Lys His Glu Asp Leu Thr Asn Asn Ala Asn Ser His
610 615 620
Arg Ser Ile Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Leu Lys Arg
625 630 635 640
Lys Glu Arg Cys Tyr Glu Lys Lys Val Arg Tyr Glu Ser Arg Lys Lys
645 650 655
Leu Ala Glu Gln Arg Pro Arg Val Lys Gly Gln Phe Val Arg Gln Val
660 665 670
His Pro Asp Pro Leu Val Ala Glu Lys Asp Gly Lys Glu Tyr Asp His
675 680 685
Ser Asp Phe
690
<210> 48
<211> 747
<212> PRT
<213> 葡萄
<400> 48
Met Gly Glu Val Val Val Ser Ser Glu Ala Gly Gly Gly Gly Met Glu
1 5 10 15
Gly Glu Val Glu Lys Lys Glu Val Gly Ser Gly Val Val Arg Trp Glu
20 25 30
Arg Phe Leu Pro Arg Met Val Leu Arg Val Leu Leu Val Glu Ala Asp
35 40 45
Asp Ser Thr Arg Gln Ile Ile Ala Ala Leu Leu Arg Lys Cys Ser Tyr
50 55 60
Lys Val Ala Ala Val Pro Asp Gly Leu Lys Ala Trp Glu Val Leu Lys
65 70 75 80
Ala Arg Pro His Asn Ile Asp Leu Ile Leu Thr Glu Val Glu Leu Pro
85 90 95
Ser Ile Ser Gly Phe Ala Leu Leu Thr Leu Val Met Glu His Glu Ile
100 105 110
Cys Lys Asn Ile Pro Val Ile Met Met Ser Ser His Gly Ser Ile Asn
115 120 125
Thr Val Tyr Lys Cys Met Leu Arg Gly Ala Ala Asp Phe Leu Val Lys
130 135 140
Pro Val Arg Arg Asn Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg
145 150 155 160
Arg Gln Ser Ser Thr Val Ser Gly Asn Gly Pro Gln Asp Glu Ser Val
165 170 175
Ala Gln Gln Lys Val Glu Ala Thr Ser Glu Asn Asn Pro Thr Ser Asn
180 185 190
His Ser Ser Asp His Val Ala Cys Ile Gln Lys Asn Lys Glu Ala Leu
195 200 205
Asn Lys Val Ser Asp Ala Gln Ser Ser Cys Ser Lys Pro Asp Leu Glu
210 215 220
Ala Glu Ser Ala Tyr Met Glu Thr Met Gln Asp Phe Ser Asn Pro Thr
225 230 235 240
Trp Ser Arg Ser Leu Val Ser Asp Thr Lys Met Gln Lys Asn Glu Glu
245 250 255
Cys Ala Lys Leu Gly Pro Lys Phe Leu Met His Asn Lys Glu Ala Gly
260 265 270
Gly Thr Leu Glu Ala Ala Cys Arg Asp Val Asn Thr Met Thr Gln Pro
275 280 285
Glu Ala Val Glu Pro Glu Asn Asp Gly Gln Gly Ala Asn Ala Pro Ser
290 295 300
Glu Ala Cys Gly Asn Asn Ala Ile Leu Gly Ser Ser Ser Arg Glu Ala
305 310 315 320
Ile Asp Leu Ile Gly Val Phe Asp Asn Ser Lys Lys Cys Thr Tyr Gly
325 330 335
Asn Ser Ser Ser Asn Asn Gly Thr Lys Lys Ser Asp Ser Ile Pro Gln
340 345 350
Leu Asp Leu Ser Leu Arg Arg Ser His Pro Ser Ser Pro Glu Asn Gln
355 360 365
Val Ala Asp Glu Arg His Thr Leu Asn His Ser Asn Gly Ser Ala Phe
370 375 380
Ser Arg Tyr Ile Asn Arg Ser Leu Gln Pro Pro His Leu Pro Ser Thr
385 390 395 400
Gly Val Phe Asn Gln Gln Lys Asn Phe Gly Ala Asp Ser Asp Lys Arg
405 410 415
Leu Ser Gln Leu Val Thr Gly Tyr Asn Ser Asp Ile Thr Ser Pro Thr
420 425 430
Leu Ser Thr Gln Arg Ser Val Ile Ser Leu Ala Thr Ser Pro Ser Gly
435 440 445
Arg Val Glu Ile Ala Leu Cys Gly Pro Gln Gln Arg Ala Phe Pro Ala
450 455 460
Pro Val Pro Gln Asn Ala Asn Asn Ser Thr Ser Gln Thr Asn His Lys
465 470 475 480
Pro Glu His Lys Leu Asp Ser Leu Glu Gly Gln Gly His Phe Ser Pro
485 490 495
Ala Thr Asp Gln Asn Ser Ser Ser Ser Phe Gly Asn Gly Gly Ala Ser
500 505 510
Asn Leu Asn Ser Phe Gly Cys Gly Ser Ile Cys Gly Ser Asn Gly Asn
515 520 525
Ala Asn Thr Val Ala Val Val Gln Ala Ala Ala Glu Gly Lys Asn Glu
530 535 540
Glu Gly Ile Phe Ser His Glu Gly His Ser Gln Arg Ser Ile Gln Arg
545 550 555 560
Glu Ala Ala Leu Thr Lys Phe Arg Leu Lys Arg Lys Asp Arg Cys Phe
565 570 575
Glu Lys Lys Val Arg Tyr Glu Ser Arg Lys Lys Leu Ala Glu Gln Arg
580 585 590
Pro Arg Val Lys Gly Gln Phe Val Arg Gln Val His Thr Ile Pro Pro
595 600 605
Pro Ala Glu Pro Asp Thr Tyr Tyr Gly Ser Ser Phe Asp Val Gln Pro
610 615 620
Gln Arg Ser Arg Tyr Leu Ser Ala Gln Pro Leu Arg Ala Ser Ser Ser
625 630 635 640
Gln Leu Leu Tyr Pro Thr His Thr Pro Leu Gln Glu Ser Lys Tyr Glu
645 650 655
Gly His Glu Glu Ser Asn Leu Leu Thr Ala Ser Leu Val Gly Thr Ala
660 665 670
Leu Pro Val Ala Pro Ser Phe Gly Tyr Glu Val Gly Arg Asp Gln Thr
675 680 685
Ala Gly Lys Leu Val Leu Ser Leu Lys Leu Asp Gly Arg Val Arg Trp
690 695 700
Lys Val Gly Thr Trp Val Ser Gly Arg Tyr Arg Leu Asn Val Asn Cys
705 710 715 720
Val Ala Val Met Ala Phe Gly Pro Ser Ile Pro Ser Gly Pro Leu Ser
725 730 735
Ser Lys Glu Gly Thr Gln Cys Ser Thr Thr Val
740 745
<210> 49
<211> 799
<212> PRT
<213> 可可
<400> 49
Met Gly Ile Val Gln Met Asn Asn Asn Gly Pro Val Ala Asn Gly Leu
1 5 10 15
Val Glu Leu Asn Thr His Ile His Asp Glu His Lys Lys Ile Arg Gly
20 25 30
Gly Val Ile Gly Glu Gly Gln Gly Leu Ser Val Glu Glu Glu Ser Trp
35 40 45
Ile Asn Glu Asp Val Glu Asp Arg Asn Asp Gly Lys Thr Glu Leu Val
50 55 60
Gln Val Gln Gly His Ala His Gly Glu Gln Glu Arg Ser Gln Gln Gln
65 70 75 80
Pro Gln Gly Pro Leu Val His Trp Glu Arg Phe Leu Pro Leu Arg Ser
85 90 95
Leu Lys Val Leu Leu Val Glu Asn Asp Asp Ser Thr Arg His Val Val
100 105 110
Cys Ala Leu Leu Arg Asn Cys Gly Phe Glu Val Thr Ala Val Ser Asn
115 120 125
Gly Leu Gln Ala Trp Lys Ile Leu Glu Asp Leu Thr Asn His Ile Asp
130 135 140
Leu Val Leu Thr Glu Val Val Met Pro Cys Leu Ser Gly Ile Gly Leu
145 150 155 160
Leu Cys Lys Ile Met Ser His Lys Thr Arg Met Asn Ile Pro Val Ile
165 170 175
Met Met Ser Ser His Asp Ser Met Ser Thr Val Phe Arg Cys Leu Ser
180 185 190
Lys Gly Ala Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu
195 200 205
Lys Asn Leu Trp Gln His Val Trp Arg Lys Cys His Ser Ser Ser Ser
210 215 220
Ser Gly Gly Gln Ser Gly Thr Gln Thr Gln Lys Ser Ser Lys Ser Lys
225 230 235 240
Gly Thr Asp Ser Asp Asn Asn Thr Gly Ser Asn Asp Glu Asp Asp Asn
245 250 255
Gly Ser Val Gly Leu Asn Val Gln Asp Gly Ser Asp Asn Gly Ser Gly
260 265 270
Thr Gln Ser Ser Trp Thr Lys Arg Ala Val Glu Val Asp Ser Ser Gln
275 280 285
Pro Ile Ser Pro Trp Asp Gln Leu Ala Asp Pro Pro His Ser Thr Cys
290 295 300
Ala Gln Val Ile His Ser Arg His Glu Val Leu Gly Asp Ser Trp Val
305 310 315 320
Pro Val Thr Ala Thr Arg Glu Tyr Asp Glu Leu Asp Asn Glu Leu Glu
325 330 335
Asn Val Val Met Gly Lys Asp Leu Glu Ile Gly Val Pro Lys Ile Thr
340 345 350
Ala Ser Gln Leu Glu Asp Pro Ser Glu Lys Val Met Thr Asn Ile Ala
355 360 365
Gly Val Asn Lys Asp Lys Leu Ser Ala Ile Asn Pro Lys Lys Asp Asp
370 375 380
Glu Lys Leu Glu Lys Ala Gln Leu Glu Leu Asn Ser Glu Lys Ser Gly
385 390 395 400
Gly Asp Leu Arg Asn Gln Ala Ala Asp Leu Ile Gly Val Ile Thr Asn
405 410 415
Asn Thr Glu Pro His Ile Glu Ser Ala Val Phe Asp Ile Pro Asn Gly
420 425 430
Leu Pro Lys Val Ser Asp Ala Lys Glu Lys Val Asn Tyr Asp Thr Lys
435 440 445
Glu Met Pro Phe Leu Glu Leu Ser Leu Lys Arg Leu Arg Asp Val Gly
450 455 460
Asp Thr Gly Thr Ser Ala His Glu Arg Asn Val Leu Arg His Ser Asp
465 470 475 480
Leu Ser Ala Phe Ser Arg Tyr Asn Ser Gly Ser Thr Ala Asn Gln Ala
485 490 495
Pro Thr Gly Asn Val Gly Ser Cys Ser Pro Leu Asp Asn Ser Ser Glu
500 505 510
Ala Val Lys Thr Asp Ser Met Lys Asn Phe Gln Ser Thr Ser Asn Ser
515 520 525
Ile Pro Pro Lys Gln Gln Ser Asn Gly Ser Ser Asn Asn Asn Asp Met
530 535 540
Gly Ser Thr Thr Asn Asn Ala Phe Ser Lys Pro Ala Val Leu Ser Asp
545 550 555 560
Lys Pro Ala Pro Lys Thr Ser Ala Lys Ser Phe His Pro Ser Ser Ala
565 570 575
Phe Gln Pro Val Gln Ser Gly His Gly Ser Ala Leu Gln Pro Val Ala
580 585 590
Gln Gly Lys Ala Asp Ala Ala Leu Gly Asn Met Ile Leu Val Lys Ala
595 600 605
Arg Gly Thr Asp Gln Gln Gly Lys Val Gln His His His His His Tyr
610 615 620
His His His His His His His Val His Asn Met Leu Pro Asn Gln Lys
625 630 635 640
Leu Gly Asn His Asp Asp Leu Ser Leu Glu Asn Met Ala Ala Ala Ala
645 650 655
Pro Gln Cys Gly Ser Ser Asn Leu Ser Ser Leu Pro His Val Glu Gly
660 665 670
Asn Ala Ala Asn His Ser Leu Thr Arg Ser Ala Ser Gly Ser Asn His
675 680 685
Gly Ser Asn Gly Gln Asn Gly Ser Ser Thr Val Leu Asn Thr Arg Gly
690 695 700
Met Asn Leu Glu Ser Glu Asn Gly Val Pro Gly Lys Gly Gly Ala Gly
705 710 715 720
Gly Gly Ile Gly Ser Gly Gly Arg Asn Val Val Asp Gln Asn Arg Phe
725 730 735
Ala Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Gln Lys Arg Lys Glu
740 745 750
Arg Cys Phe Glu Lys Lys Val Arg Tyr Gln Ser Arg Lys Lys Leu Ala
755 760 765
Glu Gln Arg Pro Arg Ile Arg Gly Gln Phe Val Arg Gln Ile Ser Thr
770 775 780
Thr Gly Lys Glu Ala Phe Arg Phe Arg Gly Ala Gly Leu Cys Thr
785 790 795
<210> 50
<211> 742
<212> PRT
<213> 稻
<400> 50
Met Met Gly Thr Ala His His Asn Gln Thr Ala Gly Ser Ala Leu Gly
1 5 10 15
Val Gly Val Gly Asp Ala Asn Asp Ala Val Pro Gly Ala Gly Gly Gly
20 25 30
Gly Tyr Ser Asp Pro Asp Gly Gly Pro Ile Ser Gly Val Gln Arg Pro
35 40 45
Pro Gln Val Cys Trp Glu Arg Phe Ile Gln Lys Lys Thr Ile Lys Val
50 55 60
Leu Leu Val Asp Ser Asp Asp Ser Thr Arg Gln Val Val Ser Ala Leu
65 70 75 80
Leu Arg His Cys Met Tyr Glu Val Ile Pro Ala Glu Asn Gly Gln Gln
85 90 95
Ala Trp Thr Tyr Leu Glu Asp Met Gln Asn Ser Ile Asp Leu Val Leu
100 105 110
Thr Glu Val Val Met Pro Gly Val Ser Gly Ile Ser Leu Leu Ser Arg
115 120 125
Ile Met Asn His Asn Ile Cys Lys Asn Ile Pro Val Ile Met Met Ser
130 135 140
Ser Asn Asp Ala Met Gly Thr Val Phe Lys Cys Leu Ser Lys Gly Ala
145 150 155 160
Val Asp Phe Leu Val Lys Pro Ile Arg Lys Asn Glu Leu Lys Asn Leu
165 170 175
Trp Gln His Val Trp Arg Arg Cys His Ser Ser Ser Gly Ser Gly Ser
180 185 190
Glu Ser Gly Ile Gln Thr Gln Lys Cys Ala Lys Ser Lys Ser Gly Asp
195 200 205
Glu Ser Asn Asn Asn Asn Gly Ser Asn Asp Asp Asp Asp Asp Asp Gly
210 215 220
Val Ile Met Gly Leu Asn Ala Arg Asp Gly Ser Asp Asn Gly Ser Gly
225 230 235 240
Thr Gln Ala Gln Ser Ser Trp Thr Lys Arg Ala Val Glu Ile Asp Ser
245 250 255
Pro Gln Ala Met Ser Pro Asp Gln Leu Ala Asp Pro Pro Asp Ser Thr
260 265 270
Cys Ala Gln Val Ile His Leu Lys Ser Asp Ile Cys Ser Asn Arg Trp
275 280 285
Leu Pro Cys Thr Ser Asn Lys Asn Ser Lys Lys Gln Lys Glu Thr Asn
290 295 300
Asp Asp Phe Lys Gly Lys Asp Leu Glu Ile Gly Ser Pro Arg Asn Leu
305 310 315 320
Asn Thr Ala Tyr Gln Ser Ser Pro Asn Glu Arg Ser Ile Lys Pro Thr
325 330 335
Asp Arg Arg Asn Glu Tyr Pro Leu Gln Asn Asn Ser Lys Glu Ala Ala
340 345 350
Met Glu Asn Leu Glu Glu Ser Ser Val Arg Ala Ala Asp Leu Ile Gly
355 360 365
Ser Met Ala Lys Asn Met Asp Ala Gln Gln Ala Ala Arg Ala Ala Asn
370 375 380
Ala Pro Asn Cys Ser Ser Lys Val Pro Glu Gly Lys Asp Lys Asn Arg
385 390 395 400
Asp Asn Ile Met Pro Ser Leu Glu Leu Ser Leu Lys Arg Ser Arg Ser
405 410 415
Thr Gly Asp Gly Ala Asn Ala Ile Gln Glu Glu Gln Arg Asn Val Leu
420 425 430
Arg Arg Ser Asp Leu Ser Ala Phe Thr Arg Tyr His Thr Pro Val Ala
435 440 445
Ser Asn Gln Gly Gly Thr Gly Phe Met Gly Ser Cys Ser Leu His Asp
450 455 460
Asn Ser Ser Glu Ala Met Lys Thr Asp Ser Ala Tyr Asn Met Lys Ser
465 470 475 480
Asn Ser Asp Ala Ala Pro Ile Lys Gln Gly Ser Asn Gly Ser Ser Asn
485 490 495
Asn Asn Asp Met Gly Ser Thr Thr Lys Asn Val Val Thr Lys Pro Ser
500 505 510
Thr Asn Lys Glu Arg Val Met Ser Pro Ser Ala Val Lys Ala Asn Gly
515 520 525
His Thr Ser Ala Phe His Pro Ala Gln His Trp Thr Ser Pro Ala Asn
530 535 540
Thr Thr Gly Lys Glu Lys Thr Asp Glu Val Ala Asn Asn Ala Ala Lys
545 550 555 560
Arg Ala Gln Pro Gly Glu Val Gln Ser Asn Leu Val Gln His Pro Arg
565 570 575
Pro Ile Leu His Tyr Val His Phe Asp Val Ser Arg Glu Asn Gly Gly
580 585 590
Ser Gly Ala Pro Gln Cys Gly Ser Ser Asn Val Phe Asp Pro Pro Val
595 600 605
Glu Gly His Ala Ala Asn Tyr Gly Val Asn Gly Ser Asn Ser Gly Ser
610 615 620
Asn Asn Gly Ser Asn Gly Gln Asn Gly Ser Thr Thr Ala Val Asn Ala
625 630 635 640
Glu Arg Pro Asn Met Glu Ile Ala Asn Gly Thr Ile Asn Lys Ser Gly
645 650 655
Pro Gly Gly Gly Asn Gly Ser Gly Ser Gly Ser Gly Asn Asp Met Tyr
660 665 670
Leu Lys Arg Phe Thr Gln Arg Glu His Arg Val Ala Ala Val Ile Lys
675 680 685
Phe Arg Gln Lys Arg Lys Glu Arg Asn Phe Gly Lys Lys Val Arg Tyr
690 695 700
Gln Ser Arg Lys Arg Leu Ala Glu Gln Arg Pro Arg Val Arg Gly Gln
705 710 715 720
Phe Val Arg Gln Ala Val Gln Asp Gln Gln Gln Gln Gly Gly Gly Arg
725 730 735
Glu Ala Ala Ala Asp Arg
740
<210> 51
<211> 766
<212> PRT
<213> 玉米
<400> 51
Met Gly Ser Ala Cys Gln Ala Gly Thr Asp Gly Pro Ser Arg Lys Asp
1 5 10 15
Val Leu Gly Ile Gly Asn Ala Ala Leu Glu Asn Gly His His Gln Ala
20 25 30
Glu Ala Asp Ala Asp Glu Trp Arg Glu Lys Glu Glu Asp Leu Ala Asn
35 40 45
Asn Gly His Ser Ala Pro Pro Pro Gly Met Gln Gln Val Asp Glu His
50 55 60
Lys Glu Glu Gln Arg Gln Ser Ile His Trp Glu Arg Phe Leu Pro Val
65 70 75 80
Lys Thr Leu Arg Val Leu Leu Val Glu Asn Asp Asp Ser Thr Arg Gln
85 90 95
Val Val Ser Ala Leu Leu Arg Lys Cys Cys Tyr Glu Val Ile Pro Ala
100 105 110
Glu Asn Gly Leu His Ala Trp Arg Tyr Leu Glu Asp Leu Gln Asn Asn
115 120 125
Ile Asp Leu Val Leu Thr Glu Val Phe Met Pro Cys Leu Ser Gly Ile
130 135 140
Gly Leu Leu Ser Lys Ile Thr Ser His Lys Ile Cys Lys Asp Ile Pro
145 150 155 160
Val Ile Met Met Ser Thr Asn Asp Ser Met Ser Met Val Phe Lys Cys
165 170 175
Leu Ser Lys Gly Ala Val Asp Phe Leu Val Lys Pro Leu Arg Lys Asn
180 185 190
Glu Leu Lys Asn Leu Trp Gln His Val Trp Arg Arg Cys His Ser Ser
195 200 205
Ser Gly Ser Glu Ser Gly Ile Gln Thr Gln Lys Cys Ala Lys Leu Asn
210 215 220
Thr Gly Asp Glu Tyr Glu Asn Gly Ser Asp Ser Asn His Asp Asp Glu
225 230 235 240
Glu Asn Asp Asp Gly Asp Asp Asp Asp Phe Ser Val Gly Leu Asn Ala
245 250 255
Arg Asp Gly Ser Asp Asn Gly Ser Gly Thr Gln Ser Ser Trp Thr Lys
260 265 270
Arg Ala Val Glu Ile Asp Ser Pro Gln Pro Ile Ser Pro Asp Gln Leu
275 280 285
Val Asp Pro Pro Asp Ser Thr Cys Ala Gln Val Ile His Pro Arg Ser
290 295 300
Glu Ile Cys Ser Asn Lys Trp Leu Pro Thr Ala Asn Lys Arg Asn Val
305 310 315 320
Lys Lys Gln Lys Glu Asn Lys Asp Glu Ser Met Gly Arg Tyr Leu Gly
325 330 335
Ile Gly Ala Pro Arg Asn Ser Ser Ala Glu Tyr Gln Ser Ser Leu Asn
340 345 350
Asp Val Ser Val Asn Pro Ile Glu Lys Gly His Glu Asn His Met Ser
355 360 365
Lys Cys Lys Ser Lys Lys Glu Thr Met Ala Glu Asp Asp Cys Thr Asn
370 375 380
Met Pro Ser Ala Thr Asn Ala Glu Thr Ala Asp Leu Ile Ser Ser Ile
385 390 395 400
Ala Arg Asn Thr Glu Gly Gln Gln Ala Val Gln Ala Val Asp Ala Pro
405 410 415
Asp Gly Pro Ser Lys Met Ala Asn Gly Asn Asp Lys Asn His Asp Ser
420 425 430
His Ile Glu Val Thr Pro His Glu Leu Gly Leu Lys Arg Ser Arg Thr
435 440 445
Asn Gly Ala Thr Ala Glu Ile His Asp Glu Arg Asn Ile Leu Lys Arg
450 455 460
Ser Asp Gln Ser Ala Phe Thr Arg Tyr His Thr Ser Val Ala Ser Asn
465 470 475 480
Gln Gly Gly Ala Arg Tyr Gly Glu Ser Ser Ser Pro Gln Asp Asn Ser
485 490 495
Ser Glu Ala Met Lys Thr Asp Ser Thr Cys Lys Met Lys Ser Asn Ser
500 505 510
Asp Ala Ala Pro Ile Lys Gln Gly Ser Asn Gly Ser Ser Asn Asn Asp
515 520 525
Val Gly Ser Ser Thr Lys Asn Val Ala Ala Arg Pro Ser Gly Asp Arg
530 535 540
Glu Arg Val Ala Ser Pro Leu Ala Ile Lys Ser Thr Gln His Ala Ser
545 550 555 560
Ala Phe His Thr Ile Gln Asn Gln Thr Ser Pro Ala Asn Leu Ile Gly
565 570 575
Glu Asp Lys Ala Asp Glu Gly Ile Ser Asn Thr Val Lys Met Ser His
580 585 590
Pro Thr Glu Val Pro Gln Gly Cys Val Gln His His His His Val His
595 600 605
Tyr Tyr Leu His Val Met Thr Gln Lys Gln Pro Ser Thr Asp Arg Gly
610 615 620
Ser Ser Asp Val His Cys Gly Ser Ser Asn Val Phe Asp Pro Pro Val
625 630 635 640
Glu Gly His Ala Ala Asn Tyr Ser Val Asn Gly Gly Val Ser Val Gly
645 650 655
His Asn Gly Cys Asn Gly Gln Asn Gly Ser Ser Ala Val Pro Asn Ile
660 665 670
Ala Arg Pro Asn Ile Glu Ser Ile Asn Gly Thr Met Ser Gln Asn Ile
675 680 685
Ala Gly Gly Gly Ile Val Ser Gly Ser Gly Ser Gly Asn Asp Met Tyr
690 695 700
Gln Asn Arg Phe Leu Gln Arg Glu Ala Ala Leu Asn Lys Phe Arg Leu
705 710 715 720
Lys Arg Lys Asp Arg Asn Phe Gly Lys Lys Val Arg Tyr Gln Ser Arg
725 730 735
Lys Arg Leu Ala Glu Gln Arg Pro Arg Val Arg Gly Gln Phe Val Arg
740 745 750
Gln Ser Glu Gln Glu Asp Gln Thr Ala Gln Gly Ser Glu Arg
755 760 765
<210> 52
<211> 917
<212> PRT
<213> 小立碗藓
<400> 52
Met Thr Ala Asp Leu Cys Glu Phe Glu Ser Glu Ser Asp Pro Leu Gln
1 5 10 15
Pro Leu Ser Ala Val Gly Arg Ala Trp Val Glu Pro Ile Val Gly Thr
20 25 30
Pro Val Gly Ala Glu Trp Arg Ile Lys Gly Gly Phe Lys Ala His Lys
35 40 45
Glu Val Asp Arg Ser Arg Glu Gln Val Gly Ser Lys Arg Val Asp Asp
50 55 60
Arg Glu Lys Asn Ser Gly Arg Leu Glu Asn Gly Cys Arg Phe Ala Asp
65 70 75 80
Arg Thr Gly Gly Ala Val Leu Lys Ala Arg Glu Asp Pro Lys Asp Ile
85 90 95
Ala Glu Gln Ile Arg Arg Glu Leu Asp His Gln Phe Pro Val Asn Asp
100 105 110
Val Leu Arg Thr Ser Glu Ser Asp Glu Asp Gly Arg Arg Glu Asp Ser
115 120 125
Ala Glu Asp His Tyr Glu Glu Gly Asp Ala Val Ala Ala Val Val Phe
130 135 140
Glu Lys Gln Arg Pro Arg Glu Ile Ala Gln Thr Arg Glu Gln Gln Gln
145 150 155 160
Gly Gly Asn Ala Ala Ala Ala Ala Ala Gly Thr Gln Gly Gly Gly Gly
165 170 175
Trp Glu Ser Phe Leu Leu Lys Arg Asn Leu Lys Val Leu Leu Val Glu
180 185 190
Asp Asp Asp Ala Thr Arg His Val Val Gly Ala Leu Leu Arg Asn Cys
195 200 205
Asn Tyr Glu Val Thr Pro Val Ala Asn Gly Ser Leu Ala Trp Gly Leu
210 215 220
Leu Glu Glu Ala Asn Ser Asn Phe Asp Leu Val Leu Thr Asp Val Val
225 230 235 240
Met Pro Tyr Leu Ser Gly Val Gly Leu Leu Ser Lys Met Met Lys Arg
245 250 255
Glu Ala Cys Lys Arg Val Pro Ile Val Ile Met Ser Ser Tyr Asp Ser
260 265 270
Leu Gly Ile Val Phe Arg Cys Leu Ser Lys Gly Ala Cys Asp Tyr Leu
275 280 285
Val Lys Pro Val Arg Lys Asn Glu Leu Lys Asn Leu Trp Gln His Val
290 295 300
Trp Arg Lys Cys His Ser Ser Ser Gly Ser Arg Ser Gly Ser Gly Ser
305 310 315 320
Gln Thr Gly Glu Val Ala Lys Pro Arg Ser Arg Gly Val Ala Ala Ala
325 330 335
Asp Asn Pro Ser Gly Ser Asn Asp Gly Asn Gly Ser Ser Asp Gly Ser
340 345 350
Asp Asn Gly Ser Ser Arg Val Asn Ala Gln Gly Gly Ser Asp Asn Gly
355 360 365
Ser Gly Asn Gln Ala Cys Met Gln Pro Val Gln Val Leu Arg Asn Ser
370 375 380
Ala Ile Pro Glu Ala Val Asp Gly Asp Glu Glu Gly Gln Ala Thr Ser
385 390 395 400
Gln Asp Lys Gly Ala Asp Leu Asp Gly Glu Met Gly His Asp Leu Glu
405 410 415
Met Ala Thr Arg Arg Ser Ala Cys Val Thr Thr Gly Lys Asp Gln Gln
420 425 430
Pro Glu Asp Ala Gln Lys Gln Asp Glu Asp Ala Val Cys Ile Leu Gln
435 440 445
Asp Ala Gly Pro Ser Pro Asp Gly Ala Asn Ala Glu Ser Pro Ser Ser
450 455 460
Ser Gly Arg Asn Asp Ala Ala Glu Glu Ser Ser Pro Lys Ile Ile Asp
465 470 475 480
Leu Ile Asn Val Ile Ala Cys Gln Pro Gln Thr Gln Asp Ala Glu Pro
485 490 495
Gln Glu Ser Glu Asn Asp Asp Glu Glu Leu Asp Pro Arg Gly Arg Ser
500 505 510
Ser Pro Lys Asn Asn Ser Ala Ser Asp Ser Gly Thr Ser Leu Glu Leu
515 520 525
Ser Leu Lys Arg Pro Arg Ser Ala Val Gly Asn Gly Gly Glu Leu Glu
530 535 540
Glu Arg Gln Pro Leu Arg His Ser Gly Gly Ser Ala Phe Ser Arg Tyr
545 550 555 560
Gly Ser Gly Gly Thr Ile Ile Gln Gln Tyr His Gln Thr Gly Gly Ser
565 570 575
Leu Pro Leu Ser Gly Tyr Pro Val Ser Gly Gly Tyr Gly Val Tyr Gly
580 585 590
Met Ser Gly Gly Ser Pro Gly Gly Ser Leu Arg Leu Gly Met Gly Met
595 600 605
Asp Arg Ser Gly Ser Ser Lys Gly Ser Val Glu Gly Thr Thr Pro Pro
610 615 620
Pro Ser His Pro Gln Ser Met Glu Lys Val Gly Gly Gln Asp Gly Tyr
625 630 635 640
Gly Asn Ala Arg Gln Thr Thr Glu Asp Ala Met Ile Val Pro Gly Met
645 650 655
Pro Met Ala Ile Pro Leu Pro Pro Pro Gly Met Leu Ala Tyr Asp Gly
660 665 670
Val Ile Gly Thr Tyr Gly Pro Ala Met His Pro Met Tyr Tyr Ala His
675 680 685
Pro Ser Ala Trp Met Ala Ala Pro Ser Arg His Met Gly Glu Arg Gly
690 695 700
Asp Val Tyr Asn Gln Ser Pro Ala Phe Gln Glu Gln Asp Ser Gly Ser
705 710 715 720
Gly Asn His Ser Gln Ala Gly Gln Thr His Gln His Met His His His
725 730 735
Gln Gly Asn Gln His His His His His His His His His His Gly Ser
740 745 750
Gly Ala Gln Pro Ser Gly Asn Ala Gly Val Gln Asp Glu Gln Gln Gln
755 760 765
Ser Val Val Pro Pro Gly Ser Ser Ala Pro Arg Cys Gly Ser Thr Gly
770 775 780
Val Asp Gly Arg Ser Gly Ser Ser Asn Gly Tyr Gly Ser Thr Gly Asn
785 790 795 800
Gly Asn Gly Ser Met Asn Gly Ser Ala Ser Gly Ser Asn Thr Gly Val
805 810 815
Asn Asn Gly Gln Ser Gly Phe Gly Ala Thr Pro Met Leu Thr Asp Asn
820 825 830
Ser Gly Ser Asn Gly Val Gly Gly Thr Asp Ala Ala Met Asp Gly Val
835 840 845
Ser Gly Gly Asn Gly Leu Cys Thr Glu Gln Met Arg Phe Ala Arg Arg
850 855 860
Glu Ala Ala Leu Asn Lys Phe Arg Gln Lys Arg Lys Glu Arg Cys Phe
865 870 875 880
Glu Lys Lys Val Arg Tyr Gln Ser Arg Lys Arg Leu Ala Glu Gln Arg
885 890 895
Pro Arg Val Arg Gly Gln Phe Val Arg Gln Ala Val His Asp Pro Ser
900 905 910
Ala Gly Asp Ala Glu
915
<210> 53
<211> 1359
<212> PRT
<213> 卡特氏团藻
<400> 53
Met Glu Phe His Val Leu Leu Val Glu Asp Asp Arg Val Thr Leu Lys
1 5 10 15
Thr Val Glu Gln Leu Leu Arg Lys Cys Asn Tyr Lys Val Thr Cys Ala
20 25 30
Ala Asn Gly Arg Glu Ala Ile Lys Val Leu Thr Ala Cys Arg His Ser
35 40 45
Gly Val Lys Val Asp Leu Ile Leu Thr Asp Ile Leu Met Pro Glu Val
50 55 60
Thr Gly Phe Asp Leu Ile Asn Glu Val Val His Gly Asp Thr Phe Cys
65 70 75 80
Asp Val Pro Val Val Val Met Ser Ser Gln Asp Ser Gln Glu Asn Val
85 90 95
Leu Gln Ala Phe Gln Ala Gly Ala Ala Asp Tyr Leu Ile Lys Pro Ile
100 105 110
Arg Lys Asn Glu Leu Ala Thr Leu Trp Gln His Val Trp Arg Ala Asn
115 120 125
Lys Ala Lys Gly Ser Gly Ser Gly Thr Thr Thr Asn Val Thr Gly Gln
130 135 140
Pro Leu Ser Gly Arg Glu Asp Leu Glu Ala Gly Glu Ala Val Ala Val
145 150 155 160
Ala Ala Ala Ala Ala Ala Ala Ser Gly Lys Ala Cys Ala Ala Thr His
165 170 175
Gly His Leu Lys Asp Ser Ser Gly Gly Ser Ser Gly Ala Ala Ala Ser
180 185 190
Val Leu Gln Ser Thr Gly Gly Thr Leu Leu Pro Asp Arg Ala Ala Thr
195 200 205
Val Arg Tyr Pro Ala Ala Ala Ala Ala Pro Pro Pro Pro Gly Ala Ser
210 215 220
Glu Leu Ser Gly Asn Val Thr Ala Gly Glu Ala Gln Gly Ser Arg Thr
225 230 235 240
Gln His Leu Arg His Leu Ser Gly Leu Ala Gly Met Glu Ser Thr Ala
245 250 255
Ala Thr Ser Ala Ala Ala Gln Gly Ser Ser Ala Ala Gly Pro Leu Arg
260 265 270
Gly Cys Gly Gly Ala Gly Thr Ala Ile Ala Gly Gly Pro Arg Ala Pro
275 280 285
Leu Gly Pro Leu Ser Phe Ala Pro Phe Gly Thr Ser Val Ala Val His
290 295 300
Phe Asp Leu Asn Pro Ala Ser Gly Ala Ala Arg Arg Leu Val Asn Ser
305 310 315 320
Ser Gly Ala Ile Asp Ala Ser Thr Gly Ser Gly Thr Ala Gly Val Ala
325 330 335
Ala Ser Ser Arg Cys Ala Ala Gly Thr Ser Ala Thr Val Ile Ser Trp
340 345 350
Ser His Val Asp Pro Thr Glu Thr Asp Pro Ala Glu Ala Glu Pro Met
355 360 365
Tyr Asp Thr Asn Ala Asp Ala Thr Ala Ala Lys Ala Ala Ala Asp Gly
370 375 380
Val Ala Glu Ala Asp Asp Asp Asp Val Gly Asp Asp Gly Gly Ala Gly
385 390 395 400
Pro Asn His Asn Asp Asp Asp Asp Glu Gly Gly Gly Asp Asp Asp Val
405 410 415
Ser Gly Asp Gly Asp Glu Asp Gly Asn Arg Pro Arg Lys Arg Pro Arg
420 425 430
Leu Leu Gln Gly Ser Ser His His His Ser His Gln His Arg Leu His
435 440 445
Ser Leu Gly Gly Thr Thr Thr Asn Thr Thr Thr Thr Thr Thr Ala Ala
450 455 460
Lys Pro Lys Ser Thr Ala Gly Glu Arg Gly Gly Ala Ala Ala Leu Leu
465 470 475 480
Ala Cys Arg Thr Ala Ala Ala Ala Pro Leu Arg Gly Ser Gly Cys Ala
485 490 495
Thr Ala Gly Ala Thr Gly Ala Cys Arg Leu Ala Ala Ala Ala Ala Ala
500 505 510
Ala Glu Gly Ser Gln Gly Ser Arg Ala Ala Ser Ala Ser Ala Gly Pro
515 520 525
Asp Gly Gly Ala Arg Glu Ser Thr Ala Thr Pro Ser Gly Asp Thr Phe
530 535 540
Ala Glu Ser Pro Ser Ala Tyr Thr Ala Thr Ala Thr Thr Thr Ser Thr
545 550 555 560
Ala Thr Thr Ser Thr Thr Thr Gly Ser Gly Ile Glu Met Gln Asp Asp
565 570 575
Glu Gln Gln Gln Arg Gln Gln Pro Lys Gln Arg Pro Pro Ala Ser Gln
580 585 590
Pro Glu Leu Glu Gly His His His Gln Gln Gln Tyr His His Tyr Tyr
595 600 605
Arg Arg Thr Ser Leu Glu Gly Gly Cys Ala Asn Ala Pro Pro Leu Pro
610 615 620
Val Pro Ser Ser Ala Arg Gly Ala Ser Pro Ala Gly Thr Gly Pro Thr
625 630 635 640
Glu Ser Gly Ser Gly Arg Asp Ser Gly Cys Ala Arg Ile Thr Asn Gly
645 650 655
Thr Ala Ala Gly Ala Thr Ala Ala Met Pro Pro Ser His Val Ser Ser
660 665 670
Ala Ser Pro Pro Arg Cys Thr Ala Thr Ser Ala Ala Ala Thr Arg Gly
675 680 685
Ser Ser Gly Ala Ala Thr Ala Ala Ala Gly Ala Met Thr Thr Ala Leu
690 695 700
Ala Thr Ala Gly Ser Tyr Pro Arg Gly Val Asp Ala Ser Pro Pro Pro
705 710 715 720
Asn Arg Ser Met Gly Ser Ser Gly Gly Asp Gly Gly Gly Thr Ala Ala
725 730 735
Ala Ala Ala Gly Thr Ala Arg Gly Ser Ser Pro Ala Ala Ala Thr Pro
740 745 750
Pro Leu Pro Pro Ser Thr Gln Gln His Gly Leu Pro His Pro Ala Ala
755 760 765
Ala Pro Pro Pro Gly Ala Ala Ser Pro Gly Gly Ala Val Thr Leu Pro
770 775 780
Pro Ala Leu Gln Glu Leu Ala Ala Leu Gly Ala Ala Arg His Ala Gly
785 790 795 800
Leu Trp Thr Gln Arg Ala Leu Leu His Gln Gln Gln Leu Leu Leu Gln
805 810 815
Gln Gln Lys Gln Gln Lys Gln Gln Gln His Gln Gln Asp Gln Val Val
820 825 830
Gly Ala Glu Lys Ile His Gly Gly Ser Thr Ser Ala Val Ala Asn Ala
835 840 845
Ala Glu Gln Gln Gln Gln Gln Pro Leu Gly Ala Ala Ala Ala Arg Arg
850 855 860
Pro Ser Lys Ala Gly Val Asp Gly Thr Glu Ala Gly Ser Gly Ala Val
865 870 875 880
Gly Gly Cys Ala Ser Ala Thr Ala Ala Val Met Ala Met Glu Ala Ser
885 890 895
Glu Pro His Gly Ala Val Gly Ser Ser Phe Thr Ala Ala Asp Arg Gln
900 905 910
Glu Thr Pro Leu Gln Pro Leu His Ala Glu Ser Ala Ala Ala Gly Gly
915 920 925
Asp Met Asp Gly Asn Arg Ser Thr Pro Ala Thr Met Pro Ser Gly Pro
930 935 940
Thr Ala Ala Ala Ser Gly Pro Ser Gln Thr Ser Asn Ser Leu Thr Val
945 950 955 960
Leu Arg His Ser Asp Arg Ser Ala Phe Thr Ala Phe Thr Val Phe Leu
965 970 975
Pro Ser Arg Val Ala Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
980 985 990
Ala Arg Pro Pro Pro Pro Pro Ala Pro Val Gln Pro Pro Ala Pro Ile
995 1000 1005
Phe Thr His Pro Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
1010 1015 1020
Ala Gly Ser Gly Gly Ala Ala Ser Val Trp Tyr Pro His Leu His
1025 1030 1035
His His His His Tyr Leu Gln Gln Gln Gln Thr His Met Gly Pro
1040 1045 1050
Leu Pro Pro Leu Pro Gly Ala Val His Val Leu Pro Ser Ile Met
1055 1060 1065
Gln Leu His Met Gly Val Leu Ala Pro Gly Pro Pro Pro Gln Gln
1070 1075 1080
Gln Gln Gln Gln His Leu Gln Ala Lys Ala Pro Gln Lys Pro His
1085 1090 1095
Asp Ser Ala Ala Ala Ala Gly Gly Ala Asn Gly Ser Leu Gly Pro
1100 1105 1110
Ala Thr Ser Ala Ala Ala Ala Thr His Met Ser Tyr Thr Gly Met
1115 1120 1125
Gln Gln Arg Pro Gly Ala Ser Ser Ala Thr Thr Thr Ser Ala Gly
1130 1135 1140
Ala Val Ala Phe Gly Gln Ser Pro Pro His Gly Leu Ala Ala Ala
1145 1150 1155
Ala Ala Ala Ala Ser Thr Pro Pro Pro Pro Pro Pro Pro Pro Val
1160 1165 1170
Cys Ile Pro Glu Ser Val Leu Gln Leu Ile Ala His Leu Ser Gly
1175 1180 1185
Arg Ala Ala Ala Glu Leu Pro Val Pro Glu Thr Val Thr Thr Ala
1190 1195 1200
Pro Leu Val Val Gln Lys Ala Pro Ser Ala Ala Arg Leu Ala Ala
1205 1210 1215
Val Ala Lys Tyr Leu Glu Lys Arg Lys His Arg Asn Phe Gln Lys
1220 1225 1230
Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ala Arg Pro
1235 1240 1245
Arg Val Arg Gly Gln Phe Val Lys Ala Ser Thr Ser Ala Val Ala
1250 1255 1260
Ala Thr Thr Pro Ala Ala Thr Gly Ala Thr Val Thr Ser Thr Ser
1265 1270 1275
Leu Arg Gln Pro Val Tyr Thr Ala Ala Gly Pro Ala Gly Leu Ala
1280 1285 1290
Leu Pro Pro Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ala Ala
1295 1300 1305
Arg Gly Val Pro Pro Pro Ser Ser Arg Ile Gly Ala Val Glu Leu
1310 1315 1320
Ala Glu Leu Val Pro Asp His Asp Ala Asp Ile Glu Asp Glu Gly
1325 1330 1335
Cys Asp Glu Pro Ala Ala Ala Glu Asp Ser Asp Gly Ser Val Ala
1340 1345 1350
Val Glu Leu Ala Glu Val
1355
<210> 54
<211> 1102
<212> PRT
<213> 莱茵衣藻
<400> 54
Met Glu Ala Asn Gly Phe His Val Val Leu Val Glu Asp Asp Asn Ile
1 5 10 15
Cys Leu Lys Val Val Glu Gln Leu Leu Arg Lys Leu Ser Tyr Arg Val
20 25 30
Ser Thr Ala Ser Asp Gly Ala Ala Ala Leu Lys Val Leu Ala Asp Cys
35 40 45
Lys Gln Arg Gly Asp Lys Val Asp Leu Ile Leu Thr Asp Ile Leu Met
50 55 60
Pro Glu Val Thr Gly Phe Asp Leu Ile Asn Glu Val Val His Gly Glu
65 70 75 80
Thr Phe Ala Asp Ile Pro Val Val Val Met Ser Ser Gln Asp Ser Gln
85 90 95
Glu Ser Val Leu Gln Ala Phe Gln Ala Gly Ala Ala Asp Tyr Leu Ile
100 105 110
Lys Pro Ile Arg Lys Asn Glu Leu Ala Thr Leu Trp Gln His Val Trp
115 120 125
Arg Ala Asn Arg Ala Lys Gly Gly Gln Thr Ser Ser Gly Ala Ala His
130 135 140
Val Gly Ala Gly Gly Arg Gly Gly Thr Ser Ser Arg Asp Gly Gly Gly
145 150 155 160
Val Ala Gly Thr Arg Cys Gly Pro Gly Asp Arg Gly Gly Ser Gly Gly
165 170 175
Asp Ala Glu Gly Ser Gly Leu Gly Gly Gly Ala Gly Ala Val Lys Asp
180 185 190
Ser Ser Gly Gly Ser Thr Gly Ala Ala Thr Ser Val Leu His Ser Thr
195 200 205
Gly Gly Thr Thr Leu Pro Ser Arg Ala Ala Thr Gly Arg His Ala Ser
210 215 220
Thr Ser Ala Gly His Gly Val Thr Ser Ala Asp Pro Ser Asn Asn Gln
225 230 235 240
Thr Ser His Ala His Ala His Ala His Ala His Ala His Gly Asn Ala
245 250 255
His Ala His Ala His Leu His Met His Gly Ala Thr Asp Arg Ala Ala
260 265 270
Gln Gly Ser Ser Ala Asn Gly Pro Ala Asn His Gly Ala Ala Gly Thr
275 280 285
Gly Leu Gln Ser Ala Gly Met Ala Gly Ser Thr Ala Ala Gly Ala Ala
290 295 300
Ala Pro Ala Gly Glu Ser Leu Ala Lys Pro Pro Phe Ala Ser Leu Ala
305 310 315 320
Val His Phe Asp Leu His Ser Val Leu Ala Gly Ala Gly Ala Ala Ala
325 330 335
Ala Asn Gly Gly Ala Asn Ala Ala Ala His Thr Ala Gly Ala Thr Gly
340 345 350
Arg Glu Ser Gly Gln Ala Ala Gly Ala Ala Thr Gly Gly Ile Ala Ala
355 360 365
Ala Gly Thr Val Ile Gly Trp Ser His Ala Asp Met Asp Val Asp Gly
370 375 380
Gly Glu Ala Gly Ala Gln Asp Glu Asp Asp Glu Asp Glu Asp Asp Gly
385 390 395 400
Val Glu Ala Pro Ala Gly Thr Gln Asn Arg Lys Arg Ala Ala Asp Asp
405 410 415
Ser Gly Cys Asp Gly Ala Ala Ala Asn Asn Asn Gly Asn Thr Ala Ala
420 425 430
Lys Ala Gly Ala Ala Ala Ile Ala Ala Gly Gly Pro Gly Ser Ser Gly
435 440 445
Arg Ala Lys Ala Thr Asp Gly Ala Arg Ala Glu Ile Arg His Asn Gly
450 455 460
Gly Pro Met Ala Ala Arg Met Ala Ala Ala Glu Gly Ser Gln Gly Ser
465 470 475 480
Arg Ala Ala Ser Gly Ser Ala Ala Thr Gly Pro Gly Gly Ala Arg Glu
485 490 495
Gly Thr Ala Thr Pro Ser Gly Asp Thr Phe Ala Glu Ser Pro Ser Thr
500 505 510
Phe Thr Ser Ile Ile Asn Thr Thr Gly Ser Gly Ser Glu Ala Asp Glu
515 520 525
Gln Pro Val Pro Leu Lys His Gln Glu Gln Gln Gln Gln Gln Gln Gln
530 535 540
Gln Arg Val Gly Glu Gly Asp Arg Ala Lys Pro Glu Pro His Pro Gln
545 550 555 560
Asn Pro Ala Gln Ala Ala His Leu Pro His Pro Ser Ala Ala Pro Cys
565 570 575
Ser Gly Gly Gly Gly Ile Ala Gln Ala Ala Leu Pro Leu Gly Leu Gln
580 585 590
Glu Leu Ala Ala Leu Gly Ala Ala Arg His Lys Glu Leu Trp Thr Gln
595 600 605
Arg His Leu Met His Gln Arg Gln Ala Ala Ala Ala Ala Thr Ala Ala
610 615 620
Ala Ala Ser Ala Ala Ala Ala Ala Ala Met Pro Thr Ala Gly Ala Ser
625 630 635 640
Ala Ala Ala Pro Ala Gly Pro Pro Ser Ala Arg Pro Ser Ala Ser Leu
645 650 655
Ala Asp Thr Gly Gly Asp Gly Pro Ala Ala Ala Thr Ala Pro Glu Thr
660 665 670
Arg Ala Asp Gly Pro Ser Gly Pro Ala Thr Thr Gln Gly Pro Lys Arg
675 680 685
Asp Ala Val Ala Gly Ala Ala Ala Val Gly Ser Ser Ala Arg Ser Asp
690 695 700
Ser Pro Leu Pro Ala Ala Ala Ala Ala Thr Ala Gly Ala Asn Gly Ala
705 710 715 720
Ser Gly Ala Ala Ser Asp Val Leu Ala Gly Ala Gly Ser Leu Ala Leu
725 730 735
Leu Arg His Ser Asp Arg Ser Ala Phe Thr Ala Phe Thr Val Phe Leu
740 745 750
Pro Gly Arg Val Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
755 760 765
Ala Ala Thr Ser Ala Gly Ala Ser Thr Gly Thr Ala Asn Gly Ala Pro
770 775 780
Pro Ala Pro Gly Thr Ala Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala
785 790 795 800
Ala Ala Ser Ala Val Pro Leu Pro His Pro His Thr Ala Pro Pro Ala
805 810 815
Leu Phe Gly Val Pro Pro Pro Ser Ser Val Pro Pro Ser Ser Leu Ser
820 825 830
Val Leu Pro Pro Val Met Pro Leu His Pro Ala Ala Ala Ala Ala Ala
835 840 845
Ala Thr Ala Gly Gly Gly Lys Pro Ser Asp Ala Ala Thr Tyr Ala Ala
850 855 860
Ala Ala Ala Ala Gly Leu Val Pro Tyr Pro Gly Phe Ala Pro Ala Arg
865 870 875 880
Pro Gly Pro Phe Pro Pro Pro Pro Gly Ser Gly Gly Pro Gly Ala Pro
885 890 895
Pro Val Tyr Ile Pro Glu Ser Val Leu Gln Leu Ile Ala His Leu Ser
900 905 910
Gly Arg Ala Ala Ala Glu Ile Pro Ala Val Pro Ala Glu Ser Val Thr
915 920 925
Ala Ala Pro Val Val Val Gln Lys Ser Gly Gly Pro Ala Ser Ala Ala
930 935 940
Arg Leu Ala Ala Val Ala Lys Tyr Leu Glu Lys Arg Lys His Arg Asn
945 950 955 960
Phe Gln Lys Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ala
965 970 975
Arg Pro Arg Val Arg Gly Gln Phe Val Lys Ala Gly Thr Ala Gly Ala
980 985 990
Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Thr Ala
995 1000 1005
Ala Thr Ala Ala Gly Thr Gly Thr Ala Arg Gly Ala Ala Thr Ala
1010 1015 1020
Ser Gly Ala Ala Gly Lys Pro Glu Leu Gln Gly Pro Asp Thr Ala
1025 1030 1035
Glu Glu Ala Ala Ala Ala Thr Leu Leu Ser Ala Ala Ala Ala Met
1040 1045 1050
Ala Ala Ala Ala Ala Gly Thr Ser Gly Pro Ser Gly Ser Gly Ser
1055 1060 1065
Gly Ala Met Asp Val Asp Gly Ala Asp Pro Glu Ala Asp Ala Asp
1070 1075 1080
Val Met Asp Glu Asp Asp Gly Glu Asp Asp Gly Ser Asp Glu Ser
1085 1090 1095
Ala Gly Glu Pro
1100
<210> 55
<211> 936
<212> PRT
<213> 佐夫色绿藻
<400> 55
Met Ser Ala Asp Ala Gly Gly Gln Lys Pro Gly Val Ala Glu Pro Gly
1 5 10 15
Ala Arg Thr Gly Pro Gly Phe Ser Val Asn Ser Ser Phe His Val Leu
20 25 30
Leu Val Asp Asp Asp Ala Val Thr Leu Lys Tyr Val Glu Gln Leu Leu
35 40 45
Arg Lys Cys Ser Tyr Glu Val Thr Thr Ala Thr Asn Gly Arg Glu Ala
50 55 60
Ile Glu Val Leu Glu Gly Arg Arg Gly Gln Val His Ile Asp Leu Ile
65 70 75 80
Leu Thr Asp Ile Ser Met Pro Glu Val Asn Gly Val Gln Leu Ile Glu
85 90 95
Glu Val Val Asn Gly Gly Lys Trp Lys Asn Leu Pro Val Ile Val Met
100 105 110
Ser Ser His Glu Ala Gln Ala Asn Val Leu Glu Ala Phe Gln Ala Gly
115 120 125
Ala Ser Asp Tyr Leu Ile Lys Pro Leu Arg Arg Asn Glu Leu Ser Thr
130 135 140
Leu Trp Gln His Val Trp Lys Ala Ser Lys Leu His Gln Pro His Leu
145 150 155 160
His Gly Glu Asp Asp Glu Asp Glu Asp Asp Thr Ala Leu Asp Asn Gly
165 170 175
Lys Phe Asp Ser Ser Ala Gly Asn Asn Lys Gly Ser Ser Gly Ala Ser
180 185 190
Thr Ser Ala Ala Gly Asp Ala Thr Ala Leu Ser Met Ala Asp Ala Ser
195 200 205
Arg Ala Leu Tyr Glu His His Pro Ser His His His Asn His Ile Gly
210 215 220
Glu Pro Ser Ile Asp Thr Gln Ala Ser Gly Gln Val Gly Ser Asn Asp
225 230 235 240
Pro Ser Leu Val Ile His Pro Leu Asp Ile Ser Pro Leu Pro Ala Ala
245 250 255
Ala Pro Pro Leu Ala Val Ala Gly Asp Pro Ala Leu Ala Ala Ala Pro
260 265 270
Leu Gly Thr Gly Gly Gln Asp Thr Pro Gly Ser Gly Asp Glu Gln Ala
275 280 285
Thr Ala Gly Thr Ser Val Gln Gln His Gln His Ser Gln Ala His His
290 295 300
His Ser Arg His Ile Pro Ala Ser Gly Ser Gly Thr Thr Glu His Ala
305 310 315 320
Pro Gln Ser Leu Ser Gln His Pro His His His Asn His Gln His His
325 330 335
His Asn Ser His His His His His Gln His Asp Leu Ala Gln Gln Arg
340 345 350
Gln Gln His His His His His Ser Asn Gly Val Asn Gln Asp His Ser
355 360 365
Gln Pro Asn Pro Asp Leu Thr Gln Met Pro Ser Ala Asp Gln Gln Ser
370 375 380
Leu Leu Thr Leu Pro His Ser Pro Asn Gly Ala Met Pro Leu Phe Lys
385 390 395 400
Pro Ser Thr Ser Ser Ala Ala Met Asp Cys Ser Thr Gln Gln Pro Leu
405 410 415
Gln Gln Gln Gln Gln His Glu His Gly Ser Ser Ser Pro Ala Leu Ser
420 425 430
Arg Pro His Ala Glu Lys Ser Pro Phe Gly Val Arg Tyr Gly Gly Gly
435 440 445
Asn Gly Gly Tyr Ser Ser Ser Met Ser Gly Ala Ser Leu Pro Pro Gly
450 455 460
Leu Gln Glu Leu Ala Val Leu Gly Gln Gln Arg Gln Ala Ala Arg Glu
465 470 475 480
Lys Asp Leu Gln Gln Arg Gln Gln Gln Gln Gln Lys Gln Gln Gln Gln
485 490 495
Gln Gln Gln Thr Ser Ala Leu Arg His Ser Asp Ser Ser Ala Phe Thr
500 505 510
Ala Phe Thr Val Phe Leu Pro Lys Gly Ser Asn Gly Leu Asn Arg Ser
515 520 525
Ser Gly Val Gly Val His Gly Ser Asn Ser Gln Thr Ser Gly Gly Gly
530 535 540
Ala Ala Asp Leu Gly Arg Ser Ala Ser Ser Met Glu Ile Leu Ser Thr
545 550 555 560
Ala Glu Thr Leu Val Gly Gln Thr Ala Gly Gly Ala Gly Val Asn Gly
565 570 575
Val Gly Ser Ala Lys Pro Gly Gly Asp Cys Leu Lys Glu Glu Ser Pro
580 585 590
Asn Asp Ser Thr Pro Ser Ala Glu Glu Gly Asp Glu Gln Asp Val Lys
595 600 605
Pro Pro Gln Ser Thr Ser Gly Ala Ala Ala Ala Glu Pro Ala Val Ala
610 615 620
Thr Ala Ser Gly Arg Ala Ala Thr Ala Ala Ile Ala Val Val Ala Asp
625 630 635 640
Ala Thr Val Ala Lys Pro Asp Ala Pro Val Ala Thr Ser Asp Gln Lys
645 650 655
Gln Val Leu Pro Phe Pro Gly Val Asn Gly Ala Ala His Leu Thr Gly
660 665 670
Met Asn Asn Gly Val Ser His Ser Gly Thr Ala Gly Ser Tyr Ser Glu
675 680 685
Leu Thr Gln Met Leu Tyr Ala Gln Leu Pro His Gln Gly Gln Pro Leu
690 695 700
Pro Asp His Val Met His Phe Leu His Asn Phe Tyr Arg Thr Met Met
705 710 715 720
Glu His Gln His Gln Gln Gln Ser Gln Gln Met Asp Gln Leu His His
725 730 735
His Val Gln Gln Gln Gln Gln Gln Gln Val Gln Gln Gln Gln Arg His
740 745 750
Leu Gln Gln Phe Ala Thr Ala Pro Asn Gly Gln Ala Pro Pro Asn His
755 760 765
Ser Asn Thr Asn Gln His Leu Gln Gln Gln Gln Gln Ala Cys Gly Asn
770 775 780
Gln Pro Leu Gln His Thr Ser Gln Pro His Cys Asn Gly Ala Ala His
785 790 795 800
Leu Gln His Leu Gln Gln Ser His Ser Ala Pro Ser Leu His Thr Pro
805 810 815
Gly Phe Thr Cys Thr Thr Thr Ala Thr Gln Ser Asn Thr Glu Pro Ser
820 825 830
Cys Met Leu Thr Gln Ser Glu Gly Ala Pro Cys Ala Ser Ser Tyr Arg
835 840 845
Ala Ala Ala Val Ala Lys Tyr Arg Glu Lys Arg Lys Asn Arg Asn Tyr
850 855 860
Asp Lys Lys Val Arg Tyr Glu Ser Arg Lys Arg Leu Ala Glu Ser Arg
865 870 875 880
Pro Arg Val Lys Gly Gln Phe Val Lys Gln Glu Val Leu Ala Ala Ala
885 890 895
Gly Leu Thr Ala Leu Ala Glu Leu Ala Thr Ala Asn Lys Arg Ala Arg
900 905 910
Leu Asp Val Asp Tyr Val Thr Ala Thr Gly Met Thr Asp Ala Asp His
915 920 925
Met Asp Thr Ala Glu Glu Ser Ser
930 935
<210> 56
<211> 444
<212> PRT
<213> 胶球藻
<400> 56
Met Ala Ala Gly Leu Lys Arg Ile Pro Ser Phe Ser Gly Arg Pro Gly
1 5 10 15
Phe Pro Asn Gly Leu Gln Val Leu Val Val Asp Gly Asp Thr Ser Ser
20 25 30
Ser Gln Cys Leu Arg Gln Lys Leu Glu Glu Leu Ala Tyr Glu Val Ser
35 40 45
Cys Cys Ser Ser Gly Ser Asp Ala Ser Ala Leu Leu Arg Lys Glu Asp
50 55 60
Ser Ser Tyr Asp Ile Leu Leu Val Glu Ala Lys Ala Leu Ala Lys Asp
65 70 75 80
Ala Thr Asp Gly Gly Ser Leu Arg Asp Ser Ala Ala His Leu Pro Leu
85 90 95
Val Leu Met Ser Glu Lys Ser Ser Ser Thr Asp Ala Val Trp Arg Gly
100 105 110
Ile Glu Leu Gly Ala Ala Asp Val Leu Glu Lys Pro Leu Ser Ser Leu
115 120 125
Lys Leu Arg Asn Ile Trp Gln His Val Val Arg Lys Met Met Ser Ser
130 135 140
Ser Gln Asp Ser Ser Arg Glu Ala Val Pro Cys Lys Met Glu Pro Lys
145 150 155 160
Ser Lys Gly Lys Gly Val Ser Ala Pro Ser Ser Pro Arg Thr Pro Ser
165 170 175
Pro Ala Ala Ser Leu Leu Thr Ile Ser Ser Gly Thr Met Thr Glu Lys
180 185 190
Ser Cys Lys Gly Gly Gly Asp Glu Ala Ser Phe Ser Gly Val Gly Asp
195 200 205
Val Lys Met Ser Cys Ser Ala Glu Ala Pro Glu Pro Cys Asp Ser Arg
210 215 220
Ala Thr Ala Glu Ser Pro Ala Ser Thr Gln Thr Lys Val Thr Phe Pro
225 230 235 240
Gly Cys Leu Asn Ser Gly Gly Thr Ala Leu Ala Ala Ser Lys Asn Cys
245 250 255
Ser Arg Lys Arg Lys Ala Lys Ala Pro Asp Thr Pro Ala Ser Val Ala
260 265 270
Ser Arg Pro Pro Leu Ala Ile Arg Pro Pro Ala Trp Ala Ser Pro Phe
275 280 285
Gly Pro Pro His Gln Gly Asn Thr His Val Val Gly Met Ala Pro Pro
290 295 300
Gln Cys Tyr Met Gln Gly Val Asp Pro Thr Asn Gly Cys Val Trp Gly
305 310 315 320
Thr Pro Ala Gly Gly Val Ser Gln Ala Pro Ala Tyr Met Pro Gly Trp
325 330 335
Gly Phe Ser Pro Gln Pro Met Leu Ser Gly Ser Phe Leu Gln His Pro
340 345 350
Ser Thr Ser Asp Leu His Lys Cys Pro Ser Val Gly Ala Ser Ser Leu
355 360 365
Ala Ser Ser Leu Asp Ser Ser Leu Thr Leu Cys Gly Phe Gly Ala Asp
370 375 380
Leu Pro Asp Asp Asp Leu Leu Leu Glu Asp Val Leu Leu Pro Asp Glu
385 390 395 400
Asp Leu Leu Asp Leu Ala Pro Asp Glu Pro Ala Thr Met Lys Ala Pro
405 410 415
Glu Gln Pro Pro Ile Gly Leu Lys Leu Lys Lys Ser Ala Ser Leu Ile
420 425 430
Asp Leu Ile Asn Ala Gln Leu Ser Ala Ala Thr Ala
435 440
<210> 57
<211> 1284
<212> DNA
<213> 卵囊藻属
<400> 57
atggcgctga agcgcgttcc tagcttttcc ggtcggccga actttcccgc cggtctgcag 60
atcctggtgg tggacagcga ttcttcctca agggaggctg tagagatgca actcaaatcg 120
cactcctatc tagcaacctg ttgttgcacc tgcggcgagg ctgtggagca gctcggcacg 180
tcaaagtatg acatcgtgct ggcagagtcc aagctggttg ctgcggagtg cgttgactcg 240
acacggttgt gcgaggccgc aagggctctg cctctggttt tgatgtgcga ggactcgacg 300
gcggacgacg tgttgaaggg aatcaggctc ggcgcttgcg actttctgga gaagccgctg 360
tccccactga agctcaagaa catatggcag cacgttgttc gcaagatgat ggagcagatg 420
cacgtccgcc gcacggacga cgcggatacg tgcactacta agagcagccg cgaccaaagc 480
tgcgcgatca agggcaagtc ggtggcttcc acgccctcgt gtcccaagac accttctccc 540
gcggcttctg gcgcagacat cggctgcagc atagccacgt cggtcagcaa ggccggggac 600
gtggtcggcg agtccagcag ttccgagacg cgcaaggagc attgcagcga gaccacggag 660
tgctccgacc tcaagagctg cgccgcaaag tcagctgtgt cggcgcaaac gccggtatcc 720
accgcgaccg ttgcagctac ctggggtgcg tcgaagaaga agtcgacagc atcagccact 780
accagcagtg tcagcaaccg gccgccgctg gcgatcaaga tgccggcgcc agctgtggca 840
tacacgtcag ggcttgcgcc ctttccgccg ccgatgtttg tacctggcgg ctggggccag 900
tcaagcaacc catgcgtggt gggcacgcca atgatgccac cgccgcccgg catgggcatg 960
ccgccccacc accacgcgcc ctatggccag gtgccgccgc cgggctatcc agtcgcatgc 1020
atgcccagcg cctttgtgcc gacgccgatg ggccctggcg gcgtggcgtt tgcgccgcca 1080
cctggcgcca gctgcacatc tgctgcgtac tacccccatc ctgctgtgga tgcaagcgcg 1140
tctgcaactg ccaccttcac gggccatgtg cagatcgacc tgactaacgt gtctgctgaa 1200
gagccggcgc ccattggttt ggcgctgcgc aagaccgcgt cgctgctcga cctggtcagc 1260
gatcgcctgg gccagcgtgc gtgc 1284
<210> 58
<211> 1026
<212> DNA
<213> 四爿藻属
<400> 58
atgctgtgcc ctgctgtcca ggttgccacc atggccactg tcctggcttc cacgcatttt 60
tcggagcgcc ccagcttccc ggctgatctg gaggtgctgc ttctggattc agcaacgcag 120
ggcgcagaaa ctgcctcgaa gctgttgctg tcgtgttcct atcgtgtcac cgtgtgccga 180
tccgtgtctg aggctctgag ccacatggca tgcaaggctt tcgacgtggt cctggtggag 240
cagaaacttt tcagcggcag ggatgcggcc gctgcgcagc tcaaggccct ggcaggcgtc 300
atccccaccg tggtcctgag tgacagcggc agtgcgaagg atacctgggc tgccatcgtt 360
gggcaggcag ccgatgtcct catccgcccg ctgaccaagc agaagctgca gacgctgtgg 420
cagcacactg tccgtatgca gcgcgcagca tcttcggctt cggcggctac tagcatggtt 480
gccaagcctg ttgccgtgct ctcctcggct ctgaagcccg ctgcttccag tgcttcactg 540
gacaaggggc agaagcgcaa gttgaaggat catatgatgg ggcccatcat ggcacacccg 600
caagtgtcca accctggctt tatctggggc gcaccagtga tgggcgttcc ggctggacag 660
caggctcccc agaagtcaga ggccccggtc accccccaga agccaggctc agagatgcac 720
cccgagctgg atgccacaag ccacatcgcc atgggctcca gcgacaactt caacgtacct 780
gtgtatgaaa gcggcactga cagccaggag tcgcagccaa cctgcgaccc cacctctctt 840
gatgacatca atgaggatga ctacgcgttt atcgatttcg cgctcagcga ttcttttccc 900
actgtggagg aggatgagat ccttccaccc attggccttt cgctgaagaa gtccagctcc 960
ctcctgaaca tgctgaacgg tgtgcttctc tcggctcact ctgtaccgct gcagctgccc 1020
cagtag 1026
<210> 59
<211> 2076
<212> DNA
<213> 大豆
<400> 59
atgggagagg tggtcatcat gagtggagag aagaagtcag ttagagtgga gggggtggag 60
aaggaagata gtggtggaag tgggagcaag gctggtgaat ttaaggggtt gatgaggtgg 120
gagaagttct tgcccaagat ggttttgagg gtgctgttgg ttgaagcaga tgattccaca 180
agacaaatta ttgccgcgct tctcagaaaa tgcagctaca aagtggttgc tgttcctgat 240
ggcttgaagg catgggaatt actcaaggga agaccgcaca atgttgatct aattctgaca 300
gaagtggatt tgccatccat atctggctat gcacttctca cattaattat ggagcacgag 360
atttgcaaaa acatccctgt tataatgatg tcttcccaag attcaattag cacagtatac 420
aaatgcatgt tgagaggtgc tgctgattat cttgttaagc ctattagaaa aaatgaactg 480
aggaacttgt ggcaacatgt ttggagaaga caatcatcaa ccactggtat taatggcctc 540
caagatgaga gtgttgcaca acagaaggtt gaagccactg cagaaaataa tgctgctagt 600
aatcgttcaa gtggtgatgc tgcttgcatt cagagaaata tagaactaat tgagaaggga 660
agtgatgcac agagctcttg taccaagcct gactgtgaag ctgagagtga ccctgtcggt 720
aacatgcagg aattttctct gctgaaatgt ggggaagcat atccaagtgg aacagagaca 780
caacaggttg aaacaagctt tcgcttaggc cagacattaa tgatgcatga ctgtcatgct 840
ggaggattaa atgtgagtat ccgcaaaaat ggtgaggcaa gcacgactaa tgacaaggat 900
actgatacag agcattttgg gaatgctagc atcagtggtg aggctcatga caatccctat 960
gttcaaatta actcttccaa ggaagctatg gacttgattg gagcatttca tactcatcca 1020
aactgttccc tgaaaaattc cacagttaat tgcacaggca actttgacca ttctcctcaa 1080
ttggatcttt ctttgagaag atcttgtccc ggaagctttg agaataaact cactgaagaa 1140
aggcacaccc tgatgcattc taatgcttca gctttcaagc ggtatactac caggcaattg 1200
caaatatcaa tgcctgcagt gttaattaac ttctctgatc aacaaagaga acagataaca 1260
aattgtgaga aaaacatctc acacatcgct actggcagca actcagatag ttcaacacct 1320
atgcaaagat gtattgtgtc tccaactaca gtccaatcaa aagaatctga acttgcaacc 1380
tcacaccccc cgcaaggaca ttctctccca attccagtaa agggtgtaag gttcaatgat 1440
ctatgcacag cctatggttc tgtacttcct tcagtgtttc atacacagtc aggtccacca 1500
gcaatgccaa gtccaaattc agttgtgctc cttgaaccaa actttcaagt aaatgcattt 1560
tatcagtcaa atatgaaaga gagtagttca gagcagcttt atgaatctcg tggtccaaat 1620
ggaaacacca cccaaaacca cattgtgtac acacaggagc acaaatcaga acatgcagaa 1680
gatcgaggac atatctctcc tacaactgat caaagtgtgt caagtagttt ctgcaatgga 1740
aatgcaagcc atcttaacag cattggttat ggaagcaact gtggaagtag cagcaatgtt 1800
gatcaagtta acactgtttg ggcagcttca gagggaaagc atgaagacct cacaaataat 1860
gcaaactctc atcgatctat ccaaagagaa gcagctctaa acaaatttcg cttgaaaagg 1920
aaagagagat gctatgagaa gaaggttcga tacgagagca gaaaaaaact agcagagcag 1980
cgtcccagag ttaaaggaca atttgttcgt caagtgcatc ctgatcctct tgttgcagaa 2040
aaagatggca aagaatatga tcattcagat ttctga 2076
<210> 60
<211> 2244
<212> DNA
<213> 葡萄
<400> 60
atgggtgagg ttgtggtgag cagtgaggca ggaggaggag gcatggaggg tgaggtggag 60
aagaaggagg tgggcagtgg ggttgtgagg tgggagaggt ttcttcccag aatggttctc 120
agggttttgt tggttgaagc ggacgattcc accaggcaaa ttatcgctgc gcttctcagg 180
aaatgcagtt acaaagttgc tgctgttcct gatggcttaa aggcatggga ggtactgaag 240
gctagacccc acaacattga cctcatattg acagaagtgg agttgccatc aatatctggc 300
tttgctctcc tcaccttggt tatggaacat gagatctgca aaaacattcc tgttataatg 360
atgtcctcac atggttcgat aaacacggtt tataaatgca tgttgagagg tgcagctgac 420
tttcttgtta agcctgttag aagaaatgag ctgaagaatt tgtggcaaca tgtctggaga 480
agacaatcgt caactgttag cggaaatggc ccccaagatg agagtgttgc acaacagaag 540
gtcgaagcca cttctgaaaa caaccccaca agtaatcact caagtgatca tgttgcttgt 600
attcagaaaa ataaggaagc actcaataaa gtgagtgatg ctcagagctc ttgttcaaag 660
ccagacttgg aagctgagag tgcctacatg gaaactatgc aggatttctc aaatccgaca 720
tggagcagat ctcttgtgag tgacacaaaa atgcagaaga atgaagaatg tgccaaattg 780
ggcccgaaat ttcttatgca caataaagaa gctgggggaa cactggaggc tgcctgcagg 840
gatgtgaaca caatgactca gcctgaagca gtggaaccag aaaatgatgg gcaaggtgct 900
aacgctccta gtgaggcttg tggtaacaat gccatattgg gcagctcatc tagagaagcc 960
atcgacttga ttggagtatt tgataattct aaaaaatgca cttatggaaa ttcttcttca 1020
aataatggca ccaaaaagag tgattctatt ccacagttgg acctttcctt gagaagatct 1080
catcctagta gccctgagaa tcaagttgct gatgaaaggc atacactgaa ccattctaat 1140
ggctcggcct tttcacgcta cataaacagg tcattgcagc caccacatct accatcaaca 1200
ggtgttttca atcagcagaa aaactttgga gctgattctg ataaacgttt atctcagctg 1260
gttactggtt ataactctga tattactagt cccacactga gtactcaaag aagtgtgatc 1320
tctctagcta ctagtccatc tggacgagtt gaaattgcac tttgtggccc tcaacagaga 1380
gcttttcctg ctccagttcc acaaaatgcc aacaattcca ccagccagac taatcacaag 1440
ccagagcaca aattggactc actggagggt caagggcact tctctcctgc cactgatcag 1500
aattcaagta gtagttttgg taatggtggt gcaagtaatc tgaatagctt tgggtgtgga 1560
agcatttgtg gaagtaatgg gaatgccaat acagttgctg ttgttcaggc cgctgcagag 1620
ggcaagaatg aagaaggtat cttcagtcat gaaggacact ctcaacgatc tatccaaaga 1680
gaagctgctc taaccaagtt tcgcttgaag cggaaagaca gatgctttga gaagaaggtt 1740
cgttatgaaa gcagaaagaa gcttgcagag cagcgacccc gagtaaaagg acagtttgtt 1800
cgacaagtgc ataccatccc cccacctgca gagcctgata catactatgg cagttcgttt 1860
gatgttcagc ctcaaagaag ccgatatcta tcagctcaac ctctcagggc ctcatcttct 1920
caactcctct atccaactca cactcctctc caagaatcca aatacgaagg tcatgaagaa 1980
agcaatctct tgacggcgtc cttggttgga actgccctac cggtggctcc atcttttggt 2040
tatgaagttg gacgtgatca gacggcagga aaacttgttc tgagtttaaa gctcgatggc 2100
cgggttcgat ggaaggtggg gacttgggtt tctggccgat accgacttaa cgttaattgt 2160
gttgctgtga tggcatttgg accctccatc ccatctggtc cactgagttc aaaagaagga 2220
actcagtgct ctactactgt ttga 2244
<210> 61
<211> 2400
<212> DNA
<213> 可可
<400> 61
atggggatag ttcaaatgaa taataatggt cctgtggcca atgggttggt tgaattgaat 60
acacatattc atgatgagca caagaaaata aggggtgggg tcataggtga ggggcagggc 120
ctctcagtgg aagaagagtc atggattaat gaggatgtgg aagacaggaa tgatgggaag 180
acagagttgg ttcaggttca gggccatgcg catggtgagc aagagaggtc acagcaacag 240
cctcaaggtc ctttggttca ctgggagagg tttttacctc taaggtcttt gaaggttcta 300
ctggtggaaa atgatgactc aactcgccat gttgtctgtg cattgctgcg aaattgtgga 360
tttgaagtta ctgctgtgtc aaatggactg caagcttgga agatcttgga agatctaacc 420
aatcatattg atcttgtttt aactgaggta gtgatgcctt gtttgtcagg cattggcctt 480
ttatgcaaga taatgagcca caaaactcgc atgaatattc cagtgattat gatgtcatct 540
catgattcta tgagtacagt ctttaggtgt ttgtccaagg gtgcagttga ctttttagtg 600
aagcctatac gaaagaatga gcttaaaaat ctttggcagc atgtttggag gaaatgccac 660
agctctagca gtagtggagg ccaaagtggt acacagaccc aaaaatcctc aaaatcaaaa 720
ggtactgatt cagacaacaa tactggaagt aatgatgagg atgacaacgg cagtgttggt 780
ttgaatgttc aggatggaag tgacaatgga agtggcactc agagctcatg gacaaagaga 840
gcagtagaag tcgacagctc ccagccaata tcaccatggg accagttagc tgatcctcct 900
catagcactt gtgcccaggt tatccattct agacatgaag tgttaggtga cagctgggtt 960
ccagtaacag cgacgaggga gtatgatgag ctggataatg aactagaaaa tgttgttatg 1020
ggcaaagact tggagatagg ggtacctaaa attacagctt cgcagcttga agacccaagt 1080
gaaaaagtaa tgaccaacat agctggtgtt aataaagaca aattatctgc aataaaccct 1140
aagaaagatg atgagaaact agagaaagcg caattggaac ttaacagtga gaaatcaggt 1200
ggtgatttga gaaatcaagc tgctgacctg ataggtgtca tcaccaataa tactgaacct 1260
catatagaaa gcgcagtctt tgacatccca aatggcctcc ctaaggtctc tgatgcaaaa 1320
gagaaggtga actacgacac gaaggaaatg ccttttcttg agctcagttt aaagagactg 1380
agagatgtag gagacactgg aacaagtgcc catgaacgaa atgtattgag acattcagac 1440
ctttcagcct tctcaagata caattctggt tcaactgcca atcaggctcc aacaggaaat 1500
gttggtagtt gttctccact tgataatagc tcggaggcag ttaaaacaga ttctatgaag 1560
aattttcagt ctacctcaaa tagcatacct ccaaagcaac agtccaatgg aagtagtaac 1620
aataatgaca tgggttccac cactaataat gccttcagca aaccagcggt actcagtgac 1680
aagccagcac ctaaaacttc agctaaatct ttccatccct cttctgcctt ccaaccagtg 1740
cagagtggcc atggttctgc cctgcaacct gtagcacaag gtaaggctga tgctgcacta 1800
ggtaacatga ttttagttaa agcaaggggc acagaccaac aggggaaagt gcagcatcac 1860
catcatcatt atcaccacca ccaccaccac catgtccata acatgctccc aaatcaaaag 1920
ttaggtaacc atgatgattt atctttggaa aatatggcag cagcagctcc ccagtgtggg 1980
tcatccaatc tgtcaagttt accacatgtt gaaggcaatg ctgctaacca cagtttgact 2040
agaagtgcat caggaagtaa ccatggaagc aatggacaga acgggagcag cactgtgtta 2100
aataccagag gaatgaatct tgaaagtgaa aatggggtgc ctgggaaagg tggagctggc 2160
ggtggaattg gatctggagg caggaatgta gttgatcaaa accgttttgc tcaaagagaa 2220
gctgctttga acaaattccg ccagaaaagg aaagaaagat gctttgagaa gaaggttcga 2280
tatcagagca gaaagaaact ggctgagcag agaccacgca ttcgaggaca gtttgtgcga 2340
cagattagca ctactgggaa ggaagcattc agatttcgtg gtgcaggatt gtgcacttag 2400
<210> 62
<211> 2229
<212> DNA
<213> 稻
<400> 62
atgatgggaa ccgctcatca caaccaaacc gccggctctg ccctcggagt cggagtcgga 60
gatgccaacg acgccgtgcc tggggctggg ggtgggggct acagcgaccc ggatggcgga 120
ccaatctccg gtgtgcagcg gccaccgcag gtctgctggg agcgcttcat ccagaagaag 180
actatcaaag tcttgctagt tgatagcgat gactccacca ggcaggtggt cagtgccctg 240
cttcgtcact gcatgtatga agtcatccct gctgaaaatg gccagcaagc atggacatat 300
ctagaagata tgcaaaacag cattgatctt gttttgacag aggttgttat gcctggtgta 360
tctggaattt ctctattgag taggatcatg aaccacaata tttgcaagaa tattccagtg 420
attatgatgt cttcaaatga tgctatgggt acagttttta agtgtttgtc aaagggcgct 480
gttgacttct tagtcaagcc catacgtaag aatgaactta agaacctatg gcagcatgtg 540
tggagacggt gccacagctc cagtggcagt ggaagtgaaa gtggcattca gacacaaaag 600
tgtgccaaat caaaaagtgg ggatgaatcc aataataaca atggcagcaa tgacgatgat 660
gacgacgatg gtgtaatcat gggacttaat gcaagagatg gcagtgataa cggcagtggc 720
actcaagcgc agagctcatg gacaaagcgc gctgttgaga ttgacagtcc acaggctatg 780
tctccagatc aattagctga tccacctgat agcacttgtg cacaagtgat ccacctgaag 840
tcagatatat gcagcaatag atggttacca tgtacaagca acaaaaattc caagaaacaa 900
aaagaaacta atgatgactt caaggggaag gacttggaaa taggttctcc tagaaattta 960
aacacagctt atcaatcctc tccgaatgag agatccatca aaccaacaga tagacggaat 1020
gaatatccac tgcaaaacaa ttcaaaggag gcagcgatgg aaaatctgga ggagtcaagt 1080
gttcgagctg ctgacttaat tggttcgatg gccaaaaaca tggatgcaca acaggcagca 1140
agagccgcaa atgcccctaa ttgctcctcc aaagtgccag aagggaaaga taagaaccgt 1200
gataatatta tgccatcact tgaattaagt ttgaaaaggt caagatcgac tggggatggt 1260
gcaaacgcaa tccaagagga acaacggaat gttttgagac gatcagatct ctcggcattt 1320
acgaggtacc atacacctgt ggcttccaat caaggtggga caggattcat gggaagctgt 1380
tcgctgcatg ataatagctc agaggctatg aaaacggatt ctgcttacaa catgaagtca 1440
aactcagatg ctgcaccaat aaaacaaggt tctaatggta gtagcaataa caatgacatg 1500
ggttccacta caaagaacgt tgtgacaaag cctagtacaa ataaggagag agtaatgtca 1560
ccctcagctg ttaaggctaa tggacacaca tcagcatttc atcctgcaca gcactggacg 1620
tctccagcta atacaacagg aaaagaaaag actgatgaag tggctaacaa tgcagcaaag 1680
agggctcagc ctggtgaagt acagagcaac ctcgtacaac accctcgccc aatacttcat 1740
tatgttcatt tcgatgtgtc acgtgagaat ggtggatccg gggcccctca atgtggttca 1800
tccaatgtat ttgatcctcc tgtcgaaggt catgctgcca actatggtgt caatggaagc 1860
aactcaggca gtaacaatgg aagcaatggg cagaatggga gtacgactgc tgtaaatgct 1920
gaacggccaa atatggagat cgctaatggc accatcaaca aaagtggacc tggaggtggc 1980
aatggaagtg gaagcggcag tggcaatgac atgtatctga aacgcttcac tcaacgagag 2040
catagagtgg ctgcagtgat caagtttaga cagaaaagga aagagcgcaa cttcggaaaa 2100
aaggtgcggt accagagcag aaagaggctg gccgagcagc ggccaagggt ccgcggacag 2160
ttcgtgcggc aagctgtgca agaccaacaa cagcagggtg gtgggcgcga agcggcagcg 2220
gacagatga 2229
<210> 63
<211> 2301
<212> DNA
<213> 玉米
<400> 63
atgggcagtg cttgccaagc tggcacagac gggccttccc gcaaggatgt gttagggata 60
gggaatgccg ccttagagaa tggccaccat caggctgaag ctgacgcaga tgaatggagg 120
gaaaaggaag aggacttggc caacaacggg cacagtgcgc caccgccagg catgcagcag 180
gtggatgagc ataaggagga acaaagacaa agcattcact gggagaggtt cctacctgtg 240
aagacactga gagtcttgct ggtggagaat gatgactcta ctcgtcaggt ggtcagtgcc 300
ctgctccgta agtgctgcta tgaagttatt cctgctgaaa atggtttgca tgcatggcga 360
tatcttgaag atctgcagaa caacatcgac cttgtattga ctgaggtttt catgccttgt 420
ctatctggta tcggtctgct tagcaaaatc acaagtcaca aaatttgcaa agacattcct 480
gtgattatga tgtctacgaa tgattctatg agtatggtgt ttaagtgttt gtcgaaggga 540
gcagttgatt tcttggtaaa accactacgt aagaatgagc ttaagaacct ttggcagcat 600
gtttggaggc gatgccacag ttccagtgga agtgaaagtg gcatccagac acagaagtgt 660
gccaaactaa atactggcga cgagtatgag aacggcagtg acagcaatca tgatgatgaa 720
gaaaatgatg acggcgacga tgacgacttc agtgttggac tcaatgctag ggatggaagt 780
gacaatggca gtggtactca aagctcatgg acaaagcgtg ctgtggagat tgacagccca 840
caacctatat ctcccgatca actagttgat ccacctgata gtacatgtgc acaagtaatt 900
caccctagat cagagatatg cagtaacaag tggttaccga cagcaaacaa aaggaatgtc 960
aagaaacaga aggagaataa agatgaatct atgggaagat acttaggaat aggtgctcct 1020
aggaactcaa gtgcagaata tcaatcatct ctcaatgatg tatctgttaa tccaatagaa 1080
aaaggacatg agaatcacat gtccaaatgc aaatctaaaa aggaaacaat ggcagaagat 1140
gattgtacaa acatgcctag tgcaacaaat gctgaaactg ctgatttgat tagctcaata 1200
gccagaaaca cagaaggcca acaagcagta caagccgttg acgcaccaga tggcccttcc 1260
aaaatggcta atggaaatga taagaatcat gattctcata tcgaagtgac accccatgag 1320
ttgggtttga agagatcgag aacaaatgga gctacagcgg aaatccatga tgagcgaaat 1380
attctgaaaa gatcagatca gtcagccttc accaggtacc atacatctgt ggcttccaat 1440
caaggtggag caagatatgg ggaaagctct tcaccacaag ataacagttc tgaggccatg 1500
aaaacggact ctacatgcaa gatgaagtca aattcagatg ctgctccaat aaagcagggc 1560
tccaatggca gtagcaataa cgatgtggga tccagtacaa agaatgttgc tgcaaggcct 1620
tcgggtgaca gggagagagt agcgtcacca ttagccatca aatctaccca gcatgcctca 1680
gcatttcata ctatacagaa tcaaacgtca ccagctaatc tgattgggga agacaaagct 1740
gatgaaggaa tttccaatac agtgaaaatg agccacccaa cagaggttcc acaaggctgc 1800
gtccagcatc atcatcatgt gcattattac ctccatgtta tgacacagaa acagccatca 1860
acagaccgtg gatcatcaga tgttcactgt ggttcgtcaa atgtgtttga tcctcctgtt 1920
gaaggacatg ctgcaaacta cagtgtgaat gggggtgtct cagttggtca taatgggtgc 1980
aatgggcaga atggaagtag cgctgtcccc aatattgcaa gaccaaacat agagagtatt 2040
aatggtacca tgagccaaaa tattgccgga ggtggcattg taagtgggag tgggagtggc 2100
aatgacatgt atcagaatcg gttcctgcaa cgagaagctg cattgaacaa attcagactg 2160
aagcggaaag atcggaactt tggtaaaaag gttcgctacc aaagcaggaa gaggcttgct 2220
gagcagcggc cacgggtccg aggacagttt gtgcgacaat ctgagcaaga agatcaaaca 2280
gcgcaaggtt cagaaagatg a 2301
<210> 64
<211> 2034
<212> DNA
<213> 小立碗藓
<400> 64
atgccatatc tgtccggagt tgggcttctg tcgaagatga tgaagcggga agcatgcaag 60
agagtgccta ttgtcatcat gtcatcgtac gacagtcttg gcatcgtgtt ccgctgcctc 120
tcgaaaggag cttgcgacta tctcgtgaaa ccagttagga aaaacgagtt gaagaatctg 180
tggcagcacg tatggaggaa gtgccacagt tcgagtggga gcagaagtgg aagcggaagc 240
cagactgggg aagtagctaa gcctcggagt cgtggtgtag cagccgctga caatcctagt 300
ggaagcaatg atgggaatgg cagcagtgat gggagtgata atgggagcag ccgggtaaat 360
gcccagggtg gaagcgacaa tggtagtggc aatcaagctt gcatgcaacc tgtacaggtt 420
ctgaggaaca gcgcaattcc agaagcagta gacggggatg aggaggggca ggcgacatcg 480
caagataagg gtgctgactt ggatggagag atggggcatg atctggagat ggcaactcga 540
aggtctgctt gtgttaccac cggaaaagat cagcaaccag aggatgccca gaagcaagat 600
gaggatgctg tatgtatctt gcaagatgcg gggccatcac ctgatggggc taatgccgag 660
agcccatcat ctagcggtcg gaatgatgcc gcagaggagt cttctccaaa gatcattgac 720
ctgataaacg tcatagcgtg tcagccacag acccaggatg cagaacctca agaaagtgag 780
aacgatgacg aagaattgga tccgcgggga aggagcagcc ctaaaaacaa ctccgcttca 840
gattccggta cttcgctgga gttaagtttg aaacggccac gatcggcggt tggtaacggc 900
ggagaattag aagagcgtca accactgcga cattcaggag gctcggcctt ttctaggtat 960
ggcagcggag gaaccattat acagcaatac catcagactg gaggttcact ccctctcagt 1020
ggttatcctg tgtctggtgg atatggtgta tatggcatgt ccggcggtag ccctggagga 1080
tctcttcgtc tgggaatggg aatggatcga agtgggtcat cgaaaggaag tgtagagggg 1140
actacacccc caccctcgca tcctcagagc atggagaaag tgggtgggca agatgggtac 1200
ggcaatgcaa gacagactac ggaggatgca atgatcgtac ctggaatgcc catggctatt 1260
cctctcccac cacctgggat gcttgcatat gatggcgtta ttggaacgta tggtccggcg 1320
atgcacccga tgtattatgc tcaccctagc gcgtggatgg cagctccgtc tcgtcacatg 1380
ggagagcggg gagatgtcta caatcaatct cctgcatttc aagagcagga ttctgggtct 1440
gggaatcatt ctcaagcggg gcagactcac cagcacatgc accaccacca aggcaaccag 1500
caccaccatc atcatcacca tcaccaccat gggagtggcg cccagccttc tggaaatgca 1560
ggggtgcaag atgaacaaca gcaatcagtg gtaccgcctg ggtcgagtgc tcctcgctgc 1620
ggctcgaccg gtgtggatgg tcgaagtggt agcagcaacg gctacgggag caccgggaat 1680
gggaatgggt ccatgaacgg aagtgcttcg ggaagtaata ctggcgtgaa caacggtcag 1740
agtggatttg gtgcgacgcc gatgttaact gacaacagtg ggagtaacgg cgtcggtgga 1800
acggatgcag ccatggatgg ggtgagtggg ggcaatgggc tgtgcacaga gcaaatgcgt 1860
ttcgccagac gagaggctgc cttgaataag tttaggcaga agagaaagga gcgatgcttt 1920
gagaagaagg tgcgatacca aagcaggaaa cggcttgcag aacaaagacc acgagtccgc 1980
ggtcagtttg tgcggcaagc ggtacatgat ccgtctgctg gtgacgccga atag 2034
<210> 65
<211> 4080
<212> DNA
<213> 卡特氏团藻
<400> 65
atggagttcc acgtactgct ggtcgaagac gacagggtga cgctgaagac agttgagcag 60
ctactccgga aatgcaatta caaagttacc tgtgcagcaa atggacggga ggcaataaag 120
gtccttactg cctgccggca cagcggcgtc aaagtggacc ttattttgac cgatatactg 180
atgccggagg ttaccggctt tgacttaatc aatgaagtgg tacatgggga caccttttgc 240
gatgtgccag tggtcgtcat gtcctctcaa gactcgcagg agaacgtgtt acaggcattc 300
caagcaggcg ctgccgacta ccttataaag cccattcgca aaaatgagct ggctacgctc 360
tggcagcatg tctggcgcgc caacaaggcc aaggggtccg gcagcggcac caccactaac 420
gtcaccgggc agcccctttc cggtcgggag gatctggagg caggcgaagc cgtcgctgtc 480
gccgccgccg ccgccgctgc cagcggcaag gcctgtgcag caacgcatgg gcatttgaag 540
gacagcagcg gcggcagcag cggcgccgcc gcttctgtat tgcagtccac gggcggaaca 600
ctactgccgg accgtgctgc cactgtacgg tatccagctg cggcggcagc gccaccgcca 660
cctggcgcat ccgagctatc agggaacgtg acggcgggcg aagctcaagg gagccgtacg 720
cagcatctgc gccatctgtc cggcttggcg gggatggaaa gcacagcggc gacgtcagcg 780
gcggcgcaag gcagtagcgc agcagggccg ctgcggggct gcggcggtgc tggtactgct 840
atagctggtg ggccgcgcgc gcccttgggc ccactttcat tcgcgccctt cggcacttcc 900
gttgccgtac actttgacct gaaccccgca tccggcgcag ctcgacggct ggtcaactcc 960
agcggcgcca tcgatgcgtc gacgggcagc ggcactgctg gcgtcgccgc ttcatcgcgt 1020
tgcgccgccg gcacctccgc caccgtcatc agttggtcgc acgtcgatcc gacggagacg 1080
gacccagcgg aggcggagcc catgtacgac acgaacgcgg acgccaccgc ggcgaaggca 1140
gcggctgacg gtgtggcgga agctgacgac gacgatgttg gcgacgacgg cggtgctggg 1200
cccaaccaca atgacgatga tgacgagggt ggcggcgacg acgacgtcag cggcgacggt 1260
gacgaggacg gaaaccggcc tcgcaagcgt ccgcggctgc ttcagggatc ctcgcatcac 1320
cacagccacc agcatcgcct tcacagccta ggcggtacga ctaccaacac caccaccact 1380
acgacagccg cgaagcctaa gtcgacagcg ggagaacgcg gcggcgcggc ggcgctactc 1440
gcatgccgta ctgcggcggc cgcaccccta cgcggcagtg gctgcgccac cgctggcgcc 1500
accggagcat gtcgactggc ggcggcggca gcggcggcgg agggctccca gggttctcgc 1560
gccgcgtcgg cgtcggcagg ccctgacggc ggcgcgcgtg agagtacggc tacccccagt 1620
ggtgacacct ttgcagagag cccgtccgcg tacactgcaa ccgccacaac gaccagtacg 1680
gcaacaacca gtacgacaac gggatccggg attgagatgc aggacgacga gcaacagcag 1740
cgacagcagc ctaagcagcg tccgccggca tctcagccgg aactggaggg tcatcatcac 1800
caacaacaat atcaccatta ttatcgacgc accagcctgg agggcggttg cgccaacgca 1860
ccccctctcc ctgtcccttc atctgcacgg ggtgcttccc cggcaggcac gggtccgacg 1920
gaaagcggct ccgggaggga tagcggctgt gccaggatta caaatggtac ggcggcgggg 1980
gcgacggcgg caatgccgcc atctcacgtc agctcggcaa gccccccccg ctgtaccgcc 2040
acttccgcgg cggcgactcg cgggtcctct ggtgctgcta ctgcggcagc gggtgccatg 2100
acaacagcct tggcgacggc cggcagctat ccgcgaggag tggacgccag cccgccgccg 2160
aatagaagta tggggtccag cggcggtgat ggcggcggaa ccgccgctgc agctgccggt 2220
acggcacgag ggagctcgcc tgcggctgct acgccgccgc tgccaccttc tacgcagcag 2280
cacgggttgc cgcatcccgc ggcggcgccg ccgccgggcg ctgcatcgcc tggcggcgcc 2340
gtgacgctgc cgccagcgct tcaggagctg gcggcactgg gggcggcccg ccatgcgggg 2400
ctatggaccc agcgggcctt attgcatcag cagcaattgt tgctgcagca gcagaagcag 2460
cagaagcaac aacagcacca acaagaccag gtagtggggg cagagaagat tcatggtggg 2520
tcgacgtcgg ctgtagccaa cgccgccgag cagcagcagc agcagccgct gggggcggcg 2580
gcggcacgtc gtcccagcaa agcgggcgtg gacggaactg aggcgggaag tggcgcggtc 2640
ggcggatgcg catcggcgac agcggcggtc atggcgatgg aggcgtcgga gccgcatggc 2700
gcggttggca gctcctttac ggcggcagat cggcaggaga cgccgttgca gcctctgcat 2760
gctgaatctg cggcggcagg cggcgacatg gacggcaacc gcagtacacc cgcaactatg 2820
ccgtcggggc ctacggcagc cgcatcgggc ccttcgcaga cgtcgaacag cttgacggtg 2880
ctgcgacata gcgacagatc cgctttcacc gcattcaccg ttttcttgcc aagcagggtt 2940
gccggcgccg cggcggcggc ggcggcggca gcagctgctc ggccgccacc accgccggcg 3000
ccggtgcagc cgccggcgcc aatcttcacg caccctgctg ctgctgctgc agccgcggcg 3060
gcggctgccg ctggcagcgg cggtgcagcc tcagtgtggt atcctcacct ccatcatcac 3120
caccactact tgcagcagca gcagacgcac atgggtccct tgccgccact gccaggtgcc 3180
gtacatgttc tgccgtcgat catgcagctt cacatgggag tactggcgcc agggccgccg 3240
ccacagcagc agcagcagca gcaccttcag gccaaggcgc ctcagaagcc tcatgattcc 3300
gccgccgccg ccggcggagc taacggctcg ctaggtcccg cgacatcggc tgcagcggcc 3360
acgcacatgt cgtacactgg catgcaacag cgcccgggcg cctcatccgc caccaccacc 3420
agcgccggcg ctgtagcgtt cggtcaatct ccacctcacg ggctggcggc ggcggcggcc 3480
gccgctagca cgcctccgcc gcctccaccg ccgcctgttt gtattcccga atcggtacta 3540
cagctcattg cgcatctgtc tggtcgggcg gcggcggagc tgcccgtacc ggaaaccgtc 3600
acgacggcac cgttggtcgt acagaaggcg ccgtcggcag cgcgattggc tgctgtagcg 3660
aagtaccttg aaaagcggaa gcaccgaaac ttccaaaaga aggttcggta cgagagccgt 3720
aaacggctgg cggaggccag gcctcgcgta cgcggccaat tcgtcaaggc aagtacttcc 3780
gcggtggcgg caaccacccc tgccgccacg ggcgccaccg tcacctctac gtcgctccgt 3840
cagcccgttt atacggcggc cggcccggct ggcctggcgc tgccgccggc ggcggcagca 3900
gcggcggcca gcgccgccgc cgcgaggggg gttccgccgc cgtcatcccg catcggagcg 3960
gtggagctgg cggagttggt gcccgaccac gacgccgaca ttgaggacga ggggtgtgac 4020
gagcccgccg ccgccgagga ctccgacggg tccgtcgcgg tggagctggc ggaggtgtag 4080
<210> 66
<211> 3309
<212> DNA
<213> 莱茵衣藻
<400> 66
atggaggcta acggcttcca cgtcgtatta gtcgaggatg ataacatttg cctgaaagtg 60
gtggagcagc tgctgcggaa gctttcgtac agagtcagca ccgcatccga tggtgccgca 120
gcgctcaaag tcctggctga ctgcaagcag aggggcgaca aagtagacct cattctcacg 180
gacatcctga tgccagaggt taccgggttt gacctcatca acgaggtcgt gcatggagag 240
acctttgccg atattccggt cgtggttatg tcgtctcaag actcgcagga aagtgtcttg 300
caggcatttc aggcgggcgc agcggactac ctcatcaagc ccattcggaa aaatgagctt 360
gcaacgctct ggcagcacgt ctggcgtgca aaccgcgcca agggtggaca gaccagcagc 420
ggcgccgcgc atgtgggcgc aggcggcagg gggggcacca gcagccgcga tggcggtggc 480
gttgccggga cgcggtgcgg cccaggcgac cgcggcggca gcggcggcga cgctgagggt 540
agtgggctag gcggcggcgc gggtgcagtc aaggacagca gcggcggcag taccggcgcc 600
gccacttcag tgctgcactc cactggtggc acgacgctgc cctcacgtgc ggccaccggt 660
cggcacgcta gcacctcagc tggacacggc gtcaccagcg ctgaccccag caacaaccaa 720
acctcgcacg cgcacgcgca tgcgcatgcg cacgctcacg ggaacgcgca cgcgcacgcg 780
caccttcata tgcacggcgc aacagatcgt gcggcgcagg gcagcagcgc taacggcccg 840
gccaaccacg gggccgctgg gacagggctg cagtccgctg ggatggcagg ttccacggct 900
gcaggcgcgg ctgcgcccgc cggtgagtcg ctggccaagc cgcccttcgc ctccctagcc 960
gtccacttcg acctgcactc agtcctggcg ggcgcgggag cggctgcagc caatggtggc 1020
gccaatgccg cagctcacac tgctggcgcc accgggcgag agagcggcca ggcggcgggc 1080
gcggccacag gcggcattgc cgccgccggc accgtcatcg gctggtcgca tgcggacatg 1140
gacgtggacg gaggggaggc cggcgcgcag gatgaagatg acgaggacga ggacgacggc 1200
gtggaggcgc cggcgggcac acagaaccgg aagcgcgccg cggatgactc gggttgcgac 1260
ggcgccgccg ccaacaacaa cggcaacact gccgcaaagg ctggcgcagc ggcaatcgcc 1320
gcgggcgggc ctgggagctc gggcagggcg aaggccacgg acggcgcccg cgctgagatt 1380
cgccacaacg gtgggccgat ggcggcgcgg atggcggctg cagagggctc tcaaggctcg 1440
cgcgctgcat cgggctcggc ggcaacggga ccgggaggag cgcgggaggg cactgcgacg 1500
cctagcggcg acacctttgc ggagagccct tccaccttca cttccatcat caacaccacc 1560
ggctcgggca gcgaggccga cgagcagcca gtgccgctga agcaccagga acagcaacag 1620
cagcaacagc agcagcgggt cggcgagggt gacagggcga agcccgaacc gcacccacag 1680
aaccctgccc aggcagcaca cctgccgcac ccgtccgcgg ccccatgctc gggcggtggc 1740
ggtattgcgc aagcggccct acccctaggg ctacaggagc tggcagcgct gggggcggct 1800
cggcacaaag agctgtggac gcagcggcac cttatgcatc agcggcaggc ggcggcagcg 1860
gcgacagcag cggcggcctc ggcagctgct gcagcggcaa tgcccacggc cggcgcgagc 1920
gccgcggctc ctgcaggccc accttcggcg cggccctccg cttccttggc agacacgggc 1980
ggcgacggcc ccgcggctgc gacggcgcct gagacgcgcg cagatgggcc ctctggccct 2040
gccacgacgc agggccccaa acgagatgcc gtcgcaggtg ccgcggctgt cggcagctct 2100
gcacggagcg acagtccgct gccggcagcc gccgccgcga cggcaggcgc caacggcgcg 2160
agcggcgccg cttctgacgt gttggcgggc gcaggcagcc ttgcgcttct ccggcacagc 2220
gatcggtctg ccttcaccgc gttcacggtc ttcctgcccg ggcgtgttgc cgccgccgcg 2280
gccgctgcag cggccgccgc cgcagctgct accagcgcgg gcgccagcac cggcactgcc 2340
aacggggctc cgccggcacc gggcaccgct ctggctgccg ctgccgcagc agctgccgcc 2400
gctgcgtcag cagtgccgct gccgcatcca cacacagcgc ccccagcgct gttcggcgtc 2460
cctccgccgt cctccgtgcc tcccagctcg ctttctgtgc tacctcctgt gatgccgctc 2520
catccggccg ctgccgctgc agcggcgacg gcgggtgggg gcaagcccag cgacgcagcc 2580
acgtatgccg cggctgctgc agctggattg gtgccgtatc cagggtttgc gccggcgcgg 2640
ccggggccat ttccgccgcc gccaggttct ggtggccccg gcgcgccgcc tgtgtacata 2700
cccgagtcag tcctgcagct gattgcgcac ctgtccggcc gcgcggctgc ggaaattccg 2760
gcggtgccgg cggagtcagt gacggcagca ccggtggttg tgcagaagag cggcggccct 2820
gcctcggcgg cgcgactggc ggcagtggcc aagtacctgg agaagcggaa gcaccgcaat 2880
ttccagaaga aggtgcgcta cgagagccgc aagcggctcg ccgaggcccg gccacgcgtc 2940
agggggcagt tcgtcaaggc gggcaccgcg ggtgcagcgg cagcggcagc ggcagcggca 3000
gccgcagccg cagccggcac tgccgctact gctgccggca ccggcacggc cagaggtgct 3060
gccaccgctt ctggggctgc tgggaagccg gagctacagg gccccgacac ggcagaagag 3120
gctgcggctg cgacgctgct tagcgcagca gctgctatgg cagcagcggc tgcgggcacc 3180
agtggcccca gcggctctgg gtccggcgcg atggatgtgg acggtgccga cccggaagca 3240
gatgcagacg tcatggatga ggacgatggc gaagacgacg ggtcggacga gtccgctggg 3300
gagccctag 3309
<210> 67
<211> 1335
<212> DNA
<213> 胶球藻
<400> 67
atggctgcag gcctcaagcg gatacccagc ttctcggggc gaccaggatt ccccaacggt 60
ctgcaggtgt tggttgtgga cggggacacc agcagcagcc agtgcttgcg gcagaagctg 120
gaggagctgg catatgaagt cagctgctgc tcgtccggat ctgacgcttc ggcgctcctg 180
cgcaaggagg actccagcta cgacattctc ctagttgagg ccaaagctct ggcaaaggat 240
gctactgatg gaggcagtct cagagattct gcagcgcacc tgccgctggt cctcatgtca 300
gaaaagagca gcagcacaga cgctgtatgg cgaggcatag agctcggggc agcggacgtt 360
ctggagaagc cgctgtcctc cttgaagctg cgcaacatct ggcaacatgt cgttcgcaag 420
atgatgagct cgtcccagga cagcagcagg gaggcggtgc cctgcaagat ggagccgaag 480
agcaagggca agggcgtgtc agcgccctcc agccctcgca ctccctcccc tgcagcctcc 540
ctcctcacca tcagcagcgg cacgatgaca gagaagagct gcaagggcgg cggcgatgag 600
gcctccttct caggtgtggg agatgtgaag atgtcctgct cggcagaggc gccggagccc 660
tgcgattcgc gcgcgaccgc tgagtcaccc gccagcacgc agaccaaggt cacgttcccg 720
gggtgcttga atagcggcgg cacggcgctc gcggctagca agaattgcag ccgcaagaga 780
aaggcaaagg cgccggacac tcctgcatcg gtggcgagcc ggccgcctct ggccatcagg 840
ccccccgcat gggcctcccc atttggtccc ccccaccagg gcaacaccca cgtcgtcggc 900
atggccccgc cacagtgcta tatgcagggg gttgacccca cgaacgggtg cgtatggggc 960
acgccagcag ggggcgtcag ccaagcgcca gcctacatgc ccggctgggg cttctcgccg 1020
cagccaatgc tttccggcag cttcttgcag catccctcca ccagcgacct gcacaagtgc 1080
cccagcgtgg gtgccagcag cctggcaagc agcctggaca gcagcctgac gctgtgcggc 1140
tttggcgcgg acctgcctga cgacgatctc ctgttggagg acgtgcttct gccggacgag 1200
gatcttctgg acttggcccc agatgagccc gccaccatga aggcccccga gcagccgccc 1260
atcggcctca agctcaagaa gtccgcttca ctcatcgacc tcatcaatgc gcaactgtcc 1320
gccgccaccg cctga 1335
<210> 68
<211> 568
<212> PRT
<213> 拟小球藻属
<400> 68
Met Leu Arg Gln Gln Leu Leu His Ser Gly Arg Gln Pro Gly Ala Thr
1 5 10 15
Cys Ser Leu Leu Thr Cys Ser Thr Trp Arg Pro Ser Ala Leu Phe Gly
20 25 30
Arg Pro Lys Pro Gln Lys Leu His Ser Gln Arg Leu Gln His Gln Gly
35 40 45
Arg Pro Ser Arg Leu Val Val Arg Ser Ala Met Phe Asp Asn Leu Ser
50 55 60
Arg Ser Leu Glu Arg Ala Trp Asp Met Val Arg Lys Asp Gly Arg Leu
65 70 75 80
Thr Ala Asp Asn Ile Lys Glu Pro Met Arg Glu Ile Arg Arg Ala Leu
85 90 95
Leu Glu Ala Asp Val Arg Leu Gly Ala Pro Leu Ile Arg Phe Leu Val
100 105 110
Ser Thr Pro Pro Pro Ser Gln Val Ser Leu Pro Val Val Arg Lys Phe
115 120 125
Val Lys Ala Val Glu Glu Lys Ala Leu Gly Ser Ala Val Thr Lys Gly
130 135 140
Val Thr Pro Asp Gln Gln Leu Val Lys Val Val Tyr Asp Gln Leu Arg
145 150 155 160
Glu Leu Met Gly Gly Gln Gln Glu Gly Leu Val Pro Thr Ser Pro Glu
165 170 175
Glu Pro Gln Val Ile Leu Met Ala Gly Leu Gln Gly Thr Gly Lys Thr
180 185 190
Thr Ala Ala Gly Lys Leu Ala Leu Phe Leu Gln Lys Lys Gly Gln Lys
195 200 205
Val Leu Leu Val Ala Thr Asp Ile Tyr Arg Pro Ala Ala Ile Asp Gln
210 215 220
Leu Val Lys Leu Gly Asp Arg Ile Gly Val Pro Val Phe Gln Leu Gly
225 230 235 240
Thr Gln Val Gln Pro Pro Glu Ile Ala Arg Gln Gly Leu Glu Lys Ala
245 250 255
Arg Ala Glu Gly Phe Asp Ala Val Ile Val Asp Thr Ala Gly Arg Leu
260 265 270
Gln Ile Asp Gln Ser Met Met Glu Glu Leu Val Gln Ile Lys Ser Thr
275 280 285
Val Lys Pro Ser Asp Thr Leu Leu Val Val Asp Ala Met Thr Gly Gln
290 295 300
Glu Ala Ala Gly Leu Val Lys Ala Phe Asn Asp Ala Val Asp Ile Thr
305 310 315 320
Gly Ala Val Leu Thr Lys Leu Asp Gly Asp Ser Arg Gly Gly Ala Ala
325 330 335
Leu Ser Val Arg Gln Val Ser Gly Arg Pro Ile Lys Phe Val Gly Met
340 345 350
Gly Glu Gly Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Ala
355 360 365
Ser Arg Ile Leu Gly Met Gly Asp Val Val Thr Leu Val Glu Lys Ala
370 375 380
Glu Glu Ser Ile Lys Glu Glu Glu Ala Gln Glu Ile Ser Arg Lys Met
385 390 395 400
Leu Ser Ala Lys Phe Asp Phe Asp Asp Phe Leu Lys Gln Tyr Lys Met
405 410 415
Val Ala Gly Met Gly Asn Met Ala Gln Ile Met Lys Met Leu Pro Gly
420 425 430
Met Asn Lys Phe Thr Glu Lys Gln Leu Ala Gly Val Glu Lys Gln Tyr
435 440 445
Lys Val Tyr Glu Ser Met Ile Gln Ser Met Thr Val Lys Glu Arg Lys
450 455 460
Gln Pro Glu Leu Leu Val Lys Ser Pro Ser Arg Arg Arg Arg Ile Ala
465 470 475 480
Arg Gly Ser Gly Arg Ser Glu Arg Glu Val Thr Glu Leu Leu Gly Val
485 490 495
Phe Thr Asn Leu Arg Thr Gln Met Gln Ser Phe Ser Lys Met Met Ala
500 505 510
Met Gly Gly Met Gly Met Gly Ser Met Met Ser Asp Glu Glu Met Met
515 520 525
Gln Ala Thr Leu Ala Gly Ala Gly Pro Arg Pro Val Pro Ala Gly Lys
530 535 540
Val Arg Arg Lys Lys Leu Ala Ala Ala Gly Gly Ser Arg Gly Met Ala
545 550 555 560
Glu Leu Ala Ser Leu Lys Ala Glu
565
<210> 69
<211> 23
<212> DNA
<213> 拟小球藻属
<400> 69
gggacatggt gcgcaaggac ggg 23
<210> 70
<211> 2667
<212> DNA
<213> 拟小球藻属
<400> 70
atggccaaac tgacatccgc tgttcctgtg ttgacagcaa gagatgttgc aggtgcagtg 60
gagttttgtg agttctgaga agctgattgt tgtttaactt ctttgaaagc tttatcgaag 120
attctgcaag cgatgaacat tgcttgtcaa gaccgagagc tgcatgccca cttgacatcc 180
agctttgaac ggctcttcat gtttgatttg tttctgattg tagggacaga tagactgggg 240
tttagcaggg actttgtgga ggacgatttt gcaggagtgg tgagggatga tgtgacactg 300
tttatctcag cagtgcagga tcaagtgagt gcagcgtcag ctgtggcagt tgttggcttt 360
cgtctcagtc agtagtttgc tgggattgat tatggagggc acagttgcaa ttttgagttg 420
cacgttgcga caagcgtgtt gacaaagcgt ggtcaagccg gccagtcttg ccggtggcgg 480
gtggcttggt ctaacttccg ctctacagca atcgttttgt tcatggttac ggggctggcg 540
tgccagaaag tcctggtcag ccaccctcgc ttcaaagccg tagcccaaca actttgcgaa 600
tatgttcgat ttgcaggtgg tgcccgataa tacactggca tgggtttggg tgagaggtac 660
agctctgcgt gcaacaggtt gcaagatgca gcgcaggtct tccctggtca aacgatgtat 720
gcagagttga gaggcacttg agctgggtga atggcgtggg ctcgtaggta gtgtgcaggg 780
caggaagggc agccaatttt ggagttgtgg tccggtgtcg ttgcttcgag ccttattagg 840
actcttgctc atcaaagcgt tagttgtgaa taagttgatc tgaaaggatg ttatgtacag 900
caagcagcag cagttaagag tctggggagt agctgcacag ggcgaggtgt caagatggga 960
agggtcctgc ctccttatgt gtttttccct gtaggggagg aagcctctta tgggcaatgg 1020
ttgggcatat tttccagcca gcccttcttt ctataggggc cagggtgggc ccagctcgtc 1080
ttggcttcca ccaccaggag agtgagggca ttgaagggcc ataaatagtc ctcccatcta 1140
cgtgcaccag agggtgtcgt ctaggctgtg catgccacga ggggaaggag ccaagaatga 1200
gtgtatgggt tgttttcatg tttaggctgg gataaaactg ttttcaattg cgcctgccgg 1260
gtgaaaacca cagcagcatc agcaagcttg gagaaggcca gcccgcccag cacaggctca 1320
cgttcccact caggcggtca gtcgggcggg ggtgtgagtc aggcaggcga gggtgtctgt 1380
gcctgacatc agcacctctg cttagccact gcagcccctg gagcagggta gggcgtcatt 1440
tgcagcaatc acctgctgcc tcacacgtcg cagcttggaa tttcaacgac catcagcgct 1500
ggggttgttg agggatcata gcagattttg gtgcagcctg gttgtcatgc tctttgtgga 1560
atggcctcta tgttcgagca attcgttgga tgttgaggtg cttggggaca gagagtcgaa 1620
tgatgggcca gggtcaaaca tgcgagcgtt tggctgagtc agcggttttt gctggtcact 1680
ttttcttttg tttcttattt aggtttgatg gatgtgtttt gtgctgctgc cctgaagctg 1740
cagcagcgtg tctgccctgc gctactgcgg gcaccaaggc tatgtgctgg tgcactcggc 1800
tgcgctgcac ctgtgcacct cgcactccgt ccagcctcca tgcagcacac gtactcacgg 1860
tgtcctcctg acctgtcgta cgctattcca aacttgctct tttgctgccg ctgctctcgt 1920
acacaattgc tgttgattat cgatatctaa tcgagcgcct gctgactgaa ctccgcaggt 1980
ttggatgaac tgtatgcaga gtggtctgaa gtggtgagca ccaactttag gtgggtgggc 2040
tctgaaggag gaggagggag cgggtgatta aacagggcct gcatgaagag gagcaggggc 2100
tgcatggaca gcagggggaa ggtgcagaag ggagggtcaa gcggggttca ggtggctgtg 2160
ggtttctgca cgagcagtga aagaagctgt atccttccac ctgctttcac tggcgaaagg 2220
ttgaaaacag gatgtcgcag ctggaaagat gttgcgctgt caagtgcaag ccatggttga 2280
gggtatgcct gtgtgcatgt gcttcttaaa gttactcctg ttctatggtt ctgggtgctt 2340
gttgtttgtg gtgcagggat gcaagcggac ctgcaatgac agagattgga gaacaacctt 2400
ggggaaggga gtttgcattg agagatcctg caggtgaggg ggcatgtaag caatggcagg 2460
caattcaaga acgaatcatt gctgcaaatg ctgggatggt atgcagctga ggtatctatt 2520
gccttgtatt ttgtctcgca ttgcatcggt ggtgcgttct gtggcctgag gcacagttct 2580
tgctgtttga taagggttcg actgagttgt cgtgtgtgct gtgctgcagg caattgcgtg 2640
cactttgttg cagaagaaca ggactga 2667
<210> 71
<211> 530
<212> DNA
<213> 拟小球藻属
<400> 71
ccaccatggg ggaggtttga agtgtgcgcc tgatataatc atacacctaa aagcaccact 60
tgctgattgt gaagggacta tgtcgtttat gacgggacgt tacgctggcc gatggtttga 120
atttggacgc tgtggtagaa tgttatatgg acgtaaaggt tggcatattg aaaatcgtct 180
tcgcaggcaa acttctagac gtgtgaccca ccggtaaaac gacaagcgtg gcgcgtcgat 240
tgcgctttga acgtcgtttg ttggactcca gatgaacctc aaaatcaaag cggtgattga 300
cgaaaatcaa atgacagccc gcaaaatttc atcagccttc ggatcggatt ctcagaatct 360
gattgtccct gctggctaca tttatgaaat ttcgtacatt ttggcagaaa tgtcccaata 420
ccatagcact gccgcctgag ctcacccgag caatgcatac tgggtacctc gcccatctcg 480
ccctctttcc aagcccagtg ctgttgtaat agccaaaggg ctcagtaaca 530
<210> 72
<211> 546
<212> DNA
<213> 拟小球藻属
<400> 72
gcatagcatc agcctgtggc agggttgtgg tagggctgag tggcagggtt aaaggggttg 60
cctaccccac ccctactctc atgacaccag caacagcagc agctcatgca gtactcaaat 120
cactgatgtc aatggtgtga cacatttggt taaggctgct ttttaaagtg ctgctttggg 180
ggcagtgact gtgcagagct tggagcgtat ccccatgtaa tcagaaccga cgagagttcg 240
gggcaacctt tcatcttcac attttttgtg atcagctaca gagtctgaaa tcaaatagag 300
gctgccatct aaacgcagga gtcacaacga aggcgaaaac tccaattgct gtactcaatg 360
cactaagtga ttgttcaatg gataaataca ctatgctcaa ttcatgccag cagagctgct 420
ccttccagcc agctacaatg gctttttcca cgccttttga agtatgaatg ttcagcttgc 480
tgtgcttgat gcatcaccat aaacacaatt ctacaacatt tcatgccaac aacagtacgg 540
gctttc 546
<210> 73
<211> 23
<212> DNA
<213> 拟小球藻属
<400> 73
tgcggtgaag cttggagctg tgg 23
<210> 74
<211> 23
<212> DNA
<213> 拟小球藻属
<400> 74
acaccacctt aaggcacatg agg 23
<210> 75
<211> 549
<212> PRT
<213> 莱茵衣藻
<400> 75
Met Gln Thr Ala Leu Arg Ala Arg Ser Ala Ala Pro Arg Gly Ala Cys
1 5 10 15
Asn Arg Thr Ala Val Ala Pro Val Ala Ser Ala His Leu Arg Gly Gln
20 25 30
Tyr Ala Pro Phe Ser Gly Ala Gln Ala Arg Pro Ala Leu Gly Arg Gln
35 40 45
Arg Gln Gln Gln Gln Gln Gln Arg Arg Gly Ala Leu Val Ile Arg Ser
50 55 60
Ala Met Phe Asp Ser Leu Ser Arg Ser Ile Glu Lys Ala Gln Arg Leu
65 70 75 80
Ile Gly Lys Ser Gly Thr Leu Thr Ala Glu Asn Met Lys Glu Pro Leu
85 90 95
Lys Glu Val Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu Pro Val
100 105 110
Val Arg Arg Phe Ile Lys Lys Val Glu Glu Arg Ala Leu Gly Thr Lys
115 120 125
Val Arg Glu Gly Arg Ala Met Gly Thr Lys Trp Lys Ser Val Val Asn
130 135 140
Cys Pro Leu Gln Asp Gly Leu Gly Asn Arg Gly Val Gly Arg Ala Arg
145 150 155 160
Thr Glu Val Gly His Arg Ala Ala Cys Val His Gly Ala Arg Gly Val
165 170 175
Gly Lys Thr Thr Ala Ala Gly Lys Leu Ala Leu Tyr Leu Lys Lys Ala
180 185 190
Lys Lys Ser Cys Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala
195 200 205
Ile Asp Gln Leu Val Lys Leu Gly Ala Ala Ile Asp Val Pro Val Phe
210 215 220
Glu Met Gly Thr Asp Val Ser Pro Val Glu Ile Ala Lys Lys Gly Val
225 230 235 240
Glu Glu Ala Arg Arg Leu Gly Val Asp Ala Val Ile Ile Asp Thr Ala
245 250 255
Gly Arg Leu Gln Val Asp Glu Gly Met Met Ala Glu Leu Arg Asp Val
260 265 270
Lys Ser Ala Val Arg Pro Ser Asp Thr Leu Leu Val Val Asp Ala Met
275 280 285
Thr Gly Gln Glu Ala Ala Asn Leu Val Arg Ser Phe Asn Glu Ala Val
290 295 300
Asp Ile Ser Gly Ala Ile Leu Thr Lys Met Asp Gly Asp Ser Arg Gly
305 310 315 320
Gly Ala Ala Leu Ser Val Arg Glu Val Ser Gly Lys Pro Ile Lys Phe
325 330 335
Val Gly Val Gly Glu Lys Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu
340 345 350
Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp Val Leu Thr Leu Tyr
355 360 365
Glu Lys Ala Glu Ala Ala Ile Lys Glu Glu Asp Ala Gln Lys Thr Met
370 375 380
Glu Arg Leu Met Glu Glu Lys Phe Asp Phe Asn Asp Phe Leu Asn Gln
385 390 395 400
Trp Lys Ala Met Asn Asn Met Gly Gly Leu Gln Met Leu Lys Met Met
405 410 415
Pro Gly Phe Asn Lys Ile Ser Glu Lys Gln Leu Tyr Glu Ala Glu Lys
420 425 430
Gln Phe Gly Val Tyr Glu Ala Ile Ile Gly Ala Met Asp Glu Glu Glu
435 440 445
Arg Ser Asn Pro Glu Val Leu Ile Lys Asn Leu Ala Arg Arg Arg Arg
450 455 460
Val Ala Gln Asp Ser Gly Lys Ser Glu Ala Glu Val Thr Lys Leu Met
465 470 475 480
Ala Ala Tyr Thr Ser Met Lys Ala Gln Val Gly Gly Met Ser Lys Leu
485 490 495
Leu Lys Leu Gln Lys Ala Gly Ala Asp Pro Gln Lys Ala Asn Ser Leu
500 505 510
Leu Gln Glu Leu Val Ala Ser Ala Gly Lys Lys Val Ala Pro Gly Lys
515 520 525
Val Arg Arg Lys Lys Glu Lys Glu Pro Leu Ser Lys Ala Arg Gly Phe
530 535 540
Gly Ser Ser Ser Lys
545
<210> 76
<211> 559
<212> PRT
<213> 细小微胞藻
<400> 76
Met Arg His Leu Leu Ser Ser Ala Ser Ile Arg Gln Tyr Asp Lys Trp
1 5 10 15
Ser Leu Val Ser Ser His Ala Lys Lys Pro Ala Leu Val Cys Ala Ser
20 25 30
Lys His Thr Lys Ser Ala Val Lys Leu Gln Cys Thr Ser Arg Gly Ser
35 40 45
Ser Asn Arg Thr Ile Gln Leu Leu Leu Phe Gln Gln Phe Arg Pro Ala
50 55 60
Lys Arg Gly Lys Leu Leu Ile Thr Arg Ala Asp Ser Phe Gly Thr Leu
65 70 75 80
Ser Glu Arg Leu Asn Ser Ala Trp Ser Ala Leu Lys Asp Glu Asp Asp
85 90 95
Leu Ser Val Glu Asn Ile Ser Leu Pro Leu Lys Asp Ile Arg Arg Ala
100 105 110
Leu Leu Glu Ala Asp Val Ser Leu Pro Val Val Arg Arg Phe Ile Lys
115 120 125
Ser Val Glu Glu Lys Ser Ile Gly Val Lys Val Thr Lys Gly Val Ser
130 135 140
Ala Ser Gln Gln Leu Thr Lys Val Val Ala Asp Glu Leu Cys Glu Leu
145 150 155 160
Met Gly Gly Phe Gly Gly Asp Lys Leu Ile Phe Arg Lys Glu Gly Glu
165 170 175
Gly Pro Thr Val Ile Leu Met Ala Gly Leu Gln Gly Val Gly Lys Thr
180 185 190
Thr Ala Cys Gly Lys Leu Ala Leu Phe Leu Lys Ala Gln Gly Lys Gln
195 200 205
Ser Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala Ile Asp Gln
210 215 220
Leu Lys Lys Leu Gly Glu Gln Ile Asp Val Pro Val Phe Glu Leu Gly
225 230 235 240
Thr Asp Phe Ser Pro Pro Asp Ile Ala Arg Ser Gly Val Glu Lys Ala
245 250 255
Lys Leu Glu Asn Phe Asp Val Val Ile Val Asp Thr Ala Gly Arg Leu
260 265 270
Gln Val Asp Glu Met Leu Met Ala Glu Leu Leu Ala Thr Lys Ala Ala
275 280 285
Thr Arg Ala Asp Glu Thr Leu Leu Val Val Asp Ala Met Thr Gly Gln
290 295 300
Glu Ala Ala Ser Leu Thr Ala Ala Phe Asn Asp Ala Val Gly Ile Thr
305 310 315 320
Gly Ala Val Leu Thr Lys Met Asp Gly Asp Thr Arg Gly Gly Ala Ala
325 330 335
Leu Ser Val Arg Glu Val Ser Gly Lys Pro Ile Lys Phe Ile Gly Ser
340 345 350
Gly Glu Lys Leu Asp Ala Leu Glu Pro Phe Phe Pro Glu Arg Met Thr
355 360 365
Thr Arg Ile Leu Gly Met Gly Asp Val Val Ser Leu Val Glu Arg Ala
370 375 380
Gln Val Ala Val Lys Glu Glu Gln Ala Asn Leu Met Arg Asp Lys Ile
385 390 395 400
Leu Ser Ala Thr Phe Asp Phe Asn Asp Phe Leu Ser Gln Leu Glu Met
405 410 415
Met Gly Lys Met Gly Gly Met Gly Gly Leu Thr Lys Met Met Pro Gly
420 425 430
Met Asn Thr Met Ser Asp Lys Glu Leu Gln Asp Ala Glu Lys Ser Leu
435 440 445
Ser Val Ala Lys Ser Leu Ile Met Ser Met Thr Pro Arg Glu Arg Gln
450 455 460
Phe Pro Asp Leu Leu Val Ala Gly Ser Ser Ala Ala Ser Arg Arg Gly
465 470 475 480
Arg Val Val Glu Gly Ser Gly Arg Ser Asp Lys Asp Leu Ala Asn Leu
485 490 495
Ile Val Met Phe Gly Ser Met Arg Val Lys Met Gln Ser Leu Ser Ala
500 505 510
Gln Met Asn Gly Thr Ala Lys Glu Val Gly Leu Val Pro Gln Leu Ser
515 520 525
Glu Val Asp Leu Asn Lys Leu Ala Phe Glu Gly Val Gly Lys Arg Val
530 535 540
Ser Pro Gly Met Val Arg Arg Arg Lys Leu Asn Ala Ser Phe Gly
545 550 555
<210> 77
<211> 568
<212> PRT
<213> 微胞藻属
<400> 77
Met Glu Ala Arg Thr Lys Gln Ala Arg Ala Pro Lys Gly Ser Ile Trp
1 5 10 15
Cys Ala Gln Arg Ala Arg Lys Asp Leu Arg Ala Arg Gly Cys Arg Gly
20 25 30
Leu Gly Ser Arg Ile Ser Lys Gly Gln Pro Phe Ser Pro Leu Thr Leu
35 40 45
Ser Thr Pro Ala Val Thr Glu Ile Gly Phe Gly Thr Leu Leu Tyr Gly
50 55 60
Ser Arg Leu Ser Ala Gly Gly Ser Arg Arg Gly Glu Thr Met Leu Arg
65 70 75 80
Arg Ala Ser Ala Phe Gly Ser Leu Thr Glu Arg Leu Asn Ser Val Trp
85 90 95
Ala Thr Leu Lys Asp Glu Asp Asp Leu Ser Leu Glu Asn Ile Lys Gly
100 105 110
Pro Leu Lys Asp Ile Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu
115 120 125
Pro Val Val Arg Arg Phe Ile Lys Asn Ile Glu Gln Lys Ala Ile Gly
130 135 140
Thr Arg Val Thr Lys Gly Val Asn Ala Gly Gln Gln Leu Thr Lys Val
145 150 155 160
Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Phe Gly Gly Asp Ser
165 170 175
Leu Ala Phe Lys Asp Pro Ser Met Gly Pro Thr Val Ile Leu Met Ala
180 185 190
Gly Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ala Leu
195 200 205
Tyr Leu Lys Lys Gln Gly Lys Asp Ser Leu Leu Val Ala Thr Asp Val
210 215 220
Tyr Arg Pro Ala Ala Ile Glu Gln Leu Lys Arg Leu Gly Glu Gln Val
225 230 235 240
Lys Thr Pro Val Phe Asp Met Gly Val Arg Val Asp Pro Pro Glu Val
245 250 255
Ala Arg Leu Gly Leu Glu Lys Ala Arg Ala Glu Gly Ile Asp Val Val
260 265 270
Ile Ile Asp Thr Ala Gly Arg Leu Gln Val Asp Val His Leu Met Glu
275 280 285
Glu Leu Arg Ala Thr Lys Ile Ala Thr Ala Ala Asp Glu Ile Leu Leu
290 295 300
Val Val Asp Ala Met Thr Gly Gln Glu Ala Ala Ala Leu Thr Ala Ala
305 310 315 320
Phe Asp Glu Ala Val Gly Ile Thr Gly Ala Val Leu Thr Lys Met Asp
325 330 335
Gly Asp Thr Arg Gly Gly Ala Ala Leu Ser Val Arg Glu Val Ser Gly
340 345 350
Lys Pro Ile Lys Phe Thr Gly Val Gly Glu Lys Met Glu Ala Leu Glu
355 360 365
Pro Phe Tyr Pro Glu Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp
370 375 380
Val Val Thr Leu Val Glu Arg Ala Gln Gln Val Val Lys Asn Glu Glu
385 390 395 400
Ala Glu Gln Met Arg Asp Lys Ile Leu Ser Ala Thr Phe Asp Phe Asn
405 410 415
Asp Phe Ile Lys Gln Met Glu Met Met Gly Gln Met Gly Gly Met Asp
420 425 430
Gly Phe Met Lys Leu Leu Pro Gly Met Ser Gly Met Ser Glu Arg Glu
435 440 445
Met Gln Glu Ala Asp Lys Ser Leu Lys Val Ala Lys Ser Leu Ile Leu
450 455 460
Ser Met Thr Ser Lys Glu Arg Gln Phe Pro Asp Ile Leu Val Ala Gly
465 470 475 480
Ala Ser Ala Lys Ser Arg Arg Lys Arg Ile Ile Glu Gly Ala Gly Arg
485 490 495
Ser Glu Lys Asp Leu Ser Gln Leu Ile Val Leu Phe Gly Ser Met Arg
500 505 510
Val Lys Met Gln Lys Met Thr Ala Glu Ile Thr Gly Ala Ser Ala Glu
515 520 525
Val Gly Leu Thr Pro Gln Leu Ser Glu Glu Asp Met Asn Thr Leu Ala
530 535 540
Asn Glu Gly Leu Arg Lys Asn Val Ser Pro Gly Met Val Arg Arg Leu
545 550 555 560
Arg Ile Arg Arg Leu Thr Gly Ser
565
<210> 78
<211> 481
<212> PRT
<213> 载色宝琳虫
<400> 78
Met Phe Asp Glu Leu Ser Ala Arg Phe Glu Glu Ala Val Lys Ser Leu
1 5 10 15
Lys Gly Leu Ser Ala Ile Thr Glu Asn Asn Val Glu Asn Ala Leu Lys
20 25 30
Gln Val Arg Arg Ala Leu Ile Glu Ala Asp Val Ser Leu Val Val Val
35 40 45
Lys Glu Phe Met Glu Glu Val Arg Ser Lys Ser Ile Gly Ile Glu Val
50 55 60
Val Arg Gly Ile Lys Pro Asp Gln Lys Phe Ile Gln Val Val Tyr Glu
65 70 75 80
Gln Leu Ile Glu Ile Met Gly Ala Asn Asn Thr Pro Leu His Lys Gln
85 90 95
Ser His Thr Val Thr Val Val Leu Met Ala Gly Leu Gln Gly Ala Gly
100 105 110
Lys Thr Thr Ala Ala Ala Lys Leu Ala Leu Tyr Leu Lys Asn Gln Gly
115 120 125
Glu Lys Val Leu Met Val Ala Ala Asp Val Tyr Arg Pro Ala Ala Ile
130 135 140
Asp Gln Leu Phe Val Leu Gly Lys Gln Ile Asp Val Glu Val Phe Thr
145 150 155 160
Leu Asn Pro Glu Ser Ile Pro Glu Asp Ile Ala Ala Ala Gly Leu Gln
165 170 175
Lys Ala Ile Arg Glu Gly Phe Asp Tyr Leu Ile Val Asp Thr Ala Gly
180 185 190
Arg Leu Gln Ile Asp Thr Ala Met Met Gln Glu Met Val Arg Ile Arg
195 200 205
Ser Ala Val Asn Pro Asn Glu Ile Leu Leu Val Val Asp Ser Met Ile
210 215 220
Gly Gln Glu Ala Ala Glu Leu Thr Arg Ala Phe His Glu Gln Ile Gly
225 230 235 240
Ile Thr Gly Ala Val Leu Thr Lys Leu Asp Gly Asp Ala Arg Gly Gly
245 250 255
Ala Ala Leu Ser Ile Arg Lys Val Ser Gly Ala Pro Ile Lys Phe Ile
260 265 270
Gly Thr Gly Glu Lys Val Glu Ala Leu Gln Pro Phe His Pro Glu Arg
275 280 285
Met Ala Ser Arg Ile Leu Gly Met Gly Asp Ile Val Thr Leu Val Glu
290 295 300
Lys Ala Gln Glu Glu Val Glu Leu Ala Asp Val Glu Lys Met Gln Arg
305 310 315 320
Lys Leu Gln Glu Ala Ser Phe Asp Phe Ser Asp Phe Leu Gln Gln Met
325 330 335
Arg Leu Val Lys Arg Met Gly Ser Leu Gly Gly Leu Met Lys Met Ile
340 345 350
Pro Gly Met Asn Lys Ile Asp Ser Thr Met Leu Arg Glu Gly Glu Ala
355 360 365
Gln Leu Lys Arg Ile Glu Ser Met Ile Gly Ser Met Thr Pro Thr Glu
370 375 380
Arg Glu Lys Pro Glu Leu Leu Ala Ser Gln Pro Ser Arg Arg Gly Arg
385 390 395 400
Ile Ala Lys Gly Ser Gly His Lys Ile Ala Asp Val Asp Lys Met Leu
405 410 415
Val Asp Phe Gln Lys Met Arg Gly Phe Met Gln Gln Met Thr Lys Gly
420 425 430
Asn Asn Phe Ala Asn Pro Leu Ser Met Gly Ala Asn Met Phe Ser Gln
435 440 445
Pro Asn Met Thr Val Pro Gln Thr Lys Ile Ser Asn Thr Asn Glu Ser
450 455 460
Arg Met Arg Asn Ser Arg Ala Thr Lys Lys Lys Lys Gly Phe Gly Gln
465 470 475 480
Leu
<210> 79
<211> 498
<212> PRT
<213> 海洋发光蚝球藻
<400> 79
Met Thr Arg Ala Asp Ala Phe Ala Gly Met Ser Asp Lys Leu Asp Lys
1 5 10 15
Ala Trp Ala Arg Leu Gln Gly Glu Lys Asp Leu Asn Ala Asp Asn Val
20 25 30
Lys Ala Pro Leu Lys Asp Val Arg Arg Ala Leu Leu Glu Ala Asp Val
35 40 45
Ser Leu Pro Val Val Arg Arg Phe Ile Ala Arg Cys Glu Glu Lys Ala
50 55 60
Val Gly Met Lys Val Thr Lys Gly Val Glu Pro Gly Gln Met Leu Val
65 70 75 80
Lys Cys Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Val Gly Ala
85 90 95
Glu Gly Ile Lys Phe Arg Asp Asp Gly Glu Pro Thr Val Val Leu Met
100 105 110
Ala Gly Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ser
115 120 125
Leu Ala Leu Arg Lys Gln Gly Lys Ser Val Leu Leu Val Ala Thr Asp
130 135 140
Val Tyr Arg Pro Ala Ala Ile Asp Gln Leu Lys Thr Leu Gly Lys Gln
145 150 155 160
Ile Gly Val Pro Val Phe Asp Met Gly Val Asp Gly Asn Pro Pro Glu
165 170 175
Ile Ala Ala Arg Gly Val Arg Lys Ala Lys Asp Glu Asp Ile Asp Val
180 185 190
Val Ile Val Asp Thr Ala Gly Arg Leu Asn Ile Asp Glu Lys Leu Met
195 200 205
Gly Glu Leu Lys Ala Thr Lys Glu Ala Thr Ser Ala Asp Glu Thr Leu
210 215 220
Leu Val Val Asp Ala Met Thr Gly Gln Glu Ala Ala Thr Leu Thr Ala
225 230 235 240
Ser Phe Asn Glu Ala Val Glu Ile Thr Gly Ala Ile Leu Thr Lys Met
245 250 255
Asp Gly Asp Thr Arg Gly Gly Ala Ala Leu Ser Val Arg Glu Val Ser
260 265 270
Gly Lys Pro Ile Lys Phe Thr Gly Val Gly Glu Lys Met Asp Ala Leu
275 280 285
Glu Pro Phe Tyr Pro Glu Arg Met Thr Ser Arg Ile Leu Gly Met Gly
290 295 300
Asp Ile Val Ser Leu Val Glu Lys Val Gln Ala Gly Val Lys Glu Glu
305 310 315 320
Glu Ala Glu Lys Ile Lys Gln Lys Ile Met Ser Ala Thr Phe Asp Phe
325 330 335
Asn Asp Phe Val Gly Gln Leu Glu Met Met Asn Asn Met Gly Gly Met
340 345 350
Lys Gln Ile Met Gln Met Met Pro Gly Thr Ala Lys Leu Ser Glu Ala
355 360 365
Asp Met Glu Ala Ala Gly Lys Ser Met Thr Ile Ala Lys Ser Leu Ile
370 375 380
Asn Ser Met Thr Lys Glu Glu Arg Gln Tyr Pro Asp Met Leu Val Ala
385 390 395 400
Ser Thr Thr Ala Asp Ser Arg Arg Gln Arg Ile Val Lys Gly Ser Gly
405 410 415
Arg Thr Glu Ala Asp Leu Ala Gln Leu Ile Met Met Phe Gly Gly Met
420 425 430
Arg Thr Gln Met Gln Lys Met Ser Gly Gln Leu Gly Gly Gln Ala Gly
435 440 445
Asp Val Gly Leu Gln Pro Gln Leu Ser Glu Ala Glu Leu Ser Lys Leu
450 455 460
Ala Met Asn Lys Ile Arg Lys Thr Val Lys Pro Gly Met Val Arg Arg
465 470 475 480
Gln Lys Ala Lys Lys Val Pro Lys Phe Leu Ala Glu Arg Glu Ser Phe
485 490 495
Ser Gln
<210> 80
<211> 426
<212> PRT
<213> 金牛蚝球藻
<400> 80
Met Lys Val Thr Lys Gly Val Glu Pro Gly Gln Met Leu Val Lys Ala
1 5 10 15
Val Ala Asp Glu Leu Cys Glu Leu Met Gly Gly Val Gly Ala Glu Gly
20 25 30
Ile Lys Phe Arg Asp Asp Gly Glu Pro Thr Val Ile Leu Met Ala Gly
35 40 45
Leu Gln Gly Val Gly Lys Thr Thr Ala Cys Gly Lys Leu Ser Leu Ala
50 55 60
Met Arg Lys Gln Gly Lys Thr Val Leu Leu Val Ala Thr Asp Val Tyr
65 70 75 80
Arg Pro Ala Ala Ile Asp Gln Leu Lys Thr Leu Gly Thr Gln Ile Gly
85 90 95
Val Pro Val Phe Asp Met Gly Val Asp Ala Ser Pro Pro Glu Val Ala
100 105 110
Ala Arg Gly Val Arg Lys Ala Lys Glu Glu Asp Ile Asp Val Val Ile
115 120 125
Val Asp Thr Ala Gly Arg Leu Asn Ile Asp Glu Lys Leu Met Ser Glu
130 135 140
Leu Lys Asp Thr Lys Leu Ala Thr Lys Ala Asp Glu Thr Leu Leu Val
145 150 155 160
Val Asp Ala Met Thr Gly Gln Glu Ala Ala Asn Leu Thr Ala Ser Phe
165 170 175
Gln Arg Gly Asp Gly Arg Arg Thr Arg Arg Gly Gly Ala Ala Leu Ser
180 185 190
Val Ala Arg Ser Phe Arg Lys Ala His Gln Phe Thr Ala Ser Val Lys
195 200 205
Met Asp Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Thr Ser Arg Ile
210 215 220
Leu Gly Met Gly Asp Ile Val Ser Leu Val Glu Lys Val Gln Ser Glu
225 230 235 240
Val Lys Glu Ala Glu Ala Glu Lys Leu Lys Glu Lys Ile Leu Lys Ala
245 250 255
Thr Phe Asp Phe Asn Asp Phe Val Thr Gln Leu Glu Met Met Asn Asn
260 265 270
Met Gly Ser Met Lys Gln Ile Met Gln Met Leu Pro Gly Thr Thr Lys
275 280 285
Leu Ser Glu Ser Glu Met Glu Ala Ala Glu Lys Ser Phe Lys Ile Ala
290 295 300
Arg Ser Leu Ile Asn Ser Met Thr Lys Glu Glu Arg Gln Phe Pro Asp
305 310 315 320
Met Leu Val Ala Ser Thr Thr Ala Glu Ser Arg Arg Ala Arg Ile Val
325 330 335
Lys Gly Ser Gly Arg Thr Glu Ala Asp Leu Ala Gln Leu Ile Ile Met
340 345 350
Phe Gly Ser Met Arg Gly Lys Met Gln Gln Leu Ser Gly Glu Leu Gly
355 360 365
Gly Glu Ala Gly Asn Val Gly Leu Gln Pro Gln Leu Ser Ala Ala Glu
370 375 380
Leu Glu Lys Leu Thr Thr Asn Lys Leu Arg Lys Asn Ile Lys Pro Gly
385 390 395 400
Met Val Arg Arg Leu Lys Ser Lys Lys Ile Pro Ile Ala Lys Asn Gly
405 410 415
Asp Arg Met Gly Ile Ser Ala Ser Ala Asp
420 425
<210> 81
<211> 510
<212> PRT
<213> 卡特氏团藻
<400> 81
Met Ser Arg Pro Ala Ala Leu Arg Gly Ala Gly Asn Arg Lys Leu Thr
1 5 10 15
Ala Thr Val Thr Ala Ala His Leu Arg Gly Ile Ala Phe Thr Ser Ile
20 25 30
Arg Thr Cys Gln Gly Ala Lys Gly Gly Ser Leu Gly Leu Pro His Pro
35 40 45
Ser Pro Pro Leu Ala Leu Pro Arg Arg Gly Arg Gly Arg Gly Ala Ala
50 55 60
Val Val Val Arg Ala Ala Met Phe Asp Asn Leu Ser Lys Ser Leu Glu
65 70 75 80
Lys Ala Gln Arg Leu Ile Gly Gly Cys Glu Val Pro Gly Val Gly Val
85 90 95
Val Gly Lys Ser Gly Thr Leu Thr Ala Glu Asn Met Lys Glu Pro Leu
100 105 110
Lys Glu Val Arg Arg Ala Leu Leu Glu Ala Asp Val Ser Leu Pro Val
115 120 125
Val Arg Arg Phe Val Lys Lys Val Glu Glu Arg Ala Leu Gly Thr Lys
130 135 140
Val Ile Glu Gly Val Thr Pro Asp Val Gln Phe Ile Lys Val Val Ser
145 150 155 160
Asn Glu Leu Ile Glu Leu Met Gly Gly Gly Val Gly Ala Lys Asp Leu
165 170 175
Glu Pro Gly Phe Pro Gln Ile Ile Leu Met Ala Gly Leu Gln Gly Val
180 185 190
Gly Lys Thr Thr Ala Ala Gly Lys Leu Ala Leu Tyr Leu Lys Lys Ala
195 200 205
Lys Lys Ser Cys Leu Leu Val Ala Thr Asp Val Tyr Arg Pro Ala Ala
210 215 220
Ile Asp Gln Leu Val Lys Leu Gly Ala Ala Ile Asp Val Pro Val Phe
225 230 235 240
Glu Leu Gly Thr Gln Val Ser Gly Lys Pro Ile Lys Phe Val Gly Val
245 250 255
Gly Glu Lys Met Glu Ala Leu Glu Pro Phe Tyr Pro Glu Arg Met Ala
260 265 270
Ser Arg Ile Leu Gly Met Gly Asp Val Leu Thr Leu Tyr Glu Lys Ala
275 280 285
Glu Ala Ala Ile Lys Glu Glu Asp Ala Lys Ala Val Met Asp Arg Leu
290 295 300
Met Glu Glu Lys Phe Asp Phe Asn Asp Phe Leu Asn Gln Trp Lys Ser
305 310 315 320
Met Asn Asn Met Gly Gly Met Gln Ile Leu Lys Met Met Pro Gly Phe
325 330 335
Asn Lys Glu Arg Ser Asn Pro Glu Val Ile Ile Lys Ser Leu Ala Arg
340 345 350
Arg Arg Arg Val Ala Gln Asp Ser Gly His Ser Glu Ala Glu Val Ala
355 360 365
Lys Leu Met Thr Ala Tyr Thr Ala Met Arg Thr Gln Val Gly Gly Met
370 375 380
Ser Lys Leu Leu Lys Leu Gln Lys Ser Gly Gly Asp Pro Ser Gln Ala
385 390 395 400
Glu Lys Leu Leu Lys Glu Leu Val Ala Ser Ala Gly Lys Lys Val Ala
405 410 415
Pro Gly Lys Pro Pro Gly Asp Pro Ala Gly Ser Phe Ile Ser Thr Pro
420 425 430
Arg Thr Pro His Pro Pro Pro Gly Pro Leu Gly Pro Arg Ser Gln Val
435 440 445
Arg Arg Lys Lys Glu Lys Glu Pro Ile Ser Lys Ala Arg Gly Phe Gly
450 455 460
Ser Pro Ser Asn Phe Asn His Asp Leu Ser Pro Pro Gly Ser Ser Pro
465 470 475 480
Ala Ala Tyr Thr Tyr Thr Leu Ser Arg Leu Ser Cys Gln Arg Leu Cys
485 490 495
Asp Gly Gly Gly Leu Leu Asp Asp Trp Asn Leu Trp Arg Arg
500 505 510
<210> 82
<211> 448
<212> PRT
<213> 三角褐指藻
<400> 82
Met Ser Glu Ala Ser Ile Gln Pro Ala Leu Arg Glu Val Arg Arg Ala
1 5 10 15
Leu Leu Asp Ala Asp Val Asn Val Asp Val Ala Asp Thr Leu Ile Glu
20 25 30
Gly Val Arg Ala Arg Ser Leu Gly Gln Glu Val Leu Glu Gly Val Thr
35 40 45
Ala Glu Gln Gln Phe Val Lys Ala Met Tyr Asp Glu Leu Leu Asp Met
50 55 60
Met Gly Gly Asp Ser Ser Val Pro Met Ser Asp Gly Pro Ser Asn Val
65 70 75 80
Pro Val Ala Thr Leu Ala Ser Gly Thr Ala Ala Asp Pro Ala Val Ile
85 90 95
Leu Leu Ala Gly Leu Gln Gly Ala Gly Lys Thr Thr Ala Ala Gly Lys
100 105 110
Leu Ala Leu Phe Leu Lys Glu Gln Arg Lys Val Leu Leu Val Ala Ala
115 120 125
Asp Ile Tyr Arg Pro Ala Ala Ile Lys Gln Leu Gln Val Leu Gly Glu
130 135 140
Ser Ile Gly Val Glu Val Phe Thr Lys Gly Thr Asp Val Asp Pro Val
145 150 155 160
Glu Ile Val Asn Ala Gly Ile Gln Lys Ala Arg Asp Glu Gly Tyr Asp
165 170 175
Thr Val Ile Val Asp Thr Ala Gly Arg Gln Val Ile Asp Thr Asp Leu
180 185 190
Met Asp Glu Leu Gln Arg Met Lys Arg Ala Ala Ser Pro Gln Glu Thr
195 200 205
Leu Leu Ile Val Asp Ala Met Thr Gly Gln Glu Ala Ala Ser Leu Thr
210 215 220
Ala Ala Phe Asp Ser Ala Ile Gly Leu Thr Gly Ala Ile Leu Thr Lys
225 230 235 240
Met Asp Gly Asp Ser Arg Gly Gly Ala Ala Val Ser Val Arg Gly Val
245 250 255
Ser Gly Lys Pro Ile Lys Phe Val Gly Thr Gly Glu Lys Thr Ala Asp
260 265 270
Leu Glu Pro Phe Tyr Pro Asp Arg Met Ala Ser Arg Ile Leu Gly Met
275 280 285
Gly Asp Val Val Ser Leu Val Glu Lys Ala Ala Ser Glu Val Ser Asp
290 295 300
Ala Asp Ala Leu Lys Met Gln Gln Lys Met Leu Asp Ala Ser Phe Asp
305 310 315 320
Phe Asp Asp Phe Val Lys Gln Ser Glu Leu Val Thr Lys Met Gly Ser
325 330 335
Val Ala Gly Ile Ala Lys Leu Met Pro Gly Met Ala Asn Gln Leu Asn
340 345 350
Met Asn Gln Ile Arg Glu Val Glu Ala Arg Leu Lys Lys Ser Lys Ser
355 360 365
Met Ile Ser Ser Met Thr Lys Lys Glu Arg Ala Asn Pro Glu Leu Leu
370 375 380
Ile Lys Asp Ser Ser Ala Arg Ser Arg Leu Ile Arg Ile Thr Lys Gly
385 390 395 400
Ser Gly Cys Gly Leu Asp Glu Gly Gln Gln Phe Met Ser Glu Phe Gln
405 410 415
Arg Met Lys Thr Met Met Ser Thr Arg Arg Phe Trp Arg Phe Trp Leu
420 425 430
Met Ile Gln Ser Leu Ala Leu Ala Val Thr Arg Pro Glu Asn Thr Val
435 440 445
<210> 83
<211> 486
<212> PRT
<213> 假微型海链藻
<400> 83
Met Phe Asp Gln Leu Ser Asn Ala Leu Thr Glu Val Ala Lys Asn Phe
1 5 10 15
Gly Gly Lys Gln Arg Met Thr Glu Asn Ser Ile Gln Pro Ala Leu Lys
20 25 30
Ser Val Arg Arg Ala Leu Leu Asp Ala Asp Val Asn Leu Asp Val Ala
35 40 45
Thr Ala Leu Ile Asp Gly Val Lys Arg Arg Ser Leu Gly Lys Glu Val
50 55 60
Thr Lys Gly Val Thr Ala Glu Gln Gln Phe Ile Lys Ala Met Tyr Asp
65 70 75 80
Glu Leu Leu Asp Met Met Gly Gly Glu Ala Asn Glu Ser Asn Thr Met
85 90 95
Ala Thr Leu Ala His Ser Ser Val Ala Asn Glu Pro Ala Val Ile Leu
100 105 110
Leu Ala Gly Leu Gln Gly Ala Gly Lys Thr Thr Ala Ala Gly Lys Leu
115 120 125
Ala Phe Arg Leu Pro Lys Arg Asn Arg Lys Val Leu Leu Val Ala Ala
130 135 140
Asp Val Tyr Arg Pro Ala Ala Ile Glu Gln Leu Gln Ile Leu Gly Lys
145 150 155 160
Gln Ile Gly Val Glu Val Phe Ser Met Gly Val Asp Ala Asp Pro Ala
165 170 175
Asp Ile Ala Lys Glu Ala Val Glu Lys Ala Lys Arg Glu Gly Phe Asp
180 185 190
Thr Val Val Val Asp Thr Ala Gly Arg Gln Val Val Asp Glu Glu Leu
195 200 205
Met Glu Glu Leu Arg Arg Val Lys Lys Thr Val Glu Pro Asp Glu Thr
210 215 220
Leu Leu Val Val Asp Ala Met Thr Gly Gln Ala Ala Ala Ser Leu Thr
225 230 235 240
Ala Ser Phe Asp Ala Ala Val Gly Ile Ser Gly Ala Ile Leu Thr Lys
245 250 255
Leu Asp Gly Asp Ser Arg Gly Gly Ala Ala Val Ser Ile Arg Gly Val
260 265 270
Ser Gly Lys Pro Ile Lys Phe Val Gly Val Gly Glu Lys Thr Asn Asp
275 280 285
Leu Glu Pro Phe Tyr Pro Asp Arg Met Ala Ser Arg Ile Leu Gly Met
290 295 300
Gly Asp Val Ile Ser Leu Val Glu Lys Ala Ser Met Glu Val Ser Asp
305 310 315 320
Ala Asp Ala Ala Lys Met Gln Glu Lys Met Ala Lys Ala Glu Phe Asp
325 330 335
Phe Asp Asp Phe Met Thr Gln Ser Arg Met Val Ser Lys Met Gly Ser
340 345 350
Met Ala Gly Val Ala Lys Met Leu Pro Gly Met Gly Asn Met Ile Asp
355 360 365
Ser Ser Gln Met Arg Gln Val Glu Glu Arg Ile Lys Arg Ser Glu Ala
370 375 380
Met Ile Cys Ser Met Asn Lys Lys Glu Arg Ala Asn Pro Gly Leu Leu
385 390 395 400
Leu Thr Asp Lys Ser Ala Arg Ser Arg Leu Met Arg Ile Thr Lys Gly
405 410 415
Ser Gly Leu Ala Phe Glu Asp Gly Leu Ala Phe Met Ser Glu Phe Gln
420 425 430
Lys Met Arg Thr Met Ile Ser Arg Met Ala Lys Gln Thr Gly Met Gly
435 440 445
Gln Pro Asp Gly Glu Gly Glu Met Glu Pro Ala Met Ala Gly Asn Arg
450 455 460
Asn Ala Arg Arg Ala Ala Lys Lys Lys Gly Lys Lys Gly Gly Arg Gly
465 470 475 480
Gly Gly Met Gly Phe Ala
485
<210> 84
<211> 530
<212> PRT
<213> 抑食金球藻
<400> 84
Met Thr Met Ala Arg Arg Ala Ala Thr Ala Ala Leu Val Leu Ala Ala
1 5 10 15
Ala Trp Ala Phe Ala Pro Pro Gln Thr Lys Arg Ala Thr Thr Gln Leu
20 25 30
Tyr Phe Phe Asp Lys Leu Ala Glu Ser Ile Thr Ala Ala Thr Asp Val
35 40 45
Leu Ser Gly Lys Ser Arg Met Thr Glu Ala Asn Thr Lys Ser Ala Leu
50 55 60
Arg Asp Val Arg Arg Ser Leu Leu Asp Ala Asp Val Ala Lys Val Val
65 70 75 80
Val Asp Gly Phe Val Glu Asn Val Gln Ala Ser Ala Leu Asp Gly Glu
85 90 95
Val Ala Glu Gly Val Asp Pro Gly Gln Gln Phe Val Lys Ile Val Tyr
100 105 110
Asp Glu Leu Lys Arg Val Met Gly Gly Asp Asp Asp Glu Leu Leu Phe
115 120 125
Ser Asp Asp Pro Glu Ala Ala Ala Lys Ala Arg Ala Gly Leu Ala Tyr
130 135 140
Arg Asp Asp Gly Ala Pro Thr Val Val Leu Leu Cys Gly Leu Gln Gly
145 150 155 160
Ala Gly Lys Thr Thr Ala Ala Ala Lys Leu Ala Leu Arg Leu Lys Glu
165 170 175
Glu Glu Gly Lys Thr Pro Met Leu Val Ala Ala Asp Val Tyr Arg Pro
180 185 190
Ala Ala Val Glu Gln Leu Gln Ile Leu Gly Glu Gln Val Gly Val Pro
195 200 205
Val Tyr Ala Glu Ala Phe Glu Ala Gly Ala Gly Asp Ala Val Ala Ile
210 215 220
Ala Thr Ala Gly Val Arg Ala Ala Lys Glu Arg Gly Ala Asp Val Val
225 230 235 240
Ile Val Asp Thr Ala Gly Arg Gln Val Ile Glu Glu Ser Leu Met Ala
245 250 255
Glu Leu Arg Ser Val Arg Ala Ala Thr Lys Pro Asp Glu Thr Leu Leu
260 265 270
Val Leu Asp Ala Met Thr Gly Gln Asp Ala Ala Ser Leu Ala Lys Arg
275 280 285
Phe Asp Asp Ala Cys Pro Leu Thr Gly Ser Val Leu Thr Lys Leu Asp
290 295 300
Gly Asp Ala Arg Gly Gly Ala Ala Leu Ser Val Arg Ala Val Ser Gly
305 310 315 320
Lys Pro Ile Lys Phe Val Gly Val Gly Glu Lys Val Gly Asp Leu Glu
325 330 335
Pro Phe Phe Pro Ala Arg Met Ala Ser Arg Ile Leu Gly Met Gly Asp
340 345 350
Val Val Ser Leu Val Glu Lys Ala Ser Lys Gln Gln Ser Ala Ala Glu
355 360 365
Ala Lys Ala Val Met Glu Arg Thr Lys Gln Ala Lys Phe Asn Phe Asp
370 375 380
Asp Tyr Leu Asp Gln Ala Arg Met Val Ser Asn Met Gly Ser Phe Gly
385 390 395 400
Ala Val Ala Lys Met Met Pro Gly Met Gly Gly Ile Asp Asn Asp Gln
405 410 415
Ile Ala Ala Ala Glu Ala Lys Ile Lys Ile Gln Ala Ser Leu Ile Asn
420 425 430
Ser Met Thr Pro Lys Glu Arg Gly Glu Pro Asp Leu Ile Ile Arg Asp
435 440 445
Lys Ser Ala Leu Ala Arg Gln Lys Arg Ile Ala Ala Gly Ser Gly Arg
450 455 460
Ser Val Asp Gln Ala Lys Gln Phe Leu Ser Glu Phe Gln Gln Met Arg
465 470 475 480
Thr Met Met Ala Lys Met Ala Gly Gln Ala Pro Pro Asp Gly Ala Asp
485 490 495
Ala Ala Ala Ala Pro Asp Pro Asp Ala Leu Leu Asn Arg Ala Ala Arg
500 505 510
Arg Ala Lys Lys Lys Lys Gly Gly Lys Arg Lys Leu Lys Thr Ala Gly
515 520 525
Phe Gly
530
<210> 85
<211> 556
<212> PRT
<213> 长囊水云
<400> 85
Met Ile Met Ala Ser Leu Lys His Arg Ser Pro Pro Arg Gly Gly Ala
1 5 10 15
Ala Ala Thr Leu Ser Phe Phe Cys Cys Val Cys Ala Leu Phe Ala Gln
20 25 30
Ser Ser Val Ala Phe Val Pro Ala Gly Gly Leu Ser Arg Cys Gly Val
35 40 45
Asn Asp Arg Ser Ser Ser Ser Cys Arg Ala Ala Ala Ile Gly Ala Ala
50 55 60
Gly Arg Ser Ser Leu Pro Val Ser Arg Ser Ser Ser Arg Arg Gly Arg
65 70 75 80
Arg Gly Gly Cys Ala Gly Gly Ala Ser Ser Pro Leu Gly Met Met Phe
85 90 95
Asp Thr Leu Ala Glu Asn Met Ala Gly Val Ala Asn Leu Phe Thr Gly
100 105 110
Gln Lys Thr Ile Thr Glu Ser Ser Val Glu Gly Ala Leu Asn Glu Val
115 120 125
Lys Arg Ala Leu Leu Asp Ala Asp Leu Asn Leu Met Val Thr Asn Thr
130 135 140
Leu Val Asp Ala Val Lys Ser Lys Ala Val Gly Met Lys Leu Val Asp
145 150 155 160
Gly Val Thr Ala Lys Gln Gln Phe Val Asn Val Met Asn Asp Glu Leu
165 170 175
Val Glu Ile Met Gly Ala Glu Gln Ala Pro Leu Ala Arg Arg Thr Asp
180 185 190
Gly Lys Pro Thr Val Ile Leu Leu Ala Gly Leu Gln Gly Thr Gly Lys
195 200 205
Thr Thr Ala Ala Ala Lys Leu Ala Lys Tyr Leu Gln Gln Glu Glu Glu
210 215 220
Pro Lys Lys Val Leu Leu Val Ala Gly Asp Val Tyr Arg Pro Ala Ile
225 230 235 240
Asp Gln Leu Ile Ser Leu Gly Lys Arg Ile Asp Val Glu Val Phe Ser
245 250 255
Met Gly Gln Gly Val Asp Pro Val Glu Ile Thr Lys Ala Gly Leu Glu
260 265 270
Arg Ala Val Glu Gly Glu Phe Asp Thr Val Ile Val Asp Thr Ala Gly
275 280 285
Arg Gln Val Val Asp Asp Thr Leu Met Thr Glu Leu Lys Asp Ile Gln
290 295 300
Val Ala Ser Glu Ala Asp Glu Val Leu Leu Val Val Asp Ala Met Thr
305 310 315 320
Gly Gln Glu Ala Ala Thr Leu Ala Ser Val Phe Asn Glu Lys Ile Gly
325 330 335
Ile Thr Gly Ala Val Leu Thr Lys Met Asp Gly Asp Thr Arg Gly Gly
340 345 350
Ala Ala Leu Ser Val Gln Gly Val Ser Gln Lys Pro Ile Lys Phe Val
355 360 365
Gly Ile Gly Glu Lys Met Ser Glu Glu Glu Ala Ala Lys Leu Ala Lys
370 375 380
Lys Met Ile Asn Ala Glu Phe Asp Phe Asn Asp Phe Leu Lys Gln Ala
385 390 395 400
Lys Met Met Lys Gly Met Gly Ser Leu Gly Gly Val Ala Asn Met Ile
405 410 415
Pro Gly Met Ala Gly Lys Ile Thr Pro Gln Gln Leu Asn Gln Ala Glu
420 425 430
Glu Gly Val Gln Arg Ala Glu Gly Leu Ile Lys Phe Met Thr Pro Glu
435 440 445
Glu Arg Arg Thr Pro Lys Leu Leu Ile Leu Asp Pro Thr Ser Gln Ala
450 455 460
Arg Cys Arg Arg Ile Ala Arg Asp Ala Gly Val Lys Leu Ser Ala Val
465 470 475 480
Ser Ala Phe Leu Lys Glu Phe Gln Ala Met Gln Ser Asn Met Ser Arg
485 490 495
Met Gly Lys Gln Met Ala Asp Gly Asp Pro Asn Ala Gly Pro Gly Gly
500 505 510
Gln Pro Ser Pro Phe Gln Gly Leu Gly Gly Asp Thr Ala Pro Gly Ala
515 520 525
Ala Pro Ser Met Asn Arg Gln Gln Arg Arg Gln Ser Lys Lys Asn Lys
530 535 540
Ala Gly Arg Ser Ala Ala Pro Ser Lys Gly Phe Gly
545 550 555
<210> 86
<211> 28452
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的多核苷酸
<400> 86
cgtctgatta aaccacgctg ggagattaga taatgaagcg tgcgcctgtt attccaaaac 60
atacgctcaa tactcaaccg gttgaagata cttcgttatc gacaccagct gccccgatgg 120
tggattcgtt aattgcgcgc gtaggagtaa tggctcgcgg taatgccatt actttgcctg 180
tatgtggtcg ggatgtgaag tttactcttg aagtgctccg gggtgatagt gttgagaaga 240
cctctcgggt atggtcaggt aatgaacgtg accaggagct gcttactgag gacgcactgg 300
atgatctcat cccttctttt ctactgactg gtcaacagac accggcgttc ggtcgaagag 360
tatctggtgt catagaaatt gccgatggga gtcgccgtcg taaagctgct gcacttaccg 420
aaagtgatta tcgtgttctg gttggcgagc tggatgatga gcagatggct gcattatcca 480
gattgggtaa cgattatcgc ccaacaagtg cttatgaacg tggtcagcgt tatgcaagcc 540
gattgcagaa tgaatttgct ggaaatattt ctgcgctggc tgatgcggaa aatatttcac 600
gtaagattat tacccgctgt atcaacaccg ccaaattgcc taaatcagtt gttgctcttt 660
tttctcaccc cggtgaacta tctgcccggt caggtgatgc acttcaaaaa gcctttacag 720
ataaagagga attacttaag cagcaggcat ctaaccttca tgagcagaaa aaagctgggg 780
tgatatttga agctgaagaa gttatcactc ttttaacttc tgtgcttaaa acgtcatctg 840
catcaagaac tagtttaagc tcacgacatc agtttgctcc tggagcgaca gtattgtata 900
agggcgataa aatggtgctt aacctggaca ggtctcgtgt tccaactgag tgtatagaga 960
aaattgaggc cattcttaag gaacttgaaa agccagcacc ctgatgcgac ctcgttttag 1020
tctacgttta tctgtcttta cttaatgtcc tttgttacag gccagaaagc ataactggcc 1080
tgaatattct ctctgggccc actgttccac ttgtatcgtc ggtctgataa tcagactggg 1140
accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1200
atcgtcggtc tgattattag tctgggacca cggtcccact cgtatcgtcg gtctgataat 1260
cagactggga ccacggtccc actcgtatcg tcggtctgat tattagtctg ggaccatggt 1320
cccactcgta tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg 1380
tctgattatt agtctggaac cacggtccca ctcgtatcgt cggtctgatt attagtctgg 1440
gaccacggtc ccactcgtat cgtcggtctg attattagtc tgggaccacg atcccactcg 1500
tgttgtcggt ctgattatcg gtctgggacc acggtcccac ttgtattgtc gatcagacta 1560
tcagcgtgag actacgattc catcaatgcc tgtcaagggc aagtattgac atgtcgtcgt 1620
aacctgtaga acggagtaac ctcggtgtgc ggttgtatgc ctgctgtgga ttgctgctgt 1680
gtcctgctta tccacaacat tttgcgcacg gttatgtgga caaaatacct ggttacccag 1740
gccgtgccgg cacgtgatcg cgcaggctca gctgcacagc agacgcaagg gacagctcag 1800
catctggaac cgccgacacc aggtgctgag gatgctgcac ctttggcaac cccaataggt 1860
gcttttgggc gtactgctgt gcctgcgcct agggacattg actggtggcg ggtcccgaag 1920
gagctgatgg gaagctacgc acaagctgaa gctggagaca gcagctccac caatgttgac 1980
ttctctgggg agcctccggc cagcagcgtg tacaacgaga ggggggacgc gttagtggag 2040
caggaggtga aggcagcaac ggcgggtgtg gattttgctg gcaggaggag ggccaggggg 2100
ttgttggaca atgctgagcc tcccgatctt gataatggcc ctggagagca gccagcaggg 2160
gcaacagtga gttggagctg gaggcgattc agggataggc agggataggc agtgtagtgg 2220
ccagaactgg ctgctggaac ctggggttac tcagggtgaa cgcaggcaaa ggggtgcagg 2280
tgtattgaag ctcttaatat aagagagatg cgtcgaacat atatggtgat agtcttgagt 2340
ggtgtgttgg gtggaaggct gctgtttacg gtgcaggaag ttttcctggt acggtcgtta 2400
tgtaatgcag cagcacgtat gtaagaacca gtcgacattt aacctatgca gtagcatata 2460
gttatgtgtc aaaatcataa attggcccta tttgtggcga gcctatcttt caaatactac 2520
tgttcctcgc actgtcctct aaaatttctc caaacaacgt tgtaaaggtg ctgatgttag 2580
catatcatct ctggcataac tggatgcacc cagccggcta acagtgggaa gatgaagagg 2640
ggcttgtaca ctacactttt cttgccaaga ctgttagctt gcccaagcca gcacagcgat 2700
tttcttgcaa caaacgtgag ccttgcatct gcttttgatt gcaacggccg actggtgagt 2760
tattgtgcaa gcagtgttct gctaaactgt tcagaccagg ttcgcagctg gctaagatcg 2820
gtatctggaa agctccaacg aacaggtttt caatacgtgc tgcgtcaata tgcccttctt 2880
gttcactaca gcgacctttc caatgttgga tgtgaacaaa tgtcgaagcg cacaataacc 2940
tgaaagacat tgttgctcat tcccttttct ttggtagcgt aggtttgtat atttagagtt 3000
ccagttctgt actagttgct ctgcggcaac gattgaagtg tgtaccttat actgcacgtt 3060
aaatatgata ggttcagcgc ggttctttaa atgacaaaat aaatagtatt caacaaaaaa 3120
aaatagttgt ttgacatgtc actttttctt ttacataggt agcatgtcgt caaatcgtca 3180
atgcaaacca gcttgcgact aacgtaagca gtacagggga tagtacaatg agtttttcac 3240
cagcaatttg gtccagtgtt ttcgcaccgc cgtgaagcgc attcacatta aagtagcatc 3300
gctacacctg ttctcatctt gttaggttca aattttgcaa cgtgtagcta caaagtggca 3360
acagcgcagg ctgttggtca ctcgctaagg cttgcattgg caccctcgtt gctctgtgta 3420
ggagcgtgca tttgtgctca agactgttat ttttgacttc aaaaacttta tcgatagcgc 3480
actgcctcgt ttttacaaga tagccttctg tgagcagccc tgccccatgc gcctttaggc 3540
tttctgtggc aatgtctggt tcagctggat cgggccaggc tactctcaga catgacggtg 3600
gctctgctgg cggcagtggg cctgtctcag acggtttttc accggccggc ctgaaggtaa 3660
agtagaaaga cactcataca catcttggtt cggcgttgaa agtaggtcat taacatactc 3720
tataaccaat atttgtaggt tctggtcgtg gacgacgagt taaccttgga aatccctacc 3780
aggctattct ccacagcccg aaccccttaa gctagacgaa cacagttagc ataacttcgt 3840
ataggatact ttatacgaag ttatgcggcc gcccaccatg ggggaggttt gaagtgtgcg 3900
cctgatataa tcatacacct aaaagcacca cttgctgatt gtgaagggac tatgtcgttt 3960
atgacgggac gttacgctgg ccgatggttt gaatttggac gctgtggtag aatgttatat 4020
ggacgtaaag gttggcatat tgaaaatcgt cttcacaggc aaacttctag acgtgtgacc 4080
caccggtaaa acgacaagcg tggcgcgtcg attgcgcttt gaacgtcgtt tgttggactc 4140
cagatgaacc tcaaaatcaa agcggtgatt gacgaaaatc aaatgacagc ccgcaaaatt 4200
tcatcagcct tcggatcgga ttctcagaat ctgattgtcc ctgctggcta catttatgaa 4260
atttcgtaca ttttggcaga aatgtcccaa taccatagca ctgccgcctg agctcacccg 4320
agcaatgcat actgggtacc tcgcccatct cgccctcttt ccaagcccag tgctgttgta 4380
aatagccaaa gggctcagta acaatggcca aactgacatc cgctgttcct gtgttgacag 4440
caagagatgt tgcaggtgca gtggagtttt gtgagttctg agaagctgat tgttgtttaa 4500
cttctttgaa agctttatcg aagattctgc aagcgatgaa cattgcttgt caagaccgag 4560
agctgcatgc ccacttgaca tccagctttg aacggctctt catgtttgat ttgtttctga 4620
ttgtagggac agatagactg gggtttagca gggactttgt ggaggacgat tttgcaggag 4680
tggtgaggga tgatgtgaca ctgtttatct cagcagtgca ggatcaagtg agtgcagcgt 4740
cagctgtggc agttgttggc tttcgtctca gtcagtagtt tgctgggatt gattatggag 4800
ggcacagttg caattttgag ttgcacgttg cgacaagcgt gttgacaaag cgtggtcaag 4860
ccggccagtc ttgccggtgg cgggtggctt ggtctaactt ccgctctaca gcaatcgttt 4920
tgttcatggt tacggggctg gcgtgccaga aagtcctggt cagccaccct cgcttcaaag 4980
ccgtagccca acaactttgc gaatatgttc gatttgcagg tggtgcccga taatacactg 5040
gcatgggttt gggtgagagg tacagctctg cgtgcaacag gttgcaagat gcagcgcagg 5100
tcttccctgg tcaaacgatg tatgcagagt tgagaggcac ttgagctggg tgaatggcgt 5160
gggctcgtag gtagtgtgca gggcaggaag ggcagccaat tttggagttg tggtccggtg 5220
tcgttgcttc gagccttatt aggactcttg ctcatcaaag cgttagttgt gaataagttg 5280
atctgaaagg atgttatgta cagcaagcag cagcagttaa gagtctgggg agtagctgca 5340
cagggcgagg tgtcaagatg ggaagggtcc tgcctcctta tgtgtttttc cctgtagggg 5400
aggaagcctc ttatgggcaa tggttgggca tattttccag ccagcccttc tttctatagg 5460
ggccagggtg ggcccagctc gtcttggctt ccaccaccag gagagtgagg gcattgaagg 5520
gccataaata gtcctcccat ctacgtgcac cagagggtgt cgtctaggct gtgcatgcca 5580
cgaggggaag gagccaagaa tgagtgtatg ggttgttttc atgtttaggc tgggataaaa 5640
ctgttttcaa ttgcgcctgc cgggtgaaaa ccacagcagc atcagcaagc ttggagaagg 5700
ccagcccgcc cagcacaggc tcacgttccc actcaggcgg tcagtcgggc gggggtgtga 5760
gtcaggcagg cgagggtgtc tgtgcctgac atcagcacct ctgcttagcc actgcagccc 5820
ctggagcagg gtagggcgtc atttgcagca atcacctgct gcctcacacg tcgcagcttg 5880
gaatttcaac gaccatcagc gctggggttg ttgagggatc atagcagatt ttggtgcagc 5940
ctggttgtca tgctctttgt ggaatggcct ctatgttcga gcaattcgtt ggatgttgag 6000
gtgcttgggg acagagagtc gaatgatggg ccagggtcaa acatgcgagc gtttggctga 6060
gtcagcggtt tttgctggtc actttttctt ttgtttctta tttaggtttg atggatgtgt 6120
tttgtgctgc tgccctgaag ctgcagcagc gtgtctgccc tgcgctactg cgggcaccaa 6180
ggctatgtgc tggtgcactc ggctgcgctg cacctgtgca cctcgcactc cgtccagcct 6240
ccatgcagca cacgtactca cggtgtcctc ctgacctgtc gtacgctatt ccaaacttgc 6300
tcttttgctg ccgctgctct cgtacacaat tgctgttgat tatcgatatc taatcgagcg 6360
cctgctgact gaactccgca ggtttggatg aactgtatgc agagtggtct gaagtggtga 6420
gcaccaactt taggtgggtg ggctctgaag gaggaggagg gagcgggtga ttaaacaggg 6480
cctgcatgaa gaggagcagg ggctgcatgg acagcagggg gaaggtgcag aagggagggt 6540
caagcggggt tcaggtggct gtgggtttct gcacgagcag tgaaagaagc tgtatccttc 6600
cacctgcttt cactggcgaa aggttgaaaa caggatgtcg cagctggaaa gatgttgcgc 6660
tgtcaagtgc aagccatggt tgagggtatg cctgtgtgca tgtgcttctt aaagttactc 6720
ctgttctatg gttctgggtg cttgttgttt gtggtgcagg gatgcaagcg gacctgcaat 6780
gacagagatt ggagaacaac cttggggaag ggagtttgca ttgagagatc ctgcaggtga 6840
gggggcatgt aagcaatggc aggcaattca agaacgaatc attgctgcaa atgctgggat 6900
ggtatgcagc tgaggtatct attgccttgt attttgtctc gcattgcatc ggtggtgcgt 6960
tctgtggcct gaggcacagt tcttgctgtt tgataagggt tcgactgagt tgtcgtgtgt 7020
gctgtgctgc aggcaattgc gtgcactttg ttgcagaaga acaggactga gcatagcatc 7080
agcctgtggc agggttgtgg tagggctgag tggcagggtt aaaggggttg cctaccccac 7140
ccctactctc atgacaccag caacagcagc agctcatgca gtactcaaat cactgatgtc 7200
aatggtgtga cacatttggt taaggctgct ttttaaagtg ctgctttggg ggcagtgact 7260
gtgcagagct tggagcgtat ccccatgtaa tcagaaccga cgagagttcg gggcaacctt 7320
tcatcttcac attttttgtg atcagctaca gagtctgaaa tcaaatagag gctgccatct 7380
aaacgcagga gtcacaacga aggcgaaaac tccaattgct gtactcaatg cactaagtga 7440
ttgttcaatg gataaataca ctatgctcaa ttcatgccag cagagctgct ccttccagcc 7500
agctacaatg gctttttcca cgccttttga agtatgaatg ttcagcttgc tgtgcttgat 7560
gcatcaccat aaacacaatt ctacaacatt tcatgccaac aacagtacgg gctttccctg 7620
caggcagttg gtacggcata ttatggttta aacatctatc ctccagatca ccagggccag 7680
tgaggccagt ttgcatagtt aagtatgctg gctattgcag taccttatat gcaaacaagt 7740
gctcaatctg tttcatcatt gtctgtgggc aaattgcctg ccaatattct ccagttattg 7800
cctgttgttt caaatgattg aaattggaag ttgtattgct ctacattttt gacttgtgat 7860
tttttcattt gttgatatct gacaactgtg aactgcactg aacttgctgt gcttataaat 7920
gcattttttt gttttgggcc acgttgattc cttgtgatac tttcctgcta tcaaaccaaa 7980
aatatactct catgactgac gtgcaacaaa tgcatggaag ctttcaacgt tacgacagct 8040
gcttgccccc catcagctat tctacatgtg taacctacct tgcatggcca ccacaacgct 8100
actgcatgca agatctggcg caactggatg tcccaatagt agaagtatcc ggattatctc 8160
cgagagtttt acatatgtaa tcgacgccat ttctgtcatc aactataaat ccattgctcc 8220
tgcatttctg gcactgacat tctaccacaa gcaataccaa tgttggagag cgacgagagc 8280
ggcctgcccg ccatggagat cgagtgccgc atcaccggca ccctgaacgg cgtggagttc 8340
gagctggtgg gcggcggaga gggcaccccc gagcagggcc gcatgaccaa caagatgaag 8400
agcaccaaag gcgccctgac cttcagcccc tacctgctga gccacgtgat gggctacggc 8460
ttctaccact tcggcaccta ccccagcggc tacgagaacc ccttcctgca cgccatcaac 8520
aacggcggct acaccaacac ccgcatcgag aagtacgagg acggcggcgt gctgcacgtg 8580
agcttcagct accgctacga ggccggccgc gtgatcggcg acttcaaggt gatgggcacc 8640
ggcttccccg aggacagcgt gatcttcacc gacaagatca tccgcagcaa cgccaccgtg 8700
gagcacctgc accccatggg cgataacgat ctggatggca gcttcacccg caccttcagc 8760
ctgcgcgacg gcggctacta cagctccgtg gtggacagcc acatgcactt caagagcgcc 8820
atccacccca gcatcctgca gaacgggggc cccatgttcg ccttccgccg cgtggaggag 8880
gatcacagca acaccgagct gggcatcgtg gagtaccagc acgccttcaa gaccccggat 8940
gcagatgccg gtgaagaata agcagcagct tgttatgcct tccccatggg catcagcatg 9000
ctgcaagctg tctagatatc cagctttcag tggaggttga gcgagggtca gcagcggttc 9060
cctggcgatg gcggtcagct tttctggaag ccttcactag gactgcgccc agcgcatgtg 9120
acgccaatcg aacttgtgtg caaggccaaa ttttgtgacc ctgtgctgca cttcatgtat 9180
tcaagaattg agaagaaatt tcattgctgc ccttctttca ctttaatttc catccctgga 9240
tccacctccc accattgtgg ttgatgggta ggggttttgg gtaggtgcag ttcgttgtgc 9300
acgttgacat gtgtaacggt gagcaaagga attgctgggc aagtagctat tgcagcttaa 9360
gggcatggtg aaacacttgt gctgtattta cagaggaagc cagacaggta aggagtgtgt 9420
ggcagcttgg aacaggaggg ctggtcgcaa caagtatgca tatcccatga ttgttgacat 9480
aagagcagca ggtgcatatt gccagccttt gtgaaagtgg attgaaaatc aattagttgg 9540
tgtgatagct gaggctaggc actgccaacc tgcagtgaaa tgaggctcca agaccgggta 9600
ataatacagg caatcgaatc cagttgaaat tacggcgatt aaatccaagc gagcgttgta 9660
agaacatctg cacctgtctg aagtagtgag cggataatga gcattgcttg ccttctatca 9720
ctatacctga cagttacgtg tcacacactc tcaagcacaa cacacagcgg caaagttact 9780
tgctaaacct cacagtcaag ctgaaaataa aggctaaatt acgtgagacc ggcgcgccat 9840
aacttcgtat aggatacttt atacgaagtt atcaccagat ataggtgacc cgataactta 9900
attaatcttg cgaagattga attgctaata gaaggttctc atctatacat gagttaccag 9960
tgaaccccat atctgctcta taatatagtc cccgctgagg cgcagtgctg aggttccagc 10020
tcgaacgagc cagtagggct tcgactcacg gctcatttat tttagagcta ggttgacttc 10080
ccagtctcat gcaatacatg agagcaggtg ttggtcgcac gcctctctca cggtgcctct 10140
tgattttcgg ccccttgcac ccgctctcat atgacatatt cgcgctgcac ccttgctcag 10200
agcaggcgca gcatgtggag tagcgggcgc aagccgtaat gaggagtctc agctcaacat 10260
gattgaggtc agcatcactg taacaataca aatcattgtg gtgccttata tatttggtaa 10320
atgctcgctg cagtattcaa atcgaccttc actgcaagca actcgattga actacgcgcg 10380
ttattgaagg cacatacaac cgggagttca gaggagtatg cccaagaaga agcggaaagt 10440
cgggagcaat ctgttgaccg tgcatcaggt atcgagaaga actaaagagc gttcaaacgc 10500
atcaatattt tgctaaagag ctttacatct ttttggggct attttctggc tactcggtag 10560
tgacttgacc actttcttcc caagtggggg caagccgata agccgctgtg accgttgatt 10620
tttttataaa agacgtagac atgttcaatc agccacaatt gatatgcttg aatacagaac 10680
ctgcccgcat tgcctgttga cgcaacatct ggtgagctgc ggttgctatc ctcccaatat 10740
aacctgaagt catgcatata ttcgcactaa tctacatccc atgttgtgtt gagctattcg 10800
gtattgatgc cagctcagtg aactaattat caaatgtata tcggtgctgc cagaatcgat 10860
ccatgtatca atgccacaag taactggaga tacatttgct acatgtagat gaggtgcgca 10920
agaacctgat ggacatgttt agggaccgcc aagccttcag cgagcataca tggaagatgc 10980
tgctgagcgt gtgcagatct tgggcagcat ggtgtaagct gaacaaccgc aagtggttcc 11040
cagcagaacc cgaaggtatg cctgggtaac tgtcaaaatc atgtatattc ccgcaatgca 11100
agtggttcat tgttgtgctt tacgttaaag acgtgtcagc tgcaggagaa ttattttgag 11160
gatgattgtc cgttgttggc gatgtcttgc attgtgaagt atgttttgaa gtcatacagg 11220
aagtgtgaaa tcccaaagca gctggctgcc gctgcatgcg accagtcatt cacctgcatt 11280
gtgtgtgctg tagatgtgag ggactatctg ctgtacctgc aagcaagggg actggcagtg 11340
aagaccattc agcagcatct gggacagctg aacatgctgc ataggaggtc tggactgcct 11400
aggccaagcg atagcaatgc agtgtctctg gtgatgcgcc gcattagaaa ggagaacgtg 11460
gatgctgggg agagggcaaa acaagcactg gcatttgagc gcaccgactt tgaccaagtg 11520
aggtgggctt cgcaactgct gcctgaactt cctgttcctg tgcatgtaca tgagagtcgg 11580
ttggaacagg ctcatactgc gcctgattga taggctgtcc cacattgttt tatttgctgt 11640
atcgatgtat tcattttgca ttgggtcctt tctgctcatg aagcaccaag aaggctggct 11700
gtcaatggca tgccagctca tgccatctgg atgacattat gcaagaccag tgttgactcg 11760
aacatgaatc ttactggaaa ctttaatgaa tgctttcgag ctttttgtgc aggtctctga 11820
tggagaactc agaccgctgc caagacatcc gcaatctggc atttctgggg atcgcctaca 11880
acacactgct gaggattgcc gagatcgcac gcattagggt gaaggacatt agccgcacag 11940
atggagggag gatgctgatc catatcggga ggacaaagac cctggtgagc acagctggag 12000
tggagaaagc actgtctctg ggagtgacca aggtaagctt accatgtgtt tatatgaagc 12060
tgatatttgg aagaaaggag gaagcaacga caacaagggc ggtgcacaat ctattgccgc 12120
ttttgaatct tgcccgcaaa ggcagtcgat gattgctcac tgtatcaggt tgatttagtt 12180
gatgaggtgt agctggggaa gctccaatcc ccagtccaga tagccttggt tatgaattgc 12240
ataatgtagg caccacttgc actggtccta aaccccagtt cattcctgtc cttctcgtgc 12300
attttgtcaa atgaacatgc aaccgagtgt gttttcctac tcgacatgtg tgcgattgcc 12360
cacgtgtgct gcagctggtg gaacggtgga ttagcgtgtc tggagtggca gatgacccca 12420
acaactacct gttttgccgc gtgcgcaaga atggagttgc tgcacctagc gcaaccagtc 12480
aactgtctac aagggcactg gaggggatct ttgaggcaac acatcgcctg atctacgggg 12540
caaaggatga ttctgggcag aggtatctgg cctggtctgg acattctgca agggttggag 12600
cagcaaggga catggcaaga gctggagtga gcattcccga gatcatgtga gaggccccag 12660
caaaaacaac agcactagct gttgctgctc agtttgtgct cgtgatgttt gaaaggaatg 12720
gacaaggttc atccatgatg ttcattatct gggctggtct tgtacatggg gttattctat 12780
actaaacagg agcgatacaa ataacaaaca atcaatgtct atatacacat atacttggct 12840
aaatttttct cccggcctta catacataac aaaggctaaa ctaattgacc caaaataatt 12900
gtatgaataa tcaaattgat gcatacaaat aatcctaaaa atgaaaaaaa tttcattgaa 12960
ataagtatag aaataacaaa tgtttgaccc acagccctca ctctccaacc caatcctgcc 13020
tctcacaaga cttgccatgt accaacttac aatgacagcg agctacaaca agttccatca 13080
aggtgtgggt tgctattagt tggtggaacg tttgtacatt tcacagttgg acatgcactt 13140
gcgaaaaagg cgttggcttc agtgaggcag tgcttgctcg tatcccctcc aagcatgcct 13200
tgtgcaccca ttttgcaacg caggcaagct ggagggtgga caaacgtgaa catcgtgatg 13260
aactacatcc gcaacctgga cagcgagact ggagcaatgg tgagactgct ggaggatggg 13320
gattaatcag gatgttttga gcggttgtag gttctgtagt tgtatggtag gttgcatgga 13380
ggaaataggc caacaacaat tccaaatcaa aggagattgt agcgttgctc ttggtccccc 13440
tgaaaatttt tgttgttatg tgtctataaa tctagttctg caccttgcaa actgtgggat 13500
gccctgtcca gagcagaagg taatcccaaa acagtcgaga aagtctcgtt gggtggttgt 13560
gtaaagtaca aatgtatgtt ttccaccttg tctttgtatt gtgcacgagc tacagcattg 13620
gtggaagggc ttatagctgc tgggtcatca tgctgtcctg ttcttgatgg tttaggtgtc 13680
atccctttca ctgactcagc gaaatcggat gcgtaccatt catgaacggt gttgcacttg 13740
ctgtttgtga aaggtactgc atgtgcattg tacaatagac tactataatg tctcatgcac 13800
gtggtcaatg atgtagattt ctggaatatg catcgtgtaa ttgattcgat gaacccctcg 13860
tttggaactc tatttgaaaa gcaatcgagt gtcattatcc ataatggatg atgatcatga 13920
gcattgcaaa tagcaccatt agaacaaact gaatattgta caccttgacc tggatatgca 13980
tccgtccttc atcccacttt attaaggcag gttataattg gcaaggagtc ggcagaatag 14040
tcgtttggtt ataccccagt tttagtgggg cctttggcag ctatattatg gtcgcgactg 14100
taaccgggtc cgtttaaagt tcgattacat ctcagaaata taattgggct gcatgttaga 14160
aacttttcgc cgggtataac cggggtataa tcggcatact gcccaatgac ggccagccgc 14220
tggtcagtga ccgtcaaacg gtcggacggt ctgcatcgca tgtgcgctga catgtcaagt 14280
gcatgcttct cttacattca ggcaaaagac tacaagtcat tgaagaattg tcaactcagt 14340
aagctgacaa ttacgttcat gaaggtcagt cgtatgaaac tcgtatttct ccctaagtcg 14400
ttactatgga aagtacatcg tgccacgtca tcgtcatcgt ggcaatgaca gatgatggat 14460
agggtggggt tggcattaat tgctatcatt ttctttgcag aaaacaaata cctggcacat 14520
aatttgttga taatcatatg tatgtatgtc cacatgtcaa cgttatatgt ataaaaatca 14580
agacttgttt gcttaactct aaatttaatg taagaatttc ggtaataatc tgatctacat 14640
tatcacttgt gattaatgtt gaaatttgtt atccttaatt atcgtgcttg gcacaacttt 14700
cagattttgt ctgctgtcac attcatgcag tttcatttgc agtaaattct caatcattta 14760
tgtagttgat aagaatattt gatctgcttt tcattaagca aattttgtta gctttctccc 14820
cttgattgtt cattcaatga gattacattg aatgatgtct acacatataa taagaacgca 14880
tgtctacaca aatctaaaaa tcagctgcac gctcccaatt actatcgcac actctgacac 14940
cagaccgtgc tgtgacaata taagctgcac tgacaaattt ggaaaacaca agattcagaa 15000
gaaaacaaat actggaaccc ctcacacacc acctttctac agcacaaaca cgaagcagta 15060
gccaaggtaa gaaaatccga tcaaaataca ttaaatcatg tctaatatac agcataagta 15120
tagctaatga aatcgttggt cgggccttaa taacacacag tctaccaaca cctagttggt 15180
aaataccgtt gctgatattg ctctgtacca gtaaaagagg gctgcgatga gcgtttttag 15240
tgcacttctt caacacggaa tatttttcac aaattggtat gagaaccaat tttgcaaaat 15300
gttcgccctg taaagtatcg ctctgggacg atcagcttga cgtaattgta ggcgaaaagg 15360
gcgttcaaag tgcagcttta tgtatgaacg tcataaaata taaagcatag cacaatcact 15420
gatagaaaat atttgtgcgc attaaaactc tcacttctgt tgcggataca acgacggaaa 15480
tgagaagctt gtgtaagaag caattcaagt tttcattttg tcatctaagg tgtgatcctc 15540
cgatattcat taccgaatgc tgatctgagt tggaaagatg gcaatattta gctgtgcaca 15600
ctttgacctc caggccttgg cgggaattta gtattctagc tttcctattg gaacgatagg 15660
ccagccaagt ctccagcttg tatacgctac accagcagac atgctctcaa tttagctgac 15720
agtgtcttca tatttgtatt atctgttgtg tctatgccga agaagaagcg caaggtgggc 15780
gactacaagg acgacgacga caagctggag ccaggtatgc ataacctttc aatagatgct 15840
gccgcgcctt gggttcgctg cctgtgtcct gaagtacttt tcaccaggtc tacatgcatg 15900
cagcaactaa tcgttagttg ttcctttgta aacagcgttt tctgtcttta ccatgattca 15960
ggcgagaagc cgtacaagtg tccagagtgc ggcaagagct tcagccagtc aggagcactg 16020
acccgccacc agagaacaca tacacgcgac aagaagtaca gcatcggcct ggacatcggc 16080
accaactctg ttggttgggc ggtgatcacc gacgagtaca aggtgccgag gtatgttatc 16140
tttgattgca ctacttgcag tcctggtggg cactattgtt gtgcataggc gctcttttgc 16200
attcatgtat tgaatgtaga gaagttgtac actcctccta ggagactagc tgatggagtc 16260
ctgtattaaa tttgttcaca tcatatgcct tacagcatga tccattagaa gtaactaaat 16320
ttctaagcac ccagtctgag aaaccagatc gatggcaagt tgctcttggc ttgctgtgct 16380
tgcagcaaga agttcaaggt gctgggcaac accgaccgcc acagcatcaa gaagaacctg 16440
atcggcgcgc tgctgttcga ttctggcgag acagcagagg cgacacgcct gaagagaaca 16500
gcacgcagac gctacacacg ccgcaagaac cgcatctgct acctccagga gatcttcagc 16560
aacgagatgg cgaaggtgga cgacagcttc ttccacaggc tggaggagtc gttcctggtg 16620
gaggaggaca agaagcacga gcgccacccg gtaagtcgcg tgccaagcac tagtttacca 16680
tcccacaaat gacaggtctg ggtgggacat ctgcacctga aaatggctta cgacagctgc 16740
ttctcaattc gagtgtgcat attgcaagca ttagattttt tcctgcagat cttcggcaac 16800
atcgtggatg aggtggcgta ccacgagaag tacccgacca tctaccacct gcgcaagaag 16860
ctggtggaca gcaccgacaa ggcggacctg agactgatct acctggcact ggcgcacatg 16920
atcaagttcc gcggccactt cctgatcgag ggtgagtgtg gaatgcatca cagtggaaac 16980
tgctttgtag tacaatttgt ttgtgaagtt tgtgtctaga tgtccatttg atctgtggaa 17040
tgaatgtgct agctctcatg cacagcagta tttggaatgc tgaattacag tgtttccttt 17100
gttggtgtca ggcgatctga acccggacaa cagcgacgtg gacaagctgt tcatccagct 17160
ggtgcagacc tacaaccagc tgttcgagga gaacccgatc aacgcaagcg gcgtggacgc 17220
aaaggtgtct tgatgtaaag tcgaacattg catttgaacg aaggagctcc cttgttggct 17280
aagcatgggt attgactcta ccccagcagg gaatcatctt gctgcaacag ctcacgtcgt 17340
atttgtatgt ggtgcaggcg attctgagcg caaggctgag caagagccgc agactggaga 17400
acctgatcgc gcaactgcca ggcgagaaga agaacggcct gttcggcaac ctgatcgcgc 17460
tgtcactggg cctgacgccg aacttcaaga gcaacttcga cctggcggag gacgcgaagc 17520
tgcaactgag caaggtgaac gtccccctcg gccctgtgct ggtgtgcctg ctgtccaatg 17580
gcacgtttgt gcttcacaat tctacaggtt gatgcaatgt aggttggttg tgctgatgcc 17640
agagatgcac tcaaccaaca ccgtgttgct ttgttggttc ccaaccagcc tgcaatgcaa 17700
cctgtgaatc gtgcaccata cgatctgcat gcaggacacc tacgacgacg acctggacaa 17760
cctgctggcg caaatcggcg accagtacgc agacctgttc ctggcagcga agaacctgag 17820
cgacgcgatt ctgctgagcg acattctgta agtctcagag cacatcacct gcatcacaca 17880
ggatttcttt tgtcagcata tcctgccttt tcgggtcatg tttggatgcc gtgcggctgt 17940
gtgccactgg tccaggcgta ctgggctttc tgacaagctg gatgttatgc ttatattgca 18000
ggcgcgtgaa caccgagatc accaaggtga gccgcacact tgctattgct cgctttcaca 18060
aaatacccgt cgtgaaaacg tcatgtgaag gttgctatca tcgggtcaga gagtatatta 18120
catcatgaac aggctgcaag ggtttgattc ctgcaggcac cactgagcgc gagcatgatc 18180
aagcggtacg acgagcacca ccaggacctg acactgctga aggcactggt gaggcagcag 18240
cttccggaga aatacaagga gatcttcttc gaccagagca agaacggcta cgcgggctac 18300
atcgatggcg gtgcatctca agaggagttc tacaaattca tcaaggtatg tttggcacac 18360
cattgacaga aggggcatgt cttgcccagt gtgcactgct gtcaggtcga tgagagaagt 18420
ggcaatgaaa aattttggtt tgacaacaaa tatgaggggg tactcgggac tgattggcaa 18480
tgcgttagaa actccgtaag atcaaatttc tgaagtggta gcagtggaag ttcctagctg 18540
agggtgtcac tcactcttat ttctgcagcc gatcctggag aagatggacg gcaccgagga 18600
gctgctggtg aagctgaacc gcgaggatct gctgcgcaag cagcgcacat tcgacaatgg 18660
cagcatcccg caccagatcc atctgggtga gctgcacgcg attctgagaa ggcaggagga 18720
cttctacccg ttcctgaagg acaaccgcga gaagatcgag aagatcctgg tacgtggccc 18780
gggttcacct gttgcgtgca tgttgacttc aggacaaagt tagcattatt acacagcggc 18840
agcacagtga gggtcatcat gtggctggct ttccaattgc tccgagggaa taatcggttg 18900
aatgtgtgtt tctcttgcca gtgtgtcctt ggaggtgcgt gcgtgcttcg caaaaaagga 18960
gtacccaata acccttgaaa caaccagttt tgggctgcaa caacacaaga ccgcggttta 19020
ctgcctgact atgcagacgt tccgcatccc gtactacgtc ggtccactgg cacgcggcaa 19080
cagcagattt gcgtggatga cccgcaagag cgaggagaca atcaccccgt ggaacttcga 19140
ggaggtggtg gataagggtg cgtggccagt accagctgca ccccacaggc ggttgttttg 19200
acatttaaac cgctttcagg aagcgtttgt acactcatgc gcttcatggt ctaccagcag 19260
gaggtctgga acacattcag atctaacatg aaatcaagct tgcatttcaa aagcggggca 19320
tccaagtgca gcggggatga actgctgtct catttctatg caggcgcgtc tgcacagagc 19380
ttcatcgagc gcatgaccaa cttcgacaag aacctgccga acgagaaggt gctgccgaag 19440
catagcctgc tgtacgagta tttcaccgtg tacaacgagc tgaccaaggt gaagtacgtg 19500
accgagggca tgcgcaagcc ggcatttctg agcggtgagc aaaagaaggt gggtggtgca 19560
caatgttgat gcagatttga cgctgtatca ctgctgtctc gctgtacagc atctgataca 19620
ctgctgttcc cgctccccgc aggccatcgt ggacctgctg ttcaagacca accgcaaggt 19680
gaccgtgaag cagctgaagg aggactactt caagaagatc gagtgcttcg acagcgtgga 19740
gatcagcggc gtggaggatc gctttaacgc gagtctgggc acctaccacg acctgctgaa 19800
gatcatcaag gacaaggact tcctggacaa cgaggaggtg attgtgggtg gagtgcaccg 19860
cgaatgaatg gggcactgca gcacaatgga gcacacatcc aatccgcaat gagctctcct 19920
gagacttttt ttggctcctg aagcaaacca gacaatgtgc gcctatttca cggacctggc 19980
gcatggaagt agtctggcaa ctatggctgg agcacaacaa tttctggtta ttttgattgg 20040
aatgattggg ggaaaaaaca atgtgttgcc cgcagcacag gccctggtgc agttgagtta 20100
gctgtagcag tagcagaagg catgtcatcg aaaaagtacc gaattgtgcc atcatcccca 20160
ccctgctgca gaacgaggac atcctggagg acatcgtgct gaccctgacc ctgttcgagg 20220
atcgcgagat gatcgaggag cgcctgaaga catacgcgca cctgtgagtg gttgccctgg 20280
acactggaga tttcttgcat gttgggtgtg gctgattgtg cctgcatcac tggatgattg 20340
tggcacattt tcggtttaat attcagggta ctgctgcaaa cgagcttggt tcaactgacg 20400
tacctgaacc agtcgttttg ctgcttgcag gttcgacgac aaggtaagct gtgacaggac 20460
aagctggcag attcttcact tgcacctgtc cagctgaatc tacaaccatg ggtgaaggat 20520
gctgccgttg ctggcagcca cacctgtttg aaactaaaat gggagcaacc tgtgcagcaa 20580
ggtcctacga tatcatacct gcttcttcaa ccatctgatg ccccttatca acaagcgcac 20640
cctgcaggaa ttacccttgc accaaaacct gggcacgttg cctgccgctt gccagaacta 20700
gctgtctgtg ccactcccaa catgtgccta gcatctgtga tatctgctac aggtgatgaa 20760
gcagctgaag cgccgccgct atacaggttg gggtagactg agccgcaagc tgatcaacgg 20820
catccgcgac aagcagagcg gcaagacaat cctggacttc ctgaagagcg acggcttcgc 20880
aaaccgcaac ttcatgcagc tgatccacga cgacaggtga gccaggggag gtgcattcct 20940
agcctgtgct tgcttgtgtg gaccctattt gggaggagga agattgacct ggtatgaaat 21000
gtgaggctag acaacacatg cgactatttc tctccagcag cactggcagg acgatgggac 21060
tgcatgtgag ggcatgtctt gacatgaaat gtcttgccac cagtttgatg tgttgacatc 21120
gaacatcagc cccccttccc cagctattat ctagttctgg tcctatcaga ccatgcgcaa 21180
tctgctggcg gtctcatctt taaaagcatt cttgtcatca ggctgtgcag tggagccagc 21240
aataaaacca acctattgtt ttgcagcctg accttcaagg aggacatcca gaaggcgcag 21300
gtgtctggtc agggcgatag ccttcacgag cacatcgcga acctggcagg ctcaccagcg 21360
atcaagaagg gcatcctgca gaccgtgaag gtggtggatg agctggtgaa ggtgatgggc 21420
cgccacaaac cggagaacat cgtgatcgag atggcgcgcg agaaccagac aacccaaaag 21480
ggccagaaga acagccgcga gcggtacgca gaactctggc gtagccacgc aaatcatgtt 21540
tgcagatgaa agttttgtca tatgcgcaag accagggacc ttctatgtat caaaaggctt 21600
aacagtgtgt tgttggttat gttgtgcagc atgaagcgca tcgaagaggg catcaaggag 21660
ctgggtgagt catgtggaaa ggtatcatac attagatggt gttcccctgt tgtacaagat 21720
ctggcagcat ttggatgctg ccattggaga tttcatgaga tattcagtta aactaaaagc 21780
gtgagttttc gcagcagagg atagagccaa actcacaaat cattttggct tggtgcaggc 21840
agccagatcc tgaaggagca tccagtggag aacacccagc tgcagaacga gaagctgtac 21900
ctgtactacc tgcagaacgg ccgcgacatg tacgtggatc aggagctgga catcaaccgc 21960
ctgagcgact acgacgtgga ccacattgtg ccgcagtcgt tcctgaagga cgacagcatc 22020
gacaacaagg tgctgacccg cagcgacaag aatcgcggca agagcgacaa cgtgccgtct 22080
gaagaggtga ggcatcgcac aggatataca gtgggttcca tgagtgctgt tgtgttgtgc 22140
attgcttcga cccgctttcc aacctgtgcg tggtgtatgg gtttgcacca tggcgtgcac 22200
gggcacaggc atgtcatgct gcaagcaaca gggccgccaa gcttccttca cctgctcggt 22260
gatctttgtc ccttcctcca ccctcccttt ttccccgccc caggtggtga agaagatgaa 22320
gaactactgg cgccagctgc tgaacgcgaa gctgatcaca cagcgcaagt tcgacaacct 22380
gaccaaggca gagaggggtg gcctgtctga gctggataag gcgggcttca tcaagcgcca 22440
gctggtggag acacgccaga tcacaaagca cgtggcgcag atcctggaca gccgcatgaa 22500
caccaagtac gacgagaacg acaagctgat ccgcgaggtg tgacccgggt gtattagaga 22560
gatgcgcaac gcgtgctggt tgttgttgcc gttgcaccta gggagtaggt cgaatgccgc 22620
gttggtgccc gctggggtgg ctgtatcatg ctggatgggg ttgcaatcag acccgggtaa 22680
gaatgaagtg tggagctcac tgttccgtcg agcgcttcag cctgcttgat ggtgatgccg 22740
gtttggcgca ggtgaaggtg atcaccctga agagcaagct gtgagtggcg tgctgcacaa 22800
ttgtttgtca agtgcacttg ttcttgatac aaagttgggc tcgccattga tagcaagaaa 22860
aagaacttgc cacctggata gctgcgtctg gaacatgttg catggaggga attttatggt 22920
gacacccatg gtgacactct tcatggaacc tgctggccac ctgctggtat gcctcttgag 22980
gctggatgat caacaaatga tgtgccgcag tctacagtca atttcagttc acccagtagc 23040
tgtttttcat tcgtgctgca gggtgagcga cttccgcaag gacttccagt tctacaaggc 23100
aagtgccttc tagggttcag atctaagcca gagcagtgaa caactggtgc tattatatcg 23160
tacatatggt gctaattcgc ctgcttgcag ctcagcaggc accattggtg cacaggaaaa 23220
tcggcgcatg atccaagtgc agctgcgcct cgcagcttgt acccctgctg agttttcttt 23280
cggctgttgc ccatgcaggt gcgcgagatc aacaactacc accacgcgca cgacgcctac 23340
ctgaatgcag tggttggcac cgcgctgatc aagaagtacc cgaagctgga gagcgagttc 23400
gtgtacggcg actacaaggt gtacgacgtg cgcaagatga tcgcgaagag cgaggtgagc 23460
actcacaggc agttctgtta ccaacatctg cgattttctt gggcagagag tgtatcttag 23520
acctcattca cctcagattc ctgagcgagc tgcaatgccc gttgtcagcc tgtgcaatga 23580
aggaaaaacc tgtcgtaatg cttgcagcag gagatcggca aggcaaccgc gaagtatttc 23640
ttctactcga acatcatgaa cttcttcaag accgagatca ccctggcgaa cggcgagatt 23700
gtgagtgtca cagtagtgtg catcttcgtt tgatccagtt tgatccacgt gcagctgccc 23760
atcaagtcca ggttgtggac cttcatcttt ggactggcag tgtatgaaaa gtccactggg 23820
aacctgctct ttttcatacc gcatcatgca tatcgtgtcc catcgtgcgt acttcatgag 23880
ttgtccctat ttttattact gtcgtcatca cttccaacgt ccacagagcc aacacgactt 23940
gtgctgaata aaggaatgaa atcgcctatt taatataaac tggtattgtg ggacaaagtc 24000
caattcgcaa gtctgatgcg cacctgtgca gaggaagagg ccgctgatcg agaccaacgg 24060
cgagacaggc gagatcgtgt gggataaggg ccgcgacttt gcgacagtgc gcaaggttct 24120
gagcatgcca caggtgaaca tcgtgaagaa gaccgaggtg cagaccggcg gcttcagcaa 24180
agagagcatc ctgccaaagc gcaacagcga caagctgatc gcgcgcaaga aggactggga 24240
cccgaagaag tatggcggct tcgacagccc aaccgtggca tatagcgtgc tggtggtggc 24300
gaaggtggag aagggcaaga gcaagaagct gaagagcgtg aaggagctgc tgggtgagcg 24360
gccagcacat gcacctaggt tgcctatcac atggcaccaa attgcatagc catttcaggg 24420
tgattcactt cccggtaaca ggcattgtct ggcagcctca tcgtatgcat gaatggagat 24480
gggtcaattc aagcttgcat ttcaaaagca gggcatccaa gtgcagctgg gatcaactgc 24540
tgtctcattt ctatgcaggc atcaccatca tggagaggag cagcttcgag aagaacccca 24600
tcgacttcct ggaggcgaag ggctacaagg aggtgaagaa ggacctgatc atcaagctgc 24660
cgaagtacag cctgttcgag ctggagaatg gccgcaagcg catgctggca tctgcaggtg 24720
ggtggtgcac aatgttgatg atagtgccct gatgtagtgc gcagatttga cgctgtatca 24780
ctgctgtctc gctgtacagc atctgataca ctgctgttcc cgctccccgc aggtgagctg 24840
caaaagggca acgagctggc actgccgagc aagtacgtga acttcctgta cctggcgagc 24900
cactacgaga agctgaaggg ctcaccggag gacaacgagc agaagcagct gttcgtggag 24960
cagcacaagc actacctgga cgagatcatc gagcagatca gcgagttcag caagcgcgtg 25020
atcctggcag acgcgaacct ggataaggta ggaattttcc cctccctgca ggtggccagg 25080
gaaatgaacc ggtcaccatg taccgggtag cacgggtgga cacacggcag tggccaggga 25140
atcgtactgc tgagggtccc cctgcatgca gactgtgggg gttccctcag gctccgtctt 25200
tgttgcacat gcaatggttt gatcggtctc agttggcatc tctattgaaa ctgctatatt 25260
cctatgccag tgacgcagag gtgaggatgg ttgacaaggt tttgacgtag tgggtgttga 25320
gggtgctgtg caggtgctga gcgcgtacaa caagcaccgc gacaagccaa tccgcgagca 25380
agcagagaac atcatccacc tgttcacgct gaccaacctg ggcgcaccag cagcgttcaa 25440
atatttcgac accaccatcg accgcaagcg gtacacaagc accaaggtac tacctgcctg 25500
cccaaatgct gttgggcttt gcagcacaaa ggaaaattct ccagccaggg tttttcctgc 25560
tgcaacactg ttgtatgatc gctcacaata agggggaaat aggtttccaa gtcatggttg 25620
tgacagtgga aaccaagtct tttttgcctc caccaagttt ttgtcctcaa atttaattca 25680
atggtggttt gtaggaggtg ctggacgcga ccctgatcca tcagagcatc acaggtacag 25740
tgcagcagca caatccctcg tcaagcttac ttgtgttgca ttgccaaatt gcccaatttc 25800
ctatgaagtt tgctgtacat ttgatcatgc gctaaattgc ttttacgttc tatcgctttg 25860
tatgcatgca ggcctgtacg agacccgcat cgacctgtct caactgggtg gcgactgagg 25920
tgcgaatagt gcttcagtaa aaaagtagca acttggtgca atatcgtcag ggtcgtgtgg 25980
tctgctcgcc agcaagtttt ttggcacagg agagcgcttt ttccgagtac cgccaaagtt 26040
caagcatgtg ctgtgattcg ctgttgcctc ttatgataat tgctcaaagt ttccaagcat 26100
tctatgtcca ccctgcacca ctaagttgta tggtgcttat tctgcagggg atgattcatg 26160
gtgcctaaaa attttgtgct gctgtcgcgt ctgttttctg tcgcagttta gtgaatgtaa 26220
ctccaaatac caaacttttc atcacaatca tattgatgcc tttgtaagtg aattacagcg 26280
ttttttgcca taaaaagaag taccgtgaca ttggggtcgt cataacaaga agctttatga 26340
acaagcagct tgatctacga gacttataca taaatggttt cgggtaactc ctaatacggg 26400
gctacgttag ttcagcagct gagaacgacc acgaacggga agaattccag ccatgttgaa 26460
gaggtgcagc tatcaaggtg aggtctttac tggtgtctgt tattgctgta acatcatttc 26520
gctgttgcac aatttaaaca tttgtaattt actgttgtta ttgcagtggc cacttgtagc 26580
agtggcagcg aggcactgac acttctacgt gaacgcaacg aggacggatc ctccgaccag 26640
ttcgacctcg tactgtcaga tgtttacatg ccgggtatgt cgtattcctt tgtaaacttt 26700
acaatatgcg tctagtttga cgcgtacact ttgtacactt tgcaaaaacg caccctgcga 26760
ggtctgccat ttggtcacta caacttggcc accttggttg caagtttgca agttcgctct 26820
acgtcaacgc tgcaaaatga accaattgtt ttgcactgac cctgccaacc ttcatttgtg 26880
gctgcagaca tggacggttt caagctgctt gaacacatcg gtctagagtt ggagcttccc 26940
gttatcagta agttgatcga gccgagtcca gagcgaagcc tgcttctata ctattagcag 27000
ctgtcttttg atatttgaca gcttgacttg atatggtcac agagcatact tgcaaccagg 27060
ttacctgttg aactagcaac tgtgcccaag catctcttca agcacctccg tcagtccata 27120
gggtactgtt gatttgtact ctgcaatact gcactgtaat gcgctgtgaa tcactgccct 27180
tcacctctag atggtgcttc cctggagccc tcccccacct ccgcctcaag cccctcacat 27240
gcctctcccc cccctgcagt gatgtcatcc aacggggaca cgaatgtcgt gctgcggggg 27300
gtcacccacg gggctgtgga ctttctgatc aagcccgttc gaattgagga gctgcggaac 27360
gtgtggcagc acgtggtgcg tcgtcgttcc atggcgctgg ccaggacgcc agacgagggg 27420
ggacactcgg acgaggactc tcaggtgccc ttggcagctt ctgggcggct tgctgtgtcg 27480
gatgccactt ggactgggga tgcacgaggg gtggggggac aatgggagat gggccatagt 27540
aggccagagt tgatggcagt ggtggtgggg gggagtaggc gggagagaag cagccatcct 27600
ggtgttggtt ttgatgattg agtgcatggg gatgatgcac aggtgagctg actggatgcc 27660
ttgtcttgct gtgctgcgct gcagcggcac agtgtgaaac gcaaggagtc ggagcagagc 27720
ccgctgcagc tcagcacaga gcagggcggg aacaagaagc caagagtggt gtggtcggtg 27780
gagatgcacc aacaggtgtg cttgcgggcg ggtgtatacg ggggaggggg gccagctgct 27840
ggctgacctg gcgtgcgcgg tgcattgcac ttggcgatga ggggcgtgct tcagtatgta 27900
gctgggacgc aattggttgt gctgtgtgac cagtgcacaa aatacatccc tgaattccag 27960
tgggttgaac agagttgtcc tggaggtggg aagcaaacgc gcacgtggta gaggggagca 28020
gggtgcagaa cagccgcagc aggggtgttg cgcagtgtgc aggtatcctg cctccatgcc 28080
ccgggccatg ggcatactac gctggtaccg tcaggatggg cgttgagcct ggcttggggg 28140
gcagggggcg agcgaatgcg gaatgggagc ggcaggtgct gggagggtgg ctgactggct 28200
tgcaggagcg caagtcctgt cgggggcgtc gtcctgttcc ctcctgcccg cttcacccac 28260
gttcactctc atgcctccac actcctgctg ctgacacacc tgtcgccacc tccgctgcag 28320
tttgtgaacg cggtcaactc cctgggcatt gacaaggcgg tgcccaagcg gattctggac 28380
ctgatgaacg tggaggggct gacgcgcgag aacgtggcca gccatctgca ggtgcctgcc 28440
atgacccgcg at 28452
<210> 87
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 87
aggctactct cagacatgac ggtggctctg 30
<210> 88
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 88
gccacaaatg aaggttggca gggtcagtgc 30
<210> 89
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 89
acaccacctt aaggcacatg agg 23
<210> 90
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 90
ggcgtgggac atggtgcgca agg 23
<210> 91
<211> 55
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 91
tgaagcaccc cccggcctct ccccccgcag ggccgcccct cccgcctcgt cgtgc 55
<210> 92
<211> 56
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 92
cgcaacgctc tccctcccca ccccccagcc tcacatccgc ctcaagcagc gccctg 56
<210> 93
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 93
caagctatgc gaggaaggga gggtc 25
<210> 94
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 94
ctgccgcaag tgagtgtgct gtc 23
<210> 95
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 95
caccagatat aggtgacccg ataac 25
<210> 96
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 96
aaaactccac tgcacctgca acat 24
<210> 97
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 97
tgcggtgaag cttggagctg 20
<210> 98
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 98
ttgccgtcga cgagacttcg gggcgcgcat ttatcgactc tcttgaagat acaccggtt 59
<210> 99
<211> 65
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 99
tccaattgta gatatcatat tgtttccgga cctaccttac gcactgagtg ctgccagatg 60
ttctt 65
<210> 100
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 100
gaggtgggtg gtagtgcttc gcgaggtg 28
<210> 101
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的引物
<400> 101
atcacagctc acagggcaga cactgcgtc 29
<210> 102
<211> 7
<212> PRT
<213> 未知
<220>
<223> 对未知的描述: 'SHAQKYF'类家族蛋白序列
<400> 102
Ser His Ala Gln Lys Tyr Phe
1 5
<210> 103
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 103
gggacatggt gcgcaaggac ggg 23
<210> 104
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 104
tgcggtgaag cttggagctg tgg 23
<210> 105
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 对人工序列的描述: 合成的寡核苷酸
<400> 105
acaccacctt aaggcacatg agg 23

Claims (77)

1.一种突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。
2.一种突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。
3.一种突变体光合生物,其包括突变或弱化的显著生长改善基因2(SGI2)。
4.一种突变体光合生物,其包括突变或弱化的对叶绿体信号识别蛋白54(cpSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。
5.根据权利要求1到4中任一项所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述突变体展现出叶绿素在低光条件下的减少和在高于100μE m-2s-1的所有生理相关辐照度下,光系统II中的光化学最大量子产率(Fv/FM)更高。
6.根据权利要求5所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述突变体光合生物展现出叶绿素的减少为至少减少20%。
7.根据权利要求6所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述叶绿素的减少为至少减少30%。
8.根据权利要求7所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述叶绿素的减少为至少减少40%。
9.根据权利要求7所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述叶绿素的减少为至少减少50%。
10.根据权利要求9所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述叶绿素的减少为至少减少60%。
11.根据权利要求10所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述叶绿素的减少为至少减少70%。
12.根据权利要求1到4中任一项所述的突变体光合生物,其中相对于同一物种的对照光合生物,所述突变体展现出在高于100μE m-2s-1的所有生理相关辐照度下,非光化学淬灭(NPQ)更低。
13.根据权利要求12所述的突变体光合生物,其中所述突变体展现出在高于250μE m- 2s-1的所有生理辐照度下,NPQ比同一物种的对照光合生物更低。
14.根据权利要求1到4中任一项所述的突变体光合生物,其中所述突变体在每个叶绿素的基础上显示出比同一物种的对照光合生物更高的固碳率。
15.根据权利要求14所述的突变体光合生物,其中所述固碳率比同一物种的对照光合生物高至少50%。
16.根据权利要求15所述的突变体光合生物,其中所述固碳率比同一物种的对照光合生物高至少100%。
17.根据权利要求1到4中任一项所述的突变体光合生物,其中析氧率比同一物种的对照光合生物高至少100%。
18.根据权利要求17所述的突变体光合生物,其中析氧率比同一物种的对照光合生物高至少200%。
19.根据权利要求1到4中任一项所述的突变体光合生物,其中所述突变体的培养物显示出比同一物种的对照光合生物体的培养物更大的生物质生产力。
20.根据权利要求19所述的突变体光合生物,其中所述突变体在光能自养培养物中显示出更大的生物质生产力。
21.根据权利要求20所述的突变体光合生物,其中所述突变体在连续光条件下显示出更大的生物质活性。
22.根据权利要求20所述的突变体光合生物,其中所述突变体在昼夜循环条件下显示出更大的生物质活性。
23.根据权利要求20所述的突变体光合生物,其中所述突变体在昼夜循环条件下显示出更大的生物质活性,在所述昼夜循环条件下,光概况模拟自然日光概况。
24.根据权利要求1到4中任一项所述的突变体光合生物,其中所述突变体已经通过UV辐照、γ辐照或化学诱变生成。
25.根据权利要求1到4中任一项所述的突变体光合生物,其中所述突变体是基因工程化突变体。
26.根据权利要求25所述的突变体光合生物,其中所述突变体已经通过插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统进行基因工程化。
27.根据权利要求26所述的突变体光合生物,其中所述突变体已经通过CRISPR/Cas系统进行基因工程化。
28.根据权利要求1到2中任一项所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述cpSRP54包括与选自由以下组成的组的氨基酸序列具有至少65%同一性的氨基酸序列:SEQ ID NO:68、SEQ ID NO:75、SEQ ID NO:76、SEQ ID NO:77、SEQ ID NO:78、SEQID NO:79、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:84和SEQ ID NO:85。
29.根据权利要求26所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述cpSRP54基因与SEQ ID NO:8的核酸序列具有至少50%同一性。
30.根据权利要求28所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述cpSRP54与选自由以下组成的组的氨基酸序列具有至少65%:SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6、SEQ ID NO:7、SEQ ID NO:8、SEQ ID NO:9、SEQID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13和SEQ ID NO:14。
31.根据权利要求1到2中任一项所述的突变体光合生物,其中在所述基因的突变或弱化之前,SGI1多肽与选自由以下组成的组的氨基酸序列具有至少50%同一性:SEQ ID NO:3、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ IDNO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ IDNO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ IDNO:34、SEQ ID NO:35、SEQ ID NO:36、SEQ ID NO:37、SEQ ID NO:38和SEQ ID NO:39。
32.根据权利要求1到2中任一项所述的突变体光合生物,其中所述对SRP54蛋白进行编码的基因包括在对cpSRP54 GTP酶结构域的前169个氨基酸进行编码的序列之外发生的突变。
33.根据权利要求32所述的突变体光合生物,其中所述对SRP54蛋白进行编码的基因中的所述突变发生在对所述cpSRP54 GTP酶结构域进行编码的所述序列之外。
34.根据权利要求33所述的突变体光合生物,其中所述对SRP54蛋白进行编码的基因不包含所述cpSRP54 GTP酶结构域中的基因破坏性突变。
35.根据权利要求1、3或4所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述SGI2基因包括对以下氨基酸序列进行编码的核酸序列,所述氨基酸序列与选自由以下组成的组的氨基酸序列具有至少65%同一性:SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ IDNO:53、SEQ ID NO:54、SEQ ID NO:55和SEQ ID NO:56。
36.根据权利要求35所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述SGI2基因包括与选自由以下组成的组的核酸序列具有至少80%同一性的核酸序列:SEQ IDNO:7、SEQ ID NO:57、SEQ ID NO:58、SEQ ID NO:59、SEQ ID NO:60、SEQ ID NO:61、SEQ IDNO:62、SEQ ID NO:63、SEQ ID NO:64、SEQ ID NO:65、SEQ ID NO:66和SEQ ID NO:66。
37.根据权利要求35所述的突变体光合生物,其中在所述基因的突变或弱化之前,所述SGI2基因包括对以下氨基酸序列进行编码的核酸序列,所述氨基酸序列与选自由以下组成的组的氨基酸序列具有至少80%同一性:SEQ ID NO:5、SEQ ID NO:40、SEQ ID NO:41、SEQID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQID NO:48、SEQ ID NO:49、SEQ ID NO:50、SEQ ID NO:51、SEQ ID NO:52、SEQ ID NO:53、SEQID NO:54、SEQ ID NO:55和SEQ ID NO:56。
38.根据权利要求1到4中任一项所述的突变体光合生物,其中所述光合生物是藻类,并且其中所述突变体藻类属于选自由以下组成的组的属:曲壳藻属(Achnanthes)、茧形藻属(Amphiprora)、双眉藻属(Amphora)、纤维藻属(Ankistrodesmus)、星胞藻属(Asteromonas)、黄金色藻属(Boekelovia)、波利氏藻属(Bolidomonas)、包特氏菌属(Borodinella)、气球藻属(Botrydium)、葡萄藻属(Botryococcus)、荀球藻属(Bracteococcus)、角毛藻属(Chaetoceros)、四鞭藻属(Carteria)、衣藻属(Chlamydomonas)、绿球藻属(Chlorococcum)、绿梭藻属(Chlorogonium)、小球藻属(Chlorella)、蓝隐藻属(Chroomonas)、金球藻属(Chrysosphaera)、球钙板藻属(Cricosphaera)、隐甲藻属(Crypthecodinium)、隐藻属(Cryptomonas)、小环藻属(Cyclotella)、杜氏藻属(Dunaliella)、椭球藻属(Ellipsoidon)、球石藻属(Emiliania)、独球藻属(Eremosphaera)、衣迪斯藻属(Ernodesmius)、裸藻属(Euglena)、真眼点藻属(Eustigmatos)、伏氏藻属(Franceia)、脆杆藻属(Fragilaria)、丽丝藻属(Gloeothamnion)、红球藻属(Haematococcus)、嗜盐古菌(Halocafeteria)、异弯藻属(Heterosigma)、膜胞藻属(Hymenomonas)、等鞭金藻属(Isochrysis)、鳞孔藻属(Lepocinclis)、微芒藻属(Micractinium)、蒜头藻属(Monodus)、单针藻属(Monoraphidium)、微球藻属(Nannochloris)、微拟球藻属(Nannochloropsis)、舟形藻属(Navicula)、新绿藻属(Neochloris)、肾鞭藻属(Nephrochloris)、肾爿藻属(Nephroselmis)、菱形藻属(Nitzschia)、棕鞭藻属(Ochromonas)、鞘藻属(Oedogonium)、卵囊藻属(Oocystis)、蚝球藻属(Ostreococcus)、巴夫藻属(Pavlova)、拟小球藻属(Parachlorella)、帕氏藻属(Pascheria)、普莱格门(Pelagomonas)、褐指藻属(Phaeodactylum)、噬菌体属(Phagus)、微绿藻属(Picochlorum)、扁藻属(Platymonas)、颗石藻属(Pleurochrysis)、宽球藻属(Pleurococcus)、原壁菌属(Prototheca)、假小球藻属(Pseudochlorella)、拟新绿藻属(Pseudoneochloris)、假十字趾藻属(Pseudostaurastrum)、塔胞藻属(Pyramimonas)、葡串藻属(Pyrobotrys)、栅列藻属(Scenedesmus)、骨条藻属(Skeletonema)、螺旋藻属(Spyrogyra)、裂丝藻属(Stichococcus)、四爿藻属(Tetraselmis)、海链藻属(Thalassiosira)、黄丝藻属(Tribonema)、无隔藻属(Vaucheria)、鲜绿球藻属(Viridiella)、魏氏藻属(Vischeria)和团藻属(Volvox)。
39.根据权利要求1到4中任一项所述的突变体光合生物,其中所述光合生物是藻类,并且其中所述突变体藻类选自由以下组成的组:绿藻门(chlorophyte)、硅藻门(bacillarophyte)、绿枝藻门(prasinophyte)、灰胞藻门(glaucophyte)、定鞭藻门(haptophyte)、绿蜘藻门(chlorarachniophyte)、裸藻门(euglenophyte)、杂色藻门(chromophyte)和甲藻门(dinoflagellate)突变体。
40.根据权利要求1到4中任一项所述的突变体光合生物,其中所述光合生物是藻类,并且其中所述藻类突变体是绿藻门。
41.根据权利要求37所述的藻类突变体,其中所述突变体属于选自由以下组成的组的属:葡萄藻属、荀球藻属、四鞭藻属、衣藻属、绿球藻属、绿梭藻属、小球藻属、蓝隐藻属、金球藻属、球钙板藻属、隐甲藻属、隐藻属、杜氏藻属、球石藻属、独球藻属、衣迪斯藻属、伏氏藻属、丽丝藻属、红球藻属、异弯藻属、膜胞藻属、等鞭金藻属、鳞孔藻属、微芒藻属、单针藻属、微球藻属、新绿藻属、肾鞭藻属、肾爿藻属、棕鞭藻属、鞘藻属、卵囊藻属、蚝球藻属、拟小球藻属、帕氏藻属、普莱格门、噬菌体属、微绿藻属、扁藻属、颗石藻属、宽球藻属、原壁菌属、假小球藻属、拟新绿藻属、假十字趾藻属、塔胞藻属、葡串藻属、栅列藻属、骨条藻属、螺旋藻属、裂丝藻属、四爿藻属、黄丝藻属、鲜绿球藻属和团藻属。
42.一种生物质,其包括根据权利要求1到4中任一项所述的突变体光合生物。
43.根据权利要求42所述的生物质,其中其中所述光合生物是藻类。
44.一种生产生物产物的方法,所述方法包括培养根据权利要求1到4中任一项所述的突变体光合生物以及从所述培养物中分离至少一种产物。
45.根据权利要求44所述的方法,其中其中所述光合生物是藻类,并且其中所述生物产物是藻类生物质。
46.根据权利要求44所述的方法,其中所述生物产物是脂质、蛋白质、肽、一个或多个氨基酸、氨基酸、一个或多个核苷酸、维生素、辅因子、激素、抗氧化剂或颜料或着色剂。
47.根据权利要求46所述的方法,其中所述生物产物是脂质。
48.根据权利要求47所述的方法,其中所述突变体光合生物被工程化成包含对参与所述脂质的生产的多肽进行编码的至少一种外源基因。
49.根据权利要求44所述的方法,其中所述突变体光合生物是光养培养的。
50.根据权利要求49所述的方法,其中所述突变体光合生物是藻类,并且其中所述藻类在池塘或水沟中进行培养。
51.一种突变体光合生物,其具有突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因2(SGI2)。
52.一种突变体光合生物,其具有突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因和突变或弱化的显著生长改善基因1(SGI1)。
53.一种突变体光合生物,其具有突变或弱化的对胞质信号识别蛋白54(cytoSRP54)进行编码的基因、突变或弱化的显著生长改善基因1(SGI1)和突变或弱化的显著生长改善基因2(SGI2)。
54.根据权利要求1到53中任一项所述的突变体光合生物,其中所述突变体光合生物的培养物显示出比同一物种的对照光合生物的培养物更大的脂质生产力。
55.根据权利要求51到53中任一项所述的突变体光合生物,其中所述突变体在光能自养培养物中显示出更大的脂质生产力。
56.根据权利要求55所述的突变体光合生物,其中所述突变体光合生物是藻类,并且其中所述突变体藻类在昼夜循环条件下显示出更大的生物质活性。
57.根据权利要求56所述的突变体藻类,其中所述突变体藻类在昼夜循环条件下显示出更大的生物质活性,在所述昼夜循环条件下,光概况模拟自然日光概况。
58.根据权利要求51到53中任一项所述的突变体光合生物,其中所述突变体光合生物已经通过UV辐照、γ辐照或化学诱变生成。
59.根据权利要求51到53中任一项所述的突变体光合生物,其中所述突变体光合生物是基因工程化突变体。
60.根据权利要求58所述的突变体光合生物,其中所述突变体光合生物已经通过插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统进行基因工程化。
61.根据权利要求59所述的突变体光合生物,其中所述突变体已经通过CRISPR/Cas系统进行基因工程化。
62.根据权利要求51到60中任一项所述的突变体光合生物,其中所述突变体光合生物是藻类,并且其中所述突变体藻类属于选自由以下组成的组的属:曲壳藻属、茧形藻属、双眉藻属、纤维藻属、星胞藻属、黄金色藻属、波利氏藻属、包特氏菌属、气球藻属、葡萄藻属、荀球藻属、角毛藻属、四鞭藻属、衣藻属、绿球藻属、绿梭藻属、小球藻属、蓝隐藻属、金球藻属、球钙板藻属、隐甲藻属、隐藻属、小环藻属、杜氏藻属、椭球藻属、球石藻属、独球藻属、衣迪斯藻属、裸藻属、真眼点藻属、伏氏藻属、脆杆藻属、丽丝藻属、红球藻属、嗜盐古菌、异弯藻属、膜胞藻属、等鞭金藻属、鳞孔藻属、微芒藻属、蒜头藻属、单针藻属、微球藻属、微拟球藻属、舟形藻属、新绿藻属、肾鞭藻属、肾爿藻属、菱形藻属、棕鞭藻属、鞘藻属、卵囊藻属、蚝球藻属、巴夫藻属、拟小球藻属、帕氏藻属、普莱格门、褐指藻属、噬菌体属、微绿藻属、扁藻属、颗石藻属、宽球藻属、原壁菌属、假小球藻属、拟新绿藻属、假十字趾藻属、塔胞藻属、葡串藻属、栅列藻属、骨条藻属、螺旋藻属、裂丝藻属、四爿藻属、海链藻属、黄丝藻属、无隔藻属、鲜绿球藻属、魏氏藻属和团藻属。
63.根据权利要求51到60中任一项所述的突变体光合生物,其中所述突变体光合生物是藻类,并且其中所述突变体藻类选自由以下组成的组:硅藻门、黄绿藻门(eustigmatophyte)和杂色藻门突变体。
64.根据权利要求63所述的突变体藻类,其中所述突变体是黄绿藻门。
65.根据权利要求64所述的突变体藻类,其中所述突变体藻类属于选自由以下组成的组的属:椭球藻属(Ellipsoidion)、真眼点藻属、魏氏藻属、蒜头藻属、微拟球藻属和假十字趾藻属。
66.一种生产脂质的方法,所述方法包括培养根据权利要求1到65中任一项所述的藻类突变体以及从所述培养物中分离至少一种脂质。
67.一种增加光合生物的生物质的方法,所述方法包括调节叶绿体信号识别蛋白54(cpSRP54)和显著生长改善基因2(SGI2)。
68.一种增加光合生物的生物质的方法,所述方法包括调节基因叶绿体信号识别蛋白54(cpSRP54)和显著生长改善基因1(SGI1)。
69.一种增加光合生物的生物质的方法,所述方法包括调节基因叶绿体信号识别蛋白54(cpSRP54)和显著生长改善基因1(SGI1)以及显著生长改善基因2(SGI2)。
70.根据权利要求67所述的方法,其中调节所述基因包括所述cpSRP54基因和所述SGI2基因中的碱基取代突变、插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统。
71.根据权利要求68所述的方法,其中调节所述基因包括所述cpSRP54基因和所述SGI1基因中的碱基取代突变、插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统。
72.根据权利要求69所述的方法,其中调节所述基因包括所述cpSRP54基因、所述SGI1基因和所述SGI2基因中的碱基取代突变、插入诱变、基因置换、RNAi、反义RNA、大范围核酸酶基因组工程化、一种或多种核酶和/或CRISPR/Cas系统。
73.根据权利要求67到72中任一项所述的方法,其中增加光合生物的生物质包括总有机碳的增加。
74.根据权利要求67到72中任一项所述的方法,其中增加光合生物的生物质包括总脂质含量的增加。
75.根据权利要求67到72中任一项所述的方法,其中增加光合生物的生物质包括总氮含量的增加。
76.根据权利要求67到75中任一项所述的方法,其中所述突变体光合生物是藻类,并且其中所述突变体藻类属于选自由以下组成的组的属:曲壳藻属、茧形藻属、双眉藻属、纤维藻属、星胞藻属、黄金色藻属、波利氏藻属、包特氏菌属、气球藻属、葡萄藻属、荀球藻属、角毛藻属、四鞭藻属、衣藻属、绿球藻属、绿梭藻属、小球藻属、蓝隐藻属、金球藻属、球钙板藻属、隐甲藻属、隐藻属、小环藻属、杜氏藻属、椭球藻属、球石藻属、独球藻属、衣迪斯藻属、裸藻属、真眼点藻属、伏氏藻属、脆杆藻属、丽丝藻属、红球藻属、嗜盐古菌、异弯藻属、膜胞藻属、等鞭金藻属、鳞孔藻属、微芒藻属、蒜头藻属、单针藻属、微球藻属、微拟球藻属、舟形藻属、新绿藻属、肾鞭藻属、肾爿藻属、菱形藻属、棕鞭藻属、鞘藻属、卵囊藻属、蚝球藻属、巴夫藻属、拟小球藻属、帕氏藻属、普莱格门、褐指藻属、噬菌体属、微绿藻属、扁藻属、颗石藻属、宽球藻属、原壁菌属、假小球藻属、拟新绿藻属、假十字趾藻属、塔胞藻属、葡串藻属、栅列藻属、骨条藻属、螺旋藻属、裂丝藻属、四爿藻属、海链藻属、黄丝藻属、无隔藻属、鲜绿球藻属、魏氏藻属和团藻属。
77.根据权利要求67到76中任一项所述的方法,其中所述突变体光合生物是植物。
CN201880084364.9A 2017-12-29 2018-12-27 用于改善生长的光合生物基因调节 Pending CN111527101A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201762612251P 2017-12-29 2017-12-29
US62/612,251 2017-12-29
US201862690205P 2018-06-26 2018-06-26
US62/690,205 2018-06-26
PCT/US2018/067712 WO2019133726A1 (en) 2017-12-29 2018-12-27 Genetic modulation of photosynthetic organisms for improved growth

Publications (1)

Publication Number Publication Date
CN111527101A true CN111527101A (zh) 2020-08-11

Family

ID=67059378

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880084364.9A Pending CN111527101A (zh) 2017-12-29 2018-12-27 用于改善生长的光合生物基因调节

Country Status (9)

Country Link
US (1) US11193132B2 (zh)
EP (1) EP3732189A4 (zh)
JP (1) JP7295864B2 (zh)
CN (1) CN111527101A (zh)
AU (1) AU2018395264B2 (zh)
BR (1) BR112020010329A2 (zh)
CA (1) CA3084828A1 (zh)
MX (1) MX2020006904A (zh)
WO (1) WO2019133726A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112724215A (zh) * 2021-01-22 2021-04-30 华中农业大学 改变玉米开花期的基因及方法
CN114350678A (zh) * 2022-01-12 2022-04-15 中国水稻研究所 基因OsLUX在促进水稻抽穗和提高植物抗病性中的应用

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA3161930A1 (en) * 2019-12-17 2021-06-24 Eric R. Moellering Recombinant algae having high lipid productivity
EP4211143A1 (en) 2020-09-11 2023-07-19 Viridos, Inc. Recombinant algae having high lipid productivity
US20220348946A1 (en) * 2021-04-23 2022-11-03 Viridos, Inc. Chlorophyte algae having improved productivity
CN114989274B (zh) * 2022-05-30 2023-01-31 宁波大学 一种三角褐指藻Myb类转录因子PtMYB3基因及其编码蛋白和应用

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010020555A1 (en) * 2008-08-20 2010-02-25 Basf Plant Science Gmbh Plants having enhanced yield-related traits and a method for making the same
WO2011049243A1 (ja) * 2009-10-23 2011-04-28 独立行政法人理化学研究所 バイオマスが増大し、かつ環境ストレス耐性が向上した形質転換植物およびその作出方法
WO2016168756A1 (en) * 2015-04-15 2016-10-20 Synthetic Genomics, Inc. Algal chloroplastic srp54 mutants

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4987071A (en) 1986-12-03 1991-01-22 University Patents, Inc. RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods
US20130167263A1 (en) 2007-10-30 2013-06-27 Monsanto Technology Llc Nucleic acid molecules and other molecules associated with plants and uses thereof
WO2002099079A2 (en) * 2001-06-06 2002-12-12 The General Hospital Corporation Cytokinin response regulators and uses thereof
WO2005054439A2 (en) 2003-12-01 2005-06-16 North Carolina State University Small interfering rna (sirna)-mediated heritable gene manipulation in plants
EP2121935B1 (en) 2006-09-25 2011-07-13 Freie Universität Berlin Transcriptional repressors of cytokinin signaling and their use
AU2009255947B2 (en) 2008-06-06 2014-12-18 Aurora Algae, Inc. Vcp-based vectors for algal cell transformation
CN103635571B (zh) 2011-06-30 2016-12-14 艾克森美孚研究与工程公司 调控毒素和抗毒素基因用于生物防范
WO2013016267A2 (en) 2011-07-22 2013-01-31 Donald Danforth Plant Science Center Plants and algae capable of modulating antenna size based on light intensity
US9243207B2 (en) 2012-02-29 2016-01-26 Exxonmobil Research And Engineering Company Solvent extraction of products from algae
DK2800811T3 (en) 2012-05-25 2017-07-17 Univ Vienna METHODS AND COMPOSITIONS FOR RNA DIRECTIVE TARGET DNA MODIFICATION AND FOR RNA DIRECTIVE MODULATION OF TRANSCRIPTION
US10612034B2 (en) 2012-06-01 2020-04-07 Exxonmobil Research And Engineering Company Promoters and terminators for use in eukaryotic cells
US9546372B2 (en) 2012-06-12 2017-01-17 Synthetic Genomics, Inc. Regulatory elements and uses thereof
JP2016013057A (ja) 2012-10-31 2016-01-28 日本たばこ産業株式会社 植物に多収性を付与する核酸、収量が増加した形質転換植物を作製する方法、植物の収量を増大させる方法
IN2015DN04148A (zh) 2012-11-16 2015-10-16 Total Marketing Services
US8883993B2 (en) 2012-12-04 2014-11-11 Exxonmobil Research And Engineering Company Tetraselmis promoters and terminators for use in eukaryotic cells
BR112015013013A2 (pt) 2012-12-06 2017-09-12 Synthetic Genomics Inc mutantes de algas tendo um fenótipo aclimatado em luz alta locked-in
JP2016519579A (ja) 2013-03-15 2016-07-07 ラーソン、マルクス カーレ、トールレイフLARSSON, Marcus Kare, Torleif 細胞、羊水回収及び細胞の単離のための方法及び機器
EP3013939A1 (en) 2013-06-25 2016-05-04 Cellectis Modified diatoms for biofuel production
AU2015373893B2 (en) 2014-12-31 2021-07-29 Synthetic Genomics, Inc. Compositions and methods for high efficiency in vivo genome editing
US10227619B2 (en) 2015-07-14 2019-03-12 Synthetic Genomics, Inc. Microorganisms having increased lipid productivity
JP6589605B2 (ja) * 2015-12-01 2019-10-16 株式会社デンソー 強光に耐性を示す緑藻突然変異体及びその利用
JP7057786B2 (ja) * 2016-12-30 2022-04-20 シンセティック ジェノミクス インコーポレーテッド 光合成アンテナの減少を有する高生産性藻類突然変異体

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010020555A1 (en) * 2008-08-20 2010-02-25 Basf Plant Science Gmbh Plants having enhanced yield-related traits and a method for making the same
WO2011049243A1 (ja) * 2009-10-23 2011-04-28 独立行政法人理化学研究所 バイオマスが増大し、かつ環境ストレス耐性が向上した形質転換植物およびその作出方法
WO2016168756A1 (en) * 2015-04-15 2016-10-20 Synthetic Genomics, Inc. Algal chloroplastic srp54 mutants

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112724215A (zh) * 2021-01-22 2021-04-30 华中农业大学 改变玉米开花期的基因及方法
CN114350678A (zh) * 2022-01-12 2022-04-15 中国水稻研究所 基因OsLUX在促进水稻抽穗和提高植物抗病性中的应用
CN114350678B (zh) * 2022-01-12 2023-11-21 中国水稻研究所 基因OsLUX在促进水稻抽穗和提高植物抗病性中的应用

Also Published As

Publication number Publication date
US11193132B2 (en) 2021-12-07
EP3732189A4 (en) 2022-01-19
AU2018395264B2 (en) 2023-06-22
AU2018395264A1 (en) 2020-07-30
WO2019133726A1 (en) 2019-07-04
MX2020006904A (es) 2020-09-09
US20190203221A1 (en) 2019-07-04
CA3084828A1 (en) 2019-07-04
JP2021509018A (ja) 2021-03-18
JP7295864B2 (ja) 2023-06-21
BR112020010329A2 (pt) 2020-11-17
EP3732189A1 (en) 2020-11-04

Similar Documents

Publication Publication Date Title
CN104837995B (zh) 具有不易变化的高光适应表型的藻类突变体
CN111527101A (zh) 用于改善生长的光合生物基因调节
JP7057786B2 (ja) 光合成アンテナの減少を有する高生産性藻類突然変異体
AU2016349822B2 (en) Algal mutants with increased lipid productivity
US20200157558A1 (en) Algal chloroplastic srp54 mutants
EP3344776B1 (en) Rekombinant algal microorganism with increased productivity
EP3322799B1 (en) Microorganisms having increased lipid productivity
KR101627477B1 (ko) 광합성 산물의 생산성을 향상시킨 조류 및 그의 이용
CN112384619A (zh) 具有增加的脂质产生的微生物和组合物及其制备和使用方法
AU2018378833A1 (en) Improving algal lipid productivity via genetic modification of a TPR domain containing protein
CN112961868A (zh) 生物质生产率调节剂
CN111433219A (zh) 通过基因修饰信号传导蛋白提高藻类脂质生产力
Lunde Expression of the bacterial ectA gene in the chloroplast of Chlamydomonas reinhardtii
CN116003563A (zh) 钙调素结合蛋白CaMBP13在调控植物耐冷性中的应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination