CN114423862A - 寡糖的产生 - Google Patents

寡糖的产生 Download PDF

Info

Publication number
CN114423862A
CN114423862A CN202080065823.6A CN202080065823A CN114423862A CN 114423862 A CN114423862 A CN 114423862A CN 202080065823 A CN202080065823 A CN 202080065823A CN 114423862 A CN114423862 A CN 114423862A
Authority
CN
China
Prior art keywords
leu
val
gly
ser
asp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080065823.6A
Other languages
English (en)
Inventor
S·阿加瓦拉
M·G·纳波利塔诺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ginkgo Bioworks Inc
Original Assignee
Ginkgo Bioworks Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ginkgo Bioworks Inc filed Critical Ginkgo Bioworks Inc
Publication of CN114423862A publication Critical patent/CN114423862A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1051Hexosyltransferases (2.4.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1051Hexosyltransferases (2.4.1)
    • C12N9/1055Levansucrase (2.4.1.10)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/04Polysaccharides, i.e. compounds containing more than five saccharide radicals attached to each other by glycosidic bonds
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/18Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y204/00Glycosyltransferases (2.4)
    • C12Y204/01Hexosyltransferases (2.4.1)
    • C12Y204/01099Sucrose:sucrose fructosyltransferase (2.4.1.99)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y204/00Glycosyltransferases (2.4)
    • C12Y204/01Hexosyltransferases (2.4.1)
    • C12Y204/0112,1-Fructan:2,1-fructan 1-fructosyltransferase (2.4.1.100)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y204/00Glycosyltransferases (2.4)
    • C12Y204/01Hexosyltransferases (2.4.1)
    • C12Y204/0101Levansucrase (2.4.1.10)

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本公开涉及用于使用蔗糖:蔗糖1‑果糖基转移酶(1‑SST)、果聚糖:果聚糖1‑果糖基转移酶(1‑FFT)和/或蔗糖:果聚糖‑6‑果糖基转移酶(6‑SFT)产生果聚糖的方法和组合物。

Description

寡糖的产生
相关申请的交叉引用
本申请要求于2019年9月24日提交的标题为“寡糖的产生”的美国临时申请序列号62/905,246(其的公开内容通过引用被整体并入本文)的在35U.S.C§119(e)下的权益。
对经由EFS-WEB作为文本文件提交的序列表的引用
本申请含有已经经由EFS-Web以ASCII格式提交的序列表,并且特此通过引用整体并入。创建于2020年9月23日的所述ASCII副本名称为G091970034WO00-SEQ-FL,并且大小为276千字节。
发明领域
本公开涉及对于蔗糖至果聚糖的转化有用的酶、核酸和细胞。
背景
聚果聚糖是包括果糖单体的寡糖。这些寡糖通常还包括葡萄糖。聚果聚糖具有多种用途(包含作为益生元、脂肪替代品、糖替代品、质构改良剂、以及在工业过程中)。聚果聚糖可以包括β(2,6)键和/或β(2,1)键,其中聚果聚糖的类型取决于果糖残基的键合位置。例如,革兰明糖(graminan)是具有不同聚合度的β(2,1)-连接-D-果糖基骨架和β(2,6)-连接-D-果糖基侧链的支链聚果聚糖寡糖的复杂混合物。三种不同类别的酶可以用于产生聚果聚糖:通过在糖类中引入β(2,1)键生成支链聚果聚糖的蔗糖:蔗糖1-果糖基转移酶(1-SST);通过形成β(2,1)键促进果糖单体在糖上的聚合的果聚糖:果聚糖1-果糖基转移酶(1-FFT);以及通过β(2,6)键催化果糖单体的添加以产生聚果聚糖的蔗糖:果聚糖-6-果糖基转移酶(6-SFT)。
概述
本公开至少部分地涉及生成含有用于例如通过将蔗糖转化为聚果聚糖来产生聚果聚糖寡糖的酶的经工程化的细胞。这些经工程化的细胞对于产生复杂的支链聚果聚糖而言是有用的。
本公开的方面涉及包括一种或更多种异源多核苷酸的宿主细胞,所述一种或更多种异源多核苷酸编码:蔗糖:蔗糖1-果糖基转移酶(1-SST);果聚糖:果聚糖1-果糖基转移酶(1-FFT);以及蔗糖:果聚糖-6-果糖基转移酶(6-SFT)。
在一些实施方案中,1-SST酶包括与SEQ ID NO:1或SEQ ID NO:24至少90%一致的氨基酸序列。
在一些实施方案中,1-FFT酶包括与SEQ ID NO:7或SEQ ID NO:31至少90%一致的氨基酸序列。
在一些实施方案中,6-SFT酶包括与SEQ ID NO:13或SEQ ID NO:38至少90%一致的氨基酸序列。
在一些实施方案中,宿主细胞包括一种或更多种编码1-SST酶、1-FFT酶和6-SFT酶中的两种或更多种的异源核苷酸。
在一些实施方案中,宿主细胞包括一种或更多种编码1-SST酶、1-FFT酶和6-SFT酶的异源核苷酸。
在一些实施方案中,1-SST酶、1-FFT酶和6-SFT酶中的至少两种在相同的异源多核苷酸上表达。
在一些实施方案中,宿主细胞是植物细胞、藻类细胞、酵母细胞、细菌细胞或动物细胞。
在一些实施方案中,酵母细胞是酵母属细胞、耶氏酵母属细胞或毕赤酵母属细胞。在一些实施方案中,宿主细胞是巴斯德毕赤酵母(Pichia pastoris)细胞。
在一些实施方案中,1-SST酶包括SEQ ID NO:1或SEQ ID NO:24的氨基酸序列。
在一些实施方案中,1-FFT酶包括SEQ ID NO:7或SEQ ID NO:31的氨基酸序列。
在一些实施方案中,6-SFT酶包括SEQ ID NO:13或SEQ ID NO:38的氨基酸序列。
在一些实施方案中,1-SST酶、1-FFT酶和6-SFT酶中的一种或更多种由宿主细胞分泌。
本公开另外的方面提供包括培养本申请中本文公开的宿主细胞中的任何一种的方法。
在一些实施方案中,方法还包括从宿主细胞中纯化1-SST酶、1-FFT酶和6-SFT酶中的一种或更多种。
本公开的另外的方面提供产生果聚糖的方法。在一些实施方案中,方法包括使蔗糖与下列中的一种或更多种接触:(a)1-SST酶,1-SST酶包括与SEQ ID NO:1或SEQ ID NO:24至少90%一致的氨基酸序列;(b)1-FFT酶,1-FFT酶包括与SEQ ID NO:7或SEQ ID NO:31至少90%一致的氨基酸序列;(c)6-SFT酶,6-SFT酶包括与SEQ ID NO:13或SEQ ID NO:38至少90%一致的氨基酸序列。
在一些实施方案中,使蔗糖与1-SST酶、1-FFT酶和6-SFT酶中的两种或更多种接触。
在一些实施方案中,使蔗糖与1-SST酶、1-FFT酶和6-SFT酶接触。
在一些实施方案中,果聚糖包括β(2,1)键、β(2,6)键或其组合。
在一些实施方案中,果聚糖是蔗果三糖、菊粉和/或革兰明糖。
在一些实施方案中,果聚糖具有至少3的聚合度。
在一些实施方案中,方法还包括纯化果聚糖。
在一些实施方案中,1-SST酶、1-FFT酶和/或6-SFT酶由一种或更多种宿主细胞分泌。
在一些实施方案中,在含有蔗糖的培养基中培养一种或更多种宿主细胞,其中使蔗糖与培养基中的1-SST酶、1-FFT酶和/或6-SFT酶接触。
在一些实施方案中,从培养基中纯化果聚糖。
在一些实施方案中,1-SST酶、1-FFT酶和/或6-SFT酶是纯化的酶。
在一些实施方案中,蔗果三糖是6-蔗果三糖。
在一些实施方案中,蔗果三糖是1-蔗果三糖。
在一些实施方案中,果聚糖包括左聚糖。
本公开的方面提供产生果聚糖的方法,方法包括(a)使蔗糖与1-SST酶接触以产生蔗果三糖;以及(b)使蔗果三糖与1-FFT酶和/或6-SFT酶接触以产生果聚糖。
在一些实施方案中,a)中产生的蔗果三糖被纯化,并且纯化的蔗果三糖在b)中与1-FFT酶和/或6-SFT酶接触。
在一些实施方案中,方法还包括纯化b)中产生的果聚糖。
在一些实施方案中,1-SST酶、1-FFT酶和/或6-SFT酶由一种或更多种宿主细胞分泌。在一些实施方案中,在含有蔗糖的培养基中培养一种或更多种宿主细胞,其中蔗糖与培养基中的1-SST酶接触。在一些实施方案中,1-SST酶、1-FFT酶和/或6-SFT酶是纯化的酶。在一些实施方案中,b)中产生的果聚糖是菊粉。在一些实施方案中,b)中产生的果聚糖是支链菊粉。在一些实施方案中,b)中产生的果聚糖是革兰明糖。
本公开的方面提供包括一种或更多种异源多核苷酸的宿主细胞,所述一种或更多种异源多核苷酸编码下列中的一种或更多种:(a)1-SST酶,1-SST酶包括与选自SEQ ID NO:1-4和SEQ ID NO:24-28的序列至少90%一致的氨基酸序列;(b)1-FFT酶,1-FFT酶包括与选自SEQ ID NO:7-10和SEQ ID NO:31-35的序列至少90%一致的氨基酸序列;以及(c)6-SFT酶,6-SFT酶包括与选自SEQ ID NO:13-21和SEQ ID NO:38-52的序列至少90%一致的氨基酸序列。
本公开的方面提供产生果聚糖的方法,方法包括使蔗糖与下列中的一种或更多种接触:(a)1-SST酶,1-SST酶包括与选自SEQ ID NO:1-4和SEQ ID NO:24-28的序列至少90%一致的氨基酸序列;(b)1-FFT酶,1-FFT酶包括与选自SEQ ID NO:7-10和SEQ ID NO:31-35的序列至少90%一致的氨基酸序列;以及(c)6-SFT酶,6-SFT酶包括与选自SEQ ID NO:13-21和SEQ ID NO:38-52的序列至少90%一致的氨基酸序列。
本发明的限制中的每个可以涵盖本发明的各种实施方案。因此,预期涉及任何一个要素或要素的组合的本发明的限制中的每个都可以包含在本发明的每个方面中。本发明的应用不限于在以下描述中所示或者在附图中所图示说明的构造细节和组分的布置。本发明能够具有其他实施方案并且能够以各种方式被实践或进行。此外,本公开中使用的措辞和术语是出于描述的目的,而不应当被视为限制性的。“包含”、“包括”、或“具有”、“含有”、“涉及”及其变体的使用意在涵盖其后列出的项目及其等同物以及附加项目。
附图的简要说明
附图并非旨在按比例绘制。在附图中,在各种图中图示说明的每个相同或几乎相同的组成部分由类似的数字表示。为清楚起见,并非每个组成部分都可以在每个图中标记。在附图中:
图1描绘了示出所选择的果聚糖(菊粉、左聚糖和革兰明糖)的化学结构的示意图。
图2描绘了示出太匮龙舌兰(Agave tequiliana)中的果聚糖产生中涉及的生物合成转化和相关酶的实例的示意图。
图3A-图3B描绘了示出来自酶库的筛选的数据的图表。图3A示出一图表,该图表显示出单独的酶和由与蔗糖的孵育形成的所得产物(β(2,6)果聚糖(在y轴上标记为‘2→6’)或β(2,1)果聚糖(在x轴上标记为‘蔗果三糖’))。根据产物形成,单独的酶被分类为:无活性;具有转化酶活性;具有蔗果三糖转移酶(1-SST)活性;或者具有β(2,6)支化(6-SFT)活性。图3B示出一图表,该图表显示出单独酶和由与蔗果三糖的孵育形成的所得产物(β(2,1)菊粉(在y轴上标记为‘耐斯糖(Nystose)’)或β(2,1)果聚糖(在x轴上标记为‘蔗果三糖’))。根据产物形成,单独的酶被分类为:无活性;具有蔗果糖酶(kestase)活性;或者具有1-FFT活性。通过HPLC分析图3A-图3B中的全部反应产物,并且使用峰积分定量。
图4描绘了示出果聚糖的代表性HPLC-RID迹线的示意图。顶部图片中示出酶促生物转化反应的实例(与蔗糖孵育的单独的酶)。底部图片中示出耐斯糖(A)、1-蔗果三糖(B)、蔗糖(C)、葡萄糖(D)和果糖(E)的可商业获得的标准品的制备的实例。
图5描绘了示出支链菊粉的合成的示意图。起始于蔗糖(葡萄糖和果糖的二聚体),使用1-SST活性酶促形成蔗果三糖(包括β(2,1)键)。1-FFT活性催化线性菊粉的形成,线性菊粉可以与具有6-SFT活性的酶反应以提供β2,6支链菊粉(G=葡萄糖;F=果糖)。
图6A-图6D示出通过生物转化形成支链菊粉的确认。图6A示出生物转化反应的HPLC-RID迹线,从而表明支链菊粉已经产生并且可以与起始材料(蔗糖)和副产物(葡萄糖)区分开来。图6B示出一示意图,该示意图描绘了当通过GC/MS使支链菊粉经受分析时所生成的片段化产物。这些片段化产物提供独特的质谱特征,该独特的质谱特征表明β2,6支化的存在。图6C示出下列的GC/MS光谱分析的实例:生物转化样品;线性糖类(菊苣;Nicie);以及已知的支链糖(‘最佳基准(Best Ground)’)。图6D是图6C中GC/MS分析介于28.0-29.6min之间的放大。
图7是SEQ ID NO:2-4、SEQ ID NO:6、SEQ ID NO:8-10、SEQ ID NO:12、SEQ ID NO:14-21和SEQ ID NO:63的序列一致性分析的非限制性实例。示出所示SEQ ID NO之间的百分比序列一致性。SEQ ID NO:6是高羊茅1-SST。SEQ ID NO:12是硬叶蓝刺头1-FFT。SEQ IDNO:63对应于梯牧草6-SFT(SEQ ID NO:23)的第60位残基至第623位残基。依据对数期望的多重序列比较(Multiple Sequence Comparison by Log-Expectation)(MUSCLE)用于序列一致性分析。
发明的详细说明
在一些方面,本公开提供被工程化用于由蔗糖产生聚果聚糖的细胞和酶。这些酶包含1-SST酶、1-FFT酶和6-SFT酶。本申请中公开的酶和包括这样的酶的宿主细胞可以用于促进果聚糖(包含支链果聚糖,如支链菊粉)的产生。在一些实施方案中,果聚糖包括β(2,1)键、β-(2,6)键或其组合。
果聚糖
如本申请中所使用的,“果聚糖”(也可以称为“聚果聚糖”或“果寡糖”)指包括果糖单体的寡糖。果聚糖通常还包括葡萄糖。在一些实施方案中,果聚糖包括至少一个β(2,1)键、至少一个β(2,6)键或其组合。在一些实施方案中,果聚糖是蔗果三糖(例如,1-蔗果三糖或6-蔗果三糖)、菊粉和/或革兰明糖。在一些实施方案中,果聚糖具有至少3(例如,至少3、至少4、至少5、至少6)的聚合度(DP),其中聚合度指果聚糖中单糖单元(例如,果糖单元)的总数量或者果聚糖混合物中单糖单元的平均数量。在一些实施方案中,果聚糖包括左聚糖(例如,线性左聚糖或支链左聚糖,例如,包括至少一个β(2,1)键和/或至少一个β(2,6)键)。在一些实施方案中,果聚糖是菊粉。在一些实施方案中,菊粉是线性菊粉或支链菊粉(例如,包括至少一个β(2,1)键和/或至少一个β(2,6)键)。在一些实施方案中,果聚糖是革兰明糖。
式1是包括β(2,1)键的果聚糖的实例:
Figure BDA0003553471780000061
式2是包括β(2,6)键的果聚糖的实例:
Figure BDA0003553471780000062
式3示出1-蔗果三糖:
Figure BDA0003553471780000071
式4示出6-蔗果三糖:
Figure BDA0003553471780000072
式5示出耐斯糖:
Figure BDA0003553471780000073
式6示出菊粉,其中n为任何整数
Figure BDA0003553471780000081
式7示出革兰明糖的实例,其中n1为任何整数
Figure BDA0003553471780000082
式8示出革兰明糖的实例,其中n1和n2可以独立地为任何整数
Figure BDA0003553471780000091
如本领域普通技术人员将理解的,使用本申请中描述的方法产生的果聚糖中的任何一种可以具有多种应用(包含工业用途)。作为非限制性实例,长链果聚糖(例如,左聚糖)可以用于发酵过程中和醋的产生中。也参见,例如,Niness,J Nutr.1999Jul;129(7Suppl):1402S-6S;Kolida et al.,Br J Nutr.2002;Koga et al.,Pediatr Res.2016Dec;80(6):844-851;Roberfroid,J Nutr.2007Nov;137(11Suppl):2493S-2502S;Suzuki et al.,Bioscience Microflora Vol.25(3),109-116,2006;Lopez and Urias-Silvas,RecentAdvances in Fructooligosaccharides Research(pp.297-310),2007;以及Vijn andSmeekens,Plant Physiology,June 1999,Vol.120,pp.351-359。
蔗糖:蔗糖1-果糖基转移酶(1-SST)
如本申请中所使用的,“蔗糖:蔗糖1-果糖基转移酶(1-SST)”指通过在糖类中引入β(2,1)键生成支链聚果聚糖(例如,由蔗糖形成1-蔗果三糖)的酶。1-SST酶可以使用蔗糖作为底物。在一些实施方案中,与其他糖类相比,1-SST对蔗糖展现出特异性。在一些实施方案中,1-SST由蔗糖产生1-蔗果三糖。在一些实施方案中,1-SST可以使用左聚糖作为底物以产生具有β(2-6)键和β(2-1)键的支链左聚糖。
本申请中描述的宿主细胞可以包括1-SST酶和/或编码这样的酶的异源多核苷酸。在一些实施方案中,宿主细胞包括编码1-SST酶的异源多核苷酸,1-SST酶包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的氨基酸序列:SEQ ID NO:1-4、SEQ IDNO:6和SEQ ID NO:24-28;表2中的1-SST酶;或本申请中另外描述的1-SST酶。在一些实施方案中,宿主细胞包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的异源多核苷酸:SEQ ID NO:5、SEQ ID NO:29-30和SEQ ID NO:62;编码表2中的1-SST酶的多核苷酸;或编码本申请中另外描述的1-SST酶的多核苷酸。
在一些实施方案中,宿主细胞不包括来源于高羊茅的1-SST。在一些实施方案中,宿主细胞不包括对应于SEQ ID NO:6的1-SST。
在一些实施方案中,相对于对照,表达编码1-SST酶的异源多核苷酸的宿主细胞可以将蔗糖至1-蔗果三糖的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍),和/或将寡糖中β(2,1)键的引入增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍)。在一些实施方案中,对照是表达编码SEQ ID NO:6的异源多核苷酸的宿主细胞。在一些实施方案中,对照是表达编码SEQ ID NO:6的异源多核苷酸的巴斯德毕赤酵母菌株(如在Lüscher,M.et.al.,“Cloning and Functional Analysis of Sucrose:Sucrose 1-Fructosyltransferase from Tall Fescue,”Plant Physiology,124:1217-1227(2000)中所描述的,并且由Lüscher,M.et.al.,“Cloning and Functional Analysis of Sucrose:Sucrose 1-Fructosyltransferase from Tall Fescue,”Plant Physiology,124:1217-1227(2000)通过引用所并入的)。
在一些实施方案中,相对于其他糖类,表达编码1-SST酶的异源多核苷酸的宿主细胞在存在蔗糖的情况下可以展现出高至少0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,高2倍至6倍)的活性。在一些实施方案中,活性对应于蔗糖至1-蔗果三糖的转化,和/或增加寡糖中β(2,1)键的引入。
在一些实施方案中,1-SST包括与SEQ ID NO:1-4、SEQ ID NO:6和SEQ ID NO:24-28中的任何一种至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%(包含之间的全部值)一致的序列。
果聚糖:果聚糖1-果糖基转移酶(1-FFT)
如本申请中所使用的,“果聚糖:果聚糖1-果糖基转移酶(1-FFT)”指催化包括β(2,1)键的寡糖(例如,1-蔗果三糖)向寡糖的较长聚合物链的转化(例如,1-蔗果三糖至菊粉的转化)的酶。1-FFT酶可以使用1-蔗果三糖、蔗糖和/或果糖作为底物。在一些实施方案中,1-FFT酶可以使用黑麦双叉寡糖或新蔗果三糖作为底物。在一些实施方案中,1-FFT由1-蔗果三糖产生菊粉(例如,支链菊粉)。
本申请中描述的宿主细胞可以包括1-FFT酶和/或编码这样的酶的异源多核苷酸。在一些实施方案中,宿主细胞包括编码1-FFT酶的异源多核苷酸,1-FFT酶包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的氨基酸序列:SEQ ID NO:7-10、SEQ IDNO:12和SEQ ID NO:31-35;表2中的1-FFT酶;或者本申请中另外描述的1-FFT酶。在一些实施方案中,宿主细胞包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的异源多核苷酸:SEQ ID NO:11、SEQ ID NO:36和SEQ ID NO:37;编码表2中的1-FFT酶的多核苷酸;或者编码本申请中另外描述的1-FFT酶的多核苷酸。
在一些实施方案中,宿主细胞不包括来源于硬叶蓝刺头的1-FFT酶。在一些实施方案中,宿主细胞不包括对应于SEQ ID NO:12的1-FFT酶。
在一些实施方案中,相对于对照,表达编码1-FFT酶的异源多核苷酸的宿主细胞可以将1-蔗果三糖至菊粉的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍),和/或将包括β(2,1)键的寡糖向寡糖的较长聚合物链的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍)。在一些实施方案中,对照是表达编码SEQ ID NO:12的异源多核苷酸的宿主细胞。在一些实施方案中,对照是表达编码SEQ ID NO:12的异源多核苷酸的巴斯德毕赤酵母菌株(如在Van den Ende,W.et al.,“Cloning and Functional Analysis of aHigh DP Fructan:Fructan 1-Fructosyl transferase from Echinops ritro(Asteraceae):Comparison of the native and recombinant enzymes,”Journal ofExperimental Botany,57(4):775-789(2006)中所描述的,并且由Van den Ende,W.etal.,“Cloning and Functional Analysis of a High DP Fructan:Fructan 1-Fructosyltransferase from Echinops ritro(Asteraceae):Comparison of the native andrecombinant enzymes,”Journal of Experimental Botany,57(4):775-789(2006)通过引用所并入的)。
在一些实施方案中,1-FFT酶包括与SEQ ID NO:7-10、SEQ ID NO:12和SEQ ID NO:31-35中的任何一种至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%(包含之间的全部值)一致的序列。
蔗糖:果聚糖-6-果糖基转移酶(6-SFT)
如本申请中所使用的,“蔗糖:果聚糖-6-果糖基转移酶(6-SFT)”指通过在糖类中引入β(2,6)键生成果聚糖(例如,由蔗糖产生6-蔗果三糖)或者通过在前体果聚糖中引入β(2,6)键生成更复杂的果聚糖(例如,由1-蔗果三糖产生黑麦双叉寡糖)的酶。6-SFT可以使用蔗糖、6-蔗果三糖、1-蔗果三糖、黑麦双叉寡糖和/或新蔗果三糖作为底物。在一些实施方案中,6-SFT由蔗糖产生6-蔗果三糖。在一些实施方案中,6-SFT由1-蔗果三糖产生黑麦双叉寡糖。在一些实施方案中,6-SFT由黑麦双叉寡糖产生革兰明糖。
本申请中描述的宿主细胞可以包括6-SFT酶和/或编码这样的酶的异源多核苷酸。在一些实施方案中,宿主细胞包括编码6-SFT酶的异源多核苷酸,6-SFT酶包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的氨基酸序列:SEQ ID NO:13-21、SEQID NO:23和SEQ ID NO:38-52;表2中的6-SFT酶;或者本申请中另外描述的6-SFT酶。在一些实施方案中,宿主细胞包括与下列中的任何一种至少70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%一致、或100%(包含之间的全部值)一致的异源多核苷酸:SEQ ID NO:22和SEQ ID NO:53-59;编码表2中的6-SFT酶的多核苷酸;或者编码本申请中另外描述的6-SFT酶的多核苷酸。
在一些实施方案中,宿主细胞不包括来源于梯牧草的6-SFT酶。在一些实施方案中,宿主细胞不包括对应于SEQ ID NO:23的6-SFT酶。在一些实施方案中,宿主细胞不包括对应于SEQ ID NO:63的6-SFT酶。
在一些实施方案中,相对于对照,表达编码6-SFT酶的异源多核苷酸的宿主细胞可以将蔗糖至1-蔗果三糖的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍),将1-蔗果三糖至黑麦双叉寡糖的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍),将黑麦双叉寡糖向革兰明糖的转化增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍),和/或将β(2,6)键向果聚糖中的引入增加多0.5倍、1倍、1.5倍、2倍、2.5倍、3倍、3.5倍、4倍、4.5倍、5倍、5.5倍或6倍(例如,多2倍至6倍)。在一些实施方案中,对照是表达编码SEQ ID NO:23的异源多核苷酸的宿主细胞。在一些实施方案中,对照是表达编码SEQ ID NO:23的异源多核苷酸的巴斯德毕赤酵母菌株(如在Tamura,K.I.,et al.“Cloning and Functional Analysis of a FructosyltransferasecDNA for Synthesis of Highly Polymerized Levans in Timothy(Phleum pratenseL.)”Journal of Experimental Botany,60(3),893-905(2009)中所描述的,并且由Tamura,K.I.,et al.“Cloning and Functional Analysis of a FructosyltransferasecDNA for Synthesis of Highly Polymerized Levans in Timothy(Phleum pratenseL.)”Journal of Experimental Botany,60(3),893-905(2009)通过引用所并入的)。在一些实施方案中,对照是表达编码SEQ ID NO:63的异源多核苷酸的宿主细胞。在一些实施方案中,对照是表达编码SEQ ID NO:63的异源多核苷酸的巴斯德毕赤酵母菌株。
在一些实施方案中,6-SFT包括与SEQ ID NO:13-21、SEQ ID NO:23和SEQ ID NO:38-52中的任何一种至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%(包含之间的全部值)一致的序列。
变体
本公开也涵盖本申请中描述的酶和蛋白质的变体(例如,1-SST、1-FFT或6-SFT)(包含核酸序列和氨基酸序列的变体)。变体可以与参考序列共有至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%(包含之间的全部值)的序列一致性。
除非另外指出,否则本领域已知的术语“序列一致性”指通过序列比较(比对)确定的两个多肽或多核苷酸的序列之间的关系。在一些实施方案中,在序列(如参考序列)的整个长度上确定序列一致性,而在其他实施方案中,在序列的区上确定序列一致性。在一些实施方案中,在序列(例如,1-SST序列、1-FFT序列或6-SFT序列)的区(例如,氨基酸或核酸的段,例如,跨越活性位点的序列)上确定序列一致性。例如,在一些实施方案中,在与参考序列的长度的至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少90%、至少95%、或超过100%对应的区上确定序列一致性。
一致性测量具有由特定数学模型、算法或计算机程序解决的空位比对(如果有的话)的两个或更多个序列中的较小序列之间一致匹配的百分比。
可以通过本领域普通技术人员已知的方法中的任何一种容易地计算相关多肽或核酸序列的一致性。可以例如使用Karlin and Altschul Proc.Natl.Acad.Sci.USA 87:2264-68,1990的算法、Karlin and Altschul Proc.Natl.Acad.Sci.USA 90:5873-77,1993中修改的算法来确定两个序列(例如,核酸序列或氨基酸序列)的“百分比一致性”。这样的算法被并入Altschul et al.,J.Mol.Biol.215:403-10,1990的
Figure BDA0003553471780000141
程序和
Figure BDA0003553471780000142
程序(版本2.0)。例如,可以用XBLAST程序(评分=50,字长=3)进行
Figure BDA0003553471780000143
蛋白质搜索,以获得与本申请中描述的蛋白质同源的氨基酸序列。在两个序列之间存在空位的情况下,例如,如Altschul et al.,Nucleic Acids Res.25(17):3389-3402,1997中所描述的,可以利用Gapped
Figure BDA0003553471780000144
当利用
Figure BDA0003553471780000145
程序和Gapped
Figure BDA0003553471780000146
程序时,如本领域普通技术人员将理解的,可以使用各自程序(例如,
Figure BDA0003553471780000147
Figure BDA0003553471780000148
)的系统内定参数,或者可以适当地调整参数。
例如,可以使用的另外的局部比对技术基于史密斯-沃特曼算法(Smith,T.F.&Waterman,M.S.(1981)“Identification of common molecular subsequences.”J.Mol.Biol.147:195-197)。例如,可以使用的通用全局比对技术是基于动态编程的尼德曼-翁施算法(Needleman,S.B.&Wunsch,C.D.(1970)“A general method applicable tothe search for similarities in the amino acid sequences of two proteins.”J.Mol.Biol.48:443-453)。
最近,开发了一种快速最优全局序列比对算法(FOGSAA),据称该算法比其他最优全局比对方法(包含尼德曼-翁施算法)更快地产生核酸序列和氨基酸序列的全局比对。在一些实施方案中,通过比对两个氨基酸序列、计算相同氨基酸的数量、并且除以氨基酸序列之一的长度来确定两个多肽的一致性。在一些实施方案中,通过比对两个核苷酸序列并且计算相同核苷酸的数量并且除以核酸之一的长度来确定两个核酸的一致性。
对于多序列比对,可以使用计算机程序(包含Clustal Omega(Sievers et al.,Mol Syst Biol.2011Oct 11;7:539))。
在优选实施方案中,当使用Karlin and Altschul Proc.Natl.Acad.Sci.USA 87:2264-68,1990(如在Karlin and Altschul Proc.Natl.Acad.Sci.USA 90:5873-77,1993中修改的)的算法(例如,
Figure BDA0003553471780000149
程序、
Figure BDA00035534717800001410
程序、
Figure BDA00035534717800001411
程序或Gapped
Figure BDA00035534717800001412
程序,使用各程序的默认参数)确定序列一致性时,发现序列(包含核酸序列或氨基酸序列)(如本申请中公开的和/或权利要求中限定的序列)与参考序列具有特定的百分比一致性。
在一些实施方案中,当使用史密斯-沃特曼算法(Smith,T.F.&Waterman,M.S.(1981)“Identification of common molecular subsequences.”J.Mol.Biol.147:195-197)或尼德曼-翁施算法(Needleman,S.B.&Wunsch,C.D.(1970)“A general methodapplicable to the search for similarities in the amino acid sequences of twoproteins.”J.Mol.Biol.48:443-453)使用默认参数确定序列一致性时,发现序列(包含核酸序列或氨基酸序列)(如本申请中公开的和/或权利要求中限定的序列)与参考序列具有特定的百分比一致性。
在一些实施方案中,当使用快速最优全局序列比对算法(FOGSAA)使用默认参数确定序列一致性时,发现序列(包含核酸序列或氨基酸序列)(如本申请中公开的和/或权利要求中限定的序列)与参考序列具有特定的百分比一致性。
在一些实施方案中,当使用Clustal Omega(Sievers et al.,Mol SystBiol.2011Oct 11;7:539)使用默认参数确定序列一致性时,发现序列(包含核酸序列或氨基酸序列)(如本申请中公开的和/或权利要求中限定的序列)与参考序列具有特定的百分比一致性。
如本申请中所使用的,当使用本领域已知的氨基酸序列比对工具比对序列X和序列Y且当序列“X”中的残基在序列“Y”中的“n”的对应位置处时,序列“X”中的残基(如核酸残基或氨基酸残基)被称为对应于不同序列“Y”中的位置或残基(如核酸残基或氨基酸残基)“n”。
变体序列可以是同源序列。如本申请中所使用的,同源序列是共有一定百分比一致性(例如,至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%(包含之间的全部值)百分比一致性)的序列(包含核酸序列或氨基酸序列)。同源序列包含但不限于旁系同源序列、直系同源序列、或者来源于趋同进化的序列。旁系同源序列由物种的基因组内的基因的复制产生,而直系同源序列在物种形成事件之后趋异。由于趋同进化,两个不同物种可能已经独立地进化,但可能各自包括与来自其他物种的序列共有一定百分比一致性的序列。
在一些实施方案中,多肽变体(如1-SST酶变体、1-FFT酶变体或6-SFT酶变体)包括与参考多肽(例如,参考1-SST酶、参考1-FFT酶或参考6-SFT酶)共有二级结构(例如,α螺旋、β片层)的域。在一些实施方案中,多肽变体(如1-SST酶变体、1-FFT酶变体或6-SFT酶变体)与参考多肽(例如,参考1-SST酶、参考1-FFT酶或参考6-SFT酶)共有三级结构。作为非限制性实例,变体多肽(例如,1-SST酶变体、1-FFT酶变体或6-SFT酶变体)与参考多肽相比可以具有低的一级序列一致性(例如,小于80%、小于75%、小于70%、小于65%、小于60%、小于55%、小于50%、小于45%、小于40%、小于35%、小于30%、小于25%、小于20%、小于15%、小于10%、或小于5%的序列一致性),但共有一个或更多个二级结构(例如,包含但不限于环、α螺旋或β片层),或者具有与参考多肽相同或相似的三级结构。例如,环可以位于β片层与α螺旋之间、两个α螺旋之间、或两个β片层之间。同源建模可以用于比较两个或更多个三级结构。
可以通过本领域普通技术人员已知的多种方法在核苷酸序列中进行突变。例如,可以通过PCR定向突变、根据Kunkel的方法(Kunkel,Proc.Nat.Acad.Sci.U.S.A.82:488-492,1985)的定点突变、通过编码多肽的基因的化学合成、通过基因编辑技术、或者通过插入(如标签(例如,HIS标签或GFP标签)的插入)来进行突变。突变可以包含例如通过本领域已知的任何方法生成的置换、缺失和易位。可以在参考文献(如Molecular Cloning:ALaboratory Manual,J.Sambrook,et al.,eds.,Fourth Edition,Cold Spring HarborLaboratory Press,Cold Spring Harbor,New York,2012或者Current Protocols inMolecular Biology,F.M.Ausubel,et al.,eds.,John Wiley&Sons,Inc.,New York,2010)中找到用于产生突变的方法。
在一些实施方案中,用于产生变体的方法包含环状变换(Yu and Lutz,TrendsBiotechnol.2011Jan;29(1):18-25)。在环状变换中,可以环化多肽的线性一级序列(例如,通过连接序列的N末端和C末端),并且可以在不同位置处切断(“断裂”)多肽。因此,如由线性序列比对方法(例如,Clustal Omega或BLAST)所确定的,新多肽的线性一级序列可以具有低的序列一致性(例如,小于80%、小于75%、小于70%、小于65%、小于60%、小于55%、小于50%、小于45%、小于40%、小于35%、小于30%、小于25%、小于20%、小于15%、小于10%、小或小于5%(包含之间的全部值))。然而,两种蛋白质的拓扑分析可以揭示两种多肽的三级结构类似或者不类似。不受特定理论的束缚,通过参考多肽的环状变换创建并且具有与参考多肽的三级结构类似的三级结构的变体多肽可以共有类似的功能特性(例如,酶活性、酶动力学、底物特异性或产物特异性)。在一些情况下,环状变换可以改变二级结构、三级结构或四级结构,并且产生具有不同功能特性(例如,增加或减少的酶活性、不同的底物特异性、或不同的产物特异性)的酶。参见,例如,Yu and Lutz,TrendsBiotechnol.2011Jan;29(1):18-25。
应当理解的是,在已经经历环状变换的蛋白质中,蛋白质的线性氨基酸序列将不同于尚未经历环状变换的参考蛋白质。然而,本领域普通技术人员将能够通过例如比对序列和检测保守基序、和/或通过比较蛋白质的结构或预测结构(例如,通过同源建模)容易地确定已经经历环状变换的蛋白质中的哪些残基对应于尚未经历环状变换的参考蛋白质中的残基。
在一些实施方案中,本申请中描述的确定感兴趣的序列与参考序列之间的百分比一致性的算法说明了序列之间的环状变换的存在。可以使用本领域已知的任何方法(包含,例如,RASPODOM(Weiner et al.,Bioinformatics.2005Apr 1;21(7):932-7))检测环状变换的存在。在一些实施方案中,在计算感兴趣的序列与本申请中描述的序列之间的百分比一致性之前,对环状变换的存在进行校正(例如,重排至少一个序列中的域)。应当理解本申请的权利要求包含在考虑序列的潜在环状变换后计算与参考序列的百分比一致性的序列。
本公开也涵盖本申请中公开的重组1-SST酶、1-FFT酶或6-SFT酶的功能变体。例如,功能变体可以结合相同底物中的一种或更多种或者产生相同产物中的一种或更多种。可以使用本领域已知的任何方法鉴别功能变体。例如,上文描述的Karlin and AltschulProc.Natl.Acad.Sci.USA 87:2264-68,1990的算法可以用于鉴别具有已知功能的同源蛋白质。
也可以通过搜索具有功能注释域的多肽鉴别推定的功能变体。数据库(包含Pfam(Sonnhammer et al.,Proteins.1997Jul;28(3):405-20))可以用于鉴别具有特定域的多肽。
同源建模也可以用于鉴别适合突变而不影响功能的氨基酸残基。这样的方法的非限制性实例可以包含位置特异性评分矩阵(position-specific scoring matrix)(PSSM)和能量最小化协议的使用。
位置特异性评分矩阵(PSSM)使用位置权重矩阵来鉴别共有序列(例如,基序)。可以在核酸序列或氨基酸序列上进行PSSM。方法使用比对序列,并且考虑在特定位置处观察到的特定残基(例如,氨基酸或核苷酸)的频率和所分析的序列的数量。参见,例如,Stormoet al.,Nucleic Acids Res.1982May 11;10(9):2997-3011。可以计算在给定位置处观察到特定残基的可能性。不受特定理论的束缚,具有高变异性的序列中的位置可以适合突变(例如,PSSM评分≥0)以产生功能同系物。
PSSM可以与Rosetta能量函数的计算配对,Rosetta能量函数确定野生型与单点突变体之间的差异。Rosetta能量函数将该差异计算为(ΔΔGcalc)。利用Rosetta函数,突变的残基与周围原子之间的键合相互作用被用于确定突变是增加还是减小蛋白质稳定性。例如,然后可以使用Rosetta能量函数分析由PSSM评分(例如,PSSM评分≥0)指定为有利的突变,以确定突变对蛋白质稳定性的潜在影响。不受特定理论的束缚,潜在稳定化的突变对于蛋白质工程(例如,功能同系物的产生)是期望的。在一些实施方案中,潜在稳定化的突变具有小于-0.1(例如,小于-0.2、小于-0.3、小于-0.35、小于-0.4、小于-0.45、小于-0.5、小于-0.55、小于-0.6、小于-0.65、小于-0.7、小于-0.75、小于-0.8、小于-0.85、小于-0.9、小于-0.95、或小于-1.0)Rosetta能量单位(R.e.u.)的ΔΔGcalc值。参见,例如,Goldenzweig etal.,Mol Cell.2016Jul 21;63(2):337-346.Doi:10.1016/j.molcel.2016.06.012。
在一些实施方案中,1-SST酶、1-FFT酶或6-SFT酶编码序列包括在与参考(例如,1-SST酶、1-FFT酶或6-SFT酶)编码序列对应的1个、2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个、21个、22个、23个、24个、25个、26个、27个、28个、29个、30个、31个、32个、33个、34个、35个、36个、37个、38个、39个、40个、41个、42个、43个、44个、45个、46个、47个、48个、49个、50个、51个、52个、53个、54个、55个、56个、57个、58个、59个、60个、61个、62个、63个、64个、65个、66个、67个、68个、69个、70个、71个、72个、73个、74个、75个、76个、77个、78个、79个、80个、81个、82个、83个、84个、85个、86个、87个、88个、89个、90个、91个、92个、93个、94个、95个、96个、97个、98个、99个、100个或超过100个位置处的突变。在一些实施方案中,相对于参考(例如,1-SST酶、1-FFT酶或6-SFT酶)编码序列,1-SST酶、1-FFT酶或6-SFT酶编码序列在编码序列的1个、2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个、21个、22个、23个、24个、25个、26个、27个、28个、29个、30个、31个、32个、33个、34个、35个、36个、37个、38个、39个、40个、41个、42个、43个、44个、45个、46个、47个、48个、49个、50个、51个、52个、53个、54个、55个、56个、57个、58个、59个、60个、61个、62个、63个、64个、65个、66个、67个、68个、69个、70个、71个、72个、73个、74个、75个、76个、77个、78个、79个、80个、81个、82个、83个、84个、85个、86个、87个、88个、89个、90个、91个、92个、93个、94个、95个、96个、97个、98个、99个、100个或更多个密码子中包括突变。如本领域普通技术人员将理解的,由于遗传码的简并性,密码子内的突变可以改变由密码子编码的氨基酸或者可以不改变由密码子编码的氨基酸。在一些实施方案中,相对于参考多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列,编码序列中的一个或更多个突变不改变编码序列(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列。
在一些实施方案中,相对于参考多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列,重组1-SST酶序列、重组1-FFT酶序列或重组6-SFT酶序列中的一个或更多个突变改变多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列。在一些实施方案中,相对于参考多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列,一个或更多个突变改变重组多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)的氨基酸序列,并且相对于参考多肽改变(增强或降低)多肽的活性。
可以使用常规方法测量本申请中描述的重组多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)中任何一种的活性(包含比活性)。作为非限制性实例,可以通过测量重组多肽的底物特异性、产生的一种或多种产物、产生的一种或多种产物的浓度、或其任何组合来确定重组多肽的活性。如本申请中所使用的,重组多肽的“比活性”指每单位时间针对给定量(例如,浓度)的重组多肽产生的特定产物的量(例如,浓度)。
本领域技术人员还将认识到,重组多肽(例如,1-SST酶、1-FFT酶或6-SFT酶)编码序列中的突变可以造成保守氨基酸置换,保守氨基酸置换提供前述多肽的功能等效变体(例如,保留多肽的活性的变体)。如本申请中所使用的,“保守氨基酸置换”指不改变进行氨基酸置换的蛋白质的相对电荷特性或尺寸特性或功能活性的氨基酸置换。
在一些情况下,氨基酸的特征在于其R基团(参见,例如,表1)。例如,氨基酸可以包括非极性脂族R基团、带正电荷的R基团、带负电荷的R基团、非极性芳族R基团、或极性不带电荷的R基团。包括非极性脂族R基团的氨基酸的非限制性实例包含丙氨酸、甘氨酸、缬氨酸、亮氨酸、甲硫氨酸和异亮氨酸。包括带正电荷的R基团的氨基酸的非限制性实例包含赖氨酸、精氨酸和组氨酸。包括带负电荷的R基团的氨基酸的非限制性实例包含天门冬氨酸盐和谷氨酸盐。包括非极性芳族R基团的氨基酸的非限制性实例包含苯丙氨酸、酪氨酸和色氨酸。包括极性不带电荷的R基团的氨基酸的非限制性实例包含丝氨酸、苏氨酸、半胱氨酸、脯氨酸、天门冬酰胺和谷氨酰胺。
多肽的功能等效变体的非限制性实例可以包含本申请中公开的蛋白质的氨基酸序列中的保守氨基酸置换。如本申请中所使用的,“保守置换”与“保守氨基酸置换”可互换地使用,并且指表1中提供的氨基酸置换中的任何一种。
在一些实施方案中,在制备变体多肽时,可以改变1个、2个、3个、4个、5个、6个、7个、8个、9个、10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个或超过20个残基。在一些实施方案中,氨基酸被保守氨基酸置换替代。
表1.保守氨基酸置换
原始残基 R基团类型 保守氨基酸置换
Ala 非极性脂族R基团 Cys、Gly、Ser
Arg 带正电荷的R基团 His、Lys
Asn 极性不带电荷的R基团 Asp、Gln、Glu
Asp 带负电荷的R基团 Asn、Gln、Glu
Cys 极性不带电荷的R基团 Ala、Ser
Gln 极性不带电荷的R基团 Asn、Asp、Glu
Glu 带负电荷的R基团 Asn、Asp、Gln
Gly 非极性脂族R基团 Ala、Ser
His 带正电荷的R基团 Arg、Tyr、Trp
Ile 非极性脂族R基团 Leu、Met、Val
Leu 非极性脂族R基团 Ile、Met、Val
Lys 带正电荷的R基团 Arg、His
Met 非极性脂族R基团 Ile、Leu、Phe、Val
Pro 极性不带电荷的R基团
Phe 非极性芳族R基团 Met、Trp、Tyr
Ser 极性不带电荷的R基团 Ala、Gly、Thr
Thr 极性不带电荷的R基团 Ala、Asn、Ser
Trp 非极性芳族R基团 His、Phe、Tyr、Met
Tyr 非极性芳族R基团 His、Phe、Trp
Val 非极性脂族R基团 Ile、Leu、Met、Thr
可以通过改变多肽的编码序列来进行多肽的氨基酸序列中的氨基酸置换以产生具有期望性质和/或活性的重组多肽变体。类似地,通常通过改变重组多肽的编码序列来进行多肽的氨基酸序列中的保守氨基酸置换以产生多肽的功能等效变体。
编码本公开的酶的序列还可以编码分泌信号。作为非限制性实例,可以根据感兴趣的宿主细胞选择分泌信号。在一些实施方案中,分泌信号可以是酵母分泌信号、植物分泌信号或细菌分泌信号。
在一些实施方案中,分泌信号包括与
Figure BDA0003553471780000201
至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%一致的序列。
在一些实施方案中,编码分泌信号的核酸序列包括与
Figure BDA0003553471780000202
至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少71%、至少72%、至少73%、至少74%、至少75%、至少76%、至少77%、至少78%、至少79%、至少80%、至少81%、至少82%、至少83%、至少84%、至少85%、至少86%、至少87%、至少88%、至少89%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%一致的序列。
应当理解的是,本领域普通技术人员已知的其他分泌信号也将与本公开的方面相容。
编码本公开的酶的核酸
本公开的方面涉及重组酶、其功能化修饰和变体、以及与其相关的应用。例如,本申请中描述的酶和细胞可以用于促进果聚糖(例如,支链果聚糖,例如,支链菊粉)的产生。方法可以包括使用包括本申请中公开的一种或更多种酶的宿主细胞、细胞裂解物、分离的酶、或其任何组合。本公开涵盖包括在宿主细胞中重组表达编码本申请中公开的酶的多核苷酸的方法。本公开还涵盖包括使反应混合物中的用于产生聚果聚糖的一种或更多种酶与本申请中公开的BCAA途径酶反应的体外方法。在一些实施方案中,BCAA途径酶是1-SST酶、1-FFT酶、或6-SFT酶、或其组合。
编码重组多肽1-SST、1-FFT和/或6-SFT中的任何一种或更多种的核酸被本公开涵盖,并且可以被包括在宿主细胞内。在一些实施方案中,核酸是操纵子的形式。在一些实施方案中,至少一个核糖体结合位点存在于核酸中存在的编码序列中的一个或更多个之间。
在一些实施方案中,本申请中提供的核酸是在高严格性条件或中等严格性条件下与编码1-SST、1-FFT和/或6-SFT的核酸杂交并且具有生物学活性的核酸。例如,高严格性条件可以包含在65℃的0.2×SSC至1×SSC,然后在65℃以0.2×SSC洗涤。在一些实施方案中,本申请中提供的核酸是在低严格性条件下与编码1-SST、1-FFT和/或6-SFT的核酸杂交并且具有生物学活性的核酸。例如,低严格性条件可以包含在室温的6×SSC,然后在室温以2×SSC洗涤。其他杂交条件包含在40℃或50℃的3×SSC、然后在20℃、30℃、40℃、50℃、60℃或65℃在1×SSC或2×SSC洗涤。
可以在存在甲醛(例如,10%、20%、30%、40%或50%)的情况下进行杂交,甲醛的存在进一步增加了杂交的严格性。例如,S.阿格拉瓦尔(S.Agrawal)(编者)分子生物学方法(Methods in Molecular Biology),第20卷;和泰森(Tijssen)(1993)生物化学和分子生物学实验技术-核酸探针杂交(Laboratory Techniques in biochemistry and molecularbiology-hybridization with nucleic acid probes)(例如,第I部分第2章“杂交原理和核酸探针测定的策略的概述(Overview of principles of hybridization and thestrategy of nucleic acid probe assays)”,纽约爱思维尔)中描述了核酸杂交的理论和实践。示例性蛋白质可以与1-SST蛋白、1-FFT蛋白、或6-SFT蛋白或其域(例如,催化域)具有至少约50%、70%、80%、90%、95%、98%或99%的同源性或一致性。其他示例性蛋白质可以由与编码1-SST蛋白、1-FFT蛋白、或6-SFT蛋白或其域(例如,催化域)的核酸具有至少约50%、70%、80%、90%、95%、98%或99%)同源性或一致性的核酸编码。
可以通过本领域已知的任何方法将编码本申请中描述的重组多肽中任何一种或更多种的核酸并入任何适当的运载体中。例如,运载体可以是表达载体(包含但不限于病毒运载体(例如,慢病毒运载体、逆转录病毒运载体、腺病毒运载体、或腺相关病毒运载体)、适合于瞬时表达的任何运载体、适合于组成型表达的任何运载体、或者适合于诱导型表达的任何运载体(例如,半乳糖诱导型运载体或强力霉素诱导型运载体))。
在一些实施方案中,运载体在细胞中自主复制。在一些实施方案中,将运载体整合到细胞内的染色体中。运载体可以含有一个或更多个核酸内切酶限制性位点,核酸内切酶限制位点被限制性核酸内切酶切割以插入和连接含有本申请中描述的基因的核酸,以产生能够在细胞中复制的重组运载体。运载体通常由DNA组成,尽管RNA运载体也是可用的。克隆运载体包含(但不限于):质粒、F黏粒(fosmid)、噬菌粒、病毒基因组和人工染色体。如本申请中所使用的,术语“表达运载体”或“表达构建体”指重组或合成生成的、具有一系列容许特定核酸在宿主细胞(例如,微生物)(如酵母细胞)中转录的指定核酸元件的核酸构建体。在一些实施方案中,将本申请中描述的基因的核酸序列插入克隆运载体中,使得其可操作地连接至调控序列,并且在一些实施方案中表达为RNA转录物。在一些实施方案中,运载体含有一种或更多种标志物(如可选择的标志物),以鉴别用重组运载体转化或转染的细胞。在一些实施方案中,本申请中描述的基因的核酸序列被重新编码。重新编码可以将基因产物的产量相对于未经重新编码的参考序列增加至少2%%、至少5%、至少10%、至少15%、至少20%、至少25%、至少30%、至少35%、至少40%、至少45%、至少50%、至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、或至少100%(包含之间的全部值)。
当编码序列和调控序列共价地连接并且编码序列的表达或转录受到调控序列的影响或控制时,编码序列和调控序列被称为“可操作地连接(operably joined)”或“可操作地连接(operably linked)”。如果编码序列被翻译成功能蛋白,则如果5’调控序列中启动子的诱导容许编码序列被转录,并且如果编码序列与调控序列之间的联接的性质不会(1)造成移码突变的引入;(2)干扰启动子区指导编码序列的转录的能力、或(3)干扰相应RNA转录物被翻译成蛋白质的能力,则编码序列和调控序列被称为是可操作地连接。
在一些实施方案中,编码本申请中描述的蛋白质中的任何一种或更多种的核酸受调控序列(例如,增强子序列)的控制。在一些实施方案中,核酸在启动子的控制下表达。启动子可以是天然启动子(例如,基因在其内源环境中的启动子,该启动子提供基因表达的正常调控)。可替代地,启动子可以是与基因的天然启动子不同的启动子,例如,启动子与基因在其内源环境中的启动子不同。
本文公开的酶可以由相同的异源多核苷酸或者由不同的异源多核苷酸编码。例如,至少2种、3种、4种、5种、6种、7种、8种、9种、10种或超过10种酶可以由相同的异源多核苷酸编码或者可以由一种或更多种不同的异源多核苷酸编码。
在一些实施方案中,编码1-SST酶的异源多核苷酸也编码1-FFT酶和/或6-SFT酶;编码1-FFT酶的异源多核苷酸也编码1-SST酶和/或6-SFT酶;或者编码6-SFT酶的异源多核苷酸也编码1-SST酶和/或1-FFT酶。
在一些实施方案中,异源多核苷酸包括可操作地连接至编码至少一种酶的多核苷酸的单个启动子。例如,编码至少1种、至少2种、至少3种、至少4种、至少5种、至少6种、至少7种、至少8种、至少9种或至少10种酶的单个核酸可以可操作地连接至单个启动子。可以通过本领域已知的任何方法(包含例如通过内部核糖体进入位点(IRES)或者多肽切割信号(如2A序列))来控制单个异源多核苷酸内酶的表达。
在一些情况下,异源多核苷酸包括超过一个启动子。在一些情况下,单独的启动子可操作地连接至至少两个多核苷酸序列,至少两个多核苷酸序列各自编码用于产生聚果聚糖的酶。在一些情况下,单独的启动子可操作地连接至编码用于产生聚果聚糖的酶的每个多核苷酸序列。
在一些实施方案中,启动子是真核启动子。真核启动子的非限制性实例包含如本领域普通技术人员已知的TDH3、PGK1、PKC1、PDC1、TEF1、TEF2、RPL18B、SSA1、TDH2、PYK1、TPI1 GAL1、GAL10、GAL7、GAL3、GAL2、MET3、MET25、HXT3、HXT7、ACT1、6-SFT1、6-SFT2、CUP1-1、ENO2、pAOX1、pGAP1和SOD1(参见,例如,Addgene网站:blog.addgene.org/plasmids-101-the-promoter-region)。在一些实施方案中,启动子是原核启动子(例如,噬菌体启动子或细菌启动子)。如本领域普通技术人员所已知的,噬菌体启动子的非限制性实例包含Pls1con、T3、T7、SP6和PL。细菌启动子的非限制性实例包含Pbad、PmgrB、Ptrc2、Plac/ara、Ptac和Pm。
在一些实施方案中,启动子是诱导型启动子。如本申请中所使用的,“诱导型启动子”是受到分子的存在或不存在控制的启动子。例如,这可以用于可控地诱导酶的表达。在一些情况下,诱导型启动子用于可控地阻遏酶的表达。诱导型启动子的非限制性实例包含化学调控的启动子和物理调控的启动子。对于化学调控的启动子,转录活性可以由一种或更多种化合物(如醇、四环素、半乳糖、类固醇、金属、或其他化合物)调控。对于物理调控的启动子,转录活性可以受现象(如光或温度)的调控。四环素调控的启动子的非限制性实例包含脱水四环素(aTc)响应性启动子和其他四环素响应性启动子系统(例如,四环素阻遏蛋白(tetR)、四环素操纵子序列(tetO)和四环素反式激活子融合蛋白(tTA))。类固醇调控的启动子的非限制性实例包含基于大鼠糖皮质激素受体、人雌激素受体、蛾蜕皮激素受体的启动子,以及来自类固醇/类维生素A/甲状腺受体超家族的启动子。金属调控的启动子的非限制性实例包含来源于金属硫蛋白(结合并且螯合金属离子的蛋白质)基因的启动子。发病机制调控的启动子的非限制性实例包含由水杨酸、乙烯或苯并噻二唑(BTH)诱导的启动子。温度/热诱导型启动子的非限制性实例包含热激启动子。光调控的启动子的非限制性实例包含来自植物细胞的光响应性启动子。在某些实施方案中,诱导型启动子是半乳糖诱导型启动子。在一些实施方案中,通过一种或更多种生理条件(例如,pH、温度、辐射、渗透压、盐水梯度、细胞表面结合、或者一种或更多种外在诱导剂或内在诱导剂的浓度)来诱导诱导型启动子。外在诱导物或诱导剂的非限制性实例包含氨基酸和氨基酸类似物、糖类和多糖、核酸、蛋白质转录激活子(activator)和阻遏子(repressor)、细胞因子、毒素、石油基化合物、含金属的化合物、盐、离子、酶底物类似物、激素或其任何组合。在一些实施方案中,诱导型启动子是pAOX1启动子。在一些实施方案中,诱导型启动子用于驱动真核细胞中的表达。在一些实施方案中,真核细胞是酵母细胞。在一些实施方案中,酵母细胞是毕赤酵母属细胞。在一些实施方案中,酵母细胞是酵母属细胞。
在一些实施方案中,启动子是组成型启动子。如本申请中所使用的,“组成型启动子”指允许基因的连续转录的未经调控的启动子。组成型启动子的非限制性实例包含TDH3、PGK1、PKC1、PDC1、TEF1、TEF2、RPL18B、SSA1、TDH2、PYK1,TPI1、HXT3、HXT7、ACT1、6-SFT1、6-SFT2、ENO2、pGAP1和SOD1。在一些实施方案中,组成型启动子用于驱动真核细胞中的表达。在一些实施方案中,真核细胞是酵母细胞。在一些实施方案中,酵母细胞是毕赤酵母属细胞。在一些实施方案中,酵母细胞是酵母属细胞。
本领域普通技术人员已知的其他诱导型启动子或组成型启动子也与本公开的方面相容。
基因表达所需的调控序列的确切性质可能在物种或细胞类型之间变化,但通常可以视需要包含分别涉及转录和翻译的起始的5’非转录序列和5’非翻译序列(如TATA框、加帽序列、CAAT序列等)。特别地,这样的5’非转录调控序列可以包含启动子区,该启动子区包含用于可操作地连接的基因的转录控制的启动子序列。调控序列还可以包含增强子序列或上游激活子序列。本申请中公开的运载体可以包含5’前导序列(leader)或信号序列。调控序列还可以包含终止子序列。在一些实施方案中,终止子序列在转录期间标记DNA中基因的末端。适合于诱导异源生物体中的本申请中描述的一个或更多个基因的表达的一种或更多种适当的运载体的选择和设计在本领域普通技术人员的能力和判断范围之内。
含有表达必需元件的表达运载体是可商业获得的,并且是本领域普通技术人员已知的(参见,例如,Sambrook et al.,Molecular Cloning:A Laboratory Manual,FourthEdition,Cold Spring Harbor Laboratory Press,2012)。
宿主细胞
本公开的蛋白质或酶中的任何一种可以在宿主细胞中表达。术语“宿主细胞”指可以用于表达多核苷酸(如编码在寡糖的产生中使用的酶的多核苷酸)的细胞。
用巴斯德毕赤酵母细胞举例说明所公开的方法、组合物和宿主细胞,但也适用于其他宿主细胞。在本申请中,术语“巴斯德毕赤酵母”与术语“法夫驹形氏酵母(Komagataella phaffii)”可互换地使用。
适合的宿主细胞包含(但不限于):酵母细胞、细菌细胞、藻类细胞、植物细胞、真菌细胞、昆虫细胞和动物细胞(包含哺乳动物细胞)。在一个说明性实施方案中,适合的宿主细胞包含巴斯德毕赤酵母。
适合的酵母宿主细胞包含(但不限于):假丝酵母属、埃希氏菌属、汉逊酵母属、酵母属、裂殖酵母属、毕赤酵母属、克鲁维酵母属和耶氏酵母属。在一些实施方案中,酵母细胞是大肠埃希氏菌、多形汉逊酵母、酿酒酵母、卡尔斯伯酵母(Saccaromycescarlsbergensis)、糖化酵母(Saccharomyces diastaticus)、诺地酵母(Saccharomycesnorbensis)、克鲁弗酵母(Saccharomyces kluyveri)、粟酒裂殖酵母、芬兰毕赤酵母(Pichia finlandica)、喜海藻糖毕赤酵母(Pichia trehalophila)、Pichia kodamae、膜蹼毕赤酵母(Pichia membranaefaciens)、仙人掌毕赤酵母(Pichia opuntiae)、耐热毕赤酵母(Pichia thermotolerans)、柳毕赤酵母(Pichia salictaria)、栋树毕赤酵母(Pichiaquercuum)、皮杰普氏毕赤酵母(Pichia pijperi)、树干毕赤酵母(Pichia stipitis)、甲醇毕赤酵母(Pichia methanolica)、安格斯毕赤酵母(Pichia angusta)、乳酸克鲁维酵母(Kluyveromyces lactis)、白假丝酵母(Candida albicans)、或解脂耶氏酵母(Yarrowialipolytica)。
在一些实施方案中,酵母菌株是工业多倍体酵母菌株。真菌细胞的其他非限制性实例包含获自曲霉属、青霉属、镰刀菌属、根霉属、支顶孢属、脉孢菌属、粪壳菌属、稻瘟菌属、异水霉属、黑粉菌属、葡萄孢属和木霉属的细胞。
在某些实施方案中,宿主细胞是藻类细胞(如衣藻属(例如,莱茵衣藻)和席藻属(席藻属ATCC29409))。
在其他实施方案中,宿主细胞是原核细胞。适合的原核细胞包含革兰氏阳性细菌细胞、革兰氏阴性细菌细胞和革兰氏不定细菌细胞。宿主细胞可以是(但不限于):农杆菌属、脂环酸芽孢杆菌属(Alicyclobacillus)、鱼腥藻属、倒囊藻属、不动杆菌属、热酸菌属(Acidothermus)、节杆菌属、固氮菌属、芽孢杆菌属、双歧杆菌属、短杆菌属、丁酸弧菌属、布赫纳氏菌属(Buchnera)、油菜属菌属(Campestris)、弯曲菌属、梭菌属、棒状杆菌属、着色菌属、粪球菌属、埃希氏菌属、肠球菌属、肠杆菌属、欧文氏菌属、梭杆菌属、粪杆菌属、弗朗西斯氏菌属、黄杆菌属、地芽孢杆菌属、嗜血杆菌属、螺杆菌属、克雷伯菌属、乳杆菌属、乳球菌属、泥杆菌属(Ilyobacter)、微球菌属、微杆菌属、中间根瘤菌属(Mesorhizobium)、甲基杆菌属、甲基杆菌属、分枝杆菌属、奈瑟氏菌属、泛菌属、假单胞菌属、原绿球藻(Prochlorococcus)、红细菌属、红假单胞菌属、红假单胞菌属、罗氏菌属(Roseburia)、红螺菌属、红球菌属、栅藻属、链霉菌属、链球菌属、聚球藻属(Synecoccus)、糖单孢菌属、糖多孢菌属、葡萄球菌属、沙雷氏菌属、沙门氏菌属、志贺氏菌属、嗜热厌氧杆菌属(Thermoanaerobacterium)、Tropheryma、Tularensis、Temecula、嗜热聚球藻属(Thermosynechococcus)、热球菌属(Thermococcus)、尿素原体(Ureaplasma)、黄杆菌属、小菌属(Xylella)、耶尔森氏菌属和发酵单胞菌属。
在一些实施方案中,细菌宿主菌株是工业菌株。许多细菌工业菌株是已知的,并且适合于本申请中描述的方法和组合物。
在一些实施方案中,细菌宿主细胞是农杆菌属(例如,放射形农杆菌(A.radiobacter)、发根农杆菌(A.rhizogenes)、悬钩子农杆菌(A.rubi))、节杆菌属(例如,金黄节杆菌(A.aurescens)、柠檬节杆菌(A.citreus)、球形节杆菌(A.globformis)、裂烃谷氨酸节杆菌(A.hydrocarboglutamicus)、迈索尔节杆菌(A.mysorens)、烟草节杆菌(A.nicotianae)、石蜡节杆菌(A.paraffineus)、原玻璃蝇节杆菌(A.protophonniae)、玫瑰色石蜡节杆菌(A.roseoparaffinus)、硫磺节杆菌(A.sulfureus)、产脲节杆菌(A.ureafaciens))、或者芽孢杆菌属(例如,苏云金芽孢杆菌(B.thuringiensis)、炭疽芽孢杆菌(B.anthracis)、巨大芽孢杆菌(B.megaterium)、枯草芽孢杆菌(B.subtilis)、迟缓芽孢杆菌(B.lentus)、环状芽孢杆菌(B.circulars)、短小芽孢杆菌(B.pumilus)、灿烂芽孢杆菌(B.lautus)、凝结芽孢杆菌(B.coagulans)、短芽孢杆菌(B.brevis)、坚强芽孢杆菌(B.firmus)、嗜碱芽孢杆菌(B.alkaophius)、地衣芽孢杆菌(B.licheniformis)、克劳氏芽孢杆菌(B.clausii)、嗜热脂肪芽孢杆菌(B.stearothermophilus)、耐盐芽孢杆菌(B.halodurans)和解淀粉芽孢杆菌(B.amyloliquefaciens))。在特定实施方案中,宿主细胞是工业芽孢杆菌菌株(包含但不限于枯草芽孢杆菌、短小芽孢杆菌、地衣芽孢杆菌、巨大芽孢杆菌、克劳氏芽孢杆菌、嗜热脂肪芽孢杆菌和解淀粉芽孢杆菌)。在一些实施方案中,宿主细胞是工业梭菌属(例如,丙酮丁醇梭菌(C.acetobutylicum)、破伤风梭菌E88(C.tetaniE88)、象牙海岸梭菌(C.lituseburense)、糖丁基梭菌(C.saccharobutylicum)、产气荚膜梭菌(C.perfringens)、拜氏梭菌(C.beijerinckii))。在一些实施方案中,宿主细胞是工业棒状杆菌属(例如,谷氨酸棒状杆菌(C.glutamicum)、嗜醋酸棒状杆菌(C.acetoacidophilum))。在一些实施方案中,宿主细胞是工业埃希氏菌属(例如,大肠埃希氏菌)。在一些实施方案中,宿主细胞是工业欧文氏菌属(例如,噬夏孢欧文氏菌(E.uredovora)、软腐欧文氏菌(E.carotovora)、菠萝欧文氏菌(E.ananas)、草生欧文氏菌(E.herbicola)、斑点欧文氏菌(E.punctata)、E.terreus)。在一些实施方案中,宿主细胞是工业泛菌属(例如,柠檬泛菌(P.citrea)、成团泛菌(P.agglomerans))。在一些实施方案中,宿主细胞是工业假单胞菌属(例如,恶臭假单胞菌(P.putida)、绿脓假单胞菌(P.aeruginosa)、梅瓦隆假单胞菌(P.mevalonii))。在一些实施方案中,宿主细胞是工业链球菌属(例如,相似型链球菌(S.equisimiles)、酿脓链球菌(S.pyogenes)、乳房链球菌(S.uberis))。在一些实施方案中,宿主细胞是工业链霉菌属(例如,产二素链霉菌(S.ambofaciens)、不产色链霉菌(S.achromogenes)、阿维链霉菌(S.avermitilis)、天蓝色链霉菌(S.coelicolor)、金霉素链霉菌(S.aureofaciens)、金色链霉菌(S.aureus)、杀真菌链霉菌(S.fungicidicus)、灰色链霉菌(S.griseus)、变铅青链霉菌(S.lividans))。在一些实施方案中,宿主细胞是工业发酵单胞菌属(例如,运动发酵单胞菌(Z.mobilis)、解脂发酵单胞菌(Z.lipolytica))。
本公开也适合于与多种动物细胞类型(包含哺乳动物细胞,例如,人类细胞系(包含293细胞、海拉细胞、WI38细胞、PER.C6细胞和Bowes黑素瘤细胞)、小鼠细胞系(包含3T3、NS0、NS1、Sp2/0)、仓鼠细胞系(CHO、BHK)、猴细胞系(COS、FRhL、Vero)和杂交瘤细胞系)使用。
在各种实施方案中,公众易于从多个培养物保藏中心(如美国典型培养物保藏中心(American Type Culture Collection)(ATCC)、德国微生物菌种保藏中心(DeutscheSammlung von Mikroorganismen and Zellkulturen GmbH)(DSM)、荷兰微生物菌种保藏中心(Centraalbureau Voor Schimmelcultures)(CBS)和美国农业研究服务专利培养物保藏中心北方地区研究中心(Agricultural Research Service Patent Culture Collection,Northern Regional Research Center)(NRRL)获取可以在本公开的实践中使用的菌株(包含原核菌株和真核菌株两者)。本公开也适合于与多种植物细胞类型使用。
如本申请中所使用的,术语“细胞”可以指单个细胞或细胞群体(如属于相同细胞系或菌株的细胞群体)。不应当将单数术语“细胞”的使用解释为明确地指单个细胞而不是细胞群体。相对于野生型对应物,宿主细胞可以包括基因修饰。
可以使用本领域已知的任何方法将编码本申请中描述的重组多肽(例如,1-SST、1-FFT和/或6-SFT)中任何一种或更多种的运载体引入适合的宿主细胞中。可以在本领域普通技术人员理解的任何适合的条件下培养宿主细胞。例如,可以使用本领域已知的任何培养基、温度和孵育条件。对于携带诱导型运载体的宿主细胞,可以用适当的诱导剂培养细胞以促进表达。
可以在接触和/或核酸的整合之前、在接触和/或核酸的整合期间、和/或在接触和/或核酸的整合之后在任何类型(富集的或基本的)和任何组成的培养基中培养本申请中公开的任何细胞。如本领域普通技术人员所理解的,可以通过常规实验优化培养物或培养过程的条件。在一些实施方案中,选择的培养基补充有各种组分。在一些实施方案中,优化补充组分的浓度和量。在一些实施方案中,通过常规实验优化培养基和生长条件(例如,pH、温度等)的其他方面。在一些实施方案中,优化培养基补充一种或更多种补充组分的频率、以及培养细胞的时间量。
可以在本领域已知且使用的培养容器中进行本申请中描述的细胞的培养。在一些实施方案中,充气反应容器(例如,搅拌釜反应器)用于培养细胞。在一些实施方案中,生物反应器或发酵器用于培养细胞。因此,在一些实施方案中,在发酵中使用细胞。术语“生物反应器”和术语“发酵器”在本申请中可互换地使用,并且指在其中发生生物反应、生物化学反应和/或化学反应(涉及活生物体或活生物体的一部分(包含一种或更多种分泌的酶))的包围物或部分包围物。“大规模生物反应器”或“工业规模生物反应器”是用于以商业规模或准商业规模生成产物的生物反应器。大型生物反应器通常具有在升、数百升、数千升、或更大范围内的体积。
在一些实施方案中,本公开的培养一种或多种细胞的方法包括本申请中描述的酶的过度表达。在一些实施方案中,培养一种或多种细胞的方法还包括分离或纯化由一种或多种细胞表达的酶(例如,在由细胞分泌酶后分离酶)。
生物反应器的非限制性实例包含:搅拌釜发酵器、通过旋转混合装置搅动的生物反应器、恒化器、通过振动装置搅动的生物反应器、气升式发酵器、填充床反应器、固定床反应器、流化床生物反应器、采用波诱导的搅动的生物反应器、离心生物反应器、滚瓶、以及中空纤维生物反应器、滚转器设备(例如,台式种类、推车安装式种类、和/或自动化种类)、竖直堆叠的板、旋转瓶、搅拌瓶或摇动瓶、振动的多孔板、MD瓶、方瓶、洛克斯氏瓶、多表面组织培养繁殖器、改良的发酵器、以及经涂覆的珠(例如,用血清蛋白、硝化纤维素或羧甲基纤维素涂覆的珠以防止细胞附着)。
在一些实施方案中,生物反应器包含细胞培养系统,其中细胞(例如,细菌细胞)与运动的液体和/或气泡接触。在一些实施方案中,细胞或细胞培养物悬浮生长。在其他实施方案中,细胞或细胞培养物附着于固相载体。载体系统的非限制性实例包含微载体(例如,可以是多孔或无孔的聚合物球、微珠和微盘)、带有特定化学基团(例如,叔胺基团)的交联珠(例如,右旋糖酐)、2D微载体(包含捕获在无孔聚合物纤维中的细胞)、3D载体(例如,载体纤维、中空纤维、多筒反应器(multicartridge reactor)、以及可以包括多孔纤维的半渗透膜)、具有降低的离子交换能力的微载体、微囊化细胞、毛细管、以及聚集体。在一些实施方案中,由材料(如右旋糖酐、明胶、玻璃、或纤维素)制造载体。
在一些实施方案中,以连续模式、半连续模式或非连续模式操作工业规模的过程。操作模式的非限制性实例是分批、补料分批(fed batch)、扩展分批(extended batch)、重复分批(repetitive batch)、抽取/填充、旋转壁、旋转瓶、和/或灌注操作模式。在一些实施方案中,生物反应器允许连续或半连续补充底物原料(例如,碳水化合物来源)和/或从生物反应器连续或半连续分离产物。
在一些实施方案中,生物反应器或发酵器包含传感器和/或控制系统以测量和/或调整反应参数。反应参数的非限制性实例包含生物学参数(例如,生长速率、细胞尺寸、细胞数量、细胞密度、细胞类型、或细胞状态等)、化学参数(例如,pH、氧化还原电位、反应底物和/或产物的浓度、溶解的气体的浓度(如氧气浓度和CO2浓度)、营养物浓度、代谢物浓度、寡肽的浓度、氨基酸的浓度、维生素的浓度、激素的浓度、添加剂的浓度、血清浓度、离子强度、离子的浓度、相对湿度、摩尔浓度、同渗容摩、其他化学物质(例如,缓冲剂、佐剂或反应副产物)的浓度)、物理/机械参数(例如,密度、传导率、搅拌程度、压力、和流速、剪切应力、剪切速率、粘度、颜色、浊度、光吸收、混合速率、转化率、以及热力学参数(如温度、光强度/质量)等)。测量本申请中描述的参数的传感器对于相关机械和电子领域的普通技术人员来说是公知的。控制系统基于来自本申请中描述的传感器的输入来调整生物反应器中的参数是生物反应器工程领域的普通技术人员公知的。
在一些实施方案中,方法涉及分批发酵(例如,摇瓶发酵)。分批发酵(例如,摇瓶发酵)的一般考虑因素包含氧气和葡萄糖的水平。例如,分批发酵(例如,摇瓶发酵)可能受限于氧气和葡萄糖,因此在一些实施方案中,菌株在设计良好的补料分批发酵中进行的能力被低估。另外,最终产物可以在溶解性、毒性、细胞累积和分泌方面显示出与底物的一些差异,并且在一些实施方案中可以具有不同的发酵动力学。
在一些实施方案中,本公开的细胞适合于在体内消耗蔗糖并且产生果聚糖。在一些实施方案中,细胞适合于产生一种或更多种用于经由转化为1-蔗果三糖、6-蔗果三糖和/或菊粉消耗蔗糖的酶(例如,1-SST、1-FFT和/或6-SFT)。在这样的实施方案中,酶可以通过在体外过程中的生物转化来催化用于蔗糖的消耗的反应。
在一些实施方案中,本公开的一种或多种细胞(例如,一种或多种宿主细胞)包括一种或更多种编码1-SST酶、1-FFT酶和/或6-SFT酶的异源多核苷酸。在一些实施方案中,宿主细胞包括一种或更多种编码1-SST酶和1-FFT酶的异源多核苷酸。在一些实施方案中,宿主细胞包括一种或更多种编码1-SST酶和6-SFT酶的异源多核苷酸。在一些实施方案中,宿主细胞包括一种或更多种编码1-FFT酶和6-SFT酶的异源多核苷酸。在一些实施方案中,宿主细胞包括一种或更多种编码1-SST酶、1-FFT酶和6-SFT酶的异源多核苷酸。
就多核苷酸(如包括基因的多核苷酸)而言,术语“异源”与术语“外源的”和术语“重组”可互换地使用,并且指:已经被人工地提供给生物系统的多核苷酸;已经在生物系统内修饰的多核苷酸、或者已经在生物系统内操纵其表达或调控的多核苷酸。被引入宿主细胞中或者在宿主细胞中表达的异源多核苷酸可以是来自与宿主细胞不同的生物体或物种的多核苷酸,或者可以是合成的多核苷酸,或者可以是也在与宿主细胞相同的生物体或物种中内源表达的多核苷酸。例如,当在宿主细胞中内源表达的多核苷酸非天然地位于宿主细胞中;在宿主细胞中稳定或瞬时重组表达;在宿主细胞内被修饰;在宿主细胞内被选择性编辑;在宿主细胞内以不同于天然存在的拷贝数的拷贝数表达;或者在宿主细胞内以非天然方式表达(如通过操纵控制多核苷酸的表达的调控区)时,在宿主细胞中内源表达的多核苷酸可以被认为是异源的。在一些实施方案中,异源多核苷酸是一多核苷酸,所述多核苷酸在宿主细胞中内源表达,但所述多核苷酸的表达由不天然调控多核苷酸表达的启动子驱动。在其他实施方案中,异源多核苷酸是一多核苷酸,所述多核苷酸在宿主细胞中内源表达,并且所述多核苷酸的表达由天然调控多核苷酸表达的启动子驱动,但所述启动子或另外的调控区被修饰。在一些实施方案中,启动子被重组激活或阻遏。例如,基于基因编辑的技术可以用于调控多核苷酸(包含内源多核苷酸)自启动子(包含内源启动子)的表达。参见,例如,Chavez et al.,Nat Methods.2016Jul;13(7):563-567。与参考多核苷酸序列相比,异源多核苷酸可以包括野生型序列或突变序列。
方法
在一些方面,本公开提供包括培养本申请中描述的宿主细胞(例如,包括编码至少一种选自由1-SST、1-FFT和6-SFT组成的组的酶的异源多核苷酸的宿主细胞)的方法。在一些实施方案中,本公开提供由蔗糖产生果聚糖(例如,菊粉)的方法,方法包括培养本申请中描述的宿主细胞(例如,包括编码1-SST、1-FFT和/或6-SFT的异源多核苷酸的宿主细胞)。在一些实施方案中,在体内发生产生和培养。在一些实施方案中,在体外发生一种或更多种产物的产生。在一些实施方案中,使用宿主细胞产生果聚糖的方法包括由细胞分泌表达的酶(例如,1-SST、1-FFT和/或6-SFT)。涉及分泌的酶的方法可以包括使分泌的酶与培养基中或宿主细胞周围溶液中的蔗糖接触。
在一些方面,本公开提供使用分离的或纯化的酶的方法。可以例如在Janson,Protein purification:principles,high resolution methods,and applications,Third Edition(2011)中找到用于蛋白质纯化的非限制性方法。在一些实施方案中,本公开提供包括使糖类与本申请中描述的一种或更多种酶接触(或孵育)以产生果聚糖的方法。在一些实施方案中,产生果聚糖的方法包括使糖类(例如,蔗糖)与下列中的一种或更多种接触:1-SST酶;1-FFT酶;和6-SFT酶。在一些实施方案中,产生果聚糖的方法包括使糖类(例如,蔗糖)与1-SST酶和1-FFT酶接触或孵育。在一些实施方案中,产生果聚糖的方法包括使糖类(例如,蔗糖)与1-SST酶和6-SFT酶接触或孵育。在一些实施方案中,产生果聚糖的方法包括使糖类(例如,蔗糖)与1-FFT酶和6-SFT酶接触或孵育。在一些实施方案中,产生果聚糖的方法包括使糖类(例如,蔗糖)与1-SST酶、1-FFT酶和6-SFT酶接触或孵育。
可以以一方法进行果聚糖的产生,凭借该方法,全部反应在一个反应器(如生物反应器)中进行,这可以称为“一锅法生物转化”。在一些实施方案中,在单个反应器中使用至少两种酶。在一些实施方案中,在单个反应器中使用至少三种酶。
作为一锅法生物转化的非限制性实例,在一些实施方案中,单个菌株可以用于将多种酶分泌至含有蔗糖的培养基中以产生聚果聚糖。在其他实施方案中,可以将各自编码一种或更多种酶的多种菌株组合到单一发酵中,其中其将各自把酶分泌至培养基中。分泌的酶可以将蔗糖转化为支链菊粉。不受特定理论的束缚,从该过程释放的葡萄糖和蔗糖可以用于提高菌株的增加的生物量,并且为支链菊粉的形成提供附加的底物。在一些情况下,一锅法生物转化包括使一种或更多种纯化的酶与底物在单个反应器中孵育以产生聚果聚糖。
在一些情况下,多个反应器用于产生聚果聚糖。超过一个反应器的使用可以称为多锅生物转化。在一些情况下,使用至少2个、至少3个、至少4个、至少5个、至少6个、至少7个、至少8个、至少9个或至少10个反应器。作为非限制性实例,多锅生物转化可以包括使分离的1-SST与蔗糖孵育以形成蔗果三糖。然后,可以分离产生的蔗果三糖,并且与1-FFT和6-SFT孵育以将蔗果三糖转化为支链菊粉。也可以分离所得到的蔗糖和葡萄糖,并且用于宿主细胞生物量累积、用于生物转化或者用于可替代的过程。在一些实施方案中,多锅生物转化包括从一个反应器中纯化感兴趣的产物以及随后将纯化的感兴趣的产物作为底物引入第二反应器中。
在一些情况下,一种或更多种选自1-SST、1-FFT和6-SFT的酶不包括分泌信号。在一些情况下,一种或更多种酶(例如,两种或更多种酶或者三种或更多种酶)通过发酵催化细胞内果聚糖的产生。例如,果聚糖可以在细胞内产生,并且随后从细胞中分泌、从细胞中分离、或者从细胞中纯化。在一些情况下,分泌的果聚糖是另外的反应的底物。在一些情况下,分泌的果聚糖作为另外的反应的底物被细胞输入。在一些情况下,果聚糖在细胞内产生,并且随后从细胞中分离或纯化。分离或纯化的果聚糖可以用作另外的反应的底物。
在一些方面,本公开提供产生果聚糖的方法,方法包括首先使蔗糖与1-SST酶接触以产生蔗果三糖(例如,1-蔗果三糖);以及随后使蔗果三糖(例如,1-蔗果三糖)与1-FFT酶和/或6-SFT酶接触以产生果聚糖。在一些实施方案中,这样的两步法包括宿主细胞(例如,包括1-SST、1-FFT和/或6-SFT)的使用和/或分离的酶(例如,1-SST、1-FFT和/或6-SFT)的使用。在一些实施方案中,通过使蔗糖与1-SST酶接触产生的蔗果三糖在与1-FFT酶和/或6-SFT酶接触之前被纯化。
产生果聚糖的方法可以包括根据本领域已知的任何分离或纯化技术分离或纯化离开宿主细胞和/或酶的所述果聚糖。
通过以下实施例进一步阐明本发明,但不应当将以下实施例解释为限制。贯穿本申请所引用的全部参考文献(包含文献参考、授权专利、公布的专利申请、以及待审专利申请)的全部内容特此通过引用被明确并入。如果并入本申请的参考文献含有定义与本公开中定义的相同术语的定义不一致或不相容的术语,则应当以本公开中归于该术语的含义为准。对本申请中引用的任何参考文献、文章、出版物、专利、专利公布和专利申请的提及不被认为是其构成有效现有技术或者形成世界上任何国家中的公知常识的一部分的承认或任何形式的暗示,也不应当被认为其构成有效现有技术或者形成本领域技术人员的公知常识的一部分的承认或暗示。
实施例
为了可以更充分地理解本申请中描述的发明,示出下列实施例。本申请中描述的实施例被提供以阐明本申请中提供的系统和方法,并且不被解释为限制其范围。
实施例1:酶库设计和筛选
酶发现
基于机器学习的生物信息学工具用于在公共序列数据库(SwissProt和TrEMBL,一起被称为UniProt)中针对三种期望的酶促活性(1-SST、1-FFT和6-SFT)中的每种鉴别候选酶。针对活性中的每种对152种酶的单一库进行测试。
库合成
全部1-SST酶、1-FFT酶和6-SFT酶的DNA序列都被编码以用于在巴斯德毕赤酵母中表达。在T7启动子的控制下在诱导型巴斯德毕赤酵母表达运载体中合成编码序列。
细胞生长和酶制备
将具有库质粒的菌株转化到巴斯德毕赤酵母表达宿主细胞中。酶被分泌到培养基中,从细胞中去除,并且浓缩。
酶筛选
生物转化反应涉及将单独的酶与蔗糖或1-蔗果三糖孵育96小时。随后通过煮沸停止反应。使样品经受高效液相色谱,并且通过折光率检测器(HPLC-RID)进行分析。
如图3A中所示,涉及单独的酶与蔗糖的孵育的反应提供所得的产物混合物,可以针对其包括β(2,6)键的果聚糖的浓度和包括β(2,1)键的果聚糖(对应于1-蔗果三糖)的浓度对所得的产物混合物进行定量。与蔗糖的孵育鉴别具有6-SFT活性或1-SST活性的酶。1-SST酶产生高水平的3-糖寡糖,3-糖寡糖在HPLC上与蔗果三糖共迁移。与1-SST的孵育不产生较长的糖聚合物。6-SFT酶产生高水平的包括β(2,6)键的高分子量寡糖。在聚合蔗糖中显示出最小活性的一些酶表现出转化酶活性,并且产生高水平的葡萄糖和果糖。
如图3B中所示,涉及单独的酶与1-蔗果三糖的孵育的反应提供所得的产物混合物,可以针对其包括β(2,1)键的菊粉(标记为“耐斯糖”)的浓度和更高阶蔗果三糖分子的浓度对所得的产物混合物进行定量。与蔗果三糖的孵育鉴别具有1-FFT活性的酶。针对高水平的4+含糖寡糖对反应进行测定,从而造成作为副产物的蔗糖的产生。许多酶生成这些高分子量物质。另一类酶(蔗果糖酶)形成蔗糖,但在聚合高分子量寡糖方面未显示出任何活性。
通过计算HPLC色谱的曲线下面积来定量所产生的聚果聚糖。图4(顶部图片)中示出生物转化反应(与蔗糖孵育的单独的酶)的HPLC色谱图的实例。图4(底部图片)中还示出可商业获得的标准品的制剂的HPLC色谱图。
实施例2:高性能酶的表征
选择性能最佳的酶用于进一步开发。实施例1中显示出6-SFT活性、1-SST活性或1-FFT活性的单独的酶被重新表达、分离、并且测定产生果聚糖的能力。在通过HPLC-RID分析生物转化反应并且与糖类标准品进行比较之前,使酶制剂与蔗糖或1-蔗果三糖孵育。通过HPLC保留时间鉴别峰,并且通过HPLC积分的相对峰面积来定量蔗糖至其他糖类的转化。表2中提供的酶代表三类酶(6-SFT、1-SST和1-FFT)中的每类的最具活性的酶。“高活性”指被测试的蛋白质的最高活性。针对功能性对全部蛋白质进行测试,并且根据其在聚合糖类方面的活性进行等级排序。将SEQ ID NO:3-4修饰成包含巴斯德毕赤酵母的分泌信号,并且经修饰的构建体(分别为SEQ ID NO:25和SEQ ID NO:27)也被鉴别为具有1-SST活性。还将SEQID NO:9-10修饰成包含巴斯德毕赤酵母的分泌信号,并且经修饰的构建体(分别为SEQ IDNO:32和SEQ ID NO:34)被鉴别为具有1-FFT活性。还将SEQ ID NO:15-21修饰成包含巴斯德毕赤酵母的分泌信号,并且经修饰的构建体(分别为SEQ ID NO:39、SEQ ID NO:41、SEQ IDNO:43、SEQ ID NO:45、SEQ ID NO:47、SEQ ID NO:49和SEQ ID NO:51)被鉴别为具有6-SFT活性。
表2.性能最佳的酶
SEQ ID NO(氨基酸) SEQ ID NO(DNA)
1-SST 1 5
1-FFT 7 11
6-SFT 13 22
实施例3:蔗糖至支链菊粉的生物转化-“一锅”生物转化
使用表2中描述的酶,进行蔗糖至支链菊粉的生物转化。如图5中所示,可以使用1-SST酶将蔗糖(葡萄糖和果糖的二聚体)转化为1-蔗果三糖(包括β(2,1)键)。然后,1-FFT酶催化线性菊粉的形成,线性菊粉自身可以与6-SFT酶反应以提供β(2,6)支链菊粉。
在单个反应中混合三种酶(1-SST、1-FFT和6-SFT),并且与蔗糖孵育96小时。在96小时之后,通过煮沸停止反应。通过HPLC-RID和气相色谱/质谱(GC/MS)测定至支链菊粉的生物转化。基于HPLC洗脱时间鉴别糖类。如图6A中所示,较高分子量的糖类(n=3至n=6)被鉴别为在蔗糖之前洗脱的HPLC峰。该一锅法转化反应显示出葡萄糖形成的增加以及早期洗脱的高分子量材料的形成,这与该峰代表支链菊粉的假设一致。该材料与标准品的比较表明,这由聚合度大于3(DP3)的材料组成。葡萄糖不与菊粉(支链或其他)共洗脱。反应的HPLC测定显示出葡萄糖的高释放作为正在产生支链菊粉(作为早洗脱峰)的样品中的后洗脱峰(参见,例如,图6A)。
然后,GC/MS用于鉴别该生物转化产物混合物中β(2,1)键和β(2,6)两者的存在。使用4步法进行GC/MS分析之前的衍生化,4步法由下列组成:1)使游离醇-OH基团甲基化;
2)水解糖键;3)还原酮基和醛基;以及4)使步骤3期间形成的醇-OH基团酰化。按照该方案,通过GC/MS分析样品,GC/MS显示出具有公认洗脱顺序和特征片段化模式的一系列产物(图6C-图6D)。生物转化样品的GC/MS造成指示β(2,6)支链菊粉的特征。生物转化样品在28.71分钟处包括峰,该峰是已知的支链糖(“最佳基准”)的特性。值得注意的是,在线性糖类(菊苣;Nicie)的GC/MS分析中未发现该特征峰。
实施例4:蔗糖至支链菊粉的生物转化-“两锅”生物转化
使分离的1-SST酶与蔗糖孵育以形成蔗果三糖。分离蔗果三糖,并且然后与1-FFT酶和6-SFT酶孵育,1-FFT酶和6-SFT酶将蔗果三糖转化为支链菊粉。
可以分离所得到的蔗糖和葡萄糖,并且用于宿主细胞生物量累积、生物转化的材料、或者可替代的过程。
序列
1-SST序列的非限制性实例
Figure BDA0003553471780000341
Figure BDA0003553471780000342
(SEQ ID NO:1;分泌信号被加以下划线)
Figure BDA0003553471780000343
Figure BDA0003553471780000344
(SEQ ID NO:2;分泌信号被加以下划线)
Figure BDA0003553471780000345
Figure BDA0003553471780000346
Figure BDA0003553471780000347
(SEQID NO:3;分泌信号被加以下划线)
Figure BDA0003553471780000348
Figure BDA0003553471780000349
(SEQ ID NO:25;分泌信号被加以下划线)
Figure BDA0003553471780000351
Figure BDA0003553471780000352
Figure BDA0003553471780000353
(SEQ ID NO:4;分泌信号被加以下划线)
Figure BDA0003553471780000354
Figure BDA0003553471780000355
(SEQ ID NO:27;分泌信号被加以下划线)
Figure BDA0003553471780000356
Figure BDA0003553471780000361
Figure BDA0003553471780000371
来自高羊茅的1-SST:
Figure BDA0003553471780000381
1-FFT序列的非限制性实例
Figure BDA0003553471780000382
Figure BDA0003553471780000383
(SEQ ID NO:7;分泌信号被加以下划线)
Figure BDA0003553471780000384
Figure BDA0003553471780000385
(SEQ ID NO:8;分泌信号被加以下划线)
Figure BDA0003553471780000386
Figure BDA0003553471780000387
Figure BDA0003553471780000388
(SEQ ID NO:9;分泌信号被加以下划线)
Figure BDA0003553471780000389
Figure BDA00035534717800003810
(SEQ ID NO:32;分泌信号被加以下划线)
Figure BDA0003553471780000391
Figure BDA0003553471780000392
Figure BDA0003553471780000393
(SEQ ID NO:10;分泌信号被加以下划线)
Figure BDA0003553471780000394
Figure BDA0003553471780000395
(SEQ ID NO:34;分泌信号被加以下划线)
Figure BDA0003553471780000396
Figure BDA0003553471780000401
Figure BDA0003553471780000411
来自硬叶蓝刺头的1-FFT:
Figure BDA0003553471780000412
6-SFT序列的非限制性实例
Figure BDA0003553471780000413
Figure BDA0003553471780000414
(SEQ ID NO:13;分泌信号被加以下划线)
Figure BDA0003553471780000415
Figure BDA0003553471780000416
(SEQ ID NO:14;分泌信号被加以下划线)
Figure BDA0003553471780000421
Figure BDA0003553471780000422
Figure BDA0003553471780000423
(SEQ ID NO:15;分泌信号被加以下划线)
Figure BDA0003553471780000425
Figure BDA0003553471780000426
(SEQ ID NO:39;分泌信号被加以下划线)
Figure BDA0003553471780000427
Figure BDA0003553471780000428
Figure BDA0003553471780000429
(SEQ ID NO:16;分泌信号被加以下划线)
Figure BDA00035534717800004210
Figure BDA00035534717800004211
(SEQ ID NO:41;分泌信号被加以下划线)
Figure BDA0003553471780000431
Figure BDA0003553471780000432
Figure BDA0003553471780000433
(SEQ ID NO:17;分泌信号被加以下划线)
Figure BDA0003553471780000434
Figure BDA0003553471780000435
(SEQ ID NO:43;分泌信号被加以下划线)
Figure BDA0003553471780000436
Figure BDA0003553471780000437
Figure BDA0003553471780000438
(SEQ ID NO:45;分泌信号被加以下划线)
Figure BDA0003553471780000441
Figure BDA0003553471780000442
Figure BDA0003553471780000443
(SEQ ID NO:19;分泌信号被加以下划线)
Figure BDA0003553471780000444
Figure BDA0003553471780000445
(SEQ ID NO:47;分泌信号被加以下划线)
Figure BDA0003553471780000446
Figure BDA0003553471780000447
Figure BDA0003553471780000448
(SEQ ID NO:20;分泌信号被加以下划线)
Figure BDA0003553471780000449
Figure BDA0003553471780000451
Figure BDA0003553471780000452
Figure BDA0003553471780000453
(SEQ ID NO:21;分泌信号被加以下划线)
Figure BDA0003553471780000454
Figure BDA0003553471780000455
(SEQ ID NO:51;分泌信号被加以下划线)
Figure BDA0003553471780000456
Figure BDA0003553471780000461
Figure BDA0003553471780000471
Figure BDA0003553471780000481
Figure BDA0003553471780000491
来自梯牧草的6-SFT:
Figure BDA0003553471780000501
等同物
本领域技术人员仅使用常规实验就将认识到或能够确知本申请中描述的本发明的具体实施方案的许多等同物。这样的等同物旨在由以下权利要求书涵盖。
本申请中公开的全部参考文献(包含专利文件)通过引用被整体(特别是本申请中引用的公开内容)并入本文。
应当理解的是,本申请中公开的序列可以含有分泌信号或者可以不含有分泌信号。本申请中公开的序列涵盖具有分泌信号的型式或者不具有分泌信号的型式。还应当理解的是,本申请中公开的蛋白质序列可以描述为具有起始密码子(M)或者不具有起始密码子(M)。本申请中公开的序列涵盖具有起始密码子的型式或者不具有起始密码子的型式。因此,在一些情况下,氨基酸编号可以对应于含有起始密码子的蛋白质序列,而在其他情况下,氨基酸编号可以对应于不含有起始密码子的蛋白质序列。还应当理解的是,本申请中公开的序列可以描述为具有终止密码子或者不具有终止密码子。本申请中公开的序列涵盖具有终止密码子的型式或者不具有终止密码子的型式。本公开的方面涵盖包括本申请中描述的序列中的任何一种及其片段的宿主细胞。
序列表
<110> 银杏生物制品公司
<120> 寡糖的产生
<130> G0919.70034WO00
<140> 尚未分配
<141> 同此同时
<150> US 62/905,246
<151> 2019-09-24
<160> 63
<170> PatentIn version 3.5
<210> 1
<211> 651
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 1
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asn Leu Met Arg Leu Arg Glu
85 90 95
Asn Asp Tyr Pro Trp Thr Asn Asp Met Leu Arg Trp Gln Arg Thr Gly
100 105 110
Phe His Phe Gln Pro Gly Lys Asn Phe Gln Ala Asp Pro Asn Ala Ala
115 120 125
Met Phe Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro Thr
130 135 140
Gly Val Ala Trp Asp Tyr Thr Ile Ser Trp Gly His Ala Val Ser Lys
145 150 155 160
Asp Leu Leu His Trp Asn Tyr Leu Pro Met Ala Leu Arg Pro Asp His
165 170 175
Trp Tyr Asp Arg Lys Gly Val Trp Ser Gly Tyr Ser Thr Leu Leu Pro
180 185 190
Asp Gly Arg Ile Val Val Leu Tyr Thr Gly Gly Thr Lys Glu Leu Val
195 200 205
Gln Val Gln Asn Leu Ala Val Pro Val Asn Leu Ser Asp Pro Leu Leu
210 215 220
Leu Glu Trp Lys Lys Ser His Val Asn Pro Ile Leu Val Pro Pro Pro
225 230 235 240
Gly Ile Glu Asp His Asp Phe Arg Asp Pro Phe Pro Val Trp Tyr Asn
245 250 255
Glu Ser Asp Ser Arg Trp His Val Val Ile Gly Ser Lys Asp Pro Glu
260 265 270
His Tyr Gly Ile Val Leu Ile Tyr Thr Thr Lys Asp Phe Val Asn Phe
275 280 285
Thr Leu Leu Pro Asn Ile Leu His Ser Thr Lys Gln Pro Val Gly Met
290 295 300
Leu Glu Cys Val Asp Leu Phe Pro Val Ala Thr Thr Asp Ser Arg Ala
305 310 315 320
Asn Gln Ala Leu Asp Met Thr Thr Met Arg Pro Gly Pro Gly Leu Lys
325 330 335
Tyr Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ala
340 345 350
Leu Gly Ser Phe Asp Leu Asp Ser Phe Thr Phe Thr Pro Asp Asp Glu
355 360 365
Thr Ile Asp Val Gly Ile Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr
370 375 380
Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys Gln Arg Arg Val Leu Trp
385 390 395 400
Gly Tyr Val Gly Glu Val Asp Ser Lys Arg Asp Asp Ala Leu Lys Gly
405 410 415
Trp Ala Ser Leu Gln Asn Ile Pro Arg Thr Ile Leu Phe Asp Thr Lys
420 425 430
Thr Lys Ser Asn Leu Ile Leu Trp Pro Val Glu Glu Val Glu Ser Leu
435 440 445
Arg Thr Ile Asn Lys Asn Phe Asn Ser Ile Pro Leu Tyr Pro Gly Ser
450 455 460
Thr Tyr Gln Leu Asp Val Gly Glu Ala Thr Gln Leu Asp Ile Val Ala
465 470 475 480
Glu Phe Glu Val Asp Glu Lys Ala Ile Glu Ala Thr Ala Glu Ala Asp
485 490 495
Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly Ala Ala Asn Arg Gly Val
500 505 510
Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Gln Glu Leu Ser Glu
515 520 525
Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg Gly Ile Asp Gly Asn Leu
530 535 540
Arg Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala Gly Ala
545 550 555 560
Ile Thr Lys Arg Val Val Gly Ser Thr Val Pro Val Leu His Gly Glu
565 570 575
Thr Trp Ala Leu Arg Ile Leu Val Asp His Ser Ile Val Glu Ser Phe
580 585 590
Ala Gln Arg Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro Thr Glu
595 600 605
Ala Ile Tyr Ser Ser Ala Arg Val Phe Leu Phe Asn Asn Ala Thr Asp
610 615 620
Ala Ile Val Thr Ala Lys Thr Val Asn Val Trp His Ile Asn Ser Thr
625 630 635 640
Tyr Asn His Val Phe Pro Gly Leu Val Ala Pro
645 650
<210> 2
<211> 621
<212> PRT
<213> 太匮龙舌兰
<400> 2
Met Ala Ser Ser Thr Lys Asp Val Glu Ala Pro Pro Thr Leu Asp Ala
1 5 10 15
Pro Leu Leu Gly Pro Ala Ala Pro Arg Ser Arg Leu Arg Val Ala Pro
20 25 30
Val Ser Leu Ser Val Met Ala Phe Leu Leu Val Ala Ile Ala Ala Ala
35 40 45
Val Leu Tyr Tyr Asn Pro Gly Gly Val Ala Ser Asn Leu Met Arg Leu
50 55 60
Arg Glu Asn Asp Tyr Pro Trp Thr Asn Asp Met Leu Arg Trp Gln Arg
65 70 75 80
Thr Gly Phe His Phe Gln Pro Gly Lys Asn Phe Gln Ala Asp Pro Asn
85 90 95
Ala Ala Met Phe Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn
100 105 110
Pro Thr Gly Val Ala Trp Asp Tyr Thr Ile Ser Trp Gly His Ala Val
115 120 125
Ser Lys Asp Leu Leu His Trp Asn Tyr Leu Pro Met Ala Leu Arg Pro
130 135 140
Asp His Trp Tyr Asp Arg Lys Gly Val Trp Ser Gly Tyr Ser Thr Leu
145 150 155 160
Leu Pro Asp Gly Arg Ile Val Val Leu Tyr Thr Gly Gly Thr Lys Glu
165 170 175
Leu Val Gln Val Gln Asn Leu Ala Val Pro Val Asn Leu Ser Asp Pro
180 185 190
Leu Leu Leu Glu Trp Lys Lys Ser His Val Asn Pro Ile Leu Val Pro
195 200 205
Pro Pro Gly Ile Glu Asp His Asp Phe Arg Asp Pro Phe Pro Val Trp
210 215 220
Tyr Asn Glu Ser Asp Ser Arg Trp His Val Val Ile Gly Ser Lys Asp
225 230 235 240
Pro Glu His Tyr Gly Ile Val Leu Ile Tyr Thr Thr Lys Asp Phe Val
245 250 255
Asn Phe Thr Leu Leu Pro Asn Ile Leu His Ser Thr Lys Gln Pro Val
260 265 270
Gly Met Leu Glu Cys Val Asp Leu Phe Pro Val Ala Thr Thr Asp Ser
275 280 285
Arg Ala Asn Gln Ala Leu Asp Met Thr Thr Met Arg Pro Gly Pro Gly
290 295 300
Leu Lys Tyr Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr
305 310 315 320
Tyr Ala Leu Gly Ser Phe Asp Leu Asp Ser Phe Thr Phe Thr Pro Asp
325 330 335
Asp Glu Thr Ile Asp Val Gly Ile Gly Leu Arg Tyr Asp Trp Gly Lys
340 345 350
Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys Gln Arg Arg Val
355 360 365
Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Arg Asp Asp Ala Leu
370 375 380
Lys Gly Trp Ala Ser Leu Gln Asn Ile Pro Arg Thr Ile Leu Phe Asp
385 390 395 400
Thr Lys Thr Lys Ser Asn Leu Ile Leu Trp Pro Val Glu Glu Val Glu
405 410 415
Ser Leu Arg Thr Ile Asn Lys Asn Phe Asn Ser Ile Pro Leu Tyr Pro
420 425 430
Gly Ser Thr Tyr Gln Leu Asp Val Gly Glu Ala Thr Gln Leu Asp Ile
435 440 445
Val Ala Glu Phe Glu Val Asp Glu Lys Ala Ile Glu Ala Thr Ala Glu
450 455 460
Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly Ala Ala Asn Arg
465 470 475 480
Gly Val Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Gln Glu Leu
485 490 495
Ser Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg Gly Ile Asp Gly
500 505 510
Asn Leu Arg Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala
515 520 525
Gly Ala Ile Thr Lys Arg Val Val Gly Ser Thr Val Pro Val Leu His
530 535 540
Gly Glu Thr Trp Ala Leu Arg Ile Leu Val Asp His Ser Ile Val Glu
545 550 555 560
Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro
565 570 575
Thr Glu Ala Ile Tyr Ser Ser Ala Arg Val Phe Leu Phe Asn Asn Ala
580 585 590
Thr Asp Ala Ile Val Thr Ala Lys Thr Val Asn Val Trp His Ile Asn
595 600 605
Ser Thr Tyr Asn His Val Phe Pro Gly Leu Val Ala Pro
610 615 620
<210> 3
<211> 591
<212> PRT
<213> 拟南芥
<400> 3
Met Ala Lys Leu Asn Arg Ser Asn Ile Gly Leu Ser Leu Leu Leu Ser
1 5 10 15
Met Phe Leu Ala Asn Phe Ile Thr Asp Leu Glu Ala Ser Ser His Gln
20 25 30
Asp Leu Asn Gln Pro Tyr Arg Thr Gly Tyr His Phe Gln Pro Leu Lys
35 40 45
Asn Trp Met Asn Gly Pro Met Ile Tyr Lys Gly Ile Tyr His Leu Phe
50 55 60
Tyr Gln Tyr Asn Pro Tyr Gly Ala Val Trp Asp Val Arg Ile Val Trp
65 70 75 80
Gly His Ser Thr Ser Val Asp Leu Val Asn Trp Ile Ser Gln Pro Pro
85 90 95
Ala Phe Asn Pro Ser Gln Pro Ser Asp Ile Asn Gly Cys Trp Ser Gly
100 105 110
Ser Val Thr Ile Leu Pro Asn Gly Lys Pro Val Ile Leu Tyr Thr Gly
115 120 125
Ile Asp Gln Asn Lys Gly Gln Val Gln Asn Val Ala Val Pro Val Asn
130 135 140
Ile Ser Asp Pro Tyr Leu Arg Glu Trp Ser Lys Pro Pro Gln Asn Pro
145 150 155 160
Leu Met Thr Thr Asn Ala Val Asn Gly Ile Asn Pro Asp Arg Phe Arg
165 170 175
Asp Pro Thr Thr Ala Trp Leu Gly Arg Asp Gly Glu Trp Arg Val Ile
180 185 190
Val Gly Ser Ser Thr Asp Asp Arg Arg Gly Leu Ala Ile Leu Tyr Lys
195 200 205
Ser Arg Asp Phe Phe Asn Trp Thr Gln Ser Met Lys Pro Leu His Tyr
210 215 220
Glu Asp Leu Thr Gly Met Trp Glu Cys Pro Asp Phe Phe Pro Val Ser
225 230 235 240
Ile Thr Gly Ser Asp Gly Val Glu Thr Ser Ser Val Gly Glu Asn Gly
245 250 255
Ile Lys His Val Leu Lys Val Ser Leu Ile Glu Thr Leu His Asp Tyr
260 265 270
Tyr Thr Ile Gly Ser Tyr Asp Arg Glu Lys Asp Val Tyr Val Pro Asp
275 280 285
Leu Gly Phe Val Gln Asn Glu Ser Ala Pro Arg Leu Asp Tyr Gly Lys
290 295 300
Tyr Tyr Ala Ser Lys Thr Phe Tyr Asp Asp Val Lys Lys Arg Arg Ile
305 310 315 320
Leu Trp Gly Trp Val Asn Glu Ser Ser Pro Ala Lys Asp Asp Ile Glu
325 330 335
Lys Gly Trp Ser Gly Leu Gln Ser Phe Pro Arg Lys Ile Trp Leu Asp
340 345 350
Glu Ser Gly Lys Glu Leu Leu Gln Trp Pro Ile Glu Glu Ile Glu Thr
355 360 365
Leu Arg Gly Gln Gln Val Asn Trp Gln Lys Lys Val Leu Lys Ala Gly
370 375 380
Ser Thr Leu Gln Val His Gly Val Thr Ala Ala Gln Ala Asp Val Glu
385 390 395 400
Val Ser Phe Lys Val Lys Glu Leu Glu Lys Ala Asp Val Ile Glu Pro
405 410 415
Ser Trp Thr Asp Pro Gln Lys Ile Cys Ser Gln Gly Asp Leu Ser Val
420 425 430
Met Ser Gly Leu Gly Pro Phe Gly Leu Met Val Leu Ala Ser Asn Asp
435 440 445
Met Glu Glu Tyr Thr Ser Val Tyr Phe Arg Ile Phe Lys Ser Asn Asp
450 455 460
Asp Thr Asn Lys Lys Thr Lys Tyr Val Val Leu Met Cys Ser Asp Gln
465 470 475 480
Ser Arg Ser Ser Leu Asn Asp Glu Asn Asp Lys Ser Thr Phe Gly Ala
485 490 495
Phe Val Ala Ile Asp Pro Ser His Gln Thr Ile Ser Leu Arg Thr Leu
500 505 510
Ile Asp His Ser Ile Val Glu Ser Tyr Gly Gly Gly Gly Arg Thr Cys
515 520 525
Ile Thr Ser Arg Val Tyr Pro Lys Leu Ala Ile Gly Glu Asn Ala Asn
530 535 540
Leu Phe Val Phe Asn Lys Gly Thr Gln Ser Val Asp Ile Leu Thr Leu
545 550 555 560
Ser Ala Trp Ser Leu Lys Ser Ala Gln Ile Asn Gly Asp Leu Met Ser
565 570 575
Pro Phe Ile Glu Arg Glu Glu Ser Arg Ser Pro Asn His Gln Phe
580 585 590
<210> 4
<211> 628
<212> PRT
<213> 石刁柏
<400> 4
Met Ala Ser Pro Ser Asp Leu Glu Ser Pro Pro Thr Leu Ser Ala Gln
1 5 10 15
Leu Leu Glu Ser Arg Pro Pro Arg Ser Lys Leu Arg Leu Val Ala Leu
20 25 30
Thr Leu Thr Ala Ala Ala Phe Leu Val Ala Leu Ala Leu Phe Leu Ala
35 40 45
Asp Gly Ser Ala Ser Arg Phe Val Ser Gly Leu Ala Arg Lys Leu Arg
50 55 60
Ser Asp Pro Ile Lys Glu His Asp Tyr Pro Trp Thr Asn Glu Met Leu
65 70 75 80
Thr Trp Gln Arg Ser Gly Phe His Phe Gln Pro Ala Lys Asn Phe Gln
85 90 95
Ser Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe
100 105 110
Tyr Gln Tyr Asn Pro Thr Gly Thr Ala Trp Asp Tyr Thr Ile Ser Trp
115 120 125
Gly His Ala Val Ser Arg Asp Leu Ile His Trp Leu His Leu Pro Met
130 135 140
Ala Met Val Pro Asp His Trp Tyr Asp Ala Lys Gly Val Trp Ser Gly
145 150 155 160
Tyr Ser Thr Leu Leu Pro Asp Gly Arg Val Ile Val Leu Tyr Thr Gly
165 170 175
Gly Thr Pro Glu Leu Val Gln Val Gln Asn Leu Ala Val Pro Ala Asp
180 185 190
Ala Ser Asp Pro Leu Leu Leu Lys Trp Lys Lys Ser Ser Val Asn Pro
195 200 205
Ile Leu Val Pro Pro Pro Gly Ile Gly Thr Ser Asp Phe Arg Asp Pro
210 215 220
Phe Pro Ile Trp Tyr Asn Glu Thr Asp Ser Asn Trp His Val Leu Ile
225 230 235 240
Gly Ser Lys Asp Ser Asn His His Gly Ile Val Leu Leu Tyr Lys Thr
245 250 255
Lys Asp Phe Phe Asn Phe Thr Leu Leu Pro Ser Leu Leu His Thr Ser
260 265 270
Thr Gln Ser Val Gly Met Phe Glu Cys Val Asp Leu Tyr Pro Val Ala
275 280 285
Thr Gly Gly Pro Leu Ser Asn Arg Gly Leu Glu Met Ser Val Asp Leu
290 295 300
Ser Asn Gly Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Glu Glu
305 310 315 320
Arg His Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Asp Ser Phe Lys
325 330 335
Trp Thr Pro Asp Asp Pro Ser Ile Asp Val Gly Val Gly Leu Arg Tyr
340 345 350
Asp Trp Gly Lys Phe Tyr Ala Ser Lys Thr Phe Phe Asp Thr Glu Lys
355 360 365
Gln Arg Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Asp
370 375 380
Asp Asp Lys Met Lys Gly Trp Ala Thr Leu Gln Asn Ile Pro Arg Thr
385 390 395 400
Ile Leu Leu Asp Thr Lys Thr Gln Ser Asn Leu Ile Ile Trp Pro Val
405 410 415
Glu Glu Val Glu Asp Leu Arg Thr Asp Gly Asn Ile Phe Asn Asp Ile
420 425 430
Lys Ile Gly Ala Gly Ser Ser Val Gln Leu Asp Ile Gly Ala Ala Ser
435 440 445
Gln Leu Asp Ile Glu Ala Glu Phe Glu Leu Asp Asn Ser Ala Leu Asp
450 455 460
Gly Ala Ile Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly
465 470 475 480
Ala Ala Asn Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
485 490 495
Asn Gln Asp Leu Thr Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg
500 505 510
Gly Thr Asp Gly Asp Leu Arg Thr His Phe Cys Gln Asp Glu Leu Arg
515 520 525
Ser Ser Lys Ala Gly Asp Ile Val Lys Arg Val Val Gly Ser Val Val
530 535 540
Pro Val Leu His Gly Glu Thr Trp Ser Leu Arg Ile Leu Val Asp His
545 550 555 560
Ser Ile Ile Glu Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr Ser
565 570 575
Arg Val Tyr Pro Thr Glu Ala Ile Tyr Asn Lys Ala Arg Leu Phe Leu
580 585 590
Phe Asn Asn Ala Thr Asp Ala Lys Val Thr Ala Lys Ser Val Lys Ile
595 600 605
Trp His Met Asn Ser Thr His Asn His Pro Phe Pro Gly Leu Glu Ser
610 615 620
Leu Phe Glu Ser
625
<210> 5
<211> 1953
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 5
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctaac ttgatgcgtt taagagagaa tgattatccc 300
tggactaacg acatgctaag atggcaacgc acgggatttc acttccagcc tggtaaaaac 360
ttccaagccg acccaaatgc agctatgttt tacaagggct ggtaccattt cttttatcaa 420
tacaacccga ccggtgtggc ttgggattac acaatctcct ggggtcacgc tgtcagtaag 480
gatttgctgc attggaatta tcttccaatg gccttgaggc ctgaccactg gtacgataga 540
aaaggtgttt ggagcggtta ctctacttta ttgccagacg gtagaattgt tgtcttgtac 600
accggtggaa ctaaggaatt agttcaagtc caaaacttgg ctgtcccagt aaacctttct 660
gacccattgc tattggaatg gaagaagtca cacgttaacc caatactcgt tccacctccg 720
gggatcgaag atcatgattt ccgagatcca ttcccagtgt ggtataatga atctgactcg 780
cggtggcacg ttgtaattgg ttccaaagat ccagagcact atggtattgt cttgatctac 840
actaccaagg acttcgttaa ctttacgtta ttaccaaaca tattgcattc caccaagcag 900
ccggttggta tgctggaatg tgtagacttg ttcccagttg ctacaactga ttctcgtgca 960
aatcaagctt tggatatgac taccatgagg cccggtcctg ggctcaaata tgtgttaaag 1020
gcgagtatgg atgacgaaag acacgattac tacgccctag gtagctttga cttggactcg 1080
ttcactttta caccagatga tgaaaccatt gacgtcggta ttggtcttag atacgactgg 1140
ggcaagttct acgcgtccaa gactttttac gaccaagaaa aacaaagaag agttttgtgg 1200
ggatacgtcg gtgaagttga ctcgaagcgt gatgatgctc tgaaaggttg ggcttctttg 1260
caaaatatcc cacgtacaat cttgttcgac accaaaacca agtccaacct aattttgtgg 1320
ccagttgaag aagtcgagtc tttaagaact attaacaaga atttcaattc aatccctttg 1380
tatcctggtt ctacttacca gcttgatgtg ggtgaagcta cccaattgga tattgtggcc 1440
gagttcgaag tcgatgaaaa ggctattgaa gctactgccg aagctgatgt tacatataac 1500
tgctccacct ccggtggtgc agctaataga ggggttttgg gtccattcgg tttgttagtt 1560
ttagctaacc aagagttgtc tgaacaaact gctacttact tctatgtctc tcgcggcata 1620
gatggtaact taagaacaca tttttgtcaa gacgaactgc gatcttccaa ggctggtgcc 1680
atcactaagc gggtagttgg ttctaccgtc ccagttctac atggcgaaac ctgggccttg 1740
agaattttgg tcgatcactc aatcgtagag tcttttgcac agagaggtag agctgttgcc 1800
acgagtagag tctatcctac agaagcaatt tatagctcag ctagagtctt tctattcaac 1860
aatgccactg acgctattgt taccgctaag acagtaaacg tttggcacat caactccacc 1920
tacaatcatg tttttccggg tctggtcgct cca 1953
<210> 6
<211> 654
<212> PRT
<213> 高羊茅
<400> 6
Met Glu Ser Ser Ala Val Val Pro Gly Thr Thr Ala Pro Leu Leu Pro
1 5 10 15
Tyr Ala Tyr Ala Pro Leu Pro Ser Ser Ala Asp Asp Ala Arg Glu Asn
20 25 30
Gln Ser Ser Gly Gly Val Arg Trp Arg Val Cys Ala Ala Val Leu Ala
35 40 45
Ala Ser Ala Leu Ala Val Leu Ile Val Val Gly Leu Leu Ala Gly Gly
50 55 60
Arg Val Asp Arg Gly Pro Ala Gly Gly Asp Val Ala Ser Ala Ala Val
65 70 75 80
Pro Ala Val Pro Met Glu Ile Pro Arg Ser Arg Gly Lys Asp Phe Gly
85 90 95
Val Ser Glu Lys Ala Ser Gly Ala Tyr Ser Ala Asp Gly Gly Phe Pro
100 105 110
Trp Ser Asn Ala Met Leu Gln Trp Gln Arg Thr Gly Phe His Phe Gln
115 120 125
Pro Glu Lys His Tyr Met Asn Asp Pro Asn Gly Pro Val Tyr Tyr Gly
130 135 140
Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro Lys Gly Asp Ser Trp
145 150 155 160
Gly Asn Ile Ala Trp Ala His Ala Val Ser Lys Asp Met Val Asn Trp
165 170 175
Arg His Leu Pro Leu Ala Met Val Pro Asp Gln Trp Tyr Asp Ser Asn
180 185 190
Gly Val Leu Thr Gly Ser Ile Thr Val Leu Pro Asp Gly Gln Val Ile
195 200 205
Leu Leu Tyr Thr Gly Asn Thr Asp Thr Leu Ala Gln Val Gln Cys Leu
210 215 220
Ala Thr Pro Ala Asp Pro Ser Asp Pro Leu Leu Arg Glu Trp Ile Lys
225 230 235 240
His Pro Ala Asn Pro Ile Leu Tyr Pro Pro Pro Gly Ile Gly Leu Lys
245 250 255
Asp Phe Arg Asp Pro Leu Thr Ala Trp Phe Asp His Ser Asp Asn Thr
260 265 270
Trp Arg Thr Val Ile Gly Ser Lys Asp Asp Asp Gly His Ala Gly Ile
275 280 285
Ile Leu Ser Tyr Lys Thr Lys Asp Phe Val Asn Tyr Glu Leu Met Pro
290 295 300
Gly Asn Met His Arg Gly Pro Asp Gly Thr Gly Met Tyr Glu Cys Ile
305 310 315 320
Asp Leu Tyr Pro Val Gly Gly Asn Ser Ser Glu Met Leu Gly Gly Asp
325 330 335
Asp Ser Pro Asp Val Leu Phe Val Leu Lys Glu Ser Ser Asp Asp Glu
340 345 350
Arg His Asp Tyr Tyr Ala Leu Gly Arg Phe Asp Ala Ala Ala Asn Ile
355 360 365
Trp Thr Pro Ile Asp Gln Glu Leu Asp Leu Gly Ile Gly Leu Arg Tyr
370 375 380
Asp Trp Gly Lys Tyr Tyr Ala Ser Lys Ser Phe Tyr Asp Gln Lys Lys
385 390 395 400
Asn Arg Arg Ile Val Trp Ala Tyr Ile Gly Glu Thr Asp Ser Glu Gln
405 410 415
Ala Asp Ile Thr Lys Gly Trp Ala Asn Leu Met Thr Ile Pro Arg Thr
420 425 430
Val Glu Leu Asp Lys Lys Thr Arg Thr Asn Leu Ile Gln Trp Pro Val
435 440 445
Glu Glu Leu Asp Thr Leu Arg Arg Asn Ser Thr Asp Leu Ser Gly Ile
450 455 460
Thr Val Asp Ala Gly Ser Val Ile Arg Leu Pro Leu His Gln Gly Ala
465 470 475 480
Gln Ile Asp Ile Glu Ala Ser Phe Gln Leu Asn Ser Ser Asp Val Asp
485 490 495
Ala Leu Thr Glu Ala Asp Val Ser Tyr Asn Cys Ser Thr Ser Gly Ala
500 505 510
Ala Val Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn
515 520 525
Gly Arg Thr Glu Gln Thr Ala Val Tyr Phe Tyr Val Ser Lys Gly Val
530 535 540
Asp Gly Ala Leu Gln Thr His Phe Cys His Asp Glu Ser Arg Ser Thr
545 550 555 560
Gln Ala Lys Asp Val Val Asn Arg Met Ile Gly Ser Ile Val Pro Val
565 570 575
Leu Asp Gly Glu Thr Phe Ser Val Arg Val Leu Val Asp His Ser Ile
580 585 590
Val Gln Ser Phe Ala Met Gly Gly Arg Ile Thr Ala Thr Ser Arg Ala
595 600 605
Tyr Pro Thr Glu Ala Ile Tyr Ala Ala Ala Gly Val Tyr Leu Phe Asn
610 615 620
Asn Ala Thr Gly Ala Thr Val Thr Ala Glu Arg Leu Val Val Tyr Glu
625 630 635 640
Met Ala Ser Ala Asp Asn His Ile Phe Thr Asn Asp Asp Leu
645 650
<210> 7
<211> 631
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 7
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Ser Val Gln Pro Ser Ala
85 90 95
Ala Glu Arg Leu Thr Trp Glu Arg Thr Ala Phe His Phe Gln Pro Ala
100 105 110
Lys Asn Phe Ile Tyr Asp Pro Asn Gly Pro Leu Phe His Met Gly Trp
115 120 125
His His Leu Phe Tyr Gln Tyr Asn Pro Tyr Ala Pro Val Trp Gly Asn
130 135 140
Met Ser Trp Gly His Ala Val Ser Lys Asp Met Ile Asn Trp Phe Glu
145 150 155 160
Leu Pro Val Ala Leu Val Pro Thr Glu Trp Tyr Asp Ile Glu Gly Val
165 170 175
Leu Ser Gly Ser Thr Thr Ala Leu Pro Asn Gly Gln Ile Phe Ala Leu
180 185 190
Tyr Thr Gly Asn Ala Asn Asp Phe Ser Gln Leu Gln Cys Lys Ala Val
195 200 205
Pro Val Asp Val Ser Asp Pro Leu Leu Val Lys Trp Val Lys Tyr Asp
210 215 220
Gly Asn Pro Ile Leu Tyr Thr Pro Pro Gly Ile Gly Leu Lys Asp Tyr
225 230 235 240
Arg Asp Pro Ser Thr Val Trp Thr Gly Pro Asp Gly Lys His Arg Met
245 250 255
Ile Met Gly Thr Lys Arg Gly Thr Thr Gly Leu Val Leu Val Tyr His
260 265 270
Thr Thr Asp Phe Thr Asn Tyr Val Met Leu Asp Glu Pro Leu His Ser
275 280 285
Val Pro Asn Thr Asp Met Trp Glu Cys Val Asp Leu Phe Pro Val Ser
290 295 300
Thr Thr Asn Asp Ser Ala Leu Asp Ile Ala Ala Tyr Gly Ser Gly Ile
305 310 315 320
Lys His Val Leu Lys Glu Ser Trp Glu Gly His Ala Met Asp Phe Tyr
325 330 335
Ser Ile Gly Thr Tyr Asp Ala Ile Asn Asp Lys Trp Thr Pro Asp Asn
340 345 350
Pro Glu Leu Asp Val Gly Ile Gly Leu Arg Cys Asp Tyr Gly Arg Phe
355 360 365
Phe Ala Ser Lys Ser Leu Tyr Asp Pro Leu Lys Lys Arg Arg Val Thr
370 375 380
Trp Gly Tyr Val Ala Glu Ser Asp Ser Ala Asp Gln Asp Val Ser Arg
385 390 395 400
Gly Trp Ala Thr Ile Tyr Asn Val Ala Arg Thr Ile Val Leu Asp Arg
405 410 415
Lys Thr Gly Thr His Leu Leu Gln Trp Pro Val Glu Glu Leu Glu Ser
420 425 430
Leu Arg Ser Asn Val Arg Glu Phe Lys Glu Met Thr Leu Glu Pro Gly
435 440 445
Ser Ile Val Pro Leu Asp Ile Gly Ser Ala Thr Gln Leu Asp Ile Ile
450 455 460
Ala Thr Phe Glu Val Asp Gln Glu Ala Leu Lys Ala Thr Ser Asp Ala
465 470 475 480
Asn Asp Glu Tyr Ala Cys Thr Thr Ser Ser Gly Ala Ala Glu Arg Gly
485 490 495
Ser Phe Gly Pro Phe Gly Ile Ala Val Leu Ala Asp Gly Thr Leu Ser
500 505 510
Glu Leu Thr Pro Val Tyr Phe Tyr Ile Ala Lys Asn Thr Lys Gly Gly
515 520 525
Val Asp Thr His Phe Cys Thr Asp Lys Leu Arg Ser Ser Leu Asp Tyr
530 535 540
Asp Ser Glu Lys Val Val Tyr Gly Ser Thr Ile Pro Val Leu Asp Gly
545 550 555 560
Glu Gln Ile Thr Met Arg Val Leu Val Asp His Ser Val Val Glu Gly
565 570 575
Phe Ala Gln Gly Gly Arg Thr Val Ile Thr Ser Arg Val Tyr Pro Thr
580 585 590
Lys Ala Ile Tyr Glu Gly Ala Lys Leu Phe Val Phe Asn Asn Ala Thr
595 600 605
Thr Thr Asn Val Lys Ala Thr Leu Asn Val Trp Gln Met Ser His Ala
610 615 620
Leu Ile Gln Pro Tyr Pro Phe
625 630
<210> 8
<211> 617
<212> PRT
<213> 牛蒡
<400> 8
Met Lys Thr Thr Glu Pro Leu Thr Asp Leu Glu His Ala Pro Asn His
1 5 10 15
Thr Pro Leu Leu Asp His Pro Gln Pro Pro Pro Ala Thr Val Ser Lys
20 25 30
Arg Leu Leu Ile Arg Val Leu Ser Ser Ile Thr Phe Val Ser Leu Phe
35 40 45
Phe Val Ser Ala Phe Leu Leu Ile Leu Leu Asn Gln His Glu Ser Ser
50 55 60
Tyr Thr Asp Asp Asn Leu Ala Pro Leu Asp Arg Ser Ser Val Gln Pro
65 70 75 80
Ser Ala Ala Glu Arg Leu Thr Trp Glu Arg Thr Ala Phe His Phe Gln
85 90 95
Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly Pro Leu Phe His Met
100 105 110
Gly Trp His His Leu Phe Tyr Gln Tyr Asn Pro Tyr Ala Pro Val Trp
115 120 125
Gly Asn Met Ser Trp Gly His Ala Val Ser Lys Asp Met Ile Asn Trp
130 135 140
Phe Glu Leu Pro Val Ala Leu Val Pro Thr Glu Trp Tyr Asp Ile Glu
145 150 155 160
Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro Asn Gly Gln Ile Phe
165 170 175
Ala Leu Tyr Thr Gly Asn Ala Asn Asp Phe Ser Gln Leu Gln Cys Lys
180 185 190
Ala Val Pro Val Asp Val Ser Asp Pro Leu Leu Val Lys Trp Val Lys
195 200 205
Tyr Asp Gly Asn Pro Ile Leu Tyr Thr Pro Pro Gly Ile Gly Leu Lys
210 215 220
Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly Pro Asp Gly Lys His
225 230 235 240
Arg Met Ile Met Gly Thr Lys Arg Gly Thr Thr Gly Leu Val Leu Val
245 250 255
Tyr His Thr Thr Asp Phe Thr Asn Tyr Val Met Leu Asp Glu Pro Leu
260 265 270
His Ser Val Pro Asn Thr Asp Met Trp Glu Cys Val Asp Leu Phe Pro
275 280 285
Val Ser Thr Thr Asn Asp Ser Ala Leu Asp Ile Ala Ala Tyr Gly Ser
290 295 300
Gly Ile Lys His Val Leu Lys Glu Ser Trp Glu Gly His Ala Met Asp
305 310 315 320
Phe Tyr Ser Ile Gly Thr Tyr Asp Ala Ile Asn Asp Lys Trp Thr Pro
325 330 335
Asp Asn Pro Glu Leu Asp Val Gly Ile Gly Leu Arg Cys Asp Tyr Gly
340 345 350
Arg Phe Phe Ala Ser Lys Ser Leu Tyr Asp Pro Leu Lys Lys Arg Arg
355 360 365
Val Thr Trp Gly Tyr Val Ala Glu Ser Asp Ser Ala Asp Gln Asp Val
370 375 380
Ser Arg Gly Trp Ala Thr Ile Tyr Asn Val Ala Arg Thr Ile Val Leu
385 390 395 400
Asp Arg Lys Thr Gly Thr His Leu Leu Gln Trp Pro Val Glu Glu Leu
405 410 415
Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Lys Glu Met Thr Leu Glu
420 425 430
Pro Gly Ser Ile Val Pro Leu Asp Ile Gly Ser Ala Thr Gln Leu Asp
435 440 445
Ile Ile Ala Thr Phe Glu Val Asp Gln Glu Ala Leu Lys Ala Thr Ser
450 455 460
Asp Ala Asn Asp Glu Tyr Ala Cys Thr Thr Ser Ser Gly Ala Ala Glu
465 470 475 480
Arg Gly Ser Phe Gly Pro Phe Gly Ile Ala Val Leu Ala Asp Gly Thr
485 490 495
Leu Ser Glu Leu Thr Pro Val Tyr Phe Tyr Ile Ala Lys Asn Thr Lys
500 505 510
Gly Gly Val Asp Thr His Phe Cys Thr Asp Lys Leu Arg Ser Ser Leu
515 520 525
Asp Tyr Asp Ser Glu Lys Val Val Tyr Gly Ser Thr Ile Pro Val Leu
530 535 540
Asp Gly Glu Gln Ile Thr Met Arg Val Leu Val Asp His Ser Val Val
545 550 555 560
Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Ile Thr Ser Arg Val Tyr
565 570 575
Pro Thr Lys Ala Ile Tyr Glu Gly Ala Lys Leu Phe Val Phe Asn Asn
580 585 590
Ala Thr Thr Thr Asn Val Lys Ala Thr Leu Asn Val Trp Gln Met Ser
595 600 605
His Ala Leu Ile Gln Pro Tyr Pro Phe
610 615
<210> 9
<211> 622
<212> PRT
<213> 橡胶草
<400> 9
Met Lys Thr Ile Glu Pro Phe Ser Asp Val Glu Asn Ala Pro Asn Ser
1 5 10 15
Thr Pro Leu Leu Asn His Pro Glu Pro Pro Arg Ala Ala Val Arg Lys
20 25 30
Gln Ser Phe Val Arg Val Leu Ser Ser Ile Thr Leu Val Ser Leu Phe
35 40 45
Phe Val Leu Ala Phe Val Leu Ile Val Leu Asn Gln Gln Asp Ser Thr
50 55 60
Thr Thr Val Ala Asn Ser Ala Pro Pro Gly Ala Thr Val Pro Glu Lys
65 70 75 80
Ser Ser Val Lys His Ser Gln Ser Asp Arg Leu Arg Trp Glu Arg Thr
85 90 95
Ala Tyr His Phe Gln Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly
100 105 110
Pro Leu Phe His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro
115 120 125
Tyr Ala Pro Ile Trp Gly Asn Met Ser Trp Gly His Ala Val Ser Lys
130 135 140
Asp Met Ile His Trp Phe Glu Leu Pro Val Ala Ile Val Pro Thr Glu
145 150 155 160
Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro
165 170 175
Asn Gly Gln Ile Phe Ala Leu Tyr Thr Gly Asn Ala Lys Asp Phe Ser
180 185 190
Gln Leu Gln Cys Lys Ala Val Pro Leu Asn Ala Ser Asp Pro Leu Leu
195 200 205
Val Glu Trp Val Lys Tyr Glu Asp Asn Pro Ile Leu Tyr Ile Pro Pro
210 215 220
Gly Ile Gly Pro Lys Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly
225 230 235 240
Pro Asp Gly Lys His Arg Met Ile Met Gly Thr Lys Gln Asn Gly Thr
245 250 255
Gly Met Val His Val Tyr His Thr Thr Asp Phe Ile Asn Tyr Val Leu
260 265 270
Leu Asp Glu Pro Leu His Ser Val Pro Asn Thr Asp Met Trp Glu Cys
275 280 285
Val Asp Phe Tyr Pro Val Ser Thr Ile Asn Asp Ser Ala Leu Asp Ile
290 295 300
Ala Ala Tyr Gly Ser Asp Ile Lys His Val Ile Lys Glu Ser Trp Glu
305 310 315 320
Gly His Gly Met Asp Leu Tyr Ser Ile Gly Thr Tyr Asp Ala Tyr Lys
325 330 335
Asp Lys Trp Thr Pro Asp Asn Pro Glu Phe Asp Val Gly Ile Gly Leu
340 345 350
Arg Val Asp Tyr Gly Arg Phe Phe Ala Ser Lys Ser Leu Tyr Asp Pro
355 360 365
Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Ala Glu Ser Asp Ser
370 375 380
Ser Asp Gln Asp Leu Asn Arg Gly Trp Ala Thr Ile Tyr Asn Val Gly
385 390 395 400
Arg Thr Val Val Leu Asp Arg Lys Thr Gly Thr His Leu Leu His Trp
405 410 415
Pro Val Glu Glu Ile Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Asn
420 425 430
Glu Ile Glu Leu Val Pro Gly Ser Ile Ile Pro Leu Asp Ile Gly Met
435 440 445
Ala Thr Gln Leu Asp Ile Val Ala Thr Phe Lys Val Asp Pro Glu Ala
450 455 460
Leu Met Ala Lys Ser Asp Ile Asn Ser Glu Tyr Gly Cys Thr Thr Ser
465 470 475 480
Ser Gly Ala Thr Gln Arg Gly Ser Leu Gly Pro Phe Gly Ile Val Val
485 490 495
Leu Ala Asp Val Ala Leu Ser Glu Leu Thr Pro Val Tyr Phe Tyr Ile
500 505 510
Ala Lys Asn Ile Asp Gly Gly Leu Val Thr His Phe Cys Thr Asp Lys
515 520 525
Leu Arg Ser Ser Leu Asp Tyr Asp Gly Glu Arg Val Val Tyr Gly Ser
530 535 540
Thr Val Pro Val Leu Asp Gly Glu Glu Leu Thr Met Arg Leu Leu Val
545 550 555 560
Asp His Ser Val Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Met
565 570 575
Thr Ser Arg Val Tyr Pro Thr Asn Ala Ile Tyr Glu Glu Ala Lys Ile
580 585 590
Phe Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Lys Ala Ser Leu Lys
595 600 605
Ile Trp Gln Met Gly Ser Ala Ser Ile Gln Ala Tyr Pro Phe
610 615 620
<210> 10
<211> 622
<212> PRT
<213> 短角蒲公英
<400> 10
Met Lys Thr Ile Glu Pro Phe Ser Asp Val Glu Asn Ala Pro Asn Ser
1 5 10 15
Thr Pro Leu Leu Asn His Pro Glu Pro Ser Arg Ala Ala Val Arg Lys
20 25 30
Gln Ser Phe Val Arg Val Leu Ser Ser Ile Thr Leu Val Ser Leu Phe
35 40 45
Phe Val Leu Ala Phe Val Leu Ile Val Leu Asn Gln Gln Asp Ser Thr
50 55 60
Asn Thr Val Ala Asn Ser Ala Pro Pro Gly Ala Thr Val Pro Glu Lys
65 70 75 80
Ser Ser Val Lys His Ser Gln Ser Asp Arg Leu Arg Trp Glu Arg Thr
85 90 95
Ala Tyr His Phe Gln Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly
100 105 110
Pro Leu Phe His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro
115 120 125
Tyr Ala Pro Ile Trp Gly Asn Met Ser Trp Gly His Ala Val Ser Lys
130 135 140
Asp Met Ile His Trp Phe Glu Leu Pro Val Ala Met Val Pro Thr Glu
145 150 155 160
Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro
165 170 175
Asn Gly Gln Ile Phe Ala Leu Tyr Thr Gly Asn Ala Lys Asp Phe Ser
180 185 190
Gln Leu Gln Cys Lys Ala Val Pro Leu Asn Ala Ser Asp Pro Leu Leu
195 200 205
Val Asp Trp Val Lys Tyr Glu Asp Asn Pro Ile Leu Tyr Ile Pro Pro
210 215 220
Gly Ile Gly Pro Lys Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly
225 230 235 240
Pro Asp Gly Lys His Arg Met Ile Met Gly Thr Lys Gln Asn Gly Thr
245 250 255
Gly Met Val His Val Tyr His Thr Thr Asp Phe Ile Asn Tyr Val Leu
260 265 270
Leu Asp Glu Pro Leu His Ser Val Pro Asn Thr Asp Met Trp Glu Cys
275 280 285
Val Asp Phe Tyr Pro Val Ser Thr Ile Asn Asp Ser Ala Leu Asp Ile
290 295 300
Ala Ala Tyr Gly Ser Asp Ile Lys His Val Ile Lys Glu Ser Trp Glu
305 310 315 320
Gly His Gly Met Asp Leu Tyr Ser Ile Gly Thr Tyr Asp Ala Tyr Lys
325 330 335
Asp Lys Trp Thr Pro Asp Asn Pro Glu Leu Asp Val Gly Ile Gly Leu
340 345 350
Arg Val Asp Tyr Gly Arg Leu Phe Ala Ser Lys Ser Leu Tyr Asp Pro
355 360 365
Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Gly Glu Ser Asp Ser
370 375 380
Pro Asp Gln Asp Ile Asn Arg Gly Trp Ala Thr Ile Tyr Asn Val Gly
385 390 395 400
Arg Thr Val Val Leu Asp Arg Lys Thr Gly Thr His Leu Leu His Trp
405 410 415
Pro Val Glu Glu Ile Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Asn
420 425 430
Glu Ile Glu Leu Val Pro Gly Ser Ile Ile Pro Leu Asp Ile Gly Met
435 440 445
Ala Thr Gln Leu Asp Ile Val Ala Thr Phe Lys Val Asp Pro Glu Ala
450 455 460
Leu Met Ala Lys Ser Asp Ile Asn Ser Glu Tyr Gly Cys Thr Thr Ser
465 470 475 480
Ser Gly Ala Thr Gln Arg Gly Ser Leu Gly Pro Phe Gly Ile Val Val
485 490 495
Leu Ala Asp Leu Ala Leu Ser Glu Leu Thr Pro Leu Tyr Phe Tyr Ile
500 505 510
Ala Lys Asn Thr Asp Gly Gly Leu Val Thr His Phe Cys Thr Asp Lys
515 520 525
Leu Arg Ser Ser Leu Asp Tyr Asp Gly Glu Arg Val Val Tyr Gly Gly
530 535 540
Thr Val Pro Val Leu Asp Gly Glu Glu Leu Thr Met Arg Leu Leu Val
545 550 555 560
Asp His Ser Val Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Ile
565 570 575
Thr Ser Arg Val Tyr Pro Thr Asn Ala Ile Tyr Glu Glu Ala Lys Ile
580 585 590
Phe Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Lys Ala Ser Leu Lys
595 600 605
Ile Trp Gln Met Gly Ser Ala Ser Ile Gln Ala Tyr Pro Phe
610 615 620
<210> 11
<211> 1896
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 11
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctagt tccgttcaac cttctgccgc tgaacgttta 300
acctgggaga gaactgcatt ccattttcag ccagctaaaa atttcattta tgatccaaac 360
ggaccgctgt ttcacatggg ctggcaccat cttttctacc aatacaaccc ctacgctcca 420
gtctggggta atatgagctg gggtcacgcg gtgtcaaagg acatgataaa ctggttcgaa 480
ttgccagtag ccttagttcc aacggaatgg tatgatattg aaggtgttct atctggttct 540
actacagctt tgcctaatgg gcaaatcttt gctttgtaca ccggtaacgc caacgacttc 600
tcccaattgc aatgtaaggc tgtcccagtt gacgtgtcgg atccattatt ggtcaaatgg 660
gttaagtatg acggtaatcc gatcttgtac actccacctg gaatcggtct gaaggattat 720
agagatccat ctaccgtctg gactggtcca gacggtaagc ataggatgat tatgggtaca 780
aagagaggta ccactggctt ggttttagtt taccacacaa cggatttcac taactacgtc 840
atgttggacg aaccactcca ctcagtacca aacactgaca tgtgggaatg cgttgatctt 900
tttccggtca gcaccaccaa tgatagtgct ttggacatcg cggcttatgg ttccggtatt 960
aaacatgttt tgaaagagtc ttgggaaggt cacgcaatgg atttctactc cattgggact 1020
tacgatgcta taaacgacaa gtggactcct gacaacccag aactagacgt cggtattggt 1080
ttgagatgtg attacggtag atttttcgca tctaagtccc tatacgatcc tttaaagaaa 1140
cggagagtta cctggggata tgtcgccgaa tctgattcag ccgaccaaga cgtgtctcgc 1200
ggttgggcta caatctataa tgttgcaagg actattgttt tagaccgtaa gaccggcact 1260
catctgcttc agtggccagt cgaagaattg gagtccctta gatcgaacgt gagagaattt 1320
aaggaaatga ccttggaacc aggttccatc gttccattgg atataggttc tgctactcaa 1380
ttggatatta tcgctacgtt cgaagttgac caagaagctt tgaaagctac ctctgacgct 1440
aacgacgaat acgcctgtac aacatcttca ggtgctgcgg agcgtggttc gttcggtccc 1500
ttcggtatcg ctgtcctcgc cgatggtacc ttgtccgaac tgactccagt atacttctac 1560
attgctaaaa atactaaggg cggggtcgat acgcactttt gtactgataa gttgagaagc 1620
tctttagact atgacagtga aaaggttgtc tacgggagta ccattccagt tttagatggt 1680
gaacaaatca ctatgagagt tctcgtcgat cattccgttg tggaaggttt tgcccagggt 1740
ggtagaactg taattaccag tagagtttac cctaccaagg ctatatacga aggtgccaag 1800
ttgtttgtat tcaataacgc tacaactaca aatgttaagg caacgttgaa tgtatggcaa 1860
atgtcacacg ccctcatcca accataccca ttctaa 1896
<210> 12
<211> 608
<212> PRT
<213> 硬叶蓝刺头
<400> 12
Glu Pro Phe Ser Asp Leu Glu His Ala Pro Asn His Thr Pro Leu Leu
1 5 10 15
Asp Arg Pro Lys Thr Pro Pro Ala Ala Val Ser His Arg Leu Leu Ile
20 25 30
Arg Val Leu Ser Thr Ile Thr Val Val Ser Leu Phe Phe Val Ala Ala
35 40 45
Phe Leu Leu Val Leu Asn Gln Gln Asp Ser Gly Asn Asn Pro Leu Pro
50 55 60
Gln Asp Pro Pro Pro Gln Pro Ser Ala Ala Asp Arg Leu Arg Trp Glu
65 70 75 80
Arg Thr Ala Tyr His Tyr Gln Pro Ala Lys Asn Phe Met Tyr Asp Pro
85 90 95
Asn Gly Pro Ile Phe His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr
100 105 110
Asn Pro Tyr Ser Val Phe Trp Gly Asn Met Thr Trp Gly His Ala Val
115 120 125
Ser Lys Asp Met Ile Asn Trp Phe Glu Leu Pro Val Ala Leu Ala Pro
130 135 140
Val Glu Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Val
145 150 155 160
Leu Pro Thr Gly Glu Ile Phe Ala Leu Tyr Thr Gly Asn Ala Asn Asp
165 170 175
Phe Ser Gln Leu Gln Cys Lys Ala Val Pro Val Asn Thr Ser Asp Pro
180 185 190
Leu Leu Ile Asp Trp Val Arg Tyr Glu Gly Asn Pro Ile Leu Tyr Thr
195 200 205
Pro Pro Gly Val Gly Leu Thr Asp Tyr Arg Asp Pro Ser Thr Val Trp
210 215 220
Thr Gly Pro Asp Asn Ile His Arg Met Ile Ile Gly Thr Arg Arg Asn
225 230 235 240
Asn Thr Gly Leu Val Leu Val Tyr His Thr Lys Asp Phe Ile Asn Tyr
245 250 255
Glu Leu Leu Asp Glu Pro Leu His Ser Val Pro Asp Ser Gly Met Trp
260 265 270
Glu Cys Val Asp Leu Tyr Pro Val Ser Thr Met Asn Asp Thr Ala Leu
275 280 285
Asp Val Ala Ala Tyr Gly Ser Gly Ile Lys His Val Leu Lys Glu Ser
290 295 300
Trp Glu Gly His Ala Lys Asp Phe Tyr Ser Ile Gly Thr Tyr Asp Ala
305 310 315 320
Ile Asn Asp Lys Trp Trp Pro Asp Asn Pro Glu Leu Asp Leu Gly Met
325 330 335
Gly Trp Arg Cys Asp Tyr Gly Arg Phe Phe Ala Ser Lys Thr Leu Tyr
340 345 350
Asp Pro Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Ala Glu Ser
355 360 365
Asp Ser Gly Asp Gln Asp Arg Ser Arg Gly Trp Ser Asn Ile Tyr Asn
370 375 380
Val Ala Arg Thr Val Met Leu Asp Arg Lys Thr Gly Thr Asn Leu Leu
385 390 395 400
Gln Trp Pro Val Glu Glu Ile Glu Ser Leu Arg Ser Lys Val His Glu
405 410 415
Phe Asn Glu Ile Glu Leu Gln Pro Gly Ser Ile Ile Pro Leu Glu Val
420 425 430
Gly Ser Thr Thr Gln Leu Asp Ile Val Ala Thr Phe Glu Val Asn Lys
435 440 445
Asp Ala Phe Glu Glu Thr Asn Val Asn Tyr Asn Glu Tyr Gly Cys Thr
450 455 460
Ser Ser Lys Gly Ala Ser Gln Arg Gly Arg Leu Gly Pro Phe Gly Ile
465 470 475 480
Ile Val Leu Ala Asp Gly Asn Leu Leu Glu Leu Thr Pro Val Tyr Phe
485 490 495
Tyr Ile Ala Lys Asn Asn Asp Gly Ser Leu Thr Thr His Phe Cys Thr
500 505 510
Asp Lys Leu Arg Ser Ser Phe Asp Tyr Asp Asp Glu Lys Val Val Tyr
515 520 525
Gly Ser Thr Val Pro Val Leu Glu Gly Glu Lys Leu Thr Ile Arg Leu
530 535 540
Met Val Asp His Ser Ile Ile Glu Gly Phe Ala Gln Gly Gly Arg Thr
545 550 555 560
Val Ile Thr Ser Arg Val Tyr Pro Thr Lys Ala Ile Tyr Asp Thr Ala
565 570 575
Lys Leu Phe Leu Phe Asn Asn Ala Thr Asp Ile Thr Val Lys Ala Ser
580 585 590
Leu Lys Val Trp His Met Ala Ser Ala Asn Ile Gln Met Tyr Pro Phe
595 600 605
<210> 13
<211> 638
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 13
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Val Pro Gly Lys Leu Glu Ser
85 90 95
Asn Ala Asp Val Glu Trp Gln Arg Ser Ala Tyr His Phe Gln Pro Asp
100 105 110
Lys Asn Phe Ile Ser Asp Pro Asp Gly Pro Met Tyr His Met Gly Trp
115 120 125
Tyr His Leu Phe Tyr Gln Tyr Asn Pro Glu Ser Ala Ile Trp Gly Asn
130 135 140
Ile Thr Trp Gly His Ser Val Ser Arg Asp Met Ile Asn Trp Phe His
145 150 155 160
Leu Pro Phe Ala Met Val Pro Asp His Trp Tyr Asp Ile Glu Gly Val
165 170 175
Met Thr Gly Ser Ala Thr Val Leu Pro Asn Gly Gln Ile Ile Met Leu
180 185 190
Tyr Thr Gly Asn Ala Tyr Asp Leu Ser Gln Leu Gln Cys Leu Ala Tyr
195 200 205
Ala Val Asn Ser Ser Asp Pro Leu Leu Leu Glu Trp Lys Lys Tyr Glu
210 215 220
Gly Asn Pro Ile Leu Phe Pro Pro Pro Gly Val Gly Tyr Lys Asp Phe
225 230 235 240
Arg Asp Pro Ser Thr Leu Trp Met Gly Pro Asp Gly Glu Trp Arg Met
245 250 255
Val Met Gly Ser Lys His Asn Glu Thr Ile Gly Cys Ala Leu Val Tyr
260 265 270
Arg Thr Thr Asn Phe Thr His Phe Glu Leu Asn Glu Glu Val Leu His
275 280 285
Ala Val Pro His Thr Gly Met Trp Glu Cys Val Asp Leu Tyr Pro Val
290 295 300
Ser Thr Thr His Thr Asn Gly Leu Glu Met Lys Asp Asn Gly Pro Asn
305 310 315 320
Val Lys Tyr Ile Leu Lys Gln Ser Gly Asp Glu Asp Arg His Asp Trp
325 330 335
Tyr Ala Ile Gly Thr Phe Asp Pro Glu Lys Asp Lys Trp Tyr Pro Asp
340 345 350
Asp Pro Glu Asn Asp Val Gly Ile Gly Leu Arg Tyr Asp Tyr Gly Lys
355 360 365
Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln His Lys Lys Arg Arg Val
370 375 380
Leu Trp Gly Tyr Val Gly Glu Thr Asp Pro Pro Lys Ser Asp Leu Leu
385 390 395 400
Lys Gly Trp Ala Asn Ile Leu Asn Ile Pro Arg Ser Val Val Leu Asp
405 410 415
Thr Gln Thr Glu Thr Asn Leu Ile Gln Trp Pro Ile Glu Glu Val Glu
420 425 430
Lys Leu Arg Ser Lys Lys Tyr Asp Glu Phe Lys Asp Val Glu Leu Arg
435 440 445
Pro Gly Ser Leu Ile Pro Leu Glu Ile Gly Thr Ala Thr Gln Leu Asp
450 455 460
Ile Ser Ala Thr Phe Glu Ile Asp Glu Lys Lys Leu Glu Ser Thr Leu
465 470 475 480
Glu Ala Asp Val Leu Phe Asn Cys Thr Thr Ser Glu Gly Ser Val Gly
485 490 495
Arg Gly Val Leu Gly Pro Phe Gly Ile Val Val Leu Ala Asp Ala Asn
500 505 510
Arg Ser Glu Gln Leu Pro Val Tyr Phe Tyr Ile Ala Lys Asp Thr Asp
515 520 525
Gly Thr Ser Arg Thr Tyr Phe Cys Ala Asp Glu Ser Arg Ser Ser Lys
530 535 540
Asp Lys Asp Val Gly Lys Trp Val Tyr Gly Ser Ser Val Pro Val Leu
545 550 555 560
Glu Gly Glu Asn Tyr Asn Met Arg Leu Leu Val Asp His Ser Ile Val
565 570 575
Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Val Thr Ser Arg Val Tyr
580 585 590
Pro Thr Met Ala Ile Tyr Gly Ala Ala Lys Ile Phe Leu Phe Asn Asn
595 600 605
Ala Thr Gly Ile Ser Val Lys Ala Ser Leu Lys Ile Trp Lys Met Ala
610 615 620
Glu Ala Gln Leu Asp Pro Phe Pro Leu Ser Gly Trp Ser Ser
625 630 635
<210> 14
<211> 640
<212> PRT
<213> 普那菊苣
<400> 14
Met Ala Ser Ser Thr Thr Ala Thr Thr Pro Leu Ile Leu Arg Asp Glu
1 5 10 15
Thr Gln Ile Arg Pro Gln Leu Ala Gly Ser Ser Val Gly Arg Arg Leu
20 25 30
Ser Met Ala Lys Ile Leu Ser Gly Ile Leu Val Phe Val Leu Val Ile
35 40 45
Cys Ala Leu Val Ala Val Ile His Asp Gln Ser Gln Gln Thr Met Ala
50 55 60
Thr Asn Asn His Gln Gly Gly Asp Lys Pro Thr Ser Ala Ala Thr Phe
65 70 75 80
Thr Ala Pro Leu Pro Gln Val Gly Leu Lys Arg Val Pro Gly Lys Leu
85 90 95
Glu Ser Asn Ala Asp Val Glu Trp Gln Arg Ser Ala Tyr His Phe Gln
100 105 110
Pro Asp Lys Asn Phe Ile Ser Asp Pro Asp Gly Pro Met Tyr His Met
115 120 125
Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro Glu Ser Ala Ile Trp
130 135 140
Gly Asn Ile Thr Trp Gly His Ser Val Ser Arg Asp Met Ile Asn Trp
145 150 155 160
Phe His Leu Pro Phe Ala Met Val Pro Asp His Trp Tyr Asp Ile Glu
165 170 175
Gly Val Met Thr Gly Ser Ala Thr Val Leu Pro Asn Gly Gln Ile Ile
180 185 190
Met Leu Tyr Thr Gly Asn Ala Tyr Asp Leu Ser Gln Leu Gln Cys Leu
195 200 205
Ala Tyr Ala Val Asn Ser Ser Asp Pro Leu Leu Leu Glu Trp Lys Lys
210 215 220
Tyr Glu Gly Asn Pro Ile Leu Phe Pro Pro Pro Gly Val Gly Tyr Lys
225 230 235 240
Asp Phe Arg Asp Pro Ser Thr Leu Trp Met Gly Pro Asp Gly Glu Trp
245 250 255
Arg Met Val Met Gly Ser Lys His Asn Glu Thr Ile Gly Cys Ala Leu
260 265 270
Val Tyr Arg Thr Thr Asn Phe Thr His Phe Glu Leu Asn Glu Glu Val
275 280 285
Leu His Ala Val Pro His Thr Gly Met Trp Glu Cys Val Asp Leu Tyr
290 295 300
Pro Val Ser Thr Thr His Thr Asn Gly Leu Glu Met Lys Asp Asn Gly
305 310 315 320
Pro Asn Val Lys Tyr Ile Leu Lys Gln Ser Gly Asp Glu Asp Arg His
325 330 335
Asp Trp Tyr Ala Ile Gly Thr Phe Asp Pro Glu Lys Asp Lys Trp Tyr
340 345 350
Pro Asp Asp Pro Glu Asn Asp Val Gly Ile Gly Leu Arg Tyr Asp Tyr
355 360 365
Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln His Lys Lys Arg
370 375 380
Arg Val Leu Trp Gly Tyr Val Gly Glu Thr Asp Pro Pro Lys Ser Asp
385 390 395 400
Leu Leu Lys Gly Trp Ala Asn Ile Leu Asn Ile Pro Arg Ser Val Val
405 410 415
Leu Asp Thr Gln Thr Glu Thr Asn Leu Ile Gln Trp Pro Ile Glu Glu
420 425 430
Val Glu Lys Leu Arg Ser Lys Lys Tyr Asp Glu Phe Lys Asp Val Glu
435 440 445
Leu Arg Pro Gly Ser Leu Ile Pro Leu Glu Ile Gly Thr Ala Thr Gln
450 455 460
Leu Asp Ile Ser Ala Thr Phe Glu Ile Asp Glu Lys Lys Leu Glu Ser
465 470 475 480
Thr Leu Glu Ala Asp Val Leu Phe Asn Cys Thr Thr Ser Glu Gly Ser
485 490 495
Val Gly Arg Gly Val Leu Gly Pro Phe Gly Ile Val Val Leu Ala Asp
500 505 510
Ala Asn Arg Ser Glu Gln Leu Pro Val Tyr Phe Tyr Ile Ala Lys Asp
515 520 525
Thr Asp Gly Thr Ser Arg Thr Tyr Phe Cys Ala Asp Glu Ser Arg Ser
530 535 540
Ser Lys Asp Lys Asp Val Gly Lys Trp Val Tyr Gly Ser Ser Val Pro
545 550 555 560
Val Leu Glu Gly Glu Asn Tyr Asn Met Arg Leu Leu Val Asp His Ser
565 570 575
Ile Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Val Thr Ser Arg
580 585 590
Val Tyr Pro Thr Met Ala Ile Tyr Gly Ala Ala Lys Ile Phe Leu Phe
595 600 605
Asn Asn Ala Thr Gly Ile Ser Val Lys Ala Ser Leu Lys Ile Trp Lys
610 615 620
Met Ala Glu Ala Gln Leu Asp Pro Phe Pro Leu Ser Gly Trp Ser Ser
625 630 635 640
<210> 15
<211> 616
<212> PRT
<213> 西尔斯山羊草
<400> 15
Met Gly Ser His Gly Lys Pro Pro Leu Pro Tyr Ala Tyr Lys Pro Leu
1 5 10 15
Pro Ser Asp Ala Asp Gly Glu Arg Thr Gly Cys Thr Arg Trp Arg Val
20 25 30
Cys Ala Thr Ala Leu Thr Ala Ser Ala Met Val Val Val Val Val Gly
35 40 45
Ala Thr Leu Leu Ala Gly Phe Arg Val Asp Gln Ala Val Asp Glu Glu
50 55 60
Ala Ala Gly Gly Phe Pro Trp Ser Asn Glu Met Leu Gln Trp Gln Arg
65 70 75 80
Ser Gly Tyr His Phe Gln Thr Ala Lys Asn Tyr Met Ser Asp Pro Asn
85 90 95
Gly Leu Met Tyr Tyr Arg Gly Trp Tyr His Met Phe Phe Gln Tyr Asn
100 105 110
Pro Val Gly Thr Asp Trp Asp Asp Gly Met Glu Trp Gly His Ala Val
115 120 125
Ser Arg Asn Leu Val Gln Trp Arg Thr Leu Pro Ile Ala Met Val Ala
130 135 140
Asp Gln Trp Tyr Asp Ile Leu Gly Val Leu Ser Gly Ser Met Thr Val
145 150 155 160
Leu Pro Asn Gly Thr Val Ile Met Ile Tyr Thr Gly Ala Thr Asn Ala
165 170 175
Ser Ala Val Glu Val Gln Cys Ile Ala Thr Pro Ala Asp Pro Asn Asp
180 185 190
Pro Leu Leu Arg Arg Trp Thr Lys His Pro Ala Asn Pro Val Ile Trp
195 200 205
Ser Pro Pro Gly Val Gly Thr Lys Asp Phe Arg Asp Ser Met Thr Ala
210 215 220
Trp Tyr Asp Glu Ser Asp Asp Thr Trp Arg Thr Leu Leu Gly Ser Lys
225 230 235 240
Asp Asp Asn Asn Gly His His Asp Gly Ile Ala Met Met Tyr Lys Thr
245 250 255
Lys Asp Phe Leu Asn Tyr Glu Leu Ile Pro Gly Ile Leu His Arg Val
260 265 270
Glu Arg Thr Gly Glu Trp Glu Cys Ile Asp Phe Tyr Pro Val Gly His
275 280 285
Arg Thr Ser Asp Asn Ser Ser Glu Met Leu His Val Leu Lys Ala Ser
290 295 300
Met Asp Asp Glu Arg His Asp Tyr Tyr Ser Leu Gly Thr Tyr Asp Ser
305 310 315 320
Ala Ala Asn Arg Trp Thr Pro Ile Asp Pro Glu Leu Asp Leu Gly Ile
325 330 335
Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Thr Ser Phe Tyr
340 345 350
Asp Pro Ala Lys Lys Arg Arg Val Leu Met Gly Tyr Val Gly Glu Val
355 360 365
Asp Ser Lys Arg Ala Asp Val Val Lys Gly Trp Ala Ser Ile Gln Ser
370 375 380
Val Pro Arg Thr Ile Ala Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu
385 390 395 400
Leu Trp Pro Val Glu Glu Ile Glu Thr Leu Arg Leu Asn Ala Thr Gln
405 410 415
Leu Ser Asp Val Thr Leu Asn Thr Gly Ser Val Ile His Ile Pro Leu
420 425 430
Arg Gln Gly Thr Gln Leu Asp Ile Glu Ala Thr Phe His Leu Asp Ala
435 440 445
Ser Ala Val Ala Ala Leu Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser
450 455 460
Ser Ser Gly Gly Ala Val Asn Arg Gly Ala Leu Gly Pro Phe Gly Leu
465 470 475 480
Leu Val Leu Ala Ala Gly Asp Arg Arg Gly Glu Gln Thr Ala Val Tyr
485 490 495
Phe Tyr Val Ser Arg Gly Leu Asp Gly Gly Leu His Thr Ser Phe Cys
500 505 510
Gln Asp Glu Leu Arg Ser Ser Arg Ala Lys Asp Val Thr Lys Arg Val
515 520 525
Ile Gly Ser Thr Val Pro Val Leu Asp Gly Glu Ala Phe Ser Met Arg
530 535 540
Val Leu Val Asp His Ser Ile Val Gln Gly Phe Ala Met Gly Gly Arg
545 550 555 560
Thr Thr Met Thr Ser Arg Val Tyr Pro Met Glu Ala Tyr Gln Glu Ala
565 570 575
Lys Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Thr Ala Glu
580 585 590
Arg Leu Val Val His Asp Met Asp Ser Ala His Asn Gln Leu Ser Asn
595 600 605
Met Asp Asp Tyr Ser Tyr Val Gln
610 615
<210> 16
<211> 616
<212> PRT
<213> 未知
<220>
<223> 硬粒小麦
<400> 16
Met Gly Ser His Gly Lys Pro Pro Leu Pro Tyr Ala Tyr Lys Pro Leu
1 5 10 15
Pro Ser Asp Ala Asp Gly Glu Arg Thr Gly Cys Thr Arg Trp Arg Val
20 25 30
Cys Ala Val Ala Leu Thr Ala Ser Ala Met Val Val Val Val Val Gly
35 40 45
Ala Thr Leu Leu Ala Gly Phe Arg Val Asp Gln Ala Val Asp Glu Glu
50 55 60
Ala Ala Gly Gly Phe Pro Trp Ser Asn Glu Met Leu Gln Trp Gln Arg
65 70 75 80
Ser Gly Tyr His Phe Gln Thr Ala Lys Asn Tyr Met Ser Asp Pro Asn
85 90 95
Gly Leu Met Tyr Tyr Arg Gly Trp Asn His Met Phe Phe Gln Tyr Asn
100 105 110
Pro Val Gly Thr Asp Trp Asp Asp Gly Met Glu Trp Gly His Ala Val
115 120 125
Ser Arg Asn Leu Val Gln Trp Arg Thr Leu Pro Ile Ala Met Val Ala
130 135 140
Asp Gln Trp Tyr Asp Ile Leu Gly Val Leu Ser Gly Ser Met Thr Val
145 150 155 160
Leu Pro Asn Gly Thr Val Ile Met Ile Tyr Thr Gly Ala Thr Asn Ala
165 170 175
Ser Ala Val Glu Val Gln Cys Ile Ala Thr Pro Ala Asp Pro Thr Asp
180 185 190
Pro Leu Leu Arg Arg Trp Thr Lys His Pro Ala Asn Pro Val Ile Trp
195 200 205
Ser Pro Pro Gly Val Gly Thr Lys Asp Phe Arg Asp Pro Met Thr Ala
210 215 220
Trp Tyr Asp Glu Ser Asp Asp Thr Trp Arg Thr Leu Leu Gly Ser Lys
225 230 235 240
Asp Asp Asn Asn Gly His His Asp Gly Ile Ala Met Met Tyr Lys Thr
245 250 255
Lys Asp Phe Leu Asn Tyr Glu Leu Ile Pro Gly Ile Leu His Arg Val
260 265 270
Glu Arg Thr Gly Glu Trp Glu Cys Ile Asp Phe Tyr Pro Val Gly Arg
275 280 285
Arg Thr Ser Asp Asn Ser Ser Glu Met Leu His Val Leu Lys Ala Ser
290 295 300
Met Asp Asp Glu Arg His Asp Tyr Tyr Ser Leu Gly Thr Tyr Asp Ser
305 310 315 320
Ala Ala Asn Arg Trp Thr Pro Ile Asp Pro Glu Leu Asp Leu Gly Ile
325 330 335
Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Thr Ser Phe Tyr
340 345 350
Asp Pro Ala Lys Lys Arg Arg Val Leu Met Gly Tyr Val Gly Glu Val
355 360 365
Asp Ser Lys Arg Ala Asp Val Val Lys Gly Trp Ala Ser Ile Gln Ser
370 375 380
Val Pro Arg Thr Ile Ala Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu
385 390 395 400
Leu Trp Pro Val Glu Glu Ile Glu Thr Leu Arg Leu Asn Ala Thr Glu
405 410 415
Leu Ser Asp Val Thr Leu Asn Thr Gly Ser Val Ile His Ile Pro Leu
420 425 430
Arg Gln Gly Thr Gln Leu Asp Ile Glu Ala Thr Phe His Leu Asp Ala
435 440 445
Ser Ala Val Ala Ala Phe Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser
450 455 460
Ser Ser Gly Gly Ala Val Asn Arg Gly Ala Leu Gly Pro Phe Gly Leu
465 470 475 480
Leu Val Leu Ala Ala Gly Asp Arg Arg Gly Glu Gln Thr Ala Val Tyr
485 490 495
Phe Tyr Val Ser Arg Gly Leu Asp Gly Gly Leu His Thr Ser Phe Cys
500 505 510
Gln Asp Glu Leu Arg Ser Ser Arg Ala Lys Asp Val Thr Lys Arg Val
515 520 525
Ile Gly Ser Thr Val Pro Val Leu Asp Gly Glu Ala Phe Ser Met Arg
530 535 540
Val Leu Val Asp His Ser Ile Val Gln Gly Phe Ala Met Gly Gly Arg
545 550 555 560
Thr Thr Met Thr Ser Arg Val Tyr Pro Met Glu Ala Tyr Gln Glu Ala
565 570 575
Lys Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Thr Ala Glu
580 585 590
Arg Leu Val Val His Glu Met Asp Ser Ala His Asn Gln Leu Ser Asn
595 600 605
Met Asp Asp His Ser Tyr Val Gln
610 615
<210> 17
<211> 648
<212> PRT
<213> 未知
<220>
<223> 硬粒小麦
<400> 17
Met Glu Ser Ser Arg Gly Ile Leu Ile Pro Gly Thr Pro Pro Leu Pro
1 5 10 15
Tyr Ala Tyr Glu Pro Leu Pro Ser Ser Leu Thr Asp Ala Asn Gly Gln
20 25 30
Glu Asp Arg Arg Ile Thr Gly Gly Val Arg Trp Arg Ala Trp Ala Ala
35 40 45
Val Leu Ala Val Gly Ala Leu Val Val Ala Ala Ala Val Phe Gly Ala
50 55 60
Ser Arg Val Asp Arg Asp Ala Val Ala Ser Ser Val Pro Ala Thr Ala
65 70 75 80
Glu His Gly Val Leu Glu Lys Ala Ser Gly Pro Tyr Ser Ala Ser Gly
85 90 95
Gly Phe Pro Trp Ser Asn Ala Met Leu Gln Trp Gln Arg Thr Gly Tyr
100 105 110
His Phe Gln Pro Glu Lys Asn Tyr Gln Asn Asp Pro Asn Gly Pro Val
115 120 125
Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln His Asn Pro Gly Gly
130 135 140
Thr Gly Trp Gly Asn Ile Ser Trp Gly His Ala Val Ser Arg Asp Met
145 150 155 160
Val His Trp Arg His Leu Pro Leu Ala Met Val Pro Glu His Trp Tyr
165 170 175
Asp Ile Glu Gly Val Leu Thr Gly Ser Ile Thr Val Leu Pro Asp Gly
180 185 190
Arg Val Ile Leu Leu Tyr Thr Gly Asn Thr Glu Thr Phe Ala Gln Val
195 200 205
Thr Cys Leu Ala Glu Ala Ala Asp Pro Ser Asp Pro Leu Leu Arg Glu
210 215 220
Trp Ala Lys His Pro Ala Asn Pro Val Val Tyr Pro Pro Pro Gly Ile
225 230 235 240
Gly Met Lys Asp Tyr Arg Asp Pro Thr Thr Ala Trp Phe Asp Asn Ser
245 250 255
Asp Asn Thr Trp Arg Ile Ile Ile Gly Ser Lys Asn Asp Thr Asp His
260 265 270
Ser Gly Ile Val Phe Thr Tyr Lys Thr Lys Asp Phe Val Ser Tyr Glu
275 280 285
Leu Ile Pro Gly Tyr Leu Tyr Arg Gly Pro Ala Gly Thr Gly Met Tyr
290 295 300
Glu Cys Ile Asp Leu Phe Ala Val Gly Gly Gly Arg Ala Ala Ser Asp
305 310 315 320
Met Tyr Asn Ser Thr Ala Glu Asp Val Leu Tyr Val Leu Lys Glu Ser
325 330 335
Ser Asp Asp Asp Arg Arg Asp Tyr Tyr Ala Leu Gly Arg Phe Asp Ala
340 345 350
Ala Ala Asn Thr Trp Thr Pro Ile Asp Thr Glu Arg Glu Leu Gly Val
355 360 365
Ala Leu Arg Tyr Asp Tyr Gly Arg Tyr Asp Thr Ser Lys Ser Phe Tyr
370 375 380
Asp Pro Val Lys Gln Arg Arg Ile Val Trp Gly Tyr Val Val Glu Thr
385 390 395 400
Asp Ser Trp Ser Ala Asp Ala Ala Lys Gly Trp Ala Asn Leu Gln Ser
405 410 415
Ile Pro Arg Thr Val Glu Leu Asp Glu Lys Thr Arg Thr Asn Leu Val
420 425 430
Gln Trp Pro Val Gly Glu Leu Asn Thr Leu Arg Ile Asn Thr Thr Asp
435 440 445
Leu Ser Asp Ile Thr Val Gly Ala Gly Ser Val Asp Ser Leu Pro Leu
450 455 460
His Gln Thr Ser Gln Leu Asp Ile Glu Ala Ser Phe Arg Ile Asn Ala
465 470 475 480
Ser Thr Ile Glu Ala Leu Asn Glu Val Asp Val Gly Tyr Asn Cys Thr
485 490 495
Met Thr Ser Gly Ala Ala Thr Arg Gly Ala Leu Gly Pro Phe Gly Ile
500 505 510
Leu Val Leu Ala Asn Val Ala Leu Thr Glu Gln Thr Ala Val Tyr Phe
515 520 525
Tyr Val Ser Lys Gly Leu Asp Gly Gly Leu Arg Thr His Phe Cys His
530 535 540
Asp Glu Leu Arg Ser Thr His Ala Thr Asp Val Ala Lys Glu Val Val
545 550 555 560
Gly Ser Thr Val Pro Val Leu Asp Gly Glu Asp Phe Ser Val Arg Val
565 570 575
Leu Val Asp His Ser Ile Val Gln Ser Phe Val Met Gly Gly Arg Met
580 585 590
Thr Ala Thr Ser Arg Ala Tyr Pro Thr Glu Ala Ile Tyr Ala Ala Ala
595 600 605
Gly Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Ile Thr Ala Glu
610 615 620
Lys Leu Val Val His Asp Met Asp Ser Ser Tyr Asn Arg Ile Phe Thr
625 630 635 640
Asp Glu Asp Leu Leu Val Leu Asp
645
<210> 18
<211> 556
<212> PRT
<213> 小麦
<400> 18
Met Ala Asn Ala Phe Pro Trp Ser Asn Ala Met Leu Gln Trp Gln Arg
1 5 10 15
Thr Gly Phe His Phe Gln Pro Asp Lys Tyr Tyr Gln Asn Asp Pro Asn
20 25 30
Gly Pro Val Tyr Tyr Gly Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn
35 40 45
Pro Ser Gly Ser Val Trp Glu Pro Gln Ile Val Trp Gly His Ala Val
50 55 60
Ser Lys Asp Leu Ile His Trp Arg His Leu Pro Pro Ala Leu Val Pro
65 70 75 80
Asp Gln Trp Tyr Asp Ile Lys Gly Val Leu Thr Gly Ser Ile Thr Val
85 90 95
Leu Pro Asp Gly Lys Val Ile Leu Leu Tyr Thr Gly Asn Thr Glu Thr
100 105 110
Phe Ala Gln Val Thr Cys Leu Ala Glu Pro Ala Asp Pro Ser Asp Pro
115 120 125
Leu Leu Arg Glu Trp Val Lys His Pro Ala Asn Pro Val Val Phe Pro
130 135 140
Pro Pro Gly Ile Gly Met Lys Asp Phe Arg Asp Pro Thr Thr Ala Trp
145 150 155 160
Tyr Asp Glu Ser Asp Gly Thr Trp Arg Thr Ile Ile Gly Ser Lys Asn
165 170 175
Asp Ser Asp His Ser Gly Ile Val Phe Ser Tyr Lys Thr Lys Asp Phe
180 185 190
Ile Ser Tyr Glu Leu Met Pro Gly Tyr Met Tyr Arg Gly Pro Lys Gly
195 200 205
Thr Gly Glu Tyr Glu Cys Ile Asp Leu Tyr Ala Val Gly Gly Gly Arg
210 215 220
Lys Ala Ser Asp Met Tyr Asn Ser Thr Ala Glu Asp Val Leu Tyr Val
225 230 235 240
Leu Lys Glu Ser Ser Asp Asp Asp Arg His Asp Trp Tyr Ser Leu Gly
245 250 255
Arg Phe Asp Ala Ala Ala Asn Lys Trp Thr Pro Ile Asp Thr Glu Leu
260 265 270
Glu Leu Gly Val Gly Leu Arg Tyr Asp Trp Gly Lys Tyr Tyr Ala Ser
275 280 285
Lys Ser Phe Tyr Asp Pro Val Lys Lys Arg Arg Val Val Trp Ala Tyr
290 295 300
Val Gly Glu Thr Asp Ser Glu Arg Ala Asp Ile Thr Lys Gly Trp Ala
305 310 315 320
Asn Leu Gln Ser Ile Pro Arg Thr Val Glu Leu Asp Glu Lys Thr Arg
325 330 335
Thr Asn Leu Ile Gln Trp Pro Val Glu Glu Leu Asn Thr Leu Arg Ile
340 345 350
Asn Thr Thr Asp Leu Ser Gly Ile Thr Val Gly Ala Gly Ser Val Ala
355 360 365
Phe Leu Pro Leu His Gln Thr Ala Gln Leu Asp Ile Glu Ala Thr Phe
370 375 380
Arg Ile Asp Ala Ser Ala Ile Glu Ala Leu Asn Glu Ala Asp Val Ser
385 390 395 400
Tyr Asn Cys Thr Thr Ser Arg Gly Ala Ala Thr Arg Gly Ala Leu Gly
405 410 415
Pro Phe Gly Leu Leu Val Leu Ala Asn His Ala Leu Thr Glu Gln Thr
420 425 430
Gly Val Tyr Phe Tyr Val Ser Lys Gly Leu Asp Gly Gly Leu Arg Thr
435 440 445
His Phe Cys His Asp Glu Leu Arg Ser Ser His Ala Ser Asp Val Val
450 455 460
Lys Arg Val Val Gly Ser Thr Val Pro Val Leu Asp Gly Glu Asp Phe
465 470 475 480
Ser Val Arg Val Leu Val Asp His Ser Ile Val Gln Ser Phe Ala Met
485 490 495
Gly Gly Arg Leu Thr Ala Thr Ser Arg Ala Tyr Pro Thr Glu Ala Ile
500 505 510
Tyr Ala Ala Ala Gly Val Tyr Met Phe Asn Asn Ala Thr Gly Thr Ser
515 520 525
Val Thr Ala Glu Lys Leu Val Val His Asp Met Asp Ser Ser Tyr Asn
530 535 540
His Ile Tyr Thr Asp Gly Asp Leu Val Val Val Asp
545 550 555
<210> 19
<211> 623
<212> PRT
<213> 洋葱
<400> 19
Met Glu Ser Arg Asp Ile Glu Ser Ser Pro Ala Leu Asn Ala Pro Leu
1 5 10 15
Leu Gln Ala Ser Pro Pro Ile Lys Ser Ser Lys Leu Lys Val Ala Leu
20 25 30
Leu Ala Thr Ser Thr Ser Val Leu Leu Leu Ile Ala Ala Phe Phe Ala
35 40 45
Val Lys Tyr Ser Val Phe Asp Ser Gly Ser Gly Leu Leu Lys Asp Asp
50 55 60
Pro Pro Ser Asp Ser Glu Asp Tyr Pro Trp Thr Asn Glu Met Leu Lys
65 70 75 80
Trp Gln Arg Thr Gly Tyr His Phe Gln Pro Pro Asn His Phe Met Ala
85 90 95
Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr
100 105 110
Gln Tyr Asn Pro Asn Gly Ser Ala Trp Asp Tyr Ser Ile Ser Trp Gly
115 120 125
His Ala Val Ser Lys Asp Met Ile His Trp Leu His Leu Pro Val Ala
130 135 140
Met Val Pro Asp His Trp Tyr Asp Ser Lys Gly Val Trp Ser Gly Tyr
145 150 155 160
Ala Thr Thr Leu Pro Asp Gly Arg Ile Ile Val Leu Tyr Thr Gly Gly
165 170 175
Thr Asp Gln Leu Val Gln Val Gln Asn Leu Ala Glu Pro Ala Asp Pro
180 185 190
Ser Asp Pro Leu Leu Ile Glu Trp Lys Lys Ser Asn Gly Asn Pro Ile
195 200 205
Leu Met Pro Pro Pro Gly Val Gly Pro His Asp Phe Arg Asp Pro Phe
210 215 220
Pro Val Trp Tyr Asn Glu Ser Asp Ser Thr Trp His Met Leu Ile Gly
225 230 235 240
Ser Lys Asp Asp Asn His Tyr Gly Thr Val Leu Ile Tyr Thr Thr Lys
245 250 255
Asp Phe Glu Thr Tyr Thr Leu Leu Pro Asp Ile Leu His Lys Thr Lys
260 265 270
Asp Ser Val Gly Met Leu Glu Cys Val Asp Leu Tyr Pro Val Ala Thr
275 280 285
Thr Gly Asn Gln Ile Gly Asn Gly Leu Glu Met Lys Gly Gly Ser Gly
290 295 300
Lys Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Asp Glu Arg His
305 310 315 320
Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Glu Ser Phe Ser Trp Val
325 330 335
Pro Asp Asp Asp Thr Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Tyr
340 345 350
Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys Lys Arg
355 360 365
Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Ala Asp Asp
370 375 380
Ile Leu Lys Gly Trp Ala Ser Val Gln Asn Ile Ala Arg Thr Ile Leu
385 390 395 400
Phe Asp Ala Lys Thr Arg Ser Asn Leu Leu Val Trp Pro Val Glu Glu
405 410 415
Leu Asp Ala Leu Arg Thr Ser Gly Lys Glu Phe Asn Gly Val Val Val
420 425 430
Glu Pro Gly Ser Thr Tyr His Leu Asp Val Gly Thr Ala Thr Gln Leu
435 440 445
Asp Ile Glu Ala Glu Phe Glu Ile Asn Lys Glu Ala Val Asp Ala Val
450 455 460
Val Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Asp Gly Ala Ala
465 470 475 480
His Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Glu
485 490 495
Lys Met Thr Glu Lys Thr Ala Thr Tyr Phe Tyr Val Ser Arg Asn Val
500 505 510
Asp Gly Gly Leu Gln Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser
515 520 525
Lys Ala Asn Asp Ile Thr Lys Arg Val Val Gly His Thr Val Pro Val
530 535 540
Leu His Gly Glu Thr Phe Ser Leu Arg Ile Leu Val Asp His Ser Ile
545 550 555 560
Val Glu Ser Phe Ala Gln Lys Gly Arg Ala Val Ala Thr Ser Arg Val
565 570 575
Tyr Pro Thr Glu Ala Ile Tyr Asp Ser Thr Arg Val Phe Leu Phe Asn
580 585 590
Asn Ala Thr Ser Ala Thr Val Thr Ala Lys Ser Val Lys Ile Trp His
595 600 605
Met Asn Ser Thr His Asn His Pro Phe Pro Gly Phe Pro Ala Pro
610 615 620
<210> 20
<211> 621
<212> PRT
<213> 太匮龙舌兰
<400> 20
Met Ala Ser Ser Thr Lys Asp Val Glu Ala Pro Pro Thr Leu Asp Ala
1 5 10 15
Pro Leu Leu Gly Ser Ala Ala Pro Arg Ser Arg Leu Arg Val Ala Ala
20 25 30
Val Ser Leu Ser Val Met Ala Phe Leu Leu Val Ala Ile Ala Ala Ala
35 40 45
Val Leu Tyr Tyr Asn Pro Gly Gly Val Ala Ser Asn Leu Met Arg Leu
50 55 60
Arg Glu Asn Asp Tyr Pro Trp Thr Asn Asp Met Leu Arg Trp Gln Arg
65 70 75 80
Thr Gly Phe His Phe Gln Pro Glu Lys Asn Phe Gln Ala Asp Pro Asn
85 90 95
Ala Ala Met Phe Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn
100 105 110
Pro Thr Gly Val Ala Trp Asp Tyr Thr Ile Ser Trp Gly His Ala Val
115 120 125
Ser Lys Asp Leu Leu His Trp Asn Tyr Leu Pro Met Ala Leu Arg Pro
130 135 140
Asp His Trp Tyr Asp Arg Lys Gly Val Trp Ser Gly Tyr Ser Thr Leu
145 150 155 160
Leu Pro Asp Gly Arg Ile Val Val Leu Tyr Thr Gly Gly Thr Lys Glu
165 170 175
Leu Val Gln Val Gln Asn Leu Ala Val Pro Val Asn Leu Ser Asp Pro
180 185 190
Leu Leu Leu Glu Trp Lys Lys Ser His Val Asn Pro Ile Leu Val Pro
195 200 205
Pro Pro Gly Ile Glu Asp His Asp Phe Arg Asp Pro Phe Pro Val Trp
210 215 220
Tyr Asn Glu Ser Asp Ser Arg Trp His Val Val Ile Gly Ser Lys Asp
225 230 235 240
Pro Glu His Tyr Gly Ile Val Leu Ile Tyr Thr Thr Lys Asp Phe Val
245 250 255
Asn Phe Thr Leu Leu Pro Asn Ile Leu His Ser Thr Lys Gln Pro Val
260 265 270
Gly Met Leu Glu Cys Val Asp Leu Phe Pro Val Ala Thr Thr Asp Ser
275 280 285
Arg Ala Asn Gln Ala Leu Asp Met Thr Thr Met Arg Pro Gly Pro Gly
290 295 300
Leu Lys Tyr Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr
305 310 315 320
Tyr Ala Leu Gly Ser Phe Asp Leu Asp Ser Phe Thr Phe Thr Pro Asp
325 330 335
Asp Glu Thr Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Trp Gly Lys
340 345 350
Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys His Arg Arg Val
355 360 365
Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Arg Asp Asp Ala Leu
370 375 380
Lys Gly Trp Ala Ser Leu Gln Asn Ile Pro Arg Thr Ile Leu Phe Asp
385 390 395 400
Thr Lys Thr Lys Ser Asn Leu Ile Leu Trp Pro Val Glu Glu Val Glu
405 410 415
Ser Leu Arg Thr Ile Asn Lys Asn Phe Asn Ser Ile Pro Leu Tyr Pro
420 425 430
Gly Ser Thr Tyr Gln Leu Asp Val Gly Glu Ala Thr Gln Leu Asp Ile
435 440 445
Val Ala Glu Phe Glu Val Asp Glu Lys Ala Ile Glu Ala Thr Ala Glu
450 455 460
Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly Ala Ala Asn Arg
465 470 475 480
Gly Val Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Gln Glu Leu
485 490 495
Ser Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg Gly Ile Asp Gly
500 505 510
Asn Leu Arg Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala
515 520 525
Gly Ala Ile Thr Lys Arg Val Val Gly Ser Thr Val Pro Val Leu His
530 535 540
Gly Glu Thr Trp Ala Leu Arg Ile Leu Val Asp His Ser Ile Val Glu
545 550 555 560
Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro
565 570 575
Thr Glu Ala Ile Tyr Ser Ser Ala Arg Val Phe Leu Phe Asn Asn Ala
580 585 590
Thr Asp Ala Ile Val Thr Ala Lys Thr Val Asn Val Trp His Met Asn
595 600 605
Ser Thr Tyr Asn His Val Phe Pro Gly Leu Val Ala Pro
610 615 620
<210> 21
<211> 623
<212> PRT
<213> 洋葱
<400> 21
Met Glu Ser Arg Asp Ile Glu Ser Ser Pro Ala Leu Asn Ala Pro Leu
1 5 10 15
Leu Gln Thr Ser Pro Pro Ile Lys Ser Ser Lys Leu Lys Val Ala Leu
20 25 30
Leu Ala Thr Ser Thr Ser Val Leu Leu Leu Ile Ala Ala Phe Phe Ala
35 40 45
Val Lys Tyr Ser Val Phe Asp Ser Gly Ser Gly Leu Leu Lys Asp Asp
50 55 60
Pro Pro Ser Asp Ser Glu Asp Tyr Pro Trp Thr Asn Glu Met Leu Lys
65 70 75 80
Trp Gln Arg Thr Gly Tyr His Phe Gln Pro Pro Asn His Phe Met Ala
85 90 95
Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr
100 105 110
Gln Tyr Asn Pro Asn Gly Ser Ala Trp Asp Tyr Ser Ile Ser Trp Gly
115 120 125
His Ala Val Ser Lys Asp Met Ile His Trp Leu His Leu Pro Val Ala
130 135 140
Met Val Pro Asp His Trp Tyr Asp Ser Lys Gly Val Trp Ser Gly Tyr
145 150 155 160
Ala Thr Thr Leu Pro Asp Gly Arg Ile Ile Val Leu Tyr Thr Gly Gly
165 170 175
Thr Asp Gln Leu Val Gln Val Gln Asn Leu Ala Glu Pro Ala Asp Pro
180 185 190
Ser Asp Pro Leu Leu Ile Glu Trp Lys Lys Ser Asn Gly Asn Pro Ile
195 200 205
Leu Met Pro Pro Pro Gly Val Gly Pro His Asp Phe Arg Asp Pro Phe
210 215 220
Pro Val Trp Tyr Asn Glu Ser Asp Ser Thr Trp His Met Leu Ile Gly
225 230 235 240
Ser Lys Asp Asp Asn His Tyr Gly Thr Val Leu Ile Tyr Thr Thr Lys
245 250 255
Asp Phe Glu Thr Tyr Thr Leu Leu Pro Asp Ile Leu His Lys Thr Lys
260 265 270
Asp Ser Val Gly Met Leu Glu Cys Val Asp Leu Tyr Pro Val Ala Thr
275 280 285
Thr Gly Asn Gln Ile Gly Asn Gly Leu Glu Met Lys Gly Gly Ser Gly
290 295 300
Lys Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Asp Glu Arg His
305 310 315 320
Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Glu Ser Phe Ser Trp Val
325 330 335
Pro Asp Asp Asp Thr Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Tyr
340 345 350
Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys Lys Arg
355 360 365
Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Ala Asp Asp
370 375 380
Ile Leu Lys Gly Trp Ala Ser Val Gln Asn Ile Ala Arg Thr Ile Leu
385 390 395 400
Phe Asp Ala Lys Thr Arg Ser Asn Leu Leu Val Trp Pro Val Glu Glu
405 410 415
Leu Asp Ala Leu Arg Thr Ser Gly Lys Glu Phe Asn Gly Val Val Val
420 425 430
Glu Pro Gly Ser Thr Tyr His Leu Asp Val Gly Thr Ala Thr Gln Leu
435 440 445
Asp Ile Glu Ala Glu Phe Glu Ile Asn Lys Glu Ala Val Asp Ala Val
450 455 460
Val Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Asp Gly Ala Ala
465 470 475 480
His Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Glu
485 490 495
Lys Met Thr Glu Lys Thr Ala Thr Tyr Phe Tyr Val Ser Arg Asn Ala
500 505 510
Asp Gly Gly Leu Gln Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser
515 520 525
Lys Ala Asn Asp Ile Thr Lys Arg Val Val Gly His Thr Val Pro Val
530 535 540
Leu His Gly Glu Thr Phe Ser Leu Arg Ile Leu Val Asp His Ser Ile
545 550 555 560
Val Glu Ser Phe Ala Gln Lys Gly Arg Ala Val Ala Thr Ser Arg Val
565 570 575
Tyr Pro Thr Glu Ala Ile Tyr Asp Ser Thr Arg Val Phe Leu Phe Asn
580 585 590
Asn Ala Thr Ser Ala Thr Val Thr Ala Lys Ser Val Lys Ile Trp His
595 600 605
Met Asn Ser Thr His Asn His Pro Phe Pro Gly Phe Pro Ala Pro
610 615 620
<210> 22
<211> 1917
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 22
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgta cccggtaaat tagaatcgaa tgccgatgtc 300
gagtggcaac gttctgcata ccattttcag ccagacaaga acttcatatc cgatcctgac 360
ggcccaatgt atcacatggg atggtaccac ctattctacc aatataaccc ggaatcagct 420
atttggggga atatcacttg gggtcatagt gtgtctaggg acatgattaa ctggtttcac 480
ttgccattcg ctatggttcc agatcattgg tacgacatcg aaggtgttat gaccggtagc 540
gctacggttc ttcctaacgg tcaaatcatt atgttgtata ctggtaatgc gtacgatttg 600
tctcaattgc aatgcttagc ttatgccgtc aactcctcag atccactact cttggaatgg 660
aagaagtacg aaggtaatcc aatattgttc ccaccacccg gtgtcggtta caaagacttt 720
agagatcctt ccaccttatg gatgggccca gacggcgaat ggagaatggt tatgggtagt 780
aagcacaacg agacaatcgg atgtgctttg gtctatcgaa ctaccaattt cactcacttt 840
gaacttaacg aagaagtttt acatgctgta ccacacacag gaatgtggga atgtgtggat 900
ctctacccgg tcagcacgac ccatactaac gggttggaaa tgaaggacaa tggtccaaac 960
gttaaatata ttttaaagca atctggtgat gaggatagac acgactggta cgccattggt 1020
acattcgatc cagaaaagga caaatggtac cctgatgacc cagagaatga cgttggtatc 1080
ggtttgagat acgactatgg gaagttctat gccagtaaga ctttttacga tcaacataaa 1140
aagcggagag tattgtgggg ttacgttggt gaaactgatc caccaaagtc ggatctattg 1200
aaaggttggg ctaacattct caacatccct agatcagtcg ttttggatac ccagacagag 1260
actaatttga ttcaatggcc aatcgaagaa gttgaaaaac ttagatccaa gaagtacgac 1320
gaatttaagg acgtcgaact gcgtcctggt tctttgattc cattggaaat cggtaccgct 1380
acccaattgg atatatctgc aactttcgaa attgatgaaa agaaactgga gtctacttta 1440
gaagctgacg ttttattcaa ctgtacaact tcagaaggtt ccgtcggtag aggtgttcta 1500
ggccctttcg gtatcgttgt cttggctgat gctaacagat ccgaacaatt gccagtttac 1560
ttctacattg caaaggacac cgatggtact tctcgcacct atttctgtgc tgacgaatct 1620
cgttcttcga aggataagga tgtgggtaag tgggtttacg gatcttccgt accagtcctg 1680
gagggtgaaa actataatat gagattgctc gtcgatcatt cgattgtaga aggttttgcc 1740
caagggggta gaaccgttgt cacctctcgc gtttatccaa cgatggcaat ctacggtgcc 1800
gctaagatat ttttgttcaa caatgctacc ggtatttcag tgaaggctag tttaaaaatc 1860
tggaagatgg ctgaggccca attggacccc ttcccacttt ccggttggag cagttaa 1917
<210> 23
<211> 623
<212> PRT
<213> 梯牧草
<400> 23
Met Ala Pro Pro Gln Ala Ile Ala Asn Gly Ala Pro Ala Pro Leu Pro
1 5 10 15
Tyr Ala Tyr Ala Arg Leu Pro Ser Ser Gly Asp Glu Lys Gln Asp Gln
20 25 30
Ser Lys Ser Gly Gly Ala Arg Tyr Cys Arg Ala Cys Val Ala Gly Val
35 40 45
Ala Ala Leu Leu Ile Val Ala Gly Ala Leu Ala Gly Ala Arg Val Gly
50 55 60
Leu Gly Gly Ile Tyr Asp Asp Ala Asp Ala Phe Ala Trp Asn Asn Ser
65 70 75 80
Met Leu Gln Trp Gln Arg Ala Gly Phe His Phe Gln Thr Glu Lys Asn
85 90 95
Phe Met Ser Asp Pro Asn Gly Pro Val Tyr Tyr Arg Gly Tyr Tyr His
100 105 110
Leu Phe Tyr Gln Tyr Asn Met Lys Gly Val Val Trp Asp Asp Gly Ile
115 120 125
Val Trp Gly His Val Val Ser Arg Asp Leu Val His Trp Arg His Leu
130 135 140
Pro Ile Ala Met Val Pro Asp His Trp Tyr Asp Ser Met Gly Val Leu
145 150 155 160
Ser Gly Ser Ile Thr Val Leu Gln Asn Gly Ser Leu Val Met Ile Tyr
165 170 175
Thr Gly Val Phe Ser Lys Thr Thr Asp Arg Ser Gly Met Met Glu Val
180 185 190
Gln Cys Leu Ala Val Pro Ala Asp Pro Asn Asp Pro Leu Leu Arg Ser
195 200 205
Trp Thr Lys His Pro Ala Asn Pro Val Leu Val His Pro Pro Gly Ile
210 215 220
Lys Asp Met Asp Phe Arg Asp Pro Thr Thr Ala Trp Phe Asp Glu Ser
225 230 235 240
Asp Ser Thr Tyr Arg Thr Val Ile Gly Thr Lys Asp Asp His His Gly
245 250 255
Ser His Ala Gly Phe Ala Met Val Tyr Lys Thr Lys Asp Phe Leu Ser
260 265 270
Phe Gln Arg Ile Pro Gly Ile Leu His Ser Val Glu His Thr Gly Met
275 280 285
Trp Glu Cys Met Asp Phe Tyr Pro Val Gly Gly Gly Asp Asn Ser Ser
290 295 300
Ser Glu Val Leu Tyr Val Ile Lys Ala Ser Met Asp Asp Glu Arg His
305 310 315 320
Asp Tyr Tyr Ala Leu Gly Met Tyr Asp Ala Ala Ala Asn Thr Trp Thr
325 330 335
Pro Leu Asp Gln Glu Leu Asp Leu Gly Ile Gly Leu Arg Tyr Asp Trp
340 345 350
Gly Lys Leu Tyr Ala Ser Thr Thr Phe Tyr Asp Pro Ala Lys Arg Arg
355 360 365
Arg Val Met Leu Gly Tyr Val Gly Glu Thr Asp Ser Arg Arg Ser Asp
370 375 380
Glu Ala Lys Gly Trp Ala Ser Ile Gln Ser Ile Pro Arg Thr Val Ala
385 390 395 400
Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu Leu Trp Pro Val Glu Glu
405 410 415
Ile Glu Thr Leu Arg Leu Asn Ala Thr Glu Phe Asn Asp Ile Asn Ile
420 425 430
Asp Thr Gly Ser Val Phe His Leu Pro Ile Arg Gln Gly Asn Gln Leu
435 440 445
Asp Ile Glu Ala Ser Phe Arg Leu Asp Ala Ser Ala Val Ala Ala Ile
450 455 460
Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser Ser Ser Gly Gly Ala Ala
465 470 475 480
Thr Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Ala Glu
485 490 495
Gly Ile Gly Glu Gln Thr Ala Val Tyr Phe Tyr Val Ser Arg Gly Leu
500 505 510
Asp Gly Gly Leu Arg Thr Ser Phe Cys Asn Asp Glu Leu Arg Ser Ser
515 520 525
Trp Ala Arg Asp Val Thr Lys Arg Val Val Gly Ser Thr Val Pro Val
530 535 540
Leu Asn Gly Glu Thr Leu Ser Met Arg Val Leu Val Asp His Ser Ile
545 550 555 560
Val Gln Ser Phe Ala Met Gly Gly Arg Val Thr Ala Thr Ser Arg Val
565 570 575
Tyr Pro Thr Glu Ala Ile Tyr Ala Ala Ala Gly Val Tyr Leu Phe Asn
580 585 590
Asn Ala Thr Asn Ala Ser Val Thr Ala Glu Arg Ile Ile Val His Glu
595 600 605
Met Asp Ser Ile Asp Asn Asn Gln Ile Phe Leu Ile Asp Asp Leu
610 615 620
<210> 24
<211> 562
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 24
Asn Leu Met Arg Leu Arg Glu Asn Asp Tyr Pro Trp Thr Asn Asp Met
1 5 10 15
Leu Arg Trp Gln Arg Thr Gly Phe His Phe Gln Pro Gly Lys Asn Phe
20 25 30
Gln Ala Asp Pro Asn Ala Ala Met Phe Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln Tyr Asn Pro Thr Gly Val Ala Trp Asp Tyr Thr Ile Ser
50 55 60
Trp Gly His Ala Val Ser Lys Asp Leu Leu His Trp Asn Tyr Leu Pro
65 70 75 80
Met Ala Leu Arg Pro Asp His Trp Tyr Asp Arg Lys Gly Val Trp Ser
85 90 95
Gly Tyr Ser Thr Leu Leu Pro Asp Gly Arg Ile Val Val Leu Tyr Thr
100 105 110
Gly Gly Thr Lys Glu Leu Val Gln Val Gln Asn Leu Ala Val Pro Val
115 120 125
Asn Leu Ser Asp Pro Leu Leu Leu Glu Trp Lys Lys Ser His Val Asn
130 135 140
Pro Ile Leu Val Pro Pro Pro Gly Ile Glu Asp His Asp Phe Arg Asp
145 150 155 160
Pro Phe Pro Val Trp Tyr Asn Glu Ser Asp Ser Arg Trp His Val Val
165 170 175
Ile Gly Ser Lys Asp Pro Glu His Tyr Gly Ile Val Leu Ile Tyr Thr
180 185 190
Thr Lys Asp Phe Val Asn Phe Thr Leu Leu Pro Asn Ile Leu His Ser
195 200 205
Thr Lys Gln Pro Val Gly Met Leu Glu Cys Val Asp Leu Phe Pro Val
210 215 220
Ala Thr Thr Asp Ser Arg Ala Asn Gln Ala Leu Asp Met Thr Thr Met
225 230 235 240
Arg Pro Gly Pro Gly Leu Lys Tyr Val Leu Lys Ala Ser Met Asp Asp
245 250 255
Glu Arg His Asp Tyr Tyr Ala Leu Gly Ser Phe Asp Leu Asp Ser Phe
260 265 270
Thr Phe Thr Pro Asp Asp Glu Thr Ile Asp Val Gly Ile Gly Leu Arg
275 280 285
Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu
290 295 300
Lys Gln Arg Arg Val Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys
305 310 315 320
Arg Asp Asp Ala Leu Lys Gly Trp Ala Ser Leu Gln Asn Ile Pro Arg
325 330 335
Thr Ile Leu Phe Asp Thr Lys Thr Lys Ser Asn Leu Ile Leu Trp Pro
340 345 350
Val Glu Glu Val Glu Ser Leu Arg Thr Ile Asn Lys Asn Phe Asn Ser
355 360 365
Ile Pro Leu Tyr Pro Gly Ser Thr Tyr Gln Leu Asp Val Gly Glu Ala
370 375 380
Thr Gln Leu Asp Ile Val Ala Glu Phe Glu Val Asp Glu Lys Ala Ile
385 390 395 400
Glu Ala Thr Ala Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly
405 410 415
Gly Ala Ala Asn Arg Gly Val Leu Gly Pro Phe Gly Leu Leu Val Leu
420 425 430
Ala Asn Gln Glu Leu Ser Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser
435 440 445
Arg Gly Ile Asp Gly Asn Leu Arg Thr His Phe Cys Gln Asp Glu Leu
450 455 460
Arg Ser Ser Lys Ala Gly Ala Ile Thr Lys Arg Val Val Gly Ser Thr
465 470 475 480
Val Pro Val Leu His Gly Glu Thr Trp Ala Leu Arg Ile Leu Val Asp
485 490 495
His Ser Ile Val Glu Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr
500 505 510
Ser Arg Val Tyr Pro Thr Glu Ala Ile Tyr Ser Ser Ala Arg Val Phe
515 520 525
Leu Phe Asn Asn Ala Thr Asp Ala Ile Val Thr Ala Lys Thr Val Asn
530 535 540
Val Trp His Ile Asn Ser Thr Tyr Asn His Val Phe Pro Gly Leu Val
545 550 555 560
Ala Pro
<210> 25
<211> 648
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 25
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asp Leu Asn Gln Pro Tyr Arg
85 90 95
Thr Gly Tyr His Phe Gln Pro Leu Lys Asn Trp Met Asn Gly Pro Met
100 105 110
Ile Tyr Lys Gly Ile Tyr His Leu Phe Tyr Gln Tyr Asn Pro Tyr Gly
115 120 125
Ala Val Trp Asp Val Arg Ile Val Trp Gly His Ser Thr Ser Val Asp
130 135 140
Leu Val Asn Trp Ile Ser Gln Pro Pro Ala Phe Asn Pro Ser Gln Pro
145 150 155 160
Ser Asp Ile Asn Gly Cys Trp Ser Gly Ser Val Thr Ile Leu Pro Asn
165 170 175
Gly Lys Pro Val Ile Leu Tyr Thr Gly Ile Asp Gln Asn Lys Gly Gln
180 185 190
Val Gln Asn Val Ala Val Pro Val Asn Ile Ser Asp Pro Tyr Leu Arg
195 200 205
Glu Trp Ser Lys Pro Pro Gln Asn Pro Leu Met Thr Thr Asn Ala Val
210 215 220
Asn Gly Ile Asn Pro Asp Arg Phe Arg Asp Pro Thr Thr Ala Trp Leu
225 230 235 240
Gly Arg Asp Gly Glu Trp Arg Val Ile Val Gly Ser Ser Thr Asp Asp
245 250 255
Arg Arg Gly Leu Ala Ile Leu Tyr Lys Ser Arg Asp Phe Phe Asn Trp
260 265 270
Thr Gln Ser Met Lys Pro Leu His Tyr Glu Asp Leu Thr Gly Met Trp
275 280 285
Glu Cys Pro Asp Phe Phe Pro Val Ser Ile Thr Gly Ser Asp Gly Val
290 295 300
Glu Thr Ser Ser Val Gly Glu Asn Gly Ile Lys His Val Leu Lys Val
305 310 315 320
Ser Leu Ile Glu Thr Leu His Asp Tyr Tyr Thr Ile Gly Ser Tyr Asp
325 330 335
Arg Glu Lys Asp Val Tyr Val Pro Asp Leu Gly Phe Val Gln Asn Glu
340 345 350
Ser Ala Pro Arg Leu Asp Tyr Gly Lys Tyr Tyr Ala Ser Lys Thr Phe
355 360 365
Tyr Asp Asp Val Lys Lys Arg Arg Ile Leu Trp Gly Trp Val Asn Glu
370 375 380
Ser Ser Pro Ala Lys Asp Asp Ile Glu Lys Gly Trp Ser Gly Leu Gln
385 390 395 400
Ser Phe Pro Arg Lys Ile Trp Leu Asp Glu Ser Gly Lys Glu Leu Leu
405 410 415
Gln Trp Pro Ile Glu Glu Ile Glu Thr Leu Arg Gly Gln Gln Val Asn
420 425 430
Trp Gln Lys Lys Val Leu Lys Ala Gly Ser Thr Leu Gln Val His Gly
435 440 445
Val Thr Ala Ala Gln Ala Asp Val Glu Val Ser Phe Lys Val Lys Glu
450 455 460
Leu Glu Lys Ala Asp Val Ile Glu Pro Ser Trp Thr Asp Pro Gln Lys
465 470 475 480
Ile Cys Ser Gln Gly Asp Leu Ser Val Met Ser Gly Leu Gly Pro Phe
485 490 495
Gly Leu Met Val Leu Ala Ser Asn Asp Met Glu Glu Tyr Thr Ser Val
500 505 510
Tyr Phe Arg Ile Phe Lys Ser Asn Asp Asp Thr Asn Lys Lys Thr Lys
515 520 525
Tyr Val Val Leu Met Cys Ser Asp Gln Ser Arg Ser Ser Leu Asn Asp
530 535 540
Glu Asn Asp Lys Ser Thr Phe Gly Ala Phe Val Ala Ile Asp Pro Ser
545 550 555 560
His Gln Thr Ile Ser Leu Arg Thr Leu Ile Asp His Ser Ile Val Glu
565 570 575
Ser Tyr Gly Gly Gly Gly Arg Thr Cys Ile Thr Ser Arg Val Tyr Pro
580 585 590
Lys Leu Ala Ile Gly Glu Asn Ala Asn Leu Phe Val Phe Asn Lys Gly
595 600 605
Thr Gln Ser Val Asp Ile Leu Thr Leu Ser Ala Trp Ser Leu Lys Ser
610 615 620
Ala Gln Ile Asn Gly Asp Leu Met Ser Pro Phe Ile Glu Arg Glu Glu
625 630 635 640
Ser Arg Ser Pro Asn His Gln Phe
645
<210> 26
<211> 559
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 26
Asp Leu Asn Gln Pro Tyr Arg Thr Gly Tyr His Phe Gln Pro Leu Lys
1 5 10 15
Asn Trp Met Asn Gly Pro Met Ile Tyr Lys Gly Ile Tyr His Leu Phe
20 25 30
Tyr Gln Tyr Asn Pro Tyr Gly Ala Val Trp Asp Val Arg Ile Val Trp
35 40 45
Gly His Ser Thr Ser Val Asp Leu Val Asn Trp Ile Ser Gln Pro Pro
50 55 60
Ala Phe Asn Pro Ser Gln Pro Ser Asp Ile Asn Gly Cys Trp Ser Gly
65 70 75 80
Ser Val Thr Ile Leu Pro Asn Gly Lys Pro Val Ile Leu Tyr Thr Gly
85 90 95
Ile Asp Gln Asn Lys Gly Gln Val Gln Asn Val Ala Val Pro Val Asn
100 105 110
Ile Ser Asp Pro Tyr Leu Arg Glu Trp Ser Lys Pro Pro Gln Asn Pro
115 120 125
Leu Met Thr Thr Asn Ala Val Asn Gly Ile Asn Pro Asp Arg Phe Arg
130 135 140
Asp Pro Thr Thr Ala Trp Leu Gly Arg Asp Gly Glu Trp Arg Val Ile
145 150 155 160
Val Gly Ser Ser Thr Asp Asp Arg Arg Gly Leu Ala Ile Leu Tyr Lys
165 170 175
Ser Arg Asp Phe Phe Asn Trp Thr Gln Ser Met Lys Pro Leu His Tyr
180 185 190
Glu Asp Leu Thr Gly Met Trp Glu Cys Pro Asp Phe Phe Pro Val Ser
195 200 205
Ile Thr Gly Ser Asp Gly Val Glu Thr Ser Ser Val Gly Glu Asn Gly
210 215 220
Ile Lys His Val Leu Lys Val Ser Leu Ile Glu Thr Leu His Asp Tyr
225 230 235 240
Tyr Thr Ile Gly Ser Tyr Asp Arg Glu Lys Asp Val Tyr Val Pro Asp
245 250 255
Leu Gly Phe Val Gln Asn Glu Ser Ala Pro Arg Leu Asp Tyr Gly Lys
260 265 270
Tyr Tyr Ala Ser Lys Thr Phe Tyr Asp Asp Val Lys Lys Arg Arg Ile
275 280 285
Leu Trp Gly Trp Val Asn Glu Ser Ser Pro Ala Lys Asp Asp Ile Glu
290 295 300
Lys Gly Trp Ser Gly Leu Gln Ser Phe Pro Arg Lys Ile Trp Leu Asp
305 310 315 320
Glu Ser Gly Lys Glu Leu Leu Gln Trp Pro Ile Glu Glu Ile Glu Thr
325 330 335
Leu Arg Gly Gln Gln Val Asn Trp Gln Lys Lys Val Leu Lys Ala Gly
340 345 350
Ser Thr Leu Gln Val His Gly Val Thr Ala Ala Gln Ala Asp Val Glu
355 360 365
Val Ser Phe Lys Val Lys Glu Leu Glu Lys Ala Asp Val Ile Glu Pro
370 375 380
Ser Trp Thr Asp Pro Gln Lys Ile Cys Ser Gln Gly Asp Leu Ser Val
385 390 395 400
Met Ser Gly Leu Gly Pro Phe Gly Leu Met Val Leu Ala Ser Asn Asp
405 410 415
Met Glu Glu Tyr Thr Ser Val Tyr Phe Arg Ile Phe Lys Ser Asn Asp
420 425 430
Asp Thr Asn Lys Lys Thr Lys Tyr Val Val Leu Met Cys Ser Asp Gln
435 440 445
Ser Arg Ser Ser Leu Asn Asp Glu Asn Asp Lys Ser Thr Phe Gly Ala
450 455 460
Phe Val Ala Ile Asp Pro Ser His Gln Thr Ile Ser Leu Arg Thr Leu
465 470 475 480
Ile Asp His Ser Ile Val Glu Ser Tyr Gly Gly Gly Gly Arg Thr Cys
485 490 495
Ile Thr Ser Arg Val Tyr Pro Lys Leu Ala Ile Gly Glu Asn Ala Asn
500 505 510
Leu Phe Val Phe Asn Lys Gly Thr Gln Ser Val Asp Ile Leu Thr Leu
515 520 525
Ser Ala Trp Ser Leu Lys Ser Ala Gln Ile Asn Gly Asp Leu Met Ser
530 535 540
Pro Phe Ile Glu Arg Glu Glu Ser Arg Ser Pro Asn His Gln Phe
545 550 555
<210> 27
<211> 654
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 27
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Arg Ser Asp Pro Ile Lys Glu
85 90 95
His Asp Tyr Pro Trp Thr Asn Glu Met Leu Thr Trp Gln Arg Ser Gly
100 105 110
Phe His Phe Gln Pro Ala Lys Asn Phe Gln Ser Asp Pro Asn Ala Ala
115 120 125
Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro Thr
130 135 140
Gly Thr Ala Trp Asp Tyr Thr Ile Ser Trp Gly His Ala Val Ser Arg
145 150 155 160
Asp Leu Ile His Trp Leu His Leu Pro Met Ala Met Val Pro Asp His
165 170 175
Trp Tyr Asp Ala Lys Gly Val Trp Ser Gly Tyr Ser Thr Leu Leu Pro
180 185 190
Asp Gly Arg Val Ile Val Leu Tyr Thr Gly Gly Thr Pro Glu Leu Val
195 200 205
Gln Val Gln Asn Leu Ala Val Pro Ala Asp Ala Ser Asp Pro Leu Leu
210 215 220
Leu Lys Trp Lys Lys Ser Ser Val Asn Pro Ile Leu Val Pro Pro Pro
225 230 235 240
Gly Ile Gly Thr Ser Asp Phe Arg Asp Pro Phe Pro Ile Trp Tyr Asn
245 250 255
Glu Thr Asp Ser Asn Trp His Val Leu Ile Gly Ser Lys Asp Ser Asn
260 265 270
His His Gly Ile Val Leu Leu Tyr Lys Thr Lys Asp Phe Phe Asn Phe
275 280 285
Thr Leu Leu Pro Ser Leu Leu His Thr Ser Thr Gln Ser Val Gly Met
290 295 300
Phe Glu Cys Val Asp Leu Tyr Pro Val Ala Thr Gly Gly Pro Leu Ser
305 310 315 320
Asn Arg Gly Leu Glu Met Ser Val Asp Leu Ser Asn Gly Gly Ile Lys
325 330 335
His Val Leu Lys Ala Ser Met Asp Glu Glu Arg His Asp Tyr Tyr Ala
340 345 350
Ile Gly Thr Phe Asp Leu Asp Ser Phe Lys Trp Thr Pro Asp Asp Pro
355 360 365
Ser Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr
370 375 380
Ala Ser Lys Thr Phe Phe Asp Thr Glu Lys Gln Arg Arg Ile Leu Trp
385 390 395 400
Gly Tyr Val Gly Glu Val Asp Ser Lys Asp Asp Asp Lys Met Lys Gly
405 410 415
Trp Ala Thr Leu Gln Asn Ile Pro Arg Thr Ile Leu Leu Asp Thr Lys
420 425 430
Thr Gln Ser Asn Leu Ile Ile Trp Pro Val Glu Glu Val Glu Asp Leu
435 440 445
Arg Thr Asp Gly Asn Ile Phe Asn Asp Ile Lys Ile Gly Ala Gly Ser
450 455 460
Ser Val Gln Leu Asp Ile Gly Ala Ala Ser Gln Leu Asp Ile Glu Ala
465 470 475 480
Glu Phe Glu Leu Asp Asn Ser Ala Leu Asp Gly Ala Ile Glu Ala Asp
485 490 495
Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly Ala Ala Asn Arg Gly Leu
500 505 510
Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Gln Asp Leu Thr Glu
515 520 525
Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg Gly Thr Asp Gly Asp Leu
530 535 540
Arg Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala Gly Asp
545 550 555 560
Ile Val Lys Arg Val Val Gly Ser Val Val Pro Val Leu His Gly Glu
565 570 575
Thr Trp Ser Leu Arg Ile Leu Val Asp His Ser Ile Ile Glu Ser Phe
580 585 590
Ala Gln Arg Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro Thr Glu
595 600 605
Ala Ile Tyr Asn Lys Ala Arg Leu Phe Leu Phe Asn Asn Ala Thr Asp
610 615 620
Ala Lys Val Thr Ala Lys Ser Val Lys Ile Trp His Met Asn Ser Thr
625 630 635 640
His Asn His Pro Phe Pro Gly Leu Glu Ser Leu Phe Glu Ser
645 650
<210> 28
<211> 565
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 28
Arg Ser Asp Pro Ile Lys Glu His Asp Tyr Pro Trp Thr Asn Glu Met
1 5 10 15
Leu Thr Trp Gln Arg Ser Gly Phe His Phe Gln Pro Ala Lys Asn Phe
20 25 30
Gln Ser Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln Tyr Asn Pro Thr Gly Thr Ala Trp Asp Tyr Thr Ile Ser
50 55 60
Trp Gly His Ala Val Ser Arg Asp Leu Ile His Trp Leu His Leu Pro
65 70 75 80
Met Ala Met Val Pro Asp His Trp Tyr Asp Ala Lys Gly Val Trp Ser
85 90 95
Gly Tyr Ser Thr Leu Leu Pro Asp Gly Arg Val Ile Val Leu Tyr Thr
100 105 110
Gly Gly Thr Pro Glu Leu Val Gln Val Gln Asn Leu Ala Val Pro Ala
115 120 125
Asp Ala Ser Asp Pro Leu Leu Leu Lys Trp Lys Lys Ser Ser Val Asn
130 135 140
Pro Ile Leu Val Pro Pro Pro Gly Ile Gly Thr Ser Asp Phe Arg Asp
145 150 155 160
Pro Phe Pro Ile Trp Tyr Asn Glu Thr Asp Ser Asn Trp His Val Leu
165 170 175
Ile Gly Ser Lys Asp Ser Asn His His Gly Ile Val Leu Leu Tyr Lys
180 185 190
Thr Lys Asp Phe Phe Asn Phe Thr Leu Leu Pro Ser Leu Leu His Thr
195 200 205
Ser Thr Gln Ser Val Gly Met Phe Glu Cys Val Asp Leu Tyr Pro Val
210 215 220
Ala Thr Gly Gly Pro Leu Ser Asn Arg Gly Leu Glu Met Ser Val Asp
225 230 235 240
Leu Ser Asn Gly Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Glu
245 250 255
Glu Arg His Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Asp Ser Phe
260 265 270
Lys Trp Thr Pro Asp Asp Pro Ser Ile Asp Val Gly Val Gly Leu Arg
275 280 285
Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Lys Thr Phe Phe Asp Thr Glu
290 295 300
Lys Gln Arg Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys
305 310 315 320
Asp Asp Asp Lys Met Lys Gly Trp Ala Thr Leu Gln Asn Ile Pro Arg
325 330 335
Thr Ile Leu Leu Asp Thr Lys Thr Gln Ser Asn Leu Ile Ile Trp Pro
340 345 350
Val Glu Glu Val Glu Asp Leu Arg Thr Asp Gly Asn Ile Phe Asn Asp
355 360 365
Ile Lys Ile Gly Ala Gly Ser Ser Val Gln Leu Asp Ile Gly Ala Ala
370 375 380
Ser Gln Leu Asp Ile Glu Ala Glu Phe Glu Leu Asp Asn Ser Ala Leu
385 390 395 400
Asp Gly Ala Ile Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly
405 410 415
Gly Ala Ala Asn Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu
420 425 430
Ala Asn Gln Asp Leu Thr Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser
435 440 445
Arg Gly Thr Asp Gly Asp Leu Arg Thr His Phe Cys Gln Asp Glu Leu
450 455 460
Arg Ser Ser Lys Ala Gly Asp Ile Val Lys Arg Val Val Gly Ser Val
465 470 475 480
Val Pro Val Leu His Gly Glu Thr Trp Ser Leu Arg Ile Leu Val Asp
485 490 495
His Ser Ile Ile Glu Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr
500 505 510
Ser Arg Val Tyr Pro Thr Glu Ala Ile Tyr Asn Lys Ala Arg Leu Phe
515 520 525
Leu Phe Asn Asn Ala Thr Asp Ala Lys Val Thr Ala Lys Ser Val Lys
530 535 540
Ile Trp His Met Asn Ser Thr His Asn His Pro Phe Pro Gly Leu Glu
545 550 555 560
Ser Leu Phe Glu Ser
565
<210> 29
<211> 1947
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 29
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgac ttgaatcaac cttatagaac cggttaccac 300
ttccagccat taaaaaactg gatgaacggc ccaatgattt acaagggaat ctatcatctg 360
ttttaccaat acaacccata cggtgccgtg tgggatgtaa ggattgtctg gggtcacagt 420
acttccgtcg atttggttaa ttggataagc caacccccgg cattcaaccc atcacaacca 480
tctgacatca acggttgttg gtcgggttct gttacgattc tacctaatgg gaagccagtt 540
atcctttata caggtattga tcaaaacaag ggtcaagttc agaatgtcgc ggttccagtc 600
aatatctctg acccatattt gcgtgaatgg tccaaaccac ctcaaaaccc attgatgact 660
accaacgctg ttaacggtat caaccctgat agatttagag atccaactac agcttggcta 720
ggaagagatg gtgagtggag agtcattgtg ggttcatcta ccgacgaccg ccggggtttg 780
gccatattat acaagtcccg cgatttcttt aattggactc aatctatgaa accgttgcat 840
tacgaagatt tgaccggaat gtgggaatgc ccagacttct tcccagtttc aattacgggg 900
agtgatggtg tggaaacttc ttccgtaggt gaaaacggta taaagcacgt tctcaaggtc 960
agcttaatcg aaactttgca tgactactat accattggtt cgtatgacag agagaaggat 1020
gtctacgttc ctgacttagg tttcgtccaa aatgaatccg ctccacgttt ggattacggg 1080
aaatactacg cctctaagac attttatgac gacgtcaaaa agcggagaat tttatggggt 1140
tgggttaacg aatcttcgcc agctaaggac gatattgaaa agggctggtc tggtttgcag 1200
tcatttccaa gaaagatttg gttggacgag agcggtaaag aattgctgca atggccaatc 1260
gaagaaatag aaactctacg tggccaacaa gttaactggc aaaagaaggt tttgaaggct 1320
ggttctacct tacaagtcca cggtgttact gctgctcaag cggatgtaga ggtttccttc 1380
aaagtcaagg aattggaaaa agcagacgtc atcgaaccct cctggaccga tccccaaaaa 1440
atatgttcgc agggtgactt gtctgttatg tctggtttag gtccgttcgg tcttatggtt 1500
cttgcttcta atgatatgga agaatacact tccgtttact tcagaatctt caagagtaac 1560
gatgatacta ataaaaagac caagtatgtt gtgctcatgt gttccgatca atcaagaagt 1620
tctttgaacg atgagaacga taagtcaacc tttggggcct ttgttgctat tgatccatct 1680
catcagacca tctctctccg aacattgatt gaccactcca tagtcgaatc atacggtggt 1740
ggtggcagaa cttgtatcac gagtagagta tatccaaagt tggccatcgg tgaaaatgca 1800
aatttattcg tctttaacaa gggtactcaa tctgttgaca ttctgacttt aagcgcttgg 1860
tcccttaaga gtgctcaaat taacggagac ttgatgtctc ctttcatcga gagagaagaa 1920
agtagatcac ccaaccatca attctaa 1947
<210> 30
<211> 1965
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 30
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctaga tcagatccta ttaaagagca tgactatcca 300
tggactaatg aaatgttgac atggcaacgt agtggatttc acttccagcc cgctaagaac 360
ttccaatccg acccaaacgc agccatgtac tacaagggct ggtatcactt cttttaccaa 420
tacaatccga ccggtactgc ttgggattac acgatctctt ggggtcatgc tgtctcgcgg 480
gacttaatac actggcttca tctgccaatg gctatggtac cagatcactg gtatgatgcg 540
aagggtgtgt ggtccggtta ctctacccta ttgccagatg gtagagttat tgtcttatat 600
actggtggta ccccagaatt ggttcaagtt caaaacttgg ccgttcctgc tgacgcctct 660
gatccactgt tgttgaaatg gaagaagtcc tcagtcaacc ccatccttgt tccgccacca 720
gggattggaa ctagcgactt cagggatcca tttcctatct ggtacaatga aacagactcc 780
aactggcacg tcttgatagg ttctaaagac tccaaccacc atggtattgt attattgtat 840
aagactaagg acttctttaa cttcacattg cttccatctt tattgcacac cagtacccag 900
agcgttggta tgttcgaatg cgtggatctc tacccagtcg ctactggtgg gccactatct 960
aatagaggtt tggaaatgag cgttgatctc tcaaatggtg gtatcaaaca tgttttgaag 1020
gcttctatgg atgaggaaag acatgactac tatgcgattg gcacctttga cttagattct 1080
ttcaaatgga cgcccgacga tccaagtatc gacgttggtg tcggtctaag atacgattgg 1140
ggtaagttct acgcttctaa gacctttttt gatactgaaa agcaacgccg aattttatgg 1200
ggctatgtcg gtgaagttga ctccaaggat gatgacaaga tgaaaggttg ggcaacctta 1260
caaaatatac ctagaactat cttgcttgac acgaaaactc aatctaactt gattatctgg 1320
ccagtcgagg aagttgaaga tttgagaact gacggcaaca ttttcaacga tataaaaatt 1380
ggtgctggtt cttcagtaca attggatatt ggtgccgctt cgcagttgga catcgaagcc 1440
gaatttgaac tagataacag tgctttggac ggcgctattg aagctgatgt cacttacaat 1500
tgttcaactt cgggtggtgc cgcaaataga ggtttgctgg ggcctttcgg tttacttgtt 1560
ttagctaacc aagacttgac agaacaaacc gctacatact tctacgtgtc cagaggtacc 1620
gatggtgatt tgagaaccca cttctgtcaa gacgaattac gttcctccaa ggcaggagac 1680
attgtcaagc gcgttgttgg ttctgtggtg ccagttctac atggtgaaac ttggtccttg 1740
agaattttgg ttgaccactc tatcatcgaa agctttgcac aaagaggacg ggctgttgct 1800
acctctaggg tctacccaac tgaggcaatc tacaacaaag ccagactgtt tttgttcaac 1860
aatgctacag acgctaaggt tactgccaag agtgttaaaa tatggcatat gaactctaca 1920
cacaaccatc cattccctgg tttagaatcg ctattcgaat cataa 1965
<210> 31
<211> 542
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 31
Ser Ser Val Gln Pro Ser Ala Ala Glu Arg Leu Thr Trp Glu Arg Thr
1 5 10 15
Ala Phe His Phe Gln Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly
20 25 30
Pro Leu Phe His Met Gly Trp His His Leu Phe Tyr Gln Tyr Asn Pro
35 40 45
Tyr Ala Pro Val Trp Gly Asn Met Ser Trp Gly His Ala Val Ser Lys
50 55 60
Asp Met Ile Asn Trp Phe Glu Leu Pro Val Ala Leu Val Pro Thr Glu
65 70 75 80
Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro
85 90 95
Asn Gly Gln Ile Phe Ala Leu Tyr Thr Gly Asn Ala Asn Asp Phe Ser
100 105 110
Gln Leu Gln Cys Lys Ala Val Pro Val Asp Val Ser Asp Pro Leu Leu
115 120 125
Val Lys Trp Val Lys Tyr Asp Gly Asn Pro Ile Leu Tyr Thr Pro Pro
130 135 140
Gly Ile Gly Leu Lys Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly
145 150 155 160
Pro Asp Gly Lys His Arg Met Ile Met Gly Thr Lys Arg Gly Thr Thr
165 170 175
Gly Leu Val Leu Val Tyr His Thr Thr Asp Phe Thr Asn Tyr Val Met
180 185 190
Leu Asp Glu Pro Leu His Ser Val Pro Asn Thr Asp Met Trp Glu Cys
195 200 205
Val Asp Leu Phe Pro Val Ser Thr Thr Asn Asp Ser Ala Leu Asp Ile
210 215 220
Ala Ala Tyr Gly Ser Gly Ile Lys His Val Leu Lys Glu Ser Trp Glu
225 230 235 240
Gly His Ala Met Asp Phe Tyr Ser Ile Gly Thr Tyr Asp Ala Ile Asn
245 250 255
Asp Lys Trp Thr Pro Asp Asn Pro Glu Leu Asp Val Gly Ile Gly Leu
260 265 270
Arg Cys Asp Tyr Gly Arg Phe Phe Ala Ser Lys Ser Leu Tyr Asp Pro
275 280 285
Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Ala Glu Ser Asp Ser
290 295 300
Ala Asp Gln Asp Val Ser Arg Gly Trp Ala Thr Ile Tyr Asn Val Ala
305 310 315 320
Arg Thr Ile Val Leu Asp Arg Lys Thr Gly Thr His Leu Leu Gln Trp
325 330 335
Pro Val Glu Glu Leu Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Lys
340 345 350
Glu Met Thr Leu Glu Pro Gly Ser Ile Val Pro Leu Asp Ile Gly Ser
355 360 365
Ala Thr Gln Leu Asp Ile Ile Ala Thr Phe Glu Val Asp Gln Glu Ala
370 375 380
Leu Lys Ala Thr Ser Asp Ala Asn Asp Glu Tyr Ala Cys Thr Thr Ser
385 390 395 400
Ser Gly Ala Ala Glu Arg Gly Ser Phe Gly Pro Phe Gly Ile Ala Val
405 410 415
Leu Ala Asp Gly Thr Leu Ser Glu Leu Thr Pro Val Tyr Phe Tyr Ile
420 425 430
Ala Lys Asn Thr Lys Gly Gly Val Asp Thr His Phe Cys Thr Asp Lys
435 440 445
Leu Arg Ser Ser Leu Asp Tyr Asp Ser Glu Lys Val Val Tyr Gly Ser
450 455 460
Thr Ile Pro Val Leu Asp Gly Glu Gln Ile Thr Met Arg Val Leu Val
465 470 475 480
Asp His Ser Val Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Ile
485 490 495
Thr Ser Arg Val Tyr Pro Thr Lys Ala Ile Tyr Glu Gly Ala Lys Leu
500 505 510
Phe Val Phe Asn Asn Ala Thr Thr Thr Asn Val Lys Ala Thr Leu Asn
515 520 525
Val Trp Gln Met Ser His Ala Leu Ile Gln Pro Tyr Pro Phe
530 535 540
<210> 32
<211> 631
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 32
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Ser Val Lys His Ser Gln
85 90 95
Ser Asp Arg Leu Arg Trp Glu Arg Thr Ala Tyr His Phe Gln Pro Ala
100 105 110
Lys Asn Phe Ile Tyr Asp Pro Asn Gly Pro Leu Phe His Met Gly Trp
115 120 125
Tyr His Leu Phe Tyr Gln Tyr Asn Pro Tyr Ala Pro Ile Trp Gly Asn
130 135 140
Met Ser Trp Gly His Ala Val Ser Lys Asp Met Ile His Trp Phe Glu
145 150 155 160
Leu Pro Val Ala Ile Val Pro Thr Glu Trp Tyr Asp Ile Glu Gly Val
165 170 175
Leu Ser Gly Ser Thr Thr Ala Leu Pro Asn Gly Gln Ile Phe Ala Leu
180 185 190
Tyr Thr Gly Asn Ala Lys Asp Phe Ser Gln Leu Gln Cys Lys Ala Val
195 200 205
Pro Leu Asn Ala Ser Asp Pro Leu Leu Val Glu Trp Val Lys Tyr Glu
210 215 220
Asp Asn Pro Ile Leu Tyr Ile Pro Pro Gly Ile Gly Pro Lys Asp Tyr
225 230 235 240
Arg Asp Pro Ser Thr Val Trp Thr Gly Pro Asp Gly Lys His Arg Met
245 250 255
Ile Met Gly Thr Lys Gln Asn Gly Thr Gly Met Val His Val Tyr His
260 265 270
Thr Thr Asp Phe Ile Asn Tyr Val Leu Leu Asp Glu Pro Leu His Ser
275 280 285
Val Pro Asn Thr Asp Met Trp Glu Cys Val Asp Phe Tyr Pro Val Ser
290 295 300
Thr Ile Asn Asp Ser Ala Leu Asp Ile Ala Ala Tyr Gly Ser Asp Ile
305 310 315 320
Lys His Val Ile Lys Glu Ser Trp Glu Gly His Gly Met Asp Leu Tyr
325 330 335
Ser Ile Gly Thr Tyr Asp Ala Tyr Lys Asp Lys Trp Thr Pro Asp Asn
340 345 350
Pro Glu Phe Asp Val Gly Ile Gly Leu Arg Val Asp Tyr Gly Arg Phe
355 360 365
Phe Ala Ser Lys Ser Leu Tyr Asp Pro Leu Lys Lys Arg Arg Val Thr
370 375 380
Trp Gly Tyr Val Ala Glu Ser Asp Ser Ser Asp Gln Asp Leu Asn Arg
385 390 395 400
Gly Trp Ala Thr Ile Tyr Asn Val Gly Arg Thr Val Val Leu Asp Arg
405 410 415
Lys Thr Gly Thr His Leu Leu His Trp Pro Val Glu Glu Ile Glu Ser
420 425 430
Leu Arg Ser Asn Val Arg Glu Phe Asn Glu Ile Glu Leu Val Pro Gly
435 440 445
Ser Ile Ile Pro Leu Asp Ile Gly Met Ala Thr Gln Leu Asp Ile Val
450 455 460
Ala Thr Phe Lys Val Asp Pro Glu Ala Leu Met Ala Lys Ser Asp Ile
465 470 475 480
Asn Ser Glu Tyr Gly Cys Thr Thr Ser Ser Gly Ala Thr Gln Arg Gly
485 490 495
Ser Leu Gly Pro Phe Gly Ile Val Val Leu Ala Asp Val Ala Leu Ser
500 505 510
Glu Leu Thr Pro Val Tyr Phe Tyr Ile Ala Lys Asn Ile Asp Gly Gly
515 520 525
Leu Val Thr His Phe Cys Thr Asp Lys Leu Arg Ser Ser Leu Asp Tyr
530 535 540
Asp Gly Glu Arg Val Val Tyr Gly Ser Thr Val Pro Val Leu Asp Gly
545 550 555 560
Glu Glu Leu Thr Met Arg Leu Leu Val Asp His Ser Val Val Glu Gly
565 570 575
Phe Ala Gln Gly Gly Arg Thr Val Met Thr Ser Arg Val Tyr Pro Thr
580 585 590
Asn Ala Ile Tyr Glu Glu Ala Lys Ile Phe Leu Phe Asn Asn Ala Thr
595 600 605
Gly Ala Ser Val Lys Ala Ser Leu Lys Ile Trp Gln Met Gly Ser Ala
610 615 620
Ser Ile Gln Ala Tyr Pro Phe
625 630
<210> 33
<211> 542
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 33
Ser Ser Val Lys His Ser Gln Ser Asp Arg Leu Arg Trp Glu Arg Thr
1 5 10 15
Ala Tyr His Phe Gln Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly
20 25 30
Pro Leu Phe His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro
35 40 45
Tyr Ala Pro Ile Trp Gly Asn Met Ser Trp Gly His Ala Val Ser Lys
50 55 60
Asp Met Ile His Trp Phe Glu Leu Pro Val Ala Ile Val Pro Thr Glu
65 70 75 80
Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro
85 90 95
Asn Gly Gln Ile Phe Ala Leu Tyr Thr Gly Asn Ala Lys Asp Phe Ser
100 105 110
Gln Leu Gln Cys Lys Ala Val Pro Leu Asn Ala Ser Asp Pro Leu Leu
115 120 125
Val Glu Trp Val Lys Tyr Glu Asp Asn Pro Ile Leu Tyr Ile Pro Pro
130 135 140
Gly Ile Gly Pro Lys Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly
145 150 155 160
Pro Asp Gly Lys His Arg Met Ile Met Gly Thr Lys Gln Asn Gly Thr
165 170 175
Gly Met Val His Val Tyr His Thr Thr Asp Phe Ile Asn Tyr Val Leu
180 185 190
Leu Asp Glu Pro Leu His Ser Val Pro Asn Thr Asp Met Trp Glu Cys
195 200 205
Val Asp Phe Tyr Pro Val Ser Thr Ile Asn Asp Ser Ala Leu Asp Ile
210 215 220
Ala Ala Tyr Gly Ser Asp Ile Lys His Val Ile Lys Glu Ser Trp Glu
225 230 235 240
Gly His Gly Met Asp Leu Tyr Ser Ile Gly Thr Tyr Asp Ala Tyr Lys
245 250 255
Asp Lys Trp Thr Pro Asp Asn Pro Glu Phe Asp Val Gly Ile Gly Leu
260 265 270
Arg Val Asp Tyr Gly Arg Phe Phe Ala Ser Lys Ser Leu Tyr Asp Pro
275 280 285
Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Ala Glu Ser Asp Ser
290 295 300
Ser Asp Gln Asp Leu Asn Arg Gly Trp Ala Thr Ile Tyr Asn Val Gly
305 310 315 320
Arg Thr Val Val Leu Asp Arg Lys Thr Gly Thr His Leu Leu His Trp
325 330 335
Pro Val Glu Glu Ile Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Asn
340 345 350
Glu Ile Glu Leu Val Pro Gly Ser Ile Ile Pro Leu Asp Ile Gly Met
355 360 365
Ala Thr Gln Leu Asp Ile Val Ala Thr Phe Lys Val Asp Pro Glu Ala
370 375 380
Leu Met Ala Lys Ser Asp Ile Asn Ser Glu Tyr Gly Cys Thr Thr Ser
385 390 395 400
Ser Gly Ala Thr Gln Arg Gly Ser Leu Gly Pro Phe Gly Ile Val Val
405 410 415
Leu Ala Asp Val Ala Leu Ser Glu Leu Thr Pro Val Tyr Phe Tyr Ile
420 425 430
Ala Lys Asn Ile Asp Gly Gly Leu Val Thr His Phe Cys Thr Asp Lys
435 440 445
Leu Arg Ser Ser Leu Asp Tyr Asp Gly Glu Arg Val Val Tyr Gly Ser
450 455 460
Thr Val Pro Val Leu Asp Gly Glu Glu Leu Thr Met Arg Leu Leu Val
465 470 475 480
Asp His Ser Val Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Met
485 490 495
Thr Ser Arg Val Tyr Pro Thr Asn Ala Ile Tyr Glu Glu Ala Lys Ile
500 505 510
Phe Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Lys Ala Ser Leu Lys
515 520 525
Ile Trp Gln Met Gly Ser Ala Ser Ile Gln Ala Tyr Pro Phe
530 535 540
<210> 34
<211> 631
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 34
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Ser Val Lys His Ser Gln
85 90 95
Ser Asp Arg Leu Arg Trp Glu Arg Thr Ala Tyr His Phe Gln Pro Ala
100 105 110
Lys Asn Phe Ile Tyr Asp Pro Asn Gly Pro Leu Phe His Met Gly Trp
115 120 125
Tyr His Leu Phe Tyr Gln Tyr Asn Pro Tyr Ala Pro Ile Trp Gly Asn
130 135 140
Met Ser Trp Gly His Ala Val Ser Lys Asp Met Ile His Trp Phe Glu
145 150 155 160
Leu Pro Val Ala Met Val Pro Thr Glu Trp Tyr Asp Ile Glu Gly Val
165 170 175
Leu Ser Gly Ser Thr Thr Ala Leu Pro Asn Gly Gln Ile Phe Ala Leu
180 185 190
Tyr Thr Gly Asn Ala Lys Asp Phe Ser Gln Leu Gln Cys Lys Ala Val
195 200 205
Pro Leu Asn Ala Ser Asp Pro Leu Leu Val Asp Trp Val Lys Tyr Glu
210 215 220
Asp Asn Pro Ile Leu Tyr Ile Pro Pro Gly Ile Gly Pro Lys Asp Tyr
225 230 235 240
Arg Asp Pro Ser Thr Val Trp Thr Gly Pro Asp Gly Lys His Arg Met
245 250 255
Ile Met Gly Thr Lys Gln Asn Gly Thr Gly Met Val His Val Tyr His
260 265 270
Thr Thr Asp Phe Ile Asn Tyr Val Leu Leu Asp Glu Pro Leu His Ser
275 280 285
Val Pro Asn Thr Asp Met Trp Glu Cys Val Asp Phe Tyr Pro Val Ser
290 295 300
Thr Ile Asn Asp Ser Ala Leu Asp Ile Ala Ala Tyr Gly Ser Asp Ile
305 310 315 320
Lys His Val Ile Lys Glu Ser Trp Glu Gly His Gly Met Asp Leu Tyr
325 330 335
Ser Ile Gly Thr Tyr Asp Ala Tyr Lys Asp Lys Trp Thr Pro Asp Asn
340 345 350
Pro Glu Leu Asp Val Gly Ile Gly Leu Arg Val Asp Tyr Gly Arg Leu
355 360 365
Phe Ala Ser Lys Ser Leu Tyr Asp Pro Leu Lys Lys Arg Arg Val Thr
370 375 380
Trp Gly Tyr Val Gly Glu Ser Asp Ser Pro Asp Gln Asp Ile Asn Arg
385 390 395 400
Gly Trp Ala Thr Ile Tyr Asn Val Gly Arg Thr Val Val Leu Asp Arg
405 410 415
Lys Thr Gly Thr His Leu Leu His Trp Pro Val Glu Glu Ile Glu Ser
420 425 430
Leu Arg Ser Asn Val Arg Glu Phe Asn Glu Ile Glu Leu Val Pro Gly
435 440 445
Ser Ile Ile Pro Leu Asp Ile Gly Met Ala Thr Gln Leu Asp Ile Val
450 455 460
Ala Thr Phe Lys Val Asp Pro Glu Ala Leu Met Ala Lys Ser Asp Ile
465 470 475 480
Asn Ser Glu Tyr Gly Cys Thr Thr Ser Ser Gly Ala Thr Gln Arg Gly
485 490 495
Ser Leu Gly Pro Phe Gly Ile Val Val Leu Ala Asp Leu Ala Leu Ser
500 505 510
Glu Leu Thr Pro Leu Tyr Phe Tyr Ile Ala Lys Asn Thr Asp Gly Gly
515 520 525
Leu Val Thr His Phe Cys Thr Asp Lys Leu Arg Ser Ser Leu Asp Tyr
530 535 540
Asp Gly Glu Arg Val Val Tyr Gly Gly Thr Val Pro Val Leu Asp Gly
545 550 555 560
Glu Glu Leu Thr Met Arg Leu Leu Val Asp His Ser Val Val Glu Gly
565 570 575
Phe Ala Gln Gly Gly Arg Thr Val Ile Thr Ser Arg Val Tyr Pro Thr
580 585 590
Asn Ala Ile Tyr Glu Glu Ala Lys Ile Phe Leu Phe Asn Asn Ala Thr
595 600 605
Gly Ala Ser Val Lys Ala Ser Leu Lys Ile Trp Gln Met Gly Ser Ala
610 615 620
Ser Ile Gln Ala Tyr Pro Phe
625 630
<210> 35
<211> 542
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 35
Ser Ser Val Lys His Ser Gln Ser Asp Arg Leu Arg Trp Glu Arg Thr
1 5 10 15
Ala Tyr His Phe Gln Pro Ala Lys Asn Phe Ile Tyr Asp Pro Asn Gly
20 25 30
Pro Leu Phe His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro
35 40 45
Tyr Ala Pro Ile Trp Gly Asn Met Ser Trp Gly His Ala Val Ser Lys
50 55 60
Asp Met Ile His Trp Phe Glu Leu Pro Val Ala Met Val Pro Thr Glu
65 70 75 80
Trp Tyr Asp Ile Glu Gly Val Leu Ser Gly Ser Thr Thr Ala Leu Pro
85 90 95
Asn Gly Gln Ile Phe Ala Leu Tyr Thr Gly Asn Ala Lys Asp Phe Ser
100 105 110
Gln Leu Gln Cys Lys Ala Val Pro Leu Asn Ala Ser Asp Pro Leu Leu
115 120 125
Val Asp Trp Val Lys Tyr Glu Asp Asn Pro Ile Leu Tyr Ile Pro Pro
130 135 140
Gly Ile Gly Pro Lys Asp Tyr Arg Asp Pro Ser Thr Val Trp Thr Gly
145 150 155 160
Pro Asp Gly Lys His Arg Met Ile Met Gly Thr Lys Gln Asn Gly Thr
165 170 175
Gly Met Val His Val Tyr His Thr Thr Asp Phe Ile Asn Tyr Val Leu
180 185 190
Leu Asp Glu Pro Leu His Ser Val Pro Asn Thr Asp Met Trp Glu Cys
195 200 205
Val Asp Phe Tyr Pro Val Ser Thr Ile Asn Asp Ser Ala Leu Asp Ile
210 215 220
Ala Ala Tyr Gly Ser Asp Ile Lys His Val Ile Lys Glu Ser Trp Glu
225 230 235 240
Gly His Gly Met Asp Leu Tyr Ser Ile Gly Thr Tyr Asp Ala Tyr Lys
245 250 255
Asp Lys Trp Thr Pro Asp Asn Pro Glu Leu Asp Val Gly Ile Gly Leu
260 265 270
Arg Val Asp Tyr Gly Arg Leu Phe Ala Ser Lys Ser Leu Tyr Asp Pro
275 280 285
Leu Lys Lys Arg Arg Val Thr Trp Gly Tyr Val Gly Glu Ser Asp Ser
290 295 300
Pro Asp Gln Asp Ile Asn Arg Gly Trp Ala Thr Ile Tyr Asn Val Gly
305 310 315 320
Arg Thr Val Val Leu Asp Arg Lys Thr Gly Thr His Leu Leu His Trp
325 330 335
Pro Val Glu Glu Ile Glu Ser Leu Arg Ser Asn Val Arg Glu Phe Asn
340 345 350
Glu Ile Glu Leu Val Pro Gly Ser Ile Ile Pro Leu Asp Ile Gly Met
355 360 365
Ala Thr Gln Leu Asp Ile Val Ala Thr Phe Lys Val Asp Pro Glu Ala
370 375 380
Leu Met Ala Lys Ser Asp Ile Asn Ser Glu Tyr Gly Cys Thr Thr Ser
385 390 395 400
Ser Gly Ala Thr Gln Arg Gly Ser Leu Gly Pro Phe Gly Ile Val Val
405 410 415
Leu Ala Asp Leu Ala Leu Ser Glu Leu Thr Pro Leu Tyr Phe Tyr Ile
420 425 430
Ala Lys Asn Thr Asp Gly Gly Leu Val Thr His Phe Cys Thr Asp Lys
435 440 445
Leu Arg Ser Ser Leu Asp Tyr Asp Gly Glu Arg Val Val Tyr Gly Gly
450 455 460
Thr Val Pro Val Leu Asp Gly Glu Glu Leu Thr Met Arg Leu Leu Val
465 470 475 480
Asp His Ser Val Val Glu Gly Phe Ala Gln Gly Gly Arg Thr Val Ile
485 490 495
Thr Ser Arg Val Tyr Pro Thr Asn Ala Ile Tyr Glu Glu Ala Lys Ile
500 505 510
Phe Leu Phe Asn Asn Ala Thr Gly Ala Ser Val Lys Ala Ser Leu Lys
515 520 525
Ile Trp Gln Met Gly Ser Ala Ser Ile Gln Ala Tyr Pro Phe
530 535 540
<210> 36
<211> 1896
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 36
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctagt tccgttaaac attctcagtc agatcgattg 300
aggtgggaac gtactgccta ccactttcaa ccagcaaaga acttcatata tgaccctaat 360
ggtccacttt tccacatggg atggtaccat ctattttacc aatataaccc gtatgctcca 420
atttggggca atatgtcttg gggtcacgct gtgtccaagg acatgatcca ttggttcgag 480
ctgcccgtcg ctatcgttcc aacggaatgg tacgatattg aaggtgtatt aagcggttcg 540
acaactgcgt tgccaaacgg tcaaattttc gccttgtaca ccggtaatgc taaggatttt 600
tctcaattac aatgcaaagc tgtccctttg aacgcttccg acccattgtt ggttgaatgg 660
gttaagtacg aagataaccc tatcctatat attccaccag gcatcggtcc taaggactac 720
agagatccat ctaccgtgtg gacaggtcca gatggtaaac acagaatgat tatgggaacc 780
aagcaaaacg gtactgggat ggttcatgtc taccacacca ctgactttat aaattatgtc 840
ttattagacg agccgttgca ctccgtccca aacaccgata tgtgggaatg tgtggacttc 900
tacccagtat ctactatcaa tgacagcgcg ttggatattg cagcctacgg ttcagacatc 960
aagcatgtta taaaagaatc ttgggaaggt catggtatgg atttatactc tattggtact 1020
tatgacgctt acaaggataa gtggacgcca gataaccccg agttcgatgt tgggattggt 1080
ctgagagttg attacggcag attctttgct tccaagagct tgtacgaccc gttgaagaag 1140
agaagagtca catggggtta tgttgctgaa agtgattctt ccgaccaaga cctcaataga 1200
ggttgggcca caatctataa cgttggtaga actgtcgtct tggaccggaa aaccggtaca 1260
cacctattac attggccagt ggaggaaatt gaatctctgc gttcgaacgt cagagaattt 1320
aatgaaattg aattggttcc aggatcgatc ataccattgg atattggtat ggctactcaa 1380
ttggacatcg ttgccacctt caaagtagac ccagaagctc ttatggctaa gtccgatatt 1440
aactctgaat acggttgtac cacttcctca ggtgctactc agcgtgggtc tttaggccct 1500
tttggtatcg ttgttttggc tgacgtagct ctatcggagt taaccccagt ttacttctat 1560
atcgcaaaga atatcgatgg tggtctggtc actcacttct gtaccgataa attgcgctct 1620
agtttggact acgatggaga aagagttgtt tacggttcaa ctgttccagt cttggacggt 1680
gaagaattaa ccatgagatt gctggtggat catagtgtag tcgaaggttt cgctcaaggt 1740
ggtagaactg ttatgacctc cagagtctac cccactaacg ccatctatga agaggcgaag 1800
atttttcttt tcaataacgc gactggcgct agtgttaaag catctttgaa gatttggcaa 1860
atgggttctg cctctattca ggcttatccc ttctaa 1896
<210> 37
<211> 1896
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 37
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctagt tccgttaaac attctcagtc agatcgattg 300
aggtgggaac gtactgccta ccactttcaa ccagcaaaga acttcatata tgaccctaat 360
ggtccacttt tccacatggg atggtaccat ctattttacc aatataaccc gtatgctcca 420
atttggggca atatgtcttg gggtcacgct gtgtccaagg acatgatcca ttggttcgag 480
ctgcccgtcg ctatggttcc aacggaatgg tacgatattg aaggtgtctt gtctgggagc 540
accacagctt tgcctaacgg tcaaatcttc gccttataca ctggtaatgc gaaagatttt 600
tcccaattac aatgcaaggc tgttccattg aacgcctcgg acccattgct cgtagattgg 660
gtcaagtacg aagataaccc aattttgtat atccccccag gtattggacc aaaggactac 720
agagatccga gtaccgtgtg gactggtcct gacggtaaac acagaatgat catgggtacc 780
aagcaaaacg gcactggtat ggttcacgta taccatacaa ccgactttat taattatgtt 840
ttattggacg aaccattgca ctctgttcca aatactgata tgtgggagtg tgtcgatttc 900
tacccagtct ctacgataaa cgacagcgca ctcgatatag ctgcttatgg tagtgatatt 960
aagcacgtta ttaaagaatc ttgggaaggt catggtatgg acttgtactc catcggtact 1020
tacgatgctt acaaggataa gtggacccca gacaaccctg aattagacgt tggtatcggg 1080
ctaagagtgg actatggtag attgttcgca tcgaaaagcc tttacgatcc actgaagaaa 1140
agaagagtca cttggggtta cgttggcgag tctgattctc cagatcagga cattaacaga 1200
ggttgggcga ccatctataa tgttggacgt accgtcgttt tggatagaaa gactggtact 1260
catctactgc actggcctgt cgaagaaatc gaatcattaa gaagtaatgt tagagaattt 1320
aacgaaattg agttggtacc aggttctata attcctttgg acattggtat ggccacacaa 1380
ttggacatcg ttgctacatt caaggttgat ccagaagctt taatggctaa gtctgacata 1440
aactccgaat acggttgtac cacttcctcc ggtgcgactc aaagaggttc gttgggtcca 1500
ttcggtatcg tcgttctagc cgatttggct ctctctgaat tgactccatt atacttttat 1560
atcgctaaga acaccgatgg gggcttggta acacacttct gtactgataa attaagatca 1620
agtttggatt acgacggtga acgcgtcgta tacggtggta cggttcccgt gttagacggg 1680
gaagaactca ccatgaggct attggtcgat cattctgttg ttgagggttt tgctcaaggt 1740
ggaagaaccg ttattactag ccgtgtctat cccacaaatg ctatttatga agaagccaag 1800
attttccttt ttaacaacgc taccggtgca tccgttaagg cttctttgaa gatatggcaa 1860
atgggtagcg cttctatcca agcctaccca ttctaa 1896
<210> 38
<211> 549
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 38
Val Pro Gly Lys Leu Glu Ser Asn Ala Asp Val Glu Trp Gln Arg Ser
1 5 10 15
Ala Tyr His Phe Gln Pro Asp Lys Asn Phe Ile Ser Asp Pro Asp Gly
20 25 30
Pro Met Tyr His Met Gly Trp Tyr His Leu Phe Tyr Gln Tyr Asn Pro
35 40 45
Glu Ser Ala Ile Trp Gly Asn Ile Thr Trp Gly His Ser Val Ser Arg
50 55 60
Asp Met Ile Asn Trp Phe His Leu Pro Phe Ala Met Val Pro Asp His
65 70 75 80
Trp Tyr Asp Ile Glu Gly Val Met Thr Gly Ser Ala Thr Val Leu Pro
85 90 95
Asn Gly Gln Ile Ile Met Leu Tyr Thr Gly Asn Ala Tyr Asp Leu Ser
100 105 110
Gln Leu Gln Cys Leu Ala Tyr Ala Val Asn Ser Ser Asp Pro Leu Leu
115 120 125
Leu Glu Trp Lys Lys Tyr Glu Gly Asn Pro Ile Leu Phe Pro Pro Pro
130 135 140
Gly Val Gly Tyr Lys Asp Phe Arg Asp Pro Ser Thr Leu Trp Met Gly
145 150 155 160
Pro Asp Gly Glu Trp Arg Met Val Met Gly Ser Lys His Asn Glu Thr
165 170 175
Ile Gly Cys Ala Leu Val Tyr Arg Thr Thr Asn Phe Thr His Phe Glu
180 185 190
Leu Asn Glu Glu Val Leu His Ala Val Pro His Thr Gly Met Trp Glu
195 200 205
Cys Val Asp Leu Tyr Pro Val Ser Thr Thr His Thr Asn Gly Leu Glu
210 215 220
Met Lys Asp Asn Gly Pro Asn Val Lys Tyr Ile Leu Lys Gln Ser Gly
225 230 235 240
Asp Glu Asp Arg His Asp Trp Tyr Ala Ile Gly Thr Phe Asp Pro Glu
245 250 255
Lys Asp Lys Trp Tyr Pro Asp Asp Pro Glu Asn Asp Val Gly Ile Gly
260 265 270
Leu Arg Tyr Asp Tyr Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp
275 280 285
Gln His Lys Lys Arg Arg Val Leu Trp Gly Tyr Val Gly Glu Thr Asp
290 295 300
Pro Pro Lys Ser Asp Leu Leu Lys Gly Trp Ala Asn Ile Leu Asn Ile
305 310 315 320
Pro Arg Ser Val Val Leu Asp Thr Gln Thr Glu Thr Asn Leu Ile Gln
325 330 335
Trp Pro Ile Glu Glu Val Glu Lys Leu Arg Ser Lys Lys Tyr Asp Glu
340 345 350
Phe Lys Asp Val Glu Leu Arg Pro Gly Ser Leu Ile Pro Leu Glu Ile
355 360 365
Gly Thr Ala Thr Gln Leu Asp Ile Ser Ala Thr Phe Glu Ile Asp Glu
370 375 380
Lys Lys Leu Glu Ser Thr Leu Glu Ala Asp Val Leu Phe Asn Cys Thr
385 390 395 400
Thr Ser Glu Gly Ser Val Gly Arg Gly Val Leu Gly Pro Phe Gly Ile
405 410 415
Val Val Leu Ala Asp Ala Asn Arg Ser Glu Gln Leu Pro Val Tyr Phe
420 425 430
Tyr Ile Ala Lys Asp Thr Asp Gly Thr Ser Arg Thr Tyr Phe Cys Ala
435 440 445
Asp Glu Ser Arg Ser Ser Lys Asp Lys Asp Val Gly Lys Trp Val Tyr
450 455 460
Gly Ser Ser Val Pro Val Leu Glu Gly Glu Asn Tyr Asn Met Arg Leu
465 470 475 480
Leu Val Asp His Ser Ile Val Glu Gly Phe Ala Gln Gly Gly Arg Thr
485 490 495
Val Val Thr Ser Arg Val Tyr Pro Thr Met Ala Ile Tyr Gly Ala Ala
500 505 510
Lys Ile Phe Leu Phe Asn Asn Ala Thr Gly Ile Ser Val Lys Ala Ser
515 520 525
Leu Lys Ile Trp Lys Met Ala Glu Ala Gln Leu Asp Pro Phe Pro Leu
530 535 540
Ser Gly Trp Ser Ser
545
<210> 39
<211> 644
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 39
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asp Glu Glu Ala Ala Gly Gly
85 90 95
Phe Pro Trp Ser Asn Glu Met Leu Gln Trp Gln Arg Ser Gly Tyr His
100 105 110
Phe Gln Thr Ala Lys Asn Tyr Met Ser Asp Pro Asn Gly Leu Met Tyr
115 120 125
Tyr Arg Gly Trp Tyr His Met Phe Phe Gln Tyr Asn Pro Val Gly Thr
130 135 140
Asp Trp Asp Asp Gly Met Glu Trp Gly His Ala Val Ser Arg Asn Leu
145 150 155 160
Val Gln Trp Arg Thr Leu Pro Ile Ala Met Val Ala Asp Gln Trp Tyr
165 170 175
Asp Ile Leu Gly Val Leu Ser Gly Ser Met Thr Val Leu Pro Asn Gly
180 185 190
Thr Val Ile Met Ile Tyr Thr Gly Ala Thr Asn Ala Ser Ala Val Glu
195 200 205
Val Gln Cys Ile Ala Thr Pro Ala Asp Pro Asn Asp Pro Leu Leu Arg
210 215 220
Arg Trp Thr Lys His Pro Ala Asn Pro Val Ile Trp Ser Pro Pro Gly
225 230 235 240
Val Gly Thr Lys Asp Phe Arg Asp Ser Met Thr Ala Trp Tyr Asp Glu
245 250 255
Ser Asp Asp Thr Trp Arg Thr Leu Leu Gly Ser Lys Asp Asp Asn Asn
260 265 270
Gly His His Asp Gly Ile Ala Met Met Tyr Lys Thr Lys Asp Phe Leu
275 280 285
Asn Tyr Glu Leu Ile Pro Gly Ile Leu His Arg Val Glu Arg Thr Gly
290 295 300
Glu Trp Glu Cys Ile Asp Phe Tyr Pro Val Gly His Arg Thr Ser Asp
305 310 315 320
Asn Ser Ser Glu Met Leu His Val Leu Lys Ala Ser Met Asp Asp Glu
325 330 335
Arg His Asp Tyr Tyr Ser Leu Gly Thr Tyr Asp Ser Ala Ala Asn Arg
340 345 350
Trp Thr Pro Ile Asp Pro Glu Leu Asp Leu Gly Ile Gly Leu Arg Tyr
355 360 365
Asp Trp Gly Lys Phe Tyr Ala Ser Thr Ser Phe Tyr Asp Pro Ala Lys
370 375 380
Lys Arg Arg Val Leu Met Gly Tyr Val Gly Glu Val Asp Ser Lys Arg
385 390 395 400
Ala Asp Val Val Lys Gly Trp Ala Ser Ile Gln Ser Val Pro Arg Thr
405 410 415
Ile Ala Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu Leu Trp Pro Val
420 425 430
Glu Glu Ile Glu Thr Leu Arg Leu Asn Ala Thr Gln Leu Ser Asp Val
435 440 445
Thr Leu Asn Thr Gly Ser Val Ile His Ile Pro Leu Arg Gln Gly Thr
450 455 460
Gln Leu Asp Ile Glu Ala Thr Phe His Leu Asp Ala Ser Ala Val Ala
465 470 475 480
Ala Leu Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser Ser Ser Gly Gly
485 490 495
Ala Val Asn Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
500 505 510
Ala Gly Asp Arg Arg Gly Glu Gln Thr Ala Val Tyr Phe Tyr Val Ser
515 520 525
Arg Gly Leu Asp Gly Gly Leu His Thr Ser Phe Cys Gln Asp Glu Leu
530 535 540
Arg Ser Ser Arg Ala Lys Asp Val Thr Lys Arg Val Ile Gly Ser Thr
545 550 555 560
Val Pro Val Leu Asp Gly Glu Ala Phe Ser Met Arg Val Leu Val Asp
565 570 575
His Ser Ile Val Gln Gly Phe Ala Met Gly Gly Arg Thr Thr Met Thr
580 585 590
Ser Arg Val Tyr Pro Met Glu Ala Tyr Gln Glu Ala Lys Val Tyr Leu
595 600 605
Phe Asn Asn Ala Thr Gly Ala Ser Val Thr Ala Glu Arg Leu Val Val
610 615 620
His Asp Met Asp Ser Ala His Asn Gln Leu Ser Asn Met Asp Asp Tyr
625 630 635 640
Ser Tyr Val Gln
<210> 40
<211> 555
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 40
Asp Glu Glu Ala Ala Gly Gly Phe Pro Trp Ser Asn Glu Met Leu Gln
1 5 10 15
Trp Gln Arg Ser Gly Tyr His Phe Gln Thr Ala Lys Asn Tyr Met Ser
20 25 30
Asp Pro Asn Gly Leu Met Tyr Tyr Arg Gly Trp Tyr His Met Phe Phe
35 40 45
Gln Tyr Asn Pro Val Gly Thr Asp Trp Asp Asp Gly Met Glu Trp Gly
50 55 60
His Ala Val Ser Arg Asn Leu Val Gln Trp Arg Thr Leu Pro Ile Ala
65 70 75 80
Met Val Ala Asp Gln Trp Tyr Asp Ile Leu Gly Val Leu Ser Gly Ser
85 90 95
Met Thr Val Leu Pro Asn Gly Thr Val Ile Met Ile Tyr Thr Gly Ala
100 105 110
Thr Asn Ala Ser Ala Val Glu Val Gln Cys Ile Ala Thr Pro Ala Asp
115 120 125
Pro Asn Asp Pro Leu Leu Arg Arg Trp Thr Lys His Pro Ala Asn Pro
130 135 140
Val Ile Trp Ser Pro Pro Gly Val Gly Thr Lys Asp Phe Arg Asp Ser
145 150 155 160
Met Thr Ala Trp Tyr Asp Glu Ser Asp Asp Thr Trp Arg Thr Leu Leu
165 170 175
Gly Ser Lys Asp Asp Asn Asn Gly His His Asp Gly Ile Ala Met Met
180 185 190
Tyr Lys Thr Lys Asp Phe Leu Asn Tyr Glu Leu Ile Pro Gly Ile Leu
195 200 205
His Arg Val Glu Arg Thr Gly Glu Trp Glu Cys Ile Asp Phe Tyr Pro
210 215 220
Val Gly His Arg Thr Ser Asp Asn Ser Ser Glu Met Leu His Val Leu
225 230 235 240
Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ser Leu Gly Thr
245 250 255
Tyr Asp Ser Ala Ala Asn Arg Trp Thr Pro Ile Asp Pro Glu Leu Asp
260 265 270
Leu Gly Ile Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Thr
275 280 285
Ser Phe Tyr Asp Pro Ala Lys Lys Arg Arg Val Leu Met Gly Tyr Val
290 295 300
Gly Glu Val Asp Ser Lys Arg Ala Asp Val Val Lys Gly Trp Ala Ser
305 310 315 320
Ile Gln Ser Val Pro Arg Thr Ile Ala Leu Asp Glu Lys Thr Arg Thr
325 330 335
Asn Leu Leu Leu Trp Pro Val Glu Glu Ile Glu Thr Leu Arg Leu Asn
340 345 350
Ala Thr Gln Leu Ser Asp Val Thr Leu Asn Thr Gly Ser Val Ile His
355 360 365
Ile Pro Leu Arg Gln Gly Thr Gln Leu Asp Ile Glu Ala Thr Phe His
370 375 380
Leu Asp Ala Ser Ala Val Ala Ala Leu Asn Glu Ala Asp Val Gly Tyr
385 390 395 400
Asn Cys Ser Ser Ser Gly Gly Ala Val Asn Arg Gly Ala Leu Gly Pro
405 410 415
Phe Gly Leu Leu Val Leu Ala Ala Gly Asp Arg Arg Gly Glu Gln Thr
420 425 430
Ala Val Tyr Phe Tyr Val Ser Arg Gly Leu Asp Gly Gly Leu His Thr
435 440 445
Ser Phe Cys Gln Asp Glu Leu Arg Ser Ser Arg Ala Lys Asp Val Thr
450 455 460
Lys Arg Val Ile Gly Ser Thr Val Pro Val Leu Asp Gly Glu Ala Phe
465 470 475 480
Ser Met Arg Val Leu Val Asp His Ser Ile Val Gln Gly Phe Ala Met
485 490 495
Gly Gly Arg Thr Thr Met Thr Ser Arg Val Tyr Pro Met Glu Ala Tyr
500 505 510
Gln Glu Ala Lys Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Val
515 520 525
Thr Ala Glu Arg Leu Val Val His Asp Met Asp Ser Ala His Asn Gln
530 535 540
Leu Ser Asn Met Asp Asp Tyr Ser Tyr Val Gln
545 550 555
<210> 41
<211> 644
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 41
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asp Glu Glu Ala Ala Gly Gly
85 90 95
Phe Pro Trp Ser Asn Glu Met Leu Gln Trp Gln Arg Ser Gly Tyr His
100 105 110
Phe Gln Thr Ala Lys Asn Tyr Met Ser Asp Pro Asn Gly Leu Met Tyr
115 120 125
Tyr Arg Gly Trp Asn His Met Phe Phe Gln Tyr Asn Pro Val Gly Thr
130 135 140
Asp Trp Asp Asp Gly Met Glu Trp Gly His Ala Val Ser Arg Asn Leu
145 150 155 160
Val Gln Trp Arg Thr Leu Pro Ile Ala Met Val Ala Asp Gln Trp Tyr
165 170 175
Asp Ile Leu Gly Val Leu Ser Gly Ser Met Thr Val Leu Pro Asn Gly
180 185 190
Thr Val Ile Met Ile Tyr Thr Gly Ala Thr Asn Ala Ser Ala Val Glu
195 200 205
Val Gln Cys Ile Ala Thr Pro Ala Asp Pro Thr Asp Pro Leu Leu Arg
210 215 220
Arg Trp Thr Lys His Pro Ala Asn Pro Val Ile Trp Ser Pro Pro Gly
225 230 235 240
Val Gly Thr Lys Asp Phe Arg Asp Pro Met Thr Ala Trp Tyr Asp Glu
245 250 255
Ser Asp Asp Thr Trp Arg Thr Leu Leu Gly Ser Lys Asp Asp Asn Asn
260 265 270
Gly His His Asp Gly Ile Ala Met Met Tyr Lys Thr Lys Asp Phe Leu
275 280 285
Asn Tyr Glu Leu Ile Pro Gly Ile Leu His Arg Val Glu Arg Thr Gly
290 295 300
Glu Trp Glu Cys Ile Asp Phe Tyr Pro Val Gly Arg Arg Thr Ser Asp
305 310 315 320
Asn Ser Ser Glu Met Leu His Val Leu Lys Ala Ser Met Asp Asp Glu
325 330 335
Arg His Asp Tyr Tyr Ser Leu Gly Thr Tyr Asp Ser Ala Ala Asn Arg
340 345 350
Trp Thr Pro Ile Asp Pro Glu Leu Asp Leu Gly Ile Gly Leu Arg Tyr
355 360 365
Asp Trp Gly Lys Phe Tyr Ala Ser Thr Ser Phe Tyr Asp Pro Ala Lys
370 375 380
Lys Arg Arg Val Leu Met Gly Tyr Val Gly Glu Val Asp Ser Lys Arg
385 390 395 400
Ala Asp Val Val Lys Gly Trp Ala Ser Ile Gln Ser Val Pro Arg Thr
405 410 415
Ile Ala Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu Leu Trp Pro Val
420 425 430
Glu Glu Ile Glu Thr Leu Arg Leu Asn Ala Thr Glu Leu Ser Asp Val
435 440 445
Thr Leu Asn Thr Gly Ser Val Ile His Ile Pro Leu Arg Gln Gly Thr
450 455 460
Gln Leu Asp Ile Glu Ala Thr Phe His Leu Asp Ala Ser Ala Val Ala
465 470 475 480
Ala Phe Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser Ser Ser Gly Gly
485 490 495
Ala Val Asn Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
500 505 510
Ala Gly Asp Arg Arg Gly Glu Gln Thr Ala Val Tyr Phe Tyr Val Ser
515 520 525
Arg Gly Leu Asp Gly Gly Leu His Thr Ser Phe Cys Gln Asp Glu Leu
530 535 540
Arg Ser Ser Arg Ala Lys Asp Val Thr Lys Arg Val Ile Gly Ser Thr
545 550 555 560
Val Pro Val Leu Asp Gly Glu Ala Phe Ser Met Arg Val Leu Val Asp
565 570 575
His Ser Ile Val Gln Gly Phe Ala Met Gly Gly Arg Thr Thr Met Thr
580 585 590
Ser Arg Val Tyr Pro Met Glu Ala Tyr Gln Glu Ala Lys Val Tyr Leu
595 600 605
Phe Asn Asn Ala Thr Gly Ala Ser Val Thr Ala Glu Arg Leu Val Val
610 615 620
His Glu Met Asp Ser Ala His Asn Gln Leu Ser Asn Met Asp Asp His
625 630 635 640
Ser Tyr Val Gln
<210> 42
<211> 555
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 42
Asp Glu Glu Ala Ala Gly Gly Phe Pro Trp Ser Asn Glu Met Leu Gln
1 5 10 15
Trp Gln Arg Ser Gly Tyr His Phe Gln Thr Ala Lys Asn Tyr Met Ser
20 25 30
Asp Pro Asn Gly Leu Met Tyr Tyr Arg Gly Trp Asn His Met Phe Phe
35 40 45
Gln Tyr Asn Pro Val Gly Thr Asp Trp Asp Asp Gly Met Glu Trp Gly
50 55 60
His Ala Val Ser Arg Asn Leu Val Gln Trp Arg Thr Leu Pro Ile Ala
65 70 75 80
Met Val Ala Asp Gln Trp Tyr Asp Ile Leu Gly Val Leu Ser Gly Ser
85 90 95
Met Thr Val Leu Pro Asn Gly Thr Val Ile Met Ile Tyr Thr Gly Ala
100 105 110
Thr Asn Ala Ser Ala Val Glu Val Gln Cys Ile Ala Thr Pro Ala Asp
115 120 125
Pro Thr Asp Pro Leu Leu Arg Arg Trp Thr Lys His Pro Ala Asn Pro
130 135 140
Val Ile Trp Ser Pro Pro Gly Val Gly Thr Lys Asp Phe Arg Asp Pro
145 150 155 160
Met Thr Ala Trp Tyr Asp Glu Ser Asp Asp Thr Trp Arg Thr Leu Leu
165 170 175
Gly Ser Lys Asp Asp Asn Asn Gly His His Asp Gly Ile Ala Met Met
180 185 190
Tyr Lys Thr Lys Asp Phe Leu Asn Tyr Glu Leu Ile Pro Gly Ile Leu
195 200 205
His Arg Val Glu Arg Thr Gly Glu Trp Glu Cys Ile Asp Phe Tyr Pro
210 215 220
Val Gly Arg Arg Thr Ser Asp Asn Ser Ser Glu Met Leu His Val Leu
225 230 235 240
Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ser Leu Gly Thr
245 250 255
Tyr Asp Ser Ala Ala Asn Arg Trp Thr Pro Ile Asp Pro Glu Leu Asp
260 265 270
Leu Gly Ile Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Thr
275 280 285
Ser Phe Tyr Asp Pro Ala Lys Lys Arg Arg Val Leu Met Gly Tyr Val
290 295 300
Gly Glu Val Asp Ser Lys Arg Ala Asp Val Val Lys Gly Trp Ala Ser
305 310 315 320
Ile Gln Ser Val Pro Arg Thr Ile Ala Leu Asp Glu Lys Thr Arg Thr
325 330 335
Asn Leu Leu Leu Trp Pro Val Glu Glu Ile Glu Thr Leu Arg Leu Asn
340 345 350
Ala Thr Glu Leu Ser Asp Val Thr Leu Asn Thr Gly Ser Val Ile His
355 360 365
Ile Pro Leu Arg Gln Gly Thr Gln Leu Asp Ile Glu Ala Thr Phe His
370 375 380
Leu Asp Ala Ser Ala Val Ala Ala Phe Asn Glu Ala Asp Val Gly Tyr
385 390 395 400
Asn Cys Ser Ser Ser Gly Gly Ala Val Asn Arg Gly Ala Leu Gly Pro
405 410 415
Phe Gly Leu Leu Val Leu Ala Ala Gly Asp Arg Arg Gly Glu Gln Thr
420 425 430
Ala Val Tyr Phe Tyr Val Ser Arg Gly Leu Asp Gly Gly Leu His Thr
435 440 445
Ser Phe Cys Gln Asp Glu Leu Arg Ser Ser Arg Ala Lys Asp Val Thr
450 455 460
Lys Arg Val Ile Gly Ser Thr Val Pro Val Leu Asp Gly Glu Ala Phe
465 470 475 480
Ser Met Arg Val Leu Val Asp His Ser Ile Val Gln Gly Phe Ala Met
485 490 495
Gly Gly Arg Thr Thr Met Thr Ser Arg Val Tyr Pro Met Glu Ala Tyr
500 505 510
Gln Glu Ala Lys Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Val
515 520 525
Thr Ala Glu Arg Leu Val Val His Glu Met Asp Ser Ala His Asn Gln
530 535 540
Leu Ser Asn Met Asp Asp His Ser Tyr Val Gln
545 550 555
<210> 43
<211> 649
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 43
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Gly Pro Tyr Ser Ala Ser
85 90 95
Gly Gly Phe Pro Trp Ser Asn Ala Met Leu Gln Trp Gln Arg Thr Gly
100 105 110
Tyr His Phe Gln Pro Glu Lys Asn Tyr Gln Asn Asp Pro Asn Gly Pro
115 120 125
Val Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln His Asn Pro Gly
130 135 140
Gly Thr Gly Trp Gly Asn Ile Ser Trp Gly His Ala Val Ser Arg Asp
145 150 155 160
Met Val His Trp Arg His Leu Pro Leu Ala Met Val Pro Glu His Trp
165 170 175
Tyr Asp Ile Glu Gly Val Leu Thr Gly Ser Ile Thr Val Leu Pro Asp
180 185 190
Gly Arg Val Ile Leu Leu Tyr Thr Gly Asn Thr Glu Thr Phe Ala Gln
195 200 205
Val Thr Cys Leu Ala Glu Ala Ala Asp Pro Ser Asp Pro Leu Leu Arg
210 215 220
Glu Trp Ala Lys His Pro Ala Asn Pro Val Val Tyr Pro Pro Pro Gly
225 230 235 240
Ile Gly Met Lys Asp Tyr Arg Asp Pro Thr Thr Ala Trp Phe Asp Asn
245 250 255
Ser Asp Asn Thr Trp Arg Ile Ile Ile Gly Ser Lys Asn Asp Thr Asp
260 265 270
His Ser Gly Ile Val Phe Thr Tyr Lys Thr Lys Asp Phe Val Ser Tyr
275 280 285
Glu Leu Ile Pro Gly Tyr Leu Tyr Arg Gly Pro Ala Gly Thr Gly Met
290 295 300
Tyr Glu Cys Ile Asp Leu Phe Ala Val Gly Gly Gly Arg Ala Ala Ser
305 310 315 320
Asp Met Tyr Asn Ser Thr Ala Glu Asp Val Leu Tyr Val Leu Lys Glu
325 330 335
Ser Ser Asp Asp Asp Arg Arg Asp Tyr Tyr Ala Leu Gly Arg Phe Asp
340 345 350
Ala Ala Ala Asn Thr Trp Thr Pro Ile Asp Thr Glu Arg Glu Leu Gly
355 360 365
Val Ala Leu Arg Tyr Asp Tyr Gly Arg Tyr Asp Thr Ser Lys Ser Phe
370 375 380
Tyr Asp Pro Val Lys Gln Arg Arg Ile Val Trp Gly Tyr Val Val Glu
385 390 395 400
Thr Asp Ser Trp Ser Ala Asp Ala Ala Lys Gly Trp Ala Asn Leu Gln
405 410 415
Ser Ile Pro Arg Thr Val Glu Leu Asp Glu Lys Thr Arg Thr Asn Leu
420 425 430
Val Gln Trp Pro Val Gly Glu Leu Asn Thr Leu Arg Ile Asn Thr Thr
435 440 445
Asp Leu Ser Asp Ile Thr Val Gly Ala Gly Ser Val Asp Ser Leu Pro
450 455 460
Leu His Gln Thr Ser Gln Leu Asp Ile Glu Ala Ser Phe Arg Ile Asn
465 470 475 480
Ala Ser Thr Ile Glu Ala Leu Asn Glu Val Asp Val Gly Tyr Asn Cys
485 490 495
Thr Met Thr Ser Gly Ala Ala Thr Arg Gly Ala Leu Gly Pro Phe Gly
500 505 510
Ile Leu Val Leu Ala Asn Val Ala Leu Thr Glu Gln Thr Ala Val Tyr
515 520 525
Phe Tyr Val Ser Lys Gly Leu Asp Gly Gly Leu Arg Thr His Phe Cys
530 535 540
His Asp Glu Leu Arg Ser Thr His Ala Thr Asp Val Ala Lys Glu Val
545 550 555 560
Val Gly Ser Thr Val Pro Val Leu Asp Gly Glu Asp Phe Ser Val Arg
565 570 575
Val Leu Val Asp His Ser Ile Val Gln Ser Phe Val Met Gly Gly Arg
580 585 590
Met Thr Ala Thr Ser Arg Ala Tyr Pro Thr Glu Ala Ile Tyr Ala Ala
595 600 605
Ala Gly Val Tyr Leu Phe Asn Asn Ala Thr Gly Ala Ser Ile Thr Ala
610 615 620
Glu Lys Leu Val Val His Asp Met Asp Ser Ser Tyr Asn Arg Ile Phe
625 630 635 640
Thr Asp Glu Asp Leu Leu Val Leu Asp
645
<210> 44
<211> 560
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 44
Ser Gly Pro Tyr Ser Ala Ser Gly Gly Phe Pro Trp Ser Asn Ala Met
1 5 10 15
Leu Gln Trp Gln Arg Thr Gly Tyr His Phe Gln Pro Glu Lys Asn Tyr
20 25 30
Gln Asn Asp Pro Asn Gly Pro Val Tyr Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln His Asn Pro Gly Gly Thr Gly Trp Gly Asn Ile Ser Trp
50 55 60
Gly His Ala Val Ser Arg Asp Met Val His Trp Arg His Leu Pro Leu
65 70 75 80
Ala Met Val Pro Glu His Trp Tyr Asp Ile Glu Gly Val Leu Thr Gly
85 90 95
Ser Ile Thr Val Leu Pro Asp Gly Arg Val Ile Leu Leu Tyr Thr Gly
100 105 110
Asn Thr Glu Thr Phe Ala Gln Val Thr Cys Leu Ala Glu Ala Ala Asp
115 120 125
Pro Ser Asp Pro Leu Leu Arg Glu Trp Ala Lys His Pro Ala Asn Pro
130 135 140
Val Val Tyr Pro Pro Pro Gly Ile Gly Met Lys Asp Tyr Arg Asp Pro
145 150 155 160
Thr Thr Ala Trp Phe Asp Asn Ser Asp Asn Thr Trp Arg Ile Ile Ile
165 170 175
Gly Ser Lys Asn Asp Thr Asp His Ser Gly Ile Val Phe Thr Tyr Lys
180 185 190
Thr Lys Asp Phe Val Ser Tyr Glu Leu Ile Pro Gly Tyr Leu Tyr Arg
195 200 205
Gly Pro Ala Gly Thr Gly Met Tyr Glu Cys Ile Asp Leu Phe Ala Val
210 215 220
Gly Gly Gly Arg Ala Ala Ser Asp Met Tyr Asn Ser Thr Ala Glu Asp
225 230 235 240
Val Leu Tyr Val Leu Lys Glu Ser Ser Asp Asp Asp Arg Arg Asp Tyr
245 250 255
Tyr Ala Leu Gly Arg Phe Asp Ala Ala Ala Asn Thr Trp Thr Pro Ile
260 265 270
Asp Thr Glu Arg Glu Leu Gly Val Ala Leu Arg Tyr Asp Tyr Gly Arg
275 280 285
Tyr Asp Thr Ser Lys Ser Phe Tyr Asp Pro Val Lys Gln Arg Arg Ile
290 295 300
Val Trp Gly Tyr Val Val Glu Thr Asp Ser Trp Ser Ala Asp Ala Ala
305 310 315 320
Lys Gly Trp Ala Asn Leu Gln Ser Ile Pro Arg Thr Val Glu Leu Asp
325 330 335
Glu Lys Thr Arg Thr Asn Leu Val Gln Trp Pro Val Gly Glu Leu Asn
340 345 350
Thr Leu Arg Ile Asn Thr Thr Asp Leu Ser Asp Ile Thr Val Gly Ala
355 360 365
Gly Ser Val Asp Ser Leu Pro Leu His Gln Thr Ser Gln Leu Asp Ile
370 375 380
Glu Ala Ser Phe Arg Ile Asn Ala Ser Thr Ile Glu Ala Leu Asn Glu
385 390 395 400
Val Asp Val Gly Tyr Asn Cys Thr Met Thr Ser Gly Ala Ala Thr Arg
405 410 415
Gly Ala Leu Gly Pro Phe Gly Ile Leu Val Leu Ala Asn Val Ala Leu
420 425 430
Thr Glu Gln Thr Ala Val Tyr Phe Tyr Val Ser Lys Gly Leu Asp Gly
435 440 445
Gly Leu Arg Thr His Phe Cys His Asp Glu Leu Arg Ser Thr His Ala
450 455 460
Thr Asp Val Ala Lys Glu Val Val Gly Ser Thr Val Pro Val Leu Asp
465 470 475 480
Gly Glu Asp Phe Ser Val Arg Val Leu Val Asp His Ser Ile Val Gln
485 490 495
Ser Phe Val Met Gly Gly Arg Met Thr Ala Thr Ser Arg Ala Tyr Pro
500 505 510
Thr Glu Ala Ile Tyr Ala Ala Ala Gly Val Tyr Leu Phe Asn Asn Ala
515 520 525
Thr Gly Ala Ser Ile Thr Ala Glu Lys Leu Val Val His Asp Met Asp
530 535 540
Ser Ser Tyr Asn Arg Ile Phe Thr Asp Glu Asp Leu Leu Val Leu Asp
545 550 555 560
<210> 45
<211> 644
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 45
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ala Asn Ala Phe Pro Trp Ser
85 90 95
Asn Ala Met Leu Gln Trp Gln Arg Thr Gly Phe His Phe Gln Pro Asp
100 105 110
Lys Tyr Tyr Gln Asn Asp Pro Asn Gly Pro Val Tyr Tyr Gly Gly Trp
115 120 125
Tyr His Phe Phe Tyr Gln Tyr Asn Pro Ser Gly Ser Val Trp Glu Pro
130 135 140
Gln Ile Val Trp Gly His Ala Val Ser Lys Asp Leu Ile His Trp Arg
145 150 155 160
His Leu Pro Pro Ala Leu Val Pro Asp Gln Trp Tyr Asp Ile Lys Gly
165 170 175
Val Leu Thr Gly Ser Ile Thr Val Leu Pro Asp Gly Lys Val Ile Leu
180 185 190
Leu Tyr Thr Gly Asn Thr Glu Thr Phe Ala Gln Val Thr Cys Leu Ala
195 200 205
Glu Pro Ala Asp Pro Ser Asp Pro Leu Leu Arg Glu Trp Val Lys His
210 215 220
Pro Ala Asn Pro Val Val Phe Pro Pro Pro Gly Ile Gly Met Lys Asp
225 230 235 240
Phe Arg Asp Pro Thr Thr Ala Trp Tyr Asp Glu Ser Asp Gly Thr Trp
245 250 255
Arg Thr Ile Ile Gly Ser Lys Asn Asp Ser Asp His Ser Gly Ile Val
260 265 270
Phe Ser Tyr Lys Thr Lys Asp Phe Ile Ser Tyr Glu Leu Met Pro Gly
275 280 285
Tyr Met Tyr Arg Gly Pro Lys Gly Thr Gly Glu Tyr Glu Cys Ile Asp
290 295 300
Leu Tyr Ala Val Gly Gly Gly Arg Lys Ala Ser Asp Met Tyr Asn Ser
305 310 315 320
Thr Ala Glu Asp Val Leu Tyr Val Leu Lys Glu Ser Ser Asp Asp Asp
325 330 335
Arg His Asp Trp Tyr Ser Leu Gly Arg Phe Asp Ala Ala Ala Asn Lys
340 345 350
Trp Thr Pro Ile Asp Thr Glu Leu Glu Leu Gly Val Gly Leu Arg Tyr
355 360 365
Asp Trp Gly Lys Tyr Tyr Ala Ser Lys Ser Phe Tyr Asp Pro Val Lys
370 375 380
Lys Arg Arg Val Val Trp Ala Tyr Val Gly Glu Thr Asp Ser Glu Arg
385 390 395 400
Ala Asp Ile Thr Lys Gly Trp Ala Asn Leu Gln Ser Ile Pro Arg Thr
405 410 415
Val Glu Leu Asp Glu Lys Thr Arg Thr Asn Leu Ile Gln Trp Pro Val
420 425 430
Glu Glu Leu Asn Thr Leu Arg Ile Asn Thr Thr Asp Leu Ser Gly Ile
435 440 445
Thr Val Gly Ala Gly Ser Val Ala Phe Leu Pro Leu His Gln Thr Ala
450 455 460
Gln Leu Asp Ile Glu Ala Thr Phe Arg Ile Asp Ala Ser Ala Ile Glu
465 470 475 480
Ala Leu Asn Glu Ala Asp Val Ser Tyr Asn Cys Thr Thr Ser Arg Gly
485 490 495
Ala Ala Thr Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
500 505 510
Asn His Ala Leu Thr Glu Gln Thr Gly Val Tyr Phe Tyr Val Ser Lys
515 520 525
Gly Leu Asp Gly Gly Leu Arg Thr His Phe Cys His Asp Glu Leu Arg
530 535 540
Ser Ser His Ala Ser Asp Val Val Lys Arg Val Val Gly Ser Thr Val
545 550 555 560
Pro Val Leu Asp Gly Glu Asp Phe Ser Val Arg Val Leu Val Asp His
565 570 575
Ser Ile Val Gln Ser Phe Ala Met Gly Gly Arg Leu Thr Ala Thr Ser
580 585 590
Arg Ala Tyr Pro Thr Glu Ala Ile Tyr Ala Ala Ala Gly Val Tyr Met
595 600 605
Phe Asn Asn Ala Thr Gly Thr Ser Val Thr Ala Glu Lys Leu Val Val
610 615 620
His Asp Met Asp Ser Ser Tyr Asn His Ile Tyr Thr Asp Gly Asp Leu
625 630 635 640
Val Val Val Asp
<210> 46
<211> 555
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 46
Ala Asn Ala Phe Pro Trp Ser Asn Ala Met Leu Gln Trp Gln Arg Thr
1 5 10 15
Gly Phe His Phe Gln Pro Asp Lys Tyr Tyr Gln Asn Asp Pro Asn Gly
20 25 30
Pro Val Tyr Tyr Gly Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro
35 40 45
Ser Gly Ser Val Trp Glu Pro Gln Ile Val Trp Gly His Ala Val Ser
50 55 60
Lys Asp Leu Ile His Trp Arg His Leu Pro Pro Ala Leu Val Pro Asp
65 70 75 80
Gln Trp Tyr Asp Ile Lys Gly Val Leu Thr Gly Ser Ile Thr Val Leu
85 90 95
Pro Asp Gly Lys Val Ile Leu Leu Tyr Thr Gly Asn Thr Glu Thr Phe
100 105 110
Ala Gln Val Thr Cys Leu Ala Glu Pro Ala Asp Pro Ser Asp Pro Leu
115 120 125
Leu Arg Glu Trp Val Lys His Pro Ala Asn Pro Val Val Phe Pro Pro
130 135 140
Pro Gly Ile Gly Met Lys Asp Phe Arg Asp Pro Thr Thr Ala Trp Tyr
145 150 155 160
Asp Glu Ser Asp Gly Thr Trp Arg Thr Ile Ile Gly Ser Lys Asn Asp
165 170 175
Ser Asp His Ser Gly Ile Val Phe Ser Tyr Lys Thr Lys Asp Phe Ile
180 185 190
Ser Tyr Glu Leu Met Pro Gly Tyr Met Tyr Arg Gly Pro Lys Gly Thr
195 200 205
Gly Glu Tyr Glu Cys Ile Asp Leu Tyr Ala Val Gly Gly Gly Arg Lys
210 215 220
Ala Ser Asp Met Tyr Asn Ser Thr Ala Glu Asp Val Leu Tyr Val Leu
225 230 235 240
Lys Glu Ser Ser Asp Asp Asp Arg His Asp Trp Tyr Ser Leu Gly Arg
245 250 255
Phe Asp Ala Ala Ala Asn Lys Trp Thr Pro Ile Asp Thr Glu Leu Glu
260 265 270
Leu Gly Val Gly Leu Arg Tyr Asp Trp Gly Lys Tyr Tyr Ala Ser Lys
275 280 285
Ser Phe Tyr Asp Pro Val Lys Lys Arg Arg Val Val Trp Ala Tyr Val
290 295 300
Gly Glu Thr Asp Ser Glu Arg Ala Asp Ile Thr Lys Gly Trp Ala Asn
305 310 315 320
Leu Gln Ser Ile Pro Arg Thr Val Glu Leu Asp Glu Lys Thr Arg Thr
325 330 335
Asn Leu Ile Gln Trp Pro Val Glu Glu Leu Asn Thr Leu Arg Ile Asn
340 345 350
Thr Thr Asp Leu Ser Gly Ile Thr Val Gly Ala Gly Ser Val Ala Phe
355 360 365
Leu Pro Leu His Gln Thr Ala Gln Leu Asp Ile Glu Ala Thr Phe Arg
370 375 380
Ile Asp Ala Ser Ala Ile Glu Ala Leu Asn Glu Ala Asp Val Ser Tyr
385 390 395 400
Asn Cys Thr Thr Ser Arg Gly Ala Ala Thr Arg Gly Ala Leu Gly Pro
405 410 415
Phe Gly Leu Leu Val Leu Ala Asn His Ala Leu Thr Glu Gln Thr Gly
420 425 430
Val Tyr Phe Tyr Val Ser Lys Gly Leu Asp Gly Gly Leu Arg Thr His
435 440 445
Phe Cys His Asp Glu Leu Arg Ser Ser His Ala Ser Asp Val Val Lys
450 455 460
Arg Val Val Gly Ser Thr Val Pro Val Leu Asp Gly Glu Asp Phe Ser
465 470 475 480
Val Arg Val Leu Val Asp His Ser Ile Val Gln Ser Phe Ala Met Gly
485 490 495
Gly Arg Leu Thr Ala Thr Ser Arg Ala Tyr Pro Thr Glu Ala Ile Tyr
500 505 510
Ala Ala Ala Gly Val Tyr Met Phe Asn Asn Ala Thr Gly Thr Ser Val
515 520 525
Thr Ala Glu Lys Leu Val Val His Asp Met Asp Ser Ser Tyr Asn His
530 535 540
Ile Tyr Thr Asp Gly Asp Leu Val Val Val Asp
545 550 555
<210> 47
<211> 650
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 47
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asp Asp Pro Pro Ser Asp Ser
85 90 95
Glu Asp Tyr Pro Trp Thr Asn Glu Met Leu Lys Trp Gln Arg Thr Gly
100 105 110
Tyr His Phe Gln Pro Pro Asn His Phe Met Ala Asp Pro Asn Ala Ala
115 120 125
Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro Asn
130 135 140
Gly Ser Ala Trp Asp Tyr Ser Ile Ser Trp Gly His Ala Val Ser Lys
145 150 155 160
Asp Met Ile His Trp Leu His Leu Pro Val Ala Met Val Pro Asp His
165 170 175
Trp Tyr Asp Ser Lys Gly Val Trp Ser Gly Tyr Ala Thr Thr Leu Pro
180 185 190
Asp Gly Arg Ile Ile Val Leu Tyr Thr Gly Gly Thr Asp Gln Leu Val
195 200 205
Gln Val Gln Asn Leu Ala Glu Pro Ala Asp Pro Ser Asp Pro Leu Leu
210 215 220
Ile Glu Trp Lys Lys Ser Asn Gly Asn Pro Ile Leu Met Pro Pro Pro
225 230 235 240
Gly Val Gly Pro His Asp Phe Arg Asp Pro Phe Pro Val Trp Tyr Asn
245 250 255
Glu Ser Asp Ser Thr Trp His Met Leu Ile Gly Ser Lys Asp Asp Asn
260 265 270
His Tyr Gly Thr Val Leu Ile Tyr Thr Thr Lys Asp Phe Glu Thr Tyr
275 280 285
Thr Leu Leu Pro Asp Ile Leu His Lys Thr Lys Asp Ser Val Gly Met
290 295 300
Leu Glu Cys Val Asp Leu Tyr Pro Val Ala Thr Thr Gly Asn Gln Ile
305 310 315 320
Gly Asn Gly Leu Glu Met Lys Gly Gly Ser Gly Lys Gly Ile Lys His
325 330 335
Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ala Ile
340 345 350
Gly Thr Phe Asp Leu Glu Ser Phe Ser Trp Val Pro Asp Asp Asp Thr
355 360 365
Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Tyr Gly Lys Phe Tyr Ala
370 375 380
Ser Lys Thr Phe Tyr Asp Gln Glu Lys Lys Arg Arg Ile Leu Trp Gly
385 390 395 400
Tyr Val Gly Glu Val Asp Ser Lys Ala Asp Asp Ile Leu Lys Gly Trp
405 410 415
Ala Ser Val Gln Asn Ile Ala Arg Thr Ile Leu Phe Asp Ala Lys Thr
420 425 430
Arg Ser Asn Leu Leu Val Trp Pro Val Glu Glu Leu Asp Ala Leu Arg
435 440 445
Thr Ser Gly Lys Glu Phe Asn Gly Val Val Val Glu Pro Gly Ser Thr
450 455 460
Tyr His Leu Asp Val Gly Thr Ala Thr Gln Leu Asp Ile Glu Ala Glu
465 470 475 480
Phe Glu Ile Asn Lys Glu Ala Val Asp Ala Val Val Glu Ala Asp Val
485 490 495
Thr Tyr Asn Cys Ser Thr Ser Asp Gly Ala Ala His Arg Gly Leu Leu
500 505 510
Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Glu Lys Met Thr Glu Lys
515 520 525
Thr Ala Thr Tyr Phe Tyr Val Ser Arg Asn Val Asp Gly Gly Leu Gln
530 535 540
Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala Asn Asp Ile
545 550 555 560
Thr Lys Arg Val Val Gly His Thr Val Pro Val Leu His Gly Glu Thr
565 570 575
Phe Ser Leu Arg Ile Leu Val Asp His Ser Ile Val Glu Ser Phe Ala
580 585 590
Gln Lys Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro Thr Glu Ala
595 600 605
Ile Tyr Asp Ser Thr Arg Val Phe Leu Phe Asn Asn Ala Thr Ser Ala
610 615 620
Thr Val Thr Ala Lys Ser Val Lys Ile Trp His Met Asn Ser Thr His
625 630 635 640
Asn His Pro Phe Pro Gly Phe Pro Ala Pro
645 650
<210> 48
<211> 561
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 48
Asp Asp Pro Pro Ser Asp Ser Glu Asp Tyr Pro Trp Thr Asn Glu Met
1 5 10 15
Leu Lys Trp Gln Arg Thr Gly Tyr His Phe Gln Pro Pro Asn His Phe
20 25 30
Met Ala Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln Tyr Asn Pro Asn Gly Ser Ala Trp Asp Tyr Ser Ile Ser
50 55 60
Trp Gly His Ala Val Ser Lys Asp Met Ile His Trp Leu His Leu Pro
65 70 75 80
Val Ala Met Val Pro Asp His Trp Tyr Asp Ser Lys Gly Val Trp Ser
85 90 95
Gly Tyr Ala Thr Thr Leu Pro Asp Gly Arg Ile Ile Val Leu Tyr Thr
100 105 110
Gly Gly Thr Asp Gln Leu Val Gln Val Gln Asn Leu Ala Glu Pro Ala
115 120 125
Asp Pro Ser Asp Pro Leu Leu Ile Glu Trp Lys Lys Ser Asn Gly Asn
130 135 140
Pro Ile Leu Met Pro Pro Pro Gly Val Gly Pro His Asp Phe Arg Asp
145 150 155 160
Pro Phe Pro Val Trp Tyr Asn Glu Ser Asp Ser Thr Trp His Met Leu
165 170 175
Ile Gly Ser Lys Asp Asp Asn His Tyr Gly Thr Val Leu Ile Tyr Thr
180 185 190
Thr Lys Asp Phe Glu Thr Tyr Thr Leu Leu Pro Asp Ile Leu His Lys
195 200 205
Thr Lys Asp Ser Val Gly Met Leu Glu Cys Val Asp Leu Tyr Pro Val
210 215 220
Ala Thr Thr Gly Asn Gln Ile Gly Asn Gly Leu Glu Met Lys Gly Gly
225 230 235 240
Ser Gly Lys Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Asp Glu
245 250 255
Arg His Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Glu Ser Phe Ser
260 265 270
Trp Val Pro Asp Asp Asp Thr Ile Asp Val Gly Val Gly Leu Arg Tyr
275 280 285
Asp Tyr Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys
290 295 300
Lys Arg Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Ala
305 310 315 320
Asp Asp Ile Leu Lys Gly Trp Ala Ser Val Gln Asn Ile Ala Arg Thr
325 330 335
Ile Leu Phe Asp Ala Lys Thr Arg Ser Asn Leu Leu Val Trp Pro Val
340 345 350
Glu Glu Leu Asp Ala Leu Arg Thr Ser Gly Lys Glu Phe Asn Gly Val
355 360 365
Val Val Glu Pro Gly Ser Thr Tyr His Leu Asp Val Gly Thr Ala Thr
370 375 380
Gln Leu Asp Ile Glu Ala Glu Phe Glu Ile Asn Lys Glu Ala Val Asp
385 390 395 400
Ala Val Val Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Asp Gly
405 410 415
Ala Ala His Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
420 425 430
Asn Glu Lys Met Thr Glu Lys Thr Ala Thr Tyr Phe Tyr Val Ser Arg
435 440 445
Asn Val Asp Gly Gly Leu Gln Thr His Phe Cys Gln Asp Glu Leu Arg
450 455 460
Ser Ser Lys Ala Asn Asp Ile Thr Lys Arg Val Val Gly His Thr Val
465 470 475 480
Pro Val Leu His Gly Glu Thr Phe Ser Leu Arg Ile Leu Val Asp His
485 490 495
Ser Ile Val Glu Ser Phe Ala Gln Lys Gly Arg Ala Val Ala Thr Ser
500 505 510
Arg Val Tyr Pro Thr Glu Ala Ile Tyr Asp Ser Thr Arg Val Phe Leu
515 520 525
Phe Asn Asn Ala Thr Ser Ala Thr Val Thr Ala Lys Ser Val Lys Ile
530 535 540
Trp His Met Asn Ser Thr His Asn His Pro Phe Pro Gly Phe Pro Ala
545 550 555 560
Pro
<210> 49
<211> 651
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 49
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asn Leu Met Arg Leu Arg Glu
85 90 95
Asn Asp Tyr Pro Trp Thr Asn Asp Met Leu Arg Trp Gln Arg Thr Gly
100 105 110
Phe His Phe Gln Pro Glu Lys Asn Phe Gln Ala Asp Pro Asn Ala Ala
115 120 125
Met Phe Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro Thr
130 135 140
Gly Val Ala Trp Asp Tyr Thr Ile Ser Trp Gly His Ala Val Ser Lys
145 150 155 160
Asp Leu Leu His Trp Asn Tyr Leu Pro Met Ala Leu Arg Pro Asp His
165 170 175
Trp Tyr Asp Arg Lys Gly Val Trp Ser Gly Tyr Ser Thr Leu Leu Pro
180 185 190
Asp Gly Arg Ile Val Val Leu Tyr Thr Gly Gly Thr Lys Glu Leu Val
195 200 205
Gln Val Gln Asn Leu Ala Val Pro Val Asn Leu Ser Asp Pro Leu Leu
210 215 220
Leu Glu Trp Lys Lys Ser His Val Asn Pro Ile Leu Val Pro Pro Pro
225 230 235 240
Gly Ile Glu Asp His Asp Phe Arg Asp Pro Phe Pro Val Trp Tyr Asn
245 250 255
Glu Ser Asp Ser Arg Trp His Val Val Ile Gly Ser Lys Asp Pro Glu
260 265 270
His Tyr Gly Ile Val Leu Ile Tyr Thr Thr Lys Asp Phe Val Asn Phe
275 280 285
Thr Leu Leu Pro Asn Ile Leu His Ser Thr Lys Gln Pro Val Gly Met
290 295 300
Leu Glu Cys Val Asp Leu Phe Pro Val Ala Thr Thr Asp Ser Arg Ala
305 310 315 320
Asn Gln Ala Leu Asp Met Thr Thr Met Arg Pro Gly Pro Gly Leu Lys
325 330 335
Tyr Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ala
340 345 350
Leu Gly Ser Phe Asp Leu Asp Ser Phe Thr Phe Thr Pro Asp Asp Glu
355 360 365
Thr Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Trp Gly Lys Phe Tyr
370 375 380
Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys His Arg Arg Val Leu Trp
385 390 395 400
Gly Tyr Val Gly Glu Val Asp Ser Lys Arg Asp Asp Ala Leu Lys Gly
405 410 415
Trp Ala Ser Leu Gln Asn Ile Pro Arg Thr Ile Leu Phe Asp Thr Lys
420 425 430
Thr Lys Ser Asn Leu Ile Leu Trp Pro Val Glu Glu Val Glu Ser Leu
435 440 445
Arg Thr Ile Asn Lys Asn Phe Asn Ser Ile Pro Leu Tyr Pro Gly Ser
450 455 460
Thr Tyr Gln Leu Asp Val Gly Glu Ala Thr Gln Leu Asp Ile Val Ala
465 470 475 480
Glu Phe Glu Val Asp Glu Lys Ala Ile Glu Ala Thr Ala Glu Ala Asp
485 490 495
Val Thr Tyr Asn Cys Ser Thr Ser Gly Gly Ala Ala Asn Arg Gly Val
500 505 510
Leu Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Gln Glu Leu Ser Glu
515 520 525
Gln Thr Ala Thr Tyr Phe Tyr Val Ser Arg Gly Ile Asp Gly Asn Leu
530 535 540
Arg Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala Gly Ala
545 550 555 560
Ile Thr Lys Arg Val Val Gly Ser Thr Val Pro Val Leu His Gly Glu
565 570 575
Thr Trp Ala Leu Arg Ile Leu Val Asp His Ser Ile Val Glu Ser Phe
580 585 590
Ala Gln Arg Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro Thr Glu
595 600 605
Ala Ile Tyr Ser Ser Ala Arg Val Phe Leu Phe Asn Asn Ala Thr Asp
610 615 620
Ala Ile Val Thr Ala Lys Thr Val Asn Val Trp His Met Asn Ser Thr
625 630 635 640
Tyr Asn His Val Phe Pro Gly Leu Val Ala Pro
645 650
<210> 50
<211> 562
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 50
Asn Leu Met Arg Leu Arg Glu Asn Asp Tyr Pro Trp Thr Asn Asp Met
1 5 10 15
Leu Arg Trp Gln Arg Thr Gly Phe His Phe Gln Pro Glu Lys Asn Phe
20 25 30
Gln Ala Asp Pro Asn Ala Ala Met Phe Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln Tyr Asn Pro Thr Gly Val Ala Trp Asp Tyr Thr Ile Ser
50 55 60
Trp Gly His Ala Val Ser Lys Asp Leu Leu His Trp Asn Tyr Leu Pro
65 70 75 80
Met Ala Leu Arg Pro Asp His Trp Tyr Asp Arg Lys Gly Val Trp Ser
85 90 95
Gly Tyr Ser Thr Leu Leu Pro Asp Gly Arg Ile Val Val Leu Tyr Thr
100 105 110
Gly Gly Thr Lys Glu Leu Val Gln Val Gln Asn Leu Ala Val Pro Val
115 120 125
Asn Leu Ser Asp Pro Leu Leu Leu Glu Trp Lys Lys Ser His Val Asn
130 135 140
Pro Ile Leu Val Pro Pro Pro Gly Ile Glu Asp His Asp Phe Arg Asp
145 150 155 160
Pro Phe Pro Val Trp Tyr Asn Glu Ser Asp Ser Arg Trp His Val Val
165 170 175
Ile Gly Ser Lys Asp Pro Glu His Tyr Gly Ile Val Leu Ile Tyr Thr
180 185 190
Thr Lys Asp Phe Val Asn Phe Thr Leu Leu Pro Asn Ile Leu His Ser
195 200 205
Thr Lys Gln Pro Val Gly Met Leu Glu Cys Val Asp Leu Phe Pro Val
210 215 220
Ala Thr Thr Asp Ser Arg Ala Asn Gln Ala Leu Asp Met Thr Thr Met
225 230 235 240
Arg Pro Gly Pro Gly Leu Lys Tyr Val Leu Lys Ala Ser Met Asp Asp
245 250 255
Glu Arg His Asp Tyr Tyr Ala Leu Gly Ser Phe Asp Leu Asp Ser Phe
260 265 270
Thr Phe Thr Pro Asp Asp Glu Thr Ile Asp Val Gly Val Gly Leu Arg
275 280 285
Tyr Asp Trp Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu
290 295 300
Lys His Arg Arg Val Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys
305 310 315 320
Arg Asp Asp Ala Leu Lys Gly Trp Ala Ser Leu Gln Asn Ile Pro Arg
325 330 335
Thr Ile Leu Phe Asp Thr Lys Thr Lys Ser Asn Leu Ile Leu Trp Pro
340 345 350
Val Glu Glu Val Glu Ser Leu Arg Thr Ile Asn Lys Asn Phe Asn Ser
355 360 365
Ile Pro Leu Tyr Pro Gly Ser Thr Tyr Gln Leu Asp Val Gly Glu Ala
370 375 380
Thr Gln Leu Asp Ile Val Ala Glu Phe Glu Val Asp Glu Lys Ala Ile
385 390 395 400
Glu Ala Thr Ala Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Gly
405 410 415
Gly Ala Ala Asn Arg Gly Val Leu Gly Pro Phe Gly Leu Leu Val Leu
420 425 430
Ala Asn Gln Glu Leu Ser Glu Gln Thr Ala Thr Tyr Phe Tyr Val Ser
435 440 445
Arg Gly Ile Asp Gly Asn Leu Arg Thr His Phe Cys Gln Asp Glu Leu
450 455 460
Arg Ser Ser Lys Ala Gly Ala Ile Thr Lys Arg Val Val Gly Ser Thr
465 470 475 480
Val Pro Val Leu His Gly Glu Thr Trp Ala Leu Arg Ile Leu Val Asp
485 490 495
His Ser Ile Val Glu Ser Phe Ala Gln Arg Gly Arg Ala Val Ala Thr
500 505 510
Ser Arg Val Tyr Pro Thr Glu Ala Ile Tyr Ser Ser Ala Arg Val Phe
515 520 525
Leu Phe Asn Asn Ala Thr Asp Ala Ile Val Thr Ala Lys Thr Val Asn
530 535 540
Val Trp His Met Asn Ser Thr Tyr Asn His Val Phe Pro Gly Leu Val
545 550 555 560
Ala Pro
<210> 51
<211> 650
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 51
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Asp Asp Pro Pro Ser Asp Ser
85 90 95
Glu Asp Tyr Pro Trp Thr Asn Glu Met Leu Lys Trp Gln Arg Thr Gly
100 105 110
Tyr His Phe Gln Pro Pro Asn His Phe Met Ala Asp Pro Asn Ala Ala
115 120 125
Met Tyr Tyr Lys Gly Trp Tyr His Phe Phe Tyr Gln Tyr Asn Pro Asn
130 135 140
Gly Ser Ala Trp Asp Tyr Ser Ile Ser Trp Gly His Ala Val Ser Lys
145 150 155 160
Asp Met Ile His Trp Leu His Leu Pro Val Ala Met Val Pro Asp His
165 170 175
Trp Tyr Asp Ser Lys Gly Val Trp Ser Gly Tyr Ala Thr Thr Leu Pro
180 185 190
Asp Gly Arg Ile Ile Val Leu Tyr Thr Gly Gly Thr Asp Gln Leu Val
195 200 205
Gln Val Gln Asn Leu Ala Glu Pro Ala Asp Pro Ser Asp Pro Leu Leu
210 215 220
Ile Glu Trp Lys Lys Ser Asn Gly Asn Pro Ile Leu Met Pro Pro Pro
225 230 235 240
Gly Val Gly Pro His Asp Phe Arg Asp Pro Phe Pro Val Trp Tyr Asn
245 250 255
Glu Ser Asp Ser Thr Trp His Met Leu Ile Gly Ser Lys Asp Asp Asn
260 265 270
His Tyr Gly Thr Val Leu Ile Tyr Thr Thr Lys Asp Phe Glu Thr Tyr
275 280 285
Thr Leu Leu Pro Asp Ile Leu His Lys Thr Lys Asp Ser Val Gly Met
290 295 300
Leu Glu Cys Val Asp Leu Tyr Pro Val Ala Thr Thr Gly Asn Gln Ile
305 310 315 320
Gly Asn Gly Leu Glu Met Lys Gly Gly Ser Gly Lys Gly Ile Lys His
325 330 335
Val Leu Lys Ala Ser Met Asp Asp Glu Arg His Asp Tyr Tyr Ala Ile
340 345 350
Gly Thr Phe Asp Leu Glu Ser Phe Ser Trp Val Pro Asp Asp Asp Thr
355 360 365
Ile Asp Val Gly Val Gly Leu Arg Tyr Asp Tyr Gly Lys Phe Tyr Ala
370 375 380
Ser Lys Thr Phe Tyr Asp Gln Glu Lys Lys Arg Arg Ile Leu Trp Gly
385 390 395 400
Tyr Val Gly Glu Val Asp Ser Lys Ala Asp Asp Ile Leu Lys Gly Trp
405 410 415
Ala Ser Val Gln Asn Ile Ala Arg Thr Ile Leu Phe Asp Ala Lys Thr
420 425 430
Arg Ser Asn Leu Leu Val Trp Pro Val Glu Glu Leu Asp Ala Leu Arg
435 440 445
Thr Ser Gly Lys Glu Phe Asn Gly Val Val Val Glu Pro Gly Ser Thr
450 455 460
Tyr His Leu Asp Val Gly Thr Ala Thr Gln Leu Asp Ile Glu Ala Glu
465 470 475 480
Phe Glu Ile Asn Lys Glu Ala Val Asp Ala Val Val Glu Ala Asp Val
485 490 495
Thr Tyr Asn Cys Ser Thr Ser Asp Gly Ala Ala His Arg Gly Leu Leu
500 505 510
Gly Pro Phe Gly Leu Leu Val Leu Ala Asn Glu Lys Met Thr Glu Lys
515 520 525
Thr Ala Thr Tyr Phe Tyr Val Ser Arg Asn Ala Asp Gly Gly Leu Gln
530 535 540
Thr His Phe Cys Gln Asp Glu Leu Arg Ser Ser Lys Ala Asn Asp Ile
545 550 555 560
Thr Lys Arg Val Val Gly His Thr Val Pro Val Leu His Gly Glu Thr
565 570 575
Phe Ser Leu Arg Ile Leu Val Asp His Ser Ile Val Glu Ser Phe Ala
580 585 590
Gln Lys Gly Arg Ala Val Ala Thr Ser Arg Val Tyr Pro Thr Glu Ala
595 600 605
Ile Tyr Asp Ser Thr Arg Val Phe Leu Phe Asn Asn Ala Thr Ser Ala
610 615 620
Thr Val Thr Ala Lys Ser Val Lys Ile Trp His Met Asn Ser Thr His
625 630 635 640
Asn His Pro Phe Pro Gly Phe Pro Ala Pro
645 650
<210> 52
<211> 561
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 52
Asp Asp Pro Pro Ser Asp Ser Glu Asp Tyr Pro Trp Thr Asn Glu Met
1 5 10 15
Leu Lys Trp Gln Arg Thr Gly Tyr His Phe Gln Pro Pro Asn His Phe
20 25 30
Met Ala Asp Pro Asn Ala Ala Met Tyr Tyr Lys Gly Trp Tyr His Phe
35 40 45
Phe Tyr Gln Tyr Asn Pro Asn Gly Ser Ala Trp Asp Tyr Ser Ile Ser
50 55 60
Trp Gly His Ala Val Ser Lys Asp Met Ile His Trp Leu His Leu Pro
65 70 75 80
Val Ala Met Val Pro Asp His Trp Tyr Asp Ser Lys Gly Val Trp Ser
85 90 95
Gly Tyr Ala Thr Thr Leu Pro Asp Gly Arg Ile Ile Val Leu Tyr Thr
100 105 110
Gly Gly Thr Asp Gln Leu Val Gln Val Gln Asn Leu Ala Glu Pro Ala
115 120 125
Asp Pro Ser Asp Pro Leu Leu Ile Glu Trp Lys Lys Ser Asn Gly Asn
130 135 140
Pro Ile Leu Met Pro Pro Pro Gly Val Gly Pro His Asp Phe Arg Asp
145 150 155 160
Pro Phe Pro Val Trp Tyr Asn Glu Ser Asp Ser Thr Trp His Met Leu
165 170 175
Ile Gly Ser Lys Asp Asp Asn His Tyr Gly Thr Val Leu Ile Tyr Thr
180 185 190
Thr Lys Asp Phe Glu Thr Tyr Thr Leu Leu Pro Asp Ile Leu His Lys
195 200 205
Thr Lys Asp Ser Val Gly Met Leu Glu Cys Val Asp Leu Tyr Pro Val
210 215 220
Ala Thr Thr Gly Asn Gln Ile Gly Asn Gly Leu Glu Met Lys Gly Gly
225 230 235 240
Ser Gly Lys Gly Ile Lys His Val Leu Lys Ala Ser Met Asp Asp Glu
245 250 255
Arg His Asp Tyr Tyr Ala Ile Gly Thr Phe Asp Leu Glu Ser Phe Ser
260 265 270
Trp Val Pro Asp Asp Asp Thr Ile Asp Val Gly Val Gly Leu Arg Tyr
275 280 285
Asp Tyr Gly Lys Phe Tyr Ala Ser Lys Thr Phe Tyr Asp Gln Glu Lys
290 295 300
Lys Arg Arg Ile Leu Trp Gly Tyr Val Gly Glu Val Asp Ser Lys Ala
305 310 315 320
Asp Asp Ile Leu Lys Gly Trp Ala Ser Val Gln Asn Ile Ala Arg Thr
325 330 335
Ile Leu Phe Asp Ala Lys Thr Arg Ser Asn Leu Leu Val Trp Pro Val
340 345 350
Glu Glu Leu Asp Ala Leu Arg Thr Ser Gly Lys Glu Phe Asn Gly Val
355 360 365
Val Val Glu Pro Gly Ser Thr Tyr His Leu Asp Val Gly Thr Ala Thr
370 375 380
Gln Leu Asp Ile Glu Ala Glu Phe Glu Ile Asn Lys Glu Ala Val Asp
385 390 395 400
Ala Val Val Glu Ala Asp Val Thr Tyr Asn Cys Ser Thr Ser Asp Gly
405 410 415
Ala Ala His Arg Gly Leu Leu Gly Pro Phe Gly Leu Leu Val Leu Ala
420 425 430
Asn Glu Lys Met Thr Glu Lys Thr Ala Thr Tyr Phe Tyr Val Ser Arg
435 440 445
Asn Ala Asp Gly Gly Leu Gln Thr His Phe Cys Gln Asp Glu Leu Arg
450 455 460
Ser Ser Lys Ala Asn Asp Ile Thr Lys Arg Val Val Gly His Thr Val
465 470 475 480
Pro Val Leu His Gly Glu Thr Phe Ser Leu Arg Ile Leu Val Asp His
485 490 495
Ser Ile Val Glu Ser Phe Ala Gln Lys Gly Arg Ala Val Ala Thr Ser
500 505 510
Arg Val Tyr Pro Thr Glu Ala Ile Tyr Asp Ser Thr Arg Val Phe Leu
515 520 525
Phe Asn Asn Ala Thr Ser Ala Thr Val Thr Ala Lys Ser Val Lys Ile
530 535 540
Trp His Met Asn Ser Thr His Asn His Pro Phe Pro Gly Phe Pro Ala
545 550 555 560
Pro
<210> 53
<211> 1935
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 53
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgac gaagaggctg ccggtggatt tccctggtca 300
aacgaaatgt tacaatggca gagatccggt taccacttcc aaacagcaaa aaattatatg 360
tctgatccta acggcctaat gtactatagg ggttggtacc atatgttctt ccaatacaac 420
ccagtcggga ctgattggga cgacggtatg gaatggggtc acgctgtgtc gcgtaatttg 480
gtacaatgga gaacgttgcc aatagctatg gttgccgatc aatggtatga tattctgggt 540
gttctttctg gttctatgac cgtcttgcca aacggtactg ttatcatgat ctacaccggt 600
gctactaatg cgagcgctgt cgaagttcaa tgtattgcaa ccccagccga tccgaacgac 660
cctttgttaa gaagatggac taagcatcca gctaaccctg tgatctggag tccaccaggt 720
gtagggacaa aggattttcg agactccatg accgcttggt acgacgagtc agatgacact 780
tggagaacct tgttgggctc caaggacgat aacaatggtc accatgatgg tattgctatg 840
atgtataaaa ctaaggattt cctaaattac gaacttatcc caggcatact gcaccgtgtc 900
gaaaggacag gtgaatggga atgcatcgac ttttacccgg ttggtcatag aacgtctgat 960
aactctagcg aaatgttgca cgttttgaaa gcctctatgg atgacgaacg gcacgattat 1020
tactccttag gtacttacga tagtgctgcc aacagatgga ccccaattga ccccgaacta 1080
gacttgggta ttggattgag atatgattgg ggtaagtttt acgctagcac ttcattctac 1140
gatccagcaa agaaacgtcg agtcttaatg ggatatgttg gtgaggttga ctccaagaga 1200
gctgacgtcg tgaagggttg ggcttctatc caatctgttc caagaacaat tgcattggac 1260
gaaaagacta gaaccaacct gctgttatgg cccgttgagg aaatcgaaac attgagacta 1320
aatgctaccc aactctcgga tgtcaccttg aatactggtt ctgtcattca tattcctttg 1380
agacaaggta cccagttgga tatagaagct acattccacc ttgatgcctc cgctgttgcc 1440
gctttaaacg aagcggacgt cggttacaac tgttcctctt ctggtggtgc tgtgaataga 1500
ggagctttgg gtccattcgg tttgttagtt ctcgcggctg gagacagacg tggtgagcaa 1560
actgctgttt acttttatgt tagtagaggt ttggacggcg gtttgcatac ctccttctgt 1620
caagatgaac tcagaagttc ccgcgcgaag gatgttacta aaagagtcat cggttcgact 1680
gtcccggttc ttgacggcga agcattctct atgagggttt tagttgatca ttcgattgtc 1740
caaggttttg caatgggtgg tagaactacg atgacatctc gggtctatcc aatggaagct 1800
taccaggagg ccaaggttta cctctttaac aacgctaccg gagcatccgt taccgctgaa 1860
agacttgtag ttcacgatat ggactcagcc cataatcaat tgtctaacat ggacgactac 1920
tcatatgtac agtaa 1935
<210> 54
<211> 1935
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 54
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgac gaagaggctg ccggtggatt tccctggtca 300
aacgaaatgt tacaatggca gagatccggt taccacttcc aaacagcaaa aaattatatg 360
tctgatccta acggcctaat gtactatagg ggttggaacc atatgttctt ccaatacaat 420
ccagtcggga ctgattggga cgacggtatg gaatggggtc acgctgtgtc gcgtaacttg 480
gtacaatgga gaacgttgcc aatagctatg gttgccgatc aatggtacga tattctgggt 540
gttctttctg gttctatgac cgtcttgcca aatggtactg ttatcatgat ctataccggt 600
gctactaacg cgagcgctgt cgaagttcaa tgtattgcaa ccccagccga tccgacggac 660
cctttgttaa gaagatggac taagcatcca gctaaccctg tgatctggag tccaccaggt 720
gtagggacaa aggattttcg agatccaatg accgcttggt acgacgaatc agacgatact 780
tggagaacgc tattgggctc taaggatgac aataatggtc accacgacgg tattgctatg 840
atgtacaaaa ctaaggattt cttgaactac gagctgattc ctggtatcct ccatagagtt 900
gaaagaacag gagaatggga atgcatagac ttttatccgg tcggtcgtag aacctctgat 960
aactcgtccg aaatgttgca tgttttaaag gcttccatgg atgacgagag acacgactac 1020
tactctctag gtacttatga tagtgccgcc aataggtgga ctccaattga cccagaattg 1080
gatttgggta ttggtttgag atatgactgg gggaaattct acgcttccac cagcttctat 1140
gatcccgcaa agaagagaag agttttgatg ggttacgtcg gtgaagtgga ctctaaacgc 1200
gctgacgttg ttaagggttg ggcctctatc caaagtgtcc cacgcaccat tgctctggac 1260
gaaaaaactc gtacaaacct tttattgtgg ccagtagaag aaatcgaaac cttaagattg 1320
aacgctactg agttgtccga cgttacttta aacactggtt ccgtcatcca cattccattg 1380
agacagggaa cccaattgga tattgaagca acctttcatc tcgatgcgag tgctgttgca 1440
gctttcaatg aagctgatgt cggttacaat tgttcatctt cgggtggtgc tgttaataga 1500
ggtgctctag ggcctttcgg cctcttagtc ttggctgccg gtgatagaag aggtgaacaa 1560
accgctgttt acttttacgt atctcgtggt ttggacggcg gtctacacac ctctttttgt 1620
caggatgagt taagatcctc aagggctaag gacgttacta agagagtcat aggatcaact 1680
gtgcccgttt tggatggtga agccttttct atgcgtgtac ttgttgatca ttccatagtc 1740
caaggtttcg caatgggtgg tagaacaact atgacgagca gagtttatcc aatggaagcg 1800
taccaagaag ctaaggttta tcttttcaac aacgcaacag gtgcctctgt tacagccgag 1860
agattggtcg tacacgaaat ggactccgcc cacaaccaat tgtcgaacat ggacgaccac 1920
tcgtatgttc aataa 1935
<210> 55
<211> 1950
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 55
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctagt ggcccttatt ctgcttcggg tggttttcca 300
tggtctaatg ccatgttgca gtggcaacgt acaggatacc acttccaacc cgaaaaaaac 360
taccaaaacg acccaaacgg tccagtctac tataagggtt ggtatcattt cttttaccaa 420
cataatccag gtggtaccgg gtggggtaac atctcatggg gtcacgcagt ttccagagat 480
atggtacact ggaggcattt accactagct atggttcctg agcattggta cgatatagaa 540
ggtgttttga ctggaagcat tactgtcctt ccagacggta gagtcatttt gttatatacc 600
ggcaatactg aaacgttcgc tcaagtgacc tgtttggcgg aggctgccga cccttccgat 660
ccactgttga gagaatgggc taagcacccg gccaacccag tagtttaccc gccaccaggt 720
atcggtatga aagactacag agatccaact acagcttggt tcgataactc agacaatacc 780
tggagaataa tcattggttc taagaatgat actgatcact ctggtatcgt ttttacttac 840
aagaccaagg acttcgtcag ctacgaactg attcctggat acctatatag aggtccagcc 900
gggacgggta tgtacgaatg cattgatttg ttcgctgttg gtggtgggcg tgctgcatca 960
gatatgtata actctaccgc tgaagatgtc ttatacgttt tgaaagaatc ctccgacgac 1020
gacagacggg attactatgc cttagggcga tttgacgctg ccgctaatac ttggacaccc 1080
atagatacag aaagagagtt gggtgtcgca ctcagatatg attacggtag atacgatact 1140
tctaagtctt tctacgaccc agttaagcaa aggagaattg tctggggtta cgttgtcgaa 1200
accgacagtt ggtccgctga cgctgcaaaa ggttgggcta acctgcaatc tatccctaga 1260
actgttgaat tggatgaaaa gactcgaaca aaccttgtac agtggccagt gggtgagttg 1320
aacaccctac gtatcaatac cactgatttg agtgacatta ccgttggtgc tggctcggtc 1380
gattctttac ccttgcacca aacttcccaa ctagacatcg aagcgtcatt tagaattaat 1440
gcctctacta tagaagcctt gaacgaagtt gatgtaggtt ataactgtac tatgacgtct 1500
ggtgctgcta ctagaggtgc tttgggtcca ttcggaattt tagtcttggc taacgtggcc 1560
ttgacagaac agaccgctgt ttatttttat gtttccaagg gtttagacgg tggtttacga 1620
acccacttct gtcatgacga attgaggtct acacacgcta ccgacgtcgc caaggaggtt 1680
gttgggtcta ctgttccagt tctcgatggt gaagatttta gcgtcagagt tttggtcgat 1740
cactcaatcg tacaatcttt cgtcatgggt ggcagaatga cagcaacttc cagagcttac 1800
ccgactgaag caatctatgc tgccgctggc gtttacctct tcaacaatgc tacaggtgct 1860
tccattaccg cagaaaaatt ggtggtacat gacatggatt cctcctacaa cagaatcttt 1920
actgacgagg atttattggt gcttgactaa 1950
<210> 56
<211> 1935
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 56
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgca aatgcttttc cttggtcgaa cgctatgttg 300
cagtggcaac gtactggctt ccatttccaa ccagacaaat actatcaaaa cgatccaaac 360
ggtcccgtct actacggagg ttggtatcac tttttctacc aatataatcc gtctggtagt 420
gtttgggagc cacaaattgt atggggtcac gccgtttcca aggacctgat ccattggcgg 480
cacttaccac cagctttggt cccagatcaa tggtacgaca taaagggtgt tctaaccggg 540
tcaattacgg tccttcctga tggtaaggtg atcttgttat atactggtaa tacagaaacc 600
ttcgctcaag ttacttgctt ggccgaaccc gcagatccaa gcgatccatt gctcagagaa 660
tgggtaaagc atcctgctaa cccagttgtc tttccaccac ccggtattgg tatgaaagac 720
ttcagagatc caaccactgc ttggtacgac gaatctgacg gcacatggag aaccatcatt 780
ggatctaaaa acgactccga ccactctggt atcgtttttt cctacaagac taaggatttc 840
attagttatg agttgatgcc gggttacatg tacagaggcc caaaggggac cggtgaatac 900
gaatgtatag atttatacgc ggtgggtggt ggtaggaagg cttctgatat gtataactcc 960
actgcggaag atgtcctata tgttttaaaa gaatcatctg acgatgatag acatgactgg 1020
tactcattgg gtagatttga cgccgctgct aataagtgga cacctataga tactgagctt 1080
gaacttggcg ttggtttgcg atatgactgg ggtaagtact acgccagcaa gtctttctac 1140
gacccagtta aaaaaagacg tgtcgtgtgg gcttatgtcg gtgaaaccga ttccgaaaga 1200
gccgacatca ccaagggttg ggcaaatttg cagtctatcc cacgcactgt tgaattggac 1260
gaaaaaacta gaacgaactt aattcaatgg ccggttgagg aactaaatac actgcgtatt 1320
aacactacag atttgtcggg aatcaccgta ggtgctggta gtgtcgcttt cttgccattg 1380
caccaaactg cccagctcga cattgaagct acttttagaa ttgatgcttc tgcgatagaa 1440
gctctaaacg aagctgatgt ttcctacaat tgtaccacat cgcgaggagc tgctaccaga 1500
ggtgccttag gtccattcgg tttgttggta ttagccaacc atgccttgac cgaacaaact 1560
ggtgtttact tttacgtgtc taagggtttg gacggtggtt taagaactca cttctgtcac 1620
gatgaactaa gatcctctca tgcttcagat gtcgttaaga gagtcgtggg tagtacggtt 1680
cctgttttgg atggggagga ctttagcgtt cgtgtcttgg ttgaccactc tattgtccaa 1740
agtttcgcca tgggtggtag gttgacagct acctccagag cttatccaac tgaagcaatc 1800
tacgctgcgg caggcgtata catgttcaac aacgctacag gtacttccgt tacggctgaa 1860
aagcttgttg tccacgatat ggattcttcc tacaaccaca tctataccga cggtgacctg 1920
gtggtagttg attaa 1935
<210> 57
<211> 1953
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 57
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgac gatcctccat ctgatagtga agattaccca 300
tggaccaatg agatgcttaa atggcaaagg acgggttatc acttccagcc cccaaaccat 360
tttatggcag acccaaacgc cgctatgtac tacaaggggt ggtatcactt cttttaccaa 420
tataacccta atggttcagc ttgggactac tccatctcgt ggggtcatgc tgtatctaag 480
gacatgattc actggctgca tttaccagtc gccatggttc cagatcattg gtacgatagc 540
aaaggagttt ggtccggcta cgctactact ttgccagatg gtagaataat tgtcttgtat 600
accggtggta cagaccaatt ggttcaagtg caaaatttag ccgaaccagc ggacccttct 660
gatccactat tgatcgaatg gaagaagtca aacggaaacc caattttgat gcctccgccg 720
ggtgtaggtc cacacgattt cagagatcca ttcccagttt ggtacaacga atctgactcc 780
acatggcaca tgttgatcgg ttctaaagat gacaatcact acggtaccgt tctaatttat 840
actactaagg attttgagac atacacttta ttgccagaca tcctacataa gaccaaggac 900
tcggttggta tgttggaatg tgtcgatctt tatccagtgg ctactaccgg gaatcaaatt 960
ggtaacggtt tagaaatgaa aggtggttcc ggcaagggta tcaagcacgt cctgaaggct 1020
tctatggacg atgaacgtca cgattattac gccataggta cgttcgactt ggaatccttt 1080
agttgggttc cggacgacga taccatagat gtcggcgtcg gcttgcgcta tgactacggt 1140
aagttctacg cttcaaaaac tttctatgat caggaaaaga agagaagaat tttgtgggga 1200
tacgttggtg aagtagactc taaggctgac gacatcttaa aaggttgggc gagcgttcaa 1260
aatattgcaa gaactatcct atttgatgca aaaactagaa gtaacttgct cgtctggccc 1320
gtcgaggaat tggacgcttt gcgaacctct ggtaaggaat ttaacggtgt ggttgttgaa 1380
cctggttcta cttaccattt agacgtaggt accgccaccc aattggatat tgaagctgaa 1440
tttgagatca ataaggaagc tgttgacgct gttgtcgaag ccgatgttac atacaactgc 1500
tccacatctg atggtgctgc tcacagaggt ttgttgggac cattcggtct tttggtttta 1560
gctaatgaaa agatgacaga aaaaaccgcc acttatttct acgtcagtcg taacgttgat 1620
gggggtctac aaactcattt ctgtcaagac gagcttagaa gctctaaagc taacgatatt 1680
accaaacgtg tcgttggcca cactgttcca gttctgcatg gtgaaacctt ctccttgaga 1740
attttagtag accactcgat cgttgaatcg tttgcgcaga agggtagagc agtcgctacg 1800
tctagggtgt atccaactga agctatctac gattctacaa gagttttcct cttcaacaac 1860
gccacttcag ctacggtcac tgccaagtcc gtaaagatat ggcatatgaa cagtacccat 1920
aaccaccctt ttccaggttt ccccgcacca taa 1953
<210> 58
<211> 1956
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 58
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctaac ttgatgcgtt taagagagaa tgattatccc 300
tggactaacg acatgctaag atggcaacgc acgggatttc acttccagcc tgaaaaaaac 360
ttccaagccg acccaaatgc agctatgttt tacaagggct ggtaccattt cttttatcaa 420
tacaacccga ccggtgtggc ttgggattac acaatctcct ggggtcacgc tgtcagtaag 480
gatttgctgc attggaatta tcttccaatg gccttgaggc ctgaccactg gtacgataga 540
aaaggtgttt ggagcggtta ctctacttta ttgccagacg gtagaattgt tgtcttgtac 600
accggtggaa ctaaggaatt agttcaagtc caaaacttgg ctgtcccagt aaacctttct 660
gacccattgc tattggaatg gaagaagtca cacgttaacc caatactcgt tccacctccg 720
gggatcgagg atcatgattt ccgagatcca ttcccagtgt ggtataatga atctgactcg 780
cggtggcacg ttgtaattgg ttccaaagat ccagaacact atggtattgt cttgatctac 840
actaccaagg acttcgttaa ctttacgtta ttaccaaaca tattgcattc caccaagcag 900
ccggttggta tgctggaatg tgtagacttg ttcccagttg ctacaactga ttctcgtgca 960
aatcaagctt tggatatgac taccatgagg cccggtcctg gcctcaaata tgtgttaaag 1020
gcgagtatgg atgacgaaag acacgattac tacgccctag gtagctttga cttggactcg 1080
ttcactttta caccagatga tgaaaccatt gacgtcggtg tcggtttgag atacgactgg 1140
ggtaagttct atgcttcaaa aactttctat gaccaagaaa agcatagaag agttttatgg 1200
ggttacgtgg gggaagttga ttctaagaga gatgacgcgt taaaaggctg ggcttccttg 1260
caaaacatcc caagaacaat tttgttcgat accaaaacta agtctaatct aatcttgtgg 1320
ccagttgaag aggtcgaatc attgagaact attaacaaga attttaactc tataccactt 1380
tacccaggtt ccacttacca attggatgtt ggggaagcca cccaactgga tattgtcgct 1440
gaatttgaag tcgatgagaa ggctattgaa gcaactgctg aagctgacgt tacatataac 1500
tgctctacca gcggtggtgc cgctaacaga ggtgttttgg gtcctttcgg tctattggtt 1560
ctagccaatc aagaactttc cgaacagact gccacttact tctatgtatc gcgtggtatc 1620
gacggcaacc tgagaaccca cttttgtcaa gacgaattga gatcctccaa agccggtgct 1680
atcaccaaga gggtcgtagg ttctacagtt cctgttttgc atggtgaaac gtgggcttta 1740
cgtatcctag ttgaccactc tattgtcgag tcttttgcac aacggggacg cgccgtcgct 1800
accagtagag tatacccaac tgaggctata tactcttcgg ctagagtctt tctcttcaat 1860
aacgcaaccg atgccattgt tacagctaaa acggtcaacg tttggcatat gaatagcact 1920
tacaaccacg tctttcctgg tttggttgct ccataa 1956
<210> 59
<211> 1953
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 59
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctgac gatcctccat ctgatagtga agattaccca 300
tggaccaatg agatgcttaa atggcaaagg acgggttatc acttccagcc cccaaaccat 360
tttatggcag acccaaacgc cgctatgtac tacaaggggt ggtatcactt cttttaccaa 420
tataacccta atggttcagc ttgggactac tccatctcgt ggggtcatgc tgtatctaag 480
gacatgattc actggctgca tttaccagtc gccatggttc cagatcattg gtacgatagc 540
aaaggagttt ggtccggcta cgctactact ttgccagatg gtagaataat tgtcttgtat 600
accggtggta cagaccaatt ggttcaagtg caaaatttag ccgaaccagc ggacccttct 660
gatccactat tgatcgaatg gaagaagtca aacggaaacc caattttgat gcctccgccg 720
ggtgtaggtc cacacgattt cagagatcca ttcccagttt ggtacaacga atctgactcc 780
acatggcaca tgttgatcgg ttctaaagat gacaatcact acggtaccgt tctaatttat 840
actactaagg attttgagac atacacttta ttgccagaca tcctacataa gaccaaggac 900
tcggttggta tgttggaatg tgtcgatctt tatccagtgg ctactaccgg gaatcaaatt 960
ggtaacggtt tagaaatgaa aggtggttcc ggcaagggta tcaagcacgt cctgaaggct 1020
tctatggacg atgaacgtca cgattattac gccataggta cgttcgactt ggaatccttt 1080
agttgggttc cggacgacga taccatagat gtcggcgtcg gcttgcgcta tgactacggt 1140
aagttctacg cttcaaaaac tttctatgat caggaaaaga agagaagaat tttgtgggga 1200
tacgttggtg aagtagactc taaggctgac gacatcttaa aaggttgggc gagcgttcaa 1260
aatattgcaa gaactatcct atttgatgca aaaactagaa gtaacttgct cgtctggccc 1320
gtcgaggaat tggacgcttt gcgaacctct ggtaaggaat ttaacggtgt ggttgttgaa 1380
cctggttcta cttaccattt agacgtaggt accgccaccc aattggatat tgaagctgaa 1440
tttgagatca ataaggaagc tgttgacgct gttgtcgaag ccgatgttac atacaactgc 1500
tccacatctg atggtgctgc tcacagaggt ttgttgggac cattcggtct tttggtttta 1560
gctaatgaaa agatgacaga aaaaaccgcc acttatttct acgtcagtcg taacgctgat 1620
gggggtctac aaactcattt ctgtcaagac gagcttagaa gctctaaagc taacgatatt 1680
accaaacgtg tcgttggcca cactgttcca gttctgcatg gtgaaacctt ctccttgaga 1740
attttagtcg atcactcaat tgtcgagtcc ttcgcgcaaa agggtagggc tgttgcaacc 1800
tctcgggtgt atccaactga agccatctac gattctacga gagtttttct cttcaacaac 1860
gctacttcgg caacggtaac tgctaagtcc gtaaagatat ggcatatgaa cagtacccat 1920
aaccaccctt ttccaggttt ccccgcgcca taa 1953
<210> 60
<211> 89
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 60
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala
85
<210> 61
<211> 267
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 61
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagct 267
<210> 62
<211> 1956
<212> DNA
<213> 人工序列
<220>
<223> 合成的
<400> 62
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct 60
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 120
tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat 180
aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta 240
tctctcgaga aaagagaggc tgaagctaac ttgatgcgtt taagagagaa tgattatccc 300
tggactaacg acatgctaag atggcaacgc acgggatttc acttccagcc tggtaaaaac 360
ttccaagccg acccaaatgc agctatgttt tacaagggct ggtaccattt cttttatcaa 420
tacaacccga ccggtgtggc ttgggattac acaatctcct ggggtcacgc tgtcagtaag 480
gatttgctgc attggaatta tcttccaatg gccttgaggc ctgaccactg gtacgataga 540
aaaggtgttt ggagcggtta ctctacttta ttgccagacg gtagaattgt tgtcttgtac 600
accggtggaa ctaaggaatt agttcaagtc caaaacttgg ctgtcccagt aaacctttct 660
gacccattgc tattggaatg gaagaagtca cacgttaacc caatactcgt tccacctccg 720
gggatcgaag atcatgattt ccgagatcca ttcccagtgt ggtataatga atctgactcg 780
cggtggcacg ttgtaattgg ttccaaagat ccagagcact atggtattgt cttgatctac 840
actaccaagg acttcgttaa ctttacgtta ttaccaaaca tattgcattc caccaagcag 900
ccggttggta tgctggaatg tgtagacttg ttcccagttg ctacaactga ttctcgtgca 960
aatcaagctt tggatatgac taccatgagg cccggtcctg ggctcaaata tgtgttaaag 1020
gcgagtatgg atgacgaaag acacgattac tacgccctag gtagctttga cttggactcg 1080
ttcactttta caccagatga tgaaaccatt gacgtcggta ttggtcttag atacgactgg 1140
ggcaagttct acgcgtccaa gactttttac gaccaagaaa aacaaagaag agttttgtgg 1200
ggatacgtcg gtgaagttga ctcgaagcgt gatgatgctc tgaaaggttg ggcttctttg 1260
caaaatatcc cacgtacaat cttgttcgac accaaaacca agtccaacct aattttgtgg 1320
ccagttgaag aagtcgagtc tttaagaact attaacaaga atttcaattc aatccctttg 1380
tatcctggtt ctacttacca gcttgatgtg ggtgaagcta cccaattgga tattgtggcc 1440
gagttcgaag tcgatgaaaa ggctattgaa gctactgccg aagctgatgt tacatataac 1500
tgctccacct ccggtggtgc agctaataga ggggttttgg gtccattcgg tttgttagtt 1560
ttagctaacc aagagttgtc tgaacaaact gctacttact tctatgtctc tcgcggcata 1620
gatggtaact taagaacaca tttttgtcaa gacgaactgc gatcttccaa ggctggtgcc 1680
atcactaagc gggtagttgg ttctaccgtc ccagttctac atggcgaaac ctgggccttg 1740
agaattttgg tcgatcactc aatcgtagag tcttttgcac agagaggtag agctgttgcc 1800
acgagtagag tctatcctac agaagcaatt tatagctcag ctagagtctt tctattcaac 1860
aatgccactg acgctattgt taccgctaag acagtaaacg tttggcacat caactccacc 1920
tacaatcatg tttttccggg tctggtcgct ccataa 1956
<210> 63
<211> 564
<212> PRT
<213> 人工序列
<220>
<223> 合成的
<400> 63
Gly Ala Arg Val Gly Leu Gly Gly Ile Tyr Asp Asp Ala Asp Ala Phe
1 5 10 15
Ala Trp Asn Asn Ser Met Leu Gln Trp Gln Arg Ala Gly Phe His Phe
20 25 30
Gln Thr Glu Lys Asn Phe Met Ser Asp Pro Asn Gly Pro Val Tyr Tyr
35 40 45
Arg Gly Tyr Tyr His Leu Phe Tyr Gln Tyr Asn Met Lys Gly Val Val
50 55 60
Trp Asp Asp Gly Ile Val Trp Gly His Val Val Ser Arg Asp Leu Val
65 70 75 80
His Trp Arg His Leu Pro Ile Ala Met Val Pro Asp His Trp Tyr Asp
85 90 95
Ser Met Gly Val Leu Ser Gly Ser Ile Thr Val Leu Gln Asn Gly Ser
100 105 110
Leu Val Met Ile Tyr Thr Gly Val Phe Ser Lys Thr Thr Asp Arg Ser
115 120 125
Gly Met Met Glu Val Gln Cys Leu Ala Val Pro Ala Asp Pro Asn Asp
130 135 140
Pro Leu Leu Arg Ser Trp Thr Lys His Pro Ala Asn Pro Val Leu Val
145 150 155 160
His Pro Pro Gly Ile Lys Asp Met Asp Phe Arg Asp Pro Thr Thr Ala
165 170 175
Trp Phe Asp Glu Ser Asp Ser Thr Tyr Arg Thr Val Ile Gly Thr Lys
180 185 190
Asp Asp His His Gly Ser His Ala Gly Phe Ala Met Val Tyr Lys Thr
195 200 205
Lys Asp Phe Leu Ser Phe Gln Arg Ile Pro Gly Ile Leu His Ser Val
210 215 220
Glu His Thr Gly Met Trp Glu Cys Met Asp Phe Tyr Pro Val Gly Gly
225 230 235 240
Gly Asp Asn Ser Ser Ser Glu Val Leu Tyr Val Ile Lys Ala Ser Met
245 250 255
Asp Asp Glu Arg His Asp Tyr Tyr Ala Leu Gly Met Tyr Asp Ala Ala
260 265 270
Ala Asn Thr Trp Thr Pro Leu Asp Gln Glu Leu Asp Leu Gly Ile Gly
275 280 285
Leu Arg Tyr Asp Trp Gly Lys Leu Tyr Ala Ser Thr Thr Phe Tyr Asp
290 295 300
Pro Ala Lys Arg Arg Arg Val Met Leu Gly Tyr Val Gly Glu Thr Asp
305 310 315 320
Ser Arg Arg Ser Asp Glu Ala Lys Gly Trp Ala Ser Ile Gln Ser Ile
325 330 335
Pro Arg Thr Val Ala Leu Asp Glu Lys Thr Arg Thr Asn Leu Leu Leu
340 345 350
Trp Pro Val Glu Glu Ile Glu Thr Leu Arg Leu Asn Ala Thr Glu Phe
355 360 365
Asn Asp Ile Asn Ile Asp Thr Gly Ser Val Phe His Leu Pro Ile Arg
370 375 380
Gln Gly Asn Gln Leu Asp Ile Glu Ala Ser Phe Arg Leu Asp Ala Ser
385 390 395 400
Ala Val Ala Ala Ile Asn Glu Ala Asp Val Gly Tyr Asn Cys Ser Ser
405 410 415
Ser Gly Gly Ala Ala Thr Arg Gly Ala Leu Gly Pro Phe Gly Leu Leu
420 425 430
Val Leu Ala Ala Glu Gly Ile Gly Glu Gln Thr Ala Val Tyr Phe Tyr
435 440 445
Val Ser Arg Gly Leu Asp Gly Gly Leu Arg Thr Ser Phe Cys Asn Asp
450 455 460
Glu Leu Arg Ser Ser Trp Ala Arg Asp Val Thr Lys Arg Val Val Gly
465 470 475 480
Ser Thr Val Pro Val Leu Asn Gly Glu Thr Leu Ser Met Arg Val Leu
485 490 495
Val Asp His Ser Ile Val Gln Ser Phe Ala Met Gly Gly Arg Val Thr
500 505 510
Ala Thr Ser Arg Val Tyr Pro Thr Glu Ala Ile Tyr Ala Ala Ala Gly
515 520 525
Val Tyr Leu Phe Asn Asn Ala Thr Asn Ala Ser Val Thr Ala Glu Arg
530 535 540
Ile Ile Val His Glu Met Asp Ser Ile Asp Asn Asn Gln Ile Phe Leu
545 550 555 560
Ile Asp Asp Leu

Claims (40)

1.一种包括一种或更多种异源多核苷酸的宿主细胞,所述一种或更多种异源多核苷酸编码:
a)蔗糖:蔗糖1-果糖基转移酶(1-SST),所述蔗糖:蔗糖1-果糖基转移酶(1-SST)包括与SEQ ID NO:1或SEQ ID NO:24至少90%一致的氨基酸序列;
b)果聚糖:果聚糖1-果糖基转移酶(1-FFT),所述果聚糖:果聚糖1-果糖基转移酶(1-FFT)包括与SEQ ID NO:7或SEQ ID NO:31至少90%一致的氨基酸序列;和/或
c)蔗糖:果聚糖-6-果糖基转移酶(6-SFT),所述蔗糖:果聚糖-6-果糖基转移酶(6-SFT)包括与SEQ ID NO:13或SEQ ID NO:38至少90%一致的氨基酸序列。
2.如权利要求1所述的宿主细胞,其中所述一种或更多种异源多核苷酸编码a)、b)和c)中的两种或更多种。
3.如权利要求1所述的宿主细胞,其中所述一种或更多种异源多核苷酸编码a)、b)和c)。
4.如权利要求1-3中任一项所述的宿主细胞,其中所述宿主细胞是植物细胞、藻类细胞、酵母细胞、细菌细胞或动物细胞。
5.如权利要求4所述的宿主细胞,其中所述宿主细胞是酵母细胞。
6.如权利要求5所述的宿主细胞,其中所述酵母细胞是酵母属细胞、耶氏酵母属细胞或毕赤酵母属细胞。
7.如权利要求6所述的宿主细胞,其中所述宿主细胞是巴斯德毕赤酵母细胞。
8.如权利要求1-7中任一项所述的宿主细胞,其中1-SST酶包括SEQ ID NO:1或SEQ IDNO:24的氨基酸序列。
9.如权利要求1-8中任一项所述的宿主细胞,其中1-FFT酶包括SEQ ID NO:7或SEQ IDNO:31的氨基酸序列。
10.如权利要求1-9中任一项所述的宿主细胞,其中6-SFT酶包括SEQ ID NO:13或SEQID NO:38的氨基酸序列。
11.如权利要求1-10中任一项所述的宿主细胞,其中1-SST酶、1-FFT酶和6-SFT酶中的一种或更多种从所述宿主细胞分泌。
12.如权利要求1-11中任一项所述的宿主细胞,其中1-SST酶、1-FFT酶和6-SFT酶中的至少两种由相同的异源多核苷酸编码。
13.一种包括培养权利要求1-12中任一项所述的宿主细胞的方法。
14.如权利要求13所述的方法,所述方法还包括从所述宿主细胞中纯化1-SST酶、1-FFT酶和6-SFT酶中的一种或更多种。
15.一种产生果聚糖的方法,所述方法包括使蔗糖与下列中的一种或更多种接触:
a)蔗糖:蔗糖1-果糖基转移酶(1-SST),所述蔗糖:蔗糖1-果糖基转移酶(1-SST)包括与SEQ ID NO:1或SEQ ID NO:24至少90%一致的氨基酸序列;
b)果聚糖:果聚糖1-果糖基转移酶(1-FFT),所述果聚糖:果聚糖1-果糖基转移酶(1-FFT)包括与SEQ ID NO:7或SEQ ID NO:31至少90%一致的氨基酸序列;以及
c)蔗糖:果聚糖-6-果糖基转移酶(6-SFT),所述蔗糖:果聚糖-6-果糖基转移酶(6-SFT)包括与SEQ ID NO:13或SEQ ID NO:38至少90%一致的氨基酸序列。
16.如权利要求15所述的方法,其中所述蔗糖与1-SST酶、1-FFT酶和6-SFT酶中的两种或更多种接触。
17.如权利要求15所述的方法,其中所述蔗糖与1-SST酶、1-FFT酶和6-SFT酶接触。
18.如权利要求15-17中任一项所述的方法,其中所述果聚糖包括β(2,1)键、β(2,6)键或其组合。
19.如权利要求15-18中任一项所述的方法,其中所述果聚糖是蔗果三糖、菊粉和/或革兰明糖。
20.如权利要求15-19中任一项所述的方法,其中所述果聚糖具有至少3的聚合度。
21.如权利要求15-20中任一项所述的方法,所述方法还包括纯化所述果聚糖。
22.如权利要求15-21中任一项所述的方法,其中1-SST酶、1-FFT酶和/或6-SFT酶由一种或更多种宿主细胞分泌。
23.如权利要求22所述的方法,其中在含有蔗糖的培养基中培养所述一种或更多种宿主细胞,并且其中所述蔗糖与所述培养基中的所述1-SST酶、所述1-FFT酶和/或所述6-SFT酶接触。
24.如权利要求23所述的方法,其中从所述培养基中纯化所述果聚糖。
25.如权利要求15-21中任一项所述的方法,其中1-SST酶、1-FFT酶和/或6-SFT酶是纯化的酶。
26.如权利要求19-25中任一项所述的方法,其中所述蔗果三糖是6-蔗果三糖。
27.如权利要求19-25中任一项所述的方法,其中所述蔗果三糖是1-蔗果三糖。
28.如权利要求15-25中任一项所述的方法,其中所述果聚糖包括左聚糖。
29.一种产生果聚糖的方法,所述方法包括:
a)使蔗糖与蔗糖:蔗糖1-果糖基转移酶(1-SST)接触以产生蔗果三糖;以及
b)使所述蔗果三糖与果聚糖:果聚糖1-果糖基转移酶(1-FFT)和/或蔗糖:果聚糖-6-果糖基转移酶(6-SFT)接触以产生所述果聚糖。
30.如权利要求29所述的方法,其中a)中产生的所述蔗果三糖被纯化,并且其中纯化的蔗果三糖在b)中与1-FFT酶和/或6-SFT酶接触。
31.如权利要求29或30所述的方法,所述方法还包括纯化b)中产生的所述果聚糖。
32.如权利要求29-31中任一项所述的方法,其中1-SST酶、1-FFT酶和/或6-SFT酶由一种或更多种宿主细胞分泌。
33.如权利要求32所述的方法,其中在含有蔗糖的培养基中培养所述一种或更多种宿主细胞,并且其中所述蔗糖与所述培养基中的所述1-SST酶接触。
34.如权利要求29-31中任一项所述的方法,其中1-SST酶、1-FFT酶和/或6-SFT酶是纯化的酶。
35.如权利要求29-34中任一项所述的方法,其中b)中产生的所述果聚糖是菊粉。
36.如权利要求29-35中任一项所述的方法,其中b)中产生的所述果聚糖是支链菊粉。
37.如权利要求29-34中任一项所述的方法,其中b)中产生的所述果聚糖是革兰明糖。
38.一种包括一种或更多种异源多核苷酸的宿主细胞,所述一种或更多种异源多核苷酸编码:
a)蔗糖:蔗糖1-果糖基转移酶(1-SST),所述蔗糖:蔗糖1-果糖基转移酶(1-SST)包括与选自SEQ ID NO:1-4和SEQ ID NO:24-28的序列至少90%一致的氨基酸序列;
b)果聚糖:果聚糖1-果糖基转移酶(1-FFT),所述果聚糖:果聚糖1-果糖基转移酶(1-FFT)包括与选自SEQ ID NO:7-10和SEQ ID NO:31-35的序列至少90%一致的氨基酸序列;和/或
c)蔗糖:果聚糖-6-果糖基转移酶(6-SFT),所述蔗糖:果聚糖-6-果糖基转移酶(6-SFT)包括与选自SEQ ID NO:13-21和SEQ ID NO:38-52的序列至少90%一致的氨基酸序列。
39.如权利要求38所述的宿主细胞,其中1-SST酶、1-FFT酶和6-SFT酶中的至少两种由相同的异源多核苷酸编码。
40.一种产生果聚糖的方法,所述方法包括使蔗糖与下列中的一种或更多种接触:
a)蔗糖:蔗糖1-果糖基转移酶(1-SST),所述蔗糖:蔗糖1-果糖基转移酶(1-SST)包括与选自SEQ ID NO:1-4和SEQ ID NO:24-28的序列至少90%一致的氨基酸序列;
b)果聚糖:果聚糖1-果糖基转移酶(1-FFT),所述果聚糖:果聚糖1-果糖基转移酶(1-FFT)包括与选自SEQ ID NO:7-10和SEQ ID NO:31-35的序列至少90%一致的氨基酸序列;以及
c)蔗糖:果聚糖-6-果糖基转移酶(6-SFT),所述蔗糖:果聚糖-6-果糖基转移酶(6-SFT)包括与选自SEQ ID NO:13-21和SEQ ID NO:38-52的序列至少90%一致的氨基酸序列。
CN202080065823.6A 2019-09-24 2020-09-24 寡糖的产生 Pending CN114423862A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962905246P 2019-09-24 2019-09-24
US62/905,246 2019-09-24
PCT/US2020/052390 WO2021061910A1 (en) 2019-09-24 2020-09-24 Production of oligosaccharides

Publications (1)

Publication Number Publication Date
CN114423862A true CN114423862A (zh) 2022-04-29

Family

ID=75166173

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080065823.6A Pending CN114423862A (zh) 2019-09-24 2020-09-24 寡糖的产生

Country Status (6)

Country Link
US (1) US20220372501A1 (zh)
EP (1) EP4034647A4 (zh)
JP (1) JP2022549314A (zh)
KR (1) KR20220094189A (zh)
CN (1) CN114423862A (zh)
WO (1) WO2021061910A1 (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5952205A (en) * 1998-02-06 1999-09-14 Neose Technologies, Inc. Process for processing sucrose into glucose and fructose
US6664444B1 (en) * 1998-04-17 2003-12-16 Tiense Suikerraffinaderij N.V. Transgenic plants presenting a modified inulin producing profile
US20040064852A1 (en) * 2001-06-25 2004-04-01 Guy Weyens Double fructan beets
US20040073975A1 (en) * 2002-08-21 2004-04-15 Stoop Johan M. Product of novel fructose polymers in embryos of transgenic plants
WO2006137574A1 (ja) * 2005-06-22 2006-12-28 Incorporated Administrative Agency National Agriculture And Food Research Organization 耐冷性植物及びその開発方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19749122A1 (de) * 1997-11-06 1999-06-10 Max Planck Gesellschaft Nucleinsäuremoleküle codierend Enzyme, die Fructosyltransferaseaktivität besitzen
US5988177A (en) * 1998-09-08 1999-11-23 Celebrity Signatures International, Inc. Wig foundation with contoured front hairline
AU2015264827B2 (en) * 2008-09-15 2017-12-07 Agriculture Victoria Services Pty Ltd Modification of fructan biosynthesis, increasing plant biomass, and enhancing productivity of biochemical pathways in a plant (2)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5952205A (en) * 1998-02-06 1999-09-14 Neose Technologies, Inc. Process for processing sucrose into glucose and fructose
US6664444B1 (en) * 1998-04-17 2003-12-16 Tiense Suikerraffinaderij N.V. Transgenic plants presenting a modified inulin producing profile
US20040064852A1 (en) * 2001-06-25 2004-04-01 Guy Weyens Double fructan beets
US20040073975A1 (en) * 2002-08-21 2004-04-15 Stoop Johan M. Product of novel fructose polymers in embryos of transgenic plants
WO2006137574A1 (ja) * 2005-06-22 2006-12-28 Incorporated Administrative Agency National Agriculture And Food Research Organization 耐冷性植物及びその開発方法

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
A´NGELA等: "Molecular characterization of sucrose: sucrose 1-fructosyltransferase (1-SST) from Agave tequilana Weber var. azul.", PLANT SCIENCE, vol. 173, no. 4, 31 October 2007 (2007-10-31), pages 480 *
ABE等: "Purification,Cloning and Functional Characterization of Fructan:Fructan 1-Fructosyltransferase from Edible Burdock(Arctium lappa L.)", J.APPL.GLYCOSCI., vol. 56, 31 December 2009 (2009-12-31), pages 241 *
IRMA VIJN等: "Cloning of Sucrose:Sucrose 1-Fructosyltransferase from Onion and Synthesis of Structurally Defined Fructan Molecules from Sucrose", PLANT PHYSIOL, vol. 117, no. 4, 31 December 1998 (1998-12-31), pages 4, XP002970800, DOI: 10.1104/pp.117.4.1507 *
NORBERT SPRENGER等人: "Fructan synthesis in transgenic tobacco and chicory plants expressing barley sucrose :fructan 6-fructosyltransferase", FEBS LETTERS, vol. 400, no. 3, 6 January 1997 (1997-01-06), pages 3 *

Also Published As

Publication number Publication date
KR20220094189A (ko) 2022-07-05
EP4034647A4 (en) 2023-11-08
WO2021061910A1 (en) 2021-04-01
JP2022549314A (ja) 2022-11-24
EP4034647A1 (en) 2022-08-03
US20220372501A1 (en) 2022-11-24

Similar Documents

Publication Publication Date Title
CN112877307B (zh) 一种氨基酸脱氢酶突变体及其应用
EP3680340A1 (en) Method for enzymatic preparation of r-3-aminobutyric acid
CN112831488B (zh) 一种谷氨酸脱羧酶及γ-氨基丁酸高产菌株
CN114207121A (zh) 甲醇利用
US20220348933A1 (en) Biosynthesis of enzymes for use in treatment of maple syrup urine disease (msud)
US20240158451A1 (en) Biosynthesis of mogrosides
US20230065419A1 (en) Enhanced production of histidine, purine pathway metabolites, and plasmid dna
CN113337495A (zh) 一种提高唾液酸产量的方法与应用
CN115335514A (zh) 罗汉果甙的生物合成
WO2023173066A1 (en) Biosynthesis of abscisic acid and abscisic acid precursors
CN114423862A (zh) 寡糖的产生
US11760988B2 (en) L-aspartate alpha-decarboxylase mutant and application thereof
CN113122563A (zh) 构建r-3-氨基丁酸生产菌的方法
US20240182877A1 (en) Production of vaccinia capping enzyme
CN111172143A (zh) D-木糖酸脱水酶及其应用
US20230174993A1 (en) Biosynthesis of mogrosides
CN114854717B (zh) 一种脂肪酶及其编码基因与应用
EP3757209A1 (en) Enzymatic production of levan-based, prebiotic fructooligosaccharides
EP3844179A1 (en) Xylr mutant for improved xylose utilization or improved co-utilization of glucose and xylose
CN112680482A (zh) 一种甘露醇的生物制备方法
CN116103360A (zh) 一种酶法制备硒代氨基酸的方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination