CN107002109A - 含氧萜烯的生产方法 - Google Patents

含氧萜烯的生产方法 Download PDF

Info

Publication number
CN107002109A
CN107002109A CN201580057470.4A CN201580057470A CN107002109A CN 107002109 A CN107002109 A CN 107002109A CN 201580057470 A CN201580057470 A CN 201580057470A CN 107002109 A CN107002109 A CN 107002109A
Authority
CN
China
Prior art keywords
leu
glu
ala
lys
val
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201580057470.4A
Other languages
English (en)
Inventor
阿吉库玛·帕拉伊尔·库马兰
林敬尧
李立伟
苏维克·高希
克里斯多夫·皮里
安东尼·库阿利
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Manus Biosynthesis Ltd By Share Ltd
Original Assignee
Manus Biosynthesis Ltd By Share Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Manus Biosynthesis Ltd By Share Ltd filed Critical Manus Biosynthesis Ltd By Share Ltd
Publication of CN107002109A publication Critical patent/CN107002109A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/24Preparation of oxygen-containing organic compounds containing a carbonyl group
    • C12P7/26Ketones
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01NPRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
    • A01N27/00Biocides, pest repellants or attractants, or plant growth regulators containing hydrocarbons
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23LFOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
    • A23L27/00Spices; Flavouring agents or condiments; Artificial sweetening agents; Table salts; Dietetic salt substitutes; Preparation or treatment thereof
    • A23L27/10Natural spices, flavouring agents or condiments; Extracts thereof
    • A23L27/12Natural spices, flavouring agents or condiments; Extracts thereof from fruit, e.g. essential oils
    • A23L27/13Natural spices, flavouring agents or condiments; Extracts thereof from fruit, e.g. essential oils from citrus fruits
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • C12N9/0073Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen 1.14.13
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P5/00Preparation of hydrocarbons or halogenated hydrocarbons
    • C12P5/002Preparation of hydrocarbons or halogenated hydrocarbons cyclic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/13Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen (1.14.13)
    • C12Y114/13078Ent-kaurene oxidase (1.14.13.78)
    • AHUMAN NECESSITIES
    • A23FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
    • A23VINDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
    • A23V2002/00Food compositions, function of food ingredients or processes for food or foodstuffs
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/03Fusion polypeptide containing a localisation/targetting motif containing a transmembrane segment

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Dentistry (AREA)
  • Agronomy & Crop Science (AREA)
  • Environmental Sciences (AREA)
  • Plant Pathology (AREA)
  • Pest Control & Pesticides (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Nutrition Science (AREA)
  • Food Science & Technology (AREA)
  • Polymers & Plastics (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本发明涉及生产含氧的萜类的方法。还提供了在这些方法中使用的多核苷酸、衍生酶和宿主细胞。

Description

含氧萜烯的生产方法
优先权
本申请要求2015年8月21日提交的美国临时申请号62/040,284的权益和优先权,其据此通过引用整体并入本文。
发明领域
本发明涉及含氧的倍半萜烯(例如诺卡酮)及其生产和使用方法。本发明还提供了用于含氧的倍半萜烯(例如诺卡酮)的生产的酶,以及用于识别、选择、制备和使用这些酶的方法。
发明背景
食品和饮料行业以及其他行业诸如香水、化妆品和保健行业通常使用萜烯和/或类萜产品作为风味物和芳香物。作为示例,许多倍半萜烯化合物被用在香料(例如广藿香醇)和风味行业(例如诺卡酮),并且许多是从植物中提取。然而,诸如以下因素:(i)植物原材料的可获得性和高价格;(ii)植物中相对低的萜烯含量;和(iii)为了在工业规模上生产足够量的萜烯产品的繁琐和低效的提取过程都激发了使用不依赖植物的系统的萜烯的生物合成的研究。因此,已经在开发用于将可再生资源诸如葡萄糖转化为类萜产品的技术到工程微生物上做出了努力。对比于传统方法,微生物具有无需土地以持续发展的快速生长的优势。
许多微生物使用甲基赤藓糖醇4-磷酸(MEP)途径或甲羟戊酸(MVA)途径以提供生产类萜产品所必要的中间体。这些MEP或MVA途径可以包括内源性或工程化的MEP或MVA途径,或两者都包括。类异戊二烯途径工程和优化的详细理解公开在WO 2011/060057、US2011/0189717、US 2012/107893、US 8,512,988和Ajikumar等(2010)Science 330 70-74中,这些文献中公开了包括诸如诺卡酮的倍半萜烯化合物的各种类萜化合物的生产,诺卡酮是产自朱栾倍半萜倍半萜烯底物的氧化的倍半萜烯。
诺卡酮(4,4a,5,6,7,8-六氢-6-异丙烯基-4,4a-二甲基-2(3II)-萘酮)是葡萄柚的重要风味成分,并且在商业上用于给软饮品和其他饮料提供风味,也被用于香料中。用于诺卡酮制备的常规方法是通过朱栾倍半萜的氧化(参见US 6,200,786和US 8,097,442)。原料朱栾倍半萜是昂贵的,并且因此消耗朱栾倍半萜的方法在商业上接受度不高。由于这些缺点,需要具有商业可行性和可持续的方法以制备诺卡酮和相关产品。
发明概述
本发明的一个目的是提供含氧的倍半萜烯产物的可持续生产。特别地,本发明提供了用于某些含氧的倍半萜烯的离体或体内生产的酶催化剂。在一些实施方案中,本发明提供了经工程化以用于含氧的倍半萜烯的生物合成的宿主细胞。本发明的另一个目的是提供用于合成含氧的倍半萜烯(包括在一些实施方案中功能表达)的工程化的细胞色素P450(CYP450)酶,以及大肠杆菌(E.coli)、酵母或其他宿主细胞中的还原酶对应物。本发明从而利用这种酶的独特性能进行氧化化学。
一方面,本发明提供了制备倍半萜烯的含氧产物的方法。所述方法包括使倍半萜烯与具有倍半萜烯氧化活性的甜菊(Stevia rebaudiana)贝壳杉烯氧化酶(SrKO)或其衍生物接触。令人惊奇的是,野生型SrKO酶显示对于倍半萜烯底物的活性,但是它的天然活性被理解为作用于二萜烯底物。进一步地,SrKO酶对于酮、诺卡酮)显示出包括氧化的独特活性,这需要两个氧合循环,并且生产不同的含氧的萜烯产品,包括羟基大根香叶-1(10)5-二烯(hydroxygermacra-1(10)5-diene)和依兰烷-3,9(11)二烯-10-过氧化物(murolan-3,9(11)diene-10-peroxy)。这些活性区别于其他测试的P450酶,其他测试的P450酶仅产生羟基化产物(例如β-诺特卡醇)的一种立体异构体作为主要产品和/或仅产生少量的诺卡酮。
在一些实施方案中,所述方法在离体(例如,无细胞)系统中发生。在其他实施方案中,倍半萜烯底物和SrKO或其衍生物在表达SrKO的细胞中接触,诸如细菌(例如大肠杆菌)。可以回收倍半萜烯的含氧产物,或倍半萜烯的含氧产物可以是用于进一步化学转化的底物。野生型的细胞色素P450在大肠杆菌中的功能表达具有归因于细菌平台的固有限制(诸如缺乏电子转移机制和细胞色素P450还原酶,以及由于缺乏内质网,P450酶的膜信号模块的翻译不相容性)。因此,在一些实施方案中,SrKO酶被修饰而在大肠杆菌宿主细胞中进行功能表达,例如,通过用稳定与大肠杆菌内膜的相互作用和/或降低细胞应激的短肽序列替换SrKO的N端跨膜区的一部分。
在一些实施方案中,SrKO衍生物相对于野生型SrKO具有增加朱栾倍半萜氧化酶活性(例如,增加诺卡酮的产量)的至少一个突变。例如,SrKO相对于野生型SrKO(SEQ ID NO:37)或针对大肠杆菌中的表达和活性被修饰的SrKO(例如,SEQ ID NO:38_或55)可以具有1-50个独立选自替换、缺失或插入的突变。例如,SrKO衍生物相对于SrKO(SEQ ID NO:37、38或55)可以具有1-40个突变、1-30个突变、1-20个突变或1-10个突变。在这些或其他实施方案中,SrKO衍生物可以包含与SrKO(SEQ ID NO:37、38或55)具有至少50%序列同一性、或至少60%序列同一性、或至少70%序列同一性、或至少80%序列同一性、或至少90%序列同一性的氨基酸序列,并且具有朱栾倍半萜氧化酶活性。在各种实施方案中,SrKO在离体或细菌系统(例如,大肠杆菌)中保持朱栾倍半萜氧化酶活性,或对比于野生型酶具有增加的朱栾倍半萜氧化酶活性。可以维持或提高朱栾倍半萜氧化酶活性的SrKO的各种突变列于表2和表6。因此,在各种实施方案中,SrKO可以具有选自表2和/或表6的至少2个、至少3个、至少4个、至少5个、至少6个、至少7个、至少8个、至少9个或至少10个突变。SrKO的示例衍生物,在本文中也被称为“朱栾倍半萜氧化酶”或“VO”,由SEQ ID NO:104和105表示,它们可以进一步被衍生化以改善所需活性。可以根据经验选择突变,或通过计算机评估(in silicoevaluation)选择突变,或通过两者选择突变,以增加含氧的倍半萜烯滴度。
根据本发明的方面,含氧的倍半萜烯产物可通过将倍半萜烯底物与具有朱栾倍半萜氧化活性的甜菊贝壳杉烯氧化酶(SrKO)或其衍生物接触而获得。不像其他的CYP450酶,当SrKO酶与朱栾倍半萜倍半萜烯底物一起使用时,产生可包括羟基大根香叶-1(10)5-二烯、依兰烷-3,9(11)二烯-10-过氧化物、诺特卡醇和诺卡酮的不同的含氧的萜烯产物谱。通过对比,其他的具有使朱栾倍半萜羟基化的活性的CYP450不生产大量的酮(诺卡酮),这需要两个氧化循环。参见表4和图7。
在各种实施方案中,倍半萜烯底物是(或主要的倍半萜烯底物是)朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、诺特卡醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯、和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯(β-aryophyllene)、长叶烯;α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛(4(Z),7(Z)-ecadienal)、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。在一些实施方案中,主要的倍半萜烯底物是朱栾倍半萜,并且主要的含氧产物是诺卡酮和/或诺特卡醇。
本发明,当体内应用时,适用于广泛的宿主细胞。在一些实施方案中,宿主细胞是微生物宿主,诸如选自大肠杆菌、枯草芽孢杆菌(Bacillus subtillus)或恶臭假单胞菌(Pseudomonas putida)的细菌;或酵母,诸如酵母属(Saccharomyces)、毕赤酵母属(Pichia)或耶氏酵母属(Yarrowia)的种类,包括酿酒酵母(Saccharomyces cerevisiae)、巴斯德毕赤酵母(Pichia pastoris)和解脂耶氏酵母(Yarrowia lipolytica)。
在一些实施方案中,宿主细胞产生异戊烯焦磷酸(IPP),IPP用作合成倍半萜烯的底物。在一些实施方案中,IPP经内源性或外源性甲基赤藓糖醇磷酸(MEP)或甲羟戊酸(MVA)途径由代谢通量生产。在一些实施方案中,倍半萜烯至少部分经MEP途径由代谢通量生产,并且其中宿主细胞具有至少一个额外拷贝的dxs、ispD、ispF和/或idi基因。
在一些实施方案中,宿主细胞表达法尼基焦磷酸合成酶(FPPS),其由IPP或DMAPP生产法尼基焦磷酸(FPP)。宿主细胞可以进一步表达异源倍半萜烯合成酶以生产所需的倍半萜烯支架。例如,在一些实施方案中,细胞表达朱栾倍半萜合成酶。已知几种朱栾倍半萜合成酶,包括葡萄(Vitis vinifera)朱栾倍半萜合成酶(VvVS)(SEQ ID NO:1)或甜橙(Citrus sinensus)朱栾倍半萜合成酶(CsVS)(SEQ ID NO:12),本发明可以采用VvVS或CsVS,或VvVS或CsVS的衍生物。本文公开了示例性的衍生物VvVS酶。在某些实施方案中,如本文所公开,倍半萜烯合成酶是选自Vv1M1(SEQ ID NO:3)、Vv2M1(SEQ ID NO:5)、Vv1M5(SEQ ID NO:7)、Vv2M5(SEQ ID NO:9)或VS2(SEQ ID NO:11)的朱栾倍半萜合成酶。
SrKO或其衍生物作用于倍半萜烯(例如,朱栾倍半萜)以生产含氧的萜烯产品。在一些实施方案中,SrKO是与细胞色素P450还原酶配偶体的融合蛋白(例如,SrCPR),允许辅因子有效再生。在其他的实施方案中,在宿主细胞中P450还原酶单独地提供(例如,到体外系统)或表达,并且在一些实施方案中,P450还原酶可以在与SrKO相同的操纵子中表达。在一些实施方案中,CPR酶单独地表达,并且在一些实施方案中,基因可以整合到宿主细胞基因组中。本文中公开了各种示例性CPR酶,并且该CPR酶可以被衍生化以改善含氧的类倍半萜烯滴度和/或改善P450效率。
在一些实施方案中,宿主细胞表达一种或多种进一步将含氧的产物导向为诺卡酮的酶,诸如一种或多种醇脱氢酶(ADH)的表达。本文公开了示例性的ADH酶。
在其他方面,本发明提供了用于制备含有含氧的倍半萜烯的产品的方法,所述方法包括将根据本文描述的方法制备并回收的含氧的倍半萜烯掺入消费品或工业产品。例如,产品可以是风味产品、芳香产品、化妆品、清洁产品、洗涤剂或肥皂或害虫防治产品。在一些实施方案中,回收的含氧产物包含诺卡酮,并且所述产品是选自饮料、口香糖、糖果或风味添加剂的风味产品。
在其他方面,本发明提供了与野生型对比具有提高的朱栾倍半萜氧化酶活性的工程化SrKO酶,以及生产本文所述的含氧的倍半萜烯并且表达所有用于从异戊烯焦磷酸(IPP)生产所需的含氧的倍半萜烯的酶组分的宿主细胞。例如,在各种实施方案中,宿主细胞表达法尼基焦磷酸合成酶、倍半萜烯合成酶和SrKO或其衍生物。IPP可以通过MEP和/或MVA途径生产,IPP对于宿主细胞来说可以是内源的,并且可以通过途径中的异源酶的表达或某些酶的复制来提高。宿主细胞包括如本文所述的各种细菌和酵母。含氧的倍半萜烯(例如,诺卡酮和/或诺特卡醇)可以从培养物中回收,和/或任选地可以用作在细胞中或离体系统中进一步化学转化的底物。
在另一方面,本发明提供了由本文所述的方法和宿主细胞生产的含倍半萜烯的油。在一些实施方案中,所述油包含羟基大根香叶-1(10)5-二烯、依兰烷-3,9(11)二烯-10-过氧化物、诺特卡醇和诺卡酮。在一些实施方案中,主要的朱栾倍半萜的含氧产物是诺卡酮和/或诺特卡醇。
在另一方面,提供了基于P45017A1(其催化雄激素的生物合成)的结构坐标的SrKO晶体模型结构(CMS)。图8A和8B中说明CMS,包括包含萜烯结合口袋(TBP)的萜烯结合口袋结构域(TBD)和结合到TBD的萜烯(例如,朱栾倍半萜)。这种SrKO晶体模型结构(CMS)促进SrKO衍生物的计算机测试(in-silico testing)。在由这个同源模型的部分辅助中,本公开说明了使用几种突变策略来识别倍半萜烯氧化活性的增加或改善,突变策略包括回归共识诱变(back-to-consensus mutagenesis)、位点饱和诱变和重组文库筛选。
本发明的其它方面和实施方案将从以下详细描述中显而易见。
附图简述
图1显示了用于朱栾倍半萜的生物合成的流程,朱栾倍半萜是根据本公开的SrKO的底物。
图2描述了用于对VvVS进行的定点突变体的倍数生产力。225个点突变中的46个与野生型WT VvVS相比,传达了朱栾倍半萜的生产力平均提高至少20%。图2显示了相对于野生型,显示出一定水平的生产力(x轴)的VvVS突变体的数量(y轴)。
图3(A和B)提供了朱栾倍半萜合成酶的氨基酸和核苷酸序列。图3A显示了葡萄野生型(WT)(VvVS)(SEQ ID NO:1和2)和衍生物Vv1M1(SEQ ID NO:3和4)、Vv2M1(SEQ ID NO:5和6)、Vv1M5(SEQ ID NO:7和8)、Vv2M5(SEQ ID NO:9和10)的氨基酸和核苷酸序列,以及衍生物VS2的氨基酸序列(SEQ ID NO:11);以及甜橙野生型(CsVS)的氨基酸序列(SEQ ID NO:12)。图3B显示了野生型VvVS和CsVS序列以及工程化的Vv2M5和VS2序列的比对。
图4(A和B)提供了对于倍半萜烯支架具有活性的各种CYP450(细胞色素P450)酶的氨基酸和核苷酸序列。图4A显示了野生型氨基酸序列以及工程化用于细菌表达的氨基酸和核苷酸序列的序列:ZzHO(分别地,SEQ ID NO:13、14和15)、BsGAO(分别地,SEQ ID NO:16、17和18)、HmPO(分别地,SEQ ID NO:19、20和21)、LsGAO(分别地,SEQ ID NO:22、23和24)、NtEAO(分别地,SEQ ID NO:25、26和27)、CpVO(分别地,SEQ ID NO:28、29和30)、AaAO(分别地,SEQ ID NO:31、32和33)、AtKO(分别地,SEQ ID NO:34、35和36)、SrKO(分别地,SEQ IDNO:37、38和39)、PpKO(分别地,SEQ ID NO:40、41和42)、BmVO(分别地,SEQ ID NO:43和SEQID NO:44)、PsVO(分别地,SEQ ID NO:45和SEQ ID NO:46)、PoLO(分别地,SEQ ID NO:47和SEQ ID NO:48)、CiVO(分别地,SEQ ID NO:49、50和51)、HaGAO(分别地,SEQ ID NO:52、53和54)。图4B显示了基于SrKO支架的工程化的朱栾倍半萜氧化酶的氨基酸序列(SEQ ID NO:55-61)。
图5A和5B描述了大肠杆菌中表达MEP、萜烯和类萜合成酶以及P450酶的构建体设计。图5A显示了上游MEP途径基因的菌株构型和具有下游途径基因的两种质粒。图5B显示了P450融合体的构建,由此P450和CPR(细胞色素P450还原酶)的N端区域被截短,并且添加到示例性前导序列(MALLLAVF--SEQ ID NO:112)(8RP),同时两者与短连接子肽融合。
图6(A-D)提供了利用序列比对的各种CPR(细胞色素P450还原酶)的氨基酸和核苷酸序列。在图6A中:甜菊(Sr)CPR(SEQ ID NO:62和63)、甜菊(Sr)CPR1(SEQ ID NO:76和77)、拟南芥(Arabidopsis thaliana)(At)CPR(SEQ ID NO:64和65)、红豆杉(Taxus cuspidata)(Tc)CPR(SEQ ID NO:66和67)、青蒿(Artemisia annua)(Aa)CPR(SEQ ID NO:68和69)、拟南芥(At)CPR1(SEQ ID NO:70和71)、拟南芥(At)CPR2(SEQ ID NO:72和73)、拟南芥(At)R2(SEQ ID NO:74和75);甜菊(Sr)CPR2(SEQ ID NO:78和79);甜菊(Sr)CPR3(SEQ ID NO:80和81);香叶天竺葵(Pelargonium graveolens)(Pg)CPR(SEQ ID NO:82和83)。图6B显示了拟南芥和青蒿CPR序列的氨基酸序列(SEQ ID NO:72、74、68、64和70)的比对。图6C显示了甜菊CPR序列(SEQ ID NO:78、80、62和76)的比对。图6D显示了8个CPR氨基酸序列(SEQ ID NO:74、72、82、68、80、62、78和76)的比对。
图7提供了GC色谱图,其显示了各种CYP450酶的不同活性,如实施例2中所述,与CPR配偶体表达在生产朱栾倍半萜的大肠杆菌中。将菌株培养4天,并用甲基叔丁基醚(MTBE)提取。通过GC-MS注射1μl的MTBE,并通过与MS文库比较来监测产物谱。从顶部到底部:紫杉属5-α羟化酶、菊苣(Cichorium intybus)(CiVO)P450(SEQ ID NO:50)、莨菪(Hyoscyamus muticus)(HmPO)P450(SEQ ID NO:20)和SrKO(SEQ ID NO:38)。
图8A和8B说明了SrKO的同源性模型及其活性位点。SrKO同源模型基于已知的突变体P45017A1(如在DeVore NM和Scott EE(Nature,482,116-119,2012)中公开的膜结合细胞色素P450 17A1的晶体结构,其催化了人中雄激素的生物合成。血红素的位置以条棒显示。图8B描述了SrKO活性位点的结构模型,朱栾倍半萜以其α结合模式停靠。显示了二级结构基序(B-C环和I-螺旋)和靶向诱变的氨基酸。
图9显示了优化朱栾倍半萜氧化酶(VO)N端膜锚定。选择大肠杆菌yhcB的N端作为膜锚定序列,其提供单次跨膜螺旋。筛选锚定的长度(20-24个氨基酸)和VO N端截短长度(28-32个氨基酸)以改善氧合滴度。
图10显示了29的截短长度和基于大肠杆菌yhcB的20个氨基酸的N端锚定导致总氧合滴度相比于对照的平均值增加1.2倍。
图11说明了在宿主细胞中表达的示例性下游途径,该途径用于将法尼基二磷酸转化为诺卡酮。法尼基二磷酸(通过表达的法尼基焦磷酸合成酶由IPP/DMAPP产生)通过朱栾倍半萜合成酶(VS)的作用转化为朱栾倍半萜,朱栾倍半萜通过朱栾倍半萜氧化酶(VO)诸如SrKO或本文所述的工程化衍生物来氧化。VO辅因子通过细胞色素P450还原酶(CPR)再生。通过VO氧化的产物可以包括诺特卡醇和诺卡酮,可通过醇脱氢酶(ADH)的作用进一步被导向为诺卡酮。
图12显示了表达VO1-L-SrCPR的菌株的氧合谱。氧合谱包括β-诺特卡醇和α-诺特卡醇的单一氧合产物以及两步氧合产物诺卡酮。
图13(A和B)显示了在野生型SrKO中使用回归共识策略识别的突变,翻译成工程化的朱栾倍半萜氧化酶背景(n22yhcB_t30VO1)的评估。超过50%的突变导致总氧合滴度提高1.2-1.45倍。图(A)显示了以mg/L计的滴度。图(B)显示了氧合滴度的倍数变化。
图14显示了回归共识突变、N端锚定优化和位点饱和诱变(SSM)的二级筛选的结果。识别出显示氧合滴度改善1.1-1.4倍的几个突变。
图15显示了在33℃下选择VO1变体的性能。识别出在33℃下保持改善的生产力的六个突变。
图16(A和B)显示了来自重组文库的初步筛选的结果。几个变体(所示的)在氧合产物滴度上表现出高达1.35倍的改善。在谱中有转变为更多的(+)-诺卡酮和更高的氧合能力用于选择变体。图(A)显示了以mg/L计的氧合产物。图(B)绘制了氧合能力的倍数变化(诺特卡醇仅需要来自朱栾倍半萜的一个氧合循环,而诺卡酮需要两个氧合循环)。
图17显示了在34℃和37℃下对于选择的VO重组文库变体的氧合能力。
图18显示了在重新筛选前导VO变体后在34℃和37℃下的氧合滴度。C6(1)(R76K、M94V、T131Q、I390L、T468I)在37℃下具有最高的氧合能力,并被命名为VO2。
图19显示了用于提高朱栾倍半萜氧化酶活性(30℃)的细胞色素P450还原酶(CPR)直系同源物的筛选。SrCPR3显示了增加的氧合滴度和更高的诺卡酮产量。
图20显示了在34℃下的CPR直系同源物的筛选。SrCPR3和AaCPR表现出约1.3倍的氧合滴度的改善,即使在更高的温度下。
图21显示了在醇脱氢酶的作用下诺特卡醇向诺卡酮的转化。识别了四种ADH直系同源物(vvDH、csABA2、bdDH和zzSDR),它们将诺特卡醇转化为(+)-诺卡酮,导致(+)-诺卡酮滴度增加3倍。
图22(A和B)描述了醇脱氢酶。图22A显示了氨基酸和核苷酸序列,包括红串红球菌(Rhodococcus erythropolis)(Re)CDH(SEQ ID NO:84和85)、甜橙(Cs)DH(SEQ ID NO:86和87)、甜橙(Cs)DH1(SEQ ID NO:88和89)、甜橙(Cs)DH2(SEQ ID NO:90和91)、甜橙(Cs)DH3(SEQ ID NO:92和93)、葡萄(Vv)DH(SEQ ID NO:94和95)、葡萄(Vv)DH1(SEQ ID NO:96和97、甜橙(Cs)ABA2(SEQ ID NO:98和99)、短柄草(Brachypodium distachyon)(Bd)DH(SEQ IDNO:100和101)、红球姜(Zingiber zerumbet)(Zz)SDR(SEQ ID NO:102和103)的那些氨基酸和核苷酸序列。图22B显示了氨基酸序列的比对。
图23(A和B)显示了几种工程化的朱栾倍半萜氧化酶(VO)变体的比对。在图23A中:8rp-t20SrKO(SEQ ID NO:106)是在N端具有20个氨基酸截短,并且添加了8个氨基酸的膜锚定的SrKO序列。8rp-t20VO0(SEQ ID NO:107)具有SrKO的N端的20个氨基酸的截短、8个氨基酸的N端锚定的添加和499位的单突变(根据野生型SrKO编号)。n22yhcB-t30VO1(SEQ IDNO:104)具有SrKO的N端的30个氨基酸的截短、基于来自大肠杆菌yhcB的22个氨基酸的膜锚定、以及在位置46、231、284、383、400、444、488和499的8个点突变(相对于SrKO野生型)。n22yhcB-t30VO2(SEQ ID NO:105)具有SrKO的N端的30个氨基酸的截短、基于来自大肠杆菌yhcB的22个氨基酸的膜锚定、以及在位置76、94、131、231、284、383、390、468和499的9个点突变(相对于SrKO野生型)。图23B,显示了针对野生型SrKO(SEQ ID NO:108)的VO0(SEQ IDNO:109)、VO1(SEQ ID NO:110)和VO2(SEQ ID NO:111)中的点突变(为了方便,所有都用野生型SrKO的N端显示)。
发明详述
本发明在各个方面提供了在离体或在细胞系统中用于制备含氧的萜烯或类萜的方法。本发明进一步提供了用于这些方法的工程化的或修饰的酶、多核苷酸和宿主细胞。本发明在各种实施方案中涉及使用SrKO酶生产诺卡酮的方法。令人惊奇的是,发现SrKO酶可用于催化倍半萜烯氧化(例如,朱栾倍半萜氧化成诺特卡醇和诺卡酮)。
如本文所用,SrKO是指具有登录号AAQ63464.1(SEQ ID NO:37)的对映-贝壳杉烯氧化酶CYP701A5[甜菊]。SrKO和其对双萜(尤其是贝壳杉烯)的活性是已知的并且描述于,例如,US2012/0164678,其据此通过引用整体并入本文。它是细胞色素p450酶(CYP450)的CYP70家族的成员。经修饰以在大肠杆菌中表达的SrKO序列显示为SEQ ID NO:38。如本文所示,SrKO对于倍半萜烯底物(例如,朱栾倍半萜)有活性,生产诺特卡醇和诺卡酮,它们是有价值的类萜化合物。这些氧合活性和产物谱(例如,增加的诺卡酮的产量)可以通过使用本文详细描述的工艺(和通过计算机模型(in silico models)辅助)诱变SrKO来进一步细化。
如本文所用,术语“SrKO衍生物”或“工程化的SrKO”是指与SrKO具有实质性结构和/或序列同一性并催化倍半萜烯支架(诸如,朱栾倍半萜)氧合的氨基酸序列。经工程化用于朱栾倍半萜的氧合的SrKO酶在本文中也称为“朱栾倍半萜氧化酶”或“VO”酶。通常,衍生物包含具有至少一个突变的SrKO的突变形式,该突变增加用于朱栾倍半萜底物或用于诺卡酮和/或其它产物的生产的酶的活性。在表2中提供了一些SrKO突变。一些这样的另外的SrKO突变提供在表6中。
术语“接触”是指体内通过相关蛋白产物(例如,倍半萜烯合成酶和CYP450)在宿主细胞中的共表达,或通过向表达SrKO或其衍生物的宿主细胞添加或供给感兴趣的底物,或体外(或“离体”)通过将倍半萜烯底物添加到纯化的P450酶或细胞提取物或含有其的部分纯化的提取物中,来将组分物理地聚集在一起。术语体外和离体是指无细胞系统,并且可以在反应管或孔中进行。
如本文所用,“萜烯”是具有简单统一特征的大型且不同类别的烃,尽管它们具有结构多样性。根据“异戊二烯规则”,所有萜烯由异戊二烯(C5)单元组成。这个事实被用于根据这种单元的数量的合理分类。单萜包含2个异戊二烯单元并被分类为(C10)萜烯,倍半萜烯包含3个异戊二烯单元并被分类为(C15)萜烯,二萜包含4个异戊二烯单元并被分类为(C20)萜烯、二倍半萜(C25)、三萜(C30)和橡胶(C5)n。它们作为具有醇、醚、酯、醛或酮基团的无环或单环至五环的衍生物(所谓的“类萜化合物”)在生物体中,特别是在高等植物中各处存在,并且是个别类型的植物的特征。萜烯诸如单萜(C10)、倍半萜烯(C15)和二萜(C20),通过称为萜烯(类萜化合物)合成酶的非常大的一组酶的作用,分别源自异戊基二磷酸底物、香叶基焦磷酸(GPP)、法尼基焦磷酸(FPP)和香叶基香叶基焦磷酸(GGPP)。这些酶通常被称为萜烯环化酶,因为反应的产物被环化成各种单萜、倍半萜烯和二萜碳骨架产物。许多所得碳骨架通过细胞色素p450水解酶进行亚序列氧合,以产生衍生物的大家族。畅销的风味物和芳香物的技术合成可以从萜烯开始,萜烯也可以用作染料和清漆的优异的溶剂或稀释剂。使用萜烯的天然的或合成的树脂,并且维生素和杀虫剂的许多药物合成也从萜烯开始。如本文所用,术语“萜烯”或“倍半萜烯”(例如)包括相应的类萜或类倍半萜烯化合物。
如本文所用,术语“含氧的倍半萜烯”是指具有一个或多个氧合事件,产生相应的醇、醛、羧酸和/或酮的倍半萜烯支架。
如本文所用,术语“MEP途径”是指(2-C-甲基-D-赤藓醇4-磷酸)途径,也称为MEP/DOXP(2-C-甲基-D-赤藓醇4-磷酸/l-脱氧-D-木酮糖5-磷酸)途径或非甲羟戊酸途径或甲羟戊酸非依赖途径。在MEP途径中,丙酮酸和D-甘油醛-3-磷酸通过一系列反应转化为IPP和DMAPP。该途径通常涉及以下酶的作用:1-脱氧-D-木酮糖-5-磷酸合成酶(Dxs)、1-脱氧-D-木酮糖-5-磷酸还原异构酶(IspC)、4-二磷酸胞苷-2-C-甲基-D-赤藓醇合成酶(IspD)、4-二磷酸胞苷-2-C-甲基-D-赤藓醇激酶(IspE)、2C-甲基-D-赤藓醇2,4-环二磷酸合成酶(IspF)、1-羟基-2-甲基-2-(E)-丁烯基4-二磷酸合成酶(IspG)和异戊烯基二磷酸异构酶(IspH)。MEP途径以及构成MEP途径的基因和酶描述于US 8,512,988中,其据此通过引用整体并入本文。例如,构成MEP途径的基因包括dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi和ispA。
如本文所用,MVA途径是指将乙酰辅酶A转化为IPP的生物合成途径。甲羟戊酸途径通常包括催化以下步骤的酶:(a)将两分子的乙酰辅酶A缩合成乙酰乙酰辅酶A(例如,通过乙酰乙酰辅酶A硫解酶的作用);(b)将乙酰乙酰辅酶A与乙酰辅酶A缩合以形成羟甲基戊二酰辅酶A(HMG-CoA)(例如,通过HMG-CoA合成酶(HMGS)的作用);(c)将HMG-CoA转化为甲羟戊酸(例如,通过HMG-CoA还原酶(HMGR)的作用);(d)将甲羟戊酸磷酸化成甲羟戊酸5-磷酸(例如,通过甲羟戊酸激酶(MK)的作用);(e)将甲羟戊酸5-磷酸转化为甲羟戊酸5-焦磷酸(例如,通过磷酸甲羟戊酸激酶(PMK)的作用);和(f)将甲羟戊酸5-焦磷酸转化成异戊烯焦磷酸(例如,通过甲羟戊酸焦磷酸脱羧酶(MPD)的作用)。MVA途径以及构成MEP途径的基因和酶描述于US 7,667,017中,其据此通过引用整体并入本文。
如本文所用,术语“细胞色素P450还原酶配偶体”或“CPR配偶体”是指能够再生用于氧化化学的感兴趣的细胞色素P450氧化酶(例如,SrKO)的辅因子组分的细胞色素P450还原酶。例如,SrCPR是SrKO的天然的CPR配偶体。在一些实施方案中,CPR配偶体不是SrKO的天然的CPR配偶体。在采用体内生产含氧的倍半萜烯的一些实施方案中,SrKO和SrCPR共表达为分离蛋白,或在一些实施方案中表达为融合蛋白。
核苷酸和氨基酸序列的相似性,即序列同一性的百分比,可以通过序列比对来确定。这样的比对可以用几种本领域已知的算法进行,诸如用Karlin和Altschul的数学算法(Karlin&Altschul(1993)Proc.Natl.Acad.Sci.USA 90:5873-5877)、用hmmalign(HMMERpackage,http://hmmer.wustl.edu/)或用CLUSTAL算法(Thompson,J.D.,Higgins,D.G.&Gibson,T.J.(1994)Nucleic Acids Res.22,4673-80)。序列同一性的等级(序列匹配)可以使用例如BLAST、BLAT或BlastZ(或BlastX)来计算。将类似的算法并入Altschul等人(1990)J.Mol.Biol.215:403-410的BLASTN和BLASTP程序中。BLAST多核苷酸搜索可以用BLASTN程序进行,得分=100,字长=12。
BLAST蛋白搜索可以用BLASTP程序进行,得分=50,字长=3。为了获得用于比较目的的空位比对,使用Gapped BLAST,如Altschul等(1997)Nucleic Acids Res.25:3389-3402中所描述。当使用BLAST和Gapped BLAST程序时,使用相应程序的默认参数。序列匹配分析可以通过已建立的同源性作图技术如Shuffle-LAGAN(Brudno M.,Bioinformatics2003b,19增刊1:154-162)或Markov随机场来补充。
“保守取代”可以例如基于所涉及的氨基酸残基的极性、电荷、大小、溶解度、疏水性、亲水性和/或两亲性中的相似性进行。20种天然存在的氨基酸可以分为以下六种标准氨基酸组:
(1)疏水性:Met、Ala、Val、Leu、Ile;
(2)中性亲水性:Cys、Ser、Thr;Asn、Gin;
(3)酸性:Asp、Glu;
(4)碱性:His、Lys、Arg;
(5)影响链取向的残基:Gly、Pro;和
(6)芳香性:Trp、Tyr、Phe。
如本文所用,“保守取代”定义为氨基酸与上文所示的六种标准氨基酸组的相同组内所列的另一氨基酸的交换。例如,Asp被Glu交换在如此修饰的多肽中保留一个负电荷。此外,甘氨酸和脯氨酸可以基于它们破坏α-螺旋的能力彼此替换。上述六组内的一些优选的保守取代是以下亚组内的交换:(i)Ala、Val、Leu和He;(ii)Ser和Thr;(ii)Asn和Gin;(iv)Lys和Arg;和(v)Tyr和Phe。
如本文所用,“非保守性取代”或“非保守性氨基酸交换”定义为氨基酸被上述六个标准氨基酸组(1)至(6)的不同组中所列的另一氨基酸交换。
一方面,本发明提供了用于制备倍半萜烯的含氧产物的方法。在各种实施方案中,倍半萜烯底物是(或主要的倍半萜烯底物是)朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、诺特卡醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯、长叶烯、α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。在一些实施方案中,主要的倍半萜烯底物是朱栾倍半萜,并且主要的含氧产物是诺卡酮和/或诺特卡醇。在上下文中,术语“主要的”是指特定的倍半萜烯单独地以高于所有其它萜烯或类萜物质的水平存在。在一些实施方案中,主要的倍半萜烯(底物或反应后的含氧的产物)构成组合物的至少25%、至少40%、至少50%或至少75%的萜烯或类萜组分。在涉及含氧的倍半萜烯的体内生产的各种实施方案中,从培养基中回收氧合产物,并且可以分级以分离或富集产物的各种组分,诸如诺卡酮。在一些实施方案中,诺卡酮被分离和/或富集,使得其组成倍半萜烯组分的至少约25%、至少约40%、至少约50%、至少约60%、至少约70%、至少约80%或至少约90%(以重量计),或构成诺特卡醇和诺卡酮的总量的至少约25%、至少约40%、至少约50%、至少约60%、至少约70%、至少约80%、至少约90%或约100%(以重量计)。
在各种实施方案中,本发明包括使倍半萜烯与萜烯氧化性P450酶或其衍生物接触。接触可以在宿主细胞或无细胞系统中进行。用于氧化的底物(例如,倍半萜烯)可以由细胞产生(例如,通过经MEP或MVA途径的代谢通量),或供给到表达P450酶的宿主细胞。含氧的产物可以回收,或作为在细胞系统或无细胞系统中进一步化学转化的底物。下表1提供了示例性P450酶的列表。虽然在某些实施方案中,本发明涉及使用以下P450酶(任选工程化以增加朱栾倍半萜氧合为诺卡酮和/或诺特卡醇),但根据本公开优选的酶是SrKO。通过根据本公开的这些反应获得的示例性含氧的倍半萜烯产物显示在表4中。
表1
在各种实施方案中,方法包括使倍半萜烯与包含甜菊贝壳杉烯氧化酶(SrKO)或其衍生物的蛋白质接触。在一些实施方案中,SrKO在如下所述的宿主细胞中表达,或在无细胞系统中提供。例如,用于用P450酶氧化萜烯的某些体外和体内系统公开于据此通过引用并入本文的US 7,211,420中。McDougle DR,Palaria A,Magnetta E,Meling DD,DasA.Functional Studies of N-terminally modified CYP2J2epoxygenase in Model Lipid Bilayers,Protein Sci.201322:964-79;Luthra,A.,Gregory,M.,Grinkova,Y.V.,Denisov,I.G.,Sligar,S.G.(2013)"Nanodiscs in the studies of membrane-boundcytochrome P450 enzymes."Methods Mol.Biol.,987,115-127)。
在一些实施方案中,SrKO衍生物包含相对于SrKO(SEQ ID NO:37)或相对于在N端修饰用于大肠杆菌中功能性表达的SrKO酶(SEQ ID NO:38或55)具有约1至约50个独立选自替换、缺失或插入的突变的氨基酸序列。在各种实施方案中,突变或突变的组合增强用于朱栾倍半萜的氧合的酶的活性,诸如诺卡酮的产量。本文所述的蛋白建模可用于指导SrKO序列中的这种替换、缺失或插入。例如,可以使用P45017A1的坐标建立SrKO氨基酸序列的结构模型。如本文所证明的,这种同源模型可用于指导用于朱栾倍半萜氧合的SrKO的改善。因此,在各种实施方案中,SrKO衍生物相对于SrKO(SEQ ID NO:37、38或55)可具有约1至约45个突变、约1至约40个突变、约1至约35个突变、约1至约30个突变、约1至约25个突变、约1至约20个突变、约1至约15个突变、约1至约10个突变或约1至约5个突变。在各种实施方案中,SrKO包含相对于SEQ ID NO:37、38或55具有至少5或至少10个突变但不超过约20或30个突变的序列。在各种实施方案中,SrKO衍生物相对于SrKO(SEQ ID NO:37、-38或55)可具有约1个突变、约2个突变、约3个突变、约4个突变、约5个突变、约6个突变、约7个突变、约8个突变、约9个突变、约10个突变、约11个突变、约12个突变、约13个突变、约14个突变、约15个突变、约16个突变、约17个突变、约18个突变、约19个突变、约20个突变、约21个突变、约22个突变、约23个突变、约24个突变、约25个突变、约26个突变、约27个突变、约28个突变、约29个突变、约30个突变、约31个突变、约32个突变、约33个突变、约34个突变、约35个突变、约36个突变、约37个突变、约38个突变、约39个突变、约40个突变、约41个突变、约42个突变、约43个突变、约44个突变、约45个突变、约46个突变、约47个突变、约48个突变、约49个突变或约50个突变。SEQ ID NO:37和本文公开的其它WT酶可以任选地包含野生型中不存在的位置2的Ala。
在这些或其他实施方案中,SrKO衍生物可以包含与SrKO(SEQ ID NO:37、38或55)具有至少约50%序列同一性、至少约55%序列同一性、至少约60%序列同一性、至少约65%序列同一性、至少约70%序列同一性、至少约75%序列同一性、至少约80%序列同一性、至少约85%序列同一性、或至少90%序列同一性、或至少91%序列同一性、或至少92%序列同一性、或至少93%序列同一性、或至少94%序列同一性、或至少95%序列同一性、或至少96%序列同一性、或至少97%序列同一性、或至少98%同一性、或至少99%序列同一性的氨基酸序列。在各种实施方案中,SrKO衍生物对于朱栾倍半萜的氧合活性比野生型酶高,诸如在与朱栾倍半萜底物接触时比野生型酶(SEQ ID NO:37)或经修饰用于大肠杆菌中的功能性表达的野生型酶有更高产量的含氧的油。例如,SrKO衍生物可以包含与SrKO(SEQ ID NO:37、38或55)具有至少:约50%同一性、约51%同一性、约52%同一性、约53%同一性、约54%同一性、约55%同一性、约56%同一性、约57%同一性、约58%同一性、约59%同一性、约60%同一性、约61%同一性、约62%同一性、约63%同一性、约64%同一性、约65%同一性、约66%同一性、约67%同一性、约68%同一性、约69%同一性、约70%同一性、约71%同一性、约72%同一性、约73%同一性、约74%同一性、约75%同一性、约76%同一性、约77%同一性、约78%同一性、约79%同一性、约80%同一性、约81%同一性、约82%同一性、约83%同一性、约84%同一性、约85%同一性、约86%同一性、约87%同一性、约88%同一性、约89%同一性、约90%同一性、约91%序列同一性、约92%序列同一性、约93%序列同一性、约94%序列同一性、约95%序列同一性、约96%序列同一性、约97%序列同一性、约98%序列同一性或约99%序列同一性的氨基酸序列。
在一些实施方案中,选择突变体用于增加含氧的朱栾倍半萜(诸如诺卡酮)的产量。例如,相对于SEQ ID NO:37,SrKO衍生物可以在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个、两个、三个、四个或更多个突变。例如,在一些实施方案中,SrKO是包含相对于SEQ ID NO:37具有一个或多个(例如,2、3、4个或全部)选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的突变的氨基酸序列的衍生物。在一些实施方案中,SrKO衍生物包含选自SEQ ID NO:55-61、104或105的氨基酸序列,其根据本公开被工程化以改善用于朱栾倍半萜氧合的活性(例如,诺卡酮的产量)。在一些实施方案中,衍生物包含相对于选自SEQ ID NO:55-61、104和105的序列具有1至20、或1至10、或1至5个突变的氨基酸序列,附带条件是氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变,或附带条件是SrKO衍生物包含相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的1个、2个、3个或更多个(或全部)突变的氨基酸序列。
在一些实施方案中,本发明提供了编码上述SrKO衍生物的重组多核苷酸,其可以被插入用于表达和任选纯化的表达载体中。在一些实施方案中,将多核苷酸并入产生朱栾倍半萜的细胞(诸如产生朱栾倍半萜的大肠杆菌细胞)的基因组中。
各种实施方案中的SrKO或衍生物具有朱栾倍半萜氧化酶活性。用于测定和定量朱栾倍半萜氧化酶活性的检测在本文中被描述并且是本领域已知的。检测包括在产生朱栾倍半萜的细胞(例如,表达FPPS和朱栾倍半萜合成酶的大肠杆菌)中表达SrKO(或衍生物),以及从水性反应介质中提取氧化的油。类萜产物的谱可以通过GC/MS定量测定。测试对朱栾倍半萜氧化酶活性的影响的SrKO的各种突变列于表2或表6中。因此,在各种实施方案中,SrKO可以具有至少约1个、至少约2个、至少约3个、至少约4个、至少约5个、至少约6个、至少约7个、至少约8个、至少约9个、或至少约10个选自表2或表6的突变。在一些实施方案中,SrKO衍生物是修饰的SrKO多肽,其包含与根据SEQ ID NO:37的野生型蛋白相比具有多至25个突变的氨基酸序列(或经修饰用于在大肠杆菌中表达的其对应物,并且包括至少与V375F、V375A、V375M、M120L、M120I、M120V、F129L、F129I、L114V、L114F和V121A(根据SEQ ID NO:38编号)中的至少任一个或多个组合的替换I310V、V375I或T487N,并任选地包含支持大肠杆菌中功能性表达的前导序列(如SEQ ID NO:38所示)。
表2.1根据野生型SrKO(SEQ ID NO:37)和8rp-t20SrKO(SEQ ID NO:38)编号的一些被测试的甜菊贝壳杉烯氧化酶突变的概述。
表2.2:在VO1背景(n22-yhcB-t30-VO1,SEQ ID NO:110)中评估以下突变体。位置维持SEQ ID NO:37的编号。
SrKO可在各种宿主细胞中表达,用于重组蛋白生产或用于倍半萜烯(例如,朱栾倍半萜)氧化。例如,宿主细胞包括US 8,512,988中描述的那些,其据此通过引用整体并入本文。宿主细胞可以是原核或真核细胞。在一些实施方案中,细胞是细菌细胞,诸如埃希氏菌属(Escherichia spp.)、链霉菌属(Streptomyces spp.)、发酵单胞菌属(Zymonas spp.)、醋酸杆菌属(Acetobacter spp.)、枸橼酸杆菌属(Citrobacter spp.)、集胞藻属(Synechocystis spp.)、根瘤菌属(Rhizobium spp.)、梭菌属(Clostridium spp.)、棒状杆菌属(Cory neb acterium spp.)、链球菌属(Streptococcus spp.)、黄单胞菌属(Xanthomonas spp.)、乳杆菌属(Lactobacillus spp.)、乳球菌属(Lactococcus spp.)、芽孢杆菌属(Bacillus spp.)、产碱杆菌属(Alcaligenes spp.)、假单胞菌属(Pseudomonasspp.)、气单胞菌属(Aeromonas spp.)、固氮菌属(Azotobacter spp.)、丛毛单胞菌属(Comamonas spp.)、分支杆菌属(Mycobacterium spp.)、红球菌属(Rhodococcus spp.)、葡萄糖酸菌属(Gluconobacter spp.)、青枯菌属(Ralstonia spp.)、嗜酸硫杆菌属(Acidithiobacillus spp.)、小月菌属(Microlunatus spp.)、地杆菌属(Geobacterspp.)、地芽孢杆菌属(Geobacillus spp.)、节杆菌属(Arthrobacter spp.)、黄杆菌属(Flavobacterium spp.)、沙雷氏菌属(Serratia spp.)、糖多孢菌属(Saccharopolysporaspp.)、栖热菌属(Thermus spp.)、寡养单胞菌属(Stenotrophomonas spp.)、色杆菌属(Chromobacterium spp.)、中华根瘤菌属(Sinorhizobium spp.)、糖多孢菌属(Saccharopolyspora spp.)、土壤杆菌属(Agrobacterium spp.)和泛菌属(Pantoeaspp.)。细菌细胞可以是革兰氏阴性细胞诸如大肠杆菌(E.coli)细胞、或革兰氏阳性细胞诸如芽孢杆菌属的种类。在其他实施方案中,细胞是真菌细胞,诸如酵母细胞,诸如,例如,酵母属、裂殖酵母属(Schizosaccharomyces spp.)、毕赤酵母属、法夫酵母属(Paffia spp.)、克鲁维酵母属(Kluyveromyces spp.)、念珠菌属(Candida spp.)、踝节菌属(Talaromycesspp.)、酒香酵母属(Brettanomyces spp.)、管囊酵母属(Pachysolen spp.)、德巴利酵母属(Debaryomyces spp.)、耶氏酵母属和工业多倍体酵母菌株。在一个实施方案中,宿主细胞是选自大肠杆菌、枯草芽孢杆菌或恶臭假单胞菌中的细菌。在一个实施方案中,宿主细胞是酵母,并且可以是酵母属、毕赤酵母属或耶氏酵母属的种类,包括酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
在一些实施方案中,宿主细胞产生异戊基焦磷酸(IPP),其用作合成倍半萜烯的底物。在一些实施方案中,通过内源或异源的甲基赤藓糖醇磷酸(MEP)或甲羟戊酸(MVA)途径的代谢通量(例如,用供给至细胞的碳源开始)产生IPP。在某些实施方案中,可以通过异源酶的表达或途径中某些酶的复制来增强MEP或MVA途径。
MEP(2-C-甲基-D-赤藓醇4-磷酸)途径,也称为MEP/DOXP(2-C-甲基-D-赤藓醇4-磷酸/1-脱氧-D-木酮糖5-磷酸)途径或非甲羟戊酸途径或甲羟戊酸非依赖性途径,是指将甘油醛-3-磷酸和丙酮酸转化为IPP和DMAPP的途径。该途径通常涉及以下酶的作用:1-脱氧-D-木酮糖-5-磷酸合成酶(Dxs)、1-脱氧-D-木酮糖-5-磷酸还原异构酶(IspC)、4-二磷酸胞苷-2-C-甲基-D-赤藓糖醇合成酶(IspD)、4-二磷酸胞苷-2-C-甲基-D-赤藓糖醇激酶(IspE)、2C-甲基-D-赤藓醇2,4-环二磷酸合成酶(IspF)、1-羟基-2-甲基-2-(E)-丁烯基4-二磷酸合成酶(IspG)和异戊烯基二磷酸异构酶(IspH)。MEP途径以及构成MEP途径的基因和酶描述于US 8,512,988中,其据此通过引用整体并入本文。例如,构成MEP途径的基因包括dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi和ispA。在一些实施方案中,倍半萜烯至少部分地通过经MEP途径的代谢通量产生,并且其中宿主细胞具有dxs、ispD、ispF和/或idi基因的至少一个另外的拷贝(例如,dxs和idi;或dxs、ispD、ispF和/或idi)。
MVA途径是指将乙酰辅酶A转化为IPP的生物合成途径。甲羟戊酸途径通常包括催化以下步骤的酶:(a)将两分子乙酰辅酶A缩合成乙酰乙酰辅酶A(例如,通过乙酰乙酰辅酶A硫解酶的作用);(b)将乙酰乙酰辅酶A与乙酰辅酶A缩合以形成羟甲基戊二酰辅酶A(HMG-CoA)(例如,通过HMG-CoA合成酶(HMGS)的作用);(c)将HMG-CoA转化为甲羟戊酸(例如,通过HMG-CoA还原酶(HMGR)的作用);(d)将甲羟戊酸磷酸化成甲羟戊酸5-磷酸(例如,通过甲羟戊酸激酶(MK)的作用);(e)将甲羟戊酸5-磷酸转化为甲羟戊酸5-焦磷酸(例如,通过磷酸甲羟戊酸激酶(PMK)的作用);和(f)将甲羟戊酸5-焦磷酸转化为异戊烯焦磷酸(例如,通过甲羟戊酸焦磷酸脱羧酶(MPD)的作用)。MVA途径以及构成MEP途径的基因和酶描述于US7,667,017中,其据此通过引用整体并入本文。
在一些实施方案中,宿主细胞表达法尼基焦磷酸合成酶(FPPS),其从IPP或DMAPP产生法尼基焦磷酸。如图1所示,法尼基焦磷酸是用于生产朱栾倍半萜的中间体。示例性的法尼基焦磷酸合成酶是酿酒酵母的ERG20(NCBI登录号P08524)和大肠杆菌ispA。各种其他原核、酵母、植物和哺乳动物FPPS酶是已知的,并且可以根据这方面使用。
宿主细胞可以进一步表达异源倍半萜烯合成酶(诸如朱栾倍半萜合成酶)以产生所需的倍半萜烯。已知几种朱栾倍半萜合成酶,包括来自葡萄柚或来自甜橙(Citrussinensis)的朱栾倍半萜合成酶。甜橙VS(例如,AAQ04608.1)及其各种衍生物描述于US2012/0246767中,其据此通过引用并入本文。例如,本发明可以采用甜橙朱栾倍半萜合成酶的氨基酸序列(SEQ ID NO:12),或相对于野生型氨基酸序列(SEQ ID NO:12)具有1至30个突变或1至20个或1至10个突变的衍生物。这样的序列与野生型序列(SEQ ID NO:12)可以具有至少60%的序列同一性、至少70%的序列同一性、至少80%的序列同一性、至少90%的序列同一性、至少95%的序列同一性、或至少约96%、约97%、约98%或约99%的序列同一性。进一步地,Licker等(Phytochemistry(2004)65:2649-2659)描述了来自葡萄(VvVS)(SEQID NO:1)的朱栾倍半萜合成酶。在一个实施方案中,包括VvVS的氨基酸序列的朱栾倍半萜合成酶或其工程化的衍生物可用于本发明。各种倍半萜烯合成酶诸如朱栾倍半萜合成酶是已知的,并且描述于例如US 2012/0107893、US 2012/0246767和US 7,273,735中,其据此通过引用整体并入本文。
例如,在一些实施方案中,朱栾倍半萜合成酶是VvVS衍生物,其包含相对于VvVS(SEQ ID NO:1)具有独立地选自替换、缺失或插入的约1至约40个突变、约1至约35个突变、约1至约30个突变、约1至约25个突变、约1至约20个突变、约1至约15个突变、或约1至约10个突变的氨基酸序列。例如,VvVS衍生物可以包含相对于SEQ ID NO:1具有至少约5个或至少约10个,但少于约30个或约20个突变的氨基酸序列。在各种实施方案中,VvVS衍生物包含相对于VvVS(SEQ ID NO:1)具有约1个突变、约2个突变、约3个突变、约4个突变、约5个突变、约6个突变、约7个突变、约8个突变、约9个突变、约10个突变、约11个突变、约12个突变、约13个突变、约14个突变、约15个突变、约16个突变、约17个突变、约18个突变、约19个突变、约20个突变、约21个突变、约22个突变、约23个突变、约24个突变、约25个突变、约26个突变、约27个突变、约28个突变、约29个突变、约30个突变、约31个突变、约32个突变、约33个突变、约34个突变、约35个突变、约36个突变、约37个突变、约38个突变、约39个突变或约40个突变的氨基酸序列。这样的序列可以与SEQ ID NO:1具有至少60%的序列同一性、至少70%的序列同一性、至少80%的序列同一性、至少90%的序列同一性、至少95%的序列同一性、或至少约96%、约97%、约98%或约99%的序列同一性。VvVS的示例性突变示于表3中。基于作为模板的5-非手性-马兜铃烯合成酶晶体结构(PDB:5EAT)的葡萄朱栾倍半萜合成酶(VvVS)的同源性模型可以指导突变。
表3.1相对于野生型(SEQ ID NO:1)的葡萄朱栾倍半萜合成酶突变的概述
表3.2:在Vv2M5背景(SEQ ID NO:9)中评估的突变的概述。
表3.3:在VS2背景(SEQ ID NO:11)中评估的突变的概述。
因此,在各种实施方案中,工程化的VvVS可以具有选自表3的至少约1个突变、约2个突变、约3个突变、约4个突变、约5个突变、约6个突变、约7个突变、约8个突变、约9个突变、约10个突变、约11个突变、约12个突变、约13个突变、约14个突变、约15个突变、约16个突变、约17个突变、约18个突变、约19个突变、约20个突变、约21个突变、约22个突变、约23个突变、约24个突变、约25个突变、约26个突变、约27个突变、约28个突变、约29个突变、约30个突变、约31个突变、约32个突变、约33个突变、约34个突变、约35个突变、约36个突变、约37个突变、约38个突变、约39个突变或约40个突变。图3中进一步描述了示例性的重组朱栾倍半萜合成酶Vv1M1(SEQ ID NO:3)、Vv2M1(SEQ ID NO:5)、Vv1M5(SEQ ID NO:7)、Vv2M5(SEQ ID NO:9)和VS2(SEQ ID NO:11),包括图3B中的比对。
在某些方面,本发明提供了多核苷酸,所述多核苷酸包含编码经修饰用于增加如上所述的朱栾倍半萜的表达的朱栾倍半萜合成酶的核苷酸序列。这样的多核苷酸可以在宿主细胞中表达,在染色体外元件诸如质粒上,或可以染色体整合。
在各种实施方案中,SrKO与P450还原酶一起表达以再生酶,或者,SrKO或衍生物与P450还原酶表达为嵌合P450酶。由于细菌平台的固有限制,诸如缺乏电子转移机制和细胞色素P450还原酶,以及由于缺乏内质网导致的P450酶的膜信号模块的翻译不相容性,细胞色素P450的功能性表达已被认为是具有挑战性的。
因此,在一些实施方案中,SrKO表达为具有细胞色素P450还原酶配偶体的融合蛋白。细胞色素P450还原酶是在内质网中发现的膜蛋白。它催化吡啶核苷酸脱水和电子转移到膜结合的细胞色素P450。在人、植物、其它哺乳动物和昆虫中发现了类似结构的同功酶。示例性的P450还原酶配偶体包括,例如,甜菊(Sr)CPR(SEQ ID NO:62和63)、甜菊(Sr)CPR1(SEQ ID NO:76和77)、拟南芥(At)CPR(SEQ ID NO:64和65)、红豆杉(Tc)CPR(SEQ ID NO:66和67)、青蒿(Aa)CPR(SEQ ID NO:68和69)、拟南芥(At)CPR1(SEQ ID NO:70和71)、拟南芥(At)CPR2(SEQ ID NO:72和73)、拟南芥(At)R2(SEQ ID NO:74和75)、甜菊(Sr)CPR2(SEQ IDNO:78和79);甜菊(Sr)CPR3(SEQ ID NO:80和81);香叶天竺葵(Pg)CPR(SEQ ID NO:82和83)。在一些实施方案中,这些P450中的任何一个都可以被衍生化,例如,以引入1至约20个突变或约1至约10个突变。图6B显示了拟南芥和青蒿CPR序列(SEQ ID NO:72、74、68、64和70)的氨基酸序列的比对。图6C显示了甜菊CPR序列(SEQ ID NO:78、80、62和76)的比对。图6D显示了8个CPR氨基酸序列(SEQ ID NO:74、72、82、68、80、62、78和76)的比对。
P450融合蛋白的工程化公开于例如US 2012/0107893和US2012/0164678中,其据此通过引用整体并入本文。在某些实施方案中,SrKO通过连接子与细胞色素P450还原酶配偶体融合。示例性的连接子序列主要是丝氨酸、甘氨酸和/或丙氨酸,以及任选地1至5个带电荷的氨基酸(诸如赖氨酸或精氨酸),所述示例性连接子序列包括例如,GSG、GSGGGGS(SEQID NO:113)、GSGEAAAK(SEQ ID NO:114)、GSGEAAAKEAAAK(SEQ ID NO:115)、GSGMGSSSN(SEQID NO:116)和GSTGS(SEQ ID NO:117)。连接子通常是柔性的,并且包含不超过1个、2个或3个疏水残基,并且通常长度为3至50个氨基酸,诸如长度为3至20个氨基酸。在其他实施方案中,P450还原酶在宿主细胞中单独表达,并且在一些实施方案中可以在与SrKO相同的操纵子中表达。在一些实施方案中,P450还原酶在宿主细胞中单独表达,并且基因任选地整合到基因组中或从质粒表达。
在某些实施方案中,P450酶的N端可以被工程化以增加它们的功能表达。膜结合P450的N端在酶表达、膜缔合和底物接近中起重要作用。据报道,在P450的N端使用稀有密码子显着改善P450的表达水平。进一步地,因为大多数植物P450酶是膜结合的,并且疏水底物被认为通过在P450和膜之间动态建立的通道进入酶,N端工程化可以影响膜和P450的缔合,并因此底物接近酶。因此,在一个实施方案中,SrKO的N端工程化产生SrKO衍生物,SrKO衍生物在宿主系统诸如大肠杆菌或酵母中维持或显示增强的朱栾倍半萜氧化酶活性。示例性的N端序列是MALLLAVF(SEQ ID NO:112),其他示例性序列包括主要为疏水性的4至20个氨基酸(诸如4至15个氨基酸,或4至10个氨基酸,或约8个氨基酸)的序列,例如,优选由选自亮氨酸、缬氨酸、丙氨酸、异亮氨酸和苯丙氨酸的(至少50%或至少75%)氨基酸构成。
在一些实施方案中,SrKO是缺失其N端跨膜区的至少一部分并且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域的衍生物。在这些实施方案中,P450酶与大肠杆菌内膜具有更稳定和/或有成效的缔合,这减少了膜缔合的P450酶的表达诱导的细胞应激。在一些实施方案中,SrKO是缺失其N端跨膜结构域的15至35个氨基酸并且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15至25个氨基酸的衍生物。在一些实施方案中,衍生物的N端跨膜结构域包含氨基酸序列MAWEYALIGLVVGIIIGAVA(SEQ ID_NO:118)或相对于SEQ IDNO:118具有1至10个或1至5个氨基酸突变的氨基酸序列。
在一些实施方案中,宿主细胞进一步表达一种或多种将产物转换为诺卡酮的酶。例如,宿主细胞可以表达从诺特卡醇产生诺卡酮的醇脱氢酶,其实例包括红串红球菌CDH(SEQ ID NO:84)、甜橙DH(SEQ ID NO:86)、甜橙DH1(SEQ ID NO:88)、甜橙DH2(SEQ ID NO:90)、甜橙DH3(SEQ ID NO:92)、葡萄DH(SEQ ID NO:94)、葡萄DH1(SEQ ID NO:96)、甜橙ABA2(SEQ ID NO:98)、短柄草DH(SEQ ID NO:100)和红球姜SDR(SEQ ID NO:102)。醇脱氢酶可以包含与本段落中描述的一种或多种酶具有至少70%、至少80%或至少90%序列同一性的并具有将诺特卡醇转化为诺卡酮的活性的氨基酸序列。
倍半萜烯(例如,朱栾倍半萜及其含氧产物)可以作为大肠杆菌中非甲羟戊酸途径的生物合成产物产生,其包括两个模块:形成异戊烯焦磷酸(IPP)的天然上游途径和形成类萜的异源下游途径。可以采用代谢途径工程化的多变量模块方法来优化工程化的大肠杆菌中倍半萜烯的生产。多变量模块途径工程化的方法基于系统多变量搜索,以识别最佳平衡两个途径模块的条件,从而使抑制性中间体的积累和通量转移至副产物减至最小。
WO 2011/060057、US 2011/0189717、US 2012/107893和US 8512988(其各自据此通过引用并入本文)描述了通过控制参与上游途径和下游途径的基因或蛋白的表达来优化细胞中类萜生产的方法和组合物。这可以通过将酶途径分组为两个模块来实现:上游(MEP)途径模块(例如,包括MEP途径的一个或多个基因)和倍半萜烯生产的下游异源途径。使用这种基本构型,就其对萜烯和类萜(例如,倍半萜烯)生产的影响来评估参数,诸如质粒拷贝数对细胞生理学的影响、表达盒中的基因顺序和启动子强度、以及染色体整合。因此可以在模块化方法中调节MEP途径内的基因表达。如本文所用,通过模块化方法调节是指多个基因一起调节。通过实例,MEP途径内的多个基因可以在DNA的连续区域诸如操纵子上重组表达。应当理解,与本发明的方面一致,MEP途径内的基因模块可以以任何顺序包含MEP途径内的任何基因。在一些实施方案中,MEP途径内的基因是以下之一:dxs、ispC、ispD、ispE、ispF、ispG、ispH、idi、ispA或ispB。MEP途径内的基因模块的非限制性实例是包含基因dxs、idi、ispD和ispF的模块,并且称为dxs-idi-ispDF。
可以通过本领域普通技术人员已知的方法实现基因和/或蛋白(包括诸如dxs-idi-ispDF操纵子和FPPS-VS操纵子的模块)表达的操作。例如,可以通过选择具有不同强度的启动子(诸如诱导型启动子)来调节基因或操纵子的表达。启动子的几个非限制性实例包括Trc、T5和T7。另外,可以通过操作细胞中基因或操纵子的拷贝数来调节基因或操纵子的表达。
MEP途径内的一种或多种基因和/或蛋白的表达可以被上调和/或下调。在某些实施方案中,MEP途径内的一种或多种基因和/或蛋白的上调可以与MEP途径内的一种或多种基因和/或蛋白的下调组合。通过实例,在一些实施方案中,过表达非甲羟戊酸(MEP)途径的一种或多种组分的细胞至少部分用于扩增异戊基二磷酸(IPP)和二甲基烯丙基二磷酸(DMAPP),GGPPS的底物。在一些实施方案中,非甲羟戊酸(MEP)途径的一种或多种组分的过表达通过增加非甲羟戊酸(MEP)途径的一种或多种组分的拷贝数来实现。在这方面,诸如通过额外的游离型表达,可以扩增MEP途径中限速步骤的组分诸如(dxs、ispD、ispF、idi)的拷贝数。
在一些实施方案中,吲哚的生产用作倍半萜烯生产的替代标记,和/或控制培养物中吲哚的积累以增加倍半萜烯的产量。例如,在各种实施方案中,将培养物中吲哚的积累控制到低于约100mg/L、或低于约75mg/L、或低于约50mg/L、或低于约25mg/L、或低于约10mg/L。可以通过使用上述的多元模块化方法平衡蛋白表达和活性来控制吲哚的积累,和/或通过化学方法来控制。
在其它方面,本发明提供了制备含有含氧的倍半萜烯(如所述)的产品的方法,其包括将根据上述方法制备和回收的含氧的倍半萜烯掺入消费品或工业产品中。例如,产品可以是风味产品、芳香产品、化妆品、清洁产品、洗涤剂或肥皂,或害虫防治产品(例如驱虫剂)。在一些实施方案中,通过分级(例如分馏)回收和任选富集的含氧的产物是诺卡酮,并且产品是选自饮料、口香糖、糖果或风味添加剂的风味产品,或是驱虫剂。
氧化产物可以通过任何合适的方法回收,包括将所需产物分配到有机相中。可以例如通过气相色谱法(例如,GC-MS)测定和/或定量所需产物的产量。可以在间歇或连续生物反应器系统中生产所需产物。产物的生产、回收和/或产物的分析可以如US 2012/0246767中所述进行,其据此通过引用整体并入本文。例如,在一些实施方案中,从水性反应介质中提取氧化油,提取可以通过分配到有机相中进行,例如,使用有机溶剂诸如烷烃诸如庚烷,然后分馏。可以通过GC/MS定量测量级分的倍半萜烯和类倍半萜烯组分,然后混合级分以产生用于风味(或其它)应用的所需的含诺卡酮的成分。
在其他方面,本发明提供了包含编码本文所述的P450衍生物的核苷酸序列的多核苷酸。在一些实施方案中,多核苷酸可以被密码子优化以在大肠杆菌或酵母中表达。在另一个实例中,多核苷酸可以包含编码SrKO融合蛋白的核苷酸序列,任选地具有本文所述的P450还原酶配偶体。在其他实施方案中,本发明提供了包含编码本文所述的倍半萜烯合成酶变体的核苷酸序列的多核苷酸,所述多核苷酸同样可以被密码子优化用于在大肠杆菌或酵母中表达。除了编码P450或倍半萜烯合成酶的序列之外,这样的多核苷酸可以进一步包含一个或多个表达控制元件。例如,多核苷酸可以包含一个或多个启动子或转录增强子、核糖体结合位点、转录终止信号和聚腺苷酸化信号作为表达控制元件。多核苷酸可以被插入任何合适的载体中,包括表达载体,并且多核苷酸可以被包含在用于表达的任何合适的宿主细胞中。多核苷酸可以被设计成用于引入任何合适的宿主细胞(包括细菌细胞和酵母细胞)中和/或表达蛋白质,并且可以从质粒表达,或可以被染色体整合。在一些实施方案中,重组核酸分子编码与野生型酶(SEQ ID NO:37)相比具有更高的朱栾倍半萜氧化活性的SrKO衍生物,并具有所述的前导序列,诸如前导序列MALLLAVF(SEQ ID NO:117)或源自大肠杆菌yhcB的前导序列。在某些实施方案中,重组核酸分子进一步编码作为操纵子或作为与能够再生SrKO酶的SrKO衍生物、SrCPR或其衍生物的框内融合。当作为融合蛋白存在时,SrKO衍生物和SrCPR可以通过3至10个氨基酸(例如,5个氨基酸)的连接序列连接。在一些实施方案中,连接序列主要为甘氨酸、丝氨酸和/或丙氨酸,并且可以包含序列GSTGS。
在其它方面,本发明提供了产生如本文所述的含氧的倍半萜烯的宿主细胞,其表达用于从异戊基焦磷酸(IPP)产生所需含氧的倍半萜烯的所有酶组分。例如,在各种实施方案中,宿主细胞表达法尼基焦磷酸合成酶、倍半萜烯合成酶和SrKO或其衍生物。IPP可以通过MEP和/或MVA途径产生,其对于宿主细胞可以是内源性的或通过异源酶的表达或途径中某些酶的复制来修饰。宿主细胞包括本文所述的各种细菌和酵母。
在其它方面,本发明提供了通过本文所述的方法和宿主细胞生产的倍半萜烯产物。如本文所公开,SrKO酶通过产生诺特卡醇并进一步氧化成酮(诺卡酮),显示出独特的活性,并产生不同的含氧的萜烯产物,包括羟基大根香叶-1(10)5-二烯和依兰烷-3,9(11)二烯-10-过氧化物。
此外,测试的其它P450酶,包括先前已知的对朱栾倍半萜底物具有羟化活性的倍半萜烯CYP450或P450,产生一种立体异构体(β-诺特卡醇)和仅少量的酮(诺卡酮)。具体地,其他倍半萜烯CYP450酶产生β-诺特卡醇和羟基朱栾倍半萜作为主要产物,而紫杉醇CYP450酶不产生任何含氧的朱栾倍半萜(表4和图7)。
在某些方面,本发明涉及SrKO衍生物酶。例如,SrKO衍生物可以包含相对于SEQ IDNO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变的氨基酸序列。例如,在一些实施方案中,SrKO是包含相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个(2个、3个、4个或全部)突变的氨基酸序列的衍生物。在一些实施方案中,SrKO衍生物包含选自SEQ IDNO:55-61、104或105的氨基酸序列,其根据本公开被工程化以提高朱栾倍半萜的氧合活性(例如,诺卡酮的产量)。在一些实施方案中,衍生物包含相对于选自SEQ ID NO:55-61、104或105的序列具有1至20个突变、或1至10个突变、或1至5个突变的氨基酸序列,附带条件是氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有1个、2个、3个或更多个突变,或附带条件是SrKO衍生物包含相对于SEQID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的1个、2个、3个或更多个(或全部)突变的氨基酸序列。如本文所示,这些突变提高SrKO朱栾倍半萜氧化活性的水平。
在这些或其他实施方案中,SrKO是缺失其N端跨膜区的至少一部分并添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域的衍生物。在一些实施方案中,SrKO是缺失其N端跨膜结构域的15至35个氨基酸(相对于SEQ ID NO:37)并添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15至25个氨基酸的衍生物。在一些实施方案中,衍生物的N端跨膜结构域包含氨基酸序列MAWEYALIGLVVGIIIGAVA(SEQ ID NO:118)或相对于SEQ ID NO:118具有1至10个或1至5个氨基酸突变的氨基酸序列。
在其它方面,本发明提供了制备经修饰的SrKO多肽的方法,其中所述方法包括以下步骤:(i)在允许多肽表达的条件下培养表达经修饰的多肽的宿主细胞;和(ii)任选地回收多肽。
在其他方面,本发明提供了一种制备含氧的倍半萜烯的方法,包括以下步骤:(i)提供经修饰的SrKO多肽,(ii)使倍半萜烯与经修饰的SrKO多肽接触,和(iii)回收所产生的含氧的倍半萜烯。所述方法可以进一步包括提供用于再生SrKO辅因子(例如,SrCPR)的CPR酶。在一些实施方案中,回收含氧的倍半萜烯作为油。在一些实施方案中,倍半萜烯是朱栾倍半萜。在一些实施方案中,含氧的倍半萜烯包括羟基大根香叶-1(10)5-二烯、依兰烷-3,9(11)二烯-10-过氧化物、诺特卡醇和诺卡酮。在一些实施方案中,主要的含氧产物是诺卡酮和/或诺特卡醇。
在另一方面,提供了基于P45017A1的结构坐标的SrKO晶体模型结构(CMS),其具有本文所述的SrKO或衍生物的氨基酸序列。CMS包含萜烯结合口袋结构域(TBD),TBD包含萜烯结合口袋(TBP)和结合到TBD的萜烯(例如,朱栾倍半萜)。图8A和8B。这种SrKO晶体模型结构(CMS)促进SrKO衍生物的计算机测试。
因此,在其它实施方案中,本发明提供了筛选能够结合TBD的萜烯的方法,其中所述方法包括使用SrKO CMS。在另一方面,本发明提供了筛选能够结合TBP的萜烯的方法,所述方法包括使TBP与测试化合物接触,以及确定所述测试化合物是否结合到所述TBP。在一些实施方案中,所述方法是要筛选可用于调节SrKO酶的活性的测试化合物(例如,萜烯)。
在另一方面,本发明提供了用于预测、模拟或建模萜烯结合结构域(TBD)的分子特征和/或分子相互作用的方法,包括使用计算机模型,所述计算机模型包括使用或描述如上定义的萜烯结合结构域的结构坐标,以提供所述配体结合结构域的图像并任选地显示所述图像。
实施例
实施例1:生产大肠杆菌菌株的倍半萜烯前体(朱栾倍半萜)的构建
产生过表达上游MEP途径基因dxs、ispD、ispF和idi的大肠杆菌,其促进通量至支持超过1g/L滴度的异源二类萜产物(3)的类异戊二烯前体异戊基焦磷酸(IPP)。通过用法尼基焦磷酸合成酶(FPPS)和倍半萜烯合成酶或香叶基焦磷酸合成酶(GPPS)和单萜合成酶代替香叶基香叶基焦磷酸合成酶(GGPS)和二萜合成酶,构建产生各种类萜包括单萜和倍半萜烯的菌株。为了开发倍半萜烯生产菌株以测试CYP450用于新型含氧的萜烯,朱栾倍半萜合成酶被克隆并在MEP途径过表达的大肠杆菌菌株中表达。高底物通量有助于识别CYP450的活性。以前,对含氧的紫杉二烯生产菌株的研究表明,当将CYP450途径转移到紫杉二烯生产菌株(300mg/L至~10mg/L)时,生产力显著降低。
进一步地,应用多变量模块化代谢工程(MMME)来平衡朱栾倍半萜的高水平生产的途径。与获得100's的mg/L二类萜的先前结果相比,天然存在的朱栾倍半萜合成酶(诸如来自葡萄的朱栾倍半萜合成酶),甚至在MMME优化后经常表现出次优化(~5mg/L)。涉及倍半萜烯生物合成的酶可能难以在大肠杆菌中表达,并且相对于涉及初级代谢(17)的那些,在动力学上也是有缺陷的。
一种葡萄朱栾倍半萜合成酶(VvVS)的同源性模型使用软件包(Schrodinger,Inc.)构建,用5-非手性-马兜铃烯合成酶的晶体结构作为模板(PDB:5EAT)。进一步地,为了识别萜烯合成酶的自然突变景观,产生了结合数百个相关萜烯合成酶序列的广泛的多序列比对。使用此信息,使用回归共识、计算机能量学(in silico energetics)和结构分析的组合设计突变。已显示回归共识突变是改善稳定性(19、20)和表达(21)的重要工具。使用基于BioLuminate中的原子力场模型的能量计算来评估预测用于具有低溶剂可接近的表面积的位置的单个突变的折叠的ΔΔG,其预期影响折叠和稳定性。
通过应用MMME方法,在具有p15A复制起点和T7启动子的质粒上结合密码子优化形式的VvVS,识别平衡的上游和下游朱栾倍半萜生产菌株。然后将该菌株背景用于筛选设计的合成酶突变。使用上述蛋白工程工具,我们设计了超过200个独特的点突变(表3),然后使用定点诱变在p15A-T7筛选质粒中构建该点突变。将突变的酶变体转化到筛选菌株中,将一式三份的菌落在选择性LB细胞培养基中培养过夜,然后接种到最小R培养基中并在22℃下培养4天。为了朱栾倍半萜的生产力,使用甲基叔丁基醚(MTBE)提取培养物,并通过组合气相色谱/质谱法分析培养物。
大约五分之一的设计点突变在我们的筛选菌株中增加朱栾倍半萜生产力至少20%(图2)。然后将有益点突变策略性地组合以赋予越来越有利的表型。提供了重组的朱栾倍半萜合成酶序列如Vv1M1(突变-R331K、I334E、N335S、V371I、A374L、T418V、S482T、S512P、K356N、Q491K、E394D、A428V、Y348F、T318S、L352I、I442L、A554P)、Vv2M1(突变-R331K、I334E、N335S、V371I、A374L、T418V、S482T、S512P、K356N、Q491K、E394D、A428V、V542T、G480A、M305L、K441R、A554P)、Vv1M5(突变-R331K、I334E、N335S、V371I、A374L、T418V、S482T、S512P、K356N、Q491K、E394D、A428V、Y348F、T318S、L352I、I442L、A554P、H284M、C46K、F448T、Q533E)和Vv2M5(突变-R331K、I334E、N335S、V371I、A374L、T418V、S482T、S512P、K356N、Q491K、E394D、A428V、V542T、G480A、M305L、K441R、A554P、H284M、C46K、F448T、Q533E)(图3)。当这些酶中的任一种在具有dxs-idi-ispDF过表达且使用MMME平衡的我们的MEP途径菌株中过表达时,所获得的朱栾倍半萜的滴度足以激发P450酶的掺入以测试其催化含氧的朱栾倍半萜形成的能力。掺入P450之前朱栾倍半萜的滴度为约30mg/L。
实施例2:CYP450文库对朱栾倍半萜支架的功能活性
使用朱栾倍半萜作为模型系统以验证基于CYP450的氧合化学用于生产萜烯化学品的能力。
使用朱栾倍半萜生产大肠杆菌菌株作为宿主背景进行CYPP450候选物筛选。为了构建用于功能性表达的CYP450,使用专有的质粒系统p5Trc(源自pSC101的质粒)构建含有通过柔性5-氨基酸连接子(GSTGS,SEQ IDNO:117)融合至N端截短的甜菊细胞色素P450还原酶(SrCPR)的候选P450的质粒。各种候选P450的序列显示在图4中。分析候选CYP450用于N端膜缔合区,N端膜结合区被截短,并向融合体中加入8个氨基酸的前导序列(MALLLAVF,SEQID NO:112)(图5A和5B)。也在类似的遗传构建中制备来自拟南芥和红豆杉的CPR氧化还原配偶体(red/ox partner)。由于天然SrCPR是有效的,因此没有测定这些构建体的活性水平。各种CPR氧化还原配偶体的序列显示在图6中。在p5Trc-CYP450-L-SrCPR转化到朱栾倍半萜生产菌株后,菌株在抗生素选择性LB培养基中在30℃下培养过夜。然后将这些培养物用于接种无菌试管中的具有15g/L甘油和0.1mM IPTG的2mL抗生素选择性R培养基培养物,随后在22℃下培养4天,然后用甲基叔丁基醚(MTBE)提取。
从表4列出的那些中选择一组CYP450酶,并在这种大肠杆菌系统中分类为倍半萜烯和二萜氧合。在测试用于朱栾倍半萜氧合的各种CYP450酶中,发现来自甜菊(SrKO)(16)的贝壳杉烯氧化酶在朱栾倍半萜支架上具有独特的氧合化学。SrKO将C19位置的二萜(-)-贝壳杉烯自然地氧化为(-)-贝壳杉烯酸。SrKO酶在本研究中通过产生羟基化产物的不同的立体异构体(α和β诺特卡醇并进一步氧化成酮(诺卡酮))显示出独特的活性,并且产生不同的含氧的萜烯产物,除了α-诺特卡醇、β-诺特卡醇和诺卡酮之外,还包括羟基大根香叶-1(10)5-二烯、依兰烷-3,9(11)二烯-10-过氧化物。其他P450,包括先前已知的用于羟基化朱栾倍半萜的倍半萜烯CYP450,产生仅一种异构体(β诺特卡醇)和仅可检测量的酮(诺卡酮)。具体地,其它倍半萜烯CYP450酶产生β-诺特卡醇和羟基朱栾倍半萜作为主要产物,而另一种二萜CYP450酶(例如,紫杉属5-α羟化酶)产生诺特卡醇作为仅少量的(可检测的)产物(表4和图7)。
表4:通过在大肠杆菌中选择P450酶从朱栾倍半萜形成的主要产物
实施例3:SrKO的结构和突变研究
一旦识别了SrKO的独特活性,进行实验以提高其进行朱栾倍半萜的不同氧化的能力。没有描述SrKO的晶体结构。针对RCSB蛋白数据库的SrKO的Blast搜索显示了SrKO对于具有晶体结构的P450酶的序列同一性是低的(~20%)。鉴于P450的保守折叠结构,而不管其低的序列同一性,使用最先进技术的(state-of-the-art)蛋白建模工具建立在SrKO上。选择催化人雄激素生物合成的膜结合细胞色素P45017A1的晶体结构(参见DeVore N.M.,Scott E.E.,Nature,482,116-119,2012)作为模板用于模型开发。使用BioLuminate蛋白建模软件,开发同源性模型(图8A),使得关键残基和特征基序的定位(参见Gotoh O.,J.BiolChem,267,83-90,1992)与模板良好比对。此外,还构建了包括修复的血红素铁复合物的SrKO的同源模型。然后使用AutoDock VINA在SrKO活性位点中产生用于朱栾倍半萜的可能的结合模型的集合(图8B)(29)。
此外,针对NCBI非冗余蛋白序列文库的SrKO的Blast搜索没有返回序列同一性大于80%的直系同源物(SrKO本身除外)。顶部命中物列在表5中。
表5:在制备同源模型中用SrKO的BLAST搜索
一旦识别了SrKO的独特活性,进行实验以提高其进行朱栾倍半萜的不同氧化的能力。使用回归共识诱变策略,构建P450酶的多序列比对,包括来自使用4种种子贝壳杉烯氧化酶基因的Uniref100数据库的BLAST搜索的序列(在聚集并消除具有大于90%同一性的序列之后),来自使用P450BM3、P450CAM和P450eryF作为种子基因的细菌蛋白质组的BLAST搜索的序列,以及最密切相关的SrKO同源物。基于同源性模型、多重序列比对和文献,设计并测试了各种点突变和双突变。在上述体内测试系统中评估这些细胞色素P450衍生物在总的含氧萜烯生产力(例如,通过GC/MS观察到的主峰的总数)中的改善。由模型指导的活性位点位置上的诱变揭示了具有显著改善的含氧产物的几种变体(下表6和下表7)。
表6:与SEQ ID NO:38的SrKO相比的总含氧油的结合口袋突变及其倍数生产力
表7:与SEQ ID NO:38的SrKO相比的总含氧油的非结合口袋点突变和生产力
实施例4:诺卡酮的分离和评估
通过GC/MS(Agilent 6800;柱:Rtx-5,0.32mm×60m×1.0μm膜厚度;GC温度。程序:40℃持续5分钟,以4℃/min升至300℃并保持30分钟)分析由细胞色素P450酶SrKO(SEQ IDNO:38)的朱栾倍半萜的氧化得到的产物,得到表8A和8B中提供的数据。
表8A.朱栾倍半萜的SrKO氧化
表8B.朱栾倍半萜的SrKO氧化
对由SrKO衍生物产生的产物进行类似的分析。证实了产物谱是可比的,并且可以基于SrKO的诱变以更高的水平产生诺卡酮、诺特卡醇的主要产物。
然后可以使用合适的溶剂(例如庚烷)从水性反应介质中提取氧化的油产物,随后分馏。每个级分的化学组成可以通过GC/MS定量测量。可以混合级分以产生用于风味剂或其它应用的所需的诺特卡醇和/或诺卡酮成分。
通过用表9中提供的分析直接与参考的诺卡酮调味产品(例如,从花臣香精(Frutarom)获得的现有天然调味商业产品)比较,可以进行可接受的验证。
表9.来自花臣香精的市售天然调味剂诺卡酮的分析
实施例5:N端锚定工程化
为了优化初始SrKO变体(在这些实施例中称为朱栾倍半萜氧化酶1或VO1)的膜相互作用,识别了锚定在具有胞质C端的内膜中的大肠杆菌蛋白。选择大肠杆菌yhcB的N端序列,其提供单次跨膜结构域。将来自yhcB的N端的20-24个氨基酸交换为原始膜锚定序列MALLLAVF(SEQ ID NO:112),并且SrKO N端截短的大小从28变为32。参见图9。VO1在p5质粒上的T7启动子的控制下表达。SrCPR独立于染色体表达。如已经描述的,将菌株在96深孔板中在30℃下在R培养基加甘油和十二烷覆盖中培养48小时。
如图10所示,与对照的平均值相比,n20yhcB_t29VO1在总含氧滴度中表现出1.2倍的生产力。N20yhcB_t29VO1表现出的总含氧滴度约为原始8RP锚的1.8倍(未显示)。
实施例6:VO1的突变分析
尽力进行VO1的突变分析以增加含氧滴度。使用菌株MB2509(MP6-MEP MP1-ScFPPSFab46-VS2MP6-ScCPR)作为背景,当用p5-T7-yhcB-VO1质粒转化时,产生约18%诺卡酮。为了诺卡酮的更高产量,评估菌株。
在基于P450 17A1(实施例3)的同源模型的指导下,在18位置进行VO活性位点的位点饱和诱变,构建5个成对位置文库。通过底物对接识别第一壳残基,并且基于用于改变结合口袋几何形状的相对接近性和位置选择非保守的第一壳残基。通过重叠延伸PCR和Gibson组装构建成对位置文库。
表12:成对位置文库(根据SEQ ID NO:37编号)
为了朱栾倍半萜的总氧合作用,按照实施例4评估菌株。在30℃和22℃下评估菌株。
成对位置文库的初步筛选揭示了一些变体失去活性。文库3包含在22℃而不是30℃具有改善的活性的变体。因此,在第一壳残基中同时引入两个或多个突变可能对活性有影响。
表13:进行以下单一位置SSM(根据SEQ ID NO:37编号)
几个变体改善含氧滴度高达1.7倍。在位置E323、I390和Q500的突变显示出具有改善的氧合滴度的几个命中物,并且选择这些位置用于二次筛选。
接下来,在VO1背景中筛选回归共识突变(19个突变体)。使用实施例3中所述的筛选方法,筛选以下突变:A2T、I389L、I389V、I389A、M94V、T488D、E491K、E52A、H46R、D191N、L150M、I495V、T468I、K344D、Q268T、R351Q、R76K、V400Q和I444A(根据SEQ ID NO:37编号)。如图13A所示,超过50%的突变导致1.2至1.45倍的含氧滴度(显示为mg/L),而产物谱中没有显著变化。用A2T、M94V、T488D、E52A、H46R、L150M、T468I、K344D、Q268T、R351Q、R76K、V400Q和I444A,观察到改善,选择它们用于二次筛选。图13B显示了相对于倍数总含氧产物变化标绘的相同的筛选。
选择来自活性位点SSM(L231M、I390L、I390M、T131K和T131Q)的前导变体、N端锚定变体n20yhcB_t29VOR1和回归共识诱变,并重新筛选。该二次筛选的结果显示在图14中。几个突变显示含氧滴度改善1.1-1.4倍。为了缩小用于重组的突变列表,在33℃筛选相同的突变以区分稳定的突变,稳定的突变可以使过程转换到更高的温度。如图15所示,6个突变(M94V、L150M、T468I、R76K、I390L和T131Q)在33℃保持改善的生产力。除了前导N端锚定,选择这六个突变用于重组。
实施例7:SrKO重组文库筛选
通过在每个位点允许变体或野生型,将在二次筛选后选择的七个突变(实施例6)随机掺入到VO重组文库中。背景菌株是MB2509(EGV G2MP6-CPR)+pBAC-T7-BCD7-yhcB-VO。
在30℃下初步筛选(使用实施例5中所述的相同方法)识别了几种变体,与VO1相比,在含氧产物滴度中具有高达1.35倍的改善。此外,选择变体显示生产转移到诺卡酮,表明了更高的P450活性(因为诺卡酮的生产需求两个氧合循环)。初级筛选的结果显示在图16A中(菌株对比以mg/L计的滴度)。图16B呈现了基于氧合能力(诺卡酮和诺特卡醇的总量)显示的相同筛选。
然后在34℃和37℃下筛选重组变体以选择在更高温度下具有改善的活性和稳定性的前导序列。二级筛选的结果显示在图17中。虽然对照在37℃几乎完全无活性,但6个前导序列在较高温度下显示出有希望的活性,并被选择用于进一步筛选(c11(8)、b4(7)、c6(1)、c12(3)、b6(2)和c9(6))。基于该进一步筛选(图18),基于氧合能力选择c6(1)作为最佳变体。六个前导序列包括以下组的突变。
表14:来自重组文库的前导变体中的突变的组
图23(A和B)显示了本文所述的几种工程化的朱栾倍半萜氧化酶(VO)变体的比对,并且突出了在筛选过程中评估的选择突变。在图23A中:8rp-t20SrKO(SEQ ID NO:106)是在N端具有20个氨基酸截短的SrKO序列,并且添加了8个氨基酸的膜锚定。8rp-t20VO0(SEQ IDNO:107)具有SrKO N端的20个氨基酸的截短、8个氨基酸的N端锚定的添加和位置499的单突变(根据野生型SrKO编号)。n22yhcB-t30VO1(SEQ ID NO:104)具有SrKO N端的30个氨基酸的截短,基于来自大肠杆菌yhcB的22个氨基酸的膜锚定,以及在位置46、231、284、383、400、488和499的8个点突变(相对于SrKO野生型)。n22yhcB-t30VO2(SEQ ID NO:105)具有SrKO N端的30个氨基酸截短,基于来自大肠杆菌yhcB的22个氨基酸的膜锚定,以及在位置76、94、131、231、284、383、390、468和499的9个点突变(相对于SrKO野生型)。在图24B中,显示针对野生型SrKO(SEQ ID NO:108)的VO0(SEQ ID NO:109)、VO1(SEQ ID NO:110)和VO2(SEQ IDNO:111)中的点突变(为了方便起见,所有用野生型SrKO N端显示)。
实施例8:细胞色素P450还原酶筛选
用VO1筛选一组细胞色素P450还原酶用于改善活性。使用菌株MB2459作为背景,用pBAC-T7-BCD7-VO1(I382L)-T7BCDx-CPRx进行该实施例。BCD代表BiCistronic Design,并描述于Mutalik等人Nature Methods 2013(10)4:354中。较低的BCD数字是指较高的翻译率。CPR包括SrCPR(SEQ ID NO:62)、SrCPR3(SEQ ID NO:80)、AaCPR(SEQ ID NO:68)、PgCPR(SEQ ID NO:82)、AtCPR2(SEQ ID NO:72)、AtCPR1(SEQ ID NO:70)、eSrCPR1(SEQ ID NO:76)和eATR2(SEQ ID NO:74)。如实施例5中那样在30℃下测试菌株。
如图20所示,通过RNA测序研究获得的SrCPR3在含氧滴度方面表现出1.3倍的改善。
在34℃下重新测试CPR直系同源物。结果示于图20中。即使在较高温度下,SrCPR3(SEQ ID NO:80)和AaCPR(SEQ ID NO:68)都显示含氧滴度的1.3倍改善。含氧滴度与在30℃下获得的那些相当。
实施例9:改变产物谱的醇脱氢酶
评估了醇脱氢酶将诺特卡醇转化为诺卡酮的能力。评估了以下ADH酶:
表15:CPR酶
使用MB2490作为背景菌株(MP6-MEP FAB46-ScFPPS-L-VS1MP6-VO1-o-SrCPR+p5-T7-BCD14-ADH),如实施例5中那样评估菌株。简言之,MP6、Fab46和T7是指用于连接的基因或操纵子的启动子。这里MEP是过表达大肠杆菌dxs、idi和ispDF基因的操纵子。ScFPPS和VS1之间的L是指编码(GSTGS)的短多肽连接子,而VO1和SrCPR之间的-o-是指操纵子构建体,在该操纵子构建体中RBS序列插入在两个基因之间。加号表示质粒,加号后面描述为具有启动子、BCD(如上所述)和所讨论的ADH的p5(5个拷贝)质粒。
识别了四种直系同源物(vvDH、csABA2、bdDH和zzSDR),其将诺特卡醇转化成诺卡酮,导致诺卡酮滴度增加超过3倍。图21。
参考文献
1.Qualley A,Dudareva N.Plant Volatiles.Encycl.Life Sci.2010;1-9.
2.Ajikumar P,Tyo K,Carlsen S.Terpenoids:opportunities forbiosynthesis of natural product drugs using engineeredmicroorganisms.Mol.Pharm.[Internet].2008[cited 2013May 16];5(2):167-90.
3.Ajikumar PK,Xiao W-H,Tyo KEJ,Wang Y,Simeon F,Leonard E,etal.Isoprenoid pathway optimization for Taxol precursor overproduction inEscherichia coli.Science[Internet].2010Oct 1[cited 2013May 22];330(6000):70-4.
4.Ro D-K,Paradisc EM,Ouellet M,Fishcr KJ,Newman KL,Ndungu JM,ctal.Production of the antimalarial drug precursor artemisinic acid inengineered yeast.Nature[Internet].2006Apr 13[cited 2013Mar 1];440(7086):940-3.
5.Sevrioukova IF,Li H,Zhang H,Peterson J a,Poulos TL.Structure of acytochromc P450-rcdox partncr elcctron-transfcrcomplcx.Proc.Natl.Acad.Sci.U.S.A.[Intcrnct].1999Mar 2;96(5):1863-8.
6.Sevrioukova IF,Poulos TL.Structural biology of redox partnerinteractions in P450cam monooxygcnasc:a fresh look at an oldsystcm.Arch.Biochcm.Biophys.[Internet].Elsevier Inc.;2011Mar 1[cited 2013Mar26];507(1):66-74.
7.Ekroos M, T.Structural basis for ligand promiscuity incytochrome P450 3A4.Proc.Natl.Acad.Sci.U.S.A.[Internet].2006Sep 12;103(37):13682-7.
8.Takahashi S,Yeo Y-S,Zhao Y,O’Maille PE,Greenhagen BT,Noel JP,etal.Functional characterization ofpremnaspirodiene oxygenase,a cytochromeP450catalyzing regio-and stereo-specific hydroxylations of diversesesqaiterpene substrates.J.Biol.Chem.[Internet].2007Oct 26[cited 2013Mar 10];282(43):31744-54.
9.Morrone D,Chen X,Coates RM,Peters RJ.Characterization of thekaurene oxidasc CYP701A3,a multifunctional cytochromc P450from gibbcrcllinbiosynthesis.Biochem.J.[Internet].2010Nov 1[cited 2013Feb 6];431(3):337-44.
10.Zhang Z,Sibbesen O.The substrate specificity ofcytochromeP450cam.Bioorganic Med....[Internet].1998[cited 2013Sep 24];6:1501-8.
11.Stjernschantz E,van Vugt-Lussenburg BM a,Bonifacio A,de Beer SB a,van der Zwan G,Gooijer C,et al.Structural rationalization of novel drugmetabolizing mutants of cytochrome P450BM3.Proteins[Internet].2008Apr[cited2013May 16];71(1):336-52.
12.Chen MMY,Snow CD,Vizearra CL,Mayo SL,Arnold FH.Comparison ofrandom mutagcncsis and scmi-rational designcd librarics for improvedcytochrome P450BM3-catalyzed hydroxylation of small alkanes.ProteinEng.Des.Sel.[Internet].2012Apr[cited 2013Mar 11];25(4):171-8.
13.Harford-Cross CF,Carmichael a B,Allan FK,England P a,Rouch D a,Wong LL.Protein engineering of cytochrome p450(cam)(CYP101)for the oxidationof polycyclic aromatic hydrocarbons.Protein Eng.[Internet].2000 Fcb;13(2):121-8.
14.Bell SG,Chen X,Sowden RJ,Xu F,Williams JN,Wong L,et al.MolecularRecognition in(+)-r-Pinene Oxidation by CytochromeP450cam.J.Am.Chem.Soc.2003;125:705-14.
15.Sowden R,Yasmin S,Rees N,Bell SG,Wong L-L.Biotransformation ofthesesquiterpene(+)-valencene by cytochrome P450cam and P450BM-3.Org.Biomol.Chcm.[Intcmct].2005[citcd 2013 May 16];3:57-64.
16.Brandle JE,Richman A,Swanson AK,Chapman BP.Leaf Ests from Steviarebaudiana:a resource for gene discovery in diterpene synthesis.PlantMol.Biol.[Internet].2002 Nov[cited 2013 Feb 12];50(4-5):613-22.
17.Bar-even A,Noor E,Savir Y,Liebermeister W,Davidi D,Tawfik DS,etal.The Moderately Efficient Enzyme:Evolutionary and Physicochemical TrendsShaping Enzyme Parameters.Biochemistry.2011;
18.Pleiss J.Protein design in metabolic engineering and syntheticbiology.Curr.Opin.Biotechnol.[Internet].2011 Oct[cited 2013 Mar 5];22(5):611-7.
19.Lehmann M,Pasamontes L,Lassen SF,Wyss M.The consensus concept forthermostability engineering of proteins.Biochim.Biophys.Acta[Intemet].2000Dec 29[cited 2013 Oct 17];1543(2):408-15.
20.Vazqucz-Figucroa E,Yeh V,Brocring JM,Chaparro-Riggcrs JF,BommariusAS.Thermostable variants constructed via the structure-guided consensusmethod also show increased stability in salts solutions and homogeneousaqueous-organic media.Protein Eng.Des.Sel.[Internet].2008 Nov[cited 2013 Oct17];21(11):673-80.
21.Dai M,Fisher HE,Temirov J,Kiss C,Phipps ME,Pavlik P,et al.Thecreation of a novel fluorcsccnt protcin by guidcd consensuscngincering.Protcin Eng.Dcs.Scl.[Internct].2007 Feb[cited 2013 Scp 19];20(2):69-79.
22.Fraatz M a.,Riemer SJL, R,Kaspera R,Nimtz M,Berger RG,etal.Anovel oxygenase from Pleurotus sapidus transforms valencene tonootkatone.J.Mol.Catal.B Enzym.[Intemet].2009 Dec[cited 2013 Apr 17];61(3-4):202-7.
23.Krügener S,Krings U,Zorn H,Berger RG.A dioxygenase of Pleurotussapidus transforms(+)-valencene regio-specifically to(+)-nootkatone via astereo-spccific allylic hydropcroxidation.Biorcsour.Tcchnol.[Intcrnct].Elscvicr Ltd;2010 Jan[cited 2013 Apr 12];101(2):457-62.
24.Zorn H,Fraatz MA,Riemer SJL,Takenberg M.Enzymatic synthesis ofnootkatonc.GERMANY;2010.
25.Kaspcra R,Krings U,Nanzad T,Bcrgcr RG.Bioconvcrsion of(+)-valcnccnc in submerged cultures of the ascomycete Chaetomiumglobosum.Appl.Microbiol.Biotechnol.[Internet].2005 Jun[cited 2013 May 16];67(4):477-83.
26.Cankar K,van Houwelingen A,Bosch D,Sonke T,Bouwmeester H,Beekwilder J.A chicory cytochrome P450 mono-oxygenase CYP71AV8 for theoxidation of (+)-valencene.FEBS Lett.[Internet].Federation of EuropeanBiochemical Societies;2011 Jan 3[cited 2013 Mar 6];585(1):178-82.
27.Girhard M,Machida K,Itoh M,Schmid RD,Arisawa A,UrlacherVB.Regioselective biooxidation of(+)-valencene by recombinant E.coliexpressing CYP 109B1 from Bacillus subtilis in a two-liquid-phasesystem.Microb.Cell Fact.[Internet].2009 Jan[cited 2013 May 16];8(4):36.
28.Bm-P,Sowden R J,Yasmin S,Rccs NH,Bcll SG,Wong L.Biotransformationof the sesquiterpene(+)-valencene by cytochrome P450.2005;57-64.
29.Trott O,Olson A.AutoDock Vina:improving the speed and accuracy ofdocking with a new scoring function,efficient optimization,andmultithreading.J.Comput.Chem.[Intcrnct].2010[citcd 2013 Jul 26];31(2):455-61.1
序列表
<110> 马努斯生物合成股份有限公司
<120> 含氧萜烯的生产方法
<130> FCT170093-99
<150> PCT/US2015/046369
<151> 2015-08-21
<150> US 62/040,284
<151> 2014-08-21
<160> 118
<170> PatentIn version 3.5
<210> 1
<211> 556
<212> PRT
<213> 葡萄
<400> 1
Met Ser Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Cys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met His Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Met Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Thr Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Arg Trp Asp Ile Asn Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Tyr Val Ala Leu Leu
340 345 350
Asp Val Tyr Lys Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Val His Tyr Ala Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Glu Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Thr Thr Ser Phe Val Gly Met Gly Glu Ile Ala Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Lys Ile Met Ser Ser Ser Asn Phe
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Gly
465 470 475 480
Val Ser Glu Glu Gln Val Tyr Ser Glu Phe Gln Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Ser
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Gln Asp Ser Tyr Thr His Val Gly Lys Val Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Ala Val Ile
545 550 555
<210> 2
<211> 1671
<212> DNA
<213> 葡萄
<400> 2
atggctacgc aggtctcagc ctcgtcactg gcacaaatcc cgcagccgaa aaatcgtccg 60
gtggcgaact tccatccgaa tatctggggt gaccagttta tcacgtatac cccggaagat 120
aaagtgaccc gtgcgtgcaa ggaagaacaa attgaagacc tgaaaaagga agtcaaacgc 180
aagctgaccg cagcagcagt ggcaaacccg tctcagctgc tgaattttat cgatgcggtt 240
caacgtctgg gcgtcgccta tcatttcgaa caggaaattg aagaagcact gcaacatatc 300
tgcaacagct ttcacgattg taatgatatg gacggcgatc tgtataacat tgctctgggt 360
ttccgtctgc tgcgccagca aggctacacg atttcctgtg acatctttaa taaattcacc 420
gatgaacgtg gtcgctttaa ggaagcgctg atctcagacg ttcgtggcat gctgggtctg 480
tatgaagctg cgcatctgcg cgtccacggc gaagatattc tggccaaagc actggctttc 540
accacgaccc acctgaaggc gatggtcgaa tctctgggtt accatctggc agaacaggtg 600
gcacacgccc tgaaccgtcc gatccgcaaa ggcctggaac gtctggaagc gcgctggtat 660
attagtgtgt accaggacga agcatttcat gataaaaccc tgctggaact ggctaagctg 720
gatttcaacc tggttcaatc tctgcacaaa gaagaactga gtaatctggc ccgttggtgg 780
aaagaactgg actttgcgac caagctgccg ttcgcccgtg atcgcctggt tgaaggctat 840
ttttggatgc atggtgtcta tttcgaaccg cagtacctgc gcggtcgtcg cattctgacg 900
aaagtgatcg caatgacctc gattctggat gacatccacg acgcttacgg caccccggaa 960
gaactgaaac tgtttattga agcgatcgaa cgttgggata ttaacagcat caatcagctg 1020
ccggaatata tgaaactgtg ctacgtggcc ctgctggatg tttacaagga aatcgaagaa 1080
gaaatggaaa aggaaggtaa ccagtatcgt gttcattacg cgaaagaagt catgaagaat 1140
caagtgcgcg cctactttgc agaagctaaa tggctgcatg aagaacacgt gccggcgttc 1200
gaagaatata tgcgcgttgc gctggccagc tctggctact gtctgctggc cacgacctct 1260
tttgtgggca tgggtgaaat tgcaacgaaa gaagcgtttg actgggttac cagtgatccg 1320
aagattatga gttcctcaaa ctttatcacc cgtctgatgg atgacattaa atcccataag 1380
ttcgaacaga aacgcggtca cgtcacctca gccgtggaat gctatatgaa acagtacggc 1440
gtttcggaag aacaagtcta tagcgaattt cagaaacaaa tcgaaaacgc atggctggat 1500
attaatcagg aatgtctgaa accgacggca gtctccatgc cgctgctggc tcgtctgctg 1560
aattttacgc gcacgatgga tgtgatctat aaagaacagg attcgtacac ccatgtgggc 1620
aaggttatgc gcgataacat tgcaagcgtg ttcattaatg ctgttatcta a 1671
<210> 3
<211> 556
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 3
Met Ala Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Cys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met His Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Met Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Ser Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Lys Trp Asp Glu Ser Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Phe Val Ala Leu Ile
340 345 350
Asp Val Tyr Asn Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Ile His Tyr Leu Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Asp Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Val Thr Ser Phe Val Gly Met Gly Glu Ile Val Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Lys Leu Met Ser Ser Ser Asn Phe
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Gly
465 470 475 480
Val Thr Glu Glu Gln Val Tyr Ser Glu Phe Lys Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Pro
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Gln Asp Ser Tyr Thr His Val Gly Lys Val Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Pro Val Ile
545 550 555
<210> 4
<211> 1671
<212> DNA
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 4
atggcaaccc aggtgagtgc aagcagcctg gcccaaatcc ctcagccgaa aaaccgcccg 60
gttgcaaact tccaccctaa tatctggggc gatcagttca tcacctatac cccggaagat 120
aaagtgacaa gggcctgcaa agaggagcag atcgaggacc tgaaaaaaga ggtgaagcgc 180
aagctgaccg cagccgcagt ggcaaacccg agccaactgt taaacttcat cgatgccgtg 240
cagcgcctgg gcgttgccta tcacttcgag caggaaatcg aagaagccct acagcacatc 300
tgtaacagct tccacgattg taacgacatg gatggcgact tatacaacat agcattaggt 360
ttccgcttac tgcgtcagca gggctacacc ataagctgcg acatctttaa caagtttacc 420
gacgagcgcg gtcgttttaa agaggcgctg attagcgacg ttcgcggcat gttaggtctg 480
tacgaagccg cacatctgcg cgtgcacggc gaagacattc tggcgaaggc gctggcattc 540
acaaccacac acctgaaggc aatggtggaa agtctgggct accacttagc cgagcaggtt 600
gcccatgcac tgaatcgccc gattcgtaag ggcctggaac gcctggaagc ccgctggtac 660
atcagtgttt atcaggatga agcctttcat gataagaccc tgctggagct ggcaaagctg 720
gatttcaacc tggttcagag cctgcataag gaagagctga gcaacctggc ccgttggtgg 780
aaggagctgg atttcgcaac caagctgccg ttcgccaggg acaggttagt tgaaggctac 840
ttctggatgc acggcgttta cttcgagccg caatacctgc gtggccgccg catcctgacg 900
aaggtgatcg ccatgaccag cattctggac gacatccacg atgcgtacgg gagccctgag 960
gagttaaagc tgttcatcga ggcaatcgag aagtgggatg agagtagcat caaccaactg 1020
ccggagtata tgaaactgtg cttcgtggcc ctgattgatg tttacaatga gattgaagag 1080
gagatggaga aagaggggaa ccagtaccgc atccactacc tgaaagaggt gatgaagaat 1140
caggtgcgcg catacttcgc agaggccaaa tggctgcatg atgagcatgt tcctgccttc 1200
gaggagtaca tgcgcgtggc attagccagc agtggttact gtctgttagc cgttacgagc 1260
ttcgtgggta tgggcgagat cgtgaccaaa gaggcattcg actgggtgac gagcgacccg 1320
aagctgatga gcagcagcaa cttcatcacc cgtctgatgg acgacatcaa gagccacaag 1380
ttcgagcaga aacgcggtca cgttaccagc gccgtggagt gctacatgaa gcagtacggc 1440
gtgacagagg agcaagtgta cagcgagttc aagaaacaaa tcgagaacgc ctggctggac 1500
atcaaccaag agtgcctgaa accgaccgca gtgccgatgc ctctgttagc ccgtctgctg 1560
aatttcacac gcacgatgga cgttatctac aaggagcagg atagctacac ccacgttggt 1620
aaggtgatgc gcgacaacat cgccagtgtg ttcatcaacc cggtgatcta a 1671
<210> 5
<211> 556
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 5
Met Ala Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Cys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met His Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Leu Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Thr Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Lys Trp Asp Glu Ser Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Tyr Val Ala Leu Leu
340 345 350
Asp Val Tyr Asn Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Ile His Tyr Leu Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Asp Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Val Thr Ser Phe Val Gly Met Gly Glu Ile Val Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Arg Ile Met Ser Ser Ser Asn Phe
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Ala
465 470 475 480
Val Thr Glu Glu Gln Val Tyr Ser Glu Phe Lys Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Pro
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Gln Asp Ser Tyr Thr His Val Gly Lys Thr Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Pro Val Ile
545 550 555
<210> 6
<211> 1671
<212> DNA
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 6
atggcaaccc aggtgagtgc aagcagcctg gcccaaatcc ctcagccgaa aaaccgcccg 60
gttgcaaact tccaccctaa tatctggggc gatcagttca tcacctatac cccggaagat 120
aaagtgacaa gggcctgcaa agaggagcag atcgaggacc tgaaaaaaga ggtgaagcgc 180
aagctgaccg cagccgcagt ggcaaacccg agccaactgt taaacttcat cgatgccgtg 240
cagcgcctgg gcgttgccta tcacttcgag caggaaatcg aagaagccct acagcacatc 300
tgtaacagct tccacgattg taacgacatg gatggcgact tatacaacat agcattaggt 360
ttccgcttac tgcgtcagca gggctacacc ataagctgcg acatctttaa caagtttacc 420
gacgagcgcg gtcgttttaa agaggcgctg attagcgacg ttcgcggcat gttaggtctg 480
tacgaagccg cacatctgcg cgtgcacggc gaagacattc tggcgaaggc gctggcattc 540
acaaccacac acctgaaggc aatggtggaa agtctgggct accacttagc cgagcaggtt 600
gcccatgcac tgaatcgccc gattcgtaag ggcctggaac gcctggaagc ccgctggtac 660
atcagtgttt atcaggatga agcctttcat gataagaccc tgctggagct ggcaaagctg 720
gatttcaacc tggttcagag cctgcataag gaagagctga gcaacctggc ccgttggtgg 780
aaggagctgg atttcgcaac caagctgccg ttcgccaggg acaggttagt tgaaggctac 840
ttctggatgc acggcgttta cttcgagccg caatacctgc gtggccgccg catcctgacg 900
aaggtgatcg ccctgaccag cattctggac gacatccacg atgcgtacgg gacccctgag 960
gagttaaagc tgttcatcga ggcaatcgag aagtgggatg agagtagcat caaccaactg 1020
ccggagtata tgaaactgtg ctatgtggcc ctgctggatg tttacaatga gattgaagag 1080
gagatggaga aagaggggaa ccagtaccgc atccactacc tgaaagaggt gatgaagaat 1140
caggtgcgcg catacttcgc agaggccaaa tggctgcatg atgagcatgt tcctgccttc 1200
gaggagtaca tgcgcgtggc attagccagc agtggttact gtctgttagc cgttacgagc 1260
ttcgtgggta tgggcgagat cgtgaccaaa gaggcattcg actgggtgac gagcgacccg 1320
cgtattatga gcagcagcaa cttcatcacc cgtctgatgg acgacatcaa gagccacaag 1380
ttcgagcaga aacgcggtca cgttaccagc gccgtggagt gctacatgaa gcagtacgca 1440
gtgacagagg agcaagtgta cagcgagttc aagaaacaaa tcgagaacgc ctggctggac 1500
atcaaccaag agtgcctgaa accgaccgca gtgccgatgc ctctgttagc ccgtctgctg 1560
aatttcacac gcacgatgga cgttatctac aaggagcagg atagctacac ccacgttggt 1620
aagaccatgc gcgacaacat cgccagtgtg ttcatcaacc cggtgatcta a 1671
<210> 7
<211> 556
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 7
Met Ala Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Lys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met Met Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Met Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Ser Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Lys Trp Asp Glu Ser Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Phe Val Ala Leu Ile
340 345 350
Asp Val Tyr Asn Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Ile His Tyr Leu Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Asp Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Val Thr Ser Phe Val Gly Met Gly Glu Ile Val Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Lys Leu Met Ser Ser Ser Asn Thr
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Gly
465 470 475 480
Val Thr Glu Glu Gln Val Tyr Ser Glu Phe Lys Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Pro
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Glu Asp Ser Tyr Thr His Val Gly Lys Val Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Pro Val Ile
545 550 555
<210> 8
<211> 1671
<212> DNA
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 8
atggcaaccc aggtgagtgc aagcagcctg gcccaaatcc ctcagccgaa aaaccgcccg 60
gttgcaaact tccaccctaa tatctggggc gatcagttca tcacctatac cccggaagat 120
aaagtgacaa gggccaaaaa agaggagcag atcgaggacc tgaaaaaaga ggtgaagcgc 180
aagctgaccg cagccgcagt ggcaaacccg agccaactgt taaacttcat cgatgccgtg 240
cagcgcctgg gcgttgccta tcacttcgag caggaaatcg aagaagccct acagcacatc 300
tgtaacagct tccacgattg taacgacatg gatggcgact tatacaacat agcattaggt 360
ttccgcttac tgcgtcagca gggctacacc ataagctgcg acatctttaa caagtttacc 420
gacgagcgcg gtcgttttaa agaggcgctg attagcgacg ttcgcggcat gttaggtctg 480
tacgaagccg cacatctgcg cgtgcacggc gaagacattc tggcgaaggc gctggcattc 540
acaaccacac acctgaaggc aatggtggaa agtctgggct accacttagc cgagcaggtt 600
gcccatgcac tgaatcgccc gattcgtaag ggcctggaac gcctggaagc ccgctggtac 660
atcagtgttt atcaggatga agcctttcat gataagaccc tgctggagct ggcaaagctg 720
gatttcaacc tggttcagag cctgcataag gaagagctga gcaacctggc ccgttggtgg 780
aaggagctgg atttcgcaac caagctgccg ttcgccaggg acaggttagt tgaaggctac 840
ttctggatga tgggcgttta cttcgagccg caatacctgc gtggccgccg catcctgacg 900
aaggtgatcg ccatgaccag cattctggac gacatccacg atgcgtacgg gagccctgag 960
gagttaaagc tgttcatcga ggcaatcgag aagtgggatg agagtagcat caaccaactg 1020
ccggagtata tgaaactgtg cttcgtggcc ctgattgatg tttacaatga gattgaagag 1080
gagatggaga aagaggggaa ccagtaccgc atccactacc tgaaagaggt gatgaagaat 1140
caggtgcgcg catacttcgc agaggccaaa tggctgcatg atgagcatgt tcctgccttc 1200
gaggagtaca tgcgcgtggc attagccagc agtggttact gtctgttagc cgttacgagc 1260
ttcgtgggta tgggcgagat cgtgaccaaa gaggcattcg actgggtgac gagcgacccg 1320
aagctgatga gcagcagcaa caccatcacc cgtctgatgg acgacatcaa gagccacaag 1380
ttcgagcaga aacgcggtca cgttaccagc gccgtggagt gctacatgaa gcagtacggc 1440
gtgacagagg agcaagtgta cagcgagttc aagaaacaaa tcgagaacgc ctggctggac 1500
atcaaccaag agtgcctgaa accgaccgca gtgccgatgc ctctgttagc ccgtctgctg 1560
aatttcacac gcacgatgga cgttatctac aaggaggaag atagctacac ccacgttggt 1620
aaggtgatgc gcgacaacat cgccagtgtg ttcatcaacc cggtgatcta a 1671
<210> 9
<211> 556
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 9
Met Ala Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Lys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met Met Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Leu Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Thr Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Lys Trp Asp Glu Ser Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Tyr Val Ala Leu Leu
340 345 350
Asp Val Tyr Asn Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Ile His Tyr Leu Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Asp Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Val Thr Ser Phe Val Gly Met Gly Glu Ile Val Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Arg Ile Met Ser Ser Ser Asn Thr
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Ala
465 470 475 480
Val Thr Glu Glu Gln Val Tyr Ser Glu Phe Lys Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Pro
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Glu Asp Ser Tyr Thr His Val Gly Lys Thr Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Pro Val Ile
545 550 555
<210> 10
<211> 1671
<212> DNA
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 10
atggcaaccc aggtgagtgc aagcagcctg gcccaaatcc ctcagccgaa aaaccgcccg 60
gttgcaaact tccaccctaa tatctggggc gatcagttca tcacctatac cccggaagat 120
aaagtgacaa gggccaaaaa agaggagcag atcgaggacc tgaaaaaaga ggtgaagcgc 180
aagctgaccg cagccgcagt ggcaaacccg agccaactgt taaacttcat cgatgccgtg 240
cagcgcctgg gcgttgccta tcacttcgag caggaaatcg aagaagccct acagcacatc 300
tgtaacagct tccacgattg taacgacatg gatggcgact tatacaacat agcattaggt 360
ttccgcttac tgcgtcagca gggctacacc ataagctgcg acatctttaa caagtttacc 420
gacgagcgcg gtcgttttaa agaggcgctg attagcgacg ttcgcggcat gttaggtctg 480
tacgaagccg cacatctgcg cgtgcacggc gaagacattc tggcgaaggc gctggcattc 540
acaaccacac acctgaaggc aatggtggaa agtctgggct accacttagc cgagcaggtt 600
gcccatgcac tgaatcgccc gattcgtaag ggcctggaac gcctggaagc ccgctggtac 660
atcagtgttt atcaggatga agcctttcat gataagaccc tgctggagct ggcaaagctg 720
gatttcaacc tggttcagag cctgcataag gaagagctga gcaacctggc ccgttggtgg 780
aaggagctgg atttcgcaac caagctgccg ttcgccaggg acaggttagt tgaaggctac 840
ttctggatga tgggcgttta cttcgagccg caatacctgc gtggccgccg catcctgacg 900
aaggtgatcg ccctgaccag cattctggac gacatccacg atgcgtacgg gacccctgag 960
gagttaaagc tgttcatcga ggcaatcgag aagtgggatg agagtagcat caaccaactg 1020
ccggagtata tgaaactgtg ctatgtggcc ctgctggatg tttacaatga gattgaagag 1080
gagatggaga aagaggggaa ccagtaccgc atccactacc tgaaagaggt gatgaagaat 1140
caggtgcgcg catacttcgc agaggccaaa tggctgcatg atgagcatgt tcctgccttc 1200
gaggagtaca tgcgcgtggc attagccagc agtggttact gtctgttagc cgttacgagc 1260
ttcgtgggta tgggcgagat cgtgaccaaa gaggcattcg actgggtgac gagcgacccg 1320
cgtattatga gcagcagcaa caccatcacc cgtctgatgg acgacatcaa gagccacaag 1380
ttcgagcaga aacgcggtca cgttaccagc gccgtggagt gctacatgaa gcagtacgca 1440
gtgacagagg agcaagtgta cagcgagttc aagaaacaaa tcgagaacgc ctggctggac 1500
atcaaccaag agtgcctgaa accgaccgca gtgccgatgc ctctgttagc ccgtctgctg 1560
aatttcacac gcacgatgga cgttatctac aaggaggaag atagctacac ccacgttggt 1620
aagaccatgc gcgacaacat cgccagtgtg ttcatcaacc cggtgatcta a 1671
<210> 11
<211> 556
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜合成酶
<400> 11
Met Ala Thr Gln Val Ser Ala Ser Ser Leu Ala Gln Ile Pro Gln Pro
1 5 10 15
Lys Asn Arg Pro Val Ala Asn Phe His Pro Asn Ile Trp Gly Asp Gln
20 25 30
Phe Ile Thr Tyr Thr Pro Glu Asp Lys Val Thr Arg Ala Lys Lys Glu
35 40 45
Glu Gln Ile Glu Asp Leu Lys Lys Glu Val Lys Arg Lys Leu Thr Ala
50 55 60
Ala Ala Val Ala Asn Pro Ser Gln Leu Leu Asn Phe Ile Asp Ala Val
65 70 75 80
Gln Arg Leu Gly Val Ala Tyr His Phe Glu Gln Glu Ile Glu Glu Ala
85 90 95
Leu Gln His Ile Cys Asn Ser Phe His Asp Cys Asn Asp Met Asp Gly
100 105 110
Asp Leu Tyr Asn Ile Ala Leu Gly Phe Arg Leu Leu Arg Gln Gln Gly
115 120 125
Tyr Thr Ile Ser Cys Asp Ile Phe Asn Lys Phe Thr Asp Glu Arg Gly
130 135 140
Arg Phe Lys Glu Ala Leu Ile Ser Asp Val Arg Gly Met Leu Gly Leu
145 150 155 160
Tyr Glu Ala Ala His Leu Arg Val His Gly Glu Asp Ile Leu Ala Lys
165 170 175
Ala Leu Ala Phe Thr Thr Thr His Leu Lys Ala Met Val Glu Ser Leu
180 185 190
Gly Tyr His Leu Ala Glu Gln Val Ala His Ala Leu Asn Arg Pro Ile
195 200 205
Arg Lys Gly Leu Glu Arg Leu Glu Ala Arg Trp Tyr Ile Ser Val Tyr
210 215 220
Gln Asp Glu Ala Phe His Asp Lys Thr Leu Leu Glu Leu Ala Lys Leu
225 230 235 240
Asp Phe Asn Leu Val Gln Ser Leu His Lys Glu Glu Leu Ser Asn Leu
245 250 255
Ala Arg Trp Trp Lys Glu Leu Asp Phe Ala Thr Lys Leu Pro Phe Ala
260 265 270
Arg Asp Arg Leu Val Glu Gly Tyr Phe Trp Met Met Gly Val Tyr Phe
275 280 285
Glu Pro Gln Tyr Leu Arg Gly Arg Arg Ile Leu Thr Lys Val Ile Ala
290 295 300
Leu Thr Ser Ile Leu Asp Asp Ile His Asp Ala Tyr Gly Thr Pro Glu
305 310 315 320
Glu Leu Lys Leu Phe Ile Glu Ala Ile Glu Lys Trp Asp Glu Ser Ser
325 330 335
Ile Asn Gln Leu Pro Glu Tyr Met Lys Leu Cys Tyr Val Ala Leu Leu
340 345 350
Asp Val Tyr Asn Glu Ile Glu Glu Glu Met Glu Lys Glu Gly Asn Gln
355 360 365
Tyr Arg Ile His Tyr Leu Lys Glu Val Met Lys Asn Gln Val Arg Ala
370 375 380
Tyr Phe Ala Glu Ala Lys Trp Leu His Asp Glu His Val Pro Ala Phe
385 390 395 400
Glu Glu Tyr Met Arg Val Ala Leu Ala Ser Ser Gly Tyr Cys Leu Leu
405 410 415
Ala Val Thr Ser Phe Val Gly Met Gly Glu Ile Val Thr Lys Glu Ala
420 425 430
Phe Asp Trp Val Thr Ser Asp Pro Arg Ile Met Ser Ser Ser Asn Thr
435 440 445
Ile Thr Arg Leu Met Asp Asp Ile Lys Ser His Lys Phe Glu Gln Lys
450 455 460
Arg Gly His Val Thr Ser Ala Val Glu Cys Tyr Met Lys Gln Tyr Ala
465 470 475 480
Val Thr Glu Glu Gln Val Tyr Ser Glu Phe Lys Lys Gln Ile Glu Asn
485 490 495
Ala Trp Leu Asp Ile Asn Gln Glu Cys Leu Lys Pro Thr Ala Val Pro
500 505 510
Met Pro Leu Leu Ala Arg Leu Leu Asn Phe Thr Arg Thr Met Asp Val
515 520 525
Ile Tyr Lys Glu Glu Asp Ser Tyr Thr His Val Gly Lys Thr Met Arg
530 535 540
Asp Asn Ile Ala Ser Val Phe Ile Asn Pro Val Ile
545 550 555
<210> 12
<211> 548
<212> PRT
<213> 甜橙
<400> 12
Met Ala Ser Gly Glu Thr Phe Arg Pro Thr Ala Asp Phe His Pro Ser
1 5 10 15
Leu Trp Arg Asn His Phe Leu Lys Gly Ala Ser Asp Phe Lys Thr Val
20 25 30
Asp His Thr Ala Thr Gln Glu Arg His Glu Ala Leu Lys Glu Glu Val
35 40 45
Arg Arg Met Ile Thr Asp Ala Glu Asp Lys Pro Val Gln Lys Leu Arg
50 55 60
Leu Ile Asp Glu Val Gln Arg Leu Gly Val Ala Tyr His Phe Glu Lys
65 70 75 80
Glu Ile Gly Asp Ala Ile Gln Lys Leu Cys Pro Ile Tyr Ile Asp Ser
85 90 95
Asn Arg Ala Asp Leu His Thr Val Ser Leu His Phe Arg Leu Leu Arg
100 105 110
Gln Gln Gly Ile Lys Ile Ser Cys Asp Val Phe Glu Lys Phe Lys Asp
115 120 125
Asp Glu Gly Arg Phe Lys Ser Ser Leu Ile Asn Asp Val Gln Gly Met
130 135 140
Leu Ser Leu Tyr Glu Ala Ala Tyr Met Ala Val Arg Gly Glu His Ile
145 150 155 160
Leu Asp Glu Ala Ile Ala Phe Thr Thr Thr His Leu Lys Ser Leu Val
165 170 175
Ala Gln Asp His Val Thr Pro Lys Leu Ala Glu Gln Ile Asn His Ala
180 185 190
Leu Tyr Arg Pro Leu Arg Lys Thr Leu Pro Arg Leu Glu Ala Arg Tyr
195 200 205
Phe Met Ser Met Ile Asn Ser Thr Ser Asp His Leu Cys Asn Lys Thr
210 215 220
Leu Leu Asn Phe Ala Lys Leu Asp Phe Asn Ile Leu Leu Glu Leu His
225 230 235 240
Lys Glu Glu Leu Asn Glu Leu Thr Lys Trp Trp Lys Asp Leu Asp Phe
245 250 255
Thr Thr Lys Leu Pro Tyr Ala Arg Asp Arg Leu Val Glu Leu Tyr Phe
260 265 270
Trp Asp Leu Gly Thr Tyr Phe Glu Pro Gln Tyr Ala Phe Gly Arg Lys
275 280 285
Ile Met Thr Gln Leu Asn Tyr Ile Leu Ser Ile Ile Asp Asp Thr Tyr
290 295 300
Asp Ala Tyr Gly Thr Leu Glu Glu Leu Ser Leu Phe Thr Glu Ala Val
305 310 315 320
Gln Arg Trp Asn Ile Glu Ala Val Asp Met Leu Pro Glu Tyr Met Lys
325 330 335
Leu Ile Tyr Arg Thr Leu Leu Asp Ala Phe Asn Glu Ile Glu Glu Asp
340 345 350
Met Ala Lys Gln Gly Arg Ser His Cys Val Arg Tyr Ala Lys Glu Glu
355 360 365
Asn Gln Lys Val Ile Gly Ala Tyr Ser Val Gln Ala Lys Trp Phe Ser
370 375 380
Glu Gly Tyr Val Pro Thr Ile Glu Glu Tyr Met Pro Ile Ala Leu Thr
385 390 395 400
Ser Cys Ala Tyr Thr Phe Val Ile Thr Asn Ser Phe Leu Gly Met Gly
405 410 415
Asp Phe Ala Thr Lys Glu Val Phe Glu Trp Ile Ser Asn Asn Pro Lys
420 425 430
Val Val Lys Ala Ala Ser Val Ile Cys Arg Leu Met Asp Asp Met Gln
435 440 445
Gly His Glu Phe Glu Gln Lys Arg Gly His Val Ala Ser Ala Ile Glu
450 455 460
Cys Tyr Thr Lys Gln His Gly Val Ser Lys Glu Glu Ala Ile Lys Met
465 470 475 480
Phe Glu Glu Glu Val Ala Asn Ala Trp Lys Asp Ile Asn Glu Glu Leu
485 490 495
Met Met Lys Pro Thr Val Val Ala Arg Pro Leu Leu Gly Thr Ile Leu
500 505 510
Asn Leu Ala Arg Ala Ile Asp Phe Ile Tyr Lys Glu Asp Asp Gly Tyr
515 520 525
Thr His Ser Tyr Leu Ile Lys Asp Gln Ile Ala Ser Val Leu Gly Asp
530 535 540
His Val Pro Phe
545
<210> 13
<211> 513
<212> PRT
<213> 红球姜
<400> 13
Met Glu Ala Ile Ser Leu Phe Ser Pro Phe Phe Phe Ile Thr Leu Phe
1 5 10 15
Leu Gly Phe Phe Ile Thr Leu Leu Ile Lys Arg Ser Ser Arg Ser Ser
20 25 30
Val His Lys Gln Gln Val Leu Leu Ala Ser Leu Pro Pro Ser Pro Pro
35 40 45
Arg Leu Pro Leu Ile Gly Asn Ile His Gln Leu Val Gly Gly Asn Pro
50 55 60
His Arg Ile Leu Leu Gln Leu Ala Arg Thr His Gly Pro Leu Ile Cys
65 70 75 80
Leu Arg Leu Gly Gln Val Asp Gln Val Val Ala Ser Ser Val Glu Ala
85 90 95
Val Glu Glu Ile Ile Lys Arg His Asp Leu Lys Phe Ala Asp Arg Pro
100 105 110
Arg Asp Leu Thr Phe Ser Arg Ile Phe Phe Tyr Asp Gly Asn Ala Val
115 120 125
Val Met Thr Pro Tyr Gly Gly Glu Trp Lys Gln Met Arg Lys Ile Tyr
130 135 140
Ala Met Glu Leu Leu Asn Ser Arg Arg Val Lys Ser Phe Ala Ala Ile
145 150 155 160
Arg Glu Asp Val Ala Arg Lys Leu Thr Gly Glu Ile Ala His Lys Ala
165 170 175
Phe Ala Gln Thr Pro Val Ile Asn Leu Ser Glu Met Val Met Ser Met
180 185 190
Ile Asn Ala Ile Val Ile Arg Val Ala Phe Gly Asp Lys Cys Lys Gln
195 200 205
Gln Ala Tyr Phe Leu His Leu Val Lys Glu Ala Met Ser Tyr Val Ser
210 215 220
Ser Phe Ser Val Ala Asp Met Tyr Pro Ser Leu Lys Phe Leu Asp Thr
225 230 235 240
Leu Thr Gly Leu Lys Ser Lys Leu Glu Gly Val His Gly Lys Leu Asp
245 250 255
Lys Val Phe Asp Glu Ile Ile Ala Gln Arg Gln Ala Ala Leu Ala Ala
260 265 270
Glu Gln Ala Glu Glu Asp Leu Ile Ile Asp Val Leu Leu Lys Leu Lys
275 280 285
Asp Glu Gly Asn Gln Glu Phe Pro Ile Thr Tyr Thr Ser Val Lys Ala
290 295 300
Ile Val Met Glu Ile Phe Leu Ala Gly Thr Glu Thr Ser Ser Ser Val
305 310 315 320
Ile Asp Trp Val Met Ser Glu Leu Ile Lys Asn Pro Lys Ala Met Glu
325 330 335
Lys Val Gln Lys Glu Met Arg Glu Ala Met Gln Gly Lys Thr Lys Leu
340 345 350
Glu Glu Ser Asp Ile Pro Lys Phe Ser Tyr Leu Asn Leu Val Ile Lys
355 360 365
Glu Thr Leu Arg Leu His Pro Pro Gly Pro Leu Leu Phe Pro Arg Glu
370 375 380
Cys Arg Glu Thr Cys Glu Val Met Gly Tyr Arg Val Pro Ala Gly Ala
385 390 395 400
Arg Leu Leu Ile Asn Ala Phe Ala Leu Ser Arg Asp Glu Lys Tyr Trp
405 410 415
Gly Ser Asp Ala Glu Ser Phe Lys Pro Glu Arg Phe Glu Gly Ile Ser
420 425 430
Val Asp Phe Lys Gly Ser Asn Phe Glu Phe Met Pro Phe Gly Ala Gly
435 440 445
Arg Arg Ile Cys Pro Gly Met Thr Phe Gly Ile Ser Ser Val Glu Val
450 455 460
Ala Leu Ala His Leu Leu Phe His Phe Asp Trp Gln Leu Pro Gln Gly
465 470 475 480
Met Lys Ile Glu Asp Leu Asp Met Met Glu Val Ser Gly Met Ser Ala
485 490 495
Thr Arg Arg Ser Pro Leu Leu Val Leu Ala Lys Leu Ile Ile Pro Leu
500 505 510
Pro
<210> 14
<211> 512
<212> PRT
<213> 红球姜
<400> 14
Met Ala Leu Leu Leu Ala Val Phe Phe Phe Phe Ile Thr Leu Phe Leu
1 5 10 15
Gly Phe Phe Ile Thr Leu Leu Ile Lys Arg Ser Ser Arg Ser Ser Val
20 25 30
His Lys Gln Gln Val Leu Leu Ala Ser Leu Pro Pro Ser Pro Pro Arg
35 40 45
Leu Pro Leu Ile Gly Asn Ile His Gln Leu Val Gly Gly Asn Pro His
50 55 60
Arg Ile Leu Leu Gln Leu Ala Arg Thr His Gly Pro Leu Ile Cys Leu
65 70 75 80
Arg Leu Gly Gln Val Asp Gln Val Val Ala Ser Ser Val Glu Ala Val
85 90 95
Glu Glu Ile Ile Lys Arg His Asp Leu Lys Phe Ala Asp Arg Pro Arg
100 105 110
Asp Leu Thr Phe Ser Arg Ile Phe Phe Tyr Asp Gly Asn Ala Val Val
115 120 125
Met Thr Pro Tyr Gly Gly Glu Trp Lys Gln Met Arg Lys Ile Tyr Ala
130 135 140
Met Glu Leu Leu Asn Ser Arg Arg Val Lys Ser Phe Ala Ala Ile Arg
145 150 155 160
Glu Asp Val Ala Arg Lys Leu Thr Gly Glu Ile Ala His Lys Ala Phe
165 170 175
Ala Gln Thr Pro Val Ile Asn Leu Ser Glu Met Val Met Ser Met Ile
180 185 190
Asn Ala Ile Val Ile Arg Val Ala Phe Gly Asp Lys Cys Lys Gln Gln
195 200 205
Ala Tyr Phe Leu His Leu Val Lys Glu Ala Met Ser Tyr Val Ser Ser
210 215 220
Phe Ser Val Ala Asp Met Tyr Pro Ser Leu Lys Phe Leu Asp Thr Leu
225 230 235 240
Thr Gly Leu Lys Ser Lys Leu Glu Gly Val His Gly Lys Leu Asp Lys
245 250 255
Val Phe Asp Glu Ile Ile Ala Gln Arg Gln Ala Ala Leu Ala Ala Glu
260 265 270
Gln Ala Glu Glu Asp Leu Ile Ile Asp Val Leu Leu Lys Leu Lys Asp
275 280 285
Glu Gly Asn Gln Glu Phe Pro Ile Thr Tyr Thr Ser Val Lys Ala Ile
290 295 300
Val Met Glu Ile Phe Leu Ala Gly Thr Glu Thr Ser Ser Ser Val Ile
305 310 315 320
Asp Trp Val Met Ser Glu Leu Ile Lys Asn Pro Lys Ala Met Glu Lys
325 330 335
Val Gln Lys Glu Met Arg Glu Ala Met Gln Gly Lys Thr Lys Leu Glu
340 345 350
Glu Ser Asp Ile Pro Lys Phe Ser Tyr Leu Asn Leu Val Ile Lys Glu
355 360 365
Thr Leu Arg Leu His Pro Pro Gly Pro Leu Leu Phe Pro Arg Glu Cys
370 375 380
Arg Glu Thr Cys Glu Val Met Gly Tyr Arg Val Pro Ala Gly Ala Arg
385 390 395 400
Leu Leu Ile Asn Ala Phe Ala Leu Ser Arg Asp Glu Lys Tyr Trp Gly
405 410 415
Ser Asp Ala Glu Ser Phe Lys Pro Glu Arg Phe Glu Gly Ile Ser Val
420 425 430
Asp Phe Lys Gly Ser Asn Phe Glu Phe Met Pro Phe Gly Ala Gly Arg
435 440 445
Arg Ile Cys Pro Gly Met Thr Phe Gly Ile Ser Ser Val Glu Val Ala
450 455 460
Leu Ala His Leu Leu Phe His Phe Asp Trp Gln Leu Pro Gln Gly Met
465 470 475 480
Lys Ile Glu Asp Leu Asp Met Met Glu Val Ser Gly Met Ser Ala Thr
485 490 495
Arg Arg Ser Pro Leu Leu Val Leu Ala Lys Leu Ile Ile Pro Leu Pro
500 505 510
<210> 15
<211> 1536
<212> DNA
<213> 红球姜
<400> 15
atggctctgt tattagcagt gttcttcttt ttcattacgc tgtttctggg tttctttatt 60
acgctgctga ttaaacgctc gtcccgtagc tctgtccata aacagcaagt gctgctggcc 120
tctctgccgc cgagtccgcc gcgcctgccg ctgattggca acatccatca actggtgggc 180
ggcaacccgc atcgtattct gctgcaactg gcgcgtaccc acggcccgct gatctgcctg 240
cgtctgggtc aggttgatca agtggttgca agttccgtgg aagctgttga agaaattatc 300
aaacgtcacg acctgaaatt tgcagatcgt ccgcgcgacc tgacctttag ccgtattttc 360
ttttatgatg gtaacgctgt cgtgatgacg ccgtacggcg gtgaatggaa acagatgcgt 420
aaaatctatg caatggaact gctgaacagc cgtcgtgtga aatcttttgc ggccattcgt 480
gaagacgttg cacgcaaact gaccggcgaa atcgctcaca aagcattcgc tcagacgccg 540
gtcattaacc tgtctgaaat ggtgatgagt atgatcaatg cgattgtcat ccgcgtggcc 600
tttggtgata aatgtaaaca gcaagcatac ttcctgcatc tggtgaaaga agctatgtcc 660
tatgtttcat cgttttcagt cgcggatatg tacccgtccc tgaaattcct ggacaccctg 720
acgggcctga aaagcaaact ggaaggcgtt cacggtaaac tggataaagt cttcgacgaa 780
atcatcgcac agcgtcaagc agcgctggcg gcggaacagg ctgaagaaga tctgattatc 840
gacgtgctgc tgaaactgaa agatgaaggc aaccaggaat ttccgattac ctatacgtca 900
gttaaagcga ttgtcatgga aatcttcctg gccggcaccg aaaccagcag cagcgtgatt 960
gactgggtta tgagtgaact gatcaaaaac ccgaaagcga tggaaaaagt gcagaaagaa 1020
atgcgtgaag ccatgcaagg caaaaccaaa ctggaagaat cggatattcc gaaatttagc 1080
tacctgaatc tggttatcaa agaaaccctg cgtctgcatc cgccgggtcc gctgctgttc 1140
ccgcgtgaat gccgcgaaac ctgcgaagtg atgggctatc gtgttccggc gggtgcccgc 1200
ctgctgatta acgcatttgc tctgtctcgt gatgaaaaat actggggttc cgacgccgaa 1260
tcatttaaac cggaacgctt tgaaggcatc tctgtggatt tcaaaggtag taattttgaa 1320
tttatgccgt tcggcgcggg ccgtcgtatt tgtccgggca tgacctttgg tatctcctca 1380
gttgaagtcg cgctggccca tctgctgttt cacttcgatt ggcaactgcc gcaaggcatg 1440
aaaattgaag atctggacat gatggaagtc tcgggtatga gcgcaacccg tcgtagcccg 1500
ctgctggttc tggccaaact gattatcccg ctgccg 1536
<210> 16
<211> 496
<212> PRT
<213> 巴伐尼亚棘
<400> 16
Met Glu Leu Thr Leu Thr Thr Ser Leu Gly Leu Ala Val Phe Val Phe
1 5 10 15
Ile Leu Phe Lys Leu Leu Thr Gly Ser Lys Ser Thr Lys Asn Ser Leu
20 25 30
Pro Glu Ala Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Val
35 40 45
Gly Thr Leu Pro His Arg Gly Val Thr Asp Met Ala Arg Lys Tyr Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ser Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Ala Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Glu Val Arg
145 150 155 160
Ser Ser Gly Ser Gly Ser Pro Val Asp Leu Ser Glu Ser Ile Phe Lys
165 170 175
Leu Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Arg Glu Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Leu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Ile Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu
225 230 235 240
Asp Ser Leu Ile Asn Asn Ile Val Ser Glu His Pro Gly Ser Arg Thr
245 250 255
Ser Ser Ser Gln Glu Ser Leu Leu Asp Val Leu Leu Arg Leu Lys Asp
260 265 270
Ser Ala Glu Leu Pro Leu Thr Ser Asp Asn Val Lys Ala Val Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu
325 330 335
Asp Ile Gln Glu Leu Ser Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Cys Val Leu Ala Gly Tyr Glu Ile Pro Thr Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Thr Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Asn Ile Met
405 410 415
Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His
435 440 445
Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro Asn Gly Ala Arg Leu Asp
450 455 460
Glu Leu Asp Met Ser Glu Cys Phe Gly Ala Thr Val Gln Arg Lys Ser
465 470 475 480
Glu Leu Leu Leu Val Pro Thr Ala Tyr Lys Thr Ala Asn Asn Ser Ala
485 490 495
<210> 17
<211> 496
<212> PRT
<213> 巴伐尼亚棘
<400> 17
Met Ala Leu Leu Leu Ala Val Phe Leu Gly Leu Ala Val Phe Val Phe
1 5 10 15
Ile Leu Phe Lys Leu Leu Thr Gly Ser Lys Ser Thr Lys Asn Ser Leu
20 25 30
Pro Glu Ala Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Val
35 40 45
Gly Thr Leu Pro His Arg Gly Val Thr Asp Met Ala Arg Lys Tyr Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ser Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Ala Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Glu Val Arg
145 150 155 160
Ser Ser Gly Ser Gly Ser Pro Val Asp Leu Ser Glu Ser Ile Phe Lys
165 170 175
Leu Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Arg Glu Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Leu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Ile Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu
225 230 235 240
Asp Ser Leu Ile Asn Asn Ile Val Ser Glu His Pro Gly Ser Arg Thr
245 250 255
Ser Ser Ser Gln Glu Ser Leu Leu Asp Val Leu Leu Arg Leu Lys Asp
260 265 270
Ser Ala Glu Leu Pro Leu Thr Ser Asp Asn Val Lys Ala Val Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu
325 330 335
Asp Ile Gln Glu Leu Ser Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Cys Val Leu Ala Gly Tyr Glu Ile Pro Thr Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Thr Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Asn Ile Met
405 410 415
Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His
435 440 445
Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro Asn Gly Ala Arg Leu Asp
450 455 460
Glu Leu Asp Met Ser Glu Cys Phe Gly Ala Thr Val Gln Arg Lys Ser
465 470 475 480
Glu Leu Leu Leu Val Pro Thr Ala Tyr Lys Thr Ala Asn Asn Ser Ala
485 490 495
<210> 18
<211> 1488
<212> DNA
<213> 巴伐尼亚棘
<400> 18
atggctctgt tattagcagt tttcctgggc ctggctgtct tcgtctttat cctgttcaaa 60
ctgctgaccg gctcaaaatc aaccaaaaat tcactgccgg aagcatggcg tctgccgatc 120
attggccaca tgcatcacct ggttggcacg ctgccgcatc gcggtgtgac cgacatggcg 180
cgtaaatacg gcagcctgat gcatctgcaa ctgggcgaag tgagcaccat tgtcgtctca 240
tcgccgcgtt gggcaaaaga agtgctgacg acgtatgata ttacctttgc gaatcgcccg 300
gaaaccctga ccggcgaaat tgttgcgtac cacaacacgg atattgtgct gtcaccgtat 360
ggcgaatact ggcgccaact gcgtaaactg tgcacgctgg aactgctgag cgccaaaaaa 420
gtgaaaagtt ttcagtcgct gcgtgaagaa gaatgctgga atctggtgaa agaagtgcgt 480
tcgagcggct caggttcccc ggtcgatctg tcggaatcca tctttaaact gattgcaacc 540
attctgagcc gcgcagcgtt tggcaaaggt atcaaagatc agcgtgaatt taccgaaatt 600
gtgaaagaaa tcctgcgcct gacgggcggt tttgatgtgg cggatatttt cccgtccaaa 660
aagatcctgc accacctgag cggcaaacgt gcgaaactga ccaacatcca caacaaactg 720
gattccctga ttaataacat tgtttctgaa catccgggtt cgcgtacctc gtcgagccag 780
gaaagcctgc tggatgtgct gctgcgcctg aaagattccg cggaactgcc gctgacctcg 840
gacaatgtta aagccgtgat cctggatatg ttcggtgcgg gcacggatac gtcgagcgcc 900
acgattgaat gggcgatcag cgaactgatc cgctgcccgc gtgcaatgga aaaagtgcaa 960
acggaactgc gtcaagcgct gaatggtaaa gaacgcattc aggaagaaga tattcaggaa 1020
ctgtcctatc tgaaactggt cattaaagaa accctgcgcc tgcatccgcc gctgccgctg 1080
gtgatgccgc gtgaatgtcg tgaaccgtgt gtcctggcgg gttacgaaat cccgaccaaa 1140
acgaaactga ttgtgaatgt ctttgccatc aatcgtgacc cggaatactg gaaagatgca 1200
gaaaccttca tgccggaacg ctttgaaaac agcccgatta acatcatggg tagtgaatat 1260
gaatacctgc cgtttggcgc aggccgccgt atgtgtccgg gtgcagctct gggtctggcg 1320
aatgtggaac tgccgctggc gcacatcctg tattatttta actggaaact gccgaatggc 1380
gctcgcctgg atgaactgga tatgtcggaa tgctttggcg cgacggtcca acgcaaaagc 1440
gaactgctgc tggtcccgac ggcatacaaa acggcaaaca actccgca 1488
<210> 19
<211> 502
<212> PRT
<213> 莨菪
<400> 19
Met Gln Phe Phe Ser Leu Val Ser Ile Phe Leu Phe Leu Ser Phe Leu
1 5 10 15
Phe Leu Leu Arg Lys Trp Lys Asn Ser Asn Ser Gln Ser Lys Lys Leu
20 25 30
Pro Pro Gly Pro Trp Lys Leu Pro Leu Leu Gly Ser Met Leu His Met
35 40 45
Val Gly Gly Leu Pro His His Val Leu Arg Asp Leu Ala Lys Lys Tyr
50 55 60
Gly Pro Leu Met His Leu Gln Leu Gly Glu Val Ser Ala Val Val Val
65 70 75 80
Thr Ser Pro Asp Met Ala Lys Glu Val Leu Lys Thr His Asp Ile Ala
85 90 95
Phe Ala Ser Arg Pro Lys Leu Leu Ala Pro Glu Ile Val Cys Tyr Asn
100 105 110
Arg Ser Asp Ile Ala Phe Cys Pro Tyr Gly Asp Tyr Trp Arg Gln Met
115 120 125
Arg Lys Ile Cys Val Leu Glu Val Leu Ser Ala Lys Asn Val Arg Ser
130 135 140
Phe Ser Ser Ile Arg Arg Asp Glu Val Leu Arg Leu Val Asn Phe Val
145 150 155 160
Arg Ser Ser Thr Ser Glu Pro Val Asn Phe Thr Glu Arg Leu Phe Leu
165 170 175
Phe Thr Ser Ser Met Thr Cys Arg Ser Ala Phe Gly Lys Val Phe Lys
180 185 190
Glu Gln Glu Thr Phe Ile Gln Leu Ile Lys Glu Val Ile Gly Leu Ala
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Leu Lys Phe Leu His
210 215 220
Val Leu Thr Gly Met Glu Gly Lys Ile Met Lys Ala His His Lys Val
225 230 235 240
Asp Ala Ile Val Glu Asp Val Ile Asn Glu His Lys Lys Asn Leu Ala
245 250 255
Met Gly Lys Thr Asn Gly Ala Leu Gly Gly Glu Asp Leu Ile Asp Val
260 265 270
Leu Leu Arg Leu Met Asn Asp Gly Gly Leu Gln Phe Pro Ile Thr Asn
275 280 285
Asp Asn Ile Lys Ala Ile Ile Phe Asp Met Phe Ala Ala Gly Thr Glu
290 295 300
Thr Ser Ser Ser Thr Leu Val Trp Ala Met Val Gln Met Met Arg Asn
305 310 315 320
Pro Thr Ile Leu Ala Lys Ala Gln Ala Glu Val Arg Glu Ala Phe Lys
325 330 335
Gly Lys Glu Thr Phe Asp Glu Asn Asp Val Glu Glu Leu Lys Tyr Leu
340 345 350
Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Val Pro Leu
355 360 365
Leu Val Pro Arg Glu Cys Arg Glu Glu Thr Glu Ile Asn Gly Tyr Thr
370 375 380
Ile Pro Val Lys Thr Lys Val Met Val Asn Val Trp Ala Leu Gly Arg
385 390 395 400
Asp Pro Lys Tyr Trp Asp Asp Ala Asp Asn Phe Lys Pro Glu Arg Phe
405 410 415
Glu Gln Cys Ser Val Asp Phe Ile Gly Asn Asn Phe Glu Tyr Leu Pro
420 425 430
Phe Gly Gly Gly Arg Arg Ile Cys Pro Gly Ile Ser Phe Gly Leu Ala
435 440 445
Asn Val Tyr Leu Pro Leu Ala Gln Leu Leu Tyr His Phe Asp Trp Lys
450 455 460
Leu Pro Thr Gly Met Glu Pro Lys Asp Leu Asp Leu Thr Glu Leu Val
465 470 475 480
Gly Val Thr Ala Ala Arg Lys Ser Asp Leu Met Leu Val Ala Thr Pro
485 490 495
Tyr Gln Pro Ser Arg Glu
500
<210> 20
<211> 508
<212> PRT
<213> 莨菪
<400> 20
Met Ala Leu Leu Leu Ala Val Phe Phe Phe Ser Leu Val Ser Ile Phe
1 5 10 15
Leu Phe Leu Ser Phe Leu Phe Leu Leu Arg Lys Trp Lys Asn Ser Asn
20 25 30
Ser Gln Ser Lys Lys Leu Pro Pro Gly Pro Trp Lys Leu Pro Leu Leu
35 40 45
Gly Ser Met Leu His Met Val Gly Gly Leu Pro His His Val Leu Arg
50 55 60
Asp Leu Ala Lys Lys Tyr Gly Pro Leu Met His Leu Gln Leu Gly Glu
65 70 75 80
Val Ser Ala Val Val Val Thr Ser Pro Asp Met Ala Lys Glu Val Leu
85 90 95
Lys Thr His Asp Ile Ala Phe Ala Ser Arg Pro Lys Leu Leu Ala Pro
100 105 110
Glu Ile Val Cys Tyr Asn Arg Ser Asp Ile Ala Phe Cys Pro Tyr Gly
115 120 125
Asp Tyr Trp Arg Gln Met Arg Lys Ile Cys Val Leu Glu Val Leu Ser
130 135 140
Ala Lys Asn Val Arg Ser Phe Ser Ser Ile Arg Arg Asp Glu Val Leu
145 150 155 160
Arg Leu Val Asn Phe Val Arg Ser Ser Thr Ser Glu Pro Val Asn Phe
165 170 175
Thr Glu Arg Leu Phe Leu Phe Thr Ser Ser Met Thr Cys Arg Ser Ala
180 185 190
Phe Gly Lys Val Phe Lys Glu Gln Glu Thr Phe Ile Gln Leu Ile Lys
195 200 205
Glu Val Ile Gly Leu Ala Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Leu Lys Phe Leu His Val Leu Thr Gly Met Glu Gly Lys Ile Met
225 230 235 240
Lys Ala His His Lys Val Asp Ala Ile Val Glu Asp Val Ile Asn Glu
245 250 255
His Lys Lys Asn Leu Ala Met Gly Lys Thr Asn Gly Ala Leu Gly Gly
260 265 270
Glu Asp Leu Ile Asp Val Leu Leu Arg Leu Met Asn Asp Gly Gly Leu
275 280 285
Gln Phe Pro Ile Thr Asn Asp Asn Ile Lys Ala Ile Ile Phe Asp Met
290 295 300
Phe Ala Ala Gly Thr Glu Thr Ser Ser Ser Thr Leu Val Trp Ala Met
305 310 315 320
Val Gln Met Met Arg Asn Pro Thr Ile Leu Ala Lys Ala Gln Ala Glu
325 330 335
Val Arg Glu Ala Phe Lys Gly Lys Glu Thr Phe Asp Glu Asn Asp Val
340 345 350
Glu Glu Leu Lys Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu
355 360 365
His Pro Pro Val Pro Leu Leu Val Pro Arg Glu Cys Arg Glu Glu Thr
370 375 380
Glu Ile Asn Gly Tyr Thr Ile Pro Val Lys Thr Lys Val Met Val Asn
385 390 395 400
Val Trp Ala Leu Gly Arg Asp Pro Lys Tyr Trp Asp Asp Ala Asp Asn
405 410 415
Phe Lys Pro Glu Arg Phe Glu Gln Cys Ser Val Asp Phe Ile Gly Asn
420 425 430
Asn Phe Glu Tyr Leu Pro Phe Gly Gly Gly Arg Arg Ile Cys Pro Gly
435 440 445
Ile Ser Phe Gly Leu Ala Asn Val Tyr Leu Pro Leu Ala Gln Leu Leu
450 455 460
Tyr His Phe Asp Trp Lys Leu Pro Thr Gly Met Glu Pro Lys Asp Leu
465 470 475 480
Asp Leu Thr Glu Leu Val Gly Val Thr Ala Ala Arg Lys Ser Asp Leu
485 490 495
Met Leu Val Ala Thr Pro Tyr Gln Pro Ser Arg Glu
500 505
<210> 21
<211> 1524
<212> DNA
<213> 莨菪
<400> 21
atggctctgt tattagcagt tttcttcttc tccctggtct caatctttct gttcctgtcc 60
tttctgttcc tgctgcgtaa atggaaaaac tcaaactccc aatcgaaaaa actgccgccg 120
ggtccgtgga aactgccgct gctgggctct atgctgcaca tggttggcgg cctgccgcat 180
cacgttctgc gtgatctggc gaaaaaatat ggtccgctga tgcatctgca actgggcgaa 240
gtctccgccg tggttgtcac ctcaccggat atggcaaaag aagtgctgaa aacgcatgac 300
attgcgttcg cctcccgtcc gaaactgctg gccccggaaa ttgtgtgcta caaccgctca 360
gatattgcat tttgtccgta tggtgactac tggcgtcaaa tgcgcaaaat ttgcgtcctg 420
gaagtgctgt cggccaaaaa tgtgcgcagc tttagctcta ttcgtcgtga tgaagttctg 480
cgtctggtta acttcgtccg cagttccacc tcggagccgg tgaattttac ggaacgtctg 540
tttctgttca cctcatcgat gacctgccgt agcgcatttg gtaaagtttt caaagaacag 600
gaaaccttca ttcaactgat caaagaagtc attggcctgg ccggcggttt tgatgtggca 660
gacatctttc cgagtctgaa attcctgcat gttctgaccg gcatggaagg caaaattatg 720
aaagctcatc acaaagtcga tgcgattgtg gaagacgtta tcaacgaaca caagaaaaac 780
ctggcgatgg gcaaaacgaa cggcgcactg ggcggtgaag atctgatcga cgttctgctg 840
cgtctgatga atgatggcgg cctgcaattt ccgatcacca acgataatat caaagctatt 900
atctttgata tgtttgcggc gggcaccgaa accagcagca gcaccctggt gtgggcgatg 960
gtgcagatga tgcgtaaccc gacgattctg gcaaaagctc aagcggaagt gcgcgaagcc 1020
ttcaaaggca aagaaacctt tgatgaaaat gacgttgaag aactgaaata tctgaaactg 1080
gtcatcaaag aaacgctgcg tctgcatccg ccggttccgc tgctggtccc gcgtgaatgc 1140
cgcgaagaaa ccgaaattaa cggttatacc atcccggtta aaacgaaagt gatggttaat 1200
gtctgggctc tgggccgtga tccgaaatac tgggatgacg cggacaactt taaaccggaa 1260
cgctttgaac agtgctctgt ggatttcatc ggcaacaact ttgaatatct gccgtttggc 1320
ggtggccgtc gcatttgtcc gggtatcagc ttcggcctgg ctaatgttta tctgccgctg 1380
gcgcaactgc tgtaccactt tgattggaaa ctgccgaccg gcatggaacc gaaagatctg 1440
gacctgaccg aactggtggg cgttacggca gctcgtaaat ctgatctgat gctggttgcg 1500
accccgtacc agccgagccg tgaa 1524
<210> 22
<211> 488
<212> PRT
<213> 莴苣
<400> 22
Met Glu Leu Ser Ile Thr Thr Ser Ile Ala Leu Ala Thr Ile Val Phe
1 5 10 15
Phe Leu Tyr Lys Leu Ala Thr Arg Pro Lys Ser Thr Lys Lys Gln Leu
20 25 30
Pro Glu Ala Ser Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Asp Leu Ala Arg Lys His Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Ile Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Ile Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Glu Val Lys
145 150 155 160
Glu Ser Gly Ser Gly Lys Pro Ile Asn Leu Ser Glu Ser Ile Phe Thr
165 170 175
Met Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Arg Glu Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Gln Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Arg Leu Thr Ser Ile His Lys Lys Leu
225 230 235 240
Asp Asn Leu Ile Asn Asn Ile Val Ala Glu His His Val Ser Thr Ser
245 250 255
Ser Lys Ala Asn Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asp
260 265 270
Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Ile Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Val Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Ala Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Lys Ile Gln Glu Glu
325 330 335
Asp Ile Gln Asp Leu Ala Tyr Leu Asn Leu Val Ile Arg Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Val Asn Leu Ala Gly Tyr Glu Ile Ala Asn Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Ala Phe Ile Pro Glu Arg Phe Glu Asn Asn Pro Asn Asn Ile Met
405 410 415
Gly Ala Asp Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn
435 440 445
Ile Leu Tyr His Phe Asn Trp Lys Leu Pro Asn Gly Ala Ser His Asp
450 455 460
Gln Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr
465 470 475 480
Glu Leu Leu Leu Val Pro Ser Phe
485
<210> 23
<211> 488
<212> PRT
<213> 莴苣
<400> 23
Met Ala Leu Leu Leu Ala Val Phe Ile Ala Leu Ala Thr Ile Val Phe
1 5 10 15
Phe Leu Tyr Lys Leu Ala Thr Arg Pro Lys Ser Thr Lys Lys Gln Leu
20 25 30
Pro Glu Ala Ser Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Asp Leu Ala Arg Lys His Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Ile Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Ile Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Glu Val Lys
145 150 155 160
Glu Ser Gly Ser Gly Lys Pro Ile Asn Leu Ser Glu Ser Ile Phe Thr
165 170 175
Met Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Arg Glu Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Gln Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Arg Leu Thr Ser Ile His Lys Lys Leu
225 230 235 240
Asp Asn Leu Ile Asn Asn Ile Val Ala Glu His His Val Ser Thr Ser
245 250 255
Ser Lys Ala Asn Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asp
260 265 270
Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Ile Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Val Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Ala Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Lys Ile Gln Glu Glu
325 330 335
Asp Ile Gln Asp Leu Ala Tyr Leu Asn Leu Val Ile Arg Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Val Asn Leu Ala Gly Tyr Glu Ile Ala Asn Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Ala Phe Ile Pro Glu Arg Phe Glu Asn Asn Pro Asn Asn Ile Met
405 410 415
Gly Ala Asp Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn
435 440 445
Ile Leu Tyr His Phe Asn Trp Lys Leu Pro Asn Gly Ala Ser His Asp
450 455 460
Gln Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr
465 470 475 480
Glu Leu Leu Leu Val Pro Ser Phe
485
<210> 24
<211> 1464
<212> DNA
<213> 莴苣
<400> 24
atggctctgt tattagcagt tttcatcgca ctggctacca tcgtcttctt cctgtataaa 60
ctggcaacgc gcccgaaatc taccaaaaaa caactgccgg aagcgagccg tctgccgatt 120
atcggccaca tgcatcacct gattggcacc atgccgcacc gtggtgtcat ggatctggcc 180
cgcaaacatg gctcgctgat gcatctgcaa ctgggcgaag tgagcaccat tgtggttagc 240
tctccgaaat gggcaaaaga aattctgacc acctatgata ttacctttgc taaccgcccg 300
gaaaccctga cgggcgaaat tatcgcgtac cataatacgg acattgtgct ggccccgtat 360
ggtgaatact ggcgtcaact gcgtaaactg tgcaccctgg aactgctgtc cgttaaaaaa 420
gtcaaatcat ttcaatcgat tcgtgaagaa gaatgttgga acctggtgaa agaagttaaa 480
gaaagcggct ctggtaaacc gattaatctg agtgaatcca tcttcaccat gattgcgacg 540
atcctgagtc gtgcggcctt tggcaaaggt attaaagatc agcgcgaatt taccgaaatt 600
gtcaaagaaa tcctgcgtca aacgggcggt ttcgatgtgg cagacatttt tccgagcaaa 660
aaattcctgc atcacctgtc tggcaaacgt gctcgcctga ccagtatcca taaaaaactg 720
gataacctga tcaacaatat cgtcgcggaa catcatgtga gcaccagcag caaagcgaat 780
gaaacgctgc tggatgttct gctgcgcctg aaagacagtg ccgaatttcc gctgaccgca 840
gacaacgtca aagctattat cctggatatg ttcggtgcag gcaccgatac cagcagcgca 900
acggtggaat gggccattag cgaactgatc cgttgcccgc gcgcaatgga aaaagttcag 960
gcagaactgc gtcaagctct gaacggtaaa gaaaaaatcc aggaagaaga tattcaagac 1020
ctggcctatc tgaatctggt gattcgtgaa accctgcgtc tgcacccgcc gctgccgctg 1080
gttatgccgc gtgaatgccg tgagccggtg aacctggcgg gctatgaaat cgccaataaa 1140
accaaactga tcgtcaatgt gtttgcgatt aaccgtgacc cggaatactg gaaagacgcg 1200
gaagccttta tcccggaacg ttttgaaaac aatccgaaca atatcatggg tgcagattat 1260
gaatacctgc cgtttggcgc tggtcgtcgc atgtgtccgg gcgcagctct gggtctggca 1320
aacgttcaac tgccgctggc gaacattctg taccatttca actggaaact gccgaatggc 1380
gcgtcccacg atcaactgga catgaccgaa tcatttggtg ccaccgtgca acgtaaaacg 1440
gaactgctgc tggttccgag cttc 1464
<210> 25
<211> 504
<212> PRT
<213> 烟草(Nicotiani tabacum)
<400> 25
Met Gln Phe Phe Ser Leu Val Ser Ile Phe Leu Phe Leu Ser Phe Leu
1 5 10 15
Phe Leu Leu Arg Lys Trp Lys Asn Ser Asn Ser Gln Ser Lys Lys Leu
20 25 30
Pro Pro Gly Pro Trp Lys Ile Pro Ile Leu Gly Ser Met Leu His Met
35 40 45
Ile Gly Gly Glu Pro His His Val Leu Arg Asp Leu Ala Lys Lys Tyr
50 55 60
Gly Pro Leu Met His Leu Gln Leu Gly Glu Ile Ser Ala Val Val Val
65 70 75 80
Thr Ser Arg Asp Met Ala Lys Glu Val Leu Lys Thr His Asp Val Val
85 90 95
Phe Ala Ser Arg Pro Lys Ile Val Ala Met Asp Ile Ile Cys Tyr Asn
100 105 110
Gln Ser Asp Ile Ala Phe Ser Pro Tyr Gly Asp His Trp Arg Gln Met
115 120 125
Arg Lys Ile Cys Val Met Glu Leu Leu Asn Ala Lys Asn Val Arg Ser
130 135 140
Phe Ser Ser Ile Arg Arg Asp Glu Val Val Arg Leu Ile Asp Ser Ile
145 150 155 160
Arg Ser Asp Ser Ser Ser Gly Glu Leu Val Asn Phe Thr Gln Arg Ile
165 170 175
Ile Trp Phe Ala Ser Ser Met Thr Cys Arg Ser Ala Phe Gly Gln Val
180 185 190
Leu Lys Gly Gln Asp Ile Phe Ala Lys Lys Ile Arg Glu Val Ile Gly
195 200 205
Leu Ala Glu Gly Phe Asp Val Val Asp Ile Phe Pro Thr Tyr Lys Phe
210 215 220
Leu His Val Leu Ser Gly Met Lys Arg Lys Leu Leu Asn Ala His Leu
225 230 235 240
Lys Val Asp Ala Ile Val Glu Asp Val Ile Asn Glu His Lys Lys Asn
245 250 255
Leu Ala Ala Gly Lys Ser Asn Gly Ala Leu Gly Gly Glu Asp Leu Ile
260 265 270
Asp Val Leu Leu Arg Leu Met Asn Asp Thr Ser Leu Gln Phe Pro Ile
275 280 285
Thr Asn Asp Asn Ile Lys Ala Val Ile Val Asp Met Phe Ala Ala Gly
290 295 300
Thr Glu Thr Ser Ser Thr Thr Thr Val Trp Ala Met Ala Glu Met Met
305 310 315 320
Lys Asn Pro Ser Val Phe Thr Lys Ala Gln Ala Glu Val Arg Glu Ala
325 330 335
Phe Arg Asp Lys Val Ser Phe Asp Glu Asn Asp Val Glu Glu Leu Lys
340 345 350
Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ser
355 360 365
Pro Leu Leu Val Pro Arg Glu Cys Arg Glu Asp Thr Asp Ile Asn Gly
370 375 380
Tyr Thr Ile Pro Ala Lys Thr Lys Val Met Val Asn Val Trp Ala Leu
385 390 395 400
Gly Arg Asp Pro Lys Tyr Trp Asp Asp Ala Glu Ser Phe Lys Pro Glu
405 410 415
Arg Phe Glu Gln Cys Ser Val Asp Phe Phe Gly Asn Asn Phe Glu Phe
420 425 430
Leu Pro Phe Gly Gly Gly Arg Arg Ile Cys Pro Gly Met Ser Phe Gly
435 440 445
Leu Ala Asn Leu Tyr Leu Pro Leu Ala Gln Leu Leu Tyr His Phe Asp
450 455 460
Trp Lys Leu Pro Thr Gly Ile Met Pro Arg Asp Leu Asp Leu Thr Glu
465 470 475 480
Leu Ser Gly Ile Thr Ile Ala Arg Lys Gly Gly Leu Tyr Leu Asn Ala
485 490 495
Thr Pro Tyr Gln Pro Ser Arg Glu
500
<210> 26
<211> 510
<212> PRT
<213> 烟草(Nicotiani tabacum)
<400> 26
Met Ala Leu Leu Leu Ala Val Phe Phe Phe Ser Leu Val Ser Ile Phe
1 5 10 15
Leu Phe Leu Ser Phe Leu Phe Leu Leu Arg Lys Trp Lys Asn Ser Asn
20 25 30
Ser Gln Ser Lys Lys Leu Pro Pro Gly Pro Trp Lys Ile Pro Ile Leu
35 40 45
Gly Ser Met Leu His Met Ile Gly Gly Glu Pro His His Val Leu Arg
50 55 60
Asp Leu Ala Lys Lys Tyr Gly Pro Leu Met His Leu Gln Leu Gly Glu
65 70 75 80
Ile Ser Ala Val Val Val Thr Ser Arg Asp Met Ala Lys Glu Val Leu
85 90 95
Lys Thr His Asp Val Val Phe Ala Ser Arg Pro Lys Ile Val Ala Met
100 105 110
Asp Ile Ile Cys Tyr Asn Gln Ser Asp Ile Ala Phe Ser Pro Tyr Gly
115 120 125
Asp His Trp Arg Gln Met Arg Lys Ile Cys Val Met Glu Leu Leu Asn
130 135 140
Ala Lys Asn Val Arg Ser Phe Ser Ser Ile Arg Arg Asp Glu Val Val
145 150 155 160
Arg Leu Ile Asp Ser Ile Arg Ser Asp Ser Ser Ser Gly Glu Leu Val
165 170 175
Asn Phe Thr Gln Arg Ile Ile Trp Phe Ala Ser Ser Met Thr Cys Arg
180 185 190
Ser Ala Phe Gly Gln Val Leu Lys Gly Gln Asp Ile Phe Ala Lys Lys
195 200 205
Ile Arg Glu Val Ile Gly Leu Ala Glu Gly Phe Asp Val Val Asp Ile
210 215 220
Phe Pro Thr Tyr Lys Phe Leu His Val Leu Ser Gly Met Lys Arg Lys
225 230 235 240
Leu Leu Asn Ala His Leu Lys Val Asp Ala Ile Val Glu Asp Val Ile
245 250 255
Asn Glu His Lys Lys Asn Leu Ala Ala Gly Lys Ser Asn Gly Ala Leu
260 265 270
Gly Gly Glu Asp Leu Ile Asp Val Leu Leu Arg Leu Met Asn Asp Thr
275 280 285
Ser Leu Gln Phe Pro Ile Thr Asn Asp Asn Ile Lys Ala Val Ile Val
290 295 300
Asp Met Phe Ala Ala Gly Thr Glu Thr Ser Ser Thr Thr Thr Val Trp
305 310 315 320
Ala Met Ala Glu Met Met Lys Asn Pro Ser Val Phe Thr Lys Ala Gln
325 330 335
Ala Glu Val Arg Glu Ala Phe Arg Asp Lys Val Ser Phe Asp Glu Asn
340 345 350
Asp Val Glu Glu Leu Lys Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu
355 360 365
Arg Leu His Pro Pro Ser Pro Leu Leu Val Pro Arg Glu Cys Arg Glu
370 375 380
Asp Thr Asp Ile Asn Gly Tyr Thr Ile Pro Ala Lys Thr Lys Val Met
385 390 395 400
Val Asn Val Trp Ala Leu Gly Arg Asp Pro Lys Tyr Trp Asp Asp Ala
405 410 415
Glu Ser Phe Lys Pro Glu Arg Phe Glu Gln Cys Ser Val Asp Phe Phe
420 425 430
Gly Asn Asn Phe Glu Phe Leu Pro Phe Gly Gly Gly Arg Arg Ile Cys
435 440 445
Pro Gly Met Ser Phe Gly Leu Ala Asn Leu Tyr Leu Pro Leu Ala Gln
450 455 460
Leu Leu Tyr His Phe Asp Trp Lys Leu Pro Thr Gly Ile Met Pro Arg
465 470 475 480
Asp Leu Asp Leu Thr Glu Leu Ser Gly Ile Thr Ile Ala Arg Lys Gly
485 490 495
Gly Leu Tyr Leu Asn Ala Thr Pro Tyr Gln Pro Ser Arg Glu
500 505 510
<210> 27
<211> 1530
<212> DNA
<213> 烟草
<400> 27
atggctctgt tattagcagt tttcttcttc tccctggtct caatcttcct gttcctgtcc 60
tttctgttcc tgctgcgtaa atggaaaaac tctaatagcc aatccaaaaa actgccgccg 120
ggtccgtgga aaattccgat cctgggctct atgctgcaca tgattggcgg tgaaccgcat 180
catgtgctgc gtgatctggc gaaaaaatat ggtccgctga tgcatctgca actgggcgaa 240
atctctgcgg tggttgtcac gagtcgtgac atggccaaag aagtgctgaa aacccatgat 300
gtggtttttg catctcgccc gaaaatcgtt gctatggata ttatctgcta taaccagtcg 360
gacatcgcgt tcagcccgta cggtgatcac tggcgtcaaa tgcgcaaaat ttgtgtcatg 420
gaactgctga acgccaaaaa tgtgcgcagt tttagctcta ttcgtcgtga tgaagtcgtg 480
cgtctgattg attccatccg ctcagacagt tcctcaggcg aactggtgaa ttttacgcag 540
cgtattatct ggttcgcatc gagcatgacc tgccgctcgg cttttggtca ggttctgaaa 600
ggccaagata tttttgcgaa gaaaattcgt gaagtgatcg gtctggccga aggcttcgat 660
gttgtggata tttttccgac ctataaattc ctgcatgtcc tgagcggtat gaaacgcaaa 720
ctgctgaacg cgcacctgaa agttgatgcc attgtcgaag acgtgatcaa cgaacataag 780
aaaaacctgg cggcgggtaa atccaacggc gcactgggcg gtgaagatct gattgacgtg 840
ctgctgcgtc tgatgaatga taccagcctg caatttccga tcaccaacga caacattaaa 900
gcggtgatcg ttgatatgtt cgcggcgggc accgaaacct ctagtaccac gaccgtttgg 960
gcgatggccg aaatgatgaa aaacccgtcg gtgtttacca aagcacaagc ggaagtgcgt 1020
gaagcgtttc gtgataaagt tagcttcgat gaaaatgatg tggaagaact gaaatacctg 1080
aaactggtga ttaaagaaac gctgcgtctg catccgccga gcccgctgct ggttccgcgt 1140
gaatgccgtg aagataccga cattaacggt tatacgatcc cggcaaaaac caaagtcatg 1200
gtgaatgttt gggctctggg ccgtgacccg aaatactggg atgacgcaga atcctttaaa 1260
ccggaacgct ttgaacagtg ctcagtggat ttctttggta acaactttga atttctgccg 1320
tttggcggtg gccgtcgcat ttgtccgggt atgtccttcg gcctggcgaa cctgtatctg 1380
ccgctggccc aactgctgta ccactttgat tggaaactgc cgacgggtat tatgccgcgt 1440
gatctggacc tgacggaact gtctggcatt accatcgcac gcaaaggtgg cctgtatctg 1500
aatgctaccc cgtaccagcc gagtcgtgaa 1530
<210> 28
<211> 522
<212> PRT
<213> 葡萄柚
<400> 28
Met Glu Leu Pro Leu Lys Ser Ile Ala Leu Thr Ile Val Ile Val Thr
1 5 10 15
Val Leu Thr Trp Ala Trp Arg Val Leu Asn Trp Val Trp Leu Arg Pro
20 25 30
Lys Lys Leu Glu Lys Phe Leu Arg Gln Gln Gly Leu Lys Gly Asn Ser
35 40 45
Tyr Arg Leu Leu Phe Gly Asp Leu Lys Glu Asn Ser Ile Glu Leu Lys
50 55 60
Glu Ala Lys Ala Arg Pro Leu Ser Leu Asp Asp Asp Ile Ala Ile Arg
65 70 75 80
Val Asn Pro Phe Leu His Lys Leu Val Asn Asp Tyr Gly Lys Asn Ser
85 90 95
Phe Met Trp Phe Gly Pro Thr Pro Arg Val Asn Ile Met Asn Pro Asp
100 105 110
Gln Ile Lys Ala Ile Phe Thr Lys Ile Asn Asp Phe Gln Lys Val Asn
115 120 125
Ser Ile Pro Leu Ala Arg Leu Leu Ile Val Gly Leu Ala Thr Leu Glu
130 135 140
Gly Glu Lys Trp Ala Lys His Arg Lys Leu Ile Asn Pro Ala Phe His
145 150 155 160
Gln Glu Lys Leu Lys Leu Met Leu Pro Ala Phe Tyr Leu Ser Cys Ile
165 170 175
Glu Ile Ile Thr Lys Trp Glu Lys Gln Met Ser Val Glu Gly Ser Ser
180 185 190
Glu Leu Asp Val Trp Pro Tyr Leu Ala Asn Leu Thr Ser Asp Val Ile
195 200 205
Ser Arg Thr Ala Phe Gly Ser Ser Tyr Glu Glu Gly Arg Arg Ile Phe
210 215 220
Gln Leu Gln Ala Glu Leu Ala Glu Leu Thr Met Gln Val Phe Arg Ser
225 230 235 240
Val His Ile Pro Gly Trp Arg Phe Leu Pro Thr Lys Arg Asn Arg Arg
245 250 255
Met Lys Glu Ile Asp Lys Glu Ile Arg Ala Ser Leu Met Gly Ile Ile
260 265 270
Lys Asn Arg Glu Lys Ala Met Arg Ala Gly Glu Ala Ala Asn Asn Asp
275 280 285
Leu Leu Gly Ile Leu Met Glu Thr Ser Phe Arg Glu Ile Glu Glu His
290 295 300
Gly Asn Asn Lys Asn Val Gly Phe Ser Met Asn Asp Val Ile Glu Glu
305 310 315 320
Cys Lys Leu Phe Tyr Phe Ala Gly Gln Glu Thr Thr Ser Val Leu Leu
325 330 335
Asn Trp Thr Met Val Leu Leu Ser Lys His Gln Asp Trp Gln Glu Arg
340 345 350
Ala Arg Gln Glu Val Leu Gln Val Phe Gly Asn Asn Lys Pro Asp Tyr
355 360 365
Asp Gly Leu Asn His Leu Lys Ile Val Gln Met Ile Leu Tyr Glu Val
370 375 380
Leu Arg Leu Tyr Pro Pro Val Thr Val Leu Ser Arg Ala Val Phe Lys
385 390 395 400
Glu Thr Lys Leu Gly Asn Leu Thr Leu Pro Ala Gly Val Gln Ile Gly
405 410 415
Leu Pro Met Ile Leu Val His Gln Asp Pro Glu Leu Trp Gly Asp Asp
420 425 430
Ala Val Glu Phe Lys Pro Glu Arg Phe Ala Glu Gly Ile Ser Lys Ala
435 440 445
Ala Lys Asn Gln Val Ser Tyr Phe Pro Phe Ala Leu Gly Pro Arg Ile
450 455 460
Cys Val Gly Gln Asn Phe Ala Leu Val Glu Ala Lys Met Ala Thr Ala
465 470 475 480
Met Ile Leu Gln Asn Tyr Ser Phe Glu Leu Ser Pro Ser Tyr Val His
485 490 495
Ala Pro Thr Ala Val Pro Thr Leu His Pro Glu Leu Gly Thr Gln Leu
500 505 510
Ile Leu Arg Lys Leu Trp Cys Lys Asn Asn
515 520
<210> 29
<211> 523
<212> PRT
<213> 葡萄柚
<400> 29
Met Ala Leu Leu Leu Ala Val Phe Ile Ala Leu Thr Ile Val Ile Val
1 5 10 15
Thr Val Leu Thr Trp Ala Trp Arg Val Leu Asn Trp Val Trp Leu Arg
20 25 30
Pro Lys Lys Leu Glu Lys Phe Leu Arg Gln Gln Gly Leu Lys Gly Asn
35 40 45
Ser Tyr Arg Leu Leu Phe Gly Asp Leu Lys Glu Asn Ser Ile Glu Leu
50 55 60
Lys Glu Ala Lys Ala Arg Pro Leu Ser Leu Asp Asp Asp Ile Ala Ile
65 70 75 80
Arg Val Asn Pro Phe Leu His Lys Leu Val Asn Asp Tyr Gly Lys Asn
85 90 95
Ser Phe Met Trp Phe Gly Pro Thr Pro Arg Val Asn Ile Met Asn Pro
100 105 110
Asp Gln Ile Lys Ala Ile Phe Thr Lys Ile Asn Asp Phe Gln Lys Val
115 120 125
Asn Ser Ile Pro Leu Ala Arg Leu Leu Ile Val Gly Leu Ala Thr Leu
130 135 140
Glu Gly Glu Lys Trp Ala Lys His Arg Lys Leu Ile Asn Pro Ala Phe
145 150 155 160
His Gln Glu Lys Leu Lys Leu Met Leu Pro Ala Phe Tyr Leu Ser Cys
165 170 175
Ile Glu Ile Ile Thr Lys Trp Glu Lys Gln Met Ser Val Glu Gly Ser
180 185 190
Ser Glu Leu Asp Val Trp Pro Tyr Leu Ala Asn Leu Thr Ser Asp Val
195 200 205
Ile Ser Arg Thr Ala Phe Gly Ser Ser Tyr Glu Glu Gly Arg Arg Ile
210 215 220
Phe Gln Leu Gln Ala Glu Leu Ala Glu Leu Thr Met Gln Val Phe Arg
225 230 235 240
Ser Val His Ile Pro Gly Trp Arg Phe Leu Pro Thr Lys Arg Asn Arg
245 250 255
Arg Met Lys Glu Ile Asp Lys Glu Ile Arg Ala Ser Leu Met Gly Ile
260 265 270
Ile Lys Asn Arg Glu Lys Ala Met Arg Ala Gly Glu Ala Ala Asn Asn
275 280 285
Asp Leu Leu Gly Ile Leu Met Glu Thr Ser Phe Arg Glu Ile Glu Glu
290 295 300
His Gly Asn Asn Lys Asn Val Gly Phe Ser Met Asn Asp Val Ile Glu
305 310 315 320
Glu Cys Lys Leu Phe Tyr Phe Ala Gly Gln Glu Thr Thr Ser Val Leu
325 330 335
Leu Asn Trp Thr Met Val Leu Leu Ser Lys His Gln Asp Trp Gln Glu
340 345 350
Arg Ala Arg Gln Glu Val Leu Gln Val Phe Gly Asn Asn Lys Pro Asp
355 360 365
Tyr Asp Gly Leu Asn His Leu Lys Ile Val Gln Met Ile Leu Tyr Glu
370 375 380
Val Leu Arg Leu Tyr Pro Pro Val Thr Val Leu Ser Arg Ala Val Phe
385 390 395 400
Lys Glu Thr Lys Leu Gly Asn Leu Thr Leu Pro Ala Gly Val Gln Ile
405 410 415
Gly Leu Pro Met Ile Leu Val His Gln Asp Pro Glu Leu Trp Gly Asp
420 425 430
Asp Ala Val Glu Phe Lys Pro Glu Arg Phe Ala Glu Gly Ile Ser Lys
435 440 445
Ala Ala Lys Asn Gln Val Ser Tyr Phe Pro Phe Ala Leu Gly Pro Arg
450 455 460
Ile Cys Val Gly Gln Asn Phe Ala Leu Val Glu Ala Lys Met Ala Thr
465 470 475 480
Ala Met Ile Leu Gln Asn Tyr Ser Phe Glu Leu Ser Pro Ser Tyr Val
485 490 495
His Ala Pro Thr Ala Val Pro Thr Leu His Pro Glu Leu Gly Thr Gln
500 505 510
Leu Ile Leu Arg Lys Leu Trp Cys Lys Asn Asn
515 520
<210> 30
<211> 1569
<212> DNA
<213> 葡萄柚
<400> 30
atggctctgt tattagcagt tttcattgct ctgacgattg ttattgttac ggtgctgacc 60
tgggcgtggc gtgtgctgaa ctgggtttgg ctgcgtccga aaaaactgga aaaatttctg 120
cgccagcaag gcctgaaggg taacagctat cgtctgctgt tcggcgatct gaaagaaaat 180
tctattgaac tgaaagaagc gaaagcccgt ccgctgagtc tggatgacga tattgcaatc 240
cgcgttaacc cgtttctgca taaactggtc aacgattacg gcaaaaattc ttttatgtgg 300
ttcggtccga ccccgcgcgt gaacattatg aacccggatc agattaaagc gatctttacg 360
aaaatcaacg atttccaaaa agttaatagc attccgctgg cgcgtctgct gatcgtcggc 420
ctggccaccc tggaaggtga aaaatgggca aaacatcgca aactgattaa cccggctttt 480
caccaagaaa aactgaaact gatgctgccg gcgttctatc tgtcctgcat cgaaattatc 540
acgaaatggg aaaaacagat gtcagtggaa ggtagctctg aactggacgt ttggccgtat 600
ctggccaatc tgaccagcga tgttatttct cgtacggcat ttggcagttc ctacgaagaa 660
ggtcgtcgca tcttccagtt acaggcggaa ctggccgaac tgaccatgca ggtttttcgt 720
tctgtccata ttccgggctg gcgtttcctg ccgacgaaac gcaaccgtcg catgaaagaa 780
attgacaaag aaatccgcgc cagtctgatg ggtattatca aaaatcgtga aaaagcaatg 840
cgcgctggcg aagcggccaa caatgatctg ctgggtattc tgatggaaac cagctttcgt 900
gaaatcgaag aacacggcaa caataaaaac gtcggtttca gcatgaatga cgtgatcgaa 960
gaatgtaaac tgttttattt cgctggccag gaaaccacgt cagttctgct gaactggacg 1020
atggtgctgc tgtcgaaaca tcaggattgg caagaacgtg cccgccagga agtcctgcaa 1080
gtgtttggca acaataaacc ggactacgat ggtctgaacc acctgaaaat tgtgcagatg 1140
atcctgtatg aagttctgcg tctgtatccg ccggtgacgg tgctgagccg tgcggtgttt 1200
aaagaaacca aactgggtaa tctgacgctg ccggcaggcg tccagattgg tctgccgatg 1260
atcctggtgc accaggaccc ggaactgtgg ggcgacgatg ctgtggaatt taaaccggaa 1320
cgtttcgcgg aaggtattag taaagcagct aaaaatcagg tttcctattt tccgttcgcg 1380
ctgggtccgc gtatttgcgt cggtcaaaac tttgcactgg tggaagctaa aatggcaacc 1440
gctatgatcc tgcaaaatta tagctttgaa ctgtcaccga gctatgttca tgcgccgacc 1500
gccgttccga cgctgcaccc ggaactgggc acgcaactga ttctgcgtaa actgtggtgt 1560
aaaaacaat 1569
<210> 31
<211> 495
<212> PRT
<213> 黄花蒿
<400> 31
Met Lys Ser Ile Leu Lys Ala Met Ala Leu Ser Leu Thr Thr Ser Ile
1 5 10 15
Ala Leu Ala Thr Ile Leu Leu Phe Val Tyr Lys Phe Ala Thr Arg Ser
20 25 30
Lys Ser Thr Lys Lys Ser Leu Pro Glu Pro Trp Arg Leu Pro Ile Ile
35 40 45
Gly His Met His His Leu Ile Gly Thr Thr Pro His Arg Gly Val Arg
50 55 60
Asp Leu Ala Arg Lys Tyr Gly Ser Leu Met His Leu Gln Leu Gly Glu
65 70 75 80
Val Pro Thr Ile Val Val Ser Ser Pro Lys Trp Ala Lys Glu Ile Leu
85 90 95
Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly
100 105 110
Glu Ile Val Leu Tyr His Asn Thr Asp Val Val Leu Ala Pro Tyr Gly
115 120 125
Glu Tyr Trp Arg Gln Leu Arg Lys Ile Cys Thr Leu Glu Leu Leu Ser
130 135 140
Val Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp
145 150 155 160
Asn Leu Val Gln Glu Ile Lys Ala Ser Gly Ser Gly Arg Pro Val Asn
165 170 175
Leu Ser Glu Asn Val Phe Lys Leu Ile Ala Thr Ile Leu Ser Arg Ala
180 185 190
Ala Phe Gly Lys Gly Ile Lys Asp Gln Lys Glu Leu Thr Glu Ile Val
195 200 205
Lys Glu Ile Leu Arg Gln Thr Gly Gly Phe Asp Val Ala Asp Ile Phe
210 215 220
Pro Ser Lys Lys Phe Leu His His Leu Ser Gly Lys Arg Ala Arg Leu
225 230 235 240
Thr Ser Leu Arg Lys Lys Ile Asp Asn Leu Ile Asp Asn Leu Val Ala
245 250 255
Glu His Thr Val Asn Thr Ser Ser Lys Thr Asn Glu Thr Leu Leu Asp
260 265 270
Val Leu Leu Arg Leu Lys Asp Ser Ala Glu Phe Pro Leu Thr Ser Asp
275 280 285
Asn Ile Lys Ala Ile Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr
290 295 300
Ser Ser Ser Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Lys Cys Pro
305 310 315 320
Lys Ala Met Glu Lys Val Gln Ala Glu Leu Arg Lys Ala Leu Asn Gly
325 330 335
Lys Glu Lys Ile His Glu Glu Asp Ile Gln Glu Leu Ser Tyr Leu Asn
340 345 350
Met Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val
355 360 365
Leu Pro Arg Glu Cys Arg Gln Pro Val Asn Leu Ala Gly Tyr Asn Ile
370 375 380
Pro Asn Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp
385 390 395 400
Pro Glu Tyr Trp Lys Asp Ala Glu Ala Phe Ile Pro Glu Arg Phe Glu
405 410 415
Asn Ser Ser Ala Thr Val Met Gly Ala Glu Tyr Glu Tyr Leu Pro Phe
420 425 430
Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn
435 440 445
Val Gln Leu Pro Leu Ala Asn Ile Leu Tyr His Phe Asn Trp Lys Leu
450 455 460
Pro Asn Gly Val Ser Tyr Asp Gln Ile Asp Met Thr Glu Ser Ser Gly
465 470 475 480
Ala Thr Met Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Ser Phe
485 490 495
<210> 32
<211> 488
<212> PRT
<213> 黄花蒿
<400> 32
Met Ala Leu Leu Leu Ala Val Phe Ile Ala Leu Ala Thr Ile Leu Leu
1 5 10 15
Phe Val Tyr Lys Phe Ala Thr Arg Ser Lys Ser Thr Lys Lys Ser Leu
20 25 30
Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Thr Pro His Arg Gly Val Arg Asp Leu Ala Arg Lys Tyr Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Pro Thr Ile Val Val Ser
65 70 75 80
Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Leu Tyr His Asn
100 105 110
Thr Asp Val Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Ile Cys Thr Leu Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Gln Glu Ile Lys
145 150 155 160
Ala Ser Gly Ser Gly Arg Pro Val Asn Leu Ser Glu Asn Val Phe Lys
165 170 175
Leu Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Lys Glu Leu Thr Glu Ile Val Lys Glu Ile Leu Arg Gln Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Arg Leu Thr Ser Leu Arg Lys Lys Ile
225 230 235 240
Asp Asn Leu Ile Asp Asn Leu Val Ala Glu His Thr Val Asn Thr Ser
245 250 255
Ser Lys Thr Asn Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asp
260 265 270
Ser Ala Glu Phe Pro Leu Thr Ser Asp Asn Ile Lys Ala Ile Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ser Thr Ile Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Lys Cys Pro Lys Ala Met Glu Lys Val Gln
305 310 315 320
Ala Glu Leu Arg Lys Ala Leu Asn Gly Lys Glu Lys Ile His Glu Glu
325 330 335
Asp Ile Gln Glu Leu Ser Tyr Leu Asn Met Val Ile Lys Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Leu Pro Arg Glu Cys Arg Gln
355 360 365
Pro Val Asn Leu Ala Gly Tyr Asn Ile Pro Asn Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Ala Phe Ile Pro Glu Arg Phe Glu Asn Ser Ser Ala Thr Val Met
405 410 415
Gly Ala Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn
435 440 445
Ile Leu Tyr His Phe Asn Trp Lys Leu Pro Asn Gly Val Ser Tyr Asp
450 455 460
Gln Ile Asp Met Thr Glu Ser Ser Gly Ala Thr Met Gln Arg Lys Thr
465 470 475 480
Glu Leu Leu Leu Val Pro Ser Phe
485
<210> 33
<211> 1464
<212> DNA
<213> 黄花蒿
<400> 33
atggctctgt tattagcagt tttcatcgca ctggcaacca ttctgctgtt tgtgtataaa 60
ttcgctaccc gttccaaatc aacgaaaaaa tcactgccgg aaccgtggcg cctgccgatt 120
atcggtcaca tgcatcacct gatcggcacc accccgcatc gtggcgtgcg tgatctggca 180
cgcaaatatg gctcgctgat gcatctgcaa ctgggtgaag tcccgaccat tgtggttagc 240
tctccgaaat gggcgaaaga aatcctgacc acctatgata ttacctttgc caaccgcccg 300
gaaaccctga cgggcgaaat cgtgctgtac cacaatacgg atgtggtgct ggcgccgtat 360
ggtgaatact ggcgtcaact gcgtaaaatt tgcaccctgg aactgctgag tgtgaaaaaa 420
gttaaatctt tccagagcct gcgtgaagaa gaatgttgga acctggttca agaaattaaa 480
gcatcgggca gcggtcgccc ggttaacctg agtgaaaatg tctttaaact gattgctacc 540
atcctgtccc gtgcggcctt cggcaaaggt atcaaagatc agaaagaact gaccgaaatt 600
gtcaaagaaa tcctgcgcca aacgggcggt tttgatgtgg cggacatttt tccgtcgaaa 660
aaattcctgc atcacctgag cggtaaacgt gcccgcctga ccagcctgcg taagaaaatt 720
gataacctga tcgacaatct ggtcgcggaa cataccgtga acacgagttc caaaaccaat 780
gaaacgctgc tggatgtgct gctgcgcctg aaagactccg ccgaatttcc gctgacctca 840
gataatatca aagcgattat cctggatatg ttcggtgcag gcaccgatac cagcagcagc 900
accattgaat gggcaatctc agaactgatt aaatgcccga aagctatgga aaaagtccag 960
gcagaactgc gcaaagctct gaacggcaaa gaaaaaatcc atgaagaaga tattcaagaa 1020
ctgtcttacc tgaacatggt tatcaaagaa accctgcgtc tgcacccgcc gctgccgctg 1080
gtgctgccgc gtgaatgtcg ccagccggtt aacctggcag gctataacat cccgaataaa 1140
acgaaactga tcgttaacgt ctttgctatt aaccgtgacc cggaatactg gaaagacgcg 1200
gaagccttta tcccggaacg ctttgaaaac agcagcgcga ccgtgatggg tgccgaatat 1260
gaatacctgc cgtttggcgc gggtcgtcgc atgtgtccgg gcgcagctct gggtctggca 1320
aacgtgcaac tgccgctggc taatatcctg tatcacttca actggaaact gccgaatggc 1380
gttagctacg atcaaattga catgaccgaa agctcaggtg ccacgatgca acgcaaaacc 1440
gaactgctgc tggtgccgtc cttc 1464
<210> 34
<211> 509
<212> PRT
<213> 拟南芥
<400> 34
Met Ala Phe Phe Ser Met Ile Ser Ile Leu Leu Gly Phe Val Ile Ser
1 5 10 15
Ser Phe Ile Phe Ile Phe Phe Phe Lys Lys Leu Leu Ser Phe Ser Arg
20 25 30
Lys Asn Met Ser Glu Val Ser Thr Leu Pro Ser Val Pro Val Val Pro
35 40 45
Gly Phe Pro Val Ile Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro
50 55 60
His Lys Thr Phe Thr Arg Trp Ser Glu Ile Tyr Gly Pro Ile Tyr Ser
65 70 75 80
Ile Lys Met Gly Ser Ser Ser Leu Ile Val Leu Asn Ser Thr Glu Thr
85 90 95
Ala Lys Glu Ala Met Val Thr Arg Phe Ser Ser Ile Ser Thr Arg Lys
100 105 110
Leu Ser Asn Ala Leu Thr Val Leu Thr Cys Asp Lys Ser Met Val Ala
115 120 125
Thr Ser Asp Tyr Asp Asp Phe His Lys Leu Val Lys Arg Cys Leu Leu
130 135 140
Asn Gly Leu Leu Gly Ala Asn Ala Gln Lys Arg Lys Arg His Tyr Arg
145 150 155 160
Asp Ala Leu Ile Glu Asn Val Ser Ser Lys Leu His Ala His Ala Arg
165 170 175
Asp His Pro Gln Glu Pro Val Asn Phe Arg Ala Ile Phe Glu His Glu
180 185 190
Leu Phe Gly Val Ala Leu Lys Gln Ala Phe Gly Lys Asp Val Glu Ser
195 200 205
Ile Tyr Val Lys Glu Leu Gly Val Thr Leu Ser Lys Asp Glu Ile Phe
210 215 220
Lys Val Leu Val His Asp Met Met Glu Gly Ala Ile Asp Val Asp Trp
225 230 235 240
Arg Asp Phe Phe Pro Tyr Leu Lys Trp Ile Pro Asn Lys Ser Phe Glu
245 250 255
Ala Arg Ile Gln Gln Lys His Lys Arg Arg Leu Ala Val Met Asn Ala
260 265 270
Leu Ile Gln Asp Arg Leu Lys Gln Asn Gly Ser Glu Ser Asp Asp Asp
275 280 285
Cys Tyr Leu Asn Phe Leu Met Ser Glu Ala Lys Thr Leu Thr Lys Glu
290 295 300
Gln Ile Ala Ile Leu Val Trp Glu Thr Ile Ile Glu Thr Ala Asp Thr
305 310 315 320
Thr Leu Val Thr Thr Glu Trp Ala Ile Tyr Glu Leu Ala Lys His Pro
325 330 335
Ser Val Gln Asp Arg Leu Cys Lys Glu Ile Gln Asn Val Cys Gly Gly
340 345 350
Glu Lys Phe Lys Glu Glu Gln Leu Ser Gln Val Pro Tyr Leu Asn Gly
355 360 365
Val Phe His Glu Thr Leu Arg Lys Tyr Ser Pro Ala Pro Leu Val Pro
370 375 380
Ile Arg Tyr Ala His Glu Asp Thr Gln Ile Gly Gly Tyr His Val Pro
385 390 395 400
Ala Gly Ser Glu Ile Ala Ile Asn Ile Tyr Gly Cys Asn Met Asp Lys
405 410 415
Lys Arg Trp Glu Arg Pro Glu Asp Trp Trp Pro Glu Arg Phe Leu Asp
420 425 430
Asp Gly Lys Tyr Glu Thr Ser Asp Leu His Lys Thr Met Ala Phe Gly
435 440 445
Ala Gly Lys Arg Val Cys Ala Gly Ala Leu Gln Ala Ser Leu Met Ala
450 455 460
Gly Ile Ala Ile Gly Arg Leu Val Gln Glu Phe Glu Trp Lys Leu Arg
465 470 475 480
Asp Gly Glu Glu Glu Asn Val Asp Thr Tyr Gly Leu Thr Ser Gln Lys
485 490 495
Leu Tyr Pro Leu Met Ala Ile Ile Asn Pro Arg Arg Ser
500 505
<210> 35
<211> 513
<212> PRT
<213> 拟南芥
<400> 35
Met Ala Leu Leu Leu Ala Val Phe Ser Met Ile Ser Ile Leu Leu Gly
1 5 10 15
Phe Val Ile Ser Ser Phe Ile Phe Ile Phe Phe Phe Lys Lys Leu Leu
20 25 30
Ser Phe Ser Arg Lys Asn Met Ser Glu Val Ser Thr Leu Pro Ser Val
35 40 45
Pro Val Val Pro Gly Phe Pro Val Ile Gly Asn Leu Leu Gln Leu Lys
50 55 60
Glu Lys Lys Pro His Lys Thr Phe Thr Arg Trp Ser Glu Ile Tyr Gly
65 70 75 80
Pro Ile Tyr Ser Ile Lys Met Gly Ser Ser Ser Leu Ile Val Leu Asn
85 90 95
Ser Thr Glu Thr Ala Lys Glu Ala Met Val Thr Arg Phe Ser Ser Ile
100 105 110
Ser Thr Arg Lys Leu Ser Asn Ala Leu Thr Val Leu Thr Cys Asp Lys
115 120 125
Ser Met Val Ala Thr Ser Asp Tyr Asp Asp Phe His Lys Leu Val Lys
130 135 140
Arg Cys Leu Leu Asn Gly Leu Leu Gly Ala Asn Ala Gln Lys Arg Lys
145 150 155 160
Arg His Tyr Arg Asp Ala Leu Ile Glu Asn Val Ser Ser Lys Leu His
165 170 175
Ala His Ala Arg Asp His Pro Gln Glu Pro Val Asn Phe Arg Ala Ile
180 185 190
Phe Glu His Glu Leu Phe Gly Val Ala Leu Lys Gln Ala Phe Gly Lys
195 200 205
Asp Val Glu Ser Ile Tyr Val Lys Glu Leu Gly Val Thr Leu Ser Lys
210 215 220
Asp Glu Ile Phe Lys Val Leu Val His Asp Met Met Glu Gly Ala Ile
225 230 235 240
Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Ile Pro Asn
245 250 255
Lys Ser Phe Glu Ala Arg Ile Gln Gln Lys His Lys Arg Arg Leu Ala
260 265 270
Val Met Asn Ala Leu Ile Gln Asp Arg Leu Lys Gln Asn Gly Ser Glu
275 280 285
Ser Asp Asp Asp Cys Tyr Leu Asn Phe Leu Met Ser Glu Ala Lys Thr
290 295 300
Leu Thr Lys Glu Gln Ile Ala Ile Leu Val Trp Glu Thr Ile Ile Glu
305 310 315 320
Thr Ala Asp Thr Thr Leu Val Thr Thr Glu Trp Ala Ile Tyr Glu Leu
325 330 335
Ala Lys His Pro Ser Val Gln Asp Arg Leu Cys Lys Glu Ile Gln Asn
340 345 350
Val Cys Gly Gly Glu Lys Phe Lys Glu Glu Gln Leu Ser Gln Val Pro
355 360 365
Tyr Leu Asn Gly Val Phe His Glu Thr Leu Arg Lys Tyr Ser Pro Ala
370 375 380
Pro Leu Val Pro Ile Arg Tyr Ala His Glu Asp Thr Gln Ile Gly Gly
385 390 395 400
Tyr His Val Pro Ala Gly Ser Glu Ile Ala Ile Asn Ile Tyr Gly Cys
405 410 415
Asn Met Asp Lys Lys Arg Trp Glu Arg Pro Glu Asp Trp Trp Pro Glu
420 425 430
Arg Phe Leu Asp Asp Gly Lys Tyr Glu Thr Ser Asp Leu His Lys Thr
435 440 445
Met Ala Phe Gly Ala Gly Lys Arg Val Cys Ala Gly Ala Leu Gln Ala
450 455 460
Ser Leu Met Ala Gly Ile Ala Ile Gly Arg Leu Val Gln Glu Phe Glu
465 470 475 480
Trp Lys Leu Arg Asp Gly Glu Glu Glu Asn Val Asp Thr Tyr Gly Leu
485 490 495
Thr Ser Gln Lys Leu Tyr Pro Leu Met Ala Ile Ile Asn Pro Arg Arg
500 505 510
Ser
<210> 36
<211> 1539
<212> DNA
<213> 拟南芥
<400> 36
atggctctgt tattagcagt tttttcgatg atttctatcc tgctgggctt tgttatctcg 60
tcctttatct ttatcttctt cttcaaaaaa ctgctgtcgt tttctcgtaa aaacatgtcc 120
gaagtttcaa ccctgccgag tgtcccggtg gttccgggtt ttccggttat cggtaatctg 180
ctgcagctga aagaaaagaa accgcataag accttcacgc gctggtccga aatctatggc 240
ccgatctact caattaaaat gggtagctct agtctgattg tgctgaactc taccgaaacg 300
gcaaaagaag ctatggttac ccgtttttcc tcaatttcga cgcgcaagct gagcaatgcg 360
ctgaccgtcc tgacgtgcga caaatctatg gtggccacca gtgattacga tgacttccat 420
aaactggtta agcgttgtct gctgaacggc ctgctgggtg cgaatgccca gaagcgtaag 480
cgccactatc gcgacgccct gattgaaaac gtgtcgagca aactgcatgc acacgctcgt 540
gatcatccgc aggaaccggt caattttcgc gcaatcttcg aacacgaact gtttggcgtg 600
gcgctgaaac aagccttcgg caaggatgtt gaatcgattt acgtcaaaga actgggcgtg 660
accctgagca aagacgaaat ctttaaggtc ctggtgcatg atatgatgga aggtgcaatt 720
gacgttgatt ggcgtgattt ctttccgtat ctgaaatgga ttccgaacaa gtcattcgaa 780
gctcgcattc agcaaaaaca caagcgtcgc ctggcagtga tgaacgctct gattcaggat 840
cgtctgaaac aaaatggctc tgaaagtgat gacgattgct atctgaattt tctgatgtcc 900
gaagcaaaaa ccctgacgaa ggaacagatt gctatcctgg tttgggaaac cattatcgaa 960
acggcggaca ccacgctggt caccacggaa tgggcgatct acgaactggc caagcatccg 1020
agcgttcagg atcgcctgtg caaagaaatt caaaacgtct gtggcggtga aaaatttaag 1080
gaagaacagc tgtcgcaagt gccgtatctg aatggtgttt tccacgaaac cctgcgtaaa 1140
tatagcccgg caccgctggt cccgatccgt tacgcccatg aagataccca gattggcggt 1200
tatcacgtgc cggcaggcag tgaaattgct atcaacattt acggttgcaa tatggacaaa 1260
aagcgttggg aacgcccgga agattggtgg ccggaacgtt ttctggacga tggcaaatat 1320
gaaacctctg atctgcataa gacgatggcg ttcggtgcag gtaaacgtgt gtgtgcaggt 1380
gcactgcaag caagtctgat ggcaggcatc gctattggtc gtctggtgca agaatttgaa 1440
tggaaactgc gcgacggcga agaagaaaac gttgatacct atggtctgac gtcccagaaa 1500
ctgtacccgc tgatggccat tatcaatccg cgtcgctca 1539
<210> 37
<211> 513
<212> PRT
<213> 甜菊
<400> 37
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His
370 375 380
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 38
<211> 501
<212> PRT
<213> 甜菊
<400> 38
Met Ala Leu Leu Leu Ala Val Phe Ala Val Ala Leu Ala Val Ala Leu
1 5 10 15
Ile Phe Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser
20 25 30
Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly
35 40 45
Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg
50 55 60
Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr
65 70 75 80
Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val
85 90 95
Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys
100 105 110
Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp
115 120 125
Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro
130 135 140
Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn
145 150 155 160
Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu
165 170 175
Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala
180 185 190
Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp
195 200 205
Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val
210 215 220
Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro
225 230 235 240
Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln
245 250 255
Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His
260 265 270
Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr
275 280 285
Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser
290 295 300
Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr
305 310 315 320
Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg
325 330 335
Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu
340 345 350
Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr
355 360 365
Leu Arg Arg His Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His
370 375 380
Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu
385 390 395 400
Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn
405 410 415
Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile
420 425 430
Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala
435 440 445
Gly Ser Leu Gln Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met
450 455 460
Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val
465 470 475 480
Asn Thr Ile Gly Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile
485 490 495
Ile Lys Pro Arg Ile
500
<210> 39
<211> 1503
<212> DNA
<213> 甜菊
<400> 39
atggctctgt tattagcagt ttttgccgtc gctctggcgg tagcactgat cttctggtat 60
ctgaaatctt acactagcgc gcgccgctct cagtccaacc acctgccgcg tgtgccggaa 120
gttccgggtg tgccactgct gggcaacctg ctgcaactga aagaaaagaa accgtacatg 180
acctttaccc gctgggcagc gacttatggt cctatctaca gcattaaaac cggcgctacg 240
tctatggttg tggtttcttc caacgaaatc gcgaaagaag ccctggtgac tcgtttccag 300
tccattagca cccgcaacct gtccaaagcg ctgaaggttc tgacggctga caagactatg 360
gtggctatga gcgactatga tgactaccac aaaaccgtta aacgtcacat cctgaccgca 420
gtactgggtc cgaacgcaca gaaaaaacat cgcatccacc gcgacattat gatggataac 480
atctccacgc agctgcatga gttcgttaag aacaatccag aacaggaaga ggtagatctg 540
cgtaaaattt ttcagtccga actgttcggt ctggctatgc gtcaggcgct gggcaaagac 600
gttgaaagcc tgtatgtcga agacctgaaa attaccatga accgtgatga gatcttccag 660
gttctggttg tagatccgat gatgggcgcc atcgacgtgg attggcgtga cttctttccg 720
tacctgaaat gggtcccgaa caagaagttc gaaaacacca tccagcaaat gtacatccgt 780
cgtgaagcgg tgatgaaaag cctgatcaaa gaacacaaaa agcgtattgc ttctggtgag 840
aaactgaact cctacatcga ttatctgctg tccgaagcgc agaccctgac cgaccaacag 900
ctgctgatgt ctctgtggga accgattatc gaaagcagcg acaccactat ggtcactacc 960
gaatgggcaa tgtatgagct ggccaaaaac ccgaaactgc aggatcgtct gtaccgtgac 1020
atcaaaagcg tttgcggctc cgagaaaatc actgaagaac acctgtctca gctgccgtac 1080
atcactgcta ttttccacga aaccctgcgt cgccattctc cggttccgat cattccgctg 1140
cgtcacgttc acgaagatac tgtgctgggt ggttaccatg taccggcagg cactgaactg 1200
gctgtcaaca tctacggctg taacatggat aaaaacgttt gggagaatcc tgaagaatgg 1260
aacccggaac gcttcatgaa agagaacgaa accatcgact tccagaaaac gatggctttc 1320
ggcggtggta aacgtgtgtg cgcaggttct ctgcaggcgc tgctgacggc gtccattggt 1380
atcggtcgca tggtacagga atttgaatgg aagctgaaag acatgaccca agaagaggtg 1440
aataccattg gtctgactac ccagatgctg cgtccactgc gtgcaatcat caaacctcgt 1500
att 1503
<210> 40
<211> 546
<212> PRT
<213> 小立碗藓
<400> 40
Met Ala Lys His Leu Ala Thr Gln Leu Leu Gln Gln Trp Asn Glu Ala
1 5 10 15
Leu Lys Thr Met Pro Pro Gly Phe Arg Thr Ala Gly Lys Ile Leu Val
20 25 30
Trp Glu Glu Leu Ala Ser Asn Lys Val Leu Ile Thr Ile Ala Leu Ala
35 40 45
Trp Val Leu Leu Phe Val Ala Arg Thr Cys Leu Arg Asn Lys Lys Arg
50 55 60
Leu Pro Pro Ala Ile Pro Gly Gly Leu Pro Val Leu Gly Asn Leu Leu
65 70 75 80
Gln Leu Thr Glu Lys Lys Pro His Arg Thr Phe Thr Ala Trp Ser Lys
85 90 95
Glu His Gly Pro Ile Phe Thr Ile Lys Val Gly Ser Val Pro Gln Ala
100 105 110
Val Val Asn Asn Ser Glu Ile Ala Lys Glu Val Leu Val Thr Lys Phe
115 120 125
Ala Ser Ile Ser Lys Arg Gln Met Pro Met Ala Leu Arg Val Leu Thr
130 135 140
Arg Asp Lys Thr Met Val Ala Met Ser Asp Tyr Gly Glu Glu His Arg
145 150 155 160
Met Leu Lys Lys Leu Val Met Thr Asn Leu Leu Gly Pro Thr Thr Gln
165 170 175
Asn Lys Asn Arg Ser Leu Arg Asp Asp Ala Leu Ile Gly Met Ile Glu
180 185 190
Gly Val Leu Ala Glu Leu Lys Ala Ser Pro Thr Ser Pro Lys Val Val
195 200 205
Asn Val Arg Asp Tyr Val Gln Arg Ser Leu Phe Pro Phe Ala Leu Gln
210 215 220
Gln Val Phe Gly Tyr Ile Pro Asp Gln Val Glu Val Leu Glu Leu Gly
225 230 235 240
Thr Cys Val Ser Thr Trp Asp Met Phe Asp Ala Leu Val Val Ala Pro
245 250 255
Leu Ser Ala Val Ile Asn Val Asp Trp Arg Asp Phe Phe Pro Ala Leu
260 265 270
Arg Trp Ile Pro Asn Arg Ser Val Glu Asp Leu Val Arg Thr Val Asp
275 280 285
Phe Lys Arg Asn Ser Ile Met Lys Ala Leu Ile Arg Ala Gln Arg Met
290 295 300
Arg Leu Ala Asn Leu Lys Glu Pro Pro Arg Cys Tyr Ala Asp Ile Ala
305 310 315 320
Leu Thr Glu Ala Thr His Leu Thr Glu Lys Gln Leu Glu Met Ser Leu
325 330 335
Trp Glu Pro Ile Ile Glu Ser Ala Asp Thr Thr Leu Val Thr Ser Glu
340 345 350
Trp Ala Met Tyr Glu Ile Ala Lys Asn Pro Asp Cys Gln Asp Arg Leu
355 360 365
Tyr Arg Glu Ile Val Ser Val Ala Gly Thr Glu Arg Met Val Thr Glu
370 375 380
Asp Asp Leu Pro Asn Met Pro Tyr Leu Gly Ala Ile Ile Lys Glu Thr
385 390 395 400
Leu Arg Lys Tyr Thr Pro Val Pro Leu Ile Pro Ser Arg Phe Val Glu
405 410 415
Glu Asp Ile Thr Leu Gly Gly Tyr Asp Ile Pro Lys Gly Tyr Gln Ile
420 425 430
Leu Val Asn Leu Phe Ala Ile Ala Asn Asp Pro Ala Val Trp Ser Asn
435 440 445
Pro Glu Lys Trp Asp Pro Glu Arg Met Leu Ala Asn Lys Lys Val Asp
450 455 460
Met Gly Phe Arg Asp Phe Ser Leu Met Pro Phe Gly Ala Gly Lys Arg
465 470 475 480
Met Cys Ala Gly Ile Thr Gln Ala Met Phe Ile Ile Pro Met Asn Val
485 490 495
Ala Ala Leu Val Gln His Cys Glu Trp Arg Leu Ser Pro Gln Glu Ile
500 505 510
Ser Asn Ile Asn Asn Lys Ile Glu Asp Val Val Tyr Leu Thr Thr His
515 520 525
Lys Leu Ser Pro Leu Ser Cys Glu Ala Thr Pro Arg Ile Ser His Arg
530 535 540
Leu Pro
545
<210> 41
<211> 548
<212> PRT
<213> 小立碗藓
<400> 41
Met Ala Leu Leu Leu Ala Val Phe Thr Gln Leu Leu Gln Gln Trp Asn
1 5 10 15
Glu Ala Leu Lys Thr Met Pro Pro Gly Phe Arg Thr Ala Gly Lys Ile
20 25 30
Leu Val Trp Glu Glu Leu Ala Ser Asn Lys Val Leu Ile Thr Ile Ala
35 40 45
Leu Ala Trp Val Leu Leu Phe Val Ala Arg Thr Cys Leu Arg Asn Lys
50 55 60
Lys Arg Leu Pro Pro Ala Ile Pro Gly Gly Leu Pro Val Leu Gly Asn
65 70 75 80
Leu Leu Gln Leu Thr Glu Lys Lys Pro His Arg Thr Phe Thr Ala Trp
85 90 95
Ser Lys Glu His Gly Pro Ile Phe Thr Ile Lys Val Gly Ser Val Pro
100 105 110
Gln Ala Val Val Asn Asn Ser Glu Ile Ala Lys Glu Val Leu Val Thr
115 120 125
Lys Phe Ala Ser Ile Ser Lys Arg Gln Met Pro Met Ala Leu Arg Val
130 135 140
Leu Thr Arg Asp Lys Thr Met Val Ala Met Ser Asp Tyr Gly Glu Glu
145 150 155 160
His Arg Met Leu Lys Lys Leu Val Met Thr Asn Leu Leu Gly Pro Thr
165 170 175
Thr Gln Asn Lys Asn Arg Ser Leu Arg Asp Asp Ala Leu Ile Gly Met
180 185 190
Ile Glu Gly Val Leu Ala Glu Leu Lys Ala Ser Pro Thr Ser Pro Lys
195 200 205
Val Val Asn Val Arg Asp Tyr Val Gln Arg Ser Leu Phe Pro Phe Ala
210 215 220
Leu Gln Gln Val Phe Gly Tyr Ile Pro Asp Gln Val Glu Val Leu Glu
225 230 235 240
Leu Gly Thr Cys Val Ser Thr Trp Asp Met Phe Asp Ala Leu Val Val
245 250 255
Ala Pro Leu Ser Ala Val Ile Asn Val Asp Trp Arg Asp Phe Phe Pro
260 265 270
Ala Leu Arg Trp Ile Pro Asn Arg Ser Val Glu Asp Leu Val Arg Thr
275 280 285
Val Asp Phe Lys Arg Asn Ser Ile Met Lys Ala Leu Ile Arg Ala Gln
290 295 300
Arg Met Arg Leu Ala Asn Leu Lys Glu Pro Pro Arg Cys Tyr Ala Asp
305 310 315 320
Ile Ala Leu Thr Glu Ala Thr His Leu Thr Glu Lys Gln Leu Glu Met
325 330 335
Ser Leu Trp Glu Pro Ile Ile Glu Ser Ala Asp Thr Thr Leu Val Thr
340 345 350
Ser Glu Trp Ala Met Tyr Glu Ile Ala Lys Asn Pro Asp Cys Gln Asp
355 360 365
Arg Leu Tyr Arg Glu Ile Val Ser Val Ala Gly Thr Glu Arg Met Val
370 375 380
Thr Glu Asp Asp Leu Pro Asn Met Pro Tyr Leu Gly Ala Ile Ile Lys
385 390 395 400
Glu Thr Leu Arg Lys Tyr Thr Pro Val Pro Leu Ile Pro Ser Arg Phe
405 410 415
Val Glu Glu Asp Ile Thr Leu Gly Gly Tyr Asp Ile Pro Lys Gly Tyr
420 425 430
Gln Ile Leu Val Asn Leu Phe Ala Ile Ala Asn Asp Pro Ala Val Trp
435 440 445
Ser Asn Pro Glu Lys Trp Asp Pro Glu Arg Met Leu Ala Asn Lys Lys
450 455 460
Val Asp Met Gly Phe Arg Asp Phe Ser Leu Met Pro Phe Gly Ala Gly
465 470 475 480
Lys Arg Met Cys Ala Gly Ile Thr Gln Ala Met Phe Ile Ile Pro Met
485 490 495
Asn Val Ala Ala Leu Val Gln His Cys Glu Trp Arg Leu Ser Pro Gln
500 505 510
Glu Ile Ser Asn Ile Asn Asn Lys Ile Glu Asp Val Val Tyr Leu Thr
515 520 525
Thr His Lys Leu Ser Pro Leu Ser Cys Glu Ala Thr Pro Arg Ile Ser
530 535 540
His Arg Leu Pro
545
<210> 42
<211> 1644
<212> DNA
<213> 小立碗藓
<400> 42
atggctctgt tattagcagt ttttacgcaa ctgctgcaac aatggaatga agctctgaag 60
acgatgccgc cgggttttcg caccgctggc aaaattctgg tgtgggaaga actggcaagc 120
aataaagttc tgattacgat cgcactggct tgggtcctgc tgtttgtggc tcgtacctgc 180
ctgcgcaata aaaagcgtct gccgccggca atcccgggcg gtctgccggt gctgggcaac 240
ctgctgcagc tgacggaaaa gaaaccgcat cgtaccttta cggcgtggag caaggaacac 300
ggcccgattt tcaccatcaa agtcggttcg gtgccgcagg ctgtggttaa caatagcgaa 360
attgcgaaag aagtcctggt gaccaagttc gccagcatct ctaaacgtca aatgccgatg 420
gcactgcgcg tcctgacgcg tgataaaacg atggtggcta tgtctgacta tggcgaagaa 480
catcgcatgc tgaaaaagct ggtgatgacg aatctgctgg gtccgaccac gcagaacaaa 540
aatcgtagtc tgcgcgatga cgcactgatt ggcatgatcg aaggtgttct ggcggaactg 600
aaggccagtc cgacctcccc gaaagtcgtg aacgttcgcg attatgtcca gcgttctctg 660
tttccgttcg cgctgcagca agtgtttggc tacattccgg atcaagttga agtcctggaa 720
ctgggcacgt gtgtttctac ctgggatatg ttcgacgcac tggttgtcgc tccgctgagt 780
gcggttatta acgtcgattg gcgtgacttt ttcccggccc tgcgctggat tccgaatcgt 840
tccgtggaag atctggtgcg caccgttgac tttaagcgta actcaattat gaaagccctg 900
atccgtgcac agcgtatgcg cctggctaac ctgaaggaac cgccgcgctg ctacgcagat 960
attgctctga ccgaagcgac gcacctgacc gaaaaacaac tggaaatgag tctgtgggaa 1020
ccgattatcg aatccgccga taccacgctg gtgacctcag aatgggctat gtatgaaatt 1080
gcgaaaaatc cggattgtca ggaccgtctg taccgcgaaa tcgtgtccgt tgccggcacg 1140
gaacgcatgg ttaccgaaga tgacctgccg aacatgccgt atctgggtgc aattatcaaa 1200
gaaacgctgc gcaagtacac cccggttccg ctgattccga gtcgttttgt cgaagaagat 1260
atcaccctgg gcggttatga cattccgaaa ggttaccaga tcctggtcaa cctgttcgcg 1320
attgccaatg atccggccgt ttggtcgaac ccggaaaaat gggacccgga acgcatgctg 1380
gcaaataaaa aggtggatat gggctttcgt gacttcagcc tgatgccgtt tggcgccggt 1440
aaacgcatgt gcgccggtat cacccaagca atgttcatta tcccgatgaa tgtggcggcc 1500
ctggttcagc attgtgaatg gcgcctgagc ccgcaagaaa tctctaacat caacaacaag 1560
atcgaagatg tggtttacct gaccacgcat aaactgtcac cgctgtcgtg cgaagcaacc 1620
ccgcgtatca gccaccgtct gccg 1644
<210> 43
<211> 1049
<212> PRT
<213> 巨大芽孢杆菌
<400> 43
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Ala Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Ile Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 44
<211> 3150
<212> DNA
<213> 巨大芽孢杆菌
<400> 44
atgacgatta aagaaatgcc gcaaccgaag acgtttggcg aactgaagaa cctgccgctg 60
ctgaacacgg ataagccggt gcaagccctg atgaagattg ctgatgaact gggcgaaatc 120
tttaaattcg aagcgccggg tcgtgtgacc cgttatctga gcagccagcg tctgattaaa 180
gaagcctgcg atgaatcgcg ctttgacaag aacctgagcc aggcactgaa atttgttcgt 240
gatttcgcag gtgacggtct ggccaccagc tggacgcatg aaaagaactg gaaaaaggcc 300
cacaatattc tgctgccgtc gttcagccag caagcaatga aaggctacca tgctatgatg 360
gtcgatatcg cggttcagct ggtccaaaaa tgggaacgtc tgaatgcgga cgaacacatt 420
gaagtgccgg aagatatgac ccgcctgacg ctggacacca tcggtctgtg tggctttaac 480
tatcgtttta attcgttcta ccgcgatcag ccgcatccgt tcattaccag catggtgcgt 540
gcgctggacg aagccatgaa caaactgcag cgtgcaaacc cggatgaccc ggcgtatgat 600
gaaaacaagc gtcagtttca agaagacatc aaagtgatga atgatctggt tgacaagatt 660
atcgcagatc gcaaagcgag cggcgaacag tcagatgacc tgctgacgca catgctgaac 720
ggcaaagacc cggaaaccgg tgaaccgctg gatgacgaaa acatccgtta tcagatcatc 780
acctttctga tcgcaggcca tgaaaccacg tcgggtctgc tgagctttgc gctgtacttc 840
ctggtcaaga acccgcacgt gctgcagaaa gcggccgaag aagcagctcg tgtgctggtt 900
gatccggttc cgtcgtataa acaggtcaag caactgaaat acgtgggtat ggttctgaat 960
gaagcgctgc gcctgtggcc gacgattccg gcatttagcc tgtatgctaa ggaagatacc 1020
gttctgggcg gtgaataccc gctggaaaaa ggcgatgaac tgatggtcct gattccgcag 1080
ctgcatcgcg acaaaaccat ctggggtgat gacgtggaag aatttcgccc ggaacgcttc 1140
gaaaacccga gcgcgattcc gcagcatgcc tttaaaccgt tcggcaatgg tcaacgtgcg 1200
tgcatcggcc agcaatttgc gctgcacgaa gccacgctgg ttctgggtat gatgctgaaa 1260
cattttgatt tcgaagacca caccaactat gaactggata ttaaggaaac cctgacgctg 1320
aaaccggaag gcttcgtggt taaagcgaag tctaaaaaga ttccgctggg cggtatcccg 1380
tctccgagta cggaacagag tgccaaaaag gtccgtaaaa aggcggaaaa cgcccataat 1440
accccgctgc tggtgctgta tggttctaac atgggcacgg cagaaggcac cgctcgcgat 1500
ctggcagaca ttgctatgtc taaaggtttt gcgccgcagg tggccacgct ggatagtcat 1560
gcaggcaatc tgccgcgtga aggtgctgtc ctgatcgtga ccgcaagcta caacggtcac 1620
ccgccggata atgcgaagca gttcgttgat tggctggacc aagcgtcggc cgatgaagtt 1680
aaaggtgtcc gctatagcgt gtttggctgt ggtgacaaga actgggctac cacgtaccag 1740
aaagttccgg cgttcattga tgaaacgctg gcggccaaag gcgcagaaaa tatcgctgat 1800
cgtggtgaag cagacgcttc cgatgacttt gaaggcacct atgaagaatg gcgcgaacac 1860
atgtggtcgg atgtggcagc ttacttcaac ctggatattg aaaacagcga agacaataaa 1920
tccaccctgt cactgcagtt tgttgatagt gcggccgaca tgccgctggc aaagatgcac 1980
ggcgctttct ccacgaatgt cgtggcttca aaagaactgc agcaaccggg ttcggcacgt 2040
agcacccgcc atctggaaat tgaactgccg aaagaagcca gctatcagga aggcgatcac 2100
ctgggtgtga ttccgcgtaa ctacgaaggc atcgtgaatc gtgttacggc ccgctttggt 2160
ctggatgcat cccagcaaat ccgcctggaa gcggaagaag aaaagctggc gcatctgccg 2220
ctggccaaaa ccgtctcagt ggaagaactg ctgcagtatg tggaactgca agatccggtt 2280
acccgtacgc agctgcgtgc gatggcggct aagaccgtct gcccgccgca caaagtggaa 2340
ctggaagctc tgctggaaaa gcaggcgtat aaagaacaag tgctggcgaa acgcctgacc 2400
atgctggaac tgctggaaaa gtacccggcc tgtgaaatga agttctctga atttatcgca 2460
ctgctgccgt ctatccgtcc gcgttattac agtattagtt cctcaccgcg tgtggatgaa 2520
aaacaggcca gtatcaccgt ttctgttgtc agtggcgaag catggtctgg ctatggtgaa 2580
tacaagggta tcgcaagtaa ctacctggct gaactgcagg aaggcgatac cattacgtgc 2640
tttatctcta cgccgcaaag tgaatttacc ctgccgaaag acccggaaac gccgctgatc 2700
atggttggcc cgggcaccgg tgtcgcaccg tttcgtggtt tcgtgcaggc acgcaagcaa 2760
ctgaaagaac agggccaatc cctgggtgaa gcgcatctgt attttggctg tcgctcaccg 2820
cacgaagatt atctgtacca ggaagaactg gaaaacgcgc aatccgaagg tattatcacg 2880
ctgcataccg ccttctcacg tatgccgaat cagccgaaaa cctacgtcca gcacgtgatg 2940
gaacaagatg gcaaaaagct gattgaactg ctggaccagg gtgcgcattt ttatatctgc 3000
ggtgatggca gccaaatggc accggcagtg gaagcaaccc tgatgaaatc ctacgcagat 3060
gttcaccagg tctcagaagc agacgctcgt ctgtggctgc agcaactgga agaaaagggc 3120
cgctatgcga aagatgtttg ggccggttaa 3150
<210> 45
<211> 396
<212> PRT
<213> 紫孢侧耳
<400> 45
Met Arg Tyr Gly Cys Ala Ala Val Ala Leu Phe Tyr Leu Thr Ala Met
1 5 10 15
Gly Lys Leu His Pro Leu Ala Ile Ile Pro Asp Tyr Lys Gly Ser Met
20 25 30
Ala Ala Ser Val Thr Ile Phe Asn Lys Arg Thr Asn Pro Leu Asp Ile
35 40 45
Ser Val Asn Gln Ala Asn Asp Trp Pro Trp Arg Tyr Ala Lys Thr Cys
50 55 60
Val Leu Ser Ser Asp Trp Ala Leu His Glu Met Ile Ile His Leu Asn
65 70 75 80
Asn Thr His Leu Val Glu Glu Ala Val Ile Val Ala Ala Gln Arg Lys
85 90 95
Leu Ser Pro Ser His Ile Val Phe Arg Leu Leu Glu Pro His Trp Val
100 105 110
Val Thr Leu Ser Leu Asn Ala Leu Ala Arg Ser Val Leu Ile Pro Glu
115 120 125
Val Ile Val Pro Ile Ala Gly Phe Ser Ala Pro His Ile Phe Gln Phe
130 135 140
Ile Arg Glu Ser Phe Thr Asn Phe Asp Trp Lys Ser Leu Tyr Val Pro
145 150 155 160
Ala Asp Leu Glu Ser Arg Gly Phe Pro Val Asp Gln Leu Asn Ser Pro
165 170 175
Lys Phe His Asn Tyr Ala Tyr Ala Arg Asp Ile Asn Asp Met Trp Thr
180 185 190
Thr Leu Lys Lys Phe Val Ser Ser Val Leu Gln Asp Ala Gln Tyr Tyr
195 200 205
Pro Asp Asp Ala Ser Val Ala Gly Asp Thr Gln Ile Gln Ala Trp Cys
210 215 220
Asp Glu Met Arg Ser Gly Met Gly Ala Gly Met Thr Asn Phe Pro Glu
225 230 235 240
Ser Ile Thr Thr Val Asp Asp Leu Val Asn Met Val Thr Met Cys Ile
245 250 255
His Ile Ala Ala Pro Gln His Thr Ala Val Asn Tyr Leu Gln Gln Tyr
260 265 270
Tyr Gln Thr Phe Val Ser Asn Lys Pro Ser Ala Leu Phe Ser Pro Leu
275 280 285
Pro Thr Ser Ile Ala Gln Leu Gln Lys Tyr Thr Glu Ser Asp Leu Met
290 295 300
Ala Ala Leu Pro Leu Asn Ala Lys Arg Gln Trp Leu Leu Met Ala Gln
305 310 315 320
Ile Pro Tyr Leu Leu Ser Met Gln Val Gln Glu Asp Glu Asn Ile Val
325 330 335
Thr Tyr Ala Ala Asn Ala Ser Thr Asp Lys Asp Pro Ile Ile Ala Ser
340 345 350
Ala Gly Arg Gln Leu Ala Ala Asp Leu Lys Lys Leu Ala Ala Val Phe
355 360 365
Leu Val Asn Ser Ala Gln Leu Asp Asp Gln Asn Thr Pro Tyr Asp Val
370 375 380
Leu Ala Pro Glu Gln Leu Ala Asn Ala Ile Val Ile
385 390 395
<210> 46
<211> 1191
<212> DNA
<213> 紫孢侧耳
<400> 46
atgcgttatg gctgtgctgc tgtggctctg ttctatctga ccgctatggg caaactgcac 60
ccgctggcta ttatcccgga ctacaagggt agcatggcgg cctctgtcac catttttaac 120
aaacgtacga atccgctgga tatcagcgtt aaccaggcaa atgactggcc gtggcgctat 180
gctaagacgt gcgtgctgag cagcgattgg gcgctgcatg aaatgattat ccacctgaac 240
aatacccatc tggtggaaga agccgtcatt gtggcagctc agcgtaaact gtcaccgtcg 300
cacatcgttt ttcgcctgct ggaaccgcat tgggtggtta ccctgtcgct gaacgcactg 360
gctcgtagcg tgctgatccc ggaagttatt gtcccgatcg cgggtttctc tgccccgcac 420
atttttcagt tcatccgcga atcttttacc aatttcgatt ggaaaagtct gtacgtcccg 480
gcggacctgg aatcgcgtgg ctttccggtg gatcagctga acagcccgaa gttccataat 540
tatgcgtacg cccgcgatat caacgacatg tggaccacgc tgaaaaagtt tgtgagttcc 600
gttctgcagg atgcccaata ttacccggat gacgcaagtg tggctggtga tacgcagatt 660
caagcatggt gcgacgaaat gcgttccggc atgggtgcgg gcatgaccaa cttcccggaa 720
tcaatcacca cggttgatga cctggtcaat atggtgacca tgtgtattca catcgcggcc 780
ccgcagcata cggcggttaa ctatctgcag caatactacc aaaccttcgt cagtaacaag 840
ccgtccgcac tgttctcacc gctgccgacc tctattgctc agctgcaaaa atacacggaa 900
agtgatctga tggcagctct gccgctgaac gcgaagcgtc agtggctgct gatggcccaa 960
attccgtatc tgctgtcgat gcaggtgcaa gaagatgaaa acatcgttac ctacgcggcc 1020
aatgcgtcca cggataaaga cccgattatc gcatcagctg gccgccagct ggcagctgac 1080
ctgaaaaagc tggcggccgt ttttctggtc aactcagccc agctggatga ccaaaatacc 1140
ccgtatgatg tgctggcacc ggaacagctg gcgaatgcca ttgttatcta a 1191
<210> 47
<211> 640
<212> PRT
<213> 糙皮侧耳
<400> 47
Met Ala Pro Thr Met Ser Leu Ser Arg Ser Ala Leu Lys Asn Val His
1 5 10 15
Leu Pro Tyr Met Val Gln His Pro Glu Pro Thr Asp Cys Ser Thr Ala
20 25 30
Met Lys His Ala Ala Glu Gly Tyr Asp Arg Ala Arg Gln Met Ile Ala
35 40 45
Phe Leu Phe Asp Ile Leu Asp Tyr Glu Ser Ser Val Pro Gln Lys Phe
50 55 60
Thr Pro Glu Glu Lys Lys Glu Lys Tyr Thr Trp Ser His Ser Asp Lys
65 70 75 80
Phe Pro Pro His Leu Ala Ile Ile Pro Glu Asp Ile Asp Val Pro Ala
85 90 95
Tyr Ile Ile Phe Ser Ile Val Arg Leu Val Gln Thr Leu Ser Ile Met
100 105 110
Ser Gly Ile Gln Cys Asn Glu Arg Leu Ala Pro Gly Pro Glu Gln Asn
115 120 125
Thr Met Glu Lys Leu Thr Lys Trp Asn Ala Glu Arg His Lys Asn Gln
130 135 140
Gly Trp Val Lys Asp Met Phe Asn Glu Pro Asn Ile Gly Leu Arg Asn
145 150 155 160
Asp Trp Tyr Thr Asp Ala Val Phe Ala Gln Gln Phe Phe Thr Gly Pro
165 170 175
Asn Pro Thr Thr Ile Thr Leu Ala Ser Asp Thr Trp Met Lys Ala Phe
180 185 190
Thr Glu Glu Ala Ala Ser Gln Gly Lys Arg Asp Leu Ile Ser Leu Phe
195 200 205
Arg Ser Ala Pro Pro Asn Ser Phe Tyr Val Gln Asp Phe Ser Asp Phe
210 215 220
Arg Ala Arg Met Gly Ala Lys Pro Asp Glu Glu Leu Cys Ala Thr Ser
225 230 235 240
Asp Gly Gly Val Thr Arg Tyr Gly Cys Ala Ala Val Ala Leu Phe Tyr
245 250 255
Leu Pro Pro Thr Gly Glu Leu His Pro Leu Ala Ile Val Pro Asp Tyr
260 265 270
Lys Gly Ser Met Ala Ala Ser Ile Thr Leu Phe Asn Lys Arg Val Asp
275 280 285
Pro Ser Asp Ala Ser Val Asp Gln Ala Asn Asp Trp Pro Trp Arg Tyr
290 295 300
Ala Lys Thr Cys Val Leu Ser Ala Asp Trp Val Leu His Glu Met Ile
305 310 315 320
Ile His Leu Asn Asn Thr His Leu Val Gln Glu Ala Val Ile Val Ala
325 330 335
Val Gln Arg Thr Leu Pro Asp Ser His Ile Val Phe Arg Leu Leu Lys
340 345 350
Pro His Trp Val Val Thr Leu Ser Leu Asn Ala Gln Ala Arg Ser Val
355 360 365
Leu Ile Pro Glu Val Ile Val Pro Ile Ala Gly Phe Ser Glu Leu Arg
370 375 380
Ile Phe Gln Phe Val Gly His Ala Phe Thr Asn Phe Asp Trp Lys Ala
385 390 395 400
Leu Tyr Val Pro Thr Asp Leu Glu Phe Arg Gly Phe Pro Leu Asp Arg
405 410 415
Leu Asp Asp Asp Lys Phe His Asn Tyr Ala Tyr Ala Lys Asp Ile Lys
420 425 430
Asp Met Trp Met Ala Leu Arg Lys Phe Val Ser Ser Val Leu Lys Asp
435 440 445
Gly Lys Tyr Tyr Pro Asp Asp Ser Ala Val Ala Ala Asp Ala Gln Ile
450 455 460
Gln Asp Trp Cys Asp Glu Met Arg Ser Glu Lys Gly Ala Gly Met Lys
465 470 475 480
Lys Phe Pro Glu Ser Ile Ser Thr Leu Asp Asp Leu Ile Asp Met Val
485 490 495
Thr Met Cys Ile His Ile Ala Ala Pro Gln His Thr Ala Val Asn Tyr
500 505 510
Leu Gln Gln Tyr Tyr Gln Thr Phe Val Pro Asn Lys Pro Ser Ala Leu
515 520 525
Phe Ser Pro Leu Pro Thr Leu Leu Ser Gln Leu Glu Ser Tyr Thr Glu
530 535 540
Ser Asp Leu Met Ala Ala Leu Pro Leu Gly Ala Lys Gln Glu Trp Leu
545 550 555 560
Leu Met Ala Gln Val Pro Tyr Leu Leu Ser Lys Glu Val Glu Gln Asp
565 570 575
Gly Asn Ile Val Thr Tyr Ala Gly Thr Ala Ser Asn Asn Glu Asp Pro
580 585 590
Ile Ile Ala Ala Ala Gly Lys Glu Leu Ser Ala Asp Leu Val Ile Leu
595 600 605
Ala Gly Val Phe Leu Lys Asn Ser Glu Lys Leu Asp Asp Gln Asn Thr
610 615 620
Ala Tyr Asn Val Leu Ala Pro Asp Gln Leu Ala Asn Ala Ile Val Ile
625 630 635 640
<210> 48
<211> 1923
<212> DNA
<213> 糙皮侧耳
<400> 48
atggccccga cgatgtcact gtctcgctcc gcactgaaga atgtccacct gccgtatatg 60
gtccaacacc cggaaccgac cgattgcagc accgcgatga aacacgcggc cgaaggttat 120
gatcgtgctc gccagatgat tgcgtttctg ttcgacatcc tggattacga aagctctgtt 180
ccgcaaaaat ttaccccgga agaaaagaaa gaaaaatata cgtggtcaca ctcggataag 240
ttcccgccgc atctggccat tatcccggaa gacattgatg tgccggcata cattatcttt 300
agcatcgttc gtctggtcca gaccctgagt attatgtccg gcatccaatg caacgaacgt 360
ctggcaccgg ggccggaaca gaatacgatg gaaaaactga cgaagtggaa cgcggaacgt 420
cataaaaatc aaggctgggt caaggatatg tttaacgaac cgaatattgg tctgcgcaac 480
gactggtata ccgatgctgt gttcgcgcag caatttttca cgggtccgaa tccgaccacg 540
attaccctgg cctctgatac gtggatgaaa gcatttaccg aagaagcagc tagtcagggc 600
aagcgtgacc tgatcagcct gtttcgctct gccccgccga actccttcta cgttcaggac 660
ttttcagatt tccgtgctcg catgggcgcg aaaccggacg aagaactgtg cgcgacctct 720
gatggcggtg ttacccgtta tggctgtgca gcagtcgcac tgttttacct gccgccgacc 780
ggtgaactgc atccgctggc cattgtgccg gattataaag gcagtatggc agcttccatc 840
acgctgttca acaagcgtgt ggacccgtca gatgcctcgg ttgaccaggc aaatgattgg 900
ccgtggcgct acgctaaaac ctgtgttctg tccgcggatt gggtcctgca tgaaatgatt 960
atccacctga acaataccca tctggtgcag gaagccgtca ttgtggcagt tcaacgtacg 1020
ctgccggatt cacacatcgt ttttcgcctg ctgaaaccgc attgggtggt taccctgtcg 1080
ctgaatgccc aggcacgtag cgttctgatc ccggaagtca ttgtgccgat cgcgggcttc 1140
agtgaactgc gcatctttca gttcgttggt cacgccttta ccaacttcga ctggaaagca 1200
ctgtatgtcc cgacggatct ggaatttcgt ggtttcccgc tggaccgcct ggatgacgat 1260
aagttccata actatgctta cgcgaaggac attaaggata tgtggatggc cctgcgtaag 1320
ttcgtgagtt ccgttctgaa agatggcaag tattacccgg acgattcggc tgttgcagca 1380
gacgcgcaga ttcaagactg gtgcgatgaa atgcgcagcg aaaaaggcgc gggtatgaaa 1440
aagttcccgg aaagcatttc taccctggac gatctgatcg atatggtgac gatgtgtatt 1500
cacatcgcag ctccgcagca taccgccgtg aactatctgc agcaatatta ccaaacgttt 1560
gttccgaata aaccgtcagc actgttctcg ccgctgccga ccctgctgag ccagctggaa 1620
tcttacacgg aaagtgatct gatggcggcc ctgccgctgg gtgctaaaca ggaatggctg 1680
ctgatggcgc aagtgccgta tctgctgtct aaggaagtcg aacaggatgg caacattgtg 1740
acctacgccg gtacggcaag taacaatgaa gatccgatta tcgcagctgc gggcaaagaa 1800
ctgtccgctg acctggtcat cctggcgggt gtgtttctga aaaactcaga aaagctggac 1860
gatcagaaca ccgcctataa tgtcctggca ccggatcaac tggccaatgc aattgtgatc 1920
taa 1923
<210> 49
<211> 495
<212> PRT
<213> 菊苣
<400> 49
Met Glu Ile Ser Ile Pro Thr Thr Leu Gly Leu Ala Val Ile Ile Phe
1 5 10 15
Ile Ile Phe Lys Leu Leu Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu
20 25 30
Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Glu Leu Ala Arg Lys His Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Asp Ile Arg
145 150 155 160
Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys
165 170 175
Met Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Met Lys Phe Thr Glu Leu Val Lys Glu Ile Leu Arg Leu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Leu Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu
225 230 235 240
Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu His Pro Gly Asn Arg Thr
245 250 255
Ser Ser Ser Gln Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Glu
260 265 270
Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Val Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu
325 330 335
Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Thr Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Thr Val Met
405 410 415
Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His
435 440 445
Ile Leu Tyr Phe Asn Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu Asp
450 455 460
Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr Glu
465 470 475 480
Leu Leu Leu Val Pro Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr
485 490 495
<210> 50
<211> 493
<212> PRT
<213> 菊苣
<400> 50
Met Ala Leu Leu Leu Ala Val Phe Leu Ala Val Ile Ile Phe Ile Ile
1 5 10 15
Phe Lys Leu Leu Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu Pro Glu
20 25 30
Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile Gly Thr
35 40 45
Met Pro His Arg Gly Val Met Glu Leu Ala Arg Lys His Gly Ser Leu
50 55 60
Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser Ser Pro
65 70 75 80
Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe Ala Asn
85 90 95
Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn Thr Asp
100 105 110
Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg Lys Leu
115 120 125
Cys Thr Leu Glu Leu Leu Ser Asn Lys Lys Val Lys Ser Phe Gln Ser
130 135 140
Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Asp Ile Arg Ser Thr
145 150 155 160
Gly Gln Gly Ser Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys Met Ile
165 170 175
Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys Asp Gln
180 185 190
Met Lys Phe Thr Glu Leu Val Lys Glu Ile Leu Arg Leu Thr Gly Gly
195 200 205
Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Leu Leu His His Leu
210 215 220
Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu Asp Asn
225 230 235 240
Leu Ile Asn Asn Ile Ile Ala Glu His Pro Gly Asn Arg Thr Ser Ser
245 250 255
Ser Gln Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Glu Ser Ala
260 265 270
Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Val Ile Leu Asp Met
275 280 285
Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp Ala Ile
290 295 300
Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln Thr Glu
305 310 315 320
Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu Asp Leu
325 330 335
Gln Glu Leu Asn Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu
340 345 350
His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu Pro Cys
355 360 365
Val Leu Gly Gly Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile Val Asn
370 375 380
Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala Glu Thr
385 390 395 400
Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Thr Val Met Gly Ser
405 410 415
Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly
420 425 430
Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His Ile Leu
435 440 445
Tyr Phe Asn Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu Asp Leu Asp
450 455 460
Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr Glu Leu Leu
465 470 475 480
Leu Val Pro Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr
485 490
<210> 51
<211> 1482
<212> DNA
<213> 菊苣
<400> 51
atggctctgt tattagcagt ttttctggct gtcattatct ttatcatctt caaactgctg 60
acccgcacca cctcgaagaa aaacctgctg ccggaaccgt ggcgtctgcc gattatcggc 120
cacatgcatc acctgattgg caccatgccg caccgtggtg tgatggaact ggcgcgcaaa 180
catggctcac tgatgcacct gcagctgggt gaagtgagca ccatcgtggt tagctctccg 240
cgttgggcga aagaagttct gaccacgtat gatattacct ttgccaaccg cccggaaacc 300
ctgacgggcg aaatcgtggc ataccataat acggacattg ttctggctcc gtatggtgaa 360
tactggcgtc agctgcgcaa actgtgcacc ctggaactgc tgagtaacaa aaaagtcaaa 420
tcttttcaaa gtctgcgtga agaagaatgt tggaatctgg tgaaagatat ccgctccacc 480
ggccagggtt caccgatcaa cctgtcggaa aacatcttca aaatgatcgc gacgatcctg 540
tctcgtgcgg cctttggcaa aggtattaaa gaccaaatga aattcaccga actggttaaa 600
gaaatcctgc gcctgacggg cggttttgat gtcgcagaca ttttcccgag taaaaaactg 660
ctgcatcacc tgtccggcaa acgtgctaaa ctgaccaaca tccataacaa actggataac 720
ctgatcaaca acattatcgc cgaacacccg ggtaatcgta ccagttcctc acaggaaacg 780
ctgctggatg ttctgctgcg cctgaaagaa agcgcagaat ttccgctgac cgcggacaat 840
gttaaagccg tcattctgga tatgttcggt gcaggcaccg acacgtcgag cgcaaccatt 900
gaatgggcta tctctgaact gattcgttgc ccgcgcgcga tggaaaaagt gcagacggaa 960
ctgcgtcaag ccctgaacgg caaagaacgc atccaggaag aagatctgca agaactgaac 1020
tacctgaaac tggttatcaa agaaaccctg cgcctgcatc cgccgctgcc gctggtcatg 1080
ccgcgtgaat gccgcgaacc gtgtgtgctg ggcggttatg atatcccgag caaaaccaaa 1140
ctgatcgtca acgtgtttgc aattaatcgt gacccggaat actggaaaga cgctgaaacc 1200
tttatgccgg aacgcttcga aaacagcccg attacggtta tgggttctga atatgaatac 1260
ctgccgtttg gtgcaggtcg tcgcatgtgt ccgggtgcag ctctgggtct ggcgaatgtc 1320
gaactgccgc tggcccacat cctgtattac ttcaactgga aactgccgaa tggcaaaacc 1380
tttgaagatc tggacatgac cgaatccttc ggtgcaacgg tgcaacgcaa aaccgaactg 1440
ctgctggtgc cgacggattt ccaaacgctg accgcatcaa cg 1482
<210> 52
<211> 488
<212> PRT
<213> 向日葵
<400> 52
Met Glu Val Ser Leu Thr Thr Ser Ile Ala Leu Ala Thr Ile Val Phe
1 5 10 15
Phe Leu Tyr Lys Leu Leu Thr Arg Pro Thr Ser Ser Lys Asn Arg Leu
20 25 30
Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Asp Leu Ala Arg Lys Tyr Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Ala Ile Val Val Ser
65 70 75 80
Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr Asp Ile Pro Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Ile Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Gln Glu Ile Lys
145 150 155 160
Ala Ser Gly Ser Gly Thr Pro Phe Asn Leu Ser Glu Gly Ile Phe Lys
165 170 175
Val Ile Ala Thr Val Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Lys Gln Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Glu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His
210 215 220
His Leu Ser Gly Lys Arg Gly Arg Leu Thr Ser Ile His Asn Lys Leu
225 230 235 240
Asp Ser Leu Ile Asn Asn Leu Val Ala Glu His Thr Val Ser Lys Ser
245 250 255
Ser Lys Val Asn Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asn
260 265 270
Ser Glu Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Ile Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Val Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Ala Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Lys Glu Glu
325 330 335
Glu Ile Gln Asp Leu Pro Tyr Leu Asn Leu Val Ile Arg Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Gln
355 360 365
Ala Met Asn Leu Ala Gly Tyr Asp Val Ala Asn Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Ser Phe Asn Pro Glu Arg Phe Glu Asn Ser Asn Thr Thr Ile Met
405 410 415
Gly Ala Asp Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ser Ala Leu Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn
435 440 445
Ile Leu Tyr Tyr Phe Lys Trp Lys Leu Pro Asn Gly Ala Ser His Asp
450 455 460
Gln Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr
465 470 475 480
Glu Leu Met Leu Val Pro Ser Phe
485
<210> 53
<211> 488
<212> PRT
<213> 向日葵
<400> 53
Met Ala Leu Leu Leu Ala Val Phe Ile Ala Leu Ala Thr Ile Val Phe
1 5 10 15
Phe Leu Tyr Lys Leu Leu Thr Arg Pro Thr Ser Ser Lys Asn Arg Leu
20 25 30
Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Asp Leu Ala Arg Lys Tyr Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Ala Ile Val Val Ser
65 70 75 80
Ser Pro Lys Trp Ala Lys Glu Ile Leu Thr Thr Tyr Asp Ile Pro Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Ile Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Gln Glu Ile Lys
145 150 155 160
Ala Ser Gly Ser Gly Thr Pro Phe Asn Leu Ser Glu Gly Ile Phe Lys
165 170 175
Val Ile Ala Thr Val Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Lys Gln Phe Thr Glu Ile Val Lys Glu Ile Leu Arg Glu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His
210 215 220
His Leu Ser Gly Lys Arg Gly Arg Leu Thr Ser Ile His Asn Lys Leu
225 230 235 240
Asp Ser Leu Ile Asn Asn Leu Val Ala Glu His Thr Val Ser Lys Ser
245 250 255
Ser Lys Val Asn Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asn
260 265 270
Ser Glu Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Ile Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Val Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Ala Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Lys Glu Glu
325 330 335
Glu Ile Gln Asp Leu Pro Tyr Leu Asn Leu Val Ile Arg Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Gln
355 360 365
Ala Met Asn Leu Ala Gly Tyr Asp Val Ala Asn Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Ser Phe Asn Pro Glu Arg Phe Glu Asn Ser Asn Thr Thr Ile Met
405 410 415
Gly Ala Asp Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ser Ala Leu Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn
435 440 445
Ile Leu Tyr Tyr Phe Lys Trp Lys Leu Pro Asn Gly Ala Ser His Asp
450 455 460
Gln Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr
465 470 475 480
Glu Leu Met Leu Val Pro Ser Phe
485
<210> 54
<211> 1464
<212> DNA
<213> 向日葵
<400> 54
atggctctgt tattagcagt tttcatcgcc ctggcaacca ttgtcttttt cctgtataaa 60
ctgctgaccc gtccgacctc atctaaaaac cgtctgccgg aaccgtggcg cctgccgatt 120
atcggccaca tgcatcacct gattggcacc atgccgcacc gtggtgtcat ggatctggca 180
cgcaaatatg gcagcctgat gcatctgcaa ctgggtgaag tttctgcgat tgtggttagc 240
tctccgaaat gggccaaaga aattctgacc acctatgata ttccgtttgc gaaccgcccg 300
gaaaccctga cgggcgaaat tatcgcatac cacaataccg acattgtgct ggctccgtat 360
ggtgaatact ggcgtcaact gcgtaaactg tgcacgctgg aactgctgag tgttaaaaaa 420
gtcaaaagtt tccagagcct gcgtgaagaa gaatgttgga acctggttca agaaattaaa 480
gcgagcggca gcggcacccc gtttaatctg agtgaaggta ttttcaaagt gattgcgacc 540
gtgctgagcc gtgcggcatt tggtaaaggt atcaaagatc agaaacaatt caccgaaatt 600
gtcaaagaaa tcctgcgcga aacgggcggt tttgatgtgg cggacatctt tccgagcaaa 660
aaattcctgc atcacctgtc tggcaaacgt ggtcgcctga cctcaattca taacaaactg 720
gattcgctga tcaacaatct ggtcgccgaa cataccgtga gcaaaagcag caaagtgaat 780
gaaacgctgc tggatgtcct gctgcgtctg aaaaactcgg aagaatttcc gctgaccgca 840
gacaatgtga aagctattat cctggatatg ttcggtgcag gcaccgatac cagcagcgca 900
acggtggaat gggccattag cgaactgatc cgttgcccgc gcgcaatgga aaaagttcag 960
gcagaactgc gtcaagctct gaacggcaaa gaacgcatta aagaagaaga aatccaggat 1020
ctgccgtatc tgaatctggt tattcgtgaa accctgcgtc tgcatccgcc gctgccgctg 1080
gtcatgccgc gtgaatgtcg ccaagcaatg aacctggctg gctatgacgt ggcaaataaa 1140
accaaactga tcgtcaatgt gtttgcgatt aaccgtgacc cggaatactg gaaagacgcg 1200
gaaagtttta acccggaacg ctttgaaaac agcaacacca cgattatggg tgcggattat 1260
gaatacctgc cgtttggcgc cggtcgtcgc atgtgtccgg gcagcgcgct gggtctggcc 1320
aacgttcaac tgccgctggc caatatcctg tattacttca aatggaaact gccgaatggc 1380
gcctcacacg atcaactgga catgaccgaa tcgtttggtg caaccgtgca acgcaaaacg 1440
gaactgatgc tggttccgtc tttc 1464
<210> 55
<211> 504
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 55
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Ala Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg Arg
20 25 30
Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val Pro
35 40 45
Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met Thr
50 55 60
Phe Thr Arg Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr
65 70 75 80
Gly Ala Thr Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys Glu
85 90 95
Ala Leu Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser Lys
100 105 110
Ala Leu Lys Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser Asp
115 120 125
Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala Val
130 135 140
Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile Met
145 150 155 160
Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn Pro
165 170 175
Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu Phe
180 185 190
Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu Tyr
195 200 205
Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Phe Gln Val
210 215 220
Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg Asp
225 230 235 240
Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn Thr
245 250 255
Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu Ile
260 265 270
Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser Tyr
275 280 285
Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln Leu
290 295 300
Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr Met
305 310 315 320
Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys Leu
325 330 335
Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu Lys
340 345 350
Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile Phe
355 360 365
His Glu Thr Leu Arg Arg His Ser Pro Val Pro Ile Ile Pro Leu Arg
370 375 380
His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala Gly
385 390 395 400
Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn Val
405 410 415
Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu Asn
420 425 430
Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg
435 440 445
Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Thr Ala Ser Ile Gly Ile
450 455 460
Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr Gln
465 470 475 480
Glu Glu Val Asn Thr Ile Gly Leu Thr Thr Gln Met Leu Arg Pro Leu
485 490 495
Arg Ala Ile Ile Lys Pro Arg Ile
500
<210> 56
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 56
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Arg Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Met Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 57
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 57
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Met Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 58
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 58
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Arg Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Met Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 59
<211> 504
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 59
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Ala Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg Arg
20 25 30
Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val Pro
35 40 45
Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met Thr
50 55 60
Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr
65 70 75 80
Gly Ala Thr Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys Glu
85 90 95
Ala Met Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser Lys
100 105 110
Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser Asp
115 120 125
Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala Val
130 135 140
Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile Met
145 150 155 160
Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn Pro
165 170 175
Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu Phe
180 185 190
Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu Tyr
195 200 205
Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln Val
210 215 220
Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg Asp
225 230 235 240
Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn Thr
245 250 255
Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu Ile
260 265 270
Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser Tyr
275 280 285
Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln Leu
290 295 300
Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr Met
305 310 315 320
Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys Leu
325 330 335
Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu Lys
340 345 350
Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile Phe
355 360 365
His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu Arg
370 375 380
His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala Gly
385 390 395 400
Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn Val
405 410 415
Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu Asn
420 425 430
Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg
435 440 445
Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly Ile
450 455 460
Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr Gln
465 470 475 480
Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro Leu
485 490 495
Arg Ala Ile Ile Lys Pro Arg Ile
500
<210> 60
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 60
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Met Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu His Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Ile Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 61
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶
<400> 61
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Leu Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu Gln Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 62
<211> 710
<212> PRT
<213> 甜菊
<400> 62
Met Gln Ser Asp Ser Val Lys Val Ser Pro Phe Asp Leu Val Ser Ala
1 5 10 15
Ala Met Asn Gly Lys Ala Met Glu Lys Leu Asn Ala Ser Glu Ser Glu
20 25 30
Asp Pro Thr Thr Leu Pro Ala Leu Lys Met Leu Val Glu Asn Arg Glu
35 40 45
Leu Leu Thr Leu Phe Thr Thr Ser Phe Ala Val Leu Ile Gly Cys Leu
50 55 60
Val Phe Leu Met Trp Arg Arg Ser Ser Ser Lys Lys Leu Val Gln Asp
65 70 75 80
Pro Val Pro Gln Val Ile Val Val Lys Lys Lys Glu Lys Glu Ser Glu
85 90 95
Val Asp Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln Thr
100 105 110
Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Val
115 120 125
Arg Tyr Glu Lys Thr Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala
130 135 140
Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala
145 150 155 160
Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala
165 170 175
Ala Asn Phe Tyr Lys Trp Phe Thr Glu Gly Asp Asp Lys Gly Glu Trp
180 185 190
Leu Lys Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr
195 200 205
Glu His Phe Asn Lys Ile Ala Ile Val Val Asp Asp Lys Leu Thr Glu
210 215 220
Met Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln
225 230 235 240
Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu
245 250 255
Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Thr Thr Pro
260 265 270
Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Val Tyr His Asp Lys Pro
275 280 285
Ala Asp Ser Tyr Ala Glu Asp Gln Thr His Thr Asn Gly His Val Val
290 295 300
His Asp Ala Gln His Pro Ser Arg Ser Asn Val Ala Phe Lys Lys Glu
305 310 315 320
Leu His Thr Ser Gln Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp
325 330 335
Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val
340 345 350
Tyr Ser Glu Asn Leu Ser Glu Val Val Asp Glu Ala Leu Lys Leu Leu
355 360 365
Gly Leu Ser Pro Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu Asp
370 375 380
Gly Thr Pro Ile Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys
385 390 395 400
Thr Leu Arg Asp Ala Leu Thr Arg Tyr Ala Asp Val Leu Ser Ser Pro
405 410 415
Lys Lys Val Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Ser
420 425 430
Glu Ala Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu
435 440 445
Tyr Ala Gln Trp Ile Val Ala Asn Gln Arg Ser Leu Leu Glu Val Met
450 455 460
Gln Ser Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala
465 470 475 480
Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro
485 490 495
Lys Met Ser Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu
500 505 510
Thr Thr Pro Ala Gly Arg Ile His Arg Gly Leu Cys Ser Thr Trp Met
515 520 525
Lys Asn Ala Val Pro Leu Thr Glu Ser Pro Asp Cys Ser Gln Ala Ser
530 535 540
Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Val Asp Pro Lys Val
545 550 555 560
Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly
565 570 575
Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu Gly
580 585 590
Ser Ser Ile Phe Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile
595 600 605
Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu
610 615 620
Leu Ile Val Ala Phe Ser Arg Glu Gly Thr Ala Lys Glu Tyr Val Gln
625 630 635 640
His Lys Met Ser Gln Lys Ala Ser Asp Ile Trp Lys Leu Leu Ser Glu
645 650 655
Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp
660 665 670
Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp
675 680 685
Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg
690 695 700
Tyr Leu Arg Asp Val Trp
705 710
<210> 63
<211> 2133
<212> DNA
<213> 甜菊
<400> 63
atgcagagcg attctgttaa agtatccccg ttcgacctgg tctctgcggc tatgaacggc 60
aaagcaatgg agaaactgaa cgcgagcgaa tctgaagatc caaccaccct gccggcactg 120
aaaatgctgg tagaaaaccg tgaactgctg actctgttca ccacctcctt cgccgttctg 180
attggttgcc tggtcttcct gatgtggcgc cgttcctctt ccaagaagct ggtacaggac 240
ccggttcctc aggtgatcgt cgttaaaaag aaagagaagg aaagcgaagt cgatgacggc 300
aaaaagaagg tttccatttt ctacggtact cagaccggca ccgctgaggg ttttgccaaa 360
gcactggttg aagaggcaaa agtgcgttac gaaaaaactt ccttcaaagt gattgacctg 420
gacgactatg ctgcggatga tgatgaatac gaggaaaaac tgaaaaaaga aagcctggcc 480
ttcttcttcc tggcaaccta tggcgatggt gaaccgaccg acaacgcggc gaacttctac 540
aaatggttta ccgaaggcga cgacaaaggt gaatggctga agaaactgca gtatggtgtt 600
ttcggtctgg gcaatcgcca gtacgaacat tttaacaaaa tcgcaatcgt tgttgatgac 660
aaactgactg aaatgggtgc gaaacgtctg gtgccggttg gcctgggtga cgatgatcaa 720
tgcatcgaag atgacttcac cgcatggaaa gaactggttt ggccggaact ggatcagctg 780
ctgcgcgacg aagacgacac ttccgtgacc accccgtata ccgctgcagt gctggagtac 840
cgtgttgttt accacgataa accggcggac tcttacgccg aagatcagac tcacactaac 900
ggtcacgtcg tacatgacgc acagcacccg tctcgtagca atgttgcgtt taagaaagag 960
ctgcacacga gccagtccga ccgctcttgt acgcacctgg agttcgatat ctcccacacc 1020
ggtctgtcct atgaaaccgg tgaccatgtt ggcgtttaca gcgaaaacct gagcgaggta 1080
gttgatgaag cgctgaaact gctgggcctg tctccagaca cctactttag cgtgcatgct 1140
gacaaggaag atggtactcc gattggcggc gcttccctgc cgccaccgtt tccaccttgc 1200
actctgcgtg atgctctgac tcgttacgct gatgttctgt ctagcccgaa aaaggttgcg 1260
ctgctggcgc tggccgcaca tgcttctgac ccgtctgaag ctgaccgtct gaaattcctg 1320
gcgtctccgg ccggcaaaga cgaatacgcg cagtggattg tcgctaacca gcgctctctg 1380
ctggaagtga tgcagtcctt cccgtctgcc aaaccgccac tgggcgtgtt tttcgcagct 1440
gtggctccgc gcctgcagcc gcgctactat tctatctcta gctccccgaa aatgagcccg 1500
aaccgcatcc acgttacttg tgctctggtt tacgaaacca cccctgcggg ccgtatccac 1560
cgtggtctgt gctctacgtg gatgaaaaat gccgtgccgc tgaccgaatc cccggactgc 1620
tctcaggcgt ccatcttcgt gcgtacctct aacttccgtc tgccggtgga cccgaaagtt 1680
cctgttatca tgatcggtcc tggcacgggt ctggccccgt ttcgtggttt tctgcaggag 1740
cgtctggctc tgaaagaatc cggtactgag ctgggctctt ccatcttttt cttcggttgt 1800
cgtaaccgca aagtcgattt catctatgaa gacgaactga acaacttcgt agagactggt 1860
gcactgtccg aactgattgt ggcattctct cgtgaaggca cggcgaaaga atacgttcaa 1920
cacaaaatgt ctcagaaagc gagcgatatc tggaaactgc tgtccgaggg tgcgtatctg 1980
tatgtttgtg gcgacgcgaa aggcatggct aaagatgtac accgcaccct gcacaccatt 2040
gtacaagaac aaggctctct ggatagctcc aaggcagaac tgtacgtgaa aaacctgcag 2100
atgtctggcc gttacctgcg tgatgtatgg taa 2133
<210> 64
<211> 692
<212> PRT
<213> 拟南芥
<400> 64
Met Thr Ser Ala Leu Tyr Ala Ser Asp Leu Phe Lys Gln Leu Lys Ser
1 5 10 15
Ile Met Gly Thr Asp Ser Leu Ser Asp Asp Val Val Leu Val Ile Ala
20 25 30
Thr Thr Ser Leu Ala Leu Val Ala Gly Phe Val Val Leu Leu Trp Lys
35 40 45
Lys Thr Thr Ala Asp Arg Ser Gly Glu Leu Lys Pro Leu Met Ile Pro
50 55 60
Lys Ser Leu Met Ala Lys Asp Glu Asp Asp Asp Leu Asp Leu Gly Ser
65 70 75 80
Gly Lys Thr Arg Val Ser Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala
85 90 95
Glu Gly Phe Ala Lys Ala Leu Ser Glu Glu Ile Lys Ala Arg Tyr Glu
100 105 110
Lys Ala Ala Val Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp Asp
115 120 125
Asp Gln Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe Cys
130 135 140
Val Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe
145 150 155 160
Ser Lys Trp Phe Thr Glu Glu Asn Glu Arg Asp Ile Lys Leu Gln Gln
165 170 175
Leu Ala Tyr Gly Val Phe Ala Leu Gly Asn Arg Gln Tyr Glu His Phe
180 185 190
Asn Lys Ile Gly Ile Val Leu Asp Glu Glu Leu Cys Lys Lys Gly Ala
195 200 205
Lys Arg Leu Ile Glu Val Gly Leu Gly Asp Asp Asp Gln Ser Ile Glu
210 215 220
Asp Asp Phe Asn Ala Trp Lys Glu Ser Leu Trp Ser Glu Leu Asp Lys
225 230 235 240
Leu Leu Lys Asp Glu Asp Asp Lys Ser Val Ala Thr Pro Tyr Thr Ala
245 250 255
Val Ile Pro Glu Tyr Arg Val Val Thr His Asp Pro Arg Phe Thr Thr
260 265 270
Gln Lys Ser Met Glu Ser Asn Val Ala Asn Gly Asn Thr Thr Ile Asp
275 280 285
Ile His His Pro Cys Arg Val Asp Val Ala Val Gln Lys Glu Leu His
290 295 300
Thr His Glu Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ser
305 310 315 320
Arg Thr Gly Ile Thr Tyr Glu Thr Gly Asp His Val Gly Val Tyr Ala
325 330 335
Glu Asn His Val Glu Ile Val Glu Glu Ala Gly Lys Leu Leu Gly His
340 345 350
Ser Leu Asp Leu Val Phe Ser Ile His Ala Asp Lys Glu Asp Gly Ser
355 360 365
Pro Leu Glu Ser Ala Val Pro Pro Pro Phe Pro Gly Pro Cys Thr Leu
370 375 380
Gly Thr Gly Leu Ala Arg Tyr Ala Asp Leu Leu Asn Pro Pro Arg Lys
385 390 395 400
Ser Ala Leu Val Ala Leu Ala Ala Tyr Ala Thr Glu Pro Ser Glu Ala
405 410 415
Glu Lys Leu Lys His Leu Thr Ser Pro Asp Gly Lys Asp Glu Tyr Ser
420 425 430
Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala Ala
435 440 445
Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala Ile Ala
450 455 460
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Cys Gln Asp Trp
465 470 475 480
Ala Pro Ser Arg Val His Val Thr Ser Ala Leu Val Tyr Gly Pro Thr
485 490 495
Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
500 505 510
Ala Val Pro Ala Glu Lys Ser His Glu Cys Ser Gly Ala Pro Ile Phe
515 520 525
Ile Arg Ala Ser Asn Phe Lys Leu Pro Ser Asn Pro Ser Thr Pro Ile
530 535 540
Val Met Val Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu
545 550 555 560
Gln Glu Arg Met Ala Leu Lys Glu Asp Gly Glu Glu Leu Gly Ser Ser
565 570 575
Leu Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Phe Ile Tyr Glu
580 585 590
Asp Glu Leu Asn Asn Phe Val Asp Gln Gly Val Ile Ser Glu Leu Ile
595 600 605
Met Ala Phe Ser Arg Glu Gly Ala Gln Lys Glu Tyr Val Gln His Lys
610 615 620
Met Met Glu Lys Ala Ala Gln Val Trp Asp Leu Ile Lys Glu Glu Gly
625 630 635 640
Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His
645 650 655
Arg Thr Leu His Thr Ile Val Gln Glu Gln Glu Gly Val Ser Ser Ser
660 665 670
Glu Ala Glu Ala Ile Val Lys Lys Leu Gln Thr Glu Gly Arg Tyr Leu
675 680 685
Arg Asp Val Trp
690
<210> 65
<211> 2079
<212> DNA
<213> 拟南芥
<400> 65
atgaccagcg cactgtacgc aagcgacctg tttaagcaac tgaagagcat tatgggcacc 60
gatagcctga gcgatgatgt tgtcctggtc attgcgacca cgagcctggc actggtggct 120
ggttttgtgg ttctgctgtg gaaaaagacc acggccgatc gttctggcga actgaaaccg 180
ctgatgattc cgaaaagtct gatggcaaag gacgaagatg acgatctgga tctgggctcc 240
ggtaaaaccc gtgtgtcaat ctttttcggt acccagacgg gcaccgcaga aggtttcgca 300
aaagctctgt ctgaagaaat taaggcgcgc tatgaaaaag cggccgttaa ggtcatcgat 360
ctggacgatt atgcagctga cgatgaccag tacgaagaaa aactgaaaaa ggaaaccctg 420
gcgtttttct gcgttgccac ctacggcgac ggtgaaccga cggataacgc ggcccgtttt 480
agtaaatggt tcaccgaaga aaatgaacgc gacattaagc tgcagcaact ggcgtatggc 540
gtgtttgctc tgggtaaccg tcagtacgaa catttcaaca agatcggtat cgtcctggat 600
gaagaactgt gtaaaaaggg cgcgaagcgc ctgattgaag tgggcctggg tgatgacgat 660
caatccatcg aagacgattt taacgcctgg aaagaatctc tgtggagtga actggacaaa 720
ctgctgaagg atgaagacga taagagcgtg gcgacgccgt ataccgccgt tattccggaa 780
taccgtgtcg tgacccatga tccgcgcttc accacgcaga aaagcatgga atcaaatgtt 840
gcgaacggta ataccacgat tgacatccat cacccgtgcc gtgtggatgt tgccgtccaa 900
aaagaactgc atacccacga atcggaccgt agctgtatcc acctggaatt tgatattagc 960
cgcacgggca tcacctatga aacgggcgac catgtgggtg tttacgcaga aaaccacgtg 1020
gaaattgttg aagaagctgg caaactgctg ggtcattcgc tggatctggt ttttagcatc 1080
cacgcggaca aggaagatgg ttcgccgctg gaaagcgcag tgccgccgcc gttcccgggt 1140
ccgtgcaccc tgggtacggg tctggcacgt tatgcagatc tgctgaatcc gccgcgcaaa 1200
tccgcactgg tggctctggc agcttacgca accgaaccgt cagaagctga aaaactgaag 1260
catctgacgt cgccggacgg taaagatgaa tatagccagt ggattgttgc gtctcaacgc 1320
agtctgctgg aagtcatggc agcatttccg tcggcaaaac cgccgctggg cgtgtttttc 1380
gcagctattg caccgcgtct gcagccgcgc tattacagca tcagctcttg tcaagattgg 1440
gcgccgtctc gtgtccatgt gaccagtgca ctggtgtatg gtccgacgcc gaccggtcgc 1500
attcacaaag gcgtgtgctc tacctggatg aaaaacgcgg ttccggccga aaagtctcac 1560
gaatgtagtg gtgcgccgat ttttatccgt gccagtaact tcaaactgcc gtccaatccg 1620
tcaaccccga tcgttatggt cggtccgggt acgggtctgg caccgtttcg tggtttcctg 1680
caggaacgca tggctctgaa agaagatggc gaagaactgg gtagttccct gctgtttttc 1740
ggctgccgta atcgccagat ggacttcatc tacgaagatg aactgaacaa cttcgtcgat 1800
caaggtgtga tttccgaact gatcatggca ttttcacgcg aaggcgctca gaaagaatac 1860
gtccaacata aaatgatgga aaaggcggcc caagtgtggg atctgatcaa agaagaaggc 1920
tatctgtacg tttgtggcga cgcaaagggt atggctcgtg atgtccatcg caccctgcac 1980
acgattgttc aggaacaaga aggtgtctca tcgagcgaag cggaagccat cgtgaaaaag 2040
ctgcagaccg aaggccgtta tctgcgcgat gtttggtaa 2079
<210> 66
<211> 717
<212> PRT
<213> 红豆杉(Taxus cuspidata)
<400> 66
Met Gln Ala Asn Ser Asn Thr Val Glu Gly Ala Ser Gln Gly Lys Ser
1 5 10 15
Leu Leu Asp Ile Ser Arg Leu Asp His Ile Phe Ala Leu Leu Leu Asn
20 25 30
Gly Lys Gly Gly Asp Leu Gly Ala Met Thr Gly Ser Ala Leu Ile Leu
35 40 45
Thr Glu Asn Ser Gln Asn Leu Met Ile Leu Thr Thr Ala Leu Ala Val
50 55 60
Leu Val Ala Cys Val Phe Phe Phe Val Trp Arg Arg Gly Gly Ser Asp
65 70 75 80
Thr Gln Lys Pro Ala Val Arg Pro Thr Pro Leu Val Lys Glu Glu Asp
85 90 95
Glu Glu Glu Glu Asp Asp Ser Ala Lys Lys Lys Val Thr Ile Phe Phe
100 105 110
Gly Thr Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Ala Glu
115 120 125
Glu Ala Lys Ala Arg Tyr Glu Lys Ala Val Phe Lys Val Val Asp Leu
130 135 140
Asp Asn Tyr Ala Ala Asp Asp Glu Gln Tyr Glu Glu Lys Leu Lys Lys
145 150 155 160
Glu Lys Leu Ala Phe Phe Met Leu Ala Thr Tyr Gly Asp Gly Glu Pro
165 170 175
Thr Asp Asn Ala Ala Arg Phe Tyr Lys Trp Phe Leu Glu Gly Lys Glu
180 185 190
Arg Glu Pro Trp Leu Ser Asp Leu Thr Tyr Gly Val Phe Gly Leu Gly
195 200 205
Asn Arg Gln Tyr Glu His Phe Asn Lys Val Ala Lys Ala Val Asp Glu
210 215 220
Val Leu Ile Glu Gln Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly
225 230 235 240
Asp Asp Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Gln
245 250 255
Val Trp Pro Glu Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Glu Pro
260 265 270
Thr Ser Ala Thr Pro Tyr Thr Ala Ala Ile Pro Glu Tyr Arg Val Glu
275 280 285
Ile Tyr Asp Ser Val Val Ser Val Tyr Glu Glu Thr His Ala Leu Lys
290 295 300
Gln Asn Gly Gln Ala Val Tyr Asp Ile His His Pro Cys Arg Ser Asn
305 310 315 320
Val Ala Val Arg Arg Glu Leu His Thr Pro Leu Ser Asp Arg Ser Cys
325 330 335
Ile His Leu Glu Phe Asp Ile Ser Asp Thr Gly Leu Ile Tyr Glu Thr
340 345 350
Gly Asp His Val Gly Val His Thr Glu Asn Ser Ile Glu Thr Val Glu
355 360 365
Glu Ala Ala Lys Leu Leu Gly Tyr Gln Leu Asp Thr Ile Phe Ser Val
370 375 380
His Gly Asp Lys Glu Asp Gly Thr Pro Leu Gly Gly Ser Ser Leu Pro
385 390 395 400
Pro Pro Phe Pro Gly Pro Cys Thr Leu Arg Thr Ala Leu Ala Arg Tyr
405 410 415
Ala Asp Leu Leu Asn Pro Pro Arg Lys Ala Ala Phe Leu Ala Leu Ala
420 425 430
Ala His Ala Ser Asp Pro Ala Glu Ala Glu Arg Leu Lys Phe Leu Ser
435 440 445
Ser Pro Ala Gly Lys Asp Glu Tyr Ser Gln Trp Val Thr Ala Ser Gln
450 455 460
Arg Ser Leu Leu Glu Ile Met Ala Glu Phe Pro Ser Ala Lys Pro Pro
465 470 475 480
Leu Gly Val Phe Phe Ala Ala Ile Ala Pro Arg Leu Gln Pro Arg Tyr
485 490 495
Tyr Ser Ile Ser Ser Ser Pro Arg Phe Ala Pro Ser Arg Ile His Val
500 505 510
Thr Cys Ala Leu Val Tyr Gly Pro Ser Pro Thr Gly Arg Ile His Lys
515 520 525
Gly Val Cys Ser Asn Trp Met Lys Asn Ser Leu Pro Ser Glu Glu Thr
530 535 540
His Asp Cys Ser Trp Ala Pro Val Phe Val Arg Gln Ser Asn Phe Lys
545 550 555 560
Leu Pro Ala Asp Ser Thr Thr Pro Ile Val Met Val Gly Pro Gly Thr
565 570 575
Gly Phe Ala Pro Phe Arg Gly Phe Leu Gln Glu Arg Ala Lys Leu Gln
580 585 590
Glu Ala Gly Glu Lys Leu Gly Pro Ala Val Leu Phe Phe Gly Cys Arg
595 600 605
Asn Arg Gln Met Asp Tyr Ile Tyr Glu Asp Glu Leu Lys Gly Tyr Val
610 615 620
Glu Lys Gly Ile Leu Thr Asn Leu Ile Val Ala Phe Ser Arg Glu Gly
625 630 635 640
Ala Thr Lys Glu Tyr Val Gln His Lys Met Leu Glu Lys Ala Ser Asp
645 650 655
Thr Trp Ser Leu Ile Ala Gln Gly Gly Tyr Leu Tyr Val Cys Gly Asp
660 665 670
Ala Lys Gly Met Ala Arg Asp Val His Arg Thr Leu His Thr Ile Val
675 680 685
Gln Glu Gln Glu Ser Val Asp Ser Ser Lys Ala Glu Phe Leu Val Lys
690 695 700
Lys Leu Gln Met Asp Gly Arg Tyr Leu Arg Asp Ile Trp
705 710 715
<210> 67
<211> 2154
<212> DNA
<213> 红豆杉(Taxus cuspidata)
<400> 67
atgcaggcta attccaacac ggtggaaggt gcctcccagg ggaagagcct gctggacata 60
tctcggctgg accatatttt tgcgctgctg ttgaacggca agggaggaga tctgggagcc 120
atgaccggct cggctttgat tttgacagag aattcgcaga atttgatgat tttgaccacg 180
gctttggctg ttttggtcgc gtgtgttttc ttcttcgttt ggaggagggg aggatcggat 240
acgcagaagc cggcggtgag accgacgcct ctggtgaagg aggaagatga ggaggaagaa 300
gacgattctg caaagaagaa agtcacgatt ttctttggga cacagactgg gacggccgag 360
ggatttgcca aggctctagc agaagaggca aaggcaagat atgagaaagc tgtgtttaaa 420
gtcgtagatt tggacaacta tgcagcagat gatgagcagt atgaagaaaa attgaaaaag 480
gaaaaattag cattttttat gctagcaacg tatggagatg gggagcccac tgacaatgca 540
gcaagatttt ataagtggtt tcttgagggc aaggagaggg agccatggct ttctgatctc 600
acttatgggg tgtttggatt aggcaacaga caatatgaac attttaataa ggtggctaaa 660
gcagtagatg aagtcttaat tgaacaaggt gcaaagcgac ttgttccagt gggccttggt 720
gatgatgacc aatgcattga agatgacttt actgcttggc gagagcaggt ttggcctgaa 780
ctggatcagt tactccggga tgaagatgat gagcccacaa gtgctacacc ttatacagct 840
gccatacctg agtatagggt tgaaatttat gattccgtgg tttcagtgta cgaggaaact 900
catgctctca agcaaaatgg ccaagctgtt tatgatatcc atcacccctg cagatctaat 960
gtggcagtga gaagagagct tcatacacct ttgtctgacc gctcttgcat ccatttggaa 1020
tttgatatat cagacactgg ccttatatat gagacaggag atcatgttgg tgtccataca 1080
gaaaacagca ttgaaactgt ggaggaagca gcaaagctac taggctacca attggacact 1140
atattctcag tccacggtga caaagaagat ggcacgccac ttggagggtc ttctttgcca 1200
ccacctttcc ctggtccatg caccctacga actgctcttg ctcgttatgc tgatttgctg 1260
aatcctcctc ggaaggccgc ctttcttgca ttggcagctc atgcatctga tccagcagag 1320
gcagagcggt tgaagttcct ctcatcacca gctggaaagg atgaatattc tcaatgggtc 1380
actgcaagtc agagaagtct tttagaaata atggcagaat ttccatcagc aaaaccaccc 1440
cttggtgttt tctttgcagc aatagcccct cgtctgcaac cccgatatta ttctatttct 1500
tcctctccca ggtttgcacc ctcaagaata catgtgacat gtgctcttgt ttacgggccc 1560
agtccaaccg gtagaattca caaaggtgtt tgttctaact ggatgaagaa ttcgctaccc 1620
tcagaagaaa cccatgactg tagctgggct ccagtctttg tcaggcaatc aaattttaaa 1680
ttgccagcag attctactac tcctattgtc atggtgggtc ctggaactgg ttttgcacct 1740
tttagaggtt ttttgcagga aagagcaaaa cttcaagaag ctggtgagaa gctcggtccg 1800
gctgttttat tttttgggtg caggaatcgc caaatggact acatttatga agatgagctg 1860
aagggctatg tggagaaagg aatactgacc aatctcattg ttgctttctc tcgtgaagga 1920
gcaaccaaag agtatgtcca gcacaagatg ctggaaaagg catccgatac ctggagtctc 1980
attgctcagg gtgggtatct ttatgtatgt ggtgatgcca agggtatggc tagggatgta 2040
cacaggacac tgcacactat tgtccaagag caggaatctg tggatagcag caaagcagag 2100
tttctagtga agaaattaca gatggatgga agatacttac gagatatatg gtga 2154
<210> 68
<211> 705
<212> PRT
<213> 青蒿
<400> 68
Met Ala Gln Ser Thr Thr Ser Val Lys Leu Ser Pro Phe Asp Leu Met
1 5 10 15
Thr Ala Leu Leu Asn Gly Lys Val Ser Phe Asp Thr Ser Asn Thr Ser
20 25 30
Asp Thr Asn Ile Pro Leu Ala Val Phe Met Glu Asn Arg Glu Leu Leu
35 40 45
Met Ile Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys Val Val Val
50 55 60
Leu Val Trp Arg Arg Ser Ser Ser Ala Ala Lys Lys Ala Ala Glu Ser
65 70 75 80
Pro Val Ile Val Val Pro Lys Lys Val Thr Glu Asp Glu Val Asp Asp
85 90 95
Gly Arg Lys Lys Val Thr Val Phe Phe Gly Thr Gln Thr Gly Thr Ala
100 105 110
Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Ala Arg Tyr Glu
115 120 125
Lys Ala Val Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Glu Asp
130 135 140
Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala Phe Phe Phe
145 150 155 160
Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe
165 170 175
Tyr Lys Trp Phe Thr Glu Gly Glu Glu Lys Gly Glu Trp Leu Asp Lys
180 185 190
Leu Gln Tyr Ala Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe
195 200 205
Asn Lys Ile Ala Lys Val Val Asp Glu Lys Leu Val Glu Gln Gly Ala
210 215 220
Lys Arg Leu Val Pro Val Gly Met Gly Asp Asp Asp Gln Cys Ile Glu
225 230 235 240
Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu Leu Asp Gln
245 250 255
Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Ala Thr Pro Tyr Thr Ala
260 265 270
Ala Val Ala Glu Tyr Arg Val Val Phe His Asp Lys Pro Glu Thr Tyr
275 280 285
Asp Gln Asp Gln Leu Thr Asn Gly His Ala Val His Asp Ala Gln His
290 295 300
Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu His Ser Pro Leu
305 310 315 320
Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser Asn Thr Gly
325 330 335
Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val Tyr Val Glu Asn Leu
340 345 350
Ser Glu Val Val Asp Glu Ala Glu Lys Leu Ile Gly Leu Pro Pro His
355 360 365
Thr Tyr Phe Ser Val His Ala Asp Asn Glu Asp Gly Thr Pro Leu Gly
370 375 380
Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu Arg Lys Ala
385 390 395 400
Leu Ala Ser Tyr Ala Asp Val Leu Ser Ser Pro Lys Lys Ser Ala Leu
405 410 415
Leu Ala Leu Ala Ala His Ala Thr Asp Ser Thr Glu Ala Asp Arg Leu
420 425 430
Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ala Gln Trp Ile
435 440 445
Val Ala Ser His Arg Ser Leu Leu Glu Val Met Glu Ala Phe Pro Ser
450 455 460
Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ser Val Ala Pro Arg Leu
465 470 475 480
Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Phe Ala Pro Asn
485 490 495
Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Gln Thr Pro Ser Gly
500 505 510
Arg Val His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ala Val Pro
515 520 525
Met Thr Glu Ser Gln Asp Cys Ser Trp Ala Pro Ile Tyr Val Arg Thr
530 535 540
Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys Val Pro Val Ile Met Ile
545 550 555 560
Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu Arg
565 570 575
Leu Ala Gln Lys Glu Ala Gly Thr Glu Leu Gly Thr Ala Ile Leu Phe
580 585 590
Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile Tyr Glu Asp Glu Leu
595 600 605
Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu Leu Val Thr Ala Phe
610 615 620
Ser Arg Glu Gly Ala Thr Lys Glu Tyr Val Gln His Lys Met Thr Gln
625 630 635 640
Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser Glu Gly Ala Tyr Leu Tyr
645 650 655
Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp Val His Arg Thr Leu
660 665 670
His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser Ser Lys Ala Glu
675 680 685
Leu Tyr Val Lys Asn Leu Gln Met Ala Gly Arg Tyr Leu Arg Asp Val
690 695 700
Trp
705
<210> 69
<211> 2118
<212> DNA
<213> 青蒿
<400> 69
atggcgcagt ctaccaccag cgtgaaattg tccccttttg atctcatgac cgctctgctg 60
aatgggaaag tctcattcga tacaagcaac acaagtgata ccaacatccc tttggcggtt 120
tttatggaga atagagagtt actcatgatt ttaacgactt ccgtggccgt gctcattggg 180
tgcgtcgtcg tacttgtctg gcgccggtca agtagcgcag caaagaaggc ggcggagtcc 240
ccggttatcg tcgtcccaaa gaaggttaca gaggacgagg tggacgacgg acgcaaaaaa 300
gtgacggtat tctttggtac acagaccgga accgcggaag gatttgctaa agcgctggtg 360
gaagaagcta aagcccgtta cgaaaaagcg gtattcaaag tgatagacct ggatgactat 420
gcggcagagg acgacgagta cgaggaaaaa ttgaaaaaag aatctcttgc gttctttttt 480
ctcgccactt acggcgatgg agaacctact gataatgcgg ctcggtttta taagtggttc 540
actgagggtg aagaaaaagg tgaatggctg gacaaattgc agtacgcagt atttggactc 600
gggaatcgtc aatatgaaca ttttaacaaa attgctaagg tcgtcgatga aaaactggtt 660
gagcagggtg cgaaacgtct ggtcccggtt ggaatgggcg atgacgacca gtgcattgaa 720
gacgacttta cagcatggaa ggaactggtg tggccggaac tggaccaact tttgcgtgac 780
gaggatgaca catctgtagc tacgccgtac actgctgcgg tagccgagta tagggtcgtt 840
tttcacgata aaccggaaac ctacgaccaa gaccagctca caaatggtca tgcagtacat 900
gatgcgcaac atccttgcag gtcaaatgtg gcggtgaaga aagagctgca cagtcctctg 960
tcagatcgtt cttgcaccca cctggaattt gacatatcca atacgggcct ttcgtatgaa 1020
accggagatc acgttggtgt ctatgttgaa aatctgtcgg aagtggttga tgaggcggaa 1080
aaacttatcg gtctgccgcc tcatacgtac ttttcagtcc acgctgataa tgaagacgga 1140
accccgctgg gtggcgcatc gttaccccca ccctttccac catgcactct gcgtaaggcg 1200
cttgccagtt atgctgatgt tttgtctagt cccaaaaaga gtgcacttct cgcactggcg 1260
gcccatgcca ctgatagtac agaggccgac aggctgaaat ttctggcgtc accagcggga 1320
aaagacgaat acgcccaatg gatcgttgcc agtcatcggt ctttactgga agtgatggaa 1380
gcgttccctt ccgctaagcc acctctgggg gtctttttcg ctagtgtggc accccgtcta 1440
cagccccggt attactcaat atctagctca cccagatttg ctccgaatag aatacacgta 1500
acatgcgcgc tggtctatga gcagacccca agtggacggg tgcataaagg ggtttgttct 1560
acctggatga agaacgccgt cccaatgacc gagtctcagg attgttcctg ggcacctata 1620
tatgttagaa catcaaactt tcgactgcca agtgacccga aagttccggt aattatgata 1680
ggtccaggaa cagggctggc tccctttcgc gggttcctcc aggaacgtct ggcgcagaag 1740
gaggcgggaa ctgaactggg gacggcgatt ttatttttcg ggtgtagaaa tcgtaaagtc 1800
gattttatat atgaagatga gttgaacaat ttcgtggaaa ccggggcatt atcggaatta 1860
gttacggctt ttagcaggga gggggcgact aaagagtatg tccagcacaa gatgactcag 1920
aaagcctcag atatatggaa cctgctgtcg gagggagcct atctttatgt ttgcggtgat 1980
gcaaaaggaa tggccaaaga tgtccaccgg accctccaca ctattgtgca ggaacagggt 2040
tcattagact caagtaaagc cgaactttac gtaaaaaatc tacagatggc gggccgttac 2100
ctccgtgacg tttggtaa 2118
<210> 70
<211> 693
<212> PRT
<213> 拟南芥
<400> 70
Met Ala Thr Ser Ala Leu Tyr Ala Ser Asp Leu Phe Lys Gln Leu Lys
1 5 10 15
Ser Ile Met Gly Thr Asp Ser Leu Ser Asp Asp Val Val Leu Val Ile
20 25 30
Ala Thr Thr Ser Leu Ala Leu Val Ala Gly Phe Val Val Leu Leu Trp
35 40 45
Lys Lys Thr Thr Ala Asp Arg Ser Gly Glu Leu Lys Pro Leu Met Ile
50 55 60
Pro Lys Ser Leu Met Ala Lys Asp Glu Asp Asp Asp Leu Asp Leu Gly
65 70 75 80
Ser Gly Lys Thr Arg Val Ser Ile Phe Phe Gly Thr Gln Thr Gly Thr
85 90 95
Ala Glu Gly Phe Ala Lys Ala Leu Ser Glu Glu Ile Lys Ala Arg Tyr
100 105 110
Glu Lys Ala Ala Val Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp
115 120 125
Asp Asp Gln Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe
130 135 140
Cys Val Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg
145 150 155 160
Phe Tyr Lys Trp Phe Thr Glu Glu Asn Glu Arg Asp Ile Lys Leu Gln
165 170 175
Gln Leu Ala Tyr Gly Val Phe Ala Leu Gly Asn Arg Gln Tyr Glu His
180 185 190
Phe Asn Lys Ile Gly Ile Val Leu Asp Glu Glu Leu Cys Lys Lys Gly
195 200 205
Ala Lys Arg Leu Ile Glu Val Gly Leu Gly Asp Asp Asp Gln Ser Ile
210 215 220
Glu Asp Asp Phe Asn Ala Trp Lys Glu Ser Leu Trp Ser Glu Leu Asp
225 230 235 240
Lys Leu Leu Lys Asp Glu Asp Asp Lys Ser Val Ala Thr Pro Tyr Thr
245 250 255
Ala Val Ile Pro Glu Tyr Arg Val Val Thr His Asp Pro Arg Phe Thr
260 265 270
Thr Gln Lys Ser Met Glu Ser Asn Val Ala Asn Gly Asn Thr Thr Ile
275 280 285
Asp Ile His His Pro Cys Arg Val Asp Val Ala Val Gln Lys Glu Leu
290 295 300
His Thr His Glu Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile
305 310 315 320
Ser Arg Thr Gly Ile Thr Tyr Glu Thr Gly Asp His Val Gly Val Tyr
325 330 335
Ala Glu Asn His Val Glu Ile Val Glu Glu Ala Gly Lys Leu Leu Gly
340 345 350
His Ser Leu Asp Leu Val Phe Ser Ile His Ala Asp Lys Glu Asp Gly
355 360 365
Ser Pro Leu Glu Ser Ala Val Pro Pro Pro Phe Pro Gly Pro Cys Thr
370 375 380
Leu Gly Thr Gly Leu Ala Arg Tyr Ala Asp Leu Leu Asn Pro Pro Arg
385 390 395 400
Lys Ser Ala Leu Val Ala Leu Ala Ala Tyr Ala Thr Glu Pro Ser Glu
405 410 415
Ala Glu Lys Leu Lys His Leu Thr Ser Pro Asp Gly Lys Asp Glu Tyr
420 425 430
Ser Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala
435 440 445
Ala Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala Ile
450 455 460
Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg
465 470 475 480
Leu Ala Pro Ser Arg Val His Val Thr Ser Ala Leu Val Tyr Gly Pro
485 490 495
Thr Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys
500 505 510
Asn Ala Val Pro Ala Glu Lys Ser His Glu Cys Ser Gly Ala Pro Ile
515 520 525
Phe Ile Arg Ala Ser Asn Phe Lys Leu Pro Ser Asn Pro Ser Thr Pro
530 535 540
Ile Val Met Val Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe
545 550 555 560
Leu Gln Glu Arg Met Ala Leu Lys Glu Asp Gly Glu Glu Leu Gly Ser
565 570 575
Ser Leu Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Phe Ile Tyr
580 585 590
Glu Asp Glu Leu Asn Asn Phe Val Asp Gln Gly Val Ile Ser Glu Leu
595 600 605
Ile Met Ala Phe Ser Arg Glu Gly Ala Gln Lys Glu Tyr Val Gln His
610 615 620
Lys Met Met Glu Lys Ala Ala Gln Val Trp Asp Leu Ile Lys Glu Glu
625 630 635 640
Gly Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val
645 650 655
His Arg Thr Leu His Thr Ile Val Gln Glu Gln Glu Gly Val Ser Ser
660 665 670
Ser Glu Ala Glu Ala Ile Val Lys Lys Leu Gln Thr Glu Gly Arg Tyr
675 680 685
Leu Arg Asp Val Trp
690
<210> 71
<211> 2082
<212> DNA
<213> 拟南芥
<400> 71
atggcgacca gcgctctgta tgctagtgac ctttttaaac agctcaaaag catcatgggc 60
actgatagcc tgtccgacga tgttgtcctg gtaatcgcaa ccacttccct tgcgcttgtt 120
gcgggctttg tggtgttact gtggaagaag actaccgcag ataggagtgg tgaattgaaa 180
ccgctgatga tcccaaaaag tctgatggcc aaagatgagg atgatgatct ggatcttgga 240
tcagggaaga cgcgagtcag tatttttttc gggacccaga cgggcaccgc ggagggcttc 300
gccaaagctc tgtccgagga aataaaggcc agatacgaga aagccgccgt aaaggttata 360
gacctagatg attacgccgc tgatgacgat cagtatgagg agaaacttaa aaaggagact 420
ctggcgtttt tttgcgtggc aacttacgga gacggcgagc ccaccgataa tgcagctagg 480
ttttacaagt ggtttaccga ggagaacgaa cgagatataa agttacagca gttggcctat 540
ggcgtgtttg ccctgggtaa tcggcaatat gagcatttca acaaaattgg catcgttctg 600
gatgaggaat tgtgcaaaaa gggtgcaaaa cggctgatag aggtgggtct aggtgacgat 660
gatcaatcta tagaagacga ttttaatgcg tggaaagaga gcttatggag tgaactggat 720
aagctcttga aagatgaaga cgacaagtca gtggcgaccc cttataccgc ggtaatcccg 780
gaataccgcg tcgtgacaca cgatccgagg tttacaaccc aaaaatctat ggagtctaat 840
gtcgccaatg gcaacacaac gattgatatt caccacccct gtcgtgttga cgtggctgtt 900
caaaaagaac ttcatacaca cgaaagtgac cgaagttgca tacacttgga atttgacatt 960
agtcgcaccg gaattacgta tgaaactggt gatcacgtgg gtgtatacgc agaaaatcat 1020
gtcgaaatag tagaagaagc tggcaaactg ctgggacatt cactcgatct agtgtttagt 1080
atacatgccg ataaagagga tggcagccca ttggaaagtg ccgtccctcc gccgtttcct 1140
ggaccgtgta ctctggggac gggactcgcc cgctatgctg acctgttaaa cccccctcgt 1200
aaaagcgccc ttgtggccct ggcggcatac gcaactgaac cgagcgaagc ggagaagctg 1260
aaacatctga catcaccgga tggcaaagac gagtatagtc agtggatagt agcctctcag 1320
cgctctctgc tggaagtgat ggccgcattt ccgtccgcca aaccaccttt gggagtattt 1380
ttcgctgcta tcgcacctcg gctccagccg cgctattaca gcatatcttc aagtccccgc 1440
ttagcaccgt ctcgtgtcca tgtcacttct gcgttggttt atggtccgac tccaacaggt 1500
cgcatccaca aaggtgtctg ttcaacctgg atgaaaaacg cggtgcccgc ggagaaatct 1560
catgagtgca gtggtgcacc tatttttatc cgcgcaagta acttcaaact cccttctaat 1620
ccgagcacgc ccattgtgat ggttggccca ggcactggcc ttgctccgtt tcgcggtttt 1680
ctacaggagc ggatggccct taaagaagat ggggaagaat tgggatcatc gttgctcttt 1740
tttggctgcc gaaatcgcca gatggatttt atctacgaag acgagttgaa taactttgtc 1800
gatcaaggag taatttcgga gttgattatg gcattttcac gcgaaggggc tcagaaagag 1860
tatgtccaac acaagatgat ggaaaaagcg gcacaagtgt gggatcttat taaagaagaa 1920
ggctatcttt atgtatgtgg ggatgcgaaa ggtatggccc gtgatgtcca tcgcaccctg 1980
cacacgattg tacaggaaca ggaaggtgtg tcctcgtccg aagcagaagc aatcgttaaa 2040
aaactgcaaa cagagggtcg ttaccttcgc gacgtgtggt aa 2082
<210> 72
<211> 712
<212> PRT
<213> 拟南芥
<400> 72
Met Ala Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met
1 5 10 15
Ala Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn
20 25 30
Ala Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile
35 40 45
Glu Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu
50 55 60
Ile Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn
65 70 75 80
Ser Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu
85 90 95
Glu Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr
100 105 110
Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala
115 120 125
Lys Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp
130 135 140
Tyr Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp
145 150 155 160
Val Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp
165 170 175
Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly
180 185 190
Glu Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg
195 200 205
Gln Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu
210 215 220
Val Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp
225 230 235 240
Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp
245 250 255
Pro Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala
260 265 270
Thr Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp
275 280 285
Ser Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly
290 295 300
Tyr Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val
305 310 315 320
Lys Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu
325 330 335
Glu Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His
340 345 350
Val Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu
355 360 365
Arg Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu
370 375 380
Lys Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro
385 390 395 400
Pro Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser
405 410 415
Ser Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp
420 425 430
Pro Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys
435 440 445
Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu
450 455 460
Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe
465 470 475 480
Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser
485 490 495
Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val
500 505 510
Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr
515 520 525
Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser
530 535 540
Ala Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser
545 550 555 560
Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe
565 570 575
Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu
580 585 590
Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp
595 600 605
Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu
610 615 620
Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr
625 630 635 640
Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile
645 650 655
Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala
660 665 670
Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser
675 680 685
Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser
690 695 700
Gly Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 73
<211> 2139
<212> DNA
<213> 拟南芥
<400> 73
atggcgtcca gcagttcatc gagttctaca agcatgatcg atctgatggc cgctattatc 60
aaaggggaac ccgtaattgt ctctgatcca gcaaatgcct cggcatacga gtcggtggct 120
gccgaattat catctatgtt aattgaaaat agacaatttg ccatgattgt gacaacttct 180
attgctgtgc tgataggttg catcgtcatg ctcgtgtggc gccgtagcgg atcaggcaac 240
tcaaagcgcg tcgagccttt gaaacccctg gttatcaaac cgcgagagga ggaaatcgat 300
gatggcagaa aaaaggttac tatctttttt ggcacacaga cggggacagc ggaaggtttc 360
gcgaaagcac tcggagagga agcgaaagcc cgatacgaga aaacacggtt caaaattgtg 420
gatctggatg actatgcggc tgatgatgat gagtatgaag aaaaactgaa aaaagaagat 480
gtggcgtttt tttttcttgc cacttatggc gacggagagc ccaccgataa tgcagcgcgg 540
ttttacaagt ggttcaccga aggaaatgat cggggagaat ggttaaaaaa tctgaaatac 600
ggtgtgttcg gtcttggcaa tcgccaatat gagcatttta ataaagtcgc gaaagtggtc 660
gatgatatat tggtagaaca gggcgctcag cgcctcgtcc aggtggggct tggcgacgat 720
gatcagtgca tagaagatga ttttactgca tggcgtgaag cgctgtggcc ggagctggac 780
accattttac gtgaagaggg cgatacagca gtggcaaccc cgtacacggc tgccgtctta 840
gagtatcgtg tgtccattca tgatagcgag gatgccaaat tcaatgacat caatatggcg 900
aatggaaatg ggtacaccgt gtttgacgcg cagcacccgt ataaggcaaa cgttgcagtc 960
aagagggaac tgcatactcc tgaaagtgat cgcagttgca tccacctgga gttcgatatt 1020
gcgggatcag gtttaacgta cgaaacgggc gaccacgtag gtgtgctgtg cgacaatctt 1080
tcagagacag tggacgaagc tctgcgcctg ctggatatga gcccggatac ctattttagc 1140
ttgcacgctg agaaagaaga tgggactcca attagcagta gcttacctcc accctttccg 1200
ccgtgtaatt tgcgtaccgc ccttacgcgc tatgcgtgtc tgctgagttc gccaaagaag 1260
tcggcccttg tggcactggc ggcacatgca agtgacccga ccgaggcgga gaggctgaaa 1320
catctggctt ctccagcggg caaagatgaa tacagcaaat gggtggtaga atcacagcgt 1380
tccctactag aagtaatggc cgaatttccc tcagctaaac caccgctggg agtgttcttt 1440
gcgggcgttg ctccccgctt gcaaccacgc ttttattcaa ttagctcaag tcctaagata 1500
gcggaaacac ggatacatgt aacttgcgca ttggtttatg aaaaaatgcc aaccgggagg 1560
atacataaag gcgtatgttc aacctggatg aaaaatgctg tgccatacga aaagtcggag 1620
aattgctcct ctgccccaat tttcgtgcgt caaagcaact ttaaactgcc gagtgattca 1680
aaggtgccta ttattatgat aggccctggt acaggactcg ccccgtttcg tggttttctt 1740
caagaaagac tggctctggt cgaatcaggc gtggaattag gaccctccgt gttatttttt 1800
ggctgccgca accgtcgaat ggacttcatc tatgaagaag aattgcaacg ttttgtggag 1860
tcaggcgctc tggcggaact atccgtcgcc tttagtagag aaggcccaac caaagaatac 1920
gtacagcata agatgatgga taaagcgagc gacatttgga atatgatctc acaaggggcg 1980
tacctgtacg tatgtggaga tgccaaagga atggcacgag acgtacatag atcgttgcat 2040
actattgctc aagaacaggg aagcatggat tcgactaaag cagaaggctt tgttaaaaat 2100
ctacagacat ctggtcgcta tctgcgtgac gtgtggtaa 2139
<210> 74
<211> 713
<212> PRT
<213> 拟南芥
<400> 74
Met Ala Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met
1 5 10 15
Ala Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn
20 25 30
Ala Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile
35 40 45
Glu Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu
50 55 60
Ile Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn
65 70 75 80
Ser Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu
85 90 95
Glu Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr
100 105 110
Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala
115 120 125
Lys Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp
130 135 140
Tyr Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp
145 150 155 160
Val Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp
165 170 175
Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly
180 185 190
Glu Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg
195 200 205
Gln Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu
210 215 220
Val Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp
225 230 235 240
Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp
245 250 255
Pro Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala
260 265 270
Thr Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp
275 280 285
Ser Glu Asp Ala Lys Phe Asn Asp Ile Thr Leu Ala Asn Gly Asn Gly
290 295 300
Tyr Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val
305 310 315 320
Lys Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu
325 330 335
Glu Phe Asp Ile Ala Gly Ser Gly Leu Thr Met Lys Leu Gly Asp His
340 345 350
Val Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu
355 360 365
Arg Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu
370 375 380
Lys Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro
385 390 395 400
Pro Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser
405 410 415
Ser Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp
420 425 430
Pro Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys
435 440 445
Asp Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu
450 455 460
Val Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe
465 470 475 480
Ala Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser
485 490 495
Ser Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val
500 505 510
Tyr Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr
515 520 525
Trp Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Lys Leu Phe Leu
530 535 540
Gly Arg Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp
545 550 555 560
Ser Lys Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro
565 570 575
Phe Arg Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val
580 585 590
Glu Leu Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met
595 600 605
Asp Phe Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala
610 615 620
Leu Ala Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu
625 630 635 640
Tyr Val Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met
645 650 655
Ile Ser Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met
660 665 670
Ala Arg Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly
675 680 685
Ser Met Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr
690 695 700
Ser Gly Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 75
<211> 2142
<212> DNA
<213> 拟南芥
<400> 75
atggcgagca gttcgtcctc ctcttctacc agtatgatcg atctgatggc cgctattata 60
aaaggagaac cagtcattgt gtctgatcct gcaaacgcat cagcctacga atctgtggct 120
gctgaactgt cctcgatgct gatcgaaaat cgccaatttg caatgattgt tacaaccagc 180
atcgctgttc ttatcgggtg tattgtcatg ctggtttggc ggcggagtgg cagcggcaat 240
tctaaaagag tggagccact gaagcctctg gtaatcaaac cccgcgaaga agaaatcgat 300
gatggacgta agaaagttac aatttttttt ggtacacaga caggtacagc agagggcttt 360
gccaaagctc ttggagaaga agcaaaagct cgatatgaga aaacacgctt caagatcgtc 420
gatctggatg actacgcggc agacgacgat gagtacgaag aaaaactcaa aaaagaggat 480
gtggcttttt ttttcctggc aacttatggg gacggcgagc ctaccgacaa tgcagcgcgg 540
ttttacaaat ggtttaccga aggcaatgat agaggggagt ggctcaaaaa tctcaaatac 600
ggagttttcg gattggggaa tagacaatac gaacacttta ataaggttgc gaaagtggta 660
gatgatattc tggtcgagca gggcgcgcaa cgtttagtac aggtcggcct gggtgatgac 720
gaccagtgca tcgaagatga ctttacggcc tggcgagaag cgctctggcc ggaattagat 780
acaatccttc gggaagaggg ggacactgct gtcgctaccc cgtacactgc cgcagtgctg 840
gaatatcgtg tttcaataca tgattcggaa gatgccaagt ttaatgacat caccctggca 900
aacggcaacg gatataccgt atttgacgct caacatccgt ataaggccaa tgtagcagta 960
aagcgggaac tccatactcc cgaaagtgac agaagttgca tccatctgga gttcgatata 1020
gcgggaagcg gactgactat gaaactggga gatcatgtag gggtcctgtg cgataatttg 1080
agcgaaaccg ttgacgaagc gctccggctt ttagatatgt cccctgatac ttatttctct 1140
ttgcacgccg agaaggaaga tggtacacct atatcctcct cgctgccgcc gccttttcca 1200
ccatgtaatc tgcgtacggc cttgactagg tatgcatgtc ttcttagctc cccgaaaaag 1260
tccgcactgg tagcgttggc agctcatgcc agcgatccca cggaggcaga gcgtttaaaa 1320
cacctggcga gtcctgctgg caaagatgaa tacagcaaat gggtggttga gtcgcagagg 1380
tccctgctgg aagtcatggc tgaatttccg tctgcgaaac cgcctctggg agttttcttc 1440
gcaggagtag ccccacgttt acaaccgcgt ttctattcta tttcttcctc ccccaagatc 1500
gcggaaactc gaatacacgt aacgtgcgca ttggtgtatg aaaagatgcc aactggtcgt 1560
atccacaagg gagtgtgctc aacctggatg aaaaacgccg ttccgtatga aaaatcggaa 1620
aaattgtttt tgggtagacc catattcgtt cggcagtcaa actttaaact accttctgat 1680
agcaaggttc cgattattat gattggaccg ggtactggcc tggcgccgtt ccgtggtttc 1740
ctgcaagaac ggttggcgct ggtggaatcc ggcgtggaac ttgggccatc ggttttgttt 1800
ttcgggtgcc gcaatcgtcg catggacttc atctacgagg aagaactcca gcgttttgtc 1860
gaaagcggtg cccttgctga attgtccgtt gcattcagcc gcgaaggtcc aactaaggag 1920
tatgtgcagc acaaaatgat ggacaaagcg agcgatattt ggaatatgat tagccagggc 1980
gcataccttt atgtgtgcgg tgatgctaag ggaatggcgc gcgatgtcca tagatcttta 2040
cataccattg cacaggagca gggctctatg gattcaacaa aagctgaagg ttttgtgaaa 2100
aaccttcaga ccagcgggcg gtatcttcgc gatgtttggt aa 2142
<210> 76
<211> 711
<212> PRT
<213> 甜菊
<400> 76
Met Ala Gln Ser Asp Ser Val Lys Val Ser Pro Phe Asp Leu Val Ser
1 5 10 15
Ala Ala Met Asn Gly Lys Ala Met Glu Lys Leu Asn Ala Ser Glu Ser
20 25 30
Glu Asp Pro Thr Thr Leu Pro Ala Leu Lys Met Leu Val Glu Asn Arg
35 40 45
Glu Leu Leu Thr Leu Phe Thr Thr Ser Phe Ala Val Leu Ile Gly Cys
50 55 60
Leu Val Phe Leu Met Trp Arg Arg Ser Ser Ser Lys Lys Leu Val Gln
65 70 75 80
Asp Pro Val Pro Gln Val Ile Val Val Lys Lys Lys Glu Lys Glu Ser
85 90 95
Glu Val Asp Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln
100 105 110
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys
115 120 125
Val Arg Tyr Glu Lys Thr Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr
130 135 140
Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu
145 150 155 160
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175
Ala Ala Asn Phe Tyr Lys Trp Phe Thr Glu Gly Asp Asp Lys Gly Glu
180 185 190
Leu Leu Lys Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205
Tyr Glu His Phe Asn Lys Ile Ala Ile Val Val Asp Asp Lys Leu Thr
210 215 220
Glu Met Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp
225 230 235 240
Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro
245 250 255
Glu Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Thr Thr
260 265 270
Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Val Tyr His Asp Lys
275 280 285
Pro Ala Asp Ser Tyr Ala Glu Asp Gln Thr His Thr Asn Gly His Val
290 295 300
Val His Asp Ala Gln His Pro Ser Arg Ser Asn Val Ala Phe Lys Lys
305 310 315 320
Glu Leu His Thr Ser Gln Ser Asp Arg Ser Cys Thr His Leu Glu Phe
325 330 335
Asp Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly
340 345 350
Val Tyr Ser Glu Asn Leu Ser Glu Val Val Asp Glu Ala Leu Lys Leu
355 360 365
Leu Gly Leu Ser Pro Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu
370 375 380
Asp Gly Thr Pro Ile Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro
385 390 395 400
Cys Thr Leu Arg Asp Ala Leu Thr Arg Tyr Ala Asp Val Leu Ser Ser
405 410 415
Pro Lys Lys Val Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro
420 425 430
Ser Glu Ala Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp
435 440 445
Glu Tyr Ala Gln Trp Ile Val Ala Asn Gln Arg Ser Leu Leu Glu Val
450 455 460
Met Gln Ser Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala
465 470 475 480
Ala Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser
485 490 495
Pro Lys Met Ser Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr
500 505 510
Glu Thr Thr Pro Ala Gly Arg Ile His Arg Gly Leu Cys Ser Thr Trp
515 520 525
Met Lys Asn Ala Val Pro Leu Thr Glu Ser Pro Asp Cys Ser Gln Ala
530 535 540
Ser Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Val Asp Pro Lys
545 550 555 560
Val Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg
565 570 575
Gly Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu
580 585 590
Gly Ser Ser Ile Phe Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe
595 600 605
Ile Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser
610 615 620
Glu Leu Ile Val Ala Phe Ser Arg Glu Gly Thr Ala Lys Glu Tyr Val
625 630 635 640
Gln His Lys Met Ser Gln Lys Ala Ser Asp Ile Trp Lys Leu Leu Ser
645 650 655
Glu Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys
660 665 670
Asp Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu
675 680 685
Asp Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly
690 695 700
Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 77
<211> 2136
<212> DNA
<213> 甜菊
<400> 77
atggctcaga gcgattctgt taaagtatcc ccgttcgacc tggtctctgc ggctatgaac 60
ggcaaagcaa tggagaaact gaacgcgagc gaatctgaag atccaaccac cctgccggca 120
ctgaaaatgc tggtagaaaa ccgtgaactg ctgactctgt tcaccacctc cttcgccgtt 180
ctgattggtt gcctggtctt cctgatgtgg cgccgttcct cttccaagaa gctggtacag 240
gacccggttc ctcaggtgat cgtcgttaaa aagaaagaga aggaaagcga agtcgatgac 300
ggcaaaaaga aggtttccat tttctacggt actcagaccg gcaccgctga gggttttgcc 360
aaagcactgg ttgaagaggc aaaagtgcgt tacgaaaaaa cttccttcaa agtgattgac 420
ctggacgact atgctgcgga tgatgatgaa tacgaggaaa aactgaaaaa agaaagcctg 480
gccttcttct tcctggcaac ctatggcgat ggtgaaccga ccgacaacgc ggcgaacttc 540
tacaaatggt ttaccgaagg cgacgacaaa ggtgaattgc tgaagaaact gcagtatggt 600
gttttcggtc tgggcaatcg ccagtacgaa cattttaaca aaatcgcaat cgttgttgat 660
gacaaactga ctgaaatggg tgcgaaacgt ctggtgccgg ttggcctggg tgacgatgat 720
caatgcatcg aagatgactt caccgcatgg aaagaactgg tttggccgga actggatcag 780
ctgctgcgcg acgaagacga cacttccgtg accaccccgt ataccgctgc agtgctggag 840
taccgtgttg tttaccacga taaaccggcg gactcttacg ccgaagatca gactcacact 900
aacggtcacg tcgtacatga cgcacagcac ccgtctcgta gcaatgttgc gtttaagaaa 960
gagctgcaca cgagccagtc cgaccgctct tgtacgcacc tggagttcga tatctcccac 1020
accggtctgt cctatgaaac cggtgaccat gttggcgttt acagcgaaaa cctgagcgag 1080
gtagttgatg aagcgctgaa actgctgggc ctgtctccag acacctactt tagcgtgcat 1140
gctgacaagg aagatggtac tccgattggc ggcgcttccc tgccgccacc gtttccacct 1200
tgcactctgc gtgatgctct gactcgttac gctgatgttc tgtctagccc gaaaaaggtt 1260
gcgctgctgg cgctggccgc acatgcttct gacccgtctg aagctgaccg tctgaaattc 1320
ctggcgtctc cggccggcaa agacgaatac gcgcagtgga ttgtcgctaa ccagcgctct 1380
ctgctggaag tgatgcagtc cttcccgtct gccaaaccgc cactgggcgt gtttttcgca 1440
gctgtggctc cgcgcctgca gccgcgctac tattctatct ctagctcccc gaaaatgagc 1500
ccgaaccgca tccacgttac ttgtgctctg gtttacgaaa ccacccctgc gggccgtatc 1560
caccgtggtc tgtgctctac gtggatgaaa aatgccgtgc cgctgaccga atccccggac 1620
tgctctcagg cgtccatctt cgtgcgtacc tctaacttcc gtctgccggt ggacccgaaa 1680
gttcctgtta tcatgatcgg tcctggcacg ggtctggccc cgtttcgtgg ttttctgcag 1740
gagcgtctgg ctctgaaaga atccggtact gagctgggct cttccatctt tttcttcggt 1800
tgtcgtaacc gcaaagtcga tttcatctat gaagacgaac tgaacaactt cgtagagact 1860
ggtgcactgt ccgaactgat tgtggcattc tctcgtgaag gcacggcgaa agaatacgtt 1920
caacacaaaa tgtctcagaa agcgagcgat atctggaaac tgctgtccga gggtgcgtat 1980
ctgtatgttt gtggcgacgc gaaaggcatg gctaaagatg tacaccgcac cctgcacacc 2040
attgtacaag aacaaggctc tctggatagc tccaaggcag aactgtacgt gaaaaacctg 2100
cagatgtctg gccgttacct gcgtgatgta tggtaa 2136
<210> 78
<211> 710
<212> PRT
<213> 甜菊
<400> 78
Met Ala Gln Ser Glu Ser Val Glu Ala Ser Thr Ile Asp Leu Met Thr
1 5 10 15
Ala Val Leu Lys Asp Thr Val Ile Asp Thr Ala Asn Ala Ser Asp Asn
20 25 30
Gly Asp Ser Lys Met Pro Pro Ala Leu Ala Met Met Phe Glu Ile Arg
35 40 45
Asp Leu Leu Leu Ile Leu Thr Thr Ser Val Ala Val Leu Val Gly Cys
50 55 60
Phe Val Val Leu Val Trp Lys Arg Ser Ser Gly Lys Lys Ser Gly Lys
65 70 75 80
Glu Leu Glu Pro Pro Lys Ile Val Val Pro Lys Arg Arg Leu Glu Gln
85 90 95
Glu Val Asp Asp Gly Lys Lys Lys Val Thr Ile Phe Phe Gly Thr Gln
100 105 110
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Phe Glu Glu Ala Lys
115 120 125
Ala Arg Tyr Glu Lys Ala Ala Phe Lys Val Ile Asp Leu Asp Asp Tyr
130 135 140
Ala Ala Asp Leu Asp Glu Tyr Ala Glu Lys Leu Lys Lys Glu Thr Tyr
145 150 155 160
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175
Ala Ala Lys Phe Tyr Lys Trp Phe Thr Glu Gly Asp Glu Lys Gly Val
180 185 190
Trp Leu Gln Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205
Tyr Glu His Phe Asn Lys Ile Gly Ile Val Val Asp Asp Gly Leu Thr
210 215 220
Glu Gln Gly Ala Lys Arg Ile Val Pro Val Gly Leu Gly Asp Asp Asp
225 230 235 240
Gln Ser Ile Glu Asp Asp Phe Ser Ala Trp Lys Glu Leu Val Trp Pro
245 250 255
Glu Leu Asp Leu Leu Leu Arg Asp Glu Asp Asp Lys Ala Ala Ala Thr
260 265 270
Pro Tyr Thr Ala Ala Ile Pro Glu Tyr Arg Val Val Phe His Asp Lys
275 280 285
Pro Asp Ala Phe Ser Asp Asp His Thr Gln Thr Asn Gly His Ala Val
290 295 300
His Asp Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu
305 310 315 320
Leu His Thr Pro Glu Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp
325 330 335
Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val
340 345 350
Tyr Cys Glu Asn Leu Ile Glu Val Val Glu Glu Ala Gly Lys Leu Leu
355 360 365
Gly Leu Ser Thr Asp Thr Tyr Phe Ser Leu His Ile Asp Asn Glu Asp
370 375 380
Gly Ser Pro Leu Gly Gly Pro Ser Leu Gln Pro Pro Phe Pro Pro Cys
385 390 395 400
Thr Leu Arg Lys Ala Leu Thr Asn Tyr Ala Asp Leu Leu Ser Ser Pro
405 410 415
Lys Lys Ser Thr Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Thr
420 425 430
Glu Ala Asp Arg Leu Arg Phe Leu Ala Ser Arg Glu Gly Lys Asp Glu
435 440 445
Tyr Ala Glu Trp Val Val Ala Asn Gln Arg Ser Leu Leu Glu Val Met
450 455 460
Glu Ala Phe Pro Ser Ala Arg Pro Pro Leu Gly Val Phe Phe Ala Ala
465 470 475 480
Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro
485 490 495
Lys Met Glu Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu
500 505 510
Lys Thr Pro Ala Gly Arg Ile His Lys Gly Ile Cys Ser Thr Trp Met
515 520 525
Lys Asn Ala Val Pro Leu Thr Glu Ser Gln Asp Cys Ser Trp Ala Pro
530 535 540
Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Ile Asp Pro Lys Val
545 550 555 560
Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly
565 570 575
Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu Gly
580 585 590
Ser Ser Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Tyr Ile
595 600 605
Tyr Glu Asn Glu Leu Asn Asn Phe Val Glu Asn Gly Ala Leu Ser Glu
610 615 620
Leu Asp Val Ala Phe Ser Arg Asp Gly Pro Thr Lys Glu Tyr Val Gln
625 630 635 640
His Lys Met Thr Gln Lys Ala Ser Glu Ile Trp Asn Met Leu Ser Glu
645 650 655
Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp
660 665 670
Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp
675 680 685
Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg
690 695 700
Tyr Leu Arg Asp Val Trp
705 710
<210> 79
<211> 2133
<212> DNA
<213> 甜菊
<400> 79
atggcgcaat ctgaaagtgt tgaggccagt accatcgacc ttatgacggc agtgttgaag 60
gatacagtta ttgacactgc aaatgcttca gataacggcg attctaaaat gcctcctgcg 120
cttgcgatga tgttcgagat ccgcgatctt ctgctgatcc ttaccacatc agtagcggtg 180
ctggtgggat gctttgtggt actcgtgtgg aaacgttcgt cgggcaaaaa atcaggtaag 240
gagctggaac cgcctaagat tgtcgtaccg aaacgccgac tggaacaaga agttgatgat 300
ggcaaaaaaa aagtgactat attttttggg acacagacag gcacagcgga gggatttgcg 360
aaagccttat tcgaggaggc gaaggcacgt tacgagaaag cagcttttaa agtcattgat 420
ctggatgact atgcagcaga cctagatgaa tacgcagaga aactgaaaaa agaaacttat 480
gcgtttttct tcctggccac atacggagac ggtgaaccga cggacaatgc cgccaagttt 540
tataagtggt tcactgaagg ggatgagaaa ggtgtatggc ttcagaaatt gcaatacgga 600
gtgttcggac taggaaatcg gcaatatgag cactttaata aaataggcat agtagtagac 660
gatgggctaa ccgagcaggg ggccaaacgg attgtacccg tgggcctggg ggacgatgat 720
cagtctattg aggatgattt tagtgcttgg aaagagcttg tttggcctga actggactta 780
ctcctgcgtg atgaagacga taaagcggca gcgactccat acacggcagc aatccccgag 840
tatagagtcg tattccatga taaaccggat gctttctctg atgaccatac ccaaactaat 900
ggtcatgcgg tccatgatgc acaacatccc tgccgcagca atgtagcggt gaaaaaggag 960
ctgcatacgc ctgaaagtga tcgctcatgt acgcatctgg agtttgatat ttcacacaca 1020
ggtcttagct acgagactgg agatcacgtc ggagtctatt gcgaaaatct gatcgaagtg 1080
gttgaagagg ccgggaaact gttgggacta agtacagata cttatttttc tttacatata 1140
gataacgagg atggttcccc acttggcggt ccatctcttc agcctccatt cccaccatgt 1200
accttacgca aagcgctgac taactacgca gatctgctgt ctagcccaaa gaaatcaacg 1260
cttctggcgt tggctgctca tgcctcagat ccgaccgaag ctgatcgcct tcgttttctg 1320
gcatcccgag aaggtaaaga tgaatatgca gaatgggtgg tagcgaatca gcgttctttg 1380
ctggaagtca tggaggcatt ccccagcgcg cgccctccgc tgggtgtttt cttcgcagcg 1440
gtggccccgc ggctccagcc gcgttattat tcaattagca gttctcctaa gatggaacct 1500
aatcgaatcc atgtaacatg tgcattggtc tatgagaaaa cgccggctgg ccgcatccat 1560
aaaggtatat gtagcacatg gatgaaaaat gcagtacccc tcacggagtc ccaggattgt 1620
agttgggcgc cgatatttgt tcggacgagc aattttagac ttcctataga cccaaaggtt 1680
ccagttatta tgattggtcc tggcaccgga cttgcgccat tccgggggtt tctgcaagaa 1740
agactggctc tgaaagaaag cggtacagaa ctcggctcca gtatattgtt tttcggctgt 1800
cgcaaccgga aagtagatta tatatatgaa aacgagctga ataacttcgt tgaaaatggt 1860
gccctgtctg aactcgatgt cgctttttcg cgagatggcc cgacaaaaga atacgtgcag 1920
cataaaatga cccagaaagc aagtgaaatc tggaatatgc tgtcagaagg ggcatatctg 1980
tatgtgtgcg gagatgcaaa gggcatggcc aaagacgttc acagaacctt gcataccata 2040
gtacaagagc agggctctct ggatagctca aaagccgagc tgtacgtgaa aaatctccag 2100
atgagtggac gctacctgag ggatgtttgg taa 2133
<210> 80
<211> 708
<212> PRT
<213> 甜菊
<400> 80
Met Ala Gln Ser Asn Ser Val Lys Ile Ser Pro Leu Asp Leu Val Thr
1 5 10 15
Ala Leu Phe Ser Gly Lys Val Leu Asp Thr Ser Asn Ala Ser Glu Ser
20 25 30
Gly Glu Ser Ala Met Leu Pro Thr Ile Ala Met Ile Met Glu Asn Arg
35 40 45
Glu Leu Leu Met Ile Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys
50 55 60
Val Val Val Leu Val Trp Arg Arg Ser Ser Thr Lys Lys Ser Ala Leu
65 70 75 80
Glu Pro Pro Val Ile Val Val Pro Lys Arg Val Gln Glu Glu Glu Val
85 90 95
Asp Asp Gly Lys Lys Lys Val Thr Val Phe Phe Gly Thr Gln Thr Gly
100 105 110
Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Ala Arg
115 120 125
Tyr Glu Lys Ala Val Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala
130 135 140
Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala Phe
145 150 155 160
Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala
165 170 175
Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asp Ala Lys Gly Glu Trp Leu
180 185 190
Asn Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu
195 200 205
His Phe Asn Lys Ile Ala Lys Val Val Asp Asp Gly Leu Val Glu Gln
210 215 220
Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln Cys
225 230 235 240
Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu Leu
245 250 255
Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Thr Val Ala Thr Pro Tyr
260 265 270
Thr Ala Ala Val Ala Glu Tyr Arg Val Val Phe His Glu Lys Pro Asp
275 280 285
Ala Leu Ser Glu Asp Tyr Ser Tyr Thr Asn Gly His Ala Val His Asp
290 295 300
Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu His
305 310 315 320
Ser Pro Glu Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser
325 330 335
Asn Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val Tyr Cys
340 345 350
Glu Asn Leu Ser Glu Val Val Asn Asp Ala Glu Arg Leu Val Gly Leu
355 360 365
Pro Pro Asp Thr Tyr Phe Ser Ile His Thr Asp Ser Glu Asp Gly Ser
370 375 380
Pro Leu Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu
385 390 395 400
Arg Lys Ala Leu Thr Cys Tyr Ala Asp Val Leu Ser Ser Pro Lys Lys
405 410 415
Ser Ala Leu Leu Ala Leu Ala Ala His Ala Thr Asp Pro Ser Glu Ala
420 425 430
Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ser
435 440 445
Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Glu Ala
450 455 460
Phe Pro Ser Ala Lys Pro Ser Leu Gly Val Phe Phe Ala Ser Val Ala
465 470 475 480
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Lys Met
485 490 495
Ala Pro Asp Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Thr
500 505 510
Pro Ala Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
515 520 525
Ala Val Pro Met Thr Glu Ser Gln Asp Cys Ser Trp Ala Pro Ile Tyr
530 535 540
Val Arg Thr Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys Val Pro Val
545 550 555 560
Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu
565 570 575
Gln Glu Arg Leu Ala Leu Lys Glu Ala Gly Thr Asp Leu Gly Leu Ser
580 585 590
Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile Tyr Glu
595 600 605
Asn Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu Leu Ile
610 615 620
Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys
625 630 635 640
Met Ser Glu Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser Glu Gly Ala
645 650 655
Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp Val His
660 665 670
Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser Ser
675 680 685
Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg Tyr Leu
690 695 700
Arg Asp Val Trp
705
<210> 81
<211> 2127
<212> DNA
<213> 甜菊
<400> 81
atggcgcaat ctaattctgt gaaaatctct ccattggatc tggttacagc actctttagc 60
gggaaggtac tggatacaag taacgccagt gaaagcgggg aatccgcgat gctgccaaca 120
atcgcgatga tcatggaaaa tcgggaactg ctaatgattc tgacaacgtc tgtagcagtt 180
ttaatcggtt gcgttgtggt tctggtgtgg cgtcgatcat ccacgaaaaa gagcgcatta 240
gaaccgcctg ttatcgtagt accaaaaaga gttcaggagg aagaggtgga tgatgggaaa 300
aaaaaagtca ccgttttctt cgggacccaa actggtacgg cagaaggttt tgcgaaagca 360
ctggtcgaag aggcgaaagc ccgctatgag aaggcggttt ttaaggttat tgaccttgat 420
gactatgcgg cggacgatga tgaatacgaa gaaaaattaa agaaagaatc acttgccttt 480
ttttttttgg caacatacgg tgatggcgag ccgactgata acgcggcacg gttttacaaa 540
tggtttaccg aaggcgacgc gaagggggag tggttgaaca agttacagta cggtgtgttc 600
ggcttgggga accgccagta cgagcacttt aacaagatag ctaaagttgt cgatgatggt 660
ctggtagaac agggagcgaa gcgtctcgtg ccagtagggc tgggcgatga tgatcagtgt 720
atagaagatg attttacggc ttggaaggag ttagtttggc cggaactgga ccaactgctg 780
cgcgatgagg atgatacaac tgtcgctacc ccgtatacag cagcggtagc tgaatacagg 840
gtggtttttc acgagaaacc tgatgcgctg agtgaggact attcgtatac taacggccat 900
gccgttcacg atgcacagca cccgtgccgt tctaatgtcg ccgtaaaaaa ggaactgcat 960
agcccggagt cggaccgcag ttgtacccat ctggagtttg atatttcaaa taccgggctg 1020
agttacgaaa cgggcgatca cgttggcgtg tactgtgaga atctaagcga ggttgttaac 1080
gatgcagaac gactggtcgg tttgcctcca gatacttatt tctcgatcca cactgatagc 1140
gaagatggct ctccactcgg gggggcgagt ctgccgcccc cgtttccccc gtgtacgctg 1200
agaaaggccc ttacatgtta tgcagatgta ctctcttccc ccaaaaaaag tgccttgctc 1260
gcattagcag cccacgctac cgatccctcg gaagcagatc gtctgaaatt cttggcatcg 1320
ccggcgggca aagatgaata cagccaatgg atagttgcaa gtcagcgcag tctcttagaa 1380
gtgatggaag cgtttccgtc cgcaaagccg tccttaggtg tgtttttcgc gtccgtggca 1440
ccgcgtcttc agcctagata ttacagcatt agttcctctc caaaaatggc cccggaccgt 1500
attcacgtga cttgtgctct tgtatatgag aaaaccccgg caggtcgtat tcacaaaggc 1560
gtgtgcagca cctggatgaa gaatgcagtg ccgatgaccg aaagccagga ttgttcatgg 1620
gcgccaatct atgtcaggac aagtaatttc agacttccgt ctgatcctaa agttccagtc 1680
ataatgattg gccccggcac gggactggct ccttttcgtg gtttcctgca agagcgcttg 1740
gcactgaaag aagcaggcac tgacctggga ctgtccatcc tgttctttgg gtgccgtaat 1800
cgtaaggtcg attttatata tgaaaatgaa ttgaacaact ttgtagaaac aggcgcatta 1860
tccgaactga tcgtagcttt tagtagagag gggccgacga aagaatatgt acaacacaag 1920
atgtctgaga aggcttcgga tatatggaac ctgctctctg agggtgccta tctgtacgtt 1980
tgcggtgatg ccaaaggaat ggccaaagat gtgcaccgca ctttacatac aatcgtccaa 2040
gagcagggta gcttggactc atctaaagct gaactgtatg tgaagaactt acagatgagc 2100
gggcgctatt tgcgagatgt ttggtaa 2127
<210> 82
<211> 706
<212> PRT
<213> 香叶天竺葵
<400> 82
Met Ala Gln Ser Ser Ser Gly Ser Met Ser Pro Phe Asp Phe Met Thr
1 5 10 15
Ala Ile Ile Lys Gly Lys Met Glu Pro Ser Asn Ala Ser Leu Gly Ala
20 25 30
Ala Gly Glu Val Thr Ala Met Ile Leu Asp Asn Arg Glu Leu Val Met
35 40 45
Ile Leu Thr Thr Ser Ile Ala Val Leu Ile Gly Cys Val Val Val Phe
50 55 60
Ile Trp Arg Arg Ser Ser Ser Gln Thr Pro Thr Ala Val Gln Pro Leu
65 70 75 80
Lys Pro Leu Leu Ala Lys Glu Thr Glu Ser Glu Val Asp Asp Gly Lys
85 90 95
Gln Lys Val Thr Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu Gly
100 105 110
Phe Ala Lys Ala Leu Ala Asp Glu Ala Lys Ala Arg Tyr Asp Lys Val
115 120 125
Thr Phe Lys Val Val Asp Leu Asp Asp Tyr Ala Ala Asp Asp Glu Glu
130 135 140
Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe Phe Leu Ala
145 150 155 160
Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr Lys
165 170 175
Trp Phe Leu Glu Gly Lys Glu Arg Gly Glu Trp Leu Gln Asn Leu Lys
180 185 190
Phe Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn Lys
195 200 205
Ile Ala Ile Val Val Asp Glu Ile Leu Ala Glu Gln Gly Gly Lys Arg
210 215 220
Leu Ile Ser Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp Asp
225 230 235 240
Phe Thr Ala Trp Arg Glu Ser Leu Trp Pro Glu Leu Asp Gln Leu Leu
245 250 255
Arg Asp Glu Asp Asp Thr Thr Val Ser Thr Pro Tyr Thr Ala Ala Val
260 265 270
Leu Glu Tyr Arg Val Val Phe His Asp Pro Ala Asp Ala Pro Thr Leu
275 280 285
Glu Lys Ser Tyr Ser Asn Ala Asn Gly His Ser Val Val Asp Ala Gln
290 295 300
His Pro Leu Arg Ala Asn Val Ala Val Arg Arg Glu Leu His Thr Pro
305 310 315 320
Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ser Gly Thr
325 330 335
Gly Ile Ala Tyr Glu Thr Gly Asp His Val Gly Val Tyr Cys Glu Asn
340 345 350
Leu Ala Glu Thr Val Glu Glu Ala Leu Glu Leu Leu Gly Leu Ser Pro
355 360 365
Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu Asp Gly Thr Pro Leu
370 375 380
Ser Gly Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr Leu Arg Thr
385 390 395 400
Ala Leu Thr Leu His Ala Asp Leu Leu Ser Ser Pro Lys Lys Ser Ala
405 410 415
Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala Asp Arg
420 425 430
Leu Arg His Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ala Gln Trp
435 440 445
Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala Glu Phe Pro
450 455 460
Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ser Val Ala Pro Arg
465 470 475 480
Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Ile Ala Pro
485 490 495
Ser Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Thr Pro Thr
500 505 510
Gly Arg Val His Lys Gly Val Cys Ser Thr Trp Met Lys Asn Ser Val
515 520 525
Pro Ser Glu Lys Ser Asp Glu Cys Ser Trp Ala Pro Ile Phe Val Arg
530 535 540
Gln Ser Asn Phe Lys Leu Pro Ala Asp Ala Lys Val Pro Ile Ile Met
545 550 555 560
Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu Gln Glu
565 570 575
Arg Leu Ala Leu Lys Glu Ala Gly Thr Glu Leu Gly Pro Ser Ile Leu
580 585 590
Phe Phe Gly Cys Arg Asn Ser Lys Met Asp Tyr Ile Tyr Glu Asp Glu
595 600 605
Leu Asp Asn Phe Val Gln Asn Gly Ala Leu Ser Glu Leu Val Leu Ala
610 615 620
Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys Met Met
625 630 635 640
Glu Lys Ala Ser Asp Ile Trp Asn Leu Ile Ser Gln Gly Ala Tyr Leu
645 650 655
Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His Arg Thr
660 665 670
Leu His Thr Ile Ala Gln Glu Gln Gly Ser Leu Asp Ser Ser Lys Ala
675 680 685
Glu Ser Met Val Lys Asn Leu Gln Met Ser Gly Arg Tyr Leu Arg Asp
690 695 700
Val Trp
705
<210> 83
<211> 2121
<212> DNA
<213> 香叶天竺葵
<400> 83
atggcgcagt caagcagtgg atcaatgagc cctttcgatt ttatgaccgc tataataaaa 60
ggtaaaatgg agccaagtaa tgcgtcttta ggagcggcag gtgaagtcac agcaatgata 120
cttgataata gggagctggt tatgattctg acgaccagca ttgcagtgct gatcggttgc 180
gttgtagtgt tcatttggcg tcgttcatca tcccagaccc ctaccgcggt gcagccatta 240
aaaccacttt tagcgaagga aacagagagc gaagtagacg atggcaaaca gaaagtaact 300
atcttttttg gtactcaaac tggaaccgct gaaggtttcg cgaaagcgct cgcagacgag 360
gccaaagcac ggtatgataa agtcactttt aaagtggttg atctggacga ttatgccgca 420
gatgacgaag aatatgaaga aaagctgaag aaggaaacgt tagcattctt ttttcttgcg 480
acgtatggag atggtgaacc tactgacaat gctgcaaggt tttataagtg gtttctggaa 540
ggtaaagaac gcggagaatg gcttcagaat ctaaaatttg gtgtgtttgg tttaggcaac 600
cgtcagtatg agcatttcaa taaaattgcc attgtggttg atgaaatcct tgcagaacaa 660
ggtggtaagc gtctcatttc agttggcctg ggcgatgatg atcagtgtat tgaggatgac 720
tttactgcct ggagggaatc gctgtggccg gagctagatc agttattacg cgatgaggat 780
gatactacgg tttctacgcc gtataccgcc gcggtgctgg aatacagagt cgtttttcat 840
gatccggcag atgccccaac tctcgaaaaa agctacagca acgctaacgg gcatagcgtg 900
gttgatgcgc aacatccgtt acgggcaaat gttgccgtca gacgggagtt gcatactcct 960
gcgtctgacc gctcatgtac ccatctggaa tttgatatat ctggtactgg catcgcatac 1020
gagacgggtg atcatgttgg cgtgtattgc gagaatcttg cagagacggt agaagaagcg 1080
ttggaacttt taggtctttc cccggataca tacttctccg tacacgcaga taaagaggac 1140
ggtacgcctc tctcaggctc atctctcccg ccgccatttc caccgtgcac tttacgtaca 1200
gccctgacgt tacatgctga cttactgtct tccccaaaga aatctgcatt gctcgcgctt 1260
gcagctcatg catcagaccc cactgaagct gatcgattgc ggcacctagc aagccctgcg 1320
ggcaaggacg aatacgctca gtggatagtt gctagtcagc gttccttgct ggaagtgatg 1380
gcggagttcc ccagtgccaa gcccccgctg ggagtattct tcgcatcggt tgctccaaga 1440
ttgcagcccc ggtactactc tatttcttct tccccaagaa tagcgccgtc tcgcatacac 1500
gtgacctgcg cgttagttta cgagaaaaca cctacgggca gagtacacaa aggagtttgc 1560
tccacttgga tgaaaaactc agtgccctct gaaaagagtg atgaatgttc atgggcacca 1620
attttcgtac gacagagcaa ctttaaactg cccgccgatg cgaaagtacc cataattatg 1680
attggtccag gaacgggtct ggcaccattt cgcggcttcc tccaggagcg gcttgcattg 1740
aaagaagcag ggacagaact gggaccttcc atattatttt ttgggtgccg caacagcaaa 1800
atggactata tatacgagga tgaactggat aattttgtac agaatggggc actctctgaa 1860
ctcgtgttgg cgttctcacg tgaaggtcct accaaagagt atgtgcaaca taagatgatg 1920
gagaaagcct cagatatatg gaaccttatt tcacagggag cttatttgta tgtgtgcggg 1980
gacgcaaaag gcatggcgcg tgatgtgcac cgcacgttac ataccatcgc tcaggagcag 2040
gggtcattag atagctcaaa agcagagagt atggtgaaga atcttcagat gtcaggcaga 2100
tacctgcgcg atgtctggta a 2121
<210> 84
<211> 277
<212> PRT
<213> 红串红球菌
<400> 84
Met Ala Arg Val Glu Gly Gln Val Ala Leu Ile Thr Gly Ala Ala Arg
1 5 10 15
Gly Gln Gly Arg Ser His Ala Ile Lys Leu Ala Glu Glu Gly Ala Asp
20 25 30
Val Ile Leu Val Asp Val Pro Asn Asp Val Val Asp Ile Gly Tyr Pro
35 40 45
Leu Gly Thr Ala Asp Glu Leu Asp Gln Thr Ala Lys Asp Val Glu Asn
50 55 60
Leu Gly Arg Lys Ala Ile Val Ile His Ala Asp Val Arg Asp Leu Glu
65 70 75 80
Ser Leu Thr Ala Glu Val Asp Arg Ala Val Ser Thr Leu Gly Arg Leu
85 90 95
Asp Ile Val Ser Ala Asn Ala Gly Ile Ala Ser Val Pro Phe Leu Ser
100 105 110
His Asp Ile Pro Asp Asn Thr Trp Arg Gln Met Ile Asp Ile Asn Leu
115 120 125
Thr Gly Val Trp His Thr Ala Lys Val Ala Val Pro His Ile Leu Ala
130 135 140
Gly Glu Arg Gly Gly Ser Ile Val Leu Thr Ser Ser Ala Ala Gly Leu
145 150 155 160
Lys Gly Tyr Ala Gln Ile Ser His Tyr Ser Ala Ala Lys His Gly Val
165 170 175
Val Gly Leu Met Arg Ser Leu Ala Leu Glu Leu Ala Pro His Arg Val
180 185 190
Arg Val Asn Ser Leu His Pro Thr Gln Val Asn Thr Pro Met Ile Gln
195 200 205
Asn Glu Gly Thr Tyr Arg Ile Phe Ser Pro Asp Leu Glu Asn Pro Thr
210 215 220
Arg Glu Asp Phe Glu Ile Ala Ser Thr Thr Thr Asn Ala Leu Pro Ile
225 230 235 240
Pro Trp Val Glu Ser Val Asp Val Ser Asn Ala Leu Leu Phe Leu Val
245 250 255
Ser Glu Asp Ala Arg Tyr Ile Thr Gly Ala Ala Ile Pro Val Asp Ala
260 265 270
Gly Thr Thr Leu Lys
275
<210> 85
<211> 834
<212> DNA
<213> 红串红球菌
<400> 85
atggcccgtg tggaaggtca agtggctctg attaccggcg ctgctcgtgg tcaaggtcgt 60
agtcatgcga ttaaactggc ggaagaaggc gcggatgtga ttctggttga cgtcccgaat 120
gatgtggttg acatcggcta tccgctgggt acggcagatg aactggacca gaccgctaaa 180
gatgttgaaa acctgggtcg taaggcgatt gtcatccatg ccgatgtgcg cgacctggaa 240
tcactgacgg cagaagtgga tcgtgctgtt agtaccctgg gccgcctgga cattgtttcc 300
gcaaatgctg gtatcgccag cgtcccgttt ctgtctcacg atattccgga caacacctgg 360
cgtcagatga ttgatatcaa tctgacgggc gtctggcata ccgcgaaagt ggccgttccg 420
cacattctgg ccggtgaacg cggcggttcc atcgttctga ccagctctgc ggccggcctg 480
aaaggttatg cacaaattag tcattactcc gcagctaagc acggcgtcgt gggtctgatg 540
cgttcactgg cactggaact ggctccgcat cgtgtccgcg tgaactcgct gcacccgacg 600
caggtgaaca ccccgatgat tcaaaatgaa ggcacgtatc gtatctttag cccggatctg 660
gaaaacccga cccgcgaaga cttcgaaatt gcgtctacca cgaccaatgc cctgccgatc 720
ccgtgggtgg aatcagttga tgtctcgaac gcactgctgt tcctggttag cgaagacgca 780
cgttacatta ccggtgcagc aatcccggtg gatgccggta cgaccctgaa gtaa 834
<210> 86
<211> 280
<212> PRT
<213> 甜橙
<400> 86
Met Ala Thr Pro Pro Ile Ser Ser Leu Ile Ser Gln Arg Leu Leu Gly
1 5 10 15
Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly Ile Gly Glu Gly Ile
20 25 30
Val Arg Leu Phe His Arg His Gly Ala Lys Val Cys Phe Val Asp Val
35 40 45
Gln Asp Glu Leu Gly Tyr Arg Leu Gln Glu Ser Leu Val Gly Asp Lys
50 55 60
Asp Ser Asn Ile Phe Tyr Ser His Cys Asp Val Thr Val Glu Asp Asp
65 70 75 80
Val Arg Arg Ala Val Asp Leu Thr Val Thr Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Ile Ser Gly Thr Pro Ser Ser Asp Ile
100 105 110
Arg Asn Val Asp Val Ser Glu Phe Glu Lys Val Phe Asp Ile Asn Val
115 120 125
Lys Gly Val Phe Met Gly Met Lys Tyr Ala Ala Ser Val Met Ile Pro
130 135 140
Arg Lys Gln Gly Ser Ile Ile Ser Leu Gly Ser Val Gly Ser Val Ile
145 150 155 160
Gly Gly Ile Gly Pro His His Tyr Ile Ser Ser Lys His Ala Val Val
165 170 175
Gly Leu Thr Arg Ser Ile Ala Ala Glu Leu Gly Gln His Gly Ile Arg
180 185 190
Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Asn Leu Ala Val Ala
195 200 205
His Leu Pro Glu Asp Glu Arg Thr Glu Asp Met Phe Thr Gly Phe Arg
210 215 220
Glu Phe Ala Lys Lys Asn Ala Asn Leu Gln Gly Val Glu Leu Thr Val
225 230 235 240
Glu Asp Val Ala Asn Ala Val Leu Phe Leu Ala Ser Glu Asp Ala Arg
245 250 255
Tyr Ile Ser Gly Asp Asn Leu Ile Val Asp Gly Gly Phe Thr Arg Val
260 265 270
Asn His Ser Phe Arg Val Phe Arg
275 280
<210> 87
<211> 843
<212> DNA
<213> 甜橙
<400> 87
atggcaacgc cgccgatttc atccctgatt tcacaacgcc tgctgggtaa agtcgccctg 60
gtcacgggtg gtgcttctgg tattggtgaa ggcatcgtgc gtctgtttca ccgtcatggc 120
gcgaaagtgt gctttgttga tgtgcaggat gaactgggct accgtctgca agaatctctg 180
gtgggcgaca aagattcaaa catcttttat agccactgtg atgtcaccgt ggaagacgat 240
gtgcgccgcg ctgtggatct gaccgtgacg aaattcggta cgctggatat tatggtcaat 300
aacgcgggta ttagtggcac cccgtccagc gatattcgta atgttgatgt gagcgaattt 360
gaaaaagtgt ttgatattaa cgtcaaaggc gtgtttatgg gcatgaaata tgccgcgagc 420
gtgatgatcc cgcgcaaaca gggtagcatc atctccctgg gttctgttgg cagcgtgatc 480
ggtggcattg gcccgcacca ttatatcagc tcgaaacatg cggttgtggg cctgacccgc 540
agcattgcag cggaactggg tcagcatggc attcgtgtga actgtgtgtc tccgtatgcg 600
gttccgacca atctggcggt tgcacacctg ccggaagatg aacgtaccga agatatgttt 660
acgggcttcc gtgaatttgc gaaaaagaat gccaacctgc aaggtgttga actgaccgtc 720
gaagatgtgg ccaatgcggt gctgtttctg gccagcgaag atgcacgcta cattagcggt 780
gataatctga tcgttgatgg cggctttacc cgtgtgaacc actcatttcg tgttttccgt 840
taa 843
<210> 88
<211> 262
<212> PRT
<213> 甜橙
<400> 88
Met Ser Lys Pro Arg Leu Gln Gly Lys Val Ala Ile Ile Met Gly Ala
1 5 10 15
Ala Ser Gly Ile Gly Glu Ala Thr Ala Lys Leu Phe Ala Glu His Gly
20 25 30
Ala Phe Val Ile Ile Ala Asp Ile Gln Asp Glu Leu Gly Asn Gln Val
35 40 45
Val Ser Ser Ile Gly Pro Glu Lys Ala Ser Tyr Arg His Cys Asp Val
50 55 60
Arg Asp Glu Lys Gln Val Glu Glu Thr Val Ala Tyr Ala Ile Glu Lys
65 70 75 80
Tyr Gly Ser Leu Asp Ile Met Tyr Ser Asn Ala Gly Val Ala Gly Pro
85 90 95
Val Gly Thr Ile Leu Asp Leu Asp Met Ala Gln Phe Asp Arg Thr Ile
100 105 110
Ala Thr Asn Leu Ala Gly Ser Val Met Ala Val Lys Tyr Ala Ala Arg
115 120 125
Val Met Val Ala Asn Lys Ile Arg Gly Ser Ile Ile Cys Thr Thr Ser
130 135 140
Thr Ala Ser Thr Val Gly Gly Ser Gly Pro His Ala Tyr Thr Ile Ser
145 150 155 160
Lys His Gly Leu Leu Gly Leu Val Arg Ser Ala Ala Ser Glu Leu Gly
165 170 175
Lys His Gly Ile Arg Val Asn Cys Val Ser Pro Phe Gly Val Ala Thr
180 185 190
Pro Phe Ser Ala Gly Thr Ile Asn Asp Val Glu Gly Phe Val Cys Lys
195 200 205
Val Ala Asn Leu Lys Gly Ile Val Leu Lys Ala Lys His Val Ala Glu
210 215 220
Ala Ala Leu Phe Leu Ala Ser Asp Glu Ser Ala Tyr Val Ser Gly His
225 230 235 240
Asp Leu Val Val Asp Gly Gly Phe Thr Ala Val Thr Asn Val Met Ser
245 250 255
Met Leu Glu Gly His Gly
260
<210> 89
<211> 789
<212> DNA
<213> 甜橙
<400> 89
atgtcaaaac cgcgtctgca aggcaaagtg gctattatta tgggtgctgc gtctggcatc 60
ggtgaagcta cggctaaact gttcgctgaa catggcgcat ttgtgattat cgctgatatt 120
caggacgaac tgggcaacca ggtggttagc tctatcggcc cggaaaaagc gtcttatcgt 180
cactgcgatg tgcgtgatga aaaacaggtt gaagaaaccg tcgcgtatgc gattgaaaaa 240
tacggcagcc tggatattat gtactccaat gcgggcgtgg ccggtccggt tggcacgatt 300
ctggatctgg acatggccca attcgaccgt accatcgcaa cgaacctggc tggtagtgtt 360
atggcagtca aatatgcggc ccgtgtcatg gtggcgaata aaattcgcgg tagcattatc 420
tgtaccacga gtaccgcctc cacggtgggc ggcagcggcc cgcacgccta taccattagc 480
aaacacggtc tgctgggcct ggttcgttca gcagcttcgg aactgggtaa acatggcatc 540
cgcgtgaact gcgttagccc gtttggtgtt gcgaccccgt tctctgccgg tacgattaac 600
gatgtcgaag gctttgtctg taaagtggcg aatctgaaag gcatcgtcct gaaagcgaag 660
catgtggccg aagcggccct gttcctggca agcgatgaat ctgcttatgt gagcggtcac 720
gacctggtgg tggatggtgg ctttacggca gttacgaatg tcatgtcaat gctggaaggt 780
cacggctaa 789
<210> 90
<211> 263
<212> PRT
<213> 甜橙
<400> 90
Met Ser Asn Pro Arg Met Glu Gly Lys Val Ala Leu Ile Thr Gly Ala
1 5 10 15
Ala Ser Gly Ile Gly Glu Ala Ala Val Arg Leu Phe Ala Glu His Gly
20 25 30
Ala Phe Val Val Ala Ala Asp Val Gln Asp Glu Leu Gly His Gln Val
35 40 45
Ala Ala Ser Val Gly Thr Asp Gln Val Cys Tyr His His Cys Asp Val
50 55 60
Arg Asp Glu Lys Gln Val Glu Glu Thr Val Arg Tyr Thr Leu Glu Lys
65 70 75 80
Tyr Gly Lys Leu Asp Val Leu Phe Ser Asn Ala Gly Ile Met Gly Pro
85 90 95
Leu Thr Gly Ile Leu Glu Leu Asp Leu Thr Gly Phe Gly Asn Thr Met
100 105 110
Ala Thr Asn Val Cys Gly Val Ala Ala Thr Ile Lys His Ala Ala Arg
115 120 125
Ala Met Val Asp Lys Asn Ile Arg Gly Ser Ile Ile Cys Thr Thr Ser
130 135 140
Val Ala Ser Ser Leu Gly Gly Thr Ala Pro His Ala Tyr Thr Thr Ser
145 150 155 160
Lys His Ala Leu Val Gly Leu Val Arg Thr Ala Cys Ser Glu Leu Gly
165 170 175
Ala Tyr Gly Ile Arg Val Asn Cys Ile Ser Pro Phe Gly Val Ala Thr
180 185 190
Pro Leu Ser Cys Thr Ala Tyr Asn Leu Arg Pro Asp Glu Val Glu Ala
195 200 205
Asn Ser Cys Ala Leu Ala Asn Leu Lys Gly Ile Val Leu Lys Ala Lys
210 215 220
His Ile Ala Glu Ala Ala Leu Phe Leu Ala Ser Asp Glu Ser Ala Tyr
225 230 235 240
Ile Ser Gly His Asn Leu Ala Val Asp Gly Gly Phe Thr Val Val Asn
245 250 255
His Ser Ser Ser Ser Ala Thr
260
<210> 91
<211> 792
<212> DNA
<213> 甜橙
<400> 91
atgtcaaacc cgcgtatgga aggcaaagtc gcactgatta cgggcgcagc atctggtatc 60
ggtgaagcag cagtccgtct gttcgctgaa catggtgcgt ttgtcgtggc ggcagatgtg 120
caagacgaac tgggtcatca ggtggcggca tctgtgggta cggaccaggt gtgctaccat 180
cactgcgatg tgcgcgatga aaaacaagtg gaagaaaccg tgcgttatac cctggaaaaa 240
tacggcaaac tggatgtcct gttttcaaac gcgggcatca tgggtccgct gaccggcatt 300
ctggaactgg atctgaccgg cttcggtaac acgatggcaa ccaatgtgtg cggtgttgcc 360
gcgaccatta aacacgcggc acgcgcaatg gtggacaaaa acattcgcgg tagcattatc 420
tgcaccacca gcgtggcttc atcgctgggt ggcaccgcgc cgcacgcata caccacgagc 480
aaacacgcac tggtgggcct ggttcgtacg gcatgttcgg aactgggtgc gtatggcatt 540
cgtgtgaact gtatcagccc gtttggtgtt gcaacgccgc tgtcttgcac ggcctataac 600
ctgcgcccgg atgaagtgga agcaaactca tgcgcactgg cgaacctgaa aggtattgtg 660
ctgaaagcga aacacattgc ggaagcagcg ctgttcctgg cgagcgatga aagcgcgtat 720
attagcggtc ataatctggc ggtggatggt ggtttcacgg tggttaatca ttcaagttcg 780
tcggcgacgt aa 792
<210> 92
<211> 278
<212> PRT
<213> 甜橙
<400> 92
Met Thr Thr Ala Gly Ser Arg Asp Ser Pro Leu Val Ala Gln Arg Leu
1 5 10 15
Leu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Val Arg Leu Phe His Lys His Gly Ala Lys Val Cys Val Val
35 40 45
Asp Ile Asn Asp Asp Leu Gly Gln His Leu Cys Gln Thr Leu Gly Pro
50 55 60
Thr Thr Arg Phe Ile His Gly Asp Val Ala Ile Glu Asp Asp Val Ser
65 70 75 80
Arg Ala Val Asp Phe Thr Val Ala Asn Phe Gly Thr Leu Asp Ile Met
85 90 95
Val Asn Asn Ala Gly Met Gly Gly Pro Pro Cys Pro Asp Ile Arg Glu
100 105 110
Phe Pro Ile Ser Thr Phe Glu Lys Val Phe Asp Ile Asn Thr Lys Gly
115 120 125
Thr Phe Ile Gly Met Lys His Ala Ala Arg Val Met Ile Pro Ser Lys
130 135 140
Lys Gly Ser Ile Val Ser Ile Ser Ser Val Thr Ser Ala Ile Gly Gly
145 150 155 160
Ala Gly Pro His Ala Tyr Thr Ala Ser Lys His Ala Val Leu Gly Leu
165 170 175
Thr Lys Ser Val Ala Ala Glu Leu Gly Gln His Gly Ile Arg Val Asn
180 185 190
Cys Val Ser Pro Tyr Ala Ile Leu Thr Asn Leu Ala Leu Ala His Leu
195 200 205
His Glu Asp Glu Arg Thr Asp Asp Ala Arg Ala Gly Phe Arg Ala Phe
210 215 220
Ile Gly Lys Asn Ala Asn Leu Gln Gly Val Asp Leu Val Glu Asp Asp
225 230 235 240
Val Ala Asn Ala Val Leu Phe Leu Ala Ser Asp Asp Ala Arg Tyr Ile
245 250 255
Ser Gly Asp Asn Leu Phe Val Asp Gly Gly Phe Thr Cys Thr Asn His
260 265 270
Ser Leu Arg Val Phe Arg
275
<210> 93
<211> 837
<212> DNA
<213> 甜橙
<400> 93
atgacgacgg ctggttcgcg tgacagtccg ctggtcgctc aacgcctgct gggcaaagtg 60
gccctggtta cgggtggtgc taccggcatt ggtgaaagta tcgtgcgtct gtttcataaa 120
cacggcgcga aagtttgcgt ggttgatatt aacgatgacc tgggccagca tctgtgtcaa 180
accctgggtc cgaccacccg tttcattcac ggcgatgttg caatcgaaga tgatgtgagc 240
cgtgcggttg attttaccgt cgccaacttc ggtacgctgg acattatggt gaacaatgcc 300
ggtatgggcg gtccgccgtg cccggatatt cgtgaatttc cgatctcgac ctttgaaaaa 360
gtcttcgaca ttaacaccaa aggcacgttc atcggtatga aacatgcggc ccgcgtgatg 420
attccgagta aaaaaggtag tattgtcagc attagcagcg tgaccagcgc gattggcggc 480
gcgggtccgc acgcctatac cgcgagcaaa catgcggtgc tgggcctgac gaaatctgtc 540
gcggcggaac tgggccagca cggtattcgt gtcaactgtg tgtctccgta cgccatcctg 600
accaatctgg cgctggccca tctgcacgaa gatgaacgta cggatgacgc gcgtgcgggt 660
tttcgtgcat tcattggtaa aaacgctaat ctgcaaggtg ttgatctggt cgaagatgac 720
gtggcgaatg ccgttctgtt tctggcatca gatgacgctc gctatatctc gggcgataac 780
ctgttcgtgg atggcggctt cacctgtacc aatcactccc tgcgtgtgtt ccgttaa 837
<210> 94
<211> 280
<212> PRT
<213> 葡萄
<400> 94
Met Ala Ala Thr Ser Ile Asp Asn Ser Pro Leu Pro Ser Gln Arg Leu
1 5 10 15
Leu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Val Arg Leu Phe Leu Lys Gln Gly Ala Lys Val Cys Ile Val
35 40 45
Asp Val Gln Asp Asp Leu Gly Gln Lys Leu Cys Asp Thr Leu Gly Gly
50 55 60
Asp Pro Asn Val Ser Phe Phe His Cys Asp Val Thr Ile Glu Asp Asp
65 70 75 80
Val Cys His Ala Val Asp Phe Thr Val Thr Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Met Ala Gly Pro Pro Cys Ser Asp Ile
100 105 110
Arg Asn Val Glu Val Ser Met Phe Glu Lys Val Phe Asp Val Asn Val
115 120 125
Lys Gly Val Phe Leu Gly Met Lys His Ala Ala Arg Ile Met Ile Pro
130 135 140
Leu Lys Lys Gly Thr Ile Ile Ser Leu Cys Ser Val Ser Ser Ala Ile
145 150 155 160
Ala Gly Val Gly Pro His Ala Tyr Thr Gly Ser Lys Cys Ala Val Ala
165 170 175
Gly Leu Thr Gln Ser Val Ala Ala Glu Met Gly Gly His Gly Ile Arg
180 185 190
Val Asn Cys Ile Ser Pro Tyr Ala Ile Ala Thr Gly Leu Ala Leu Ala
195 200 205
His Leu Pro Glu Asp Glu Arg Thr Glu Asp Ala Met Ala Gly Phe Arg
210 215 220
Ala Phe Val Gly Lys Asn Ala Asn Leu Gln Gly Val Glu Leu Thr Val
225 230 235 240
Asp Asp Val Ala His Ala Ala Val Phe Leu Ala Ser Asp Glu Ala Arg
245 250 255
Tyr Ile Ser Gly Leu Asn Leu Met Leu Asp Gly Gly Phe Ser Cys Thr
260 265 270
Asn His Ser Leu Arg Val Phe Arg
275 280
<210> 95
<211> 843
<212> DNA
<213> 葡萄
<400> 95
atggccgcaa cgagcattga taattctccg ctgccgagtc aacgtctgct gggtaaagtc 60
gcactggtca cgggtggcgc tacgggtatt ggcgaaagca tcgtgcgtct gtttctgaaa 120
cagggtgcta aagtgtgcat tgtggacgtg caagatgacc tgggccagaa actgtgcgat 180
accctgggtg gcgatccgaa cgttagcttt ttccattgcg atgtgaccat cgaagatgat 240
gtgtgccatg cagttgattt taccgtcacg aaattcggca ccctggatat tatggtgaac 300
aatgcgggta tggcaggtcc gccgtgctcg gacatccgca acgtggaagt cagcatgttt 360
gaaaaagtgt ttgatgtgaa tgtgaaaggt gttttcctgg gcatgaaaca tgcagcccgc 420
attatgattc cgctgaaaaa aggcaccatt atcagcctgt gttcagtttc cagcgctatc 480
gcgggcgttg gtccgcacgc atatacgggt agcaaatgcg cagtggcggg tctgacgcaa 540
tcggtcgcag cagaaatggg tggtcatggc attcgcgtga actgtatcag cccgtatgca 600
atcgcaacgg gtctggcgct ggcacatctg ccggaagatg aacgcacgga agatgcaatg 660
gcgggtttcc gtgcgtttgt gggtaaaaat gcgaatctgc aaggtgttga actgaccgtg 720
gatgatgtgg cgcacgcagc ggtgtttctg gcaagcgatg aagcacgtta catctctggt 780
ctgaatctga tgctggacgg cggcttttcg tgtaccaacc actcgctgcg tgtctttcgc 840
taa 843
<210> 96
<211> 279
<212> PRT
<213> 葡萄
<400> 96
Met Ser Thr Ala Ser Ser Gly Asp Val Ser Leu Leu Ser Gln Arg Leu
1 5 10 15
Val Gly Lys Val Ala Leu Ile Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Ile Ala Arg Leu Phe Tyr Arg His Gly Ala Lys Val Cys Ile Val
35 40 45
Asp Ile Gln Asp Asn Pro Gly Gln Asn Leu Cys Arg Glu Leu Gly Thr
50 55 60
Asp Asp Ala Cys Phe Phe His Cys Asp Val Ser Ile Glu Ile Asp Val
65 70 75 80
Ile Arg Ala Val Asp Phe Val Val Asn Arg Phe Gly Lys Leu Asp Ile
85 90 95
Met Val Asn Asn Ala Gly Ile Ala Asp Pro Pro Cys Pro Asp Ile Arg
100 105 110
Asn Thr Asp Leu Ser Ile Phe Glu Lys Val Phe Asp Val Asn Val Lys
115 120 125
Gly Thr Phe Gln Cys Met Lys His Ala Ala Arg Val Met Val Pro Gln
130 135 140
Lys Lys Gly Ser Ile Ile Ser Leu Thr Ser Val Ala Ser Val Ile Gly
145 150 155 160
Gly Ala Gly Pro His Ala Tyr Thr Gly Ser Lys His Ala Val Leu Gly
165 170 175
Leu Thr Lys Ser Val Ala Ala Glu Leu Gly Leu His Gly Ile Arg Val
180 185 190
Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Gly Met Pro Leu Ala His
195 200 205
Leu Pro Glu Ser Glu Lys Thr Glu Asp Ala Met Met Gly Met Arg Ala
210 215 220
Phe Val Gly Arg Asn Ala Asn Leu Gln Gly Ile Glu Leu Thr Val Asp
225 230 235 240
Asp Val Ala Asn Ser Val Val Phe Leu Ala Ser Asp Glu Ala Arg Tyr
245 250 255
Val Ser Gly Leu Asn Leu Met Leu Asp Gly Gly Phe Ser Cys Val Asn
260 265 270
His Ser Leu Arg Val Phe Arg
275
<210> 97
<211> 840
<212> DNA
<213> 葡萄
<400> 97
atgtcaacgg cttcctcggg tgatgtgtcg ctgctgtcgc aacgcctggt cggtaaagtc 60
gctctgatta cgggtggtgc aacgggcatt ggtgaatcga ttgcgcgtct gttttaccgt 120
catggtgcga aagtgtgcat cgttgacatt caggataatc cgggtcaaaa cctgtgccgt 180
gaactgggca ccgacgatgc gtgcttcttt cactgcgatg tgagcattga aatcgatgtg 240
attcgtgctg ttgactttgt ggttaaccgc tttggtaaac tggacattat ggttaataac 300
gcgggcatcg cagatccgcc gtgcccggat attcgcaaca ccgatctgag catttttgaa 360
aaagtgttcg atgtgaacgt gaaaggcacc tttcagtgta tgaaacacgc agcgcgcgtt 420
atggtgccgc agaaaaaagg tagcattatc agcctgacct cggtggcgag cgtgattggt 480
ggcgcgggtc cgcacgccta tacgggtagc aaacacgcgg ttctgggtct gacgaaaagc 540
gttgcggcag aactgggtct gcatggtatt cgcgtgaact gtgtgagtcc gtatgcagtt 600
ccgacgggta tgccgctggc acatctgccg gaatcggaaa aaaccgaaga tgcgatgatg 660
ggtatgcgtg catttgtggg tcgtaatgcc aacctgcaag gtattgaact gaccgtggac 720
gatgtcgcaa atagcgtcgt gtttctggcg tcggatgaag cgcgttatgt tagcggtctg 780
aacctgatgc tggacggcgg cttctcgtgt gtcaaccact cgctgcgtgt gtttcgctaa 840
<210> 98
<211> 280
<212> PRT
<213> 甜橙
<400> 98
Met Ser Asn Ser Asn Ser Thr Asp Ser Ser Pro Ala Val Gln Arg Leu
1 5 10 15
Val Gly Arg Val Ala Leu Ile Thr Gly Gly Ala Thr Gly Ile Gly Glu
20 25 30
Ser Thr Val Arg Leu Phe His Lys His Gly Ala Lys Val Cys Ile Ala
35 40 45
Asp Val Gln Asp Asn Leu Gly Gln Gln Val Cys Gln Ser Leu Gly Gly
50 55 60
Glu Pro Asp Thr Phe Phe Cys His Cys Asp Val Thr Lys Glu Glu Asp
65 70 75 80
Val Cys Ser Ala Val Asp Leu Thr Val Glu Lys Phe Gly Thr Leu Asp
85 90 95
Ile Met Val Asn Asn Ala Gly Ile Ser Gly Ala Pro Cys Pro Asp Ile
100 105 110
Arg Glu Ala Asp Leu Ser Glu Phe Glu Lys Val Phe Asp Ile Asn Val
115 120 125
Lys Gly Val Phe His Gly Met Lys His Ala Ala Arg Ile Met Ile Pro
130 135 140
Gln Thr Lys Gly Thr Ile Ile Ser Ile Cys Ser Val Ala Gly Ala Ile
145 150 155 160
Gly Gly Leu Gly Pro His Ala Tyr Thr Gly Ser Lys His Ala Val Leu
165 170 175
Gly Leu Asn Lys Asn Val Ala Ala Glu Leu Gly Lys Tyr Gly Ile Arg
180 185 190
Val Asn Cys Val Ser Pro Tyr Ala Val Ala Thr Gly Leu Ala Leu Ala
195 200 205
His Leu Pro Glu Glu Glu Arg Thr Glu Asp Ala Met Val Gly Phe Arg
210 215 220
Asn Phe Val Ala Arg Asn Ala Asn Met Gln Gly Thr Glu Leu Thr Ala
225 230 235 240
Asn Asp Val Ala Asn Ala Val Leu Phe Leu Ala Ser Asp Glu Ala Arg
245 250 255
Tyr Ile Ser Gly Thr Asn Leu Met Val Asp Gly Gly Phe Thr Ser Val
260 265 270
Asn His Ser Leu Arg Val Phe Arg
275 280
<210> 99
<211> 843
<212> DNA
<213> 甜橙
<400> 99
atgtccaata gcaactctac ggattcgtcg ccggcagtcc aacgcctggt cggtcgtgtc 60
gccctgatta cgggtggtgc aacgggtatt ggcgaaagca cggtgcgcct gtttcataaa 120
catggcgcga aagtgtgtat tgccgacgtt caggataacc tgggtcagca agtgtgtcag 180
agtctgggtg gcgaaccgga tacctttttc tgccattgtg atgtgacgaa agaagaagat 240
gtgtgtagcg cagttgatct gaccgtggaa aaatttggca ccctggacat tatggtgaac 300
aatgcgggta ttagcggcgc accgtgcccg gacattcgtg aagccgatct gagcgaattt 360
gaaaaagttt tcgacatcaa cgtgaaaggc gtgtttcacg gcatgaaaca tgcagcgcgt 420
attatgatcc cgcaaaccaa aggcaccatt atcagcattt gctccgtggc tggtgcgatt 480
ggtggcctgg gtccgcacgc atataccggc tccaaacatg cagtcctggg cctgaacaaa 540
aacgtggccg cggaactggg caaatacggt atccgtgtga attgcgtcag cccgtatgct 600
gttgccaccg gcctggctct ggcacacctg ccggaagaag aacgtaccga agatgcaatg 660
gtgggctttc gtaattttgt ggcacgcaac gcgaatatgc aaggcaccga actgacggcg 720
aatgatgtgg caaacgcggt cctgtttctg gcctctgatg aagcccgtta tatcagcggc 780
acgaatctga tggtggatgg cggttttacc tcggtcaatc actcgctgcg tgtcttccgt 840
taa 843
<210> 100
<211> 282
<212> PRT
<213> 短柄草
<400> 100
Met Ser Ala Ala Ala Ala Val Ser Ser Ser Ser Ser Pro Arg Leu Glu
1 5 10 15
Gly Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly Ile Gly Glu Ala
20 25 30
Ile Val Arg Leu Phe Arg Gln His Gly Ala Lys Val Cys Ile Ala Asp
35 40 45
Val Gln Asp Glu Ala Gly Gln Gln Val Arg Asp Ser Leu Gly Asp Asp
50 55 60
Ala Gly Thr Asp Val Leu Phe Val His Cys Asp Val Thr Val Glu Glu
65 70 75 80
Asp Val Ser Arg Ala Val Asp Ala Ala Ala Glu Lys Phe Gly Thr Leu
85 90 95
Asp Ile Met Val Asn Asn Ala Gly Ile Thr Gly Asp Lys Val Thr Asp
100 105 110
Ile Arg Asn Leu Asp Phe Ala Glu Val Arg Lys Val Phe Asp Ile Asn
115 120 125
Val His Gly Met Leu Leu Gly Met Lys His Ala Ala Arg Val Met Ile
130 135 140
Pro Gly Lys Lys Gly Ser Ile Val Ser Leu Ala Ser Val Ala Ser Val
145 150 155 160
Met Gly Gly Met Gly Pro His Ala Tyr Thr Ala Ser Lys His Ala Val
165 170 175
Val Gly Leu Thr Lys Ser Val Ala Leu Glu Leu Gly Lys His Gly Ile
180 185 190
Arg Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Ala Leu Ser Met
195 200 205
Pro His Leu Pro Gln Gly Glu His Lys Gly Asp Ala Val Arg Asp Phe
210 215 220
Leu Ala Phe Val Gly Gly Glu Ala Asn Leu Lys Gly Val Asp Leu Leu
225 230 235 240
Pro Lys Asp Val Ala Gln Ala Val Leu Tyr Leu Ala Ser Asp Glu Ala
245 250 255
Arg Tyr Ile Ser Ala Leu Asn Leu Val Val Asp Gly Gly Phe Thr Ser
260 265 270
Val Asn Pro Asn Leu Lys Ala Phe Glu Asp
275 280
<210> 101
<211> 849
<212> DNA
<213> 短柄草
<400> 101
atgtccgctg ctgccgccgt gtcctcctca tcgtcgccgc gtctggaagg caaagtcgct 60
ctggttacgg gtggtgcgtc aggtatcggc gaagccattg tgcgcctgtt ccgtcaacat 120
ggtgccaaag tgtgtatcgc ggatgtccaa gacgaagcgg gccaacaggt ccgtgatagc 180
ctgggtgacg atgccggtac ggatgtgctg tttgtgcatt gcgacgttac cgtggaagaa 240
gatgtgtcac gcgcggtgga tgccgctgcg gaaaaattcg gcaccctgga cattatggtg 300
aacaacgcag gtattacggg cgacaaagtg acggacattc gcaacctgga tttcgctgaa 360
gtccgtaaag tgttcgacat caatgtgcac ggtatgctgc tgggcatgaa acatgcggcc 420
cgcgtgatga ttccgggtaa aaaaggctcg attgtgagcc tggcatcggt cgcaagcgtt 480
atgggtggta tgggtccgca cgcatatacc gcaagcaaac acgcggttgt gggtctgacg 540
aaaagcgttg cactggaact gggcaaacat ggtattcgtg tcaactgtgt gagcccgtat 600
gcagttccga ccgcactgtc aatgccgcac ctgccgcagg gcgaacataa aggtgatgcg 660
gtgcgtgatt tcctggcgtt tgttggcggt gaagcgaatc tgaaaggtgt cgatctgctg 720
ccgaaagatg ttgcacaggc ggttctgtat ctggcaagcg acgaagcgcg ctatatttct 780
gcgctgaatc tggtggttga tggcggtttt acgagcgtga atccgaatct gaaagcattt 840
gaagactaa 849
<210> 102
<211> 267
<212> PRT
<213> 红球姜
<400> 102
Met Arg Leu Glu Gly Lys Val Ala Leu Val Thr Gly Gly Ala Ser Gly
1 5 10 15
Ile Gly Glu Ser Ile Ala Arg Leu Phe Ile Glu His Gly Ala Lys Ile
20 25 30
Cys Ile Val Asp Val Gln Asp Glu Leu Gly Gln Gln Val Ser Gln Arg
35 40 45
Leu Gly Gly Asp Pro His Ala Cys Tyr Phe His Cys Asp Val Thr Val
50 55 60
Glu Asp Asp Val Arg Arg Ala Val Asp Phe Thr Ala Glu Lys Tyr Gly
65 70 75 80
Thr Ile Asp Ile Met Val Asn Asn Ala Gly Ile Thr Gly Asp Lys Val
85 90 95
Ile Asp Ile Arg Asp Ala Asp Phe Asn Glu Phe Lys Lys Val Phe Asp
100 105 110
Ile Asn Val Asn Gly Val Phe Leu Gly Met Lys His Ala Ala Arg Ile
115 120 125
Met Ile Pro Lys Met Lys Gly Ser Ile Val Ser Leu Ala Ser Val Ser
130 135 140
Ser Val Ile Ala Gly Ala Gly Pro His Gly Tyr Thr Gly Ala Lys His
145 150 155 160
Ala Val Val Gly Leu Thr Lys Ser Val Ala Ala Glu Leu Gly Arg His
165 170 175
Gly Ile Arg Val Asn Cys Val Ser Pro Tyr Ala Val Pro Thr Arg Leu
180 185 190
Ser Met Pro Tyr Leu Pro Glu Ser Glu Met Gln Glu Asp Ala Leu Arg
195 200 205
Gly Phe Leu Thr Phe Val Arg Ser Asn Ala Asn Leu Lys Gly Val Asp
210 215 220
Leu Met Pro Asn Asp Val Ala Glu Ala Val Leu Tyr Leu Ala Thr Glu
225 230 235 240
Glu Ser Lys Tyr Val Ser Gly Leu Asn Leu Val Ile Asp Gly Gly Phe
245 250 255
Ser Ile Ala Asn His Thr Leu Gln Val Phe Glu
260 265
<210> 103
<211> 804
<212> DNA
<213> 红球姜
<400> 103
atgcgtctgg aaggcaaagt ggctctggtc acgggcggtg cgtcgggtat tggcgaatct 60
attgctcgtc tgtttattga acacggtgca aaaatttgca tcgtggatgt ccaggatgaa 120
ctgggtcaac aggtctctca gcgtctgggt ggcgatccgc acgcctgtta tttccactgt 180
gatgtgaccg tggaagatga cgttcgtcgc gcggtggatt ttacggcgga aaaatatggc 240
accattgaca ttatggttaa caatgcgggc attacgggcg ataaagtgat cgatattcgt 300
gatgcggatt tcaacgaatt taaaaaagtg ttcgacatta acgtgaatgg tgtctttctg 360
ggcatgaaac acgcagcgcg tattatgatc ccgaaaatga aaggctccat cgtttcgctg 420
gcgtccgtta gctcggtgat tgctggtgca ggtccgcatg gctataccgg cgcaaaacat 480
gcggttgtgg gtctgaccaa aagcgttgca gccgaactgg gtcgtcatgg tattcgcgtg 540
aactgcgttt cgccgtatgc ggtgccgacg cgcctgtcaa tgccgtatct gccggaatcg 600
gaaatgcagg aagatgcact gcgcggcttt ctgacctttg tgcgtagcaa tgcgaacctg 660
aaaggcgttg atctgatgcc gaatgatgtg gcggaagctg ttctgtatct ggcgaccgaa 720
gaaagcaaat atgtttcagg tctgaatctg gttattgacg gcggcttctc catcgctaat 780
cataccctgc aagtgtttga ataa 804
<210> 104
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 104
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn Arg Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Arg Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Leu Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu Gln Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Ile Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Gln Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ala Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Thr Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Asp
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 105
<211> 505
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 105
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala Met Arg Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg
20 25 30
Arg Ser Gln Ser Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val
35 40 45
Pro Leu Leu Gly Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met
50 55 60
Thr Phe Thr Lys Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys
65 70 75 80
Thr Gly Ala Thr Ser Val Val Val Val Ser Ser Asn Glu Ile Ala Lys
85 90 95
Glu Ala Leu Val Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser
100 105 110
Lys Ala Leu Lys Val Leu Thr Ala Asp Lys Gln Met Val Ala Met Ser
115 120 125
Asp Tyr Asp Asp Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala
130 135 140
Val Leu Gly Pro Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile
145 150 155 160
Met Met Asp Asn Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn
165 170 175
Pro Glu Gln Glu Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu
180 185 190
Phe Gly Leu Ala Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu
195 200 205
Tyr Val Glu Asp Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Leu Gln
210 215 220
Val Leu Val Val Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg
225 230 235 240
Asp Phe Phe Pro Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn
245 250 255
Thr Ile Gln Gln Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu
260 265 270
Ile Lys Glu Gln Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser
275 280 285
Tyr Ile Asp Tyr Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln
290 295 300
Leu Leu Met Ser Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr
305 310 315 320
Met Val Thr Thr Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys
325 330 335
Leu Gln Asp Arg Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu
340 345 350
Lys Ile Thr Glu Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile
355 360 365
Phe His Glu Thr Leu Arg Lys His Ser Pro Val Pro Ile Leu Pro Leu
370 375 380
Arg His Val His Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala
385 390 395 400
Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn
405 410 415
Val Trp Glu Asn Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu
420 425 430
Asn Glu Thr Ile Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys
435 440 445
Arg Val Cys Ala Gly Ser Leu Gln Ala Leu Leu Ile Ala Ser Ile Gly
450 455 460
Ile Gly Arg Met Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr
465 470 475 480
Gln Glu Glu Val Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro
485 490 495
Leu Arg Ala Ile Ile Lys Pro Arg Ile
500 505
<210> 106
<211> 501
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 106
Met Ala Leu Leu Leu Ala Val Phe Ala Val Ala Leu Ala Val Ala Leu
1 5 10 15
Ile Phe Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser
20 25 30
Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly
35 40 45
Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg
50 55 60
Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr
65 70 75 80
Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val
85 90 95
Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys
100 105 110
Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp
115 120 125
Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro
130 135 140
Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn
145 150 155 160
Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu
165 170 175
Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala
180 185 190
Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp
195 200 205
Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val
210 215 220
Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro
225 230 235 240
Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln
245 250 255
Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His
260 265 270
Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr
275 280 285
Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser
290 295 300
Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr
305 310 315 320
Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg
325 330 335
Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu
340 345 350
Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr
355 360 365
Leu Arg Arg His Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His
370 375 380
Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu
385 390 395 400
Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn
405 410 415
Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile
420 425 430
Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala
435 440 445
Gly Ser Leu Gln Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met
450 455 460
Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val
465 470 475 480
Asn Thr Ile Gly Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile
485 490 495
Ile Lys Pro Arg Ile
500
<210> 107
<211> 501
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 107
Met Ala Leu Leu Leu Ala Val Phe Ala Val Ala Leu Ala Val Ala Leu
1 5 10 15
Ile Phe Trp Tyr Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser
20 25 30
Asn His Leu Pro Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly
35 40 45
Asn Leu Leu Gln Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg
50 55 60
Trp Ala Ala Thr Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr
65 70 75 80
Ser Met Val Val Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val
85 90 95
Thr Arg Phe Gln Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys
100 105 110
Val Leu Thr Ala Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp
115 120 125
Tyr His Lys Thr Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro
130 135 140
Asn Ala Gln Lys Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn
145 150 155 160
Ile Ser Thr Gln Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu
165 170 175
Glu Val Asp Leu Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala
180 185 190
Met Arg Gln Ala Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp
195 200 205
Leu Lys Ile Thr Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val
210 215 220
Asp Pro Met Met Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro
225 230 235 240
Tyr Leu Lys Trp Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln
245 250 255
Met Tyr Ile Arg Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His
260 265 270
Lys Lys Arg Ile Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr
275 280 285
Leu Leu Ser Glu Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser
290 295 300
Leu Trp Glu Pro Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr
305 310 315 320
Glu Trp Ala Met Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg
325 330 335
Leu Tyr Arg Asp Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu
340 345 350
Glu His Leu Ser Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr
355 360 365
Leu Arg Arg His Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His
370 375 380
Glu Asp Thr Val Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu
385 390 395 400
Ala Val Asn Ile Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn
405 410 415
Pro Glu Glu Trp Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile
420 425 430
Asp Phe Gln Lys Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala
435 440 445
Gly Ser Leu Gln Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met
450 455 460
Val Gln Glu Phe Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val
465 470 475 480
Asn Thr Ile Gly Leu Thr Asn Gln Met Leu Arg Pro Leu Arg Ala Ile
485 490 495
Ile Lys Pro Arg Ile
500
<210> 108
<211> 513
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 108
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His
370 375 380
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 109
<211> 513
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 109
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu His Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His
370 375 380
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Asn Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 110
<211> 513
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 110
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn Arg Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Leu Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu Gln Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Lys His
370 375 380
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Gln
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ala Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Asp Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Asn Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 111
<211> 513
<212> PRT
<213> 人工序列
<220>
<223> 合成的工程化朱栾倍半萜氧化酶变体
<400> 111
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Lys Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Val Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Gln Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Leu Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu Gln Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Lys His
370 375 380
Ser Pro Val Pro Ile Leu Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Ile Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Asn Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 112
<211> 8
<212> PRT
<213> 人工序列
<220>
<223> 合成的前导序列
<400> 112
Met Ala Leu Leu Leu Ala Val Phe
1 5
<210> 113
<211> 7
<212> PRT
<213> 人工序列
<220>
<223> 合成的连接子序列
<400> 113
Gly Ser Gly Gly Gly Gly Ser
1 5
<210> 114
<211> 8
<212> PRT
<213> 人工序列
<220>
<223> 合成的连接子序列
<400> 114
Gly Ser Gly Glu Ala Ala Ala Lys
1 5
<210> 115
<211> 13
<212> PRT
<213> 人工序列
<220>
<223> 合成的连接子序列
<400> 115
Gly Ser Gly Glu Ala Ala Ala Lys Glu Ala Ala Ala Lys
1 5 10
<210> 116
<211> 9
<212> PRT
<213> 人工序列
<220>
<223> 合成的连接子序列
<400> 116
Gly Ser Gly Met Gly Ser Ser Ser Asn
1 5
<210> 117
<211> 5
<212> PRT
<213> 人工序列
<220>
<223> 合成的连接子序列
<400> 117
Gly Ser Thr Gly Ser
1 5
<210> 118
<211> 20
<212> PRT
<213> 人工序列
<220>
<223> 合成序列
<400> 118
Met Ala Trp Glu Tyr Ala Leu Ile Gly Leu Val Val Gly Ile Ile Ile
1 5 10 15
Gly Ala Val Ala
20
权利要求书(按照条约第19条的修改)
1.一种制备倍半萜烯的含氧产物的方法,其包括使倍半萜烯与具有倍半萜烯氧化活性的甜菊贝壳杉烯氧化酶(SrKO)或其衍生物接触,以及回收所述含氧产物。
2.如权利要求1所述的方法,其中所述SrKO衍生物包含相对于SrKO(SEQ ID NO:37、38或55)具有1-50个独立选自替换、缺失或插入的突变的氨基酸序列。
3.如权利要求2所述的方法,其中所述SrKO衍生物包含相对于SrKO(SEQ ID NO:37、38或55)具有1-40个突变的氨基酸序列。
4.如权利要求1-3中任一项所述的方法,其中所述SrKO衍生物包含与SrKO(SEQ ID NO:37、38或55)具有至少70%序列同一性的氨基酸序列,并且具有朱栾倍半萜氧化酶活性。
5.如权利要求2-4中任一项所述的方法,其中相对于SrKO(SEQ ID NO:37、38或55)的一个或多个突变选自表2或表6。
6.如权利要求5所述的方法,其中至少2个突变选自表2和/或表6。
7.如权利要求2所述的方法,其中所述SrKO是缺失其N端跨膜区的至少一部分且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域的衍生物。
8.如权利要求7所述的方法,其中所述SrKO是缺失其N端跨膜结构域的15-35个氨基酸且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸的衍生物。
9.如权利要求8所述的方法,其中来自yhcB的所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
10.如权利要求1-9中任一项所述的方法,其中所述SrKO是相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变的衍生物。
11.如权利要求10所述的方法,其中所述SrKO是相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变的衍生物。
12.如权利要求1所述的方法,其中所述SrKO是包含选自SEQ ID NO:55-61、104和105的氨基酸序列或相对于选自SEQ ID NO:55-61、104和105的序列具有1-20个突变的氨基酸序列的衍生物。
13.如权利要求1-12中任一项所述的方法,其中所述倍半萜烯是朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯、长叶烯;α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。
14.如权利要求1-13所述的方法,其中主要的倍半萜烯是朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯、长叶烯;α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。
15.如权利要求14所述的方法,其中所述主要的倍半萜烯是朱栾倍半萜。
16.如权利要求15所述的方法,其中主要的含氧产物是诺卡酮。
17.如权利要求1-16中任一项所述的方法,其中所述接触发生在宿主细胞中,所述宿主细胞是细菌。
18.如权利要求17所述的方法,其中所述宿主细胞是大肠杆菌、枯草芽孢杆菌或恶臭假单胞菌。
19.如权利要求1-16中任一项所述的方法,其中所述接触发生在宿主细胞中,所述宿主细胞是酵母。
20.如权利要求19所述的方法,其中所述宿主细胞是酵母属、毕赤酵母属或耶氏酵母属的种类,包括酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
21.如权利要求17-20中任一项所述的方法,其中所述宿主细胞产生异戊基焦磷酸(IPP)。
22.如权利要求21所述的方法,其中所述IPP经内源性或外源性MEP或MVA途径由代谢通量产生。
23.如权利要求22所述的方法,其中所述倍半萜烯至少部分经MEP途径由代谢通量产生,并且其中所述宿主细胞具有至少一个额外拷贝的dxs、ispD、ispF和/或idi基因。
24.如权利要求17-23中任一项所述的方法,其中所述宿主细胞进一步表达法尼基焦磷酸合成酶。
25.如权利要求17-24中任一项所述的方法,其中所述宿主细胞进一步表达异源倍半萜烯合成酶。
26.如权利要求25所述的方法,其中所述倍半萜烯合成酶是朱栾倍半萜合成酶。
27.如权利要求26所述的方法,其中所述朱栾倍半萜合成酶是甜橙或葡萄朱栾倍半萜合成酶或其衍生物。
28.如权利要求21所述的方法,其中VvVS包含相对于VvVS(SEQ ID NO:1)具有1-40个独立选自替换、缺失或插入的突变的氨基酸序列。
29.如权利要求27所述的方法,其中CsVS包含相对于CsVS(SEQ ID NO:12)具有1-40个独立选自替换、缺失或插入的突变的氨基酸序列。
30.如权利要求28所述的方法,其中一个或多个突变选自表3。
31.如权利要求1-30中任一项所述的方法,其中所述SrKO在共表达细胞色素P450还原酶的宿主细胞中表达。
32.如权利要求31所述的方法,其中所述P450还原酶是来自甜菊属的CPR。
33.如权利要求31所述的方法,其中所述P450还原酶包含选自SEQ ID NO:62、64、66、68、70、72、74、76、78、80和82的氨基酸序列或其衍生物。
34.如权利要求1-33中任一项所述的方法,其中所述SrKO表达为具有细胞色素P450还原酶配偶体的融合体。
35.如权利要求1-34中任一项所述的方法,其中所述宿主细胞进一步表达生产诺卡酮的醇脱氢酶。
36.如权利要求1-35中任一项所述的方法,其中从水性反应介质中提取氧化油,然后进行分馏。
37.如权利要求36所述的方法,其中使用有机溶剂从水性反应介质中提取所述氧化油。
38.如权利要求36所述的方法,其中通过GC/MS定量测定级分中的倍半萜烯和类倍半萜烯组分,然后混合所述级分以生产所需的含诺卡酮的成分。
39.如权利要求38所述的方法,其中定量测定诺卡酮组分。
40.如权利要求36或37所述的方法,其中回收诺卡酮级分。
41.一种制备含有含氧的倍半萜烯的产品的方法,其包括:将由权利要求1-40中任一项所述的方法制备的含氧的倍半萜烯掺入所述产品中。
42.如权利要求41所述的方法,其中所述产品是风味产品、芳香产品、化妆品、清洁产品、洗涤剂或肥皂或害虫防治产品。
43.如权利要求42所述的方法,其中所述产品是选自饮料、口香糖、糖果或人工风味剂的风味产品。
44.如权利要求42所述的方法,其中所述产品是驱虫剂。
45.一种SrKO衍生物,其包含相对于SEQ ID NO:37、38或55增加朱栾倍半萜氧化酶活性的至少一个突变。
46.如权利要求45所述的SrKO衍生物,其中所述SrKO衍生物缺失其N端跨膜区的至少一部分,并且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域。
47.如权利要求46所述的SrKO衍生物,其中所述SrKO衍生物缺失其N端跨膜结构域的15-35个氨基酸,并且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸。
48.如权利要求46所述的SrKO衍生物,其中所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
49.如权利要求45-48所述的SrKO衍生物,其中所述SrKO衍生物相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变。
50.如权利要求49所述的SrKO衍生物,其中所述SrKO衍生物相对于SEQ ID NO:37具有选自R76K、M94V、L107M、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
51.如权利要求50所述的SrKO衍生物,其中所述SrKO衍生物包含选自SEQ ID NO:55-61、104和105的氨基酸序列。
52.如权利要求50所述的SrKO衍生物,其中所述SrKO衍生物包含相对于选自SEQ IDNO:55-61、104或105的氨基酸序列具有1-20个突变的氨基酸序列,附带条件是所述氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变,或附带条件是所述氨基酸序列相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
53.如权利要求45-52中任一项所述的SrKO衍生物,其中所述衍生物被分离或部分纯化,或在宿主细胞中异源表达。
54.一种生产含氧的倍半萜烯的宿主细胞,其中所述宿主细胞产生异戊基焦磷酸(IPP);并且所述宿主细胞表达法尼基焦磷酸合成酶、倍半萜烯合成酶和具有朱栾倍半萜氧化酶活性的SrKO或其衍生物。
55.如权利要求54所述的宿主细胞,其中所述SrKO衍生物包含相对于SEQ ID NO:37、38或55增加朱栾倍半萜氧化酶活性的至少一个突变
56.如权利要求54或55所述的宿主细胞,其中所述宿主细胞表达MVA或MEP途径。
57.如权利要求56所述的宿主细胞,其中所述宿主细胞是细菌或酵母。
58.如权利要求57所述的宿主细胞,其中所述宿主细胞是大肠杆菌、枯草芽孢杆菌或恶臭假单胞菌。
59.如权利要求57所述的宿主细胞,其中所述宿主细胞是酵母属、毕赤酵母属或耶氏酵母属的种类,包括酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
60.如权利要求54-59中任一项所述的宿主细胞,其中所述SrKO衍生物相对于SrKO(SEQID NO:37、38或55)具有1-50个独立选自替换、缺失或插入的突变。
61.如权利要求60所述的宿主细胞,其中所述SrKO衍生物相对于SrKO(SEQ ID NO:37、38或55)具有1-40个突变。
62.如权利要求60或61所述的宿主细胞,其中所述SrKO衍生物包含与SrKO(SEQ ID NO:37、38或55)具有至少70%序列同一性的氨基酸序列,并且具有朱栾倍半萜氧化酶活性。
63.如权利要求54-62中任一项所述的宿主细胞,其中相对于SrKO(SEQ ID NO:37或38)的一个或多个突变选自表2或表6。
64.如权利要求63所述的宿主细胞,其中至少2个突变选自表2和/或表6。
65.如权利要求54所述的宿主细胞,其中所述SrKO衍生物缺失其N端跨膜区的至少一部分,并且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域。
66.如权利要求65所述的宿主细胞,其中所述SrKO衍生物缺失其N端跨膜结构域的15-35个氨基酸,并且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸。
67.如权利要求66所述的宿主细胞,其中所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
68.如权利要求54-67中任一项所述的宿主细胞,其中所述SrKO衍生物相对于SEQ IDNO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变。
69.如权利要求68所述的宿主细胞,其中所述SrKO衍生物相对于SEQ ID NO:37具有选自R76K、M94V、L107M、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
70.如权利要求68或69所述的宿主细胞,其中所述SrKO衍生物包含选自SEQ ID NO:55-61、104和105的氨基酸序列;或相对于选自SEQ ID NO:55-61、104和105的序列具有1-20个突变的氨基酸序列,附带条件是所述氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变,或附带条件是所述氨基酸序列相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
71.如权利要求54-70中任一项所述的宿主细胞,其中所述宿主细胞生产诺卡酮。
72.如权利要求71所述的宿主细胞,其中所述宿主细胞是具有至少一个额外拷贝的dxs、ispD、ispF和/或idi基因的大肠杆菌。
73.如权利要求54-72中任一项所述的宿主细胞,其中所述倍半萜烯合成酶是朱栾倍半萜合成酶。
74.如权利要求73所述的宿主细胞,其中所述朱栾倍半萜合成酶是在大肠杆菌中具有增加的表达和/或活性的甜橙或葡萄朱栾倍半萜合成酶或其衍生物。
75.如权利要求74所述的宿主细胞,其中VvVS衍生物相对于VvVS(SEQ ID NO:1)具有1-40个独立选自替换、缺失或插入的突变。
76.如权利要求74所述的宿主细胞,其中CsVS相对于CsVS(SEQ ID NO:12)具有1-40个独立选自替换、缺失或插入的突变。
77.如权利要求75所述的宿主细胞,其中一个或多个突变选自表3。
78.如权利要求54-77中任一项所述的宿主细胞,其中所述SrKO或其衍生物在共表达细胞色素P450还原酶的宿主细胞中表达。
79.如权利要求78所述的宿主细胞,其中所述SrKO表达为具有细胞色素P450还原酶配偶体的融合体。
80.如权利要求54-79中任一项所述的宿主细胞,其中所述P450还原酶是来自甜菊属的CPR。
81.如权利要求54-79中任一项所述的宿主细胞,其中所述P450还原酶包含选自SEQ IDNO:62、64、66、68、70、72、74、76、78、80和82的氨基酸序列或其衍生物。
82.如权利要求54-79中任一项所述的宿主细胞,其中所述宿主细胞进一步表达将诺特卡醇转化为诺卡酮的醇脱氢酶。
83.一种含氧的倍半萜烯产物,其由权利要求1-40中任一项所述的方法获得。
84.一种风味产品或芳香产品,其包含如权利要求83中所限定的含氧的倍半萜烯产物。
85.一种驱虫剂产品,其包含如权利要求83中所限定的含氧的倍半萜烯产物。
86.一种朱栾倍半萜合成酶,其包含在大肠杆菌中具有增加的活性的选自SEQ ID NO:3(Vv1MI)、SEQ ID NO:5(Vv2MI)、SEQ ID NO:7(Vv1M5)、SEQ ID NO:9(Vv2M5)和SEQ ID NO:11(VS2)的氨基酸序列或其衍生物。
87.一种重组核酸分子,其包含核苷酸序列,所述核苷酸序列编码具有增加朱栾倍半萜氧化酶活性的至少一个突变的SrKO衍生物;前导序列,所述前导序列支持在大肠杆菌中的表达和活性;连接子序列;和SrCPR或其衍生物,所述SrCPR或其衍生物足以再生所述SrKO衍生物。
88.一种重组核酸分子,其包含编码权利要求45-52中任一项所述的SrKO衍生物的核苷酸序列。
89.一种分离的核苷酸序列、重组载体或宿主细胞,其包含编码根据权利要求88所述的修饰的SrKO多肽的核苷酸序列。
90.一种权利要求88所述的分离的核苷酸序列,其中所述核苷酸序列在宿主细胞中表达,并且其中所述宿主细胞是大肠杆菌细胞。
91.一种制备根据权利要求45-52所述的修饰的SrKO多肽的方法,其中所述方法包括如下步骤:(i)在允许多肽表达的条件下,培养表达修饰多肽的宿主细胞;和(ii)任选地回收所述多肽。
92.一种生产含氧的倍半萜烯的方法,其包括如下步骤:
(i)提供根据权利要求45或52所述的修饰的SrKO多肽
(ii)在细胞色素P450还原酶的存在下,使倍半萜烯与修饰的SrKO多肽接触,和
(iii)回收生产的含氧的倍半萜烯。
93.根据权利要求92所述的方法,其中回收所述含氧的倍半萜烯作为油。
94.根据权利要求91-93所述的方法,其中所述倍半萜烯是朱栾倍半萜。

Claims (94)

1.一种制备倍半萜烯的含氧产物的方法,其包括使倍半萜烯与具有倍半萜烯氧化活性的甜菊贝壳杉烯氧化酶(SrKO)或其衍生物接触,以及回收所述含氧产物。
2.如权利要求1所述的方法,其中所述SrKO衍生物包含相对于SrKO(SEQ ID NO:37、38或55)具有1-50个独立选自替换、缺失或插入的突变的氨基酸序列。
3.如权利要求2所述的方法,其中所述SrKO衍生物包含相对于SrKO(SEQ ID NO:37、38或55)具有1-40个突变的氨基酸序列。
4.如权利要求1-3中任一项所述的方法,其中所述SrKO衍生物包含与SrKO(SEQ ID NO:37、38或55)具有至少70%序列同一性的氨基酸序列,并且具有朱栾倍半萜氧化酶活性。
5.如权利要求2-4中任一项所述的方法,其中相对于SrKO(SEQ ID NO:37、38或55)的一个或多个突变选自表2或表6。
6.如权利要求5所述的方法,其中至少2个突变选自表2和/或表6。
7.如权利要求2所述的方法,其中所述SrKO是缺失其N端跨膜区的至少一部分且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域的衍生物。
8.如权利要求7所述的方法,其中所述SrKO是缺失其N端跨膜结构域的15-35个氨基酸且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸的衍生物。
9.如权利要求8所述的方法,其中来自yhcB的所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
10.如权利要求1-9中任一项所述的方法,其中所述SrKO是相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变的衍生物。
11.如权利要求10所述的方法,其中所述SrKO是相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变的衍生物。
12.如权利要求1所述的方法,其中所述SrKO是包含选自SEQ ID NO:55-61、104和105的氨基酸序列或相对于选自SEQ ID NO:55-61、104和105的序列具有1-20个突变的氨基酸序列的衍生物。
13.如权利要求1-12中任一项所述的方法,其中所述倍半萜烯是朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、诺特卡醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯、长叶烯;α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。
14.如权利要求1-13所述的方法,其中主要的倍半萜烯是朱栾倍半萜、大根香叶烯(A、B、C、D或E)、法尼烯、法尼醇、诺特卡醇、广藿香醇、杜松烯、雪松醇、蛇麻烯、长叶烯和/或佛手柑油烯、β-衣兰烯、β-檀香醇、β-檀香烯、α-檀香烯、α-檀香醇、β-香根酮、a-香根酮、客烯醇、红没药烯、β-丁香烯、长叶烯;α-甜橙醛;α-红没药醇、(-)-β-可巴烯、(-)-α-可巴烯、4(Z),7(Z)-癸二烯醛、雪松醇、雪松烯、雪松醇、愈创木醇、(-)-6,9-愈创木二烯、布藜醇、愈创木醇、喇叭烯、喇叭茶醇、香樟烯和α-佛手柑油烯。
15.如权利要求14所述的方法,其中所述主要的倍半萜烯是朱栾倍半萜。
16.如权利要求15所述的方法,其中主要的含氧产物是诺卡酮和任选地诺特卡醇。
17.如权利要求1-16中任一项所述的方法,其中所述接触发生在宿主细胞中,所述宿主细胞是细菌。
18.如权利要求17所述的方法,其中所述宿主细胞是大肠杆菌、枯草芽孢杆菌或恶臭假单胞菌。
19.如权利要求1-16中任一项所述的方法,其中所述接触发生在宿主细胞中,所述宿主细胞是酵母。
20.如权利要求19所述的方法,其中所述宿主细胞是酵母属、毕赤酵母属或耶氏酵母属的种类,包括酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
21.如权利要求17-20中任一项所述的方法,其中所述宿主细胞产生异戊基焦磷酸(IPP)。
22.如权利要求21所述的方法,其中所述IPP经内源性或外源性MEP或MVA途径由代谢通量产生。
23.如权利要求22所述的方法,其中所述倍半萜烯至少部分经MEP途径由代谢通量产生,并且其中所述宿主细胞具有至少一个额外拷贝的dxs、ispD、ispF和/或idi基因。
24.如权利要求17-23中任一项所述的方法,其中所述宿主细胞进一步表达法尼基焦磷酸合成酶。
25.如权利要求17-24中任一项所述的方法,其中所述宿主细胞进一步表达异源倍半萜烯合成酶。
26.如权利要求25所述的方法,其中所述倍半萜烯合成酶是朱栾倍半萜合成酶。
27.如权利要求26所述的方法,其中所述朱栾倍半萜合成酶是甜橙或葡萄朱栾倍半萜合成酶或其衍生物。
28.如权利要求21所述的方法,其中VvVS包含相对于VvVS(SEQ ID NO:1)具有1-40个独立选自替换、缺失或插入的突变的氨基酸序列。
29.如权利要求27所述的方法,其中CsVS包含相对于CsVS(SEQ ID NO:12)具有1-40个独立选自替换、缺失或插入的突变的氨基酸序列。
30.如权利要求28所述的方法,其中一个或多个突变选自表3。
31.如权利要求1-30中任一项所述的方法,其中所述SrKO在共表达细胞色素P450还原酶的宿主细胞中表达。
32.如权利要求31所述的方法,其中所述P450还原酶是来自甜菊属的CPR。
33.如权利要求31所述的方法,其中所述P450还原酶包含选自SEQ ID NO:62、64、66、68、70、72、74、76、78、80和82的氨基酸序列或其衍生物。
34.如权利要求1-33中任一项所述的方法,其中所述SrKO表达为具有细胞色素P450还原酶配偶体的融合体。
35.如权利要求1-34中任一项所述的方法,其中所述宿主细胞进一步表达从诺特卡醇生产诺卡酮的醇脱氢酶。
36.如权利要求1-35中任一项所述的方法,其中从水性反应介质中提取氧化油,然后进行分馏。
37.如权利要求36所述的方法,其中使用有机溶剂从水性反应介质中提取所述氧化油。
38.如权利要求36所述的方法,其中通过GC/MS定量测定级分中的倍半萜烯和类倍半萜烯组分,然后混合所述级分以生产所需的含诺卡酮的成分。
39.如权利要求38所述的方法,其中定量测定诺卡酮组分。
40.如权利要求36或37所述的方法,其中回收诺卡酮级分。
41.一种制备含有含氧的倍半萜烯的产品的方法,其包括:将由权利要求1-40中任一项所述的方法制备的含氧的倍半萜烯掺入所述产品中。
42.如权利要求41所述的方法,其中所述产品是风味产品、芳香产品、化妆品、清洁产品、洗涤剂或肥皂或害虫防治产品。
43.如权利要求42所述的方法,其中所述产品是选自饮料、口香糖、糖果或人工风味剂的风味产品。
44.如权利要求42所述的方法,其中所述产品是驱虫剂。
45.一种SrKO衍生物,其包含相对于SEQ ID NO:37、38或55增加朱栾倍半萜氧化酶活性的至少一个突变。
46.如权利要求45所述的SrKO衍生物,其中所述SrKO衍生物缺失其N端跨膜区的至少一部分,并且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域。
47.如权利要求46所述的SrKO衍生物,其中所述SrKO衍生物缺失其N端跨膜结构域的15-35个氨基酸,并且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸。
48.如权利要求46所述的SrKO衍生物,其中所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
49.如权利要求45-48所述的SrKO衍生物,其中所述SrKO衍生物相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变。
50.如权利要求49所述的SrKO衍生物,其中所述SrKO衍生物相对于SEQ ID NO:37具有选自R76K、M94V、L107M、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
51.如权利要求50所述的SrKO衍生物,其中所述SrKO衍生物包含选自SEQ ID NO:55-61、104和105的氨基酸序列。
52.如权利要求50所述的SrKO衍生物,其中所述SrKO衍生物包含相对于选自SEQ IDNO:55-61、104或105的氨基酸序列具有1-20个突变的氨基酸序列,附带条件是所述氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变,或附带条件是所述氨基酸序列相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
53.如权利要求45-52中任一项所述的SrKO衍生物,其中所述衍生物被分离或部分纯化,或在宿主细胞中异源表达。
54.一种生产含氧的倍半萜烯的宿主细胞,其中所述宿主细胞产生异戊基焦磷酸(IPP);并且所述宿主细胞表达法尼基焦磷酸合成酶、倍半萜烯合成酶和具有朱栾倍半萜氧化酶活性的SrKO或其衍生物。
55.如权利要求54所述的宿主细胞,其中所述SrKO衍生物包含相对于SEQ ID NO:37、38或55增加朱栾倍半萜氧化酶活性的至少一个突变。
56.如权利要求54或55所述的宿主细胞,其中所述宿主细胞表达MVA或MEP途径。
57.如权利要求56所述的宿主细胞,其中所述宿主细胞是细菌或酵母。
58.如权利要求57所述的宿主细胞,其中所述宿主细胞是大肠杆菌、枯草芽孢杆菌或恶臭假单胞菌。
59.如权利要求57所述的宿主细胞,其中所述宿主细胞是酵母属、毕赤酵母属或耶氏酵母属的种类,包括酿酒酵母、巴斯德毕赤酵母和解脂耶氏酵母。
60.如权利要求54-59中任一项所述的宿主细胞,其中所述SrKO衍生物相对于SrKO(SEQID NO:37、38或55)具有1-50个独立选自替换、缺失或插入的突变。
61.如权利要求60所述的宿主细胞,其中所述SrKO衍生物相对于SrKO(SEQ ID NO:37、38或55)具有1-40个突变。
62.如权利要求60或61所述的宿主细胞,其中所述SrKO衍生物包含与SrKO(SEQ ID NO:37、38或55)具有至少70%序列同一性的氨基酸序列,并且具有朱栾倍半萜氧化酶活性。
63.如权利要求54-62中任一项所述的宿主细胞,其中相对于SrKO(SEQ ID NO:37或38)的一个或多个突变选自表2或表6。
64.如权利要求63所述的宿主细胞,其中至少2个突变选自表2和/或表6。
65.如权利要求54所述的宿主细胞,其中所述SrKO衍生物缺失其N端跨膜区的至少一部分,并且添加来自大肠杆菌yhcB或其衍生物的内膜跨膜结构域。
66.如权利要求65所述的宿主细胞,其中所述SrKO衍生物缺失其N端跨膜结构域的15-35个氨基酸,并且添加来自大肠杆菌yhcB或其衍生物的跨膜结构域的15-25个氨基酸。
67.如权利要求66所述的宿主细胞,其中所述N端跨膜结构域包含SEQ ID NO:118的氨基酸序列或相对于SEQ ID NO:118具有1-5个氨基酸突变的氨基酸序列。
68.如权利要求54-67中任一项所述的宿主细胞,其中所述SrKO衍生物相对于SEQ IDNO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变。
69.如权利要求68所述的宿主细胞,其中所述SrKO衍生物相对于SEQ ID NO:37具有选自R76K、M94V、L107M、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
70.如权利要求68或69所述的宿主细胞,其中所述SrKO衍生物包含选自SEQ ID NO:55-61、104和105的氨基酸序列;或相对于选自SEQ ID NO:55-61、104和105的序列具有1-20个突变的氨基酸序列,附带条件是所述氨基酸序列相对于SEQ ID NO:37在选自46、76、94、131、231、284、383、390、400、444、468、488和499的位置具有一个或多个突变,或附带条件是所述氨基酸序列相对于SEQ ID NO:37具有选自R76K、M94V、T131Q、F231L、H284Q、R383K、I390L、T468I和T499N的一个或多个突变。
71.如权利要求54-70中任一项所述的宿主细胞,其中所述宿主细胞生产诺卡酮。
72.如权利要求71所述的宿主细胞,其中所述宿主细胞是具有至少一个额外拷贝的dxs、ispD、ispF和/或idi基因的大肠杆菌。
73.如权利要求54-72中任一项所述的宿主细胞,其中所述倍半萜烯合成酶是朱栾倍半萜合成酶。
74.如权利要求73所述的宿主细胞,其中所述朱栾倍半萜合成酶是在大肠杆菌中具有增加的表达和/或活性的甜橙或葡萄朱栾倍半萜合成酶或其衍生物。
75.如权利要求74所述的宿主细胞,其中VvVS衍生物相对于VvVS(SEQ ID NO:1)具有1-40个独立选自替换、缺失或插入的突变。
76.如权利要求74所述的宿主细胞,其中CsVS相对于CsVS(SEQ ID NO:12)具有1-40个独立选自替换、缺失或插入的突变。
77.如权利要求75所述的宿主细胞,其中一个或多个突变选自表3。
78.如权利要求54-77中任一项所述的宿主细胞,其中所述SrKO或其衍生物在共表达细胞色素P450还原酶的宿主细胞中表达。
79.如权利要求78所述的宿主细胞,其中所述SrKO表达为具有细胞色素P450还原酶配偶体的融合体。
80.如权利要求54-79中任一项所述的宿主细胞,其中所述P450还原酶是来自甜菊属的CPR。
81.如权利要求54-79中任一项所述的宿主细胞,其中所述P450还原酶包含选自SEQ IDNO:62、64、66、68、70、72、74、76、78、80和82的氨基酸序列或其衍生物。
82.如权利要求54-79中任一项所述的宿主细胞,其中所述宿主细胞进一步表达将诺特卡醇转化为诺卡酮的醇脱氢酶。
83.一种含氧的倍半萜烯产物,其由权利要求1-40中任一项所述的方法获得。
84.一种风味产品或芳香产品,其包含如权利要求83中所限定的含氧的倍半萜烯产物。
85.一种驱虫剂产品,其包含如权利要求83中所限定的含氧的倍半萜烯产物。
86.一种朱栾倍半萜合成酶,其包含在大肠杆菌中具有增加的活性的选自SEQ ID NO:3(Vv1MI)、SEQ ID NO:5(Vv2MI)、SEQ ID NO:7(Vv1M5)、SEQ ID NO:9(Vv2M5)和SEQ ID NO:11(VS2)的氨基酸序列或其衍生物。
87.一种重组核酸分子,其包含核苷酸序列,所述核苷酸序列编码具有增加朱栾倍半萜氧化酶活性的至少一个突变的SrKO衍生物;前导序列,所述前导序列支持在大肠杆菌中的表达和活性;连接子序列;和SrCPR或其衍生物,所述SrCPR或其衍生物足以再生所述SrKO衍生物。
88.一种重组核酸分子,其包含编码权利要求45-52中任一项所述的SrKO衍生物的核苷酸序列。
89.一种分离的核苷酸序列、重组载体或宿主细胞,其包含编码根据权利要求88所述的修饰的SrKO多肽的核苷酸序列。
90.一种权利要求88所述的分离的核苷酸序列,其中所述核苷酸序列在宿主细胞中表达,并且其中所述宿主细胞是大肠杆菌细胞。
91.一种制备根据权利要求45-52所述的修饰的SrKO多肽的方法,其中所述方法包括如下步骤:(i)在允许多肽表达的条件下,培养表达修饰多肽的宿主细胞;和(ii)任选地回收所述多肽。
92.一种生产含氧的倍半萜烯的方法,其包括如下步骤:
(i)提供根据权利要求45或52所述的修饰的SrKO多肽
(ii)在细胞色素P450还原酶的存在下,使倍半萜烯与修饰的SrKO多肽接触,和
(iii)回收生产的含氧的倍半萜烯。
93.根据权利要求92所述的方法,其中回收所述含氧的倍半萜烯作为油。
94.根据权利要求91-93所述的方法,其中所述倍半萜烯是朱栾倍半萜。
CN201580057470.4A 2014-08-21 2015-08-21 含氧萜烯的生产方法 Pending CN107002109A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201462040284P 2014-08-21 2014-08-21
US62/040,284 2014-08-21
PCT/US2015/046369 WO2016029153A1 (en) 2014-08-21 2015-08-21 Methods for production of oxygenated terpenes

Publications (1)

Publication Number Publication Date
CN107002109A true CN107002109A (zh) 2017-08-01

Family

ID=54035332

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580057470.4A Pending CN107002109A (zh) 2014-08-21 2015-08-21 含氧萜烯的生产方法

Country Status (7)

Country Link
US (6) US10501760B2 (zh)
EP (2) EP3183353A2 (zh)
JP (1) JP6735750B2 (zh)
CN (1) CN107002109A (zh)
MY (1) MY185817A (zh)
SG (1) SG11201701278RA (zh)
WO (2) WO2016029153A1 (zh)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107723252A (zh) * 2017-09-22 2018-02-23 天津大学 生产巴伦西亚橘烯和诺卡酮的重组解脂耶氏酵母菌及构建方法
CN109486695A (zh) * 2018-12-26 2019-03-19 浙江医药股份有限公司 用于生产法尼烯的解脂耶氏酵母
CN110923249A (zh) * 2019-10-15 2020-03-27 贵州省烟草科学研究院 烟草CyP71及在调控植物表皮毛发育方面的应用
CN111235046A (zh) * 2020-02-05 2020-06-05 天津大学 异源合成α-檀香烯的重组解脂耶氏酵母及其构建方法
CN111394290A (zh) * 2020-03-26 2020-07-10 北京化工大学 产长叶烯的基因工程菌及其构建方法与应用
CN111527203A (zh) * 2018-01-18 2020-08-11 弗门尼舍有限公司 细胞色素p450单加氧酶催化的倍半萜的氧化
CN112391360A (zh) * 2020-11-04 2021-02-23 江南大学 黄酮3β-羟化酶还原酶辅酶突变体及其应用
CN112513280A (zh) * 2018-05-25 2021-03-16 约翰·英尼斯中心 用于产生单萜类化合物的方法
CN113195726A (zh) * 2018-09-06 2021-07-30 马努斯生物合成股份有限公司 莎草奥酮的微生物生产
CN114480512A (zh) * 2021-12-13 2022-05-13 华南理工大学 氧化还原酶及其突变体在生物合成圆柚酮中的应用

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3183353A2 (en) 2014-08-21 2017-06-28 Givaudan S.A. Process for producing oxygenated products of valencene
ES2959560T3 (es) * 2015-08-21 2024-02-27 Manus Bio Inc Aumento de la productividad de células hospedadoras de E. coli que expresan funcionalmente enzimas P450
BR112019000892A2 (pt) * 2016-07-20 2019-04-30 Firmenich & Cie vetiver
CN109890217A (zh) 2016-09-16 2019-06-14 百事可乐公司 用于改善非营养性甜味剂的味道的组合物和方法
CN108060092B (zh) * 2016-11-04 2021-08-27 中国科学院天津工业生物技术研究所 一种重组菌及其用途
WO2018140778A1 (en) 2017-01-26 2018-08-02 Manus Bio, Inc. Metabolic engineering for microbial production of terpenoid products
EP3577212A4 (en) 2017-02-03 2020-12-16 Manus Bio, Inc. METABOLIC ENGINEERING FOR THE MICROBIAL MANUFACTURING OF TERPENOID PRODUCTS
WO2019059337A1 (ja) * 2017-09-22 2019-03-28 味の素株式会社 ヌートカトンの製造方法
US20210032669A1 (en) * 2018-02-27 2021-02-04 Manus Bio, Inc. Microbial production of triterpenoids including mogrosides
EP3778910A4 (en) 2018-04-06 2022-01-19 Ajinomoto Co., Inc. PROCESS FOR THE PREPARATION OF 13-HYDROXY-9(Z)-OCTADECEONIC ACID
EP3861101A4 (en) * 2018-10-05 2022-07-20 Manus Bio Inc. BIOSYNTHESIS AND RECOVERY OF SECONDARY METABOLITES
CN109913380B (zh) * 2019-03-25 2021-12-10 南京工业大学 生产(-)-α-红没药醇的重组解脂耶氏酵母菌及其构建方法和应用
CN111019852B (zh) * 2019-12-31 2021-11-23 江南大学 一种提高酿酒酵母工程菌株合成葡萄糖二酸效率的方法
CN111154665B (zh) * 2020-01-21 2021-02-02 南京工业大学 一株重组解脂耶罗维亚酵母及其构建方法和应用
US11939618B2 (en) * 2020-03-23 2024-03-26 The Regents Of The University Of California Fusion proteins useful for modifying terpenes
US11932587B2 (en) 2020-10-30 2024-03-19 Givaudan Sa Method of causing conversion of volatile terpene species
EP4237396A1 (en) 2020-10-30 2023-09-06 Givaudan SA Method of mitigating conversion of volatile terpene species
EP4337768A1 (en) * 2021-05-11 2024-03-20 Manus Bio Inc. Enzymes, host cells, and methods for production of rotundone and other terpenoids
CN114181964B (zh) * 2021-11-02 2023-06-09 云南大学 一种表达盒组合、重组载体和重组酿酒酵母及其应用

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1703507A (zh) * 2002-10-04 2005-11-30 弗门尼舍有限公司 倍半萜烯合成酶及其使用方法
WO2006079020A2 (en) * 2005-01-19 2006-07-27 University Of Kentucky Research Foundation Functional identification of the hyoscyamus muticus gene coding for premnaspirodiene hydroxylase activity
CN105121647A (zh) * 2012-11-01 2015-12-02 不列颠哥伦比亚大学 细胞色素p450和细胞色素p450还原酶多肽、编码核酸分子及其用途

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL121437A (en) * 1995-12-18 2000-10-31 Firmenich & Cie Process for the preparation of nootkatone or nootkatol or of mixtures of these compounds
WO1999025196A1 (fr) 1997-11-17 1999-05-27 Taisho Pharmaceutical Co., Ltd. Insectifuge contre insectes hematophages
GB9825421D0 (en) 1998-11-19 1999-01-13 Isis Innovation Process for oxidising terpenes
US6200786B1 (en) 1999-09-08 2001-03-13 Givaudan S.A. Process for the preparation of nootkatone by laccase catalysis
US6890960B1 (en) * 1999-10-19 2005-05-10 Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College Vetiver oil extracts as termite repellent and toxicant
US7172886B2 (en) 2001-12-06 2007-02-06 The Regents Of The University Of California Biosynthesis of isopentenyl pyrophosphate
JP4139646B2 (ja) 2002-08-01 2008-08-27 長谷川香料株式会社 香気香味賦与剤
BRPI0314979B1 (pt) 2002-10-04 2019-10-22 Firmenich & Cie método de produção de um valenceno e microorganismo transgênico
US7741089B2 (en) 2003-08-11 2010-06-22 Verenium Corporation Laccases, nucleic acids encoding them and methods for making and using them
DE102004006825A1 (de) 2004-02-11 2005-09-01 Maxens Gmbh Verfahren zur Herstellung von aromaaktiven Terpenen
US8927241B2 (en) 2009-11-10 2015-01-06 Massachusetts Institute Of Technology Microbial engineering for the production of chemical and pharmaceutical products from the isoprenoid pathway
EP2499257A1 (en) 2009-11-10 2012-09-19 Massachusetts Institute of Technology Microbial engineering for the production of chemical and pharmaceutical products from the isoprenoid pathway
JP2013544084A (ja) 2010-10-29 2013-12-12 アリリクス・インコーポレイテッド 改変されたバレンセンシンターゼポリペプチド、コーディング核酸分子およびその使用
WO2012075030A1 (en) 2010-11-30 2012-06-07 Massachusetts Institute Of Technology Microbial production of natural sweeteners, diterpenoid steviol glycosides
KR102021914B1 (ko) 2011-11-09 2019-09-17 아미리스 인코퍼레이티드 아세틸-코엔자임 a로부터 유래되는 이소프레노이드의 제조 방법
KR101559489B1 (ko) 2014-04-08 2015-10-12 한국해양과학기술원 방향타를 이용한 요제어방식 조류발전장치 및 이의 제어방법
EP3183353A2 (en) 2014-08-21 2017-06-28 Givaudan S.A. Process for producing oxygenated products of valencene
CN107109453A (zh) 2014-11-05 2017-08-29 马努斯生物合成股份有限公司 甜菊醇糖苷的微生物产生
ES2959560T3 (es) 2015-08-21 2024-02-27 Manus Bio Inc Aumento de la productividad de células hospedadoras de E. coli que expresan funcionalmente enzimas P450
WO2020051488A1 (en) * 2018-09-06 2020-03-12 Manus Bio, Inc. Microbial production of rotundone

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1703507A (zh) * 2002-10-04 2005-11-30 弗门尼舍有限公司 倍半萜烯合成酶及其使用方法
WO2006079020A2 (en) * 2005-01-19 2006-07-27 University Of Kentucky Research Foundation Functional identification of the hyoscyamus muticus gene coding for premnaspirodiene hydroxylase activity
CN105121647A (zh) * 2012-11-01 2015-12-02 不列颠哥伦比亚大学 细胞色素p450和细胞色素p450还原酶多肽、编码核酸分子及其用途

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HUMPHREY,T.V.: "ent-kaurene oxidase CYP701A5 [Stevia rebaudiana]", 《NCBI DATABASE》 *
TAMARA WRIESSNEGGER: "Production of the sesquiterpenoid (+)-nootkatone by metabolic engineering of Pichia pastoris", 《METABOLIC ENGINEERING》 *
王佳 等: "倍半萜生物转化的研究进展", 《沈阳药科大学学报》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107723252A (zh) * 2017-09-22 2018-02-23 天津大学 生产巴伦西亚橘烯和诺卡酮的重组解脂耶氏酵母菌及构建方法
CN111527203A (zh) * 2018-01-18 2020-08-11 弗门尼舍有限公司 细胞色素p450单加氧酶催化的倍半萜的氧化
CN112513280A (zh) * 2018-05-25 2021-03-16 约翰·英尼斯中心 用于产生单萜类化合物的方法
CN113195726A (zh) * 2018-09-06 2021-07-30 马努斯生物合成股份有限公司 莎草奥酮的微生物生产
CN109486695A (zh) * 2018-12-26 2019-03-19 浙江医药股份有限公司 用于生产法尼烯的解脂耶氏酵母
CN109486695B (zh) * 2018-12-26 2021-04-06 浙江医药股份有限公司 用于生产法尼烯的解脂耶氏酵母
CN110923249A (zh) * 2019-10-15 2020-03-27 贵州省烟草科学研究院 烟草CyP71及在调控植物表皮毛发育方面的应用
CN111235046A (zh) * 2020-02-05 2020-06-05 天津大学 异源合成α-檀香烯的重组解脂耶氏酵母及其构建方法
CN111394290A (zh) * 2020-03-26 2020-07-10 北京化工大学 产长叶烯的基因工程菌及其构建方法与应用
CN112391360A (zh) * 2020-11-04 2021-02-23 江南大学 黄酮3β-羟化酶还原酶辅酶突变体及其应用
CN112391360B (zh) * 2020-11-04 2022-09-06 江南大学 黄酮3β-羟化酶还原酶突变体及其应用
CN114480512A (zh) * 2021-12-13 2022-05-13 华南理工大学 氧化还原酶及其突变体在生物合成圆柚酮中的应用
WO2023109530A1 (zh) * 2021-12-13 2023-06-22 华南理工大学 氧化还原酶及其突变体在生物合成圆柚酮中的应用

Also Published As

Publication number Publication date
JP6735750B2 (ja) 2020-08-05
EP3183357B1 (en) 2020-03-04
MY185817A (en) 2021-06-10
EP3183353A2 (en) 2017-06-28
WO2016029187A2 (en) 2016-02-25
US20180135081A1 (en) 2018-05-17
SG11201701278RA (en) 2017-03-30
JP2017525395A (ja) 2017-09-07
US20180327789A1 (en) 2018-11-15
US11807890B2 (en) 2023-11-07
US20240141392A1 (en) 2024-05-02
US20200224224A1 (en) 2020-07-16
US20220073955A1 (en) 2022-03-10
US20210292798A1 (en) 2021-09-23
EP3183357A1 (en) 2017-06-28
US11180782B2 (en) 2021-11-23
US10934564B2 (en) 2021-03-02
US10501760B2 (en) 2019-12-10
WO2016029153A1 (en) 2016-02-25
WO2016029187A3 (en) 2016-05-06
EP3183357A4 (en) 2018-07-04
US11952608B2 (en) 2024-04-09

Similar Documents

Publication Publication Date Title
CN107002109A (zh) 含氧萜烯的生产方法
AU2015326892B2 (en) Methods and materials for biosynthesis of mogroside compounds
Netzer et al. Biosynthetic pathway for γ-cyclic sarcinaxanthin in Micrococcus luteus: Heterologous expression and evidence for diverse and multiple catalytic functions of C50 carotenoid cyclases
PT2970934T (pt) Polipèptidos de valenceno sintase, moléculas de ácido nucleico que os codificam e suas utilizações
JP4700617B2 (ja) パッチュリからのセスキテルペン合成酵素
HUE034344T2 (en) Cytochrome P450 and its use for enzymatic oxidation of terpenes
US9714440B2 (en) Method for producing patchoulol and 7-epi-α-selinene
CA2996711C (en) Method of fermentative alpha-ionone production
Liu et al. Systems metabolic engineering of Escherichia coli coculture for de novo production of genistein
CN106987578A (zh) 一种生产koraiol的萜类合酶及其应用
Chen et al. Expanding the structural diversity of terpenes by synthetic biology approaches
Liu et al. Systematic optimization of HPO-CPR to boost (+)-nootkatone synthesis in engineered Saccharomyces cerevisiae
JP5526381B2 (ja) セスキテルペン変換酵素遺伝子及びそれを利用した酸化セスキテルペンの製造方法
JP5457159B2 (ja) 新規セスキテルペン合成酵素遺伝子及びそれを利用したセスキテルペンの製造方法
EP1641931A1 (en) Improved production of coenzyme q-10
JP2023520900A (ja) ゲラニル二リン酸誘導化合物の産生
Mohedano Metabolic Engineering of Saccharomyces Cerevisiae for the Production of Flavonoids
Xu et al. Metabolic Engineering of Candida Tropicalis for the De Novo Synthesis of Β-Ionone
Chappell et al. Sesquiterpene Synthase Gene and Protein

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination