CN115151643A - 用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台 - Google Patents

用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台 Download PDF

Info

Publication number
CN115151643A
CN115151643A CN202180016363.2A CN202180016363A CN115151643A CN 115151643 A CN115151643 A CN 115151643A CN 202180016363 A CN202180016363 A CN 202180016363A CN 115151643 A CN115151643 A CN 115151643A
Authority
CN
China
Prior art keywords
ala
leu
ser
val
gly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180016363.2A
Other languages
English (en)
Inventor
唐奕
陈梦玢
I·奥科拉夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of California
Original Assignee
University of California
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of California filed Critical University of California
Publication of CN115151643A publication Critical patent/CN115151643A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • C12N15/78Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Pseudomonas
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • C12N15/815Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/02Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/22Preparation of oxygen-containing organic compounds containing a hydroxy group aromatic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/40Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
    • C12P7/42Hydroxy-carboxylic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/102Plasmid DNA for yeast
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y203/00Acyltransferases (2.3)
    • C12Y203/01Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y301/00Hydrolases acting on ester bonds (3.1)
    • C12Y301/02Thioester hydrolases (3.1.2)

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)

Abstract

本公开文本提供了从微生物中和在无细胞系统中以高效价产生橄榄醇酸及其类似物的生物合成平台。

Description

用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台
相关申请的交叉引用
本申请根据35 U.S.C.§119要求2020年1月10日提交的临时申请序列号62/959,849的优先权,将其公开内容通过引用并入本文。
政府支持声明
本发明是在由美国国立卫生研究院授予的授权号1R35GM11805下在美国政府支持下完成的。美国政府享有本发明的一定权利。
技术领域
本公开文本提供了从微生物中和在无细胞系统中以高效价产生橄榄醇酸及其类似物的生物合成平台。
通过引用并入序列表
本申请随附的是名为“Sequence-Listing_ST25”的序列表,其创建于2021年1月7日并且具有350,174字节的数据,在IBM-PC MS-Windows操作系统上进行机器格式化。出于所有目的将序列表通过引用以其整体特此并入。
背景技术
大麻素是一大类植物来源的生物活性天然产物,它们调节人内源性大麻素系统的大麻素受体(CB1和CB2)以及调节其他生物系统。大麻素是有前途的药物,正在进行100多项研究它们作为抗癌剂、止吐药、抗惊厥药、镇痛药和抗抑郁药的治疗益处的临床试验。此外,三种大麻素疗法已经被FDA批准用于治疗化学疗法诱发的恶心、MS痉挛和与严重癫痫相关的癫痫发作。尽管它在医学中发挥着重要作用,但原生植物中的低丰度以及大麻的合法调度已经阻碍了进行深入研究来揭示大麻素的生物学,从而阻碍了更广泛的医学应用。
发明内容
本公开文本提供了一种生物合成平台,所述生物合成平台包含从更简单的代谢物产生橄榄醇酸及其类似物的一系列酶,所述一系列酶包括:非还原性聚酮合酶(NRPKS),所述NRPKS将包括己酰辅酶A、己酸、辛酰辅酶A、辛酸和/或其类似物的一组代谢物转化为芳族二醇代谢物;以及硫酯酶,所述硫酯酶将所述芳族二醇代谢物转化为橄榄醇酸及其类似物。在一个实施方案中,所述平台进一步包含高还原性聚酮合酶(HRPKS),所述HRPKS利用乙酰辅酶A、丙二酰辅酶A和NADPH来合成选自己酰辅酶A、己酸、辛酰辅酶A、辛酸和/或其类似物(例如,丁酸、己烯酸、辛烯酸、癸酸、癸烯酸、月桂酸、壬酸和相关的辅酶A等效物等)的所述一组代谢物。在一个实施方案中,己酰辅酶A、己酸、辛酰辅酶A、辛酸的所述类似物在C6或C8酰基链中不同。在另一个或进一步的实施方案中,橄榄醇酸的所述类似物包括2-庚基-4,6-二羟基苯甲酸、(E)-2-(庚-1-烯-1-基)-4,6-二羟基苯甲酸和(E)-2,4-二羟基-6-(戊-1-烯-1-基)苯甲酸。在仍另一个或进一步的实施方案中,构成所述生物合成平台的所述酶中的一种或多种来自真菌。在前述任一项的又另一个实施方案中,构成所述生物合成平台的所述一系列酶来自金龟子绿僵菌(Metarhizium anisopliae)。在前述任一项的仍另一个实施方案中,所述NRPKS具有与SEQ ID NO:4的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的序列。在一个进一步的实施方案中,所述NRPKS具有与SEQ ID NO:2的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。在仍一个进一步的实施方案中,所述NRPKS包含SEQ ID NO:4的序列。在前述任一项的另一个实施方案中,所述TE具有与SEQ ID NO:3的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的序列。在一个进一步的实施方案中,所述TE具有与SEQ ID NO:6的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。在又一个进一步的实施方案中,所述TE包含SEQ ID NO:6的序列。在任何前述实施方案的仍另一个实施方案中,所述HRPKS具有与SEQID NO:2的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的序列。在一个进一步的实施方案中,所述HRPKS具有与SEQ ID NO:2的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。在仍一个进一步的实施方案中,所述HRPKS包含SEQID NO:2的序列。
本公开文本还提供了一种用于在无细胞系统中表达所述生物合成平台的线性表达模板(LET),所述LET包含编码构成任何前述实施方案的生物合成平台的所述一系列酶的多核苷酸序列。在又另一个实施方案中,所述LET包含:编码具有HRPKS活性并且具有与SEQID NO:2的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;编码具有NRPKS活性和与SEQ ID NO:4的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;编码具有TE活性并且具有与SEQ ID NO:6的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列。
本公开文本还提供了一种或多种质粒或者一种或多种载体,所述一种或多种质粒或者一种或多种载体包含编码构成如本文所述的生物合成平台的所述一系列酶的多核苷酸序列。在一个实施方案中,第一质粒包含编码具有HRPKS活性并且具有与SEQ ID NO:2的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;第二质粒包含编码具有NRPKS活性和与SEQ ID NO:4的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;并且其中第三质粒包含编码具有TE活性并且具有与SEQ ID NO:6的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列。
本公开文本还提供了一种重组微生物,所述重组微生物包含本公开文本的一种或多种质粒或者一种或多种载体。在一个实施方案中,所述重组微生物是细菌、古生菌或真菌。在一个进一步的实施方案中,所述重组微生物是选自以下的细菌:大肠杆菌(Escherichia Coli.)、类球红细菌(Rodhobacter sphaeroides)、游海假交替单胞菌(Pseudoalteromonas haloplanktis)、希瓦氏菌属(Shewanella sp.)菌株Ac10、荧光假单胞菌(Pseudomonas fluorescens)、恶臭假单胞菌(Pseudomonas putida)、铜绿假单胞菌(Pseudomonas aeruginosa)、伸长盐单胞菌(Halomonas elongata)、需盐色盐杆菌(Chromohalobacter salex’igens)、变铅青链霉菌(Streptomyces lividans)、灰色链霉菌(Streptomyces griseus)、耐内酰胺诺卡氏菌(Nocardia lactamdurans)、耻垢分枝杆菌(Mycobacterium smegmatis)、谷氨酸棒状杆菌(Corynebacterium glutamicum)、产氨棒状杆菌(Corynebacterium ammoniagenes)、乳糖发酵短杆菌(Brevibacteriumlactofermentum)、枯草芽孢杆菌(Bacillus subtilis)、短芽孢杆菌(Bacillus brevis)、巨大芽孢杆菌(Bacillus megaterium)、地衣芽孢杆菌(Bacillus licheniformis)、解淀粉芽孢杆菌(Bacillus amyloliquefaciens)、乳酸乳球菌(Lactococcus lactis)、植物乳杆菌(Lactobacillus plantarum)、干酪乳杆菌(Lactobacillus casei)、罗伊氏乳杆菌(Lactobacillus reuteri)和加氏乳杆菌(Lactobacillus gasseri)。在另一个实施方案中,所述重组微生物是埃希氏杆菌属(Escherichia)或恶臭假单胞菌。在仍另一个实施方案中,所述重组微生物是选自以下的细菌:酿酒酵母(Saccharomyces cerevisiae)、乳酸克鲁维酵母(Kluyveromyces lactis)、毕赤酵母(Pichia pastoris)、多形汉逊酵母(Hansenulapolymorpha)、解脂耶氏酵母(Yarrowia lipolytica)、构巢曲霉(Aspergillus nidulans)、里氏木霉(Trichoderma reesei)、尖孢镰刀菌(Fusarium oxysporum)、黄孢原毛平革菌(Phanerochaete chrysosporium)、棉阿舒囊霉(Ashbya gossypii)、米曲霉(A.oryzae)和卢克诺文思金孢子菌(Chrysosporium lucknowense)。在另一个实施方案中,所述重组微生物是构巢曲霉或酿酒酵母。
本公开文本还提供了一种产生橄榄醇酸及其类似物的方法,所述方法包括培养本公开文本的重组微生物。在一个实施方案中,所述方法进一步包括分离和纯化所述橄榄醇酸及其类似物。
例如,在本文提出的研究中,源自例如真菌金龟子绿僵菌的生物合成平台或集群以高产率提供橄榄醇酸及其类似物。所述生物合成平台包含:(1)高还原性聚酮合酶(HRPKS)、(2)非还原性聚酮合酶(NRPKS)和(3)硫酯酶(TE)。HRPKS利用乙酰辅酶A、丙二酰辅酶A和NADPH来合成拴系在酰基载体蛋白(ACP)上的C6或C8酰基链。HRPKS的ACP结构域使酰基硫酯穿梭到NRPKS的起始单元酰基载体蛋白转酰基酶(SAT)结构域。在由酮基合酶(KS)结构域催化的三个脱羧缩合和由产物模板(PT)结构域催化的芳构化之后,TE水解来自NRPKS的产物以开始下一个催化循环。由于HRPKS具有合成能力并且NRPKS SAT结构域具有宽松的底物选择性,可以获得高效价橄榄醇酸以及三种酰基链长度和饱和度不同的类似物。在构巢曲霉中异源表达上述酶,并且在没有进行任何代谢优化的情况下以>4g/L的总效价获得橄榄醇酸及其类似物。在实践中,所述生物合成平台也可以在其他微生物系统中表达以产生橄榄醇酸及其类似物,包括在大肠杆菌(E.coli)和酵母中。
附图说明
图1证明对于大麻素的生物合成,橄榄醇酸起着核心作用。
图2A-B提供了用于产生橄榄醇酸及其类似物的本公开文本的生物合成途径的实施方案。(A)本公开文本的生物合成途径,包含来自金龟子绿僵菌ARSEF23的OVA。(B)所述生物合成途径的异源表达提供了橄榄醇酸及其类似物。
图3提供了从液体烧瓶培养物产生橄榄醇酸及其类似物的液相色谱(LC)迹线。
具体实施方式
如本文和所附权利要求所用,除非上下文另外明确规定,否则单数形式“一个/一种(a)”、“一个/一种(an)”和“所述(the)”包括复数指代物。因此,例如,提及“一种聚酮合酶”包括多种此类聚酮合酶,并且提及“所述大麻素中间体”包括提及一种或多种大麻素中间体及其本领域技术人员已知的等效物,等等。
此外,除非另外说明,否则“或”的使用意指“和/或”。类似地,“包含(comprise)”、“包含(comprises)”、“包含(comprising)”、“包括(include)”、“包括(includes)”和“包括(including)”是可互换的,并不旨在是限制性的。
应进一步理解,在对各个实施方案的描述使用术语“包含”的情况下,本领域技术人员将理解,在一些特定情况下,可以可替代地使用语言“基本上由……组成”或“由……组成”来描述实施方案。
除非另外定义,否则本文所用的所有技术和科学术语都具有与本公开文本所属领域的普通技术人员通常所理解的相同的含义。尽管许多方法和试剂与本文所述的那些相似或等同,但本文公开了示例性方法和材料。
出于描述和公开可以结合本文的描述使用的方法的目的,将本文提及的所有出版物都通过引用全部并入本文。此外,关于在一个或多个出版物中呈现的与已经在本公开文本中明确定义的术语相似或相同的任何术语,在所有方面将以如在本公开文本中明确提供的术语的定义为准。
应当理解,本发明不限于本文所述的特定方法、方案和试剂等,因此可以变化。本文所用的术语仅出于描述特定实施方案的目的,并不旨在限制本发明的范围,本发明的范围仅由权利要求限定。
除了在操作实施例中以外或者在另外指示的情况下,本文所用的表示成分的量或反应条件的所有数字都应当理解为在所有情况下均被术语“约”修饰。术语“约”在与百分比结合用于描述本发明时意指±1%。
如本文所用,酶的“活性”是其催化反应从而产生代谢物(即,“起作用”)的能力的量度,并且可以表示为反应的代谢物产生的速率。例如,酶活性可以表示为每单位时间或每单位酶产生的代谢物量(例如,浓度或重量),或用亲和力或解离常数来表示。
术语“生物合成途径”是指底物被转化为更复杂的产物或以逐步的方式被降解的多步骤、酶催化的过程。生物合成途径的先决要素通常包括:前体化合物(底物)、任选的化学能(例如,ATP)和可能需要辅酶(例如,NADH、NADPH)的催化酶。本公开文本提供了从更简单的前体化合物(诸如乙酰辅酶A和丙二酰辅酶A)产生橄榄醇酸和橄榄醇酸类似物的生物合成途径。本公开文本还提供了重组微生物,所述重组微生物表达本文公开的用于产生橄榄醇酸和橄榄醇酸类似物的生物合成途径。在一个具体实施方案中,本文公开的生物合成途径包含一种或多种聚酮合酶。在一个进一步的实施方案中,本文公开的生物合成途径包含一种或多种硫酯酶。在某些实施方案中,包含本公开文本的生物合成途径的工程化微生物包含对于工程化微生物异源的选自聚酮合酶或硫酯酶的至少一种酶。
“酶”意指通常完全或大部分由构成蛋白质或多肽的氨基酸构成的任何物质,它或多或少特异性地催化或促进一种或多种化学或生物化学反应。
关于基因或多核苷酸的术语“表达”是指基因或多核苷酸的转录,并且视情况而定所得mRNA转录物翻译成蛋白质或多肽。因此,如从上下文中将清楚的,蛋白质或多肽的表达由开放阅读框的转录和翻译引起。
“代谢物”是指由代谢产生的任何物质,或者产生所需代谢物、化学物质、醇或聚酮等的特定代谢过程所必需的或参与所述特定代谢过程的物质。代谢物可以是作为代谢的起始材料(例如,碳水化合物、磷酸糖、丙酮酸等)、中间体(例如,乙酰辅酶A)或最终产物(例如,橄榄醇酸)的有机化合物。代谢物可以用于构建更复杂的分子,或者它们可以被分解成更简单的分子。中间代谢物可以由其他代谢物合成,可能用于制备更复杂的物质,或者被分解成更简单的化合物,有时伴随化学能的释放。
如本文所用,术语“代谢工程化(metabolically engineered)”或“代谢工程化(metabolic engineering)”涉及生物合成基因、与操纵子相关的基因和多核苷酸的控制元件的合理途径设计和组装,以在微生物中或在无细胞系统中产生所需代谢物,诸如己酰辅酶A或最终产物(如橄榄醇酸或其类似物)。生物合成途径的合理途径设计和组装可以包括用于产生所需代谢物的辅因子。“代谢工程化”可以进一步包括通过使用遗传工程和适当的培养条件调节和优化转录、翻译、蛋白质稳定性和蛋白质功能性来优化代谢通量,所述培养条件包括减少、破坏或敲除与通向所需途径的中间体竞争的竞争代谢途径。例如,在无细胞系统中,表达在无细胞系统中使用的一种或多种酶的宿主细胞可以被进一步工程化以消除或去除竞争途径酶,从而去除可能存在于被破坏的或无细胞的制剂中的污染物或酶。
生物合成基因对于宿主微生物可以是异源的,这是由于其对于宿主是外来的,或者通过诱变、重组和/或与内源宿主细胞中的异源表达控制序列相关联而被修饰。在一个实施方案中,在多核苷酸对于宿主生物体是异种的情况下,可以对多核苷酸进行密码子优化。
术语“多核苷酸”、“核酸”或“重组核酸”是指多核苷酸,诸如脱氧核糖核酸(DNA),并且在适当的情况下核糖核酸(RNA)。应当认识到,除非另外明确指示,否则包括“T”的任何序列都可以通过用“U”替换“T”来修饰。
“蛋白质”或“多肽”(所述术语在本文中可互换地使用)包含一条或多条称为氨基酸的化学结构单元的链,氨基酸通过称为肽键的化学键连接在一起。蛋白质或多肽可以起酶的作用。
术语“重组微生物”和“重组宿主细胞”在本文中可互换地使用,并且是指已经被遗传修饰以表达异源多核苷酸或过表达内源多核苷酸或表达未表达的内源多核苷酸的微生物。多核苷酸通常编码参与如本文所述的用于产生所需代谢物的代谢途径的靶酶,但是也可以包括调节或活性或转录所必需的蛋白质因子。因此,本文所述的重组微生物已经被遗传工程化以表达或过表达亲本微生物先前未表达或过表达的靶酶。应理解,术语“重组微生物”和“重组宿主细胞”不仅是指特定重组微生物,而且是指这样的微生物的后代或潜在后代。还应当理解,重组微生物可以用作多肽的来源,并且重组微生物不需要具有用于产生所需代谢物的完整途径。而是可以共培养各自具有一种或多种但不是全部用于代谢途径的多肽的多种重组微生物以产生所需代谢物,或者可以破坏所述多种重组微生物,并且使用无细胞环境或从每种重组微生物中分离所表达的多肽。
术语“底物”或“合适的底物”是指通过酶的作用转化为或意图转化为另一种化合物的任何物质或化合物。所述术语不仅包括单一化合物,而且包括化合物的组合,诸如含有至少一种底物或其衍生物的溶液、混合物和其他材料。此外,术语“底物”不仅涵盖提供适合用作起始材料的碳源的化合物,而且包括在如本文所述的途径中使用的中间体和最终产物代谢物。另外,底物可以是氧化的或还原的辅因子或者磷酸化或去磷酸化的因子。
由于大麻素的结构复杂,大麻素的可扩大化学合成仍然是一项具有挑战性的任务。然而,鉴于对大麻素作为许多障碍的治疗选择的空前需求,以高效价获得大麻素的可持续方法将具有很大的实用性。大麻素来源于脂肪酸、聚酮和萜烯生物合成途径的组合,所述生物合成途径产生关键的结构单元牻牛儿基焦磷酸(GPP)和橄榄醇酸(OA)。大麻素的微生物发酵的瓶颈是橄榄醇酸(OA),它是将简单的结构单元与复杂的后期化合物连接的核心中间体(例如,参见图1)。迄今为止,已经开发了许多用于合成橄榄醇酸的基于发酵的工艺。例如,已经提出了从己酰辅酶A开始,通过聚酮脂酰辅酶A硫解酶和橄榄醇酸环化酶(OAC)的串联作用构建橄榄醇酸。另外的提议已经提出,某些牻牛儿基转移酶可以用于将橄榄醇酸转化为大麻萜酚酸。对于这两种提议,都依赖于串联使用两种酶,即四酮合酶(TKS)和橄榄醇酸环化酶(OAC),这两种酶都源自大麻(Cannabis sativa)。使用这样的策略的缺点是双重的:(i)起始单元己酰辅酶A的可用性在微生物宿主中通常较低,这限制了大麻素的最终产率;并且(ii)TKS和OAC的串联使用仅提供一种产物即橄榄醇酸,然而,产生另外的大麻素类似物将特别有益于确定大麻素基础结构的微小变化的结构/活性。
本公开文本通过提供可以原位产生己酰辅酶A的生物合成平台/集群为上述问题提供了客观的技术解决方案,并且进一步通过不仅产生橄榄醇酸(OA)而且产生其近似结构类似物表现出产物灵活性。此外,本文所述的生物合成途径可以利用来源于非植物生物体的OA合酶和OA环化酶,从而促进从微生物生产系统产生OA和OA类似物。
在一个具体实施方案中,本公开文本提供了产生橄榄醇酸及其类似物的无细胞生物系统,所述无细胞生物系统包括本文公开的生物合成平台。微生物系统可能受到多种技术挑战的阻碍,所述技术挑战使得难以实现成本竞争力,包括由于竞争途径导致的低产率;由缓慢的生长速率或途径优化困难引起的低生产率;污染性微生物生长;产物毒性;以及昂贵的产物分离。相比之下,无细胞生物系统可以避免许多这些问题。例如,无细胞生物系统具有适用于工业应用的若干优点:更高水平的途径设计灵活性;更好的组分优化控制;更快速的设计-构建-测试周期;以及没有中间体或产物的细胞毒性。体外生物系统可以实现活微生物或化学催化剂无法实现的生物反应。没有细胞膜屏障的酶系统通常比微生物系统具有更快的反应速率。例如,酶燃料电池通常比微生物燃料电池具有高得多的功率输出。酶混合物也比微生物更耐受有毒化合物。酶混合物通常在广泛的反应条件下发挥作用,所述反应条件诸如高温、低pH、存在有机溶剂或离子液体。在体外建立单一的专用途径可以消除在细胞中发生的副反应,使得几乎100%的产率和快速的反应时间是可能的。
无细胞生物系统的常见组分包括细胞提取物、能源、大量的氨基酸、辅因子(诸如镁)和具有所需基因的DNA。通过裂解目的细胞并且离心出细胞壁、DNA基因组和其他碎片来获得细胞提取物。剩余物是必要的细胞机器,包括核糖体、氨酰-tRNA合成酶、翻译起始和延伸因子、核酸酶等。
在无细胞生物系统中通常使用两种类型的DNA:质粒和线性表达模板(LET)。质粒是环形的,并且仅在细胞内制造。经由PCR可以更有效地制备LET,LET复制DNA的速度比在培养箱中培养细胞快得多。虽然LET制备起来更容易且更快,但在无细胞制剂中质粒产率通常要高得多。正因为如此,更多的研究已经集中在优化无细胞制剂的LET产率上,以接近采用质粒情况下的无细胞制剂的产率。能源通常是无细胞反应的一部分。通常,将含有所需能源连同大量的氨基酸的单独混合物添加到提取物中以进行反应。常见的来源是磷酸烯醇丙酮酸、乙酰磷酸和磷酸肌酸。
在一个实施方案中,本公开文本提供了基于质粒的无细胞生物系统的用途,所述无细胞生物系统包括本文公开的用于产生橄榄醇酸和类似物的生物合成平台。用于这样的系统的质粒或载体可以是与在下文呈现的实施例中使用的载体相同的载体,或者在下文更全面地描述的包含编码HRPKS、NRPKS和ΨAC-TE多肽的多核苷酸的构建体。在另一个实施方案中,无细胞系统由无细胞提取物产生。在此实施方案中,在微生物中表达本公开文本的生物合成平台的各种酶(例如,HRPKS、NRPKS和ΨAC-TE),对其进行提取,并且将其用作无细胞生物系统中的粗提取物。可替代地,可以在用于无细胞生物系统之前进一步纯化各种酶。本领域已知的许多技术可以用于纯化本文公开的生物系统的酶,包括使用亲和色谱法(例如,金属结合、免疫亲和、蛋白质标签等);电泳;基于HPLC和LC色谱法的方法(例如,尺寸排阻色谱法、离子交换色谱法、反相色谱法、阳离子交换色谱法);过滤技术(例如,凝胶过滤);免疫印迹;和离心。
本公开文本提供了基于LET的无细胞生物系统的用途,所述无细胞生物系统包括本文公开的用于产生橄榄醇酸和类似物的生物合成平台。线性表达模板可以通过PCR快速产生,从而允许快速且轻松地筛选多种构建体。表达载体使得蛋白质生产能够被扩大。要么将目的基因直接插入表达载体中,要么首先通过PCR产生线性模板,随后进行克隆。所达到的蛋白质产率足以进行扩展的功能和结构分析,或者产生用于NMR光谱学研究或X射线晶体学研究的标记蛋白质。LET试剂盒可以从不同的供应商(如biotechrabbit)商购。
本公开文本进一步提供了包含和/或用于表达本公开文本的生物合成平台的工程化微生物。
术语“微生物”包括来自古生菌域、细菌域和真核生物域的原核和真核微生物物种,后者包括酵母和丝状真菌、原生动物、藻类或高等原生生物。术语“微生物(microbial)细胞”和“微生物(microbe)”与术语微生物(microorganism)可互换地使用。
术语“原核生物”是本领域公认的,并且是指不含细胞核或其他细胞器的细胞。通常将原核生物分类为细菌域和古生菌域两个域之一。古生菌域与细菌域的生物体之间的决定性差异是基于16S核糖体RNA中的核苷酸碱基序列的根本差异。
术语“古生菌”是指疵壁菌门(Mendosicutes)的生物体的分类,通常在非惯常环境中发现,并且通过若干标准与其余原核生物区分开来,所述标准包括核糖体蛋白的数量和细胞壁中胞壁酸的缺乏。基于ssrRNA分析,古生菌由两个系统发生不同的组组成:泉古菌门(Crenarchaeota)和广古菌门(Euryarchaeota)。基于它们的生理学,古生菌可以被组织成三种类型:产甲烷菌(产生甲烷的原核生物);极端嗜盐菌(生活在非常高浓度的盐([NaCl])下的原核生物);以及极端(超)嗜热菌(生活在非常高的温度下的原核生物)。除了将它们与细菌区分开来的统一古生菌特征(即,细胞壁中没有胞壁质、酯连接的膜脂等)外,这些原核生物还表现出独特的结构或生化属性,这使它们适应它们的特定生境。泉古菌门主要由超嗜热硫依赖性原核生物组成,并且广古菌门包含产甲烷菌和极端嗜盐菌。
“细菌”或“真细菌”是指原核生物体的域。细菌包括如下至少11个不同的组:(1)革兰氏阳性(gram+)细菌,其中有两个主要细分:(1)高G+C组(放线菌属(Actinomycetes)、分枝杆菌属(Mycobacteria)、微球菌属(Micrococcus)等)(2)低G+C组(芽孢杆菌属(Bacillus)、梭菌属(Clostridia)、乳杆菌属(Lactobacillus)、葡萄球菌属(Staphylococci)、链球菌属(Streptococci)、支原体属(Mycoplasmas));(2)变形菌(Proteobacteria),例如紫色光合+非光合革兰氏阴性细菌(包括大多数“常见”革兰氏阴性细菌);(3)蓝细菌(Cyanobacteria),例如产氧光养生物;(4)螺旋体(Spirochetes)和相关物种;(5)浮霉状菌属(Planctomyces);(6)拟杆菌属(Bacteroides)、黄杆菌属(Flavobacteria);(7)衣原体属(Chlamydia);(8)绿色硫细菌;(9)绿色非硫细菌(也为厌氧光养生物);(10)抗辐射微球菌和亲缘菌;以及(11)热袍菌属(Thermotoga)和嗜热栖热腔菌(Thermosipho thermophiles)。
“革兰氏阴性细菌”包括球菌、非肠道杆菌和肠道杆菌。革兰氏阴性细菌的属包括例如奈瑟菌属(Neisseria)、螺旋菌属(Spirillum)、巴斯德菌属(Pasteurella)、布鲁氏菌属(Brucella)、耶尔森菌属(Yersinia)、弗朗西斯氏菌属(Francisella)、嗜血杆菌属(Haemophilus)、鲍特菌属(Bordetella)、埃希氏杆菌属、沙门菌属(Salmonella)、志贺菌属(Shigella)、克雷伯菌属(Klebsiella)、变形杆菌属(Proteus)、弧菌属(Vibrio)、假单胞菌属(Pseudomonas)、拟杆菌属、醋杆菌属(Acetobacter)、气杆菌属(Aerobacter)、土壤杆菌属(Agrobacterium)、固氮菌属(Azotobacter)、螺旋菌属(Spirilla)、沙雷菌属(Serratia)、弧菌属、根瘤菌属(Rhizobium)、衣原体属、立克次体属(Rickettsia)、密螺旋体属(Treponema)和梭形杆菌属(Fusobacterium)。
“革兰氏阳性细菌”包括球菌、非孢子杆菌和孢子杆菌。革兰氏阳性细菌的属包括例如放线菌属(Actinomyces)、芽孢杆菌属、梭菌属(Clostridium)、棒状杆菌属(Corynebacterium)、丹毒丝菌属(Erysipelothrix)、乳杆菌属、李斯特菌属(Listeria)、分枝杆菌属(Mycobacterium)、粘球菌属(Myxococcus)、诺卡氏菌属(Nocardia)、葡萄球菌属(Staphylococcus)、链球菌属(Streptococcus)和链霉菌属(Streptomyces)。
因此,本公开文本提供了“工程化的”或“修饰的”微生物,所述微生物经由以下方式而产生:将遗传物质引入选择的宿主或亲本微生物中,从而修饰或改变微生物的细胞生理学和生物化学。通过引入遗传物质,亲本微生物获得新特性,例如产生新的或更大量的细胞内代谢物的能力。引入亲本微生物中的遗传物质含有编码参与生物合成平台的一种或多种酶的一个或多个基因或者一个或多个基因的部分,并且包括编码参与产生橄榄醇酸及其类似物的一种或多种酶的一个或多个基因或者一个或多个基因的部分,并且还可以包括用于表达和/或调节这些基因的表达的另外的元件,例如启动子序列。可以被工程化以表达本文公开的生物合成途径的微生物包括细菌、古生菌、藻类和真菌。可以被工程化以表达本文公开的生物合成途径的合适细菌的例子包括大肠杆菌、类球红细菌、游海假交替单胞菌、希瓦氏菌属菌株Ac10、荧光假单胞菌、恶臭假单胞菌、铜绿假单胞菌、伸长盐单胞菌、需盐色盐杆菌、变铅青链霉菌、灰色链霉菌、耐内酰胺诺卡氏菌、耻垢分枝杆菌、谷氨酸棒状杆菌、产氨棒状杆菌、乳糖发酵短杆菌、枯草芽孢杆菌、短芽孢杆菌、巨大芽孢杆菌、地衣芽孢杆菌、解淀粉芽孢杆菌、乳酸乳球菌、植物乳杆菌、干酪乳杆菌、罗伊氏乳杆菌和加氏乳杆菌。可以被工程化以表达本文公开的生物合成途径的合适古生菌的例子包括詹氏甲烷球菌(Methanocaldococcus(Methanococcus)jannaschii)和硫矿硫化叶菌(Sulfolobussolfataricus)。可以被工程化以表达本文公开的生物合成途径的合适真菌的例子包括酿酒酵母、乳酸克鲁维酵母、毕赤酵母、多形汉逊酵母、解脂耶氏酵母、构巢曲霉、里氏木霉、尖孢镰刀菌、黄孢原毛平革菌、棉阿舒囊霉、米曲霉和卢克诺文思金孢子菌。
作为将遗传物质引入宿主或亲本微生物中的替代方案或除了将遗传物质引入宿主或亲本微生物中之外,工程化的或修饰的微生物还可以包括破坏、缺失或敲除基因或多核苷酸以改变微生物的细胞生理学和生物化学。通过减少、破坏或敲除基因或多核苷酸,微生物获得新的或改进的特性(例如,产生新的或更大量的细胞内代谢物、提高代谢物沿着所需途径的通量和/或减少不需要的副产物的产生的能力)。例如,可能需要工程化生物体以在代谢途径中表达一组所需组,同时消除竞争途径的酶。此工程化可以适用于体外(其中在破坏或纯化后不存在不需要的酶)或体内。
“天然”或“野生型”蛋白、酶、多核苷酸、基因或细胞意指在自然界中存在的蛋白质、酶、多核苷酸、基因或细胞。
“亲本微生物”是指用于产生重组微生物的细胞。在一个实施方案中,术语“亲本微生物”描述在自然界中存在的细胞,即未被遗传修饰的“野生型”细胞。术语“亲本微生物”进一步描述用作用于进一步工程化的“亲本”的细胞。在此后一个实施方案中,细胞可以已经被遗传工程化,但是用作用于进一步遗传工程化的来源。
例如,野生型微生物可以被遗传修饰以表达或过表达第一靶酶,诸如HRPKS。此微生物可以在被修饰以表达或过表达第二靶酶的微生物的产生中充当亲本微生物。如本文所用,“表达”或“过表达”是指所需基因产物的表型表达。在一个实施方案中,生物体中天然存在的基因可以被工程化,使得它与异源启动子或调节结构域连接,其中调节结构域引起所述基因的表达,从而相对于野生型生物体改变其正常表达。可替代地,生物体可以被工程化以去除或减少对基因的阻抑功能,从而改变其表达。在又另一个实施方案中,将包含与所需表达控制/调节元件可操作地连接的基因序列的盒工程化到微生物中。
因此,亲本微生物起连续遗传修饰事件的参考细胞的作用。可以通过将一种或多种核酸分子引入参考细胞中来完成每个修饰事件。引入促进一种或多种靶酶的表达或过表达或者一种或多种靶酶的减少或消除。应理解,术语“促进”涵盖通过遗传修饰(例如,亲本微生物中的启动子序列的遗传修饰)来激活编码靶酶的内源多核苷酸。应进一步理解,术语“促进”涵盖将编码靶酶的外源多核苷酸引入亲本微生物中。
编码用于产生橄榄醇酸及其类似物的酶(包括其同源物、变体、片段、相关融合蛋白或功能等效物)的多核苷酸在指导此类多肽在适当的宿主细胞(诸如细菌或酵母细胞)中表达的重组核酸分子中使用。
应理解,本文所述的多核苷酸包括“基因”,并且上述核酸分子包括“载体”或“质粒”。因此,术语“基因”(也称为“结构基因”)是指编码包含氨基酸序列的特定多肽的多核苷酸,所述多肽包括一种或多种蛋白质或酶的全部或部分,并且可以包括确定例如表达基因的条件的调节(非转录)DNA序列,诸如启动子区或表达控制元件。基因的转录区可以包括非翻译区(包括内含子、5'-非翻译区(UTR)和3'-UTR)以及编码序列。
本领域技术人员将认识到,由于遗传密码的简并性质,核苷酸序列不同的多种密码子可以用于编码给定的氨基酸。在本文中提及编码本文所述的生物合成酶或多肽(例如,SEQ ID NO:2)的特定多核苷酸或基因序列仅用于说明本公开文本的实施方案,并且本公开文本包括编码多肽的任何序列的多核苷酸,所述多肽包含与在本公开文本的方法中利用的酶的多肽和蛋白质的氨基酸序列相同的氨基酸序列或者与其至少50%-99%相同并且与具有100%同一性的序列具有相同生物活性的多肽序列。例如,多肽通常可以耐受其氨基酸序列中的一个或多个氨基酸取代、缺失和插入,而不会损失或显著损失所需活性。本公开文本包括具有替代氨基酸序列的此类多肽,并且本文所示的氨基酸序列仅说明本公开文本的示例性实施方案。
如本文别处更详细描述的,本公开文本提供了编码一种或多种靶酶的呈重组DNA表达载体或质粒形式的多核苷酸。通常,此类载体可以在宿主微生物的细胞质中复制或整合到宿主微生物的染色体DNA中,或者可以在无细胞系统中使用。在任一种情况下,载体可以是稳定载体(即,即使仅采用选择压力,载体经多次细胞分裂后仍存在)或瞬时载体(即,随着细胞分裂次数的增加,载体逐渐被宿主微生物遗失)。本公开文本提供了呈分离形式(即,不纯的,但是在制剂中以自然界中未发现的丰度和/或浓度存在)以及纯化形式(即,基本上不含污染材料,或基本上不含与相应DNA在自然界中一起被发现的材料)的DNA分子。
可以根据标准PCR扩增技术和下文实施例部分中描述的那些程序使用cDNA、mRNA或可替代的基因组DNA作为模板以及适当的寡核苷酸引物来扩增本公开文本的多核苷酸。可以将如此扩增的核酸克隆到适当的载体中并且通过DNA序列分析进行表征。此外,可以通过标准合成技术(例如,使用自动化DNA合成仪)来制备对应于核苷酸序列的寡核苷酸。
还应理解,可以通过以下方式产生编码与本文所述的酶同源的多肽的分离的多核苷酸分子:将一个或多个核苷酸取代、添加或缺失引入编码特定多肽的核苷酸序列中,使得将一个或多个氨基酸取代、添加或缺失引入所编码的蛋白质中。可以通过标准技术(诸如定点诱变和PCR介导的诱变)将突变引入多核苷酸中。与可能需要进行非保守氨基酸取代的那些位置相反,在一些位置,优选的是进行保守氨基酸取代。
如本领域技术人员将理解的,修饰编码序列以增强其在特定宿主中的表达可以是有利的。遗传密码是冗余的,具有64种可能的密码子,但是大多数生物体通常使用这些密码子的子集。在物种中最常利用的密码子被称为最优密码子,而不被经常利用的那些密码子被分类为稀有或利用率低的密码子。密码子可以被取代以反映宿主的偏好密码子使用,所述过程有时被称为“密码子优化”或“控制物种密码子偏倚”。
可以制备含有特定原核或真核宿主偏好的密码子的优化的编码序列(还参见,Murray等人(1989)Nucl.Acids Res.17:477-508),例如与由非优化的序列产生的转录物相比,提高翻译速率或产生具有期望特性(诸如更长的半衰期)的重组RNA转录物。翻译终止密码子也可以被修饰以反映宿主偏好。例如,酿酒酵母(S.cerevisiae)和哺乳动物的典型终止密码子分别是UAA和UGA。单子叶植物的典型终止密码子是UGA,而昆虫和大肠杆菌通常使用UAA作为终止密码子(Dalphin等人(1996)Nucl.Acids Res.24:216-218)。
“转化”是指将载体引入宿主细胞中的过程。转化(或转导、或转染)可以通过许多手段中的任一种来实现,所述手段包括电穿孔、显微注射、基因枪法(或粒子轰击介导的递送)或土壤杆菌(agrobacterium)介导的转化。
“载体”通常是指可以在生物体、细胞或细胞组分之间传播和/或转移的多核苷酸。载体包括病毒、噬菌体、原病毒、质粒、噬菌粒、转座子和人工染色体(诸如YAC(酵母人工染色体)、BAC(细菌人工染色体)和PLAC(植物人工染色体))等,它们是“附加体”,即自主复制或可以整合到宿主细胞的染色体中。载体也可以是裸RNA多核苷酸、裸DNA多核苷酸、在同一条链内由DNA和RNA两者构成的多核苷酸、聚赖氨酸缀合的DNA或RNA、肽缀合的DNA或RNA、脂质体缀合的DNA等,它们在自然界中不是附加型的,或者它可以是包含一种或多种上述多核苷酸构建体的生物体,诸如细菌或真菌。
表达载体的各种组分可以广泛变化,取决于载体的预期用途和旨在使载体在其中复制或驱动表达的一种或多种宿主细胞。适合在细菌、酵母、丝状真菌和其他常用细胞中表达基因和维持载体的表达载体组分是广为人知且可商购获得的。例如,用于包含在本公开文本的表达载体中的合适启动子包括在真核或原核宿主微生物中起作用的那些启动子。启动子可以包含调节序列,它们允许调节与宿主微生物的生长相关的表达或者响应于化学或物理刺激而使基因的表达开启或关闭。对于大肠杆菌和某些其他细菌宿主细胞,可以使用来源于生物合成酶、赋予抗生素抗性的酶和噬菌体蛋白的基因的启动子,并且包括例如半乳糖启动子、乳糖(lac)启动子、麦芽糖启动子、色氨酸(trp)启动子、β-内酰胺酶(bla)启动子、λ噬菌体PL启动子和T5启动子。另外,还可以使用合成启动子,诸如tac启动子(美国专利号4,551,433,将其通过引用以其整体并入本文)。对于大肠杆菌表达载体,包括大肠杆菌复制起点(诸如来自pUC、p1P、p1和pBR)是有用的。
因此,重组表达载体含有用于本文公开的生物合成平台的至少一种表达系统,所述表达系统又由与启动子可操作地连接的基因编码序列的至少一部分和任选的操作以影响编码序列在相容宿主细胞中的表达的终止序列构成。通过用本公开文本的重组DNA表达载体转化来修饰宿主细胞,以含有作为染色体外元件或整合到染色体中的表达系统序列。
足以指导技术人员进行体外扩增方法(包括聚合酶链式反应(PCR)、连接酶链式反应(LCR)、Qβ-复制酶扩增和其他RNA聚合酶介导的技术(例如,NASBA))(例如,用于产生本公开文本的同源核酸)的方案的例子可见于Berger、Sambrook和Ausubel以及Mullis等人(1987)美国专利号4,683,202;Innis等人编辑(1990)PCR Protocols:A Guide to Methodsand Applications(Academic Press Inc.San Diego,Calif.)(“Innis”);Arnheim和Levinson(1990年10月1日)C&EN 36-47;The Journal Of NIH Research(1991)3:81-94;Kwoh等人(1989)Proc.Natl.Acad.Sci.USA 86:1173;Guatelli等人(1990)Proc.Nat'l.Acad.Sci.USA 87:1874;Lomell等人(1989)J.Clin.Chem 35:1826;Landegren等人(1988)Science 241:1077-1080;Van Brunt(1990)Biotechnology 8:291-294;Wu和Wallace(1989)Gene 4:560;Barringer等人(1990)Gene 89:117;以及Sooknanan和Malek(1995)Biotechnology 13:563-564。
用于克隆体外扩增的核酸的改进方法描述于Wallace等人,美国专利号5,426,039中。
用于通过PCR扩增大核酸的改进方法总结于Cheng等人(1994)Nature 369:684-685和其中引用的参考文献中,其中产生了高达40kb的PCR扩增子。技术人员将理解,使用逆转录酶和聚合酶,基本上任何RNA都可以被转化为适合限制性消化、PCR扩增和测序的双链DNA。参见例如,Ausubel、Sambrook和Berger,全部同上。
另外,并且如上文所提及,本文提供的微生物和方法涵盖本公开文本的可用于生产的生物合成平台的酶(例如,HRPKS、NRPKS和TE)的同源物。关于第一家族或物种的原始酶或基因所用的术语“同源物”是指通过功能、结构或基因组分析确定为对应于第一家族或物种的原始酶或基因的第二家族或物种的酶或基因的第二家族或物种的不同酶或基因。通常来说,同源物将具有功能、结构或基因组相似性。使用遗传探针和PCR可以容易地克隆酶或基因的同源物的技术是已知的。可以使用功能测定和/或通过基因的基因组作图来确认克隆序列作为同源物的身份。
如果编码蛋白质的核酸序列具有与编码第二蛋白质的核酸序列相似的序列,则所述蛋白质与第二蛋白质具有“同源性”或与第二蛋白质“同源”。可替代地,如果蛋白质和第二蛋白质具有“相似的”氨基酸序列,则这两种蛋白质具有同源性。(因此,术语“同源蛋白质”被定义为意指两种蛋白质具有相似的氨基酸序列)。
如本文所用,在氨基酸序列具有至少约50%、60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%同一性时,两种蛋白质(或蛋白质的区域)是基本上同源的。为了确定两个氨基酸序列或两个核酸序列的同一性百分比,出于最佳比较目的将序列进行比对(例如,可以在第一氨基酸序列和第二氨基酸或第一核酸序列和第二核酸序列中的一个或两个中引入空位以实现最佳比对,并且出于比较目的可以忽略非同源序列)。在一个实施方案中,出于比较目的而比对的参考序列的长度为参考序列长度的至少30%,通常至少40%,更通常至少50%,甚至更通常至少60%,甚至更通常至少70%、80%、90%、100%。然后比较在相应氨基酸位置或核苷酸位置处的氨基酸残基或核苷酸。在第一序列中的位置被与第二序列中的相应位置相同的氨基酸残基或核苷酸占据时,则分子在该位置处是相同的(如本文所用,氨基酸或核酸“同一性”等同于氨基酸或核酸“同源性”)。两个序列之间的同一性百分比是考虑到空位的数量和每个空位的长度,序列所共享的相同位置的数量的函数,需要引入空位以实现两个序列的最佳比对。
通常使用序列分析软件来测量多肽的序列同源性(其也可以称为序列同一性百分比)。参见例如,威斯康星州麦迪逊大学大道910号威斯康星大学生物技术中心53705的遗传学计算机组(GCG)的序列分析软件包。蛋白质分析软件使用分配给各种取代、缺失和其他修饰(包括保守氨基酸取代)的同源性量度来匹配相似的序列。例如,GCG含有诸如“Gap”和“Bestfit”的程序,它们可以与默认参数一起用来确定密切相关的多肽(诸如来自不同生物体物种的同源多肽)之间或野生型蛋白质与其突变蛋白之间的序列同源性或序列同一性。参见例如,GCG版本6.1。
用于将分子序列与含有来自不同生物体的大量序列的数据库进行比较的典型算法是计算机程序BLAST(Altschul,1990;Gish,1993;Madden,1996;Altschul,1997;Zhang,1997),尤其是blastp或tblastn(Altschul,1997)。BLASTp的典型参数是:期望值:10(默认);过滤器:seg(默认);开放空位罚分:11(默认);扩展空位罚分:1(默认);最大比对:100(默认);字长:11(默认);描述数:100(默认);罚分矩阵:BLOWSUM62。
在搜索含有来自大量不同生物体的序列的数据库时,通常比较氨基酸序列。可以通过本领域已知的除blastp之外的算法来测量使用氨基酸序列的数据库搜索。例如,可以使用FASTA(GCG版本6.1中的一种程序)来比较多肽序列。FASTA提供了查询序列与搜索序列之间的最佳重叠区域的比对和序列同一性百分比(Pearson,1990,通过引用特此并入本文)。例如,可以使用如在GCG版本6.1(通过引用特此并入本文)中提供的FASTA及其默认参数(字长2和PAM250评分矩阵)来确定氨基酸序列之间的序列同一性百分比。
在关于蛋白质或肽使用“同源”时,应认识到,不相同的残基位置通常因保守氨基酸取代而不同。“保守氨基酸取代”是氨基酸残基被具有化学特性(例如,电荷或疏水性)相似的侧链(R基团)的另一个氨基酸残基取代的取代。一般来说,保守氨基酸取代将不会显著改变蛋白质的功能特性。在两个或更多个氨基酸序列因保守取代而彼此不同的情况下,可以向上调整序列同一性百分比或同源性程度以校正取代的保守性质。用于进行此调整的手段是本领域技术人员熟知的(参见例如,Pearson等人,1994,通过引用特此并入本文)。
“保守氨基酸取代”是用具有相似侧链的氨基酸残基替换氨基酸残基的取代。本领域已经定义了具有相似侧链的氨基酸残基家族。这些家族包括具有碱性侧链的氨基酸(例如,赖氨酸、精氨酸、组氨酸)、具有酸性侧链的氨基酸(例如,天冬氨酸、谷氨酸)、具有不带电的极性侧链的氨基酸(例如,甘氨酸、天冬酰胺、谷氨酰胺、丝氨酸、苏氨酸、酪氨酸、半胱氨酸)、具有非极性侧链的氨基酸(例如,丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸、甲硫氨酸、色氨酸)、具有β-支链侧链的氨基酸(例如,苏氨酸、缬氨酸、异亮氨酸)和具有芳族侧链的氨基酸(例如,酪氨酸、苯丙氨酸、色氨酸、组氨酸)。以下六组各自含有为彼此的保守取代的氨基酸:1)丝氨酸(S)、苏氨酸(T);2)天冬氨酸(D)、谷氨酸(E);3)天冬酰胺(N)、谷氨酰胺(Q);4)精氨酸(R)、赖氨酸(K);5)异亮氨酸(I)、亮氨酸(L)、甲硫氨酸(M)、丙氨酸(A)、缬氨酸(V);以及6)苯丙氨酸(F)、酪氨酸(Y)、色氨酸(W)。
在一些情况下,可以使用“同工酶”,它们进行相同的功能转化/反应,但是在结构上如此不同,以致于它们通常被确定为不是“同源的”。
本公开文本提供了系统和/或重组微生物,所述系统和/或重组微生物包括高还原性聚酮合酶(HRPKS)。可以将此酶与本文公开的生物合成平台中的其他酶组合,用于产生如上文和下文所述的橄榄醇酸及其类似物。所述酶产生包括己酰辅酶A、己酸和/或其类似物的代谢物。高还原性聚酮合酶可以由HRPKS基因、多核苷酸或其同源物编码。HRPKS基因或多核苷酸可以来源于各种微生物,包括金龟子绿僵菌。
除了前述内容之外,术语“高还原性聚酮合酶”或“HRPKS”是指能够催化从乙酰辅酶A和丙二酰辅酶A形成己酰辅酶A、己酸和/或其类似物并且如使用默认参数由NCBI BLAST计算的与SEQ ID NO:2具有至少约50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高序列同一性或者至少约50%、60%、70%、80%、90%、95%、96%、97%、98%、99%或更高序列相似性的蛋白质。另外的同源物包括表1中呈现的与具有SEQ ID NO:2的序列的HRPKS同源的那些序列。将与所呈现的登录号相关的序列通过引用并入本文。
表1:金龟子绿僵菌HRPKS(SEQ ID NO:2)的同源物。
Figure BDA0003811277370000121
在另一个实施方案中,本文提供的系统或重组微生物包括非还原性聚酮合酶(NRPKS)。可以将此酶与本文公开的生物合成平台中的其他酶组合,用于产生如上文和下文所述的橄榄醇酸及其类似物。所述酶产生包括来自己酰辅酶A、己酸和/或其类似物的芳族二醇代谢物的代谢物。非还原性聚酮合酶可以由NRPKS基因、多核苷酸或其同源物编码。NRPKS基因或多核苷酸可以来源于各种微生物,包括金龟子绿僵菌。
除了前述内容之外,术语“非还原性聚酮合酶”或“NRPKS”是指能够催化从己酰辅酶A、己酸或其类似物形成芳族二醇代谢物并且如使用默认参数由NCBI BLAST计算的与SEQID NO:4具有至少约50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高序列同一性或者至少约50%、60%、70%、80%、90%、95%、96%、97%、98%、99%或更高序列相似性的蛋白质。另外的同源物包括表2中呈现的与具有SEQ ID NO:4的序列的HRPKS同源的那些序列。将与前述登录号相关的序列通过引用并入本文。
表2:金龟子绿僵菌NRPKS(SEQ ID NO:4)的同源物。
Figure BDA0003811277370000131
在另一个实施方案中,本文提供的系统或重组微生物包括硫酯酶(TE)。可以将此酶与本文公开的生物合成平台中的其他酶组合,用于产生如上文和下文所述的橄榄醇酸及其类似物。所述酶从芳族二醇代谢物产生橄榄醇酸和类似物。硫酯酶可以由TE基因、多核苷酸或其同源物编码。TE基因或多核苷酸可以来源于各种微生物,包括金龟子绿僵菌。
除了前述内容之外,术语“硫酯酶”或“TE”或“ΨACP-TE”是指能够催化从芳族二醇代谢物形成橄榄醇酸及其类似物并且如使用默认参数由NCBI BLAST计算的与SEQ ID NO:6具有至少约50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或更高序列同一性或者至少约50%、60%、70%、80%、90%、95%、96%、97%、98%、99%或更高序列相似性的蛋白质。另外的同源物包括表3中呈现的与具有SEQ ID NO:6的序列的ΨACP-TE同源的那些序列。将与前述登录号相关的序列通过引用并入本文。
表3:金龟子绿僵菌TE(SEQ ID NO:6)的同源物。
Figure BDA0003811277370000132
Figure BDA0003811277370000141
在本文提出的研究中,发现源自真菌金龟子绿僵菌的生物合成平台以高产率提供橄榄醇酸及其类似物。特别地,在构巢曲霉中异源表达上述生物合成平台,并且在没有进行任何代谢优化的情况下以>4g/L的总效价获得橄榄醇酸及其类似物。在实践中,也可以在大肠杆菌、酵母和其他异源微生物宿主中表达所述生物合成平台用于产生橄榄醇酸及其类似物。
在另一个实施方案中,本公开文本展示了使用本公开文本的生物合成平台来产生橄榄醇酸及其类似物。所述生物合成平台包括一种或多种异源多核苷酸的表达或过表达,所述异源多核苷酸包括:(i)催化从乙酰辅酶A和丙二酰辅酶A产生己酰辅酶A、己酸和/或其类似物的多肽;(ii)催化从己酰辅酶A、己酸或其类似物产生芳族二醇代谢物的多肽;(iii)催化从芳族二醇产生橄榄醇酸及其类似物的多肽。在一个实施方案中,所述生物合成平台包括(i)包含一组多肽的无细胞系统,所述一组多肽包括(1)与SEQ ID NO:2具有至少50%-100%序列同一性的多肽、(2)与SEQ ID NO:4具有至少50%-100%序列同一性的多肽和(3)与SEQ ID NO:6具有至少45%-100%序列同一性的多肽,使得所述无细胞系统可以将乙酰辅酶A和丙二酰辅酶A转化为橄榄醇酸或其类似物;或者(ii)表达异源多肽的至少一种重组细胞,所述异源多肽选自(1)与SEQ ID NO:2具有至少50%-100%序列同一性的多肽、(2)与SEQ ID NO:4具有至少50%-100%序列同一性的多肽、(3)与SEQ ID NO:6具有至少45%-100%序列同一性的多肽和(4)(1)-(3)的任何组合,使得所述微生物可以将乙酰辅酶A和丙二酰辅酶A转化为橄榄醇酸或其类似物。
在一个实施方案中,所述无细胞系统包含具有选自SEQ ID NO:2、8和14的序列的第一多肽;具有选自SEQ ID NO:4、10和16的序列的第二多肽;以及具有选自SEQ ID NO:6、12和18的序列的第三多肽,其中所述无细胞系统可以将乙酰辅酶A和丙二酰辅酶A转化为橄榄醇酸。
在另一个实施方案中,所述至少一种重组微生物表达具有选自SEQ ID NO:2、8和14的序列的第一异源多肽;具有选自SEQ ID NO:4、10和16的序列的第二异源多肽;和/或具有选自SEQ ID NO:6、12和18的序列的第三异源多肽,其中包含所述至少一种重组微生物的培养物表达所述第一异源多肽、所述第二异源多肽和所述第三异源多肽,可以将乙酰辅酶A和丙二酰辅酶A转化为橄榄醇酸。
本领域技术人员将认识到,上文鉴定的各种代谢物可以用作其他分解代谢或合成代谢途径的底物。
将认识到,可以利用具有一种或多种(但不是全部)前述酶的子系统或生物体,然后将其与包含所述途径的剩余酶成员的生物体或其他子系统组合。
如先前所述及,贯穿本公开文本描述的靶酶通常产生代谢物。另外,贯穿本公开文本描述的靶酶由多核苷酸编码。
因此,在一个实施方案中,本文提供的系统或重组微生物包含高还原性聚酮合酶(HRPKS)或其同源物或变体。可以将此表达与所述生物合成途径的酶组合,并且可以进一步包括用于产生橄榄醇酸或另外的代谢物的另外的下游酶。HRPKS可以来源于金龟子绿僵菌、膨大弯颈霉(Tolypocladium inflatum)、莱氏绿僵菌和/或岛篮状菌(或上表1中鉴定的其他生物体)。在另一个实施方案中,可以使用HRPKS的工程化变体,只要它具有高还原性聚酮合酶活性并且可以将乙酰辅酶A和丙二酰辅酶A转化为己酰辅酶A、己酸和/或其类似物即可。此类工程化变体可以通过定点诱变、定向进化等获得。因此,包括在本公开文本内的是与来自金龟子绿僵菌的HRPKS的序列至少85%-99%相同(例如,与SEQ ID NO:2 85%-100%相同)并且具有高还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自膨大弯颈霉的HRPKS的序列至少85%-99%相同并且具有高还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自莱氏绿僵菌的HRPKS的序列至少85%-99%相同(例如,与SEQ ID NO:8 85%-100%相同)并且具有高还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自岛篮状菌的HRPKS的序列至少85%-99%相同(例如,与SEQ IDNO:14 85%-100%相同)并且具有高还原性聚酮合酶活性的多肽。
在另一个或进一步的实施方案中,本文提供的系统或重组微生物包括非还原性聚酮合酶或其同源物或变体的表达。可以将此表达与所述生物合成途径的酶组合,并且可以进一步包括用于产生橄榄醇酸或另外的代谢物的另外的下游酶。NRPKS可以来源于金龟子绿僵菌、膨大弯颈霉、莱氏绿僵菌和/或岛篮状菌(或上表2中鉴定的其他生物体)。在另一个实施方案中,可以使用NRPKS的工程化变体,只要它具有非还原性聚酮合酶活性并且可以将己酰辅酶A、己酸和/或其类似物转化为芳族二醇代谢物即可。此类工程化变体可以通过定点诱变、定向进化等获得。因此,包括在本公开文本内的是与来自金龟子绿僵菌的NRPKS的序列至少85%-99%相同(例如,与SEQ ID NO:4 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自膨大弯颈霉的NRPKS的序列至少85%-99%相同并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自莱氏绿僵菌的NRPKS的序列至少85%-99%相同(例如,与SEQ ID NO:10 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自岛篮状菌的NRPKS的序列至少85%-99%相同(例如,与SEQ ID NO:16 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。
在另一个或进一步的实施方案中,本文提供的系统或重组微生物包括硫酯酶或其同源物或变体的表达。可以将此表达与所述生物合成途径的酶组合,并且可以进一步包括用于产生橄榄醇酸或另外的代谢物的另外的下游酶。TE可以来源于金龟子绿僵菌、膨大弯颈霉、莱氏绿僵菌和/或岛篮状菌(或上表3中鉴定的其他生物体)。在另一个实施方案中,可以使用TE的工程化变体,只要它具有硫酯酶活性并且可以将芳族二醇代谢物转化为橄榄醇酸及其类似物即可。此类工程化变体可以通过定点诱变、定向进化等获得。因此,包括在本公开文本内的是与来自金龟子绿僵菌的TE的序列至少85%-99%相同(例如,与SEQ ID NO:6 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自膨大弯颈霉的TE的序列至少85%-99%相同并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自莱氏绿僵菌的TE的序列至少85%-99%相同(例如,与SEQ ID NO:12 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。因此,包括在本公开文本内的是与来自岛篮状菌的TE的序列至少85%-99%相同(例如,与SEQ ID NO:18 85%-100%相同)并且具有非还原性聚酮合酶活性的多肽。
如上文所指示,本公开文本进一步提供了产生构成本文公开的生物合成平台的酶的变体。此类酶变体可以扩展底物特异性、改变底物特异性、改善反应动力学、提高酶稳定性等。例如,构成本文公开的生物合成平台的酶的变体可以改变这些产物的比率,诸如C6多于C8 OA。酶的变体可以基于使用突变方法或定向进化方法对酶的序列(例如,SEQ ID NO:2、4、6、8、10、12、14、16或18)进行改变。
产生酶变体的突变方法包括例如定点诱变(Ling等人(1997)"Approaches to DNAmutagenesis:an overview"Anal Biochem.254(2):157-178;Dale等人(1996)"Oligonucleotide-directed random mutagenesis using the phosphorothioatemethod"Methods Mol.Biol.57:369-374;Smith(1985)"In vitro mutagenesis"Ann.Rev.Genet.19:423-462;Botstein和Shortle(1985)"Strategies and applicationsof in vitro mutagenesis"Science229:1193-1201;Carter(1986)"Site-directedmutagenesis"Biochem.J.237:1-7;以及Kunkel(1987)"The efficiency ofoligonucleotide directed mutagenesis"Nucleic Acids&Molecular Biology(Eckstein,F.和Lilley,D.M.J.编辑,Springer Verlag,Berlin));使用含有尿嘧啶的模板进行的诱变(Kunkel(1985)"Rapid and efficient site-specific mutagenesis withoutphenotypic selection"Proc.Natl.Acad.Sci.USA 82:488-492;Kunkel等人(1987)"Rapidand efficient site-specific mutagenesis without phenotypic selection"Methodsin Enzymol.154,367-382;以及Bass等人(1988)"Mutant Trp repressors with new DNA-binding specificities"Science 242:240-245);寡核苷酸定点诱变(Methods inEnzymol.100:468-500(1983);Methods in Enzymol.154:329-350(1987);Zoller和Smith(1982)"Oligonucleotide-directed mutagenesis using M13-derived vectors:anefficient and general procedure for the production of point mutations in anyDNA fragment"Nucleic Acids Res.10:6487-6500;Zoller和Smith(1983)"Oligonucleotide-directed mutagenesis of DNA fragments cloned into M13vectors"Methods in Enzymol.100:468-500;以及Zoller和Smith(1987)"Oligonucleotide-directed mutagenesis:a simple method using twooligonucleotide primers and a single-stranded DNA template"Methods inEnzymol.154:329-350);硫代磷酸修饰的DNA诱变(Taylor等人(1985)"The use ofphosphorothioate-modified DNA in restriction enzyme reactions to preparenicked DNA"Nucl.Acids Res.13:8749-8764;Taylor等人(1985)"The rapid generationof oligonucleotide-directed mutations at high frequency usingphosphorothioate-modified DNA"Nucl.Acids Res.13:8765-8787;Nakamaye和Eckstein(1986)"Inhibition of restriction endonuclease Nci I cleavage byphosphorothioate groups and its application to oligonucleotide-directedmutagenesis"Nucl.Acids Res.14:9679-9698;Sayers等人(1988)"Y-T Exonucleases inphosphorothioate-based oligonucleotide-directed mutagenesis"Nucl.AcidsRes.16:791-802;以及Sayers等人(1988)"Strand specific cleavage ofphosphorothioate-containing DNA by reaction with restriction endonucleases inthe presence of ethidium bromide"Nucl.Acids Res.16:803-814);使用有缺口的双链DNA进行的诱变(Kramer等人(1984)"The gapped duplex DNA approach tooligonucleotide-directed mutation construction"Nucl.Acids Res.12:9441-9456;Kramer和Fritz(1987)Methods in Enzymol."Oligonucleotide-directed constructionof mutations via gapped duplex DNA"154:350-367;Kramer等人(1988)"Improvedenzymatic in vitro reactions in the gapped duplex DNA approach tooligonucleotide-directed construction of mutations"Nucl.Acids Res.16:7207;以及Fritz等人(1988)"Oligonucleotide-directed construction of mutations:agappedduplex DNA procedure without enzymatic reactions in vitro"Nucl.Acids Res.16:6987-6999)(将其中的每一个通过引用并入)。
另外的合适方法包括点错配修复(Kramer等人(1984)"Point Mismatch Repair"Cell38:879-887);使用修复缺陷型宿主菌株进行的诱变(Carter等人(1985)"Improvedoligonucleotide site-directed mutagenesis using M13 vectors"Nucl.AcidsRes.13:4431-4443;以及Carter(1987)"Improved oligonucleotide-directedmutagenesis using M13 vectors"Methods in Enzymol.154:382-403);缺失诱变(Eghtedarzadeh和Henikoff(1986)"Use of oligonucleotides to generate largedeletions"Nucl.Acids Res.14:5115);限制性选择和限制性纯化(Wells等人(1986)"Importance of hydrogen-bond formation in stabilizing the transition state ofsubtilisin"Phil.Trans.R.Soc.Lond.A 317:415-423);通过全基因合成进行的诱变(Nambiar等人(1984)"Total synthesis and cloning of a gene coding for theribonuclease S protein"Science 223:1299-1301;Sakamar和Khorana(1988)"Totalsynthesis and expression of a gene for the a-subunit of bovine rod outersegment guanine nucleotide-binding protein(transducin)"Nucl.Acids Res.14:6361-6372;Wells等人(1985)"Cassette mutagenesis:an efficient method forgeneration of multiple mutations at defined sites"Gene 34:315-323;以及Grundstrom等人(1985)"Oligonucleotide-directed mutagenesis by microscale`shot-gun`gene synthesis"Nucl.Acids Res.13:3305-3316);双链断裂修复(Mandecki(1986);Arnold(1993)"Protein engineering for unusual environments"Current Opinion inBiotechnology 4:450-455;以及"Oligonucleotide-directed double-strand breakrepair in plasmids of Escherichia coli:a method for site-specificmutagenesis"Proc.Natl.Acad.Sci.USA,83:7177-7181)(将其中的每一个通过引用并入)。关于许多上述方法的另外的详细信息可以见于Methods in Enzymology第154卷,其中还描述了采用各种诱变方法解决问题的有用控制。
有关各种多样化产生方法的另外的详细信息可以见于以下美国专利、PCT公开案和EPO公开案:授予Stemmer的美国专利号5,605,793(1997年2月25日),"Methods for Invitro Recombination";授予Stemmer等人的美国专利号5,811,238(1998年9月22日)"Methods for Generating Polynucleotides having Desired Characteristics byIterative Selection and Recombination";授予Stemmer等人的美国专利号5,830,721(1998年11月3日),"DNA Mutagenesis by Random Fragmentation and Reassembly";授予Stemmer等人的美国专利号5,834,252(1998年11月10日)"End-Complementary PolymeraseReaction";授予Minshull等人的美国专利号5,837,458(1998年11月17日),"Methods andCompositions for Cellular and Metabolic Engineering";WO 95/22625,Stemmer和Crameri,"Mutagenesis by Random Fragmentation and Reassembly";Stemmer和Lipschutz的WO 96/33207"End Complementary Polymerase Chain Reaction";Stemmer和Crameri的WO 97/20078"Methods for Generating Polynucleotides having DesiredCharacteristics by Iterative Selection and Recombination";Minshull和Stemmer的WO 97/35966,"Methods and Compositions for Cellular and MetabolicEngineering";Punnonen等人的WO 99/41402"Targeting of Genetic Vaccine Vectors";Punnonen等人的WO 99/41383"Antigen Library Immunization";Punnonen等人的WO99/41369"Genetic Vaccine Vector Engineering";Punnonen等人的WO 99/41368"Optimization of Immunomodulatory Properties of Genetic Vaccines";Stemmer和Crameri的EP 752008,"DNA Mutagenesis by Random Fragmentation and Reassembly";Stemmer的EP0932670"Evolving Cellular DNA Uptake by Recursive SequenceRecombination";Stemmer等人的WO 99/23107,"Modification of Virus Tropism andHost Range by Viral Genome Shuffling";Apt等人的WO 99/21979,"HumanPapillomavirus Vectors";del Cardayre等人的WO 98/31837"Evolution of WholeCells and Organisms by Recursive Sequence Recombination";Patten和Stemmer的WO98/27230,"Methods and Compositions for Polypeptide Engineering";Stemmer等人的WO 98/13487,"Methods for Optimization of Gene Therapy by Recursive SequenceShuffling and Selection";WO 00/00632,"Methods for Generating Highly DiverseLibraries";WO 00/09679,"Methods for Obtaining in vitro RecombinedPolynucleotide Sequence Banks and Resulting Sequences";Arnold等人的WO98/42832,"Recombination of Polynucleotide Sequences Using Random or DefinedPrimers";Arnold等人的WO 99/29902,"Method for Creating Polynucleotide andPolypeptide Sequences";Vind的WO 98/41653,"An in vitro Method for Constructionof a DNA Library";Borchert等人的WO 98/41622,"Method for Constructing aLibrary Using DNA Shuffling";Pati和Zarling的WO 98/42727,"Sequence Alterationsusing Homologous Recombination";Patten等人的WO 00/18906,"Shuffling of Codon-Altered Genes";del Cardayre等人的WO00/04190"Evolution of Whole Cells andOrganisms by Recursive Recombination";Crameri等人的WO 00/42561,"Oligonucleotide Mediated Nucleic Acid Recombination";Selifonov和Stemmer的WO00/42559"Methods of Populating Data Structures for Use in EvolutionarySimulations";Selifonov等人的WO 00/42560,"Methods for Making CharacterStrings,Polynucleotides&Polypeptides Having Desired Characteristics";Welch等人的WO 01/23401,"Use of Codon-Varied Oligonucleotide Synthesis for SyntheticShuffling";以及Affholter的WO 01/64864"Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation"(将其中的每一个通过引用并入)。
定向进化也可以用于产生本文公开的生物合成平台的酶变体。定向进化(DE)是蛋白质工程中使用的一种方法,它模拟自然选择的过程来引导蛋白质或核酸朝着用户定义的目标进行。它包括对基因进行迭代轮次的诱变(创建变体文库)、选择(表达这些变体并且分离具有所需功能的成员)和扩增(为下一轮产生模板)。在DE中,通过迭代轮次的诱变、选择或筛选以及扩增来使单个基因进化。通常使用来自一轮中的最佳变体作为下一轮的模板来重复多轮这些步骤,以实现逐步改善。进行定向进化循环的第一步是产生变体基因文库。可以通过随机点突变(通过化学诱变剂或易错PCR)以及插入和缺失(通过转座子)对起始基因进行诱变。可以通过对若干序列(通常具有超过70%序列同一性)进行DNA改组以跳入改组的亲本基因之间的序列空间区域中来模拟基因重组。最后,可以基于结构和功能知识,系统地随机化基因的特定区域,以用于更集中的方法。
本公开文本进一步提供了制备构成本文公开的生物合成途径的酶的另外的形式,包括但不限于(i)分解HRPKS和NRPKS;(ii)融合诸如NRPKS和TE的酶;(iii)重新改组结构域,诸如一种酶的部分融合到另一种酶的部分等(例如,参见Kolkman等人"Directedevolution of proteins by exon shuffling".Nature Biotechnology 19(5):423–8(2001);Morgante等人,"Gene duplication and exon shuffling by helitron-liketransposons generate intraspecies diversity in maize".Nature Genetics.37(9):997–1002(2005);Van Rijk,"Molecular mechanisms of exon shuffling:Illegitimaterecombination".Genetica 118(2–3):245–9(2003);Elluche,S.“Bringing functionstogether with fusion enzymes--from nature's inventions to biotechnologicalapplications.”Appl Microbiol Biotechnol.99(4):1545-56(2015);Aalbers等人“Enzyme Fusions in Biocatalysis:Coupling Reactions by Pairing Enzymes.”Chembiochem.20(1):20-28(2019))。
编码本公开文本的多肽的多核苷酸在SEQ ID NO:1、3、5、7、9、11、13、15和17中提供。应当容易地认识到,所述序列是以DNA列出的,然而,考虑了将在本公开文本所附的序列表中包括用“U”取代“T”的RNA。如上文所提及,可以将本公开文本的多核苷酸克隆到载体中以进行表达。适合表达的载体是本领域已知的并且在本文别处有描述。在一些实施方案中,本公开文本的细胞或载体可以包含选自SEQ ID NO:1、3、5、7、9、11、13、15和17的至少一种多核苷酸。在另一个实施方案中,所述细胞或载体包含选自1、7和13的第一多核苷酸;选自3、9和15的第二多核苷酸;以及选自5、11和17的第三多核苷酸。在另一个实施方案中,本公开文本考虑了在中等严格至严格条件下与由选自SEQ ID NO:1、3、5、7、9、11、13、15和17的序列组成的多核苷酸杂交的多核苷酸。杂交严格条件是本领域熟知的。此外,本公开文本还考虑了与SEQ ID NO:1、3、5、7、9、11、13、15或17中的任一个至少70%、80%、85%、90%、92%、95%、97%或99%相同并且编码分别具有SEQ ID NO:2、4、6、8、10、12、14、16或18的序列的多肽的多核苷酸。
如先前所讨论,描述在本文中可用的分子生物学技术(包括载体、启动子的使用和许多其他相关主题)的通用教材包括Berger和Kimmel,Guide to Molecular CloningTechniques,Methods in Enzymology第152卷,(Academic Press,Inc.,San Diego,Calif.)(“Berger”);Sambrook等人,Molecular Cloning--A Laboratory Manual,第2版,第1-3卷,Cold Spring Harbor Laboratory,Cold Spring Harbor,N.Y.,1989(“Sambrook”);以及Current Protocols in Molecular Biology,F.M.Ausubel等人编辑,Current Protocols,Greene Publishing Associates,Inc.与John Wiley&Sons,Inc.的合资公司,(增补到1999年)(“Ausubel”)(将其中的每一个通过引用并入)。足以指导技术人员进行体外扩增方法(包括聚合酶链式反应(PCR)、连接酶链式反应(LCR)、Qb-复制酶扩增和其他RNA聚合酶介导的技术(例如,NASBA))(例如,用于产生本发明的同源核酸)的方案的例子可见于Berger、Sambrook和Ausubel以及Mullis等人(1987)美国专利号4,683,202;Innis等人编辑(1990)PCR Protocols:A Guide to Methods and Applications(AcademicPress Inc.San Diego,Calif.)(“Innis”);Arnheim和Levinson(1990年10月1日)C&EN 36-47;The Journal Of NIH Research(1991)3:81-94;Kwoh等人(1989)Proc.Natl.Acad.Sci.USA 86:1173;Guatelli等人(1990)Proc.Nat'l.Acad.Sci.USA 87:1874;Lomell等人(1989)J.Clin.Chem 35:1826;Landegren等人(1988)Science 241:1077-1080;Van Brunt(1990)Biotechnology 8:291-294;Wu和Wallace(1989)Gene 4:560;Barringer等人(1990)Gene 89:117;以及Sooknanan和Malek(1995)Biotechnology 13:563-564(将其中的每一个通过引用并入)。用于克隆体外扩增的核酸的改进方法描述于Wallace等人,美国专利号5,426,039中。用于通过PCR扩增大核酸的改进方法总结于Cheng等人(1994)Nature 369:684-685和其中引用的参考文献中(通过引用并入本文),其中产生了高达40kb的PCR扩增子。技术人员将理解,使用逆转录酶和聚合酶,基本上任何RNA都可以被转化为适合限制性消化、PCR扩增和测序的双链DNA。参见例如,Ausubel、Sambrook和Berger,全部同上。
以下实施例旨在说明而非限制本公开文本。虽然它们是可以使用的那些程序的典型,但是可以可替代地使用本领域技术人员已知的其他程序。
实施例
质粒构建和表达:将质粒pYTU、pYTP和pYTR用PacI和SwaI消化。使用这些限制性位点将编码HRPKS、NRPKS和ΨACP-TE的基因(即,HRPKS(SEQ ID NO:2)、NRPKS(SEQ ID NO:4)和ΨACP-TE(SEQ ID NO:6))插入质粒中。使用金龟子绿僵菌ARSEF23的基因组DNA作为模板,通过PCR扩增基因。使用pYTR作为模板,通过PCR扩增glaA启动子和trpC终止子。将PCR片段转化到酵母中,并且通过同源重组产生质粒pYTU-glaA-NRPKS-trpC、pYTP-glaA-ΨACP-TE和pYTR-glaA-HRPKS-trpC。使用Frozen-EZ Yeast Transformation II KitTM(ZymoResearch)来完成酵母转化。从酵母中提取质粒,并且通过电穿孔将其转化到大肠杆菌TOP10中以分离单个质粒。在从大肠杆菌中提取之后,通过测序确认质粒序列。使用Liu等人(Org Lett.19:3560–3563(2017))描述的方法将所有三种质粒(pYTU-glaA-NRPKS-trpC、pYTP-glaA-ΨACP-TE、pYTR-glaA-HRPKS-trpC)转化到构巢曲霉(A.nidulans)中,以形成橄榄醇酸生产菌株。
然后将菌株在50mL falcon管中的10mL CD-ST培养基(20g/L淀粉、20g/L酪蛋白、50mL/L硝酸盐和1mL/L痕量元素)中培养,并且在28℃和250rpm下在摇床中保持过夜。第二天,将25μL培养物接种到125mL烧瓶中的25mL CD-ST培养基中,并且在28℃和250rpm下在摇床中保持过夜。
鉴定出与金龟子绿僵菌集群同源的其他三个集群,它们都含有HRPKS、NRPKS和ΨACP-TE基因。尽管这些集群都具有相同的这三个基因,但它们的序列同一性不同,表明所产生的产物具有多样性。由于同源集群的基因在序列同一性上与金龟子绿僵菌集群中的基因不同,因此可以通过不同的甲基转移酶和烯酰基还原酶结构域活性来预见扩大的产物多样性。
如上文针对金龟子绿僵菌(M.anisopliae)描述的质粒由这些同源集群的基因产生,并且在构巢曲霉中异源表达。然后分析产物概况。
检测和分离:使用Shimadzu 2020EVLC-MS(
Figure BDA0003811277370000202
Kinetex,1.7μm,2.0×100mm,C-18柱)使用正负模式电喷雾电离进行LC-MS分析。洗脱方法包括在13.25分钟内5%-95%(v/v)乙腈/水的线性梯度,然后是持续4.75分钟的95%(v/v)乙腈/水,流速为0.3mL/min。LC流动相补充有0.1%甲酸(v/v)。
通过在分为20个培养皿的1升固体CD-ST琼脂中培养转化体来进行化合物的大规模生产。在28℃下生长4-5天之后,用丙酮充分提取琼脂。将提取物减压浓缩,并且进一步用酸化乙酸乙酯提取三次。橄榄醇酸和衍生物保留在乙酸乙酯层中,随后将其减压干燥。将残余物加载到Teledyne Combi-Flash系统上的Redisep Rf Gold反相C18柱中。之后,使用Shimadzu UFLC系统采用
Figure BDA0003811277370000201
Kinetex柱(5μ,10.0×250mm,C18)进行HPLC纯化。对于HPLC纯化,使用流速为4mL/min的溶剂A(0.1%甲酸的水溶液)和B(0.1%甲酸的乙腈溶液)。
产生橄榄醇酸和类似物的生物合成途径。如图2A所示,构巢曲霉被重组工程化以分别表达来自金龟子绿僵菌、莱氏绿僵菌和岛篮状菌的各种异源基因(即,HRPKS(SEQ IDNO:2、8或14)、NRPKS(SEQ ID NO:4、10或16)和ΨACP-TE(SEQ ID NO:6、12或18),从而以高产率提供橄榄醇酸(OA)和OA类似物。HRPKS利用乙酰辅酶A、丙二酰辅酶A和NADPH来合成拴系在酰基载体蛋白(ACP)上的C6或C8酰基链。HRPKS的ACP结构域然后使酰基硫酯穿梭到非还原性聚酮合酶(NRPKS)的起始单元酰基载体蛋白转酰基酶(SAT)结构域。在由酮基合酶(KS)结构域催化的三个脱羧缩合和由产物模板(PT)结构域催化的芳构化之后,硫酯酶(TE)水解来自NRPKS的产物以开始下一个催化循环。由于HRPKS具有合成能力并且NRPKS SAT结构域具有宽松的底物选择性,获得高效价橄榄醇酸以及三种酰基链长度和饱和度不同的类似物(参见图2B)。
在构巢曲霉中异源表达上述生物合成途径在没有进行任何代谢优化的情况下获得总效价>4g/L的橄榄醇酸及其类似物。如图3呈现的液相色谱(LC)迹线所示,化合物2比其他化合物以更高的量产生,有极少到没有其他污染产物形成。实际产率在图2B中呈现并且如下:化合物2(约4g/L)>化合物3(约800mg/L)>化合物1(约450mg/L)>>化合物4(80mg/L)。化合物1:橄榄醇酸;化合物2:2-庚基-4,6-二羟基苯甲酸;化合物3:(E)-2-(庚-1-烯-1-基)-4,6-二羟基苯甲酸;以及化合物4:(E)-2,4-二羟基-6-(戊-1-烯-1-基)苯甲酸。
从膨大弯颈霉(T.inflatum)、莱氏绿僵菌(M.rileyi)和岛篮状菌(T.islandicus)克隆的基因获得相似的LC迹线。对于膨大弯颈霉集群,当在构巢曲霉中异源表达基因时,对LC的审查显示出与表达来自金龟子绿僵菌的基因时相同的概况,具有可比较的效价。对于岛篮状菌集群,当在构巢曲霉中异源表达时,LC主要显示出橄榄醇酸。
将理解,可以在不脱离本公开文本的精神和范围的情况下进行各种修改。因此,其他实施方案在以下权利要求的范围内。
序列表
<110> 加利福尼亚大学董事会
<120> 用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台
<130> 00011-091CN1
<140> PCT/US2021/012866
<141> 2021-01-09
<150> US 62/959,849
<151> 2020-01-10
<160> 18
<170> PatentIn version 3.5
<210> 1
<211> 7548
<212> DNA
<213> 金龟子绿僵菌
<220>
<221> CDS
<222> (1)..(7548)
<400> 1
atg caa gcg cca gca cca tca aga gac gac att gcc gtc gtc ggc ttg 48
Met Gln Ala Pro Ala Pro Ser Arg Asp Asp Ile Ala Val Val Gly Leu
1 5 10 15
tcg tgc cgc ttc ccg ggc gaa gca gat acc gcc gag cac ttt tgg gat 96
Ser Cys Arg Phe Pro Gly Glu Ala Asp Thr Ala Glu His Phe Trp Asp
20 25 30
ttc atc tgc aat gga cgt aat gca tac tct gag aat ccg gat cgg tgg 144
Phe Ile Cys Asn Gly Arg Asn Ala Tyr Ser Glu Asn Pro Asp Arg Trp
35 40 45
acg ccg gat gct ttt cac tac ggt gag aaa aaa atc aac acc agt ctg 192
Thr Pro Asp Ala Phe His Tyr Gly Glu Lys Lys Ile Asn Thr Ser Leu
50 55 60
ccc cgg gga ggg cat ttt atg aag caa gat gtg gcc gcc ttt gac gcc 240
Pro Arg Gly Gly His Phe Met Lys Gln Asp Val Ala Ala Phe Asp Ala
65 70 75 80
aac ttc ttc aac ctc tcc aag gtc gag gcc gag tcc atg gac ccc cag 288
Asn Phe Phe Asn Leu Ser Lys Val Glu Ala Glu Ser Met Asp Pro Gln
85 90 95
cag cgc atc atg atg gag gtg acg tac gag tcc atg gag agc gcc ggc 336
Gln Arg Ile Met Met Glu Val Thr Tyr Glu Ser Met Glu Ser Ala Gly
100 105 110
ctc cgc gtc gac cag ctc gcg ggc tcg cgg acg ggc gtc ttc atg gcc 384
Leu Arg Val Asp Gln Leu Ala Gly Ser Arg Thr Gly Val Phe Met Ala
115 120 125
agc ttc acg agc gac tac cgc gag atg ctg tac cgc gat gcc gag acg 432
Ser Phe Thr Ser Asp Tyr Arg Glu Met Leu Tyr Arg Asp Ala Glu Thr
130 135 140
gcg cct ctc tac acc gca acg ggc acc agc aac acg tcg acg tcg aac 480
Ala Pro Leu Tyr Thr Ala Thr Gly Thr Ser Asn Thr Ser Thr Ser Asn
145 150 155 160
cgc gtc tcg tgg ttc ttc gac ctg cgc ggg ccc agc ttc acc gtc aac 528
Arg Val Ser Trp Phe Phe Asp Leu Arg Gly Pro Ser Phe Thr Val Asn
165 170 175
acg gcc tgc tcg tcc agt ctg gtc gcc tgc cat ctc gcc tgc caa agc 576
Thr Ala Cys Ser Ser Ser Leu Val Ala Cys His Leu Ala Cys Gln Ser
180 185 190
cta tgg agc ggc gag acg gag agc gcc att gtc ggc ggc acc agc ctg 624
Leu Trp Ser Gly Glu Thr Glu Ser Ala Ile Val Gly Gly Thr Ser Leu
195 200 205
ctg ctg aac ccc gac atg ttc ctg tac ctt tcc aac cag cag ttc ctg 672
Leu Leu Asn Pro Asp Met Phe Leu Tyr Leu Ser Asn Gln Gln Phe Leu
210 215 220
gcc ccc gac ggc cag tgc aag agc ttt gac gag tcg ggc gac ggc tac 720
Ala Pro Asp Gly Gln Cys Lys Ser Phe Asp Glu Ser Gly Asp Gly Tyr
225 230 235 240
gcc agg ggc gac ggc atc ggc gtc gtc att ctg aag cga gtt gcc gac 768
Ala Arg Gly Asp Gly Ile Gly Val Val Ile Leu Lys Arg Val Ala Asp
245 250 255
gcc ctc cgc gac ggc gac ccg atc cgc gcc gtc atc cgt ggc agc gga 816
Ala Leu Arg Asp Gly Asp Pro Ile Arg Ala Val Ile Arg Gly Ser Gly
260 265 270
tgc aac cag gac ggc cat aca aag ggc ttc acc atc ccc agc gtc gac 864
Cys Asn Gln Asp Gly His Thr Lys Gly Phe Thr Ile Pro Ser Val Asp
275 280 285
gcg caa gcc tcc ctc att gca gaa acg tac cgc aac gcc ggc ctc tca 912
Ala Gln Ala Ser Leu Ile Ala Glu Thr Tyr Arg Asn Ala Gly Leu Ser
290 295 300
ctt gcg gag aca cgc tac gtc gag gct cac gga acg ggc acc cag gcc 960
Leu Ala Glu Thr Arg Tyr Val Glu Ala His Gly Thr Gly Thr Gln Ala
305 310 315 320
ggc gac acg cgt gag atg gaa ggc att gcc cgc aca ttc agc cag cac 1008
Gly Asp Thr Arg Glu Met Glu Gly Ile Ala Arg Thr Phe Ser Gln His
325 330 335
cgc acg gcg tcg gac gag ctg ctg gtg gga tca gtc aag gca aat atc 1056
Arg Thr Ala Ser Asp Glu Leu Leu Val Gly Ser Val Lys Ala Asn Ile
340 345 350
ggg cat ctc gaa gcc tgc gcg gga ctg gcc tcg ctc ata aag tgc gtc 1104
Gly His Leu Glu Ala Cys Ala Gly Leu Ala Ser Leu Ile Lys Cys Val
355 360 365
tac atc ctg gaa acg ggc gtg ata ccc ccg acg ccg agc gtc cgc gtc 1152
Tyr Ile Leu Glu Thr Gly Val Ile Pro Pro Thr Pro Ser Val Arg Val
370 375 380
ctg aac ccc aag atc cgc tgg gag gaa tgg cat ctc aag gtg cct gcg 1200
Leu Asn Pro Lys Ile Arg Trp Glu Glu Trp His Leu Lys Val Pro Ala
385 390 395 400
aca caa aca act tgg ccg acc gag ggc ctg cgg cgg atc agc acc caa 1248
Thr Gln Thr Thr Trp Pro Thr Glu Gly Leu Arg Arg Ile Ser Thr Gln
405 410 415
ggt ttt gga tat ggc ggt aca aac gcg cat ctg att ctc gac gac gcg 1296
Gly Phe Gly Tyr Gly Gly Thr Asn Ala His Leu Ile Leu Asp Asp Ala
420 425 430
gcc cat tat ctc gag gca cgc aaa ctc agg ggc cac cac tat acc cgt 1344
Ala His Tyr Leu Glu Ala Arg Lys Leu Arg Gly His His Tyr Thr Arg
435 440 445
aca cat ccc cag aca cag aga ctt ttg acc tcg gca atg cag gaa gac 1392
Thr His Pro Gln Thr Gln Arg Leu Leu Thr Ser Ala Met Gln Glu Asp
450 455 460
gtg tca aac gac cat ccg cca cgg tta ttt ctg ttc cgc gca aat gat 1440
Val Ser Asn Asp His Pro Pro Arg Leu Phe Leu Phe Arg Ala Asn Asp
465 470 475 480
cgc gag ggc ctg gga cgc gtc cgc tcg tcg ctg gcc cag cat ctc gag 1488
Arg Glu Gly Leu Gly Arg Val Arg Ser Ser Leu Ala Gln His Leu Glu
485 490 495
cag ctc ctc aag tcg tgg ccg cag gat tcg aga gac ggc ggc gca tac 1536
Gln Leu Leu Lys Ser Trp Pro Gln Asp Ser Arg Asp Gly Gly Ala Tyr
500 505 510
cta cac aat ctg gcc ttc acc cta gcc agt cga cgg tcc cat ctc caa 1584
Leu His Asn Leu Ala Phe Thr Leu Ala Ser Arg Arg Ser His Leu Gln
515 520 525
tgg cag acg tac gcc acg gcc tcg acg ccc tcg gag ctg ctc caa gcg 1632
Trp Gln Thr Tyr Ala Thr Ala Ser Thr Pro Ser Glu Leu Leu Gln Ala
530 535 540
ctc cag cac gag ggc agc gcg tgg gcg gct ccc gag act cgc ctc gcc 1680
Leu Gln His Glu Gly Ser Ala Trp Ala Ala Pro Glu Thr Arg Leu Ala
545 550 555 560
gcc tcg ccc ccc cgg ctc ggc ttc atc ttc acc ggc cag ggc gcg cag 1728
Ala Ser Pro Pro Arg Leu Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
565 570 575
tgg gct cgc atg ggc gtc gag ctg atg gcg tac ccc gtg ttc cgc cag 1776
Trp Ala Arg Met Gly Val Glu Leu Met Ala Tyr Pro Val Phe Arg Gln
580 585 590
agc gtc gag gcg tcg gac ggg ttt ctg cgc agc gcc ctc ggg tgc ccc 1824
Ser Val Glu Ala Ser Asp Gly Phe Leu Arg Ser Ala Leu Gly Cys Pro
595 600 605
tgg tct gcc gtc gac gag ctg gcc cag ccg cag gct acg tcg cgg ctc 1872
Trp Ser Ala Val Asp Glu Leu Ala Gln Pro Gln Ala Thr Ser Arg Leu
610 615 620
tcc gag gcg gcc tac agc cag acg ctc tgc acg gtg ctc caa atc gcc 1920
Ser Glu Ala Ala Tyr Ser Gln Thr Leu Cys Thr Val Leu Gln Ile Ala
625 630 635 640
acc gtc gac ctg ctc gag gac tgg aac gtc tgt ccc acg cgc gtg gcc 1968
Thr Val Asp Leu Leu Glu Asp Trp Asn Val Cys Pro Thr Arg Val Ala
645 650 655
ggg cac tcg agc ggc gag atc gcc gcc gcc tac tgc ctg ggc gcc ctg 2016
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Ala Leu
660 665 670
agc aag cac gac agt ctg cgg gtg gcc tac tac cgc ggg att ctg tcc 2064
Ser Lys His Asp Ser Leu Arg Val Ala Tyr Tyr Arg Gly Ile Leu Ser
675 680 685
tcg gag atg cag cag aca cac gcg gat cgc agg gga gcc atg atg gcc 2112
Ser Glu Met Gln Gln Thr His Ala Asp Arg Arg Gly Ala Met Met Ala
690 695 700
gtc ggg gct tcc ccc gaa gag gtc gag gcg tgg ctg gcc aag ctg acc 2160
Val Gly Ala Ser Pro Glu Glu Val Glu Ala Trp Leu Ala Lys Leu Thr
705 710 715 720
cgg gga cga gtc gtc gtc gcc tgc atc aac tcg ccg acc agc gtc acg 2208
Arg Gly Arg Val Val Val Ala Cys Ile Asn Ser Pro Thr Ser Val Thr
725 730 735
gca tcc ggg gac gcc gcg ggc gtc gac gag ctt ctc gcc atg gtc caa 2256
Ala Ser Gly Asp Ala Ala Gly Val Asp Glu Leu Leu Ala Met Val Gln
740 745 750
cag gcc ggc gtg ttt ggg cgc aag ctg cag gtg gac gtg gcc tat cac 2304
Gln Ala Gly Val Phe Gly Arg Lys Leu Gln Val Asp Val Ala Tyr His
755 760 765
tct cac cac atg cag tcg gtt tct tcc gcg tac tct gag ctc ctc aag 2352
Ser His His Met Gln Ser Val Ser Ser Ala Tyr Ser Glu Leu Leu Lys
770 775 780
gat ctt gcg ccg ctg ccg gcg cgt ccg gga cgc acc atg cac tcg agc 2400
Asp Leu Ala Pro Leu Pro Ala Arg Pro Gly Arg Thr Met His Ser Ser
785 790 795 800
gtc ttg ggc cgt gtc att gac gcc gcg gag ctc ggc gcc tcc aac tgg 2448
Val Leu Gly Arg Val Ile Asp Ala Ala Glu Leu Gly Ala Ser Asn Trp
805 810 815
gtg caa aac ctc gtc tcc ccg gtg cgc ttc tcc gaa gcc gtg tcg agc 2496
Val Gln Asn Leu Val Ser Pro Val Arg Phe Ser Glu Ala Val Ser Ser
820 825 830
ctc ctc tcc gcc ggg gac aag ccg gcc gtc gat gtg ctc gtc gag att 2544
Leu Leu Ser Ala Gly Asp Lys Pro Ala Val Asp Val Leu Val Glu Ile
835 840 845
gga ccg cac gcc gcg ctc aag ggg ccc gtc cag cag atc ctc cag gcc 2592
Gly Pro His Ala Ala Leu Lys Gly Pro Val Gln Gln Ile Leu Gln Ala
850 855 860
cag ggc gtg tcc gcg gtc aag tac acg agt gtc ctc tcc cgg gga cag 2640
Gln Gly Val Ser Ala Val Lys Tyr Thr Ser Val Leu Ser Arg Gly Gln
865 870 875 880
agc gcc gta aag acg gct ctg gcg tgc gcc ggc gag ctc gtc ctg tcg 2688
Ser Ala Val Lys Thr Ala Leu Ala Cys Ala Gly Glu Leu Val Leu Ser
885 890 895
agt gtg ccc gtc gcc gtg tct cgc gta aac ttg gag tcc ggg ccg ccg 2736
Ser Val Pro Val Ala Val Ser Arg Val Asn Leu Glu Ser Gly Pro Pro
900 905 910
ccg agt ccg ttg gtc gac ctg ccc ccc tat ccc tgg aac cga tca act 2784
Pro Ser Pro Leu Val Asp Leu Pro Pro Tyr Pro Trp Asn Arg Ser Thr
915 920 925
cga ttc tgg gcc gag tcg cgt ctt tcc cga gag tat cgg ctt cgc aag 2832
Arg Phe Trp Ala Glu Ser Arg Leu Ser Arg Glu Tyr Arg Leu Arg Lys
930 935 940
cac gcc cgc ctg ccg ctg ctg gga agt ccg tgt ccc acg atg ggc gcc 2880
His Ala Arg Leu Pro Leu Leu Gly Ser Pro Cys Pro Thr Met Gly Ala
945 950 955 960
cgc gag aga tac tgg cgc ggc atg gtg agg ttg gag gag gag ccc tgg 2928
Arg Glu Arg Tyr Trp Arg Gly Met Val Arg Leu Glu Glu Glu Pro Trp
965 970 975
atc cgg gac cat gag atc cag ggg tcc atc ctg tat ccc ggg gcc ggc 2976
Ile Arg Asp His Glu Ile Gln Gly Ser Ile Leu Tyr Pro Gly Ala Gly
980 985 990
ttc ttg atc atg gcc att gaa gct gcc tcc cag cag gca ggc gag cag 3024
Phe Leu Ile Met Ala Ile Glu Ala Ala Ser Gln Gln Ala Gly Glu Gln
995 1000 1005
cgc aaa gta agc gca ttc cga ctg cgc gac gtg cac ctc gac gcc 3069
Arg Lys Val Ser Ala Phe Arg Leu Arg Asp Val His Leu Asp Ala
1010 1015 1020
gcc ttg gtg gtg acc gag gac agc acc gcc gag gcc att ctg caa 3114
Ala Leu Val Val Thr Glu Asp Ser Thr Ala Glu Ala Ile Leu Gln
1025 1030 1035
ctc cga ccg cat ctt ctc gcg ccg ggc agc agc cag tcg tcc tgg 3159
Leu Arg Pro His Leu Leu Ala Pro Gly Ser Ser Gln Ser Ser Trp
1040 1045 1050
atg gag ttt acc gtc aat tca tct att gac ggc ggt gac ttg cgt 3204
Met Glu Phe Thr Val Asn Ser Ser Ile Asp Gly Gly Asp Leu Arg
1055 1060 1065
cag aac tgc tcc ggc ctc atc atg atc gag tat gcc gcc gac gcc 3249
Gln Asn Cys Ser Gly Leu Ile Met Ile Glu Tyr Ala Ala Asp Ala
1070 1075 1080
gac tcg gcc atg gac cgc gag cgt gcc ctg gag tcg gac atg gtt 3294
Asp Ser Ala Met Asp Arg Glu Arg Ala Leu Glu Ser Asp Met Val
1085 1090 1095
tgt gac tgg tac aag aaa acg tac gtc tct tgc cag cag tct gtc 3339
Cys Asp Trp Tyr Lys Lys Thr Tyr Val Ser Cys Gln Gln Ser Val
1100 1105 1110
gat gtg ggc aaa ttc tac tcg cgc ctt gct tct ctc ggc ctt gtt 3384
Asp Val Gly Lys Phe Tyr Ser Arg Leu Ala Ser Leu Gly Leu Val
1115 1120 1125
tac gga cca acc ttt gca aac gtg acg gag att cgg agg acg ggc 3429
Tyr Gly Pro Thr Phe Ala Asn Val Thr Glu Ile Arg Arg Thr Gly
1130 1135 1140
cag ggc cag tgt atc ggt gcc gtc cgt atc ccg gcc gtg gac agc 3474
Gln Gly Gln Cys Ile Gly Ala Val Arg Ile Pro Ala Val Asp Ser
1145 1150 1155
ctc gtg ccg ccc gca tac cgc agc cat cct cac gtc atc cat ccg 3519
Leu Val Pro Pro Ala Tyr Arg Ser His Pro His Val Ile His Pro
1160 1165 1170
ggg acg ttg gat gcc gtc ttc cac ctc gcc ttt gcg gcg ctc gag 3564
Gly Thr Leu Asp Ala Val Phe His Leu Ala Phe Ala Ala Leu Glu
1175 1180 1185
gac tcg ttg ctt ccg ggc ccc atg gtc cca acg aca atc gac gag 3609
Asp Ser Leu Leu Pro Gly Pro Met Val Pro Thr Thr Ile Asp Glu
1190 1195 1200
ctg gtc gtg gca gca gat aca cca aac acc cct ggc act ctg ctt 3654
Leu Val Val Ala Ala Asp Thr Pro Asn Thr Pro Gly Thr Leu Leu
1205 1210 1215
cgg gga gtc tca cgc tct tct cct cac ggc ttc aga gag ctc atc 3699
Arg Gly Val Ser Arg Ser Ser Pro His Gly Phe Arg Glu Leu Ile
1220 1225 1230
tcc gac att gac atg ctg gac gac caa agc agc aga gca ctt gtg 3744
Ser Asp Ile Asp Met Leu Asp Asp Gln Ser Ser Arg Ala Leu Val
1235 1240 1245
caa atc aag ggg ttc cgt tgc gcc gac gta tcc ggg ggg cgc atg 3789
Gln Ile Lys Gly Phe Arg Cys Ala Asp Val Ser Gly Gly Arg Met
1250 1255 1260
acg tcg tcg gag gcg gcg tca gca gag agc cgg ccg att ggc ttc 3834
Thr Ser Ser Glu Ala Ala Ser Ala Glu Ser Arg Pro Ile Gly Phe
1265 1270 1275
cgt ctc gag tgg aag ccg gca atc gac ttg ctg acc ggt gag cag 3879
Arg Leu Glu Trp Lys Pro Ala Ile Asp Leu Leu Thr Gly Glu Gln
1280 1285 1290
cta cgg aca cat ctt gac cgt cgt gtc aag cag gag ggt gcg tcc 3924
Leu Arg Thr His Leu Asp Arg Arg Val Lys Gln Glu Gly Ala Ser
1295 1300 1305
aac gtc gcc cgc gcc aca gag ctg aac aat cat gtc cat cac ctt 3969
Asn Val Ala Arg Ala Thr Glu Leu Asn Asn His Val His His Leu
1310 1315 1320
gaa gaa act tta cct cgc gtt gcc gtg gat cct gcc atg gca aac 4014
Glu Glu Thr Leu Pro Arg Val Ala Val Asp Pro Ala Met Ala Asn
1325 1330 1335
ttg tct gac tgg ctg tcg gcc aag tct gca aaa ctc acg aat ggt 4059
Leu Ser Asp Trp Leu Ser Ala Lys Ser Ala Lys Leu Thr Asn Gly
1340 1345 1350
act act tca tca tcc aaa cgt cta tcc cca ggg ggt gac atg ctc 4104
Thr Thr Ser Ser Ser Lys Arg Leu Ser Pro Gly Gly Asp Met Leu
1355 1360 1365
gca atg aga gac gcc ttg acc gcc gtg cga gca ggg agc att cca 4149
Ala Met Arg Asp Ala Leu Thr Ala Val Arg Ala Gly Ser Ile Pro
1370 1375 1380
tca cca gaa caa caa gac agg atg ctg aga gag gtg gag caa aac 4194
Ser Pro Glu Gln Gln Asp Arg Met Leu Arg Glu Val Glu Gln Asn
1385 1390 1395
ggc gct ctg tcc att cta ttc aag ccg ctc gac gca tat atc gac 4239
Gly Ala Leu Ser Ile Leu Phe Lys Pro Leu Asp Ala Tyr Ile Asp
1400 1405 1410
ctt cgc cat cat gcc aag ccc aac ctg tcg att ctt gag ctg agc 4284
Leu Arg His His Ala Lys Pro Asn Leu Ser Ile Leu Glu Leu Ser
1415 1420 1425
ctg gat tcg gtg cca tac tct gtc ttt gca gcc ctg ccc agt cga 4329
Leu Asp Ser Val Pro Tyr Ser Val Phe Ala Ala Leu Pro Ser Arg
1430 1435 1440
cac aag att ctc cag aca gcg cag tac gcc att aga gta tcg caa 4374
His Lys Ile Leu Gln Thr Ala Gln Tyr Ala Ile Arg Val Ser Gln
1445 1450 1455
gag ggc gtc gcc gac cga gtc agg gcc cag ttt ggg tct cag gct 4419
Glu Gly Val Ala Asp Arg Val Arg Ala Gln Phe Gly Ser Gln Ala
1460 1465 1470
tcc gac att gac gtc tcc gtc aca gac ttt aca aag aaa ctc gac 4464
Ser Asp Ile Asp Val Ser Val Thr Asp Phe Thr Lys Lys Leu Asp
1475 1480 1485
gag ggc ttg gga aag cat gat gtc att ctc ata ttt gac cct ggc 4509
Glu Gly Leu Gly Lys His Asp Val Ile Leu Ile Phe Asp Pro Gly
1490 1495 1500
ttc gta cac gca aag cta gag gtc gtt ttg cgc aac gcg cgc aag 4554
Phe Val His Ala Lys Leu Glu Val Val Leu Arg Asn Ala Arg Lys
1505 1510 1515
ctg ttg aac cca ggg ggc agg atc gtc gtc gca gaa gtc agc gac 4599
Leu Leu Asn Pro Gly Gly Arg Ile Val Val Ala Glu Val Ser Asp
1520 1525 1530
cct ggg ctc tac ttg ggc aca gca ctg ggc tgt ctt cag tgg aca 4644
Pro Gly Leu Tyr Leu Gly Thr Ala Leu Gly Cys Leu Gln Trp Thr
1535 1540 1545
aga aac cta gac gtt gcc cag agc agc agc agc tgg aca tcg tgt 4689
Arg Asn Leu Asp Val Ala Gln Ser Ser Ser Ser Trp Thr Ser Cys
1550 1555 1560
ctc gcg cgc tcg gga ctg acg cct gct ctc aaa ctc atc gac atg 4734
Leu Ala Arg Ser Gly Leu Thr Pro Ala Leu Lys Leu Ile Asp Met
1565 1570 1575
gac aca gag tcc gcc gtt cac gga cac ttc cgc ctg agt ctc aca 4779
Asp Thr Glu Ser Ala Val His Gly His Phe Arg Leu Ser Leu Thr
1580 1585 1590
ggc aat gcc gcc gag tcg acc aac agt gac aat cgc cag ccg cag 4824
Gly Asn Ala Ala Glu Ser Thr Asn Ser Asp Asn Arg Gln Pro Gln
1595 1600 1605
caa gtc acc ctc ata gaa gcc gcc aat cca tct gcc acg gcg caa 4869
Gln Val Thr Leu Ile Glu Ala Ala Asn Pro Ser Ala Thr Ala Gln
1610 1615 1620
gat atc gcg gca gcc gtg gcc cag aat ctt gac aag gcg tcg att 4914
Asp Ile Ala Ala Ala Val Ala Gln Asn Leu Asp Lys Ala Ser Ile
1625 1630 1635
ccc aca aag cgc atc cgt tgg ggc tcc gac gtg tcg cag ctc aag 4959
Pro Thr Lys Arg Ile Arg Trp Gly Ser Asp Val Ser Gln Leu Lys
1640 1645 1650
ggc cag cct tgc atc gtc ctg acg gac ttg gag tct gcg ctt ctc 5004
Gly Gln Pro Cys Ile Val Leu Thr Asp Leu Glu Ser Ala Leu Leu
1655 1660 1665
aag gac ccg gca cca gag gat ctc gcg gcc ctg cag tcg ctg ttc 5049
Lys Asp Pro Ala Pro Glu Asp Leu Ala Ala Leu Gln Ser Leu Phe
1670 1675 1680
gcg cat gcc gag agc acc ctc tgg gtc agt ggc ccc ctg gga cct 5094
Ala His Ala Glu Ser Thr Leu Trp Val Ser Gly Pro Leu Gly Pro
1685 1690 1695
gat gct gct ctg atc acg ggc ctg tct cgc agc gtt tgc aac gag 5139
Asp Ala Ala Leu Ile Thr Gly Leu Ser Arg Ser Val Cys Asn Glu
1700 1705 1710
gcg gcc gac gtc cat ata cgc acg ctt gag gtg act gat ctg cct 5184
Ala Ala Asp Val His Ile Arg Thr Leu Glu Val Thr Asp Leu Pro
1715 1720 1725
ggc ccc ggg gcc gac agc tac gcc gac ctg gtc act cgc gtc ttc 5229
Gly Pro Gly Ala Asp Ser Tyr Ala Asp Leu Val Thr Arg Val Phe
1730 1735 1740
cgg tat agc ggt ccc gat aca gag ttt cgg tgg cat tca gac gcg 5274
Arg Tyr Ser Gly Pro Asp Thr Glu Phe Arg Trp His Ser Asp Ala
1745 1750 1755
ctg ctt gtc agc cgc ctg gtc gag gat gag gcc cga aac aag gag 5319
Leu Leu Val Ser Arg Leu Val Glu Asp Glu Ala Arg Asn Lys Glu
1760 1765 1770
att gca cag ctg ctg ggc cag gga gaa aag gcc gcg gtt gcg act 5364
Ile Ala Gln Leu Leu Gly Gln Gly Glu Lys Ala Ala Val Ala Thr
1775 1780 1785
acg cta cag gag aag cca gag gga ctg aag cta tgc atg cgc cag 5409
Thr Leu Gln Glu Lys Pro Glu Gly Leu Lys Leu Cys Met Arg Gln
1790 1795 1800
att ggc atg ctg gac tct gtt tgc ttt gag ccc gac ttg ttg gct 5454
Ile Gly Met Leu Asp Ser Val Cys Phe Glu Pro Asp Leu Leu Ala
1805 1810 1815
ttg gag cca ctg gaa gca ggc gag gtg gaa gtc gac gtc aag gcc 5499
Leu Glu Pro Leu Glu Ala Gly Glu Val Glu Val Asp Val Lys Ala
1820 1825 1830
tcc gga gtc aac ttc cga gat gtc atg gtc gcc ttg gga cag att 5544
Ser Gly Val Asn Phe Arg Asp Val Met Val Ala Leu Gly Gln Ile
1835 1840 1845
cca gac cgg gca ttc ggg ttc gag ggc gct ggt gtc gtt cgc cgt 5589
Pro Asp Arg Ala Phe Gly Phe Glu Gly Ala Gly Val Val Arg Arg
1850 1855 1860
gta cat gct tca gag acg cgc ctc cgc cca gga gac cga gtc gtc 5634
Val His Ala Ser Glu Thr Arg Leu Arg Pro Gly Asp Arg Val Val
1865 1870 1875
ttc ctc gct cac gga gca cac cgt aca gtc cat cgc gta cgc gcc 5679
Phe Leu Ala His Gly Ala His Arg Thr Val His Arg Val Arg Ala
1880 1885 1890
gac tac gcc atg cct atg cct gat acc atg agc ttt gaa gag ggc 5724
Asp Tyr Ala Met Pro Met Pro Asp Thr Met Ser Phe Glu Glu Gly
1895 1900 1905
gcg gcc att ctc ctc gtc cac acg aca gct tgg tac gca ctc gtc 5769
Ala Ala Ile Leu Leu Val His Thr Thr Ala Trp Tyr Ala Leu Val
1910 1915 1920
aag tcg gcg cgc gca aca gcc ggc cag tca gtc ctc gtt cac gct 5814
Lys Ser Ala Arg Ala Thr Ala Gly Gln Ser Val Leu Val His Ala
1925 1930 1935
gcc gca ggt ggt gtt ggc cag gcc gtc ctc atg ctt gct cga cat 5859
Ala Ala Gly Gly Val Gly Gln Ala Val Leu Met Leu Ala Arg His
1940 1945 1950
cta ggt cta cag gtt ttc gcg acg gtt ggt tcc gag gag aag agg 5904
Leu Gly Leu Gln Val Phe Ala Thr Val Gly Ser Glu Glu Lys Arg
1955 1960 1965
aag ctt gtg cac gaa acg tac ggg gtt ccc cac gac cac atc ttc 5949
Lys Leu Val His Glu Thr Tyr Gly Val Pro His Asp His Ile Phe
1970 1975 1980
aac tcg cga gac gcc agc ttt gcc atg ggc gtg aag cgc atg acc 5994
Asn Ser Arg Asp Ala Ser Phe Ala Met Gly Val Lys Arg Met Thr
1985 1990 1995
aaa ggc cgc ggg gtc gat att gtt gtc aat tcg ctg gct ggg gaa 6039
Lys Gly Arg Gly Val Asp Ile Val Val Asn Ser Leu Ala Gly Glu
2000 2005 2010
gct ctc cgg cag acg tgg cac tgc ctg gcc ccc ttt ggc acc ttt 6084
Ala Leu Arg Gln Thr Trp His Cys Leu Ala Pro Phe Gly Thr Phe
2015 2020 2025
gtc gag ctc ggc atg aag gac atc ttg gac aac gca cgc ctg gac 6129
Val Glu Leu Gly Met Lys Asp Ile Leu Asp Asn Ala Arg Leu Asp
2030 2035 2040
atg aag ccc ttc ctc cag gat gcc aca ttc gtc ttc ttt aac ctg 6174
Met Lys Pro Phe Leu Gln Asp Ala Thr Phe Val Phe Phe Asn Leu
2045 2050 2055
aac cgt gtc caa aag gag cgg cca gac ctc atg gga gag gct ctc 6219
Asn Arg Val Gln Lys Glu Arg Pro Asp Leu Met Gly Glu Ala Leu
2060 2065 2070
cga gag aca atg gcc ctt gta cgc tcc ggc gct ctc aag ccc gcg 6264
Arg Glu Thr Met Ala Leu Val Arg Ser Gly Ala Leu Lys Pro Ala
2075 2080 2085
acg ccg ctc acc tcg tat ccc gcc tct cag gtg gaa gcg gca ttc 6309
Thr Pro Leu Thr Ser Tyr Pro Ala Ser Gln Val Glu Ala Ala Phe
2090 2095 2100
cgc aag att caa acg ggc cag cac cta ggg aag ctc gtg ctg aca 6354
Arg Lys Ile Gln Thr Gly Gln His Leu Gly Lys Leu Val Leu Thr
2105 2110 2115
ttc cag gag gga gat gtt gtc ccc gtc gtc aga cca gac ctc agc 6399
Phe Gln Glu Gly Asp Val Val Pro Val Val Arg Pro Asp Leu Ser
2120 2125 2130
cta agt gac tct ggc acc tac ctt ctc gtc gga gga ctc ggc ggc 6444
Leu Ser Asp Ser Gly Thr Tyr Leu Leu Val Gly Gly Leu Gly Gly
2135 2140 2145
ttg ggc cgg agt ctt gca cgg ctc ctg gtg cag ctt ggg gcg cgc 6489
Leu Gly Arg Ser Leu Ala Arg Leu Leu Val Gln Leu Gly Ala Arg
2150 2155 2160
cgg ctg tgc ttc ctc tct cgc tcc ggc gca gca agc agc gag gcg 6534
Arg Leu Cys Phe Leu Ser Arg Ser Gly Ala Ala Ser Ser Glu Ala
2165 2170 2175
cgc gcc ctc gtc aag gaa ctg gag atg cag cat cga gta cgc gtc 6579
Arg Ala Leu Val Lys Glu Leu Glu Met Gln His Arg Val Arg Val
2180 2185 2190
ctc gtc tgc aaa ggg gac gtg tcc gac gcc gac acc gta tcc cgc 6624
Leu Val Cys Lys Gly Asp Val Ser Asp Ala Asp Thr Val Ser Arg
2195 2200 2205
gtc gtc cag caa tgc cgg gcg gct ctg ggg ccc atc cgg ggc gtc 6669
Val Val Gln Gln Cys Arg Ala Ala Leu Gly Pro Ile Arg Gly Val
2210 2215 2220
att cag tgt gcc atg gtc ctc cgt gac ggt ctc ttt gag agg atg 6714
Ile Gln Cys Ala Met Val Leu Arg Asp Gly Leu Phe Glu Arg Met
2225 2230 2235
gct cac gat cag tgg acc gaa agc acg cgg ccc aag gtg cag ggc 6759
Ala His Asp Gln Trp Thr Glu Ser Thr Arg Pro Lys Val Gln Gly
2240 2245 2250
acg tgg aac ctg cac gag cag atc cca gtg tcc gac ttt ttc atc 6804
Thr Trp Asn Leu His Glu Gln Ile Pro Val Ser Asp Phe Phe Ile
2255 2260 2265
acg ctg agt tcc ttt gcg ggc gtc ttt gga agc cgt ggg cag agc 6849
Thr Leu Ser Ser Phe Ala Gly Val Phe Gly Ser Arg Gly Gln Ser
2270 2275 2280
aac tac gcc gct gcg ggt gcg tac gag gat gcc atg gca cac cat 6894
Asn Tyr Ala Ala Ala Gly Ala Tyr Glu Asp Ala Met Ala His His
2285 2290 2295
cgg gag tct ctg ggc cag agg gcc atc acc atc gac ttg ggc atc 6939
Arg Glu Ser Leu Gly Gln Arg Ala Ile Thr Ile Asp Leu Gly Ile
2300 2305 2310
atg cga gac gtg ggt gtt ctc gcc gag aac ggc atc acc gac tat 6984
Met Arg Asp Val Gly Val Leu Ala Glu Asn Gly Ile Thr Asp Tyr
2315 2320 2325
ctc cgc gag tgg gag gag ccg ttt gga atc cgc gag ccc gag ttc 7029
Leu Arg Glu Trp Glu Glu Pro Phe Gly Ile Arg Glu Pro Glu Phe
2330 2335 2340
cat gcg ctc atc aag tca gcc atc atg tcg acg acg cag ccc ctg 7074
His Ala Leu Ile Lys Ser Ala Ile Met Ser Thr Thr Gln Pro Leu
2345 2350 2355
act gaa cgc tcc gtg gtg cag atc cca acc ggc ctg gcc acg gcc 7119
Thr Glu Arg Ser Val Val Gln Ile Pro Thr Gly Leu Ala Thr Ala
2360 2365 2370
cgg tct gcg cag gca gcc ggt ata agc aca ccg ttc tac ttt gat 7164
Arg Ser Ala Gln Ala Ala Gly Ile Ser Thr Pro Phe Tyr Phe Asp
2375 2380 2385
gat gcc cgt ttc tcc atc ctg gcc cag aca cgc gcc tcg gcc ggt 7209
Asp Ala Arg Phe Ser Ile Leu Ala Gln Thr Arg Ala Ser Ala Gly
2390 2395 2400
gcc tcg tct gca gct ggg tct ggt gac gcc gat gcc ggc aag gtt 7254
Ala Ser Ser Ala Ala Gly Ser Gly Asp Ala Asp Ala Gly Lys Val
2405 2410 2415
tct gtg cgg acg cag ctt tcc cag gct cat tcc gtg gct gaa gcc 7299
Ser Val Arg Thr Gln Leu Ser Gln Ala His Ser Val Ala Glu Ala
2420 2425 2430
gcc gcc gcc gtc cag acg gtg ctt ctt gag cgc gtg gca agg acc 7344
Ala Ala Ala Val Gln Thr Val Leu Leu Glu Arg Val Ala Arg Thr
2435 2440 2445
ctt cag agc tcc gtg gcg gaa atc gat ccc tcc cgg cca ctg cac 7389
Leu Gln Ser Ser Val Ala Glu Ile Asp Pro Ser Arg Pro Leu His
2450 2455 2460
tcg tac ggt gta gat tcc ttg gtg gcc gtg gaa acg gtc aag tgg 7434
Ser Tyr Gly Val Asp Ser Leu Val Ala Val Glu Thr Val Lys Trp
2465 2470 2475
atg ttt aag acg ctg gac gct aag atg acg gtg ttt gat gtt ctt 7479
Met Phe Lys Thr Leu Asp Ala Lys Met Thr Val Phe Asp Val Leu
2480 2485 2490
tcc aac gtg tcc atc acg gcg ctg tgc gag aag att gca tcc atg 7524
Ser Asn Val Ser Ile Thr Ala Leu Cys Glu Lys Ile Ala Ser Met
2495 2500 2505
tct act ttg gtg aaa ttg aac tag 7548
Ser Thr Leu Val Lys Leu Asn
2510 2515
<210> 2
<211> 2515
<212> PRT
<213> 金龟子绿僵菌
<400> 2
Met Gln Ala Pro Ala Pro Ser Arg Asp Asp Ile Ala Val Val Gly Leu
1 5 10 15
Ser Cys Arg Phe Pro Gly Glu Ala Asp Thr Ala Glu His Phe Trp Asp
20 25 30
Phe Ile Cys Asn Gly Arg Asn Ala Tyr Ser Glu Asn Pro Asp Arg Trp
35 40 45
Thr Pro Asp Ala Phe His Tyr Gly Glu Lys Lys Ile Asn Thr Ser Leu
50 55 60
Pro Arg Gly Gly His Phe Met Lys Gln Asp Val Ala Ala Phe Asp Ala
65 70 75 80
Asn Phe Phe Asn Leu Ser Lys Val Glu Ala Glu Ser Met Asp Pro Gln
85 90 95
Gln Arg Ile Met Met Glu Val Thr Tyr Glu Ser Met Glu Ser Ala Gly
100 105 110
Leu Arg Val Asp Gln Leu Ala Gly Ser Arg Thr Gly Val Phe Met Ala
115 120 125
Ser Phe Thr Ser Asp Tyr Arg Glu Met Leu Tyr Arg Asp Ala Glu Thr
130 135 140
Ala Pro Leu Tyr Thr Ala Thr Gly Thr Ser Asn Thr Ser Thr Ser Asn
145 150 155 160
Arg Val Ser Trp Phe Phe Asp Leu Arg Gly Pro Ser Phe Thr Val Asn
165 170 175
Thr Ala Cys Ser Ser Ser Leu Val Ala Cys His Leu Ala Cys Gln Ser
180 185 190
Leu Trp Ser Gly Glu Thr Glu Ser Ala Ile Val Gly Gly Thr Ser Leu
195 200 205
Leu Leu Asn Pro Asp Met Phe Leu Tyr Leu Ser Asn Gln Gln Phe Leu
210 215 220
Ala Pro Asp Gly Gln Cys Lys Ser Phe Asp Glu Ser Gly Asp Gly Tyr
225 230 235 240
Ala Arg Gly Asp Gly Ile Gly Val Val Ile Leu Lys Arg Val Ala Asp
245 250 255
Ala Leu Arg Asp Gly Asp Pro Ile Arg Ala Val Ile Arg Gly Ser Gly
260 265 270
Cys Asn Gln Asp Gly His Thr Lys Gly Phe Thr Ile Pro Ser Val Asp
275 280 285
Ala Gln Ala Ser Leu Ile Ala Glu Thr Tyr Arg Asn Ala Gly Leu Ser
290 295 300
Leu Ala Glu Thr Arg Tyr Val Glu Ala His Gly Thr Gly Thr Gln Ala
305 310 315 320
Gly Asp Thr Arg Glu Met Glu Gly Ile Ala Arg Thr Phe Ser Gln His
325 330 335
Arg Thr Ala Ser Asp Glu Leu Leu Val Gly Ser Val Lys Ala Asn Ile
340 345 350
Gly His Leu Glu Ala Cys Ala Gly Leu Ala Ser Leu Ile Lys Cys Val
355 360 365
Tyr Ile Leu Glu Thr Gly Val Ile Pro Pro Thr Pro Ser Val Arg Val
370 375 380
Leu Asn Pro Lys Ile Arg Trp Glu Glu Trp His Leu Lys Val Pro Ala
385 390 395 400
Thr Gln Thr Thr Trp Pro Thr Glu Gly Leu Arg Arg Ile Ser Thr Gln
405 410 415
Gly Phe Gly Tyr Gly Gly Thr Asn Ala His Leu Ile Leu Asp Asp Ala
420 425 430
Ala His Tyr Leu Glu Ala Arg Lys Leu Arg Gly His His Tyr Thr Arg
435 440 445
Thr His Pro Gln Thr Gln Arg Leu Leu Thr Ser Ala Met Gln Glu Asp
450 455 460
Val Ser Asn Asp His Pro Pro Arg Leu Phe Leu Phe Arg Ala Asn Asp
465 470 475 480
Arg Glu Gly Leu Gly Arg Val Arg Ser Ser Leu Ala Gln His Leu Glu
485 490 495
Gln Leu Leu Lys Ser Trp Pro Gln Asp Ser Arg Asp Gly Gly Ala Tyr
500 505 510
Leu His Asn Leu Ala Phe Thr Leu Ala Ser Arg Arg Ser His Leu Gln
515 520 525
Trp Gln Thr Tyr Ala Thr Ala Ser Thr Pro Ser Glu Leu Leu Gln Ala
530 535 540
Leu Gln His Glu Gly Ser Ala Trp Ala Ala Pro Glu Thr Arg Leu Ala
545 550 555 560
Ala Ser Pro Pro Arg Leu Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
565 570 575
Trp Ala Arg Met Gly Val Glu Leu Met Ala Tyr Pro Val Phe Arg Gln
580 585 590
Ser Val Glu Ala Ser Asp Gly Phe Leu Arg Ser Ala Leu Gly Cys Pro
595 600 605
Trp Ser Ala Val Asp Glu Leu Ala Gln Pro Gln Ala Thr Ser Arg Leu
610 615 620
Ser Glu Ala Ala Tyr Ser Gln Thr Leu Cys Thr Val Leu Gln Ile Ala
625 630 635 640
Thr Val Asp Leu Leu Glu Asp Trp Asn Val Cys Pro Thr Arg Val Ala
645 650 655
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Ala Leu
660 665 670
Ser Lys His Asp Ser Leu Arg Val Ala Tyr Tyr Arg Gly Ile Leu Ser
675 680 685
Ser Glu Met Gln Gln Thr His Ala Asp Arg Arg Gly Ala Met Met Ala
690 695 700
Val Gly Ala Ser Pro Glu Glu Val Glu Ala Trp Leu Ala Lys Leu Thr
705 710 715 720
Arg Gly Arg Val Val Val Ala Cys Ile Asn Ser Pro Thr Ser Val Thr
725 730 735
Ala Ser Gly Asp Ala Ala Gly Val Asp Glu Leu Leu Ala Met Val Gln
740 745 750
Gln Ala Gly Val Phe Gly Arg Lys Leu Gln Val Asp Val Ala Tyr His
755 760 765
Ser His His Met Gln Ser Val Ser Ser Ala Tyr Ser Glu Leu Leu Lys
770 775 780
Asp Leu Ala Pro Leu Pro Ala Arg Pro Gly Arg Thr Met His Ser Ser
785 790 795 800
Val Leu Gly Arg Val Ile Asp Ala Ala Glu Leu Gly Ala Ser Asn Trp
805 810 815
Val Gln Asn Leu Val Ser Pro Val Arg Phe Ser Glu Ala Val Ser Ser
820 825 830
Leu Leu Ser Ala Gly Asp Lys Pro Ala Val Asp Val Leu Val Glu Ile
835 840 845
Gly Pro His Ala Ala Leu Lys Gly Pro Val Gln Gln Ile Leu Gln Ala
850 855 860
Gln Gly Val Ser Ala Val Lys Tyr Thr Ser Val Leu Ser Arg Gly Gln
865 870 875 880
Ser Ala Val Lys Thr Ala Leu Ala Cys Ala Gly Glu Leu Val Leu Ser
885 890 895
Ser Val Pro Val Ala Val Ser Arg Val Asn Leu Glu Ser Gly Pro Pro
900 905 910
Pro Ser Pro Leu Val Asp Leu Pro Pro Tyr Pro Trp Asn Arg Ser Thr
915 920 925
Arg Phe Trp Ala Glu Ser Arg Leu Ser Arg Glu Tyr Arg Leu Arg Lys
930 935 940
His Ala Arg Leu Pro Leu Leu Gly Ser Pro Cys Pro Thr Met Gly Ala
945 950 955 960
Arg Glu Arg Tyr Trp Arg Gly Met Val Arg Leu Glu Glu Glu Pro Trp
965 970 975
Ile Arg Asp His Glu Ile Gln Gly Ser Ile Leu Tyr Pro Gly Ala Gly
980 985 990
Phe Leu Ile Met Ala Ile Glu Ala Ala Ser Gln Gln Ala Gly Glu Gln
995 1000 1005
Arg Lys Val Ser Ala Phe Arg Leu Arg Asp Val His Leu Asp Ala
1010 1015 1020
Ala Leu Val Val Thr Glu Asp Ser Thr Ala Glu Ala Ile Leu Gln
1025 1030 1035
Leu Arg Pro His Leu Leu Ala Pro Gly Ser Ser Gln Ser Ser Trp
1040 1045 1050
Met Glu Phe Thr Val Asn Ser Ser Ile Asp Gly Gly Asp Leu Arg
1055 1060 1065
Gln Asn Cys Ser Gly Leu Ile Met Ile Glu Tyr Ala Ala Asp Ala
1070 1075 1080
Asp Ser Ala Met Asp Arg Glu Arg Ala Leu Glu Ser Asp Met Val
1085 1090 1095
Cys Asp Trp Tyr Lys Lys Thr Tyr Val Ser Cys Gln Gln Ser Val
1100 1105 1110
Asp Val Gly Lys Phe Tyr Ser Arg Leu Ala Ser Leu Gly Leu Val
1115 1120 1125
Tyr Gly Pro Thr Phe Ala Asn Val Thr Glu Ile Arg Arg Thr Gly
1130 1135 1140
Gln Gly Gln Cys Ile Gly Ala Val Arg Ile Pro Ala Val Asp Ser
1145 1150 1155
Leu Val Pro Pro Ala Tyr Arg Ser His Pro His Val Ile His Pro
1160 1165 1170
Gly Thr Leu Asp Ala Val Phe His Leu Ala Phe Ala Ala Leu Glu
1175 1180 1185
Asp Ser Leu Leu Pro Gly Pro Met Val Pro Thr Thr Ile Asp Glu
1190 1195 1200
Leu Val Val Ala Ala Asp Thr Pro Asn Thr Pro Gly Thr Leu Leu
1205 1210 1215
Arg Gly Val Ser Arg Ser Ser Pro His Gly Phe Arg Glu Leu Ile
1220 1225 1230
Ser Asp Ile Asp Met Leu Asp Asp Gln Ser Ser Arg Ala Leu Val
1235 1240 1245
Gln Ile Lys Gly Phe Arg Cys Ala Asp Val Ser Gly Gly Arg Met
1250 1255 1260
Thr Ser Ser Glu Ala Ala Ser Ala Glu Ser Arg Pro Ile Gly Phe
1265 1270 1275
Arg Leu Glu Trp Lys Pro Ala Ile Asp Leu Leu Thr Gly Glu Gln
1280 1285 1290
Leu Arg Thr His Leu Asp Arg Arg Val Lys Gln Glu Gly Ala Ser
1295 1300 1305
Asn Val Ala Arg Ala Thr Glu Leu Asn Asn His Val His His Leu
1310 1315 1320
Glu Glu Thr Leu Pro Arg Val Ala Val Asp Pro Ala Met Ala Asn
1325 1330 1335
Leu Ser Asp Trp Leu Ser Ala Lys Ser Ala Lys Leu Thr Asn Gly
1340 1345 1350
Thr Thr Ser Ser Ser Lys Arg Leu Ser Pro Gly Gly Asp Met Leu
1355 1360 1365
Ala Met Arg Asp Ala Leu Thr Ala Val Arg Ala Gly Ser Ile Pro
1370 1375 1380
Ser Pro Glu Gln Gln Asp Arg Met Leu Arg Glu Val Glu Gln Asn
1385 1390 1395
Gly Ala Leu Ser Ile Leu Phe Lys Pro Leu Asp Ala Tyr Ile Asp
1400 1405 1410
Leu Arg His His Ala Lys Pro Asn Leu Ser Ile Leu Glu Leu Ser
1415 1420 1425
Leu Asp Ser Val Pro Tyr Ser Val Phe Ala Ala Leu Pro Ser Arg
1430 1435 1440
His Lys Ile Leu Gln Thr Ala Gln Tyr Ala Ile Arg Val Ser Gln
1445 1450 1455
Glu Gly Val Ala Asp Arg Val Arg Ala Gln Phe Gly Ser Gln Ala
1460 1465 1470
Ser Asp Ile Asp Val Ser Val Thr Asp Phe Thr Lys Lys Leu Asp
1475 1480 1485
Glu Gly Leu Gly Lys His Asp Val Ile Leu Ile Phe Asp Pro Gly
1490 1495 1500
Phe Val His Ala Lys Leu Glu Val Val Leu Arg Asn Ala Arg Lys
1505 1510 1515
Leu Leu Asn Pro Gly Gly Arg Ile Val Val Ala Glu Val Ser Asp
1520 1525 1530
Pro Gly Leu Tyr Leu Gly Thr Ala Leu Gly Cys Leu Gln Trp Thr
1535 1540 1545
Arg Asn Leu Asp Val Ala Gln Ser Ser Ser Ser Trp Thr Ser Cys
1550 1555 1560
Leu Ala Arg Ser Gly Leu Thr Pro Ala Leu Lys Leu Ile Asp Met
1565 1570 1575
Asp Thr Glu Ser Ala Val His Gly His Phe Arg Leu Ser Leu Thr
1580 1585 1590
Gly Asn Ala Ala Glu Ser Thr Asn Ser Asp Asn Arg Gln Pro Gln
1595 1600 1605
Gln Val Thr Leu Ile Glu Ala Ala Asn Pro Ser Ala Thr Ala Gln
1610 1615 1620
Asp Ile Ala Ala Ala Val Ala Gln Asn Leu Asp Lys Ala Ser Ile
1625 1630 1635
Pro Thr Lys Arg Ile Arg Trp Gly Ser Asp Val Ser Gln Leu Lys
1640 1645 1650
Gly Gln Pro Cys Ile Val Leu Thr Asp Leu Glu Ser Ala Leu Leu
1655 1660 1665
Lys Asp Pro Ala Pro Glu Asp Leu Ala Ala Leu Gln Ser Leu Phe
1670 1675 1680
Ala His Ala Glu Ser Thr Leu Trp Val Ser Gly Pro Leu Gly Pro
1685 1690 1695
Asp Ala Ala Leu Ile Thr Gly Leu Ser Arg Ser Val Cys Asn Glu
1700 1705 1710
Ala Ala Asp Val His Ile Arg Thr Leu Glu Val Thr Asp Leu Pro
1715 1720 1725
Gly Pro Gly Ala Asp Ser Tyr Ala Asp Leu Val Thr Arg Val Phe
1730 1735 1740
Arg Tyr Ser Gly Pro Asp Thr Glu Phe Arg Trp His Ser Asp Ala
1745 1750 1755
Leu Leu Val Ser Arg Leu Val Glu Asp Glu Ala Arg Asn Lys Glu
1760 1765 1770
Ile Ala Gln Leu Leu Gly Gln Gly Glu Lys Ala Ala Val Ala Thr
1775 1780 1785
Thr Leu Gln Glu Lys Pro Glu Gly Leu Lys Leu Cys Met Arg Gln
1790 1795 1800
Ile Gly Met Leu Asp Ser Val Cys Phe Glu Pro Asp Leu Leu Ala
1805 1810 1815
Leu Glu Pro Leu Glu Ala Gly Glu Val Glu Val Asp Val Lys Ala
1820 1825 1830
Ser Gly Val Asn Phe Arg Asp Val Met Val Ala Leu Gly Gln Ile
1835 1840 1845
Pro Asp Arg Ala Phe Gly Phe Glu Gly Ala Gly Val Val Arg Arg
1850 1855 1860
Val His Ala Ser Glu Thr Arg Leu Arg Pro Gly Asp Arg Val Val
1865 1870 1875
Phe Leu Ala His Gly Ala His Arg Thr Val His Arg Val Arg Ala
1880 1885 1890
Asp Tyr Ala Met Pro Met Pro Asp Thr Met Ser Phe Glu Glu Gly
1895 1900 1905
Ala Ala Ile Leu Leu Val His Thr Thr Ala Trp Tyr Ala Leu Val
1910 1915 1920
Lys Ser Ala Arg Ala Thr Ala Gly Gln Ser Val Leu Val His Ala
1925 1930 1935
Ala Ala Gly Gly Val Gly Gln Ala Val Leu Met Leu Ala Arg His
1940 1945 1950
Leu Gly Leu Gln Val Phe Ala Thr Val Gly Ser Glu Glu Lys Arg
1955 1960 1965
Lys Leu Val His Glu Thr Tyr Gly Val Pro His Asp His Ile Phe
1970 1975 1980
Asn Ser Arg Asp Ala Ser Phe Ala Met Gly Val Lys Arg Met Thr
1985 1990 1995
Lys Gly Arg Gly Val Asp Ile Val Val Asn Ser Leu Ala Gly Glu
2000 2005 2010
Ala Leu Arg Gln Thr Trp His Cys Leu Ala Pro Phe Gly Thr Phe
2015 2020 2025
Val Glu Leu Gly Met Lys Asp Ile Leu Asp Asn Ala Arg Leu Asp
2030 2035 2040
Met Lys Pro Phe Leu Gln Asp Ala Thr Phe Val Phe Phe Asn Leu
2045 2050 2055
Asn Arg Val Gln Lys Glu Arg Pro Asp Leu Met Gly Glu Ala Leu
2060 2065 2070
Arg Glu Thr Met Ala Leu Val Arg Ser Gly Ala Leu Lys Pro Ala
2075 2080 2085
Thr Pro Leu Thr Ser Tyr Pro Ala Ser Gln Val Glu Ala Ala Phe
2090 2095 2100
Arg Lys Ile Gln Thr Gly Gln His Leu Gly Lys Leu Val Leu Thr
2105 2110 2115
Phe Gln Glu Gly Asp Val Val Pro Val Val Arg Pro Asp Leu Ser
2120 2125 2130
Leu Ser Asp Ser Gly Thr Tyr Leu Leu Val Gly Gly Leu Gly Gly
2135 2140 2145
Leu Gly Arg Ser Leu Ala Arg Leu Leu Val Gln Leu Gly Ala Arg
2150 2155 2160
Arg Leu Cys Phe Leu Ser Arg Ser Gly Ala Ala Ser Ser Glu Ala
2165 2170 2175
Arg Ala Leu Val Lys Glu Leu Glu Met Gln His Arg Val Arg Val
2180 2185 2190
Leu Val Cys Lys Gly Asp Val Ser Asp Ala Asp Thr Val Ser Arg
2195 2200 2205
Val Val Gln Gln Cys Arg Ala Ala Leu Gly Pro Ile Arg Gly Val
2210 2215 2220
Ile Gln Cys Ala Met Val Leu Arg Asp Gly Leu Phe Glu Arg Met
2225 2230 2235
Ala His Asp Gln Trp Thr Glu Ser Thr Arg Pro Lys Val Gln Gly
2240 2245 2250
Thr Trp Asn Leu His Glu Gln Ile Pro Val Ser Asp Phe Phe Ile
2255 2260 2265
Thr Leu Ser Ser Phe Ala Gly Val Phe Gly Ser Arg Gly Gln Ser
2270 2275 2280
Asn Tyr Ala Ala Ala Gly Ala Tyr Glu Asp Ala Met Ala His His
2285 2290 2295
Arg Glu Ser Leu Gly Gln Arg Ala Ile Thr Ile Asp Leu Gly Ile
2300 2305 2310
Met Arg Asp Val Gly Val Leu Ala Glu Asn Gly Ile Thr Asp Tyr
2315 2320 2325
Leu Arg Glu Trp Glu Glu Pro Phe Gly Ile Arg Glu Pro Glu Phe
2330 2335 2340
His Ala Leu Ile Lys Ser Ala Ile Met Ser Thr Thr Gln Pro Leu
2345 2350 2355
Thr Glu Arg Ser Val Val Gln Ile Pro Thr Gly Leu Ala Thr Ala
2360 2365 2370
Arg Ser Ala Gln Ala Ala Gly Ile Ser Thr Pro Phe Tyr Phe Asp
2375 2380 2385
Asp Ala Arg Phe Ser Ile Leu Ala Gln Thr Arg Ala Ser Ala Gly
2390 2395 2400
Ala Ser Ser Ala Ala Gly Ser Gly Asp Ala Asp Ala Gly Lys Val
2405 2410 2415
Ser Val Arg Thr Gln Leu Ser Gln Ala His Ser Val Ala Glu Ala
2420 2425 2430
Ala Ala Ala Val Gln Thr Val Leu Leu Glu Arg Val Ala Arg Thr
2435 2440 2445
Leu Gln Ser Ser Val Ala Glu Ile Asp Pro Ser Arg Pro Leu His
2450 2455 2460
Ser Tyr Gly Val Asp Ser Leu Val Ala Val Glu Thr Val Lys Trp
2465 2470 2475
Met Phe Lys Thr Leu Asp Ala Lys Met Thr Val Phe Asp Val Leu
2480 2485 2490
Ser Asn Val Ser Ile Thr Ala Leu Cys Glu Lys Ile Ala Ser Met
2495 2500 2505
Ser Thr Leu Val Lys Leu Asn
2510 2515
<210> 3
<211> 5148
<212> DNA
<213> 金龟子绿僵菌
<220>
<221> CDS
<222> (1)..(5148)
<400> 3
atg aaa ctg cgt gtc gca aac ttc ctc ctc ttt ggg gat cag acc gta 48
Met Lys Leu Arg Val Ala Asn Phe Leu Leu Phe Gly Asp Gln Thr Val
1 5 10 15
gag aag ctc cca gcc att cgg cac ctg gtg agc cat ggc gcg tcc tca 96
Glu Lys Leu Pro Ala Ile Arg His Leu Val Ser His Gly Ala Ser Ser
20 25 30
cct ctt gtc cag aga ttc ctg cgt caa gtg tgc gat gca gta cag ctc 144
Pro Leu Val Gln Arg Phe Leu Arg Gln Val Cys Asp Ala Val Gln Leu
35 40 45
cag gtc agc aag ctg cct ctg cac tcg gag caa cgc agc aac att ggg 192
Gln Val Ser Lys Leu Pro Leu His Ser Glu Gln Arg Ser Asn Ile Gly
50 55 60
aac ttc gac agt atc ctt cga cta gcc gag aac aat gcc cgg ctg gag 240
Asn Phe Asp Ser Ile Leu Arg Leu Ala Glu Asn Asn Ala Arg Leu Glu
65 70 75 80
gag ccc aac gag atc att gcc acc gtc ttg atg aat atc gca cgt cta 288
Glu Pro Asn Glu Ile Ile Ala Thr Val Leu Met Asn Ile Ala Arg Leu
85 90 95
gga gag ctc att cta tat gca gag caa gac cct acc gtt ctc gcc tcc 336
Gly Glu Leu Ile Leu Tyr Ala Glu Gln Asp Pro Thr Val Leu Ala Ser
100 105 110
aaa ggc aac cgc aac tgt att ctg gga ttc tgc acc ggc gag gtg gcc 384
Lys Gly Asn Arg Asn Cys Ile Leu Gly Phe Cys Thr Gly Glu Val Ala
115 120 125
gct gct gtg gcc gcc gtc gcg cag gac acc aac gaa ctc gtc gag ctg 432
Ala Ala Val Ala Ala Val Ala Gln Asp Thr Asn Glu Leu Val Glu Leu
130 135 140
gga gtc gag gtg aca cac atc atc ttt cgc atg gcc cgc gaa ctc aat 480
Gly Val Glu Val Thr His Ile Ile Phe Arg Met Ala Arg Glu Leu Asn
145 150 155 160
cgc cgg tct ctc atg gtt gac cgt acc aat ggc ccc tgg gcc cgg aca 528
Arg Arg Ser Leu Met Val Asp Arg Thr Asn Gly Pro Trp Ala Arg Thr
165 170 175
ata ctg ggc att tca gtc gat cgc gtc cgg gaa atc cta caa gac ttc 576
Ile Leu Gly Ile Ser Val Asp Arg Val Arg Glu Ile Leu Gln Asp Phe
180 185 190
cac gag aac cag tct att cct cgc gcg cga caa gtc tgc att ggc ttc 624
His Glu Asn Gln Ser Ile Pro Arg Ala Arg Gln Val Cys Ile Gly Phe
195 200 205
gtc tca gat ggc tgg tta aca ctc ttt ggc ccg ccc aca act ctg caa 672
Val Ser Asp Gly Trp Leu Thr Leu Phe Gly Pro Pro Thr Thr Leu Gln
210 215 220
cgg ctt tta gaa tgg tcg gca gag ctg gaa gac gct ccg caa atc gac 720
Arg Leu Leu Glu Trp Ser Ala Glu Leu Glu Asp Ala Pro Gln Ile Asp
225 230 235 240
acc gac gcc cgc gga ggc gtg cac atg gag acg ttg cca gaa gtt gac 768
Thr Asp Ala Arg Gly Gly Val His Met Glu Thr Leu Pro Glu Val Asp
245 250 255
ccg gat cgg att ctt ggc tca tcg cca tgg ctg gac cgg gcc ccc gtg 816
Pro Asp Arg Ile Leu Gly Ser Ser Pro Trp Leu Asp Arg Ala Pro Val
260 265 270
cac acg gcc acc ata atc tcg ccc tac acg tgc aaa ccg cgg cag cag 864
His Thr Ala Thr Ile Ile Ser Pro Tyr Thr Cys Lys Pro Arg Gln Gln
275 280 285
aag acg ttg cgg ggg ctt ctg gag gaa ata att gca gat gtc ggg cag 912
Lys Thr Leu Arg Gly Leu Leu Glu Glu Ile Ile Ala Asp Val Gly Gln
290 295 300
agg acg ttg aat ttg gcc acg tca atg aac gct gct gtt gag ctc gca 960
Arg Thr Leu Asn Leu Ala Thr Ser Met Asn Ala Ala Val Glu Leu Ala
305 310 315 320
cag gca gac aag ctc cgt ctt gtt atg ccc ggc tac act agt cac gac 1008
Gln Ala Asp Lys Leu Arg Leu Val Met Pro Gly Tyr Thr Ser His Asp
325 330 335
gtc tac ttt caa aga tta ctg caa aaa cgc ggc ata gag tat tcc gtc 1056
Val Tyr Phe Gln Arg Leu Leu Gln Lys Arg Gly Ile Glu Tyr Ser Val
340 345 350
atg tca cat ggg gac cat ttg tcg tca ggt ccc agc cga cag ggt tca 1104
Met Ser His Gly Asp His Leu Ser Ser Gly Pro Ser Arg Gln Gly Ser
355 360 365
gga ctt gtg gct gtc gtc ggc atg tct ggg agg ttc cca ggg agc ggc 1152
Gly Leu Val Ala Val Val Gly Met Ser Gly Arg Phe Pro Gly Ser Gly
370 375 380
gac atc aac gca ttt tgg gag ggt ctt tta gag ggc aaa aga tat atc 1200
Asp Ile Asn Ala Phe Trp Glu Gly Leu Leu Glu Gly Lys Arg Tyr Ile
385 390 395 400
caa gag att cca aat aca cga ttt gac ctg gag caa tgg tac gat gcc 1248
Gln Glu Ile Pro Asn Thr Arg Phe Asp Leu Glu Gln Trp Tyr Asp Ala
405 410 415
acg gga aaa caa aag aat tct acc atg gcg cgg aca gga gcc ttc ctc 1296
Thr Gly Lys Gln Lys Asn Ser Thr Met Ala Arg Thr Gly Ala Phe Leu
420 425 430
gac aag ccg ggc atg ttc gac aac cgc cta ttc gac atg tcg ccc agg 1344
Asp Lys Pro Gly Met Phe Asp Asn Arg Leu Phe Asp Met Ser Pro Arg
435 440 445
gag gcc atg cag aca gac gtc cag cac cgg ctg ctc atg aca acc agc 1392
Glu Ala Met Gln Thr Asp Val Gln His Arg Leu Leu Met Thr Thr Ser
450 455 460
tac gag gca ctg gag atg tcg ggc tac tat ccc gat ggc acg ctt tcg 1440
Tyr Glu Ala Leu Glu Met Ser Gly Tyr Tyr Pro Asp Gly Thr Leu Ser
465 470 475 480
aca aac aag gac cgc gtc gcc tcc ttc ttt ggc cag acg tct gat gat 1488
Thr Asn Lys Asp Arg Val Ala Ser Phe Phe Gly Gln Thr Ser Asp Asp
485 490 495
tgg cga gaa gtg gtg gtc cac caa ggg gta gac atc tac ttc gcc acg 1536
Trp Arg Glu Val Val Val His Gln Gly Val Asp Ile Tyr Phe Ala Thr
500 505 510
gga agc tgc cgc gct ttc gga cca ggc agg ctg cac cac cac ttc aaa 1584
Gly Ser Cys Arg Ala Phe Gly Pro Gly Arg Leu His His His Phe Lys
515 520 525
tgg gga ggt ccg tct tat agc gtc gac tcg gcc tgc tct tcc agc atc 1632
Trp Gly Gly Pro Ser Tyr Ser Val Asp Ser Ala Cys Ser Ser Ser Ile
530 535 540
gca gcc gtc ggt tta gcg tgc tcg gcg ctc ctc ggc cgc gaa tgc gac 1680
Ala Ala Val Gly Leu Ala Cys Ser Ala Leu Leu Gly Arg Glu Cys Asp
545 550 555 560
atg gct ctg gct ggt gga gga tcc ctc ctc ctc tcc cca tca ccc ttc 1728
Met Ala Leu Ala Gly Gly Gly Ser Leu Leu Leu Ser Pro Ser Pro Phe
565 570 575
tcg ggg tta agc cgt ggc ggt ttc ctg tcc gct cat gga ggg tgc cag 1776
Ser Gly Leu Ser Arg Gly Gly Phe Leu Ser Ala His Gly Gly Cys Gln
580 585 590
acg ttc cac gac aat gcc gac ggt tac gtc cgt gga gag gga gtt ggc 1824
Thr Phe His Asp Asn Ala Asp Gly Tyr Val Arg Gly Glu Gly Val Gly
595 600 605
gtg gtc gtt ctc aaa cgg ttg gag gac gcg ctg gac gac caa gac aac 1872
Val Val Val Leu Lys Arg Leu Glu Asp Ala Leu Asp Asp Gln Asp Asn
610 615 620
atc ctc ggc gtc gtc cgg gga tcc gga cgc aac tac agc agt gat gct 1920
Ile Leu Gly Val Val Arg Gly Ser Gly Arg Asn Tyr Ser Ser Asp Ala
625 630 635 640
tct tcc atg atg cat ccc tcg gca aat gct cag aaa aag ctg tac tgc 1968
Ser Ser Met Met His Pro Ser Ala Asn Ala Gln Lys Lys Leu Tyr Cys
645 650 655
gat gtg ctg gag caa agc ggt gta gac gcc aac agc atc tcg tac gtg 2016
Asp Val Leu Glu Gln Ser Gly Val Asp Ala Asn Ser Ile Ser Tyr Val
660 665 670
gag atg cat gga acc ggg aca cag gcg gga gac ttt atg gaa atg tcc 2064
Glu Met His Gly Thr Gly Thr Gln Ala Gly Asp Phe Met Glu Met Ser
675 680 685
tcg gtc ttg tca aca ttt gca gaa aag cga ggc tcg gat aat ccg ctc 2112
Ser Val Leu Ser Thr Phe Ala Glu Lys Arg Gly Ser Asp Asn Pro Leu
690 695 700
att gtt ggg gcc ctc aaa gca aat att ggc cac ggg gaa gct gcg gcc 2160
Ile Val Gly Ala Leu Lys Ala Asn Ile Gly His Gly Glu Ala Ala Ala
705 710 715 720
ggt gtt tgc gct ctt atc aaa acc ctc atg atg ctc cag tct cga cag 2208
Gly Val Cys Ala Leu Ile Lys Thr Leu Met Met Leu Gln Ser Arg Gln
725 730 735
att ccc ccc cag ccc gat ctt cct gga cct att aac cac cgc ttt cct 2256
Ile Pro Pro Gln Pro Asp Leu Pro Gly Pro Ile Asn His Arg Phe Pro
740 745 750
gat cta gca gcg cgt aat gta tac atc gcg gcc cgc aat atg aga ctg 2304
Asp Leu Ala Ala Arg Asn Val Tyr Ile Ala Ala Arg Asn Met Arg Leu
755 760 765
gag gcc agt cca gtg gct aag ggc acg cta cgc gtc ttt ctc aac agc 2352
Glu Ala Ser Pro Val Ala Lys Gly Thr Leu Arg Val Phe Leu Asn Ser
770 775 780
ttc gac gcc tcg gga gga aat tcg tgc ttg gtg ctt gaa gaa gct ccg 2400
Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Val Leu Glu Glu Ala Pro
785 790 795 800
cca cgg gcc gtc aag gat gca gac cct cga ggt cac cac gtc gtg acg 2448
Pro Arg Ala Val Lys Asp Ala Asp Pro Arg Gly His His Val Val Thr
805 810 815
ctt tca gcc cgt tcc cag aag tca ctt att ggc atc aaa gag agg tat 2496
Leu Ser Ala Arg Ser Gln Lys Ser Leu Ile Gly Ile Lys Glu Arg Tyr
820 825 830
ctc gct cat ctg cgc caa cat cct gac acc aaa ctg gcc gac ttg gcc 2544
Leu Ala His Leu Arg Gln His Pro Asp Thr Lys Leu Ala Asp Leu Ala
835 840 845
tat acc aca agc gct cga cgc att cac ggg tta ttg cgg tac gcc att 2592
Tyr Thr Thr Ser Ala Arg Arg Ile His Gly Leu Leu Arg Tyr Ala Ile
850 855 860
gcc gca tct tcc att gac gag gtc gtg caa tgc ctg gag acg gat ctc 2640
Ala Ala Ser Ser Ile Asp Glu Val Val Gln Cys Leu Glu Thr Asp Leu
865 870 875 880
gcc cag ggg aaa aca cca cgt cag cct ccg gca aca cca acg gta gtc 2688
Ala Gln Gly Lys Thr Pro Arg Gln Pro Pro Ala Thr Pro Thr Val Val
885 890 895
ttt aca ttt act ggc caa ggc gca cac tat atc ggc atg ggg gca aac 2736
Phe Thr Phe Thr Gly Gln Gly Ala His Tyr Ile Gly Met Gly Ala Asn
900 905 910
ttg tgg gag acg tct gcc aca ttc cgc aat acg ctt cac gac tac cag 2784
Leu Trp Glu Thr Ser Ala Thr Phe Arg Asn Thr Leu His Asp Tyr Gln
915 920 925
aca atg gcc agc gct caa ggc ctc ccc cat ttc ctg cat ctc atc acg 2832
Thr Met Ala Ser Ala Gln Gly Leu Pro His Phe Leu His Leu Ile Thr
930 935 940
gac agc agc aca ccc gcg cca cag tcg ggc ccg gat acc gtg cag gta 2880
Asp Ser Ser Thr Pro Ala Pro Gln Ser Gly Pro Asp Thr Val Gln Val
945 950 955 960
cag ctg gcc atg gta agc ttg gaa ctg gcc ctg gcc aag ctc tgg cgc 2928
Gln Leu Ala Met Val Ser Leu Glu Leu Ala Leu Ala Lys Leu Trp Arg
965 970 975
tcc tgg ggc atc cag cca gcc atg gtc ttg ggc cac agc ctg ggc gaa 2976
Ser Trp Gly Ile Gln Pro Ala Met Val Leu Gly His Ser Leu Gly Glu
980 985 990
tac gcg gcc ttg tgc gtg gcc gga gtc ttg tcc gtg agc gac act ctg 3024
Tyr Ala Ala Leu Cys Val Ala Gly Val Leu Ser Val Ser Asp Thr Leu
995 1000 1005
tac ctc gtc gcc aag cga gca caa atc atg gct gga gcc ctg acg 3069
Tyr Leu Val Ala Lys Arg Ala Gln Ile Met Ala Gly Ala Leu Thr
1010 1015 1020
ccg cac gaa tac gga atg ctg gct gtg aat cta agc gtt gct gac 3114
Pro His Glu Tyr Gly Met Leu Ala Val Asn Leu Ser Val Ala Asp
1025 1030 1035
acg cgg gaa gtg ctc tcg tct ggc cag cat act tcc tgc gcc gtg 3159
Thr Arg Glu Val Leu Ser Ser Gly Gln His Thr Ser Cys Ala Val
1040 1045 1050
gct tgc atc aac gcg ccc aag atg aca gtc gtg agc ggc ttg cgc 3204
Ala Cys Ile Asn Ala Pro Lys Met Thr Val Val Ser Gly Leu Arg
1055 1060 1065
tcg aag ctg gac gat ctc cag gac caa ctc aag tcg gac ggc acc 3249
Ser Lys Leu Asp Asp Leu Gln Asp Gln Leu Lys Ser Asp Gly Thr
1070 1075 1080
cgg tgc act ccc cta tct gtt ccc tat ggc ttc cac tcc agc cag 3294
Arg Cys Thr Pro Leu Ser Val Pro Tyr Gly Phe His Ser Ser Gln
1085 1090 1095
ctt gat ccc atc ttg ggc cag ttc gaa gag gcc tgc cag ggc gtg 3339
Leu Asp Pro Ile Leu Gly Gln Phe Glu Glu Ala Cys Gln Gly Val
1100 1105 1110
acc ttt tcc gcg ccg agt gtc ccg gtc gtt tcc acg ctc ttg gct 3384
Thr Phe Ser Ala Pro Ser Val Pro Val Val Ser Thr Leu Leu Ala
1115 1120 1125
acg aca gtc cgg gaa gaa gga aca ttc tct ccg gag tac ctg gca 3429
Thr Thr Val Arg Glu Glu Gly Thr Phe Ser Pro Glu Tyr Leu Ala
1130 1135 1140
cga cag gcg cgc gaa ccc gtc gac ttt gtc ggg gca ttg ggc gcg 3474
Arg Gln Ala Arg Glu Pro Val Asp Phe Val Gly Ala Leu Gly Ala
1145 1150 1155
gtg cag gag cac aag ttt ccc ggc ctg acc ttc ctc gag att ggg 3519
Val Gln Glu His Lys Phe Pro Gly Leu Thr Phe Leu Glu Ile Gly
1160 1165 1170
ccc gat ccc gtg tgc tcg ggt ctt gtg aat gct acg cta ggt gcc 3564
Pro Asp Pro Val Cys Ser Gly Leu Val Asn Ala Thr Leu Gly Ala
1175 1180 1185
gat gag gct gca ttg cgc tgc gtt gcc tcg atg cac cgc gga aag 3609
Asp Glu Ala Ala Leu Arg Cys Val Ala Ser Met His Arg Gly Lys
1190 1195 1200
gcc aac tgg gcg tcg ata tcg tgc agc ttg agg gat ctc tat acg 3654
Ala Asn Trp Ala Ser Ile Ser Cys Ser Leu Arg Asp Leu Tyr Thr
1205 1210 1215
gcg ggt gcc gcc att gac tgg cca gcc cat cac cgg gat ttc aaa 3699
Ala Gly Ala Ala Ile Asp Trp Pro Ala His His Arg Asp Phe Lys
1220 1225 1230
tca tcg gta tcc ctg ctg gac ctc cca aag tac tcg ttt gac gag 3744
Ser Ser Val Ser Leu Leu Asp Leu Pro Lys Tyr Ser Phe Asp Glu
1235 1240 1245
aag gaa ttc tgg gcg tcg ttc ccc gat cga gac ctt cag acc att 3789
Lys Glu Phe Trp Ala Ser Phe Pro Asp Arg Asp Leu Gln Thr Ile
1250 1255 1260
gga gac gtc gag acc aag cac agc caa ccg cct gcc att gtt cct 3834
Gly Asp Val Glu Thr Lys His Ser Gln Pro Pro Ala Ile Val Pro
1265 1270 1275
tcg gta caa ggg tat tgc aca acg act ctg cag cgg atc acg agg 3879
Ser Val Gln Gly Tyr Cys Thr Thr Thr Leu Gln Arg Ile Thr Arg
1280 1285 1290
gaa aca atc gag ccc gat ggg ttg tcg gtt aca ttc tca tca gac 3924
Glu Thr Ile Glu Pro Asp Gly Leu Ser Val Thr Phe Ser Ser Asp
1295 1300 1305
cta gcc gac cag cac cta cgg gca gcc gtg cga ggc cac gcc gtg 3969
Leu Ala Asp Gln His Leu Arg Ala Ala Val Arg Gly His Ala Val
1310 1315 1320
gcc gat gtg gaa att tgt tcc agc agt ctg ctc ttg gac atg gcg 4014
Ala Asp Val Glu Ile Cys Ser Ser Ser Leu Leu Leu Asp Met Ala
1325 1330 1335
ctc tcc gcg gcc caa tat gcc tac atg aag cat tct cct ggt cag 4059
Leu Ser Ala Ala Gln Tyr Ala Tyr Met Lys His Ser Pro Gly Gln
1340 1345 1350
aag atg cca gtg cca tta acc gtc cgc aac tgc tat ttc cac cgg 4104
Lys Met Pro Val Pro Leu Thr Val Arg Asn Cys Tyr Phe His Arg
1355 1360 1365
ggt gtt gtc ttg acg gac aag gcc cag acg gtg gaa gtc acc gtc 4149
Gly Val Val Leu Thr Asp Lys Ala Gln Thr Val Glu Val Thr Val
1370 1375 1380
act ctt aca tcc tcg gcc aag act gcg gat atc cgg tac cac tgc 4194
Thr Leu Thr Ser Ser Ala Lys Thr Ala Asp Ile Arg Tyr His Cys
1385 1390 1395
cgc act cct gac gag tat tac gag gtt ggc gcc tgc cag gtc gtc 4239
Arg Thr Pro Asp Glu Tyr Tyr Glu Val Gly Ala Cys Gln Val Val
1400 1405 1410
ttg aag cca gca agc aaa ccg gac caa gcc ggc ttc ctg gtt cgg 4284
Leu Lys Pro Ala Ser Lys Pro Asp Gln Ala Gly Phe Leu Val Arg
1415 1420 1425
tcc cgc atg gct gct ctc aag gcg tcc gca agt cac cgg cta ggc 4329
Ser Arg Met Ala Ala Leu Lys Ala Ser Ala Ser His Arg Leu Gly
1430 1435 1440
aga cgc gca gtc tat cga tta ttc gac aac gtt gtg cgt tat tcc 4374
Arg Arg Ala Val Tyr Arg Leu Phe Asp Asn Val Val Arg Tyr Ser
1445 1450 1455
gaa caa tac cag ggg cta gaa aat gtc cac ttg tca gag gac atg 4419
Glu Gln Tyr Gln Gly Leu Glu Asn Val His Leu Ser Glu Asp Met
1460 1465 1470
cag gat gcc gtg gcg gaa atc aac atg gcc cac gtc cca gcc gca 4464
Gln Asp Ala Val Ala Glu Ile Asn Met Ala His Val Pro Ala Ala
1475 1480 1485
ggc ggc cat tac ctc cac cac cca ttc ttg ctc gac tcg att gtt 4509
Gly Gly His Tyr Leu His His Pro Phe Leu Leu Asp Ser Ile Val
1490 1495 1500
cac ttg tcg ggg ttc ttg gtg aac aat ggg ctt cgc tat tcc agc 4554
His Leu Ser Gly Phe Leu Val Asn Asn Gly Leu Arg Tyr Ser Ser
1505 1510 1515
gag tgg gct tgc ctt tcc acc ggc ttt gac gag tgg cac ctg ctc 4599
Glu Trp Ala Cys Leu Ser Thr Gly Phe Asp Glu Trp His Leu Leu
1520 1525 1530
aag ccg ctt gat ccc acc act gtg tac acc agc tac acc ttc atg 4644
Lys Pro Leu Asp Pro Thr Thr Val Tyr Thr Ser Tyr Thr Phe Met
1535 1540 1545
gag gac tct cgg tcg acg agc aat ctg gta acg ggc gat gta tac 4689
Glu Asp Ser Arg Ser Thr Ser Asn Leu Val Thr Gly Asp Val Tyr
1550 1555 1560
gtc tat gac gga gag gag ctg gtc tcg gtg ctg acg ggg ctg cag 4734
Val Tyr Asp Gly Glu Glu Leu Val Ser Val Leu Thr Gly Leu Gln
1565 1570 1575
ttc caa aag atg aag agg acg gca ctc act cat cta ctg agc ccc 4779
Phe Gln Lys Met Lys Arg Thr Ala Leu Thr His Leu Leu Ser Pro
1580 1585 1590
ccg aca gtc ggt acc atg gcc gcc aag ccg agt aca tgt atg cca 4824
Pro Thr Val Gly Thr Met Ala Ala Lys Pro Ser Thr Cys Met Pro
1595 1600 1605
act atg gga caa acg gag ccg ttg ccg gct caa gcc aga gtg gcc 4869
Thr Met Gly Gln Thr Glu Pro Leu Pro Ala Gln Ala Arg Val Ala
1610 1615 1620
ggc ttg ccg gtt ccc aca cca ccg gct aca gca agt gtt gat gat 4914
Gly Leu Pro Val Pro Thr Pro Pro Ala Thr Ala Ser Val Asp Asp
1625 1630 1635
ggc gag ggg gag aag ttc gac ttg gtc aat aca ctc ttt tcc att 4959
Gly Glu Gly Glu Lys Phe Asp Leu Val Asn Thr Leu Phe Ser Ile
1640 1645 1650
atc gca cgc gag gtg ggc gtg gag cca agc gat ttg gag ggc gac 5004
Ile Ala Arg Glu Val Gly Val Glu Pro Ser Asp Leu Glu Gly Asp
1655 1660 1665
gtc aac ctg gcg aat ttg ggc ata gac tcc ctg atg gcc att acc 5049
Val Asn Leu Ala Asn Leu Gly Ile Asp Ser Leu Met Ala Ile Thr
1670 1675 1680
ata att tca gtc atg cag cag gaa aca ggt gtc gag ttg ccg ggg 5094
Ile Ile Ser Val Met Gln Gln Glu Thr Gly Val Glu Leu Pro Gly
1685 1690 1695
acc ttt ttc ctc gac aat gcc act aca acg gca gtg att gcg gcg 5139
Thr Phe Phe Leu Asp Asn Ala Thr Thr Thr Ala Val Ile Ala Ala
1700 1705 1710
gta ggg tag 5148
Val Gly
1715
<210> 4
<211> 1715
<212> PRT
<213> 金龟子绿僵菌
<400> 4
Met Lys Leu Arg Val Ala Asn Phe Leu Leu Phe Gly Asp Gln Thr Val
1 5 10 15
Glu Lys Leu Pro Ala Ile Arg His Leu Val Ser His Gly Ala Ser Ser
20 25 30
Pro Leu Val Gln Arg Phe Leu Arg Gln Val Cys Asp Ala Val Gln Leu
35 40 45
Gln Val Ser Lys Leu Pro Leu His Ser Glu Gln Arg Ser Asn Ile Gly
50 55 60
Asn Phe Asp Ser Ile Leu Arg Leu Ala Glu Asn Asn Ala Arg Leu Glu
65 70 75 80
Glu Pro Asn Glu Ile Ile Ala Thr Val Leu Met Asn Ile Ala Arg Leu
85 90 95
Gly Glu Leu Ile Leu Tyr Ala Glu Gln Asp Pro Thr Val Leu Ala Ser
100 105 110
Lys Gly Asn Arg Asn Cys Ile Leu Gly Phe Cys Thr Gly Glu Val Ala
115 120 125
Ala Ala Val Ala Ala Val Ala Gln Asp Thr Asn Glu Leu Val Glu Leu
130 135 140
Gly Val Glu Val Thr His Ile Ile Phe Arg Met Ala Arg Glu Leu Asn
145 150 155 160
Arg Arg Ser Leu Met Val Asp Arg Thr Asn Gly Pro Trp Ala Arg Thr
165 170 175
Ile Leu Gly Ile Ser Val Asp Arg Val Arg Glu Ile Leu Gln Asp Phe
180 185 190
His Glu Asn Gln Ser Ile Pro Arg Ala Arg Gln Val Cys Ile Gly Phe
195 200 205
Val Ser Asp Gly Trp Leu Thr Leu Phe Gly Pro Pro Thr Thr Leu Gln
210 215 220
Arg Leu Leu Glu Trp Ser Ala Glu Leu Glu Asp Ala Pro Gln Ile Asp
225 230 235 240
Thr Asp Ala Arg Gly Gly Val His Met Glu Thr Leu Pro Glu Val Asp
245 250 255
Pro Asp Arg Ile Leu Gly Ser Ser Pro Trp Leu Asp Arg Ala Pro Val
260 265 270
His Thr Ala Thr Ile Ile Ser Pro Tyr Thr Cys Lys Pro Arg Gln Gln
275 280 285
Lys Thr Leu Arg Gly Leu Leu Glu Glu Ile Ile Ala Asp Val Gly Gln
290 295 300
Arg Thr Leu Asn Leu Ala Thr Ser Met Asn Ala Ala Val Glu Leu Ala
305 310 315 320
Gln Ala Asp Lys Leu Arg Leu Val Met Pro Gly Tyr Thr Ser His Asp
325 330 335
Val Tyr Phe Gln Arg Leu Leu Gln Lys Arg Gly Ile Glu Tyr Ser Val
340 345 350
Met Ser His Gly Asp His Leu Ser Ser Gly Pro Ser Arg Gln Gly Ser
355 360 365
Gly Leu Val Ala Val Val Gly Met Ser Gly Arg Phe Pro Gly Ser Gly
370 375 380
Asp Ile Asn Ala Phe Trp Glu Gly Leu Leu Glu Gly Lys Arg Tyr Ile
385 390 395 400
Gln Glu Ile Pro Asn Thr Arg Phe Asp Leu Glu Gln Trp Tyr Asp Ala
405 410 415
Thr Gly Lys Gln Lys Asn Ser Thr Met Ala Arg Thr Gly Ala Phe Leu
420 425 430
Asp Lys Pro Gly Met Phe Asp Asn Arg Leu Phe Asp Met Ser Pro Arg
435 440 445
Glu Ala Met Gln Thr Asp Val Gln His Arg Leu Leu Met Thr Thr Ser
450 455 460
Tyr Glu Ala Leu Glu Met Ser Gly Tyr Tyr Pro Asp Gly Thr Leu Ser
465 470 475 480
Thr Asn Lys Asp Arg Val Ala Ser Phe Phe Gly Gln Thr Ser Asp Asp
485 490 495
Trp Arg Glu Val Val Val His Gln Gly Val Asp Ile Tyr Phe Ala Thr
500 505 510
Gly Ser Cys Arg Ala Phe Gly Pro Gly Arg Leu His His His Phe Lys
515 520 525
Trp Gly Gly Pro Ser Tyr Ser Val Asp Ser Ala Cys Ser Ser Ser Ile
530 535 540
Ala Ala Val Gly Leu Ala Cys Ser Ala Leu Leu Gly Arg Glu Cys Asp
545 550 555 560
Met Ala Leu Ala Gly Gly Gly Ser Leu Leu Leu Ser Pro Ser Pro Phe
565 570 575
Ser Gly Leu Ser Arg Gly Gly Phe Leu Ser Ala His Gly Gly Cys Gln
580 585 590
Thr Phe His Asp Asn Ala Asp Gly Tyr Val Arg Gly Glu Gly Val Gly
595 600 605
Val Val Val Leu Lys Arg Leu Glu Asp Ala Leu Asp Asp Gln Asp Asn
610 615 620
Ile Leu Gly Val Val Arg Gly Ser Gly Arg Asn Tyr Ser Ser Asp Ala
625 630 635 640
Ser Ser Met Met His Pro Ser Ala Asn Ala Gln Lys Lys Leu Tyr Cys
645 650 655
Asp Val Leu Glu Gln Ser Gly Val Asp Ala Asn Ser Ile Ser Tyr Val
660 665 670
Glu Met His Gly Thr Gly Thr Gln Ala Gly Asp Phe Met Glu Met Ser
675 680 685
Ser Val Leu Ser Thr Phe Ala Glu Lys Arg Gly Ser Asp Asn Pro Leu
690 695 700
Ile Val Gly Ala Leu Lys Ala Asn Ile Gly His Gly Glu Ala Ala Ala
705 710 715 720
Gly Val Cys Ala Leu Ile Lys Thr Leu Met Met Leu Gln Ser Arg Gln
725 730 735
Ile Pro Pro Gln Pro Asp Leu Pro Gly Pro Ile Asn His Arg Phe Pro
740 745 750
Asp Leu Ala Ala Arg Asn Val Tyr Ile Ala Ala Arg Asn Met Arg Leu
755 760 765
Glu Ala Ser Pro Val Ala Lys Gly Thr Leu Arg Val Phe Leu Asn Ser
770 775 780
Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Val Leu Glu Glu Ala Pro
785 790 795 800
Pro Arg Ala Val Lys Asp Ala Asp Pro Arg Gly His His Val Val Thr
805 810 815
Leu Ser Ala Arg Ser Gln Lys Ser Leu Ile Gly Ile Lys Glu Arg Tyr
820 825 830
Leu Ala His Leu Arg Gln His Pro Asp Thr Lys Leu Ala Asp Leu Ala
835 840 845
Tyr Thr Thr Ser Ala Arg Arg Ile His Gly Leu Leu Arg Tyr Ala Ile
850 855 860
Ala Ala Ser Ser Ile Asp Glu Val Val Gln Cys Leu Glu Thr Asp Leu
865 870 875 880
Ala Gln Gly Lys Thr Pro Arg Gln Pro Pro Ala Thr Pro Thr Val Val
885 890 895
Phe Thr Phe Thr Gly Gln Gly Ala His Tyr Ile Gly Met Gly Ala Asn
900 905 910
Leu Trp Glu Thr Ser Ala Thr Phe Arg Asn Thr Leu His Asp Tyr Gln
915 920 925
Thr Met Ala Ser Ala Gln Gly Leu Pro His Phe Leu His Leu Ile Thr
930 935 940
Asp Ser Ser Thr Pro Ala Pro Gln Ser Gly Pro Asp Thr Val Gln Val
945 950 955 960
Gln Leu Ala Met Val Ser Leu Glu Leu Ala Leu Ala Lys Leu Trp Arg
965 970 975
Ser Trp Gly Ile Gln Pro Ala Met Val Leu Gly His Ser Leu Gly Glu
980 985 990
Tyr Ala Ala Leu Cys Val Ala Gly Val Leu Ser Val Ser Asp Thr Leu
995 1000 1005
Tyr Leu Val Ala Lys Arg Ala Gln Ile Met Ala Gly Ala Leu Thr
1010 1015 1020
Pro His Glu Tyr Gly Met Leu Ala Val Asn Leu Ser Val Ala Asp
1025 1030 1035
Thr Arg Glu Val Leu Ser Ser Gly Gln His Thr Ser Cys Ala Val
1040 1045 1050
Ala Cys Ile Asn Ala Pro Lys Met Thr Val Val Ser Gly Leu Arg
1055 1060 1065
Ser Lys Leu Asp Asp Leu Gln Asp Gln Leu Lys Ser Asp Gly Thr
1070 1075 1080
Arg Cys Thr Pro Leu Ser Val Pro Tyr Gly Phe His Ser Ser Gln
1085 1090 1095
Leu Asp Pro Ile Leu Gly Gln Phe Glu Glu Ala Cys Gln Gly Val
1100 1105 1110
Thr Phe Ser Ala Pro Ser Val Pro Val Val Ser Thr Leu Leu Ala
1115 1120 1125
Thr Thr Val Arg Glu Glu Gly Thr Phe Ser Pro Glu Tyr Leu Ala
1130 1135 1140
Arg Gln Ala Arg Glu Pro Val Asp Phe Val Gly Ala Leu Gly Ala
1145 1150 1155
Val Gln Glu His Lys Phe Pro Gly Leu Thr Phe Leu Glu Ile Gly
1160 1165 1170
Pro Asp Pro Val Cys Ser Gly Leu Val Asn Ala Thr Leu Gly Ala
1175 1180 1185
Asp Glu Ala Ala Leu Arg Cys Val Ala Ser Met His Arg Gly Lys
1190 1195 1200
Ala Asn Trp Ala Ser Ile Ser Cys Ser Leu Arg Asp Leu Tyr Thr
1205 1210 1215
Ala Gly Ala Ala Ile Asp Trp Pro Ala His His Arg Asp Phe Lys
1220 1225 1230
Ser Ser Val Ser Leu Leu Asp Leu Pro Lys Tyr Ser Phe Asp Glu
1235 1240 1245
Lys Glu Phe Trp Ala Ser Phe Pro Asp Arg Asp Leu Gln Thr Ile
1250 1255 1260
Gly Asp Val Glu Thr Lys His Ser Gln Pro Pro Ala Ile Val Pro
1265 1270 1275
Ser Val Gln Gly Tyr Cys Thr Thr Thr Leu Gln Arg Ile Thr Arg
1280 1285 1290
Glu Thr Ile Glu Pro Asp Gly Leu Ser Val Thr Phe Ser Ser Asp
1295 1300 1305
Leu Ala Asp Gln His Leu Arg Ala Ala Val Arg Gly His Ala Val
1310 1315 1320
Ala Asp Val Glu Ile Cys Ser Ser Ser Leu Leu Leu Asp Met Ala
1325 1330 1335
Leu Ser Ala Ala Gln Tyr Ala Tyr Met Lys His Ser Pro Gly Gln
1340 1345 1350
Lys Met Pro Val Pro Leu Thr Val Arg Asn Cys Tyr Phe His Arg
1355 1360 1365
Gly Val Val Leu Thr Asp Lys Ala Gln Thr Val Glu Val Thr Val
1370 1375 1380
Thr Leu Thr Ser Ser Ala Lys Thr Ala Asp Ile Arg Tyr His Cys
1385 1390 1395
Arg Thr Pro Asp Glu Tyr Tyr Glu Val Gly Ala Cys Gln Val Val
1400 1405 1410
Leu Lys Pro Ala Ser Lys Pro Asp Gln Ala Gly Phe Leu Val Arg
1415 1420 1425
Ser Arg Met Ala Ala Leu Lys Ala Ser Ala Ser His Arg Leu Gly
1430 1435 1440
Arg Arg Ala Val Tyr Arg Leu Phe Asp Asn Val Val Arg Tyr Ser
1445 1450 1455
Glu Gln Tyr Gln Gly Leu Glu Asn Val His Leu Ser Glu Asp Met
1460 1465 1470
Gln Asp Ala Val Ala Glu Ile Asn Met Ala His Val Pro Ala Ala
1475 1480 1485
Gly Gly His Tyr Leu His His Pro Phe Leu Leu Asp Ser Ile Val
1490 1495 1500
His Leu Ser Gly Phe Leu Val Asn Asn Gly Leu Arg Tyr Ser Ser
1505 1510 1515
Glu Trp Ala Cys Leu Ser Thr Gly Phe Asp Glu Trp His Leu Leu
1520 1525 1530
Lys Pro Leu Asp Pro Thr Thr Val Tyr Thr Ser Tyr Thr Phe Met
1535 1540 1545
Glu Asp Ser Arg Ser Thr Ser Asn Leu Val Thr Gly Asp Val Tyr
1550 1555 1560
Val Tyr Asp Gly Glu Glu Leu Val Ser Val Leu Thr Gly Leu Gln
1565 1570 1575
Phe Gln Lys Met Lys Arg Thr Ala Leu Thr His Leu Leu Ser Pro
1580 1585 1590
Pro Thr Val Gly Thr Met Ala Ala Lys Pro Ser Thr Cys Met Pro
1595 1600 1605
Thr Met Gly Gln Thr Glu Pro Leu Pro Ala Gln Ala Arg Val Ala
1610 1615 1620
Gly Leu Pro Val Pro Thr Pro Pro Ala Thr Ala Ser Val Asp Asp
1625 1630 1635
Gly Glu Gly Glu Lys Phe Asp Leu Val Asn Thr Leu Phe Ser Ile
1640 1645 1650
Ile Ala Arg Glu Val Gly Val Glu Pro Ser Asp Leu Glu Gly Asp
1655 1660 1665
Val Asn Leu Ala Asn Leu Gly Ile Asp Ser Leu Met Ala Ile Thr
1670 1675 1680
Ile Ile Ser Val Met Gln Gln Glu Thr Gly Val Glu Leu Pro Gly
1685 1690 1695
Thr Phe Phe Leu Asp Asn Ala Thr Thr Thr Ala Val Ile Ala Ala
1700 1705 1710
Val Gly
1715
<210> 5
<211> 1128
<212> DNA
<213> 金龟子绿僵菌
<220>
<221> CDS
<222> (1)..(1128)
<400> 5
atg gcc gtc acc gtg tgg caa gat gcg ctc aac atc att gcg cag gag 48
Met Ala Val Thr Val Trp Gln Asp Ala Leu Asn Ile Ile Ala Gln Glu
1 5 10 15
agc ggg ctg gag ccc gca gaa atc atc gag acg gac gac acg gcg ttt 96
Ser Gly Leu Glu Pro Ala Glu Ile Ile Glu Thr Asp Asp Thr Ala Phe
20 25 30
ctc acg ctc ggc atc aat cag atc ctc gcc aca gcc atc ttg gcg cac 144
Leu Thr Leu Gly Ile Asn Gln Ile Leu Ala Thr Ala Ile Leu Ala His
35 40 45
ctc aga ggg cct cgt gga gag cct ctc cca cga gac atc ttt gac cag 192
Leu Arg Gly Pro Arg Gly Glu Pro Leu Pro Arg Asp Ile Phe Asp Gln
50 55 60
aag ccc acg gtt ggt gcg ttc cgc cgc ttc tat gag acc cct att cac 240
Lys Pro Thr Val Gly Ala Phe Arg Arg Phe Tyr Glu Thr Pro Ile His
65 70 75 80
ctg gac att gct ccc gtc gcg gca ccg gcg ccg ccc aag ctg aag cgc 288
Leu Asp Ile Ala Pro Val Ala Ala Pro Ala Pro Pro Lys Leu Lys Arg
85 90 95
gtg ccg tcg tct tct gtc ccg ctg tcc atc gtc ttg caa aac aac ccg 336
Val Pro Ser Ser Ser Val Pro Leu Ser Ile Val Leu Gln Asn Asn Pro
100 105 110
gcg tcg agc cgg cac acg gtg ttc ctc ctc ccg gac ggc agc ggc tcg 384
Ala Ser Ser Arg His Thr Val Phe Leu Leu Pro Asp Gly Ser Gly Ser
115 120 125
gcc atg gcc tac gcc aac ctc ccg ccc gtc cac ccg gcc gtc tgc atc 432
Ala Met Ala Tyr Ala Asn Leu Pro Pro Val His Pro Ala Val Cys Ile
130 135 140
gtc ggc atg aac agc ccg tac ctg cgc gac gcc ggc tcg tac cgc tgc 480
Val Gly Met Asn Ser Pro Tyr Leu Arg Asp Ala Gly Ser Tyr Arg Cys
145 150 155 160
tcc gtg gaa gac ctg gca tcg caa tgg gtc cgt gaa gtc tac cgc cgc 528
Ser Val Glu Asp Leu Ala Ser Gln Trp Val Arg Glu Val Tyr Arg Arg
165 170 175
caa cca cgg ggg ccg tac att gtc ggc ggg tgg tca gcg gga ggc tac 576
Gln Pro Arg Gly Pro Tyr Ile Val Gly Gly Trp Ser Ala Gly Gly Tyr
180 185 190
tac tcg tac gaa gtg gcc aag cgc ctc ctg cag gac gga cac gcc gtc 624
Tyr Ser Tyr Glu Val Ala Lys Arg Leu Leu Gln Asp Gly His Ala Val
195 200 205
gcc aag ctg atc ctg atc gac tcg ccg tgc cgc acc gtc ttt gag tcc 672
Ala Lys Leu Ile Leu Ile Asp Ser Pro Cys Arg Thr Val Phe Glu Ser
210 215 220
ctg tcc atg gac gtc gtc aac tac ctc tcc tct cga aac ctc atg ggc 720
Leu Ser Met Asp Val Val Asn Tyr Leu Ser Ser Arg Asn Leu Met Gly
225 230 235 240
aac tgg ggc tct ccg gaa atg ccc gag tgg ctg gtg cag cat ttc cgc 768
Asn Trp Gly Ser Pro Glu Met Pro Glu Trp Leu Val Gln His Phe Arg
245 250 255
tcg acg ctc gcc gcc gtg ggc aag tac cgc ccg cgg ccc atc gac tcg 816
Ser Thr Leu Ala Ala Val Gly Lys Tyr Arg Pro Arg Pro Ile Asp Ser
260 265 270
gct ggc aag atg cag acg tac atc atc tgg agc cga gac ggc gtg ctg 864
Ala Gly Lys Met Gln Thr Tyr Ile Ile Trp Ser Arg Asp Gly Val Leu
275 280 285
gac caa gac gcg ctg gcc agg tct gga ctc gac acg agc gtc aag gtg 912
Asp Gln Asp Ala Leu Ala Arg Ser Gly Leu Asp Thr Ser Val Lys Val
290 295 300
tcg cga ttt ctg ctg cag ggc aag gat gac ctg ggg ccg aat gga tgg 960
Ser Arg Phe Leu Leu Gln Gly Lys Asp Asp Leu Gly Pro Asn Gly Trp
305 310 315 320
gac gac ctg ctg ccc agc aag gac atg gct att gcg acg caa tcg ggg 1008
Asp Asp Leu Leu Pro Ser Lys Asp Met Ala Ile Ala Thr Gln Ser Gly
325 330 335
acg cac ttc acc atg att aac aag cct cat gtg gcc cag atg agc gat 1056
Thr His Phe Thr Met Ile Asn Lys Pro His Val Ala Gln Met Ser Asp
340 345 350
ctc ttg cgt gat gcc gtg att ggc atc ggc tct gac cga cag gcg cac 1104
Leu Leu Arg Asp Ala Val Ile Gly Ile Gly Ser Asp Arg Gln Ala His
355 360 365
tgg cag cga gtg agc cag tca tga 1128
Trp Gln Arg Val Ser Gln Ser
370 375
<210> 6
<211> 375
<212> PRT
<213> 金龟子绿僵菌
<400> 6
Met Ala Val Thr Val Trp Gln Asp Ala Leu Asn Ile Ile Ala Gln Glu
1 5 10 15
Ser Gly Leu Glu Pro Ala Glu Ile Ile Glu Thr Asp Asp Thr Ala Phe
20 25 30
Leu Thr Leu Gly Ile Asn Gln Ile Leu Ala Thr Ala Ile Leu Ala His
35 40 45
Leu Arg Gly Pro Arg Gly Glu Pro Leu Pro Arg Asp Ile Phe Asp Gln
50 55 60
Lys Pro Thr Val Gly Ala Phe Arg Arg Phe Tyr Glu Thr Pro Ile His
65 70 75 80
Leu Asp Ile Ala Pro Val Ala Ala Pro Ala Pro Pro Lys Leu Lys Arg
85 90 95
Val Pro Ser Ser Ser Val Pro Leu Ser Ile Val Leu Gln Asn Asn Pro
100 105 110
Ala Ser Ser Arg His Thr Val Phe Leu Leu Pro Asp Gly Ser Gly Ser
115 120 125
Ala Met Ala Tyr Ala Asn Leu Pro Pro Val His Pro Ala Val Cys Ile
130 135 140
Val Gly Met Asn Ser Pro Tyr Leu Arg Asp Ala Gly Ser Tyr Arg Cys
145 150 155 160
Ser Val Glu Asp Leu Ala Ser Gln Trp Val Arg Glu Val Tyr Arg Arg
165 170 175
Gln Pro Arg Gly Pro Tyr Ile Val Gly Gly Trp Ser Ala Gly Gly Tyr
180 185 190
Tyr Ser Tyr Glu Val Ala Lys Arg Leu Leu Gln Asp Gly His Ala Val
195 200 205
Ala Lys Leu Ile Leu Ile Asp Ser Pro Cys Arg Thr Val Phe Glu Ser
210 215 220
Leu Ser Met Asp Val Val Asn Tyr Leu Ser Ser Arg Asn Leu Met Gly
225 230 235 240
Asn Trp Gly Ser Pro Glu Met Pro Glu Trp Leu Val Gln His Phe Arg
245 250 255
Ser Thr Leu Ala Ala Val Gly Lys Tyr Arg Pro Arg Pro Ile Asp Ser
260 265 270
Ala Gly Lys Met Gln Thr Tyr Ile Ile Trp Ser Arg Asp Gly Val Leu
275 280 285
Asp Gln Asp Ala Leu Ala Arg Ser Gly Leu Asp Thr Ser Val Lys Val
290 295 300
Ser Arg Phe Leu Leu Gln Gly Lys Asp Asp Leu Gly Pro Asn Gly Trp
305 310 315 320
Asp Asp Leu Leu Pro Ser Lys Asp Met Ala Ile Ala Thr Gln Ser Gly
325 330 335
Thr His Phe Thr Met Ile Asn Lys Pro His Val Ala Gln Met Ser Asp
340 345 350
Leu Leu Arg Asp Ala Val Ile Gly Ile Gly Ser Asp Arg Gln Ala His
355 360 365
Trp Gln Arg Val Ser Gln Ser
370 375
<210> 7
<211> 7533
<212> DNA
<213> 莱氏绿僵菌
<220>
<221> CDS
<222> (1)..(7533)
<400> 7
atg gag gct tcg tca caa tca aga gac gac atc gcc gtc att ggg tta 48
Met Glu Ala Ser Ser Gln Ser Arg Asp Asp Ile Ala Val Ile Gly Leu
1 5 10 15
tcg tgc cgc ttc ccg ggt gaa gca gac aca gcc gag cac ttt tgg gac 96
Ser Cys Arg Phe Pro Gly Glu Ala Asp Thr Ala Glu His Phe Trp Asp
20 25 30
ttc att tgc aac gga cgc aat gca tac tct gaa aac ccg gat cgg tgg 144
Phe Ile Cys Asn Gly Arg Asn Ala Tyr Ser Glu Asn Pro Asp Arg Trp
35 40 45
aat ccg gat gct ttc cac tac ggc gag aag aag ctc aac acc agc ttg 192
Asn Pro Asp Ala Phe His Tyr Gly Glu Lys Lys Leu Asn Thr Ser Leu
50 55 60
ccc cgg gga gga cat ttc atg aag caa gat gtg gcc gcc ttt gat gcc 240
Pro Arg Gly Gly His Phe Met Lys Gln Asp Val Ala Ala Phe Asp Ala
65 70 75 80
aac ttc ttc aac ctc tcc aag gtc gag gcg gag tcc atg gac ccc cag 288
Asn Phe Phe Asn Leu Ser Lys Val Glu Ala Glu Ser Met Asp Pro Gln
85 90 95
cag cgc atc gtc atg gag gtg acg tac gag tcc atg gag agc gca ggg 336
Gln Arg Ile Val Met Glu Val Thr Tyr Glu Ser Met Glu Ser Ala Gly
100 105 110
ctc cgc gtc gac cgg ctc gct ggc tct cgc acc ggc gtc ttc atg gcc 384
Leu Arg Val Asp Arg Leu Ala Gly Ser Arg Thr Gly Val Phe Met Ala
115 120 125
agt ttc acc agc gac tac cga gaa atg ctc tat cgt gat gct gag acg 432
Ser Phe Thr Ser Asp Tyr Arg Glu Met Leu Tyr Arg Asp Ala Glu Thr
130 135 140
gcg cct ctc tac acc gcg acg ggc act agc aac aca tca acc tcg aac 480
Ala Pro Leu Tyr Thr Ala Thr Gly Thr Ser Asn Thr Ser Thr Ser Asn
145 150 155 160
cgt gtc tcg tgg ttt ttc gac ttg cgc ggg cct agc ttt acc gtg aac 528
Arg Val Ser Trp Phe Phe Asp Leu Arg Gly Pro Ser Phe Thr Val Asn
165 170 175
aca gcc tgc tcc tcc agt ctg gta gca tgc cat ctc gcc tgc cag agt 576
Thr Ala Cys Ser Ser Ser Leu Val Ala Cys His Leu Ala Cys Gln Ser
180 185 190
ctg tgg aat ggc gag acg gag agc gcc atc gtc ggc ggc acc agc ctg 624
Leu Trp Asn Gly Glu Thr Glu Ser Ala Ile Val Gly Gly Thr Ser Leu
195 200 205
ctg ctc aac ccc gac atg ttt ctg tac ctc tcc aac cag cgg ttc ctg 672
Leu Leu Asn Pro Asp Met Phe Leu Tyr Leu Ser Asn Gln Arg Phe Leu
210 215 220
gcc ccc gac ggc cag tgc aaa agc ttc gac gag tcc ggc gac ggc tac 720
Ala Pro Asp Gly Gln Cys Lys Ser Phe Asp Glu Ser Gly Asp Gly Tyr
225 230 235 240
gcc aga ggt gat ggc atc ggc gtt gtc att ctg aag cgc gtt gct gac 768
Ala Arg Gly Asp Gly Ile Gly Val Val Ile Leu Lys Arg Val Ala Asp
245 250 255
gcc gtt cgc gat ggc gat ccg atc cga gcc gtg atc cgt ggc agc gga 816
Ala Val Arg Asp Gly Asp Pro Ile Arg Ala Val Ile Arg Gly Ser Gly
260 265 270
tgc aac caa gac ggc cac aca aag ggc ttc acc atc ccc agt gtt gag 864
Cys Asn Gln Asp Gly His Thr Lys Gly Phe Thr Ile Pro Ser Val Glu
275 280 285
gcg caa gcc tct ctt atc gag gag acg tac cgc aaa gca ggt ctt tca 912
Ala Gln Ala Ser Leu Ile Glu Glu Thr Tyr Arg Lys Ala Gly Leu Ser
290 295 300
ctt gca gag acg cgt tac gta gag gcc cac ggg acc ggc acc cag gcg 960
Leu Ala Glu Thr Arg Tyr Val Glu Ala His Gly Thr Gly Thr Gln Ala
305 310 315 320
ggc gac acg tgt gag atg gag ggt atc gca cga aca ttc ggc cag cac 1008
Gly Asp Thr Cys Glu Met Glu Gly Ile Ala Arg Thr Phe Gly Gln His
325 330 335
cgg ggc gac tca gat gat ctg cta gtc gga tct gtc aag tca aat att 1056
Arg Gly Asp Ser Asp Asp Leu Leu Val Gly Ser Val Lys Ser Asn Ile
340 345 350
gga cat ctc gaa gct tgc gct gga ctg gcc tcg ctc ata aag tgc atc 1104
Gly His Leu Glu Ala Cys Ala Gly Leu Ala Ser Leu Ile Lys Cys Ile
355 360 365
ttc att ctg gaa aca ggc gtg ata cca ccg acg ccc agt gtc cgc gtt 1152
Phe Ile Leu Glu Thr Gly Val Ile Pro Pro Thr Pro Ser Val Arg Val
370 375 380
ctc aac ccc aag atc cgc tgg gag gaa tgg cat ctc aag gtt ccc tcg 1200
Leu Asn Pro Lys Ile Arg Trp Glu Glu Trp His Leu Lys Val Pro Ser
385 390 395 400
aaa caa act cct tgg cca acc gac ggc cta cgg cga gtg agc aca cag 1248
Lys Gln Thr Pro Trp Pro Thr Asp Gly Leu Arg Arg Val Ser Thr Gln
405 410 415
ggt ttc gga tac ggt ggt aca aac gcc cat ctg att ctc gac gat gca 1296
Gly Phe Gly Tyr Gly Gly Thr Asn Ala His Leu Ile Leu Asp Asp Ala
420 425 430
gcc cac tat ctc gag ggg cga agt ctc agg ggt cat cat tac act cgc 1344
Ala His Tyr Leu Glu Gly Arg Ser Leu Arg Gly His His Tyr Thr Arg
435 440 445
aca cat cct cag gcg cag agg ctt ttg acc tct gca atc cac ggg gct 1392
Thr His Pro Gln Ala Gln Arg Leu Leu Thr Ser Ala Ile His Gly Ala
450 455 460
tcg cca aag gaa cag ctg ccg cgt ttg ttt ctg ttc cgc gcg aat gat 1440
Ser Pro Lys Glu Gln Leu Pro Arg Leu Phe Leu Phe Arg Ala Asn Asp
465 470 475 480
cgt gag ggc ctt ggg cgt gtc cgg gcg tct ttg gca caa cat ctc gac 1488
Arg Glu Gly Leu Gly Arg Val Arg Ala Ser Leu Ala Gln His Leu Asp
485 490 495
caa ctc ctg ccc tcg tgg tcc cag gac tcg agc ggc cgt gat gca tac 1536
Gln Leu Leu Pro Ser Trp Ser Gln Asp Ser Ser Gly Arg Asp Ala Tyr
500 505 510
ctc cag aac ttg gcc ttt acc ctc gcc agc cga cga tcc aat ctc aaa 1584
Leu Gln Asn Leu Ala Phe Thr Leu Ala Ser Arg Arg Ser Asn Leu Lys
515 520 525
tgg cag acg tat gcc acg gct tct acc ccg gac gag ttg ctt caa gtg 1632
Trp Gln Thr Tyr Ala Thr Ala Ser Thr Pro Asp Glu Leu Leu Gln Val
530 535 540
ctc aag acc aag ggc gac gca tgg gcg agt ccc gag gct cgc ctt gcc 1680
Leu Lys Thr Lys Gly Asp Ala Trp Ala Ser Pro Glu Ala Arg Leu Ala
545 550 555 560
gcg tca tcc ccc cgt ctt ggc ttt att ttc acc ggc cag ggc gct caa 1728
Ala Ser Ser Pro Arg Leu Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
565 570 575
tgg gct cgc atg ggt gtt gag ctc atg gga tat ccc gtg ttt cgc caa 1776
Trp Ala Arg Met Gly Val Glu Leu Met Gly Tyr Pro Val Phe Arg Gln
580 585 590
agc gtc gag gag tcg gag cac ttc ctg cgc gag act ctc ggc tgt ccc 1824
Ser Val Glu Glu Ser Glu His Phe Leu Arg Glu Thr Leu Gly Cys Pro
595 600 605
tgg tct gcc atc gat gag ctg gcc aag ccg cag acc acg tcc cgt ctc 1872
Trp Ser Ala Ile Asp Glu Leu Ala Lys Pro Gln Thr Thr Ser Arg Leu
610 615 620
tcc gag gca gcc tac agt cag acg ctg tgc acc gta ctt caa att gcc 1920
Ser Glu Ala Ala Tyr Ser Gln Thr Leu Cys Thr Val Leu Gln Ile Ala
625 630 635 640
att gta gac ttg ctt caa gac tgg aat gtc tct ccc act cgc gtt gcc 1968
Ile Val Asp Leu Leu Gln Asp Trp Asn Val Ser Pro Thr Arg Val Ala
645 650 655
ggg cac tca agt ggc gaa ata gcg gcg gca tat tgc cta ggc gcc ctg 2016
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Ala Leu
660 665 670
acc aag cag gac agt ctg aga gtc gcc tac tac cga gga atc ctg tcg 2064
Thr Lys Gln Asp Ser Leu Arg Val Ala Tyr Tyr Arg Gly Ile Leu Ser
675 680 685
tca gag atg caa gaa aca cac aag gac caa aag gga gcc atg atg gcc 2112
Ser Glu Met Gln Glu Thr His Lys Asp Gln Lys Gly Ala Met Met Ala
690 695 700
atc ggg gcc tcc ccc gag acg gta gca cag tgg ttg gca cag ctg act 2160
Ile Gly Ala Ser Pro Glu Thr Val Ala Gln Trp Leu Ala Gln Leu Thr
705 710 715 720
cgg gga aaa gtc gtc gtt gcc tgc atc aac tcg ccg acg agt gtc acg 2208
Arg Gly Lys Val Val Val Ala Cys Ile Asn Ser Pro Thr Ser Val Thr
725 730 735
gca tcc ggc gac gca gcg ggc atc gac gag ctc ctt tcc ata gta caa 2256
Ala Ser Gly Asp Ala Ala Gly Ile Asp Glu Leu Leu Ser Ile Val Gln
740 745 750
gag gcg gga gtc ttt gga cgc aag ttg aaa gtg gac gtg gca tat cac 2304
Glu Ala Gly Val Phe Gly Arg Lys Leu Lys Val Asp Val Ala Tyr His
755 760 765
tcg cat cat atg cag tcg gtt tct gcg gcc tac tct gcg ctc ctg aag 2352
Ser His His Met Gln Ser Val Ser Ala Ala Tyr Ser Ala Leu Leu Lys
770 775 780
gac ctc aag ccg ctg cca gcg cac gag ggc cgc acc atg cat tcg agc 2400
Asp Leu Lys Pro Leu Pro Ala His Glu Gly Arg Thr Met His Ser Ser
785 790 795 800
gta ttg ggt ggc ttg ata gac acc gca gag ctt ggt gcg tcc aac tgg 2448
Val Leu Gly Gly Leu Ile Asp Thr Ala Glu Leu Gly Ala Ser Asn Trp
805 810 815
gtg cgg aac ctg att tca ccg gtg cgt ttc tct gaa gcc gtc tcg agc 2496
Val Arg Asn Leu Ile Ser Pro Val Arg Phe Ser Glu Ala Val Ser Ser
820 825 830
ctc atc ttg gac ggg gac aag cca gcc gtc gat atg ctc atc gag atc 2544
Leu Ile Leu Asp Gly Asp Lys Pro Ala Val Asp Met Leu Ile Glu Ile
835 840 845
ggg cca cac gct gcg ctc aag gga ccc gtc cag gaa aca cta gag gcc 2592
Gly Pro His Ala Ala Leu Lys Gly Pro Val Gln Glu Thr Leu Glu Ala
850 855 860
aag ggc gtc tcc gcg gtc aag tac acg agc gtc gtg tct cgg ggc cag 2640
Lys Gly Val Ser Ala Val Lys Tyr Thr Ser Val Val Ser Arg Gly Gln
865 870 875 880
aat gct gtc aag acg gct ttg gcc tgc gcg ggc gag ctc gtc aac tcg 2688
Asn Ala Val Lys Thr Ala Leu Ala Cys Ala Gly Glu Leu Val Asn Ser
885 890 895
agc gtc ccc gtt gca atg gat cgt gta aat ctc gag tcg gag ctg caa 2736
Ser Val Pro Val Ala Met Asp Arg Val Asn Leu Glu Ser Glu Leu Gln
900 905 910
ccg agc ccg ctg gtc gat ctt cca tca tac cca tgg aac cgc tcg acc 2784
Pro Ser Pro Leu Val Asp Leu Pro Ser Tyr Pro Trp Asn Arg Ser Thr
915 920 925
cgg ttc tgg gcc gag tca cgt ctt tct caa gaa tat cgg ctt cgc aag 2832
Arg Phe Trp Ala Glu Ser Arg Leu Ser Gln Glu Tyr Arg Leu Arg Lys
930 935 940
cat gcc cgc ctg ccc ctg ctg gga agt ccg tgt ccc acg atg ggc gcc 2880
His Ala Arg Leu Pro Leu Leu Gly Ser Pro Cys Pro Thr Met Gly Ala
945 950 955 960
cgt gag aga tac tgg cgc ggc atg gtg agg ctg gac gag gag ccc tgg 2928
Arg Glu Arg Tyr Trp Arg Gly Met Val Arg Leu Asp Glu Glu Pro Trp
965 970 975
atc cga gac cat gag atc caa ggg tct atc ctg tat cct ggt gcc ggt 2976
Ile Arg Asp His Glu Ile Gln Gly Ser Ile Leu Tyr Pro Gly Ala Gly
980 985 990
ttc ctg atc atg gcc atc gaa gcc gct tct cag caa gca aac gaa cag 3024
Phe Leu Ile Met Ala Ile Glu Ala Ala Ser Gln Gln Ala Asn Glu Gln
995 1000 1005
cgc aaa gtg agc gcg ttt cgt ctg cgc gat gtg cac ctt gat gcc 3069
Arg Lys Val Ser Ala Phe Arg Leu Arg Asp Val His Leu Asp Ala
1010 1015 1020
gcc ttg gtg gtc acg gac aac agc act gcc gag gca att cta caa 3114
Ala Leu Val Val Thr Asp Asn Ser Thr Ala Glu Ala Ile Leu Gln
1025 1030 1035
ctt cgc ccg cat ctc ctc gcg ccg gga agc agc cag tcg tct tgg 3159
Leu Arg Pro His Leu Leu Ala Pro Gly Ser Ser Gln Ser Ser Trp
1040 1045 1050
atg gag ttt acc gtc aac tca tcc att gat ggc ggt gcc ctg cgt 3204
Met Glu Phe Thr Val Asn Ser Ser Ile Asp Gly Gly Ala Leu Arg
1055 1060 1065
cag aac tgc tcc ggc ctc atc atg atc gag tac gag gct gac gca 3249
Gln Asn Cys Ser Gly Leu Ile Met Ile Glu Tyr Glu Ala Asp Ala
1070 1075 1080
gac tcg gcc atg gcc cgt gaa cgt agc ttg gag tca gac acg gtt 3294
Asp Ser Ala Met Ala Arg Glu Arg Ser Leu Glu Ser Asp Thr Val
1085 1090 1095
tgt gat ttg tac aag aag acg tac att tcc tgc cgg cag tct gtc 3339
Cys Asp Leu Tyr Lys Lys Thr Tyr Ile Ser Cys Arg Gln Ser Val
1100 1105 1110
gat gtg gcc aag ttc tac tcc cgt ctc gcc tct ctt ggc ctc acc 3384
Asp Val Ala Lys Phe Tyr Ser Arg Leu Ala Ser Leu Gly Leu Thr
1115 1120 1125
tac ggg ccg gcg ttt gca aac ttg aca gag atc cgg agg acg ggc 3429
Tyr Gly Pro Ala Phe Ala Asn Leu Thr Glu Ile Arg Arg Thr Gly
1130 1135 1140
aac ggc cag tgt acc ggc gcc gtt cgt gtt ccc gct gtc gaa agc 3474
Asn Gly Gln Cys Thr Gly Ala Val Arg Val Pro Ala Val Glu Ser
1145 1150 1155
ctg gtg cct cca gca tac cgc agc cat cct cat gtc atc cat ccg 3519
Leu Val Pro Pro Ala Tyr Arg Ser His Pro His Val Ile His Pro
1160 1165 1170
ggg acg ttg gac gcc atc ttc cat ctt gcc ttt gcg gcc ctc gag 3564
Gly Thr Leu Asp Ala Ile Phe His Leu Ala Phe Ala Ala Leu Glu
1175 1180 1185
gac tct ctg ctt ccc ggt ccc atg gtc cca acg aca atc gat ggg 3609
Asp Ser Leu Leu Pro Gly Pro Met Val Pro Thr Thr Ile Asp Gly
1190 1195 1200
cta gtc gtt gca gca aac act cca aac gag ccc ggc act ttg ctt 3654
Leu Val Val Ala Ala Asn Thr Pro Asn Glu Pro Gly Thr Leu Leu
1205 1210 1215
cgc gga gtt tcg cag tct tct cca cat gga ttc agg gag ctc atc 3699
Arg Gly Val Ser Gln Ser Ser Pro His Gly Phe Arg Glu Leu Ile
1220 1225 1230
tcc gac att gac gtg ctg gat gat cag agc agc aga gcc gtt gta 3744
Ser Asp Ile Asp Val Leu Asp Asp Gln Ser Ser Arg Ala Val Val
1235 1240 1245
cag atc aag ggc ttc cgc tgc gcc gac gtc tcc gga ggc agc gcg 3789
Gln Ile Lys Gly Phe Arg Cys Ala Asp Val Ser Gly Gly Ser Ala
1250 1255 1260
aat tcg tca gac gcg gag cct gca gag gct cgt ccg atc agc ttc 3834
Asn Ser Ser Asp Ala Glu Pro Ala Glu Ala Arg Pro Ile Ser Phe
1265 1270 1275
cgt ctc aac tgg aag cca gca atc gac ctg ctt tct gct gag cag 3879
Arg Leu Asn Trp Lys Pro Ala Ile Asp Leu Leu Ser Ala Glu Gln
1280 1285 1290
ctg cgg aaa tat gtt ggt cgt gtt gcc aaa caa gca gat gct tct 3924
Leu Arg Lys Tyr Val Gly Arg Val Ala Lys Gln Ala Asp Ala Ser
1295 1300 1305
tcc cat ctc att cgt gcc acg gaa cta aac aac cag gtt gga aat 3969
Ser His Leu Ile Arg Ala Thr Glu Leu Asn Asn Gln Val Gly Asn
1310 1315 1320
ctt ccg gaa act gca cca tca gct gca ttg gat gcc gtc acg gaa 4014
Leu Pro Glu Thr Ala Pro Ser Ala Ala Leu Asp Ala Val Thr Glu
1325 1330 1335
aaa gcc act cga tgg ttc gct gcc aag tct gcg aag ctc gtc gac 4059
Lys Ala Thr Arg Trp Phe Ala Ala Lys Ser Ala Lys Leu Val Asp
1340 1345 1350
ggt gct gcc acg gca tcc agc gct tca tcc tca ggg ggc tac gtc 4104
Gly Ala Ala Thr Ala Ser Ser Ala Ser Ser Ser Gly Gly Tyr Val
1355 1360 1365
gac gca acg aga gac gca tgg gca gca gtg cga gaa ggc cgt atc 4149
Asp Ala Thr Arg Asp Ala Trp Ala Ala Val Arg Glu Gly Arg Ile
1370 1375 1380
cca tca cca gag aaa caa gac agg gtg ttg aga gag gta gag aag 4194
Pro Ser Pro Glu Lys Gln Asp Arg Val Leu Arg Glu Val Glu Lys
1385 1390 1395
aac ggc gca ctg tcc acc tta ctg ggg gcg ctc gac gcg tac atg 4239
Asn Gly Ala Leu Ser Thr Leu Leu Gly Ala Leu Asp Ala Tyr Met
1400 1405 1410
gat ctt cgc cat cat gcg aag ccc aac ttg tca gtt ctc gag ctg 4284
Asp Leu Arg His His Ala Lys Pro Asn Leu Ser Val Leu Glu Leu
1415 1420 1425
agc tta gac gcg gtg ccg tac tct att ttc gca gcc ctg ccc agt 4329
Ser Leu Asp Ala Val Pro Tyr Ser Ile Phe Ala Ala Leu Pro Ser
1430 1435 1440
cgg cag agc att ctc cag aca gcc cag tat gct att cga gta tct 4374
Arg Gln Ser Ile Leu Gln Thr Ala Gln Tyr Ala Ile Arg Val Ser
1445 1450 1455
caa gac ggc gtc cag gat cga att agg agt caa ttc ggg tcc caa 4419
Gln Asp Gly Val Gln Asp Arg Ile Arg Ser Gln Phe Gly Ser Gln
1460 1465 1470
gga tct ggc atc gac gtt gcc gtc acg gat ttc acc caa aag atc 4464
Gly Ser Gly Ile Asp Val Ala Val Thr Asp Phe Thr Gln Lys Ile
1475 1480 1485
gac gag aca ttg ggg aag cat gat gta att ctc ata ttt gat cct 4509
Asp Glu Thr Leu Gly Lys His Asp Val Ile Leu Ile Phe Asp Pro
1490 1495 1500
ggc ttc tta cac gcc aag ctc gag gtc gtc ttg cga aac gcc cgc 4554
Gly Phe Leu His Ala Lys Leu Glu Val Val Leu Arg Asn Ala Arg
1505 1510 1515
aag ctg ctg aac ccc gga ggc aag atc atc gtg gca gag gtc aac 4599
Lys Leu Leu Asn Pro Gly Gly Lys Ile Ile Val Ala Glu Val Asn
1520 1525 1530
gag ccc gga cta tat ctg ggc aca gca ctg ggc tgt ctt cac tgg 4644
Glu Pro Gly Leu Tyr Leu Gly Thr Ala Leu Gly Cys Leu His Trp
1535 1540 1545
aca aga aac ctc gac gtc tcg cag agt agc tgg aca tcg tgc ctc 4689
Thr Arg Asn Leu Asp Val Ser Gln Ser Ser Trp Thr Ser Cys Leu
1550 1555 1560
tcg cgc ttc gga ctg acg cct gcc ctg gaa ctc atc gac gca aac 4734
Ser Arg Phe Gly Leu Thr Pro Ala Leu Glu Leu Ile Asp Ala Asn
1565 1570 1575
aca gat gcc acc ggt cat ggg aag ttt cag ctc cgt ctt aca ggc 4779
Thr Asp Ala Thr Gly His Gly Lys Phe Gln Leu Arg Leu Thr Gly
1580 1585 1590
agt gcc gcg gag tcg aat ggg agt agc agc cat cag ccg cag caa 4824
Ser Ala Ala Glu Ser Asn Gly Ser Ser Ser His Gln Pro Gln Gln
1595 1600 1605
gtc acc ctc ata gaa tct gcc gat gca tct gag atg gcg caa ggc 4869
Val Thr Leu Ile Glu Ser Ala Asp Ala Ser Glu Met Ala Gln Gly
1610 1615 1620
gtc gca gaa gcg gta gcc cag cgt ctt caa gag gct tct att ccc 4914
Val Ala Glu Ala Val Ala Gln Arg Leu Gln Glu Ala Ser Ile Pro
1625 1630 1635
aca aag cgc gtc cat tgg ggc tgc gat gtc tcg caa ctc aag ggc 4959
Thr Lys Arg Val His Trp Gly Cys Asp Val Ser Gln Leu Lys Gly
1640 1645 1650
cag ccc tgc atc gtc ctg acg gac ctg cag tct gcg ctg ctg aaa 5004
Gln Pro Cys Ile Val Leu Thr Asp Leu Gln Ser Ala Leu Leu Lys
1655 1660 1665
gat ctg gca cca gag gac ctc gcg gcc ttg caa tca ctt ttc ttg 5049
Asp Leu Ala Pro Glu Asp Leu Ala Ala Leu Gln Ser Leu Phe Leu
1670 1675 1680
cat gct gag agc act ctt tgg gtg acc ggt ccc ctt ggc cca gac 5094
His Ala Glu Ser Thr Leu Trp Val Thr Gly Pro Leu Gly Pro Asp
1685 1690 1695
gcg gct ctg ata aca ggt ttg gct cgc agc gtt tgc aac gag gca 5139
Ala Ala Leu Ile Thr Gly Leu Ala Arg Ser Val Cys Asn Glu Ala
1700 1705 1710
gct gga gtt cag atc cgc acg ctt gag gtg act gat ttg ccg ata 5184
Ala Gly Val Gln Ile Arg Thr Leu Glu Val Thr Asp Leu Pro Ile
1715 1720 1725
tct gca gcc gcc ggc tat gcc gac atg gta gct cgt gtt ttc cgc 5229
Ser Ala Ala Ala Gly Tyr Ala Asp Met Val Ala Arg Val Phe Arg
1730 1735 1740
tat cgt ggc tcg gat aca gag ttt cag tgg cat tca gac gct ctg 5274
Tyr Arg Gly Ser Asp Thr Glu Phe Gln Trp His Ser Asp Ala Leu
1745 1750 1755
cta gtc agc cgg ctg act gag gat gag gac cga aac gag gag atc 5319
Leu Val Ser Arg Leu Thr Glu Asp Glu Asp Arg Asn Glu Glu Ile
1760 1765 1770
gcg cag ctg ctg gga cag gga gaa acg gcc gcg gct gag act acg 5364
Ala Gln Leu Leu Gly Gln Gly Glu Thr Ala Ala Ala Glu Thr Thr
1775 1780 1785
cta cag gag aca cca gag gga ctg aaa ctg tgc gtg agg caa ata 5409
Leu Gln Glu Thr Pro Glu Gly Leu Lys Leu Cys Val Arg Gln Ile
1790 1795 1800
ggc atg ctc gac tct gcc tgc tac gag cca gat ccg ttg gca ttg 5454
Gly Met Leu Asp Ser Ala Cys Tyr Glu Pro Asp Pro Leu Ala Leu
1805 1810 1815
gaa cca cta gag gcc ggc gag gtg gaa gtc gac gtg aag gct tca 5499
Glu Pro Leu Glu Ala Gly Glu Val Glu Val Asp Val Lys Ala Ser
1820 1825 1830
ggg gtc aac ttc cga gat gtc atg gtc gcc ctg ggg cag atc cca 5544
Gly Val Asn Phe Arg Asp Val Met Val Ala Leu Gly Gln Ile Pro
1835 1840 1845
gat cgg gct ttc gga ttc gag ggc gcc ggt gtc gtc cgc cgt gtc 5589
Asp Arg Ala Phe Gly Phe Glu Gly Ala Gly Val Val Arg Arg Val
1850 1855 1860
cac gct gaa gag tcg cgg ctt cgc cct gga gat cga gtc gtc ttc 5634
His Ala Glu Glu Ser Arg Leu Arg Pro Gly Asp Arg Val Val Phe
1865 1870 1875
ctt gct cac gga gcg cac cgc act gtt cat cgt gta cgc gcg gac 5679
Leu Ala His Gly Ala His Arg Thr Val His Arg Val Arg Ala Asp
1880 1885 1890
tat gcc atg cct atg ccc gat acc atg tcc ttt gaa gag ggc gcg 5724
Tyr Ala Met Pro Met Pro Asp Thr Met Ser Phe Glu Glu Gly Ala
1895 1900 1905
gct gtt ctc ctt gtc cac aca aca gcc tgg tac gcc ctc gtc aaa 5769
Ala Val Leu Leu Val His Thr Thr Ala Trp Tyr Ala Leu Val Lys
1910 1915 1920
tcg gca cgc gca aca gcc ggt cag tca gtc ctt gtt cat gcc gct 5814
Ser Ala Arg Ala Thr Ala Gly Gln Ser Val Leu Val His Ala Ala
1925 1930 1935
gca ggc ggt gtt ggc cag gca gtc ctc atg ctt gcc cga cat ctg 5859
Ala Gly Gly Val Gly Gln Ala Val Leu Met Leu Ala Arg His Leu
1940 1945 1950
ggc ctg gag gtt ttt gcg acg gtt ggc tcc gag gag aag agg aag 5904
Gly Leu Glu Val Phe Ala Thr Val Gly Ser Glu Glu Lys Arg Lys
1955 1960 1965
ctt gta cac gaa acg tac ggg att cct cac gac cac atg ttc aac 5949
Leu Val His Glu Thr Tyr Gly Ile Pro His Asp His Met Phe Asn
1970 1975 1980
tcg cgg gac tcc agc ttt gca atg ggc gtg aag cgg atg acc aac 5994
Ser Arg Asp Ser Ser Phe Ala Met Gly Val Lys Arg Met Thr Asn
1985 1990 1995
ggc cgc gga gtt gac att gtt gtc aat tcg ctc gct ggg gaa gct 6039
Gly Arg Gly Val Asp Ile Val Val Asn Ser Leu Ala Gly Glu Ala
2000 2005 2010
ctc cgg cag acg tgg cat tgc ctg gca ccg ttt ggc acc ttt gtc 6084
Leu Arg Gln Thr Trp His Cys Leu Ala Pro Phe Gly Thr Phe Val
2015 2020 2025
gag ctc ggc atg aag gac ata ttg gac aac gca cgc tta gac atg 6129
Glu Leu Gly Met Lys Asp Ile Leu Asp Asn Ala Arg Leu Asp Met
2030 2035 2040
aaa ccc ttc ctg cag gac gca acc ttt gtc ttc ttc aac ctg aac 6174
Lys Pro Phe Leu Gln Asp Ala Thr Phe Val Phe Phe Asn Leu Asn
2045 2050 2055
cgg gtc caa aag gag cgg cca gat ctc atg aag gag gct ctc agg 6219
Arg Val Gln Lys Glu Arg Pro Asp Leu Met Lys Glu Ala Leu Arg
2060 2065 2070
gaa acg atg gcc ctt gta tcc tct ggg gcg ctg aag cca gca acg 6264
Glu Thr Met Ala Leu Val Ser Ser Gly Ala Leu Lys Pro Ala Thr
2075 2080 2085
ccg ctc acc gca tac gca gct tct caa gtg gaa aca gca ttc cgg 6309
Pro Leu Thr Ala Tyr Ala Ala Ser Gln Val Glu Thr Ala Phe Arg
2090 2095 2100
aaa atc cag act ggg cag cac ctg ggt aag ctc gtg cta acg ttc 6354
Lys Ile Gln Thr Gly Gln His Leu Gly Lys Leu Val Leu Thr Phe
2105 2110 2115
cag acc gga gac gtt ctc cgc gtc atc aga ccg gat ctc agc ctg 6399
Gln Thr Gly Asp Val Leu Arg Val Ile Arg Pro Asp Leu Ser Leu
2120 2125 2130
ggc gac tcc ggc gcg tac ctc ctt gtt gga gga ctc ggc gga tta 6444
Gly Asp Ser Gly Ala Tyr Leu Leu Val Gly Gly Leu Gly Gly Leu
2135 2140 2145
ggt cgt agt ctt gca cgg ctg ctg gta cat ctc ggt gcc cgc cgg 6489
Gly Arg Ser Leu Ala Arg Leu Leu Val His Leu Gly Ala Arg Arg
2150 2155 2160
cta tgt ttc ttg tct cgg tct ggt gca aaa agc agc gag gca cag 6534
Leu Cys Phe Leu Ser Arg Ser Gly Ala Lys Ser Ser Glu Ala Gln
2165 2170 2175
gcg ctc gtc cag gaa ctc gag ttg cag cac cga gtt cgc gtg ctt 6579
Ala Leu Val Gln Glu Leu Glu Leu Gln His Arg Val Arg Val Leu
2180 2185 2190
gtc tgc caa ggg gat gtg tcc gac agc gac acg gtg gct cgc gtc 6624
Val Cys Gln Gly Asp Val Ser Asp Ser Asp Thr Val Ala Arg Val
2195 2200 2205
gtt cag caa tgc acc acg acc ctc ggg ccc atc cgt ggc gtc gtc 6669
Val Gln Gln Cys Thr Thr Thr Leu Gly Pro Ile Arg Gly Val Val
2210 2215 2220
cag tgt gcc atg att ctc cgg gat ggc ctg ttt gag aga atg aca 6714
Gln Cys Ala Met Ile Leu Arg Asp Gly Leu Phe Glu Arg Met Thr
2225 2230 2235
cac gag cag tgg acc gag agc acg cgg ccg aag gtg cag ggc acg 6759
His Glu Gln Trp Thr Glu Ser Thr Arg Pro Lys Val Gln Gly Thr
2240 2245 2250
tgg aac ttg cat gag cag atc cca tcg gcc gac ttc ttc atc acg 6804
Trp Asn Leu His Glu Gln Ile Pro Ser Ala Asp Phe Phe Ile Thr
2255 2260 2265
ctg agc tcc ttt gca ggc gtg ttt gga agc cgc ggg cag agc aac 6849
Leu Ser Ser Phe Ala Gly Val Phe Gly Ser Arg Gly Gln Ser Asn
2270 2275 2280
tac gcc gct gcg ggt gcg tac gag gat gcc ttg gca cat ttc cga 6894
Tyr Ala Ala Ala Gly Ala Tyr Glu Asp Ala Leu Ala His Phe Arg
2285 2290 2295
acg tct ctg gga cag agg gct atc acc atc gac ttg ggc atc atg 6939
Thr Ser Leu Gly Gln Arg Ala Ile Thr Ile Asp Leu Gly Ile Met
2300 2305 2310
cgt gac gtg ggc gtc ctc gcc gag cag ggc atc acg gac tac ctc 6984
Arg Asp Val Gly Val Leu Ala Glu Gln Gly Ile Thr Asp Tyr Leu
2315 2320 2325
cgg gag tgg gag gag ccc ttt gga ata cga gag cat gag ttt cat 7029
Arg Glu Trp Glu Glu Pro Phe Gly Ile Arg Glu His Glu Phe His
2330 2335 2340
gcc ctc atc aag tcg gcc atc atg tcg gcc acg gaa ccg ccg act 7074
Ala Leu Ile Lys Ser Ala Ile Met Ser Ala Thr Glu Pro Pro Thr
2345 2350 2355
gag cgc tcc gtg gtg cag atc cct acc ggc ttg gcc acc gcc cgt 7119
Glu Arg Ser Val Val Gln Ile Pro Thr Gly Leu Ala Thr Ala Arg
2360 2365 2370
tcc gcg caa gca gcc ggt ata agc aca cca ttc tac ttt gac gac 7164
Ser Ala Gln Ala Ala Gly Ile Ser Thr Pro Phe Tyr Phe Asp Asp
2375 2380 2385
gcc cgt ttc tcg atc ctc gcc cag aca cgc acc gcg gcc ggt gcg 7209
Ala Arg Phe Ser Ile Leu Ala Gln Thr Arg Thr Ala Ala Gly Ala
2390 2395 2400
tcg tcg gcg aac gct gat gat ggc aag gtt tcc atc cga aca cag 7254
Ser Ser Ala Asn Ala Asp Asp Gly Lys Val Ser Ile Arg Thr Gln
2405 2410 2415
ctc tct cag gcc cag tcg gtg gct gaa gca gcc tcc gcc gtt cag 7299
Leu Ser Gln Ala Gln Ser Val Ala Glu Ala Ala Ser Ala Val Gln
2420 2425 2430
acg gtg ctg ctt gag cgg gta gca aag acg ctc cag agc tct gta 7344
Thr Val Leu Leu Glu Arg Val Ala Lys Thr Leu Gln Ser Ser Val
2435 2440 2445
tcg gaa ata gat cca tct cag cca ctg cat tcg tat ggt gtc gat 7389
Ser Glu Ile Asp Pro Ser Gln Pro Leu His Ser Tyr Gly Val Asp
2450 2455 2460
tcc ctg gtc gcc gtg gaa acg gtc aag tgg atg ttt aaa acg cta 7434
Ser Leu Val Ala Val Glu Thr Val Lys Trp Met Phe Lys Thr Leu
2465 2470 2475
gag gct aag ctg acg gtg ttt gat gtt ctc tcc aac gtg tct att 7479
Glu Ala Lys Leu Thr Val Phe Asp Val Leu Ser Asn Val Ser Ile
2480 2485 2490
gtt gta tta tgc gag aag att gct acc acg tct act cta gta aag 7524
Val Val Leu Cys Glu Lys Ile Ala Thr Thr Ser Thr Leu Val Lys
2495 2500 2505
ttg agc tag 7533
Leu Ser
2510
<210> 8
<211> 2510
<212> PRT
<213> 莱氏绿僵菌
<400> 8
Met Glu Ala Ser Ser Gln Ser Arg Asp Asp Ile Ala Val Ile Gly Leu
1 5 10 15
Ser Cys Arg Phe Pro Gly Glu Ala Asp Thr Ala Glu His Phe Trp Asp
20 25 30
Phe Ile Cys Asn Gly Arg Asn Ala Tyr Ser Glu Asn Pro Asp Arg Trp
35 40 45
Asn Pro Asp Ala Phe His Tyr Gly Glu Lys Lys Leu Asn Thr Ser Leu
50 55 60
Pro Arg Gly Gly His Phe Met Lys Gln Asp Val Ala Ala Phe Asp Ala
65 70 75 80
Asn Phe Phe Asn Leu Ser Lys Val Glu Ala Glu Ser Met Asp Pro Gln
85 90 95
Gln Arg Ile Val Met Glu Val Thr Tyr Glu Ser Met Glu Ser Ala Gly
100 105 110
Leu Arg Val Asp Arg Leu Ala Gly Ser Arg Thr Gly Val Phe Met Ala
115 120 125
Ser Phe Thr Ser Asp Tyr Arg Glu Met Leu Tyr Arg Asp Ala Glu Thr
130 135 140
Ala Pro Leu Tyr Thr Ala Thr Gly Thr Ser Asn Thr Ser Thr Ser Asn
145 150 155 160
Arg Val Ser Trp Phe Phe Asp Leu Arg Gly Pro Ser Phe Thr Val Asn
165 170 175
Thr Ala Cys Ser Ser Ser Leu Val Ala Cys His Leu Ala Cys Gln Ser
180 185 190
Leu Trp Asn Gly Glu Thr Glu Ser Ala Ile Val Gly Gly Thr Ser Leu
195 200 205
Leu Leu Asn Pro Asp Met Phe Leu Tyr Leu Ser Asn Gln Arg Phe Leu
210 215 220
Ala Pro Asp Gly Gln Cys Lys Ser Phe Asp Glu Ser Gly Asp Gly Tyr
225 230 235 240
Ala Arg Gly Asp Gly Ile Gly Val Val Ile Leu Lys Arg Val Ala Asp
245 250 255
Ala Val Arg Asp Gly Asp Pro Ile Arg Ala Val Ile Arg Gly Ser Gly
260 265 270
Cys Asn Gln Asp Gly His Thr Lys Gly Phe Thr Ile Pro Ser Val Glu
275 280 285
Ala Gln Ala Ser Leu Ile Glu Glu Thr Tyr Arg Lys Ala Gly Leu Ser
290 295 300
Leu Ala Glu Thr Arg Tyr Val Glu Ala His Gly Thr Gly Thr Gln Ala
305 310 315 320
Gly Asp Thr Cys Glu Met Glu Gly Ile Ala Arg Thr Phe Gly Gln His
325 330 335
Arg Gly Asp Ser Asp Asp Leu Leu Val Gly Ser Val Lys Ser Asn Ile
340 345 350
Gly His Leu Glu Ala Cys Ala Gly Leu Ala Ser Leu Ile Lys Cys Ile
355 360 365
Phe Ile Leu Glu Thr Gly Val Ile Pro Pro Thr Pro Ser Val Arg Val
370 375 380
Leu Asn Pro Lys Ile Arg Trp Glu Glu Trp His Leu Lys Val Pro Ser
385 390 395 400
Lys Gln Thr Pro Trp Pro Thr Asp Gly Leu Arg Arg Val Ser Thr Gln
405 410 415
Gly Phe Gly Tyr Gly Gly Thr Asn Ala His Leu Ile Leu Asp Asp Ala
420 425 430
Ala His Tyr Leu Glu Gly Arg Ser Leu Arg Gly His His Tyr Thr Arg
435 440 445
Thr His Pro Gln Ala Gln Arg Leu Leu Thr Ser Ala Ile His Gly Ala
450 455 460
Ser Pro Lys Glu Gln Leu Pro Arg Leu Phe Leu Phe Arg Ala Asn Asp
465 470 475 480
Arg Glu Gly Leu Gly Arg Val Arg Ala Ser Leu Ala Gln His Leu Asp
485 490 495
Gln Leu Leu Pro Ser Trp Ser Gln Asp Ser Ser Gly Arg Asp Ala Tyr
500 505 510
Leu Gln Asn Leu Ala Phe Thr Leu Ala Ser Arg Arg Ser Asn Leu Lys
515 520 525
Trp Gln Thr Tyr Ala Thr Ala Ser Thr Pro Asp Glu Leu Leu Gln Val
530 535 540
Leu Lys Thr Lys Gly Asp Ala Trp Ala Ser Pro Glu Ala Arg Leu Ala
545 550 555 560
Ala Ser Ser Pro Arg Leu Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
565 570 575
Trp Ala Arg Met Gly Val Glu Leu Met Gly Tyr Pro Val Phe Arg Gln
580 585 590
Ser Val Glu Glu Ser Glu His Phe Leu Arg Glu Thr Leu Gly Cys Pro
595 600 605
Trp Ser Ala Ile Asp Glu Leu Ala Lys Pro Gln Thr Thr Ser Arg Leu
610 615 620
Ser Glu Ala Ala Tyr Ser Gln Thr Leu Cys Thr Val Leu Gln Ile Ala
625 630 635 640
Ile Val Asp Leu Leu Gln Asp Trp Asn Val Ser Pro Thr Arg Val Ala
645 650 655
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Ala Leu
660 665 670
Thr Lys Gln Asp Ser Leu Arg Val Ala Tyr Tyr Arg Gly Ile Leu Ser
675 680 685
Ser Glu Met Gln Glu Thr His Lys Asp Gln Lys Gly Ala Met Met Ala
690 695 700
Ile Gly Ala Ser Pro Glu Thr Val Ala Gln Trp Leu Ala Gln Leu Thr
705 710 715 720
Arg Gly Lys Val Val Val Ala Cys Ile Asn Ser Pro Thr Ser Val Thr
725 730 735
Ala Ser Gly Asp Ala Ala Gly Ile Asp Glu Leu Leu Ser Ile Val Gln
740 745 750
Glu Ala Gly Val Phe Gly Arg Lys Leu Lys Val Asp Val Ala Tyr His
755 760 765
Ser His His Met Gln Ser Val Ser Ala Ala Tyr Ser Ala Leu Leu Lys
770 775 780
Asp Leu Lys Pro Leu Pro Ala His Glu Gly Arg Thr Met His Ser Ser
785 790 795 800
Val Leu Gly Gly Leu Ile Asp Thr Ala Glu Leu Gly Ala Ser Asn Trp
805 810 815
Val Arg Asn Leu Ile Ser Pro Val Arg Phe Ser Glu Ala Val Ser Ser
820 825 830
Leu Ile Leu Asp Gly Asp Lys Pro Ala Val Asp Met Leu Ile Glu Ile
835 840 845
Gly Pro His Ala Ala Leu Lys Gly Pro Val Gln Glu Thr Leu Glu Ala
850 855 860
Lys Gly Val Ser Ala Val Lys Tyr Thr Ser Val Val Ser Arg Gly Gln
865 870 875 880
Asn Ala Val Lys Thr Ala Leu Ala Cys Ala Gly Glu Leu Val Asn Ser
885 890 895
Ser Val Pro Val Ala Met Asp Arg Val Asn Leu Glu Ser Glu Leu Gln
900 905 910
Pro Ser Pro Leu Val Asp Leu Pro Ser Tyr Pro Trp Asn Arg Ser Thr
915 920 925
Arg Phe Trp Ala Glu Ser Arg Leu Ser Gln Glu Tyr Arg Leu Arg Lys
930 935 940
His Ala Arg Leu Pro Leu Leu Gly Ser Pro Cys Pro Thr Met Gly Ala
945 950 955 960
Arg Glu Arg Tyr Trp Arg Gly Met Val Arg Leu Asp Glu Glu Pro Trp
965 970 975
Ile Arg Asp His Glu Ile Gln Gly Ser Ile Leu Tyr Pro Gly Ala Gly
980 985 990
Phe Leu Ile Met Ala Ile Glu Ala Ala Ser Gln Gln Ala Asn Glu Gln
995 1000 1005
Arg Lys Val Ser Ala Phe Arg Leu Arg Asp Val His Leu Asp Ala
1010 1015 1020
Ala Leu Val Val Thr Asp Asn Ser Thr Ala Glu Ala Ile Leu Gln
1025 1030 1035
Leu Arg Pro His Leu Leu Ala Pro Gly Ser Ser Gln Ser Ser Trp
1040 1045 1050
Met Glu Phe Thr Val Asn Ser Ser Ile Asp Gly Gly Ala Leu Arg
1055 1060 1065
Gln Asn Cys Ser Gly Leu Ile Met Ile Glu Tyr Glu Ala Asp Ala
1070 1075 1080
Asp Ser Ala Met Ala Arg Glu Arg Ser Leu Glu Ser Asp Thr Val
1085 1090 1095
Cys Asp Leu Tyr Lys Lys Thr Tyr Ile Ser Cys Arg Gln Ser Val
1100 1105 1110
Asp Val Ala Lys Phe Tyr Ser Arg Leu Ala Ser Leu Gly Leu Thr
1115 1120 1125
Tyr Gly Pro Ala Phe Ala Asn Leu Thr Glu Ile Arg Arg Thr Gly
1130 1135 1140
Asn Gly Gln Cys Thr Gly Ala Val Arg Val Pro Ala Val Glu Ser
1145 1150 1155
Leu Val Pro Pro Ala Tyr Arg Ser His Pro His Val Ile His Pro
1160 1165 1170
Gly Thr Leu Asp Ala Ile Phe His Leu Ala Phe Ala Ala Leu Glu
1175 1180 1185
Asp Ser Leu Leu Pro Gly Pro Met Val Pro Thr Thr Ile Asp Gly
1190 1195 1200
Leu Val Val Ala Ala Asn Thr Pro Asn Glu Pro Gly Thr Leu Leu
1205 1210 1215
Arg Gly Val Ser Gln Ser Ser Pro His Gly Phe Arg Glu Leu Ile
1220 1225 1230
Ser Asp Ile Asp Val Leu Asp Asp Gln Ser Ser Arg Ala Val Val
1235 1240 1245
Gln Ile Lys Gly Phe Arg Cys Ala Asp Val Ser Gly Gly Ser Ala
1250 1255 1260
Asn Ser Ser Asp Ala Glu Pro Ala Glu Ala Arg Pro Ile Ser Phe
1265 1270 1275
Arg Leu Asn Trp Lys Pro Ala Ile Asp Leu Leu Ser Ala Glu Gln
1280 1285 1290
Leu Arg Lys Tyr Val Gly Arg Val Ala Lys Gln Ala Asp Ala Ser
1295 1300 1305
Ser His Leu Ile Arg Ala Thr Glu Leu Asn Asn Gln Val Gly Asn
1310 1315 1320
Leu Pro Glu Thr Ala Pro Ser Ala Ala Leu Asp Ala Val Thr Glu
1325 1330 1335
Lys Ala Thr Arg Trp Phe Ala Ala Lys Ser Ala Lys Leu Val Asp
1340 1345 1350
Gly Ala Ala Thr Ala Ser Ser Ala Ser Ser Ser Gly Gly Tyr Val
1355 1360 1365
Asp Ala Thr Arg Asp Ala Trp Ala Ala Val Arg Glu Gly Arg Ile
1370 1375 1380
Pro Ser Pro Glu Lys Gln Asp Arg Val Leu Arg Glu Val Glu Lys
1385 1390 1395
Asn Gly Ala Leu Ser Thr Leu Leu Gly Ala Leu Asp Ala Tyr Met
1400 1405 1410
Asp Leu Arg His His Ala Lys Pro Asn Leu Ser Val Leu Glu Leu
1415 1420 1425
Ser Leu Asp Ala Val Pro Tyr Ser Ile Phe Ala Ala Leu Pro Ser
1430 1435 1440
Arg Gln Ser Ile Leu Gln Thr Ala Gln Tyr Ala Ile Arg Val Ser
1445 1450 1455
Gln Asp Gly Val Gln Asp Arg Ile Arg Ser Gln Phe Gly Ser Gln
1460 1465 1470
Gly Ser Gly Ile Asp Val Ala Val Thr Asp Phe Thr Gln Lys Ile
1475 1480 1485
Asp Glu Thr Leu Gly Lys His Asp Val Ile Leu Ile Phe Asp Pro
1490 1495 1500
Gly Phe Leu His Ala Lys Leu Glu Val Val Leu Arg Asn Ala Arg
1505 1510 1515
Lys Leu Leu Asn Pro Gly Gly Lys Ile Ile Val Ala Glu Val Asn
1520 1525 1530
Glu Pro Gly Leu Tyr Leu Gly Thr Ala Leu Gly Cys Leu His Trp
1535 1540 1545
Thr Arg Asn Leu Asp Val Ser Gln Ser Ser Trp Thr Ser Cys Leu
1550 1555 1560
Ser Arg Phe Gly Leu Thr Pro Ala Leu Glu Leu Ile Asp Ala Asn
1565 1570 1575
Thr Asp Ala Thr Gly His Gly Lys Phe Gln Leu Arg Leu Thr Gly
1580 1585 1590
Ser Ala Ala Glu Ser Asn Gly Ser Ser Ser His Gln Pro Gln Gln
1595 1600 1605
Val Thr Leu Ile Glu Ser Ala Asp Ala Ser Glu Met Ala Gln Gly
1610 1615 1620
Val Ala Glu Ala Val Ala Gln Arg Leu Gln Glu Ala Ser Ile Pro
1625 1630 1635
Thr Lys Arg Val His Trp Gly Cys Asp Val Ser Gln Leu Lys Gly
1640 1645 1650
Gln Pro Cys Ile Val Leu Thr Asp Leu Gln Ser Ala Leu Leu Lys
1655 1660 1665
Asp Leu Ala Pro Glu Asp Leu Ala Ala Leu Gln Ser Leu Phe Leu
1670 1675 1680
His Ala Glu Ser Thr Leu Trp Val Thr Gly Pro Leu Gly Pro Asp
1685 1690 1695
Ala Ala Leu Ile Thr Gly Leu Ala Arg Ser Val Cys Asn Glu Ala
1700 1705 1710
Ala Gly Val Gln Ile Arg Thr Leu Glu Val Thr Asp Leu Pro Ile
1715 1720 1725
Ser Ala Ala Ala Gly Tyr Ala Asp Met Val Ala Arg Val Phe Arg
1730 1735 1740
Tyr Arg Gly Ser Asp Thr Glu Phe Gln Trp His Ser Asp Ala Leu
1745 1750 1755
Leu Val Ser Arg Leu Thr Glu Asp Glu Asp Arg Asn Glu Glu Ile
1760 1765 1770
Ala Gln Leu Leu Gly Gln Gly Glu Thr Ala Ala Ala Glu Thr Thr
1775 1780 1785
Leu Gln Glu Thr Pro Glu Gly Leu Lys Leu Cys Val Arg Gln Ile
1790 1795 1800
Gly Met Leu Asp Ser Ala Cys Tyr Glu Pro Asp Pro Leu Ala Leu
1805 1810 1815
Glu Pro Leu Glu Ala Gly Glu Val Glu Val Asp Val Lys Ala Ser
1820 1825 1830
Gly Val Asn Phe Arg Asp Val Met Val Ala Leu Gly Gln Ile Pro
1835 1840 1845
Asp Arg Ala Phe Gly Phe Glu Gly Ala Gly Val Val Arg Arg Val
1850 1855 1860
His Ala Glu Glu Ser Arg Leu Arg Pro Gly Asp Arg Val Val Phe
1865 1870 1875
Leu Ala His Gly Ala His Arg Thr Val His Arg Val Arg Ala Asp
1880 1885 1890
Tyr Ala Met Pro Met Pro Asp Thr Met Ser Phe Glu Glu Gly Ala
1895 1900 1905
Ala Val Leu Leu Val His Thr Thr Ala Trp Tyr Ala Leu Val Lys
1910 1915 1920
Ser Ala Arg Ala Thr Ala Gly Gln Ser Val Leu Val His Ala Ala
1925 1930 1935
Ala Gly Gly Val Gly Gln Ala Val Leu Met Leu Ala Arg His Leu
1940 1945 1950
Gly Leu Glu Val Phe Ala Thr Val Gly Ser Glu Glu Lys Arg Lys
1955 1960 1965
Leu Val His Glu Thr Tyr Gly Ile Pro His Asp His Met Phe Asn
1970 1975 1980
Ser Arg Asp Ser Ser Phe Ala Met Gly Val Lys Arg Met Thr Asn
1985 1990 1995
Gly Arg Gly Val Asp Ile Val Val Asn Ser Leu Ala Gly Glu Ala
2000 2005 2010
Leu Arg Gln Thr Trp His Cys Leu Ala Pro Phe Gly Thr Phe Val
2015 2020 2025
Glu Leu Gly Met Lys Asp Ile Leu Asp Asn Ala Arg Leu Asp Met
2030 2035 2040
Lys Pro Phe Leu Gln Asp Ala Thr Phe Val Phe Phe Asn Leu Asn
2045 2050 2055
Arg Val Gln Lys Glu Arg Pro Asp Leu Met Lys Glu Ala Leu Arg
2060 2065 2070
Glu Thr Met Ala Leu Val Ser Ser Gly Ala Leu Lys Pro Ala Thr
2075 2080 2085
Pro Leu Thr Ala Tyr Ala Ala Ser Gln Val Glu Thr Ala Phe Arg
2090 2095 2100
Lys Ile Gln Thr Gly Gln His Leu Gly Lys Leu Val Leu Thr Phe
2105 2110 2115
Gln Thr Gly Asp Val Leu Arg Val Ile Arg Pro Asp Leu Ser Leu
2120 2125 2130
Gly Asp Ser Gly Ala Tyr Leu Leu Val Gly Gly Leu Gly Gly Leu
2135 2140 2145
Gly Arg Ser Leu Ala Arg Leu Leu Val His Leu Gly Ala Arg Arg
2150 2155 2160
Leu Cys Phe Leu Ser Arg Ser Gly Ala Lys Ser Ser Glu Ala Gln
2165 2170 2175
Ala Leu Val Gln Glu Leu Glu Leu Gln His Arg Val Arg Val Leu
2180 2185 2190
Val Cys Gln Gly Asp Val Ser Asp Ser Asp Thr Val Ala Arg Val
2195 2200 2205
Val Gln Gln Cys Thr Thr Thr Leu Gly Pro Ile Arg Gly Val Val
2210 2215 2220
Gln Cys Ala Met Ile Leu Arg Asp Gly Leu Phe Glu Arg Met Thr
2225 2230 2235
His Glu Gln Trp Thr Glu Ser Thr Arg Pro Lys Val Gln Gly Thr
2240 2245 2250
Trp Asn Leu His Glu Gln Ile Pro Ser Ala Asp Phe Phe Ile Thr
2255 2260 2265
Leu Ser Ser Phe Ala Gly Val Phe Gly Ser Arg Gly Gln Ser Asn
2270 2275 2280
Tyr Ala Ala Ala Gly Ala Tyr Glu Asp Ala Leu Ala His Phe Arg
2285 2290 2295
Thr Ser Leu Gly Gln Arg Ala Ile Thr Ile Asp Leu Gly Ile Met
2300 2305 2310
Arg Asp Val Gly Val Leu Ala Glu Gln Gly Ile Thr Asp Tyr Leu
2315 2320 2325
Arg Glu Trp Glu Glu Pro Phe Gly Ile Arg Glu His Glu Phe His
2330 2335 2340
Ala Leu Ile Lys Ser Ala Ile Met Ser Ala Thr Glu Pro Pro Thr
2345 2350 2355
Glu Arg Ser Val Val Gln Ile Pro Thr Gly Leu Ala Thr Ala Arg
2360 2365 2370
Ser Ala Gln Ala Ala Gly Ile Ser Thr Pro Phe Tyr Phe Asp Asp
2375 2380 2385
Ala Arg Phe Ser Ile Leu Ala Gln Thr Arg Thr Ala Ala Gly Ala
2390 2395 2400
Ser Ser Ala Asn Ala Asp Asp Gly Lys Val Ser Ile Arg Thr Gln
2405 2410 2415
Leu Ser Gln Ala Gln Ser Val Ala Glu Ala Ala Ser Ala Val Gln
2420 2425 2430
Thr Val Leu Leu Glu Arg Val Ala Lys Thr Leu Gln Ser Ser Val
2435 2440 2445
Ser Glu Ile Asp Pro Ser Gln Pro Leu His Ser Tyr Gly Val Asp
2450 2455 2460
Ser Leu Val Ala Val Glu Thr Val Lys Trp Met Phe Lys Thr Leu
2465 2470 2475
Glu Ala Lys Leu Thr Val Phe Asp Val Leu Ser Asn Val Ser Ile
2480 2485 2490
Val Val Leu Cys Glu Lys Ile Ala Thr Thr Ser Thr Leu Val Lys
2495 2500 2505
Leu Ser
2510
<210> 9
<211> 5166
<212> DNA
<213> 莱氏绿僵菌
<220>
<221> CDS
<222> (1)..(5166)
<400> 9
atg aaa atc cgg gct aca aac ttc ctc ctt ttt gga gat cag act gta 48
Met Lys Ile Arg Ala Thr Asn Phe Leu Leu Phe Gly Asp Gln Thr Val
1 5 10 15
gag aag ctt cca gcc att cgg cag ctg gta ggg cac gct gcg tcc tca 96
Glu Lys Leu Pro Ala Ile Arg Gln Leu Val Gly His Ala Ala Ser Ser
20 25 30
gct ctg ctt cag agg ttt ctg cgt caa gtt tgc gat gcg gtg cag ctc 144
Ala Leu Leu Gln Arg Phe Leu Arg Gln Val Cys Asp Ala Val Gln Leu
35 40 45
gaa gtc gcc aag ttg cct atg cac tcg gag caa cgc agc aac att gac 192
Glu Val Ala Lys Leu Pro Met His Ser Glu Gln Arg Ser Asn Ile Asp
50 55 60
aag ttt gac agc atc att cga cta gcc gaa aac aat gcc cgg ctg gac 240
Lys Phe Asp Ser Ile Ile Arg Leu Ala Glu Asn Asn Ala Arg Leu Asp
65 70 75 80
gag ccc aat gag atc gtt gcc acc gtc ttg atg aat atc gcc cgg ata 288
Glu Pro Asn Glu Ile Val Ala Thr Val Leu Met Asn Ile Ala Arg Ile
85 90 95
ggc gag ctc att ctg tat gca gaa gaa gac cct acc gtc ctc gtc tcc 336
Gly Glu Leu Ile Leu Tyr Ala Glu Glu Asp Pro Thr Val Leu Val Ser
100 105 110
aaa ggc aac cgc aac tgt att ctg gga ttc tgc act ggc gag gtg gct 384
Lys Gly Asn Arg Asn Cys Ile Leu Gly Phe Cys Thr Gly Glu Val Ala
115 120 125
gct gcc gcg gcc act atc gcg cag gac tcc aat gag ctg gtt gag ctg 432
Ala Ala Ala Ala Thr Ile Ala Gln Asp Ser Asn Glu Leu Val Glu Leu
130 135 140
ggc gtg gag atg act cac atc atc ttt cgc atg gcc cga gag cta aat 480
Gly Val Glu Met Thr His Ile Ile Phe Arg Met Ala Arg Glu Leu Asn
145 150 155 160
cac cgg tct ctc atg gtt gac cgt acc aac ggc ccc tgg gca aag aca 528
His Arg Ser Leu Met Val Asp Arg Thr Asn Gly Pro Trp Ala Lys Thr
165 170 175
atc ttg ggc att tca gtt gag cgc gtc cag gag att cta cat gag ttc 576
Ile Leu Gly Ile Ser Val Glu Arg Val Gln Glu Ile Leu His Glu Phe
180 185 190
cac gag agc gag tca att cct cgt gtc cga cga gtc tgc gtc ggg ttc 624
His Glu Ser Glu Ser Ile Pro Arg Val Arg Arg Val Cys Val Gly Phe
195 200 205
atc gca gaa ggc tgg ttg acg ctc ttc ggt ccc ccg aca acc ctg caa 672
Ile Ala Glu Gly Trp Leu Thr Leu Phe Gly Pro Pro Thr Thr Leu Gln
210 215 220
cga ctt ttc gaa tgg tca gta gag ctg gaa gac gct cca cag att gcc 720
Arg Leu Phe Glu Trp Ser Val Glu Leu Glu Asp Ala Pro Gln Ile Ala
225 230 235 240
aca gac gct cgt gga ggt gtg cac atg aag acg atg ccc gac gtt gac 768
Thr Asp Ala Arg Gly Gly Val His Met Lys Thr Met Pro Asp Val Asp
245 250 255
gtg gac tgg att ctt ggc tcg tcc gta tgg ctc gac cga acc ccc gtt 816
Val Asp Trp Ile Leu Gly Ser Ser Val Trp Leu Asp Arg Thr Pro Val
260 265 270
cac aca gct acc atc ttc tct ccc tat acg tgt cag cct cgg cag caa 864
His Thr Ala Thr Ile Phe Ser Pro Tyr Thr Cys Gln Pro Arg Gln Gln
275 280 285
cag act ctg cga ggg ctt ctg agg gaa atc att acc gac gtt gcg cag 912
Gln Thr Leu Arg Gly Leu Leu Arg Glu Ile Ile Thr Asp Val Ala Gln
290 295 300
cgg acg ttg tat ttg gcc aag gca atg aac gcg gct ctt gag ttt acc 960
Arg Thr Leu Tyr Leu Ala Lys Ala Met Asn Ala Ala Leu Glu Phe Thr
305 310 315 320
aag gca gac gag ctg cga gtc gtc atg ccc ggt cac acg agc cac gac 1008
Lys Ala Asp Glu Leu Arg Val Val Met Pro Gly His Thr Ser His Asp
325 330 335
gtc tat ttc ctc aag tcg ctt cag aaa cgt ggc ata gag tac tca gtc 1056
Val Tyr Phe Leu Lys Ser Leu Gln Lys Arg Gly Ile Glu Tyr Ser Val
340 345 350
atg tca cat ggc gat agc cca ccg tca gct ccg ggt agg caa ggt tca 1104
Met Ser His Gly Asp Ser Pro Pro Ser Ala Pro Gly Arg Gln Gly Ser
355 360 365
ggc ctt gtt gct gtc gtc ggc atg tcc ggc agg ttc ccg gga agc ggc 1152
Gly Leu Val Ala Val Val Gly Met Ser Gly Arg Phe Pro Gly Ser Gly
370 375 380
gac atc aat gcc ttc tgg gag ggt ctt ttg gag ggg aaa aga tat att 1200
Asp Ile Asn Ala Phe Trp Glu Gly Leu Leu Glu Gly Lys Arg Tyr Ile
385 390 395 400
caa gag att cca aat acc cga ttc gat ctg gag aag tgg tat gac gcg 1248
Gln Glu Ile Pro Asn Thr Arg Phe Asp Leu Glu Lys Trp Tyr Asp Ala
405 410 415
acg ggc aaa gta aag aac tcg aca att gcg cga acg gga gcc ttc ctt 1296
Thr Gly Lys Val Lys Asn Ser Thr Ile Ala Arg Thr Gly Ala Phe Leu
420 425 430
gat aag cca ggt atg ttc gac aac cgc ctg ttc gac atg tcg cca agg 1344
Asp Lys Pro Gly Met Phe Asp Asn Arg Leu Phe Asp Met Ser Pro Arg
435 440 445
gag gcc atg cag acg gac gtc cag cac cga cta ctc atg aca acc ggc 1392
Glu Ala Met Gln Thr Asp Val Gln His Arg Leu Leu Met Thr Thr Gly
450 455 460
tac gag gca ctg gag atg tcg gga tac tcc ccc gac ggg act ccc tca 1440
Tyr Glu Ala Leu Glu Met Ser Gly Tyr Ser Pro Asp Gly Thr Pro Ser
465 470 475 480
act gac acg agt cgc atc gca tca tac ttt gga cag acg tca gac gat 1488
Thr Asp Thr Ser Arg Ile Ala Ser Tyr Phe Gly Gln Thr Ser Asp Asp
485 490 495
tgg cgg gaa gtg gtg gtc cat cag ggg gtc gac atc tac ttc gcc acg 1536
Trp Arg Glu Val Val Val His Gln Gly Val Asp Ile Tyr Phe Ala Thr
500 505 510
gga agt tgc cgt gcc ttc ggg cca ggc aga ctg cat cac cat ttc aaa 1584
Gly Ser Cys Arg Ala Phe Gly Pro Gly Arg Leu His His His Phe Lys
515 520 525
tgg gga ggc ccg tct tac agt gtc gac tcg gca tgc tcc tcg agc atc 1632
Trp Gly Gly Pro Ser Tyr Ser Val Asp Ser Ala Cys Ser Ser Ser Ile
530 535 540
gca gcc gtc ggt ctg gca tgc tca gcg ctc ctc ggg cgc gaa tgc gac 1680
Ala Ala Val Gly Leu Ala Cys Ser Ala Leu Leu Gly Arg Glu Cys Asp
545 550 555 560
atg gcc ctg gct ggc gga gga tct cta ctt ctc tcc ccg tcg ccc ttc 1728
Met Ala Leu Ala Gly Gly Gly Ser Leu Leu Leu Ser Pro Ser Pro Phe
565 570 575
tca ggc ttg agc cgt ggt ggt ttc tta tcc gcc caa gga ggg tgc cag 1776
Ser Gly Leu Ser Arg Gly Gly Phe Leu Ser Ala Gln Gly Gly Cys Gln
580 585 590
aca ttc cac gac aac gcc gat ggc tac gtc cga gga gag ggc gtc gga 1824
Thr Phe His Asp Asn Ala Asp Gly Tyr Val Arg Gly Glu Gly Val Gly
595 600 605
gtg gtt gtt ctc aag cgc tta gaa gat gcg ctg gac gac cag gac aac 1872
Val Val Val Leu Lys Arg Leu Glu Asp Ala Leu Asp Asp Gln Asp Asn
610 615 620
ata ctc ggc gtt gtc cgc ggg tcc gga cgc aac tac agc agc gat gcc 1920
Ile Leu Gly Val Val Arg Gly Ser Gly Arg Asn Tyr Ser Ser Asp Ala
625 630 635 640
tct tcg atg atg cac ccc tcg gca aac gcc cag aaa cag ctg tac cgt 1968
Ser Ser Met Met His Pro Ser Ala Asn Ala Gln Lys Gln Leu Tyr Arg
645 650 655
gat gtt ctg gag cag agt ggt gta gag gcc aac agc atc tcc tac gtg 2016
Asp Val Leu Glu Gln Ser Gly Val Glu Ala Asn Ser Ile Ser Tyr Val
660 665 670
gaa atg cac ggg aca ggc acg cag gcc ggg gac ttt atg gaa atg tct 2064
Glu Met His Gly Thr Gly Thr Gln Ala Gly Asp Phe Met Glu Met Ser
675 680 685
tcc gtc ctg tca acg ttt gcg gag aag cga ggc gcg gat aat ccg ctc 2112
Ser Val Leu Ser Thr Phe Ala Glu Lys Arg Gly Ala Asp Asn Pro Leu
690 695 700
att gta gga gcc ctc aaa gca agt att ggc cac gga gaa gca gcg gcc 2160
Ile Val Gly Ala Leu Lys Ala Ser Ile Gly His Gly Glu Ala Ala Ala
705 710 715 720
ggc gtc tgc gct ctc atc aaa acc ctg atg atg ctt cag tgt cga cgg 2208
Gly Val Cys Ala Leu Ile Lys Thr Leu Met Met Leu Gln Cys Arg Arg
725 730 735
att cca cct caa ccc gac ctt cct ggg cct atc aac cat cga ttc cct 2256
Ile Pro Pro Gln Pro Asp Leu Pro Gly Pro Ile Asn His Arg Phe Pro
740 745 750
gat ctt gca gcc cgc aat gtg tac att gcg gcc cgc aac ttg aag ttg 2304
Asp Leu Ala Ala Arg Asn Val Tyr Ile Ala Ala Arg Asn Leu Lys Leu
755 760 765
gag gcc agc ccg atg gcc aaa ggg gtt ctt cgg atg ttt ctg aac agc 2352
Glu Ala Ser Pro Met Ala Lys Gly Val Leu Arg Met Phe Leu Asn Ser
770 775 780
ttc gat gct tcg ggt gga aat tcg tgt ttg ctg ctt gaa gaa gct ccg 2400
Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Leu Leu Glu Glu Ala Pro
785 790 795 800
ccg cgg gcc gtc aag gac gaa gac gct cga agt cat cat gtt gtg acc 2448
Pro Arg Ala Val Lys Asp Glu Asp Ala Arg Ser His His Val Val Thr
805 810 815
ctt tca gcc cgc tct cag aag tca ctc atc gga atc aaa gag aag tac 2496
Leu Ser Ala Arg Ser Gln Lys Ser Leu Ile Gly Ile Lys Glu Lys Tyr
820 825 830
ctg gcc cat ctg agt caa aat ccg ggc atc aag ctg gcg gac ctg gca 2544
Leu Ala His Leu Ser Gln Asn Pro Gly Ile Lys Leu Ala Asp Leu Ala
835 840 845
tac tcg aca act gct cgg cga atg cat gga ctg ttg cgg tat gcc atc 2592
Tyr Ser Thr Thr Ala Arg Arg Met His Gly Leu Leu Arg Tyr Ala Ile
850 855 860
gcc gca tcc tcc gtt gac gag gtc atg aac tct ctg gag acg gat ctc 2640
Ala Ala Ser Ser Val Asp Glu Val Met Asn Ser Leu Glu Thr Asp Leu
865 870 875 880
gcc cag ggg aaa aca cct cgt cag cct ccg gta gcg ccg agt ata gtt 2688
Ala Gln Gly Lys Thr Pro Arg Gln Pro Pro Val Ala Pro Ser Ile Val
885 890 895
ttc att ttt aca ggc cag ggc gca cat tac ctc ggt atg ggc tcg gaa 2736
Phe Ile Phe Thr Gly Gln Gly Ala His Tyr Leu Gly Met Gly Ser Glu
900 905 910
ctg tgg aag aca tct gcc atg ttc cgc aac acg ctt caa aag tac cag 2784
Leu Trp Lys Thr Ser Ala Met Phe Arg Asn Thr Leu Gln Lys Tyr Gln
915 920 925
acg atg gcc agt gcc gaa ggc ctc ccc tac ttc ctc gat ctc atc gta 2832
Thr Met Ala Ser Ala Glu Gly Leu Pro Tyr Phe Leu Asp Leu Ile Val
930 935 940
gga aac agc acg tcc acg caa cag tca ggg ccg gat act gta cag gta 2880
Gly Asn Ser Thr Ser Thr Gln Gln Ser Gly Pro Asp Thr Val Gln Val
945 950 955 960
cag ctg gcc atg gtc agc ttg gaa cta gcc ctc gct gag ctt tgg cgt 2928
Gln Leu Ala Met Val Ser Leu Glu Leu Ala Leu Ala Glu Leu Trp Arg
965 970 975
tcc tgg ggc atc cag cct gcc atg gtc ttg ggc cac agc cta ggc gaa 2976
Ser Trp Gly Ile Gln Pro Ala Met Val Leu Gly His Ser Leu Gly Glu
980 985 990
tac gcc gcc ctg tgc gtg gcc gga gtg ctc tcg gtg agc gat gct ctg 3024
Tyr Ala Ala Leu Cys Val Ala Gly Val Leu Ser Val Ser Asp Ala Leu
995 1000 1005
tac ctc gtg tac agg cga gct caa atc atg act gag gcc ctg act 3069
Tyr Leu Val Tyr Arg Arg Ala Gln Ile Met Thr Glu Ala Leu Thr
1010 1015 1020
gct agc gag tac ggc atg ttg gcc gtc aat cta agc gtc tgt gac 3114
Ala Ser Glu Tyr Gly Met Leu Ala Val Asn Leu Ser Val Cys Asp
1025 1030 1035
acg cgg gag gtg ctg tcg tct ggc cag cat gcc tca tgt gcc gtg 3159
Thr Arg Glu Val Leu Ser Ser Gly Gln His Ala Ser Cys Ala Val
1040 1045 1050
gct tgc atc aat gcc ccg aag atg acg gtg gtg agc ggt ccg ctg 3204
Ala Cys Ile Asn Ala Pro Lys Met Thr Val Val Ser Gly Pro Leu
1055 1060 1065
ccg aag ctt gaa gag ctc cag aat caa ctc aag tcg gac ggc act 3249
Pro Lys Leu Glu Glu Leu Gln Asn Gln Leu Lys Ser Asp Gly Thr
1070 1075 1080
cga tgt acg cct ctt tct gtt ccc tac ggc ttt cac tcg agt caa 3294
Arg Cys Thr Pro Leu Ser Val Pro Tyr Gly Phe His Ser Ser Gln
1085 1090 1095
ctt gac ccc atc ctg gac cag ttc gaa gct gcc tgc caa ggc gtc 3339
Leu Asp Pro Ile Leu Asp Gln Phe Glu Ala Ala Cys Gln Gly Val
1100 1105 1110
acc ttc tct gca ccg aaa gtc ccc gtg gtc tct acg ctc ttg gct 3384
Thr Phe Ser Ala Pro Lys Val Pro Val Val Ser Thr Leu Leu Ala
1115 1120 1125
act gtg gtc cga gaa gaa ggg act ttc tct ccg ggg tat ctg gcc 3429
Thr Val Val Arg Glu Glu Gly Thr Phe Ser Pro Gly Tyr Leu Ala
1130 1135 1140
cgg cag gcc cgc gaa cca gtc gac ttt gtc gga gct ttg ggc atg 3474
Arg Gln Ala Arg Glu Pro Val Asp Phe Val Gly Ala Leu Gly Met
1145 1150 1155
gtg cag gag cag agt ctt gcc tcc ctg gtg ttt ctc gaa gtt gga 3519
Val Gln Glu Gln Ser Leu Ala Ser Leu Val Phe Leu Glu Val Gly
1160 1165 1170
cct gaa cct gta tgt tcc ggt ctt gtg aac gcc acg cta agt gcc 3564
Pro Glu Pro Val Cys Ser Gly Leu Val Asn Ala Thr Leu Ser Ala
1175 1180 1185
ggg gag acc aag gca cgc tgc ttt gct tcg atg cat cgg ggt cat 3609
Gly Glu Thr Lys Ala Arg Cys Phe Ala Ser Met His Arg Gly His
1190 1195 1200
gaa aac tgg gcg tcg ata tca tca agc ttg aga gat ctc tac atg 3654
Glu Asn Trp Ala Ser Ile Ser Ser Ser Leu Arg Asp Leu Tyr Met
1205 1210 1215
gcg ggt gct ccc atc gac tgg cca gcc ttc cac cac gac ttc aag 3699
Ala Gly Ala Pro Ile Asp Trp Pro Ala Phe His His Asp Phe Lys
1220 1225 1230
tcg tcc gtc tcc ctt ctt gac ctt ccc aag tac tct ttc gac gag 3744
Ser Ser Val Ser Leu Leu Asp Leu Pro Lys Tyr Ser Phe Asp Glu
1235 1240 1245
aag gag ttc tgg gcg tca ttc cct aac aga gac atg cag ggc acg 3789
Lys Glu Phe Trp Ala Ser Phe Pro Asn Arg Asp Met Gln Gly Thr
1250 1255 1260
gga gag gtc gag ccc aag caa agc caa ccg ccc gtc atc gtt ccg 3834
Gly Glu Val Glu Pro Lys Gln Ser Gln Pro Pro Val Ile Val Pro
1265 1270 1275
tct gtg caa gga tac tgc acg acg act ctg cag cga atc gta aaa 3879
Ser Val Gln Gly Tyr Cys Thr Thr Thr Leu Gln Arg Ile Val Lys
1280 1285 1290
gaa acc gac cag ccg gac ggg cta tcg gtc acg ttt aca tcc gac 3924
Glu Thr Asp Gln Pro Asp Gly Leu Ser Val Thr Phe Thr Ser Asp
1295 1300 1305
ctg gca gaa cag cac cta cgt gcg gct gta cga ggt cat gcc gtg 3969
Leu Ala Glu Gln His Leu Arg Ala Ala Val Arg Gly His Ala Val
1310 1315 1320
gcc gac ata gaa atc tgt tcc agc agc ctg ctc ctg gac atg gca 4014
Ala Asp Ile Glu Ile Cys Ser Ser Ser Leu Leu Leu Asp Met Ala
1325 1330 1335
ctt tct gca gcc caa tat gcc tat ctg aaa cat tcc ccc ggc cag 4059
Leu Ser Ala Ala Gln Tyr Ala Tyr Leu Lys His Ser Pro Gly Gln
1340 1345 1350
aag atg cct gtt cca ttg acc gtc cgc aac tgc ttc ttc cac cgg 4104
Lys Met Pro Val Pro Leu Thr Val Arg Asn Cys Phe Phe His Arg
1355 1360 1365
gct gtc gtc ttg acc gag gaa gcg cag acc gtg gaa gtc acc gtc 4149
Ala Val Val Leu Thr Glu Glu Ala Gln Thr Val Glu Val Thr Val
1370 1375 1380
aca ttc agg tcc tcg acc aag act gcg gat att cag tac tac tgc 4194
Thr Phe Arg Ser Ser Thr Lys Thr Ala Asp Ile Gln Tyr Tyr Cys
1385 1390 1395
cga act tcc gac gag tac tac gag ttc gga tcc tgc cag gtg acg 4239
Arg Thr Ser Asp Glu Tyr Tyr Glu Phe Gly Ser Cys Gln Val Thr
1400 1405 1410
ttg gaa gca cca aga aaa cca gac cag gct gga ttt ctg gtt cgg 4284
Leu Glu Ala Pro Arg Lys Pro Asp Gln Ala Gly Phe Leu Val Arg
1415 1420 1425
tcc cgt att gct gca ctc aag gag tcg gca agt cac cgg cta ggc 4329
Ser Arg Ile Ala Ala Leu Lys Glu Ser Ala Ser His Arg Leu Gly
1430 1435 1440
aag cac gca gtt tac cgg ttg ttt gac aac att gtg cgg tat tca 4374
Lys His Ala Val Tyr Arg Leu Phe Asp Asn Ile Val Arg Tyr Ser
1445 1450 1455
gag caa tac cag ggg cta aag aac gtc cat ctt tcg gaa gac atg 4419
Glu Gln Tyr Gln Gly Leu Lys Asn Val His Leu Ser Glu Asp Met
1460 1465 1470
cgc gac gct gtg gcg gag atc aac atg acg caa gtc cca gcg gca 4464
Arg Asp Ala Val Ala Glu Ile Asn Met Thr Gln Val Pro Ala Ala
1475 1480 1485
ggc ggt cat tat ctt cac cac ccg ttt ttg atg gac tcg att gtt 4509
Gly Gly His Tyr Leu His His Pro Phe Leu Met Asp Ser Ile Val
1490 1495 1500
cat ctt tca gga ttc ttg gtg aac aac ggc ctc cgt tac tcc agc 4554
His Leu Ser Gly Phe Leu Val Asn Asn Gly Leu Arg Tyr Ser Ser
1505 1510 1515
gaa tgg gct tgt ctt tcc acc ggt ttc gag gag ctt cac ctg ctc 4599
Glu Trp Ala Cys Leu Ser Thr Gly Phe Glu Glu Leu His Leu Leu
1520 1525 1530
aag ccg cta gat cct gcc act gta tac acc agc tat act ttt atg 4644
Lys Pro Leu Asp Pro Ala Thr Val Tyr Thr Ser Tyr Thr Phe Met
1535 1540 1545
gaa gat tcc ccg acg acg agc aat gtc att ggc gat gtg tac gtc 4689
Glu Asp Ser Pro Thr Thr Ser Asn Val Ile Gly Asp Val Tyr Val
1550 1555 1560
tac gat ggg gca gag tta gtc tcc gtg gtg aca gga ttg cag ttt 4734
Tyr Asp Gly Ala Glu Leu Val Ser Val Val Thr Gly Leu Gln Phe
1565 1570 1575
caa aag atg aag agg aca gca ctc act cat ctg ctg agt ccc gcg 4779
Gln Lys Met Lys Arg Thr Ala Leu Thr His Leu Leu Ser Pro Ala
1580 1585 1590
acg gcg cgc aac acg gcg gcc aag acg atc cct cat aga ccg acg 4824
Thr Ala Arg Asn Thr Ala Ala Lys Thr Ile Pro His Arg Pro Thr
1595 1600 1605
acg gcc ccg gcg aag gcg ctt tct gac caa cag ccc atc act acc 4869
Thr Ala Pro Ala Lys Ala Leu Ser Asp Gln Gln Pro Ile Thr Thr
1610 1615 1620
att caa gcc gaa gca gct gtc ccg cag gct agt act cct tcg acc 4914
Ile Gln Ala Glu Ala Ala Val Pro Gln Ala Ser Thr Pro Ser Thr
1625 1630 1635
gcg gca agt gtc aat ggt ggt gag ggg gag aag ttt gac ctg gtg 4959
Ala Ala Ser Val Asn Gly Gly Glu Gly Glu Lys Phe Asp Leu Val
1640 1645 1650
gaa acg ctg ttt tcc atc atc gca cgc gag gtc ggc gtc gac tcg 5004
Glu Thr Leu Phe Ser Ile Ile Ala Arg Glu Val Gly Val Asp Ser
1655 1660 1665
agc gat ttg aag ggc gac gtg aac ctg gcg aat ctg ggc ata gac 5049
Ser Asp Leu Lys Gly Asp Val Asn Leu Ala Asn Leu Gly Ile Asp
1670 1675 1680
tcc ctc atg gcc atc aca atc atc tcg gtc atg cag cag gaa aca 5094
Ser Leu Met Ala Ile Thr Ile Ile Ser Val Met Gln Gln Glu Thr
1685 1690 1695
gga att gag ttg ccc ggg acg ttt ttc ctc gac aat tcc acg acg 5139
Gly Ile Glu Leu Pro Gly Thr Phe Phe Leu Asp Asn Ser Thr Thr
1700 1705 1710
acg gca gta atc gcg gca gtg gga tag 5166
Thr Ala Val Ile Ala Ala Val Gly
1715 1720
<210> 10
<211> 1721
<212> PRT
<213> 莱氏绿僵菌
<400> 10
Met Lys Ile Arg Ala Thr Asn Phe Leu Leu Phe Gly Asp Gln Thr Val
1 5 10 15
Glu Lys Leu Pro Ala Ile Arg Gln Leu Val Gly His Ala Ala Ser Ser
20 25 30
Ala Leu Leu Gln Arg Phe Leu Arg Gln Val Cys Asp Ala Val Gln Leu
35 40 45
Glu Val Ala Lys Leu Pro Met His Ser Glu Gln Arg Ser Asn Ile Asp
50 55 60
Lys Phe Asp Ser Ile Ile Arg Leu Ala Glu Asn Asn Ala Arg Leu Asp
65 70 75 80
Glu Pro Asn Glu Ile Val Ala Thr Val Leu Met Asn Ile Ala Arg Ile
85 90 95
Gly Glu Leu Ile Leu Tyr Ala Glu Glu Asp Pro Thr Val Leu Val Ser
100 105 110
Lys Gly Asn Arg Asn Cys Ile Leu Gly Phe Cys Thr Gly Glu Val Ala
115 120 125
Ala Ala Ala Ala Thr Ile Ala Gln Asp Ser Asn Glu Leu Val Glu Leu
130 135 140
Gly Val Glu Met Thr His Ile Ile Phe Arg Met Ala Arg Glu Leu Asn
145 150 155 160
His Arg Ser Leu Met Val Asp Arg Thr Asn Gly Pro Trp Ala Lys Thr
165 170 175
Ile Leu Gly Ile Ser Val Glu Arg Val Gln Glu Ile Leu His Glu Phe
180 185 190
His Glu Ser Glu Ser Ile Pro Arg Val Arg Arg Val Cys Val Gly Phe
195 200 205
Ile Ala Glu Gly Trp Leu Thr Leu Phe Gly Pro Pro Thr Thr Leu Gln
210 215 220
Arg Leu Phe Glu Trp Ser Val Glu Leu Glu Asp Ala Pro Gln Ile Ala
225 230 235 240
Thr Asp Ala Arg Gly Gly Val His Met Lys Thr Met Pro Asp Val Asp
245 250 255
Val Asp Trp Ile Leu Gly Ser Ser Val Trp Leu Asp Arg Thr Pro Val
260 265 270
His Thr Ala Thr Ile Phe Ser Pro Tyr Thr Cys Gln Pro Arg Gln Gln
275 280 285
Gln Thr Leu Arg Gly Leu Leu Arg Glu Ile Ile Thr Asp Val Ala Gln
290 295 300
Arg Thr Leu Tyr Leu Ala Lys Ala Met Asn Ala Ala Leu Glu Phe Thr
305 310 315 320
Lys Ala Asp Glu Leu Arg Val Val Met Pro Gly His Thr Ser His Asp
325 330 335
Val Tyr Phe Leu Lys Ser Leu Gln Lys Arg Gly Ile Glu Tyr Ser Val
340 345 350
Met Ser His Gly Asp Ser Pro Pro Ser Ala Pro Gly Arg Gln Gly Ser
355 360 365
Gly Leu Val Ala Val Val Gly Met Ser Gly Arg Phe Pro Gly Ser Gly
370 375 380
Asp Ile Asn Ala Phe Trp Glu Gly Leu Leu Glu Gly Lys Arg Tyr Ile
385 390 395 400
Gln Glu Ile Pro Asn Thr Arg Phe Asp Leu Glu Lys Trp Tyr Asp Ala
405 410 415
Thr Gly Lys Val Lys Asn Ser Thr Ile Ala Arg Thr Gly Ala Phe Leu
420 425 430
Asp Lys Pro Gly Met Phe Asp Asn Arg Leu Phe Asp Met Ser Pro Arg
435 440 445
Glu Ala Met Gln Thr Asp Val Gln His Arg Leu Leu Met Thr Thr Gly
450 455 460
Tyr Glu Ala Leu Glu Met Ser Gly Tyr Ser Pro Asp Gly Thr Pro Ser
465 470 475 480
Thr Asp Thr Ser Arg Ile Ala Ser Tyr Phe Gly Gln Thr Ser Asp Asp
485 490 495
Trp Arg Glu Val Val Val His Gln Gly Val Asp Ile Tyr Phe Ala Thr
500 505 510
Gly Ser Cys Arg Ala Phe Gly Pro Gly Arg Leu His His His Phe Lys
515 520 525
Trp Gly Gly Pro Ser Tyr Ser Val Asp Ser Ala Cys Ser Ser Ser Ile
530 535 540
Ala Ala Val Gly Leu Ala Cys Ser Ala Leu Leu Gly Arg Glu Cys Asp
545 550 555 560
Met Ala Leu Ala Gly Gly Gly Ser Leu Leu Leu Ser Pro Ser Pro Phe
565 570 575
Ser Gly Leu Ser Arg Gly Gly Phe Leu Ser Ala Gln Gly Gly Cys Gln
580 585 590
Thr Phe His Asp Asn Ala Asp Gly Tyr Val Arg Gly Glu Gly Val Gly
595 600 605
Val Val Val Leu Lys Arg Leu Glu Asp Ala Leu Asp Asp Gln Asp Asn
610 615 620
Ile Leu Gly Val Val Arg Gly Ser Gly Arg Asn Tyr Ser Ser Asp Ala
625 630 635 640
Ser Ser Met Met His Pro Ser Ala Asn Ala Gln Lys Gln Leu Tyr Arg
645 650 655
Asp Val Leu Glu Gln Ser Gly Val Glu Ala Asn Ser Ile Ser Tyr Val
660 665 670
Glu Met His Gly Thr Gly Thr Gln Ala Gly Asp Phe Met Glu Met Ser
675 680 685
Ser Val Leu Ser Thr Phe Ala Glu Lys Arg Gly Ala Asp Asn Pro Leu
690 695 700
Ile Val Gly Ala Leu Lys Ala Ser Ile Gly His Gly Glu Ala Ala Ala
705 710 715 720
Gly Val Cys Ala Leu Ile Lys Thr Leu Met Met Leu Gln Cys Arg Arg
725 730 735
Ile Pro Pro Gln Pro Asp Leu Pro Gly Pro Ile Asn His Arg Phe Pro
740 745 750
Asp Leu Ala Ala Arg Asn Val Tyr Ile Ala Ala Arg Asn Leu Lys Leu
755 760 765
Glu Ala Ser Pro Met Ala Lys Gly Val Leu Arg Met Phe Leu Asn Ser
770 775 780
Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Leu Leu Glu Glu Ala Pro
785 790 795 800
Pro Arg Ala Val Lys Asp Glu Asp Ala Arg Ser His His Val Val Thr
805 810 815
Leu Ser Ala Arg Ser Gln Lys Ser Leu Ile Gly Ile Lys Glu Lys Tyr
820 825 830
Leu Ala His Leu Ser Gln Asn Pro Gly Ile Lys Leu Ala Asp Leu Ala
835 840 845
Tyr Ser Thr Thr Ala Arg Arg Met His Gly Leu Leu Arg Tyr Ala Ile
850 855 860
Ala Ala Ser Ser Val Asp Glu Val Met Asn Ser Leu Glu Thr Asp Leu
865 870 875 880
Ala Gln Gly Lys Thr Pro Arg Gln Pro Pro Val Ala Pro Ser Ile Val
885 890 895
Phe Ile Phe Thr Gly Gln Gly Ala His Tyr Leu Gly Met Gly Ser Glu
900 905 910
Leu Trp Lys Thr Ser Ala Met Phe Arg Asn Thr Leu Gln Lys Tyr Gln
915 920 925
Thr Met Ala Ser Ala Glu Gly Leu Pro Tyr Phe Leu Asp Leu Ile Val
930 935 940
Gly Asn Ser Thr Ser Thr Gln Gln Ser Gly Pro Asp Thr Val Gln Val
945 950 955 960
Gln Leu Ala Met Val Ser Leu Glu Leu Ala Leu Ala Glu Leu Trp Arg
965 970 975
Ser Trp Gly Ile Gln Pro Ala Met Val Leu Gly His Ser Leu Gly Glu
980 985 990
Tyr Ala Ala Leu Cys Val Ala Gly Val Leu Ser Val Ser Asp Ala Leu
995 1000 1005
Tyr Leu Val Tyr Arg Arg Ala Gln Ile Met Thr Glu Ala Leu Thr
1010 1015 1020
Ala Ser Glu Tyr Gly Met Leu Ala Val Asn Leu Ser Val Cys Asp
1025 1030 1035
Thr Arg Glu Val Leu Ser Ser Gly Gln His Ala Ser Cys Ala Val
1040 1045 1050
Ala Cys Ile Asn Ala Pro Lys Met Thr Val Val Ser Gly Pro Leu
1055 1060 1065
Pro Lys Leu Glu Glu Leu Gln Asn Gln Leu Lys Ser Asp Gly Thr
1070 1075 1080
Arg Cys Thr Pro Leu Ser Val Pro Tyr Gly Phe His Ser Ser Gln
1085 1090 1095
Leu Asp Pro Ile Leu Asp Gln Phe Glu Ala Ala Cys Gln Gly Val
1100 1105 1110
Thr Phe Ser Ala Pro Lys Val Pro Val Val Ser Thr Leu Leu Ala
1115 1120 1125
Thr Val Val Arg Glu Glu Gly Thr Phe Ser Pro Gly Tyr Leu Ala
1130 1135 1140
Arg Gln Ala Arg Glu Pro Val Asp Phe Val Gly Ala Leu Gly Met
1145 1150 1155
Val Gln Glu Gln Ser Leu Ala Ser Leu Val Phe Leu Glu Val Gly
1160 1165 1170
Pro Glu Pro Val Cys Ser Gly Leu Val Asn Ala Thr Leu Ser Ala
1175 1180 1185
Gly Glu Thr Lys Ala Arg Cys Phe Ala Ser Met His Arg Gly His
1190 1195 1200
Glu Asn Trp Ala Ser Ile Ser Ser Ser Leu Arg Asp Leu Tyr Met
1205 1210 1215
Ala Gly Ala Pro Ile Asp Trp Pro Ala Phe His His Asp Phe Lys
1220 1225 1230
Ser Ser Val Ser Leu Leu Asp Leu Pro Lys Tyr Ser Phe Asp Glu
1235 1240 1245
Lys Glu Phe Trp Ala Ser Phe Pro Asn Arg Asp Met Gln Gly Thr
1250 1255 1260
Gly Glu Val Glu Pro Lys Gln Ser Gln Pro Pro Val Ile Val Pro
1265 1270 1275
Ser Val Gln Gly Tyr Cys Thr Thr Thr Leu Gln Arg Ile Val Lys
1280 1285 1290
Glu Thr Asp Gln Pro Asp Gly Leu Ser Val Thr Phe Thr Ser Asp
1295 1300 1305
Leu Ala Glu Gln His Leu Arg Ala Ala Val Arg Gly His Ala Val
1310 1315 1320
Ala Asp Ile Glu Ile Cys Ser Ser Ser Leu Leu Leu Asp Met Ala
1325 1330 1335
Leu Ser Ala Ala Gln Tyr Ala Tyr Leu Lys His Ser Pro Gly Gln
1340 1345 1350
Lys Met Pro Val Pro Leu Thr Val Arg Asn Cys Phe Phe His Arg
1355 1360 1365
Ala Val Val Leu Thr Glu Glu Ala Gln Thr Val Glu Val Thr Val
1370 1375 1380
Thr Phe Arg Ser Ser Thr Lys Thr Ala Asp Ile Gln Tyr Tyr Cys
1385 1390 1395
Arg Thr Ser Asp Glu Tyr Tyr Glu Phe Gly Ser Cys Gln Val Thr
1400 1405 1410
Leu Glu Ala Pro Arg Lys Pro Asp Gln Ala Gly Phe Leu Val Arg
1415 1420 1425
Ser Arg Ile Ala Ala Leu Lys Glu Ser Ala Ser His Arg Leu Gly
1430 1435 1440
Lys His Ala Val Tyr Arg Leu Phe Asp Asn Ile Val Arg Tyr Ser
1445 1450 1455
Glu Gln Tyr Gln Gly Leu Lys Asn Val His Leu Ser Glu Asp Met
1460 1465 1470
Arg Asp Ala Val Ala Glu Ile Asn Met Thr Gln Val Pro Ala Ala
1475 1480 1485
Gly Gly His Tyr Leu His His Pro Phe Leu Met Asp Ser Ile Val
1490 1495 1500
His Leu Ser Gly Phe Leu Val Asn Asn Gly Leu Arg Tyr Ser Ser
1505 1510 1515
Glu Trp Ala Cys Leu Ser Thr Gly Phe Glu Glu Leu His Leu Leu
1520 1525 1530
Lys Pro Leu Asp Pro Ala Thr Val Tyr Thr Ser Tyr Thr Phe Met
1535 1540 1545
Glu Asp Ser Pro Thr Thr Ser Asn Val Ile Gly Asp Val Tyr Val
1550 1555 1560
Tyr Asp Gly Ala Glu Leu Val Ser Val Val Thr Gly Leu Gln Phe
1565 1570 1575
Gln Lys Met Lys Arg Thr Ala Leu Thr His Leu Leu Ser Pro Ala
1580 1585 1590
Thr Ala Arg Asn Thr Ala Ala Lys Thr Ile Pro His Arg Pro Thr
1595 1600 1605
Thr Ala Pro Ala Lys Ala Leu Ser Asp Gln Gln Pro Ile Thr Thr
1610 1615 1620
Ile Gln Ala Glu Ala Ala Val Pro Gln Ala Ser Thr Pro Ser Thr
1625 1630 1635
Ala Ala Ser Val Asn Gly Gly Glu Gly Glu Lys Phe Asp Leu Val
1640 1645 1650
Glu Thr Leu Phe Ser Ile Ile Ala Arg Glu Val Gly Val Asp Ser
1655 1660 1665
Ser Asp Leu Lys Gly Asp Val Asn Leu Ala Asn Leu Gly Ile Asp
1670 1675 1680
Ser Leu Met Ala Ile Thr Ile Ile Ser Val Met Gln Gln Glu Thr
1685 1690 1695
Gly Ile Glu Leu Pro Gly Thr Phe Phe Leu Asp Asn Ser Thr Thr
1700 1705 1710
Thr Ala Val Ile Ala Ala Val Gly
1715 1720
<210> 11
<211> 1149
<212> DNA
<213> 莱氏绿僵菌
<220>
<221> CDS
<222> (1)..(1149)
<400> 11
atg gct gtc act gtg tgg caa gat gcg ctc gag atc atc gct cag gag 48
Met Ala Val Thr Val Trp Gln Asp Ala Leu Glu Ile Ile Ala Gln Glu
1 5 10 15
agc ggg ctg gaa ccc gca gag atc atc gag acg gac gac atg gag ttc 96
Ser Gly Leu Glu Pro Ala Glu Ile Ile Glu Thr Asp Asp Met Glu Phe
20 25 30
gcc aga ctt ggc atc aat cat att ctc gcc acg gcc atc ttg tcg cac 144
Ala Arg Leu Gly Ile Asn His Ile Leu Ala Thr Ala Ile Leu Ser His
35 40 45
ctc aga ggg cct cgc gga gag cct ctc cca cga gac att ttt gat cag 192
Leu Arg Gly Pro Arg Gly Glu Pro Leu Pro Arg Asp Ile Phe Asp Gln
50 55 60
aag cgc aca gtt gga gct ttc cgg cgt ttc tac gag acg tct att cac 240
Lys Arg Thr Val Gly Ala Phe Arg Arg Phe Tyr Glu Thr Ser Ile His
65 70 75 80
ctt gag act tct ccc atc acc ccc atc ctc gca ccc aag cga gct cag 288
Leu Glu Thr Ser Pro Ile Thr Pro Ile Leu Ala Pro Lys Arg Ala Gln
85 90 95
ctg aag cgt gag aag tcg ttt act gtt ccg ctc tcc atc gtc ttg cag 336
Leu Lys Arg Glu Lys Ser Phe Thr Val Pro Leu Ser Ile Val Leu Gln
100 105 110
aat agc ccg gct tcg agc cgg cac acc gta ttc ctc ctc cca gac ggc 384
Asn Ser Pro Ala Ser Ser Arg His Thr Val Phe Leu Leu Pro Asp Gly
115 120 125
agc ggc tct gcc atg gcg tac gca aac ctg cca cca gtc cac cca acc 432
Ser Gly Ser Ala Met Ala Tyr Ala Asn Leu Pro Pro Val His Pro Thr
130 135 140
gtc tgt gtc gtt ggg atg aac agt ccc tac ctc cgt gac gcc aac tca 480
Val Cys Val Val Gly Met Asn Ser Pro Tyr Leu Arg Asp Ala Asn Ser
145 150 155 160
tat cgc tgc tct gtc gag aat ctg gcg tcg caa tgg gtc cag gaa atc 528
Tyr Arg Cys Ser Val Glu Asn Leu Ala Ser Gln Trp Val Gln Glu Ile
165 170 175
tat cgc cgc cag cca cgc gga cct tat atc gtc ggt gga tgg tcg gcg 576
Tyr Arg Arg Gln Pro Arg Gly Pro Tyr Ile Val Gly Gly Trp Ser Ala
180 185 190
gga ggt tac tac tcg tac gaa gtg gcc caa cgc ctc ctg caa gat ggt 624
Gly Gly Tyr Tyr Ser Tyr Glu Val Ala Gln Arg Leu Leu Gln Asp Gly
195 200 205
cac gtc gtg gac aag ctg att ctg ata gac tcg cct tgc cgc act gtc 672
His Val Val Asp Lys Leu Ile Leu Ile Asp Ser Pro Cys Arg Thr Val
210 215 220
ttc gag tct ctc tcg atg gaa gtc gtc aac tat ctc tca aag cat aac 720
Phe Glu Ser Leu Ser Met Glu Val Val Asn Tyr Leu Ser Lys His Asn
225 230 235 240
cta atg ggc aac tgg ggc tcc caa gga ctt ccg gac tgg cta gtc cag 768
Leu Met Gly Asn Trp Gly Ser Gln Gly Leu Pro Asp Trp Leu Val Gln
245 250 255
cat ttc cgc tcc acg ctc gcc gcc gtg ggc aag tat cgt cca agg cca 816
His Phe Arg Ser Thr Leu Ala Ala Val Gly Lys Tyr Arg Pro Arg Pro
260 265 270
ctg cat tcg gtt ggg gaa atg gag acg tac atc atc tgg agt cgc gat 864
Leu His Ser Val Gly Glu Met Glu Thr Tyr Ile Ile Trp Ser Arg Asp
275 280 285
ggt gtg ctg gaa cac gat gct ttg gtc gag tct ggt ctc gac atg agc 912
Gly Val Leu Glu His Asp Ala Leu Val Glu Ser Gly Leu Asp Met Ser
290 295 300
atc aag gta tcc agg ttt ctg ctc gaa ggc aag gac gat ctg gga ccc 960
Ile Lys Val Ser Arg Phe Leu Leu Glu Gly Lys Asp Asp Leu Gly Pro
305 310 315 320
aac gga tgg gat gag ctg ctg ccc agc aag gat att gcg att gcc act 1008
Asn Gly Trp Asp Glu Leu Leu Pro Ser Lys Asp Ile Ala Ile Ala Thr
325 330 335
cag tcg ggg acg cat ttc acc atg atc aac aag cct cac gtg gca cag 1056
Gln Ser Gly Thr His Phe Thr Met Ile Asn Lys Pro His Val Ala Gln
340 345 350
atg agc gat ctt tta cgc gat gcg gtg act ggc atc act acc gac aga 1104
Met Ser Asp Leu Leu Arg Asp Ala Val Thr Gly Ile Thr Thr Asp Arg
355 360 365
cta tcg cag tgg cag aga gta aga aag gac gag cag gga aag tag 1149
Leu Ser Gln Trp Gln Arg Val Arg Lys Asp Glu Gln Gly Lys
370 375 380
<210> 12
<211> 382
<212> PRT
<213> 莱氏绿僵菌
<400> 12
Met Ala Val Thr Val Trp Gln Asp Ala Leu Glu Ile Ile Ala Gln Glu
1 5 10 15
Ser Gly Leu Glu Pro Ala Glu Ile Ile Glu Thr Asp Asp Met Glu Phe
20 25 30
Ala Arg Leu Gly Ile Asn His Ile Leu Ala Thr Ala Ile Leu Ser His
35 40 45
Leu Arg Gly Pro Arg Gly Glu Pro Leu Pro Arg Asp Ile Phe Asp Gln
50 55 60
Lys Arg Thr Val Gly Ala Phe Arg Arg Phe Tyr Glu Thr Ser Ile His
65 70 75 80
Leu Glu Thr Ser Pro Ile Thr Pro Ile Leu Ala Pro Lys Arg Ala Gln
85 90 95
Leu Lys Arg Glu Lys Ser Phe Thr Val Pro Leu Ser Ile Val Leu Gln
100 105 110
Asn Ser Pro Ala Ser Ser Arg His Thr Val Phe Leu Leu Pro Asp Gly
115 120 125
Ser Gly Ser Ala Met Ala Tyr Ala Asn Leu Pro Pro Val His Pro Thr
130 135 140
Val Cys Val Val Gly Met Asn Ser Pro Tyr Leu Arg Asp Ala Asn Ser
145 150 155 160
Tyr Arg Cys Ser Val Glu Asn Leu Ala Ser Gln Trp Val Gln Glu Ile
165 170 175
Tyr Arg Arg Gln Pro Arg Gly Pro Tyr Ile Val Gly Gly Trp Ser Ala
180 185 190
Gly Gly Tyr Tyr Ser Tyr Glu Val Ala Gln Arg Leu Leu Gln Asp Gly
195 200 205
His Val Val Asp Lys Leu Ile Leu Ile Asp Ser Pro Cys Arg Thr Val
210 215 220
Phe Glu Ser Leu Ser Met Glu Val Val Asn Tyr Leu Ser Lys His Asn
225 230 235 240
Leu Met Gly Asn Trp Gly Ser Gln Gly Leu Pro Asp Trp Leu Val Gln
245 250 255
His Phe Arg Ser Thr Leu Ala Ala Val Gly Lys Tyr Arg Pro Arg Pro
260 265 270
Leu His Ser Val Gly Glu Met Glu Thr Tyr Ile Ile Trp Ser Arg Asp
275 280 285
Gly Val Leu Glu His Asp Ala Leu Val Glu Ser Gly Leu Asp Met Ser
290 295 300
Ile Lys Val Ser Arg Phe Leu Leu Glu Gly Lys Asp Asp Leu Gly Pro
305 310 315 320
Asn Gly Trp Asp Glu Leu Leu Pro Ser Lys Asp Ile Ala Ile Ala Thr
325 330 335
Gln Ser Gly Thr His Phe Thr Met Ile Asn Lys Pro His Val Ala Gln
340 345 350
Met Ser Asp Leu Leu Arg Asp Ala Val Thr Gly Ile Thr Thr Asp Arg
355 360 365
Leu Ser Gln Trp Gln Arg Val Arg Lys Asp Glu Gln Gly Lys
370 375 380
<210> 13
<211> 7524
<212> DNA
<213> 岛篮状菌
<220>
<221> CDS
<222> (1)..(7524)
<400> 13
atg gcg aca acg aat gaa gtc cgg tgg gct caa gat att gcc att gtt 48
Met Ala Thr Thr Asn Glu Val Arg Trp Ala Gln Asp Ile Ala Ile Val
1 5 10 15
ggc atg tcc tgc cga ttc gcc gat gac gcg gat tca ttc cct cgg ttc 96
Gly Met Ser Cys Arg Phe Ala Asp Asp Ala Asp Ser Phe Pro Arg Phe
20 25 30
tgg gat ttc att tgc aat gga aga tat gcg ttc cac tac cct gga aaa 144
Trp Asp Phe Ile Cys Asn Gly Arg Tyr Ala Phe His Tyr Pro Gly Lys
35 40 45
aaa aca aac aca agt ttg cct cgc ggt gca cat ttc ttc aaa gat gac 192
Lys Thr Asn Thr Ser Leu Pro Arg Gly Ala His Phe Phe Lys Asp Asp
50 55 60
atc gca gag ttc gat gcc aat ttc ttc aac atc tcc aaa gtc gag gcc 240
Ile Ala Glu Phe Asp Ala Asn Phe Phe Asn Ile Ser Lys Val Glu Ala
65 70 75 80
gaa tcg att gat ccg caa cag cgc atg gtg atg gaa aca acg ttc gaa 288
Glu Ser Ile Asp Pro Gln Gln Arg Met Val Met Glu Thr Thr Phe Glu
85 90 95
gcc cta gaa aat gct gga att act ata gac aaa gtg gca gga acc cgc 336
Ala Leu Glu Asn Ala Gly Ile Thr Ile Asp Lys Val Ala Gly Thr Arg
100 105 110
gct ggt gtc tgg atg gcc aat ttt act agc gat tat cgt gag atg cta 384
Ala Gly Val Trp Met Ala Asn Phe Thr Ser Asp Tyr Arg Glu Met Leu
115 120 125
tac cga gat tca gag aca gca ccg atg tat acc ctg tca ggc gcc agc 432
Tyr Arg Asp Ser Glu Thr Ala Pro Met Tyr Thr Leu Ser Gly Ala Ser
130 135 140
aac aca tcc acg tca aac cgt gta tca tgg ttc ttt gat ctc aaa ggc 480
Asn Thr Ser Thr Ser Asn Arg Val Ser Trp Phe Phe Asp Leu Lys Gly
145 150 155 160
cca agc ttt acc ttg aac act gca tgc tct tca agt atg gtg gct acc 528
Pro Ser Phe Thr Leu Asn Thr Ala Cys Ser Ser Ser Met Val Ala Thr
165 170 175
cat cta gct tgc cag agc ctt gct ctg ggt gaa tcc agc agt gcg ata 576
His Leu Ala Cys Gln Ser Leu Ala Leu Gly Glu Ser Ser Ser Ala Ile
180 185 190
gtt ggc ggg aca agt ctc ctc ttg aat cca gac cta ttc ctc ttt ttg 624
Val Gly Gly Thr Ser Leu Leu Leu Asn Pro Asp Leu Phe Leu Phe Leu
195 200 205
tcg aat cag cat ttc tta gca gct gat ggt aaa tct aaa gcc ttt gat 672
Ser Asn Gln His Phe Leu Ala Ala Asp Gly Lys Ser Lys Ala Phe Asp
210 215 220
gcc agt ggt gat gga tac ggc cgg ggt gaa ggc gtt gct gtt gtt gtc 720
Ala Ser Gly Asp Gly Tyr Gly Arg Gly Glu Gly Val Ala Val Val Val
225 230 235 240
tta aag cgt gtt gcg gac gcc atc gct gat ggt gat ccc att cga gca 768
Leu Lys Arg Val Ala Asp Ala Ile Ala Asp Gly Asp Pro Ile Arg Ala
245 250 255
gtg atc cgt ggg act gcc atc aat caa gat gga agg aca aag gga atg 816
Val Ile Arg Gly Thr Ala Ile Asn Gln Asp Gly Arg Thr Lys Gly Met
260 265 270
aca tta cct agt gta gat gct caa gaa caa ttg atc aag gat gcc tat 864
Thr Leu Pro Ser Val Asp Ala Gln Glu Gln Leu Ile Lys Asp Ala Tyr
275 280 285
cgc aat gca gga ctg tcc atg aag gac act cga tat gtc gaa gct cac 912
Arg Asn Ala Gly Leu Ser Met Lys Asp Thr Arg Tyr Val Glu Ala His
290 295 300
gga aca gga act caa gct ggt gac aag tgt gag acg gag gca tta tct 960
Gly Thr Gly Thr Gln Ala Gly Asp Lys Cys Glu Thr Glu Ala Leu Ser
305 310 315 320
cga act ttt agc cca tac cgt act gca tcc gaa cga ctc att ctt ggg 1008
Arg Thr Phe Ser Pro Tyr Arg Thr Ala Ser Glu Arg Leu Ile Leu Gly
325 330 335
tct gtc aag acc aac att ggg cat ttg gag gca tgt gcc ggt tta gcg 1056
Ser Val Lys Thr Asn Ile Gly His Leu Glu Ala Cys Ala Gly Leu Ala
340 345 350
tcc atg ata aaa tgc gtt ggt att ctt gaa gcc gga gtg att cct cca 1104
Ser Met Ile Lys Cys Val Gly Ile Leu Glu Ala Gly Val Ile Pro Pro
355 360 365
aat cca tta tac aaa aaa ggt aac ccg gga ata aaa ttc gac gac tgg 1152
Asn Pro Leu Tyr Lys Lys Gly Asn Pro Gly Ile Lys Phe Asp Asp Trp
370 375 380
aaa ctc cat gta cct act agc tca ata caa tgg ccg acc agt ggc ctg 1200
Lys Leu His Val Pro Thr Ser Ser Ile Gln Trp Pro Thr Ser Gly Leu
385 390 395 400
cgg cgc atc agc acc caa gga ttt ggg tat gga gga acc aat gcg cat 1248
Arg Arg Ile Ser Thr Gln Gly Phe Gly Tyr Gly Gly Thr Asn Ala His
405 410 415
atc atc atg gac gac gct cac aac tat ctg gta tct cgt gac ata act 1296
Ile Ile Met Asp Asp Ala His Asn Tyr Leu Val Ser Arg Asp Ile Thr
420 425 430
gcg ata cac aat aca tgc ctg ctc aat ctg aca aat gga acc act tat 1344
Ala Ile His Asn Thr Cys Leu Leu Asn Leu Thr Asn Gly Thr Thr Tyr
435 440 445
ata gag cat aaa gag gct cct cgg cca agg att ttc cat ttt agt gcc 1392
Ile Glu His Lys Glu Ala Pro Arg Pro Arg Ile Phe His Phe Ser Ala
450 455 460
cag gac aag gac ggg cta ggg agg gta cga gac gcc act tgc cag tat 1440
Gln Asp Lys Asp Gly Leu Gly Arg Val Arg Asp Ala Thr Cys Gln Tyr
465 470 475 480
ctc aag tca ggt gca tta gag gct ggg aaa atg cgc cag aat gaa gat 1488
Leu Lys Ser Gly Ala Leu Glu Ala Gly Lys Met Arg Gln Asn Glu Asp
485 490 495
aaa tac ctt aga gat cta gct tat aca ctg tca gag aga cgt tct cgg 1536
Lys Tyr Leu Arg Asp Leu Ala Tyr Thr Leu Ser Glu Arg Arg Ser Arg
500 505 510
ttg caa tgg cag aca ttt gcg gtg gcc tca tct gtc gaa gga ttg att 1584
Leu Gln Trp Gln Thr Phe Ala Val Ala Ser Ser Val Glu Gly Leu Ile
515 520 525
gaa aca tta cag acc aag cca tgg gcc agt cca gag aca cgc tca gcg 1632
Glu Thr Leu Gln Thr Lys Pro Trp Ala Ser Pro Glu Thr Arg Ser Ala
530 535 540
tca aaa gta cct cgc ata ggc ttc ata ttt act ggt caa ggg gct cag 1680
Ser Lys Val Pro Arg Ile Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
545 550 555 560
tgg cca cgg atg gga atc gag ctg atg gaa tat gac att ttc cga aaa 1728
Trp Pro Arg Met Gly Ile Glu Leu Met Glu Tyr Asp Ile Phe Arg Lys
565 570 575
agc gtg gaa aga tca gat gtt tac ttg cgc gag gga ttg gac tgc tcc 1776
Ser Val Glu Arg Ser Asp Val Tyr Leu Arg Glu Gly Leu Asp Cys Ser
580 585 590
tgg tct gcc atc gaa gaa ctt gct aaa cct gat tcc tcg tct aac ctg 1824
Trp Ser Ala Ile Glu Glu Leu Ala Lys Pro Asp Ser Ser Ser Asn Leu
595 600 605
ggc gca gcg gaa tac agc caa gca ctc tgt tcc gtt ctt cag att gcc 1872
Gly Ala Ala Glu Tyr Ser Gln Ala Leu Cys Ser Val Leu Gln Ile Ala
610 615 620
cta ata gac ctg ctc gat agc tgg aac atc aga cca agc gca gta gcc 1920
Leu Ile Asp Leu Leu Asp Ser Trp Asn Ile Arg Pro Ser Ala Val Ala
625 630 635 640
ggc cat tct agt gga gaa ata gcg gcg gcc tac tgc ctt ggg gtt ctc 1968
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Val Leu
645 650 655
tct tgg gag gat gcc cta aaa gta gct tac ttt cga ggg tcg cta tcg 2016
Ser Trp Glu Asp Ala Leu Lys Val Ala Tyr Phe Arg Gly Ser Leu Ser
660 665 670
gca gag atg aag gga aat gac agc tcg ctc aat gga gca atg atg gct 2064
Ala Glu Met Lys Gly Asn Asp Ser Ser Leu Asn Gly Ala Met Met Ala
675 680 685
gtc ggc tct tca cca gcg gat att gaa aag tgg ctc gac aaa gtt act 2112
Val Gly Ser Ser Pro Ala Asp Ile Glu Lys Trp Leu Asp Lys Val Thr
690 695 700
gca ggg gag gtt gta gtt gca tgc gtg aac tcc cct gcc agc att act 2160
Ala Gly Glu Val Val Val Ala Cys Val Asn Ser Pro Ala Ser Ile Thr
705 710 715 720
ttg tct ggt gat gct gcc ggt atc aac gaa ttg gag tcc atg ttg aaa 2208
Leu Ser Gly Asp Ala Ala Gly Ile Asn Glu Leu Glu Ser Met Leu Lys
725 730 735
gaa gca ggg ata ttt gca agg aaa cta aag gtg gac acg gca tac cac 2256
Glu Ala Gly Ile Phe Ala Arg Lys Leu Lys Val Asp Thr Ala Tyr His
740 745 750
tct cca cat atg cag acc att gcc ggc caa tac ttt gaa gcc att gcc 2304
Ser Pro His Met Gln Thr Ile Ala Gly Gln Tyr Phe Glu Ala Ile Ala
755 760 765
gac att tct ata tta ccg gtg agg aat ggg tgc caa atg cat tct agc 2352
Asp Ile Ser Ile Leu Pro Val Arg Asn Gly Cys Gln Met His Ser Ser
770 775 780
gtg cga ggt ggc tac att gat ccg aat gaa ctc ggt gcc gcg aat tgg 2400
Val Arg Gly Gly Tyr Ile Asp Pro Asn Glu Leu Gly Ala Ala Asn Trp
785 790 795 800
gta cgg aat ttg gta tcg act gtt cag ttt gct gat gct gtt cac gat 2448
Val Arg Asn Leu Val Ser Thr Val Gln Phe Ala Asp Ala Val His Asp
805 810 815
ctt ctt cga cca tta gtt tat ggt gag cgt gca gcg cac aat gct gtg 2496
Leu Leu Arg Pro Leu Val Tyr Gly Glu Arg Ala Ala His Asn Ala Val
820 825 830
gac att ctg gtt gaa gtc ggg ccg cat tct gct tta cag gga ccg gta 2544
Asp Ile Leu Val Glu Val Gly Pro His Ser Ala Leu Gln Gly Pro Val
835 840 845
aac cag aca atg aag gcc cat gga atc aat agt atc aat tat tgt aca 2592
Asn Gln Thr Met Lys Ala His Gly Ile Asn Ser Ile Asn Tyr Cys Thr
850 855 860
atg ctc tca cgt ggg aaa aat gcc atc aat acg gct cta tca tgt gcc 2640
Met Leu Ser Arg Gly Lys Asn Ala Ile Asn Thr Ala Leu Ser Cys Ala
865 870 875 880
gcc act ttg tat gtg gaa ggc ctc gca gtc gat ctt cgc agg gcc aac 2688
Ala Thr Leu Tyr Val Glu Gly Leu Ala Val Asp Leu Arg Arg Ala Asn
885 890 895
cag gat gaa agc ttt gcg gtt gag cct atc ttc gat atg cct tcg tac 2736
Gln Asp Glu Ser Phe Ala Val Glu Pro Ile Phe Asp Met Pro Ser Tyr
900 905 910
cct tgg aac cac tca att cga tat tgg gcc gaa tct cgt gtg gaa aag 2784
Pro Trp Asn His Ser Ile Arg Tyr Trp Ala Glu Ser Arg Val Glu Lys
915 920 925
gaa tat cga cag cgg aag tat ccc cgt aca cct tta ctc ggt gct cct 2832
Glu Tyr Arg Gln Arg Lys Tyr Pro Arg Thr Pro Leu Leu Gly Ala Pro
930 935 940
tgt ccg tct atg aat gcg ggt gag aag gtc tgg aga ggc ttt att cga 2880
Cys Pro Ser Met Asn Ala Gly Glu Lys Val Trp Arg Gly Phe Ile Arg
945 950 955 960
cct agt gag gag ccg tgg gtt cgc gat cat gtt att caa ggc tcc att 2928
Pro Ser Glu Glu Pro Trp Val Arg Asp His Val Ile Gln Gly Ser Ile
965 970 975
tta tat cca gct gcc gga ttc tta gca atg gcc att gaa gct gca agg 2976
Leu Tyr Pro Ala Ala Gly Phe Leu Ala Met Ala Ile Glu Ala Ala Arg
980 985 990
cag ggg act gag acg gga agg tca att gac ggt ttc aga ctt cgt gat 3024
Gln Gly Thr Glu Thr Gly Arg Ser Ile Asp Gly Phe Arg Leu Arg Asp
995 1000 1005
gtc cag att aat gct gcc ctg gtt att gag gaa aat gtc gaa cca 3069
Val Gln Ile Asn Ala Ala Leu Val Ile Glu Glu Asn Val Glu Pro
1010 1015 1020
gaa gtg ata ttg agg ttg cag cca cac aga atg ggc acc ctg gat 3114
Glu Val Ile Leu Arg Leu Gln Pro His Arg Met Gly Thr Leu Asp
1025 1030 1035
gcg ggt tca gta tcc tgg cag gaa ttc act gtt tca tct tca aca 3159
Ala Gly Ser Val Ser Trp Gln Glu Phe Thr Val Ser Ser Ser Thr
1040 1045 1050
gat gga aca gat cta cga caa aat tgt tca gga ctg ctt gcc atc 3204
Asp Gly Thr Asp Leu Arg Gln Asn Cys Ser Gly Leu Leu Ala Ile
1055 1060 1065
gat tat gaa ccc gct gag gga tct tct atg cac atc gag aaa atc 3249
Asp Tyr Glu Pro Ala Glu Gly Ser Ser Met His Ile Glu Lys Ile
1070 1075 1080
aag gag gtc gag act atc aaa gga aaa ttg gtc aag gcg aag gaa 3294
Lys Glu Val Glu Thr Ile Lys Gly Lys Leu Val Lys Ala Lys Glu
1085 1090 1095
cag tgt aga gct gct atc aat gtc gat gaa ttt tat gcc cat ctt 3339
Gln Cys Arg Ala Ala Ile Asn Val Asp Glu Phe Tyr Ala His Leu
1100 1105 1110
gac acc gtt ggc cta aca tat ggc gag act ttc gct aac ctg acc 3384
Asp Thr Val Gly Leu Thr Tyr Gly Glu Thr Phe Ala Asn Leu Thr
1115 1120 1125
gag gtt cac acc aat gca gca aca gga gaa tgt aca ggt cgt ttg 3429
Glu Val His Thr Asn Ala Ala Thr Gly Glu Cys Thr Gly Arg Leu
1130 1135 1140
ctc gta cct gat gtt gag tca gcc atc cct ccg cat atg agg gaa 3474
Leu Val Pro Asp Val Glu Ser Ala Ile Pro Pro His Met Arg Glu
1145 1150 1155
cgg cca cac atc ata cac cca aca acc tta gat gcc att ttt cac 3519
Arg Pro His Ile Ile His Pro Thr Thr Leu Asp Ala Ile Phe His
1160 1165 1170
tta gca ttt gct gca atc agc gaa cat cca ttc tca ctc aag agt 3564
Leu Ala Phe Ala Ala Ile Ser Glu His Pro Phe Ser Leu Lys Ser
1175 1180 1185
gcc atg gtt cct att tcg ata aca gag gta gtc att tca aac gaa 3609
Ala Met Val Pro Ile Ser Ile Thr Glu Val Val Ile Ser Asn Glu
1190 1195 1200
gtg ccc cac aga aag gga tcc cag ctc gaa gga ttc gct cag tct 3654
Val Pro His Arg Lys Gly Ser Gln Leu Glu Gly Phe Ala Gln Ser
1205 1210 1215
tct cgg ttt gga ttt cga gaa ttg gtc acc aat atc aac att ttt 3699
Ser Arg Phe Gly Phe Arg Glu Leu Val Thr Asn Ile Asn Ile Phe
1220 1225 1230
gac gag caa ctc aca gat gcc gtt gtc aag atc agc gga ttt aga 3744
Asp Glu Gln Leu Thr Asp Ala Val Val Lys Ile Ser Gly Phe Arg
1235 1240 1245
tgt gca gat gtg tct ggt tca agc caa agt acg agc agc ggt gag 3789
Cys Ala Asp Val Ser Gly Ser Ser Gln Ser Thr Ser Ser Gly Glu
1250 1255 1260
gca gcc aag cca att acg ttt aaa gaa atc cat cga cct gct ctg 3834
Ala Ala Lys Pro Ile Thr Phe Lys Glu Ile His Arg Pro Ala Leu
1265 1270 1275
gag ctt ctt gac tat gag gat ctc caa aga gct gtc aac gca aat 3879
Glu Leu Leu Asp Tyr Glu Asp Leu Gln Arg Ala Val Asn Ala Asn
1280 1285 1290
gcg gac gaa att gct agt gga ata ttt gaa cag gat acc tct ctc 3924
Ala Asp Glu Ile Ala Ser Gly Ile Phe Glu Gln Asp Thr Ser Leu
1295 1300 1305
gac aaa tcc gcc ctc gcc att gtt aag cgg aca ctg tct aac gtt 3969
Asp Lys Ser Ala Leu Ala Ile Val Lys Arg Thr Leu Ser Asn Val
1310 1315 1320
cca cgg tca tct gta cat aaa gat ttg ctc ggt ttc tac gat tgg 4014
Pro Arg Ser Ser Val His Lys Asp Leu Leu Gly Phe Tyr Asp Trp
1325 1330 1335
atg cag agg caa gtt tca tcg gca gac aaa gca tca ggt gct ggt 4059
Met Gln Arg Gln Val Ser Ser Ala Asp Lys Ala Ser Gly Ala Gly
1340 1345 1350
caa aga gac agc acg ggc tat aca aat ata tct gtg aag gac cta 4104
Gln Arg Asp Ser Thr Gly Tyr Thr Asn Ile Ser Val Lys Asp Leu
1355 1360 1365
gaa ggt att ctg tct ggt gaa aaa att gct gca cag gcc atg gat 4149
Glu Gly Ile Leu Ser Gly Glu Lys Ile Ala Ala Gln Ala Met Asp
1370 1375 1380
gag aac gtc att ctt atg cct gct ctc act agc tct gcg aac ttc 4194
Glu Asn Val Ile Leu Met Pro Ala Leu Thr Ser Ser Ala Asn Phe
1385 1390 1395
caa caa ata atg aaa aaa ttg agc cag tat tta ctt att ctg cag 4239
Gln Gln Ile Met Lys Lys Leu Ser Gln Tyr Leu Leu Ile Leu Gln
1400 1405 1410
cac aca tac cca gaa ctc tcc gtt ctc gag atc att cat tcg gcg 4284
His Thr Tyr Pro Glu Leu Ser Val Leu Glu Ile Ile His Ser Ala
1415 1420 1425
gaa aat tca act act gga tct att tta ccc cag ttg caa tct gct 4329
Glu Asn Ser Thr Thr Gly Ser Ile Leu Pro Gln Leu Gln Ser Ala
1430 1435 1440
gaa gtt att ctt gat aca agc aaa tac act gtg ctt gtg caa aat 4374
Glu Val Ile Leu Asp Thr Ser Lys Tyr Thr Val Leu Val Gln Asn
1445 1450 1455
gag aag gct gcc aaa aca gtg gaa agc cag cta ggt acc ctg acg 4419
Glu Lys Ala Ala Lys Thr Val Glu Ser Gln Leu Gly Thr Leu Thr
1460 1465 1470
gat ctt ata tcg ctt gaa gtg agc gcc aca gac aat agt gta caa 4464
Asp Leu Ile Ser Leu Glu Val Ser Ala Thr Asp Asn Ser Val Gln
1475 1480 1485
gac cat gga cgc cag tat gat ctt gct ctt gtg gta aac att gct 4509
Asp His Gly Arg Gln Tyr Asp Leu Ala Leu Val Val Asn Ile Ala
1490 1495 1500
cat aaa gac cct gat gta ctt ctc tgc gaa gca aaa tca tcc ctg 4554
His Lys Asp Pro Asp Val Leu Leu Cys Glu Ala Lys Ser Ser Leu
1505 1510 1515
aaa gaa ggg ggc cgt gtt tgc att atc gaa ata ggc gag cct ctc 4599
Lys Glu Gly Gly Arg Val Cys Ile Ile Glu Ile Gly Glu Pro Leu
1520 1525 1530
ttg aat ctt gga ata ggg ttg gcc gct tta cag cac act cat ttc 4644
Leu Asn Leu Gly Ile Gly Leu Ala Ala Leu Gln His Thr His Phe
1535 1540 1545
att att agt agc caa aac aca gac gag tct cac ttg aat cgt gct 4689
Ile Ile Ser Ser Gln Asn Thr Asp Glu Ser His Leu Asn Arg Ala
1550 1555 1560
ggg ttt acg aaa gag ctt ctt ctt gga gat gcc tta cca ccc aag 4734
Gly Phe Thr Lys Glu Leu Leu Leu Gly Asp Ala Leu Pro Pro Lys
1565 1570 1575
aac gag ttc cgg ctc ata gcc gga aat aca tcg aag cga tta gca 4779
Asn Glu Phe Arg Leu Ile Ala Gly Asn Thr Ser Lys Arg Leu Ala
1580 1585 1590
gtt act att caa gga gag ata gtc att gta cag gcg cct gag ccg 4824
Val Thr Ile Gln Gly Glu Ile Val Ile Val Gln Ala Pro Glu Pro
1595 1600 1605
tca aaa tct gct caa aat gtt gct gat gcc ctt act gaa gtg ctt 4869
Ser Lys Ser Ala Gln Asn Val Ala Asp Ala Leu Thr Glu Val Leu
1610 1615 1620
gag aaa caa tgt gtg cgc gcc att cgt gtt gat tgg agc tta ccc 4914
Glu Lys Gln Cys Val Arg Ala Ile Arg Val Asp Trp Ser Leu Pro
1625 1630 1635
gag tat att tcg gtc ata gaa ggc aag gaa tgt atc gtc ttg gct 4959
Glu Tyr Ile Ser Val Ile Glu Gly Lys Glu Cys Ile Val Leu Ala
1640 1645 1650
gat ctg gag aag tca cac cta cta gaa gca tct cag gag gaa ttc 5004
Asp Leu Glu Lys Ser His Leu Leu Glu Ala Ser Gln Glu Glu Phe
1655 1660 1665
cca ata ata caa cag acc atc ctg aag gct gga ggc atc ctt tgg 5049
Pro Ile Ile Gln Gln Thr Ile Leu Lys Ala Gly Gly Ile Leu Trp
1670 1675 1680
gtt agt ggc tct atc gga cca gac gcg gca tta gtc act gga ttg 5094
Val Ser Gly Ser Ile Gly Pro Asp Ala Ala Leu Val Thr Gly Leu
1685 1690 1695
gct cga aca att cgc aac gag ata cca ggc agc aag ctg cga gtt 5139
Ala Arg Thr Ile Arg Asn Glu Ile Pro Gly Ser Lys Leu Arg Val
1700 1705 1710
ctt cag aca aat gag ctc tcg tta gct tca ccg acc acg tgg tca 5184
Leu Gln Thr Asn Glu Leu Ser Leu Ala Ser Pro Thr Thr Trp Ser
1715 1720 1725
aat tat att ttg cga ttg cta caa tca cca acg cta gat agt gag 5229
Asn Tyr Ile Leu Arg Leu Leu Gln Ser Pro Thr Leu Asp Ser Glu
1730 1735 1740
ttc acc atc aaa gat ggt ttt ctc caa atc agt cgc gtc gta gaa 5274
Phe Thr Ile Lys Asp Gly Phe Leu Gln Ile Ser Arg Val Val Glu
1745 1750 1755
tat tac act cga aac gac gct ttg gcg gtt tct ctc ggg cgg cag 5319
Tyr Tyr Thr Arg Asn Asp Ala Leu Ala Val Ser Leu Gly Arg Gln
1760 1765 1770
gag cct aaa acg gtg cat atg cct ctt agt gaa act tca agc cca 5364
Glu Pro Lys Thr Val His Met Pro Leu Ser Glu Thr Ser Ser Pro
1775 1780 1785
gtc aaa ctg tgt atc aag aat cct ggg atg ctt gat tca cta tat 5409
Val Lys Leu Cys Ile Lys Asn Pro Gly Met Leu Asp Ser Leu Tyr
1790 1795 1800
ttt gaa ccg gat gat atc ctt aat agt cct cta gcc tcc ggg caa 5454
Phe Glu Pro Asp Asp Ile Leu Asn Ser Pro Leu Ala Ser Gly Gln
1805 1810 1815
gtc gaa ata gaa gtg aaa gca tcg ggt gtc aat ttc cgc gat gtc 5499
Val Glu Ile Glu Val Lys Ala Ser Gly Val Asn Phe Arg Asp Val
1820 1825 1830
atg gtt tgt atg ggt cag att cca gat agt ttg cta ggc ttc gag 5544
Met Val Cys Met Gly Gln Ile Pro Asp Ser Leu Leu Gly Phe Glu
1835 1840 1845
gca gct gga ata gtt cgt cga gtt ggt gag aat gtt caa aac atc 5589
Ala Ala Gly Ile Val Arg Arg Val Gly Glu Asn Val Gln Asn Ile
1850 1855 1860
aaa gca ggt gat cga gtt tgt ttt atc gca cac ggt tct cat cga 5634
Lys Ala Gly Asp Arg Val Cys Phe Ile Ala His Gly Ser His Arg
1865 1870 1875
act gtc cat cgt gtg aga aat gag tat gtg gta cac atc cca gat 5679
Thr Val His Arg Val Arg Asn Glu Tyr Val Val His Ile Pro Asp
1880 1885 1890
gaa atg tcc ttc gca gag gct tct ggc gtg ctt ctt gtt cat ggc 5724
Glu Met Ser Phe Ala Glu Ala Ser Gly Val Leu Leu Val His Gly
1895 1900 1905
acg gcg tgg tat ggt ctg gtc aag att gcc cag atc aaa gca ggg 5769
Thr Ala Trp Tyr Gly Leu Val Lys Ile Ala Gln Ile Lys Ala Gly
1910 1915 1920
caa acg att ctc atc cat gcc gct gcg ggt ggt gtt gga caa gca 5814
Gln Thr Ile Leu Ile His Ala Ala Ala Gly Gly Val Gly Gln Ala
1925 1930 1935
gca gtg atg ttg gcc cag cat ttt ggt ctc gag ata ttt gca aca 5859
Ala Val Met Leu Ala Gln His Phe Gly Leu Glu Ile Phe Ala Thr
1940 1945 1950
gtt ggc tcc gat gac aaa agg caa ctc atc cag gac ctt tat aag 5904
Val Gly Ser Asp Asp Lys Arg Gln Leu Ile Gln Asp Leu Tyr Lys
1955 1960 1965
atc cca gaa gac cac att ttc aat tct cgt gac ctg agt ttt gcc 5949
Ile Pro Glu Asp His Ile Phe Asn Ser Arg Asp Leu Ser Phe Ala
1970 1975 1980
aag gga gtg ctg cgt atg aca aat ggt cgt ggt gtg gat gtt atc 5994
Lys Gly Val Leu Arg Met Thr Asn Gly Arg Gly Val Asp Val Ile
1985 1990 1995
ctt aat tct cta tct ggg gag act ctt cgc caa aca tgg cac tgc 6039
Leu Asn Ser Leu Ser Gly Glu Thr Leu Arg Gln Thr Trp His Cys
2000 2005 2010
gtc gct cca ttt gga aca ttc atc gaa atc ggt att aaa gat atc 6084
Val Ala Pro Phe Gly Thr Phe Ile Glu Ile Gly Ile Lys Asp Ile
2015 2020 2025
ctc agc aat acc cga cta gac atg cgc cct ttc ctt caa gat gcc 6129
Leu Ser Asn Thr Arg Leu Asp Met Arg Pro Phe Leu Gln Asp Ala
2030 2035 2040
cga ttt gcc ttt ttt aat ttg aac cgt atc gag aac gag cga cca 6174
Arg Phe Ala Phe Phe Asn Leu Asn Arg Ile Glu Asn Glu Arg Pro
2045 2050 2055
gac ttg atg agc gag gca tta aat gaa agt atg gct ttc atc agc 6219
Asp Leu Met Ser Glu Ala Leu Asn Glu Ser Met Ala Phe Ile Ser
2060 2065 2070
tcg ggt gct aca cga cct gtt tca ccc ctg atg aac ttc cct gtc 6264
Ser Gly Ala Thr Arg Pro Val Ser Pro Leu Met Asn Phe Pro Val
2075 2080 2085
tcg cag gta gaa gat gcc ttc cgt ctc atg cag acg ggc aag cac 6309
Ser Gln Val Glu Asp Ala Phe Arg Leu Met Gln Thr Gly Lys His
2090 2095 2100
cgg ggg aaa cta tcc ctg aca tac tca tct tct gac gta gta ccc 6354
Arg Gly Lys Leu Ser Leu Thr Tyr Ser Ser Ser Asp Val Val Pro
2105 2110 2115
att cag agc cga cct act cgc tct att cgt ctg gat gaa act agt 6399
Ile Gln Ser Arg Pro Thr Arg Ser Ile Arg Leu Asp Glu Thr Ser
2120 2125 2130
gcc tat gtt ctc gta ggt ggt ctt ggt ggg ctt ggg cgc agt ctt 6444
Ala Tyr Val Leu Val Gly Gly Leu Gly Gly Leu Gly Arg Ser Leu
2135 2140 2145
gca caa ctt ttt gtc cga ctc ggg tgc aag aaa cta tgc ttt ctt 6489
Ala Gln Leu Phe Val Arg Leu Gly Cys Lys Lys Leu Cys Phe Leu
2150 2155 2160
tct cga tca gga ggg gca agt gaa aag gca cag aag ctc ctc aaa 6534
Ser Arg Ser Gly Gly Ala Ser Glu Lys Ala Gln Lys Leu Leu Lys
2165 2170 2175
gac ctt cag cag caa ggg gtc aaa act ctt gct ctt aga tgc gac 6579
Asp Leu Gln Gln Gln Gly Val Lys Thr Leu Ala Leu Arg Cys Asp
2180 2185 2190
gtt tct gat gca cag tct gtc aaa gcg gct att aat gaa tgc gcg 6624
Val Ser Asp Ala Gln Ser Val Lys Ala Ala Ile Asn Glu Cys Ala
2195 2200 2205
act cgc ttg gga cct gtc ctg ggt gtg gta cag tgt gca atg gtg 6669
Thr Arg Leu Gly Pro Val Leu Gly Val Val Gln Cys Ala Met Val
2210 2215 2220
ctt cga gat ggc cta ttc gag aag atg acc cac caa cag tgg gtt 6714
Leu Arg Asp Gly Leu Phe Glu Lys Met Thr His Gln Gln Trp Val
2225 2230 2235
gag ggt act cgg ccc aag gtc cag ggg tct tgg aac cta cat gtg 6759
Glu Gly Thr Arg Pro Lys Val Gln Gly Ser Trp Asn Leu His Val
2240 2245 2250
aac cta cca aat gtt gat ttc ttt att att ctc agt tcc ttt gct 6804
Asn Leu Pro Asn Val Asp Phe Phe Ile Ile Leu Ser Ser Phe Ala
2255 2260 2265
gga att ttt gga agc cga ggc caa agc aac tat acc gca gcg gga 6849
Gly Ile Phe Gly Ser Arg Gly Gln Ser Asn Tyr Thr Ala Ala Gly
2270 2275 2280
gcg tat gag gat gcg ctt gca aat tat cga cga tcg ctg ggt ctc 6894
Ala Tyr Glu Asp Ala Leu Ala Asn Tyr Arg Arg Ser Leu Gly Leu
2285 2290 2295
aaa gcg gtg aca gtt gac ttg ggt att atg cgc gat gtg ggc gtt 6939
Lys Ala Val Thr Val Asp Leu Gly Ile Met Arg Asp Val Gly Val
2300 2305 2310
ctt gcc gag caa ggt ata aca gat tat ctg cga gag tgg gag gag 6984
Leu Ala Glu Gln Gly Ile Thr Asp Tyr Leu Arg Glu Trp Glu Glu
2315 2320 2325
cca tgc ggt att cga gaa gct gaa ttc cat gcg ctt atg gaa aat 7029
Pro Cys Gly Ile Arg Glu Ala Glu Phe His Ala Leu Met Glu Asn
2330 2335 2340
gtc ttg act agt gaa gtt ctt gga gat cag gag cct cta ccg gca 7074
Val Leu Thr Ser Glu Val Leu Gly Asp Gln Glu Pro Leu Pro Ala
2345 2350 2355
cac att ccg acg ggc ttt gct acc gca aag aca gtt caa caa ttt 7119
His Ile Pro Thr Gly Phe Ala Thr Ala Lys Thr Val Gln Gln Phe
2360 2365 2370
ggt atc acc acg cca ttt tac ttt gat gat cct cgg ttt tca att 7164
Gly Ile Thr Thr Pro Phe Tyr Phe Asp Asp Pro Arg Phe Ser Ile
2375 2380 2385
cta tcc gcc gcc ggc tct agt aag aca gga gct ggt gat agc acg 7209
Leu Ser Ala Ala Gly Ser Ser Lys Thr Gly Ala Gly Asp Ser Thr
2390 2395 2400
gat tct aac aag gcc atc tca gtg caa aat caa att gcg cag tct 7254
Asp Ser Asn Lys Ala Ile Ser Val Gln Asn Gln Ile Ala Gln Ser
2405 2410 2415
ata tct att tca gag gca gca tca gcc gtc acc aat gct ctt gtt 7299
Ile Ser Ile Ser Glu Ala Ala Ser Ala Val Thr Asn Ala Leu Val
2420 2425 2430
gca cgc gtg gcc aaa tcg ctt caa agc gct ttg tcc gac atc gac 7344
Ala Arg Val Ala Lys Ser Leu Gln Ser Ala Leu Ser Asp Ile Asp
2435 2440 2445
cca tcc cgg ccg ctg cat gcc ttc ggt gtg gat tct ctt gtc gcc 7389
Pro Ser Arg Pro Leu His Ala Phe Gly Val Asp Ser Leu Val Ala
2450 2455 2460
gtc gaa gtg gtg aac tgg gtg ttc aaa gaa atc aag gcc aaa gtt 7434
Val Glu Val Val Asn Trp Val Phe Lys Glu Ile Lys Ala Lys Val
2465 2470 2475
acc gta ttt gac gtt ctt tct agt att cct att aca tct ctt gcc 7479
Thr Val Phe Asp Val Leu Ser Ser Ile Pro Ile Thr Ser Leu Ala
2480 2485 2490
gag aag att gcg ctg aag tct agt ctt ttg ccg caa ttg act tga 7524
Glu Lys Ile Ala Leu Lys Ser Ser Leu Leu Pro Gln Leu Thr
2495 2500 2505
<210> 14
<211> 2507
<212> PRT
<213> 岛篮状菌
<400> 14
Met Ala Thr Thr Asn Glu Val Arg Trp Ala Gln Asp Ile Ala Ile Val
1 5 10 15
Gly Met Ser Cys Arg Phe Ala Asp Asp Ala Asp Ser Phe Pro Arg Phe
20 25 30
Trp Asp Phe Ile Cys Asn Gly Arg Tyr Ala Phe His Tyr Pro Gly Lys
35 40 45
Lys Thr Asn Thr Ser Leu Pro Arg Gly Ala His Phe Phe Lys Asp Asp
50 55 60
Ile Ala Glu Phe Asp Ala Asn Phe Phe Asn Ile Ser Lys Val Glu Ala
65 70 75 80
Glu Ser Ile Asp Pro Gln Gln Arg Met Val Met Glu Thr Thr Phe Glu
85 90 95
Ala Leu Glu Asn Ala Gly Ile Thr Ile Asp Lys Val Ala Gly Thr Arg
100 105 110
Ala Gly Val Trp Met Ala Asn Phe Thr Ser Asp Tyr Arg Glu Met Leu
115 120 125
Tyr Arg Asp Ser Glu Thr Ala Pro Met Tyr Thr Leu Ser Gly Ala Ser
130 135 140
Asn Thr Ser Thr Ser Asn Arg Val Ser Trp Phe Phe Asp Leu Lys Gly
145 150 155 160
Pro Ser Phe Thr Leu Asn Thr Ala Cys Ser Ser Ser Met Val Ala Thr
165 170 175
His Leu Ala Cys Gln Ser Leu Ala Leu Gly Glu Ser Ser Ser Ala Ile
180 185 190
Val Gly Gly Thr Ser Leu Leu Leu Asn Pro Asp Leu Phe Leu Phe Leu
195 200 205
Ser Asn Gln His Phe Leu Ala Ala Asp Gly Lys Ser Lys Ala Phe Asp
210 215 220
Ala Ser Gly Asp Gly Tyr Gly Arg Gly Glu Gly Val Ala Val Val Val
225 230 235 240
Leu Lys Arg Val Ala Asp Ala Ile Ala Asp Gly Asp Pro Ile Arg Ala
245 250 255
Val Ile Arg Gly Thr Ala Ile Asn Gln Asp Gly Arg Thr Lys Gly Met
260 265 270
Thr Leu Pro Ser Val Asp Ala Gln Glu Gln Leu Ile Lys Asp Ala Tyr
275 280 285
Arg Asn Ala Gly Leu Ser Met Lys Asp Thr Arg Tyr Val Glu Ala His
290 295 300
Gly Thr Gly Thr Gln Ala Gly Asp Lys Cys Glu Thr Glu Ala Leu Ser
305 310 315 320
Arg Thr Phe Ser Pro Tyr Arg Thr Ala Ser Glu Arg Leu Ile Leu Gly
325 330 335
Ser Val Lys Thr Asn Ile Gly His Leu Glu Ala Cys Ala Gly Leu Ala
340 345 350
Ser Met Ile Lys Cys Val Gly Ile Leu Glu Ala Gly Val Ile Pro Pro
355 360 365
Asn Pro Leu Tyr Lys Lys Gly Asn Pro Gly Ile Lys Phe Asp Asp Trp
370 375 380
Lys Leu His Val Pro Thr Ser Ser Ile Gln Trp Pro Thr Ser Gly Leu
385 390 395 400
Arg Arg Ile Ser Thr Gln Gly Phe Gly Tyr Gly Gly Thr Asn Ala His
405 410 415
Ile Ile Met Asp Asp Ala His Asn Tyr Leu Val Ser Arg Asp Ile Thr
420 425 430
Ala Ile His Asn Thr Cys Leu Leu Asn Leu Thr Asn Gly Thr Thr Tyr
435 440 445
Ile Glu His Lys Glu Ala Pro Arg Pro Arg Ile Phe His Phe Ser Ala
450 455 460
Gln Asp Lys Asp Gly Leu Gly Arg Val Arg Asp Ala Thr Cys Gln Tyr
465 470 475 480
Leu Lys Ser Gly Ala Leu Glu Ala Gly Lys Met Arg Gln Asn Glu Asp
485 490 495
Lys Tyr Leu Arg Asp Leu Ala Tyr Thr Leu Ser Glu Arg Arg Ser Arg
500 505 510
Leu Gln Trp Gln Thr Phe Ala Val Ala Ser Ser Val Glu Gly Leu Ile
515 520 525
Glu Thr Leu Gln Thr Lys Pro Trp Ala Ser Pro Glu Thr Arg Ser Ala
530 535 540
Ser Lys Val Pro Arg Ile Gly Phe Ile Phe Thr Gly Gln Gly Ala Gln
545 550 555 560
Trp Pro Arg Met Gly Ile Glu Leu Met Glu Tyr Asp Ile Phe Arg Lys
565 570 575
Ser Val Glu Arg Ser Asp Val Tyr Leu Arg Glu Gly Leu Asp Cys Ser
580 585 590
Trp Ser Ala Ile Glu Glu Leu Ala Lys Pro Asp Ser Ser Ser Asn Leu
595 600 605
Gly Ala Ala Glu Tyr Ser Gln Ala Leu Cys Ser Val Leu Gln Ile Ala
610 615 620
Leu Ile Asp Leu Leu Asp Ser Trp Asn Ile Arg Pro Ser Ala Val Ala
625 630 635 640
Gly His Ser Ser Gly Glu Ile Ala Ala Ala Tyr Cys Leu Gly Val Leu
645 650 655
Ser Trp Glu Asp Ala Leu Lys Val Ala Tyr Phe Arg Gly Ser Leu Ser
660 665 670
Ala Glu Met Lys Gly Asn Asp Ser Ser Leu Asn Gly Ala Met Met Ala
675 680 685
Val Gly Ser Ser Pro Ala Asp Ile Glu Lys Trp Leu Asp Lys Val Thr
690 695 700
Ala Gly Glu Val Val Val Ala Cys Val Asn Ser Pro Ala Ser Ile Thr
705 710 715 720
Leu Ser Gly Asp Ala Ala Gly Ile Asn Glu Leu Glu Ser Met Leu Lys
725 730 735
Glu Ala Gly Ile Phe Ala Arg Lys Leu Lys Val Asp Thr Ala Tyr His
740 745 750
Ser Pro His Met Gln Thr Ile Ala Gly Gln Tyr Phe Glu Ala Ile Ala
755 760 765
Asp Ile Ser Ile Leu Pro Val Arg Asn Gly Cys Gln Met His Ser Ser
770 775 780
Val Arg Gly Gly Tyr Ile Asp Pro Asn Glu Leu Gly Ala Ala Asn Trp
785 790 795 800
Val Arg Asn Leu Val Ser Thr Val Gln Phe Ala Asp Ala Val His Asp
805 810 815
Leu Leu Arg Pro Leu Val Tyr Gly Glu Arg Ala Ala His Asn Ala Val
820 825 830
Asp Ile Leu Val Glu Val Gly Pro His Ser Ala Leu Gln Gly Pro Val
835 840 845
Asn Gln Thr Met Lys Ala His Gly Ile Asn Ser Ile Asn Tyr Cys Thr
850 855 860
Met Leu Ser Arg Gly Lys Asn Ala Ile Asn Thr Ala Leu Ser Cys Ala
865 870 875 880
Ala Thr Leu Tyr Val Glu Gly Leu Ala Val Asp Leu Arg Arg Ala Asn
885 890 895
Gln Asp Glu Ser Phe Ala Val Glu Pro Ile Phe Asp Met Pro Ser Tyr
900 905 910
Pro Trp Asn His Ser Ile Arg Tyr Trp Ala Glu Ser Arg Val Glu Lys
915 920 925
Glu Tyr Arg Gln Arg Lys Tyr Pro Arg Thr Pro Leu Leu Gly Ala Pro
930 935 940
Cys Pro Ser Met Asn Ala Gly Glu Lys Val Trp Arg Gly Phe Ile Arg
945 950 955 960
Pro Ser Glu Glu Pro Trp Val Arg Asp His Val Ile Gln Gly Ser Ile
965 970 975
Leu Tyr Pro Ala Ala Gly Phe Leu Ala Met Ala Ile Glu Ala Ala Arg
980 985 990
Gln Gly Thr Glu Thr Gly Arg Ser Ile Asp Gly Phe Arg Leu Arg Asp
995 1000 1005
Val Gln Ile Asn Ala Ala Leu Val Ile Glu Glu Asn Val Glu Pro
1010 1015 1020
Glu Val Ile Leu Arg Leu Gln Pro His Arg Met Gly Thr Leu Asp
1025 1030 1035
Ala Gly Ser Val Ser Trp Gln Glu Phe Thr Val Ser Ser Ser Thr
1040 1045 1050
Asp Gly Thr Asp Leu Arg Gln Asn Cys Ser Gly Leu Leu Ala Ile
1055 1060 1065
Asp Tyr Glu Pro Ala Glu Gly Ser Ser Met His Ile Glu Lys Ile
1070 1075 1080
Lys Glu Val Glu Thr Ile Lys Gly Lys Leu Val Lys Ala Lys Glu
1085 1090 1095
Gln Cys Arg Ala Ala Ile Asn Val Asp Glu Phe Tyr Ala His Leu
1100 1105 1110
Asp Thr Val Gly Leu Thr Tyr Gly Glu Thr Phe Ala Asn Leu Thr
1115 1120 1125
Glu Val His Thr Asn Ala Ala Thr Gly Glu Cys Thr Gly Arg Leu
1130 1135 1140
Leu Val Pro Asp Val Glu Ser Ala Ile Pro Pro His Met Arg Glu
1145 1150 1155
Arg Pro His Ile Ile His Pro Thr Thr Leu Asp Ala Ile Phe His
1160 1165 1170
Leu Ala Phe Ala Ala Ile Ser Glu His Pro Phe Ser Leu Lys Ser
1175 1180 1185
Ala Met Val Pro Ile Ser Ile Thr Glu Val Val Ile Ser Asn Glu
1190 1195 1200
Val Pro His Arg Lys Gly Ser Gln Leu Glu Gly Phe Ala Gln Ser
1205 1210 1215
Ser Arg Phe Gly Phe Arg Glu Leu Val Thr Asn Ile Asn Ile Phe
1220 1225 1230
Asp Glu Gln Leu Thr Asp Ala Val Val Lys Ile Ser Gly Phe Arg
1235 1240 1245
Cys Ala Asp Val Ser Gly Ser Ser Gln Ser Thr Ser Ser Gly Glu
1250 1255 1260
Ala Ala Lys Pro Ile Thr Phe Lys Glu Ile His Arg Pro Ala Leu
1265 1270 1275
Glu Leu Leu Asp Tyr Glu Asp Leu Gln Arg Ala Val Asn Ala Asn
1280 1285 1290
Ala Asp Glu Ile Ala Ser Gly Ile Phe Glu Gln Asp Thr Ser Leu
1295 1300 1305
Asp Lys Ser Ala Leu Ala Ile Val Lys Arg Thr Leu Ser Asn Val
1310 1315 1320
Pro Arg Ser Ser Val His Lys Asp Leu Leu Gly Phe Tyr Asp Trp
1325 1330 1335
Met Gln Arg Gln Val Ser Ser Ala Asp Lys Ala Ser Gly Ala Gly
1340 1345 1350
Gln Arg Asp Ser Thr Gly Tyr Thr Asn Ile Ser Val Lys Asp Leu
1355 1360 1365
Glu Gly Ile Leu Ser Gly Glu Lys Ile Ala Ala Gln Ala Met Asp
1370 1375 1380
Glu Asn Val Ile Leu Met Pro Ala Leu Thr Ser Ser Ala Asn Phe
1385 1390 1395
Gln Gln Ile Met Lys Lys Leu Ser Gln Tyr Leu Leu Ile Leu Gln
1400 1405 1410
His Thr Tyr Pro Glu Leu Ser Val Leu Glu Ile Ile His Ser Ala
1415 1420 1425
Glu Asn Ser Thr Thr Gly Ser Ile Leu Pro Gln Leu Gln Ser Ala
1430 1435 1440
Glu Val Ile Leu Asp Thr Ser Lys Tyr Thr Val Leu Val Gln Asn
1445 1450 1455
Glu Lys Ala Ala Lys Thr Val Glu Ser Gln Leu Gly Thr Leu Thr
1460 1465 1470
Asp Leu Ile Ser Leu Glu Val Ser Ala Thr Asp Asn Ser Val Gln
1475 1480 1485
Asp His Gly Arg Gln Tyr Asp Leu Ala Leu Val Val Asn Ile Ala
1490 1495 1500
His Lys Asp Pro Asp Val Leu Leu Cys Glu Ala Lys Ser Ser Leu
1505 1510 1515
Lys Glu Gly Gly Arg Val Cys Ile Ile Glu Ile Gly Glu Pro Leu
1520 1525 1530
Leu Asn Leu Gly Ile Gly Leu Ala Ala Leu Gln His Thr His Phe
1535 1540 1545
Ile Ile Ser Ser Gln Asn Thr Asp Glu Ser His Leu Asn Arg Ala
1550 1555 1560
Gly Phe Thr Lys Glu Leu Leu Leu Gly Asp Ala Leu Pro Pro Lys
1565 1570 1575
Asn Glu Phe Arg Leu Ile Ala Gly Asn Thr Ser Lys Arg Leu Ala
1580 1585 1590
Val Thr Ile Gln Gly Glu Ile Val Ile Val Gln Ala Pro Glu Pro
1595 1600 1605
Ser Lys Ser Ala Gln Asn Val Ala Asp Ala Leu Thr Glu Val Leu
1610 1615 1620
Glu Lys Gln Cys Val Arg Ala Ile Arg Val Asp Trp Ser Leu Pro
1625 1630 1635
Glu Tyr Ile Ser Val Ile Glu Gly Lys Glu Cys Ile Val Leu Ala
1640 1645 1650
Asp Leu Glu Lys Ser His Leu Leu Glu Ala Ser Gln Glu Glu Phe
1655 1660 1665
Pro Ile Ile Gln Gln Thr Ile Leu Lys Ala Gly Gly Ile Leu Trp
1670 1675 1680
Val Ser Gly Ser Ile Gly Pro Asp Ala Ala Leu Val Thr Gly Leu
1685 1690 1695
Ala Arg Thr Ile Arg Asn Glu Ile Pro Gly Ser Lys Leu Arg Val
1700 1705 1710
Leu Gln Thr Asn Glu Leu Ser Leu Ala Ser Pro Thr Thr Trp Ser
1715 1720 1725
Asn Tyr Ile Leu Arg Leu Leu Gln Ser Pro Thr Leu Asp Ser Glu
1730 1735 1740
Phe Thr Ile Lys Asp Gly Phe Leu Gln Ile Ser Arg Val Val Glu
1745 1750 1755
Tyr Tyr Thr Arg Asn Asp Ala Leu Ala Val Ser Leu Gly Arg Gln
1760 1765 1770
Glu Pro Lys Thr Val His Met Pro Leu Ser Glu Thr Ser Ser Pro
1775 1780 1785
Val Lys Leu Cys Ile Lys Asn Pro Gly Met Leu Asp Ser Leu Tyr
1790 1795 1800
Phe Glu Pro Asp Asp Ile Leu Asn Ser Pro Leu Ala Ser Gly Gln
1805 1810 1815
Val Glu Ile Glu Val Lys Ala Ser Gly Val Asn Phe Arg Asp Val
1820 1825 1830
Met Val Cys Met Gly Gln Ile Pro Asp Ser Leu Leu Gly Phe Glu
1835 1840 1845
Ala Ala Gly Ile Val Arg Arg Val Gly Glu Asn Val Gln Asn Ile
1850 1855 1860
Lys Ala Gly Asp Arg Val Cys Phe Ile Ala His Gly Ser His Arg
1865 1870 1875
Thr Val His Arg Val Arg Asn Glu Tyr Val Val His Ile Pro Asp
1880 1885 1890
Glu Met Ser Phe Ala Glu Ala Ser Gly Val Leu Leu Val His Gly
1895 1900 1905
Thr Ala Trp Tyr Gly Leu Val Lys Ile Ala Gln Ile Lys Ala Gly
1910 1915 1920
Gln Thr Ile Leu Ile His Ala Ala Ala Gly Gly Val Gly Gln Ala
1925 1930 1935
Ala Val Met Leu Ala Gln His Phe Gly Leu Glu Ile Phe Ala Thr
1940 1945 1950
Val Gly Ser Asp Asp Lys Arg Gln Leu Ile Gln Asp Leu Tyr Lys
1955 1960 1965
Ile Pro Glu Asp His Ile Phe Asn Ser Arg Asp Leu Ser Phe Ala
1970 1975 1980
Lys Gly Val Leu Arg Met Thr Asn Gly Arg Gly Val Asp Val Ile
1985 1990 1995
Leu Asn Ser Leu Ser Gly Glu Thr Leu Arg Gln Thr Trp His Cys
2000 2005 2010
Val Ala Pro Phe Gly Thr Phe Ile Glu Ile Gly Ile Lys Asp Ile
2015 2020 2025
Leu Ser Asn Thr Arg Leu Asp Met Arg Pro Phe Leu Gln Asp Ala
2030 2035 2040
Arg Phe Ala Phe Phe Asn Leu Asn Arg Ile Glu Asn Glu Arg Pro
2045 2050 2055
Asp Leu Met Ser Glu Ala Leu Asn Glu Ser Met Ala Phe Ile Ser
2060 2065 2070
Ser Gly Ala Thr Arg Pro Val Ser Pro Leu Met Asn Phe Pro Val
2075 2080 2085
Ser Gln Val Glu Asp Ala Phe Arg Leu Met Gln Thr Gly Lys His
2090 2095 2100
Arg Gly Lys Leu Ser Leu Thr Tyr Ser Ser Ser Asp Val Val Pro
2105 2110 2115
Ile Gln Ser Arg Pro Thr Arg Ser Ile Arg Leu Asp Glu Thr Ser
2120 2125 2130
Ala Tyr Val Leu Val Gly Gly Leu Gly Gly Leu Gly Arg Ser Leu
2135 2140 2145
Ala Gln Leu Phe Val Arg Leu Gly Cys Lys Lys Leu Cys Phe Leu
2150 2155 2160
Ser Arg Ser Gly Gly Ala Ser Glu Lys Ala Gln Lys Leu Leu Lys
2165 2170 2175
Asp Leu Gln Gln Gln Gly Val Lys Thr Leu Ala Leu Arg Cys Asp
2180 2185 2190
Val Ser Asp Ala Gln Ser Val Lys Ala Ala Ile Asn Glu Cys Ala
2195 2200 2205
Thr Arg Leu Gly Pro Val Leu Gly Val Val Gln Cys Ala Met Val
2210 2215 2220
Leu Arg Asp Gly Leu Phe Glu Lys Met Thr His Gln Gln Trp Val
2225 2230 2235
Glu Gly Thr Arg Pro Lys Val Gln Gly Ser Trp Asn Leu His Val
2240 2245 2250
Asn Leu Pro Asn Val Asp Phe Phe Ile Ile Leu Ser Ser Phe Ala
2255 2260 2265
Gly Ile Phe Gly Ser Arg Gly Gln Ser Asn Tyr Thr Ala Ala Gly
2270 2275 2280
Ala Tyr Glu Asp Ala Leu Ala Asn Tyr Arg Arg Ser Leu Gly Leu
2285 2290 2295
Lys Ala Val Thr Val Asp Leu Gly Ile Met Arg Asp Val Gly Val
2300 2305 2310
Leu Ala Glu Gln Gly Ile Thr Asp Tyr Leu Arg Glu Trp Glu Glu
2315 2320 2325
Pro Cys Gly Ile Arg Glu Ala Glu Phe His Ala Leu Met Glu Asn
2330 2335 2340
Val Leu Thr Ser Glu Val Leu Gly Asp Gln Glu Pro Leu Pro Ala
2345 2350 2355
His Ile Pro Thr Gly Phe Ala Thr Ala Lys Thr Val Gln Gln Phe
2360 2365 2370
Gly Ile Thr Thr Pro Phe Tyr Phe Asp Asp Pro Arg Phe Ser Ile
2375 2380 2385
Leu Ser Ala Ala Gly Ser Ser Lys Thr Gly Ala Gly Asp Ser Thr
2390 2395 2400
Asp Ser Asn Lys Ala Ile Ser Val Gln Asn Gln Ile Ala Gln Ser
2405 2410 2415
Ile Ser Ile Ser Glu Ala Ala Ser Ala Val Thr Asn Ala Leu Val
2420 2425 2430
Ala Arg Val Ala Lys Ser Leu Gln Ser Ala Leu Ser Asp Ile Asp
2435 2440 2445
Pro Ser Arg Pro Leu His Ala Phe Gly Val Asp Ser Leu Val Ala
2450 2455 2460
Val Glu Val Val Asn Trp Val Phe Lys Glu Ile Lys Ala Lys Val
2465 2470 2475
Thr Val Phe Asp Val Leu Ser Ser Ile Pro Ile Thr Ser Leu Ala
2480 2485 2490
Glu Lys Ile Ala Leu Lys Ser Ser Leu Leu Pro Gln Leu Thr
2495 2500 2505
<210> 15
<211> 7809
<212> DNA
<213> 岛篮状菌
<220>
<221> CDS
<222> (1)..(7809)
<400> 15
atg gct ttg gat ttc gac tac atc att gtc ggc ggg ggc act gca gga 48
Met Ala Leu Asp Phe Asp Tyr Ile Ile Val Gly Gly Gly Thr Ala Gly
1 5 10 15
tgt gtt ctc gca agc cgc ctt tct gaa tac cta ccg gac gcc tct att 96
Cys Val Leu Ala Ser Arg Leu Ser Glu Tyr Leu Pro Asp Ala Ser Ile
20 25 30
cta ttg atc gaa gcc ggt atc gag cat gac cct cgc gtg aaa cca acc 144
Leu Leu Ile Glu Ala Gly Ile Glu His Asp Pro Arg Val Lys Pro Thr
35 40 45
ctt ggg ttg act ggc caa gca gcg aac gaa att aaa tgg aac ata cag 192
Leu Gly Leu Thr Gly Gln Ala Ala Asn Glu Ile Lys Trp Asn Ile Gln
50 55 60
agt gct cct caa tct gct gtt ggc aac aag act atc gat cta gtg cag 240
Ser Ala Pro Gln Ser Ala Val Gly Asn Lys Thr Ile Asp Leu Val Gln
65 70 75 80
ggt aaa gtg ctc ggg ggc acc tcc ggt att aac cac caa gta tgg tcc 288
Gly Lys Val Leu Gly Gly Thr Ser Gly Ile Asn His Gln Val Trp Ser
85 90 95
cgc ggt gca gct gga gac ttc aat cgc tgg gca gca gaa gtt ggc gac 336
Arg Gly Ala Ala Gly Asp Phe Asn Arg Trp Ala Ala Glu Val Gly Asp
100 105 110
ccg cga tgg tca tgg aat gga cag ctc ccc ttc ttc aag aac acc gag 384
Pro Arg Trp Ser Trp Asn Gly Gln Leu Pro Phe Phe Lys Asn Thr Glu
115 120 125
aca ttc cat cca ggg gct gac cta cag ggc aaa gat tta agc gcc ctt 432
Thr Phe His Pro Gly Ala Asp Leu Gln Gly Lys Asp Leu Ser Ala Leu
130 135 140
cat ggc ttc gat ggt cct atc aag gtg tct caa act tca tcc tgt gga 480
His Gly Phe Asp Gly Pro Ile Lys Val Ser Gln Thr Ser Ser Cys Gly
145 150 155 160
cgc ccg cgc aac tac cca ctg aaa gga gcc att gct tcc atg tac aag 528
Arg Pro Arg Asn Tyr Pro Leu Lys Gly Ala Ile Ala Ser Met Tyr Lys
165 170 175
agt gcc ggc gta tcc caa ggt gaa gat ttg aat tct gga aat att ctt 576
Ser Ala Gly Val Ser Gln Gly Glu Asp Leu Asn Ser Gly Asn Ile Leu
180 185 190
ggc ttc agt gaa gca acg gcc ggg tcc tac gac ggt atc cgg caa tgg 624
Gly Phe Ser Glu Ala Thr Ala Gly Ser Tyr Asp Gly Ile Arg Gln Trp
195 200 205
gcg gga gga aac tac aaa ttt ggt ccc aac gtg act ttg tgg acg gaa 672
Ala Gly Gly Asn Tyr Lys Phe Gly Pro Asn Val Thr Leu Trp Thr Glu
210 215 220
acc cat gta tca aaa atc atc tca cag ggt tct cga gcc acg gga gtc 720
Thr His Val Ser Lys Ile Ile Ser Gln Gly Ser Arg Ala Thr Gly Val
225 230 235 240
gag tac ttg cgg cct gac aga agc act agt tcc tca gta tca gct aaa 768
Glu Tyr Leu Arg Pro Asp Arg Ser Thr Ser Ser Ser Val Ser Ala Lys
245 250 255
aaa gaa gtc atc gtc tca agc ggt gct cag ggc tca ccc aag cta cta 816
Lys Glu Val Ile Val Ser Ser Gly Ala Gln Gly Ser Pro Lys Leu Leu
260 265 270
ctg tta agt gga att gga ccc tcg gca gag cta caa aag cat agc att 864
Leu Leu Ser Gly Ile Gly Pro Ser Ala Glu Leu Gln Lys His Ser Ile
275 280 285
cag caa gta gtc gaa ctc cct gtg ggg gaa aac tac agc gac cac ccc 912
Gln Gln Val Val Glu Leu Pro Val Gly Glu Asn Tyr Ser Asp His Pro
290 295 300
atg atg gca aca tac tgg aac cta gaa aag cgc ggt ctg gct ctt ggt 960
Met Met Ala Thr Tyr Trp Asn Leu Glu Lys Arg Gly Leu Ala Leu Gly
305 310 315 320
gat gtt gaa atg cgt tca gct gaa tgc gat tgg act tct ggg ttg ccg 1008
Asp Val Glu Met Arg Ser Ala Glu Cys Asp Trp Thr Ser Gly Leu Pro
325 330 335
gtt gac tgg ttg gca ttc cac cgt cac gat caa gac cca acc att gct 1056
Val Asp Trp Leu Ala Phe His Arg His Asp Gln Asp Pro Thr Ile Ala
340 345 350
gca ttg gct gag agc cag tta agc tca aat gaa ttg gaa cgc ttt cag 1104
Ala Leu Ala Glu Ser Gln Leu Ser Ser Asn Glu Leu Glu Arg Phe Gln
355 360 365
gag caa aat agg gct cac act gag tct gtg gtc tta tac ggt cat att 1152
Glu Gln Asn Arg Ala His Thr Glu Ser Val Val Leu Tyr Gly His Ile
370 375 380
gac ttc tcg ggc aag gcg ggc cct cca cct cca gga tct aac gtg tgt 1200
Asp Phe Ser Gly Lys Ala Gly Pro Pro Pro Pro Gly Ser Asn Val Cys
385 390 395 400
gta atg aac att cta gtc act cca tcg tct cgg gga aca gtg acg ctc 1248
Val Met Asn Ile Leu Val Thr Pro Ser Ser Arg Gly Thr Val Thr Leu
405 410 415
aaa tcc acc aat cca ttc gat gca cct gtg tgt gac ccg aac atg tta 1296
Lys Ser Thr Asn Pro Phe Asp Ala Pro Val Cys Asp Pro Asn Met Leu
420 425 430
tcc aac gaa ctc gat aag caa ctt ctt tgg tct gtg acc cgt ttg aca 1344
Ser Asn Glu Leu Asp Lys Gln Leu Leu Trp Ser Val Thr Arg Leu Thr
435 440 445
agc caa ggt ctt gag cga act att tct cca gag tac gga ctt tct gag 1392
Ser Gln Gly Leu Glu Arg Thr Ile Ser Pro Glu Tyr Gly Leu Ser Glu
450 455 460
tat gcc att gat gac gat tta cgc ggt gac tac ggc gat gag gcc atg 1440
Tyr Ala Ile Asp Asp Asp Leu Arg Gly Asp Tyr Gly Asp Glu Ala Met
465 470 475 480
atg cga cgt gct gtc cga att gtt cgc acc gtg aat cat gga agt ggt 1488
Met Arg Arg Ala Val Arg Ile Val Arg Thr Val Asn His Gly Ser Gly
485 490 495
aca tgc tca atg ggc act gtc gtt gac aca gag tgt cga gta aag ggc 1536
Thr Cys Ser Met Gly Thr Val Val Asp Thr Glu Cys Arg Val Lys Gly
500 505 510
gtt gag ggc ttg cga gta gtc gac tcc agc gtc att cct ctt cca ctc 1584
Val Glu Gly Leu Arg Val Val Asp Ser Ser Val Ile Pro Leu Pro Leu
515 520 525
tgc gcg cac tac cag gcg tct gtg tac gca ttg gcg gaa cag gat cag 1632
Cys Ala His Tyr Gln Ala Ser Val Tyr Ala Leu Ala Glu Gln Asp Gln
530 535 540
aca gag cag ttc ctc ctc cta tat gga gat cag acg gta gag aag ctg 1680
Thr Glu Gln Phe Leu Leu Leu Tyr Gly Asp Gln Thr Val Glu Lys Leu
545 550 555 560
cct gct gtt cgg gct ctt gta gaa cat gcc cag agg tcg ccg gct ggt 1728
Pro Ala Val Arg Ala Leu Val Glu His Ala Gln Arg Ser Pro Ala Gly
565 570 575
cgt cgt ttt ctc cgc gat gcg tgc gat atc att caa ata gaa ata ttc 1776
Arg Arg Phe Leu Arg Asp Ala Cys Asp Ile Ile Gln Ile Glu Ile Phe
580 585 590
agc ctt gat aca gat gag aga gct cac gtc ggg cat ttt gat act ctg 1824
Ser Leu Asp Thr Asp Glu Arg Ala His Val Gly His Phe Asp Thr Leu
595 600 605
ttg cag ctg gca gaa agt aat gcc cag gct gac cag ccc agt gag atc 1872
Leu Gln Leu Ala Glu Ser Asn Ala Gln Ala Asp Gln Pro Ser Glu Ile
610 615 620
gtg gct aca att ctc atg aac gtg acc cga ctg gga gag ttt att ctt 1920
Val Ala Thr Ile Leu Met Asn Val Thr Arg Leu Gly Glu Phe Ile Leu
625 630 635 640
tac gca gaa gaa cac cca aac gtc tta ggc tct ata gag caa ccg aca 1968
Tyr Ala Glu Glu His Pro Asn Val Leu Gly Ser Ile Glu Gln Pro Thr
645 650 655
cat att gtt gct ttt tgc aca gga gaa att ccg gca gct gtg gca gca 2016
His Ile Val Ala Phe Cys Thr Gly Glu Ile Pro Ala Ala Val Ala Ala
660 665 670
gcc gcg cgc gat agc atc gaa tta tat aat ttg tca atc gag aca gtc 2064
Ala Ala Arg Asp Ser Ile Glu Leu Tyr Asn Leu Ser Ile Glu Thr Val
675 680 685
cgc ata ata tgt cgt ttt gcg cgg aac atc atc cgt cgg tcg gtc cta 2112
Arg Ile Ile Cys Arg Phe Ala Arg Asn Ile Ile Arg Arg Ser Val Leu
690 695 700
gta gac agg act aat ggt agc tgg gcg acc acc atc gtt ggt gtt tcg 2160
Val Asp Arg Thr Asn Gly Ser Trp Ala Thr Thr Ile Val Gly Val Ser
705 710 715 720
ccc gga cga gtt cag acc ata ctt gac act ttt cac cag agt cag aat 2208
Pro Gly Arg Val Gln Thr Ile Leu Asp Thr Phe His Gln Ser Gln Asn
725 730 735
att gct cct aca aga caa atc aac atc ggc atc atg gca gca ggt tgg 2256
Ile Ala Pro Thr Arg Gln Ile Asn Ile Gly Ile Met Ala Ala Gly Trp
740 745 750
cta aca ctt ttc ggg cct cct atc act aca gaa caa ctt ttt aac tgg 2304
Leu Thr Leu Phe Gly Pro Pro Ile Thr Thr Glu Gln Leu Phe Asn Trp
755 760 765
tct aag gag ctt gat ggg gca tct cgc atc aag aca gat gct ggc ggt 2352
Ser Lys Glu Leu Asp Gly Ala Ser Arg Ile Lys Thr Asp Ala Gly Gly
770 775 780
ggt gtt cat ctt ccc aat ctc ccc gag cta gac ttg gat gag gtc gta 2400
Gly Val His Leu Pro Asn Leu Pro Glu Leu Asp Leu Asp Glu Val Val
785 790 795 800
gga tta tca ccg ctt tta gat gtc cct att acc ccc aag gcg agg ctt 2448
Gly Leu Ser Pro Leu Leu Asp Val Pro Ile Thr Pro Lys Ala Arg Leu
805 810 815
tgg tca ccg tac agt tgc gag att cgc aat gca gca aca ctc aga gat 2496
Trp Ser Pro Tyr Ser Cys Glu Ile Arg Asn Ala Ala Thr Leu Arg Asp
820 825 830
tta atc cgt cag gtc atc cca gat atc acc caa tac tca cta cga tta 2544
Leu Ile Arg Gln Val Ile Pro Asp Ile Thr Gln Tyr Ser Leu Arg Leu
835 840 845
agt gat acc ata gag acg gcc gtt aaa ggg cta agc aat gga tca gtc 2592
Ser Asp Thr Ile Glu Thr Ala Val Lys Gly Leu Ser Asn Gly Ser Val
850 855 860
aag gtt gtc tgc gtt ggt tat act gct cac ctg ata tct ctg cag aaa 2640
Lys Val Val Cys Val Gly Tyr Thr Ala His Leu Ile Ser Leu Gln Lys
865 870 875 880
tca ctg cag cgc gaa aga cgc gag gcc act gtc tta caa cat tcc agc 2688
Ser Leu Gln Arg Glu Arg Arg Glu Ala Thr Val Leu Gln His Ser Ser
885 890 895
gca ggt tca aca ttt ttc aca tcg ccg cgc gga ggc tcg gag tcc att 2736
Ala Gly Ser Thr Phe Phe Thr Ser Pro Arg Gly Gly Ser Glu Ser Ile
900 905 910
gct att gta gga atg tct gga aga ttt cct ggt agc gac aat ata caa 2784
Ala Ile Val Gly Met Ser Gly Arg Phe Pro Gly Ser Asp Asn Ile Gln
915 920 925
gag tat tgg caa tcc ctg ttg gat gga gaa agg cat att aaa gag atc 2832
Glu Tyr Trp Gln Ser Leu Leu Asp Gly Glu Arg His Ile Lys Glu Ile
930 935 940
cct aaa aac cgg ttc gac ttg agc aaa tgg tac gat gag acc gga aaa 2880
Pro Lys Asn Arg Phe Asp Leu Ser Lys Trp Tyr Asp Glu Thr Gly Lys
945 950 955 960
cag aaa aac gcc acg atg aat cgc tcg ggc gcg ttt tta gat cga ccc 2928
Gln Lys Asn Ala Thr Met Asn Arg Ser Gly Ala Phe Leu Asp Arg Pro
965 970 975
ggt tac ttt gac aac cgg ttg ttc aat atg tca ccc cgg gaa gcc ctt 2976
Gly Tyr Phe Asp Asn Arg Leu Phe Asn Met Ser Pro Arg Glu Ala Leu
980 985 990
cag acc gat cct ctt cat cgt atg ttc ctc acc gtg agc tat gag gct 3024
Gln Thr Asp Pro Leu His Arg Met Phe Leu Thr Val Ser Tyr Glu Ala
995 1000 1005
ctc gag atg gca ggc tat tct cca gag gca aca ttg gca aca aac 3069
Leu Glu Met Ala Gly Tyr Ser Pro Glu Ala Thr Leu Ala Thr Asn
1010 1015 1020
agt aac cgc atc gca acc tat ttt ggt caa aca tca gat gac tgg 3114
Ser Asn Arg Ile Ala Thr Tyr Phe Gly Gln Thr Ser Asp Asp Trp
1025 1030 1035
aga gac att gtg ctt acc cag ggc gtg gat ata tac tac gct ccg 3159
Arg Asp Ile Val Leu Thr Gln Gly Val Asp Ile Tyr Tyr Ala Pro
1040 1045 1050
ggt att tgc cgt gcc ttt gca cct ggt cgt ctc aac tat cac ttt 3204
Gly Ile Cys Arg Ala Phe Ala Pro Gly Arg Leu Asn Tyr His Phe
1055 1060 1065
aag tgg gga ggg cca tcg tat agt gtt gat gca gct tgc gca tcg 3249
Lys Trp Gly Gly Pro Ser Tyr Ser Val Asp Ala Ala Cys Ala Ser
1070 1075 1080
agc atc gcc aca att tcc ctg gct tgt tct gcc ttg ctg gct cgc 3294
Ser Ile Ala Thr Ile Ser Leu Ala Cys Ser Ala Leu Leu Ala Arg
1085 1090 1095
gaa tgc gac acc gct ctc gca ggt gga ggc tcc att ctt gac tct 3339
Glu Cys Asp Thr Ala Leu Ala Gly Gly Gly Ser Ile Leu Asp Ser
1100 1105 1110
cct gca cca ttt gct ggt tta agc cga ggt ggc ttt ctc tcc ccg 3384
Pro Ala Pro Phe Ala Gly Leu Ser Arg Gly Gly Phe Leu Ser Pro
1115 1120 1125
gag aaa ggt tgt gaa act ttc cat gac gat gct gat ggt tac gtg 3429
Glu Lys Gly Cys Glu Thr Phe His Asp Asp Ala Asp Gly Tyr Val
1130 1135 1140
cgt ggc gaa ggc gtg ggt gtc gtt gtt ctt aag cgg ctc gaa gat 3474
Arg Gly Glu Gly Val Gly Val Val Val Leu Lys Arg Leu Glu Asp
1145 1150 1155
gct gtt gcg gat aac gac aac atc cta ggt gtc atc cgc gga tca 3519
Ala Val Ala Asp Asn Asp Asn Ile Leu Gly Val Ile Arg Gly Ser
1160 1165 1170
gcg aga aac tat agc aag gga gct tct tct att aca cat cca tct 3564
Ala Arg Asn Tyr Ser Lys Gly Ala Ser Ser Ile Thr His Pro Ser
1175 1180 1185
tcg gaa gcg cag cag cgt ctc tat cgg cag gtc ttg aat cag aat 3609
Ser Glu Ala Gln Gln Arg Leu Tyr Arg Gln Val Leu Asn Gln Asn
1190 1195 1200
gcc ata gac gcg gcg agt gtt tcc tat gtg gaa atg cac ggc acc 3654
Ala Ile Asp Ala Ala Ser Val Ser Tyr Val Glu Met His Gly Thr
1205 1210 1215
gga aca caa gcc ggt gac tct aca gag atg tcc tca gta ttg tct 3699
Gly Thr Gln Ala Gly Asp Ser Thr Glu Met Ser Ser Val Leu Ser
1220 1225 1230
aca ttt ggt caa tct cgt tcc aaa gat aac cca ctg gtt gtc ggc 3744
Thr Phe Gly Gln Ser Arg Ser Lys Asp Asn Pro Leu Val Val Gly
1235 1240 1245
gct gtt aag gca aat att ggt cat gga gaa gcc gct gct ggt gtt 3789
Ala Val Lys Ala Asn Ile Gly His Gly Glu Ala Ala Ala Gly Val
1250 1255 1260
tgt gcc ctc atc aag acc ttg atg atg ttt cag aag cat acc atc 3834
Cys Ala Leu Ile Lys Thr Leu Met Met Phe Gln Lys His Thr Ile
1265 1270 1275
cca ccg caa cct gga atg cct ttt aaa ctt aat cat cat ttc ccc 3879
Pro Pro Gln Pro Gly Met Pro Phe Lys Leu Asn His His Phe Pro
1280 1285 1290
gat ctg gag aag atg aac gtg cat ata cca gca act gca att ccg 3924
Asp Leu Glu Lys Met Asn Val His Ile Pro Ala Thr Ala Ile Pro
1295 1300 1305
cta acg agt gct agt aac gcc gcc aaa cga agg atc ttt ctc aac 3969
Leu Thr Ser Ala Ser Asn Ala Ala Lys Arg Arg Ile Phe Leu Asn
1310 1315 1320
agc ttc gat gcc tct ggg ggg aac tct tgc ctt cta tta gag gag 4014
Ser Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Leu Leu Glu Glu
1325 1330 1335
gcg cct cta aag cac tcc aag gct agt gat ccc cga aat cac cac 4059
Ala Pro Leu Lys His Ser Lys Ala Ser Asp Pro Arg Asn His His
1340 1345 1350
gtc gtg acc ttt tct gct cga act ccc ttc tct ctt cga gca att 4104
Val Val Thr Phe Ser Ala Arg Thr Pro Phe Ser Leu Arg Ala Ile
1355 1360 1365
aaa gaa aaa tac ctt caa tat att cgg ctc aac ccg aat aca tcg 4149
Lys Glu Lys Tyr Leu Gln Tyr Ile Arg Leu Asn Pro Asn Thr Ser
1370 1375 1380
ctg gct gat ctt gcc tac acc acg act gca cgc cgc atg cac caa 4194
Leu Ala Asp Leu Ala Tyr Thr Thr Thr Ala Arg Arg Met His Gln
1385 1390 1395
agc tcg gcc cgg tca aca ttt acc gct acg agt atc gaa gat ttt 4239
Ser Ser Ala Arg Ser Thr Phe Thr Ala Thr Ser Ile Glu Asp Phe
1400 1405 1410
gcc aat aag ctt gaa act gac ttg aag aaa gaa gat tcc cct gtc 4284
Ala Asn Lys Leu Glu Thr Asp Leu Lys Lys Glu Asp Ser Pro Val
1415 1420 1425
aaa aag agt aag ggg gct tct agt ggg cct aac gtt gta ttt gct 4329
Lys Lys Ser Lys Gly Ala Ser Ser Gly Pro Asn Val Val Phe Ala
1430 1435 1440
ttt acc ggt cag ggg tcc cag tat gca ggg atg gct cat caa ctc 4374
Phe Thr Gly Gln Gly Ser Gln Tyr Ala Gly Met Ala His Gln Leu
1445 1450 1455
tgg cac gac agt gcg gta ttc cgg cgg cta ata gac tcg atc caa 4419
Trp His Asp Ser Ala Val Phe Arg Arg Leu Ile Asp Ser Ile Gln
1460 1465 1470
tcc ata gcg act gct ttg gat ttg cct aag ttt gtt gac ctg att 4464
Ser Ile Ala Thr Ala Leu Asp Leu Pro Lys Phe Val Asp Leu Ile
1475 1480 1485
gct tcc caa agc ttc gat ttg tct aaa gcc agc cca att cag aca 4509
Ala Ser Gln Ser Phe Asp Leu Ser Lys Ala Ser Pro Ile Gln Thr
1490 1495 1500
caa cta gct ata gtg gcg ctt gaa att ggc ctg gct cag cta tgg 4554
Gln Leu Ala Ile Val Ala Leu Glu Ile Gly Leu Ala Gln Leu Trp
1505 1510 1515
gca tca tgg gga gtg cag cca agc ctt gtc att ggc cac agc ttg 4599
Ala Ser Trp Gly Val Gln Pro Ser Leu Val Ile Gly His Ser Leu
1520 1525 1530
gga gag tat gct gca tta tgc ata tca ggg gtt ctg acg gtc agc 4644
Gly Glu Tyr Ala Ala Leu Cys Ile Ser Gly Val Leu Thr Val Ser
1535 1540 1545
gat act ctc tat cta gtc gga aag agg gca atg atg tta gtt gag 4689
Asp Thr Leu Tyr Leu Val Gly Lys Arg Ala Met Met Leu Val Glu
1550 1555 1560
tct gtt gcg caa aac gaa tac gcc atg ctg gca atc aat gat gaa 4734
Ser Val Ala Gln Asn Glu Tyr Ala Met Leu Ala Ile Asn Asp Glu
1565 1570 1575
gtt gat atc att cgt cag cgc ctc gca aca gac gca tat aat aca 4779
Val Asp Ile Ile Arg Gln Arg Leu Ala Thr Asp Ala Tyr Asn Thr
1580 1585 1590
tgt gag atc gca tgc atc aac gca ccc aaa tcg acc gtg gta agt 4824
Cys Glu Ile Ala Cys Ile Asn Ala Pro Lys Ser Thr Val Val Ser
1595 1600 1605
ggc gct cta tca gaa atc aaa atc atg caa aag gag tta gag gaa 4869
Gly Ala Leu Ser Glu Ile Lys Ile Met Gln Lys Glu Leu Glu Glu
1610 1615 1620
caa ggg tat cgg tcc act ctt ctc cat gta cca ttc gga ttc cac 4914
Gln Gly Tyr Arg Ser Thr Leu Leu His Val Pro Phe Gly Phe His
1625 1630 1635
tcg aag caa atg gac cca atc cta gat tcg tac gag tcg tgt gta 4959
Ser Lys Gln Met Asp Pro Ile Leu Asp Ser Tyr Glu Ser Cys Val
1640 1645 1650
cag gga gtt ggt att tca tcc cct cgg gtt cca ata gca tcc act 5004
Gln Gly Val Gly Ile Ser Ser Pro Arg Val Pro Ile Ala Ser Thr
1655 1660 1665
ctc cta ggt gat att att cag gac aag tca acg gtt tct tct gtc 5049
Leu Leu Gly Asp Ile Ile Gln Asp Lys Ser Thr Val Ser Ser Val
1670 1675 1680
tac ctt aga cga cag acc cga gaa tct gtt aat ttt gtc gga gct 5094
Tyr Leu Arg Arg Gln Thr Arg Glu Ser Val Asn Phe Val Gly Ala
1685 1690 1695
cta caa gcg gca cag gtc tcc aat ttc ctg cgg gat gac aca ctc 5139
Leu Gln Ala Ala Gln Val Ser Asn Phe Leu Arg Asp Asp Thr Leu
1700 1705 1710
ttt ctc gag atg ggg ccc gat cca gtt tgt atg tcg ttg gtt cgc 5184
Phe Leu Glu Met Gly Pro Asp Pro Val Cys Met Ser Leu Val Arg
1715 1720 1725
tca act ctg ggg aca att gca acg cct cga ctt cta cct gcc ctt 5229
Ser Thr Leu Gly Thr Ile Ala Thr Pro Arg Leu Leu Pro Ala Leu
1730 1735 1740
cgc cgg aac gaa aac aat tgg ttg acc acg tca aat aca cta gca 5274
Arg Arg Asn Glu Asn Asn Trp Leu Thr Thr Ser Asn Thr Leu Ala
1745 1750 1755
gca gtc cac cag gcc ggt gtg ccc gtc aac tgg cca gac tat cac 5319
Ala Val His Gln Ala Gly Val Pro Val Asn Trp Pro Asp Tyr His
1760 1765 1770
cgg gag ttt aca aac tgt ctg aca ctg cta gat ttg ccc aca tat 5364
Arg Glu Phe Thr Asn Cys Leu Thr Leu Leu Asp Leu Pro Thr Tyr
1775 1780 1785
gtg ttt gat gaa aag gag ttc tgg aca tca tac ccg gac ccc gag 5409
Val Phe Asp Glu Lys Glu Phe Trp Thr Ser Tyr Pro Asp Pro Glu
1790 1795 1800
cag cta agt ggt gtt gag caa aag cat ttg tca cca cca cca gtt 5454
Gln Leu Ser Gly Val Glu Gln Lys His Leu Ser Pro Pro Pro Val
1805 1810 1815
cct gca gta cag gga ttc ccc aca aca act ctt caa agg cta acc 5499
Pro Ala Val Gln Gly Phe Pro Thr Thr Thr Leu Gln Arg Leu Thr
1820 1825 1830
caa gaa gca ttc gag gac ggt aaa atc tcg gtc act ttc gag tcc 5544
Gln Glu Ala Phe Glu Asp Gly Lys Ile Ser Val Thr Phe Glu Ser
1835 1840 1845
agc aca tct gat cct cac ctt ttt gaa gcg ata atg ggc cat gct 5589
Ser Thr Ser Asp Pro His Leu Phe Glu Ala Ile Met Gly His Ala
1850 1855 1860
gtg gcc gga gtc acg att tgt tcc agt agt atc ttc agc gac atg 5634
Val Ala Gly Val Thr Ile Cys Ser Ser Ser Ile Phe Ser Asp Met
1865 1870 1875
gca tta tcg gcc gct cgg tac acg tgc gaa cgg cta cag cca ggc 5679
Ala Leu Ser Ala Ala Arg Tyr Thr Cys Glu Arg Leu Gln Pro Gly
1880 1885 1890
agg tgg tct gaa gag cta ctt acc atc agc ggc ctg gat att cag 5724
Arg Trp Ser Glu Glu Leu Leu Thr Ile Ser Gly Leu Asp Ile Gln
1895 1900 1905
cgg cca ata gtg gtc ctt gat cga aaa gac tca cat atc att cag 5769
Arg Pro Ile Val Val Leu Asp Arg Lys Asp Ser His Ile Ile Gln
1910 1915 1920
atc aac gct aaa ctt gat gca aaa acc gaa gag gtt tat atc agc 5814
Ile Asn Ala Lys Leu Asp Ala Lys Thr Glu Glu Val Tyr Ile Ser
1925 1930 1935
ttt caa gac cag gtt ggg aaa ccc ata ggg tcc tgc aag atc tca 5859
Phe Gln Asp Gln Val Gly Lys Pro Ile Gly Ser Cys Lys Ile Ser
1940 1945 1950
ttt cac gac gct gcg agc tgg aag cag aac atc tcg cgt att ctg 5904
Phe His Asp Ala Ala Ser Trp Lys Gln Asn Ile Ser Arg Ile Leu
1955 1960 1965
tat ctt gtc tct ttc agg att gat gta cta aaa gag gca act atc 5949
Tyr Leu Val Ser Phe Arg Ile Asp Val Leu Lys Glu Ala Thr Ile
1970 1975 1980
act ggt caa gga cat cga ttc ttg cgg cca gtg atc tac cga ctc 5994
Thr Gly Gln Gly His Arg Phe Leu Arg Pro Val Ile Tyr Arg Leu
1985 1990 1995
ttc tcc aat gtc gtg aat tat ggg gaa cgt ttt caa ggg tta gaa 6039
Phe Ser Asn Val Val Asn Tyr Gly Glu Arg Phe Gln Gly Leu Glu
2000 2005 2010
gag gtt ttc ctc gat tcc gag tgt aac gat gtt gtt ggt caa gtt 6084
Glu Val Phe Leu Asp Ser Glu Cys Asn Asp Val Val Gly Gln Val
2015 2020 2025
aga ctt ccg gac ttg cca tcc agt aaa tca gga cat ttc cta tat 6129
Arg Leu Pro Asp Leu Pro Ser Ser Lys Ser Gly His Phe Leu Tyr
2030 2035 2040
agc ccc tat tta ctt gat gcc gtt gta cat gtt gcc ggc ttc ctg 6174
Ser Pro Tyr Leu Leu Asp Ala Val Val His Val Ala Gly Phe Leu
2045 2050 2055
gtc aac tgc ggc ttg aaa tat ccc gag gat ata ggg ttc ctg gct 6219
Val Asn Cys Gly Leu Lys Tyr Pro Glu Asp Ile Gly Phe Leu Ala
2060 2065 2070
tcc agc ttc gaa tcc tgg cac ata ttg aag cct atc tta cct aat 6264
Ser Ser Phe Glu Ser Trp His Ile Leu Lys Pro Ile Leu Pro Asn
2075 2080 2085
aaa act tac act agc tat tcc cac atg gaa gaa tca tct aac gga 6309
Lys Thr Tyr Thr Ser Tyr Ser His Met Glu Glu Ser Ser Asn Gly
2090 2095 2100
tcc tct ttg ttg gga gac gtg tac gtc ttt gat ggg aaa gat ctg 6354
Ser Ser Leu Leu Gly Asp Val Tyr Val Phe Asp Gly Lys Asp Leu
2105 2110 2115
gtc ggc tca cta act gga ctc cgc ttt caa aag atg aaa aag att 6399
Val Gly Ser Leu Thr Gly Leu Arg Phe Gln Lys Met Lys Lys Ile
2120 2125 2130
gct ctc aca aga att ttg caa tcg gca gcc cct cac tct tct atg 6444
Ala Leu Thr Arg Ile Leu Gln Ser Ala Ala Pro His Ser Ser Met
2135 2140 2145
aaa ata ggc gca gga gtc ttt cga cca gat ctt ctt ggg tca agt 6489
Lys Ile Gly Ala Gly Val Phe Arg Pro Asp Leu Leu Gly Ser Ser
2150 2155 2160
gaa aaa cag tct tca aga aat aag cag ttg gct agg gat gtt gat 6534
Glu Lys Gln Ser Ser Arg Asn Lys Gln Leu Ala Arg Asp Val Asp
2165 2170 2175
ttc gat aca cta cct tca tcg gtc gag ccg tct gct ttc acc act 6579
Phe Asp Thr Leu Pro Ser Ser Val Glu Pro Ser Ala Phe Thr Thr
2180 2185 2190
ccc aaa cct tcg tca tct gtc acc tct atc ata ggt cat gat gaa 6624
Pro Lys Pro Ser Ser Ser Val Thr Ser Ile Ile Gly His Asp Glu
2195 2200 2205
ccc ggg gtt gga gat aag ttt ctt gct gcc gtt gca gca gag gta 6669
Pro Gly Val Gly Asp Lys Phe Leu Ala Ala Val Ala Ala Glu Val
2210 2215 2220
ggc tgc gaa atc tcc gac ttg gaa ccc gac aca gta ttt gga gat 6714
Gly Cys Glu Ile Ser Asp Leu Glu Pro Asp Thr Val Phe Gly Asp
2225 2230 2235
cta ggg gta gac tcg ttg atg gca att acg gtt att gcc tca atc 6759
Leu Gly Val Asp Ser Leu Met Ala Ile Thr Val Ile Ala Ser Ile
2240 2245 2250
aga aat gac act gga gtc gaa ttg cca ggg tcg ttt ttc ctc gac 6804
Arg Asn Asp Thr Gly Val Glu Leu Pro Gly Ser Phe Phe Leu Asp
2255 2260 2265
aac ccg acc gtt gca gaa gct aca aaa gca ttg cgt ggg gat agc 6849
Asn Pro Thr Val Ala Glu Ala Thr Lys Ala Leu Arg Gly Asp Ser
2270 2275 2280
gac gct ggc atc tcc acg cct cag tct tct cct ccg aat ctt tcc 6894
Asp Ala Gly Ile Ser Thr Pro Gln Ser Ser Pro Pro Asn Leu Ser
2285 2290 2295
ccc aaa att cgt ggt gaa gaa gtg aac ggt gag tct tcg gtt cct 6939
Pro Lys Ile Arg Gly Glu Glu Val Asn Gly Glu Ser Ser Val Pro
2300 2305 2310
ttt gag ccg tta gag aca aca cca tct att acc aca gac ttc gaa 6984
Phe Glu Pro Leu Glu Thr Thr Pro Ser Ile Thr Thr Asp Phe Glu
2315 2320 2325
gtt gga agg gcg acg gaa aca ccg ttg tta ata gat aaa cca gct 7029
Val Gly Arg Ala Thr Glu Thr Pro Leu Leu Ile Asp Lys Pro Ala
2330 2335 2340
gct acc ctg tta ttg cag ggg tct gtg gct tca acg gag ccc cct 7074
Ala Thr Leu Leu Leu Gln Gly Ser Val Ala Ser Thr Glu Pro Pro
2345 2350 2355
ctt ttc ctc cta gct gat ggc acc ggt tca gtt tct tcc tac ata 7119
Leu Phe Leu Leu Ala Asp Gly Thr Gly Ser Val Ser Ser Tyr Ile
2360 2365 2370
cag ctt cct gcg ctt tca ggc ggt cgt cga atc tat ggg gtg gag 7164
Gln Leu Pro Ala Leu Ser Gly Gly Arg Arg Ile Tyr Gly Val Glu
2375 2380 2385
tct cca ttt gct cgc gat ccg tcg gcc ttc gtt gat atc agc gtg 7209
Ser Pro Phe Ala Arg Asp Pro Ser Ala Phe Val Asp Ile Ser Val
2390 2395 2400
ggt gat tta gca gac gct ttt att ttc tcc ata cgc aaa gtt cag 7254
Gly Asp Leu Ala Asp Ala Phe Ile Phe Ser Ile Arg Lys Val Gln
2405 2410 2415
cct gtt ggt cca tat gtt att gga ggt tcc tcg ttg ggt gct att 7299
Pro Val Gly Pro Tyr Val Ile Gly Gly Ser Ser Leu Gly Ala Ile
2420 2425 2430
cat gcg ttt gag gtt agc cat cgt tta ctc aat gct ggt gag act 7344
His Ala Phe Glu Val Ser His Arg Leu Leu Asn Ala Gly Glu Thr
2435 2440 2445
gtc tct gag ttg ctt ctc atc gca aat gca gca cca att cct gcc 7389
Val Ser Glu Leu Leu Leu Ile Ala Asn Ala Ala Pro Ile Pro Ala
2450 2455 2460
cca gct cat ctg aga cat ttg gaa att tcc acc gaa atg att gag 7434
Pro Ala His Leu Arg His Leu Glu Ile Ser Thr Glu Met Ile Glu
2465 2470 2475
aaa agt gga att gct tat ggc acc ggc cgg aag aag tta tcc acc 7479
Lys Ser Gly Ile Ala Tyr Gly Thr Gly Arg Lys Lys Leu Ser Thr
2480 2485 2490
cta tct gca aga caa aaa cag cat ctt acg gct tct gtt cga tct 7524
Leu Ser Ala Arg Gln Lys Gln His Leu Thr Ala Ser Val Arg Ser
2495 2500 2505
cac gta ctc tac gag ccc cag gcc ttt acc gaa acc cat cgg cca 7569
His Val Leu Tyr Glu Pro Gln Ala Phe Thr Glu Thr His Arg Pro
2510 2515 2520
gta cat aca acg ttg atc gtt gcc tca aag ggt ctt ggg ggt ggg 7614
Val His Thr Thr Leu Ile Val Ala Ser Lys Gly Leu Gly Gly Gly
2525 2530 2535
aca agc tcg cca gaa tgt cca tta act ccc tgg ata cag gct aat 7659
Thr Ser Ser Pro Glu Cys Pro Leu Thr Pro Trp Ile Gln Ala Asn
2540 2545 2550
tgg gga tcg tcg gag act ctg ggg tgg gat ggc ctg gtc ggc gag 7704
Trp Gly Ser Ser Glu Thr Leu Gly Trp Asp Gly Leu Val Gly Glu
2555 2560 2565
att cac tct att cac cgc gaa gac act gac agt ttc tca tta ctg 7749
Ile His Ser Ile His Arg Glu Asp Thr Asp Ser Phe Ser Leu Leu
2570 2575 2580
aag tat cct aac att acc aag tta ggc caa att atc aat gac cgc 7794
Lys Tyr Pro Asn Ile Thr Lys Leu Gly Gln Ile Ile Asn Asp Arg
2585 2590 2595
gtt tgt cat gca tag 7809
Val Cys His Ala
2600
<210> 16
<211> 2602
<212> PRT
<213> 岛篮状菌
<400> 16
Met Ala Leu Asp Phe Asp Tyr Ile Ile Val Gly Gly Gly Thr Ala Gly
1 5 10 15
Cys Val Leu Ala Ser Arg Leu Ser Glu Tyr Leu Pro Asp Ala Ser Ile
20 25 30
Leu Leu Ile Glu Ala Gly Ile Glu His Asp Pro Arg Val Lys Pro Thr
35 40 45
Leu Gly Leu Thr Gly Gln Ala Ala Asn Glu Ile Lys Trp Asn Ile Gln
50 55 60
Ser Ala Pro Gln Ser Ala Val Gly Asn Lys Thr Ile Asp Leu Val Gln
65 70 75 80
Gly Lys Val Leu Gly Gly Thr Ser Gly Ile Asn His Gln Val Trp Ser
85 90 95
Arg Gly Ala Ala Gly Asp Phe Asn Arg Trp Ala Ala Glu Val Gly Asp
100 105 110
Pro Arg Trp Ser Trp Asn Gly Gln Leu Pro Phe Phe Lys Asn Thr Glu
115 120 125
Thr Phe His Pro Gly Ala Asp Leu Gln Gly Lys Asp Leu Ser Ala Leu
130 135 140
His Gly Phe Asp Gly Pro Ile Lys Val Ser Gln Thr Ser Ser Cys Gly
145 150 155 160
Arg Pro Arg Asn Tyr Pro Leu Lys Gly Ala Ile Ala Ser Met Tyr Lys
165 170 175
Ser Ala Gly Val Ser Gln Gly Glu Asp Leu Asn Ser Gly Asn Ile Leu
180 185 190
Gly Phe Ser Glu Ala Thr Ala Gly Ser Tyr Asp Gly Ile Arg Gln Trp
195 200 205
Ala Gly Gly Asn Tyr Lys Phe Gly Pro Asn Val Thr Leu Trp Thr Glu
210 215 220
Thr His Val Ser Lys Ile Ile Ser Gln Gly Ser Arg Ala Thr Gly Val
225 230 235 240
Glu Tyr Leu Arg Pro Asp Arg Ser Thr Ser Ser Ser Val Ser Ala Lys
245 250 255
Lys Glu Val Ile Val Ser Ser Gly Ala Gln Gly Ser Pro Lys Leu Leu
260 265 270
Leu Leu Ser Gly Ile Gly Pro Ser Ala Glu Leu Gln Lys His Ser Ile
275 280 285
Gln Gln Val Val Glu Leu Pro Val Gly Glu Asn Tyr Ser Asp His Pro
290 295 300
Met Met Ala Thr Tyr Trp Asn Leu Glu Lys Arg Gly Leu Ala Leu Gly
305 310 315 320
Asp Val Glu Met Arg Ser Ala Glu Cys Asp Trp Thr Ser Gly Leu Pro
325 330 335
Val Asp Trp Leu Ala Phe His Arg His Asp Gln Asp Pro Thr Ile Ala
340 345 350
Ala Leu Ala Glu Ser Gln Leu Ser Ser Asn Glu Leu Glu Arg Phe Gln
355 360 365
Glu Gln Asn Arg Ala His Thr Glu Ser Val Val Leu Tyr Gly His Ile
370 375 380
Asp Phe Ser Gly Lys Ala Gly Pro Pro Pro Pro Gly Ser Asn Val Cys
385 390 395 400
Val Met Asn Ile Leu Val Thr Pro Ser Ser Arg Gly Thr Val Thr Leu
405 410 415
Lys Ser Thr Asn Pro Phe Asp Ala Pro Val Cys Asp Pro Asn Met Leu
420 425 430
Ser Asn Glu Leu Asp Lys Gln Leu Leu Trp Ser Val Thr Arg Leu Thr
435 440 445
Ser Gln Gly Leu Glu Arg Thr Ile Ser Pro Glu Tyr Gly Leu Ser Glu
450 455 460
Tyr Ala Ile Asp Asp Asp Leu Arg Gly Asp Tyr Gly Asp Glu Ala Met
465 470 475 480
Met Arg Arg Ala Val Arg Ile Val Arg Thr Val Asn His Gly Ser Gly
485 490 495
Thr Cys Ser Met Gly Thr Val Val Asp Thr Glu Cys Arg Val Lys Gly
500 505 510
Val Glu Gly Leu Arg Val Val Asp Ser Ser Val Ile Pro Leu Pro Leu
515 520 525
Cys Ala His Tyr Gln Ala Ser Val Tyr Ala Leu Ala Glu Gln Asp Gln
530 535 540
Thr Glu Gln Phe Leu Leu Leu Tyr Gly Asp Gln Thr Val Glu Lys Leu
545 550 555 560
Pro Ala Val Arg Ala Leu Val Glu His Ala Gln Arg Ser Pro Ala Gly
565 570 575
Arg Arg Phe Leu Arg Asp Ala Cys Asp Ile Ile Gln Ile Glu Ile Phe
580 585 590
Ser Leu Asp Thr Asp Glu Arg Ala His Val Gly His Phe Asp Thr Leu
595 600 605
Leu Gln Leu Ala Glu Ser Asn Ala Gln Ala Asp Gln Pro Ser Glu Ile
610 615 620
Val Ala Thr Ile Leu Met Asn Val Thr Arg Leu Gly Glu Phe Ile Leu
625 630 635 640
Tyr Ala Glu Glu His Pro Asn Val Leu Gly Ser Ile Glu Gln Pro Thr
645 650 655
His Ile Val Ala Phe Cys Thr Gly Glu Ile Pro Ala Ala Val Ala Ala
660 665 670
Ala Ala Arg Asp Ser Ile Glu Leu Tyr Asn Leu Ser Ile Glu Thr Val
675 680 685
Arg Ile Ile Cys Arg Phe Ala Arg Asn Ile Ile Arg Arg Ser Val Leu
690 695 700
Val Asp Arg Thr Asn Gly Ser Trp Ala Thr Thr Ile Val Gly Val Ser
705 710 715 720
Pro Gly Arg Val Gln Thr Ile Leu Asp Thr Phe His Gln Ser Gln Asn
725 730 735
Ile Ala Pro Thr Arg Gln Ile Asn Ile Gly Ile Met Ala Ala Gly Trp
740 745 750
Leu Thr Leu Phe Gly Pro Pro Ile Thr Thr Glu Gln Leu Phe Asn Trp
755 760 765
Ser Lys Glu Leu Asp Gly Ala Ser Arg Ile Lys Thr Asp Ala Gly Gly
770 775 780
Gly Val His Leu Pro Asn Leu Pro Glu Leu Asp Leu Asp Glu Val Val
785 790 795 800
Gly Leu Ser Pro Leu Leu Asp Val Pro Ile Thr Pro Lys Ala Arg Leu
805 810 815
Trp Ser Pro Tyr Ser Cys Glu Ile Arg Asn Ala Ala Thr Leu Arg Asp
820 825 830
Leu Ile Arg Gln Val Ile Pro Asp Ile Thr Gln Tyr Ser Leu Arg Leu
835 840 845
Ser Asp Thr Ile Glu Thr Ala Val Lys Gly Leu Ser Asn Gly Ser Val
850 855 860
Lys Val Val Cys Val Gly Tyr Thr Ala His Leu Ile Ser Leu Gln Lys
865 870 875 880
Ser Leu Gln Arg Glu Arg Arg Glu Ala Thr Val Leu Gln His Ser Ser
885 890 895
Ala Gly Ser Thr Phe Phe Thr Ser Pro Arg Gly Gly Ser Glu Ser Ile
900 905 910
Ala Ile Val Gly Met Ser Gly Arg Phe Pro Gly Ser Asp Asn Ile Gln
915 920 925
Glu Tyr Trp Gln Ser Leu Leu Asp Gly Glu Arg His Ile Lys Glu Ile
930 935 940
Pro Lys Asn Arg Phe Asp Leu Ser Lys Trp Tyr Asp Glu Thr Gly Lys
945 950 955 960
Gln Lys Asn Ala Thr Met Asn Arg Ser Gly Ala Phe Leu Asp Arg Pro
965 970 975
Gly Tyr Phe Asp Asn Arg Leu Phe Asn Met Ser Pro Arg Glu Ala Leu
980 985 990
Gln Thr Asp Pro Leu His Arg Met Phe Leu Thr Val Ser Tyr Glu Ala
995 1000 1005
Leu Glu Met Ala Gly Tyr Ser Pro Glu Ala Thr Leu Ala Thr Asn
1010 1015 1020
Ser Asn Arg Ile Ala Thr Tyr Phe Gly Gln Thr Ser Asp Asp Trp
1025 1030 1035
Arg Asp Ile Val Leu Thr Gln Gly Val Asp Ile Tyr Tyr Ala Pro
1040 1045 1050
Gly Ile Cys Arg Ala Phe Ala Pro Gly Arg Leu Asn Tyr His Phe
1055 1060 1065
Lys Trp Gly Gly Pro Ser Tyr Ser Val Asp Ala Ala Cys Ala Ser
1070 1075 1080
Ser Ile Ala Thr Ile Ser Leu Ala Cys Ser Ala Leu Leu Ala Arg
1085 1090 1095
Glu Cys Asp Thr Ala Leu Ala Gly Gly Gly Ser Ile Leu Asp Ser
1100 1105 1110
Pro Ala Pro Phe Ala Gly Leu Ser Arg Gly Gly Phe Leu Ser Pro
1115 1120 1125
Glu Lys Gly Cys Glu Thr Phe His Asp Asp Ala Asp Gly Tyr Val
1130 1135 1140
Arg Gly Glu Gly Val Gly Val Val Val Leu Lys Arg Leu Glu Asp
1145 1150 1155
Ala Val Ala Asp Asn Asp Asn Ile Leu Gly Val Ile Arg Gly Ser
1160 1165 1170
Ala Arg Asn Tyr Ser Lys Gly Ala Ser Ser Ile Thr His Pro Ser
1175 1180 1185
Ser Glu Ala Gln Gln Arg Leu Tyr Arg Gln Val Leu Asn Gln Asn
1190 1195 1200
Ala Ile Asp Ala Ala Ser Val Ser Tyr Val Glu Met His Gly Thr
1205 1210 1215
Gly Thr Gln Ala Gly Asp Ser Thr Glu Met Ser Ser Val Leu Ser
1220 1225 1230
Thr Phe Gly Gln Ser Arg Ser Lys Asp Asn Pro Leu Val Val Gly
1235 1240 1245
Ala Val Lys Ala Asn Ile Gly His Gly Glu Ala Ala Ala Gly Val
1250 1255 1260
Cys Ala Leu Ile Lys Thr Leu Met Met Phe Gln Lys His Thr Ile
1265 1270 1275
Pro Pro Gln Pro Gly Met Pro Phe Lys Leu Asn His His Phe Pro
1280 1285 1290
Asp Leu Glu Lys Met Asn Val His Ile Pro Ala Thr Ala Ile Pro
1295 1300 1305
Leu Thr Ser Ala Ser Asn Ala Ala Lys Arg Arg Ile Phe Leu Asn
1310 1315 1320
Ser Phe Asp Ala Ser Gly Gly Asn Ser Cys Leu Leu Leu Glu Glu
1325 1330 1335
Ala Pro Leu Lys His Ser Lys Ala Ser Asp Pro Arg Asn His His
1340 1345 1350
Val Val Thr Phe Ser Ala Arg Thr Pro Phe Ser Leu Arg Ala Ile
1355 1360 1365
Lys Glu Lys Tyr Leu Gln Tyr Ile Arg Leu Asn Pro Asn Thr Ser
1370 1375 1380
Leu Ala Asp Leu Ala Tyr Thr Thr Thr Ala Arg Arg Met His Gln
1385 1390 1395
Ser Ser Ala Arg Ser Thr Phe Thr Ala Thr Ser Ile Glu Asp Phe
1400 1405 1410
Ala Asn Lys Leu Glu Thr Asp Leu Lys Lys Glu Asp Ser Pro Val
1415 1420 1425
Lys Lys Ser Lys Gly Ala Ser Ser Gly Pro Asn Val Val Phe Ala
1430 1435 1440
Phe Thr Gly Gln Gly Ser Gln Tyr Ala Gly Met Ala His Gln Leu
1445 1450 1455
Trp His Asp Ser Ala Val Phe Arg Arg Leu Ile Asp Ser Ile Gln
1460 1465 1470
Ser Ile Ala Thr Ala Leu Asp Leu Pro Lys Phe Val Asp Leu Ile
1475 1480 1485
Ala Ser Gln Ser Phe Asp Leu Ser Lys Ala Ser Pro Ile Gln Thr
1490 1495 1500
Gln Leu Ala Ile Val Ala Leu Glu Ile Gly Leu Ala Gln Leu Trp
1505 1510 1515
Ala Ser Trp Gly Val Gln Pro Ser Leu Val Ile Gly His Ser Leu
1520 1525 1530
Gly Glu Tyr Ala Ala Leu Cys Ile Ser Gly Val Leu Thr Val Ser
1535 1540 1545
Asp Thr Leu Tyr Leu Val Gly Lys Arg Ala Met Met Leu Val Glu
1550 1555 1560
Ser Val Ala Gln Asn Glu Tyr Ala Met Leu Ala Ile Asn Asp Glu
1565 1570 1575
Val Asp Ile Ile Arg Gln Arg Leu Ala Thr Asp Ala Tyr Asn Thr
1580 1585 1590
Cys Glu Ile Ala Cys Ile Asn Ala Pro Lys Ser Thr Val Val Ser
1595 1600 1605
Gly Ala Leu Ser Glu Ile Lys Ile Met Gln Lys Glu Leu Glu Glu
1610 1615 1620
Gln Gly Tyr Arg Ser Thr Leu Leu His Val Pro Phe Gly Phe His
1625 1630 1635
Ser Lys Gln Met Asp Pro Ile Leu Asp Ser Tyr Glu Ser Cys Val
1640 1645 1650
Gln Gly Val Gly Ile Ser Ser Pro Arg Val Pro Ile Ala Ser Thr
1655 1660 1665
Leu Leu Gly Asp Ile Ile Gln Asp Lys Ser Thr Val Ser Ser Val
1670 1675 1680
Tyr Leu Arg Arg Gln Thr Arg Glu Ser Val Asn Phe Val Gly Ala
1685 1690 1695
Leu Gln Ala Ala Gln Val Ser Asn Phe Leu Arg Asp Asp Thr Leu
1700 1705 1710
Phe Leu Glu Met Gly Pro Asp Pro Val Cys Met Ser Leu Val Arg
1715 1720 1725
Ser Thr Leu Gly Thr Ile Ala Thr Pro Arg Leu Leu Pro Ala Leu
1730 1735 1740
Arg Arg Asn Glu Asn Asn Trp Leu Thr Thr Ser Asn Thr Leu Ala
1745 1750 1755
Ala Val His Gln Ala Gly Val Pro Val Asn Trp Pro Asp Tyr His
1760 1765 1770
Arg Glu Phe Thr Asn Cys Leu Thr Leu Leu Asp Leu Pro Thr Tyr
1775 1780 1785
Val Phe Asp Glu Lys Glu Phe Trp Thr Ser Tyr Pro Asp Pro Glu
1790 1795 1800
Gln Leu Ser Gly Val Glu Gln Lys His Leu Ser Pro Pro Pro Val
1805 1810 1815
Pro Ala Val Gln Gly Phe Pro Thr Thr Thr Leu Gln Arg Leu Thr
1820 1825 1830
Gln Glu Ala Phe Glu Asp Gly Lys Ile Ser Val Thr Phe Glu Ser
1835 1840 1845
Ser Thr Ser Asp Pro His Leu Phe Glu Ala Ile Met Gly His Ala
1850 1855 1860
Val Ala Gly Val Thr Ile Cys Ser Ser Ser Ile Phe Ser Asp Met
1865 1870 1875
Ala Leu Ser Ala Ala Arg Tyr Thr Cys Glu Arg Leu Gln Pro Gly
1880 1885 1890
Arg Trp Ser Glu Glu Leu Leu Thr Ile Ser Gly Leu Asp Ile Gln
1895 1900 1905
Arg Pro Ile Val Val Leu Asp Arg Lys Asp Ser His Ile Ile Gln
1910 1915 1920
Ile Asn Ala Lys Leu Asp Ala Lys Thr Glu Glu Val Tyr Ile Ser
1925 1930 1935
Phe Gln Asp Gln Val Gly Lys Pro Ile Gly Ser Cys Lys Ile Ser
1940 1945 1950
Phe His Asp Ala Ala Ser Trp Lys Gln Asn Ile Ser Arg Ile Leu
1955 1960 1965
Tyr Leu Val Ser Phe Arg Ile Asp Val Leu Lys Glu Ala Thr Ile
1970 1975 1980
Thr Gly Gln Gly His Arg Phe Leu Arg Pro Val Ile Tyr Arg Leu
1985 1990 1995
Phe Ser Asn Val Val Asn Tyr Gly Glu Arg Phe Gln Gly Leu Glu
2000 2005 2010
Glu Val Phe Leu Asp Ser Glu Cys Asn Asp Val Val Gly Gln Val
2015 2020 2025
Arg Leu Pro Asp Leu Pro Ser Ser Lys Ser Gly His Phe Leu Tyr
2030 2035 2040
Ser Pro Tyr Leu Leu Asp Ala Val Val His Val Ala Gly Phe Leu
2045 2050 2055
Val Asn Cys Gly Leu Lys Tyr Pro Glu Asp Ile Gly Phe Leu Ala
2060 2065 2070
Ser Ser Phe Glu Ser Trp His Ile Leu Lys Pro Ile Leu Pro Asn
2075 2080 2085
Lys Thr Tyr Thr Ser Tyr Ser His Met Glu Glu Ser Ser Asn Gly
2090 2095 2100
Ser Ser Leu Leu Gly Asp Val Tyr Val Phe Asp Gly Lys Asp Leu
2105 2110 2115
Val Gly Ser Leu Thr Gly Leu Arg Phe Gln Lys Met Lys Lys Ile
2120 2125 2130
Ala Leu Thr Arg Ile Leu Gln Ser Ala Ala Pro His Ser Ser Met
2135 2140 2145
Lys Ile Gly Ala Gly Val Phe Arg Pro Asp Leu Leu Gly Ser Ser
2150 2155 2160
Glu Lys Gln Ser Ser Arg Asn Lys Gln Leu Ala Arg Asp Val Asp
2165 2170 2175
Phe Asp Thr Leu Pro Ser Ser Val Glu Pro Ser Ala Phe Thr Thr
2180 2185 2190
Pro Lys Pro Ser Ser Ser Val Thr Ser Ile Ile Gly His Asp Glu
2195 2200 2205
Pro Gly Val Gly Asp Lys Phe Leu Ala Ala Val Ala Ala Glu Val
2210 2215 2220
Gly Cys Glu Ile Ser Asp Leu Glu Pro Asp Thr Val Phe Gly Asp
2225 2230 2235
Leu Gly Val Asp Ser Leu Met Ala Ile Thr Val Ile Ala Ser Ile
2240 2245 2250
Arg Asn Asp Thr Gly Val Glu Leu Pro Gly Ser Phe Phe Leu Asp
2255 2260 2265
Asn Pro Thr Val Ala Glu Ala Thr Lys Ala Leu Arg Gly Asp Ser
2270 2275 2280
Asp Ala Gly Ile Ser Thr Pro Gln Ser Ser Pro Pro Asn Leu Ser
2285 2290 2295
Pro Lys Ile Arg Gly Glu Glu Val Asn Gly Glu Ser Ser Val Pro
2300 2305 2310
Phe Glu Pro Leu Glu Thr Thr Pro Ser Ile Thr Thr Asp Phe Glu
2315 2320 2325
Val Gly Arg Ala Thr Glu Thr Pro Leu Leu Ile Asp Lys Pro Ala
2330 2335 2340
Ala Thr Leu Leu Leu Gln Gly Ser Val Ala Ser Thr Glu Pro Pro
2345 2350 2355
Leu Phe Leu Leu Ala Asp Gly Thr Gly Ser Val Ser Ser Tyr Ile
2360 2365 2370
Gln Leu Pro Ala Leu Ser Gly Gly Arg Arg Ile Tyr Gly Val Glu
2375 2380 2385
Ser Pro Phe Ala Arg Asp Pro Ser Ala Phe Val Asp Ile Ser Val
2390 2395 2400
Gly Asp Leu Ala Asp Ala Phe Ile Phe Ser Ile Arg Lys Val Gln
2405 2410 2415
Pro Val Gly Pro Tyr Val Ile Gly Gly Ser Ser Leu Gly Ala Ile
2420 2425 2430
His Ala Phe Glu Val Ser His Arg Leu Leu Asn Ala Gly Glu Thr
2435 2440 2445
Val Ser Glu Leu Leu Leu Ile Ala Asn Ala Ala Pro Ile Pro Ala
2450 2455 2460
Pro Ala His Leu Arg His Leu Glu Ile Ser Thr Glu Met Ile Glu
2465 2470 2475
Lys Ser Gly Ile Ala Tyr Gly Thr Gly Arg Lys Lys Leu Ser Thr
2480 2485 2490
Leu Ser Ala Arg Gln Lys Gln His Leu Thr Ala Ser Val Arg Ser
2495 2500 2505
His Val Leu Tyr Glu Pro Gln Ala Phe Thr Glu Thr His Arg Pro
2510 2515 2520
Val His Thr Thr Leu Ile Val Ala Ser Lys Gly Leu Gly Gly Gly
2525 2530 2535
Thr Ser Ser Pro Glu Cys Pro Leu Thr Pro Trp Ile Gln Ala Asn
2540 2545 2550
Trp Gly Ser Ser Glu Thr Leu Gly Trp Asp Gly Leu Val Gly Glu
2555 2560 2565
Ile His Ser Ile His Arg Glu Asp Thr Asp Ser Phe Ser Leu Leu
2570 2575 2580
Lys Tyr Pro Asn Ile Thr Lys Leu Gly Gln Ile Ile Asn Asp Arg
2585 2590 2595
Val Cys His Ala
2600
<210> 17
<211> 1143
<212> DNA
<213> 岛篮状菌
<220>
<221> CDS
<222> (1)..(1143)
<400> 17
atg tct gcg agc gta gaa aca gcg tgg tcg cag tgt ctg cga ata att 48
Met Ser Ala Ser Val Glu Thr Ala Trp Ser Gln Cys Leu Arg Ile Ile
1 5 10 15
gca aag gag aca ggg ttt agt atc gac gat atc gat gac gag gat gaa 96
Ala Lys Glu Thr Gly Phe Ser Ile Asp Asp Ile Asp Asp Glu Asp Glu
20 25 30
ttc acc aca gat ctc ggt gtc aac ccg att gtc gca cgg tca att ata 144
Phe Thr Thr Asp Leu Gly Val Asn Pro Ile Val Ala Arg Ser Ile Ile
35 40 45
cgt tct ttc gaa agc gtc ttg aaa aga gac att ccc tcg act gta ttt 192
Arg Ser Phe Glu Ser Val Leu Lys Arg Asp Ile Pro Ser Thr Val Phe
50 55 60
acc cag tgt cca act atc aaa gaa ttt cgc ggc gga tac ttt cag tca 240
Thr Gln Cys Pro Thr Ile Lys Glu Phe Arg Gly Gly Tyr Phe Gln Ser
65 70 75 80
tgc att gat agt atc acg gag cca aag gac gac ctg gca gta aag aaa 288
Cys Ile Asp Ser Ile Thr Glu Pro Lys Asp Asp Leu Ala Val Lys Lys
85 90 95
gct gca aca gcg cac gga gga aat aaa aaa tca act acg aac act act 336
Ala Ala Thr Ala His Gly Gly Asn Lys Lys Ser Thr Thr Asn Thr Thr
100 105 110
cgt acg cga gct cgc gtc ccg atc tcc att gtg ctc caa ggc aag cca 384
Arg Thr Arg Ala Arg Val Pro Ile Ser Ile Val Leu Gln Gly Lys Pro
115 120 125
acg atg gat tgt gcc gag aag acc aac atc ttc ctt cta cct gac ggc 432
Thr Met Asp Cys Ala Glu Lys Thr Asn Ile Phe Leu Leu Pro Asp Gly
130 135 140
agt ggt tcc ggg atg gct tat gtg gaa atg cca ctt atc gat cct tct 480
Ser Gly Ser Gly Met Ala Tyr Val Glu Met Pro Leu Ile Asp Pro Ser
145 150 155 160
act gtc tgt ctt gtt gcg ttg aat agt ccc tat ctc aac cgc gcc tcg 528
Thr Val Cys Leu Val Ala Leu Asn Ser Pro Tyr Leu Asn Arg Ala Ser
165 170 175
gag tac tgt tgt tca atc gaa gaa att gca aga gag tac gtg caa gag 576
Glu Tyr Cys Cys Ser Ile Glu Glu Ile Ala Arg Glu Tyr Val Gln Glu
180 185 190
att cgt aaa cgc caa cct cac gga cct tac gtg ctt ggg ggc tgg tct 624
Ile Arg Lys Arg Gln Pro His Gly Pro Tyr Val Leu Gly Gly Trp Ser
195 200 205
gcc ggt ggt tat tac tca tat gaa gtg gcg tgt gaa ctc atc cgt caa 672
Ala Gly Gly Tyr Tyr Ser Tyr Glu Val Ala Cys Glu Leu Ile Arg Gln
210 215 220
ggt gaa cgt gtg aaa aag ctc att ttg ctc gat tct cct tgt cgg cca 720
Gly Glu Arg Val Lys Lys Leu Ile Leu Leu Asp Ser Pro Cys Arg Pro
225 230 235 240
gat ttt gag gag ctt cca atg gaa gtg gtg cag tat tta tcc aaa aag 768
Asp Phe Glu Glu Leu Pro Met Glu Val Val Gln Tyr Leu Ser Lys Lys
245 250 255
aac ctt atg ggc aac tgg gac cgc agt gct cga cat aca agt gtt cct 816
Asn Leu Met Gly Asn Trp Asp Arg Ser Ala Arg His Thr Ser Val Pro
260 265 270
tct tgg gtc atc gag cat ttc cgc tcg act ctt cgg gcg gta cgt gag 864
Ser Trp Val Ile Glu His Phe Arg Ser Thr Leu Arg Ala Val Arg Glu
275 280 285
tat gtg cca gtg ccg atg gac gct gct gat gct cca gac gaa gtt tgc 912
Tyr Val Pro Val Pro Met Asp Ala Ala Asp Ala Pro Asp Glu Val Cys
290 295 300
atc atc tgg agt cga gaa ggt gta atg cca gca aac cag ctt cga aga 960
Ile Ile Trp Ser Arg Glu Gly Val Met Pro Ala Asn Gln Leu Arg Arg
305 310 315 320
acg ggt ttg gat ctc cgc gtc cgc gtc gca cgt ttt ctt ctc gaa gga 1008
Thr Gly Leu Asp Leu Arg Val Arg Val Ala Arg Phe Leu Leu Glu Gly
325 330 335
aaa cct gat ctc acc agt gca tac ggg tgg gac cgg ctt ttc ccc gga 1056
Lys Pro Asp Leu Thr Ser Ala Tyr Gly Trp Asp Arg Leu Phe Pro Gly
340 345 350
gcg cac atc agc att tcg tct atc tcg ggc aat cac ttc acc ctg atc 1104
Ala His Ile Ser Ile Ser Ser Ile Ser Gly Asn His Phe Thr Leu Ile
355 360 365
aac aaa ccc aac gta agc gtc tgt tcc ttt ccc gag tag 1143
Asn Lys Pro Asn Val Ser Val Cys Ser Phe Pro Glu
370 375 380
<210> 18
<211> 380
<212> PRT
<213> 岛篮状菌
<400> 18
Met Ser Ala Ser Val Glu Thr Ala Trp Ser Gln Cys Leu Arg Ile Ile
1 5 10 15
Ala Lys Glu Thr Gly Phe Ser Ile Asp Asp Ile Asp Asp Glu Asp Glu
20 25 30
Phe Thr Thr Asp Leu Gly Val Asn Pro Ile Val Ala Arg Ser Ile Ile
35 40 45
Arg Ser Phe Glu Ser Val Leu Lys Arg Asp Ile Pro Ser Thr Val Phe
50 55 60
Thr Gln Cys Pro Thr Ile Lys Glu Phe Arg Gly Gly Tyr Phe Gln Ser
65 70 75 80
Cys Ile Asp Ser Ile Thr Glu Pro Lys Asp Asp Leu Ala Val Lys Lys
85 90 95
Ala Ala Thr Ala His Gly Gly Asn Lys Lys Ser Thr Thr Asn Thr Thr
100 105 110
Arg Thr Arg Ala Arg Val Pro Ile Ser Ile Val Leu Gln Gly Lys Pro
115 120 125
Thr Met Asp Cys Ala Glu Lys Thr Asn Ile Phe Leu Leu Pro Asp Gly
130 135 140
Ser Gly Ser Gly Met Ala Tyr Val Glu Met Pro Leu Ile Asp Pro Ser
145 150 155 160
Thr Val Cys Leu Val Ala Leu Asn Ser Pro Tyr Leu Asn Arg Ala Ser
165 170 175
Glu Tyr Cys Cys Ser Ile Glu Glu Ile Ala Arg Glu Tyr Val Gln Glu
180 185 190
Ile Arg Lys Arg Gln Pro His Gly Pro Tyr Val Leu Gly Gly Trp Ser
195 200 205
Ala Gly Gly Tyr Tyr Ser Tyr Glu Val Ala Cys Glu Leu Ile Arg Gln
210 215 220
Gly Glu Arg Val Lys Lys Leu Ile Leu Leu Asp Ser Pro Cys Arg Pro
225 230 235 240
Asp Phe Glu Glu Leu Pro Met Glu Val Val Gln Tyr Leu Ser Lys Lys
245 250 255
Asn Leu Met Gly Asn Trp Asp Arg Ser Ala Arg His Thr Ser Val Pro
260 265 270
Ser Trp Val Ile Glu His Phe Arg Ser Thr Leu Arg Ala Val Arg Glu
275 280 285
Tyr Val Pro Val Pro Met Asp Ala Ala Asp Ala Pro Asp Glu Val Cys
290 295 300
Ile Ile Trp Ser Arg Glu Gly Val Met Pro Ala Asn Gln Leu Arg Arg
305 310 315 320
Thr Gly Leu Asp Leu Arg Val Arg Val Ala Arg Phe Leu Leu Glu Gly
325 330 335
Lys Pro Asp Leu Thr Ser Ala Tyr Gly Trp Asp Arg Leu Phe Pro Gly
340 345 350
Ala His Ile Ser Ile Ser Ser Ile Ser Gly Asn His Phe Thr Leu Ile
355 360 365
Asn Lys Pro Asn Val Ser Val Cys Ser Phe Pro Glu
370 375 380

Claims (32)

1.一种生物合成平台,所述生物合成平台包含从更简单的代谢物产生橄榄醇酸及其类似物的一系列分离的酶,所述一系列酶包括:
非还原性聚酮合酶(NRPKS),所述NRPKS将包括己酰辅酶A、己酸、辛酰辅酶A、辛酸和/或其类似物的一组代谢物转化为芳族二醇代谢物;以及
硫酯酶,所述硫酯酶将所述芳族二醇代谢物转化为橄榄醇酸及其类似物。
2.根据权利要求1所述的生物合成平台,所述生物合成平台进一步包含:
高还原性聚酮合酶(HRPKS),所述HRPKS利用乙酰辅酶A、丙二酰辅酶A和NADPH来合成选自己酰辅酶A、己酸、辛酰辅酶A、辛酸和/或其类似物的所述一组代谢物。
3.根据权利要求1或权利要求2所述的生物合成平台,其中橄榄醇酸的所述类似物选自2-庚基-4,6-二羟基苯甲酸、(E)-2-(庚-1-烯-1-基)-4,6-二羟基苯甲酸和(E)-2,4-二羟基-6-(戊-1-烯-1-基)苯甲酸。
4.根据权利要求1所述的生物合成平台,其中构成所述生物合成平台的所述酶中的一种或多种来自真菌。
5.根据前述权利要求中任一项所述的生物合成平台,其中构成所述生物合成平台的所述一系列酶来自金龟子绿僵菌(Metarhizium anisopliae)、膨大弯颈霉(Tolypocladiuminflatum)、莱氏绿僵菌(Metarhizium rileyi)和/或岛篮状菌(Talaromycesislandicus)。
6.根据权利要求1所述的生物合成平台,其中所述NRPKS具有与SEQ ID NO:4的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的序列。
7.根据权利要求6所述的生物合成平台,其中所述NRPKS具有与SEQ ID NO:4、10或16的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。
8.根据权利要求6所述的生物合成平台,其中所述NRPKS包含SEQ ID NO:4、10或16的序列。
9.根据权利要求1所述的生物合成平台,其中所述TE具有与SEQ ID NO:6的序列至少45%、50%、60%、70%、80%、90%、95%、98%或99%相同的序列。
10.根据权利要求9所述的生物合成平台,其中所述TE具有与SEQ ID NO:6、12或18的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。
11.根据权利要求9所述的生物合成平台,其中所述TE包含SEQ ID NO:6、12或18的序列。
12.根据权利要求2所述的生物合成平台,其中所述HRPKS具有与SEQ ID NO:2的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的序列。
13.根据权利要求12所述的生物合成平台,其中所述HRPKS具有与SEQ ID NO:2、8或14的序列至少95%、98%或99%相同并且含有1至20个保守氨基酸取代的序列。
14.根据权利要求12所述的生物合成平台,其中所述HRPKS包含SEQ ID NO:2、8或14的序列。
15.一种用于在无细胞系统中表达所述生物合成平台的线性表达模板(LET),所述LET包含编码构成根据前述权利要求中任一项所述的生物合成平台的所述一系列酶的多核苷酸序列。
16.根据权利要求15所述的LET,其中所述多核苷酸编码选自以下的多肽:
(i)与SEQ ID NO:2、8或14的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的多肽;
(ii)与SEQ ID NO:4、10或16的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的多肽;
(iii)与SEQ ID NO:6、12或18的序列至少50%、60%、70%、80%、90%、95%、98%或99%相同的多肽;以及
(iv)(i)、(ii)和(iii)的任何组合。
17.根据权利要求15所述的LET,其中所述LET包含:编码具有HRPKS活性并且具有与SEQID NO:2、8或14的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;
编码具有NRPKS活性和与SEQ ID NO:4、10或16的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列;
编码具有TE活性并且具有与SEQ ID NO:6、12或18的序列至少95%、98%或99%相同的序列的多肽的多核苷酸序列。
18.根据权利要求15所述的LET,其中所述LET包含选自SEQ ID NO:1、7和13的多核苷酸序列。
19.根据权利要求15或18所述的LET,其中所述LET包含选自SEQ ID NO:3、9和15的多核苷酸序列。
20.根据权利要求15、18或19所述的LET,其中所述LET包含选自SEQ ID NO:5、11和17的多核苷酸序列。
21.一种或多种质粒或者一种或多种载体,所述一种或多种质粒或者一种或多种载体包含编码构成根据前述权利要求中任一项所述的生物合成平台的所述一系列酶的多核苷酸序列。
22.根据权利要求21所述的一种或多种质粒或者一种或多种载体,其中:
第一质粒包含编码具有HRPKS活性的多肽的多核苷酸序列,并且其中所述多肽具有与SEQ ID NO:2、8或14的序列至少95%、98%或99%相同的序列;
其中第二质粒包含编码具有NRPKS活性的多肽的多核苷酸序列,并且其中所述多肽具有与SEQ ID NO:4、10或16的序列至少95%、98%或99%相同的序列;
并且其中第三质粒包含编码具有TE活性的多肽的多核苷酸序列,并且其中所述多肽具有与SEQ ID NO:6、12或18的序列至少95%、98%或99%相同的序列。
23.一种重组微生物,所述重组微生物包含根据权利要求21或权利要求22所述的一种或多种质粒或者一种或多种载体。
24.根据权利要求23所述的重组微生物,其中所述重组微生物是细菌、古生菌或真菌。
25.根据权利要求24所述的重组微生物,其中所述重组微生物是选自以下的细菌:大肠杆菌(Escherichia Coli.)、类球红细菌(Rodhobacter sphaeroides)、游海假交替单胞菌(Pseudoalteromonas haloplanktis)、希瓦氏菌属(Shewanella sp.)菌株Ac10、荧光假单胞菌(Pseudomonas fluorescens)、恶臭假单胞菌(Pseudomonas putida)、铜绿假单胞菌(Pseudomonas aeruginosa)、伸长盐单胞菌(Halomonas elongata)、需盐色盐杆菌(Chromohalobacter salex’igens)、变铅青链霉菌(Streptomyces lividans)、灰色链霉菌(Streptomyces griseus)、耐内酰胺诺卡氏菌(Nocardia lactamdurans)、耻垢分枝杆菌(Mycobacterium smegmatis)、谷氨酸棒状杆菌(Corynebacterium glutamicum)、产氨棒状杆菌(Corynebacterium ammoniagenes)、乳糖发酵短杆菌(Brevibacteriumlactofermentum)、枯草芽孢杆菌(Bacillus subtilis)、短芽孢杆菌(Bacillus brevis)、巨大芽孢杆菌(Bacillus megaterium)、地衣芽孢杆菌(Bacillus licheniformis)、解淀粉芽孢杆菌(Bacillus amyloliquefaciens)、乳酸乳球菌(Lactococcus lactis)、植物乳杆菌(Lactobacillus plantarum)、干酪乳杆菌(Lactobacillus casei)、罗伊氏乳杆菌(Lactobacillus reuteri)和加氏乳杆菌(Lactobacillus gasseri)。
26.根据权利要求25所述的重组微生物,其中所述重组微生物是埃希氏杆菌属(Escherichia)或恶臭假单胞菌。
27.根据权利要求24所述的重组微生物,其中所述重组微生物是选自以下的细菌:酿酒酵母(Saccharomyces cerevisiae)、乳酸克鲁维酵母(Kluyveromyces lactis)、毕赤酵母(Pichia pastoris)、多形汉逊酵母(Hansenula polymorpha)、解脂耶氏酵母(Yarrowialipolytica)、构巢曲霉(Aspergillus nidulans)、里氏木霉(Trichoderma reesei)、尖孢镰刀菌(Fusarium oxysporum)、黄孢原毛平革菌(Phanerochaete chrysosporium)、棉阿舒囊霉(Ashbya gossypii)、米曲霉(A.oryzae)和卢克诺文思金孢子菌(Chrysosporiumlucknowense)。
28.根据权利要求27所述的重组微生物,其中所述重组微生物是构巢曲霉或酿酒酵母。
29.一种产生橄榄醇酸及其类似物的无细胞方法,所述无细胞方法包括:
将乙酰辅酶A、丙二酰辅酶A、NADPH供应给包含根据权利要求15-20中任一项所述的LET的无细胞生物系统,或供应给包含所述生物合成平台的粗提取物或纯化提取物,其中所述生物合成平台是从根据权利要求23至28中任一项所述的重组微生物中提取的。
30.一种产生橄榄醇酸及其类似物的方法,所述方法包括:
培养根据权利要求23至28中任一项所述的重组微生物。
31.根据权利要求29所述的方法,所述方法进一步包括:
分离和纯化所述橄榄醇酸及其类似物。
32.根据权利要求30所述的方法,所述方法进一步包括:
分离和纯化所述橄榄醇酸及其类似物。
CN202180016363.2A 2020-01-10 2021-01-09 用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台 Pending CN115151643A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202062959849P 2020-01-10 2020-01-10
US62/959,849 2020-01-10
PCT/US2021/012866 WO2021142393A1 (en) 2020-01-10 2021-01-09 Biosynthetic platform for the production of olivetolic avid and analogues of olivetolic acid

Publications (1)

Publication Number Publication Date
CN115151643A true CN115151643A (zh) 2022-10-04

Family

ID=76788339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180016363.2A Pending CN115151643A (zh) 2020-01-10 2021-01-09 用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台

Country Status (9)

Country Link
US (1) US20230051453A1 (zh)
EP (1) EP4087932A4 (zh)
JP (1) JP2023509662A (zh)
KR (1) KR20220126740A (zh)
CN (1) CN115151643A (zh)
BR (1) BR112022013503A2 (zh)
CA (1) CA3163708A1 (zh)
MX (1) MX2022008463A (zh)
WO (1) WO2021142393A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220106616A1 (en) * 2019-02-10 2022-04-07 Dyadic International (Usa), Inc. Production of cannabinoids in filamentous fungi

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2898272T3 (es) * 2017-04-27 2022-03-04 Univ California Microorganismos y métodos para producir cannabinoides y derivados de cannabinoides

Also Published As

Publication number Publication date
EP4087932A1 (en) 2022-11-16
KR20220126740A (ko) 2022-09-16
MX2022008463A (es) 2022-10-18
JP2023509662A (ja) 2023-03-09
EP4087932A4 (en) 2024-01-17
US20230051453A1 (en) 2023-02-16
BR112022013503A2 (pt) 2022-09-13
WO2021142393A1 (en) 2021-07-15
CA3163708A1 (en) 2021-07-15

Similar Documents

Publication Publication Date Title
CN110651047B (zh) 用于在酵母中生产植物大麻素和植物大麻素类似物的方法和细胞系
CN112789505B (zh) 用于生产大麻素和其它异戊二烯化的化合物的生物合成平台
EP2935566B1 (en) Cyanobacterium sp. for production of compounds
KR20100087695A (ko) 이소프로판올을 생산하도록 조작된 미생물
WO2010132845A1 (en) Organisms for the production of cyclohexanone
US11473110B2 (en) Xylitol producing Metschnikowia species
US20220333142A1 (en) Engineered trans-enoyl coa reductase and methods of making and using
CN107231807B (zh) 基因修饰苯丙酮酸脱羧酶、其制备方法和用途
CN110730820A (zh) 醛脱氢酶变体和使用方法
WO2021042057A1 (en) Systems and methods for preparing cannabinoids and derivatives
CN115003823A (zh) 用于生产大麻素和其他异戊二烯化化合物的生物合成平台
US20220411766A1 (en) Compositions and methods for using genetically modified orthologous enzymes
CN115151643A (zh) 用于产生橄榄醇酸和橄榄醇酸类似物的生物合成平台
CN105940111B (zh) 从3-羟基羧酸经3-羟基羧基-核苷酸制备烯烃
KR20190020121A (ko) 히드록시티로졸을 생산하기 위한 숙주 세포 및 방법
US20230348865A1 (en) Engineered enzymes and methods of making and using
WO2023076966A1 (en) Engineered enzymes and methods of making and using
JP5947470B2 (ja) 微生物の物質生産性を向上させる方法および該方法に用いるキット
CN113906045A (zh) 贝壳杉烯酸13-羟化酶(kah)变体及其用途

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination