CN116615554A - 含氧二萜化合物的制备 - Google Patents

含氧二萜化合物的制备 Download PDF

Info

Publication number
CN116615554A
CN116615554A CN202180051016.3A CN202180051016A CN116615554A CN 116615554 A CN116615554 A CN 116615554A CN 202180051016 A CN202180051016 A CN 202180051016A CN 116615554 A CN116615554 A CN 116615554A
Authority
CN
China
Prior art keywords
leu
ala
glu
ser
lys
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180051016.3A
Other languages
English (en)
Inventor
J·安德森·瑞恩布尔格
N·L·汉森
V·福尔曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kobenhavns Universitet
Original Assignee
Kobenhavns Universitet
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kobenhavns Universitet filed Critical Kobenhavns Universitet
Publication of CN116615554A publication Critical patent/CN116615554A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P5/00Preparation of hydrocarbons or halogenated hydrocarbons
    • C12P5/007Preparation of hydrocarbons or halogenated hydrocarbons containing one or more isoprene units, i.e. terpenes
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C35/00Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a ring other than a six-membered aromatic ring
    • C07C35/22Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a ring other than a six-membered aromatic ring polycyclic, at least one hydroxy group bound to a condensed ring system
    • C07C35/37Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a ring other than a six-membered aromatic ring polycyclic, at least one hydroxy group bound to a condensed ring system with a hydroxy group on a condensed system having three rings
    • C07C35/42Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a ring other than a six-membered aromatic ring polycyclic, at least one hydroxy group bound to a condensed ring system with a hydroxy group on a condensed system having three rings derived from the phenanthrene skeleton
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C39/00Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring
    • C07C39/12Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring polycyclic with no unsaturation outside the aromatic rings
    • C07C39/17Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring polycyclic with no unsaturation outside the aromatic rings containing other rings in addition to the six-membered aromatic rings, e.g. cyclohexylphenol
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C39/00Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring
    • C07C39/205Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring polycyclic, containing only six-membered aromatic rings as cyclic parts with unsaturation outside the rings
    • C07C39/225Compounds having at least one hydroxy or O-metal group bound to a carbon atom of a six-membered aromatic ring polycyclic, containing only six-membered aromatic rings as cyclic parts with unsaturation outside the rings with at least one hydroxy group on a condensed ring system
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C403/00Derivatives of cyclohexane or of a cyclohexene or of cyclohexadiene, having a side-chain containing an acyclic unsaturated part of at least four carbon atoms, this part being directly attached to the cyclohexane or cyclohexene or cyclohexadiene rings, e.g. vitamin A, beta-carotene, beta-ionone
    • C07C403/06Derivatives of cyclohexane or of a cyclohexene or of cyclohexadiene, having a side-chain containing an acyclic unsaturated part of at least four carbon atoms, this part being directly attached to the cyclohexane or cyclohexene or cyclohexadiene rings, e.g. vitamin A, beta-carotene, beta-ionone having side-chains substituted by singly-bound oxygen atoms
    • C07C403/08Derivatives of cyclohexane or of a cyclohexene or of cyclohexadiene, having a side-chain containing an acyclic unsaturated part of at least four carbon atoms, this part being directly attached to the cyclohexane or cyclohexene or cyclohexadiene rings, e.g. vitamin A, beta-carotene, beta-ionone having side-chains substituted by singly-bound oxygen atoms by hydroxy groups
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C47/00Compounds having —CHO groups
    • C07C47/38Unsaturated compounds having —CHO groups bound to carbon atoms of rings other than six—membered aromatic rings
    • C07C47/46Unsaturated compounds having —CHO groups bound to carbon atoms of rings other than six—membered aromatic rings containing hydroxy groups
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C50/00Quinones
    • C07C50/26Quinones containing groups having oxygen atoms singly bound to carbon atoms
    • C07C50/34Quinones containing groups having oxygen atoms singly bound to carbon atoms the quinoid structure being part of a condensed ring system having three rings
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C69/00Esters of carboxylic acids; Esters of carbonic or haloformic acids
    • C07C69/02Esters of acyclic saturated monocarboxylic acids having the carboxyl group bound to an acyclic carbon atom or to hydrogen
    • C07C69/12Acetic acid esters
    • C07C69/16Acetic acid esters of dihydroxylic compounds
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07DHETEROCYCLIC COMPOUNDS
    • C07D209/00Heterocyclic compounds containing five-membered rings, condensed with other rings, with one nitrogen atom as the only ring hetero atom
    • C07D209/56Ring systems containing three or more rings
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07DHETEROCYCLIC COMPOUNDS
    • C07D307/00Heterocyclic compounds containing five-membered rings having one oxygen atom as the only ring hetero atom
    • C07D307/77Heterocyclic compounds containing five-membered rings having one oxygen atom as the only ring hetero atom ortho- or peri-condensed with carbocyclic rings or ring systems
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07DHETEROCYCLIC COMPOUNDS
    • C07D493/00Heterocyclic compounds containing oxygen atoms as the only ring hetero atoms in the condensed system
    • C07D493/22Heterocyclic compounds containing oxygen atoms as the only ring hetero atoms in the condensed system in which the condensed system contains four or more hetero rings
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1085Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P15/00Preparation of compounds containing at least three condensed carbocyclic rings
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/04Oxygen as only ring hetero atoms containing a five-membered hetero ring, e.g. griseofulvin, vitamin C
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/18Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
    • C12P17/181Heterocyclic compounds containing oxygen atoms as the only ring heteroatoms in the condensed system, e.g. Salinomycin, Septamycin
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • C12P7/18Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic polyhydric
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/22Preparation of oxygen-containing organic compounds containing a hydroxy group aromatic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/24Preparation of oxygen-containing organic compounds containing a carbonyl group
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07BGENERAL METHODS OF ORGANIC CHEMISTRY; APPARATUS THEREFOR
    • C07B2200/00Indexing scheme relating to specific properties of organic compounds
    • C07B2200/07Optical isomers
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C2601/00Systems containing only non-condensed rings
    • C07C2601/12Systems containing only non-condensed rings with a six-membered ring
    • C07C2601/14The ring being saturated
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C2602/00Systems containing two condensed rings
    • C07C2602/02Systems containing two condensed rings the rings having only two atoms in common
    • C07C2602/14All rings being cycloaliphatic
    • C07C2602/26All rings being cycloaliphatic the ring system containing ten carbon atoms
    • C07C2602/28Hydrogenated naphthalenes
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C2603/00Systems containing at least three condensed rings
    • C07C2603/02Ortho- or ortho- and peri-condensed systems
    • C07C2603/04Ortho- or ortho- and peri-condensed systems containing three rings
    • C07C2603/22Ortho- or ortho- and peri-condensed systems containing three rings containing only six-membered rings
    • C07C2603/26Phenanthrenes; Hydrogenated phenanthrenes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/645Fungi ; Processes using fungi
    • C12R2001/85Saccharomyces
    • C12R2001/865Saccharomyces cerevisiae

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Nutrition Science (AREA)
  • Cell Biology (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

本发明公开了一种通过插入编码特定细胞色素P450酶的基因、并在所选宿主细胞中表达所述基因以合成所述化合物来制备含氧二萜化合物(如雷酚内酯、雷公藤内酯酮和雷公藤甲素)的方法。进一步公开了适用于该合成的特定细胞色素P450酶。

Description

含氧二萜化合物的制备
序列列表参考
本申请包含计算机可读形式的序列列表,其通过引用并入。
技术领域
本发明涉及在重组细胞如酵母细胞中生产含氧二萜化合物。含氧二萜化合物可用作合成有用生物活性化合物的中间或最终化合物,以用于例如药物治疗疾病(例如癌症)。本发明还涉及特别适合于生产这种化合物的基因、酶和细胞,例如酵母细胞。
背景技术
萜烯是一组由基本的5碳结构异戊二烯(2-甲基-1,3-丁二烯)生成的多种化合物。二萜是具有20个碳结构的化合物,由一种酶二萜合酶的作用产生,该酶将化合物双(牻牛儿基)二磷酸盐(GGPP)转化为二萜结构,这通过进一步修饰可以形成广泛的Diterpene或二萜化合物。
Diterpene、二萜及其衍生物被广泛用作药物、化妆品、营养品、香料、香料和杀虫剂。在天然或工程细胞中增加这些化合物的产生的方法在本领域中是丰富的。
已知雷公藤这种中药植物能产生几种具有潜在药理性能的倍半萜、二萜和三萜,包括二萜化合物雷公藤内酯酮和雷公藤甲素。雷公藤甲素,一种含氧二萜化合物,及其衍生物已被确定为潜在的有价值的药理化合物,并正在作为免疫抑制剂和治疗癌症进行研究。雷公藤甲素可进一步用于治疗COVID-19。雷公藤内酯酮可能用作男性避孕药。
使用工程微生物从可再生原料中生产有价值的分子是传统生产方式的理想选择。然而,实现经济上可行的产量、效价和生产率是工业化的主要障碍。
N L Hansen等人(2017)在The PlantJournal 2017,89,429-441中描述一种二萜合酶,能够将GGPP转化为dipterpene,即松香烷型二萜烯,这是雷公藤甲素的前体。P Su等人(2018)在The plantJournal 2018,50-65:和J Guo等人(2018)在PNAS 2013,110,12108-12113中证实了这一发现。
将松香烷型二萜烯转化为其他二萜化合物(如雷公藤甲素)的完整途径尚未阐明。
细胞色素P450酶(CYP)参与terpenoid的生物合成,对于许多细胞色素P450酶来说,对于它们作用的底物、它们产生的化合物或它们在特定化合物生物合成中的作用一无所知。
US 20190270971A1公开了用于提高功能性表达p450酶的微生物宿主细胞的生产率的方法。该文件描述了如何修饰P450基因以提高微生物(如酵母)的性能,并提到与细胞色素P450还原酶的共表达有助于提高产量。提到雷公藤甲素可能是P450化学的主题,但该文件没有提供雷公藤甲素与任何特定P450酶或细胞色素之间的任何联系。
CN 108395997A描述了GGPP产量增加的酵母。用不同的二萜合酶和P450酶转化酵母以合成二萜化合物。该专利背后的科学家团队还支持更多的专利和专利申请,公开了使用合适的terpene合成酶和P450酶合成不同的二萜和三萜化合物,例如CN 108866029(friedelin)、CN 107058419(Kauren型)和WO 2020029564(Fridelin和amyrins)。
CN 110747178A将P450基因TwCYP728B70描述为编码在雷公藤甲素合成中具有作用的细胞色素P450酶。
发明内容
本发明人解决了提供一种制备含氧二萜化合物如雷酚内酯、雷公藤内酯酮和雷公藤甲素的改进方法的问题。
在第一方面,本发明涉及一种制备含氧二萜化合物的方法,该方法包括以下步骤:
a.提供能够产生松香烷型二萜烯和/或脱氢松香二烯的宿主细胞;
b.用编码具有细胞色素P450活性的酶的第一基因转化宿主细胞;
c.在导致转化基因表达的条件下生长转化细胞;由此形成含氧二萜化合物;
其中:
编码具有细胞色素P450活性的酶的第一基因编码多肽,所述多肽包含与SEQ IDNO:1(TwCYP82D274v1)、SEQ ID NO:2(TwCYP82D274v2)、SEQ ID NO:74(TwCYP82D274v3)或SEQ ID NO:75(TwCYP82D274v4)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。
在第二方面,本发明涉及制备含氧二萜化合物的方法,并包括用编码具有细胞色素P450活性的酶的第一基因和进一步用编码具有细胞色素P450活性的第二酶的第二基因以及编码具有细胞色素P450活性的第三酶的第三基因转化宿主细胞,
其中:
编码具有细胞色素P450活性的酶的第二基因编码多肽,所述多肽包含与SEQ IDNO:3(TwCYP71BE85)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽;和
编码具有细胞色素P450活性的酶的第三基因编码多肽,所述多肽包含与SEQ IDNO:4(TwCYP71BE85)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。
在第三方面,本发明涉及制备含氧二萜化合物的方法,其中所述方法包括用编码具有细胞色素P450活性的酶的第一、第二和第三基因转化宿主细胞,并且进一步用编码具有细胞色素P450活性的第四酶的第四基因转化宿主;
其中:
编码具有细胞色素P450活性的酶的第四基因编码多肽,所述多肽包含与SEQ IDNO:5(TwCYP82D213v1)或SEQ ID NO:76(TwCYP82D213v2)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。
根据本发明,可以提供有用的含氧二萜化合物雷酚内酯、雷公藤内酯酮或雷公藤甲素。
本发明还涉及用于本发明方法的多肽、多核苷酸、质粒和表达构建体以及重组宿主细胞。
附图说明
图1显示了表达miltiredene生物合成基因和选定的T.wilfordii CYP的N.benthamiana叶提取物的LCMS谱。
TwCYP与CfDXS、CfGGPPS、CfTPS1和CfTPS3共表达。“3xSTD”代表三种混合真实标准品的LCMS运行:雷公藤甲素、雷公藤内酯酮和雷酚内酯。实线表示m/z 280-380范围内的离子色谱图。虚线(-------)表示在m/z 313.1800±0.015处提取的离子色谱图,对应于雷酚内酯[m+H]的亲代离子。虚线(---)表示在m/z 359.1490±0.015处提取的离子色谱图,对应于雷公藤内酯酮[m+H]的亲代离子。使用LC协议1。有关详细信息,请参见实施例1。
图2显示了基因工程酿酒酵母(S.cerevisiae)菌株提取物的LCMS图谱。
在背景菌株(-)中,编码二萜生物合成酶SPGGPPS7、CftTPS1、CftTPS3和TwCPR1的基因被整合到野生型酿酒酵母的基因组中。在TwCYP82D274v1菌株中,二萜生物合成酶用TwCYP82D274v1表达,导致化合物(3)的形成,被鉴定为14-OH-脱氢松香二烯,标记为灰色。LC方法1用于分析。有关详细信息,请参见实施例3。
图3显示了在599.85MHz下CDCl3中14-OH-脱氢松香二烯的1H NMR光谱。有关详细信息,请参见实施例4。
图4显示了在150.83MHz下CDCl3中14-OH-脱氢松香二烯的13C NMR谱。有关详细信息,请参见实施例4。
图5显示了整合了指定基因组合基因组的酵母提取物的LCMS图谱。
所有酵母菌株都具有基因组整合的spGGPPs7、CftTPS1、CftTPS3和TwCPR1。“0.5ppm 3xSTD”代表三种混合真实标准品的LCMS运行:雷公藤甲素、雷公藤内酯酮和雷酚内酯。非虚线表示m/z 280-380范围内的离子色谱图。虚线表示在m/z 359.1490±0.015处提取的离子色谱图,对应于雷公藤内酯酮([M+H]+)的亲代离子。使用LC协议2。有关详细信息,请参见实施例5。
图6显示了从雷公藤中分离的TwCYP和B5蛋白的不同变体的共表达
从工程化酿酒酵母菌株的培养物中量化的雷酚内酯、雷公藤内酯酮和14-OH-脱氢松香二烯的水平。每一列代表工程酵母菌株及其所选化合物的产量。整合到每个单独菌株中的基因在下图中表示。定量基于代表每种感兴趣化合物的峰面积。每种化合物都有单独的刻度。有关详细信息,请参见实施例5。
图7显示了雷公藤内酯酮拟议生物合成途径中关键中间体的相对数量(条形),当在体内建立时,通过在N.benthamiana(图A和D)和酿酒酵母(图B和E,表3中列出的菌株)中的异源基因表达。基因表达由左边的黑色方块表示,而相对数量由条形表示(3-4个生物复制的平均值;黑色菱形方块),白色和灰色填充颜色区分表达,并且Twb5#1无表达。误差条表示标准偏差。“DiTPS”反映CftTPS1和CftTPS3。在峰面积的量化中,GCMS(松香烷型二萜烯和14-OH-脱氢松香二烯)和LCMS(所有其他化合物)的特征质量公差分别为±0.1m/z和±0.005m/z。图C:在N.benthamiana和酿酒酵母体内从松香烷型二萜烯到雷公藤内酯酮的假设生物合成途径,包括Wagner-Meerwein重排反应,以解释abietane碳骨架中C-19或C-18向C-3的甲基转移。
图8显示了在生物反应器中生长时,酵母菌株NVJ8.15产生的雷酚内酯和雷公藤内酯酮在7天内的累积。雷公藤内酯酮(实线黑线)和雷酚内酯(虚线黑线)的水平显示了每天采集的培养物样品中的绝对量(ppm,w/v)。通过600nm处的吸光度(灰色虚线)定量生物质。
图9显示了表达雷公藤内酯酮生物合成所需基因的酵母菌株,但其基因变体取代了TwCYP82D274v1或TwCYP82D113v1,与产生雷酚内酯(图A)和雷公藤内酯酮(图B)的能力有关,并导致类似的LCMS谱(图C)。工程菌株中存在的基因用黑色方块表示。图A和B:条形图表示平均相对数量(2-3个生物重复,交叉),误差条形图显示标准误差。从左到右,条形代表酵母菌株:NVJ10-1、NVJ10-3、NVJ10-6、NVJ10-8(见表3)。图C:LCMS分析酵母培养物的EIC(m/z280-360)。从上到下成对的色谱图表示酵母菌株NVJ10-1、NVJ10-3、NVJ10-6和NVJ10-8。
序列的简短描述
SEQ ID NO:1显示来源于雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D274v1。
SEQ ID NO:2显示来源于雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D274v2。SEQ ID NO:2与SEQ ID NO:1仅在三个位置上不同,因此假设SEQ ID NO:1和SEQ ID NO:2代表同一基因的不同等位基因。
SEQ ID NO:3显示来源于雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP71BE85。
SEQ ID NO:4显示源自T.wilfordii的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP71BE86。
SEQ ID NO:5显示源自T.wilfordii的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D213v1。
SEQ ID NO:6显示源自T.wilfordii的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D217。
SEQ ID NO:7显示来源于雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D275。
SEQ ID NO:8显示来源于雷公藤的细胞色素B5酶的氨基酸序列。这种酶也被称为TwB5#1。
SEQ ID NO:9显示来源于雷公藤的细胞色素P450还原酶的氨基酸序列。这种酶也称为TwCPR1。
SEQ ID NO:10-66显示了实施例2中进一步描述的PCR引物。
SEQ ID NO:67显示源自Plectranthus barbatus的二萜合酶TPS1的氨基酸序列。这种酶也称为CfTPS1。
SEQ ID NO:68显示源自Plectranthus barbatus的二萜合酶TPS3的氨基酸序列。这种酶也称为CfTPS3。
SEQ ID NO:69显示了萜合酶TPS9的氨基酸序列,其来源于雷公藤。该酶也称为TwTPS9。
SEQ ID NO:70显示了源自雷公藤的萜烯合酶的氨基酸序列。该酶也称为TwTPS27。
SEQ ID NO:71显示了源自Salvia miltiorrhiza的柯巴基焦磷酸合酶CPS1的氨基酸序列。这种酶也称为SmCPS。
SEQ ID NO:72显示源自Salvia miltiorrhiza的松香烷型二萜烯合酶KSL1的氨基酸序列。这种酶也称为SmKSL。
SEQ ID NO:73显示了源自Synechococcus sp的geranyl geranyl二磷酸合酶的氨基酸序列。该酶也称为SpGGPPs7v1。
SEQ ID NO:74显示源自雷公藤的细胞色素P450酶的氨基酸序列。该酶也被称为TwCYP82D274v3。
SEQ ID NO:75显示源自雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D274v4。
SEQ ID NO:76显示源自雷公藤的细胞色素P450酶的氨基酸序列。该酶也称为TwCYP82D213v2。
SEQ ID NO:77显示源自Plectranthus barbatus的二萜合酶TPS1的截断氨基酸序列。截断氨基酸序列以去除转运肽。这种酶也称为CftTPS1。
SEQ ID NO:78显示源自Plectranthus barbatus的二萜合酶TPS3的截断氨基酸序列。截断氨基酸序列以去除转运肽。这种酶也被称为CftTPS3。
SEQ ID NO:79显示源自Plectranthus barbatus的DXS酶的氨基酸序列。这种酶也称为CfDXS。
SEQ ID NO:80显示源自酿酒酵母的截断HMGR酶的氨基酸序列。这种酶也称为SctHMGR。
SEQ ID NO:81显示了源自Synechococcus sp的geranyl geranyl二磷酸合酶的氨基酸序列。该酶也称为SpGGPPs7v2。
具体实施方式
根据本发明的第一方面,提供一种制备含氧二萜化合物的方法,该方法包括以下步骤:
a.提供能够产生松香烷型二萜烯和/或脱氢松香二烯的宿主细胞;
b.用编码具有细胞色素P450活性的酶的第一基因转化宿主细胞;
c.在导致转化基因表达的条件下生长转化细胞;由此形成含氧二萜化合物;
其中:
编码具有细胞色素P450活性的酶的第一基因编码多肽,所述多肽包含或由下列组成:SEQ ID NO:1或与SEQ ID NO:1(TwCYP82D274V1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。
具有SEQ ID NO:1的多肽是具有细胞色素P450活性的第一基因的优选实例,具有SEQ ID NO:2、SEQ ID NO:74和SEQ ID NO:55的多肽是这种多肽的其他实例。
因此,由第一基因编码的酶具有通过在松香烷型二萜烯的二萜骨架的第14位插入OH基团将松香烷型二萜烯和/或脱氢松香二烯转化为14-OH-脱氢松香二烯的能力。
在一些实施方案中,14-OH-脱氢松香二烯的合成通过化合物14-OH-松香烷型二萜烯进行,该化合物随后转化为14-OH-脱氢松香二烯。然而,本发明不限于将松香烷型二萜烯转化为14-OH-脱氢松香二烯的任何特定机制。
使用根据本发明第一方面的方法导致含氧二萜化合物14-OH-脱氢松香二烯的形成,
14-OH-脱氢松香二烯
这是合成药用含氧二萜化合物的有用中间体,包括众所周知的化合物,如雷酚内酯、雷公藤内酯酮和雷公藤甲素。
在本发明的第二方面,方法步骤b.包括用编码具有细胞色素P450活性的酶的第一基因转化宿主细胞,并且进一步用编码具有具有细胞色素P450活性的第二酶的第二基因和编码具有细胞色素P450活性的第三酶的第三基因来转化宿主细胞,
其中:
编码具有细胞色素P450活性的酶的第二基因编码多肽,所述多肽包含或由下列组成:SEQ ID NO:4或与SEQ ID NO:4(TwCYP71BE86)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽;和
编码具有细胞色素P450活性的酶的第三基因编码多肽,所述多肽包含或由下列组成:SEQ ID NO:3或与SEQ ID NO:3(TwCYP71BE85)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。
在该第二方面中,宿主细胞优选进一步制备含氧二萜化合物雷酚内酯(3bR,9bS)-6-羟基-9b-甲基-7-丙-2-基-3,3b,4,5,10,11-六氢化萘并[2,1-e]异苯并呋喃-1-酮。
雷酚内酯
雷酚内酯是一种有价值的化合物,已被鉴定为抗雄激素。此外,它可以作为进一步修饰的起点,从而产生进一步的生物活性化合物。
在本发明第二方面的优选实施方案中,用编码具有细胞色素B5活性的多肽的第五基因进一步转化宿主细胞,所述多肽包含或由下列组成:SEQ ID NO:8或与SEQ ID NO:8(TwB5#1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。令人惊讶地发现,在表达编码具有细胞色素P450活性的酶的第一、第二和第三基因的同一细胞中表达具有细胞色素B5活性的多肽,导致含氧二萜化合物的产量显著增加。与没有具有细胞色素B5活性的多肽的类似细胞的生产相比,生产增加至少50%,优选增加至少100%,优选至少200%或甚至更多。
在本发明的第三方面中,用编码具有细胞色素P450活性的酶的第一、第二和第三基因转化宿主细胞,并且进一步用编码具有生物色素P450的活性的第四酶的第四基因来转化宿主细胞;
其中:
编码具有细胞色素P450活性的第四酶的第四基因编码多肽,所述多肽包含或由下列组成:SEQ ID NO:5或与SEQ ID NO:5(TwCYP82D213v1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。具有SEQ ID NO:5的多肽是具有细胞色素P450活性的第四基因的优选实例;具有SEQ ID NO:76的多肽是这种多肽的另一个例子。
在本发明的第三方面,转化的真核细胞优选产生含氧二萜化合物雷公藤内酯酮。
雷公藤内酯酮
化合物雷公藤内酯酮据报道在癌症中具有强烈的抑制活性(Fulu Dong et al2019,The Prostate,Volume 19,issue 11,第1284-1293页)。该化合物也可用作男性避孕药。此外,该化合物可用作进一步修饰的起点,从而产生进一步的生物活性化合物。
在本发明第三方面的优选实施方案中,用编码具有细胞色素B5活性的多肽的第五基因进一步转化宿主细胞,并且所述多肽包含或由下列组成:SEQ ID NO:8或与SEQ ID NO:8(TwB5#1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。令人惊讶地发现,在表达编码具有细胞色素P450活性的酶的第一、第二和第三基因的同一细胞中表达具有细胞色素B5活性的多肽,导致含氧二萜化合物的产量显著增加。与没有具有细胞色素B5活性的多肽的类似细胞的生产相比,生产增加至少50%,优选增加至少100%,优选至少200%或甚至更多。
在本发明第三方面的另一优选实施方案中,宿主细胞进一步用编码具有细胞色素P450活性的第五酶的第六基因和/或编码具有细胞细胞色素P45活性的第六酶的第七基因转化,
其中:
编码具有细胞色素P450活性的第五酶的第六基因编码多肽,所述多肽包含或由下列组成:SEQ ID NO:6或与SEQ ID NO:6(TwCYP82D217)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽;和
编码具有细胞色素P450活性的第六酶的第七基因编码多肽,所述多肽包含或由下列组成:与SEQ ID NO:7(TwCYP82D275)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列,或其成熟多肽。令人惊讶地发现,表达编码具有细胞色素P450活性的酶的第六和/或第七基因,导致含氧二萜化合物的更高产量。优选地,与没有编码具有细胞色素P450活性的酶的第六和/或第七基因的类似细胞的生产相比,生产增加至少10%,优选增加至少20%,甚至更优选增加至少50%或甚至更多。
第一、第二、第三、第四、第五、第六和第七基因可以包含在一种或多种核酸分子中,例如一种或多种异源核酸。编码具有细胞色素P450活性的第一酶的异源核酸在此可称为“第一异源核酸”。编码具有细胞色素P450活性的第二酶的异源核酸在此可称为“第二异源核酸”。编码具有细胞色素P450活性的第三酶的异源核酸在此可称为“第三异源核酸”。编码具有细胞色素P450活性的第四酶的异源核酸在此可称为“第四异源核酸”。编码具有细胞色素P450活性的第五酶的异源核酸在此可称为“第五异源核酸”。编码具有细胞色素P450活性的第六酶的异源核酸在此可称为“第六异源核酸”。编码具有细胞色素B5活性的酶的异源核酸在此可称为“第七异源核酸”。这并不意味着重组宿主细胞必须总共包含七种异源核酸;在一些实施方案中,细胞仅包含第一、第二、第三、第四、第五、第六和第七异源核酸中的一种或多种。
根据本发明方法制备的含氧二萜化合物可通过生物或化学合成进一步修饰。与此相关,生物合成被理解为一种方法,其中包含本发明基因的宿主细胞进一步被提供有一个或多个额外的基因,所述基因编码具有修饰根据本发明方法制备的含氧二萜化合物的能力的其他酶。
根据本发明方法制备的含氧二萜化合物的化学修饰可以在回收含氧二酚化合物之前直接在培养液上进行,或者可以在回收的含氧三萜化合物上进行。
可以通过有机合成将雷公藤内酯酮还原为雷公藤甲素。这种合成的一个例子是通过氢化物对C-14酮的亲核攻击进行还原。对于该反应,硼氢化钠是在适当的溶剂例如水或甲醇中在中性pH下催化该反应的合适试剂。
在一个优选实施方案中,根据本发明方法生产的雷公藤内酯酮被转化为化合物雷公藤甲素,该化合物据报道为免疫抑制剂,正在研究用于癌症治疗。
宿主细胞
能够产生松香烷型二萜烯和/或脱氢松香二烯的宿主细胞原则上可以是任何这样的细胞。该细胞可以是天然产生松香烷型二萜烯和/或脱氢松香二烯的细胞,也可以是经过工程设计产生一种或两种化合物的细胞。
据信,至少在某些情况下,松香烷型二萜烯可以自发地转化为脱氢松香二烯,因此本发明可以使用生产松香烷型二萜烯的细胞进行,该细胞自发地转化成脱氢松香二烯,或者可以在含有促进松香烷型二萜烯转化为脱氢松香二烯的酶的细胞中进行(参见J.Zi,etal.,Organic&Biomolecular Chemistry 2013,11,7650-7652)。
松香烷型二萜烯的合成通常始于GGPP的形成。GGPP可以通过一个二甲基烯丙基焦磷酸(DMAP)分子和三个异戊烯基焦磷酸(IPP)分子的缩合来合成,并且通常由双(牻牛儿基)二磷酸合酶(例如源自Synechococcus sp.的SpGGPPs7酶)催化;并且具有SEQ ID NO:73或SEQ ID NO:81中所示的氨基酸序列。
GGPP通过二萜合酶的作用或通过两种或多种二萜合酶的联合作用,例如两种二萜合酶CfTPS1和CfTPS3的组合,或如SEQ ID NO:77中所述CftTPS1和如SEQ ID NO:78中所示CftPPS3的组合,转化为松香烷型二萜烯,所述二萜合酶衍生自Plectranthus barbatus,并具有SEQ ID NO:67和68的氨基酸序列;两种二萜合酶TwTPS9和TwTPS27的组合,衍生自雷公藤并具有SEQ ID NO:69和70的氨基酸序列;或源自Salvia miltiorrhiza并具有SEQ IDNO:71的氨基酸序列的柯巴基焦磷酸合酶SmCPS和源自Salvia miltiorrhiza且具有SEQ IDNO:72的氨基酸序列的松香烷型二萜烯合酶SmKSL。
公开的现有技术:
N.L.Hansen,et al.,The Plant Journal 2017,89,429-441.
P.Su,et a.,The Plant Jounal 2018,93,50-65.
J.Guo,et al.,PNAS 2013.110,12108-12113.
提供产生松香烷型二萜烯和/或脱氢松香二烯的宿主细胞的一种优选方式是选择产生GGPP的宿主细胞,并用催化GGPP转化为松香烷型二萜烯的二萜合酶将其转化。或者,已经被基因改造以产生GG4P的宿主细胞可以用作起点。
用二萜合酶转化宿主细胞以催化GGPP转化为松香烷型二萜烯的技术在现有技术中是已知的,例如在N L Hansen et al(2017),The Plant Journal 2017,89,429-441(Incorporated herein by reference);P Su et al(2018),The plantJournal 2018,50-65和J.Guo,et al.,Proceedings ofthe NationalAcademy ofSciences 2013,110,12108-12113,以及这些出版物中公开的程序和方法也可用于提供用于本发明的宿主细胞。
宿主细胞可以是原核细胞,例如真细菌或古细菌细胞;或真核细胞,例如植物细胞、动物细胞、昆虫细胞、真菌细胞或酵母细胞。
实际上,所有真核细胞都为其生物合成产生GGPP,但在一些实施方案中,与不产生增加量GGPP的类似真核细胞相比,真核细胞产生增加量的GGPP,这可能增加松香烷型二萜烯的产量。现有技术中还描述了增加真核细胞中GGPP产量的方法。
宿主细胞可以是单细胞生物体,也可以包含在多细胞生物体内,例如植物。用作根据本发明的宿主细胞的合适植物或植物细胞的实例包括玉米(Zea mays)、油菜(Brassicanapus,Brassica rapa ssp.)、苜蓿(Medicago sativa)、水稻(Oryza sativa)、黑麦(SSecale cerale)、高粱(Sorghum bicolor,Sorghum vulgare)、向日葵(Helianthusannuas)、小麦(Tritium aestivum和其他物种)、小黑麦、黑麦(Secale)大豆(Glycinemax)、烟草(Nicotiana tabacum)、马铃薯(Solanum tuberosum)、花生(Arachishypogaea)、棉花(Gossypium hirsutum)、甘薯(Impomoea batatus)、木薯(Manihotesculenta)、咖啡(Coffea spp.)、椰子(Cocos nucifera)、菠萝(Ananacomosus)、柑橘(Citrus spp.)、可可(Theobroma cacao)、茶(Camellia senensis)、香蕉(Musa spp)、鳄梨(Persea americana)、无花果(Ficus casica)、番石榴(Psidiumguajava)、芒果(Mangiferindica)、橄榄(Olea europae)、木瓜(Carica papaya)、腰果(Anacardium occidentale)、澳洲坚果(Macadamia intergrifolia)、杏仁(Primusamygdalus)、苹果(Malus spp.)、梨(Pyrus spp.)、李子和樱桃树(Prunus spp.)、ribes(currant etc.)、葡萄、菊芋(Helianthemum spp.)、非谷类草(禾本科)、糖和饲料甜菜(甜菜属)、菊苣、燕麦、大麦、蔬菜或观赏植物,农作物(例如,谷物和豆类、玉米、小麦、土豆、木薯、大米、高粱、小米、木薯、大麦、豌豆、甜菜、甘蔗、大豆、油菜、向日葵和其他根茎、块茎或种子作物)。其他重要的植物可能是果树、作物树、林木或用作香料或医药产品的植物(Mentha spp.,clove,Artemesia spp.,Thymus spp.,Lavendula spp.,Allium spp.,Hypericum,Catharanthus spp.,Vinca spp.,Papaver spp.,Digitalis spp.,Rawolfiaspp.,Vanilla spp.,Petrusilium spp.,Eucalyptus,茶树,Picea spp.,Pinus spp.,Abies spp.,Juniperus spp)。可用于本发明的园艺植物可包括莴苣、菊苣和蔬菜芸苔,包括卷心菜、花椰菜和花椰菜、胡萝卜、康乃馨和天竺葵。
植物也可以是烟草、葫芦、胡萝卜、草莓、向日葵、番茄、辣椒或菊花。
植物的其它实例包括谷物植物,例如油籽植物或豆科植物。感兴趣的种子包括谷物种子,如玉米、小麦、大麦、高粱、黑麦等。油料种子植物包括棉花大豆、红花、向日葵、芸薹属植物、玉米、苜蓿、棕榈、椰子等。豆科植物包括豆类和豌豆。豆类包括瓜尔豆、刺槐豆、葫芦巴、大豆、菜豆、豇豆、绿豆、利马豆、蚕豆、扁豆、鹰嘴豆。
特别优选的植物种类包括Physcomitrella sp.,如P.patens;拟南芥属,如拟南芥;烟草属(Nicotiana sp.),如N.benthamiana;衣藻属(Chlamydomonas sp.),如C.reinhardtii;以及Nannochloropsis sp.(N.oceanica)等南绿球藻(Nannochloropsissp.)。
根据本发明使用的合适的真核细胞的实例包括真菌细胞,如Agaricus、Aspergillus、念珠菌、Eremothecium、镰刀菌/赤霉素、克鲁维酵母、Laetiporus、Lentinus、Phaffia、Phanerochaete、毕赤酵母、Physcomitrella、Rhodoturula、酿酒酵母、裂殖酵母、Sphaceloma、Xanthophyllomyces或耶罗维亚酵母。来自这些属的示例性物种包括Lentinustigrinus、硫黄菌、黄孢平革菌,Pichia pastoris,Cyberlindnera jadinii,Physcomitrella patens,Rhodoturula glutinis,粘液真菌,Phaffia rhodozyma,红法夫酵母,Fusariumfujikuroi/Gibberellafujikuroi,Candida utilis,Candida glabrata,白色念珠菌和Yarrowia lipolytica。
在一些实施方案中,宿主细胞可以是子囊菌,例如赤霉素(Gibberellafujikuroi)、乳酸克鲁维酵母(Kluyveromyces lactis)、裂殖酵母(Schizosaccharomycespombe)、黑曲霉(Aspergillus niger)、溶脂亚罗菌(Yarrowia lipolytica)、棉花灰霉(Ashbya gosspiii)或酿酒酵母。
在一些实施方案中,宿主细胞可以是藻类细胞,例如三孢布拉克菌(Blakesleatrispora)、杜氏盐藻(Dunaliella salina)、雨生红球藻(Haematoccus pluvialis)、小球藻(Chlorella sp.)、裙带菜(裙带菜)、马尾藻(Sargasum)、海带(Laminaria japonica)、almeriensis Scenedsmus。
在一些实施方案中,宿主细胞可以是原核生物,例如芽孢杆菌细胞,例如枯草芽孢杆菌;大肠杆菌细胞,例如大肠杆菌细胞;乳杆菌细胞;乳球菌细胞;链霉菌细胞、链球菌细胞、角膜细菌细胞;醋酸杆菌细胞;不动杆菌细胞;或假单胞菌细胞。
在一些实施方案中,宿主细胞可以是蓝藻细胞,例如聚囊藻属或聚球藻属。
在一个实施方案中,选择适合在发酵罐中生长的宿主细胞。生长根据本发明的重组宿主细胞是生长宿主细胞以生产本发明的含氧二萜化合物的方便方法。
在另一个实施方案中,宿主细胞是趋光细胞,并且细胞在温室或光生物反应器中培养。
基因和酶
本发明的重组宿主细胞能够产生能够产生松香烷型二萜烯和/或脱氢松香二烯。松香烷型二萜烯可以自发转化为脱氢松香二烯,也可以通过促进松香烷型二萜烯转化为脱氢松香二烯的酶进行转化。
如上所述,松香烷型二萜烯的合成通常以一个二甲基烯丙基焦磷酸(DMAP)分子和三个异戊烯基焦磷酸(IPP)分子通过双(牻牛儿基)二磷酸合酶缩合形成GGPP开始。
重组宿主细胞和编码在重组宿主细胞中催化GGPP合成的酶的异源核酸在本领域中是公知的,参见例如WO 2015/113570。此外,许多宿主生物能够产生GGPP,因此异源核酸可能并不总是GGPP生产所必需的。
在一些实施方案中,重组宿主细胞包含编码双(牻牛儿基)二磷酸合酶的异源核酸,如SEQ ID NO:73或SEQ ID NO:81中所述双(牻牛儿基)二磷酸合酶SpGGPPs7,或与其具有至少80%、例如至少81%、例如至少82%、例如至少83%、例如至少84%、例如至少85%、例如至少86%、例如至少87%、例如至少88%、例如至少89%、例如至少90%、如至少91%、例如至少92%、例如至少93%、例如至少94%、例如至少95%、例如至少96%、例如至少97%、例如至少98%、例如至少99%的序列同一性的功能同源物,或其成熟多肽。
随后,GGPP可通过一个或多个二萜合酶、柯巴基焦磷酸合酶和/或松香烷型二萜烯合酶的作用转化为松香烷型二萜烯。
在一些实施方案中,重组宿主细胞包含一种或多种编码一种或多种二萜合酶的异源核酸,如二萜合酶CfTPS1(SEQ ID NO:67)和CfTPS3(SEQ ID NO:68)、或CftTPS1(SEQ IDNO:77)和CftTPS3(SEQ ID NO:78)、或与其具有至少80%、例如至少81%、例如至少82%、例如至少83%、例如至少84%、例如至少85%、例如至少86%、例如至少87%、例如至少88%、例如至少89%、例如至少90%、如至少91%、例如至少92%、例如至少93%、例如至少94%、例如至少95%、例如至少96%、例如至少97%、例如至少98%、例如至少99%的序列同一性的相应功能同源物,或其成熟多肽。
在一些实施方案中,重组宿主细胞包含一种或多种编码一种或多种二萜合酶的异源核酸,如二萜合酶TwTPS9(SEQ ID NO:69)和TwTPS27(SEQ ID NO:70)、或与其具有至少80%、例如至少81%、例如至少82%、例如至少83%、例如至少84%、例如至少85%、例如至少86%、例如至少87%、例如至少88%、例如至少89%、例如至少90%、如至少91%、例如至少92%、例如至少93%、例如至少94%、例如至少95%、例如至少96%、例如至少97%、例如至少98%、例如至少99%的序列同一性的相应功能同源物,或其成熟多肽。
在一些实施方案中,重组宿主细胞包含一种或多种柯巴基焦磷酸合酶和一种或多种松香烷型二萜烯合酶的组合,如柯巴基焦磷酸合酶SmCPS(SEQ ID NO:71)和松香烷型二萜烯合酶SmKSL(SEQ ID NO:72)的组合、或与其具有至少80%、例如至少81%、例如至少82%、例如至少83%、例如至少84%、例如至少85%、例如至少86%、例如至少87%、例如至少88%、例如至少89%、例如至少90%、如至少91%、例如至少92%、例如至少93%、例如至少94%、例如至少95%、例如至少96%、例如至少97%、例如至少98%、例如至少99%的序列同一性的相应功能同源物,或其成熟多肽。
在又一方面,本发明涉及具有细胞色素P450酶活性的多肽,并且所述多肽包含与SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6和SEQID NO:7中的一种具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽。
在又一方面,本发明涉及具有细胞色素B5酶活性的多肽,并且所述多肽包含与SEQID NO:8具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽。
本发明还涉及编码具有细胞色素P450酶活性的多肽的多核苷酸序列或基因,并且包含与SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6和SEQ ID NO:7中的一种具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽;或者编码具有细胞色素B5活性的多肽,所述多肽包含与SEQ ID NO:8具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽。
在优选实施方案中,具有细胞色素P450活性的第一、第二、第三、第四、第五和第六酶中的一种或多种包含或由根据以下任一项的氨基酸序列组成:SEQ ID NO:1(TwCYP82D274v1)、SEQ ID NO:2(TwCYP82D274v2)、SEQ ID NO:74(TwCYP82D274v3)、SEQ IDNO:75(TwCYP82D274v4)、SEQ ID NO:3(TwCYP71BE85)、SEQ ID NO:4(TwCYP71BE86)、SEQ IDNO:5(TwCYP82D213v1)和SEQ ID NO:76(TwCYP82D213v2),与其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、优选至少99%序列同一性的相应功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第一酶的第一异源核酸编码下列所述TwCYP82D274:SEQ ID NO:1(TwCYP82D274v1,SEQ ID NO:2(TwCYP82D274v2),SEQ IDNO:74(TwCYP82D274v3),SEQ ID NO:75(TwCYP82D274v4),或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第二酶的第二异源核酸编码下列细胞色素P450酶TwCYP71BE86:SEQ ID NO:4,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第三酶的第三异源核酸编码下列细胞色素P450酶TwCYP71BE85:SEQ ID NO:3,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第四酶的第四异源核酸编码下列细胞色素P450酶:SEQ ID NO:5中所述TwCYP82D213v1或SEQ ID NO:76(TwCYP82D213v2)中所述TwCYP82D213v2,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第五酶的第五异源核酸编码下列细胞色素P450酶TwCYP82D217:SEQ ID NO:6,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素P450活性的第六酶的第六异源核酸编码下列细胞色素P450酶TwCYP82D275:SEQ ID NO:7,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,编码具有细胞色素B5活性的酶的第七异源核酸编码下列细胞色素B5酶TwB5#1:SEQ ID NO:8,或与其具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物。
在一些实施方案中,提供了重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽,
其中所述细胞能够产生14-羟基脱氢松香二烯。
在一些实施方案中,提供重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iii.包含编码下列异源核酸:SEQ ID NO:4的TwCYP71BE86,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;
其中所述细胞能够生产14-羟基脱氢松香二烯、3,14-二羟基脱氢松香二烯、3,14-dihydroxyabeodiene和/或14-羟基-18-醛-abeodiene。
在一些实施方案中,提供了重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iii.包含编码下列异源核酸:SEQ ID NO:4的TwCYP71BE86,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iv.包含编码下列异源核酸:SEQ ID NO:3的TwCYP71BE85,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;
其中所述细胞能够生产14-羟基脱氢松香二烯,3,14-二羟基脱氢松香二烯,3,14-dihydroxyabeodiene,14-羟基-18-醛-abeodiene和/或雷酚内酯。
在一些实施方案中,提供了重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iii.包含编码下列异源核酸:SEQ ID NO:4的TwCYP71BE86,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iv.包含编码下列异源核酸:SEQ ID NO:3的TwCYP71BE85,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
v.包含编码下列异源核酸:SEQ ID NO:5或SEQ ID NO:76的TwCYP82D213,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;
其中所述细胞能够生产14-羟基脱氢松香二烯,3,14-二羟基脱氢松香二烯,3,14-dihydroxyabeodiene,14-羟基-18-醛-abeodiene,雷酚内酯和/或雷公藤内酯酮。
在一些实施方案中,提供了重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iii.包含编码下列异源核酸:SEQ ID NO:4的TwCYP71BE86,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iv.包含编码下列异源核酸:SEQ ID NO:3的TwCYP71BE85,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
v.包含编码下列异源核酸:SEQ ID NO:5或SEQ ID NO:76的TwCYP82D213,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
vi.包含编码下列异源核酸:SEQ ID NO:8的TwB5#1,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;
其中所述细胞能够产生效价比相同酵母细胞高至少2倍,例如至少3倍,例如高至少4倍,例如低至少5倍的雷公藤内酯酮,除非所述酵母不表达所述TwB5#1或其功能同系物。
在一些实施方案中,提供了重组宿主细胞:
i.其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码下列异源核酸:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ IDNO:75的TwCYP82D274,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iii.包含编码下列异源核酸:SEQ ID NO:4的TwCYP71BE86,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
iv.包含编码下列异源核酸:SEQ ID NO:3的TwCYP71BE85,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
v.包含编码下列异源核酸:SEQ ID NO:5或SEQ ID NO:76的TwCYP82D213,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;和
vi.包含编码下列异源核酸:SEQ ID NO:8的TwB5#1,或具有至少80%,如至少81%,如至少82%,如至少83%,如至少84%,如至少85%,如至少86%,如至少87%,如至少88%,如至少89%,如至少90%,如至少91%,如至少92%,如至少93%,如至少94%,如至少95%,如至少96%,如至少97%,如至少98%,如至少99%序列同一性的功能同系物,或其成熟多肽;
其中所述细胞能够在发酵培养基中生长,并且其中发酵7天后所述发酵培养基包括:
-至少3ppm雷公藤内酯酮和/或
-至少1ppm雷酚内酯。
上述重组宿主细胞可能由于几种不同的原因而能够产生松香烷型二萜烯和/或脱氢松香二烯。例如,宿主细胞可能内源性地能够产生松香烷型二萜烯。或者,重组宿主细胞可包含一个或多个异源核酸序列,其编码一种或多种参与松香烷型二萜烯生产的酶,例如SEQ ID NO:73或SEQ ID NO:81的二萜生物合成酶SPGGPPS7、SEQ ID NO:67的CfTPS1、SEQID NO:77的CftTPS1、SEQ ID NO:68的CfTS3、SEQ ID NO:78的CftTPS3和/或SEQ ID NO:9的TwCPR1,与其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、优选至少99%序列同一性的相应功能同系物,或其成熟多肽。
具有细胞色素P450活性的第一(例如TwCYP82D274)、第二(例如TwCYP71BE86)、第三(例如,TwCYP71BE85)、第四(例如:TwCYP82D213)、第五(例如TwCYP82D217)和第六(例如,TwCYP82D275)酶以及具有细胞色素B5活性的酶(例如,TwB5#1)的功能同系物可以通过在酵母细胞中表达相关蛋白并评估它们是否能够产生下文所述的特定化合物来验证。
一种酵母细胞,表达TwCYP82D274(SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ ID NO:75)的功能同系物,并且还表达:
i.二萜生物合成酶SPGGPPS7v2(SEQ ID NO:81)、CftTPS1(SEQ ID NO:77)、CftTPS3(SEQ ID NO:78)和TwCPR1(SEQ ID NO:9),
优选能够生产14-羟基脱氢松香二烯。
一种酵母细胞,表达TwCYP71BE86(SEQ ID NO:4)的功能同系物,并且还表达:
i.二萜生物合成酶SPGGPPS7v2(SEQ ID NO:81)、CftTPS1(SEQ ID NO:77)、CftTPS3(SEQ ID NO:78)和TwCPR1(SEQ ID NO:9);和
ii.TwCYP82D274(SEQ ID NO:1或SEQ ID NO:2),
优选能够生产14-羟基脱氢松香二烯,3,14-二羟基脱氢松香二烯,3,14-dihydroxyabeodiene和14-羟基-18-醛-abeodiene。
一种酵母细胞,表达TwCYP71BE85(SEQ ID NO:3)的功能同系物,并且还表达:
i.二萜生物合成酶SPGGPPS7v2(SEQ ID NO:81)、CftTPS1(SEQ ID NO:77)、CftTPS3(SEQ ID NO:78)和TwCPR1(SEQ ID NO:9);和
ii.TwCYP82D274(SEQ ID NO:1或SEQ ID NO:2);和
iii.TwCYP71BE86(SEQ ID NO:4),
优选能够生产14-羟基脱氢松香二烯,3,14-二羟基脱氢松香二烯,3,14-dihydroxyabeodiene,14-羟基-18-醛-abeodiene和雷酚内酯。
一种酵母细胞,表达TwCYP82D213(SEQ ID NO:5或SEQ ID NO:76)的功能同系物,并且还表达:
i.二萜生物合成酶SPGGPPS7v2(SEQ ID NO:81)、CftTPS1(SEQ ID NO:77)、CftTPS3(SEQ ID NO:78)和TwCPR1(SEQ ID NO:9);和
ii.TwCYP82D274(SEQ ID NO:1或SEQ ID NO:2);
iii.TwCYP71BE86(SEQ ID NO:4);和
iv.TwCYP71BE85(SEQ ID NO:3),
优选能够生产14-羟基脱氢松香二烯,3,14-二羟基脱氢松香二烯,3,14-dihydroxyabeodiene,14-羟基-18-醛-abeodiene,雷酚内酯和雷公藤内酯酮。
一种酵母细胞,表达TwB5#1(SEQ ID NO:8)的功能同系物,并且还表达:
i.二萜生物合成酶SPGGPPS7v2(SEQ ID NO:81)、CftTPS1(SEQ ID NO:77)、CftTPS3(SEQ ID NO:78)和TwCPR1(SEQ ID NO:9);
ii.TwCYP82D274(SEQ ID NO:1或SEQ ID NO:2);
iii.TwCYP71BE86(SEQ ID NO:4);
iv.TwCYP71BE85(SEQ ID NO:3);和
v.TwCYP82D213(SEQ ID NO:5或SEQ ID NO:76)
优选能够产生效价比相同酵母细胞高至少2倍,例如至少3倍,例如,至少4倍,例如:至少5倍的雷公藤内酯酮,除非所述酵母不表达TwB5#1的所述功能同系物。
在优选实施方案中,具有细胞色素B5活性的酶包含或由下列组成:根据SEQ IDNO:8(TwB5#1)的氨基酸序列,或与其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、优选至少99%序列同一性的相应功能同系物,或其成熟多肽。
本发明的多核苷酸可以通过从天然产生多肽的生物体(如植物雷公藤或近缘植物)中克隆来提供,或者可以通过基于本领域已知技术的多核苷酸序列的化学合成来提供,或者其可以具有自然界中未发现的序列,例如该序列可以针对特定选择的宿主细胞进行密码子优化。
本发明的多肽可由天然产生多肽的生物提供,例如植物雷公藤或相关生物;或者它们可以通过将编码多肽的多核苷酸插入和表达到合适的宿主细胞中并从包含用相应基因转化的宿主细胞的培养液中回收多肽来提供。优选从合适的选择的重组宿主细胞提供本发明的多肽。
为了在合适的宿主细胞中转化和表达基因,该基因通常与合适的调节元件可操作地连接,并插入适合于特定选择的宿主细胞的表达载体中。选择合适的调节元件、构建合适的表达载体和转化所选择的宿主细胞是普通从业者的技能范围内的,并且本发明不受这些元件的任何特定选择的限制。
所产生的包含本发明基因的宿主细胞适合在容器中生长,例如发酵罐或摇瓶;在基因表达和含氧二萜化合物形成的条件下。当生长停止或培养液中积累了足够高量的含氧二萜化合物时,可进一步修饰含氧二酚化合物并从培养液中回收。
序列同一性被理解为两个氨基酸或核苷酸序列之间相似性的度量。通过首先对齐两个序列来计算序列同一性,计算两个序列包含相同氨基酸残基或核苷酸的位置的数量,并计算同一性百分比,即具有相同氨基酸残基或核苷酸的位点的数量与整个对齐长度的同一性。
已经开发了几种算法,并且可供技术人员使用。
在本说明书和权利要求中,使用默认参数(BLOSUM 62矩阵,间隙开放惩罚11;间隙扩展惩罚1,Exp.Thr10)使用NCBI BLAST+成对比对算法计算氨基酸序列的序列同一性,使用默认参数(匹配/不匹配分数1,-3;间隙打开罚分5;间隙扩展罚分2;exp.Thr10)。NCBIBLAST+项目进一步描述于:Madeira F el at(2019)NAR 47:W636-W641。
实施例
材料和方法
Nicotiana benthamiana的基因工程
从植物材料中克隆雷公藤CYP基因,并使用先前在(1-4)中描述的构建体和方法,在Nicotiana benthamiana中与二萜生物合成基因CfDXS(SEQ ID NO:79)或SctHMGR(SEQID NO:80)、CfGGPPS或SpGGPPS7(SEQ ID NO:81)、CfTPS1(SEQ ID NO:67)或CftTPS1(SEQID NO:77)、和CfTPS3(SEQ ID NO:68)或CftTPS3(SEQ ID NO:78)共表达。基因沉默抑制剂p19也共同表达。简言之,将各自含有单独二萜生物合成基因或雷公藤CYP(TwCYP)的二元载体转化为农杆菌。混合各自含有特定质粒的农杆菌液体培养物,以共同表达TwCYP的特定组合。
酿酒酵母的基因工程及工程化酿酒酵母的生长条件
培养基
YPD培养基:20g/L BactoTM蛋白胨,10g/L BactoTM酵母提取物,20g/L葡萄糖。
不含尿嘧啶的合成完全(SC)meda:1.92g/L不含尿苷的酵母合成脱落培养基补充剂(Sigma Aldrich Co.LLC.目录号Y1501)、6.7g/L不含氨基酸的酵母氮基(Sigma-AldrichCo.LLC目录号Y0626)、20g/L葡萄糖。进料时间(FIT)基于EnPump 200(Enpresso GmbH),并根据产品随附的方案制造。琼脂平板:SC培养基,包括琼脂(15g/L)。
通过在含有5-氟乳清酸(5-FOA,0.74g/L)和尿嘧啶(30mg/L)的不含尿嘧啶的SC培养基的琼脂平板上选择URA3功能缺失,在亲本菌株中引入尿嘧啶营养缺陷型。
在不含尿嘧啶琼脂平板的SC上分离酵母转化体。
工程酿酒酵母菌株的分批补料发酵分离松香烷型二萜烯衍生二萜
所有工程化酿酒酵母菌株均采用与前述类似的进料时间(FIT;m2p实验室)方法在96个深井板中培养(插入参考Forman等人2018)。为了分离和纯化雷公藤内酯酸途径中的关键中间体,使用2L a生物反应器(Sartorius AG)在饲料分批发酵罐中培养选定的工程酿酒酵母菌株。通过向反应器罐(带叶轮)中加入100mL发酵剂培养物,开始补料分批发酵,然后通过高压灭菌提前制备,同时含有200mL分批葡萄糖和300mL分批盐混合物。还加入5mL维生素混合物、5mL微量元素和0.5mL微量元素。生物反应器中的培养在以下条件下开始(监测和自动控制):pH=5,温度=30℃,溶解氧(DO)=20%。当通过供给氢氧化铵(32%)和硫酸(10%)来控制pH时,通过空气供给和搅拌来控制溶解氧。此外,通过添加抗泡沫乳液(35119,Serva Electrophology GmbH)来调节泡沫水平。在生物反应器中初始培养18小时后,开始以1.3%的速率用进料溶液进料。发酵过程持续7天,每天采样培养物。
用于LC-MS分析的工程酿酒酵母的提取
将基因工程酿酒酵母菌株转移到96孔板中的0.5mL培养基中,在30℃下以350转/分的轨道振荡生长3天。为了提取,将0.1mL酿酒酵母培养物转移到1.5mL玻璃瓶中。加入0.4mL MeOH uHPLC级。使用0:22μm 96孔滤板(Merck Millipore,Darmstadt,德国)过滤酿酒酵母提取物,并在LC-MS分析之前在4℃下储存。
用于LC-MS分析的二萜类代谢物的提取
用于LCMS分析的酵母培养物样品在1.5mL玻璃瓶中通过将酵母培养物和掺有5ppm穿心莲内酯(内标;FA17902,CarboSynth)的甲醇以1:19(v/v)的比例混合(对于每日生物反应器样品)和1:4(v/v)的比例(对于96个深井培养物)制备。在室温下摇动混合30分钟。对于烟草样品,将2个叶盘置于1.5ml玻璃瓶中,用1ml甲醇提取溶液提取,在室温下振荡1小时。
在LCMS分析之前,样品通过0.22μM 96孔板过滤器(Merck Millipore,Darmstadt,德国),并在5℃下储存。
LC-MS分析
使用与Bruker Compact ESI-QTOF-MS(Bruker)系统耦合的Ultimate3000UHPLC+聚焦系统(Dionex Corporation,Sunnyvale,CA)分析甲醇(MeOH)提取物。样品在KinetexXB-C18柱(100x 2.1mm ID,1:7μm粒径,100°a孔径;Phenomenex Inc.,Torrance,CA)上分离,保持40℃,流速为0.3mL min-1,流动相包括0.05%(v/v)甲酸溶于水(溶剂A)和0.05%(v/v)甲酸溶于乙腈(溶剂B)。
使用了两种LC协议:
LC方法1:0-0.5分钟,10%B;0.5-21min,从10%B线性增加到80%B;21-31分钟,至90%B;31-34分钟,至100%B;34-39分钟100%B;39-40分钟线性从100%降至10%B。
LC方法2:0-0.5分钟,20%B;0.5-11分钟,从20%B线性增加到80%B;11-20分钟,至90%B;20-22分钟,至100%B;22-27分钟100%B;27-28分钟线性从100%降至20%B。
LC方法3:0-0.5分钟,20%B;0.5-9分钟,从20%B线性增加到100%B;9-11分钟,100%B;11-11.5分钟,从100%B线性下降至20%B;11.5-15分钟,20%B。
用于GC-MS分析的二萜类代谢物的提取
通过以1:4(v/v)的比例混合酵母培养物和纯甲醇,在1.5mL玻璃瓶中制备用于GCMS分析的酵母培养物样品。短暂混合后,将非极性成分液-液萃取到己烷中,以1:1(v/v)的比例混合并振荡1小时,加入10ppm 1-二十碳烯。GCMS分析之前,将己烷层转移到新的小瓶中。气相色谱-质谱(GC-MS)分析
在岛津GCMS-QP2010 Ultra(岛津公司)上使用Agilent HP-5MS柱(AgilentTechnologies)20m x 0.18mm i.d.,0.18μm膜厚)进行GC-MS分析。以50cm s-1的恒定线速度使用氢气作为载气,在250℃下注入体积为1μL(无分流模式)。烤箱程序为80℃ 2分钟,以20℃/min的速率升温至180℃,以10℃/min速率升温至300℃,并以20°C/min速率升温至310℃,保持3分钟。数据以.CDF格式存储并以MZmine2处理。
松香烷型二萜烯衍生二萜类化合物的相对定量
酵母培养物中的相对化合物量基于特征离子的归一化峰面积(使用MZmine2软件中的目标特征检测获得的数据)。对以下离子的信号进行了量化:1:松香烷型二萜烯m/z91.1,2:14-羟基松香二烯m/z189.1,3:F15P1 m/z 303.2318,4:F20P2 m/z 283.2059,5:F15P2 m/z299.2002,6:雷酚内酯m/z 313.1794,8:雷公藤内酯酮m/z 359.1481。LCMS和GCMS数据的质量偏差分别为5ppm和100ppm。
内标穿心莲内酯的基峰离子峰面积(m/z 315.1947)用于标准化。
绝对量化
通过共同分析甲醇制备的真实标准品和最终浓度为5ppm的内标物(穿心莲内酯),对雷酚内酯(FT65732,CarboSynth)和雷公藤内酯酮(FT65197,CarboSynth)进行了绝对定量。定量基于标准化峰面积,并根据标准反应曲线的线性外推斜率计算(雷酚内酯0.05、0.5、1、2ppm;雷公藤内酯酮0.5、1,2、10、20ppm)。
用于核磁共振分析的酿酒酵母工程菌株中松香烷型二萜烯衍生物的分离和纯化
本发明中的化合物从生物反应器培养物酵母菌株NVJ8.15和NVJ3.10中分离,并通过NMR进行结构鉴定。最初在Celite (06858,Sigma Aldrich)存在下通过旋转蒸发干燥肉汤和甲醇裂解细胞(细胞:甲醇=1:4,v/v)的合并乙酸乙酯提取物。随后使用/>5.250(Interchim,/>France)仪器通过连续分馏分离化合物,并通过紫外吸收和蒸发光散射检测(ELSD)进行检测。该装置配备有(C1)PF-15SIHP-F0025(OV002A,Interchim)柱和(C2)US5C18HQ-100/300(SSP750,Interchin)柱,分别用于正相和反相分离。
使用柱(参考文献9)对/粗提取物的干混合物进行初步预分级,并从手动填充的干负载柱加载。使用流动相己烷(A)和乙酸乙酯(B)(恒定流速为15mL/min)进行分离,然后用100%甲醇进行最终洗涤步骤。通过UV和ELSD检测并收集感兴趣的化合物。在进一步分馏或NMR研究之前,使用LC-MS方法3和TLC分析通过LCMS连续评估收集的馏分。通过使用C1的额外正相馏分或使用C2的反相柱分馏,从具有多种化合物的馏分中额外纯化感兴趣的化合物。
对于C2的反相纯化,使用转子蒸发蒸发样品,并将其重新悬浮在2mL甲醇中。将样品直接注射到预处理柱C2上。C2的流动相由溶剂C:去离子水和溶剂D乙腈组成,各自用0.05%(v/v)甲酸酸化。使用32mL/min的恒定流速,随着溶剂D浓度的增加,具有线性溶剂梯度。通过ELSD和UV检测并收集感兴趣的化合物。
通过在岛津HPLC(SPD-M20A二极管阵列检测器、FRC-10A馏分收集器、DGU-20A5脱气器、LC-20AT泵、CBM-20A系统控制器、CTO-10AS VP柱烘箱、SIL-10AP自动进样器)上多次注射100uL ontp半制备Phenomenex Luna 5μm C18(2)100?50x10mm(全多孔)柱(Phenomenox,Inc.,加利福尼亚州托伦斯,美国),完成额外的反相纯化。流动相在C和D之间呈线性梯度,D的量从50-100%增加。通过210nm的紫外吸收检测并收集感兴趣的化合物。
质谱
在m/z 50-1200的扫描范围内以正离子模式采集质谱,ESI和MS设置如下:毛细管电压,4000V;端板偏置,500V;干气温度,220C干气流量为8L min1;喷雾器压力,2巴;源CID能量为0eV;六极RF,50Vpp;四极离子能量,4eV;碰撞电池能量,7eV。使用内部甲酸钠标准校准原始色谱数据,随后使用DataAnalysis 4.3(Build 110.102.1532)(64位),Bruker导出为mzML格式。MZmine ver2.53用于可视化LC-MS色谱图。
生物反应器起始培养基和进料培养基的培养基配方
分批葡萄糖:
葡萄糖一水合物55g/L
分批盐混合物:
硫酸铵25g/L
磷酸二氢钾5g/L
七水硫酸镁1.7g/L
饲料葡萄糖:
葡萄糖一水合物880g/L
进料盐混合物:
磷酸二氢钾21.6g/L
七水硫酸镁24.24g/L
硫酸钾8.4g/L
硫酸钠0.672g/L
准备说明:
通过将组分溶解在Milli-Q水中并通过高压灭菌,在单独的BlueCap瓶中制备批次和饲料盐混合物以及批次和饲料葡萄糖。
进料溶液:
通过将500mL饲料葡萄糖与500mL饲料盐混合物、10mL维生素混合物、10ml微量元素溶液和1mL微量元素溶液混合来制备饲料溶液。
实施例1:在Nicotiana benthamiana中的表达
在农杆菌渗透后7天收获共表达感兴趣基因的特定组合(GOI)的N.benthamiana叶片材料。将1mL甲醇(MeOH)添加到2个叶盘(=2cm)中。萃取在室温下以200rpm的轨道振荡进行。使用0:22μm 96孔滤板(Merck Millipore,Darmstadt,德国)过滤200μL提取物,并在4℃进行LC-MS分析。
图1显示了获得的LCMS配置文件。结果表明,用编码具有SEQ ID NO:1的酶的CYP82D274V1转化的N.benthamiana细胞导致产生14-OH-脱氢松香二烯;当用编码分别具有氨基酸序列SEQ ID NO:3和SEQ ID NO:4的酶的CYP71BE85和CYP71BE86进一步转化细胞时,形成雷酚内酯;并且当细胞进一步用编码具有SEQ ID NO:5的氨基酸序列的酶的CYP82D213转化时,形成雷公藤内酯酮。此外,可以看出,具有SEQ ID NO:6序列并由基因CYP82D217编码的酶增加雷酚内酯和雷公藤内酯酮的产量。
实施例2:酿酒酵母菌株的构建。
菌株构造
母酵母菌株为酿酒酵母S288C(NCYC 3608;英国诺里奇酵母培养物国家保藏中心)。
菌株的基因型和来源列于表3。
使用乙酸锂转化法(8)制备构建的酵母菌株。没有功能URA3的亲本菌株通过以下程序变得合格:将甘油原液接种到5ml YPD培养基中,并在30℃O/N下生长。然后,将3mL O/N培养物转移到50mLYPD培养基中,继续生长4-5小时,然后以4000RPM离心10分钟,然后丢弃上清液。然后,在无菌水中(第一次在25mL中,第二次在1mL中)洗涤2次后,将细胞准备好转化,并重新悬浮在0.4mL无菌水中。
通过以下步骤转化有活力的酵母细胞:分别加入10μL有活力的NotI消化质粒(各2μL)的混合物,并与60μL PEG 3350(50%w/v)、9μL LiAc(1M)和12.5μL预浸鲑鱼精子DNA(10mg/ml)混合。然后将所得混合物在42℃孵育40分钟,然后通过离心(3000RPM,5分钟)收集细胞并去除上清液。然后将细胞重新悬浮在100μL无菌水中,并在无尿嘧啶琼脂平板的SC上铺展。在30℃下孵育2天后,分离的转化体表现为单个菌落。使用表1中的基因和构建体特异性引物,通过菌落PCR确认基因构建体的插入。对于菌落PCR,将酵母菌落重新悬浮在50μL20mM NaOH中,并在99℃下孵育15分钟。将1μL菌落悬浮液用于PCR。
表1:使用引物列表
/>
酿酒酵母基因组工程基因构建体的组装
质粒名称和编码的基因构建体列于表2中。如前所述(5),通过USER克隆产生所有质粒。此外,如前所述(6),为USER克隆制备了名为装配器-1、-2和-3的亲本载体,用于同时整合多达六个基因构建体,并含有AsiSI/Nb.BsmI USER盒。用于与USER兼容的PfuX7聚合酶(7)进行PCR扩增的引物列于表1中。表3列出了本工作中使用和生成的载体。
酿酒酵母的密码子优化基因来自美国旧金山TWIST Biosciences。下表中前缀为“CO_”的所有基因都经过密码子优化。使用与上述表1中描述的引物相同的引物扩增密码子优化的基因,不同之处在于引物被修改以适应密码子优化基因中任何核苷酸变化的杂交。用于密码子优化基因扩增的引物也公开于J.Andersen-Ranberg et al.,Expanding theLandscape of Diterpene Structural Diversity through StereochemicallyControlled Combinatorial Biosynthesis.Angewandte Chemie InternationalEdition,n/a(2016)。
表2:产生和使用的载体和质粒
/>
表3:使用和产生的酿酒酵母菌株列表
/>
/>
实施例3:酵母、酿酒酵母中的表达
提取和代谢物分析
将基因工程酿酒酵母菌株转移到96孔板中的0.5mL培养基中,并在30℃下以350rpm的轨道振荡培养3天。为了提取,将0.1mL酿酒酵母培养物转移到1.5mL玻璃瓶中。加入0.4mL MetOH uHPLC级。使用0:22μm 96孔滤板(Merck Millipore,Darmstadt,德国)过滤酿酒酵母提取物提取物,并在LC-MS分析之前在4℃下储存。
提取物的LCMS谱可以在图2中看到,其中可以观察到用编码具有SEQ ID NO:1氨基酸序列的酶的TwCYP82D274V1转化背景菌株导致14-OH-脱氢松香二烯的形成。
实施例4:通过NMR分析检测14-OH-脱氢松香二烯
通过NMR分析实施例3中鉴定为14-OH-脱氢松香二烯的化合物以确认其特性。
核磁共振纯化
工程酵母中雷公藤甲素中间体的纯化
将产生所需目标化合物的工程酵母从SC琼脂接种到10mL YDP中,并在30℃条件下培养。将5mL ON培养物接种在500mL FIT培养基中,并在30℃下生长5天。用500mL EtAc从培养物中提取目标化合物。通过转子蒸发除去溶剂,并将分析物重新悬浮在己烷中。重复提取3次。己烷提取物用于SupelcleanTM /Na2SO4SPE管(Sigma-Aldrich)和分析物使用1:99-5:95EtAc:己烷的阶梯梯度从柱中洗脱。用LC-MS或GC-MS分析每个馏分,并选择含有感兴趣化合物的馏分进行NMR分析。
NMR分析
NMR数据在配备有针对13C和1H优化的5mm低温冷却DCH探针的Bruker Avance IIIHD 600MHz NMR光谱仪(1H工作频率599.85MHz)上获得(Bruker Biospin,Karlsruhe,Germany)。NMR数据记录在CDCl3(Euriso top,99.8原子%D)中的5mm管中,温度平衡至300K,锁定参数优化,梯度匀场,接收器增益设置,所有这些都由Topspin 3.2版和IconNMR4.7.5版(德国卡尔斯鲁厄Bruker Biospin)自动控制。1H和13C化学位移分别参考pH7.26ppm和pC77.16ppm的残留溶剂信号。用30°脉冲和64k数据点以及零填充到256k数据点,以12kHz的谱宽、1s的弛豫延迟和2.7s的采集时间采集1H光谱。使用Waltz-16复合脉冲去耦方案对13C光谱进行1H去耦。使用4096(HMBC)、2048(DQF-COSY和ROESY)或1024(多重编辑的HSQC)数据点(直接维度)和256(DQF-COSY、HMBC和ROESY)或128(多重编辑HSQC)个数据点(间接维度)获取2D同核和异核实验。2D NMR数据在F1中被零填充到1k,在F2中被零填补到两倍的点数,在F1中采用正向线性预测(LPBIN=0)。NMR数据的处理使用Topspin 4.0.9版(德国卡尔斯鲁厄Bruker Biospin)完成。
14-OH-脱氢松香二烯的NMR光谱数据如表4所示。
表41(14-OH-脱氢松香二烯)的1H和13C NMR光谱数据
/>
a1H NMR(599.85MHz)和13C NMR(150.83MHz)数据获得于CDCl3.bnH=氢的数量。多重性报告为明显分裂:s=单重,d=双重,t=三重,sep=七重,m=多重(也适用于重叠),br=宽A’表示最高化学位移值,B’表示最低化学位移值。
图3显示了在599.85MHz下CDCl3中14-OH-脱氢松香二烯的1H NMR谱,图4显示了在150.83MHz下CDCl3中14-OH-脱氢松香二烯的13C NMR谱,证实了该化合物的同一性。
实施例5:导致产生雷酚内酯和雷公藤内酯酮的基因在酿酒酵母中的表达
这是一项初步研究,旨在评估导致啤酒酵母产生雷酚内酯和雷公藤内酯酮的基因表达的影响。
用各自含有单独二萜生物合成基因或雷公藤CYPs(TwCYPs)的载体进一步转化实施例2中产生的背景酵母菌株
提取物的LCMS谱可以在图5中看到,其中可以看到,用编码分别具有SEQ ID NO:1、SEQ ID NO:3和SEQ ID NO:4氨基酸序列的酶的TwCYP82D274V1、TwCYP71BE85和TwCYP71BE86转化背景菌株;导致雷酚内酯的形成;并且用编码具有SEQ ID NO:5的氨基酸序列的酶的CYP82D213进一步转化导致雷公藤内酯酮的形成。
图6显示了在生成的转化体的提取物中检测到的含氧二萜化合物的含量概况。
左侧面板显示雷酚内酯和雷公藤内酯酮的含量,右侧面板显示了14-OH-脱氢松香二烯的含量。表达编码具有SEQ ID NO:8的氨基酸序列的酶的基因TwB5#1导致显著更高的雷酚内酯和雷公藤内酯酮的产生。
TwB5#2-6基因是不增加雷酚内酯或雷公藤内酯酮产生的其他雷公藤细胞色素B5基因(未提供序列)。
实施例6:在酿酒酵母和N.benthamiana中生产含氧二萜化合物
按照上文“材料和方法”一节所述,培养所有工程酿酒酵母菌株和本氏芽胞杆菌。类似地,提取二萜类代谢物,通过LC-MS、GC-MS和NMR进行分析,并如上所述进行定量。
优选实验生物体是酵母,并且异源基因已经在生物体中稳定转染,因为这给出了最精确和可重复的结果。
结果如图7-9所示。从图中可以清楚地看出,不同于酵母细胞和烟草植物的生物都能够根据本发明的方法以高滴度生产所提出的雷公藤内酯酮生物合成途径中所要求的关键中间体。
图10-26显示了其他关键化合物的核磁共振波谱。所产生的化合物的NMR光谱数据显示在下表5-21中。
表5.F1-14的1H和13C NMR数据和2D HMBC和ROESY相关性
表6.F1-15的1H和13C NMR数据和2D HMBC和ROESY相关性
表7.F1-18的1H和13C NMR数据和2D HMBC和ROESY相关性
/>
表8.F1-23(F15P1)的1H和13C NMR数据和2D HMBC和ROESY相关性
表9.F1-31的1H和13C NMR数据和2D HMBC和ROESY相关性
/>
表10.F2-X的1H和13C NMR数据和2D HMBC和ROESY相关性
表11.F2-10的1H和13C NMR数据和2D HMBC和ROESY相关性
表12.F20P1的1H和13C NMR数据和2D HMBC和ROESY相关性
/>
表13.F20P2的1H和13C NMR数据和2D HMBC和ROESY相关性
表14.F20P3的1H和13C NMR数据和2D HMBC和ROESY相关性
/>
表15.F20P4的1H和13C NMR数据和2D HMBC和ROESY相关性
表16.F15P2的1H和13C NMR数据和2D HMBC和ROESY相关性
表17.F55P2的1H和13C NMR数据和2D HMBC和ROESY相关性
/>
表18.F55P3的1H和13C NMR数据和2D HMBC和ROESY相关性
5表19.F15P4的1H和13C NMR数据和2D HMBC和ROESY相关性
表20.F20P5的1H和13C NMR数据和2D HMBC和ROESY相关性
表21.F60P1的1H和13C NMR数据和2D HMBC和ROESY相关性
a1H NMR(600.13)和13C NMR(150.90MHz)数据获得于CDCl3中的样品。b基于HSQC和HMBC实验的分配。c以明显分裂形式报告的多重性:s=单重态,d=双重态,t=三重态,sext=六重态,m=多重态(包括重叠共振),br=宽。α表示Me指向平面,并且β表示我指向平面外。A表示最低化学位移值,B表示最高化学位移值
参考文献
1.J.Andersen-Ranberg et al.,Expanding the Landscape of DiterpeneStructural Diversity through Stereochemically Controlled CombinatorialBiosynthesis.Angewandte Chemie International Edition,n/a(2016).
2.I.Pateraki et al.,Total biosynthesis of the cyclic AMP boosterforskolin from Coleus forskohlii.Elife 6,e23001(2017).
3.I.Pateraki et al.,Manoyl Oxide(13R),the Biosynthetic Precursor ofForskolin,Is Synthesized in Specialized Root Cork Cells in Coleusforskohlii.Plant Physiology 164,1222-1236(2014).
4.N.L.Hansen et al.,The terpene synthase gene family in Tripterygiumwilfordii harbors a labdane-type diterpene synthase among the monoterpenesynthase TPS-b subfamily.The Plant Journal 89,429-441(2017).
5.H.H.Nour-Eldin,B.G.Hansen,M.H.H.J.K.Jensen,B.A.Halkier,Advancing uracil-excision based cloning towards an ideal technique forcloning PCR fragments.Nucleic Acids Research 34,e122(2006).
6.N.B.Jensen et al.,EasyClone:method for iterative chromosomalintegration of multiple genes in Saccharomyces cerevisiae.FEMS Yeast Research14,238-248(2014).
7.M.H.H.A mutant Pfu DNA polymerase designed for advanceduracil-excision DNA engineering.BMC Biotechnology 10,21(2010).
8.R.D.Gietz,R.H.Schiestl,High-efficiency yeast transformation usingthe LiAc/SS carrier DNA/PEG method.Nature Protocols 2,31-34(2007).
9.Hansen,N.L.,et al.,Integrating pathway elucidation with yeastengineering to produce polpunonic acid the precursor ofthe anti-oDesity agentcelastrol.MicroD Cell Fact,2020.19(1):p.15.
10.Voinnet O,Rivas S,Mestre P,Baulcombe D.An enhanced transientexpression system in plants based on suppression of gene silencing by the p19protein of tomato bushy stunt virus[retracted in:Plant J.2015 Nov;84(4):846].Plant J.2003;33(5):949-956.doi:10.1046/j.1365-313x.2003.01676.x
项目
1.一种能够产生含氧二萜化合物的重组宿主细胞,其中所述宿主细胞能够产生松香烷型二萜烯和/或脱氢松香二烯,并且已经用编码具有细胞色素P450活性的酶的第一基因转化,该酶能够将松香烷型二萜烯和/或脱氢松香二烯转化为14-羟基脱氢松香二烯。
2.根据项目1所述的重组宿主细胞,其中所述第一基因编码多肽,所述多肽包含与SEQ ID NO:1(TwCYP82D274V1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽。
3.根据项目1或2所述的重组宿主细胞,其中所述重组宿主细胞进一步包含:
编码具有细胞色素P450活性的第二酶的第二基因和编码具有细胞色素P450活性第三酶的第三基因,其中:
所述第二基因编码多肽,所述多肽包含与SEQ ID NO:4(TwCYP71BE86)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽;
所述第三基因编码多肽,所述多肽包含与SEQ ID NO:3(TwCYP71BE85)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽。
4.根据项目3所述的重组宿主细胞,其中所述宿主细胞还包含编码具有细胞色素B5活性的多肽的基因,所述多肽包含与SEQ ID NO:8(TwB5#1)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽。
5.根据项目3或4所述的重组宿主细胞,其中所述重组宿主细胞进一步包含:
编码具有细胞色素P450活性的第四酶的第四基因,其中:
所述第四基因编码多肽,所述多肽包含与SEQ ID NO:5(TwCYP82D213)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽。
6.根据项目5所述的重组宿主细胞,其中所述重组宿主细胞进一步包含:
编码具有细胞色素P450活性的第五酶的第五基因和编码具有细胞色素P450活性第六酶的第六基因,其中:
所述第五基因编码多肽,所述多肽包含与SEQ ID NO:6(TwCYP82D217)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽;
所述第六基因编码多肽,所述多肽包含与SEQ ID NO:7(TwCYP82D275)具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的氨基酸序列、或其成熟多肽。
7.根据前述项目中任一项所述的重组宿主细胞,其中能够产生松香烷型二萜烯和/或脱氢松香二烯的宿主细胞是用一个或多个编码以下基因的基因转化的重组细胞:
a双(牻牛儿基)二磷酸合酶;
b能够将GGPP转化为松香烷型二萜烯的二萜合酶;
c联合能够GGPP转化为松香烷型二萜烯的两种或多种二萜合酶的组合;或
d柯巴基焦磷酸合酶和松香烷型二萜烯合酶。
8.根据项目7所述的重组宿主细胞,其中所述双(牻牛儿基)二磷酸合酶是包含SEQID NO:73或SEQ ID NO:81的氨基酸序列的多肽。
9.根据项目7所述的重组宿主细胞,其中能够将GGPP转化为松香烷型二萜烯的两种或多种二萜合酶的组合是包含SEQ ID NO:67的氨基酸序列的多肽和包含SEQ ID NO:68的氨基酸序列的组合;或是包含SEQ ID NO:69的氨基酸序列的多肽和包含SEQ ID NO:70的氨基酸序列的组合。
10.根据项目7所述的重组宿主细胞,其中柯巴基焦磷酸合酶和松香烷型二萜烯合酶的组合是包含SEQ ID NO:71的氨基酸序列的多肽和包含SEQ ID NO:72的多肽肽的组合。
11.根据前述项目中任一项所述的重组宿主细胞,其中所述重组宿主细胞选自原核细胞或真核细胞。
12.根据项目11所述的重组宿主细胞,是选自大肠杆菌、芽孢杆菌、乳杆菌和棒状杆菌的原核细胞。
13.根据项目11所述的重组宿主细胞,是选自酿酒酵母、Scizosaccha romyces、Klyveromyces、Pichia、Candida和Yarrowia物种的真核细胞。
14.根据项目11所述的重组宿主细胞,其中所述细胞是酿酒酵母细胞。
15.根据项目中任一项所述的重组宿主细胞用于制备含氧二萜化合物中的用途。
16.权利要求15的用途,其中所述含氧二萜化合物选自14-OH-脱氢松香二烯、雷酚内酯和雷公藤内酯酮。
17.如权利要求16所述的用途,其中所述含氧二萜化合物是雷公藤内酯酮,其中雷公藤内酯酮进一步转化为雷公藤甲素。
18.根据权利要求15至17中任一项所述的用途,其中使用一个或多个分离和/或色谱步骤回收所述含氧二萜化合物。
19.具有细胞色素P450酶活性的多肽,所述多肽包含与序列SEQ ID NO:1、SEQ IDNO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6、SEQ ID NO:7、SEQ ID NO:8中的一种具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽。
20.编码项目19所述的多肽的多核苷酸。
21.包含项目20所述的多核苷酸的质粒、表达载体、表达构建体或重组宿主细胞。
22.化合物14-OH-脱氢松香二烯。
序列表
<110> 哥本哈根大学
<120> 含氧二萜化合物的制备
<130> P6004PC00
<160> 81
<170> PatentIn version 3.5
<210> 1
<211> 533
<212> PRT
<213> Tripterygium wilfordii
<400> 1
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Pro Lys Ile
1 5 10 15
Phe Ala Val Leu Leu Leu Phe Ile Cys Leu Arg Ile Leu Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Ser Ala Ala
35 40 45
Trp Pro Leu Ile Gly His Leu Leu His Leu Arg Gly Pro Gln Ala Pro
50 55 60
His Ile Thr Leu Gly Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asp Ser Glu Val
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Ile Ala Leu Ala Gly Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Ala
115 120 125
Phe Ser Pro Tyr Gly Pro Tyr Trp Arg His Met Arg Lys Leu Ala Thr
130 135 140
Val Glu Leu Leu Ser Ala Gln Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Leu Lys Arg Ser Met Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Ser Gly Asp Ser Asn His Val Thr Val Asp Met
180 185 190
Thr Arg Ile Leu Gly Asp Ile Ile Ala Asn Val Ile Tyr Arg Met Val
195 200 205
Val Gly Lys Val Tyr Ala Ser Lys Gly Glu Glu Asp Ala Arg Trp Lys
210 215 220
Gln Val Val Trp Glu Tyr Ile Lys Leu Leu Ser His Phe Gly Val Gly
225 230 235 240
Asp Ala Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly Val Glu Lys
245 250 255
Ser Met Lys Lys Ala Ala Lys Glu Leu Asp Ile Tyr Val Glu Glu Trp
260 265 270
Leu Glu Glu His Lys Lys Lys Arg Ser Glu Arg Lys Ser Asp Asn Gly
275 280 285
Ile Val Glu Glu Asp Phe Met Asp Val Met Leu Ser Val Phe Asp Asp
290 295 300
Asp Asp Gln Leu Glu Asn Phe Ala His His Ser Ala His Thr Ile Asn
305 310 315 320
Lys Ala Met Cys Leu Ala Ile Ile Leu Ala Ala Ser Asp Thr Thr Lys
325 330 335
Thr Thr Leu Thr Trp Ala Leu Ser Leu Leu Leu Asn His Pro Asp Val
340 345 350
Met Lys Lys Val Gln Gln Glu Leu Ala Ala His Ile Gly Pro Asp Lys
355 360 365
Pro Val Lys Glu Ser Asp Val Lys Ser Leu Val Tyr Leu Glu Ala Val
370 375 380
Val Lys Glu Thr Leu Arg Leu Tyr Pro Pro Gly Pro Leu Gly Leu Pro
385 390 395 400
His Glu Ser Met Glu Asp Cys Thr Val Ala Gly Tyr His Val Pro Ser
405 410 415
Gly Thr Arg Ile Leu Tyr Asn Leu Trp Lys Ile Gln Gln Asp Pro Gln
420 425 430
Val Trp Glu Asn Pro Ser Glu Phe Lys Pro Asp Arg Phe Leu Thr Thr
435 440 445
His Lys Asp Val Asp Val Arg Gly Arg Asn Phe Glu Tyr Leu Pro Phe
450 455 460
Gly Ser Gly Arg Arg Met Cys Pro Gly Met Ser Phe Ala Leu Gln Val
465 470 475 480
Met Glu Val Ser Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr
485 490 495
Pro Asn Gly Lys Pro Val Asp Met Thr Glu Val Asn Gly Leu Val Thr
500 505 510
Asp Arg Ala Thr Pro Leu Glu Ala Leu Ile Thr Pro Arg Leu Pro Ala
515 520 525
His Leu Tyr Met Gly
530
<210> 2
<211> 533
<212> PRT
<213> Tripterygium wilfordii
<400> 2
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Pro Lys Ile
1 5 10 15
Phe Ala Val Leu Leu Leu Phe Ile Cys Leu Arg Ile Leu Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Ser Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu Leu His Leu Arg Gly Pro Gln Ala Pro
50 55 60
His Ile Thr Leu Gly Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asp Ser Glu Phe
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Ile Ala Leu Ala Gly Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Ala
115 120 125
Phe Ser Pro Tyr Gly Pro Tyr Cys Arg His Met Arg Lys Leu Ala Thr
130 135 140
Val Glu Leu Leu Ser Ala Gln Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Leu Lys Arg Ser Met Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Ser Gly Asp Ser Asn His Val Thr Val Asp Met
180 185 190
Thr Arg Ile Leu Gly Asp Ile Ile Ala Asn Val Ile Tyr Arg Met Val
195 200 205
Val Gly Lys Val Tyr Ala Ser Lys Gly Glu Glu Asp Ala Arg Trp Lys
210 215 220
Gln Val Val Trp Glu Tyr Ile Lys Leu Leu Ser His Phe Gly Val Gly
225 230 235 240
Asp Ala Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly Val Glu Lys
245 250 255
Ser Met Lys Lys Ala Ala Lys Glu Leu Asp Ile Tyr Val Glu Glu Trp
260 265 270
Leu Glu Glu His Lys Lys Lys Arg Ser Glu Arg Lys Ser Asp Asn Gly
275 280 285
Ile Val Glu Glu Asp Phe Met Asp Val Met Leu Ser Val Phe Asp Asp
290 295 300
Asp Asp Gln Leu Glu Asn Phe Ala His His Ser Ala His Thr Ile Asn
305 310 315 320
Lys Ala Met Cys Leu Ala Ile Ile Leu Ala Ala Ser Asp Thr Thr Lys
325 330 335
Thr Thr Leu Thr Trp Ala Leu Ser Leu Leu Leu Asn His Pro Asp Val
340 345 350
Met Lys Lys Val Gln Gln Glu Leu Ala Ala His Ile Gly Pro Asp Lys
355 360 365
Pro Val Lys Glu Ser Asp Val Lys Ser Leu Val Tyr Leu Glu Ala Val
370 375 380
Val Lys Glu Thr Leu Arg Leu Tyr Pro Pro Gly Pro Leu Gly Leu Pro
385 390 395 400
His Glu Ser Met Glu Asp Cys Thr Val Ala Gly Tyr His Val Pro Ser
405 410 415
Gly Thr Arg Ile Leu Tyr Asn Leu Trp Lys Ile Gln Gln Asp Pro Gln
420 425 430
Val Trp Glu Asn Pro Ser Glu Phe Lys Pro Asp Arg Phe Leu Thr Thr
435 440 445
His Lys Asp Val Asp Val Arg Gly Arg Asn Phe Glu Tyr Leu Pro Phe
450 455 460
Gly Ser Gly Arg Arg Met Cys Pro Gly Met Ser Phe Ala Leu Gln Val
465 470 475 480
Met Glu Val Ser Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr
485 490 495
Pro Asn Gly Lys Pro Val Asp Met Thr Glu Val Asn Gly Leu Val Thr
500 505 510
Asp Arg Ala Thr Pro Leu Glu Ala Leu Ile Thr Pro Arg Leu Pro Ala
515 520 525
His Leu Tyr Met Gly
530
<210> 3
<211> 508
<212> PRT
<213> Tripterygium wilfordii
<400> 3
Met Asp Leu Leu Gln Phe Pro Ser Val Ser Ile Leu Leu Gly Phe Val
1 5 10 15
Phe Phe Met Phe Met Val Leu Lys Val Trp Lys Arg Phe Glu Ala Asn
20 25 30
Gly Ser Thr Ser Asn Leu Pro Pro Gly Pro Trp Lys Leu Pro Ile Ile
35 40 45
Gly Asn Leu His Gln Leu Gly Gly Ser Asp Pro Pro His Arg Ala Leu
50 55 60
Gly Glu Leu Ala Lys Lys Tyr Gly Pro Leu Met Phe Leu Gln Leu Gly
65 70 75 80
Glu Ile Gln Thr Leu Val Val Ser Ser Ala Glu Tyr Ala Glu Glu Val
85 90 95
Leu Lys Thr His Asp Thr Val Phe Ala Ser Arg Pro Gln Met His Ser
100 105 110
Leu Glu Ile Met Ser Tyr Asp Tyr Lys Asp Ile Thr Phe Ser Pro Ser
115 120 125
Asp Gly Ser Trp Arg Arg Arg Arg Lys Ile Cys Val Gln Glu Leu Leu
130 135 140
Ser Ala Lys Arg Val Gln Ser Phe Arg Ser Thr Arg Glu Lys Glu Leu
145 150 155 160
Ser Lys Leu Ile Gln Trp Ile Phe Ser Gln Ala Gly Thr Ser Ile Asn
165 170 175
Leu Thr Thr Lys Ile Tyr Ser Ser Thr Cys Thr Leu Ser Ser Arg Met
180 185 190
Ala Phe Ser Asp Glu Cys Lys Tyr Gln Glu Glu Phe Ile Ser Ile Leu
195 200 205
Lys Asp Leu Leu Lys Ile Ala Ser Gly Phe Asn Ile Glu Asp Met Phe
210 215 220
Pro Ser Met Lys Phe Leu His Leu Ile Ser Gly Ala Ser Ser Lys Ile
225 230 235 240
Glu Lys Leu His Lys Gln Leu Asp Arg Ile Val Gly Ser Ile Ile Asp
245 250 255
Glu His Ile Asn Leu Asn Thr Arg Lys Ser Glu Gly Asn Glu Asp Leu
260 265 270
Val Asp Val Leu Leu Lys Tyr His Glu Gln Gly Asp Ser Glu Phe Ser
275 280 285
Leu Ser Met Glu Glu Ile Lys Ala Ile Ile Cys Asp Ile Tyr Leu Ala
290 295 300
Gly Thr Glu Thr Ser Ser Thr Thr Val Asp Trp Thr Met Ala Glu Leu
305 310 315 320
Ile Lys Asn Pro Arg Val Met Lys Lys Ala Gln Ala Glu Val Arg Gln
325 330 335
Val Phe Asp Ser Arg Gly Ser Val Asp Glu Thr Gly Ile Pro Glu Leu
340 345 350
Lys Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro
355 360 365
Gly Pro Leu Leu Leu Pro Arg Glu Asn Ala Lys Ser Cys Glu Ile Asn
370 375 380
Glu Tyr Val Ile Pro Ala Lys Thr Arg Val Met Val Asn Gly Trp Ala
385 390 395 400
Ile Gly Arg Asp Pro Lys Tyr Trp Pro Lys Glu Pro Glu Lys Phe Tyr
405 410 415
Pro Glu Arg Phe Ile Asp Asn Pro Ile Asp Tyr Lys Gly Thr Asn Phe
420 425 430
Glu Tyr Ile Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Met Ala
435 440 445
Phe Gly Leu Ala Asn Val Glu Leu Pro Leu Ser Gln Phe Leu Tyr Tyr
450 455 460
Phe Asp Trp Lys Leu Ala Asp Gly Met Val Pro Glu Asn Leu Asn Met
465 470 475 480
Ala Glu Ala Phe Ala Ala Thr Val Cys Arg Lys Asp Asp Leu Tyr Leu
485 490 495
Ile Pro Thr Pro Tyr Cys Pro Ser Pro Ala Phe Asn
500 505
<210> 4
<211> 499
<212> PRT
<213> Tripterygium wilfordii
<400> 4
Met Asp Leu Gln Leu Pro Ser Phe Pro Ile Leu Ser Ser Ile Ile Leu
1 5 10 15
Leu Ile Leu Val Val Leu Lys Ser Val Leu Arg Pro Ser Lys Leu Pro
20 25 30
Pro Gly Pro Trp Lys Leu Pro Leu Ile Gly Asn Leu His Gln Leu Ala
35 40 45
Gln Asp Leu Pro His Arg Ala Leu Gln Lys Leu Ala Lys Lys His Gly
50 55 60
Pro Leu Met His Leu His Phe Gly Glu Val Pro Thr Leu Val Val Thr
65 70 75 80
Ser Pro Glu Tyr Ala Lys Glu Val Met Lys Thr His Asp Ile Thr Phe
85 90 95
Ala Ser Arg Pro Leu Leu Asn Ala Met Lys Val Met Thr Tyr Asp His
100 105 110
Thr Asp Ile Ala Phe Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Ile Cys Thr Ile Glu Leu Leu Ser Val Lys Arg Val Gln Ser Phe
130 135 140
Arg Pro Ile Arg Glu Gln Glu Thr Ser Asn Val Ile Glu Trp Ile Gly
145 150 155 160
Ser Asn Ala Gly Ser Ser Ile Asn Leu Thr Glu Arg Leu Tyr Thr Thr
165 170 175
Ile Tyr Ala Leu Val Ser Lys Val Ala Phe Gly Arg Thr Cys Gly Arg
180 185 190
Gly Glu His Glu Glu Phe Ile Glu Tyr Ser Lys Ala Ser Gln Asn Arg
195 200 205
Ala Ser Gly Phe Asn Ile Val Asp Val Phe Pro Ser Leu Lys Leu Val
210 215 220
His Trp Ile Met Gly Glu Gly Lys Lys Thr Glu Arg Leu His Lys Gln
225 230 235 240
Gly Asp Met Leu Leu Gly Asn Ile Ile Asn Gln His Val Lys Lys Pro
245 250 255
Val Thr Gly Lys Gly Asp Asp Glu His Glu Asp Leu Val Asp Val Leu
260 265 270
Leu Lys Phe His Glu Glu Gly Asp Phe Pro Leu Thr Ile Asn Asn Ile
275 280 285
Lys Ser Val Ile Gln Asp Ile Phe Val Ala Gly Gly Glu Thr Ser Ala
290 295 300
Thr Thr Ile Asp Trp Ala Met Arg Glu Met Met Lys Asn Pro Arg Val
305 310 315 320
Met Lys Lys Ala Gln Ala Glu Val Arg Gln Val Phe Asp Ser Arg Gly
325 330 335
Arg Val Asp Glu Thr Ala Val Pro Glu Leu Lys Tyr Leu Lys Leu Val
340 345 350
Leu Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Phe Leu Leu Pro
355 360 365
Arg Ile Asn Trp Glu Arg Cys Glu Ile Asn Gly Tyr Glu Ile Ala Ala
370 375 380
Asn Thr Lys Val Ile Val Asn Ala Trp Ala Ile Gly Arg Asp Pro Asn
385 390 395 400
Tyr Trp Thr Glu Ala Glu Arg Phe Tyr Pro Glu Arg Phe Leu Glu Lys
405 410 415
Ser Ala Asp Tyr Lys Gly Thr Ser Phe Glu Tyr Thr Pro Phe Gly Ala
420 425 430
Gly Arg Arg Leu Cys Pro Gly Met Ser Phe Gly Leu Ala Asn Val Glu
435 440 445
Phe Pro Leu Ser Gln Leu Leu Tyr His Phe Asp Trp Asn Leu Thr Gly
450 455 460
Gly Met Lys Pro Glu Asp Leu Asn Met Ile Glu Ser Phe Asp Val Thr
465 470 475 480
Met Arg Ala Lys Asp Asp Leu His Leu Val Pro Thr Pro Tyr Arg Ser
485 490 495
Leu Ser Gly
<210> 5
<211> 530
<212> PRT
<213> Tripterygium wilfordii
<400> 5
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Thr Lys Ile
1 5 10 15
Phe Ala Val Leu Leu Leu Tyr Leu Phe Leu Arg Ile Phe Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Gly Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu His Leu Leu Ile Gly Pro Gln Ala Ser
50 55 60
Tyr Ile Thr Leu Ser Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asn Ser Glu Val
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Lys Val Leu Ala Asn Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Gly
115 120 125
Trp Ser Pro Tyr Gly Pro Tyr Trp Arg Gln Leu Arg Lys Leu Val Thr
130 135 140
Val Glu Leu Leu Ser Asn Gln Arg Leu Lys Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Val Lys Asn Ser Leu Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Asp Ser Asn His Val Ser Val Asp Met Thr Arg
180 185 190
Ile Phe Gly Asp Ile Thr Gly Asn Leu Ile Tyr Arg Ile Val Val Gly
195 200 205
Lys Val Tyr Ala Arg Lys Gly Glu Gly Val Val Arg Trp Lys Gln Val
210 215 220
Val Gly Asp Tyr Met Lys Leu Leu Thr His Phe Asn Val Gly Asp Ala
225 230 235 240
Met Pro Phe Met Arg Trp Phe Asp Leu Gly Gly Leu Glu Lys Ala Met
245 250 255
Lys Ile Thr Phe Lys Glu Leu Asp Gly Tyr Val Glu Glu Trp Leu Glu
260 265 270
Glu His Lys Lys Lys Arg Ser Asn Ser Gly Gly His Gly Ile Val Glu
275 280 285
Glu Asp Phe Met Asp Val Met Leu Ser Ile Phe Asp Asp Gly Gly Gln
290 295 300
Gln Glu Tyr Cys Thr Asp Asn Ser Thr His Thr Thr Asn Lys Ala Met
305 310 315 320
Cys Met Ala Leu Ile Leu Gly Ala Ser Glu Thr Thr Lys Thr Thr Leu
325 330 335
Thr Trp Ser Leu Ser Leu Leu Leu Asn Asn Leu Asp Val Leu Lys Lys
340 345 350
Val Lys Gln Glu Leu Ala Ala His Ile Gly Pro Glu Thr Leu Val Thr
355 360 365
Glu Ser Asp Val Asn Ser Leu Val Tyr Leu Asp Ala Val Ile Thr Glu
370 375 380
Thr Leu Arg Leu Tyr Pro Leu Gly Pro Leu Gly Leu Pro His Glu Ser
385 390 395 400
Ile Glu Asp Cys Thr Ile Ala Gly Tyr His Val Pro Ala Arg Thr Arg
405 410 415
Ile Leu Phe Asn Leu Trp Lys Ile His Gln Asp Pro Arg Val Trp Glu
420 425 430
Asn Pro Leu Glu Phe Lys Pro Glu Arg Phe Leu Lys Glu His Asn Asn
435 440 445
Ile Asp Val Arg Gly Gly His Phe Glu Leu Leu Pro Phe Gly Ser Gly
450 455 460
Arg Arg Met Cys Pro Gly Val Ser Phe Ala Leu Gln Val Leu Lys Leu
465 470 475 480
Thr Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr Pro Asn Asp
485 490 495
Glu Pro Val Asp Met Thr Glu Val Asn His Met Ala Thr Thr Arg Ala
500 505 510
Thr Pro Leu Glu Thr Leu Ile Ser Pro Arg Leu Pro Ser His Leu Tyr
515 520 525
Met Gly
530
<210> 6
<211> 533
<212> PRT
<213> Tripterugium wilfordii
<400> 6
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Thr Thr Ser
1 5 10 15
Phe Ala Val Leu Leu Leu Tyr Leu Cys Leu Arg Ile Phe Thr Asn Val
20 25 30
Leu Lys Pro Asn Lys Ser Lys Thr Ser Pro Pro Gln Ala Gly Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu His Leu Phe Arg Gly Pro Gln Pro Pro
50 55 60
His Ile Thr Leu Gly Lys Met Ala Asp Lys His Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val Ser Pro Thr Leu Val Ile Ser Asp Ser Gln Ile
85 90 95
Ala Lys Glu Cys Phe Thr Thr His Asp Lys Ile Leu Ala Gly Arg Pro
100 105 110
Ala Tyr Val Ala Leu Glu Ile Met Gly Tyr Asn Asn Ala Met Phe Gly
115 120 125
Phe Ser Pro Tyr Gly Pro Tyr Trp Arg Tyr Ile Arg Lys Leu Ala Thr
130 135 140
Ile Glu Leu Leu Ser Asn Lys Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Val Lys Asn Ala Met Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
Val His Asn Lys Ser Ala Ser Gly His Ser Asn His Val Ser Val Asp
180 185 190
Met Ser Lys Ile Leu Gly Asp Ile Ser Ser Asn Val Thr Tyr Arg Ala
195 200 205
Met Val Gly Lys Val Tyr Ala Ser Lys Gly Glu Glu Asp Val Arg Trp
210 215 220
Lys Gln Val Leu Ser Glu Tyr Met Lys Leu Leu Ser Asn Phe Ser Ser
225 230 235 240
Cys Asp Ala Leu Pro Phe Leu Arg Trp Phe Asp Phe Gly Gly Leu Glu
245 250 255
Lys Ser Met Lys Arg Thr Phe Lys Glu Leu Asp Asn Tyr Val Glu Glu
260 265 270
Trp Leu Gln Glu His Arg Lys Lys Arg Ser Ser Ser Gly Asp Gly Gly
275 280 285
Ile Val Val Glu Asp Phe Met Asp Val Met Leu Ser Ile Phe Asp Asn
290 295 300
Val Gly Glu His Glu Asn Phe Thr Asp Tyr Ser Pro His Thr Ile Asn
305 310 315 320
Lys Ala Thr Cys Met Ser Leu Leu Leu Gly Ala Ser Asp Thr Thr Lys
325 330 335
Ser Thr Met Ile Trp Ser Leu Ser Leu Leu Leu Asn His Pro Asp Val
340 345 350
Leu Lys Lys Val Gln Gln Glu Leu Asp Ala His Ile Gly Pro Glu Thr
355 360 365
Leu Val Asn Glu Ser Asp Val Lys Ser Phe Val Tyr Leu Asp Ala Val
370 375 380
Ile Lys Glu Thr Leu Arg Leu Tyr Ser Pro Gly Pro Leu Gly Leu Pro
385 390 395 400
His Glu Ala Met Glu Asp Cys Thr Val Ala Gly Tyr His Val Pro Ala
405 410 415
Gly Thr Gln Leu Leu Phe Asn Gln Trp Lys Met His Gln Asp Pro Asn
420 425 430
Val Trp Glu Asp Pro Ser Glu Phe Lys Pro Glu Arg Phe Leu Thr Thr
435 440 445
His Lys Asp Ile Asp Phe Arg Gly Arg His Phe Glu Tyr Leu Pro Phe
450 455 460
Ala Ser Gly Arg Arg Ile Cys Pro Gly Ile Ser Phe Ala His Gln Ile
465 470 475 480
Leu Met Leu Ser Leu Ala Asn Met Leu His Gly Phe Asp Phe Thr Thr
485 490 495
Pro Asn Gly Glu Pro Val Asp Met Ala Gln Val Ser Gly Gly Thr Leu
500 505 510
Ile Arg Ala Thr Pro Leu Glu Ala Leu Ile Ser Pro Arg Leu Pro Gly
515 520 525
His Val Tyr Met Gly
530
<210> 7
<211> 530
<212> PRT
<213> Tripterygium wilfordii
<400> 7
Met Glu Phe Leu Leu Ser Ile Pro Ala Asn Thr Ile Ala Thr Gln Ile
1 5 10 15
Phe Ala Leu Leu Leu Leu Tyr Leu Cys Phe Arg Lys Phe Thr Asp Val
20 25 30
Leu Lys Pro Lys Gln Ser Lys Thr Ser Pro Pro Gln Val Gly Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu His Arg Leu Arg Gly Pro Pro Ala Pro
50 55 60
His Ile Thr Leu Gly Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Leu His Pro Thr Leu Val Ile Ser Asn Ser Glu Ile
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Lys Val Leu Ala Gly Arg Pro
100 105 110
Ala Thr Val Ala Thr Glu Ile Met Ser Tyr Asn His Ala Met Phe Thr
115 120 125
Phe Ser Ser Tyr Gly Pro Tyr Trp Ser His Thr Arg Lys Leu Val Thr
130 135 140
Val Glu Leu Leu Ser Asn Lys Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Val Lys Asn Ser Val Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Thr Gly Asp Ser Asn Gln Val Leu Val Asp Met Thr Arg
180 185 190
Ile Phe Gly Asp Ile Ile Ala Asn Val Ile Tyr Arg Ile Val Val Gly
195 200 205
Lys Val Tyr Ala Ser Lys Gly Glu Gly His Val Arg Trp Lys Gln Val
210 215 220
Val Ser Glu Tyr Val Asn Leu Leu Ser His Phe Gly Val Gly Asp Ala
225 230 235 240
Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly Lys Glu Lys Ala Met
245 250 255
Lys Lys Thr Ala Lys Glu Leu Asp Asn Tyr Val Glu Glu Trp Leu Gln
260 265 270
Glu His Lys Lys Lys Arg Ser Ser Ala Gly Asp His Gly Ile Val Glu
275 280 285
Glu Asp Phe Met Asp Val Met Leu Ser Ile Phe Tyr Asp Asp Asp Gln
290 295 300
Glu Glu Ser Phe Ala Asp His Ser Ala His Thr Ile Asn Lys Ala Leu
305 310 315 320
Cys Leu Ser Leu Ile Leu Ala Ala Ser Asp Thr Thr Lys Thr Thr Leu
325 330 335
Thr Trp Val Leu Ser Leu Leu Leu Asn His Arg Asp Ile Leu Asn Lys
340 345 350
Val Gln Gln Glu Leu Ile Ala His Ile Gly Pro Glu Thr Pro Val Asn
355 360 365
Glu Ser Asp Ile Lys Ser Phe Val Tyr Leu Glu Ala Val Ile Lys Glu
370 375 380
Thr Leu Arg Leu Tyr Pro Pro Gly Pro Leu Gly Leu Pro His Glu Ser
385 390 395 400
Met Glu Asp Cys Thr Ile Ala Gly Tyr His Val Pro Ala Gly Thr Arg
405 410 415
Val Leu Phe Asn Gln Trp Lys Ile His His Asp Pro Gln Val Trp Glu
420 425 430
Asn Pro Ser Glu Phe Lys Pro Glu Arg Phe Leu Arg Thr His Lys Glu
435 440 445
Val Asp Val Arg Gly Arg His Phe Glu Leu Leu Pro Phe Gly Ser Gly
450 455 460
Arg Arg Met Cys Pro Gly Ile Ser Phe Ala Leu Gln Val Met Glu Leu
465 470 475 480
Ala Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr Pro Asn Gly
485 490 495
Glu Pro Val Asp Met Thr Glu Asp Asn Gly Phe Val Thr Leu Arg Ala
500 505 510
Thr Pro Leu Glu Ala Leu Ile Ser Pro Arg Leu Pro Gly His Val Tyr
515 520 525
Met Gly
530
<210> 8
<211> 134
<212> PRT
<213> Tripterygium wilfordii
<400> 8
Met Ala Ser Asp Arg Lys Ile Tyr Met Phe Lys Glu Val Glu Thr His
1 5 10 15
Asn Lys Thr Lys Asp Cys Trp Leu Ile Ile Ser Gly Lys Val Tyr Asp
20 25 30
Val Thr Pro Phe Met Glu Asp His Pro Gly Gly Asp Glu Val Leu Leu
35 40 45
Ser Ser Thr Gly Lys Asp Ala Thr Asn Asp Phe Glu Asp Val Gly His
50 55 60
Ser Asp Asn Ala Arg Asp Met Met Asp Gln Tyr Cys Ile Gly Glu Ile
65 70 75 80
Asp Gly Lys Thr Val Pro Glu Lys Arg Asn Tyr Ile Pro Ala Gln Thr
85 90 95
Pro Ala Tyr Asn Gln Asp Lys Thr Pro Glu Phe Val Val Lys Val Leu
100 105 110
Gln Phe Leu Val Pro Leu Leu Ile Leu Gly Leu Ala Phe Ala Val Arg
115 120 125
His Phe Thr Lys Lys Glu
130
<210> 9
<211> 708
<212> PRT
<213> Tripterygium wilfordii
<400> 9
Met Gln Ser Ser Ser Asn Ser Met Lys Ala Ser Pro Leu Asp Leu Met
1 5 10 15
Ser Ala Ile Ile Lys Gly Lys Val Asp Pro Ser Asn Val Ser Ser Glu
20 25 30
Val Ser Gly Glu Val Thr Ser Ile Ile Phe Glu Asn Arg Glu Phe Val
35 40 45
Met Ile Leu Thr Thr Ser Ile Ala Val Leu Ile Gly Cys Val Val Val
50 55 60
Leu Ile Trp Arg Arg Ser Gly Ala Gln Lys Ser Lys Ala Leu Val Pro
65 70 75 80
Pro Lys Pro Leu Ala Val Lys Leu Pro Glu Pro Glu Val Asp Asp Gly
85 90 95
Lys Ser Lys Ile Thr Val Phe Tyr Gly Thr Gln Thr Gly Thr Ala Glu
100 105 110
Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Ala Arg Tyr Glu Lys
115 120 125
Ala Val Phe Lys Ile Val Asp Leu Asp Asp Tyr Ala Glu Asp Asp Asp
130 135 140
Glu Tyr Glu Glu Lys Leu Lys Lys Glu Lys Leu Ala Ile Phe Phe Leu
145 150 155 160
Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr
165 170 175
Lys Trp Phe Leu Glu Gly Lys Glu Arg Gly Glu Cys Phe Gln Asn Met
180 185 190
Lys Phe Ala Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn
195 200 205
Lys Val Ala Lys Glu Val Asp Gln Ile Leu Ser Glu Gln Gly Ala Thr
210 215 220
Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln Cys Leu Glu Asp
225 230 235 240
Asp Phe Thr Ala Trp Arg Glu Leu Val Trp Pro Glu Leu Asp Gln Leu
245 250 255
Leu Arg Asp Lys Asp Gly Ala Thr Thr Val Ser Thr Pro Tyr Thr Ala
260 265 270
Thr Ile Pro Glu Tyr Arg Val Lys Cys Tyr Asp Thr Ser Asp Ala Ser
275 280 285
Val Glu Glu Lys Ser Trp Ser Asn Ala Asn Gly His Ala Val Val Asp
290 295 300
Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Arg Glu Leu His
305 310 315 320
Thr Pro Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp Ile Ala
325 330 335
Gly Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val Tyr Cys
340 345 350
Glu Asn Leu Thr Glu Thr Val Glu Glu Ala Val Arg Leu Leu Gly Leu
355 360 365
Ser Pro Asp Thr Tyr Phe Ser Leu His Ser Asp Lys Glu Asp Gly Thr
370 375 380
Pro Leu Ser Ala Ser Ser Leu Pro Pro Thr Phe Pro Pro Cys Ser Leu
385 390 395 400
Lys Thr Ala Leu Ala Arg Tyr Ala Asp Leu Leu Asn Ser Pro Lys Lys
405 410 415
Ser Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Thr Glu Ala
420 425 430
Asp Arg Leu Arg His Leu Ala Ser Pro Ala Gly Lys Asp Glu Tyr Ala
435 440 445
Gln Trp Val Ile Ala Ser Gln Arg Ser Leu Leu Glu Ile Met Ala Glu
450 455 460
Phe Pro Ser Ala Arg Pro Pro Leu Gly Val Phe Phe Ala Ala Val Ala
465 470 475 480
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Met
485 490 495
Ala Pro Ser Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu Lys Thr
500 505 510
Pro Ala Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
515 520 525
Ala Val Pro Leu Glu Lys Ser His Glu Ser Cys Trp Ala Pro Ile Phe
530 535 540
Val Arg Gln Ser Asn Phe Lys Leu Pro Val Asp Thr Lys Val Pro Ile
545 550 555 560
Ile Met Ile Gly Pro Gly Thr Gly Phe Ala Pro Phe Arg Gly Phe Leu
565 570 575
Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Ala Glu Leu Gly Ser Ser
580 585 590
Ile Leu Phe Phe Gly Cys Arg Asn Arg Arg Leu Asp Tyr Ile Tyr Glu
595 600 605
Glu Glu Leu Asn Asn Phe Val Glu Ser Ala Ala Leu Ser Glu Leu Ile
610 615 620
Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val Gln His Lys
625 630 635 640
Met Met Glu Lys Ala Ser Asp Ile Trp Asn Met Ile Asn Gln Gly Ala
645 650 655
Tyr Ile Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His
660 665 670
Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp Ser Ser
675 680 685
Lys Ala Glu Ser Met Val Lys Asn Leu Gln Thr Ser Gly Arg Tyr Leu
690 695 700
Arg Asp Val Trp
705
<210> 10
<211> 38
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP71BE85v1_TEF-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 10
agcgatacgn aaaatggact tattgcaatt tccatctg 38
<210> 11
<211> 29
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP71BE85v1_TEF-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 11
cacgcgantc agttaaatgc gggtgatgg 29
<210> 12
<211> 34
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwGA3OX1_TEF-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 12
agcgatacgn aaaatgagtc ctccgcctac aata 34
<210> 13
<211> 32
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwGA3OX1_TEF-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 13
cacgcgantt aaatacctaa aagcgagacg gg 32
<210> 14
<211> 38
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_AcoUGT2_TEF-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 14
agcgatacgn aaaatggctg ttagcttaaa aaataccg 38
<210> 15
<211> 30
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_AcoUGT2_TEF-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 15
cacgcgantt aacgactgat atgagcgacg 30
<210> 16
<211> 35
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP82D213_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 16
atcaacgggn aaaatggaat tccttctgtc attgc 35
<210> 17
<211> 32
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP82D213_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 17
cgtgcganct aacccatgta aagatgtgat gg 32
<210> 18
<211> 38
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP71BE86_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 18
atcaacgggn aaaatggact tacaattacc tagcttcc 38
<210> 19
<211> 33
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer CO_TwCYP71BE86_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 19
cgtgcgantt aaccagataa actacgatat ggg 33
<210> 20
<211> 40
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCYP82D217_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 20
ggcttaanaa gcatcttctc tcctaactag ctttctaaat 40
<210> 21
<211> 35
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCYP82D217_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 21
ggtttaanct attgcaattc accccatgta gacaa 35
<210> 22
<211> 29
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLifeUP_TEF-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 22
agcgatacgn gacctgcagg ctgaggctt 29
<210> 23
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_TEF-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 23
cacgcgancc cggggctgag gtttaat 27
<210> 24
<211> 35
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCYP82D274v1_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G, or T
<400> 24
ggcttaanat ggagtttctt ctttcactcc caaca 35
<210> 25
<211> 35
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCYP82D274v1_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 25
ggtttaantc agcccatata gagatgagct gggag 35
<210> 26
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_TEF-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 26
agcgatacgn tgcaggctga ggcttaatat g 31
<210> 27
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_TEF-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 27
cacgcgancc cggggctgag gtttaat 27
<210> 28
<211> 36
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCPR1_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A,. C, G or T
<400> 28
ggcttaanat gcaatcttct tcaaattcta tgaagg 36
<210> 29
<211> 29
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer TwCPR1_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 29
ggtttaantt accacacatc ccggagata 29
<210> 30
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 30
atcaacgggn tgcaggctga ggcttaatat g 31
<210> 31
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 31
cgtgcgancc cggggctgag gtttaat 27
<210> 32
<211> 29
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#1_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 32
ggcttaanat ggcttcggat cggaagata 29
<210> 33
<211> 34
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#1_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 33
ggtttaanct attctttctt ggtgaagtga cgta 34
<210> 34
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 34
atcaacgggn tgcaggctga ggcttaatat g 31
<210> 35
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 35
cgtgcgancc cggggctgag gtttaat 27
<210> 36
<211> 29
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#2_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 36
ggcttaanat gggtggagac ggaaaggtt 29
<210> 37
<211> 32
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#2_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 37
ggtttaantt aagcaggagg agctgatttg gt 32
<210> 38
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 38
atcaacgggn tgcaggctga ggcttaatat g 31
<210> 39
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 39
cgtgcgancc cggggctgag gtttaat 27
<210> 40
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#3_pLife-F
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 40
ggcttaanat ggctggtcag agagttttca c 31
<210> 41
<211> 33
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#3_pLife-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G, or T
<400> 41
ggtttaantt agaagatctg ctcaggcctt gta 33
<210> 42
<211> 31
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 42
atcaacgggn tgcaggctga ggcttaatat g 31
<210> 43
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer pLife_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 43
cgtgcgancc cggggctgag gtttaat 27
<210> 44
<211> 40
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#4_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 44
atcaacgggn aaaatggcta aacttctttc atttgctgag 40
<210> 45
<211> 36
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#4_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 45
cgtgcgantt agaaaaggta tcgcaaacca aatgcc 36
<210> 46
<211> 38
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#5_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<400> 46
atcaacgggn aaaatgatta ttgttgcggt ggctctga 38
<210> 47
<211> 42
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#5_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<400> 47
cgtgcgantt acttctctag atccccaatg taaaaatcat cg 42
<210> 48
<211> 37
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#6_PGK-F
<220>
<221> misc_feature
<222> (10)..(10)
<223> A, C, G or T
<220>
<221> misc_feature
<222> (10)..(10)
<223> n is a, c, g, or t
<400> 48
atcaacgggn aaaatgccga ctttaacgaa gctgcac 37
<210> 49
<211> 34
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer Twb5#6_PGK-R
<220>
<221> misc_feature
<222> (8)..(8)
<223> A, C, G or T
<220>
<221> misc_feature
<222> (8)..(8)
<223> n is a, c, g, or t
<400> 49
cgtgcganct acttcttccg caagtacagg agtc 34
<210> 50
<211> 27
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer YEA85_UP_Genotyping_Fw
<400> 50
tctcaggtat agcatgaggt cgctcat 27
<210> 51
<211> 28
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA86_DW_Genotyping_Fw
<400> 51
cctgcaggac tagtgctgag gcattaat 28
<210> 52
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA87_X-2_Genotyping_UP
<400> 52
gtttgtagtt ggcggtggag 20
<210> 53
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA88_X-2_Genotyping_DW
<400> 53
gagacaagat ggggcaagac 20
<210> 54
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA89_X-3_Genotyping_UP
<400> 54
tgacgaatcg ttaggcacag 20
<210> 55
<211> 21
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA90_X-3_Genotyping_DW
<400> 55
ccgtgcaata ccaaaatcga g 21
<210> 56
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA91_X-4_Genotyping_UP
<400> 56
ctcacaaagg gacgaatcct 20
<210> 57
<211> 19
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA92_X-4_Genotyping_DW
<400> 57
gacggtacgt tgaccagag 19
<210> 58
<211> 23
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA93_XI-1_Genotyping_UP
<400> 58
cttaatgggt agtgcttgac acg 23
<210> 59
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA94_XI-2_Genotyping_UP
<400> 59
gtttgtagtt ggcggtggag 20
<210> 60
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA95_X1-2_Genotyping_DW
<400> 60
gagacaagat ggggcaagac 20
<210> 61
<211> 25
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA96_XI-5_Genotyping_UP
<400> 61
ctcaatgatc aaaatcctga atgca 25
<210> 62
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA97_X1-5_Genotyping_DW
<400> 62
gcatggtcac cgctatcagc 20
<210> 63
<211> 19
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA98_XII-2_Genotyping_UP
<400> 63
cgaagaaggc ctgcaattc 19
<210> 64
<211> 19
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> primer YEA99_X1I-2_Genotyping_DW
<400> 64
ggccctgata aggttgttg 19
<210> 65
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer YEA100_XII-5_Genotyping_UP
<400> 65
ccaccgaagt tgatttgctt 20
<210> 66
<211> 20
<212> DNA
<213> 人工(Artificial)
<220>
<221>
<222>
<223> Primer YEA101_X1I-5_Genotyping_DW
<400> 66
gtgggagtaa gggatcctgt 20
<210> 67
<211> 786
<212> PRT
<213> Plectranthus barbatus
<400> 67
Met Gly Ser Leu Ser Thr Met Asn Leu Asn His Ser Pro Met Ser Tyr
1 5 10 15
Ser Gly Ile Leu Pro Ser Ser Ser Ala Lys Ala Lys Leu Leu Leu Pro
20 25 30
Gly Cys Phe Ser Ile Ser Ala Trp Met Asn Asn Gly Lys Asn Leu Asn
35 40 45
Cys Gln Leu Thr His Lys Lys Ile Ser Lys Val Ala Glu Ile Arg Val
50 55 60
Ala Thr Val Asn Ala Pro Pro Val His Asp Gln Asp Asp Ser Thr Glu
65 70 75 80
Asn Gln Cys His Asp Ala Val Asn Asn Ile Glu Asp Pro Ile Glu Tyr
85 90 95
Ile Arg Thr Leu Leu Arg Thr Thr Gly Asp Gly Arg Ile Ser Val Ser
100 105 110
Pro Tyr Asp Thr Ala Trp Val Ala Leu Ile Lys Asp Leu Gln Gly Arg
115 120 125
Asp Ala Pro Glu Phe Pro Ser Ser Leu Glu Trp Ile Ile Gln Asn Gln
130 135 140
Leu Ala Asp Gly Ser Trp Gly Asp Ala Lys Phe Phe Cys Val Tyr Asp
145 150 155 160
Arg Leu Val Asn Thr Ile Ala Cys Val Val Ala Leu Arg Ser Trp Asp
165 170 175
Val His Ala Glu Lys Val Glu Arg Gly Val Arg Tyr Ile Asn Glu Asn
180 185 190
Val Glu Lys Leu Arg Asp Gly Asn Glu Glu His Met Thr Cys Gly Phe
195 200 205
Glu Val Val Phe Pro Ala Leu Leu Gln Arg Ala Lys Ser Leu Gly Ile
210 215 220
Gln Asp Leu Pro Tyr Asp Ala Pro Val Ile Gln Glu Ile Tyr His Ser
225 230 235 240
Arg Glu Gln Lys Ser Lys Arg Ile Pro Leu Glu Met Met His Lys Val
245 250 255
Pro Thr Ser Leu Leu Phe Ser Leu Glu Gly Leu Glu Asn Leu Glu Trp
260 265 270
Asp Lys Leu Leu Lys Leu Gln Ser Ala Asp Gly Ser Phe Leu Thr Ser
275 280 285
Pro Ser Ser Thr Ala Phe Ala Phe Met Gln Thr Arg Asp Pro Lys Cys
290 295 300
Tyr Gln Phe Ile Lys Asn Thr Ile Gln Thr Phe Asn Gly Gly Ala Pro
305 310 315 320
His Thr Tyr Pro Val Asp Val Phe Gly Arg Leu Trp Ala Ile Asp Arg
325 330 335
Leu Gln Arg Leu Gly Ile Ser Arg Phe Phe Glu Ser Glu Ile Ala Asp
340 345 350
Cys Ile Ala His Ile His Arg Phe Trp Thr Glu Lys Gly Val Phe Ser
355 360 365
Gly Arg Glu Ser Glu Phe Cys Asp Ile Asp Asp Thr Ser Met Gly Val
370 375 380
Arg Leu Met Arg Met His Gly Tyr Asp Val Asp Pro Asn Val Leu Lys
385 390 395 400
Asn Phe Lys Lys Asp Asp Lys Phe Ser Cys Tyr Gly Gly Gln Met Ile
405 410 415
Glu Ser Pro Ser Pro Ile Tyr Asn Leu Tyr Arg Ala Ser Gln Leu Arg
420 425 430
Phe Pro Gly Glu Gln Ile Leu Glu Asp Ala Asn Lys Phe Ala Tyr Asp
435 440 445
Phe Leu Gln Glu Lys Leu Ala His Asn Gln Ile Leu Asp Lys Trp Val
450 455 460
Ile Ser Lys His Leu Pro Asp Glu Ile Lys Leu Gly Leu Glu Met Pro
465 470 475 480
Trp Tyr Ala Thr Leu Pro Arg Val Glu Ala Arg Tyr Tyr Ile Gln Tyr
485 490 495
Tyr Ala Gly Ser Gly Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met
500 505 510
Pro Glu Ile Ser Asn Asp Thr Tyr His Glu Leu Ala Lys Thr Asp Phe
515 520 525
Lys Arg Cys Gln Ala Gln His Gln Phe Glu Trp Ile Tyr Met Gln Glu
530 535 540
Trp Tyr Glu Ser Cys Asn Met Glu Glu Phe Gly Ile Ser Arg Lys Glu
545 550 555 560
Leu Leu Val Ala Tyr Phe Leu Ala Thr Ala Ser Ile Phe Glu Leu Glu
565 570 575
Arg Ala Asn Glu Arg Ile Ala Trp Ala Lys Ser Gln Ile Ile Ser Thr
580 585 590
Ile Ile Ala Ser Phe Phe Asn Asn Gln Asn Thr Ser Pro Glu Asp Lys
595 600 605
Leu Ala Phe Leu Thr Asp Phe Lys Asn Gly Asn Ser Thr Asn Met Ala
610 615 620
Leu Val Thr Leu Thr Gln Phe Leu Glu Gly Phe Asp Arg Tyr Thr Ser
625 630 635 640
His Gln Leu Lys Asn Ala Trp Ser Val Trp Leu Arg Lys Leu Gln Gln
645 650 655
Gly Glu Gly Asn Gly Gly Ala Asp Ala Glu Leu Leu Val Asn Thr Leu
660 665 670
Asn Ile Cys Ala Gly His Ile Ala Phe Arg Glu Glu Ile Leu Ala His
675 680 685
Asn Asp Tyr Lys Thr Leu Ser Asn Leu Thr Ser Lys Ile Cys Arg Gln
690 695 700
Leu Ser Gln Ile Gln Asn Glu Lys Glu Leu Glu Thr Glu Gly Gln Lys
705 710 715 720
Thr Ser Ile Lys Asn Lys Glu Leu Glu Glu Asp Met Gln Arg Leu Val
725 730 735
Lys Leu Val Leu Glu Lys Ser Arg Val Gly Ile Asn Arg Asp Met Lys
740 745 750
Lys Thr Phe Leu Ala Val Val Lys Thr Tyr Tyr Tyr Lys Ala Tyr His
755 760 765
Ser Ala Gln Ala Ile Asp Asn His Met Phe Lys Val Leu Phe Glu Pro
770 775 780
Val Ala
785
<210> 68
<211> 598
<212> PRT
<213> Plectranthus barbatus
<400> 68
Met Ser Ser Leu Ala Gly Asn Leu Arg Val Ile Pro Phe Ser Gly Asn
1 5 10 15
Arg Val Gln Thr Arg Thr Gly Ile Leu Pro Val His Gln Thr Pro Met
20 25 30
Ile Thr Ser Lys Ser Ser Ala Ala Val Lys Cys Ser Leu Thr Thr Pro
35 40 45
Thr Asp Leu Met Gly Lys Ile Lys Glu Val Phe Asn Arg Glu Val Asp
50 55 60
Thr Ser Pro Ala Ala Met Thr Thr His Ser Thr Asp Ile Pro Ser Asn
65 70 75 80
Leu Cys Ile Ile Asp Thr Leu Gln Arg Leu Gly Ile Asp Gln Tyr Phe
85 90 95
Gln Ser Glu Ile Asp Ala Val Leu His Asp Thr Tyr Arg Leu Trp Gln
100 105 110
Leu Lys Lys Lys Asp Ile Phe Ser Asp Ile Thr Thr His Ala Met Ala
115 120 125
Phe Arg Leu Leu Arg Val Lys Gly Tyr Glu Val Ala Ser Asp Glu Leu
130 135 140
Ala Pro Tyr Ala Asp Gln Glu Arg Ile Asn Leu Gln Thr Ile Asp Val
145 150 155 160
Pro Thr Val Val Glu Leu Tyr Arg Ala Ala Gln Glu Arg Leu Thr Glu
165 170 175
Glu Asp Ser Thr Leu Glu Lys Leu Tyr Val Trp Thr Ser Ala Phe Leu
180 185 190
Lys Gln Gln Leu Leu Thr Asp Ala Ile Pro Asp Lys Lys Leu His Lys
195 200 205
Gln Val Glu Tyr Tyr Leu Lys Asn Tyr His Gly Ile Leu Asp Arg Met
210 215 220
Gly Val Arg Arg Asn Leu Asp Leu Tyr Asp Ile Ser His Tyr Lys Ser
225 230 235 240
Leu Lys Ala Ala His Arg Phe Tyr Asn Leu Ser Asn Glu Asp Ile Leu
245 250 255
Ala Phe Ala Arg Gln Asp Phe Asn Ile Ser Gln Ala Gln His Gln Lys
260 265 270
Glu Leu Gln Gln Leu Gln Arg Trp Tyr Ala Asp Cys Arg Leu Asp Thr
275 280 285
Leu Lys Phe Gly Arg Asp Val Val Arg Ile Gly Asn Phe Leu Thr Ser
290 295 300
Ala Met Ile Gly Asp Pro Glu Leu Ser Asp Leu Arg Leu Ala Phe Ala
305 310 315 320
Lys His Ile Val Leu Val Thr Arg Ile Asp Asp Phe Phe Asp His Gly
325 330 335
Gly Pro Lys Glu Glu Ser Tyr Glu Ile Leu Glu Leu Val Lys Glu Trp
340 345 350
Lys Glu Lys Pro Ala Gly Glu Tyr Val Ser Glu Glu Val Glu Ile Leu
355 360 365
Phe Thr Ala Val Tyr Asn Thr Val Asn Glu Leu Ala Glu Met Ala His
370 375 380
Ile Glu Gln Gly Arg Ser Val Lys Asp Leu Leu Val Lys Leu Trp Val
385 390 395 400
Glu Ile Leu Ser Val Phe Arg Ile Glu Leu Asp Thr Trp Thr Asn Asp
405 410 415
Thr Ala Leu Thr Leu Glu Glu Tyr Leu Ser Gln Ser Trp Val Ser Ile
420 425 430
Gly Cys Arg Ile Cys Ile Leu Ile Ser Met Gln Phe Gln Gly Val Lys
435 440 445
Leu Ser Asp Glu Met Leu Gln Ser Glu Glu Cys Thr Asp Leu Cys Arg
450 455 460
Tyr Val Ser Met Val Asp Arg Leu Leu Asn Asp Val Gln Thr Phe Glu
465 470 475 480
Lys Glu Arg Lys Glu Asn Thr Gly Asn Ser Val Ser Leu Leu Gln Ala
485 490 495
Ala His Lys Asp Glu Arg Val Ile Asn Glu Glu Glu Ala Cys Ile Lys
500 505 510
Val Lys Glu Leu Ala Glu Tyr Asn Arg Arg Lys Leu Met Gln Ile Val
515 520 525
Tyr Lys Thr Gly Thr Ile Phe Pro Arg Lys Cys Lys Asp Leu Phe Leu
530 535 540
Lys Ala Cys Arg Ile Gly Cys Tyr Leu Tyr Ser Ser Gly Asp Glu Phe
545 550 555 560
Thr Ser Pro Gln Gln Met Met Glu Asp Met Lys Ser Leu Val Tyr Glu
565 570 575
Pro Leu Pro Ile Ser Pro Pro Glu Ala Asn Asn Ala Ser Gly Glu Lys
580 585 590
Met Ser Cys Val Ser Asn
595
<210> 69
<211> 807
<212> PRT
<213> Tripterygium wilfordii
<400> 69
Met His Ser Leu Leu Met Lys Lys Val Ile Met Tyr Ser Ser Gln Thr
1 5 10 15
Thr His Val Phe Pro Ser Pro Leu His Cys Thr Ile Pro Lys Ser Ser
20 25 30
Ser Phe Phe Leu Asp Ala Pro Val Ala Arg Leu His Cys Leu Ser Gly
35 40 45
His Gly Ala Lys Lys Lys Arg Leu His Phe Asp Ile Gln Gln Gly Arg
50 55 60
Asn Ala Val Ser Lys Thr His Thr Pro Asp Asp Leu Tyr Ala Lys Gln
65 70 75 80
Glu Tyr Ser Val Pro Glu Ile Val Lys Asp Asp Asp Lys Glu Glu Glu
85 90 95
Val Val Lys Ile Lys Glu His Val Asp Ile Ile Lys Ser Met Leu Ser
100 105 110
Ser Met Glu Asp Gly Glu Ile Ser Ile Ser Ala Tyr Asp Thr Ala Trp
115 120 125
Val Ala Leu Ile Gln Asp Ile His Asn Asn Gly Ala Pro Gln Phe Pro
130 135 140
Ser Ser Leu Leu Trp Ile Ala Glu Asn Gln Leu Pro Asp Gly Ser Trp
145 150 155 160
Gly Asp Ser Arg Val Phe Leu Ala Phe Asp Arg Ile Ile Asn Thr Leu
165 170 175
Ala Cys Val Val Ala Leu Lys Ser Trp Asn Val His Pro Asp Lys Cys
180 185 190
Glu Arg Gly Ile Ser Phe Leu Lys Glu Asn Ile Ser Met Leu Glu Lys
195 200 205
Asp Asp Ser Glu His Met Leu Val Gly Phe Glu Phe Gly Phe Pro Val
210 215 220
Leu Leu Asp Met Ala Arg Arg Leu Gly Ile Asp Val Pro Asp Asp Ser
225 230 235 240
Pro Phe Leu Gln Glu Ile Tyr Val Gln Arg Asp Leu Lys Leu Lys Arg
245 250 255
Ile Pro Lys Asp Ile Leu His Asn Val Pro Thr Thr Leu Leu His Ser
260 265 270
Leu Glu Ala Ile Pro Asp Leu Asp Trp Thr Lys Leu Leu Lys Leu Gln
275 280 285
Cys Gln Asp Gly Ser Leu Leu Phe Ser Pro Ser Ser Thr Ala Met Ala
290 295 300
Phe Ile Asn Thr Lys Asp Glu Asn Cys Leu Arg Tyr Leu Asn Tyr Val
305 310 315 320
Val Gln Arg Phe Asn Gly Gly Ala Pro Thr Val Tyr Pro Tyr Asp Leu
325 330 335
Phe Glu His Asn Trp Ala Val Asp Arg Leu Gln Arg Leu Gly Ile Ser
340 345 350
Arg Phe Phe Gln Pro Glu Ile Arg Glu Cys Met Ser Tyr Val Tyr Arg
355 360 365
Tyr Trp Thr Lys Asp Gly Ile Phe Cys Thr Arg Asn Ser Arg Val His
370 375 380
Asp Val Asp Asp Thr Ala Met Gly Phe Arg Leu Leu Arg Leu His Gly
385 390 395 400
Tyr Glu Val His Pro Asp Ala Phe Arg Gln Phe Lys Lys Gly Cys Glu
405 410 415
Phe Ile Cys Tyr Glu Gly Gln Ser His Pro Thr Val Thr Val Met Tyr
420 425 430
Asn Leu Tyr Arg Ala Ser Gln Leu Met Phe Pro Glu Glu Lys Ile Leu
435 440 445
Asp Glu Ala Lys Gln Phe Thr Glu Lys Phe Leu Gly Glu Lys Arg Ser
450 455 460
Ala Asn Lys Leu Leu Asp Lys Trp Ile Ile Thr Lys Asp Leu Pro Gly
465 470 475 480
Glu Val Gly Phe Ala Leu Asp Val Pro Trp Tyr Ala Ser Leu Pro Arg
485 490 495
Val Glu Ala Arg Phe Phe Ile Gln His Tyr Gly Gly Glu Asp Asp Val
500 505 510
Trp Leu Asp Lys Ala Leu Tyr Arg Met Pro Tyr Val Asn Asn Asn Val
515 520 525
Tyr Leu Glu Leu Ala Lys Leu Asp Tyr Asn Tyr Cys Gln Ala Leu His
530 535 540
Arg Thr Glu Trp Gly Arg Ile Gln Lys Trp Tyr Glu Glu Cys Lys Pro
545 550 555 560
Arg Asp Phe Gly Ile Ser Arg Glu Cys Leu Leu Arg Ala Tyr Phe Met
565 570 575
Ala Ala Ala Ser Ile Phe Glu Pro Glu Arg Ser Met Glu Arg Leu Ala
580 585 590
Trp Ala Lys Thr Ala Ile Leu Leu Glu Ile Ile Val Ser Tyr Phe Ser
595 600 605
Glu Val Gly Asn Ser Thr Glu Gln Arg Ile Ala Phe Thr Thr Glu Phe
610 615 620
Ser Ile Arg Ala Ser Pro Met Gly Gly Tyr Ile Asn Gly Arg Lys Leu
625 630 635 640
Asp Lys Ile Gly Thr Thr Gln Glu Leu Ile Gln Met Leu Leu Ala Thr
645 650 655
Ile Asp Gln Phe Ser Gln Asp Ala Phe Ala Ala Tyr Gly His Asp Ile
660 665 670
Thr Arg His Leu His Asn Ser Trp Lys Met Trp Leu Leu Lys Trp Gln
675 680 685
Glu Glu Gly Asp Arg Trp Leu Gly Glu Ala Glu Leu Leu Ile Gln Thr
690 695 700
Ile Asn Leu Met Ala Asp His Lys Ile Ala Glu Lys Leu Phe Met Gly
705 710 715 720
His Thr Asn Tyr Glu Gln Leu Phe Ser Leu Thr Asn Lys Val Cys Tyr
725 730 735
Ser Leu Gly His His Glu Leu Gln Asn Asn Arg Glu Leu Glu His Asp
740 745 750
Met Gln Arg Leu Val Gln Leu Val Leu Thr Asn Ser Ser Asp Gly Ile
755 760 765
Asp Ser Asp Ile Lys Lys Thr Phe Leu Ala Val Ala Lys Arg Phe Tyr
770 775 780
Tyr Thr Ala Phe Val Asp Pro Glu Thr Val Asn Val His Ile Ala Lys
785 790 795 800
Val Leu Phe Glu Arg Val Asp
805
<210> 70
<211> 589
<212> PRT
<213> Tripterygium wilfordii
<400> 70
Met Ala Pro Leu Val Val Ser Leu Thr Ile Ser His Phe Val Ile Gln
1 5 10 15
Thr Gly Ser Thr Ala Leu His Tyr Ser Ala Leu Pro Glu Thr Arg Thr
20 25 30
Lys His Cys His Ser Ser Arg Pro Phe Ala Ser Ile Asn Ser Asn Ser
35 40 45
Leu Gln Met Asn Gln Arg Pro Leu Thr Asp Tyr Arg Pro Ala Ile Trp
50 55 60
Asn Pro Glu Leu Ile Asp Ser Leu Asn Thr Pro Tyr Ser Tyr Gln Ser
65 70 75 80
His Gly Thr Gln Leu Asp Lys Leu Arg Gln Asp Ala Lys Arg Leu Leu
85 90 95
Ser Ser Thr Ser Asp Pro Cys Leu Leu Leu Asn His Val Glu Ser Met
100 105 110
Gln Arg Leu Gly Ile Ala Tyr His Phe Gln Glu Glu Ile Asp Tyr Leu
115 120 125
Leu Asn Thr Arg Ile Gln Pro Tyr Ser Pro Asp Asp His Asp Leu His
130 135 140
Thr Thr Ala Leu Arg Phe Arg Ile Leu Arg Asp Asn Asn Phe Pro Ile
145 150 155 160
Ser Ser Asp Val Phe Gly Lys Phe Met Ser Arg Glu Gly Lys Phe Leu
165 170 175
Asp Ser Leu Ser Arg Asp Val Lys Gly Leu Leu Ser Leu Tyr Glu Ala
180 185 190
Ser Phe Leu Gly Val Asp Gly Glu Val Ile Leu Asp Glu Ala Lys Glu
195 200 205
Phe Ser Ser Lys Asn Leu Arg Ala Leu Leu Gly Arg Leu Glu Ser Thr
210 215 220
Ser Ile Asp Val Ala Glu Gln Val Lys Gln Ser Leu Gln Ile Pro Leu
225 230 235 240
Phe Trp Arg Met Pro Arg Val Glu Ala Arg Asn Phe Ile Asp Phe Tyr
245 250 255
Gln Lys Lys Asp Ala Lys Ser Ser Thr Leu Leu Glu Leu Ala Lys Leu
260 265 270
Asp Phe Asn Leu Val Gln Ser Thr Tyr Gln Gln Glu Leu Lys Glu Leu
275 280 285
Ser Lys Trp Trp Glu Asn Leu Gly Phe Lys Gln Lys Leu Ser Phe Thr
290 295 300
Arg Asp Arg Leu Met Gln Ser Tyr Phe Ser Thr Thr Gly Ile Thr Phe
305 310 315 320
Lys Pro Gln Phe Ser Lys Ala Arg Ile Ala Ala Thr Lys Phe Ile Asn
325 330 335
Ile Val Asn Thr Ile Asp Asp Ile His Asp Tyr Tyr Gly Ser Gln Asp
340 345 350
Asp Leu Lys Leu Phe Asp Ser Ala Val Lys Arg Trp Asp Leu Ala Ala
355 360 365
Met Glu Glu Leu Pro Asp Tyr Met Lys Ile Cys Tyr Phe Ala Met Tyr
370 375 380
Asn Leu Val Asn Glu Leu Ala Tyr Asp Val Leu Ile Asn Gln Gly Ile
385 390 395 400
Asp Val Leu Pro Cys Leu Arg Glu Ala Trp Thr Lys Phe Cys Gly Ala
405 410 415
Ala Phe Val Glu Ser Gln Trp Cys Tyr Thr Gly Tyr Thr Pro Ser Met
420 425 430
Asp Asp Tyr Leu Lys Asn Cys Trp Ile Ser Ile Gly Val His Gly Ser
435 440 445
Leu Asn Phe Ala Arg Ala His Gln Gln Gly Ser Arg Ser Pro Ile Ala
450 455 460
Asn Thr Pro Leu His Cys Leu Glu Asp Pro Leu Leu Tyr Trp Ser Ser
465 470 475 480
Val Ile Cys Arg Leu Asn Asn Asp Leu Ala Thr Phe Gln His Glu Ser
485 490 495
Lys Thr Gly Glu Val Val Ser Phe Val Lys Cys Tyr Met Val Glu Lys
500 505 510
Gly Val Ser Gln Glu Gln Ala Cys Asp Glu Ile Arg Glu Leu Ile Lys
515 520 525
His Ala Trp Lys Met Leu Asn Thr Glu Arg Arg Arg Ser Asp Leu Pro
530 535 540
Pro Leu Met Val Glu Met Cys Met Asp Thr Pro Lys Leu Ser Gln Cys
545 550 555 560
Leu Tyr Gln His Gly Asp Gly Phe Gly Val Ala Ile Asp Leu Thr Lys
565 570 575
Asp Val Met Ser Ser Leu Ile Phe Arg Gln Ile Pro Ile
580 585
<210> 71
<211> 793
<212> PRT
<213> Salvia miltiorrhiza
<400> 71
Met Ala Ser Leu Ser Ser Thr Ile Leu Ser Arg Ser Pro Ala Ala Arg
1 5 10 15
Arg Arg Ile Thr Pro Ala Ser Ala Lys Leu His Arg Pro Glu Cys Phe
20 25 30
Ala Thr Ser Ala Trp Met Gly Ser Ser Ser Lys Asn Leu Ser Leu Ser
35 40 45
Tyr Gln Leu Asn His Lys Lys Ile Ser Val Ala Thr Val Asp Ala Pro
50 55 60
Gln Val His Asp His Asp Gly Thr Thr Val His Gln Gly His Asp Ala
65 70 75 80
Val Lys Asn Ile Glu Asp Pro Ile Glu Tyr Ile Arg Thr Leu Leu Arg
85 90 95
Thr Thr Gly Asp Gly Arg Ile Ser Val Ser Pro Tyr Asp Thr Ala Trp
100 105 110
Val Ala Met Ile Lys Asp Val Glu Gly Arg Asp Gly Pro Gln Phe Pro
115 120 125
Ser Ser Leu Glu Trp Ile Val Gln Asn Gln Leu Glu Asp Gly Ser Trp
130 135 140
Gly Asp Gln Lys Leu Phe Cys Val Tyr Asp Arg Leu Val Asn Thr Ile
145 150 155 160
Ala Cys Val Val Ala Leu Arg Ser Trp Asn Val His Ala His Lys Val
165 170 175
Lys Arg Gly Val Thr Tyr Ile Lys Glu Asn Val Asp Lys Leu Met Glu
180 185 190
Gly Asn Glu Glu His Met Thr Cys Gly Phe Glu Val Val Phe Pro Ala
195 200 205
Leu Leu Gln Lys Ala Lys Ser Leu Gly Ile Glu Asp Leu Pro Tyr Asp
210 215 220
Ser Pro Ala Val Gln Glu Val Tyr His Val Arg Glu Gln Lys Leu Lys
225 230 235 240
Arg Ile Pro Leu Glu Ile Met His Lys Ile Pro Thr Ser Leu Leu Phe
245 250 255
Ser Leu Glu Gly Leu Glu Asn Leu Asp Trp Asp Lys Leu Leu Lys Leu
260 265 270
Gln Ser Ala Asp Gly Ser Phe Leu Thr Ser Pro Ser Ser Thr Ala Phe
275 280 285
Ala Phe Met Gln Thr Lys Asp Glu Lys Cys Tyr Gln Phe Ile Lys Asn
290 295 300
Thr Ile Asp Thr Phe Asn Gly Gly Ala Pro His Thr Tyr Pro Val Asp
305 310 315 320
Val Phe Gly Arg Leu Trp Ala Ile Asp Arg Leu Gln Arg Leu Gly Ile
325 330 335
Ser Arg Phe Phe Glu Pro Glu Ile Ala Asp Cys Leu Ser His Ile His
340 345 350
Lys Phe Trp Thr Asp Lys Gly Val Phe Ser Gly Arg Glu Ser Glu Phe
355 360 365
Cys Asp Ile Asp Asp Thr Ser Met Gly Met Arg Leu Met Arg Met His
370 375 380
Gly Tyr Asp Val Asp Pro Asn Val Leu Arg Asn Phe Lys Gln Lys Asp
385 390 395 400
Gly Lys Phe Ser Cys Tyr Gly Gly Gln Met Ile Glu Ser Pro Ser Pro
405 410 415
Ile Tyr Asn Leu Tyr Arg Ala Ser Gln Leu Arg Phe Pro Gly Glu Glu
420 425 430
Ile Leu Glu Asp Ala Lys Arg Phe Ala Tyr Asp Phe Leu Lys Glu Lys
435 440 445
Leu Ala Asn Asn Gln Ile Leu Asp Lys Trp Val Ile Ser Lys His Leu
450 455 460
Pro Asp Glu Ile Lys Leu Gly Leu Glu Met Pro Trp Leu Ala Thr Leu
465 470 475 480
Pro Arg Val Glu Ala Lys Tyr Tyr Ile Gln Tyr Tyr Ala Gly Ser Gly
485 490 495
Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met Pro Glu Ile Ser Asn
500 505 510
Asp Thr Tyr His Asp Leu Ala Lys Thr Asp Phe Lys Arg Cys Gln Ala
515 520 525
Lys His Gln Phe Glu Trp Leu Tyr Met Gln Glu Trp Tyr Glu Ser Cys
530 535 540
Gly Ile Glu Glu Phe Gly Ile Ser Arg Lys Asp Leu Leu Leu Ser Tyr
545 550 555 560
Phe Leu Ala Thr Ala Ser Ile Phe Glu Leu Glu Arg Thr Asn Glu Arg
565 570 575
Ile Ala Trp Ala Lys Ser Gln Ile Ile Ala Lys Met Ile Thr Ser Phe
580 585 590
Phe Asn Lys Glu Thr Thr Ser Glu Glu Asp Lys Arg Ala Leu Leu Asn
595 600 605
Glu Leu Gly Asn Ile Asn Gly Leu Asn Asp Thr Asn Gly Ala Gly Arg
610 615 620
Glu Gly Gly Ala Gly Ser Ile Ala Leu Ala Thr Leu Thr Gln Phe Leu
625 630 635 640
Glu Gly Phe Asp Arg Tyr Thr Arg His Gln Leu Lys Asn Ala Trp Ser
645 650 655
Val Trp Leu Thr Gln Leu Gln His Gly Glu Ala Asp Asp Ala Glu Leu
660 665 670
Leu Thr Asn Thr Leu Asn Ile Cys Ala Gly His Ile Ala Phe Arg Glu
675 680 685
Glu Ile Leu Ala His Asn Glu Tyr Lys Ala Leu Ser Asn Leu Thr Ser
690 695 700
Lys Ile Cys Arg Gln Leu Ser Phe Ile Gln Ser Glu Lys Glu Met Gly
705 710 715 720
Val Glu Gly Glu Ile Ala Ala Lys Ser Ser Ile Lys Asn Lys Glu Leu
725 730 735
Glu Glu Asp Met Gln Met Leu Val Lys Leu Val Leu Glu Lys Tyr Gly
740 745 750
Gly Ile Asp Arg Asn Ile Lys Lys Ala Phe Leu Ala Val Ala Lys Thr
755 760 765
Tyr Tyr Tyr Arg Ala Tyr His Ala Ala Asp Thr Ile Asp Thr His Met
770 775 780
Phe Lys Val Leu Phe Glu Pro Val Ala
785 790
<210> 72
<211> 595
<212> PRT
<213> Salvia miltiorrhiza
<220>
<221> UNSURE
<222> (221)..(221)
<223> Xaa can be any naturally occurring amino acid(Xaa可以是任何天然存在的氨基酸)
<400> 72
Met Ser Leu Ala Phe Asn Pro Ala Ala Thr Ala Phe Ser Gly Asn Gly
1 5 10 15
Ala Arg Ser Arg Arg Glu Asn Phe Pro Val Lys His Val Thr Val Arg
20 25 30
Gly Phe Pro Met Ile Thr Asn Lys Ser Ser Phe Ala Val Lys Cys Asn
35 40 45
Leu Thr Thr Thr Asp Leu Met Gly Lys Ile Ala Glu Lys Phe Lys Gly
50 55 60
Glu Asp Ser Asn Phe Pro Ala Ala Ala Ala Val Gln Pro Ala Ala Asp
65 70 75 80
Met Pro Ser Asn Leu Cys Ile Ile Asp Thr Leu Gln Arg Leu Gly Val
85 90 95
Asp Arg Tyr Phe Arg Ser Glu Ile Asp Thr Ile Leu Glu Asp Thr Tyr
100 105 110
Arg Leu Trp Gln Arg Lys Glu Arg Ala Ile Phe Ser Asp Thr Ala Ile
115 120 125
His Ala Met Ala Phe Arg Leu Leu Arg Val Lys Gly Tyr Glu Val Ser
130 135 140
Ser Glu Glu Leu Ala Pro Tyr Ala Asp Gln Glu His Val Asp Leu Gln
145 150 155 160
Thr Ile Glu Val Ala Thr Val Ile Glu Leu Tyr Arg Ala Ala Gln Glu
165 170 175
Arg Thr Gly Glu Asp Glu Ser Ser Leu Lys Lys Leu His Ala Trp Thr
180 185 190
Thr Thr Phe Leu Lys Gln Lys Leu Leu Thr Asn Ser Ile Pro Asp Lys
195 200 205
Lys Leu His Lys Leu Val Glu Tyr Tyr Leu Lys Asn Xaa His Gly Ile
210 215 220
Leu Asp Arg Met Gly Val Arg Gln Asn Leu Asp Leu Tyr Asp Ile Ser
225 230 235 240
Tyr Tyr Arg Thr Ser Lys Ala Ala Asn Arg Phe Ser Asn Leu Cys Ser
245 250 255
Glu Asp Phe Leu Ala Phe Ala Arg Gln Asp Phe Asn Ile Cys Gln Ala
260 265 270
Gln His Gln Lys Glu Leu Gln Gln Leu Gln Arg Trp Tyr Ala Asp Cys
275 280 285
Lys Leu Asp Thr Leu Lys Tyr Gly Arg Asp Val Val Arg Val Ala Asn
290 295 300
Phe Leu Thr Ser Ala Ile Ile Gly Asp Pro Glu Leu Ser Asp Val Arg
305 310 315 320
Ile Val Phe Ala Gln His Ile Val Leu Val Thr Arg Ile Asp Asp Phe
325 330 335
Phe Asp His Arg Gly Ser Arg Glu Glu Ser Tyr Lys Ile Leu Glu Leu
340 345 350
Ile Lys Glu Trp Lys Glu Lys Pro Ala Ala Glu Tyr Gly Ser Glu Glu
355 360 365
Val Glu Ile Leu Phe Thr Ala Val Tyr Asn Thr Val Asn Glu Leu Ala
370 375 380
Glu Arg Ala His Val Glu Gln Gly Arg Ser Val Lys Asp Phe Leu Ile
385 390 395 400
Lys Leu Trp Val Gln Ile Leu Ser Ile Phe Lys Arg Glu Leu Asp Thr
405 410 415
Trp Ser Asp Asp Thr Ala Leu Thr Leu Asp Asp Tyr Leu Ser Ala Ser
420 425 430
Trp Val Ser Ile Gly Cys Arg Ile Cys Ile Leu Met Ser Met Gln Phe
435 440 445
Ile Gly Ile Lys Leu Ser Asp Glu Met Leu Leu Ser Glu Glu Cys Ile
450 455 460
Asp Leu Cys Arg His Val Ser Met Val Asp Arg Leu Leu Asn Asp Val
465 470 475 480
Gln Thr Phe Glu Lys Glu Arg Lys Glu Asn Thr Gly Asn Ser Val Thr
485 490 495
Leu Leu Leu Ala Ala Asn Lys Asp Asp Ser Ser Phe Thr Glu Glu Glu
500 505 510
Ala Ile Arg Ile Ala Lys Glu Met Ala Glu Cys Asn Arg Arg Gln Leu
515 520 525
Met Gln Ile Val Tyr Lys Thr Gly Thr Ile Phe Pro Arg Gln Cys Lys
530 535 540
Asp Met Phe Leu Lys Val Cys Arg Ile Gly Cys Tyr Leu Tyr Ala Ser
545 550 555 560
Gly Asp Glu Phe Thr Ser Pro Gln Gln Met Met Glu Asp Met Lys Ser
565 570 575
Leu Val Tyr Glu Pro Leu Thr Ile His Pro Leu Val Ala Asn Asn Val
580 585 590
Arg Gly Lys
595
<210> 73
<211> 300
<212> PRT
<213> Synechococcus sp
<400> 73
Met Val Val Ala Asp Ala His Thr Gln Gly Phe Ser Leu Ala Gln Tyr
1 5 10 15
Leu Gln Glu Gln Lys Thr Ile Val Glu Thr Ala Leu Asp Gln Ser Leu
20 25 30
Val Ile Thr Glu Pro Val Thr Ile Tyr Glu Ala Met Arg Tyr Ser Leu
35 40 45
Leu Ala Gly Gly Lys Arg Leu Arg Pro Ile Leu Cys Leu Ala Ala Cys
50 55 60
Glu Met Leu Gly Gly Thr Ala Ala Met Ala Met Asn Thr Ala Cys Ala
65 70 75 80
Leu Glu Met Ile His Thr Met Ser Leu Ile His Asp Asp Leu Pro Ala
85 90 95
Met Asp Asn Asp Asp Leu Arg Arg Gly Lys Pro Thr Asn His Lys Val
100 105 110
Tyr Gly Glu Asp Ile Ala Ile Leu Ala Gly Asp Ala Leu Leu Ser Tyr
115 120 125
Ala Phe Glu Tyr Val Ala Arg Thr Pro Asp Val Pro Ala Glu Arg Leu
130 135 140
Leu Gln Val Ile Val Arg Leu Gly Gln Ala Val Gly Ala Glu Gly Leu
145 150 155 160
Val Gly Gly Gln Val Val Asp Leu Glu Ser Glu Gly Lys Thr Asp Val
165 170 175
Ala Val Glu Thr Leu Asn Phe Ile His Thr His Lys Thr Gly Ala Leu
180 185 190
Leu Glu Val Cys Val Thr Ala Gly Ala Ile Leu Ala Gly Ala Lys Pro
195 200 205
Glu Glu Val Gln Leu Leu Ser Arg Tyr Ala Gln Asn Ile Gly Leu Ala
210 215 220
Phe Gln Ile Val Asp Asp Ile Leu Asp Ile Thr Ala Thr Ala Glu Glu
225 230 235 240
Leu Gly Lys Thr Ala Gly Lys Asp Leu Glu Ala Gln Lys Val Thr Tyr
245 250 255
Pro Ser Leu Trp Gly Ile Glu Lys Ser Gln Ala Glu Ala Gln Lys Leu
260 265 270
Val Ala Glu Ala Ile Ala Ser Leu Glu Pro Tyr Gly Glu Lys Ala Asn
275 280 285
Pro Leu Lys Ala Leu Ala Glu Tyr Ile Val Asn Arg
290 295 300
<210> 74
<211> 533
<212> PRT
<213> Tripterygium wilfordii
<400> 74
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Pro Lys Ile
1 5 10 15
Phe Ala Val Leu Leu Leu Phe Ile Cys Leu Arg Ile Leu Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Ser Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu Leu His Leu Arg Gly Pro Gln Ala Pro
50 55 60
His Ile Thr Leu Gly Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asp Ser Glu Val
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Ile Ala Leu Ala Gly Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Ala
115 120 125
Phe Ser Pro Tyr Gly Pro Tyr Trp Arg His Met Arg Lys Leu Ala Thr
130 135 140
Val Glu Leu Leu Ser Ala Gln Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Leu Lys Arg Ser Met Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Ser Gly Asp Ser Asn His Val Thr Val Asp Met
180 185 190
Thr Arg Ile Leu Gly Asp Ile Ile Ala Asn Val Ile Tyr Arg Met Val
195 200 205
Val Gly Lys Val Tyr Ala Ser Lys Gly Glu Glu Asp Ala Arg Trp Lys
210 215 220
Gln Val Val Trp Glu Tyr Ile Lys Leu Leu Ser His Phe Gly Val Gly
225 230 235 240
Asp Ala Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly Val Glu Lys
245 250 255
Ser Met Lys Lys Ala Ala Lys Glu Leu Asp Ile Tyr Val Glu Glu Trp
260 265 270
Leu Glu Glu His Lys Lys Lys Arg Ser Glu Arg Lys Ser Asp Asn Gly
275 280 285
Ile Val Glu Glu Asp Phe Met Asp Val Met Leu Ser Val Phe Asp Asp
290 295 300
Asp Asp Gln Leu Glu Asn Phe Ala His His Ser Ala His Thr Ile Asn
305 310 315 320
Lys Ala Met Cys Leu Ala Ile Ile Leu Ala Ala Ser Asp Thr Thr Lys
325 330 335
Thr Thr Leu Thr Trp Ala Leu Ser Leu Leu Leu Asn His Pro Asp Val
340 345 350
Met Lys Lys Val Gln Gln Glu Leu Ala Ala His Ile Gly Pro Asp Lys
355 360 365
Pro Val Lys Glu Ser Asp Val Lys Ser Leu Val Tyr Leu Glu Ala Val
370 375 380
Val Lys Glu Thr Leu Arg Leu Tyr Pro Pro Gly Pro Leu Gly Leu Pro
385 390 395 400
His Glu Ser Met Glu Asp Cys Thr Val Ala Gly Tyr His Val Pro Ser
405 410 415
Gly Thr Arg Ile Leu Tyr Asn Leu Trp Lys Ile Gln Gln Asp Pro Gln
420 425 430
Val Trp Glu Asn Pro Ser Glu Phe Lys Pro Asp Arg Phe Leu Thr Thr
435 440 445
His Lys Asp Val Asp Val Arg Gly Arg Asn Phe Glu Tyr Leu Pro Phe
450 455 460
Gly Ser Gly Arg Arg Met Cys Pro Gly Met Ser Phe Ala Leu Gln Val
465 470 475 480
Met Glu Val Ser Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr
485 490 495
Pro Asn Gly Lys Pro Val Asp Met Thr Glu Val Asn Gly Leu Val Thr
500 505 510
Asp Arg Ala Thr Pro Leu Glu Ala Leu Ile Thr Pro Arg Leu Pro Ala
515 520 525
His Leu Tyr Met Gly
530
<210> 75
<211> 533
<212> PRT
<213> Tripterygium wilfordii
<400> 75
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Thr Thr Ser
1 5 10 15
Phe Ala Val Leu Leu Leu Tyr Leu Cys Leu Arg Ile Phe Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Gly Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu His Leu Leu Ile Gly Pro Gln Ala Ser
50 55 60
Tyr Ile Thr Leu Ser Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asn Ser Glu Val
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Lys Val Leu Ala Asn Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Gly
115 120 125
Trp Ser Pro Tyr Gly Pro Tyr Trp Arg His Met Arg Lys Leu Ala Thr
130 135 140
Val Glu Leu Leu Ser Ala Gln Arg Leu Glu Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Leu Lys Arg Ser Met Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Ser Gly Asp Ser Asn His Val Thr Val Asp Met
180 185 190
Thr Arg Ile Leu Gly Asp Ile Ile Ala Asn Val Ile Tyr Arg Met Val
195 200 205
Val Gly Lys Val Tyr Ala Ser Lys Gly Glu Glu Asp Ala Arg Trp Lys
210 215 220
Gln Val Val Trp Glu Tyr Ile Lys Leu Leu Ser His Phe Gly Val Gly
225 230 235 240
Asp Ala Leu Pro Phe Leu Arg Trp Leu Asp Leu Gly Gly Val Glu Lys
245 250 255
Ser Met Lys Lys Ala Ala Lys Glu Leu Asp Ile Tyr Val Glu Glu Trp
260 265 270
Leu Glu Glu His Lys Lys Lys Arg Ser Glu Arg Lys Ser Asp Asn Gly
275 280 285
Ile Val Glu Glu Asp Phe Met Asp Val Met Leu Ser Val Phe Asp Asp
290 295 300
Asp Asp Gln Leu Glu Asn Phe Ala His His Ser Ala His Thr Ile Asn
305 310 315 320
Lys Ala Met Cys Leu Ala Ile Ile Leu Ala Ala Ser Asp Thr Thr Lys
325 330 335
Thr Thr Leu Thr Trp Ala Leu Ser Leu Leu Leu Asn His Pro Asp Val
340 345 350
Met Lys Lys Val Gln Gln Glu Leu Ala Ala His Ile Gly Pro Asp Lys
355 360 365
Pro Val Lys Glu Ser Asp Val Lys Ser Leu Val Tyr Leu Glu Ala Val
370 375 380
Val Lys Glu Thr Leu Arg Leu Tyr Pro Pro Gly Pro Leu Gly Leu Pro
385 390 395 400
His Glu Ser Met Glu Asp Cys Thr Val Ala Gly Tyr His Val Pro Ser
405 410 415
Gly Thr Arg Ile Leu Tyr Asn Leu Trp Lys Ile Gln Gln Asp Pro Gln
420 425 430
Val Trp Glu Asn Pro Ser Glu Phe Lys Pro Asp Arg Phe Leu Thr Thr
435 440 445
His Lys Asp Val Asp Val Arg Gly Arg Asn Phe Glu Tyr Leu Pro Phe
450 455 460
Gly Ser Gly Arg Arg Met Cys Pro Gly Met Ser Phe Ala Leu Gln Val
465 470 475 480
Met Glu Val Ser Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr
485 490 495
Pro Asn Gly Lys Pro Val Asp Met Thr Glu Val Asn Gly Leu Val Thr
500 505 510
Asp Arg Ala Thr Pro Leu Glu Ala Leu Ile Thr Pro Arg Leu Pro Ala
515 520 525
His Leu Tyr Met Gly
530
<210> 76
<211> 530
<212> PRT
<213> Tripterygium wilfordii
<400> 76
Met Glu Phe Leu Leu Ser Leu Pro Thr Asn Thr Ile Ala Thr Lys Ile
1 5 10 15
Phe Ala Val Leu Leu Leu Tyr Leu Phe Leu Arg Ile Phe Thr Asn Val
20 25 30
Leu Lys Pro Lys Lys Ser Lys Thr Ser Pro Pro Gln Ala Gly Gly Ala
35 40 45
Trp Pro Leu Ile Gly His Leu His Leu Leu Ile Gly Pro Gln Ala Ser
50 55 60
Tyr Ile Thr Leu Ser Lys Met Ala Asp Lys Tyr Gly Pro Ile Phe Lys
65 70 75 80
Ile Lys Leu Gly Val His Pro Thr Leu Val Ile Ser Asn Ser Glu Val
85 90 95
Ala Lys Glu Cys Leu Thr Thr His Asp Lys Val Leu Ala Asn Arg Pro
100 105 110
Ala Thr Val Ala Met Glu Ile Met Gly Tyr Asn His Ala Met Phe Gly
115 120 125
Trp Ser Pro Tyr Gly Pro Tyr Trp Arg Gln Leu Arg Lys Leu Val Thr
130 135 140
Val Glu Leu Leu Ser Asn Gln Arg Leu Lys Thr Phe Lys His Ile Arg
145 150 155 160
Glu Ser Glu Val Lys Asn Ser Leu Lys Glu Met Tyr Gln Ser Trp Val
165 170 175
His Asn Lys Ser Gly Asp Ser Asn His Val Ser Val Asp Met Thr Arg
180 185 190
Ile Phe Gly Asp Ile Thr Gly Asn Leu Ile Tyr Arg Ile Val Val Gly
195 200 205
Lys Val Tyr Ala Arg Lys Gly Glu Gly Val Val Arg Trp Lys Gln Val
210 215 220
Val Gly Asp Tyr Met Lys Leu Leu Thr His Phe Asn Val Gly Asp Ala
225 230 235 240
Met Pro Phe Met Arg Trp Phe Asp Leu Gly Gly Leu Glu Lys Ala Met
245 250 255
Lys Ile Thr Phe Lys Glu Leu Asp Gly Tyr Val Glu Glu Trp Leu Glu
260 265 270
Glu His Lys Lys Lys Arg Ser Asn Ser Gly Gly His Gly Ile Val Glu
275 280 285
Glu Asp Phe Met Asp Val Met Leu Ser Ile Phe Asp Asp Gly Gly Gln
290 295 300
Gln Glu Tyr Cys Thr Asp Asn Ser Thr His Thr Thr Asn Lys Ala Met
305 310 315 320
Cys Met Ala Leu Ile Leu Gly Ala Ser Glu Thr Thr Lys Thr Thr Leu
325 330 335
Thr Trp Ser Leu Ser Leu Leu Leu Asn Asn Leu Asp Val Leu Lys Lys
340 345 350
Val Lys Gln Glu Leu Ala Ala His Ile Gly Pro Glu Thr Leu Val Thr
355 360 365
Glu Ser Asp Val Asn Ser Leu Val Tyr Leu Asp Ala Val Ile Thr Glu
370 375 380
Thr Leu Arg Leu Tyr Pro Leu Gly Pro Leu Gly Leu Pro His Glu Ser
385 390 395 400
Ile Glu Asp Cys Thr Ile Ala Gly Tyr His Val Pro Ala Arg Thr Arg
405 410 415
Ile Leu Phe Asn Leu Trp Lys Ile His Gln Asp Pro Arg Val Trp Glu
420 425 430
Asn Pro Leu Glu Phe Lys Pro Glu Arg Phe Leu Lys Glu His Asn Asn
435 440 445
Ile Asp Val Arg Gly Gly His Phe Glu Leu Leu Pro Phe Gly Ser Gly
450 455 460
Arg Arg Met Cys Pro Gly Val Ser Phe Ala Leu Gln Val Leu Lys Leu
465 470 475 480
Ile Leu Ala Asn Met Leu His Gly Phe Asp Phe Ala Thr Pro Asn Asp
485 490 495
Glu Pro Val Asp Met Thr Glu Val Asn His Met Ala Thr Thr Arg Ala
500 505 510
Thr Pro Leu Glu Thr Leu Ile Ser Pro Arg Leu Pro Ser His Leu Tyr
515 520 525
Met Gly
530
<210> 77
<211> 749
<212> PRT
<213> Plectranthus barbatus
<400> 77
Met Ala Trp Met Asn Asn Gly Lys Asn Leu Asn Cys Gln Leu Thr His
1 5 10 15
Lys Lys Ile Ser Lys Val Ala Glu Ile Arg Val Ala Thr Val Asn Ala
20 25 30
Pro Pro Val His Asp Gln Asp Asp Ser Thr Glu Asn Gln Cys His Asp
35 40 45
Ala Val Asn Asn Ile Glu Asp Pro Ile Glu Tyr Ile Arg Thr Leu Leu
50 55 60
Arg Thr Thr Gly Asp Gly Arg Ile Ser Val Ser Pro Tyr Asp Thr Ala
65 70 75 80
Trp Val Ala Leu Ile Lys Asp Leu Gln Gly Arg Asp Ala Pro Glu Phe
85 90 95
Pro Ser Ser Leu Glu Trp Ile Ile Gln Asn Gln Leu Ala Asp Gly Ser
100 105 110
Trp Gly Asp Ala Lys Phe Phe Cys Val Tyr Asp Arg Leu Val Asn Thr
115 120 125
Ile Ala Cys Val Val Ala Leu Arg Ser Trp Asp Val His Ala Glu Lys
130 135 140
Val Glu Arg Gly Val Arg Tyr Ile Asn Glu Asn Val Glu Lys Leu Arg
145 150 155 160
Asp Gly Asn Glu Glu His Met Thr Cys Gly Phe Glu Val Val Phe Pro
165 170 175
Ala Leu Leu Gln Arg Ala Lys Ser Leu Gly Ile Gln Asp Leu Pro Tyr
180 185 190
Asp Ala Pro Val Ile Gln Glu Ile Tyr His Ser Arg Glu Gln Lys Ser
195 200 205
Lys Arg Ile Pro Leu Glu Met Met His Lys Val Pro Thr Ser Leu Leu
210 215 220
Phe Ser Leu Glu Gly Leu Glu Asn Leu Glu Trp Asp Lys Leu Leu Lys
225 230 235 240
Leu Gln Ser Ala Asp Gly Ser Phe Leu Thr Ser Pro Ser Ser Thr Ala
245 250 255
Phe Ala Phe Met Gln Thr Arg Asp Pro Lys Cys Tyr Gln Phe Ile Lys
260 265 270
Asn Thr Ile Gln Thr Phe Asn Gly Gly Ala Pro His Thr Tyr Pro Val
275 280 285
Asp Val Phe Gly Arg Leu Trp Ala Ile Asp Arg Leu Gln Arg Leu Gly
290 295 300
Ile Ser Arg Phe Phe Glu Ser Glu Ile Ala Asp Cys Ile Ala His Ile
305 310 315 320
His Arg Phe Trp Thr Glu Lys Gly Val Phe Ser Gly Arg Glu Ser Glu
325 330 335
Phe Cys Asp Ile Asp Asp Thr Ser Met Gly Val Arg Leu Met Arg Met
340 345 350
His Gly Tyr Asp Val Asp Pro Asn Val Leu Lys Asn Phe Lys Lys Asp
355 360 365
Asp Lys Phe Ser Cys Tyr Gly Gly Gln Met Ile Glu Ser Pro Ser Pro
370 375 380
Ile Tyr Asn Leu Tyr Arg Ala Ser Gln Leu Arg Phe Pro Gly Glu Gln
385 390 395 400
Ile Leu Glu Asp Ala Asn Lys Phe Ala Tyr Asp Phe Leu Gln Glu Lys
405 410 415
Leu Ala His Asn Gln Ile Leu Asp Lys Trp Val Ile Ser Lys His Leu
420 425 430
Pro Asp Glu Ile Lys Leu Gly Leu Glu Met Pro Trp Tyr Ala Thr Leu
435 440 445
Pro Arg Val Glu Ala Arg Tyr Tyr Ile Gln Tyr Tyr Ala Gly Ser Gly
450 455 460
Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met Pro Glu Ile Ser Asn
465 470 475 480
Asp Thr Tyr His Glu Leu Ala Lys Thr Asp Phe Lys Arg Cys Gln Ala
485 490 495
Gln His Gln Phe Glu Trp Ile Tyr Met Gln Glu Trp Tyr Glu Ser Cys
500 505 510
Asn Met Glu Glu Phe Gly Ile Ser Arg Lys Glu Leu Leu Val Ala Tyr
515 520 525
Phe Leu Ala Thr Ala Ser Ile Phe Glu Leu Glu Arg Ala Asn Glu Arg
530 535 540
Ile Ala Trp Ala Lys Ser Gln Ile Ile Ser Thr Ile Ile Ala Ser Phe
545 550 555 560
Phe Asn Asn Gln Asn Thr Ser Pro Glu Asp Lys Leu Ala Phe Leu Thr
565 570 575
Asp Phe Lys Asn Gly Asn Ser Thr Asn Met Ala Leu Val Thr Leu Thr
580 585 590
Gln Phe Leu Glu Gly Phe Asp Arg Tyr Thr Ser His Gln Leu Lys Asn
595 600 605
Ala Trp Ser Val Trp Leu Arg Lys Leu Gln Gln Gly Glu Gly Asn Gly
610 615 620
Gly Ala Asp Ala Glu Leu Leu Val Asn Thr Leu Asn Ile Cys Ala Gly
625 630 635 640
His Ile Ala Phe Arg Glu Glu Ile Leu Ala His Asn Asp Tyr Lys Thr
645 650 655
Leu Ser Asn Leu Thr Ser Lys Ile Cys Arg Gln Leu Ser Gln Ile Gln
660 665 670
Asn Glu Lys Glu Leu Glu Thr Glu Gly Gln Lys Thr Ser Ile Lys Asn
675 680 685
Lys Glu Leu Glu Glu Asp Met Gln Arg Leu Val Lys Leu Val Leu Glu
690 695 700
Lys Ser Arg Val Gly Ile Asn Arg Asp Met Lys Lys Thr Phe Leu Ala
705 710 715 720
Val Val Lys Thr Tyr Tyr Tyr Lys Ala Tyr His Ser Ala Gln Ala Ile
725 730 735
Asp Asn His Met Phe Lys Val Leu Phe Glu Pro Val Ala
740 745
<210> 78
<211> 567
<212> PRT
<213> Plectranthus barbatus
<400> 78
Met Ile Thr Ser Lys Ser Ser Ala Ala Val Lys Cys Ser Leu Thr Thr
1 5 10 15
Pro Thr Asp Leu Met Gly Lys Ile Lys Glu Val Phe Asn Arg Glu Val
20 25 30
Asp Thr Ser Pro Ala Ala Met Thr Thr His Ser Thr Asp Ile Pro Ser
35 40 45
Asn Leu Cys Ile Ile Asp Thr Leu Gln Arg Leu Gly Ile Asp Gln Tyr
50 55 60
Phe Gln Ser Glu Ile Asp Ala Val Leu His Asp Thr Tyr Arg Leu Trp
65 70 75 80
Gln Leu Lys Lys Lys Asp Ile Phe Ser Asp Ile Thr Thr His Ala Met
85 90 95
Ala Phe Arg Leu Leu Arg Val Lys Gly Tyr Glu Val Ala Ser Asp Glu
100 105 110
Leu Ala Pro Tyr Ala Asp Gln Glu Arg Ile Asn Leu Gln Thr Ile Asp
115 120 125
Val Pro Thr Val Val Glu Leu Tyr Arg Ala Ala Gln Glu Arg Leu Thr
130 135 140
Glu Glu Asp Ser Thr Leu Glu Lys Leu Tyr Val Trp Thr Ser Ala Phe
145 150 155 160
Leu Lys Gln Gln Leu Leu Thr Asp Ala Ile Pro Asp Lys Lys Leu His
165 170 175
Lys Gln Val Glu Tyr Tyr Leu Lys Asn Tyr His Gly Ile Leu Asp Arg
180 185 190
Met Gly Val Arg Arg Asn Leu Asp Leu Tyr Asp Ile Ser His Tyr Lys
195 200 205
Ser Leu Lys Ala Ala His Arg Phe Tyr Asn Leu Ser Asn Glu Asp Ile
210 215 220
Leu Ala Phe Ala Arg Gln Asp Phe Asn Ile Ser Gln Ala Gln His Gln
225 230 235 240
Lys Glu Leu Gln Gln Leu Gln Arg Trp Tyr Ala Asp Cys Arg Leu Asp
245 250 255
Thr Leu Lys Phe Gly Arg Asp Val Val Arg Ile Gly Asn Phe Leu Thr
260 265 270
Ser Ala Met Ile Gly Asp Pro Glu Leu Ser Asp Leu Arg Leu Ala Phe
275 280 285
Ala Lys His Ile Val Leu Val Thr Arg Ile Asp Asp Phe Phe Asp His
290 295 300
Gly Gly Pro Lys Glu Glu Ser Tyr Glu Ile Leu Glu Leu Val Lys Glu
305 310 315 320
Trp Lys Glu Lys Pro Ala Gly Glu Tyr Val Ser Glu Glu Val Glu Ile
325 330 335
Leu Phe Thr Ala Val Tyr Asn Thr Val Asn Glu Leu Ala Glu Met Ala
340 345 350
His Ile Glu Gln Gly Arg Ser Val Lys Asp Leu Leu Val Lys Leu Trp
355 360 365
Val Glu Ile Leu Ser Val Phe Arg Ile Glu Leu Asp Thr Trp Thr Asn
370 375 380
Asp Thr Ala Leu Thr Leu Glu Glu Tyr Leu Ser Gln Ser Trp Val Ser
385 390 395 400
Ile Gly Cys Arg Ile Cys Ile Leu Ile Ser Met Gln Phe Gln Gly Val
405 410 415
Lys Leu Ser Asp Glu Met Leu Gln Ser Glu Glu Cys Thr Asp Leu Cys
420 425 430
Arg Tyr Val Ser Met Val Asp Arg Leu Leu Asn Asp Val Gln Thr Phe
435 440 445
Glu Lys Glu Arg Lys Glu Asn Thr Gly Asn Ser Val Ser Leu Leu Gln
450 455 460
Ala Ala His Lys Asp Glu Arg Val Ile Asn Glu Glu Glu Ala Cys Ile
465 470 475 480
Lys Val Lys Glu Leu Ala Glu Tyr Asn Arg Arg Lys Leu Met Gln Ile
485 490 495
Val Tyr Lys Thr Gly Thr Ile Phe Pro Arg Lys Cys Lys Asp Leu Phe
500 505 510
Leu Lys Ala Cys Arg Ile Gly Cys Tyr Leu Tyr Ser Ser Gly Asp Glu
515 520 525
Phe Thr Ser Pro Gln Gln Met Met Glu Asp Met Lys Ser Leu Val Tyr
530 535 540
Glu Pro Leu Pro Ile Ser Pro Pro Glu Ala Asn Asn Ala Ser Gly Glu
545 550 555 560
Lys Met Ser Cys Val Ser Asn
565
<210> 79
<211> 722
<212> PRT
<213> Plectranthus barbatus
<400> 79
Met Ala Ser Cys Gly Ala Ile Gly Ser Ser Phe Leu Pro Leu Leu His
1 5 10 15
Ser Asp Glu Ser Ser Leu Leu Ser Arg Pro Thr Ala Ala Leu His Ile
20 25 30
Lys Lys Gln Lys Phe Ser Val Gly Ala Ala Leu Tyr Gln Asp Asn Thr
35 40 45
Asn Asp Val Val Pro Ser Gly Glu Gly Leu Thr Arg Gln Lys Pro Arg
50 55 60
Thr Leu Ser Phe Thr Gly Glu Lys Pro Ser Thr Pro Ile Leu Asp Thr
65 70 75 80
Ile Asn Tyr Pro Ile His Met Lys Asn Leu Ser Val Glu Glu Leu Glu
85 90 95
Ile Leu Ala Asp Glu Leu Arg Glu Glu Ile Val Tyr Thr Val Ser Lys
100 105 110
Thr Gly Gly His Leu Ser Ser Ser Leu Gly Val Ser Glu Leu Thr Val
115 120 125
Ala Leu His His Val Phe Asn Thr Pro Asp Asp Lys Ile Ile Trp Asp
130 135 140
Val Gly His Gln Ala Tyr Pro His Lys Ile Leu Thr Gly Arg Arg Ser
145 150 155 160
Arg Met His Thr Ile Arg Gln Thr Phe Gly Leu Ala Gly Phe Pro Lys
165 170 175
Arg Asp Glu Ser Pro His Asp Ala Phe Gly Ala Gly His Ser Ser Thr
180 185 190
Ser Ile Ser Ala Gly Leu Gly Met Ala Val Gly Arg Asp Leu Leu Gln
195 200 205
Lys Asn Asn His Val Ile Ser Val Ile Gly Asp Gly Ala Met Thr Ala
210 215 220
Gly Gln Ala Tyr Glu Ala Met Asn Asn Ala Gly Phe Leu Asp Ser Asn
225 230 235 240
Leu Ile Ile Val Leu Asn Asp Asn Lys Gln Val Ser Leu Pro Thr Ala
245 250 255
Thr Val Asp Gly Pro Ala Pro Pro Val Gly Ala Leu Ser Lys Ala Leu
260 265 270
Thr Lys Leu Gln Ala Ser Arg Lys Phe Arg Gln Leu Arg Glu Ala Ala
275 280 285
Lys Gly Met Thr Lys Gln Met Gly Asn Gln Ala His Glu Ile Ala Ser
290 295 300
Lys Val Asp Thr Tyr Val Lys Gly Met Met Gly Lys Pro Gly Ala Ser
305 310 315 320
Leu Phe Glu Glu Leu Gly Ile Tyr Tyr Ile Gly Pro Val Asp Gly His
325 330 335
Asn Ile Glu Asp Leu Val Tyr Ile Phe Lys Lys Val Lys Glu Met Pro
340 345 350
Ala Pro Gly Pro Val Leu Ile His Ile Ile Thr Glu Lys Gly Lys Gly
355 360 365
Tyr Pro Pro Ala Glu Val Ala Ala Asp Lys Met His Gly Val Val Lys
370 375 380
Phe Asp Pro Thr Thr Gly Lys Gln Met Lys Val Lys Thr Lys Thr Gln
385 390 395 400
Ser Tyr Thr Gln Tyr Phe Ala Glu Ser Leu Val Ala Glu Ala Glu Gln
405 410 415
Asp Glu Lys Val Val Ala Ile His Ala Ala Met Gly Gly Gly Thr Gly
420 425 430
Leu Asn Ile Phe Gln Lys Arg Phe Pro Asp Arg Cys Phe Asp Val Gly
435 440 445
Ile Ala Glu Gln His Ala Val Thr Phe Ala Ala Gly Leu Ala Thr Glu
450 455 460
Gly Leu Lys Pro Phe Cys Thr Ile Tyr Ser Ser Phe Leu Gln Arg Gly
465 470 475 480
Tyr Asp Gln Val Val His Asp Val Asp Leu Gln Lys Leu Pro Val Arg
485 490 495
Phe Met Met Asp Arg Ala Gly Leu Val Gly Ala Asp Gly Pro Thr His
500 505 510
Cys Gly Ala Phe Asp Thr Thr Tyr Met Ala Cys Leu Pro Asn Met Val
515 520 525
Val Met Ala Pro Ser Asp Glu Ala Glu Leu Met His Met Val Ala Thr
530 535 540
Ala Ala Val Ile Asp Asp Arg Pro Ser Cys Val Arg Tyr Pro Arg Gly
545 550 555 560
Asn Gly Ile Gly Val Pro Leu Pro Pro Asn Asn Lys Gly Ile Pro Leu
565 570 575
Glu Val Gly Lys Gly Arg Ile Leu Lys Glu Gly Asn Arg Val Ala Ile
580 585 590
Leu Gly Phe Gly Thr Ile Val Gln Asn Cys Leu Ala Ala Ala Gln Leu
595 600 605
Leu Gln Glu His Gly Ile Ser Val Ser Val Ala Asp Ala Arg Phe Cys
610 615 620
Lys Pro Leu Asp Gly Asp Leu Ile Lys Asn Leu Val Lys Glu His Glu
625 630 635 640
Val Leu Ile Thr Val Glu Glu Gly Ser Ile Gly Gly Phe Ser Ala His
645 650 655
Val Ser His Phe Leu Ser Leu Asn Gly Leu Leu Asp Gly Asn Leu Lys
660 665 670
Trp Arg Pro Met Val Leu Pro Asp Arg Tyr Ile Asp His Gly Ala Tyr
675 680 685
Pro Asp Gln Ile Glu Glu Ala Gly Leu Ser Ser Lys His Ile Ala Gly
690 695 700
Thr Val Leu Ser Leu Ile Gly Gly Gly Lys Asp Ser Leu His Leu Ile
705 710 715 720
Asn Met
<210> 80
<211> 525
<212> PRT
<213> Saccharomyces cerevisiae
<400> 80
Met Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr Ala
1 5 10 15
Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val Ile
20 25 30
Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser Ser
35 40 45
Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu Ser
50 55 60
Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu Ser
65 70 75 80
Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu Val
85 90 95
Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly Asp
100 105 110
Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu Ala
115 120 125
Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr Asp
130 135 140
Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met
145 150 155 160
Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr Ser
165 170 175
Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Ala
180 185 190
Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr Val
195 200 205
Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro Thr
210 215 220
Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu Gly
225 230 235 240
Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg
245 250 255
Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met Arg
260 265 270
Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser Lys
275 280 285
Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp Glu
290 295 300
Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys
305 310 315 320
Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala
325 330 335
Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser Asp
340 345 350
Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly Ser
355 360 365
Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Leu
370 375 380
Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn Val
385 390 395 400
Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp Leu
405 410 415
Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly
420 425 430
Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly Val
435 440 445
Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu Ala
450 455 460
Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys Ala
465 470 475 480
Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn Arg
485 490 495
Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp Ile
500 505 510
Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
515 520 525
<210> 81
<211> 297
<212> PRT
<213> Synechococcus sp
<400> 81
Met Val Ala Gln Thr Phe Asn Leu Asp Thr Tyr Leu Ser Gln Arg Gln
1 5 10 15
Gln Gln Val Glu Glu Ala Leu Ser Ala Ala Leu Val Pro Ala Tyr Pro
20 25 30
Glu Arg Ile Tyr Glu Ala Met Arg Tyr Ser Leu Leu Ala Gly Gly Lys
35 40 45
Arg Leu Arg Pro Ile Leu Cys Leu Ala Ala Cys Glu Leu Ala Gly Gly
50 55 60
Ser Val Glu Gln Ala Met Pro Thr Ala Cys Ala Leu Glu Met Ile His
65 70 75 80
Thr Met Ser Leu Ile His Asp Asp Leu Pro Ala Met Asp Asn Asp Asp
85 90 95
Phe Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Asp Ile
100 105 110
Ala Ile Leu Ala Gly Asp Ala Leu Leu Ala Tyr Ala Phe Glu His Ile
115 120 125
Ala Ser Gln Thr Arg Gly Val Pro Pro Gln Leu Val Leu Gln Val Ile
130 135 140
Ala Arg Ile Gly His Ala Val Ala Ala Thr Gly Leu Val Gly Gly Gln
145 150 155 160
Val Val Asp Leu Glu Ser Glu Gly Lys Ala Ile Ser Leu Glu Thr Leu
165 170 175
Glu Tyr Ile His Ser His Lys Thr Gly Ala Leu Leu Glu Ala Ser Val
180 185 190
Val Ser Gly Gly Ile Leu Ala Gly Ala Asp Glu Glu Leu Leu Ala Arg
195 200 205
Leu Ser His Tyr Ala Arg Asp Ile Gly Leu Ala Phe Gln Ile Val Asp
210 215 220
Asp Ile Leu Asp Val Thr Ala Thr Ser Glu Gln Leu Gly Lys Thr Ala
225 230 235 240
Gly Lys Asp Gln Ala Ala Ala Lys Ala Thr Tyr Pro Ser Leu Leu Gly
245 250 255
Leu Glu Ala Ser Arg Gln Lys Ala Glu Glu Leu Ile Gln Ser Ala Lys
260 265 270
Glu Ala Leu Arg Pro Tyr Gly Ser Gln Ala Glu Pro Leu Leu Ala Leu
275 280 285
Ala Asp Phe Ile Thr Arg Arg Gln His
290 295

Claims (37)

1.一种能够产生含氧二萜化合物的重组宿主细胞,其中所述宿主细胞:
i.能够产生松香烷型二萜烯和/或脱氢松香二烯;和
ii.包含编码具有细胞色素P450活性的第一酶的第一异源核酸,其中所述具有细胞色素P450活性的第一酶是SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:74或SEQ ID NO:75中所述的细胞色素P450酶TwCYP82D274,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽,由此宿主细胞能够将松香烷型二萜烯和/或脱氢松香二烯转化为14-羟基脱氢松香二烯。
2.根据权利要求1所述的重组宿主细胞,其中所述重组宿主细胞进一步包含编码具有细胞色素P450活性的第二酶的第二异源核酸,其中所述具有细胞色素P450活性的第二酶是SEQ ID NO:4中所述的细胞色素P450酶TwCYP71BE86,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
3.根据权利要求2所述的重组宿主细胞,其中所述重组宿主细胞包含并表达所述第一异源核酸和所述第二异源核酸,由此所述细胞能够产生14-羟基脱氢松香二烯、3,14-二羟基脱氢松香二烯、3,14-dihydroxyabeodiene和14-羟基-18-醛-abeodiene。
4.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞还包含编码具有细胞色素P450活性的第三酶的第三异源核酸,其中所述具有细胞色素P450活性的第三酶是SEQ ID NO:3中所述的细胞色素P450酶TwCYP71BE85,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
5.根据权利要求4所述的重组宿主细胞,其中所述重组宿主细胞包含并表达所述第一异源核酸和所述第三异源核酸,由此所述细胞能够产生14-羟基脱氢松香二烯。
6.根据权利要求4所述的重组宿主细胞,其中所述重组宿主细胞包含并表达所述第一异源核酸、所述第二异源核酸和所述第三异源核酸,由此所述细胞能够产生14-羟基脱氢松香二烯、3,14-二羟基脱氢松香二烯、3,14-dihydroxyabeodiene、14-羟基-18-醛-abeodiene和雷酚内酯。
7.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞还包含编码具有细胞色素P450活性的第四酶的第四异源核酸,其中所述具有细胞色素P450活性的第四酶是SEQ ID NO:5中所述的细胞色素P450酶TwCYP82D213,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
8.根据权利要求7所述的重组宿主细胞,其中所述重组宿主细胞包含并表达所述第一异源核酸、所述第二异源核酸和所述第三异源核酸以及所述第四异源核酸,由此所述细胞能够产生14-羟基脱氢松香二烯、3,14-二羟基脱氢松香二烯、3,14-dihydroxyabeodiene、14-羟基-18-醛-abeodiene、雷酚内酯和雷公藤内酯酮。
9.根据前述权利要求中任一项所述的重组宿主细胞,其中所述宿主细胞还包含编码具有细胞色素P450活性的第五酶的第五异源核酸,其中所述具有细胞色素P450活性的第五酶是SEQ ID NO:6中所述的细胞色素P450酶TwCYP82D217,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
10.根据前述权利要求中任一项所述的重组宿主细胞,其中所述宿主细胞还包含编码具有细胞色素P450活性的第六酶的第六异源核酸,其中所述具有细胞色素P450活性的第六酶是SEQ ID NO:7中所述的细胞色素P450酶TwCYP82D275,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
11.根据前述权利要求中任一项所述的重组宿主细胞,其中所述宿主细胞还包含编码具有细胞色素B5活性的酶的第七异源核酸,其中所述具有细胞色素B5活性的酶是SEQ IDNO:8中所述的细胞色素B5 TwB5#1,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
12.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞表达以下一种或多种:
i.双(牻牛儿基)二磷酸合酶;
ii.能够将双(牻牛儿基)二磷酸盐(GGPP)转化为松香烷型二萜烯的二萜合酶;
iii.两种或多种二萜合酶的组合,组合能够将GGPP转化为松香烷型二萜烯;或
iv.柯巴基焦磷酸合酶和松香烷型二萜烯合酶,
由此所述细胞能够产生松香烷型二萜烯和/或脱氢松香二烯。
13.根据权利要求12所述的重组宿主细胞,其中所述双(牻牛儿基)二磷酸合酶包含SEQID NO:73或SEQ ID NO:81的氨基酸序列的多肽,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的功能同系物、或其成熟多肽。
14.根据权利要求12-13中任一项所述的重组宿主细胞,其中能够将GGPP转化为松香烷型二萜烯的两种或多种二萜合酶的组合是SEQ ID NO:67中所述CfTPS1和SEQ ID NO:68中所述CfTPS3的组合,或SEQ ID NO:77中所述CftTPS1和SEQ ID NO:78中所述CftTPS3的组合,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的各自功能同系物的组合;或者SEQID NO:69中所述TwTPS9和SEQ ID NO:70中所述TwTPS27的组合,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的各自功能同系物的组合。
15.根据权利要求12-14中任一项所述的重组宿主细胞,其中柯巴基焦磷酸合酶和松香烷型二萜烯合酶的组合是SEQ ID NO:71中所述SmCPS和SEQ ID NO:72中所述SmKSL的组合,或其具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性、优选至少95%序列同一性、更优选至少98%序列同一性的各自功能同系物的组合。
16.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞是原核细胞或真核细胞。
17.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞是选自由酵母、裂殖酵母、Klyveromyces、毕赤酵母、念珠菌和耶罗维亚酵母组成的组的物种的真核细胞。
18.根据前述权利要求中任一项所述的重组宿主细胞,其中所述重组宿主细胞是酿酒酵母细胞。
19.根据权利要求1至16中任一项所述的重组宿主细胞,其中所述重组宿主细胞是选自大肠杆菌、芽孢杆菌、乳杆菌和棒状杆菌的物种的原核细胞。
20.根据权利要求1至16中任一项所述的重组宿主细胞,其中所述重组宿主细胞是植物细胞或包含在植物中,其中所述植物可以是烟草,和/或所述宿主细胞是来自另一种多细胞宿主的细胞。
21.一种制备含氧二萜化合物如雷公藤内酯酮的方法,所述方法包括以下步骤:
i.提供根据前述权利要求中任一项的重组宿主细胞;和
ii.在适于制备所述含氧二萜化合物的条件下培养所述重组宿主细胞。
22.根据权利要求21所述的方法,其中所述含氧二萜化合物选自14-OH-脱氢松香二烯、雷酚内酯和雷公藤内酯酮。
23.根据权利要求21至22中任一项所述的方法,其中所述含氧二萜化合物是雷公藤内酯酮。
24.根据权利要求22至23中任一项所述的方法,还包括回收和任选地纯化所述雷公藤内酯酮的步骤。
25.一种制备雷公藤甲素的方法,所述方法包括:
i.根据权利要求22至24中任一项的方法制备雷公藤内酯酮,以及
ii.将雷公藤内酯酮转化为雷公藤甲素,和
iii.任选地回收和/或纯化雷公藤甲素。
26.根据权利要求1-20中任一项所述的重组宿主细胞用于制备含氧二萜化合物中的用途。
27.权利要求26的用途,其中所述含氧二萜化合物选自14-OH-脱氢松香二烯、雷酚内酯和雷公藤内酯酮。
28.如权利要求26所述的用途,其中所述含氧二萜化合物是雷公藤内酯酮,其中雷公藤内酯酮进一步转化为雷公藤甲素。
29.根据权利要求26至28中任一项所述的用途,其中使用一个或多个分离和/或色谱步骤回收所述含氧二萜化合物。
30.具有细胞色素P450酶活性的多肽,所述多肽包含与序列SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6、SEQ ID NO:7中的一种具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列,或其成熟多肽;或者
具有细胞色素B5酶活性的多肽,所述多肽包含与SEQ ID NO:8具有至少80%序列同一性、优选至少85%序列同一性、优选至少90%序列同一性,优选至少95%序列同一性、优选至少96%序列同一性、优选至少97%序列同一性、优选至少98%序列同一性、或甚至100%序列同一性的氨基酸序列。
31.编码权利要求30所述的多肽的多核苷酸。
32.包含权利要求31所述的多核苷酸的质粒、表达载体、表达构建体或重组宿主细胞。
33.化合物14-OH-脱氢松香二烯。
34.一种选自下式(1)至(17)的化合物:
35.根据权利要求34所述的化合物,其中所述化合物是式(6)的化合物(F20P2)。
36.根据权利要求34所述的化合物,其中所述化合物是根据式(10)的化合物(F15P1)。
37.根据权利要求34所述的化合物,其中所述化合物是式(15)的化合物(F15P2)。
CN202180051016.3A 2020-08-27 2021-08-26 含氧二萜化合物的制备 Pending CN116615554A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DKPA202000964 2020-08-27
DKPA202000964 2020-08-27
PCT/EP2021/073656 WO2022043461A1 (en) 2020-08-27 2021-08-26 Production of oxygenated diterpenoid compounds

Publications (1)

Publication Number Publication Date
CN116615554A true CN116615554A (zh) 2023-08-18

Family

ID=80354714

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180051016.3A Pending CN116615554A (zh) 2020-08-27 2021-08-26 含氧二萜化合物的制备

Country Status (11)

Country Link
EP (1) EP4204576A1 (zh)
JP (1) JP2023539092A (zh)
KR (1) KR20230058053A (zh)
CN (1) CN116615554A (zh)
AU (1) AU2021335016A1 (zh)
BR (1) BR112023002950A2 (zh)
CA (1) CA3192028A1 (zh)
CL (1) CL2023000475A1 (zh)
IL (1) IL300574A (zh)
MX (1) MX2023001925A (zh)
WO (1) WO2022043461A1 (zh)

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040031072A1 (en) * 1999-05-06 2004-02-12 La Rosa Thomas J. Soy nucleic acid molecules and other molecules associated with transcription plants and uses thereof for plant improvement
WO2015113570A1 (en) 2014-01-31 2015-08-06 University Of Copenhagen Methods for producing diterpenes
WO2015197075A1 (en) * 2014-06-23 2015-12-30 University Of Copenhagen Methods and materials for production of terpenoids
CN107058419B (zh) 2016-12-12 2018-04-13 首都医科大学 雷公藤TwKS与TwCPS3在制备贝壳杉烷型二萜化合物中的应用
EP3404107A1 (de) * 2017-05-19 2018-11-21 Jowat SE Verfahren zur fermentativen de novo synthese von harzsäuren
US20190270971A1 (en) 2018-03-01 2019-09-05 Jason Donald Increasing productivity of microbial host cells that functionally express p450 enzymes
CN108395997A (zh) 2018-03-05 2018-08-14 首都医科大学 一种高产二萜类成分的酵母工程菌
CN108866029B (zh) 2018-08-08 2019-06-04 首都医科大学 雷公藤三萜合酶TwOSC3及其编码基因与应用
CN108866030B (zh) 2018-08-08 2019-05-07 首都医科大学 雷公藤三萜合酶TwOSC1及其编码基因与应用
CN110747178B (zh) 2019-11-08 2021-05-07 首都医科大学 雷公藤细胞色素p450氧化酶在制备松香烷型二萜化合物中的应用

Also Published As

Publication number Publication date
EP4204576A1 (en) 2023-07-05
BR112023002950A2 (pt) 2023-03-21
JP2023539092A (ja) 2023-09-13
KR20230058053A (ko) 2023-05-02
MX2023001925A (es) 2023-08-03
AU2021335016A1 (en) 2023-03-02
IL300574A (en) 2023-04-01
WO2022043461A1 (en) 2022-03-03
CL2023000475A1 (es) 2023-07-21
CA3192028A1 (en) 2022-03-03

Similar Documents

Publication Publication Date Title
Gershenzon et al. Terpenoid biosynthesis: the basic pathway and formation of monoterpenes, sesquiterpenes, and diterpenes
Lu et al. Biosynthesis of ursolic acid and oleanolic acid in Saccharomyces cerevisiae
Rahimi et al. Effect of salicylic acid and yeast extract on the accumulation of jasmonic acid and sesquiterpenoids in Panax ginseng adventitious roots
Nguyen et al. De novo synthesis of high-value plant sesquiterpenoids in yeast
de Kraker et al. Hydroxylation of sesquiterpenes by enzymes from chicory (Cichorium intybus L.) roots
US20230167473A1 (en) Method for producing heterogenous cannabichromene from saccharomyces cerevisiae
WO2016107920A1 (en) Production of macrocyclic diterpenes in recombinant hosts
Scholz et al. Methyl jasmonate induced accumulation of kalopanaxsaponin I in Nigella sativa
Callari et al. Dynamic control of ERG20 and ERG9 expression for improved casbene production in Saccharomyces cerevisiae
Dong et al. Enhance production of diterpenoids in yeast by overexpression of the fused enzyme of ERG20 and its mutant mERG20
Luo et al. Characterization of a sesquiterpene cyclase from the glandular trichomes of Leucosceptrum canum for sole production of cedrol in Escherichia coli and Nicotiana benthamiana
Karunanithi et al. Functional characterization of the cytochrome P450 monooxygenase CYP71AU87 indicates a role in marrubiin biosynthesis in the medicinal plant Marrubium vulgare
Zhou et al. 22 R‐but not 22 S‐hydroxycholesterol is recruited for diosgenin biosynthesis
Peng et al. Comparative analysis of astaxanthin and its esters in the mutant E1 of Haematococcus pluvialis and other green algae by HPLC with a C30 column
Kim et al. Production of (−)-α-bisabolol in metabolically engineered Saccharomyces cerevisiae
Arnesen et al. Engineering of Yarrowia lipolytica for the production of plant triterpenoids: asiatic, madecassic, and arjunolic acids
JP6399315B2 (ja) テルペン合成酵素遺伝子、アセト酢酸エステル加水分解酵素遺伝子、及びテルペンの製造方法
Li et al. An extremely promiscuous terpenoid synthase from the Lamiaceae plant Colquhounia coccinea var. mollis catalyzes the formation of sester-/di-/sesqui-/mono-terpenoids
Malinowska et al. Production of triterpenoids with cell and tissue cultures
Wei et al. Metabolic engineering of Saccharomyces cerevisiae for heterologous carnosic acid production
Kim et al. Molecular cloning and characterization of mevalonic acid (MVA) pathway genes and triterpene accumulation in Panax ginseng
CN116615554A (zh) 含氧二萜化合物的制备
Xiao et al. Biocatalytic and chemical derivatization of the fungal meroditerpenoid chevalone E
Xiao-Chao et al. Identification of a cytochrome P450 from Tripterygium hypoglaucum (Levl.) Hutch that catalyzes polpunonic acid formation in celastrol biosynthesis
CN110885281B (zh) 一类四环二萜类化合物及其制备方法与应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination