CN114717170A - 异源合成黄酮类化合物的宿主细胞及其应用 - Google Patents

异源合成黄酮类化合物的宿主细胞及其应用 Download PDF

Info

Publication number
CN114717170A
CN114717170A CN202110009696.4A CN202110009696A CN114717170A CN 114717170 A CN114717170 A CN 114717170A CN 202110009696 A CN202110009696 A CN 202110009696A CN 114717170 A CN114717170 A CN 114717170A
Authority
CN
China
Prior art keywords
leu
ala
val
gly
ser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110009696.4A
Other languages
English (en)
Other versions
CN114717170B (zh
Inventor
王勇
李建华
计东尼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Center for Excellence in Molecular Plant Sciences of CAS
Original Assignee
Center for Excellence in Molecular Plant Sciences of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Center for Excellence in Molecular Plant Sciences of CAS filed Critical Center for Excellence in Molecular Plant Sciences of CAS
Priority to CN202110009696.4A priority Critical patent/CN114717170B/zh
Priority to PCT/CN2022/070316 priority patent/WO2022148377A1/zh
Publication of CN114717170A publication Critical patent/CN114717170A/zh
Application granted granted Critical
Publication of CN114717170B publication Critical patent/CN114717170B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • C12N9/1037Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0012Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7)
    • C12N9/0036Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6)
    • C12N9/0038Oxidoreductases (1.) acting on nitrogen containing compounds as donors (1.4, 1.5, 1.6, 1.7) acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
    • C12N9/0042NADPH-cytochrome P450 reductase (1.6.2.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/06Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y106/00Oxidoreductases acting on NADH or NADPH (1.6)
    • C12Y106/02Oxidoreductases acting on NADH or NADPH (1.6) with a heme protein as acceptor (1.6.2)
    • C12Y106/02004NADPH-hemoprotein reductase (1.6.2.4), i.e. NADP-cytochrome P450-reductase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/11Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors (1.14.11)
    • C12Y114/11022Flavone synthase (1.14.11.22)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/11Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors (1.14.11)
    • C12Y114/11023Flavonol synthase (1.14.11.23)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y203/00Acyltransferases (2.3)
    • C12Y203/01Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • C12Y203/01074Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y403/00Carbon-nitrogen lyases (4.3)
    • C12Y403/01Ammonia-lyases (4.3.1)
    • C12Y403/01024Phenylalanine ammonia-lyase (4.3.1.24)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y505/00Intramolecular lyases (5.5)
    • C12Y505/01Intramolecular lyases (5.5.1)
    • C12Y505/01006Chalcone isomerase (5.5.1.6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y602/00Ligases forming carbon-sulfur bonds (6.2)
    • C12Y602/01Acid-Thiol Ligases (6.2.1)
    • C12Y602/010124-Coumarate-CoA ligase (6.2.1.12)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/185Escherichia
    • C12R2001/19Escherichia coli

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Botany (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

本发明提供了一种异源合成黄芩素、野黄芩素类或白杨素类化合物的宿主细胞及其应用。本发明提供了新型的黄芩素、野黄芩素类化合物或白杨素类化合物生物合成的优化改造,可实现以原核生物为底盘利用酶自组装技术合成黄芩素、野黄芩素和白杨素类化合物,以及实现利用葡萄糖从头合成黄芩素类化合物。本发明也揭示了优化改造后的宿主细胞及其应用。

Description

异源合成黄酮类化合物的宿主细胞及其应用
技术领域
本发明涉及合成生物学及医药技术领域,具体地,本发明涉及异源合成黄酮类化合物的宿主细胞及其应用。
背景技术
黄芩素和野黄芩素是黄酮类化合物,主要存在于中药黄芩中。这两种活性黄酮仅在黄芩等相关药用植物的根中积累较少。黄芩素和黄芩素都是通过类黄酮生物合成途径合成的。黄芩素和野黄芩素具有抗氧化、抗肿瘤、抗菌、护心等重要生理活性。最近,黄芩素在体外被报道为SARS-CoV-2 3Clpro的抑制剂,显示了中药的巨大潜力。
目前,黄酮类化合物的主要来源是从植物中提取和化学合成。然而,由于使用有毒化学物质和极端的反应条件,植物萃取或化学合成无法提供大规模生产的绿色路线。因此,微生物合成黄酮类化合物的研究已经深入开展。由于从植物引入复杂的异质途径,酶的失衡和中间代谢物的积累,通常会导致产物滴度低。为了解决这些问题,采用多变量模块化方法,通过调节启动子强度和质粒拷贝数来合成黄酮类化合物。然而,它是费时的,总是需要大量的工作。之前的工作报道了在工程酵母和大肠杆菌中实现黄芩素和野黄芩素的合成,但黄芩素和野黄芩素的产量仍处于很低水平。
建立底物通道是提高酶在体内催化效率的有效途径。利用相互作用蛋白对组装多酶已被证明是提高催化效率的有效方法。脚手架也是增加产品滴度的有效策略,许多不同构象的连接子被开发用于构建各种相互作用蛋白。
在本领域中,多种天然产物的合成元件经过组装后实现了在微生物中的异源合成。但是利用生物体内酶组装策略生产黄芩素和野黄芩素这两种活性黄酮类化合物还尚未见报道;从葡萄糖合成黄芩素和野黄芩素的大肠杆菌还尚未见报道。
因此,本领域亟待优化能够高效地异源合成黄芩素和野黄芩素或类似化合物的微生物菌株。
发明内容
本发明的目的在于提供异源合成黄芩素、野黄芩素类或白杨素类化合物的宿主细胞及其应用。
在本发明的第一方面,提供一种用于合成黄芩素、野黄芩素类化合物(如黄芩素或野黄芩素)的原核细胞,其包括外源的下组酶的编码基因:黄酮6-羟化酶(F6H),细胞色素P450氧化还原酶(CPR),苯丙氨酸解氨酶(PAL)、4-香豆酸辅酶A连接酶(4CL)、查尔酮合成酶(CHS)、查尔酮异构酶(CHI)和黄酮合成酶I(FNSI);且所述酶被表达后,苯丙氨酸解氨酶(PAL)和4-香豆酸辅酶A连接酶(4CL)构成复合体(复合反应器)。
在本发明的另一方面,提供一种用于合成白杨素类化合物(如白杨素或芹菜素)的原核细胞,其包括外源的下组酶的编码基因:苯丙氨酸解氨酶(PAL)、4-香豆酸辅酶A连接酶(4CL)、查尔酮合成酶(CHS)、查尔酮异构酶(CHI)和黄酮合成酶I(FNSI);且所述酶被表达后,苯丙氨酸解氨酶(PAL)和4-香豆酸辅酶A连接酶(4CL)构成复合体(复合反应器)。
在一个优选例中,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶的复合体包括:苯丙氨酸解氨酶和4-香豆酸辅酶A通过蛋白-蛋白相互作用结构域及其配体的结合而靠近,获得复合体。
在另一优选例中,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶直接连接或通过连接子连接、获得融合蛋白形式的复合体。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域,GBD结构域。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括PDZ结构域,其配体为PDZligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述PDZ结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与PDZ融合、所述4-香豆酸辅酶A与PDZ ligand融合;更佳地,所述苯丙氨酸解氨酶与PDZ融合时还包括以ER/K连接子连接(PAL-ER/K-PDZ),所述4-香豆酸辅酶A与PDZ ligand融合时还包括以(GGGGS)2连接子连接(PDZlig-(GGGGS)2-4CL)。
在另一优选例中,所述蛋白-蛋白相互作用结构域包括SH3结构域,其配体为SH3ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述SH3结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与SH3融合、所述4-香豆酸辅酶A与SH3 ligand融合;更佳地,所述苯丙氨酸解氨酶与SH3融合时还包括以ER/K连接子连接(PAL-ER/K-SH3),所述4-香豆酸辅酶A与SH3 ligand融合时还包括以(GGGGS)2连接子连接(SH3lig-(GGGGS)2-4CL)。
在另一优选例中,所述苯丙氨酸解氨酶与PDZ融合时,所述苯丙氨酸解氨酶位于N端,所述PDZ位于C端。
在另一优选例中,所述4-香豆酸辅酶A与PDZ ligand融合时,所述PDZ ligand位于N端,所述4-香豆酸辅酶A位于C端。
在另一优选例中,所述苯丙氨酸解氨酶与SH3融合时;所述苯丙氨酸解氨酶位于N端,所述SH3位于C端。
在另一优选例中,所述4-香豆酸辅酶A与SH3 ligand融合时,所述SH3 ligand位于N端,所述4-香豆酸辅酶A位于C端。
在另一优选例中,所述细胞中还包括外源的促进丙二酰CoA生成的酶的编码基因;较佳地,包括matC,matB,ACS,FabF。
在另一优选例中,所述的原核细胞为大肠杆菌细胞。
在另一优选例中,所述细胞中还包括外源的促进苯丙氨酸合成的酶的编码基因;较佳地,包括:aroG,pheA;更佳地,所述pheA为第976位由A突变为C的基因;更佳地,所述aroG为第436位由G突变为A的基因。
在另一优选例中,所述“促进”为统计学意义的“促进”,例如促进5%以上,10%以上,20%以上,50%以上,80%以上,100%以上或更高。
在本发明的另一方面,提供所述的原核细胞的应用,用于合成黄芩素或野黄芩素类化合物。
在本发明的另一方面,提供所述的原核细胞的应用,用于合成白杨素类化合物。
在本发明的另一方面,提供一种合成黄芩素或野黄芩素类化合物的方法,包括:提供所述的原核细胞(含有F6H和CPR),以式(I)为底物,合成黄芩素或野黄芩素类化合物;
Figure BDA0002884548880000031
其中,R包括H或OH。
在本发明的另一方面,提供一种合成白杨素类化合物的方法,包括:提供所述的原核细胞(可不含有F6H和CPR),以式(I)为底物,合成白杨素类化合物。
在本发明的另一方面,提供一种合成黄芩素或野黄芩素类化合物或白杨素类化合物的方法,包括:提供所述的原核细胞,以葡萄糖为底物,合成黄芩素或野黄芩素类化合物或白杨素类化合物。
在一个优选例中,在引入细胞时,所述PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因位于一个构建体(质粒)中。
在另一优选例中,所述黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因位于一个构建体中,较佳地还包括2B1(细胞色素P450 2B1家族可溶性蛋白)基因。
在另一优选例中,所述matC,matB,ACS,FabF的编码基因位于一个构建体中。
在另一优选例中,所述SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因位于一个构建体中。
在另一优选例中,所述查尔酮异构酶,黄酮合成酶I的编码基因位于一个构建体中。
在另一优选例中,所述matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因(pheAfbr),第436位由G突变为A的aroG基因(aroGfbr)位于一个构建体中。
在本发明的另一方面,提供一种用于生产黄芩素或野黄芩素类化合物(试剂盒中含有F6H和CPR)或白杨素类化合物(试剂盒中可不含有F6H和CPR)的试剂盒,其包括所述的重组的宿主细胞。
在本发明的另一方面,提供一种用于建立合成黄芩素或野黄芩素类化合物或白杨素类化合物的宿主细胞的试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;包含matC,matB,ACS,FabF的编码基因的构建体;包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因(pheAfbr),第436位由G突变为A的aroG基因(aroGfbr)的构建体;可选地,还包含黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因的构建体;较佳地所述构建体还包含2B1基因。
在一个优选例中,所述的试剂盒中还包括:葡萄糖;或,式(I)底物。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。
附图说明
图1、质粒pZZ41的构建示意图。
图2、质粒pZZ55的构建示意图。
图3、非自组装菌株DN-1、自组装菌株DN-2的黄芩素产量的柱形图。
图4、非自组装菌株DN-1、自组装菌株DN-2产生的黄芩素的HPLC检测图谱。
图5、对照菌株DN-3相比,自组装菌株DN-4的黄芩素产量的柱形图。
图6、非自组装菌株DN-1、自组装菌株DN-2的野黄芩素产量的柱形图。
图7、自组装工程菌株DN-6、非自组装工程菌株DN-5的黄芩素产量的柱形图。
图8、以苯丙氨酸为前体,进行发酵生成黄芩素、野黄岑素的合成途径示意图。
图9、以葡萄糖为前体,进行发酵生成黄芩素的合成途径示意图。
图10、非自组装的菌株JH-0相比与自组装菌株DN-0的白杨素产量比较。
具体实施方式
本发明人经过深入的研究,提供了新型的黄芩素或野黄芩素类化合物/白杨素类化合物生物合成的优化改造,可实现以原核生物为底盘利用酶自组装技术合成黄芩素或野黄芩素类化合物/白杨素类化合物,以及实现利用葡萄糖从头合成黄芩素或野黄芩类化合物/白杨素类化合物。本发明也揭示了优化改造后的宿主细胞及其应用。
如本文所用,“外源的”或“异源的”是指来自不同来源的两条或多条核酸或蛋白质序列之间的关系。
如本文所用,所述的“可操作地连接(相连)”或“操作性连接(相连)”是指两个或多个核酸区域或核酸序列的功能性的空间排列。例如:启动子区被置于相对于目的基因核酸序列的特定位置,使得核酸序列的转录受到该启动子区域的引导,从而,启动子区域被“可操作地连接”到该核酸序列上。
如本文所用,所述的“表达构建物”是指重组DNA分子,它包含预期的核酸编码序列,其可以包含一个或多个基因表达盒。所述的“构建物”通常被包含在表达载体中。
如本文所用,所述的PAL、4CL、CHS、CHI和FNSI蛋白是在表达系统中形成白杨素或芹菜素合成途径的蛋白。
如本文所用,所述的F6H和CPR蛋白是在表达系统中转化白杨素或芹菜素、生成黄芩素或野黄芩素类化合物的蛋白。
如本文所用,所述的matC、matB、ACS和/或FabF蛋白在表达系统中促进丙二酰CoA生成的酶。
如本文所用,所述的aroG或其突变体,pheA或其突变体在表达系统中促进苯丙氨酸合成。
野生型的上述蛋白或基因为本领域已经鉴定的,因此,可以从公众途径获得和制备。作为本发明的优选方式,PAL来源于红景天(Rhodotorula toruloides),其具有GenBank登录号AAA33883.1所示的序列;4CL来源于欧芹(Petroselium crispum),其具有GenBank登录号KF765780.1所示的序列;CHS来源于矮牵牛(Petunia X hybrida),其具有GenBank登录号KF765781.1所示的序列;CHI基因来源于苜蓿(Medicago sativa),其具有GenBank登录号KF765782.1所示的序列;FNSI来源于欧芹(Petroselium crispum),其具有Swiss-Prot登录号Q7XZQ8.1所示的序列。
野生型的F6H和CPR也是本领域已经鉴定的。作为本发明的优选方式,F6H来源于黄岑(Scutellaria baicalensis),其具有GenBank登录号ASW21050.1所示的序列。作为本发明的优选方式,CPR来自于拟南芥(Arabidopsis thaliana),其具有GenBank登录号NP_849472.2所示的序列。
野生型的matC、matB、ACS、FabF蛋白也是本领域已经鉴定的。作为本发明的优选方式,matC来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号KF765784.1所示的序列;matB来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号AGZ04579.1所示的序列;ACS来源于大肠杆菌(Escherichia coli),其具有GenBank登录号CP062211.1所示的序列;FabF来源于大肠杆菌(Escherichia coli),其具有GenBank登录号AP023237.1所示的序列。
黄芩素和野黄芩素是二个结构相似且重要的黄酮类化合物。黄芩素的分子式为C15H10O5,分子量为270.24,而野黄芩素的分子量为C15H10O6,分子量为286.24。它们的结构如下所示:
Figure BDA0002884548880000061
本发明人发现,利用宿主细胞生产黄芩素、野黄岑素类化合物(如黄芩素或野黄芩素)或其前体白杨素类化合物(如白杨素或芹菜素)的过程中,仍然存在产物含量不够高的情形,因此对多个参与反应的蛋白进行了分析,经过大量筛选和实验,获得了一种优选的改造方案,极为显著地提高了微生物,尤其是原核表达系统(原核细胞)如大肠杆菌中化合物的产量。
作为本发明的改造方案的一个方面,本发明人利用酶组装技术发酵生产黄芩素或野黄芩素。该方案的原理为:利用相互作用的蛋白对(例如PDZ和PDZ ligand)与黄芩素合成途径中的酶PAL和4CL进行融合,使PAL和4CL能够在大肠杆菌体内进行自发组装,形成双酶复合反应器,从而提目标化合物的产量。
本发明人首次在合成黄岑类化合物/白杨素类化合物的原核表达系统中发现将PAL与4CL构建成复合体(复合反应器),可极为有效地提高表达系统的产量。适用于使得PAL与4CL构成有活性的复合体的任何生物材料或技术手段可被应用于本发明中。作为本发明的优选方式,所述的蛋白-蛋白相互作用结构域可包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域。作为本发明的更优选的方式,所述的蛋白-蛋白相互作用结构域可包括选自下组的结构域:PDZ结构域,SH3结构域;它们的相应的配体为PDZ ligand(PDZlig)或SH3 ligand(SH3lig)。
蛋白质-蛋白质相互作用主要由蛋白质结构域来高效介导。PDZ、SH3、WW等结构域可通过一个或多个识别“口袋”来识别和结合配体蛋白的一段保守的短肽序列。就PDZ结构域而言,它通常结合配体蛋白C末端4-5个氨基酸残基,其也能够结合配体蛋白的中间序列,与自身或其他结构域聚合,或与膜上的脂类结合。
本发明中,也可包括其它的将PAL与4CL构成复合体、且能保留所述PAL与4CL的生物学活性的方法,例如将它们进行融合,构成具有适合的空间结构的融合蛋白;可通过实验测试来确定融合蛋白的活性。PAL与4CL之间的融合可以是直接连接,也可以利用连接子(Linker)来进行连接。
作为本发明的改造方案的另一个方面,本发明人在原核表达系统中过表达aroG、特别是其aroGfbr,以及pheA、特别是其pheAfbr基因,构建获得高产苯丙氨酸原核表达系统,在该原核表达系统中引入外源的黄芩素或野黄芩素类化合物/白杨素类化合物合成途径,使该菌株能够利用葡萄糖从头合成黄芩素化合物/白杨素类化合物。
常用的原核表达系统包括大肠杆菌、枯草杆菌等;例如可为大肠杆菌细胞(E.coli),如大肠杆菌BL21(DE3)。
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括它们的类似物。这些类似物与天然蛋白的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些蛋白包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的蛋白并不限于上述例举的代表性的蛋白。
在上述优选的蛋白(包括上述野生型的蛋白,突变型的蛋白)的基础上,本发明还包括与所述的蛋白同源性高(比如与所列举的具体蛋白序列的同源性为70%或更高;优选地同源性为80%或更高;更优选地同源性为90%或更高,如同源性95%,98%或99%)的、且具有相应多肽相同功能的蛋白也包括在本发明内。
本发明中列举了来自特定物种的蛋白或基因。应理解,虽然本发明中优选研究了获自特定物种的蛋白或基因,但是获自其它物种的与所述蛋白或基因高度同源(如具有60%以上,如70%,80%,85%、90%、95%、甚至98%序列相同性)的其它蛋白或基因也在本发明考虑的范围之内。
发明还涉及本发明还提供了编码本发明的蛋白或其保守性变异蛋白的多核苷酸序列。本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。编码本发明的突变体成熟蛋白的多核苷酸包括:只编码成熟蛋白的编码序列;成熟蛋白的编码序列和各种附加编码序列;成熟蛋白的编码序列(和任选的附加编码序列)以及非编码序列。
本发明还包括针对所述基因的序列,进行密码子优化后形成的多核苷酸序列,例如,根据宿主细胞的偏好进行密码子优化。
本发明中,还构建了高产黄芩素或野黄芩素类化合物的工程菌株,其中包括外源的下组酶的编码基因:F6H,CPR,PAL、4CL、CHS、CHI和FNSI;且所述酶被表达后,PAL和4CL构成复合体(复合反应器)。培养该重组菌株,并以苯丙氨酸或酪氨酸为底物,生产黄芩素或野黄芩素类化合物。以苯丙氨酸或酪氨酸为底物的生产,适合于规模化的化合物生产。
本发明中,还构建了高产白杨素类化合物的工程菌株,其中包括外源的下组酶的编码基因:PAL、4CL、CHS、CHI和FNSI;且所述酶被表达后,PAL和4CL构成复合体(复合反应器)。培养该重组菌株,并以苯丙氨酸或酪氨酸为底物,生产黄芩素或野黄芩素类化合物。以苯丙氨酸或酪氨酸为底物的生产,适合于规模化的化合物生产。
为本发明的优选方式,所述F6H还包括与之融合的多肽标签,所述的多肽标签例如选自:8RP,Sumo,MBP,2B1,或它们的组合;较佳地为2B1。所述的多肽标签与所述F6H之间,可以包含或不包含连接肽,所述的连接肽不影响两者的生物学活性。F6H于2B1连接,可获得一种改进的F6H突变体2B1trF6H。
还可以在上述的工程菌株中,进一步引入上述底物(苯丙氨酸或酪氨酸)的上游生成途径,例如包括:由葡萄糖或甘油通过糖酵解、磷酸戊糖途径、莽草酸途径生成苯丙氨酸或酪氨酸。应理解,基于此类途径来形成苯丙氨酸或酪氨酸的方案也包含在本发明中。通过本领域已知手段来加强所述形成苯丙氨酸或酪氨酸途径的方法可包含在本发明中。
作为一种优选方式,可以在上述以苯丙氨酸或酪氨酸为底物的工程菌株中,进一步引入外源的aroG、特别是其aroGfbr,以及pheA、特别是其pheAfbr,获得另一种重组菌株,该菌株能够以葡萄糖为底物,生产黄芩素类化合物/白杨素类化合物。以葡萄糖为底物的生产,成本低廉,非常适合于规模化的化合物生产。
在建立如本发明优化的表达系统以及利用其进行生产的基础上,本领域人员还可系统研提高黄芩素、野黄芩素类化合物或白杨素类化合物产量的一系列因素,包括基因的效率和适宜性、基因剂量和培养基。此外,也可通过扩大生产规模来提高目标化合物的产量。例如,在摇瓶规模、简单培养条件下的产量基础上,当进一步扩大生产规模、进行培养基流加方案(可以源源不断提供充沛的底物)或给予良好的发酵罐水平生产条件(如温度的优化控制、溶氧的优化控制等)时,其产量通常可增加2~1000倍。这些操作和优化方式也应被包含在本发明中。可以预期,本发明的重组原核细胞,在一些优化的设备和操作工艺中,目标产物的量会发生长足的增长。
在获得了发酵产物后,从发酵产物中提取目标化合物可以采用本领域已知的技术。可以采用一些公知技术如高效液相色谱来对产物进行分析鉴定,以确定获得了所需的化合物。
本发明的菌株稳定性好,并可实现在生物反应器中规模性培养及生产黄芩素或野黄芩素类化合物/白杨素类化合物。本发明优选的菌株的目标化合物得率非常高。
相对于传统的植物提取手段,微生物发酵具有速度快、受外界因素影响较小等优势;部分化合物通过微生物合成的产量远高于植物提取,已经成为天然产物获得的一种重要手段。本发明中,通过大肠杆菌生产黄芩素或野黄芩素类化合物/白杨素类化合物,实现目标化合物更经济、更方便的制造。
本发明还提供了用于生产黄芩素或野黄芩素类化合物工程菌株的试剂盒。此外,其中还可包括原核细胞的培养基,用于合成的底物如苯丙氨酸、酪氨酸或葡萄糖,黄芩素或野黄芩素类化合物分离或检测试剂。更佳地,所述试剂盒中还可包括说明进行生物合成黄层素的方法的使用说明书等。
本发明还提供了用于构建所述生产黄芩素或野黄芩素类化合物/白杨素类化合物,工程菌株的试剂盒,所述试剂盒中可包括一系列构建体,例如可参考本发明的实施例中所提供的构建体,也可以为含有所述基因但基因排列或串联方式不同的其它构建体。表达载体(表达构建物)的建立可以采用本领域技术人员熟悉的技术。在得知了所需选择的酶以及所需表达的细胞体系之后,本领域技术人员可以进行表达构建物的建立。基因序列可以被插入到不同的表达构建物(如表达载体)中,也可以被插入到同一表达构建物中,只要在转入到细胞后其编码的多肽能够被有效地表达和发挥活性即可。所述试剂盒中还可包括原核细胞,原核细胞的培养基,用于合成的底物如苯丙氨酸、酪氨酸或葡萄糖,黄芩素或野黄芩素类化合物分离或检测试剂。更佳地,所述试剂盒中还可包括说明进行生物合成黄芩素或野黄芩素的方法的使用说明书等。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如J.萨姆布鲁克等编著,分子克隆实验指南,第三版,科学出版社,2002中所述的条件,或按照制造厂商所建议的条件。
1.实验材料
多聚酶链式反应(PCR)胶回收试剂盒,质粒抽提试剂盒均为美国Axygen产品;聚合酶链式反应(PCR)高保真酶PrimeSTAR Max DNA Polymerase为日本宝生物公司(TAKARA)产品;限制性内切酶均为NEB产品。
标准品化合物黄芩素和野黄芩素购自上海源叶生物科技有限公司。其他试剂为国产分析纯或色谱纯试剂,购自国药集团化学试剂有限公司。
PCR使用Arktik Thermal Cycler(Thermo Fisher Scientific);恒温培养使用ZXGP-A2050恒温培养箱和ZWY-211G恒温培养振荡器;离心使用5418R高速冷冻式离心机和5418小型离心机(Eppendorf)。真空浓缩使用Concentrator plus浓缩仪(Eppendorf);OD600使用UV-1200紫外可见分光光度计检测(上海美谱达仪器有限公司)。旋转蒸发系统由IKA RV 10digital旋转蒸发仪(IKA)和MZ 2C NT化学隔膜泵、CVC3000真空控制器(vacuubrand)组成。高效液相色谱使用Dionex UltiMate 3000液相色谱系统(ThermoFisher Scientific)。
2、本发明所涉及的菌株及质粒
大肠杆菌DH10B用于基因克隆,大肠杆菌BL21(DE3)菌株用于蛋白表达和黄芩素、野黄芩素的生产。
pCDFDuet-1、pETDuet-1、pACYCDuet-1载体用于代谢途径基因装配。
5nm rigid linker ER/K序列:KAKLKEEEERKQREEEERIKRLEELAKRKEEERKGT。
3、酶的选择
野生型的上述蛋白或基因均为本领域已经鉴定的,因此,可以从公众途径获得和制备,具体如下:
PAL:来源于红景天(Rhodotorula toruloides),其具有GenBank登录号AAA33883.1所示的序列(RtPAL);
4CL:来源于欧芹(Petroselium crispum),其具有GenBank登录号KF765780.1所示的序列(Pc4CL);
CHS:来源于矮牵牛(Petunia X hybrida),其具有GenBank登录号KF765781.1所示的序列;
CHI基因:来源于苜蓿(Medicago sativa),其具有GenBank登录号KF765782.1所示的序列;
FNS I:来源于欧芹(Petroselium crispum),其具有Swiss-Prot登录号Q7XZQ8.1所示的序列;
F6H:来源于黄岑(Scutellaria baicalensis),其具有GenBank登录号ASW21050.1所示的序列;
CPR:来自于拟南芥(Arabidopsis thaliana),其具有GenBank登录号NP_849472.2所示的序列。
PDZ结构域:来自于小鼠Mouseα-syntrophin(syn),77-171氨基酸序列,其具有GenBank登录号EDL06069所示的序列。
matB基因:来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号AGZ04579.1所示的序列。
matC基因:来源于豆科根瘤菌(Rhizobium leguminosarum),其具有GenBank登录号KF765784.1所示的序列。
ACS基因:来源于大肠杆菌(Escherichia coli),其具有GenBank登录号CP062211.1所示的序列。
PDZ ligand:序列为GVKESLV(SEQ ID NO:12)。
SH3结构域:AEYVRALFDFNGNDEEDLPFKKGDILRIRDKPEEQWWNAEDSEGKRGMIPVPYVEKY(SEQ ID NO:13)。
SH3lig:PPPALPPKRRR(SEQ ID NO:14)。
4、质粒的构建
构建含单个基因的质粒
(1)PAL和PDZ的融合蛋白质粒的构建
首先在PAL基因N端增加NcoI酶切位点,PDZ序列C端增加EcoRI酶切位点,利用Over-Lap PCR方法将PAL,5nm rigid linker ER/K,PDZ进行基因融合,载体骨架选择pCDFDuet-1,将Over-Lap PCR后得到的融合基因PAL-ER/K-PDZ连接到pCDFDuet-1的NcoI和EcoRI位点,得到pCDFDuet1-T7 PAL-ER/K-PDZ。
(2)4CL和PDZ ligand的融合蛋白质粒的构建
将PDZ ligand序列设计在上游引物中,N端增加NcoI酶切位点,4CL序列C端增加BamHI酶切位点,PCR得到PDZlig-4CL融合基因,将融合基因构建到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 PDZlig-4CL。
(3)4CL和PDZ ligand的融合蛋白质粒的构建
将PDZ ligand序列设计在下游引物中,4CL N端增加NcoI酶切位点,PDZ ligand C端增加BamHI酶切位点,PCR得到4CL-PDZlig融合基因,将融合基因构建到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 4CL-PDZlig。
(4)pheA基因从BL21(DE3)基因组克隆得到,在pheA基因N端增加NcoI酶切位点,C端增加BamHI酶切位点。利用PCR引物,将pheA基因的976位由A突变为C得到pheAfbr,载体骨架选择pETDuet-1,将pheAfbr基因连接到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 pheAfbr
(5)aroG基因从BL21(DE3)基因组克隆得到,在aroG基因N端增加NcoI酶切位点,C端增加BamHI酶切位点。利用PCR引物,将aroG基因的436位由G突变为A得到aroGfbr,载体骨架选择pETDuet-1,将aroGfbr基因连接到pCDFDuet-1的NcoI和BamHI位点,得到pCDFDuet1-T7 aroGfbr
5、构建携带多基因的质粒
pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)的建立:参见引用文献1:Production of plant-specific flavones baicalein and scutellarein in anengineered E.coli from available phenylalanine and tyrosine。
pYH66(pETDuet1-T72B1trF6H-T7CPR)的建立:参见引用文献1:Production ofplant-specific flavones baicalein and scutellarein in an engineered E.colifrom available phenylalanine and tyrosine。
pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)的建立:利用PCR得到matC基因和matB基因;在matC基因N端加上NcoI酶切位点,C端加上HindIII酶切位点,将matC基因连到pACYCDuet1的NcoI和HindIII位点(该位点前具有载体自带的T7启动子),获得T7matC。在T7matC基因位点N端加上HindIII位点,C端加上Acc65I位点,利用PCR得到T7matC基因,将T7matC基因连到pACYCDuet1的HindIII和Acc65I位点。在T7ACS基因N端加上Acc65I位点,C端加上NotI位点,利用PCR得到T7ACS基因,将T7ACS基因连接到pACYCDuet1的Acc65I和NotI位点。在T7FabF基因的N端加上NotI位点,在C端加上XbaI位点,PCR得到T7 FabF基因,将T7FabF基因连接到pACYCDuet1的NotI和XbaI位点。得到pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF质粒。
pZZ12(pCDFDuet1-T7-CHI-FNSI)的建立:在CHI基因N端加上NcoI位点,C端加上HindIII位点,PCR得到CHI基因,将CHI基因连接到pCDFDuet1的NcoI和HindIII位点。在FNSI基因N端加上HindIII位点,C端加上EcoRI位点,PCR得到FNSI基因,连接到pCDFDuet1的HindIII和EcoRI位点。得到pCDFDuet1-T7-CHI-FNSI。
pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)的建立:在SH3lig基因N端加上NcoI位点,4CL基因C端加上BamHI位点,以pET28a为载体,利用Over-lap PCR将SH3lig和4CL融合得到融合基因SH3lig-4CL,将SH3lig-4CL连接到pET28a的NcoI和BamHI位点。Over-Lap PCR得到融合基因PAL-ER/K,在PAL-ER/K基因N端加上BamHI位点,SH3基因C端加上EcoRI位点,利用Over-lap PCR将PAL-ER/K基因和SH3基因融合,得到PAL-ER/K-SH3基因,将PAL-ER/K-SH3连接到pET28a的BamHI和EcoRI位点。在CHS基因N端加上EcoRI位点,C端加上SalI位点,将CHS连接到pET28a的EcoRI和SalI位点。得到质粒pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)。
pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)的建立:在4CL基因N端加上NcoI位点,4CL基因C端加上BamHI位点,以pET28a为载体,将4CL连接到pET28a的NcoI和BamHI位点。在PAL-ER/K基因N端加上BamHI位点,SH3基因C端加上EcoRI位点,利用Over-Lap PCR将PAL-ER/K基因和SH3基因融合,得到PAL-ER/K-SH3基因,将PAL-ER/K-SH3连接到pET28a的BamHI和EcoRI位点。在CHS基因N端加上EcoRI位点,C端加上SalI位点,将CHS连接到pET28a的EcoRI和SalI位点。得到质粒pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)。
pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)的建立:将4(1)中构建的PAL-ER/K-PDZ融合基因N端增加BamHI酶切位点,C端增加EcoRI酶切位点,插入到pCDFDuet1中,获得pCDFDuet1-T7PAL-ER/K-PDZ。以pCDFDuet1-T7PAL-ER/K-PDZ为模板,PCR后将基因PAL-ER/K-PDZ构建到pCDFDuet1-T7PDZlig-4CL质粒的BamHI和EcoRI位点,得到pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ。将T7FNSI-T7CHS-T7CHI基因利用HindIII和AvrII双酶切从pYH57质粒上酶切下来,连接到上述质粒的HindIII和AvrII酶切位点得到pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒。
pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)的建立:在pCDFDuet1-T7 aroGfbr质粒T7启动子前设计引物,增加BamHI酶切位点,C端增加EcoRI酶切位点,以pETDuet1-T7aroGfbr为模板,PCR得到T7aroGfbr片段,将该片段连接到pCDFDuet1-T7pheAfbr质粒的BamHI和EcoRI位点,得到pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)质粒。
pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)的建立:以pZZ52(pETDuet1-T7pheAfbr-T7aroGfbr)质粒为模板,T7启动子上游设计引物增加AvrII质粒,在C端增加AvrII酶切位点,PCR克隆得到T7pheAfbr-T7aroGfbr片段,利用一步克隆试剂盒将T7pheAfbr-T7aroGfbr连接到pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)的AvrII位点,得到pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒。
质粒详细信息见表1,细胞信息见表2,质粒构建示意图见图1~图2。
表1
Figure BDA0002884548880000141
表2
Figure BDA0002884548880000142
Figure BDA0002884548880000151
5、大肠杆菌摇瓶发酵黄芩素和野黄芩素
发酵菌株的准备:构建好的质粒转化大肠杆菌BL21(DE3),37℃倒置培养12h后挑阳性克隆于2mL LB抗性的培养基中,37℃,250rpm培养10h制备发酵种子菌,发酵工程菌的详细信息见表2。
种子液1%转接至10mL含2%葡萄糖的M9培养基中(培养基中加入相应的抗生素),37℃,250rpm培养至菌株OD=0.5,加入0.1mM的IPTG及500mg/L的苯丙氨酸,22℃,220rpm发酵3天,取样1mL,菌液超声破碎3次,等体积乙酸乙酯混匀萃取两次,12000rpm,2min离心转移有机相至新管,室温或30℃旋转蒸干后,加200μL甲醇复溶(浓缩5倍)充分混匀,12000rpm,2min后转移上清HPLC检测。
6、HPLC检测
液相检测条件:A相:0.1%甲酸水,B相:乙腈;分离条件:0-20min 20%B相-55%B相,20-22min 55%B相-100%B相,22-27min 100%B相,27-35min 100%B相-20%B相,35-40min,20%B相;检测波长:340nm,柱温:30℃。
色谱柱:Thermo syncronis C18反相柱(250mm×4.6mm,5μm)。
实施例
实施例1、蛋白的优化改造
1、PAL蛋白序列的优化改造
来自红景天(Rhodotorula toruloides)的PAL(RtPAL)长度为693aa(GenBank登录号AAA33883.1),具体序列如下(SEQ ID NO:1):
MAPRPTSQSQARTCPTTQVTQVDIVEKMLAAPTDSTLELDGYSLNLGDVVSAARKGRPVRVKDSDEIRSKIDKSVEFLRSQLSMSVYGVTTGFGGSADTRTEDAISLQKALLEHQLCGVLPSSFDSFRLGRGLENSLPLEVVRGAMTIRVNSLTRGHSAVRLVVLEALTNFLNHGITPIVPLRGTISASGDLSPLSYIAAAISGHPDSKVHVVHEGKEKILYAREAMALFNLEPVVLGPKEGLGLVNGTAVSASMATLALHDAHMLSLLSQSLTAMTVEAMVGHAGSFHPFLHDVTRPHPTQIEVAGNIRKLLEGSRFAVHHEEEVKVKDDEGILRQDRYPLRTSPQWLGPLVSDLIHAHAVLTIEAGQSTTDNPLIDVENKTSHHGGNFQAAAVANTMEKTRLGLAQIGKLNFTQLTEMLNAGMNRGLPSCLAAEDPSLSYHCKGLDIAAAAYTSELGHLANPVTTHVQPAEMANQAVNSLALISARRTTESNDVLSLLLATHLYCVLQAIDLRAIEFEFKKQFGPAIVSLIDQHFGSAMTGSNLRDELVEKVNKTLAKRLEQTNSYDLVPRWHDAFSFAAGTVVEVLSSTSLSLAAVNAWKVAAAESAISLTRQVRETFWSAASTSSPALSYLSPRTQILYAFVREELGVKARRGDVFLGKQEVTIGSNVSKIYEAIKSGRINNVLLKMLA*
改造1:本发明人针对SEQ ID NO:1进行序列改造,氨基酸,在C端加上5nm rigidlinker ER/K连接子,获得改进的PAL突变体PAL-ER/K,具体序列如下(SEQ ID NO:2):
Figure BDA0002884548880000161
改造2:本发明人针对SEQ ID NO:2进行序列改造,再在C端加上PDZ的氨基酸序列,获得改进的PAL-ER/K突变体PAL-ER/K-PDZ,具体序列如下(SEQ ID NO:3):
Figure BDA0002884548880000162
Figure BDA0002884548880000171
其中,PAL位于SEQ ID NO:3的第1~693位;ER/K位于SEQ ID NO:3的第694~729位;PDZ位于SEQ ID NO:3的第730~824位。
改造3:本发明人针对SEQ ID NO:2进行序列改造,再在C端加上SH3的氨基酸序列,获得改进的PAL-ER/K突变体PAL-ER/K-SH3,具体序列如下(SEQ ID NO:4):
Figure BDA0002884548880000172
其中,PAL位于SEQ ID NO:4的第1~693位;ER/K位于SEQ ID NO:3的第694~729位;PDZ位于SEQ ID NO:3的第730~786位。
2、4CL蛋白序列的优化改造
来自欧芹(Petroselium crispum)的4CL(Pc4CL)长度为544aa(GenBank登录号KF765780.1),具体序列如下(SEQ ID NO:5):
MGDCVAPKEDLIFRSKLPDIYIPKHLPLHTYCFENISKVGDKSCLINGATGETFTYSQVELLSRKVASGLNKLGIQQGDTIMLLLPNSPEYFFAFLGASYRGAISTMANPFFTSAEVIKQLKASQAKLIITQACYVDKVKDYAAEKNIQIICIDDAPQDCLHFSKLMEADESEMPEVVINSDDVVALPYSSGTTGLPKGVMLTHKGLVTSVAQQVDGDNPNLYMHSEDVMICILPLFHIYSLNAVLCCGLRAGVTILIMQKFDIVPFLELIQKYKVTIGPFVPPIVLAIAKSPVVDKYDLSSVRTVMSGAAPLGKELEDAVRAKFPNAKLGQGYGMTEAGPVLAMCLAFAKEPYEIKSGACGTVVRNAEMKIVDPETNASLPRNQRGEICIRGDQIMKGYLNDPESTRTTIDEEGWLHTGDIGFIDDDDELFIVDRLKEIIKYKGFQVAPAELEALLLTHPTISDAAVVPMIDEKAGEVPVAFVVRTNGFTTTEEEIKQFVSKQVVFYKRIFRVFFVDAIPKSPSGKILRKDLRARIASGDLPK*
改造1:本发明人针对SEQ ID NO:5进行序列改造,去除其中第1位氨基酸,再在N端加上(GGGGS)2的氨基酸序列,获得改进的4CL突变体(GGGGS)2-4CL,具体序列如下(SEQ IDNO:6):
Figure BDA0002884548880000181
改造2:本发明人针对SEQ ID NO:6进行序列改造,再在N端加上PDZlig的氨基酸序列,再N端加上M氨基酸,获得改进的(GGGGS)2-4CL突变体PDZlig-(GGGGS)2-4CL,具体序列如下(SEQ ID NO:7):
Figure BDA0002884548880000182
其中,PDZlig位于SEQ ID NO:7的第1~8位;(GGGGS)2位于SEQ ID NO:7的第9~18位;4CL位于SEQ ID NO:7的第19~561位。
改造3:本发明人针对SEQ ID NO:5进行序列改造,在C端加上(GGGGS)2的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2,具体序列如下(SEQ ID NO:8):
Figure BDA0002884548880000183
改造4:本发明人针对SEQ ID NO:8进行序列改造,在C端加上PDZlig的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2-PDZlig,具体序列如下(SEQ ID NO:9):
Figure BDA0002884548880000191
其中,4CL位于SEQ ID NO:9的第1~544位;(GGGGS)2位于SEQ ID NO:10的第545~554位;PDZlig位于SEQ ID NO:9的第555~561位。
改造5:本发明人针对SEQ ID NO:6进行序列改造,再在N端加上SH3lig的氨基酸序列,再N端加上M氨基酸,获得改进的(GGGGS)2-4CL突变体SH3lig-(GGGGS)2-4CL(简称SH3lig-4CL),具体序列如下(SEQ ID NO:10):
Figure BDA0002884548880000192
其中,SH3lig位于SEQ ID NO:10的第1~12位;(GGGGS)2位于SEQ ID NO:10的第13~22位;4CL位于SEQ ID NO:10的第23~565位。
改造6:本发明人针对SEQ ID NO:8进行序列改造,在C端加上SH3lig的氨基酸序列,获得改进的4CL突变体4CL-(GGGGS)2-SH3lig,具体序列如下(SEQ ID NO:11):
Figure BDA0002884548880000193
Figure BDA0002884548880000201
其中,4CL位于SEQ ID NO:11的第1~544位;(GGGGS)2位于SEQ ID NO:10的第545~554位;SH3lig位于SEQ ID NO:10的第555~565位。
实施例2、发酵工程菌检测白杨素
1、非自组装菌株(JH-0)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒转化入BL21(DE3),得到工程菌DN-1,用于以苯丙氨酸为前体,发酵白杨素。
2、4CL-改造2、PAL-改造2菌株(DN-0)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒转化入BL21(DE3),得到自组装工程菌DN-0,用于以苯丙氨酸为前体,发酵白杨素。
发酵方法如下:菌株LB固体培养基(壮观霉素80μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5~0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的苯丙氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。取样检测黄芩素产量。
结果显示,与非自组装的菌株JH-0相比,自组装菌株DN-0的白杨素产量,提高到1.6倍(图10)。
实施例3、发酵工程菌检测黄芩素
本实施例中,以苯丙氨酸为前体,进行发酵,合成途径如图8。
1、非自组装菌株(DN-1)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1-trF6H-T7CPR)质粒,pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)质粒共同转化入BL21(DE3),得到工程菌DN-1,用于以苯丙氨酸为前体,发酵黄芩素。
2、4CL-改造2、PAL-改造2菌株(DN-2)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1-trF6H-T7CPR)质粒,pYH38(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF)质粒共同转化入BL21(DE3),得到自组装工程菌DN-2,用于以苯丙氨酸为前体发酵黄芩素。
3、4CL无改造、PAL-改造3菌株(DN-3,作为对照菌株)
将pZZ23(pET28a-T7-4CL-PAL-ER/K-SH3-CHS)质粒,pZZ12(pCDFDuet1-CHI-FNSI)质粒,质粒共同转化入BL21(DE3),得到对照工程菌DN-3,以苯丙氨酸为前体,用于以苯丙氨酸为前体发酵黄芩素。
4、4CL-改造5、PAL-改造3菌株(DN-4)
将pZZ22(pET28a-SH3lig-T7-4CL-PAL-ER/K-SH3-CHS)质粒,pZZ12(pCDFDuet1-T7-CHI-FNSI)质粒,质粒共同转化入BL21(DE3),得到自组装工程菌DN-4,用于以苯丙氨酸为前体发酵黄芩素。
上述四种菌株进行发酵,发酵方法如下:菌株LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mLM9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5~0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的苯丙氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。取样检测黄芩素产量。
结果显示,与非自组装的菌株DN-1相比,自组装菌株DN-2的黄芩素产量,提高6.6倍(图3)。HPLC检测图谱如图4。
结果显示,与对照菌株DN-3相比,自组装菌株DN-4的黄芩素产量,提高2.5倍(图5)。
根据上述结果,说明本发明的改造方案能够显著提高黄芩素产量,如表3。
表3
Figure BDA0002884548880000211
对于多种改造方案进行比较,本发明人优选PAL改造2、4CL改造2或PAL改造3、4CL改造5。
实施例4、发酵工程菌检测野黄芩素
本实施例中,以酪氨酸为前体,进行发酵。
如前一实施例获得非组装的工程菌DN-1,用于以酪氨酸为前体发酵黄芩素。
如前一实施例得到自组装工程菌DN-2,用于以酪氨酸为前体发酵黄芩素。
上述两种菌株,在LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,加入终浓度为500mg/L经灭菌的酪氨酸并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。
取样检测生长情况,野黄芩素产量。与菌株DN-1相比,自组装菌株DN-2的野黄芩素产量,提高1.43倍(图6)。
实施例5、从葡萄糖合成黄芩素
本实施例中,以葡萄糖为前体,进行发酵,合成途径如图9。
1、非自组装菌株(DN-5)
将pYH57(pCDFDuet1-T74CL-T7PAL-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1trF6H-T7CPR)质粒,pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒共同转化入BL21(DE3),得到非自组装工程菌DN-5,用于从葡萄糖合成黄芩素。
2、4CL改造2,PAL改造2菌株(DN-5)
将pZZ41(pCDFDuet1-T7PDZlig-4CL-T7PAL-ER/K-PDZ-T7FNSI-T7CHS-T7CHI)质粒,pYH66(pETDuet1-T72B1trF6H-T7CPR)质粒,pZZ55(pACYCDuet1-T7matC-T7matB-T7ACS-T7FabF-T7pheAfbr-T7aroGfbr)质粒共同转化入BL21(DE3),得到采用PDZ和PDZlig互作方案的自组装工程菌DN-6,用于从葡萄糖合成黄芩素。
上述建立的两种菌株,LB固体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL)37℃培养过夜。挑取单个克隆到2mL LB液体培养基(壮观霉素80μg/mL,氨苄青霉素100μg/mL,氯霉素34μg/mL),转接过夜培养的菌液到新的10mL M9Y液体抗性培养基中37℃,250r/min培养至OD600=0.5-0.6,水浴降温至16℃左右,然后加入诱导剂IPTG至终浓度0.2mM,并转至22℃低温诱导培养,在摇床转速220r/min条件下继续培养72h。
结果显示,非自组装工程菌DN-5能够从葡萄糖中合成黄芩素,自组装工程菌株DN-6能够从葡萄糖中合成黄芩素,自组装工程菌株DN-6比非自组装工程菌株DN-5提高了111.7%(图7)。
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。
序列表
<110> 中国科学院分子植物科学卓越创新中心
<120> 异源合成黄酮类化合物的宿主细胞及其应用
<130> 209377
<160> 14
<170> SIPOSequenceListing 1.0
<210> 1
<211> 693
<212> PRT
<213> 红景天(Rhodotorula toruloides)
<400> 1
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala
690
<210> 2
<211> 729
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(729)
<223> PAL-ER/K
<400> 2
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr
725
<210> 3
<211> 824
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(786)
<223> PAL-ER/K-SH3
<400> 3
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr Leu Gln Arg Arg Arg Val Thr
725 730 735
Val Arg Lys Ala Asp Ala Gly Gly Leu Gly Ile Ser Ile Lys Gly Gly
740 745 750
Arg Glu Asn Lys Met Pro Ile Leu Ile Ser Lys Ile Phe Lys Gly Leu
755 760 765
Ala Ala Asp Gln Thr Glu Ala Leu Phe Val Gly Asp Ala Ile Leu Ser
770 775 780
Val Asn Gly Glu Asp Leu Ser Ser Ala Thr His Asp Glu Ala Val Gln
785 790 795 800
Ala Leu Lys Lys Thr Gly Lys Glu Val Val Leu Glu Val Lys Tyr Met
805 810 815
Lys Glu Val Ser Pro Tyr Phe Lys
820
<210> 4
<211> 786
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(786)
<223> PAL-ER/K-SH3
<400> 4
Met Ala Pro Arg Pro Thr Ser Gln Ser Gln Ala Arg Thr Cys Pro Thr
1 5 10 15
Thr Gln Val Thr Gln Val Asp Ile Val Glu Lys Met Leu Ala Ala Pro
20 25 30
Thr Asp Ser Thr Leu Glu Leu Asp Gly Tyr Ser Leu Asn Leu Gly Asp
35 40 45
Val Val Ser Ala Ala Arg Lys Gly Arg Pro Val Arg Val Lys Asp Ser
50 55 60
Asp Glu Ile Arg Ser Lys Ile Asp Lys Ser Val Glu Phe Leu Arg Ser
65 70 75 80
Gln Leu Ser Met Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser
85 90 95
Ala Asp Thr Arg Thr Glu Asp Ala Ile Ser Leu Gln Lys Ala Leu Leu
100 105 110
Glu His Gln Leu Cys Gly Val Leu Pro Ser Ser Phe Asp Ser Phe Arg
115 120 125
Leu Gly Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly
130 135 140
Ala Met Thr Ile Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val
145 150 155 160
Arg Leu Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly Ile
165 170 175
Thr Pro Ile Val Pro Leu Arg Gly Thr Ile Ser Ala Ser Gly Asp Leu
180 185 190
Ser Pro Leu Ser Tyr Ile Ala Ala Ala Ile Ser Gly His Pro Asp Ser
195 200 205
Lys Val His Val Val His Glu Gly Lys Glu Lys Ile Leu Tyr Ala Arg
210 215 220
Glu Ala Met Ala Leu Phe Asn Leu Glu Pro Val Val Leu Gly Pro Lys
225 230 235 240
Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala
245 250 255
Thr Leu Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gln Ser
260 265 270
Leu Thr Ala Met Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe
275 280 285
His Pro Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gln Ile Glu
290 295 300
Val Ala Gly Asn Ile Arg Lys Leu Leu Glu Gly Ser Arg Phe Ala Val
305 310 315 320
His His Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly Ile Leu Arg
325 330 335
Gln Asp Arg Tyr Pro Leu Arg Thr Ser Pro Gln Trp Leu Gly Pro Leu
340 345 350
Val Ser Asp Leu Ile His Ala His Ala Val Leu Thr Ile Glu Ala Gly
355 360 365
Gln Ser Thr Thr Asp Asn Pro Leu Ile Asp Val Glu Asn Lys Thr Ser
370 375 380
His His Gly Gly Asn Phe Gln Ala Ala Ala Val Ala Asn Thr Met Glu
385 390 395 400
Lys Thr Arg Leu Gly Leu Ala Gln Ile Gly Lys Leu Asn Phe Thr Gln
405 410 415
Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys
420 425 430
Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp
435 440 445
Ile Ala Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro
450 455 460
Val Thr Thr His Val Gln Pro Ala Glu Met Ala Asn Gln Ala Val Asn
465 470 475 480
Ser Leu Ala Leu Ile Ser Ala Arg Arg Thr Thr Glu Ser Asn Asp Val
485 490 495
Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gln Ala Ile
500 505 510
Asp Leu Arg Ala Ile Glu Phe Glu Phe Lys Lys Gln Phe Gly Pro Ala
515 520 525
Ile Val Ser Leu Ile Asp Gln His Phe Gly Ser Ala Met Thr Gly Ser
530 535 540
Asn Leu Arg Asp Glu Leu Val Glu Lys Val Asn Lys Thr Leu Ala Lys
545 550 555 560
Arg Leu Glu Gln Thr Asn Ser Tyr Asp Leu Val Pro Arg Trp His Asp
565 570 575
Ala Phe Ser Phe Ala Ala Gly Thr Val Val Glu Val Leu Ser Ser Thr
580 585 590
Ser Leu Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Ala Ala Glu
595 600 605
Ser Ala Ile Ser Leu Thr Arg Gln Val Arg Glu Thr Phe Trp Ser Ala
610 615 620
Ala Ser Thr Ser Ser Pro Ala Leu Ser Tyr Leu Ser Pro Arg Thr Gln
625 630 635 640
Ile Leu Tyr Ala Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg
645 650 655
Gly Asp Val Phe Leu Gly Lys Gln Glu Val Thr Ile Gly Ser Asn Val
660 665 670
Ser Lys Ile Tyr Glu Ala Ile Lys Ser Gly Arg Ile Asn Asn Val Leu
675 680 685
Leu Lys Met Leu Ala Lys Ala Lys Leu Lys Glu Glu Glu Glu Arg Lys
690 695 700
Gln Arg Glu Glu Glu Glu Arg Ile Lys Arg Leu Glu Glu Leu Ala Lys
705 710 715 720
Arg Lys Glu Glu Glu Arg Lys Gly Thr Ala Glu Tyr Val Arg Ala Leu
725 730 735
Phe Asp Phe Asn Gly Asn Asp Glu Glu Asp Leu Pro Phe Lys Lys Gly
740 745 750
Asp Ile Leu Arg Ile Arg Asp Lys Pro Glu Glu Gln Trp Trp Asn Ala
755 760 765
Glu Asp Ser Glu Gly Lys Arg Gly Met Ile Pro Val Pro Tyr Val Glu
770 775 780
Lys Tyr
785
<210> 5
<211> 544
<212> PRT
<213> 欧芹(Petroselium crispum)
<400> 5
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
<210> 6
<211> 553
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(553)
<223> (GGGGS)2-4CL
<400> 6
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Asp Cys Val Ala Pro
1 5 10 15
Lys Glu Asp Leu Ile Phe Arg Ser Lys Leu Pro Asp Ile Tyr Ile Pro
20 25 30
Lys His Leu Pro Leu His Thr Tyr Cys Phe Glu Asn Ile Ser Lys Val
35 40 45
Gly Asp Lys Ser Cys Leu Ile Asn Gly Ala Thr Gly Glu Thr Phe Thr
50 55 60
Tyr Ser Gln Val Glu Leu Leu Ser Arg Lys Val Ala Ser Gly Leu Asn
65 70 75 80
Lys Leu Gly Ile Gln Gln Gly Asp Thr Ile Met Leu Leu Leu Pro Asn
85 90 95
Ser Pro Glu Tyr Phe Phe Ala Phe Leu Gly Ala Ser Tyr Arg Gly Ala
100 105 110
Ile Ser Thr Met Ala Asn Pro Phe Phe Thr Ser Ala Glu Val Ile Lys
115 120 125
Gln Leu Lys Ala Ser Gln Ala Lys Leu Ile Ile Thr Gln Ala Cys Tyr
130 135 140
Val Asp Lys Val Lys Asp Tyr Ala Ala Glu Lys Asn Ile Gln Ile Ile
145 150 155 160
Cys Ile Asp Asp Ala Pro Gln Asp Cys Leu His Phe Ser Lys Leu Met
165 170 175
Glu Ala Asp Glu Ser Glu Met Pro Glu Val Val Ile Asn Ser Asp Asp
180 185 190
Val Val Ala Leu Pro Tyr Ser Ser Gly Thr Thr Gly Leu Pro Lys Gly
195 200 205
Val Met Leu Thr His Lys Gly Leu Val Thr Ser Val Ala Gln Gln Val
210 215 220
Asp Gly Asp Asn Pro Asn Leu Tyr Met His Ser Glu Asp Val Met Ile
225 230 235 240
Cys Ile Leu Pro Leu Phe His Ile Tyr Ser Leu Asn Ala Val Leu Cys
245 250 255
Cys Gly Leu Arg Ala Gly Val Thr Ile Leu Ile Met Gln Lys Phe Asp
260 265 270
Ile Val Pro Phe Leu Glu Leu Ile Gln Lys Tyr Lys Val Thr Ile Gly
275 280 285
Pro Phe Val Pro Pro Ile Val Leu Ala Ile Ala Lys Ser Pro Val Val
290 295 300
Asp Lys Tyr Asp Leu Ser Ser Val Arg Thr Val Met Ser Gly Ala Ala
305 310 315 320
Pro Leu Gly Lys Glu Leu Glu Asp Ala Val Arg Ala Lys Phe Pro Asn
325 330 335
Ala Lys Leu Gly Gln Gly Tyr Gly Met Thr Glu Ala Gly Pro Val Leu
340 345 350
Ala Met Cys Leu Ala Phe Ala Lys Glu Pro Tyr Glu Ile Lys Ser Gly
355 360 365
Ala Cys Gly Thr Val Val Arg Asn Ala Glu Met Lys Ile Val Asp Pro
370 375 380
Glu Thr Asn Ala Ser Leu Pro Arg Asn Gln Arg Gly Glu Ile Cys Ile
385 390 395 400
Arg Gly Asp Gln Ile Met Lys Gly Tyr Leu Asn Asp Pro Glu Ser Thr
405 410 415
Arg Thr Thr Ile Asp Glu Glu Gly Trp Leu His Thr Gly Asp Ile Gly
420 425 430
Phe Ile Asp Asp Asp Asp Glu Leu Phe Ile Val Asp Arg Leu Lys Glu
435 440 445
Ile Ile Lys Tyr Lys Gly Phe Gln Val Ala Pro Ala Glu Leu Glu Ala
450 455 460
Leu Leu Leu Thr His Pro Thr Ile Ser Asp Ala Ala Val Val Pro Met
465 470 475 480
Ile Asp Glu Lys Ala Gly Glu Val Pro Val Ala Phe Val Val Arg Thr
485 490 495
Asn Gly Phe Thr Thr Thr Glu Glu Glu Ile Lys Gln Phe Val Ser Lys
500 505 510
Gln Val Val Phe Tyr Lys Arg Ile Phe Arg Val Phe Phe Val Asp Ala
515 520 525
Ile Pro Lys Ser Pro Ser Gly Lys Ile Leu Arg Lys Asp Leu Arg Ala
530 535 540
Arg Ile Ala Ser Gly Asp Leu Pro Lys
545 550
<210> 7
<211> 561
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(561)
<223> PDZlig-(GGGGS)2-4CL
<400> 7
Met Gly Val Lys Glu Ser Leu Val Gly Gly Gly Gly Ser Gly Gly Gly
1 5 10 15
Gly Ser Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser
20 25 30
Lys Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr
35 40 45
Cys Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn
50 55 60
Gly Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser
65 70 75 80
Arg Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp
85 90 95
Thr Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe
100 105 110
Leu Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe
115 120 125
Phe Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys
130 135 140
Leu Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala
145 150 155 160
Ala Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp
165 170 175
Cys Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro
180 185 190
Glu Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser
195 200 205
Gly Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu
210 215 220
Val Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr
225 230 235 240
Met His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile
245 250 255
Tyr Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr
260 265 270
Ile Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile
275 280 285
Gln Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu
290 295 300
Ala Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val
305 310 315 320
Arg Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp
325 330 335
Ala Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly
340 345 350
Met Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys
355 360 365
Glu Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn
370 375 380
Ala Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg
385 390 395 400
Asn Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly
405 410 415
Tyr Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly
420 425 430
Trp Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu
435 440 445
Phe Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln
450 455 460
Val Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile
465 470 475 480
Ser Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val
485 490 495
Pro Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu
500 505 510
Glu Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile
515 520 525
Phe Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys
530 535 540
Ile Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro
545 550 555 560
Lys
<210> 8
<211> 554
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(554)
<223> 4CL-(GGGGS)2
<400> 8
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
545 550
<210> 9
<211> 561
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(561)
<223> 4CL-(GGGGS)2- PDZlig
<400> 9
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Val Lys Glu Ser Leu
545 550 555 560
Val
<210> 10
<211> 565
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(565)
<223> SH3lig-(GGGGS)2-4CL
<400> 10
Met Pro Pro Pro Ala Leu Pro Pro Lys Arg Arg Arg Gly Gly Gly Gly
1 5 10 15
Ser Gly Gly Gly Gly Ser Gly Asp Cys Val Ala Pro Lys Glu Asp Leu
20 25 30
Ile Phe Arg Ser Lys Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro
35 40 45
Leu His Thr Tyr Cys Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser
50 55 60
Cys Leu Ile Asn Gly Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val
65 70 75 80
Glu Leu Leu Ser Arg Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile
85 90 95
Gln Gln Gly Asp Thr Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr
100 105 110
Phe Phe Ala Phe Leu Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met
115 120 125
Ala Asn Pro Phe Phe Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala
130 135 140
Ser Gln Ala Lys Leu Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val
145 150 155 160
Lys Asp Tyr Ala Ala Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp
165 170 175
Ala Pro Gln Asp Cys Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu
180 185 190
Ser Glu Met Pro Glu Val Val Ile Asn Ser Asp Asp Val Val Ala Leu
195 200 205
Pro Tyr Ser Ser Gly Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr
210 215 220
His Lys Gly Leu Val Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn
225 230 235 240
Pro Asn Leu Tyr Met His Ser Glu Asp Val Met Ile Cys Ile Leu Pro
245 250 255
Leu Phe His Ile Tyr Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg
260 265 270
Ala Gly Val Thr Ile Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe
275 280 285
Leu Glu Leu Ile Gln Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro
290 295 300
Pro Ile Val Leu Ala Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp
305 310 315 320
Leu Ser Ser Val Arg Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys
325 330 335
Glu Leu Glu Asp Ala Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly
340 345 350
Gln Gly Tyr Gly Met Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu
355 360 365
Ala Phe Ala Lys Glu Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr
370 375 380
Val Val Arg Asn Ala Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala
385 390 395 400
Ser Leu Pro Arg Asn Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln
405 410 415
Ile Met Lys Gly Tyr Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile
420 425 430
Asp Glu Glu Gly Trp Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp
435 440 445
Asp Asp Glu Leu Phe Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr
450 455 460
Lys Gly Phe Gln Val Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr
465 470 475 480
His Pro Thr Ile Ser Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys
485 490 495
Ala Gly Glu Val Pro Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr
500 505 510
Thr Thr Glu Glu Glu Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe
515 520 525
Tyr Lys Arg Ile Phe Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser
530 535 540
Pro Ser Gly Lys Ile Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser
545 550 555 560
Gly Asp Leu Pro Lys
565
<210> 11
<211> 565
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(565)
<223> 4CL-(GGGGS)2-SH3lig
<400> 11
Met Gly Asp Cys Val Ala Pro Lys Glu Asp Leu Ile Phe Arg Ser Lys
1 5 10 15
Leu Pro Asp Ile Tyr Ile Pro Lys His Leu Pro Leu His Thr Tyr Cys
20 25 30
Phe Glu Asn Ile Ser Lys Val Gly Asp Lys Ser Cys Leu Ile Asn Gly
35 40 45
Ala Thr Gly Glu Thr Phe Thr Tyr Ser Gln Val Glu Leu Leu Ser Arg
50 55 60
Lys Val Ala Ser Gly Leu Asn Lys Leu Gly Ile Gln Gln Gly Asp Thr
65 70 75 80
Ile Met Leu Leu Leu Pro Asn Ser Pro Glu Tyr Phe Phe Ala Phe Leu
85 90 95
Gly Ala Ser Tyr Arg Gly Ala Ile Ser Thr Met Ala Asn Pro Phe Phe
100 105 110
Thr Ser Ala Glu Val Ile Lys Gln Leu Lys Ala Ser Gln Ala Lys Leu
115 120 125
Ile Ile Thr Gln Ala Cys Tyr Val Asp Lys Val Lys Asp Tyr Ala Ala
130 135 140
Glu Lys Asn Ile Gln Ile Ile Cys Ile Asp Asp Ala Pro Gln Asp Cys
145 150 155 160
Leu His Phe Ser Lys Leu Met Glu Ala Asp Glu Ser Glu Met Pro Glu
165 170 175
Val Val Ile Asn Ser Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly
180 185 190
Thr Thr Gly Leu Pro Lys Gly Val Met Leu Thr His Lys Gly Leu Val
195 200 205
Thr Ser Val Ala Gln Gln Val Asp Gly Asp Asn Pro Asn Leu Tyr Met
210 215 220
His Ser Glu Asp Val Met Ile Cys Ile Leu Pro Leu Phe His Ile Tyr
225 230 235 240
Ser Leu Asn Ala Val Leu Cys Cys Gly Leu Arg Ala Gly Val Thr Ile
245 250 255
Leu Ile Met Gln Lys Phe Asp Ile Val Pro Phe Leu Glu Leu Ile Gln
260 265 270
Lys Tyr Lys Val Thr Ile Gly Pro Phe Val Pro Pro Ile Val Leu Ala
275 280 285
Ile Ala Lys Ser Pro Val Val Asp Lys Tyr Asp Leu Ser Ser Val Arg
290 295 300
Thr Val Met Ser Gly Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ala
305 310 315 320
Val Arg Ala Lys Phe Pro Asn Ala Lys Leu Gly Gln Gly Tyr Gly Met
325 330 335
Thr Glu Ala Gly Pro Val Leu Ala Met Cys Leu Ala Phe Ala Lys Glu
340 345 350
Pro Tyr Glu Ile Lys Ser Gly Ala Cys Gly Thr Val Val Arg Asn Ala
355 360 365
Glu Met Lys Ile Val Asp Pro Glu Thr Asn Ala Ser Leu Pro Arg Asn
370 375 380
Gln Arg Gly Glu Ile Cys Ile Arg Gly Asp Gln Ile Met Lys Gly Tyr
385 390 395 400
Leu Asn Asp Pro Glu Ser Thr Arg Thr Thr Ile Asp Glu Glu Gly Trp
405 410 415
Leu His Thr Gly Asp Ile Gly Phe Ile Asp Asp Asp Asp Glu Leu Phe
420 425 430
Ile Val Asp Arg Leu Lys Glu Ile Ile Lys Tyr Lys Gly Phe Gln Val
435 440 445
Ala Pro Ala Glu Leu Glu Ala Leu Leu Leu Thr His Pro Thr Ile Ser
450 455 460
Asp Ala Ala Val Val Pro Met Ile Asp Glu Lys Ala Gly Glu Val Pro
465 470 475 480
Val Ala Phe Val Val Arg Thr Asn Gly Phe Thr Thr Thr Glu Glu Glu
485 490 495
Ile Lys Gln Phe Val Ser Lys Gln Val Val Phe Tyr Lys Arg Ile Phe
500 505 510
Arg Val Phe Phe Val Asp Ala Ile Pro Lys Ser Pro Ser Gly Lys Ile
515 520 525
Leu Arg Lys Asp Leu Arg Ala Arg Ile Ala Ser Gly Asp Leu Pro Lys
530 535 540
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Pro Pro Pro Ala Leu Pro
545 550 555 560
Pro Lys Arg Arg Arg
565
<210> 12
<211> 7
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(7)
<223> PDZ ligand
<400> 12
Gly Val Lys Glu Ser Leu Val
1 5
<210> 13
<211> 57
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(57)
<223> SH3结构域
<400> 13
Ala Glu Tyr Val Arg Ala Leu Phe Asp Phe Asn Gly Asn Asp Glu Glu
1 5 10 15
Asp Leu Pro Phe Lys Lys Gly Asp Ile Leu Arg Ile Arg Asp Lys Pro
20 25 30
Glu Glu Gln Trp Trp Asn Ala Glu Asp Ser Glu Gly Lys Arg Gly Met
35 40 45
Ile Pro Val Pro Tyr Val Glu Lys Tyr
50 55
<210> 14
<211> 11
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<221> PEPTIDE
<222> (1)..(11)
<223> SH3lig
<400> 14
Pro Pro Pro Ala Leu Pro Pro Lys Arg Arg Arg
1 5 10

Claims (20)

1.一种用于合成黄芩素、野黄芩素类化合物的原核细胞,其包括外源的下组酶的编码基因:黄酮6-羟化酶,细胞色素P450氧化还原酶,苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I;且所述酶被表达后,苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶构成复合体。
2.一种用于合成白杨素类化合物的原核细胞,其包括外源的下组酶的编码基因:苯丙氨酸解氨酶、4-香豆酸辅酶A连接酶、查尔酮合成酶、查尔酮异构酶和黄酮合成酶I;且所述酶被表达后,苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶构成复合体。
3.如权利要求1或2所述的原核细胞,其特征在于,所述的苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶的复合体包括:
苯丙氨酸解氨酶和4-香豆酸辅酶A通过蛋白-蛋白相互作用结构域及其配体的结合而靠近,获得复合体;或
苯丙氨酸解氨酶和4-香豆酸辅酶A连接酶直接连接或通过连接子连接、获得融合蛋白形式的复合体。
4.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括选自下组的结构域:PDZ结构域,SH3结构域,WW结构域,LIM结构域,DD结构域,PH结构域,EH结构域,GBD结构域。
5.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括PDZ结构域,其配体为PDZ ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述PDZ结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与PDZ融合、所述4-香豆酸辅酶A与PDZligand融合;更佳地,所述苯丙氨酸解氨酶与PDZ融合时还包括以ER/K连接子连接,所述4-香豆酸辅酶A与PDZ ligand融合时还包括以(GGGGS)2连接子连接。
6.如权利要求3所述的原核细胞,其特征在于,所述蛋白-蛋白相互作用结构域包括SH3结构域,其配体为SH3 ligand;所述苯丙氨酸解氨酶和4-香豆酸辅酶A分别与所述SH3结构域及其配体融合;较佳地,所述苯丙氨酸解氨酶与SH3融合、所述4-香豆酸辅酶A与SH3ligand融合;更佳地,所述苯丙氨酸解氨酶与SH3融合时还包括以ER/K连接子连接,所述4-香豆酸辅酶A与SH3 ligand融合时还包括以(GGGGS)2连接子连接。
7.如权利要求5或6所述的原核细胞,其特征在于,所述苯丙氨酸解氨酶与PDZ融合时,所述苯丙氨酸解氨酶位于N端,所述PDZ位于C端;或
所述4-香豆酸辅酶A与PDZ ligand融合时,所述PDZ ligand位于N端,所述4-香豆酸辅酶A位于C端;或
所述苯丙氨酸解氨酶与SH3融合时;所述苯丙氨酸解氨酶位于N端,所述SH3位于C端;或
所述4-香豆酸辅酶A与SH3 ligand融合时,所述SH3 ligand位于N端,所述4-香豆酸辅酶A位于C端。
8.如权利要求1所述的原核细胞,其特征在于,所述细胞中还包括外源的促进丙二酰CoA生成的酶的编码基因;较佳地,包括matC,matB,ACS,FabF。
9.如权利要求1所述的原核细胞,其特征在于,所述的原核细胞为大肠杆菌细胞。
10.如权利要求1~9任一所述的原核细胞,其特征在于,所述细胞中还包括外源的促进苯丙氨酸合成的酶的编码基因;较佳地,包括:aroG,pheA;更佳地,所述pheA为第976位由A突变为C的基因;更佳地,所述aroG为第436位由G突变为A的基因。
11.权利要求1、3~10任一所述的原核细胞的应用,用于合成黄芩素或野黄芩素类化合物。
12.权利要求2、3~10任一所述的原核细胞的应用,用于合成白杨素类化合物。
13.一种合成黄芩素或野黄芩素类化合物的方法,包括:提供权利要求1、3~10任一所述的原核细胞,以式(I)为底物,合成黄芩素或野黄芩素类化合物;
Figure FDA0002884548870000031
其中,R包括H或OH。
14.一种合成白杨素类化合物的方法,包括:提供权利要求2、3~10任一所述的原核细胞,以式(I)为底物,合成白杨素类化合物;
Figure FDA0002884548870000032
其中,R包括H或OH。
15.一种合成黄芩素类化合物或白杨素类化合物的方法,包括:提供权利要求10所述的原核细胞,以葡萄糖为底物,合成黄芩素类化合物或白杨素类化合物。
16.如权利要求13~15任一所述的方法,其特征在于,在引入细胞时,
所述PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因位于一个构建体中;
所述黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因位于一个构建体中,较佳地还包括2B1基因;
所述matC,matB,ACS,FabF的编码基因位于一个构建体中;
所述SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因位于一个构建体中;
所述查尔酮异构酶,黄酮合成酶I的编码基因位于一个构建体中;或
所述matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因位于一个构建体中。
17.一种试剂盒,其包括权利要求1~10任一所述的重组的宿主细胞。
18.一种试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;
包含黄酮6-羟化酶,细胞色素P450氧化还原酶的编码基因的构建体;较佳地所述构建体还包含2B1基因;
包含matC,matB,ACS,FabF的编码基因的构建体;
包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;
包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;
包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因的构建体。
19.一种试剂盒,其包括:包含PDZligand、4-香豆酸辅酶A连接酶、苯丙氨酸解氨酶、ER/K、PDZ、黄酮合成酶I、查尔酮合成酶、查尔酮异构酶的编码基因的构建体;
包含matC,matB,ACS,FabF的编码基因的构建体;
包含SH3lig,4-香豆酸辅酶A连接酶,苯丙氨酸解氨酶,ER/K,SH3,查尔酮合成酶的编码基因的构建体;
包含查尔酮异构酶,黄酮合成酶I的编码基因的构建体;
包含matC、matB、ACS、FabF的编码基因,第976位由A突变为C的pheA基因,第436位由G突变为A的aroG基因的构建体。
20.如权利要求17~19任一所述的试剂盒,其特征在于,其中还包括:葡萄糖;或,式(I)底物;
Figure FDA0002884548870000041
其中,R包括H或OH。
CN202110009696.4A 2021-01-05 2021-01-05 异源合成黄酮类化合物的宿主细胞及其应用 Active CN114717170B (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202110009696.4A CN114717170B (zh) 2021-01-05 2021-01-05 异源合成黄酮类化合物的宿主细胞及其应用
PCT/CN2022/070316 WO2022148377A1 (zh) 2021-01-05 2022-01-05 异源合成黄酮类化合物的宿主细胞及其应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110009696.4A CN114717170B (zh) 2021-01-05 2021-01-05 异源合成黄酮类化合物的宿主细胞及其应用

Publications (2)

Publication Number Publication Date
CN114717170A true CN114717170A (zh) 2022-07-08
CN114717170B CN114717170B (zh) 2024-06-04

Family

ID=82234033

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110009696.4A Active CN114717170B (zh) 2021-01-05 2021-01-05 异源合成黄酮类化合物的宿主细胞及其应用

Country Status (2)

Country Link
CN (1) CN114717170B (zh)
WO (1) WO2022148377A1 (zh)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107723317A (zh) * 2017-11-29 2018-02-23 华东理工大学 一种在大肠杆菌中生产衣康酸的方法
CN110885846A (zh) * 2018-09-07 2020-03-17 中国科学院上海生命科学研究院 合成黄芩素和野黄芩素的微生物、其制备方法及其应用

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013067173A1 (en) * 2011-11-02 2013-05-10 The Board Of Trustees Of The Leland Stanford Junior University Systematic control of protein interaction using a modular er/k linker
WO2018226757A1 (en) * 2017-06-05 2018-12-13 Sivaramakrishnan Sivaraj Screening system and methods for identifying enzyme substrates and modulators of enzyme activity
CN113621530A (zh) * 2020-04-07 2021-11-09 华东理工大学 生产汉黄芩素类化合物的基因工程酵母菌、其构建方法及应用

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107723317A (zh) * 2017-11-29 2018-02-23 华东理工大学 一种在大肠杆菌中生产衣康酸的方法
CN110885846A (zh) * 2018-09-07 2020-03-17 中国科学院上海生命科学研究院 合成黄芩素和野黄芩素的微生物、其制备方法及其应用

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI,J.等: "Production of plant-specific flavones baicalein and scutellarein in an engineered E.coli from available phenylalanine and tyrosine", METABOLIC ENGINEERING *
方从兵,宛晓春,江昌俊: "黄酮类化合物生物合成的研究进展(综述)", 安徽农业大学学报 *

Also Published As

Publication number Publication date
CN114717170B (zh) 2024-06-04
WO2022148377A1 (zh) 2022-07-14

Similar Documents

Publication Publication Date Title
US9181539B2 (en) Strains for the production of flavonoids from glucose
CN113322288B (zh) 新型黄酮羟基化酶、合成黄酮碳苷类化合物的微生物及其应用
WO2020048523A1 (zh) 合成黄芩素和野黄芩素的微生物、其制备方法及其应用
CN108913672A (zh) 一种新型异戊烯转移酶及其应用
CN113265433A (zh) 双功能碳苷糖基转移酶及其应用
CN112391300B (zh) 水飞蓟来源的黄酮3β-羟化酶及其辅酶的应用
CN114717170B (zh) 异源合成黄酮类化合物的宿主细胞及其应用
CN110904062B (zh) 一株高产l-丙氨酸的菌株
CN108220260B (zh) 一种催化柚皮素生成山萘酚的融合酶及其应用
CN114657160B (zh) 一种糖基转移酶突变体及其应用
CN112877349B (zh) 一种重组表达载体、包含其的基因工程菌及其应用
CN114277024B (zh) 一种新型三萜合酶及其应用
CN114032222B (zh) 糖链延伸糖基转移酶突变体及其编码基因以及基因工程菌和它们的应用
KR20240032944A (ko) 람노스가 고도로 특이적인 글리코실트랜스퍼라제 및 이의 응용
CN116656641A (zh) 咖啡酸o-甲基转移酶突变体及其应用
CN113621629A (zh) 一种基于丙二酰辅酶a再生的柚皮素体外酶促合成方法
WO2023138679A1 (zh) 异源合成黄酮类化合物的调控方法与应用
CN113583983A (zh) 一种融合蛋白或其变体及其在制备骨化二醇中的应用
CN106906192B (zh) 一种葡萄糖基转移酶及在合成藏红花酸葡萄糖酯中的应用
CN114806999B (zh) 一种基因工程菌及其在制备二氢大豆苷元中的应用
CN113528471B (zh) 一种从头合成黄烷酮的三功能酶及其合成方法和应用
CN114250237B (zh) 一种酪氨酸酚裂解酶突变体、工程菌及在催化合成左旋多巴中的应用
CN114774442B (zh) 一种产灯盏乙素的重组解脂耶氏酵母及其构建方法和用途
CN109486789B (zh) 一种立体选择性提高的菜豆环氧化物水解酶突变体
CN117946988A (zh) 小檗碱桥酶突变体及其应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant