CN113862166A - 一种产柚皮素的酿酒酵母菌 - Google Patents

一种产柚皮素的酿酒酵母菌 Download PDF

Info

Publication number
CN113862166A
CN113862166A CN202111126967.0A CN202111126967A CN113862166A CN 113862166 A CN113862166 A CN 113862166A CN 202111126967 A CN202111126967 A CN 202111126967A CN 113862166 A CN113862166 A CN 113862166A
Authority
CN
China
Prior art keywords
plasmid
gene
naringenin
saccharomyces cerevisiae
derived
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111126967.0A
Other languages
English (en)
Other versions
CN113862166B (zh
Inventor
范文超
刘映淼
高书良
施鑫磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Huarui Biotechnology Co ltd
Original Assignee
Zhejiang Huarui Biotechnology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Huarui Biotechnology Co ltd filed Critical Zhejiang Huarui Biotechnology Co ltd
Priority to CN202111126967.0A priority Critical patent/CN113862166B/zh
Publication of CN113862166A publication Critical patent/CN113862166A/zh
Application granted granted Critical
Publication of CN113862166B publication Critical patent/CN113862166B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/001Oxidoreductases (1.) acting on the CH-CH group of donors (1.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • C12N9/1037Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/06Oxygen as only ring hetero atoms containing a six-membered hetero ring, e.g. fluorescein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y103/00Oxidoreductases acting on the CH-CH group of donors (1.3)
    • C12Y103/01Oxidoreductases acting on the CH-CH group of donors (1.3) with NAD+ or NADP+ as acceptor (1.3.1)
    • C12Y103/01012Prephenate dehydrogenase (1.3.1.12)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y203/00Acyltransferases (2.3)
    • C12Y203/01Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • C12Y203/01074Naringenin-chalcone synthase (2.3.1.74), i.e. chalcone synthase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y401/00Carbon-carbon lyases (4.1)
    • C12Y401/01Carboxy-lyases (4.1.1)
    • C12Y401/01043Phenylpyruvate decarboxylase (4.1.1.43)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y402/00Carbon-oxygen lyases (4.2)
    • C12Y402/03Carbon-oxygen lyases (4.2) acting on phosphates (4.2.3)
    • C12Y402/03005Chorismate synthase (4.2.3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y403/00Carbon-nitrogen lyases (4.3)
    • C12Y403/01Ammonia-lyases (4.3.1)
    • C12Y403/01025Phenylalanine-tyrosine ammonia-lyase (4.3.1.25)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y504/00Intramolecular transferases (5.4)
    • C12Y504/99Intramolecular transferases (5.4) transferring other groups (5.4.99)
    • C12Y504/99005Chorismate mutase (5.4.99.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y505/00Intramolecular lyases (5.5)
    • C12Y505/01Intramolecular lyases (5.5.1)
    • C12Y505/01006Chalcone isomerase (5.5.1.6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y602/00Ligases forming carbon-sulfur bonds (6.2)
    • C12Y602/01Acid-Thiol Ligases (6.2.1)
    • C12Y602/010124-Coumarate-CoA ligase (6.2.1.12)

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明公开了一种构建柚皮素生产菌的方法,包括以下步骤:构建表达芳香族氨基酸合成模块的质粒A,构建表达香豆酸和桂皮酸生成模块的质粒B,构建表达柚皮素合成模块的质粒C,将质粒A、质粒B、质粒C转化入酿酒酵母BY4742中,从转化子中筛选得到能通过发酵生产柚皮素的工程菌。

Description

一种产柚皮素的酿酒酵母菌
技术领域
本发明属于代谢工程领域,具体地说,涉及一种产柚皮素的酵母菌、尤其是酿酒酵母工程菌的构建方法。
背景技术
黄酮类化合物(flavonoids)泛指两个苯环通过三个碳原子相互连接而成的一系列化合物的总称,即具有C6-C3-C6结构的一类化合物的总称。柚皮素是柚皮甙的甙元,属二氢黄酮类化合物,具有抗菌、抗炎、清除自由基、抗氧化、止咳祛痰、降血脂、抗癌抗肿瘤、抗粥样动脉硬化等作用,可被广泛地应用于医药、食品等领域。柚皮素是重要的黄酮类化合物合成的平台化合物,众多的黄酮类化合物都是以柚皮素为前体,经过羟基化、甲基化、异戊烯基化、糖基化等修饰后合成。
目前,柚皮素主要通过从柚皮中提取制备。近年来,发酵法制备柚皮素受到了极大的关注,柚皮素主要由芳香族氨基酸苯丙氨酸和酪氨酸代谢而来。2014年,江南大学吴军军等报道了基因工程大肠杆菌发酵制备柚皮素,摇瓶发酵48h,可产柚皮素391mg/L(Fine-Tuning of the Fatty Acid Pathway by Synthetic Antisense RNA for Enhanced(2S)-Naringenin Production from l-Tyrosine in Escherichia coli.Applied andEnvironmental Microbiology,2014,80(23):7283-92)。2020年,魏文平等在解脂耶氏酵母中过表达木糖利用途径和柚皮素生成路径,摇瓶发酵可产柚皮素728mg/L(Wei W,Zhang P,Shang Y,et al.Metabolically engineering of Yarrowia lipolytica for thebiosynthesis of naringenin from a mixture of glucose and xylose.BioresourceTechnology,2020,314:123726.)2021年,吴梅等利用内生真菌Phomopsis liquidambaris的黄酮合成路径在酿酒酵母中进行了重构,构建菌种通过补料发酵产柚皮素和山奈酚分别为121.53mg/L和75.38mg/L(Wu M,Gong D C,Yang Q,et al.,Activation of Naringeninand Kaempferol through Pathway Refactoring in the Endophyte PhomopsisLiquidambaris.ACS Synthetic Biology,2021.)
发明内容
为了探索在公认安全微生物(GRAS)比如酿酒酵母、乳酸菌、枯草芽孢杆菌等微生物中通过发酵来生产柚皮素的可行性,我们尝试着在一些微生物宿主中增强或者导入外源芳香族氨基酸合成模块(用于提高苯丙氨酸和酪氨酸合成能力)、香豆酸和桂皮酸生成模块(用于建立苯丙氨酸代谢和酪氨酸双途径)和柚皮素合成模块(用于构建柚皮素合成途径),并且/或者弱化分支代谢途径(例如敲除编码苯丙酮酸脱羧酶aro10的基因和/或编码丙酮酸脱羧酶PDC5的基因),终于在宿主酿酒酵母BY4742中实现了本发明目的,构建出了一株产柚皮素工程菌。更有利的是,该工程菌的发酵液中无苯丙氨酸和酪氨酸积累。具体而言,本发明包括如下技术方案。
一种构建柚皮素生产菌的方法,其包括以下步骤:
A.构建表达芳香族氨基酸合成模块的质粒A,该质粒A用于中断酿酒酵母基因组中编码苯丙酮酸脱羧酶aro10的aro10基因,并且在酿酒酵母基因组中aro10基因位点整合编码DHAP合成酶aro4的aro4基因、编码分支酸变位酶aro7的aro7基因、编码五功能蛋白aro1的aro1基因、编码分支酸合酶aro2的aro2基因;芳香族氨基酸合成模块用于建立苯丙氨酸代谢和酪氨酸双途径;
B.构建表达香豆酸和桂皮酸生成模块的质粒B,该质粒B用于中断酿酒酵母基因组中编码丙酮酸脱羧酶PDC5的pdc5基因,并且在酿酒酵母基因组中pdc5基因位点整合编码预苯酸脱氢酶TYR1的tyr1基因、编码苯丙氨酸解氨酶PAL2的PAL2基因、编码酪氨酸解氨酶TAL的TAL基因;该香豆酸和桂皮酸生成模块用于建立苯丙氨酸代谢和酪氨酸双途径;
C.构建表达柚皮素合成模块的质粒C,该质粒C用于在酿酒酵母基因组中整合编码肉桂酸-4-羟化酶C4H的C4H基因、编码4-香豆酸-CoA连接酶4CL的4CL基因、编码查尔酮黄烷酮异构酶CHI的CHI基因、编码查尔酮合酶CHS的CHS基因;该柚皮素合成模块用于构建柚皮素合成途径,从而打通柚皮素从头合成的生物合成路线;
D.将步骤A中得到的质粒A、步骤B中得到的质粒B、步骤C中得到的质粒C分别地、或者两种以上同时地转化入酿酒酵母感受态细胞中,从转化子中筛选得到阳性克隆;
E.从阳性克隆中筛选出通过发酵生产柚皮素的工程菌。
可选地,步骤A中所述aro10基因和/或步骤B中所述pdc5基因的中断比如敲除可以通过基因编辑技术实施。
上述基因编辑技术可以采用CRISPR-Cas9系统、CRISPR-Cpf1系统、CRISPR-Cas相关的转座系统INTEGRATE系统或者CAST系统。
上述INTEGRATE系统是指Sam Sternberg研究组开发的基因编辑工具(Insertionof transposable elements by guide RNA-assisted targeting,引导RNA辅助靶向的转座元件插入);CAST系统是指张锋研究组开发的基因编辑工具(CRISPR-associatedtransposase,CRISPR相关转座酶)。
优选地,上述步骤D中宿主酿酒酵母可以是酿酒酵母BY4742。
在一种优选的实施方式中,步骤A中所述DHAP合成酶(NCBI-Gene ID:852551,含K229L突变)、分支酸变位酶(NCBI Gene ID:856173,含G141S突变)、五功能蛋白(NCBI GeneID:851705)、分支酸合酶(NCBI Gene ID:852729)都是来源于酿酒酵母(Saccharomycescerevisiae)自身,它们的编码基因分别简写为SCaro4K229L、SCaro7G141S、SCaro1、SCaro2,所述芳香族氨基酸合成模块表示为Δaro10::SCaro4K229L-SCaro7G141S-SCaro1-SCaro2;并且/或者
步骤B中所述预苯酸脱氢酶TYR1来源于酿酒酵母自身(NCBI Gene ID:852464),苯丙氨酸解氨酶来源于拟南芥(Arabidopsis thaliana)(NCBI Gene ID:824493)、酪氨酸解氨酶来源于约氏黄杆菌(Flavobacterium johnsoniaeu)(GenBank:KR095306.1),它们的编码基因分别简写为SCtyr1、atPAL2、FjTAL,所述香豆酸和桂皮酸生成模块表示为Δpdc5::SCtyr1-atPAL2-FjTAL;并且/或者
步骤C中所述肉桂酸-4-羟化酶来源于拟南芥(NCBI Gene ID:817599),4-香豆酸-CoA连接酶来源于拟南芥(NCBI Gene ID:841593,含I250L和I461V突变),查尔酮黄烷酮异构酶来源于拟南芥(NCBI Gene ID:824678),查尔酮合酶来源于矮牵牛(GenBank:KF765781.1),它们的编码基因分别简写为atC4H、at4CLm、atCHI、PhCHS,所述柚皮素合成模块表示为atC4H-at4CLm-atCHI-PhCHS。
上述质粒A的骨架质粒是载体pUC或者pYES2,但不限于此,可以是任何适合于在酿酒酵母中表达的质粒;质粒B的骨架质粒是载体pUC或者pYES2,但不限于此,可以是任何适合于在酿酒酵母中表达的质粒;质粒C的骨架质粒是载体pUC或者pYES2,但不限于此,可以是任何适合于在酿酒酵母中表达的质粒。
上述质粒A的核苷酸序列优选为SEQ ID NO:1,在本文中称为pUC-aro4712。
上述质粒B的核苷酸序列优选为SEQ ID NO:2,在本文中称为pUC-TPT;
上述质粒C的核苷酸序列优选为SEQ ID NO:4,在本文中称为pYES2-C4CC。
在一种实施方式中,步骤D中质粒A、质粒B和质粒C对于酿酒酵母细胞的转化可以分步骤转化,例如先将质粒A转入酿酒酵母比如酿酒酵母BY4742感受态细胞中,得到菌株A,基因型为BY4742(Δaro10::aro4-aro7-aro1-aro2);在采用基因编辑技术的情况下,基因型为BY4742(Δaro10::aro4-aro7-aro1-aro2/pHCas-Nour)。再将质粒B转入菌株A,得到菌株B,基因型为BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2-fjTAL)。然后将质粒C转入菌株B,得到菌株C,基因型为BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2–fjTAL/pYES2-CTCC)。
优选地,上述步骤E中筛选出的工程菌发酵时不产生苯丙氨酸和/或酪氨酸。
本发明的第二个方面在于提供一种柚皮素生产菌,其通过上述的方法构建得到。
本发明的第三个方面在于提供上述柚皮素生产菌在生产柚皮素中的应用。具体地,通过菌株的发酵来生产柚皮素。
本发明在公认的安全模式微生物酿酒酵母中构建出了完整的从葡萄糖至柚皮素的途径,得到的工程菌能够通过发酵直接生产出柚皮素,易于被广大食品和药品销售商和消费者所接受,以酿酒酵母BY4742为宿主构建的产柚皮素工程菌的发酵液中柚皮素含量高达850mg/L,并且检测不到苯丙氨酸和酪氨酸,值得进一步开发并探索工业化生产的可行性。
附图说明
图1是本发明构建的酿酒酵母中生物合成柚皮素的代谢路线图。
图2是本发明构建的整合质粒pUC-aro4712图谱,其核苷酸序列为SEQ ID NO:1,由南京金唯智生物技术有限公司合成。
图3是本发明构建的整合质粒pUC-TPT图谱,其核苷酸序列为SEQ ID NO:2,由南京金唯智生物技术有限公司合成。
图4是本发明构建的合成质粒pUC-C4CC图谱,其核苷酸序列为SEQ ID NO:3,由南京金唯智生物技术有限公司合成。
图5是本发明构建的穿梭质粒pYES2-C4CC图谱,其核苷酸序列为SEQ ID NO:4,由浙江华睿生物技术有限公司研发中心构建。
具体实施方式
本发明根据以酿酒酵母为出发菌株,运用代谢工程对其基因组进行改造,构建了柚皮素合成路线(如图1所示),使得酿酒酵母可通过发酵将葡萄糖经过一系列生化反应合成柚皮素,并排出体外。
在上述的工程菌构建步骤A和B中,术语“中断(disruption)”,是某一个基因的功能丧失或者大幅度下调即基因敲减,包括该基因的敲除、突变或缺失,一般可以通过敲除、插入失活、移码突变、引入终止密码子提前翻译终止等常规的基因操作手段实现。
在步骤D中,质粒A、质粒B和质粒C可以分步骤地转化入宿主感受态细胞中,逐步完成基因组的改造;也可以两种质粒三种质粒一起转化入同一个感受态细胞中。
应理解,在构建本发明的基因工程菌的具体操作中,步骤A、步骤B和步骤C、以及质粒A、质粒B和质粒C的转化步骤的排序并非完全根据英文字母顺序由前到后地固定不变,它们可以交叉、颠倒地操作,只要每个步骤能实现各自的功能、完成宿主细胞基因型的定向改变即可。
在本文中,有时为了描述简便,会将酶比如苯丙酮酸脱羧酶(aro10)蛋白质名称与其编码基因(DNA)名称混用,本领域技术人员应能理解它们在不同描述场合表示不同的物质。例如,对于苯丙酮酸脱羧酶,用于描述苯丙酮酸脱羧酶功能或类别时,指的是aro10蛋白质;在作为一种基因描述时,指的是编码该酶的基因aro10,以此类推,这是本领域技术人员容易理解的。
在本文中,对于本发明,术语“柚皮素生产菌”、“基因工程菌”、“柚皮素工程菌”,表示相同的含义,可以互换使用,例如表示构建的酿酒酵母工程菌BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2–fjTAL/pYES2-CTCC)。
为了在酿酒酵母比如BY4742中最佳地表达外源的酶比如atC4H、PhCHS等,本发明对其表达基因进行了密码子优化。
密码子优化是可用于通过增加感兴趣基因的翻译效率使生物体中蛋白质表达最大化的一种技术。不同的生物体由于突变倾向和天然选择而通常示出对于编码相同氨基酸的一些密码子之一的特殊偏好性。例如,在选定的微生物如酿酒酵母中,优化密码子反映出其各自的基因组tRNA库的组成。因此,在生长快速的微生物中,氨基酸的低频率密码子可以用用于相同氨基酸的但高频率的密码子置换。因此,优化的DNA序列的表达在选定的微生物中得以改良。
经过密码子优化,设计了质粒pUC-aro4712、pUC-TPT、pUC-C4CC和pYES2-C4CC的核苷酸序列SEQ ID NOs:1-4,通过转化酿酒酵母BY4742,终于构建得到了产柚皮素、但几乎不产苯丙氨酸和酪氨酸的工程菌。有趣的是,当转入上述质粒pUC-aro4712、pUC-TPT、pUC-C4CC和pYES2-C4CC的宿主菌是另一个酿酒酵母亚种CCTCC M94055时,构建得到的转化子的发酵产物中不仅包含柚皮素,还包含苯丙氨酸和酪氨酸,可能与菌株内的酶系差异导致柚皮素代谢途径不同有关,具体原因有待于进一步研究。
以下结合具体实施例对本发明做进一步详细说明。应理解,以下实施例仅用于说明本发明而非用于限定本发明的范围。
实施例
本文中涉及到多种物质的添加量、含量及浓度,其中所述的百分含量,除特别说明外,皆指质量百分含量。
材料和方法
实施例中的基因合成、引物合成及测序皆委托南京金唯智生物技术有限公司完成。
本文中的分子生物学实验包括质粒构建、酶切、感受态细胞制备、转化等主要参照《分子克隆实验指南》(第三版),J.萨姆布鲁克,D.W.拉塞尔(美)编著,黄培堂等译,科学出版社,北京,2002)进行。比如感受态细胞转化方法及感受态制备方法均参照《分子克隆实验指南》(第三版)第1章96页进行。必要时可以通过简单试验确定具体实验条件。
PCR扩增实验根据质粒或DNA模板供应商提供的反应条件或试剂盒说明书进行。必要时可以通过简单试验予以调整。
培养基:
LB培养基:5g/L酵母提取物,10g/L胰蛋白胨,10g/L氯化钠。(LB固体培养基另加20g/L琼脂粉。)
YPD20培养基:10g/L酵母提取物,20g/L胰蛋白胨,20g/L葡萄糖。(固体培养基另加20g/L琼脂粉。)
2×YPAD培养基:20g/L酵母提取物,40g/L胰蛋白胨,20g/L葡萄糖。
腺嘌呤:80mg/L。121℃灭菌20min。将葡萄糖制备400g/L母液,115℃15min灭菌,2.5mL分装备用。
YNB-URA培养基:YNB 6.7g;腺嘌呤0.3g;Dropout(不含尿嘧啶)1.26g。115℃,15min灭菌;固体培养基加入20g琼脂粉。
摇瓶发酵培养基(1L):YNB 6.7g,Dropout(不含尿嘧啶)1.26g;40g葡萄糖,20gCaCO3
20X电转母液:80g/L甘氨酸,2%吐温80。
以下实施例中,使用含氨苄霉素(amp)培养基时,所述抗生素在培养基中的终浓度为100μg/ml;使用诺尔丝菌素(Nour)和潮霉素(hyg)时终浓度为200μg/ml。
柚皮素的HPLC测定条件:
Figure BDA0003278882540000071
以下实施例中使用的引物序列信息如下表所示。
表1、引物序列
引物 序列(5’→3’)
pYES-gRNA-aro10-F cagtgaaagataaatgatcACATCAATGAAATTAATAAgttttagagctagaaatagc
gRNA-univer-R gatcatttatctttcactgcggagaagtttcgaacgccgaaacatg
pYES-gRNA-PDC5-F cagtgaaagataaatgatcGCTGATTTGATATTGTCTATgttttagagctagaaatagc
pYES2-XhoI-F Ccgctcgagatttaaatatttgcttatacaatcttc
pYES2-NotI-R ataagaatgcggccgcacgcgtgtacgcatgtaacattatac
pYES2-C4CC-V-F gccgcaaattaaagccttcgag
pYES2-C4CC-V-R CTCAACACCTTGTGTAAGAAGC
Aro10-V-F CAGTTTTCGGTGTTCCTGGTGAC
Aro10-V-R CTCTCATAATGATGGATAAATC
PDC5-V-F GAACGCTGCCTATGCTGCTGATG
PDC5-V-R CATCATAATATTCTTCCCTATC
表1中,名称中的“-F”代表正向;“-R”代表反向。
实施例中所使用的出发菌株酿酒酵母BY4742由上海工业生物技术研发中心惠赠;质粒pUC-aro4712、pUC-TPT、pUC-C4CC由浙江华睿生物技术有限公司设计,委托南京金唯智生物技术有限公司合成;质粒pYES2-C4CC由浙江华睿生物技术有限公司构建。任何单位和个人都可以获得这些质粒用于验证本发明,但未经允许不得用作其他用途,包括开发利用、新药申报、科学研究和教学。
实施例1:相关质粒构建
1.1构建用于CRISPR-Cas9系统的质粒pYES2-gRNA-aro10、pYES2-gRNA-PDC5:以pYES2-gRNA-MCS(addgene DI:107734,中国科学院分子植物科学卓越创新中心杨晟研究员惠赠)为模板,使用引物对pYES-gRNA-aro10-F/gRNA-univer-R进行PCR环扩,PCR扩增长度6kb。
PCR扩增体系为:KOD FX 1μl、2×KOD FX buffer 25μl、dNTP 3μl、模板质粒0.5μl、上下游引物各1μl(50μM)、ddH2O 18.5μl。
PCR扩增程序为:95℃5min;94℃30s,60℃30s,68℃6min,31个循环;68℃10min;16℃10min。
PCR产物用DpnI酶37℃消化2h,然后取3μl转化DH5α感受态细胞(上海唯地生物),涂布amp平板,37℃培养箱中培养过夜,选取3个转化子抽质粒送测序,获得pYES2-gRNA-aro10测序阳性质粒,用于进行下步反应。
同样方法,使用引物对pYES-gRNA-PDC5-F/gRNA-univer-R进行pYES2-gRNA-PDC5质粒构建。
1.2构建整合质粒pUC-aro4712
设计整合质粒pUC-aro4712的结构,如图1所示,核苷酸序列为SEQ ID NO:1,委托南京金唯智生物技术有限公司合成。pUC-aro4712质粒用限制性内切酶NotI线性化,线性化片段直接过柱回收备用。pUC-aro4712质粒用于aro10基因(编码苯丙酮酸脱羧酶,NCBIGene ID:851987)敲除,同时在aro10位点整合aro4K229L基因(编码DHAP合成酶,NCBI GeneID:852551)、aro7G141S基因(编码分支酸变位酶,NCBI Gene ID:856173)、aro1基因(编码五功能蛋白,NCBI Gene ID:851705)、aro2基因(编码分支酸合酶,NCBI Gene ID:852729)。
1.3构建整合质粒pUC-TPT
设计整合质粒pUC-TPT的结构,如图2所示,核苷酸序列为SEQ ID NO:2,委托南京金唯智生物技术有限公司合成。pUC-TPT质粒用限制性内切酶NotI线性化,线性化片段直接过柱回收备用。pUC-TPT质粒用于丙酮酸脱羧酶(PDC5,NCBI Gene ID:850825)基因敲除,同时在此位点过表达酿酒酵母自身的预苯酸脱氢酶(TYR1,NCBI Gene ID:852464)、拟南芥来源苯丙氨酸解氨酶(atPAL2,NCBI Gene ID:824493)和来源于约氏黄杆菌(Flavobacteriumjohnsoniaeu)来源的酪氨酸解氨酶(fjTAL,GenBank:KR095306.1)。
1.4构建合成质粒pUC-C4CC
设计合成质粒pUC-C4CC的结构,如图3所示,核苷酸序列为SEQ ID NO:3,委托南京金唯智生物技术有限公司合成。质粒pUC-C4CC用于提供C4CC片段,用来构建穿梭质粒pYES2-C4CC。
1.5构建穿梭质粒pYES2-C4CC
质粒pUC-C4CC用NotI和XhoI酶切,回收9.8kb大小C4CC片段。以pYES2质粒为模板,使用引物对pYES2-XhoI-F/pYES2-NotI-R进行PCR扩增,得4.6kb大小片段,琼脂糖凝胶回收(Axygen DNA凝胶回收试剂盒AP-GX-50),用NotI和XhoI酶切,直接过柱回收,得pYES2酶切片段。上述C4CC片段和pYES2酶切片段用T4连接酶连接,转化DH5α感受态细胞(上海唯地生物)。转化子用引物对pYES2-C4CC-V-F/pYES2-C4CC-V-R进行鉴定,阳性质粒可扩增出800bp片段,为穿梭质粒pYES2-C4CC。pYES2-C4CC主要用于过表达拟南芥来源的肉桂酸-4-羟化酶(atC4H,NCBI Gene ID:817599)、拟南芥来源的4-香豆酸-CoA连接酶(at4CL,NCBI GeneID:841593,I250L和I461V氨基酸突变)、拟南芥来源的查尔酮黄烷酮异构酶(atCHI,NCBIGene ID:824678)和矮牵牛来源的查尔酮合酶(CHS,GenBank:KF765781.1),核苷酸序列是SEQ ID NO:4,其结构参见图4。
实施例2:基因工程菌构建
2.1感受态细胞制备
使用质粒pHCas-Nour(addgene ID:107733,中国科学院分子植物科学卓越创新中心杨晟研究员惠赠)转化酿酒酵母BY4742,阳性克隆BY4742/pHCas-Nour菌种YPD20平板(诺尔丝抗性,200μg/ml)划线;挑单克隆到YPD20(Nour,200μg/ml)试管过夜培养;按初始OD600=0.1-0.2转接入50ml 2×YPAD摇瓶,30℃,240rpm培养至OD600为0.8-1.0,约4-5小时;20℃,3000g离心5min,收集菌体,用1/2体积无菌水重悬离心,3000g 5min。1/2体积无菌水再洗一次,离心收菌。用1ml无菌水重悬后转至1.5ml EP管,12000g,30秒收集菌体;用1ml无菌水重悬,每管100微升分装备用。
2.2质粒pUC-aro4712整合
取分装好的酵母菌,12000g离心30s,丢弃上清,按顺序依次加入以下成分:
PEG3350(50%(w/v)),240微升;
LiAc(1M),36微升;
鱼精ssDNA(2mg/L,使用之前100℃放置5min后立马插入冰上),50微升;
DNA(pYES2-gRNA-aro10和pUCaro4712线性化质粒)+H2O,34微升。
涡旋振荡混匀,42℃热激15min;12000g离心30s去上清,加入1ml YPD培养基重悬复苏1~2h,取适量复苏菌液涂抗性平板(Nour+hyg),平板30℃培养,得转化子。
获得的转化子以Aro10-V-F/Aro10-V-R引物对进行菌落PCR鉴定,阳性转化子扩增获得1.24kb大小片段,野生型菌种无条带。阳性菌种在YPD试管连续传代3~5次,YPD平板(Nour抗性)划线,长出单克隆到YPD平板(hyg抗性)鉴定,潮霉素平板不能生长菌株即BY4742(Δaro10::aro4-aro7-aro1-aro2)/pHCas-Nour,进行下步构建。
2.3质粒pUC-TPT整合
pUC-TPT质粒用限制性内切酶NotI线性化,线性化片段直接过柱回收备用,将其整合入菌株BY4742(Δaro10::aro4-aro7-aro1-aro2)/pHCas-Nour中,使用引物对PDC5-V-F/PDC5-V-R进行菌落PCR鉴定,阳性转化子出现1.09kb条带,野生型菌种无条带。相同的方法获得BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2-fjTAL)菌株。
2.4穿梭质粒pYES2-C4CC转化
将pYES2-C4CC质粒电转入BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2-fjTAL)菌株中,转化子涂布YNB-URA3平板筛选,得到阳性克隆,其基因型为BY4742(Δaro10::aro4-aro7-aro1-aro2ΔPDC5::scTYR1-atPAL2-fjTAL)/pYES2-CTCC,命名为ZHR5601。
实施例3:工程菌发酵验证实验
将基因工程菌ZHR5601在YNB-URA3平板上培养,挑单克隆到YNB-URA3试管,30℃过夜培养,然后按照10v/v%接种量转接到摇瓶发酵培养基中,30℃230rpm振荡培养48h,取样进行HPLC检测,发酵液中柚皮素含量高达850mg/L,并且检测不到苯丙氨酸和酪氨酸。提示工程菌ZHR5601的柚皮素代谢途径构建成功,发酵结果甚至超过预期。
应理解,上述实施例仅用于举例说明目的,而不是对本发明的限制。本领域技术人员在阅读了本发明的构思之后,对其做出的各种改变或调整,均应落入本发明的保护范围内,这些等价形式同样属于权利要求书限定的范围。
序列表
<110> 浙江华睿生物技术有限公司
<120> 一种产柚皮素的酿酒酵母菌
<130> SHPI2110336
<160> 4
<170> SIPOSequenceListing 1.0
<210> 1
<211> 14129
<212> DNA
<213> 人工序列()
<400> 1
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 60
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 120
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 180
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 240
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 300
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 360
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 420
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 480
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 540
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 600
ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 660
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 720
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 780
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 840
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 900
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 960
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 1020
tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 1080
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 1140
ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 1200
gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 1260
caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 1320
gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 1380
ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 1440
tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 1500
caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 1560
tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 1620
cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 1680
ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 1740
aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 1800
tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 1860
gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 1920
gaaaagtgcc acctgacgtc ctatacaagc gcattgacgt ttctaaactt tctttgcaat 1980
atgattcaaa tgtaactcaa tatacgaacg aaacaatgcg gttagaagat cctaccaatg 2040
gacaatcaag cattattaca caagttcact tacaaaagac gatgcctaaa tttttgaacc 2100
ctggtgatgt tgtcgtttgt gaaacaggct cttttcaatt ctctgttcgt gatttcgcgt 2160
ttccttcgca attaaaatat atatcgcaag gatttttcct ttccattggc atggcccttc 2220
ctgccgccct aggtgttgga attgccatgc aagaccactc aaacgctcac atcaatggtg 2280
gcggccgcgc gttcaagtaa ctttagtgat cggaacctac atcatttggt cccacagcta 2340
catgattcaa attttaaagg gccaaatcat aaagtatatc atgatatggt aaaagataga 2400
gtcgcttgct cggtagccta cttggaggat attgaaactg catgtgacca agtcgataat 2460
gttatccgcg atatttacaa gtattctaaa cctggttata tttttgttcc tgcagatttt 2520
gcggatatgt ctgttacatg tgataatttg gttaatgttc cacgtatatc tcaacaagat 2580
tgtatagtat acccttctga aaaccaattg tctgacataa tcaacaagcc cacacaccat 2640
agcttcaaaa tgtttctact ccttttttac tcttccagat tttctcggac tccgcgcatc 2700
gccgtaccac ttcaaaacac ccaagcacag catactaaat ttcccctctt tcttcctcta 2760
gggtgtcgtt aattacccgt actaaaggtt tggaaaagaa aaaagagacc gcctcgtttc 2820
tttttcttcg tcgaaaaagg caataaaaat ttttatcacg tttctttttc ttgaaaattt 2880
ttttttttga tttttttctc tttcgatgac ctcccattga tatttaagtt aataaacggt 2940
cttcaatttc tcaagtttca gtttcatttt tcttgttcta ttacaacttt ttttacttct 3000
tgctcattag aaagaaagca tagcaatcta atctaagttt taattacaaa atgagtgaat 3060
ctccaatgtt cgctgccaac ggcatgccaa aggtaaatca aggtgctgaa gaagatgtca 3120
gaattttagg ttacgaccca ttagcttctc cagctctcct tcaagtgcaa atcccagcca 3180
caccaacttc tttggaaact gccaagagag gtagaagaga agctatagat attattaccg 3240
gtaaagacga cagagttctt gtcattgtcg gtccttgttc catccatgat ctagaagccg 3300
ctcaagaata cgctttgaga ttaaagaaat tgtcagatga attaaaaggt gatttatcca 3360
tcattatgag agcatacttg gagaagccaa gaacaaccgt cggctggaaa ggtctaatta 3420
atgaccctga tgttaacaac actttcaaca tcaacaaggg tttgcaatcc gctagacaat 3480
tgtttgtcaa cttgacaaat atcggtttgc caattggttc tgaaatgctt gataccattt 3540
ctcctcaata cttggctgat ttggtctcct tcggtgccat tggtgccaga accaccgaat 3600
ctcaactgca cagagaattg gcctccggtt tgtctttccc agttggtttc aagaacggta 3660
ccgatggtac cttaaatgtt gctgtggatg cttgtcaagc cgctgctcat tctcaccatt 3720
tcatgggtgt tactttgcat ggtgttgctg ctatcaccac tactaagggt aacgaacact 3780
gcttcgttat tctaagaggt ggtaaaaagg gtaccaacta cgacgctaag tccgttgcag 3840
aagctaaggc tcaattgcct gccggttcca acggtctaat gattgactac tctcacggta 3900
actccaataa ggatttcaga aaccaaccaa aggtcaatga cgttgtttgt gagcaaatcg 3960
ctaacggtga aaacgccatt accggtgtca tgattgaatc aaacatcaac gaaggtaacc 4020
aaggcatccc agccgaaggt aaagccggct tgaaatatgg tgtttccatc actgatgctt 4080
gtataggttg ggaaactact gaagacgtct tgaggaaatt ggctgctgct gtcagacaaa 4140
gaagagaagt taacaagaaa tagcatgtaa ttagttatgt cacgcttaca ttcacgccct 4200
ccccccacat ccgctctaac cgaaaaggaa ggagttagac aacctgaagt ctaggtccct 4260
atttattttt ttatagttat gttagtatta agaacgttat ttatatttca aatttttctt 4320
ttttttctgt acagacgcgt gtacgcatgt aacattatac tgaaaacctt gcttgagaag 4380
gttttgggac gctcgaaggc tttacgcaca gatattataa catctgcata ataggcattt 4440
gcaagaatta ctcgtgagta aggaaagagt gaggaactat cgcatacctg catttaaaga 4500
tgccgatttg ggcgcgaatc ctttattttg gcttcaccct catactatta tcagggccag 4560
aaaaaggaag tgtttccctc cttcttgaat tgatgttacc ctcataaagc acgtggcctc 4620
ttatcgagaa agaaattacc gtcgctcgtg atttgtttgc aaaaagaaca aaactgaaaa 4680
aacccagaca cgctcgactt cctgtcttcc tattgattgc agcttccaat ttcgtcacac 4740
aacaaggtcc tagcgacggc tcacaggttt tgtaacaagc aatcgaaggt tctggaatgg 4800
cgggaaaggg tttagtacca catgctatga tgcccactgt gatctccaga gcaaagttcg 4860
ttcgatcgta ctgttactct ctctctttca aacagaattg tccgaatcgt gtgacaacaa 4920
cagcctgttc tcacacactc ttttcttcta accaaggggg tggtttagtt tagtagaacc 4980
tcgtgaaact tacatttaca tatatataaa cttgcataaa ttggtcaatg caagaaatac 5040
atatttggtc ttttctaatt cgtagttttt caagttctta gatgctttct ttttctcttt 5100
tttacagatc atcaaggaag taattatcta ctttttacaa caaatataaa acaatggatt 5160
tcacaaaacc agaaactgtt ttaaatctac aaaatattag agatgaatta gttagaatgg 5220
aggattcgat catcttcaaa tttattgaga ggtcgcattt cgccacatgt ccttcagttt 5280
atgaggcaaa ccatccaggt ttagaaattc cgaattttaa aggatctttc ttggattggg 5340
ctctttcaaa tcttgaaatt gcgcattctc gcatcagaag attcgaatca cctgatgaaa 5400
ctcccttctt tcctgacaag attcagaaat cattcttacc gagcattaac tacccacaaa 5460
ttttggcgcc ttatgcccca gaagttaatt acaatgataa aataaaaaaa gtttatattg 5520
aaaagattat accattaatt tcgaaaagag atggtgatga taagaataac ttctgttctg 5580
ttgccactag agatatagaa tgtttgcaaa gcttgagtag gagaatccac tttggcaagt 5640
ttgttgctga agccaagttc caatcggata tcccgctata cacaaagctg atcaaaagta 5700
aagatgtcga ggggataatg aagaatatca ccaattctgc cgttgaagaa aagattctag 5760
aaagattaac taagaaggct gaagtctatg gtgtggaccc taccaacgag tcaggtgaaa 5820
gaaggattac tccagaatat ttggtaaaaa tttataagga aattgttata cctatcacta 5880
aggaagttga ggtggaatac ttgctaagaa ggttggaaga gtaagcgaat ttcttatgat 5940
ttatgatttt tattattaaa taagttataa aaaaaataag tgtatacaaa ttttaaagtg 6000
actcttaggt tttaaaacga aaattcttat tcttgagtaa ctctttcctg taggtcaggt 6060
tgctttctca ggtatagcat gaggtcgctc ttattgacca cacctctacc ggcatgccga 6120
gcaaatgcct gcaaatcgct ccccatttca cccaattgta gatatgctaa ctccagcaat 6180
gagttgatga atctcggtgt gtattttatg tcctcagagg acaacacctg ttgtaatcgt 6240
tcttccacac ggatccacag cctagccttc agttgggctc tatcttcatc gtcttacaca 6300
gaatatataa catcgtaggt gtctgggtga acagtttatt cctggcatcc actaaatata 6360
atggagcccg ctttttaagc tggcatccag aaaaaaaaag aatcccagca ccaaaatatt 6420
gttttcttca ccaaccatca gttcataggt ccattctctt agcgcaacta cagagaacag 6480
gggcacaaac aggcaaaaaa cgggcacaac ctcaatggag tgatgcaacc tgcctggagt 6540
aaatgatgac acaaggcaat tgacccacgc atgtatctat ctcattttct tacaccttct 6600
attaccttct gctctctctg atttggaaaa agctgaaaaa aaaggttgaa accagttccc 6660
tgaaattatt cccctacttg actaataagt atataaagac ggtaggtatt gattgtaatt 6720
ctgtaaatct atttcttaaa cttcttaaat tctactttta tagttagtct tttttttagt 6780
tttaaaacac caagaactta gtttcgaata aacacacata aacaaacaaa atggtgcagt 6840
tagccaaagt cccaattcta ggaaatgata ttatccacgt tgggtataac attcatgacc 6900
atttggttga aaccataatt aaacattgtc cttcttcgac atacgttatt tgcaatgata 6960
cgaacttgag taaagttcca tactaccagc aattagtcct ggaattcaag gcttctttgc 7020
cagaaggctc tcgtttactt acttatgttg ttaaaccagg tgagacaagt aaaagtagag 7080
aaaccaaagc gcagctagaa gattatcttt tagtggaagg atgtactcgt gatacggtta 7140
tggtagcgat cggtggtggt gttattggtg acatgattgg gttcgttgca tctacattta 7200
tgagaggtgt tcgtgttgtc caagtaccaa catccttatt ggcaatggtc gattcctcca 7260
ttggtggtaa aactgctatt gacactcctc taggtaaaaa ctttattggt gcattttggc 7320
aaccaaaatt tgtccttgta gatattaaat ggctagaaac gttagccaag agagagttta 7380
tcaatgggat ggcagaagtt atcaagactg cttgtatttg gaacgctgac gaatttacta 7440
gattagaatc aaacgcttcg ttgttcttaa atgttgttaa tggggcaaaa aatgtcaagg 7500
ttaccaatca attgacaaac gagattgacg agatatcgaa tacagatatt gaagctatgt 7560
tggatcatac atataagtta gttcttgaga gtattaaggt caaagcggaa gttgtctctt 7620
cggatgaacg tgaatccagt ctaagaaacc ttttgaactt cggacattct attggtcatg 7680
cttatgaagc tatactaacc ccacaagcat tacatggtga atgtgtgtcc attggtatgg 7740
ttaaagaggc ggaattatcc cgttatttcg gtattctctc ccctacccaa gttgcacgtc 7800
tatccaagat tttggttgcc tacgggttgc ctgtttcgcc tgatgagaaa tggtttaaag 7860
agctaacctt acataagaaa acaccattgg atatcttatt gaagaaaatg agtattgaca 7920
agaaaaacga gggttccaaa aagaaggtgg tcattttaga aagtattggt aagtgctatg 7980
gtgactccgc tcaatttgtt agcgatgaag acctgagatt tattctaaca gatgaaaccc 8040
tcgtttaccc cttcaaggac atccctgctg atcaacagaa agttgttatc ccccctggtt 8100
ctaagtccat ctccaatcgt gctttaattc ttgctgccct cggtgaaggt caatgtaaaa 8160
tcaagaactt attacattct gatgatacta aacatatgtt aaccgctgtt catgaattga 8220
aaggtgctac gatatcatgg gaagataatg gtgagacggt agtggtggaa ggacatggtg 8280
gttccacatt gtcagcttgt gctgacccct tatatctagg taatgcaggt actgcatcta 8340
gatttttgac ttccttggct gccttggtca attctacttc aagccaaaag tatatcgttt 8400
taactggtaa cgcaagaatg caacaaagac caattgctcc tttggtcgat tctttgcgtg 8460
ctaatggtac taaaattgag tacttgaata atgaaggttc cctgccaatc aaagtttata 8520
ctgattcggt attcaaaggt ggtagaattg aattagctgc tacagtttct tctcagtacg 8580
tatcctctat cttgatgtgt gccccatacg ctgaagaacc tgtaactttg gctcttgttg 8640
gtggtaagcc aatctctaaa ttgtacgtcg atatgacaat aaaaatgatg gaaaaattcg 8700
gtatcaatgt tgaaacttct actacagaac cttacactta ttatattcca aagggacatt 8760
atattaaccc atcagaatac gtcattgaaa gtgatgcctc aagtgctaca tacccattgg 8820
ccttcgccgc aatgactggt actaccgtaa cggttccaaa cattggtttt gagtcgttac 8880
aaggtgatgc cagatttgca agagatgtct tgaaacctat gggttgtaaa ataactcaaa 8940
cggcaacttc aactactgtt tcgggtcctc ctgtaggtac tttaaagcca ttaaaacatg 9000
ttgatatgga gccaatgact gatgcgttct taactgcatg tgttgttgcc gctatttcgc 9060
acgacagtga tccaaattct gcaaatacaa ccaccattga aggtattgca aaccagcgtg 9120
tcaaagagtg taacagaatt ttggccatgg ctacagagct cgccaaattt ggcgtcaaaa 9180
ctacagaatt accagatggt attcaagtcc atggtttaaa ctcgataaaa gatttgaagg 9240
ttccttccga ctcttctgga cctgtcggtg tatgcacata tgatgatcat cgtgtggcca 9300
tgagtttctc gcttcttgca ggaatggtaa attctcaaaa tgaacgtgac gaagttgcta 9360
atcctgtaag aatacttgaa agacattgta ctggtaaaac ctggcctggc tggtgggatg 9420
tgttacattc cgaactaggt gccaaattag atggtgcaga acctttagag tgcacatcca 9480
aaaagaactc aaagaaaagc gttgtcatta ttggcatgag agcagctggc aaaactacta 9540
taagtaaatg gtgcgcatcc gctctgggtt acaaattagt tgacctagac gagctgtttg 9600
agcaacagca taacaatcaa agtgttaaac aatttgttgt ggagaacggt tgggagaagt 9660
tccgtgagga agaaacaaga attttcaagg aagttattca aaattacggc gatgatggat 9720
atgttttctc aacaggtggc ggtattgttg aaagcgctga gtctagaaaa gccttaaaag 9780
attttgcctc atcaggtgga tacgttttac acttacatag ggatattgag gagacaattg 9840
tctttttaca aagtgatcct tcaagacctg cctatgtgga agaaattcgt gaagtttgga 9900
acagaaggga ggggtggtat aaagaatgct caaatttctc tttctttgct cctcattgct 9960
ccgcagaagc tgagttccaa gctctaagaa gatcgtttag taagtacatt gcaaccatta 10020
caggtgtcag agaaatagaa attccaagcg gaagatctgc ctttgtgtgt ttaacctttg 10080
atgacttaac tgaacaaact gagaatttga ctccaatctg ttatggttgt gaggctgtag 10140
aggtcagagt agaccatttg gctaattact ctgctgattt cgtgagtaaa cagttatcta 10200
tattgcgtaa agccactgac agtattccta tcatttttac tgtgcgaacc atgaagcaag 10260
gtggcaactt tcctgatgaa gagttcaaaa ccttgagaga gctatacgat attgccttga 10320
agaatggtgt tgaattcctt gacttagaac taactttacc tactgatatc caatatgagg 10380
ttattaacaa aaggggcaac accaagatca ttggttccca tcatgacttc caaggattat 10440
actcctggga cgacgctgaa tgggaaaaca gattcaatca agcgttaact cttgatgtgg 10500
atgttgtaaa atttgtgggt acggctgtta atttcgaaga taatttgaga ctggaacact 10560
ttagggatac acacaagaat aagcctttaa ttgcagttaa tatgacttct aaaggtagca 10620
tttctcgtgt tttgaataat gttttaacac ctgtgacatc agatttattg cctaactccg 10680
ctgcccctgg ccaattgaca gtagcacaaa ttaacaagat gtatacatct atgggaggta 10740
tcgagcctaa ggaactgttt gttgttggaa agccaattgg ccactctaga tcgccaattt 10800
tacataacac tggctatgaa attttaggtt tacctcacaa gttcgataaa tttgaaactg 10860
aatccgcaca attggtgaaa gaaaaacttt tggacggaaa caagaacttt ggcggtgctg 10920
cagtcacaat tcctctgaaa ttagatataa tgcagtacat ggatgaattg actgatgctg 10980
ctaaagttat tggtgctgta aacacagtta taccattggg taacaagaag tttaagggtg 11040
ataataccga ctggttaggt atccgtaatg ccttaattaa caatggcgtt cccgaatatg 11100
ttggtcatac cgctggtttg gttatcggtg caggtggcac ttctagagcc gccctttacg 11160
ccttgcacag tttaggttgc aaaaagatct tcataatcaa caggacaact tcgaaattga 11220
agccattaat agagtcactt ccatctgaat tcaacattat tggaatagag tccactaaat 11280
ctatagaaga gattaaggaa cacgttggcg ttgctgtcag ctgtgtacca gccgacaaac 11340
cattagatga cgaactttta agtaagctgg agagattcct tgtgaaaggt gcccatgctg 11400
cttttgtacc aaccttattg gaagccgcat acaaaccaag cgttactccc gttatgacaa 11460
tttcacaaga caaatatcaa tggcacgttg tccctggatc acaaatgtta gtacaccaag 11520
gtgtagctca gtttgaaaag tggacaggat tcaagggccc tttcaaggcc atttttgatg 11580
ccgttacgaa agagtagatt taactcctta agttacttta atgatttagt ttttattatt 11640
aataattcat gctcatgaca tctcatatac acgtttataa aacttaaata gattgaaaat 11700
gtattaaaga ttcctcaggg attcgatttt tttggaagtt tttgtttttt tttccttgag 11760
atgctgtagt atttgggaac aattatacaa tcgaaagata tatgcttaca ttcgaccgtt 11820
ttagccgtga tcatccaact ggcaccgctg gcttgaacaa caataccagc cttccaactt 11880
ctgtaaataa cggcggtacg ccagtgccac cagtaccgtt acctttcggt atacctcctt 11940
tccccatgtt tccaatgccc ttcatgcctc caacggctac tatcacaaat cctcatcaag 12000
ctgacgcaag ccctaagaaa tgaataacaa tactgacagt actaaataat tgcctacttg 12060
gcttcacata cgttgcatac gtcgatatag ataataatga taatgacagc aggattatcg 12120
taatacgtaa tagttgaaaa tctcaaaaat gtgtgggtca ttacgtaaat aatgatagga 12180
atgggattct tctatttttc ctttttccat tctagcagcc gtcgggaaaa cgtggcatcc 12240
tctctttcgg gctcaattgg agtcacgctg ccgtgagcat cctctctttc catatctaac 12300
aactgagcac gtaaccaatg gaaaagcatg agcttagcgt tgctccaaaa aagtattgga 12360
tggttaatac catttgtctg ttctcttctg actttgactc ctcaaaaaaa aaaaatctac 12420
aatcaacaga tcgcttcaat tacgccctca caaaaacttt tttccttctt cttcgcccac 12480
gttaaatttt atccctcatg ttgtctaacg gatttctgca cttgatttat tataaaaaga 12540
caaagacata atacttctct atcaatttca gttattgttc ttccttgcgt tattcttctg 12600
ttcttctttt tcttttgtca tatataacca taaccaagta atacatattc aaaatgtcaa 12660
cgtttgggaa actgttccgc gtcaccacat atggtgaatc gcattgtaag tctgtcggtt 12720
gcattgtcga cggtgttcct ccaggaatgt cattaaccga agctgacatt cagccacaat 12780
tgaccagaag aagaccgggt caatctaagc tatcgacccc tagagacgaa aaggatagag 12840
tggaaatcca gtccggtacc gagttcggca agactctagg tacacccatc gccatgatga 12900
tcaaaaacga ggaccaaaga cctcacgact actccgacat ggacaagttc cctagacctt 12960
cccatgcgga cttcacgtac tcggaaaagt acggtatcaa ggcctcctct ggtggtggca 13020
gagcttctgc tagagaaacg attggccgtg tcgcttcagg tgccattgct gagaagttct 13080
tagctcagaa ctctaatgtc gagatcgtag cctttgtgac acaaatcggg gaaatcaaga 13140
tgaacagaga ctctttcgat cctgaatttc agcatctgtt gaacaccatc accagggaaa 13200
aagtggactc aatgggtcct atcagatgtc cagacgcctc cgttgctggt ttgatggtca 13260
aggaaatcga aaagtacaga ggcaacaagg actctatcgg tggtgtcgtc acttgtgtcg 13320
tgagaaactt gcctaccggt ctcggtgagc catgctttga caagttggaa gccatgttgg 13380
ctcatgctat gttgtccatt ccagcatcca agggtttcga aattggctca ggttttcagg 13440
gtgtctctgt tccagggtcc aagcacaatg acccatttta ctttgaaaaa gaaacaaaca 13500
gattaagaac aaagaccaac aattcaggtg gtgtacaagg tggtatctct aatggtgaga 13560
acatctattt ctctgtccca ttcaagtcag tggccactat ctctcaagaa caaaaaaccg 13620
ccacttacga tggtgaagaa ggtatcttag ccgctaaggg tagacatgac cctgctgtca 13680
ctccaagagc tattcctatt gtggaagcca tgaccgctct ggtgttggct gacgcgcttt 13740
tgatccaaaa ggcaagagat ttctccagat ccgtggttca ttaatgcgtt tgaagtgaga 13800
cgctccatca tctctcttaa tttttcatga ctgacgtttt ttcttcattt taattatcat 13860
agtatttgtt tgaaaaaaaa aaaaaaaaat ttcccttatc aatgatatcc ttacgattat 13920
ataaattcct tacctaaacc tattatttgt gtacatatat cagagtatta ttacatatat 13980
aacctttttc tctaaaacag gaaaaaaaaa agaaaacgat aacatgctct gccatccttt 14040
gttcaccgag caaaattaaa aacgcaaaat gaattgtccc tatgaaatta ttaaaggacc 14100
acatcaccag acttatctct ggggggtcc 14129
<210> 2
<211> 10136
<212> DNA
<213> 人工序列()
<400> 2
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 60
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 120
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 180
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 240
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 300
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 360
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 420
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 480
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 540
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 600
ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 660
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 720
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 780
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 840
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 900
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 960
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 1020
tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 1080
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 1140
ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 1200
gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 1260
caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 1320
gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 1380
ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 1440
tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 1500
caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 1560
tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 1620
cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 1680
ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 1740
aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 1800
tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 1860
gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 1920
gaaaagtgcc acctgacgtc gtcgtcaagg actacaaacc tgttgctgtc ccagctagag 1980
ttccaattac caagtctact ccagctaaca ctccaatgaa gcaagaatgg atgtggaacc 2040
atttgggtaa cttcttgaga gaaggtgata ttgttattgc tgaaaccggt acttccgcct 2100
tcggtattaa ccaaactact ttcccaacag atgtatacgc tatcgtccaa gtcttgtggg 2160
gttccattgg tttcacagtc ggcgctctat tgggtgctac tatggccgct gaagaacttg 2220
atccaaagaa gagagttatt gcggccgctc cacagaatgt ctgccaacat ttctgaaacc 2280
actgccatga tcactgatat tgctaacgct ccagctgaaa ttgacagatg tatcagaacc 2340
acctacacta cccaaagacc agtctacttg ggtttgccag ctaacttggt tgacttgaac 2400
gtcccagcca agttattgga aactccaatt gacttgtctt tgaagccaaa cgacgctgaa 2460
gctgaagctg aagttgttag aactgttgtt gaattgatca aggatgctaa gaacccagtt 2520
atcttggctg atgcttgtgc ttctagaccc cacacaccat agcttcaaaa tgtttctact 2580
ccttttttac tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac 2640
ccaagcacag catactaaat ttcccctctt tcttcctcta gggtgtcgtt aattacccgt 2700
actaaaggtt tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg 2760
caataaaaat ttttatcacg tttctttttc ttgaaaattt ttttttttga tttttttctc 2820
tttcgatgac ctcccattga tatttaagtt aataaacggt cttcaatttc tcaagtttca 2880
gtttcatttt tcttgttcta ttacaacttt ttttacttct tgctcattag aaagaaagca 2940
tagcaatcta atctaagttt taattacaaa atggtatcag aggataagat tgagcaatgg 3000
aaagccacaa aagtcattgg tataattggt ctgggtgata tgggcctatt atacgctaat 3060
aaatttacag atgctggatg gggtgttata tgttgtgata gggaagaata ttatgatgaa 3120
ctgaaagaaa aatatgcctc agctaaattc gaactggtga aaaatggtca tttggtatcc 3180
aggcaaagcg actatattat ctatagtgtt gaagcatcca atattagtaa gatcgtcgca 3240
acgtatggac catcttctaa ggttggaaca attgttgggg gtcaaacgag ttgtaagctg 3300
ccggaaatcg aggctttcga aaagtattta cccaaggact gcgacatcat taccgtgcat 3360
tcccttcatg ggcctaaagt taatactgaa ggccaaccac tagttattat caatcacaga 3420
tcacagtacc cagaatcttt tgagttcgtt aattctgtta tggcatgttt gaaaagtaag 3480
caagtttatt tgacatatga agagcatgac aagattaccg ctgatacaca agctgtgaca 3540
catgctgctt tcttaagtat gggatctgcg tgggcaaaga taaagattta tccttggact 3600
ctgggtgtaa acaaatggta cggtggccta gaaaatgtga aagttaatat atcactaaga 3660
atctattcga acaagtggca tgtttacgca ggattagcca taacaaaccc aagtgcacat 3720
cagcaaattc ttcaatatgc aaccagtgca acagaactat ttagtttaat gatagataac 3780
aaagaacaag aacttactga tagactatta aaagctaagc aatttgtatt tggaaagcat 3840
actggtctct tactattgga tgacacgatt ttagagaaat attcgctatc aaaaagcagc 3900
attggtaaca gcaacaattg caagccagtg ccgaattcac atttatcatt gttggcgatt 3960
gttgattcgt ggtttcaact tggtattgat ccatatgatc atatgatttg ttcgacgcca 4020
ttattcagaa tattcctggg tgtgtccgaa tatctttttt taaaacctgg cttattagaa 4080
cagacaattg atgcagctat ccatgataaa tcattcataa aagatgattt agaatttgtt 4140
atttcggcta gagaatggag ctcggttgtt tcttttgcca attttgatat atacaaaaag 4200
caatttcaga gtgttcaaaa gttctttgag ccaatgcttc cagaggctaa tctcattggc 4260
aacgagatga taaaaaccat tctgagtcat tctagtgacc gttcggccgc tgaaaaaaga 4320
aatacataac atgtaattag ttatgtcacg cttacattca cgccctcccc ccacatccgc 4380
tctaaccgaa aaggaaggag ttagacaacc tgaagtctag gtccctattt atttttttat 4440
agttatgtta gtattaagaa cgttatttat atttcaaatt tttctttttt ttctgtacag 4500
acgcgtgtac gcatgtaaca ttatactgaa aaccttgctt gagaaggttt tgggacgctc 4560
gaaggcttta cgcacagata ttataacatc tgcataatag gcatttgcaa gaattactcg 4620
tgagtaagga aagagtgagg aactatcgca tacctgcatt taaagatgcc gatttgggcg 4680
cgaatccttt attttggctt caccctcata ctattatcag ggccagaaaa aggaagtgtt 4740
tccctccttc ttgaattgat gttaccctca taaagcacgt ggcctcttat cgagaaagaa 4800
attaccgtcg ctcgtgattt gtttgcaaaa agaacaaaac tgaaaaaacc cagacacgct 4860
cgacttcctg tcttcctatt gattgcagct tccaatttcg tcacacaaca aggtcctagc 4920
gacggctcac aggttttgta acaagcaatc gaaggttctg gaatggcggg aaagggttta 4980
gtaccacatg ctatgatgcc cactgtgatc tccagagcaa agttcgttcg atcgtactgt 5040
tactctctct ctttcaaaca gaattgtccg aatcgtgtga caacaacagc ctgttctcac 5100
acactctttt cttctaacca agggggtggt ttagtttagt agaacctcgt gaaacttaca 5160
tttacatata tataaacttg cataaattgg tcaatgcaag aaatacatat ttggtctttt 5220
ctaattcgta gtttttcaag ttcttagatg ctttcttttt ctctttttta cagatcatca 5280
aggaagtaat tatctacttt ttacaacaaa tataaaacaa tggatcaaat tgaagctatg 5340
ctttgtggtg ggggtgaaaa gaccaaagtc gcggttacta cgaaaactct agcagaccca 5400
ctaaactggg gcctagccgc tgaccagatg aaaggtagcc atttagacga ggttaagaaa 5460
atggtggaag agtaccgtcg tccggtcgtg aatttgggtg gcgaaacttt aacaattgga 5520
caggtggctg cgataagcac tgtaggcgga tctgttaagg tagaattggc ggaaacctct 5580
agagctggtg tgaaagcttc ttctgattgg gtgatggaat ccatgaataa gggtactgat 5640
tcctatggcg ttactactgg cttcggagcc actagccata gaaggacaaa gaacggtaca 5700
gcacttcaga ccgaattaat tagattttta aacgcgggta tattcggcaa tactaaggaa 5760
acctgtcata cgctgccaca gtctgccacc agggcggcca tgttggtacg tgtaaatacc 5820
ctattacaag gctattcagg tattaggttc gagattctgg aagctattac ctcattattg 5880
aatcacaaca tttcaccttc attaccactt agaggaacta ttaccgcttc tggcgacctt 5940
gttccattat catatatcgc tggtttgttg acgggtagac caaatagtaa ggcaacggga 6000
ccagatggtg aaagtttgac ggccaaggaa gcttttgaaa aagctggtat ttcaacaggt 6060
tttttcgatt tgcaaccaaa ggaaggctta gccttagtaa acggcacagc ggttggttct 6120
ggtatggcat ctatggtttt atttgaagca aacgtccagg cagtattagc tgaggtactt 6180
tctgcaattt tcgcggaagt aatgtctgga aaaccagaat ttaccgatca tttgactcac 6240
agactgaaac accatccggg gcaaattgaa gcagcagcta ttatggaaca tatcttagac 6300
gggagttctt atatgaaact agcgcaaaaa gtgcatgaaa tggatccatt gcaaaagcca 6360
aaacaagata gatatgctct tagaaccagc ccccaatggt tgggtccaca aatcgaagtg 6420
attagacagg cgactaaatc tatagagaga gaaattaatt cagtcaatga taatccactg 6480
attgatgtaa gtagaaataa agctatccac ggcggtaatt tccagggtac tccaatcggt 6540
gtgtcaatgg ataacacaag actggcaatc gcagccatcg gtaagctaat gtttgcccaa 6600
tttagcgaac ttgtcaatga cttctacaac aacggtttgc caagcaattt gacagcttca 6660
tcaaatccat ctttggacta cggcttcaaa ggtgctgaaa tagccatggc ttcctactgc 6720
tctgaattgc aatacttagc caatcctgta acatctcatg tacaatcagc tgaacaacac 6780
aatcaagatg tcaatagttt gggtttgata tcctctagaa agacatccga ggccgttgat 6840
atcttaaaat taatgtcaac tacctttttg gttggtattt gccaagctgt tgatttacgt 6900
cacttggaag aaaatttgag gcaaacagtt aaaaacactg taagtcaggt cgccaagaaa 6960
gtgcttacaa ctggaattaa tggtgaattg cacccttcta gattctgtga aaaagattta 7020
ttaaaagtgg tagacaggga acaggtcttt acctatgttg acgatccgtg ctcagcaact 7080
tatccattaa tgcaaagatt gaggcaagtc atagttgatc acgcattgag caatggtgaa 7140
acagagaaaa atgctgttac ttcaattttt caaaagattg gtgcatttga agaagagtta 7200
aaggctgttc taccaaagga agttgaggct gctagagctg cctatggtaa tggtactgca 7260
ccgatcccaa accgtatcaa agaatgcaga tcttaccctt tatatagatt tgttagagaa 7320
gagttaggta ccaagttgtt aactggcgaa aaggttgtct cccccggtga agaatttgat 7380
aaggttttta cagctatgtg tgaaggtaag ctgatagatc cgttgatgga ttgtcttaaa 7440
gaatggaacg gagcccccat tccaatttgt taagcgaatt tcttatgatt tatgattttt 7500
attattaaat aagttataaa aaaaataagt gtatacaaat tttaaagtga ctcttaggtt 7560
ttaaaacgaa aattcttatt cttgagtaac tctttcctgt aggtcaggtt gctttctcag 7620
gtatagcatg aggtcgctct tattgaccac acctctaccg gcatgccgag caaatgcctg 7680
caaatcgctc cccatttcac ccaattgtag atatgctaac tccagcaatg agttgatgaa 7740
tctcggtgtg tattttatgt cctcagagga caacacctgt tgtaatcgtt cttccacacg 7800
gatccacagc ctagccttca gttgggctct atcttcatcg tcttacacag aatatataac 7860
atcgtaggtg tctgggtgaa cagtttattc ctggcatcca ctaaatataa tggagcccgc 7920
tttttaagct ggcatccaga aaaaaaaaga atcccagcac caaaatattg ttttcttcac 7980
caaccatcag ttcataggtc cattctctta gcgcaactac agagaacagg ggcacaaaca 8040
ggcaaaaaac gggcacaacc tcaatggagt gatgcaacct gcctggagta aatgatgaca 8100
caaggcaatt gacccacgca tgtatctatc tcattttctt acaccttcta ttaccttctg 8160
ctctctctga tttggaaaaa gctgaaaaaa aaggttgaaa ccagttccct gaaattattc 8220
ccctacttga ctaataagta tataaagacg gtaggtattg attgtaattc tgtaaatcta 8280
tttcttaaac ttcttaaatt ctacttttat agttagtctt ttttttagtt ttaaaacacc 8340
aagaacttag tttcgaataa acacacataa acaaacaaaa tgaatacaat caatgaatat 8400
ctatctttgg aagaattcga ggccataatt ttcggcaatc agaaggtcac catttctgat 8460
gttgtggtca accgtgtaaa cgaatcattc aattttttga aagaattctc aggaaataag 8520
gtaatttatg gtgtcaacac cgggtttgga cccatggctc agtatagaat taaggaatct 8580
gatcagattc aattgcaata caacttaata aggtcacata gttctgggac cggaaaaccc 8640
ttgtcaccgg tgtgtgccaa agctgccatt ctagcaagat taaatacatt atcccttggt 8700
aattcaggag ttcacccctc tgtaattaat ttgatgagtg agttgattaa caaagacatt 8760
actcctctaa tcttcgaaca tggaggagtt ggtgcatctg gggatttggt ccaactgagc 8820
catttagctt tggtgctgat cggcgaagga gaagtctttt ataaaggcga aagaaggcca 8880
accccagagg tttttgaaat cgaaggtctt aaaccaatcc aagttgagat tagagaaggc 8940
ttggcgctaa ttaacggtac ttctgttatg acgggtatcg gcgttgtcaa tgtttatcac 9000
gctaagaagc tgctagactg gtcattaaaa tcctcctgcg caataaatga gttggtccaa 9060
gcttatgacg atcattttag tgccgaattg aatcaaacca aaaggcataa aggtcagcaa 9120
gaaattgctt taaaaatgag acaaaaccta tctgattcaa ctctgattag aaaaagggaa 9180
gatcacttat attctggtga aaatacagaa gaaattttta aggaaaaagt tcaggaatat 9240
tattctctta gatgtgttcc acaaatttta ggtccagtac tagaaactat aaacaatgtt 9300
gccagtattt tggaagatga atttaatagt gcgaatgaca atccaatcat cgatgttaaa 9360
aatcaacacg tttatcatgg tggtaacttt catggagatt atattagctt ggagatggac 9420
aaactgaaaa tcgttattac caaattaact atgttagctg agcgtcaatt aaattacttg 9480
ttaaatagca aaattaacga acttttgcca ccatttgtta atttgggcac attgggtttt 9540
aattttggca tgcaaggcgt ccaatttact gcaacctcaa caaccgctga gtcccaaatg 9600
ttgtctaatc ccatgtatgt acattccatt ccaaacaata acgacaatca agacattgtg 9660
tccatgggta ctaattctgc tgtaattacg tctaaagtta ttgaaaatgc ttttgaagta 9720
ttagctattg aaatgataac aattgttcaa gctatagatt atcttggtca gaaagataaa 9780
ataagttccg tttcaaaaaa gtggtatgac gaaatcagga acatcatacc tacctttaag 9840
gaagaccaag tcatgtaccc atttgtccaa aaggtgaaag atcatttgat taataactaa 9900
atttaactcc ttaagttact ttaatgattt agtttttatt attaataatt catgctcatg 9960
acatctcata tacacgttta taaaacttaa atagattgaa aatgtattaa agattcctca 10020
gggattcgat ttttttggaa gtttttgttt ttttttcctt gagatgctgt agtatttggg 10080
aacaattata caatcgaaag atatatgctt acattcgacc gttttagccg tgatca 10136
<210> 3
<211> 11733
<212> DNA
<213> 人工序列()
<400> 3
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 60
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 120
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 180
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 240
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 300
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 360
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 420
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 480
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 540
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 600
ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag 660
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 720
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 780
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 840
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 900
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 960
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 1020
tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 1080
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 1140
ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 1200
gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 1260
caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 1320
gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 1380
ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 1440
tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 1500
caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 1560
tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 1620
cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 1680
ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 1740
aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 1800
tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 1860
gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 1920
gaaaagtgcc acctgacgtc gcggccgccc cacacaccat agcttcaaaa tgtttctact 1980
ccttttttac tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac 2040
ccaagcacag catactaaat ttcccctctt tcttcctcta gggtgtcgtt aattacccgt 2100
actaaaggtt tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg 2160
caataaaaat ttttatcacg tttctttttc ttgaaaattt ttttttttga tttttttctc 2220
tttcgatgac ctcccattga tatttaagtt aataaacggt cttcaatttc tcaagtttca 2280
gtttcatttt tcttgttcta ttacaacttt ttttacttct tgctcattag aaagaaagca 2340
tagcaatcta atctaagttt taattacaaa atggatctgt tgttattaga gaaatccttg 2400
atcgctgtat tcgtggccgt tattctggct acagttattt ctaagttaag agggaagaaa 2460
ttaaaattgc ccccaggccc tatcccaatt ccaatctttg gcaattggtt gcaagttggc 2520
gatgatctaa accataggaa tcttgttgat tacgctaaga aatttggaga cttatttctg 2580
ttgaggatgg gtcagagaaa tctggttgtg gtttctagcc cggatcttac aaaggaagtg 2640
cttcttacac aaggtgttga gttcggttca agaactagaa acgtcgtttt cgacattttt 2700
acgggtaaag ggcaggatat ggtgtttact gtctatggag agcattggag aaaaatgaga 2760
cgtattatga ctgttccatt ctttacaaat aaagttgtcc aacagaatag agaagggtgg 2820
gagtttgagg ctgctagcgt tgtggaggac gtgaagaaga atccagacag tgcgaccaaa 2880
ggtattgttt tgagaaaacg tctgcaattg atgatgtata acaatatgtt cagaataatg 2940
ttcgaccgtc gtttcgagag tgaagatgat cccttgtttc ttaggcttaa agcacttaat 3000
ggagaaagat cacgtttggc tcaaagcttt gagtataatt atggcgactt cattccaatc 3060
ctacgtcctt ttttaagagg atacttaaaa atctgccaag atgttaaaga tagaagaatt 3120
gcattattta aaaaatactt tgtagatgaa agaaaacaaa tagcatcttc taaacctact 3180
gggtccgaag gccttaagtg cgctatcgat cacattttag aggctgagca aaagggtgaa 3240
atcaacgagg ataatgtctt gtacattgtc gagaacatta atgttgcggc tattgaaact 3300
acattgtggt ctatagaatg gggtatagct gaactggtta atcatccaga gattcaatca 3360
aaattgagaa acgaattaga tactgtctta ggtcctggtg tccaagtaac tgagccagat 3420
ttgcacaagc tgccttattt gcaagcagta gtaaaagaaa ctctaagatt gagaatggcc 3480
atcccattac tggttccaca catgaaccta catgatgcaa agttggcagg ttatgacatt 3540
ccagctgaat caaaaatatt agttaatgct tggtggctag caaataatcc aaattcatgg 3600
aagaagcctg aagaatttcg tcccgaaagg ttctttgaag aagaatcaca cgtagaagcg 3660
aatggtaatg actttaggta tgtccccttt ggtgtgggtc gtagatcttg tcccggtatt 3720
attttggctt tacccatttt aggtattaca attggtcgta tggtacaaaa tttcgagtta 3780
ctgccacccc caggtcaatc taaagttgac acgtctgaaa aaggcggtca attttcacta 3840
catatactaa atcattcaat aattgtaatg aagccacgta attgttaaca tgtaattagt 3900
tatgtcacgc ttacattcac gccctccccc cacatccgct ctaaccgaaa aggaaggagt 3960
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 4020
gttatttata tttcaaattt ttcttttttt tctgtacaga cgcgtgtacg catgtaacat 4080
tatactgaaa accttgcttg agaaggtttt gggacgctcg aaggctttac gcacagatat 4140
tataacatct gcataatagg catttgcaag aattactcgt gagtaaggaa agagtgagga 4200
actatcgcat acctgcattt aaagatgccg atttgggcgc gaatccttta ttttggcttc 4260
accctcatac tattatcagg gccagaaaaa ggaagtgttt ccctccttct tgaattgatg 4320
ttaccctcat aaagcacgtg gcctcttatc gagaaagaaa ttaccgtcgc tcgtgatttg 4380
tttgcaaaaa gaacaaaact gaaaaaaccc agacacgctc gacttcctgt cttcctattg 4440
attgcagctt ccaatttcgt cacacaacaa ggtcctagcg acggctcaca ggttttgtaa 4500
caagcaatcg aaggttctgg aatggcggga aagggtttag taccacatgc tatgatgccc 4560
actgtgatct ccagagcaaa gttcgttcga tcgtactgtt actctctctc tttcaaacag 4620
aattgtccga atcgtgtgac aacaacagcc tgttctcaca cactcttttc ttctaaccaa 4680
gggggtggtt tagtttagta gaacctcgtg aaacttacat ttacatatat ataaacttgc 4740
ataaattggt caatgcaaga aatacatatt tggtcttttc taattcgtag tttttcaagt 4800
tcttagatgc tttctttttc tcttttttac agatcatcaa ggaagtaatt atctactttt 4860
tacaacaaat ataaaacaat ggcgccacaa gaacaagcag tttctcaggt gatggagaaa 4920
cagagcaaca acaacaacag tgacgtcatt ttccgatcaa agttaccgga tatttacatc 4980
ccgaaccacc tatctctcca cgactacatc ttccaaaaca tctccgaatt cgccactaag 5040
ccttgcctaa tcaacggacc aaccggccac gtgtacactt actccgacgt ccacgtcatc 5100
tcccgccaaa tcgccgccaa ttttcacaaa ctcggcgtta accaaaacga cgtcgtcatg 5160
ctcctcctcc caaactgtcc cgaattcgtc ctctctttcc tcgccgcctc cttccgcggc 5220
gcaaccgcca ccgccgcaaa ccctttcttc actccggcgg agatagctaa acaagccaaa 5280
gcctccaaca ccaaactcat aatcaccgaa gctcgttacg tcgacaaaat caaaccactt 5340
caaaacgacg acggagtagt catcgtctgc atcgacgaca acgaatccgt gccaatccct 5400
gaaggctgcc tccgcttcac cgagttgact cagtcgacaa ccgaggcatc agaagtcatc 5460
gactcggtgg agatttcacc ggacgacgtg gtggcactac cttactcctc tggcacgacg 5520
ggattaccaa aaggagtgat gctgactcac aagggactag tcacgagcgt tgctcagcaa 5580
gtcgacggcg agaacccgaa tctttatttc cacagcgatg acgtcctact ctgtgttttg 5640
cccatgtttc atatctacgc tttgaactcg atcatgttgt gtggtcttag agttggtgcg 5700
gcgattctga taatgccgaa gtttgagatc aatctgctat tggagctgat ccagaggtgt 5760
aaagtgacgg tggctccgat ggttccgccg attgtgttgg ccattgcgaa gtcttcggag 5820
acggagaagt atgatttgag ctcgataaga gtggtgaaat ctggtgctgc tcctcttggt 5880
aaagaacttg aagatgccgt taatgccaag tttcctaatg ccaaactcgg tcagggatac 5940
ggaatgacgg aagcaggtcc agtgctagca atgtcgttag gttttgcaaa ggaacctttt 6000
ccggttaagt caggagcttg tggtactgtt gtaagaaatg ctgagatgaa aatagttgat 6060
ccagacaccg gagattctct ttcgaggaat caacccggtg agatttgtat tcgtggtcac 6120
cagatcatga aaggttacct caacaatccg gcagctacag cagagaccat tgataaagac 6180
ggttggcttc atactggaga tattggattg atcgatgacg atgacgagct tttcatcgtt 6240
gatcgattga aagaacttgt caagtataaa ggttttcagg tagctccggc tgagctagag 6300
gctttgctca tcggtcatcc tgacattact gatgttgctg ttgtagcaat gaaagaagaa 6360
gcagctggtg aagttcctgt tgcatttgtg gtgaaatcga aggattcgga gttatcagaa 6420
gatgatgtga agcaattcgt gtcgaaacag gttgtgtttt acaagagaat caacaaagtg 6480
ttcttcactg aatccattcc taaagctcca tcagggaaga tattgaggaa agatctgagg 6540
gcaaaactag caaatggatt gtaagcgaat ttcttatgat ttatgatttt tattattaaa 6600
taagttataa aaaaaataag tgtatacaaa ttttaaagtg actcttaggt tttaaaacga 6660
aaattcttat tcttgagtaa ctctttcctg taggtcaggt tgctttctca ggtatagcat 6720
gaggtcgctc ttattgacca cacctctacc ggcatgccga gcaaatgcct gcaaatcgct 6780
ccccatttca cccaattgta gatatgctaa ctccagcaat gagttgatga atctcggtgt 6840
gtattttatg tcctcagagg acaacacctg ttgtaatcgt tcttccacac ggatccacag 6900
cctagccttc agttgggctc tatcttcatc gtcttacaca gaatatataa catcgtaggt 6960
gtctgggtga acagtttatt cctggcatcc actaaatata atggagcccg ctttttaagc 7020
tggcatccag aaaaaaaaag aatcccagca ccaaaatatt gttttcttca ccaaccatca 7080
gttcataggt ccattctctt agcgcaacta cagagaacag gggcacaaac aggcaaaaaa 7140
cgggcacaac ctcaatggag tgatgcaacc tgcctggagt aaatgatgac acaaggcaat 7200
tgacccacgc atgtatctat ctcattttct tacaccttct attaccttct gctctctctg 7260
atttggaaaa agctgaaaaa aaaggttgaa accagttccc tgaaattatt cccctacttg 7320
actaataagt atataaagac ggtaggtatt gattgtaatt ctgtaaatct atttcttaaa 7380
cttcttaaat tctactttta tagttagtct tttttttagt tttaaaacac caagaactta 7440
gtttcgaata aacacacata aacaaacaaa atggcgccac aagaacaagc agtttctcag 7500
gtgatggaga aacagagcaa caacaacaac agtgacgtca ttttccgatc aaagttaccg 7560
gatatttaca tcccgaacca cctatctctc cacgactaca tcttccaaaa catctccgaa 7620
ttcgccacta agccttgcct aatcaacgga ccaaccggcc acgtgtacac ttactccgac 7680
gtccacgtca tctcccgcca aatcgccgcc aattttcaca aactcggcgt taaccaaaac 7740
gacgtcgtca tgctcctcct cccaaactgt cccgaattcg tcctctcttt cctcgccgcc 7800
tccttccgcg gcgcaaccgc caccgccgca aaccctttct tcactccggc ggagatagct 7860
aaacaagcca aagcctccaa caccaaactc ataatcaccg aagctcgtta cgtcgacaaa 7920
atcaaaccac ttcaaaacga cgacggagta gtcatcgtct gcatcgacga caacgaatcc 7980
gtgccaatcc ctgaaggctg cctccgcttc accgagttga ctcagtcgac aaccgaggca 8040
tcagaagtca tcgactcggt ggagatttca ccggacgacg tggtggcact accttactcc 8100
tctggcacga cgggattacc aaaaggagtg atgctgactc acaagggact agtcacgagc 8160
gttgctcagc aagtcgacgg cgagaacccg aatctttatt tccacagcga tgacgtcata 8220
ctctgtgttt tgcccatgtt tcatatctac gctttgaact cgatcatgtt gtgtggtctt 8280
agagttggtg cggcgattct gataatgccg aagtttgaga tcaatctgct attggagctg 8340
atccagaggt gtaaagtgac ggtggctccg atggttccgc cgattgtgtt ggccattgcg 8400
aagtcttcgg agacggagaa gtatgatttg agctcgataa gagtggtgaa atctggtgct 8460
gctcctcttg gtaaagaact tgaagatgcc gttaatgcca agtttcctaa tgccaaactc 8520
ggtcagggat acggaatgac ggaagcaggt ccagtgctag caatgtcgtt aggttttgca 8580
aaggaacctt ttccggttaa gtcaggagct tgtggtactg ttgtaagaaa tgctgagatg 8640
aaaatagttg atccagacac cggagattct ctttcgagga atcaacccgg tgagatttgt 8700
attcgtggtc accagatcat gaaaggttac ctcaacaatc cggcagctac agcagagacc 8760
attgataaag acggttggct tcatactgga gatattggat tgatcgatga cgatgacgag 8820
cttttcatcg ttgatcgatt gaaagaactt atcaagtata aaggttttca ggtagctccg 8880
gctgagctag aggctttgct catcggtcat cctgacatta ctgatgttgc tgttgtcgca 8940
atgaaagaag aagcagctgg tgaagttcct gttgcatttg tggtgaaatc gaaggattcg 9000
gagttatcag aagatgatgt gaagcaattc gtgtcgaaac aggttgtgtt ttacaagaga 9060
atcaacaaag tgttcttcac tgaatccatt cctaaagctc catcagggaa gatattgagg 9120
aaagatctga gggcaaaact agcaaatgga ttgtaaattt aactccttaa gttactttaa 9180
tgatttagtt tttattatta ataattcatg ctcatgacat ctcatataca cgtttataaa 9240
acttaaatag attgaaaatg tattaaagat tcctcaggga ttcgattttt ttggaagttt 9300
ttgttttttt ttccttgaga tgctgtagta tttgggaaca attatacaat cgaaagatat 9360
atgcttacat tcgaccgttt tagccgtgat catccaactg gcaccgctgg cttgaacaac 9420
aataccagcc ttccaacttc tgtaaataac ggcggtacgc cagtgccacc agtaccgtta 9480
cctttcggta tacctccttt ccccatgttt ccaatgccct tcatgcctcc aacggctact 9540
atcacaaatc ctcatcaagc tgacgcaagc cctaagaaat gaataacaat actgacagta 9600
ctaaataatt gcctacttgg cttcacatac gttgcatacg tcgatataga taataatgat 9660
aatgacagca ggattatcgt aatacgtaat agttgaaaat ctcaaaaatg tgtgggtcat 9720
tacgtaaata atgataggaa tgggattctt ctatttttcc tttttccatt ctagcagccg 9780
tcgggaaaac gtggcatcct ctctttcggg ctcaattgga gtcacgctgc cgtgagcatc 9840
ctctctttcc atatctaaca actgagcacg taaccaatgg aaaagcatga gcttagcgtt 9900
gctccaaaaa agtattggat ggttaatacc atttgtctgt tctcttctga ctttgactcc 9960
tcaaaaaaaa aaaatctaca atcaacagat cgcttcaatt acgccctcac aaaaactttt 10020
ttccttcttc ttcgcccacg ttaaatttta tccctcatgt tgtctaacgg atttctgcac 10080
ttgatttatt ataaaaagac aaagacataa tacttctcta tcaatttcag ttattgttct 10140
tccttgcgtt attcttctgt tcttcttttt cttttgtcat atataaccat aaccaagtaa 10200
tacatattca aaatggttac ggtggaagaa taccgcaaag ctcaacgcgc tgaaggcccg 10260
gcgacggtga tggcgattgg cacggcaacc ccgacgaact gtgttgatca gagcacctat 10320
ccggactatt actttcgtat caccaactct gaacataaaa cggatctgaa agaaaaattc 10380
aaacgtatgt gcgaaaaaag catgatcaaa aaacgctata tgcacctgac cgaagaaatt 10440
ctgaaagaaa atccgagcat gtgtgaatac atggcaccgt ctctggatgc tcgccaggac 10500
attgtggttg tcgaagtgcc gaaactgggt aaagaagcgg cccagaaagc gatcaaagaa 10560
tggggccaac cgaaatcaaa aattacccat ctggtctttt gcaccacgtc gggtgtggat 10620
atgccgggtt gtgactatca actgacgaaa ctgctgggtc tgcgtccgag cgtgaaacgc 10680
ctgatgatgt accagcaagg ctgcttcgca ggcggtaccg ttctgcgtct ggcgaaagat 10740
ctggccgaaa acaataaagg tgcgcgtgtt ctggtggtgt gtagtgaaat caccgctgtt 10800
acgtttcgtg gtccgaacga tacgcacctg gactccctgg ttggccaggc cctgttcggt 10860
gatggtgcag gtgccattat cattggtagc gacccgattc cgggcgttga acgtccgctg 10920
tttgaactgg tcagcgcagc tcaaaccctg ctgccggata gccacggcgc aattgacggt 10980
cacctgcgtg aagtcggtct gacgttccat ctgctgaaag atgtgccggg cctgatctca 11040
aaaaacattg aaaaaagcct ggaagaagcg tttcgcccgc tgagtatctc cgattggaac 11100
agcctgttct ggattgcaca tccgggcggc ccggcaatcc tggaccaggt cgaaattaaa 11160
ctgggtctga aaccggaaaa actgaaagcg acccgtaatg ttctgtcaaa ctacggcaat 11220
atgagctctg cctgcgtcct gtttattctg gatgaaatgc gcaaagcatc ggctaaagaa 11280
ggtctgggca ccacgggtga aggcctggaa tggggcgtgc tgttcggctt tggtccgggt 11340
ctgacggtgg aaacggtggt tctgcatagt gtggctacct aatgcgtttg aagtgagacg 11400
ctccatcatc tctcttaatt tttcatgact gacgtttttt cttcatttta attatcatag 11460
tatttgtttg aaaaaaaaaa aaaaaaattt cccttatcaa tgatatcctt acgattatat 11520
aaattcctta cctaaaccta ttatttgtgt acatatatca gagtattatt acatatataa 11580
cctttttctc taaaacagga aaaaaaaaag aaaacgataa catgctctgc catcctttgt 11640
tcaccgagca aaattaaaaa cgcaaaatga attgtcccta tgaaattatt aaaggaccac 11700
atcaccagac ttatctctgg ggggtccctc gag 11733
<210> 4
<211> 14400
<212> DNA
<213> 人工序列()
<400> 4
ggccgcaaat taaagccttc gagcgtccca aaaccttctc aagcaaggtt ttcagtataa 60
tgttacatgc gtacacgcgt gcggccgccc cacacaccat agcttcaaaa tgtttctact 120
ccttttttac tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac 180
ccaagcacag catactaaat ttcccctctt tcttcctcta gggtgtcgtt aattacccgt 240
actaaaggtt tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg 300
caataaaaat ttttatcacg tttctttttc ttgaaaattt ttttttttga tttttttctc 360
tttcgatgac ctcccattga tatttaagtt aataaacggt cttcaatttc tcaagtttca 420
gtttcatttt tcttgttcta ttacaacttt ttttacttct tgctcattag aaagaaagca 480
tagcaatcta atctaagttt taattacaaa atggatctgt tgttattaga gaaatccttg 540
atcgctgtat tcgtggccgt tattctggct acagttattt ctaagttaag agggaagaaa 600
ttaaaattgc ccccaggccc tatcccaatt ccaatctttg gcaattggtt gcaagttggc 660
gatgatctaa accataggaa tcttgttgat tacgctaaga aatttggaga cttatttctg 720
ttgaggatgg gtcagagaaa tctggttgtg gtttctagcc cggatcttac aaaggaagtg 780
cttcttacac aaggtgttga gttcggttca agaactagaa acgtcgtttt cgacattttt 840
acgggtaaag ggcaggatat ggtgtttact gtctatggag agcattggag aaaaatgaga 900
cgtattatga ctgttccatt ctttacaaat aaagttgtcc aacagaatag agaagggtgg 960
gagtttgagg ctgctagcgt tgtggaggac gtgaagaaga atccagacag tgcgaccaaa 1020
ggtattgttt tgagaaaacg tctgcaattg atgatgtata acaatatgtt cagaataatg 1080
ttcgaccgtc gtttcgagag tgaagatgat cccttgtttc ttaggcttaa agcacttaat 1140
ggagaaagat cacgtttggc tcaaagcttt gagtataatt atggcgactt cattccaatc 1200
ctacgtcctt ttttaagagg atacttaaaa atctgccaag atgttaaaga tagaagaatt 1260
gcattattta aaaaatactt tgtagatgaa agaaaacaaa tagcatcttc taaacctact 1320
gggtccgaag gccttaagtg cgctatcgat cacattttag aggctgagca aaagggtgaa 1380
atcaacgagg ataatgtctt gtacattgtc gagaacatta atgttgcggc tattgaaact 1440
acattgtggt ctatagaatg gggtatagct gaactggtta atcatccaga gattcaatca 1500
aaattgagaa acgaattaga tactgtctta ggtcctggtg tccaagtaac tgagccagat 1560
ttgcacaagc tgccttattt gcaagcagta gtaaaagaaa ctctaagatt gagaatggcc 1620
atcccattac tggttccaca catgaaccta catgatgcaa agttggcagg ttatgacatt 1680
ccagctgaat caaaaatatt agttaatgct tggtggctag caaataatcc aaattcatgg 1740
aagaagcctg aagaatttcg tcccgaaagg ttctttgaag aagaatcaca cgtagaagcg 1800
aatggtaatg actttaggta tgtccccttt ggtgtgggtc gtagatcttg tcccggtatt 1860
attttggctt tacccatttt aggtattaca attggtcgta tggtacaaaa tttcgagtta 1920
ctgccacccc caggtcaatc taaagttgac acgtctgaaa aaggcggtca attttcacta 1980
catatactaa atcattcaat aattgtaatg aagccacgta attgttaaca tgtaattagt 2040
tatgtcacgc ttacattcac gccctccccc cacatccgct ctaaccgaaa aggaaggagt 2100
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 2160
gttatttata tttcaaattt ttcttttttt tctgtacaga cgcgtgtacg catgtaacat 2220
tatactgaaa accttgcttg agaaggtttt gggacgctcg aaggctttac gcacagatat 2280
tataacatct gcataatagg catttgcaag aattactcgt gagtaaggaa agagtgagga 2340
actatcgcat acctgcattt aaagatgccg atttgggcgc gaatccttta ttttggcttc 2400
accctcatac tattatcagg gccagaaaaa ggaagtgttt ccctccttct tgaattgatg 2460
ttaccctcat aaagcacgtg gcctcttatc gagaaagaaa ttaccgtcgc tcgtgatttg 2520
tttgcaaaaa gaacaaaact gaaaaaaccc agacacgctc gacttcctgt cttcctattg 2580
attgcagctt ccaatttcgt cacacaacaa ggtcctagcg acggctcaca ggttttgtaa 2640
caagcaatcg aaggttctgg aatggcggga aagggtttag taccacatgc tatgatgccc 2700
actgtgatct ccagagcaaa gttcgttcga tcgtactgtt actctctctc tttcaaacag 2760
aattgtccga atcgtgtgac aacaacagcc tgttctcaca cactcttttc ttctaaccaa 2820
gggggtggtt tagtttagta gaacctcgtg aaacttacat ttacatatat ataaacttgc 2880
ataaattggt caatgcaaga aatacatatt tggtcttttc taattcgtag tttttcaagt 2940
tcttagatgc tttctttttc tcttttttac agatcatcaa ggaagtaatt atctactttt 3000
tacaacaaat ataaaacaat ggcgccacaa gaacaagcag tttctcaggt gatggagaaa 3060
cagagcaaca acaacaacag tgacgtcatt ttccgatcaa agttaccgga tatttacatc 3120
ccgaaccacc tatctctcca cgactacatc ttccaaaaca tctccgaatt cgccactaag 3180
ccttgcctaa tcaacggacc aaccggccac gtgtacactt actccgacgt ccacgtcatc 3240
tcccgccaaa tcgccgccaa ttttcacaaa ctcggcgtta accaaaacga cgtcgtcatg 3300
ctcctcctcc caaactgtcc cgaattcgtc ctctctttcc tcgccgcctc cttccgcggc 3360
gcaaccgcca ccgccgcaaa ccctttcttc actccggcgg agatagctaa acaagccaaa 3420
gcctccaaca ccaaactcat aatcaccgaa gctcgttacg tcgacaaaat caaaccactt 3480
caaaacgacg acggagtagt catcgtctgc atcgacgaca acgaatccgt gccaatccct 3540
gaaggctgcc tccgcttcac cgagttgact cagtcgacaa ccgaggcatc agaagtcatc 3600
gactcggtgg agatttcacc ggacgacgtg gtggcactac cttactcctc tggcacgacg 3660
ggattaccaa aaggagtgat gctgactcac aagggactag tcacgagcgt tgctcagcaa 3720
gtcgacggcg agaacccgaa tctttatttc cacagcgatg acgtcctact ctgtgttttg 3780
cccatgtttc atatctacgc tttgaactcg atcatgttgt gtggtcttag agttggtgcg 3840
gcgattctga taatgccgaa gtttgagatc aatctgctat tggagctgat ccagaggtgt 3900
aaagtgacgg tggctccgat ggttccgccg attgtgttgg ccattgcgaa gtcttcggag 3960
acggagaagt atgatttgag ctcgataaga gtggtgaaat ctggtgctgc tcctcttggt 4020
aaagaacttg aagatgccgt taatgccaag tttcctaatg ccaaactcgg tcagggatac 4080
ggaatgacgg aagcaggtcc agtgctagca atgtcgttag gttttgcaaa ggaacctttt 4140
ccggttaagt caggagcttg tggtactgtt gtaagaaatg ctgagatgaa aatagttgat 4200
ccagacaccg gagattctct ttcgaggaat caacccggtg agatttgtat tcgtggtcac 4260
cagatcatga aaggttacct caacaatccg gcagctacag cagagaccat tgataaagac 4320
ggttggcttc atactggaga tattggattg atcgatgacg atgacgagct tttcatcgtt 4380
gatcgattga aagaacttgt caagtataaa ggttttcagg tagctccggc tgagctagag 4440
gctttgctca tcggtcatcc tgacattact gatgttgctg ttgtagcaat gaaagaagaa 4500
gcagctggtg aagttcctgt tgcatttgtg gtgaaatcga aggattcgga gttatcagaa 4560
gatgatgtga agcaattcgt gtcgaaacag gttgtgtttt acaagagaat caacaaagtg 4620
ttcttcactg aatccattcc taaagctcca tcagggaaga tattgaggaa agatctgagg 4680
gcaaaactag caaatggatt gtaagcgaat ttcttatgat ttatgatttt tattattaaa 4740
taagttataa aaaaaataag tgtatacaaa ttttaaagtg actcttaggt tttaaaacga 4800
aaattcttat tcttgagtaa ctctttcctg taggtcaggt tgctttctca ggtatagcat 4860
gaggtcgctc ttattgacca cacctctacc ggcatgccga gcaaatgcct gcaaatcgct 4920
ccccatttca cccaattgta gatatgctaa ctccagcaat gagttgatga atctcggtgt 4980
gtattttatg tcctcagagg acaacacctg ttgtaatcgt tcttccacac ggatccacag 5040
cctagccttc agttgggctc tatcttcatc gtcttacaca gaatatataa catcgtaggt 5100
gtctgggtga acagtttatt cctggcatcc actaaatata atggagcccg ctttttaagc 5160
tggcatccag aaaaaaaaag aatcccagca ccaaaatatt gttttcttca ccaaccatca 5220
gttcataggt ccattctctt agcgcaacta cagagaacag gggcacaaac aggcaaaaaa 5280
cgggcacaac ctcaatggag tgatgcaacc tgcctggagt aaatgatgac acaaggcaat 5340
tgacccacgc atgtatctat ctcattttct tacaccttct attaccttct gctctctctg 5400
atttggaaaa agctgaaaaa aaaggttgaa accagttccc tgaaattatt cccctacttg 5460
actaataagt atataaagac ggtaggtatt gattgtaatt ctgtaaatct atttcttaaa 5520
cttcttaaat tctactttta tagttagtct tttttttagt tttaaaacac caagaactta 5580
gtttcgaata aacacacata aacaaacaaa atggcgccac aagaacaagc agtttctcag 5640
gtgatggaga aacagagcaa caacaacaac agtgacgtca ttttccgatc aaagttaccg 5700
gatatttaca tcccgaacca cctatctctc cacgactaca tcttccaaaa catctccgaa 5760
ttcgccacta agccttgcct aatcaacgga ccaaccggcc acgtgtacac ttactccgac 5820
gtccacgtca tctcccgcca aatcgccgcc aattttcaca aactcggcgt taaccaaaac 5880
gacgtcgtca tgctcctcct cccaaactgt cccgaattcg tcctctcttt cctcgccgcc 5940
tccttccgcg gcgcaaccgc caccgccgca aaccctttct tcactccggc ggagatagct 6000
aaacaagcca aagcctccaa caccaaactc ataatcaccg aagctcgtta cgtcgacaaa 6060
atcaaaccac ttcaaaacga cgacggagta gtcatcgtct gcatcgacga caacgaatcc 6120
gtgccaatcc ctgaaggctg cctccgcttc accgagttga ctcagtcgac aaccgaggca 6180
tcagaagtca tcgactcggt ggagatttca ccggacgacg tggtggcact accttactcc 6240
tctggcacga cgggattacc aaaaggagtg atgctgactc acaagggact agtcacgagc 6300
gttgctcagc aagtcgacgg cgagaacccg aatctttatt tccacagcga tgacgtcata 6360
ctctgtgttt tgcccatgtt tcatatctac gctttgaact cgatcatgtt gtgtggtctt 6420
agagttggtg cggcgattct gataatgccg aagtttgaga tcaatctgct attggagctg 6480
atccagaggt gtaaagtgac ggtggctccg atggttccgc cgattgtgtt ggccattgcg 6540
aagtcttcgg agacggagaa gtatgatttg agctcgataa gagtggtgaa atctggtgct 6600
gctcctcttg gtaaagaact tgaagatgcc gttaatgcca agtttcctaa tgccaaactc 6660
ggtcagggat acggaatgac ggaagcaggt ccagtgctag caatgtcgtt aggttttgca 6720
aaggaacctt ttccggttaa gtcaggagct tgtggtactg ttgtaagaaa tgctgagatg 6780
aaaatagttg atccagacac cggagattct ctttcgagga atcaacccgg tgagatttgt 6840
attcgtggtc accagatcat gaaaggttac ctcaacaatc cggcagctac agcagagacc 6900
attgataaag acggttggct tcatactgga gatattggat tgatcgatga cgatgacgag 6960
cttttcatcg ttgatcgatt gaaagaactt atcaagtata aaggttttca ggtagctccg 7020
gctgagctag aggctttgct catcggtcat cctgacatta ctgatgttgc tgttgtcgca 7080
atgaaagaag aagcagctgg tgaagttcct gttgcatttg tggtgaaatc gaaggattcg 7140
gagttatcag aagatgatgt gaagcaattc gtgtcgaaac aggttgtgtt ttacaagaga 7200
atcaacaaag tgttcttcac tgaatccatt cctaaagctc catcagggaa gatattgagg 7260
aaagatctga gggcaaaact agcaaatgga ttgtaaattt aactccttaa gttactttaa 7320
tgatttagtt tttattatta ataattcatg ctcatgacat ctcatataca cgtttataaa 7380
acttaaatag attgaaaatg tattaaagat tcctcaggga ttcgattttt ttggaagttt 7440
ttgttttttt ttccttgaga tgctgtagta tttgggaaca attatacaat cgaaagatat 7500
atgcttacat tcgaccgttt tagccgtgat catccaactg gcaccgctgg cttgaacaac 7560
aataccagcc ttccaacttc tgtaaataac ggcggtacgc cagtgccacc agtaccgtta 7620
cctttcggta tacctccttt ccccatgttt ccaatgccct tcatgcctcc aacggctact 7680
atcacaaatc ctcatcaagc tgacgcaagc cctaagaaat gaataacaat actgacagta 7740
ctaaataatt gcctacttgg cttcacatac gttgcatacg tcgatataga taataatgat 7800
aatgacagca ggattatcgt aatacgtaat agttgaaaat ctcaaaaatg tgtgggtcat 7860
tacgtaaata atgataggaa tgggattctt ctatttttcc tttttccatt ctagcagccg 7920
tcgggaaaac gtggcatcct ctctttcggg ctcaattgga gtcacgctgc cgtgagcatc 7980
ctctctttcc atatctaaca actgagcacg taaccaatgg aaaagcatga gcttagcgtt 8040
gctccaaaaa agtattggat ggttaatacc atttgtctgt tctcttctga ctttgactcc 8100
tcaaaaaaaa aaaatctaca atcaacagat cgcttcaatt acgccctcac aaaaactttt 8160
ttccttcttc ttcgcccacg ttaaatttta tccctcatgt tgtctaacgg atttctgcac 8220
ttgatttatt ataaaaagac aaagacataa tacttctcta tcaatttcag ttattgttct 8280
tccttgcgtt attcttctgt tcttcttttt cttttgtcat atataaccat aaccaagtaa 8340
tacatattca aaatggttac ggtggaagaa taccgcaaag ctcaacgcgc tgaaggcccg 8400
gcgacggtga tggcgattgg cacggcaacc ccgacgaact gtgttgatca gagcacctat 8460
ccggactatt actttcgtat caccaactct gaacataaaa cggatctgaa agaaaaattc 8520
aaacgtatgt gcgaaaaaag catgatcaaa aaacgctata tgcacctgac cgaagaaatt 8580
ctgaaagaaa atccgagcat gtgtgaatac atggcaccgt ctctggatgc tcgccaggac 8640
attgtggttg tcgaagtgcc gaaactgggt aaagaagcgg cccagaaagc gatcaaagaa 8700
tggggccaac cgaaatcaaa aattacccat ctggtctttt gcaccacgtc gggtgtggat 8760
atgccgggtt gtgactatca actgacgaaa ctgctgggtc tgcgtccgag cgtgaaacgc 8820
ctgatgatgt accagcaagg ctgcttcgca ggcggtaccg ttctgcgtct ggcgaaagat 8880
ctggccgaaa acaataaagg tgcgcgtgtt ctggtggtgt gtagtgaaat caccgctgtt 8940
acgtttcgtg gtccgaacga tacgcacctg gactccctgg ttggccaggc cctgttcggt 9000
gatggtgcag gtgccattat cattggtagc gacccgattc cgggcgttga acgtccgctg 9060
tttgaactgg tcagcgcagc tcaaaccctg ctgccggata gccacggcgc aattgacggt 9120
cacctgcgtg aagtcggtct gacgttccat ctgctgaaag atgtgccggg cctgatctca 9180
aaaaacattg aaaaaagcct ggaagaagcg tttcgcccgc tgagtatctc cgattggaac 9240
agcctgttct ggattgcaca tccgggcggc ccggcaatcc tggaccaggt cgaaattaaa 9300
ctgggtctga aaccggaaaa actgaaagcg acccgtaatg ttctgtcaaa ctacggcaat 9360
atgagctctg cctgcgtcct gtttattctg gatgaaatgc gcaaagcatc ggctaaagaa 9420
ggtctgggca ccacgggtga aggcctggaa tggggcgtgc tgttcggctt tggtccgggt 9480
ctgacggtgg aaacggtggt tctgcatagt gtggctacct aatgcgtttg aagtgagacg 9540
ctccatcatc tctcttaatt tttcatgact gacgtttttt cttcatttta attatcatag 9600
tatttgtttg aaaaaaaaaa aaaaaaattt cccttatcaa tgatatcctt acgattatat 9660
aaattcctta cctaaaccta ttatttgtgt acatatatca gagtattatt acatatataa 9720
cctttttctc taaaacagga aaaaaaaaag aaaacgataa catgctctgc catcctttgt 9780
tcaccgagca aaattaaaaa cgcaaaatga attgtcccta tgaaattatt aaaggaccac 9840
atcaccagac ttatctctgg ggggtccctc gagatttaaa tatttgctta tacaatcttc 9900
ctgtttttgg ggcttttctg attatcaacc ggggtggagc ttcccattgc gaataccgct 9960
tccacaaaca ttgctcaaaa gtatctcttt gctatatatc tctgtgctat atccctatat 10020
aacctaccca tccacctttc gctccttgaa cttgcatcta aactcgacct ctacattttt 10080
tatgtttatc tctagtatta ctctttagac aaaaaaattg tagtaagaac tattcataga 10140
gtgaatcgaa aacaatacga aaatgtaaac atttcctata cgtagtatat agagacaaaa 10200
tagaagaaac cgttcataat tttctgacca atgaagaatc atcaacgcta tcactttctg 10260
ttcacaaagt atgcgcaatc cacatcggta tagaatataa tcggggatgc ctttatcttg 10320
aaaaaatgca cccgcagctt cgctagtaat cagtaaacgc gggaagtgga gtcaggcttt 10380
ttttatggaa gagaaaatag acaccaaagt agccttcttc taaccttaac ggacctacag 10440
tgcaaaaagt tatcaagaga ctgcattata gagcgcacaa aggagaaaaa aagtaatcta 10500
agatgctttg ttagaaaaat agcgctctcg ggatgcattt ttgtagaaca aaaaagaagt 10560
atagattctt tgttggtaaa atagcgctct cgcgttgcat ttctgttctg taaaaatgca 10620
gctcagattc tttgtttgaa aaattagcgc tctcgtcgcg ttgcattttt gttttacaaa 10680
aatgaagcac agattcttcg ttggtaaaat agcgctttcg cgttgcattt ctgttctgta 10740
aaaatgcagc tcagattctt tgtttgaaaa attagcgctc tcgcgttgca tttttgttct 10800
acaaaatgaa gcacagatgc ttcgttaaca aagatatgct attgaagtgc aagatggaaa 10860
cgcagaaaat gaaccgggga tgcgacgtgc aagattacct atgcaataga tgcaatagtt 10920
tctccaggaa ccgaaataca tacattgtct tccgtaaagc gctagactat atattattat 10980
acaggttcaa atatactatc tgtttcaggg aaaactccca ggttcggatg ttcaaaattc 11040
aatgatgggt aacaagtacg atcgtaaatc tgtaaaacag tttgtcggat attaggctgt 11100
atctcctcaa agcgtattcg aatatcattg agaagctgca gcgtcacatc ggataataat 11160
gatggcagcc attgtagaag tgccttttgc atttctagtc tctttctcgg tctagctagt 11220
tttactacat cgcgaagata gaatcttaga tcacactgcc tttgctgagc tggatcaata 11280
gagtaacaaa agagtggtaa ggcctcgtta aaggacaagg acctgagcgg aagtgtatcg 11340
tacagtagac ggagtatcta gtatagtcta tagtccgtgg aattaattct catctttgac 11400
agcttatcat cgataagcta gcttttcaat tcaattcatc attttttttt tattcttttt 11460
tttgatttcg gtttctttga aatttttttg attcggtaat ctccgaacag aaggaagaac 11520
gaaggaagga gcacagactt agattggtat atatacgcat atgtagtgtt gaagaaacat 11580
gaaattgccc agtattctta acccaactgc acagaacaaa aacctgcagg aaacgaagat 11640
aaatcatgtc gaaagctaca tataaggaac gtgctgctac tcatcctagt cctgttgctg 11700
ccaagctatt taatatcatg cacgaaaagc aaacaaactt gtgtgcttca ttggatgttc 11760
gtaccaccaa ggaattactg gagttagttg aagcattagg tcccaaaatt tgtttactaa 11820
aaacacatgt ggatatcttg actgattttt ccatggaggg cacagttaag ccgctaaagg 11880
cattatccgc caagtacaat tttttactct tcgaagacag aaaatttgct gacattggta 11940
atacagtcaa attgcagtac tctgcgggtg tatacagaat agcagaatgg gcagacatta 12000
cgaatgcaca cggtgtggtg ggcccaggta ttgttagcgg tttgaagcag gcggcagaag 12060
aagtaacaaa ggaacctaga ggccttttga tgttagcaga attgtcatgc aagggctccc 12120
tatctactgg agaatatact aagggtactg ttgacattgc gaagagcgac aaagattttg 12180
ttatcggctt tattgctcaa agagacatgg gtggaagaga tgaaggttac gattggttga 12240
ttatgacacc cggtgtgggt ttagatgaca agggagacgc attgggtcaa cagtatagaa 12300
ccgtggatga tgtggtctct acaggatctg acattattat tgttggaaga ggactatttg 12360
caaagggaag ggatgctaag gtagagggtg aacgttacag aaaagcaggc tgggaagcat 12420
atttgagaag atgcggccag caaaactaaa aaactgtatt ataagtaaat gcatgtatac 12480
taaactcaca aattagagct tcaatttaat tatatcagtt attacccatt gaaaaaggaa 12540
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 12600
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 12660
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 12720
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg atacactatt 12780
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 12840
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 12900
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 12960
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 13020
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgaga gtgacaccac 13080
gatgcctgta gcaatgccaa caacgttgcg caaactatta actggcgaac tacttactct 13140
agcttcccgg caacaattaa tagactgaat ggaggcggat aaagttgcag gaccacttct 13200
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 13260
gtctcgcggt atcattgcag cactggggcc agatggtaag cgctcccgta tcgtagttat 13320
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 13380
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 13440
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 13500
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 13560
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 13620
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 13680
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 13740
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 13800
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 13860
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 13920
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 13980
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 14040
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 14100
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 14160
gaaaaacgcc agcaacgcgg cctttttacg gttcctgggc ttttgctggc cttttgctca 14220
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 14280
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 14340
ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 14400

Claims (10)

1.一种构建柚皮素生产菌的方法,其特征在于,包括以下步骤:
A.构建表达芳香族氨基酸合成模块的质粒A,该质粒A用于中断酿酒酵母基因组中编码苯丙酮酸脱羧酶aro10的aro10基因,并且在酿酒酵母基因组中aro10基因位点整合编码DHAP合成酶aro4的aro4基因、编码分支酸变位酶aro7的aro7基因、编码五功能蛋白aro1的aro1基因、编码分支酸合酶aro2的aro2基因;
B.构建表达香豆酸和桂皮酸生成模块的质粒B,该质粒B用于中断酿酒酵母基因组中编码丙酮酸脱羧酶PDC5的pdc5基因,并且在酿酒酵母基因组中pdc5基因位点整合编码预苯酸脱氢酶TYR1的tyr1基因、编码苯丙氨酸解氨酶PAL2的PAL2基因、编码酪氨酸解氨酶TAL的TAL基因;
C.构建表达柚皮素合成模块的质粒C,该质粒C用于在酿酒酵母基因组中整合编码肉桂酸-4-羟化酶C4H的C4H基因、编码4-香豆酸-CoA连接酶4CL的4CL基因、编码查尔酮黄烷酮异构酶CHI的CHI基因、编码查尔酮合酶CHS的CHS基因;
D.将步骤A中得到的质粒A、步骤B中得到的质粒B、步骤C中得到的质粒C转化入酿酒酵母中,从转化子中筛选得到阳性克隆;
E.从阳性克隆中筛选出产柚皮素的工程菌。
2.如权利要求1所述的方法,其特征在于,步骤A中所述aro10基因和/或步骤B中所述pdc5基因的中断通过基因编辑技术实施。
3.如权利要求1所述的方法,其特征在于,步骤D中宿主酿酒酵母是酿酒酵母BY4742。
4.如权利要求1所述的方法,其特征在于,步骤A中所述DHAP合成酶(NCBI-Gene ID:852551,含K229L突变)、分支酸变位酶(NCBI Gene ID:856173,含G141S突变)、五功能蛋白(NCBI Gene ID:851705)、分支酸合酶(NCBI Gene ID:852729)都是来源于酿酒酵母自身;并且/或者
步骤B中所述预苯酸脱氢酶TYR1来源于酿酒酵母自身(NCBI Gene ID:852464),苯丙氨酸解氨酶来源于拟南芥(NCBI Gene ID:824493)、酪氨酸解氨酶来源于约氏黄杆菌(GenBank:KR095306.1);并且/或者
步骤C中所述肉桂酸-4-羟化酶来源于拟南芥(NCBI Gene ID:817599),4-香豆酸-CoA连接酶来源于拟南芥(NCBI Gene ID:841593,含I250L和I461V突变),查尔酮黄烷酮异构酶来源于拟南芥(NCBI Gene ID:824678),查尔酮合酶来源于矮牵牛(GenBank:KF765781.1)。
5.如权利要求1所述的方法,其特征在于,质粒A的骨架质粒是载体pUC或者pYES2,质粒B的骨架质粒是载体pUC或者pYES2,质粒C的骨架质粒是载体pUC或者pYES2。
6.如权利要求4所述的方法,其特征在于,质粒A的核苷酸序列为SEQ ID NO:1;质粒B的核苷酸序列为SEQ ID NO:2;质粒C的核苷酸序列为SEQ ID NO:4。
7.如权利要求1所述的方法,其特征在于,步骤E中筛选出的工程菌发酵时不产生苯丙氨酸和/或酪氨酸。
8.一种柚皮素生产菌,其特征在于,通过如权利要求1-7中任一项所述的方法构建得到。
9.如权利要求8所述的柚皮素生产菌在生产柚皮素中的应用。
10.如权利要求9所述的应用,其特征在于,通过菌株的发酵来生产柚皮素。
CN202111126967.0A 2021-09-26 2021-09-26 一种产柚皮素的酿酒酵母菌 Active CN113862166B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111126967.0A CN113862166B (zh) 2021-09-26 2021-09-26 一种产柚皮素的酿酒酵母菌

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111126967.0A CN113862166B (zh) 2021-09-26 2021-09-26 一种产柚皮素的酿酒酵母菌

Publications (2)

Publication Number Publication Date
CN113862166A true CN113862166A (zh) 2021-12-31
CN113862166B CN113862166B (zh) 2024-04-02

Family

ID=78994303

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111126967.0A Active CN113862166B (zh) 2021-09-26 2021-09-26 一种产柚皮素的酿酒酵母菌

Country Status (1)

Country Link
CN (1) CN113862166B (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114941001A (zh) * 2022-04-29 2022-08-26 浙江工业大学 酿酒酵母产樱花素代谢工程菌株的构建方法及其应用
CN117363504A (zh) * 2023-12-04 2024-01-09 潍坊医学院 一种同时生产棕矢车菊素、泽兰林素的酿酒酵母工程菌及其构建方法与应用

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106929439A (zh) * 2017-04-11 2017-07-07 天津大学 一种重组酿酒酵母及其构建方法与应用
WO2021053513A1 (en) * 2019-09-18 2021-03-25 Eleszto Genetika, Inc. Methods and microorganisms for producing flavonoids
CN113403334A (zh) * 2021-06-11 2021-09-17 江南大学 一组用于酿酒酵母多拷贝整合的质粒工具包

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106929439A (zh) * 2017-04-11 2017-07-07 天津大学 一种重组酿酒酵母及其构建方法与应用
WO2021053513A1 (en) * 2019-09-18 2021-03-25 Eleszto Genetika, Inc. Methods and microorganisms for producing flavonoids
CN113403334A (zh) * 2021-06-11 2021-09-17 江南大学 一组用于酿酒酵母多拷贝整合的质粒工具包

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LYU X ET AL: "Enhancement of Naringenin Biosynthesis from Tyrosine by Metabolic Engineering of Saccharomyces cerevisiae", 《J AGRIC FOOD CHEM》, vol. 65, no. 31 *
张伟: "产对香豆酸及其衍生物的酿酒酵母菌株的构建与优化", 《中国优秀硕士学位论文全文数据库工程科技Ⅰ辑》, no. 09, pages 024 - 69 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114941001A (zh) * 2022-04-29 2022-08-26 浙江工业大学 酿酒酵母产樱花素代谢工程菌株的构建方法及其应用
CN114941001B (zh) * 2022-04-29 2024-05-10 浙江工业大学 酿酒酵母产樱花素代谢工程菌株的构建方法及其应用
CN117363504A (zh) * 2023-12-04 2024-01-09 潍坊医学院 一种同时生产棕矢车菊素、泽兰林素的酿酒酵母工程菌及其构建方法与应用
CN117363504B (zh) * 2023-12-04 2024-02-23 潍坊医学院 一种同时生产棕矢车菊素、泽兰林素的酿酒酵母工程菌及其构建方法与应用

Also Published As

Publication number Publication date
CN113862166B (zh) 2024-04-02

Similar Documents

Publication Publication Date Title
CN113862166B (zh) 一种产柚皮素的酿酒酵母菌
CN112921054B (zh) 一种用于治疗β-地中海贫血的慢病毒载体及其制备方法和应用
US5286636A (en) DNA cloning vectors with in vivo excisable plasmids
CN105368732B (zh) 一株产木糖醇的工业酿酒酵母菌株及构建方法
CN104593413A (zh) 利用家蚕后部丝腺合成分泌人血清白蛋白的方法
IL174489A (en) A method for preparing packaging cells containing adenovirus virus sequences encoding e1a, and e1b, recombinant molecules used in the method and cells containing these molecules
CN114085858B (zh) L-丝氨酸生产菌及其构建方法
CN104962576B (zh) 一种柱状黄杆菌基因定向敲除质粒及应用
CN112266914B (zh) 一种熊蜂生假丝酵母强组成型启动子及其应用
CN101838663A (zh) 一种大肠杆菌-棒状杆菌穿梭组成型表达载体及其构建方法
CN109234318B (zh) 一种提高红曲霉菌胞外色素的方法
CN110804559B (zh) 一株重组产黄青霉基因工程菌及其构建方法与应用
CN112760342B (zh) 一种用于glp-1及其类似物活性测定的慢病毒载体及细胞株
CN107267538B (zh) 一种植物质体表达载体的构建方法及应用
CN113755442B (zh) 一种用于药物活性测定的细胞株及其制备方法与应用
CN110452893B (zh) 一种高保真CRISPR/AsCpf1突变体的构建及其应用
CN111378684B (zh) 一种热诱导的基因编辑系统CRISPR-Cas12b在陆地棉中的应用
CN114107369A (zh) 一种myc标签融合表达载体的制备方法及其应用
CN114540355A (zh) Hhex软骨组织特异性敲除小鼠动物模型及其构建方法
CN113151276A (zh) 一种il-4基因缺失斑马鱼
CN110331170A (zh) 一种双重gRNA的基因表达元件及其构建方法与应用
CN109777829A (zh) 一种基因编辑U6启动子驱动的sgRNA表达组件的构建方法
CN113614229A (zh) 遗传修饰的梭菌属细菌、其制备和用途
CN107501406A (zh) 一种重组牛β‑乳球蛋白及其制备方法和应用
CN112760241B (zh) 一株重组产黄青霉基因工程菌及其构建方法与应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant