CN102695796A - 细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法 - Google Patents

细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法 Download PDF

Info

Publication number
CN102695796A
CN102695796A CN2010800616564A CN201080061656A CN102695796A CN 102695796 A CN102695796 A CN 102695796A CN 2010800616564 A CN2010800616564 A CN 2010800616564A CN 201080061656 A CN201080061656 A CN 201080061656A CN 102695796 A CN102695796 A CN 102695796A
Authority
CN
China
Prior art keywords
seq
enzyme
glucopyranosyl
sequence
octadecenoic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010800616564A
Other languages
English (en)
Other versions
CN102695796B (zh
Inventor
S.沙费尔
M.韦塞尔
A.蒂森胡森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Evonik Operations GmbH
Original Assignee
Evonik Degussa GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Evonik Degussa GmbH filed Critical Evonik Degussa GmbH
Publication of CN102695796A publication Critical patent/CN102695796A/zh
Application granted granted Critical
Publication of CN102695796B publication Critical patent/CN102695796B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/44Preparation of O-glycosides, e.g. glucosides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K31/00Medicinal preparations containing organic active ingredients
    • A61K31/70Carbohydrates; Sugars; Derivatives thereof
    • A61K31/7028Compounds having saccharide radicals attached to non-saccharide compounds by glycosidic linkages
    • A61K31/7034Compounds having saccharide radicals attached to non-saccharide compounds by glycosidic linkages attached to a carbocyclic compound, e.g. phloridzin
    • A61K31/704Compounds having saccharide radicals attached to non-saccharide compounds by glycosidic linkages attached to a carbocyclic compound, e.g. phloridzin attached to a condensed carbocyclic ring system, e.g. sennosides, thiocolchicosides, escin, daunorubicin
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1051Hexosyltransferases (2.4.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/14Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Epidemiology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Cosmetics (AREA)
  • Peptides Or Proteins (AREA)
  • Catching Or Destruction (AREA)

Abstract

本发明涉及细胞、核酸和酶,涉及它们用于生产槐糖脂的用途,也涉及用于生产槐糖脂的方法。

Description

细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法
技术领域
本发明涉及核酸、酶和细胞以及它们用于生产槐糖脂的用途,也涉及用于生产槐糖脂的方法。
背景技术
目前,表面活性剂基本上基于石化原料的基础来生产。由于可预见的石化原料的短缺以及日益增加的对基于可再生原料的和/或生物可降解的产品的需求,基于可再生原料的表面活性剂的使用是一种合适的替代方案。
槐糖脂具有用作表面活性剂所需要的表面活性性质。
这些脂类目前使用多种酵母菌、特别是假丝酵母菌Candida bombicola)的野生型分离物来生产。
迄今为止,仅通过过程控制(pH值、氧供应、培养基组成、饲喂策略、氮供应、温度、底物的选择等)的优化进行产物形成的性能参数的改善,所述性能参数诸如碳产率、空间时间产率、产物浓度、产物同质性(乙酰化程度, 脂肪酸种类, 内酯形式对开链形式)。
唯一的例外是如下的假丝酵母菌的遗传修饰,其中β-氧化已经被排除,使得作为底物而补料的甘油三酯、脂肪酸、脂肪醇等不能够再被用作碳源,换而言之,被降解(Van Bogaert等人FEMS Yeast Res. 2009年6月;9(4):610-7)。以此方式,通过选择底物,应该能够有针对性地控制槐糖脂的脂肪酸比例,从而影响产物性质。
由于通过优化过程控制仅能有限地改善槐糖脂的生物技术生产的性能参数,所以还必须进行遗传修饰。所述遗传修饰一方面包括增强参与槐糖脂合成的酶:细胞色素P450单加氧酶、糖基转移酶I、糖基转移酶II、乙酰基转移酶、槐糖脂输出体(Exporter),目的是改善产物形成的性能参数,诸如碳产率、空间时间产率、产物浓度、产物同质性(乙酰化程度、脂肪酸物质)等。所述遗传修饰另一方面包括减弱一些参与槐糖脂合成的酶:糖基转移酶II、乙酰基转移酶,目的是修饰生产的槐糖脂的结构和性质:糖基转移酶II:生产单糖基槐糖脂;乙酰基转移酶:生产未乙酰化的槐糖脂。
如果要在清洁用途、化妆品用途和其它用途中大规模地使用槐糖脂作为表面活性剂,它们必须与目前使用的表面活性剂相竞争。目前使用的表面活性剂是可以以非常低的成本生产的大宗化学品。因此,必须以尽可能最低的成本来生产槐糖脂。这不能仅仅通过过程优化来优化性能参数来实现。
因此,存在日益增加的对具有高产品产率的槐糖脂有效生产的需求。
因此,本发明以这样的目的为基础,即提供工具和/或方法,借助于所述工具和/或方法,可以以简单且大量地合成特定槐糖脂。
发明内容
令人惊奇地,已经发现,在下文中所述的细胞、核酸、多肽和方法能够达成上述目的。
因此,本发明的主题是,具有经改变的酶装置的遗传修饰的细胞,所述细胞用于合成槐糖脂。
本发明的另一个主题是,如在权利要求11和12中所述的新的核酸和载体。
本发明还有一个主题是,可用于槐糖脂生物合成中的新的酶。
本发明的优点是,不仅改善了槐糖脂形成的性能参数,诸如碳产率和空间时间产率,而且也可以改善例如乙酰化程度和脂肪酸种类方面的产物同质性。
本发明的一个主题是能够形成槐糖脂的细胞,对所述细胞已如此进行了遗传工程改变,使得所述细胞相对较于它的野生型而言具有一种经改变的如分别在下文中列举的选自下列所述组的至少一种酶的活性:
至少一种酶E1,该酶具有多肽序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63,尤其是Seq ID Nr. 7,或具有这样的多肽序列:其中相对于各个参照序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63、尤其是Seq ID Nr. 7,最多25%、优选最多20%、特别优选最多15%、尤其最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各参照序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63的酶的至少50%、优选65%、特别优选80%、尤其多于90%的酶的活性,其中将酶E1的酶的活性理解为将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的能力,
至少一种酶E2,该酶具有多肽序列Seq ID Nr. 8或Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 8或Seq ID Nr. 11,最多60%、优选最多25%、特别优选最多15%、尤其最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各参照序列Seq ID Nr. 8或Seq ID Nr. 11的酶的至少50%、优选65%、特别优选80%、尤其多于90%的酶的活性,其中将酶E2的酶的活性理解为将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的能力,
至少一种酶E3,该酶具有多肽序列Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 11,最多60%、优选最多25%、特别优选最多15%、尤其最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有参照序列Seq ID Nr. 11的酶的至少50%、优选地65%、特别优选地80%、尤其多于90%的酶的活性,其中将酶E3的酶的活性理解为将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的能力,
至少一种酶E4,该酶具有多肽序列Seq ID Nr. 9,或具有这样的多肽序列:其中相对于Seq ID Nr. 9,最多50%、优选最多25%、特别优选最多15%、尤其最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 9的酶的至少50%、优选65%、特别优选80%、尤其多于90%的酶活性,其中将酶E4的酶的活性理解为将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的能力,其中第一个选项是优选的,
至少一种酶E5,该酶具有多肽序列Seq ID Nr. 10,或具有这样的多肽序列:其中相对于Seq ID Nr. 10,最多45%、优选最多25%、特别优选最多15%、尤其最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 10的酶的至少50%、优选65%、特别优选80%、尤其多于90%的酶活性,其中酶E5的酶活性理解为将槐糖脂从细胞转移到周围的培养基中的能力。
在本发明的上下文中,表述“槐糖脂”理解为通式(Ia)和(Ib)的化合物
式(Ia)
Figure 155433DEST_PATH_IMAGE002
式(Ib)
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,未置换或具有羟基官能置换的,无分支的,任选包含1-3个双键或三键的二价有机残基,
R4 = H、CH3或包含2-10个碳原子的,未置换或具有羟基官能置换的,无分支的,任选包含1-3个双键或三键的一价有机残基,且
n = 1或0。
在本发明上下文中,细胞的“野生型”优选理解为起始菌株,由所述起始菌株通过遗传工程操纵遗传元素产生根据本发明的细胞,所述细胞对上述Seq ID Nr.的酶的活性负责。
表述“酶的经改变的活性”优选地理解为表示,经改变的细胞内活性。
不导致给定多肽的性质和功能显著改变的给定多肽序列的氨基酸残基的改变是本领域技术人员已知的。因而,例如,可将所谓的保守氨基酸彼此交换;这样的合适的氨基酸置换的实例是:Ala替换Ser;Arg替换Lys;Asn替换Gln或His;Asp替换Glu;Cys替换Ser;Gln替换Asn;Glu替换Asp;Gly替换Pro;His替换Asn或Gln;Ile替换Leu或Val;Leu替换Met或Val;Lys替换Arg或Gln或Glu;Met替换Leu或Ile;Phe替换Met或Leu或Tyr;Ser替换Thr;Thr替换Ser;Trp替换Tyr;Tyr替换Trp或Phe;Val替换Ile或Leu。同样已知,尤其在多肽的N-或C-端例如以氨基酸插入或删除形式的改变通常对多肽的功能没有显著影响。
可以通过如下方法测定酶E1的活性:以技术人员已知的方式,破碎包含该活性的细胞,例如借助于球磨机、弗氏压碎器或超声波粉碎器,随后将完整的细胞、细胞碎片和破碎辅助物(例如,玻璃珠)通过在13 000 rpm和4℃下的10分钟的离心分离去除。然后,可以用得到的无细胞的粗提取物进行酶测定,并继之以产物的LC-ESI-MS检测。或者以本领域技术人员已知的方式,通过色谱法(诸如镍/次氮基三乙酸亲和色谱法、抗生蛋白链菌素亲和色谱法、凝胶过滤色谱法或离子交换色谱法)可以富集或者也可以纯化酶最高到均质。在总体积为200µl的200 mM磷酸钠缓冲液(pH 7.4)、0.5 mM NADPH、0.5 mM二硫苏糖醇、3 mM葡萄糖6-磷酸和0.5 U葡萄糖-6-磷酸脱氢酶和50 μl蛋白质粗提物(约1 mg总蛋白质)或纯化的蛋白质溶液(10µg纯化的蛋白质)中,可以进行标准测定。通过加入下述物质,开始反应:a)5 μl底物(Z-9-十八碳烯酸)在乙醇中的10 mM溶液,或b)5µl底物(Z-9-十八碳烯酸)在0.1% Triton X-100中的10 mM溶液,所述溶液事先已经通过2次超声波处理(每次30秒)进行预处理,并在30℃温育30分钟。此后,将该反应用200µl乙酸乙酯萃取。沉淀未溶解的组分,这里通过短暂离心(在16 100 g 5分钟)进行相分离,并借助于LC-ESI-MS来分析乙酸乙酯相。产物的鉴别通过分析相应的质量痕迹和MS2波谱进行。
可以通过如下方法测定酶E2的活性:以技术人员已知的方式,破碎包含该活性的细胞,例如借助于球磨机、弗氏压碎器或超声波粉碎器,随后将完整的细胞、细胞碎片和破碎辅助物(例如,玻璃珠)通过在13 000 rpm和4℃下的10分钟的离心分离去除。然后,可以用得到的无细胞的粗提取物进行酶测定,并继之以产物的LC-ESI-MS检测。或者以本领域技术人员已知的方式,通过色谱法(诸如镍/次氮基三乙酸亲和色谱法、抗生蛋白链菌素亲和色谱法、凝胶过滤色谱法或离子交换色谱法)可以富集或者也可以纯化酶最高到均质。标准测定可以由185 μl 10 mM Tris-HCl(pH 7.5)、10 μl 125 mM UDP-葡萄糖和50 μl蛋白质粗提物(约1 mg总蛋白)或纯化的蛋白溶液(10µg纯化的蛋白)组成。通过加入下述物质,开始反应:a)5 μl底物(例如,18-羟基-Z-9-十八碳烯酸)在乙醇中的10 mM溶液,或b)5µl底物(例如,18-羟基-Z-9-十八碳烯酸)在0.1% Triton X-100中的10 mM溶液,所述溶液事先已经通过2次超声波处理(每次30秒)进行预处理,并在30℃温育30分钟。此后,将该反应用200µl乙酸乙酯萃取。沉淀未溶解的组分,这里通过短暂离心(在16 100 g 5分钟)进行相分离,并借助于LC-ESI-MS来分析乙酸乙酯相。产物的鉴别通过分析相应的质量迹线和MS2波谱进行。在该测定中,优选使用18-羟基-Z-9-十八碳烯酸作为底物,因为它是商业可得到的,也因为已经在不同地方证实,在槐糖脂生物合成过程中,槐糖脂生物合成的酶不仅接受18-羟基-Z-9-十八碳烯酸、17-羟基-Z-9-十八碳烯酸也接受其它链长的(饱和的或不饱和的)和在ω-或ω-1-碳上羟基化的羟基脂肪酸作为底物,如同也接受在槐糖脂生物合成进程中从它们产生的单葡糖苷和二葡糖苷作为底物(Asmer, H.J., Lang, S., Wagner, F., Wray, V. (1988). Microbial production, structure elucidation and bioconversion of sophorose lipids. J. Am. Oil Chem. Soc. 65:1460–1466; Nunez, A., Ashby, R., Foglia, T.A. 等人(2001). Analysis and characterization of sophorolipids by liquid chromatography with atmospheric pressure chemical ionization. Chromatographia 53:673–677; Ashby, R.D., Solaiman, D.K., Foglia, T.A. (2008). Property control of sophorolipids: influence of fatty acid substrate and blending. Biotechnology Letters 30:1093-1100)。
可以通过如下方法测定酶E3的活性:以技术人员已知的方式,破碎包含该活性的细胞,例如借助于球磨机、弗氏压碎器或超声波粉碎器,随后将完整的细胞、细胞碎片和破碎辅助物(例如,玻璃珠)通过在13 000 rpm和4℃下的10分钟的离心分离去除。然后,可以用得到的无细胞的粗提取物进行酶测定,并继之以产物的LC-ESI-MS检测。或者以本领域技术人员已知的方式,通过色谱法(诸如镍/次氮基三乙酸亲和色谱法、抗生蛋白链菌素亲和色谱法、凝胶过滤色谱法或离子交换色谱法)可以富集或者也可以纯化酶最高到均质。标准测定可以由185 μl 10 mM Tris-HCl(pH 7.5)、10 μl 125 mM UDP-葡萄糖和50 μl蛋白质粗提物(约1 mg总蛋白)或纯化的蛋白溶液(10µg纯化的蛋白)组成。通过加入下述物质,开始反应:a)5 μl底物(例如,18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸)在乙醇中的10 mM溶液,或b)5µl底物(18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸)在0.1% Triton X-100中的10 mM溶液,所述溶液事先已经通过2次超声波处理(每次30秒)进行预处理,或c)通过加入关于酶E2的活性测定所述的反应混合物,并在30℃温育30分钟。此后,将反应用200µl(如在a)和b)中所述,加入的底物)或400µl(如在c)中所述,加入的底物)的乙酸乙酯萃取反应物。沉淀未溶解的组分,这里通过短暂离心(在16 100 g 5分钟)进行相分离,并借助于LC-ESI-MS来分析乙酸乙酯相。产物的鉴别通过分析相应的质量迹线和MS2波谱进行。在该测定中,优选使用18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸作为底物,因为它的前体分子18-羟基-Z-9-十八碳烯酸是商业可得到的,也因为已经在不同地方证实,在槐糖脂生物合成过程中,槐糖脂生物合成的酶不仅接受18-羟基-Z-9-十八碳烯酸、17-羟基-Z-9-十八碳烯酸也接受其它链长的(饱和的或不饱和的)和在ω-或ω-1-碳上羟基化的羟基脂肪酸作为底物,如同也接受在槐糖脂生物合成进程中从它们产生的单葡糖苷和二葡糖苷作为底物。
可以通过如下方法测定酶E4的活性:以技术人员已知的方式,破碎包含该活性的细胞,例如借助于球磨机、弗氏压碎器或超声波粉碎器,随后将完整的细胞、细胞碎片和破碎辅助物(例如,玻璃珠)通过在13 000 rpm和4℃下的10分钟的离心分离去除。然后,可以用得到的无细胞的粗提取物进行酶测定,并继之以产物的LC-ESI-MS检测。或者以本领域技术人员已知的方式,通过色谱法(诸如镍/次氮基三乙酸亲和色谱法、抗生蛋白链菌素亲和色谱法、凝胶过滤色谱法或离子交换色谱法)可以富集或者也可以纯化酶最高到均质。标准测定可以由185 μl 10 mM Tris-HCl(pH 7.5)、2.5 μl 100 mM乙酰基-辅酶A和50 μl蛋白质粗提物(约1 mg总蛋白)或纯化的蛋白溶液(10µg纯化的蛋白)组成。通过加入下述物质,开始反应:a)5 μl底物(化学上去乙酰化的槐糖脂)在乙醇中的10 mM溶液,或b)5µl底物(化学上去乙酰化的槐糖脂)在0.1% Triton X-100中的10 mM溶液,所述溶液事先已经通过2次超声波处理(每次30秒)进行预处理,或c)通过加入用于测定酶E3的活性所述的反应混合物(按照该处在c以下)所述的底物加入的方式,随后在30℃温育30分钟。此后,将反应用200µl(如在a)和b)中所述,加入的底物)或600µl(如在c)中所述,加入的底物)的乙酸乙酯萃取反应物。沉淀未溶解的组分,这里通过短暂离心(在16 100 g 5分钟)进行相分离,并借助于LC-ESI-MS来分析乙酸乙酯相。产物的鉴别通过分析相应的质量迹线和MS2波谱进行。根据本发明优选的是,酶E4不仅接受槐糖脂的内酯形式(如在这里选作参照活性)作为底物,而且也能够在合适的位置至少一次将槐糖脂的酸形式乙酰化,如式(Ia)一般地所示,其中R1和R2 = H。
通过每个细胞的酶E5的绝对量,可以以最简单的方式间接地测定酶E5与它的野生型相比改变的活性,因为可以认为,基于细胞,存在增加会造成活性增加,存在减少会造成活性减少,并且这些关系直接地彼此依赖。与野生型相比改变的酶E5的存在可以用常规方法测定。因而,可以使用对待检测蛋白质特异性的抗体通过蛋白印迹杂交(Sambrook等人, Molecular Cloning: a laboratory manual, 第2版Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. USA, 1989),随后用适合的用于浓度测定的软件进行光学评价(Lohaus和Meyer(1989)Biospektrum, 5: 32-39; Lottspeich(1999), Angewandte Chemie 111: 2630-2647)来分析蛋白质浓度。
根据本发明优选的细胞是微生物,优选细菌细胞、酵母菌细胞或真菌细胞,其中假丝酵母属和Wickerhamiella属的子囊菌,尤其是假丝酵母菌Candida bogoriensisCandida batistaeCandida apicola拟威 克酵母Wickerhamiella domericqiae ,是特别优选的。
尤其是菌株 丝酵母菌ATCC 22214、Candida bogoriensis NRRL Y-5980、Candida batistae CBS 8550、Candida apicola IMET 42747和拟威 克酵母是特别合适的细胞。
由于槐糖脂由根据本发明的细胞从葡萄糖和脂肪酸形成,所以如果根据本发明的细胞在其β-氧化中被至少部分地阻断,则是有利的,因为这会防止底物的流失,并从而使得更高的产物浓度和碳产率成为可能。例如在WO 03/100013中描述了在β-氧化中被阻断的假丝酵母属细胞,在Van Bogaert等人FEMS Yeast Res. 2009年6月;9(4):610-7中描述了β-氧化被阻断的假丝酵母菌细胞。
在根据本发明优选的细胞中,改变的酶的活性优选是酶的活性增加。
根据本发明,优选具有活性增加的下列酶的组合的细胞:
E1E2、E1E3、E1E4、E1E5、E2E3、E2E4、E2E5、E3E4、E3E5、E4E5、E1E2E3、E1E2E4、E1E2E5、E1E3E4、E1E3E5、E1E4E5、E2E3E4、E2E4E5、E3E4E5、E1E2E3E4、E2E3E4E5、E1E3E4E5、E1E2E4E5、E1E2E3E5、E1E2E3E4和E1E2E3E4E5
其中组合
E1E2、E1E3、E1E4、E1E5、E2E3、E2E4、E2E5、E3E4、E3E5、E4E5、E1E2E3、E1E2E4、E1E2E5、E1E3E4、E1E3E5、E1E4E5、E2E3E4、E2E4E5、E3E4E5和E1E2E3E4E5
特别是
E1E2、E1E3、E1E4、E1E5、E2E3、E2E4、E2E5、E3E4、E3E5、E4E5和E1E2E3E4E5
是优选的。
为了制备通式(Ia)(其中n = 0)的槐糖脂,在细胞中应当存在尽可能少的酶E3的酶的活性。因而,在根据本发明的细胞的一个确定的实施方式中,改变的酶E3的活性是减少的活性。
在这样的情况下,根据本发明优选的细胞是这样的细胞:其具有的酶E3的活性减少,并任选同时具有酶E1、E2、E4和E5的至少一种的活性增加,且其特别地除了酶E3的活性减少以外,具有下述酶组合的活性增加:
E1E2、E1E4、E1E5、E2E4、E2E5、E4E5、E1E2E4、E1E2E5、E1E4E5和E1E2E4E5
特别优选
E1E2、E1E4、E1E5、E2E4、E2E5、E4E5和E1E2E4E5
在这样的情况下,根据本发明的细胞优选是假丝酵母菌 -Candida bogoriensis-Candida batistae-Candida apicola-拟威克 酵母细胞。
此外这里优选的是这样的根据本发明的细胞,其中通过基因的修饰来实现酶的活性的降低,所述基因包含选自下述的核酸序列:Seq ID Nr. 6,和与参照序列Seq ID Nr. 6至少80%、特别优选至少90%、此外优选至少95%和最优选至少99%相同的序列,
其中所述修饰选自优选由下述修饰组成的组:
在所述基因中插入外源DNA、所述基因的至少部分删除、所述基因序列中的点突变、RNA干扰(siRNA)、反义RNA或侧接所述基因的调节序列的修饰(插入、删除或点突变)。
适用于制备这样的细胞的核酸例如是具有Seq ID Nr. 16的核酸,其也是本发明的主题。
为了制备通式(Ia)或(Ib)(其中R1和R2为H)的槐糖脂,在细胞中应当存在尽可能少的酶E4的酶的活性。因而,在根据本发明的细胞的一个确定的实施方式中,改变的酶E4的活性是减少的活性。
在这种情况下,根据本发明优选的细胞是这样的细胞:其具有的至少一种酶E4的活性减少,并任选同时具有的酶E1、E2、E3和E5的至少一种的活性增加,且其特别地除了具有酶E4活性减少以外,还具有下述酶组合的活性增加:
E1E2、E1E3、E1E5、E2E3、E2E5、E3E5、E1E2E3、E1E2E5、E1E3E5和E1E2E3E5
特别优选
E1E2、E1E3、E1E5、E2E3、E2E5、E3E5和E1E2E3E5
在这种情况下,根据本发明的细胞优选是假丝酵母菌 -Candida bogoriensis-Candida batistae-Candida apicola-或拟威克酵母细胞。
此外这里优选的是这样的根据本发明的细胞,其中通过基因的修饰来实现酶的活性的降低,所述基因包含选自下述的核酸序列:Seq ID Nr. 4,和与Seq ID Nr. 4至少80%、特别优选至少90%、此外优选至少95%和最优选至少99%相同的序列,
其中所述修饰选自优选由下述修饰组成的组:
在所述基因中插入外源DNA、所述基因的至少部分删除、所述基因序列中的点突变、RNA干扰(siRNA)、反义RNA或侧接所述基因的调节序列的修饰(插入、删除或点突变)。
适用于制备这样的细胞的核酸例如是具有Seq ID Nr. 14的核酸,其也是本发明的主题。
为了制备通式(Ia)(其中n = 0和R1为H)的槐糖脂,在细胞中应当存在尽可能少的酶E3和E4的酶的活性。因而,在根据本发明的细胞的一个确定的实施方式中,修饰的酶E3和E4的活性是减少的活性。
在这种情况下,根据本发明优选的细胞是这样的细胞:其具有的酶E3和E4的各至少一种的活性减少,并同时具有的酶E1、E2和E5的至少一种的活性增加,且其尤其除了酶E3和E4的各至少一种的活性减少以外,还具有下述酶组合的活性增加:
E1E2、E1E5、E2E5、E1E2E5
特别优选
E1E2、E1E5和E2E5
在这种情况下,根据本发明的细胞优选是假丝酵母菌 -Candida bogoriensis-Candida batistae-Candida apicola-或拟威克酵母细胞。
此外这里优选的是这样的根据本发明的细胞,其中通过基因的修饰来实现酶的活性的降低,所述基因包含选自下述的核酸序列:Seq ID Nr. 4,和与Seq ID Nr. 4至少80%、特别优选至少90%、此外优选至少95%和最优选至少99%相同的序列,
包含选自下述的核酸序列的基因:Seq ID Nr. 6,和与参照序列Seq ID Nr. 6至少80%、特别优选至少90%、此外优选至少95%和最优选至少99%相同的序列,
其中所述修饰选自优选由下述修饰组成的组:
在所述基因中插入外源DNA、所述基因的至少部分删除、所述基因序列中的点突变、RNA干扰(siRNA)、反义RNA或侧接所述基因的调节序列的修饰(插入、删除或点突变)。
适用于制备这样的细胞的核酸是例如具有Seq ID Nr. 14和16的那些。
在下文中用于增加细胞中酶的活性所述的内容,既适用于增加酶E1至E5的活性,也适用于在下文中所有提及的其活性可以任选增加的酶。
原则上,可以这样实现酶的活性的增加:增加编码所述酶的一个或多个基因序列的拷贝数、使用强启动子、改变所述基因的密码子使用、以不同的方式增加mRNA或酶的半衰期、修饰基因表达的调节或使用编码具有增加的活性的合适酶的基因或等位基因,和任选地组合这些措施。例如,使用载体通过转化、转导、接合或这些方法的组合制备根据本发明遗传修饰的细胞,所述载体包含希望的基因、该基因的等位基因或其部分和启动子,所述启动子使得该基因的表达成为可能。特别地,通过将所述基因或所述等位基因整合进细胞的染色体中或染色体外复制的载体中,实现异源表达。
关于增加细胞中酶(例如酶异柠檬酸裂合酶)的活性的可能性的综述由EP0839211(这里将它作为引用并入本文)给出,且它的关于增加细胞中酶的活性的可能性的公开内容,形成本发明的公开内容的一部分。
上文所述的以及下文提及的所有酶或基因的表达可以借助于1维和2维蛋白凝胶分离,和随后使用合适的评价软件在凝胶中对蛋白质浓度进行光学识别。如果酶的活性的增加仅基于相应基因的表达的增加,则以简单的方式通过在野生型和经遗传工程改变的细胞之间对比1维和2维的蛋白质分离,可以定量测定酶的活性增加。制备棒状杆菌的蛋白质凝胶和鉴定蛋白质的常规方法,是由Hermann等人(Electrophoresis, 22: 1712.23(2001))所述的做法。同样可以使用对待检测蛋白质特异性的抗体通过蛋白印迹杂交(Sambrook等人, Molecular Cloning: a laboratory manual, 第2版Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. USA, 1989),随后用适合的用于浓度测定的软件进行光学评价(Lohaus和Meyer(1989)Biospektrum, 5: 32-39; Lottspeich(1999), Angewandte Chemie 111: 2630-2647)来分析蛋白质浓度。借助于DNA条带移位分析(也称作凝胶阻滞)(Wilson等人(2001)Journal of Bacteriology, 183: 2151-2155),可以测量DNA结合蛋白的活性。通过不同的充分描述的报高基因分析的方法(Sambrook等人, Molecular Cloning: a laboratory manual, 第2版Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. USA, 1989),可以证明DNA结合蛋白对其它基因的表达的影响。可以根据不同的描述的方法(Donahue等人(2000)Journal of Bacteriology 182(19): 5624-5627; Ray等人(2000)Journal of Bacteriology 182(8): 2277-2284; Freedberg等人(1973)Journal of Bacteriology 115 (3): 816-823)测定细胞内酶的(比)活性。只要在下文中没有阐明用于测定某种酶的活性的具体方法,优选通过在下述文献中描述的方法,测定酶的活性的增加以及酶的活性的减少:Hermann等人, Electrophoresis, 22: 1712-23(2001), Lohaus等人, Biospektrum 5 32-39(1998), Lottspeich, Angewandte Chemie 111: 2630-2647(1999)和Wilson等人, Journal of Bacteriology 183: 2151-2155(2001)。
如果通过内源基因的突变来增加酶的活性,则可以如下产生这样的突变:使用传统的方法以非定向方式产生,例如,通过紫外线辐照或通过触发诱变的化学试剂,或者,有针对性地借助于遗传工程方法,诸如删除、插入和/或核苷酸替代。通过这些突变得到改变的细胞。特别优选的酶的突变体尤其也是这样的酶:其不能再进行反馈抑制,或与野生型酶相比至少可减少反馈抑制。
如果要通过增加酶的合成来实现酶的活性的增加,则例如可以增加相应基因的拷贝数,或者突变位于结构基因上游的启动子区和调节区或核糖体结合位点。构建在结构基因上游的表达盒以同样的方式起作用。另外,通过可诱导的启动子可以在每一个任何意的时间点增加表达。此外,还可以将酶基因归入为调节序列(也称作“增强子”),其通过改善RNA聚合酶和DNA之间的相互作用,同样起增加基因表达的作用。通过延长mRNA的寿命的措施同样改善表达。此外,通过防止酶降解,同样增加酶的活性。在这里,基因或基因构建存在于具有不同拷贝数的质粒中,或者整合在染色体中并扩增。此外,作为一个替代方案,可以通过改变培养基组成和培养管理,可以实现相关基因的过表达。为此,本领域技术人员尤其在下述文献中找到说明:Martin等人(Bio/Technology 5, 137-146(1987)),Guerrero等人(Gene 138, 35-41(1994)), Tsuchiya和Morinaga(Bio/Technology 6, 428-430(1988)),Eikmanns等人(Gene 102, 93-98(1991)),在EP-A-0 472 869中,在US 4,601,893中,Schwarzer和Pühler(Bio/Technology 9, 84-87(1991)),Reinscheid等人(Applied and Environmental Microbiology 60, 126-132(1994)),LaBarre等人(Journal of Bacteriology 175, 1001-1007(1993)),在WO-A-96/15246中,Malumbres等人(Gene 134, 15-24(1993)),JP-A-10-229891,Jensen和Hammer(Biotechnology and Bioengineering 58, 191-195(1998)),以及遗传学和分子生物学的已知教科书中。上述的措施与突变一样导致遗传工程改变的细胞。
例如,使用附加体质粒来增加各基因的表达。原则上,所有供本领域技术人员用于此目的的实施方式都适合作为质粒和载体。这样的质粒和载体可以由例如Novagen、Promega、Promega、New England Biolabs、Clontech或Gibco BRL公司的小册子获知。其它优选的质粒和载体可以见于:Glover, D. M. (1985), DNA cloning: a practical approach, Vol. I-III, IRL Press Ltd., Oxford; Rodriguez, R.L. 和Denhardt, D. T(编)(1988), Vectors: a survey of molecular cloning vectors and their uses, 179-204, Butterworth, Stoneham; Goeddel, D. V. (1990), Systems for heterologous gene expression, Methods Enzymol. 185, 3-7; Sambrook, J.; Fritsch, E. F. 和Maniatis, T. (1989), Molecular cloning: a laboratory manual, 第2版, Cold Spring Harbor Laboratory Press, New York。
随后通过转化,将包含要扩增的基因或要灭活的基因的部分的载体(诸如表达载体、基因删除盒、基因插入盒或基因过表达盒)转移进希望的菌株中。用于转化的方法,尤其是电穿孔、乙酸锂介导的转化、冻融转化例如描述在下列文献中:在Gietz, R.D., Schiestl, R.H. (2007).Frozen competent yeast cells that can be transformed with high efficiency using the LiAc/SS carrier DNA/PEG method. Nat Protoc. 2:1-4; Suga, M., Hatakeyama, T. (2003). High-efficiency electroporation by freezing intact yeast cells with addition of calcium. Curr Genet. 43:206-211; Hubberstey, A.V., Wildeman, A.G. (1991). Transformation of Saccharomyces cerevisiae by use of frozen spheroplasts. Trends Genet. 7:41; Bröker, M. (1993). Rapid transformation of cryopreserved competent Schizosaccharomyces pombe cells. Biotechniques. 15:598 - 600; Gietz, R.D., Schiestl, R.H. (1989). High efficiency transformation of intact yeast cells using single stranded nucleic acids as a carrier. Curr Genet. 16:339-346中和在“Nonconventional yeast in biotechnology”(Klaus Wolf编, Springer-Verlag Berlin, 1996)中。转化以后,载体、尤其是基因删除盒、基因插入盒或基因过表达盒借助于交换(“crossover”-Ereigniss)通过同源的或异源的、优选通过同源的重组整合进希望的菌株的染色体中。作为一个替代方案,载体、尤其是表达载体也可以是附加型,即作为独立的复制单元在希望的菌株的细胞中复制。在所有情况下,这样来确保载体(诸如表达载体、基因删除盒、基因插入盒或基因过表达盒)在细胞分裂时也传递给子细胞。
在上文中和在下文中使用的措辞“与它的野生型相比增加的酶Ex的活性”优选总是理解为,至少1.5、特别优选至少10、此外优选至少100、此外还更优选至少1000和最优选至少10 000的因数的增加的各种酶Ex的活性。此外,根据本发明的具有“与它的野生型相比增加的酶Ex的活性”的细胞尤其也包括这样的细胞:其野生型不具有或至少不具有可检测的酶Ex的活性,且其仅在增加酶的活性(例如通过过表达)以后才显示出可检测的酶Ex的活性。在这种情况下,术语“过表达”或在下文中使用的措辞“表达的增加”也包括这样的情况:其中起始细胞(例如野生型细胞)不具有或至少不具有可检测的表达,且仅通过重组方法来诱导可检测的酶Ex的合成。
与此相应的,使用的措辞“减少的酶Ex的活性”优选理解为,至少0.5、特别优选至少0.1、此外优选至少0.01、此外还更优选至少0.001和最优选至少0.0001的因数的减少的活性。措辞“减少的活性”也包括不可检测的活性(“零活性”)。某种酶的活性的减少例如可以通过定向突变,或通过其它的本领域技术人员已知的用于减少某种酶的活性的措施来进行。
减少微生物中的酶的活性的方法是本领域技术人员已知的。
分子生物学的技术在这里尤其适合。本领域技术人员在WO91/006660和WO03/100013中,可以找到用于修饰和减少蛋白质表达以及由此随之出现的酶的活性的减少(特别对于假丝酵母属)、尤其是用于中断某种基因的说明。
根据本发明优选的细胞的特征在于,通过修饰包含上述核酸序列之一的基因来实现酶的活性的减少,其中所述修饰选自优选由下述修饰组成的组:在所述基因中插入外源DNA、所述基因的至少部分删除、所述基因序列中的点突变、RNA干扰(siRNA)、反义RNA或侧接所述基因的调节序列的修饰(插入、删除或点突变)。
在这种情况下,外源DNA可理解为,对于所述基因而言(并非对于所述生物体而言)是“外来”的任意DNA序列,换而言之,在这种情况下Candida-bombicola-内源DNA序列也可以作为“外源DNA”起作用。
在这种情况下,特别优选的是,将所述基因通过选择标记基因的插入来中断,因而所述外源DNA是选择标记基因,其中所述插入优选通过同源重组进基因座中来进行。
根据本发明优选的细胞的特征在于,它们已经被至少一种在下文中所述的根据本发明的核酸和/或在下文中所述的根据本发明的载体转化。
根据本发明的细胞可以有利地用于生产槐糖脂。
因而,本发明的另一个主题是,根据本发明的细胞用于生产通式(Ia)和(Ib)的化合物的用途,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,
R4 =H、CH3或包含2-10个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的一价有机残基,
n = 0或1,
用于生产尤其是这样的通式(Ia)和(Ib)的化合物的用途,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,
R4 = H、CH3或C9H19,且
n = 0或1,
和用于生产最特别优选的通式(Ia)和(Ib)的化合物的用途,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,尤其是C8H15=C7H14
R4 = H、CH3或C9H19,尤其是H或CH3,且
n = 1。
本发明的另一个主题是用于生产槐糖脂、优选通式(Ia)和(Ib)的化合物的方法,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,
R4 = H、CH3或包含2-10个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的一价有机残基,且
n = 0或1,
用于生产尤其是这样的通式(Ia)和(Ib)的化合物的方法,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,
R4 = H、CH3或C9H19,且
n = 0或1,
和用于生产最特别优选的通式(Ia)和(Ib)的化合物的方法,
其中
R1 = H或CO-CH3
R2 = H或CO-CH3
R3 =含有6-32个碳原子的,优选7-19个碳原子的,未取代或具有羟基官能取代的,无分支的,任选包含1-3个双键或三键的二价有机残基,尤其是C8H15=C7H14
R4 = H、CH3或C9H19,尤其是H或CH3,且
n = 1,
所述方法包括下述工艺步骤:
I) 使根据本发明的细胞与包含碳源的培养基接触,
II) 在使所述细胞能够从所述碳源形成槐糖脂的条件下,培养所述细胞,且
III) 任选分离形成的槐糖脂。
为了生产上述产物的目的,可以连续地或不连续地以分批方法(分批培养)或分批补料方法(进料法(Zulaufverfahren))或重复分批补料法(重复进料法(repetitives Zulaufverfahren))使根据本发明的遗传改变的细胞与营养培养基接触,并从而进行培养。如在GB-A-1009370中描述的半连续方法也可行的。已知的培养方法的综述,可以参见:Chmiel的教科书(“Bioprozesstechnik 1. Einführung in die Bioverfahrenstechnik”(Gustav Fischer Verlag, Stuttgart, 1991))或Storhas的教科书(“Bioreaktoren und periphere Einrichtungen”, Vieweg Verlag, Brunswick/Wiesbaden, 1994)。
要使用的培养基必须以合适的方式满足各菌株的要求。在“Nonconventional yeast in biotechnology”中(Klaus Wolf编, Springer-Verlag Berlin, 1996)含有用于不同酵母菌株的培养基的描述。
作为碳源可以使用的例如是:碳水化合物例如葡萄糖、蔗糖、阿拉伯糖、木糖、乳糖、果糖、麦芽糖、糖蜜、淀粉、纤维素和半纤维素,植物油和动物油和脂肪例如豆油、红花油、花生油、大麻油、麻风树油、椰子脂、南瓜子油、亚麻子油、玉米油、罂粟子油、月见草油、橄榄油、棕榈核油、棕榈油、菜籽油、芝麻油、葵花籽油、葡萄籽油、核桃油、小麦胚芽油和椰子油,脂肪酸例如辛酸、癸酸、月桂酸、肉豆蔻酸、棕榈酸、棕榈油酸、硬脂酸、花生四烯酸、山萮酸、油酸、亚油酸、亚麻酸、γ-亚麻酸和它们的甲基-或乙基酯,和脂肪酸混合物,具有前述脂肪酸的甘油单酯、甘油二酯和甘油三酯,醇类例如甘油、乙醇和甲醇,烃类诸如甲烷、含碳的气体和气体混合物,诸如CO、CO2、合成气、烟道气,氨基酸诸如L-谷氨酸盐或L-缬氨酸,或有机酸例如醋酸。这些物质可以单独地或作为混合物使用。特别优选使用碳水化合物、尤其是单糖、寡糖或多糖作为碳源(如在US 6,01,494和US 6,136,576中所述),和烃类、尤其是烷烃、烯烃和炔烃和它们衍生的单羧酸和由这些单羧酸衍生的甘油单酯、甘油二酯和甘油三酯以及甘油和乙酸酯。最特别优选的是包含甘油与辛酸、癸酸、月桂酸、肉豆蔻酸、棕榈酸、棕榈油酸、硬脂酸、花生四烯酸、山萮酸、油酸、亚油酸、亚麻酸和/或γ-亚油酸的酯化产物的甘油单酯、甘油二酯和甘油三酯。
作为氮源可以使用的是包含氮的有机化合物,诸如蛋白胨、酵母萃取物、肉膏、麦芽膏、玉米浆、大豆粉和尿素,或无机化合物诸如硫酸铵、氯化铵、磷酸铵、碳酸铵和硝酸铵、氨、氢氧化铵或氨水。氮源可以单独地或作为混合物使用。
作为磷源可以使用:磷酸、磷酸二氢钾或磷酸氢二钾或相应的含钠盐。此外,所述培养基必须包含所生长必需的金属盐,例如,硫酸镁或硫酸铁。最后,除了上述物质以外,可以使用必需的生长物质诸如氨基酸和维生素。此外,可以将合适的前体加入培养基中。可以将所述的原料作为单批加入培养物中,或在培养过程中以合适的方式补料。
以合适的方式使用碱性化合物(诸如氢氧化钠、氢氧化钾、氨或氨水)或酸性化合物(诸如磷酸和硫酸)以控制培养物的pH。使用消泡剂(例如,脂肪酸聚乙二醇酯)以控制泡沫产生。为了维持质粒的稳定性,可以向培养基中加入合适的选择物质,例如,抗生素。为了维持好氧条件,将氧或含氧的气体混合物(例如,空气)导入培养物中。
培养物的温度通常是高于20℃,优选高于25℃,它也可以是高于40℃,其中不超过95℃、特别优选不超过90℃和最优选不超过80℃的培养温度是有利的。
在根据本发明的方法的步骤III)中,可以任选地从细胞和/或营养培养基中分离出通过细胞形成的槐糖脂,其中本领域技术人员已知的从复杂组合物中分离出低分子量物质的所有方法均适用于分离,例如,过滤、萃取、吸附(色谱法)或结晶。通常,取决于产物形式进行槐糖脂的后处理。在槐糖脂以水不溶性的内酯形式存在的情况下,适合的是下述方法:从水相中分离内酯形式的产物通过离心进行。
另外,产物相包括生物质和不同杂质的残余物,诸如油、脂肪酸和其它营养培养基组分。例如,通过使用合适的溶剂萃取,可以有利地借助于有机溶剂分离油残余物。烷烃(例如,正己烷)是优选的溶剂。从水相分离产物可以例如使用合适的酯,例如借助于乙酸乙酯进行。上述萃取步骤可以以任意次序进行。
或者,从营养培养基中分离槐糖脂可以通过将内酯形式转化成水溶性的开链酸形式进行。例如,借助于水解,有利地通过碱性水解,进行向开链酸形式的转化。此后,将开链的槐糖脂溶解在酸的水溶液(例如硫酸的水溶液)中,以便分离在溶液中可能已经形成的盐。借助于萃取,进行产物的进一步纯化。在这里,优选使用溶剂,尤其是有机溶剂。正戊醇是优选的溶剂。为了去除溶剂,例如进行蒸馏。此后,例如借助于色谱法,可以进一步纯化低压冻干的产物。在这里可以提及的实例是:借助于合适的溶剂进行沉淀、借助于合适的溶剂进行萃取、络合(例如借助于环糊精或环糊精衍生物)、结晶、借助于色谱法进行纯化或分离或者将槐糖脂转化成可以容易分离的衍生物。
用根据本发明的方法生产的槐糖脂可以有利地用于清洁剂、化妆品制剂或药物制剂以及作物保护制剂中。
因而,本发明的另一个主题是,用根据本发明的方法得到的槐糖脂制备化妆品制剂、皮肤病学制剂或药物制剂、作物保护制剂和护理剂及清洁剂和表面活性剂浓缩物的用途。
术语“护理剂”在这里理解为,满足下述目的的制剂:使物品保持在它的初始形式,减少或避免外部影响(例如时间、光、温度、压力、污损、与所述物品接触的其它反应性化合物的化学反应)的作用(例如,老化、污损、材料疲劳、漂白),或甚至改善对象的希望的有利的性质。对于最后一点,可以提及例如,被考虑的对象的改善的毛发光泽或更大的弹性。
“作物保护制剂”应当理解为这样的制剂,其明显地用于保护植物的制剂,这取决于它们的制品的性质;如果在所述制剂中包含至少一种选自下述类别的化合物:除草剂、杀真菌剂、杀虫剂、杀螨剂、杀线虫剂、驱鸟剂、植物营养物和土壤改良剂,则其尤其属于这种情况。
根据本发明,优选将根据本发明的方法制备的槐糖脂用于家务、工业、尤其是用于硬表面、皮革或纺织品的护理剂及清洁剂中。
一种分离的DNA为达成该目的作出了贡献,所述DNA选自下述序列:
A1a) 根据Seq ID Nr. 2、Seq ID Nr. 52或Seq ID Nr. 54,尤其是Seq ID Nr. 2的序列,其中该序列编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质,
B1a) 一种不含内含子的序列,其源自根据A1a)的序列,且其与根据Seq ID Nr. 2、Seq ID Nr. 52或Seq ID Nr. 54,尤其是根据Seq ID Nr. 2的序列编码同样的蛋白质或肽,
C1a) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 7、Seq ID Nr. 53或Seq ID Nr. 55,尤其是Seq ID Nr. 7的氨基酸序列,且优选能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸,
D1a) 一种序列,其与根据组A1a)至C1a)中的任一组的、特别优选根据组A1a)的序列具有至少80%、特别优选至少90%、更优选至少95%和最优选至少99%相同,其中该序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
E1a) 一种序列,其与根据组A1a)至D1a)中的任一组的、特别优选根据组A1a)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中所述序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
F1a) 根据组A1a)至E1a)中的任一组的、特别优选根据组A1a)的序列的衍生物,所述衍生物通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,和
G1a) 与根据组A1a)至F1a)中的任一组的、特别优选根据组A1a)的序列互补的序列。
一种分离的DNA为达成该目的作出了进一步的贡献,所述DNA选自下述序列:A1b) 根据Seq ID Nr. 56、Seq ID Nr. 58或Seq ID Nr. 60的序列,其中该序列编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质,
B1b) 一种不含内含子的序列,其源自根据A1b)的序列,且其与根据Seq ID Nr. 56、Seq ID Nr. 58或Seq ID Nr. 60的序列编码同样的蛋白质或肽,
C1b) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 57、Seq ID Nr. 59或Seq ID Nr. 61的氨基酸序列,且所述蛋白质或肽优选能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸,
D1b) 一种序列,其与根据组A1b)至C1b)中的任一组的、特别优选根据组A1b)的序列至少80%、特别优选至少86%、更优选至少95%和最优选至少99% 相同,其中该序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
E1b) 一种序列,其与根据组A1b)至D1b)中的任一组的、特别优选根据组A1b)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
F1b) 根据组A1b)至E1b)中的任一组的、特别优选根据组A1b)的序列的衍生物,所述衍生物通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,和
G1b) 与根据组A1b)至F1b)中的任一组的、特别优选根据组A1b)的序列互补的序列。
一种分离的DNA为达成该目的作出了进一步的贡献,所述DNA选自下述序列:A1c) 根据Seq ID Nr. 62的序列,其中该序列编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质,
B1c) 一种不含内含子的序列,其源自根据A1c)的序列,且其与根据Seq ID Nr. 62的序列编码同样的蛋白质或肽,
C1c) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 63的氨基酸序列,且所述蛋白质或肽优选能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸,
D1c) 一种序列,其与根据组A1c)至C1c)中的任一组的、特别优选根据组A1c)的序列至少60%、特别优选至少85%、更优选至少90%和最优选至少99% 相同,其中该序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
E1c) 一种序列,其与根据组A1c)至D1c)中的任一组的、特别优选根据组A1c)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,
F1c) 根据组A1c)至E1c)中的任一组的、特别优选根据组A1c)的序列的衍生物,所述衍生物通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质或肽,和
G1c) 与根据组A1c)至F1c)中的任一组的、特别优选根据组A1c)的序列互补的序列。
本发明的另一个主题是,一种分离的DNA,所述DNA选自下述序列:
A2) 根据Seq ID Nr. 3的序列,其中该序列编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的蛋白质,
B2) 一种不含内含子的序列,其源自根据A2)的序列,且其与根据Seq ID Nr. 3的序列编码同样的蛋白质或肽,
C2) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 8的氨基酸序列,且其优选能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,
D2) 一种序列,其与根据组A2)至C2)中的任一组、特别优选根据组A2)的序列至少80%、特别优选至少90%、更优选至少95%和最优选至少99%相同,其中该序列优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的蛋白质或肽,
E2) 一种序列,其与根据组A2)至D2)中的任一组、特别优选根据组A2)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的蛋白质或肽,
F2) 根据组A2)至E2)中的任一组的、特别优选根据组A2)的序列的衍生物,所述衍生物通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的蛋白质或肽,和
G2) 与根据组A2)至F2)中的任一组的、特别优选根据组A2)的序列互补的序列。
本发明的另一个主题是,一种分离的DNA,所述DNA选自下述序列:
A3) 根据Seq ID Nr. 4的序列,其中该序列编码
能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的蛋白质,其中第一个选项是优选的,
B3) 一种不含内含子的序列,其源自根据A3)的序列,且其与根据Seq ID Nr. 4的序列编码同样的蛋白质或肽,
C3) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 9的氨基酸序列,且其优选
能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的蛋白质或肽,其中第一个选项是优选的,
D3) 一种序列,其与根据组A3)至C3)中的任一组、特别优选根据组A3)的序列至少80%、特别优选至少90%、更优选至少95%和最优选至少99%相同,其中该序列优选编码
能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的蛋白质或肽,其中第一个选项是优选的,
E3) 一种序列,其与根据组A3)至D3)中的任一组、特别优选根据组A3)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码
能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的蛋白质或肽,其中第一个选项是优选的,
F3) 根据组A3)至E3)中的任一组的、特别优选根据组A3)的序列的衍生物,所述衍生物通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码
能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,
或能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的蛋白质或肽,其中第一个选项是优选的,和
G3) 与根据组A3)至F3)中的任一组的、特别优选根据组A3)的序列互补的序列。
本发明的另一个主题是,一种分离的DNA,所述DNA选自下述序列:
A4) 根据Seq ID Nr. 5的序列,其中该序列编码能够将槐糖脂从细胞转移到周围的培养基中的蛋白质,
B4) 一种不含内含子的序列,其源自根据A4)的序列,且其与根据Seq ID Nr. 5的序列编码同样的蛋白质或肽,
C4) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 10的氨基酸序列,且其优选能够将槐糖脂从细胞转移到周围的培养基中,
D4) 一种序列,其与根据组A4)至C4)中的任一组的、特别优选根据组A4)的序列至少80%、特别优选至少90%、更优选至少95%和最优选至少99%相同,其中该序列优选编码能够将槐糖脂从细胞转移到周围的培养基中的蛋白质或肽,
E4) 一种序列,其与根据组A4)至D4)中的任一组的、特别优选根据组A4)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码能够将槐糖脂从细胞转移到周围的培养基中的蛋白质或肽,
F4) 根据组A4)至E4)中的任一组的、特别优选根据组A4)的序列的衍生物,其已经通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将槐糖脂从细胞转移到周围的培养基中的蛋白质或肽,和
G4) 与根据组A4)至F4)中的任一组的、特别优选根据组A4)的序列互补的序列。
本发明的另一个主题是,一种分离的DNA,所述DNA选自下述序列:
A5) 根据Seq ID Nr. 6的序列,其中该序列编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的蛋白质,其中后一种选项是优选的,
B5) 一种不含内含子的序列,其源自根据A5)的序列,且其与根据Seq ID Nr. 6的序列编码相同的蛋白质或肽,
C5) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 11的氨基酸序列,且其优选能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸,其中后一种选项是优选的,
D5) 一种序列,其与根据组A5)至C5)中的任一组的、特别优选根据组A5)的序列至少80%、特别优选至少90%、更优选至少95%和最优选至少99%相同,其中该序列优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的蛋白质或肽,其中后一种选项是优选的,
E5) 一种序列,其与根据组A5)至D5)中的任一组的、特别优选根据组A5)的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,其中该序列优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的蛋白质或肽,其中后一种选项是优选的,
F5) 根据组A5)至E5)中的任一组的、特别优选根据组A5)的序列的衍生物,其已经通过置换、加成、反转和/或删除至少1个碱基、优选至少2个碱基、更优选至少5个碱基和最优选至少10个碱基、但是优选不多于100个碱基、特别优选不多于50个碱基和最优选不多于25个碱基而得到,其中该衍生物优选编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的蛋白质或肽,其中后一种选项是优选的,和
G5) 与根据组A5)至F5)中的任一组的、特别优选根据组A5)的序列互补的序列。
在这里,借助于已知的方法来测定“核苷酸一致性”或“氨基酸一致性”。一般而言,考虑到具体要求,使用具有算法的特殊的计算机程序。
优选的测定一致性的方法首先在要对比的序列之间产生最大匹配。用于测定一致性的计算机程序包括、但不限于于此:GCG软件包,包括GAP(Deveroy, J. 等人, Nucleic Acid Research 12(1984), 第387页, Genetics Computer Group University of Wisconsin, Medicine(Wi),和BLASTP、BLASTN和FASTA(Altschul, S. 等人, Journal of Molecular Biology 215(1990), 第403-410页)。BLAST程序可以从国家生物技术信息中心(NCBI)和其它来源(BLAST Handbuch, Altschul S. 等人, NCBI NLM NIH Bethesda ND 22894; Altschul S. 等人, 出处同上)得到。
同样地,已知的Smith-Waterman算法可以用于测定核苷酸一致性。
在使用BLASTN程序(Altschul, S. 等人, Journal of Molecular Biology 215(1990), 第403-410页)时,用于测定“核苷酸一致性”的优选参数是:
期望阈值: 10
字长: 28
匹配评分: 1
错配评分: -2
空位权重(Gap Costs): 线性的
上述参数是用于核苷酸序列比对的默认参数。
GAP程序同样适合使用上述参数。
在使用BLASTP程序(Altschul, S. 等人, Journal of Molecular Biology 215(1990), 第403-410页)时,用于测定“氨基酸一致性”的优选参数是:
期望阈值: 10
字长: 3
矩阵: BLOSUM62
空位权重: 存在: 11;延伸: 1
组合调整(Compositional adjustments): 有条件的组合评分矩阵调整
上述参数是用于对比氨基酸序列的默认参数。
GAP程序同样适合使用上述参数。
根据上述算法,80%的一致性意味着与本发明80%的一致性。同样适用于更高的一致性。
所述特征“与某一序列的反链杂交或在考虑遗传密码的简并性的情况下与所述反链杂交的序列”指,在优选严格的条件下,与参照序列的反链杂交或在考虑遗传密码的简并性的条件下与所述反链杂交的序列。例如,所述杂交可以在2 x SSC中在68℃进行,或根据Boehringer公司(Mannheim)的地高辛标记试剂盒的方案进行。优选的杂交条件是,例如,在7% SDS、1% BSA、1 mM EDTA、250 mM磷酸钠缓冲液(pH 7.2)中在65℃温育过夜,随后用2 x SSC、0.1% SDS在65℃洗涤。
根据本发明的分离的DNA的衍生物,尤其包括这样的序列:所述序列在它们编码的蛋白质中,产生保守的氨基酸置换,例如,用甘氨酸置换丙氨酸,或用天冬氨酸置换谷氨酸,所述衍生物可以根据替代的F1a)、F1b)、F1b)、F1c)、F2)、F3)、F4)或F5),通过将根据组A1a)至E1a)、A1b)至E1b)、A1c)至E1c)、A2)至E2)、A3)至E3)、A4)至E4)和A5)至E5)中的任一组的一个序列置换、添加、反转和/或删除一个或多个碱基来得到。这样的功能中性的突变称作有义突变(sense mutation),且不会导致多肽活性的根本性变化。此外已知,多肽的N-和/或C-端末端的变化对它的功能没有显著损害,或者甚至能够使它稳定化,因此相应地,本发明也包括这样的DNA序列:其中在具有根据本发明的核酸的序列的3’-末端处或5’-末端处添加碱基。为此,本领域技术人员尤其可以在下述文献中找到说明:Ben-Bassat等人(Journal of Bacteriology 169:751-757(1987)),O'Regan等人(Gene 77:237-251(1989)),Sahin-Toth等人(Protein Sciences 3:240-247(1994)),Hochuli等人(Bio/Technology 6:1321-1325(1988)),以及已知的遗传学和分子生物学的教科书。
此外,载体,优选表达载体、基因删除盒、基因插入盒或基因过表达盒,为达成开始所述的目的作出了贡献,所述载体包括这样的DNA,所述DNA具有根据上文定义的组A1a)至G1a)、A1b)至G1b)、A1c)至G1c)、A2)至G2)、A3)至G3)、A4)至G4)和A5)至G5)中的任一组的序列。作为载体,本领域技术人员已知的所有通常用于将DNA导入宿主细胞中的载体均是合适的。这些载体不仅能够自主复制(因为它们具有复制起点,例如2µ质粒或ARS(自主复制序列)的复制起点),而且也能够整合进染色体中(非复制质粒)。载体也被理解为绝对没有复制起点的线性DNA片段,例如,基因删除盒、基因插入盒或基因过表达盒。基因删除盒通常由选择标志物和与要删除的区域侧接的DNA片段组成。基因插入盒通常由标志物和要灭活的基因的片段组成,基因过表达盒通常由标志物、要过表达的基因和与所述基因的表达有关的调节区(例如,启动子和终止子)组成。优选的载体选自质粒和盒,例如,大肠杆菌-酵母穿梭质粒;特别优选的是表达载体、基因删除盒、基因插入盒或基因过表达盒,尤其是在下文中所述的具有Seq ID Nr. 12、Seq ID Nr. 13、Seq ID Nr. 14、Seq ID Nr. 15和Seq ID Nr. 16的基因删除盒和具有Seq ID Nr. 70、Seq ID Nr. 71、Seq ID Nr. 72、Seq ID Nr. 73和Seq ID Nr. 74的表达盒。按照根据本发明的载体的一个优选实施方式,具有根据组A1)至F5)中的任一组的序列的DNA处在组成型启动子或可调节启动子的控制下,所述启动子适用于在微生物细胞中表达由这些DNA序列编码的多肽,所述微生物细胞优选是细菌细胞、酵母菌细胞或真菌细胞,特别优选酵母菌细胞、最优选假丝酵母菌 -Candida bogoriensis- Candida batistae-Candida apicola-或拟威克酵母细胞。这样的组成型启动子的实例是,例如,TSC3启动子、ENO1启动子、FBA1启动子、GPD启动子、GPM启动子、FBA1启动子、ICL1启动子或ACT1启动子。这样的可调节启动子的实例是,例如,GAL1启动子、GAL2启动子、GAL7启动子、MEL1启动子、GAL10启动子、SBG1启动子、SBG2启动子、SBG3启动子、SBG4启动子、SBG5启动子或MAL2启动子。
除了启动子以外,根据本发明的载体应当优选包含核糖体结合位点和终止子。在这种情况下,特别优选的是,根据本发明的DNA嵌入包含启动子、核糖体结合位点和终止子的载体的表达盒中。除了上述结构元件以外,所述载体可以进一步包含本领域技术人员已知的选择标记基因。
在实施例中描述的核酸Seq ID Nr. 12、Seq ID Nr. 13、Seq ID Nr. 14、Seq ID Nr. 15、Seq ID Nr. 16、IntEx-CbSBG1(Seq ID Nr. 70)、IntEx-CbSBG2(Seq ID Nr. 71)、IntEx-CbSBG3(Seq ID Nr. 72)、IntEx-CbSBG4(Seq ID Nr. 73)和IntEx-CbSBG5(Seq ID Nr. 74),是根据本发明优选的载体。
新的酶E1至E5对达成所述目的作出了进一步的贡献。
因而,本发明的另一个主题是,一种分离的多肽,其选自:
酶E1,该酶具有多肽序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63、特别是Seq ID Nr. 7,或具有这样的多肽序列:其中相对于各参照序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63、特别是Seq ID Nr. 7,最多25%、优选最多20%、特别优选最多15%、尤其是最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有各个参照序列的酶的至少50%、优选65%、特别优选80%、尤其是多于90%的酶的活性,其中酶E1的酶的活性理解为,将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的能力,
酶E2,该酶具有多肽序列Seq ID Nr. 8或Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 8或Seq ID Nr. 11,最多60%、优选最多25%、特别优选最多15%、尤其是最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有各个参照序列Nr. 8或11的酶的至少50%、优选65%、特别优选80%、尤其是多于90%的酶的活性,其中酶E2的酶的活性理解为,将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的能力,
酶E3,该酶具有多肽序列Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 11,最多60%、优选最多25%、特别优选最多15%、尤其是最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有参照序列Seq ID Nr. 11的酶的至少50%、优选65%、特别优选80%、尤其是超过90%的酶的活性,其中酶E3的酶的活性理解为,将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的能力,
酶E4,该酶具有多肽序列Seq ID Nr. 9,或具有这样的多肽序列:其中相对于Seq ID Nr. 9,最多25%、特别优选最多15%、尤其是最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有Seq ID Nr. 9的酶的少50%、优选65%、特别优选80%、尤其是超过90%的酶的活性,其中酶E4的酶的活性理解为,
将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯,
或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯二乙酸酯,
或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯二乙酸酯的能力,其中第一个选项是优选的,
酶E5,该酶具有多肽序列Seq ID Nr. 10,或具有这样的多肽序列:其中相对于Seq ID Nr. 10,最多45%、优选最多25%、特别优选最多15%、尤其是最多10、9、8、7、6、5、4、3、2、1%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有Seq ID Nr. 10的酶的至少50%、优选65%、特别优选80%、尤其是超过90%的酶的活性,其中酶E5的酶的活性理解为,将槐糖脂从细胞转移到周围的培养基中的能力。
在下文给出的实施例中,示例性地描述了本发明,而不应将由整个说明书和权利要求中得到的本发明的应用范围限制在实施例中提及的实施方式中。
下述附图是实施例的组成部分:
图1: 17-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八烯4''-O-内酯的精确的质量迹线
图2: 17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八烯4''-O-内酯的精确质量迹线。
实施例:
实施例1: 假丝酵母菌ATCC 22214的尿嘧啶-营养缺陷型突变体的生成
如上文所述(van Bogaert等人Yeast. 2007. 24(3):201-8),生成 丝酵母菌ATCC 22214的尿嘧啶-营养缺陷型突变体。该菌株被命名为C. bombicola ATCC 22214 ura-
实施例2: 在假丝酵母菌ATCC 22214中参与槐糖脂生物合成的酶的结构基因的灭活
为了能够鉴别参与槐糖脂生物合成的酶,首先借助于GLS Flex钛技术,对假丝酵母菌ATCC 22214的基因组进行测序。在检查假丝酵母菌ATCC 22214的遗传信息时,鉴别出一簇5个基因(Seq ID Nr. 01),它们的编码区(Seq ID Nr. 02、Seq ID Nr. 03、Seq ID Nr. 04、Seq ID Nr. 05、Seq ID Nr. 06)编码基因产物(Seq ID Nr. 07、Seq ID Nr. 08、Seq ID Nr. 09、Seq ID Nr. 10、Seq ID Nr. 11)。
所述5个基因被命名为SBG1(Seq ID Nr. 02)、SBG2(Seq ID Nr. 03)、SBG3(Seq ID Nr. 04)、SBG4(Seq ID Nr. 05)和SBG5(Seq ID Nr. 06)(SBG代表槐糖脂生物合成基因)。
它们编码下述蛋白质:Sbg1p(Seq ID Nr. 07)、Sbg2p(Seq ID Nr. 08)、Sbg3p(Seq ID Nr. 09)、Sbg4p(Seq ID Nr. 10)和Sbg5p(Seq ID Nr. 11)。
表1:Sbg1p、Sbg2p、Sbg3p、Sbg4p和Sbg5p和它们在槐糖脂的生物合成和输出中的功能。
Seq ID Nr. 蛋白质 PFAM 域 NCBI 保守域 功能
07 Sbg1p P450 (PFAM PF00067) cytochrome P450 脂肪酸[ω,ω-1, ω-2, ω-3]-羟基化的单加氧酶
08 Sbg2p UDP 糖基转移酶 (PFAM PF00201) 糖基转移酶 UDP-葡萄糖: [ω,ω-1, ω-2, ω-3]-羟基脂肪酸葡萄糖转移酶
09 Sbg3p 麦芽糖 O-乙酰基转移酶 (PRK10092) 乙酰辅酶A: 槐糖脂乙酰基转移酶
10 Sbg4p ABC 运载体 (PFAM 00667) ABC transporter 槐糖脂输出蛋白
11 Sbg5p UDP 糖基转移酶 (PFAM PF00201) 糖基转移酶 UDP-葡萄糖: [ω,ω-1, ω-2, ω-3]-羟基脂肪酸葡萄糖转移酶;UDP-葡萄糖: [ω,ω-1, ω-2, ω-3]-(β-D-吡喃葡萄糖基)氧基脂肪酸葡萄糖基转移酶
将基因SBG1SBG2SBG3SBG4SBG5逐个灭活,并且表征与槐糖脂生物合成相关的相应突变体的表型。为了在C. bombicola ATCC 22214中构建相应的突变体,首先通过GeneArt AG(Regensburg)合成删除盒。这些删除盒(Seq ID Nr. 12、Seq ID Nr. 13、Seq ID Nr. 14、Seq ID Nr. 15、Seq ID Nr. 16)由前述的基因CbURA3(van Bogaert等人Yeast. 2007. 24(3):201-8)构成,所述基因编码C. bombicola ATCC 22214的乳清酸核苷-5-磷酸脱羧酶,且其在上游和下游分别侧接大约1000碱基对的要灭活的基因的侧接区。分别将loxP-基因座(其任选地通过暂时导入重组酶Cre编码基因及其功能性表达允许删除CbURA3基因)(关于综述,参见Kühn & Torres. Methods Mol Biol. 2002. 180:175-204)嵌入在侧接区和CbURA3基因之间。这里,构建对应于表2中所示数据的各个删除盒:
表2: 编码Sbg1p、Sbg2p、Sbg3p、Sbg4p和Sbg5p的C. bombicola ATCC 22214的结构基因的删除盒的结构。
Seq ID Nr. 基因 5’-侧接区 loxP-基因座1 CbURA3 loxP-基因座2 3’-侧接区
12 SBG1 1 - 1003 1004 - 1037 1038 - 3106 3107 - 3140 3141 - 4143
13 SBG2 1 - 0999 1000 - 1033 1034 - 3102 3103 - 3136 3137 - 4143
14 SBG3 1 - 1002 1003 - 1036 1037 - 3105 3106 - 3139 3140 - 4140
15 SBG4 1 - 0997 0998 - 1031 1032 -3100 3101 - 3134 3135 - 4130
16 SBG5 1 - 1002 1003 - 1036 1037 - 3105 3106 - 3139 3140 - 4141
为了以足够的量提供删除盒用于随后的C. bombicola ATCC 22214 ura- 的转化,通过PCR扩增所述删除盒。这里使用下述的寡核苷酸:
用于灭活CbSBG1的删除盒的扩增:
SBG1-fw: 5'- AAT TGT TCG ATG GAT AGC TTT GGA GTC -3'
(Seq ID Nr. 17)
SBG1-rv: 5'- TTC GGG GCT CCT GTC GTT GTC -3'
(Seq ID Nr. 18)
用于灭活CbSBG2的删除盒的扩增:
SBG2-fw: 5'- GAA ATC TGA TCA ATT CTG CAA ACC TG -3'
(Seq ID Nr. 19)
SBG2-rv: 5'- ATG ACT CCT AGA AAA GAA ATT GAC CAG -3'
(Seq ID Nr. 20)
用于灭活CbSBG3的删除盒的扩增:
SBG3-fw: 5'- TGC AGA CAA GTT CCT GCA GCT G -3'
(Seq ID Nr. 21)
SBG3-rv: 5'- ATG CTT TAT TCA GGC ACG CTA CG -3'
(Seq ID Nr. 22)
用于灭活CbSBG4的删除盒的扩增:
SBG4-fw: 5'- GGA TGA GTC GCA GTC ACG AAC -3'
(Seq ID Nr. 23)
SBG4-rv: 5'- TCA ATC ATT GGC TCA AGA CTA GGA AC -3'
(Seq ID Nr. 24)
用于灭活CbSBG5的删除盒的扩增:
SBG5-fw: 5'- ATT CTG GTG CTG ACC TCG CCA C -3'
(Seq ID Nr. 25)
SBG5-rv: 5'- ACT CAT GTC GTA CTT GCA AGA ACT G -3'
(Seq ID Nr. 26)。
将下述参数用于PCR:1 x:预变性,98℃,3 min;25 x:变性,98℃,0:10 min,退火,60℃,0:30 min;延伸,72℃,2:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用New England Biolabs(Frankfurt)的PhusionTM High-Fidelity Master Mix进行扩增。按照生产商的说明书,使用QIAquick PCR纯化试剂盒(Qiagen, Hilden)纯化PCR产物。PCR的操作、借助于琼脂糖凝胶电泳的成功的PCR扩增的验证、DNA的溴化乙锭染色、测定PCR片段大小、纯化PCR产物和测定DNA浓度,都以技术人员已知的方式进行。
如以前所述进行C. bombicola ATCC 22214 ura - 的转化(van Bogaert等人Yeast. 2008. 25:273-278; van Bogaert等人FEMS Yeast Res. 2009. 9:610-617)。
为了验证在用CbSBG1(Seq ID Nr. 12)、CbSBG2(Seq ID Nr. 13)、CbSBG3(Seq ID Nr. 14)、CbSBG4(Seq ID Nr. 15)和CbSBG5(Seq ID Nr. 16)的删除盒转化以后基因SBG1SBG2SBG3SBG4SBG5C. bombicola ATCC 22214 ura 转化体中的删除,借助于菌落PCR,分别扩增5个转化体和C. bombicola ATCC 22214 ura- 的各个基因座。这里使用下述寡核苷酸:
CbSBG1的基因组删除的验证:
SBG1-KO-fw: 5'- GTG TCG ACT CGC CAA ATT CCA TCG GAG -3'
(Seq ID Nr. 27)
SBG1-KO-rv: 5'- GGT TCA TAG CGA GTT TCT TTG CAT GTG C -3'(Seq ID Nr. 28)
CbSBG2的基因组删除的验证:
SBG2-KO-fw: 5'- CTC CTT TAT TAA CTC CGC AGC ATG ACT G -3'(Seq ID Nr. 29)
SBG2-KO-rv: 5'- CTC CTC GAA GGA CCC TCA AAA CAA AGG -3'
(Seq ID Nr. 30)
CbSBG3的基因组删除的验证:
SBG3-KO-fw: 5'- CAA ATT TAT CTG GGA GCA CAG TTA CAT TGC -3'(Seq ID Nr. 31)
SBG3-KO-rv: 5'- CAC ACA TTG CTT TAG TCC AGC AAG AAC C -3'(Seq ID Nr. 32)
CbSBG4的基因组删除的验证:
SBG4-KO-fw: 5'- ATT CTC CTC GCA CGT TTC TCG GGG C -3'
(Seq ID Nr. 33)
SBG4-KO-rv: 5'- GGT TGA AAT ACT TGT TGC CGC ACT AAA G -3'(Seq ID Nr. 34)
CbSBG5的基因组删除的验证:
SBG5-KO-fw: 5'- CGC TTC CTG AAT TGA GTT GGT ATC GTT AAT G -3'(Seq ID Nr. 35)
SBG5-KO-rv: 5'- GAC ATT GTT GGA ATT GGC TGC TTA GTG G -3'(Seq ID Nr. 36)。
将下述参数用于PCR:1 x:预变性,94℃,3 min;25 x:变性,94℃,1:00 min,退火,60℃,1:00 min;延伸,72℃,5:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用Qiagen(Hilden)的Taq PCR Master Mix试剂盒进行扩增。随后在0.8% 的琼脂糖凝胶上各分离10µl PCR反应物。PCR操作、琼脂糖凝胶电泳操作、DNA的溴化乙锭染色和测定PCR片段大小,都以技术人员已知的方式进行。
在扩增对应的基因座时应产生在表3中给出的PCR片段大小:
表3:在成功删除后和在野生型情形中,在染色体SBG1-SBG2-SBG3-SBG4-SBG5基因座扩增时预期的PCR片段大小。
基因 在染色体删除后PCR产物的大小 在野生型情形中PCR产物的大小
SBG1 4201 bp 3678 bp
SBG2 4199 bp 3451 bp
SBG3 4199 bp 2839 bp
SBG4 4190 bp 5950 bp
SBG5 4201 bp 3360 bp
在扩增C. bombicola ATCC 22214 ura- CbSBG1-CbSBG2-CbSBG3-CbSBG4-CbSBG5基因座的情况中,仅得到在野生型情形存在时预期的片段大小,即分别是3.7 kbp、3.5 kbp、2.8 kbp、5.9 kbp和3.4 kbp。
在转化CbSBG1的删除盒以后扩增转化体的SBG1基因座的情况中,仅得到在成功的CbSBG1染色体删除以后预期的片段大小,即大约4.2 kbp。
在转化CbSBG2的删除盒以后扩增转化体的SBG2基因座的情况中,仅得到在成功的CbSBG2染色体删除以后预期的片段大小,即大约4.2 kbp。
在转化CbSBG3的删除盒以后扩增转化体的SBG3基因座的情况中,仅得到在成功的CbSBG3染色体删除以后预期的片段大小,即大约4.2 kbp。
在转化CbSBG4的删除盒以后扩增转化体的SBG4基因座的情况中,仅得到在成功的CbSBG4染色体删除以后预期的片段大小,即大约4.2 kbp。
在转化CbSBG5的删除盒以后扩增转化体的SBG5基因座的情况中,仅得到在成功的CbSBG5染色体删除以后预期的片段大小,即大约4.2 kbp。
因而,在所有5种情况下,能够鉴定出这样的克隆:其中基因CbSBG1CbSBG2CbSBG3CbSBG4CbSBG5已经发生染色体删除。相应的菌株在下文中被称作C. bombicola ATCC 22214 sbg1C. bombicola ATCC 22214 sbg2C. bombicola ATCC 22214 sbg3C. bombicola ATCC 22214 sbg4C. bombicola ATCC 22214 sbg5
实施例3:通过C. bombicola ATCC 22214、C. bombicola ATCC 22214 sbg1、C. bombicola ATCC 22214 sbg2、C. bombicola ATCC 22214 sbg3、C. bombicola ATCC 22214 sbg4和C. bombicola ATCC 22214 sbg5表征槐糖脂的形成。
在YPD琼脂平板上进行菌株C. bombicola ATCC 22214、C. bombicola ATCC 22214 sbg1C. bombicola ATCC 22214 sbg2C. bombicola ATCC 22214 sbg3C. bombicola ATCC 22214 sbg4C. bombicola ATCC 22214 sbg5的增殖。使用在下文中称作SL生产培养基的培养基用于槐糖脂的生产。所述培养基由0.1% KH2PO4、0.5% MgSO4•7 H2O、0.01% FeCl3、0.01% NaCl、0.01% 尿嘧啶、0.4% 酵母浸出物、0.1% 尿素、10.5%菜籽油和10% 葡萄糖构成。将pH调至4.5,然后在高压灭菌釜(121℃, 20 min)中将培养基灭菌。在培养过程中,不需要调节pH。
为了在摇瓶中研究槐糖脂生产,首先制备预培养物。为此,使用一环新鲜涂在YPD琼脂平板上的菌株并给在100 ml锥形瓶中的10 ml YPD培养基接种。在30℃和200 rpm培养过夜。该预培养物在下文中用于接种在1000 ml锥形瓶中的100 ml SL培养基(起始OD600 0.2)。将所述培养物在200 rpm和30℃培养7天,每天取出2 ml培养液样品,其中应当注意,在取样之前彻底混合培养基。
如下制备用于随后的色谱分析的样品:使用正位移移液器(Combitip),将800µl丙酮预先放入2ml反应器中,并立即密封反应器,以使蒸发最小化。随后加入200µl培养液。在涡旋培养液/丙酮混合物以后,以13000 rpm离心所述混合物1 min,并将800µl上清液转移进HPLC容器中。
使用蒸发光散射检测器(ELSD)进行槐糖脂和/或油酸的检测和定量测定。借助于Agilent Technologies 1200系列(Santa Clara, 加利福尼亚)和Zorbax SB-C8快速拆分柱(4.6 x 150 mm, 3.5µm, Agilent),进行实际测量。注射体积是5µl,该方法的运行时间是20 min。作为流动相使用H2O和0.1% 的TFA(三氟醋酸, 溶液A)和甲醇(溶液B)。柱温是40℃。作为检测器使用ELSD(检测器温度60℃)和DAD(二极管阵列, 210 nm)。在该方法中使用的梯度如表4中所示。
表4:要用于基于HPLC的槐糖脂定量测定的流动相的梯度形式描述。
t [min] 溶液B% 流速[ml/min]
0.00 70% 1.00
15.00 100% 1.00
15.01 70% 1.00
20.00 70% 1.00
尽管C. bombicola ATCC 22214生产了槐糖脂,但在菌株C. bombicola ATCC 22214 sbg1C. bombicola ATCC 22214 sbg2C. bombicola ATCC 22214 sbg4中没有可检测的槐糖脂形成。这清楚地证实,这些基因参与槐糖脂形成,并且在这种情况中它们满足上述的功能。菌株C. bombicola ATCC 22214 sbg3C. bombicola ATCC 22214 sbg5能够形成槐糖脂,但它们在HPLC分析中显示出改变的保留时间。
通过LC-MS2可以证实,与由C. bombicola ATCC 22214形成的槐糖脂相反,由C. bombicola ATCC 22214 sbg3形成的槐糖脂仅对应着通式(Ia)和(Ib)的化合物,其中R1 = H和R2 = H。
这证实了Sbg3p在槐糖脂生物合成中作为乙酰基转移酶(E4)的功能。
同样地,通过LC-MS可以证实,与由C. bombicola ATCC 22214形成的槐糖脂相反,由C. bombicola ATCC 22214 sbg5形成的槐糖脂仅对应着通式(Ia)的化合物,其中n = 0。
这证实了Sbg5p在槐糖脂生物合成中作为糖基转移酶II(E3)的功能。
实施例4: 过度生产参与槐糖脂生物合成的酶的假丝酵母菌ATCC 22214菌株的构建
为了使过度生产参与槐糖脂生物合成的酶的假丝酵母菌ATCC 22214菌株的构建成为可能,首先通过GeneArt AG合成整合/过表达盒(Seq ID Nr. 75)。
该整合/过表达盒包含了在表5中说明的组分:
表5: 用于研发 丝酵母菌ATCC 22214的整合/过表达盒中存在的模块和重要的限制性切割位点的概揽。
位置(bp) 组分
1 - 8 NotI识别位点
9 - 507 C. bombicola ATCC 22214 LEU2基因的上游DNA区段
508 - 513 PciI识别位点
514 - 1217 C. bombicola ATCC 22214 URA3基因的启动子区
1217 - 2005 C. bombicola ATCC 22214 URA3基因的编码区
2006 - 2586 C. bombicola ATCC 22214 URA3基因的终止子区
2587 - 2592 PciI识别位点
2593 - 2600 AsiSI识别位点
2601 - 3012 C. bombicola ATCC 22214 TSC3基因的启动子区
3011 - 3016 NdeI识别位点
3025 - 3032 FseI识别位点
3033 - 3210 C. bombicola ATCC 22214 TSC3基因的终止子区
3211 - 3218 AsiSI识别位点
3219 - 3224 MluI识别位点
3225 - 3724 C. bombicola ATCC 22214 LEU2基因的下游DNA区段
3725 - 3732 SbfI识别位点
该整合/过表达盒使得通过NdeI和FseI将从起始密码子至终止密码子的任意结构基因插入在C. bombicola ATCC 22214 TSC3基因的启动子区和终止子区之间成为可能,所述TSC3基因编码甘油醛-3-磷酸脱氢酶(van Bogaert等人; 2008)。甘油醛-3-磷酸脱氢酶是在许多酵母中大量出现的蛋白质,所以可以认为,以此方式可以实现插入的基因的强表达。选择C. bombicola ATCC 22214 URA3基因作为选择标志物,这样该整合/过表达盒可以仅用于转化C. bombicola ATCC 22214的尿嘧啶-营养缺陷型菌株。已经描述了它的生产和C. bombicola ATCC 22214 URA3基因(van Bogaert等人, 2007; van Bogaert等人, 2008)。5’-和3’-端DNA区段允许所述盒插入在C. bombicola ATCC 22214 LEU2基因座(Seq ID Nr. 37)处,其中LEU2基因被灭活。LEU2C. bombicola ATCC 22214中仅编码异丙基苹果酸脱氢酶。由于异丙基苹果酸脱氢酶是亮氨酸生物合成的重要组分,所以通过它们的亮氨酸营养缺陷现象,可以鉴定具有整合/过表达盒的正确整合的转化体。各种独特的且丰富的识别序列(NotI、PciI、AseSI、MluI、SbfI)允许置换整合/过表达盒的单个模块。所述盒被GeneArt AG克隆进专有的载体pMA中,所述载体不包含上述的切割位点,所以这些切割位点可在它们的完整范围内被使用。
为了将基因CbSBG1CbSBG3CbSBG5插入所述的整合/过表达盒中,通过PCR,从C. bombicola ATCC 22214的染色体DNA扩增该基因,并且同时,通过使用的寡核苷酸,将NdeI切割位点导入在起始密码子的上游,和将FseI切割位点导入在终止密码子的下游。为了将基因CbSBG2CbSBG4插入所述的整合/过表达盒中,首先由GeneArt AG(Regensburg)重新合成所述基因,以便如此修饰它们的序列,使得内部FseI和NotI切割位点(CbSBG2)和NdeI切割位点(CbSBG4)被去除,而没有改变被编码的蛋白质的氨基酸序列。此后,通过PCR,扩增由GeneArt AG(Regensburg)提供的修饰的基因CbSBG2modCbSBG4mod,并通过使用的寡核苷酸,同时将NdeI切割位点导入在起始密码子的上游,将FseI切割位点导入在终止密码子的下游。这里使用下述寡核苷酸:
CbSBG1
SBG1-OE-fw: 5'- ATA TAT ATA CAT ATG TTA ATC AAA GAC ATT ATT CTA ACT CCA ATG-3'(Seq ID Nr. 38)
SBG1-OE-rv: 5'- ATA TAT GGC CGG CCA ACT TAA GAA AAC CGC ACA ACC ACA CCG-3'(Seq ID Nr. 39)
CbSBG2mod
SBG2-OE-fw: 5'- ATA TAT ATA CAT ATG AGC CCT TCA TCA CAC AAA CCC CTG -3'(Seq ID Nr. 40)
SBG2-OE-rv: 5'- ATA TAT GGC CGG CCA TTC TAA GAA CTC ACC GCT AAG GCC -3'(Seq ID Nr. 41)
CbSBG3
SBG3-OE-fw: 5'- ATA TAT ATA CAT ATG GTT GTA AAC TCC TCG AAG GAC CC-3'(Seq ID Nr. 42)
SBG3-OE-rv: 5'- ATA TAT GGC CGG CCT ACC TAG ACC TTC TGG TTA GCG GTA TTG -3'(Seq ID Nr. 43)
CbSBG4mod
SBG4-OE-fw: 5'- ATA TAT ATA CAT ATG GTG GAT GAT ATA CAG GTA GAG AAG C-3'(Seq ID Nr. 44)
SBG4-OE-rv: 5'- ATA TAT GGC CGG CCA CGT CAA ATC TCT CCG AGA CCT TGC AAG -3'(Seq ID Nr. 45)
CbSBG5
SBG5-OE-fw: 5'- ATA TAT ATA CAT ATG GCC ATC GAG AAA CCA GTG ATA GTT G -3'(Seq ID Nr. 46)
SBG5-OE-rv: 5'- ATA TAT GGC CGG CCA GGT TAA GAA GCT AAT TCA CTA ATT GCC GAC -3'(Seq ID Nr. 47)。
将下述参数用于PCR:1 x:预变性,98℃,3 min;25 x:变性,98℃,0:10 min,退火,60℃,0:30 min;延伸,72℃,2:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用New England Biolabs(Frankfurt)的PhusionTM Fidelity Master Mix进行扩增。随后在0.8% 的琼脂糖凝胶上各分离10µl PCR反应物。PCR操作、琼脂糖凝胶电泳操作、DNA的溴化乙锭染色和测定PCR片段大小,都以本领域技术人员已知的方式进行。
在所有情况下,均可以扩增预期大小的PCR片段。这些大小是:对于CbSBG1,为1646 bp;对于CbSBG2,为1421 bp;对于CbSBG3,为809 bp;对于CbSBG4,为3929 bp;和对于CbSBG5,为1328 bp。按照限制性内切核酸酶的生产商(New England Biolabs; Frankfurt/Main)的推荐,用NdeI和FseI消化PCR产物,并连接进NdeI-和FseI-切割的载体pMA-ExCat(Seq ID Nr. 64)中。以技术人员已知的方式,进行化学感受态大肠杆菌DH5α细胞(New England Biolabs; Frankfurt/Main)的连接和转化。通过用NdeI和FseI限制酶切,验证和证实CbSBG1-CbSBG2-CbSBG3-CbSBG4-CbSBG5片段在pMA-ExCat中的正确插入。得到的载体被命名为pMA_ExCat-CbSBG1(Seq ID Nr. 65)、pMA_ExCat-CbSBG2(Seq ID Nr. 66)、pMA_ExCat-CbSBG3(Seq ID Nr. 67)、pMA_ExCat-CbSBG4(Seq ID Nr. 68)和pMA_ExCat-CbSBG5(Seq ID Nr. 69)
为了以足够的量提供各整合/过表达盒和对照盒ExCat用于随后的C. bombicola ATCC 22214 ura- 的转化,将其通过PCR扩增。使用下述寡核苷酸:
OEx-LEU2-fw: 5'- GGA CCT GCG CCC TAA AAT GGG AC -3'
(Seq ID Nr. 48)
OEx-LEU2-rv: 5'- ATC CTA GAA AAC AGC TGG ATA TGG ATA AAC-3'(Seq ID Nr. 49)。
按照生产商的说明,借助于QIAquick PCR纯化试剂盒(Qiagen, Hilden),纯化PCR产物。PCR的操作、借助于琼脂糖凝胶电泳验证成功的PCR扩增、DNA的溴化乙锭染色、测定PCR片段大小、纯化PCR产物和测定DNA浓度,都以技术人员已知的方式进行。
得到的整合/过表达盒被命名为IntEx-CbSBG1(Seq ID Nr. 70)、IntEx-CbSBG2(Seq ID Nr. 71)、IntEx-CbSBG3(Seq ID Nr. 72)、IntEx-CbSBG4(Seq ID Nr. 73)和IntEx-CbSBG5(Seq ID Nr. 74)。也得到对照盒ExCat(Seq ID Nr. 75)。
如前所述进行C. bombicola ATCC 22214 ura -的转化(van Bogaert等人Yeast. 2008. 25:273-278; van Bogaert等人FEMS Yeast Res. 2009. 9:610-617)。
为了验证整合/过表达盒(用于过表达CbSBG1CbSBG2CbSBG3CbSBG4CbSBG5)和对照盒ExCat在C. bombicola ATCC 22214 ura- LEU2基因座中的插入,借助于菌落PCR分别扩增5个转化体(在转化CbSBG1CbSBG2CbSBG3CbSBG4CbSBG5的整合/过表达盒对照盒ExCat以后)和C. bombicola ATCC 22214 ura- LEU2基因座。这里使用下述寡核苷酸:
LEU2-KI-fw: 5'- GTG CCC GAC CAC CAT GAG CTG TC -3'
(Seq ID Nr. 50)
LEU2-KI-rv: 5'- CCC AAG CAT GAG GGT CGT GCC GG -3'
(Seq ID Nr. 51)。
将下述参数用于PCR:1 x:预变性,94℃,3 min;25 x:变性,94℃,1:00 min,退火,60℃,1:00 min;延伸,72℃,5:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用Qiagen(Hilden)的Taq PCR Master Mix试剂盒进行扩增。随后在0.8% 的琼脂糖凝胶上各分离10µl PCR反应物。PCR操作、琼脂糖凝胶电泳操作、DNA的溴化乙锭染色和测定PCR片段大小,都以技术人员熟知的方式进行。
对应的基因座的扩增应产生在表6中指出的PCR片段大小:
表6:在SBG1-SBG2-SBG3-SBG4-SBG5表达盒和对照盒ExCat同源重组进染色体C. bombicola LEU2基因座中以后和在非同源整合以后,在扩增染色体LEU2基因座时预期的PCR片段大小。
基因 在同源整合进CbLEU2基因座以后,PCR产物的大小 在非同源整合在基因组的其它位点处以后,PCR产物的大小
SBG1 5452 bp 2235 bp
SBG2 5227 bp 2235 bp
SBG3 4615 bp 2235 bp
SBG4 7735 bp 2235 bp
SBG5 5125 bp 2235 bp
ExCat 3844 bp 2235 bp
在扩增C. bombicola ATCC 22214 ura- LEU2基因座的情况中,仅得到在野生型情形存在时预期的片段,其具有2.2 kbp的大小。
在用整合/过表达盒(用于过表达CbSBG1CbSBG2 modCbSBG3CbSBG4 modCbSBG5)转化以后扩增C. bombicola ATCC 22214转化体的LEU2基因座的情况中,仅得到在整合/过表达盒IntEx-CbSBG1(Seq ID Nr. 70)、IntEx-CbSBG2(Seq ID Nr. 71)、IntEx-CbSBG3(Seq ID Nr. 72)、IntEx-CbSBG4(Seq ID Nr. 73)和IntEx-CbSBG5(Seq ID Nr. 74)成功的染色体整合情况中预期的片段大小,它们分别是大约5.5 kbp、5.2 kbp、4.6 kbp、7.7 kbp和5.1 kbp。
因而,在所有5种情况下,可能鉴定出这样的克隆:其中可能使基因CbSBG1CbSBG2CbSBG3CbSBG4CbSBG5C. bombicola ATCC 22214 TSC3启动子控制,从而能够假定过表达。
相应的菌株在下文中称作C. bombicola ATCC 22214 PTSC3-SBG1-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG2-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG3-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG4-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG5-TTSC3
实施例5:表征通过C. bombicola ATCC 22214 ExCat、C. bombicola ATCC 22214 PTSC3-SBG1-TTSC3、C. bombicola ATCC 22214 PTSC3-SBG2-TTSC3、C. bombicola ATCC 22214 PTSC3-SBG3-TTSC3、C. bombicola ATCC 22214 PTSC3-SBG4-TTSC3和C. bombicola ATCC 22214 PTSC3-SBG5-TTSC3的槐糖脂形成
在YPD琼脂平板上繁殖菌株C. bombicola ATCC 22214 ExCat、C. bombicola ATCC 22214PTSC3-SBG1-TTSC3C. bombicola ATCC 22214PTSC3-SBG2-TTSC3C. bombicola ATCC 22214 PTSC3-SBG3-TTSC3C. bombicola ATCC 22214PTSC3-SBG4-TTSC3C. bombicola ATCC 22214PTSC3-SBG5-TTSC3。使用在下文中被称作SL生产培养基的培养基用于生产槐糖脂。该培养基由0.1% KH2PO4、0.5% MgSO4•7 H2O、0.01% FeCl3、0.01% NaCl、0.01% 尿嘧啶、0.4% 酵母浸出物、0.1% 尿素、10.5%菜籽油和10% 葡萄糖组成。将pH调至4.5,然后用高压灭菌釜(121℃, 20 min)将培养基灭菌。在培养过程中,不需要调节pH。
为了在摇瓶中研究槐糖脂生产,首先制备预培养物。为此,使用一环新鲜涂在YPD琼脂平板上的菌株并给在100 ml锥形瓶中的10 ml YPD培养基接种。在30℃和200 rpm培养过夜。该预培养物在下文中用于接种在1000 ml锥形瓶中的100 ml SL培养基(起始OD600 0.2)。将所述培养物在200 rpm和30℃培养7天,每天取出2 ml培养液样品,其中应当注意,在取样之前彻底混合培养基。
如下制备用于随后的色谱分析的样品:使用正位移移液器(Combitip),将800µl丙酮预先放入2ml反应器中,并立即密封反应器,以使蒸发最小化。随后加入200µl培养液。在涡旋培养液/丙酮混合物以后,以13000 rpm离心所述混合物1 min,并将800µl上清液转移进HPLC容器中。
使用蒸发光散射检测器(ELSD)进行槐糖脂和/或油酸的检测和定量测定。借助于Agilent Technologies 1200系列(Santa Clara, 加利福尼亚)和Zorbax SB-C8快速拆分柱(4.6 x 150 mm, 3.5µm, Agilent),进行实际测量。注射体积是5µl,该方法的运行时间是20 min。作为流动相使用H2O和0.1% 的TFA(三氟醋酸, 溶液A)和甲醇(溶液B)。柱温是40℃。使用的检测器是ELSD(检测器温度60℃)和DAD(二极管阵列, 210 nm)。在该方法中使用的梯度如表3中所示。
如同对照菌株C. bombicola ATCC 22214 ExCat一样,所有菌株均生产槐糖脂。但是,相对于C. bombicola ATCC 22214 ExCat,菌株C. bombicola ATCC 22214 PTSC3-SBG1-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG2-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG3-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG4-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG5-TTSC3 具有增加的槐糖脂形成的空间时间产率。C. bombicola ATCC 22214 ExCat在选择的条件下生产大约2 mg槐糖脂/升•小时•OD600,而在菌株C. bombicola ATCC 22214 PTSC3-SBG1-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG2-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG3-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG4-TTSC3 C. bombicola ATCC 22214 PTSC3-SBG5-TTSC3 的情况中,该参数是在2.5 mg至6 mg之间。因而,能够证实,增强C. bombicola ATCC 22214中的酶CbSBG1CbSBG2CbSBG3CbSBG4CbSBG5会导致槐糖脂的形成增加。
实施例 6: 用于过表达具有 N- His- 标签的假丝酵母菌基因 SBG2 的载体 pTZ_E02_His-GlcTrI
为了在大肠杆菌中过表达假丝酵母菌ATCC22214基因SBG2(Seq ID Nr. 03),构建了质粒pTZ_E02_His-GlcTrI。按照生产商的说明,使用假丝酵母菌ATCC22214的染色体DNA作为PCR的模板,所述PCR使用来自Roche Diagnostics(Mannheim)的“ExpandTM High Fidelity”-PCR试剂盒。借助于寡核苷酸1373_GlcTrI_BsmBI_His_fp(Seq ID Nr. 76)和1373_GlcTrI_AscI_rp(Seq ID Nr. 77),从染色体DNA扩增SBG2基因(“PCR protocols. A guide to methods and applications”, 1990, Academic Press),并以此方式在5'末端处配备6重N-端组氨酸标签。另外,导入BsmBI和AscI切割位点。使用下述寡核苷酸:
1373_GlcTrI_BsmBI_His_fp(Seq ID Nr. 76):
5'-AAACGTCTCAGATGCACCACCACCACCACCACATGGTTGTAAACTCCTCG-3'
1373_GlcTrI_AscI_rp(Seq ID Nr. 77):
5'-AAAGGCGCGCCCTAGACCTTCTGGTTAGCG-3'。
按照生产商的说明书,借助于QIAquick PCR纯化试剂盒(Qiagen, Hilden),纯化PCR产物(1435 bp),用BsmBI和AscI切割,并且随后连接进已经以相同的方式切割的Trenzyme GmbH(Konstanz)的表达载体pTZ_E02(基于pET24d的载体; Merck Chemicals, Darmstadt)中。得到的质粒pTZ_E02_His-GlcTrI(Seq ID Nr. 78)是6700碱基对大的。以技术人员已知的方式,进行化学感受态大肠杆菌DH5α细胞(Gibco-BRL, Karlsruhe)的连接和转化。通过DNA序列分析,验证插入物的可靠性。
借助于转化,将质粒pTZ_E02_His-GlcTrI导入菌株大肠杆菌BL21(DE3)和大肠杆菌Rosetta(DE3)(二者来自Merck Chemicals, Darmstadt)中。得到的菌株被称作大肠杆菌BL21(DE3)/pTZ_E02_His-GlcTrI和大肠杆菌Rosetta(DE3)/pTZ_E02_His-GlcTrI。
实施例7: 用于过表达具有N-端His-标签的假丝酵母菌基因SBG5的载体pTZ_E02_His-GlcTrII
为了在大肠杆菌中过表达假丝酵母菌ATCC22214基因SBG5(Seq ID Nr. 06),构建了质粒pTZ_E02_His-GlcTrII。按照生产商的说明,使用假丝酵母菌ATCC22214的染色体DNA作为PCR的模板,所述PCR使用Roche Diagnostics(Mannheim)的“ExpandTM High Fidelity”-PCR试剂盒。借助于寡核苷酸1373_GlcTrII_BsmBI_His_fp(Seq ID Nr. 79)和1373_GlcTrII_AscI_rp(Seq ID Nr. 80),从染色体DNA扩增SBG5基因(“PCR protocols. A guide to methods and applications”, 1990, Academic Press),并以此方式在5’末端处配备6重N-端组氨酸标签。此外,导入BsmBI和AscI切割位点。使用下述寡核苷酸:
1373_GlcTrII_BsmBI_His_fp(Seq ID Nr. 79):
5'-AAACGTCTCAGATGCACCACCACCACCACCACATGGCCATCGAGAAACCAG-3'
1373_GlcTrII_AscI_rp(Seq ID Nr. 80):
5'-AAAGGCGCGCCTTAAGAAGCTAATTCACTAATTGCC-3'。
按照生产商的说明书,借助于QIAquick PCR纯化试剂盒(Qiagen, Hilden),纯化PCR产物(1342 bp),用BsmBI和AscI切割,并且随后连接进Trenzyme GmbH,Konstanz的已经以相同的方式切割的表达载体pTZ_E02(基于pET24d的载体; Merck Chemicals, Darmstadt)中。得到的质粒pTZ_E02_His-GlcTrII(Seq ID Nr. 81)是6607碱基对大的。以技术人员已知的方式,进行化学感受态大肠杆菌DH5α细胞(Gibco-BRL, Karlsruhe)的连接和转化。通过DNA序列分析,验证插入物的可靠性。
借助于转化,将质粒pTZ_E02_His-GlcTrII导入菌株大肠杆菌BL21(DE3)和大肠杆菌Rosetta(DE3)(二者来自Merck Chemicals, Darmstadt)中。得到的菌株被称作大肠杆菌BL21(DE3)/pTZ_E02_His-GlcTrII和大肠杆菌Rosetta(DE3)/pTZ_E02_His-GlcTrII。
实施例 8: 用于过表达具有N-端His-标签的假丝酵母菌基因SBG3的载体 pTZ_E02_His-AcTr
为了在大肠杆菌中过表达假丝酵母菌ATCC22214基因SBG3(Seq ID Nr. 04),构建了质粒pTZ_E02_His-AcTr。按照生产商的说明,使用假丝酵母菌ATCC22214的染色体DNA作为PCR的模板,所述PCR使用Roche Diagnostics(Mannheim)的“ExpandTM High Fidelity”-PCR试剂盒。借助于寡核苷酸1373_AcTr_BsmBI_His_fp(Seq ID Nr. 82)和1373_AcTr_AscI_rp(Seq ID Nr. 83),从染色体DNA扩增SBG3基因(“PCR protocols. A guide to methods and applications”, 1990, Academic Press),并以此方式在5’末端处配备6重N-端组氨酸标签。此外,导入BsmBI和AscI切割位点。使用下述寡核苷酸:
1373_AcTr_BsmBI_His_fp(Seq ID Nr. 82):
5'-AAACGTCTCAGATGCACCACCACCACCACCACATGGTTGTAAACTCCTCG-3'
1373_AcTr_AscI_rp(Seq ID Nr. 83):
5'-AAAGGCGCGCCCTAGACCTTCTGGTTAGCG-3'。
按照生产商的说明书,借助于QIAquick PCR纯化试剂盒(Qiagen, Hilden),纯化PCR产物(823 bp),用BsmBI和AscI切割,并且随后连接进已经以相同的方式切割的Trenzyme GmbH(Konstanz)的表达载体pTZ_E02(基于pET24d的载体; Merck Chemicals, Darmstadt)中。得到的质粒pTZ_E02_His-AcTr(Seq ID Nr. 84)是6088碱基对大的。以技术人员已知的方式,进行化学感受态大肠杆菌DH5α细胞(Gibco-BRL, Karlsruhe)的连接和转化。通过DNA序列分析,验证插入物的可靠性。
借助于转化,将质粒pTZ_E02_His-AcTr导入菌株大肠杆菌BL21(DE3)和大肠杆菌Rosetta(DE3)(二者来自Merck Chemicals, Darmstadt)中。得到的菌株被称作大肠杆菌BL21(DE3)/ pTZ_E02_His-AcTr和大肠杆菌Rosetta(DE3)/ pTZ_E02_His-AcTr。
实施例 9: 参与槐糖脂生物合成的酶 SBG2 SBG3 SBG5 的异源表达
在每种情况下,首先在5 ml含有50µg/ml卡那霉素的LB培养基(10 g/l胰蛋白质胨、5 g/l酵母浸出物、10 g/l NaCl)中,在37℃和175 rpm,培养在第1-3项下构建的大肠杆菌菌株的单一菌落8小时。此后,给在500 ml摇瓶中的100 ml LB培养基接种第一种预培养物,并在37℃和175 rpm培养过夜。次日早晨,给1 l LB培养基(具有0.1的起始OD600)接种第二种预培养物(5-l摇瓶)。在37℃和175 rpm温育所有培养物。通过表观光密度(OD600),跟踪培养物的生长。当达到约0.3的OD600时,将培养温度从37℃降至20℃。在0.6的OD600时,通过加入0.5 mM IPTG(终浓度),诱导各目标基因的表达。在所有培养步骤中,加入合适的抗生素(卡那霉素50µg/ml)。在加入IPTG之前和在诱导以后24 h,采取用于分析的样品。按照生产商的说明书,用Bugbuster(Merck Chemicals, Darmstadt)破碎细胞,以使可溶性的和不溶性的蛋白质彼此分离。借助于SDS-PAGE,分离可比较量的细胞提取物,随后用胶态考马斯对凝胶染色。对于所有3种重组生产的蛋白质(具有His标签的Sbg2p、Sbg3p和Sbg5p)而言,在细胞提取物的可溶性的级分中均可检测到过量生产。
实施例 10 :参与槐糖脂生物合成的酶 Sbg2p Sbg3p Sbg5p 的纯化
在诱导基因表达24 h后,借助于离心(8000 g, 20 min, 4℃)收获细胞。1升培养物产生约5 g湿生物质。将细胞团块重新悬浮于100 ml缓冲液A(100 mM Tris、pH 7.8、50 mM NaCl、20 mM咪唑)中,所述缓冲液A另外包含蛋白酶抑制剂(Roche, 定货号11 873 580 001)。通过在微射流机(Microfluidizer)中经过6次,破碎重新悬浮的细胞。在另外的离心步骤(10000 g, 20 min, 4℃)以后,过滤(孔径: 0.45µm)上清液,以得到可溶性的蛋白质级分。通过His-标签亲和色谱柱(GE, HisTrap FF 1 ml柱, 定货号17-5319-01),纯化目标蛋白质。流速为1 ml/min。使用缓冲液B(100 mM Tris、pH 7.8、50 mM NaCl、500 mM咪唑)进行0-100%的线性洗脱。为此,采用20倍柱体积的缓冲液B,并收集2 ml级分。合并含有蛋白质的洗脱液级分,并借助于过滤单元(Amicon Ultra-15, NMWL 10 kDa Centricons, Millipore, 定货号UFC901024)进行浓缩。此后,借助于使用Sephadex 25(PD-10柱, GE, 定货号17-0851-01)的凝胶过滤更换缓冲液,将各个蛋白质级分进到最终的缓冲液(100 mM Tris、pH 7.8、50 mM NaCl)中。借助于SDS-PAGE,验证蛋白质纯化。从1 l培养物中纯化出3.3 mg Sbg2p(蛋白质浓度1.0µg/µl)、7.3 mg Sbg5p(蛋白质浓度2.2µg/µl)和6.9 mg Sbg3p(蛋白质浓度2.1µg/µl)。
实施例 11 :参与槐糖脂生物合成的酶 Sbg2p Sbg3p Sbg5p 的表征
为了证明参与槐糖脂生物合成的酶Sbg2p、Sbg3p和Sbg5p的功能,使用3种纯化的酶Sbg2p、Sbg3p和Sbg5p分别单独地和以所有可能的组合进行酶测定。该测定以350µl的总体积按照下表进行:
表7:酶测定混合物的组成(µl)
Figure 89498DEST_PATH_IMAGE003
通过加入14 μl 13.4 mM的底物(18-羟基-Z-9-十八碳烯酸)在乙醇中的溶液,开始反应,并在振摇下(600 rpm)在30℃温育6 h。此后,通过加入1.4 ml丙酮,终止反应。通过离心(16100 g, 5 min, RT),沉淀出未溶解的组分。随后将上清液转移进新容器中,并通过真空蒸发器(25℃)浓缩至最初的反应体积(350µl)。借助于LC-ESI-MS分析样品,并通过分析相应的质量迹线和MS波谱,鉴定产物。
为了鉴定所形成的产物,将5µl产物注射进UPLC系统Accela(Thermo Scientific, Dreieich)中。用半-UPLC柱“Pursuit XRs ULTRA”(C8, 2.8µm, 2.1 x 100 mm)(Varian, Darmstadt),分析要研究的物质。使用由流动相A1(H2O, 0.1% (v/v)TFA)和流动相B1(甲醇, 0.1% (v/v)TFA)组成的梯度,以0.3 ml/min的流速,在40℃在25 min内进行分离。梯度随时间的进程示于表8中。
表8:HPLC梯度的进程
时间[min] 流动相A1 [%] 流动相B1 [%]
0 30 70
15 0 100
25 0 100
25.01 30 70
32 30 70
在200 - 600 nm的波长范围内,借助于DAD检测器进行检测,并用高分辨率FT-ICR质谱仪LTQ-FT(Thermo Scientific, Dreieich)在扫描范围m/z 100 - 1000内进行质量选择。用ESI(电喷射离子化)进行离子化。借助于FT-ICR质量分析仪(具有R = 100000的分辨率和≤2 ppm的质量准确度),测定精确的质量和化学经验式。
使用的对照反应是,仅包含底物UDP-葡萄糖、乙酰辅酶A和18-羟基-Z-9-十八碳烯酸、但是没有酶的混合物(参见表7)。在该样品中,通过MS仅能够检测到底物18-羟基-Z-9-十八碳烯酸(C18H34O3; 298.2502 g/mol)。
混合物2(参见表7)除了包括底物以外,还包含105µg Sbg3p。如同在混合物1中一样,在该样品中仅能够检测到18-羟基-Z-9-十八碳烯酸。
混合物3(参见表7)除了包括底物以外,还包含100µg Sbg2p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸(经验式C24H44O8;分子量460.3031 g/mol)。这表明,Sbg2p能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸。
混合物4(参见表7)除了底物以外,另外包含110µg Sbg5p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸(经验式C30H54O13;分子量622.3559 g/mol)。这表明,Sbg5p能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,并进一步转化成18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。
混合物5(参见表7)除了底物以外,另外包含100µg Sbg2p和105µg Sbg3p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和18-(6-O-乙酰基-β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸(经验式C26H46O9;分子量502.3136 g/mol)。这证实,如同在混合物3中已经证实的,Sbg2p能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,并进一步证实,在乙酰辅酶A存在下,Sbg3p能够乙酰化18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,得到18-(6-O-乙酰基-β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸。
混合物6(参见表7)除了底物以外,另外包含110µg Sbg5p和105µg Sbg3p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸、18-(6-O-乙酰基-β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸、18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸、18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸(经验式C32H56O14;分子量664.3665 g/mol)和18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸(经验式C34H58O15;分子量706.3770 g/mol)。这证实,如同在混合物4中已经证实的,Sbg5p能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,并进一步转化成18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸,并且进一步证实,所形成的产物可以在乙酰辅酶A存在下被Sbg3p乙酰化,成为18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸和/或18-L-[(2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸以及18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。
混合物7(参见表7)除了底物以外,另外包含100µg Sbg2p和110µg Sbg5p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。这证实,Sbg2p和Sbg5p在混合物中能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,并进一步转化成18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。
混合物8(参见表7)除了底物以外,另外包含100µg Sbg2p、105µg Sbg3p和110µg Sbg5p。除了底物18-羟基-Z-9-十八碳烯酸以外,检测到18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸、18-(6-O-乙酰基-β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸、18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸、18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸和18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。这证实,如同在混合物7中已经证实的,Sbg2p和Sbg5p一起能够将UDP-葡萄糖和18-羟基-Z-9-十八碳烯酸转化成18-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,并进一步转化成18-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸,并且进一步证实,如同在混合物5和6中已经证实的,所形成的产物能够在乙酰辅酶A存在下被Sbg3p乙酰化,成为18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸和/或18-L-[(2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸以及18-L-[(6'-O-乙酰基-2'-O-β-D-吡喃葡萄糖基-6''-O-乙酰基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸。
实施例 12 :灭活假丝酵母菌 ATCC 22214 中的乙酰基转移酶( SBG3 )的替代途径
在一个替代途径中,单独地灭活基因SBG3,并表征对应的突变体在槐糖脂生物合成方面的表型。为了在C. bombicola ATCC 22214中构建对应的突变体,首先由GeneArt AG(Regensburg)合成CbSBG3的删除盒(Seq ID Nr. 14;参见实施例2)。此后,在Trenzyme GmbH(Konstanz)将基因CbURA3通过潮霉素抗性盒置换,该基因编码C. bombicola ATCC 22214的乳清酸核苷-5-磷酸脱羧酶(van Bogaert等人Yeast. 2007. 24(3):201-8)。为此,使用下述寡核苷酸,从载体p-Col-5的DNA(Seq ID Nr. 85)扩增潮霉素盒:
1390_hygR_fp_EcoRV: 5'- AAA GAT ATC TCT ATG CGC ACC CGT TCT C -3'
(Seq ID Nr. 86)
1390_hygR_rp_Hind/Bgl: 5'- TTT AGA TCT AAG CTT GAG ACA CCT CAG CAT GCA CCA TTC -3'
(Seq ID Nr. 87)。
将下述参数用于PCR:1 x:预变性,98℃,3 min;25 x:变性,98℃,0:10 min,退火,60℃,0:30 min;延伸,72℃,2:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用New England Biolabs(Frankfurt)的PhusionTM Fidelity Master Mix进行扩增。按照生产商的说明书,使用QIAquick PCR纯化试剂盒(Qiagen, Hilden)纯化PCR产物。得到的PCR产物具有1831 bp的大小。PCR操作、借助于琼脂糖凝胶电泳验证成功的PCR扩增、DNA的溴化乙锭染色、测定PCR片段大小、纯化PCR产物和测定DNA浓度,都以技术人员已知的方式进行。将潮霉素盒克隆进载体pCR4_AcTr_URA(Seq ID Nr. 88)中,其中将所述载体用限制性内切核酸酶BglII和PmlI线性化。用限制性内切核酸酶EcoRV和BglII准备插入物用于随后的连接。以技术人员已知的方式进行连接,随后转化进大肠杆菌DH5α细胞中。通过DNA序列分析,验证插入物的可靠性。
产生的质粒被命名为pCR4_AcTr_HygR(Seq ID Nr. 89),且是8578 bp大的。
删除盒CbSbg3-hyg(Seq ID Nr. 90)由肺炎克雷伯菌(Klebsiella pneumoniae)潮霉素抗性基因(hph)组成,该基因编码潮霉素B磷酸酶(Gritz L和Davies J 1983 Plasmid-encoded hygromycin B resistance: the sequence of hygromycin B phosphotransferase gene and its expression in Escherichia coli and Saccharomyces cerevisiae. Gene 25 (2-3): 179-188)。该抗性基因的启动子是组成型杂合启动子hp4d(Madzak等人2000, Strong hybrid promoters and integrative expression/secretion vectors for quasi-constitutive expression of heterologous proteins in the yeast Yarrowia lipolytica. J. Mol. Microbiol. Biotechnol. 2, 207–216)。该抗性基因侧接XPR2基因的终止子,所述XPR2基因编码来自解脂假丝酵母(Y. lipolytica)的胞外蛋白酶(Nicaud等人1989a. Cloning, sequencing and amplification of the alkaline extracellular protease (XPR2) gene of the yeast Yarrowia lipolytica. J. Biotechnol. 12, 285–298)。该抗性基因在上游和下游侧接大约1000 bp的要灭活的基因的邻接区域。
在每种情况下,将loxP-基因座(其任选地通过暂时导入重组酶Cre编码基因及其功能性表达允许删除CbURA3基因)(关于综述,参见Kühn & Torres. Methods Mol Biol. 2002. 180:175-204)嵌入在侧接区和hph基因之间。构建对应于下表9中所示数据的删除盒:
表9:C. bombicola ATCC 22214的编码Sbg3p的结构基因的删除盒的结构。
Seq ID Nr. 基因 5’-侧接区 loxP基因座1 hph loxP基因座2 3’-侧接区
90 SBG3 1 - 1033 1034 - 1066 1067 - 3599 3600 - 3633 3634 - 4635
为了以足够的量提供删除盒用于随后的C. bombicola ATCC 22214的转化,通过PCR扩增所述删除盒。这里使用下述的寡核苷酸:
用于灭活CbSBG3的删除盒的扩增:
SBG3-fw: 5'- TGC AGA CAA GTT CCT GCA GCT G -3'
(Seq ID Nr. 21)
SBG3-rv: 5'- ATG CTT TAT TCA GGC ACG CTA CG -3'
(Seq ID Nr. 22)。
将下述参数用于PCR:1 x:预变性,98℃,3 min;25 x:变性,98℃,0:10 min,退火,60℃,0:30 min;延伸,72℃,2:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用New England Biolabs(Frankfurt)的PhusionTM Fidelity Master Mix进行扩增。按照生产商的说明书,使用QIAquick PCR纯化试剂盒(Qiagen, Hilden)纯化PCR产物。PCR操作、借助于琼脂糖凝胶电泳验证成功的PCR扩增、DNA的溴化乙锭染色、测定PCR片段大小、纯化PCR产物和测定DNA浓度,都以技术人员已知的方式进行。
如前所述进行C. bombicola ATCC 22214的转化(van Bogaert等人Yeast. 2008. 25:273-278; van Bogaert等人FEMS Yeast Res. 2009. 9:610-617)。
为了验证在用CbSBG3(Seq ID Nr. 90)的删除盒转化以后基因SBG3C. bombicola ATCC 22214转化体中的删除,借助于菌落PCR,分别扩增5个转化体和C. bombicola ATCC 22214的各基因座。这里使用下述寡核苷酸:
CbSBG3的基因组删除的验证:
SBG3-KO-fw: 5'- CAA ATT TAT CTG GGA GCA CAG TTA CAT TGC -3'(Seq ID Nr. 31)
SBG3-KO-rv: 5'- CAC ACA TTG CTT TAG TCC AGC AAG AAC C -3'(Seq ID Nr. 32)。
将下述参数用于PCR:1 x:预变性,94℃,3 min;25 x:变性,94℃,1:00 min,退火,60℃,1:00 min;延伸,72℃,5:00 min;1 x:最后一次延伸,72℃,10 min。按照生产商的推荐,使用Qiagen(Hilden)的Taq PCR Master Mix试剂盒进行扩增。随后在0.8% 的琼脂糖凝胶上各分离10µl PCR反应物。PCR操作、琼脂糖凝胶电泳操作、DNA的溴化乙锭染色和测定PCR片段大小,都以技术人员已知的方式进行。
在扩增C. bombicola ATCC 22214的CbSBG3基因座时,仅测得在野生型情形存在时预期的片段大小,即2839 bp。
在转化删除盒CbSBG3-hyg以后扩增转化体的SBG3基因座时,仅得到在从染色体成功删除CbSBG3以后预期的片段大小,即4693 bp。
因此,可以鉴定出这样的克隆:其中染色体删除了基因CbSBG3。相应的菌株此后被称作C. bombicola ATCC 22214 sbg3-hyg
实施例 13 :经由 C. bombicola ATCC 22214 sbg3-hyg 的槐糖脂形成的表征。
在YPD琼脂平板上繁殖菌株C. bombicola ATCC 22214和C. bombicola ATCC 22214 sbg3-hyg。使用在下文中称作SL生产培养基的培养基用于生产槐糖脂。该培养基由0.1% KH2PO4、0.5% MgSO4•7 H2O、0.01% FeCl3、0.01% NaCl、0.4% 酵母浸出物、0.1% 尿素、10.5%菜籽油和10% 葡萄糖组成。将pH调至4.5,然后借助于高压灭菌釜(121℃, 20 min)将培养基灭菌。在培养过程中,不需要调节pH。
为了在摇瓶中研究槐糖脂生产,首先制备预培养物。为此,使用一环新鲜涂在YPD琼脂平板上的菌株并给在100 ml锥形瓶中的10 ml YPD培养基接种。在30℃和200 rpm培养过夜。该预培养物在下文中用于接种在1000 ml锥形瓶中的100 ml SL培养基(起始OD600 0.2)。将所述培养物在200 rpm和30℃培养7天,每天取出2 ml培养液样品,其中应当注意,在取样之前彻底混合培养基。
如下制备用于随后的色谱分析的样品:使用正位移移液器(Combitip),将800µl丙酮预先放入2ml反应器中,并立即密封反应器,以使蒸发最小化。随后加入200µl培养液。在涡旋培养液/丙酮混合物以后,以13000 rpm离心所述混合物1 min,并将800µl上清液转移进HPLC容器中。
使用蒸发光散射检测器(ELSD)进行槐糖脂和/或油酸的检测和定量测定。借助于Agilent Technologies 1200系列(Santa Clara, 加利福尼亚)和Zorbax SB-C8快速拆分柱(4.6 x 150 mm, 3.5µm, Agilent),进行实际测量。注射体积是5µl,该方法的运行时间是20 min。作为流动相使用H2O和0.1% 的TFA(三氟醋酸, 溶液A)和甲醇(溶液B)。柱温是40℃。使用的检测器是ELSD(检测器温度60℃)和DAD(二极管阵列, 210 nm)。在该方法中使用的梯度如在下表10中所示。
表10:要用于基于HPLC的槐糖脂定量测定的流动相的梯度形式描述。
t [min] 溶液B% 流速[ml/min]
0.00 70% 1.00
15.00 100% 1.00
15.01 70% 1.00
20.00 70% 1.00
分析表明,C. bombicola ATCC 22214和C. bombicola ATCC 22214 sbg3-hyg都生产槐糖脂。
借助于LC-MS2证实,与由C. bombicola ATCC 22214形成的槐糖脂相反,由C. bombicola ATCC 22214 sbg3-hyg形成的槐糖脂仅对应通式(Ia)和(Ib)的化合物,其中R1 = H,且R2 = H(参见图1和2),并且这些化合物的浓度与C. bombicola ATCC 22214相比增加了因数10。这证实了Sbg3p在槐糖脂生物合成中作为乙酰基转移酶的功能。
序列表
<110> Evonik Degussa GmbH
<120> 细胞、核酸、酶和它们的用途以及用于生产槐糖脂的方法
<130> 200900338
<160> 90
<170> PatentIn 3.4版
<210> 1
<211> 18013
<212> DNA
<213> 假丝酵母菌
<400> 1
caaactcgac gctaaacaga ccttaaatga caccaatcaa tgtgaaaaaa tcaagttttt 60
ttgttcactc tatattgact gtttccgatg tgtgctatgc agccctcttt gaatcggtgg 120
aagcatgtag ttgaagaaag atggacgtag gagaaacatc aaactgaaca atagtaactt 180
aaacgtggtt tagaatgcaa gagcaggctc gctgctatgg cattcatagc caggaaagaa 240
acacggatga tctcacactt tgttggatcg acagtcggat ttttttgaaa atttatactt 300
ggcatacatc ttaatacagg ggtagaagga gaagtcgcga gagcgatttc tccgtcattt 360
attcgccgac aaatgtggat ccgtatttag cagattcgaa gtaaattgca ctcgacacca 420
cccacgtgat cgacactgtc gcgtcgatct ccatatatgt acgtgcctat ataaacaagc 480
aacacgcaga ttttgaaatc acatagggag ttgcccgtat gaatccggtt caaataataa 540
tactttgttt tcagatagga gaaacaaaac acccttggta ctcagaagac aaataacgat 600
ccattgtttt caactggaag aaataataca cattgatatt cagaagacaa ataactatcc 660
catttcttta gtatgtgcga aggtaaacag ttctatttca ccttaaaaac actactgaaa 720
gtgcgacata ctgtcgtacg taaaatataa aagcaatcac tatcatttcg ccattatcct 780
tgtcttgtaa taatccaaaa ctgagatcgg gaacggttcc cgttcttgac ataagcagga 840
gctgagaaca ggaacggttc ctggtcttga aatcagcagt aatagagaac gggattggtt 900
cccgttcttg acataagcag gaattggaaa caggaacggt tcccggtctt gacatcagca 960
gggatcgaaa acaggagcgg tttccggtct tgacatgata caaagaatga ttctttgtat 1020
cgggtctatg ggaggaaaaa cagctcattt tcacagaaaa tacagagaac aaaataattg 1080
aaagcgcgac ataatgtcgt acgtagaatt tagaagcaat tactcttatt tttccattat 1140
ccgcgctatt gtacacacac ccaaaccaga acgcgacttg agtgcaatgc ttactaacgc 1200
gcacattaat aagcaaatat agatacgcgg agagcacgcg aaatttgttt accagtacac 1260
tagtgcttag cacaatgaaa tagaccgtac tccggctgag gctcaaagtc cagaagttag 1320
agatttgcca gtttcgttac tagacggttc gttgtgccag gtatgtcgta cagcgcattt 1380
atcagggacg gaaatgggtc ttccatccct gttttggaat gcgctgtcga tccggacgca 1440
gcctcagccg cgtctatttc aaccccccat tagacaggcg gtacattagc tgtttggcct 1500
tcacgctaca gcataattct ccgtcatgtg tgtttccatg accaagaatt gttttggccc 1560
acgaaccaag atcatcgccg tcatataaac ccacattgga gtgttgactc tccatagctt 1620
gtcgttgaat gcaaacttga tgcccgcaaa agtgcttatt agcctacgca ctgattcgcc 1680
ccactctgcg agccacattt ccgctagctt aacatcaggc accgcaatcg gtgcctggac 1740
tgtctccggg ctcggccgag cccggttgag accatcttct tcaaattcat cttctgatag 1800
ctcatctaac atcctagagc tgttcctctt tttccttctt tttgttaatt ggtatttaaa 1860
ccaccaagtg tgtaaacttg tatttttgtc atccgagaga tatctaatag caagtttgcg 1920
attagttaca aatttgttgc gctcttgttc ggtactctta ttgaaacaag ggtgtcgact 1980
cgccaaattc catcggagaa aattgttcga tggatagctt tggagtctgt cccatcatga 2040
tacgaaaagc gtgaagctcc tctgacaatc aaaactttgt ttcaatgggg tgtaggatgg 2100
accccggatc caaacgaccg cgagtcaaaa aacctacggg tgcatttacc cgtagttgat 2160
ctggaaagtc gagatcaact ttttgtagtt tagttacatt catttcacgg tcgaaaaact 2220
cacacacaac gattgcagta tatttaccaa aatcgtctga agagaagcat ctgattgaga 2280
gttcaccatg acgaatccca taaacgacta ctccactgga cacaccgaca gacgccctgg 2340
ggatagtgaa actgaatttg tcggtataat ggcccgtctc acaggccggg cagaacactt 2400
tcatgtcctt tcgcaggtct cgacattgga caagtatgtt gtcgtgggtg acgacaaatt 2460
ggtcctcatc cttgaataag atgctccctt tgttctcagg aactggcacc attccattat 2520
gggcgaataa tttctgctca tcttcgggac tgatgccata ttcttctaac agaagacggc 2580
gctcacatgg gacctggtgc tctcgccggc ctctcaaatc gccggtgcat ctccacacgc 2640
aaattcacgg gtgtataccc ctgatcaaac gtatcttgcg cgttctgtta ttcattggag 2700
cgagggcccg atcctgtcct atcaaatgat ttcatgtggg aataatccat caattgttct 2760
ggattgaggt atacttcgag ctgtaaagat gtcgcttcta tgtcaagaat agtcggttaa 2820
acgcactcct tcaagattta catgatttac atgattcttc ataaagagca taaataaaga 2880
actgcagcca ttcttgagta aagtgctcag aataataaaa aggttgccac aggttgagtt 2940
aacatgggtt gattgaacca attaaggagg gaacgtttct tccatgggag gctaagaaac 3000
ttaagaaaac cgcacaacca caccgggagg agcgtgttga gctgtaagcg ttgttgagaa 3060
acgaggggac tctgggaagt cgggacccat ctcaatcttg gaatactcct gtaagagtct 3120
caccagagtt agcgaaagct ctgtcagggc gaattgttgg ccgagacaaa ttcggggacc 3180
gccattgaag ggcaagaatg cccacacatt atctagcttc aagttctccc atcgattggg 3240
attgaattcg tgggcgtcag gaccccaata cttgatgtcc ctgtggacca tgtaaattga 3300
atagtaaact gcggtgccct taggaacgaa gatcggatcc ttctgctcgg gaccaccacc 3360
tatgggtaga gttgtatctc tcacagcagt acggaagttc aatggcaata ccggcgcaag 3420
acgcaagact tcatttataa cttgcttcaa ataaggtgct tgcttcagaa gttcgaatga 3480
taaaggcctt tgctcctcct tggttccaaa atgatcgagg acctcctcac gtagtttgtt 3540
gaatacgtca ggatttctgg caaggaaatg aatagcgaag ctcaacgtag cagctgttgt 3600
atctctacca gcaatgagaa tgttgaaaat ttgatcacgt atcgtcactg ggtctcgggt 3660
aactttagcc atctcaagcg agaacacata gatgccacta gactctgcag cagcatcctt 3720
ctctgcaata gagttctcag cagcgaaaga tgtggcgtaa agagccttat caacgtagta 3780
gtcaatatag gactgagcac gtttcttgtg atctcggaat tccttagagt tgaacaacca 3840
gtagactttg cttgataggg tccgtttgaa agcgtaattc agtagaaagt tgtaggactc 3900
cacgaattgt tcggcagtaa tctccgaacc atcacgggct acaatacatg actgattctc 3960
agggttcaag ctctcgcagg actccccaaa taggaattca gtcgctgtat ccagcgtaag 4020
tttgtggaaa taatgttgaa catcaataaa ttggtccact ttcattgcac ggttcatctc 4080
ctttattaac tccgcagcat gactggaaat ctgatcaatt ctgcaaacct gatctttagt 4140
gaactgaggt ctcaacatcg atcgagactg tttccatcca tttccgctga gtgtaaatat 4200
cccttggcca aacacttttc ccactgtgtg gaaacgtgct ccaagaccaa aatcattgaa 4260
tttggttgcc aggattgtct taatgttttc tggctcgatt gtgaagattt ggtattgaag 4320
gggagcttgt cgaagatacg tccgtgcttt gaacttattg aagactctgt cgtattgaac 4380
ttccagtaag gtgtatgact tggccgtctt gatcatgtcc atggttcttt gtattcccag 4440
tgggaacgat ttctcaatga agcgaggcat actacacttg tgcctacgtg ctgcatagcg 4500
gtaccatagg agccagatag gctcgtgtag aactaagaaa gctacgaaga gcagtggcaa 4560
caagccagca acagcggata aactcattgg agttagaata atgtctttga ttaacatata 4620
tgtacttttc aatatgataa acggagaaat aacgcccggc tctatatgca agctgcatca 4680
accctaatat atattagcga gtttctcatg caggctgtag tttgagtcgc tgtaacctca 4740
gcctcaagac tcttacacca taggtagagt ttcgtcactg ggaaactcag ttactatcta 4800
aaccaaactg tgctaatgct caaacctatc actcagaatt tagattgaat caatctaagt 4860
ctgttgagaa acagatatgc atcaggggca cagactaaaa gctgctctca gcgagtaccc 4920
ttacctcttg agaaccctca aaatttaccc agcctgcagc atatcatgca ccatggttaa 4980
attcggaaat gaatttaccg gtggccttga accacgttcc tccaattatt taaggcaata 5040
acctgccact ctcttgattt gattaagaaa gactttcaat ttagcttctc cctacgaata 5100
ttcaatgagc ccttcatcac acaaacccct gattctcgct tgcggcttgc ctctttcagg 5160
ccatataatg cccgttttga gtctggtaca cggccttacg gacgacggat acgaagctac 5220
tgttgtgaca ggcagagcgt ttgaacaaaa agttcgagat gtgggtgcag actttgttcc 5280
tttagaaggg aacgcagatt ttgatgacca caccttagac gatctggtcc cgggccgtaa 5340
agacatggcc ccaagcttcg atcgtacagt tcaagatgtg gagcacatga tggtagctac 5400
tcttcctgag cagtttgccg ctattcagag ggctttcaaa aagctcagcg caagcggccg 5460
ccctgtcgtt cttgtcagtg aagtgctgtt tttcggtgca caccctatca gcctcggtgc 5520
tcctggtttc aaacccgctg gctggatttg tttaggggtt ttgcctcttt tgatccgcag 5580
tgatcatacc ttaggacttg acaacgacag gagccccgaa gcacatgcaa agaaactcgc 5640
tatgaaccac gctcttgagc accaaatttt cgttaaagcc actgctaagc acaaggaaat 5700
ctgccgagag ttaggttgca ctgaagatcc caaatttatc tgggagcaca gttacattgc 5760
tgcagacaag ttcctgcagc tgtgcccgcc ttctcttgag ttcagcagag accatctgcc 5820
tagcaacttc aaattcgccg gctcaacgcc caagcaccga actcaattca cccctccttc 5880
ctggtggggg gatgttctga gtgccaagcg agtcatcatg gtcactcaag gaacttttgc 5940
tgtcagttac aagcatctta ttgtgcctac tcttgaggcc ttgaaggacg agcctgacac 6000
tttaacagta gccatattgg gccgccgcgg tgccaagcta ccggatgatg ttgtggttcc 6060
tgagaatgct cgcgtgatcg actacttcaa ctacgatgct ctacttcctc acgttgatgc 6120
tcttgtctac aatggtggat atggcggact tcagcacagc ttaagccact ctgttccagt 6180
tgttattgct ggtgactctg aagacaagcc aatggtggca tcgagagctg aggccgctgg 6240
cgtggcaatt gatttgaaaa ctggcttgcc tacagtggag caaatcaaag aagctgttga 6300
ttcgataatt ggaaatccga aattccacga agcctcgaag aaggttcaaa tggagttgga 6360
aagccacaac tccttgaaaa ttcttgagga aagcatcgag gaaatcgcca gccatgactt 6420
tggtcttttg accaagagtg acgaggaaac tgaagatata cctgtcaaag ggccggcctt 6480
agcggtgagt tcttagaatc gtacgatcaa atcagatcag ggaagagagg tagggttttt 6540
tttatttatg tctttgtttt tattgattga aatttacaat acaacaacca tcaaattaat 6600
ttgaacaaac aacaacacac acacacactg caactttcaa aaaaataagt aaaaggaaga 6660
gaggagtttg ccaatatatt taccttcttc taattctgtt atttttttta attgttttgt 6720
ggaaagaaag aagaaaaggc tgtcatgaat ttagtttacc tagaccttct ggttagcggt 6780
attgacgttc atttcaactg gaagaaggaa ttccagttcc tctccttcag cctcgtcggg 6840
atcctcctct ggaatatgct tgaggattcg cgcagggact cctcccacca cagtacgagg 6900
aggaacatct tctcgaacga cagcaccagc cgcaattgtt gagccatctc caatcgtaac 6960
acccggcagg acagtcacat tcgcaccaat ccatacatta ttccccacct tgataggaag 7020
agcatacaca attctcctcg cacgtttctc ggggctaata ggatgagtcg cagtcacgaa 7080
cgttgtattg ggccctacaa tcacctcatc accaaagatt attggagccg agtccaagaa 7140
gcaaacgttg aagttggcgt aaaagtgctc gcctacgctg atgttgaatc caaaatcaac 7200
tgagaatgga gcggtcagcc agacaatatc ctttgtttga ccaaaagtgt ctttgagaat 7260
ctcgaccttc ttgatataag cagcgtgatt tgactcaaaa gtacgacttt cacttgcaat 7320
ggtattgaac tccctaactt tctcactagt agccagggct ctaaacataa gatctggatc 7380
gtatggattg taaggaactc ctgagaccat cttctcatag ttttcattgc caggggtgtt 7440
tttgaggttt tttttggccc aagagaccat ttcctggtca atttcttttc taggagtcat 7500
tcctttgttt tgagggtcct tcgaggagtt tacaaccatt gaattctaga atgtgaggtg 7560
gaatgaggca aggaaggagg aacgtattga gttgtacctt aagatatctc aaagtgctta 7620
tctccgacta ccggaatatg ctccgggtaa tgcaagtcag tgtgcatatg ggtaaggtga 7680
tgcaagctaa ccctcagggc atatctaatt cgcgtgaggg ttattattgg tctacattac 7740
ctcagtcata gcccgtcaaa gcaaaagccc aaaatcagca cgaaatccca gagatagatt 7800
gttgctgtct cttcaagtac tacgacagtt ccctatatct acagattatc gtcacgagtg 7860
aattatgcag gataggtgac tcaggggtca taatcagagg aatccaatgt gctatttcaa 7920
ttaacgagtc cctttaatca gacaatgtat ggtgactcag gggccataac tagagaaatt 7980
cgatatgcta tttcaattaa tgagtgcctt taatcaaata atgtatgcaa gcagtggcca 8040
aaaataaatg aacgtcaaat ctctccgaga ccttgcaagt tcaccaattc agcgtaccat 8100
ccattgagtt caaggaggct ctgatggtcg ccctgctcca cgatgcgccc tcctgagaac 8160
acatatatga catctgcttt ctgaattgtt gataatctat gcgcaacggc gattgtagta 8220
cggcccttcg ctgctgcgtc gagtgctgct tgaactactt tctcagattc ggaatccaga 8280
gctgaggtgg cctcatcgag gaggagtacc tttggatttc tgatcagggc ccttgcaatt 8340
gcaattcgct gcttttgccc cccagatagc aacgatcccc tagatccgct gagcgtttcg 8400
tagccatcag gcaacgacat gatgaattcg tgaatgttcg ctttgcgagc ggcatcctca 8460
atcatctcct gcgttacttc agactcaggg ccagaccatc ccattagaat attctcacgt 8520
agcgtgcctg aataaagcat tggttcttgc tggactaaag caatgtgtga tctcaatgca 8580
ttcaggttat attcgcgtaa atctttccca tcgaaaagta cttgacctgc taatggatca 8640
taaaatcttt ccaccagtcc aatagtagta gacttaccgc atccactggc tccaactaga 8700
gcgatgtatt ggcccttttt gactgttaag ttgagatctt gtaaaactgg tacttgaggt 8760
cgagtaggat atcggaaatt cacatgacgg aactcaatat ctcctctcac cgactcctcg 8820
ggagcaacgt aaccttcctc actccataca tctatagaag gagtggcagt caagattctg 8880
taaatgttac gcgctgcatc tttggctgag ttcatgtttg gagcatagct gaaaatttgg 8940
ccagcggctt gagaacctgt aataatagcc atgaagacag tcatatatcc tgcgaccgaa 9000
gcttcacctc gtctcattac agtgcttccc caccaaaaaa cgagggctac cacccagggt 9060
gtcattcctt ccgagagtgc gtagtacaat gctgagcggg caatggcaat tctggagctg 9120
aaaatctgag agtctactgt ctttgtgtat tttacgacca cgtctaactc acgagttaag 9180
gactggactg tgcggacagc acttgtatac tcagatgcca tggagccact tcgttcgtaa 9240
acttctctcg cacgatccga taattgggta agaacccaga ctctgacgaa gccacacacc 9300
aacatgacag gaacaacaga cgtagccacg agtccaattc tccaattgaa aggtatacca 9360
gtaactatgc cgccaatcaa ggtcaccaga ctctgttgaa tttgaccgag ggtggcccca 9420
ctcaaaccct cgatcatttt agcttccttc gccaaaattg aggttagcgc acccggcgtg 9480
ttgtttttgt ggtcgaagaa tgcaatatcc attcgcatca attggcggaa caaagctaat 9540
ctgatatttt tgaccaactt atcagatgca agtgataaag cagctatagt gataaaagcc 9600
gtcatgaatg aaatgcagcc tacgaaaaaa taccaccatc ccatgatatt caccacatgc 9660
cgcatttttc cgtattcact gggaggtaga accatgcttc cagtggtttg gccagttatt 9720
attgccattg caggatagca atagcccaaa ataatggagg ctaaactacc aatgagaatg 9780
taaccccatt ctttcctatt cagcccccaa accagtttgg tattggtcat caacgtgcta 9840
tgtggggggt tgcgcacacc agggatgtca ttttcttgat attcaggagg ttgagtggtc 9900
tgagtacctg cactgtgaac actcaatgtg ctcacatcct tgggattgaa cttttcgttc 9960
agtgagtcca gaggcgaaat gtctagagct tcaatatcga ggacctcaac gttagtgctc 10020
tttgctttag ttactctttg agcatcaacc aaagctttat aaggcccttc tcgctgtatg 10080
agctcattgt gagtaccctg ctctatgacg ttacctttag acatgacaac tatcttgttg 10140
gcatccttga tcgtagagag tctgtgtgca acgactatag tggtacgacc ttcggccgct 10200
ttgtcgagcg catcttgaac gataccttca gatttggtat ccagagcaga agtcgcttca 10260
tcgagcagca gaattttagg gtctgagacg attgctcttg ctattgcaat gcgttgtttc 10320
tgaccaccgc tgagaagaaa tcctcgatct ccaacattgg tttggatgcc ttctgagaga 10380
gtctgaatga aatcccaggc attggcatct ttacaagctt gaatgatttt agcttcctta 10440
acatgctcgt cagcgaactc aatgtcagtg ccaatcaaac catagctgat attctcatat 10500
attgactctg aaaagagtac tggttcctgc tgaacataac caatttgttg acggagccat 10560
cttgtgttca ggtcgctaat ctcctggcca tccagagtaa cgcttccttc gagaggtaaa 10620
tagaacctct caagaatacc tacaattgta gacttccctg atcccgaggc acctaccagt 10680
gccacagtag atccagcagg aacttcaagg ctaaaatcgg agaggaccaa aacgtctggg 10740
cgactaggat atcggaactt gacatttttg agctcaattc tgccaacggc cttagtttgg 10800
gggacaattc ctttatctat ggactggcca tcgatgactg ggacacgatc aatggcctca 10860
ttgagaatgc tcgcggcagt gagacccttg acaagaaacc tcacgtttgg cgcgatattc 10920
ccaagctgga agcttccaag taacatagct gtgattacaa ctattatctt tccaacgtca 10980
gcactcccac taacgatttc tctggaaccc tgccacagag ctaaggcata cacccaaaaa 11040
gtactagccc atatgcacgc taacatgacc cccaatgagt aactgctccg cttcgattcc 11100
ttcacaacac gatcaagtac cttttcatac ttgacggcga gatgaggttg agcgccaaat 11160
gctactgtag tcctgacagc actgagagcc tcctccgcaa cggtagctcc agactgcgaa 11220
tatatcgcgt cagatctgag ctgatatttg gccatgaagg tggcgccagt tcccattgtg 11280
attaccatga accctacagc actcaggagg atgcaagcca gtttccattg cgaagcaaaa 11340
cttataacgg tggccgcaat gaaggaagct attccctgta cgacgtttcc aagcttgtcg 11400
ctgatcgctt cctgaattga gttggtatcg ttaatgattc tggtgctgac ctcgccacca 11460
cctagtttgt cgtaaaacgc gatattctgg cgaataacag cactcagata atgctttcgg 11520
taacgtcctg ccaacacttc gcctctgtcc acaagcagga agctctcgag aaacgcactg 11580
ccgagcatac caatgccaat atagacaaaa tagagagaca ggtgattcac cttatgctgg 11640
aactcattgc ccttgaggtc atatgaagtg aagtctctga atgtgttgaa gatggcgccc 11700
actactaacg tgaacattgg aagcgcggct ccatgcaccg ctgcaaaaaa aagcgcaagt 11760
atctccaaga aaacgtcaag gggagtgcaa aatctgaaca acctgaaaaa gcttgtggcg 11820
actctctttg tttcaagctg acttcgcaat acattggcct catgtggatc taacgcagag 11880
agcttctcct cgagaagctt gtccttagtc tcgatgagtt tctcacgctt ctctacctgt 11940
atatcatcca ccataagcca aaatcagaga gtgggacctg attcagaatc acacggaccc 12000
gtatatataa caatcacttt ccaacaatat agcgagtatt aatatatttc cgggtaaggg 12060
ttgttccgga cttatgcatt taatcacagg ttgcatcagc taaatatgtc agggccgacg 12120
gcgtaaattt agaaggttag gtcaagatcc atcggtcagg ccaatggagc tctactatga 12180
taggcagctg aagcgagaca agatatactt cagttgcgct ctctgaaaaa attattttgt 12240
gattctcact cagtggatgt ggcgacacac ggaaccaata atctcgccgg aaaggcggct 12300
gaacatcagt cttgcataag tgtgcaagtg gcctgagcac agcgtgcatt acccttacca 12360
tacattcggg gcaagttaaa tccagcatta tataaacttg attgacacaa atgggcataa 12420
aacaataaag tctcctatat ggccatcgag aaaccagtga tagttgcttg tgcctgccca 12480
ctagcggggc acgtgggccc agtgctcagc ctggtccgcg gtctactcaa tagaggatat 12540
gaggtgactt tcgtaacagg gaacgcattc aaggagaaag ttattgaggc aggatgcact 12600
ttcgtccctc tccaaggacg agctgactac catgaataca atctccctga aatcgctcca 12660
ggattgctca cgattcctcc aggccttgag cagaccggtt actcaatgaa tgagattttt 12720
gtgaaggcga ttcctgagca gtacgatgca cttcaaactg ctctaaaaca ggttgaggct 12780
gaaaataaat cagctgtggt gattggcgag accatgtttc taggggtgca tccgatatca 12840
ctgggtgccc caggtctcaa gccccaaggc gtaatcacgt taggaactat tccgtgcatg 12900
ctgaaagcag agaaggcgcc tggagttcct agtcttgagc caatgattga tactttagtg 12960
cggcaacaag tatttcaacc aggaactgac tctgagaagg agatcatgaa gacgctcggg 13020
gccacgaagg agcccgaatt tctcctggag aatatataca gcagccctga cagatttttg 13080
caactgtgcc ctccatctct tgaatttcac ttgacttcgc ctcctcctgg cttctcgttc 13140
gctggtagtg caccgcatgt aaagtctgct ggattagcaa ctccacctca cctgccgtct 13200
tggtggcctg atgtgctgag tgcgaagcgt ctgattgttg ttacacaagg aacagcagcc 13260
atcaactatg aagatctgct cattccagca ttgcaggcct ttgctgacga agaagacact 13320
ctcgtagttg gtatattggg cgtcaaaggg gcgtcacttc ctgatagcgt taaagttcct 13380
gcaaacgctc gaattgttga ttattttcct tacgatgagc tactaccgca tgcctctgtt 13440
ttcatataca acggtggata cggaggtctg cagcacagtt tgagccatgg cgttcccgtc 13500
atcatcggag gaggaatgtt ggtagacaag ccagctgttg cttcacgagc tgtatgggct 13560
ggtgttggtt atgatcttca aaccttgcag gcaacttctg agctagtctc cacggccgtt 13620
aaggaggtgt tggctactcc ctcgtatcac gagaaagcca tggcagtcaa gaaagagctt 13680
gaaaaataca agtctcttga tattctagag tcggcaatta gtgaattagc ttcttaacct 13740
ggctcttttt ctagatatgt ctgcgccctg ctcactgctt actggcctaa gctggtatta 13800
cggaccttaa tcaagtatca ccccaaggca atcgagagtc ttatcgagtc tctaggtaga 13860
tagatacacg ttttgatttt tcggcccact ttgtagaaaa atctcagtga tttcatggaa 13920
ttcagttaca aatactaatc tgataaacca agaactacac tcggtgttga gagcagaatt 13980
aaagggactt ggcgtctagc acaaaacgat acttgacgtc accactgtga acgcgcttcc 14040
aagcttcggc gatatagctg tactcaatca gctcaacatc acaggtgatg ttattttcac 14100
cacagaagtc cagcatctcc tgagtctctg gcaagccacc aatgtttgag taagtgatag 14160
atttatttcc agccaaatga gaggtcagaa ccttgagggg tccaatttga ccaacaacaa 14220
cgagacaccc accaatatca agggacttga ggtatggctc gaagtcgtgt tcaaagggaa 14280
tggtgtcgat gatcaggtca aatgtgccag cgaccgcctc gagctcattc ggatcagagg 14340
aagcaactac gcggctagca ccttgtgctt tcgctcctgc ggctttggcg tgactcctgc 14400
tgaacagtgt gacttcagag cccatggctg aggcaaattt gatagccatg gaaccaaggc 14460
ctccgagacc aactacaccg actctttttc caggtccggc gccgtgagcc ctcagaggag 14520
agtaggtagt gataccagca cagagaaggg gcgcagaagc tgccaagtcg aggttggagg 14580
ggattttgag cacaaactcc tcgcgagcaa gaatgtgttg cgaataccct cccttcgtga 14640
cttccccgtt ctttccgctg gaattgtaag tttgagtgcg tgaaacacac caattttctt 14700
tgcctaattt acagttcttg caagtacgac atgagtccac taagcagcca attccaacaa 14760
tgtcgccagc ttggaacttc ttgacggccg ggccgacagc agtggccctt ccaataatct 14820
catgcccacc aacaaaggga aattttgcat tgttccagtc gttgtgcgct gtatggagtt 14880
cactgtgaca aattccacaa taaaggatct cgatgcttac gtcgttgggt cggggatcgc 14940
gacgctcaat agtgccagga actgggtcgc tagttgtatc gtggactatg taggccttgc 15000
aagttgaagg catcgtgaat tttgactgat ccgagcgcag tactctacgt ttagcttgaa 15060
gtcgggagaa gggtccggat tagaagataa gcggcatcct gtgacaagca gtaaaaaaat 15120
gcacccaaaa taaaagttgt gctaaggacc aagagttaga ttaaattcac tacctgatta 15180
tgagctgttt agttttagaa ctttgttgct aaacaattat acgtggctat acaacctacc 15240
caaaatttac aacgccgctt agctaatgac tacgcaaccc tactggatta ggctagggct 15300
ccgagatagc gaaacgtggg gtagcgggcg acaggtcata tagagcccct accctactcg 15360
gtgcaggtta ccgacggacg acatttggag tagtgatttt gactttccaa agatggaatt 15420
tcctctgtag tgaaagatta ctgtatatat ttattggtcg catcgcttgc tcagtttgtg 15480
atccaaccca gggttaatag tggtttaagc tgaactgcgg tgggaagccc agccggtgaa 15540
aggagctttc tggagagcat acggcactaa tgagagcctc tgacaggctg cattcctttt 15600
cccgcacgta cctgatatcc catcatgcgg gaccaggtta gggagtgggt tcagggttta 15660
gatagtggag ctcattggta gctcaccagc gagctctgag tagatggctg tgtcacacat 15720
tgaggcagaa gtttttctgt ctgaagtact gaagatttct tgctttggca acagtaatgg 15780
ggccaggtcc gaaggctcgg caaacttaag ctcgaaatta gatgagcgta agattcactt 15840
aacaacaaat tcgcgaagtc ctaggaagcg cgactgacag aggagtgttt cgttcaacaa 15900
tttcgcgaag gattgcacta ctcaccaact catattaatt cagctaatgt ttctaatttt 15960
caaaactagt acggaagtct gcagttagac agctcttgcg tttgaagaac ttaggcgcga 16020
gatttctcag ctgtatctac acgtcttggg tcgacgcagc tgttggagcg aaccaacgca 16080
caactaacaa caaatcaagt agactaggga tacaagatta aaatcatacg taaagcatca 16140
tttatcatta ttgacaggca ctcaacaagc acaacggctc ggagatgaaa gcacactgct 16200
ctctgcattt taaaagggac atctagatga ggagggcagc agcagcaata gcaccgacag 16260
caacagggac ttggaggacc gaagcagcat taggggcagc tgacgcagtg cccttgctag 16320
agccagaagc cttaggagtg ccagaactct tagagttgcc agaagcagaa gatttgccgg 16380
atgcgctagc atcagcagca gaactcagag aagatgagga accggagtca gtggaggtcg 16440
attttatggg agtgaacttg tagagcatgt tcttagaact cttgtcagtg acaaagacgt 16500
ctccattggg ggcaacctcg atgtggttgg gagttgtgac gttgagctga gtgataatac 16560
tatagtcttc aggatcaata acaaccacgg agtggcccgc acggcaggca acgtaaacaa 16620
cgtcataaac gggatcgtaa cgagcgttga gaggacgtcc aggcatatcg atgctcttca 16680
caaccttgcc ggacttgggg ttgacaataa cagtattgtt ggagccttgg ttcgtaacga 16740
aaagttgctc acgtcgggag tcccaagcaa caccacttga aaacttgaca ttgtccccga 16800
gatcgaagga tttgacagag tagtcgttga ggtcgatggc tgcggcaagg ggctgtttca 16860
aagccaccgt gtagagtact ttgttgacct cgtcagcaac aagactcata ctgctggaga 16920
aattcttgcc gagagactca gatatattga tactcttggc ggatttgtca gtggtgctag 16980
catcgaatac ggctatgaca ctagacctcg cagaagagac gtaagcaagc ccagtgctct 17040
ggtcaacata gacatcacgc ggatgcggct gaatgtcatc ggggtactga acaccaaggc 17100
tgaggtcctt accattataa taggaaacag tgccctggcg ggtgttggta acccaaacac 17160
ggttgttatc gtagtcgcta tcaacgccat aaactgcgta gcgttgggta acatttccag 17220
tggtaccgat ggcaggctga acgtccctga caacagccag gctcttaggg tcaacctcga 17280
taaggtcgga ctggttcaca gggggacgac caacagagtt ggtaaggaaa agcctgtcat 17340
tggttctgtc ataagtgctt tggtagagac cgccgtactt actaaagtca gcgctttgag 17400
tctcgtaaga gagggtgcga gcatcaatcc cgacggcgag gagaagaaca gcaagagagt 17460
ggatagcaat cattagagct cagtaaaaac gctgttatgg tcaaaataac atttgtgaga 17520
tagtttccct atttatattt ctcgagaaag agccgtttgc gaaaatgggc gccaggcata 17580
attggccaag ggtaaatatg ggtcagggta tctttgggct cgggcggatt ctgcagatgg 17640
cccagagaga ttttcatcat cgaggcaagt tcaaagctcg aaactggcca cattgagcac 17700
cgtggtaaag attgaacgac tatatagtga tttcaattat gtcctgcatt agggcttggt 17760
tttttttctg actgcagcag tgcctattga ggaattcgca atgagagagc cctacggtct 17820
gtgctagatg taaaagatac gatcgagact tagatgcatc taccccagcc cttaccatct 17880
tatatgaggt tgagagattt atttttgttt ttagagatga ttcttcagca aaccagaagg 17940
gaatccggaa ggagttaggg ttaatgatcc agttagtgtt tgtagatatt atccagctcg 18000
tagatgagaa gcg 18013
<210> 2
<211> 1617
<212> DNA
<213> 假丝酵母菌
<400> 2
atgttaatca aagacattat tctaactcca atgagtttat ccgctgttgc tggcttgttg 60
ccactgctct tcgtagcttt cttagttcta cacgagccta tctggctcct atggtaccgc 120
tatgcagcac gtaggcacaa gtgtagtatg cctcgcttca ttgagaaatc gttcccactg 180
ggaatacaaa gaaccatgga catgatcaag acggccaagt catacacctt actggaagtt 240
caatacgaca gagtcttcaa taagttcaaa gcacggacgt atcttcgaca agctcccctt 300
caataccaaa tcttcacaat cgagccagaa aacattaaga caatcctggc aaccaaattc 360
aatgattttg gtcttggagc acgtttccac acagtgggaa aagtgtttgg ccaagggata 420
tttacactca gcggaaatgg atggaaacag tctcgatcga tgttgagacc tcagttcact 480
aaagatcagg tttgcagaat tgatcagatt tccagtcatg ctgcggagtt aataaaggag 540
atgaaccgtg caatgaaagt ggaccaattt attgatgttc aacattattt ccacaaactt 600
acgctggata cagcgactga attcctattt ggggagtcct gcgagagctt gaaccctgag 660
aatcagtcat gtattgtagc ccgtgatggt tcggagatta ctgccgaaca attcgtggag 720
tcctacaact ttctactgaa ttacgctttc aaacggaccc tatcaagcaa agtctactgg 780
ttgttcaact ctaaggaatt ccgagatcac aagaaacgtg ctcagtccta tattgactac 840
tacgttgata aggctcttta cgccacatct ttcgctgctg agaactctat tgcagagaag 900
gatgctgctg cagagtctag tggcatctat gtgttctcgc ttgagatggc taaagttacc 960
cgagacccag tgacgatacg tgatcaaatt ttcaacattc tcattgctgg tagagataca 1020
acagctgcta cgttgagctt cgctattcat ttccttgcca gaaatcctga cgtattcaac 1080
aaactacgtg aggaggtcct cgatcatttt ggaaccaagg aggagcaaag gcctttatca 1140
ttcgaacttc tgaagcaagc accttatttg aagcaagtta taaatgaagt cttgcgtctt 1200
gcgccggtat tgccattgaa cttccgtact gctgtgagag atacaactct acccataggt 1260
ggtggtcccg agcagaagga tccgatcttc gttcctaagg gcaccgcagt ttactattca 1320
atttacatgg tccacaggga catcaagtat tggggtcctg acgcccacga attcaatccc 1380
aatcgatggg agaacttgaa gctagataat gtgtgggcat tcttgccctt caatggcggt 1440
ccccgaattt gtctcggcca acaattcgcc ctgacagagc tttcgctaac tctggtgaga 1500
ctcttacagg agtattccaa gattgagatg ggtcccgact tcccagagtc ccctcgtttc 1560
tcaacaacgc ttacagctca acacgctcct cccggtgtgg ttgtgcggtt ttcttaa 1617
<210> 3
<211> 1392
<212> DNA
<213> 假丝酵母菌
<400> 3
atgagccctt catcacacaa acccctgatt ctcgcttgcg gcttgcctct ttcaggccat 60
ataatgcccg ttttgagtct ggtacacggc cttacggacg acggatacga agctactgtt 120
gtgacaggca gagcgtttga acaaaaagtt cgagatgtgg gtgcagactt tgttccttta 180
gaagggaacg cagattttga tgaccacacc ttagacgatc tggtcccggg ccgtaaagac 240
atggccccaa gcttcgatcg tacagttcaa gatgtggagc acatgatggt agctactctt 300
cctgagcagt ttgccgctat tcagagggct ttcaaaaagc tcagcgcaag cggccgccct 360
gtcgttcttg tcagtgaagt gctgtttttc ggtgcacacc ctatcagcct cggtgctcct 420
ggtttcaaac ccgctggctg gatttgttta ggggttttgc ctcttttgat ccgcagtgat 480
cataccttag gacttgacaa cgacaggagc cccgaagcac atgcaaagaa actcgctatg 540
aaccacgctc ttgagcacca aattttcgtt aaagccactg ctaagcacaa ggaaatctgc 600
cgagagttag gttgcactga agatcccaaa tttatctggg agcacagtta cattgctgca 660
gacaagttcc tgcagctgtg cccgccttct cttgagttca gcagagacca tctgcctagc 720
aacttcaaat tcgccggctc aacgcccaag caccgaactc aattcacccc tccttcctgg 780
tggggggatg ttctgagtgc caagcgagtc atcatggtca ctcaaggaac ttttgctgtc 840
agttacaagc atcttattgt gcctactctt gaggccttga aggacgagcc tgacacttta 900
acagtagcca tattgggccg ccgcggtgcc aagctaccgg atgatgttgt ggttcctgag 960
aatgctcgcg tgatcgacta cttcaactac gatgctctac ttcctcacgt tgatgctctt 1020
gtctacaatg gtggatatgg cggacttcag cacagcttaa gccactctgt tccagttgtt 1080
attgctggtg actctgaaga caagccaatg gtggcatcga gagctgaggc cgctggcgtg 1140
gcaattgatt tgaaaactgg cttgcctaca gtggagcaaa tcaaagaagc tgttgattcg 1200
ataattggaa atccgaaatt ccacgaagcc tcgaagaagg ttcaaatgga gttggaaagc 1260
cacaactcct tgaaaattct tgaggaaagc atcgaggaaa tcgccagcca tgactttggt 1320
cttttgacca agagtgacga ggaaactgaa gatatacctg tcaaagggcc ggccttagcg 1380
gtgagttctt ag 1392
<210> 4
<211> 780
<212> DNA
<213> 假丝酵母菌
<400> 4
atggttgtaa actcctcgaa ggaccctcaa aacaaaggaa tgactcctag aaaagaaatt 60
gaccaggaaa tggtctcttg ggccaaaaaa aacctcaaaa acacccctgg caatgaaaac 120
tatgagaaga tggtctcagg agttccttac aatccatacg atccagatct tatgtttaga 180
gccctggcta ctagtgagaa agttagggag ttcaatacca ttgcaagtga aagtcgtact 240
tttgagtcaa atcacgctgc ttatatcaag aaggtcgaga ttctcaaaga cacttttggt 300
caaacaaagg atattgtctg gctgaccgct ccattctcag ttgattttgg attcaacatc 360
agcgtaggcg agcactttta cgccaacttc aacgtttgct tcttggactc ggctccaata 420
atctttggtg atgaggtgat tgtagggccc aatacaacgt tcgtgactgc gactcatcct 480
attagccccg agaaacgtgc gaggagaatt gtgtatgctc ttcctatcaa ggtggggaat 540
aatgtatgga ttggtgcgaa tgtgactgtc ctgccgggtg ttacgattgg agatggctca 600
acaattgcgg ctggtgctgt cgttcgagaa gatgttcctc ctcgtactgt ggtgggagga 660
gtccctgcgc gaatcctcaa gcatattcca gaggaggatc ccgacgaggc tgaaggagag 720
gaactggaat tccttcttcc agttgaaatg aacgtcaata ccgctaacca gaaggtctag 780
<210> 5
<211> 3900
<212> DNA
<213> 假丝酵母菌
<400> 5
atggtggatg atatacaggt agagaagcgt gagaaactca tcgagactaa ggacaagctt 60
ctcgaggaga agctctctgc gttagatcca catgaggcca atgtattgcg aagtcagctt 120
gaaacaaaga gagtcgccac aagctttttc aggttgttca gattttgcac tccccttgac 180
gttttcttgg agatacttgc gctttttttt gcagcggtgc atggagccgc gcttccaatg 240
ttcacgttag tagtgggcgc catcttcaac acattcagag acttcacttc atatgacctc 300
aagggcaatg agttccagca taaggtgaat cacctgtctc tctattttgt ctatattggc 360
attggtatgc tcggcagtgc gtttctcgag agcttcctgc ttgtggacag aggcgaagtg 420
ttggcaggac gttaccgaaa gcattatctg agtgctgtta ttcgccagaa tatcgcgttt 480
tacgacaaac taggtggtgg cgaggtcagc accagaatca ttaacgatac caactcaatt 540
caggaagcga tcagcgacaa gcttggaaac gtcgtacagg gaatagcttc cttcattgcg 600
gccaccgtta taagttttgc ttcgcaatgg aaactggctt gcatcctcct gagtgctgta 660
gggttcatgg taatcacaat gggaactggc gccaccttca tggccaaata tcagctcaga 720
tctgacgcga tatattcgca gtctggagct accgttgcgg aggaggctct cagtgctgtc 780
aggactacag tagcatttgg cgctcaacct catctcgccg tcaagtatga aaaggtactt 840
gatcgtgttg tgaaggaatc gaagcggagc agttactcat tgggggtcat gttagcgtgc 900
atatgggcta gtactttttg ggtgtatgcc ttagctctgt ggcagggttc cagagaaatc 960
gttagtggga gtgctgacgt tggaaagata atagttgtaa tcacagctat gttacttgga 1020
agcttccagc ttgggaatat cgcgccaaac gtgaggtttc ttgtcaaggg tctcactgcc 1080
gcgagcattc tcaatgaggc cattgatcgt gtcccagtca tcgatggcca gtccatagat 1140
aaaggaattg tcccccaaac taaggccgtt ggcagaattg agctcaaaaa tgtcaagttc 1200
cgatatccta gtcgcccaga cgttttggtc ctctccgatt ttagccttga agttcctgct 1260
ggatctactg tggcactggt aggtgcctcg ggatcaggga agtctacaat tgtaggtatt 1320
cttgagaggt tctatttacc tctcgaagga agcgttactc tggatggcca ggagattagc 1380
gacctgaaca caagatggct ccgtcaacaa attggttatg ttcagcagga accagtactc 1440
ttttcagagt caatatatga gaatatcagc tatggtttga ttggcactga cattgagttc 1500
gctgacgagc atgttaagga agctaaaatc attcaagctt gtaaagatgc caatgcctgg 1560
gatttcattc agactctctc agaaggcatc caaaccaatg ttggagatcg aggatttctt 1620
ctcagcggtg gtcagaaaca acgcattgca atagcaagag caatcgtctc agaccctaaa 1680
attctgctgc tcgatgaagc gacttctgct ctggatacca aatctgaagg tatcgttcaa 1740
gatgcgctcg acaaagcggc cgaaggtcgt accactatag tcgttgcaca cagactctct 1800
acgatcaagg atgccaacaa gatagttgtc atgtctaaag gtaacgtcat agagcagggt 1860
actcacaatg agctcataca gcgagaaggg ccttataaag ctttggttga tgctcaaaga 1920
gtaactaaag caaagagcac taacgttgag gtcctcgata ttgaagctct agacatttcg 1980
cctctggact cactgaacga aaagttcaat cccaaggatg tgagcacatt gagtgttcac 2040
agtgcaggta ctcagaccac tcaacctcct gaatatcaag aaaatgacat ccctggtgtg 2100
cgcaaccccc cacatagcac gttgatgacc aataccaaac tggtttgggg gctgaatagg 2160
aaagaatggg gttacattct cattggtagt ttagcctcca ttattttggg ctattgctat 2220
cctgcaatgg caataataac tggccaaacc actggaagca tggttctacc tcccagtgaa 2280
tacggaaaaa tgcggcatgt ggtgaatatc atgggatggt ggtatttttt cgtaggctgc 2340
atttcattca tgacggcttt tatcactata gctgctttat cacttgcatc tgataagttg 2400
gtcaaaaata tcagattagc tttgttccgc caattgatgc gaatggatat tgcattcttc 2460
gaccacaaaa acaacacgcc gggtgcgcta acctcaattt tggcgaagga agctaaaatg 2520
atcgagggtt tgagtggggc caccctcggt caaattcaac agagtctggt gaccttgatt 2580
ggcggcatag ttactggtat acctttcaat tggagaattg gactcgtggc tacgtctgtt 2640
gttcctgtca tgttggtgtg tggcttcgtc agagtctggg ttcttaccca attatcggat 2700
cgtgcgagag aagtttacga acgaagtggc tccatggcat ctgagtatac aagtgctgtc 2760
cgcacagtcc agtccttaac tcgtgagtta gacgtggtcg taaaatacac aaagacagta 2820
gactctcaga ttttcagctc cagaattgcc attgcccgct cagcattgta ctacgcactc 2880
tcggaaggaa tgacaccctg ggtggtagcc ctcgtttttt ggtggggaag cactgtaatg 2940
agacgaggtg aagcttcggt cgcaggatat atgactgtct tcatggctat tattacaggt 3000
tctcaagccg ctggccaaat tttcagctat gctccaaaca tgaactcagc caaagatgca 3060
gcgcgtaaca tttacagaat cttgactgcc actccttcta tagatgtatg gagtgaggaa 3120
ggttacgttg ctcccgagga gtcggtgaga ggagatattg agttccgtca tgtgaatttc 3180
cgatatccta ctcgacctca agtaccagtt ttacaagatc tcaacttaac agtcaaaaag 3240
ggccaataca tcgctctagt tggagccagt ggatgcggta agtctactac tattggactg 3300
gtggaaagat tttatgatcc attagcaggt caagtacttt tcgatgggaa agatttacgc 3360
gaatataacc tgaatgcatt gagatcacac attgctttag tccagcaaga accaatgctt 3420
tattcaggca cgctacgtga gaatattcta atgggatggt ctggccctga gtctgaagta 3480
acgcaggaga tgattgagga tgccgctcgc aaagcgaaca ttcacgaatt catcatgtcg 3540
ttgcctgatg gctacgaaac gctcagcgga tctaggggat cgttgctatc tggggggcaa 3600
aagcagcgaa ttgcaattgc aagggccctg atcagaaatc caaaggtact cctcctcgat 3660
gaggccacct cagctctgga ttccgaatct gagaaagtag ttcaagcagc actcgacgca 3720
gcagcgaagg gccgtactac aatcgccgtt gcgcatagat tatcaacaat tcagaaagca 3780
gatgtcatat atgtgttctc aggagggcgc atcgtggagc agggcgacca tcagagcctc 3840
cttgaactca atggatggta cgctgaattg gtgaacttgc aaggtctcgg agagatttga 3900
<210> 6
<211> 1299
<212> DNA
<213> 假丝酵母菌
<400> 6
atggccatcg agaaaccagt gatagttgct tgtgcctgcc cactagcggg gcacgtgggc 60
ccagtgctca gcctggtccg cggtctactc aatagaggat atgaggtgac tttcgtaaca 120
gggaacgcat tcaaggagaa agttattgag gcaggatgca ctttcgtccc tctccaagga 180
cgagctgact accatgaata caatctccct gaaatcgctc caggattgct cacgattcct 240
ccaggccttg agcagaccgg ttactcaatg aatgagattt ttgtgaaggc gattcctgag 300
cagtacgatg cacttcaaac tgctctaaaa caggttgagg ctgaaaataa atcagctgtg 360
gtgattggcg agaccatgtt tctaggggtg catccgatat cactgggtgc cccaggtctc 420
aagccccaag gcgtaatcac gttaggaact attccgtgca tgctgaaagc agagaaggcg 480
cctggagttc ctagtcttga gccaatgatt gatactttag tgcggcaaca agtatttcaa 540
ccaggaactg actctgagaa ggagatcatg aagacgctcg gggccacgaa ggagcccgaa 600
tttctcctgg agaatatata cagcagccct gacagatttt tgcaactgtg ccctccatct 660
cttgaatttc acttgacttc gcctcctcct ggcttctcgt tcgctggtag tgcaccgcat 720
gtaaagtctg ctggattagc aactccacct cacctgccgt cttggtggcc tgatgtgctg 780
agtgcgaagc gtctgattgt tgttacacaa ggaacagcag ccatcaacta tgaagatctg 840
ctcattccag cattgcaggc ctttgctgac gaagaagaca ctctcgtagt tggtatattg 900
ggcgtcaaag gggcgtcact tcctgatagc gttaaagttc ctgcaaacgc tcgaattgtt 960
gattattttc cttacgatga gctactaccg catgcctctg ttttcatata caacggtgga 1020
tacggaggtc tgcagcacag tttgagccat ggcgttcccg tcatcatcgg aggaggaatg 1080
ttggtagaca agccagctgt tgcttcacga gctgtatggg ctggtgttgg ttatgatctt 1140
caaaccttgc aggcaacttc tgagctagtc tccacggccg ttaaggaggt gttggctact 1200
ccctcgtatc acgagaaagc catggcagtc aagaaagagc ttgaaaaata caagtctctt 1260
gatattctag agtcggcaat tagtgaatta gcttcttaa 1299
<210> 7
<211> 538
<212> PRT
<213> 假丝酵母菌
<400> 7
Met Leu Ile Lys Asp Ile Ile Leu Thr Pro Met Ser Leu Ser Ala Val
1 5 10 15
Ala Gly Leu Leu Pro Leu Leu Phe Val Ala Phe Leu Val Leu His Glu
20 25 30
Pro Ile Trp Leu Leu Trp Tyr Arg Tyr Ala Ala Arg Arg His Lys Cys
35 40 45
Ser Met Pro Arg Phe Ile Glu Lys Ser Phe Pro Leu Gly Ile Gln Arg
50 55 60
Thr Met Asp Met Ile Lys Thr Ala Lys Ser Tyr Thr Leu Leu Glu Val
65 70 75 80
Gln Tyr Asp Arg Val Phe Asn Lys Phe Lys Ala Arg Thr Tyr Leu Arg
85 90 95
Gln Ala Pro Leu Gln Tyr Gln Ile Phe Thr Ile Glu Pro Glu Asn Ile
100 105 110
Lys Thr Ile Leu Ala Thr Lys Phe Asn Asp Phe Gly Leu Gly Ala Arg
115 120 125
Phe His Thr Val Gly Lys Val Phe Gly Gln Gly Ile Phe Thr Leu Ser
130 135 140
Gly Asn Gly Trp Lys Gln Ser Arg Ser Met Leu Arg Pro Gln Phe Thr
145 150 155 160
Lys Asp Gln Val Cys Arg Ile Asp Gln Ile Ser Ser His Ala Ala Glu
165 170 175
Leu Ile Lys Glu Met Asn Arg Ala Met Lys Val Asp Gln Phe Ile Asp
180 185 190
Val Gln His Tyr Phe His Lys Leu Thr Leu Asp Thr Ala Thr Glu Phe
195 200 205
Leu Phe Gly Glu Ser Cys Glu Ser Leu Asn Pro Glu Asn Gln Ser Cys
210 215 220
Ile Val Ala Arg Asp Gly Ser Glu Ile Thr Ala Glu Gln Phe Val Glu
225 230 235 240
Ser Tyr Asn Phe Leu Leu Asn Tyr Ala Phe Lys Arg Thr Leu Ser Ser
245 250 255
Lys Val Tyr Trp Leu Phe Asn Ser Lys Glu Phe Arg Asp His Lys Lys
260 265 270
Arg Ala Gln Ser Tyr Ile Asp Tyr Tyr Val Asp Lys Ala Leu Tyr Ala
275 280 285
Thr Ser Phe Ala Ala Glu Asn Ser Ile Ala Glu Lys Asp Ala Ala Ala
290 295 300
Glu Ser Ser Gly Ile Tyr Val Phe Ser Leu Glu Met Ala Lys Val Thr
305 310 315 320
Arg Asp Pro Val Thr Ile Arg Asp Gln Ile Phe Asn Ile Leu Ile Ala
325 330 335
Gly Arg Asp Thr Thr Ala Ala Thr Leu Ser Phe Ala Ile His Phe Leu
340 345 350
Ala Arg Asn Pro Asp Val Phe Asn Lys Leu Arg Glu Glu Val Leu Asp
355 360 365
His Phe Gly Thr Lys Glu Glu Gln Arg Pro Leu Ser Phe Glu Leu Leu
370 375 380
Lys Gln Ala Pro Tyr Leu Lys Gln Val Ile Asn Glu Val Leu Arg Leu
385 390 395 400
Ala Pro Val Leu Pro Leu Asn Phe Arg Thr Ala Val Arg Asp Thr Thr
405 410 415
Leu Pro Ile Gly Gly Gly Pro Glu Gln Lys Asp Pro Ile Phe Val Pro
420 425 430
Lys Gly Thr Ala Val Tyr Tyr Ser Ile Tyr Met Val His Arg Asp Ile
435 440 445
Lys Tyr Trp Gly Pro Asp Ala His Glu Phe Asn Pro Asn Arg Trp Glu
450 455 460
Asn Leu Lys Leu Asp Asn Val Trp Ala Phe Leu Pro Phe Asn Gly Gly
465 470 475 480
Pro Arg Ile Cys Leu Gly Gln Gln Phe Ala Leu Thr Glu Leu Ser Leu
485 490 495
Thr Leu Val Arg Leu Leu Gln Glu Tyr Ser Lys Ile Glu Met Gly Pro
500 505 510
Asp Phe Pro Glu Ser Pro Arg Phe Ser Thr Thr Leu Thr Ala Gln His
515 520 525
Ala Pro Pro Gly Val Val Val Arg Phe Ser
530 535
<210> 8
<211> 463
<212> PRT
<213> 假丝酵母菌
<400> 8
Met Ser Pro Ser Ser His Lys Pro Leu Ile Leu Ala Cys Gly Leu Pro
1 5 10 15
Leu Ser Gly His Ile Met Pro Val Leu Ser Leu Val His Gly Leu Thr
20 25 30
Asp Asp Gly Tyr Glu Ala Thr Val Val Thr Gly Arg Ala Phe Glu Gln
35 40 45
Lys Val Arg Asp Val Gly Ala Asp Phe Val Pro Leu Glu Gly Asn Ala
50 55 60
Asp Phe Asp Asp His Thr Leu Asp Asp Leu Val Pro Gly Arg Lys Asp
65 70 75 80
Met Ala Pro Ser Phe Asp Arg Thr Val Gln Asp Val Glu His Met Met
85 90 95
Val Ala Thr Leu Pro Glu Gln Phe Ala Ala Ile Gln Arg Ala Phe Lys
100 105 110
Lys Leu Ser Ala Ser Gly Arg Pro Val Val Leu Val Ser Glu Val Leu
115 120 125
Phe Phe Gly Ala His Pro Ile Ser Leu Gly Ala Pro Gly Phe Lys Pro
130 135 140
Ala Gly Trp Ile Cys Leu Gly Val Leu Pro Leu Leu Ile Arg Ser Asp
145 150 155 160
His Thr Leu Gly Leu Asp Asn Asp Arg Ser Pro Glu Ala His Ala Lys
165 170 175
Lys Leu Ala Met Asn His Ala Leu Glu His Gln Ile Phe Val Lys Ala
180 185 190
Thr Ala Lys His Lys Glu Ile Cys Arg Glu Leu Gly Cys Thr Glu Asp
195 200 205
Pro Lys Phe Ile Trp Glu His Ser Tyr Ile Ala Ala Asp Lys Phe Leu
210 215 220
Gln Leu Cys Pro Pro Ser Leu Glu Phe Ser Arg Asp His Leu Pro Ser
225 230 235 240
Asn Phe Lys Phe Ala Gly Ser Thr Pro Lys His Arg Thr Gln Phe Thr
245 250 255
Pro Pro Ser Trp Trp Gly Asp Val Leu Ser Ala Lys Arg Val Ile Met
260 265 270
Val Thr Gln Gly Thr Phe Ala Val Ser Tyr Lys His Leu Ile Val Pro
275 280 285
Thr Leu Glu Ala Leu Lys Asp Glu Pro Asp Thr Leu Thr Val Ala Ile
290 295 300
Leu Gly Arg Arg Gly Ala Lys Leu Pro Asp Asp Val Val Val Pro Glu
305 310 315 320
Asn Ala Arg Val Ile Asp Tyr Phe Asn Tyr Asp Ala Leu Leu Pro His
325 330 335
Val Asp Ala Leu Val Tyr Asn Gly Gly Tyr Gly Gly Leu Gln His Ser
340 345 350
Leu Ser His Ser Val Pro Val Val Ile Ala Gly Asp Ser Glu Asp Lys
355 360 365
Pro Met Val Ala Ser Arg Ala Glu Ala Ala Gly Val Ala Ile Asp Leu
370 375 380
Lys Thr Gly Leu Pro Thr Val Glu Gln Ile Lys Glu Ala Val Asp Ser
385 390 395 400
Ile Ile Gly Asn Pro Lys Phe His Glu Ala Ser Lys Lys Val Gln Met
405 410 415
Glu Leu Glu Ser His Asn Ser Leu Lys Ile Leu Glu Glu Ser Ile Glu
420 425 430
Glu Ile Ala Ser His Asp Phe Gly Leu Leu Thr Lys Ser Asp Glu Glu
435 440 445
Thr Glu Asp Ile Pro Val Lys Gly Pro Ala Leu Ala Val Ser Ser
450 455 460
<210> 9
<211> 259
<212> PRT
<213> 假丝酵母菌
<400> 9
Met Val Val Asn Ser Ser Lys Asp Pro Gln Asn Lys Gly Met Thr Pro
1 5 10 15
Arg Lys Glu Ile Asp Gln Glu Met Val Ser Trp Ala Lys Lys Asn Leu
20 25 30
Lys Asn Thr Pro Gly Asn Glu Asn Tyr Glu Lys Met Val Ser Gly Val
35 40 45
Pro Tyr Asn Pro Tyr Asp Pro Asp Leu Met Phe Arg Ala Leu Ala Thr
50 55 60
Ser Glu Lys Val Arg Glu Phe Asn Thr Ile Ala Ser Glu Ser Arg Thr
65 70 75 80
Phe Glu Ser Asn His Ala Ala Tyr Ile Lys Lys Val Glu Ile Leu Lys
85 90 95
Asp Thr Phe Gly Gln Thr Lys Asp Ile Val Trp Leu Thr Ala Pro Phe
100 105 110
Ser Val Asp Phe Gly Phe Asn Ile Ser Val Gly Glu His Phe Tyr Ala
115 120 125
Asn Phe Asn Val Cys Phe Leu Asp Ser Ala Pro Ile Ile Phe Gly Asp
130 135 140
Glu Val Ile Val Gly Pro Asn Thr Thr Phe Val Thr Ala Thr His Pro
145 150 155 160
Ile Ser Pro Glu Lys Arg Ala Arg Arg Ile Val Tyr Ala Leu Pro Ile
165 170 175
Lys Val Gly Asn Asn Val Trp Ile Gly Ala Asn Val Thr Val Leu Pro
180 185 190
Gly Val Thr Ile Gly Asp Gly Ser Thr Ile Ala Ala Gly Ala Val Val
195 200 205
Arg Glu Asp Val Pro Pro Arg Thr Val Val Gly Gly Val Pro Ala Arg
210 215 220
Ile Leu Lys His Ile Pro Glu Glu Asp Pro Asp Glu Ala Glu Gly Glu
225 230 235 240
Glu Leu Glu Phe Leu Leu Pro Val Glu Met Asn Val Asn Thr Ala Asn
245 250 255
Gln Lys Val
<210> 10
<211> 1299
<212> PRT
<213> 假丝酵母菌
<400> 10
Met Val Asp Asp Ile Gln Val Glu Lys Arg Glu Lys Leu Ile Glu Thr
1 5 10 15
Lys Asp Lys Leu Leu Glu Glu Lys Leu Ser Ala Leu Asp Pro His Glu
20 25 30
Ala Asn Val Leu Arg Ser Gln Leu Glu Thr Lys Arg Val Ala Thr Ser
35 40 45
Phe Phe Arg Leu Phe Arg Phe Cys Thr Pro Leu Asp Val Phe Leu Glu
50 55 60
Ile Leu Ala Leu Phe Phe Ala Ala Val His Gly Ala Ala Leu Pro Met
65 70 75 80
Phe Thr Leu Val Val Gly Ala Ile Phe Asn Thr Phe Arg Asp Phe Thr
85 90 95
Ser Tyr Asp Leu Lys Gly Asn Glu Phe Gln His Lys Val Asn His Leu
100 105 110
Ser Leu Tyr Phe Val Tyr Ile Gly Ile Gly Met Leu Gly Ser Ala Phe
115 120 125
Leu Glu Ser Phe Leu Leu Val Asp Arg Gly Glu Val Leu Ala Gly Arg
130 135 140
Tyr Arg Lys His Tyr Leu Ser Ala Val Ile Arg Gln Asn Ile Ala Phe
145 150 155 160
Tyr Asp Lys Leu Gly Gly Gly Glu Val Ser Thr Arg Ile Ile Asn Asp
165 170 175
Thr Asn Ser Ile Gln Glu Ala Ile Ser Asp Lys Leu Gly Asn Val Val
180 185 190
Gln Gly Ile Ala Ser Phe Ile Ala Ala Thr Val Ile Ser Phe Ala Ser
195 200 205
Gln Trp Lys Leu Ala Cys Ile Leu Leu Ser Ala Val Gly Phe Met Val
210 215 220
Ile Thr Met Gly Thr Gly Ala Thr Phe Met Ala Lys Tyr Gln Leu Arg
225 230 235 240
Ser Asp Ala Ile Tyr Ser Gln Ser Gly Ala Thr Val Ala Glu Glu Ala
245 250 255
Leu Ser Ala Val Arg Thr Thr Val Ala Phe Gly Ala Gln Pro His Leu
260 265 270
Ala Val Lys Tyr Glu Lys Val Leu Asp Arg Val Val Lys Glu Ser Lys
275 280 285
Arg Ser Ser Tyr Ser Leu Gly Val Met Leu Ala Cys Ile Trp Ala Ser
290 295 300
Thr Phe Trp Val Tyr Ala Leu Ala Leu Trp Gln Gly Ser Arg Glu Ile
305 310 315 320
Val Ser Gly Ser Ala Asp Val Gly Lys Ile Ile Val Val Ile Thr Ala
325 330 335
Met Leu Leu Gly Ser Phe Gln Leu Gly Asn Ile Ala Pro Asn Val Arg
340 345 350
Phe Leu Val Lys Gly Leu Thr Ala Ala Ser Ile Leu Asn Glu Ala Ile
355 360 365
Asp Arg Val Pro Val Ile Asp Gly Gln Ser Ile Asp Lys Gly Ile Val
370 375 380
Pro Gln Thr Lys Ala Val Gly Arg Ile Glu Leu Lys Asn Val Lys Phe
385 390 395 400
Arg Tyr Pro Ser Arg Pro Asp Val Leu Val Leu Ser Asp Phe Ser Leu
405 410 415
Glu Val Pro Ala Gly Ser Thr Val Ala Leu Val Gly Ala Ser Gly Ser
420 425 430
Gly Lys Ser Thr Ile Val Gly Ile Leu Glu Arg Phe Tyr Leu Pro Leu
435 440 445
Glu Gly Ser Val Thr Leu Asp Gly Gln Glu Ile Ser Asp Leu Asn Thr
450 455 460
Arg Trp Leu Arg Gln Gln Ile Gly Tyr Val Gln Gln Glu Pro Val Leu
465 470 475 480
Phe Ser Glu Ser Ile Tyr Glu Asn Ile Ser Tyr Gly Leu Ile Gly Thr
485 490 495
Asp Ile Glu Phe Ala Asp Glu His Val Lys Glu Ala Lys Ile Ile Gln
500 505 510
Ala Cys Lys Asp Ala Asn Ala Trp Asp Phe Ile Gln Thr Leu Ser Glu
515 520 525
Gly Ile Gln Thr Asn Val Gly Asp Arg Gly Phe Leu Leu Ser Gly Gly
530 535 540
Gln Lys Gln Arg Ile Ala Ile Ala Arg Ala Ile Val Ser Asp Pro Lys
545 550 555 560
Ile Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr Lys Ser Glu
565 570 575
Gly Ile Val Gln Asp Ala Leu Asp Lys Ala Ala Glu Gly Arg Thr Thr
580 585 590
Ile Val Val Ala His Arg Leu Ser Thr Ile Lys Asp Ala Asn Lys Ile
595 600 605
Val Val Met Ser Lys Gly Asn Val Ile Glu Gln Gly Thr His Asn Glu
610 615 620
Leu Ile Gln Arg Glu Gly Pro Tyr Lys Ala Leu Val Asp Ala Gln Arg
625 630 635 640
Val Thr Lys Ala Lys Ser Thr Asn Val Glu Val Leu Asp Ile Glu Ala
645 650 655
Leu Asp Ile Ser Pro Leu Asp Ser Leu Asn Glu Lys Phe Asn Pro Lys
660 665 670
Asp Val Ser Thr Leu Ser Val His Ser Ala Gly Thr Gln Thr Thr Gln
675 680 685
Pro Pro Glu Tyr Gln Glu Asn Asp Ile Pro Gly Val Arg Asn Pro Pro
690 695 700
His Ser Thr Leu Met Thr Asn Thr Lys Leu Val Trp Gly Leu Asn Arg
705 710 715 720
Lys Glu Trp Gly Tyr Ile Leu Ile Gly Ser Leu Ala Ser Ile Ile Leu
725 730 735
Gly Tyr Cys Tyr Pro Ala Met Ala Ile Ile Thr Gly Gln Thr Thr Gly
740 745 750
Ser Met Val Leu Pro Pro Ser Glu Tyr Gly Lys Met Arg His Val Val
755 760 765
Asn Ile Met Gly Trp Trp Tyr Phe Phe Val Gly Cys Ile Ser Phe Met
770 775 780
Thr Ala Phe Ile Thr Ile Ala Ala Leu Ser Leu Ala Ser Asp Lys Leu
785 790 795 800
Val Lys Asn Ile Arg Leu Ala Leu Phe Arg Gln Leu Met Arg Met Asp
805 810 815
Ile Ala Phe Phe Asp His Lys Asn Asn Thr Pro Gly Ala Leu Thr Ser
820 825 830
Ile Leu Ala Lys Glu Ala Lys Met Ile Glu Gly Leu Ser Gly Ala Thr
835 840 845
Leu Gly Gln Ile Gln Gln Ser Leu Val Thr Leu Ile Gly Gly Ile Val
850 855 860
Thr Gly Ile Pro Phe Asn Trp Arg Ile Gly Leu Val Ala Thr Ser Val
865 870 875 880
Val Pro Val Met Leu Val Cys Gly Phe Val Arg Val Trp Val Leu Thr
885 890 895
Gln Leu Ser Asp Arg Ala Arg Glu Val Tyr Glu Arg Ser Gly Ser Met
900 905 910
Ala Ser Glu Tyr Thr Ser Ala Val Arg Thr Val Gln Ser Leu Thr Arg
915 920 925
Glu Leu Asp Val Val Val Lys Tyr Thr Lys Thr Val Asp Ser Gln Ile
930 935 940
Phe Ser Ser Arg Ile Ala Ile Ala Arg Ser Ala Leu Tyr Tyr Ala Leu
945 950 955 960
Ser Glu Gly Met Thr Pro Trp Val Val Ala Leu Val Phe Trp Trp Gly
965 970 975
Ser Thr Val Met Arg Arg Gly Glu Ala Ser Val Ala Gly Tyr Met Thr
980 985 990
Val Phe Met Ala Ile Ile Thr Gly Ser Gln Ala Ala Gly Gln Ile Phe
995 1000 1005
Ser Tyr Ala Pro Asn Met Asn Ser Ala Lys Asp Ala Ala Arg Asn
1010 1015 1020
Ile Tyr Arg Ile Leu Thr Ala Thr Pro Ser Ile Asp Val Trp Ser
1025 1030 1035
Glu Glu Gly Tyr Val Ala Pro Glu Glu Ser Val Arg Gly Asp Ile
1040 1045 1050
Glu Phe Arg His Val Asn Phe Arg Tyr Pro Thr Arg Pro Gln Val
1055 1060 1065
Pro Val Leu Gln Asp Leu Asn Leu Thr Val Lys Lys Gly Gln Tyr
1070 1075 1080
Ile Ala Leu Val Gly Ala Ser Gly Cys Gly Lys Ser Thr Thr Ile
1085 1090 1095
Gly Leu Val Glu Arg Phe Tyr Asp Pro Leu Ala Gly Gln Val Leu
1100 1105 1110
Phe Asp Gly Lys Asp Leu Arg Glu Tyr Asn Leu Asn Ala Leu Arg
1115 1120 1125
Ser His Ile Ala Leu Val Gln Gln Glu Pro Met Leu Tyr Ser Gly
1130 1135 1140
Thr Leu Arg Glu Asn Ile Leu Met Gly Trp Ser Gly Pro Glu Ser
1145 1150 1155
Glu Val Thr Gln Glu Met Ile Glu Asp Ala Ala Arg Lys Ala Asn
1160 1165 1170
Ile His Glu Phe Ile Met Ser Leu Pro Asp Gly Tyr Glu Thr Leu
1175 1180 1185
Ser Gly Ser Arg Gly Ser Leu Leu Ser Gly Gly Gln Lys Gln Arg
1190 1195 1200
Ile Ala Ile Ala Arg Ala Leu Ile Arg Asn Pro Lys Val Leu Leu
1205 1210 1215
Leu Asp Glu Ala Thr Ser Ala Leu Asp Ser Glu Ser Glu Lys Val
1220 1225 1230
Val Gln Ala Ala Leu Asp Ala Ala Ala Lys Gly Arg Thr Thr Ile
1235 1240 1245
Ala Val Ala His Arg Leu Ser Thr Ile Gln Lys Ala Asp Val Ile
1250 1255 1260
Tyr Val Phe Ser Gly Gly Arg Ile Val Glu Gln Gly Asp His Gln
1265 1270 1275
Ser Leu Leu Glu Leu Asn Gly Trp Tyr Ala Glu Leu Val Asn Leu
1280 1285 1290
Gln Gly Leu Gly Glu Ile
1295
<210> 11
<211> 432
<212> PRT
<213> 假丝酵母菌
<400> 11
Met Ala Ile Glu Lys Pro Val Ile Val Ala Cys Ala Cys Pro Leu Ala
1 5 10 15
Gly His Val Gly Pro Val Leu Ser Leu Val Arg Gly Leu Leu Asn Arg
20 25 30
Gly Tyr Glu Val Thr Phe Val Thr Gly Asn Ala Phe Lys Glu Lys Val
35 40 45
Ile Glu Ala Gly Cys Thr Phe Val Pro Leu Gln Gly Arg Ala Asp Tyr
50 55 60
His Glu Tyr Asn Leu Pro Glu Ile Ala Pro Gly Leu Leu Thr Ile Pro
65 70 75 80
Pro Gly Leu Glu Gln Thr Gly Tyr Ser Met Asn Glu Ile Phe Val Lys
85 90 95
Ala Ile Pro Glu Gln Tyr Asp Ala Leu Gln Thr Ala Leu Lys Gln Val
100 105 110
Glu Ala Glu Asn Lys Ser Ala Val Val Ile Gly Glu Thr Met Phe Leu
115 120 125
Gly Val His Pro Ile Ser Leu Gly Ala Pro Gly Leu Lys Pro Gln Gly
130 135 140
Val Ile Thr Leu Gly Thr Ile Pro Cys Met Leu Lys Ala Glu Lys Ala
145 150 155 160
Pro Gly Val Pro Ser Leu Glu Pro Met Ile Asp Thr Leu Val Arg Gln
165 170 175
Gln Val Phe Gln Pro Gly Thr Asp Ser Glu Lys Glu Ile Met Lys Thr
180 185 190
Leu Gly Ala Thr Lys Glu Pro Glu Phe Leu Leu Glu Asn Ile Tyr Ser
195 200 205
Ser Pro Asp Arg Phe Leu Gln Leu Cys Pro Pro Ser Leu Glu Phe His
210 215 220
Leu Thr Ser Pro Pro Pro Gly Phe Ser Phe Ala Gly Ser Ala Pro His
225 230 235 240
Val Lys Ser Ala Gly Leu Ala Thr Pro Pro His Leu Pro Ser Trp Trp
245 250 255
Pro Asp Val Leu Ser Ala Lys Arg Leu Ile Val Val Thr Gln Gly Thr
260 265 270
Ala Ala Ile Asn Tyr Glu Asp Leu Leu Ile Pro Ala Leu Gln Ala Phe
275 280 285
Ala Asp Glu Glu Asp Thr Leu Val Val Gly Ile Leu Gly Val Lys Gly
290 295 300
Ala Ser Leu Pro Asp Ser Val Lys Val Pro Ala Asn Ala Arg Ile Val
305 310 315 320
Asp Tyr Phe Pro Tyr Asp Glu Leu Leu Pro His Ala Ser Val Phe Ile
325 330 335
Tyr Asn Gly Gly Tyr Gly Gly Leu Gln His Ser Leu Ser His Gly Val
340 345 350
Pro Val Ile Ile Gly Gly Gly Met Leu Val Asp Lys Pro Ala Val Ala
355 360 365
Ser Arg Ala Val Trp Ala Gly Val Gly Tyr Asp Leu Gln Thr Leu Gln
370 375 380
Ala Thr Ser Glu Leu Val Ser Thr Ala Val Lys Glu Val Leu Ala Thr
385 390 395 400
Pro Ser Tyr His Glu Lys Ala Met Ala Val Lys Lys Glu Leu Glu Lys
405 410 415
Tyr Lys Ser Leu Asp Ile Leu Glu Ser Ala Ile Ser Glu Leu Ala Ser
420 425 430
<210> 12
<211> 4143
<212> DNA
<213> 人工的
<220>
<223> 整合构建体
<400> 12
aattgttcga tggatagctt tggagtctgt cccatcatga tacgaaaagc gtgaagctcc 60
tctgacaatc aaaactttgt ttcaatgggg tgtaggatgg accccggatc caaacgaccg 120
cgagtcaaaa aacctacggg tgcatttacc cgtagttgat ctggaaagtc gagatcaact 180
ttttgtagtt tagttacatt catttcacgg tcgaaaaact cacacacaac gattgcagta 240
tatttaccaa aatcgtctga agagaagcat ctgattgaga gttcaccatg acgaatccca 300
taaacgacta ctccactgga cacaccgaca gacgccctgg ggatagtgaa actgaatttg 360
tcggtataat ggcccgtctc acaggccggg cagaacactt tcatgtcctt tcgcaggtct 420
cgacattgga caagtatgtt gtcgtgggtg acgacaaatt ggtcctcatc cttgaataag 480
atgctccctt tgttctcagg aactggcacc attccattat gggcgaataa tttctgctca 540
tcttcgggac tgatgccata ttcttctaac agaagacggc gctcacatgg gacctggtgc 600
tctcgccggc ctctcaaatc gccggtgcat ctccacacgc aaattcacgg gtgtataccc 660
ctgatcaaac gtatcttgcg cgttctgtta ttcattggag cgagggcccg atcctgtcct 720
atcaaatgat ttcatgtggg aataatccat caattgttct ggattgaggt atacttcgag 780
ctgtaaagat gtcgcttcta tgtcaagaat agtcggttaa acgcactcct tcaagattta 840
catgatttac atgattcttc ataaagagca taaataaaga actgcagcca ttcttgagta 900
aagtgctcag aataataaaa aggttgccac aggttgagtt aacatgggtt gattgaacca 960
attaaggagg gaacgtttct tccatgggag gctaagaaac ttaataactt cgtataatgt 1020
atgctatacg aagttattaa ttaactgacg ggcggatagt acaggctttg ccaaaagcct 1080
ataaggctaa agaaagtaaa caagtgaggt tgaaccatga tggcagtgtt cgaattctga 1140
tcaatgaagt acactgcgaa gggaatcccc gaaacggcga acaaaaagaa catcagagga 1200
ggaacgccct cgcaatcccg aacataccag tttcgcagaa cctggggtat caactggatg 1260
caccagcata ctgttcccac tgttgccaat gctgtagacg ctccattgtt gtcagtcatt 1320
ttagcatttt acagtaacca actccaaaaa acagcccgct ctgctgggaa gacttcgcaa 1380
ttatttatcc actactgctg cggttatata cttctcgatc tcagtctcgg ttataattgc 1440
cgcttgacag cctggagaaa ttcggatact ccacgtgata attgccatag ggcataattt 1500
tcgaaacagc tcgcaacgat ctcggctagt tttccccttt tttgacccat atcgacgctg 1560
agactcactc acttgatgcc taccgttagg gtaaattttt caagcctgca gaatatcgcg 1620
ggacgcagtc tcctgcacgc gcgtgacttc atcttactta catcaaacag cccgattaat 1680
ttgaaaagtc ctagctgatc gagggcacgg gcactactgt agagaaataa tatgaagctg 1740
agctatgagg agcgccgaga gaggctgccg gctgtagcag cccggctatt cgacatcatt 1800
gtgagcaagc aaacaaatct ttgcgcaagc ttggatgtgc gaactacctc tgagttactg 1860
agtatcctgg accgcattgg accttacatt tgtatggtta agacccacat tgacataatt 1920
gacgacttcg aatacgacac aactgtcagc ggtttgaaac agctttcaac gaagcacaat 1980
tttctcattt ttgaagaccg aaagttcgca gacatcggtt ccactgttaa ggcccaatat 2040
gcaggtggag tgtttaagat cgctcaatgg gctgatataa caaatgctca cggtgttcct 2100
gggccgggaa ttgtgagcgg actagaagag gctgcgaagg aaactacgga tgaacctcgc 2160
ggccttgtca tgcttgcaga actgagttcg aagggcacac tggctcacgg cgaatactcg 2220
caagcgacag tagacatcgc tcgcagtaac cgcgcatttg tgtttggttt catcgctcag 2280
caaaaagtcg gaaagccaga ggaagactgg gtcattatga ctcctggggt gggcctggac 2340
gacaaaggtg atggattggg gcagcagtat cgtactgtgg acgacgtcat agagaccggc 2400
acagacgtta ttatcgtcgg acgcgggctc tatagcaagg gacgagatcc tgtgcacgaa 2460
gctcagcgtt accaaaaggc gggctggaat gcatatctga gaaaagttca gtcaagatga 2520
ttttctcaaa cagttccttc aatgcaactt gcacatgaat acctataaaa tctgattaaa 2580
ttaccataaa aggtacagat taaaatatat atgccttcaa tggcatcctt cgcgattctg 2640
attcgtcagc acacttcaac cttcctacta tgagtgacag tgatgatgat ctgctggcat 2700
tggccgacgt tggctccgac tccgaagagg aaatctcgct gccgtcgccg ccaagcaatg 2760
aggtcgtcaa tccctatcct ctagaaggca aatatctcga tgctgaagac agggcgaagt 2820
tggacgcgct gccagagatt gagcgagaag agatcttgta tgaccgagct caggagatgc 2880
agcggtacga ggagagaagg tatcttgctc agcgaaggaa gcagatgacg cgggttgctg 2940
acgaggacga agccccctcc gccaagcgtc aacggggtac aacaggcgtc tcttcgggta 3000
cgaagtcatc tcttgaggca ttaaagaaac gaagggccca gcagtctcgg aagtcctcac 3060
gccatggagt tgatgacgat gtgtatagtg acgatgatgt taattaataa cttcgtataa 3120
tgtatgctat acgaagttat atatgtactt ttcaatatga taaacggaga aataacgccc 3180
ggctctatat gcaagctgca tcaaccctaa tatatattag cgagtttctc atgcaggctg 3240
tagtttgagt cgctgtaacc tcagcctcaa gactcttaca ccataggtag agtttcgtca 3300
ctgggaaact cagttactat ctaaaccaaa ctgtgctaat gctcaaacct atcactcaga 3360
atttagattg aatcaatcta agtctgttga gaaacagata tgcatcaggg gcacagacta 3420
aaagctgctc tcagcgagta cccttacctc ttgagaaccc tcaaaattta cccagcctgc 3480
agcatatcat gcaccatggt taaattcgga aatgaattta ccggtggcct tgaaccacgt 3540
tcctccaatt atttaaggca ataacctgcc actctcttga tttgattaag aaagactttc 3600
aatttagctt ctccctacga atattcaatg agcccttcat cacacaaacc cctgattctc 3660
gcttgcggct tgcctctttc aggccatata atgcccgttt tgagtctggt acacggcctt 3720
acggacgacg gatacgaagc tactgttgtg acaggcagag cgtttgaaca aaaagttcga 3780
gatgtgggtg cagactttgt tcctttagaa gggaacgcag attttgatga ccacacctta 3840
gacgatctgg tcccgggccg taaagacatg gccccaagct tcgatcgtac agttcaagat 3900
gtggagcaca tgatggtagc tactcttcct gagcagtttg ccgctattca gagggctttc 3960
aaaaagctca gcgcaagcgg ccgccctgtc gttcttgtca gtgaagtgct gtttttcggt 4020
gcacacccta tcagcctcgg tgctcctggt ttcaaacccg ctggctggat ttgtttaggg 4080
gttttgcctc ttttgatccg cagtgatcat accttaggac ttgacaacga caggagcccc 4140
gaa 4143
<210> 13
<211> 4143
<212> DNA
<213> 人工的
<220>
<223> 整合构建体
<400> 13
gaaatctgat caattctgca aacctgatct ttagtgaact gaggtctcaa catcgatcga 60
gactgtttcc atccatttcc gctgagtgta aatatccctt ggccaaacac ttttcccact 120
gtgtggaaac gtgctccaag accaaaatca ttgaatttgg ttgccaggat tgtcttaatg 180
ttttctggct cgattgtgaa gatttggtat tgaaggggag cttgtcgaag atacgtccgt 240
gctttgaact tattgaagac tctgtcgtat tgaacttcca gtaaggtgta tgacttggcc 300
gtcttgatca tgtccatggt tctttgtatt cccagtggga acgatttctc aatgaagcga 360
ggcatactac acttgtgcct acgtgctgca tagcggtacc ataggagcca gataggctcg 420
tgtagaacta agaaagctac gaagagcagt ggcaacaagc cagcaacagc ggataaactc 480
attggagtta gaataatgtc tttgattaac atatatgtac ttttcaatat gataaacgga 540
gaaataacgc ccggctctat atgcaagctg catcaaccct aatatatatt agcgagtttc 600
tcatgcaggc tgtagtttga gtcgctgtaa cctcagcctc aagactctta caccataggt 660
agagtttcgt cactgggaaa ctcagttact atctaaacca aactgtgcta atgctcaaac 720
ctatcactca gaatttagat tgaatcaatc taagtctgtt gagaaacaga tatgcatcag 780
gggcacagac taaaagctgc tctcagcgag tacccttacc tcttgagaac cctcaaaatt 840
tacccagcct gcagcatatc atgcaccatg gttaaattcg gaaatgaatt taccggtggc 900
cttgaaccac gttcctccaa ttatttaagg caataacctg ccactctctt gatttgatta 960
agaaagactt tcaatttagc ttctccctac gaatattcaa taacttcgta taatgtatgc 1020
tatacgaagt tattaattaa ctgacgggcg gatagtacag gctttgccaa aagcctataa 1080
ggctaaagaa agtaaacaag tgaggttgaa ccatgatggc agtgttcgaa ttctgatcaa 1140
tgaagtacac tgcgaaggga atccccgaaa cggcgaacaa aaagaacatc agaggaggaa 1200
cgccctcgca atcccgaaca taccagtttc gcagaacctg gggtatcaac tggatgcacc 1260
agcatactgt tcccactgtt gccaatgctg tagacgctcc attgttgtca gtcattttag 1320
cattttacag taaccaactc caaaaaacag cccgctctgc tgggaagact tcgcaattat 1380
ttatccacta ctgctgcggt tatatacttc tcgatctcag tctcggttat aattgccgct 1440
tgacagcctg gagaaattcg gatactccac gtgataattg ccatagggca taattttcga 1500
aacagctcgc aacgatctcg gctagttttc cccttttttg acccatatcg acgctgagac 1560
tcactcactt gatgcctacc gttagggtaa atttttcaag cctgcagaat atcgcgggac 1620
gcagtctcct gcacgcgcgt gacttcatct tacttacatc aaacagcccg attaatttga 1680
aaagtcctag ctgatcgagg gcacgggcac tactgtagag aaataatatg aagctgagct 1740
atgaggagcg ccgagagagg ctgccggctg tagcagcccg gctattcgac atcattgtga 1800
gcaagcaaac aaatctttgc gcaagcttgg atgtgcgaac tacctctgag ttactgagta 1860
tcctggaccg cattggacct tacatttgta tggttaagac ccacattgac ataattgacg 1920
acttcgaata cgacacaact gtcagcggtt tgaaacagct ttcaacgaag cacaattttc 1980
tcatttttga agaccgaaag ttcgcagaca tcggttccac tgttaaggcc caatatgcag 2040
gtggagtgtt taagatcgct caatgggctg atataacaaa tgctcacggt gttcctgggc 2100
cgggaattgt gagcggacta gaagaggctg cgaaggaaac tacggatgaa cctcgcggcc 2160
ttgtcatgct tgcagaactg agttcgaagg gcacactggc tcacggcgaa tactcgcaag 2220
cgacagtaga catcgctcgc agtaaccgcg catttgtgtt tggtttcatc gctcagcaaa 2280
aagtcggaaa gccagaggaa gactgggtca ttatgactcc tggggtgggc ctggacgaca 2340
aaggtgatgg attggggcag cagtatcgta ctgtggacga cgtcatagag accggcacag 2400
acgttattat cgtcggacgc gggctctata gcaagggacg agatcctgtg cacgaagctc 2460
agcgttacca aaaggcgggc tggaatgcat atctgagaaa agttcagtca agatgatttt 2520
ctcaaacagt tccttcaatg caacttgcac atgaatacct ataaaatctg attaaattac 2580
cataaaaggt acagattaaa atatatatgc cttcaatggc atccttcgcg attctgattc 2640
gtcagcacac ttcaaccttc ctactatgag tgacagtgat gatgatctgc tggcattggc 2700
cgacgttggc tccgactccg aagaggaaat ctcgctgccg tcgccgccaa gcaatgaggt 2760
cgtcaatccc tatcctctag aaggcaaata tctcgatgct gaagacaggg cgaagttgga 2820
cgcgctgcca gagattgagc gagaagagat cttgtatgac cgagctcagg agatgcagcg 2880
gtacgaggag agaaggtatc ttgctcagcg aaggaagcag atgacgcggg ttgctgacga 2940
ggacgaagcc ccctccgcca agcgtcaacg gggtacaaca ggcgtctctt cgggtacgaa 3000
gtcatctctt gaggcattaa agaaacgaag ggcccagcag tctcggaagt cctcacgcca 3060
tggagttgat gacgatgtgt atagtgacga tgatgttaat taataacttc gtataatgta 3120
tgctatacga agttattaga atcgtacgat caaatcagat cagggaagag aggtagggtt 3180
ttttttattt atgtctttgt ttttattgat tgaaatttac aatacaacaa ccatcaaatt 3240
aatttgaaca aacaacaaca cacacacaca ctgcaacttt caaaaaaata agtaaaagga 3300
agagaggagt ttgccaatat atttaccttc ttctaattct gttatttttt ttaattgttt 3360
tgtggaaaga aagaagaaaa ggctgtcatg aatttagttt acctagacct tctggttagc 3420
ggtattgacg ttcatttcaa ctggaagaag gaattccagt tcctctcctt cagcctcgtc 3480
gggatcctcc tctggaatat gcttgaggat tcgcgcaggg actcctccca ccacagtacg 3540
aggaggaaca tcttctcgaa cgacagcacc agccgcaatt gttgagccat ctccaatcgt 3600
aacacccggc aggacagtca cattcgcacc aatccataca ttattcccca ccttgatagg 3660
aagagcatac acaattctcc tcgcacgttt ctcggggcta ataggatgag tcgcagtcac 3720
gaacgttgta ttgggcccta caatcacctc atcaccaaag attattggag ccgagtccaa 3780
gaagcaaacg ttgaagttgg cgtaaaagtg ctcgcctacg ctgatgttga atccaaaatc 3840
aactgagaat ggagcggtca gccagacaat atcctttgtt tgaccaaaag tgtctttgag 3900
aatctcgacc ttcttgatat aagcagcgtg atttgactca aaagtacgac tttcacttgc 3960
aatggtattg aactccctaa ctttctcact agtagccagg gctctaaaca taagatctgg 4020
atcgtatgga ttgtaaggaa ctcctgagac catcttctca tagttttcat tgccaggggt 4080
gtttttgagg ttttttttgg cccaagagac catttcctgg tcaatttctt ttctaggagt 4140
cat 4143
<210> 14
<211> 4140
<212> DNA
<213> 人工的
<220>
<223> 整合构建体
<400> 14
tgcagacaag ttcctgcagc tgtgcccgcc ttctcttgag ttcagcagag accatctgcc 60
tagcaacttc aaattcgccg gctcaacgcc caagcaccga actcaattca cccctccttc 120
ctggtggggg gatgttctga gtgccaagcg agtcatcatg gtcactcaag gaacttttgc 180
tgtcagttac aagcatctta ttgtgcctac tcttgaggcc ttgaaggacg agcctgacac 240
tttaacagta gccatattgg gccgccgcgg tgccaagcta ccggatgatg ttgtggttcc 300
tgagaatgct cgcgtgatcg actacttcaa ctacgatgct ctacttcctc acgttgatgc 360
tcttgtctac aatggtggat atggcggact tcagcacagc ttaagccact ctgttccagt 420
tgttattgct ggtgactctg aagacaagcc aatggtggca tcgagagctg aggccgctgg 480
cgtggcaatt gatttgaaaa ctggcttgcc tacagtggag caaatcaaag aagctgttga 540
ttcgataatt ggaaatccga aattccacga agcctcgaag aaggttcaaa tggagttgga 600
aagccacaac tccttgaaaa ttcttgagga aagcatcgag gaaatcgcca gccatgactt 660
tggtcttttg accaagagtg acgaggaaac tgaagatata cctgtcaaag ggccggcctt 720
agcggtgagt tcttagaatc gtacgatcaa atcagatcag ggaagagagg tagggttttt 780
tttatttatg tctttgtttt tattgattga aatttacaat acaacaacca tcaaattaat 840
ttgaacaaac aacaacacac acacacactg caactttcaa aaaaataagt aaaaggaaga 900
gaggagtttg ccaatatatt taccttcttc taattctgtt atttttttta attgttttgt 960
ggaaagaaag aagaaaaggc tgtcatgaat ttagtttacc taataacttc gtataatgta 1020
tgctatacga agttattaat taactgacgg gcggatagta caggctttgc caaaagccta 1080
taaggctaaa gaaagtaaac aagtgaggtt gaaccatgat ggcagtgttc gaattctgat 1140
caatgaagta cactgcgaag ggaatccccg aaacggcgaa caaaaagaac atcagaggag 1200
gaacgccctc gcaatcccga acataccagt ttcgcagaac ctggggtatc aactggatgc 1260
accagcatac tgttcccact gttgccaatg ctgtagacgc tccattgttg tcagtcattt 1320
tagcatttta cagtaaccaa ctccaaaaaa cagcccgctc tgctgggaag acttcgcaat 1380
tatttatcca ctactgctgc ggttatatac ttctcgatct cagtctcggt tataattgcc 1440
gcttgacagc ctggagaaat tcggatactc cacgtgataa ttgccatagg gcataatttt 1500
cgaaacagct cgcaacgatc tcggctagtt ttcccctttt ttgacccata tcgacgctga 1560
gactcactca cttgatgcct accgttaggg taaatttttc aagcctgcag aatatcgcgg 1620
gacgcagtct cctgcacgcg cgtgacttca tcttacttac atcaaacagc ccgattaatt 1680
tgaaaagtcc tagctgatcg agggcacggg cactactgta gagaaataat atgaagctga 1740
gctatgagga gcgccgagag aggctgccgg ctgtagcagc ccggctattc gacatcattg 1800
tgagcaagca aacaaatctt tgcgcaagct tggatgtgcg aactacctct gagttactga 1860
gtatcctgga ccgcattgga ccttacattt gtatggttaa gacccacatt gacataattg 1920
acgacttcga atacgacaca actgtcagcg gtttgaaaca gctttcaacg aagcacaatt 1980
ttctcatttt tgaagaccga aagttcgcag acatcggttc cactgttaag gcccaatatg 2040
caggtggagt gtttaagatc gctcaatggg ctgatataac aaatgctcac ggtgttcctg 2100
ggccgggaat tgtgagcgga ctagaagagg ctgcgaagga aactacggat gaacctcgcg 2160
gccttgtcat gcttgcagaa ctgagttcga agggcacact ggctcacggc gaatactcgc 2220
aagcgacagt agacatcgct cgcagtaacc gcgcatttgt gtttggtttc atcgctcagc 2280
aaaaagtcgg aaagccagag gaagactggg tcattatgac tcctggggtg ggcctggacg 2340
acaaaggtga tggattgggg cagcagtatc gtactgtgga cgacgtcata gagaccggca 2400
cagacgttat tatcgtcgga cgcgggctct atagcaaggg acgagatcct gtgcacgaag 2460
ctcagcgtta ccaaaaggcg ggctggaatg catatctgag aaaagttcag tcaagatgat 2520
tttctcaaac agttccttca atgcaacttg cacatgaata cctataaaat ctgattaaat 2580
taccataaaa ggtacagatt aaaatatata tgccttcaat ggcatccttc gcgattctga 2640
ttcgtcagca cacttcaacc ttcctactat gagtgacagt gatgatgatc tgctggcatt 2700
ggccgacgtt ggctccgact ccgaagagga aatctcgctg ccgtcgccgc caagcaatga 2760
ggtcgtcaat ccctatcctc tagaaggcaa atatctcgat gctgaagaca gggcgaagtt 2820
ggacgcgctg ccagagattg agcgagaaga gatcttgtat gaccgagctc aggagatgca 2880
gcggtacgag gagagaaggt atcttgctca gcgaaggaag cagatgacgc gggttgctga 2940
cgaggacgaa gccccctccg ccaagcgtca acggggtaca acaggcgtct cttcgggtac 3000
gaagtcatct cttgaggcat taaagaaacg aagggcccag cagtctcgga agtcctcacg 3060
ccatggagtt gatgacgatg tgtatagtga cgatgatgtt aattaataac ttcgtataat 3120
gtatgctata cgaagttatt gaattctaga atgtgaggtg gaatgaggca aggaaggagg 3180
aacgtattga gttgtacctt aagatatctc aaagtgctta tctccgacta ccggaatatg 3240
ctccgggtaa tgcaagtcag tgtgcatatg ggtaaggtga tgcaagctaa ccctcagggc 3300
atatctaatt cgcgtgaggg ttattattgg tctacattac ctcagtcata gcccgtcaaa 3360
gcaaaagccc aaaatcagca cgaaatccca gagatagatt gttgctgtct cttcaagtac 3420
tacgacagtt ccctatatct acagattatc gtcacgagtg aattatgcag gataggtgac 3480
tcaggggtca taatcagagg aatccaatgt gctatttcaa ttaacgagtc cctttaatca 3540
gacaatgtat ggtgactcag gggccataac tagagaaatt cgatatgcta tttcaattaa 3600
tgagtgcctt taatcaaata atgtatgcaa gcagtggcca aaaataaatg aacgtcaaat 3660
ctctccgaga ccttgcaagt tcaccaattc agcgtaccat ccattgagtt caaggaggct 3720
ctgatggtcg ccctgctcca cgatgcgccc tcctgagaac acatatatga catctgcttt 3780
ctgaattgtt gataatctat gcgcaacggc gattgtagta cggcccttcg ctgctgcgtc 3840
gagtgctgct tgaactactt tctcagattc ggaatccaga gctgaggtgg cctcatcgag 3900
gaggagtacc tttggatttc tgatcagggc ccttgcaatt gcaattcgct gcttttgccc 3960
cccagatagc aacgatcccc tagatccgct gagcgtttcg tagccatcag gcaacgacat 4020
gatgaattcg tgaatgttcg ctttgcgagc ggcatcctca atcatctcct gcgttacttc 4080
agactcaggg ccagaccatc ccattagaat attctcacgt agcgtgcctg aataaagcat 4140
<210> 15
<211> 4130
<212> DNA
<213> 人工的
<220>
<223> 整合构建体
<400> 15
ggatgagtcg cagtcacgaa cgttgtattg ggccctacaa tcacctcatc accaaagatt 60
attggagccg agtccaagaa gcaaacgttg aagttggcgt aaaagtgctc gcctacgctg 120
atgttgaatc caaaatcaac tgagaatgga gcggtcagcc agacaatatc ctttgtttga 180
ccaaaagtgt ctttgagaat ctcgaccttc ttgatataag cagcgtgatt tgactcaaaa 240
gtacgacttt cacttgcaat ggtattgaac tccctaactt tctcactagt agccagggct 300
ctaaacataa gatctggatc gtatggattg taaggaactc ctgagaccat cttctcatag 360
ttttcattgc caggggtgtt tttgaggttt tttttggccc aagagaccat ttcctggtca 420
atttcttttc taggagtcat tcctttgttt tgagggtcct tcgaggagtt tacaaccatt 480
gaattctaga atgtgaggtg gaatgaggca aggaaggagg aacgtattga gttgtacctt 540
aagatatctc aaagtgctta tctccgacta ccggaatatg ctccgggtaa tgcaagtcag 600
tgtgcatatg ggtaaggtga tgcaagctaa ccctcagggc atatctaatt cgcgtgaggg 660
ttattattgg tctacattac ctcagtcata gcccgtcaaa gcaaaagccc aaaatcagca 720
cgaaatccca gagatagatt gttgctgtct cttcaagtac tacgacagtt ccctatatct 780
acagattatc gtcacgagtg aattatgcag gataggtgac tcaggggtca taatcagagg 840
aatccaatgt gctatttcaa ttaacgagtc cctttaatca gacaatgtat ggtgactcag 900
gggccataac tagagaaatt cgatatgcta tttcaattaa tgagtgcctt taatcaaata 960
atgtatgcaa gcagtggcca aaaataaatg aacgtcaata acttcgtata atgtatgcta 1020
tacgaagtta ttaattaact gacgggcgga tagtacaggc tttgccaaaa gcctataagg 1080
ctaaagaaag taaacaagtg aggttgaacc atgatggcag tgttcgaatt ctgatcaatg 1140
aagtacactg cgaagggaat ccccgaaacg gcgaacaaaa agaacatcag aggaggaacg 1200
ccctcgcaat cccgaacata ccagtttcgc agaacctggg gtatcaactg gatgcaccag 1260
catactgttc ccactgttgc caatgctgta gacgctccat tgttgtcagt cattttagca 1320
ttttacagta accaactcca aaaaacagcc cgctctgctg ggaagacttc gcaattattt 1380
atccactact gctgcggtta tatacttctc gatctcagtc tcggttataa ttgccgcttg 1440
acagcctgga gaaattcgga tactccacgt gataattgcc atagggcata attttcgaaa 1500
cagctcgcaa cgatctcggc tagttttccc cttttttgac ccatatcgac gctgagactc 1560
actcacttga tgcctaccgt tagggtaaat ttttcaagcc tgcagaatat cgcgggacgc 1620
agtctcctgc acgcgcgtga cttcatctta cttacatcaa acagcccgat taatttgaaa 1680
agtcctagct gatcgagggc acgggcacta ctgtagagaa ataatatgaa gctgagctat 1740
gaggagcgcc gagagaggct gccggctgta gcagcccggc tattcgacat cattgtgagc 1800
aagcaaacaa atctttgcgc aagcttggat gtgcgaacta cctctgagtt actgagtatc 1860
ctggaccgca ttggacctta catttgtatg gttaagaccc acattgacat aattgacgac 1920
ttcgaatacg acacaactgt cagcggtttg aaacagcttt caacgaagca caattttctc 1980
atttttgaag accgaaagtt cgcagacatc ggttccactg ttaaggccca atatgcaggt 2040
ggagtgttta agatcgctca atgggctgat ataacaaatg ctcacggtgt tcctgggccg 2100
ggaattgtga gcggactaga agaggctgcg aaggaaacta cggatgaacc tcgcggcctt 2160
gtcatgcttg cagaactgag ttcgaagggc acactggctc acggcgaata ctcgcaagcg 2220
acagtagaca tcgctcgcag taaccgcgca tttgtgtttg gtttcatcgc tcagcaaaaa 2280
gtcggaaagc cagaggaaga ctgggtcatt atgactcctg gggtgggcct ggacgacaaa 2340
ggtgatggat tggggcagca gtatcgtact gtggacgacg tcatagagac cggcacagac 2400
gttattatcg tcggacgcgg gctctatagc aagggacgag atcctgtgca cgaagctcag 2460
cgttaccaaa aggcgggctg gaatgcatat ctgagaaaag ttcagtcaag atgattttct 2520
caaacagttc cttcaatgca acttgcacat gaatacctat aaaatctgat taaattacca 2580
taaaaggtac agattaaaat atatatgcct tcaatggcat ccttcgcgat tctgattcgt 2640
cagcacactt caaccttcct actatgagtg acagtgatga tgatctgctg gcattggccg 2700
acgttggctc cgactccgaa gaggaaatct cgctgccgtc gccgccaagc aatgaggtcg 2760
tcaatcccta tcctctagaa ggcaaatatc tcgatgctga agacagggcg aagttggacg 2820
cgctgccaga gattgagcga gaagagatct tgtatgaccg agctcaggag atgcagcggt 2880
acgaggagag aaggtatctt gctcagcgaa ggaagcagat gacgcgggtt gctgacgagg 2940
acgaagcccc ctccgccaag cgtcaacggg gtacaacagg cgtctcttcg ggtacgaagt 3000
catctcttga ggcattaaag aaacgaaggg cccagcagtc tcggaagtcc tcacgccatg 3060
gagttgatga cgatgtgtat agtgacgatg atgttaatta ataacttcgt ataatgtatg 3120
ctatacgaag ttataagcca aaatcagaga gtgggacctg attcagaatc acacggaccc 3180
gtatatataa caatcacttt ccaacaatat agcgagtatt aatatatttc cgggtaaggg 3240
ttgttccgga cttatgcatt taatcacagg ttgcatcagc taaatatgtc agggccgacg 3300
gcgtaaattt agaaggttag gtcaagatcc atcggtcagg ccaatggagc tctactatga 3360
taggcagctg aagcgagaca agatatactt cagttgcgct ctctgaaaaa attattttgt 3420
gattctcact cagtggatgt ggcgacacac ggaaccaata atctcgccgg aaaggcggct 3480
gaacatcagt cttgcataag tgtgcaagtg gcctgagcac agcgtgcatt acccttacca 3540
tacattcggg gcaagttaaa tccagcatta tataaacttg attgacacaa atgggcataa 3600
aacaataaag tctcctatat ggccatcgag aaaccagtga tagttgcttg tgcctgccca 3660
ctagcggggc acgtgggccc agtgctcagc ctggtccgcg gtctactcaa tagaggatat 3720
gaggtgactt tcgtaacagg gaacgcattc aaggagaaag ttattgaggc aggatgcact 3780
ttcgtccctc tccaaggacg agctgactac catgaataca atctccctga aatcgctcca 3840
ggattgctca cgattcctcc aggccttgag cagaccggtt actcaatgaa tgagattttt 3900
gtgaaggcga ttcctgagca gtacgatgca cttcaaactg ctctaaaaca ggttgaggct 3960
gaaaataaat cagctgtggt gattggcgag accatgtttc taggggtgca tccgatatca 4020
ctgggtgccc caggtctcaa gccccaaggc gtaatcacgt taggaactat tccgtgcatg 4080
ctgaaagcag agaaggcgcc tggagttcct agtcttgagc caatgattga 4130
<210> 16
<211> 4141
<212> DNA
<213> 人工的
<220>
<223> 整合构建体
<400> 16
attctggtgc tgacctcgcc accacctagt ttgtcgtaaa acgcgatatt ctggcgaata 60
acagcactca gataatgctt tcggtaacgt cctgccaaca cttcgcctct gtccacaagc 120
aggaagctct cgagaaacgc actgccgagc ataccaatgc caatatagac aaaatagaga 180
gacaggtgat tcaccttatg ctggaactca ttgcccttga ggtcatatga agtgaagtct 240
ctgaatgtgt tgaagatggc gcccactact aacgtgaaca ttggaagcgc ggctccatgc 300
accgctgcaa aaaaaagcgc aagtatctcc aagaaaacgt caaggggagt gcaaaatctg 360
aacaacctga aaaagcttgt ggcgactctc tttgtttcaa gctgacttcg caatacattg 420
gcctcatgtg gatctaacgc agagagcttc tcctcgagaa gcttgtcctt agtctcgatg 480
agtttctcac gcttctctac ctgtatatca tccaccataa gccaaaatca gagagtggga 540
cctgattcag aatcacacgg acccgtatat ataacaatca ctttccaaca atatagcgag 600
tattaatata tttccgggta agggttgttc cggacttatg catttaatca caggttgcat 660
cagctaaata tgtcagggcc gacggcgtaa atttagaagg ttaggtcaag atccatcggt 720
caggccaatg gagctctact atgataggca gctgaagcga gacaagatat acttcagttg 780
cgctctctga aaaaattatt ttgtgattct cactcagtgg atgtggcgac acacggaacc 840
aataatctcg ccggaaaggc ggctgaacat cagtcttgca taagtgtgca agtggcctga 900
gcacagcgtg cattaccctt accatacatt cggggcaagt taaatccagc attatataaa 960
cttgattgac acaaatgggc ataaaacaat aaagtctcct atataacttc gtataatgta 1020
tgctatacga agttattaat taactgacgg gcggatagta caggctttgc caaaagccta 1080
taaggctaaa gaaagtaaac aagtgaggtt gaaccatgat ggcagtgttc gaattctgat 1140
caatgaagta cactgcgaag ggaatccccg aaacggcgaa caaaaagaac atcagaggag 1200
gaacgccctc gcaatcccga acataccagt ttcgcagaac ctggggtatc aactggatgc 1260
accagcatac tgttcccact gttgccaatg ctgtagacgc tccattgttg tcagtcattt 1320
tagcatttta cagtaaccaa ctccaaaaaa cagcccgctc tgctgggaag acttcgcaat 1380
tatttatcca ctactgctgc ggttatatac ttctcgatct cagtctcggt tataattgcc 1440
gcttgacagc ctggagaaat tcggatactc cacgtgataa ttgccatagg gcataatttt 1500
cgaaacagct cgcaacgatc tcggctagtt ttcccctttt ttgacccata tcgacgctga 1560
gactcactca cttgatgcct accgttaggg taaatttttc aagcctgcag aatatcgcgg 1620
gacgcagtct cctgcacgcg cgtgacttca tcttacttac atcaaacagc ccgattaatt 1680
tgaaaagtcc tagctgatcg agggcacggg cactactgta gagaaataat atgaagctga 1740
gctatgagga gcgccgagag aggctgccgg ctgtagcagc ccggctattc gacatcattg 1800
tgagcaagca aacaaatctt tgcgcaagct tggatgtgcg aactacctct gagttactga 1860
gtatcctgga ccgcattgga ccttacattt gtatggttaa gacccacatt gacataattg 1920
acgacttcga atacgacaca actgtcagcg gtttgaaaca gctttcaacg aagcacaatt 1980
ttctcatttt tgaagaccga aagttcgcag acatcggttc cactgttaag gcccaatatg 2040
caggtggagt gtttaagatc gctcaatggg ctgatataac aaatgctcac ggtgttcctg 2100
ggccgggaat tgtgagcgga ctagaagagg ctgcgaagga aactacggat gaacctcgcg 2160
gccttgtcat gcttgcagaa ctgagttcga agggcacact ggctcacggc gaatactcgc 2220
aagcgacagt agacatcgct cgcagtaacc gcgcatttgt gtttggtttc atcgctcagc 2280
aaaaagtcgg aaagccagag gaagactggg tcattatgac tcctggggtg ggcctggacg 2340
acaaaggtga tggattgggg cagcagtatc gtactgtgga cgacgtcata gagaccggca 2400
cagacgttat tatcgtcgga cgcgggctct atagcaaggg acgagatcct gtgcacgaag 2460
ctcagcgtta ccaaaaggcg ggctggaatg catatctgag aaaagttcag tcaagatgat 2520
tttctcaaac agttccttca atgcaacttg cacatgaata cctataaaat ctgattaaat 2580
taccataaaa ggtacagatt aaaatatata tgccttcaat ggcatccttc gcgattctga 2640
ttcgtcagca cacttcaacc ttcctactat gagtgacagt gatgatgatc tgctggcatt 2700
ggccgacgtt ggctccgact ccgaagagga aatctcgctg ccgtcgccgc caagcaatga 2760
ggtcgtcaat ccctatcctc tagaaggcaa atatctcgat gctgaagaca gggcgaagtt 2820
ggacgcgctg ccagagattg agcgagaaga gatcttgtat gaccgagctc aggagatgca 2880
gcggtacgag gagagaaggt atcttgctca gcgaaggaag cagatgacgc gggttgctga 2940
cgaggacgaa gccccctccg ccaagcgtca acggggtaca acaggcgtct cttcgggtac 3000
gaagtcatct cttgaggcat taaagaaacg aagggcccag cagtctcgga agtcctcacg 3060
ccatggagtt gatgacgatg tgtatagtga cgatgatgtt aattaataac ttcgtataat 3120
gtatgctata cgaagttatt aacctggctc tttttctaga tatgtctgcg ccctgctcac 3180
tgcttactgg cctaagctgg tattacggac cttaatcaag tatcacccca aggcaatcga 3240
gagtcttatc gagtctctag gtagatagat acacgttttg atttttcggc ccactttgta 3300
gaaaaatctc agtgatttca tggaattcag ttacaaatac taatctgata aaccaagaac 3360
tacactcggt gttgagagca gaattaaagg gacttggcgt ctagcacaaa acgatacttg 3420
acgtcaccac tgtgaacgcg cttccaagct tcggcgatat agctgtactc aatcagctca 3480
acatcacagg tgatgttatt ttcaccacag aagtccagca tctcctgagt ctctggcaag 3540
ccaccaatgt ttgagtaagt gatagattta tttccagcca aatgagaggt cagaaccttg 3600
aggggtccaa tttgaccaac aacaacgaga cacccaccaa tatcaaggga cttgaggtat 3660
ggctcgaagt cgtgttcaaa gggaatggtg tcgatgatca ggtcaaatgt gccagcgacc 3720
gcctcgagct cattcggatc agaggaagca actacgcggc tagcaccttg tgctttcgct 3780
cctgcggctt tggcgtgact cctgctgaac agtgtgactt cagagcccat ggctgaggca 3840
aatttgatag ccatggaacc aaggcctccg agaccaacta caccgactct ttttccaggt 3900
ccggcgccgt gagccctcag aggagagtag gtagtgatac cagcacagag aaggggcgca 3960
gaagctgcca agtcgaggtt ggaggggatt ttgagcacaa actcctcgcg agcaagaatg 4020
tgttgcgaat accctccctt cgtgacttcc ccgttctttc cgctggaatt gtaagtttga 4080
gtgcgtgaaa cacaccaatt ttctttgcct aatttacagt tcttgcaagt acgacatgag 4140
t 4141
<210> 17
<211> 27
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 17
aattgttcga tggatagctt tggagtc 27
<210> 18
<211> 21
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 18
ttcggggctc ctgtcgttgt c 21
<210> 19
<211> 26
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 19
gaaatctgat caattctgca aacctg 26
<210> 20
<211> 27
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 20
atgactccta gaaaagaaat tgaccag 27
<210> 21
<211> 22
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 21
tgcagacaag ttcctgcagc tg 22
<210> 22
<211> 23
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 22
atgctttatt caggcacgct acg 23
<210> 23
<211> 21
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 23
ggatgagtcg cagtcacgaa c 21
<210> 24
<211> 26
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 24
tcaatcattg gctcaagact aggaac 26
<210> 25
<211> 22
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 25
attctggtgc tgacctcgcc ac 22
<210> 26
<211> 25
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 26
actcatgtcg tacttgcaag aactg 25
<210> 27
<211> 27
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 27
gtgtcgactc gccaaattcc atcggag 27
<210> 28
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 28
ggttcatagc gagtttcttt gcatgtgc 28
<210> 29
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 29
ctcctttatt aactccgcag catgactg 28
<210> 30
<211> 27
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 30
ctcctcgaag gaccctcaaa acaaagg 27
<210> 31
<211> 30
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 31
caaatttatc tgggagcaca gttacattgc 30
<210> 32
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 32
cacacattgc tttagtccag caagaacc 28
<210> 33
<211> 25
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 33
attctcctcg cacgtttctc ggggc 25
<210> 34
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 34
ggttgaaata cttgttgccg cactaaag 28
<210> 35
<211> 31
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 35
cgcttcctga attgagttgg tatcgttaat g 31
<210> 36
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 36
gacattgttg gaattggctg cttagtgg 28
<210> 37
<211> 3107
<212> DNA
<213> 人工的
<220>
<223> 表达盒
<400> 37
acaaacgacc ccgccccacc cctcacacgg ccttaccagc ccaggaagca atggcccgaa 60
cctcgtgggc taccgcactc cgtttggaaa cccaatagga actgcagcag cagggaactc 120
agctgctact ccagctggaa accctctagg gaaggtaaga gcagactctt caacgagcct 180
tactactcag ggacagcgaa gggtccgcgt gcatgtccag ggcgacacat ttctcatttt 240
ggtgccaccg gacctgaagt ttgagcatct ttccaatcgt gttgagcgca agctccgact 300
atgtgggaaa atgccgcctt caggccaggc aggctcactc atttttgaat acatggatga 360
agacgaggac cgcgtgcgac tggagagcga cgaggaccta agtgtggcgt ttgaggctgt 420
gcccgaccac catgagctgt ccgtctacgt caaaaactga cgattatgat ctaatgatat 480
ttaaaagata tgtaaaacgg ttattttttg gacctgcgcc ctaaaatggg actttgtcaa 540
aaaaagaacg gcctcctgcg cgatggagag caatcaagaa ttcggagttc cgatgcgaat 600
ccatcaagaa aacggcccct aggcaatcta aaaccgtggc cgacatacta taagtcaatt 660
ccgctgtaca aataacaagc gatcaatcca taatctgagg ctcatttcat acggactttt 720
ctaagttcac ataattctat gatgcatact aacaaatacg atgcacaaat gggtacaagg 780
cctaaagagg gccacaatcg cgatttactc gatacggcaa atcagttcca caagtaattc 840
gctatcgtcg gtgttgttat acacctctcg gcttgagtca atatcgagca tgcaaggttg 900
acgcattctg gggaaatgta tccacgtgat cgccgatatc ggagcggata cgctgtgtag 960
tcttcagttg taagatttct tatacagcga cgcaaccatc atgtctgtgc aaacgaaaac 1020
aattgttctt cttcctggag accactgtgg cccagaagtc gttgccgaag cagtgaaagt 1080
actcaaagcc gtggaaactg ctttaccatc ggttaccttc gagtttcagc accatttgat 1140
tggcggtgct gccatagatg ctgctggtgt tcccattacg gaagagactc ttgctgcctc 1200
tagaaaggct gacgctgttt tgcttggtgc tgtaggaggg cccaagtggg gcactggctc 1260
agtgagaccc gaacagggtc tcctcaagat tcgcaaggag cttcaattgt acgcgaatct 1320
gcgtccctgt aacatcattg ctccaaagtt tgccaagctc agtcctctga aggaggagaa 1380
tgttttggga accgacatta tgattgtacg agaactcaca ggtggaatct acttcggaga 1440
tcgcgaagaa gccgatatga gcacggccga ccctcatgcc acagatactg agaagtacag 1500
cgttagtgaa attacgcgca tcgctcgtat ggcaggcttt ttggctctgc aggcccaacc 1560
tccgctacct gtttggagct tggacaaggc caatgtgctt gcttccagcc gtttgtggcg 1620
cgaaaccgtc accaaggtgt tcaaagagga attccctcag ctcaaattgg agcatcagct 1680
cattgattcg gcggccatga ttttggtgaa gaaccctcga cagctcaatg gtgtcgttat 1740
caccaccaac atgttcggag acattttcag cgacgaggcg agtgttattc ctggctctct 1800
gggtctgcta ccctcagctt cgctcagtgg actgcctgac acaaactctg cctttggtct 1860
gtacgagcct tgtcacggct ctgctcccga cctcgctgct aacaaggcaa atccagtcgc 1920
taccattctc agcgcagcaa tgatgcttcg tctttcacta ggtcttcctg aagctgctga 1980
tgctgttgag aaagctgttt ccaacgtttt gaactcagtc gcggccacgg cagacattgg 2040
tggaacagcc tccaccacag aggtaggcga tgcaattgcc gcagagacgt tgaagcttct 2100
caaatagtct gctataaatt gacggagttt cgtacagtgc gctcgtacag tgcgctgcca 2160
aatacaattt agtgtagcca gattggatgg ttgaattgct cttcacggtt gcacgctatt 2220
ggcaaaaaag agagagccgc tctgaactgg ttcatccgca gctgaccttc gaaactcttt 2280
aatatttaat aatattgcag caaaatctat agcttatgcc acatctatac ggaagaggta 2340
ttcaacatta gagcttgtgt cgcccattct ctacacgagc ccacgcatca gcagtgaggg 2400
gcttgtagct cgtgccctct aaccagtaga ttgtttgtcc tgctggggcg ggaatctgct 2460
ggtttcggaa ttctttcttc tgaactttgt tgttgccggt gatggtgacg gtgtcgacga 2520
acttaatgaa tatcggcacg gcatagcgtg gcagcctttc caaaagatgc ttgccgagtt 2580
tatccatatc cagctgtttt ctaggattgt tgagcttgat cacagcaaat ccggcacgac 2640
cctcatgctt gggaacctgc acacctacac agacacacag atcgactcca ccgaagtcca 2700
caactgcttc ctcgacttcg tttgtgctaa cgttctcgct cttccatcga aacgtatccc 2760
cgagtcgatc aacaaagtag acgctatgat ctttatcagc cctcagaagg tctccgctgc 2820
gcacccaggc atctcccttc ttgaaaacat caaacacaag cttctcatcc gtggctgatt 2880
ggttgccgac atagccctgg aaatcgagtt tgatattctt cgggtcgagt ttgaaaagga 2940
attcacccgg ctcgtccgag tgtgtctcac ggcacaggcc ggttttggga tcgcgccata 3000
aatcctgcgt gtcaacatca atcgcggcga tgttccacct ggtacgatgc agcacgcggg 3060
tggccacagt accataatgg ccacatgcac caacaccata tgcacct 3107
<210> 38
<211> 45
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 38
atatatatac atatgttaat caaagacatt attctaactc caatg 45
<210> 39
<211> 42
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 39
atatatggcc ggccaactta agaaaaccgc acaaccacac cg 42
<210> 40
<211> 39
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 40
atatatatac atatgagccc ttcatcacac aaacccctg 39
<210> 41
<211> 39
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 41
atatatggcc ggccattcta agaactcacc gctaaggcc 39
<210> 42
<211> 38
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 42
atatatatac atatggttgt aaactcctcg aaggaccc 38
<210> 43
<211> 42
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 43
atatatggcc ggcctaccta gaccttctgg ttagcggtat tg 42
<210> 44
<211> 40
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 44
atatatatac atatggtgga tgatatacag gtagagaagc 40
<210> 45
<211> 42
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 45
atatatggcc ggccacgtca aatctctccg agaccttgca ag 42
<210> 46
<211> 40
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 46
atatatatac atatggccat cgagaaacca gtgatagttg 40
<210> 47
<211> 45
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 47
atatatggcc ggccaggtta agaagctaat tcactaattg ccgac 45
<210> 48
<211> 23
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 48
ggacctgcgc cctaaaatgg gac 23
<210> 49
<211> 30
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 49
atcctagaaa acagctggat atggataaac 30
<210> 50
<211> 23
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 50
gtgcccgacc accatgagct gtc 23
<210> 51
<211> 23
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 51
cccaagcatg agggtcgtgc cgg 23
<210> 52
<211> 1572
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1572)
<400> 52
atg att ctt tat gct gtg ctg ggc gca ttc gcc gcc ttc ttg ctt tac 48
Met Ile Leu Tyr Ala Val Leu Gly Ala Phe Ala Ala Phe Leu Leu Tyr
1 5 10 15
atg gat gta ctt tac cct ttc gtg att tac cct ctg aga gcg cga tgg 96
Met Asp Val Leu Tyr Pro Phe Val Ile Tyr Pro Leu Arg Ala Arg Trp
20 25 30
cac aaa tgt ggt tac atc cct aga gat ttg agc tgg cca ttg ggg att 144
His Lys Cys Gly Tyr Ile Pro Arg Asp Leu Ser Trp Pro Leu Gly Ile
35 40 45
cca ctc acc ctg gta gtt ctc tcg aag ttg agg aaa gat atg ctg ctg 192
Pro Leu Thr Leu Val Val Leu Ser Lys Leu Arg Lys Asp Met Leu Leu
50 55 60
caa ttc atg gca gcg caa gac ctt agt cgc cct tac aag aca tcc tta 240
Gln Phe Met Ala Ala Gln Asp Leu Ser Arg Pro Tyr Lys Thr Ser Leu
65 70 75 80
cgt caa ttt ctg ggt aaa tgg gta atc gcc act aga gat cct gag aac 288
Arg Gln Phe Leu Gly Lys Trp Val Ile Ala Thr Arg Asp Pro Glu Asn
85 90 95
atc aag gct gtt cta tcc acc aag ttc aat gac ttc tcg ctg aaa gaa 336
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Glu
100 105 110
aga ggg aat agg atg agg cat gta atc ggt gat gga att ttt acc caa 384
Arg Gly Asn Arg Met Arg His Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
gat ggc gca cca tgg aag cac tcg cga gat atg ctc agg cct cag ttc 432
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
acc aag gat caa atc agc cga gtg gaa ttg ttg agc cac cac atc gac 480
Thr Lys Asp Gln Ile Ser Arg Val Glu Leu Leu Ser His His Ile Asp
145 150 155 160
gtt ttg att cgt gaa atc agg aag tcg gga ggt aac gtc gag ttg caa 528
Val Leu Ile Arg Glu Ile Arg Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
cgt tta ttc cac ctc atg act atg gac acc gcc act cac ttt cta ttc 576
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr His Phe Leu Phe
180 185 190
ggc gag tcc gtt ggc tcg ttg gag gtc agt ggc gaa agc aag ggc att 624
Gly Glu Ser Val Gly Ser Leu Glu Val Ser Gly Glu Ser Lys Gly Ile
195 200 205
gag atc acc gac cca aag act gga gag att gtg aac acc gtt gat ttt 672
Glu Ile Thr Asp Pro Lys Thr Gly Glu Ile Val Asn Thr Val Asp Phe
210 215 220
gtt gag tct tat act ttt gca aac aag ttt gct ctc aag aag att atc 720
Val Glu Ser Tyr Thr Phe Ala Asn Lys Phe Ala Leu Lys Lys Ile Ile
225 230 235 240
ctc aac gac ttg gag ttt tta gcc gac ttg acg gag ccc tcg tat aag 768
Leu Asn Asp Leu Glu Phe Leu Ala Asp Leu Thr Glu Pro Ser Tyr Lys
245 250 255
tgg cat ctg cgc cgt gtc cac aca gtc atg gat cac tac gtt cag ctg 816
Trp His Leu Arg Arg Val His Thr Val Met Asp His Tyr Val Gln Leu
260 265 270
gct ttg aag gct act gag aag tat gat cct gat gat gat agc gag aag 864
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Asp Asp Asp Ser Glu Lys
275 280 285
gga gaa tac tac ttt agc cat gag ctg gcg aaa ctc acg aga gac ccc 912
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
ttg tcg ttg aga gat cag ctt ttc aat att ctc att gct ggc cgc gac 960
Leu Ser Leu Arg Asp Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
act acc gca gca act ttg tcc tat gcc ttc cac tat cta acg aag aat 1008
Thr Thr Ala Ala Thr Leu Ser Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
ccc gct atc tac gcc aag gtc cgc gaa gat gtg ctc acg gtc ttc cct 1056
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
aat gga gac gca tca ttg gcg act tac gag gac ttg cga aag gct aag 1104
Asn Gly Asp Ala Ser Leu Ala Thr Tyr Glu Asp Leu Arg Lys Ala Lys
355 360 365
tat ctc caa atg gtg atc aag gag gta ttg cgt ctt gcg cct gcg gtt 1152
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Ala Val
370 375 380
ccc ttg aac acg cgt gcc gcg gtt cgt gac aca tat ctg cca cgg ggc 1200
Pro Leu Asn Thr Arg Ala Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
gga ggc cca gcc gga aac ctg ccc gtt ttt gtt ccc aag ggc act gct 1248
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Ala
405 410 415
gtc aac tac cct aca tat att ttg cac cgc gat cca gat atc tat ggt 1296
Val Asn Tyr Pro Thr Tyr Ile Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
gcc gac gcg tac gag ttc aac ccc gag aga tgg agg cct gag aat aag 1344
Ala Asp Ala Tyr Glu Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
ctt ccg aat agc cca atg tac tct tgg gga tac att ccc ttc aat ggt 1392
Leu Pro Asn Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
ggc cct cgc atc tgc att gga cag cag ttc gcc ttg act gag atc gct 1440
Gly Pro Arg Ile Cys Ile Gly Gln Gln Phe Ala Leu Thr Glu Ile Ala
465 470 475 480
ttg acg atg atc aag ctg gtt ctg gaa ttt gag agg ctg gag cct gcc 1488
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
gac gac ttt gag ccc aat ctt caa gac aag tcc tct tta act gtc atg 1536
Asp Asp Phe Glu Pro Asn Leu Gln Asp Lys Ser Ser Leu Thr Val Met
500 505 510
gtc gga ggg tcg ggc gtc cga gtg aaa ctg agt taa 1572
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 53
<211> 523
<212> PRT
<213> 假丝酵母菌
<400> 53
Met Ile Leu Tyr Ala Val Leu Gly Ala Phe Ala Ala Phe Leu Leu Tyr
1 5 10 15
Met Asp Val Leu Tyr Pro Phe Val Ile Tyr Pro Leu Arg Ala Arg Trp
20 25 30
His Lys Cys Gly Tyr Ile Pro Arg Asp Leu Ser Trp Pro Leu Gly Ile
35 40 45
Pro Leu Thr Leu Val Val Leu Ser Lys Leu Arg Lys Asp Met Leu Leu
50 55 60
Gln Phe Met Ala Ala Gln Asp Leu Ser Arg Pro Tyr Lys Thr Ser Leu
65 70 75 80
Arg Gln Phe Leu Gly Lys Trp Val Ile Ala Thr Arg Asp Pro Glu Asn
85 90 95
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Glu
100 105 110
Arg Gly Asn Arg Met Arg His Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
Thr Lys Asp Gln Ile Ser Arg Val Glu Leu Leu Ser His His Ile Asp
145 150 155 160
Val Leu Ile Arg Glu Ile Arg Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr His Phe Leu Phe
180 185 190
Gly Glu Ser Val Gly Ser Leu Glu Val Ser Gly Glu Ser Lys Gly Ile
195 200 205
Glu Ile Thr Asp Pro Lys Thr Gly Glu Ile Val Asn Thr Val Asp Phe
210 215 220
Val Glu Ser Tyr Thr Phe Ala Asn Lys Phe Ala Leu Lys Lys Ile Ile
225 230 235 240
Leu Asn Asp Leu Glu Phe Leu Ala Asp Leu Thr Glu Pro Ser Tyr Lys
245 250 255
Trp His Leu Arg Arg Val His Thr Val Met Asp His Tyr Val Gln Leu
260 265 270
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Asp Asp Asp Ser Glu Lys
275 280 285
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
Leu Ser Leu Arg Asp Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
Thr Thr Ala Ala Thr Leu Ser Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
Asn Gly Asp Ala Ser Leu Ala Thr Tyr Glu Asp Leu Arg Lys Ala Lys
355 360 365
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Ala Val
370 375 380
Pro Leu Asn Thr Arg Ala Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Ala
405 410 415
Val Asn Tyr Pro Thr Tyr Ile Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
Ala Asp Ala Tyr Glu Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
Leu Pro Asn Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
Gly Pro Arg Ile Cys Ile Gly Gln Gln Phe Ala Leu Thr Glu Ile Ala
465 470 475 480
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
Asp Asp Phe Glu Pro Asn Leu Gln Asp Lys Ser Ser Leu Thr Val Met
500 505 510
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 54
<211> 1671
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1671)
<400> 54
atg agg ccc ctg ttg cgg gaa caa gac aca tca cac cca gag cta ttg 48
Met Arg Pro Leu Leu Arg Glu Gln Asp Thr Ser His Pro Glu Leu Leu
1 5 10 15
ttg gca agc aat act att ttt aac ccc ctt tcc aag agt gtc caa act 96
Leu Ala Ser Asn Thr Ile Phe Asn Pro Leu Ser Lys Ser Val Gln Thr
20 25 30
gtt caa tac ggc ctc atg aac att aat ttc tct gac gtg ctc gtg cta 144
Val Gln Tyr Gly Leu Met Asn Ile Asn Phe Ser Asp Val Leu Val Leu
35 40 45
gga ggc atc agc gtg agc ttt ttg ctc gcc tac cag gcg att tac ttt 192
Gly Gly Ile Ser Val Ser Phe Leu Leu Ala Tyr Gln Ala Ile Tyr Phe
50 55 60
tat ttc att tac tcg cca cga gcc aaa aag ctc ggt tgc gct ctt cca 240
Tyr Phe Ile Tyr Ser Pro Arg Ala Lys Lys Leu Gly Cys Ala Leu Pro
65 70 75 80
ccg gtc ttc ttc tct ttc cca ctc gga ata ccg gag gtc ata cgt ctt 288
Pro Val Phe Phe Ser Phe Pro Leu Gly Ile Pro Glu Val Ile Arg Leu
85 90 95
gtg aac gcc tgg ttc aac gat gat ctc ctt gag tat ttc acc ttc aaa 336
Val Asn Ala Trp Phe Asn Asp Asp Leu Leu Glu Tyr Phe Thr Phe Lys
100 105 110
ttc gag gag ttc cag cgc aaa acc gga ttc caa tca gtc gct ggg caa 384
Phe Glu Glu Phe Gln Arg Lys Thr Gly Phe Gln Ser Val Ala Gly Gln
115 120 125
cta tgg att ggg act att gag ccc gag aac atc aag act atg ctc gct 432
Leu Trp Ile Gly Thr Ile Glu Pro Glu Asn Ile Lys Thr Met Leu Ala
130 135 140
act tca ttt aaa gac tac tcc cta ggc ttc cgt tac gag gcc atg tac 480
Thr Ser Phe Lys Asp Tyr Ser Leu Gly Phe Arg Tyr Glu Ala Met Tyr
145 150 155 160
ggc ctt ctc gga aat ggc att ttc act ctc agt ggt gag ggc tgg aag 528
Gly Leu Leu Gly Asn Gly Ile Phe Thr Leu Ser Gly Glu Gly Trp Lys
165 170 175
cac agc cgc gct ttg ttg cgt ccg caa ttt agt cgt gag caa gtc tct 576
His Ser Arg Ala Leu Leu Arg Pro Gln Phe Ser Arg Glu Gln Val Ser
180 185 190
cac ctt gaa tca atg cgc aca cac atc aat atg ttg atc aac aac cac 624
His Leu Glu Ser Met Arg Thr His Ile Asn Met Leu Ile Asn Asn His
195 200 205
ttc aag ggt ggc aaa gtc gtc gat gct cag gtt ttg ttc cac aat cta 672
Phe Lys Gly Gly Lys Val Val Asp Ala Gln Val Leu Phe His Asn Leu
210 215 220
acc att gat act gct acc gaa ttc cta ttc gga gag agc acc aac act 720
Thr Ile Asp Thr Ala Thr Glu Phe Leu Phe Gly Glu Ser Thr Asn Thr
225 230 235 240
ctt gac cct gct ctt gct cag cat gga ttc cct gga cct aag ggt ctt 768
Leu Asp Pro Ala Leu Ala Gln His Gly Phe Pro Gly Pro Lys Gly Leu
245 250 255
gta acc ggt gag cag ttt gct gag gct ttt acc tct gct ctc gaa ttg 816
Val Thr Gly Glu Gln Phe Ala Glu Ala Phe Thr Ser Ala Leu Glu Leu
260 265 270
ctt tct gtg cga gtt atg gcc ggc gcc gca tgg ttc ctc gtt tgg acc 864
Leu Ser Val Arg Val Met Ala Gly Ala Ala Trp Phe Leu Val Trp Thr
275 280 285
ccc aaa ttc tgg cgc tca tgc aaa gtc tgc cac aac ttc att gat tac 912
Pro Lys Phe Trp Arg Ser Cys Lys Val Cys His Asn Phe Ile Asp Tyr
290 295 300
ttc gtt ttc aag gct ctg gcc act cct atg gag aag gac cag gaa gct 960
Phe Val Phe Lys Ala Leu Ala Thr Pro Met Glu Lys Asp Gln Glu Ala
305 310 315 320
gat cgc tac gtc ttt att cga gaa ctc aca aag gag acc tct gac cca 1008
Asp Arg Tyr Val Phe Ile Arg Glu Leu Thr Lys Glu Thr Ser Asp Pro
325 330 335
cgg gtc atc cgc gac cag gcc ctc aac atc ctc ttg gct ggt cgt gat 1056
Arg Val Ile Arg Asp Gln Ala Leu Asn Ile Leu Leu Ala Gly Arg Asp
340 345 350
acc act gcg gca ctt ctc agc ttc acc acc tac tac ctt ggt gcc tac 1104
Thr Thr Ala Ala Leu Leu Ser Phe Thr Thr Tyr Tyr Leu Gly Ala Tyr
355 360 365
cct gag gtc tac gat gag ctt cgc gag gct gtt att gcg gac ttc ggc 1152
Pro Glu Val Tyr Asp Glu Leu Arg Glu Ala Val Ile Ala Asp Phe Gly
370 375 380
aag gaa gat gct gag ccc cct acg ttt gag cag ctt aag cag tgc aag 1200
Lys Glu Asp Ala Glu Pro Pro Thr Phe Glu Gln Leu Lys Gln Cys Lys
385 390 395 400
gtg cta cag aac gtc att cgg gaa gtt ttg cga ttg cac ccg aat gtg 1248
Val Leu Gln Asn Val Ile Arg Glu Val Leu Arg Leu His Pro Asn Val
405 410 415
ccc ctc aac ttc cgc gag gcc att acc gat act aag ttc ccc aca gga 1296
Pro Leu Asn Phe Arg Glu Ala Ile Thr Asp Thr Lys Phe Pro Thr Gly
420 425 430
ggc ggc ccg aat gga gac cag ccc gtt ttc gtt ccc aag gga cag aaa 1344
Gly Gly Pro Asn Gly Asp Gln Pro Val Phe Val Pro Lys Gly Gln Lys
435 440 445
gtg ttt tac gcc acc tac gtc atg cag cga aat gag ggt ctc tgg ggt 1392
Val Phe Tyr Ala Thr Tyr Val Met Gln Arg Asn Glu Gly Leu Trp Gly
450 455 460
cct gac tcc aca aca ttc cgc cct gac cgc tgg aac gag tca aga gag 1440
Pro Asp Ser Thr Thr Phe Arg Pro Asp Arg Trp Asn Glu Ser Arg Glu
465 470 475 480
gcc atc gca tcc gga tgg gac tac att cct ttc aac ggc ggc cct cgt 1488
Ala Ile Ala Ser Gly Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg
485 490 495
att tgc ctg ggt cag cag ttc gct ctc aca gag gcg agc tac acg ctc 1536
Ile Cys Leu Gly Gln Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu
500 505 510
gtg cgt atc tgc caa gag ttc tcc agg att gag gtt ctc cac cct gat 1584
Val Arg Ile Cys Gln Glu Phe Ser Arg Ile Glu Val Leu His Pro Asp
515 520 525
gtt att acc tcc agg aac gtg atg aaa cag cgc atg cgt ttg acc aac 1632
Val Ile Thr Ser Arg Asn Val Met Lys Gln Arg Met Arg Leu Thr Asn
530 535 540
tct tcc agc ggc ggc gtc ata gcg aag ttc att cgc tag 1671
Ser Ser Ser Gly Gly Val Ile Ala Lys Phe Ile Arg
545 550 555
<210> 55
<211> 556
<212> PRT
<213> 假丝酵母菌
<400> 55
Met Arg Pro Leu Leu Arg Glu Gln Asp Thr Ser His Pro Glu Leu Leu
1 5 10 15
Leu Ala Ser Asn Thr Ile Phe Asn Pro Leu Ser Lys Ser Val Gln Thr
20 25 30
Val Gln Tyr Gly Leu Met Asn Ile Asn Phe Ser Asp Val Leu Val Leu
35 40 45
Gly Gly Ile Ser Val Ser Phe Leu Leu Ala Tyr Gln Ala Ile Tyr Phe
50 55 60
Tyr Phe Ile Tyr Ser Pro Arg Ala Lys Lys Leu Gly Cys Ala Leu Pro
65 70 75 80
Pro Val Phe Phe Ser Phe Pro Leu Gly Ile Pro Glu Val Ile Arg Leu
85 90 95
Val Asn Ala Trp Phe Asn Asp Asp Leu Leu Glu Tyr Phe Thr Phe Lys
100 105 110
Phe Glu Glu Phe Gln Arg Lys Thr Gly Phe Gln Ser Val Ala Gly Gln
115 120 125
Leu Trp Ile Gly Thr Ile Glu Pro Glu Asn Ile Lys Thr Met Leu Ala
130 135 140
Thr Ser Phe Lys Asp Tyr Ser Leu Gly Phe Arg Tyr Glu Ala Met Tyr
145 150 155 160
Gly Leu Leu Gly Asn Gly Ile Phe Thr Leu Ser Gly Glu Gly Trp Lys
165 170 175
His Ser Arg Ala Leu Leu Arg Pro Gln Phe Ser Arg Glu Gln Val Ser
180 185 190
His Leu Glu Ser Met Arg Thr His Ile Asn Met Leu Ile Asn Asn His
195 200 205
Phe Lys Gly Gly Lys Val Val Asp Ala Gln Val Leu Phe His Asn Leu
210 215 220
Thr Ile Asp Thr Ala Thr Glu Phe Leu Phe Gly Glu Ser Thr Asn Thr
225 230 235 240
Leu Asp Pro Ala Leu Ala Gln His Gly Phe Pro Gly Pro Lys Gly Leu
245 250 255
Val Thr Gly Glu Gln Phe Ala Glu Ala Phe Thr Ser Ala Leu Glu Leu
260 265 270
Leu Ser Val Arg Val Met Ala Gly Ala Ala Trp Phe Leu Val Trp Thr
275 280 285
Pro Lys Phe Trp Arg Ser Cys Lys Val Cys His Asn Phe Ile Asp Tyr
290 295 300
Phe Val Phe Lys Ala Leu Ala Thr Pro Met Glu Lys Asp Gln Glu Ala
305 310 315 320
Asp Arg Tyr Val Phe Ile Arg Glu Leu Thr Lys Glu Thr Ser Asp Pro
325 330 335
Arg Val Ile Arg Asp Gln Ala Leu Asn Ile Leu Leu Ala Gly Arg Asp
340 345 350
Thr Thr Ala Ala Leu Leu Ser Phe Thr Thr Tyr Tyr Leu Gly Ala Tyr
355 360 365
Pro Glu Val Tyr Asp Glu Leu Arg Glu Ala Val Ile Ala Asp Phe Gly
370 375 380
Lys Glu Asp Ala Glu Pro Pro Thr Phe Glu Gln Leu Lys Gln Cys Lys
385 390 395 400
Val Leu Gln Asn Val Ile Arg Glu Val Leu Arg Leu His Pro Asn Val
405 410 415
Pro Leu Asn Phe Arg Glu Ala Ile Thr Asp Thr Lys Phe Pro Thr Gly
420 425 430
Gly Gly Pro Asn Gly Asp Gln Pro Val Phe Val Pro Lys Gly Gln Lys
435 440 445
Val Phe Tyr Ala Thr Tyr Val Met Gln Arg Asn Glu Gly Leu Trp Gly
450 455 460
Pro Asp Ser Thr Thr Phe Arg Pro Asp Arg Trp Asn Glu Ser Arg Glu
465 470 475 480
Ala Ile Ala Ser Gly Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg
485 490 495
Ile Cys Leu Gly Gln Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu
500 505 510
Val Arg Ile Cys Gln Glu Phe Ser Arg Ile Glu Val Leu His Pro Asp
515 520 525
Val Ile Thr Ser Arg Asn Val Met Lys Gln Arg Met Arg Leu Thr Asn
530 535 540
Ser Ser Ser Gly Gly Val Ile Ala Lys Phe Ile Arg
545 550 555
<210> 56
<211> 1560
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1560)
<400> 56
atg att att gat ctt tca gac gcg ctg ata ata gga ggc atc gcc ctg 48
Met Ile Ile Asp Leu Ser Asp Ala Leu Ile Ile Gly Gly Ile Ala Leu
1 5 10 15
tgc ttc ttg ctc tcc tac cag gcg atc tac ttt tac ttt att tac tcg 96
Cys Phe Leu Leu Ser Tyr Gln Ala Ile Tyr Phe Tyr Phe Ile Tyr Ser
20 25 30
cca cgg gcc aag aag ctt gga tgc gct cct cct ctc att gtg cac gct 144
Pro Arg Ala Lys Lys Leu Gly Cys Ala Pro Pro Leu Ile Val His Ala
35 40 45
ttc cca ctg ggt ttg ccg aca att ttc gga ctt ata aga gct tgg cgc 192
Phe Pro Leu Gly Leu Pro Thr Ile Phe Gly Leu Ile Arg Ala Trp Arg
50 55 60
aac gac gat ctt ctc cag tac ttg agc gac aac ttc gct aga atc agg 240
Asn Asp Asp Leu Leu Gln Tyr Leu Ser Asp Asn Phe Ala Arg Ile Arg
65 70 75 80
acc aga acc gga atg caa gta atg gcc ggt cag ctg tgg ctc aac acc 288
Thr Arg Thr Gly Met Gln Val Met Ala Gly Gln Leu Trp Leu Asn Thr
85 90 95
att gag cca gaa aac atc aag gcc atg ctt gcc act tcg ttc aag gat 336
Ile Glu Pro Glu Asn Ile Lys Ala Met Leu Ala Thr Ser Phe Lys Asp
100 105 110
ttc tcg ctt ggg ttc cgc tat gaa gtc atg cat ggc ctc ctc gga gat 384
Phe Ser Leu Gly Phe Arg Tyr Glu Val Met His Gly Leu Leu Gly Asp
115 120 125
ggt atc ttc act ctc agt ggt gag ggc tgg aaa cac agc cgt gcc ttg 432
Gly Ile Phe Thr Leu Ser Gly Glu Gly Trp Lys His Ser Arg Ala Leu
130 135 140
cta cgt cca cag ttc agc cgt gag caa gtc tct cac ttg gac tca atg 480
Leu Arg Pro Gln Phe Ser Arg Glu Gln Val Ser His Leu Asp Ser Met
145 150 155 160
cgc aca cac atc aat ttg atg atc aac aac cac ttc aaa ggt ggc cag 528
Arg Thr His Ile Asn Leu Met Ile Asn Asn His Phe Lys Gly Gly Gln
165 170 175
gtc gtc gac gct cag gtt cta tac cat aac ctg aca atc gac act gcc 576
Val Val Asp Ala Gln Val Leu Tyr His Asn Leu Thr Ile Asp Thr Ala
180 185 190
act gaa ttc ctg ttc ggt gag agc acc aac act ctt gac cct gtt ctt 624
Thr Glu Phe Leu Phe Gly Glu Ser Thr Asn Thr Leu Asp Pro Val Leu
195 200 205
gca cag cag gga cta ccg ggt cct agg ggc gtt gtt act ggt gag cag 672
Ala Gln Gln Gly Leu Pro Gly Pro Arg Gly Val Val Thr Gly Glu Gln
210 215 220
ttc gct aac gct ttc acc tac gct caa gag ttg ctc agt att cga gtc 720
Phe Ala Asn Ala Phe Thr Tyr Ala Gln Glu Leu Leu Ser Ile Arg Val
225 230 235 240
atg gcc ggc tca gca tgg ttc ctc gtc tgg act cct aag ttc agg cgc 768
Met Ala Gly Ser Ala Trp Phe Leu Val Trp Thr Pro Lys Phe Arg Arg
245 250 255
tcg tgc aag gtg tgc cac aac ttt att gac tac ttc gtc ttt aag gct 816
Ser Cys Lys Val Cys His Asn Phe Ile Asp Tyr Phe Val Phe Lys Ala
260 265 270
ctg gcc act cct atg gag aaa gac cag gag gct gat cgc tat gta ttc 864
Leu Ala Thr Pro Met Glu Lys Asp Gln Glu Ala Asp Arg Tyr Val Phe
275 280 285
atc cga gaa ctc act aag gag act tct gac cca aag gtt ata cgt gac 912
Ile Arg Glu Leu Thr Lys Glu Thr Ser Asp Pro Lys Val Ile Arg Asp
290 295 300
cag gct ctc aac atc ctt tta gct ggc cgc gat acc act gca gca ctc 960
Gln Ala Leu Asn Ile Leu Leu Ala Gly Arg Asp Thr Thr Ala Ala Leu
305 310 315 320
ctc agc ttc acc act tac tac ctt ggc gca tat cct gag gtc tac gac 1008
Leu Ser Phe Thr Thr Tyr Tyr Leu Gly Ala Tyr Pro Glu Val Tyr Asp
325 330 335
gag ctt cgc gag gca gtt ctt gca gac ttc ggc cct gcc gat tct gag 1056
Glu Leu Arg Glu Ala Val Leu Ala Asp Phe Gly Pro Ala Asp Ser Glu
340 345 350
ccc cct acc ttt gag agg ctc aag cag tgc aag gtg ttg cag aat gtc 1104
Pro Pro Thr Phe Glu Arg Leu Lys Gln Cys Lys Val Leu Gln Asn Val
355 360 365
atc cgc gag gtt ctg cga ttg cac ccg aat gtg ccc ctc aac ttc cgc 1152
Ile Arg Glu Val Leu Arg Leu His Pro Asn Val Pro Leu Asn Phe Arg
370 375 380
cag gcc atc gtt gat act aag ttc cct act ggt ggt ggc ccg aat aga 1200
Gln Ala Ile Val Asp Thr Lys Phe Pro Thr Gly Gly Gly Pro Asn Arg
385 390 395 400
gac cag ccc atc ttt gtt cca aaa gga cag aag gtg ttc tac tcc acg 1248
Asp Gln Pro Ile Phe Val Pro Lys Gly Gln Lys Val Phe Tyr Ser Thr
405 410 415
tac gtc atg cag cga agc aag gac atc tgg ggc gct gac tcc aca tcg 1296
Tyr Val Met Gln Arg Ser Lys Asp Ile Trp Gly Ala Asp Ser Thr Ser
420 425 430
ttc cga cca gaa cgc tgg aac gag ccc aga gaa gct ctt gca tca ggt 1344
Phe Arg Pro Glu Arg Trp Asn Glu Pro Arg Glu Ala Leu Ala Ser Gly
435 440 445
tgg gat tac att cct ttc aat ggt ggc cct cgc att tgt atc ggt cag 1392
Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg Ile Cys Ile Gly Gln
450 455 460
cag ttc gct ctc act gag gct agc tac acg ctt gtc cgt att tgc cag 1440
Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu Val Arg Ile Cys Gln
465 470 475 480
gag ttt acc aga att gag gtt ctt cat ccc gat gtc att act tct agg 1488
Glu Phe Thr Arg Ile Glu Val Leu His Pro Asp Val Ile Thr Ser Arg
485 490 495
aaa gag atg aag cag cgc atg cgc ttg acc aac tcg gct agc ggt ggc 1536
Lys Glu Met Lys Gln Arg Met Arg Leu Thr Asn Ser Ala Ser Gly Gly
500 505 510
gtg atg gcg aga ttc att cgt tag 1560
Val Met Ala Arg Phe Ile Arg
515
<210> 57
<211> 519
<212> PRT
<213> 假丝酵母菌
<400> 57
Met Ile Ile Asp Leu Ser Asp Ala Leu Ile Ile Gly Gly Ile Ala Leu
1 5 10 15
Cys Phe Leu Leu Ser Tyr Gln Ala Ile Tyr Phe Tyr Phe Ile Tyr Ser
20 25 30
Pro Arg Ala Lys Lys Leu Gly Cys Ala Pro Pro Leu Ile Val His Ala
35 40 45
Phe Pro Leu Gly Leu Pro Thr Ile Phe Gly Leu Ile Arg Ala Trp Arg
50 55 60
Asn Asp Asp Leu Leu Gln Tyr Leu Ser Asp Asn Phe Ala Arg Ile Arg
65 70 75 80
Thr Arg Thr Gly Met Gln Val Met Ala Gly Gln Leu Trp Leu Asn Thr
85 90 95
Ile Glu Pro Glu Asn Ile Lys Ala Met Leu Ala Thr Ser Phe Lys Asp
100 105 110
Phe Ser Leu Gly Phe Arg Tyr Glu Val Met His Gly Leu Leu Gly Asp
115 120 125
Gly Ile Phe Thr Leu Ser Gly Glu Gly Trp Lys His Ser Arg Ala Leu
130 135 140
Leu Arg Pro Gln Phe Ser Arg Glu Gln Val Ser His Leu Asp Ser Met
145 150 155 160
Arg Thr His Ile Asn Leu Met Ile Asn Asn His Phe Lys Gly Gly Gln
165 170 175
Val Val Asp Ala Gln Val Leu Tyr His Asn Leu Thr Ile Asp Thr Ala
180 185 190
Thr Glu Phe Leu Phe Gly Glu Ser Thr Asn Thr Leu Asp Pro Val Leu
195 200 205
Ala Gln Gln Gly Leu Pro Gly Pro Arg Gly Val Val Thr Gly Glu Gln
210 215 220
Phe Ala Asn Ala Phe Thr Tyr Ala Gln Glu Leu Leu Ser Ile Arg Val
225 230 235 240
Met Ala Gly Ser Ala Trp Phe Leu Val Trp Thr Pro Lys Phe Arg Arg
245 250 255
Ser Cys Lys Val Cys His Asn Phe Ile Asp Tyr Phe Val Phe Lys Ala
260 265 270
Leu Ala Thr Pro Met Glu Lys Asp Gln Glu Ala Asp Arg Tyr Val Phe
275 280 285
Ile Arg Glu Leu Thr Lys Glu Thr Ser Asp Pro Lys Val Ile Arg Asp
290 295 300
Gln Ala Leu Asn Ile Leu Leu Ala Gly Arg Asp Thr Thr Ala Ala Leu
305 310 315 320
Leu Ser Phe Thr Thr Tyr Tyr Leu Gly Ala Tyr Pro Glu Val Tyr Asp
325 330 335
Glu Leu Arg Glu Ala Val Leu Ala Asp Phe Gly Pro Ala Asp Ser Glu
340 345 350
Pro Pro Thr Phe Glu Arg Leu Lys Gln Cys Lys Val Leu Gln Asn Val
355 360 365
Ile Arg Glu Val Leu Arg Leu His Pro Asn Val Pro Leu Asn Phe Arg
370 375 380
Gln Ala Ile Val Asp Thr Lys Phe Pro Thr Gly Gly Gly Pro Asn Arg
385 390 395 400
Asp Gln Pro Ile Phe Val Pro Lys Gly Gln Lys Val Phe Tyr Ser Thr
405 410 415
Tyr Val Met Gln Arg Ser Lys Asp Ile Trp Gly Ala Asp Ser Thr Ser
420 425 430
Phe Arg Pro Glu Arg Trp Asn Glu Pro Arg Glu Ala Leu Ala Ser Gly
435 440 445
Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg Ile Cys Ile Gly Gln
450 455 460
Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu Val Arg Ile Cys Gln
465 470 475 480
Glu Phe Thr Arg Ile Glu Val Leu His Pro Asp Val Ile Thr Ser Arg
485 490 495
Lys Glu Met Lys Gln Arg Met Arg Leu Thr Asn Ser Ala Ser Gly Gly
500 505 510
Val Met Ala Arg Phe Ile Arg
515
<210> 58
<211> 1572
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1572)
<400> 58
atg att ttt tat gct gtg ctt ggc gct gtg gtc acc ttc tta ctt tac 48
Met Ile Phe Tyr Ala Val Leu Gly Ala Val Val Thr Phe Leu Leu Tyr
1 5 10 15
gta gat gtg atc tac cct ttc gtg ata tat cct tta aaa gca cga tgg 96
Val Asp Val Ile Tyr Pro Phe Val Ile Tyr Pro Leu Lys Ala Arg Trp
20 25 30
cac aaa tgt ggc tcc gta cct cga gag ctt agc tgg cca ttg ggg att 144
His Lys Cys Gly Ser Val Pro Arg Glu Leu Ser Trp Pro Leu Gly Ile
35 40 45
cca acc acc ata gga gtt ttt tcg aac ata aag aag gat cta cat ctt 192
Pro Thr Thr Ile Gly Val Phe Ser Asn Ile Lys Lys Asp Leu His Leu
50 55 60
caa gtc ctg gca gcg tac gac ctc agc cgg tct tat aag aca agc ttg 240
Gln Val Leu Ala Ala Tyr Asp Leu Ser Arg Ser Tyr Lys Thr Ser Leu
65 70 75 80
cgt caa agt ctc ggc aca tgg gta gtt gct acg cgg gat cct gag aac 288
Arg Gln Ser Leu Gly Thr Trp Val Val Ala Thr Arg Asp Pro Glu Asn
85 90 95
atc aag gcc gtt ttg tct acc aag ttc aat gac ttt tca ctg aaa gag 336
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Glu
100 105 110
aga gga att cgg tta agg cat gta att ggt gat ggt atc ttt acc caa 384
Arg Gly Ile Arg Leu Arg His Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
gat ggt gca ccg tgg aag cac tcg cga gat atg ctc aga cct caa ttc 432
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
agt agg gaa caa atc agc cgc gtg gag gtg ttg agt cac cac atc gat 480
Ser Arg Glu Gln Ile Ser Arg Val Glu Val Leu Ser His His Ile Asp
145 150 155 160
gtt ttg att cgt gag atc aaa aag tcg gga ggt aat gtt gag ttg caa 528
Val Leu Ile Arg Glu Ile Lys Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
cga cta ttc cac ctc atg act atg gac acc gcc aca cag ttt ctt ttc 576
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr Gln Phe Leu Phe
180 185 190
ggc gaa tca att ggc tcg cta gaa gtc agt ggc gac agc aag ggc att 624
Gly Glu Ser Ile Gly Ser Leu Glu Val Ser Gly Asp Ser Lys Gly Ile
195 200 205
gag att act gac cca aat act gga gat att gtg agt acc gtt gac ttc 672
Glu Ile Thr Asp Pro Asn Thr Gly Asp Ile Val Ser Thr Val Asp Phe
210 215 220
gtt gag tct tat act ttc aca aac aga ttt gct atg aag aag gta ttc 720
Val Glu Ser Tyr Thr Phe Thr Asn Arg Phe Ala Met Lys Lys Val Phe
225 230 235 240
ctg aac aaa tgg gaa ttc ttg gca aac ttg tcg aac ccc tca tat gag 768
Leu Asn Lys Trp Glu Phe Leu Ala Asn Leu Ser Asn Pro Ser Tyr Glu
245 250 255
agg cat atg cgg cgt gtc cac aca gtc ctg gat cac tac gtt cag ctg 816
Arg His Met Arg Arg Val His Thr Val Leu Asp His Tyr Val Gln Leu
260 265 270
gct ttg aag gct act gag aag tat gat cct gaa gat gac agc gag aaa 864
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Glu Asp Asp Ser Glu Lys
275 280 285
gga gaa tac tac ttt agc cat gag ctg gct aaa ctc acg aga gac ccc 912
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
ttg tcg ttg cgc aat cag ctt ttt aat atc ctg att gct ggc cgc gac 960
Leu Ser Leu Arg Asn Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
act acc gca gca aca ttg tcc tat gcc ttc cat tac tta acg aag aac 1008
Thr Thr Ala Ala Thr Leu Ser Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
cca gcc atc tac gcc aag gtt cgc gaa gat gtg ctc acc gtc ttc ccc 1056
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
gat gga gac gcc tca ttg gcg acc ttt gag gac ttg cga aag gcc aag 1104
Asp Gly Asp Ala Ser Leu Ala Thr Phe Glu Asp Leu Arg Lys Ala Lys
355 360 365
tat ctc caa atg gta atc aag gag gta ttg cgc ctt gcg cct gcg gtt 1152
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Ala Val
370 375 380
ccc aca aat tcg cgt act gcg gtt cgt gac acc tat ctg cca cgg ggt 1200
Pro Thr Asn Ser Arg Thr Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
gga ggc cca gct gga aac cta ccc gtt ttc gtt ccc aag ggc act att 1248
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Ile
405 410 415
atc agg tat cct gca tat atc ttg cac cgc gat cct gat ata tat ggt 1296
Ile Arg Tyr Pro Ala Tyr Ile Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
gcc gac tcg tat gac ttc aac cct gag agg tgg aga ccc gag aat aag 1344
Ala Asp Ser Tyr Asp Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
ctc cca ggt agc cca atg tac tca tgg ggc tat att ccc ttt aat ggc 1392
Leu Pro Gly Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
ggc cct cgc att tgc gtt gga cag cag ttt gcc ttg act gaa atc gct 1440
Gly Pro Arg Ile Cys Val Gly Gln Gln Phe Ala Leu Thr Glu Ile Ala
465 470 475 480
ttg aca atg atc aag ctg gtt ttg gaa ttt gag agg ctg gag cct gct 1488
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
gat gac ttt gag ccc aat ctt cga gat agg acc tca tta act tcc atg 1536
Asp Asp Phe Glu Pro Asn Leu Arg Asp Arg Thr Ser Leu Thr Ser Met
500 505 510
gtc gga ggg tcg ggc gtc cga gta aaa ctg agt taa 1572
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 59
<211> 523
<212> PRT
<213> 假丝酵母菌
<400> 59
Met Ile Phe Tyr Ala Val Leu Gly Ala Val Val Thr Phe Leu Leu Tyr
1 5 10 15
Val Asp Val Ile Tyr Pro Phe Val Ile Tyr Pro Leu Lys Ala Arg Trp
20 25 30
His Lys Cys Gly Ser Val Pro Arg Glu Leu Ser Trp Pro Leu Gly Ile
35 40 45
Pro Thr Thr Ile Gly Val Phe Ser Asn Ile Lys Lys Asp Leu His Leu
50 55 60
Gln Val Leu Ala Ala Tyr Asp Leu Ser Arg Ser Tyr Lys Thr Ser Leu
65 70 75 80
Arg Gln Ser Leu Gly Thr Trp Val Val Ala Thr Arg Asp Pro Glu Asn
85 90 95
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Glu
100 105 110
Arg Gly Ile Arg Leu Arg His Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
Ser Arg Glu Gln Ile Ser Arg Val Glu Val Leu Ser His His Ile Asp
145 150 155 160
Val Leu Ile Arg Glu Ile Lys Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr Gln Phe Leu Phe
180 185 190
Gly Glu Ser Ile Gly Ser Leu Glu Val Ser Gly Asp Ser Lys Gly Ile
195 200 205
Glu Ile Thr Asp Pro Asn Thr Gly Asp Ile Val Ser Thr Val Asp Phe
210 215 220
Val Glu Ser Tyr Thr Phe Thr Asn Arg Phe Ala Met Lys Lys Val Phe
225 230 235 240
Leu Asn Lys Trp Glu Phe Leu Ala Asn Leu Ser Asn Pro Ser Tyr Glu
245 250 255
Arg His Met Arg Arg Val His Thr Val Leu Asp His Tyr Val Gln Leu
260 265 270
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Glu Asp Asp Ser Glu Lys
275 280 285
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
Leu Ser Leu Arg Asn Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
Thr Thr Ala Ala Thr Leu Ser Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
Asp Gly Asp Ala Ser Leu Ala Thr Phe Glu Asp Leu Arg Lys Ala Lys
355 360 365
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Ala Val
370 375 380
Pro Thr Asn Ser Arg Thr Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Ile
405 410 415
Ile Arg Tyr Pro Ala Tyr Ile Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
Ala Asp Ser Tyr Asp Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
Leu Pro Gly Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
Gly Pro Arg Ile Cys Val Gly Gln Gln Phe Ala Leu Thr Glu Ile Ala
465 470 475 480
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
Asp Asp Phe Glu Pro Asn Leu Arg Asp Arg Thr Ser Leu Thr Ser Met
500 505 510
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 60
<211> 1572
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1572)
<400> 60
atg att ttt tat gct gtg ctt ggc act gtg gtc gcc ttc tta ctt tac 48
Met Ile Phe Tyr Ala Val Leu Gly Thr Val Val Ala Phe Leu Leu Tyr
1 5 10 15
gta gat gtg atc tac cct ttc gtg ata tat cct tta aag gca cga tgg 96
Val Asp Val Ile Tyr Pro Phe Val Ile Tyr Pro Leu Lys Ala Arg Trp
20 25 30
cac aaa tgt ggc ttc gtc cct cga gag ctg agc tgg cca ttg ggg att 144
His Lys Cys Gly Phe Val Pro Arg Glu Leu Ser Trp Pro Leu Gly Ile
35 40 45
cca gac acc ata gca gtt ttt tcg agg ata aag aag gat cta cat ctt 192
Pro Asp Thr Ile Ala Val Phe Ser Arg Ile Lys Lys Asp Leu His Leu
50 55 60
caa ttc ctg gca gcg cac gac ctc agc cgg tct tat aag aca agc ttg 240
Gln Phe Leu Ala Ala His Asp Leu Ser Arg Ser Tyr Lys Thr Ser Leu
65 70 75 80
cgt caa act ctc ggc aca tgg gta gtt gat acg cga gat cct gag aat 288
Arg Gln Thr Leu Gly Thr Trp Val Val Asp Thr Arg Asp Pro Glu Asn
85 90 95
atc aag gcc gtt ttg tct acc aag ttc aat gac ttt tca ctg aaa gat 336
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Asp
100 105 110
aga gga att cgg tta agg caa gta att ggt gat ggt att ttt acc caa 384
Arg Gly Ile Arg Leu Arg Gln Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
gat ggt gca ccg tgg aag cac tcg cga gat atg ctc aga cct caa ttc 432
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
agt agg gaa caa att agc cgc gtg gag gtg ttg agt cac cac atc gat 480
Ser Arg Glu Gln Ile Ser Arg Val Glu Val Leu Ser His His Ile Asp
145 150 155 160
gtt ttg att cgt gag atc aaa aag tcg gga ggt aat gtt gag ttg caa 528
Val Leu Ile Arg Glu Ile Lys Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
cga cta ttc cac ctc atg act atg gac act gct aca cag ttt ctt ttc 576
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr Gln Phe Leu Phe
180 185 190
ggc gaa tca att ggc tcg cta gaa gtc agt ggc gac agc aag ggc att 624
Gly Glu Ser Ile Gly Ser Leu Glu Val Ser Gly Asp Ser Lys Gly Ile
195 200 205
gag att act gac cca aat act gga gat att gtg aat acc gtt gac ttc 672
Glu Ile Thr Asp Pro Asn Thr Gly Asp Ile Val Asn Thr Val Asp Phe
210 215 220
gtt gag tct tat act ttt gca aac aga ttt gct atg aaa aag ata tta 720
Val Glu Ser Tyr Thr Phe Ala Asn Arg Phe Ala Met Lys Lys Ile Leu
225 230 235 240
ctg aac aaa tgg gaa ttc gtg gta aac ttg tcg aac ccc tca tat gag 768
Leu Asn Lys Trp Glu Phe Val Val Asn Leu Ser Asn Pro Ser Tyr Glu
245 250 255
agg cat atg cga cgt gtc cac aca gtc ctg gat cac tac gtt cag ctg 816
Arg His Met Arg Arg Val His Thr Val Leu Asp His Tyr Val Gln Leu
260 265 270
gct ttg aag gct act gag aag tat gat cct gaa gat gac tgc gag aaa 864
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Glu Asp Asp Cys Glu Lys
275 280 285
gga gaa tac tac ttt agc cat gag ctg gct aaa ctc acg aga gac ccc 912
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
ttg tgc ttg cgc aat cag ctt ttt aat atc ctg att gct ggc cgc gac 960
Leu Cys Leu Arg Asn Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
act acc gca gca aca ttg gcc tat gcc ttc cat tac ttg acg aag aac 1008
Thr Thr Ala Ala Thr Leu Ala Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
cca gcc atc tac gcc aag gtg cgc gaa gat gtg ctc acc gtc ttc ccc 1056
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
aat gga gat gcc tca ttg gcg acc ttt gag gac ttg cga aag gcc aag 1104
Asn Gly Asp Ala Ser Leu Ala Thr Phe Glu Asp Leu Arg Lys Ala Lys
355 360 365
tat ctc caa atg gta atc aag gag gta ttg cgc ctt gcg cct gtg gtt 1152
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Val Val
370 375 380
ccc aca aat tcg cgt act gcg gtt cgt gac acc tat ctg cca cgg ggt 1200
Pro Thr Asn Ser Arg Thr Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
gga ggc cca gct gga aac cta ccc gtt ttc gtt ccc aag ggc aca aat 1248
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Asn
405 410 415
gtc agg tat tct gca tat gtc ttg cac cgc gat cct gat ata tat ggt 1296
Val Arg Tyr Ser Ala Tyr Val Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
gcc gac tcg tat gac ttc aac cct gag agg tgg aga ccc gag aat aag 1344
Ala Asp Ser Tyr Asp Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
ctc cca ggt agc cca atg tac tca tgg ggc tat att ccc ttt aat ggc 1392
Leu Pro Gly Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
ggc cct cgc att tgc gtt gga cag cag ttt gcc ttg act gaa ttc gct 1440
Gly Pro Arg Ile Cys Val Gly Gln Gln Phe Ala Leu Thr Glu Phe Ala
465 470 475 480
ttg aca atg atc aag ctg gtt tta gaa ttt gag agg ctg gag cct gct 1488
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
gat gac ttt gag ccc aat ctt cta gat agg acc tca tta act gcc atg 1536
Asp Asp Phe Glu Pro Asn Leu Leu Asp Arg Thr Ser Leu Thr Ala Met
500 505 510
gtc gga ggg tcg ggc gtc cga gta aaa ctg agt taa 1572
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 61
<211> 523
<212> PRT
<213> 假丝酵母菌
<400> 61
Met Ile Phe Tyr Ala Val Leu Gly Thr Val Val Ala Phe Leu Leu Tyr
1 5 10 15
Val Asp Val Ile Tyr Pro Phe Val Ile Tyr Pro Leu Lys Ala Arg Trp
20 25 30
His Lys Cys Gly Phe Val Pro Arg Glu Leu Ser Trp Pro Leu Gly Ile
35 40 45
Pro Asp Thr Ile Ala Val Phe Ser Arg Ile Lys Lys Asp Leu His Leu
50 55 60
Gln Phe Leu Ala Ala His Asp Leu Ser Arg Ser Tyr Lys Thr Ser Leu
65 70 75 80
Arg Gln Thr Leu Gly Thr Trp Val Val Asp Thr Arg Asp Pro Glu Asn
85 90 95
Ile Lys Ala Val Leu Ser Thr Lys Phe Asn Asp Phe Ser Leu Lys Asp
100 105 110
Arg Gly Ile Arg Leu Arg Gln Val Ile Gly Asp Gly Ile Phe Thr Gln
115 120 125
Asp Gly Ala Pro Trp Lys His Ser Arg Asp Met Leu Arg Pro Gln Phe
130 135 140
Ser Arg Glu Gln Ile Ser Arg Val Glu Val Leu Ser His His Ile Asp
145 150 155 160
Val Leu Ile Arg Glu Ile Lys Lys Ser Gly Gly Asn Val Glu Leu Gln
165 170 175
Arg Leu Phe His Leu Met Thr Met Asp Thr Ala Thr Gln Phe Leu Phe
180 185 190
Gly Glu Ser Ile Gly Ser Leu Glu Val Ser Gly Asp Ser Lys Gly Ile
195 200 205
Glu Ile Thr Asp Pro Asn Thr Gly Asp Ile Val Asn Thr Val Asp Phe
210 215 220
Val Glu Ser Tyr Thr Phe Ala Asn Arg Phe Ala Met Lys Lys Ile Leu
225 230 235 240
Leu Asn Lys Trp Glu Phe Val Val Asn Leu Ser Asn Pro Ser Tyr Glu
245 250 255
Arg His Met Arg Arg Val His Thr Val Leu Asp His Tyr Val Gln Leu
260 265 270
Ala Leu Lys Ala Thr Glu Lys Tyr Asp Pro Glu Asp Asp Cys Glu Lys
275 280 285
Gly Glu Tyr Tyr Phe Ser His Glu Leu Ala Lys Leu Thr Arg Asp Pro
290 295 300
Leu Cys Leu Arg Asn Gln Leu Phe Asn Ile Leu Ile Ala Gly Arg Asp
305 310 315 320
Thr Thr Ala Ala Thr Leu Ala Tyr Ala Phe His Tyr Leu Thr Lys Asn
325 330 335
Pro Ala Ile Tyr Ala Lys Val Arg Glu Asp Val Leu Thr Val Phe Pro
340 345 350
Asn Gly Asp Ala Ser Leu Ala Thr Phe Glu Asp Leu Arg Lys Ala Lys
355 360 365
Tyr Leu Gln Met Val Ile Lys Glu Val Leu Arg Leu Ala Pro Val Val
370 375 380
Pro Thr Asn Ser Arg Thr Ala Val Arg Asp Thr Tyr Leu Pro Arg Gly
385 390 395 400
Gly Gly Pro Ala Gly Asn Leu Pro Val Phe Val Pro Lys Gly Thr Asn
405 410 415
Val Arg Tyr Ser Ala Tyr Val Leu His Arg Asp Pro Asp Ile Tyr Gly
420 425 430
Ala Asp Ser Tyr Asp Phe Asn Pro Glu Arg Trp Arg Pro Glu Asn Lys
435 440 445
Leu Pro Gly Ser Pro Met Tyr Ser Trp Gly Tyr Ile Pro Phe Asn Gly
450 455 460
Gly Pro Arg Ile Cys Val Gly Gln Gln Phe Ala Leu Thr Glu Phe Ala
465 470 475 480
Leu Thr Met Ile Lys Leu Val Leu Glu Phe Glu Arg Leu Glu Pro Ala
485 490 495
Asp Asp Phe Glu Pro Asn Leu Leu Asp Arg Thr Ser Leu Thr Ala Met
500 505 510
Val Gly Gly Ser Gly Val Arg Val Lys Leu Ser
515 520
<210> 62
<211> 1206
<212> DNA
<213> 假丝酵母菌
<220>
<221> CDS
<222> (1)..(1206)
<400> 62
atg ttt gcg aaa gct tta tgg gag gat gat gtt ttg gag tac gcc tgc 48
Met Phe Ala Lys Ala Leu Trp Glu Asp Asp Val Leu Glu Tyr Ala Cys
1 5 10 15
cgc agg ttt gca ggc atg aag gtc aga act ggg ctt caa act gtc gct 96
Arg Arg Phe Ala Gly Met Lys Val Arg Thr Gly Leu Gln Thr Val Ala
20 25 30
ggc cag cta tgg ata gca act atc gag ccg gag aac atc aag acc gta 144
Gly Gln Leu Trp Ile Ala Thr Ile Glu Pro Glu Asn Ile Lys Thr Val
35 40 45
ctt gcc acc tcg ttc aat gac tac tcc ctt ggc ttc cgt tat aat gcc 192
Leu Ala Thr Ser Phe Asn Asp Tyr Ser Leu Gly Phe Arg Tyr Asn Ala
50 55 60
cta tac ggc ctt ctc gga aat ggt att ttc acc ctt agt ggt gat ggc 240
Leu Tyr Gly Leu Leu Gly Asn Gly Ile Phe Thr Leu Ser Gly Asp Gly
65 70 75 80
tgg aag cac agt cgt gct ttg ttg cgt ccg cag ttc agt cgt gag caa 288
Trp Lys His Ser Arg Ala Leu Leu Arg Pro Gln Phe Ser Arg Glu Gln
85 90 95
gtt tct cac ttg gac tcc atg cgt aca cac atc aac ttg atg atc aac 336
Val Ser His Leu Asp Ser Met Arg Thr His Ile Asn Leu Met Ile Asn
100 105 110
aac cat ttc aaa ggc ggc cac gtc gtt gac gca cag gct cga tac cac 384
Asn His Phe Lys Gly Gly His Val Val Asp Ala Gln Ala Arg Tyr His
115 120 125
aat ttg acc atc gat act gcg act gaa ttc ctt ttc ggt gag agc act 432
Asn Leu Thr Ile Asp Thr Ala Thr Glu Phe Leu Phe Gly Glu Ser Thr
130 135 140
aac aca ctc gac cct gtt ctt gca cag caa gga ctc cct ggt cct aag 480
Asn Thr Leu Asp Pro Val Leu Ala Gln Gln Gly Leu Pro Gly Pro Lys
145 150 155 160
ggc acc gtt acc gga gag cag ttt gct gaa gct ttc acc tcc gct ctt 528
Gly Thr Val Thr Gly Glu Gln Phe Ala Glu Ala Phe Thr Ser Ala Leu
165 170 175
caa gtg ctg agt gtc cga gtt atg gcc ggc tcc gca tgg ttc ctc att 576
Gln Val Leu Ser Val Arg Val Met Ala Gly Ser Ala Trp Phe Leu Ile
180 185 190
tgg act cct aaa ttc tgg cgc tcg tgc aag gtg tgc cac aac ttc att 624
Trp Thr Pro Lys Phe Trp Arg Ser Cys Lys Val Cys His Asn Phe Ile
195 200 205
gac tac ttc gta tac aag gcc ttg gcc act ccg atg gag aag ggc caa 672
Asp Tyr Phe Val Tyr Lys Ala Leu Ala Thr Pro Met Glu Lys Gly Gln
210 215 220
gag gct gat cgc tat gtt ttt att cga gag ctc aca aag gag act tct 720
Glu Ala Asp Arg Tyr Val Phe Ile Arg Glu Leu Thr Lys Glu Thr Ser
225 230 235 240
gac cca aga gtc atc cgt gac cag gct cta aat atc ctg ctg gct ggt 768
Asp Pro Arg Val Ile Arg Asp Gln Ala Leu Asn Ile Leu Leu Ala Gly
245 250 255
cgt gat acc act gcg gca ctc ctc atc att gcg gac ttt ggc tct gag 816
Arg Asp Thr Thr Ala Ala Leu Leu Ile Ile Ala Asp Phe Gly Ser Glu
260 265 270
gac gct gag ccc cct acc ttt gag cag ctc aag cag tgc aag gta ctg 864
Asp Ala Glu Pro Pro Thr Phe Glu Gln Leu Lys Gln Cys Lys Val Leu
275 280 285
cag aat gtc att cgc gag gtt tta cgt ttg cac cct aat gtg ccg ctc 912
Gln Asn Val Ile Arg Glu Val Leu Arg Leu His Pro Asn Val Pro Leu
290 295 300
aac ttc cgc cag gct ata act gat act aag ctc ccc act ggt ggt ggc 960
Asn Phe Arg Gln Ala Ile Thr Asp Thr Lys Leu Pro Thr Gly Gly Gly
305 310 315 320
ccg aac aga gac cag cct gtc ttt gtt cca aag gga cag aaa gtg ttc 1008
Pro Asn Arg Asp Gln Pro Val Phe Val Pro Lys Gly Gln Lys Val Phe
325 330 335
tac gcc acc tac gtc atg cag cga gat ccg gaa ata tgg ggc ccc gac 1056
Tyr Ala Thr Tyr Val Met Gln Arg Asp Pro Glu Ile Trp Gly Pro Asp
340 345 350
tct aca agc ttc cgc cct gat cga tgg aat gag ccg aga gag gct ctt 1104
Ser Thr Ser Phe Arg Pro Asp Arg Trp Asn Glu Pro Arg Glu Ala Leu
355 360 365
gca tca ggt tgg gat tat att cct ttc aat ggc ggc cct cgc att tgt 1152
Ala Ser Gly Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg Ile Cys
370 375 380
atc ggt cag cag ttc gct ctc act gag gct agc tac aca ctt gtc cgt 1200
Ile Gly Gln Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu Val Arg
385 390 395 400
atc tag 1206
Ile
<210> 63
<211> 401
<212> PRT
<213> 假丝酵母菌
<400> 63
Met Phe Ala Lys Ala Leu Trp Glu Asp Asp Val Leu Glu Tyr Ala Cys
1 5 10 15
Arg Arg Phe Ala Gly Met Lys Val Arg Thr Gly Leu Gln Thr Val Ala
20 25 30
Gly Gln Leu Trp Ile Ala Thr Ile Glu Pro Glu Asn Ile Lys Thr Val
35 40 45
Leu Ala Thr Ser Phe Asn Asp Tyr Ser Leu Gly Phe Arg Tyr Asn Ala
50 55 60
Leu Tyr Gly Leu Leu Gly Asn Gly Ile Phe Thr Leu Ser Gly Asp Gly
65 70 75 80
Trp Lys His Ser Arg Ala Leu Leu Arg Pro Gln Phe Ser Arg Glu Gln
85 90 95
Val Ser His Leu Asp Ser Met Arg Thr His Ile Asn Leu Met Ile Asn
100 105 110
Asn His Phe Lys Gly Gly His Val Val Asp Ala Gln Ala Arg Tyr His
115 120 125
Asn Leu Thr Ile Asp Thr Ala Thr Glu Phe Leu Phe Gly Glu Ser Thr
130 135 140
Asn Thr Leu Asp Pro Val Leu Ala Gln Gln Gly Leu Pro Gly Pro Lys
145 150 155 160
Gly Thr Val Thr Gly Glu Gln Phe Ala Glu Ala Phe Thr Ser Ala Leu
165 170 175
Gln Val Leu Ser Val Arg Val Met Ala Gly Ser Ala Trp Phe Leu Ile
180 185 190
Trp Thr Pro Lys Phe Trp Arg Ser Cys Lys Val Cys His Asn Phe Ile
195 200 205
Asp Tyr Phe Val Tyr Lys Ala Leu Ala Thr Pro Met Glu Lys Gly Gln
210 215 220
Glu Ala Asp Arg Tyr Val Phe Ile Arg Glu Leu Thr Lys Glu Thr Ser
225 230 235 240
Asp Pro Arg Val Ile Arg Asp Gln Ala Leu Asn Ile Leu Leu Ala Gly
245 250 255
Arg Asp Thr Thr Ala Ala Leu Leu Ile Ile Ala Asp Phe Gly Ser Glu
260 265 270
Asp Ala Glu Pro Pro Thr Phe Glu Gln Leu Lys Gln Cys Lys Val Leu
275 280 285
Gln Asn Val Ile Arg Glu Val Leu Arg Leu His Pro Asn Val Pro Leu
290 295 300
Asn Phe Arg Gln Ala Ile Thr Asp Thr Lys Leu Pro Thr Gly Gly Gly
305 310 315 320
Pro Asn Arg Asp Gln Pro Val Phe Val Pro Lys Gly Gln Lys Val Phe
325 330 335
Tyr Ala Thr Tyr Val Met Gln Arg Asp Pro Glu Ile Trp Gly Pro Asp
340 345 350
Ser Thr Ser Phe Arg Pro Asp Arg Trp Asn Glu Pro Arg Glu Ala Leu
355 360 365
Ala Ser Gly Trp Asp Tyr Ile Pro Phe Asn Gly Gly Pro Arg Ile Cys
370 375 380
Ile Gly Gln Gln Phe Ala Leu Thr Glu Ala Ser Tyr Thr Leu Val Arg
385 390 395 400
Ile
<210> 64
<211> 6084
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 64
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
ctacaagtcc atatgtgtag agttgttttt gttgttaagt ctttctttaa gagcttgacc 1140
gactataacc gttcaacggc gcattatata ctttgggtat cggccagtgc tgacaactca 1200
cacgttgcga ccccttaccc agaagcatac ccagcgcgat gtcgatcgtg ttatatcgta 1260
gacgcacacc ctgcaatgac gggtaggctc taaatcggga tgcgaaaaag aggttgcctt 1320
gctttttgcc ctggtagatg gcatgctgag cgtgcgcttg ccgcctaatt tttgtgtgtc 1380
gcctgctatt tattgctgaa gctagcccgc cgcatctttc cccaaggctt cgattgctcg 1440
tattggggca gggattggta ctcaaccttg cagatgagac tccagcaaca acgtcgtact 1500
gcttagcgat cgcacatgtt tcatcatcgt cactatacac atcgtcatca actccatggc 1560
gtgaggactt ccgagactgc tgggcccttc gtttctttaa tgcctcaaga gatgacttcg 1620
tacccgaaga gacgcctgtt gtaccccgtt gacgcttggc ggagggggct tcgtcctcgt 1680
cagcaacccg cgtcatctgc ttccttcgct gagcaagata ccttctctcc tcgtaccgct 1740
gcatctcctg agctcggtca tacaagatct cttctcgctc aatctctggc agcgcgtcca 1800
acttcgccct gtcttcagca tcgagatatt tgccttctag aggataggga ttgacgacct 1860
cattgcttgg cggcgacggc agcgagattt cctcttcgga gtcggagcca acgtcggcca 1920
atgccagcag atcatcatca ctgtcactca tagtaggaag gttgaagtgt gctgacgaat 1980
cagaatcgcg aaggatgcca ttgaaggcat atatatttta atctgtacct tttatggtaa 2040
tttaatcaga ttttataggt attcatgtgc aagttgcatt gaaggaactg tttgagaaaa 2100
tcatcttgac tgaacttttc tcagatatgc attccagccc gccttttggt aacgctgagc 2160
ttcgtgcaca ggatctcgtc ccttgctata gagcccgcgt ccgacgataa taacgtctgt 2220
gccggtctct atgacgtcgt ccacagtacg atactgctgc cccaatccat cacctttgtc 2280
gtccaggccc accccaggag tcataatgac ccagtcttcc tctggctttc cgactttttg 2340
ctgagcgatg aaaccaaaca caaatgcgcg gttactgcga gcgatgtcta ctgtcgcttg 2400
cgagtattcg ccgtgagcca gtgtgccctt cgaactcagt tctgcaagca tgacaaggcc 2460
gcgaggttca tccgtagttt ccttcgcagc ctcttctagt ccgctcacaa ttcccggccc 2520
aggaacaccg tgagcatttg ttatatcagc ccattgagcg atcttaaaca ctccacctgc 2580
atattgggcc ttaacagtgg aaccgatgtc tgcgaacttt cggtcttcaa aaatgagaaa 2640
attgtgcttc gttgaaagct gtttcaaacc gctgacagtt gtgtcgtatt cgaagtcgtc 2700
aattatgtca atgtgggtct taaccataca aatgtaaggt ccaatgcggt ccaggatact 2760
cagtaactca gaggtagttc gcacatccaa gcttgcgcaa agatttgttt gcttgctcac 2820
aatgatgtcg aatagccggg ctgctacagc cggcagcctc tctcggcgct cctcatagct 2880
cagcttcata ttatttctct acagtagtgc ccgtgccctc gatcagctag gacttttcaa 2940
attaatcggg ctgtttgatg taagtaagat gaagtcacgc gcgtgcagga gactgcgtcc 3000
cgcgatattc tgcaggcttg aaaaatttac cctaacggta ggcatcaagt gagtgagtct 3060
cagcgtcgat atgggtcaaa aaaggggaaa actagccgag atcgttgcga gctgtttcga 3120
aaattatgcc ctatggcaat tatcacgtgg agtatccgaa tttctccagg ctgtcaagcg 3180
gcaattataa ccgagactga gatcgagaag tatataaccg cagcagtagt ggataaataa 3240
ttgcgaagtc ttcccagcag agcgggctgt tttttggagt tggttactgt aaaatgctaa 3300
aatgactgac aacaatggag cgtctacagc attggcaaca gtgggaacag tatgctggtg 3360
catccagttg ataccccagg ttctgcgaaa ctggtatgtt cgggattgcg agggcgttcc 3420
tcctctgatg ttctttttgt tcgccgtttc ggggattccc ttcgcagtgt acttcattga 3480
tcagaattcg aacactgcca tcatggttca acctcacttg tttactttct ttagccttat 3540
aggcttttgg caaagcctgt actatccgcc cgtcagacca gcacgggccg tcacatgtat 3600
ggttgcgtcg ctgtataaga aatcttacaa ctgaagacta cacagcgtat ccgctccgat 3660
atcggcgatc acgtggatac atttccccag aatgcgtcaa ccttgcatgc tcgatattga 3720
ctcaagccga gaggtgtata acaacaccga cgatagcgaa ttacttgtgg aactgatttg 3780
ccgtatcgag taaatcgcga ttgtggccct ctttaggcct tgtacccatt tgtgcatcgt 3840
atttgttagt atgcatcata gaattatgtg aacttagaaa agtccgtatg aaatgagcct 3900
cagattatgg attgatcgct tgttatttgt acagcggaat tgacttatag tatgtcggcc 3960
acggttttag attgcctagg ggccgttttc ttgatggatt cgcatcggaa ctccgaattc 4020
ttgattgctc tccatcgcgc aggaggccgt tctttttttg acaaagtccc attttagggc 4080
gcaggtccaa aaaataagcg gccgcttaat taactggcct catgggcctt ccgctcactg 4140
cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aacatggtca tagctgtttc 4200
cttgcgtatt gggcgctctc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 4260
gtaaagcctg gggtgcctaa tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 4320
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 4380
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 4440
gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 4500
ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 4560
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 4620
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 4680
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 4740
tcttgaagtg gtggcctaac tacggctaca ctagaagaac agtatttggt atctgcgctc 4800
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 4860
ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 4920
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 4980
gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 5040
aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 5100
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 5160
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 5220
ctgcaatgat accgcgagaa ccacgctcac cggctccaga tttatcagca ataaaccagc 5280
cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 5340
ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 5400
ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 5460
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 5520
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 5580
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 5640
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 5700
gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 5760
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 5820
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 5880
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 5940
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 6000
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 6060
gcacatttcc ccgaaaagtg ccac 6084
<210> 65
<211> 7693
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 65
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
caacttaaga aaaccgcaca accacaccgg gaggagcgtg ttgagctgta agcgttgttg 1140
agaaacgagg ggactctggg aagtcgggac ccatctcaat cttggaatac tcctgtaaga 1200
gtctcaccag agttagcgaa agctctgtca gggcgaattg ttggccgaga caaattcggg 1260
gaccgccatt gaagggcaag aatgcccaca cattatctag cttcaagttc tcccatcgat 1320
tgggattgaa ttcgtgggcg tcaggacccc aatacttgat gtccctgtgg accatgtaaa 1380
ttgaatagta aactgcggtg cccttaggaa cgaagatcgg atccttctgc tcgggaccac 1440
cacctatggg tagagttgta tctctcacag cagtacggaa gttcaatggc aataccggcg 1500
caagacgcaa gacttcattt ataacttgct tcaaataagg tgcttgcttc agaagttcga 1560
atgataaagg cctttgctcc tccttggttc caaaatgatc gaggacctcc tcacgtagtt 1620
tgttgaatac gtcaggattt ctggcaagga aatgaatagc gaagctcaac gtagcagctg 1680
ttgtatctct accagcaatg agaatgttga aaatttgatc acgtatcgtc actgggtctc 1740
gggtaacttt agccatctca agcgagaaca catagatgcc actagactct gcagcagcat 1800
ccttctctgc aatagagttc tcagcagcga aagatgtggc gtaaagagcc ttatcaacgt 1860
agtagtcaat ataggactga gcacgtttct tgtgatctcg gaattcctta gagttgaaca 1920
accagtagac tttgcttgat agggtccgtt tgaaagcgta attcagtaga aagttgtagg 1980
actccacgaa ttgttcggca gtaatctccg aaccatcacg ggctacaata catgactgat 2040
tctcagggtt caagctctcg caggactccc caaataggaa ttcagtcgct gtatccagcg 2100
taagtttgtg gaaataatgt tgaacatcaa taaattggtc cactttcatt gcacggttca 2160
tctcctttat taactccgca gcatgactgg aaatctgatc aattctgcaa acctgatctt 2220
tagtgaactg aggtctcaac atcgatcgag actgtttcca tccatttccg ctgagtgtaa 2280
atatcccttg gccaaacact tttcccactg tgtggaaacg tgctccaaga ccaaaatcat 2340
tgaatttggt tgccaggatt gtcttaatgt tttctggctc gattgtgaag atttggtatt 2400
gaaggggagc ttgtcgaaga tacgtccgtg ctttgaactt attgaagact ctgtcgtatt 2460
gaacttccag taaggtgtat gacttggccg tcttgatcat gtccatggtt ctttgtattc 2520
ccagtgggaa cgatttctca atgaagcgag gcatactaca cttgtgccta cgtgctgcat 2580
agcggtacca taggagccag ataggctcgt gtagaactaa gaaagctacg aagagcagtg 2640
gcaacaagcc agcaacagcg gataaactca ttggagttag aataatgtct ttgattaaca 2700
tatgtgtaga gttgtttttg ttgttaagtc tttctttaag agcttgaccg actataaccg 2760
ttcaacggcg cattatatac tttgggtatc ggccagtgct gacaactcac acgttgcgac 2820
cccttaccca gaagcatacc cagcgcgatg tcgatcgtgt tatatcgtag acgcacaccc 2880
tgcaatgacg ggtaggctct aaatcgggat gcgaaaaaga ggttgccttg ctttttgccc 2940
tggtagatgg catgctgagc gtgcgcttgc cgcctaattt ttgtgtgtcg cctgctattt 3000
attgctgaag ctagcccgcc gcatctttcc ccaaggcttc gattgctcgt attggggcag 3060
ggattggtac tcaaccttgc agatgagact ccagcaacaa cgtcgtactg cttagcgatc 3120
gcacatgttt catcatcgtc actatacaca tcgtcatcaa ctccatggcg tgaggacttc 3180
cgagactgct gggcccttcg tttctttaat gcctcaagag atgacttcgt acccgaagag 3240
acgcctgttg taccccgttg acgcttggcg gagggggctt cgtcctcgtc agcaacccgc 3300
gtcatctgct tccttcgctg agcaagatac cttctctcct cgtaccgctg catctcctga 3360
gctcggtcat acaagatctc ttctcgctca atctctggca gcgcgtccaa cttcgccctg 3420
tcttcagcat cgagatattt gccttctaga ggatagggat tgacgacctc attgcttggc 3480
ggcgacggca gcgagatttc ctcttcggag tcggagccaa cgtcggccaa tgccagcaga 3540
tcatcatcac tgtcactcat agtaggaagg ttgaagtgtg ctgacgaatc agaatcgcga 3600
aggatgccat tgaaggcata tatattttaa tctgtacctt ttatggtaat ttaatcagat 3660
tttataggta ttcatgtgca agttgcattg aaggaactgt ttgagaaaat catcttgact 3720
gaacttttct cagatatgca ttccagcccg ccttttggta acgctgagct tcgtgcacag 3780
gatctcgtcc cttgctatag agcccgcgtc cgacgataat aacgtctgtg ccggtctcta 3840
tgacgtcgtc cacagtacga tactgctgcc ccaatccatc acctttgtcg tccaggccca 3900
ccccaggagt cataatgacc cagtcttcct ctggctttcc gactttttgc tgagcgatga 3960
aaccaaacac aaatgcgcgg ttactgcgag cgatgtctac tgtcgcttgc gagtattcgc 4020
cgtgagccag tgtgcccttc gaactcagtt ctgcaagcat gacaaggccg cgaggttcat 4080
ccgtagtttc cttcgcagcc tcttctagtc cgctcacaat tcccggccca ggaacaccgt 4140
gagcatttgt tatatcagcc cattgagcga tcttaaacac tccacctgca tattgggcct 4200
taacagtgga accgatgtct gcgaactttc ggtcttcaaa aatgagaaaa ttgtgcttcg 4260
ttgaaagctg tttcaaaccg ctgacagttg tgtcgtattc gaagtcgtca attatgtcaa 4320
tgtgggtctt aaccatacaa atgtaaggtc caatgcggtc caggatactc agtaactcag 4380
aggtagttcg cacatccaag cttgcgcaaa gatttgtttg cttgctcaca atgatgtcga 4440
atagccgggc tgctacagcc ggcagcctct ctcggcgctc ctcatagctc agcttcatat 4500
tatttctcta cagtagtgcc cgtgccctcg atcagctagg acttttcaaa ttaatcgggc 4560
tgtttgatgt aagtaagatg aagtcacgcg cgtgcaggag actgcgtccc gcgatattct 4620
gcaggcttga aaaatttacc ctaacggtag gcatcaagtg agtgagtctc agcgtcgata 4680
tgggtcaaaa aaggggaaaa ctagccgaga tcgttgcgag ctgtttcgaa aattatgccc 4740
tatggcaatt atcacgtgga gtatccgaat ttctccaggc tgtcaagcgg caattataac 4800
cgagactgag atcgagaagt atataaccgc agcagtagtg gataaataat tgcgaagtct 4860
tcccagcaga gcgggctgtt ttttggagtt ggttactgta aaatgctaaa atgactgaca 4920
acaatggagc gtctacagca ttggcaacag tgggaacagt atgctggtgc atccagttga 4980
taccccaggt tctgcgaaac tggtatgttc gggattgcga gggcgttcct cctctgatgt 5040
tctttttgtt cgccgtttcg gggattccct tcgcagtgta cttcattgat cagaattcga 5100
acactgccat catggttcaa cctcacttgt ttactttctt tagccttata ggcttttggc 5160
aaagcctgta ctatccgccc gtcagaccag cacgggccgt cacatgtatg gttgcgtcgc 5220
tgtataagaa atcttacaac tgaagactac acagcgtatc cgctccgata tcggcgatca 5280
cgtggataca tttccccaga atgcgtcaac cttgcatgct cgatattgac tcaagccgag 5340
aggtgtataa caacaccgac gatagcgaat tacttgtgga actgatttgc cgtatcgagt 5400
aaatcgcgat tgtggccctc tttaggcctt gtacccattt gtgcatcgta tttgttagta 5460
tgcatcatag aattatgtga acttagaaaa gtccgtatga aatgagcctc agattatgga 5520
ttgatcgctt gttatttgta cagcggaatt gacttatagt atgtcggcca cggttttaga 5580
ttgcctaggg gccgttttct tgatggattc gcatcggaac tccgaattct tgattgctct 5640
ccatcgcgca ggaggccgtt ctttttttga caaagtccca ttttagggcg caggtccaaa 5700
aaataagcgg ccgcttaatt aactggcctc atgggccttc cgctcactgc ccgctttcca 5760
gtcgggaaac ctgtcgtgcc agctgcatta acatggtcat agctgtttcc ttgcgtattg 5820
ggcgctctcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggg taaagcctgg 5880
ggtgcctaat gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 5940
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 6000
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 6060
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 6120
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 6180
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 6240
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 6300
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 6360
tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca 6420
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 6480
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 6540
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 6600
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 6660
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 6720
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 6780
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 6840
ccgcgagaac cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 6900
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 6960
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 7020
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 7080
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 7140
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 7200
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 7260
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 7320
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 7380
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 7440
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 7500
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 7560
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 7620
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 7680
cgaaaagtgc cac 7693
<210> 66
<211> 7465
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 66
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
cctaagaact caccgctaag gccggacctt tgacaggtat atcttcagtt tcctcgtcac 1140
tcttggtcaa aagaccaaag tcatggctgg cgatttcctc gatgctttcc tcaagaattt 1200
tcaaggagtt gtggctttcc aactccattt gaaccttctt cgaggcttcg tggaatttcg 1260
gatttccaat tatcgaatca acagcttctt tgatttgctc cactgtaggc aagccagttt 1320
tcaaatcaat tgccacgcca gcggcctcag ctctcgatgc caccattggc ttgtcttcag 1380
agtcaccagc aataacaact ggaacagagt ggcttaagct gtgctgaagt ccgccatatc 1440
caccattgta gacaagagca tcaacgtgag gaagtagagc atcgtagttg aagtagtcga 1500
tcacgcgagc attctcagga accacaacat catccggtag cttggcaccg cggcggccca 1560
atatggctac tgttaaagtg tcaggctcgt ccttcaaggc ctcaagagta ggcacaataa 1620
gatgcttgta actgacagca aaagttcctt gagtgaccat gatgactcgc ttggcactca 1680
gaacatcccc ccaccaggaa ggaggggtga attgagttcg gtgcttgggc gttgagccgg 1740
cgaatttgaa gttgctaggc agatggtctc tgctgaactc aagagaaggc gggcacagct 1800
gcaggaactt gtctgcagca atgtaactgt gctcccagat aaatttggga tcttcagtgc 1860
aacctaactc tcggcagatt tccttgtgct tagcagtggc tttaacgaaa atttggtgct 1920
caagagcgtg gttcatagcg agtttctttg catgtgcttc ggggctcctg tcgttgtcaa 1980
gtcctaaggt atgatcactg cggatcaaaa gaggcaaaac ccctaaacaa atccagccag 2040
cgggtttgaa accaggagca ccgaggctga tagggtgtgc accgaaaaac agcacttcac 2100
tgacaagaac gacagggcga ccgcttgcgc tgagcttttt gaaagccctc tgaatagcgg 2160
caaactgctc aggaagagta gctaccatca tgtgctccac atcttgaact gtacgatcga 2220
agcttggggc catgtcttta cggcccggga ccagatcgtc taaggtgtgg tcatcaaaat 2280
ctgcgttccc ttctaaagga acaaagtctg cacccacatc tcgaactttt tgttcaaacg 2340
ctctgcctgt cacaacagta gcttcgtatc cgtcgtccgt aaggccgtgt accagactca 2400
aaacgggcat tatatggcct gaaagaggca agccgcaagc gagaatcagg ggtttgtgtg 2460
atgaagggct catatgtgta gagttgtttt tgttgttaag tctttcttta agagcttgac 2520
cgactataac cgttcaacgg cgcattatat actttgggta tcggccagtg ctgacaactc 2580
acacgttgcg accccttacc cagaagcata cccagcgcga tgtcgatcgt gttatatcgt 2640
agacgcacac cctgcaatga cgggtaggct ctaaatcggg atgcgaaaaa gaggttgcct 2700
tgctttttgc cctggtagat ggcatgctga gcgtgcgctt gccgcctaat ttttgtgtgt 2760
cgcctgctat ttattgctga agctagcccg ccgcatcttt ccccaaggct tcgattgctc 2820
gtattggggc agggattggt actcaacctt gcagatgaga ctccagcaac aacgtcgtac 2880
tgcttagcga tcgcacatgt ttcatcatcg tcactataca catcgtcatc aactccatgg 2940
cgtgaggact tccgagactg ctgggccctt cgtttcttta atgcctcaag agatgacttc 3000
gtacccgaag agacgcctgt tgtaccccgt tgacgcttgg cggagggggc ttcgtcctcg 3060
tcagcaaccc gcgtcatctg cttccttcgc tgagcaagat accttctctc ctcgtaccgc 3120
tgcatctcct gagctcggtc atacaagatc tcttctcgct caatctctgg cagcgcgtcc 3180
aacttcgccc tgtcttcagc atcgagatat ttgccttcta gaggataggg attgacgacc 3240
tcattgcttg gcggcgacgg cagcgagatt tcctcttcgg agtcggagcc aacgtcggcc 3300
aatgccagca gatcatcatc actgtcactc atagtaggaa ggttgaagtg tgctgacgaa 3360
tcagaatcgc gaaggatgcc attgaaggca tatatatttt aatctgtacc ttttatggta 3420
atttaatcag attttatagg tattcatgtg caagttgcat tgaaggaact gtttgagaaa 3480
atcatcttga ctgaactttt ctcagatatg cattccagcc cgccttttgg taacgctgag 3540
cttcgtgcac aggatctcgt cccttgctat agagcccgcg tccgacgata ataacgtctg 3600
tgccggtctc tatgacgtcg tccacagtac gatactgctg ccccaatcca tcacctttgt 3660
cgtccaggcc caccccagga gtcataatga cccagtcttc ctctggcttt ccgacttttt 3720
gctgagcgat gaaaccaaac acaaatgcgc ggttactgcg agcgatgtct actgtcgctt 3780
gcgagtattc gccgtgagcc agtgtgccct tcgaactcag ttctgcaagc atgacaaggc 3840
cgcgaggttc atccgtagtt tccttcgcag cctcttctag tccgctcaca attcccggcc 3900
caggaacacc gtgagcattt gttatatcag cccattgagc gatcttaaac actccacctg 3960
catattgggc cttaacagtg gaaccgatgt ctgcgaactt tcggtcttca aaaatgagaa 4020
aattgtgctt cgttgaaagc tgtttcaaac cgctgacagt tgtgtcgtat tcgaagtcgt 4080
caattatgtc aatgtgggtc ttaaccatac aaatgtaagg tccaatgcgg tccaggatac 4140
tcagtaactc agaggtagtt cgcacatcca agcttgcgca aagatttgtt tgcttgctca 4200
caatgatgtc gaatagccgg gctgctacag ccggcagcct ctctcggcgc tcctcatagc 4260
tcagcttcat attatttctc tacagtagtg cccgtgccct cgatcagcta ggacttttca 4320
aattaatcgg gctgtttgat gtaagtaaga tgaagtcacg cgcgtgcagg agactgcgtc 4380
ccgcgatatt ctgcaggctt gaaaaattta ccctaacggt aggcatcaag tgagtgagtc 4440
tcagcgtcga tatgggtcaa aaaaggggaa aactagccga gatcgttgcg agctgtttcg 4500
aaaattatgc cctatggcaa ttatcacgtg gagtatccga atttctccag gctgtcaagc 4560
ggcaattata accgagactg agatcgagaa gtatataacc gcagcagtag tggataaata 4620
attgcgaagt cttcccagca gagcgggctg ttttttggag ttggttactg taaaatgcta 4680
aaatgactga caacaatgga gcgtctacag cattggcaac agtgggaaca gtatgctggt 4740
gcatccagtt gataccccag gttctgcgaa actggtatgt tcgggattgc gagggcgttc 4800
ctcctctgat gttctttttg ttcgccgttt cggggattcc cttcgcagtg tacttcattg 4860
atcagaattc gaacactgcc atcatggttc aacctcactt gtttactttc tttagcctta 4920
taggcttttg gcaaagcctg tactatccgc ccgtcagacc agcacgggcc gtcacatgta 4980
tggttgcgtc gctgtataag aaatcttaca actgaagact acacagcgta tccgctccga 5040
tatcggcgat cacgtggata catttcccca gaatgcgtca accttgcatg ctcgatattg 5100
actcaagccg agaggtgtat aacaacaccg acgatagcga attacttgtg gaactgattt 5160
gccgtatcga gtaaatcgcg attgtggccc tctttaggcc ttgtacccat ttgtgcatcg 5220
tatttgttag tatgcatcat agaattatgt gaacttagaa aagtccgtat gaaatgagcc 5280
tcagattatg gattgatcgc ttgttatttg tacagcggaa ttgacttata gtatgtcggc 5340
cacggtttta gattgcctag gggccgtttt cttgatggat tcgcatcgga actccgaatt 5400
cttgattgct ctccatcgcg caggaggccg ttcttttttt gacaaagtcc cattttaggg 5460
cgcaggtcca aaaaataagc ggccgcttaa ttaactggcc tcatgggcct tccgctcact 5520
gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taacatggtc atagctgttt 5580
ccttgcgtat tgggcgctct ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 5640
ggtaaagcct ggggtgccta atgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 5700
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 5760
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 5820
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 5880
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 5940
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 6000
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 6060
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 6120
ttcttgaagt ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct 6180
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 6240
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 6300
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 6360
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 6420
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 6480
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 6540
gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 6600
gctgcaatga taccgcgaga accacgctca ccggctccag atttatcagc aataaaccag 6660
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 6720
attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 6780
gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 6840
tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 6900
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 6960
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 7020
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 7080
tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 7140
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 7200
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 7260
tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 7320
aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 7380
tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 7440
cgcacatttc cccgaaaagt gccac 7465
<210> 67
<211> 6856
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 67
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
ctacctagac cttctggtta gcggtattga cgttcatttc aactggaaga aggaattcca 1140
gttcctctcc ttcagcctcg tcgggatcct cctctggaat atgcttgagg attcgcgcag 1200
ggactcctcc caccacagta cgaggaggaa catcttctcg aacgacagca ccagccgcaa 1260
ttgttgagcc atctccaatc gtaacacccg gcaggacagt cacattcgca ccaatccata 1320
cattattccc caccttgata ggaagagcat acacaattct cctcgcacgt ttctcggggc 1380
taataggatg agtcgcagtc acgaacgttg tattgggccc tacaatcacc tcatcaccaa 1440
agattattgg agccgagtcc aagaagcaaa cgttgaagtt ggcgtaaaag tgctcgccta 1500
cgctgatgtt gaatccaaaa tcaactgaga atggagcggt cagccagaca atatcctttg 1560
tttgaccaaa agtgtctttg agaatctcga ccttcttgat ataagcagcg tgatttgact 1620
caaaagtacg actttcactt gcaatggtat tgaactccct aactttctca ctagtagcca 1680
gggctctaaa cataagatct ggatcgtatg gattgtaagg aactcctgag accatcttct 1740
catagttttc attgccaggg gtgtttttga ggtttttttt ggcccaagag accatttcct 1800
ggtcaatttc ttttctagga gtcattcctt tgttttgagg gtccttcgag gagtttacaa 1860
ccatatgtgt agagttgttt ttgttgttaa gtctttcttt aagagcttga ccgactataa 1920
ccgttcaacg gcgcattata tactttgggt atcggccagt gctgacaact cacacgttgc 1980
gaccccttac ccagaagcat acccagcgcg atgtcgatcg tgttatatcg tagacgcaca 2040
ccctgcaatg acgggtaggc tctaaatcgg gatgcgaaaa agaggttgcc ttgctttttg 2100
ccctggtaga tggcatgctg agcgtgcgct tgccgcctaa tttttgtgtg tcgcctgcta 2160
tttattgctg aagctagccc gccgcatctt tccccaaggc ttcgattgct cgtattgggg 2220
cagggattgg tactcaacct tgcagatgag actccagcaa caacgtcgta ctgcttagcg 2280
atcgcacatg tttcatcatc gtcactatac acatcgtcat caactccatg gcgtgaggac 2340
ttccgagact gctgggccct tcgtttcttt aatgcctcaa gagatgactt cgtacccgaa 2400
gagacgcctg ttgtaccccg ttgacgcttg gcggaggggg cttcgtcctc gtcagcaacc 2460
cgcgtcatct gcttccttcg ctgagcaaga taccttctct cctcgtaccg ctgcatctcc 2520
tgagctcggt catacaagat ctcttctcgc tcaatctctg gcagcgcgtc caacttcgcc 2580
ctgtcttcag catcgagata tttgccttct agaggatagg gattgacgac ctcattgctt 2640
ggcggcgacg gcagcgagat ttcctcttcg gagtcggagc caacgtcggc caatgccagc 2700
agatcatcat cactgtcact catagtagga aggttgaagt gtgctgacga atcagaatcg 2760
cgaaggatgc cattgaaggc atatatattt taatctgtac cttttatggt aatttaatca 2820
gattttatag gtattcatgt gcaagttgca ttgaaggaac tgtttgagaa aatcatcttg 2880
actgaacttt tctcagatat gcattccagc ccgccttttg gtaacgctga gcttcgtgca 2940
caggatctcg tcccttgcta tagagcccgc gtccgacgat aataacgtct gtgccggtct 3000
ctatgacgtc gtccacagta cgatactgct gccccaatcc atcacctttg tcgtccaggc 3060
ccaccccagg agtcataatg acccagtctt cctctggctt tccgactttt tgctgagcga 3120
tgaaaccaaa cacaaatgcg cggttactgc gagcgatgtc tactgtcgct tgcgagtatt 3180
cgccgtgagc cagtgtgccc ttcgaactca gttctgcaag catgacaagg ccgcgaggtt 3240
catccgtagt ttccttcgca gcctcttcta gtccgctcac aattcccggc ccaggaacac 3300
cgtgagcatt tgttatatca gcccattgag cgatcttaaa cactccacct gcatattggg 3360
ccttaacagt ggaaccgatg tctgcgaact ttcggtcttc aaaaatgaga aaattgtgct 3420
tcgttgaaag ctgtttcaaa ccgctgacag ttgtgtcgta ttcgaagtcg tcaattatgt 3480
caatgtgggt cttaaccata caaatgtaag gtccaatgcg gtccaggata ctcagtaact 3540
cagaggtagt tcgcacatcc aagcttgcgc aaagatttgt ttgcttgctc acaatgatgt 3600
cgaatagccg ggctgctaca gccggcagcc tctctcggcg ctcctcatag ctcagcttca 3660
tattatttct ctacagtagt gcccgtgccc tcgatcagct aggacttttc aaattaatcg 3720
ggctgtttga tgtaagtaag atgaagtcac gcgcgtgcag gagactgcgt cccgcgatat 3780
tctgcaggct tgaaaaattt accctaacgg taggcatcaa gtgagtgagt ctcagcgtcg 3840
atatgggtca aaaaagggga aaactagccg agatcgttgc gagctgtttc gaaaattatg 3900
ccctatggca attatcacgt ggagtatccg aatttctcca ggctgtcaag cggcaattat 3960
aaccgagact gagatcgaga agtatataac cgcagcagta gtggataaat aattgcgaag 4020
tcttcccagc agagcgggct gttttttgga gttggttact gtaaaatgct aaaatgactg 4080
acaacaatgg agcgtctaca gcattggcaa cagtgggaac agtatgctgg tgcatccagt 4140
tgatacccca ggttctgcga aactggtatg ttcgggattg cgagggcgtt cctcctctga 4200
tgttcttttt gttcgccgtt tcggggattc ccttcgcagt gtacttcatt gatcagaatt 4260
cgaacactgc catcatggtt caacctcact tgtttacttt ctttagcctt ataggctttt 4320
ggcaaagcct gtactatccg cccgtcagac cagcacgggc cgtcacatgt atggttgcgt 4380
cgctgtataa gaaatcttac aactgaagac tacacagcgt atccgctccg atatcggcga 4440
tcacgtggat acatttcccc agaatgcgtc aaccttgcat gctcgatatt gactcaagcc 4500
gagaggtgta taacaacacc gacgatagcg aattacttgt ggaactgatt tgccgtatcg 4560
agtaaatcgc gattgtggcc ctctttaggc cttgtaccca tttgtgcatc gtatttgtta 4620
gtatgcatca tagaattatg tgaacttaga aaagtccgta tgaaatgagc ctcagattat 4680
ggattgatcg cttgttattt gtacagcgga attgacttat agtatgtcgg ccacggtttt 4740
agattgccta ggggccgttt tcttgatgga ttcgcatcgg aactccgaat tcttgattgc 4800
tctccatcgc gcaggaggcc gttctttttt tgacaaagtc ccattttagg gcgcaggtcc 4860
aaaaaataag cggccgctta attaactggc ctcatgggcc ttccgctcac tgcccgcttt 4920
ccagtcggga aacctgtcgt gccagctgca ttaacatggt catagctgtt tccttgcgta 4980
ttgggcgctc tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc gggtaaagcc 5040
tggggtgcct aatgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 5100
ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 5160
cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc 5220
ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct 5280
tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc 5340
gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 5400
tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 5460
gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 5520
tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc tctgctgaag 5580
ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt 5640
agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa 5700
gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 5760
attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 5820
agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta 5880
atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc 5940
cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg 6000
ataccgcgag aaccacgctc accggctcca gatttatcag caataaacca gccagccgga 6060
agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt 6120
tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt 6180
gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc 6240
caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc 6300
ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca 6360
gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag 6420
tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg 6480
tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa 6540
cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa 6600
cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga 6660
gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga 6720
atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg 6780
agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt 6840
ccccgaaaag tgccac 6856
<210> 68
<211> 9973
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 68
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
ctcaaatctc tccgagacct tgcaagttca ccaattcagc gtaccatcca ttgagttcaa 1140
ggaggctctg atggtcgccc tgctccacga tgcgccctcc tgagaacaca tatatgacat 1200
ctgctttctg aattgttgat aatctatgcg caacggcgat tgtagtacgg cccttcgctg 1260
ctgcgtcgag tgctgcttga actactttct cagattcgga atccagagct gaggtggcct 1320
catcgaggag gagtaccttt ggatttctga tcagggccct tgcaattgca attcgctgct 1380
tttgcccccc agatagcaac gatcccctag atccgctgag cgtttcgtag ccatcaggca 1440
acgacatgat gaattcgtga atgttcgctt tgcgagcggc atcctcaatc atctcctgcg 1500
ttacttcaga ctcagggcca gaccatccca ttagaatatt ctcacgtagc gtgcctgaat 1560
aaagcattgg ttcttgctgg actaaagcaa tgtgtgatct caatgcattc aggttatatt 1620
cgcgtaaatc tttcccatcg aaaagtactt gacctgctaa tggatcataa aatctttcca 1680
ccagtccaat agtagtagac ttaccgcatc cactggctcc aactagagcg atgtattggc 1740
cctttttgac tgttaagttg agatcttgta aaactggtac ttgaggtcga gtaggatatc 1800
ggaaattcac atgacggaac tcaatatctc ctctcaccga ctcctcggga gcaacgtaac 1860
cttcctcact ccatacatct atagaaggag tggcagtcaa gattctgtaa atgttacgcg 1920
ctgcatcttt ggctgagttc atgtttggag catagctgaa aatttggcca gcggcttgag 1980
aacctgtaat aatagccatg aagacagtca tatatcctgc gaccgaagct tcacctcgtc 2040
tcattacagt gcttccccac caaaaaacga gggctaccac ccagggtgtc attccttccg 2100
agagtgcgta gtacaatgct gagcgggcaa tggcaattct ggagctgaaa atctgagagt 2160
ctactgtctt tgtgtatttt acgaccacgt ctaactcacg agttaaggac tggactgtgc 2220
ggacagcact tgtatactca gatgccatgg agccacttcg ttcgtaaact tctctcgcac 2280
gatccgataa ttgggtaaga acccagactc tgacgaagcc acacaccaac atgacaggaa 2340
caacagacgt agccacgagt ccaattctcc aattgaaagg tataccagta actatgccgc 2400
caatcaaggt caccagactc tgttgaattt gaccgagggt ggccccactc aaaccctcga 2460
tcattttagc ttccttcgcc aaaattgagg ttagcgcacc cggcgtgttg tttttgtggt 2520
cgaagaatgc aatatccatt cgcatcaatt ggcggaacaa agctaatctg atatttttga 2580
ccaacttatc agatgcaagt gataaagcag ctatagtgat aaaagccgtc atgaatgaaa 2640
tgcagcctac gaaaaaatac caccatccca tgatattcac cacatgccgc atttttccgt 2700
attcactggg aggtagaacc atgcttccag tggtttggcc agttattatt gccattgcag 2760
gatagcaata gcccaaaata atggaggcta aactaccaat gagaatgtaa ccccattctt 2820
tcctattcag cccccaaacc agtttggtat tggtcatcaa cgtgctatgt ggggggttgc 2880
gcacaccagg gatgtcattt tcttgatatt caggaggttg agtggtctga gtacctgcac 2940
tgtgaacact caatgtgctc acatccttgg gattgaactt ttcgttcagt gagtccagag 3000
gcgaaatgtc tagagcttca atatcgagga cctcaacgtt agtgctcttt gctttagtta 3060
ctctttgagc atcaaccaaa gctttataag gcccttctcg ctgtatgagc tcattgtgag 3120
taccctgctc tatgacgtta cctttagaca tgacaactat cttgttggca tccttgatcg 3180
tagagagtct gtgtgcaacg actatagtgg tacgaccttc ggccgctttg tcgagcgcat 3240
cttgaacgat accttcagat ttggtatcca gagcagaagt cgcttcatcg agcagcagaa 3300
ttttagggtc tgagacgatt gctcttgcta ttgcaatgcg ttgtttctga ccaccgctga 3360
gaagaaatcc tcgatctcca acattggttt ggatgccttc tgagagagtc tgaatgaaat 3420
cccaggcatt ggcatcttta caagcttgaa tgattttagc ttccttaaca tgctcgtcag 3480
cgaactcaat gtcagtgcca atcaaaccat agctgatatt ctcatatatt gactctgaaa 3540
agagtactgg ttcctgctga acataaccaa tttgttgacg gagccatctt gtgttcaggt 3600
cgctaatctc ctggccatcc agagtaacgc ttccttcgag aggtaaatag aacctctcaa 3660
gaatacctac aattgtagac ttccctgatc ccgaggcacc taccagtgcc acagtagatc 3720
cagcaggaac ttcaaggcta aaatcggaga ggaccaaaac gtctgggcga ctaggatatc 3780
ggaacttgac atttttgagc tcaattctgc caacggcctt agtttggggg acaattcctt 3840
tatctatgga ctggccatcg atgactggga cacgatcaat ggcctcattg agaatgctcg 3900
cggcagtgag acccttgaca agaaacctca cgtttggcgc gatattccca agctggaagc 3960
ttccaagtaa catagctgtg attacaacta ttatctttcc aacgtcagca ctcccactaa 4020
cgatttctct ggaaccctgc cacagagcta aggcatacac ccaaaaagta ctagcccaaa 4080
tgcacgctaa catgaccccc aatgagtaac tgctccgctt cgattccttc acaacacgat 4140
caagtacctt ttcatacttg acggcgagat gaggttgagc gccaaatgct actgtagtcc 4200
tgacagcact gagagcctcc tccgcaacgg tagctccaga ctgcgaatat atcgcgtcag 4260
atctgagctg atatttggcc atgaaggtgg cgccagttcc cattgtgatt accatgaacc 4320
ctacagcact caggaggatg caagccagtt tccattgcga agcaaaactt ataacggtgg 4380
ccgcaatgaa ggaagctatt ccctgtacga cgtttccaag cttgtcgctg atcgcttcct 4440
gaattgagtt ggtatcgtta atgattctgg tgctgacctc gccaccacct agtttgtcgt 4500
aaaacgcgat attctggcga ataacagcac tcagataatg ctttcggtaa cgtcctgcca 4560
acacttcgcc tctgtccaca agcaggaagc tctcgagaaa cgcactgccg agcataccaa 4620
tgccaatata gacaaaatag agagacaggt gattcacctt atgctggaac tcattgccct 4680
tgaggtcata gctagtgaag tctctgaatg tgttgaagat ggcgcccact actaacgtga 4740
acattggaag cgcggctcca tgcaccgctg caaaaaaaag cgcaagtatc tccaagaaaa 4800
cgtcaagggg agtgcaaaat ctgaacaacc tgaaaaagct tgtggcgact ctctttgttt 4860
caagctgact tcgcaataca ttggcctcat gtggatctaa cgcagagagc ttctcctcga 4920
gaagcttgtc cttagtctcg atgagtttct cacgcttctc tacctgtata tcatccacca 4980
tatgtgtaga gttgtttttg ttgttaagtc tttctttaag agcttgaccg actataaccg 5040
ttcaacggcg cattatatac tttgggtatc ggccagtgct gacaactcac acgttgcgac 5100
cccttaccca gaagcatacc cagcgcgatg tcgatcgtgt tatatcgtag acgcacaccc 5160
tgcaatgacg ggtaggctct aaatcgggat gcgaaaaaga ggttgccttg ctttttgccc 5220
tggtagatgg catgctgagc gtgcgcttgc cgcctaattt ttgtgtgtcg cctgctattt 5280
attgctgaag ctagcccgcc gcatctttcc ccaaggcttc gattgctcgt attggggcag 5340
ggattggtac tcaaccttgc agatgagact ccagcaacaa cgtcgtactg cttagcgatc 5400
gcacatgttt catcatcgtc actatacaca tcgtcatcaa ctccatggcg tgaggacttc 5460
cgagactgct gggcccttcg tttctttaat gcctcaagag atgacttcgt acccgaagag 5520
acgcctgttg taccccgttg acgcttggcg gagggggctt cgtcctcgtc agcaacccgc 5580
gtcatctgct tccttcgctg agcaagatac cttctctcct cgtaccgctg catctcctga 5640
gctcggtcat acaagatctc ttctcgctca atctctggca gcgcgtccaa cttcgccctg 5700
tcttcagcat cgagatattt gccttctaga ggatagggat tgacgacctc attgcttggc 5760
ggcgacggca gcgagatttc ctcttcggag tcggagccaa cgtcggccaa tgccagcaga 5820
tcatcatcac tgtcactcat agtaggaagg ttgaagtgtg ctgacgaatc agaatcgcga 5880
aggatgccat tgaaggcata tatattttaa tctgtacctt ttatggtaat ttaatcagat 5940
tttataggta ttcatgtgca agttgcattg aaggaactgt ttgagaaaat catcttgact 6000
gaacttttct cagatatgca ttccagcccg ccttttggta acgctgagct tcgtgcacag 6060
gatctcgtcc cttgctatag agcccgcgtc cgacgataat aacgtctgtg ccggtctcta 6120
tgacgtcgtc cacagtacga tactgctgcc ccaatccatc acctttgtcg tccaggccca 6180
ccccaggagt cataatgacc cagtcttcct ctggctttcc gactttttgc tgagcgatga 6240
aaccaaacac aaatgcgcgg ttactgcgag cgatgtctac tgtcgcttgc gagtattcgc 6300
cgtgagccag tgtgcccttc gaactcagtt ctgcaagcat gacaaggccg cgaggttcat 6360
ccgtagtttc cttcgcagcc tcttctagtc cgctcacaat tcccggccca ggaacaccgt 6420
gagcatttgt tatatcagcc cattgagcga tcttaaacac tccacctgca tattgggcct 6480
taacagtgga accgatgtct gcgaactttc ggtcttcaaa aatgagaaaa ttgtgcttcg 6540
ttgaaagctg tttcaaaccg ctgacagttg tgtcgtattc gaagtcgtca attatgtcaa 6600
tgtgggtctt aaccatacaa atgtaaggtc caatgcggtc caggatactc agtaactcag 6660
aggtagttcg cacatccaag cttgcgcaaa gatttgtttg cttgctcaca atgatgtcga 6720
atagccgggc tgctacagcc ggcagcctct ctcggcgctc ctcatagctc agcttcatat 6780
tatttctcta cagtagtgcc cgtgccctcg atcagctagg acttttcaaa ttaatcgggc 6840
tgtttgatgt aagtaagatg aagtcacgcg cgtgcaggag actgcgtccc gcgatattct 6900
gcaggcttga aaaatttacc ctaacggtag gcatcaagtg agtgagtctc agcgtcgata 6960
tgggtcaaaa aaggggaaaa ctagccgaga tcgttgcgag ctgtttcgaa aattatgccc 7020
tatggcaatt atcacgtgga gtatccgaat ttctccaggc tgtcaagcgg caattataac 7080
cgagactgag atcgagaagt atataaccgc agcagtagtg gataaataat tgcgaagtct 7140
tcccagcaga gcgggctgtt ttttggagtt ggttactgta aaatgctaaa atgactgaca 7200
acaatggagc gtctacagca ttggcaacag tgggaacagt atgctggtgc atccagttga 7260
taccccaggt tctgcgaaac tggtatgttc gggattgcga gggcgttcct cctctgatgt 7320
tctttttgtt cgccgtttcg gggattccct tcgcagtgta cttcattgat cagaattcga 7380
acactgccat catggttcaa cctcacttgt ttactttctt tagccttata ggcttttggc 7440
aaagcctgta ctatccgccc gtcagaccag cacgggccgt cacatgtatg gttgcgtcgc 7500
tgtataagaa atcttacaac tgaagactac acagcgtatc cgctccgata tcggcgatca 7560
cgtggataca tttccccaga atgcgtcaac cttgcatgct cgatattgac tcaagccgag 7620
aggtgtataa caacaccgac gatagcgaat tacttgtgga actgatttgc cgtatcgagt 7680
aaatcgcgat tgtggccctc tttaggcctt gtacccattt gtgcatcgta tttgttagta 7740
tgcatcatag aattatgtga acttagaaaa gtccgtatga aatgagcctc agattatgga 7800
ttgatcgctt gttatttgta cagcggaatt gacttatagt atgtcggcca cggttttaga 7860
ttgcctaggg gccgttttct tgatggattc gcatcggaac tccgaattct tgattgctct 7920
ccatcgcgca ggaggccgtt ctttttttga caaagtccca ttttagggcg caggtccaaa 7980
aaataagcgg ccgcttaatt aactggcctc atgggccttc cgctcactgc ccgctttcca 8040
gtcgggaaac ctgtcgtgcc agctgcatta acatggtcat agctgtttcc ttgcgtattg 8100
ggcgctctcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggg taaagcctgg 8160
ggtgcctaat gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 8220
gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 8280
aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 8340
gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 8400
ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 8460
cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 8520
ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 8580
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 8640
tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca 8700
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 8760
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 8820
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 8880
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 8940
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 9000
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 9060
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 9120
ccgcgagaac cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 9180
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 9240
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 9300
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 9360
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 9420
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 9480
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 9540
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 9600
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 9660
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 9720
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 9780
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 9840
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 9900
ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 9960
cgaaaagtgc cac 9973
<210> 69
<211> 7375
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 69
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc 60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga 120
gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt 180
gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt 240
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg 300
acggccagtg agcgcgacgt aatacgactc actatagggc gaattggcgg aaggccgtca 360
aggcctaggc gcgcctgcag gatcctagaa aacagctgga tatggataaa ctcggcaagc 420
atcttttgga aaggctgcca cgctatgccg tgccgatatt cattaagttc gtcgacaccg 480
tcaccatcac cggcaacaac aaagttcaga agaaagaatt ccgaaaccag cagattcccg 540
ccccagcagg acaaacaatc tactggttag agggcacgag ctacaagccc ctcactgctg 600
atgcgtgggc tcgtgtagag aatgggcgac acaagctcta atgttgaata cctcttccgt 660
atagatgtgg cataagctat agattttgct gcaatattat taaatattaa agagtttcga 720
aggtcagctg cggatgaacc agttcagagc ggctctctct tttttgccaa tagcgtgcaa 780
ccgtgaagag caattcaacc atccaatctg gctacactaa attgtatttg gcagcgcact 840
gtacgagcgc actgtacgaa actccgtcaa tttatagcag aacgcgtgcg atcgcgggcc 900
ccagcgatat gacgagaatc aggcaataat agcttaagct gaagtgtttt tagatttagt 960
tcggagtgcg cttctcaaaa gtgctgggat caacaagttt ttaacatggg ttttgattta 1020
tattgtttta tatgagcgcc tcagatatgc gctgacagcc tattaggaga aatggccggc 1080
caggttaaga agctaattca ctaattgccg actctagaat atcaagagac ttgtattttt 1140
caagctcttt cttgactgcc atggctttct cgtgatacga gggagtagcc aacacctcct 1200
taacggccgt ggagactagc tcagaagttg cctgcaaggt ttgaagatca taaccaacac 1260
cagcccatac agctcgtgaa gcaacagctg gcttgtctac caacattcct cctccgatga 1320
tgacgggaac gccatggctc aaactgtgct gcagacctcc gtatccaccg ttgtatatga 1380
aaacagaggc atgcggtagt agctcatcgt aaggaaaata atcaacaatt cgagcgtttg 1440
caggaacttt aacgctatca ggaagtgacg cccctttgac gcccaatata ccaactacga 1500
gagtgtcttc ttcgtcagca aaggcctgca atgctggaat gagcagatct tcatagttga 1560
tggctgctgt tccttgtgta acaacaatca gacgcttcgc actcagcaca tcaggccacc 1620
aagacggcag gtgaggtgga gttgctaatc cagcagactt tacatgcggt gcactaccag 1680
cgaacgagaa gccaggagga ggcgaagtca agtgaaattc aagagatgga gggcacagtt 1740
gcaaaaatct gtcagggctg ctgtatatat tctccaggag aaattcgggc tccttcgtgg 1800
ccccgagcgt cttcatgatc tccttctcag agtcagttcc tggttgaaat acttgttgcc 1860
gcactaaagt atcaatcatt ggctcaagac taggaactcc aggcgccttc tctgctttca 1920
gcatgcacgg aatagttcct aacgtgatta cgccttgggg cttgagacct ggggcaccca 1980
gtgatatcgg atgcacccct agaaacatgg tctcgccaat caccacagct gatttatttt 2040
cagcctcaac ctgttttaga gcagtttgaa gtgcatcgta ctgctcagga atcgccttca 2100
caaaaatctc attcattgag taaccggtct gctcaaggcc tggaggaatc gtgagcaatc 2160
ctggagcgat ttcagggaga ttgtattcat ggtagtcagc tcgtccttgg agagggacga 2220
aagtgcatcc tgcctcaata actttctcct tgaatgcgtt ccctgttacg aaagtcacct 2280
catatcctct attgagtaga ccgcggacca ggctgagcac tgggcccacg tgccccgcta 2340
gtgggcaggc acaagcaact atcactggtt tctcgatggc catatgtgta gagttgtttt 2400
tgttgttaag tctttcttta agagcttgac cgactataac cgttcaacgg cgcattatat 2460
actttgggta tcggccagtg ctgacaactc acacgttgcg accccttacc cagaagcata 2520
cccagcgcga tgtcgatcgt gttatatcgt agacgcacac cctgcaatga cgggtaggct 2580
ctaaatcggg atgcgaaaaa gaggttgcct tgctttttgc cctggtagat ggcatgctga 2640
gcgtgcgctt gccgcctaat ttttgtgtgt cgcctgctat ttattgctga agctagcccg 2700
ccgcatcttt ccccaaggct tcgattgctc gtattggggc agggattggt actcaacctt 2760
gcagatgaga ctccagcaac aacgtcgtac tgcttagcga tcgcacatgt ttcatcatcg 2820
tcactataca catcgtcatc aactccatgg cgtgaggact tccgagactg ctgggccctt 2880
cgtttcttta atgcctcaag agatgacttc gtacccgaag agacgcctgt tgtaccccgt 2940
tgacgcttgg cggagggggc ttcgtcctcg tcagcaaccc gcgtcatctg cttccttcgc 3000
tgagcaagat accttctctc ctcgtaccgc tgcatctcct gagctcggtc atacaagatc 3060
tcttctcgct caatctctgg cagcgcgtcc aacttcgccc tgtcttcagc atcgagatat 3120
ttgccttcta gaggataggg attgacgacc tcattgcttg gcggcgacgg cagcgagatt 3180
tcctcttcgg agtcggagcc aacgtcggcc aatgccagca gatcatcatc actgtcactc 3240
atagtaggaa ggttgaagtg tgctgacgaa tcagaatcgc gaaggatgcc attgaaggca 3300
tatatatttt aatctgtacc ttttatggta atttaatcag attttatagg tattcatgtg 3360
caagttgcat tgaaggaact gtttgagaaa atcatcttga ctgaactttt ctcagatatg 3420
cattccagcc cgccttttgg taacgctgag cttcgtgcac aggatctcgt cccttgctat 3480
agagcccgcg tccgacgata ataacgtctg tgccggtctc tatgacgtcg tccacagtac 3540
gatactgctg ccccaatcca tcacctttgt cgtccaggcc caccccagga gtcataatga 3600
cccagtcttc ctctggcttt ccgacttttt gctgagcgat gaaaccaaac acaaatgcgc 3660
ggttactgcg agcgatgtct actgtcgctt gcgagtattc gccgtgagcc agtgtgccct 3720
tcgaactcag ttctgcaagc atgacaaggc cgcgaggttc atccgtagtt tccttcgcag 3780
cctcttctag tccgctcaca attcccggcc caggaacacc gtgagcattt gttatatcag 3840
cccattgagc gatcttaaac actccacctg catattgggc cttaacagtg gaaccgatgt 3900
ctgcgaactt tcggtcttca aaaatgagaa aattgtgctt cgttgaaagc tgtttcaaac 3960
cgctgacagt tgtgtcgtat tcgaagtcgt caattatgtc aatgtgggtc ttaaccatac 4020
aaatgtaagg tccaatgcgg tccaggatac tcagtaactc agaggtagtt cgcacatcca 4080
agcttgcgca aagatttgtt tgcttgctca caatgatgtc gaatagccgg gctgctacag 4140
ccggcagcct ctctcggcgc tcctcatagc tcagcttcat attatttctc tacagtagtg 4200
cccgtgccct cgatcagcta ggacttttca aattaatcgg gctgtttgat gtaagtaaga 4260
tgaagtcacg cgcgtgcagg agactgcgtc ccgcgatatt ctgcaggctt gaaaaattta 4320
ccctaacggt aggcatcaag tgagtgagtc tcagcgtcga tatgggtcaa aaaaggggaa 4380
aactagccga gatcgttgcg agctgtttcg aaaattatgc cctatggcaa ttatcacgtg 4440
gagtatccga atttctccag gctgtcaagc ggcaattata accgagactg agatcgagaa 4500
gtatataacc gcagcagtag tggataaata attgcgaagt cttcccagca gagcgggctg 4560
ttttttggag ttggttactg taaaatgcta aaatgactga caacaatgga gcgtctacag 4620
cattggcaac agtgggaaca gtatgctggt gcatccagtt gataccccag gttctgcgaa 4680
actggtatgt tcgggattgc gagggcgttc ctcctctgat gttctttttg ttcgccgttt 4740
cggggattcc cttcgcagtg tacttcattg atcagaattc gaacactgcc atcatggttc 4800
aacctcactt gtttactttc tttagcctta taggcttttg gcaaagcctg tactatccgc 4860
ccgtcagacc agcacgggcc gtcacatgta tggttgcgtc gctgtataag aaatcttaca 4920
actgaagact acacagcgta tccgctccga tatcggcgat cacgtggata catttcccca 4980
gaatgcgtca accttgcatg ctcgatattg actcaagccg agaggtgtat aacaacaccg 5040
acgatagcga attacttgtg gaactgattt gccgtatcga gtaaatcgcg attgtggccc 5100
tctttaggcc ttgtacccat ttgtgcatcg tatttgttag tatgcatcat agaattatgt 5160
gaacttagaa aagtccgtat gaaatgagcc tcagattatg gattgatcgc ttgttatttg 5220
tacagcggaa ttgacttata gtatgtcggc cacggtttta gattgcctag gggccgtttt 5280
cttgatggat tcgcatcgga actccgaatt cttgattgct ctccatcgcg caggaggccg 5340
ttcttttttt gacaaagtcc cattttaggg cgcaggtcca aaaaataagc ggccgcttaa 5400
ttaactggcc tcatgggcct tccgctcact gcccgctttc cagtcgggaa acctgtcgtg 5460
ccagctgcat taacatggtc atagctgttt ccttgcgtat tgggcgctct ccgcttcctc 5520
gctcactgac tcgctgcgct cggtcgttcg ggtaaagcct ggggtgccta atgagcaaaa 5580
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 5640
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 5700
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 5760
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 5820
catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 5880
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 5940
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 6000
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 6060
actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 6120
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 6180
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 6240
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 6300
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 6360
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 6420
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 6480
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga accacgctca 6540
ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 6600
cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 6660
agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 6720
cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 6780
tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 6840
agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 6900
gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 6960
gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 7020
ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 7080
tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 7140
tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 7200
gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 7260
caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 7320
atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccac 7375
<210> 70
<211> 5316
<212> DNA
<213> 人工的
<220>
<223> 敲除盒
<400> 70
ggacctgcgc cctaaaatgg gactttgtca aaaaaagaac ggcctcctgc gcgatggaga 60
gcaatcaaga attcggagtt ccgatgcgaa tccatcaaga aaacggcccc taggcaatct 120
aaaaccgtgg ccgacatact ataagtcaat tccgctgtac aaataacaag cgatcaatcc 180
ataatctgag gctcatttca tacggacttt tctaagttca cataattcta tgatgcatac 240
taacaaatac gatgcacaaa tgggtacaag gcctaaagag ggccacaatc gcgatttact 300
cgatacggca aatcagttcc acaagtaatt cgctatcgtc ggtgttgtta tacacctctc 360
ggcttgagtc aatatcgagc atgcaaggtt gacgcattct ggggaaatgt atccacgtga 420
tcgccgatat cggagcggat acgctgtgta gtcttcagtt gtaagatttc ttatacagcg 480
acgcaaccat acatgtgacg gcccgtgctg gtctgacggg cggatagtac aggctttgcc 540
aaaagcctat aaggctaaag aaagtaaaca agtgaggttg aaccatgatg gcagtgttcg 600
aattctgatc aatgaagtac actgcgaagg gaatccccga aacggcgaac aaaaagaaca 660
tcagaggagg aacgccctcg caatcccgaa cataccagtt tcgcagaacc tggggtatca 720
actggatgca ccagcatact gttcccactg ttgccaatgc tgtagacgct ccattgttgt 780
cagtcatttt agcattttac agtaaccaac tccaaaaaac agcccgctct gctgggaaga 840
cttcgcaatt atttatccac tactgctgcg gttatatact tctcgatctc agtctcggtt 900
ataattgccg cttgacagcc tggagaaatt cggatactcc acgtgataat tgccataggg 960
cataattttc gaaacagctc gcaacgatct cggctagttt tccccttttt tgacccatat 1020
cgacgctgag actcactcac ttgatgccta ccgttagggt aaatttttca agcctgcaga 1080
atatcgcggg acgcagtctc ctgcacgcgc gtgacttcat cttacttaca tcaaacagcc 1140
cgattaattt gaaaagtcct agctgatcga gggcacgggc actactgtag agaaataata 1200
tgaagctgag ctatgaggag cgccgagaga ggctgccggc tgtagcagcc cggctattcg 1260
acatcattgt gagcaagcaa acaaatcttt gcgcaagctt ggatgtgcga actacctctg 1320
agttactgag tatcctggac cgcattggac cttacatttg tatggttaag acccacattg 1380
acataattga cgacttcgaa tacgacacaa ctgtcagcgg tttgaaacag ctttcaacga 1440
agcacaattt tctcattttt gaagaccgaa agttcgcaga catcggttcc actgttaagg 1500
cccaatatgc aggtggagtg tttaagatcg ctcaatgggc tgatataaca aatgctcacg 1560
gtgttcctgg gccgggaatt gtgagcggac tagaagaggc tgcgaaggaa actacggatg 1620
aacctcgcgg ccttgtcatg cttgcagaac tgagttcgaa gggcacactg gctcacggcg 1680
aatactcgca agcgacagta gacatcgctc gcagtaaccg cgcatttgtg tttggtttca 1740
tcgctcagca aaaagtcgga aagccagagg aagactgggt cattatgact cctggggtgg 1800
gcctggacga caaaggtgat ggattggggc agcagtatcg tactgtggac gacgtcatag 1860
agaccggcac agacgttatt atcgtcggac gcgggctcta tagcaaggga cgagatcctg 1920
tgcacgaagc tcagcgttac caaaaggcgg gctggaatgc atatctgaga aaagttcagt 1980
caagatgatt ttctcaaaca gttccttcaa tgcaacttgc acatgaatac ctataaaatc 2040
tgattaaatt accataaaag gtacagatta aaatatatat gccttcaatg gcatccttcg 2100
cgattctgat tcgtcagcac acttcaacct tcctactatg agtgacagtg atgatgatct 2160
gctggcattg gccgacgttg gctccgactc cgaagaggaa atctcgctgc cgtcgccgcc 2220
aagcaatgag gtcgtcaatc cctatcctct agaaggcaaa tatctcgatg ctgaagacag 2280
ggcgaagttg gacgcgctgc cagagattga gcgagaagag atcttgtatg accgagctca 2340
ggagatgcag cggtacgagg agagaaggta tcttgctcag cgaaggaagc agatgacgcg 2400
ggttgctgac gaggacgaag ccccctccgc caagcgtcaa cggggtacaa caggcgtctc 2460
ttcgggtacg aagtcatctc ttgaggcatt aaagaaacga agggcccagc agtctcggaa 2520
gtcctcacgc catggagttg atgacgatgt gtatagtgac gatgatgaaa catgtgcgat 2580
cgctaagcag tacgacgttg ttgctggagt ctcatctgca aggttgagta ccaatccctg 2640
ccccaatacg agcaatcgaa gccttgggga aagatgcggc gggctagctt cagcaataaa 2700
tagcaggcga cacacaaaaa ttaggcggca agcgcacgct cagcatgcca tctaccaggg 2760
caaaaagcaa ggcaacctct ttttcgcatc ccgatttaga gcctacccgt cattgcaggg 2820
tgtgcgtcta cgatataaca cgatcgacat cgcgctgggt atgcttctgg gtaaggggtc 2880
gcaacgtgtg agttgtcagc actggccgat acccaaagta tataatgcgc cgttgaacgg 2940
ttatagtcgg tcaagctctt aaagaaagac ttaacaacaa aaacaactct acacatatgt 3000
taatcaaaga cattattcta actccaatga gtttatccgc tgttgctggc ttgttgccac 3060
tgctcttcgt agctttctta gttctacacg agcctatctg gctcctatgg taccgctatg 3120
cagcacgtag gcacaagtgt agtatgcctc gcttcattga gaaatcgttc ccactgggaa 3180
tacaaagaac catggacatg atcaagacgg ccaagtcata caccttactg gaagttcaat 3240
acgacagagt cttcaataag ttcaaagcac ggacgtatct tcgacaagct ccccttcaat 3300
accaaatctt cacaatcgag ccagaaaaca ttaagacaat cctggcaacc aaattcaatg 3360
attttggtct tggagcacgt ttccacacag tgggaaaagt gtttggccaa gggatattta 3420
cactcagcgg aaatggatgg aaacagtctc gatcgatgtt gagacctcag ttcactaaag 3480
atcaggtttg cagaattgat cagatttcca gtcatgctgc ggagttaata aaggagatga 3540
accgtgcaat gaaagtggac caatttattg atgttcaaca ttatttccac aaacttacgc 3600
tggatacagc gactgaattc ctatttgggg agtcctgcga gagcttgaac cctgagaatc 3660
agtcatgtat tgtagcccgt gatggttcgg agattactgc cgaacaattc gtggagtcct 3720
acaactttct actgaattac gctttcaaac ggaccctatc aagcaaagtc tactggttgt 3780
tcaactctaa ggaattccga gatcacaaga aacgtgctca gtcctatatt gactactacg 3840
ttgataaggc tctttacgcc acatctttcg ctgctgagaa ctctattgca gagaaggatg 3900
ctgctgcaga gtctagtggc atctatgtgt tctcgcttga gatggctaaa gttacccgag 3960
acccagtgac gatacgtgat caaattttca acattctcat tgctggtaga gatacaacag 4020
ctgctacgtt gagcttcgct attcatttcc ttgccagaaa tcctgacgta ttcaacaaac 4080
tacgtgagga ggtcctcgat cattttggaa ccaaggagga gcaaaggcct ttatcattcg 4140
aacttctgaa gcaagcacct tatttgaagc aagttataaa tgaagtcttg cgtcttgcgc 4200
cggtattgcc attgaacttc cgtactgctg tgagagatac aactctaccc ataggtggtg 4260
gtcccgagca gaaggatccg atcttcgttc ctaagggcac cgcagtttac tattcaattt 4320
acatggtcca cagggacatc aagtattggg gtcctgacgc ccacgaattc aatcccaatc 4380
gatgggagaa cttgaagcta gataatgtgt gggcattctt gcccttcaat ggcggtcccc 4440
gaatttgtct cggccaacaa ttcgccctga cagagctttc gctaactctg gtgagactct 4500
tacaggagta ttccaagatt gagatgggtc ccgacttccc agagtcccct cgtttctcaa 4560
caacgcttac agctcaacac gctcctcccg gtgtggttgt gcggttttct taagttggcc 4620
ggccatttct cctaataggc tgtcagcgca tatctgaggc gctcatataa aacaatataa 4680
atcaaaaccc atgttaaaaa cttgttgatc ccagcacttt tgagaagcgc actccgaact 4740
aaatctaaaa acacttcagc ttaagctatt attgcctgat tctcgtcata tcgctggggc 4800
ccgcgatcgc acgcgttctg ctataaattg acggagtttc gtacagtgcg ctcgtacagt 4860
gcgctgccaa atacaattta gtgtagccag attggatggt tgaattgctc ttcacggttg 4920
cacgctattg gcaaaaaaga gagagccgct ctgaactggt tcatccgcag ctgaccttcg 4980
aaactcttta atatttaata atattgcagc aaaatctata gcttatgcca catctatacg 5040
gaagaggtat tcaacattag agcttgtgtc gcccattctc tacacgagcc cacgcatcag 5100
cagtgagggg cttgtagctc gtgccctcta accagtagat tgtttgtcct gctggggcgg 5160
gaatctgctg gtttcggaat tctttcttct gaactttgtt gttgccggtg atggtgacgg 5220
tgtcgacgaa cttaatgaat atcggcacgg catagcgtgg cagcctttcc aaaagatgct 5280
tgccgagttt atccatatcc agctgttttc taggat 5316
<210> 71
<211> 5088
<212> DNA
<213> 人工的
<220>
<223> 敲除盒
<400> 71
ggacctgcgc cctaaaatgg gactttgtca aaaaaagaac ggcctcctgc gcgatggaga 60
gcaatcaaga attcggagtt ccgatgcgaa tccatcaaga aaacggcccc taggcaatct 120
aaaaccgtgg ccgacatact ataagtcaat tccgctgtac aaataacaag cgatcaatcc 180
ataatctgag gctcatttca tacggacttt tctaagttca cataattcta tgatgcatac 240
taacaaatac gatgcacaaa tgggtacaag gcctaaagag ggccacaatc gcgatttact 300
cgatacggca aatcagttcc acaagtaatt cgctatcgtc ggtgttgtta tacacctctc 360
ggcttgagtc aatatcgagc atgcaaggtt gacgcattct ggggaaatgt atccacgtga 420
tcgccgatat cggagcggat acgctgtgta gtcttcagtt gtaagatttc ttatacagcg 480
acgcaaccat acatgtgacg gcccgtgctg gtctgacggg cggatagtac aggctttgcc 540
aaaagcctat aaggctaaag aaagtaaaca agtgaggttg aaccatgatg gcagtgttcg 600
aattctgatc aatgaagtac actgcgaagg gaatccccga aacggcgaac aaaaagaaca 660
tcagaggagg aacgccctcg caatcccgaa cataccagtt tcgcagaacc tggggtatca 720
actggatgca ccagcatact gttcccactg ttgccaatgc tgtagacgct ccattgttgt 780
cagtcatttt agcattttac agtaaccaac tccaaaaaac agcccgctct gctgggaaga 840
cttcgcaatt atttatccac tactgctgcg gttatatact tctcgatctc agtctcggtt 900
ataattgccg cttgacagcc tggagaaatt cggatactcc acgtgataat tgccataggg 960
cataattttc gaaacagctc gcaacgatct cggctagttt tccccttttt tgacccatat 1020
cgacgctgag actcactcac ttgatgccta ccgttagggt aaatttttca agcctgcaga 1080
atatcgcggg acgcagtctc ctgcacgcgc gtgacttcat cttacttaca tcaaacagcc 1140
cgattaattt gaaaagtcct agctgatcga gggcacgggc actactgtag agaaataata 1200
tgaagctgag ctatgaggag cgccgagaga ggctgccggc tgtagcagcc cggctattcg 1260
acatcattgt gagcaagcaa acaaatcttt gcgcaagctt ggatgtgcga actacctctg 1320
agttactgag tatcctggac cgcattggac cttacatttg tatggttaag acccacattg 1380
acataattga cgacttcgaa tacgacacaa ctgtcagcgg tttgaaacag ctttcaacga 1440
agcacaattt tctcattttt gaagaccgaa agttcgcaga catcggttcc actgttaagg 1500
cccaatatgc aggtggagtg tttaagatcg ctcaatgggc tgatataaca aatgctcacg 1560
gtgttcctgg gccgggaatt gtgagcggac tagaagaggc tgcgaaggaa actacggatg 1620
aacctcgcgg ccttgtcatg cttgcagaac tgagttcgaa gggcacactg gctcacggcg 1680
aatactcgca agcgacagta gacatcgctc gcagtaaccg cgcatttgtg tttggtttca 1740
tcgctcagca aaaagtcgga aagccagagg aagactgggt cattatgact cctggggtgg 1800
gcctggacga caaaggtgat ggattggggc agcagtatcg tactgtggac gacgtcatag 1860
agaccggcac agacgttatt atcgtcggac gcgggctcta tagcaaggga cgagatcctg 1920
tgcacgaagc tcagcgttac caaaaggcgg gctggaatgc atatctgaga aaagttcagt 1980
caagatgatt ttctcaaaca gttccttcaa tgcaacttgc acatgaatac ctataaaatc 2040
tgattaaatt accataaaag gtacagatta aaatatatat gccttcaatg gcatccttcg 2100
cgattctgat tcgtcagcac acttcaacct tcctactatg agtgacagtg atgatgatct 2160
gctggcattg gccgacgttg gctccgactc cgaagaggaa atctcgctgc cgtcgccgcc 2220
aagcaatgag gtcgtcaatc cctatcctct agaaggcaaa tatctcgatg ctgaagacag 2280
ggcgaagttg gacgcgctgc cagagattga gcgagaagag atcttgtatg accgagctca 2340
ggagatgcag cggtacgagg agagaaggta tcttgctcag cgaaggaagc agatgacgcg 2400
ggttgctgac gaggacgaag ccccctccgc caagcgtcaa cggggtacaa caggcgtctc 2460
ttcgggtacg aagtcatctc ttgaggcatt aaagaaacga agggcccagc agtctcggaa 2520
gtcctcacgc catggagttg atgacgatgt gtatagtgac gatgatgaaa catgtgcgat 2580
cgctaagcag tacgacgttg ttgctggagt ctcatctgca aggttgagta ccaatccctg 2640
ccccaatacg agcaatcgaa gccttgggga aagatgcggc gggctagctt cagcaataaa 2700
tagcaggcga cacacaaaaa ttaggcggca agcgcacgct cagcatgcca tctaccaggg 2760
caaaaagcaa ggcaacctct ttttcgcatc ccgatttaga gcctacccgt cattgcaggg 2820
tgtgcgtcta cgatataaca cgatcgacat cgcgctgggt atgcttctgg gtaaggggtc 2880
gcaacgtgtg agttgtcagc actggccgat acccaaagta tataatgcgc cgttgaacgg 2940
ttatagtcgg tcaagctctt aaagaaagac ttaacaacaa aaacaactct acacatatga 3000
gcccttcatc acacaaaccc ctgattctcg cttgcggctt gcctctttca ggccatataa 3060
tgcccgtttt gagtctggta cacggcctta cggacgacgg atacgaagct actgttgtga 3120
caggcagagc gtttgaacaa aaagttcgag atgtgggtgc agactttgtt cctttagaag 3180
ggaacgcaga ttttgatgac cacaccttag acgatctggt cccgggccgt aaagacatgg 3240
ccccaagctt cgatcgtaca gttcaagatg tggagcacat gatggtagct actcttcctg 3300
agcagtttgc cgctattcag agggctttca aaaagctcag cgcaagcggt cgccctgtcg 3360
ttcttgtcag tgaagtgctg tttttcggtg cacaccctat cagcctcggt gctcctggtt 3420
tcaaacccgc tggctggatt tgtttagggg ttttgcctct tttgatccgc agtgatcata 3480
ccttaggact tgacaacgac aggagccccg aagcacatgc aaagaaactc gctatgaacc 3540
acgctcttga gcaccaaatt ttcgttaaag ccactgctaa gcacaaggaa atctgccgag 3600
agttaggttg cactgaagat cccaaattta tctgggagca cagttacatt gctgcagaca 3660
agttcctgca gctgtgcccg ccttctcttg agttcagcag agaccatctg cctagcaact 3720
tcaaattcgc cggctcaacg cccaagcacc gaactcaatt cacccctcct tcctggtggg 3780
gggatgttct gagtgccaag cgagtcatca tggtcactca aggaactttt gctgtcagtt 3840
acaagcatct tattgtgcct actcttgagg ccttgaagga cgagcctgac actttaacag 3900
tagccatatt gggccgccgc ggtgccaagc taccggatga tgttgtggtt cctgagaatg 3960
ctcgcgtgat cgactacttc aactacgatg ctctacttcc tcacgttgat gctcttgtct 4020
acaatggtgg atatggcgga cttcagcaca gcttaagcca ctctgttcca gttgttattg 4080
ctggtgactc tgaagacaag ccaatggtgg catcgagagc tgaggccgct ggcgtggcaa 4140
ttgatttgaa aactggcttg cctacagtgg agcaaatcaa agaagctgtt gattcgataa 4200
ttggaaatcc gaaattccac gaagcctcga agaaggttca aatggagttg gaaagccaca 4260
actccttgaa aattcttgag gaaagcatcg aggaaatcgc cagccatgac tttggtcttt 4320
tgaccaagag tgacgaggaa actgaagata tacctgtcaa aggtccggcc ttagcggtga 4380
gttcttaggg ccggccattt ctcctaatag gctgtcagcg catatctgag gcgctcatat 4440
aaaacaatat aaatcaaaac ccatgttaaa aacttgttga tcccagcact tttgagaagc 4500
gcactccgaa ctaaatctaa aaacacttca gcttaagcta ttattgcctg attctcgtca 4560
tatcgctggg gcccgcgatc gcacgcgttc tgctataaat tgacggagtt tcgtacagtg 4620
cgctcgtaca gtgcgctgcc aaatacaatt tagtgtagcc agattggatg gttgaattgc 4680
tcttcacggt tgcacgctat tggcaaaaaa gagagagccg ctctgaactg gttcatccgc 4740
agctgacctt cgaaactctt taatatttaa taatattgca gcaaaatcta tagcttatgc 4800
cacatctata cggaagaggt attcaacatt agagcttgtg tcgcccattc tctacacgag 4860
cccacgcatc agcagtgagg ggcttgtagc tcgtgccctc taaccagtag attgtttgtc 4920
ctgctggggc gggaatctgc tggtttcgga attctttctt ctgaactttg ttgttgccgg 4980
tgatggtgac ggtgtcgacg aacttaatga atatcggcac ggcatagcgt ggcagccttt 5040
ccaaaagatg cttgccgagt ttatccatat ccagctgttt tctaggat 5088
<210> 72
<211> 4479
<212> DNA
<213> 人工的
<220>
<223> 敲除盒
<400> 72
ggacctgcgc cctaaaatgg gactttgtca aaaaaagaac ggcctcctgc gcgatggaga 60
gcaatcaaga attcggagtt ccgatgcgaa tccatcaaga aaacggcccc taggcaatct 120
aaaaccgtgg ccgacatact ataagtcaat tccgctgtac aaataacaag cgatcaatcc 180
ataatctgag gctcatttca tacggacttt tctaagttca cataattcta tgatgcatac 240
taacaaatac gatgcacaaa tgggtacaag gcctaaagag ggccacaatc gcgatttact 300
cgatacggca aatcagttcc acaagtaatt cgctatcgtc ggtgttgtta tacacctctc 360
ggcttgagtc aatatcgagc atgcaaggtt gacgcattct ggggaaatgt atccacgtga 420
tcgccgatat cggagcggat acgctgtgta gtcttcagtt gtaagatttc ttatacagcg 480
acgcaaccat acatgtgacg gcccgtgctg gtctgacggg cggatagtac aggctttgcc 540
aaaagcctat aaggctaaag aaagtaaaca agtgaggttg aaccatgatg gcagtgttcg 600
aattctgatc aatgaagtac actgcgaagg gaatccccga aacggcgaac aaaaagaaca 660
tcagaggagg aacgccctcg caatcccgaa cataccagtt tcgcagaacc tggggtatca 720
actggatgca ccagcatact gttcccactg ttgccaatgc tgtagacgct ccattgttgt 780
cagtcatttt agcattttac agtaaccaac tccaaaaaac agcccgctct gctgggaaga 840
cttcgcaatt atttatccac tactgctgcg gttatatact tctcgatctc agtctcggtt 900
ataattgccg cttgacagcc tggagaaatt cggatactcc acgtgataat tgccataggg 960
cataattttc gaaacagctc gcaacgatct cggctagttt tccccttttt tgacccatat 1020
cgacgctgag actcactcac ttgatgccta ccgttagggt aaatttttca agcctgcaga 1080
atatcgcggg acgcagtctc ctgcacgcgc gtgacttcat cttacttaca tcaaacagcc 1140
cgattaattt gaaaagtcct agctgatcga gggcacgggc actactgtag agaaataata 1200
tgaagctgag ctatgaggag cgccgagaga ggctgccggc tgtagcagcc cggctattcg 1260
acatcattgt gagcaagcaa acaaatcttt gcgcaagctt ggatgtgcga actacctctg 1320
agttactgag tatcctggac cgcattggac cttacatttg tatggttaag acccacattg 1380
acataattga cgacttcgaa tacgacacaa ctgtcagcgg tttgaaacag ctttcaacga 1440
agcacaattt tctcattttt gaagaccgaa agttcgcaga catcggttcc actgttaagg 1500
cccaatatgc aggtggagtg tttaagatcg ctcaatgggc tgatataaca aatgctcacg 1560
gtgttcctgg gccgggaatt gtgagcggac tagaagaggc tgcgaaggaa actacggatg 1620
aacctcgcgg ccttgtcatg cttgcagaac tgagttcgaa gggcacactg gctcacggcg 1680
aatactcgca agcgacagta gacatcgctc gcagtaaccg cgcatttgtg tttggtttca 1740
tcgctcagca aaaagtcgga aagccagagg aagactgggt cattatgact cctggggtgg 1800
gcctggacga caaaggtgat ggattggggc agcagtatcg tactgtggac gacgtcatag 1860
agaccggcac agacgttatt atcgtcggac gcgggctcta tagcaaggga cgagatcctg 1920
tgcacgaagc tcagcgttac caaaaggcgg gctggaatgc atatctgaga aaagttcagt 1980
caagatgatt ttctcaaaca gttccttcaa tgcaacttgc acatgaatac ctataaaatc 2040
tgattaaatt accataaaag gtacagatta aaatatatat gccttcaatg gcatccttcg 2100
cgattctgat tcgtcagcac acttcaacct tcctactatg agtgacagtg atgatgatct 2160
gctggcattg gccgacgttg gctccgactc cgaagaggaa atctcgctgc cgtcgccgcc 2220
aagcaatgag gtcgtcaatc cctatcctct agaaggcaaa tatctcgatg ctgaagacag 2280
ggcgaagttg gacgcgctgc cagagattga gcgagaagag atcttgtatg accgagctca 2340
ggagatgcag cggtacgagg agagaaggta tcttgctcag cgaaggaagc agatgacgcg 2400
ggttgctgac gaggacgaag ccccctccgc caagcgtcaa cggggtacaa caggcgtctc 2460
ttcgggtacg aagtcatctc ttgaggcatt aaagaaacga agggcccagc agtctcggaa 2520
gtcctcacgc catggagttg atgacgatgt gtatagtgac gatgatgaaa catgtgcgat 2580
cgctaagcag tacgacgttg ttgctggagt ctcatctgca aggttgagta ccaatccctg 2640
ccccaatacg agcaatcgaa gccttgggga aagatgcggc gggctagctt cagcaataaa 2700
tagcaggcga cacacaaaaa ttaggcggca agcgcacgct cagcatgcca tctaccaggg 2760
caaaaagcaa ggcaacctct ttttcgcatc ccgatttaga gcctacccgt cattgcaggg 2820
tgtgcgtcta cgatataaca cgatcgacat cgcgctgggt atgcttctgg gtaaggggtc 2880
gcaacgtgtg agttgtcagc actggccgat acccaaagta tataatgcgc cgttgaacgg 2940
ttatagtcgg tcaagctctt aaagaaagac ttaacaacaa aaacaactct acacatatgg 3000
ttgtaaactc ctcgaaggac cctcaaaaca aaggaatgac tcctagaaaa gaaattgacc 3060
aggaaatggt ctcttgggcc aaaaaaaacc tcaaaaacac ccctggcaat gaaaactatg 3120
agaagatggt ctcaggagtt ccttacaatc catacgatcc agatcttatg tttagagccc 3180
tggctactag tgagaaagtt agggagttca ataccattgc aagtgaaagt cgtacttttg 3240
agtcaaatca cgctgcttat atcaagaagg tcgagattct caaagacact tttggtcaaa 3300
caaaggatat tgtctggctg accgctccat tctcagttga ttttggattc aacatcagcg 3360
taggcgagca cttttacgcc aacttcaacg tttgcttctt ggactcggct ccaataatct 3420
ttggtgatga ggtgattgta gggcccaata caacgttcgt gactgcgact catcctatta 3480
gccccgagaa acgtgcgagg agaattgtgt atgctcttcc tatcaaggtg gggaataatg 3540
tatggattgg tgcgaatgtg actgtcctgc cgggtgttac gattggagat ggctcaacaa 3600
ttgcggctgg tgctgtcgtt cgagaagatg ttcctcctcg tactgtggtg ggaggagtcc 3660
ctgcgcgaat cctcaagcat attccagagg aggatcccga cgaggctgaa ggagaggaac 3720
tggaattcct tcttccagtt gaaatgaacg tcaataccgc taaccagaag gtctaggtag 3780
gccggccatt tctcctaata ggctgtcagc gcatatctga ggcgctcata taaaacaata 3840
taaatcaaaa cccatgttaa aaacttgttg atcccagcac ttttgagaag cgcactccga 3900
actaaatcta aaaacacttc agcttaagct attattgcct gattctcgtc atatcgctgg 3960
ggcccgcgat cgcacgcgtt ctgctataaa ttgacggagt ttcgtacagt gcgctcgtac 4020
agtgcgctgc caaatacaat ttagtgtagc cagattggat ggttgaattg ctcttcacgg 4080
ttgcacgcta ttggcaaaaa agagagagcc gctctgaact ggttcatccg cagctgacct 4140
tcgaaactct ttaatattta ataatattgc agcaaaatct atagcttatg ccacatctat 4200
acggaagagg tattcaacat tagagcttgt gtcgcccatt ctctacacga gcccacgcat 4260
cagcagtgag gggcttgtag ctcgtgccct ctaaccagta gattgtttgt cctgctgggg 4320
cgggaatctg ctggtttcgg aattctttct tctgaacttt gttgttgccg gtgatggtga 4380
cggtgtcgac gaacttaatg aatatcggca cggcatagcg tggcagcctt tccaaaagat 4440
gcttgccgag tttatccata tccagctgtt ttctaggat 4479
<210> 73
<211> 7596
<212> DNA
<213> 人工的
<220>
<223> 敲除盒
<400> 73
ggacctgcgc cctaaaatgg gactttgtca aaaaaagaac ggcctcctgc gcgatggaga 60
gcaatcaaga attcggagtt ccgatgcgaa tccatcaaga aaacggcccc taggcaatct 120
aaaaccgtgg ccgacatact ataagtcaat tccgctgtac aaataacaag cgatcaatcc 180
ataatctgag gctcatttca tacggacttt tctaagttca cataattcta tgatgcatac 240
taacaaatac gatgcacaaa tgggtacaag gcctaaagag ggccacaatc gcgatttact 300
cgatacggca aatcagttcc acaagtaatt cgctatcgtc ggtgttgtta tacacctctc 360
ggcttgagtc aatatcgagc atgcaaggtt gacgcattct ggggaaatgt atccacgtga 420
tcgccgatat cggagcggat acgctgtgta gtcttcagtt gtaagatttc ttatacagcg 480
acgcaaccat acatgtgacg gcccgtgctg gtctgacggg cggatagtac aggctttgcc 540
aaaagcctat aaggctaaag aaagtaaaca agtgaggttg aaccatgatg gcagtgttcg 600
aattctgatc aatgaagtac actgcgaagg gaatccccga aacggcgaac aaaaagaaca 660
tcagaggagg aacgccctcg caatcccgaa cataccagtt tcgcagaacc tggggtatca 720
actggatgca ccagcatact gttcccactg ttgccaatgc tgtagacgct ccattgttgt 780
cagtcatttt agcattttac agtaaccaac tccaaaaaac agcccgctct gctgggaaga 840
cttcgcaatt atttatccac tactgctgcg gttatatact tctcgatctc agtctcggtt 900
ataattgccg cttgacagcc tggagaaatt cggatactcc acgtgataat tgccataggg 960
cataattttc gaaacagctc gcaacgatct cggctagttt tccccttttt tgacccatat 1020
cgacgctgag actcactcac ttgatgccta ccgttagggt aaatttttca agcctgcaga 1080
atatcgcggg acgcagtctc ctgcacgcgc gtgacttcat cttacttaca tcaaacagcc 1140
cgattaattt gaaaagtcct agctgatcga gggcacgggc actactgtag agaaataata 1200
tgaagctgag ctatgaggag cgccgagaga ggctgccggc tgtagcagcc cggctattcg 1260
acatcattgt gagcaagcaa acaaatcttt gcgcaagctt ggatgtgcga actacctctg 1320
agttactgag tatcctggac cgcattggac cttacatttg tatggttaag acccacattg 1380
acataattga cgacttcgaa tacgacacaa ctgtcagcgg tttgaaacag ctttcaacga 1440
agcacaattt tctcattttt gaagaccgaa agttcgcaga catcggttcc actgttaagg 1500
cccaatatgc aggtggagtg tttaagatcg ctcaatgggc tgatataaca aatgctcacg 1560
gtgttcctgg gccgggaatt gtgagcggac tagaagaggc tgcgaaggaa actacggatg 1620
aacctcgcgg ccttgtcatg cttgcagaac tgagttcgaa gggcacactg gctcacggcg 1680
aatactcgca agcgacagta gacatcgctc gcagtaaccg cgcatttgtg tttggtttca 1740
tcgctcagca aaaagtcgga aagccagagg aagactgggt cattatgact cctggggtgg 1800
gcctggacga caaaggtgat ggattggggc agcagtatcg tactgtggac gacgtcatag 1860
agaccggcac agacgttatt atcgtcggac gcgggctcta tagcaaggga cgagatcctg 1920
tgcacgaagc tcagcgttac caaaaggcgg gctggaatgc atatctgaga aaagttcagt 1980
caagatgatt ttctcaaaca gttccttcaa tgcaacttgc acatgaatac ctataaaatc 2040
tgattaaatt accataaaag gtacagatta aaatatatat gccttcaatg gcatccttcg 2100
cgattctgat tcgtcagcac acttcaacct tcctactatg agtgacagtg atgatgatct 2160
gctggcattg gccgacgttg gctccgactc cgaagaggaa atctcgctgc cgtcgccgcc 2220
aagcaatgag gtcgtcaatc cctatcctct agaaggcaaa tatctcgatg ctgaagacag 2280
ggcgaagttg gacgcgctgc cagagattga gcgagaagag atcttgtatg accgagctca 2340
ggagatgcag cggtacgagg agagaaggta tcttgctcag cgaaggaagc agatgacgcg 2400
ggttgctgac gaggacgaag ccccctccgc caagcgtcaa cggggtacaa caggcgtctc 2460
ttcgggtacg aagtcatctc ttgaggcatt aaagaaacga agggcccagc agtctcggaa 2520
gtcctcacgc catggagttg atgacgatgt gtatagtgac gatgatgaaa catgtgcgat 2580
cgctaagcag tacgacgttg ttgctggagt ctcatctgca aggttgagta ccaatccctg 2640
ccccaatacg agcaatcgaa gccttgggga aagatgcggc gggctagctt cagcaataaa 2700
tagcaggcga cacacaaaaa ttaggcggca agcgcacgct cagcatgcca tctaccaggg 2760
caaaaagcaa ggcaacctct ttttcgcatc ccgatttaga gcctacccgt cattgcaggg 2820
tgtgcgtcta cgatataaca cgatcgacat cgcgctgggt atgcttctgg gtaaggggtc 2880
gcaacgtgtg agttgtcagc actggccgat acccaaagta tataatgcgc cgttgaacgg 2940
ttatagtcgg tcaagctctt aaagaaagac ttaacaacaa aaacaactct acacatatgg 3000
tggatgatat acaggtagag aagcgtgaga aactcatcga gactaaggac aagcttctcg 3060
aggagaagct ctctgcgtta gatccacatg aggccaatgt attgcgaagt cagcttgaaa 3120
caaagagagt cgccacaagc tttttcaggt tgttcagatt ttgcactccc cttgacgttt 3180
tcttggagat acttgcgctt ttttttgcag cggtgcatgg agccgcgctt ccaatgttca 3240
cgttagtagt gggcgccatc ttcaacacat tcagagactt cactagctat gacctcaagg 3300
gcaatgagtt ccagcataag gtgaatcacc tgtctctcta ttttgtctat attggcattg 3360
gtatgctcgg cagtgcgttt ctcgagagct tcctgcttgt ggacagaggc gaagtgttgg 3420
caggacgtta ccgaaagcat tatctgagtg ctgttattcg ccagaatatc gcgttttacg 3480
acaaactagg tggtggcgag gtcagcacca gaatcattaa cgataccaac tcaattcagg 3540
aagcgatcag cgacaagctt ggaaacgtcg tacagggaat agcttccttc attgcggcca 3600
ccgttataag ttttgcttcg caatggaaac tggcttgcat cctcctgagt gctgtagggt 3660
tcatggtaat cacaatggga actggcgcca ccttcatggc caaatatcag ctcagatctg 3720
acgcgatata ttcgcagtct ggagctaccg ttgcggagga ggctctcagt gctgtcagga 3780
ctacagtagc atttggcgct caacctcatc tcgccgtcaa gtatgaaaag gtacttgatc 3840
gtgttgtgaa ggaatcgaag cggagcagtt actcattggg ggtcatgtta gcgtgcattt 3900
gggctagtac tttttgggtg tatgccttag ctctgtggca gggttccaga gaaatcgtta 3960
gtgggagtgc tgacgttgga aagataatag ttgtaatcac agctatgtta cttggaagct 4020
tccagcttgg gaatatcgcg ccaaacgtga ggtttcttgt caagggtctc actgccgcga 4080
gcattctcaa tgaggccatt gatcgtgtcc cagtcatcga tggccagtcc atagataaag 4140
gaattgtccc ccaaactaag gccgttggca gaattgagct caaaaatgtc aagttccgat 4200
atcctagtcg cccagacgtt ttggtcctct ccgattttag ccttgaagtt cctgctggat 4260
ctactgtggc actggtaggt gcctcgggat cagggaagtc tacaattgta ggtattcttg 4320
agaggttcta tttacctctc gaaggaagcg ttactctgga tggccaggag attagcgacc 4380
tgaacacaag atggctccgt caacaaattg gttatgttca gcaggaacca gtactctttt 4440
cagagtcaat atatgagaat atcagctatg gtttgattgg cactgacatt gagttcgctg 4500
acgagcatgt taaggaagct aaaatcattc aagcttgtaa agatgccaat gcctgggatt 4560
tcattcagac tctctcagaa ggcatccaaa ccaatgttgg agatcgagga tttcttctca 4620
gcggtggtca gaaacaacgc attgcaatag caagagcaat cgtctcagac cctaaaattc 4680
tgctgctcga tgaagcgact tctgctctgg ataccaaatc tgaaggtatc gttcaagatg 4740
cgctcgacaa agcggccgaa ggtcgtacca ctatagtcgt tgcacacaga ctctctacga 4800
tcaaggatgc caacaagata gttgtcatgt ctaaaggtaa cgtcatagag cagggtactc 4860
acaatgagct catacagcga gaagggcctt ataaagcttt ggttgatgct caaagagtaa 4920
ctaaagcaaa gagcactaac gttgaggtcc tcgatattga agctctagac atttcgcctc 4980
tggactcact gaacgaaaag ttcaatccca aggatgtgag cacattgagt gttcacagtg 5040
caggtactca gaccactcaa cctcctgaat atcaagaaaa tgacatccct ggtgtgcgca 5100
accccccaca tagcacgttg atgaccaata ccaaactggt ttgggggctg aataggaaag 5160
aatggggtta cattctcatt ggtagtttag cctccattat tttgggctat tgctatcctg 5220
caatggcaat aataactggc caaaccactg gaagcatggt tctacctccc agtgaatacg 5280
gaaaaatgcg gcatgtggtg aatatcatgg gatggtggta ttttttcgta ggctgcattt 5340
cattcatgac ggcttttatc actatagctg ctttatcact tgcatctgat aagttggtca 5400
aaaatatcag attagctttg ttccgccaat tgatgcgaat ggatattgca ttcttcgacc 5460
acaaaaacaa cacgccgggt gcgctaacct caattttggc gaaggaagct aaaatgatcg 5520
agggtttgag tggggccacc ctcggtcaaa ttcaacagag tctggtgacc ttgattggcg 5580
gcatagttac tggtatacct ttcaattgga gaattggact cgtggctacg tctgttgttc 5640
ctgtcatgtt ggtgtgtggc ttcgtcagag tctgggttct tacccaatta tcggatcgtg 5700
cgagagaagt ttacgaacga agtggctcca tggcatctga gtatacaagt gctgtccgca 5760
cagtccagtc cttaactcgt gagttagacg tggtcgtaaa atacacaaag acagtagact 5820
ctcagatttt cagctccaga attgccattg cccgctcagc attgtactac gcactctcgg 5880
aaggaatgac accctgggtg gtagccctcg ttttttggtg gggaagcact gtaatgagac 5940
gaggtgaagc ttcggtcgca ggatatatga ctgtcttcat ggctattatt acaggttctc 6000
aagccgctgg ccaaattttc agctatgctc caaacatgaa ctcagccaaa gatgcagcgc 6060
gtaacattta cagaatcttg actgccactc cttctataga tgtatggagt gaggaaggtt 6120
acgttgctcc cgaggagtcg gtgagaggag atattgagtt ccgtcatgtg aatttccgat 6180
atcctactcg acctcaagta ccagttttac aagatctcaa cttaacagtc aaaaagggcc 6240
aatacatcgc tctagttgga gccagtggat gcggtaagtc tactactatt ggactggtgg 6300
aaagatttta tgatccatta gcaggtcaag tacttttcga tgggaaagat ttacgcgaat 6360
ataacctgaa tgcattgaga tcacacattg ctttagtcca gcaagaacca atgctttatt 6420
caggcacgct acgtgagaat attctaatgg gatggtctgg ccctgagtct gaagtaacgc 6480
aggagatgat tgaggatgcc gctcgcaaag cgaacattca cgaattcatc atgtcgttgc 6540
ctgatggcta cgaaacgctc agcggatcta ggggatcgtt gctatctggg gggcaaaagc 6600
agcgaattgc aattgcaagg gccctgatca gaaatccaaa ggtactcctc ctcgatgagg 6660
ccacctcagc tctggattcc gaatctgaga aagtagttca agcagcactc gacgcagcag 6720
cgaagggccg tactacaatc gccgttgcgc atagattatc aacaattcag aaagcagatg 6780
tcatatatgt gttctcagga gggcgcatcg tggagcaggg cgaccatcag agcctccttg 6840
aactcaatgg atggtacgct gaattggtga acttgcaagg tctcggagag atttgaggcc 6900
ggccatttct cctaataggc tgtcagcgca tatctgaggc gctcatataa aacaatataa 6960
atcaaaaccc atgttaaaaa cttgttgatc ccagcacttt tgagaagcgc actccgaact 7020
aaatctaaaa acacttcagc ttaagctatt attgcctgat tctcgtcata tcgctggggc 7080
ccgcgatcgc acgcgttctg ctataaattg acggagtttc gtacagtgcg ctcgtacagt 7140
gcgctgccaa atacaattta gtgtagccag attggatggt tgaattgctc ttcacggttg 7200
cacgctattg gcaaaaaaga gagagccgct ctgaactggt tcatccgcag ctgaccttcg 7260
aaactcttta atatttaata atattgcagc aaaatctata gcttatgcca catctatacg 7320
gaagaggtat tcaacattag agcttgtgtc gcccattctc tacacgagcc cacgcatcag 7380
cagtgagggg cttgtagctc gtgccctcta accagtagat tgtttgtcct gctggggcgg 7440
gaatctgctg gtttcggaat tctttcttct gaactttgtt gttgccggtg atggtgacgg 7500
tgtcgacgaa cttaatgaat atcggcacgg catagcgtgg cagcctttcc aaaagatgct 7560
tgccgagttt atccatatcc agctgttttc taggat 7596
<210> 74
<211> 4998
<212> DNA
<213> 人工的
<220>
<223> 敲除盒
<400> 74
ggacctgcgc cctaaaatgg gactttgtca aaaaaagaac ggcctcctgc gcgatggaga 60
gcaatcaaga attcggagtt ccgatgcgaa tccatcaaga aaacggcccc taggcaatct 120
aaaaccgtgg ccgacatact ataagtcaat tccgctgtac aaataacaag cgatcaatcc 180
ataatctgag gctcatttca tacggacttt tctaagttca cataattcta tgatgcatac 240
taacaaatac gatgcacaaa tgggtacaag gcctaaagag ggccacaatc gcgatttact 300
cgatacggca aatcagttcc acaagtaatt cgctatcgtc ggtgttgtta tacacctctc 360
ggcttgagtc aatatcgagc atgcaaggtt gacgcattct ggggaaatgt atccacgtga 420
tcgccgatat cggagcggat acgctgtgta gtcttcagtt gtaagatttc ttatacagcg 480
acgcaaccat acatgtgacg gcccgtgctg gtctgacggg cggatagtac aggctttgcc 540
aaaagcctat aaggctaaag aaagtaaaca agtgaggttg aaccatgatg gcagtgttcg 600
aattctgatc aatgaagtac actgcgaagg gaatccccga aacggcgaac aaaaagaaca 660
tcagaggagg aacgccctcg caatcccgaa cataccagtt tcgcagaacc tggggtatca 720
actggatgca ccagcatact gttcccactg ttgccaatgc tgtagacgct ccattgttgt 780
cagtcatttt agcattttac agtaaccaac tccaaaaaac agcccgctct gctgggaaga 840
cttcgcaatt atttatccac tactgctgcg gttatatact tctcgatctc agtctcggtt 900
ataattgccg cttgacagcc tggagaaatt cggatactcc acgtgataat tgccataggg 960
cataattttc gaaacagctc gcaacgatct cggctagttt tccccttttt tgacccatat 1020
cgacgctgag actcactcac ttgatgccta ccgttagggt aaatttttca agcctgcaga 1080
atatcgcggg acgcagtctc ctgcacgcgc gtgacttcat cttacttaca tcaaacagcc 1140
cgattaattt gaaaagtcct agctgatcga gggcacgggc actactgtag agaaataata 1200
tgaagctgag ctatgaggag cgccgagaga ggctgccggc tgtagcagcc cggctattcg 1260
acatcattgt gagcaagcaa acaaatcttt gcgcaagctt ggatgtgcga actacctctg 1320
agttactgag tatcctggac cgcattggac cttacatttg tatggttaag acccacattg 1380
acataattga cgacttcgaa tacgacacaa ctgtcagcgg tttgaaacag ctttcaacga 1440
agcacaattt tctcattttt gaagaccgaa agttcgcaga catcggttcc actgttaagg 1500
cccaatatgc aggtggagtg tttaagatcg ctcaatgggc tgatataaca aatgctcacg 1560
gtgttcctgg gccgggaatt gtgagcggac tagaagaggc tgcgaaggaa actacggatg 1620
aacctcgcgg ccttgtcatg cttgcagaac tgagttcgaa gggcacactg gctcacggcg 1680
aatactcgca agcgacagta gacatcgctc gcagtaaccg cgcatttgtg tttggtttca 1740
tcgctcagca aaaagtcgga aagccagagg aagactgggt cattatgact cctggggtgg 1800
gcctggacga caaaggtgat ggattggggc agcagtatcg tactgtggac gacgtcatag 1860
agaccggcac agacgttatt atcgtcggac gcgggctcta tagcaaggga cgagatcctg 1920
tgcacgaagc tcagcgttac caaaaggcgg gctggaatgc atatctgaga aaagttcagt 1980
caagatgatt ttctcaaaca gttccttcaa tgcaacttgc acatgaatac ctataaaatc 2040
tgattaaatt accataaaag gtacagatta aaatatatat gccttcaatg gcatccttcg 2100
cgattctgat tcgtcagcac acttcaacct tcctactatg agtgacagtg atgatgatct 2160
gctggcattg gccgacgttg gctccgactc cgaagaggaa atctcgctgc cgtcgccgcc 2220
aagcaatgag gtcgtcaatc cctatcctct agaaggcaaa tatctcgatg ctgaagacag 2280
ggcgaagttg gacgcgctgc cagagattga gcgagaagag atcttgtatg accgagctca 2340
ggagatgcag cggtacgagg agagaaggta tcttgctcag cgaaggaagc agatgacgcg 2400
ggttgctgac gaggacgaag ccccctccgc caagcgtcaa cggggtacaa caggcgtctc 2460
ttcgggtacg aagtcatctc ttgaggcatt aaagaaacga agggcccagc agtctcggaa 2520
gtcctcacgc catggagttg atgacgatgt gtatagtgac gatgatgaaa catgtgcgat 2580
cgctaagcag tacgacgttg ttgctggagt ctcatctgca aggttgagta ccaatccctg 2640
ccccaatacg agcaatcgaa gccttgggga aagatgcggc gggctagctt cagcaataaa 2700
tagcaggcga cacacaaaaa ttaggcggca agcgcacgct cagcatgcca tctaccaggg 2760
caaaaagcaa ggcaacctct ttttcgcatc ccgatttaga gcctacccgt cattgcaggg 2820
tgtgcgtcta cgatataaca cgatcgacat cgcgctgggt atgcttctgg gtaaggggtc 2880
gcaacgtgtg agttgtcagc actggccgat acccaaagta tataatgcgc cgttgaacgg 2940
ttatagtcgg tcaagctctt aaagaaagac ttaacaacaa aaacaactct acacatatgg 3000
ccatcgagaa accagtgata gttgcttgtg cctgcccact agcggggcac gtgggcccag 3060
tgctcagcct ggtccgcggt ctactcaata gaggatatga ggtgactttc gtaacaggga 3120
acgcattcaa ggagaaagtt attgaggcag gatgcacttt cgtccctctc caaggacgag 3180
ctgactacca tgaatacaat ctccctgaaa tcgctccagg attgctcacg attcctccag 3240
gccttgagca gaccggttac tcaatgaatg agatttttgt gaaggcgatt cctgagcagt 3300
acgatgcact tcaaactgct ctaaaacagg ttgaggctga aaataaatca gctgtggtga 3360
ttggcgagac catgtttcta ggggtgcatc cgatatcact gggtgcccca ggtctcaagc 3420
cccaaggcgt aatcacgtta ggaactattc cgtgcatgct gaaagcagag aaggcgcctg 3480
gagttcctag tcttgagcca atgattgata ctttagtgcg gcaacaagta tttcaaccag 3540
gaactgactc tgagaaggag atcatgaaga cgctcggggc cacgaaggag cccgaatttc 3600
tcctggagaa tatatacagc agccctgaca gatttttgca actgtgccct ccatctcttg 3660
aatttcactt gacttcgcct cctcctggct tctcgttcgc tggtagtgca ccgcatgtaa 3720
agtctgctgg attagcaact ccacctcacc tgccgtcttg gtggcctgat gtgctgagtg 3780
cgaagcgtct gattgttgtt acacaaggaa cagcagccat caactatgaa gatctgctca 3840
ttccagcatt gcaggccttt gctgacgaag aagacactct cgtagttggt atattgggcg 3900
tcaaaggggc gtcacttcct gatagcgtta aagttcctgc aaacgctcga attgttgatt 3960
attttcctta cgatgagcta ctaccgcatg cctctgtttt catatacaac ggtggatacg 4020
gaggtctgca gcacagtttg agccatggcg ttcccgtcat catcggagga ggaatgttgg 4080
tagacaagcc agctgttgct tcacgagctg tatgggctgg tgttggttat gatcttcaaa 4140
ccttgcaggc aacttctgag ctagtctcca cggccgttaa ggaggtgttg gctactccct 4200
cgtatcacga gaaagccatg gcagtcaaga aagagcttga aaaatacaag tctcttgata 4260
ttctagagtc ggcaattagt gaattagctt cttaacctgg ccggccattt ctcctaatag 4320
gctgtcagcg catatctgag gcgctcatat aaaacaatat aaatcaaaac ccatgttaaa 4380
aacttgttga tcccagcact tttgagaagc gcactccgaa ctaaatctaa aaacacttca 4440
gcttaagcta ttattgcctg attctcgtca tatcgctggg gcccgcgatc gcacgcgttc 4500
tgctataaat tgacggagtt tcgtacagtg cgctcgtaca gtgcgctgcc aaatacaatt 4560
tagtgtagcc agattggatg gttgaattgc tcttcacggt tgcacgctat tggcaaaaaa 4620
gagagagccg ctctgaactg gttcatccgc agctgacctt cgaaactctt taatatttaa 4680
taatattgca gcaaaatcta tagcttatgc cacatctata cggaagaggt attcaacatt 4740
agagcttgtg tcgcccattc tctacacgag cccacgcatc agcagtgagg ggcttgtagc 4800
tcgtgccctc taaccagtag attgtttgtc ctgctggggc gggaatctgc tggtttcgga 4860
attctttctt ctgaactttg ttgttgccgg tgatggtgac ggtgtcgacg aacttaatga 4920
atatcggcac ggcatagcgt ggcagccttt ccaaaagatg cttgccgagt ttatccatat 4980
ccagctgttt tctaggat 4998
<210> 75
<211> 3732
<212> DNA
<213> 人工的
<220>
<223> 整合盒
<400> 75
gcggccgctt attttttgga cctgcgccct aaaatgggac tttgtcaaaa aaagaacggc 60
ctcctgcgcg atggagagca atcaagaatt cggagttccg atgcgaatcc atcaagaaaa 120
cggcccctag gcaatctaaa accgtggccg acatactata agtcaattcc gctgtacaaa 180
taacaagcga tcaatccata atctgaggct catttcatac ggacttttct aagttcacat 240
aattctatga tgcatactaa caaatacgat gcacaaatgg gtacaaggcc taaagagggc 300
cacaatcgcg atttactcga tacggcaaat cagttccaca agtaattcgc tatcgtcggt 360
gttgttatac acctctcggc ttgagtcaat atcgagcatg caaggttgac gcattctggg 420
gaaatgtatc cacgtgatcg ccgatatcgg agcggatacg ctgtgtagtc ttcagttgta 480
agatttctta tacagcgacg caaccataca tgtgacggcc cgtgctggtc tgacgggcgg 540
atagtacagg ctttgccaaa agcctataag gctaaagaaa gtaaacaagt gaggttgaac 600
catgatggca gtgttcgaat tctgatcaat gaagtacact gcgaagggaa tccccgaaac 660
ggcgaacaaa aagaacatca gaggaggaac gccctcgcaa tcccgaacat accagtttcg 720
cagaacctgg ggtatcaact ggatgcacca gcatactgtt cccactgttg ccaatgctgt 780
agacgctcca ttgttgtcag tcattttagc attttacagt aaccaactcc aaaaaacagc 840
ccgctctgct gggaagactt cgcaattatt tatccactac tgctgcggtt atatacttct 900
cgatctcagt ctcggttata attgccgctt gacagcctgg agaaattcgg atactccacg 960
tgataattgc catagggcat aattttcgaa acagctcgca acgatctcgg ctagttttcc 1020
ccttttttga cccatatcga cgctgagact cactcacttg atgcctaccg ttagggtaaa 1080
tttttcaagc ctgcagaata tcgcgggacg cagtctcctg cacgcgcgtg acttcatctt 1140
acttacatca aacagcccga ttaatttgaa aagtcctagc tgatcgaggg cacgggcact 1200
actgtagaga aataatatga agctgagcta tgaggagcgc cgagagaggc tgccggctgt 1260
agcagcccgg ctattcgaca tcattgtgag caagcaaaca aatctttgcg caagcttgga 1320
tgtgcgaact acctctgagt tactgagtat cctggaccgc attggacctt acatttgtat 1380
ggttaagacc cacattgaca taattgacga cttcgaatac gacacaactg tcagcggttt 1440
gaaacagctt tcaacgaagc acaattttct catttttgaa gaccgaaagt tcgcagacat 1500
cggttccact gttaaggccc aatatgcagg tggagtgttt aagatcgctc aatgggctga 1560
tataacaaat gctcacggtg ttcctgggcc gggaattgtg agcggactag aagaggctgc 1620
gaaggaaact acggatgaac ctcgcggcct tgtcatgctt gcagaactga gttcgaaggg 1680
cacactggct cacggcgaat actcgcaagc gacagtagac atcgctcgca gtaaccgcgc 1740
atttgtgttt ggtttcatcg ctcagcaaaa agtcggaaag ccagaggaag actgggtcat 1800
tatgactcct ggggtgggcc tggacgacaa aggtgatgga ttggggcagc agtatcgtac 1860
tgtggacgac gtcatagaga ccggcacaga cgttattatc gtcggacgcg ggctctatag 1920
caagggacga gatcctgtgc acgaagctca gcgttaccaa aaggcgggct ggaatgcata 1980
tctgagaaaa gttcagtcaa gatgattttc tcaaacagtt ccttcaatgc aacttgcaca 2040
tgaataccta taaaatctga ttaaattacc ataaaaggta cagattaaaa tatatatgcc 2100
ttcaatggca tccttcgcga ttctgattcg tcagcacact tcaaccttcc tactatgagt 2160
gacagtgatg atgatctgct ggcattggcc gacgttggct ccgactccga agaggaaatc 2220
tcgctgccgt cgccgccaag caatgaggtc gtcaatccct atcctctaga aggcaaatat 2280
ctcgatgctg aagacagggc gaagttggac gcgctgccag agattgagcg agaagagatc 2340
ttgtatgacc gagctcagga gatgcagcgg tacgaggaga gaaggtatct tgctcagcga 2400
aggaagcaga tgacgcgggt tgctgacgag gacgaagccc cctccgccaa gcgtcaacgg 2460
ggtacaacag gcgtctcttc gggtacgaag tcatctcttg aggcattaaa gaaacgaagg 2520
gcccagcagt ctcggaagtc ctcacgccat ggagttgatg acgatgtgta tagtgacgat 2580
gatgaaacat gtgcgatcgc taagcagtac gacgttgttg ctggagtctc atctgcaagg 2640
ttgagtacca atccctgccc caatacgagc aatcgaagcc ttggggaaag atgcggcggg 2700
ctagcttcag caataaatag caggcgacac acaaaaatta ggcggcaagc gcacgctcag 2760
catgccatct accagggcaa aaagcaaggc aacctctttt tcgcatcccg atttagagcc 2820
tacccgtcat tgcagggtgt gcgtctacga tataacacga tcgacatcgc gctgggtatg 2880
cttctgggta aggggtcgca acgtgtgagt tgtcagcact ggccgatacc caaagtatat 2940
aatgcgccgt tgaacggtta tagtcggtca agctcttaaa gaaagactta acaacaaaaa 3000
caactctaca catatggact tgtaggccgg ccatttctcc taataggctg tcagcgcata 3060
tctgaggcgc tcatataaaa caatataaat caaaacccat gttaaaaact tgttgatccc 3120
agcacttttg agaagcgcac tccgaactaa atctaaaaac acttcagctt aagctattat 3180
tgcctgattc tcgtcatatc gctggggccc gcgatcgcac gcgttctgct ataaattgac 3240
ggagtttcgt acagtgcgct cgtacagtgc gctgccaaat acaatttagt gtagccagat 3300
tggatggttg aattgctctt cacggttgca cgctattggc aaaaaagaga gagccgctct 3360
gaactggttc atccgcagct gaccttcgaa actctttaat atttaataat attgcagcaa 3420
aatctatagc ttatgccaca tctatacgga agaggtattc aacattagag cttgtgtcgc 3480
ccattctcta cacgagccca cgcatcagca gtgaggggct tgtagctcgt gccctctaac 3540
cagtagattg tttgtcctgc tggggcggga atctgctggt ttcggaattc tttcttctga 3600
actttgttgt tgccggtgat ggtgacggtg tcgacgaact taatgaatat cggcacggca 3660
tagcgtggca gcctttccaa aagatgcttg ccgagtttat ccatatccag ctgttttcta 3720
ggatcctgca gg 3732
<210> 76
<211> 50
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 76
aaacgtctca gatgcaccac caccaccacc acatggttgt aaactcctcg 50
<210> 77
<211> 30
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 77
aaaggcgcgc cctagacctt ctggttagcg 30
<210> 78
<211> 6700
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 78
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040
ttttgtttaa ctttaagaag gagatatacg atgcaccacc accaccacca catgagccct 5100
tcatcacaca aacccctgat tctcgcttgc ggcttgcctc tttcaggcca tataatgccc 5160
gttttgagtc tggtacacgg ccttacggac gacggatacg aagctactgt tgtgacaggc 5220
agagcgtttg aacaaaaagt tcgagatgtg ggtgcagact ttgttccttt agaagggaac 5280
gcagattttg atgaccacac cttagacgat ctggtcccgg gccgtaaaga catggcccca 5340
agcttcgatc gtacagttca agatgtggag cacatgatgg tagctactct tcctgagcag 5400
tttgccgcta ttcagagggc tttcaaaaag ctcagcgcaa gcggtcgccc tgtcgttctt 5460
gtcagtgaag tgctgttttt cggtgcacac cctatcagcc tcggtgctcc tggtttcaaa 5520
cccgctggct ggatttgttt aggggttttg cctcttttga tccgcagtga tcatacctta 5580
ggacttgaca acgacaggag ccccgaagca catgcaaaga aactcgctat gaaccacgct 5640
cttgagcacc aaattttcgt taaagccact gctaagcaca aggaaatctg ccgagagtta 5700
ggttgcactg aagatcccaa atttatctgg gagcacagtt acattgctgc agacaagttc 5760
ctgcagctgt gcccgccttc tcttgagttc agcagagacc atctgcctag caacttcaaa 5820
ttcgccggct caacgcccaa gcaccgaact caattcaccc ctccttcctg gtggggggat 5880
gttctgagtg ccaagcgagt catcatggtc actcaaggaa cttttgctgt cagttacaag 5940
catcttattg tgcctactct tgaggccttg aaggacgagc ctgacacttt aacagtagcc 6000
atattgggcc gccgcggtgc caagctaccg gatgatgttg tggttcctga gaatgctcgc 6060
gtgatcgact acttcaacta cgatgctcta cttcctcacg ttgatgctct tgtctacaat 6120
ggtggatatg gcggacttca gcacagctta agccactctg ttccagttgt tattgctggt 6180
gactctgaag acaagccaat ggtggcatcg agagctgagg ccgctggcgt ggcaattgat 6240
ttgaaaactg gcttgcctac agtggagcaa atcaaagaag ctgttgattc gataattgga 6300
aatccgaaat tccacgaagc ctcgaagaag gttcaaatgg agttggaaag ccacaactcc 6360
ttgaaaattc ttgaggaaag catcgaggaa atcgccagcc atgactttgg tcttttgacc 6420
aagagtgacg aggaaactga agatatacct gtcaaaggtc cggccttagc ggtgagttct 6480
tagggcgcgc cctcgaggga tccgaattcg agctccgtcg acaagcttgc ggccgcactc 6540
gagcaccacc accaccacca ctgagatccg gctgctaaca aagcccgaaa ggaagctgag 6600
ttggctgctg ccaccgctga gcaataacta gcataacccc ttggggcctc taaacgggtc 6660
ttgaggggtt ttttgctgaa aggaggaact atatccggat 6700
<210> 79
<211> 51
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 79
aaacgtctca gatgcaccac caccaccacc acatggccat cgagaaacca g 51
<210> 80
<211> 36
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 80
aaaggcgcgc cttaagaagc taattcacta attgcc 36
<210> 81
<211> 6607
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 81
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040
ttttgtttaa ctttaagaag gagatatacg atgcaccacc accaccacca catggccatc 5100
gagaaaccag tgatagttgc ttgtgcctgc ccactagcgg ggcacgtggg cccagtgctc 5160
agcctggtcc gcggtctact caatagagga tatgaggtga ctttcgtaac agggaacgca 5220
ttcaaggaga aagttattga ggcaggatgc actttcgtcc ctctccaagg acgagctgac 5280
taccatgaat acaatctccc tgaaatcgct ccaggattgc tcacgattcc tccaggcctt 5340
gagcagaccg gttactcaat gaatgagatt tttgtgaagg cgattcctga gcagtacgat 5400
gcacttcaaa ctgctctaaa acaggttgag gctgaaaata aatcagctgt ggtgattggc 5460
gagaccatgt ttctaggggt gcatccgata tcactgggtg ccccaggtct caagccccaa 5520
ggcgtaatca cgttaggaac tattccgtgc atgctgaaag cagagaaggc gcctggagtt 5580
cctagtcttg agccaatgat tgatacttta gtgcggcaac aagtatttca accaggaact 5640
gactctgaga aggagatcat gaagacgctc ggggccacga aggagcccga atttctcctg 5700
gagaatatat acagcagccc tgacagattt ttgcaactgt gccctccatc tcttgaattt 5760
cacttgactt cgcctcctcc tggcttctcg ttcgctggta gtgcaccgca tgtaaagtct 5820
gctggattag caactccacc tcacctgccg tcttggtggc ctgatgtgct gagtgcgaag 5880
cgtctgattg ttgttacaca aggaacagca gccatcaact atgaagatct gctcattcca 5940
gcattgcagg cctttgctga cgaagaagac actctcgtag ttggtatatt gggcgtcaaa 6000
ggggcgtcac ttcctgatag cgttaaagtt cctgcaaacg ctcgaattgt tgattatttt 6060
ccttacgatg agctactacc gcatgcctct gttttcatat acaacggtgg atacggaggt 6120
ctgcagcaca gtttgagcca tggcgttccc gtcatcatcg gaggaggaat gttggtagac 6180
aagccagctg ttgcttcacg agctgtatgg gctggtgttg gttatgatct tcaaaccttg 6240
caggcaactt ctgagctagt ctccacggcc gttaaggagg tgttggctac tccctcgtat 6300
cacgagaaag ccatggcagt caagaaagag cttgaaaaat acaagtctct tgatattcta 6360
gagtcggcaa ttagtgaatt agcttcttaa ggcgcgccct cgagggatcc gaattcgagc 6420
tccgtcgaca agcttgcggc cgcactcgag caccaccacc accaccactg agatccggct 6480
gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca 6540
taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata 6600
tccggat 6607
<210> 82
<211> 50
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 82
aaacgtctca gatgcaccac caccaccacc acatggttgt aaactcctcg 50
<210> 83
<211> 30
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 83
aaaggcgcgc cctagacctt ctggttagcg 30
<210> 84
<211> 6088
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 84
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040
ttttgtttaa ctttaagaag gagatatacg atgcaccacc accaccacca catggttgta 5100
aactcctcga aggaccctca aaacaaagga atgactccta gaaaagaaat tgaccaggaa 5160
atggtctctt gggccaaaaa aaacctcaaa aacacccctg gcaatgaaaa ctatgagaag 5220
atggtctcag gagttcctta caatccatac gatccagatc ttatgtttag agccctggct 5280
actagtgaga aagttaggga gttcaatacc attgcaagtg aaagtcgtac ttttgagtca 5340
aatcacgctg cttatatcaa gaaggtcgag attctcaaag acacttttgg tcaaacaaag 5400
gatattgtct ggctgaccgc tccattctca gttgattttg gattcaacat cagcgtaggc 5460
gagcactttt acgccaactt caacgtttgc ttcttggact cggctccaat aatctttggt 5520
gatgaggtga ttgtagggcc caatacaacg ttcgtgactg cgactcatcc tattagcccc 5580
gagaaacgtg cgaggagaat tgtgtatgct cttcctatca aggtggggaa taatgtatgg 5640
attggtgcga atgtgactgt cctgccgggt gttacgattg gagatggctc aacaattgcg 5700
gctggtgctg tcgttcgaga agatgttcct cctcgtactg tggtgggagg agtccctgcg 5760
cgaatcctca agcatattcc agaggaggat cccgacgagg ctgaaggaga ggaactggaa 5820
ttccttcttc cagttgaaat gaacgtcaat accgctaacc agaaggtcta gggcgcgccc 5880
tcgagggatc cgaattcgag ctccgtcgac aagcttgcgg ccgcactcga gcaccaccac 5940
caccaccact gagatccggc tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc 6000
accgctgagc aataactagc ataacccctt ggggcctcta aacgggtctt gaggggtttt 6060
ttgctgaaag gaggaactat atccggat 6088
<210> 85
<211> 10065
<212> DNA
<213> 人工的
<220>
<223> 载体
<220>
<221> misc_feature
<222> (3998)..(3998)
<223> n是a、c、g或t
<400> 85
gaattctcat gtttgacagc ttatcatcga taagctttaa tgcggtagtt tatcacagtt 60
aaattgctaa cgcagtcagg caccgtgtat gaaatctaac aatgcgctca tcgtcatcct 120
cggcaccgtc accctggatg ctgtaggcat aggcttggtt atgccggtac tgccgggcct 180
cttgcgggat atcgtccatt ccgacagcat cgccagtcac tatggcgtgc tgctagcgct 240
atatgcgttg atgcaatttc tatgcgcacc cgttctcgga gcactgtccg accgctttgg 300
ccgccgccca gtcctgctcg cttcgctact tggagccact atcgactacg cgatcatggc 360
gaccacaccc gtcctgtgga tccaggccgt tgagcaccgc cgccgcaagg aatggtgcat 420
gctgaggtgt ctcacaagtg ccgtgcagtc ccgcccccac ttgcttctct ttgtgtgtag 480
tgtacgtaca ttatcgagac cgttgttccc gcccacctcg atccggcatg ctgaggtgtc 540
tcacaagtgc cgtgcagtcc cgcccccact tgcttctctt tgtgtgtagt gtacgtacat 600
tatcgagacc gttgttcccg cccacctcga tccggcatgc tgaggtgtct cacaagtgcc 660
gtgcagtccc gcccccactt gcttctcttt gtgtgtagtg tacgtacatt atcgagaccg 720
ttgttcccgc ccacctcgat ccggcatgct gaggtgtctc acaagtgccg tgcagtcccg 780
cccccacttg cttctctttg tgtgtagtgt acgtacatta tcgagaccgt tgttcccgcc 840
cacctcgatc cggcatgcac tgatcacggg caaaagtgcg tatatataca agagcgtttg 900
ccagccacag attttcactc cacacaccac atcacacata caaccacaca catccacaat 960
gaaaaagcct gaactcaccg cgacgagcgt cgagaagttt ctgatcgaaa agttcgacag 1020
cgtctccgac ctgatgcagc tctcggaggg cgaagaatct cgtgctttca gcttcgatgt 1080
aggagggcgt ggatatgtcc tgcgggtaaa tagctgcgcc gatggtttct acaaagatcg 1140
ttatgtttat cggcactttg catcggccgc gctcccgatt ccggaagtgc ttgacattgg 1200
ggagttcagc gagagcctga cctattgcat ctcccgccgt gcacagggtg tcacgttgca 1260
agacctgcct gaaaccgaac tgcccgctgt tctgcagccg gtcgcggagg ccatggatgc 1320
gatcgctgcg gccgatctta gccagacgag cgggttcggc ccattcggac cgcaaggaat 1380
cggtcaatac actacatggc gtgatttcat atgcgcgatt gctgatcccc atgtgtatca 1440
ctggcaaact gtgatggacg acaccgtcag tgcgtccgtc gcgcaggctc tcgatgagct 1500
gatgctttgg gccgaggact gccccgaagt ccggcacctc gtgcacgcgg atttcggctc 1560
caacaatgtc ctgacggaca atggccgcat aacagcggtc attgactgga gcgaggcgat 1620
gttcggggat tcccaatacg aggtcgccaa catcttcttc tggaggccgt ggttggcttg 1680
tatggagcag cagacgcgct acttcgagcg gaggcatccg gagcttgcag gatcgccgcg 1740
gctccgggcg tatatgctcc gcattggtct tgaccaactc tatcagagct tggttgacgg 1800
caatttcgat gatgcagctt gggcgcaggg tcgatgcgac gcaatcgtcc gatccggagc 1860
cgggactgtc gggcgtacac aaatcgcccg cagaagcgcg gccgtctgga ccgatggctg 1920
tgtagaagta ctcgccgata gtggaaaccg acgccccagc actcgtccga gggcaaagga 1980
atagtcgacg ctctccctta tgcgactcct gcattaggaa gcagcccagt agtaggttga 2040
ggccgttgag caccgccgcc gcaaggaatg gtgcatgctg aggtgtctca caagtgccgt 2100
gcagtcccgc ccccacttgc ttctctttgt gtgtagtgta cgtacattat cgagaccgtt 2160
gttcccgccc acctcgatcc ggcatgctga ggtgtctcac aagtgccgtg cagtcccgcc 2220
cccacttgct tctctttgtg tgtagtgtac gtacattatc gagaccgttg ttcccgccca 2280
cctcgatccg gcatgctgag gtgtctcaca agtgccgtgc agtcccgccc ccacttgctt 2340
ctctttgtgt gtagtgtacg tacattatcg agaccgttgt tcccgcccac ctcgatccgg 2400
catgctgagg tgtctcacaa gtgccgtgca gtcccgcccc cacttgcttc tctttgtgtg 2460
tagtgtacgt acattatcga gaccgttgtt cccgcccacc tcgatccggc atgcactgat 2520
cacgggcaaa agtgcgtata tatacaagag cgtttgccag ccacagattt tcactccaca 2580
caccacatca cacatacaac cacacacatc cacgggctgc aggaattcga tatcaagctt 2640
atcgataccg tcgaggggca gagccgatcc tgtacacttt acttaaaacc attatctgag 2700
tgttaaatgt ccaatttact gaccgtacac caaaatttgc ctgcattacc ggtcgatgca 2760
acgagtgatg aggttcgcaa gaacctgatg gacatgttca gggatcgcca ggcgttttct 2820
gagcatacct ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg gtgcaagttg 2880
aataaccgga aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt 2940
caggcgcgcg gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt 3000
catcgtcggt ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg 3060
cggatccgaa aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa 3120
cgcactgatt tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata 3180
cgtaatctgg catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc 3240
aggatcaggg ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc 3300
agaacgaaaa cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact 3360
aaactggtcg agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg 3420
ttttgccggg tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact 3480
cgcgccctgg aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac 3540
tctggtcaga gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat 3600
atggcccgcg ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta 3660
aatattgtca tgaactatat ccgtaccctg gatagtgaaa caggggcaat ggtgcgcctg 3720
ctggaagatg gcgattagcc attaacgcgt aaatgattgc tataattatt tgatatttat 3780
ggtgacatat gagaaaggat ttcaacatcg acggaaaata tgtagtgctg tctgtaagca 3840
ctaatattca gtcgccagcc gtcattgtca ctgtaaagct gagcgataga atgcctgata 3900
ttgactcaat atccgttgcg tttcctgtca aaagtatgcg tagtgctgaa catttcgtga 3960
tgaatgccac cgaggaagaa gcacggcgcg gttttgcnta aagtgatgtc tgagtttggc 4020
gaactcttgg gtaaggttgg aattgtcgac cgatgccctt gagagccttc aacccagtca 4080
gctccttccg gtgggcgcgg ggcatgacta tcgtcgccgc acttatgact gtcttcttta 4140
tcatgcaact cgtaggacag gtgccggcag cgctctgggt cattttcggc gaggaccgct 4200
ttcgctggag cgcgacgatg atcggcctgt cgcttgcggt attcggaatc ttgcacgccc 4260
tcgctcaagc cttcgtcact ggtcccgcca ccaaacgttt cggcgagaag caggccatta 4320
tcgccggcat ggcggccgac gcgctgggct acgtcttgct ggcgttcgcg acgcgaggct 4380
ggatggcctt ccccattatg attcttctcg cttccggcgg catcgggatg cccgcgttgc 4440
aggccatgct gtccaggcag gtagatgacg accatcaggg acagcttcaa ggatcgctcg 4500
cggctcttac cagcctaact tcgatcactg gaccgctgat cgtcacggcg atttatgccg 4560
cctcggcgag cacatggaac gggttggcat ggattgtagg cgccgcccta taccttgtct 4620
gcctccccgc gttgcgtcgc ggtgcatgga gccgggccac ctcgacctga atggaagccg 4680
gcggcacctc gctaacggat tcaccactcc aagaattgga gccaatcaat tcttgcggag 4740
aactgtgaat gcgcaaacca acccttggca gaacatatcc atcgcgtccg ccatctccag 4800
cagccgcacg cggcgcatct cgggcagcgt tgggtcctgg ccacgggtgc gcatgatcgt 4860
gctcctgtcg ttgaggaccc ggctaggctg gcggggttgc cttactggtt agcagaatga 4920
atcaccgata cgcgagcgaa cgtgaagcga ctgctgctgc aaaacgtctg cgacctgagc 4980
aacaacatga atggtcttcg gtttccgtgt ttcgtaaagt ctggaaacgc ggaagtcagc 5040
gccctgcacc attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac 5100
acctacatct gtattaacga agcgctggca ttgaccctga gtgatttttc tctggtcccg 5160
ccgcatccat accgccagtt gtttaccctc acaacgttcc agtaaccggg catgttcatc 5220
atcagtaacc cgtatcgtga gcatcctctc tcgtttcatc ggtatcatta cccccatgaa 5280
cagaaattcc cccttacacg gaggcatcaa gtgaccaaac aggaaaaaac cgcccttaac 5340
atggcccgct ttatcagaag ccagacatta acgcttctgg agaaactcaa cgagctggac 5400
gcggatgaac aggcagacat ctgtgaatcg cttcacgacc acgctgatga gctttaccgc 5460
agcagatctg tatatatata tatatatgca agccattttt tttctctcac catctatttt 5520
aatatataaa attagatcat ctatctaaac tttttcatta aataaattag atggcgaaaa 5580
taatggagac gtattccatt ataatatata aaaacctaaa actatgtttc attataacaa 5640
tttacttcct aatttggaaa attcgaagtt ggttattata tgtgcatata tactgaatgt 5700
tcataacttc tagtcaacag atataattta ttcctcgtag taacttgccc gcaaacattt 5760
tatatctaaa ttaatttcaa gggaagttct tgtaaatata tatttatctc aagtaaacag 5820
ttagaaatat cagccatgat gacattttcc aggatggcaa tgactcatga tcacactgag 5880
atttttaata gatatttcgt tagagatgat ggtatctcaa aacaaaacga ctgtagctct 5940
tttaccacct catttacaat ttcatctttc atcaaattta gggatgccat caactttcag 6000
ttcataatta atatcttacc aaattaggta atctgcaaaa gttcagactg tgaaatgtaa 6060
cattttatat atcaagctct atttaatgcc tcacagtagt taacataaag agatacagaa 6120
ttgtcgtgtc agtgtatact atccatgtgt atactctgga tatccatttg tattccatta 6180
tctacgaaaa gcacttagat aaatactaaa ttgttatttg gtatgtatcg tataagttga 6240
aagttttgag cccatcttgt tgttttcttt tattaaataa aataaaataa ctaacgttat 6300
gatactttga tgtgtttttt aatttaatta taccagtact tgtttgaaat ttttttctgc 6360
agaattttgg ccggctcatt tctatttgtt gtaagtacga gtatttgaac ttttagtcag 6420
atactggtag ttatatattt attttgtttt tgtttatttt gttgggtttt ttgtttgttg 6480
ttttttttcg gggggttgtg ttccaacttc gtttttggaa ttttaattta gtttctcgat 6540
cttcgctttt ggaatttatt taatttatcc ctccccttga ggtgtgaata acttaaaaat 6600
gctagaagga gctacacagg tgtttgtaca gtaaaaacta tcagcaggat accatcgcaa 6660
gatgttcata tcgctttgtt gagtcactgc aggggaccgc tgaggtattc gctggttcgg 6720
tgagggcggc cgtccctgtg attcgtacga ataaattctt tgtacaagta ccagtgctac 6780
aattgtaggt ggtgctcata caggtacacc ccgtgtgtaa gtaaactcca attatgttat 6840
gtctgataaa aggatgtaac ataggcaagc tgctcgtgag tgttgagtac gaaccttaga 6900
tccaaatcac ccgcacccta cggatatact tgcttgaata tacttgtaat aaggctgtct 6960
gctgacatcg gtgcgcgtat gttctgggcg gcgactctct ccgaaccatc gaacagttcc 7020
tgaacacgac gagctagcta caacatgact cgcaagagct ctgtgcgtgt acacaacgag 7080
ccgtgcccgt gtaacagtct tcggttccga cccccaaaaa acccaccata caccgaaata 7140
gcacatcctt acgaccagta gcagcagagt gcgctacagt aagtattcgt caatacaagt 7200
aaatcacgag tacgacagtt gccgacacgg acagaaagga actacagatt taaatatacc 7260
aaacaataat tcattactaa tgtcaatcct tacagctgga taaaaaaact gggggatttt 7320
gttaacgagc tcattcgcaa atgaaacggg aaaagttctt cgatttagtg ttaaatctcc 7380
gttaaaaacc gcttatttgg atcgagctcg gaccttgcgg cgctttcgct tgagtcgtct 7440
gactctcttc tttctccact tagctctcat tctgggttag ttccatgttc tccgctggcg 7500
ggggcgacca ccgctaatcg agccgacttg tattgaaagg caggcaagaa ggtatcgaag 7560
gggaagaacc gttttgtggt tgctgcacca cggcttccaa tgctctccca atgaagaacc 7620
aaggtcggta attaatactc acttgaaaga tcaagacaag aacctgatga atgtgaggaa 7680
aaaaagacaa gaaggggaaa gtttgaccat ttttaagctg tgcgagccac aggccgggta 7740
acagataaat taggttctga aaattcggat ctgctgcctc gcgcgtttcg gtgatgacgg 7800
tgaaaacctc tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc 7860
cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggcgcagc 7920
catgacccag tcacgtagcg atagcggagt gtatactggc ttaactatgc ggcatcagag 7980
cagattgtac tgagagtgca ccatatgcgg tgtgaaatac cgcacagatg cgtaaggaga 8040
aaataccgca tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 8100
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 8160
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 8220
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 8280
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 8340
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 8400
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 8460
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 8520
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 8580
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 8640
gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 8700
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 8760
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 8820
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 8880
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 8940
aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 9000
taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 9060
gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 9120
agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 9180
cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag 9240
tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac 9300
gttgttgcca ttgctgcagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 9360
agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 9420
gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 9480
atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9540
gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 9600
tcttgcccgg cgtcaacacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 9660
atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 9720
agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 9780
gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 9840
cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 9900
tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 9960
ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca 10020
ttaacctata aaaataggcg tatcacgagg ccctttcgtc ttcaa 10065
<210> 86
<211> 28
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 86
aaagatatct ctatgcgcac ccgttctc 28
<210> 87
<211> 39
<212> DNA
<213> 人工的
<220>
<223> 引物
<400> 87
tttagatcta agcttgagac acctcagcat gcaccattc 39
<210> 88
<211> 8114
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 88
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60
acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagctca 240
gaattaaccc tcactaaagg gactagtcct gcaggtttaa acgaattcgc cctttcatct 300
cgagatgctt tattcaggca cgctacgtga gaatattcta atgggatggt ctggccctga 360
gtctgaagta acgcaggaga tgattgagga tgccgctcgc aaagcgaaca ttcacgaatt 420
catcatgtcg ttgcctgatg gctacgaaac gctcagcgga tctaggggat cgttgctatc 480
tggggggcaa aagcagcgaa ttgcaattgc aagggccctg atcagaaatc caaaggtact 540
cctcctcgat gaggccacct cagctctgga ttccgaatct gagaaagtag ttcaagcagc 600
actcgacgca gcagcgaagg gccgtactac aatcgccgtt gcgcatagat tatcaacaat 660
tcagaaagca gatgtcatat atgtgttctc aggagggcgc atcgtggagc agggcgacca 720
tcagagcctc cttgaactca atggatggta cgctgaattg gtgaacttgc aaggtctcgg 780
agagatttga cgttcattta tttttggcca ctgcttgcat acattatttg attaaaggca 840
ctcattaatt gaaatagcat atcgaatttc tctagttatg gcccctgagt caccatacat 900
tgtctgatta aagggactcg ttaattgaaa tagcacattg gattcctctg attatgaccc 960
ctgagtcacc tatcctgcat aattcactcg tgacgataat ctgtagatat agggaactgt 1020
cgtagtactt gaagagacag caacaatcta tctctgggat ttcgtgctga ttttgggctt 1080
ttgctttgac gggctatgac tgaggtaatg tagaccaata ataaccctca cgcgaattag 1140
atatgccctg agggttagct tgcatcacct tacccatatg cacactgact tgcattaccc 1200
ggagcatatt ccggtagtcg gagataagca ctttgagata tcttaaggta caactcaata 1260
cgttcctcct tccttgcctc attccacctc acattctaga attcaataac ttcgtatagc 1320
atacattata cgaagttatt aattaacatc atcgtcacta tacacatcgt catcaactcc 1380
atggcgtgag gacttccgag actgctgggc ccttcgtttc tttaatgcct caagagatga 1440
cttcgtaccc gaagagacgc ctgttgtacc ccgttgacgc ttggcggagg gggcttcgtc 1500
ctcgtcagca acccgcgtca tctgcttcct tcgctgagca agataccttc tctcctcgta 1560
ccgctgcatc tcctgagctc ggtcatacaa gatctcttct cgctcaatct ctggcagcgc 1620
gtccaacttc gccctgtctt cagcatcgag atatttgcct tctagaggat agggattgac 1680
gacctcattg cttggcggcg acggcagcga gatttcctct tcggagtcgg agccaacgtc 1740
ggccaatgcc agcagatcat catcactgtc actcatagta ggaaggttga agtgtgctga 1800
cgaatcagaa tcgcgaagga tgccattgaa ggcatatata ttttaatctg taccttttat 1860
ggtaatttaa tcagatttta taggtattca tgtgcaagtt gcattgaagg aactgtttga 1920
gaaaatcatc ttgactgaac ttttctcaga tatgcattcc agcccgcctt ttggtaacgc 1980
tgagcttcgt gcacaggatc tcgtcccttg ctatagagcc cgcgtccgac gataataacg 2040
tctgtgccgg tctctatgac gtcgtccaca gtacgatact gctgccccaa tccatcacct 2100
ttgtcgtcca ggcccacccc aggagtcata atgacccagt cttcctctgg ctttccgact 2160
ttttgctgag cgatgaaacc aaacacaaat gcgcggttac tgcgagcgat gtctactgtc 2220
gcttgcgagt attcgccgtg agccagtgtg cccttcgaac tcagttctgc aagcatgaca 2280
aggccgcgag gttcatccgt agtttccttc gcagcctctt ctagtccgct cacaattccc 2340
ggcccaggaa caccgtgagc atttgttata tcagcccatt gagcgatctt aaacactcca 2400
cctgcatatt gggccttaac agtggaaccg atgtctgcga actttcggtc ttcaaaaatg 2460
agaaaattgt gcttcgttga aagctgtttc aaaccgctga cagttgtgtc gtattcgaag 2520
tcgtcaatta tgtcaatgtg ggtcttaacc atacaaatgt aaggtccaat gcggtccagg 2580
atactcagta actcagaggt agttcgcaca tccaagcttg cgcaaagatt tgtttgcttg 2640
ctcacaatga tgtcgaatag ccgggctgct acagccggca gcctctctcg gcgctcctca 2700
tagctcagct tcatattatt tctctacagt agtgcccgtg ccctcgatca gctaggactt 2760
ttcaaattaa tcgggctgtt tgatgtaagt aagatgaagt cacgcgcgtg caggagactg 2820
cgtcccgcga tattctgcag gcttgaaaaa tttaccctaa cggtaggcat caagtgagtg 2880
agtctcagcg tcgatatggg tcaaaaaagg ggaaaactag ccgagatcgt tgcgagctgt 2940
ttcgaaaatt atgccctatg gcaattatca cgtggagtat ccgaatttct ccaggctgtc 3000
aagcggcaat tataaccgag actgagatcg agaagtatat aaccgcagca gtagtggata 3060
aataattgcg aagtcttccc agcagagcgg gctgtttttt ggagttggtt actgtaaaat 3120
gctaaaatga ctgacaacaa tggagcgtct acagcattgg caacagtggg aacagtatgc 3180
tggtgcatcc agttgatacc ccaggttctg cgaaactggt atgttcggga ttgcgagggc 3240
gttcctcctc tgatgttctt tttgttcgcc gtttcgggga ttcccttcgc agtgtacttc 3300
attgatcaga attcgaacac tgccatcatg gttcaacctc acttgtttac tttctttagc 3360
cttataggct tttggcaaag cctgtactat ccgcccgtca gttaattaat aacttcgtat 3420
agcatacatt atacgaagtt attaggtaaa ctaaattcat gacagccttt tcttctttct 3480
ttccacaaaa caattaaaaa aaataacaga attagaagaa ggtaaatata ttggcaaact 3540
cctctcttcc ttttacttat ttttttgaaa gttgcagtgt gtgtgtgtgt tgttgtttgt 3600
tcaaattaat ttgatggttg ttgtattgta aatttcaatc aataaaaaca aagacataaa 3660
taaaaaaaac cctacctctc ttccctgatc tgatttgatc gtacgattct aagaactcac 3720
cgctaaggcc ggccctttga caggtatatc ttcagtttcc tcgtcactct tggtcaaaag 3780
accaaagtca tggctggcga tttcctcgat gctttcctca agaattttca aggagttgtg 3840
gctttccaac tccatttgaa ccttcttcga ggcttcgtgg aatttcggat ttccaattat 3900
cgaatcaaca gcttctttga tttgctccac tgtaggcaag ccagttttca aatcaattgc 3960
cacgccagcg gcctcagctc tcgatgccac cattggcttg tcttcagagt caccagcaat 4020
aacaactgga acagagtggc ttaagctgtg ctgaagtccg ccatatccac cattgtagac 4080
aagagcatca acgtgaggaa gtagagcatc gtagttgaag tagtcgatca cgcgagcatt 4140
ctcaggaacc acaacatcat ccggtagctt ggcaccgcgg cggcccaata tggctactgt 4200
taaagtgtca ggctcgtcct tcaaggcctc aagagtaggc acaataagat gcttgtaact 4260
gacagcaaaa gttccttgag tgaccatgat gactcgcttg gcactcagaa catcccccca 4320
ccaggaagga ggggtgaatt gagttcggtg cttgggcgtt gagccggcga atttgaagtt 4380
gctaggcaga tggtctctgc tgaactcaag agaaggcggg cacagctgca ggaacttgtc 4440
tgcaggtacc tcaagggcga attcgcggcc gctaaattca attcgcccta tagtgagtcg 4500
tattacaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 4560
caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 4620
cgcaccgatc gcccttccca acagttgcgc agcctatacg tacggcagtt taaggtttac 4680
acctataaaa gagagagccg ttatcgtctg tttgtggatg tacagagtga tattattgac 4740
acgccggggc gacggatggt gatccccctg gccagtgcac gtctgctgtc agataaagtc 4800
tcccgtgaac tttacccggt ggtgcatatc ggggatgaaa gctggcgcat gatgaccacc 4860
gatatggcca gtgtgccggt ctccgttatc ggggaagaag tggctgatct cagccaccgc 4920
gaaaatgaca tcaaaaacgc cattaacctg atgttctggg gaatataaat gtcaggcatg 4980
agattatcaa aaaggatctt cacctagatc cttttcacgt agaaagccag tccgcagaaa 5040
cggtgctgac cccggatgaa tgtcagctac tgggctatct ggacaaggga aaacgcaagc 5100
gcaaagagaa agcaggtagc ttgcagtggg cttacatggc gatagctaga ctgggcggtt 5160
ttatggacag caagcgaacc ggaattgcca gctggggcgc cctctggtaa ggttgggaag 5220
ccctgcaaag taaactggat ggctttcttg ccgccaagga tctgatggcg caggggatca 5280
agctctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga tggattgcac 5340
gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca 5400
atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt 5460
gtcaagaccg acctgtccgg tgccctgaat gaactgcaag acgaggcagc gcggctatcg 5520
tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga 5580
agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct 5640
cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg 5700
gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg 5760
gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc 5820
gaactgttcg ccaggctcaa ggcgagcatg cccgacggcg aggatctcgt cgtgacccat 5880
ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac 5940
tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt 6000
gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct 6060
cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg aattattaac 6120
gcttacaatt tcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 6180
atcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 6240
acattcaaat atgtatccgc tcatgagatt atcaaaaagg atcttcacct agatcctttt 6300
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag 6360
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat 6420
agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc 6480
cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa 6540
ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca 6600
gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa 6660
cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt 6720
cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc 6780
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact 6840
catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc 6900
tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg 6960
ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct 7020
catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc 7080
cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag 7140
cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac 7200
acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg 7260
ttattgtctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 7320
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 7380
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 7440
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 7500
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 7560
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 7620
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 7680
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 7740
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 7800
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 7860
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 7920
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 7980
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 8040
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 8100
cgaggaagcg gaag 8114
<210> 89
<211> 8578
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 89
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60
acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagctca 240
gaattaaccc tcactaaagg gactagtcct gcaggtttaa acgaattcgc cctttcatct 300
cgagatgctt tattcaggca cgctacgtga gaatattcta atgggatggt ctggccctga 360
gtctgaagta acgcaggaga tgattgagga tgccgctcgc aaagcgaaca ttcacgaatt 420
catcatgtcg ttgcctgatg gctacgaaac gctcagcgga tctaggggat cgttgctatc 480
tggggggcaa aagcagcgaa ttgcaattgc aagggccctg atcagaaatc caaaggtact 540
cctcctcgat gaggccacct cagctctgga ttccgaatct gagaaagtag ttcaagcagc 600
actcgacgca gcagcgaagg gccgtactac aatcgccgtt gcgcatagat tatcaacaat 660
tcagaaagca gatgtcatat atgtgttctc aggagggcgc atcgtggagc agggcgacca 720
tcagagcctc cttgaactca atggatggta cgctgaattg gtgaacttgc aaggtctcgg 780
agagatttga cgttcattta tttttggcca ctgcttgcat acattatttg attaaaggca 840
ctcattaatt gaaatagcat atcgaatttc tctagttatg gcccctgagt caccatacat 900
tgtctgatta aagggactcg ttaattgaaa tagcacattg gattcctctg attatgaccc 960
ctgagtcacc tatcctgcat aattcactcg tgacgataat ctgtagatat agggaactgt 1020
cgtagtactt gaagagacag caacaatcta tctctgggat ttcgtgctga ttttgggctt 1080
ttgctttgac gggctatgac tgaggtaatg tagaccaata ataaccctca cgcgaattag 1140
atatgccctg agggttagct tgcatcacct tacccatatg cacactgact tgcattaccc 1200
ggagcatatt ccggtagtcg gagataagca ctttgagata tcttaaggta caactcaata 1260
cgttcctcct tccttgcctc attccacctc acattctaga attcaataac ttcgtatagc 1320
atacattata cgaagttatt aattaacatc atcgtcacta tacacatcgt catcaactcc 1380
atggcgtgag gacttccgag actgctgggc ccttcgtttc tttaatgcct caagagatga 1440
cttcgtaccc gaagagacgc ctgttgtacc ccgttgacgc ttggcggagg gggcttcgtc 1500
ctcgtcagca acccgcgtca tctgcttcct tcgctgagca agataccttc tctcctcgta 1560
ccgctgcatc tcctgagctc ggtcatacaa gatctaagct tgagacacct cagcatgcac 1620
cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc ttcctaatgc 1680
aggagtcgca taagggagag cgtcgactat tcctttgccc tcggacgagt gctggggcgt 1740
cggtttccac tatcggcgag tacttctaca cagccatcgg tccagacggc cgcgcttctg 1800
cgggcgattt gtgtacgccc gacagtcccg gctccggatc ggacgattgc gtcgcatcga 1860
ccctgcgccc aagctgcatc atcgaaattg ccgtcaacca agctctgata gagttggtca 1920
agaccaatgc ggagcatata cgcccggagc cgcggcgatc ctgcaagctc cggatgcctc 1980
cgctcgaagt agcgcgtctg ctgctccata caagccaacc acggcctcca gaagaagatg 2040
ttggcgacct cgtattggga atccccgaac atcgcctcgc tccagtcaat gaccgctgtt 2100
atgcggccat tgtccgtcag gacattgttg gagccgaaat ccgcgtgcac gaggtgccgg 2160
acttcggggc agtcctcggc ccaaagcatc agctcatcga gagcctgcgc gacggacgca 2220
ctgacggtgt cgtccatcac agtttgccag tgatacacat ggggatcagc aatcgcgcat 2280
atgaaatcac gccatgtagt gtattgaccg attccttgcg gtccgaatgg gccgaacccg 2340
ctcgtctggc taagatcggc cgcagcgatc gcatccatgg cctccgcgac cggctgcaga 2400
acagcgggca gttcggtttc aggcaggtct tgcaacgtga caccctgtgc acggcgggag 2460
atgcaatagg tcaggctctc gctaaattcc ccaatgtcaa gcacttccgg aatcgggagc 2520
gcggccgatg caaagtgccg ataaacataa cgatctttgt agaaaccatc ggcgcagcta 2580
tttacccgca ggacatatcc acgccctcct acatcgaagc tgaaagcacg agattcttcg 2640
ccctccgaga gctgcatcag gtcggagacg ctgtcgaact tttcgatcag aaacttctcg 2700
acagacgtgg cggtgagttc aggctttttc attgtggatg tgtgtggttg tatgtgtgat 2760
gtggtgtgtg gagtgaaaat ctgtggctgg caaacgctct tgtatatata cgcacttttg 2820
cccgtgatca gtgcatgccg gatcgaggtg ggcgggaaca acggtctcga taatgtacgt 2880
acactacaca caaagagaag caagtggggg cgggactgca cggcacttgt gagacacctc 2940
agcatgccgg atcgaggtgg gcgggaacaa cggtctcgat aatgtacgta cactacacac 3000
aaagagaagc aagtgggggc gggactgcac ggcacttgtg agacacctca gcatgccgga 3060
tcgaggtggg cgggaacaac ggtctcgata atgtacgtac actacacaca aagagaagca 3120
agtgggggcg ggactgcacg gcacttgtga gacacctcag catgccggat cgaggtgggc 3180
gggaacaacg gtctcgataa tgtacgtaca ctacacacaa agagaagcaa gtgggggcgg 3240
gactgcacgg cacttgtgag acacctcagc atgcaccatt ccttgcggcg gcggtgctca 3300
acggcctgga tccacaggac gggtgtggtc gccatgatcg cgtagtcgat agtggctcca 3360
agtagcgaag cgagcaggac tgggcggcgg ccaaagcggt cggacagtgc tccgagaacg 3420
ggtgcgcata gagatgtgga gtatccgaat ttctccaggc tgtcaagcgg caattataac 3480
cgagactgag atcgagaagt atataaccgc agcagtagtg gataaataat tgcgaagtct 3540
tcccagcaga gcgggctgtt ttttggagtt ggttactgta aaatgctaaa atgactgaca 3600
acaatggagc gtctacagca ttggcaacag tgggaacagt atgctggtgc atccagttga 3660
taccccaggt tctgcgaaac tggtatgttc gggattgcga gggcgttcct cctctgatgt 3720
tctttttgtt cgccgtttcg gggattccct tcgcagtgta cttcattgat cagaattcga 3780
acactgccat catggttcaa cctcacttgt ttactttctt tagccttata ggcttttggc 3840
aaagcctgta ctatccgccc gtcagttaat taataacttc gtatagcata cattatacga 3900
agttattagg taaactaaat tcatgacagc cttttcttct ttctttccac aaaacaatta 3960
aaaaaaataa cagaattaga agaaggtaaa tatattggca aactcctctc ttccttttac 4020
ttattttttt gaaagttgca gtgtgtgtgt gtgttgttgt ttgttcaaat taatttgatg 4080
gttgttgtat tgtaaatttc aatcaataaa aacaaagaca taaataaaaa aaaccctacc 4140
tctcttccct gatctgattt gatcgtacga ttctaagaac tcaccgctaa ggccggccct 4200
ttgacaggta tatcttcagt ttcctcgtca ctcttggtca aaagaccaaa gtcatggctg 4260
gcgatttcct cgatgctttc ctcaagaatt ttcaaggagt tgtggctttc caactccatt 4320
tgaaccttct tcgaggcttc gtggaatttc ggatttccaa ttatcgaatc aacagcttct 4380
ttgatttgct ccactgtagg caagccagtt ttcaaatcaa ttgccacgcc agcggcctca 4440
gctctcgatg ccaccattgg cttgtcttca gagtcaccag caataacaac tggaacagag 4500
tggcttaagc tgtgctgaag tccgccatat ccaccattgt agacaagagc atcaacgtga 4560
ggaagtagag catcgtagtt gaagtagtcg atcacgcgag cattctcagg aaccacaaca 4620
tcatccggta gcttggcacc gcggcggccc aatatggcta ctgttaaagt gtcaggctcg 4680
tccttcaagg cctcaagagt aggcacaata agatgcttgt aactgacagc aaaagttcct 4740
tgagtgacca tgatgactcg cttggcactc agaacatccc cccaccagga aggaggggtg 4800
aattgagttc ggtgcttggg cgttgagccg gcgaatttga agttgctagg cagatggtct 4860
ctgctgaact caagagaagg cgggcacagc tgcaggaact tgtctgcagg tacctcaagg 4920
gcgaattcgc ggccgctaaa ttcaattcgc cctatagtga gtcgtattac aattcactgg 4980
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 5040
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 5100
cccaacagtt gcgcagccta tacgtacggc agtttaaggt ttacacctat aaaagagaga 5160
gccgttatcg tctgtttgtg gatgtacaga gtgatattat tgacacgccg gggcgacgga 5220
tggtgatccc cctggccagt gcacgtctgc tgtcagataa agtctcccgt gaactttacc 5280
cggtggtgca tatcggggat gaaagctggc gcatgatgac caccgatatg gccagtgtgc 5340
cggtctccgt tatcggggaa gaagtggctg atctcagcca ccgcgaaaat gacatcaaaa 5400
acgccattaa cctgatgttc tggggaatat aaatgtcagg catgagatta tcaaaaagga 5460
tcttcaccta gatccttttc acgtagaaag ccagtccgca gaaacggtgc tgaccccgga 5520
tgaatgtcag ctactgggct atctggacaa gggaaaacgc aagcgcaaag agaaagcagg 5580
tagcttgcag tgggcttaca tggcgatagc tagactgggc ggttttatgg acagcaagcg 5640
aaccggaatt gccagctggg gcgccctctg gtaaggttgg gaagccctgc aaagtaaact 5700
ggatggcttt cttgccgcca aggatctgat ggcgcagggg atcaagctct gatcaagaga 5760
caggatgagg atcgtttcgc atgattgaac aagatggatt gcacgcaggt tctccggccg 5820
cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc tgctctgatg 5880
ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag accgacctgt 5940
ccggtgccct gaatgaactg caagacgagg cagcgcggct atcgtggctg gccacgacgg 6000
gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac tggctgctat 6060
tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc gagaaagtat 6120
ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc tgcccattcg 6180
accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg 6240
atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc 6300
tcaaggcgag catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc 6360
cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc cggctgggtg 6420
tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa gagcttggcg 6480
gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca 6540
tcgccttcta tcgccttctt gacgagttct tctgaattat taacgcttac aatttcctga 6600
tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatcagg tggcactttt 6660
cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat 6720
ccgctcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 6780
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 6840
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 6900
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 6960
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 7020
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 7080
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 7140
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 7200
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 7260
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 7320
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 7380
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 7440
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 7500
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 7560
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 7620
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 7680
ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgacc 7740
aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa 7800
ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca 7860
ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta 7920
actggcttca gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc 7980
caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca 8040
gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta 8100
ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag 8160
cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt 8220
cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc 8280
acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac 8340
ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac 8400
gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc 8460
tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat 8520
accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaag 8578
<210> 90
<211> 4657
<212> DNA
<213> 人工的
<220>
<223> 载体
<400> 90
ggtttaaacg aattcgccct ttcatctcga gatgctttat tcaggcacgc tacgtgagaa 60
tattctaatg ggatggtctg gccctgagtc tgaagtaacg caggagatga ttgaggatgc 120
cgctcgcaaa gcgaacattc acgaattcat catgtcgttg cctgatggct acgaaacgct 180
cagcggatct aggggatcgt tgctatctgg ggggcaaaag cagcgaattg caattgcaag 240
ggccctgatc agaaatccaa aggtactcct cctcgatgag gccacctcag ctctggattc 300
cgaatctgag aaagtagttc aagcagcact cgacgcagca gcgaagggcc gtactacaat 360
cgccgttgcg catagattat caacaattca gaaagcagat gtcatatatg tgttctcagg 420
agggcgcatc gtggagcagg gcgaccatca gagcctcctt gaactcaatg gatggtacgc 480
tgaattggtg aacttgcaag gtctcggaga gatttgacgt tcatttattt ttggccactg 540
cttgcataca ttatttgatt aaaggcactc attaattgaa atagcatatc gaatttctct 600
agttatggcc cctgagtcac catacattgt ctgattaaag ggactcgtta attgaaatag 660
cacattggat tcctctgatt atgacccctg agtcacctat cctgcataat tcactcgtga 720
cgataatctg tagatatagg gaactgtcgt agtacttgaa gagacagcaa caatctatct 780
ctgggatttc gtgctgattt tgggcttttg ctttgacggg ctatgactga ggtaatgtag 840
accaataata accctcacgc gaattagata tgccctgagg gttagcttgc atcaccttac 900
ccatatgcac actgacttgc attacccgga gcatattccg gtagtcggag ataagcactt 960
tgagatatct taaggtacaa ctcaatacgt tcctccttcc ttgcctcatt ccacctcaca 1020
ttctagaatt caataacttc gtatagcata cattatacga agttattaat taacatcatc 1080
gtcactatac acatcgtcat caactccatg gcgtgaggac ttccgagact gctgggccct 1140
tcgtttcttt aatgcctcaa gagatgactt cgtacccgaa gagacgcctg ttgtaccccg 1200
ttgacgcttg gcggaggggg cttcgtcctc gtcagcaacc cgcgtcatct gcttccttcg 1260
ctgagcaaga taccttctct cctcgtaccg ctgcatctcc tgagctcggt catacaagat 1320
ctaagcttga gacacctcag catgcaccat tccttgcggc ggcggtgctc aacggcctca 1380
acctactact gggctgcttc ctaatgcagg agtcgcataa gggagagcgt cgactattcc 1440
tttgccctcg gacgagtgct ggggcgtcgg tttccactat cggcgagtac ttctacacag 1500
ccatcggtcc agacggccgc gcttctgcgg gcgatttgtg tacgcccgac agtcccggct 1560
ccggatcgga cgattgcgtc gcatcgaccc tgcgcccaag ctgcatcatc gaaattgccg 1620
tcaaccaagc tctgatagag ttggtcaaga ccaatgcgga gcatatacgc ccggagccgc 1680
ggcgatcctg caagctccgg atgcctccgc tcgaagtagc gcgtctgctg ctccatacaa 1740
gccaaccacg gcctccagaa gaagatgttg gcgacctcgt attgggaatc cccgaacatc 1800
gcctcgctcc agtcaatgac cgctgttatg cggccattgt ccgtcaggac attgttggag 1860
ccgaaatccg cgtgcacgag gtgccggact tcggggcagt cctcggccca aagcatcagc 1920
tcatcgagag cctgcgcgac ggacgcactg acggtgtcgt ccatcacagt ttgccagtga 1980
tacacatggg gatcagcaat cgcgcatatg aaatcacgcc atgtagtgta ttgaccgatt 2040
ccttgcggtc cgaatgggcc gaacccgctc gtctggctaa gatcggccgc agcgatcgca 2100
tccatggcct ccgcgaccgg ctgcagaaca gcgggcagtt cggtttcagg caggtcttgc 2160
aacgtgacac cctgtgcacg gcgggagatg caataggtca ggctctcgct aaattcccca 2220
atgtcaagca cttccggaat cgggagcgcg gccgatgcaa agtgccgata aacataacga 2280
tctttgtaga aaccatcggc gcagctattt acccgcagga catatccacg ccctcctaca 2340
tcgaagctga aagcacgaga ttcttcgccc tccgagagct gcatcaggtc ggagacgctg 2400
tcgaactttt cgatcagaaa cttctcgaca gacgtggcgg tgagttcagg ctttttcatt 2460
gtggatgtgt gtggttgtat gtgtgatgtg gtgtgtggag tgaaaatctg tggctggcaa 2520
acgctcttgt atatatacgc acttttgccc gtgatcagtg catgccggat cgaggtgggc 2580
gggaacaacg gtctcgataa tgtacgtaca ctacacacaa agagaagcaa gtgggggcgg 2640
gactgcacgg cacttgtgag acacctcagc atgccggatc gaggtgggcg ggaacaacgg 2700
tctcgataat gtacgtacac tacacacaaa gagaagcaag tgggggcggg actgcacggc 2760
acttgtgaga cacctcagca tgccggatcg aggtgggcgg gaacaacggt ctcgataatg 2820
tacgtacact acacacaaag agaagcaagt gggggcggga ctgcacggca cttgtgagac 2880
acctcagcat gccggatcga ggtgggcggg aacaacggtc tcgataatgt acgtacacta 2940
cacacaaaga gaagcaagtg ggggcgggac tgcacggcac ttgtgagaca cctcagcatg 3000
caccattcct tgcggcggcg gtgctcaacg gcctggatcc acaggacggg tgtggtcgcc 3060
atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca 3120
aagcggtcgg acagtgctcc gagaacgggt gcgcatagag atgtggagta tccgaatttc 3180
tccaggctgt caagcggcaa ttataaccga gactgagatc gagaagtata taaccgcagc 3240
agtagtggat aaataattgc gaagtcttcc cagcagagcg ggctgttttt tggagttggt 3300
tactgtaaaa tgctaaaatg actgacaaca atggagcgtc tacagcattg gcaacagtgg 3360
gaacagtatg ctggtgcatc cagttgatac cccaggttct gcgaaactgg tatgttcggg 3420
attgcgaggg cgttcctcct ctgatgttct ttttgttcgc cgtttcgggg attcccttcg 3480
cagtgtactt cattgatcag aattcgaaca ctgccatcat ggttcaacct cacttgttta 3540
ctttctttag ccttataggc ttttggcaaa gcctgtacta tccgcccgtc agttaattaa 3600
taacttcgta tagcatacat tatacgaagt tattaggtaa actaaattca tgacagcctt 3660
ttcttctttc tttccacaaa acaattaaaa aaaataacag aattagaaga aggtaaatat 3720
attggcaaac tcctctcttc cttttactta tttttttgaa agttgcagtg tgtgtgtgtg 3780
ttgttgtttg ttcaaattaa tttgatggtt gttgtattgt aaatttcaat caataaaaac 3840
aaagacataa ataaaaaaaa ccctacctct cttccctgat ctgatttgat cgtacgattc 3900
taagaactca ccgctaaggc cggccctttg acaggtatat cttcagtttc ctcgtcactc 3960
ttggtcaaaa gaccaaagtc atggctggcg atttcctcga tgctttcctc aagaattttc 4020
aaggagttgt ggctttccaa ctccatttga accttcttcg aggcttcgtg gaatttcgga 4080
tttccaatta tcgaatcaac agcttctttg atttgctcca ctgtaggcaa gccagttttc 4140
aaatcaattg ccacgccagc ggcctcagct ctcgatgcca ccattggctt gtcttcagag 4200
tcaccagcaa taacaactgg aacagagtgg cttaagctgt gctgaagtcc gccatatcca 4260
ccattgtaga caagagcatc aacgtgagga agtagagcat cgtagttgaa gtagtcgatc 4320
acgcgagcat tctcaggaac cacaacatca tccggtagct tggcaccgcg gcggcccaat 4380
atggctactg ttaaagtgtc aggctcgtcc ttcaaggcct caagagtagg cacaataaga 4440
tgcttgtaac tgacagcaaa agttccttga gtgaccatga tgactcgctt ggcactcaga 4500
acatcccccc accaggaagg aggggtgaat tgagttcggt gcttgggcgt tgagccggcg 4560
aatttgaagt tgctaggcag atggtctctg ctgaactcaa gagaaggcgg gcacagctgc 4620
aggaacttgt ctgcaggtac ctcaagggcg aattcgc 4657

Claims (14)

1.形成槐糖脂的细胞,其已如此进行了遗传工程改变,使得所述细胞相较于它的野生型而言具有一种经改变的如分别在以下列举的选自下列所述组的至少一种酶的活性:
至少一种酶E1,该酶具有多肽序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63,或具有这样的多肽序列:其中相对于Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63,最多25%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各参照序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63的酶的至少50%的酶的活性,其中将酶E1的酶的活性理解为将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的能力,
至少一种酶E2,该酶具有多肽序列Seq ID Nr. 8或Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 8或Seq ID Nr. 11,最多60%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各参照序列Seq ID Nr. 8或Seq ID Nr. 11的酶的至少50%的酶的活性,其中将酶E2的酶的活性理解为将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的能力,
至少一种酶E3,该酶具有多肽序列Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 11,最多60%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有参照序列Seq ID Nr. 11的酶的至少50%的酶的活性,其中将酶E3的酶的活性理解为将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的能力,
至少一种酶E4,该酶具有多肽序列Seq ID Nr. 9,或具有这样的多肽序列:其中相对于Seq ID Nr. 9,最多50%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 9的酶的至少50%的酶的活性,其中将酶E4的酶的活性理解为将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的能力,
至少一种酶E5,该酶具有多肽序列Seq ID Nr. 10,或具有这样的多肽序列:其中相对于Seq ID Nr. 10,最多45%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 10的酶的至少50%的酶的活性,其中将酶E5的酶活性理解为将槐糖脂从细胞转移到周围的培养基中的能力。
2.根据权利要求1的细胞,其特征在于,在其β-氧化中所述细胞被至少部分地阻断。
3.根据权利要求1或2的细胞,其特征在于,所述改变的活性是增加的活性。
4.根据权利要求3的细胞,其特征在于,所述细胞具有增加的下述酶的组合的活性:
E1E2、E1E3、E1E4、E1E5、E2E3、E2E4、E2E5、E3E4、E3E5、E4E5、E1E2E3、E1E2E4、E1E2E5、E1E3E4、E1E3E5、E1E4E5、E2E3E4、E2E4E5、E3E4E5、E1E2E3E4、E2E3E4E5、E1E3E4E5、E1E2E4E5、E1E2E3E5、E1E2E3E4和E1E2E3E4E5
5.根据权利要求1或2的细胞,其特征在于,所述细胞具有减少的酶E3的活性和任选增加的下述酶的组合的活性:
E1E2、E1E4、E1E5、E2E4、E2E5、E4E5、E1E2E4、E1E2E5、E1E4E5和E1E2E4E5
6.根据权利要求1或2的细胞,其特征在于,所述细胞具有减少的酶E4的活性和任选增加的下述酶的组合的活性:
E1E2、E1E3、E1E5、E2E3、E2E5、E3E5、E1E2E3、E1E2E5、E1E3E5和E1E2E3E5
7.根据权利要求1或2的细胞,其特征在于,所述细胞具有减少的酶E3和E4的活性和任选增加的下述酶的组合的活性:
E1E2、E1E5、E2E5、E1E2E5
8.根据权利要求1-7的至少一项的细胞,其特征在于,将所述细胞用根据权利要求10或11的至少一种核酸转化。
9.用于生产槐糖脂的方法,所述方法包括下述工艺步骤:
I) 使根据权利要求1-8的至少一项的细胞与包含碳源的培养基接触,
II) 在使所述细胞能够从所述碳源形成槐糖脂的条件下,培养所述细胞,且
III) 任选分离形成的槐糖脂。
10.用根据权利要求9的方法得到的槐糖脂制备化妆品制剂、皮肤病学制剂或药物制剂、作物保护制剂和护理剂及清洁剂和表面活性剂浓缩物的用途。
11.分离的DNA,所述DNA选自下述序列:
A) 根据Seq ID Nr. 2、Seq ID Nr. 3、Seq ID Nr. 4、Seq ID Nr. 5、Seq ID Nr. 6、Seq ID Nr. 52、Seq ID Nr. 54、Seq ID Nr. 56、Seq ID Nr. 58、Seq ID Nr. 60或Seq ID Nr. 62的序列,
其中根据Seq ID Nr. 2、Seq ID Nr. 52、Seq ID Nr. 54、Seq ID Nr. 56、Seq ID Nr. 58、Seq ID Nr. 60或Seq ID Nr. 62的序列编码能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的蛋白质,
其中所述序列Seq ID Nr. 3编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的蛋白质,
其中所述序列Seq ID Nr. 4编码能够将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯二乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯二乙酸酯的蛋白质,
其中所述序列Seq ID Nr. 5编码能够将槐糖脂从细胞转移到周围的培养基中的蛋白质,
其中所述序列Seq ID Nr. 6编码能够将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸,或将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的蛋白质,
B) 一种不含内含子的序列,其源自根据A)的序列,且其与根据Seq ID Nr. 2、Seq ID Nr. 3、Seq ID Nr. 4、Seq ID Nr. 5、Seq ID Nr. 6、Seq ID Nr. 52、Seq ID Nr. 54、Seq ID Nr. 56、Seq ID Nr. 58、Seq ID Nr. 60或Seq ID Nr. 62的序列编码相同的蛋白质或肽,
C) 一种序列,其编码蛋白质或肽,所述蛋白质或肽包含根据Seq ID Nr. 7、Seq ID Nr. 8、Seq ID Nr. 9、Seq ID Nr. 10、Seq ID Nr. 11、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63的氨基酸序列, 其中所述包含根据Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63的氨基酸序列的蛋白质或肽能够将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸,
D) 一种序列,其与根据组A)至C)中的任一组的序列至少80%相同,
E) 一种序列,其与根据组A)至D)中的任一组的序列的反链杂交,或在考虑遗传密码的简并性的情况下与所述反链杂交,
F) 根据组A)至E)中的任一组的序列的衍生物,其通过置换、添加、反转和/或删除一个或多个碱基而得到,和
G) 与根据组A)至F)中的任一组的序列互补的序列。
12.载体,其包含如在权利要求11中定义的、根据组A)至G)中的任一组的DNA序列。
13.根据权利要求12的载体用于转化细胞的用途。
14.分离的多肽,其选自:
酶E1,该酶具有多肽序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63、特别是Seq ID Nr. 7,或具有这样的多肽序列:其中相对于各个参照序列Seq ID Nr. 7、Seq ID Nr. 53、Seq ID Nr. 55、Seq ID Nr. 57、Seq ID Nr. 59、Seq ID Nr. 61或Seq ID Nr. 63、特别是Seq ID Nr. 7,最多25%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各个参照序列的酶的至少50%的酶的活性,其中将酶E1的酶的活性理解为将Z-9-十八碳烯酸转化成17-羟基-Z-9-十八碳烯酸的能力,
酶E2,该酶具有多肽序列Seq ID Nr. 8或Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 8或Seq ID Nr. 11,最多60%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有各个参照序列Seq ID Nr. 8或Seq ID Nr. 11的酶的至少50%的酶的活性,其中将酶E2的酶的活性理解为将UDP-葡萄糖和17-羟基-Z-9-十八碳烯酸转化成17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸的能力,
酶E3,该酶具有多肽序列Seq ID Nr. 11,或具有这样的多肽序列:其中相对于Seq ID Nr. 11,最多60%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有参照序列Seq ID Nr. 11的酶的至少50%的酶的活性,其中将酶E3的酶的活性理解为将17-(β-D-吡喃葡萄糖基氧基)-Z-9-十八碳烯酸和UDP-葡萄糖转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸的能力,
酶E4,该酶具有多肽序列Seq ID Nr. 9,或具有这样的多肽序列:其中相对于Seq ID Nr. 9,最多50%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 9的酶的至少50%的酶的活性,其中将酶E4的酶的活性理解为将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯单乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯单乙酸酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯,或将17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸1',4''-内酯和乙酰基-辅酶A转化成17-L-[(2'-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基)氧基]-Z-9-十八碳烯酸-1',4''-内酯二乙酸酯的能力,和
酶E5,该酶具有多肽序列Seq ID Nr. 10,或具有这样的多肽序列:其中相对于Seq ID Nr. 10,最多45%的氨基酸残基通过删除、插入、置换或这些的组合被改变,且该酶还具有带有Seq ID Nr. 10的酶的至少50%的酶的活性,其中将酶E5的酶的活性理解为将槐糖脂从细胞转移到周围的培养基中的能力。
CN201080061656.4A 2009-11-18 2010-10-19 细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法 Active CN102695796B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
DE102009046799 2009-11-18
DE102009046799.8 2009-11-18
DE102010014680.3 2010-04-12
DE102010014680A DE102010014680A1 (de) 2009-11-18 2010-04-12 Zellen, Nukleinsäuren, Enzyme und deren Verwendung sowie Verfahren zur Herstellung von Sophorolipiden
PCT/EP2010/065713 WO2011061032A2 (de) 2009-11-18 2010-10-19 Zellen, nukleinsäuren, enzyme und deren verwendung sowie verfahren zur herstellung von sophorolipiden

Publications (2)

Publication Number Publication Date
CN102695796A true CN102695796A (zh) 2012-09-26
CN102695796B CN102695796B (zh) 2017-06-13

Family

ID=44060100

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201080061656.4A Active CN102695796B (zh) 2009-11-18 2010-10-19 细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法

Country Status (7)

Country Link
US (5) US8911982B2 (zh)
EP (1) EP2501813B1 (zh)
JP (1) JP5936548B2 (zh)
CN (1) CN102695796B (zh)
BR (1) BR112012012008A2 (zh)
DE (1) DE102010014680A1 (zh)
WO (1) WO2011061032A2 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103275139A (zh) * 2013-06-17 2013-09-04 山东大学 十六碳双乙酰化一个双键内酯型槐糖脂及其应用
CN103275140A (zh) * 2013-06-17 2013-09-04 山东大学 十八碳双乙酰化一个双键内酯型槐糖脂及其应用
CN113966378A (zh) * 2019-06-11 2022-01-21 莎罗雅株式会社 褐变被抑制的含酸型槐糖脂的组合物
CN116555269A (zh) * 2023-06-28 2023-08-08 百葵锐(深圳)生物科技有限公司 熊蜂生假丝酵母诱导型启动子及其应用

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102006025821A1 (de) * 2006-06-02 2007-12-06 Degussa Gmbh Ein Enzym zur Herstellung von Mehylmalonatsemialdehyd oder Malonatsemialdehyd
DE102007060705A1 (de) * 2007-12-17 2009-06-18 Evonik Degussa Gmbh ω-Aminocarbonsäuren oder ihre Lactame, herstellende, rekombinante Zellen
DE102008002715A1 (de) * 2008-06-27 2009-12-31 Evonik Röhm Gmbh 2-Hydroxyisobuttersäure produzierende rekombinante Zelle
DE102009009580A1 (de) 2009-02-19 2010-08-26 Evonik Degussa Gmbh Verfahren zur Herstellung freier Säuren aus ihren Salzen
DE102009046623A1 (de) 2009-11-11 2011-05-12 Evonik Röhm Gmbh Verwendung eines zu einem MeaB-Protein homologen Proteins zur Erhöhung der enzymatischen Aktivität einer 3-Hydroxycarbonsäure-CoA-Mutase
DE102009046626A1 (de) 2009-11-11 2011-05-12 Evonik Degussa Gmbh Candida tropicalis Zellen und deren Verwendung
DE102010014680A1 (de) 2009-11-18 2011-08-18 Evonik Degussa GmbH, 45128 Zellen, Nukleinsäuren, Enzyme und deren Verwendung sowie Verfahren zur Herstellung von Sophorolipiden
GB0921691D0 (en) * 2009-12-11 2010-01-27 Univ Gent Sophorolpid transporter protien
DE102010015807A1 (de) 2010-04-20 2011-10-20 Evonik Degussa Gmbh Biokatalytisches Oxidationsverfahren mit alkL-Genprodukt
GB201009882D0 (en) 2010-06-11 2010-07-21 Univ Gent Yeast strains modified in their sophorolipid production and uses thereof
WO2012080116A1 (en) * 2010-12-15 2012-06-21 Universiteit Gent Producing unacetylated sophorolipids by fermentation
DE102012102875B4 (de) * 2011-04-04 2024-04-18 Wisconsin Alumni Research Foundation Vorläuferauswahl mit einem Artificial-Intelligence-Algorithmus erhöht Abdeckung und Reproduzierbarkeit von proteomischen Proben
US8691231B2 (en) 2011-06-03 2014-04-08 Merrimack Pharmaceuticals, Inc. Methods of treatment of tumors expressing predominantly high affinity EGFR ligands or tumors expressing predominantly low affinity EGFR ligands with monoclonal and oligoclonal anti-EGFR antibodies
RU2014106109A (ru) 2011-07-20 2015-08-27 Эвоник Дегусса Гмбх Окисление и аминирование первичных спиртов
EP2602328A1 (de) 2011-12-05 2013-06-12 Evonik Industries AG Verfahren zur Oxidation von Alkanen unter Verwendung einer AlkB Alkan 1-Monooxygenase
EP2607479A1 (en) 2011-12-22 2013-06-26 Evonik Industries AG Biotechnological production of alcohols and derivatives thereof
EP2631298A1 (en) 2012-02-22 2013-08-28 Evonik Industries AG Biotechnological method for producing butanol and butyric acid
EP2821495B1 (en) * 2012-03-02 2016-10-05 Saraya Co., Ltd. High-purity acid-form sophorolipid (sl) containing composition and process for preparing same
EP2639308A1 (de) 2012-03-12 2013-09-18 Evonik Industries AG Enzymatische omega-Oxidation und -Aminierung von Fettsäuren
EP2647696A1 (de) 2012-04-02 2013-10-09 Evonik Degussa GmbH Verfahren zur aeroben Herstellung von Alanin oder einer unter Verbrauch von Alanin entstehenden Verbindung
CA2886893C (en) 2012-09-29 2019-05-07 Yong Wang Method for producing stevioside compounds by microorganism
JP6053124B2 (ja) * 2012-10-18 2016-12-27 花王株式会社 糖脂質の製造方法
EP2746400A1 (de) 2012-12-21 2014-06-25 Evonik Industries AG Herstellung von Aminen und Diaminen aus einer Carbonsäure oder Dicarbonsäure oder eines Monoesters davon
DE102013205755A1 (de) 2013-04-02 2014-10-02 Evonik Industries Ag Waschmittelformulierung für Textilien enthaltend Rhamnolipide mit einem überwiegenden Gehalt an di-Rhamnolipiden
DE102013205756A1 (de) 2013-04-02 2014-10-02 Evonik Industries Ag Mischungszusammensetzung enthaltend Rhamnolipide
US10752650B2 (en) 2013-08-09 2020-08-25 Saraya Co., Ltd. Sophorolipid compound and composition comprising same
US9650658B2 (en) 2013-08-26 2017-05-16 Universiteit Gent Methods to produce bolaamphiphilic glycolipids
EP3042940B1 (en) * 2013-09-04 2021-07-21 Saraya Co., Ltd. Low-toxicity sophorolipid-containing composition and use therefor
JP6389400B2 (ja) * 2013-11-21 2018-09-12 花王株式会社 アセチル化スフィンゴイドの製造方法
JP6278555B2 (ja) * 2013-12-26 2018-02-14 花王株式会社 脂肪族ジオールの製造方法
JP6323940B2 (ja) * 2013-12-26 2018-05-16 花王株式会社 アルキルポリグリコシドの製造方法
EP3117838B1 (en) 2014-03-10 2020-09-16 Saraya Co., Ltd. Composition containing sophorolipid, physiologically active substance and oil and fat, and method of producing said composition
EP2949214A1 (en) 2014-05-26 2015-12-02 Evonik Degussa GmbH Methods of producing rhamnolipids
EP3002328A1 (de) 2014-09-30 2016-04-06 Evonik Degussa GmbH Biotensidhaltige Formulierung
CA2937594A1 (en) 2015-02-26 2016-08-26 Evonik Degussa Gmbh Alkene production
JP6157524B2 (ja) * 2015-03-04 2017-07-05 サラヤ株式会社 低毒性ソホロリピッド含有組成物及びその用途
EP3070155A1 (de) 2015-03-18 2016-09-21 Evonik Degussa GmbH Zusammensetzung enthaltend peptidase und biotensid
US10590428B2 (en) 2015-07-22 2020-03-17 Kao Corporation Sophorolipid highly-productive mutant strain
JP2016160264A (ja) * 2016-01-04 2016-09-05 サラヤ株式会社 低毒性ソホロリピッド含有組成物及びその用途
WO2017157659A1 (de) 2016-03-18 2017-09-21 Evonik Degussa Gmbh Granulat umfassend einen anorganischen, festen träger, auf dem mindestens ein biotensid enthalten ist
WO2018065314A1 (de) 2016-10-07 2018-04-12 Evonik Degussa Gmbh Zusammensetzung enthaltend glykolipide und konservierungsmittel
EP3529259A1 (en) 2016-10-24 2019-08-28 Evonik Degussa GmbH Rhamnolipid-producing cell having reduced glucose dehydrogenase activity
US11464717B2 (en) 2017-02-10 2022-10-11 Evonik Operations Gmbh Oral care composition containing at least one biosurfactant and fluoride
ES2973974T3 (es) 2017-03-07 2024-06-25 Saraya Co Ltd Composición de detergente
EP3749679A1 (en) 2018-02-09 2020-12-16 Evonik Operations GmbH Mixture composition comprising glucolipids
EP4117616A1 (en) 2020-03-11 2023-01-18 Evonik Operations GmbH Mixture composition comprising glycolipids and triethyl citrate
JP2023525870A (ja) * 2020-05-13 2023-06-19 アンフィスター オメガ-グリコシド及びアルキルグリコシドの効率的な合成
KR102691533B1 (ko) * 2021-09-03 2024-08-05 대한민국 신종 리조스파에라(Rhizosphaera) sp. JAF-11 효모로부터 분리된 생물학적 계면활성제
WO2023198511A1 (en) 2022-04-13 2023-10-19 Evonik Operations Gmbh Process for the fermentative production of a biosurfactant
CN114891649B (zh) * 2022-06-07 2024-05-24 天津大学 复合菌及其在降解长链烷烃中的应用

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US601494A (en) 1898-03-29 Spar for vessels
NL301993A (zh) 1962-12-18
US4601893A (en) 1984-02-08 1986-07-22 Pfizer Inc. Laminate device for controlled and prolonged release of substances to an ambient environment and method of use
US5254466A (en) 1989-11-06 1993-10-19 Henkel Research Corporation Site-specific modification of the candida tropicals genome
DE4027453A1 (de) 1990-08-30 1992-03-05 Degussa Neue plasmide aus corynebacterium glutamicum und davon abgeleitete plasmidvektoren
DE4440118C1 (de) 1994-11-11 1995-11-09 Forschungszentrum Juelich Gmbh Die Genexpression in coryneformen Bakterien regulierende DNA
DK0839211T3 (da) 1995-07-13 2001-09-03 Basf Ag Fremgangsmåde til fremstilling af riboflavin ved hjælp af mikroorganismer med ændret isocitratlyaseaktivitet
ID21487A (id) 1996-11-13 1999-06-17 Du Pont Metoda pembuatan 1,3 - propandiol dengan organisme rekombinan
JPH10229891A (ja) 1997-02-20 1998-09-02 Mitsubishi Rayon Co Ltd マロン酸誘導体の製造法
AU2003247411A1 (en) 2002-05-23 2003-12-12 Cognis Corporation NON-REVERTIBLE Beta-OXIDATION BLOCKED CANDIDA TROPICALIS
DE102006025821A1 (de) 2006-06-02 2007-12-06 Degussa Gmbh Ein Enzym zur Herstellung von Mehylmalonatsemialdehyd oder Malonatsemialdehyd
DE102007005072A1 (de) 2007-02-01 2008-08-07 Evonik Degussa Gmbh Verfahren zur fermentativen Herstellung von Cadaverin
DE102007015583A1 (de) 2007-03-29 2008-10-02 Albert-Ludwigs-Universität Freiburg Ein Enzym zur Herstellung von Methylmalonyl-Coenzym A oder Ethylmalonyl-Coenzym A sowie dessen Verwendung
DE102007060705A1 (de) 2007-12-17 2009-06-18 Evonik Degussa Gmbh ω-Aminocarbonsäuren oder ihre Lactame, herstellende, rekombinante Zellen
DE102008002715A1 (de) 2008-06-27 2009-12-31 Evonik Röhm Gmbh 2-Hydroxyisobuttersäure produzierende rekombinante Zelle
DE102008040193A1 (de) 2008-07-04 2010-01-07 Evonik Röhm Gmbh Verfahren zur Herstellung freier Carbonsäuren
DE102009009580A1 (de) 2009-02-19 2010-08-26 Evonik Degussa Gmbh Verfahren zur Herstellung freier Säuren aus ihren Salzen
DE102009046626A1 (de) 2009-11-11 2011-05-12 Evonik Degussa Gmbh Candida tropicalis Zellen und deren Verwendung
DE102009046623A1 (de) 2009-11-11 2011-05-12 Evonik Röhm Gmbh Verwendung eines zu einem MeaB-Protein homologen Proteins zur Erhöhung der enzymatischen Aktivität einer 3-Hydroxycarbonsäure-CoA-Mutase
DE102010014680A1 (de) 2009-11-18 2011-08-18 Evonik Degussa GmbH, 45128 Zellen, Nukleinsäuren, Enzyme und deren Verwendung sowie Verfahren zur Herstellung von Sophorolipiden
GB0921691D0 (en) * 2009-12-11 2010-01-27 Univ Gent Sophorolpid transporter protien
DE102010002809A1 (de) 2010-03-12 2011-11-17 Evonik Degussa Gmbh Verfahren zur Herstellung von linearen alpha,omega-Dicarbonsäurediestern
DE102010015807A1 (de) 2010-04-20 2011-10-20 Evonik Degussa Gmbh Biokatalytisches Oxidationsverfahren mit alkL-Genprodukt
GB201009882D0 (en) * 2010-06-11 2010-07-21 Univ Gent Yeast strains modified in their sophorolipid production and uses thereof
DE102010026196A1 (de) 2010-06-25 2011-12-29 Evonik Degussa Gmbh Synthese von omega-Aminocarbonsäuren und deren Estern aus ungesättigten Fettsäurederivaten
DE102010032484A1 (de) 2010-07-28 2012-02-02 Evonik Goldschmidt Gmbh Zellen und Verfahren zur Herstellung von Rhamnolipiden

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
INGE N. A. VAN BOGAERT ET AL.: "Microbial production and application of sophorolipids", 《APPL MICROBIOL BIOTECHNOL》 *
INGE N.A. VAN BOGAERT ET AL.: "Importance of the cytochromeP450monooxygenaseCYP52 family for the sophorolipid-producing yeastCandida bombicola", 《FEMS YEAST RES》 *
INGE N.A. VAN BOGAERT ET AL.: "Knocking outtheMFE-2 gene ofCandida bombicola leads to improvedmedium-chain sophorolipid production", 《FEMS YEAST RES》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103275139A (zh) * 2013-06-17 2013-09-04 山东大学 十六碳双乙酰化一个双键内酯型槐糖脂及其应用
CN103275140A (zh) * 2013-06-17 2013-09-04 山东大学 十八碳双乙酰化一个双键内酯型槐糖脂及其应用
CN103275139B (zh) * 2013-06-17 2015-12-23 山东大学 十六碳双乙酰化一个双键内酯型槐糖脂及其应用
CN103275140B (zh) * 2013-06-17 2016-05-25 山东大学 十八碳双乙酰化一个双键内酯型槐糖脂及其应用
CN113966378A (zh) * 2019-06-11 2022-01-21 莎罗雅株式会社 褐变被抑制的含酸型槐糖脂的组合物
CN113966378B (zh) * 2019-06-11 2024-03-26 莎罗雅株式会社 褐变被抑制的含酸型槐糖脂的组合物
CN116555269A (zh) * 2023-06-28 2023-08-08 百葵锐(深圳)生物科技有限公司 熊蜂生假丝酵母诱导型启动子及其应用
CN116555269B (zh) * 2023-06-28 2023-09-15 百葵锐(深圳)生物科技有限公司 熊蜂生假丝酵母诱导型启动子及其应用

Also Published As

Publication number Publication date
CN102695796B (zh) 2017-06-13
US20150056661A1 (en) 2015-02-26
JP5936548B2 (ja) 2016-06-22
BR112012012008A2 (pt) 2016-11-29
WO2011061032A3 (de) 2011-09-15
US9068211B2 (en) 2015-06-30
US20150056658A1 (en) 2015-02-26
WO2011061032A2 (de) 2011-05-26
US20150056660A1 (en) 2015-02-26
US20150056659A1 (en) 2015-02-26
EP2501813A2 (de) 2012-09-26
US9102968B2 (en) 2015-08-11
DE102010014680A1 (de) 2011-08-18
EP2501813B1 (de) 2018-02-21
JP2013511266A (ja) 2013-04-04
US8911982B2 (en) 2014-12-16
US9157108B2 (en) 2015-10-13
US20130035403A1 (en) 2013-02-07
US9085787B2 (en) 2015-07-21

Similar Documents

Publication Publication Date Title
CN102695796B (zh) 细胞、核酸、酶和它们用于生产槐糖脂的用途以及方法
CN107002020B (zh) 使用rna引导的内切核酸酶在非常规酵母中基因靶向
CN101365788B (zh) Δ-9延伸酶及其在制备多不饱和脂肪酸中的用途
KR102628801B1 (ko) 세포내 유전자 변형 및 증가된 상동 재조합을 위한 보호 dna 주형 및 이용 방법
DK2087105T3 (da) Delta 17-desaturase og anvendelse heraf ved fremstilling af flerumættede fedtsyrer
DK2927316T3 (en) Total fermentation of oligosaccharides
DK2576605T3 (en) PREPARATION OF METABOLITES
DK2087106T3 (en) MUTATING DELTA8 DESATURATION GENES CONSTRUCTED BY TARGETED MUTAGENES AND USE THEREOF IN THE MANUFACTURE OF MULTI-Saturated FAT ACIDS
CN101563356B (zh) 高山被孢霉c16/18脂肪酸延伸酶
CN108779480A (zh) 生产鞘氨醇碱和鞘脂类的方法
KR20230165368A (ko) Cpf1 또는 csm1을 사용하여 게놈을 변형하기 위한 조성물 및 방법
DK2324119T3 (en) Mutant DELTA5 Desaturases AND USE THEREOF FOR THE PRODUCTION OF polyunsaturated fatty acids
KR20130138760A (ko) 고농도의 에이코사펜타엔산 생성을 위한 재조합 미생물 숙주 세포
CN106687578B (zh) 螺旋藻中的靶向诱变
KR20110038087A (ko) 재생가능 자원으로부터의 이소프렌 중합체
KR20070085669A (ko) 고농도의 아라키돈산을 생성하는 야로위아 리폴리티카 균주
KR20100118973A (ko) 이소프렌을 생성하기 위한 조성물 및 방법
CN109843909B (zh) 利用替代的葡萄糖转运蛋白产生鼠李糖脂的细胞和方法
CN111465689B (zh) Cas9变体和使用方法
KR20210006966A (ko) 조작된 캐스케이드 구성성분 및 캐스케이드 복합체
CN106661573B (zh) 多核苷酸文库的重组酶介导的整合
CN109996874A (zh) 10-甲基硬脂酸的异源性产生
AU2022402777A1 (en) C2c9 nuclease-based novel genome editing system and application thereof
CN115698297A (zh) 多模块生物合成酶基因组合文库的制备方法
DK2935601T3 (en) RECOMBINANT MICROBELL CELLS PRODUCING AT LEAST 28% EICOSAPENTAIC ACID AS DRY WEIGHT

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder

Address after: Essen, Germany

Patentee after: Evonik Operations Limited

Address before: Essen, Germany

Patentee before: EVONIK DEGUSSA GmbH

CP01 Change in the name or title of a patent holder