CN109661403A - 前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株 - Google Patents

前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株 Download PDF

Info

Publication number
CN109661403A
CN109661403A CN201780048781.3A CN201780048781A CN109661403A CN 109661403 A CN109661403 A CN 109661403A CN 201780048781 A CN201780048781 A CN 201780048781A CN 109661403 A CN109661403 A CN 109661403A
Authority
CN
China
Prior art keywords
seq
amino acid
glucoamylase
bigger
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201780048781.3A
Other languages
English (en)
Inventor
克里斯托弗·K·米勒
格雷戈里·迈克尔·波因特
阿米特·瓦斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cargill Inc
Original Assignee
Cargill Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cargill Inc filed Critical Cargill Inc
Publication of CN109661403A publication Critical patent/CN109661403A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/02Monosaccharides
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/39Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • C12N9/2405Glucanases
    • C12N9/2408Glucanases acting on alpha -1,4-glucosidic bonds
    • C12N9/2411Amylases
    • C12N9/2428Glucan 1,4-alpha-glucosidase (3.2.1.3), i.e. glucoamylase
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/39Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts
    • C07K14/395Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts from Saccharomyces
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/39Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts
    • C07K14/40Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts from Candida
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P13/00Preparation of nitrogen-containing organic compounds
    • C12P13/04Alpha- or beta- amino acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/02Oxygen as only ring hetero atoms
    • C12P17/04Oxygen as only ring hetero atoms containing a five-membered hetero ring, e.g. griseofulvin, vitamin C
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/14Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • C12P7/06Ethanol, i.e. non-beverage
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y302/00Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
    • C12Y302/01Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
    • C12Y302/01003Glucan 1,4-alpha-glucosidase (3.2.1.3), i.e. glucoamylase
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E50/00Technologies for the production of fuel of non-fossil origin
    • Y02E50/10Biofuels, e.g. bio-diesel

Abstract

本发明涉及能够将显著量的葡糖淀粉酶分泌到发酵培养基中的非天然酵母。所述葡糖淀粉酶可以促进淀粉材料的降解,从而产生葡萄糖以用于发酵成所期望的生物产物,如乙醇。所述葡糖淀粉酶可以具有分泌信号的葡糖淀粉酶融合蛋白的形式提供,所述分泌信号衍生自SEQ ID NO:73的至少AA 1‑19;(ii)SEQ ID NO:74的至少AA 1‑19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQ ID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA)。

Description

前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生 的工程化的酵母菌株
相关申请的交叉引用
本申请要求2016年8月05日提交并且名称为“前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株(LEADER-MODIFIED GLUCOAMYLASEPOLYPEPTIDES AND ENGINEERED YEAST STRAIN HAVING ENHANCED BIOPRODUCTPRODUCTION)”的美国临时专利申请序列号62/371,681的权益,该申请在此以引用的方式整体并入本文。
序列表
在2017年8月04日创建的名称为“N00485_ST25.txt”并且具有440千字节的大小的ASCII文本文件的全部内容以引用的方式整体并入本文。
技术领域
本发明涉及修饰的葡糖淀粉酶(GA)、表达这些酶的微生物以及用于生产乙醇的发酵方法。
背景技术
通过发酵生产乙醇是公知的工业过程。然而,增加乙醇收率在技术上可能是困难的。有各种因素使得微生物在被设计用于增加乙醇产量的发酵条件下生长具有挑战性。举例来说,发酵培养基可以具有更高的底物浓度以促进乙醇产生,但是这些条件可能对细胞生长具有负面影响。此外,乙醇浓度增加和不期望的副产物的积累也可能对细胞健康有害。已经针对对这些条件的耐受性来选择酵母菌株,这可以引起乙醇收率提高。具体来说,酵母酿酒酵母(Saccharomyces cerevisiae)的乙醇耐受性菌株已经在工业环境中用作主力微生物以用于生产乙醇。
分子技术已经使得能够鉴定出与乙醇耐受性有关的基因。举例来说,Kajiwara(Appl Microbiol Biotechnol.2000;53:568-74)报道了参与不饱和脂肪酸(UFA)合成的OLE1基因的过表达引起细胞中不饱和脂肪酸水平更高和更高的乙醇产量。其它研究已经发现,通过破坏海藻糖水解酶酸性海藻糖酶(ATH)来积累海藻糖(Kim等,Appl EnvironMicrobiol.1996;62:1563-1569)或通过携带PRO1γ-谷氨酰激酶突变的菌株来积累脯氨酸L-脯氨酸(Takagi等,Appl Environ Microbiol.2005;71:8656-8662)提高了酵母的乙醇耐受性。麦角固醇与酿酒酵母的乙醇耐受性密切相关(Inoue等,Biosci BiotechnolBiochem.2000;64:229-236)。虽然在该领域中已经取得了进展,但是使用表现出乙醇耐受性的遗传修饰的菌株单独可能不足以在发酵过程期间提供所期望的水平的乙醇。
除了发酵微生物的遗传谱之外,发酵培养基的组分也可能对乙醇产生具有显著的影响。在发酵过程中,在培养基中存在碳水化合物或碳水化合物混合物。淀粉是广泛可用并且廉价的碳水化合物来源。它可从多种植物来源获得,如玉米、小麦、水稻、大麦等。许多生物体不能够直接代谢淀粉,或者缓慢并且低效地代谢淀粉。
因此,通常在将淀粉供给到发酵过程中之前对淀粉进行处理以将它分解成生物体可以容易发酵的单糖。通常,将淀粉水解以形成主要含有葡萄糖(即右旋糖)的混合物。然而,预处理淀粉组合物以为发酵作准备可能是昂贵的和劳动密集的,这是因为它通常涉及将纯化的淀粉降解酶添加到淀粉材料中,并且在进行发酵之前需要另外的步骤。此外,完全水解成葡萄糖会增加显著的成本,因此大部分的可商购获得的葡萄糖产品倾向于含有少量的各种低聚多糖。
生产基于淀粉的乙醇的成本的很大一部分是将淀粉分解成可发酵糖的酶。已经在酿酒酵母中尝试了各种分子技术以减少或消除向发酵培养基中添加淀粉分解酶的需要,但是这些方法已经取得了不同程度的成功。影响工程化的菌株的商业可行性的潜在限制因素是酿酒酵母分泌大量外来蛋白质的能力以及所述蛋白质在它被分泌之后在发酵培养基中以所期望的方式起作用的能力。
发明内容
本发明涉及工程化的酵母和发酵方法,其中所述工程化的酵母能够将修饰的葡糖淀粉酶分泌到发酵培养基中并且对发酵底物提供葡糖淀粉酶活性。本发明还涉及葡糖淀粉酶(E.C.3.2.1.3),其被修饰以用异源性分泌序列部分或完全置换它们的天然分泌序列。本发明还涉及编码这些分泌序列修饰的葡糖淀粉酶的基因以及表达这些基因的微生物。本发明还涉及用于生产由生物体制造的生物衍生产品(发酵产品),如乙醇的方法。本发明还涉及发酵副产品,其可以用于其它类型的组合物和方法,如动物饲料组合物和相关方法。
在与本申请相关的实验研究中,已经发现,源自于构巢曲霉(Aspergillusnidulans)α淀粉酶(An AA)、酿酒酵母α交配因子(Sc FAKS、Sc AKS、Sc AK以及Sc MFα1)、酿酒酵母转化酶(Sc IV)、原鸡(Gallus gallus)溶菌酶(Gg LZ)以及智人白蛋白(Hs SA)的N末端氨基酸序列可以用作用于葡糖淀粉酶融合多肽的异源性分泌信号,并且这些异源性分泌信号能够促进所述融合多肽分泌到发酵培养基中。所述融合体对淀粉产品具有酶活性,从而引起葡萄糖形成和发酵成所期望的生物产物,如乙醇。
还已经发现,这些N末端氨基酸序列共有被认为促进所述葡糖淀粉酶融合体的分泌的共同的结构特征,并且不干扰所述融合蛋白具有葡糖淀粉酶活性的能力。具体来说,这些分泌序列包括一段5-8个连续疏水性氨基酸残基。在该段中至少一次发现的共同疏水性氨基酸是亮氨酸。此外,在一些分泌信号中,发现该段通常紧邻一个或两个极性氨基酸残基,如丝氨酸。
因此,本发明的方面提供了一种工程化的多肽,所述工程化的多肽包括(a)包含5-8个连续疏水性氨基酸残基的分泌信号氨基酸序列;以及(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述分泌信号氨基酸序列与所述葡糖淀粉酶氨基酸序列是异源的,并且所述工程化的多肽具有葡糖淀粉酶活性。举例来说,这段5-8个连续疏水性氨基酸残基可以存在于具有约15个至约30个氨基酸的异源性前导序列内。
因此,在本发明的方面,所述工程化的多肽包括(a)分泌信号氨基酸序列,所述分泌信号氨基酸序列与以下各项具有80%或更大的序列同一性:(i)SEQ ID NO:73的至少AA1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQ ID NO:76(Gg LZ);或(vi)SEQ ID NO:78(HsSA);以及(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述多肽具有葡糖淀粉酶活性。
本发明的方面还提供了一种核酸序列,所述核酸序列编码工程化的多肽,所述工程化的多肽包括包含5-8个连续疏水性氨基酸残基的分泌信号氨基酸序列;以及(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述分泌信号氨基酸序列与所述葡糖淀粉酶氨基酸序列是异源的,并且所述工程化的多肽具有葡糖淀粉酶活性。这些方面包括构建体,其中所述核酸存在于载体构建体上,所述载体构建体可以包括以下序列中的一个或多个:启动子序列、终止子序列、选择标记序列、基因组整合序列和/或复制起点序列。所述核酸可以整合到宿主基因组DNA的一个或多个位置中,或可以存在于细胞内,但是不整合,如在质粒或游离型构建体上。本发明还提供了核酸,如DNA低聚物(例如单链DNA PCR引物或更长的线性DNA片段),其可以用于检测细胞中具有如所述的分泌序列的葡糖淀粉酶基因。
本发明的方面还提供了宿主细胞,所述宿主细胞包括编码所述分泌信号修饰的葡糖淀粉酶的核酸序列。在一些方面,所述宿主细胞表达所述信号修饰的葡糖淀粉酶并且能够将所述酶分泌到其中存在所述细胞的培养基中。示例性宿主细胞包括酵母,如酵母菌属(Saccharomyces)的菌种(例如酿酒酵母)。所述工程化的酵母可以对所述细胞的生物衍生产物,如乙醇或衍生自由所述酶的淀粉分解活性产生的前体的另外的产物具有耐受性。举例来说,所述宿主细胞可以是可商购获得的菌株或具有一个或多个特定遗传修饰的菌株,所述特定遗传修饰使得对生物衍生产物的耐受性增加,如乙醇耐受性增加,如乙醇耐受性酿酒酵母。
本发明的另一个方面提供了一种工程化的酵母,所述工程化的酵母表达多肽,所述多肽包含(a)分泌信号氨基酸序列,所述分泌信号氨基酸序列与以下各项具有80%或更大的序列同一性:(i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA);(vii)SEQ ID NO:80(MFα2);或(viii)SEQID NO:81(Pho5);以及(b)葡糖淀粉酶氨基酸序列,所述葡糖淀粉酶氨基酸序列与选自由以下组成的组的葡糖淀粉酶序列具有至少50%的序列同一性:(1)SEQ ID NO:42(米根霉(Rhizopus oryzae)GA)的氨基酸26-604;(ii)SEQ ID NO:43(白宇佐美曲霉(Aspergillusshirousami)GA)的氨基酸19-639;(iii)SEQ ID NO:44(土曲霉(Aspergillus terreus)GA)的氨基酸21-636。
本发明的方面还提供了一种用于通过发酵生产乙醇的方法,其中所述乙醇以90g/L或更大的浓度存在于发酵培养基中。在所述方法中,将包含淀粉材料和如本文所述的工程化的酵母的液体培养基发酵。发酵可以在液体培养基中提供约90g/L或更大的乙醇浓度,如在约90g/L至约170g/L的范围。在一些方面,在发酵期间的至少一个时间点期间,所述发酵培养基具有大于32℃的温度,并且发酵在发酵培养基中提供110g/L或更大量的乙醇。
在另一个方面,本发明提供了可以用于制备饲料组合物的方法和组合物。所述饲料组合物包括从源自于本公开的非天然酵母的发酵培养基中获得的发酵培养基副产品。举例来说,在已经完成发酵过程之后,可以从发酵培养基中去除一些或所有生物产物以提供包含非生物产物固体的精制组合物。所述非生物产物固体可以包括所述非天然酵母、培养基中未被所述酵母利用的原料材料以及发酵副产物。所述精制组合物可以用于形成饲料组合物,如牲畜饲料组合物。所述包含非生物产物固体的精制组合物可以提供碳水化合物和蛋白质补充剂以提高饲料组合物的营养含量。
附图说明
图1是酵母培养板的照片,示出了表达米根霉葡糖淀粉酶的不同型式的菌株的生长。
图2是来自用含有修饰的来自米根霉的葡糖淀粉酶、野生型米根霉葡糖淀粉酶的菌株以及缺乏葡糖淀粉酶的菌株进行发酵的乙醇产量随时间推移的图表。
图3是表达修饰的米根霉葡糖淀粉酶的菌株在最终时间点时的乙醇滴度的图表。
图4是表达分泌信号修饰的葡糖淀粉酶的多个拷贝的菌株的乙醇和葡萄糖谱的图表。
图5是来自玉米醪发酵的乙醇水平的图表,比较了表达分泌信号修饰的葡糖淀粉酶的菌株。
图6是来自玉米醪发酵的乙醇水平的图表,与不表达葡糖淀粉酶的菌株相比,比较了表达分泌信号修饰的葡糖淀粉酶的菌株。
图7是在30℃和33.3℃下来自玉米醪发酵的乙醇滴度的图表,与不表达葡糖淀粉酶的菌株相比,比较了表达修饰的米根霉葡糖淀粉酶或修饰的白宇佐美曲霉葡糖淀粉酶的菌株。
图8是在30℃和33.3℃下玉米醪发酵中残留葡萄糖水平的图表,与不表达葡糖淀粉酶的菌株相比,比较了表达修饰的米根霉葡糖淀粉酶或修饰的白宇佐美曲霉葡糖淀粉酶的菌株。
图9是在33.3℃下来自玉米醪发酵的乙醇水平的图表,比较了突变菌株1-28与亲本菌株1-25。
具体实施方式
下文所述的本发明的方面不意图是穷举性的或将本发明限于以下详细说明中所公开的精确形式。相反,所选择和描述的方面的目的是使得可以促进本领域技术人员对本发明的原理和实施的理解和了解。
本发明的方面涉及葡糖淀粉酶基因,所述葡糖淀粉酶基因被修饰以用异源性分泌序列置换它们的天然分泌序列。通过用基于SEQ ID NO:73-78的异源性前导序列置换葡糖淀粉酶的天然前导序列,异源性前导序列-GA融合体能够被分泌到发酵培养基中并且对淀粉产品具有酶活性,从而引起葡萄糖形成和发酵成所期望的生物产物,如乙醇。能够用作模板以表达这些酶的核酸也是本发明的方面。
本发明的方面还涉及表达这些酶的微生物,特别是真菌生物体,如酵母(例如酿酒酵母)。这些生物体可以表达具有基于衍生自SEQ ID NO:73-78中的一个或多个的序列的分泌信号的葡糖淀粉酶。
所述葡糖淀粉酶可以从细胞分泌到发酵培养基中,其中所述酶可以对发酵培养基中存在的葡萄糖聚合物具有淀粉分解活性。进而,所述酶可以引起葡萄糖聚合物降解成葡萄糖,所述葡萄糖可以进入细胞中并且用作碳源以用于产生目标化合物,如乙醇。
如本文所用的术语“外源性”意指将分子(如核酸)或活性(如酶活性)引入宿主生物体中。外源性核酸可以通过公知技术引入到宿主生物体中并且可以被维持在宿主染色体物质外部(例如维持在非整合载体上),或可以被整合到宿主的染色体中,如通过重组事件。外源性核酸可以编码与宿主生物体同源或异源的酶或其部分。
术语“异源性”指的是来自不同于所提到的分子或生物体的来源的分子或活性。举例来说,在本公开的上下文中,“异源性信号序列”指的是与所提到的多肽或酶的序列不同的信号序列。举例来说,可以将天然信号序列从葡糖淀粉酶中去除并且用来自不同多肽的信号序列置换,修饰的葡糖淀粉酶具有“异源性信号序列”。因此,与所提到的生物体异源的基因或蛋白质是不存在于该生物体中的基因或蛋白质。举例来说,存在于第一真菌菌种中并且外源引入作为宿主生物体的第二真菌菌种中的特定葡糖淀粉酶基因与第二真菌生物体是“异源的”。
葡糖淀粉酶(E.C.3.2.1.3)是将1,4-连接的a-D-葡糖基残基连续从低聚糖链和多糖链的非还原末端水解,同时释放D-葡萄糖的淀粉分解酶。
葡糖淀粉酶还可以切割支链淀粉分支点上的α-1,6键。如本文所用的术语“淀粉分解活性”涉及这些酶促机制。葡糖淀粉酶多肽可以是天然存在的葡糖淀粉酶的变体或天然存在的葡糖淀粉酶的一部分(如在它的N末端、它的C末端或这两者处被截短的葡糖淀粉酶),而所述葡糖淀粉酶多肽保留淀粉分解活性。
葡糖淀粉酶的替代名称包括淀粉葡萄糖苷酶;γ-淀粉酶;溶酶体α-葡萄糖苷酶;酸性麦芽糖酶;外切-1,4-α-葡萄糖苷酶;葡萄糖淀粉酶;γ-1,4-葡聚糖葡萄糖水解酶;酸性麦芽糖酶;1,4-α-D-葡聚糖葡萄糖水解酶。
大部分的葡糖淀粉酶是多结构域酶。许多葡糖淀粉酶包括经由O-糖基化接头区与催化结构域连接的淀粉结合结构域。所述淀粉结合结构域可以折叠成反向平行β-桶状结构并且可以具有淀粉或β-环糊精的两个结合位点。然而,一些葡糖淀粉酶不包括淀粉结合结构域(例如参见Hostinova等,Archives of Biochemistry and Biophysics,411:189-195,2003),或包括非规范淀粉结合结构域。举例来说,米根霉葡糖淀粉酶具有N末端生淀粉结合结构域,并且扣囊复膜孢酵母(Saccharomycopsis fibuligera)IFO 0111葡糖淀粉酶缺乏明确的淀粉结合结构域(Hostinova等(同上))。因此,本发明的一些方面涉及不包括淀粉结合结构域并且具有被异源性分泌信号修饰的N末端的葡糖淀粉酶,并且其它方面涉及包括淀粉结合结构域并且具有被异源性分泌信号修饰的N末端的葡糖淀粉酶。
葡糖淀粉酶还可以具有催化结构域,所述催化结构域具有构造的扭曲(α/α)(6)-桶状结构的构型,其具有中心漏斗形活性位点。葡糖淀粉酶可以具有约450个残基的结构上保守的催化结构域。在一些葡糖淀粉酶中,在催化结构域之后一般是接头区,所述接头区由30个至80个残基组成,所述残基与具有约100个残基的淀粉结合结构域连接。
葡糖淀粉酶特性可以与它们的结构特征有关。使用来自催化结构域和淀粉结合结构域模型的信息来构建基于结构的多序列比对(参见例如Coutinho,P.M.和Reilly,P.J.,1994.Protein Eng.7:393-400和749-760)。已经证实,基于结构-功能关系研究,催化结构域和淀粉结合结构域在功能上是独立的,并且在微生物葡糖淀粉酶中存在结构相似性。根据其它研究,已经证实特定葡糖淀粉酶残基参与引导蛋白质构象变化、底物结合、热稳定性以及催化活性(参见例如Sierks,M.R.等,1993.Protein Eng.6:75-79;以及Sierks,M.R.和Svensson,B.1993.Biochemistry 32:1113-1117)。因此,葡糖淀粉酶序列与蛋白质功能之间的相关性在本领域中是有所了解的,并且本领域技术人员可以设计和表达具有一个或多个氨基酸缺失、取代和/或添加的具有淀粉分解活性的葡糖淀粉酶的变体。举例来说,在一些方面,异源性分泌信号-葡糖淀粉酶的葡糖淀粉酶部分可以含有天然存在的葡糖淀粉酶的截短型式,所述截短型式至少具有催化结构域和任选的淀粉结合结构域而具有如本文所述的淀粉分解活性。
Shibuya,I.等(Agric.Biol.Chem.,58:1905-1914,1990)描述了来自白宇佐美曲霉的葡糖淀粉酶(GAase)基因的核苷酸序列(参见表1中的glaA,登录号P22832)。推导的GAase的氨基酸序列含有639个氨基酸残基,具有约68,000道尔顿的相对分子质量(非糖基化形式)。白宇佐美曲霉GA的氨基酸19-639如SEQ ID NO:43所示。
Ghose,A.等(FEMS Microbiol Lett.54:345-349,1990)描述了具有细胞外淀粉分解活性的来自土曲霉的菌株的葡糖淀粉酶,其在pH 5.0下具有最佳的活性并且在pH 3.0-8.0是稳定的。Ventura,L.等(Appl.Environ.Microbiol.61:399-402 1995)描述了通过与黑曲霉(A.niger)glaA基因的同源性来克隆土曲霉gla1基因。gla1编码序列含有被四个内含子中断的具有2,132bp的开放阅读框(基因库登录号L15383),从而提供具有推导的67,789的Mr的具有636个氨基酸的长度的多肽(UniProt Q0CPK9)。将Gla1氨基酸序列与其它真菌葡糖淀粉酶进行比较,这鉴定出具有28个氨基酸的推定前导肽、含有酶活性区的N末端催化结构域、具有高Thr和Ser含量的接头区以及C末端淀粉结合结构域。土曲霉GA的氨基酸21-636如SEQ ID NO:44所示。
Ashikari,T等(Agric.Biol.Chem.49:2521-2523,1985)描述了通过确定纯化蛋白的N末端和C末端序列和产生用于鉴定编码具有604个氨基酸长度的蛋白质的cDNA的探针寡核苷酸来克隆根霉属菌种葡糖淀粉酶基因。如由Lin,S.-C.等(BMC Biochemistry 8:9,2007)所综述,米根霉葡糖淀粉酶(Ro GA)被合成为含有具有25个氨基酸的分泌信号的前体,Ro GA的成熟形式由SBD结构域(残基26-131)、富含Thr/Ser的接头区(残基132-167)以及催化结构域(残基168-604)组成。所述SBD结构域属于碳水化合物结合模块(CBM)家族21并且Ro GA的C末端催化结构域水解淀粉并且与其它真菌GA的催化结构域具有高度序列相似性。米根霉GA的氨基酸26-604如SEQ ID NO:42所示。
Hostinova等(Archives of Biochemistry and Biophysics,411:189-195,2003)描述了酵母菌株扣囊复膜孢酵母IFO 0111中的葡糖淀粉酶基因Glm(在本文被称作“Sf GA-1”)的核苷酸序列。根据Hostinova等,扣囊复膜孢酵母Glm基因被转录成1.7kb的RNA转录物,其编码具有515个氨基酸的蛋白质并且由SEQ ID NO:1表示。在515个氨基酸长的多肽链中,26个N末端氨基酸残基构成信号肽并且随后的489个氨基酸残基构成成熟蛋白。缺乏信号序列并且具有489个氨基酸的长度的成熟Glm在脱糖基化形式下具有54,590Da的预测分子量。在与其它葡糖淀粉酶进行比对时,Glm被证实在催化结构域中具有同源性。
Itoh等(J.Bacteriol.169:4171-4176)描述了酵母扣囊复膜孢酵母中的另一种葡糖淀粉酶基因GLU1(在本文被称作“Sf GA-2”)的核苷酸序列。扣囊复膜孢酵母GLU1基因被转录成2.1kb的RNA转录物,其编码具有519个氨基酸的蛋白质并且具有57,000Da的分子量。GLU1具有四个潜在的糖基化位点(对于具有2000Da的分子量的天冬酰胺连接的糖苷)。GLU1具有四个潜在的糖基化位点(对于具有2000Da的分子量的天冬酰胺连接的糖苷)。GLU1具有用于分泌的天然信号序列,其可能在蛋白质输出期间被切除。在切割位点之前是碱性氨基酸Lys-Arg,其被认为是蛋白水解加工信号以产生成熟蛋白。
Itoh等(同上)还描述了来自酵母和真菌的葡糖淀粉酶的氨基酸序列的比对。对扣囊复膜孢酵母、黑曲霉、米根霉和糖化酵母(Saccharomyces diastaticus)以及酿酒酵母进行比对,显示出五个高度同源的片段(S1-S5)。对应的保守片段的这些部分被证实在构象上是彼此相似的。一般位于羧基末端处的S5片段似乎对于淀粉分解活性是非必需的,这是因为来自酵母菌属菌种的葡糖淀粉酶缺乏该区域。
在这方面,本发明还考虑了具有葡糖淀粉酶活性的多肽的变体和部分。表1和表2呈现了各种真菌和细菌葡糖淀粉酶基因的列表,包括葡糖淀粉酶的天然信号序列的氨基酸位置以及在一些序列中前肽的氨基酸位置。
表1:真菌葡糖淀粉酶
表2:细菌葡糖淀粉酶
如本文以及表1和表2中所述,来自各种真菌和细菌菌种的葡糖淀粉酶一般还包括天然“信号序列”。各种其它术语可以用于表示如本领域已知的“信号序列”,如其中词语“信号”被“分泌”或“靶向”或“定位”或“转运”或“前导”替换,并且词语“序列”被“肽”或“信号”替换。一般来说,信号序列是位于新合成的蛋白质的氨基末端处的短氨基酸段(长度通常在5-30个氨基酸的范围内)。大部分的信号肽包括碱性N末端区(n区)、中心疏水区(h区)以及极性C末端区(c区)(例如参见von Heijne,G.(1986)Nucleic Acids Res.14,4683-4690)。信号序列可以将蛋白质靶向到细胞的某个部分,或可以靶向蛋白质以从细胞中分泌。举例来说,已经证实,糖化酵母葡糖淀粉酶STAI基因的天然N末端信号序列可以将它靶向到分泌部的内质网(例如参见Yamashita,I.等,(1985)J.Bacteriol.161,567-573)。
在一个方面,本发明提供了葡糖淀粉酶的天然信号序列被基于An aa、Sc FAKS、ScAKS、Sc MFα1、Sc IV、Gg LZ以及Hs SA的N末端部分处的序列的分泌信号部分或完全置换,所述分泌信号代表葡糖淀粉酶背景下的异源性分泌信号。这些分泌信号可以用作葡糖淀粉酶的天然分泌信号的替代物,或除了天然分泌信号之外还可以使用这些分泌信号。鉴于异源性分泌信号的添加,所述蛋白质可以被称为“融合蛋白”并且如下注释:[An aa-SS]-[GA]、[Sc IV-SS]-[GA]等。
在一些方面,本公开的融合蛋白可以包括信号序列,所述信号序列与SEQ ID NO:73(Sc-FAKS)的至少AA 1-19的氨基酸序列具有80%或更大、85%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。SEQ ID NO:73是源自于酿酒酵母肽交配信息素α-因子的N末端部分的90个氨基酸的序列(例如参见Brake,A.等,Proc.Natl.Acad.Sci.,81:4642-4646,1984;Kurjan,J.和Herskowitz,I.,Cell 30:933-943,1982),并且所述信号序列可以选自1-x,其中x是19至89范围内的整数。
对于短于SEQ ID NO:73的异源性信号序列,所述信号序列可以选自1-x,其中x是19至88范围内的整数(19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87或88)。
可以用作异源性信号序列的SEQ ID NO:73的一部分的实例是酿酒酵母α交配因子(Sc-MFα1),它是SEQ ID NO:73的氨基酸1-19。
SEQ ID NO:73的氨基酸1-19中的示例性氨基酸取代可以包括位置7(F→L、V、I、A、G;非极性)和10(V→L、F、I、A、G;非极性)处的保守氨基酸取代。
本公开的融合蛋白可以包括信号序列,所述信号序列与包括SEQ ID NO:73(Sc-FAKS)的AA 1-19和AA 20-89的一个或多个部分的氨基酸序列具有80%或更大、85%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。在实施方案中,所述信号序列可以与其中SEQ ID NO:73的AA 20-89的一个或多个部分缺失的SEQ ID NO:73具有同一性。举例来说,所述信号序列可以与没有氨基酸29-33、57-70或这两者的SEQ ID NO:73具有同一性。示例性信号序列是氨基酸SEQ ID NO:74(Sc-AKS;即没有氨基酸29-33和57-70的SEQ ID NO:73)。另一个示例性信号序列是SEQ ID NO:74的一部分(被称为Sc-AK;即没有氨基酸29-33、57-70以及86-89的SEQ ID NO:73)。
本公开的融合蛋白可以包括信号序列,所述信号序列与源自于酿酒酵母转化酶(Sc IV)的N末端的SEQ ID NO:75具有80%或更大、85%或更大、90%或更大或95%或更大的序列同一性。Sc IV是具有532个氨基酸的蔗糖水解酶,其具有具19个氨基酸的N末端信号肽(例如参见Carlson M.等,(1983)Mol.Cell.Biol.3:439-447)。
SEQ ID NO:75中的示例性氨基酸取代可以包括位置6(F→L、V、I、A、G;非极性)和9(L→V、F、I、A、G;非极性)处的保守氨基酸取代。
本公开的融合蛋白可以包括信号序列,所述信号序列与源自于原鸡溶菌酶(GgLZ)的N末端的SEQ ID NO:76具有80%或更大、85%或更大、90%或更大或95%或更大的序列同一性。Gg LZ(也被称为蛋清溶菌酶)是具有129个氨基酸的糖苷水解酶,其具有具18个氨基酸的N末端信号肽(例如参见Jigami等(1986)Gene 43:273-279)。
SEQ ID NO:76中的示例性氨基酸取代可以包括位置10(L→F、V、I、A、G;非极性)和13(V→L、F、I、A、G;非极性)处的保守氨基酸取代。
本公开的融合蛋白可以包括信号序列,所述信号序列与源自于智人白蛋白(HsSA)的N末端的SEQ ID NO:78具有80%或更大、85%或更大、90%或更大或95%或更大的序列同一性。Hs SA是具有609个氨基酸的血清蛋白,其具有具18个氨基酸的N末端信号肽(例如参见Kober等(2013)Biotechnology and Bioengineering;110:1164-1173)。
SEQ ID NO:78中的示例性氨基酸取代可以包括位置6(F→L、V、I、A、G;非极性)和9(L→V、F、I、A、G;非极性)处的保守氨基酸取代。
在一些方面,本公开的融合蛋白可以包括信号序列,所述信号序列与源自于酿酒酵母交配因子α2基因(Sc MFα2)的N末端的SEQ ID NO:80具有80%或更大、85%或更大、90%或更大或95%或更大的序列同一性。Sc MFα2分泌信号修饰的葡糖淀粉酶多肽和表达所述Sc MFα2分泌信号修饰的葡糖淀粉酶多肽的工程化的酵母菌株描述于国际申请序列号PCT/US2016/016822中,并且在2016年2月5日提交(Miller等)。
酿酒酵母交配因子α2(Sc MFα2)分泌信号描述于美国专利号4,546,082(Kurjan等)中。Sc MFα2SS序列如下:MKFISTFLTFILAAVSVTA(SEQ ID NO:80)。Sc MFα2序列来自基因YGL089C(YGL089C),而MFα1是由基因YPL187W MFα1编码的并且MFα2是由MATa细胞分泌的信息素。
基于SEQ ID NO:73-78和80的比对,鉴定出促进GA分泌和活性的分泌信号中的共同结构特征。该共同的结构特征是分泌信号内的一段5-8个连续疏水性氨基酸残基。该共同的结构特征是分泌信号内的一段5-8个连续疏水性氨基酸残基。举例来说,该段可以具有5个、6个、7个或8个疏水性氨基酸。疏水性氨基酸是丙氨酸、异亮氨酸、亮氨酸、缬氨酸和苯丙氨酸、甲硫氨酸、色氨酸以及酪氨酸。优选的是,该段包括丙氨酸残基、异亮氨酸残基、亮氨酸残基、苯丙氨酸残基以及缬氨酸残基中的一个或多个。甚至更优选的是,该段包括一个或多个亮氨酸残基。
该段可以存在于具有至少15个、16个、17个、18个或19个或更多个氨基酸残基,如约15个至约30个氨基酸的异源性分泌信号氨基酸序列内。所述异源性分泌信号氨基酸序列具有与和所述异源性序列融合的葡糖淀粉酶序列的天然分泌序列不同的氨基酸序列。
该段通常由不具疏水性的两个氨基酸残基界定。在一些实施方案中,该段疏水性氨基酸残基紧邻一个或两个极性氨基酸残基。在优选的方面,所述极性氨基酸残基是丝氨酸残基。
在一些方面,所述5-8个连续疏水性氨基酸残基包含选自由以下组成的组的序列:AVLFAA(SEQ ID NO:82)、AFLFLL(SEQ ID NO:83)、LVLVLL(SEQ ID NO:84)、LLFLF(SEQ IDNO:85)以及FILAAV(SEQ ID NO:86)。
在其它方面,本公开的融合蛋白可以包括信号序列,所述信号序列与源自于酿酒酵母阻遏酸性磷酸酶(Sc PHO5)的N末端的SEQ ID NO:81具有80%或更大、85%或更大、90%或更大或95%或更大的序列同一性。Sc PHO5分泌信号描述于美国专利号5,521,086(Scott等)和Meyhack等(EMBO J.6:675-680,1982)中。Sc PHO5SS序列如下:MFKSVVYSILAASLANA(SEQ ID NO:81)。Sc PHO5序列来自PHO5,所述PHO5是编码酿酒酵母酸性磷酸酶的结构基因,由培养基中的浓度或无机磷酸盐(Pi)调节。Sc PHO5分泌信号修饰的葡糖淀粉酶多肽和表达所述Sc PHO5分泌信号修饰的葡糖淀粉酶多肽的工程化的酵母菌株描述于国际申请序列号PCT/US2016/016822中,并且在2016年2月5日提交(Miller等)。
可以进行分子技术以产生核酸序列,所述核酸序列是用于表达具有异源性信号序列的葡糖淀粉酶基因的模板(如果葡糖淀粉酶蛋白质/核苷酸序列是本领域已知的话)。一般来说,制备核酸以编码包含异源性信号序列和葡糖淀粉酶序列的蛋白质。
可以使用编码功能性葡糖淀粉酶多肽的任何序列。在一些方面,所述葡糖淀粉酶序列可以是葡糖淀粉酶基因的天然(“野生型”)序列,其中所述异源性信号序列-葡糖淀粉酶基因的葡糖淀粉酶部分的序列在任何氨基酸位置处与天然序列没有差异。在其它方面,所述异源性信号序列-葡糖淀粉酶基因的葡糖淀粉酶部分的序列在一个或多个氨基酸位置处不同于天然序列。所述差异可以是例如(a)从野生型序列中去除一个或多个氨基酸;(b)将一个或多个氨基酸添加到野生型序列中;(c)取代野生型序列;(a)和(c)的组合;或(b)和(c)的组合。
举例来说,在一个方面,在添加异源性信号序列之前,可以将葡糖淀粉酶的天然序列在它的N末端处进行改变。在一些方面,在连接异源性信号序列之前,去除天然葡糖淀粉酶信号序列的全部或一部分。举例来说,可以通过使天然分泌信号的一个或多个,而非所有的氨基酸缺失(例如天然前导序列的最多50%、60%、70%、80%、90%或95%缺失)来改变葡糖淀粉酶的天然前导序列的一部分。这样的天然前导序列的一部分的缺失可以使得天然葡糖淀粉酶前导序列丧失它的天然功能,它的天然功能被异源性信号序列(基于衍生自SEQID NO:73-78和80的序列的分泌信号)提供的功能替代。在其它方面,全部的天然分泌信号可以从葡糖淀粉酶中去除并且被异源性信号序列置换。
举例来说并且参考表1,在制备融合蛋白构建体时,去除米根霉葡糖淀粉酶(RoGA;P07683)的前25个氨基酸中的一些或全部,其对应于使用CBS预测服务器预测的前导序列(即天然蛋白质的氨基酸1-25)。因此,米根霉葡糖淀粉酶天然分泌信号的全部或一部分被异源性分泌信号序列置换,所述异源性分泌信号序列具有(a)分泌信号氨基酸序列,所述分泌信号氨基酸序列与以下各项具有80%或更大的序列同一性:(i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQ ID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA)。
所述异源性分泌信号序列可以与Ro GA多肽的其余部分(例如氨基酸26-604;SEQID NO:42)直接或间接连接。当本公开的异源性分泌信号序列与Ro GA多肽的其余部分直接融合时,提供了具有约597个至约668个氨基酸范围内的长度的融合蛋白。
在一些方面,本公开提供了一种多肽,所述多肽与选自由以下组成的组的多肽具有50%或更大、60%或更大、70%或更大、80%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:(i)SEQ ID NO:52(Sc-FAKS)-米根霉GA;(ii)SEQ ID NO:53(Sc-AKS)-米根霉GA;(iii)SEQ ID NO:54(An aa)-米根霉GA;(iv)SEQ ID NO:55(Sc IV)-米根霉GA;(v)SEQ ID NO:56(Gg LZ)-米根霉GA;(vi)SEQ IDNO:57(Hs SA)-米根霉GA;以及(vii)SEQ ID NO:58(Sc MFα1)-米根霉GA。
在一些方面,本公开提供了一种多肽,所述多肽与选自由以下组成的组的多肽具有50%或更大、60%或更大、70%或更大、80%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:(i)SEQ ID NO:45(Sc-FAKS)-扣囊复膜孢酵母GA;(ii)SEQ ID NO:46(Sc-AKS)-扣囊复膜孢酵母GA;(iii)SEQ ID NO:47(Anaa)-扣囊复膜孢酵母GA;(iv)SEQ ID NO:48(ScIV)-扣囊复膜孢酵母GA;(v)SEQ ID NO:49(Gg LZ)-扣囊复膜孢酵母GA;(vi)SEQ ID NO:50(Hs SA)-扣囊复膜孢酵母GA;以及(vii)SEQ ID NO:51(Sc MFα1)-扣囊复膜孢酵母GA。
在一些方面,本公开提供了一种多肽,所述多肽与选自由以下组成的组的多肽具有50%或更大、60%或更大、70%或更大、80%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:(i)SEQ ID NO:59(Sc-FAKS)-白宇佐美曲霉GA;
(i)SEQ ID NO:60(Sc-AKS)-白宇佐美曲霉GA;(ii)SEQ ID NO:61(An aa)-白宇佐美曲霉GA;(iii)SEQ ID NO:62(Sc IV)-白宇佐美曲霉GA;(iv)SEQ ID NO:63(Gg LZ)-白宇佐美曲霉GA;(vi)SEQ ID NO:64(Hs SA)-白宇佐美曲霉GA;以及(vii)SEQ ID NO:65(Sc MFα1)-白宇佐美曲霉GA。
在一些方面,本公开提供了一种多肽,所述多肽与选自由以下组成的组的多肽具有50%或更大、60%或更大、70%或更大、80%或更大、90%或更大、95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:(i)SEQ ID NO:66(Sc-FAKS)-土曲霉GA;(ii)SEQ ID NO:67(Sc-AKS)-土曲霉GA;(iii)SEQ ID NO:68(An aa)-土曲霉GA;(iv)SEQ ID NO:69(Sc IV)-土曲霉GA;(v)SEQ ID NO:70(Gg LZ)-土曲霉GA;(vi)SEQ IDNO:71(Hs SA)-土曲霉GA;以及(vii)SEQ ID NO:72(Sc MFα1)-土曲霉GA。
作为另一个实例,可以通过取代来改变葡糖淀粉酶的天然前导序列的一个或多个氨基酸,所述取代是用与天然氨基酸不同的氨基酸置换天然葡糖淀粉酶前导序列中特定位置处的天然氨基酸。举例来说,可以通过取代天然分泌信号的一个或多个氨基酸来改变葡糖淀粉酶的天然前导序列的一部分(例如天然前导序列氨基酸的最多50%、60%、70%、80%、90%或95%可以被取代)。一个或多个氨基酸的取代可以使得天然葡糖淀粉酶前导序列丧失它的天然功能,它的天然功能被异源性分泌信号序列提供的功能替代。
在其它方面,包含异源性分泌信号序列和葡糖淀粉酶序列的融合多肽任选地包含天然葡糖淀粉酶多肽或异源性分泌信号序列中不存在的另外的序列。在一些方面,所述另外的序列可以为分泌信号修饰的葡糖淀粉酶提供天然多肽中不存在的功能。另外的功能包括例如其它蛋白质或材料的蛋白酶位点或结合位点或异源性分泌信号与葡糖淀粉酶部分之间的接头区。
天然葡糖淀粉酶多肽或异源性分泌信号序列中可能不存在,但是可以添加的另外的序列的实例是接头或间隔序列。接头序列可以位于异源性分泌序列与葡糖淀粉酶序列之间。这样的融合多肽[分泌信号修饰的多肽]可以如下注释:[SS]-[L]-[GA],其中“L”表示将信号序列与葡糖淀粉酶连接的一个或多个氨基酸。示例性接头包括一个或多个氨基酸,如最多5个、10个、15个、20个、25个、30个、35个、50个、100个或200个氨基酸。接头可以包括使接头具有刚性并且防止分泌信号与葡糖淀粉酶的其它部分之间的相互作用的氨基酸。刚性接头可以包括诸如Pro、Arg、Phe、Thr、Glu以及Gln的残基,并且经常形成α-螺旋结构。
或者,所述融合多肽可以包括柔性接头。柔性接头可以包括甘氨酸残基并且使信号序列与融合蛋白的葡糖淀粉酶部分连接而不干扰它们对应的功能。在一些接头序列中,大部分(>50%)的氨基酸残基是甘氨酸。示例性接头序列包括一个或多个接头嵌段,每一个嵌段具有一个或多个甘氨酸残基和选自丝氨酸、谷氨酸、天冬氨酸以及赖氨酸的一个氨基酸。举例来说,接头区可以包括式[GaX]n,其中a是1-6范围内的整数,X是S、E、D或K,并且n是1-10范围内的整数。
在一些方面,所述多肽包括具有蛋白酶切割序列的接头。示例性蛋白酶切割序列包括凝血酶、因子Xa、鼻病毒3C、TEV蛋白酶、Ssp DnaB、内含肽、Sce VMA1内含肽、肠激酶以及KEX2的蛋白酶切割序列(参见例如Waugh,D.S.,Protein Expr Purif.80(2):283-293,2011;Zhou等,Microbial Cell Factories 13:44,2014;以及Bourbonnais等,J.Bio.Chem.263(30):15342,1988)。
天然葡糖淀粉酶多肽或异源性分泌信号序列中可能不存在,但是可以添加的另外的序列的另一个实例是标签序列。标签序列可以位于葡糖淀粉酶序列的C末端处,并且这样的蛋白质可以如下注释:[SS]-[GA]-[T]和[SS]-[L]-[GA]-[T],其中“T”表示提供标签序列的一个或多个氨基酸。示例性肽标签包括最多5个、10个、15个或20个氨基酸。肽标签可以用于多种目的中的任一种或多种。举例来说,所述标签可以允许通过标签结合成员与标签特异性相互作用的能力来从培养基中纯化酶。所述标签还可以允许使用具有可检测标记的标签结合成员来检测或鉴定蛋白质。示例性短肽标签是多聚Arg、FLAG、多聚His、c-myc、S以及Strep II。
本公开的分泌信号修饰的多肽还可以具有天然葡糖淀粉酶多肽中除天然分泌序列以外的一个或多个区域的缺失,其中所述缺失不影响所述多肽的淀粉分解活性。所述缺失可以基于关于天然葡糖淀粉酶的结构和功能的已知信息,包括突变研究和序列比对(例如参见Coutinho(同上)和Sierks(同上))。在一些方面,所述分泌信号修饰的多肽有最多1%、最多2%、最多4%、最多6%、最多8%、最多10%、最多12%、最多14%、最多16%、最多18%、最多20%或最多25%的葡糖淀粉酶多肽的序列缺失。在一些方面,本公开的分泌信号修饰的多肽有对应于天然葡糖淀粉酶多肽的C末端的一部分的缺失。
已经产生了葡糖淀粉酶的截短形式并且已经证实其具有酶活性。举例来说,Evans等(Gene,91:131;1990)产生了葡糖淀粉酶的一系列截短形式以研究O-糖基化区域中有多少是为GAII的活性或稳定性所必需的,所述GAII是缺乏生淀粉结合结构域的酶的完全活性形式。发现C末端的一大部分可以从GAII中缺失而对所述酶的活性、热稳定性或分泌没有显著的影响。
与引起葡糖淀粉酶活性发生变化相关的各种氨基酸取代也是本领域已知的。葡糖淀粉酶序列中各个位置处一个或多个氨基酸的取代已经被证实影响特性,如热稳定性、淀粉水解活性、底物利用以及蛋白酶抗性。因而,本公开考虑将异源性分泌信号与葡糖淀粉酶序列一起使用,所述葡糖淀粉酶序列包括多肽的葡糖淀粉酶部分中的一个或多个氨基酸取代,其中所述取代不同于葡糖淀粉酶的野生型序列。
举例来说,美国专利号8,809,023描述了一种用于降低淀粉水解期间异麦芽糖合成与淀粉水解活性之间的比率(IS/SH比)的方法。具体来说,描述了里氏木霉葡糖淀粉酶(Tr GA)(总长度是632个氨基酸,具有具信号肽的N末端),其在如下氨基酸位置处被修饰:D44R和A539R;或D44R、N61I以及A539R。据报道,该葡糖淀粉酶变体与所述亲本葡糖淀粉酶相比在淀粉水解期间表现出降低的IS/SH比。举例来说,本公开考虑用本公开的异源性分泌信号置换所期望的葡糖淀粉酶(例如不同的葡糖淀粉酶)的天然前导序列,其中所述所期望的葡糖淀粉酶还具有对应于修饰的Tr GA的D44R和A539R;或D44R、N61I以及A539R取代的氨基酸取代。在更广泛的意义上,所述异源性分泌信号可以与具有以下氨基酸取代的葡糖淀粉酶变体一起使用:D44R和A539R;或D44R、N61I以及A539R,所述位置对应于TrGA序列中的对应位置,其中所述葡糖淀粉酶变体与TrGA序列的整个长度具有至少90%的氨基酸序列同一性。模板葡糖淀粉酶序列与TrGA序列的相应的“对应位置”可以通过将例如已知的葡糖淀粉酶多肽序列(用于构建异源性分泌信号葡糖淀粉酶融合体的模板)与TrGA序列进行序列比对来了解。
作为另一个实例,美国专利号8,592,194描述了与野生型葡糖淀粉酶变体相比具有增加的热稳定性的葡糖淀粉酶变体。在本公开中还描述了里氏木霉葡糖淀粉酶,但是作为替代,在以下位置处存在对天然Tr GA序列的一个或多个氨基酸取代:位置10、14、15、23、42、45、46、59、60、61、67、68、72、73、97、98、99、102、108、110、113、114、122、124、125、133、140、144、145、147、152、153、164、175、182、204、205、214、216、219、228、229、230、231、236、239、240、241、242、244、263、264、265、268、269、276、284、291、300、301、303、310、311、313、316、338、342、344、346、349、359、361、364、379、382、390、391、393、394、408、410、415、417以及418。举例来说,本公开考虑用异源性分泌信号置换所期望的葡糖淀粉酶的天然前导序列,其中所述所期望的葡糖淀粉酶还具有被证实为提供增加的热稳定性的氨基酸取代中的任何一个或多个。在更广泛的意义上,所述异源性分泌信号可以与具有提供增加的热稳定性的氨基酸取代的葡糖淀粉酶变体一起使用,所述位置对应于TrGA序列中的对应位置。
来自两种或更多种葡糖淀粉酶的“相应”氨基酸的确定可以通过比对它们的氨基酸序列的全部或部分来确定。序列比对和序列同一性的产生包括全局比对和局部比对,这通常使用计算方法。为了提供全局比对,使用全局优化,所述全局优化强制跨越所有查询序列的整个长度的序列比对。相比之下,在局部比对中,鉴定长序列内较短的具有相似性的区域。
如本文所用的“等同位置”意指两个序列(例如模板GA序列和具有一个或多个所期望的取代的GA序列)共有的位置,所述位置是基于一种葡糖淀粉酶的氨基酸序列的比对或作为三维结构的比对。因此,序列比对或结构比对或这两者都可以用于确定等同性。
在一些实施方式中,使用BLAST算法来比较和确定序列相似性或同一性。此外,可以确定序列中可以被指定权重或分数的空位的存在或显著性。也可以使用这些算法来确定核苷酸序列相似性或同一性。用于确定相关性的参数是基于用于计算所确定的匹配的统计相似性和显著性的本领域已知的方法来计算的。相关的基因产物预期具有高度相似性,如大于50%的序列同一性。使用BLAST算法确定两个或更多个序列的相关性的示例性参数可以如下。
在一些实施方式中,使用BLAST(国家生物信息中心(National Center forBiological Information,NCBI)基本局部比对搜索工具)2.2.29版软件,使用默认参数进行比对。使用BLAST 2.2.29版算法,使用默认参数,相对于参考序列具有XX%(例如80%)的同一性分数的序列被认为与参考序列是至少XX%同一的或等同地具有XX%的序列同一性。如果使用葡糖淀粉酶变体,那么全局比对可以比对与例如米根霉葡糖淀粉酶具有显著同一性的序列以确定靶序列(例如葡糖淀粉酶直系同源物)中哪个或哪些相应氨基酸位置可以被一个或多个氨基酸取代。
在一些实施方式中,编码异源性分泌信号-葡糖淀粉酶多肽的核酸序列以及任何调节序列(例如终止子、启动子等)和载体序列(例如包括选择标记、整合标记、复制序列等)可以使用已知的分子技术制备。用于制备DNA构建体(例如包括异源性分泌信号-葡糖淀粉酶基因的DNA构建体)的方法的一般指导可以见于Sambrook等,Molecular Cloning,ALaboratory Manual,纽约州冷泉港的冷泉港实验室出版社(Cold Spring HarborLaboratory Press,Cold Spring Harbor,N.Y.),1989;以及Ausubel等,CurrentProtocols in Molecular Biology,纽约州纽约的Greene Publishing and Wiley-Interscience公司(Greene Publishing and Wiley-Interscience,New York,N.Y.),1993中。
当使用少量的葡糖淀粉酶模板DNA作为PCR中的起始材料时,可以使用包括异源性分泌信号序列和葡糖淀粉酶序列的在它的天然信号序列的3'的一部分的引物来产生相对大量的特定DNA片段,所述特定DNA片段包括所述异源性分泌信号序列和所述葡糖淀粉酶基因。
可以使用PCR技术来修饰天然葡糖淀粉酶核酸序列以添加异源性分泌信号序列或在葡糖淀粉酶核酸序列中引入一个或多个突变以提供变体。PCR技术描述于例如Higuchi,(1990),PCR Protocols,第177-183页,学术出版社(Academic Press);Ito等(1991)Gene102:67-70;Bernhard等(1994)Bioconjugate Chem.5:126-132;以及Vallette等(1989)Nuc.Acids Res.17:723-733中。所述技术可以任选地包括先前制备的编码葡糖淀粉酶多肽的DNA的定点(或寡核苷酸介导的)诱变、PCR诱变以及盒式诱变。
或者,核酸分子可以由定制基因合成提供商,如DNA2.0(加利福尼亚州的门洛帕克(Menlo Park,CA))或GeneArt(Life Technologies公司,Thermo Fisher Scientific公司)产生。
可以构建表达载体以包括与在宿主生物体中具有功能的表达控制序列可操作地连接的异源性分泌信号-葡糖淀粉酶核酸序列。适用于宿主生物体中的表达载体包括例如质粒、游离体以及人工染色体。所述载体可以包括可操作以用于稳定整合到宿主染色体中的选择序列或选择标记。此外,所述载体可以包括一个或多个选择标记基因和适当的表达控制序列。还可以包括选择标记基因,所述选择标记基因例如提供对抗生素或毒素的抗性、补充营养缺陷型缺陷或提供培养基中没有的关键营养素。表达控制序列可以包括本领域公知的组成型和诱导型启动子、转录增强子、转录终止子等。
在一些方面,所述核酸可以是密码子优化的。用于异源性分泌信号序列-葡糖淀粉酶的葡糖淀粉酶部分的核酸模板可以是编码葡糖淀粉酶的天然DNA序列,或所述模板可以是被优化用于在所期望的宿主细胞中表达的密码子优化型式。提供有关在特定宿主生物体中所期望的密码子使用的信息的数据库是本领域已知的。
根据本公开的一个方面,包含异源性分泌信号序列-葡糖淀粉酶的DNA构建体与启动子序列可操作地连接,其中所述启动子序列在所选择的宿主细胞中具有功能。在一些方面,所述启动子在真菌宿主细胞中显示出转录活性并且可以衍生自编码与宿主细胞同源或异源的蛋白质的基因。在一些方面,所述启动子可用于在酿酒酵母中表达。公知的组成型启动子的实例包括但不限于细胞色素c启动子(pCYC)、翻译延伸因子启动子(pTEF)、甘油醛-3-磷酸脱氢酶启动子(pGPD)、磷酸甘油酸激酶启动子(PGK)以及醇脱氢酶启动子(pADH)。任选地,在载体上还可以包括控制表达的另外的因子,如增强子等。
包括异源性分泌信号序列-葡糖淀粉酶基因的表达载体还可以包括在宿主细胞中具有功能的任何终止序列。举例来说,终止序列和启动子序列可以来自相同的细胞,或终止序列与宿主细胞是同源的。终止序列可以对应于使用的任何启动子。
可以使用载体将DNA构建体引入宿主细胞中。所述载体可以是当引入宿主细胞中时被稳定引入的任何载体。在一些方面,所述载体整合到宿主细胞基因组中并且被复制。载体包括克隆载体、表达载体、穿梭载体、质粒、噬菌体颗粒、盒等。在一些方面,所述载体是表达载体,所述表达载体包含与葡糖淀粉酶编码序列可操作地连接的调节序列。如本文所述的SEQ ID NO可以通过总共构成特定SEQ ID NO的具有重叠同源性的多个较小的DNA片段(例如“SEQ ID NO亚片段”)的转化而在细胞中组装。举例来说,所期望的SEQ ID NO或其部分在细胞中的基因座处的整合可以通过两个至五个DNA亚片段的共转化而实现,这些亚片段进行彼此重组并且整合到细胞中与所述亚片段的部分具有同源性的基因座中。
包含异源性分泌信号序列-葡糖淀粉酶基因的DNA构建体还可以包括选择标记,从而促进在宿主细胞中的选择。举例来说,所述选择标记可以用于转化的酵母。酵母选择标记的实例包括通常用于选择转化的酵母细胞的标记。可以使用控制营养缺陷型的基因来使用营养缺陷型标记,这意味着所述基因使得酵母能够产生酵母生长所需的营养素。控制营养缺陷型的基因的实例包括亮氨酸营养缺陷型(LEU2)、组氨酸营养缺陷型(HIS3)、尿嘧啶营养缺陷型(URA3、URA5)以及色氨酸营养缺陷型(TRP1)。
DNA构建体可以是整合到基因组中并且连同当中已经整合有它的一条或多条染色体一起复制的DNA构建体。举例来说,可以将真菌细胞用编码葡糖淀粉酶的DNA构建体转化,并且将所述DNA构建体以一个或多个拷贝整合到一条或多条宿主染色体中。这种整合一般被认为是一个优点,这是因为DNA序列更可能被稳定地维持。DNA构建体向宿主染色体中的整合可以根据常规方法,如通过同源重组或异源重组来进行。
本公开的工程化的酵母可以包括具有多个拷贝(两个或更多个)的具有异源性分泌信号序列的葡糖淀粉酶基因。举例来说,所述工程化的酵母可以是工程化的酵母菌属,其至少具有第一、第二、第三以及第四外源性核酸,所述外源性核酸各自包括编码本公开的至少一种具有异源性分泌信号序列的葡糖淀粉酶多肽的序列。如果所述工程化的酵母包括多个拷贝的编码具有异源性分泌信号序列的葡糖淀粉酶基因的基因,那么所述拷贝的核酸序列可以彼此相同或不同。示例性方法和已经被工程化以包括多个拷贝的葡糖淀粉酶基因的酵母菌株描述于国际申请序列号PCT/US16/24249中,并且在2016年3月25日提交(Miller等)。
工程化的酵母还可以包括与具有异源性信号序列的葡糖淀粉酶的修饰不同的一种或其它遗传修饰。(异源性)修饰可以包括引入外源性核酸序列;上调或下调基因表达的调节元件的变化;基因拷贝数增加;以及消除基因或基因产物的表达、降低其表达或增加其表达或活性的缺失或突变。异源性修饰可以包括以下一种或多种:使用与所期望的基因的天然启动子不同的启动子;使用与所期望的基因的天然终止子不同的终止子;将基因引入基因组中与它的天然位置不同的位置处;引入所期望的基因的多个拷贝。
可以被包括在工程化的酵母中的另外的遗传修饰是改变或引入将低分子量非葡萄糖糖转化成葡萄糖的酶活性。举例来说,一种任选的另外的遗传修饰在工程化的酵母中影响或引入异麦芽糖酶活性。异麦芽糖酶可以通过水解异麦芽糖中的1,6醚键将异麦芽糖转化为葡萄糖。异麦芽糖酶也可以表现出水解麦芽糖中的1,4醚键的交叉活性。所述遗传修饰可以使异麦芽糖酶活性被引入到细胞中、使细胞中的异麦芽糖酶的量增加和/或使异麦芽糖酶活性增加。
在一些实施方案中,除了具有异源性分泌信号序列的葡糖淀粉酶基因之外,所述工程化的细胞还包括异源性异麦芽糖酶基因或在提供增加的在细胞中的表达的异源性启动子控制之下的异麦芽糖酶基因或以多个拷贝存在于细胞中的异麦芽糖酶基因。举例来说,可以将在异源性启动子(如PDC启动子)控制之下的异麦芽糖酶(IMA)基因工程化到酵母中。
可以引入工程化的酵母中的异麦芽糖酶基因的实例包括但不限于酿酒酵母IMA1(P53051)、酿酒酵母IMA2(Q08295)、酿酒酵母IMA3(P0CW40)、酿酒酵母IMA4(P0CW41)、酿酒酵母IMA5(P40884)、枯草芽孢杆菌(Bacillus subtilis)malL(O06994)、蜡状芽孢杆菌(Bacillus cereus)malL(P21332)、凝结芽孢杆菌(Bacillus coagulans)malL(Q45101)、芽孢杆菌属菌种(Bacillus sp.)malL(P29093)等。优选的是,所述异麦芽糖酶基因编码与登录号NP 011803.3的氨基酸序列(酿酒酵母IMA1)具有大于80%、85%、90%、95%、98%或99%序列同一性的多肽。
在一些实施方案中,所述工程化的酵母还可以包括遗传修饰,所述遗传修饰提供与具有异源性信号序列的葡糖淀粉酶不同的淀粉降解多肽。举例来说,所述遗传修饰可以是引入编码不同的多糖降解酶的核酸的遗传修饰,如外源性或修饰的α-淀粉酶、β-淀粉酶、支链淀粉酶或异淀粉酶。所述遗传修饰也可以是增加细胞中内源性或外源性淀粉降解多肽的量的遗传修饰,如通过使基因处于强启动子的控制之下或在细胞中提供多个拷贝的基因,如整合到基因组中的多个拷贝的基因或存在于非染色体构建体(例如质粒)上的多个拷贝。
在一些实施方案中,所述工程化的酵母还可以包括遗传修饰,所述遗传修饰提供外源性或修饰的糖转运蛋白基因(如异麦芽糖转运蛋白);参见例如2015年12月17日提交的名称为“Sugar Transporter-Modified Yeast Strains and Methods for BioproductProduction”的共同转让的美国申请序列号62/268,932。
可以将各种宿主细胞用包括异源性信号序列-葡糖淀粉酶基因的核酸转化。在一些方面,包括异源性信号序列-葡糖淀粉酶基因的核酸存在于细菌细胞中。可以使用所述细菌细胞例如繁殖核酸序列或产生一定量的多肽。
在其它方面,所述宿主细胞是真核细胞,如真菌细胞。
在一些方面,所述宿主细胞对发酵培养基中更高量的生物衍生产物,如乙醇具有耐受性。在一些方面,所述宿主细胞是“工业酵母”,其指的是常规用于乙醇发酵的任何酵母。实例包括清酒酵母、烧酒酵母、葡萄酒酵母、啤酒酵母、面包酵母等。清酒酵母显示出高乙醇发酵能力和高乙醇抗性以及遗传稳定性。通常,工业酵母具有高乙醇抗性并且优选地在10%或更大的乙醇浓度下存活。
在示例性方面,包括异源性信号序列-葡糖淀粉酶基因的酵母是酿酒酵母。一些酿酒酵母菌株对乙醇具有高耐受性。乙醇耐受性酵母的各种菌株是可商购获得的,如RED和ETHANOL酵母(美国的Fermentis/Lesaffre公司)、FALI(美国的Fleischmann's Yeast公司)、SUPERSTART和酵母(美国威斯康星州的Ethanol Technology公司(Ethanol Technology,Wis.,USA))、BIOFERM AFT和XR(美国乔治亚州的NABC-North American Bioproducts公司(NABC-North American BioproductsCorporation,GA,USA))、GERT STRAND(瑞典的Gert Strand公司(Gert Strand AB,Sweden))以及FERMIOL(DSM Specialties公司)。
工业酵母通常是原养型的并且因此没有适用于选择转化体的营养缺陷型标记。如果所述酵母没有将在转化后另外促进异源性信号序列-葡糖淀粉酶基因在细胞内的保留的遗传背景,那么可以将宿主细胞进行工程化以引入一个或多个遗传突变以建立标记基因的使用,所述标记基因与细胞中的异源性信号序列-葡糖淀粉酶基因缔合并且维持所述异源性信号序列-葡糖淀粉酶基因。举例来说,可以在将异源性信号序列-葡糖淀粉酶基因引入细胞中之前对可商购获得的乙醇耐受性酵母细胞进行遗传修饰。
用于不同营养缺陷型的标记可以通过破坏控制营养缺陷型的基因来提供。在一个实施方式中,乙醇耐受性酵母菌株被工程化以破坏控制营养缺陷型的一个或多个基因的拷贝,如LEU2、HIS3、URA3、URA5以及TRP1。在提供尿嘧啶营养缺陷型的情况下,例如,可以用从尿嘧啶营养缺陷型突变株(例如酿酒酵母MT-8菌株)中获得的ura3-片段置换乙醇耐受性酵母的正常ura3基因以破坏正常的ura3基因。在ura3基因被破坏的菌株的情况下,标记的存在/不存在可以容易地通过利用以下事实来鉴定或选择:ura3基因被破坏的菌株能够在含有5-氟乳清酸(5-FOA)的培养基中生长,而正常的ura3菌株(野生型酵母或通常的工业酵母)不能生长。在lys2基因被破坏的菌株的情况下,标记的存在/不存在可以容易地通过利用以下事实来鉴定或选择:lys2基因被破坏的菌株能够在含有α-氨基己二酸的培养基中生长,而正常的lys2菌株(野生型酵母或通常的工业酵母)不能生长。可以使用用于破坏营养缺陷型控制基因和选择性地分离营养缺陷型控制基因突变体的方法,这取决于所使用的营养缺陷型。或者,可以使用显性选择标记,如来自构巢曲霉的amdS(美国专利号5,876,988),其允许依靠乙酰胺作为唯一的氮源生长;或ARO4-OFP,其允许在氟苯丙氨酸存在下生长(Fukuda等)。这些标记可以使用可再循环的cre-loxP系统重复使用,或可选地可以用于产生允许利用另外的标记的营养缺陷型菌株。
在已经将宿主细胞进行工程化以提供用于引入异源性信号序列-葡糖淀粉酶基因的所期望的遗传背景之后,将基因构建体引入细胞中以允许表达。用于将基因构建体引入宿主细胞中的方法包括转化、转导、转染、共转染、电穿孔。具体来说,可以使用乙酸锂方法、原生质体方法等进行酵母转化。待引入的基因构建体可以质粒的形式或通过插入到宿主的基因中或经由与宿主的基因进行同源重组掺入染色体中。可以使用选择标记(例如如上所述的营养缺陷型标记)选择当中已经引入有基因构建体的转化的酵母。可以通过测量表达的蛋白质的活性来进行进一步确认。
可以使用本领域公知的方法来确认包括异源性信号序列-葡糖淀粉酶基因的外源性核酸序列的转化。这些方法包括例如核酸分析,如mRNA的RNA印迹或聚合酶链反应(PCR)扩增或基因产物表达的免疫印迹法或其它合适的分析方法以测试所引入的核酸序列或它的相应基因产物的表达。本领域技术人员应当了解的是,外源性核酸以足够的量表达以产生所期望的产物,并且进一步了解的是,表达水平可以使用本领域公知和如本文所公开的方法来优化以获得足够的表达。
本公开的工程化的酵母可以任何合适的形式提供。在一些方面,将非天然酵母脱水以形成干酵母组合物。所述干酵母组合物相对于湿组合物可以具有延长的保存期。
使用表达异源性信号序列-葡糖淀粉酶基因的宿主细胞的发酵可以在含有淀粉和/或糖的植物材料存在下进行,所述植物材料指的是可从任何植物和植物部分,如块茎、根、茎、叶以及种子获得的含有淀粉和/或糖的植物材料。包含淀粉和/或糖的植物材料可以从谷物获得,如大麦、小麦、玉米、黑麦、高粱、粟、大麦、马铃薯、木薯或水稻以及其任何组合。包含淀粉和/或糖的植物材料可以经过加工,如通过诸如碾磨、制麦芽或部分制麦芽的方法。在一些方面,所述淀粉材料来自玉米粉、碾磨的玉米胚乳、高粱粉、大豆粉、小麦粉、生物质衍生淀粉、大麦粉以及其组合。
在一些方面,所述发酵培养基包括处理过的淀粉。举例来说,所述发酵培养基可以包括部分水解淀粉。部分水解淀粉可以包括高分子量糊精和高分子量麦芽糊精。在一些实施方式中,在发酵培养基中使用具有约5至约95或更优选地约45至约65范围内的右旋糖当量(“DE”)的部分水解淀粉产品。部分水解淀粉和其制备是本领域公知的。部分水解淀粉可以通过将淀粉与诸如盐酸或硫酸的酸一起在高温下加热,然后用诸如碳酸钠的合适的碱中和水解混合物来制备。或者,部分水解淀粉可以通过酶促方法来制备,如通过将α-淀粉酶添加到淀粉制品中。α淀粉酶可以使含有三个或更多个(1→4)-α-连接的D-葡萄糖单元的多糖中的(1→4)-α-D-糖苷键发生内切水解。可以使用部分水解淀粉产品,其具有所期望的范围内的量的淀粉和淀粉降解产物。
如本文所用的“液化物(liquifact)”是已经经历液化的具有约10至约15范围内的右旋糖当量的玉米淀粉。可以使用玉米湿磨工艺来提供玉米浆,其可以用于发酵。可以将玉米粒浸泡,然后碾磨,并且分离成它们的主要成分级分。淡玉米浆是浸泡过程的副产物,并且含有可溶性蛋白质、氨基酸、有机酸、碳水化合物、维生素以及矿物质的混合物。
在本公开的方面,鉴于葡糖淀粉酶从工程化的酵母产生和分泌到发酵培养基中,发酵方法可以省去将纯化或富集的市售葡糖淀粉酶添加到培养基中,或至少允许在发酵方法中使用显著更少的市售葡糖淀粉酶。举例来说,本公开的工程化的酵母可以允许消除市售葡糖淀粉酶的添加或将市售葡糖淀粉酶的添加至少减少约50%、60%、70%、80%、90%或95%。通常,在不使用分泌葡糖淀粉酶的工程化的酵母的发酵方法中将使用每升约7个单位至约50个单位范围内的量的葡糖淀粉酶。
发酵培养基包括水并且优选地包括营养素,如氮源(如蛋白质)、维生素以及盐。在发酵培养基中还可以存在缓冲剂。在发酵一段时间之后在发酵液中还可以存在其它组分,如可以随着发酵进展而积聚的发酵产物和其它代谢物。任选地,发酵液可以用碱,如氢氧化钙或碳酸钙、氨或氢氧化铵、氢氧化钠或氢氧化钾缓冲以维持生物体良好运作的pH值。
本公开的工程化的酵母还可以用工程化的酵母的比生长速率来描述。酵母的生长速率可以由L=log(数量)定义,其中数量是相对于T(时间),每单位体积(mL)形成的酵母细胞的数量。
在使得发酵可以发生的条件下进行发酵。尽管条件可以根据特定的生物体和所期望的发酵产物而变化,但是典型的条件包括约20℃或更高,并且更通常在约30℃至约50℃范围内的温度。在发酵期间,可以混合或搅拌反应混合物。在一些实施方式中,混合或搅拌可以通过将气体喷射到发酵液中的机械作用而发生。或者,在发酵期间可以使用直接机械搅拌,如通过叶轮或通过其它手段。
本公开还提供了非天然酵母,所述非天然酵母能够在比其中在发酵过程中通常使用酵母,如酿酒酵母的温度更高的温度下生长和/或可以产生发酵产物。举例来说,酿酒酵母通常在约30℃的温度下具有最佳的生长。在与本公开相关的实验中,已经鉴定出对升高的温度具有更大的耐受性的酵母,如32℃或更高,如在高于32℃至约40℃的范围内。升高的温度的示例性范围是T1至T2,其中T1选自32.2℃、32.4℃、32.6℃、32.8℃、33℃、33.2℃、33.3℃、33.4℃、33.6℃、33.8℃、34℃、34.2℃、34.4℃、34.6℃、34.8℃、35℃以及36℃;并且T2选自36℃、37℃、38℃、39℃以及40℃。出于本公开的目的,如果酵母在暴露于具有升高的温度的发酵培养基期间或之后可以继续生长、繁殖和/或产生发酵产物,那么所述酵母被认为是“耐热的”。
在发酵过程期间,发酵培养基可以在发酵过程期间一段或多段时间期间达到32℃或更高的升高的温度。温度可以在发酵期的一部分期间或在整个发酵期期间升高。温度可以升高5分钟或更长时间、10分钟或更长时间、30分钟或更长时间、1小时或更长时间、2小时或更长时间、5小时或更长时间或10小时或更长时间。升高温度的时间也可以表示为占总发酵期的总和,如占发酵期的约0.1%至100%、约0.1%至约75%、约0.1%至约50%、约0.1%至约25%、约0.1%至约10%、约0.1%至约5%、约0.1%至约2.5%、约0.1%至约1%或约0.1%至约0.5%。
所述工程化的酵母还可以在升高温度的时间段期间或之后提供商业上相关的乙醇滴度。举例来说,在升高温度(例如对应于T1至T2的范围中的任一个)的时间段期间或之后,乙醇滴度可以在约110g/L至约170g/L的范围内、在约125g/L至约170g/L的范围内或在约140g/L至约170g/L的范围内。因此,本文所述的工程化的酵母可以在高温时间段期间或之后以商业上有用的滴度产生乙醇,这在用于产生乙醇的发酵过程中的其它目前可用的酵母菌株中通常将会导致问题。这些问题包括但不限于:显著百分比的酵母细胞死亡;对酵母繁殖能力的有害影响;和/或降低或消除酵母产生发酵产物的能力。
可以将具有异源性信号序列-葡糖淀粉酶基因的工程化的酿酒酵母置于温度选择压力下以选择对在更高温度下生长表现出增加的耐受性的菌株。在施加更高温度选择之前可以对工程化的酵母进行随机诱变(例如UV、化学)以产生可以赋予提高的对在这些更高温度下生长的耐受性的一个或多个突变。举例来说,本公开的工程化的酵母在32℃或更高的范围内的温度下可以具有比参考酵母的生长速率大10%、20%、30%、40%或50%的比生长速率。
在一些情况下,在工业容量发酵罐中进行发酵以实现商业规模的经济效益和控制。在一个方面,在具有约10,000升或更大容量的发酵罐中进行发酵。
可以调节发酵培养基的pH值以为葡糖淀粉酶活性、细胞生长以及发酵活性提供最佳条件以提供所期望的产物,如乙醇。举例来说,可以将溶液的pH值调节到3至5.5的范围内。在一个实施方式中,发酵培养基的pH值在4至4.5的范围内。
如上所述,本发明的发酵方法使用表达异源性信号序列-葡糖淀粉酶基因并且能够将所产生的酶分泌到发酵培养基中的遗传修饰的微生物。这些酶因此直接暴露于发酵液条件并且影响发酵培养基中的碳水化合物组成。在发酵培养基中,葡糖淀粉酶可以通过切割α-(1,4)糖苷键和α-(1,6)糖苷键来引起D-葡萄糖从淀粉或相关低聚糖和多糖分子的非还原末端水解和释放。
淀粉也可能受到发酵培养基中存在的一种或多种其它淀粉酶(例如α-淀粉酶)的作用。举例来说,如果在发酵培养基中存在α-淀粉酶,那么它可以通过水解内部α-(1,4)-键来引起前体淀粉的部分水解并且引起淀粉分子的部分分解。
在一些实施方式中,发酵作为单批进行直到完成为止。在其它实施方式中,发酵作为补料分批发酵过程进行。在该实施方式中,将待发酵的淀粉材料的总量的第一部分添加到发酵培养基中,其中葡糖淀粉酶作用于淀粉以引起用作发酵底物的葡萄糖形成。另外的淀粉材料可以分一个或多个部分添加以为培养基中的葡糖淀粉酶提供更多的底物。可以调节淀粉的添加并且可以监测葡萄糖的形成以提供高效的发酵。
优选的是,发酵以连续操作模式进行。在该模式中,多个发酵罐串联操作,其中在第一发酵罐中供应淀粉水解产物,将其供给到第二发酵罐中,依此类推,直到所述淀粉水解产物被转化成乙醇为止。可以使用2-7个发酵罐操作连续操作。
在一些实施方式中,使用可变速率添加系统将淀粉材料的总量的一部分添加到发酵液中。这样的系统的实例包括变速泵或与泵可操作地连接的计量阀(如节流阀),所述泵或阀门可以用于随着时间推移改变引入发酵液中的淀粉材料的量。在一些实施方式中,在添加一部分淀粉材料期间,通过实时监测系统监测葡萄糖浓度。
实时监测系统包括直接监测葡萄糖浓度的系统和间接监测葡萄糖浓度的系统。通常直接监测葡萄糖浓度的实时监测系统的实例包括基于红外(IR)光谱法的系统、近红外(NIR)光谱系统、傅里叶变换红外(Fourier transform infrared,FTIR)系统、基于折射率的系统、自动化基于酶的测量系统,如由YSI Life Sciences公司出售的YSI 2950生物化学分析仪系统;基于高效液相色谱法(HPLC)的系统、基于气相色谱法(GC)的系统以及本领域技术人员已知的其它实时监测系统。此外,间接监测/测量发酵过程的葡萄糖浓度的实时监测系统可以通过确定特定发酵过程中典型的碳分布并且将发酵液中存在的葡萄糖浓度与由发酵表现出的另一参数相关联,例如像发酵液中存在的葡萄糖水平与二氧化碳析出速率的测量值和来自发酵容器的废气流中存在的二氧化碳的量的相关性来开发。二氧化碳可以容易地经由使用质谱仪或用于测量废气流的组分的其它合适的仪器技术来测量。在一个优选的方面,通过使用红外光谱法的实时监测系统监测葡萄糖浓度。在另一个方面,通过使用近红外光谱法的实时监测系统监测葡萄糖浓度。实时监测系统与控制淀粉材料向发酵液中的引入以将发酵液中葡萄糖的形成调节到所期望的浓度的设备进行交互。
在发酵过程期间,可以采集发酵培养基的样品以确定培养基中葡糖淀粉酶活性的量。培养基中葡糖淀粉酶活性的量可以被称为细胞外葡糖淀粉酶活性,这是因为它对应于从工程化的酵母中分泌的葡糖淀粉酶。在一些测量模式中,培养基中葡糖淀粉酶活性的量可以通过每单位体积的培养基每单位量的生物质的葡糖淀粉酶活性的量来确定。
如本文所用的“生物质”指的是工程化的酵母的重量,它可以每升培养基的干细胞重量(DCW/L)的克数来测量。
GA活性的单位(U)可以被定义为催化从淀粉中释放1mg葡萄糖/分钟的酶的量。葡糖淀粉酶活性可以在浓缩发酵液中通过在两步终点测定中将淀粉水解与HXK/G6PDH反应混合物(Sigma G3293)相关联来测量。可以将发酵液从使用非葡萄糖碳源(即棉子糖)生长的预定量的细胞浓缩以避免干扰测定。
比活性等于给定体积的发酵液中的活性除以相同体积的发酵液中细胞的湿重。比活性具有以下单位,即每克生物质的GA活性的单位数(U/g生物质)。用于测定的生物质的量可以通过在通过过滤或离心去除发酵液之后确定湿细胞重量来测量。
通过将1.1g的玉米淀粉(S4126,Sigma公司)溶解在50mL近沸水中,然后添加1mL的3M乙酸钠(pH 5.2)来制备淀粉溶液。将通常在1ul-20ul范围内的体积的浓缩发酵液(Vb)(通过使用10Kb Kd截止柱制备,Millipore#UFC901008)添加到淀粉浆(Vs)中,总体积是200ul,并且允许在37℃孵育特定的时间(T),通常是5分钟-60分钟。选择参数以使得葡萄糖形成在所期望的时间内是线性的。将20μL的每一个样品添加到2μL的0.6N NaOH中并且充分混合。然后添加200μL的HXK/G6PDH混合物并且在30℃孵育30分钟。使用分光光度计(SpectraMaxTM M2)测量在340nm的吸光度。利用使用已知的葡萄糖标准的回归分析来计算每一个样品中释放的葡萄糖的量。每克生物质的比酶活性(U/g生物质)可以通过获得在浓缩之前使用的样品的重量(以克为单位)来计算。活性的单位数=(葡萄糖的毫克数/T)×((Vb+Vs)/(Vb))×(222/20)。比活性=活性的单位数/生物质的克数。
在一些方面,在发酵方法中,培养基具有每克生物质2.25U或更大量的葡糖淀粉酶活性。在一些方面,培养基具有以下量的葡糖淀粉酶活性:每克生物质约2.3U或更大、约2.35U或更大、约2.4U或更大、约2.45U或更大、约2.5U或更大、约2.6U或更大、约2.7U或更大、约2.8U或更大、约2.9U或更大、约3U或更大、约3.5U或更大、约4U或更大、约4.5U或更大、约5U或更大、约5.5U或更大、约6U或更大、约6.5U或更大、约7U或更大、约7.5U或更大或约8U或更大。在一些方面,培养基具有以下范围内的量的葡糖淀粉酶活性:每克生物质约2.3U至约15U、约2.4U至约15U、约2.5U至约15U、约3U至约15U、约3.5U至约15U、约4U至约15U、约4.5U至约15U、约5U至约15U、约5.5U至约15U、约6U至约15U、约6.5U至约15U、约7U至约15U、约7.5U至约15U或约8U至约15U。
在其它方面,发酵培养基中由本公开的非天然酵母提供的葡糖淀粉酶活性的量可以相对于参考酵母来描述。举例来说,可以将表达具有异源性信号序列的外源性葡糖淀粉酶(例如与SEQ ID NO:52具有90%或更大的同一性)的非天然酵母的葡糖淀粉酶活性的量与表达具有天然信号序列的外源性葡糖淀粉酶的在其它方面相同的酵母进行比较。
在一些方面,表达具有异源性信号序列的外源性葡糖淀粉酶的非天然酵母在发酵培养基中提供的葡糖淀粉酶活性的量是参考酵母的至少1.125倍(比参考酵母大12.5%)。在一些方面,非天然酵母中葡糖淀粉酶活性的量是参考酵母的至少1.15倍、至少1.175倍、至少1.225倍、至少1.25倍、至少1.3倍、至少1.35倍、至少1.4倍、至少1.45倍、至少1.5倍、至少1.75倍、至少2倍、至少2.25倍、至少2.5倍、至少2.75倍、至少3倍、至少3.25倍、至少3.5倍、至少3.75倍或至少4倍。在一些方面,相对于参考酵母,由非天然酵母提供的葡糖淀粉酶活性的量在以下范围内:所述非天然酵母中葡糖淀粉酶活性的量是参考酵母的约1.15倍至约7.5倍、约1.175倍至约7.5倍、约1.225倍至约7.5倍、约1.25倍至约7.5倍、约1.3倍至约7.5倍、约1.35倍至约7.5倍、约1.4倍至约7.5倍、约1.45倍至约7.5倍、约1.5倍至约7.5倍、约1.75倍至约7.5倍、约2倍至约7.5倍、约2.25倍至约7.5倍、约2.5倍至约7.5倍、约2.75倍至约7.5倍、约3倍至约7.5倍、约3.25倍至约7.5倍、约3.5倍至约7.5倍、约3.75倍至约7.5倍或约4倍至约7.5倍。
可以在发酵期间所期望的时间点对发酵培养基中的葡糖淀粉酶活性进行测量。举例来说,可以在发酵过程的中途约1/10、约2/10、约3/10、约4/10、约5/10、约6/10、约7/10、约8/10、约9/10时或在发酵过程结束时从发酵培养基中采集样品,并且可以测试样品的葡糖淀粉酶活性。
在一些实施方式中,发酵期是约30小时或更长时间、约40小时或更长时间、约50小时或更长时间或约60小时或更长时间,如约40小时至约120小时或50小时至约110小时范围内的时间段。
发酵产物(在本文也被称为“生物衍生产物”或“生物产物”)可以是可以通过葡糖淀粉酶对淀粉材料进行酶促降解,形成葡萄糖并且进行葡萄糖发酵所制备的任何产物。在各方面,在一个实施方案中,发酵产物选自由以下组成的组:氨基酸、有机酸、醇、二醇、多元醇、脂肪酸、脂肪酸烷基酯(如脂肪酸甲酯或乙酯(例如C6至C12脂肪酸甲酯(优选的是C8至C10脂肪酸甲酯)))、单酰基甘油酯、二酰基甘油酯、三酰基甘油酯以及其混合物。优选的发酵产物是有机酸、氨基酸、脂肪酸烷基酯(如脂肪酸甲酯(例如C8至C12脂肪酸甲酯(优选的是C8至C10脂肪酸甲酯)))以及它们的盐,并且特别是其中所述有机酸选自由以下组成的组:羟基羧酸(包括单羟基和二羟基一元羧酸、二元羧酸以及三元羧酸)、一元羧酸、二元羧酸和三元羧酸以及其混合物。通过本发明的方法制备的发酵产物的实例是有机酸或氨基酸,如乳酸、柠檬酸、丙二酸、羟基丁酸、己二酸、赖氨酸、酮戊二酸、戊二酸、3-羟基-丙酸、丁二酸、苹果酸、反丁烯二酸、衣康酸、粘康酸、甲基丙烯酸、乙酸、己酸甲酯、辛酸甲酯、壬酸甲酯、癸酸甲酯、十二烷酸甲酯、己酸乙酯、辛酸乙酯、壬酸乙酯、癸酸乙酯、十二烷酸乙酯以及其混合物和其衍生物和其盐。在一个优选的方面,本公开的发酵方法产生乙醇作为生物产物。
从发酵液中回收发酵产物。实现此目的的方式将取决于特定产物。然而,在一些实施方式中,通常经由过滤步骤或离心步骤将生物体与液相分离,并且经由例如蒸馏、萃取、结晶、膜分离、渗透、反渗透或其它合适的技术来回收产物。
本发明的方法提供了在生产规模水平上以优异的收率和纯度制备发酵产物的能力。在一个方面,在至少25,000加仑的发酵液量中进行所述方法。在一个方面,进行分批方法以产生至少25,000加仑的最终发酵液的批次。在一些方面,所述方法是连续方法,在至少200,000加仑的容器中进行。
包含异源性分泌信号-葡糖淀粉酶的组合物可以任选地与以下与葡糖淀粉酶不同的酶中的任一种或任何组合一起组合使用。示例性的其它酶包括α淀粉酶、β-淀粉酶、肽酶(蛋白酶、朊酶、内肽酶、外肽酶)、支链淀粉酶、异淀粉酶、纤维素酶、半纤维素酶、葡聚糖内切酶和相关的β-葡聚糖水解辅助酶、木聚糖酶和木聚糖酶辅助酶、乙酰乳酸脱羧酶、环糊精糖基转移酶、脂肪酶、植酸酶、漆酶、氧化酶、酯酶、角质酶、颗粒淀粉水解酶以及其它葡糖淀粉酶。
在一些方面,异源性分泌信号-葡糖淀粉酶可以用于淀粉转化过程,诸如以产生用于果糖浆的右旋糖、特种糖以及醇和其它终产物(例如有机酸、抗坏血酸以及氨基酸)。通过使用本公开的葡糖淀粉酶对淀粉底物进行发酵来生产醇可以包括生产燃料酒精或饮用酒精。
与亲本或野生型葡糖淀粉酶相比,当在相同的条件下使用异源性分泌信号-葡糖淀粉酶时,醇的产量可以是更大的。举例来说,使用本公开的葡糖淀粉酶,醇产量可以增加到野生型菌株中的醇产量的1.1倍或更大、1.2倍或更大、1.3倍或更大、1.4倍或更大、1.5倍或更大、1.6倍或更大、1.7倍或更大、1.7倍或更大、1.8倍或更大、1.9倍或更大、2.0倍或更大、2.1倍或更大、2.2倍或更大、2.3倍或更大、2.4倍或更大或2.5倍或更大。
在一些方面,本公开提供了一种用于通过发酵生产乙醇的方法,其中乙醇以90g/L或更大的浓度存在于发酵培养基中。在所述方法中,将包含淀粉材料和包含编码多肽的外源性核酸的非天然酵母的液体培养基发酵,所述多肽包含葡糖淀粉酶部分和与所述葡糖淀粉酶异源的信号序列。发酵可以在液体培养基中提供约90g/L或更大的乙醇浓度,如在约90g/L至约170g/L的范围内、在约110g/L至约170g/L的范围内、在约125g/L至约170g/L的范围内或在约140g/L至约170g/L的范围内。
所述方法包括将包含淀粉材料和包含编码多肽的外源性核酸的非天然酵母的液体培养基发酵,所述多肽包含葡糖淀粉酶部分和与所述葡糖淀粉酶异源的信号序列,其中所述发酵在液体培养基中提供90g/L或更大的乙醇浓度。
乙醇质量收率可以通过用乙醇浓度除以消耗的总葡萄糖来计算。由于葡萄糖可以作为游离葡萄糖存在或被束缚在低聚物中,因此需要考虑这两者。为了确定在发酵开始和结束时存在的总葡萄糖,确定总葡萄糖当量测量值。总葡萄糖当量测量如下。使用HPLC,使用RI检测来测量葡萄糖。使用Bio Rad 87H柱,使用10mM H2SO4流动相完成分离。对于每一个样品,按一式三份测量葡萄糖。在6%(v/v)三氟乙酸中,在121℃按一式三份进行酸水解,持续15分钟。通过相同的HPLC方法测量在水解之后所得的葡萄糖。每一个样品中存在的总葡萄糖当量是在酸水解之后测量的葡萄糖的量。消耗的总葡萄糖是通过将在发酵结束时存在的总葡萄糖当量从在发酵开始时存在的总葡萄糖当量中减去来计算的。
使用本公开的非天然酵母还可以在以下方面提供益处:滴度增加、挥发性有机酸(VOC)减少以及杂醇油化合物(挥发性有机酸、高级醇、醛、酮、脂肪酸以及酯)减少。
可以首先用处理系统的一种或多种试剂处理发酵产物。然后可以将经过处理的发酵产物送到蒸馏系统中。在蒸馏系统中,可以将发酵产物蒸馏并且脱水成乙醇。在一些方面,从发酵培养基中去除的组分包括水、可溶性组分、油以及未发酵的固体。这些组分中的一些可以用于其它目的,如用于动物饲料产品。可以从釜馏物中回收其它副产品,例如糖浆。
本公开还提供了一种用于生产食品、饲料或饮料产品的方法,如酒精饮料或非酒精饮料,如基于谷物或麦芽的饮料,如啤酒或威士忌,如葡萄酒、苹果酒、醋、米酒、酱油或果汁,所述方法包括以下步骤:用如本文所述的组合物处理含有淀粉和/或糖的植物材料。在另一个方面,本发明还涉及一种试剂盒,所述试剂盒包括本公开的葡糖淀粉酶或如本文所考虑的组合物;以及所述葡糖淀粉酶或组合物的使用说明书。本发明还涉及一种通过使用葡糖淀粉酶的方法生产的发酵饮料。
在发酵过程完成后,发酵培养基中存在的物质可以是有用的。在一些方面,在发酵过程已经完成后,或在发酵过程正在进行时,可以从发酵培养基中去除一些或所有生物产物以提供包含非生物产物固体的精制组合物。所述非生物产物固体是非天然酵母、培养基中未被酵母利用的原料材料以及发酵副产物。这些材料可以提供可用作补充剂以提高饲料组合物的营养含量的碳水化合物和蛋白质的来源。所述饲料材料可以是来自发酵过程的副产品,如釜馏物(全釜馏物、稀釜馏物等)或由其制备的组合物,包括干酒糟(DDG)、含有可溶物的干酒糟(DDGS)、湿酒糟(DWG)以及酒糟可溶物(DS)。
可以进一步处理发酵培养基(任选地去除了一些或所有目标生物产物),诸如以去除水或使得非生物产物固体从培养基中沉淀或分离。在一些情况下,通过冷冻干燥或烘箱干燥处理培养基。在处理之后,精制组合物可以呈例如液体浓缩物、半湿饼或干燥固体的形式。精制组合物可以用作饲料组合物本身或制备饲料组合物的成分。在优选的制品中,饲料组合物是牲畜饲料组合物,如用于羊、牛、猪等。
发酵培养基中的固体可以提供一种或多种氨基酸的来源。通过引入动物饲料中,所述发酵副产品可以提供提高的与一种或多种必需氨基酸相关的氨基酸含量。必需氨基酸可以包括组氨酸、异亮氨酸、赖氨酸、甲硫氨酸、苯丙氨酸、苏氨酸以及色氨酸。这些氨基酸可以作为游离氨基酸存在于饲料组合物中或可以源自于富含所述氨基酸的蛋白质或肽。发酵培养基中的固体可以提供一种益生元的来源,其是不易消化的食物物质,如不易消化的低聚糖,选择性地刺激肠道中有利的细菌菌种的生长,从而有益于宿主。发酵培养基中的固体可以提供植酸酶、β-葡聚糖酶、蛋白酶以及木聚糖酶的来源。
所述饲料组合物可以用于水产养殖,所述水产养殖是养殖水生生物体,如鱼类、贝类或植物。水产养殖包括养殖海洋物种和淡水物种这两者并且可以从基于陆地的生产到开放式海洋生产不等。
除了从发酵培养基中获得的材料之外,饲料组合物还可以包括一种或多种饲料添加剂。饲料添加剂可以用于例如有助于提供均衡膳食(例如维生素和/或痕量矿物质)、保护动物防止疾病和/或胁迫(例如抗生素、益生菌)和/或刺激或控制生长和行为(例如激素)。添加剂产品成分可以包括例如:生长促进剂、药用物质、缓冲剂、抗氧化剂、酶、防腐剂、球团粘结剂、直接饲喂微生物等。添加剂产品成分还可以包括例如离子载体(例如莫能星(monesin)、拉沙里菌素(lasalocid)、莱特洛霉素(laidlomycin)等)、β-激动剂(齐帕特罗(zilpaterol)、莱克多巴胺(ractompamine)等)、抗生素(例如氯四环素(chlortetracycline,CTC)、氧四环素(oxytetracycline)、杆菌肽(bacitrain)、泰乐菌素(tylosin)、金霉素(aureomycin))、益生菌和酵母培养物、抗球虫剂(例如氨丙啉(amprollium)、癸氧喹酯(decoquinate)、拉沙里菌素、莫能菌素(monensin))以及激素(例如生长激素或抑制发情和/或排卵的激素,如醋酸美伦孕酮(melengestrol acetate))、信息素、营养药物、药物、类黄烷、营养和非营养补充剂、解毒剂等。一些可商购获得的添加剂是以商品名以及出售的。
实施例1:产生酿酒酵母基础菌株
将菌株1用SEQ ID NO 1转化。SEQ ID NO 1含有以下元件:与整合基因座A的5'同源区(1bp-436bp)、loxP重组位点(445bp-478bp)、来自酿酒酵母的3-脱氧-D-阿拉伯-庚酮糖酸-7-磷酸(DAHP)合酶基因的突变型式(ARO4-OFP)的表达盒(479bp-20647bp)、loxP重组位点(2648bp-2681bp)以及与整合基因座A的3'同源区(2691bp-3182bp)。在含有3.5g/L的对氟苯丙氨酸和1g/L的L-酪氨酸的合成完全培养基(ScD-PFP)上选择转化体。在ScD-PFP上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ IDNO 1正确整合到基因座A的一个等位基因中。PCR验证的分离株被命名为菌株1-1。
将菌株1-1用SEQ ID NO 2转化。SEQ ID NO 2含有以下元件:与整合基因座A的5'同源区(1bp-435bp)、loxP重组位点(444bp-477bp)、来自构巢曲霉的乙酰胺酶(amdS)基因的表达盒(478bp-2740bp)、loxP重组位点(2741bp-2774bp)、与整合基因座A的3'同源区(2783bp-3275bp)。在含有80mg/L尿嘧啶和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(Yeast Nitrogen Base)(不含硫酸铵或氨基酸)(YNB+乙酰胺+尿嘧啶)上选择转化体。在YNB+乙酰胺+尿嘧啶平板上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 2正确整合到基因座A的第二等位基因中。PCR验证的分离株被命名为菌株1-2。
将菌株1-2用SEQ ID NO 3和SEQ ID NO 4共转化。SEQ ID NO 3含有以下元件:来自P1噬菌体的cre重组酶的开放阅读框(52bp-1083bp)以及与SEQ ID NO 4同源的侧接DNA。SEQ ID NO 4含有以下元件:来自酿酒酵母的CYC1终止子(10bp-199bp)、2μ复制起点(2195bp-3350bp)、来自酿酒酵母的URA3选择标记(3785bp-4901bp)以及来自酿酒酵母的PGK启动子(5791bp-6376bp)。在缺乏尿嘧啶的合成缺陷型培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。筛选分离的菌落以在ScD-PFP和YNB+乙酰胺+尿嘧啶上生长。通过PCR验证ARO4-OFP和amdS基因的丧失。将PCR验证的分离株划线接种到含有1g/L 5-氟乳清酸的YNB上以针对2μ质粒的丧失进行选择。PCR验证的分离株被命名为菌株1-3。
实施例2:构建表达修饰的真菌葡糖淀粉酶的菌株。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 6共转化。SEQ ID NO 5含有以下元件:含有ScCYC1终止子的线性化质粒(4-227)、ScURA3表达盒(952bp-2049bp)、用于稳定复制的CEN6着丝粒(2308bp-2826bp)、β-内酰胺酶(2958bp-3815bp)以及ScTDH3启动子(5052bp-5734bp)。SEQ ID NO 6含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达来自白宇佐美曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1969bp)以及与SEQ ID NO 5的同源区(1975bp-2017bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-4。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 7共转化。SEQ ID NO 7含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达具有修饰的分泌信号的来自白宇佐美曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1966bp)和与SEQ ID NO 5的同源区(1972bp-2014bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-5。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 8共转化。SEQ ID NO 8含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达具有修饰的分泌信号的来自白宇佐美曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1972bp)和与SEQ ID NO 5的同源区(1978bp-2020bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-6。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 9共转化。SEQ ID NO 9含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达来自土曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1960bp)和与SEQ ID NO 5的同源区(1966bp-2008bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-7。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 10共转化。SEQ ID NO 10含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达具有修饰的分泌信号的来自土曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1951bp)和与SEQ ID NO 5的同源区(1957bp-1999bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-8。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 11共转化。SEQ ID NO 11含有以下元件:与SEQ ID NO 5的同源区(1bp-44bp)和表达具有修饰的分泌信号的来自土曲霉的密码子优化的葡糖淀粉酶的开放阅读框(50bp-1957bp)和与SEQ ID NO 5的同源区(1963bp-2005bp)。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-9。
将菌株1-3用SEQ ID NO 12共转化。SEQ ID NO 12含有与SEQ ID NO 5中的元件相同的元件,不同的是DNA呈环状形式。在ScD-Ura上选择转化体。通过将菌落从ScD-Ura转化平板划线接种到相似的培养基上并且在30℃将平板孵育1天-2天直到出现单菌落为止来获得来自每一次单独ScD-Ura转化的单菌落分离株。一个分离株被保存为菌株1-10。
实施例3:表达修饰的真菌葡糖淀粉酶的酵母菌株的小规模发酵
将菌株1-4至菌株1-10接种到ScD-Ura平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将单菌落接种到15ml falcon培养管中所容纳的2ml培养基(由850g液化物、150g过滤灭菌的淡玉米浆、25g葡萄糖、1g尿素组成)中。将每一个管放置在旋转振荡器中,以100rpm搅拌并且温度是30℃。在48小时之后,采集样品并且通过HPLC分析以定量乙醇产量(表3)。表3表明所测试的这两种真菌葡糖淀粉酶都能够比空载体对照发酵更多的乙醇,并且白宇佐美曲霉葡糖淀粉酶和土曲霉葡糖淀粉酶这两者都受益于前导序列修饰。
表3.示出了来自小规模管发酵的乙醇滴度的表。该数据证实了前导序列修饰对乙醇滴度的有益作用。
菌株 基因描述 信号序列 乙醇滴度(g/l)
1-4 白宇佐美曲霉葡糖淀粉酶 天然 58.953
1-5 白宇佐美曲霉葡糖淀粉酶 酿酒酵母Pho5 89.401
1-6 白宇佐美曲霉葡糖淀粉酶 酿酒酵母Mfα2 73.811
1-7 土曲霉葡糖淀粉酶 天然 40.357
1-8 土曲霉葡糖淀粉酶 酿酒酵母Pho5 46.082
1-9 土曲霉葡糖淀粉酶 酿酒酵母Mfα2 73.530
1-10 无葡糖淀粉酶 NA 20.990
实施例4:用表达野生型和信号序列修饰的米根霉葡糖淀粉酶的质粒转化菌株1-3
将菌株1-3用SEQ ID NO 5和SEQ ID NO 13共转化。SEQ ID NO 5含有以下元件:ScCYC1终止子(4-227)、含有ScURA3表达盒的线性化质粒(952bp-2049bp)、用于稳定复制的CEN6着丝粒(2308bp-2826bp)、β-内酰胺酶(2958bp-3815bp)以及ScTDH3启动子(5052bp-5734bp)。SEQ ID NO 13含有以下元件:与SEQ ID NO 5的同源区和表达来自米根霉的密码子优化的葡糖淀粉酶的开放阅读框和与SEQ ID NO 5的同源区。在ScD-Ura上选择转化体并且复制平板接种到ScD-Ura和Sc-Ura 1%淀粉(w/v)上。所得的平板示于图1行A中。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 14共转化。SEQ ID NO 14含有以下元件:与SEQ ID NO 5的同源区、和表达具有修饰的分泌信号的来自米根霉的密码子优化的葡糖淀粉酶的开放阅读框和与SEQ ID NO 5的同源区。在ScD-Ura上选择转化体并且复制平板接种到ScD-Ura和Sc-Ura 1%淀粉(w/v)上。所得的平板示于图1行B中。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 15共转化。SEQ ID NO 15含有以下元件:与SEQ ID NO 5的同源区、和表达具有修饰的分泌信号的来自米根霉的密码子优化的葡糖淀粉酶的开放阅读框和与SEQ ID NO 5的同源区。在ScD-Ura上选择转化体并且复制平板接种到ScD-Ura和Sc-Ura 1%淀粉(w/v)上。所得的平板示于图1行C中。
将菌株1-3用SEQ ID NO 5和SEQ ID NO 16共转化。SEQ ID NO 16含有以下元件:与SEQ ID NO 5的同源区、和表达具有修饰的分泌信号的来自米根霉的密码子优化的葡糖淀粉酶的开放阅读框和与SEQ ID NO 5的同源区。在ScD-Ura上选择转化体并且复制平板接种到ScD-Ura和Sc-Ura 1%淀粉(w/v)上。所得的平板示于图1行D中。
结果示于图1中。该结果表明由前导序列修饰引起生长的改良。
实施例5:表达野生型或修饰的米根霉葡糖淀粉酶的菌株
将菌株1-3用SEQ ID NO 17转化。SEQ ID NO 17含有:ScTDH3启动子(6bp-688bp)、具有修饰的分泌信号的来自米根霉的密码子优化的葡糖淀粉酶(695bp-2491bp)、ScCYC1终止子(2500bp-2723bp)、ScURA3表达盒(3448bp-4545bp)、用于稳定复制的着丝粒CEN6(4804bp-5322bp)以及β-内酰胺酶(5454bp-6311bp)。在缺乏尿嘧啶的合成缺陷型培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。PCR验证的分离株被保存为菌株1-4。
将菌株1-3用SEQ ID NO 18转化。SEQ ID NO 18含有:ScTDH3启动子(1bp-683bp)、具有天然分泌信号的来自米根霉的密码子优化的葡糖淀粉酶(685bp-2509bp)、ScCYC1终止子(2513bp-2736bp)、ScURA3表达盒(3461bp-4558bp)、用于稳定复制的着丝粒CEN6(4817bp-5335bp)以及β-内酰胺酶(5467bp-6324bp)。在缺乏尿嘧啶的合成缺陷型培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。PCR验证的分离株被保存为菌株1-5。
将菌株1-3用SEQ ID NO 12转化。在缺乏尿嘧啶的合成缺陷型培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。PCR验证的分离株被保存为菌株1-6。
实施例6:比较表达野生型或修饰的米根霉葡糖淀粉酶的菌株1-4和菌株1-5的SSF发酵
将菌株1-4、菌株1-5以及菌株1-6接种到ScD-Ura平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自ScD-Ura平板的细胞刮到无菌摇瓶培养基中并且测量光密度(OD600)。使用型号Genesys20分光光度计(Thermo Scientific公司)用1cm光程长度在600nm的波长下测量光密度。将摇瓶用细胞浆液接种以达到0.1-0.3的初始OD600。在即将接种之前,将50mL摇瓶培养基添加到250mL无挡板的摇瓶(Corning 4995-250)中,所述摇瓶装有含有透气密封件的螺旋盖(corning 1395-45LTMC)。摇瓶培养基由725g呈液化物形式的部分水解玉米淀粉、150g过滤的淡玉米浆、50g水、25g葡萄糖以及1g尿素组成。将用于每一种菌株的一式两份烧瓶在30℃在轨道式振荡器中以100rpm振荡下孵育69小时。在发酵期间采集样品并且通过HPLC分析发酵液中的相关代谢物浓度。乙醇产生曲线示于图2中。这表明野生型米根霉葡糖淀粉酶不具功能,这是因为它与缺乏葡糖淀粉酶的对照菌株产生相当的乙醇。这还表明分泌信号修饰显著提高乙醇产生。
实施例7:评价对米根霉葡糖淀粉酶的另外的前导序列修饰
将菌株1-3用SEQ ID NO 19转化。SEQ ID NO 19含有以下元件:对应于核苷酸695-2701的含有来自α交配因子(FAKS)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2710-2933的CYC1终止子;对应于核苷酸5014-5532的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4755-3739的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 19还包括对应于核苷酸5664-6521的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-7。
将菌株1-3用SEQ ID NO 20转化。SEQ ID NO 20含有以下元件:对应于核苷酸695-2617的含有来自α交配因子(AKS)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2626-2849的CYC1终止子;对应于核苷酸4930-5448的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4671-3655的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 20还包括对应于核苷酸5580-6437的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-8。
将菌株1-3用SEQ ID NO 21转化。SEQ ID NO 21含有以下元件:对应于核苷酸695-2605的含有来自α交配因子(AK)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2614-2837的CYC1终止子;对应于核苷酸4918-5436的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4659-3643的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 21还包括对应于核苷酸5568-6425的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-9。
将菌株1-3用SEQ ID NO 22转化。SEQ ID NO 22含有以下元件:对应于核苷酸695-2491的含有来自α因子T(AT)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2500-2723的CYC1终止子;对应于核苷酸4804-3522的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4545-3529的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 22还包括对应于核苷酸5454-6311的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-10。
将菌株1-3用SEQ ID NO 23转化。SEQ ID NO 23含有以下元件:对应于核苷酸695-2494的含有来自α淀粉酶(AA)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2503-2726的CYC1终止子;对应于核苷酸4807-5325的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4548-3532的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 23还包括对应于核苷酸5457-6314的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-11。
将菌株1-3用SEQ ID NO 24转化。SEQ ID NO 24含有以下元件:对应于核苷酸695-2488的含有来自泡盛曲霉葡糖淀粉酶(GA)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2497-3720的CYC1终止子;对应于核苷酸4801-5319的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4542-3526的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 24还包括对应于核苷酸5451-6308的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-12。
将菌株1-3用SEQ ID NO 25转化。SEQ ID NO 25含有以下元件:对应于核苷酸695-2482的含有来自菊粉酶(IN)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2491-2714的CYC1终止子;对应于核苷酸4795-5313的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4536-3520的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 25还包括对应于核苷酸5445-6302的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-13。
将菌株1-3用SEQ ID NO 26转化。SEQ ID NO 26含有以下元件:对应于核苷酸695-2491的含有来自转化酶(IV)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2500-2723的CYC1终止子;对应于核苷酸4804-5322的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4545-3529的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 26还包括对应于核苷酸5454-6311的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-14。
将菌株1-3用SEQ ID NO 27转化。SEQ ID NO 27含有以下元件:对应于核苷酸695-2512的含有来自溶菌酶(LZ)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2521-2744的CYC1终止子;对应于核苷酸4825-5343的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4566-3550的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 27还包括对应于核苷酸5475-6332的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-15。
将菌株1-3用SEQ ID NO 28转化。SEQ ID NO 28含有以下元件:对应于核苷酸695-2488的含有来自白蛋白(SA)的N末端分泌前导序列的来自米根霉的修饰的葡糖淀粉酶基因的表达盒,包括对应于核苷酸6-668的TDH3启动子和对应于核苷酸2497-2720的CYC1终止子;对应于核苷酸4801-5319的允许稳定复制的着丝粒(CEN6)以及对应于核苷酸4542-3526的乳清酸核苷-5'-磷酸脱羧酶(URA3)的表达盒。SEQ ID NO 28还包括对应于核苷酸5451-6308的氨苄西林抗性基因。在ScD-Ura上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。通过PCR验证质粒的存在。PCR验证的分离株被命名为菌株1-16。
实施例8:测试对米根霉葡糖淀粉酶的另外的前导序列修饰的SSF发酵
将菌株1-6至菌株1-16接种到ScD-Ura平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自ScD-Ura平板的细胞刮到无菌摇瓶培养基中并且测量光密度(OD600)。使用型号Genesys20分光光度计(Thermo Scientific公司)用1cm光程长度在600nm的波长下测量光密度。
将摇瓶用细胞浆液接种以达到0.1-0.3的初始OD600。在即将接种之前,将50g摇瓶培养基添加到250mL无挡板的摇瓶(Corning 4995-250)中,所述摇瓶装有含有透气密封件的螺旋盖(corning 1395-45LTMC)。摇瓶培养基由725g呈液化物形式的部分水解玉米淀粉、150g过滤的淡玉米浆、25g葡萄糖以及1g尿素(Sigma U6504)组成。在填充培养基和接种之前和之后将摇瓶称重。将填充/接种前重量从填充/接种后重量中减去以确定起始摇瓶重量。
将接种的烧瓶在33.3℃孵育,同时在轨道式振荡器中以100rpm振荡74小时。采用最终时间点并且通过高效液相色谱法,使用折射率检测器分析发酵液中的乙醇浓度。摇瓶实验的结果示于图3中。这些结果表明了对米根霉葡糖淀粉酶的另外的前导序列修饰的有效性,除了GA前导序列和IN前导序列之外,所有修饰都使得菌株能够在给定的时间范围内产生大于120g/L的乙醇。
实施例9:构建含有内源性异麦芽糖酶的过表达和异源性麦芽糖转运蛋白的菌株
将菌株1-3用SEQ ID NO 29转化。SEQ ID NO 29含有以下元件:与整合基因座B的同源区(1bp-303bp)、ScPGK1启动子(309bp-895bp)、密码子优化的酿酒酵母异麦芽糖(902bp-2671bp)、ScGAL10终止子(2680bp-2935bp)、loxP重组位点(2978bp-3011bp)、ScURA3表达盒(3012bp-4641bp)、loxP重组位点(4642bp-4675bp)、ScADH1启动子(4690bp-5435bp)、米氏酵母(Saccharomyces mikatae)麦芽糖转运蛋白(5436bp-7289bp)、ScCYC1终止子(7299bp-7521bp)以及与整合基因座B的同源区。在缺乏尿嘧啶的合成完全培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 29正确整合到整合基因座B的一个等位基因中。PCR验证的分离株被命名为菌株1-17。
将菌株1-17用SEQ ID NO 30转化。SEQ ID NO 30含有以下元件:与整合基因座B的同源区(2bp-303bp)、ScPGK1启动子(309bp-895bp)、密码子优化的酿酒酵母异麦芽糖酶(902bp-2671bp)、ScGAL10终止子(2680bp-2935bp)、loxP重组位点(2985bp-3018bp)、ScTEF1终止子(3019bp-3178bp)、构巢曲霉乙酰胺酶(3179bp-4825bp)、ScTEF1启动子(4826bp-5281bp)、loxP重组位点(5282bp-5315bp)、ScCYC1终止子(5324bp-5547bp)、米氏酵母麦芽糖转运蛋白(5556bp-7409bp)、ScADH1启动子(7410bp-8148bp)以及与整合基因座B的同源区(8155bp-8685bp)。在含有20g/L葡萄糖和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(不含硫酸铵或氨基酸)(YNB+乙酰胺)上选择转化体。在YNB+乙酰胺上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 30正确整合到整合基因座B的第二等位基因中。PCR验证的分离株被命名为菌株1-18。
将菌株1-18用SEQ ID NO 31转化。SEQ ID NO 31含有以下元件:1)来自酿酒酵母的3-脱氧-D-阿拉伯-庚酮糖酸-7-磷酸(DAHP)合酶基因的突变型式(ARO4-OFP)的表达盒;2)来自P1噬菌体的cre重组酶的表达盒;3)含有天然URA3的表达盒;以及4)酿酒酵母CEN6着丝粒。在含有3.5g/L的对氟苯丙氨酸和1g/L的L-酪氨酸的合成完全培养基(ScD-PFP)上选择转化体。在ScD-PFP上将所得的转化体划线接种以用于单菌落分离。选择单菌落。PCR验证的分离株被命名为菌株1-19。
实施例10:在菌株1-19中构建含有具有修饰的分泌信号的白宇佐美曲霉葡糖淀粉酶的多个拷贝的菌株
将菌株1-19用SEQ ID NO 32和SEQ ID NO 33转化。SEQ ID NO 32含有以下元件:与整合基因座C的同源区(2bp-1003bp)、ScTDH3启动子(1010bp-1691bp)、含有修饰的信号序列的白宇佐美曲霉葡糖淀粉酶(1698bp-3614bp)、ScCYC1终止子(3623bp-3846bp)、loxP重组位点(3855bp-3888bp)、ScURA3启动子(3889bp-4395bp)以及ScURA3基因的上游部分(4396bp-4999bp)。SEQ ID NO 33含有以下元件:ScURA3基因的下游部分(7bp-606bp)、URA3终止子(607bp-927bp)、loxP重组位点(928bp-961bp)、ADH1启动子(968bp-1714bp)、含有修饰的信号序列的白宇佐美曲霉葡糖淀粉酶(1721bp-3637bp)、GAL10终止子(3646bp-4116bp)以及与整合基因座C的同源区(4125bp-5124bp)。在缺乏尿嘧啶的合成完全培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 32和SEQ ID NO 33正确整合到整合基因座C的一个等位基因中。PCR验证的分离株被命名为菌株1-20。
将菌株1-20用SEQ ID NO 34和SEQ ID NO 35转化。SEQ ID NO 34含有以下元件:与整合基因座C的同源区(2bp-1003bp)、ScTDH3启动子(1010bp-1691bp)、含有修饰的信号序列的白宇佐美曲霉葡糖淀粉酶(1698bp-3614bp)、ScCYC1终止子(3623bp-3846bp)、loxP重组位点(3855bp-3888bp)、ScTEF1启动子(3889bp-4344bp)以及构巢曲霉乙酰胺酶的上游部分(4345bp-5384bp)。SEQ ID NO 35含有以下元件:构巢曲霉amdS的下游部分(7bp-1032bp)、ScADH1终止子(1033bp-1335bp)、loxP重组位点(1336bp-1369bp)、ScADH1启动子(1376bp-2123bp)、具有修饰的信号序列的白宇佐美曲霉葡糖淀粉酶(2129bp-4045bp)、ScGAL10终止子(4054bp-4524bp)以及与整合基因座C的同源区(4533bp-5532bp)。在含有20g/L葡萄糖和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(不含硫酸铵或氨基酸)上选择转化体。在含有20g/l葡萄糖和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(不含硫酸铵或氨基酸)上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 35和SEQ ID NO 57正确整合到整合基因座C的第二等位基因中。PCR验证的分离株被命名为菌株1-21。
实施例11:在菌株1-19和菌株1-3中构建含有具有修饰的分泌信号的米根霉葡糖淀粉酶的多个拷贝的菌株
将菌株1-3用SEQ ID NO 36和SEQ ID NO 37转化。SEQ ID NO 36含有以下元件:与整合基因座C的同源区(2bp-1003bp)、ScTDH3启动子(1010bp-1691bp)、具有修饰的信号序列的米根霉葡糖淀粉酶(1698bp-3494bp)、ScCYC1终止子(3503bp-3726bp)、loxP重组位点(3735bp-3768bp)、ScURA3启动子(3769bp-4275bp)、ScURA3的上游部分(4276bp-4879bp)。SEQ ID NO 37含有以下元件:ScURA3的下游部分(7bp-606bp)、ScURA3终止子(607bp-927bp)、loxP重组位点(928bp-961bp)、ScADH1启动子(968bp-1714bp)、具有修饰的信号序列的米根霉葡糖淀粉酶(1720bp-3516bp)、ScGAL10终止子(3525bp-3995bp)以及与整合基因座C的同源区。在缺乏尿嘧啶的合成完全培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ IDNO 36和SEQ ID NO 37正确整合到整合基因座C的一个等位基因中。PCR验证的分离株被命名为菌株1-22。
将菌株1-22用SEQ ID NO 38和SEQ ID NO 39转化。SEQ ID NO 38含有以下元件:与整合基因座C的同源区(2bp-1003bp)、ScTDH3启动子(1010bp-1691bp)、具有修饰的信号序列的米根霉葡糖淀粉酶(1698bp-3494bp)、ScCYC1终止子(3503bp-3726bp)、loxP重组位点(3735bp-3768bp)、ScTEF1启动子(3769bp-4224bp)以及构巢曲霉乙酰胺酶的上游部分(4225bp-5264bp)。SEQ ID NO 39含有以下元件:构巢曲霉乙酰胺酶的下游部分(7bp-1032bp)、ScADH1终止子(1033bp-1335bp)、loxP重组位点(1336bp-1369bp)、ScADH1启动子(1376bp-2123bp)、具有修饰的信号序列的米根霉葡糖淀粉酶(2129bp-3925bp)、ScGAL10终止子(3934bp-4404bp)以及与整合基因座C的同源区。在含有20g/L葡萄糖和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(不含硫酸铵或氨基酸)上选择转化体。在含有20g/l葡萄糖和作为唯一氮源的1g/L乙酰胺的酵母氮源基础(不含硫酸铵或氨基酸)上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 38和SEQ IDNO 39正确整合到第二等位基因中。PCR验证的分离株被命名为菌株1-23。
将菌株1-19用SEQ ID NO 36和SEQ ID NO 37转化。在缺乏尿嘧啶的合成完全培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 36和SEQ ID NO 37正确整合到整合基因座C的一个等位基因中。PCR验证的分离株被命名为菌株1-24。将菌株1-24用SEQ ID NO 38和SEQ ID NO 39转化。在YNB+乙酰胺平板上选择转化体。在YNB+乙酰胺上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 38和SEQ IDNO 39正确整合到整合基因座C的第二等位基因中。PCR验证的分离株被命名为菌株1-25。
实施例12:含有Pho5-As GA和Mfα2-Ro GA或Sf GA的多个拷贝的菌株的SSF发酵
将菌株1-25和菌株1-21接种到YPD平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自YPD平板的细胞刮到无菌摇瓶培养基中并且测量光密度(OD600)。使用型号Genesys20分光光度计(Thermo Scientific公司)用1cm光程长度在600nm的波长下测量光密度。将摇瓶用细胞浆液接种以达到0.1-0.3的初始OD600。在即将接种之前,将50mL摇瓶培养基添加到250mL无挡板的摇瓶(Corning 4995-250)中,所述摇瓶装有含有透气密封件的螺旋盖(corning 1395-45LTMC)。摇瓶培养基由800g呈液化物形式的部分水解玉米淀粉、150g过滤的淡玉米浆、50g水、25g葡萄糖以及1g尿素组成。将用于每一种菌株的一式两份烧瓶在30℃在轨道式振荡器中以100rpm振荡下孵育67小时。在发酵期间采集样品并且通过HPLC分析发酵液中的相关代谢物浓度。图4示出了乙醇产生曲线。该结果表明了工程化的表达GA的菌株在不需要补充GA的情况下产生高乙醇滴度的能力。使用补充有市售葡糖淀粉酶的市售野生型菌株的类似发酵将在相似的时间范围内达到类似的滴度。
实施例13:在30℃和33.3℃使用透气膜盖的玉米醪发酵
将菌株1、菌株1-18、菌株1-25以及菌株1-23接种到YPD平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自每一种菌株的单菌落接种到容纳50ml YPD(10g/L酵母提取物、20g/L蛋白胨、100g/L葡萄糖)的250ml接种瓶中并且在30℃和250RPM下培养过夜。将50克液化的玉米醪称取到预称重的250ml带挡板的螺旋盖摇瓶(Corning 4995-250)中,所述摇瓶装有含有透气密封件的螺旋盖(Corning 1395-45LTMC)。将0.190ml的50%(w/v)尿素溶液添加到每一个烧瓶中。将25ul的10mg/ml氨苄西林溶液添加到每一个烧瓶中。将70μl的SpirizymeTM的1:10稀释液添加到容纳菌株1和菌株1-18的烧瓶中。最终,添加来自过夜培养物的适量的种子接种物以达到0.1的初始OD600的目标。记录烧瓶的重量。在100RPM搅拌下将烧瓶在30℃和33.3℃下孵育。按一式两份运行每一种菌株。将烧瓶定期称重以计算重量损失,可以使用本领域已知的方法将所述重量损失转换成乙醇。提交在67.75小时的时间最终样品进行HPLC。图5显示了在30℃下ScIMA1和SmMAL11的益处,菌株1-18比菌株1多产生2.4g/L的乙醇。图5显示表达分泌信号修饰的米根霉葡糖淀粉酶的菌株实现了与补充有市售葡糖淀粉酶的菌株1至少同样多的乙醇。图5还显示在所有背景中将温度升高到33.3℃时乙醇滴度降低。在表达葡糖淀粉酶的菌株中,所述影响是更明显的。在该温度下,表达分泌信号修饰的米根霉GA的菌株比菌株1少产生8.8%(菌株1-25)或10.5%(菌株1-21)的乙醇。
玉米醪(或液化玉米醪)可以如下制备:碾磨预定量的黄色臼齿形2号玉米并且使其通过US 20号筛。将筛渣(在US 20号筛上保留的经过两次研磨的玉米)以X:Y的筛渣与过筛玉米的比率(0.020的筛渣/总玉米质量比)添加回。通过卤素水分平衡法测量水分含量以确定碾磨的玉米的干重。添加水以产生32%的浆液(w/w,以干重为基础)。添加浓硫酸以达到5.7-5.9的pH值。添加二水合氯化钙粉末以实现35ppm的钙浓度。基于玉米干淀粉重量,以每吨干基淀粉剂量2.84kg的剂量比添加淀粉酶(LiquozymeTM Novozymes Liquozyme Supra2.2X),并且将浆液转移到配备有预设在120℃的油浴的Buchi Rotovapor R-220烧瓶中。使反应进行2小时,一旦右旋糖当量(DE)达到30+/-2,就通过将温度降低到34℃-36℃来停止。使用另外的浓硫酸将pH值调节到5.0。可以通过使用渗透压计(AdvancedTM型号3D3和Precion系统型号Osmette XLTM)来确定DE。使用HPLC,使用Aminex HPX-87H柱(300mm×7.8mm)在60C(0.01N硫酸流动相,0.6mL/min流速)确定糖和低聚碳水化合物含量。
实施例14:在30℃和33.3℃使用装有气锁塞的烧瓶的玉米醪发酵
将菌株1、菌株1-18、菌株1-25以及菌株1-23接种到YPD平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自每一种菌株的单菌落接种到容纳50ml YPD(10g/L酵母提取物、20g/L蛋白胨、100g/L葡萄糖)的250ml接种瓶中并且在30℃和250RPM下培养过夜。将50克液化的玉米醪称取到预称重的250ml带挡板的摇瓶中,所述摇瓶装有含有5ml低芥酸菜籽油的气锁和塞。将0.190ml的50%(w/v)尿素溶液添加到每一个烧瓶中。将25ul的10mg/ml氨苄西林溶液添加到每一个烧瓶中。将70μl的SpirizymeTM的1:10稀释液添加到容纳菌株1和菌株1-18的烧瓶中。最终,添加来自过夜培养物的适量的种子接种物以达到0.1的初始OD600的目标。记录烧瓶的重量。在100RPM搅拌下将烧瓶在30℃和33.3C下孵育。按一式两份运行每一种菌株。将烧瓶定期称重以计算重量损失,可以使用本领域已知的方法将所述重量损失转换成乙醇。提交时间最终样品进行HPLC。图6显示在30℃表达分泌信号修饰的米根霉葡糖淀粉酶的菌株实现了与补充有市售葡糖淀粉酶的菌株1相似的乙醇。图6还显示在所有背景中将温度升高到33.3℃时乙醇滴度降低。在该温度下,表达分泌信号修饰的米根霉GA的菌株比菌株1少产生16.9%(菌株1-25)或20.3%(菌株1-21)的乙醇。
实施例15:在30℃和33.3℃使用装有气锁塞的烧瓶的玉米醪发酵
将菌株1、菌株1-25、菌株1-21以及菌株1-23接种到YPD平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自每一种菌株的单菌落接种到容纳50ml YM发酵液(3g/L酵母提取物、3g/L麦芽提取物、5g/l酵母蛋白胨、10g/L葡萄糖)的250ml接种瓶中并且在30℃和250RPM下培养过夜。将50克液化的玉米醪称取到预称重的250ml带挡板的摇瓶中,所述摇瓶装有含有5ml低芥酸菜籽油的气锁和塞。将0.190ml的50%(w/v)尿素溶液添加到每一个烧瓶中。将25ul的10mg/ml氨苄西林溶液添加到每一个烧瓶中。将70μl的SpirizymeTM的1:10稀释液添加到容纳菌株1的烧瓶中。最终,添加来自过夜培养物的适量的种子接种物以达到0.1的初始OD600的目标。记录烧瓶的重量。在100RPM搅拌下将烧瓶在30℃和33.3℃孵育。按一式两份运行每一种菌株。将烧瓶定期称重以计算重量损失,可以使用本领域已知的方法将所述重量损失转换成乙醇。在发酵72小时之后提交时间最终样品进行HPLC。图7显示在30℃表达分泌信号修饰的米根霉葡糖淀粉酶或修饰的白宇佐美曲霉葡糖淀粉酶的菌株实现了与补充有市售葡糖淀粉酶的菌株1相似的乙醇。图7还显示在所有背景中将温度升高到33.3℃时乙醇滴度降低。图8显示了在发酵结束时留下的残余葡萄糖。
实施例16:菌株1-25中URA3的恢复
将菌株1-25用SEQ ID NO 31转化。在含有3.5g/L的对氟苯丙氨酸和1g/L的L-酪氨酸的合成完全培养基(ScD-PFP)上选择转化体。在ScD-PFP上将所得的转化体划线接种以用于单菌落分离。选择单菌落。PCR验证的分离株被命名为菌株1-26。将菌株1-26用SEQ ID NO40转化。SEQ ID NO 40含有与菌株1-3中被破坏的基因座具有同源性的ScURA3表达盒。在缺乏尿嘧啶的合成完全培养基(ScD-Ura)上选择转化体。在ScD-Ura上将所得的转化体划线接种以用于单菌落分离。选择单菌落。在单菌落中通过PCR验证SEQ ID NO 40正确整合到整合基因座A的一个等位基因中。PCR验证的分离株被命名为菌株1-27。
实施例17:菌株1-27的诱变
将菌株1-27接种到YPD平板上并且在30℃孵育过夜。将细胞接种到9ml的巴特菲尔德缓冲液(butterfields buffer)中达到4.0的OD600。将100μl等分并且平铺到YNB 1%淀粉平板(6.7g/L不含氨基酸或硫酸铵的酵母氮源基础、20g/l琼脂、10g/l淀粉)上。将平板在没有盖子的情况下按琼脂尺寸递降的顺序放置到UV交联仪(Stratalinker,Stratagene公司)中并且使用300J/cm2的能量设定进行诱变。将平板在30℃孵育七天。将突变株接种到类似的平板上并且在30℃再孵育3天。将单菌落接种到YPD,并且保存为菌株1-28。
将菌株1-28接种到YPD平板上并且在30℃孵育过夜。将细胞接种到9ml的巴特菲尔德缓冲液中达到4.0的OD600。将100μl等分并且平铺到YNB 1%淀粉平板(6.7g/L不含氨基酸或硫酸铵的酵母氮源基础、20g/l琼脂、10g/l淀粉)上。将平板在没有盖子的情况下按琼脂尺寸递降的顺序放置到UV交联仪(Stratalinker,Stratagene公司)中并且使用200J/cm2的能量设定进行诱变。将平板在37℃孵育七天。将突变株接种到类似的平板上并且在37℃再孵育3天。将单菌落接种到YPD,并且保存为菌株1-29。
实施例18:比较菌株1-25和菌株1-28的玉米醪发酵
将菌株1-25和菌株1-28接种到YPD平板上并且在30℃孵育直到单菌落可见为止(1天-2天)。将来自每一种菌株的单菌落接种到容纳50ml YPD(10g/L酵母提取物、20g/L蛋白胨、100g/L葡萄糖)的250ml接种瓶中并且在30℃和250RPM下培养过夜。将50克液化的玉米醪称取到预称重的250ml带挡板的摇瓶中,所述摇瓶装有含有5ml低芥酸菜籽油的气锁和塞。将0.190ml的50%(w/v)尿素溶液添加到每一个烧瓶中。将25ul的10mg/ml氨苄西林溶液添加到每一个烧瓶中。最终,添加来自过夜培养物的适量的种子接种物以达到0.1的初始OD600的目标。记录烧瓶的重量。在100RPM搅拌下将烧瓶在33.3℃孵育。按一式两份运行每一种菌株。将烧瓶定期称重以计算重量损失,可以使用本领域已知的方法将所述重量损失转换成乙醇。提交在69小时的时间最终样品进行HPLC。图9显示与菌株1-25相比菌株1-28中的乙醇产量提高。在33.3℃,菌株1-28中的乙醇产量与亲本菌株1-25相比高了18.2%。
实施例19:在37℃比较菌株1和菌株1-28的玉米醪发酵
使用Nexcelcom Bioscience细胞计数器测量活细胞计数。将细胞在Nexcelcom酵母缓冲液(产品编号CSO-0110)中1:40稀释;将其在ViaStain酵母活/死AO/PI染色剂(Nexcelom编号CS0-0102-10ML,来自试剂盒编号CSK-0102)中进一步1:1稀释。在稀释之后并且在加入细胞计数器载片中之前将样品涡旋混合5秒-10秒。将样品在暗处孵育2分钟-5分钟,之后对活CFU/ml进行计数。
将菌株1-28和菌株1从原种培养物接种到玉米醪接种瓶中,从而使这两种菌株达到5.0e7至1.e8的活细胞的初始CFU/ml的目标。种子培养基由每个250ml带挡板的烧瓶50g液化的玉米醪组成。将种子在30C、250rpm下孵育15小时-18小时。从这些接种瓶对生产烧瓶进行接种,从而使这两种菌株达到1.95e7的活细胞的初始CFU/ml的目标。在该时间点使用细胞计数器法再次获取细胞计数以正确地将从接种瓶转移到生产烧瓶中的接种物水平标准化。
对于生产烧瓶,将50g液化的玉米醪称取到预称重的250ml带挡板的摇瓶中,所述摇瓶装有含有5ml低芥酸菜籽油的气锁和塞。将0.315ml的50%(w/v)尿素溶液添加到每一个烧瓶中。将25ul的10mg/ml氨苄西林溶液添加到每一个烧瓶中。将100μl的DistillaseTM的1:10稀释液添加到容纳菌株1的烧瓶中,将75uL的DistillaseTM的1:100稀释液添加到菌株1-28烧瓶中(0.075×剂量)。对于这两种菌株,如上文所详述添加种子达到每个烧瓶1.95e7CFU/ml。记录烧瓶的重量。使用Infors Multitron振荡器在37℃和100RPM搅拌下孵育烧瓶,在24小时,将振荡器中的温度降低到32.5℃。按一式两份运行每一种菌株。将烧瓶定期称重以计算重量损失,可以使用本领域已知的方法将所述重量损失转换成乙醇。提交在75小时的样品以使用本领域已知的方法进行HPLC。表4和表5显示当经受高初始温度时,菌株1-28显示出与菌株1至少相当的性能。
表4.玉米醪发酵中的最终乙醇滴度。
表5.玉米醪发酵中乙醇的收率
示例性实施方案
A.一种工程化的多肽,所述工程化的多肽包含:
(a)包含5-8个连续疏水性氨基酸残基的分泌信号氨基酸序列;以及
(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述分泌信号氨基酸序列与所述葡糖淀粉酶氨基酸序列是异源的,并且所述工程化的多肽具有葡糖淀粉酶活性。
B.实施方案A的工程化的多肽,其中所述分泌信号氨基酸序列包含5个或6个连续疏水性氨基酸残基。
C.实施方案A的工程化的多肽,其中所述5-8个连续疏水性氨基酸残基的氨基酸选自由以下组成的组:丙氨酸、异亮氨酸、亮氨酸、苯丙氨酸以及缬氨酸。
D.实施方案C的工程化的多肽,其中所述5-8个连续的疏水性氨基酸残基包含一个或多个亮氨酸残基。
E.实施方案A的工程化的多肽,其中所述5-8个连续疏水性氨基酸残基紧邻一个或两个极性氨基酸残基。
F.实施方案E的工程化的多肽,其中所述极性氨基酸残基是丝氨酸残基。
G.实施方案A的工程化的多肽,其中所述5-8个连续疏水性氨基酸残基包含选自由以下组成的组的序列:AVLFAA、AFLFLL、LVLVLL、LLFLF以及FILAAV。
H.实施方案A的工程化的多肽,其中所述分泌信号氨基酸序列包含至少15个、16个、17个、18个或19个氨基酸残基。
I.实施方案A的工程化的多肽,所述工程化的多肽包含:
(a)分泌信号氨基酸序列,所述分泌信号氨基酸序列与以下各项具有80%或更大的序列同一性:(i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA);以及
(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述多肽具有葡糖淀粉酶活性。
J.实施方案I的工程化的多肽,其中所述(a)分泌信号氨基酸序列与以下各项具有90%或更大的序列同一性:
(i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQ ID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA)。
K.实施方案J的工程化的多肽,其中所述(a)分泌信号氨基酸序列是:
i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77(An aa);(iv)SEQ ID NO:75(Sc IV);(v)SEQ ID NO:76(Gg LZ);或(vi)SEQ ID NO:78(Hs SA)。
L.实施方案A-K中任一个的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列来自酵母或真菌葡糖淀粉酶。
M.实施方案A-L中任一个的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列是酵母或真菌葡糖淀粉酶多肽的酶活性部分。
N.实施方案A-M中任一个的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列来自选自由以下组成的组的酵母或真菌生物体:树脂枝孢霉菌、黑曲霉、泡盛曲霉、米曲霉、川地曲霉、白宇佐美曲霉、土曲霉、出芽短梗霉(Aureobasidium pullulans)、食腺嘌呤芽生葡萄孢酵母、布鲁塞尔酒香酵母、白色念珠菌、产朊假丝酵母、草酸青霉(Penicilliumoxalicum)、米根霉、粟酒裂殖酵母、酿酒酵母、扣囊复膜孢酵母、埃默森踝节菌(Talaromyces emersonii)、瓣环栓菌(Trametes cingulate)以及里氏木霉。
O.实施方案N的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ ID NO:42(米根霉GA的氨基酸26-604)具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
P.实施方案N的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ ID NO:43(白宇佐美曲霉GA的氨基酸19-639)具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
Q.实施方案N的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ ID NO:44(土曲霉GA的氨基酸21-636)具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
R.实施方案A的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与选自由以下各项组成的组的多肽的氨基酸具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:
(i)SEQ ID NO:45(Sc-FAKS)-扣囊复膜孢酵母GA;
(ii)SEQ ID NO:46(Sc-AKS)-扣囊复膜孢酵母GA;
(iii)SEQ ID NO:47(An aa)-扣囊复膜孢酵母GA;
(iv)SEQ ID NO:48(Sc IV)-扣囊复膜孢酵母GA;
(v)SEQ ID NO:49(Gg LZ)-扣囊复膜孢酵母GA;
(vi)SEQ ID NO:50(Hs SA)-扣囊复膜孢酵母GA;以及
(vii)SEQ ID NO:51(Sc MFα1)-扣囊复膜孢酵母GA。
S.实施方案A的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与选自由以下各项组成的组的多肽的氨基酸具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:
(i)SEQ ID NO:52(Sc-FAKS)-米根霉GA;
(ii)SEQ ID NO:53(Sc-AKS)-米根霉GA;
(iii)SEQ ID NO:54(An aa)-米根霉GA;
(iv)SEQ ID NO:55(Sc IV)-米根霉GA;
(v)SEQ ID NO:56(Gg LZ)-米根霉GA;
(vi)SEQ ID NO:57(Hs SA)-米根霉GA;以及
(vii)SEQ ID NO:58(Sc MFα1)-米根霉GA。
T.实施方案A的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与选自由以下各项组成的组的多肽的氨基酸具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:
(i)SEQ ID NO:59(Sc-FAKS)-白宇佐美曲霉GA;
(i)SEQ ID NO:60(Sc-AKS)-白宇佐美曲霉GA;
(ii)SEQ ID NO:61(An aa)-白宇佐美曲霉GA;
(iii)SEQ ID NO:62(Sc IV)-白宇佐美曲霉GA;
(iv)SEQ ID NO:63(Gg LZ)-白宇佐美曲霉GA;
(vi)SEQ ID NO:64(Hs SA)-白宇佐美曲霉GA;以及
(vii)SEQ ID NO:65(Sc MFα1)-白宇佐美曲霉GA。
U.实施方案A的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与选自由以下各项组成的组的多肽的氨基酸具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性:
(i)SEQ ID NO:66(Sc-FAKS)-土曲霉GA;
(ii)SEQ ID NO:67(Sc-AKS)-土曲霉GA;
(iii)SEQ ID NO:68(An aa)-土曲霉GA;
(iv)SEQ ID NO:69(Sc IV)-土曲霉GA;
(v)SEQ ID NO:70(Gg LZ)-土曲霉GA;
(vi)SEQ ID NO:71(Hs SA)-土曲霉GA;以及
(vii)SEQ ID NO:72(Sc MFα1)-土曲霉GA。
V.实施方案A-U中任一个的工程化的多肽,所述工程化的多肽还包含第三序列,所述第三序列不同于所述(a)分泌信号氨基酸序列和所述(b)葡糖淀粉酶氨基酸,其中所述第三序列被定位于(a)与(b)之间或处于(b)的C末端处。
W.一种核酸,所述核酸包含编码实施方案A-V中任一个的多肽的核酸序列。
X.实施方案W的核酸,所述核酸还包含转录调节序列。
Y.实施方案X的核酸,其中所述转录调节序列包含ADH启动子或TDH3启动子。
Z.一种载体,所述载体包含实施方案W-Y中任一个的核酸。
AA.实施方案Z的载体,所述载体包含用于在酵母中选择的营养缺陷型基因标记。
AB.一种工程化的细胞,所述工程化的细胞表达实施方案A-V中任一个的多肽。
AC.一种工程化的细胞,所述工程化的细胞包含实施方案W-AB中任一个(即W-Z、AA或AB中的任一个)的核酸或载体。
AD.实施方案AC的工程化的宿主细胞,所述工程化的宿主细胞是工程化的酵母。
AE.实施方案AD的工程化的酵母,所述工程化的酵母是酵母菌属的菌种。
AF.实施方案AE的工程化的酵母,所述工程化的酵母是酿酒酵母。
AG.实施方案AE-AF中任一个的工程化的酵母,所述工程化的酵母还包含(i)异源性异麦芽糖酶或以比未修饰的酵母中的水平更高的水平表达的内源性异麦芽糖酶;(ii)异源性糖转运蛋白多肽;(iii)不同于葡糖淀粉酶的异源性淀粉降解酶;或(i)-(iii)中的任一种的组合。
AH.实施方案AG的工程化的酵母,其中所述内源性异麦芽糖酶选自由以下组成的组:IMA1、IMA2、IMA3、IMA4以及IMA5;或所述异源性糖转运蛋白多肽与SEQ ID NO:79(SmMAL11)具有90%或更大的同一性。
AI.实施方案AE-AH中任一个的工程化的酵母,所述工程化的酵母(a)能够以大于90g/L、100g/L、110g/L、120g/L、130g/L或140g/L的滴度产生乙醇;(b)在33℃至40℃、33℃至39℃、33℃至38℃、33℃至37℃、34℃至37℃、35℃至37℃或36℃至38℃范围内的温度下具有耐热性;或(a)和(b)这两者。
AJ.一种发酵培养基,所述发酵培养基包含实施方案A-V中任一个的多肽或实施方案AE-AH中任一个的工程化的酵母。
AK.实施方案A-V中任一个的多肽、实施方案AE-AI中任一个的工程化的酵母或实施方案AJ的发酵培养基用于制备生物产物或饲料组合物的用途。
AL.实施方案AJ的发酵培养基,所述发酵培养基包含约90g/L或更大浓度的乙醇。
AM.实施方案AL的发酵培养基,所述发酵培养基包含90g/L至170g/L范围内的浓度的乙醇。
AN.一种饲料组合物,所述饲料组合物是由实施方案AJ、AL或AM中任一个的发酵培养基制备的。
AO.实施方案AN的饲料组合物,所述饲料组合物是通过包括以下步骤的方法制备的:(a)从所述发酵培养基中去除一些或所有生物产物以提供包含非生物产物固体的精制组合物;以及(b)使用所述精制组合物形成饲料组合物。
AP.一种用于产生发酵产物的发酵方法,所述发酵方法包括以下步骤:
将包含淀粉材料和实施方案A-V中任一个的多肽或实施方案AE-AI中任一个的工程化的酵母的液体培养基发酵以提供发酵产物。
AQ.实施方案AP的发酵方法,其中所述发酵产物是乙醇。
AR.实施方案AQ的发酵方法,其中在所述培养基中产生乙醇达到90g/L或更大的浓度。
AS.实施方案AR的发酵方法,其中所述发酵提供90g/L至170g/L范围内的乙醇。
AT.实施方案AS的发酵方法,其中所述发酵提供110g/L至170g/L范围内的乙醇。
AU.实施方案AT的发酵方法,其中所述发酵提供125g/L至170g/L范围内的乙醇。
AV.实施方案AP-AU中任一个的方法,其中所述发酵培养基在所述发酵期间至少一个时间点期间是至少33℃、至少34℃、至少35℃、至少36℃或至少37℃。
AW.实施方案AP-AV中任一个的方法,其中所述发酵在所述发酵培养基中提供120g/L或更大、130g/L或更大或140g/L或更大量的乙醇。
序列表
<110> 嘉吉公司
克里斯·米勒
<120> 前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株
<130> N00485-WO-PCT
<150> 62/371,681
<151> 2016-08-05
<160> 86
<170> PatentIn 版本3.5
<210> 1
<211> 3182
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 1
cctactgcgc caattgatga caatacagac gatgataaca aaccgaagtt atctgatgta 60
gaaaaggatt aaagatgcta agagatagtg atgatatttc ataaataatg taattctata 120
tatgttaatt accttttttg cgaggcatat ttatggtgaa ggataagttt tgaccatcaa 180
agaaggttaa tgtggctgtg gtttcagggt ccataaagct tttcaattca tctttttttt 240
ttttgttctt ttttttgatt ccggtttctt tgaaattttt ttgattcggt aatctccgag 300
cagaaggaag aacgaaggaa ggagcacaga cttagattgg tatatatacg catatgtggt 360
gttgaagaaa catgaaattg cccagtattc ttaacccaac tgcacagaac aaaaacctgc 420
aggaaacgaa gataaagcgg ccgcataact tcgtataatg tatgctatac gaagttatct 480
gccagtatac agctagcctt gaaagtgatg gaaaacattg tcatcggcac ataaataaaa 540
aaattatgaa tcacgtgatc aacagcaaat tatgtactcg tatatatgca agcgcattcc 600
ttatattgac actctttcat tgggcatgag gctgtgtaaa cataagctgt aacggtctca 660
cggaacactg tgtagttgca ttactgtcag gcagttatgt tgcttaatat aaaggcaaag 720
gcatggcaga atcactttaa aacgtggccc cacccgctgc accctgtgca ttttgtacgt 780
tactgcgaaa tgactcaacg atgaaatgaa aaaattttgc ttgaaatttt gaaaaaaaga 840
tgtgcgggac gcattgttag ctcattgaat acatcgtgat cgaatccaat caatgtttaa 900
tttcatatta atacagaaac tttttctcat actttcttct tcttttcatt ggtatattat 960
ctatatatcg tgttaattcc tctttcgtca tttttagcat cgttataaga gtaattaaga 1020
ataactagaa gagtctctct ttatattcgt ttattttata tatttaaccg ctaaatttag 1080
taaacaaaag aatctatcag aaatgagtga atctccaatg ttcgctgcca acggcatgcc 1140
aaaggtaaat caaggtgctg aagaagatgt cagaatttta ggttacgacc cattagcttc 1200
tccagctctc cttcaagtgc aaatcccagc cacaccaact tctttggaaa ctgccaagag 1260
aggtagaaga gaagctatag atattattac cggtaaagac gacagagttc ttgtcattgt 1320
cggtccttgt tccatccatg atctagaagc cgctcaagaa tacgctttga gattaaagaa 1380
attgtcagat gaattaaaag gtgatttatc catcattatg agagcatact tggagaagcc 1440
aagaacaacc gtcggctgga aaggtctaat taatgaccct gatgttaaca acactttcaa 1500
catcaacaag ggtttgcaat ccgctagaca attgtttgtc aacttgacaa atatcggttt 1560
gccaattggt tctgaaatgc ttgataccat ttctcctaaa tacttggctg atttggtctc 1620
cttcggtgcc attggtgcca gaaccaccga atctcaactg cacagagaat tggcctccgg 1680
tttgtctttc ccagttggtt tcaagaacgg taccgatggt accttaaatg ttgctgtgga 1740
tgcttgtcaa gccgctgctc attctcacca tttcatgggt gttactaagc atggtgttgc 1800
tgctatcacc actactaagg gtaacgaaca ctgcttcgtt attctaagag gtggtaaaaa 1860
gggtaccaac tacgacgcta agtccgttgc agaagctaag gctcaattgc ctgccggttc 1920
caacggtcta atgattgact actctcacgg taactccaat aaggatttca gaaaccaacc 1980
aaaggtcaat gacgttgttt gtgagcaaat cgctaacggt gaaaacgcca ttaccggtgt 2040
catgattgaa tcaaacatca acgaaggtaa ccaaggcatc ccagccgaag gtaaagccgg 2100
cttgaaatat ggtgtttcca tcactgatgc ttgtataggt tgggaaacta ctgaagacgt 2160
cttgaggaaa ttggctgctg ctgtcagaca aagaagagaa gttaacaaga aatagatgtt 2220
tttttaatga tatatgtaac gtacattctt tcctctacca ctgccaattc ggtattattt 2280
aattgtgttt agcgctattt actaattaac tagaaactca atttttaaag gcaaagctcg 2340
ctgacctttc actgatttcg tggatgttat actatcagtt actcttctgc aaaaaaaaat 2400
tgagtcatat cgtagctttg ggattatttt tctctctctc cacggctaat taggtgatca 2460
tgaaaaaatg aaaaattcat gagaaaagag tcagacatcg aaacatacat aagttgatat 2520
tcctttgata tcgacgacta ctcaatcagg ttttaaaaga aaagaggcag ctattgaagt 2580
agcagtatcc agtttaggtt ttttaattat ttacaagtaa agaaaaagag aatgccggtc 2640
gttcacgata acttcgtata atgtatgcta tacgaagtta tgcggccgcg agaagatgcg 2700
gccagcaaaa ctaaaaaact gtattataag taaatgcatg tatactaaac tcacaaatta 2760
gagcttcaat ttaattatat cagttattac ccgggaatct cggtcgtaat gatttctata 2820
atgacgaaaa aaaaaaaatt ggaaagaaaa agcttcatgg cctttataaa aaggaactat 2880
ccaatacctc gccagaacca agtaacagta ttttacgggg cacaaatcaa gaacaataag 2940
acaggactgt aaagatggac gcattgaact ccaaagaaca acaagagttc caaaaagtag 3000
tggaacaaaa gcaaatgaag gatttcatgc gtttgtactc taatctggta gaaagatgtt 3060
tcacagactg tgtcaatgac ttcacaacat caaagctaac caataaggaa caaacatgca 3120
tcatgaagtg ctcagaaaag ttcttgaagc atagcgaacg tgtagggcag cgtttccaag 3180
ag 3182
<210> 2
<211> 3275
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 2
cctactgcgc caattgatga caatacagac gatgataaca aaccgaagtt atctgatgta 60
gaaaaggatt aaagatgcta agagatagtg atgatatttc ataaataatg taattctata 120
tatgttaatt accttttttg cgaggcatat ttatggtgaa gaataagttt tgaccatcaa 180
agaaggttaa tgtggctgtg gtttcagggt ccataaagct tttcaattca tcattttttt 240
tttattcttt tttttgattc cggtttcctt gaaatttttt tgattcggta atctccgaac 300
agaaggaaga acgaaggaag gagcacagac ttagattggt atatatacgc atatgtagtg 360
ttgaagaaac atgaaattgc ccagtattct taacccaact gcacagaaca aaaatctgca 420
ggaaacgaag ataaagcggc cgcataactt cgtatagcat acattatacg aagttatcgc 480
ctgttaagat ataactgaaa aaagagggga atttttagat actgaaatga tattttagaa 540
taaccagact atatataagg ataaattaca aaaaattaac taatagataa gatttaaata 600
taaaagatat gcaactagaa aagtcttatc aatctcctta tggagtgacg acgttaccca 660
acaatttacc gacttcttcg gcgatagcca aagttctctc ttcggacaat cttctaccaa 720
taacttgaac agcaacagga gcaccgtgat aagcctctgg gtcgtattct tcttgaacca 780
aagcatccaa ttcggaaaca gctttaaaag attcgttctt cttatcaata ttcttatcag 840
cgaaagtgac tgggacgaca acagaggtga aatccaataa gttaataacg gaggcgtaac 900
cgtagtatct gaattgatcg tgtctgacag cggcggtagg agtaattgga gcgataatag 960
cgtccaattc cttaccagct ttttcttcag cttcacgcca cttttccaag tattccattt 1020
gatagttcca cttttgtaaa tgagtgtccc acaattcgtt catgttaaca gccttaatat 1080
ttgggttcaa caagtcctta atgttaggga tggctggctc accagaggca gaaatgtctc 1140
tcatgacgtc ggcagaacca tcagcagcat agatgtggga aatcaagtca tgaccgaaat 1200
catgcttgta tggagtccat ggagtaacgg tgtgaccagc cttggccaaa gcggcaacgg 1260
tagtttcgac accacgtaaa attggtgggt gtggcaagac gttaccgtcg aaattgtaat 1320
aaccaatgtt caaaccacca ttcttaatct tagaggcaat gatgtcagat tcagattgtc 1380
tccatggcat tgggatgacc ttagagtcgt acttccaagg ttcttgaccc aagacagatt 1440
tggtgaacaa tctcaagtct tcgacggagt gagtgatagg accaacgacg gagtgaacgg 1500
tttcttgacc ttccatagag ttagccattt tagcatatgg caatctaccg tgagatggtc 1560
tcaaaccgta taaaaagttg aaagcagctg ggactctaat ggaaccacca atgtcagtac 1620
cgacaccaat aacaccacct ctaataccaa caatagcacc ttcaccacca gaagaaccac 1680
cacaggacca atttttgttt cttggattga cagttctacc aatgatgttg ttgacggttt 1740
cacagaccat caaggtttgt gggacagagg tcttaacgta gaaaacagca ccagcttttc 1800
tcaacatggt ggttaagacg gaatcacctt catcgtattt gtttaaccag gaaatgtaac 1860
ccatggaggt ttcgtaaccc ttaacacgca attggtcctt taaagagatt ggtaaaccgt 1920
gtaatggacc aactggtctc ttatgcttag cgtagtattc atctaattct ctagcttgag 1980
ctaaagcagc atctgggaag aattcgtgag cacagttggt taattgttga gcaatagcag 2040
ctctcttaca aaaagccaaa gtgacttcaa cagaagtcaa ctcaccagcg gccaacttgg 2100
agaccaaatc agcagcagag gcttcggtaa tcttcaattc agcctcagac aaaataccgg 2160
acttctttgg gaaatcaata acggaatctt cggcaggcaa agtttgaacc ttccattcgt 2220
caggaatggt tttagccaaa cgggcacgtt tgtcggcggc caattcttcc caggattgtg 2280
gcattttgta attaaaactt agattagatt gctatgcttt ctttctaatg agcaagaagt 2340
aaaaaaagtt gtaatagaac aagaaaaacg aaactgaaac ttgagaaatt gaagaccatt 2400
tattaactta aatatcaatg ggaggtcatc gaaagagaaa aaaatcaaaa aaaaaatttt 2460
tcaagaaaaa gaaacgtgat aaaaattttt attgcctttt tcgacgaaga aaaagaaacg 2520
aggcggtctc ttttttcttt tccaaacctt tagtacgggt aattaacgcc accctagagg 2580
aagaaagagg ggaaatttag tatgctgtgc ttgggtgttt tgaagtggta cggcgatgcg 2640
cggagtccga gaaaatctgg aagagtaaaa aaggagtaga aacattttga agctatggtg 2700
tgtgggggat cacttgtggg ggattgggtg tgatgtaagg ataacttcgt atagcataca 2760
ttatacgaag ttatgcggcc gcgagaagat gcggccagca aaactaaaaa actgtattat 2820
aagtaaatgc atgtatacta aactcacaaa ttagagcttc aatttaatta tatcagttat 2880
tacccgggaa tctcggtcgt aatgattttt ataatgacga aaaaaaaaaa attggaaaga 2940
aaaagcttca tggcctttat aaaaaggaac catccaatac ctcgccagaa ccaagtaaca 3000
gtattttacg gggcacaaat caagaacaat aagacaggac tgtaaagatg gacgcattga 3060
actccaaaga acaacaagag ttccaaaaag tagtggaaca aaagcaaatg aaggatttca 3120
tgcgtttgta ctctaatctg gtagaaagat gttttacaga ctgtgtcaat gacttcacaa 3180
catcaaagct aaccaataag gaacaaacat gcatcatgaa gtgctcagaa aagttcttga 3240
agcatagcga acgtgtaggg cagcgtttcc aagag 3275
<210> 3
<211> 1132
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 3
ctctttttta cagatcatca aggaagtaat tatctacttt ttacaagaat tcatgtctaa 60
tttacttact gttcaccaaa acttgcctgc attaccagtt gacgcaacct ccgatgaagt 120
cagaaagaac cttatggata tgtttagaga tagacaagct ttctccgaac atacttggaa 180
aatgttatta tccgtttgta gatcctgggc cgcttggtgt aaacttaaca atagaaaatg 240
gtttcctgct gaaccagaag acgtcagaga ttacttactt tacttacaag ctagaggttt 300
ggctgttaaa actatccaac aacacttagg tcaattgaat atgttacaca gaagatccgg 360
tttaccaaga ccatccgatt ccaacgcagt ttcccttgtt atgagaagaa ttagaaaaga 420
aaatgttgac gctggtgaaa gagctaaaca agcattagca tttgaaagaa ccgatttcga 480
tcaagttaga tccttaatgg aaaattccga tagatgtcaa gatattagaa acttagcttt 540
cttaggtatt gcttacaaca cattattaag aatcgctgaa attgctagaa ttagagttaa 600
agatatttca agaaccgatg gcggtagaat gttaatccac attggcagaa caaaaacctt 660
agtctccaca gcaggcgtcg aaaaagcatt atcattaggt gttactaaat tagttgaacg 720
ttggatttcc gtttccggtg ttgcagatga cccaaacaac tacttattct gtcgtgttag 780
aaaaaatggt gttgccgctc cttccgctac ctcacaatta tccacaagag cattagaagg 840
catttttgaa gctacccaca gacttattta tggtgcaaaa gacgattccg gtcaaagata 900
tttagcttgg tctggtcatt ccgctagagt tggtgccgca agagacatgg caagagctgg 960
tgtttctatt cctgaaatta tgcaagccgg tggttggact aatgttaaca ttgttatgaa 1020
ctatatcaga aacttagatt ccgaaacagg tgctatggtt agattacttg aagacggtga 1080
ttaagctagc taagatccgc tctaaccgaa aaggaaggag ttagacaacc tg 1132
<210> 4
<211> 6376
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 4
ctagctaaga tccgctctaa ccgaaaagga aggagttaga caacctgaag tctaggtccc 60
tatttatttt tttatagtta tgttagtatt aagaacgtta tttatatttc aaatttttct 120
tttttttctg tacagacgcg tgtacgcatg taacattata ctgaaaacct tgcttgagaa 180
ggttttggga cgctcgaaga tccagctgca ttaatgaatc ggccaacgcg cggggagagg 240
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 300
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 360
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 420
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 480
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 540
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 600
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 660
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 720
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 780
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 840
agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg 900
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 960
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 1020
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 1080
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 1140
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag 1200
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat 1260
agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc 1320
cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa 1380
ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca 1440
gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa 1500
cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt 1560
cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc 1620
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact 1680
catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc 1740
tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg 1800
ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct 1860
catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc 1920
cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag 1980
cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac 2040
acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg 2100
ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt 2160
tccgcgcaca tttccccgaa aagtgccacc tgaacgaagc atctgtgctt cattttgtag 2220
aacaaaaatg caacgcgaga gcgctaattt ttcaaacaaa gaatctgagc tgcattttta 2280
cagaacagaa atgcaacgcg aaagcgctat tttaccaacg aagaatctgt gcttcatttt 2340
tgtaaaacaa aaatgcaacg cgagagcgct aatttttcaa acaaagaatc tgagctgcat 2400
ttttacagaa cagaaatgca acgcgagagc gctattttac caacaaagaa tctatacttc 2460
ttttttgttc tacaaaaatg catcccgaga gcgctatttt tctaacaaag catcttagat 2520
tacttttttt ctcctttgtg cgctctataa tgcagtctct tgataacttt ttgcactgta 2580
ggtccgttaa ggttagaaga aggctacttt ggtgtctatt ttctcttcca taaaaaaagc 2640
ctgactccac ttcccgcgtt tactgattac tagcgaagct gcgggtgcat tttttcaaga 2700
taaaggcatc cccgattata ttctataccg atgtggattg cgcatacttt gtgaacagaa 2760
agtgatagcg ttgatgattc ttcattggtc agaaaattat gaacggtttc ttctattttg 2820
tctctatata ctacgtatag gaaatgttta cattttcgta ttgttttcga ttcactctat 2880
gaatagttct tactacaatt tttttgtcta aagagtaata ctagagataa acataaaaaa 2940
tgtagaggtc gagtttagat gcaagttcaa ggagcgaaag gtggatgggt aggttatata 3000
gggatatagc acagagatat atagcaaaga gatacttttg agcaatgttt gtggaagcgg 3060
tattcgcaat attttagtag ctcgttacag tccggtgcgt ttttggtttt ttgaaagtgc 3120
gtcttcagag cgcttttggt tttcaaaagc gctctgaagt tcctatactt tctagagaat 3180
aggaacttcg gaataggaac ttcaaagcgt ttccgaaaac gagcgcttcc gaaaatgcaa 3240
cgcgagctgc gcacatacag ctcactgttc acgtcgcacc tatatctgcg tgttgcctgt 3300
atatatatat acatgagaag aacggcatag tgcgtgttta tgcttaaatg cgtacttata 3360
tgcgtctatt tatgtaggat gaaaggtagt ctagtacctc ctgtgatatt atcccattcc 3420
atgcggggta tcgtatgctt ccttcagcac taccctttag ctgttctata tgctgccact 3480
cctcaattgg attagtctca tccttcaatg ctatcatttc ctttgatatt ggatcatact 3540
aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc 3600
gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg 3660
tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg 3720
gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag 3780
tgcaccatac cacagctttt caattcaatt catcattttt tttttattct tttttttgat 3840
ttcggtttct ttgaaatttt tttgattcgg taatctccga acagaaggaa gaacgaagga 3900
aggagcacag acttagattg gtatatatac gcatatgtag tgttgaagaa acatgaaatt 3960
gcccagtatt cttaacccaa ctgcacagaa caaaaacctg caggaaacga agataaatca 4020
tgtcgaaagc tacatataag gaacgtgctg ctactcatcc tagtcctgtt gctgccaagc 4080
tatttaatat catgcacgaa aagcaaacaa acttgtgtgc ttcattggat gttcgtacca 4140
ccaaggaatt actggagtta gttgaagcat taggtcccaa aatttgttta ctaaaaacac 4200
atgtggatat cttgactgat ttttccatgg agggcacagt taagccgcta aaggcattat 4260
ccgccaagta caatttttta ctcttcgaag acagaaaatt tgctgacatt ggtaatacag 4320
tcaaattgca gtactctgcg ggtgtataca gaatagcaga atgggcagac attacgaatg 4380
cacacggtgt ggtgggccca ggtattgtta gcggtttgaa gcaggcggca gaagaagtaa 4440
caaaggaacc tagaggcctt ttgatgttag cagaattgtc atgcaagggc tccctatcta 4500
ctggagaata tactaagggt actgttgaca ttgcgaagag cgacaaagat tttgttatcg 4560
gctttattgc tcaaagagac atgggtggaa gagatgaagg ttacgattgg ttgattatga 4620
cacccggtgt gggtttagat gacaagggag acgcattggg tcaacagtat agaaccgtgg 4680
atgatgtggt ctctacagga tctgacatta ttattgttgg aagaggacta tttgcaaagg 4740
gaagggatgc taaggtagag ggtgaacgtt acagaaaagc aggctgggaa gcatatttga 4800
gaagatgcgg ccagcaaaac taaaaaactg tattataagt aaatgcatgt atactaaact 4860
cacaaattag agcttcaatt taattatatc agttattacc ctatgcggtg tgaaataccg 4920
cacagatgcg taaggagaaa ataccgcatc aggaaattgt aaacgttaat attttgttaa 4980
aattcgcgtt aaatttttgt taaatcagct cattttttaa ccaataggcc gaaatcggca 5040
aaatccctta taaatcaaaa gaatagaccg agatagggtt gagtgttgtt ccagtttgga 5100
acaagagtcc actattaaag aacgtggact ccaacgtcaa agggcgaaaa accgtctatc 5160
agggcgatgg cccactacgt gaaccatcac cctaatcaag ttttttgggg tcgaggtgcc 5220
gtaaagcact aaatcggaac cctaaaggga gcccccgatt tagagcttga cggggaaagc 5280
cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg agcgggcgct agggcgctgg 5340
caagtgtagc ggtcacgctg cgcgtaacca ccacacccgc cgcgcttaat gcgccgctac 5400
agggcgcgtc cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc 5460
ctcttcgcta ttacgccagc tgaattggag cgacctcatg ctatacctga gaaagcaacc 5520
tgacctacag gaaagagtta ctcaagaata agaattttcg ttttaaaacc taagagtcac 5580
tttaaaattt gtatacactt atttttttta taacttattt aataataaaa atcataaatc 5640
ataagaaatt cgcttattta gaagtgtcaa caacgtatct accaacgatt tgaccctttt 5700
ccatcttttc gtaaatttct ggcaaggtag acaagccgac aaccttgatt ggagacttga 5760
ccaaacctct ggcgaagaat tgttaattaa gccagaaaaa ggaagtgttt ccctccttct 5820
tgaattgatg ttaccctcat aaagcacgtg gcctcttatc gagaaagaaa ttaccgtcgc 5880
tcgtgatttg tttgcaaaaa gaacaaaact gaaaaaaccc agacacgctc gacttcctgt 5940
cttcctattg attgcagctt ccaatttcgt cacacaacaa ggtcctagcg acggctcaca 6000
ggttttgtaa caagcaatcg aaggttctgg aatggcggga aagggtttag taccacatgc 6060
tatgatgccc actgtgatct ccagagcaaa gttcgttcga tcgtactgtt actctctctc 6120
tttcaaacag aattgtccga atcgtgtgac aacaacagcc tgttctcaca cactcttttc 6180
ttctaaccaa gggggtggtt tagtttagta gaacctcgtg aaacttacat ttacatatat 6240
ataaacttgc ataaattggt caatgcaaga aatacatatt tggtcttttc taattcgtag 6300
tttttcaagt tcttagatgc tttctttttc tcttttttac agatcatcaa ggaagtaatt 6360
atctactttt tacaag 6376
<210> 5
<211> 5735
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 5
taaacaggcc ccttttcctt tgtcgatatc atgtaattag ttatgtcacg cttacattca 60
cgccctcctc ccacatccgc tctaaccgaa aaggaaggag ttagacaacc tgaagtctag 120
gtccctattt atttttttat agttatgtta gtattaagaa cgttatttat atttcaaatt 180
tttctttttt ttctgtacaa acgcgtgtac gcatgtaacg ggcagacgcg gccgccaccg 240
cggtggagct ccaattcgcc ctatagtgag tcgtattaca attcactggc cgtcgtttta 300
caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 360
cccttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 420
cgcagcctga atggcgaatg gcgcgacgcg ccctgtagcg gcgcattaag cgcggcgggt 480
gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc 540
gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg 600
gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat 660
tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg 720
ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct 780
atctcggtct attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa 840
aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac gtttacaatt 900
tcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc agggtaataa 960
ctgatataat taaattgaag ctctaatttg tgagtttagt atacatgcat ttacttataa 1020
tacagttttt tagttttgct ggccgcatct tctcaaatat gcttcccagc ctgcttttct 1080
gtaacgttca ccctctacct tagcatccct tccctttgca aatagtcctc ttccaacaat 1140
aataatgtca gatcctgtag agaccacatc atccacggtt ctatactgtt gacccaatgc 1200
gtctcccttg tcatctaaac ccacaccggg tgtcataatc aaccaatcgt aaccttcatc 1260
tcttccaccc atgtctcttt gagcaataaa gccgataaca aaatctttgt cgctcttcgc 1320
aatgtcaaca gtacccttag tatattctcc agtagatagg gagcccttgc atgacaattc 1380
tgctaacatc aaaaggcctc taggttcctt tgttacttct tctgccgcct gcttcaaacc 1440
gctaacaata cctgggccca ccacaccgtg tgcattcgta atgtctgccc attctgctat 1500
tctgtataca cccgcagagt actgcaattt gactgtatta ccaatgtcag caaattttct 1560
gtcttcgaag agtaaaaaat tgtacttggc ggataatgcc tttagcggct taactgtgcc 1620
ctccatggaa aaatcagtca agatatccac atgtgttttt agtaaacaaa ttttgggacc 1680
taatgcttca actaactcca gtaattcctt ggtggtacga acatccaatg aagcacacaa 1740
gtttgtttgc ttttcgtgca tgatattaaa tagcttggca gcaacaggac taggatgagt 1800
agcagcacgt tccttatatg tagctttcga catgatttat cttcgtttcc tgcaggtttt 1860
tgttctgtgc agttgggtta agaatactgg gcaatttcat gtttcttcaa cactacatat 1920
gcgtatatat accaatctaa gtctgtgctc cttccttcgt tcttccttct gttcggagat 1980
taccgaatca aaaaaatttc aaagaaaccg aaatcaaaaa aaagaataaa aaaaaaatga 2040
tgaattgaat tgaaaagcgt ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 2100
aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc 2160
ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc 2220
accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt 2280
taatgtcatg ataataatgg tttcttagga cggatcgctt gcctgtaact tacacgcgcc 2340
tcgtatcttt taatgatgga ataatttggg aatttactct gtgtttattt atttttatgt 2400
tttgtatttg gattttagaa agtaaataaa gaaggtagaa gagttacgga atgaagaaaa 2460
aaaaataaac aaaggtttaa aaaatttcaa caaaaagcgt actttacata tatatttatt 2520
agacaagaaa agcagattaa atagatatac attcgattaa cgataagtaa aatgtaaaat 2580
cacaggattt tcgtgtgtgg tcttctacac agacaagatg aaacaattcg gcattaatac 2640
ctgagagcag gaagagcaag ataaaaggta gtatttgttg gcgatccccc tagagtcttt 2700
tacatcttcg gaaaacaaaa actatttttt ctttaatttc tttttttact ttctattttt 2760
aatttatata tttatattaa aaaatttaaa ttataattat ttttatagca cgtgatgaaa 2820
aggacccagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct 2880
aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat 2940
attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg 3000
cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg 3060
aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc 3120
ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat 3180
gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact 3240
attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca 3300
tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact 3360
tacttctgac aacgatcgga ggaccgaagg agctaaccgc tttttttcac aacatggggg 3420
atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg 3480
agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg 3540
aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg 3600
caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag 3660
ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc 3720
gtatcgtagt tatctacacg acgggcagtc aggcaactat ggatgaacga aatagacaga 3780
tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat 3840
atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc 3900
tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag 3960
accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct 4020
gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac 4080
caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc 4140
tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg 4200
ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt 4260
tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt 4320
gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc 4380
attgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca 4440
gggtcggaac aggagagcgc acgagggagc ttccaggggg gaacgcctgg tatctttata 4500
gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg 4560
ggccgagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct 4620
ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta 4680
ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag 4740
tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga 4800
ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg 4860
caattaatgt gagttacctc actcattagg caccccaggc tttacacttt atgcttccgg 4920
ctcctatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc 4980
atgattacgc caagctcgga attaaccctc actaaaggga acaaaagctg ggtaccgggc 5040
cccccctcga gatctcccga gtttatcatt atcaatactg ccatttcaaa gaatacgtaa 5100
ataattaata gtagtgattt tcctaacttt atttagtcaa aaaattggcc ttttaattct 5160
gctgtaaccc gtacatgccc aaaatagggg gcgggttaca cagaatatat aacatcatag 5220
gtgtctgggt gaacagttta ttcctggcat ccactaaata taatggagcc cgcttttttt 5280
aagctggcat ccagaaaaaa aaagaatccc agcaccaaaa tattgttttc ttcaccaacc 5340
atcagttcat aggtccattc tcttagcgca actacacaga acaggggcac aaacaggcaa 5400
aaaacgggca caacctcaat ggagtgatgc aacctgcttg gagtaaatga tgacacaagg 5460
caattgacct acgcatgtat ctatctcatt ttcttacacc ttctattacc ttctgctctc 5520
tctgatttgg aaaaagctga aaaaaaaggt tgaaaccagt tccctgaaat tattccccta 5580
tttgactaat aagtatataa agacggtagg tattgattgt aattctgtaa atctatttct 5640
taaacttctt aaattctact tttatagtta gtcttttttt tagttttaaa acactaagaa 5700
cttagtttcg aataaacaca cataaacaaa caaat 5735
<210> 6
<211> 2017
<212> DNA
<213> 白宇佐美曲霉(Aspergillus shirousami)
<400> 6
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgagctttag 60
gtcattgctt gctctgtctg gcttagtttg tagtggtttg gcaagcgtta tctctaagag 120
agcaacgttg gatagttggt tatcaaatga agcaactgtc gctagaaccg caattctaaa 180
caatattgga gctgatggtg catgggttag cggtgcagac tctggtattg tggtagcctc 240
tccatccaca gataatccag attatttcta tacttggact agagattccg gaatagtttt 300
gaaaacgctg gtggatttgt ttcgtaatgg ggacaccgac ttgttatcaa ccattgagca 360
ttatatctcc agtcaagcaa ttattcaagg tgtctcaaat ccatccggcg acttgagcag 420
tggggggctg ggagaaccta agttcaatgt ggacgaaacg gcttacgctg gaagttgggg 480
cagaccacag agagacggac cagctctaag agcaacagcc atgattggat tcggtcagtg 540
gctactagac aatggataca ctagcgccgc gacagaaatt gtttggccac tagtcaggaa 600
cgacctaagt tacgttgctc aatattggaa ccaaaccggg tatgatctgt gggaagaggt 660
taatggatct agtttcttca ccatcgcagt tcagcataga gctttggttg aaggtagcgc 720
cttcgcaacg gcagttggga gttcatgctc ttggtgtgat tcacaggcac cacaaatctt 780
atgttatctt cagagctttt ggaccggttc ctatattcta gccaatttcg acagttccag 840
atccggtaag gatactaaca ctttacttgg ctcaatacat accttcgacc ctgaagctgg 900
gtgtgatgat tctacattcc aaccctgttc tccgagagca ctggccaatc ataaagaagt 960
ggttgattca tttagaagta tttatacact aaatgacgga ttaagtgaca gtgaagccgt 1020
agccgtcgga agatatccag aagattccta ttacaatggt aatccatggt tcttatgtac 1080
acttgctgct gctgaacaat tatatgacgc attgtatcaa tgggataagc aaggctcttt 1140
agaaattacc gacgtaagtt tagacttctt taaagcattg tatagcggtg cagccacggg 1200
tacatactca tcttcttcta gtacgtactc ttctattgtt tctgcggtga aaacttttgc 1260
tgacggcttt gtttctatcg tcgagaccca tgccgccagt aacggttctt tatccgaaca 1320
atttgacaag tccgatggcg atgagttaag cgcaagagat ctaacctggt cttatgccgc 1380
attacttaca gccaacaaca gacgtaattc cgttgtacca ccatcttggg gtgaaacaag 1440
tgcttcttca gttccgggca cctgcgcggc cacaagtgca tcaggaactt attcatcagt 1500
gactgtaaca tcttggccta gtattgtcgc aaccggtggt acaactacca ctgcaactac 1560
gacgggttct ggaggagtca cttccacaag caagactacg actactgcaa gtaaaaccag 1620
tactactacc tcctccacta gctgtacgac acccaccgcc gtagccgtca ctttcgattt 1680
gactgctaca accacctacg gcgagaatat ctacttggtg ggatcaatct cacaactagg 1740
tgactgggag acttccgacg ggatcgcttt gtcagcagat aaatacacat catctaaccc 1800
accatggtat gtgacggtca ctttacctgc cggggagtct ttcgaataca agtttataag 1860
ggtagaatcc gatgacagtg tggaatggga atctgatcct aatagagagt acacagtgcc 1920
acaagcttgt ggggaatcta cagccacagt taccgataca tggaggtagt taattaaaca 1980
ggcccctttt cctttgtcga tatcatgtaa ttagtta 2017
<210> 7
<211> 2014
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 7
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgttcaagtc 60
tgttgtttac tctattttgg ctgcctcttt ggctaacgct agtgttatct ctaagagagc 120
aacgttggat agttggttat caaatgaagc aactgtcgct agaaccgcaa ttctaaacaa 180
tattggagct gatggtgcat gggttagcgg tgcagactct ggtattgtgg tagcctctcc 240
atccacagat aatccagatt atttctatac ttggactaga gattccggaa tagttttgaa 300
aacgctggtg gatttgtttc gtaatgggga caccgacttg ttatcaacca ttgagcatta 360
tatctccagt caagcaatta ttcaaggtgt ctcaaatcca tccggcgact tgagcagtgg 420
ggggctggga gaacctaagt tcaatgtgga cgaaacggct tacgctggaa gttggggcag 480
accacagaga gacggaccag ctctaagagc aacagccatg attggattcg gtcagtggct 540
actagacaat ggatacacta gcgccgcgac agaaattgtt tggccactag tcaggaacga 600
cctaagttac gttgctcaat attggaacca aaccgggtat gatctgtggg aagaggttaa 660
tggatctagt ttcttcacca tcgcagttca gcatagagct ttggttgaag gtagcgcctt 720
cgcaacggca gttgggagtt catgctcttg gtgtgattca caggcaccac aaatcttatg 780
ttatcttcag agcttttgga ccggttccta tattctagcc aatttcgaca gttccagatc 840
cggtaaggat actaacactt tacttggctc aatacatacc ttcgaccctg aagctgggtg 900
tgatgattct acattccaac cctgttctcc gagagcactg gccaatcata aagaagtggt 960
tgattcattt agaagtattt atacactaaa tgacggatta agtgacagtg aagccgtagc 1020
cgtcggaaga tatccagaag attcctatta caatggtaat ccatggttct tatgtacact 1080
tgctgctgct gaacaattat atgacgcatt gtatcaatgg gataagcaag gctctttaga 1140
aattaccgac gtaagtttag acttctttaa agcattgtat agcggtgcag ccacgggtac 1200
atactcatct tcttctagta cgtactcttc tattgtttct gcggtgaaaa cttttgctga 1260
cggctttgtt tctatcgtcg agacccatgc cgccagtaac ggttctttat ccgaacaatt 1320
tgacaagtcc gatggcgatg agttaagcgc aagagatcta acctggtctt atgccgcatt 1380
acttacagcc aacaacagac gtaattccgt tgtaccacca tcttggggtg aaacaagtgc 1440
ttcttcagtt ccgggcacct gcgcggccac aagtgcatca ggaacttatt catcagtgac 1500
tgtaacatct tggcctagta ttgtcgcaac cggtggtaca actaccactg caactacgac 1560
gggttctgga ggagtcactt ccacaagcaa gactacgact actgcaagta aaaccagtac 1620
tactacctcc tccactagct gtacgacacc caccgccgta gccgtcactt tcgatttgac 1680
tgctacaacc acctacggcg agaatatcta cttggtggga tcaatctcac aactaggtga 1740
ctgggagact tccgacggga tcgctttgtc agcagataaa tacacatcat ctaacccacc 1800
atggtatgtg acggtcactt tacctgccgg ggagtctttc gaatacaagt ttataagggt 1860
agaatccgat gacagtgtgg aatgggaatc tgatcctaat agagagtaca cagtgccaca 1920
agcttgtggg gaatctacag ccacagttac cgatacatgg aggtagttaa ttaaacaggc 1980
cccttttcct ttgtcgatat catgtaatta gtta 2014
<210> 8
<211> 2020
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 8
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgaagttcat 60
ttccactttc ttgaccttca ttttggctgc tgtctctgtc accgctagtg ttatctctaa 120
gagagcaacg ttggatagtt ggttatcaaa tgaagcaact gtcgctagaa ccgcaattct 180
aaacaatatt ggagctgatg gtgcatgggt tagcggtgca gactctggta ttgtggtagc 240
ctctccatcc acagataatc cagattattt ctatacttgg actagagatt ccggaatagt 300
tttgaaaacg ctggtggatt tgtttcgtaa tggggacacc gacttgttat caaccattga 360
gcattatatc tccagtcaag caattattca aggtgtctca aatccatccg gcgacttgag 420
cagtgggggg ctgggagaac ctaagttcaa tgtggacgaa acggcttacg ctggaagttg 480
gggcagacca cagagagacg gaccagctct aagagcaaca gccatgattg gattcggtca 540
gtggctacta gacaatggat acactagcgc cgcgacagaa attgtttggc cactagtcag 600
gaacgaccta agttacgttg ctcaatattg gaaccaaacc gggtatgatc tgtgggaaga 660
ggttaatgga tctagtttct tcaccatcgc agttcagcat agagctttgg ttgaaggtag 720
cgccttcgca acggcagttg ggagttcatg ctcttggtgt gattcacagg caccacaaat 780
cttatgttat cttcagagct tttggaccgg ttcctatatt ctagccaatt tcgacagttc 840
cagatccggt aaggatacta acactttact tggctcaata cataccttcg accctgaagc 900
tgggtgtgat gattctacat tccaaccctg ttctccgaga gcactggcca atcataaaga 960
agtggttgat tcatttagaa gtatttatac actaaatgac ggattaagtg acagtgaagc 1020
cgtagccgtc ggaagatatc cagaagattc ctattacaat ggtaatccat ggttcttatg 1080
tacacttgct gctgctgaac aattatatga cgcattgtat caatgggata agcaaggctc 1140
tttagaaatt accgacgtaa gtttagactt ctttaaagca ttgtatagcg gtgcagccac 1200
gggtacatac tcatcttctt ctagtacgta ctcttctatt gtttctgcgg tgaaaacttt 1260
tgctgacggc tttgtttcta tcgtcgagac ccatgccgcc agtaacggtt ctttatccga 1320
acaatttgac aagtccgatg gcgatgagtt aagcgcaaga gatctaacct ggtcttatgc 1380
cgcattactt acagccaaca acagacgtaa ttccgttgta ccaccatctt ggggtgaaac 1440
aagtgcttct tcagttccgg gcacctgcgc ggccacaagt gcatcaggaa cttattcatc 1500
agtgactgta acatcttggc ctagtattgt cgcaaccggt ggtacaacta ccactgcaac 1560
tacgacgggt tctggaggag tcacttccac aagcaagact acgactactg caagtaaaac 1620
cagtactact acctcctcca ctagctgtac gacacccacc gccgtagccg tcactttcga 1680
tttgactgct acaaccacct acggcgagaa tatctacttg gtgggatcaa tctcacaact 1740
aggtgactgg gagacttccg acgggatcgc tttgtcagca gataaataca catcatctaa 1800
cccaccatgg tatgtgacgg tcactttacc tgccggggag tctttcgaat acaagtttat 1860
aagggtagaa tccgatgaca gtgtggaatg ggaatctgat cctaatagag agtacacagt 1920
gccacaagct tgtggggaat ctacagccac agttaccgat acatggaggt agttaattaa 1980
acaggcccct tttcctttgt cgatatcatg taattagtta 2020
<210> 9
<211> 2008
<212> DNA
<213> 土曲霉(Aspergillus terreus)
<400> 9
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgacaagaat 60
cttaacacta gctttgcacg gtttggctct agttcaaagt gtggttggcg ccccccaatt 120
agcacccaga gctacgacta gcttagacgc atggttagcg tccgaaacaa cagttgcttt 180
agatggaata ctagacaatg ttggatcaag cggagcctat gccaagtctg caaaaagcgg 240
tattgtgata gcgtcccctt ctactagtga cccagattat tattatacct ggaccagaga 300
tgctgcgttg accgtaaaag ccttgatcga tttgtttaga aacggagaaa catccttaca 360
aacagtaatt atggaataca tatctagcca agcatattta caaacagttt ctaatccatc 420
cggatcgtta agtaccggtg gtttggctga accaaaatac tacgtagatg aaactgcgta 480
cactggaagc tggggaaggc cacaaagaga tggccctgcc ctaagagcca ctgctatgat 540
cgattttgga aattggctga tcgataatgg ttactctact tacgccagtt ccattgtctg 600
gcctattgtt agaaacgatc tttcttatgt tgcgcagtac tggaatcaaa ccggttacga 660
tctttgggag gaggtaaatg ggagttcatt tttcactata gctgttcagc atagagcttt 720
ggtggaaggt agtacattcg catctaaagt tggtgcttca tgctcctggt gcgattcaca 780
agctcctcaa gtgctttgct tcctacaaag gttttggact ggttcttaca taatggcaaa 840
ctttggaggg ggtagatccg gtaaagatgc taatacagtt ctggggagta ttcatacctt 900
cgaccctaat gcgggttgtg acgacacgac tttccagcca tgctcaccac gtgcgttggc 960
aaaccataaa gtctatactg actcttttag atctatctac agtataaatt ctggcattag 1020
ctctggtaag gctgtggcag ttggaagata ccccgaagat tcttactata acggtaaccc 1080
gtggtttctt accacattgg ctgctgcaga acaactttat gatgccatct atcaatggca 1140
aaaaatcgga tctatcacca ttacagacgt atctttggct tttttcaaag acctttattc 1200
ttcagccgct gtgggtactt acgcctccag ttcctcagca ttcactagta tagtttctgc 1260
ggtaaaaacc tatgctgatg gttatatgtc tatagtccag acacatgcta tgacaaacgg 1320
atcattaagt gagcagtttg gtaaatctga cggtttttct ttgtctgcaa gagatttaac 1380
ctggtcttat gctgctctgt tgactgcaaa tcttaggagg aactccgtcg ttccaccctc 1440
ttggggtgaa actactgcaa catcagtccc cagtgtgtgt tcagccacta gtgctacagg 1500
gacatatagt actgctacta acactgcttg gccgtctaca ttgactagcg gtacaggagc 1560
cacaaccacg acatcaaaag ctacgtcttc atcaactacc actacatctt ctgcgtctag 1620
tacgacagtt gagtgtgtag ttccaacagc tgtggcggtc acttttgatg aggtcgcaac 1680
cactacatac ggtgaaaatg tttacgtcgt cggttcaata tcacagttgg gttcttggga 1740
cacgtctaaa gcagtggcat tatctgcatc caaatatacc tcctccaata acctgtggta 1800
tgtgactgtg acattgccag caggaacaac atttcaatac aaatttatca gagtgagttc 1860
ttctggtagt gtcacctggg agtcagatcc gaaccgttct tacacagtac catcagcctg 1920
tggcaccagc acggctgtag ttaatacaac ttggagatag ttaattaaac aggccccttt 1980
tcctttgtcg atatcatgta attagtta 2008
<210> 10
<211> 1999
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 10
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgttcaagtc 60
tgttgtttac tctattttgg ctgcctcttt ggctaacgct gccccccaat tagcacccag 120
agctacgact agcttagacg catggttagc gtccgaaaca acagttgctt tagatggaat 180
actagacaat gttggatcaa gcggagccta tgccaagtct gcaaaaagcg gtattgtgat 240
agcgtcccct tctactagtg acccagatta ttattatacc tggaccagag atgctgcgtt 300
gaccgtaaaa gccttgatcg atttgtttag aaacggagaa acatccttac aaacagtaat 360
tatggaatac atatctagcc aagcatattt acaaacagtt tctaatccat ccggatcgtt 420
aagtaccggt ggtttggctg aaccaaaata ctacgtagat gaaactgcgt acactggaag 480
ctggggaagg ccacaaagag atggccctgc cctaagagcc actgctatga tcgattttgg 540
aaattggctg atcgataatg gttactctac ttacgccagt tccattgtct ggcctattgt 600
tagaaacgat ctttcttatg ttgcgcagta ctggaatcaa accggttacg atctttggga 660
ggaggtaaat gggagttcat ttttcactat agctgttcag catagagctt tggtggaagg 720
tagtacattc gcatctaaag ttggtgcttc atgctcctgg tgcgattcac aagctcctca 780
agtgctttgc ttcctacaaa ggttttggac tggttcttac ataatggcaa actttggagg 840
gggtagatcc ggtaaagatg ctaatacagt tctggggagt attcatacct tcgaccctaa 900
tgcgggttgt gacgacacga ctttccagcc atgctcacca cgtgcgttgg caaaccataa 960
agtctatact gactctttta gatctatcta cagtataaat tctggcatta gctctggtaa 1020
ggctgtggca gttggaagat accccgaaga ttcttactat aacggtaacc cgtggtttct 1080
taccacattg gctgctgcag aacaacttta tgatgccatc tatcaatggc aaaaaatcgg 1140
atctatcacc attacagacg tatctttggc ttttttcaaa gacctttatt cttcagccgc 1200
tgtgggtact tacgcctcca gttcctcagc attcactagt atagtttctg cggtaaaaac 1260
ctatgctgat ggttatatgt ctatagtcca gacacatgct atgacaaacg gatcattaag 1320
tgagcagttt ggtaaatctg acggtttttc tttgtctgca agagatttaa cctggtctta 1380
tgctgctctg ttgactgcaa atcttaggag gaactccgtc gttccaccct cttggggtga 1440
aactactgca acatcagtcc ccagtgtgtg ttcagccact agtgctacag ggacatatag 1500
tactgctact aacactgctt ggccgtctac attgactagc ggtacaggag ccacaaccac 1560
gacatcaaaa gctacgtctt catcaactac cactacatct tctgcgtcta gtacgacagt 1620
tgagtgtgta gttccaacag ctgtggcggt cacttttgat gaggtcgcaa ccactacata 1680
cggtgaaaat gtttacgtcg tcggttcaat atcacagttg ggttcttggg acacgtctaa 1740
agcagtggca ttatctgcat ccaaatatac ctcctccaat aacctgtggt atgtgactgt 1800
gacattgcca gcaggaacaa catttcaata caaatttatc agagtgagtt cttctggtag 1860
tgtcacctgg gagtcagatc cgaaccgttc ttacacagta ccatcagcct gtggcaccag 1920
cacggctgta gttaatacaa cttggagata gttaattaaa caggcccctt ttcctttgtc 1980
gatatcatgt aattagtta 1999
<210> 11
<211> 2005
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 11
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgaagttcat 60
ttccactttc ttgaccttca ttttggctgc tgtctctgtc accgctgccc cccaattagc 120
acccagagct acgactagct tagacgcatg gttagcgtcc gaaacaacag ttgctttaga 180
tggaatacta gacaatgttg gatcaagcgg agcctatgcc aagtctgcaa aaagcggtat 240
tgtgatagcg tccccttcta ctagtgaccc agattattat tatacctgga ccagagatgc 300
tgcgttgacc gtaaaagcct tgatcgattt gtttagaaac ggagaaacat ccttacaaac 360
agtaattatg gaatacatat ctagccaagc atatttacaa acagtttcta atccatccgg 420
atcgttaagt accggtggtt tggctgaacc aaaatactac gtagatgaaa ctgcgtacac 480
tggaagctgg ggaaggccac aaagagatgg ccctgcccta agagccactg ctatgatcga 540
ttttggaaat tggctgatcg ataatggtta ctctacttac gccagttcca ttgtctggcc 600
tattgttaga aacgatcttt cttatgttgc gcagtactgg aatcaaaccg gttacgatct 660
ttgggaggag gtaaatggga gttcattttt cactatagct gttcagcata gagctttggt 720
ggaaggtagt acattcgcat ctaaagttgg tgcttcatgc tcctggtgcg attcacaagc 780
tcctcaagtg ctttgcttcc tacaaaggtt ttggactggt tcttacataa tggcaaactt 840
tggagggggt agatccggta aagatgctaa tacagttctg gggagtattc ataccttcga 900
ccctaatgcg ggttgtgacg acacgacttt ccagccatgc tcaccacgtg cgttggcaaa 960
ccataaagtc tatactgact cttttagatc tatctacagt ataaattctg gcattagctc 1020
tggtaaggct gtggcagttg gaagataccc cgaagattct tactataacg gtaacccgtg 1080
gtttcttacc acattggctg ctgcagaaca actttatgat gccatctatc aatggcaaaa 1140
aatcggatct atcaccatta cagacgtatc tttggctttt ttcaaagacc tttattcttc 1200
agccgctgtg ggtacttacg cctccagttc ctcagcattc actagtatag tttctgcggt 1260
aaaaacctat gctgatggtt atatgtctat agtccagaca catgctatga caaacggatc 1320
attaagtgag cagtttggta aatctgacgg tttttctttg tctgcaagag atttaacctg 1380
gtcttatgct gctctgttga ctgcaaatct taggaggaac tccgtcgttc caccctcttg 1440
gggtgaaact actgcaacat cagtccccag tgtgtgttca gccactagtg ctacagggac 1500
atatagtact gctactaaca ctgcttggcc gtctacattg actagcggta caggagccac 1560
aaccacgaca tcaaaagcta cgtcttcatc aactaccact acatcttctg cgtctagtac 1620
gacagttgag tgtgtagttc caacagctgt ggcggtcact tttgatgagg tcgcaaccac 1680
tacatacggt gaaaatgttt acgtcgtcgg ttcaatatca cagttgggtt cttgggacac 1740
gtctaaagca gtggcattat ctgcatccaa atatacctcc tccaataacc tgtggtatgt 1800
gactgtgaca ttgccagcag gaacaacatt tcaatacaaa tttatcagag tgagttcttc 1860
tggtagtgtc acctgggagt cagatccgaa ccgttcttac acagtaccat cagcctgtgg 1920
caccagcacg gctgtagtta atacaacttg gagatagtta attaaacagg ccccttttcc 1980
tttgtcgata tcatgtaatt agtta 2005
<210> 12
<211> 5745
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 12
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagattaatt aaacaggccc cttttccttt 720
gtcgatatca tgtaattagt tatgtcacgc ttacattcac gccctcctcc cacatccgct 780
ctaaccgaaa aggaaggagt tagacaacct gaagtctagg tccctattta tttttttata 840
gttatgttag tattaagaac gttatttata tttcaaattt ttcttttttt tctgtacaaa 900
cgcgtgtacg catgtaacgg gcagacgcgg ccgccaccgc ggtggagctc caattcgccc 960
tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac 1020
cctggcgtta cccaacttaa tcgccttgca gcacatcccc ccttcgccag ctggcgtaat 1080
agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 1140
cgcgacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 1200
accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 1260
gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 1320
tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 1380
gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 1440
agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 1500
ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 1560
tttaacgcga attttaacaa aatattaacg tttacaattt cctgatgcgg tattttctcc 1620
ttacgcatct gtgcggtatt tcacaccgca gggtaataac tgatataatt aaattgaagc 1680
tctaatttgt gagtttagta tacatgcatt tacttataat acagtttttt agttttgctg 1740
gccgcatctt ctcaaatatg cttcccagcc tgcttttctg taacgttcac cctctacctt 1800
agcatccctt ccctttgcaa atagtcctct tccaacaata ataatgtcag atcctgtaga 1860
gaccacatca tccacggttc tatactgttg acccaatgcg tctcccttgt catctaaacc 1920
cacaccgggt gtcataatca accaatcgta accttcatct cttccaccca tgtctctttg 1980
agcaataaag ccgataacaa aatctttgtc gctcttcgca atgtcaacag tacccttagt 2040
atattctcca gtagataggg agcccttgca tgacaattct gctaacatca aaaggcctct 2100
aggttccttt gttacttctt ctgccgcctg cttcaaaccg ctaacaatac ctgggcccac 2160
cacaccgtgt gcattcgtaa tgtctgccca ttctgctatt ctgtatacac ccgcagagta 2220
ctgcaatttg actgtattac caatgtcagc aaattttctg tcttcgaaga gtaaaaaatt 2280
gtacttggcg gataatgcct ttagcggctt aactgtgccc tccatggaaa aatcagtcaa 2340
gatatccaca tgtgttttta gtaaacaaat tttgggacct aatgcttcaa ctaactccag 2400
taattccttg gtggtacgaa catccaatga agcacacaag tttgtttgct tttcgtgcat 2460
gatattaaat agcttggcag caacaggact aggatgagta gcagcacgtt ccttatatgt 2520
agctttcgac atgatttatc ttcgtttcct gcaggttttt gttctgtgca gttgggttaa 2580
gaatactggg caatttcatg tttcttcaac actacatatg cgtatatata ccaatctaag 2640
tctgtgctcc ttccttcgtt cttccttctg ttcggagatt accgaatcaa aaaaatttca 2700
aagaaaccga aatcaaaaaa aagaataaaa aaaaaatgat gaattgaatt gaaaagcgtg 2760
gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc 2820
aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc 2880
tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc 2940
gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt 3000
ttcttaggac ggatcgcttg cctgtaactt acacgcgcct cgtatctttt aatgatggaa 3060
taatttggga atttactctg tgtttattta tttttatgtt ttgtatttgg attttagaaa 3120
gtaaataaag aaggtagaag agttacggaa tgaagaaaaa aaaataaaca aaggtttaaa 3180
aaatttcaac aaaaagcgta ctttacatat atatttatta gacaagaaaa gcagattaaa 3240
tagatataca ttcgattaac gataagtaaa atgtaaaatc acaggatttt cgtgtgtggt 3300
cttctacaca gacaagatga aacaattcgg cattaatacc tgagagcagg aagagcaaga 3360
taaaaggtag tatttgttgg cgatccccct agagtctttt acatcttcgg aaaacaaaaa 3420
ctattttttc tttaatttct ttttttactt tctattttta atttatatat ttatattaaa 3480
aaatttaaat tataattatt tttatagcac gtgatgaaaa ggacccaggt ggcacttttc 3540
ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 3600
cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 3660
gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 3720
ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 3780
tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 3840
aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 3900
ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 3960
agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 4020
gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 4080
gaccgaagga gctaaccgct ttttttcaca acatggggga tcatgtaact cgccttgatc 4140
gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 4200
tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 4260
ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 4320
cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 4380
gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 4440
cgggcagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 4500
tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 4560
aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 4620
aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 4680
gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 4740
cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 4800
ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 4860
accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 4920
tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 4980
cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 5040
gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc 5100
ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 5160
cgagggagct tccagggggg aacgcctggt atctttatag tcctgtcggg tttcgccacc 5220
tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gccgagccta tggaaaaacg 5280
ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 5340
ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 5400
ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 5460
gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg 5520
acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttacctca 5580
ctcattaggc accccaggct ttacacttta tgcttccggc tcctatgttg tgtggaattg 5640
tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc aagctcggaa 5700
ttaaccctca ctaaagggaa caaaagctgg gtaccgggcc ccccc 5745
<210> 13
<211> 1932
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 13
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgcagttatt 60
caatctacca ctgaaggtga gtttcttctt ggtcttgtct tatttctcat tattggtttc 120
tgccgcatct attccatcta gtgcatctgt acaattggac tcctacaatt acgatggttc 180
cacattttcc ggcaagattt atgtcaaaaa catcgcttac tctaaaaagg ttactgttgt 240
gtacgcagac ggttctgaca actggaacaa taacggcaac actattgctg catcattttc 300
aggcccaatc tctggatcaa attacgaata ctggacattc tcagcatcag tgaagggcat 360
aaaggagttc tacatcaaat acgaagtttc aggtaagaca tattacgaca ataacaactc 420
tgcaaactac caagtctcaa cttctaaacc tactacaact actgcagcta caaccacaac 480
tacagctcca tcaacttcta caacaacccg tccatctagt tcagagcctg ccaccttccc 540
tactggtaat tctaccatca gctcttggat caaaaagcag gaagatattt ccagattcgc 600
tatgcttaga aacatcaacc cacctggttc tgccacaggg tttatcgccg catcactctc 660
taccgctggt ccagattact actacgcgtg gacaagagat gccgctttga catctaacgt 720
tatcgtttac gaatacaaca ccacattgtc tgggaataag acaattctaa acgtacttaa 780
ggattacgtc acattcagtg ttaagacaca gtctacttca acagtttgta attgccttgg 840
tgaaccaaag ttcaatccag acggcagtgg ttacacaggt gcttggggta gacctcaaaa 900
tgatggtcct gcagaaagag cgactacatt tgttctgttt gccgacagct acttgactca 960
aactaaggat gcctcatacg tcactggtac attaaagcca gcaattttca aagatctcga 1020
ttacgttgtt aacgtctgga gtaacggatg tttcgattta tgggaggagg tgaacggagt 1080
tcatttctac acccttatgg ttatgagaaa agggctattg ttgggggctg atttcgcgaa 1140
gagaaacggt gactcaacta gagcctcaac ttactcttct actgcttcca caattgctaa 1200
caagatatca agtttctggg ttagctcaaa caactgggtg caagtatccc aatctgtcac 1260
aggaggtgta agtaaaaagg ggttagacgt tagcaccctg ttagctgcga atctaggatc 1320
agtcgatgat ggatttttca ctccaggttc tgaaaagata ttagctacag ctgtggcagt 1380
cgaagattcc tttgccagtc tatacccaat caacaaaaac cttccatcat acttggggaa 1440
cgctattgga agataccctg aagatacata caacggtaat ggtaactcac aaggcaatcc 1500
ttggtttctg gcggttaccg gctacgcaga gttgtactat agagcaatta aggaatggat 1560
ttctaatgga ggcgttacag tgtcctctat ctcattgcca tttttcaaaa agttcgatag 1620
ctctgcaaca tccggtaaaa agtacaccgt aggtacttct gacttcaaca atttagcaca 1680
aaacattgct cttgctgcag atcgtttcct atctactgta caactccatg caccaaacaa 1740
tggttcatta gcagaggaat ttgatagaac aacaggtttt tctaccggcg ctagagattt 1800
aacatggtcc cacgcctcat tgataacagc atcctatgcc aaagccggtg ctccagctgc 1860
ataattaatt aaacaggccc cttttccttt gtcgatatca tgtaattagt tatgtcacgc 1920
ttacattcac gc 1932
<210> 14
<211> 1908
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 14
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgttcaagtc 60
tgttgtttac tctattttgg ctgcctcttt ggctaacgct gcatctattc catctagtgc 120
atctgtacaa ttggactcct acaattacga tggttccaca ttttccggca agatttatgt 180
caaaaacatc gcttactcta aaaaggttac tgttgtgtac gcagacggtt ctgacaactg 240
gaacaataac ggcaacacta ttgctgcatc attttcaggc ccaatctctg gatcaaatta 300
cgaatactgg acattctcag catcagtgaa gggcataaag gagttctaca tcaaatacga 360
agtttcaggt aagacatatt acgacaataa caactctgca aactaccaag tctcaacttc 420
taaacctact acaactactg cagctacaac cacaactaca gctccatcaa cttctacaac 480
aacccgtcca tctagttcag agcctgccac cttccctact ggtaattcta ccatcagctc 540
ttggatcaaa aagcaggaag atatttccag attcgctatg cttagaaaca tcaacccacc 600
tggttctgcc acagggttta tcgccgcatc actctctacc gctggtccag attactacta 660
cgcgtggaca agagatgccg ctttgacatc taacgttatc gtttacgaat acaacaccac 720
attgtctggg aataagacaa ttctaaacgt acttaaggat tacgtcacat tcagtgttaa 780
gacacagtct acttcaacag tttgtaattg ccttggtgaa ccaaagttca atccagacgg 840
cagtggttac acaggtgctt ggggtagacc tcaaaatgat ggtcctgcag aaagagcgac 900
tacatttgtt ctgtttgccg acagctactt gactcaaact aaggatgcct catacgtcac 960
tggtacatta aagccagcaa ttttcaaaga tctcgattac gttgttaacg tctggagtaa 1020
cggatgtttc gatttatggg aggaggtgaa cggagttcat ttctacaccc ttatggttat 1080
gagaaaaggg ctattgttgg gggctgattt cgcgaagaga aacggtgact caactagagc 1140
ctcaacttac tcttctactg cttccacaat tgctaacaag atatcaagtt tctgggttag 1200
ctcaaacaac tgggtgcaag tatcccaatc tgtcacagga ggtgtaagta aaaaggggtt 1260
agacgttagc accctgttag ctgcgaatct aggatcagtc gatgatggat ttttcactcc 1320
aggttctgaa aagatattag ctacagctgt ggcagtcgaa gattcctttg ccagtctata 1380
cccaatcaac aaaaaccttc catcatactt ggggaacgct attggaagat accctgaaga 1440
tacatacaac ggtaatggta actcacaagg caatccttgg tttctggcgg ttaccggcta 1500
cgcagagttg tactatagag caattaagga atggatttct aatggaggcg ttacagtgtc 1560
ctctatctca ttgccatttt tcaaaaagtt cgatagctct gcaacatccg gtaaaaagta 1620
caccgtaggt acttctgact tcaacaattt agcacaaaac attgctcttg ctgcagatcg 1680
tttcctatct actgtacaac tccatgcacc aaacaatggt tcattagcag aggaatttga 1740
tagaacaaca ggtttttcta ccggcgctag agatttaaca tggtcccacg cctcattgat 1800
aacagcatcc tatgccaaag ccggtgctcc agctgcataa ttaattaaac aggccccttt 1860
tcctttgtcg atatcatgta attagttatg tcacgcttac attcacgc 1908
<210> 15
<211> 1914
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 15
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgaagttcat 60
ttccactttc ttgaccttca ttttggctgc tgtctctgtc accgctgcat ctattccatc 120
tagtgcatct gtacaattgg actcctacaa ttacgatggt tccacatttt ccggcaagat 180
ttatgtcaaa aacatcgctt actctaaaaa ggttactgtt gtgtacgcag acggttctga 240
caactggaac aataacggca acactattgc tgcatcattt tcaggcccaa tctctggatc 300
aaattacgaa tactggacat tctcagcatc agtgaagggc ataaaggagt tctacatcaa 360
atacgaagtt tcaggtaaga catattacga caataacaac tctgcaaact accaagtctc 420
aacttctaaa cctactacaa ctactgcagc tacaaccaca actacagctc catcaacttc 480
tacaacaacc cgtccatcta gttcagagcc tgccaccttc cctactggta attctaccat 540
cagctcttgg atcaaaaagc aggaagatat ttccagattc gctatgctta gaaacatcaa 600
cccacctggt tctgccacag ggtttatcgc cgcatcactc tctaccgctg gtccagatta 660
ctactacgcg tggacaagag atgccgcttt gacatctaac gttatcgttt acgaatacaa 720
caccacattg tctgggaata agacaattct aaacgtactt aaggattacg tcacattcag 780
tgttaagaca cagtctactt caacagtttg taattgcctt ggtgaaccaa agttcaatcc 840
agacggcagt ggttacacag gtgcttgggg tagacctcaa aatgatggtc ctgcagaaag 900
agcgactaca tttgttctgt ttgccgacag ctacttgact caaactaagg atgcctcata 960
cgtcactggt acattaaagc cagcaatttt caaagatctc gattacgttg ttaacgtctg 1020
gagtaacgga tgtttcgatt tatgggagga ggtgaacgga gttcatttct acacccttat 1080
ggttatgaga aaagggctat tgttgggggc tgatttcgcg aagagaaacg gtgactcaac 1140
tagagcctca acttactctt ctactgcttc cacaattgct aacaagatat caagtttctg 1200
ggttagctca aacaactggg tgcaagtatc ccaatctgtc acaggaggtg taagtaaaaa 1260
ggggttagac gttagcaccc tgttagctgc gaatctagga tcagtcgatg atggattttt 1320
cactccaggt tctgaaaaga tattagctac agctgtggca gtcgaagatt cctttgccag 1380
tctataccca atcaacaaaa accttccatc atacttgggg aacgctattg gaagataccc 1440
tgaagataca tacaacggta atggtaactc acaaggcaat ccttggtttc tggcggttac 1500
cggctacgca gagttgtact atagagcaat taaggaatgg atttctaatg gaggcgttac 1560
agtgtcctct atctcattgc catttttcaa aaagttcgat agctctgcaa catccggtaa 1620
aaagtacacc gtaggtactt ctgacttcaa caatttagca caaaacattg ctcttgctgc 1680
agatcgtttc ctatctactg tacaactcca tgcaccaaac aatggttcat tagcagagga 1740
atttgataga acaacaggtt tttctaccgg cgctagagat ttaacatggt cccacgcctc 1800
attgataaca gcatcctatg ccaaagccgg tgctccagct gcataattaa ttaaacaggc 1860
cccttttcct ttgtcgatat catgtaatta gttatgtcac gcttacattc acgc 1914
<210> 16
<211> 1920
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 16
cactaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa tgaagttcat 60
ttccactttc ttgaccttca ttttggctgc tgtctctgtc accgctgcaa agagatctat 120
tccatctagt gcatctgtac aattggactc ctacaattac gatggttcca cattttccgg 180
caagatttat gtcaaaaaca tcgcttactc taaaaaggtt actgttgtgt acgcagacgg 240
ttctgacaac tggaacaata acggcaacac tattgctgca tcattttcag gcccaatctc 300
tggatcaaat tacgaatact ggacattctc agcatcagtg aagggcataa aggagttcta 360
catcaaatac gaagtttcag gtaagacata ttacgacaat aacaactctg caaactacca 420
agtctcaact tctaaaccta ctacaactac tgcagctaca accacaacta cagctccatc 480
aacttctaca acaacccgtc catctagttc agagcctgcc accttcccta ctggtaattc 540
taccatcagc tcttggatca aaaagcagga agatatttcc agattcgcta tgcttagaaa 600
catcaaccca cctggttctg ccacagggtt tatcgccgca tcactctcta ccgctggtcc 660
agattactac tacgcgtgga caagagatgc cgctttgaca tctaacgtta tcgtttacga 720
atacaacacc acattgtctg ggaataagac aattctaaac gtacttaagg attacgtcac 780
attcagtgtt aagacacagt ctacttcaac agtttgtaat tgccttggtg aaccaaagtt 840
caatccagac ggcagtggtt acacaggtgc ttggggtaga cctcaaaatg atggtcctgc 900
agaaagagcg actacatttg ttctgtttgc cgacagctac ttgactcaaa ctaaggatgc 960
ctcatacgtc actggtacat taaagccagc aattttcaaa gatctcgatt acgttgttaa 1020
cgtctggagt aacggatgtt tcgatttatg ggaggaggtg aacggagttc atttctacac 1080
ccttatggtt atgagaaaag ggctattgtt gggggctgat ttcgcgaaga gaaacggtga 1140
ctcaactaga gcctcaactt actcttctac tgcttccaca attgctaaca agatatcaag 1200
tttctgggtt agctcaaaca actgggtgca agtatcccaa tctgtcacag gaggtgtaag 1260
taaaaagggg ttagacgtta gcaccctgtt agctgcgaat ctaggatcag tcgatgatgg 1320
atttttcact ccaggttctg aaaagatatt agctacagct gtggcagtcg aagattcctt 1380
tgccagtcta tacccaatca acaaaaacct tccatcatac ttggggaacg ctattggaag 1440
ataccctgaa gatacataca acggtaatgg taactcacaa ggcaatcctt ggtttctggc 1500
ggttaccggc tacgcagagt tgtactatag agcaattaag gaatggattt ctaatggagg 1560
cgttacagtg tcctctatct cattgccatt tttcaaaaag ttcgatagct ctgcaacatc 1620
cggtaaaaag tacaccgtag gtacttctga cttcaacaat ttagcacaaa acattgctct 1680
tgctgcagat cgtttcctat ctactgtaca actccatgca ccaaacaatg gttcattagc 1740
agaggaattt gatagaacaa caggtttttc taccggcgct agagatttaa catggtccca 1800
cgcctcattg ataacagcat cctatgccaa agccggtgct ccagctgcat aattaattaa 1860
acaggcccct tttcctttgt cgatatcatg taattagtta tgtcacgctt acattcacgc 1920
<210> 17
<211> 7542
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 17
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgaag ttcatttcca ctttcttgac 720
cttcattttg gctgctgtct ctgtcaccgc tgcatctatt ccatctagtg catctgtaca 780
attggactcc tacaattacg atggttccac attttccggc aagatttatg tcaaaaacat 840
cgcttactct aaaaaggtta ctgttgtgta cgcagacggt tctgacaact ggaacaataa 900
cggcaacact attgctgcat cattttcagg cccaatctct ggatcaaatt acgaatactg 960
gacattctca gcatcagtga agggcataaa ggagttctac atcaaatacg aagtttcagg 1020
taagacatat tacgacaata acaactctgc aaactaccaa gtctcaactt ctaaacctac 1080
tacaactact gcagctacaa ccacaactac agctccatca acttctacaa caacccgtcc 1140
atctagttca gagcctgcca ccttccctac tggtaattct accatcagct cttggatcaa 1200
aaagcaggaa gatatttcca gattcgctat gcttagaaac atcaacccac ctggttctgc 1260
cacagggttt atcgccgcat cactctctac cgctggtcca gattactact acgcgtggac 1320
aagagatgcc gctttgacat ctaacgttat cgtttacgaa tacaacacca cattgtctgg 1380
gaataagaca attctaaacg tacttaagga ttacgtcaca ttcagtgtta agacacagtc 1440
tacttcaaca gtttgtaatt gccttggtga accaaagttc aatccagacg gcagtggtta 1500
cacaggtgct tggggtagac ctcaaaatga tggtcctgca gaaagagcga ctacatttgt 1560
tctgtttgcc gacagctact tgactcaaac taaggatgcc tcatacgtca ctggtacatt 1620
aaagccagca attttcaaag atctcgatta cgttgttaac gtctggagta acggatgttt 1680
cgatttatgg gaggaggtga acggagttca tttctacacc cttatggtta tgagaaaagg 1740
gctattgttg ggggctgatt tcgcgaagag aaacggtgac tcaactagag cctcaactta 1800
ctcttctact gcttccacaa ttgctaacaa gatatcaagt ttctgggtta gctcaaacaa 1860
ctgggtgcaa gtatcccaat ctgtcacagg aggtgtaagt aaaaaggggt tagacgttag 1920
caccctgtta gctgcgaatc taggatcagt cgatgatgga tttttcactc caggttctga 1980
aaagatatta gctacagctg tggcagtcga agattccttt gccagtctat acccaatcaa 2040
caaaaacctt ccatcatact tggggaacgc tattggaaga taccctgaag atacatacaa 2100
cggtaatggt aactcacaag gcaatccttg gtttctggcg gttaccggct acgcagagtt 2160
gtactataga gcaattaagg aatggatttc taatggaggc gttacagtgt cctctatctc 2220
attgccattt ttcaaaaagt tcgatagctc tgcaacatcc ggtaaaaagt acaccgtagg 2280
tacttctgac ttcaacaatt tagcacaaaa cattgctctt gctgcagatc gtttcctatc 2340
tactgtacaa ctccatgcac caaacaatgg ttcattagca gaggaatttg atagaacaac 2400
aggtttttct accggcgcta gagatttaac atggtcccac gcctcattga taacagcatc 2460
ctatgccaaa gccggtgctc cagctgcata attaattaaa caggcccctt ttcctttgtc 2520
gatatcatgt aattagttat gtcacgctta cattcacgcc ctcctcccac atccgctcta 2580
accgaaaagg aaggagttag acaacctgaa gtctaggtcc ctatttattt ttttatagtt 2640
atgttagtat taagaacgtt atttatattt caaatttttc ttttttttct gtacaaacgc 2700
gtgtacgcat gtaacgggca gacgcggccg ccaccgcggt ggagctccaa ttcgccctat 2760
agtgagtcgt attacaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 2820
ggcgttaccc aacttaatcg ccttgcagca catcccccct tcgccagctg gcgtaatagc 2880
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 2940
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 3000
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 3060
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 3120
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 3180
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 3240
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 3300
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 3360
aacgcgaatt ttaacaaaat attaacgttt acaatttcct gatgcggtat tttctcctta 3420
cgcatctgtg cggtatttca caccgcaggg taataactga tataattaaa ttgaagctct 3480
aatttgtgag tttagtatac atgcatttac ttataataca gttttttagt tttgctggcc 3540
gcatcttctc aaatatgctt cccagcctgc ttttctgtaa cgttcaccct ctaccttagc 3600
atcccttccc tttgcaaata gtcctcttcc aacaataata atgtcagatc ctgtagagac 3660
cacatcatcc acggttctat actgttgacc caatgcgtct cccttgtcat ctaaacccac 3720
accgggtgtc ataatcaacc aatcgtaacc ttcatctctt ccacccatgt ctctttgagc 3780
aataaagccg ataacaaaat ctttgtcgct cttcgcaatg tcaacagtac ccttagtata 3840
ttctccagta gatagggagc ccttgcatga caattctgct aacatcaaaa ggcctctagg 3900
ttcctttgtt acttcttctg ccgcctgctt caaaccgcta acaatacctg ggcccaccac 3960
accgtgtgca ttcgtaatgt ctgcccattc tgctattctg tatacacccg cagagtactg 4020
caatttgact gtattaccaa tgtcagcaaa ttttctgtct tcgaagagta aaaaattgta 4080
cttggcggat aatgccttta gcggcttaac tgtgccctcc atggaaaaat cagtcaagat 4140
atccacatgt gtttttagta aacaaatttt gggacctaat gcttcaacta actccagtaa 4200
ttccttggtg gtacgaacat ccaatgaagc acacaagttt gtttgctttt cgtgcatgat 4260
attaaatagc ttggcagcaa caggactagg atgagtagca gcacgttcct tatatgtagc 4320
tttcgacatg atttatcttc gtttcctgca ggtttttgtt ctgtgcagtt gggttaagaa 4380
tactgggcaa tttcatgttt cttcaacact acatatgcgt atatatacca atctaagtct 4440
gtgctccttc cttcgttctt ccttctgttc ggagattacc gaatcaaaaa aatttcaaag 4500
aaaccgaaat caaaaaaaag aataaaaaaa aaatgatgaa ttgaattgaa aagcgtggtg 4560
cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac 4620
acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt 4680
gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag 4740
acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc 4800
ttaggacgga tcgcttgcct gtaacttaca cgcgcctcgt atcttttaat gatggaataa 4860
tttgggaatt tactctgtgt ttatttattt ttatgttttg tatttggatt ttagaaagta 4920
aataaagaag gtagaagagt tacggaatga agaaaaaaaa ataaacaaag gtttaaaaaa 4980
tttcaacaaa aagcgtactt tacatatata tttattagac aagaaaagca gattaaatag 5040
atatacattc gattaacgat aagtaaaatg taaaatcaca ggattttcgt gtgtggtctt 5100
ctacacagac aagatgaaac aattcggcat taatacctga gagcaggaag agcaagataa 5160
aaggtagtat ttgttggcga tccccctaga gtcttttaca tcttcggaaa acaaaaacta 5220
ttttttcttt aatttctttt tttactttct atttttaatt tatatattta tattaaaaaa 5280
tttaaattat aattattttt atagcacgtg atgaaaagga cccaggtggc acttttcggg 5340
gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 5400
tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 5460
ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 5520
ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 5580
gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 5640
gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 5700
acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 5760
actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 5820
ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 5880
cgaaggagct aaccgctttt tttcacaaca tgggggatca tgtaactcgc cttgatcgtt 5940
gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 6000
caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 6060
aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 6120
ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 6180
tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 6240
gcagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 6300
ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 6360
ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 6420
tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 6480
cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 6540
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6600
gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 6660
acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 6720
ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 6780
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6840
cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 6900
aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 6960
gggagcttcc aggggggaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 7020
gacttgagcg tcgatttttg tgatgctcgt caggggggcc gagcctatgg aaaaacgcca 7080
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 7140
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 7200
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 7260
caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 7320
ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tacctcactc 7380
attaggcacc ccaggcttta cactttatgc ttccggctcc tatgttgtgt ggaattgtga 7440
gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag ctcggaatta 7500
accctcacta aagggaacaa aagctgggta ccgggccccc cc 7542
<210> 18
<211> 7560
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 18
atctcccgag tttatcatta tcaatactgc catttcaaag aatacgtaaa taattaatag 60
tagtgatttt cctaacttta tttagtcaaa aaattggcct tttaattctg ctgtaacccg 120
tacatgccca aaataggggg cgggttacac agaatatata acatcatagg tgtctgggtg 180
aacagtttat tcctggcatc cactaaatat aatggagccc gcttttttta agctggcatc 240
cagaaaaaaa aagaatccca gcaccaaaat attgttttct tcaccaacca tcagttcata 300
ggtccattct cttagcgcaa ctacacagaa caggggcaca aacaggcaaa aaacgggcac 360
aacctcaatg gagtgatgca acctgcttgg agtaaatgat gacacaaggc aattgaccta 420
cgcatgtatc tatctcattt tcttacacct tctattacct tctgctctct ctgatttgga 480
aaaagctgaa aaaaaaggtt gaaaccagtt ccctgaaatt attcccctat ttgactaata 540
agtatataaa gacggtaggt attgattgta attctgtaaa tctatttctt aaacttctta 600
aattctactt ttatagttag tctttttttt agttttaaaa cactaagaac ttagtttcga 660
ataaacacac ataaacaaac aaatctagaa tgcagttatt caatctacca ctgaaggtga 720
gtttcttctt ggtcttgtct tatttctcat tattggtttc tgccgcatct attccatcta 780
gtgcatctgt acaattggac tcctacaatt acgatggttc cacattttcc ggcaagattt 840
atgtcaaaaa catcgcttac tctaaaaagg ttactgttgt gtacgcagac ggttctgaca 900
actggaacaa taacggcaac actattgctg catcattttc aggcccaatc tctggatcaa 960
attacgaata ctggacattc tcagcatcag tgaagggcat aaaggagttc tacatcaaat 1020
acgaagtttc aggtaagaca tattacgaca ataacaactc tgcaaactac caagtctcaa 1080
cttctaaacc tactacaact actgcagcta caaccacaac tacagctcca tcaacttcta 1140
caacaacccg tccatctagt tcagagcctg ccaccttccc tactggtaat tctaccatca 1200
gctcttggat caaaaagcag gaagatattt ccagattcgc tatgcttaga aacatcaacc 1260
cacctggttc tgccacaggg tttatcgccg catcactctc taccgctggt ccagattact 1320
actacgcgtg gacaagagat gccgctttga catctaacgt tatcgtttac gaatacaaca 1380
ccacattgtc tgggaataag acaattctaa acgtacttaa ggattacgtc acattcagtg 1440
ttaagacaca gtctacttca acagtttgta attgccttgg tgaaccaaag ttcaatccag 1500
acggcagtgg ttacacaggt gcttggggta gacctcaaaa tgatggtcct gcagaaagag 1560
cgactacatt tgttctgttt gccgacagct acttgactca aactaaggat gcctcatacg 1620
tcactggtac attaaagcca gcaattttca aagatctcga ttacgttgtt aacgtctgga 1680
gtaacggatg tttcgattta tgggaggagg tgaacggagt tcatttctac acccttatgg 1740
ttatgagaaa agggctattg ttgggggctg atttcgcgaa gagaaacggt gactcaacta 1800
gagcctcaac ttactcttct actgcttcca caattgctaa caagatatca agtttctggg 1860
ttagctcaaa caactgggtg caagtatccc aatctgtcac aggaggtgta agtaaaaagg 1920
ggttagacgt tagcaccctg ttagctgcga atctaggatc agtcgatgat ggatttttca 1980
ctccaggttc tgaaaagata ttagctacag ctgtggcagt cgaagattcc tttgccagtc 2040
tatacccaat caacaaaaac cttccatcat acttggggaa cgctattgga agataccctg 2100
aagatacata caacggtaat ggtaactcac aaggcaatcc ttggtttctg gcggttaccg 2160
gctacgcaga gttgtactat agagcaatta aggaatggat ttctaatgga ggcgttacag 2220
tgtcctctat ctcattgcca tttttcaaaa agttcgatag ctctgcaaca tccggtaaaa 2280
agtacaccgt aggtacttct gacttcaaca atttagcaca aaacattgct cttgctgcag 2340
atcgtttcct atctactgta caactccatg caccaaacaa tggttcatta gcagaggaat 2400
ttgatagaac aacaggtttt tctaccggcg ctagagattt aacatggtcc cacgcctcat 2460
tgataacagc atcctatgcc aaagccggtg ctccagctgc ataattaatt aaacaggccc 2520
cttttccttt gtcgatatca tgtaattagt tatgtcacgc ttacattcac gccctcctcc 2580
cacatccgct ctaaccgaaa aggaaggagt tagacaacct gaagtctagg tccctattta 2640
tttttttata gttatgttag tattaagaac gttatttata tttcaaattt ttcttttttt 2700
tctgtacaaa cgcgtgtacg catgtaacgg gcagacgcgg ccgccaccgc ggtggagctc 2760
caattcgccc tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 2820
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ccttcgccag 2880
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 2940
tggcgaatgg cgcgacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac 3000
gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc 3060
ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 3120
agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg 3180
ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 3240
gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta 3300
ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat 3360
ttaacaaaaa tttaacgcga attttaacaa aatattaacg tttacaattt cctgatgcgg 3420
tattttctcc ttacgcatct gtgcggtatt tcacaccgca gggtaataac tgatataatt 3480
aaattgaagc tctaatttgt gagtttagta tacatgcatt tacttataat acagtttttt 3540
agttttgctg gccgcatctt ctcaaatatg cttcccagcc tgcttttctg taacgttcac 3600
cctctacctt agcatccctt ccctttgcaa atagtcctct tccaacaata ataatgtcag 3660
atcctgtaga gaccacatca tccacggttc tatactgttg acccaatgcg tctcccttgt 3720
catctaaacc cacaccgggt gtcataatca accaatcgta accttcatct cttccaccca 3780
tgtctctttg agcaataaag ccgataacaa aatctttgtc gctcttcgca atgtcaacag 3840
tacccttagt atattctcca gtagataggg agcccttgca tgacaattct gctaacatca 3900
aaaggcctct aggttccttt gttacttctt ctgccgcctg cttcaaaccg ctaacaatac 3960
ctgggcccac cacaccgtgt gcattcgtaa tgtctgccca ttctgctatt ctgtatacac 4020
ccgcagagta ctgcaatttg actgtattac caatgtcagc aaattttctg tcttcgaaga 4080
gtaaaaaatt gtacttggcg gataatgcct ttagcggctt aactgtgccc tccatggaaa 4140
aatcagtcaa gatatccaca tgtgttttta gtaaacaaat tttgggacct aatgcttcaa 4200
ctaactccag taattccttg gtggtacgaa catccaatga agcacacaag tttgtttgct 4260
tttcgtgcat gatattaaat agcttggcag caacaggact aggatgagta gcagcacgtt 4320
ccttatatgt agctttcgac atgatttatc ttcgtttcct gcaggttttt gttctgtgca 4380
gttgggttaa gaatactggg caatttcatg tttcttcaac actacatatg cgtatatata 4440
ccaatctaag tctgtgctcc ttccttcgtt cttccttctg ttcggagatt accgaatcaa 4500
aaaaatttca aagaaaccga aatcaaaaaa aagaataaaa aaaaaatgat gaattgaatt 4560
gaaaagcgtg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc 4620
gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt 4680
acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac 4740
cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga 4800
taataatggt ttcttaggac ggatcgcttg cctgtaactt acacgcgcct cgtatctttt 4860
aatgatggaa taatttggga atttactctg tgtttattta tttttatgtt ttgtatttgg 4920
attttagaaa gtaaataaag aaggtagaag agttacggaa tgaagaaaaa aaaataaaca 4980
aaggtttaaa aaatttcaac aaaaagcgta ctttacatat atatttatta gacaagaaaa 5040
gcagattaaa tagatataca ttcgattaac gataagtaaa atgtaaaatc acaggatttt 5100
cgtgtgtggt cttctacaca gacaagatga aacaattcgg cattaatacc tgagagcagg 5160
aagagcaaga taaaaggtag tatttgttgg cgatccccct agagtctttt acatcttcgg 5220
aaaacaaaaa ctattttttc tttaatttct ttttttactt tctattttta atttatatat 5280
ttatattaaa aaatttaaat tataattatt tttatagcac gtgatgaaaa ggacccaggt 5340
ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca 5400
aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg 5460
aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 5520
cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 5580
ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 5640
cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta 5700
ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 5760
gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 5820
gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca 5880
acgatcggag gaccgaagga gctaaccgct ttttttcaca acatggggga tcatgtaact 5940
cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc 6000
acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact 6060
ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt 6120
ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt 6180
gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt 6240
atctacacga cgggcagtca ggcaactatg gatgaacgaa atagacagat cgctgagata 6300
ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag 6360
attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat 6420
ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa 6480
aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 6540
aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt 6600
ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg 6660
tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc 6720
ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga 6780
cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc 6840
agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc 6900
gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca 6960
ggagagcgca cgagggagct tccagggggg aacgcctggt atctttatag tcctgtcggg 7020
tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gccgagccta 7080
tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct 7140
cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag 7200
tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa 7260
gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc 7320
agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg 7380
agttacctca ctcattaggc accccaggct ttacacttta tgcttccggc tcctatgttg 7440
tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc 7500
aagctcggaa ttaaccctca ctaaagggaa caaaagctgg gtaccgggcc ccccctcgag 7560
<210> 19
<211> 7752
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 19
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgcgt ttcccaagta tcttcaccgc 720
tgttcttttc gctgcctctt ccgcactggc agctcctgtc aacaccacga ctgaggatga 780
gacagcacaa attcctgcgg aggctgtaat cggttacagt gacctggaag gcgattttga 840
cgttgctgtg ttgccgttct caaactctac taacaacgga cttcttttca taaacacgac 900
catagccagc attgcagcta aggaggaagg cgttagcctg gaaaagaggg aagcagaagc 960
cgcatctatt ccatctagtg catctgtaca attggactcc tacaattacg atggttccac 1020
attttccggc aagatttatg tcaaaaacat cgcttactct aaaaaggtta ctgttgtgta 1080
cgcagacggt tctgacaact ggaacaataa cggcaacact attgctgcat cattttcagg 1140
cccaatctct ggatcaaatt acgaatactg gacattctca gcatcagtga agggcataaa 1200
ggagttctac atcaaatacg aagtttcagg taagacatat tacgacaata acaactctgc 1260
aaactaccaa gtctcaactt ctaaacctac tacaactact gcagctacaa ccacaactac 1320
agctccatca acttctacaa caacccgtcc atctagttca gagcctgcca ccttccctac 1380
tggtaattct accatcagct cttggatcaa aaagcaggaa gatatttcca gattcgctat 1440
gcttagaaac atcaacccac ctggttctgc cacagggttt atcgccgcat cactctctac 1500
cgctggtcca gattactact acgcgtggac aagagatgcc gctttgacat ctaacgttat 1560
cgtttacgaa tacaacacca cattgtctgg gaataagaca attctaaacg tacttaagga 1620
ttacgtcaca ttcagtgtta agacacagtc tacttcaaca gtttgtaatt gccttggtga 1680
accaaagttc aatccagacg gcagtggtta cacaggtgct tggggtagac ctcaaaatga 1740
tggtcctgca gaaagagcga ctacatttgt tctgtttgcc gacagctact tgactcaaac 1800
taaggatgcc tcatacgtca ctggtacatt aaagccagca attttcaaag atctcgatta 1860
cgttgttaac gtctggagta acggatgttt cgatttatgg gaggaggtga acggagttca 1920
tttctacacc cttatggtta tgagaaaagg gctattgttg ggggctgatt tcgcgaagag 1980
aaacggtgac tcaactagag cctcaactta ctcttctact gcttccacaa ttgctaacaa 2040
gatatcaagt ttctgggtta gctcaaacaa ctgggtgcaa gtatcccaat ctgtcacagg 2100
aggtgtaagt aaaaaggggt tagacgttag caccctgtta gctgcgaatc taggatcagt 2160
cgatgatgga tttttcactc caggttctga aaagatatta gctacagctg tggcagtcga 2220
agattccttt gccagtctat acccaatcaa caaaaacctt ccatcatact tggggaacgc 2280
tattggaaga taccctgaag atacatacaa cggtaatggt aactcacaag gcaatccttg 2340
gtttctggcg gttaccggct acgcagagtt gtactataga gcaattaagg aatggatttc 2400
taatggaggc gttacagtgt cctctatctc attgccattt ttcaaaaagt tcgatagctc 2460
tgcaacatcc ggtaaaaagt acaccgtagg tacttctgac ttcaacaatt tagcacaaaa 2520
cattgctctt gctgcagatc gtttcctatc tactgtacaa ctccatgcac caaacaatgg 2580
ttcattagca gaggaatttg atagaacaac aggtttttct accggcgcta gagatttaac 2640
atggtcccac gcctcattga taacagcatc ctatgccaaa gccggtgctc cagctgcata 2700
attaattaaa caggcccctt ttcctttgtc gatatcatgt aattagttat gtcacgctta 2760
cattcacgcc ctcctcccac atccgctcta accgaaaagg aaggagttag acaacctgaa 2820
gtctaggtcc ctatttattt ttttatagtt atgttagtat taagaacgtt atttatattt 2880
caaatttttc ttttttttct gtacaaacgc gtgtacgcat gtaacgggca gacgcggccg 2940
ccaccgcggt ggagctccaa ttcgccctat agtgagtcgt attacaattc actggccgtc 3000
gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca 3060
catcccccct tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa 3120
cagttgcgca gcctgaatgg cgaatggcgc gacgcgccct gtagcggcgc attaagcgcg 3180
gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 3240
cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 3300
aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 3360
cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 3420
ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 3480
aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg 3540
ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt 3600
acaatttcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcaggg 3660
taataactga tataattaaa ttgaagctct aatttgtgag tttagtatac atgcatttac 3720
ttataataca gttttttagt tttgctggcc gcatcttctc aaatatgctt cccagcctgc 3780
ttttctgtaa cgttcaccct ctaccttagc atcccttccc tttgcaaata gtcctcttcc 3840
aacaataata atgtcagatc ctgtagagac cacatcatcc acggttctat actgttgacc 3900
caatgcgtct cccttgtcat ctaaacccac accgggtgtc ataatcaacc aatcgtaacc 3960
ttcatctctt ccacccatgt ctctttgagc aataaagccg ataacaaaat ctttgtcgct 4020
cttcgcaatg tcaacagtac ccttagtata ttctccagta gatagggagc ccttgcatga 4080
caattctgct aacatcaaaa ggcctctagg ttcctttgtt acttcttctg ccgcctgctt 4140
caaaccgcta acaatacctg ggcccaccac accgtgtgca ttcgtaatgt ctgcccattc 4200
tgctattctg tatacacccg cagagtactg caatttgact gtattaccaa tgtcagcaaa 4260
ttttctgtct tcgaagagta aaaaattgta cttggcggat aatgccttta gcggcttaac 4320
tgtgccctcc atggaaaaat cagtcaagat atccacatgt gtttttagta aacaaatttt 4380
gggacctaat gcttcaacta actccagtaa ttccttggtg gtacgaacat ccaatgaagc 4440
acacaagttt gtttgctttt cgtgcatgat attaaatagc ttggcagcaa caggactagg 4500
atgagtagca gcacgttcct tatatgtagc tttcgacatg atttatcttc gtttcctgca 4560
ggtttttgtt ctgtgcagtt gggttaagaa tactgggcaa tttcatgttt cttcaacact 4620
acatatgcgt atatatacca atctaagtct gtgctccttc cttcgttctt ccttctgttc 4680
ggagattacc gaatcaaaaa aatttcaaag aaaccgaaat caaaaaaaag aataaaaaaa 4740
aaatgatgaa ttgaattgaa aagcgtggtg cactctcagt acaatctgct ctgatgccgc 4800
atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 4860
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 4920
gttttcaccg tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt 4980
ataggttaat gtcatgataa taatggtttc ttaggacgga tcgcttgcct gtaacttaca 5040
cgcgcctcgt atcttttaat gatggaataa tttgggaatt tactctgtgt ttatttattt 5100
ttatgttttg tatttggatt ttagaaagta aataaagaag gtagaagagt tacggaatga 5160
agaaaaaaaa ataaacaaag gtttaaaaaa tttcaacaaa aagcgtactt tacatatata 5220
tttattagac aagaaaagca gattaaatag atatacattc gattaacgat aagtaaaatg 5280
taaaatcaca ggattttcgt gtgtggtctt ctacacagac aagatgaaac aattcggcat 5340
taatacctga gagcaggaag agcaagataa aaggtagtat ttgttggcga tccccctaga 5400
gtcttttaca tcttcggaaa acaaaaacta ttttttcttt aatttctttt tttactttct 5460
atttttaatt tatatattta tattaaaaaa tttaaattat aattattttt atagcacgtg 5520
atgaaaagga cccaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 5580
ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 5640
aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 5700
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 5760
atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 5820
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 5880
tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 5940
tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 6000
atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 6060
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt tttcacaaca 6120
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 6180
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 6240
ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 6300
aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 6360
ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc 6420
cctcccgtat cgtagttatc tacacgacgg gcagtcaggc aactatggat gaacgaaata 6480
gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 6540
actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 6600
agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 6660
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 6720
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 6780
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 6840
tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 6900
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 6960
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 7020
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 7080
gtgagcattg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 7140
gcggcagggt cggaacagga gagcgcacga gggagcttcc aggggggaac gcctggtatc 7200
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 7260
caggggggcc gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 7320
tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc 7380
gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg 7440
agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt 7500
ggccgattca ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc 7560
gcaacgcaat taatgtgagt tacctcactc attaggcacc ccaggcttta cactttatgc 7620
ttccggctcc tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct 7680
atgaccatga ttacgccaag ctcggaatta accctcacta aagggaacaa aagctgggta 7740
ccgggccccc cc 7752
<210> 20
<211> 7668
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 20
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgcgt tttcccagca tctttactgc 720
tgtacttttt gccgcgagta gtgccctggc tgctccggtg aatactacaa ctgaagacga 780
attagagggg gatttcgatg tggccgttct accgttcagc gcgagcatag ctgcaaaaga 840
agaaggagtg agcttggaaa aaagggaggc tgaggcggca tctattccat ctagtgcatc 900
tgtacaattg gactcctaca attacgatgg ttccacattt tccggcaaga tttatgtcaa 960
aaacatcgct tactctaaaa aggttactgt tgtgtacgca gacggttctg acaactggaa 1020
caataacggc aacactattg ctgcatcatt ttcaggccca atctctggat caaattacga 1080
atactggaca ttctcagcat cagtgaaggg cataaaggag ttctacatca aatacgaagt 1140
ttcaggtaag acatattacg acaataacaa ctctgcaaac taccaagtct caacttctaa 1200
acctactaca actactgcag ctacaaccac aactacagct ccatcaactt ctacaacaac 1260
ccgtccatct agttcagagc ctgccacctt ccctactggt aattctacca tcagctcttg 1320
gatcaaaaag caggaagata tttccagatt cgctatgctt agaaacatca acccacctgg 1380
ttctgccaca gggtttatcg ccgcatcact ctctaccgct ggtccagatt actactacgc 1440
gtggacaaga gatgccgctt tgacatctaa cgttatcgtt tacgaataca acaccacatt 1500
gtctgggaat aagacaattc taaacgtact taaggattac gtcacattca gtgttaagac 1560
acagtctact tcaacagttt gtaattgcct tggtgaacca aagttcaatc cagacggcag 1620
tggttacaca ggtgcttggg gtagacctca aaatgatggt cctgcagaaa gagcgactac 1680
atttgttctg tttgccgaca gctacttgac tcaaactaag gatgcctcat acgtcactgg 1740
tacattaaag ccagcaattt tcaaagatct cgattacgtt gttaacgtct ggagtaacgg 1800
atgtttcgat ttatgggagg aggtgaacgg agttcatttc tacaccctta tggttatgag 1860
aaaagggcta ttgttggggg ctgatttcgc gaagagaaac ggtgactcaa ctagagcctc 1920
aacttactct tctactgctt ccacaattgc taacaagata tcaagtttct gggttagctc 1980
aaacaactgg gtgcaagtat cccaatctgt cacaggaggt gtaagtaaaa aggggttaga 2040
cgttagcacc ctgttagctg cgaatctagg atcagtcgat gatggatttt tcactccagg 2100
ttctgaaaag atattagcta cagctgtggc agtcgaagat tcctttgcca gtctataccc 2160
aatcaacaaa aaccttccat catacttggg gaacgctatt ggaagatacc ctgaagatac 2220
atacaacggt aatggtaact cacaaggcaa tccttggttt ctggcggtta ccggctacgc 2280
agagttgtac tatagagcaa ttaaggaatg gatttctaat ggaggcgtta cagtgtcctc 2340
tatctcattg ccatttttca aaaagttcga tagctctgca acatccggta aaaagtacac 2400
cgtaggtact tctgacttca acaatttagc acaaaacatt gctcttgctg cagatcgttt 2460
cctatctact gtacaactcc atgcaccaaa caatggttca ttagcagagg aatttgatag 2520
aacaacaggt ttttctaccg gcgctagaga tttaacatgg tcccacgcct cattgataac 2580
agcatcctat gccaaagccg gtgctccagc tgcataatta attaaacagg ccccttttcc 2640
tttgtcgata tcatgtaatt agttatgtca cgcttacatt cacgccctcc tcccacatcc 2700
gctctaaccg aaaaggaagg agttagacaa cctgaagtct aggtccctat ttattttttt 2760
atagttatgt tagtattaag aacgttattt atatttcaaa tttttctttt ttttctgtac 2820
aaacgcgtgt acgcatgtaa cgggcagacg cggccgccac cgcggtggag ctccaattcg 2880
ccctatagtg agtcgtatta caattcactg gccgtcgttt tacaacgtcg tgactgggaa 2940
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccccttcgc cagctggcgt 3000
aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 3060
tggcgcgacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 3120
gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 3180
ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 3240
cgatttagtg ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt 3300
agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt 3360
aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt 3420
gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa 3480
aaatttaacg cgaattttaa caaaatatta acgtttacaa tttcctgatg cggtattttc 3540
tccttacgca tctgtgcggt atttcacacc gcagggtaat aactgatata attaaattga 3600
agctctaatt tgtgagttta gtatacatgc atttacttat aatacagttt tttagttttg 3660
ctggccgcat cttctcaaat atgcttccca gcctgctttt ctgtaacgtt caccctctac 3720
cttagcatcc cttccctttg caaatagtcc tcttccaaca ataataatgt cagatcctgt 3780
agagaccaca tcatccacgg ttctatactg ttgacccaat gcgtctccct tgtcatctaa 3840
acccacaccg ggtgtcataa tcaaccaatc gtaaccttca tctcttccac ccatgtctct 3900
ttgagcaata aagccgataa caaaatcttt gtcgctcttc gcaatgtcaa cagtaccctt 3960
agtatattct ccagtagata gggagccctt gcatgacaat tctgctaaca tcaaaaggcc 4020
tctaggttcc tttgttactt cttctgccgc ctgcttcaaa ccgctaacaa tacctgggcc 4080
caccacaccg tgtgcattcg taatgtctgc ccattctgct attctgtata cacccgcaga 4140
gtactgcaat ttgactgtat taccaatgtc agcaaatttt ctgtcttcga agagtaaaaa 4200
attgtacttg gcggataatg cctttagcgg cttaactgtg ccctccatgg aaaaatcagt 4260
caagatatcc acatgtgttt ttagtaaaca aattttggga cctaatgctt caactaactc 4320
cagtaattcc ttggtggtac gaacatccaa tgaagcacac aagtttgttt gcttttcgtg 4380
catgatatta aatagcttgg cagcaacagg actaggatga gtagcagcac gttccttata 4440
tgtagctttc gacatgattt atcttcgttt cctgcaggtt tttgttctgt gcagttgggt 4500
taagaatact gggcaatttc atgtttcttc aacactacat atgcgtatat ataccaatct 4560
aagtctgtgc tccttccttc gttcttcctt ctgttcggag attaccgaat caaaaaaatt 4620
tcaaagaaac cgaaatcaaa aaaaagaata aaaaaaaaat gatgaattga attgaaaagc 4680
gtggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc 4740
gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca 4800
agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg 4860
cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 4920
ggtttcttag gacggatcgc ttgcctgtaa cttacacgcg cctcgtatct tttaatgatg 4980
gaataatttg ggaatttact ctgtgtttat ttatttttat gttttgtatt tggattttag 5040
aaagtaaata aagaaggtag aagagttacg gaatgaagaa aaaaaaataa acaaaggttt 5100
aaaaaatttc aacaaaaagc gtactttaca tatatattta ttagacaaga aaagcagatt 5160
aaatagatat acattcgatt aacgataagt aaaatgtaaa atcacaggat tttcgtgtgt 5220
ggtcttctac acagacaaga tgaaacaatt cggcattaat acctgagagc aggaagagca 5280
agataaaagg tagtatttgt tggcgatccc cctagagtct tttacatctt cggaaaacaa 5340
aaactatttt ttctttaatt tcttttttta ctttctattt ttaatttata tatttatatt 5400
aaaaaattta aattataatt atttttatag cacgtgatga aaaggaccca ggtggcactt 5460
ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 5520
atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 5580
tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 5640
tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 5700
gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 5760
aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 5820
gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 5880
ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 5940
gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 6000
gaggaccgaa ggagctaacc gctttttttc acaacatggg ggatcatgta actcgccttg 6060
atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 6120
ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 6180
cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 6240
cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 6300
gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 6360
cgacgggcag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 6420
cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 6480
taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 6540
ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 6600
aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 6660
caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 6720
taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 6780
gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 6840
cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 6900
taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 6960
agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc 7020
ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 7080
gcacgaggga gcttccaggg gggaacgcct ggtatcttta tagtcctgtc gggtttcgcc 7140
acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggccgagc ctatggaaaa 7200
acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 7260
tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 7320
ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 7380
agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 7440
acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttacc 7500
tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcctatg ttgtgtggaa 7560
ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagctcg 7620
gaattaaccc tcactaaagg gaacaaaagc tgggtaccgg gccccccc 7668
<210> 21
<211> 7656
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 21
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgaga ttcccgtcca tattcacagc 720
cgtcttgttt gcggcatctt ctgcgttagc tgcgccagta aacacgacaa cggaagacga 780
acttgagggg gatttcgatg tggccgtact tcctttctcc gcatcaatcg cggcgaaaga 840
ggagggtgta tctttagaaa agagggcatc tattccatct agtgcatctg tacaattgga 900
ctcctacaat tacgatggtt ccacattttc cggcaagatt tatgtcaaaa acatcgctta 960
ctctaaaaag gttactgttg tgtacgcaga cggttctgac aactggaaca ataacggcaa 1020
cactattgct gcatcatttt caggcccaat ctctggatca aattacgaat actggacatt 1080
ctcagcatca gtgaagggca taaaggagtt ctacatcaaa tacgaagttt caggtaagac 1140
atattacgac aataacaact ctgcaaacta ccaagtctca acttctaaac ctactacaac 1200
tactgcagct acaaccacaa ctacagctcc atcaacttct acaacaaccc gtccatctag 1260
ttcagagcct gccaccttcc ctactggtaa ttctaccatc agctcttgga tcaaaaagca 1320
ggaagatatt tccagattcg ctatgcttag aaacatcaac ccacctggtt ctgccacagg 1380
gtttatcgcc gcatcactct ctaccgctgg tccagattac tactacgcgt ggacaagaga 1440
tgccgctttg acatctaacg ttatcgttta cgaatacaac accacattgt ctgggaataa 1500
gacaattcta aacgtactta aggattacgt cacattcagt gttaagacac agtctacttc 1560
aacagtttgt aattgccttg gtgaaccaaa gttcaatcca gacggcagtg gttacacagg 1620
tgcttggggt agacctcaaa atgatggtcc tgcagaaaga gcgactacat ttgttctgtt 1680
tgccgacagc tacttgactc aaactaagga tgcctcatac gtcactggta cattaaagcc 1740
agcaattttc aaagatctcg attacgttgt taacgtctgg agtaacggat gtttcgattt 1800
atgggaggag gtgaacggag ttcatttcta cacccttatg gttatgagaa aagggctatt 1860
gttgggggct gatttcgcga agagaaacgg tgactcaact agagcctcaa cttactcttc 1920
tactgcttcc acaattgcta acaagatatc aagtttctgg gttagctcaa acaactgggt 1980
gcaagtatcc caatctgtca caggaggtgt aagtaaaaag gggttagacg ttagcaccct 2040
gttagctgcg aatctaggat cagtcgatga tggatttttc actccaggtt ctgaaaagat 2100
attagctaca gctgtggcag tcgaagattc ctttgccagt ctatacccaa tcaacaaaaa 2160
ccttccatca tacttgggga acgctattgg aagataccct gaagatacat acaacggtaa 2220
tggtaactca caaggcaatc cttggtttct ggcggttacc ggctacgcag agttgtacta 2280
tagagcaatt aaggaatgga tttctaatgg aggcgttaca gtgtcctcta tctcattgcc 2340
atttttcaaa aagttcgata gctctgcaac atccggtaaa aagtacaccg taggtacttc 2400
tgacttcaac aatttagcac aaaacattgc tcttgctgca gatcgtttcc tatctactgt 2460
acaactccat gcaccaaaca atggttcatt agcagaggaa tttgatagaa caacaggttt 2520
ttctaccggc gctagagatt taacatggtc ccacgcctca ttgataacag catcctatgc 2580
caaagccggt gctccagctg cataattaat taaacaggcc ccttttcctt tgtcgatatc 2640
atgtaattag ttatgtcacg cttacattca cgccctcctc ccacatccgc tctaaccgaa 2700
aaggaaggag ttagacaacc tgaagtctag gtccctattt atttttttat agttatgtta 2760
gtattaagaa cgttatttat atttcaaatt tttctttttt ttctgtacaa acgcgtgtac 2820
gcatgtaacg ggcagacgcg gccgccaccg cggtggagct ccaattcgcc ctatagtgag 2880
tcgtattaca attcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt 2940
acccaactta atcgccttgc agcacatccc cccttcgcca gctggcgtaa tagcgaagag 3000
gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg gcgcgacgcg 3060
ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca 3120
cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc 3180
gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct 3240
ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg 3300
ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc 3360
ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg 3420
attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg 3480
aattttaaca aaatattaac gtttacaatt tcctgatgcg gtattttctc cttacgcatc 3540
tgtgcggtat ttcacaccgc agggtaataa ctgatataat taaattgaag ctctaatttg 3600
tgagtttagt atacatgcat ttacttataa tacagttttt tagttttgct ggccgcatct 3660
tctcaaatat gcttcccagc ctgcttttct gtaacgttca ccctctacct tagcatccct 3720
tccctttgca aatagtcctc ttccaacaat aataatgtca gatcctgtag agaccacatc 3780
atccacggtt ctatactgtt gacccaatgc gtctcccttg tcatctaaac ccacaccggg 3840
tgtcataatc aaccaatcgt aaccttcatc tcttccaccc atgtctcttt gagcaataaa 3900
gccgataaca aaatctttgt cgctcttcgc aatgtcaaca gtacccttag tatattctcc 3960
agtagatagg gagcccttgc atgacaattc tgctaacatc aaaaggcctc taggttcctt 4020
tgttacttct tctgccgcct gcttcaaacc gctaacaata cctgggccca ccacaccgtg 4080
tgcattcgta atgtctgccc attctgctat tctgtataca cccgcagagt actgcaattt 4140
gactgtatta ccaatgtcag caaattttct gtcttcgaag agtaaaaaat tgtacttggc 4200
ggataatgcc tttagcggct taactgtgcc ctccatggaa aaatcagtca agatatccac 4260
atgtgttttt agtaaacaaa ttttgggacc taatgcttca actaactcca gtaattcctt 4320
ggtggtacga acatccaatg aagcacacaa gtttgtttgc ttttcgtgca tgatattaaa 4380
tagcttggca gcaacaggac taggatgagt agcagcacgt tccttatatg tagctttcga 4440
catgatttat cttcgtttcc tgcaggtttt tgttctgtgc agttgggtta agaatactgg 4500
gcaatttcat gtttcttcaa cactacatat gcgtatatat accaatctaa gtctgtgctc 4560
cttccttcgt tcttccttct gttcggagat taccgaatca aaaaaatttc aaagaaaccg 4620
aaatcaaaaa aaagaataaa aaaaaaatga tgaattgaat tgaaaagcgt ggtgcactct 4680
cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 4740
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 4800
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 4860
gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagga 4920
cggatcgctt gcctgtaact tacacgcgcc tcgtatcttt taatgatgga ataatttggg 4980
aatttactct gtgtttattt atttttatgt tttgtatttg gattttagaa agtaaataaa 5040
gaaggtagaa gagttacgga atgaagaaaa aaaaataaac aaaggtttaa aaaatttcaa 5100
caaaaagcgt actttacata tatatttatt agacaagaaa agcagattaa atagatatac 5160
attcgattaa cgataagtaa aatgtaaaat cacaggattt tcgtgtgtgg tcttctacac 5220
agacaagatg aaacaattcg gcattaatac ctgagagcag gaagagcaag ataaaaggta 5280
gtatttgttg gcgatccccc tagagtcttt tacatcttcg gaaaacaaaa actatttttt 5340
ctttaatttc tttttttact ttctattttt aatttatata tttatattaa aaaatttaaa 5400
ttataattat ttttatagca cgtgatgaaa aggacccagg tggcactttt cggggaaatg 5460
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 5520
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 5580
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 5640
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 5700
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 5760
caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 5820
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 5880
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 5940
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 6000
agctaaccgc tttttttcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 6060
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 6120
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 6180
taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 6240
ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 6300
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acgggcagtc 6360
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 6420
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 6480
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 6540
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 6600
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 6660
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 6720
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 6780
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 6840
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 6900
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 6960
acaccgaact gagataccta cagcgtgagc attgagaaag cgccacgctt cccgaaggga 7020
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 7080
ttccaggggg gaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 7140
agcgtcgatt tttgtgatgc tcgtcagggg ggccgagcct atggaaaaac gccagcaacg 7200
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 7260
tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 7320
gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 7380
gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc 7440
ccgactggaa agcgggcagt gagcgcaacg caattaatgt gagttacctc actcattagg 7500
caccccaggc tttacacttt atgcttccgg ctcctatgtt gtgtggaatt gtgagcggat 7560
aacaatttca cacaggaaac agctatgacc atgattacgc caagctcgga attaaccctc 7620
actaaaggga acaaaagctg ggtaccgggc cccccc 7656
<210> 22
<211> 7542
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 22
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgagg tttccctcca tctttactgc 720
cgttttgttc gcggcttcca gcgcgttggc tgcatctatt ccatctagtg catctgtaca 780
attggactcc tacaattacg atggttccac attttccggc aagatttatg tcaaaaacat 840
cgcttactct aaaaaggtta ctgttgtgta cgcagacggt tctgacaact ggaacaataa 900
cggcaacact attgctgcat cattttcagg cccaatctct ggatcaaatt acgaatactg 960
gacattctca gcatcagtga agggcataaa ggagttctac atcaaatacg aagtttcagg 1020
taagacatat tacgacaata acaactctgc aaactaccaa gtctcaactt ctaaacctac 1080
tacaactact gcagctacaa ccacaactac agctccatca acttctacaa caacccgtcc 1140
atctagttca gagcctgcca ccttccctac tggtaattct accatcagct cttggatcaa 1200
aaagcaggaa gatatttcca gattcgctat gcttagaaac atcaacccac ctggttctgc 1260
cacagggttt atcgccgcat cactctctac cgctggtcca gattactact acgcgtggac 1320
aagagatgcc gctttgacat ctaacgttat cgtttacgaa tacaacacca cattgtctgg 1380
gaataagaca attctaaacg tacttaagga ttacgtcaca ttcagtgtta agacacagtc 1440
tacttcaaca gtttgtaatt gccttggtga accaaagttc aatccagacg gcagtggtta 1500
cacaggtgct tggggtagac ctcaaaatga tggtcctgca gaaagagcga ctacatttgt 1560
tctgtttgcc gacagctact tgactcaaac taaggatgcc tcatacgtca ctggtacatt 1620
aaagccagca attttcaaag atctcgatta cgttgttaac gtctggagta acggatgttt 1680
cgatttatgg gaggaggtga acggagttca tttctacacc cttatggtta tgagaaaagg 1740
gctattgttg ggggctgatt tcgcgaagag aaacggtgac tcaactagag cctcaactta 1800
ctcttctact gcttccacaa ttgctaacaa gatatcaagt ttctgggtta gctcaaacaa 1860
ctgggtgcaa gtatcccaat ctgtcacagg aggtgtaagt aaaaaggggt tagacgttag 1920
caccctgtta gctgcgaatc taggatcagt cgatgatgga tttttcactc caggttctga 1980
aaagatatta gctacagctg tggcagtcga agattccttt gccagtctat acccaatcaa 2040
caaaaacctt ccatcatact tggggaacgc tattggaaga taccctgaag atacatacaa 2100
cggtaatggt aactcacaag gcaatccttg gtttctggcg gttaccggct acgcagagtt 2160
gtactataga gcaattaagg aatggatttc taatggaggc gttacagtgt cctctatctc 2220
attgccattt ttcaaaaagt tcgatagctc tgcaacatcc ggtaaaaagt acaccgtagg 2280
tacttctgac ttcaacaatt tagcacaaaa cattgctctt gctgcagatc gtttcctatc 2340
tactgtacaa ctccatgcac caaacaatgg ttcattagca gaggaatttg atagaacaac 2400
aggtttttct accggcgcta gagatttaac atggtcccac gcctcattga taacagcatc 2460
ctatgccaaa gccggtgctc cagctgcata attaattaaa caggcccctt ttcctttgtc 2520
gatatcatgt aattagttat gtcacgctta cattcacgcc ctcctcccac atccgctcta 2580
accgaaaagg aaggagttag acaacctgaa gtctaggtcc ctatttattt ttttatagtt 2640
atgttagtat taagaacgtt atttatattt caaatttttc ttttttttct gtacaaacgc 2700
gtgtacgcat gtaacgggca gacgcggccg ccaccgcggt ggagctccaa ttcgccctat 2760
agtgagtcgt attacaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 2820
ggcgttaccc aacttaatcg ccttgcagca catcccccct tcgccagctg gcgtaatagc 2880
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 2940
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 3000
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 3060
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 3120
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 3180
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 3240
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 3300
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 3360
aacgcgaatt ttaacaaaat attaacgttt acaatttcct gatgcggtat tttctcctta 3420
cgcatctgtg cggtatttca caccgcaggg taataactga tataattaaa ttgaagctct 3480
aatttgtgag tttagtatac atgcatttac ttataataca gttttttagt tttgctggcc 3540
gcatcttctc aaatatgctt cccagcctgc ttttctgtaa cgttcaccct ctaccttagc 3600
atcccttccc tttgcaaata gtcctcttcc aacaataata atgtcagatc ctgtagagac 3660
cacatcatcc acggttctat actgttgacc caatgcgtct cccttgtcat ctaaacccac 3720
accgggtgtc ataatcaacc aatcgtaacc ttcatctctt ccacccatgt ctctttgagc 3780
aataaagccg ataacaaaat ctttgtcgct cttcgcaatg tcaacagtac ccttagtata 3840
ttctccagta gatagggagc ccttgcatga caattctgct aacatcaaaa ggcctctagg 3900
ttcctttgtt acttcttctg ccgcctgctt caaaccgcta acaatacctg ggcccaccac 3960
accgtgtgca ttcgtaatgt ctgcccattc tgctattctg tatacacccg cagagtactg 4020
caatttgact gtattaccaa tgtcagcaaa ttttctgtct tcgaagagta aaaaattgta 4080
cttggcggat aatgccttta gcggcttaac tgtgccctcc atggaaaaat cagtcaagat 4140
atccacatgt gtttttagta aacaaatttt gggacctaat gcttcaacta actccagtaa 4200
ttccttggtg gtacgaacat ccaatgaagc acacaagttt gtttgctttt cgtgcatgat 4260
attaaatagc ttggcagcaa caggactagg atgagtagca gcacgttcct tatatgtagc 4320
tttcgacatg atttatcttc gtttcctgca ggtttttgtt ctgtgcagtt gggttaagaa 4380
tactgggcaa tttcatgttt cttcaacact acatatgcgt atatatacca atctaagtct 4440
gtgctccttc cttcgttctt ccttctgttc ggagattacc gaatcaaaaa aatttcaaag 4500
aaaccgaaat caaaaaaaag aataaaaaaa aaatgatgaa ttgaattgaa aagcgtggtg 4560
cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac 4620
acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt 4680
gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag 4740
acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc 4800
ttaggacgga tcgcttgcct gtaacttaca cgcgcctcgt atcttttaat gatggaataa 4860
tttgggaatt tactctgtgt ttatttattt ttatgttttg tatttggatt ttagaaagta 4920
aataaagaag gtagaagagt tacggaatga agaaaaaaaa ataaacaaag gtttaaaaaa 4980
tttcaacaaa aagcgtactt tacatatata tttattagac aagaaaagca gattaaatag 5040
atatacattc gattaacgat aagtaaaatg taaaatcaca ggattttcgt gtgtggtctt 5100
ctacacagac aagatgaaac aattcggcat taatacctga gagcaggaag agcaagataa 5160
aaggtagtat ttgttggcga tccccctaga gtcttttaca tcttcggaaa acaaaaacta 5220
ttttttcttt aatttctttt tttactttct atttttaatt tatatattta tattaaaaaa 5280
tttaaattat aattattttt atagcacgtg atgaaaagga cccaggtggc acttttcggg 5340
gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 5400
tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 5460
ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 5520
ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 5580
gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 5640
gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 5700
acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 5760
actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 5820
ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 5880
cgaaggagct aaccgctttt tttcacaaca tgggggatca tgtaactcgc cttgatcgtt 5940
gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 6000
caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 6060
aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 6120
ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 6180
tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 6240
gcagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 6300
ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 6360
ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 6420
tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 6480
cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 6540
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6600
gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 6660
acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 6720
ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 6780
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6840
cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 6900
aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 6960
gggagcttcc aggggggaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 7020
gacttgagcg tcgatttttg tgatgctcgt caggggggcc gagcctatgg aaaaacgcca 7080
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 7140
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 7200
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 7260
caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 7320
ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tacctcactc 7380
attaggcacc ccaggcttta cactttatgc ttccggctcc tatgttgtgt ggaattgtga 7440
gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag ctcggaatta 7500
accctcacta aagggaacaa aagctgggta ccgggccccc cc 7542
<210> 23
<211> 7545
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 23
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatggtg gcctggtgga gtctgttcct 720
ttacgggtta caagtggctg cgcccgctct ggcagcatct attccatcta gtgcatctgt 780
acaattggac tcctacaatt acgatggttc cacattttcc ggcaagattt atgtcaaaaa 840
catcgcttac tctaaaaagg ttactgttgt gtacgcagac ggttctgaca actggaacaa 900
taacggcaac actattgctg catcattttc aggcccaatc tctggatcaa attacgaata 960
ctggacattc tcagcatcag tgaagggcat aaaggagttc tacatcaaat acgaagtttc 1020
aggtaagaca tattacgaca ataacaactc tgcaaactac caagtctcaa cttctaaacc 1080
tactacaact actgcagcta caaccacaac tacagctcca tcaacttcta caacaacccg 1140
tccatctagt tcagagcctg ccaccttccc tactggtaat tctaccatca gctcttggat 1200
caaaaagcag gaagatattt ccagattcgc tatgcttaga aacatcaacc cacctggttc 1260
tgccacaggg tttatcgccg catcactctc taccgctggt ccagattact actacgcgtg 1320
gacaagagat gccgctttga catctaacgt tatcgtttac gaatacaaca ccacattgtc 1380
tgggaataag acaattctaa acgtacttaa ggattacgtc acattcagtg ttaagacaca 1440
gtctacttca acagtttgta attgccttgg tgaaccaaag ttcaatccag acggcagtgg 1500
ttacacaggt gcttggggta gacctcaaaa tgatggtcct gcagaaagag cgactacatt 1560
tgttctgttt gccgacagct acttgactca aactaaggat gcctcatacg tcactggtac 1620
attaaagcca gcaattttca aagatctcga ttacgttgtt aacgtctgga gtaacggatg 1680
tttcgattta tgggaggagg tgaacggagt tcatttctac acccttatgg ttatgagaaa 1740
agggctattg ttgggggctg atttcgcgaa gagaaacggt gactcaacta gagcctcaac 1800
ttactcttct actgcttcca caattgctaa caagatatca agtttctggg ttagctcaaa 1860
caactgggtg caagtatccc aatctgtcac aggaggtgta agtaaaaagg ggttagacgt 1920
tagcaccctg ttagctgcga atctaggatc agtcgatgat ggatttttca ctccaggttc 1980
tgaaaagata ttagctacag ctgtggcagt cgaagattcc tttgccagtc tatacccaat 2040
caacaaaaac cttccatcat acttggggaa cgctattgga agataccctg aagatacata 2100
caacggtaat ggtaactcac aaggcaatcc ttggtttctg gcggttaccg gctacgcaga 2160
gttgtactat agagcaatta aggaatggat ttctaatgga ggcgttacag tgtcctctat 2220
ctcattgcca tttttcaaaa agttcgatag ctctgcaaca tccggtaaaa agtacaccgt 2280
aggtacttct gacttcaaca atttagcaca aaacattgct cttgctgcag atcgtttcct 2340
atctactgta caactccatg caccaaacaa tggttcatta gcagaggaat ttgatagaac 2400
aacaggtttt tctaccggcg ctagagattt aacatggtcc cacgcctcat tgataacagc 2460
atcctatgcc aaagccggtg ctccagctgc ataattaatt aaacaggccc cttttccttt 2520
gtcgatatca tgtaattagt tatgtcacgc ttacattcac gccctcctcc cacatccgct 2580
ctaaccgaaa aggaaggagt tagacaacct gaagtctagg tccctattta tttttttata 2640
gttatgttag tattaagaac gttatttata tttcaaattt ttcttttttt tctgtacaaa 2700
cgcgtgtacg catgtaacgg gcagacgcgg ccgccaccgc ggtggagctc caattcgccc 2760
tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac 2820
cctggcgtta cccaacttaa tcgccttgca gcacatcccc ccttcgccag ctggcgtaat 2880
agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 2940
cgcgacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 3000
accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 3060
gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 3120
tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 3180
gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 3240
agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 3300
ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 3360
tttaacgcga attttaacaa aatattaacg tttacaattt cctgatgcgg tattttctcc 3420
ttacgcatct gtgcggtatt tcacaccgca gggtaataac tgatataatt aaattgaagc 3480
tctaatttgt gagtttagta tacatgcatt tacttataat acagtttttt agttttgctg 3540
gccgcatctt ctcaaatatg cttcccagcc tgcttttctg taacgttcac cctctacctt 3600
agcatccctt ccctttgcaa atagtcctct tccaacaata ataatgtcag atcctgtaga 3660
gaccacatca tccacggttc tatactgttg acccaatgcg tctcccttgt catctaaacc 3720
cacaccgggt gtcataatca accaatcgta accttcatct cttccaccca tgtctctttg 3780
agcaataaag ccgataacaa aatctttgtc gctcttcgca atgtcaacag tacccttagt 3840
atattctcca gtagataggg agcccttgca tgacaattct gctaacatca aaaggcctct 3900
aggttccttt gttacttctt ctgccgcctg cttcaaaccg ctaacaatac ctgggcccac 3960
cacaccgtgt gcattcgtaa tgtctgccca ttctgctatt ctgtatacac ccgcagagta 4020
ctgcaatttg actgtattac caatgtcagc aaattttctg tcttcgaaga gtaaaaaatt 4080
gtacttggcg gataatgcct ttagcggctt aactgtgccc tccatggaaa aatcagtcaa 4140
gatatccaca tgtgttttta gtaaacaaat tttgggacct aatgcttcaa ctaactccag 4200
taattccttg gtggtacgaa catccaatga agcacacaag tttgtttgct tttcgtgcat 4260
gatattaaat agcttggcag caacaggact aggatgagta gcagcacgtt ccttatatgt 4320
agctttcgac atgatttatc ttcgtttcct gcaggttttt gttctgtgca gttgggttaa 4380
gaatactggg caatttcatg tttcttcaac actacatatg cgtatatata ccaatctaag 4440
tctgtgctcc ttccttcgtt cttccttctg ttcggagatt accgaatcaa aaaaatttca 4500
aagaaaccga aatcaaaaaa aagaataaaa aaaaaatgat gaattgaatt gaaaagcgtg 4560
gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc 4620
aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc 4680
tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc 4740
gagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt 4800
ttcttaggac ggatcgcttg cctgtaactt acacgcgcct cgtatctttt aatgatggaa 4860
taatttggga atttactctg tgtttattta tttttatgtt ttgtatttgg attttagaaa 4920
gtaaataaag aaggtagaag agttacggaa tgaagaaaaa aaaataaaca aaggtttaaa 4980
aaatttcaac aaaaagcgta ctttacatat atatttatta gacaagaaaa gcagattaaa 5040
tagatataca ttcgattaac gataagtaaa atgtaaaatc acaggatttt cgtgtgtggt 5100
cttctacaca gacaagatga aacaattcgg cattaatacc tgagagcagg aagagcaaga 5160
taaaaggtag tatttgttgg cgatccccct agagtctttt acatcttcgg aaaacaaaaa 5220
ctattttttc tttaatttct ttttttactt tctattttta atttatatat ttatattaaa 5280
aaatttaaat tataattatt tttatagcac gtgatgaaaa ggacccaggt ggcacttttc 5340
ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 5400
cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 5460
gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 5520
ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 5580
tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 5640
aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 5700
ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 5760
agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 5820
gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 5880
gaccgaagga gctaaccgct ttttttcaca acatggggga tcatgtaact cgccttgatc 5940
gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 6000
tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 6060
ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 6120
cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 6180
gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 6240
cgggcagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 6300
tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 6360
aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 6420
aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 6480
gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 6540
cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 6600
ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 6660
accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 6720
tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 6780
cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 6840
gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc 6900
ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 6960
cgagggagct tccagggggg aacgcctggt atctttatag tcctgtcggg tttcgccacc 7020
tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gccgagccta tggaaaaacg 7080
ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 7140
ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 7200
ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 7260
gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg 7320
acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttacctca 7380
ctcattaggc accccaggct ttacacttta tgcttccggc tcctatgttg tgtggaattg 7440
tgagcggata acaatttcac acaggaaaca gctatgacca tgattacgcc aagctcggaa 7500
ttaaccctca ctaaagggaa caaaagctgg gtaccgggcc ccccc 7545
<210> 24
<211> 7539
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 24
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgagc tttaggagtc ttctggccct 720
tagtgggctg gtctgttctg gcttggcagc atctattcca tctagtgcat ctgtacaatt 780
ggactcctac aattacgatg gttccacatt ttccggcaag atttatgtca aaaacatcgc 840
ttactctaaa aaggttactg ttgtgtacgc agacggttct gacaactgga acaataacgg 900
caacactatt gctgcatcat tttcaggccc aatctctgga tcaaattacg aatactggac 960
attctcagca tcagtgaagg gcataaagga gttctacatc aaatacgaag tttcaggtaa 1020
gacatattac gacaataaca actctgcaaa ctaccaagtc tcaacttcta aacctactac 1080
aactactgca gctacaacca caactacagc tccatcaact tctacaacaa cccgtccatc 1140
tagttcagag cctgccacct tccctactgg taattctacc atcagctctt ggatcaaaaa 1200
gcaggaagat atttccagat tcgctatgct tagaaacatc aacccacctg gttctgccac 1260
agggtttatc gccgcatcac tctctaccgc tggtccagat tactactacg cgtggacaag 1320
agatgccgct ttgacatcta acgttatcgt ttacgaatac aacaccacat tgtctgggaa 1380
taagacaatt ctaaacgtac ttaaggatta cgtcacattc agtgttaaga cacagtctac 1440
ttcaacagtt tgtaattgcc ttggtgaacc aaagttcaat ccagacggca gtggttacac 1500
aggtgcttgg ggtagacctc aaaatgatgg tcctgcagaa agagcgacta catttgttct 1560
gtttgccgac agctacttga ctcaaactaa ggatgcctca tacgtcactg gtacattaaa 1620
gccagcaatt ttcaaagatc tcgattacgt tgttaacgtc tggagtaacg gatgtttcga 1680
tttatgggag gaggtgaacg gagttcattt ctacaccctt atggttatga gaaaagggct 1740
attgttgggg gctgatttcg cgaagagaaa cggtgactca actagagcct caacttactc 1800
ttctactgct tccacaattg ctaacaagat atcaagtttc tgggttagct caaacaactg 1860
ggtgcaagta tcccaatctg tcacaggagg tgtaagtaaa aaggggttag acgttagcac 1920
cctgttagct gcgaatctag gatcagtcga tgatggattt ttcactccag gttctgaaaa 1980
gatattagct acagctgtgg cagtcgaaga ttcctttgcc agtctatacc caatcaacaa 2040
aaaccttcca tcatacttgg ggaacgctat tggaagatac cctgaagata catacaacgg 2100
taatggtaac tcacaaggca atccttggtt tctggcggtt accggctacg cagagttgta 2160
ctatagagca attaaggaat ggatttctaa tggaggcgtt acagtgtcct ctatctcatt 2220
gccatttttc aaaaagttcg atagctctgc aacatccggt aaaaagtaca ccgtaggtac 2280
ttctgacttc aacaatttag cacaaaacat tgctcttgct gcagatcgtt tcctatctac 2340
tgtacaactc catgcaccaa acaatggttc attagcagag gaatttgata gaacaacagg 2400
tttttctacc ggcgctagag atttaacatg gtcccacgcc tcattgataa cagcatccta 2460
tgccaaagcc ggtgctccag ctgcataatt aattaaacag gccccttttc ctttgtcgat 2520
atcatgtaat tagttatgtc acgcttacat tcacgccctc ctcccacatc cgctctaacc 2580
gaaaaggaag gagttagaca acctgaagtc taggtcccta tttatttttt tatagttatg 2640
ttagtattaa gaacgttatt tatatttcaa atttttcttt tttttctgta caaacgcgtg 2700
tacgcatgta acgggcagac gcggccgcca ccgcggtgga gctccaattc gccctatagt 2760
gagtcgtatt acaattcact ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc 2820
gttacccaac ttaatcgcct tgcagcacat ccccccttcg ccagctggcg taatagcgaa 2880
gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga atggcgcgac 2940
gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 3000
acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 3060
ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 3120
gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca 3180
tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 3240
ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa 3300
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 3360
gcgaatttta acaaaatatt aacgtttaca atttcctgat gcggtatttt ctccttacgc 3420
atctgtgcgg tatttcacac cgcagggtaa taactgatat aattaaattg aagctctaat 3480
ttgtgagttt agtatacatg catttactta taatacagtt ttttagtttt gctggccgca 3540
tcttctcaaa tatgcttccc agcctgcttt tctgtaacgt tcaccctcta ccttagcatc 3600
ccttcccttt gcaaatagtc ctcttccaac aataataatg tcagatcctg tagagaccac 3660
atcatccacg gttctatact gttgacccaa tgcgtctccc ttgtcatcta aacccacacc 3720
gggtgtcata atcaaccaat cgtaaccttc atctcttcca cccatgtctc tttgagcaat 3780
aaagccgata acaaaatctt tgtcgctctt cgcaatgtca acagtaccct tagtatattc 3840
tccagtagat agggagccct tgcatgacaa ttctgctaac atcaaaaggc ctctaggttc 3900
ctttgttact tcttctgccg cctgcttcaa accgctaaca atacctgggc ccaccacacc 3960
gtgtgcattc gtaatgtctg cccattctgc tattctgtat acacccgcag agtactgcaa 4020
tttgactgta ttaccaatgt cagcaaattt tctgtcttcg aagagtaaaa aattgtactt 4080
ggcggataat gcctttagcg gcttaactgt gccctccatg gaaaaatcag tcaagatatc 4140
cacatgtgtt tttagtaaac aaattttggg acctaatgct tcaactaact ccagtaattc 4200
cttggtggta cgaacatcca atgaagcaca caagtttgtt tgcttttcgt gcatgatatt 4260
aaatagcttg gcagcaacag gactaggatg agtagcagca cgttccttat atgtagcttt 4320
cgacatgatt tatcttcgtt tcctgcaggt ttttgttctg tgcagttggg ttaagaatac 4380
tgggcaattt catgtttctt caacactaca tatgcgtata tataccaatc taagtctgtg 4440
ctccttcctt cgttcttcct tctgttcgga gattaccgaa tcaaaaaaat ttcaaagaaa 4500
ccgaaatcaa aaaaaagaat aaaaaaaaaa tgatgaattg aattgaaaag cgtggtgcac 4560
tctcagtaca atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc 4620
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 4680
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg 4740
aaagggcctc gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta 4800
ggacggatcg cttgcctgta acttacacgc gcctcgtatc ttttaatgat ggaataattt 4860
gggaatttac tctgtgttta tttattttta tgttttgtat ttggatttta gaaagtaaat 4920
aaagaaggta gaagagttac ggaatgaaga aaaaaaaata aacaaaggtt taaaaaattt 4980
caacaaaaag cgtactttac atatatattt attagacaag aaaagcagat taaatagata 5040
tacattcgat taacgataag taaaatgtaa aatcacagga ttttcgtgtg tggtcttcta 5100
cacagacaag atgaaacaat tcggcattaa tacctgagag caggaagagc aagataaaag 5160
gtagtatttg ttggcgatcc ccctagagtc ttttacatct tcggaaaaca aaaactattt 5220
tttctttaat ttcttttttt actttctatt tttaatttat atatttatat taaaaaattt 5280
aaattataat tatttttata gcacgtgatg aaaaggaccc aggtggcact tttcggggaa 5340
atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 5400
tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 5460
aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 5520
acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 5580
acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 5640
ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 5700
ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 5760
caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 5820
ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 5880
aggagctaac cgcttttttt cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5940
aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 6000
tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 6060
aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 6120
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 6180
ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggca 6240
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 6300
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 6360
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 6420
cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 6480
cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 6540
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 6600
tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 6660
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 6720
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 6780
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 6840
cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6900
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6960
agcttccagg ggggaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 7020
ttgagcgtcg atttttgtga tgctcgtcag gggggccgag cctatggaaa aacgccagca 7080
acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 7140
cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 7200
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 7260
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 7320
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttac ctcactcatt 7380
aggcacccca ggctttacac tttatgcttc cggctcctat gttgtgtgga attgtgagcg 7440
gataacaatt tcacacagga aacagctatg accatgatta cgccaagctc ggaattaacc 7500
ctcactaaag ggaacaaaag ctgggtaccg ggccccccc 7539
<210> 25
<211> 7533
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 25
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgaag ttggcatatt cattattgct 720
accgctggca ggtgtaagtg cagcatctat tccatctagt gcatctgtac aattggactc 780
ctacaattac gatggttcca cattttccgg caagatttat gtcaaaaaca tcgcttactc 840
taaaaaggtt actgttgtgt acgcagacgg ttctgacaac tggaacaata acggcaacac 900
tattgctgca tcattttcag gcccaatctc tggatcaaat tacgaatact ggacattctc 960
agcatcagtg aagggcataa aggagttcta catcaaatac gaagtttcag gtaagacata 1020
ttacgacaat aacaactctg caaactacca agtctcaact tctaaaccta ctacaactac 1080
tgcagctaca accacaacta cagctccatc aacttctaca acaacccgtc catctagttc 1140
agagcctgcc accttcccta ctggtaattc taccatcagc tcttggatca aaaagcagga 1200
agatatttcc agattcgcta tgcttagaaa catcaaccca cctggttctg ccacagggtt 1260
tatcgccgca tcactctcta ccgctggtcc agattactac tacgcgtgga caagagatgc 1320
cgctttgaca tctaacgtta tcgtttacga atacaacacc acattgtctg ggaataagac 1380
aattctaaac gtacttaagg attacgtcac attcagtgtt aagacacagt ctacttcaac 1440
agtttgtaat tgccttggtg aaccaaagtt caatccagac ggcagtggtt acacaggtgc 1500
ttggggtaga cctcaaaatg atggtcctgc agaaagagcg actacatttg ttctgtttgc 1560
cgacagctac ttgactcaaa ctaaggatgc ctcatacgtc actggtacat taaagccagc 1620
aattttcaaa gatctcgatt acgttgttaa cgtctggagt aacggatgtt tcgatttatg 1680
ggaggaggtg aacggagttc atttctacac ccttatggtt atgagaaaag ggctattgtt 1740
gggggctgat ttcgcgaaga gaaacggtga ctcaactaga gcctcaactt actcttctac 1800
tgcttccaca attgctaaca agatatcaag tttctgggtt agctcaaaca actgggtgca 1860
agtatcccaa tctgtcacag gaggtgtaag taaaaagggg ttagacgtta gcaccctgtt 1920
agctgcgaat ctaggatcag tcgatgatgg atttttcact ccaggttctg aaaagatatt 1980
agctacagct gtggcagtcg aagattcctt tgccagtcta tacccaatca acaaaaacct 2040
tccatcatac ttggggaacg ctattggaag ataccctgaa gatacataca acggtaatgg 2100
taactcacaa ggcaatcctt ggtttctggc ggttaccggc tacgcagagt tgtactatag 2160
agcaattaag gaatggattt ctaatggagg cgttacagtg tcctctatct cattgccatt 2220
tttcaaaaag ttcgatagct ctgcaacatc cggtaaaaag tacaccgtag gtacttctga 2280
cttcaacaat ttagcacaaa acattgctct tgctgcagat cgtttcctat ctactgtaca 2340
actccatgca ccaaacaatg gttcattagc agaggaattt gatagaacaa caggtttttc 2400
taccggcgct agagatttaa catggtccca cgcctcattg ataacagcat cctatgccaa 2460
agccggtgct ccagctgcat aattaattaa acaggcccct tttcctttgt cgatatcatg 2520
taattagtta tgtcacgctt acattcacgc cctcctccca catccgctct aaccgaaaag 2580
gaaggagtta gacaacctga agtctaggtc cctatttatt tttttatagt tatgttagta 2640
ttaagaacgt tatttatatt tcaaattttt cttttttttc tgtacaaacg cgtgtacgca 2700
tgtaacgggc agacgcggcc gccaccgcgg tggagctcca attcgcccta tagtgagtcg 2760
tattacaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 2820
caacttaatc gccttgcagc acatcccccc ttcgccagct ggcgtaatag cgaagaggcc 2880
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cgacgcgccc 2940
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 3000
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 3060
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 3120
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 3180
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 3240
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 3300
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 3360
tttaacaaaa tattaacgtt tacaatttcc tgatgcggta ttttctcctt acgcatctgt 3420
gcggtatttc acaccgcagg gtaataactg atataattaa attgaagctc taatttgtga 3480
gtttagtata catgcattta cttataatac agttttttag ttttgctggc cgcatcttct 3540
caaatatgct tcccagcctg cttttctgta acgttcaccc tctaccttag catcccttcc 3600
ctttgcaaat agtcctcttc caacaataat aatgtcagat cctgtagaga ccacatcatc 3660
cacggttcta tactgttgac ccaatgcgtc tcccttgtca tctaaaccca caccgggtgt 3720
cataatcaac caatcgtaac cttcatctct tccacccatg tctctttgag caataaagcc 3780
gataacaaaa tctttgtcgc tcttcgcaat gtcaacagta cccttagtat attctccagt 3840
agatagggag cccttgcatg acaattctgc taacatcaaa aggcctctag gttcctttgt 3900
tacttcttct gccgcctgct tcaaaccgct aacaatacct gggcccacca caccgtgtgc 3960
attcgtaatg tctgcccatt ctgctattct gtatacaccc gcagagtact gcaatttgac 4020
tgtattacca atgtcagcaa attttctgtc ttcgaagagt aaaaaattgt acttggcgga 4080
taatgccttt agcggcttaa ctgtgccctc catggaaaaa tcagtcaaga tatccacatg 4140
tgtttttagt aaacaaattt tgggacctaa tgcttcaact aactccagta attccttggt 4200
ggtacgaaca tccaatgaag cacacaagtt tgtttgcttt tcgtgcatga tattaaatag 4260
cttggcagca acaggactag gatgagtagc agcacgttcc ttatatgtag ctttcgacat 4320
gatttatctt cgtttcctgc aggtttttgt tctgtgcagt tgggttaaga atactgggca 4380
atttcatgtt tcttcaacac tacatatgcg tatatatacc aatctaagtc tgtgctcctt 4440
ccttcgttct tccttctgtt cggagattac cgaatcaaaa aaatttcaaa gaaaccgaaa 4500
tcaaaaaaaa gaataaaaaa aaaatgatga attgaattga aaagcgtggt gcactctcag 4560
tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 4620
cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 4680
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 4740
cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttaggacgg 4800
atcgcttgcc tgtaacttac acgcgcctcg tatcttttaa tgatggaata atttgggaat 4860
ttactctgtg tttatttatt tttatgtttt gtatttggat tttagaaagt aaataaagaa 4920
ggtagaagag ttacggaatg aagaaaaaaa aataaacaaa ggtttaaaaa atttcaacaa 4980
aaagcgtact ttacatatat atttattaga caagaaaagc agattaaata gatatacatt 5040
cgattaacga taagtaaaat gtaaaatcac aggattttcg tgtgtggtct tctacacaga 5100
caagatgaaa caattcggca ttaatacctg agagcaggaa gagcaagata aaaggtagta 5160
tttgttggcg atccccctag agtcttttac atcttcggaa aacaaaaact attttttctt 5220
taatttcttt ttttactttc tatttttaat ttatatattt atattaaaaa atttaaatta 5280
taattatttt tatagcacgt gatgaaaagg acccaggtgg cacttttcgg ggaaatgtgc 5340
gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac 5400
aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt 5460
tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag 5520
aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg 5580
aactggatct caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa 5640
tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc 5700
aagagcaact cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag 5760
tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa 5820
ccatgagtga taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc 5880
taaccgcttt ttttcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg 5940
agctgaatga agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa 6000
caacgttgcg caaactatta actggcgaac tacttactct agcttcccgg caacaattaa 6060
tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg 6120
gctggtttat tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag 6180
cactggggcc agatggtaag ccctcccgta tcgtagttat ctacacgacg ggcagtcagg 6240
caactatgga tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt 6300
ggtaactgtc agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt 6360
aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac 6420
gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag 6480
atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg 6540
tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca 6600
gagcgcagat accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga 6660
actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca 6720
gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc 6780
agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca 6840
ccgaactgag atacctacag cgtgagcatt gagaaagcgc cacgcttccc gaagggagaa 6900
aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc 6960
caggggggaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc 7020
gtcgattttt gtgatgctcg tcaggggggc cgagcctatg gaaaaacgcc agcaacgcgg 7080
cctttttacg gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat 7140
cccctgattc tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca 7200
gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca 7260
aaccgcctct ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg 7320
actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag ttacctcact cattaggcac 7380
cccaggcttt acactttatg cttccggctc ctatgttgtg tggaattgtg agcggataac 7440
aatttcacac aggaaacagc tatgaccatg attacgccaa gctcggaatt aaccctcact 7500
aaagggaaca aaagctgggt accgggcccc ccc 7533
<210> 26
<211> 7542
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 26
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgctt cttcaagcgt tcttgttttt 720
actggcaggg tttgctgcaa aaatttcagc cgcatctatt ccatctagtg catctgtaca 780
attggactcc tacaattacg atggttccac attttccggc aagatttatg tcaaaaacat 840
cgcttactct aaaaaggtta ctgttgtgta cgcagacggt tctgacaact ggaacaataa 900
cggcaacact attgctgcat cattttcagg cccaatctct ggatcaaatt acgaatactg 960
gacattctca gcatcagtga agggcataaa ggagttctac atcaaatacg aagtttcagg 1020
taagacatat tacgacaata acaactctgc aaactaccaa gtctcaactt ctaaacctac 1080
tacaactact gcagctacaa ccacaactac agctccatca acttctacaa caacccgtcc 1140
atctagttca gagcctgcca ccttccctac tggtaattct accatcagct cttggatcaa 1200
aaagcaggaa gatatttcca gattcgctat gcttagaaac atcaacccac ctggttctgc 1260
cacagggttt atcgccgcat cactctctac cgctggtcca gattactact acgcgtggac 1320
aagagatgcc gctttgacat ctaacgttat cgtttacgaa tacaacacca cattgtctgg 1380
gaataagaca attctaaacg tacttaagga ttacgtcaca ttcagtgtta agacacagtc 1440
tacttcaaca gtttgtaatt gccttggtga accaaagttc aatccagacg gcagtggtta 1500
cacaggtgct tggggtagac ctcaaaatga tggtcctgca gaaagagcga ctacatttgt 1560
tctgtttgcc gacagctact tgactcaaac taaggatgcc tcatacgtca ctggtacatt 1620
aaagccagca attttcaaag atctcgatta cgttgttaac gtctggagta acggatgttt 1680
cgatttatgg gaggaggtga acggagttca tttctacacc cttatggtta tgagaaaagg 1740
gctattgttg ggggctgatt tcgcgaagag aaacggtgac tcaactagag cctcaactta 1800
ctcttctact gcttccacaa ttgctaacaa gatatcaagt ttctgggtta gctcaaacaa 1860
ctgggtgcaa gtatcccaat ctgtcacagg aggtgtaagt aaaaaggggt tagacgttag 1920
caccctgtta gctgcgaatc taggatcagt cgatgatgga tttttcactc caggttctga 1980
aaagatatta gctacagctg tggcagtcga agattccttt gccagtctat acccaatcaa 2040
caaaaacctt ccatcatact tggggaacgc tattggaaga taccctgaag atacatacaa 2100
cggtaatggt aactcacaag gcaatccttg gtttctggcg gttaccggct acgcagagtt 2160
gtactataga gcaattaagg aatggatttc taatggaggc gttacagtgt cctctatctc 2220
attgccattt ttcaaaaagt tcgatagctc tgcaacatcc ggtaaaaagt acaccgtagg 2280
tacttctgac ttcaacaatt tagcacaaaa cattgctctt gctgcagatc gtttcctatc 2340
tactgtacaa ctccatgcac caaacaatgg ttcattagca gaggaatttg atagaacaac 2400
aggtttttct accggcgcta gagatttaac atggtcccac gcctcattga taacagcatc 2460
ctatgccaaa gccggtgctc cagctgcata attaattaaa caggcccctt ttcctttgtc 2520
gatatcatgt aattagttat gtcacgctta cattcacgcc ctcctcccac atccgctcta 2580
accgaaaagg aaggagttag acaacctgaa gtctaggtcc ctatttattt ttttatagtt 2640
atgttagtat taagaacgtt atttatattt caaatttttc ttttttttct gtacaaacgc 2700
gtgtacgcat gtaacgggca gacgcggccg ccaccgcggt ggagctccaa ttcgccctat 2760
agtgagtcgt attacaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 2820
ggcgttaccc aacttaatcg ccttgcagca catcccccct tcgccagctg gcgtaatagc 2880
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 2940
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 3000
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 3060
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 3120
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 3180
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 3240
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 3300
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 3360
aacgcgaatt ttaacaaaat attaacgttt acaatttcct gatgcggtat tttctcctta 3420
cgcatctgtg cggtatttca caccgcaggg taataactga tataattaaa ttgaagctct 3480
aatttgtgag tttagtatac atgcatttac ttataataca gttttttagt tttgctggcc 3540
gcatcttctc aaatatgctt cccagcctgc ttttctgtaa cgttcaccct ctaccttagc 3600
atcccttccc tttgcaaata gtcctcttcc aacaataata atgtcagatc ctgtagagac 3660
cacatcatcc acggttctat actgttgacc caatgcgtct cccttgtcat ctaaacccac 3720
accgggtgtc ataatcaacc aatcgtaacc ttcatctctt ccacccatgt ctctttgagc 3780
aataaagccg ataacaaaat ctttgtcgct cttcgcaatg tcaacagtac ccttagtata 3840
ttctccagta gatagggagc ccttgcatga caattctgct aacatcaaaa ggcctctagg 3900
ttcctttgtt acttcttctg ccgcctgctt caaaccgcta acaatacctg ggcccaccac 3960
accgtgtgca ttcgtaatgt ctgcccattc tgctattctg tatacacccg cagagtactg 4020
caatttgact gtattaccaa tgtcagcaaa ttttctgtct tcgaagagta aaaaattgta 4080
cttggcggat aatgccttta gcggcttaac tgtgccctcc atggaaaaat cagtcaagat 4140
atccacatgt gtttttagta aacaaatttt gggacctaat gcttcaacta actccagtaa 4200
ttccttggtg gtacgaacat ccaatgaagc acacaagttt gtttgctttt cgtgcatgat 4260
attaaatagc ttggcagcaa caggactagg atgagtagca gcacgttcct tatatgtagc 4320
tttcgacatg atttatcttc gtttcctgca ggtttttgtt ctgtgcagtt gggttaagaa 4380
tactgggcaa tttcatgttt cttcaacact acatatgcgt atatatacca atctaagtct 4440
gtgctccttc cttcgttctt ccttctgttc ggagattacc gaatcaaaaa aatttcaaag 4500
aaaccgaaat caaaaaaaag aataaaaaaa aaatgatgaa ttgaattgaa aagcgtggtg 4560
cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac 4620
acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt 4680
gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag 4740
acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc 4800
ttaggacgga tcgcttgcct gtaacttaca cgcgcctcgt atcttttaat gatggaataa 4860
tttgggaatt tactctgtgt ttatttattt ttatgttttg tatttggatt ttagaaagta 4920
aataaagaag gtagaagagt tacggaatga agaaaaaaaa ataaacaaag gtttaaaaaa 4980
tttcaacaaa aagcgtactt tacatatata tttattagac aagaaaagca gattaaatag 5040
atatacattc gattaacgat aagtaaaatg taaaatcaca ggattttcgt gtgtggtctt 5100
ctacacagac aagatgaaac aattcggcat taatacctga gagcaggaag agcaagataa 5160
aaggtagtat ttgttggcga tccccctaga gtcttttaca tcttcggaaa acaaaaacta 5220
ttttttcttt aatttctttt tttactttct atttttaatt tatatattta tattaaaaaa 5280
tttaaattat aattattttt atagcacgtg atgaaaagga cccaggtggc acttttcggg 5340
gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 5400
tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 5460
ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 5520
ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 5580
gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 5640
gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 5700
acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 5760
actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 5820
ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 5880
cgaaggagct aaccgctttt tttcacaaca tgggggatca tgtaactcgc cttgatcgtt 5940
gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 6000
caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 6060
aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 6120
ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 6180
tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 6240
gcagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 6300
ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 6360
ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 6420
tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 6480
cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 6540
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6600
gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 6660
acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 6720
ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 6780
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6840
cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 6900
aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 6960
gggagcttcc aggggggaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 7020
gacttgagcg tcgatttttg tgatgctcgt caggggggcc gagcctatgg aaaaacgcca 7080
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 7140
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 7200
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 7260
caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 7320
ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tacctcactc 7380
attaggcacc ccaggcttta cactttatgc ttccggctcc tatgttgtgt ggaattgtga 7440
gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag ctcggaatta 7500
accctcacta aagggaacaa aagctgggta ccgggccccc cc 7542
<210> 27
<211> 7563
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 27
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgacc aaaccaactc aggtgttggt 720
aaggtcagtg agtattctgt tcttcattac tttgcttcat ctggtcgtag ctgcatctat 780
tccatctagt gcatctgtac aattggactc ctacaattac gatggttcca cattttccgg 840
caagatttat gtcaaaaaca tcgcttactc taaaaaggtt actgttgtgt acgcagacgg 900
ttctgacaac tggaacaata acggcaacac tattgctgca tcattttcag gcccaatctc 960
tggatcaaat tacgaatact ggacattctc agcatcagtg aagggcataa aggagttcta 1020
catcaaatac gaagtttcag gtaagacata ttacgacaat aacaactctg caaactacca 1080
agtctcaact tctaaaccta ctacaactac tgcagctaca accacaacta cagctccatc 1140
aacttctaca acaacccgtc catctagttc agagcctgcc accttcccta ctggtaattc 1200
taccatcagc tcttggatca aaaagcagga agatatttcc agattcgcta tgcttagaaa 1260
catcaaccca cctggttctg ccacagggtt tatcgccgca tcactctcta ccgctggtcc 1320
agattactac tacgcgtgga caagagatgc cgctttgaca tctaacgtta tcgtttacga 1380
atacaacacc acattgtctg ggaataagac aattctaaac gtacttaagg attacgtcac 1440
attcagtgtt aagacacagt ctacttcaac agtttgtaat tgccttggtg aaccaaagtt 1500
caatccagac ggcagtggtt acacaggtgc ttggggtaga cctcaaaatg atggtcctgc 1560
agaaagagcg actacatttg ttctgtttgc cgacagctac ttgactcaaa ctaaggatgc 1620
ctcatacgtc actggtacat taaagccagc aattttcaaa gatctcgatt acgttgttaa 1680
cgtctggagt aacggatgtt tcgatttatg ggaggaggtg aacggagttc atttctacac 1740
ccttatggtt atgagaaaag ggctattgtt gggggctgat ttcgcgaaga gaaacggtga 1800
ctcaactaga gcctcaactt actcttctac tgcttccaca attgctaaca agatatcaag 1860
tttctgggtt agctcaaaca actgggtgca agtatcccaa tctgtcacag gaggtgtaag 1920
taaaaagggg ttagacgtta gcaccctgtt agctgcgaat ctaggatcag tcgatgatgg 1980
atttttcact ccaggttctg aaaagatatt agctacagct gtggcagtcg aagattcctt 2040
tgccagtcta tacccaatca acaaaaacct tccatcatac ttggggaacg ctattggaag 2100
ataccctgaa gatacataca acggtaatgg taactcacaa ggcaatcctt ggtttctggc 2160
ggttaccggc tacgcagagt tgtactatag agcaattaag gaatggattt ctaatggagg 2220
cgttacagtg tcctctatct cattgccatt tttcaaaaag ttcgatagct ctgcaacatc 2280
cggtaaaaag tacaccgtag gtacttctga cttcaacaat ttagcacaaa acattgctct 2340
tgctgcagat cgtttcctat ctactgtaca actccatgca ccaaacaatg gttcattagc 2400
agaggaattt gatagaacaa caggtttttc taccggcgct agagatttaa catggtccca 2460
cgcctcattg ataacagcat cctatgccaa agccggtgct ccagctgcat aattaattaa 2520
acaggcccct tttcctttgt cgatatcatg taattagtta tgtcacgctt acattcacgc 2580
cctcctccca catccgctct aaccgaaaag gaaggagtta gacaacctga agtctaggtc 2640
cctatttatt tttttatagt tatgttagta ttaagaacgt tatttatatt tcaaattttt 2700
cttttttttc tgtacaaacg cgtgtacgca tgtaacgggc agacgcggcc gccaccgcgg 2760
tggagctcca attcgcccta tagtgagtcg tattacaatt cactggccgt cgttttacaa 2820
cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatcccccc 2880
ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 2940
agcctgaatg gcgaatggcg cgacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg 3000
gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct 3060
ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg 3120
ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag 3180
ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg 3240
gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc 3300
tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat 3360
gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttcc 3420
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcagg gtaataactg 3480
atataattaa attgaagctc taatttgtga gtttagtata catgcattta cttataatac 3540
agttttttag ttttgctggc cgcatcttct caaatatgct tcccagcctg cttttctgta 3600
acgttcaccc tctaccttag catcccttcc ctttgcaaat agtcctcttc caacaataat 3660
aatgtcagat cctgtagaga ccacatcatc cacggttcta tactgttgac ccaatgcgtc 3720
tcccttgtca tctaaaccca caccgggtgt cataatcaac caatcgtaac cttcatctct 3780
tccacccatg tctctttgag caataaagcc gataacaaaa tctttgtcgc tcttcgcaat 3840
gtcaacagta cccttagtat attctccagt agatagggag cccttgcatg acaattctgc 3900
taacatcaaa aggcctctag gttcctttgt tacttcttct gccgcctgct tcaaaccgct 3960
aacaatacct gggcccacca caccgtgtgc attcgtaatg tctgcccatt ctgctattct 4020
gtatacaccc gcagagtact gcaatttgac tgtattacca atgtcagcaa attttctgtc 4080
ttcgaagagt aaaaaattgt acttggcgga taatgccttt agcggcttaa ctgtgccctc 4140
catggaaaaa tcagtcaaga tatccacatg tgtttttagt aaacaaattt tgggacctaa 4200
tgcttcaact aactccagta attccttggt ggtacgaaca tccaatgaag cacacaagtt 4260
tgtttgcttt tcgtgcatga tattaaatag cttggcagca acaggactag gatgagtagc 4320
agcacgttcc ttatatgtag ctttcgacat gatttatctt cgtttcctgc aggtttttgt 4380
tctgtgcagt tgggttaaga atactgggca atttcatgtt tcttcaacac tacatatgcg 4440
tatatatacc aatctaagtc tgtgctcctt ccttcgttct tccttctgtt cggagattac 4500
cgaatcaaaa aaatttcaaa gaaaccgaaa tcaaaaaaaa gaataaaaaa aaaatgatga 4560
attgaattga aaagcgtggt gcactctcag tacaatctgc tctgatgccg catagttaag 4620
ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc 4680
atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc 4740
gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa 4800
tgtcatgata ataatggttt cttaggacgg atcgcttgcc tgtaacttac acgcgcctcg 4860
tatcttttaa tgatggaata atttgggaat ttactctgtg tttatttatt tttatgtttt 4920
gtatttggat tttagaaagt aaataaagaa ggtagaagag ttacggaatg aagaaaaaaa 4980
aataaacaaa ggtttaaaaa atttcaacaa aaagcgtact ttacatatat atttattaga 5040
caagaaaagc agattaaata gatatacatt cgattaacga taagtaaaat gtaaaatcac 5100
aggattttcg tgtgtggtct tctacacaga caagatgaaa caattcggca ttaatacctg 5160
agagcaggaa gagcaagata aaaggtagta tttgttggcg atccccctag agtcttttac 5220
atcttcggaa aacaaaaact attttttctt taatttcttt ttttactttc tatttttaat 5280
ttatatattt atattaaaaa atttaaatta taattatttt tatagcacgt gatgaaaagg 5340
acccaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa 5400
tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt 5460
gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg 5520
cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag 5580
atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg 5640
agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg 5700
gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt 5760
ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga 5820
cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac 5880
ttctgacaac gatcggagga ccgaaggagc taaccgcttt ttttcacaac atgggggatc 5940
atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc 6000
gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac 6060
tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag 6120
gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg 6180
gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta 6240
tcgtagttat ctacacgacg ggcagtcagg caactatgga tgaacgaaat agacagatcg 6300
ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata 6360
tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt 6420
ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc 6480
ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct 6540
tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa 6600
ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag 6660
tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc 6720
tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg 6780
actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca 6840
cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagcatt 6900
gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg 6960
tcggaacagg agagcgcacg agggagcttc caggggggaa cgcctggtat ctttatagtc 7020
ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc 7080
cgagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc 7140
cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg 7200
cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga 7260
gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc 7320
attaatgcag ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa 7380
ttaatgtgag ttacctcact cattaggcac cccaggcttt acactttatg cttccggctc 7440
ctatgttgtg tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg 7500
attacgccaa gctcggaatt aaccctcact aaagggaaca aaagctgggt accgggcccc 7560
ccc 7563
<210> 28
<211> 7539
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 28
tcgagatctc ccgagtttat cattatcaat actgccattt caaagaatac gtaaataatt 60
aatagtagtg attttcctaa ctttatttag tcaaaaaatt ggccttttaa ttctgctgta 120
acccgtacat gcccaaaata gggggcgggt tacacagaat atataacatc ataggtgtct 180
gggtgaacag tttattcctg gcatccacta aatataatgg agcccgcttt ttttaagctg 240
gcatccagaa aaaaaaagaa tcccagcacc aaaatattgt tttcttcacc aaccatcagt 300
tcataggtcc attctcttag cgcaactaca cagaacaggg gcacaaacag gcaaaaaacg 360
ggcacaacct caatggagtg atgcaacctg cttggagtaa atgatgacac aaggcaattg 420
acctacgcat gtatctatct cattttctta caccttctat taccttctgc tctctctgat 480
ttggaaaaag ctgaaaaaaa aggttgaaac cagttccctg aaattattcc cctatttgac 540
taataagtat ataaagacgg taggtattga ttgtaattct gtaaatctat ttcttaaact 600
tcttaaattc tacttttata gttagtcttt tttttagttt taaaacacta agaacttagt 660
ttcgaataaa cacacataaa caaacaaatc tagaatgaag tgggtaacat tcatctccct 720
tttgttctta ttttctagtg catacagcgc atctattcca tctagtgcat ctgtacaatt 780
ggactcctac aattacgatg gttccacatt ttccggcaag atttatgtca aaaacatcgc 840
ttactctaaa aaggttactg ttgtgtacgc agacggttct gacaactgga acaataacgg 900
caacactatt gctgcatcat tttcaggccc aatctctgga tcaaattacg aatactggac 960
attctcagca tcagtgaagg gcataaagga gttctacatc aaatacgaag tttcaggtaa 1020
gacatattac gacaataaca actctgcaaa ctaccaagtc tcaacttcta aacctactac 1080
aactactgca gctacaacca caactacagc tccatcaact tctacaacaa cccgtccatc 1140
tagttcagag cctgccacct tccctactgg taattctacc atcagctctt ggatcaaaaa 1200
gcaggaagat atttccagat tcgctatgct tagaaacatc aacccacctg gttctgccac 1260
agggtttatc gccgcatcac tctctaccgc tggtccagat tactactacg cgtggacaag 1320
agatgccgct ttgacatcta acgttatcgt ttacgaatac aacaccacat tgtctgggaa 1380
taagacaatt ctaaacgtac ttaaggatta cgtcacattc agtgttaaga cacagtctac 1440
ttcaacagtt tgtaattgcc ttggtgaacc aaagttcaat ccagacggca gtggttacac 1500
aggtgcttgg ggtagacctc aaaatgatgg tcctgcagaa agagcgacta catttgttct 1560
gtttgccgac agctacttga ctcaaactaa ggatgcctca tacgtcactg gtacattaaa 1620
gccagcaatt ttcaaagatc tcgattacgt tgttaacgtc tggagtaacg gatgtttcga 1680
tttatgggag gaggtgaacg gagttcattt ctacaccctt atggttatga gaaaagggct 1740
attgttgggg gctgatttcg cgaagagaaa cggtgactca actagagcct caacttactc 1800
ttctactgct tccacaattg ctaacaagat atcaagtttc tgggttagct caaacaactg 1860
ggtgcaagta tcccaatctg tcacaggagg tgtaagtaaa aaggggttag acgttagcac 1920
cctgttagct gcgaatctag gatcagtcga tgatggattt ttcactccag gttctgaaaa 1980
gatattagct acagctgtgg cagtcgaaga ttcctttgcc agtctatacc caatcaacaa 2040
aaaccttcca tcatacttgg ggaacgctat tggaagatac cctgaagata catacaacgg 2100
taatggtaac tcacaaggca atccttggtt tctggcggtt accggctacg cagagttgta 2160
ctatagagca attaaggaat ggatttctaa tggaggcgtt acagtgtcct ctatctcatt 2220
gccatttttc aaaaagttcg atagctctgc aacatccggt aaaaagtaca ccgtaggtac 2280
ttctgacttc aacaatttag cacaaaacat tgctcttgct gcagatcgtt tcctatctac 2340
tgtacaactc catgcaccaa acaatggttc attagcagag gaatttgata gaacaacagg 2400
tttttctacc ggcgctagag atttaacatg gtcccacgcc tcattgataa cagcatccta 2460
tgccaaagcc ggtgctccag ctgcataatt aattaaacag gccccttttc ctttgtcgat 2520
atcatgtaat tagttatgtc acgcttacat tcacgccctc ctcccacatc cgctctaacc 2580
gaaaaggaag gagttagaca acctgaagtc taggtcccta tttatttttt tatagttatg 2640
ttagtattaa gaacgttatt tatatttcaa atttttcttt tttttctgta caaacgcgtg 2700
tacgcatgta acgggcagac gcggccgcca ccgcggtgga gctccaattc gccctatagt 2760
gagtcgtatt acaattcact ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc 2820
gttacccaac ttaatcgcct tgcagcacat ccccccttcg ccagctggcg taatagcgaa 2880
gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga atggcgcgac 2940
gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 3000
acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 3060
ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 3120
gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca 3180
tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 3240
ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa 3300
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 3360
gcgaatttta acaaaatatt aacgtttaca atttcctgat gcggtatttt ctccttacgc 3420
atctgtgcgg tatttcacac cgcagggtaa taactgatat aattaaattg aagctctaat 3480
ttgtgagttt agtatacatg catttactta taatacagtt ttttagtttt gctggccgca 3540
tcttctcaaa tatgcttccc agcctgcttt tctgtaacgt tcaccctcta ccttagcatc 3600
ccttcccttt gcaaatagtc ctcttccaac aataataatg tcagatcctg tagagaccac 3660
atcatccacg gttctatact gttgacccaa tgcgtctccc ttgtcatcta aacccacacc 3720
gggtgtcata atcaaccaat cgtaaccttc atctcttcca cccatgtctc tttgagcaat 3780
aaagccgata acaaaatctt tgtcgctctt cgcaatgtca acagtaccct tagtatattc 3840
tccagtagat agggagccct tgcatgacaa ttctgctaac atcaaaaggc ctctaggttc 3900
ctttgttact tcttctgccg cctgcttcaa accgctaaca atacctgggc ccaccacacc 3960
gtgtgcattc gtaatgtctg cccattctgc tattctgtat acacccgcag agtactgcaa 4020
tttgactgta ttaccaatgt cagcaaattt tctgtcttcg aagagtaaaa aattgtactt 4080
ggcggataat gcctttagcg gcttaactgt gccctccatg gaaaaatcag tcaagatatc 4140
cacatgtgtt tttagtaaac aaattttggg acctaatgct tcaactaact ccagtaattc 4200
cttggtggta cgaacatcca atgaagcaca caagtttgtt tgcttttcgt gcatgatatt 4260
aaatagcttg gcagcaacag gactaggatg agtagcagca cgttccttat atgtagcttt 4320
cgacatgatt tatcttcgtt tcctgcaggt ttttgttctg tgcagttggg ttaagaatac 4380
tgggcaattt catgtttctt caacactaca tatgcgtata tataccaatc taagtctgtg 4440
ctccttcctt cgttcttcct tctgttcgga gattaccgaa tcaaaaaaat ttcaaagaaa 4500
ccgaaatcaa aaaaaagaat aaaaaaaaaa tgatgaattg aattgaaaag cgtggtgcac 4560
tctcagtaca atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc 4620
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 4680
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg 4740
aaagggcctc gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta 4800
ggacggatcg cttgcctgta acttacacgc gcctcgtatc ttttaatgat ggaataattt 4860
gggaatttac tctgtgttta tttattttta tgttttgtat ttggatttta gaaagtaaat 4920
aaagaaggta gaagagttac ggaatgaaga aaaaaaaata aacaaaggtt taaaaaattt 4980
caacaaaaag cgtactttac atatatattt attagacaag aaaagcagat taaatagata 5040
tacattcgat taacgataag taaaatgtaa aatcacagga ttttcgtgtg tggtcttcta 5100
cacagacaag atgaaacaat tcggcattaa tacctgagag caggaagagc aagataaaag 5160
gtagtatttg ttggcgatcc ccctagagtc ttttacatct tcggaaaaca aaaactattt 5220
tttctttaat ttcttttttt actttctatt tttaatttat atatttatat taaaaaattt 5280
aaattataat tatttttata gcacgtgatg aaaaggaccc aggtggcact tttcggggaa 5340
atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 5400
tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 5460
aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 5520
acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 5580
acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 5640
ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 5700
ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 5760
caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 5820
ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 5880
aggagctaac cgcttttttt cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5940
aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 6000
tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 6060
aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 6120
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 6180
ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggca 6240
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 6300
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 6360
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 6420
cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 6480
cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 6540
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 6600
tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 6660
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 6720
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 6780
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 6840
cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6900
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6960
agcttccagg ggggaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 7020
ttgagcgtcg atttttgtga tgctcgtcag gggggccgag cctatggaaa aacgccagca 7080
acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 7140
cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 7200
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 7260
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 7320
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttac ctcactcatt 7380
aggcacccca ggctttacac tttatgcttc cggctcctat gttgtgtgga attgtgagcg 7440
gataacaatt tcacacagga aacagctatg accatgatta cgccaagctc ggaattaacc 7500
ctcactaaag ggaacaaaag ctgggtaccg ggccccccc 7539
<210> 29
<211> 8086
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 29
ccgggaaacc atccacttca cgagactgat ctcctctgcc ggaacaccgg gcatctccaa 60
cttataagtt ggagaaataa gagaatttca gattgagaga atgaaaaaaa aaaaaaaaga 120
cagaggagag cataaaaatg gggttcactt tttggtaaag ctatagcatg cctatcacat 180
ataaatagag tgccagtagc gacttttttc acactcgaaa tactcttact actgctctct 240
tgttgttttt atcacttctt gtttcttctt ggtaaataga atatcaagct acaaaaagca 300
tacctcgagg ccagaaaaag gaagtgtttc cctccttctt gaattgatgt taccctcata 360
aagcacgtgg cctcttatcg agaaagaaat taccgtcgct cgtgatttgt ttgcaaaaag 420
aacaaaactg aaaaaaccca gacacgctcg acttcctgtc ttcctgttga ttgcagcttc 480
caatttcgtc acacaacaag gtcctagcga cggctcacag gttttgtaac aagcaatcga 540
aggttctgga atggcgggaa agggtttagt accacatgct atgatgccca ctgtgatctc 600
cagagcaaag ttcgttcgat cgtactgtta ctctctctct ttcaaacaga attgtccgaa 660
tcgtgtgaca acaacagcct gttctcacac actcttttct tctaaccaag ggggtggttt 720
agtttagtag aacctcgtga aacttacatt tacatatata taaacttgca taaattggtc 780
aatgcaagaa atacatattt ggtcttttct aattcgtagt ttttcaagtt cttagatgct 840
ttctttttct cttttttaca gatcatcaag gaagtaatta tctacttttt acaagtctag 900
aatgactatc tcttctgctc acccagaaac tgaaccaaag tggtggaaag aggcaacttt 960
ttaccaaatc tacccagctt cattcaagga ctccaatgat gatggttggg gtgatatgaa 1020
aggtattgct tccaaattag aatacattaa ggaattaggt gccgatgcta tttggatttc 1080
tccattctat gattctccac aagacgatat gggttatgac atcgctaact atgaaaaggt 1140
ttggccaacc tatggcacta atgaggactg ttttgcatta attgagaaaa cccacaagtt 1200
gggcatgaag ttcattactg atcttgtcat taatcattgt tcatccgaac atgaatggtt 1260
caaggaatcc agatcctcca aaactaatcc aaaaagagat tggtttttct ggagaccacc 1320
taagggttat gatgctgaag gtaagccaat tccaccaaac aattggaagt cttactttgg 1380
tggttccgca tggaccttcg acgaaaagac ccaagagttt tacttgagat tattctgctc 1440
cacccaacca gatttgaact gggaaaatga agattgtaga aaagcaatct acgaatctgc 1500
agttggctat tggttagatc acggtgttga tggtttcaga attgatgttg gttcacttta 1560
ctcaaaggtt gttggtttgc cagatgcacc agttgttgat aaaaactcta catggcaatc 1620
ttctgaccca tacactctta atggtcctag aatccatgaa tttcatcaag agatgaacca 1680
gttcattaga aatagagtta aggatggtag agaaattatg accgttggtg aaatgcaaca 1740
tgcatctgat gaaactaaga gattatacac atcagcctcc cgtcacgaat tgtctgaatt 1800
attcaacttt tcacacacag acgttggcac atccccatta ttccgttata acttggttcc 1860
attcgaattg aaggactgga aaatcgcatt ggcagaattg tttagatata tcaatggtac 1920
tgattgttgg tctaccatct acttggaaaa ccacgaccaa ccaagatcca tcactagatt 1980
cggtgatgac tctcctaaaa accgtgtcat ttctggtaag ttactttctg tcttattatc 2040
cgccttaacc ggtactttgt acgtctatca aggccaggaa ttgggtcaaa ttaactttaa 2100
gaattggcca gtcgaaaagt atgaagatgt cgaaatcaga aacaactaca atgcaattaa 2160
ggaggaacat ggtgaaaatt cagaggaaat gaaaaagttt ttggaagcta ttgctcttat 2220
ttccagagat cacgctagaa ccccaatgca atggtcaaga gaggaaccta acgctggttt 2280
ctctggtcct tccgccaagc cttggtttta cttaaacgac tccttcagag aaggtattaa 2340
cgttgaagat gaaattaagg acccaaattc cgtccttaac ttctggaagg aagcattgaa 2400
gtttagaaag gcccataagg atattaccgt ttatggttat gactttgagt ttatcgattt 2460
ggataacaaa aagttattct cattcactaa aaagtataac aacaagacct tattcgctgc 2520
tttaaacttc tcttctgatg ctactgattt caaaattcct aatgacgatt cctctttcaa 2580
gttggagttt ggtaactacc caaagaagga agttgacgca tcttctcgta cattgaagcc 2640
ttgggaaggt agaatctaca tctccgagta acctgcaggt ttgccagctt actatccttc 2700
ttgaaaatat gcactctata tcttttagtt cttaattgca acacatagat ttgctgtata 2760
acgaatttta tgctattttt taaatttgga gttcagtgat aaaagtgtca cagcgaattt 2820
cctcacatgt agggaccgaa ttgtttacaa gttctctgta ccaccatgga gacatcaaaa 2880
attgaaaatc tatggaaaga tatggacggt agcaacaaga atatagcacg agccggcgac 2940
tagtaacggc cgccagtgtg ctggaattcg gccggccata acttcgtata atgtatgcta 3000
tacgaagtta tggcaacggt tcatcatctc atggatctgc acatgaacaa acaccagagt 3060
caaacgacgt tgaaattgag gctactgcgc caattgatga caatacagac gatgataaca 3120
aaccgaagtt atctgatgta gaaaaggatt agagatgcta agagatagtg atgatatttc 3180
ataaataatg taattctata tatgttaatt accttttttg cgaggcatat ttatggtgaa 3240
ggataagttt tgaccatcaa agaaggttaa tgtggctgtg gtttcagggt ccataaagct 3300
tttcaattca tctttttttt tttgttcttt tttttgattc cggtttcttt gaaatttttt 3360
tgattcggta atctccgagc agaaggaaga acgaaggaag gagcacagac ttagattggt 3420
atatatacgc atatgtggtg ttgaagaaac atgaaattgc ccagtattct taacccaact 3480
gcacagaaca aaaacctgca ggaaacgaag ataaatcatg tcgaaagcta catataagga 3540
acgtgctgct actcatccta gtcctgttgc tgccaagcta tttaatatca tgcacgaaaa 3600
gcaaacaaac ttgtgtgctt cattggatgt tcgtaccacc aaggaattac tggagttagt 3660
tgaagcatta ggtcccaaaa tttgtttact aaaaacacat gtggatatct tgactgattt 3720
ttccatggag ggcacagtta agccgctaaa ggcattatcc gccaagtaca attttttact 3780
cttcgaagac agaaaatttg ctgacattgg taatacagtc aaattgcagt actctgcggg 3840
tgtatacaga atagcagaat gggcagacat tacgaatgcg cacggtgtgg tgggcccagg 3900
tattgttagc ggtttgaagc aggcggcgga agaagtaaca aaggaaccta gaggcctttt 3960
gatgttagca gaattgtcat gcaagggctc cctagctact ggagaatata ctaagggtac 4020
tgttgacatt gcgaagagcg acaaagattt tgttatcggc tttattgctc aaagagacat 4080
gggtggaaga gatgaaggtt acgattggtt gattatgaca cccggtgtgg gtttagatga 4140
caagggagac gcattgggtc aacagtatag agccgtggat gatgtggtct ctacaggatc 4200
tgacattatt attgttggaa gaggactatt tgcaaaggga agggatgcta aggtagaggg 4260
tgaacgttac agaaaagcag gctgggaagc atatttgaga agatgcggcc agcaaaacta 4320
aaaaactgta ttataagtaa atgcatgtat actaaactca caaattagag cttcaattta 4380
attatatcag ttattacccg ggaatctcgg tcgtaatgat ttttataatg acgaaaaaaa 4440
aaaattggaa agaaaaagct tcatggcctt tataaaaagg aaccatccaa tacctcgcca 4500
gaaccaagta acagtatttt acggggcaca aatcaagaac aataagacag gactgtaaag 4560
atggacgcat tgaactccaa agaacaacaa gagttccaaa aagtagtgga acaaaagcaa 4620
atgaaggatt tcatgcgttt gataacttcg tataatgtat gctatacgaa gttatgcggc 4680
cgcctcgaga tctcccctaa accgtggaat atttcggata tccttttgtt gtttccgggt 4740
gtacaatatg gacttcctct tttctggcaa ccaaacccat acatcgggat tcctataata 4800
ccttcgttgg tctccctaac atgtaggtgg cggaggggag atatacaata gaacagatac 4860
cagacaagac ataatgggct aaacaagact acaccaatta cactgcctca ttgatggtgg 4920
tacataacga actaatactg tagccctaga cttgatagcc atcatcatat cgaagtttca 4980
ctaccctttt tccatttgcc atctattgaa gtaataatag gcgcatgcaa cttcttttct 5040
ttttttttct tttctctctc ccccgttgtt gtctcaccat atccgcaatg acaaaaaaat 5100
gatggaagac actaaaggaa aaaattaacg acaaagacag caccaacaga tgtcgttgtt 5160
ccagagctga tggggggtat ctcgaagcac acgaaacttt ttccttcctt cattcacgca 5220
cactactctc taatgagcaa cggtatacgg ccttccttcc agttacttga atttgaaata 5280
aaaaaagttt gctgtcttgc tatcaagtat aaatagacct gcaattatta atcttttgtt 5340
tcctcgtcat tgttctcgtt ccctttcttc cttgtttctt tttctgcaca atatttcaag 5400
ctataccaag catacaatca actatctcat atacaatgaa gaacttcata tcactggtga 5460
acaagaaaaa gggtaccctg gatgatagga atagtagcgt tccggaatct tccagtggta 5520
taatacacca acgtggagct ttaaacactg aggattttga agaaggaaag aaagatggtg 5580
cattcgaatt gggtcacctc gaattcacca ccaattcagc ccaattgggt gattcagacg 5640
atgataatga taatgcaatt aagatagcga atgctgccac tgatgaagcc aatgaggcta 5700
atagtgaaga aaaaagcatg accttaaggc aagctttgag aaaatatcca aaggcagccc 5760
tatggtccat cttggtgtct actaccttag tcatggaagg ttatgatact gcgcttttga 5820
gtgcacttta tgcattaccg gttttccaga ggaaattcgg tactatgaat gcggaaggct 5880
cctacgaaat tacctcgcag tggcaaattg gtttgaacat gtgtgtcctt tgtggtgaaa 5940
tgattggttt acagatgacc acttacatgg tcgagttcat gggtaatcgt tacacaatga 6000
ttacggcgct cggcttgttg actgcttata tttttatcct ttactactgc aaaagtttgg 6060
ccatgatcgc tgtagggcaa attctgtctg ctatgccatg gggttgcttc cagagtctgg 6120
ctgttaccta tgcttcggag gtttgccccc tagcgctgag atattacatg accagttact 6180
ccaatatttg ttggttgttt ggtcaaattt tcgcttctgg tatcatgaaa aactcccagg 6240
agaatttggg agactccgat ttaggctaca agttgccatt tgccttacaa tggatctggc 6300
ctgcaccttt gattattggt atcttctttg ctcctgagtc gccttggtgg ctggtgagaa 6360
agaataagat tgcggaggcc aaaaagtcct tgaatagaat cctgagcggc actgctgccg 6420
agagggagat tcaagtggat atcactttaa agcaaattga gatgaccatt gagaaggaga 6480
gacttctggc atctaaatca gggtcgttct tcaactgttt caaaggcgtt gatggaagaa 6540
gaacaaggct tgcgtgtttg acttgggttg ctcaaaacag tagtggtgcc gttttactag 6600
gttactcgac gtatttcttt gaaagggcag ggatggccac tgacaaggcg tttactttct 6660
cgcttatcca gtactgtcta ggtttagcag gcactctttg ttcctgggtg atatctggcc 6720
gtgttggtag atggagtatc ctggcttatg gtcttgcatt tcaaatggtg tgtctattca 6780
tcattggtgg aatggggttt gcatccggaa gcaatgccag taatggtgct ggtggtctac 6840
tgctggcttt atcgttcttt tacaacgctg gtatcggagc tgtcgtttac tgtattgtgg 6900
ctgaaattcc gtctgcagaa ttaaggacca aaactattgt aatggctcgt atttgctata 6960
atttgatggc cgtcatcaat gccattttaa cgccatatat gctgaacgtg agtgactgga 7020
actggggtgc taaaaccggc ctatactggg gtggtttcac tgcagtcact ttggcttggg 7080
ttatcattga tttgcctgag acaactggta gaacatttag cgaaattaat gagcttttca 7140
atcaaggtgt ccctgctaga aaatttgcat ctactgtagt tgatcctttc gggaagggac 7200
agcgtcaaaa tgattcgcaa gtggataacg tcattgacca gtcctcaagc gcaatgcagc 7260
aagagctaaa tgaagctaac gaattctaat taattaaaca ggcccctttt cctttgtcga 7320
tatcatgtaa ttagttatgt cacgcttaca ttcacgccct cctcccacat ccgctctaac 7380
cgaaaaggaa ggagttagac aacctgaagt ctaggtccct atttattttt ttatagttat 7440
gttagtatta agaacgttat ttatatttca aatttttctt ttttttctgt acaaacgcgt 7500
gtacgcatgt aacgggcaga cgaattcgat atcaagctta tcgataccgt cgacgcggat 7560
ctcttatgtc tttacgattt atagttttca ttatcaagta tgcctatatt agtatatagc 7620
atctttagat gacagtgttc gaagtttcac gaataaaaga taatattcta ctttttgctc 7680
ccaccgcgtt tgctagcacg agtgaacacc atccctcgcc tgtgagttgt acccattcct 7740
ctaaactgta gacatggtag cttcagcagt gttcgttatg tacggcatcc tccaacaaac 7800
agtcggttat agtttgtcct gctcctctga atcgtctccc tcgatatttc tcattttcct 7860
tcgcatgcca gcattgaaat gatcgaagtt caatgatgaa acggtaattc ttctgtcatt 7920
tactcatctc atctcatcaa gttatataat tctatacgga tgtaattttt cacttttcgt 7980
cttgacgtcc accctataat ttcaattatt gaaccctcac aaatgatgca ctgcaatgta 8040
cacaccctca tatagtttct cagggcttga tcagggttcc gtagag 8086
<210> 30
<211> 8685
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 30
ccgggaaacc atccacttca cgagactgat ctcctctgcc ggaacaccgg gcatctccaa 60
cttataagtt ggagaaataa gagaatttca gattgagaga atgaaaaaaa aaaaaaaaga 120
cagaggagag cataaaaatg gggttcactt tttggtaaag ctatagcatg cctatcacat 180
ataaatagag tgccagtagc gacttttttc acactcgaaa tactcttact actgctctct 240
tgttgttttt atcacttctt gtttcttctt ggtaaataga atatcaagct acaaaaagca 300
tacctcgagg ccagaaaaag gaagtgtttc cctccttctt gaattgatgt taccctcata 360
aagcacgtgg cctcttatcg agaaagaaat taccgtcgct cgtgatttgt ttgcaaaaag 420
aacaaaactg aaaaaaccca gacacgctcg acttcctgtc ttcctgttga ttgcagcttc 480
caatttcgtc acacaacaag gtcctagcga cggctcacag gttttgtaac aagcaatcga 540
aggttctgga atggcgggaa agggtttagt accacatgct atgatgccca ctgtgatctc 600
cagagcaaag ttcgttcgat cgtactgtta ctctctctct ttcaaacaga attgtccgaa 660
tcgtgtgaca acaacagcct gttctcacac actcttttct tctaaccaag ggggtggttt 720
agtttagtag aacctcgtga aacttacatt tacatatata taaacttgca taaattggtc 780
aatgcaagaa atacatattt ggtcttttct aattcgtagt ttttcaagtt cttagatgct 840
ttctttttct cttttttaca gatcatcaag gaagtaatta tctacttttt acaagtctag 900
aatgactatc tcttctgctc acccagaaac tgaaccaaag tggtggaaag aggcaacttt 960
ttaccaaatc tacccagctt cattcaagga ctccaatgat gatggttggg gtgatatgaa 1020
aggtattgct tccaaattag aatacattaa ggaattaggt gccgatgcta tttggatttc 1080
tccattctat gattctccac aagacgatat gggttatgac atcgctaact atgaaaaggt 1140
ttggccaacc tatggcacta atgaggactg ttttgcatta attgagaaaa cccacaagtt 1200
gggcatgaag ttcattactg atcttgtcat taatcattgt tcatccgaac atgaatggtt 1260
caaggaatcc agatcctcca aaactaatcc aaaaagagat tggtttttct ggagaccacc 1320
taagggttat gatgctgaag gtaagccaat tccaccaaac aattggaagt cttactttgg 1380
tggttccgca tggaccttcg acgaaaagac ccaagagttt tacttgagat tattctgctc 1440
cacccaacca gatttgaact gggaaaatga agattgtaga aaagcaatct acgaatctgc 1500
agttggctat tggttagatc acggtgttga tggtttcaga attgatgttg gttcacttta 1560
ctcaaaggtt gttggtttgc cagatgcacc agttgttgat aaaaactcta catggcaatc 1620
ttctgaccca tacactctta atggtcctag aatccatgaa tttcatcaag agatgaacca 1680
gttcattaga aatagagtta aggatggtag agaaattatg accgttggtg aaatgcaaca 1740
tgcatctgat gaaactaaga gattatacac atcagcctcc cgtcacgaat tgtctgaatt 1800
attcaacttt tcacacacag acgttggcac atccccatta ttccgttata acttggttcc 1860
attcgaattg aaggactgga aaatcgcatt ggcagaattg tttagatata tcaatggtac 1920
tgattgttgg tctaccatct acttggaaaa ccacgaccaa ccaagatcca tcactagatt 1980
cggtgatgac tctcctaaaa accgtgtcat ttctggtaag ttactttctg tcttattatc 2040
cgccttaacc ggtactttgt acgtctatca aggccaggaa ttgggtcaaa ttaactttaa 2100
gaattggcca gtcgaaaagt atgaagatgt cgaaatcaga aacaactaca atgcaattaa 2160
ggaggaacat ggtgaaaatt cagaggaaat gaaaaagttt ttggaagcta ttgctcttat 2220
ttccagagat cacgctagaa ccccaatgca atggtcaaga gaggaaccta acgctggttt 2280
ctctggtcct tccgccaagc cttggtttta cttaaacgac tccttcagag aaggtattaa 2340
cgttgaagat gaaattaagg acccaaattc cgtccttaac ttctggaagg aagcattgaa 2400
gtttagaaag gcccataagg atattaccgt ttatggttat gactttgagt ttatcgattt 2460
ggataacaaa aagttattct cattcactaa aaagtataac aacaagacct tattcgctgc 2520
tttaaacttc tcttctgatg ctactgattt caaaattcct aatgacgatt cctctttcaa 2580
gttggagttt ggtaactacc caaagaagga agttgacgca tcttctcgta cattgaagcc 2640
ttgggaaggt agaatctaca tctccgagta acctgcaggt ttgccagctt actatccttc 2700
ttgaaaatat gcactctata tcttttagtt cttaattgca acacatagat ttgctgtata 2760
acgaatttta tgctattttt taaatttgga gttcagtgat aaaagtgtca cagcgaattt 2820
cctcacatgt agggaccgaa ttgtttacaa gttctctgta ccaccatgga gacatcaaaa 2880
attgaaaatc tatggaaaga tatggacggt agcaacaaga atatagcacg agccggcgac 2940
tagtaacggc cgccagtgtg ctggaattcg gccggccagg ccgcataact tcgtatagca 3000
tacattatac gaagttatcg cctgttaaga tataactgaa aaaagagggg aatttttaga 3060
tactgaaatg atattttaga ataaccagac tatatataag gataaattac aaaaaattaa 3120
ctaatagata agatttaaat ataaaagata tgcaactaga aaagtcttat caatctcctt 3180
atggagtgac gacgttaccc aacaatttac cgacttcttc ggcgatagcc aaagttctct 3240
cttcggacaa tcttctacca ataacttgaa cagcaacagg agcaccgtga taagcctctg 3300
ggtcgtattc ttcttgaacc aaagcatcca attcggaaac agctttaaaa gattcgttct 3360
tcttatcaat attcttatca gcgaaagtga ctgggacgac aacagaggtg aaatccaata 3420
agttaataac ggaggcgtaa ccgtagtatc tgaattgatc gtgtctgaca gcggcggtag 3480
gagtaattgg agcgataata gcgtccaatt ccttaccagc tttttcttca gcttcacgcc 3540
acttttccaa gtattccatt tgatagttcc acttttgtaa atgagtgtcc cacaattcgt 3600
tcatgttaac agccttaata tttgggttca acaagtcctt aatgttaggg atggctggct 3660
caccagaggc agaaatgtct ctcatgacgt cggcagaacc atcagcagca tagatgtggg 3720
aaatcaagtc atgaccgaaa tcatgcttgt atggagtcca tggagtaacg gtgtgaccag 3780
ccttggccaa agcggcaacg gtagtttcga caccacgtaa aattggtggg tgtggcaaga 3840
cgttaccgtc gaaattgtaa taaccaatgt tcaaaccacc attcttaatc ttagaggcaa 3900
tgatgtcaga ttcagattgt ctccatggca ttgggatgac cttagagtcg tacttccaag 3960
gttcttgacc caagacagat ttggtgaaca atctcaagtc ttcgacggag tgagtgatag 4020
gaccaacgac ggagtgaacg gtttcttgac cttccataga gttagccatt ttagcatatg 4080
gcaatctacc gtgagatggt ctcaaaccgt ataaaaagtt gaaagcagct gggactctaa 4140
tggaaccacc aatgtcagta ccgacaccaa taacaccacc tctaatacca acaatagcac 4200
cttcaccacc agaagaacca ccacaggacc aatttttgtt tcttggattg acagttctac 4260
caatgatgtt gttgacggtt tcacagacca tcaaggtttg tgggacagag gtcttaacgt 4320
agaaaacagc accagctttt ctcaacatgg tggttaagac ggaatcacct tcatcgtatt 4380
tgtttaacca ggaaatgtaa cccatggagg tttcgtaacc cttaacacgc aattggtcct 4440
ttaaagagat tggtaaaccg tgtaatggac caactggtct cttatgctta gcgtagtatt 4500
catctaattc tctagcttga gctaaagcag catctgggaa gaattcgtga gcacagttgg 4560
ttaattgttg agcaatagca gctctcttac aaaaagccaa agtgacttca acagaagtca 4620
actcaccagc ggccaacttg gagaccaaat cagcagcaga ggcttcggta atcttcaatt 4680
cagcctcaga caaaataccg gacttctttg ggaaatcaat aacggaatct tcggcaggca 4740
aagtttgaac cttccattcg tcaggaatgg ttttagccaa acgggcacgt ttgtcggcgg 4800
ccaattcttc ccaggattgt ggcattttgt aattaaaact tagattagat tgctatgctt 4860
tctttctaat gagcaagaag taaaaaaagt tgtaatagaa caagaaaaac gaaactgaaa 4920
cttgagaaat tgaagaccat ttattaactt aaatatcaat gggaggtcat cgaaagagaa 4980
aaaaatcaaa aaaaaaattt ttcaagaaaa agaaacgtga taaaaatttt tattgccttt 5040
ttcgacgaag aaaaagaaac gaggcggtct cttttttctt ttccaaacct ttagtacggg 5100
taattaacgc caccctagag gaagaaagag gggaaattta gtatgctgtg cttgggtgtt 5160
ttgaagtggt acggcgatgc gcggagtccg agaaaatctg gaagagtaaa aaaggagtag 5220
aaacattttg aagctatggt gtgtggggga tcacttgtgg gggattgggt gtgatgtaag 5280
gataacttcg tatagcatac attatacgaa gttatgcggc cgcgtctgcc cgttacatgc 5340
gtacacgcgt ttgtacagaa aaaaaagaaa aatttgaaat ataaataacg ttcttaatac 5400
taacataact ataaaaaaat aaatagggac ctagacttca ggttgtctaa ctccttcctt 5460
ttcggttaga gcggatgtgg gaggagggcg tgaatgtaag cgtgacataa ctaattacat 5520
gatatcgaca aaggaaaagg ggcctgttta attaattaga attcgttagc ttcatttagc 5580
tcttgctgca ttgcgcttga ggactggtca atgacgttat ccacttgcga atcattttga 5640
cgctgtccct tcccgaaagg atcaactaca gtagatgcaa attttctagc agggacacct 5700
tgattgaaaa gctcattaat ttcgctaaat gttctaccag ttgtctcagg caaatcaatg 5760
ataacccaag ccaaagtgac tgcagtgaaa ccaccccagt ataggccggt tttagcaccc 5820
cagttccagt cactcacgtt cagcatatat ggcgttaaaa tggcattgat gacggccatc 5880
aaattatagc aaatacgagc cattacaata gttttggtcc ttaattctgc agacggaatt 5940
tcagccacaa tacagtaaac gacagctccg ataccagcgt tgtaaaagaa cgataaagcc 6000
agcagtagac caccagcacc attactggca ttgcttccgg atgcaaaccc cattccacca 6060
atgatgaata gacacaccat ttgaaatgca agaccataag ccaggatact ccatctacca 6120
acacggccag atatcaccca ggaacaaaga gtgcctgcta aacctagaca gtactggata 6180
agcgagaaag taaacgcctt gtcagtggcc atccctgccc tttcaaagaa atacgtcgag 6240
taacctagta aaacggcacc actactgttt tgagcaaccc aagtcaaaca cgcaagcctt 6300
gttcttcttc catcaacgcc tttgaaacag ttgaagaacg accctgattt agatgccaga 6360
agtctctcct tctcaatggt catctcaatt tgctttaaag tgatatccac ttgaatctcc 6420
ctctcggcag cagtgccgct caggattcta ttcaaggact ttttggcctc cgcaatctta 6480
ttctttctca ccagccacca aggcgactca ggagcaaaga agataccaat aatcaaaggt 6540
gcaggccaga tccattgtaa ggcaaatggc aacttgtagc ctaaatcgga gtctcccaaa 6600
ttctcctggg agtttttcat gataccagaa gcgaaaattt gaccaaacaa ccaacaaata 6660
ttggagtaac tggtcatgta atatctcagc gctagggggc aaacctccga agcataggta 6720
acagccagac tctggaagca accccatggc atagcagaca gaatttgccc tacagcgatc 6780
atggccaaac ttttgcagta gtaaaggata aaaatataag cagtcaacaa gccgagcgcc 6840
gtaatcattg tgtaacgatt acccatgaac tcgaccatgt aagtggtcat ctgtaaacca 6900
atcatttcac cacaaaggac acacatgttc aaaccaattt gccactgcga ggtaatttcg 6960
taggagcctt ccgcattcat agtaccgaat ttcctctgga aaaccggtaa tgcataaagt 7020
gcactcaaaa gcgcagtatc ataaccttcc atgactaagg tagtagacac caagatggac 7080
catagggctg cctttggata ttttctcaaa gcttgcctta aggtcatgct tttttcttca 7140
ctattagcct cattggcttc atcagtggca gcattcgcta tcttaattgc attatcatta 7200
tcatcgtctg aatcacccaa ttgggctgaa ttggtggtga attcgaggtg acccaattcg 7260
aatgcaccat ctttctttcc ttcttcaaaa tcctcagtgt ttaaagctcc acgttggtgt 7320
attataccac tggaagattc cggaacgcta ctattcctat catccagggt accctttttc 7380
ttgttcacca gtgatatgaa gttcttcatt gtatatgaga tagttgattg tatgcttggt 7440
atagcttgaa atattgtgca gaaaaagaaa caaggaagaa agggaacgag aacaatgacg 7500
aggaaacaaa agattaataa ttgcaggtct atttatactt gatagcaaga cagcaaactt 7560
tttttatttc aaattcaagt aactggaagg aaggccgtat accgttgctc attagagagt 7620
agtgtgcgtg aatgaaggaa ggaaaaagtt tcgtgtgctt cgagataccc cccatcagct 7680
ctggaacaac gacatctgtt ggtgctgtct ttgtcgttaa ttttttcctt tagtgtcttc 7740
catcattttt ttgtcattgc ggatatggtg agacaacaac gggggagaga gaaaagaaaa 7800
aaaaagaaaa gaagttgcat gcgcctatta ttacttcaat agatggcaaa tggaaaaagg 7860
gtagtgaaac ttcgatatga tgatggctat caagtctagg gctacagtat tagttcgtta 7920
tgtaccacca tcaatgaggc agtgtaattg gtgtagtctt gtttagccca ttatgtcttg 7980
tctggtatct gttctattgt atatctcccc tccgccacct acatgttagg gagaccaacg 8040
aaggtattat aggaatcccg atgtatgggt ttggttgcca gaaaagagga agtccatatt 8100
gtacacccgg aaacaacaaa aggatatccg aaatattcca cggtttaggt cgacgcggat 8160
ctcttatgtc tttacgattt atagttttca ttatcaagta tgcctatatt agtgtatagc 8220
atctttagat gacagtgttc gaagtttcac gaataaaaga taatattcta ctttttgctc 8280
ccaccgcgtt tgctagcacg agtgaacacc atccctcgcc tgtgagttgt acccattcct 8340
ctaaactgta gacatggtag cttcagcagt gttcgttatg tacggcatcc tccaacaaac 8400
agtcggttat agtttgtcct gctcctctga atcgtctccc tcgatatttc tcattttcct 8460
tcgcatgcca gcattgaaat gatcgaagtt caatgatgaa acggtaattc ttctgtcatt 8520
tactcatctc atctcatcaa gttatataat tctatacgga tgtaattttt cacttttcgt 8580
cttgacgtca ccctataatt tcaattgttg aaccctcaca aatgatgcac tgcaatgtac 8640
acaccctcat atagtttctc agggcttgat cagggttccg tagag 8685
<210> 31
<211> 8719
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 31
atcacatagg aagcaacagg cgcgttggac ttttaatttt cgaggaccgc gaatccttac 60
atcacaccca atcccccaca agtgatcccc cacacaccat agcttcaaaa tgtttctact 120
ccttttttac tcttccagat tttctcggac tccgcgcatc gccgtaccac ttcaaaacac 180
ccaagcacag catactaaat ttcccctctt tcttcctcta gggtgtcgtt aattacccgt 240
actaaaggtt tggaaaagaa aaaagagacc gcctcgtttc tttttcttcg tcgaaaaagg 300
caataaaaat ttttatcacg tttctttttc ttgaaaattt ttttttttga tttttttctc 360
tttcgatgac ctcccattga tatttaagtt aataaacggt cttcaatttc tcaagtttca 420
gtttcatttt tcttgttcta ttacaacttt ttttacttct tgctcattag aaagaaagca 480
tagcaatcta atctaagttt taattacaaa tctagaatga gtgaatctcc aatgttcgct 540
gccaacggca tgccaaaggt aaatcaaggt gctgaagaag atgtcagaat tttaggttac 600
gacccattag cttctccagc tctccttcaa gtgcaaatcc cagccacacc aacttctttg 660
gaaactgcca agagaggtag aagagaagct atagatatta ttaccggtaa agacgacaga 720
gttcttgtca ttgtcggtcc ttgttccatc catgatcttg aagccgctca agaatacgct 780
ttgagattaa agaaattgtc agatgaatta aaaggtgatt tatccatcat tatgagagca 840
tacttggaga agccaagaac aaccgtcggc tggaaaggtc taattaatga ccctgatgtt 900
aacaacactt tcaacatcaa caagggtttg caatccgcta gacaattgtt tgtcaacttg 960
acaaatatcg gtttgccaat tggttctgaa atgcttgata ccatttctcc taaatacttg 1020
gctgatttgg tctccttcgg tgccattggt gccagaacca ccgaatctca actgcacaga 1080
gaattggcct ccggtttgtc tttcccagtt ggtttcaaga acggtaccga tggtacctta 1140
aatgttgctg tggatgcttg tcaagccgct gctcattctc accatttcat gggtgttact 1200
aagcatggtg ttgctgctat caccactact aagggtaacg aacactgctt cgttattcta 1260
agaggtggta aaaagggtac caactacgac gctaagtccg ttgcagaagc taaggctcaa 1320
ttgcctgccg gttccaacgg tctaatgatt gactactctc acggtaactc caataaggat 1380
ttcagaaacc aaccaaaggt caatgacgtt gtttgtgagc aaatcgctaa cggtgaaaac 1440
gccattaccg gtgtcatgat tgaatcaaac atcaacgaag gtaaccaagg catcccagcc 1500
gaaggtaaag ccggcttgaa atatggtgtt tccatcactg atgcttgtat aggttgggaa 1560
actactgaag acgtcttgag gaaattggct gctgctgtca gacaaagaag agaagttaac 1620
aagaaataga tgttttttta atgatatatg taacgtacat tctttcctct accactgcca 1680
attcggtatt atttaattgt gtttagcgct atttactaat taactagaaa ctcaattttt 1740
aaaggcaaag ctcgctgacc tttcactgat ttcgtggatg ttatactatc agttactctt 1800
ctgcaaaaaa aaattgagtc atatcgtagc tttgggatta tttttctctc tctccacggc 1860
taattaggtg atcatgaaaa aatgaaaaat tcatgagaaa agagtcagac atcgaaacat 1920
acataagttg atattccttt gatatcgacg actactcaat caggttttaa aagaaaagag 1980
gcagctattg aagtagcagt atccagttta ggttttttaa ttatttacaa gtaaagaaaa 2040
agagaatgcc ggtcgttcac ggcggccgcg ccagaaaaag gaagtgtttc cctccttctt 2100
gaattgatgt taccctcata aagcacgtgg cctcttatcg agaaagaaat taccgtcgct 2160
cgtgatttgt ttgcaaaaag aacaaaactg aaaaaaccca gacacgctcg acttcctgtc 2220
ttcctattga ttgcagcttc caatttcgtc acacaacaag gtcctagcga cggctcacag 2280
gttttgtaac aagcaatcga aggttctgga atggcgggaa agggtttagt accacatgct 2340
atgatgccca ctgtgatctc cagagcaaag ttcgttcgat cgtactgtta ctctctctct 2400
ttcaaacaga attgtccgaa tcgtgtgaca acaacagcct gttctcacac actcttttct 2460
tctaaccaag ggggtggttt agtttagtag aacctcgtga aacttacatt tacatatata 2520
taaacttgca taaattggtc aatgcaagaa atacatattt ggtcttttct aattcgtagt 2580
ttttcaagtt cttagatgct ttctttttct cttttttaca gatcatcaac tcttttttac 2640
agatcatcaa ggaagtaatt atctactttt tacaagaatt catgtctaat ttacttactg 2700
ttcaccaaaa cttgcctgca ttaccagttg acgcaacctc cgatgaagtc agaaagaacc 2760
ttatggatat gtttagagat agacaagctt tctccgaaca tacttggaaa atgttattat 2820
ccgtttgtag atcctgggcc gcttggtgta aacttaacaa tagaaaatgg tttcctgctg 2880
aaccagaaga cgtcagagat tacttacttt acttacaagc tagaggtttg gctgttaaaa 2940
ctatccaaca acacttaggt caattgaata tgttacacag aagatccggt ttaccaagac 3000
catccgattc caacgcagtt tcccttgtta tgagaagaat tagaaaagaa aatgttgacg 3060
ctggtgaaag agctaaacaa gcattagcat ttgaaagaac cgatttcgat caagttagat 3120
ccttaatgga aaattccgat agatgtcaag atattagaaa cttagctttc ttaggtattg 3180
cttacaacac attattaaga atcgctgaaa ttgctagaat tagagttaaa gatatttcaa 3240
gaaccgatgg cggtagaatg ttaatccaca ttggcagaac aaaaacctta gtctccacag 3300
caggcgtcga aaaagcatta tcattaggtg ttactaaatt agttgaacgt tggatttccg 3360
tttccggtgt tgcagatgac ccaaacaact acttattctg tcgtgttaga aaaaatggtg 3420
ttgccgctcc ttccgctacc tcacaattat ccacaagagc attagaaggc atttttgaag 3480
ctacccacag acttatttat ggtgcaaaag acgattccgg tcaaagatat ttagcttggt 3540
ctggtcattc cgctagagtt ggtgccgcaa gagacatggc aagagctggt gtttctattc 3600
ctgaaattat gcaagccggt ggttggacta atgttaacat tgttatgaac tatatcagaa 3660
acttagattc cgaaacaggt gctatggtta gattacttga agacggtgat taagctagct 3720
aagatccgct ctaaccgaaa aggaaggagt tagacaacct gaagtctagg tccctattta 3780
tttttttata gttatgttag tattaagaac gttatttata tttcaaattt ttcttttttt 3840
tctgtacaga cgcgtgtacg catgtaacat tatactgaaa accttgcttg agaaggtttt 3900
gggacgctcg aaggagctcc aattcgccct atagtgagtc gtattacaat tcactggccg 3960
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag 4020
cacatccccc cttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc 4080
aacagttgcg cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg 4140
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4200
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4260
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4320
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4380
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 4440
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 4500
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 4560
ttacaatttc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcag 4620
ggtaataact gatataatta aattgaagct ctaatttgtg agtttagtat acatgcattt 4680
acttataata cagtttttta gttttgctgg ccgcatcttc tcaaatatgc ttcccagcct 4740
gcttttctgt aacgttcacc ctctacctta gcatcccttc cctttgcaaa tagtcctctt 4800
ccaacaataa taatgtcaga tcctgtagag accacatcat ccacggttct atactgttga 4860
cccaatgcgt ctcccttgtc atctaaaccc acaccgggtg tcataatcaa ccaatcgtaa 4920
ccttcatctc ttccacccat gtctctttga gcaataaagc cgataacaaa atctttgtcg 4980
ctcttcgcaa tgtcaacagt acccttagta tattctccag tagataggga gcccttgcat 5040
gacaattctg ctaacatcaa aaggcctcta ggttcctttg ttacttcttc tgccgcctgc 5100
ttcaaaccgc taacaatacc tgggcccacc acaccgtgtg cattcgtaat gtctgcccat 5160
tctgctattc tgtatacacc cgcagagtac tgcaatttga ctgtattacc aatgtcagca 5220
aattttctgt cttcgaagag taaaaaattg tacttggcgg ataatgcctt tagcggctta 5280
actgtgccct ccatggaaaa atcagtcaag atatccacat gtgtttttag taaacaaatt 5340
ttgggaccta atgcttcaac taactccagt aattccttgg tggtacgaac atccaatgaa 5400
gcacacaagt ttgtttgctt ttcgtgcatg atattaaata gcttggcagc aacaggacta 5460
ggatgagtag cagcacgttc cttatatgta gctttcgaca tgatttatct tcgtttcctg 5520
caggtttttg ttctgtgcag ttgggttaag aatactgggc aatttcatgt ttcttcaaca 5580
ctacatatgc gtatatatac caatctaagt ctgtgctcct tccttcgttc ttccttctgt 5640
tcggagatta ccgaatcaaa aaaatttcaa agaaaccgaa atcaaaaaaa agaataaaaa 5700
aaaaatgatg aattgaattg aaaagcgtgg tgcactctca gtacaatctg ctctgatgcc 5760
gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 5820
ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 5880
aggttttcac cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt 5940
ttataggtta atgtcatgat aataatggtt tcttaggacg gatcgcttgc ctgtaactta 6000
cacgcgcctc gtatctttta atgatggaat aatttgggaa tttactctgt gtttatttat 6060
ttttatgttt tgtatttgga ttttagaaag taaataaaga aggtagaaga gttacggaat 6120
gaagaaaaaa aaataaacaa aggtttaaaa aatttcaaca aaaagcgtac tttacatata 6180
tatttattag acaagaaaag cagattaaat agatatacat tcgattaacg ataagtaaaa 6240
tgtaaaatca caggattttc gtgtgtggtc ttctacacag acaagatgaa acaattcggc 6300
attaatacct gagagcagga agagcaagat aaaaggtagt atttgttggc gatcccccta 6360
gagtctttta catcttcgga aaacaaaaac tattttttct ttaatttctt tttttacttt 6420
ctatttttaa tttatatatt tatattaaaa aatttaaatt ataattattt ttatagcacg 6480
tgatgaaaag gacccaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 6540
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 6600
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 6660
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 6720
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 6780
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 6840
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 6900
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 6960
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 7020
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt tttttcacaa 7080
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 7140
aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt 7200
aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga 7260
taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa 7320
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa 7380
gccctcccgt atcgtagtta tctacacgac gggcagtcag gcaactatgg atgaacgaaa 7440
tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt 7500
ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt 7560
gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg 7620
agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt 7680
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca 7740
agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac 7800
tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac 7860
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct 7920
taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg 7980
gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca 8040
gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt 8100
aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggga acgcctggta 8160
tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc 8220
gtcagggggg ccgagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc 8280
cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa 8340
ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag 8400
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg 8460
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 8520
gcgcaacgca attaatgtga gttacctcac tcattaggca ccccaggctt tacactttat 8580
gcttccggct cctatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 8640
ctatgaccat gattacgcca agctcggaat taaccctcac taaagggaac aaaagctggg 8700
taccgggccc cccctcgag 8719
<210> 32
<211> 5001
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 32
cagagcctct tatattcact ctgttcctcc atcgcctatt gagaaacgtt ggaataaaac 60
tctaaaaata tcatctagtt ggttagtttt tattttacca gtacattgtc acttgcggag 120
ggaggatgac ataaagattg agacgcagtc atttaatgaa gtttaaacgc aggtatttga 180
taaagtaata cgatattgaa tcatgacgta taaagtgaaa tgaacaaatg attacgtaaa 240
aaatgtcgat tttctcttga gagactccca tagcctctaa gaggccttct actacgttcc 300
atatatctaa gaatggggcc atatccagtg gaatcccagc aattatttaa ggatcaccta 360
tttctcagcc gatattttag caaaatcact accaatatca gggggcaata gttgatcgcc 420
tactttaaca aaaaatgttg ctcacgtatt aacacaggca acaaaaagga tattacgcaa 480
gaacgtagta tccacatgcc atcctccttg ttgcatcttt ttttttccga aatgattccc 540
tttcctgcac aacacgagat ctttcacgca tacatcggaa ggatcacccc ccactcaagt 600
cgttgcattg ctaacatgtg gcattctgcc catttttttc acgaaaattc tctctctata 660
atgaagaccc ttgtgccctg gactctgtaa tacttgaaac tacttcctca ataatcgctt 720
ggagacctac ccccacgctt ttcaaacaag gcgctagcaa aaagcctgcc gatatctcct 780
tgccccctcc ttctgttcga gagaactacg acccgaccaa taataatgtc atacaagaac 840
cgccaagaac caactgctga accttagatc tccaatactt cagttggagt atgtgaatat 900
ataagtacct ggtcgactaa tcttcttgca tcttttcgta ttcttacatc ctatgtcgct 960
aatacagttc ccgcatagag aagaaagcaa acaaaagtag tcactcgaga tctcccgagt 1020
ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt agtgattttc 1080
ctaactttat ttagtcaaaa aattggcctt ttaattctgc tgtaacccgt acatgcccaa 1140
aatagggggc gggttacaca gaatatataa catcataggt gtctgggtga acagtttatt 1200
cctggcatcc actaaatata atggagcccg ctttttttaa gctggcatcc agaaaaaaaa 1260
agaatcccag caccaaaata ttgttttctt caccaaccat cagttcatag gtccattctc 1320
ttagcgcaac tacacagaac aggggcacaa acaggcaaaa aacgggcaca acctcaatgg 1380
agtgatgcaa cctgcttgga gtaaatgatg acacaaggca attgacctac gcatgtatct 1440
atctcatttt cttacacctt ctattacctt ctgctctctc tgatttggaa aaagctgaaa 1500
aaaaaggttg aaaccagttc cctgaaatta ttcccctatt tgactaataa gtatataaag 1560
acggtaggta ttgattgtaa ttctgtaaat ctatttctta aacttcttaa attctacttt 1620
tatagttagt ctttttttta gtttaaaaca ccaagaactt agtttcgaat aaacacacat 1680
aaacaaacaa atctagaatg ttcaagtctg ttgtttactc tattttggct gcctctttgg 1740
ctaacgctag tgttatctct aagagagcaa cgttggatag ttggttatca aatgaagcaa 1800
ctgtcgctag aaccgcaatt ctaaacaata ttggagctga tggtgcatgg gttagcggtg 1860
cagactctgg tattgtggta gcctctccat ccacagataa tccagattat ttctatactt 1920
ggactagaga ttccggaata gttttgaaaa cgctggtgga tttgtttcgt aatggggaca 1980
ccgacttgtt atcaaccatt gagcattata tctccagtca agcaattatt caaggtgtct 2040
caaatccatc cggcgacttg agcagtgggg ggctgggaga acctaagttc aatgtggacg 2100
aaacggctta cgctggaagt tggggcagac cacagagaga cggaccagct ctaagagcaa 2160
cagccatgat tggattcggt cagtggctac tagacaatgg atacactagc gccgcgacag 2220
aaattgtttg gccactagtc aggaacgacc taagttacgt tgctcaatat tggaaccaaa 2280
ccgggtatga tctgtgggaa gaggttaatg gatctagttt cttcaccatc gcagttcagc 2340
atagagcttt ggttgaaggt agcgccttcg caacggcagt tgggagttca tgctcttggt 2400
gtgattcaca ggcaccacaa atcttatgtt atcttcagag cttttggacc ggttcctata 2460
ttctagccaa tttcgacagt tccagatccg gtaaggatac taacacttta cttggctcaa 2520
tacatacctt cgaccctgaa gctgggtgtg atgattctac attccaaccc tgttctccga 2580
gagcactggc caatcataaa gaagtggttg attcatttag aagtatttat acactaaatg 2640
acggattaag tgacagtgaa gccgtagccg tcggaagata tccagaagat tcctattaca 2700
atggtaatcc atggttctta tgtacacttg ctgctgctga acaattatat gacgcattgt 2760
atcaatggga taagcaaggc tctttagaaa ttaccgacgt aagtttagac ttctttaaag 2820
cattgtatag cggtgcagcc acgggtacat actcatcttc ttctagtacg tactcttcta 2880
ttgtttctgc ggtgaaaact tttgctgacg gctttgtttc tatcgtcgag acccatgccg 2940
ccagtaacgg ttctttatcc gaacaatttg acaagtccga tggcgatgag ttaagcgcaa 3000
gagatctaac ctggtcttat gccgcattac ttacagccaa caacagacgt aattccgttg 3060
taccaccatc ttggggtgaa acaagtgctt cttcagttcc gggcacctgc gcggccacaa 3120
gtgcatcagg aacttattca tcagtgactg taacatcttg gcctagtatt gtcgcaaccg 3180
gtggtacaac taccactgca actacgacgg gttctggagg agtcacttcc acaagcaaga 3240
ctacgactac tgcaagtaaa accagtacta ctacctcctc cactagctgt acgacaccca 3300
ccgccgtagc cgtcactttc gatttgactg ctacaaccac ctacggcgag aatatctact 3360
tggtgggatc aatctcacaa ctaggtgact gggagacttc cgacgggatc gctttgtcag 3420
cagataaata cacatcatct aacccaccat ggtatgtgac ggtcacttta cctgccgggg 3480
agtctttcga atacaagttt ataagggtag aatccgatga cagtgtggaa tgggaatctg 3540
atcctaatag agagtacaca gtgccacaag cttgtgggga atctacagcc acagttaccg 3600
atacatggag gtagttaatt aaacaggccc cttttccttt gtcgatatca tgtaattagt 3660
tatgtcacgc ttacattcac gccctcctcc cacatccgct ctaaccgaaa aggaaggagt 3720
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 3780
gttatttata tttcaaattt ttcttttttt tctgtacaaa cgcgtgtacg catgtaacgg 3840
gcagacggcc ggccataact tcgtataatg tatgctatac gaagttatgg caacggttca 3900
tcatctcatg gatctgcaca tgaacaaaca ccagagtcaa acgacgttga aattgaggct 3960
actgcgccaa ttgatgacaa tacagacgat gataacaaac cgaagttatc tgatgtagaa 4020
aaggattaga gatgctaaga gatagtgatg atatttcata aataatgtaa ttctatatat 4080
gttaattacc ttttttgcga ggcatattta tggtgaagga taagttttga ccatcaaaga 4140
aggttaatgt ggctgtggtt tcagggtcca taaagctttt caattcatct tttttttttt 4200
tgttcttttt tttgattccg gtttctttga aatttttttg attcggtaat ctccgagcag 4260
aaggaagaac gaaggaagga gcacagactt agattggtat atatacgcat atgtggtgtt 4320
gaagaaacat gaaattgccc agtattctta acccaactgc acagaacaaa aacctgcagg 4380
aaacgaagat aaatcatgtc gaaagctaca tataaggaac gtgctgctac tcatcctagt 4440
cctgttgctg ccaagctatt taatatcatg cacgaaaagc aaacaaactt gtgtgcttca 4500
ttggatgttc gtaccaccaa ggaattactg gagttagttg aagcattagg tcccaaaatt 4560
tgtttactaa aaacacatgt ggatatcttg actgattttt ccatggaggg cacagttaag 4620
ccgctaaagg cattatccgc caagtacaat tttttactct tcgaagacag aaaatttgct 4680
gacattggta atacagtcaa attgcagtac tctgcgggtg tatacagaat agcagaatgg 4740
gcagacatta cgaatgcaca cggtgtggtg ggcccaggta ttgttagcgg tttgaagcag 4800
gcggcggaag aagtaacaaa ggaacctaga ggccttttga tgttagcaga attgtcatgc 4860
aagggctccc tagctactgg agaatatact aagggtactg ttgacattgc gaagagcgac 4920
aaagattttg ttatcggctt tattgctcaa agagacatgg gtggaagaga tgaaggttac 4980
gattggttga ttatgacacg c 5001
<210> 33
<211> 5125
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 33
ggccgctcca tggagggcac agttaagccg ctaaaggcat tatccgccaa gtacaatttt 60
ttactcttcg aagacagaaa atttgctgac attggtaata cagtcaaatt gcagtactct 120
gcgggtgtat acagaatagc agaatgggca gacattacga atgcacacgg tgtggtgggc 180
ccaggtattg ttagcggttt gaagcaggcg gcggaagaag taacaaagga acctagaggc 240
cttttgatgt tagcagaatt gtcatgcaag ggctccctag ctactggaga atatactaag 300
ggtactgttg acattgcgaa gagcgacaaa gattttgtta tcggctttat tgctcaaaga 360
gacatgggtg gaagagatga aggttacgat tggttgatta tgacacccgg tgtgggttta 420
gatgacaagg gagacgcatt gggtcaacag tatagaaccg tggatgatgt ggtctctaca 480
ggatctgaca ttattattgt tggaagagga ctatttgcaa agggaaggga tgctaaggta 540
gagggtgaac gttacagaaa agcaggctgg gaagcatatt tgagaagatg cggccagcaa 600
aactaaaaaa ctgtattata agtaaatgca tgtatactaa actcacaaat tagagcttca 660
atttaattat atcagttatt acccgggaat ctcggtcgta atgattttta taatgacgaa 720
aaaaaaaaaa ttggaaagaa aaagcttcat ggcctttata aaaaggaacc atccaatacc 780
tcgccagaac caagtaacag tattttacgg ggcacaaatc aagaacaata agacaggact 840
gtaaagatgg acgcattgaa ctccaaagaa caacaagagt tccaaaaagt agtggaacaa 900
aagcaaatga aggatttcat gcgtttgata acttcgtata atgtatgcta tacgaagtta 960
tctcgagatc tcccctaaac cgtggaatat ttcggatatc cttttgttgt ttccgggtgt 1020
acaatatgga cttcctcttt tctggcaacc aaacccatac atcgggattc ctataatacc 1080
ttcgttggtc tccctaacat gtaggtggcg gaggggagat atacaataga acagatacca 1140
gacaagacat aatgggctaa acaagactac accaattaca ctgcctcatt gatggtggta 1200
cataacgaac taatactgta gccctagact tgatagccat catcatatcg aagtttcact 1260
accctttttc catttgccat ctattgaagt aataataggc gcatgcaact tcttttcttt 1320
ttttttcttt tctctctccc ccgttgttgt ctcaccatat ccgcaatgac aaaaaaatga 1380
tggaagacac taaaggaaaa aattaacgac aaagacagca ccaacagatg tcgttgttcc 1440
agagctgatg aggggtatct cgaagcacac gaaacttttt ccttccttca ttcacgcaca 1500
ctactctcta atgagcaacg gtatacggcc ttccttccag ttacttgaat ttgaaataaa 1560
aaaaagtttg ctgtcttgct atcaagtata aatagacctg caattattaa tcttttgttt 1620
cctcgtcatt gttctcgttc cctttcttcc ttgtttcttt ttctgcacaa tatttcaagc 1680
tataccaagc atacaatcaa ctatctcata tacatctaga atgttcaaaa gcgtggttta 1740
ctccattttg gctgcatctt tggcaaatgc ctcagttatt tccaagcgtg ctacgttaga 1800
tagttggtta agtaatgagg caaccgtagc cagaaccgca atattgaaca atattggtgc 1860
cgatggggcc tgggttagcg gcgcagattc cggtatcgta gttgcaagcc cctccactga 1920
taaccccgat tacttctaca cttggactag ggactccggc atcgttttga agactctggt 1980
tgacttattt agaaatggcg atacggatct acttagcacg atagaacact atatcagttc 2040
ccaagctatt atacagggtg tttctaaccc cagtggagat ttatcaagcg gaggccttgg 2100
tgagcctaaa ttcaacgttg atgagactgc atatgcagga tcttggggca gaccacaaag 2160
agatggtcca gctttaagag ctactgccat gataggtttc gggcaatggt tgttggacaa 2220
tggatacact tcagcagcaa ctgaaatcgt ctggccgttg gtcagaaatg atttaagcta 2280
tgtcgctcag tattggaacc aaacaggcta cgatttgtgg gaagaggtca atggttcttc 2340
tttctttacc attgccgttc agcacagagc cctagtggaa ggttctgctt tcgcaacagc 2400
tgtggggtct tcttgctcat ggtgtgattc tcaagctcct cagatactgt gttatttaca 2460
gtcattctgg actggttcct acattcttgc taacttcgat tcttccagaa gcggcaaaga 2520
tactaacact ttgcttggca gcattcacac ttttgatccc gaagctgggt gcgacgattc 2580
tactttccaa ccatgttcac caagggcgct ggctaatcat aaggaagttg tcgattcttt 2640
tagatctatc tataccttga atgacggtct atcagattcc gaagccgtgg ctgtgggaag 2700
gtatcccgag gattcatact acaatggtaa cccatggttt ctttgcacat tggctgcagc 2760
ggaacaactg tatgatgctc tttaccaatg ggacaagcaa ggctctctgg agatcacaga 2820
tgttagtctg gatttcttta aggctttgta ctcaggtgca gccaccggta catatagctc 2880
ttcaagttca acctatagct ctatagtatc cgccgtgaag acctttgcag atggtttcgt 2940
tagcattgtt gaaacacacg ctgcatcaaa tggttcactg agtgaacagt ttgataaatc 3000
cgacggcgat gaattgagtg cgagagattt gacttggtct tatgcggctc ttcttactgc 3060
taacaataga cgtaatagtg ttgttccacc ctcttggggt gaaactagcg ctagtagtgt 3120
cccagggact tgcgcagcta caagtgcatc tggcacctac agctcagtga ctgttacatc 3180
ctggccaagt atagttgcta ccggtggcac tacaaccact gcaaccacta ctggtagcgg 3240
aggtgttact tcaacttcta aaaccacgac aactgcttct aagacgtcaa cgaccacatc 3300
ctcaacaagc tgtactacac ctacagcggt tgcagttaca ttcgacctaa ccgccacgac 3360
gacctacggg gaaaatatat atttggttgg aagtatctct caattagggg attgggaaac 3420
gtctgatgga attgccctaa gtgcagataa atatacatct tctaacccgc cttggtatgt 3480
taccgttaca ttgccagcag gcgaatcctt tgaatataaa ttcataagag tcgaatctga 3540
tgattctgtt gaatgggagt cagacccaaa tcgtgagtat actgtacctc aggcctgcgg 3600
tgaaagcaca gctactgtga ccgatacttg gaggtagtta attaatttac cagcttacta 3660
tccttcttga aaatatgcac tctatatctt ttagttctta attgcaacac atagatttgc 3720
tgtataacga attttatgct atttttttaa tttggagttc ggtgatgaaa gtgtcacagc 3780
gaatttcctc acatgtaggg accgaattgt ttacaagttc tctgtaccac catggagaca 3840
tcaaagattg aaaatctatg gaaagatatg gacggtagca acaagaatat agcacgagcc 3900
gcggagttca tttcgttact tttgatatcg ctcacaacta ttgcgaagcg cttcagtgaa 3960
aaaatcataa ggaaaagttg taaatattat tggtagtatt cgtttggtaa agtagagggg 4020
gtaatttttc ccctttattt tgttcataca ttcttaaatt gctttgcctc tccttttgga 4080
aagctatact tcggagcact gttgagcgaa ggctcaggcc ggccttatag cctagcttta 4140
aggctacttt aaaaactttt tatttattca tacacatata ttatcgaaca ttcgtataac 4200
ttaatatcat tcaaaaaaaa aaaaaaaaaa aaaagaaaac atatacacat atatatttat 4260
gtttatagag agagagagag aaaatttgaa tttttgaatc atttgcaaag ttatatgttt 4320
tatacattat ttattcattt tttttggtgt cgaggacatt gtgctgttca gagaaccact 4380
taaaatacgc atcgttctgt aaatatccac tttcattaaa aaccttattc acttctaact 4440
ttgccttcaa ctccttcttg gagttttctc cctttttttt ctgaacaagc tcaaccagat 4500
ataatggttc gttcttttcg aactttgtct ttacatatat ttcctccttt gtacctcttc 4560
tctttcccac ataaacagtc cccttttcaa taaaacgaga gaaataccag aaaagtagcg 4620
agagaacaaa atatgcgcct accaaaagct tttgatacgt aacaatctga tctctctcaa 4680
attttttatc caagaagaaa ctcaaaccag ctacaacagc tatggaataa cctatgtaca 4740
atttagcatc gagtaaagcg tatgatctct cgtaatttaa tctcgcgaaa acagaaggta 4800
gggcttcatc taaagcttgg ttcaactccg ggattgaata tacattaata ggtttagcag 4860
aactcatctt gaacaggcgt ctcttttcct tacaataact tgtgcttttc cttctataat 4920
tccgtttcaa cgtgtacaat tgtcattttt tgtctggtat gattttgcag aactgaaaaa 4980
atctcttaaa tgttccgcct catcaagaag gcatattcct ttacaaaagt acattgatct 5040
tacaagaagc tagctaatgg tactatttaa aaaacaacta cactccatca atacataaaa 5100
ttgttatgat agacttgagg gacgg 5125
<210> 34
<211> 5384
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 34
cagagcctct tatattcact ctgttcctcc atcgcctatt gagaaacgtt ggaataaaac 60
tctaaaaata tcatctagtt ggttagtttt tattttacca gtacattgtc acttgcggag 120
ggaggatgac ataaagattg agacgcagtc atttaatgaa gtttaaacgc aggtatttga 180
taaagtaata cgatattgaa tcatgacgta taaagtgaaa tgaacaaatg attacgtaaa 240
aaatgtcgat tttctcttga gagactccca tagcctctaa gaggccttct actacgttcc 300
atatatctaa gaatggggcc atatccagtg gaatcccagc aattatttaa ggatcaccta 360
tttctcagcc gatattttag caaaatcact accaatatca gggggcaata gttgatcgcc 420
tactttaaca aaaaatgttg ctcacgtatt aacacaggca acaaaaagga tattacgcaa 480
gaacgtagta tccacatgcc atcctccttg ttgcatcttt ttttttccga aatgattccc 540
tttcctgcac aacacgagat ctttcacgca tacatcggaa ggatcacccc ccactcaagt 600
cgttgcattg ctaacatgtg gcattctgcc catttttttc acgaaaattc tctctctata 660
atgaagaccc ttgtgccctg gactctgtaa tacttgaaac tacttcctca ataatcgctt 720
ggagacctac ccccacgctt ttcaaacaag gcgctagcaa aaagcctgcc gatatctcct 780
tgccccctcc ttctgttcga gagaactacg acccgaccaa taataatgtc atacaagaac 840
cgccaagaac caactgctga accttagatc tccaatactt cagttggagt atgtgaatat 900
ataagtacct ggtcgactaa tcttcttgca tcttttcgta ttcttacatc ctatgtcgct 960
aatacagttc ccgcatagag aagaaagcaa acaaaagtag tcactcgaga tctcccgagt 1020
ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt agtgattttc 1080
ctaactttat ttagtcaaaa aattggcctt ttaattctgc tgtaacccgt acatgcccaa 1140
aatagggggc gggttacaca gaatatataa catcataggt gtctgggtga acagtttatt 1200
cctggcatcc actaaatata atggagcccg ctttttttaa gctggcatcc agaaaaaaaa 1260
agaatcccag caccaaaata ttgttttctt caccaaccat cagttcatag gtccattctc 1320
ttagcgcaac tacacagaac aggggcacaa acaggcaaaa aacgggcaca acctcaatgg 1380
agtgatgcaa cctgcttgga gtaaatgatg acacaaggca attgacctac gcatgtatct 1440
atctcatttt cttacacctt ctattacctt ctgctctctc tgatttggaa aaagctgaaa 1500
aaaaaggttg aaaccagttc cctgaaatta ttcccctatt tgactaataa gtatataaag 1560
acggtaggta ttgattgtaa ttctgtaaat ctatttctta aacttcttaa attctacttt 1620
tatagttagt ctttttttta gtttaaaaca ccaagaactt agtttcgaat aaacacacat 1680
aaacaaacaa atctagaatg ttcaagtctg ttgtttactc tattttggct gcctctttgg 1740
ctaacgctag tgttatctct aagagagcaa cgttggatag ttggttatca aatgaagcaa 1800
ctgtcgctag aaccgcaatt ctaaacaata ttggagctga tggtgcatgg gttagcggtg 1860
cagactctgg tattgtggta gcctctccat ccacagataa tccagattat ttctatactt 1920
ggactagaga ttccggaata gttttgaaaa cgctggtgga tttgtttcgt aatggggaca 1980
ccgacttgtt atcaaccatt gagcattata tctccagtca agcaattatt caaggtgtct 2040
caaatccatc cggcgacttg agcagtgggg ggctgggaga acctaagttc aatgtggacg 2100
aaacggctta cgctggaagt tggggcagac cacagagaga cggaccagct ctaagagcaa 2160
cagccatgat tggattcggt cagtggctac tagacaatgg atacactagc gccgcgacag 2220
aaattgtttg gccactagtc aggaacgacc taagttacgt tgctcaatat tggaaccaaa 2280
ccgggtatga tctgtgggaa gaggttaatg gatctagttt cttcaccatc gcagttcagc 2340
atagagcttt ggttgaaggt agcgccttcg caacggcagt tgggagttca tgctcttggt 2400
gtgattcaca ggcaccacaa atcttatgtt atcttcagag cttttggacc ggttcctata 2460
ttctagccaa tttcgacagt tccagatccg gtaaggatac taacacttta cttggctcaa 2520
tacatacctt cgaccctgaa gctgggtgtg atgattctac attccaaccc tgttctccga 2580
gagcactggc caatcataaa gaagtggttg attcatttag aagtatttat acactaaatg 2640
acggattaag tgacagtgaa gccgtagccg tcggaagata tccagaagat tcctattaca 2700
atggtaatcc atggttctta tgtacacttg ctgctgctga acaattatat gacgcattgt 2760
atcaatggga taagcaaggc tctttagaaa ttaccgacgt aagtttagac ttctttaaag 2820
cattgtatag cggtgcagcc acgggtacat actcatcttc ttctagtacg tactcttcta 2880
ttgtttctgc ggtgaaaact tttgctgacg gctttgtttc tatcgtcgag acccatgccg 2940
ccagtaacgg ttctttatcc gaacaatttg acaagtccga tggcgatgag ttaagcgcaa 3000
gagatctaac ctggtcttat gccgcattac ttacagccaa caacagacgt aattccgttg 3060
taccaccatc ttggggtgaa acaagtgctt cttcagttcc gggcacctgc gcggccacaa 3120
gtgcatcagg aacttattca tcagtgactg taacatcttg gcctagtatt gtcgcaaccg 3180
gtggtacaac taccactgca actacgacgg gttctggagg agtcacttcc acaagcaaga 3240
ctacgactac tgcaagtaaa accagtacta ctacctcctc cactagctgt acgacaccca 3300
ccgccgtagc cgtcactttc gatttgactg ctacaaccac ctacggcgag aatatctact 3360
tggtgggatc aatctcacaa ctaggtgact gggagacttc cgacgggatc gctttgtcag 3420
cagataaata cacatcatct aacccaccat ggtatgtgac ggtcacttta cctgccgggg 3480
agtctttcga atacaagttt ataagggtag aatccgatga cagtgtggaa tgggaatctg 3540
atcctaatag agagtacaca gtgccacaag cttgtgggga atctacagcc acagttaccg 3600
atacatggag gtagttaatt aaacaggccc cttttccttt gtcgatatca tgtaattagt 3660
tatgtcacgc ttacattcac gccctcctcc cacatccgct ctaaccgaaa aggaaggagt 3720
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 3780
gttatttata tttcaaattt ttcttttttt tctgtacaaa cgcgtgtacg catgtaacgg 3840
gcagacggcc ggccataact tcgtataatg tatgctatac gaagttatcc ttacatcaca 3900
cccaatcccc cacaagtgat cccccacaca ccatagcttc aaaatgtttc tactcctttt 3960
ttactcttcc agattttctc ggactccgcg catcgccgta ccacttcaaa acacccaagc 4020
acagcatact aaatttcccc tctttcttcc tctagggtgg cgttaattac ccgtactaaa 4080
ggtttggaaa agaaaaaaga gaccgcctcg tttctttttc ttcgtcgaaa aaggcaataa 4140
aaatttttat cacgtttctt tttcttgaaa aatttttttt ttgatttttt tctctttcga 4200
tgacctccca ttgatattta agttaataaa tggtcttcaa tttctcaagt ttcagtttcg 4260
tttttcttgt tctattacaa ctttttttac ttcttgctca ttagaaagaa agcatagcaa 4320
tctaatctaa gttttaatta caaaatgcca caatcctggg aagaattggc cgccgacaaa 4380
cgtgcccgtt tggctaaaac cattcctgac gaatggaagg ttcaaacttt gcctgccgaa 4440
gattccgtta ttgatttccc aaagaagtcc ggtattttgt ctgaggctga attgaagatt 4500
accgaagcct ctgctgctga tttggtctcc aagttggccg ctggtgagtt gacttctgtt 4560
gaagtcactt tggctttttg taagagagct gctattgctc aacaattaac caactgtgct 4620
cacgaattct tcccagatgc tgctttagct caagctagag aattagatga atactacgct 4680
aagcataaga gaccagttgg tccattacac ggtttaccaa tctctttaaa ggaccaattg 4740
cgtgttaagg gttacgaaac ctccatgggt tacatttcct ggttaaacaa atacgatgaa 4800
ggtgattccg tcttaaccac catgttgaga aaagctggtg ctgttttcta cgttaagacc 4860
tctgtcccac aaaccttgat ggtctgtgaa accgtcaaca acatcattgg tagaactgtc 4920
aatccaagaa acaaaaattg gtcctgtggt ggttcttctg gtggtgaagg tgctattgtt 4980
ggtattagag gtggtgttat tggtgtcggt actgacattg gtggttccat tagagtccca 5040
gctgctttca actttttata cggtttgaga ccatctcacg gtagattgcc atatgctaaa 5100
atggctaact ctatggaagg tcaagaaacc gttcactccg tcgttggtcc tatcactcac 5160
tccgtcgaag acttgagatt gttcaccaaa tctgtcttgg gtcaagaacc ttggaagtac 5220
gactctaagg tcatccccat gccatggaga caatctgaat ctgacatcat tgcctctaag 5280
attaagaatg gtggtttgaa cattggttat tacaatttcg acggtaacgt cttgccacac 5340
ccaccaattt tacgtggtgt cgaaactacc gttgccgctt tggc 5384
<210> 35
<211> 5533
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 35
ggccgcgaag gtgctattgt tggtattaga ggtggtgtta ttggtgtcgg tactgacatt 60
ggtggttcca ttagagtccc agctgctttc aactttttat acggtttgag accatctcac 120
ggtagattgc catatgctaa aatggctaac tctatggaag gtcaagaaac cgttcactcc 180
gtcgttggtc ctatcactca ctccgtcgaa gacttgagat tgttcaccaa atctgtcttg 240
ggtcaagaac cttggaagta cgactctaag gtcatcccaa tgccatggag acaatctgaa 300
tctgacatca ttgcctctaa gattaagaat ggtggtttga acattggtta ttacaatttc 360
gacggtaacg tcttgccaca cccaccaatt ttacgtggtg tcgaaactac cgttgccgct 420
ttggccaagg ctggtcacac cgttactcca tggactccat acaagcatga tttcggtcat 480
gacttgattt cccacatcta tgctgctgat ggttctgccg acgtcatgag agacatttct 540
gcctctggtg agccagccat ccctaacatt aaggacttgt tgaacccaaa tattaaggct 600
gttaacatga acgaattgtg ggacactcat ttacaaaagt ggaactatca aatggaatac 660
ttggaaaagt ggcgtgaagc tgaagaaaaa gctggtaagg aattggacgc tattatcgct 720
ccaattactc ctaccgccgc tgtcagacac gatcaattca gatactacgg ttacgcctcc 780
gttattaact tattggattt cacctctgtt gtcgtcccag tcactttcgc tgataagaat 840
attgataaga agaacgaatc ttttaaagct gtttccgaat tggatgcttt ggttcaagaa 900
gaatacgacc cagaggctta tcacggtgct cctgttgctg ttcaagttat tggtagaaga 960
ttgtccgaag agagaacttt ggctatcgcc gaagaagtcg gtaaattgtt gggtaacgtc 1020
gtcactccat aagcgaattt cttatgattt atgattttta ttattaaata agttataaaa 1080
aaaataagtg tatacaaatt ttaaagtgac tcttaggttt taaaacgaaa attcttattc 1140
ttgagtaact ctttcctgta ggtcaggttg ctttctcagg tatagcatga ggtcgctctt 1200
attgaccaca cctctaccgg catgccgagc aaatgcctgc aaatcgctcc ccatttcacc 1260
caattgtaga tatgctaact ccagcaatga gttgatgaat ctcggtgtgt attttatgtc 1320
ctcagaggac aacacataac ttcgtataat gtatgctata cgaagttatc tcgagatctc 1380
ccctaaaccg tggaatattt cggatatcct tttgttgttt ccgggtgtac aatatggact 1440
tcctcttttc tggcaaccaa acccatacat cgggattcct ataatacctt cgttggtctc 1500
cctaacatgt aggtggcgga ggggagatat acaatagaac agataccaga caagacataa 1560
tgggctaaac aagactacac caattacact gcctcattga tggtggtaca taacgaacta 1620
atactgtagc cctagacttg atagccatca tcatatcgaa gtttcactac cctttttcca 1680
tttgccatct attgaagtaa taataggcgc atgcaacttc ttttcttttt ttttcttttc 1740
tctctccccc gttgttgtct caccatatcc gcaatgacaa aaaaatgatg gaagacacta 1800
aaggaaaaaa ttaacgacaa agacagcacc aacagatgtc gttgttccag agctgatgag 1860
gggtatctcg aagcacacga aactttttcc ttccttcatt cacgcacact actctctaat 1920
gagcaacggt atacggcctt ccttccagtt acttgaattt gaaataaaaa aaagtttgct 1980
gtcttgctat caagtataaa tagacctgca attattaatc ttttgtttcc tcgtcattgt 2040
tctcgttccc tttcttcctt gtttcttttt ctgcacaata tttcaagcta taccaagcat 2100
acaatcaact atctcatata catctagaat gttcaaaagc gtggtttact ccattttggc 2160
tgcatctttg gcaaatgcct cagttatttc caagcgtgct acgttagata gttggttaag 2220
taatgaggca accgtagcca gaaccgcaat attgaacaat attggtgccg atggggcctg 2280
ggttagcggc gcagattccg gtatcgtagt tgcaagcccc tccactgata accccgatta 2340
cttctacact tggactaggg actccggcat cgttttgaag actctggttg acttatttag 2400
aaatggcgat acggatctac ttagcacgat agaacactat atcagttccc aagctattat 2460
acagggtgtt tctaacccca gtggagattt atcaagcgga ggccttggtg agcctaaatt 2520
caacgttgat gagactgcat atgcaggatc ttggggcaga ccacaaagag atggtccagc 2580
tttaagagct actgccatga taggtttcgg gcaatggttg ttggacaatg gatacacttc 2640
agcagcaact gaaatcgtct ggccgttggt cagaaatgat ttaagctatg tcgctcagta 2700
ttggaaccaa acaggctacg atttgtggga agaggtcaat ggttcttctt tctttaccat 2760
tgccgttcag cacagagccc tagtggaagg ttctgctttc gcaacagctg tggggtcttc 2820
ttgctcatgg tgtgattctc aagctcctca gatactgtgt tatttacagt cattctggac 2880
tggttcctac attcttgcta acttcgattc ttccagaagc ggcaaagata ctaacacttt 2940
gcttggcagc attcacactt ttgatcccga agctgggtgc gacgattcta ctttccaacc 3000
atgttcacca agggcgctgg ctaatcataa ggaagttgtc gattctttta gatctatcta 3060
taccttgaat gacggtctat cagattccga agccgtggct gtgggaaggt atcccgagga 3120
ttcatactac aatggtaacc catggtttct ttgcacattg gctgcagcgg aacaactgta 3180
tgatgctctt taccaatggg acaagcaagg ctctctggag atcacagatg ttagtctgga 3240
tttctttaag gctttgtact caggtgcagc caccggtaca tatagctctt caagttcaac 3300
ctatagctct atagtatccg ccgtgaagac ctttgcagat ggtttcgtta gcattgttga 3360
aacacacgct gcatcaaatg gttcactgag tgaacagttt gataaatccg acggcgatga 3420
attgagtgcg agagatttga cttggtctta tgcggctctt cttactgcta acaatagacg 3480
taatagtgtt gttccaccct cttggggtga aactagcgct agtagtgtcc cagggacttg 3540
cgcagctaca agtgcatctg gcacctacag ctcagtgact gttacatcct ggccaagtat 3600
agttgctacc ggtggcacta caaccactgc aaccactact ggtagcggag gtgttacttc 3660
aacttctaaa accacgacaa ctgcttctaa gacgtcaacg accacatcct caacaagctg 3720
tactacacct acagcggttg cagttacatt cgacctaacc gccacgacga cctacgggga 3780
aaatatatat ttggttggaa gtatctctca attaggggat tgggaaacgt ctgatggaat 3840
tgccctaagt gcagataaat atacatcttc taacccgcct tggtatgtta ccgttacatt 3900
gccagcaggc gaatcctttg aatataaatt cataagagtc gaatctgatg attctgttga 3960
atgggagtca gacccaaatc gtgagtatac tgtacctcag gcctgcggtg aaagcacagc 4020
tactgtgacc gatacttgga ggtagttaat taatttacca gcttactatc cttcttgaaa 4080
atatgcactc tatatctttt agttcttaat tgcaacacat agatttgctg tataacgaat 4140
tttatgctat ttttttaatt tggagttcgg tgatgaaagt gtcacagcga atttcctcac 4200
atgtagggac cgaattgttt acaagttctc tgtaccacca tggagacatc aaagattgaa 4260
aatctatgga aagatatgga cggtagcaac aagaatatag cacgagccgc ggagttcatt 4320
tcgttacttt tgatatcgct cacaactatt gcgaagcgct tcagtgaaaa aatcataagg 4380
aaaagttgta aatattattg gtagtattcg tttggtaaag tagagggggt aatttttccc 4440
ctttattttg ttcatacatt cttaaattgc tttgcctctc cttttggaaa gctatacttc 4500
ggagcactgt tgagcgaagg ctcaggccgg ccttatagcc tagctttaag gctactttaa 4560
aaacttttta tttattcata cacatatatt atcgaacatt cgtataactt aatatcattc 4620
aaaaaaaaaa aaaaaaaaaa aagaaaacat atacacatat atatttatgt ttatagagag 4680
agagagagaa aatttgaatt tttgaatcat ttgcaaagtt atatgtttta tacattattt 4740
attcattttt tttggtgtcg aggacattgt gctgttcaga gaaccactta aaatacgcat 4800
cgttctgtaa atatccactt tcattaaaaa ccttattcac ttctaacttt gccttcaact 4860
ccttcttgga gttttctccc ttttttttct gaacaagctc aaccagatat aatggttcgt 4920
tcttttcgaa ctttgtcttt acatatattt cctcctttgt acctcttctc tttcccacat 4980
aaacagtccc cttttcaata aaacgagaga aataccagaa aagtagcgag agaacaaaat 5040
atgcgcctac caaaagcttt tgatacgtaa caatctgatc tctctcaaat tttttatcca 5100
agaagaaact caaaccagct acaacagcta tggaataacc tatgtacaat ttagcatcga 5160
gtaaagcgta tgatctctcg taatttaatc tcgcgaaaac agaaggtagg gcttcatcta 5220
aagcttggtt caactccggg attgaatata cattaatagg tttagcagaa ctcatcttga 5280
acaggcgtct cttttcctta caataacttg tgcttttcct tctataattc cgtttcaacg 5340
tgtacaattg tcattttttg tctggtatga ttttgcagaa ctgaaaaaat ctcttaaatg 5400
ttccgcctca tcaagaaggc atattccttt acaaaagtac attgatctta caagaagcta 5460
gctaatggta ctatttaaaa aacaactaca ctccatcaat acataaaatt gttatgatag 5520
acttgaggga cgg 5533
<210> 36
<211> 4881
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 36
cagagcctct tatattcact ctgttcctcc atcgcctatt gagaaacgtt ggaataaaac 60
tctaaaaata tcatctagtt ggttagtttt tattttacca gtacattgtc acttgcggag 120
ggaggatgac ataaagattg agacgcagtc atttaatgaa gtttaaacgc aggtatttga 180
taaagtaata cgatattgaa tcatgacgta taaagtgaaa tgaacaaatg attacgtaaa 240
aaatgtcgat tttctcttga gagactccca tagcctctaa gaggccttct actacgttcc 300
atatatctaa gaatggggcc atatccagtg gaatcccagc aattatttaa ggatcaccta 360
tttctcagcc gatattttag caaaatcact accaatatca gggggcaata gttgatcgcc 420
tactttaaca aaaaatgttg ctcacgtatt aacacaggca acaaaaagga tattacgcaa 480
gaacgtagta tccacatgcc atcctccttg ttgcatcttt ttttttccga aatgattccc 540
tttcctgcac aacacgagat ctttcacgca tacatcggaa ggatcacccc ccactcaagt 600
cgttgcattg ctaacatgtg gcattctgcc catttttttc acgaaaattc tctctctata 660
atgaagaccc ttgtgccctg gactctgtaa tacttgaaac tacttcctca ataatcgctt 720
ggagacctac ccccacgctt ttcaaacaag gcgctagcaa aaagcctgcc gatatctcct 780
tgccccctcc ttctgttcga gagaactacg acccgaccaa taataatgtc atacaagaac 840
cgccaagaac caactgctga accttagatc tccaatactt cagttggagt atgtgaatat 900
ataagtacct ggtcgactaa tcttcttgca tcttttcgta ttcttacatc ctatgtcgct 960
aatacagttc ccgcatagag aagaaagcaa acaaaagtag tcactcgaga tctcccgagt 1020
ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt agtgattttc 1080
ctaactttat ttagtcaaaa aattggcctt ttaattctgc tgtaacccgt acatgcccaa 1140
aatagggggc gggttacaca gaatatataa catcataggt gtctgggtga acagtttatt 1200
cctggcatcc actaaatata atggagcccg ctttttttaa gctggcatcc agaaaaaaaa 1260
agaatcccag caccaaaata ttgttttctt caccaaccat cagttcatag gtccattctc 1320
ttagcgcaac tacacagaac aggggcacaa acaggcaaaa aacgggcaca acctcaatgg 1380
agtgatgcaa cctgcttgga gtaaatgatg acacaaggca attgacctac gcatgtatct 1440
atctcatttt cttacacctt ctattacctt ctgctctctc tgatttggaa aaagctgaaa 1500
aaaaaggttg aaaccagttc cctgaaatta ttcccctatt tgactaataa gtatataaag 1560
acggtaggta ttgattgtaa ttctgtaaat ctatttctta aacttcttaa attctacttt 1620
tatagttagt ctttttttta gtttaaaaca ccaagaactt agtttcgaat aaacacacat 1680
aaacaaacaa atctagaatg aagttcattt ccactttctt gaccttcatt ttggctgctg 1740
tctctgtcac cgctgcatct attccatcta gtgcatctgt acaattggac tcctacaatt 1800
acgatggttc cacattttcc ggcaagattt atgtcaaaaa catcgcttac tctaaaaagg 1860
ttactgttgt gtacgcagac ggttctgaca actggaacaa taacggcaac actattgctg 1920
catcattttc aggcccaatc tctggatcaa attacgaata ctggacattc tcagcatcag 1980
tgaagggcat aaaggagttc tacatcaaat acgaagtttc aggtaagaca tattacgaca 2040
ataacaactc tgcaaactac caagtctcaa cttctaaacc tactacaact actgcagcta 2100
caaccacaac tacagctcca tcaacttcta caacaacccg tccatctagt tcagagcctg 2160
ccaccttccc tactggtaat tctaccatca gctcttggat caaaaagcag gaagatattt 2220
ccagattcgc tatgcttaga aacatcaacc cacctggttc tgccacaggg tttatcgccg 2280
catcactctc taccgctggt ccagattact actacgcgtg gacaagagat gccgctttga 2340
catctaacgt tatcgtttac gaatacaaca ccacattgtc tgggaataag acaattctaa 2400
acgtacttaa ggattacgtc acattcagtg ttaagacaca gtctacttca acagtttgta 2460
attgccttgg tgaaccaaag ttcaatccag acggcagtgg ttacacaggt gcttggggta 2520
gacctcaaaa tgatggtcct gcagaaagag cgactacatt tgttctgttt gccgacagct 2580
acttgactca aactaaggat gcctcatacg tcactggtac attaaagcca gcaattttca 2640
aagatctcga ttacgttgtt aacgtctgga gtaacggatg tttcgattta tgggaggagg 2700
tgaacggagt tcatttctac acccttatgg ttatgagaaa agggctattg ttgggggctg 2760
atttcgcgaa gagaaacggt gactcaacta gagcctcaac ttactcttct actgcttcca 2820
caattgctaa caagatatca agtttctggg ttagctcaaa caactgggtg caagtatccc 2880
aatctgtcac aggaggtgta agtaaaaagg ggttagacgt tagcaccctg ttagctgcga 2940
atctaggatc agtcgatgat ggatttttca ctccaggttc tgaaaagata ttagctacag 3000
ctgtggcagt cgaagattcc tttgccagtc tatacccaat caacaaaaac cttccatcat 3060
acttggggaa cgctattgga agataccctg aagatacata caacggtaat ggtaactcac 3120
aaggcaatcc ttggtttctg gcggttaccg gctacgcaga gttgtactat agagcaatta 3180
aggaatggat ttctaatgga ggcgttacag tgtcctctat ctcattgcca tttttcaaaa 3240
agttcgatag ctctgcaaca tccggtaaaa agtacaccgt aggtacttct gacttcaaca 3300
atttagcaca aaacattgct cttgctgcag atcgtttcct atctactgta caactccatg 3360
caccaaacaa tggttcatta gcagaggaat ttgatagaac aacaggtttt tctaccggcg 3420
ctagagattt aacatggtcc cacgcctcat tgataacagc atcctatgcc aaagccggtg 3480
ctccagctgc ataattaatt aaacaggccc cttttccttt gtcgatatca tgtaattagt 3540
tatgtcacgc ttacattcac gccctcctcc cacatccgct ctaaccgaaa aggaaggagt 3600
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 3660
gttatttata tttcaaattt ttcttttttt tctgtacaaa cgcgtgtacg catgtaacgg 3720
gcagacggcc ggccataact tcgtataatg tatgctatac gaagttatgg caacggttca 3780
tcatctcatg gatctgcaca tgaacaaaca ccagagtcaa acgacgttga aattgaggct 3840
actgcgccaa ttgatgacaa tacagacgat gataacaaac cgaagttatc tgatgtagaa 3900
aaggattaga gatgctaaga gatagtgatg atatttcata aataatgtaa ttctatatat 3960
gttaattacc ttttttgcga ggcatattta tggtgaagga taagttttga ccatcaaaga 4020
aggttaatgt ggctgtggtt tcagggtcca taaagctttt caattcatct tttttttttt 4080
tgttcttttt tttgattccg gtttctttga aatttttttg attcggtaat ctccgagcag 4140
aaggaagaac gaaggaagga gcacagactt agattggtat atatacgcat atgtggtgtt 4200
gaagaaacat gaaattgccc agtattctta acccaactgc acagaacaaa aacctgcagg 4260
aaacgaagat aaatcatgtc gaaagctaca tataaggaac gtgctgctac tcatcctagt 4320
cctgttgctg ccaagctatt taatatcatg cacgaaaagc aaacaaactt gtgtgcttca 4380
ttggatgttc gtaccaccaa ggaattactg gagttagttg aagcattagg tcccaaaatt 4440
tgtttactaa aaacacatgt ggatatcttg actgattttt ccatggaggg cacagttaag 4500
ccgctaaagg cattatccgc caagtacaat tttttactct tcgaagacag aaaatttgct 4560
gacattggta atacagtcaa attgcagtac tctgcgggtg tatacagaat agcagaatgg 4620
gcagacatta cgaatgcaca cggtgtggtg ggcccaggta ttgttagcgg tttgaagcag 4680
gcggcggaag aagtaacaaa ggaacctaga ggccttttga tgttagcaga attgtcatgc 4740
aagggctccc tagctactgg agaatatact aagggtactg ttgacattgc gaagagcgac 4800
aaagattttg ttatcggctt tattgctcaa agagacatgg gtggaagaga tgaaggttac 4860
gattggttga ttatgacacg c 4881
<210> 37
<211> 4921
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 37
ggccgctcca tggagggcac agttaagccg ctaaaggcat tatccgccaa gtacaatttt 60
ttactcttcg aagacagaaa atttgctgac attggtaata cagtcaaatt gcagtactct 120
gcgggtgtat acagaatagc agaatgggca gacattacga atgcacacgg tgtggtgggc 180
ccaggtattg ttagcggttt gaagcaggcg gcggaagaag taacaaagga acctagaggc 240
cttttgatgt tagcagaatt gtcatgcaag ggctccctag ctactggaga atatactaag 300
ggtactgttg acattgcgaa gagcgacaaa gattttgtta tcggctttat tgctcaaaga 360
gacatgggtg gaagagatga aggttacgat tggttgatta tgacacccgg tgtgggttta 420
gatgacaagg gagacgcatt gggtcaacag tatagaaccg tggatgatgt ggtctctaca 480
ggatctgaca ttattattgt tggaagagga ctatttgcaa agggaaggga tgctaaggta 540
gagggtgaac gttacagaaa agcaggctgg gaagcatatt tgagaagatg cggccagcaa 600
aactaaaaaa ctgtattata agtaaatgca tgtatactaa actcacaaat tagagcttca 660
atttaattat atcagttatt acccgggaat ctcggtcgta atgattttta taatgacgaa 720
aaaaaaaaaa ttggaaagaa aaagcttcat ggcctttata aaaaggaacc atccaatacc 780
tcgccagaac caagtaacag tattttacgg ggcacaaatc aagaacaata agacaggact 840
gtaaagatgg acgcattgaa ctccaaagaa caacaagagt tccaaaaagt agtggaacaa 900
aagcaaatga aggatttcat gcgtttgata acttcgtata atgtatgcta tacgaagtta 960
tctcgagatc tcccctaaac cgtggaatat ttcggatatc cttttgttgt ttccgggtgt 1020
acaatatgga cttcctcttt tctggcaacc aaacccatac atcgggattc ctataatacc 1080
ttcgttggtc tccctaacat gtaggtggcg gaggggagat atacaataga acagatacca 1140
gacaagacat aatgggctaa acaagactac accatttaca ctgcctcatt gatggtggta 1200
cataacgaac taatactgta gccctagact tgatagccat catcatatcg aagtttcact 1260
accctttttc catttgccat ctattgaagt aataataggc gcatgcaact tctttttttt 1320
tttttttttt ctctctcccc cgttgttgtc tcaccatatc cgcaatgaca aaaaaatgat 1380
ggaagacact aaaggaaaaa attaacgaca aagacagcac caacagatgt cgttgttcca 1440
gagctgatga ggggtatctc gaagcacacg aaactttttc cttccttcat tcacgcacac 1500
tactctctaa tgagcaacgg tatacggcct tccttccagt tacttgaatt tgaaataaaa 1560
aaaagtttgc tgtcttgcta tcaagtataa atagacctgc aattattaat cttttgtttc 1620
ctcgtcattg ttctcgttcc ctttcttcct tgtttctttt tctgcacaat atttcaagct 1680
ataccaagca tacaatcaac tatctcatat acatctagaa tgaagtttat ctccacgttt 1740
ttaaccttta tcctagcagc tgtcagcgtc accgccgcat caattccgag ttcagcatct 1800
gtacaacttg actcttacaa ttacgatggc agcactttct cagggaaaat ttatgtgaaa 1860
aacatagcat atagtaagaa ggttaccgtg gtatatgcag acggttctga taattggaat 1920
aataatggaa acactattgc cgccagtttt tccggcccaa tttctggttc caattacgag 1980
tattggacct tttctgcatc agtaaaaggc atcaaggaat tctatattaa gtacgaagtt 2040
tcaggtaaga catattacga taacaataac tcagcaaatt atcaagtctc tacatctaag 2100
cccacaacaa caactgctgc taccaccact acaaccgctc cttctaccag caccactacc 2160
agaccaagct ctagtgaacc ggctaccttt cctaccggaa acagtaccat ctcaagctgg 2220
atcaaaaagc aagaggacat aagtcgtttt gctatgttga ggaacattaa tcctccagga 2280
tccgcgaccg gtttcattgc agcatcacta agtactgccg ggcctgatta ttattatgct 2340
tggactagag acgctgcatt aacatcaaac gtgattgttt atgaatataa tacgaccctt 2400
tccggtaata aaacgatctt gaacgtatta aaagactatg tgacctttag tgtgaagacc 2460
caatctacat ctacagtgtg taattgtttg ggagaaccta aattcaatcc agacggttct 2520
gggtacactg gtgcctgggg tagacctcaa aacgacggtc cagcagaaag agcaacaacc 2580
tttgttctat ttgctgactc ttatttaacg caaacaaagg acgcctcata tgttacaggg 2640
accctaaaac cagcaatttt caaagacttg gattatgttg ttaatgtttg gagcaacgga 2700
tgttttgact tgtgggagga ggttaacggt gtacactttt atacattgat ggtgatgaga 2760
aaagggttgc tattgggagc agatttcgct aaaagaaatg gtgattctac aagagcgagc 2820
acatatagta gcaccgcttc aacaatcgcc aataaaatct catctttctg ggtatctagc 2880
aacaactggg tacaagtttc ccaaagtgtt accggcggtg tgtccaaaaa gggtttagac 2940
gttagcacac ttctagctgc taatttgggt agcgttgatg acgggttttt tactccaggt 3000
agtgagaaga tactggcaac cgcggtggcg gttgaagaca gctttgcttc attgtatcct 3060
ataaataaaa atctgccctc ttatctgggt aatgcaattg gcagataccc agaagatacc 3120
tacaatggta atggtaattc ccaggggaac ccatggtttt tggctgttac aggctacgca 3180
gaactttatt accgtgcaat caaggaatgg atttcaaatg gcggcgtcac tgtcagtagt 3240
ataagtttgc ccttttttaa gaaatttgat tcctcagcaa cgtctggtaa aaaatacacc 3300
gtaggtacta gtgatttcaa taatttggcc caaaatattg cgcttgctgc tgacaggttt 3360
cttagtaccg ttcagttgca cgctccaaat aatggctcat tggctgaaga atttgatcgt 3420
acgacaggtt tctccactgg tgctagggat ttgacttgga gtcatgcctc cttaatcaca 3480
gcaagctatg ctaaagctgg tgcacctgct gcttagttaa ttaatttacc agcttactat 3540
ccttcttgaa aatatgcact ctatatcttt tagttcttaa ttgcaacaca tagatttgct 3600
gtataacgaa ttttatgcta tttttttaat ttggagttcg gtgatgaaag tgtcacagcg 3660
aatttcctca catgtaggga ccgaattgtt tacaagttct ctgtaccacc atggagacat 3720
caaagattga aaatctatgg aaagatatgg acggtagcaa caagaatata gcacgagccg 3780
cggagttcat ttcgttactt ttgatatcgc tcacaactat tgcgaagcgc ttcagtgaaa 3840
aaatcataag gaaaagttgt aaatattatt ggtagtattc gtttggtaaa gtagaggggg 3900
taatttttcc cctttatttt gttcatacat tcttaaattg ctttgcctct ccttttggaa 3960
agctatactt cggagcactg ttgagcgaag gctcaggccg gccttatagc ctagctttaa 4020
ggctacttta aaaacttttt atttattcat acacatatat tatcgaacat tcgtataact 4080
taatatcatt caaaaaaaaa aaaaaaaaaa gaaaacatat acacatatat atttatgttt 4140
atagagagag agaaaatttg aatttttgaa tcatttgcaa agttatatgt tttatacatt 4200
atttattcat tttttttggt gtcgaggaca ttgtgctgtt cagagaacca cttaaaatac 4260
gcatcgttct gtaaatatcc actttcatta aaaaccttat tcacttctaa ctttgccttc 4320
aactccttct tggagttttc tccctttttt ttctgaacaa gctcaaccag atataatggt 4380
tcgttctttt cgaactttgt ctttacatat atttcctcct ttgtacctct tctctttccc 4440
acataaacag tccccttttc aataaaacga gagaaatacc agaaaagtag cgagagaaca 4500
aaatatgcgc ctaccaaaag cttttgatac gtaacaatct gatctctctc aaatttttta 4560
tccaagaaga aactcaaacc agctacaaca gctatggaat aacctatgta caatttagca 4620
tcgagtaaag cgtatgatct ctcgtaattt aatctcgcga aaacagaagg tagggcttca 4680
tctaaagctt ggttcaactc cgggattgaa tatacattaa taggtttagc agaactcatc 4740
ttgaacaggc gtctcttttc cttacaataa cttgtgcttt tccttctata attccgtttc 4800
aacgtgtaca attgtcattt tttgtctggt atgattttgc agaactgaaa aaatctctta 4860
aatgttccgc ctcatcaaga aggcatattc ctttacaaaa gtacattgat cttacaagaa 4920
g 4921
<210> 38
<211> 5264
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 38
cagagcctct tatattcact ctgttcctcc atcgcctatt gagaaacgtt ggaataaaac 60
tctaaaaata tcatctagtt ggttagtttt tattttacca gtacattgtc acttgcggag 120
ggaggatgac ataaagattg agacgcagtc atttaatgaa gtttaaacgc aggtatttga 180
taaagtaata cgatattgaa tcatgacgta taaagtgaaa tgaacaaatg attacgtaaa 240
aaatgtcgat tttctcttga gagactccca tagcctctaa gaggccttct actacgttcc 300
atatatctaa gaatggggcc atatccagtg gaatcccagc aattatttaa ggatcaccta 360
tttctcagcc gatattttag caaaatcact accaatatca gggggcaata gttgatcgcc 420
tactttaaca aaaaatgttg ctcacgtatt aacacaggca acaaaaagga tattacgcaa 480
gaacgtagta tccacatgcc atcctccttg ttgcatcttt ttttttccga aatgattccc 540
tttcctgcac aacacgagat ctttcacgca tacatcggaa ggatcacccc ccactcaagt 600
cgttgcattg ctaacatgtg gcattctgcc catttttttc acgaaaattc tctctctata 660
atgaagaccc ttgtgccctg gactctgtaa tacttgaaac tacttcctca ataatcgctt 720
ggagacctac ccccacgctt ttcaaacaag gcgctagcaa aaagcctgcc gatatctcct 780
tgccccctcc ttctgttcga gagaactacg acccgaccaa taataatgtc atacaagaac 840
cgccaagaac caactgctga accttagatc tccaatactt cagttggagt atgtgaatat 900
ataagtacct ggtcgactaa tcttcttgca tcttttcgta ttcttacatc ctatgtcgct 960
aatacagttc ccgcatagag aagaaagcaa acaaaagtag tcactcgaga tctcccgagt 1020
ttatcattat caatactgcc atttcaaaga atacgtaaat aattaatagt agtgattttc 1080
ctaactttat ttagtcaaaa aattggcctt ttaattctgc tgtaacccgt acatgcccaa 1140
aatagggggc gggttacaca gaatatataa catcataggt gtctgggtga acagtttatt 1200
cctggcatcc actaaatata atggagcccg ctttttttaa gctggcatcc agaaaaaaaa 1260
agaatcccag caccaaaata ttgttttctt caccaaccat cagttcatag gtccattctc 1320
ttagcgcaac tacacagaac aggggcacaa acaggcaaaa aacgggcaca acctcaatgg 1380
agtgatgcaa cctgcttgga gtaaatgatg acacaaggca attgacctac gcatgtatct 1440
atctcatttt cttacacctt ctattacctt ctgctctctc tgatttggaa aaagctgaaa 1500
aaaaaggttg aaaccagttc cctgaaatta ttcccctatt tgactaataa gtatataaag 1560
acggtaggta ttgattgtaa ttctgtaaat ctatttctta aacttcttaa attctacttt 1620
tatagttagt ctttttttta gtttaaaaca ccaagaactt agtttcgaat aaacacacat 1680
aaacaaacaa atctagaatg aagttcattt ccactttctt gaccttcatt ttggctgctg 1740
tctctgtcac cgctgcatct attccatcta gtgcatctgt acaattggac tcctacaatt 1800
acgatggttc cacattttcc ggcaagattt atgtcaaaaa catcgcttac tctaaaaagg 1860
ttactgttgt gtacgcagac ggttctgaca actggaacaa taacggcaac actattgctg 1920
catcattttc aggcccaatc tctggatcaa attacgaata ctggacattc tcagcatcag 1980
tgaagggcat aaaggagttc tacatcaaat acgaagtttc aggtaagaca tattacgaca 2040
ataacaactc tgcaaactac caagtctcaa cttctaaacc tactacaact actgcagcta 2100
caaccacaac tacagctcca tcaacttcta caacaacccg tccatctagt tcagagcctg 2160
ccaccttccc tactggtaat tctaccatca gctcttggat caaaaagcag gaagatattt 2220
ccagattcgc tatgcttaga aacatcaacc cacctggttc tgccacaggg tttatcgccg 2280
catcactctc taccgctggt ccagattact actacgcgtg gacaagagat gccgctttga 2340
catctaacgt tatcgtttac gaatacaaca ccacattgtc tgggaataag acaattctaa 2400
acgtacttaa ggattacgtc acattcagtg ttaagacaca gtctacttca acagtttgta 2460
attgccttgg tgaaccaaag ttcaatccag acggcagtgg ttacacaggt gcttggggta 2520
gacctcaaaa tgatggtcct gcagaaagag cgactacatt tgttctgttt gccgacagct 2580
acttgactca aactaaggat gcctcatacg tcactggtac attaaagcca gcaattttca 2640
aagatctcga ttacgttgtt aacgtctgga gtaacggatg tttcgattta tgggaggagg 2700
tgaacggagt tcatttctac acccttatgg ttatgagaaa agggctattg ttgggggctg 2760
atttcgcgaa gagaaacggt gactcaacta gagcctcaac ttactcttct actgcttcca 2820
caattgctaa caagatatca agtttctggg ttagctcaaa caactgggtg caagtatccc 2880
aatctgtcac aggaggtgta agtaaaaagg ggttagacgt tagcaccctg ttagctgcga 2940
atctaggatc agtcgatgat ggatttttca ctccaggttc tgaaaagata ttagctacag 3000
ctgtggcagt cgaagattcc tttgccagtc tatacccaat caacaaaaac cttccatcat 3060
acttggggaa cgctattgga agataccctg aagatacata caacggtaat ggtaactcac 3120
aaggcaatcc ttggtttctg gcggttaccg gctacgcaga gttgtactat agagcaatta 3180
aggaatggat ttctaatgga ggcgttacag tgtcctctat ctcattgcca tttttcaaaa 3240
agttcgatag ctctgcaaca tccggtaaaa agtacaccgt aggtacttct gacttcaaca 3300
atttagcaca aaacattgct cttgctgcag atcgtttcct atctactgta caactccatg 3360
caccaaacaa tggttcatta gcagaggaat ttgatagaac aacaggtttt tctaccggcg 3420
ctagagattt aacatggtcc cacgcctcat tgataacagc atcctatgcc aaagccggtg 3480
ctccagctgc ataattaatt aaacaggccc cttttccttt gtcgatatca tgtaattagt 3540
tatgtcacgc ttacattcac gccctcctcc cacatccgct ctaaccgaaa aggaaggagt 3600
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac 3660
gttatttata tttcaaattt ttcttttttt tctgtacaaa cgcgtgtacg catgtaacgg 3720
gcagacggcc ggccataact tcgtataatg tatgctatac gaagttatcc ttacatcaca 3780
cccaatcccc cacaagtgat cccccacaca ccatagcttc aaaatgtttc tactcctttt 3840
ttactcttcc agattttctc ggactccgcg catcgccgta ccacttcaaa acacccaagc 3900
acagcatact aaatttcccc tctttcttcc tctagggtgg cgttaattac ccgtactaaa 3960
ggtttggaaa agaaaaaaga gaccgcctcg tttctttttc ttcgtcgaaa aaggcaataa 4020
aaatttttat cacgtttctt tttcttgaaa aatttttttt ttgatttttt tctctttcga 4080
tgacctccca ttgatattta agttaataaa tggtcttcaa tttctcaagt ttcagtttcg 4140
tttttcttgt tctattacaa ctttttttac ttcttgctca ttagaaagaa agcatagcaa 4200
tctaatctaa gttttaatta caaaatgcca caatcctggg aagaattggc cgccgacaaa 4260
cgtgcccgtt tggctaaaac cattcctgac gaatggaagg ttcaaacttt gcctgccgaa 4320
gattccgtta ttgatttccc aaagaagtcc ggtattttgt ctgaggctga attgaagatt 4380
accgaagcct ctgctgctga tttggtctcc aagttggccg ctggtgagtt gacttctgtt 4440
gaagtcactt tggctttttg taagagagct gctattgctc aacaattaac caactgtgct 4500
cacgaattct tcccagatgc tgctttagct caagctagag aattagatga atactacgct 4560
aagcataaga gaccagttgg tccattacac ggtttaccaa tctctttaaa ggaccaattg 4620
cgtgttaagg gttacgaaac ctccatgggt tacatttcct ggttaaacaa atacgatgaa 4680
ggtgattccg tcttaaccac catgttgaga aaagctggtg ctgttttcta cgttaagacc 4740
tctgtcccac aaaccttgat ggtctgtgaa accgtcaaca acatcattgg tagaactgtc 4800
aatccaagaa acaaaaattg gtcctgtggt ggttcttctg gtggtgaagg tgctattgtt 4860
ggtattagag gtggtgttat tggtgtcggt actgacattg gtggttccat tagagtccca 4920
gctgctttca actttttata cggtttgaga ccatctcacg gtagattgcc atatgctaaa 4980
atggctaact ctatggaagg tcaagaaacc gttcactccg tcgttggtcc tatcactcac 5040
tccgtcgaag acttgagatt gttcaccaaa tctgtcttgg gtcaagaacc ttggaagtac 5100
gactctaagg tcatccccat gccatggaga caatctgaat ctgacatcat tgcctctaag 5160
attaagaatg gtggtttgaa cattggttat tacaatttcg acggtaacgt cttgccacac 5220
ccaccaattt tacgtggtgt cgaaactacc gttgccgctt tggc 5264
<210> 39
<211> 5337
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 39
ggccgcgaag gtgctattgt tggtattaga ggtggtgtta ttggtgtcgg tactgacatt 60
ggtggttcca ttagagtccc agctgctttc aactttttat acggtttgag accatctcac 120
ggtagattgc catatgctaa aatggctaac tctatggaag gtcaagaaac cgttcactcc 180
gtcgttggtc ctatcactca ctccgtcgaa gacttgagat tgttcaccaa atctgtcttg 240
ggtcaagaac cttggaagta cgactctaag gtcatcccaa tgccatggag acaatctgaa 300
tctgacatca ttgcctctaa gattaagaat ggtggtttga acattggtta ttacaatttc 360
gacggtaacg tcttgccaca cccaccaatt ttacgtggtg tcgaaactac cgttgccgct 420
ttggccaagg ctggtcacac cgttactcca tggactccat acaagcatga tttcggtcat 480
gacttgattt cccacatcta tgctgctgat ggttctgccg acgtcatgag agacatttct 540
gcctctggtg agccagccat ccctaacatt aaggacttgt tgaacccaaa tattaaggct 600
gttaacatga acgaattgtg ggacactcat ttacaaaagt ggaactatca aatggaatac 660
ttggaaaagt ggcgtgaagc tgaagaaaaa gctggtaagg aattggacgc tattatcgct 720
ccaattactc ctaccgccgc tgtcagacac gatcaattca gatactacgg ttacgcctcc 780
gttattaact tattggattt cacctctgtt gtcgtcccag tcactttcgc tgataagaat 840
attgataaga agaacgaatc ttttaaagct gtttccgaat tggatgcttt ggttcaagaa 900
gaatacgacc cagaggctta tcacggtgct cctgttgctg ttcaagttat tggtagaaga 960
ttgtccgaag agagaacttt ggctatcgcc gaagaagtcg gtaaattgtt gggtaacgtc 1020
gtcactccat aagcgaattt cttatgattt atgattttta ttattaaata agttataaaa 1080
aaaataagtg tatacaaatt ttaaagtgac tcttaggttt taaaacgaaa attcttattc 1140
ttgagtaact ctttcctgta ggtcaggttg ctttctcagg tatagcatga ggtcgctctt 1200
attgaccaca cctctaccgg catgccgagc aaatgcctgc aaatcgctcc ccatttcacc 1260
caattgtaga tatgctaact ccagcaatga gttgatgaat ctcggtgtgt attttatgtc 1320
ctcagaggac aacacataac ttcgtataat gtatgctata cgaagttatc tcgagatctc 1380
ccctaaaccg tggaatattt cggatatcct tttgttgttt ccgggtgtac aatatggact 1440
tcctcttttc tggcaaccaa acccatacat cgggattcct ataatacctt cgttggtctc 1500
cctaacatgt aggtggcgga ggggagatat acaatagaac agataccaga caagacataa 1560
tgggctaaac aagactacac caattacact gcctcattga tggtggtaca taacgaacta 1620
atactgtagc cctagacttg atagccatca tcatatcgaa gtttcactac cctttttcca 1680
tttgccatct attgaagtaa taataggcgc atgcaacttc ttttcttttt ttttcttttc 1740
tctctccccc gttgttgtct caccatatcc gcaatgacaa aaaaatgatg gaagacacta 1800
aaggaaaaaa ttaacgacaa agacagcacc aacagatgtc gttgttccag agctgatgag 1860
gggtatctcg aagcacacga aactttttcc ttccttcatt cacgcacact actctctaat 1920
gagcaacggt atacggcctt ccttccagtt acttgaattt gaaataaaaa aaagtttgct 1980
gtcttgctat caagtataaa tagacctgca attattaatc ttttgtttcc tcgtcattgt 2040
tctcgttccc tttcttcctt gtttcttttt ctgcacaata tttcaagcta taccaagcat 2100
acaatcaact atctcatata catctagaat gaagtttatc tccacgtttt taacctttat 2160
cctagcagct gtcagcgtca ccgccgcatc aattccgagt tcagcatctg tacaacttga 2220
ctcttacaat tacgatggca gcactttctc agggaaaatt tatgtgaaaa acatagcata 2280
tagtaagaag gttaccgtgg tatatgcaga cggttctgat aattggaata ataatggaaa 2340
cactattgcc gccagttttt ccggcccaat ttctggttcc aattacgagt attggacctt 2400
ttctgcatca gtaaaaggca tcaaggaatt ctatattaag tacgaagttt caggtaagac 2460
atattacgat aacaataact cagcaaatta tcaagtctct acatctaagc ccacaacaac 2520
aactgctgct accaccacta caaccgctcc ttctaccagc accactacca gaccaagctc 2580
tagtgaaccg gctacctttc ctaccggaaa cagtaccatc tcaagctgga tcaaaaagca 2640
agaggacata agtcgttttg ctatgttgag gaacattaat cctccaggat ccgcgaccgg 2700
tttcattgca gcatcactaa gtactgccgg gcctgattat tattatgctt ggactagaga 2760
cgctgcatta acatcaaacg tgattgttta tgaatataat acgacccttt ccggtaataa 2820
aacgatcttg aacgtattaa aagactatgt gacctttagt gtgaagaccc aatctacatc 2880
tacagtgtgt aattgtttgg gagaacctaa attcaatcca gacggttctg ggtacactgg 2940
tgcctggggt agacctcaaa acgacggtcc agcagaaaga gcaacaacct ttgttctatt 3000
tgctgactct tatttaacgc aaacaaagga cgcctcatat gttacaggga ccctaaaacc 3060
agcaattttc aaagacttgg attatgttgt taatgtttgg agcaacggat gttttgactt 3120
gtgggaggag gttaacggtg tacactttta tacattgatg gtgatgagaa aagggttgct 3180
attgggagca gatttcgcta aaagaaatgg tgattctaca agagcgagca catatagtag 3240
caccgcttca acaatcgcca ataaaatctc atctttctgg gtatctagca acaactgggt 3300
acaagtttcc caaagtgtta ccggcggtgt gtccaaaaag ggtttagacg ttagcacact 3360
tctagctgct aatttgggta gcgttgatga cgggtttttt actccaggta gtgagaagat 3420
actggcaacc gcggtggcgg ttgaagacag ctttgcttca ttgtatccta taaataaaaa 3480
tctgccctct tatctgggta atgcaattgg cagataccca gaagatacct acaatggtaa 3540
tggtaattcc caggggaacc catggttttt ggctgttaca ggctacgcag aactttatta 3600
ccgtgcaatc aaggaatgga tttcaaatgg cggcgtcact gtcagtagta taagtttgcc 3660
cttttttaag aaatttgatt cctcagcaac gtctggtaaa aaatacaccg taggtactag 3720
tgatttcaat aatttggccc aaaatattgc gcttgctgct gacaggtttc ttagtaccgt 3780
tcagttgcac gctccaaata atggctcatt ggctgaagaa tttgatcgta cgacaggttt 3840
ctccactggt gctagggatt tgacttggag tcatgcctcc ttaatcacag caagctatgc 3900
taaagctggt gcacctgctg cttagttaat taatttacca gcttactatc cttcttgaaa 3960
atatgcactc tatatctttt agttcttaat tgcaacacat agatttgctg tataacgaat 4020
tttatgctat ttttttaatt tggagttcgg tgatgaaagt gtcacagcga atttcctcac 4080
atgtagggac cgaattgttt acaagttctc tgtaccacca tggagacatc aaagattgaa 4140
aatctatgga aagatatgga cggtagcaac aagaatatag cacgagccgc ggagttcatt 4200
tcgttacttt tgatatcgct cacaactatt gcgaagcgct tcagtgaaaa aatcataagg 4260
aaaagttgta aatattattg gtagtattcg tttggtaaag tagagggggt aatttttccc 4320
ctttattttg ttcatacatt cttaaattgc tttgcctctc cttttggaaa gctatacttc 4380
ggagcactgt tgagcgaagg ctcaggccgg ccttatagcc tagctttaag gctactttaa 4440
aaacttttta tttattcata cacatatatt atcgaacatt cgtataactt aatatcattc 4500
aaaaaaaaaa aaaaaaaaaa aagaaaacat atacacatat atatttatgt ttatagagag 4560
agagagagaa aatttgaatt tttgaatcat ttgcaaagtt atatgtttta tacattattt 4620
attcattttt tttggtgtcg aggacattgt gctgttcaga gaaccactta aaatacgcat 4680
cgttctgtaa atatccactt tcattaaaaa ccttattcac ttctaacttt gccttcaact 4740
ccttcttgga gttttctccc ttttttttct gaacaagctc aaccagatat aatggttcgt 4800
tcttttcgaa ctttgtcttt acatatattt cctcctttgt acctcttctc tttcccacat 4860
aaacagtccc cttttcaata aaacgagaga aataccagaa aagtagcgag agaacaaaat 4920
atgcgcctac caaaagcttt tgatacgtaa caatctgatc tctctcaaat tttttatcca 4980
agaagaaact caaaccagct acaacagcta tggaataacc tatgtacaat ttagcatcga 5040
gtaaagcgta tgatctctcg taatttaatc tcgcgaaaac agaaggtagg gcttcatcta 5100
aagcttggtt caactccggg attgaatata cattaatagg tttagcagaa ctcatcttga 5160
acaggcgtct cttttcctta caataacttg tgcttttcct tctataattc cgtttcaacg 5220
tgtacaattg tcattttttg tctggtatga ttttgcagaa ctgaaaaaat ctcttaaatg 5280
ttccgcctca tcaagaaggc atattccttt acaaaagtac attgatctta caagaag 5337
<210> 40
<211> 1684
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 40
ggccgccagt gtgatggata tctgcagaat tcgcccttgc tagcggcaac ggttcatcat 60
ctcatggatc tgcacatgaa caaacaccag agtcaaacga cgttgaaatt gaggctactg 120
cgccaattga tgacaataca gacgatgata acaaaccgaa gttatctgat gtagaaaagg 180
attagagatg ctaagagata gtgatgatat ttcataaata atgtaattct atatatgtta 240
attacctttt ttgcgaggca tatttatggt gaaggataag ttttgaccat caaagaaggt 300
taatgtggct gtggtttcag ggtccataaa gcttttcaat tcatcttttt tttttttgtt 360
cttttttttg attccggttt ctttgaaatt tttttgattc ggtaatctcc gagcagaagg 420
aagaacgaag gaaggagcac agacttagat tggtatatat acgcatatgt ggtgttgaag 480
aaacatgaaa ttgcccagta ttcttaaccc aactgcacag aacaaaaacc tgcaggaaac 540
gaagataaat catgtcgaaa gctacatata aggaacgtgc tgctactcat cctagtcctg 600
ttgctgccaa gctatttaat atcatgcacg aaaagcaaac aaacttgtgt gcttcattgg 660
atgttcgtac caccaaggaa ttactggagt tagttgaagc attaggtccc aaaatttgtt 720
tactaaaaac acatgtggat atcttgactg atttttccat ggagggcaca gttaagccgc 780
taaaggcatt atccgccaag tacaattttt tactcttcga agacagaaaa tttgctgaca 840
ttggtaatac agtcaaattg cagtactctg cgggtgtata cagaatagca gaatgggcag 900
acattacgaa tgcacacggt gtggtgggcc caggtattgt tagcggtttg aagcaggcgg 960
cggaagaagt aacaaaggaa cctagaggcc ttttgatgtt agcagaattg tcatgcaagg 1020
gctccctagc tactggagaa tatactaagg gtactgttga cattgcgaag agcgacaaag 1080
attttgttat cggctttatt gctcaaagag acatgggtgg aagagatgaa ggttacgatt 1140
ggttgattat gacacccggt gtgggtttag atgacaaggg agacgcattg ggtcaacagt 1200
atagaaccgt ggatgatgtg gtctctacag gatctgacat tattattgtt ggaagaggac 1260
tatttgcaaa gggaagggat gctaaggtag agggtgaacg ttacagaaaa gcaggctggg 1320
aagcatattt gagaagatgc ggccagcaaa actaaaaaac tgtattataa gtaaatgcat 1380
gtatactaaa ctcacaaatt agagcttcaa tttaattata tcagttatta cccgggaatc 1440
tcggtcgtaa tgatttttat aatgacgaaa aaaaaaaaat tggaaagaaa aagcttcatg 1500
gcctttataa aaaggaacca tccaatacct cgccagaacc aagtaacagt attttacggg 1560
gcacaaatca agaacaataa gacaggactg taaagatgga cgcattgaac tccaaagaac 1620
aacaagagtt ccaaaaagta gtggaacaaa agcaaatgaa ggatttcatg cgtttgccgc 1680
gggc 1684
<210> 41
<211> 497
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 41
Val Pro Val Glu Leu Asp Lys Arg Asn Thr Gly His Phe Gln Ala Tyr
1 5 10 15
Ser Gly Tyr Thr Val Ala Arg Ser Asn Phe Thr Gln Trp Ile His Glu
20 25 30
Gln Pro Ala Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile Asp Tyr Pro
35 40 45
Glu Gly Gln Phe Lys Ser Ala Lys Pro Gly Val Val Val Ala Ser Pro
50 55 60
Ser Thr Ser Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg Asp Thr Ala
65 70 75 80
Ile Thr Phe Leu Ser Leu Ile Ala Glu Val Glu Asp His Ser Phe Ser
85 90 95
Asn Thr Thr Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser Asn Thr Tyr
100 105 110
Thr Leu Gln Arg Val Ser Asn Pro Ser Gly Asn Phe Asp Ser Pro Asn
115 120 125
His Asp Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Asp Thr Ala Tyr
130 135 140
Thr Ala Ser Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Leu Arg Ala
145 150 155 160
Tyr Ala Ile Ser Arg Tyr Leu Asn Ala Val Ala Lys His Asn Asn Gly
165 170 175
Lys Leu Leu Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser Ser Ala Ser
180 185 190
Asp Ile Tyr Trp Lys Ile Ile Lys Pro Asp Leu Gln His Val Ser Thr
195 200 205
His Trp Ser Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn Gln Gly Thr
210 215 220
His Phe Phe Thr Ala Leu Val Gln Leu Lys Ala Leu Ser Tyr Gly Ile
225 230 235 240
Pro Leu Ser Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser Trp Leu Glu
245 250 255
Lys Gln Lys Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser Gly Phe Val
260 265 270
Asn Ser Gly Lys Lys His Ile Val Glu Ser Pro Gln Leu Ser Ser Arg
275 280 285
Gly Gly Leu Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile Thr His Asp
290 295 300
Ile Gly Asp Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp Asn Ser Tyr
305 310 315 320
Val Leu Asn Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys Asn Arg Tyr
325 330 335
Lys Ile Asn Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly Arg Tyr Pro
340 345 350
Glu Asp Val Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn Pro Trp Gln
355 360 365
Leu Ala Thr Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu Ala Tyr Asn
370 375 380
Ser Leu Lys Asn Lys Lys Asn Leu Val Ile Glu Lys Leu Asn Tyr Asp
385 390 395 400
Leu Tyr Asn Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp Ser Ser Tyr
405 410 415
Ala Ser Lys Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp Asn Tyr Lys
420 425 430
Asn Val Ile Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe Leu Lys Val
435 440 445
Leu Leu Asp His Ile Asp Asp Asn Gly Gln Leu Thr Glu Glu Ile Asn
450 455 460
Arg Tyr Thr Gly Phe Gln Ala Gly Ala Val Ser Leu Thr Trp Ser Ser
465 470 475 480
Gly Ser Leu Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu Ile Glu Leu
485 490 495
Leu
<210> 42
<211> 579
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 42
Ala Ser Ile Pro Ser Ser Ala Ser Val Gln Leu Asp Ser Tyr Asn Tyr
1 5 10 15
Asp Gly Ser Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn Ile Ala Tyr
20 25 30
Ser Lys Lys Val Thr Val Val Tyr Ala Asp Gly Ser Asp Asn Trp Asn
35 40 45
Asn Asn Gly Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro Ile Ser Gly
50 55 60
Ser Asn Tyr Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys Gly Ile Lys
65 70 75 80
Glu Phe Tyr Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr Tyr Asp Asn
85 90 95
Asn Asn Ser Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro Thr Thr Thr
100 105 110
Thr Ala Ala Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser Thr Thr Thr
115 120 125
Arg Pro Ser Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly Asn Ser Thr
130 135 140
Ile Ser Ser Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg Phe Ala Met
145 150 155 160
Leu Arg Asn Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe Ile Ala Ala
165 170 175
Ser Leu Ser Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp Thr Arg Asp
180 185 190
Ala Ala Leu Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn Thr Thr Leu
195 200 205
Ser Gly Asn Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr Val Thr Phe
210 215 220
Ser Val Lys Thr Gln Ser Thr Ser Thr Val Cys Asn Cys Leu Gly Glu
225 230 235 240
Pro Lys Phe Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala Trp Gly Arg
245 250 255
Pro Gln Asn Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe Val Leu Phe
260 265 270
Ala Asp Ser Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr Val Thr Gly
275 280 285
Thr Leu Lys Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val Val Asn Val
290 295 300
Trp Ser Asn Gly Cys Phe Asp Leu Trp Glu Glu Val Asn Gly Val His
305 310 315 320
Phe Tyr Thr Leu Met Val Met Arg Lys Gly Leu Leu Leu Gly Ala Asp
325 330 335
Phe Ala Lys Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr Tyr Ser Ser
340 345 350
Thr Ala Ser Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp Val Ser Ser
355 360 365
Asn Asn Trp Val Gln Val Ser Gln Ser Val Thr Gly Gly Val Ser Lys
370 375 380
Lys Gly Leu Asp Val Ser Thr Leu Leu Ala Ala Asn Leu Gly Ser Val
385 390 395 400
Asp Asp Gly Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu Ala Thr Ala
405 410 415
Val Ala Val Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile Asn Lys Asn
420 425 430
Leu Pro Ser Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro Glu Asp Thr
435 440 445
Tyr Asn Gly Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe Leu Ala Val
450 455 460
Thr Gly Tyr Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu Trp Ile Ser
465 470 475 480
Asn Gly Gly Val Thr Val Ser Ser Ile Ser Leu Pro Phe Phe Lys Lys
485 490 495
Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val Gly Thr Ser
500 505 510
Asp Phe Asn Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala Asp Arg Phe
515 520 525
Leu Ser Thr Val Gln Leu His Ala Pro Asn Asn Gly Ser Leu Ala Glu
530 535 540
Glu Phe Asp Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg Asp Leu Thr
545 550 555 560
Trp Ser His Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys Ala Gly Ala
565 570 575
Pro Ala Ala
<210> 43
<211> 621
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 43
Ser Val Ile Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu Ser Asn Glu
1 5 10 15
Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly Ala Asp Gly
20 25 30
Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala Ser Pro Ser
35 40 45
Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser Gly Ile
50 55 60
Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr Asp Leu
65 70 75 80
Leu Ser Thr Ile Glu His Tyr Ile Ser Ser Gln Ala Ile Ile Gln Gly
85 90 95
Val Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu Gly Glu Pro
100 105 110
Lys Phe Asn Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp Gly Arg Pro
115 120 125
Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile Gly Phe Gly
130 135 140
Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr Glu Ile Val
145 150 155 160
Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn
165 170 175
Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe
180 185 190
Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser Ala Phe Ala
195 200 205
Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln
210 215 220
Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr Ile Leu Ala
225 230 235 240
Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr Leu Leu Gly
245 250 255
Ser Ile His Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp Ser Thr Phe
260 265 270
Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu Val Val Asp
275 280 285
Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser Asp Ser Glu
290 295 300
Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn
305 310 315 320
Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala
325 330 335
Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr Asp Val Ser
340 345 350
Leu Asp Phe Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr Gly Thr Tyr
355 360 365
Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala Val Lys Thr
370 375 380
Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala Ala Ser Asn
385 390 395 400
Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp Glu Leu Ser
405 410 415
Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala Asn Asn
420 425 430
Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr Ser Ala Ser
435 440 445
Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly Thr Tyr Ser
450 455 460
Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr Gly Gly Thr
465 470 475 480
Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly Val Thr Ser Thr Ser
485 490 495
Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr Ser Ser Thr
500 505 510
Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala
515 520 525
Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser Ile Ser Gln
530 535 540
Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser Ala Asp Lys
545 550 555 560
Tyr Thr Ser Ser Asn Pro Pro Trp Tyr Val Thr Val Thr Leu Pro Ala
565 570 575
Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser Asp Asp Ser
580 585 590
Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gln Ala
595 600 605
Cys Gly Glu Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
610 615 620
<210> 44
<211> 616
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 44
Ala Pro Gln Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp Ala Trp Leu
1 5 10 15
Ala Ser Glu Thr Thr Val Ala Leu Asp Gly Ile Leu Asp Asn Val Gly
20 25 30
Ser Ser Gly Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile Val Ile Ala
35 40 45
Ser Pro Ser Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp Thr Arg Asp
50 55 60
Ala Ala Leu Thr Val Lys Ala Leu Ile Asp Leu Phe Arg Asn Gly Glu
65 70 75 80
Thr Ser Leu Gln Thr Val Ile Met Glu Tyr Ile Ser Ser Gln Ala Tyr
85 90 95
Leu Gln Thr Val Ser Asn Pro Ser Gly Ser Leu Ser Thr Gly Gly Leu
100 105 110
Ala Glu Pro Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp
115 120 125
Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile
130 135 140
Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr Tyr Ala Ser
145 150 155 160
Ser Ile Val Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr Val Ala Gln
165 170 175
Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser
180 185 190
Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser
195 200 205
Thr Phe Ala Ser Lys Val Gly Ala Ser Cys Ser Trp Cys Asp Ser Gln
210 215 220
Ala Pro Gln Val Leu Cys Phe Leu Gln Arg Phe Trp Thr Gly Ser Tyr
225 230 235 240
Ile Met Ala Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr
245 250 255
Val Leu Gly Ser Ile His Thr Phe Asp Pro Asn Ala Gly Cys Asp Asp
260 265 270
Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Val
275 280 285
Tyr Thr Asp Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser Gly Ile Ser
290 295 300
Ser Gly Lys Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr
305 310 315 320
Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala Glu Gln Leu
325 330 335
Tyr Asp Ala Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile Thr Ile Thr
340 345 350
Asp Val Ser Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser Ala Ala Val
355 360 365
Gly Thr Tyr Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile Val Ser Ala
370 375 380
Val Lys Thr Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln Thr His Ala
385 390 395 400
Met Thr Asn Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser Asp Gly Phe
405 410 415
Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr
420 425 430
Ala Asn Leu Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr
435 440 445
Thr Ala Thr Ser Val Pro Ser Val Cys Ser Ala Thr Ser Ala Thr Gly
450 455 460
Thr Tyr Ser Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr Leu Thr Ser
465 470 475 480
Gly Thr Gly Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser Ser Ser Thr
485 490 495
Thr Thr Thr Ser Ser Ala Ser Ser Thr Thr Val Glu Cys Val Val Pro
500 505 510
Thr Ala Val Ala Val Thr Phe Asp Glu Val Ala Thr Thr Thr Tyr Gly
515 520 525
Glu Asn Val Tyr Val Val Gly Ser Ile Ser Gln Leu Gly Ser Trp Asp
530 535 540
Thr Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr Ser Ser Asn
545 550 555 560
Asn Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Thr Thr Phe Gln
565 570 575
Tyr Lys Phe Ile Arg Val Ser Ser Ser Gly Ser Val Thr Trp Glu Ser
580 585 590
Asp Pro Asn Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly Thr Ser Thr
595 600 605
Ala Val Val Asn Thr Thr Trp Arg
610 615
<210> 45
<211> 586
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 45
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Val Pro Val Glu Leu Asp Lys
85 90 95
Arg Asn Thr Gly His Phe Gln Ala Tyr Ser Gly Tyr Thr Val Ala Arg
100 105 110
Ser Asn Phe Thr Gln Trp Ile His Glu Gln Pro Ala Val Ser Trp Tyr
115 120 125
Tyr Leu Leu Gln Asn Ile Asp Tyr Pro Glu Gly Gln Phe Lys Ser Ala
130 135 140
Lys Pro Gly Val Val Val Ala Ser Pro Ser Thr Ser Glu Pro Asp Tyr
145 150 155 160
Phe Tyr Gln Trp Thr Arg Asp Thr Ala Ile Thr Phe Leu Ser Leu Ile
165 170 175
Ala Glu Val Glu Asp His Ser Phe Ser Asn Thr Thr Leu Ala Lys Val
180 185 190
Val Glu Tyr Tyr Ile Ser Asn Thr Tyr Thr Leu Gln Arg Val Ser Asn
195 200 205
Pro Ser Gly Asn Phe Asp Ser Pro Asn His Asp Gly Leu Gly Glu Pro
210 215 220
Lys Phe Asn Val Asp Asp Thr Ala Tyr Thr Ala Ser Trp Gly Arg Pro
225 230 235 240
Gln Asn Asp Gly Pro Ala Leu Arg Ala Tyr Ala Ile Ser Arg Tyr Leu
245 250 255
Asn Ala Val Ala Lys His Asn Asn Gly Lys Leu Leu Leu Ala Gly Gln
260 265 270
Asn Gly Ile Pro Tyr Ser Ser Ala Ser Asp Ile Tyr Trp Lys Ile Ile
275 280 285
Lys Pro Asp Leu Gln His Val Ser Thr His Trp Ser Thr Ser Gly Phe
290 295 300
Asp Leu Trp Glu Glu Asn Gln Gly Thr His Phe Phe Thr Ala Leu Val
305 310 315 320
Gln Leu Lys Ala Leu Ser Tyr Gly Ile Pro Leu Ser Lys Thr Tyr Asn
325 330 335
Asp Pro Gly Phe Thr Ser Trp Leu Glu Lys Gln Lys Asp Ala Leu Asn
340 345 350
Ser Tyr Ile Asn Ser Ser Gly Phe Val Asn Ser Gly Lys Lys His Ile
355 360 365
Val Glu Ser Pro Gln Leu Ser Ser Arg Gly Gly Leu Asp Ser Ala Thr
370 375 380
Tyr Ile Ala Ala Leu Ile Thr His Asp Ile Gly Asp Asp Asp Thr Tyr
385 390 395 400
Thr Pro Phe Asn Val Asp Asn Ser Tyr Val Leu Asn Ser Leu Tyr Tyr
405 410 415
Leu Leu Val Asp Asn Lys Asn Arg Tyr Lys Ile Asn Gly Asn Tyr Lys
420 425 430
Ala Gly Ala Ala Val Gly Arg Tyr Pro Glu Asp Val Tyr Asn Gly Val
435 440 445
Gly Thr Ser Glu Gly Asn Pro Trp Gln Leu Ala Thr Ala Tyr Ala Gly
450 455 460
Gln Thr Phe Tyr Thr Leu Ala Tyr Asn Ser Leu Lys Asn Lys Lys Asn
465 470 475 480
Leu Val Ile Glu Lys Leu Asn Tyr Asp Leu Tyr Asn Ser Phe Ile Ala
485 490 495
Asp Leu Ser Lys Ile Asp Ser Ser Tyr Ala Ser Lys Asp Ser Leu Thr
500 505 510
Leu Thr Tyr Gly Ser Asp Asn Tyr Lys Asn Val Ile Lys Ser Leu Leu
515 520 525
Gln Phe Gly Asp Ser Phe Leu Lys Val Leu Leu Asp His Ile Asp Asp
530 535 540
Asn Gly Gln Leu Thr Glu Glu Ile Asn Arg Tyr Thr Gly Phe Gln Ala
545 550 555 560
Gly Ala Val Ser Leu Thr Trp Ser Ser Gly Ser Leu Leu Ser Ala Asn
565 570 575
Arg Ala Arg Asn Lys Leu Ile Glu Leu Leu
580 585
<210> 46
<211> 558
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 46
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Leu Glu Gly
20 25 30
Asp Phe Asp Val Ala Val Leu Pro Phe Ser Ala Ser Ile Ala Ala Lys
35 40 45
Glu Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala Val Pro Val
50 55 60
Glu Leu Asp Lys Arg Asn Thr Gly His Phe Gln Ala Tyr Ser Gly Tyr
65 70 75 80
Thr Val Ala Arg Ser Asn Phe Thr Gln Trp Ile His Glu Gln Pro Ala
85 90 95
Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile Asp Tyr Pro Glu Gly Gln
100 105 110
Phe Lys Ser Ala Lys Pro Gly Val Val Val Ala Ser Pro Ser Thr Ser
115 120 125
Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg Asp Thr Ala Ile Thr Phe
130 135 140
Leu Ser Leu Ile Ala Glu Val Glu Asp His Ser Phe Ser Asn Thr Thr
145 150 155 160
Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser Asn Thr Tyr Thr Leu Gln
165 170 175
Arg Val Ser Asn Pro Ser Gly Asn Phe Asp Ser Pro Asn His Asp Gly
180 185 190
Leu Gly Glu Pro Lys Phe Asn Val Asp Asp Thr Ala Tyr Thr Ala Ser
195 200 205
Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Leu Arg Ala Tyr Ala Ile
210 215 220
Ser Arg Tyr Leu Asn Ala Val Ala Lys His Asn Asn Gly Lys Leu Leu
225 230 235 240
Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser Ser Ala Ser Asp Ile Tyr
245 250 255
Trp Lys Ile Ile Lys Pro Asp Leu Gln His Val Ser Thr His Trp Ser
260 265 270
Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn Gln Gly Thr His Phe Phe
275 280 285
Thr Ala Leu Val Gln Leu Lys Ala Leu Ser Tyr Gly Ile Pro Leu Ser
290 295 300
Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser Trp Leu Glu Lys Gln Lys
305 310 315 320
Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser Gly Phe Val Asn Ser Gly
325 330 335
Lys Lys His Ile Val Glu Ser Pro Gln Leu Ser Ser Arg Gly Gly Leu
340 345 350
Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile Thr His Asp Ile Gly Asp
355 360 365
Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp Asn Ser Tyr Val Leu Asn
370 375 380
Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys Asn Arg Tyr Lys Ile Asn
385 390 395 400
Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly Arg Tyr Pro Glu Asp Val
405 410 415
Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn Pro Trp Gln Leu Ala Thr
420 425 430
Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu Ala Tyr Asn Ser Leu Lys
435 440 445
Asn Lys Lys Asn Leu Val Ile Glu Lys Leu Asn Tyr Asp Leu Tyr Asn
450 455 460
Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp Ser Ser Tyr Ala Ser Lys
465 470 475 480
Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp Asn Tyr Lys Asn Val Ile
485 490 495
Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe Leu Lys Val Leu Leu Asp
500 505 510
His Ile Asp Asp Asn Gly Gln Leu Thr Glu Glu Ile Asn Arg Tyr Thr
515 520 525
Gly Phe Gln Ala Gly Ala Val Ser Leu Thr Trp Ser Ser Gly Ser Leu
530 535 540
Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu Ile Glu Leu Leu
545 550 555
<210> 47
<211> 516
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 47
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Val Pro Val Glu Leu Asp Lys Arg Asn Thr Gly His Phe
20 25 30
Gln Ala Tyr Ser Gly Tyr Thr Val Ala Arg Ser Asn Phe Thr Gln Trp
35 40 45
Ile His Glu Gln Pro Ala Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile
50 55 60
Asp Tyr Pro Glu Gly Gln Phe Lys Ser Ala Lys Pro Gly Val Val Val
65 70 75 80
Ala Ser Pro Ser Thr Ser Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg
85 90 95
Asp Thr Ala Ile Thr Phe Leu Ser Leu Ile Ala Glu Val Glu Asp His
100 105 110
Ser Phe Ser Asn Thr Thr Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser
115 120 125
Asn Thr Tyr Thr Leu Gln Arg Val Ser Asn Pro Ser Gly Asn Phe Asp
130 135 140
Ser Pro Asn His Asp Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Asp
145 150 155 160
Thr Ala Tyr Thr Ala Ser Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala
165 170 175
Leu Arg Ala Tyr Ala Ile Ser Arg Tyr Leu Asn Ala Val Ala Lys His
180 185 190
Asn Asn Gly Lys Leu Leu Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser
195 200 205
Ser Ala Ser Asp Ile Tyr Trp Lys Ile Ile Lys Pro Asp Leu Gln His
210 215 220
Val Ser Thr His Trp Ser Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn
225 230 235 240
Gln Gly Thr His Phe Phe Thr Ala Leu Val Gln Leu Lys Ala Leu Ser
245 250 255
Tyr Gly Ile Pro Leu Ser Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser
260 265 270
Trp Leu Glu Lys Gln Lys Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser
275 280 285
Gly Phe Val Asn Ser Gly Lys Lys His Ile Val Glu Ser Pro Gln Leu
290 295 300
Ser Ser Arg Gly Gly Leu Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile
305 310 315 320
Thr His Asp Ile Gly Asp Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp
325 330 335
Asn Ser Tyr Val Leu Asn Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys
340 345 350
Asn Arg Tyr Lys Ile Asn Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly
355 360 365
Arg Tyr Pro Glu Asp Val Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn
370 375 380
Pro Trp Gln Leu Ala Thr Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu
385 390 395 400
Ala Tyr Asn Ser Leu Lys Asn Lys Lys Asn Leu Val Ile Glu Lys Leu
405 410 415
Asn Tyr Asp Leu Tyr Asn Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp
420 425 430
Ser Ser Tyr Ala Ser Lys Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp
435 440 445
Asn Tyr Lys Asn Val Ile Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe
450 455 460
Leu Lys Val Leu Leu Asp His Ile Asp Asp Asn Gly Gln Leu Thr Glu
465 470 475 480
Glu Ile Asn Arg Tyr Thr Gly Phe Gln Ala Gly Ala Val Ser Leu Thr
485 490 495
Trp Ser Ser Gly Ser Leu Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu
500 505 510
Ile Glu Leu Leu
515
<210> 48
<211> 516
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 48
Met Leu Leu Gln Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys
1 5 10 15
Ile Ser Ala Val Pro Val Glu Leu Asp Lys Arg Asn Thr Gly His Phe
20 25 30
Gln Ala Tyr Ser Gly Tyr Thr Val Ala Arg Ser Asn Phe Thr Gln Trp
35 40 45
Ile His Glu Gln Pro Ala Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile
50 55 60
Asp Tyr Pro Glu Gly Gln Phe Lys Ser Ala Lys Pro Gly Val Val Val
65 70 75 80
Ala Ser Pro Ser Thr Ser Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg
85 90 95
Asp Thr Ala Ile Thr Phe Leu Ser Leu Ile Ala Glu Val Glu Asp His
100 105 110
Ser Phe Ser Asn Thr Thr Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser
115 120 125
Asn Thr Tyr Thr Leu Gln Arg Val Ser Asn Pro Ser Gly Asn Phe Asp
130 135 140
Ser Pro Asn His Asp Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Asp
145 150 155 160
Thr Ala Tyr Thr Ala Ser Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala
165 170 175
Leu Arg Ala Tyr Ala Ile Ser Arg Tyr Leu Asn Ala Val Ala Lys His
180 185 190
Asn Asn Gly Lys Leu Leu Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser
195 200 205
Ser Ala Ser Asp Ile Tyr Trp Lys Ile Ile Lys Pro Asp Leu Gln His
210 215 220
Val Ser Thr His Trp Ser Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn
225 230 235 240
Gln Gly Thr His Phe Phe Thr Ala Leu Val Gln Leu Lys Ala Leu Ser
245 250 255
Tyr Gly Ile Pro Leu Ser Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser
260 265 270
Trp Leu Glu Lys Gln Lys Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser
275 280 285
Gly Phe Val Asn Ser Gly Lys Lys His Ile Val Glu Ser Pro Gln Leu
290 295 300
Ser Ser Arg Gly Gly Leu Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile
305 310 315 320
Thr His Asp Ile Gly Asp Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp
325 330 335
Asn Ser Tyr Val Leu Asn Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys
340 345 350
Asn Arg Tyr Lys Ile Asn Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly
355 360 365
Arg Tyr Pro Glu Asp Val Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn
370 375 380
Pro Trp Gln Leu Ala Thr Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu
385 390 395 400
Ala Tyr Asn Ser Leu Lys Asn Lys Lys Asn Leu Val Ile Glu Lys Leu
405 410 415
Asn Tyr Asp Leu Tyr Asn Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp
420 425 430
Ser Ser Tyr Ala Ser Lys Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp
435 440 445
Asn Tyr Lys Asn Val Ile Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe
450 455 460
Leu Lys Val Leu Leu Asp His Ile Asp Asp Asn Gly Gln Leu Thr Glu
465 470 475 480
Glu Ile Asn Arg Tyr Thr Gly Phe Gln Ala Gly Ala Val Ser Leu Thr
485 490 495
Trp Ser Ser Gly Ser Leu Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu
500 505 510
Ile Glu Leu Leu
515
<210> 49
<211> 523
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 49
Met Leu Gly Lys Asn Asp Pro Met Cys Leu Val Leu Val Leu Leu Gly
1 5 10 15
Leu Thr Ala Leu Leu Gly Ile Cys Gln Gly Val Pro Val Glu Leu Asp
20 25 30
Lys Arg Asn Thr Gly His Phe Gln Ala Tyr Ser Gly Tyr Thr Val Ala
35 40 45
Arg Ser Asn Phe Thr Gln Trp Ile His Glu Gln Pro Ala Val Ser Trp
50 55 60
Tyr Tyr Leu Leu Gln Asn Ile Asp Tyr Pro Glu Gly Gln Phe Lys Ser
65 70 75 80
Ala Lys Pro Gly Val Val Val Ala Ser Pro Ser Thr Ser Glu Pro Asp
85 90 95
Tyr Phe Tyr Gln Trp Thr Arg Asp Thr Ala Ile Thr Phe Leu Ser Leu
100 105 110
Ile Ala Glu Val Glu Asp His Ser Phe Ser Asn Thr Thr Leu Ala Lys
115 120 125
Val Val Glu Tyr Tyr Ile Ser Asn Thr Tyr Thr Leu Gln Arg Val Ser
130 135 140
Asn Pro Ser Gly Asn Phe Asp Ser Pro Asn His Asp Gly Leu Gly Glu
145 150 155 160
Pro Lys Phe Asn Val Asp Asp Thr Ala Tyr Thr Ala Ser Trp Gly Arg
165 170 175
Pro Gln Asn Asp Gly Pro Ala Leu Arg Ala Tyr Ala Ile Ser Arg Tyr
180 185 190
Leu Asn Ala Val Ala Lys His Asn Asn Gly Lys Leu Leu Leu Ala Gly
195 200 205
Gln Asn Gly Ile Pro Tyr Ser Ser Ala Ser Asp Ile Tyr Trp Lys Ile
210 215 220
Ile Lys Pro Asp Leu Gln His Val Ser Thr His Trp Ser Thr Ser Gly
225 230 235 240
Phe Asp Leu Trp Glu Glu Asn Gln Gly Thr His Phe Phe Thr Ala Leu
245 250 255
Val Gln Leu Lys Ala Leu Ser Tyr Gly Ile Pro Leu Ser Lys Thr Tyr
260 265 270
Asn Asp Pro Gly Phe Thr Ser Trp Leu Glu Lys Gln Lys Asp Ala Leu
275 280 285
Asn Ser Tyr Ile Asn Ser Ser Gly Phe Val Asn Ser Gly Lys Lys His
290 295 300
Ile Val Glu Ser Pro Gln Leu Ser Ser Arg Gly Gly Leu Asp Ser Ala
305 310 315 320
Thr Tyr Ile Ala Ala Leu Ile Thr His Asp Ile Gly Asp Asp Asp Thr
325 330 335
Tyr Thr Pro Phe Asn Val Asp Asn Ser Tyr Val Leu Asn Ser Leu Tyr
340 345 350
Tyr Leu Leu Val Asp Asn Lys Asn Arg Tyr Lys Ile Asn Gly Asn Tyr
355 360 365
Lys Ala Gly Ala Ala Val Gly Arg Tyr Pro Glu Asp Val Tyr Asn Gly
370 375 380
Val Gly Thr Ser Glu Gly Asn Pro Trp Gln Leu Ala Thr Ala Tyr Ala
385 390 395 400
Gly Gln Thr Phe Tyr Thr Leu Ala Tyr Asn Ser Leu Lys Asn Lys Lys
405 410 415
Asn Leu Val Ile Glu Lys Leu Asn Tyr Asp Leu Tyr Asn Ser Phe Ile
420 425 430
Ala Asp Leu Ser Lys Ile Asp Ser Ser Tyr Ala Ser Lys Asp Ser Leu
435 440 445
Thr Leu Thr Tyr Gly Ser Asp Asn Tyr Lys Asn Val Ile Lys Ser Leu
450 455 460
Leu Gln Phe Gly Asp Ser Phe Leu Lys Val Leu Leu Asp His Ile Asp
465 470 475 480
Asp Asn Gly Gln Leu Thr Glu Glu Ile Asn Arg Tyr Thr Gly Phe Gln
485 490 495
Ala Gly Ala Val Ser Leu Thr Trp Ser Ser Gly Ser Leu Leu Ser Ala
500 505 510
Asn Arg Ala Arg Asn Lys Leu Ile Glu Leu Leu
515 520
<210> 50
<211> 515
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 50
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala
1 5 10 15
Tyr Ser Val Pro Val Glu Leu Asp Lys Arg Asn Thr Gly His Phe Gln
20 25 30
Ala Tyr Ser Gly Tyr Thr Val Ala Arg Ser Asn Phe Thr Gln Trp Ile
35 40 45
His Glu Gln Pro Ala Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile Asp
50 55 60
Tyr Pro Glu Gly Gln Phe Lys Ser Ala Lys Pro Gly Val Val Val Ala
65 70 75 80
Ser Pro Ser Thr Ser Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg Asp
85 90 95
Thr Ala Ile Thr Phe Leu Ser Leu Ile Ala Glu Val Glu Asp His Ser
100 105 110
Phe Ser Asn Thr Thr Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser Asn
115 120 125
Thr Tyr Thr Leu Gln Arg Val Ser Asn Pro Ser Gly Asn Phe Asp Ser
130 135 140
Pro Asn His Asp Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Asp Thr
145 150 155 160
Ala Tyr Thr Ala Ser Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Leu
165 170 175
Arg Ala Tyr Ala Ile Ser Arg Tyr Leu Asn Ala Val Ala Lys His Asn
180 185 190
Asn Gly Lys Leu Leu Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser Ser
195 200 205
Ala Ser Asp Ile Tyr Trp Lys Ile Ile Lys Pro Asp Leu Gln His Val
210 215 220
Ser Thr His Trp Ser Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn Gln
225 230 235 240
Gly Thr His Phe Phe Thr Ala Leu Val Gln Leu Lys Ala Leu Ser Tyr
245 250 255
Gly Ile Pro Leu Ser Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser Trp
260 265 270
Leu Glu Lys Gln Lys Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser Gly
275 280 285
Phe Val Asn Ser Gly Lys Lys His Ile Val Glu Ser Pro Gln Leu Ser
290 295 300
Ser Arg Gly Gly Leu Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile Thr
305 310 315 320
His Asp Ile Gly Asp Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp Asn
325 330 335
Ser Tyr Val Leu Asn Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys Asn
340 345 350
Arg Tyr Lys Ile Asn Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly Arg
355 360 365
Tyr Pro Glu Asp Val Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn Pro
370 375 380
Trp Gln Leu Ala Thr Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu Ala
385 390 395 400
Tyr Asn Ser Leu Lys Asn Lys Lys Asn Leu Val Ile Glu Lys Leu Asn
405 410 415
Tyr Asp Leu Tyr Asn Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp Ser
420 425 430
Ser Tyr Ala Ser Lys Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp Asn
435 440 445
Tyr Lys Asn Val Ile Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe Leu
450 455 460
Lys Val Leu Leu Asp His Ile Asp Asp Asn Gly Gln Leu Thr Glu Glu
465 470 475 480
Ile Asn Arg Tyr Thr Gly Phe Gln Ala Gly Ala Val Ser Leu Thr Trp
485 490 495
Ser Ser Gly Ser Leu Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu Ile
500 505 510
Glu Leu Leu
515
<210> 51
<211> 516
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 51
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Val Pro Val Glu Leu Asp Lys Arg Asn Thr Gly His Phe
20 25 30
Gln Ala Tyr Ser Gly Tyr Thr Val Ala Arg Ser Asn Phe Thr Gln Trp
35 40 45
Ile His Glu Gln Pro Ala Val Ser Trp Tyr Tyr Leu Leu Gln Asn Ile
50 55 60
Asp Tyr Pro Glu Gly Gln Phe Lys Ser Ala Lys Pro Gly Val Val Val
65 70 75 80
Ala Ser Pro Ser Thr Ser Glu Pro Asp Tyr Phe Tyr Gln Trp Thr Arg
85 90 95
Asp Thr Ala Ile Thr Phe Leu Ser Leu Ile Ala Glu Val Glu Asp His
100 105 110
Ser Phe Ser Asn Thr Thr Leu Ala Lys Val Val Glu Tyr Tyr Ile Ser
115 120 125
Asn Thr Tyr Thr Leu Gln Arg Val Ser Asn Pro Ser Gly Asn Phe Asp
130 135 140
Ser Pro Asn His Asp Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Asp
145 150 155 160
Thr Ala Tyr Thr Ala Ser Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala
165 170 175
Leu Arg Ala Tyr Ala Ile Ser Arg Tyr Leu Asn Ala Val Ala Lys His
180 185 190
Asn Asn Gly Lys Leu Leu Leu Ala Gly Gln Asn Gly Ile Pro Tyr Ser
195 200 205
Ser Ala Ser Asp Ile Tyr Trp Lys Ile Ile Lys Pro Asp Leu Gln His
210 215 220
Val Ser Thr His Trp Ser Thr Ser Gly Phe Asp Leu Trp Glu Glu Asn
225 230 235 240
Gln Gly Thr His Phe Phe Thr Ala Leu Val Gln Leu Lys Ala Leu Ser
245 250 255
Tyr Gly Ile Pro Leu Ser Lys Thr Tyr Asn Asp Pro Gly Phe Thr Ser
260 265 270
Trp Leu Glu Lys Gln Lys Asp Ala Leu Asn Ser Tyr Ile Asn Ser Ser
275 280 285
Gly Phe Val Asn Ser Gly Lys Lys His Ile Val Glu Ser Pro Gln Leu
290 295 300
Ser Ser Arg Gly Gly Leu Asp Ser Ala Thr Tyr Ile Ala Ala Leu Ile
305 310 315 320
Thr His Asp Ile Gly Asp Asp Asp Thr Tyr Thr Pro Phe Asn Val Asp
325 330 335
Asn Ser Tyr Val Leu Asn Ser Leu Tyr Tyr Leu Leu Val Asp Asn Lys
340 345 350
Asn Arg Tyr Lys Ile Asn Gly Asn Tyr Lys Ala Gly Ala Ala Val Gly
355 360 365
Arg Tyr Pro Glu Asp Val Tyr Asn Gly Val Gly Thr Ser Glu Gly Asn
370 375 380
Pro Trp Gln Leu Ala Thr Ala Tyr Ala Gly Gln Thr Phe Tyr Thr Leu
385 390 395 400
Ala Tyr Asn Ser Leu Lys Asn Lys Lys Asn Leu Val Ile Glu Lys Leu
405 410 415
Asn Tyr Asp Leu Tyr Asn Ser Phe Ile Ala Asp Leu Ser Lys Ile Asp
420 425 430
Ser Ser Tyr Ala Ser Lys Asp Ser Leu Thr Leu Thr Tyr Gly Ser Asp
435 440 445
Asn Tyr Lys Asn Val Ile Lys Ser Leu Leu Gln Phe Gly Asp Ser Phe
450 455 460
Leu Lys Val Leu Leu Asp His Ile Asp Asp Asn Gly Gln Leu Thr Glu
465 470 475 480
Glu Ile Asn Arg Tyr Thr Gly Phe Gln Ala Gly Ala Val Ser Leu Thr
485 490 495
Trp Ser Ser Gly Ser Leu Leu Ser Ala Asn Arg Ala Arg Asn Lys Leu
500 505 510
Ile Glu Leu Leu
515
<210> 52
<211> 668
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 52
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ala Ser Ile Pro Ser Ser Ala
85 90 95
Ser Val Gln Leu Asp Ser Tyr Asn Tyr Asp Gly Ser Thr Phe Ser Gly
100 105 110
Lys Ile Tyr Val Lys Asn Ile Ala Tyr Ser Lys Lys Val Thr Val Val
115 120 125
Tyr Ala Asp Gly Ser Asp Asn Trp Asn Asn Asn Gly Asn Thr Ile Ala
130 135 140
Ala Ser Phe Ser Gly Pro Ile Ser Gly Ser Asn Tyr Glu Tyr Trp Thr
145 150 155 160
Phe Ser Ala Ser Val Lys Gly Ile Lys Glu Phe Tyr Ile Lys Tyr Glu
165 170 175
Val Ser Gly Lys Thr Tyr Tyr Asp Asn Asn Asn Ser Ala Asn Tyr Gln
180 185 190
Val Ser Thr Ser Lys Pro Thr Thr Thr Thr Ala Ala Thr Thr Thr Thr
195 200 205
Thr Ala Pro Ser Thr Ser Thr Thr Thr Arg Pro Ser Ser Ser Glu Pro
210 215 220
Ala Thr Phe Pro Thr Gly Asn Ser Thr Ile Ser Ser Trp Ile Lys Lys
225 230 235 240
Gln Glu Asp Ile Ser Arg Phe Ala Met Leu Arg Asn Ile Asn Pro Pro
245 250 255
Gly Ser Ala Thr Gly Phe Ile Ala Ala Ser Leu Ser Thr Ala Gly Pro
260 265 270
Asp Tyr Tyr Tyr Ala Trp Thr Arg Asp Ala Ala Leu Thr Ser Asn Val
275 280 285
Ile Val Tyr Glu Tyr Asn Thr Thr Leu Ser Gly Asn Lys Thr Ile Leu
290 295 300
Asn Val Leu Lys Asp Tyr Val Thr Phe Ser Val Lys Thr Gln Ser Thr
305 310 315 320
Ser Thr Val Cys Asn Cys Leu Gly Glu Pro Lys Phe Asn Pro Asp Gly
325 330 335
Ser Gly Tyr Thr Gly Ala Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala
340 345 350
Glu Arg Ala Thr Thr Phe Val Leu Phe Ala Asp Ser Tyr Leu Thr Gln
355 360 365
Thr Lys Asp Ala Ser Tyr Val Thr Gly Thr Leu Lys Pro Ala Ile Phe
370 375 380
Lys Asp Leu Asp Tyr Val Val Asn Val Trp Ser Asn Gly Cys Phe Asp
385 390 395 400
Leu Trp Glu Glu Val Asn Gly Val His Phe Tyr Thr Leu Met Val Met
405 410 415
Arg Lys Gly Leu Leu Leu Gly Ala Asp Phe Ala Lys Arg Asn Gly Asp
420 425 430
Ser Thr Arg Ala Ser Thr Tyr Ser Ser Thr Ala Ser Thr Ile Ala Asn
435 440 445
Lys Ile Ser Ser Phe Trp Val Ser Ser Asn Asn Trp Val Gln Val Ser
450 455 460
Gln Ser Val Thr Gly Gly Val Ser Lys Lys Gly Leu Asp Val Ser Thr
465 470 475 480
Leu Leu Ala Ala Asn Leu Gly Ser Val Asp Asp Gly Phe Phe Thr Pro
485 490 495
Gly Ser Glu Lys Ile Leu Ala Thr Ala Val Ala Val Glu Asp Ser Phe
500 505 510
Ala Ser Leu Tyr Pro Ile Asn Lys Asn Leu Pro Ser Tyr Leu Gly Asn
515 520 525
Ala Ile Gly Arg Tyr Pro Glu Asp Thr Tyr Asn Gly Asn Gly Asn Ser
530 535 540
Gln Gly Asn Pro Trp Phe Leu Ala Val Thr Gly Tyr Ala Glu Leu Tyr
545 550 555 560
Tyr Arg Ala Ile Lys Glu Trp Ile Ser Asn Gly Gly Val Thr Val Ser
565 570 575
Ser Ile Ser Leu Pro Phe Phe Lys Lys Phe Asp Ser Ser Ala Thr Ser
580 585 590
Gly Lys Lys Tyr Thr Val Gly Thr Ser Asp Phe Asn Asn Leu Ala Gln
595 600 605
Asn Ile Ala Leu Ala Ala Asp Arg Phe Leu Ser Thr Val Gln Leu His
610 615 620
Ala Pro Asn Asn Gly Ser Leu Ala Glu Glu Phe Asp Arg Thr Thr Gly
625 630 635 640
Phe Ser Thr Gly Ala Arg Asp Leu Thr Trp Ser His Ala Ser Leu Ile
645 650 655
Thr Ala Ser Tyr Ala Lys Ala Gly Ala Pro Ala Ala
660 665
<210> 53
<211> 640
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 53
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Leu Glu Gly
20 25 30
Asp Phe Asp Val Ala Val Leu Pro Phe Ser Ala Ser Ile Ala Ala Lys
35 40 45
Glu Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala Ala Ser Ile
50 55 60
Pro Ser Ser Ala Ser Val Gln Leu Asp Ser Tyr Asn Tyr Asp Gly Ser
65 70 75 80
Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn Ile Ala Tyr Ser Lys Lys
85 90 95
Val Thr Val Val Tyr Ala Asp Gly Ser Asp Asn Trp Asn Asn Asn Gly
100 105 110
Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro Ile Ser Gly Ser Asn Tyr
115 120 125
Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys Gly Ile Lys Glu Phe Tyr
130 135 140
Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr Tyr Asp Asn Asn Asn Ser
145 150 155 160
Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro Thr Thr Thr Thr Ala Ala
165 170 175
Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser Thr Thr Thr Arg Pro Ser
180 185 190
Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly Asn Ser Thr Ile Ser Ser
195 200 205
Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg Phe Ala Met Leu Arg Asn
210 215 220
Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe Ile Ala Ala Ser Leu Ser
225 230 235 240
Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp Thr Arg Asp Ala Ala Leu
245 250 255
Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn Thr Thr Leu Ser Gly Asn
260 265 270
Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr Val Thr Phe Ser Val Lys
275 280 285
Thr Gln Ser Thr Ser Thr Val Cys Asn Cys Leu Gly Glu Pro Lys Phe
290 295 300
Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala Trp Gly Arg Pro Gln Asn
305 310 315 320
Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe Val Leu Phe Ala Asp Ser
325 330 335
Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr Val Thr Gly Thr Leu Lys
340 345 350
Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val Val Asn Val Trp Ser Asn
355 360 365
Gly Cys Phe Asp Leu Trp Glu Glu Val Asn Gly Val His Phe Tyr Thr
370 375 380
Leu Met Val Met Arg Lys Gly Leu Leu Leu Gly Ala Asp Phe Ala Lys
385 390 395 400
Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr Tyr Ser Ser Thr Ala Ser
405 410 415
Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp Val Ser Ser Asn Asn Trp
420 425 430
Val Gln Val Ser Gln Ser Val Thr Gly Gly Val Ser Lys Lys Gly Leu
435 440 445
Asp Val Ser Thr Leu Leu Ala Ala Asn Leu Gly Ser Val Asp Asp Gly
450 455 460
Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu Ala Thr Ala Val Ala Val
465 470 475 480
Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile Asn Lys Asn Leu Pro Ser
485 490 495
Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro Glu Asp Thr Tyr Asn Gly
500 505 510
Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe Leu Ala Val Thr Gly Tyr
515 520 525
Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu Trp Ile Ser Asn Gly Gly
530 535 540
Val Thr Val Ser Ser Ile Ser Leu Pro Phe Phe Lys Lys Phe Asp Ser
545 550 555 560
Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val Gly Thr Ser Asp Phe Asn
565 570 575
Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala Asp Arg Phe Leu Ser Thr
580 585 590
Val Gln Leu His Ala Pro Asn Asn Gly Ser Leu Ala Glu Glu Phe Asp
595 600 605
Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg Asp Leu Thr Trp Ser His
610 615 620
Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys Ala Gly Ala Pro Ala Ala
625 630 635 640
<210> 54
<211> 598
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 54
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Ser Ile Pro Ser Ser Ala Ser Val Gln Leu Asp Ser
20 25 30
Tyr Asn Tyr Asp Gly Ser Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn
35 40 45
Ile Ala Tyr Ser Lys Lys Val Thr Val Val Tyr Ala Asp Gly Ser Asp
50 55 60
Asn Trp Asn Asn Asn Gly Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro
65 70 75 80
Ile Ser Gly Ser Asn Tyr Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys
85 90 95
Gly Ile Lys Glu Phe Tyr Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr
100 105 110
Tyr Asp Asn Asn Asn Ser Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro
115 120 125
Thr Thr Thr Thr Ala Ala Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser
130 135 140
Thr Thr Thr Arg Pro Ser Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly
145 150 155 160
Asn Ser Thr Ile Ser Ser Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg
165 170 175
Phe Ala Met Leu Arg Asn Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe
180 185 190
Ile Ala Ala Ser Leu Ser Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp
195 200 205
Thr Arg Asp Ala Ala Leu Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn
210 215 220
Thr Thr Leu Ser Gly Asn Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr
225 230 235 240
Val Thr Phe Ser Val Lys Thr Gln Ser Thr Ser Thr Val Cys Asn Cys
245 250 255
Leu Gly Glu Pro Lys Phe Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala
260 265 270
Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe
275 280 285
Val Leu Phe Ala Asp Ser Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr
290 295 300
Val Thr Gly Thr Leu Lys Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val
305 310 315 320
Val Asn Val Trp Ser Asn Gly Cys Phe Asp Leu Trp Glu Glu Val Asn
325 330 335
Gly Val His Phe Tyr Thr Leu Met Val Met Arg Lys Gly Leu Leu Leu
340 345 350
Gly Ala Asp Phe Ala Lys Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr
355 360 365
Tyr Ser Ser Thr Ala Ser Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp
370 375 380
Val Ser Ser Asn Asn Trp Val Gln Val Ser Gln Ser Val Thr Gly Gly
385 390 395 400
Val Ser Lys Lys Gly Leu Asp Val Ser Thr Leu Leu Ala Ala Asn Leu
405 410 415
Gly Ser Val Asp Asp Gly Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu
420 425 430
Ala Thr Ala Val Ala Val Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile
435 440 445
Asn Lys Asn Leu Pro Ser Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro
450 455 460
Glu Asp Thr Tyr Asn Gly Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe
465 470 475 480
Leu Ala Val Thr Gly Tyr Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu
485 490 495
Trp Ile Ser Asn Gly Gly Val Thr Val Ser Ser Ile Ser Leu Pro Phe
500 505 510
Phe Lys Lys Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val
515 520 525
Gly Thr Ser Asp Phe Asn Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala
530 535 540
Asp Arg Phe Leu Ser Thr Val Gln Leu His Ala Pro Asn Asn Gly Ser
545 550 555 560
Leu Ala Glu Glu Phe Asp Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg
565 570 575
Asp Leu Thr Trp Ser His Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys
580 585 590
Ala Gly Ala Pro Ala Ala
595
<210> 55
<211> 598
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 55
Met Leu Leu Gln Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys
1 5 10 15
Ile Ser Ala Ala Ser Ile Pro Ser Ser Ala Ser Val Gln Leu Asp Ser
20 25 30
Tyr Asn Tyr Asp Gly Ser Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn
35 40 45
Ile Ala Tyr Ser Lys Lys Val Thr Val Val Tyr Ala Asp Gly Ser Asp
50 55 60
Asn Trp Asn Asn Asn Gly Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro
65 70 75 80
Ile Ser Gly Ser Asn Tyr Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys
85 90 95
Gly Ile Lys Glu Phe Tyr Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr
100 105 110
Tyr Asp Asn Asn Asn Ser Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro
115 120 125
Thr Thr Thr Thr Ala Ala Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser
130 135 140
Thr Thr Thr Arg Pro Ser Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly
145 150 155 160
Asn Ser Thr Ile Ser Ser Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg
165 170 175
Phe Ala Met Leu Arg Asn Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe
180 185 190
Ile Ala Ala Ser Leu Ser Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp
195 200 205
Thr Arg Asp Ala Ala Leu Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn
210 215 220
Thr Thr Leu Ser Gly Asn Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr
225 230 235 240
Val Thr Phe Ser Val Lys Thr Gln Ser Thr Ser Thr Val Cys Asn Cys
245 250 255
Leu Gly Glu Pro Lys Phe Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala
260 265 270
Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe
275 280 285
Val Leu Phe Ala Asp Ser Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr
290 295 300
Val Thr Gly Thr Leu Lys Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val
305 310 315 320
Val Asn Val Trp Ser Asn Gly Cys Phe Asp Leu Trp Glu Glu Val Asn
325 330 335
Gly Val His Phe Tyr Thr Leu Met Val Met Arg Lys Gly Leu Leu Leu
340 345 350
Gly Ala Asp Phe Ala Lys Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr
355 360 365
Tyr Ser Ser Thr Ala Ser Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp
370 375 380
Val Ser Ser Asn Asn Trp Val Gln Val Ser Gln Ser Val Thr Gly Gly
385 390 395 400
Val Ser Lys Lys Gly Leu Asp Val Ser Thr Leu Leu Ala Ala Asn Leu
405 410 415
Gly Ser Val Asp Asp Gly Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu
420 425 430
Ala Thr Ala Val Ala Val Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile
435 440 445
Asn Lys Asn Leu Pro Ser Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro
450 455 460
Glu Asp Thr Tyr Asn Gly Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe
465 470 475 480
Leu Ala Val Thr Gly Tyr Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu
485 490 495
Trp Ile Ser Asn Gly Gly Val Thr Val Ser Ser Ile Ser Leu Pro Phe
500 505 510
Phe Lys Lys Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val
515 520 525
Gly Thr Ser Asp Phe Asn Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala
530 535 540
Asp Arg Phe Leu Ser Thr Val Gln Leu His Ala Pro Asn Asn Gly Ser
545 550 555 560
Leu Ala Glu Glu Phe Asp Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg
565 570 575
Asp Leu Thr Trp Ser His Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys
580 585 590
Ala Gly Ala Pro Ala Ala
595
<210> 56
<211> 605
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 56
Met Leu Gly Lys Asn Asp Pro Met Cys Leu Val Leu Val Leu Leu Gly
1 5 10 15
Leu Thr Ala Leu Leu Gly Ile Cys Gln Gly Ala Ser Ile Pro Ser Ser
20 25 30
Ala Ser Val Gln Leu Asp Ser Tyr Asn Tyr Asp Gly Ser Thr Phe Ser
35 40 45
Gly Lys Ile Tyr Val Lys Asn Ile Ala Tyr Ser Lys Lys Val Thr Val
50 55 60
Val Tyr Ala Asp Gly Ser Asp Asn Trp Asn Asn Asn Gly Asn Thr Ile
65 70 75 80
Ala Ala Ser Phe Ser Gly Pro Ile Ser Gly Ser Asn Tyr Glu Tyr Trp
85 90 95
Thr Phe Ser Ala Ser Val Lys Gly Ile Lys Glu Phe Tyr Ile Lys Tyr
100 105 110
Glu Val Ser Gly Lys Thr Tyr Tyr Asp Asn Asn Asn Ser Ala Asn Tyr
115 120 125
Gln Val Ser Thr Ser Lys Pro Thr Thr Thr Thr Ala Ala Thr Thr Thr
130 135 140
Thr Thr Ala Pro Ser Thr Ser Thr Thr Thr Arg Pro Ser Ser Ser Glu
145 150 155 160
Pro Ala Thr Phe Pro Thr Gly Asn Ser Thr Ile Ser Ser Trp Ile Lys
165 170 175
Lys Gln Glu Asp Ile Ser Arg Phe Ala Met Leu Arg Asn Ile Asn Pro
180 185 190
Pro Gly Ser Ala Thr Gly Phe Ile Ala Ala Ser Leu Ser Thr Ala Gly
195 200 205
Pro Asp Tyr Tyr Tyr Ala Trp Thr Arg Asp Ala Ala Leu Thr Ser Asn
210 215 220
Val Ile Val Tyr Glu Tyr Asn Thr Thr Leu Ser Gly Asn Lys Thr Ile
225 230 235 240
Leu Asn Val Leu Lys Asp Tyr Val Thr Phe Ser Val Lys Thr Gln Ser
245 250 255
Thr Ser Thr Val Cys Asn Cys Leu Gly Glu Pro Lys Phe Asn Pro Asp
260 265 270
Gly Ser Gly Tyr Thr Gly Ala Trp Gly Arg Pro Gln Asn Asp Gly Pro
275 280 285
Ala Glu Arg Ala Thr Thr Phe Val Leu Phe Ala Asp Ser Tyr Leu Thr
290 295 300
Gln Thr Lys Asp Ala Ser Tyr Val Thr Gly Thr Leu Lys Pro Ala Ile
305 310 315 320
Phe Lys Asp Leu Asp Tyr Val Val Asn Val Trp Ser Asn Gly Cys Phe
325 330 335
Asp Leu Trp Glu Glu Val Asn Gly Val His Phe Tyr Thr Leu Met Val
340 345 350
Met Arg Lys Gly Leu Leu Leu Gly Ala Asp Phe Ala Lys Arg Asn Gly
355 360 365
Asp Ser Thr Arg Ala Ser Thr Tyr Ser Ser Thr Ala Ser Thr Ile Ala
370 375 380
Asn Lys Ile Ser Ser Phe Trp Val Ser Ser Asn Asn Trp Val Gln Val
385 390 395 400
Ser Gln Ser Val Thr Gly Gly Val Ser Lys Lys Gly Leu Asp Val Ser
405 410 415
Thr Leu Leu Ala Ala Asn Leu Gly Ser Val Asp Asp Gly Phe Phe Thr
420 425 430
Pro Gly Ser Glu Lys Ile Leu Ala Thr Ala Val Ala Val Glu Asp Ser
435 440 445
Phe Ala Ser Leu Tyr Pro Ile Asn Lys Asn Leu Pro Ser Tyr Leu Gly
450 455 460
Asn Ala Ile Gly Arg Tyr Pro Glu Asp Thr Tyr Asn Gly Asn Gly Asn
465 470 475 480
Ser Gln Gly Asn Pro Trp Phe Leu Ala Val Thr Gly Tyr Ala Glu Leu
485 490 495
Tyr Tyr Arg Ala Ile Lys Glu Trp Ile Ser Asn Gly Gly Val Thr Val
500 505 510
Ser Ser Ile Ser Leu Pro Phe Phe Lys Lys Phe Asp Ser Ser Ala Thr
515 520 525
Ser Gly Lys Lys Tyr Thr Val Gly Thr Ser Asp Phe Asn Asn Leu Ala
530 535 540
Gln Asn Ile Ala Leu Ala Ala Asp Arg Phe Leu Ser Thr Val Gln Leu
545 550 555 560
His Ala Pro Asn Asn Gly Ser Leu Ala Glu Glu Phe Asp Arg Thr Thr
565 570 575
Gly Phe Ser Thr Gly Ala Arg Asp Leu Thr Trp Ser His Ala Ser Leu
580 585 590
Ile Thr Ala Ser Tyr Ala Lys Ala Gly Ala Pro Ala Ala
595 600 605
<210> 57
<211> 597
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 57
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala
1 5 10 15
Tyr Ser Ala Ser Ile Pro Ser Ser Ala Ser Val Gln Leu Asp Ser Tyr
20 25 30
Asn Tyr Asp Gly Ser Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn Ile
35 40 45
Ala Tyr Ser Lys Lys Val Thr Val Val Tyr Ala Asp Gly Ser Asp Asn
50 55 60
Trp Asn Asn Asn Gly Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro Ile
65 70 75 80
Ser Gly Ser Asn Tyr Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys Gly
85 90 95
Ile Lys Glu Phe Tyr Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr Tyr
100 105 110
Asp Asn Asn Asn Ser Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro Thr
115 120 125
Thr Thr Thr Ala Ala Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser Thr
130 135 140
Thr Thr Arg Pro Ser Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly Asn
145 150 155 160
Ser Thr Ile Ser Ser Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg Phe
165 170 175
Ala Met Leu Arg Asn Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe Ile
180 185 190
Ala Ala Ser Leu Ser Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp Thr
195 200 205
Arg Asp Ala Ala Leu Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn Thr
210 215 220
Thr Leu Ser Gly Asn Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr Val
225 230 235 240
Thr Phe Ser Val Lys Thr Gln Ser Thr Ser Thr Val Cys Asn Cys Leu
245 250 255
Gly Glu Pro Lys Phe Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala Trp
260 265 270
Gly Arg Pro Gln Asn Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe Val
275 280 285
Leu Phe Ala Asp Ser Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr Val
290 295 300
Thr Gly Thr Leu Lys Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val Val
305 310 315 320
Asn Val Trp Ser Asn Gly Cys Phe Asp Leu Trp Glu Glu Val Asn Gly
325 330 335
Val His Phe Tyr Thr Leu Met Val Met Arg Lys Gly Leu Leu Leu Gly
340 345 350
Ala Asp Phe Ala Lys Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr Tyr
355 360 365
Ser Ser Thr Ala Ser Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp Val
370 375 380
Ser Ser Asn Asn Trp Val Gln Val Ser Gln Ser Val Thr Gly Gly Val
385 390 395 400
Ser Lys Lys Gly Leu Asp Val Ser Thr Leu Leu Ala Ala Asn Leu Gly
405 410 415
Ser Val Asp Asp Gly Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu Ala
420 425 430
Thr Ala Val Ala Val Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile Asn
435 440 445
Lys Asn Leu Pro Ser Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro Glu
450 455 460
Asp Thr Tyr Asn Gly Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe Leu
465 470 475 480
Ala Val Thr Gly Tyr Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu Trp
485 490 495
Ile Ser Asn Gly Gly Val Thr Val Ser Ser Ile Ser Leu Pro Phe Phe
500 505 510
Lys Lys Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val Gly
515 520 525
Thr Ser Asp Phe Asn Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala Asp
530 535 540
Arg Phe Leu Ser Thr Val Gln Leu His Ala Pro Asn Asn Gly Ser Leu
545 550 555 560
Ala Glu Glu Phe Asp Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg Asp
565 570 575
Leu Thr Trp Ser His Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys Ala
580 585 590
Gly Ala Pro Ala Ala
595
<210> 58
<211> 598
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 58
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Ser Ile Pro Ser Ser Ala Ser Val Gln Leu Asp Ser
20 25 30
Tyr Asn Tyr Asp Gly Ser Thr Phe Ser Gly Lys Ile Tyr Val Lys Asn
35 40 45
Ile Ala Tyr Ser Lys Lys Val Thr Val Val Tyr Ala Asp Gly Ser Asp
50 55 60
Asn Trp Asn Asn Asn Gly Asn Thr Ile Ala Ala Ser Phe Ser Gly Pro
65 70 75 80
Ile Ser Gly Ser Asn Tyr Glu Tyr Trp Thr Phe Ser Ala Ser Val Lys
85 90 95
Gly Ile Lys Glu Phe Tyr Ile Lys Tyr Glu Val Ser Gly Lys Thr Tyr
100 105 110
Tyr Asp Asn Asn Asn Ser Ala Asn Tyr Gln Val Ser Thr Ser Lys Pro
115 120 125
Thr Thr Thr Thr Ala Ala Thr Thr Thr Thr Thr Ala Pro Ser Thr Ser
130 135 140
Thr Thr Thr Arg Pro Ser Ser Ser Glu Pro Ala Thr Phe Pro Thr Gly
145 150 155 160
Asn Ser Thr Ile Ser Ser Trp Ile Lys Lys Gln Glu Asp Ile Ser Arg
165 170 175
Phe Ala Met Leu Arg Asn Ile Asn Pro Pro Gly Ser Ala Thr Gly Phe
180 185 190
Ile Ala Ala Ser Leu Ser Thr Ala Gly Pro Asp Tyr Tyr Tyr Ala Trp
195 200 205
Thr Arg Asp Ala Ala Leu Thr Ser Asn Val Ile Val Tyr Glu Tyr Asn
210 215 220
Thr Thr Leu Ser Gly Asn Lys Thr Ile Leu Asn Val Leu Lys Asp Tyr
225 230 235 240
Val Thr Phe Ser Val Lys Thr Gln Ser Thr Ser Thr Val Cys Asn Cys
245 250 255
Leu Gly Glu Pro Lys Phe Asn Pro Asp Gly Ser Gly Tyr Thr Gly Ala
260 265 270
Trp Gly Arg Pro Gln Asn Asp Gly Pro Ala Glu Arg Ala Thr Thr Phe
275 280 285
Val Leu Phe Ala Asp Ser Tyr Leu Thr Gln Thr Lys Asp Ala Ser Tyr
290 295 300
Val Thr Gly Thr Leu Lys Pro Ala Ile Phe Lys Asp Leu Asp Tyr Val
305 310 315 320
Val Asn Val Trp Ser Asn Gly Cys Phe Asp Leu Trp Glu Glu Val Asn
325 330 335
Gly Val His Phe Tyr Thr Leu Met Val Met Arg Lys Gly Leu Leu Leu
340 345 350
Gly Ala Asp Phe Ala Lys Arg Asn Gly Asp Ser Thr Arg Ala Ser Thr
355 360 365
Tyr Ser Ser Thr Ala Ser Thr Ile Ala Asn Lys Ile Ser Ser Phe Trp
370 375 380
Val Ser Ser Asn Asn Trp Val Gln Val Ser Gln Ser Val Thr Gly Gly
385 390 395 400
Val Ser Lys Lys Gly Leu Asp Val Ser Thr Leu Leu Ala Ala Asn Leu
405 410 415
Gly Ser Val Asp Asp Gly Phe Phe Thr Pro Gly Ser Glu Lys Ile Leu
420 425 430
Ala Thr Ala Val Ala Val Glu Asp Ser Phe Ala Ser Leu Tyr Pro Ile
435 440 445
Asn Lys Asn Leu Pro Ser Tyr Leu Gly Asn Ala Ile Gly Arg Tyr Pro
450 455 460
Glu Asp Thr Tyr Asn Gly Asn Gly Asn Ser Gln Gly Asn Pro Trp Phe
465 470 475 480
Leu Ala Val Thr Gly Tyr Ala Glu Leu Tyr Tyr Arg Ala Ile Lys Glu
485 490 495
Trp Ile Ser Asn Gly Gly Val Thr Val Ser Ser Ile Ser Leu Pro Phe
500 505 510
Phe Lys Lys Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val
515 520 525
Gly Thr Ser Asp Phe Asn Asn Leu Ala Gln Asn Ile Ala Leu Ala Ala
530 535 540
Asp Arg Phe Leu Ser Thr Val Gln Leu His Ala Pro Asn Asn Gly Ser
545 550 555 560
Leu Ala Glu Glu Phe Asp Arg Thr Thr Gly Phe Ser Thr Gly Ala Arg
565 570 575
Asp Leu Thr Trp Ser His Ala Ser Leu Ile Thr Ala Ser Tyr Ala Lys
580 585 590
Ala Gly Ala Pro Ala Ala
595
<210> 59
<211> 710
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 59
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Val Ile Ser Lys Arg Ala
85 90 95
Thr Leu Asp Ser Trp Leu Ser Asn Glu Ala Thr Val Ala Arg Thr Ala
100 105 110
Ile Leu Asn Asn Ile Gly Ala Asp Gly Ala Trp Val Ser Gly Ala Asp
115 120 125
Ser Gly Ile Val Val Ala Ser Pro Ser Thr Asp Asn Pro Asp Tyr Phe
130 135 140
Tyr Thr Trp Thr Arg Asp Ser Gly Ile Val Leu Lys Thr Leu Val Asp
145 150 155 160
Leu Phe Arg Asn Gly Asp Thr Asp Leu Leu Ser Thr Ile Glu His Tyr
165 170 175
Ile Ser Ser Gln Ala Ile Ile Gln Gly Val Ser Asn Pro Ser Gly Asp
180 185 190
Leu Ser Ser Gly Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Glu Thr
195 200 205
Ala Tyr Ala Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu
210 215 220
Arg Ala Thr Ala Met Ile Gly Phe Gly Gln Trp Leu Leu Asp Asn Gly
225 230 235 240
Tyr Thr Ser Ala Ala Thr Glu Ile Val Trp Pro Leu Val Arg Asn Asp
245 250 255
Leu Ser Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp
260 265 270
Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg
275 280 285
Ala Leu Val Glu Gly Ser Ala Phe Ala Thr Ala Val Gly Ser Ser Cys
290 295 300
Ser Trp Cys Asp Ser Gln Ala Pro Gln Ile Leu Cys Tyr Leu Gln Ser
305 310 315 320
Phe Trp Thr Gly Ser Tyr Ile Leu Ala Asn Phe Asp Ser Ser Arg Ser
325 330 335
Gly Lys Asp Thr Asn Thr Leu Leu Gly Ser Ile His Thr Phe Asp Pro
340 345 350
Glu Ala Gly Cys Asp Asp Ser Thr Phe Gln Pro Cys Ser Pro Arg Ala
355 360 365
Leu Ala Asn His Lys Glu Val Val Asp Ser Phe Arg Ser Ile Tyr Thr
370 375 380
Leu Asn Asp Gly Leu Ser Asp Ser Glu Ala Val Ala Val Gly Arg Tyr
385 390 395 400
Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu
405 410 415
Ala Ala Ala Glu Gln Leu Tyr Asp Ala Leu Tyr Gln Trp Asp Lys Gln
420 425 430
Gly Ser Leu Glu Ile Thr Asp Val Ser Leu Asp Phe Phe Lys Ala Leu
435 440 445
Tyr Ser Gly Ala Ala Thr Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr
450 455 460
Ser Ser Ile Val Ser Ala Val Lys Thr Phe Ala Asp Gly Phe Val Ser
465 470 475 480
Ile Val Glu Thr His Ala Ala Ser Asn Gly Ser Leu Ser Glu Gln Phe
485 490 495
Asp Lys Ser Asp Gly Asp Glu Leu Ser Ala Arg Asp Leu Thr Trp Ser
500 505 510
Tyr Ala Ala Leu Leu Thr Ala Asn Asn Arg Arg Asn Ser Val Val Pro
515 520 525
Pro Ser Trp Gly Glu Thr Ser Ala Ser Ser Val Pro Gly Thr Cys Ala
530 535 540
Ala Thr Ser Ala Ser Gly Thr Tyr Ser Ser Val Thr Val Thr Ser Trp
545 550 555 560
Pro Ser Ile Val Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr
565 570 575
Gly Ser Gly Gly Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser
580 585 590
Lys Thr Ser Thr Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala
595 600 605
Val Ala Val Thr Phe Asp Leu Thr Ala Thr Thr Thr Tyr Gly Glu Asn
610 615 620
Ile Tyr Leu Val Gly Ser Ile Ser Gln Leu Gly Asp Trp Glu Thr Ser
625 630 635 640
Asp Gly Ile Ala Leu Ser Ala Asp Lys Tyr Thr Ser Ser Asn Pro Pro
645 650 655
Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser Phe Glu Tyr Lys
660 665 670
Phe Ile Arg Val Glu Ser Asp Asp Ser Val Glu Trp Glu Ser Asp Pro
675 680 685
Asn Arg Glu Tyr Thr Val Pro Gln Ala Cys Gly Glu Ser Thr Ala Thr
690 695 700
Val Thr Asp Thr Trp Arg
705 710
<210> 60
<211> 682
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 60
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Leu Glu Gly
20 25 30
Asp Phe Asp Val Ala Val Leu Pro Phe Ser Ala Ser Ile Ala Ala Lys
35 40 45
Glu Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser Val Ile
50 55 60
Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu Ser Asn Glu Ala Thr Val
65 70 75 80
Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly Ala Asp Gly Ala Trp Val
85 90 95
Ser Gly Ala Asp Ser Gly Ile Val Val Ala Ser Pro Ser Thr Asp Asn
100 105 110
Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser Gly Ile Val Leu Lys
115 120 125
Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr Asp Leu Leu Ser Thr
130 135 140
Ile Glu His Tyr Ile Ser Ser Gln Ala Ile Ile Gln Gly Val Ser Asn
145 150 155 160
Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu Gly Glu Pro Lys Phe Asn
165 170 175
Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp Gly Arg Pro Gln Arg Asp
180 185 190
Gly Pro Ala Leu Arg Ala Thr Ala Met Ile Gly Phe Gly Gln Trp Leu
195 200 205
Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr Glu Ile Val Trp Pro Leu
210 215 220
Val Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly
225 230 235 240
Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Ile Ala
245 250 255
Val Gln His Arg Ala Leu Val Glu Gly Ser Ala Phe Ala Thr Ala Val
260 265 270
Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln Ile Leu Cys
275 280 285
Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr Ile Leu Ala Asn Phe Asp
290 295 300
Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr Leu Leu Gly Ser Ile His
305 310 315 320
Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp Ser Thr Phe Gln Pro Cys
325 330 335
Ser Pro Arg Ala Leu Ala Asn His Lys Glu Val Val Asp Ser Phe Arg
340 345 350
Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser Asp Ser Glu Ala Val Ala
355 360 365
Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe
370 375 380
Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala Leu Tyr Gln
385 390 395 400
Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr Asp Val Ser Leu Asp Phe
405 410 415
Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr Gly Thr Tyr Ser Ser Ser
420 425 430
Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala Val Lys Thr Phe Ala Asp
435 440 445
Gly Phe Val Ser Ile Val Glu Thr His Ala Ala Ser Asn Gly Ser Leu
450 455 460
Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp Glu Leu Ser Ala Arg Asp
465 470 475 480
Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala Asn Asn Arg Arg Asn
485 490 495
Ser Val Val Pro Pro Ser Trp Gly Glu Thr Ser Ala Ser Ser Val Pro
500 505 510
Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly Thr Tyr Ser Ser Val Thr
515 520 525
Val Thr Ser Trp Pro Ser Ile Val Ala Thr Gly Gly Thr Thr Thr Thr
530 535 540
Ala Thr Thr Thr Gly Ser Gly Gly Val Thr Ser Thr Ser Lys Thr Thr
545 550 555 560
Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr Ser Ser Thr Ser Cys Thr
565 570 575
Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala Thr Thr Thr
580 585 590
Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser Ile Ser Gln Leu Gly Asp
595 600 605
Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser Ala Asp Lys Tyr Thr Ser
610 615 620
Ser Asn Pro Pro Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser
625 630 635 640
Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser Asp Asp Ser Val Glu Trp
645 650 655
Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gln Ala Cys Gly Glu
660 665 670
Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
675 680
<210> 61
<211> 640
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 61
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ser Val Ile Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu
20 25 30
Ser Asn Glu Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly
35 40 45
Ala Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala
50 55 60
Ser Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp
65 70 75 80
Ser Gly Ile Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp
85 90 95
Thr Asp Leu Leu Ser Thr Ile Glu His Tyr Ile Ser Ser Gln Ala Ile
100 105 110
Ile Gln Gly Val Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu
115 120 125
Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp
130 135 140
Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile
145 150 155 160
Gly Phe Gly Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr
165 170 175
Glu Ile Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln
180 185 190
Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser
195 200 205
Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser
210 215 220
Ala Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln
225 230 235 240
Ala Pro Gln Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr
245 250 255
Ile Leu Ala Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr
260 265 270
Leu Leu Gly Ser Ile His Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp
275 280 285
Ser Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu
290 295 300
Val Val Asp Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser
305 310 315 320
Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr
325 330 335
Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu
340 345 350
Tyr Asp Ala Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr
355 360 365
Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr
370 375 380
Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala
385 390 395 400
Val Lys Thr Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala
405 410 415
Ala Ser Asn Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp
420 425 430
Glu Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr
435 440 445
Ala Asn Asn Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr
450 455 460
Ser Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly
465 470 475 480
Thr Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr
485 490 495
Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly Val Thr
500 505 510
Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr
515 520 525
Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp
530 535 540
Leu Thr Ala Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser
545 550 555 560
Ile Ser Gln Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser
565 570 575
Ala Asp Lys Tyr Thr Ser Ser Asn Pro Pro Trp Tyr Val Thr Val Thr
580 585 590
Leu Pro Ala Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser
595 600 605
Asp Asp Ser Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val
610 615 620
Pro Gln Ala Cys Gly Glu Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
625 630 635 640
<210> 62
<211> 640
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 62
Met Leu Leu Gln Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys
1 5 10 15
Ile Ser Ala Ser Val Ile Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu
20 25 30
Ser Asn Glu Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly
35 40 45
Ala Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala
50 55 60
Ser Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp
65 70 75 80
Ser Gly Ile Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp
85 90 95
Thr Asp Leu Leu Ser Thr Ile Glu His Tyr Ile Ser Ser Gln Ala Ile
100 105 110
Ile Gln Gly Val Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu
115 120 125
Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp
130 135 140
Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile
145 150 155 160
Gly Phe Gly Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr
165 170 175
Glu Ile Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln
180 185 190
Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser
195 200 205
Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser
210 215 220
Ala Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln
225 230 235 240
Ala Pro Gln Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr
245 250 255
Ile Leu Ala Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr
260 265 270
Leu Leu Gly Ser Ile His Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp
275 280 285
Ser Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu
290 295 300
Val Val Asp Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser
305 310 315 320
Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr
325 330 335
Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu
340 345 350
Tyr Asp Ala Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr
355 360 365
Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr
370 375 380
Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala
385 390 395 400
Val Lys Thr Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala
405 410 415
Ala Ser Asn Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp
420 425 430
Glu Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr
435 440 445
Ala Asn Asn Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr
450 455 460
Ser Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly
465 470 475 480
Thr Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr
485 490 495
Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly Val Thr
500 505 510
Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr
515 520 525
Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp
530 535 540
Leu Thr Ala Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser
545 550 555 560
Ile Ser Gln Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser
565 570 575
Ala Asp Lys Tyr Thr Ser Ser Asn Pro Pro Trp Tyr Val Thr Val Thr
580 585 590
Leu Pro Ala Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser
595 600 605
Asp Asp Ser Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val
610 615 620
Pro Gln Ala Cys Gly Glu Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
625 630 635 640
<210> 63
<211> 647
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 63
Met Leu Gly Lys Asn Asp Pro Met Cys Leu Val Leu Val Leu Leu Gly
1 5 10 15
Leu Thr Ala Leu Leu Gly Ile Cys Gln Gly Ser Val Ile Ser Lys Arg
20 25 30
Ala Thr Leu Asp Ser Trp Leu Ser Asn Glu Ala Thr Val Ala Arg Thr
35 40 45
Ala Ile Leu Asn Asn Ile Gly Ala Asp Gly Ala Trp Val Ser Gly Ala
50 55 60
Asp Ser Gly Ile Val Val Ala Ser Pro Ser Thr Asp Asn Pro Asp Tyr
65 70 75 80
Phe Tyr Thr Trp Thr Arg Asp Ser Gly Ile Val Leu Lys Thr Leu Val
85 90 95
Asp Leu Phe Arg Asn Gly Asp Thr Asp Leu Leu Ser Thr Ile Glu His
100 105 110
Tyr Ile Ser Ser Gln Ala Ile Ile Gln Gly Val Ser Asn Pro Ser Gly
115 120 125
Asp Leu Ser Ser Gly Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Glu
130 135 140
Thr Ala Tyr Ala Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala
145 150 155 160
Leu Arg Ala Thr Ala Met Ile Gly Phe Gly Gln Trp Leu Leu Asp Asn
165 170 175
Gly Tyr Thr Ser Ala Ala Thr Glu Ile Val Trp Pro Leu Val Arg Asn
180 185 190
Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu
195 200 205
Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His
210 215 220
Arg Ala Leu Val Glu Gly Ser Ala Phe Ala Thr Ala Val Gly Ser Ser
225 230 235 240
Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln Ile Leu Cys Tyr Leu Gln
245 250 255
Ser Phe Trp Thr Gly Ser Tyr Ile Leu Ala Asn Phe Asp Ser Ser Arg
260 265 270
Ser Gly Lys Asp Thr Asn Thr Leu Leu Gly Ser Ile His Thr Phe Asp
275 280 285
Pro Glu Ala Gly Cys Asp Asp Ser Thr Phe Gln Pro Cys Ser Pro Arg
290 295 300
Ala Leu Ala Asn His Lys Glu Val Val Asp Ser Phe Arg Ser Ile Tyr
305 310 315 320
Thr Leu Asn Asp Gly Leu Ser Asp Ser Glu Ala Val Ala Val Gly Arg
325 330 335
Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Cys Thr
340 345 350
Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala Leu Tyr Gln Trp Asp Lys
355 360 365
Gln Gly Ser Leu Glu Ile Thr Asp Val Ser Leu Asp Phe Phe Lys Ala
370 375 380
Leu Tyr Ser Gly Ala Ala Thr Gly Thr Tyr Ser Ser Ser Ser Ser Thr
385 390 395 400
Tyr Ser Ser Ile Val Ser Ala Val Lys Thr Phe Ala Asp Gly Phe Val
405 410 415
Ser Ile Val Glu Thr His Ala Ala Ser Asn Gly Ser Leu Ser Glu Gln
420 425 430
Phe Asp Lys Ser Asp Gly Asp Glu Leu Ser Ala Arg Asp Leu Thr Trp
435 440 445
Ser Tyr Ala Ala Leu Leu Thr Ala Asn Asn Arg Arg Asn Ser Val Val
450 455 460
Pro Pro Ser Trp Gly Glu Thr Ser Ala Ser Ser Val Pro Gly Thr Cys
465 470 475 480
Ala Ala Thr Ser Ala Ser Gly Thr Tyr Ser Ser Val Thr Val Thr Ser
485 490 495
Trp Pro Ser Ile Val Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr
500 505 510
Thr Gly Ser Gly Gly Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala
515 520 525
Ser Lys Thr Ser Thr Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr
530 535 540
Ala Val Ala Val Thr Phe Asp Leu Thr Ala Thr Thr Thr Tyr Gly Glu
545 550 555 560
Asn Ile Tyr Leu Val Gly Ser Ile Ser Gln Leu Gly Asp Trp Glu Thr
565 570 575
Ser Asp Gly Ile Ala Leu Ser Ala Asp Lys Tyr Thr Ser Ser Asn Pro
580 585 590
Pro Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser Phe Glu Tyr
595 600 605
Lys Phe Ile Arg Val Glu Ser Asp Asp Ser Val Glu Trp Glu Ser Asp
610 615 620
Pro Asn Arg Glu Tyr Thr Val Pro Gln Ala Cys Gly Glu Ser Thr Ala
625 630 635 640
Thr Val Thr Asp Thr Trp Arg
645
<210> 64
<211> 639
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 64
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala
1 5 10 15
Tyr Ser Ser Val Ile Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu Ser
20 25 30
Asn Glu Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly Ala
35 40 45
Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala Ser
50 55 60
Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser
65 70 75 80
Gly Ile Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr
85 90 95
Asp Leu Leu Ser Thr Ile Glu His Tyr Ile Ser Ser Gln Ala Ile Ile
100 105 110
Gln Gly Val Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu Gly
115 120 125
Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp Gly
130 135 140
Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile Gly
145 150 155 160
Phe Gly Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr Glu
165 170 175
Ile Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr
180 185 190
Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser Ser
195 200 205
Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser Ala
210 215 220
Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln Ala
225 230 235 240
Pro Gln Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr Ile
245 250 255
Leu Ala Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr Leu
260 265 270
Leu Gly Ser Ile His Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp Ser
275 280 285
Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu Val
290 295 300
Val Asp Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser Asp
305 310 315 320
Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn
325 330 335
Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu Tyr
340 345 350
Asp Ala Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr Asp
355 360 365
Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr Gly
370 375 380
Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala Val
385 390 395 400
Lys Thr Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala Ala
405 410 415
Ser Asn Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp Glu
420 425 430
Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala
435 440 445
Asn Asn Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr Ser
450 455 460
Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly Thr
465 470 475 480
Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr Gly
485 490 495
Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly Val Thr Ser
500 505 510
Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr Ser
515 520 525
Ser Thr Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu
530 535 540
Thr Ala Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser Ile
545 550 555 560
Ser Gln Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser Ala
565 570 575
Asp Lys Tyr Thr Ser Ser Asn Pro Pro Trp Tyr Val Thr Val Thr Leu
580 585 590
Pro Ala Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser Asp
595 600 605
Asp Ser Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro
610 615 620
Gln Ala Cys Gly Glu Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
625 630 635
<210> 65
<211> 640
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 65
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ser Val Ile Ser Lys Arg Ala Thr Leu Asp Ser Trp Leu
20 25 30
Ser Asn Glu Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly
35 40 45
Ala Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala
50 55 60
Ser Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp
65 70 75 80
Ser Gly Ile Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp
85 90 95
Thr Asp Leu Leu Ser Thr Ile Glu His Tyr Ile Ser Ser Gln Ala Ile
100 105 110
Ile Gln Gly Val Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Gly Leu
115 120 125
Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Ala Gly Ser Trp
130 135 140
Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile
145 150 155 160
Gly Phe Gly Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Ala Ala Thr
165 170 175
Glu Ile Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln
180 185 190
Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser
195 200 205
Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser
210 215 220
Ala Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln
225 230 235 240
Ala Pro Gln Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Tyr
245 250 255
Ile Leu Ala Asn Phe Asp Ser Ser Arg Ser Gly Lys Asp Thr Asn Thr
260 265 270
Leu Leu Gly Ser Ile His Thr Phe Asp Pro Glu Ala Gly Cys Asp Asp
275 280 285
Ser Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu
290 295 300
Val Val Asp Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser
305 310 315 320
Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr
325 330 335
Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu
340 345 350
Tyr Asp Ala Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Ile Thr
355 360 365
Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Gly Ala Ala Thr
370 375 380
Gly Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Ser Ala
385 390 395 400
Val Lys Thr Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala
405 410 415
Ala Ser Asn Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser Asp Gly Asp
420 425 430
Glu Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr
435 440 445
Ala Asn Asn Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr
450 455 460
Ser Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ser Gly
465 470 475 480
Thr Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr
485 490 495
Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly Val Thr
500 505 510
Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr Thr Thr
515 520 525
Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp
530 535 540
Leu Thr Ala Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser
545 550 555 560
Ile Ser Gln Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser
565 570 575
Ala Asp Lys Tyr Thr Ser Ser Asn Pro Pro Trp Tyr Val Thr Val Thr
580 585 590
Leu Pro Ala Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Val Glu Ser
595 600 605
Asp Asp Ser Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val
610 615 620
Pro Gln Ala Cys Gly Glu Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
625 630 635 640
<210> 66
<211> 705
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 66
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ala Pro Gln Leu Ala Pro Arg
85 90 95
Ala Thr Thr Ser Leu Asp Ala Trp Leu Ala Ser Glu Thr Thr Val Ala
100 105 110
Leu Asp Gly Ile Leu Asp Asn Val Gly Ser Ser Gly Ala Tyr Ala Lys
115 120 125
Ser Ala Lys Ser Gly Ile Val Ile Ala Ser Pro Ser Thr Ser Asp Pro
130 135 140
Asp Tyr Tyr Tyr Thr Trp Thr Arg Asp Ala Ala Leu Thr Val Lys Ala
145 150 155 160
Leu Ile Asp Leu Phe Arg Asn Gly Glu Thr Ser Leu Gln Thr Val Ile
165 170 175
Met Glu Tyr Ile Ser Ser Gln Ala Tyr Leu Gln Thr Val Ser Asn Pro
180 185 190
Ser Gly Ser Leu Ser Thr Gly Gly Leu Ala Glu Pro Lys Tyr Tyr Val
195 200 205
Asp Glu Thr Ala Tyr Thr Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly
210 215 220
Pro Ala Leu Arg Ala Thr Ala Met Ile Asp Phe Gly Asn Trp Leu Ile
225 230 235 240
Asp Asn Gly Tyr Ser Thr Tyr Ala Ser Ser Ile Val Trp Pro Ile Val
245 250 255
Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr
260 265 270
Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Ile Ala Val
275 280 285
Gln His Arg Ala Leu Val Glu Gly Ser Thr Phe Ala Ser Lys Val Gly
290 295 300
Ala Ser Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln Val Leu Cys Phe
305 310 315 320
Leu Gln Arg Phe Trp Thr Gly Ser Tyr Ile Met Ala Asn Phe Gly Gly
325 330 335
Gly Arg Ser Gly Lys Asp Ala Asn Thr Val Leu Gly Ser Ile His Thr
340 345 350
Phe Asp Pro Asn Ala Gly Cys Asp Asp Thr Thr Phe Gln Pro Cys Ser
355 360 365
Pro Arg Ala Leu Ala Asn His Lys Val Tyr Thr Asp Ser Phe Arg Ser
370 375 380
Ile Tyr Ser Ile Asn Ser Gly Ile Ser Ser Gly Lys Ala Val Ala Val
385 390 395 400
Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu
405 410 415
Thr Thr Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile Tyr Gln Trp
420 425 430
Gln Lys Ile Gly Ser Ile Thr Ile Thr Asp Val Ser Leu Ala Phe Phe
435 440 445
Lys Asp Leu Tyr Ser Ser Ala Ala Val Gly Thr Tyr Ala Ser Ser Ser
450 455 460
Ser Ala Phe Thr Ser Ile Val Ser Ala Val Lys Thr Tyr Ala Asp Gly
465 470 475 480
Tyr Met Ser Ile Val Gln Thr His Ala Met Thr Asn Gly Ser Leu Ser
485 490 495
Glu Gln Phe Gly Lys Ser Asp Gly Phe Ser Leu Ser Ala Arg Asp Leu
500 505 510
Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala Asn Leu Arg Arg Asn Ser
515 520 525
Val Val Pro Pro Ser Trp Gly Glu Thr Thr Ala Thr Ser Val Pro Ser
530 535 540
Val Cys Ser Ala Thr Ser Ala Thr Gly Thr Tyr Ser Thr Ala Thr Asn
545 550 555 560
Thr Ala Trp Pro Ser Thr Leu Thr Ser Gly Thr Gly Ala Thr Thr Thr
565 570 575
Thr Ser Lys Ala Thr Ser Ser Ser Thr Thr Thr Thr Ser Ser Ala Ser
580 585 590
Ser Thr Thr Val Glu Cys Val Val Pro Thr Ala Val Ala Val Thr Phe
595 600 605
Asp Glu Val Ala Thr Thr Thr Tyr Gly Glu Asn Val Tyr Val Val Gly
610 615 620
Ser Ile Ser Gln Leu Gly Ser Trp Asp Thr Ser Lys Ala Val Ala Leu
625 630 635 640
Ser Ala Ser Lys Tyr Thr Ser Ser Asn Asn Leu Trp Tyr Val Thr Val
645 650 655
Thr Leu Pro Ala Gly Thr Thr Phe Gln Tyr Lys Phe Ile Arg Val Ser
660 665 670
Ser Ser Gly Ser Val Thr Trp Glu Ser Asp Pro Asn Arg Ser Tyr Thr
675 680 685
Val Pro Ser Ala Cys Gly Thr Ser Thr Ala Val Val Asn Thr Thr Trp
690 695 700
Arg
705
<210> 67
<211> 677
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 67
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Leu Glu Gly
20 25 30
Asp Phe Asp Val Ala Val Leu Pro Phe Ser Ala Ser Ile Ala Ala Lys
35 40 45
Glu Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala Ala Pro Gln
50 55 60
Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp Ala Trp Leu Ala Ser Glu
65 70 75 80
Thr Thr Val Ala Leu Asp Gly Ile Leu Asp Asn Val Gly Ser Ser Gly
85 90 95
Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile Val Ile Ala Ser Pro Ser
100 105 110
Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp Thr Arg Asp Ala Ala Leu
115 120 125
Thr Val Lys Ala Leu Ile Asp Leu Phe Arg Asn Gly Glu Thr Ser Leu
130 135 140
Gln Thr Val Ile Met Glu Tyr Ile Ser Ser Gln Ala Tyr Leu Gln Thr
145 150 155 160
Val Ser Asn Pro Ser Gly Ser Leu Ser Thr Gly Gly Leu Ala Glu Pro
165 170 175
Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp Gly Arg Pro
180 185 190
Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile Asp Phe Gly
195 200 205
Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr Tyr Ala Ser Ser Ile Val
210 215 220
Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn
225 230 235 240
Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe
245 250 255
Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser Thr Phe Ala
260 265 270
Ser Lys Val Gly Ala Ser Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln
275 280 285
Val Leu Cys Phe Leu Gln Arg Phe Trp Thr Gly Ser Tyr Ile Met Ala
290 295 300
Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr Val Leu Gly
305 310 315 320
Ser Ile His Thr Phe Asp Pro Asn Ala Gly Cys Asp Asp Thr Thr Phe
325 330 335
Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Val Tyr Thr Asp
340 345 350
Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser Gly Ile Ser Ser Gly Lys
355 360 365
Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn
370 375 380
Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala
385 390 395 400
Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile Thr Ile Thr Asp Val Ser
405 410 415
Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser Ala Ala Val Gly Thr Tyr
420 425 430
Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile Val Ser Ala Val Lys Thr
435 440 445
Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln Thr His Ala Met Thr Asn
450 455 460
Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser Asp Gly Phe Ser Leu Ser
465 470 475 480
Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala Asn Leu
485 490 495
Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly Glu Thr Thr Ala Thr
500 505 510
Ser Val Pro Ser Val Cys Ser Ala Thr Ser Ala Thr Gly Thr Tyr Ser
515 520 525
Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr Leu Thr Ser Gly Thr Gly
530 535 540
Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser Ser Ser Thr Thr Thr Thr
545 550 555 560
Ser Ser Ala Ser Ser Thr Thr Val Glu Cys Val Val Pro Thr Ala Val
565 570 575
Ala Val Thr Phe Asp Glu Val Ala Thr Thr Thr Tyr Gly Glu Asn Val
580 585 590
Tyr Val Val Gly Ser Ile Ser Gln Leu Gly Ser Trp Asp Thr Ser Lys
595 600 605
Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr Ser Ser Asn Asn Leu Trp
610 615 620
Tyr Val Thr Val Thr Leu Pro Ala Gly Thr Thr Phe Gln Tyr Lys Phe
625 630 635 640
Ile Arg Val Ser Ser Ser Gly Ser Val Thr Trp Glu Ser Asp Pro Asn
645 650 655
Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly Thr Ser Thr Ala Val Val
660 665 670
Asn Thr Thr Trp Arg
675
<210> 68
<211> 635
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 68
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Gln Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp
20 25 30
Ala Trp Leu Ala Ser Glu Thr Thr Val Ala Leu Asp Gly Ile Leu Asp
35 40 45
Asn Val Gly Ser Ser Gly Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile
50 55 60
Val Ile Ala Ser Pro Ser Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp
65 70 75 80
Thr Arg Asp Ala Ala Leu Thr Val Lys Ala Leu Ile Asp Leu Phe Arg
85 90 95
Asn Gly Glu Thr Ser Leu Gln Thr Val Ile Met Glu Tyr Ile Ser Ser
100 105 110
Gln Ala Tyr Leu Gln Thr Val Ser Asn Pro Ser Gly Ser Leu Ser Thr
115 120 125
Gly Gly Leu Ala Glu Pro Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr
130 135 140
Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr
145 150 155 160
Ala Met Ile Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr
165 170 175
Tyr Ala Ser Ser Ile Val Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr
180 185 190
Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val
195 200 205
Asn Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val
210 215 220
Glu Gly Ser Thr Phe Ala Ser Lys Val Gly Ala Ser Cys Ser Trp Cys
225 230 235 240
Asp Ser Gln Ala Pro Gln Val Leu Cys Phe Leu Gln Arg Phe Trp Thr
245 250 255
Gly Ser Tyr Ile Met Ala Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp
260 265 270
Ala Asn Thr Val Leu Gly Ser Ile His Thr Phe Asp Pro Asn Ala Gly
275 280 285
Cys Asp Asp Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn
290 295 300
His Lys Val Tyr Thr Asp Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser
305 310 315 320
Gly Ile Ser Ser Gly Lys Ala Val Ala Val Gly Arg Tyr Pro Glu Asp
325 330 335
Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala
340 345 350
Glu Gln Leu Tyr Asp Ala Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile
355 360 365
Thr Ile Thr Asp Val Ser Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser
370 375 380
Ala Ala Val Gly Thr Tyr Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile
385 390 395 400
Val Ser Ala Val Lys Thr Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln
405 410 415
Thr His Ala Met Thr Asn Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser
420 425 430
Asp Gly Phe Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala
435 440 445
Leu Leu Thr Ala Asn Leu Arg Arg Asn Ser Val Val Pro Pro Ser Trp
450 455 460
Gly Glu Thr Thr Ala Thr Ser Val Pro Ser Val Cys Ser Ala Thr Ser
465 470 475 480
Ala Thr Gly Thr Tyr Ser Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr
485 490 495
Leu Thr Ser Gly Thr Gly Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser
500 505 510
Ser Ser Thr Thr Thr Thr Ser Ser Ala Ser Ser Thr Thr Val Glu Cys
515 520 525
Val Val Pro Thr Ala Val Ala Val Thr Phe Asp Glu Val Ala Thr Thr
530 535 540
Thr Tyr Gly Glu Asn Val Tyr Val Val Gly Ser Ile Ser Gln Leu Gly
545 550 555 560
Ser Trp Asp Thr Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr
565 570 575
Ser Ser Asn Asn Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Thr
580 585 590
Thr Phe Gln Tyr Lys Phe Ile Arg Val Ser Ser Ser Gly Ser Val Thr
595 600 605
Trp Glu Ser Asp Pro Asn Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly
610 615 620
Thr Ser Thr Ala Val Val Asn Thr Thr Trp Arg
625 630 635
<210> 69
<211> 635
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 69
Met Leu Leu Gln Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys
1 5 10 15
Ile Ser Ala Ala Pro Gln Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp
20 25 30
Ala Trp Leu Ala Ser Glu Thr Thr Val Ala Leu Asp Gly Ile Leu Asp
35 40 45
Asn Val Gly Ser Ser Gly Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile
50 55 60
Val Ile Ala Ser Pro Ser Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp
65 70 75 80
Thr Arg Asp Ala Ala Leu Thr Val Lys Ala Leu Ile Asp Leu Phe Arg
85 90 95
Asn Gly Glu Thr Ser Leu Gln Thr Val Ile Met Glu Tyr Ile Ser Ser
100 105 110
Gln Ala Tyr Leu Gln Thr Val Ser Asn Pro Ser Gly Ser Leu Ser Thr
115 120 125
Gly Gly Leu Ala Glu Pro Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr
130 135 140
Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr
145 150 155 160
Ala Met Ile Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr
165 170 175
Tyr Ala Ser Ser Ile Val Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr
180 185 190
Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val
195 200 205
Asn Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val
210 215 220
Glu Gly Ser Thr Phe Ala Ser Lys Val Gly Ala Ser Cys Ser Trp Cys
225 230 235 240
Asp Ser Gln Ala Pro Gln Val Leu Cys Phe Leu Gln Arg Phe Trp Thr
245 250 255
Gly Ser Tyr Ile Met Ala Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp
260 265 270
Ala Asn Thr Val Leu Gly Ser Ile His Thr Phe Asp Pro Asn Ala Gly
275 280 285
Cys Asp Asp Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn
290 295 300
His Lys Val Tyr Thr Asp Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser
305 310 315 320
Gly Ile Ser Ser Gly Lys Ala Val Ala Val Gly Arg Tyr Pro Glu Asp
325 330 335
Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala
340 345 350
Glu Gln Leu Tyr Asp Ala Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile
355 360 365
Thr Ile Thr Asp Val Ser Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser
370 375 380
Ala Ala Val Gly Thr Tyr Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile
385 390 395 400
Val Ser Ala Val Lys Thr Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln
405 410 415
Thr His Ala Met Thr Asn Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser
420 425 430
Asp Gly Phe Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala
435 440 445
Leu Leu Thr Ala Asn Leu Arg Arg Asn Ser Val Val Pro Pro Ser Trp
450 455 460
Gly Glu Thr Thr Ala Thr Ser Val Pro Ser Val Cys Ser Ala Thr Ser
465 470 475 480
Ala Thr Gly Thr Tyr Ser Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr
485 490 495
Leu Thr Ser Gly Thr Gly Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser
500 505 510
Ser Ser Thr Thr Thr Thr Ser Ser Ala Ser Ser Thr Thr Val Glu Cys
515 520 525
Val Val Pro Thr Ala Val Ala Val Thr Phe Asp Glu Val Ala Thr Thr
530 535 540
Thr Tyr Gly Glu Asn Val Tyr Val Val Gly Ser Ile Ser Gln Leu Gly
545 550 555 560
Ser Trp Asp Thr Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr
565 570 575
Ser Ser Asn Asn Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Thr
580 585 590
Thr Phe Gln Tyr Lys Phe Ile Arg Val Ser Ser Ser Gly Ser Val Thr
595 600 605
Trp Glu Ser Asp Pro Asn Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly
610 615 620
Thr Ser Thr Ala Val Val Asn Thr Thr Trp Arg
625 630 635
<210> 70
<211> 642
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 70
Met Leu Gly Lys Asn Asp Pro Met Cys Leu Val Leu Val Leu Leu Gly
1 5 10 15
Leu Thr Ala Leu Leu Gly Ile Cys Gln Gly Ala Pro Gln Leu Ala Pro
20 25 30
Arg Ala Thr Thr Ser Leu Asp Ala Trp Leu Ala Ser Glu Thr Thr Val
35 40 45
Ala Leu Asp Gly Ile Leu Asp Asn Val Gly Ser Ser Gly Ala Tyr Ala
50 55 60
Lys Ser Ala Lys Ser Gly Ile Val Ile Ala Ser Pro Ser Thr Ser Asp
65 70 75 80
Pro Asp Tyr Tyr Tyr Thr Trp Thr Arg Asp Ala Ala Leu Thr Val Lys
85 90 95
Ala Leu Ile Asp Leu Phe Arg Asn Gly Glu Thr Ser Leu Gln Thr Val
100 105 110
Ile Met Glu Tyr Ile Ser Ser Gln Ala Tyr Leu Gln Thr Val Ser Asn
115 120 125
Pro Ser Gly Ser Leu Ser Thr Gly Gly Leu Ala Glu Pro Lys Tyr Tyr
130 135 140
Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp Gly Arg Pro Gln Arg Asp
145 150 155 160
Gly Pro Ala Leu Arg Ala Thr Ala Met Ile Asp Phe Gly Asn Trp Leu
165 170 175
Ile Asp Asn Gly Tyr Ser Thr Tyr Ala Ser Ser Ile Val Trp Pro Ile
180 185 190
Val Arg Asn Asp Leu Ser Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly
195 200 205
Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Ile Ala
210 215 220
Val Gln His Arg Ala Leu Val Glu Gly Ser Thr Phe Ala Ser Lys Val
225 230 235 240
Gly Ala Ser Cys Ser Trp Cys Asp Ser Gln Ala Pro Gln Val Leu Cys
245 250 255
Phe Leu Gln Arg Phe Trp Thr Gly Ser Tyr Ile Met Ala Asn Phe Gly
260 265 270
Gly Gly Arg Ser Gly Lys Asp Ala Asn Thr Val Leu Gly Ser Ile His
275 280 285
Thr Phe Asp Pro Asn Ala Gly Cys Asp Asp Thr Thr Phe Gln Pro Cys
290 295 300
Ser Pro Arg Ala Leu Ala Asn His Lys Val Tyr Thr Asp Ser Phe Arg
305 310 315 320
Ser Ile Tyr Ser Ile Asn Ser Gly Ile Ser Ser Gly Lys Ala Val Ala
325 330 335
Val Gly Arg Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe
340 345 350
Leu Thr Thr Leu Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile Tyr Gln
355 360 365
Trp Gln Lys Ile Gly Ser Ile Thr Ile Thr Asp Val Ser Leu Ala Phe
370 375 380
Phe Lys Asp Leu Tyr Ser Ser Ala Ala Val Gly Thr Tyr Ala Ser Ser
385 390 395 400
Ser Ser Ala Phe Thr Ser Ile Val Ser Ala Val Lys Thr Tyr Ala Asp
405 410 415
Gly Tyr Met Ser Ile Val Gln Thr His Ala Met Thr Asn Gly Ser Leu
420 425 430
Ser Glu Gln Phe Gly Lys Ser Asp Gly Phe Ser Leu Ser Ala Arg Asp
435 440 445
Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala Asn Leu Arg Arg Asn
450 455 460
Ser Val Val Pro Pro Ser Trp Gly Glu Thr Thr Ala Thr Ser Val Pro
465 470 475 480
Ser Val Cys Ser Ala Thr Ser Ala Thr Gly Thr Tyr Ser Thr Ala Thr
485 490 495
Asn Thr Ala Trp Pro Ser Thr Leu Thr Ser Gly Thr Gly Ala Thr Thr
500 505 510
Thr Thr Ser Lys Ala Thr Ser Ser Ser Thr Thr Thr Thr Ser Ser Ala
515 520 525
Ser Ser Thr Thr Val Glu Cys Val Val Pro Thr Ala Val Ala Val Thr
530 535 540
Phe Asp Glu Val Ala Thr Thr Thr Tyr Gly Glu Asn Val Tyr Val Val
545 550 555 560
Gly Ser Ile Ser Gln Leu Gly Ser Trp Asp Thr Ser Lys Ala Val Ala
565 570 575
Leu Ser Ala Ser Lys Tyr Thr Ser Ser Asn Asn Leu Trp Tyr Val Thr
580 585 590
Val Thr Leu Pro Ala Gly Thr Thr Phe Gln Tyr Lys Phe Ile Arg Val
595 600 605
Ser Ser Ser Gly Ser Val Thr Trp Glu Ser Asp Pro Asn Arg Ser Tyr
610 615 620
Thr Val Pro Ser Ala Cys Gly Thr Ser Thr Ala Val Val Asn Thr Thr
625 630 635 640
Trp Arg
<210> 71
<211> 634
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 71
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala
1 5 10 15
Tyr Ser Ala Pro Gln Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp Ala
20 25 30
Trp Leu Ala Ser Glu Thr Thr Val Ala Leu Asp Gly Ile Leu Asp Asn
35 40 45
Val Gly Ser Ser Gly Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile Val
50 55 60
Ile Ala Ser Pro Ser Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp Thr
65 70 75 80
Arg Asp Ala Ala Leu Thr Val Lys Ala Leu Ile Asp Leu Phe Arg Asn
85 90 95
Gly Glu Thr Ser Leu Gln Thr Val Ile Met Glu Tyr Ile Ser Ser Gln
100 105 110
Ala Tyr Leu Gln Thr Val Ser Asn Pro Ser Gly Ser Leu Ser Thr Gly
115 120 125
Gly Leu Ala Glu Pro Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr Gly
130 135 140
Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala
145 150 155 160
Met Ile Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr Tyr
165 170 175
Ala Ser Ser Ile Val Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr Val
180 185 190
Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn
195 200 205
Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu
210 215 220
Gly Ser Thr Phe Ala Ser Lys Val Gly Ala Ser Cys Ser Trp Cys Asp
225 230 235 240
Ser Gln Ala Pro Gln Val Leu Cys Phe Leu Gln Arg Phe Trp Thr Gly
245 250 255
Ser Tyr Ile Met Ala Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp Ala
260 265 270
Asn Thr Val Leu Gly Ser Ile His Thr Phe Asp Pro Asn Ala Gly Cys
275 280 285
Asp Asp Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His
290 295 300
Lys Val Tyr Thr Asp Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser Gly
305 310 315 320
Ile Ser Ser Gly Lys Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Ser
325 330 335
Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala Glu
340 345 350
Gln Leu Tyr Asp Ala Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile Thr
355 360 365
Ile Thr Asp Val Ser Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser Ala
370 375 380
Ala Val Gly Thr Tyr Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile Val
385 390 395 400
Ser Ala Val Lys Thr Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln Thr
405 410 415
His Ala Met Thr Asn Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser Asp
420 425 430
Gly Phe Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu
435 440 445
Leu Thr Ala Asn Leu Arg Arg Asn Ser Val Val Pro Pro Ser Trp Gly
450 455 460
Glu Thr Thr Ala Thr Ser Val Pro Ser Val Cys Ser Ala Thr Ser Ala
465 470 475 480
Thr Gly Thr Tyr Ser Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr Leu
485 490 495
Thr Ser Gly Thr Gly Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser Ser
500 505 510
Ser Thr Thr Thr Thr Ser Ser Ala Ser Ser Thr Thr Val Glu Cys Val
515 520 525
Val Pro Thr Ala Val Ala Val Thr Phe Asp Glu Val Ala Thr Thr Thr
530 535 540
Tyr Gly Glu Asn Val Tyr Val Val Gly Ser Ile Ser Gln Leu Gly Ser
545 550 555 560
Trp Asp Thr Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr Ser
565 570 575
Ser Asn Asn Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Thr Thr
580 585 590
Phe Gln Tyr Lys Phe Ile Arg Val Ser Ser Ser Gly Ser Val Thr Trp
595 600 605
Glu Ser Asp Pro Asn Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly Thr
610 615 620
Ser Thr Ala Val Val Asn Thr Thr Trp Arg
625 630
<210> 72
<211> 635
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 72
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Gln Leu Ala Pro Arg Ala Thr Thr Ser Leu Asp
20 25 30
Ala Trp Leu Ala Ser Glu Thr Thr Val Ala Leu Asp Gly Ile Leu Asp
35 40 45
Asn Val Gly Ser Ser Gly Ala Tyr Ala Lys Ser Ala Lys Ser Gly Ile
50 55 60
Val Ile Ala Ser Pro Ser Thr Ser Asp Pro Asp Tyr Tyr Tyr Thr Trp
65 70 75 80
Thr Arg Asp Ala Ala Leu Thr Val Lys Ala Leu Ile Asp Leu Phe Arg
85 90 95
Asn Gly Glu Thr Ser Leu Gln Thr Val Ile Met Glu Tyr Ile Ser Ser
100 105 110
Gln Ala Tyr Leu Gln Thr Val Ser Asn Pro Ser Gly Ser Leu Ser Thr
115 120 125
Gly Gly Leu Ala Glu Pro Lys Tyr Tyr Val Asp Glu Thr Ala Tyr Thr
130 135 140
Gly Ser Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr
145 150 155 160
Ala Met Ile Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Thr
165 170 175
Tyr Ala Ser Ser Ile Val Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr
180 185 190
Val Ala Gln Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Glu Val
195 200 205
Asn Gly Ser Ser Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val
210 215 220
Glu Gly Ser Thr Phe Ala Ser Lys Val Gly Ala Ser Cys Ser Trp Cys
225 230 235 240
Asp Ser Gln Ala Pro Gln Val Leu Cys Phe Leu Gln Arg Phe Trp Thr
245 250 255
Gly Ser Tyr Ile Met Ala Asn Phe Gly Gly Gly Arg Ser Gly Lys Asp
260 265 270
Ala Asn Thr Val Leu Gly Ser Ile His Thr Phe Asp Pro Asn Ala Gly
275 280 285
Cys Asp Asp Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn
290 295 300
His Lys Val Tyr Thr Asp Ser Phe Arg Ser Ile Tyr Ser Ile Asn Ser
305 310 315 320
Gly Ile Ser Ser Gly Lys Ala Val Ala Val Gly Arg Tyr Pro Glu Asp
325 330 335
Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala
340 345 350
Glu Gln Leu Tyr Asp Ala Ile Tyr Gln Trp Gln Lys Ile Gly Ser Ile
355 360 365
Thr Ile Thr Asp Val Ser Leu Ala Phe Phe Lys Asp Leu Tyr Ser Ser
370 375 380
Ala Ala Val Gly Thr Tyr Ala Ser Ser Ser Ser Ala Phe Thr Ser Ile
385 390 395 400
Val Ser Ala Val Lys Thr Tyr Ala Asp Gly Tyr Met Ser Ile Val Gln
405 410 415
Thr His Ala Met Thr Asn Gly Ser Leu Ser Glu Gln Phe Gly Lys Ser
420 425 430
Asp Gly Phe Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala
435 440 445
Leu Leu Thr Ala Asn Leu Arg Arg Asn Ser Val Val Pro Pro Ser Trp
450 455 460
Gly Glu Thr Thr Ala Thr Ser Val Pro Ser Val Cys Ser Ala Thr Ser
465 470 475 480
Ala Thr Gly Thr Tyr Ser Thr Ala Thr Asn Thr Ala Trp Pro Ser Thr
485 490 495
Leu Thr Ser Gly Thr Gly Ala Thr Thr Thr Thr Ser Lys Ala Thr Ser
500 505 510
Ser Ser Thr Thr Thr Thr Ser Ser Ala Ser Ser Thr Thr Val Glu Cys
515 520 525
Val Val Pro Thr Ala Val Ala Val Thr Phe Asp Glu Val Ala Thr Thr
530 535 540
Thr Tyr Gly Glu Asn Val Tyr Val Val Gly Ser Ile Ser Gln Leu Gly
545 550 555 560
Ser Trp Asp Thr Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr
565 570 575
Ser Ser Asn Asn Leu Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Thr
580 585 590
Thr Phe Gln Tyr Lys Phe Ile Arg Val Ser Ser Ser Gly Ser Val Thr
595 600 605
Trp Glu Ser Asp Pro Asn Arg Ser Tyr Thr Val Pro Ser Ala Cys Gly
610 615 620
Thr Ser Thr Ala Val Val Asn Thr Thr Trp Arg
625 630 635
<210> 73
<211> 90
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 73
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln
20 25 30
Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe
35 40 45
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu
50 55 60
Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val
65 70 75 80
Ser Leu Glu Lys Arg Glu Ala Glu Ala Ser
85 90
<210> 74
<211> 61
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 74
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser
1 5 10 15
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Leu Glu Gly
20 25 30
Asp Phe Asp Val Ala Val Leu Pro Phe Ser Ala Ser Ile Ala Ala Lys
35 40 45
Glu Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala
50 55 60
<210> 75
<211> 19
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 75
Met Leu Leu Gln Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys
1 5 10 15
Ile Ser Ala
<210> 76
<211> 26
<212> PRT
<213> 原鸡(Gallus gallus)
<400> 76
Met Leu Gly Lys Asn Asp Pro Met Cys Leu Val Leu Val Leu Leu Gly
1 5 10 15
Leu Thr Ala Leu Leu Gly Ile Cys Gln Gly
20 25
<210> 77
<211> 20
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 77
Met Val Ala Trp Trp Ser Leu Phe Leu Tyr Gly Leu Gln Val Ala Ala
1 5 10 15
Pro Ala Leu Ala
20
<210> 78
<211> 18
<212> PRT
<213> 智人(Homo sapiens)
<400> 78
Met Lys Trp Val Thr Phe Ile Ser Leu Leu Phe Leu Phe Ser Ser Ala
1 5 10 15
Tyr Ser
<210> 79
<211> 617
<212> PRT
<213> 米氏酵母(Saccharomyces mikatae)
<400> 79
Met Lys Asn Phe Ile Ser Leu Val Asn Lys Lys Lys Gly Thr Leu Asp
1 5 10 15
Asp Arg Asn Ser Ser Val Pro Glu Ser Ser Ser Gly Ile Ile His Gln
20 25 30
Arg Gly Ala Leu Asn Thr Glu Asp Phe Glu Glu Gly Lys Lys Asp Gly
35 40 45
Ala Phe Glu Leu Gly His Leu Glu Phe Thr Thr Asn Ser Ala Gln Leu
50 55 60
Gly Asp Ser Asp Asp Asp Asn Asp Asn Ala Ile Lys Ile Ala Asn Ala
65 70 75 80
Ala Thr Asp Glu Ala Asn Glu Ala Asn Ser Glu Glu Lys Ser Met Thr
85 90 95
Leu Arg Gln Ala Leu Arg Lys Tyr Pro Lys Ala Ala Leu Trp Ser Ile
100 105 110
Leu Val Ser Thr Thr Leu Val Met Glu Gly Tyr Asp Thr Ala Leu Leu
115 120 125
Ser Ala Leu Tyr Ala Leu Pro Val Phe Gln Arg Lys Phe Gly Thr Met
130 135 140
Asn Ala Glu Gly Ser Tyr Glu Ile Thr Ser Gln Trp Gln Ile Gly Leu
145 150 155 160
Asn Met Cys Val Leu Cys Gly Glu Met Ile Gly Leu Gln Met Thr Thr
165 170 175
Tyr Met Val Glu Phe Met Gly Asn Arg Tyr Thr Met Ile Thr Ala Leu
180 185 190
Gly Leu Leu Thr Ala Tyr Ile Phe Ile Leu Tyr Tyr Cys Lys Ser Leu
195 200 205
Ala Met Ile Ala Val Gly Gln Ile Leu Ser Ala Met Pro Trp Gly Cys
210 215 220
Phe Gln Ser Leu Ala Val Thr Tyr Ala Ser Glu Val Cys Pro Leu Ala
225 230 235 240
Leu Arg Tyr Tyr Met Thr Ser Tyr Ser Asn Ile Cys Trp Leu Phe Gly
245 250 255
Gln Ile Phe Ala Ser Gly Ile Met Lys Asn Ser Gln Glu Asn Leu Gly
260 265 270
Asp Ser Asp Leu Gly Tyr Lys Leu Pro Phe Ala Leu Gln Trp Ile Trp
275 280 285
Pro Ala Pro Leu Ile Ile Gly Ile Phe Phe Ala Pro Glu Ser Pro Trp
290 295 300
Trp Leu Val Arg Lys Asn Lys Ile Ala Glu Ala Lys Lys Ser Leu Asn
305 310 315 320
Arg Ile Leu Ser Gly Thr Ala Ala Glu Arg Glu Ile Gln Val Asp Ile
325 330 335
Thr Leu Lys Gln Ile Glu Met Thr Ile Glu Lys Glu Arg Leu Leu Ala
340 345 350
Ser Lys Ser Gly Ser Phe Phe Asn Cys Phe Lys Gly Val Asp Gly Arg
355 360 365
Arg Thr Arg Leu Ala Cys Leu Thr Trp Val Ala Gln Asn Ser Ser Gly
370 375 380
Ala Val Leu Leu Gly Tyr Ser Thr Tyr Phe Phe Glu Arg Ala Gly Met
385 390 395 400
Ala Thr Asp Lys Ala Phe Thr Phe Ser Leu Ile Gln Tyr Cys Leu Gly
405 410 415
Leu Ala Gly Thr Leu Cys Ser Trp Val Ile Ser Gly Arg Val Gly Arg
420 425 430
Trp Ser Ile Leu Ala Tyr Gly Leu Ala Phe Gln Met Val Cys Leu Phe
435 440 445
Ile Ile Gly Gly Met Gly Phe Ala Ser Gly Ser Asn Ala Ser Asn Gly
450 455 460
Ala Gly Gly Leu Leu Leu Ala Leu Ser Phe Phe Tyr Asn Ala Gly Ile
465 470 475 480
Gly Ala Val Val Tyr Cys Ile Val Ala Glu Ile Pro Ser Ala Glu Leu
485 490 495
Arg Thr Lys Thr Ile Val Met Ala Arg Ile Cys Tyr Asn Leu Met Ala
500 505 510
Val Ile Asn Ala Ile Leu Thr Pro Tyr Met Leu Asn Val Ser Asp Trp
515 520 525
Asn Trp Gly Ala Lys Thr Gly Leu Tyr Trp Gly Gly Phe Thr Ala Val
530 535 540
Thr Leu Ala Trp Val Ile Ile Asp Leu Pro Glu Thr Thr Gly Arg Thr
545 550 555 560
Phe Ser Glu Ile Asn Glu Leu Phe Asn Gln Gly Val Pro Ala Arg Lys
565 570 575
Phe Ala Ser Thr Val Val Asp Pro Phe Gly Lys Gly Gln Arg Gln Asn
580 585 590
Asp Ser Gln Val Asp Asn Val Ile Asp Gln Ser Ser Ser Ala Met Gln
595 600 605
Gln Glu Leu Asn Glu Ala Asn Glu Phe
610 615
<210> 80
<211> 19
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 80
Met Lys Phe Ile Ser Thr Phe Leu Thr Phe Ile Leu Ala Ala Val Ser
1 5 10 15
Val Thr Ala
<210> 81
<211> 17
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 81
Met Phe Lys Ser Val Val Tyr Ser Ile Leu Ala Ala Ser Leu Ala Asn
1 5 10 15
Ala
<210> 82
<211> 6
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 82
Ala Val Leu Phe Ala Ala
1 5
<210> 83
<211> 6
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 83
Ala Phe Leu Phe Leu Leu
1 5
<210> 84
<211> 6
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 84
Leu Val Leu Val Leu Leu
1 5
<210> 85
<211> 5
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 85
Leu Leu Phe Leu Phe
1 5
<210> 86
<211> 6
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> *
<400> 86
Phe Ile Leu Ala Ala Val
1 5

Claims (15)

1.一种工程化的多肽,所述工程化的多肽包含:
(a)包含5-8个连续疏水性氨基酸残基的分泌信号氨基酸序列;以及
(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述分泌信号氨基酸序列与所述葡糖淀粉酶氨基酸序列是异源的,并且
其中所述工程化的多肽具有葡糖淀粉酶活性。
2.根据权利要求1所述的工程化的多肽,其中所述5-8个连续疏水性氨基酸残基的氨基酸选自丙氨酸、异亮氨酸、亮氨酸、苯丙氨酸和/或缬氨酸。
3.根据权利要求1所述的工程化的多肽,其中所述5-8个连续疏水性氨基酸残基包含选自以下的序列:AVLFAA、AFLFLL、LVLVLL、LLFLF或FILAAV。
4.根据权利要求1-3中任一项所述的工程化的多肽,所述工程化的多肽包含:
(a)分泌信号氨基酸序列,所述分泌信号氨基酸序列与以下各项具有80%或更大的序列同一性:(i)SEQ ID NO:73的至少AA 1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA 1-19的氨基酸序列;(iii)SEQ ID NO:77;(iv)SEQ ID NO:75;(v)SEQ ID NO:76;或(vi)SEQID NO:78;以及
(b)来自酵母、真菌或细菌葡糖淀粉酶多肽的葡糖淀粉酶氨基酸序列,其中所述多肽具有葡糖淀粉酶活性。
5.根据权利要求1所述的工程化的多肽,其中所述(a)分泌信号氨基酸序列与以下各项具有90%或更大的序列同一性:
(i)SEQ ID NO:73的至少AA1-19的氨基酸序列;(ii)SEQ ID NO:74的至少AA1-19的氨基酸序列;(iii)SEQ ID NO:77;(iv)SEQ ID NO:75;(v)SEQ ID NO:76;或(vi)SEQ ID NO:78。
6.根据前述权利要求中任一项所述的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列来自选自由以下组成的组的酵母或真菌生物体:树脂枝孢霉菌、黑曲霉、泡盛曲霉、米曲霉、川地曲霉、白宇佐美曲霉、土曲霉、出芽短梗霉、食腺嘌呤芽生葡萄孢酵母、布鲁塞尔酒香酵母、白色念珠菌、产朊假丝酵母、草酸青霉、米根霉、粟酒裂殖酵母、酿酒酵母、扣囊复膜孢酵母、埃默森踝节菌、瓣环栓菌以及里氏木霉。
7.根据权利要求6所述的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ IDNO:42具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
8.根据权利要求6所述的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ IDNO:43具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
9.根据权利要求6所述的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与SEQ IDNO:44具有95%或更大、96%或更大、97%或更大、98%或更大或99%或更大的序列同一性。
10.根据权利要求1所述的工程化的多肽,其中所述葡糖淀粉酶氨基酸序列与来自选自以下的多肽的氨基酸具有95%或更大、96%或更大、97%或更大、98%或更大、或99%或更大的序列同一性:
(i)SEQ ID NO:45;(ii)SEQ ID NO:46;(iii)SEQ ID NO:47;(iv)SEQ ID NO:48;(v)SEQ ID NO:49;(vi)SEQ ID NO:50;或(vii)SEQ ID NO:51。
11.一种工程化的细胞,所述工程化的细胞表达根据权利要求1-10中任一项所述的工程化的多肽。
12.根据权利要求11所述的工程化的细胞,其中所述工程化的细胞是从菌种酿酒酵母的宿主细胞工程化的。
13.根据权利要求11-12中任一项所述的工程化的细胞,所述工程化的细胞(a)能够以大于90g/L、100g/L、110g/L、120g/L、130g/L或140g/L的滴度产生乙醇;(b)在33℃至40℃、33℃至39℃、33℃至38℃、33℃至37℃、34℃至37℃、35℃至37℃或36℃至38℃范围内的温度下具有耐热性;或(a)和(b)这两者。
14.一种用于产生发酵产物的发酵方法,所述发酵方法包括以下步骤:
将包含淀粉材料和根据权利要求11-13中任一项所述的工程化的细胞的液体培养基发酵以提供发酵产物。
15.根据权利要求14所述的发酵方法,其中所述发酵提供90g/L至170g/L范围内的滴度的乙醇。
CN201780048781.3A 2016-08-05 2017-08-04 前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株 Pending CN109661403A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201662371681P 2016-08-05 2016-08-05
US62/371,681 2016-08-05
PCT/US2017/045493 WO2018027131A1 (en) 2016-08-05 2017-08-04 Leader-modified glucoamylase polypeptides and engineered yeast strains having enhanced bioproduct production

Publications (1)

Publication Number Publication Date
CN109661403A true CN109661403A (zh) 2019-04-19

Family

ID=61073959

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201780048781.3A Pending CN109661403A (zh) 2016-08-05 2017-08-04 前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株

Country Status (6)

Country Link
US (1) US11421212B2 (zh)
EP (1) EP3494129A1 (zh)
CN (1) CN109661403A (zh)
BR (1) BR112019002238A2 (zh)
CA (1) CA3032736A1 (zh)
WO (1) WO2018027131A1 (zh)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3516051A4 (en) 2016-09-16 2020-04-29 Cargill, Incorporated GENETICALLY MODIFIED YEASTS CONSUMING LACTATE AND FERMENTATION PROCESSES USING THE GENETICALLY MODIFIED YEASTS
WO2018141872A1 (en) * 2017-02-02 2018-08-09 Lallemand Hungary Liquidity Management Llc Heterologous protease expression for improving alcoholic fermentation
KR102618002B1 (ko) * 2017-03-10 2023-12-27 볼트 쓰레즈, 인크. 재조합 단백질을 고분비 수율로 생산하기 위한 조성물 및 방법
BR112019023085A2 (pt) 2017-05-04 2020-06-09 Cargill Inc levedura geneticamente modificada, e, processo para fabricar etanol.
WO2019168962A1 (en) * 2018-02-28 2019-09-06 Cargill, Incorporated Glucoamylase engineered yeast and fermentation methods
MX2021015818A (es) 2019-08-06 2022-02-03 Novozymes As Proteinas de fusion para mejorar la expresion de enzimas.
US20210163995A1 (en) * 2019-11-29 2021-06-03 Lallemand Hungary Liquidity Management Llc Process for displacing an exogenous enzyme
BR112022010434A2 (pt) * 2019-11-29 2022-10-11 Lallemand Hungary Liquidity Man Llc Levedura que expressa glucoamilase heteróloga
WO2021133658A1 (en) * 2019-12-23 2021-07-01 Cargill, Incorporated Fermentation method and uses thereof
US11814629B2 (en) 2020-03-19 2023-11-14 Lallemand Hungary Liquidity Management Llc Yeast expressing glucoamylase with enhanced starch hydrolysis
CN115867651A (zh) * 2020-04-17 2023-03-28 丹尼斯科美国公司 葡糖淀粉酶及其使用方法
WO2022005732A1 (en) * 2020-06-30 2022-01-06 Arris Enterprises Llc Virtual elastic queue
CN112662575B (zh) * 2021-01-28 2022-10-21 武汉轻工大学 高蛋白酶活性及高出酒率的扣囊复膜酵母菌、其组合物及应用
WO2023225459A2 (en) 2022-05-14 2023-11-23 Novozymes A/S Compositions and methods for preventing, treating, supressing and/or eliminating phytopathogenic infestations and infections
WO2024040001A1 (en) 2022-08-17 2024-02-22 Cargill, Incorporated Genetically modified yeast and fermentation processes for the production of ethanol

Family Cites Families (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4546082A (en) 1982-06-17 1985-10-08 Regents Of The Univ. Of California E. coli/Saccharomyces cerevisiae plasmid cloning vector containing the alpha-factor gene for secretion and processing of hybrid proteins
NZ207926A (en) 1983-04-25 1988-04-29 Genentech Inc Use of yeast #a#-factor to assist in expression of proteins heterologus to yeast
US4870008A (en) 1983-08-12 1989-09-26 Chiron Corporation Secretory expression in eukaryotes
US5422267A (en) 1984-05-22 1995-06-06 Robert R. Yocum Industrial yeast comprising an integrated glucoamylase gene
US6214577B1 (en) 1984-05-22 2001-04-10 Robert Rogers Yocum Yeast vectors conferring antibiotic resistance
CA1293460C (en) 1985-10-07 1991-12-24 Brian Lee Sauer Site-specific recombination of dna in yeast
US5024941A (en) 1985-12-18 1991-06-18 Biotechnica International, Inc. Expression and secretion vector for yeast containing a glucoamylase signal sequence
US5231017A (en) 1991-05-17 1993-07-27 Solvay Enzymes, Inc. Process for producing ethanol
DE69432543T2 (de) 1993-07-23 2003-12-24 Dsm Nv Selektionmarker-genfreie rekombinante Stämme: Verfahren zur ihrer Herstellung und die Verwendung dieser Stämme
US5521086A (en) 1993-09-16 1996-05-28 Cephalon, Inc. Secretion sequence for the production of a heterologous protein in yeast
AU4013295A (en) 1994-10-27 1996-05-23 Genencor International, Inc. A method for improved raw material utilization in fermentation processes
US5587290A (en) 1995-06-26 1996-12-24 The Regents Of The University Of California Stress tolerant yeast mutants
IT1294728B1 (it) 1997-09-12 1999-04-12 Biopolo S C A R L Ceppi di lievito per la riproduzione di acido lattico
US8735544B1 (en) 1999-01-13 2014-05-27 Little Sioux Corn Processors, Llc Value added whole stillage by-products from an ethanol production process
EP1183385B1 (en) 1999-05-21 2006-07-19 Cargill Dow LLC Methods and materials for the synthesis of organic products
US7109010B2 (en) 2000-11-22 2006-09-19 Nature Works Llc Methods and materials for the synthesis of organic products
EP1339740A2 (en) 2000-12-01 2003-09-03 Wyeth Methods and cells for detecting modulators of rgs proteins
WO2004007664A2 (en) 2002-05-28 2004-01-22 Maxygen, Inc. Nucleic acid vectors
US20040023349A1 (en) 2002-06-13 2004-02-05 Novozymes A/S Processes for making ethanol
DE10252245A1 (de) * 2002-11-07 2004-05-27 Prof. Dr. Danilo Porro Università degli Studi di Milano-Bicocca Dipartimento die Biotechnologie e Bioscienze Verfahren zur Expression und Sekretion von Proteinen mittels der nicht-konventionellen Hefe Zygosaccharomyces bailii
US7413887B2 (en) 2004-05-27 2008-08-19 Genecor International, Inc. Trichoderma reesei glucoamylase and homologs thereof
US7314033B2 (en) 2004-11-18 2008-01-01 Massachusetts Institute Of Technology Fuel management system for variable ethanol octane enhancement of gasoline engines
US7785872B2 (en) 2004-12-08 2010-08-31 Simpson Biotech Co., Ltd. Nucleic acids for enhancing gene expression and use thereof
CN101128580B (zh) 2004-12-22 2016-08-24 诺维信公司 用于淀粉加工的酶
US8097448B2 (en) 2005-06-02 2012-01-17 Cargill Inc. Genetically modified yeast of the species Issatchenkia orientalis and closely relates species, and fermentation processes using same
CN101384621A (zh) * 2005-11-10 2009-03-11 受体生物公司 产生受体和配体同种型的方法
JP5354559B2 (ja) 2005-11-24 2013-11-27 独立行政法人産業技術総合研究所 高効率分泌シグナルペプチド及びそれらを利用したタンパク質発現系
CA2673525C (en) 2006-12-21 2017-12-05 Verenium Corporation Amylases and glucoamylases, nucleic acids encoding them and methods for making and using them
US20100317078A1 (en) 2007-08-27 2010-12-16 Cornell Research Foundation Inc Methods to improve alcohol tolerance of microorganisms
AU2008300579B2 (en) 2007-09-18 2014-11-13 Basf Plant Science Gmbh Plants with increased yield
US8592194B2 (en) 2007-10-09 2013-11-26 Danisco Us Inc. Glucoamylase variants with altered properties
ES2563040T3 (es) 2007-12-23 2016-03-10 Gevo, Inc. Organismo de levadura que produce isobutanol a un alto rendimiento
JP2011067095A (ja) 2008-01-10 2011-04-07 Ajinomoto Co Inc 発酵法による目的物質の製造法
US8067339B2 (en) * 2008-07-09 2011-11-29 Merck Sharp & Dohme Corp. Surface display of whole antibodies in eukaryotes
AU2008297025B2 (en) 2008-09-09 2012-03-15 Suntory Holdings Limited Glucose-induced inactivation/degradation-resistant transporter gene and use thereof
KR101178205B1 (ko) 2008-12-04 2012-08-29 게란티제약 주식회사 원형질융합에 의한 에탄올 저항성 균주, 이의 제조방법, 소디움메타게르마네이트를 이용한 고함량 바이오유기게르마늄을 함유한 효모의 제조방법 및 이에 의해 생산된 효모
CA2760876A1 (en) 2009-05-04 2010-11-11 Carnegie Institution Of Washington Novel sugar transporters
US8394622B2 (en) 2009-08-10 2013-03-12 Pioneer Hi Bred International Inc Yeast strains for improved ethanol production
EP2467474A1 (en) 2009-08-19 2012-06-27 Danisco A/S Variants of glucoamylase
CA2782154C (en) 2009-11-30 2018-10-16 Novozymes A/S Polypeptides having glucoamylase activity and polynucleotides encoding same
WO2011091107A2 (en) 2010-01-20 2011-07-28 Arizona Board Of Regents, A Body Corporate Of The State Of Arizona Acting For And On Behalf Of Arizona State University Film bulk acoustic wave resonator-based ethanol and acetone sensors and methods using the same
US8809060B2 (en) 2010-04-23 2014-08-19 Ewha University-Industry Collaboration Foundation Ethanol-resistant yeast gene, and use thereof
CN103124783A (zh) 2010-06-03 2013-05-29 马斯科马公司 表达用于使用淀粉和纤维素进行联合生物加工的糖分解酶的酵母
US8178331B2 (en) 2010-09-15 2012-05-15 Wisconsin Alumni Research Foundation Recombinant yeast with improved ethanol tolerance and related methods of use
US20140162335A1 (en) 2011-06-21 2014-06-12 Pedro Esteban Bortiri Recombinant Yeast Expressing AGT1
FR2978146B1 (fr) 2011-07-21 2013-08-30 IFP Energies Nouvelles Procede de deshydratation de l'ethanol en ethylene a basse consommation energetique
US8697412B2 (en) 2011-11-28 2014-04-15 Novozymes A/S Polypeptides having glucoamylase activity and polynucleotides encoding same
DK2794641T3 (en) 2011-12-22 2017-03-13 Dupont Nutrition Biosci Aps POLYPEPTIDES WITH GLUCOAMYLASE ACTIVITY AND PROCEDURES FOR PREPARING IT
WO2013178674A1 (en) 2012-05-31 2013-12-05 Novozymes A/S Improved selection in fungi
MX2015002099A (es) 2012-08-22 2015-05-11 Dupont Nutrition Biosci Aps Variantes que tienen actividad glucoamilasa.
US9988650B2 (en) 2012-11-20 2018-06-05 Lallenmend Hungary Liquidity Management LLC Electron consuming ethanol production pathway to displace glycerol formation in S. cerevisiae
BR102012029839A8 (pt) 2012-11-23 2018-05-22 Mendes De Oliveira Jadyr utilização de biocida natural no processo de produção de etanol de diversas fontes
DE102012221460A1 (de) 2012-11-23 2014-06-12 Beiersdorf Ag Verwendung von Triethylcitrat als Vergällungsmittel für Ethanol
CA2920114A1 (en) 2013-08-15 2015-02-19 Lallemand Hungary Liquidity Management Llc Methods for the improvement of product yield and production in a microorganism through glycerol recycling
CN106559996A (zh) 2014-06-18 2017-04-05 嘉吉公司 一种利用遗传工程改造的酵母发酵生产糖的方法
WO2016127083A1 (en) 2015-02-06 2016-08-11 Cargill, Incorporated Modified glucoamylase enzymes and yeast strains having enhanced bioproduct production
EP3274461A1 (en) * 2015-03-27 2018-01-31 Cargill, Incorporated Glucoamylase-modified yeast strains and methods for bioproduct production
BR112018011902A2 (pt) 2015-12-17 2018-12-04 Cargill Inc método de fermentação, levedura geneticamente modificada, construto de ácido nucleico, vetor, célula hospedeira, meio de fermentação e utilização da levedura geneticamente modificada

Also Published As

Publication number Publication date
US20190345471A1 (en) 2019-11-14
WO2018027131A1 (en) 2018-02-08
BR112019002238A2 (pt) 2019-05-14
CA3032736A1 (en) 2018-02-08
US11421212B2 (en) 2022-08-23
EP3494129A1 (en) 2019-06-12

Similar Documents

Publication Publication Date Title
CN109661403A (zh) 前导序列修饰的葡糖淀粉酶多肽和具有增强的生物产物产生的工程化的酵母菌株
AU2020289750B2 (en) Engineered meganucleases with recognition sequences found in the human T cell receptor alpha constant region gene
CN110582567B (zh) 经遗传修饰的表达海藻糖酶的酵母及使用此类经遗传修饰的酵母的发酵方法
CN101835898B (zh) 用于生物活性肽表达和纯化的可溶性标记
AU2021200863A1 (en) Genetically-modified cells comprising a modified human t cell receptor alpha constant region gene
CN101365788B (zh) Δ-9延伸酶及其在制备多不饱和脂肪酸中的用途
CN101939434B (zh) 用于在大豆中提高种子贮藏油脂的生成和改变脂肪酸谱的来自解脂耶氏酵母的dgat基因
KR20210149060A (ko) Tn7-유사 트랜스포존을 사용한 rna-유도된 dna 통합
KR20140092759A (ko) 숙주 세포 및 아이소부탄올의 제조 방법
BRPI0806354A2 (pt) plantas oleaginosas transgências, sementes, óleos, produtos alimentìcios ou análogos a alimento, produtos alimentìcios medicinais ou análogos alimentìcios medicinais, produtos farmacêuticos, bebidas fórmulas para bebês, suplementos nutricionais, rações para animais domésticos, alimentos para aquacultura, rações animais, produtos de sementes inteiras, produtos de óleos misturados, produtos, subprodutos e subprodutos parcialmente processados
DK2324120T3 (en) Manipulating SNF1 protein kinase OF REVISION OF OIL CONTENT IN OLEAGINOUS ORGANISMS
CN108884467A (zh) 包含产能发酵途径的基因工程菌
KR20110122672A (ko) 이소프렌 및 공-산물을 제조하는 방법
DK2768848T3 (en) METHODS AND PROCEDURES FOR EXPRESSION AND SECRETARY OF PEPTIDES AND PROTEINS
US20030024009A1 (en) Manipulation of the phenolic acid content and digestibility of plant cell walls by targeted expression of genes encoding cell wall degrading enzymes
CN109906270A (zh) 经遗传修饰的乳酸消耗酵母以及使用此类经遗传修饰的酵母的发酵工艺
CN107429220A (zh) 经葡糖淀粉酶修饰的酵母菌株和用于产生生物产物的方法
CN111757890A (zh) 发酵工艺
CN108431229A (zh) 糖转运蛋白修饰的酵母菌株和用于生物制品生产的方法
CN112166188A (zh) 用于使用经工程化的酵母产生乙醇的方法
KR20210151916A (ko) 뒤시엔느 근육 이영양증의 치료를 위한 aav 벡터-매개된 큰 돌연변이 핫스팟의 결실
CN108138198A (zh) 为提高的产率而改造的微生物
CN113302303A (zh) 经修饰的丝状真菌宿主细胞
KR20140043890A (ko) 조절된 유전자 발현 시스템 및 그의 작제물
KR102409420B1 (ko) 형질전환 생물체 선별용 마커 조성물, 형질전환 생물체 및 형질전환 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190419

WD01 Invention patent application deemed withdrawn after publication