CN113832043A - 具有提高的乳酸产生能力的重组耐酸酵母 - Google Patents

具有提高的乳酸产生能力的重组耐酸酵母 Download PDF

Info

Publication number
CN113832043A
CN113832043A CN202110702592.1A CN202110702592A CN113832043A CN 113832043 A CN113832043 A CN 113832043A CN 202110702592 A CN202110702592 A CN 202110702592A CN 113832043 A CN113832043 A CN 113832043A
Authority
CN
China
Prior art keywords
strain
lactic acid
gene encoding
gene
recombinant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110702592.1A
Other languages
English (en)
Inventor
朴宰演
李泰荣
李气成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SK Innovation Co Ltd
Original Assignee
SK Innovation Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SK Innovation Co Ltd filed Critical SK Innovation Co Ltd
Publication of CN113832043A publication Critical patent/CN113832043A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/14Fungi; Culture media therefor
    • C12N1/16Yeasts; Culture media therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/14Fungi; Culture media therefor
    • C12N1/16Yeasts; Culture media therefor
    • C12N1/165Yeast isolates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/36Adaptation or attenuation of cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0006Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0008Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/40Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
    • C12P7/56Lactic acid
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01001Alcohol dehydrogenase (1.1.1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01008Glycerol-3-phosphate dehydrogenase (NAD+) (1.1.1.8)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01027L-Lactate dehydrogenase (1.1.1.27)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01028D-Lactate dehydrogenase (1.1.1.28)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/02Oxidoreductases acting on the CH-OH group of donors (1.1) with a cytochrome as acceptor (1.1.2)
    • C12Y101/02003L-Lactate dehydrogenase (cytochrome) (1.1.2.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y401/00Carbon-carbon lyases (4.1)
    • C12Y401/01Carboxy-lyases (4.1.1)
    • C12Y401/01001Pyruvate decarboxylase (4.1.1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/645Fungi ; Processes using fungi

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Botany (AREA)
  • Cell Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

本发明公开了一种具有提高的乳酸产生能力的重组耐酸酵母及使用其制备乳酸的方法。当使用根据本发明的重组耐酸酵母生产乳酸时,不仅可以使用比常规细菌发酵情况下明显更少量的中和剂以与细菌发酵相似的乳酸产生能力进行乳酸发酵,而且可以减少副产物乙醇和甘油的产生。因此,发酵成本可以大大降低,后续纯化过程的成本也可以降低。

Description

具有提高的乳酸产生能力的重组耐酸酵母
技术领域
本发明涉及一种具有提高的乳酸产生能力的重组耐酸酵母及使用其制备乳酸的方法。更具体地,本发明涉及其中在编码丙酮酸脱羧酶的基因已被缺失的位点上引入特定的、细菌来源的乳酸脱氢酶基因的重组耐酸酵母,以及使用该重组耐酸酵母制备乳酸的方法。
背景技术
聚乳酸(PLA)是可生物降解的聚合物,其通过将乳酸转化为丙交酯并在其上进行开环聚合反应来制备。聚乳酸的原料,乳酸,是通过发酵生产的。PLA广泛用于一次性食品容器中,其优点在于,它能够被单独使用或以塑料中的组合物或共聚物的形式用于各种行业,包括汽车行业和纤维行业。此外,它是近年来用于3D打印的聚合物的代表性实例,并且是用于3D打印机时产生较低量的有害气体和气味的环保聚合物。
传统的乳酸生产工艺是使用乳酸细菌(lactic acid bacteria)进行的,并包括在使用各种形式的Ca盐/Ma盐或中和剂例如氨来保持6至8的中性pH 的同时进行发酵,以防止由于乳酸细菌产生和积累的乳酸而导致的细菌死亡或其生长减慢。当发酵完成时,分离微生物,并添加硫酸以将乳酸盐(lactate)转化为乳酸,同时由于难以从水中分离盐并将乳酸转化为丙交酯,以CaSO4的形式除去Ca盐。在该方法中,产生的副产物CaSO4的量大于乳酸的量,从而降低了工艺效率。
通常,PLA通过发酵产生乳酸,然后通过纯化工艺将产生的乳酸转化为丙交酯。为了转化为丙交酯,需要将乳酸转化为氢化形式的工艺,而中性发酵的pH通常为6至7,因此使用大量的硫酸将中性pH改变为酸性 pH。在该过程中,产生大量的中和盐,并且由于中和盐的价值低以及去除该中和盐的工艺的投资成本,经济可行性降低。
同时,乳酸具有L型光学异构体和D型光学异构体。存在各种各样的微生物种群。例如,主要产生L型光学异构体的乳酸细菌通常也产生约 5-10%的D型光学异构体,主要产生D型光学异构体的菌株包括产生D型光学异构体和L型光学异构体两者的菌株,产生D型光学异构体和乙醇两者的菌株等(Ellen I.Garvie,Microbiological Reviews,106-139,1980)。
同时,就天然产生乳酸的乳酸杆菌(Lactobacillus)而言,为了商业化地生产乳酸,必须使用大量昂贵的营养物作为培养基。这些过多的营养成分极大地阻碍了下游聚合工艺或当丙交酯用作中间体时进行的丙交酯转化工艺,为了获得高产量(yield)和高纯度的聚合物或其前体,产生了例如吸附、蒸馏和离子交换的纯化工艺的成本,从而进一步增加了生产成本。为了解决这些问题,已经提出了使用酵母的研究。众所周知,即使使用廉价的营养物,酵母也可以进行生长/发酵,并且还具有高的耐酸性。
当使用在酸中生长良好的酵母(以下称为“耐酸酵母”)生产乳酸时,在发酵期间无需使用中和剂将培养基的pH维持在6至7,因此简化了发酵工艺,并且不需要用于去除中和剂的下游纯化工艺。此外,酵母本身会产生代谢所需的许多成分,因此与细菌(尤其是乳酸杆菌)相比,可以在营养水平相对较低的培养基中进行培养,从而避免了下游的纯化工艺,并大大降低了生产成本。
然而,使用酵母生产乳酸对技术是有要求的。要求是,作为菌株发酵性能指标的乳酸的产量(yield)、生产率(productivity)和浓度必须保持在与乳酸细菌的性能相似的高水平,才能使该技术能够被商业化应用。
尽管已开发出使用耐酸酵母生产乳酸的技术,但在实践中,在很多情况下,通过在发酵期间进行中和反应,只有当pH值保持为至少3.7(不低于乳酸的pKa值)的情况下进行发酵,才能表现出较高的发酵性能,因此,将该技术确定为实现耐酸性的实用方法是不合理的,并且很难预期降低工艺中的生产成本的效果(Michael Sauer et al.,Biotechnologyand Genetic Engineering Reviews,27:229-256,2010)。
因此,能够降低工艺成本的耐酸酵母只有在其能够在发酵溶液的pH 值不超过pKa值的情况下完成发酵而不使用中和剂或以最小量使用中和剂,并且三个主要发酵指标达到了与乳酸细菌相似的水平时,才能被商业应用。
通常,当发酵葡萄糖时,酵母产生的乙醇为主要产物,副产物为甘油,几乎不产生任何乳酸。另外,由于从具有高耐酸性的微生物中选择产生乳酸的菌株的可能性非常低,因此,本发明人选择了具有优异的耐酸性的酵母菌株,并试图通过基因工程方法构建一种被赋予乳酸产生能力并且乙醇和甘油产生能力受到抑制的菌株。
因此,作为大力生产具有强耐酸性同时表现出与细菌菌株相似的乳酸产生能力(乳酸产生速率和浓度)以及产生副产物乙醇和甘油的能力降低的酵母菌株的结果,本发明人发现,由于通过在耐酸酵母中编码丙酮酸转化酶(pyruvate conversion enzyme)的基因位置引入源自表皮葡萄球菌 (S.epidermidis)的乳酸脱氢酶基因而增加了乳酸脱氢酶的活性,乳酸产量得到了提高,基于这一发现,完成了本发明。
发明内容
因此,鉴于上述问题而提出了本发明,并且本发明的一个目的是提供一种具有增加的乳酸产生速率(production rate)和增加的乳酸浓度的重组菌株,该重组菌株是通过将增强的对高浓度乳酸的耐受性赋予具有乳酸产生能力的重组耐酸酵母菌株构建而成的。
本发明的另一个目的是使用适应性进化的方法提供一种生产具有乳酸产生能力和增强的乳酸耐受性(lactic-acid resistance)的重组酵母菌株的方法。
本发明的另一个目的是提供一种通过该方法生产的、在高浓度的乳酸培养基中具有提高的乳酸产生能力的重组酵母菌株。
根据本发明的一个方面,上述和其他目的可以通过提供一种具有乳酸产生能力的重组菌株来实现,该重组菌株通过从耐酸酵母YBC菌株 (KCTC13508BP)中缺失编码丙酮酸脱羧酶的基因,并在编码丙酮酸脱羧酶的基因的位置引入编码源自表皮葡萄球菌(Staphylococcus epidermidis) 的乳酸脱氢酶的基因构建而成。
根据本发明的另一个方面,提供了一种具有乳酸产生能力的重组菌株,该重组菌株通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的GPD1基因、编码将乳酸盐(lactate) 转化为丙酮酸盐(pyruvate)的酶的CYB2基因、编码醇脱氢酶的ADH基因和编码丙酮酸脱羧酶的PDC基因,并将编码乳酸脱氢酶的基因引入至耐酸酵母YBC菌株构建而成;
其中该编码乳酸脱氢酶的基因被引入缺失的ADH基因的位置、缺失的PDC基因的位置和缺失的GPD1基因的位置处,并且
在PDC基因的位置处引入的编码乳酸脱氢酶的基因是编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
根据本发明的另一个方面,提供了一种生产具有提高的乳酸耐受性和提高的乳酸产生能力的重组菌株的方法,该方法包括:(a)通过从在低浓度乳酸培养基中到在高浓度乳酸培养基中循序地培养具有乳酸产生能力的重组酵母菌株,来诱导该重组酵母菌株向高乳酸浓度的适应性进化; (b)选择在该高浓度乳酸培养基中具有提高的乳酸产生能力的重组酵母菌株;以及(c)在所选菌株的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
根据本发明的另一个方面,提供了重组菌株#26-5(登记号:KCTC 14215BP),其通过具有乳酸产生能力的重组菌株在高乳酸浓度下的适应性进化构建而成,该具有乳酸产生能力的重组菌株通过从耐酸酵母YBC 菌株(KCTC13508BP)中缺失GPD1基因(其编码将磷酸二羟丙酮 (dihydroxyacetone phosphate)转化为甘油-3-磷酸(glycerol-3-phosphate)的酶)、CYB2基因(其编码将乳酸盐转化为丙酮酸盐的酶)、ADH基因(其编码醇脱氢酶(alcohol dehydrogenase))和PDC基因(其编码丙酮酸脱羧酶),并将编码乳酸脱氢酶的基因引入耐酸酵母YBC菌株构建而成。
根据本发明的另一个方面,提供了一种重组酵母YBC6菌株,该重组酵母YBC6菌株通过在重组菌株#26-5(登记号:KCTC 14215BP)的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因构建而成,其中与YBC菌株(KCTC13508BP)或YBC5菌株相比,该重组酵母 YBC6菌株具有在高乳酸浓度下提高的乳酸产生能力和减少的乙醇和甘油的产生能力。
根据本发明的另一个方面,提供了一种生产乳酸的方法,其包括:(a) 培养该菌株以产生乳酸;和(b)收集产生的乳酸。
本发明的效果
当使用根据本发明的重组耐酸酵母生产乳酸时,可以使用比常规细菌发酵情况下显著更少量的中和剂以与细菌发酵相似的乳酸产生能力进行乳酸发酵,因此发酵成本可以大大降低。此外,可以减少副产物乙醇和甘油的产生,因此可以降低后续纯化工艺的成本。
附图说明
从以下结合附图的详细描述中,将更清楚地理解本发明的上述和其他目的、特征和其他优点,其中:
图1为说明根据本发明的YBC5菌株在各种乳酸浓度下强制诱导适应以增加对乳酸的耐受性的过程的示意图,具体地,图1(a)示出了在第二轮适应性进化中传代培养期间细胞的生长,以确保在乳酸浓度适应过程中菌株#26-5被选择,图1(b)示出了在第三轮适应性进化中传代培养期间,细胞在70g/L至80g/L乳酸浓度下的生长,以进一步提高对乳酸的耐受性。
图2示出了菌株#26-1和菌株#26-5之间的乳酸生产曲线的比较结果,它们是在根据本发明的第二轮适应性进化中选择的两个菌株,#26菌株是在第一轮适应性进化中选择的。
图3示出了菌株#26-5的发酵曲线,其选自通过根据本发明的YBC5菌株的适应性进化而被赋予提高的乳酸耐受性的菌落(colony);
图4示出了根据本发明的菌株#26-5与菌株YBC5之间的发酵曲线的比较结果;
图5示出了用于从菌株#26-5的基因组中缺失PDC1(g3002-1)基因的盒(cassete)的实例,该菌株是YBC菌株、YBC1菌株和YBC5菌株的改良变体,用于在相应基因被缺失的位置处插入LDH基因,或者用于将已经插入的LDH基因与另一个LDH基因交换;
图6示出了YBC6的发酵曲线,其为通过将源自表皮葡萄球菌的LDH 引入#26-5菌株构建而成的菌株,该菌株选自通过根据本发明的YBC5菌株的适应性进化而被赋予提高的乳酸耐受性的菌群(strain flora);
图7示出了根据本发明的YBC5菌株、#26-5菌株和YBC6菌株在发酵过程中的乳酸产生能力的比较结果。
具体实施方式
除非另外定义,否则本说明书中使用的所有技术术语和科学术语具有与本发明所属领域的技术人员通常理解的相同的含义。通常,本说明书中使用的术语是本领域公知的并且是本领域中常用的。
耐酸酵母的特征在于,即使在酸性pH下也能以高速率消耗糖,显示出高的生长速率,并且在发酵条件下将消耗的葡萄糖转化为所需的产物。在本发明人先前的研究中,从几个酵母文库中具有这些特征的酵母中选择耐酸酵母菌株(KCTC13508BP),耐酸酵母菌株(KCTC13508BP)即使在 40g/L至80g/L的乳酸浓度下也具有高生长率和高糖消耗率(韩国专利申请第10-2018-0044509号)。
在本发明人先前的专利申请中,通过控制该耐酸酵母YBC菌株的代谢回路来提高其乳酸产生能力并抑制其乙醇产生能力,本发明人通过从如下获得的菌株中缺失编码将乳酸盐转化为丙酮酸盐的细胞色素b2酶的基因获得重组菌株:从该YBC菌株中缺失编码醇脱氢酶的基因和编码丙酮酸脱羧酶的基因,并将乳酸脱氢酶基因引入该YBC菌株。
此外,为了抑制构建的菌株中甘油的产生,本发明人通过从该菌株中缺失编码甘油-3-磷酸脱氢酶(其将磷酸羟基丙酮转化为甘油3-磷酸)的基因来构建重组菌株。
在本发明中,为了恢复重组菌株的乳酸耐受性,将重组菌株在含有最高为80g/L的各种浓度的乳酸的培养基中传代培养,从中选择具有优异乳酸耐受性的菌株,并用源自表皮葡萄球菌的乳酸脱氢酶基因替换在所选菌株的PDC基因组位置处取代后的外源乳酸脱氢酶基因,以构建新的重组菌株,并发现该重组菌株具有高乳酸耐受性、高乳酸产生能力和被抑制的乙醇和甘油的产生能力。
因此,一方面,本发明涉及一种具有乳酸产生能力的重组菌株,该重组菌株通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码丙酮酸脱羧酶的基因并在该编码丙酮酸脱羧酶的基因的位置引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因构建而成。
在本发明中,编码源自表皮葡萄球菌的乳酸脱氢酶的基因可以由 SEQ ID NO:1表示。
在本发明中,该重组菌株特征在于进一步缺失或失活了编码醇脱氢酶的基因,以及在于进一步缺失或失活了编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的基因。
在本发明中,该重组菌株特征在于进一步缺失或失活了编码将乳酸盐转化为丙酮酸盐的酶的基因。
在另一方面,本发明涉及具有乳酸产生能力的重组菌株,该重组菌株通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失GPD1基因(其编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶)、CYB2基因(其编码将乳酸盐转化为丙酮酸盐的酶)、ADH基因(其编码醇脱氢酶)以及PDC基因(其编码丙酮酸脱羧酶),并将编码乳酸脱氢酶的基因引入该耐酸酵母YBC菌株构建而成,
其中该编码乳酸脱氢酶的基因被引入缺失的ADH基因的位置、缺失的PDC基因的位置和缺失的GPD1基因的位置处,并且
在缺失的PDC基因的位置处引入的该编码乳酸脱氢酶的基因是编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
在本发明中,该编码源自表皮葡萄球菌的乳酸脱氢酶的基因由SEQ ID NO:1表示,其蛋白质序列(protein sequence)由SEQ ID NO:2表示,其中密码子的使用经调整以在耐酸YBC菌株中基因表达。
在本发明中,在缺失的ADH基因的位置和缺失的GPD1基因的位置处引入的该编码乳酸脱氢酶的基因可以源自表皮葡萄球菌或植物乳杆菌 (Lactobacillus plantarum)。
具有乳酸产生能力的重组菌株表现出高乳酸产量。然而,细胞内大量产生的乳酸和细胞内碳通量的巨大变化影响氧化还原平衡和细胞生长和调节机制,导致细胞生长速率和糖消耗速率(以及最终地,乳酸产生速率)降低的变化。基因工程导致乳酸耐受性降低的原因如下。传统的野生型微生物是即使在细胞外乳酸浓度高(40~80g/L)且pH为2至3(低于pKa)的环境下也能生长良好的菌株。该菌株在基因工程(genetic engineering)后在细胞内主动产生乳酸,因此乳酸的产生受到细胞内产生的乳酸和从细胞外穿透细胞膜(通过传质(mass transfer))的乳酸的抑制。细胞内增加的乳酸浓度降低了细胞的pH,从而显示出抑制各种细胞内活性的效果,包括基因复制和蛋白质产生,导致降低的乳酸耐受性。
此外,当细胞外乳酸浓度增加或外部pH酸性更高,因此总乳酸的大部分以水合形式存在时,这种效果变得更强。为了解决重组菌株带来的问题,将菌株在目标环境中连续培养,例如适应性进化/强制进化,不断选择在适应环境的同时被修饰的细胞以提高性能(Zhengming Zhu et al., Applied Microbiology and Biotechnology,102:4615-4627,2018;Eugene Fletcher et al.,Metabolic Engineering 39(2017)19-28,2017;Christopher P Long,Current Opinions in Chemical Engineering,22:209-215,2018)。导致突变的化合物或导致突变的物理因素(例如紫外线)可用于这种强制进化 (ZhengmingZhu et al.,Applied Microbiology and Biotechnology(2018) 102:4615-4627)。最初,试图应用其它方法,而不是本发明中上述的适应性进化。然而,同时增加期望的性能方面(例如,酸耐受性)和其他的性能方面(例如,生产率)的随机突变非常罕见。尽管单独监测了约100种菌株,但很难从中选择有用的菌株。考虑到需要开发能够选择优良菌落的自动化高通量系统和能够从培养并突变至108菌落/mL或更高的细胞组中检测优良菌落的基因系统(例如,与LDH表达水平成比例的荧光报告子),很难扩大进行选择的组。
因此,使用适应性进化方法,而不是使用突变体的方法,在高糖和高乳酸浓度下连续培养重组酵母菌株的细胞,当细胞生长良好时,乳酸浓度增加。此外,重复包括在培养的中间阶段将相应的细胞接种在含有乳酸的固体培养基上,从固体培养基中选择具有高生长速率(具有大尺寸) 的菌落,并分别用烧瓶测试菌落的乳酸产生能力的过程。选择的菌株直接与培养物的亲本菌株进行比较。当重复该操作时,能够选择具有提高的乳酸产生能力和所需的乳酸耐受性的菌株。通过基于发酵罐的培养检测到发酵性能的提高(见图7)。
在另一方面,本发明涉及一种生产具有提高的乳酸耐受性和提高的乳酸产生能力的重组酵母菌株的方法。该方法包括:(a)通过从在低浓度乳酸培养基中到在高浓度乳酸培养基中循序地培养具有乳酸产生能力的重组酵母菌株,来诱导该重组酵母菌株向高乳酸浓度的适应性进化;(b) 选择在该高浓度乳酸培养基中具有提高的乳酸产生能力的重组酵母菌株;以及(c)在所选菌株的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
在本发明中,步骤(a)中具有乳酸产生能力的重组酵母菌株是通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失GPD1基因(其编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶)、CYB2基因(其编码将乳酸盐转化为丙酮酸盐的酶)、ADH基因(其编码醇脱氢酶)和PDC基因(其编码丙酮酸脱羧酶),并将编码乳酸脱氢酶的基因引入耐酸酵母YBC菌株构建而成的YBC5菌株,其中编码乳酸脱氢酶的基因被引入缺失的ADH基因的位置、缺失的 PDC基因的位置和缺失的GPD1基因的位置处。
在另一方面,本发明涉及重组菌株#26-5(登记号:KCTC 14215BP),其通过具有乳酸产生能力的重组菌株在高乳酸浓度下适应性进化构建而成,该具有乳酸产生能力的重组菌株是通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失GPD1基因(其编码将磷酸二羟丙酮转化为甘油-3- 磷酸的酶)、CYB2基因(其编码将乳酸盐转化为丙酮酸盐的酶)、ADH基因 (其编码醇脱氢酶)和PDC基因(其编码丙酮酸脱羧酶),并将编码乳酸脱氢酶的基因引入至耐酸酵母YBC菌株构建而成。
在另一方面,本发明涉及一种重组酵母YBC6菌株,该重组酵母YBC6 菌株通过在重组菌株#26-5(登记号:KCTC 14215BP)的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因构建而成,其中该重组酵母YBC6菌株与YBC菌株(KCTC13508BP)或YBC5菌株相比,具有在高乳酸浓度下提高的乳酸产生能力和被抑制的乙醇和甘油的产生能力。
适应性进化(adaptive evolution)是一个非常强大的工具,但是会产生副作用。在本发明中进行的适应性进化是选择适于在高浓度乳酸的存在下生长良好的菌株。在该过程中选择的适应性微生物即使在游离乳酸的浓度高时也能生长,并且获得了即使在乳酸存在下也具有针对乳酸浓度的防御机制并促进糖代谢的菌株,因为具有高乳酸产生能力的菌株在选择过程中被选中。即使当发酵培养过程中反应器中乳酸的浓度增加时,所选菌株的这一特征也可以增加反应器中微生物的浓度,并且这种高浓度的微生物可以增加总乳酸产生速率(production rate)。特别地,本发明使用的亲本菌株是在高浓度乳酸下以非常快的速度生长并且具有高葡萄糖消耗速率的菌株,这是遗传操作之前的特征。因此,适应性进化可以在通过遗传操作将碳通量从乙醇转化为乳酸的过程中,在乳酸存在的情况下,使降低的乳酸耐受性和随之降低的生长速率得以恢复。然而,从微生物的角度考虑这种适应性进化过程,微生物必然会选择几个方向。从微生物的角度来看,在低于pKa的酸性pH下,当乳酸的外部浓度增加时,未结合的水合乳酸被转移并渗透到细胞中,这降低了细胞内的pH。在这种乳酸的作用很大的状态下,在选择能够产生ATP和NADH同时转化作为生长底物的糖的发酵产物(碳流)时,在进行DNA复制和蛋白质生产的同时,促进生长和加速糖代谢同时提高乳酸产生性能(例如,产生速率)的进化方向,其具有与外部胁迫相同的作用,不能是自然选择的方向,但是对于生物体来说,选择一个方向是很自然的,在该方向上诱导对当前应激物的影响小于对乳酸的影响的发酵产物。此外,在本发明中使用的亲本菌株中的乳酸产生途径是从外部引入的途径,并且野生型微生物最初是在产生作为主要副产物的乙醇的同时生长的微生物,因此被加强以促进生长和增加乳酸耐受性的碳途径显然是乙醇,并且已经确定增加了乳酸产生过程的副产物乙醇。
为了分析适应性进化诱导的促进乳酸耐受性增加和副产物增加的遗传因素,对适应性进化前后的微生物进行了qPCR分析(转录组分析)。除了qPCR分析之外,可以对整个基因组进行分析,以检测它们之间的差异。然而,基因组中有许多基因已经被突变修饰,但事实上,只有其中一些基因是通过蛋白质表达或其调节机制作为表型出现的因子。为此,人们认为qPCR分析更适合于发现表型遗传因素,例如耐受性增加和生长速率增加。
为了鉴定在转录组分析中具有表达差异的基因,在所有分析的RNA 序列中筛选其中与野生型菌株中的表达率相比,适应性进化菌株中表达的倍数变化(fold change)减少或增加两倍以下,或两倍以上的基因,然后使用注释分析相应的基因(见表6)。
在表达减少的基因库(gene pool)中,一个特定的基因是编码乳酸脱氢酶的LDH基因。如上所述,选择增加乳酸浓度、最快生长速率和乳酸产生速率(在某些情况下,最高乳酸产生浓度)的微生物,但是能够响应由乳酸引起的氧化压力或酸性pH的抑制的微生物的优选响应是减少内部产生的乳酸量。在适应性进化过程中,表达的减少在微生物中积累。当然,所选菌株也具有优异的乳酸产生能力,但这主要被认为是由于快速生长速率和细胞中糖消耗速率或糖转移速率的增加导致的微生物浓度的快速增加。认为,由于这种表达的减少,每个细胞的乳酸产生能力降低。
作为分析表达增加的基因库的结果,观察到执行相同功能的几个基因组(group)的表达一起增加。这些基因的分类如下。
A类与发酵产物有关,是一组与乙醇产生有关的基因,除了乳酸脱氢酶表达的减少之外,其目的是增加碳通量以实现生长率。特别地,当检测常规的PDC(编码丙酮酸脱羧酶的基因)和ADH(醇脱氢酶)活性时,发现作为候选基因被检测但无法检测到所预期的活性的基因在目前的适应性进化中被增强。E类与己糖转运蛋白有关,是一组与最初作用于C6糖类(即,例如葡萄糖和甘露糖的糖类)并将其转运至细胞中的蛋白质相关的基因。B类是一组锌指蛋白。有研究报道,乳酸诱导的氧化应激影响与锌指蛋白相关的功能基团,或者与锌指蛋白相关的增强涉及活性氧(ROS)的减少(Derek A.Abbott et al.,Applied andEnvironmental Microbiology, 2320-2325,2009),包括乳酸在内的氧化应激与锌指蛋白之间的关系非常高(Xixi Zhou et al,The journal of biological chemistry,290:18361-18369, 2015;B Gao et al.,Cell Death and Disease,5:e1334,2014;AnandaS.Prasad and Bin Bao,Antioxidants 8:164,2019),预测所选菌株的耐酸性与这些锌指蛋白部分相关。D类是一组与硫酸盐/亚硫酸盐相关的基因,特别地,观察到硫酸盐/亚硫酸盐还原酶的表达增加,这被认为是由于通过乳酸介导的氧化应激来减轻硫酸盐/亚硫酸盐介导的氧化应激的机制。此外,支持抵抗外部胁迫的基因结构和细胞结构的基因表达被鉴定,并被统称为D 类应激反应。
通过转录组分析,确定了本发明菌株的各种特征,并且当在将来包括逆向工程(reverse engineering)的进一步研究中研究每个基因的特征时,可以设计另外的开发方法。
在转录组分析中,没有检测到与甘油产生相关的基因表达的增加,但是从通过发酵培养鉴定的表型中检测到,通过适应性进化选择的菌株与亲本菌株相比也表现出增加的甘油产生能力。因此,认为甘油的增加是由NADH通过甘油产生途径再生(regeneration)引起的,这是由于LDH失活,而不是相关基因的表达引起的NADH再生不足,因此当LDH被激活(增强)时,甘油将再次减少。
总之,由于乳酸耐受性的增加,在相同的时间段内发酵罐中可以获得更多的微生物,并且由于糖代谢速率的增加,发酵速率增加。然而,与此同时,由于乳酸耐受性增加,观察到乳酸过程的副产物增加。这种现象是由于LDH的表达减少,从而甘油和乙醇产生相关基因的表达增加,导致耐受性增加。作为解决这个问题的方法,可以考虑增强LDH和除去乙醇产生相关基因。在本发明中,首先,通过进行LDH的进一步表达,每个细胞中的产生速率(production rate)增加,以进一步增加总产生速率,通过用LDH进行辅酶NADH的还原,负责甘油途径的NADH的还原减少,因此,甘油增加的水平降低,并且通过提高乳酸产生能力,其直接与使用丙酮酸作为代谢途径中的前体节点的乙醇产生能力竞争,乙醇的产生被减少。此后,如果需要,可以除去额外表达的产乙醇基因。
在本发明的一个方面,经历适应性进化的菌株是YBC5菌株,其中在 YBC5菌株基因组的g4423(ADH)、g3002-1(PDC)和g2947(CYB2)位点处取代有源自植物乳杆菌的LDH基因,由于YBC菌株的二倍体特征,总共插入了6个基因拷贝。在许多情况下,当插入相同基因的多个拷贝时,由于细胞的反馈抑制,相同基因的表达被抑制,因此没有发挥与拷贝数成比例增加的效果,并且相同基因的存在可能影响基因组的稳定性。为此,本发明发现了一种增强LDH的新方法。如本发明人的在先专利申请(韩国专利申请第2018-0044509号和韩国专利申请第2019-0124701号)中所述,插入YBC5菌株的g4423(ADH)位点处的LDH表现出非常高的活性,并且由于g2947启动子的影响,插入g2947(CYB2)位点处的LDH在发酵后期也表现出持续的活性。然而,在g3002-1(PDC)位点处插入的LDH在表型上表现出相对较低的增加乳酸产生能力的作用。特别是,qPCR显示YBC固有的PDC通过g3002-1(PDC)的高活性启动子表现出高表达率。因此,对相关现象进行了进一步的研究。
首先,通过从野生型YBC菌株中除去g3002-1并引入源自植物乳杆菌的LDH来测定乳酸产生能力。观察到该菌株通过除去PDC很好地显示了表型,但是产生非常少量的乳酸。考虑到由于g4423,源自植物乳杆菌的 LDH被强烈表达,相同基因在同一菌株内不同基因组位置处表达的极端差异不是普遍现象。当LDH在RNA中表达时,预计RAN翻译成蛋白质将顺利进行,总之,假设相同的基因没有在相应的PDC位点转录。为了解决这一现象,本发明人试图建立各种假设,但没有找到解决这一问题的假设。因此,本发明人假设,顺利表达被抑制取决于源自植物乳杆菌的 LDH的基因组位点和基因结构问题。要解决这一问题,引入源自另一个菌株的LDH来改变基因组结构,从而促进LDH的表达。
对于要引入的目标LDH,选择在酵母中具有最佳酸性pH和优异表达以适合耐酸菌株特征的基因。考虑到必要的资源,单独使用筛选从自然界存在的众多基因中找到这些基因几乎是不可能的,并且是一项需要基因组挖掘和实验验证以及许多假设的任务。本发明人首先试图从文献中选择具有相似特征的基因,而不是进行这样的基因组挖掘,将其注入到相应耐酸菌株的g3002-1位点,然后测定其活性。本发明人试图通过文献检索找到以前在类似条件下验证过的基因,并靶标到源自表皮葡萄球菌的LDH和源自家牛(Bos taurus)的LDH,其被发现在主题文献中测试的基因中是有效的(Jae Won Lee et al,J.Biotecho241,2017)。
在将三个目标基因导入野生型YBC菌株(KCTC13508BP)的g3002-1 位点后,比较它们之间的乳酸产生能力。结果,发现了一个非常有趣的事实。三个基因之间的活性差异非常大,其中与源于植物乳杆菌的LDH (LpLDH)相比,源于表皮葡萄球菌的LDH(SeLDH)基因显示出基于乳酸产量的39倍的活性增加,结果,SeLDH基因单独在g3002-1位点获得0.39g/g 的产量,其活性与先前插入g4423位点处的LpLDH相当。结果,本发明人获得了能够确保g3002-1位点的高活性并恢复适应性进化菌株的LDH活性的方法。
然而,在各种LDH中,表皮葡萄球菌的LDH基因是以FDP-激活的LDH 的形式存在的需要FDP作为辅酶(E.I Garvie,Bacterial Lactate Dehydrogenases,MicrobiologicalReviews,1980),并且这种FDP是糖酵解的中间产物。因此,与使用普通糖类例如葡萄糖、果糖和蔗糖作为底物相比,当使用其他糖类时,LpLDH基因的活性很可能受到FDP的可用性的影响,并且其活性很可能因此受到影响。然而,因为在商业过程中主要使用的底物是玉米淀粉,其主要由葡萄糖、果糖和蔗糖及它们的糖化产物、甘蔗汁及其副产物组成,这种限制被最小化。
在本发明的一个方面,将SeLDH基因插入到通过适应性进化选择的菌株中,结果显示该菌株显示出从0.67g/g到0.75g/g的进一步增加的产量,并在获得从2.54g/L/小时到2.56g/L/小时的进一步增加的生产率和从 123g/L到130g/L的增加的浓度的同时显示出如预期的副产物的大量减少。
在本发明获得的菌株的培养物中发现了耐酸菌株发酵的ATP相关需求,如现有文献中所述(Antonius J.A.van Maris et al.,Appl.Environ. Microbiol.,70;2898,2004)。换句话说,随着细胞外乳酸浓度的增加,需要能量将细胞内乳酸转移到外部,并且由于消耗ATP,发酵需要ATP供应。 ATP的消耗是由于当添加2ATP/葡萄糖时产生耐酸性乳酸(acid-resistant lactic acid),这可以通过在一般发酵中阻断其中的氧气的同时迫使生产菌株产生乳酸来保证。因此,在一般发酵的厌氧条件下,细胞维持所需的能量是不足的。需要氧气供应来补充能量,供应的氧气完全氧化TCA途径中的底物来供应能量ATP,但同时,一些底物被转化为CO2,而不是乳酸,导致乳酸产量降低。因此,有必要设置最佳的通气条件,使乳酸产量的损失最小化,同时保持细胞活性。
此外,本发明的菌株既是酵母又是革兰氏阳性菌株,因此这应在设定发酵方法时给予考虑。众所周知,在高糖浓度下,即使在有氧条件下,革兰氏阳性菌株也会经历厌氧发酵反应,例如乙醇发酵或乳酸发酵,而不是TCA发酵。相比之下,革兰氏阴性菌株在有氧条件下可以抑制发酵产物的产生,因此只能增加细胞,细胞生长期和发酵产物产生期可以分开进行。然而,这些革兰氏阴性菌株在需氧条件下导致细胞生长,并且可以确保发酵罐中的高细胞浓度,但是细胞不能无限增加以提高发酵速率,因为在需氧条件下消耗的许多底物随着细胞生长转化为CO2(此外,细胞浓度根据培养基的营养物和限制性底物(limitsubstrate)而受到限制是自然的)。另一方面,革兰氏阳性菌株在有氧条件下生长细胞,也产生发酵产物。因此,当发酵产物是乳酸时,糖酵解过程所需的NAD可以由LDH 消耗的NADH提供,革兰氏阳性菌株在乳酸产量方面相对有利,因为与通过呼吸将底物转化(氧化)为CO2来提供NAD的革兰氏阴性菌株相比,减少碳损失为CO2的同时乳酸可以增加。然而,发酵产物在反应器中的快速积累可能快速达到发生生长抑制的乳酸浓度,并且在增殖至所需细胞浓度方面可能存在限制。因此,为了克服这一点,需要调节最佳氧气供应速率,以最大化发酵的最佳种子浓度和初始生长率,并防止底物由于过量氧气而过度转化为CO2。此外,一旦达到生长停止时的乳酸浓度,就需要通过降低氧气供应速率来调节氧气供应速率,以保持不会发生过量CO2损失的微氧状态,同时供应ATP,ATP是如上所述将乳酸排放到细胞外部的能量。因此,对于添加用于中和的化合物和混合化合物而言,本发明的发酵不需要在发酵罐中保持高的混合速率,但是有必要控制最小的氧气供应速率,在该最小的氧气供应速率下,在发酵的早期阶段发生足够的细胞生长,并且在发酵的后期阶段发生足够的ATP供应,同时减少由于过量氧气而导致的CO2损失,这是一个重要的放大(scale-up)因素。基于氧气供应速率的优化,可以通过许多实验找到合适的曝气速率和混合速率,并且也可以使用本领域已知的参数例如OUR和OTR找到最佳值。
在本发明的一个方面,与亲本菌株YBC菌株(KCTC13508BP)或源自亲本菌株的突变菌株(YBC1、YBC2/YBC3/YBC4/YBC5)相比,由于对乳酸的耐受性增加,通过适应性进化选择的菌株可以获得高细胞浓度和较快的乳酸产生速率。在本发明的另一方面,通过适应性进化选择的菌株中的LDH被增强,以实现乙醇和甘油产量的减少。
在本发明的另一方面,与通过在亲本菌株YBC菌株基因组的PDC (g3002-1)基因位置处引入源自植物乳杆菌的LDH基因获得的乳酸产生能力相比,发现通过在该位置处引入源自表皮葡萄球菌的LDH基因获得的乳酸产生能力以产量计增加了30倍以上。
在本发明中,引入的编码乳酸脱氢酶的基因优选是源自瑞士乳杆菌(L.helveticus)的LDH基因、源自米根霉(R.oryzae)的LDH基因或源自植物乳杆菌的LDH基因、源自家牛(B.taurus)的LDH基因和源自表皮葡萄球菌的LDH基因。更优选地,源自植物乳杆菌的LDH基因被引入到g4423(ADH) 位点处,并且源自表皮葡萄球菌的LDH基因被引入到g3002-1(PDC)位点处。
在本发明的一个方面,通过YBC5菌株(Δ g4423::ldh/Δg3002-1::ldh/Δg2947::ldh/Δg1544)的适应性进化获得的具有显著增加的乳酸耐受性的#26-5菌株表现出高乳酸生产率和高乳酸产生浓度,因此显著提高了该方法的经济效率。此外,用源自表皮葡萄球菌的 LDH取代#26-5菌株g3002-1的LDH构建的YBC6菌株(Δ g4423::LpLDH/Δg3002-1::SeLDH/Δg2947::LpLDH/Δg1544)与YBC5和 #26-5相比,表现出增加的乳酸产生速率和产生浓度,以及受抑制的乙醇和甘油的产生,从而增加产量。YBC6的发酵特性显示产量、产生速率和产生浓度已经达到可作为耐酸菌株商业化的水平。
因此,在另一方面,本发明涉及一种生产乳酸的方法,其包括(a)培养重组菌株以产生乳酸,和(b)收集产生的乳酸。
本发明能够实现一种优异的耐酸菌株,其具有达到商业化水平的大大增加的乳酸生产率、浓度和产量,大大降低的乙醇产量,及大大降低的甘油副产物。
如本文所用,术语“耐酸酵母”被定义为当培养基含有浓度至少为 1M的有机酸(特别是乳酸)时,与培养基不含有机酸时相比,在低于有机酸的pKa值的pH下,能够保持至少10%的生物量消耗速率(例如糖消耗速率)或至少10%的比生长速率的酵母。更具体地,术语“耐酸酵母”被定义为与pH 5以上相比,在pH 2至4时能够保持至少10%的生物量消耗率(例如糖消耗速率)或至少10%的比生长速率的酵母。
根据本发明的重组酵母可以通过根据常规方法将该基因插入宿主酵母的染色体中生产,或者通过将包含该基因的载体引入宿主酵母中来生产。
作为宿主酵母,通常使用具有高DNA导入效率和导入DNA的高表达效率的宿主细胞。在本发明的一个实施方案中,使用了耐酸酵母,但是本发明不限于此,可以使用任何类型的酵母,只要其能够充分表达靶 DNA。
可以根据任何转化方法制备重组酵母。术语“转化”指将DNA引入宿主体内,使DNA作为染色体的一个因子或通过染色体整合进行复制的现象,意指通过将外源DNA引入细胞而人为诱导遗传变化的现象。一般的转化方法包括电穿孔、乙酸锂-聚乙二醇等。
此外,在本发明中,任何公知的基因工程方法都可以用作将基因插入宿主微生物染色体的方法。例如,有使用逆转录病毒载体、腺病毒载体、腺相关病毒载体、单纯疱疹病毒载体、痘病毒载体、慢病毒载体、非病毒载体等的方法。“载体”意指含有可操作地连接至能够在合适的宿主中表达该DNA的合适的调节序列的DNA序列的DNA产物。载体可以是质粒、噬菌体颗粒或简单的潜在基因组插入物。当转化到合适的宿主中时,载体可以被复制或执行独立于宿主基因组的功能,或者其中一些可以与基因组整合。质粒是目前最常用的载体形式,但线性DNA也是酵母基因组整合的常用形式。
典型的质粒载体包括(a)有效进行复制的复制起点,使得在每个宿主细胞中包含预定数量的质粒载体;(b)抗生素耐受性基因或营养缺陷型标记基因,以筛选用质粒载体转化的宿主细胞;和(c)插入外源DNA片段的限制性酶切位点。即使不存在合适的限制性酶切位点,也可以按照常规方法(吉布森组装)使用合成的寡核苷酸衔接子或接头容易地将载体和外源DNA连接。如果需要,也通常使用合成整个所需序列并使用的方法。
此外,当一个核酸序列基于它们之间的功能关系与另一个核酸序列对齐时,它被称为与其“可操作地连接”。这可以是基因和控制序列以这样的方式连接,使得当合适的分子(例如,转录激活蛋白)连接到控制序列时能够进行基因表达。例如,当作为参与多肽分泌的前蛋白表达时,用于前序列或分泌前导序列的DNA可操作地连接至多肽的DNA;当影响序列的转录时,启动子或增强子可操作地连接至编码序列;当影响序列的转录时,核糖体结合位点可操作地连接至编码序列;或者定位以促进翻译时,核糖体结合位点可操作地连接至编码序列。
通常,术语“可操作地连接”意指连接的DNA序列与其接触,或者分泌前导序列与其接触并存在于阅读框中。然而,增强子不需要与之接触。这些序列的连接(linkage)是通过在方便的限制性酶切位点进行结合 (ligation)(连接)来实现的。当不存在这样的位点时,使用根据常规方法的合成寡核苷酸衔接子或接头。
应当理解的是,在表达本发明的DNA序列时,并非所有载体的功能都相同。同样,对于同一表达系统,并非所有宿主的功能都相同。然而,本领域技术人员将能够从各种载体、表达控制序列和宿主中进行适当的选择,而不需要过多的实验负担,并且不背离本发明的范围。例如,应该考虑宿主来进行载体的选择,因为载体应该在其中复制。还应考虑载体复制的次数、控制载体复制次数的能力以及由相应载体编码的其他蛋白质的表达,例如抗生素标记的表达。
在本发明中,碳源可以包括但不限于选自葡萄糖、木糖、阿拉伯糖、蔗糖、果糖、纤维素、半乳糖、葡萄糖低聚物和甘油中的一种或多种。
在本发明中,可以在使得微生物,例如大肠杆菌(E.coli)等不再起作用(例如,不能产生代谢物)的条件下进行培养。例如,可以在1.0至6.5的 pH,优选1.0至6.0的pH,更优选2.6至4.0的pH下进行培养,但不限于此。
在下文中,将参考实施例更详细地描述本发明。然而,对于本领域技术人员来说,显而易见的是,这些实施例仅用于说明本发明,并且不应被解释为限制本发明的范围。
实施例1:耐酸菌株YBC的适应性进化#1
在先前的研究中,本发明人通过对各种酵母菌株进行测试来选择具有耐酸性的菌株,并通过在酵母菌株培养开始时向培养基中添加乳酸并监测微生物的生长和糖消耗速率来确定具有最佳耐酸性的菌株,即,YBC 菌株,并将该菌株以保藏号KCTC13508BP保藏在韩国典型培养物保藏中心。
系统发育分析(Phylogenetic analysis)表明,YBC菌株(KCTC13508BP) 是一种与酿酒酵母(S.cerevisiae)类似的菌株,是二倍体,并且为Crabtree 阳性。
通过抑制乳酸消耗和抑制甘油产生同时最小化乙醇产生的抑制,从相应的YBC菌株遗传修饰的YBC5菌株获得了可商业化的产量(韩国专利申请第10-2020-0046779号)。
YBC5菌株如下获得:从YBC菌株中缺失ADH(醇脱氢酶)并将LDH基因引入该菌株以构建YBC1菌株,从YBC1菌株中除去g3002-1基因(PDC 基因)并在其中表达LDH以构建能够高效产生乳酸并具有被抑制的乙醇产生的YBC2菌株,将LDH基因引入YBC2菌株并除去g2947(其为消耗乳酸的基因),以构建乳酸消耗能力被去除的YBC4菌株,并从YBC4菌株中除去GPD1(g1544)基因(除去等位基因1和等位基因2,作为二倍体菌株),以构建YBC5菌株。
构建该菌株的方法如下:
YBC1菌株是通过从YBC菌株中除去作为YBC菌株的主要ADH基因的g4423基因,并在g4423的位置处引入源自植物乳杆菌的SEQ ID NO.3 的LDH基因而获得的菌株。基于g4423及其UTR的信息构建已除去每个基因的ORF并且包含5’UTR和3’UTR基因盒(genecassette),并将其用作供体DNA。对于g4423的每个等位基因,相应的5’UTR由SEQ ID NO.4和 SEQ ID NO.5表示,3’UTR由SEQ ID NO.6和SEQ ID NO.7表示。采用如上所述的使用限制性酶的克隆方法、吉布森组装和使用基因合成的方法产生供体DNA。合成SEQ ID NO.3的LDH,然后将其引入g4423的ORF位点,以产生供体DNA,并将供体DNA引入YBC,以构建重组菌株YBC1。
此外,g3002-1基因是在YBC菌株的基因组测序中位于骨架72处的基因,并充当PDC基因。从YBC1菌株中除去g3002-1基因(位于骨架72处的基因),并将SEQ ID NO:3的LDH基因引入其中以构建重组菌株YBC2。
使用相应的UTR作为重组位点来构建用于替代g3002基因的盒。类似于上述将LDH引入YBC1的g4423基因(ADH)位点的方法,使用g3002-1的UTR构建该盒。然而,为了简化基因替换的过程,在不考虑等位基因变异的情况下制备针对一个等位基因的供体盒(cassette),但是也可以为每个等位基因制备一个供体盒(cassette)。此外,对于用于基因替换的引物,除了用于产生缺失菌株的引物之外,分别使用能够检测LDH和g3002-1的 UTR的如下一对引物,以增加基因替换验证的准确性。
g3002-1 UTR-LDH-正向引物:GCAGGATATCAGTTGTTTG(SEQ ID NO:8)
g3002-1 UTR-LDH-反向引物:AATACCTTGTTGAGCCATAG(SEQ ID NO:9)
此外,YBC4菌株是通过从YBC2菌株中缺失YBC2菌株的主要CYB2 基因,即g2947基因,并在g2947基因的位置处引入源自植物乳杆菌的SEQ ID NO:3的LDH基因构建而成的菌株。在YBC菌株的基因组测序中g2947 基因是位于骨架41处的基因。基于g2947及其UTR的信息,构建已除去每个基因的ORF并且包含5’UTR和3’UTR基因盒(gene cassette),并将其用作供体DNA。对于g2947的每个等位基因,相应的5’UTR由SEQ ID NO:10 和SEQ ID NO:11表示,3’UTR由SEQ ID NO:12和SEQ ID NO:13表示。采用如上所述的使用限制性酶的克隆方法、吉布森组装和使用基因合成的方法来产生供体DNA。
然而,为了简化基因替换的过程,在不考虑等位基因变异的情况下制备针对一个等位基因的供体盒,但是也可以为每个等位基因制备一个供体盒。
YBC5菌株是通过从YBC4菌株中缺失作为YBC4菌株的GPD1基因的 g1544基因构建而成的菌株。在YBC菌株的基因组测序中g1544基因是位于骨架19处的基因。基于g1544及其UTR的信息,构建除去每个基因的ORF并且包含5’UTR和3’UTR和抗生素标记的基因盒,并将其用作供体 DNA。对于g1544的每个等位基因,相应的5’UTR由SEQ ID NO:14和SEQ ID NO:15表示,而3’UTR由SEQ ID NO:16和SEQ ID NO:17表示。采用如上所述的使用限制性酶的克隆方法、吉布森组装和使用基因合成的方法制备供体DNA。
为了简化基因替换的过程,在不考虑等位基因变异的情况下制备了针对一个等位基因的供体盒(cassette),但是也可以为每个等位基因制备一个供体盒(cassette)。另外,当使用目前商业化的基因工程技术(CRISPR) 时,可以在不使用抗生素标记的情况下制备和应用供体盒。
制备的重组菌株的基因型如下:
YBC2:Δg4423::ldh/Δg3002-1::ldh
YBC4:Δg4423::ldh/Δg3002-1::ldh/Δg2947::ldh
YBC5:Δg4423::ldh/Δg3002-1::ldh/Δg2947::ldh/Δg1544
然而,为了确保商业化的经济可行性,重组菌株必须在3.7以下的pH 下达到2.5g/L/小时以上的产生速率和120g/L以上的乳酸浓度。因此,在以下实施例中,为了提高YBC5菌株的乳酸产生速率,进行了提高乳酸耐受性的处理。
如表1所示,YBC5菌株被传代培养,同时乳酸浓度从10g/L逐渐增加至80g/L。在传代培养期间,通过自然突变在细胞中出现突变体,并且高度适应高浓度乳酸的菌株生长相对较快,并逐渐成为整个菌株菌群中的优势种。在增加乳酸浓度的同时重复该过程,并检测整个菌株菌群的生长率。此外,在适当的时间点将菌株菌群置于含有乳酸的琼脂平板上,并分离产生的菌落。此时,所选择的菌落是在含有乳酸的固体培养基上因快速生长而最大的菌落。通过该过程,从在40g/L、50g/L、60g/L、70g/L 和80g/L的液体浓度下生长的菌落中选择了42个菌落。
[表1]
Figure RE-GDA0003208354950000221
在该过程中,由菌群在每种含乳酸的培养基中产生的乳酸浓度的变化如表2所示。
[表2]
LA0 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/1,14:00 0 0.20 4.34 54.5 1.0 0.0 0.0
10/2,15:00 25 6.72 2.52 9.2 38.0 0.8 0.0
LA10 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/1,14:00 0 0.20 3.10 54.2 11.4 0.0 0.6
10/2,15:00 25 5.72 2.56 13.9 42.5 0.8 0.6
LA20 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/1,14:00 0 0.17 2.94 55.2 21.7 0.0 0.9
10/2,15:00 42 6.50 2.53 6.8 58.0 0.0 1.3
LA30 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/4,9:00 0 0.21 2.83 52.0 32.6 0.0 1.5
10/5,10:00 25 4.06 2.75 43.4 37.3 0.0 1.6
LA40 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/5,10:00 0 0.10 2.78 54.5 41.9 0.0 2.6
10/7,9:00 47 5.40 2.56 21.6 64.4 1.1 3.4
LA50 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/7,11:00 0 0.22 2.75 54.6 51.9 0.0 2.5
10/9,8:00 45 1.34 2.75 53.6 55.4 0.0 2.8
10/10,9:00 70 5.92 2.64 29.5 65.7 0.0 3.3
LA60 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/9,8:00 0 0.07 2.73 54.6 61.6 0.0 3.1
3d 72 0.25 2.72 51.0 60.6 0.0 3.4
5d 120 6.23 3.43 18.1 81.2 0.0 3.7
LA70 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/12,09:00 0 0.05 2.73 51.7 68.7 0.0 3.8
6d 144 4.70 3.52 30.9 83.1 0.0 5.5
LA80_1 培养时间(小时) OD(A600) pH 葡萄糖 乳酸 乙醇 甘油
10/18,09:00 0 NA 3.55 53.7 79.2 0.0 5.9
10/22,15:00 102 6.14 3.37 5.4 105.6 1.0 8.5
将42个选定的菌落接种到5ml锥形管中。因为是小规模培养,所以接种从菌落中获得的最均匀的数量(most uniform amount)以获得均匀的接种 OD。本文使用的培养基是补充6%(初级)葡萄糖或12%(二级)葡萄糖的mYP培养基(5g/L蛋白胨、4g/L酵母提取物、5g/LKH2PO4、2g/L MgSO4·7H2O、0.15g/L尿嘧啶),并在30℃和150rpm下培养96小时。
表3示出了5ml培养物的结果。在该培养物中,选择具有高乳酸产生浓度、高细胞浓度或高乳酸产生量的15个菌落,并对其进行以下烧瓶培养评价。
选择的菌落如下:3、5、6、8、10、22、24、26、27、31、32、35、 37、38、41。
[表3]
Figure RE-GDA0003208354950000241
Figure RE-GDA0003208354950000251
对选择的菌落进行烧瓶培养评价,培养条件如下。将10%葡萄糖(初级)添加至m-YP培养基(5g/L蛋白胨、4g/L酵母提取物、5g/L KH2PO4、2g/L MgSO4·7H2O、0.15g/L尿嘧啶)中,调节总体积为50ml。将微生物接种至培养基中,并在30℃和150rpm下培养72小时。此外,接种1天后,以糖注入浓度的20%的量添加CaCO3溶液。
烧瓶培养物的分析结果如表4所示。
基于为综合判断引入的评价逻辑来分析烧瓶培养物的结果。首先,为产生速率、乳酸产量、生长速率、乙醇浓度(按低级数(in low order))和甘油浓度(按低级数)的每一项选择前五个菌落,并将每一项中的得分和权重分配给所选择的菌落并进行汇总。至于权重,出于适应性进化的目的,优先考虑乳酸产生速率,排除生长速率较快但乳酸产生能力较低的菌落。表5示出了评价过程及其结果。
[表4]
Figure RE-GDA0003208354950000261
[表5]
Figure RE-GDA0003208354950000262
如表5所示,选择了26号菌落(以下称为“#26菌株”),选择的主要原因是与乳酸产生能力的增加相比,它是其中副产物的增加最小化的菌落。当仅考虑乳酸产生能力和乳酸产量而不考虑副产物的增加时,3号菌落显示出更好的结果,但是在第一轮中首先考虑整体性能。在随后的适应性进化中也观察到了这些副产物的增加趋势。
实施例2:耐酸酵母YBC菌株的适应性进化#2
在实施例1的第一轮适应性进化中选择的#26菌株与YBC5相比性能有所提高(参见表4中与YBC5的比较结果),但是没有达到适合商业化的性能,因此进行了进一步的改进操作。
由于#26菌株具有增强的对乳酸浓度的耐受性,所以在高乳酸浓度下开始培养,并且在每个浓度下的传代培养次数增加。为了提高极端条件下的生长特性,在不添加中和剂CaCO3的情况下进行培养。
使用的乳酸如下制备:用0.2μm过滤器从实际发酵产生的培养基中除去杂质,然后进行浓缩以制备40-50%的溶液,然后根据所需的乳酸浓度与YP培养基(20g/L蛋白胨,10g/L酵母提取物)混合。糖浓度为10%,以新鲜培养基总体积的10%的量接种传代培养基,并进行培养。
图1(a)示出了#26菌株的传代培养结果。即使在60g/L的乳酸浓度下,也观察到菌株菌群的平稳生长。在用第23天的培养基稀释的YP培养基中培养菌株菌群,然后分离出12个菌落。菌落的选择基于尺寸进行。
在与实施例1相同的条件下,对选择的菌落进行烧瓶培养。此时的参考性能是#26菌株的性能。通过类似于实施例1的选择过程选择5号菌落,并将其命名为#26-5菌株以区别于第一轮结果。#26-5的培养结果如图2所示。
为了与菌株#26-5相比进一步增加耐受性,进行了第三轮适应性进化。本文使用的目标菌株是#26-5菌株和第二轮培养的菌株菌群。在第三轮开始时比较两个烧瓶中的生长。结果,从第二轮开始连续生长的菌株菌群的生长更加优异。由于在菌株菌群中存在比所选菌株#26-5具有更强耐受性的突变体的可能性很高,使用菌株#26-5作为起始培养菌株的适应性进化被停止。第三轮的结果如图1(b)所示。乳酸浓度增加到80g/L,观察到细胞的生长,但是观察到生长速率与60g/L的乳酸浓度相比显著降低。因此,在70g/L的降低的乳酸浓度下进行传代培养,其提供了平稳的生长速率。第三轮完成后,将相应的菌株菌群置于琼脂平板上,然后选择菌落。此时,制备含有浓度为45g/L的乳酸的YPDU琼脂平板和乳酸浓度分别为50g/L和60g/L的平板,并在其上接种菌株菌群,在含有乳酸的琼脂平板上也产生菌落。
尽管与#26-5相比,在附加的第3轮中产生的菌落具有比#26-5更高的乳酸耐受性,并且因此在高乳酸浓度下能够生长更多,但是发酵产物中乳酸的比例进一步降低,并且作为副产物的乙醇和甘油的产量进一步增加。这里,应选择#26-5菌株和第三轮分离的菌株中的任何一个作为商业化的目标菌株,其将在未来进一步开发。有必要做出决定,选择在中长期内对具有高乳酸耐受性的菌株进行进一步开发(关于恢复乳酸产生能力和副产物减少幅度高的研究),或者选择具有相对较低的乳酸耐受性但仍具有较好的乳酸耐受性的#26-5。在本发明中,对#26-5进行了进一步的研究,预期将在更短的时间内得到开发。
实施例3:适应性进化前后基因表达的比较
在这个实施例中,基于qPCR观察到由于适应性进化引起的YBC5 和#26-5基因表达的变化。从每种YPDU培养基中在30℃和200rpm培养24小时的样品中提取总RNA,通过NGS分析RNA,然后分析相同基因表达水平的变化,如表6所示。
[表6]
YBC5和#26-5菌株基因表达的变化及相应基因,A类:产物相关, B类:锌指蛋白,C类:硫酸盐/亚硫酸盐相关蛋白,D类:应激反应,E 类:己糖转运蛋白
Figure RE-GDA0003208354950000281
Figure RE-GDA0003208354950000291
Figure RE-GDA0003208354950000301
Figure RE-GDA0003208354950000311
*进化菌株/原始菌株倍数变化-表示表达减少,进化菌株/原始菌株倍数变化+表示表达增加
通过适应性进化,各种基因的表达水平在所选菌落的菌株中增加或减少,但最显著的差异是LpLDH(源自植物乳杆菌的LDH)的减弱部分,其负责乳酸的产生。已经清楚地表明,LDH增强研究是必要的。然而,作为本发明人仅选择快速产生乳酸的菌株的结果,尽管LDH减弱,发现在菌株#26-5中,将糖从外部转运到细胞中的几个转运蛋白基因的表达增强。这被认为是获得较快的乳酸产生能力的主要原因,尽管LpLDH减弱。
重组耐酸菌株#26-5于2020年6月15日保藏于KCTC(登记号KCTC 14215BP)。
实施例4:使用选定的适应性进化菌株的发酵罐操作
在该实施例中,在生物反应器中培养在实施例3中作为适应性进化菌株选择的#26-5,并测定其乳酸发酵性能。
将#26-5菌株分别在40ml mYP培养基(10g/L蛋白胨、5g/L酵母提取物、5g/LKH2PO4、2g/L MgSO4·7H2O、0.3g/L尿嘧啶)中进行初次接种,并在380ml mYP培养基中进行二次接种,在30℃下,以200rpm培养2 天,并收获所有细胞。将细胞接种在1.18L mYP培养基中,然后在30℃下培养。此时,相对于包括额外的糖溶液和CaCO3溶液的1.7L的体积,调节mYP培养基中每种组分的浓度。在接种OD为1.73时开始培养,将 100ml 42.33%的CaCO3与450ml 62.5%的糖溶液在单独的进料瓶中混合,并将糖和CaCO3的混合物注入生物反应器中,同时用磁力搅拌器以400 rpm的速度在瓶中连续混合,使得CaCO3在溶液中均匀。除了这种注射 CaCO3与糖溶液的混合物的方法之外,在一些发酵中,可以与糖溶液分开,每2小时以预定量(5-10ml)注射一次CaCO3。然而,考虑到当尽可能均匀地注入少量时,可以最小化由于CaCO3的引入而导致的CO2浓度的增加,从而提高发酵性能这一事实,注入CaCO3与糖溶液的混合物。在商业发酵中,可以不与水混合而直接注入CaCO3,因此可以避免由于额外的水而导致的乳酸浓度的降低。然而,在实验室规模中,灭菌的CaCO3作为溶液相被注入。此外,在整个发酵过程中,可以以糖和CaCO3的始终如一的混合比例注入CaCO3。在某些情况下,大部分CaCO3在接种后 24小时内添加,此时初始菌株生长活跃,此后仅注入糖溶液(基于发酵ID 60)。糖和CaCO3混合物的注入速率和通气速率在各批次中不同,但是基于表7中的发酵ID F60,混合溶液的注入速率在最初2小时期间为13.5ml/ 小时,此后为15.3ml/小时。基于F60,在细胞生长阶段以0.7lpm的通气速率和700rpm的搅拌速率进行培养。20小时后,由此逐渐改变培养条件至通气速率为0.35lpm,搅拌速率为600rpm。
使用菌株#26-5的发酵培养结果示于图3中。
#26-5菌株发酵的结果示出,乳酸的产生速率为2.54g/L/小时,产量为0.67g/g,乳酸浓度为123g/L。即,乳酸的产生速率和浓度是优异的,但是与YBC5相比存在产量降低的问题。该问题是由发酵过程中各为7g/L 的乙醇和甘油的产生造成的。该问题是由于作为适应性进化(其具有增加发酵速率和增加对乳酸的耐受性的效果)的副作用发生的LDH的减弱和产乙醇基因的表达。抵消这些副作用的方法是增强LDH并除去相应的产乙醇基因,其中增强LDH的方法在实施例5至实施例7中进行。图3中所示出的糖浓度是在以补料分批方式注入糖和CaCO3的混合物的过程中反应器中的糖浓度。可以以分批方式进行商业发酵,其中在初始阶段注入全部量的糖,而单独注入CaCO3。以同样的方式,以补料分批或半补料分批的方式操作和优化发酵是可能的,其中在发酵过程中注入一部分糖作为与补料分批的适当组合。
图4示出了发酵期间菌株#26-5和菌株YBC5之间仅乳酸浓度的比较结果。与YBC5菌株相比,菌株#26-5显示出发酵速率和乳酸产生浓度的增加,这被认为是由于乳酸耐受性的增加造成的。然而,如上所述,YBC5 菌株的乳酸产量为0.81g/g至0.83g/g,而#26-5菌株的乳酸产量为0.63g/g 至0.72g/g,并且在pH 3附近的产量为0.67g/g至0.68g/g(参见表6中的 F59和F60)。
已经在上文描述了通气在耐酸发酵中的作用和重要性,通气应最大化至24小时,这是细胞生长期,并且此后保持在最小值。然而,即使在细胞生长阶段,过度通气也会导致乳酸产量降低(参见表7中的F57发酵)。 24小时后维持细胞活性的通气速率对产量的影响非常敏感,因此在从 0.35lpm的最佳通气速率降低至0.3lpm以下的情况下,基于2L培养,乳酸产生速率显著降低,或者在严重的情况下,乳酸产生停止,并且当通气速率为0.4lpm以上时,产量与通气速率成反比地降低。通气速率 (aeration rate)可以表示为氧转移速率或氧流入细胞的速率。然而,通气速率受反应器的结构、搅拌器的形状和分布器的空气排放形式的影响,因此当这些因素改变时,应该重新优化。对于那些具有微生物细胞培养相关知识的人来说,进行重新优化并不是很难。
各种条件变化的发酵结果如下表7所示。
[表7]
使用#26-5菌株优化培养条件(包括糖注入和通气)的结果
Figure RE-GDA0003208354950000341
实施例5:在YBC菌株的g-3002位点引入LDH的效果对比
在YBC菌株的PDC(g3002-1)位点处引入源自植物乳杆菌(SEQ ID NO:3)的LpLDH、或源自家牛的LDH(BtLDH)(SEQ ID NO:58)、或源自表皮葡萄球菌的LDH(SeLDH)(SEQ IDNO:1)的每种的2个拷贝。
构建该菌株的方法如下:
除去YBC菌株的主要PDC基因,即g3002-1基因,并将源自植物乳杆菌的SEQ ID NO:3的酵母密码子优化的LDH基因引入g3002-1的位点处以得到一株菌株。基于g3002-1及其UTR的信息,除去每个基因的 ORF,并在其位点处引入LpLDH,产生包含g3002-1的5’UTR和3’UTR 的基因盒,并用作供体DNA(参见图5中基因盒的例子)。此外,g3002-1 基因是在YBC菌株的基因组测序中位于骨架72处的基因,并作为PDC 基因。为了简化基因替换过程,供体盒是针对一个等位基因产生的,不考虑等位基因变异,但可以针对每个等位基因产生。对于g3002-1的每个等位基因,相应的5’UTR显示在SEQ ID NO:59和SEQ ID NO:60中,3’ UTR显示在SEQ ID NO:61和SEQ ID NO:62中。如上所述,使用限制性酶的克隆方法和使用吉布森组装的方法可以用于生产供体DNA,但是可以合成和使用整个基因序列。该重组菌株被命名为“YBClp”。
为了验证遗传操作的正确执行,使用以下引物鉴定转化体,并且如果需要,通过基因组部分的测序鉴定正确的转化体。
同样,引入BtLDH基因的菌株被命名为“YBCbt”,引入SeLDH 基因的菌株被命名为“YBCse”,用于鉴定基因组的引物如下。
用于鉴定3002-1ORF的正向引物:GCAGGATATCAGTTGTTTG (SEQ ID NO:63)
用于鉴定3002-1ORF的反向引物:ATAGAGAAGCTGGAACAG (SEQ ID NO:64)
用于鉴定3002-1UTR的正向引物:GCAGGATATCAGTTGTTTG (SEQ ID NO:65)
用于鉴定3002-1UTR的反向引物:CAGAATCTTAGAAAGGAGG (SEQ ID NO:66)
用于鉴定LpLDH、BtLDH和SeLDH的引入的正向引物: GCAGGATATCAGTTGTTTG(SEQID NO:67)
用于鉴定LpLDH的引入的反向引物:AATACCTTGTTGAGCCATAG (SEQ ID NO:68)
用于鉴定BtLDH的引入的正向引物:ACCTTCTTGTTGTCTAGC (SEQ ID NO:69)
用于鉴定SeLDH的引入的反向引物:ATAACTCTTTCAGCTGGC (SEQ ID NO:70)
通过在30℃和150rpm下的50ml烧瓶培养物对转化体进行实验,鉴定其基因型。接种量为0.1OD,本文使用的培养基为YP培养基(20g/L 蛋白胨,10g/L酵母提取物),使用6%的葡萄糖并添加150mg/L尿嘧啶。
结果如表8所示。
[表8]
在g3002-1(PDC)位点处引入LDH的效果比较
Figure RE-GDA0003208354950000361
如表8所示,由于同一基因组中LDH的变化,乳酸产量有显著差异。特别地,在YBC的g4423基因组位点处取代的LpLDH显示出非常强的表达,对应于0.5g/g以上的产量,但在g3002-1位点处几乎不表达;并且通过将LDH的来源改变为SeLDH在g3002-1位点处获得的LDH活性与在g4423位点的情况中的LDH活性相当,这两个结果是迄今为止尚未报道的新现象。
实施例6:YBC1和YBC5菌株的g3002-1位点的SeLDH取代效果
为了验证在上述实施例中鉴定的g3002-1位点处的SeLDH的高活性,对YBC1菌株和#26-5菌株(来自YBC5)进行了相同的遗传操作。
目标菌株YBC1和YBC5的基因型如下:
YBC1:Δg4423::LpLDH
#26-5(来自YBC5):Δg4423::LpLDH,Δg3002-72::LpLDH, Δg2947::LpLDH,Δg1544
此处使用的盒(cassette)和方法类似于实施例5。在YBC5的情况下,目标位置的LpLDH应该用SeLDH取代,但是为了扩增两个LDH序列之间低相似性的部分,并基于此鉴定正确的转化体,引物改变如下。
用于鉴定LpLDH的存在的正向引物:GCAGGATATCAGTTGTTTG (SEQ ID NO:71)
用于鉴定LpLDH的存在的反向引物:TTTCAAACCAGTACCACCA (SEQ ID NO:72)
用于鉴定SeLDH的存在的正向引物1:GCAGGATATCAGTTGTTTG (SEQ ID NO:73)
用于鉴定SeLDH的存在的反向引物1:GAAGAAGAA TACAAAGCACC(SEQ ID NO:74)
用于鉴定SeLDH的存在的正向引物2:GCAGGATATCAGTTGTTTG (SEQ ID NO:75)
用于鉴定SeLDH的存在的反向引物2:CACCAGCTTTAACAGTAAC (SEQ ID NO:76)
在YBC1菌株的g3002位置处引入SeLDH构建的菌株命名为“YBC2se”,在YBC5菌株的g3002位置处引入SeLDH构建的菌株命名为“YBC6”,它们的基因型如下。
YBC2se:Δg4423::LpLDH,Δg3002-72::SeLDH
YBC6:Δg4423::LpLDH,Δg3002-72::SeLDH,Δg2947::LpLDH, Δg1544
对转化体进行实验,通过在30℃和150rpm下的50ml烧瓶培养鉴定其基因型。此时,接种OD为0.1,此处使用的培养基为YP培养基(20g/L 蛋白胨,10g/L酵母提取物),YBC2和YBC2se使用5%的葡萄糖,YBC5 和YBC6使用10%的葡萄糖,并加入150mg/L尿嘧啶。
结果示于下表9和表10。
[表9]
YBC1中SeLDH的效果鉴定(w/o pH控制)
Figure RE-GDA0003208354950000371
Figure RE-GDA0003208354950000381
[表10]
YBC5中SeLDH的效果鉴定(w/o pH控制)
ID 产量(yield)(g/g)
#26-5 0.55
YBC6 0.76
如表9和表10所示,与相同位置处取代有LpLDH的YBC2相比,位于YBC1菌株的PDC基因处的YBC2se在类似条件下表现出高产量。考虑到在耐酸条件下将乳酸运输到细胞外所需的ATP产量的降低,除了 PDC阻断之外,SeLDH的强LDH表达导致乳酸产生与乙醇产生相比有很大提高,导致产量基本上类似于理论产量。此外,通过常规适应性进化而被赋予提高的生产率但降低的产量的菌株#26-5,在相同条件下也表现出产量的大幅增加。这意味着通过适应性进化降低的LDH的活性通过在g3002-1位点处强表达的SeLDH而大大提高。
实施例7:使用YBC6菌株的发酵罐操作
在该实施例中,在生物反应器中培养YBC6菌株,并测定其乳酸发酵性能。
将YBC6菌株分别在40ml mYP培养基(10g/L蛋白胨、5g/L酵母提取物、5g/L KH2PO4、2g/L MgSO4·7H2O、0.3g/L尿嘧啶)中进行初次接种,并在380ml mYP培养基中进行二次接种,在30℃下,以200rpm培养2 天,并收获所有细胞。将细胞接种在1.18L mYP培养基中,然后在30℃下培养。此时,相对于包括额外的糖溶液和CaCO3溶液的1.7L的体积,调节mYP培养基中每种组分的浓度。在接种OD为1.74时开始培养,将 100ml 42.33%的CaCO3与450ml62.5%的糖溶液在单独的进料瓶中混合,并将糖和CaCO3的混合物注入生物反应器中,同时用磁力搅拌器以400 rpm的速度在瓶中连续混合,使得CaCO3在溶液中均匀。然而,考虑到当尽可能均匀地注入时,可以最小化由于CaCO3的引入而导致的CO2浓度的增加(氧转移的阻碍),从而提高发酵性能这一事实,注入CaCO3与糖溶液的混合物。在商业发酵中,可以不与水混合而直接注入CaCO3。然而,在实验室规模中,灭菌的CaCO3作为溶液相被注入。此外,在整个发酵过程中,可以以糖和CaCO3的均匀混合比例注入CaCO3。然而,在该发酵中,大部分CaCO3在接种后24小时内添加,此时初始菌株生长活跃,此后仅注入糖溶液。在最初的8小时期间,糖和CaCO3混合物的注入速率从4.5ml/小时提高至18ml/小时,此后为22.5ml/小时。在细胞生长阶段,以0.5lpm的通气速率和600rpm的搅拌速率进行培养。12小时后,以0.35lpm的通气速率和600rpm的搅拌速率进行培养,并由此逐渐改变。33小时后,以0.4lpm的通气速率和600rpm的搅拌速率进行培养。
使用YBC6菌株的发酵培养结果示于图6中。
图6中示出的糖浓度是在以补料分批方式注入糖和CaCO3的混合物的过程中反应器中的糖浓度。可以以分批方式进行商业发酵,其中在初始阶段注入总糖量,而单独注入CaCO3。以同样的方式,可以在补料分批或半补料分批中操作和优化发酵,其中在发酵过程中注入一部分糖作为与补料分批的适当组合。
培养结果示出pH 3.16时乳酸产量为0.75g/g、发酵速率为2.56g/L/ 小时、乳酸浓度为130g/L,这是迄今为止发表的耐酸菌株性能中的最佳结果。美国嘉吉公司关于耐酸菌株培养的专利(美国专利第7,232,664号) 建议将0.75g/g的产量、2.5g产物/L/小时的发酵速率和120g/L的乳酸浓度作为耐酸乳酸菌株商业化性能的标准。该实施例的结果在所有指标上都达到了上述标准。美国第7,232,664号专利的实施例公开了0.67g/g的总产量,0.8g乳酸/g细胞/小时的平均发酵速率和114g/L的浓度。本发明的发酵结果示出优于该实施例的产量和性能。
图7示出了YBC5菌株和#26-5菌株以及YBC6菌株之间仅乳酸生产能力的比较结果,并且证明了YBC6的乳酸产生的优异性,这是由于适应性进化与基因组中PDC位置处的LDH增强的组合效应。
保藏机构名称:韩国典型培养物保藏中心
保藏编号:KCTC14215BP
保藏日期:2020年6月15日
地址:韩国生物科学与生物技术研究所(KRIBB)韩国全罗北道井邑市 Ipsin街181,56212
尽管已经详细描述了本发明的具体配置,但是本领域技术人员将理解,提供该描述是出于说明性目的以提出优选实施方案,并不应解释为限制本发明的范围。因此,本发明的实质范围由所附权利要求及其等同物来限定。
序列表
<110> SK新技术株式会社
<120> 具有提高的乳酸产生能力的重组耐酸酵母
<130> KHP212110806.1
<150> 10-2020-0077331
<151> 2020-06-24
<160> 76
<170> PatentIn 3.5版
<210> 1
<211> 948
<212> DNA
<213> 表皮葡萄球菌(Staphylococcus epidermidis)
<400> 1
atgaaaaaat ttggtaaaaa agttgttttg gttggtgatg gttctgttgg ttcttcttat 60
gcttttgcta tggttactca aggtattgct gatgaatttg ttattattga tattgctaaa 120
gataaagttg aagctgatgt taaagatttg aatcatggtg ctttgtattc ttcttctcca 180
gttactgtta aagctggtga atatgaagat tgtaaagatg ctgatttggt tgttattact 240
gctggtgctc cacaaaaacc aggtgaaact agattgcaat tggttgaaaa aaatactaaa 300
attatgaaat ctattgttac ttctgttatg gattctggtt ttgatggttt ttttttgatt 360
gctgctaatc cagttgatat tttgactaga tatgttaaag aagttactgg tttgccagct 420
gaaagagtta ttggttctgg tactgttttg gattctgcta gatttagata tttgatttct 480
aaagaattgg gtgttacttc ttcttctgtt catgcttcta ttattggtga acatggtgat 540
tctgaattgg ctgtttggtc tcaagctaat gttggtggta tttctgttta tgatactttg 600
aaagaagaaa ctggttctga tgctaaagct aatgaaattt atattaatac tagagatgct 660
gcttatgata ttattcaagc taaaggttct acttattatg gtattgcttt ggctttgttg 720
agaatttcta aagctttgtt gaataatgaa aattctattt tgactgtttc ttctcaattg 780
aatggtcaat atggttttaa tgatgtttat ttgggtttgc caactttgat taatcaaaat 840
ggtgctgtta aaatttatga aactccattg aatgataatg aattgcaatt gttggaaaaa 900
tctgttaaaa ctttggaaga tacttatgat tctattaaac atttggtt 948
<210> 2
<211> 316
<212> PRT
<213> 表皮葡萄球菌(Staphylococcus epidermidis)
<400> 2
Met Lys Lys Phe Gly Lys Lys Val Val Leu Val Gly Asp Gly Ser Val
1 5 10 15
Gly Ser Ser Tyr Ala Phe Ala Met Val Thr Gln Gly Ile Ala Asp Glu
20 25 30
Phe Val Ile Ile Asp Ile Ala Lys Asp Lys Val Glu Ala Asp Val Lys
35 40 45
Asp Leu Asn His Gly Ala Leu Tyr Ser Ser Ser Pro Val Thr Val Lys
50 55 60
Ala Gly Glu Tyr Glu Asp Cys Lys Asp Ala Asp Leu Val Val Ile Thr
65 70 75 80
Ala Gly Ala Pro Gln Lys Pro Gly Glu Thr Arg Leu Gln Leu Val Glu
85 90 95
Lys Asn Thr Lys Ile Met Lys Ser Ile Val Thr Ser Val Met Asp Ser
100 105 110
Gly Phe Asp Gly Phe Phe Leu Ile Ala Ala Asn Pro Val Asp Ile Leu
115 120 125
Thr Arg Tyr Val Lys Glu Val Thr Gly Leu Pro Ala Glu Arg Val Ile
130 135 140
Gly Ser Gly Thr Val Leu Asp Ser Ala Arg Phe Arg Tyr Leu Ile Ser
145 150 155 160
Lys Glu Leu Gly Val Thr Ser Ser Ser Val His Ala Ser Ile Ile Gly
165 170 175
Glu His Gly Asp Ser Glu Leu Ala Val Trp Ser Gln Ala Asn Val Gly
180 185 190
Gly Ile Ser Val Tyr Asp Thr Leu Lys Glu Glu Thr Gly Ser Asp Ala
195 200 205
Lys Ala Asn Glu Ile Tyr Ile Asn Thr Arg Asp Ala Ala Tyr Asp Ile
210 215 220
Ile Gln Ala Lys Gly Ser Thr Tyr Tyr Gly Ile Ala Leu Ala Leu Leu
225 230 235 240
Arg Ile Ser Lys Ala Leu Leu Asn Asn Glu Asn Ser Ile Leu Thr Val
245 250 255
Ser Ser Gln Leu Asn Gly Gln Tyr Gly Phe Asn Asp Val Tyr Leu Gly
260 265 270
Leu Pro Thr Leu Ile Asn Gln Asn Gly Ala Val Lys Ile Tyr Glu Thr
275 280 285
Pro Leu Asn Asp Asn Glu Leu Gln Leu Leu Glu Lys Ser Val Lys Thr
290 295 300
Leu Glu Asp Thr Tyr Asp Ser Ile Lys His Leu Val
305 310 315
<210> 3
<211> 963
<212> DNA
<213> 植物乳杆菌(Lactobacillus plantarum)
<400> 3
atgtcttcta tgccaaatca tcaaaaagtt gttttggttg gtgatggtgc tgttggttct 60
tcttatgctt ttgctatggc tcaacaaggt attgctgaag aatttgttat tgttgatgtt 120
gttaaagata gaactaaagg tgatgctttg gatttggaag atgctcaagc ttttactgct 180
ccaaaaaaaa tttattctgg tgaatattct gattgtaaag atgctgattt ggttgttatt 240
actgctggtg ctccacaaaa accaggtgaa tctagattgg atttggttaa taaaaatttg 300
aatattttgt cttctattgt taaaccagtt gttgattctg gttttgatgg tatttttttg 360
gttgctgcta atccagttga tattttgact tatgctactt ggaaattttc tggttttcca 420
aaagaaagag ttattggttc tggtacttct ttggattctt ctagattgag agttgctttg 480
ggtaaacaat ttaatgttga tccaagatct gttgatgctt atattatggg tgaacatggt 540
gattctgaat ttgctgctta ttctactgct actattggta ctagaccagt tagagatgtt 600
gctaaagaac aaggtgtttc tgatgatgat ttggctaaat tggaagatgg tgttagaaat 660
aaagcttatg atattattaa tttgaaaggt gctacttttt atggtattgg tactgctttg 720
atgagaattt ctaaagctat tttgagagat gaaaatgctg ttttgccagt tggtgcttat 780
atggatggtc aatatggttt gaatgatatt tatattggta ctccagctat tattggtggt 840
actggtttga aacaaattat tgaatctcca ttgtctgctg atgaattgaa aaaaatgcaa 900
gattctgctg ctactttgaa aaaagttttg aatgatggtt tggctgaatt ggaaaataaa 960
taa 963
<210> 4
<211> 988
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g4423 等位基因1的5' UTR
<400> 4
gttaactcag ttttctctct ttccctccac cccacgttac tctgcgaaca aaaatacgca 60
cagaatgaac atctgattga ttaatattta tatattactt agtggcaccc ctacaaacaa 120
accaattttg aatatttctc accatcatga tatttattta gggcaagaat ttcatgtaca 180
tacgtgcgtg tactgcatag ttttgttata tgtaaataac cagcaatata tcaccaatga 240
taaatgctca gtaatttatt tggaaccaaa atagtttcag taatcaaata atacaataac 300
taacaagtgc tgattataca acagctgtta acaacacaaa cacgctctct tctattctct 360
tccctgcttg ttcgtgtggt atattcccga atttgcaatt tagaaattat attttttaaa 420
agaattgttc tccattttct ggtagtcgta agtggcaaat tggatcataa gacacaatct 480
tgttagttcg actgctaaca ccagacaaga ccgaacgaaa acagaaaaaa aagataattt 540
tgttattctg ttcaattctc tctctctttt taaggtatct ttacattaca ttacatatcc 600
caaattacaa caagagcaag aaatgaagca caacaacacg ccatctttcg tgattatttt 660
atcatttcta tatcgtaact aaattaacaa atgctatgtt tcttaatttt taatgataaa 720
tctaactgct accttaattt ctcatggaaa gtggcaaata cagaaattat atattcttat 780
tcattttctt ataattttta tcaattacca aatatatata aatgcaatta attgattgtt 840
cctgtcacat aatttttttt gtttgttacc tttattcttt atccatttag tttagttctt 900
atatctttct tttctatttc tctttttcgt ttaatctcac cgtacacata tatatccata 960
tatcaataca aataaaaatc atttaaaa 988
<210> 5
<211> 961
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g4423 等位基因2的5' UTR
<400> 5
gttaactcag ttttctctct ttccctccac cccacgttac tctgcgaaca aaaaatacgc 60
acagaatgaa catctgattg attaatattt atatattact cagtggcacc cctacaaaca 120
aaccaatttt gaatattgtt caccatcatg atatttattt agggcaagaa tttcatgtac 180
atacgtgcgt gtactgcata gttttgttat atgaaaataa ccagcaatat atcaccaatg 240
aataaattct caataattta tttggaacca aataatgcaa taactagcaa actaagtggt 300
gattatacaa cagctgttaa caacacaaac atacgctctc ttctattatc tcttccctgc 360
ttgttcgtgt ggtatattca cgaatttgca atttagaaat tatatttttt aaaagaattg 420
ttctccattt tctggtagtc gtaagtggca aattggatca taagacacaa tcttgttagt 480
tcgactgcta acaccagaca acaccgaacg aaaacaagaa aaaataatta ttctctctct 540
ttttaaggta tcttacatta catatcccaa attacaacaa gagcaagaaa tgaggcacaa 600
caacacacca tcatctttcg tgattatttt tatcatttct atcatgtaat taaattaaca 660
aatgttaagt ttattaattt ttaatgataa atctagttgc taccttaatt tctcatggaa 720
agtggcaaat actgaaatta tttaattcta ctttcatttt cttataattt ttatcaatta 780
ccaaatatat ataaatgcaa ttaattgatt gttcctgtca cataattttt tttgtttgtt 840
acctttattc tttatccatt taatttattt cttgtatctt tcttttctat ttctcttttc 900
tgtttaatct caccgtacac atatatatcc atatatcaat acaaataaaa atcatttaaa 960
a 961
<210> 6
<211> 257
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g4423 等位基因1的3' UTR
<400> 6
taagtcattt aatttattct tttagaatat atttattttg tctttatttt tgaaatgtta 60
atagtctttt ttttttactt tgaacaaaaa aaagtaaaat taaaacttat cttatatacg 120
cttttaaaca ttaaactcgt taacgaatta tataatgatt ttatcgaact actttatgtt 180
tttttaatag aataatcttc tttattaata taacttacta cttcttaatc ttgttgtcct 240
ccattcgaaa ctcgagt 257
<210> 7
<211> 255
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g4423 等位基因2的3' UTR
<400> 7
taagtcattt aatttattct tttagaatat atttattttg tctttatttt tgaaatgtta 60
atagtctttt ttttactttg aaaaaaaaaa aaagtaaaat taaacttatc ttatatacgc 120
ttttaaacat taaactcgtt aacgaattat ataatgattt tatcgaacta ctttatgttt 180
ttttaataga ataatcttct ttattaatat aacttactac ttcttaatct tgttgtcctc 240
cattcgaaac tcgag 255
<210> 8
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1 UTR-LDH-正向引物
<400> 8
gcaggatatc agttgtttg 19
<210> 9
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1 UTR-LDH-反向 引物
<400> 9
aataccttgt tgagccatag 20
<210> 10
<211> 375
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g2947 等位基因1的5'UTR
<400> 10
atatattttg gctgacattg taattagatg agatccacaa tttttctttt gtttgactgt 60
tcgatatgga gaaggtggga tgcactatta ttatattcag aagtttattt gtacagttta 120
aagaacaaat agtggctaat cctatcctcg gactaaaaaa aatcgttcac ttctatccta 180
ctgtaaatct tatgaaaatg atgtaattca tatagttact atattttctt tcttttagaa 240
actttatgat atatatatat atataaaagg actaatcacc caactctcaa attcattaaa 300
aagaaatatg tttctatcat cttcttttct tattatacct cgtctaataa taaaaccaaa 360
caattttctg taaag 375
<210> 11
<211> 375
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g2947 等位基因2的5'UTR
<400> 11
atatattttg gctgacattg taattagatg agatccacaa tttttctttt gtttgactgt 60
tcgatatgga gaaggtggga tgcactatta ttatattcag aagtttattt gtacagcttg 120
aagaacaaat agtggctaat cctatcctcg gactaaaaaa aattgttcac ttttatccta 180
ctgtaaatct tatgaaaatg atgtaattca tatagttact atattttctt tcttttagaa 240
acttcatgat atatatatat atataaaagg actaatcacc caactctcaa atttattaaa 300
aagaaatatg tttctatcat cttcttttct tattatacct tctctaataa taaaaataaa 360
caactttctg taaag 375
<210> 12
<211> 997
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g2947 等位基因1的3'UTR
<400> 12
ttgtgactct atggagttta cctattttat ataccactat atcacaaaaa gtaataacaa 60
cttttcaaat ataatacaat attcaataaa tatatttata tattctaaaa tctacgtttt 120
tctctttctt aaaaaaataa acaaactgac cctttcaatc ttcaatgtga tactttactt 180
attttatttc attacacaga aaggtataaa tatatacata acttaatggt ttattcattt 240
cttcttatta gacaacgtgg ttagttgttg tttaacccat tccaataata aatcagtttg 300
taaataacct tcactgttaa atactttatt aatctctaat gaactagtta aagttttctt 360
cttattatct atcaaagtca tattgtaaat tggtttattt tcttcaaatt ctgtctttaa 420
tttaattatt tcagtaccat tcttaccact atatacgata gatttttcaa catatttctt 480
aaagaaccaa aatattacag atagtacaaa atatgtaccg actaaaattt gttgatattt 540
aacgatatta tcatgaacaa attttttatc aatgatgaaa ctgattgctg caacgatggc 600
agttgaataa ccaattaata atttctgatc aactaattca aaggtttctt catagcctaa 660
tcttttcatg acatcaggta gactttcatt tatagtttgt gatacttcag agatggaata 720
aacgttaacg ggcttactca ttgtgcttta aaggagaatg cggaattaat gagctcttta 780
ctatgtatca gaactcgaac taatgcaaag acaaatggaa taaactagtt acaatatata 840
tgaattttgt ctgttctttt ataatatatt ataatggatt tcccaaattg atgattattg 900
gttcactaag aaagctagaa agaagatgag atttctcgaa tagtaaaata ttacgttaac 960
atatctgaga ttaaaccgat agtcaatttg tacgtta 997
<210> 13
<211> 997
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g2947 等位基因2的3' UTR
<400> 13
ttgtgactct atggagttta cctattttat ataccactgt atcacaaaaa gtaataacaa 60
cttctcaaat ataatacaat atttaataaa tatatttata tattctaaaa tctacgtttt 120
tctctttctt aaaaaaataa acaaactgac cctttcaatc ttcaatgtga tactttactt 180
attttatttc attacacaga aaggtataaa tatatacata acttaatggt ttattcattt 240
cttcttatta gacagagtgg ttagttgttg tttaacccat tccaataata aatcagtttg 300
taaataacct tcactgttaa atactttatt aatctctaat gaactagtta aagttttctt 360
cttattatct atcaaagtca tattgtaaat tggtttattt tcttcaaatt ctgtctttaa 420
tttaattatt tcagtaccat tcttaccact atatacgata gatttttcaa catatttctt 480
aaagaaccaa aatattacag atagtacaaa atatgtaccg actaaaattt gttgatattt 540
aacgatatta tcatgaacaa attttttatc aatgatgaaa ctgattgctg caacgatggc 600
agttgaataa ccaattaata atttctgatc aactaattca aaggtttctt cataacctaa 660
tcttttcata acatcaggta gactttcatt tatagtttgt gatacttcag agatggaata 720
aacgttaaca ggtttactca ttgtgcttta aaggagaatg cggaattaat gagctcttta 780
ctatgtatca gaactcgaac taatgcaaag aaaaatggaa taaacttgtt acaatatgta 840
tgaattttgt ctattctttt ataataaatt ataatagatt tcccaaattg atgattattg 900
gttcactaag aaagctagaa agaagatgag atttctcgaa tagtaaaata ttaccttaac 960
atatctgaga ttaaaccgat agtcaatttg tacgtta 997
<210> 14
<211> 1328
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g1544 等位基因1的5' UTR
<400> 14
agaaaatagt ttctccgatt aaattttttt ttcaaatcaa atctttattt aagaattggt 60
agtgtatagt agtataatat tgcctaagaa attggagtag tccgtaaaaa atgggacaaa 120
attgttgaaa ttgagcaacc tgaaaatttt atgctggtct caagtagaga aacagacgta 180
gaaccaaaat tgacccaatt tcttgttgcc tttaattggg tcattcataa gaattcaaaa 240
tattttcttt tcccactcac gcgagagata tgcgcacacg atatagttaa taccgcttgt 300
aacaatacgt agatggccaa aaatgaacaa aaggggacac tcctcaaaag aaaaaattgc 360
ttgtttggct gtcttctcca attgaaatat acacacacac cgcggtaaaa aaaaaattga 420
aatggaaatc gcggtgggac aaaagtagca accacaacaa gggaattttc cttactgctg 480
cggcagatcc ttactcatct ctcgaatata tatagcctct tgggtccacg ggcaaaaaag 540
aaataaaaaa aagagaagca acagaaccgc acgcaacgta cgcagtgatc catccatttt 600
ccacaaaatt tatctatttt cttgtctata ttttttacgt acaactaact gatcttcttg 660
tccccctccc cccatttacc cgttaaaatg aaagctgaac aacagaaaat aataattcgc 720
tctggtggac aaaaaataca agaacaagag agtatcataa ttatgtgggt cacaaatgac 780
cctacaactg tcacctagtt ggtacaaaat ttgaccctca ttctcaaata attactacat 840
ttgggtctgt attaatgcta atatttcaat atatctctat ctatcagtca catacaaatt 900
tatcttcatc ttaaagggac tcacttactc aataatggtc tatctttata tttttttcat 960
acgtatgtat gtacgtagta aagggccatc aatgatccat cttactatta ttattcttta 1020
gttatttcta agcaacaaaa ggtctgtacc acagtttcag tgtcgtcata cctcttcttt 1080
taatttcttt tcggggaggg atgtcttaat gctaacttct gtctcactat taacggtaaa 1140
tcgtattaat ctcaatatat atataaaggg ttgatatttt ccaccgtttt aaaaattatt 1200
cccttgtttc tctattatta attttagact acttatttta attatttttc ccttttttac 1260
ttattatata tatataacta tatattacca ataataatat aagcaatcac atatatttat 1320
cccattaa 1328
<210> 15
<211> 1328
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g1544 等位基因2的5'UTR
<400> 15
agaaaatagt ttctccgatt aaattttttt ttcaaatcaa atctttattt aagaattggt 60
agtgtatagt agtataatat tgcctaagaa attggagtag tccgtaaaaa atgggacaaa 120
attgttgaaa ttgagcaacc tgaaaatttt atgctggtca caagtagaga aataggcgta 180
gaaccaaaat tgacccaatt tcttgttgcc tttaattggg tcattcataa gaattcaaaa 240
tattttcttt tcccactcac gcgagagata tgcgcacacg atataattaa taccgtttgt 300
aacaatacgt agatggccaa aaatgaacaa aatgggacac tcctcaaaag gaaaaattgc 360
ttgtttggct gtcttctcca attgaaatat acacacacac cgcggtaaaa aaaaaattga 420
aattgaaatc gcggtgggac aaaagtagca accacaacaa gggaattttc cttactgctg 480
cggcagatcc ttactcatct cttgaatata tatagcctct tgggtccacg ggcaaaaaag 540
aaaaaaaaaa aagagaagca acagaaccgc acacaacgta cgcagtgatc catccatttt 600
ccacaaaatt tatttatttt cttgtctgta ttatttacgt acaactaact gatcttcttg 660
tccccccccc cccatttacc cgttaaaatg aaagctgaac aacagaaaat aataattcgc 720
tctgatggac aaaaaataca agaacaagag agtatcatca ctatgtgggt cacaaatgac 780
cctacaactg taatctagtt gatacaaaat ttgaccctca ttctcaaata attactacat 840
ttgggtctgt attaatacta atatctgtat atctctctat ctatcagtca catacaaatt 900
tatcttcatc ttaaagggac tcacttactc aataatggtc tatctttata tttttatcat 960
acgtatgtat gtacgtagta aagggccatc aatgatccat attattatta ttattcttta 1020
gttatttcta agcaacaaaa ggtctgtacc acagtttcag tgtcgtcata tctcttattt 1080
taatttcttt tcggggaggg atgtcttaat gctaacttct gtctcactat taacggtaaa 1140
tcttattaat ctcaatatat atataaaggg ttgatatttt ccaacgtttt aaaacttatt 1200
cccttgtttc tatattacta atttaacatt acttatttta attatttttc ccttttttac 1260
ttattatata tatataagta catattacca ataataatat aagcaatcac atatatttat 1320
cccattaa 1328
<210> 16
<211> 402
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g1544 等位基因1的3'UTR
<400> 16
tccatcatca agaatatata tatataataa agccatccct tttacgaacc tgcctgcatt 60
tgcttaagac cgagcaaaaa aaataaatta caacataacg aaaaaaacaa acaaacttaa 120
gggggagaaa aaaaaataat atcccataac ttacatacac aacatacata aaattaaaaa 180
aataaacatt ttatcaataa ttttttttta aagtatatag agctactaat attatagaaa 240
tacagacgca acttaaagaa ctttgttcaa tcttttcaat cttctcagtc ttttctagtc 300
ataataaatt atcaaatgcg aatatttaaa tcaaaattat ataaggggta tatcgtatat 360
atataaattt atcaaatgtg tatatgtatt ttattatgtt ta 402
<210> 17
<211> 402
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g1544 等位基因2的3'UTR
<400> 17
tccatcatca aaaatatata tatataataa agccatccct tttacgaacc tgcctgcatt 60
tgcttaagac cgagcaaaaa aaataaatta caatataacg aaaaaaacaa acaaacttaa 120
gggggagaaa aaaaaataat atcccataac ttacatacac aacatacata aaattaaaaa 180
aataaacatt ttatcaataa ttttttttta aagtatatat agctactaat attatagaaa 240
tacaaatgca acttaaagaa ctttgttcaa tcttttcaat cttctcaatc ttttctagtc 300
ataataaatt atcaaatgcg aatatttaaa ttaaaattat ataaagggta tatcatatat 360
atataaattt atcaattgtg tatatgtatt ttattatgtt ta 402
<210> 18
<211> 4032
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> C2862_g1_i1
<400> 18
tctaatattt taatcttttg accaaatatg tttttgtcgc ctattgataa tagaaaaatg 60
taaccttcac aaacaaccct aataccaaga gaacgaaaga tagggtatat atatatcatg 120
aatgaatatc actaacaacg aaatataata ctcactttct cgaggcggcg tccatccata 180
caccgcatac ccattacaag aagccaagtc tgcctgcatt ttttttcttt ttcaataaag 240
aaaagaaaac cggggttttt gcctatttca attatagtta attctccgta gcttaatatc 300
atgttctctc gaaaatgtct tttgtttgca aatacctgca ataagtacaa ataatccggt 360
atgttgaaaa gaacaataaa aaataataag ggccaccgtt acactgtatg gccacacaca 420
ataccgtttg tggtatttcc cgcgtggaac aacaacaact gatttgtttc aaggttgctc 480
tccctccatt ttcacagaat ccaggttctt ggtgggtggc gtgttctggg attcctgtaa 540
tgacaacgcg agacaaagcc aaggagacag aaaggggacg gcttctcatc ccatcagtcg 600
cagcaaccgc ggcttcctct agcacgttcc acgcttttta tagtggttaa ctcagttttc 660
tctctttccc tccaccccac gttactctgc gaacaaaaaa tacgcacaga atgaacatct 720
gattgattaa tatttatata ttactcagtg gcacccctac aaacaaacca attttgaata 780
ttgttcacca tcatgatatt tatttagggc aagaatttca tgtacatacg tgcgtgtact 840
gcatagtttt gttatatgaa aataaccagc aatatatcac caatgaataa attctcaata 900
atttatttgg aaccaaataa tgcaataact agcaaactaa gtggtgatta tacaacagct 960
gttaacaaca caaacatacg ctctcttcta ttatctcttc cctgcttgtt cgtgtggtat 1020
attcacgaat ttgcaattta gaaattatat tttttaaaag aattgttctc cattttctgg 1080
tagtcgtaag tggcaaattg gatcataaga cacaatcttg ttagttcgac tgctaacacc 1140
agacaacacc gaacgaaaac aagaaaaaat aattattctc tctcttttta aggtatcttt 1200
acattacatt acatatccca aattacaaca agagcaagaa atgaagcaca acaacacgcc 1260
atctttcgtg attattttat catttctata tcgtaactaa attaacaaat gctatgtttc 1320
ttaattttta atgataaatc taactgctac cttaatttct catggaaagt ggcaaataca 1380
gaaattatat attcttattc attttcttat aatttttatc aattaccaaa tatatataaa 1440
tgcaattaat tgattgttcc tgtcacataa ttttttttgt ttgttacctt tattctttat 1500
ccatttagtt tagttcttat atctttcttt tctatttctc tttttcgttt aatctcaccg 1560
tacacatata tatccatata tcaatacaaa taaaaatcat ttaaaagggc ccaacaaaat 1620
gtcttctatg ccaaatcatc aaaaagttgt tttggttggt gatggtgctg ttggttcttc 1680
ttatgctttt gctatggctc aacaaggtat tgctgaagaa tttgttattg ttgatgttgt 1740
taaagataga actaaaggtg atgctttgga tttggaagat gctcaagctt ttactgctcc 1800
aaaaaaaatt tattctggtg aatattctga ttgtaaagat gctgatttgg ttgttattac 1860
tgctggtgct ccacaaaaac caggtgaatc tagattggat ttggttaata aaaatttgaa 1920
tattttgtct tctattgtta aaccagttgt tgattctggt tttgatggta tttttttggt 1980
tgctgctaat ccagttgata ttttgactta tgctacttgg aaattttctg gttttccaaa 2040
agaaagagtt attggttctg gtacttcttt ggattcttct agattgagag ttgctttggg 2100
taaacaattt aatgttgatc caagatctgt tgatgcttat attatgggtg aacatggtga 2160
ttctgaattt gctgcttatt ctactgctac tattggtact agaccagtta gagatgttgc 2220
taaagaacaa ggtgtttctg atgatgattt ggctaaattg gaagatggtg ttagaaataa 2280
agcttatgat attattaatt tgaaaggtgc tactttttat ggtattggta ctgctttgat 2340
gagaatttct aaagctattt tgagagatga aaatgctgtt ttgccagttg gtgcttatat 2400
ggatggtcaa tatggtttga atgatattta tattggtact ccagctatta ttggtggtac 2460
tggtttgaaa caaattattg aatctccatt gtctgctgat gaattgaaaa aaatgcaaga 2520
ttctgctgct actttgaaaa aagttttgaa tgatggtttg gctgaattgg aaaataaata 2580
agagctctac cgttcgtata atgtatgcta tacgaacggt agcgatcgct ttgtctttat 2640
ttttgaaatg ttaatagtct ttttttttta ctttgaacaa aaaaaagtaa aattaaaact 2700
tatcttatat acgcttttaa acattaaact cgttaacgaa ttatataatg attttatcga 2760
actactttat gtttttttaa tagaataatc ttctttatta atataactta ctacttctta 2820
atcttgttgt cctccattcg aaactcgaga ggaacaattt ctgagtctct ctcgcaccct 2880
ttcgtacgta ccgtttttcc aatttctttc gggaaacgga actggacgca ttttatttga 2940
ctgttgaaag ggagatttaa tatttatata gagagatata acaactaact tataagttta 3000
tacaggctgt tatcacatat atatatatat caacagagga ctagctcaat agaataacat 3060
tagatatgtc gatgctgaac cgtttgtttg gtgttagatc catttcacaa tgtgctactc 3120
gtttacaacg ttctacaggg acaaatatat cagaaggtcc actaagaatt attccacaat 3180
tacaaacttt ctattctgct aatccaatgc atgataacaa tatcgacaag ctagaaaatc 3240
ttctacgtaa atatatcaag ttaccaagta caaataactt attgaagaca catgggaata 3300
catctacaga aatcgatcca acaaaattat tacaatcaca aaattcttca cgtcctttat 3360
ggttatcatt caaggattat acagtgattg gaggtggttc acgtttaaaa cctactcaat 3420
acacagaact tttatttcta ttgaataaac tacatagtat cgatccacaa ttaatgaatg 3480
atgatattaa gaacgaatta gctcattatt ataagaatac ttcacaggaa actaataaag 3540
tcaccatccc taaattggat gaattcggta gaagtattgg aatcggtaga aggaaatccg 3600
caactgcaaa agtctatgta gttagaggtg agggccaagt tcttgtaaat aatagacaaa 3660
ttaacgacta ttttgtcaaa ttaaaggata gagaatctgt aatgtatcca ttacaagtaa 3720
tcaatgggat tgctaattat aatgtattta ttactacatc aggtggtggt tcaactggtc 3780
aagctgacgc cgcaggatta gctattggta aagctttaat tgcattcaat ccattgttaa 3840
agacaagact acatagagcc ggatgtttga ctaccgatta cagacgtgtc gaaagaaaga 3900
aacctggtaa agttaaagct agaaaatcac caacttgggt caaaagatag acgcacacga 3960
tttctttcgt tacatattct tacatatttt aaacatatac attcgtacca tgtaaatatt 4020
aatatcaaca ta 4032
<210> 19
<211> 3067
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4821_g17_i3
<400> 19
caattaactg atggtccatt ctttaacaaa taattttttt tttgttagaa gttattttaa 60
aaggaattaa cagaaaagca atgactgggg tcattaggat tgtcaatata aaagcaacta 120
aactcctaag agtttactgt acgaacggcg ataaggtagt tcatcatact tacttaataa 180
ttacaggaag tgacaataca aaaagaaatc tagttctgga gagaaaacca agggtaagga 240
atgaaacaga actgggaaga gaaggattct tcctgctgtc ctgcgctttt tcctagtgga 300
aatacatgca caaatttttt tttctgatgt atatttcctc tgtgtgaacc acgtagctct 360
gtgaaaaagt atcgtaggct agtttgaatg tggaaaatta gcgggggtgg ggccccgata 420
gaggctaaag ttatgttaaa attgtctacg ctagattcac tgaaattaca cgttgaactg 480
gaaaaataat ttccccgggt gaatgaaatt tgtcatgcag ctgtaaaacg ggacacagaa 540
aacggcgcat ggtgaaaatt tttcagttgc ttttttggtg gctagtattc aaataatttc 600
tccttgcagc cacatagatg aaaatgaaga agttaaagaa caaaaagatc ccctacaata 660
tagatttgca actacatgca accataatca tggtaacaat tgaacaaaat gcagcagcta 720
aaggtgcaaa ttagtttctt ttgtgcatta atttcgcctg aaataatttt cctttttttt 780
ttttttttta ttttttctgg aatcaacatt caaattatct aaagaacctc tgcagaattg 840
tttttatttt cttaaagatc aaccaactta aggaaatttt tttcaaagtt ttgctagtgt 900
tttctctcct ttaacccact tcatccaatg gttattcttg tcgttatgct acgatatttt 960
ccaggcggaa ttgctttttc tgccttgttt tgattattaa atagtttctc cctttattaa 1020
taattattcc atgaacaaaa tctccccttc atttgattca gaaatcactg cagattaaag 1080
acactcatgc aagttgaaat tgaattaata aattactttt atttcatgca aagctcaaca 1140
acaaggacaa catgaatgat gaaaattcca aaaagtaact ctttcagaaa taggaaaaaa 1200
aaagatataa aaggtcaacg aatattccaa cttttacaga aataatttcc tttacaactt 1260
ttcctatttc atatttcatt tcttttgttt attttaaaaa taaaaaacca tacaactaaa 1320
gatttatatt atatctcttt aacaataaca attcagtaaa tatatacttc aatatgtctg 1380
ctgctcctgt tgaagaaaac attaataacg agtctcaaca attgactcca actgcctctg 1440
gctccaactc tgttctatct actccatcta acaaagctga cagagatgaa ctaaaagatg 1500
aagctgaaaa cgctgaagat aatgtcgctg cttttgacga tatgccatta aagccagctt 1560
ccgcttacgt caccgtctcc atcatgtgtg ttatgattgc tttcggtggt ttcgttttcg 1620
gttgggatac tggtaccatt tctggtttcg ttaaccaaac tgattttatt aacagattag 1680
gtcaaaagcg tcacgatggt tctcactact tatccaaggt cagaactggt ttaattgtct 1740
ctattttcaa cattggttgt gctatcggtg gtgttatctt atctaagatc ggtgatgtct 1800
acggtagaag aatcggttta attactgttg ttaccattta cgtcgtcggt ttaattattt 1860
ccattgctac ccaacatgct tggtaccaat atttcattgg tagaattatc tctggtctag 1920
gtgttggtgg tatttctgtt ttatccccaa tgttgatttc tgaagtttct ccaaagcatc 1980
taagaggtcc attagtttcc tgttatcaat tgatgattac tctaggtatt ttcttaggtt 2040
actgtactaa ctacggtacc aagaactact ctaacactgt ccaatggaga gttccattag 2100
gtctaggttt cgcttgggct ttattcatga ttggtggtat gatgtttgtt ccagaatctc 2160
cacgtttctt agtcgaagtt ggtagaaatg aagatgctaa gagatctatt gctgtctcta 2220
ataaggtttc catcgacgat ccatctgtac aagctgaatt agaattatta atggctgctt 2280
ccgaagctga aagattagct ggtaatgctt cctggggtga attattcgct accaagaaca 2340
agattttcca acgtttaatc atggcttgtg ttatccaatc tctacaacaa ttgactggtg 2400
ataactattt cttctactat ggtaccacta ttttcaacgc tgtcggtatg aatgattctt 2460
tcgaaacttc tattgtttta ggtattgtta actttgcttc cactttcgtc ggtatctggg 2520
ctgtttctag attcggtaga agaactctat tattatgggg ttccgcttcc atgactgctt 2580
gtatggttgt tttcgcttct gtcggtgtta ctagattatg gccagatggt gctaaccaca 2640
aggaaaactc ttctaagggt gctggtaact gtatgattgt tttcacatgt ttcttcattt 2700
tctgtttcgc tccaacctgg gctccattag ttttcgttgt ctgttctgaa tctttcccat 2760
tgagagttag atctaagtgt atggctttag ctcaagcttg taactggatc tggggtttct 2820
taattggttt cttcactcct ttcattactg gtgctattaa cttttactat ggttacgttt 2880
tcatgggttg tctatgtttc tcctggttct acgttttctt ctttatccca gaaaccaagg 2940
gtctatctct agaagaagtc gatcaaatgt ggctagaagg tgtcttacca tggaagtctg 3000
ctcaatgggt tccaccatct aagagaggtg ccgaatacga tgccgaagct atggctcatg 3060
atgataa 3067
<210> 20
<211> 5390
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4795_g1_i1
<400> 20
gacaataata aaaaataaag gtgttgaact gtcaacaaaa tacagttaat tgtacggtat 60
gtaattttca tcattcacat cgacttatgt ttatgctgct cctcttcata atctgctaca 120
attaaattgc tctttttttt ttgttatcaa cagaatatat atttcctgag gggggaaaaa 180
agaggtaaga cagttcaatc ttttaacaag ttataagtaa caaggacaaa tgcgtttttt 240
ttgtcaactt ttgattgtat cgcataaaat atttactcca ttgcaaatag gaacttattg 300
actataacag taatttcctt tattaataat gaatttattt ctctttatgt atttgcataa 360
taactgggac atttttgctc ttgttcagcg gtaaatcgtc tagacgaagc ctatgtatct 420
attaatctat tatagaggtg atgtcccttc gtagtcaata aattctaagt acacatatac 480
atacgtaggg gcactcacac tattatattt attttctttc tttcttttat ctggcaatac 540
gatacggaga ccggagaaag aatgttcgtg ggaaaaaaaa actttttttt tttgttctag 600
aaggtttcat tttcaaccag agtaactccg gacaaaaaag ggataccgta aaaccccgtg 660
cgagtgagat ttgaattcct atcatatcgc aatttgtcgc aattcatatg gttctttcat 720
tttttcactt acataataaa catttctctt tccccagaat ttttttttac ttttattttg 780
acaatagcaa tgaatttaaa ccctgaaata attatattat tgattgatgg agttttcaat 840
acaaagagag agagaaagaa aaagttagaa ttgatatcgt agatggcttt atccacctat 900
tcattcaagt ctgtcagcac ttcatcaggt tagagataag acctaatacg ctggttccac 960
aataattgaa ctaataaatt acactatatt cctttttgtc tctggataaa agatgtatta 1020
tagtttcaag atacatattt gaaacgtaca agtaatacaa agttgttaaa accataattt 1080
aataaaaaaa ttctttatgc aacattgatt ggaacgacta caattaaggt tctatatccg 1140
atgcttcaat aatgcgagat tttaagaaag caatatgcct acaagagaat gttaagtaat 1200
ataatcaaga catttttttc ttacaaaagc aaaaaaaagt gaaaagtcgg aaatgtctta 1260
agacccgaga atccaggaac cgatgtgaaa aaagtttaat tatcaaatga tattaacagt 1320
tattcatacg tacaattaat tcccaatcta atatatattc atatgagtgt agtgtatata 1380
atcttaacta atgcatactt cacttttaat gattacaaaa tgaaacagca ttttaaatct 1440
tattattagc atacccaatc atttaaagta attttatatt tcgaggatag atagtatttc 1500
ttgtcgacat aataacataa gcaaaattct tgtatctcta attaggtaac tcccgctccc 1560
cccccaaaga aaaaacaaac cacttctgca agtttcagtt attaaataat atgggaaagc 1620
gacacattcg gagttttata ttattatcac acatataacg tcatatttat ctataagtgg 1680
taactaatat gccaattttt tacaagaaac aaacgaacac ccatatgtta cggtaatggg 1740
aaacaacata atttgtcaaa tatatggcat atatctatca agttttacct gatatcttca 1800
attcggaaag ttacttgtta tggtaaaaat gaattagctg gcccttaatt tatgacaaga 1860
atgagctatt atctggggta cgtttattta tcgtatacct acttataaga atgtaagaat 1920
aataagtttt gaaagatttg attaagactt tgggaatggt aaaattgtta ataatgattt 1980
attaattact gtcagatatt aaaactccat cgttaccaga agttcattat aattttcaca 2040
tgcttctact aaaatatttt tgttgagctg ttatgcgtgt catttgtgac actgcgatta 2100
tgagtatgtc attcatttaa catcagtttc tccaagttat ttaatttttt tagtgtcata 2160
ttgttattac cccaatattg tcatacattt atccccaact aattataata ttctcaataa 2220
ttatagcttg gcgagtaaat ctttcaataa tctgttgaga aaaacctgtc ataaaatatt 2280
acgaatttct tttcaacagg tacaagcaca tgaataatct taactttatt ctctattggt 2340
ttaattaaac acttttaaac tgtggaaaca tactaatatg gtttatacag acatgtacgt 2400
atactccaat ttttatttga aatacatacc ctaatttcag cccttcattt tacgcgtatc 2460
atcttgaaca gatacaagtt acctaattag gaaatgtaat atcttgaagc caaaaatcta 2520
ttttttttct ctcttcttcg gaaaacgcgc gatcaatctc tttaccgaat gaggtaatct 2580
taattacacg aaaaattttc agatattttt ctctctttct cgaacagtgt ttggttaatc 2640
gaaacataat cgtaaaataa acacataaac cttccgtttg caataccttg ccgtcaattt 2700
aacacccttt tcatactttt tcaaataatt atattcaact aaaagttaaa aatcagttaa 2760
ctaacgtatt tttacaacat ttgttaaggg aataatagaa gctatcaaac gttaagttat 2820
cacacagtta tatcatcaaa caacaatgtc atttgataga ccagaaattt atagtgcgcc 2880
agttttacaa ggtgttacac caaacgatga tgataacaca gaaattatca aatcctttag 2940
aaattttatc cttgaattta gaatcgattc acaatttatt tacagagaac aattaagaaa 3000
tgcattatta gttaagaatt attcattaag tgttaatatg gaacacttaa ttggttataa 3060
tgaggatctt ttcaaaaaat tgtctgatga accatctgat attatcccat tatttgagaa 3120
tgcaatcact caagttgcta aaagaatcac tatcctaaat agatctcagg agtctaccac 3180
aggtaatgga caagcaacag gtgaggatat cgcatctttg attccaccat ttcaattaat 3240
cctaaattcg aaagctaatc aaattccaat gagagaatta ggttctgaac atgtctccaa 3300
agttgttaga ttatcaggta ttgttatctc tgcatcagta ttaacatcca gagctacaca 3360
tttacgtcta atgtgtaaga attgtagaca tacaacatcg atcactgtaa atacattcaa 3420
ttccattact ggtactcaag tttctttacc acattcctgt ttatctaatg ttcaaactga 3480
atcaggtcaa gtaagttcca tggaggcaag tgctccacca aaaaattgtg gacctgatcc 3540
atatatgatt atccatgaag cctctacatt tattgatcaa caatttttga aattacaaga 3600
aatcccagaa atggtaccag ttggtgagat gccacgtcat ttaagattat catgtgatag 3660
atatttgaca aataaagttg ttccagggtc tcgtgttaca gtagtcggta tttattccat 3720
ctataccgct aaaggtgcag gaccaagttc aggtaacgaa ggtggtgtct ctattagaaa 3780
tccgtatatt aaagtattag gtttacaaac tgatatcgat acaaatactt tctataattc 3840
tgtttccatg ttttccgaag aagaagaaga agagttttta caactaagta gaaatccaaa 3900
tatttatgat cttgtcgcta aatctatcgc tccttcaatt ttcggtaatg aggacattaa 3960
gaaagccatt gtttgtttat tgatgggtgg ttccaagaaa ttattgcccg atgggatgag 4020
attaagaggt gatatcaacg ttttactact gggtgatcca ggtactgcaa agtctcaatt 4080
attgaaattc gttgagaaag tctctccaat ctctgtttat acatcaggta agggttcttc 4140
tgcagcaggt ttaactgcga gtgttcaaag agatccaaca acaagagaat tttatttaga 4200
aggtggtgct atggttcttg cagatggtgg tgttgtttgt attgatgaat tcgataaaat 4260
gagagatgaa gatcgtgttg cgatccatga agcgatggaa caacaaacca tttctattgc 4320
aaaagcaggt attactacag ttttgaattc aagaacaagt gttcttgcgg cagcaaatcc 4380
aatctatggt cgttatgatg aaatgaaatc tccaggtgaa aatattgatt tccaaacaac 4440
aattttgtct cgttttgata tgattttcat tgttaaggat gaacatgatg aagcccgtga 4500
tatctctatc gctaaccacg ttattaatat tcatacaggt cgtgtctcgc aagaacaaga 4560
agaaatggaa aacaatggtg aagaaataag tatggataaa ttgaagcgtt acattactta 4620
ttgtagaaga aaatgtgcac caagattatc tgttcaagct gctgaaagat tatcatccca 4680
attcgttacc attagaaaag aattattaat aaatgaattg aattctactg aacgttcttc 4740
aattccaatc actgttcgtc aattagaagc tattattcgt atcactgaat ctttagccaa 4800
attagagtta agtcccgtag ctcacgagag acacgttcaa gaagcaatca gattgtttca 4860
agcatctacc atggatgcag catctcagga tccaatcggt ggtatggatc caacagggaa 4920
ttcaagatct atacttgcag agattcgtga aattgaacaa gaactaaaga gaagattgcc 4980
gattggttgg tcaacatcaa tacaaacatt aagaagagag tttgttgaat ccaatagatt 5040
ttcacaacct gcattagata aagcattata tgcgctagag aaacatgata ctattcaatt 5100
aagacatcaa ggtcaaaatg tttatagaag tagtatatga tctggtgata acacatgggg 5160
taaagtaatc ttgaatcaaa agtttgtaat attaatacat caatttgtct agattgaaac 5220
atatatatac gtacatacaa ctaatgaata tattctaggc aagagcaaaa aataatacat 5280
agtatatgga tccgattctt atggagattg aagtaaactc aaccagccct atactcttca 5340
atttattctt gagttgtaag ttaaccgaag gacggtcgga acgaacgcag 5390
<210> 21
<211> 5128
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4321_g4_i2
<400> 21
attgaaataa ctctttcttt gatacatcac ctaaagacta tcaaacacat ttattaatat 60
taagatttta aaatattaag aaacttttcg tcaaaagtat atttcaaagt tttttttaat 120
atattcattc agtttaaaaa tctcattaat tcattcatga taatataaga gattttctaa 180
atattttctc acatcatcgt tctacaaaaa taatattatt gcaaaactaa attaaagaag 240
gaattttata attttaagtt ctaactttca actgtttttt attgttcaat tttattttca 300
ttaacttttt cacaaaattc tacaagtatt tccttaatga atttaaccgc ttatcatcca 360
tatccaattt ctaaccaatc aacacaatat cataaaactc taatggaaaa tccatcgata 420
ccaaattcaa gtgtagcaac ctcatctatt acaactacgc ccacaaatga tttcaataat 480
gatttacctt caaactctag gagggaatct gaaacaggct ccacaaaggt aacattacca 540
ccaatctcta gtatcatcaa tgctccacaa gaacaacaaa ataaaagtat tagatctgaa 600
atagttgaac ccgaaaactc tacttcttta aggacatctc ctttaacgca gacggatatt 660
caaaattcac agtcaatgaa taatagtact ataactcctg ctggtccagg tattagcact 720
tacgttcaac caatggtaaa ttccagaaga ccttccgcaa cacagcagca aatgaatatt 780
aattatgcta caccaagaaa tggtatggca gtactaggtg gggtttcttc ccaagcaact 840
ccaattggta ccccagggaa tagcccaaac ggtaattatt tagctaatca agcgattata 900
cagcaagaaa atgaagctgc tgcttatgca attcaacaga aacaacaatt gcaacagata 960
cagtatcaac aacaagttca agctcaagct caagctccac aacaaggcta ttattatgtt 1020
gttgcaccat ttcaacaaca acaacaagtt caaacgcaag cacaacaaat gccacaaatg 1080
atgccaatgt ctatcactca acaacaaatt gctttccaaa aggctcaagc acaacaggtt 1140
gacgaacaac aggctcagca gcagcatatg cttcaacttg cacaacagca acaacaacaa 1200
gctatttcta catatcctgt tgttgttaat atgccacatc caaatgagat tcaacaacag 1260
caaccgcaac agcaacaggt acggagtccg gaatttgaga ataacgttgt ttatcaagtt 1320
ccaagacaac cagaaggtat aatgaatcaa ggtcacccaa taatggttcc aaccactgct 1380
attccaacaa gttctcaacc agttcaaaaa ccaacaatga ctgctggtta tgttacatca 1440
gaaggtttaa ttcctgttcc aacaagtatt caatctaatc taagcttagc tgttagatta 1500
cgtaaacaat gcccagtatg tggtaagatt tgttctagac catctacttt gaaaactcat 1560
tatttgatcc acactggtga tacaccattt aaatgtccat ggaagacatg taagaaatct 1620
tttaacgtta agagcaacat gctaagacat ttaaaatgcc atcaaaagaa atcaccaaag 1680
gttactaaag gtggttctaa ttctggtgat gaaaaaaact ctatagacaa tgaaaagaca 1740
attaaggcta ttgaaggggc agtatcatca tctgaaaaac aatcgaaagc tactgatgac 1800
gatgctaaag ctgattcgtt gtcgacagaa attccaaaag aaactaaata actttgctaa 1860
tttgatatta tgcgaactct atattattgc tcaattccga taaacaaaat taatagaagg 1920
aagcaaaaag ggctatctat ataatatttt actacatata aaaatgaata ttcataacta 1980
ttaaacacaa atagaattgt aaagtttcga aaacaggttt ccattaactg ggacaaggat 2040
acgtttctcg gatctgtttg gctgctatat tattaacaat ctatccagtt tccaaaaact 2100
gcacttccct attcataaac tagcatctga ttatttttga aagccgattt gagtttcaaa 2160
tcttacttaa tgaaacttta ataatccctt ctgtctctat ctttaagagg ttttgatacc 2220
ataataatct taacaaaagt ccctcatttt caattccgtc aaaattgatt ggttaataat 2280
attcaatgat tgatttgcat atatcgtctc gaatttgagt cttcaagctc actattgtga 2340
catagacaaa atattatctc aaaggataaa gcaaaaaatt aaattgaaat ttcagtaatt 2400
aatcgaatgg tttgttagta attaataatt atgaggtcaa atgaaaacag atcgatactc 2460
gtttcggaaa tgctaagagt aaaccaaaat aaggttttat ttccaaaaaa aggaaagtaa 2520
aaagaaacta tacattgcct attgtggaag gtttagtaaa tctccgaaga acctgcgggc 2580
gagcggatga attttgtttc ctgagaaata aaattttttg atatatctct gtaaatatcc 2640
gtagtactgc tgttgtttcc tagaatattt agaaacatcg aagagaaagg aacgcgggac 2700
aaagaaataa gatttctatg tttagcgtgg gtagtaaggt cacttgtacg tattgttctg 2760
acatcgcata gatatctaca aaattaagtc aatttagaaa agtgatcagc aggtgagaag 2820
atccagtagc caattcatta tttgtgcaca aatttactgc aaaaggttat gtatcttgca 2880
tatccatatc gagacctaat ttaggaattt taatatttta actgtcctca aacttattca 2940
attcatttac ccttctctga ctatttcaaa caaggcactg gaatttctta gataaagaaa 3000
aatataattg caacatttgt ccacattatt gctctgttta acagcgaaaa tcgtgtaaat 3060
tttagtggga aaacataata ttaactacta ccaaattctt tcatgggtac tatcagctta 3120
aggtggaaaa taccaagtgt tcttctatta tctaatagtc cctaagatat ggagggaccc 3180
aagtaagaga tattgaaatg ttccccaaag ctatgcccca cttgaatatg ctttcccttt 3240
caagcttcct aaacatgtaa cattcttagt attggataag tgctgactta tataacaagg 3300
tttttctttt aaatccagga tatataaaca aactctaagt aaaaaggtta gagcaccgaa 3360
ctaaacgaaa tcaagaaaga cttcgtttga agcgaattgt atagctcaac caatcaggaa 3420
caatagatca aatatttagt atgatgtgat atattgatgc tactaaaagt taacggaaag 3480
acgaaaatga tatccgatta atcgattatc tcgaatacag taaattatta gaagatgaaa 3540
gataattaaa tttgtgaaac atacctaaat ccaaacacag atcaaatatt agtatgccct 3600
taccacctct taactctttg ttagggtgtt tttagaacaa gctataaatc ataagaggtc 3660
aggtgataag agccactgtg ccgaacctga accgtaagat aatagttgaa gtagtgactg 3720
gaaatagcct ggcccgctat gatctatgta acaagaccag tacacactca caaataggaa 3780
gttagatacg cacgtgataa gagtgtatgc cttagaagct taagtagata acattgagcg 3840
acaatacaat agtatgtccg ataagagttc cgataagaga acacatgtcc tagttgttct 3900
aataatacaa tagcatatac atgctcgtca acgaggaggt ccctacgtta ggattccttc 3960
cgagtgatta tcaacagtgt ggtatgagac acacgaccag caaataataa agaaagaata 4020
ataataaata aaatataaaa ggacgttaag caagtgctta acgaaagatc caaacagata 4080
tataaaaata agcaatcaag cttaaagata ataattacaa caatcaacac taagattgta 4140
agtgtacata agatacataa tataatataa gaatctacta taaatactct tatcacagaa 4200
aagcgtctaa cacattaact ataactaaga tgattgaatt aactaaacac aaaagacatc 4260
taagacttcg aaagacatcc aagggaatcg aagacatcta cttaatgtag aatctgagaa 4320
tctaggaatc tgagaatcta agaatctgta gttaacatca acaagataca gcttggcact 4380
atcaccaact gaacccatct tagctatgca cacaacatag tctactgtca actctcttaa 4440
ttggaacgaa gaagaactaa gaactaagaa ctacttatga taacttcaga catcttatca 4500
actaagaatc tcgaattatt atactcatga tacagcatag cgtaatgagc aactggatcc 4560
ttatcgtaca agaactatct atcatagaca acgtctcttg taagttccct cgattaatga 4620
aaggaactaa ccaatgactc tttcattcaa taggacaact caataacgac tctcgtgtat 4680
aatctatttc caaaacataa attttaacca ccgtaaaaca ccagagccgt tacaaaaaat 4740
tcgaaagttt gattatttgc tattgttaaa tttggaaggt tttctacggc tgtttctagg 4800
cagggagtat taaagataag attatcactt cctaaaacat ttcaaatgag agtaatgtat 4860
tgatatgcat ttgagaatgt acggataata ataatacatc ccatataagt ctaatatatc 4920
aaagaatgat accgtaataa ttcaatcaga ccacatacta aagcttattc tcggtttatt 4980
acgaaaagat atttgtctaa aataagtttt ctaagtattt tgattccatg ttattggcaa 5040
tatatatctc atgaatcaaa tcaagatgtt gcaaaatcca ttatgaggtt gttgcaaagt 5100
ccaatatgag gttctaagtt tctatatc 5128
<210> 22
<211> 2759
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c144_g1_i1
<400> 22
ccattgtcca acactaccaa caataacact aacaccgaag gttcaggggt tgagtccggt 60
gaacacactt tatcgagtat cccacctcaa cataatgcca atacgttctg tttttttggt 120
ttttattata ttatgtaaat tataaaatct taaatacatg aacgaactga aactcaaatg 180
tgagtccaga tttacccttt atacacgttg cttttactaa cttaagcttt gatcatcaaa 240
tgaattgtga tttcctaata ctttgtgttg tatttttggt accatatttc tgtcacttat 300
gtaaaatgga atatacaata ctactattca attagtgatc agtcagaact ctttgagaac 360
ttgaagtatt ctttccaatc tttcttttca ttctcactct cttctacagg ttttcctctc 420
cttgatgggg gaacccaaga tgcagatttc catggttgta caccttcttc atatagtaat 480
tggatttctt ccaaagacaa accaatcgtt tccggtagaa agaagaaaac atatagaaac 540
atagctacca aacaaccgac aaatacgtaa ccatagtaga agtggataga accggtaatg 600
aatggtgtga aaaaaccaat caaaaattgc cataaccagt tacatgcggt agaaatcgac 660
atggctctgg acttgaacct cgaggggaat gattcggcaa ccacaatata agcaacagga 720
gcccatgatg ttgcgaagca aaataggtag aaacaggtaa atacaatcat cacattacca 780
gcacctttcg aggatggggc actgtcacca tgaggataaa gacatttgac tccgacactt 840
gcgaatatga ccatacaggc catcatgcca gctgctccaa ataatagaca tttacgacgg 900
ccgattttgt ccacaactat aacagcaata atagtggaga agaaattcac cgtacccaga 960
atgatagaag tctcaaatcc gtcagtaaga cccactgatt tgaaaatagt tgtaccgtaa 1020
aagaaaaagt agttttcacc agtaagttgt aaaaacgttt gcactagaat acctgtaatc 1080
aaacgctgaa ggatatttga ctcaggtgaa aaaagttcct tccatgaagc ttcaccttgt 1140
tccctttggg caagcacacc ggcgataatt tcttctactt ctccatgtac ccatgcatcc 1200
tctggtgaaa tcttgttgat cttggcaata gaagcgcgtg cctcatcatg tctttcctgt 1260
tcaaccaagt atcttgggga ttctggaacc aatagcatac caatgatgat aattagggcc 1320
cacaaaaagc aaagtccaac agggatcctc cattgtgcag tattattata ctttctggtg 1380
ccataaacac tacaataacc taggaaaata ccaaacgtca tgttcaattg atacaatgaa 1440
acaagcccac ctctcatgtc tttaggagct atttcagaca aaagcattgg acacaacacc 1500
gaacatccac cagcaccgag accataaatg atcttaccga taaagtattg gtaccacttg 1560
tgatttgaac taatctgaat aattgcacca atcatatata ccaataccac gatgacaatt 1620
gctaaccttc tacctaaagt atctgcaaaa cgggcaaaaa gaagacctcc tatagcacaa 1680
ccaacactga acattgccac tagaagaccc atacgcacat tactcaagta atattctcca 1740
gtactgtgtt tgtaagaacc gaaattcatt ttaaagttgt ccatgttaat gaacccagcc 1800
gtaataccac tatcccaacc aggtaggaac cccccaaagg agataggaat acaaagcaga 1860
tagatagtaa gataacctag atatcccctc tttggtggtt caatggaatt tccgtttatg 1920
acttcattgt cataaacccc atccgaccac tctttttcta caggtggcga gacataaact 1980
tcaatattag aggcatcttg aatgtctata ttactatgaa aagatgattg tgaactagac 2040
atttttttta ttttaatttt taaagccttt ctttttcttt tttttagtta tattattata 2100
gaataaaagt ataagaataa atgaatgatg cccattgacc ttgcccttta tatatgatgg 2160
tcgaattgca attctatgat agatgatgat aaattcaata gcttatatca tctctttaac 2220
ataggttcaa gggaagtaca ctcagggata tgatctgcat tcaaacggtt cggacggtcc 2280
agacaattca gacggttcag agtattcgtg gaaagtgaac cggctgctat cattagctaa 2340
tactatcctc caaaataata cgtcattact ttgggtgaaa tgcattatta gctaaactgc 2400
attcattgta cgtcactctc tgtggtacta acaaaccact gtaaaaaaga aaaaaaaacg 2460
ccgaattctt tgtgcctgac agattgttgg gatctccacg gatcattttc agcgccccat 2520
gttttgccga atgcataccc cgcccttacg ttgagctttc ataactttaa atttcagtcc 2580
ggaggtattt gttttgagag tgggcctcgc agtgaaaatt ttggaaaagt ttcacacata 2640
gttaagttac atttggacct attttcacaa tacaatggaa ctttttttcc aacttttaat 2700
atcccttccc ctatacaagg gtaaaatatt ttcatttttt actttccctt ccctttcac 2759
<210> 23
<211> 1217
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2309_g1_i1
<400> 23
cgcaaatttg acaaatggtc attttactcc aatttttttt cttattttga aaaatttccc 60
tgataaacaa aaaaaaaaat tgaattactt ctaaaaacgt taatcattta tcctcattgg 120
aagttacttt ttttttcctt ctctgaaacg tgcggagatg atgaggggta aatattttga 180
atttctcttg tttttctttc ttgctgctga catctcacgt ttgacgaaat ggagaacatc 240
agttgccgcg gatccgaaaa gacgattaac taaaaacgcc tcttcattta gttaatcttc 300
ttgctggttt cgcgtctcct tattaccggt tcagctgatt gatattatct cggagatgag 360
caacaaacac cagttgagtt catgattcta tatttgtaaa ctagttttac aatgacatca 420
taaattaaag ggaagaagac aagagttact taagaatctc gagttctgtt tgtttgtttg 480
tttttttaat taagtaatat cgctgagaga tatagttaag attacataaa aaacaactga 540
tacaagaata attaacatta acgacctctc aaccataaag tgaacgtagt ccacttatat 600
tttcatattg cttaactgga ttttcattta gaaattgtac agctcattga atagtcgagt 660
caataattca aattccgatt atttaattac cacacaccct tatatttgat caactgacaa 720
aggacatttc cttagtgaaa cagacataat agccagcaaa tcattccatt gcattttgat 780
taagactcat tttttatcat atattgttac tgtttcagaa aatgtcaact tttccatcaa 840
ttttaatgat ggatattaca gattctaata tatcgatacc tgattcaaac gaccccacaa 900
gaaaggaagt tggtcttaat aaagacattt atgactcgtt tgacaatgaa ccatggttac 960
ataattcatc acaagatatc acacaattga agatgatgaa agttgataga gatctattat 1020
ttcaaattat tatgattgaa gatatttcaa aatctaagca atctcaattc gatgaattaa 1080
gtacacgtat agatcctaaa aatcaacgtg ttgatacttt aagaaattca aagggtccta 1140
aaaaatttga tattgttact caagttgatt tagataatga taatgatact tcaacaacaa 1200
caacaaataa taataat 1217
<210> 24
<211> 2127
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c3558_g1_i1
<400> 24
gttttattta ataataataa tatttaatta ttattcactt ctacttcaaa taatattcaa 60
ctagtttcat ttttttaatt tattgaaaac atactttttc cctttggtac aggtacattg 120
caataataat taataaatta acccttagaa tttttatttt gtcgatctaa ataaaaaaag 180
aaattaaatt atacgatata tataaatcga ctatcaattt agatcatttt taattcgctg 240
ttaatttatt aaaaaaatcc ttcaacctgt tgtaaaatta gacaaacata tcatactaat 300
caaaaaatat ttctattaaa atggaagaat tgtattacta tataccaact aatcaacgtc 360
caaataatgg tcaacaaata tcacatgtac aacaacaaca ggtacaacaa gtacaacctc 420
aattagttgc atattatcct tctacaaata ttataattcc acaacaagaa ttacaacaac 480
aacaactaca gaaacagcga caaagaaatt cacaatcgca tcctcaatta tatccatacc 540
ctcaattaat gtatactaat caattggtaa atccaaggta tactacacta tacagtccaa 600
ttatttctca accaggtaca tcgacagcaa ttcctattac aagtacttct gctacatcaa 660
tatataatga acgttctaat caaataaata cacctattcc tacaatgaca agtaatcaaa 720
taactggttc aattcattat aataacgaaa taaatgtcag tcctacagca gtgacacata 780
ataatccacc aggtgtacaa ttaccaccat tgtcaagctt ggtgtcacag attaaatcaa 840
catcccaatc atgtcctgat atttctacat tgacatcaca atcgagttct tcaattaata 900
caatgaatgc aaacaacgca agagaattta agtctgcagc tacatctatc tcatcagctt 960
caagtttcaa tgacaataaa aataacactg ctactactac tactaataac catagaaata 1020
gtatacctta tatacttacc ttttcatcgc agaaagaaga caatcttcac tcggtcacta 1080
atcataatca atctacacaa ttaccaatta cgaatttccc agtaagagaa ataactccac 1140
caatctatac aatttcacca aaacaaaata atgcaaatat taataaatta atagttacaa 1200
atagaaactc aaatgaaaat ttaaatcatt tactaactag aaatgatatc actgtaattg 1260
aaccagtaag aatgaataat acaaatatta atgatatgaa attaaataag ggcaaaaatg 1320
gtaaaattcc atcacaacaa agaaaacaat gtccaatttg tggtaaaatt tgttcaagac 1380
cttctacttt aaagactcat tttttaattc atacaggtga taatccattt aaatgttcct 1440
gggttggttg taagaaaagt tttaatgtta agagtaatat gttaagacat ttaaaatcac 1500
atcaaagaaa actagaaaaa ttagctaaga aacaagctga tttattaaaa caagaaaaat 1560
tgaaacaaac aaccaacaat aatgacaaga agaaatagat agaaaacagt aatggctaaa 1620
agatttcatt taaatcaaat caaatatttt cattttactg tcttctactt cttactccat 1680
agaataaagc atacctaaca aataataatt ttgataatat ccttgataat aataatcaaa 1740
agaaaataac gaaaatattc ttcattgctc taatagcatt cattattctt tattcacttc 1800
atctatagat attctaaatt taaataatat gataatctct tttttttttt tttgcctttt 1860
cagatctctt tatgtaaacg acacactcgc catctttcaa caagacaacg cggccagccc 1920
aaaattttta ttgtcctatt taagcaaagt gtaaactttt cagaagtgac aatgttgaat 1980
taaataaaaa aaaaggaatg caatttctta aatgaataat ttacacatta attagaaaaa 2040
aatcttaaat attttacaaa aaccgaaata aaacttagtt tcaggaataa aagcatagaa 2100
caatgtaaaa aaatcgtggt tttaatg 2127
<210> 25
<211> 5413
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c1715_g1_i1
<400> 25
ggcgtattca aagtatagtc aagaacataa tatattgagg gtgtgtcaat accacattat 60
agatttagga tatactcgat taagaataat aacaatgggt gcgagttgta aggatcaaaa 120
gaaagctgtt gctatctgtc ttcagagatc tccttgtgta atgatagaaa gaaatagtcc 180
tcaaaaatgc atagacgatc caaacttaag caaggatttg ccagaacttt gtatagcaca 240
aatgaaagca tttctagatt gtaaacgagg gatggttgat atgaccaaaa gaatgagagg 300
taacgctcct ttatctacag gtaagtacga tgaacaatat gataatttat gcaaagggaa 360
gttcgatccc agggaagaaa tgcataaatt acaagtttta aattctcaag agaaagaata 420
aaaaggagaa tgaatttttg taaataaaat gacaacaacg aaagagatta atattccata 480
cgtttcataa ttcaaaaatt aatttaagaa tatggttttt atgttataca agatgtatat 540
aagaatgtgt acataagtac cggtataaat cacaatatct ataaacattg atgtaggatt 600
atcctgagtt ttaagaagtt aacttctgtg taaaaatatg ggcctagagt ataggatgat 660
tataccggtt aatcaatata tccttctaaa ttattaatat gcaactgtat aagtctgtaa 720
tgaccttggt gattaatatt attttcattt gttatagccg attgcgtaat agtgaaaaaa 780
aaatcttaat agcggatcta tgtcttcctt gctgtcagct caagttctta aagaactagg 840
acaaatacac ttgaaaatcc aactttaatc aaattagatg gaagtaatag ttatatcaat 900
agctagctat agttatattc aaaccaactc tataaaaaag cactaatatt gcaaaatgga 960
tgacgatctg caaaataact taaatgtggt gaatattgtc tctcaacaaa gtctggagca 1020
aaaaattggt gataatgttg aacaactaac aaaacagaaa ttactcgaac aagaaactac 1080
aagattagaa agagctcaga atctatttga caaactgaag gggcagctat cctcattaag 1140
aagaaggctt aataatacga accgaatatc gataaagatc aaattaagga aagagatcca 1200
acagttacag gataaagata ttatcgaggc acggaaagat atcagggaga tatatgatcg 1260
acttagagat ctgaaaaagt cagatgatac aaataccaga gagcaagatg atggtggtag 1320
aagagaaggg gagtcagaaa gagattactt agtgaggaca ggtaagctaa ctgcatttgg 1380
ttcaaaatca ggatttataa tcgatgataa agtaaattct ccagctacaa aaagaataaa 1440
agtggaagat gacgcaatac tcgaatctcc cactactgat gaatatgaga tggcgaatga 1500
acaaatggtt gaaaacataa ctgataactc ttcagaaagc gattacaaac cagataataa 1560
tggggatata tccgaaaatg aggattataa tgaaagcgac ataaatactg aggatgagga 1620
aataataata gaggaaggaa aagttaaagt taatgaagct aatgatgatg gtgatgagtt 1680
aacatatcag aaaagattaa aaaagtggat agcccaaaga tctaagggga gaaaaaataa 1740
caatgaagct ccattacctg agtggcgtaa atcacatcct gaaattcctg atgcaagact 1800
tgatgatatt tttaaaattc ctggtgatat acacccttta ttattcaact atcagaaaac 1860
ttgcgtacaa tggttatacg aattatacca acaaggtgca ggtggaataa tcggagatga 1920
gatgggtctg gggaaaacaa ttcaagtgat agcatttctt gcagcgctac accattctgg 1980
gctattaaat ggcccagttt taattgtttg ccccgcaaca gtcatgaaac aatgggtcaa 2040
tgaactccac cattggtggc ctccattccg ttctgtcatt ttgcattcaa tagggtcggg 2100
tatgtcagat aaaagcaaaa tgaaagaaac agaattcgaa gaattgatga tgaattcaaa 2160
cccggatgaa ttttcctacg acgatttcaa gaattctaaa aaggcaaaat ctgccttgga 2220
atcgtctctg catttagaca atttaatacg aagagtggtt gaaaagggtc atattctaat 2280
tacaacatat gttggtctca ggatacattc agaaaagctg ttaaaagtag actgggatta 2340
tgttgtctta gacgaaggcc ataagattag aaatccggat tctgaaatat cattaaccac 2400
aaagaaatta agaactccaa ataggataat tttatcaggt actccaattc aaaacaatct 2460
gaatgaatta tggtctctgt ttgacttcat atatccaggt aagctaggaa cattaccagt 2520
atttcaacaa cagtttgtta tcccaataaa taccggtggc tatgcaaatg ccaccaatat 2580
tcaagttcag actgggtata aatgtgctgt tgcgttgagg gatctaattt ccccatatct 2640
actgcgaagg gtcaaaagtg acgtagcaaa ggacttacct cagaagaaag aaatggtact 2700
attttgtaaa ttgacacagt atcaaagaaa taagtaccta gaattcctga actcaaacga 2760
attgaaacaa attaaaggtg gaagaagaca tgttctatac ggtatcgaca tcttgaggaa 2820
aatatgtaat caccctgata ttctggagag agaggagaag caaaacgaac tcgactatgg 2880
taatcccagt agatctggta agatgcaagt tgttaaacaa ctattattgc tatggaagaa 2940
agatgggaac aaaaccttgc ttttcaccca atctagacaa atgttggata ttctggaaaa 3000
atttgtagca agtggagatc ctgatttgag taatatcagt tatctaagaa tggatggtac 3060
aactaatatt tcaaagcgac aagctttagt agacaggttt aacaatgagg atattgacct 3120
gtttttatta actaccaagg ttggtggcct ggggataaat ttaacaggtg caaacaggat 3180
tatcattttt gatccagact ggaatccatc tacagattta caagctcgtg aacgtgcgtg 3240
gagaattggc caaaagagag aagtttcaat ttatagatta atggtgtcag gctcgataga 3300
ggagaagata tatcacaggc aaatctttaa acaattctta actaataaaa tcttaactga 3360
tccaaaacag aagagattct tcaaaatgaa tgaactacaa gatttattta gtttaggagg 3420
agatgatgga ttagcttcag aagagcttgc gaacgaggtc gagagacata cgcagacact 3480
gaaggaatct aaaactaaac aaagtgatga ctttgaacag gttgccaata tagcaggtgt 3540
ctcgaaatta gaaggtttct tttcaaaaga agaaaaagaa gccagtaaaa atgaagatga 3600
aagattaata gcagggttaa tcagcgaaag tggtaactta gaaaatgcca gtactcatga 3660
acaggttgtt ggatctcata tgacatctaa acattctacc aaattaattg caagagaagc 3720
tgaaaaaatt gctgggcagg ctgtcaatgc tattcgtgaa tctagaagaa agacccagaa 3780
atatgatatt ggtaccccaa catggacagg taaatttggt caagctggta aggtcataaa 3840
gaaaaaaata aagccgtcaa agaaaaatgc tctggcatca tcagatatct tgaagactat 3900
tcgtgatcgt caaatagaat cgaaaaagaa tgaatcgttg aatgacttgg ctgatccaaa 3960
ccgtaaatta atgatgaaga tcgtaaatct tttaaatgaa tcatctcagt ataccttacc 4020
atctgcttct atcattgagg atcttaacat agatgtaaag gataaaaatg ttattatcaa 4080
tgtcagagct ttactaagag ctgttgctaa atttgataaa gtgaaaaaaa tgtggacatt 4140
gaacaatgaa tttgttaata attgagcaaa cttttttccc caaggacaat taaatactag 4200
aggaagaaaa gttagccaca gaaagagaat atatatggat ttgcattatg aatatataaa 4260
tatttaaacc ttaacggaaa ccaaaccctt cagattcggt tacagctaac atcaatttat 4320
tctctagctt ttcttttgag gaatattccc atatacacag ttcgttaaaa catgtgtgtg 4380
caattggcag atcattacta tccctagctc ccaatctgct tatcttaaaa gtcaaggttg 4440
atatacctgt agctggtact ctattagaac tagtaatgaa ttgtaacact ttaccttgca 4500
ttttataatc ccaattttcc aaaatctccc aaaaccaatt tacaacagaa gtttcattgg 4560
taaaaccgcc ttgatattta gtaacagaac gtaacatttg aaaatcatat tttgtatgct 4620
catcatcccc acataataaa cgttccaatt cttcagaatt aaataggcct atggatttac 4680
aatttgaaaa gactctgctg aatccatcca taaatctttc aaaagacgct gctaccgatt 4740
ttgttagata gaaatcaatc cacaatttta cataatcgga tttatttgac tgagttaccg 4800
ggatattaga gccattcttg caaagttcta ccgtcacagt attcgaacct ttggattttt 4860
tactaccttt attattcgct gttagtttat tatgatatgt ggtttcaaat gtaaggcaga 4920
aaacgtcatt aaaatcatcc ttcgaatatt ctaacatctt taataaattt gaagctgtct 4980
ctggatatag ctctgtgtaa tctgcaaaag ttaacgtttc attgcacatc tttttataaa 5040
gtgcctttgg gaatgataaa tctaggatat ttccgttaaa cattgccaat gctatgacaa 5100
cacccaacag gtaataatat tcttcttgtg attgaatttt ttcttttgat ttggaagatg 5160
gtacaattgg aaaccaacat aatctactat cctttatatg atcaaacaaa ccagttgtcg 5220
gactgaataa agattttgtt aaaaggataa accattcctt tctcaaacca cccgcatcga 5280
taccaggttc tttaataaat tcaattctta gagatttcaa taaatcacct tgatgctctt 5340
tgataacctt taatgaatca tgggtaatat gatctcttcg tattttaatc ttaaaataaa 5400
cttctatagt ctt 5413
<210> 26
<211> 2985
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4733_g1_i1
<400> 26
tcagctacta atactggtgc cacctctgtt cataaaccag taggtatcgt cactccaggt 60
tatggtttgc cttattttgc ctcttcttta acagagacta aaaataactt cttattcaat 120
gttgctgctt tatcttatca agataaaaaa aatagattag gtagcgatta cattactcca 180
ttatcaatcg ctaaacaatt aggctttaac gttattacgc ctgtttcaaa gaaagaatta 240
gagttaactt ctttattatc tgttgcttta gctactttat ccaattccaa ttccactatt 300
catttattcg atggtttaac ttctactcgt tcattttcaa ctttaaatag taacattgtt 360
aactccgaat ctttaattgc taatttagct aagactctag gtaatgaacc atcctttgat 420
gctatcttaa agggtttcaa tgaacaaata ggtagtcaat tgacaaagtt ccaatattct 480
ggtccttcaa atccagaggt tttattcgtt acttatggta ctaccgaatc tgaactattt 540
agttccgttg taccaacttt gtcagttaga gttccattac catttgacac taacgaattc 600
gttaattcaa ttccatcaag tgttaagaag attgtcatca tcggtcaatc attgaatgaa 660
aatcatgctg tcccatcttc cctaagatta gatgtctctt cagctttatt cttccatggt 720
cgtaaaaata tttcaattca agaacatatc tatcaacctg attttgcttg gactactcgt 780
gaagtatcca acatcgcaaa ccaattcgat gtcaagacaa tcagtaccgc tgctcaaact 840
ggtaagcatg ctttatttta tctaccagat gattccaaat ttattaatat cccagctact 900
ttagtcaaga ctttagcttc tactactaac gatattcaat tctctactaa attcaataac 960
tctgttcata gtggtgcatt tgaagctgat attgcagttg gtaatgttga aacaggtact 1020
gcttctgccg atttcatctt ggttcaagat atcaatctat taaaccattt agatatcgtt 1080
aatgctatta agggaaatgg taccattgtt tatttagcta atcgtgatat tactaaatat 1140
ccacaacaat tcatcgctga tttaatcacc aagaaaatta ctttagttat tgttgaccct 1200
actgaatacg aagatgatat cgattcctta gttgctttga ttcaaggtca attctatcaa 1260
tctggtttac aattagctaa taaccaaatt caatcaaaaa ttgtatctaa tttatctcaa 1320
gaacaaattc atgatatttt gaacgctaat gaggattcag aagaatatca attctcaatc 1380
tttactgttt caaacttacc tgaacctgaa ttctccgaag aagttcgtga acagttacct 1440
tctttcttcc aagctgattc attcaaacca aataatatta aacaacaaca agctattgtt 1500
aatgacccac cttcaattac ttcaacaatt actgaattga ctaaaagatt agctttcaag 1560
gaagcatacc acgttgaaaa gaaattaaga ccagatttac cactaattaa gaaccacata 1620
atcaaggtta aagaaaacag acgtttgact ccagcagact acgatagaaa cattttccat 1680
atcgaattcg atatctctgg tactgattta acttacgata ttggtgaagc tcttggtatc 1740
catgcaagaa ataacgaaca acaagttctg gaattcttac aatcttatgg tgtagatcca 1800
gaacaaatcg ttcaagtacc aaacaaggat caaccacaat atattgaatc aagaactgta 1860
ttacaagtat ttgttgaaaa tctagatcta tttggtaaac cacctaagaa attctacgaa 1920
tccctaatcc cattcgctga agatgaagat gaaaagaaat ttttgcagga tttaattact 1980
ccaggtggtg cattggaatt gaaaaatttc caagaagtcg aattttattc atatgctgac 2040
atctttgctc gtttcccatc agtgagacca gaattagctg atttgattaa tatcattgct 2100
ccattgaaga gaagagaata ttctattgca tcctcacaaa agatgcatcc aaatgaaatt 2160
catttgttaa tcgtcgttgt tgattgggtc gacaaacagg gtagaaagag atatggtcaa 2220
gcctctaaat atatctctga tttacaaatc ggtcaagaat tagtcgtcag tgttaaacca 2280
tcagttatga aattacctgc tgatccaaag gctcctgtca ttatgagtgg tctaggtact 2340
ggtttggcac catttaaagc aattgtcgaa gaaaaattat ggcaaaagca acaaggttac 2400
gagattggtg atatcttctt atacttgggt tccagacatt gtagacaaga atacttatat 2460
ggtgaagttt gggaagctta taaagatgct ggtatcatta gtcatatcgg ggctgctttc 2520
tcaagagatc aaactcaaaa gatttatatc caagatcgta tcagagagaa tttagacgat 2580
ttgaaggtcg ctatgattga tcaaaacggt tctttcttct tatgtggtcc aacttggcca 2640
gtaccagata ttacttctgc tttggaagat atcattgcag ctgatgctaa ggaaagaaac 2700
gttaaggttg acttgaatga agccatcgaa gaattgaagg aaacttcaag atatatctta 2760
gaagtttact aattcgttac atatatattt atttatgata catttattta ataaactttt 2820
ttttagtaaa tatttctttt tttttgttgt taaaatatag tacgaatatt ttttttttac 2880
ataagactga ctacagatgt accatcttgg aatcttgttc tgaacactct gttggcatta 2940
gtgaacatac cttccttcaa agaaacaaca ataaactgag cccca 2985
<210> 27
<211> 2512
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4679_g6_i1
<400> 27
taataataat aataataatg atgaaattcc aatattgatc gattcaaata acaatgaaac 60
taggaatatc gatacttcta aacaattacc tttaacacat catgaaattg attttgaaaa 120
tgatctaact ttggaagata gtgatacaga tatcgatatg atgatggatg atgaagatga 180
tgatgacatt gaacaaaata atacaactta caacaacaca gtgtttcaag gtaataattc 240
ttccaggaga agaatgcgtg attattttaa attcaattta ttcaattctt caaatcaacc 300
aaaggttaca aacgaatctg aaattttatt gaagtcagag gaaaaggata caacagcaat 360
actcccacaa ttcaatgatc aattcaattt aaagaaaaaa tcctcatttt ggaaccctaa 420
gacttcatct ttcttgaaaa ggtacaatag taaaaataaa gatggcatta ataacactaa 480
tattcctgat atagatgaca tctcaactag agatttcgaa ttacctgatt tattcgatat 540
tgaaaatcat ttagttcagg actcatcttc atcctcatta ttattaccaa tacaatccgt 600
agaaccaatc ttcaaaaata cgttgaatcc aatggtcact atgaaccaag ttactacaca 660
attacagcaa caatcacaaa ttaattcaac tgcaacaaat acaactatgt catcactcgt 720
ttcaccttct tctccagtaa tgacaattgc tccctcattg gtaactacag atgatatttc 780
aaataacttc gtcgaaccaa ttaaatcaat gaatccattt gaacaagata tgaatttgag 840
ttatttcgat attaattcaa ataatgttct taacgacaac acagttgaag aggaaccact 900
taaatcagtc cttgaggaac caccattagt agaaaacgag actccaagaa aagtcacccc 960
aactacgccg gctccttcct tgactccttc cttgactcct caacaaccta tcaagagaag 1020
aggttcaaat actctgccaa agacaagagg acgtaaacca tctttaatcc cagatgccag 1080
caaacaattc tgttgtgact actgtgatag aagattcaaa agacaagagc atctaaagag 1140
acacattaga tcgctgcata tctgcgagaa accattcacc tgtcacatat gtcagaaaaa 1200
tttcagtagg agtgataatt tgaatcagca tatcaagact cattctcacg atgaaacaaa 1260
ttgatctggt tcctctaagc tccttttctg cgctcatata tagagataca tacatataga 1320
tagatatata gactcgtgtt ttaactgata taatgaataa tgataaatca acctttttaa 1380
atttaatgtt tctgaataga gtcatctaaa cgtggttgtg acttctggtc tctgatagtc 1440
tgccgatttt cgctgcaaca gaacaatgag ctaccaaaaa aagagaaagt atgggcgtta 1500
ttattagaat aagaacatgt cgatatctca agtacaacat aatgtgggaa gatctaatat 1560
atccaataac aagaccaagt gctatcaaaa ttgcacattt tatcccaacc caatgtctgg 1620
ttacaacagc aataccgctt atcaattgca actagataga tttctgaaat ctccatcaca 1680
taataatcaa tgttcagatt gtaagaattc aaatccaaca tggtgttcta catcctttaa 1740
tgtatttcta tgttccagat gtgcatcctt gcataagcaa ctactaaata aggaccctta 1800
ttattccaat atcaaatcga tcaaattgga tacgtggtct gatgatgaat tgttcaattt 1860
catacataaa ccaaatcaat caatcaatag agacatatat actacttcag acaatgcata 1920
tgacttggaa caattaatta aaagaaaata tatggatcct ggactagaag tcggattagc 1980
taaaagaaga gccaatagaa aatatcctct attaacaaat aggagaccaa gagattatga 2040
attaagcaaa tattgtagac atatcaggga aatcgaatca tatgatagga gattcactaa 2100
tgaagataat attgtggaag cattatccat ggctcatggt aatattgata acgccattga 2160
aatcttaaga tataatgatg agtaccttaa ctcaagagac tacagagatg attatgatag 2220
tcgaaacagt agtagatcat cgctatcaga aaacagatat cgcaaccgtc cggacagcaa 2280
ccgcacccca agtttgccaa gaagaccaga taatagtgga ccaaaagatg ctgtatttga 2340
tgggtcattt ggtaacgcta ctacaactac aacaaaagct cccaaggcag ctgtattcga 2400
tggtttatca cctgatgcct tatctaatct ccaagcatct gaatatcaag tacaacagaa 2460
tgaattgatg aaacaacaaa tgttgcaaca acagcaacag caacagcaac aa 2512
<210> 28
<211> 3243
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4955_g2_i1
<400> 28
taataataat aataataata atacaacaat cttaccatct actcatatac ccaattcagc 60
cattaatgat attaatagaa ctgctaccgc tgctacaaca acaacaatta ctaccgctaa 120
gattactact tcaaaatatg atcaatcaaa aattcataat ttaccaagtc caacttgttc 180
cgtatcaggt aataacaaca atgctaatat caaaagcaat agtaataaca acagtaatac 240
taatagtgga gtctctactc cacctgaaga tgtagaacca atgaatttag tttgtaaatg 300
ggataattgt aacaaaatct tcgttcaacc ggaattatta tatcatcatt tatgtcaaga 360
tcatgttggt agaaaatctc aaagaaattt acaattagat tgtcattggg ataaatgtca 420
aacaaagaca gagaaaagag atcatatcac atcacatatt agagttcata tcccattgaa 480
accatttgct tgttcatctt gttcaaaaaa atttaaaaga ccacaggatt taaagaaaca 540
tttaaagatt catttagatt ctggtaacat tatgaaaagg aaaaggggtc caaaagtcgg 600
ttctaaaaga attaataaga atggtattaa atcaatagat aataaacata ttcaaacagg 660
tattgatcaa agatcaagaa gtttaccttc aacaagcttc actaatttac ctcatttaag 720
taatggtttc agaaaattca ttactaatga tattcaatct tatcaacctg tattgactca 780
tagattagat acaagattac aaaatataat gggtcaaact cttactgctg ctcaattaca 840
agaacaacca catttatatc atccaatcga taaaaattta caaggtccta acatgtctag 900
tgaaagagta tctgtttcat cagttatgga tacattacca cgtcatgtag cagctaacgc 960
agcaggtttc ttttcagaat tatctaacaa catggctaat aacacagcat tatatcaaca 1020
tcatcaacaa cagcaacaac aacaacaagc acactcaagt attcaattgc aatcacatcc 1080
acagacttta atcggtaact attctaaatt gccaccattg aatggtgtta catcataccc 1140
aacacaacaa catacaacaa tgattgaaag ttcaatgaat gtcccaacta acaaaatgac 1200
tatgttacca tcaatggcag aagttactgg tctacaacct agataccaac aacaacagca 1260
gcaagcacaa caacctagaa gtagtccaaa tgcaactatc atctcatcat atccaactat 1320
ccaaaacatg cctcaacaac aagttccatt acagggacaa gttatggcaa gacctatgcc 1380
aggtactcaa ttgccatata atttggttgt taatgcaatg cctgttgctg gtagtacaat 1440
gaatatggtt gagaatagat atagtacatt acaaagatca actggtcatt ctagtggttc 1500
tgatgattca gaatctgatt cagaatctga aagagattat gaagaagaag atttcgaaga 1560
aagcttggat tttgttaatg ttattagaga ttatttgatg tgtacattat tagaagaaga 1620
atatgatgaa tccgtcgatg ataaaattga ggatttgatt aatgataaat tttggaagga 1680
atcaaaaggt ttgatatcta aatatccaac tattagagtt tgaaagaaat atattaagaa 1740
ttataaataa aaattaattt atgtaatgta tatcaataaa taaatacatg aaataaataa 1800
atataaatct gccacaatat ggaatggaat atgatgtgat gtgtgtacag atgtatatcg 1860
cacaaatgaa ttaatgctaa tgttgtaaaa ttgtgacttt tttactagtg ctttttttta 1920
ttgcgggaag gaatgaatat tgttttatgg ttgatgatag aatgtaatgc ttgagttcaa 1980
ctatgaaccg aatgattcat tagactctta ggtaaagaag atgtaactga tctagaattt 2040
gtagcggaac ttgaagatct tacaattttt tcatttgtat tatttgcttc atcattattt 2100
gatgctattg atggaattcc tgtattagaa tcacagtctg attgttcatg ttcatgttct 2160
tctaatggag gtccaacata caattgagat ggtacttgtt caacattatt tggcattgaa 2220
aaattgacga tatgtgtgga atcatttgta ttaactttaa atataaaact attatctctg 2280
ataactcttt catatattct atttcttact acgattactg caactgcaca tggagtagta 2340
ataaaaccaa caactgcacc atatgaaagt ttcattaatt gtggctctgg ataattatta 2400
aaatacctat catgattatt tattttatta ccaattgatg caaagatacc catacaaatt 2460
ggccaaacga agataaataa tgcaacagcg aataacattg ctctaataac tttcctaatt 2520
aaccattcaa taatattaaa atataatttt tcatttggat atttaacaac atgatttctc 2580
caaaatttta aaagggacat cttaggtaaa tcaatatctt tattattact atatttatta 2640
tcatctctat gtctcattac acctgtatta atttcaaaat accattgcat atatcttgaa 2700
aatttatttc ttttaacaat gattgaatta taccaaaaat taacaaacca tgaaggaaat 2760
tcaaatgtat tgtttaaata atcatcataa cctaccatta attcttcaac aacccaagta 2820
acaccaactt gaatgaataa agttaatgca caatcacctg ataatgtatt agggaatgcc 2880
cataatgtaa ctaaatgagg agatttctga tacataccat aagcgatacc aaattcagaa 2940
ccaccaccta taatggcaga acctaatcct tgatataaga atagataaac tattgaaaat 3000
atcaacgggt attttataaa tagtgtaatc atctatcgta agttatgtgt gtctgtgtgt 3060
gagtgtaatt ttgtgatgat ctgggtttga attaaatata attgtaattt gtttagttta 3120
tagtagtacg tatatgtgag cttatctttt taccaaatca atgaacagtt ttataacgta 3180
tcttctttcc ttctctctta aataaataaa taataataat aatagaattt aaatatgcag 3240
cag 3243
<210> 29
<211> 5008
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c8855_g1_i1
<400> 29
cttaagaatt agatatataa ctaaaattaa gttgaaaagt tacgttacat gatgcggata 60
ctacttcaag ctagtgaata attgaattca caatacttgg gggggtacta gaatgggata 120
atacattata gtgtgtggta gttcaaaaat gaattatacc agacggtatg gcatgctata 180
ataggttatc tataaactat ataaatcaaa aaaagagatt tttacatgat gatacaaaat 240
acattgctcg aaggtcttgt atcgatgtga gcaatacgat ttctcaatag actacaacgg 300
tatctgattt ttatcttttt tatgtaataa gtacttgttt tacatacttt tttcacttga 360
tattctataa tgaaagttag atgtctattt aaattggcca ttttcgttat tttaaagtac 420
aactgagttt tgattcatag tcattatagt tgtgaatcgg tttccatttc ttgaatgtcc 480
cgatgagctt cttcatattt gcacacagat gcattgtaaa agtacgtttt tatattatag 540
taattacgaa acttttgtga ctagtattta attattttta aaaaaaaaca gtaagaagct 600
gatatcaagt cgtacaataa gaaattctta acaacaagga actatgtcta aaagagttag 660
caatactgta cggtataaac acccactacc aatccatcca ctagatctac ctcaattatt 720
tgttcataat ccgatatcat ggacctactg gttgtatagt tacatcacaa gttataatgc 780
atcatcgaca aagatacatg tagatatcac aaaacacaac aacttcatac atatagaagt 840
tagagaacta accgatatga aatatttgtg ggataatgga ttctttggca caggccaatt 900
gtccagaagt gaacctacgt ggtatgaaca aacaagcaag aagttccaaa gcagtggcga 960
tgaaaagagc aatggtatca gtttagagcg agtcactaaa ttaagaagac aacaaagagt 1020
tgagttcaag aaacaacgtg aaatagtaga ggagaaattg ttacaactta ggagagaagg 1080
taatttgaca cctgaacaag aagcagagat tcttgaacaa gaaagagaca aactacgtaa 1140
atttaaggat gatcaagttt cattagacag aatggatcaa gatgaagata ttactgaaca 1200
agagactaaa cgactgctat tacagcaatc tgaaatattt gatgaaaatg ataatttatt 1260
gaatttggaa tcactcgaat taatgcctgt agagacgata ttcttaagtt ttgcattacc 1320
tattttagat atatcaccag tagatttcat tttgaagtgt tgctttactg atatatctcg 1380
gtattcagag gaattacaca cattattgat ccaatatgct gcatatcatc attacagatc 1440
tcatggatgg tgtgtacgct ctggtataaa atttggtagt gattatattt tatacaagag 1500
aggtccacca tttcaacacg cagacttttg tataatggta ttagattcta attgttcgaa 1560
accttataca tggtattcta ccattgctag agtatgcggc acggcaaata agacattagt 1620
cctttgctat gtagaacgtc tagagactga agaacaaata ttagaatggt tacaaggggg 1680
acagttaaca aaagttttta acagctttaa ggtcggtgag gttatataca gaagatgggt 1740
agcaggaaga aaccgtgact aataatgtca gatcgatttg aacgactgaa tgaagagata 1800
atttattact acttggtatt tactatgaat gtttaattat ataagttcaa aggtatataa 1860
ttgttttttt ttccattata tgaatgcatg catgtgactg tttattctag tctataataa 1920
cctatttacc attgtatgta tctagaaccc aaccacaatc tctcattccc ttgatgtgtg 1980
cttttggtcc ttctaccaga tcgtcataaa ttttgtttat cttctgatca tgggtagctg 2040
gttttgatat acaaacaacc tcttcgtttg gtaggatgca ttttagttgt gattttggaa 2100
atggacatgg acaatcggac ttcgataata cacattctag cgtaacaggg cataagtagt 2160
ttttacaatc actgtctaat actctgtctt catatggttc taattttgga ggttgttgct 2220
gttgctgttg ttggttgaag ttaaacataa acccttgaac atccagcaat aaaaaaaatg 2280
ctgctactag tatgacttta ttaaatgatc ttaaatagga catattattt ccttctctct 2340
cctggactat tcgataacga cctaatgttc atatatctac ccgtatatac gataaatatg 2400
ataccttgca atcataataa actgtgttat ataaatatat atgtttgtat cctggaacta 2460
gatggttggt tagttaacac agtggcatct cttcgctcca tgttgacatc aaagtggtga 2520
agttcgtggt tagtgatatt ctacgcgtta attttttcga tttcaacaaa cgcgaaattc 2580
tgttttgatg aaacttctct tttcaataac aacaacaaca aaggttgaaa gtctggtcat 2640
tccatctttg tttcgttttg acattgtata tatcaatatg tgtcaagtcg tgttgtgcag 2700
aggaagaaac aaacgacgat ttagggtaat ttcagtagct gcagaagacg ttgaggagga 2760
tgtattacag gatcgatcaa tataccaacg catataataa gattctcaga gaatcttcaa 2820
atccgtcgac atgccaacta gtcatttttg tgtcatgttt gaatatcgat gcactgtgtg 2880
caactaggat gttatccacc cttttcaaga aacaactagt ccaattacaa attgtacctg 2940
tgtttggtta ttctgaatta aaaacacatt ataagaaatt agatgaaaac attaatagca 3000
tagttttggt cggttttggt agttacattg atattgagac atttctagaa attgaccctc 3060
aagaatatgt gttggatacc tcatatagtg aatcgttaat tcaaaaacca gagaataata 3120
catacaaaag atacatttac gtattggata gtcataggcc gtggaatcta gataatttat 3180
ttggttctga cattgtacaa tgtttcgatg atggcacagt ggaagattca ttaggggaac 3240
agaaagaggc atattttaaa ttgatagggc tagaaacagc agcaggagat gataactcgg 3300
aagaagaatc agatgatgag gaaaacacag atgacgatga taacgacgat gatgaggatg 3360
ataatgactc cttagaaaat ggtaagagac tacaccctga tagtataaaa tataagaaac 3420
aagctcgaaa acaaaggagg aaagaaataa gtcgatatga aaatgtactg gaagaatact 3480
actcccaagg tactacagtt gttaattcaa tatcatctca agtctattca ttgatttccg 3540
ctattggtga aactaattta actcaattat ggctagccat cctaggtgca acttcattag 3600
atactacata ctcctcagtt tacaataact tatacccaat tatgcaggac gaagttaaaa 3660
ggttatcacc tgggaatagt tttctcgtat ctgcaacacg ttcaaatggt acagggtctt 3720
cttcaaaaac accagatact ttatctcttg aagttcagcc agattactat ctatttttat 3780
tgagacactc ttcattatac gacagtttct attattctaa ttttgttaac gctaaattat 3840
cactatggaa tgaaaatggt aggaaacgac tgcataaaat gtttgcaaga atgggtatac 3900
cattaagtac tgcacatgaa acgtggcttt atatggataa ctccattaaa agagaattag 3960
gaaatatttt ccataaaaat ttagatagat acgggttaca agacatcata agagatggtt 4020
ttgttcgaac atttgggtac aggggatcta taagtgcaag tgaatatgtt gaatcattag 4080
cggcattatt agaagctgga tcaacggtga acagctcaaa tcatagtaat acatctaatt 4140
cccctgggaa atctagtagt aatgataata atagcaatga taatgacgat gatgataatg 4200
gtgcacaaga agaggatgat gagcaggatg tagcagcagt taaccgtaag aaagcgttgt 4260
cttccatgga aaatatcaga aaacaatggg tttctaattt ctggttaagt tgggatgcat 4320
tagacgaaaa gaatatagat atattatctc gaggtattaa gcatgcacaa tttcttcaaa 4380
aggcaatatt taacaccggt gttactgtcc ttgaaaagaa aatgattaaa catttaagaa 4440
tatacagatt atgtgtctta caagatggtc cagatctgtc aatttatcag aacccattaa 4500
cattattgag attagggaac tggttaatag agtgttgtgc cgaggcagaa gataagcaat 4560
tattacccat ggtattggct tgtttagatg aggatactga cacatactta gttgctgggc 4620
tttcaccaag atatccaaga ggtttagata atttgaagaa gaaagaacct atattaaata 4680
actttagtat ggcatttcaa caaatcactg cccaaactgg tgcaaaggtt aaaattgaca 4740
acttcgaaag ttctataatc gaaatcagaa aagatgattt gtcaccgttc ttagagaggt 4800
taacattaag tggattgtta tgagatagtt ctgattataa catgaatagt agaatgaaaa 4860
gagggaaggt tattaaaata atataaaaaa ttaaataaga tagtcatatt cacattacat 4920
agtcatatac atatttacaa taatttaata tgttagatat tagaacacat cattggagtt 4980
ttgatgatta taaatattct tttgttaa 5008
<210> 30
<211> 3479
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c7636_g1_i1
<400> 30
gggaaagaag gataatgatg agttccgcca agagaccact tcaagaagtg gataatgagt 60
tgcttgattt tgcagctcag aatgaggcta atattgaaca tgataaagaa caagccccta 120
agagaaggaa acgtatttat gaagcgatta cccaatactc catgaacact caagatgagg 180
caggttctaa ttcaaattta tcttatcctg gttatatcaa gaaggtgaaa ttacgtaact 240
tcatgtcgca tgaaaatttt gaattagaat tgggtccaca actaaatttc attgtaggta 300
ataacggtag tggtaagagt gccattctta cggccattac aattggatta ggtggtaaaa 360
ccagtgatac aaacagaggt actaaattaa cggatttaat aagagaagga accgcgtcaa 420
ctaagattac attgtattta gataaccgtg gtccaggatc ctacgatcct gagaaatttg 480
gtgataccat tattattgag agaacaatta gacgtgatag ttccaatgtg tttagtgtta 540
agaccgaaaa tggtaacgaa gttgggaaca aaaagaaaga tgtccagctt attgttgatt 600
ttttctccat cccaattata aacccaatgt gtttcttatc tcaggatgct gcaagaagat 660
ttctgacagc cagtacctca caagataaat accatcattt tatgaaaggt actcttttag 720
aagatactaa aataaattta gataacgcaa gttctattgt tagtaaagct caagagaata 780
tgaggttaca tgccggttca ttacaagtac ttaaacagga atataaggac tccaagaaac 840
tagcgcgtga gttcaataaa acaagtgatc tgaacgaaaa gaaaatgcta ctatgtgcca 900
agatcttatc tctcgatatc gaggctaata ctaaatccag taatgccgtg gaacaggaaa 960
ttgtcaataa tgggaaccag ataaaaaatt ttgacaaaag aatcgagaaa cggaaagcag 1020
atattgaaag atttgtttcg gatcaaaaga aagctgaaga gggtattgaa aatcaaatga 1080
atataattaa tacaaaagac caagatttta gagctaaaaa agatgaagtt gcaaaattga 1140
gggctttgta taatgctgaa gaacgtaatc aaactcaaac aaaacaaagt attactgatt 1200
gtaaaaacag aattcagctt ttcaataaga aaattgcaaa gttcgaacag aaaatcaatg 1260
aagaaatggg tggggataga gaagcaatga aggagcaact aaagacactg gaaaaggaaa 1320
gagacgtagc ccaaggaaat ctttcagcta tgcaaactac attaagggat ctgcaaaata 1380
gggaaaagag cgaatgtgac caacgtaatg ttgaagttcg cactttagaa gatggtattg 1440
cggcaaaaac gtctgaatac aataaaatca agactggtaa taacgatttt ctacttaact 1500
tcgacagaaa aattaatcag ttatttgctg aaattgagcg caataaaaat catttccact 1560
ctatgccaat aggtccactt gggaggtttg taagtataaa acgtgagtat aatcaatgga 1620
cccaaaatat ccagaaattt ctgtcatcga cagtcagctc tttccttgtt acagacttaa 1680
atgacgatcg attattgaga aggataatga aaaaatgtaa tattagaaat attggtgtat 1740
taatttacaa aatgaaaagg ctcgatgttt cttcgtttct agttcgagca tcatatccaa 1800
ccatttacga tgccctcgtc tttgataccc cagaaatgga aagtttattc attgatgtaa 1860
catatttgga aaaagtggtt ttgatagaaa attataaaga ggctaggaat ttcctacaag 1920
gaaatcctgg gagaattcga attgcattat ctttgaggga tcgtaatggt ggttatcagc 1980
tacgtggtgc aaaccagtta gattctgtca aatatgaatc ccagataaaa attaaagttg 2040
gttcttcaaa cgaggataat cttgcatatc tgaaacaaac tattgatgaa gaaaggaaag 2100
agatagaaaa aatcaagaat aagtacgaaa cagttatatt taatacaaga caagagatga 2160
atacgaccaa ccaaacaatg aaacgtttgt cggaagatat taagagaaag ggacatgaga 2220
tcacgcaatt aactgttaaa gctaatgcaa ttgtggatac tgggttactg acatcgatga 2280
acgaagaaag agataagcaa gaaggggcag ttgcagtata tgaggccaca gtaagggaga 2340
tagatgctaa attggatgcg ctccgtgaaa agatacagcc aataaagata agctatgaca 2400
atgcaaaaca cagtcttcgt gaagcaaata aaacattaga tgaacttaaa gcagctgtta 2460
atagccgttc agacaaagta gaaagatata atgctgatat ccagaattgt gagcatgaaa 2520
tagaaaagct atctcaaaag aataaatcgc tggaacaaaa taaagaagtt cttgttaatg 2580
ggatcactaa gcagaagctg agtttggaac aaatttgttc tatggaagaa ttacagaagg 2640
ctaatctacc tgataaaaaa gatgagttga aacgtgagat tgataagatt agtaaagaca 2700
taagaagggc agaaaattct attggtatat cagaagaaaa ggtggtccag ctatttaacg 2760
aaagtagggc aaagtataaa gatgcataca gcagattgga gagtctagaa actaccttga 2820
tacaacttca acaatccatc aaagagcgtg taatcaaata taacttaaat gtcaacgaaa 2880
cgttcttaaa ggccaatttg gacttcatcg gatctctgaa aatgagaaag ttaaccggga 2940
aattattctt taagaaggaa gaacatagtt tagagatata tgtttcaaca ccgggtgata 3000
caacggaaag aagtgtagat accttatcag gtggtgaaaa atcgtattca cagatggcac 3060
ttctattagc tacttgggaa ccaatgcgtt cgagaattat tgcccttgat gagttcgatg 3120
tttatatgga tcaagttaat agaaaaattg gtaccggttt aattgtaaat aaattgaaag 3180
acaaggttag aactcaaact atcatcatca cacctcagga tattggtaaa attacggaca 3240
tcaacgattc tggtgttatg attcatagga taaaagatcc aaaaagacaa aataattccg 3300
ataatcgtgc aaattagtgt ctttttttag atatttttat gacacggtcg ccaatatggc 3360
gtgcatttac ttgtttgttg atgcgacttt ttcgttggaa tttgaatgta atattaaact 3420
atataaattt tataaacggt aaaatatata tatatatcaa attatgagat aaaacaaat 3479
<210> 31
<211> 3024
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4682_g2_i1
<400> 31
aattaactat ttatcaggtt attattatta ttatttaacc cgaataactt tattgaacaa 60
tataaaagat agtatattca actccagtta gtatctcctc tgacatttaa atcattacgg 120
agttttcttc tcttttacct ttttttttac ttttgctttt cctgaccata ttaaacataa 180
cggtgaaaac atacttttac aatattaaac attccttttt gatccaagga gaatataaat 240
aaatacaaaa aaaataccga aaagaaataa atcatttcag agttaaagta aatacattat 300
accgattaaa ttgtaagaac atatagatta ataaaacaaa ataaactact aataattata 360
atattccaaa ctccgtcgtt ttgaaatata cactttcttt ctttttttta cacagcctac 420
tttgacagta taattaaaga atattaaaat taatcattat ctttttattg ttattatatt 480
tttgcacctt atttcccaac catcccccaa aattatttaa acagcatact ctattaaaag 540
ttttgatata aattaaaaaa aaaaatagtc atttgatctt ttattaataa taataaaaga 600
ataatacccc aaatacataa caaatatgca agctaataaa gaacctgtaa ttgatattac 660
tatacaaagt ggtaacgatg aagtaataat attaaaggga ccacctgaaa ctgctccacc 720
ggtattatta tcaggtatca taacattatc tacatgtgaa actgtaaaag ttaaatctgt 780
ttcattaaga ttaacaggta gaatgactta taatgtaccc attataaata aagataaatc 840
gaaaaaggat aataatgaaa ctatagatca aacaaaagtt aaacgtttat ctgctgatag 900
atggttatat catcataaat gggatgattt tacaatagat aattatttta aaggtttata 960
taaaaattat caaacaaaga cacctattat ggattctaaa aatgttagac atactgtcgt 1020
accgtcacat ccaatagcac aaggtacttc agctccattt aagaatggtt tagcaagacc 1080
aagatcaact acttctttat tatcattaaa gacaaataca aatatttctt cacctttcca 1140
aagaagaaaa tctcatactt tattaaaggg gaaatatgaa ttccctttta catcaatatt 1200
accaggtgat attaacgaaa ctattgatgg tttacctgat acaaatgtaa attattattt 1260
agaagcaatt attgaaagaa ctaatggtaa atctgatctt tattgtagaa aatatgttag 1320
aattgtaaga acaattactc cagatatcgc agaaatttct gaaacggtaa atgttacaga 1380
tacatggatt gatagaattt tttattcgat ttctgttggt gcaaagactt tagccattgg 1440
ttcaaaagtt ccaataaata tttccgtgat tcctttacaa tcagggatta gactaggaac 1500
gattcgaata tcattatatg aaactgcaga atattgcttt aaaggtacaa gaacaaaaat 1560
tgatcgtgtc gtatcaagat tgaaaattga aaatccggaa aaattattag ttaaattaat 1620
taaagacgat aaatttcaag agaaatggga attagatttg ccatttagaa tcccagcaag 1680
tttatctaaa tgtactcaag attgtcaagt tatcaaagaa attagagtaa cacataaatt 1740
taaatgttca atcaattttt ataatgcaga tggacacgta tcaaaattaa aggctaattt 1800
acctgtttgt ttattcattt cagaatttgt accattgaaa gttagacgaa tggaatctac 1860
aacagatttc acttgtatca caaaggatat atcaaaccag attcgtaacg aagatgcaaa 1920
ggaaacaatt tttgaagcag gtcatacagg attagtatct ccacaatatg ataatgaatc 1980
aatgttaccc gtaagagcat taaatgaatt attggctcct ccagaatatg aaaaccatgt 2040
ctttgataga agattttgta atgacatgga tgtggaatcc tcaatgttac ctcctccatc 2100
agatgtcgcc ccagatttac taccatacga accaaatgaa gagttgtctg catcaaagat 2160
cttaaaggat attgatctag ataattctac tcatagaaga atgagtgatg catcatgtca 2220
aacaatgcct gaatttgcat ttgctagtat tgatacggca gtagaggaag gactacctaa 2280
tgaaacaact aatgaacaac aacttattcc caacctacaa aattatacat ttggccaaag 2340
tcaaacaaac aatagtaatt atagggattc tcgtcataat agtaacgcaa gtaataacat 2400
tcaaacaaat gatatagata tggaaccatt aaatggtagt gaaatgccac ctccatccta 2460
tatggaagat gatcatgatc acataagcaa cacaattcca ccgatttttg gttctattca 2520
agacagacct agacggttta gatctacttc actaacaaac acattcacat cagctgattt 2580
aggaccaatt gctattccag tgccagctca tacaataaat tcatcccaaa atattataag 2640
atcggtacca caagtctcca gaatgagaaa ttcttctgtt tcttcaaaca atccttttct 2700
aaatgatgtc attgtatcgg acgcatttgc aactatggta tcagagacaa ttcaaaggtt 2760
ttcaccaatt gctaactcaa gtagcaattc aagtgaatcg agagatattt ttaactctcc 2820
aacaaacaat caattatacc acacattttc agagccatat gaaaagttaa ctgcacaaaa 2880
cgaacgcaga tattcagtca cagagacaat gaagaataat aacaacgatt ttatatcaag 2940
aaaggatatt ctaactattg aatccagaat ggatcaagta gctatgggtc cagaaaacta 3000
attaaattca tatttggcaa ttat 3024
<210> 32
<211> 2232
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4154_g1_i1
<400> 32
ctgacaataa taaacctttg ttttaaactc accgaagtat ctctttcctt tggttcccgc 60
cttcctttaa ttgatgtcaa tacgaatgtt atacaagagc ataatttcac gtgcagcaga 120
caatttcaag aaagtccgtt tataaaaaga aacagaacac agggcactga ttaaagcatt 180
tgcacaaaat gaaattgtgt gaatctttct tagatggaca gttgcaaaca ctctcaactc 240
aacgtataac tacatttcag atcattactg gataaaatat cataagttta tgaaaccaag 300
atattgtcta aatataatga ctaagcagac taccgtcatt cagaaagact tatatatcta 360
tatataaaga tatacatatc gattcatcca agatttcccc caaatattga attctattaa 420
cactttcctc aatagcatac cgaactataa cacaaaatgt ctaaagtata tgaattaaac 480
aacaatatca gtgtaacaca acaacagtta gaccactgga ataaatttct tggcacatta 540
tcaacaccac aagagattct aagatgggcc atagttacat tcccaggtct atttcaaact 600
actgcttttg gtttaacagg tttagctacc attgatatgt tatctaagat tcattctcag 660
gttgaacaat atccgttagt tccattgatt ttcatcgata cattacataa tttcccacaa 720
actttagatc ttttacaagt tgtacaagat aaatattata aaccattgaa tcaatccatt 780
aatgtgttca aaccagtaaa ttgttccgat gaaacagagt ttgctaataa gtatggcgat 840
ttattatggg aaaccgatga ggataaatat gatttcttag caaaagttga acctgctagt 900
agagcatata aagaattagg tgttaccgct gtgttcacag gtagaagaaa atcacaaggt 960
gcagctagat ctgaattaaa atttgttgaa attgatgagt tgaataaaat tattaagatc 1020
aatccattag ctaattggac attcaatgaa gtccagtcat atattcaaga aaataatgtt 1080
ccgactaatg aattgttgaa actaggttat aagtctattg gtgactacca ctctacccag 1140
ccagttaagg aaggtgaaga cgaaagaagt ggtagatgga agggcaagac aaaaaccgaa 1200
tgtggtatcc atgaaacaag taggtttgct caatttttaa aagataagaa tgaatcaact 1260
aacgaatcaa ctaccactaa agcctagcga agatgacatc ttaaacagag caatgtttat 1320
ataaacctta catattttat agaataattc atgttacaat tactcaagta aactttctgc 1380
caatctgtat acttcccagg atatatatat ttaagtttgt actagggttc atattgacaa 1440
tccgttaggt ctaattttat tttcttcatt tatatccgct ccgtccttag cggaaggtcc 1500
tatctcgctg aagcaacaaa ttaatagaat ataaagaaat atattggaag aagatcttga 1560
accacgagga tgatatatac tgccataccc tggttaagag aaggtgttaa caacaaatct 1620
atcacaacta cgttatattt aattgctact ataccacaat gggtaacttt agatttccag 1680
taaagacgaa actaccacct ggtttcttaa atgcaagaat cattagggat aattttaaaa 1740
gacaacaagc agcagagaat gaagtcacca ttaaagcatt aaaatatatt gctagaaata 1800
ctgtacttcc accaaaggca cgtttgcaag cacaattgca acttaatatc atgcctaact 1860
atactaaaat gacacaagtt aagaatagat gtattgcttc tggtacagcc agagctgtaa 1920
taagtgattt tagattatgc agaacacaat tcagagaaaa ggcaagggct ggtgagctac 1980
caggtgtgaa gaaaggtgtg tggtaaatcc atattatact gtctagaaat attattttac 2040
accaacattc catgaaaaga aaagtatacc agattgattt tcttatattt taatctaaaa 2100
ttttatatag gacatctggt gtaccatgta catacatata aatacctgtc ataaattttt 2160
gtatattcta ataaacaaac attcgtaaat aacaggattg caatagcagg ctcaagaaat 2220
tgagagataa cg 2232
<210> 33
<211> 7626
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c422_g1_i1
<400> 33
ataagacata acagtgtagg aaaatgttag gaacataatg aagttctatt tattcataag 60
gttcttcaaa atgttggtat gtagaataga actgtttcgg tcgcctaaga tagacctcta 120
gaaaacgtat gaacttttct gttaaaattg gttgctaatg caccccccac attcattttc 180
ttcattagtc aattgacaga aattgtgtat agccacataa atatgatatt gctataacac 240
agaattacgc cgatatttat cgattcctac gttaaatgag tgactcggaa aatatcgtta 300
cttttcagat tcgtttagtt cagtcattcc gagaactttt tatataccag ataaaaaggt 360
gggatgtcta gatcattaga aattcagaaa aagtagattt attgttggct caacaagtgg 420
ctgtaaagaa tatgatatcg acgattatat aatgagttac tgttaaatgt gaccgtatct 480
cttttcttgt ttgactccgg taattgggct ataatttaat tataatatat ttagtcggca 540
ttgtattatg gtatagaatg gagggttaag agaaaaattg tgaacatata tagctaagcc 600
aacagaattt aaatgatata gaaaaatgaa tattctcgtc aacgacataa attgtagcag 660
tcttccaccc caagtcattg ccaaatgtga aatgtatttc tttttacttt caaaattaca 720
gtataaaaag gcaatttaat ataacttaag tttaattgat gaaatcatca gctaacattt 780
ggctagtttg aaacctgaaa ataaccaatc tttcaaagat atataatggc attgattctg 840
aaaaacgact atatcacttt atggtgttga ctttcaaata tttttgggaa aagaattttc 900
tgagtaacgc aaagtagtgt caaattttga gagtgtatta ccaactacga ttagtctctg 960
aaaatggata taaattgtga agaagtcggt tatttacgcc taagtgaagt cgataacatt 1020
tcgaagtcgt gtctttcttg gttattggtt atattaacac gatttctgtc ctctatggta 1080
caagcccata ttatacgtat taagtatttc gatttatcag gtccaagaag atttcagcag 1140
agcttgtctg atataatagt tgttgttaac catgttactt aaccatatcg gtctcttcta 1200
tcttaaaaat aatagtctta aatacattat agcgttccaa acatatccat atcttattat 1260
gaattattgt ttacatatta tttagtacaa caataatatg taattcggga ttttcagaag 1320
taaacatatt tgtaaaaatt atgataaagt cacagctaaa aaaaaaaaac atttatttca 1380
cttcgaggaa aacaacgaat agcgctcgct caactttgtt tatgaaattc atatggtaaa 1440
cgtacgcaat gtctttcatc tctttgataa ggtatatctt taattaactt ccaacaatca 1500
aacagttgtg ataattatac ttttttgaca atatatataa acctaggtca tagtcattgt 1560
tgttgtataa cactttgtaa cataggtagg gtcgaacatt cctattttgt tgtcacattc 1620
gcatagttca tgagaaaatg aaaactgacc aatcagacga gagataatta aactcacaaa 1680
aatgtattac taagcacagc aacgatcgga aaaattaaga aaaaatattc cagttgtgta 1740
attcttgcta gcgcacctct ttttgttgat gtcatgcata ttgcaataaa gaatgaaaat 1800
attttccagc ggcgttcctg tttttttaaa tatatatata taaaggcgat ctttttggtt 1860
tactgctaac tactattcaa ttcattcctt cagttatgag tttagaaaaa tttgctattt 1920
aatttcagaa caagttataa tcatagcacc atatttagat aaacttttct tctgatattt 1980
taccttctga catgatatag ttctaatagg tagaaggttt cacaagatag taatatttga 2040
gatatatttt atttatacaa gcaaggtgaa gtcggtatgc tatttataca atttggtgga 2100
tttatattag acatttattg caacaatgaa gaatcataat ccgatctgct ttataccttt 2160
agaatcgatg caaatctata gttgtggttt accgatgaat gttttggtct cttttagtta 2220
attcatatgt tcgtaatagt gaacatctca taacgataaa atgctcgaac catacaaata 2280
ttaagatagt acgagtatga accatatatt tcgagagtac attatctacc atgtatccca 2340
ttaggctatt ttgtcgatta ttgttatgct gattttgcct gttaatagcg attcttcact 2400
tttcactatg taaaatccgt aaaatcacca tatcaaaaga ttagcaatca ggcacgcggt 2460
atgattgctc agtaaatcca attattattc agtataacaa gagcgatctc gttccttcca 2520
ataggatgat attgtcctct ctaaaattaa tgaatacctc gatgattgtg ctgtcatgtg 2580
ccatttcaat tttgtctttc gaggcattat aatttccgac aacaagttag gaaggatcgt 2640
tgttagtcgc accgggaaaa aagtgtgggt aaaacagatg gccaaaaaaa aaaaaattgg 2700
taagcacttc gtagtaagac ctttgcggaa ctcagtgtat atgtatacgt caacagatgt 2760
aaatagctat gaacgaaaga taatggcaac tgggaatact ggacggcaag taaccgaaat 2820
ttgctcacgt tatgagaaat taagtgtaat caacttacac actaaaacgt atatgctaag 2880
actgaaatca agaattcaca atgatagctg ttacagtatc agtcctttac acaaccacac 2940
tttcatgtta ttgaagagtt cgaatcgcaa tgattgtaac ccagaatgat gactgttgca 3000
aataagagta cggcatgtta aaattaacga agttttataa caatatatat atataccgaa 3060
aagaaagaca gttttgttca actaaatacg cttatgcatg ctctcattca tatttcaata 3120
ccaagttcgt gttttttttt ggttgactaa taattttctg agtataacca gaagtaatca 3180
tcaactcaat gtcgaataat ctattccaat tgctcttgaa gcatacagag tctgatatag 3240
attgcaacgt ttttaccaca gctgacctta aggatgactt accacaaaca gtttctcaat 3300
tactgaatag aaatgatcct ttcgctcaag ttaaagaaaa tatcaaacca aataaactga 3360
atgtcgtatt cacagatgaa ttgacattat taagggcatt gcctcatttg accagtttaa 3420
aagaacaacg tttagtcatc aatgttaaca tcggtcataa cgattactca gtcgtgtcta 3480
cattaaaaga tctaaatatt gttactttga tatctaacga ttataactct gctctcaaga 3540
atattaacgt cgctaattct gttgccttta actcttctac tacagtattg cactttatca 3600
actacagaaa gtgcaccaat gatttgcagg atatcaaaga aaatgaatta ctaccggtta 3660
gtttaatcaa taatggccaa ttacaatctg aagacaatat gctatccgaa ttaaacaatt 3720
tttcattgtc tcctacttca tcagaagatg catctgttgc tgttatcaac ttatctcctt 3780
atggaaagga attttctaga tatttgccat ctcaagtctc acttattgat atcaacattt 3840
atagaccatg ggatattgat caacttttaa ctctttttgt cccttctatc agaaagattg 3900
tgattgttca aggcgctagt gcagatgatg atggtgaaca atctcatgct tttgatccat 3960
tgttgttaga ttttttcagt gattttaata aattggtcga aagaaagatt gaccaattga 4020
ttttatcaaa agtaggctta atttccatct gtgatataaa agactcctta gaaattattg 4080
tttccaacgc tgttaaggat gctccaaatg ctcatctatt tgtaggtaaa ccggtcgatg 4140
gtattaatgg taaatattcc agttctattc tttcttcgat tgatcaccaa cgtactttcg 4200
aaagttctta tatcagggtt ctgcaacaat tattttcttc aaatttgaat attttaaacg 4260
aatttcagag tgattctatt gtggctaatt ctccggaata cggctttggt tatctgttga 4320
attcagataa cattcgtgaa aagttggttg aaaatgctag aagcttactt gacttcactt 4380
ccttcaaaga tataccagct gctgatgcca ctaatttggt taagcttttg tcaaaatgga 4440
ttgattgcaa ccgctcttca aagagtacca ccgaagagtc gaacgaaatt tctactgcta 4500
tttttaatat ttttaagagt tatccggaat gtcaatcaat caagacattc ttagaaatat 4560
ctgatgatat cgaagattac ttatttaaat caaactggtt aattggttct gatgcttggt 4620
catatgatgt aggcaactcg ggtgttcatc aagttttaag ttcaaagaag aatataaata 4680
tgttaatcat tgattcagaa acttcctcca ctataaaacg aaacaagtct cactcaaaga 4740
aaaatattgg tttgtatgct atgaatttcc acactgtata tgtcgcttcc gttgctgtct 4800
attcatctta tactcaactg ttaacttcat tattagaagc tgcgaaattc aatggtcctt 4860
ctgttgtcgt tgcctatcta ccttatgaaa ctgaaaagta caccccagtc gatattttaa 4920
aggaaaccaa aattgccgtt aattctggat attggccatt gtacagatac gatccatcta 4980
ttgaagatga taatgaagct ttccaacttg actcttctgt tattagaaag gaactccaag 5040
acttcctaga ccgggaaaac aagttgacat tattaacgaa gaaagaacct ggaattgaga 5100
ccactgttga acaatctgtt tcagatgcta ttgctaaaaa aatggaatta agaaacaagg 5160
ctgccttaca tcaactactt aatggtttgt caggacctcc gttacatatt tattatgcat 5220
ccgatggtgg taatgcttca tctcttgcca atcgtttggc taatagagct actgcaagaa 5280
acttaaacgc tacgtctcta tcaatggaca ctattgttat ggatgaatta tcaggcgaag 5340
aaaatgttgt ttttattacc tccacggctg gtcaaggtga atttccacag gatggtaaaa 5400
cattctggca agaactaaaa atggcaggac aagctgatct atcaagtatt aggttctcgg 5460
tatttggttt aggtgattcg aaatattggc caagaaagga agattctcgt tacttcaata 5520
aaccatcaaa agatttattt tccaaattac aatcactcgg tgccgatcca tttgttccac 5580
tgggcctagg tgatgaccaa gaggataatg gttatgaaac tgcgtattcc atatgggagc 5640
aacaattgtg ggttgaactt ggtgtggata agattgaagt tgccgacgaa ccaagagaac 5700
tgactgccga agatgtcaaa ttacaatcca atttcttacg tggtacatta gcagccgatt 5760
tagtaaatga agaaactggc aacattacta atgaaaatac acaaattgcg aagttccatg 5820
gtttgtacat gcaagatgat agagatatta gagccactcg caaagaacaa ggtttagaac 5880
cattatatgc attcatggct agagttagaa cccctcatgg tactgcatct cctgagcagt 5940
ggttattact tgataaatta tctgacgaaa ctggtactgg tactattaaa ttaactaaca 6000
gggctacttt ccaactgcat ggtgtattaa agaaagatat taagcataca atcagatcga 6060
tgaactcgtt actaatggat actctagctg gttctggtga tgttaataga gatgttatga 6120
tatctgcaat tccggaaaat aagaaagtac atgatcaatt ggtttctatt ggtaaacaga 6180
tatctgagta ttttttgcca aagacgactg cttatcacga aatttggtta catggtgttg 6240
acgaacgtga cgatgaccca acctggccta ccatttacga gaatagaaaa gaaggtccaa 6300
gaaagaagaa gacaatggtt agtggtaatg ctctggtaga cgttgaacca atatattccc 6360
cggtttatct tccaagaaaa ttcaaagtca atattgccgc acctccatat aacgatgtcg 6420
atgtttggtc aagtgatgtt ggtttaatat caattattaa tcaagatact caagaaattg 6480
aaggtttcaa tctattagcg ggtggtggta tgggtacaac tcacaacaac ataaagacat 6540
ggcctgatac tggtaaaatg ctaggttttg ttactccaga caatgttatt aaagccattg 6600
agagtgtatt aatttttcaa agagataatg gtgaccgtac aaaccgtaaa cacgcacgtt 6660
taagatacac tatcgacaca gttgggtttg aaaactttaa aaatattgtt gaagaaagac 6720
tagtttttga tttcaaacca ccaagagatt atactatcga ttccaatgtt gacaaatttg 6780
gttgggtcaa agacgaaagt ggtctgaacc atttcacaac ctttattgaa aatggtagag 6840
ttctagatgc tcctggtatg aatcaaaaga caggtctgag ggaaattgcg aagtatatgc 6900
aattgaagaa atgcggtgag tttagattaa ccggtaacca acatatcatt attagtaaga 6960
tacaagataa ctatctacca gaattaaaag aattgctaaa acagtttcaa ttggataacc 7020
ttcaattgtc aggattgaag ttatcatcct catcgtgtgt cggtttccca acttgtggtt 7080
tagctatggc tgaatcagaa agatttttcc ccattattat tacagaaata gaagaaacat 7140
tggaagattt tggtttacgt cacgattctg ttgtattaag aataactggt tgtggtaatg 7200
ggtgttcacg cccatggtta gctgaagttg ctttgatagg taaagcccca aatgtttaca 7260
atattatgct aggtggtggt tatcacggta acagattgaa caaactgtac agatcaaatg 7320
tcaatgataa agacatctgt ggtattctaa agcctttgtt taagcagtgg gcacttgaaa 7380
gatatgaagc tgaacacttt ggtgactttc taattagaaa agatataatc aaggaaacta 7440
cagaaggaaa atatttccat gataatgttg cccaggaagc ctattaatta ttttttttat 7500
ctatttgtat atttatattt atttatttat ctatttacta aaaagaatat tattattcta 7560
aagtacgtga tcttcggtat atagtttaaa atcagttgaa tgactacatc attttttttt 7620
aagagg 7626
<210> 34
<211> 1418
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c3775_g2_i1
<400> 34
ttatcagtat atgctgttaa taagcaaaga tatatcggtt gtgtgtgaac tacacataat 60
tatacgttga ttatcaatgg aaatagaatg agacttatcc gacatcggtt ttccagtgaa 120
aatatattgc cccttagtgc ctctccgata tcatagatat gttgcctaaa gccagtgatt 180
gttgactcac atacagacac accctattct tgaattgcct atcgcttatc atttacaata 240
tacccctgga cctcatttcc tatccttttt tgccccacta ttagccagga aaaaatgatt 300
tactgtttga acaatcccgt cttcgcaaat catatattca cccacattcc cgaaaactcg 360
gaaaactatt tcttgcggta aggcaatgaa aatgaggaac agaaccactt cggacttcga 420
cggccgttcc ttcggccggt ttggcaccaa atttttcgag aagcaaaaaa gaaacgaagg 480
aaacagaaag agaggggcag agacgaaaaa tggaagaaaa aaaagcaatt ccacgcacat 540
tctacgaggg ggtctctgtg aaggagtgtc cgatgatttc cgattactta cataagtggc 600
attatgagat tcctactgat tttaattcaa taagcaaaaa aaaattttcg tcgagaataa 660
tgaggaggga tagatatgtg ccaaatttgt tgttttgaag ataagtattt gagacatata 720
taaaccgagt taaaattggt atatgattag aatagttcct gctgttcctg ttcctttctg 780
gtaatgaggc gaaccaaaga atcatagctt ttaaaaataa aaacaaacaa aatgtacagc 840
gagggaactg ataagacttc cgaaatccgt gaaattttat tattgaacgg tttctaaaaa 900
taatgactaa ggtggactga caccaaattt tgcataagta ataatattat taaacaagtt 960
ttagaagaca tggtatatct cataactaca taacatttac acatatataa tctaatacat 1020
ataacatata ctggaatatt cgtgtttctt taattatttt tttatccttt attaatttat 1080
tatgtatggt ctctataaaa tatatatcat gatgatgcat taattaatgt tattcatttc 1140
cccgcgttta gttgttcaga taacttgcct tgagcgatta gtgccgcagg agcatccatt 1200
acaggtaaca tgatttcaat cattctaatg gtagagtttt tctggaaatt tttatcgtta 1260
gttaactttc tccattctcc tactgtggat actctataag tttcataatc tgtggccccg 1320
aagacaggca atagttgtaa atgatcccaa ctttggactg cattatatgc agcatcgggt 1380
ccgtggatta atctttcaat agtataccca tcattatt 1418
<210> 35
<211> 2172
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4997_g1_i1
<400> 35
agacagacag tcacatttgg taactgttct ttaaccgttt gagatatgta cacatgtatg 60
tgaacattat taagattcct gtcctcttag atacataata tataaatagg gatacgtata 120
ctctgtaaag tgggatctta caagagagtg agagagagag caacagaata aaaaaaaaag 180
gtacacacac agtatcagtt gagcataacg atgggggcta tcacacagaa accaagagac 240
atatatgaga agaaagatgg tgatgatcat ttaaatacga tgaaccatga gcgttatgag 300
tgtccgcatc cggattgtaa taaaagtttc tcaaggcagg aacatttagg tagacataaa 360
ttaaatcatt ggccaaaaga aattttcact tgtaattatt tatttcctga gactaaatta 420
ttatgtggta aaacatttgt taggaaagat ttattagtaa ggcatgaaaa gagacatact 480
aaggagaaaa atagattaca tgcaaaggag attacaccaa ctgctacaaa tgtaactacg 540
gaaaagaaag ttgtcaagaa aagacaatcc aagaagaaac agcaacagca agggaaagat 600
attacaaatg ttattccaga gtcaaatgcg gatgattcag gtatgggaac atcaaaaata 660
aagaataaga aaaaaaatat aactaaaaga ttaagcacta aaattgatag agtcgcttcc 720
gttccaataa gtttgaaaaa tgatgataat actacaacaa caagtaatgg aatgaattat 780
ccaacaagta gtaatttgaa tcaaaataaa ttaaattcaa agagtttatt aaatatgact 840
agtttgaatg ataataactt aggcacagct acatttttcg gtaatggacc aggacaatct 900
tcaaatatct tcgattggtt atgggctcct gaacctaaaa gcaaattaaa cgatacaaca 960
agtatgaata ttcagaacaa ttctggtttg aatcaaccat ttaatgttat gtcgcagcaa 1020
cagatgcctc aacagcaaca acaacaacaa cagatattgc ttccgaatgg tactttacca 1080
ttacatttac aacagtattc tccattagat actacgaata ataatcataa taataataac 1140
aatttaccat taagtccaga aatgaataaa acatataatg ttggtaacaa ttccaatatt 1200
gttgtattcc cagtacaaga ggtacaaatg gattattcag atgaaagaag attcccaata 1260
acacaatcaa tgaataataa tagtagtatt gtacatactg ttacaatgag tccaactcaa 1320
caatattatg atacaagtca tacagggact ttaccaacac atcattcaac tgtcaataac 1380
aacaacggta taactactat tggaactaaa aaatttggtc gtagacgtaa attaacaaca 1440
atcgataaaa attcaccacc aaaggattta tccactgtga ttaacgccga aaagaaatta 1500
ttattaccat tgacacaaac aacattagca aattctaaaa taaatactgt aattaataat 1560
cctttaccaa ataaagatcg taggatatca atagatgcag aaattgttaa tggtaatgaa 1620
aatattaaaa atgacgacac taggggtaat aataataata gtacaaacga taataacgag 1680
gaaacaaaga atgttactga tcgtctaagt gtcaattata ttcttctatg aatgttgtgt 1740
attggaagat ctgcattatt atccatctgt aaaaaatttg gatttaagct tccctttttt 1800
tttcacatct gccataccat catcatcatc atcatcatcc cctttgtaac atatttattt 1860
caatatatag atatatcaaa tttttttcat aaaacgaaaa gcaaaatgat aactacatcg 1920
ctatttacat tgattcaact caacaatttt tggattaaac tttcacgaca tcattacatc 1980
attgttttgc atatgtcacc ttctcacttg cttgttgaca ccattaaacg gtaactcaac 2040
attgatgaaa tgaaatttgg accaacagaa ctagcctcgc ctcattattc aaaggaacag 2100
ggaaaggcat ggaaggagcg gaggggaaca gcaagagaga actcctcttc ctcttcctct 2160
tcctcttcct ct 2172
<210> 36
<211> 4720
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c7526_g1_i1
<400> 36
aatttttgac ttgtaattaa acatggtcag atctctggat gttatgtcat cttttttcac 60
cttgagtata tatattatat aaagaaaaca taaaacgtat caaggacttt agtagtttgt 120
ctcttttcac tggtttctat atttgaaaca ctttttattt cacttaatat tgccctcatt 180
ttattttttt attgtgtgct ttgagttaaa gtaaataaga aaccatttca acggaatctc 240
cgaactttcg aaaaacaaag acagaagaag taacaatgac tgcaacttct gatgcatcat 300
tgaaacaaac tctctatggg tttgctgcaa gagatccatt aaataaatta tactacacta 360
cattaaacaa aggtcaaaat actaaggatt caattccaga tgttgctgtt caattattaa 420
atgataacga tccatttgct acaattttgg aaaatgtctc tgaaacattg actacagtat 480
tcactgacga acaaactttg ttgaaaagtt tacctcatct atttcaattg gaacagaaac 540
caatcatcat taatatcgat ttatccttac aagattattc tattatttct gctatcaagg 600
atttgaatat tgttacttta gtatcaaatg acggttcttc tgctattgct catgctcaat 660
tagctagtaa cattgcttta caaagacaaa tcccagtatt ccatttcatt aactattcaa 720
taatcgataa atcaggtgaa atcattccag atttccaaga aatagaacaa tcccaagaaa 780
taaccaaaga cgacaacgaa gaagaggaag aagaagaaat ttccttggaa caatatttaa 840
ctgaacaaaa aatacaatct tttgatctat tagcacaagg ttctaatcca tctgtcgcaa 900
ttgtcaatct ttctcaatac tcaaaggaca ttgcctcagt attaccaaat accgcctctt 960
taattgatgt taagatttac aggccatgga acattcaaga actgttacaa ttgattgcac 1020
cttccgtctc taagattgtt gttattcaag gttcctacaa ggataattat actacagttt 1080
ctcaatcatt cgatcctttc ttattagatt tcttctccga ttttcaaaaa ctggtcgaaa 1140
gaaacatcga tcatgtaatc ttaactaaag taggtgaatt accaattgat gctattcatg 1200
actcattaga tatcatcatt aataacgctc ataaagaaaa tccagatcaa aacttatatc 1260
taggtaagcc ataccacgaa caaattcaaa ataaagaata catcgatttg attcattcct 1320
ctgttaagaa tgttctaaaa ttagaagaag cttatttgaa ggttctaaga caattattct 1380
ctagtaattt acaaatttta aatgaatatt caaatgatac tgttaatggt aacactcctg 1440
aatatggttt tggttactat ttaaaacaag atcaaactcg tgaacgatta attaatttaa 1500
tcaaatcttc attagatgtc tctcttttcg ctggtgtttc taatggatca gcagtagttg 1560
aaaacttatc caaatggtta aaattcaatg aatctcttga tgatcaacaa gttgaagaag 1620
ctaacgttat tgctcatgat atctttgaaa ctttactagc taataaatct aacgacacaa 1680
ttgctaaatt cttatccgtt gcttctactg aggacgcttt cactttcaaa tcacattggt 1740
tagtcggttc ggatgcttgg tcttatgatt taggcaactc tggtgtacat aacgttttat 1800
catcaaagaa aaacattaac atgttattaa ttgattctga accatacact gctaagaaca 1860
aaattgctca taagaaaaat gttggtctat atgctatgaa ctatcataac gtctatgttg 1920
cttctgttgc tgtctattcc tcttacactc agttattaac tgcaatgctt gaagctaaca 1980
aattcaatgg tccttcttta attctagcct atttaccata ttcagaagaa tcaaatacac 2040
cattagatgt tctaaaggaa actaaagttg ctgttgaatc tggttattgg ccattataca 2100
gatatgatcc aagtaaagag gatgaagatg atgaaactca tggtttcaca ctagattctt 2160
ccgtcattaa gaaagaattg caagacttct tagaccgtga aaacaaattg actctattga 2220
ttaagaaata cccaatcgtt gctgacaata ttaagaattc tgcaagtgat accattacaa 2280
gaaaacatga tgctagaaat aaagctgctt tagatgaatt gcttgatggt ttatctggtc 2340
cgccattaca catctattat tcttcagatg gtagtaattc tatcaattta gctactcgtc 2400
tatgcaaacg tgccgtcgct agaggtttaa aagctaccgt attatcaatg gaacaagtta 2460
tcgtcgacga attaccaggt gaagaaaatg ttatcttctt tacatctacc gctggtcaag 2520
gtgaattccc acaagacggt aaatcattct gggatgaatt aaaggcttct accatagatt 2580
tggctggttt gaatgtatct gtttttggtt taggtgactc caaatattgg ccacgtaagg 2640
aagatgctcg ttactacaac aaaccttcta aggatttagc tgctaaatta gaggttcttg 2700
gtgctaactt tattgtccct ctaggtttag gtgatgatca agatgctgat ggtttccaag 2760
aaggttatca agcttgggaa ccaaaattat gggaggctct aggtgttgac aacgtcgatg 2820
ttccagatga accaagacca tggaacaatg aagatatgaa actcaactca gatttcttaa 2880
gaggtaccat tgttgaaggt ttaaacgacg agtccacttt agcaattcat ccatacgatc 2940
aacaattgac taaattccat ggttgttata tgcaagatga tcgtgatatc agagatatcc 3000
gtaaggctca aggtttagaa cctttattta gtttcatgtc aagagttaga ttaccaggtg 3060
gtaaagccac tccagaacaa tggttggctt tagataaaat tgcaagtgaa gtcggtaatg 3120
gtactatgaa gatttctaca agagcaactt tccaattaca tggtattcta aagaaggatc 3180
tgaaacatgc tatcagaggt atgaattcta ctttaatgga cactttagct gcctgtggtg 3240
atgttaacag aaacgttgtg gttactgctc ttccaaccaa tgctaaggtt ttcaaccaag 3300
tatctcagat gggtactgat atttctgaat atttcttacc aaagacaact gcttatcatg 3360
aaatttggtt acaaggtacc gacgaacgtg atgatgatct aaactggcca caaattttcg 3420
agaatagaaa ggaaggtcca accaagaaga agactttagt aagtggtaat gcattagtcg 3480
acgtcgaacc aatttacagt aatgtttatt taccaagaaa gtttaaggtt aatattgcag 3540
ttccaccata caacgacgtt gatgttttct ctattgattt aggtttaatt gctattgtta 3600
atccagatac acagattatt gaaggttaca acttatatgc tggtggtggt atgggttcta 3660
ctcacaacaa tactaagaca tatccaagaa ctggttctga ttttggtttt gttaaaccag 3720
aagatgttat tcctgctatt caagctgtta tgattatgca aagagataat ggtgatcgtc 3780
aagatcgtaa acatgcccgt ttaaagtata ctattgatga tattggcgtt cctcaattca 3840
aggctatggt tgaagaagaa tggggtaaga agtttgaacc atctagacca tacgaacaat 3900
ttatttctaa ccacgattac ttcgggtggg ctaaggatga gactggtcta aaccattata 3960
cttgtttcat tgaaaatggt agagttgttg atactcctga attacctcaa aagactggtt 4020
tagttaagat tgctaaatta ctacaaaaga ataaatctgg tcattttaga ttaactgcta 4080
ctcaacatgt tttgatttct gatattgaag ataaggactt ggatgaagtc aagaagatct 4140
taaagcaata caaattagat attacagaat tgagtggtat tagaattgct tcttcatcat 4200
gtgtcggttt accaacttgt ggtttagcta tggcagaatc tgaacgttac ttacctgtct 4260
taattgatga aatcgaagag gtcctagaag aatttggtct acgtcatgat tctattgtta 4320
tgagaatgac aggttgtccg aatggttgtt ctcgtccatg gttagcagaa attgctttaa 4380
tcggtaaagc tccacatact tacaacttaa tgctaggtgg tggttactat ggtcaaagat 4440
tgaacaaatt gtacagagca tcagtcaagg atgatgatgt tattggtatc ttaaagccaa 4500
tatttaagag atgggcttta gaaagagaag aaggtgagca tttcggtgat tttgttgttc 4560
gtgttggtat tatcaagcca actttagaag gtaaatattt ccacgatgat atcgctgaag 4620
acgcttatta gaggaacggc gtctattatt ttcatgtata tgtatattta taaatattta 4680
tttcacaact tatttatttt aactattaat tctttaaaat 4720
<210> 37
<211> 953
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c10116_g1_i1
<400> 37
caaatatttt gaaaagtgtt agttaatact tattcaaact aactcatata catccctaac 60
aatcaaagac tcacttacat aatgaagatc ctaacatccg aagaaattaa tgctcatagt 120
gcctatactt taaaaggtgg tgcattaggt gccgttatag gtttagctgg ttcagctgca 180
ttatttaaat tcttaccaaa aagattccca ggttttaaac caagtcaaat ggcatggtct 240
gctaagactg cattatttat tactcctcca actttattta cagctatttg tgcagaagaa 300
gcatctaata gatttgatgc tttgaaatat tccggttcat atatgtcaga tgaagctcta 360
gagagacaag cagcttggga taaattatca aagaaggagc aaatggttga aactttaaat 420
aataataaat ataaaattat tacaggttta tgggctgctt cattatatgc atcatgggaa 480
attattaata gagataaaat tatgaatgct actcaaaaag ctgttcaagc aagaatgtat 540
gcacaattta ttactgtaat attattatta tgttcagttg gtttaagtac ttatgaaaag 600
aaattaaatc cagataaagc taaacattta gagagtcaac gttgggctaa tgctttaaaa 660
gctgctgcag aacaagaaaa gatggcagat gcacaaacta ctttctctaa tgaagaaaga 720
agagatgcaa agattttcaa atatgattaa tctgttttgt ttgtttgttt gcataatatt 780
attattacca ttttactatc acgcaatgcc atattttata cattttatac acaatccaac 840
tttcctccca aatttatttg aacatctata tttatgaact tctatttttt ttaattcctg 900
tttcacttac tcaatcttat ttaatgactt tattctcgta taaaaaaata caa 953
<210> 38
<211> 5517
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2453_g4_i1
<400> 38
cacagctttg gataatgtta ccagtgatga tctgtataaa aatacgtaat ttgaaaagat 60
gagcttacat gttgtgagaa gagctgctct gcccttaaac taagaaacaa caattgtcga 120
caacagtatt cctttgatcg aaacaagaaa atgtcagttg aaactttgat atactggtca 180
attgaattaa taatataaca caaagaagag tattcctatc gtaaaccatt agttagatcc 240
caaaggaaga ttgattatca tgcccaactt taagatgtac atttagtttc aattaagtta 300
tattaagagt tgtaatttca agatattcac cctatgccga attaacatat tcggagtagt 360
cgttaagatc aatactactc tgtaggtgac taaataaatc tgttctacgt aattaactct 420
cccagaaatt ccggagtaat cacaaagaag gaagatatgg ttatcggaat ttcaattgac 480
ctgtcagatg gttgaaatat tagattagta aaccaaaaca atctcacaca tacatcaatt 540
aaatctcaat tgagtattaa taagttaatc aatagataaa ttcaaaaata tgtcccttat 600
aatcaaactt ttcgagatca agtttttcat caacctttga atacgactcg gcgagcggct 660
aatttctatt cgcggatcgg taaggaacgg aattacttca agttcacata aaagaaataa 720
gaaagtgtct tgtttacaga acgatcagtt ataatacaac taaatggttg caaaaaatac 780
tagattaaaa ccaaaaaaaa aagaagaaat aactggggca agccgggaat cgaacccggg 840
acctcccgca ccccgagcgg aaatcatacc tctagaccac gtgcccttga aatttcaaat 900
cttgaaattt tgttgggtca ttatatacgg gagctgaaat tcgggtaata ccgaaatatt 960
ttaaatggta ttatccgatt tattcattag tatcttaatc tgataaatgg tctagtattt 1020
tctcgagcta ttattgtatc ttagtattca aaagatgatt agatatatga ggaaattatg 1080
gaacaaaaag gaaaatttca aacgtttcac taatatgtaa aataatttgt gacaccgttt 1140
cttctaataa ttttaaataa agaaacatct gataatgttt gatgcaaaag tagtggacat 1200
cgtcctttga caacactgtg aaactgatat tcttaaaaat tgtcaaatag acccatatca 1260
aatttatcat gattaaatcg ctggatcaat acacagggtc atataattag tctgatttga 1320
atatcaatct gcttcatgtc ttatcttcgc aaaccaaaat ttagttatta atggattaaa 1380
actgtatatt tataccatct aacaacagaa tgtttggttt aaactatagg gtatcattaa 1440
actcaattat taataataaa aaaaagagaa tttagtatct taaatgtatt aataataatc 1500
ttagatcggt aaacattgtc aataaagagt gaattgcatt aataaattag gtcaacaact 1560
tagctattat taataggtcg ataggtgaag aaggacaata atagtttaat attttagtat 1620
attttgaatt gaaaatggga aactggatgg gaatgatatt gaatgtgggt acatcctaca 1680
gtaatggtaa tattatcaga attaccacaa gttgaacaaa tcacgtagac tatttctagc 1740
gctatatacc tcacaatctg taatataagt ctacaagtgt tcgtctatgt catgattcga 1800
taccacggaa aagaatagat caatagaaag tggtaacaat tatctgggac tgtcttggag 1860
atatcagacg tcccattcat cttgaatttg aaacatatat ctctccgtaa caattcgaaa 1920
ttgtcactaa tggaagcaga aactcagata ccaaacacat cttttgagta acacaattat 1980
ggttctttga cggaagattt attattccaa aattaagagg ggattttgat ttaaaaatgt 2040
caacattatt taataagaac atatccaatt caaatcctta acaatagcta atagaaactg 2100
ctctaagacg tgtattatat aaacccagag aaagttaaaa tatgtccttt tacaacttag 2160
aagttctcga aacgctaata attacctgtt cttctgaagc tgctaaacga atctagaatt 2220
gatttgttaa ccaaaaaagg aacacaaaca cttgtgaaac ttgtataata tgtgtaatta 2280
tccggaattt tacagtttga attagctctt tatgttaaga acaacattaa ccctttttat 2340
tctgtattcc ccactaattc cccagacaat aaggttagtt ctccattggc ctgcatctcc 2400
cacagaatat aacacaggag attaaataat ccgcgtaata tatcaagata tccatatgga 2460
aaaatatgac aagacagtct cttcagcgtg ctttcttggt tgataaatgt caacaggctt 2520
ttgttcaaat aaggaaaccg gagtaaacaa cacgaattag gtattgaact aaaacaggaa 2580
cttcaaaata aaaccgaacc ccttcatcta cccacgttaa tgaacagagt tgaacgatca 2640
tatattcatt tggaataaat ttctccacat ctattaagtt acggattaaa taaatggaac 2700
atctccaatt aataacattt ccttaattaa tgccagggag tcggaattat gtttacttat 2760
ttcaggagtt aacagaatta tttttcaatc gtggaaataa gaaagctcgg aacatttatt 2820
tctcaattgc atattagcta agaaaaagta atggcgaaaa gaaaaaaacg gagaaatttt 2880
tttattgaag tcttggatag gcggaatgat ttgtaaagtt gtcatagaaa actaattaag 2940
ttttgagaag gtatctagca ttgctcaact tatagggtac gctttggaag aattaaaaca 3000
aattaacttt acgtagtagc tccgacattg ttctggcatt gactttatca aaatcgcata 3060
ttggaacaat caattagggt cttctgtgcg ctttagacat taccaacaaa ttccaacgga 3120
accctacaaa gtgtggggga agaagagtaa ttgttgcctt ttcctctctc aagcaagtgt 3180
gtaatcaaac acacctttta gctaattcac tctccgcgaa gtttaaatat gaagtttttc 3240
cacctatttt caaattcaat tcccaacatc caaaatttat tcctttacaa gaatcacaca 3300
ggttattaca ttaggaattg cttttttttt ttttttttct tctttccccc ccaagattgg 3360
tgatttgcct tccagattgg aaattattta cgcaaggaat agctgcagaa gcaaggagta 3420
aaatgtcgca gtcaatagtt ttcccgcccc gcgttttttc tccggcgatt ttatctccgc 3480
ttgtactatg ttatctttgg agaatgctat ttccaagagt ttcggaaaaa catttatgaa 3540
aagaaataat atcaaaaact gtatgcgaag ataacgttgt aggatatatt tctacgatgg 3600
aacactgtgc cccgcgaatt ttaaagatgc aaaacaataa ctacagatct atctgagaat 3660
atatcatggc acgaccccca ccccataaca ttgcttctgc attagtttct cttttccccg 3720
tctcggaaaa atatattcgt ttttccgaac aaacaaaaca tttcatgtac gtatattcct 3780
tcatgttcaa tagtagctta tgtaacaatt tgttcggtat cctattgata ctttagcgaa 3840
atatattcaa ttgtgtgtat ggtatgacag atgtcaagat atatatatat atatataaaa 3900
ggaagaaact ttcccattct agactaaaga cattcattta atggtttggt tctttcgttt 3960
tccacctttc cccttcgttt gtcaatctgt cattgaaatt ttaaacaatc tcaattagta 4020
acactagtat acaatcgttc acaataatta ttgtactgta caattattat tattattttt 4080
ttaagaaagg tcaccaagat taaatataat ttcagttttg aaaggtactc aattgtaaga 4140
aaaagtaata taatataata accaaaatga gtgttaatcc ccaaactaaa tttccagctg 4200
ataacaatga tagaccattt agatgtgagc tttgtcatcg cggctttcac agactcgaac 4260
ataaaaaaag acacgttaga acacataccg gagaaaaacc gcacggatgt cagttccccg 4320
gatgtaacaa atttttcagt agaactgatg aattaaagag acattcgaga acacatattg 4380
gtacatctca aagaaagact aggaagataa ttccaaagaa taatagtgag actcaaattt 4440
catcaaaacc aattactatc gctgcttcaa agaaagtgat taaaaaggag ataagtacac 4500
ctccaaagac atatactgtt ccatcattaa catcattatt acatcatgag actacaccaa 4560
tgtcttcacg taaattatta gctggttcca catcacttga aagaccgatt tcaagaacaa 4620
tgtttcctcc aagtatacag aaagtttccc ctatgagtga aaatagttct gctgaatcat 4680
ccataccgaa ttctccaatt tctcaaaata attctatatc gacatctagt agttcattgt 4740
ccttaaattc attacttaac agtaatgtta acaataacaa taatcaaatg tcatctgtat 4800
catctgtatc atcatattca gatggtagtt tcaattattt agatacatca ttaaaattat 4860
caagtaaaag acgtgcagat ttccaaattg tttcagaaga aaatgatgca gatagcactg 4920
gtagtagtaa tataagattt aatagtaatc aaccgaattc tattcaatta ccaccaatta 4980
aatctatatt agctaatatc aataatttca ataatggaat gatatcatcc caacaatata 5040
ctcgtgcatc tacataccaa caacaagcat aatttcatcg ataggtaacg atgaaatcaa 5100
caatcataac cacatactag tatatacttc tatacataca cacatacata cattcataca 5160
ttcattcata actaatcaac gaacattaaa cattcatcat ttgacaaaaa tcacttgtcc 5220
tcttattcaa atagttccca gatcctgctt acataataat atctgtcggc aatctttatt 5280
atgaaattca cttaccgaca tggaaaccgc ccaccaacaa acaatactgg aaaagtccgt 5340
ccttattata aaataaaaaa attgtttgcc gtttcattta ttaatttatg cgcatctttt 5400
ttttcttttt tttttgtttc gtacgtaaag aatttcggga agatcctacc attccgtccg 5460
acgtgatgat ccgggtaacg tgtgttttgt tctattcgcc cgcgcgcccc ccacttc 5517
<210> 39
<211> 2495
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c8818_g1_i1
<400> 39
gaaaaattta ttaaatatta ataagtcttc tgtgtttgtt tttttttggc ttttctaata 60
cttatagttt tatttgcctc atgattgatt gattccgtac ttttctctat ttctcctgag 120
gtatagtatt atattaatac tgatatatat atataaggtg ataaattttc ttgattctag 180
aaatattcat tgtgtcaaaa tcagatcact aacagcatcg atttccaatg atgagtattt 240
ctgatatacc gccagcttta tcaccgattt caagtactac ctcgattcaa ggtttacaaa 300
cttcaatgaa taataataat aatagtaata atcatattaa taattcagga acaaattctg 360
tatcaacttc accacatgct tattcattgg atagatatca tgaaccatct tcgaatgaaa 420
aatttaaacg taatgggaat tcttcttcat caaataataa taataataat aacggtttag 480
gtaatacaag taaaaatatg tcattaccac cgatttcatc atttgataat ttgattcgtg 540
ccgctgagaa acaatatgct tctacaagta ataataatac tgctgcaatg gatcagtcac 600
aggtatcatt atctgctact gcaagtatga cttcattacc attaagtaac aatactaaca 660
atgttctatt acatccgcta caacaaggtg taatgactcc agtgggttca agaactaata 720
tgttaagtta tcaactatct actgaacaaa gacatagagc tcctatcact agaagtatac 780
tacaacatcc accaagtgca actactacag atgctcgttc cgaatcaaat cgttctttac 840
ttgcatcccc atcagattca atgtcaagaa caagtgtaag tagtagcagt agcagtacaa 900
gcactagtac tacagccagt aaatcaatag taggtgatta taaactaggt ggtccacaat 960
ctcaggaacc accactgagt ctgacaactt caactacaac aaaggtaaca aaaccaagaa 1020
agaagaaaca atgtccaatt tgtcacaatt attacgctaa tttatcaact cataaatcaa 1080
ctcatttgac ccccgaggat agacctcata gatgtcccgt atgtgaacga ggcttcgctc 1140
gtaataatga tctaattaga catagaaaga gacattggaa ggacgaattg atgtcgccgg 1200
cgacatctac aaataatgga tcctctaagg ataaatcaaa tatcaattct caagcgttat 1260
cgaaacaatc acaattgaga tcattacatc aaattaaagg aacttttaaa tgtccattca 1320
attcaaattt gattaaattg gatatggagg tatatccaca taagaataag atattgccgt 1380
tcgagacatc gaattgtcat caaactggtg tattttcaag atgcgataca tataagaatc 1440
atttgaaagc gttacatttt gaatatccac caggtacaaa aaaaaaggat agaggaatcg 1500
ttcctgggaa atgtaaacat tgtggtgcaa aatttgaaaa tgttgatacg tggttaaata 1560
atcatgttgg taaaaattgt ggttacatat atcattgatt tgtacatata tatatttttt 1620
ttttgtaaag ttatataaga tgcattattt tttttttaat tttttaattt ttaattaatt 1680
ttgaatgagt ggctgagtct cgaaccatta caatattcat atcattgaga taattcgatg 1740
ggaacgcggt ataatttgta taaggtacaa taggtttcgg gtagaaacta tgttcaatac 1800
cttgagaaaa ttcaagattc agactagtac ttatatacca acaaccaaat gctttctgta 1860
tatttgagat catattttta aggatattaa aggatttgaa atcatctgaa gtatttagat 1920
acgatagatt atcaattatt ataccgtcca atgaacaatc ggcagcattt gttccatacc 1980
gtatcggaca ttctttattt aatgtcacta ttggattagc gttaactcta ccaaggaaac 2040
aaattaaccc ttcaattgtc gaaatttctt tactgttaac gtaccaatat ttgatatttg 2100
caaatcgggt ctcctccaat gtttgtctcc atatagatat gatatcaata ataagaattg 2160
caccggtgtt ccctgtttca taatttcgta gtaataattg tctcttatat tcattcattt 2220
tgaaatccgg ttcaattaaa tgcattgaat acttcggagt agtagttgca tcggacacat 2280
cttgaagatt taatgaatgt ggtactgtat ggtagtattg ggataacgga tattctttac 2340
aatgtttcaa tatttccatc tttattttga gttgttatac tgttcaatat tatcactttc 2400
actgatgttt gaactcacgg ctcttatatt ggaaaacaaa tcttgcgcat cgatcgcgac 2460
atactgaaaa acaaaagatt ttaaataatc atgag 2495
<210> 40
<211> 1779
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4613_g5_i1
<400> 40
tgtgaaaatg attatcgggg ggcagaaagt aacgaagaga taaacaggaa tgtctgattt 60
caattgggaa aatattggaa aagtgtattg agacaaaaga gagagagaga aggaatatat 120
atataaggga atcaatatct tttttgttgt tgctaatgtg tgtattggaa tttatttttt 180
ctatatttat tgaattttga tattttattt atgtatatat gtcgtttgag gaaatccatt 240
cctaaacaag gaccgtttac tgaaagaacc taaggttggt ttttttttgc ttttaattac 300
atctagttgt taacttttct tactgataaa gttaatcttt ccgatggttc ataccaacca 360
ccgttagcag aaggacattt agcaacagtt gcataattat aatcactggt gaaagtatat 420
gaccaacctg gtgtaatagt tgtcttttgt tgtgtagatt ttctaaaagt tgaataatat 480
gttactgcac tagttggata tttcatagac caagttgtcc ttgtgacagt tgtatttggt 540
tgcatttgca ttggagcaaa tctacttacg cctgtttgta aagtatatgg tacagtgaat 600
gaagcactat caattgaatc tgaaccacct tgatcaggta aattatattg tgcagttgga 660
gcttctgtac ctgaataagt attagtacca acggtacctg ccatattttt caattcaaat 720
ctaggactat aatgaattgc ataattatta ctattaacta aaccattaat ttgaaggaaa 780
aattgacctg agccaacaac agaacttgaa aaggtaacat tataattata taatgttgta 840
tcttcaccat cattactagt actagtttta aattctgcag gacttaattt attttcaata 900
gtatatggac aattcattgc agtatttgga cctgtacata atttaatttt gaataatgta 960
acatctgtga ttgttggata atctgtatta tcagaaaatt gaatcggaat ggtgacttca 1020
ccagaacttg gtttataact ggtaccttct tccggtccaa caatagagat atcacccatt 1080
gtaaagagac ataattgtgc taacacaaca ataactgatt ggaataacat tttttttgct 1140
ttctattgta aaatacgtga ttttgatatt aagctaacga aagaaataaa gattcgatac 1200
tcgttctgaa ctgttttatc ttatatacta gacctgttaa ctgatacgac gtaccttgta 1260
gagtttactt gtattaaacg tcaacaaatt tcttctcact tatattcagt ttgctgagcc 1320
tacatttaca tctatgttta ctccctcagc atataaaatt tctcactgca ggaaagcgag 1380
aaacagtttt ttcctcttga cagatcgcgt aaaaatatct gatttgtgac ttggcacagt 1440
gtggtgtacg gttttctccc accacgagaa gttctcaacg atcagctcag ctggaactga 1500
cactcttgat taaagcaacc catatgttat ctacatcgtg ctactattat atatacggtc 1560
atacatagct atttatagtt ggactttgtc gtaacatcga tattgatgtt ccgtatgtat 1620
gtacatagta aaatttgcaa cttttgctct atatttttat catcttgtac gtagtagcaa 1680
ttaccgttca agagcgcgtt cggaccaaca aaaatttggg acaccgggct ggaacccaaa 1740
gagaaaaaac aacgagatta ttgattggcc ttctgagca 1779
<210> 41
<211> 3714
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4552_g7_i1
<400> 41
tatatatata tatatatata agagcttgca atattgtatt gtgtgaaaat attgaattta 60
tatttatatt tctttacctt tactcaactc tgtcaaatac atacacacac acacatatac 120
atacatatat atataaatat acttacagaa atatggtgga acaatatagt gttattgttg 180
gtaaagcaga aaatgaacat gaaactgcac caagaagaaa tagtagattt aagaaggctc 240
ccttagtgag accaattggt atgaaatgta atacagttta tgaattctta gttgaaattt 300
ttaataaaaa taaatcaggt caagcaatgg gttggagaga tacaattgat attcatgaag 360
agattaaaat tattaataaa gttattgatg gtaaaaatat accaactgag aaaacttggt 420
tatattatga aatgtcacct tataattata atagttataa tgaattaatg gatattatgc 480
atgatttagg tagaggttta attaaaatgg gtttaaaacc tgaatctgaa gataaattac 540
atattttcgc ttcaacttca cataaatgga tgaaaatgtt tttaggtgca caatctcaag 600
ctattccaat tgttaccgct tatgatactt taggtgaatc tggtttaact cattcaatgg 660
ttcaaactgg tacaaatgca gtatttacag ataataatct tttatcaaaa ttaatcaatc 720
ccttgaagaa agccacagaa attaaataca ttattcattc tgaaaagatt aatccaaagg 780
ataaaagaca aaatggtaaa atgtttaaag ttgctaatga cgcaattgaa aagattaaag 840
aaattagacc agatattaaa attttcactt ttgatgaaat tattaaaatg ggtcaagatt 900
caaaacatga aatcgatatt catcctccaa ctcctgaaga tttatgttgt atcatgtata 960
cttcaggttc aacaggtgat ccaaagggtg tcgtattaaa acattcaact gtaacagcag 1020
gtatcggtgg tgtcggtagt acagtttatg gtttcatggg tccagaagat agtatcattg 1080
cattcttacc tttagcacat attttcgaat tagtcttcga attagaatgt ttctattggg 1140
gtgctaccat tgggtatggt accgttaaga cactttcagc tcaatcaatg cgtaattgtc 1200
aaggtgattt acaagaattt aaaccaactt taatggtcgg tgtcgcagct gtatgggaaa 1260
ctatcagaaa gggtattctt gcacaattaa gtcaacaacc agctattgta caaaaaattt 1320
tctggacagc ctataataca aagactactt tgaaaaaatt ccatttacca ggtggtgacg 1380
ccattggtag attaatcttt aaaaaagtta aagaagctac aggtggtcgt ttgaaattta 1440
tgtgtaatgg tggttctcca attagtttag acgctcaagt cttcttatct aatattttat 1500
gtccaatgtt aattggttac ggtttaactg aaactgtcgc taatactact gttactcaac 1560
ctgatagatt tgaatttggt gtagcaggtg atttagctgg taccattacc gctaaattag 1620
tcgacgttga agaattaggt tatttcgcca aaaataatca aggtgaatta tggttaaagg 1680
gtgcttgtgt cttaccagaa tattacaaga atcctgaaga aacagaaaag gctttaacta 1740
aagatggttg gttcaagact ggtgatatcg cagaatggac cgctaatggt catttaaaga 1800
ttattgatag aaagaagaat ctagttaaga caatgaatgg tgaatacatt gctttggaaa 1860
aattagaatc tatctataga tcaaacaaat atgttatgaa catttgttgt tacgctgatc 1920
aaactaaagt taaagctgtt ggtattgtcg tgcctgtatt cccacaatta gctaaattag 1980
ctgtatctct aggtataatg aagcaaggtg aagatgtgga acaatatgtt gacaatccaa 2040
agttggccaa tgctgtatta gccgatatgt taaaaactgg tagagatcaa ggtttagcag 2100
gtattgaact attacaaggt gttgttctat ttgatgatga gtggacccca gagaatggtt 2160
atgttacttc cgcacaaaaa ttaaagagaa aggatatctt acacgctgtt caaaagagag 2220
ttgataaggt ttatctaaca aaataaattt aatccgtcac ttacatcttc gcataattaa 2280
tcactaccca tatattccta aacacatttc ttttattaac ctccccattt aactcattca 2340
tatcatgtta acgatttctc gttaggaata acatttttaa gatacatttg tttatccatt 2400
tgtaaaaaca aaagaaacat tttaaaaagt ttcaattcat attagcattc acttatattt 2460
ataataagtt tttcaagttt tttttttact ttacgtactt atatagaaac aagaataaaa 2520
aaaagtaata tcaataataa aaaaagattt atcccaattt tgaaactaca aactattatt 2580
atttcaatag atgagcgatt aaatgggaca atgactctat gacttcttgg cccgtatacc 2640
attcatgttc caaaataatt catactttcc actcgaacca atacatatca ataccaacac 2700
aaaagagagg tcctccaaag cacataaaac attaaatcat ttcaacatgt aactatccaa 2760
aaaagtaatc aatttcacca aatgatagct ttcaaataca tatccatcac atatcaacat 2820
tcaaacaagt atcaagtcat agcatattca ataaataaaa acaaccaaaa ttcaaagaat 2880
gataccttaa atttgaatca ttaatattgc aaagttatgg aaatgaaagc tttgaaacaa 2940
tataacgacg cacagtaacc ttctataagc ttacaatcat aaggatgaaa aactcataga 3000
tatgaatcgt aagtgaatct tgtctgaata cgaaacctat gactctttct tgtttgaaac 3060
acgtatgata cggacagtat gcattgagaa aaatccaggt tgattgacta tttctgaata 3120
atcacaattg aatgttgaaa ctataatgca aacgactagg agaataaaaa tctcagttgt 3180
ggatacagta tagcagtatt tgattacgag acattctttc gtgagatcaa atcttatatg 3240
gtttggaata caaaccatat acagcagtcc atcaataaat aattagttac cccttttaaa 3300
agaaagaaca agataatcca agagttttaa aatgaagtta ttgacatgga ttggtctttt 3360
atagttggcg tctcattgat gacttcaaaa ttttggaagc ttcccgcttc ccgctgaaaa 3420
gttgaaaaat ttttcgagat gtaagggtaa cgttcaaaat gattagaacg ttaggttata 3480
acgaatatcg aacaacaaag taacaagtga tgtcagaaca tttggagggt gaaccatata 3540
taccgattgt tttgctattt attagacatt acaaattgaa cgtgtctcta aataagtact 3600
atatacagaa ttccataatg cacattgttc agtatccttc tctacatttt ttgaatgttt 3660
tggattgact ttacattcga acgctgtaat agaagcgtaa ttccaaggtt tggc 3714
<210> 42
<211> 6997
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c5170_g1_i1
<400> 42
tcgggtgaat atatgttcca acacattatg acactgcgga acaattgatg cccaaattat 60
gtggggataa gcctatatag tggtcttgga ataggacatt caagaatatg acatattaat 120
taaatcatcc aattcggaat cgtatgaaac agagtcttga gaaatcgaac ttctgcataa 180
atattttctt ccgagggaga ggaagaacaa acatctgcat aacaatatta cattttaata 240
aaaccgacca aaacaacact gagactaaat aatgatcaac tagctagaca aagaagttct 300
gttagttacg aactgtttga aacaattctt gctgcgtggt ctctataata cagttcgacg 360
aatatatttg gttccgccat tgtattgtac agcgttccca tacccgatca acatgcaaaa 420
aagatattca aactacagtc ttacaaaccg accacacctt cacaaagcat tgtttatatt 480
tttagacatt ccaaaactgt tgacatttta taactaatag atcaattaag cttggaagga 540
ctgtcattca atattcaaca gattgttcca atataaagaa gtgaatatac attaaaagaa 600
aaaaaaagaa ttataattac aaaagatatt tagttattaa tctatattag gtggtggcta 660
atttatataa taaaaattag tcataaatag aaatataaaa cattatagaa cgagatcatt 720
cactcgaatt aaaatgacag gaaaaatgtc gatcgttctt tatgttggaa aggaaaaacg 780
gaatcataaa gattagttgt tttggattac attagcaatt gaattagatt ccgaatagtt 840
cattgctcca aattaaatga aaaaaagaag tcttttagga aagacaaata ttacttctta 900
aatgagaaca tcctcttata cattggctta tcatccgtta aggctgcatc aacatcataa 960
ttagcgtctc ttcttgcagc tgggacccag gagcctgact tccatggtag aacaccttct 1020
aaccacattt cgttaacttc atctaaggta agacccttag tttctggaac gaagaagaaa 1080
acaaaaaggg ctgcaaaaat catacatccc ataaagacgt acccgtaaca gaaaccaata 1140
gcaccagcaa tgaaagaggt aaagaatgaa attaagaaat tccataacca attggaaccg 1200
gcagaaagag ccataccctt ggctttgact ctcagtggaa aattttcggc tgcaacaatc 1260
caagcaattg gagcccaagt acaaccgaag gagaaaatga agaaacagga gaagacaatc 1320
atacagttac cagcaccctt tgatgatggt tcactcttac cgttagggta taatctcttg 1380
acaccaacag atgcgaacac aacgaaacaa caaaccatgc agatagcacc ccataataaa 1440
caggtacgac gtccaaatct atcgacaata tacaatgctg ggaatgtgga gaagaaagcg 1500
acgacaccaa agacaattgc tgtttggtat gtatcatcta aaccgacagc tttaaaaata 1560
gtagtaccat aatagaagaa atagttacaa ccacttaatt gttgcaaaga gttaatgata 1620
cagcacatga ttaaacgatg cagaatctta ccctttggtg aaaacatttc tggccaactg 1680
gcggtaccac tagctctttc agcctcaata ctagctgcaa taatatcgac ttctctttga 1740
acaccaggat cttcagcact aactttattg gaaatggcaa tggatttttt agcttcttct 1800
actctaccaa catccattag ataacgagga gattcgggga caaaaagcat agcacatgcc 1860
ataaataatg cccaagcaaa tgataaacct aaaggaactc tccattgaac agaatttgaa 1920
taacccttag taccataatt agtacaatca ccaaggaaaa tacccatcgt acacattaat 1980
tgaaaaaagg aacctagagt accacgtaaa tgttttggag cgacctcagt tagcagcata 2040
ggggagaaaa ttgacatccc accgacacca gtacctgcga caattctacc aataaagtat 2100
tggtaccact tatcaataga agcgatctgg atcacaatac caataatata aataaaagcc 2160
gtgactgcca aggccttctt acgaccaact ttattggcga tgtcacctaa agtaatacaa 2220
ccgattaaac ccccaatgtt gaatatagaa acgattaaac ctgttctaac attggaaaag 2280
tagaaactcc catctttacg tctagaggca aacctcttaa cataatcagg gtgggctaag 2340
aaaccaccaa tcgtacctgt atcccatcct gagatgaaac caccaaatgc aattaaaata 2400
caccagaatg tgataccaat atatgcactc ccaggtttaa ttggaatttc gacctccagg 2460
ttcccaatat cttctaaact attatcggta gcaccgttat cggtttctaa tttatgactg 2520
tctgctgtta gtcttgagtg tgcatctccc tccccagatt gtgatggatt cgtgcttgta 2580
aacgtggagt cggctggact tccgtctata atattttgtt cagacattat taaatattaa 2640
ttttctagat gtcagacaat tagtaaagca aatgaaaaaa aaaaattgtt ttgttatttt 2700
ttttgattaa ctatatttta tgttatatgt tataataaac tgaaattttg caataaaaaa 2760
tgattaaaaa tacgaagtga agtaaaagag aaaataataa tgatagtgat agtttcaaat 2820
caatttatga atgagaatat aatttctttt tatattaaag ttatattaga aagttatgaa 2880
aaagaaaata aaaattgaag aaataaataa taatattgat gtctaagacg tttcatgata 2940
gacatattct aggatgatca tttcaacaga attacatcct actatattca aagacaatta 3000
gattacttat ttttgcatga aataaagtat cttgctgcag tatcaacatt ttcttcgcat 3060
ttttgtggaa agagaacatg ataactaata tttcaaagac ccacatatgt tcgagtgtca 3120
agatgtttca caagtttaga tgagaagata agatgacaat atcctttgaa tatattagca 3180
tgaaataagt tattactgta caaaagcaag gattacaggg taaaccttaa ttgttggttg 3240
agtgacgtta tattttcata aggttgacga ggttttctat acttctaccg gaataggaac 3300
tattataacg tgttcgtata ataacctttc taagacagta attttaaaaa taaataaaaa 3360
agagcagaag cgaaaggcaa ctgtattttc acaaaaacaa tattcacaaa tcccccctct 3420
tgaacccctt tcactactat tgaaagcaac ttcatattgt gatggttggt agatattttc 3480
tgaattatat ttcttagaaa tacaattgcg ctattatcat ctaaggagta ccaatgtggg 3540
gcatgcatta tgaatgcggg gtttacgctt tcatgtgaaa tagcagggtg aataataaaa 3600
catagcttcc tttttctcct atatgcaact gtaaatgaac gcaatttgca ctatatcgat 3660
ttcctaatta ttacaattgt ttttgtagct gtaccttatc tgcatcagtg gagcttaaaa 3720
agaaaaaaaa aagttaaatc gtcaatccat aacacgactg tgctacgcta tgatcagtgg 3780
gagaaatttc atgaaatgat tatttcatct acaacaatat accacgttat ggaacgttat 3840
gcagaaatcc gataaaatgc acattaattt attatctaat gtctctattg ttttagaaca 3900
actgaatcat aatattgttg aaagcggttg aagagtacac agtatcatac atccatcttt 3960
ttttagcact attgcatagt tatactgtta ttaccaaaat atctgaataa cttacgtcaa 4020
agatatttca aaaaagaaag gaaaaaaaaa aaaaagacat gaaaggagca tttggaacac 4080
gcaattttga atctaaataa catagaggca gctggtagca ccacttactc gaaaatatta 4140
gcacaatatt taaaataccg tgcactactt ggaatataca gtaaaatgag atacaaactc 4200
gtataaacta gtcatgtttt gtacttcata ttaactacaa acaaatattt gataataatt 4260
atacatccgt acagatatga cagcccagaa tacatgtgat tgttagttat cgctgaaaaa 4320
tctaactgtt gaacttggtg atcatggtcc gtaagattat tacttaatct ggagagggga 4380
aggtgaaata agaatatggt tgtaccgtat tttgtattag ttactgtaaa ttattgcggg 4440
gataataacc ttaatgctaa agttttctta gagattcctc caatgaacga aaagagaaaa 4500
catgaaggaa ctcctctgtc gtatggtaaa agcaaggata gttttcttac attgttaatt 4560
ttaacatcga acccatctta aacgtgaaaa ttttgtgcct agtaaatact cttaatagta 4620
aatagagctt aaataacccc tatagatttt tctatcacat ttatgattta gggtagtaga 4680
aaattaacct agaaagaaca tgtgttgtta caactggaac ctgaaacata tattacaagg 4740
ataatatcag gtatctcatt gacgttattc tcctaaattt actgcacaat aaagaatatt 4800
aacttacaga atcaaatgtc cagaacttga ggagatgtcc cgttatatca aacttaaaaa 4860
aattctaact taaggctctt agcgtatcct gataattaaa aagtttattt aatcaaacga 4920
acgcaaataa ttaccatcct aaataggaaa cgcacgtcag tatcgtgatg tgcgctcttg 4980
ttttcttacg tacgtacact attctcctac atgctaagac ccgtacgctg ttcttctcat 5040
atttgtttat ttcgataaaa tagggagaca catctttttg cacaggaatg ccgcacgtag 5100
atcataattt ccccgggaaa tgccgaaatg gcaaaaaagc aacgacgatg tgcagtgagt 5160
gataaacgtc gtttcttttt tttcagcaaa gaaaagcaac tgccgtaaat ctttagttcg 5220
agataccaca gagagaactg caatgcattt aacgcagata tcctgtgcga gcgtacgttc 5280
aagacgctat tatgcatgtt tgctccgaat ggtggggtag cgtgaagaat gttttccttt 5340
aaataaagat tcttaaacat gttagaggta attttcttac gacagggcca gaacaagatt 5400
ttgctgtctc gatgacaagg cagtgaaata agagacggaa caaaaatttc atacagaaca 5460
gtaagccaag actgctctcg tagtcacttt tttctatttt ctataatgtt tatcagaatt 5520
tgaggatgaa aacctacctt tttgaaaagt ttaaggataa taccacatgc tgacgtatat 5580
attccccaca acacttgata atttcttatt gtttattaag gggtaccatt aaatcacaga 5640
taaagaacag cgctatccgt tatttggtaa taagtataac atcgccatct gtgaggtggg 5700
atgctcaaaa cgactattag ataaattttt aggtacttca tatttggaaa gtggtcccat 5760
attatcatct gcttcagtgg actcactttg atcccatcat agagtgaagg acaaagtatt 5820
atcgagtata ttcacgactg tcacgaactt aaatcacata cattgtacta ttatttcaag 5880
taatatagta caacaactga ttggaataaa tgaattgcat cacatcagtt gagtgcgttt 5940
tttatgtttt tatataatcg ctattttcat atattacttt aagaaataaa aaaaatataa 6000
aagggctcct aatttcaaca tattcatgat tttaatttac ctgttgccat ttatttagtt 6060
tacatctttt agaaagttat ttctatataa acaaaaaaat tataatatca ataattactt 6120
ttttaaaaca ttaacaaaca acaaacaact gcattatgag tgctccaaag aaatttctac 6180
tttttggtga ttctattact gaatttacgt atgaaccaga tcaatggtgt ttaggtcctg 6240
ctttgcaaaa cgtttatgcc agaaagatgg atattatcca aagaggttac attggttaca 6300
attcacgttg ggctctacat atccttcctg aattattaga tggtattgga aaggtcgatt 6360
tagcttatgt tttcttcggt actaatgatt gtatgccaag tggtcaattc gctgttcctc 6420
ttgaagaata cttagacaat atgaagaaga ttgtcacatt aatgactgct agaggtatta 6480
aggtcattgt tattggtacc tctttgattg agttagatag atggaacgaa ttaaatccat 6540
ctgaaaatgc cactggttta attcgtaaca ctgaatccca aaagttcttc ggtgacaaat 6600
taagagagtt atgtaaacaa gaaaactacg tctttgttga cttatacaag aagtttaccg 6660
aacttggtgg tgcagaatgg aagagtttat taaaggatgg tttacatttt aatagtttcg 6720
gttacaaaat tttctatgat gaattattgc gtgtaattaa agaaaattac ccagaattcc 6780
atccttctaa catgcactat caatatccag aatgttacat tattgacagg gacatgaaga 6840
acttggtcat ttaatgagat tacatagtaa accaaactat ccatgacgca aaattattat 6900
tatttactac cctttttact tatcctttta cttaatgatt tatactctgg tttcgatact 6960
agatattttt ttcccaatcc taattttact tctttgc 6997
<210> 43
<211> 2575
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2749_g2_i1
<400> 43
gtaaaatatg tcatattttc aacgtcccca aatggtgcag gaactgctcc cagtgtattg 60
ttcaatttcg gaaatatatt atcagttctt tgaccatttc ggaaataaca atttgatcat 120
ttgctctggt acacgtaata atcttaagaa ttatcgttga aagcctcaaa cttaactttc 180
ggccgaaatg cctcacttgg aatggtgatt aagtgatgag aaagaagaaa cgataacaat 240
aatcaaatat ctccgaactc ttcggccgaa caacacacag agagatgatc gtacaggaac 300
catacagatg agaaactgtt tccatgcaga tcttttcgat ttacttaatt aacaaaagat 360
tttcataaat tcttcattcc gcataaaaaa gggagagaac acttagatat tcaggttatc 420
ttattgaaaa attttatata aagacgacga atattttcaa atatttcaag tttcaattaa 480
ttctcttgct tctcttttct gtataaaata atataataat ttcgacaact tataagggta 540
aaacactcat ataaaacaaa aactgcaatt attttaacta tttcattata aaaaaaatgg 600
cttatccaga aactttttca ggtatcgcaa tcctagataa caaggattac actcatccga 660
agaaggttga cttcgaacca aaggtgtttg gcgatcacga tattgactta aaggtcgaat 720
gttgtggtgt ctgtggttca gatcatcata tggcctgtgg tgcttggggt gaatccgtta 780
agccaactgt tctaggtcac gaagttattg gtaccgtcgt taaattaggt ccaaaatgta 840
acacaggtct aaagatcggt gaccgtgttg gtgttggtgc tcaagctttt gcttgtttgg 900
aatgtgagcg ttgtaagtct gacaacgaac aatattgtag aaagggtgtt tggactatcg 960
gtgctcctta tgctgatgga tattccagta aaggtggttt tggtaactat gttagattac 1020
atgaacattt tgctgttcca attccagaag gtttagattc cgctacaatt gctccattat 1080
tgtgtggtgg tatcactgtt tactccccat tattgcgtaa tggttgtggt ccaggtaaga 1140
aagtcggtat catgggtatc ggtggtattg gtcacatggg tatcatgtta gcaaaagcta 1200
tgggtgcaga agtgtatgca atctctagat ctaacgcaaa gaaggatgat tccttcaaat 1260
taggtgcaga tcattatatc gctaccaagg aagagccaga ttgggccact aaatatgatg 1320
acactctgga tttagttgtc atttgtgccg gttcattgac agatattgat tttaatgttt 1380
taccaaaggt tatgaaaatc ggaggtaaga ttatttctat agctgcacca gatgcctctg 1440
aaaagttaga aatgagtccg tttggtttgt taggtgtctc tattgccaat tctggtattg 1500
gttccgtcaa agaaatcaag caattactac aattagccaa ggataaggat atcaaaccat 1560
gggttgaaca agttccaatg ggtgaagatg ctttaggtca agtttttgct agaatggata 1620
agggtgatgt cagatacagg tttaccatgg ttgactatga caaggtcttt taatttaaaa 1680
caatagttta ccattgatca gtcatgactc tttatgaaac gctacatcct taaataaata 1740
gctaaactaa ttatttatat acatatatat atatattgac atacttataa tgtacattac 1800
atattaccat tccatatagg aatgtcattc gaagtattgt tttgcaaagt gatacattgt 1860
aaactgagct aataaaatgt attttgaaaa tccctatcat ttttcaaagg tctgtaaacg 1920
atgaaaagat aaacagaagt acgaataacc gaacatttgg aattctaatt atgtcgtact 1980
gtttttaata atgcattata ttttgtacac catatgatgt tatatttcaa gaaggaaacg 2040
gcgtgacata aagtttgaag ggacggtatg ttatgtatgt gcttaaagtt taattaatta 2100
cctatggaaa tgaatttaat tttggtagtg atcaagcatg atagctccat tggtgacaac 2160
agaactttat gtcttccgca actgctaaaa tgtatctaaa ggtcgaatcg ttcaatcttg 2220
ataacattta aactgtgttc agcaactgca atgaattcgg tatgtcatct gattcagttt 2280
aatggtcgtt caatatgggg tcaacgaatt tttaacttga cagacaaaca tatgataata 2340
tacgaactag ttacgtctct tgtaagggtt ataaagagga gcaggtatca tctactgttt 2400
ttaatcacta ttgttctttt gtttacacgg tttttacatt ttgtcttctc ctggacatct 2460
tacaaggcga taaaaatatt atatttcact tgtataaaaa aaaaaatatt tccaaaaaaa 2520
aaggaataat atttttatat ttacatcttg tagtttatat acatgtaatt taaac 2575
<210> 44
<211> 412
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2564_g1_i1
<400> 44
tttaattaat ttattttctt ataattatta ttctttttct ttttttgcta ataaaccaaa 60
gagaattaca tttactcata attattttca tcaactatgt cagaatattt agatttagtt 120
aaaagaggtg gtaacgaagc cttaaaggtt aacggtccag ccaaggctga tttccacatc 180
acagacaggg gttcagattg gttattcacc gtcttctgta tctacacttt tgcctgtatt 240
gttgctattc tattaatgtt tagaaaacca gcaaatgaaa gatttgttta ctatactgta 300
atccttccat atgcttgtat ggctgtcaac tacttcacaa tggcttccaa tttaggttgg 360
gctccagtcg ttgctttata caaccgtcac agagtctcta ctcaaactac tc 412
<210> 45
<211> 677
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2992_g1_i1
<400> 45
ggtttatctg aaggtggtaa cgttatcgtc cctgattctg aacatatttt ctacggtatc 60
attgatttaa tttacttatg tttcttacct gccgtttggt tagttttcgt ctcctatgtc 120
ggtttagaca agatgggtct agacaccctt ggtgccccat ctgatttgga accattacca 180
actgtcgctt ccactactag tgttgcctca aagaaatctg aatcatcctc cgctggtgaa 240
ggtgaagaaa agaagaagtc caagtcacca ttgaacaagt taaagaagtc caagaaggct 300
gacgaagaat aaatttaatt tagctttctt tgcctaattt tttttttttt caataaatga 360
aatacgatca cccttttttt aaatccctta atgtgtttat atttttttgt tactattatt 420
attcaactct aatttccttc tataagaaga aacaaaaata tcatatatcc cccacactca 480
agtagaatct tttgaaaaga ttcactacaa acgtatatcc attaccccct aattattatt 540
tcttaatgtt taatattctc aatttttttt ctctcacaca ctatactaaa ggaaactttt 600
taattaaaat ttccaattca tcattaatat ccaataaaaa aaattgttta atttttctat 660
ttaaagtttc ttctata 677
<210> 46
<211> 2141
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4074_g1_i1
<400> 46
tctctaagta ctacgtacat agtgataaac gggcacgtgc gtcacgtgat attgtgatgc 60
agggcacgtg acatggcgtc ataaaaatcc gaacttcaaa gggaaaaggc gtttttaacg 120
accaaaagca gaagcaacaa tttggcaaag taaaccagaa gttatatatg cttgtgttat 180
tcgttactgc taccgttatt atagagcgct ggtgctatcc tcatttgtat atgtatgtat 240
gtacatactg ctttgtgcct ttcttttggg tttggcccac ttcataaaca cagtcagatt 300
ttgcactgtc actcaccacc acagtcagca gtagcaccgt ctgttgttct tattattgcg 360
caaacaataa gtcgatgaat ggaaataata tcgcatttta tattcaagta tataaggccc 420
gttaaacgat aacagatcaa tcttgatgag aaaggagcca ggtttcttct attttcaata 480
ttgttctttc ttccaactga tataactcat caatcataca attcataatg cctgcaccac 540
atggaggaca attacaagat cttgttgcaa gagactacga aaagcgtgac aatcttttac 600
aagaagctac caatgccact ttaaaacaat ggatcctaac tgaacgtcaa ttatgtgata 660
tcgaactgat cctaaatggt gggttctctc cattaactgg gttcctatcc caaaaggatt 720
atgactcagt tgtactaacc agtcgtttat ccgatggtac cttatggcca atgccaatta 780
ctctagatat taatgattcc aacttcacag attccattaa atccggtgac agaattgtcc 840
tatcacaaaa cggtgagatt ccaattgcga tcctaaccgt ttcagatatc tggcaaccag 900
ataaatccat cgaagccaag aatgtcttcc gtggtgatcc agaacatcca gctattaaat 960
atttgtttga aaccgctggt gatcactacg tcggtggttc attagaatgt attcaattac 1020
ctatccatta cgactaccca ggtcaaagac gtactcctgc acaactaaga gcagaattcg 1080
attctcgtca ttgggataga atcgttgcat tccaaactag aaatccaatg catagagcac 1140
acagagaatt aacagttaga gccgctagag aaactaacgc taagatcttg attcaccctg 1200
ttgtaggtct aactaaacct ggtgatatcg atcatcacac cagagttcgt gcttataacg 1260
aaattgttaa aagataccct gctggtatgg ccctgctatc attattacca ttagccatga 1320
gaatggctgg tgatcgtgaa gctttatggc atgcaatcat tagaaagaat tatggtgcta 1380
atcatttcat tgtcggtaga gatcatgcag gtccaggtaa aaactcaaag ggtgtagatt 1440
tctacggtcc ttatgatgct caagagttag tcgaatctca tcgtgatgaa ttacaaatca 1500
ctgtggttcc attcagaatg gttacttatt taccagatga ggacagatac gctccaatcg 1560
atacagtcga tactaccacc acaagaactt taaacatcag tggtactgaa ttaagacgtc 1620
gtttaagagt aggtgcctct atcccagaat ggttctccta tccagaagtt gttaagatcc 1680
taagagaatc caacccacca agaccaaaac aaggttttgc catttcttta ttatctgacg 1740
acatacctgt ttcaactaat caattatcca ttgcattgtt atctatcttc ttacaatttg 1800
gtggtggtag atactataag atccttgaac gtgacgatta cactgatgat ctattagttg 1860
aattgattaa cgatttcgtt aaagccggta ctggtctaat catcaaaaga gacgtcccag 1920
taaagaacct aaccaacgtt tacactgttg gtaagaccga tgacgcagac attaagatca 1980
atgaaactga tggtacagta tttgatatcg ttcaaagagt tgtcttattc ctagaagaca 2040
acggttacat caatttctaa tccctcatat ataatataca tataaaaaaa taatgataaa 2100
aatgtttctt tctgtaaaat aaatcaataa ttcaagtttc t 2141
<210> 47
<211> 5350
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4925_g1_i1
<400> 47
attaaacaaa aagaaaagag aaaatagtag ataaaagtat ttccatcatt tcacgaataa 60
ttcccctttt taaaaattta taaaacacct tctccctttt ataaccttct ttaactaaca 120
aacaaacaaa taaacaagaa aagatggact taactccaag aattgtttat aagaattacg 180
tattcaatga gagtgcaaac tattcccaat acggtaataa tatgattgcc caacatcaac 240
caggtatgca accggtaaat gtaaattatt tacctttacc aaacgttcca ataataactt 300
atcgtaataa tgatggttcg attaatgcta cagctgctgg ttcagctgtt tcttcaactg 360
caccaggtgt gtatccaagt tattatcaac cagctattgc tgcttctgct tcccctcctt 420
atcaatttaa tgctaatatt cctcaaacta ccttttatga cactcaacaa caacaacaaa 480
ctgtcaataa taataataat aacacacttc cacgtgttaa ttctggctca aattttagat 540
tgccatctat ctccagtatt atggggtcaa ataacaacat taacaatacc agtactaaag 600
ctacaaacaa tgatgtcatg attgatcatg catttgaaaa taataaggat aattattctt 660
caagtaaatc ttcaccaatt acacctagaa ataacgttat ggttatggtt gctaatccat 720
tagttctatc aaatgtagta acaccgattt atcaacaaaa gattcctgca ggtcctttga 780
caccaccaat gtcagttcct cattctccaa tgtcttctga aggtattgaa aataaagttt 840
ccgccgatgc tgagaatact actacgcctt cggctggtag acaatcggct ttaaagatta 900
ccgaagttaa gccagaaaga aaatctagaa gaaagaagtt tatttgtgat ggtattggta 960
aacatttaag tccagaagtt agacaaaaaa aagaatgtcc aatttgtggt aagaaatgtt 1020
ccagaccttc aactttaaag actcattatt tgattcatac aggtgataat cctttctgtt 1080
gtactagacc tggttgtaat aaaagtttta atgttaagag taatttgcaa agacacatta 1140
gaagtcacga taagaaacta tcgaaaactt taaaacaatc aacacaaatt ccgattcagt 1200
taccatgtcc acatatgtac taaggaaggg tgctccaaat gaggaaacat aaaaagagag 1260
gatgaaaaaa aaaagaggat tgaaatagga aattgagaaa caaacgtaca aattggcaat 1320
aatcaagaaa aaattgaaaa tgaggagaac atgcatacta atgcgcctaa aaaggaatga 1380
cgctactgga aaccaaaatc ggatgaatat aaaaatggat ccttaaaagt ttttagccgt 1440
aaagaatatt ttgttgaata tactttaaga aagtttttaa agtttagaca ttagaggttt 1500
gaagtatcat gataaagtgg tcattgaatt agtcagtgca aagtcaaata tatttataat 1560
tgtgttaatc aactatattc tgtattgttc cgaaaggagt atcatataag gctgtgcact 1620
aagatgtcta aaatttaagt ttcaagtttg agattttcct tttcattgtt ctggtttcca 1680
taattaaagt ttcaggaatc acatttctct actttcgttt ttctgttctt atcattaaaa 1740
tccgtttttt gtttatttca tttttataat aaagtttgtt atacttcaat atatcctttc 1800
aacccctccc tttcatttga atttgctaca taaataacaa taaataatag aatatatacg 1860
taattgaaat tcaattcaaa taaattcaca ttcaaaattt ttaataatct gaaaatcaag 1920
gataagtaaa tgaggtactg taccttacat taaaagttgc tagatggact gaaggattgt 1980
tgttgaagat cctattgttt tgggatactg tattgttccc gatgatacta ttgttttgca 2040
ccattgtatt gttatccata ttaccttttt ggtaattgtt gagcttgcca cactttcgat 2100
aaccatgcaa attaaacttt gttttagcag gcatttttga tcaggggcag tacgtactcg 2160
aagtgacaaa ctttattcgt tatcaagtaa ggtatcatat ttaatgataa cagactagaa 2220
taaatgtttt ttgaataata catcccacta accttaagtt ctcacttgca atagagtcga 2280
tgagatactt cgcgtatgtt acgaatactt caagataatc ctgataattt tttccctttt 2340
tttttgagga gactccaatg aaggtcattg gaaattcttt ttactgccag ttcccttttt 2400
attttttaca catttatgtg ccctttcgag actgattatt cctcattttt ttggattttt 2460
gagttctaag ttataacctc cttgcttagg gacatttaag ttgaagactt tctatcattt 2520
aacaattaaa tggttttcaa tcacttataa agatgagtaa gctaagttat atggctagac 2580
ttttcaaacg accccacctc atcgataaag tcaatcgaat cttcttttct tctttgatgc 2640
aaaagtgatg atgtcagatt atacttttct ttataattct tctgtcattt taatcattct 2700
actgcttttt attgaatcta gttctcagtt ttctctacgt acaggcttta tgcagatatt 2760
aatccttcta catatcttcg gagactattg tgaattgaaa gcaaaggaaa cacatagaga 2820
ctattttgcc acttgctttt gcgagacatc ggtatctaac ggttttcatt tttttcggac 2880
ccaggatgac agcgtaggaa atgcacctgg acccttttca cgcacgaagc acataacgga 2940
tattgttttt ttcttcaact tcttaaaagc cgatgtaatt attatagaaa taaatatttt 3000
actatggggt ttttactcgg aattttaaat attgcttaat cgcagatctt aaatttttat 3060
gtccattttt acaacaagtg ggtctaaaaa ttttcaccca atacgtattt tttcctgctt 3120
ggtattatcg gagattttta cgttttgaat tagaagcaaa aaaaattttt ttcgatcaga 3180
ggaatacaga ggtaacagta ctttcttgta aaatgaatct ggagaatttg tttggaattt 3240
atcacaaaat ttacgtaccc cactttattt acgtagctga atctaaagtc tttgaaatga 3300
tactgtgcat atatgatgta gtttctatct catcgaagaa gtattattat tattagtaat 3360
gatctgatat gtggctagat ttacggagtt aaaagacttt aaaattaata tagtaaacta 3420
ttactattca atactagaag taataatatc catgcacatg cttcattttt atgctatatc 3480
ttaatgatat cattataaca tatctcaatg atatcattat aatgctaacc ctcaaaactt 3540
taaactcgaa ttgagagtag aaagttagcg aataactatg tagtatgtta taacctttat 3600
agtttatcaa aacaatgacg aaatatttag taatttattt ccttgaagga aattaattat 3660
gattaataaa tccaaaaata atattgatca acagtaacca gggaccttcc taaagcaaaa 3720
aaaagaaaac aacagctagg caaacatcga attttcgtca ttagttcgtc aataaggatt 3780
attgaccgaa caagaaaaaa aaagaaatta tagttgatga accaataatg accaaatatt 3840
aagaaactgt catcggttac aatttccccc aagtaaattg aaaggtgttt cttgccgaaa 3900
catcaaaaag caaaaaaaaa actacgtact gaaagatcga taggctggaa ttcaatttaa 3960
ttaaaaaaaa aaaaaaagca aacaactgtt gcgtcactat aaatcacacg tacaaagaac 4020
aatggtattt ttctaagttt aaagaacaac gaaaaaaaaa ttgtttgagg aggagcggca 4080
gtctttttca cagaacaata aagacataat attatgaatt atgcacaccg tactttgtac 4140
attgtgtcca tatgtgaaat aaaatagatt tactccagaa ttcttgggta gtagtgtttg 4200
atattacgac tatacttttt ccttcttaaa actattggaa acatcgaaaa taattgtatc 4260
acaatatcaa attggtaaca aatcgttgag gaaagatatc ttaaagtttt agtaaaaagg 4320
tgcaacttga ttgaaaaaga aataagacag aatatttttg ccaagtttag gagacattta 4380
gaaaagaaat tcttttttca agactcaacc aaaagaaaga agaacaggtt ctgcgaaagc 4440
atcatgtttt aatttcatta gttgtgtacg gtaatttgtc ttttcaaatt ttttttacgc 4500
tcaaccccaa taaaataaaa aaagatcatc ttagatagga ggatgtgaca cttgtccagc 4560
tttgtgtgga tttttactca agaattattt taactaaaat aaatatattt tagatgcaaa 4620
tttttactgc cgtttctttt ttgttgtctt gttcgctctt tctattcttc tattacagta 4680
aatcttggta gtcacacttt taaaggaaca aataaacttt aatagaatca tgtttaattc 4740
agaaagatgc gctgcaacat tggtctccga aatttcattg aggaaattcc gatgatcttt 4800
aattcctctg tcagaaattt gaaagaacta cgtaaatttc ttgtaggccg cttttttttt 4860
gttacaataa gtaaaatgga gaaaactatg gagaattaca gaagcgaaaa aaaaacccaa 4920
aatctttatt ctaaatttat gagtatgtat agtttttttt cctgatactg cttttagttc 4980
tttctcccat gatcagttag gaacgacata catttacgtc tattttaccc ttttaagtga 5040
cttacagcga aaaatgcgta ataggatttg tacaataaat atatcgcttt tttcagttaa 5100
ttaatttgtt cttcctgcat taaataaaag ggtaattacc gtagcattaa attgattttt 5160
tattttttta cagataatat gtcgaagtta tattacgtat ttacatataa cgataattcc 5220
tcgtgtttcc aaaggaagtc cgcaagtcga gtttaccatt tagtaaaaaa tttatataat 5280
gattggaaat ttaataatca agttcgtttt ctactacctt ttaaccacta tattataata 5340
tcatattgta 5350
<210> 48
<211> 3493
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c4569_g1_i1
<400> 48
atctatcaat tctcttaaag aacccatgta cagtacgtaa gttgacggta cttgacttgt 60
tcattccttg gattaacttg ttttccattg ctataataac aacaacttca tcattatgta 120
tcttttaatc ccttcttgct tcttttagtg aacaagttat attatgaaga agataatgat 180
gattgattga ttgattactt tgtggcgtta acaagagaaa atctactatt tgatgatgat 240
gatataataa actacctgtc atcgcaaatt ttaattcaac attccataat aacctctaag 300
aagccaaaaa aacaaagttt aaaatgtttt ccgggaaacg taaaaggtct attaagagag 360
agagatggca gtaatgataa tttatatgat caaattgatt aaatttaact tatatacaca 420
tcgcgtcatt taaattttgt ttcttctttt taatcttctt aacattctcg tttacatcta 480
tccatatata tatatataac aaatatttta ttgtaaatta aaataaattt gttgatccct 540
tgaaatatca tttctattta gtttacattt ttatttattt tattcattcg acatctcaat 600
atatttaaag atcttctttt agtttgataa ataaaaataa ttagaagcaa tcataattat 660
tatattacca aggatcatat cgttctatat tgaagaagcg acaggaaagg agagaagaaa 720
aaaaaagaaa aaaagaaaat acttttttag ataatctatt gtgtgcgtga cttctaccgg 780
aaaattctta tttgataata ataataaata caatcatcat catatcccaa cgtaaggaga 840
tacaaaagga aaacacacta ttattatata aagaagtgtc atattatttt agcttatttt 900
tttcacatta tatcccttgg aagactatac ttcatttaat ttaattaatt aaaaactagc 960
caaatcaaag gaattacaca tatacataac actgccaaag taaaaaaaaa aatgcatgaa 1020
acgttagggc aggtcatttg gatcgctgtt aaacctatta ttaaaattta tttaattatt 1080
ggtgtcggtt tcggtctttg taaaatgggt atcttaacag ctgatgcaac aagaagtata 1140
tcggatattg ttttaactgt tcttttacca tctttatcat ttaataaaat tgttggtaat 1200
atcgaagata atgatattaa attcgttggt atcatttgtt taacgtctgt tttaattttc 1260
ggtacaggtt tattttttgc ctacgtaatt aagaaaactt tacctgtacc aaaagcttgg 1320
ggtggtggta tcctagctgg tggtatgttc ccaaatattt cagatttacc aattgcttat 1380
ttacaaactt tagatcaaag ttcaatgttt acaacagaag aaggtaataa aggtgttgct 1440
aatgttatta ttttcttagc aatgttctta ttctgtgtct tcaatttagg tgggttccgt 1500
cttattgaaa atgatttcaa ttacaaagat gaagaaagtg gtgttagaga gaatgaatta 1560
caagataatg ataattcatc aaatgtttcc ccattagatt ctatcccaga agaagcagat 1620
gaagagaaaa atggtttaca tacttcttct tcgtcatctg gtttaagtaa aaaatctcaa 1680
tctgttgtca atggtgaaaa aaataatatt agtagtaatt cagctactcc atctgcacat 1740
aatattccaa ttaacaataa taaagctgtt aataatggtt cagaagataa catggaaaac 1800
tcaactgata atgatgattt aggtgacatg catatggaag atgacttagg taatgaagaa 1860
caatccgttc aatcatctat tgctacctct attaactctc aagtttctgc aggtgattat 1920
aaccgtgaat taggtattcc agctgcaaga agaactttaa gtcaaccagt tgcttacacg 1980
gaggaagaac atagttcctt aggtcgtcgt caaacttaca gtcaatacag tgtaaattca 2040
aatctaaatt taactcctgt tagatcatta gataagcgtg atttaccatc tgaaggttta 2100
gacgatattg ttagagaata ttctaatgtt gatcaatatg gtggtagaag acaatctgtt 2160
gttggttctt tacaaaatga tgatgggtca atcaatgatc aagcttctca tatgtcaagt 2220
ttacaaaaaa ttagatcatc taatttaact aagattttaa cctcagatgc tactgttagt 2280
aagaaggata ttgaagaatc tggtggttct ttaccaaaat gtattcaaaa gttcccatta 2340
actccattca ttgttttctt cttaaagaat tgtttaagac cctgttctat ggctgttatt 2400
gctgctttaa caatcgcatt tatcccttgg gttaaagctt tattcgttac atcaagtcat 2460
actccacata tcagacaagc tccagatgat caacctgcat taagtttctt catggatttc 2520
accagttatg tcggtgcagc ttctgttcca ttcggtttaa ttctattagg tgctacttta 2580
ggtagattaa agattaagaa attatatcca ggtttctgga aatcagctgt cttattagtt 2640
ttcttgagac aatgtattat gccaatcttc ggtgtcctat gggctgatcg tttagttaaa 2700
gcaggttggt tagatagaca aaaggatgaa atgttattat tcgttatgac cattaactgg 2760
gctttaccaa caatgactac tttaatttat ttcaccgcaa gttatactcc attagactgt 2820
gaagatccga ttcaaatgga atgtacggct ttcttcttaa tgttgcaata tccattattg 2880
gttgtcagtc taccattcgt tgtcacatac tacttaaagg tctacttgaa gaagtaggaa 2940
aaataataat cagtaatttt aaaattaata cacaaacgtt acaaaacgta acctcaagtc 3000
aatatattcc cattttatgt ttctttcttt aatcatacgg tcgtttttta attaagatcg 3060
catttcattc aatctcataa ttcataatca tcatattagt taattcatac cttttttttt 3120
attatgtcat taaaaaaaaa ttataatgat gcttatttga attgaataaa tttgaaaact 3180
ccagcacctt tttttatatc cctaatggat aatccacaat catcagttat attttgatgc 3240
atgcataccg atctttatta aatcttaaca aaaaattcca gaaaagaagt atctctacta 3300
acctgatatt tcaagtgaaa agttataata acaatagttt aaagaataaa caaatatata 3360
ataattcata atttcatcat atataaaaac gcctccaatg agcacttaat cgttgggact 3420
tttaatattt aaaaaaaata ataataaaaa aaatatataa taaaatataa taaaaaatat 3480
tttaaataag tta 3493
<210> 49
<211> 5210
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c1919_g1_i1
<400> 49
attcaagaaa agaaacaaaa acgaatgtcc ctcttttatt ttatcctgcg gaattaagtt 60
agagaaatta aattcccctt actctaacgg tattttgtgt gtaggaaatc aataagtgaa 120
aaaataattg ataaaatatt ttgcatatct cagataatct gaaggcatgg caatttttat 180
ttttgttaca taagattact aaaccaagga aagacaaatc aaagttttag attaacaaca 240
aatgaaaact gggtagctct cagcgaaccg agaatggaaa gttacctgtg ttcactttta 300
actcgcacaa ttaattgttt tgcggctttt gaattgttgc atttttggaa gacaagaatt 360
tcagtcccag taaattctct gtaccaaaaa tctaaatttc ttgtcaaagc gaggaaaata 420
aagaaaaatt ccgcgtttgt gttccatata tttactaagt aactcccgta ttgaggttcg 480
gaaatgttag atcttataat attagaagac aaggggaaac aatgatatca tcaggcttct 540
ctctaagttc tttagagaaa tttatcagaa ggtgcttatt atttatattc cttatattac 600
taggatacga aattcccaca tgtaaccttg aaaaaagcat agatgcagtg agtttcttaa 660
ttattcatat ggtgggggag gtctcataat ccactttaaa ataactttgt ttccctatca 720
aattttcaac aatttttgtt catttattat gctatatcat tcaaccttta ttttttgtta 780
caatgaagcc tctctattca ttaagacgtt ttcccaaaga gtccagatat tgaagcagat 840
accttttaaa ataaacattg ctttcctgct ttatcaagga acacctattg tcttgttatt 900
ttctgtgtta ggtattcatg taccagatca tttaattaaa ctataaagtt ttaaatatat 960
tccgtgcggt gattctcctc gagattttca aagatacgca aaataaattc atgtcttaag 1020
tgattaaatt ttttagtgca aagaaaatat agggtaaaat taaagttctg attattttgt 1080
cgtagtacct gctttaactg gcgggactcc cctgatcttg caagacaata gtacgcacta 1140
ttgtctatga aatctttgaa atttccgcag aaattgctta ttaaactatg ttcttctcga 1200
tgtttttcaa tattttcatt accttctttg ctgcgcagta gtgttaagga agaatatatc 1260
tttttttgca ctaacattta ttactacaat tcaccagctc atcgtgcccg ctgaatttat 1320
gctacatgaa gtgctatttt caatctctac caaacattga tatgcctgtg gtggtgttgt 1380
tgatgcttct acagtttaat tttaatctac atgatcagct gagagtacag caggatcatt 1440
ttttgttttt ctaaataatt gcgtatgaat tttaatttta atttgatgca ttaaagaatt 1500
catcacaatt atatatagtt gttcttagca tcacccactc tcacgggaat attgtgttaa 1560
ctagttggaa tacaattata cttttcagtt aaaatttact gctttctata tgtttctaca 1620
gatgatgaag aaaaaaatac aacacagaaa aaagaaaagc acagcatcaa caataacatg 1680
aagagaaaat tctaccaatt catctgattt atattatata aatatatctg ttcttaagtg 1740
ccacaattta ttatgcttat tttaataatc tagaatactg aactttcctt gtatttgaat 1800
aaagggctta atcaacttcc ttactgatat aataatatat tcttagaaca attatacaag 1860
cagaccattt cacctaacaa tctcatctct caaacattcg tgaaataatc attgcaataa 1920
tcaccacatt ggaaacatat aagagattcc ttcttttttc ttcgtttatc agagacaaca 1980
tattcttcat taataaaaat tattaaggtc gttgaattca ttacgactaa aaatcatcaa 2040
ccaatattcg ggttagcata atccataatc atagtcattt agttcaattc atatacaatc 2100
actgaataat attcctagtt gattgatttc attcccaatt ttaacattta tttttctcta 2160
tacatatact accttgttat aaaaatttag agatagattg gaaaatcaat agcatttgtc 2220
agtaattcaa tctacgtttt taaatttcca aataacgtta aattttagaa gttaatggca 2280
aacagtccaa cattattgca ttactaaagc attgttaatt aattgtgaac aacaataaaa 2340
tatatattta tattaacaac cgtcaattat tacactcgat ttgcagttaa acgaaaagga 2400
aaaattaatc tggaatacaa acctcaagat ttcaaaataa gtactttcaa aaaaagaaaa 2460
cagaatcaat tgtaatagtt tgtaaacata tcataatttt atcataaatc gtatgcattt 2520
tcaaacagat atcaggacca attcttttta agatatctaa gcaactttga acttaatata 2580
atttctcaca cacatacacg agaaaacaca cctacaagac aaacaagaaa cccttttaga 2640
agaacacctc acataaaaca tgtcaaaaaa tgttaaagct aaaagcaaaa taaagaataa 2700
ggacaaaaag acacggagta ataacaatac tacatgtaca agtaaagatg atgaattgga 2760
aaataaaatt cacataaaga atttcagatg ggaccctaag gaaagtgtag aattccctgt 2820
atcatatcta agtccatcta tcgtcaaact aacaaatagt ccattagacg attatcagag 2880
atccttcttc agttacgcac tgttggatga taaggaactt gatctgaaca tcgagtatac 2940
gacttaccga acaagtatcg ctgaacaatt catttcaccc atttatcaaa caaaacaaaa 3000
gcgttccaga agaaatggaa gacattctgg tggacatagg agatcaaaac atttactaga 3060
atgttttgaa taccaacttc caaatctaag acagtcattt actgaagaag atggaatcat 3120
aagtccagga aacggtacgc cgtctcccga atcattaata gaaatctaca ggaaaaattt 3180
atgtcttgat agaccaaatg tttatgtcct tgacggaata atcatcaata gcatagaaga 3240
agaatctaaa acttcatcag caggctctga aacaaatagt gacgataaag atacggcgag 3300
caactcaact gaaactagtg acgataaagc attacaaagt tctgacagcc gtgacgcaaa 3360
ccttgaaaat gtagtggaac ccgaaatccc attgttaaaa gataataaat cagacagtat 3420
cagttattca ttaacaaaaa atcaaaaatt taggctacaa aagatggatc acaattctga 3480
gaaaaatcaa aagattataa acccaaacaa ttgcattata tggacattcg aaggtgggta 3540
tgttttttta actggtatat ggagactata ccaagatgtg atgaaaggac tgataacaat 3600
accacgcaaa aattgccatg ataataaaat attacaggaa ctatgtgctg ttgaattcaa 3660
aaatgtttta tcgcatacag ttttcaatat cacaattgat tcagatggta aggtacaaca 3720
ttcgaaaaag aggagttacc cagaatcact aggaaccaat agtcagatag atggtcttga 3780
aacagagcct aatgataaca cggctgatac aacattcgat attttagaag agttcaataa 3840
gttattttcg caatctaaat ccaaatatac tgatctccat tggaattctt taccaagcac 3900
actgagacat gaattgtttg aaagtttcaa agtacatttg atcaaagaaa agaatgttcc 3960
ggctaatttc tttaatggct tcgatatgac tcaattgatt caccgtattc gtggtggtta 4020
tattaaaata caaggaacgt ggattccaat ggaaattgcc aaatctcttt gtattaaatt 4080
ctgtttccca atcagatatt ttttagttcc aatatttggt cctgattttc cagatcagtg 4140
tgctaattgg tttctagata agcaagaaga aaatgttaat gtggtaaaca gtgacacatc 4200
gcacgacatc tcttatagtc ctataactaa gaagagactc cgttctttac caaagttatc 4260
ccagacgtct ttgacagatg gtgatcactt attccatcaa caatatcctt accggtatgg 4320
aaaacaggaa aacagttcac atgttccaat gaacgaccaa ggtttgaatc aacaggcata 4380
tactcaatat ccaaatatta cagatggaaa ttatacgcaa caagcagtga cgtattcaca 4440
aaatattaaa caatatcccg atagaaataa tatagataac gtgcagaaga agctggatgt 4500
ccagaacaga ccacatcttc cacatatcac taacttaatt aactcattaa atggatcgcc 4560
aactcctaga gagaatccgc catcaaggga ggcttttccg caaggaccac aattacagaa 4620
cacaccaaat atcgaaccaa gtaccaatgt gtcaataaat caaaccccac acgaaaatat 4680
gacaagagct attagtcaac cagtttatgc gatgtctgga cattttgata atgctccaag 4740
tcctgcttat catcaccctg tccaatatgg tggcccaact gtattcgagt cctatagtaa 4800
ggagaatgtg aataaccaat atggtggaca tattgtatct gtacccgcta cttcatatcc 4860
acaagttaat cctattatgg gtcacgatcc aaatatgtat gttagaaatc aaatggttcc 4920
acagcaagtt agacaaattc cgacacctgg ccaggttgta tatattaata aaaccccaca 4980
gataatgcat cagcaggtac tacctacaac tattgaacaa ccaaatcaaa taccgataca 5040
ccaactaggt ccacaatatg cagtattaga ggatggaagt agagtacctg ttaattctaa 5100
tagtcaagtt gttatggtgc aacctcagat gcaacctcaa attttgccgc aatatgccaa 5160
caatcctgtg attatcgctc ctgctaataa taataacaat aacaataaca 5210
<210> 50
<211> 2404
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2966_g3_i1
<400> 50
gtccaatgga gagttccatt aggtttatcc ttcgcatggg ctctacttat gattggtgcc 60
atgttcttcg tcccagaatc tccacgttat ttaatggaag tcggtaagac tgaagaagcc 120
aagagatcca tctctacttc taacaagatt tctgttgatg atccagctgt tcaaagagaa 180
gctgatacca ttgccgctaa cattgaggcc gaaagagctg caggtagcgc tacttgggct 240
gatatgttct ccaccagagg taaggttgtt caacgtctat taatgtgttg tatagttcaa 300
tctttacaac aactgaccgg ctgtaactat ttcttctatt acggtactat tgtcttcaag 360
gctgtcggtt taaacgattc ttatcaaact gctattgttt tcggtattgt caattttgca 420
tctagttttg tttcactgta tgtcgttgat agattcggtc gtcgtgcttg tctaatgtgg 480
ggtgccgccg ctatggtctg ttgttacgtt gtttacgctt ctgtcggtgt tactagacta 540
tatccaaatg gtaagaacga ggcaacttcg aaaggtgctg gtaattgtat gattgtcttc 600
tcatgtttct tcattttttg ttttgcttgt acttgggctc ctatctgttg gattgttgtc 660
tctgaaactt tcccactgaa gattaagcca aagggtatgg ctttagctaa cggttgtaac 720
tggttatgga atttcttaat ttctttcttc accccattca tcactggtgc tattaacttc 780
tattatggtt acgtttttat gggttgtatg gtctttgcag tattttacgt tttcttctgt 840
gtcccagaaa ccaagggttt aactttagaa gaagttaacg aaatgtggga agatggtgtc 900
ttaccatgga aatctacatc ttgggttcca gctgccaaga gaggtgccga ctacgatgcc 960
gatgctgcta aggtcgataa caagccaatg tacaagaaat tcttctaaac aggagtactt 1020
taatgaaaca ttgttagatt ctttaaattc ataccacatg caatatctct ttcctatttt 1080
atccatcttt tttttatttc cagtttagct gttttttcaa ttcatgactt tttggtattg 1140
taagaattct tcaaagaact aacaattaaa tatctaataa cactttgtct tttcataatc 1200
attcgaattt taatttactt tttaacctaa cttaatctat atatacacaa tacaaaattt 1260
attttatcat aaacacgaac atagttatat agtattatct actggaaaca aaacaagtct 1320
cttagtctga cgaagattcc tcctggaggt aactagaggg ctttctgtat tacactaaca 1380
caatctattt ctgagttttc aacgtgttta accaactgta gacaatcctg atgtattgtt 1440
tgtcaaatat taatattaat ccaataggta atttcgggaa aacaaattta gttattcccg 1500
acgtattgtt tgttaattac aatagtagta taatagacaa tctcgggaaa ataaccttag 1560
ataattttga cttattgctt gtttaaataa tacaataggt cgtatcctgg aaaacatcta 1620
ctttaagtta ttccgcacgt tatcgtgaca ttaagataag cggatagatc atttctatcc 1680
tcctgattcc attaacggtt tgatcaagcc actttgataa ttccctttcc agattttgtt 1740
aaaatgagtg tttttgtaaa gtatttttat taattatgtg ctatatatat ttgcacttta 1800
tataagatat atattaaatt taaaggattt aggtcataaa aacgtaaaat aaggaaaaaa 1860
aggtaaagag agctcattga aggcattaaa taaaaagtag gatgattctg cctatcttgt 1920
tatttgtttt gaatctattc tagaataccc tttggaccag tactaccgta tgggtaaatt 1980
tctggttctg gagcatttgg accttcgacg tagtttagta atggagtaaa gatatcccaa 2040
ctgacatcta attcatcatc tctgacataa ttagctctgt tacctaggaa agcatctcta 2100
attaatcttt cgtaagcttc tggaacccag tccttagcgt acttcttgga gtaagtcata 2160
tctaaagaga cttgatgaac gttatcaacg aaacctggag cagtagtgtt gaaagtcatg 2220
tagatcttac gttctggatg gaattgaatg acgaattcat ttggagtaca accagcaaac 2280
ataccactta ctttcttctt gtatttcatt ctaatttcaa ctctatcttc atctaaaccc 2340
ttaccagctc tcatgacaat tggaacgcct tcccaacggt cattgtgaat atccatggtg 2400
attt 2404
<210> 51
<211> 6125
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c6180_g1_i1
<400> 51
cacagagaca gtattagttc tggtctgagt agtttacact tacctagtta cgttatcttt 60
acagaaagat ggagaacgac gattcaaaat tatctgcgat tggtaccaca gaggcccaaa 120
ataccacact gaaaagtaaa tcagaagctc aaaaagttaa gaaaccaaaa tcaaagagac 180
ataaaccaat aagatcatgt tccttttgca ggaaacgtaa gttgaaatgt gaccaaaaga 240
aaccgatatg ctcgagttgt aaatcaagag aactttcaga atgtatctat gcagagaatt 300
caaatagcgg gaatagccca gctaatagca tatcaaatga ttcaagatct agacaaagta 360
gtaatatgaa aagagattct cctggtagtg ccatgtcagg agttgcagaa tattctacag 420
cccctttgag aaacttattg tcaaccatgc aattcccacc aatggaaatt tgtaataata 480
ataattcacg agattcttct gaaaagcaat caagcccaac tacaagtcac ttctcctttt 540
atgatgatcg tctaacgaca gctgttccaa atattacaca tagtatcgct gctagtaatg 600
ctcaaaatac tacaggaaat acaaaatctg atgtagacga acaaatacca aatccattca 660
gaaattatta ttttatacaa tgcaaggata ccggtagaac catatcatat ggtcctacct 720
cgttacgtac ttttatcatg agaaataact ggggttttaa ggataaatat attcaattat 780
ggaaaaagat taaattagaa agaaataatt ggaagaaaaa atatatgact aacaaaaata 840
atgaacttga cttaattgag cttgatctag gtaactcggt atcgatttta aatgatgtgt 900
tgccatgtct acctgattat gattctataa agagctatat caatgatttt tttgatgaca 960
agaattcaaa cctttatgag tgtaatacat tcttagacaa aaggaaaatc ctctatgatt 1020
tagagtttag ttttattcaa aatagagtag gtaatattat ccagctgaaa ccaacagaca 1080
agaaaaatta ttacaagatt gccgttattt taatgatttt agtatttaca aaattcagac 1140
aaaatattcc ggttcaaata ttgagattga tgacatattt gacaggttta gtatccccaa 1200
agactagtta tattgaaaaa tcgcagtttt tattacaaca ggttttttat atatcatatt 1260
ttgcgcaaaa gggtgatgaa acaagtctga taggaataat gtctcagctg actacaagca 1320
caatgacatt aggtttacat ttaaatatta gagagattta taagaatagg gagataatgg 1380
taggaagttg tgaatctatt gaaaatttat ggacctgggt attatatttt gattttgaat 1440
tatcattgag aataggaaaa ccattagata tcccccttga ggtattcaat gaaattaatt 1500
tccaagatga taatagtttg gcattatatg gtgatgtaat gaataatgaa ttcggtttaa 1560
acagggatga tacgagacca gagttttgta tgatgccgaa atcattatca tctttactaa 1620
ataatagccc tggtacaacg aaccaaagtg gaccaaaatt tccatcttca acgaaaccca 1680
gaataagaga ttttacaaag gagaaatcct tctttggtaa aatgagaagg ttccttttct 1740
tggtcagacc aatgcttggt gaattttata agaagacggg tacccctaag cttgttgaac 1800
atgggcaagt tcttttaaaa tttttggaag atgaattgaa accaatcaaa tatgctacag 1860
atccggattt gatatctgaa ttaacctttg gtgatctccg tttaacatta acgattttag 1920
atatcataac tattttttac tcggttggtt ttgtactatt gaatcatcga tctctaatat 1980
tgaaaaatat ttcaattcaa actcacctat tgacatttgc aattttcaaa aatttcgtca 2040
accactgttt caaattagat gaaaaatatt tcccagagat gatccatcca tcttataata 2100
atctaacccc ttatttgact gcctgtttag gcatttcatt acacccggta ttgcaatcac 2160
ttggtgtatt ctatgcgttc ttctttttaa aggcaacatt atttgaaaac ggtatatttg 2220
tttcatatga tatgaccgag gttgaatggg atatgtcttc attcaatgtt ccaactgaca 2280
aatccatctc attgatcact actttcaata tgtacaagaa gatatttgaa gattggataa 2340
gttatgataa gcaaaataaa agaggcttcc aattaaaaaa tcttattcta agatcatatt 2400
caggattaat cttaattaca ctagaaaaaa cctatagaat cattgtcgaa aaggctctag 2460
agtatagaaa aaagattgaa acctcattaa tgactgaagg tggatctaat aaaagagaaa 2520
ataaaaaggg tgtagaaatg tgtgacgagt accgctactc atctaatcct ccaagattag 2580
aagtggactc agttgatggc tcagttggta gtcccggacc cattggcgca agacaagcat 2640
atcatagtta tttacagtac caagtacaac ttcaagaaca aaatcagata ttgggacttc 2700
gtaaacacat cgaatcatta aaagagaagg atagaaatag attgaatctg acaaagggac 2760
ctatgaggaa tattggcaat ggtctcaaca ttgatccaaa ttataatatg gcaggcgtta 2820
acggttatgc tgcggcagct gactctaatg atgcaaatga tggtcatacg atttcctctg 2880
agaatggtgg accaacaatc atctccagca caggtgaaaa gatatcagct tcagaaacag 2940
aaatggcaca aaaattagtt gatgatttct ggtcaagtta taatactggt tgggaaaagt 3000
tgttaaatga ttcggactca cttttccaaa attttgaaga tgaattaaag aatggtatgg 3060
accatccttt tgatttgcaa taaccgaaaa caaaacagtc catagagttg gtgcctcaca 3120
tatataagat ggattttcct atctgcaagg tgtaccctta tgctctacta cccctttgaa 3180
acaataacat tgtaacttct ttcaaaagtg tgtcattcgg atcattttct catgagacga 3240
ttgtccctta ttgtttgcgt aaaggggatt taagggtaca acaggaacat ctcttttata 3300
aatttataga gagcacatta tgtaaataca aaaagtggaa gaagatgctt ctatatatat 3360
ataaataagt agataatttt atgttaaatg ataagtgtct gatgttcttt ggtttttgtt 3420
cctttaaact gtgatgaaac caaactgggc atatctgata gtaatttaat tttgaagatg 3480
tctccacagg ttcaccacaa ggagtacacg ttacaagacc tactagataa gtttcaatac 3540
gtcagagacc ttgattctaa tcctgaaaca aagattatct ctctacttgg gacagttgat 3600
tcacagtccg ctatattgac tgtagagaag acacatttca ttcataatga gaccattaga 3660
aagcagtcta ttcatccccc ttggactaga attaactctt acagaaattc aaactcaaat 3720
aacgaatatc atcctgtaaa gaagcatgat aatgagatta tgatggtatc agatgaggag 3780
ctggggataa gaaagccttc tgctacggag ttttatcttc tgaacggagt tgtcgatttg 3840
aaggaactaa cttcaaatgg gaactattat tgggcgttag cattaatcaa agagaatatt 3900
gatgagaatc ctactgccaa gatcagcttt atatggccag caacagacgt gcatataaga 3960
aggtatgatc aacagaagtt gcaccttgtg aaggaaactc cggatatgta tcaaaggatt 4020
gtcaaaccat tcattactga aatgacttct ggtcacaaat tggattgggt gtacaagatg 4080
ctatacgaaa acacagagga cagcagggtc atatacaagc aatacaatga attacagaag 4140
gatgatgcat tcatcctatt accagacacg agatgggatg gtcagacttt ggaatccctg 4200
tatcttgtag ctttaatgta tagagacgat ataaaatcta ttagagactt tagacctgaa 4260
catagagatt ggttaatccg gataaataag ttattaaaat cggtcatccc accttgctac 4320
aattatgcgg tgcatgccga tgagttacgc atctttattc attatcagcc gtcatactat 4380
catttgcata tccatgtagt tcatatcaaa catccaggac taagtggagg actccatgat 4440
gggaaagcaa ttcagataga tgacgctatt gagcatctga cattcctagg tgcaaatggt 4500
tggatggatg catgcattac atacactatc ggtgaaaatc atccactatg gttaaagggg 4560
ttaaaagatg aagttcaaaa gcaactaaaa gaggcaaatg ttcaggaacc accaccgata 4620
attaacagtc tgtattccct agagaaaact acacgaattg gggcacgagt ctcattatga 4680
tgactatccc ttatccattt ttacaggctg ttatttgatg ttcttatgta aacgtatatc 4740
tatctctggt acctctcaga atttttttgt tgccatcaga taaggattag aaacggtacc 4800
tttatgggtt catatctata gtctatattc taagtattta taaggatagt tttcatgata 4860
ttatcacatt gttttgattc agccaatgta atcattcagg tacatttaaa ttctggtcta 4920
agtatagatg aatggactga acttcttgtt gatttttccg ccaagttcct ctaaagcgag 4980
agcacttctt gaagtatcta tcactaaaaa aaaatatttg aactttaaga cacagttcaa 5040
aagaacaact tgagaagagg aggaaaaggt tcaaaggttg accaacaaaa taacaagaag 5100
atatacagat tactaaacaa agtcaaaata tgtcgacaca ggaatttgga atgtcgcacg 5160
ttagatcttc atctgtatca ttattagcgg aagcaacctc cggagctggc tctgcaggaa 5220
tcaattcgat agaggataaa ctaactagaa ttgaaatata taagaacctt accgattatg 5280
aggatacttt ggcaaaattg attgaatcag ttgataagtt ccatccaaat atgaaatatg 5340
cccaagattt gatacaggca gattttgatt tatttacctc tctggagaca tttgccaaat 5400
atgatgaaat tgataataaa ttgaatttgc tagaagataa gcgtacagct attggtgacc 5460
aaacgaagga tatacttgag attttgaatg aatgtcatga tgatctaaat aatttaccaa 5520
gtttagagca agttgaattc gaaaagaaaa caatattgga acaaagacaa aaagttaact 5580
cgacgatatt gttagattat gccactaaat tatccaaatt tacaaagata ccgcctacat 5640
ttgataaagg tactattgga ccgaataatt ttgtttggcc aggtgatgat gccttaagaa 5700
gaggtatgct agcaatggca tctttgaaca gtgataaatt gaccagaata gcaggagaat 5760
cagatacaaa tgcaaacaca aatcctacgt tagaaaccat ggaaattagc actgaagaga 5820
ataatgaaaa taccagtgat tcagcagatc aaaagcaaga caatactgat gatagaagag 5880
gttcgtttgt atttaatggt aacgataaac cgcatacaag cgaaagaaag gaacagagtg 5940
ataatgaaga catagattta gatctagatt tgttcaatcc tgatgaattt taaaagaaga 6000
aaatatccag gcatttgagg gtatcttgat tctttatttc ttatatatgt gtatatatat 6060
tttataacgt caattttttt tttcataaac cattagaatg ctgtaaacag gggtatcggt 6120
ccctg 6125
<210> 52
<211> 4808
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c494_g1_i1
<400> 52
aactgccggc ctgtcaaatc gagtgaaaaa aaaagagatg tataatggaa ataagaagta 60
aacagtaaca cgcacaccta attgaagata acttgatccg gttaacagga acatttctaa 120
ctttataact agtgaatatc aacgacagtt taaactaaaa aacgacacaa ctctttgaat 180
acccaacatg tctgtgttaa aagcattaag ggggcttcca ttacaccctg atacaattac 240
attaattgaa cgtaacattc tatcaaaccc gaaaacgaaa cctgagtatc aattacaact 300
tcatcaatta ttagaaaggt atgagagtgc taggaaaata acaacaaaaa acaaaaccat 360
cgaacaaata atttacacat catatttcca atggtttaat actgtcccac cttatttgaa 420
agtgtttgag acacgatatg atgatttaca taattattgg cccattgata aggactcaga 480
tcacattcat gatagaaaag taccaatgtt aagagacctt tggttaaaga acgatgaccg 540
cgcagtagat tatacgttgg aatacatgct aaaacaagac tcatgttgcc caacagatat 600
ttttgaacct atatttgggc aatttcaatt tataatgaaa aatccacaaa tacagagacg 660
taagattggt aagacttcta gaattccaat attactgctg ccgttaaatg ttctaggtga 720
agatatcgcc gcgtgtcgat ccaacaactt attgcgaaga caaatcaatg aagtaagaag 780
aatattagtt gttgataatc cgatattaga tgctaatgtg gctgaacaat tgctacaatt 840
atcgaataat tatgacaggt caatggagag aaaggtgtcc agaaggtatt atcttacatc 900
aaaacttgga tatacttgga tcaaaccaaa tagtaatgag tcaaacgaaa atgacaataa 960
tggtaagcct cagttatcga ccctaactgg tgatgcattc acatcagggt tggagttgcc 1020
tcaatttgca tccttcacgt gaatttctga gtaaaaacgt gaaaatcacg tgccattgtg 1080
ctgtttttca aacttttatc tccgactttt ttattgccca attcccacca tctcggccga 1140
aacaatgcaa ctcccacatt tctccggaga gggtgaaaca agcgcagaga ctgctattat 1200
tcacaaggaa cagcgcgtta tttaacggcg gtggtggcgg tggtggtagt ggtgtgttcc 1260
gcgtgtgcca cagttcttgt cgttgcattt agataagcat aatgaggaag aatcttctta 1320
atgcggcgac agtaaatcca ttaacatttt tcacaatgac acacatgaac cacattatca 1380
ttctgcttag ggggaccagt cctgtttgcc agcttaatac attttgaagc cactactgtc 1440
tgagacaaat aattctagat ttgaaggatt gtcatggaaa taaattttat caagaacgat 1500
tgttagtaat acgaatgaca aatcttaaag gaaactaata tataaagaat gatcaaacga 1560
tttgaaaatc tcaattcgaa tgtatttctt atctttaaat tttacttccc acttgttaat 1620
aataaagcca ttcatacgaa ttacatattc ataatcaaaa aacaaaattg agatcatata 1680
ttatttcact tatctgaact atctctaata tatatttata tattaccgct tttttttttt 1740
tacaaaaaga gactgtcacc acatatttca acataaaagg cttggcaagt atagtttatc 1800
tacctttaag tttttgttcg atttatctct aattgttcgc tattcatttt aaaagtcaat 1860
ttcaacatga ctaactatcc agaagacaat caacatcaat tcgatgaaat cgaagaaact 1920
ttagaattac cagattacgg tagtaacaac agttttaacg gagatgcaga tttagacgat 1980
ttagaacaag aatataatca atacaaagat gaagaattcg ctaatccatc ctccaataat 2040
aataataata atattcaaaa tactagtaca agtaacgaca acaataaata ttcaaaaaac 2100
tttgacgaat ccaaattaaa tgctaaaatt agtcaaattt ctttcttaaa cgattctggt 2160
gtcgaaagta ataataccaa taatattaat attccatcat ttcacgaaca tagtctatct 2220
ctacgtgaat attaccgcca tgatttaaaa gaatatttca gttggaaatc agcaggtaat 2280
tactgtcttt ccatcttccc tgttgttaaa tggttaccac attacaacta tatatggttc 2340
attcaagatt taatcgcagg tatcacaatc ggttgcgttc tagtgccaca atctatgtca 2400
tatgctcaaa ttgctacgtt accaccacaa tatggtttat actcttcgtt catcggtgca 2460
ttcgtttatt cactttttgc aacatcaaag gatgtctgta ttggtccagt tgccgtcatg 2520
tctctagaga cagctaaggt cgtcgctaga gtcactgaga aattgtctag tgatactgat 2580
attaccgctc caattatcgc tactacacta gcgttcttat gtggtgtaat tgctttaggt 2640
ggtgggttat taagattagg tttccttgtt gaattaattt cattaaatgc tgtttcaggg 2700
tttatgactg gttctgcatt aaatatcatt tgtggtcaag ttccatcatt gatgggttac 2760
agttcaaaat taaacactag acaatctact tacaaagtta tcattgctgc attgaaacat 2820
ttaccagata ccaaattgga tgctgtattt ggtttaatcc cattattcat tttatatact 2880
tggaaatggt ggtgtaacaa catgggtcct aagcttgctg aaagacattt cggtagaact 2940
aaaccaagat taaatttcta tcttcaaaaa ttttatttct acgcacaagc ttgtagaaac 3000
gctatcgtta ttatagtttt cacttgtatt tcatggtcta ttactagagg taagactaaa 3060
gctgaaagaa agatcaaaat attaggtgct gtgccttctg gattgaaaga tgttggtgtc 3120
tttgaattac gtgatgattt aatgtcaaaa attgctcctg aattaccagc ttctgtcatt 3180
gtcttattat tggaacatat ctcgattgct aagtcatttg gtagagttaa tgactataag 3240
attgttccag atcaagaact aattgctatt ggtgttacta atctattggg tactttcttc 3300
atggcttatc ctgctacagg ttcattttca agatctgcat taaaggctaa atgtgatgtc 3360
aagactccat tctctggtgt tattagtggt gcttgtgtgt tattggcatt gtactgttta 3420
acaagtgcct tctttttcat tccatcagct actttatctg ccgttattat tcatgctgtc 3480
tctgatttaa ttgcttctta tcacacaaca tggaatttct ggaaaatgaa tccattagat 3540
tgtttatgtt tcattgttac agttttcata actgtctttt cgtctattga aaatggtatc 3600
tatttcgcca tgtgttggtc agctgcctta ttgattttga aggtaacgtt cccagccggt 3660
aaattcttag gttacattca aatcgctgaa gttgttaatg gtaatattgt caatgatcca 3720
tcaatcactg tatctgagcc ggtctctgaa aatgaggaag atccagaagt caataagaaa 3780
tcttcaacat ttaataaatt taaaggtaaa atgttttcat catctggtaa atctgcttct 3840
acaaaagaat tcaacagtga tgaatacaaa aagaatttat atgaaacaat gaatgaaagt 3900
gaaactaatg acagtactgc taaattgaac tactacacca aatgggttcc attcgatcat 3960
gcatatacca aagaattaaa cccggattgt aatattattg ctcctccacc tggcgtcatt 4020
gtttacagat taactgacag ttatacatat ttgaattgtt caagacattt tgacatcatc 4080
tttgatgaag ttaagaaaca aactaagaga ggtaaattaa ttcaacattc gaagaaaacc 4140
gatcgtcctt ggaatgatcc aggtccatgg gaacctccaa cgttccacag agctctaatt 4200
aagaaaggta aacaattttt ctcaagaaat aaatctggta atggtgagga tactactgaa 4260
gtagatatcg atgcaaatgt atctttggat aacaatacaa ttgtggatga acgtccatta 4320
ttaaagatca tttgtcttga tttttcacag gtttcacagg tcgatgcaac tgcattacaa 4380
tcactagttg atctaagaaa agccatcaat aaatatgctg atagacaagt tgaattccat 4440
tttgcaggta ttacatctcc atgggtcaaa agaggtttag aaaatatgaa attcggtaaa 4500
gtcaatgaag aattcagcga tgaatctatt attactggtc atactagtta tcatctagcc 4560
agatctcaag aaaagtcaga tgattcaagt aatcttgatg attcatttga agatttagaa 4620
agtagaaaga cttatcaaat taatgttgct acaggtacta atttaccgtt cttccacata 4680
gatattcctg atttctcaca atggaatatt taaaacgtta atttagtcat acatatgttt 4740
tacatatatc actatataca tattctataa taatatcaat atcaagataa aataatacaa 4800
ttacaaag 4808
<210> 53
<211> 3437
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2966_g2_i1
<400> 53
gtccaatgga gagttccatt aggtttatcc ttcgcatggg ctctacttat gattggtgcc 60
atgttcttcg tcccagaatc tccacgttat ttaatggaag tcggtaagac tgaagaagcc 120
aagagatcta ttgctacttc taacaaggtt tccattgacg atccagctgt caccgctgaa 180
gctgaattga ttgctgccgg tatcgaagcc gaaagagctg caggtagtgc ttcttgggct 240
gatatgttct caagtaacgg taaggttgtt caaagattaa ttatgtgttg tatgctacaa 300
tgtctacaac aattgactgg ttgtaactat ttcttctact acggtaccat tattttccaa 360
gctgtcggtt taaacgattc ttatcaaact gctattgttt tcggtattgt taatttcgcc 420
tctacattcg ttgcattcta cgtcgttgat agattcggtc gtcgtgcttg tctaatgtgg 480
ggtgccgccg ctatggtctg ttgttacgtt gtttacgctt ctgtcggtgt tactagacta 540
tatccaaatg gtaagaacga ggcaacttcg aaaggtgctg gtaattgtat gattgtcttc 600
tcatgtttct tcattttttg ttttgcttgt acttgggctc ctatctgttg gattgttgtc 660
tctgaaactt tcccactgaa gattaagcca aagggtatgg ctttagctaa cggttgtaac 720
tggttatgga atttcttaat ttctttcttc accccattca ttactggtgc tattaacttc 780
tattacggtt acgtcttcat gggctgtatg gttttcgctt acttctacgt cttcttctgt 840
gtcccagaaa ccaagggttt aactttagaa gaagttaacg aaatgtggga agatggtgtc 900
ttaccatgga aatctacatc ttgggttcca gctgccaaga gaggtgctga ctacgatgct 960
gaagctacca aggtcgatga taagccaatg tacaagagaa tgttctctag aaagtaaaca 1020
gatagataat taattatttt ttatccttct gtctttttac aattttaatg aatagaaata 1080
taaactcaaa cacatatttc aaatagtatt atgattttat gatttttttc ccttctttta 1140
caaaaatgat agattgcatg ttttaacaaa aattctttct attgtttcgt tgttttgaca 1200
aaaaaaatcc tttcttttat gatacacgca attttctaac ttaattttaa tatcaacttt 1260
tttaataata ataataattt ctttttaact tataatatcg ttttcttctc aataataatt 1320
atgtaaataa tatatatatt atttgtttcg aacctgctat tatattttaa aaatagatgg 1380
atagcaccgt gccattatca tcacaacttt gaacaataaa accaatatcg aatcttcctg 1440
agttgaataa atccctcatt ttcacatagc aaaaaattat tcaaaattaa tccacataat 1500
actttctttg tctcccccgg attatctcca ttttatgcgt cataccacca aagcaaacag 1560
aactccccgc cataatgcat ccggaaaaaa tgataaattg cttattatct tttctttttc 1620
tttgcgatgt ctcggaattt ctctttcggg accggattaa ttttcaaatt tctttttttt 1680
tgagaaccag aatcacggaa caagataatg gcatgtcgat gatcatttaa tcgaaaatct 1740
gttactaaaa accgactgca gtctctccaa caagaatact tgtatctcct aaaatgttct 1800
ggatgattcc aatatcgata tccacacata ttaaacaaat ttatttgtgt ctttctaaat 1860
ctccatattt gtacagagag ttaaataatg gggttacatg agaatactgt ggggatggac 1920
attatggtac gtattgattc ataaaagttg ctaaattaat cgacttatca ctcagtgatt 1980
ggaaagaatt ctgtttttaa ccgattcact ccttcattaa ggtgcaggtc aatcatcaac 2040
taatttctgg agaaattaaa ctcaagcacc gagaatttcc aaaatttacc acggttagta 2100
tcgacgattg gttcgttgaa aaggtacagt tcaaacttaa aaaggaaccc ggtgaacacc 2160
tatcatgtag tccggtaatg gtcttctaat gctaggtact gagattgttc tttctagatg 2220
cggtatattg tacattttta cgatatttga tttcaatgtt aatctccttt tattgttact 2280
taaagattac aattgggtct ttaatttcat cacaataccg ccgtcgtcac tcctaggttc 2340
cttactgtgc cccacgacgg tctcaaaatg gggaaactgc aaaatactta ttattaaaat 2400
cttcattcat acgactgcca tctaatttaa attcgcttac attaaattat tctgacaaat 2460
gatgcaacag acgccttcat ccaccccgga ttaatgtttc ctaacttatc tttaaacaaa 2520
aattatgaga tatttaattt taaaatctgg ggtatataaa ggtaagaaat aacgaatcat 2580
attgataatg tcaaattatt attttccaaa tcatttttcc ctttaaagtc tctttatatc 2640
agaagttcaa tatactaata aataattaca aatatttatt aacacaaaca aaaagagcat 2700
ttcaaaaatg actgaaacaa attctgttca tgagttggaa aacacaaacg ctttgcctat 2760
taactctgac agtaatacag atactcaatc aaacagtgct tcactaacag attcaaggaa 2820
acaagaattc ggtaatcaag agctagaagg taccgatgga aatcaagaag atttagatat 2880
tccaatcaaa gctgcctctg cttatgtcac catctctatc ttctgtgtta tgatcggttt 2940
cggtggtttc atttcaggtt gggatactgg taccattggt ggtttcttag cccatcctga 3000
ttatttgaaa agatttggtt ccaaacataa ggatggtact tactacttct ctaacgtcag 3060
aactggttta gtcgtctcta ttttcaacat tggtggttta atgggttgtt taatccttgg 3120
tggtctagct aacagaatcg gtcgtaagat ggctctagtc gctgtcactg ccatttacat 3180
ggttggtatc gttattcaaa ttgcttccat taacaaatgg taccaatact tcatcggtag 3240
aattatctca ggtatgggtg tcggttctat ctctatgttt tccccaatgt tactatctga 3300
agttgctcca aagcatttaa gaggtacttt aggttctatt tatcaattaa tgtgtacctt 3360
cggtattttc ttaggtgatt gtactaatta cggtactaaa gcttactcta attctgtcca 3420
atggagagtt ccattag 3437
<210> 54
<211> 2198
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c849_g1_i1
<400> 54
gtttatcatc tttctgatat ttagaaagaa ggaataaaaa gaaaaatgta acatatgtta 60
aaggttgttt tcttattcca cgatcagaag ttaaaaactt taaatatatt gtacctccct 120
cttactacat catcaactat ggagatacta gaaagtttgt tcacgtaatc attttcttaa 180
gaatctgaac gcgttgttct ttttttttct ttttctttct tttggtattg aacgttgcga 240
acaatatact gcagggagta gttagtattc ttttttccat atcttacttt cttccttatt 300
tataactgca gtgctatgct gcattgccag agaaaattac cgaccattca tgtgaggtac 360
cgaatagctt gaaaagtcag aaaccttgaa gaccggatgt caaagtatgt tgttcttatt 420
ttttttaaga ttttttcttt ttatttttaa tctgttgcaa agaattgaag aaactagagg 480
tttccttctt gattatcaca tatttattcc atttaaaatt atgcaatatc ctttggaaaa 540
acccagagta catgtttgtg catatccttt tgtgggtatt tttcacccga aaatatatat 600
ccttaataat aaagttataa ctataatgtt gcatgcaggt tttattgtta acatgtacat 660
tctattttac aaatagcaat taaatccagc ttgtttacgt gcctcaaaaa caaatacagc 720
aaagaaaaaa aatagtattt gactaatgca gtttctgagt gaaaatgatg aaaaatatca 780
tatatataaa cagaacaaga tcttaaatat ttcaataatg gggatgccgg tttatcgaat 840
agataatctt atccgtagta attaatttta cttatataat atcccgtaaa ctagataaaa 900
gataattttt ggttaatatt ataagtgaca atcatcatcc gcttacacat tagaaaagtt 960
attccaaaat tcaaactata agaggacaac aaaaacaatt atatgtattt gtcaaacgtt 1020
ttagtagtta tatatctgta atactatttt attcaaaaga aattcgggga aatataatca 1080
cattatgaaa ggtaataaca aaattacaaa agatagaaac acatcaaagg gtcaatcgag 1140
ttacttagag actttgttgt catcggaagt gacgacaagt ttagacacaa caccattttt 1200
ttgtaatatt ttgacaaatt tatatgaaaa tgttcaactt gatggttcaa ctcacaaatc 1260
tggagttgtt tctgatacaa gaacggttct attacatcaa catcaatttg atggtatctg 1320
cgggtctaat gaagatgaag ttcctaatac ggagcaatct ataacaagcg aattaaaaaa 1380
ttctactcca gattctcgag gccgtgagag tatatcttct tggaaccatt ctactttaac 1440
aaattattca ccactttttt tggaaccaag atcatattcg agcatgagtt cacaatcgtc 1500
attatatgaa gaaaatacac aacaagaaat gcattccaaa aatgaaaccc agatcaatgc 1560
aaactccaaa agtgtacaag attatatatc ttcatatatg gacctaacaa cagatactag 1620
aatttatgat tcaatatcac catcgtattc atcattatta gaaatttctg ataatattac 1680
aaattctagt actaaaaaag acagcgctga aaattcatct accaaaagaa atgggagcat 1740
atctaaatca aatttcccaa aaggcgtcat acatgaatgt aatctatgtg gtaaaagatt 1800
ccaaaggccg tctacattgg agactcacat gaacgttcac tcaggtgaaa agccattttc 1860
atgtcccttt ttagattgta aaaaattatt taatgcaaga tcaaatatgc taagacattt 1920
aaagatgcat ttcaagttag gaaaagggaa atatttgtta ccaaatggcg agatatcatc 1980
tgagaaacct acagctaagc aattagtatg ctttactaac cctgcagcta gcaaagttac 2040
atgagaaact gtccacaaat tttcgaagtc ataatgcatt tcatccatga gtttgatata 2100
tgtgcaacat atctagtttg tcacaaaaca tccagtactt atattcatat atatatatat 2160
aacaacacca ataaatatat tctaattttt attcttca 2198
<210> 55
<211> 2430
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2966_g1_i1
<400> 55
cccaaatact tatataatat tttctccatt gtcacaagaa ctttcaattt gtattctcat 60
ttagaaatat aacaaaggac acaaattgta taagtattca ttttgacaac aaaagagcta 120
tgctactgga cattttcttt cttttttctt tttgattcaa aagatatcgg cacctaatca 180
ggagacattc atcgtctcct tgttagttca agacaataaa tacttactag ttaccccaga 240
tatatatgtt cattacccca gatataattt acaaccatca tgcatatttc acactagaac 300
ttcatttaac aaatccttga acatacgtaa ggaagtaaaa aaaaaagtta ccccttaaag 360
gacggtccta tcttcttttc tttttaaata acaatacgta aaaaggactt cacgtacatg 420
tattaacccc gcaaaataac attcaataac gattgttcaa ggagttataa cagacaccag 480
attacaatat ccattaattg ttcgaagtga cacgagccgg tataaacaaa taaattattt 540
actctgccga attgtcattg accatcctct ggtggtccca gatgctgatg ctgttgttgg 600
ccccccctcc atagaacaac taataattag ataccgactt ccccacactc atgcgtcacc 660
cactgattta acggaccgaa gttgaatagg aaagaggaag aaattctttc gaccgttcag 720
tggcgtgaca ccgacagcca aaccgttcaa tcaaaaaatg aagccgttaa ttcgattcta 780
ctaacaatcc ccagtacaat cccaccggta gcgtgggggt gagtaatacg gtgtgtctat 840
cagtaacagc ggggcgatct tttttttttt tttcactttt ctttcatcgg tgaatatttc 900
ctgtcctgat gtcccgatta ggacattccg caccgcctgg cccgtacgca atattgttac 960
cgcgtacgca tggtactacg tggcgttgtt acagcgggtg ccccgccccc gcgttgctgc 1020
ttttttcggg ggcgagctta atagcttaaa gtttcttttt gtgcgaggct gcttaccaaa 1080
tgagggcaga aagaaagaaa agaaatcaaa aaaaaagcaa ccacccacga tcacatgccc 1140
gttctcgagg agtttgttag agaattgact tatggaatac ttcaacaaga atacgaatat 1200
catcaattct atgaaaaatg tatctcacgg tgtctcggtg aaattgttat gtcctgcata 1260
taacagtgtc agcagaaacc gagggaaaca tatcgttact cctactactt cgtcgtaacg 1320
tgtccttgaa agaaattaac aaaaaacaag gaaataattc accgacttga tctttctttc 1380
tctctgttcc tctggtgaaa ttatggagaa ttctttgttt ttgttatgtt ggaagtgaaa 1440
agaaattctt tcatatcaat gcagttcagt tgactagtga aacaagtata ttcaaaattg 1500
taacgtttgc tacttttttg atttagttat tttaggaatg tttatttggt ttctggaaac 1560
atataaatac agcaacgata tcaactcaat tttaaaattc aattcaatct tgtctctctc 1620
tttttttaat tcatacttat tttttttctt ttaactataa aaaacccatc aataataact 1680
aataacatta tttaataaat atattcaata tgtctgaagc tcaggttgat cctcaaaacg 1740
agcatccaga aactaatgca atgccttctt catctgacaa caactctgtc ttaactgccg 1800
actccaacaa agtcgacaat gacatgaaga tggaaggtga aaactctagt caagatcaaa 1860
tggttgttga tattccaatc aaaccagctt ccgcttacgt caccatttct atcttctgtg 1920
ttatgatcgg tttcggtggt ttcatcgccg gttgggatac tggtaccatt ggtggtttct 1980
tagcccatcc tgattattta aagagatttg gttccaaaca taaggatggt acttactact 2040
tctctaacgt cagaactggt ttagtcgtct ctattttcaa cattggtggt ttaatgggtt 2100
gtttaatcct tggtggtcta gctaacagaa tcggtcgtaa gatggctcta gtcgctgtca 2160
ctgccattta catggttggt atcgttattc aaattgcttc cattaacaaa tggtaccaat 2220
acttcatcgg tagaattatc tcaggtatgg gtgtcggttc tatctctatg ttttctccta 2280
tgttactatc tgaagttgct ccaaaacatt taagaggtac tttaggttcc atgtatcaat 2340
taatgtgtac cttcggtatt ttcttaggtg attgtactaa ttacggtact aaagcttact 2400
ctaattctgt ccaatggaga gttccattag 2430
<210> 56
<211> 2873
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c2749_g1_i1
<400> 56
tttttttttt tttttttttt tgaatatatt aaatagaata tttcgatgtt attagaattg 60
ataaaccttt cctcaagata tctttaatgc aataacttct tagtttctga tatctgaaac 120
tactagttac aaaaatgcat atctgaaact actagttaca aaaatgcata tctgttgtta 180
gaaatatttg acttagccac attgcactta gcttatatat gtcacataga aagtatcctt 240
gttttagaag tatttgcatt ttagatacat tgatatagtg attattttga agctaggtca 300
tcactaatag gtgattaatt aatgaaatga ttgtttcggc cgaaactttt ttgtagcctg 360
tcacgtgcac ttcggattta acaaagaaaa aaggatttac tacgtacaat atagtgttgg 420
gagttgcctt tattgaagtt actttgacta aattatattt ttatttctga ctattttccg 480
aagtagctgc tattttcatc cattagaaac aagagcaaca aacaattata gctggtggag 540
gtaatgctat tcctgtctac gtacatccat ttcacagaat tatgcttcaa ttgcgattct 600
tcttaatgaa gaccaagtaa atatcgtgac ttgaatagtt gcacgtttaa gctctacata 660
ccatatagca gtatctttgt tagcttggcg gtgtcatatt atatgcgatg acaatacata 720
cttggcaact gggggatact attaaagatc aaggaggcgt aaaatacatc cctactaata 780
ttattactat tatgttgttt ttgtgataac ctattcagaa gtgattttag ggatgctgct 840
ttgacgctcc gtacagttga aggttaacag tctaagttga catgtggcta tacaatatcc 900
ttgtttaaga gattaattca acgaactctt gatgaaataa tcaatcattc gagatttggt 960
ataaaactgt cggccgaata atttccaacc aaagcctctt cttctccgtc ggagatatct 1020
tctactacgg ccaatttaca gtagaaaaaa aaagaaacgg aatcacattt tatatttacg 1080
taggttccag aatatctgtt tccggcacag aaacaactgt acgacagaaa atagcaaaga 1140
tgcctttctc cgagaagaat cctccttcgc ttaaataatt gaaaaattta ttaaggattt 1200
gcttgataca aagccaaggt tctctctgtt ctgttatgtg attgtcttgg aagtataagg 1260
agattcaatg agctttttca aggatgaaaa tgattaatat ataaaggcaa cgaattccta 1320
tgaaatgttt cgatgttatc tagatgtttt cccagttttc tttttgtttt tcgctaaagg 1380
gtcaacgata aaataatatc acaattataa caaatatggc ttatccagaa aatttctcag 1440
gtatcgcaat cgtagataac aaagattata ctcatccaaa gaaagttgat ttcgaaccaa 1500
aggtctttgg cgatcacgac attgatttaa aggtcgaatg ttgcggtgtc tgtggttcag 1560
atcatcataa ggcctgtggt gcgtggggtg aaaccgttaa acctactgtt ttaggtcacg 1620
aaattattgg taccgttgtt aaattgggcc caaaatgtaa ttccggtcta aaaattggtg 1680
accgtgttgg tgtaggtgct caagcattcg cttatttgga ttgtgaccgt tgtaaatctg 1740
ataacgaaca atattgtaga aagtgtgttt gggccatcga ttcagcatat gccgatggtt 1800
accgtagtaa aggtggtttt ggtaactatg ttagattaca tgaacatttt gctgtcccaa 1860
ttccagaagg tttagattct gctaccattg caccattatt atgtggtggt gtcactgttt 1920
actccccatt attacgtaat ggttgtggcc caggtaagaa agttggtatc atgggtattg 1980
gtggtatcgg tcacatgggt atcttgttag caaaagcaat gggtggtgaa gtatacgcaa 2040
tctccagatc caacgcaaag aaggaagatg cctttaagtt aggtgccgat cattatattg 2100
caaccaagga agaaccagat tggactacta aatatgatga taccttagat ttagtcgtca 2160
tctgttcagg ttctttaact gatattgatt taaatgtttt accaaagaca atgaaaattg 2220
gcggtaagat tgtttccatt gctattcctg aagcttccga gaaattagac atgagcccat 2280
ttggtttgtt aggtgtctct attgctaact ctaatattgg ttccgttaag gagatcaaac 2340
aattactaca attagctaag gataagaata tcaaaccttg ggttgagcaa gttccaatgg 2400
gcgaagattc cttaggtcaa gtctttgcta gaatggataa aggtgacgtc agatacagat 2460
ttactatggt cgattatgac aaggtctttt aaaggatgat ttagtatatt cttctgaaaa 2520
aattatcact tttcatatta tatatatata acgttcataa tctactatta ttcaaaaaaa 2580
aatattcttc aataattatt atgagtttat ttacactaaa tacaaaatga aacgtgtata 2640
cagaaattct aaagactttc aagatataat atggccaagt agaattgtgt ccaattactg 2700
gaaactgaaa ttacactgtg tgctatcctt aagtgtataa ttcttcatag taaaaggaaa 2760
taatttcaat atgcaatagt aagtttaata aaccctacac tatatattgt gacttatata 2820
taatatatat aacatttcta ttactatggt tatgtaccta atataactaa aac 2873
<210> 57
<211> 1837
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> c7728_g1_i1
<400> 57
agagaatata gtgggtatga cttaagtgtt gtgaagcgtg ttacatgact tttacaatgt 60
aagcaggtaa tgttttcaga ttttggatat tatctacaga atttcttgat cattattgca 120
tattatcctg agcaatattg ataatttctt aaatcagctg gtaatccagc actttaatag 180
ttgggataca gggacatagg aagtagaaat ataatgcggt tgtttctacc cctataatta 240
gactgcacgt cagtactctt gtctcttttt gttgagaaac aagtcattaa ccccgaaagt 300
tttagtcaac agatgcggga gccctaagaa actgtcaagt tataaacaga tgatcggtaa 360
cgaagtagaa agggaaatta caatatttaa gaagataaca ttatcttaaa ggttcctttt 420
ttggtttaag ttgtcacttg taaacgagag agctagttgt ccttttatat tgactgtaac 480
acttcaaccc ttcagaaaga tgagtaccaa aacaaagatg cagaaaagat cagttaataa 540
aaattcgaac aacattgaaa tcaacgatga cgataaaaca ttgaagaagg ttgtaactca 600
taccacatca accaaacgta agctttacac atggcacgaa atacctgatt ggcagaaaga 660
taatgaatat atccatggtg ggtacgtgaa agaaactaat agtttcactg aatgcatcaa 720
tagtttattc tatattcaca atgagactgt caatatttac agtcacttaa ttcctggttt 780
gatctcctta ggactagtta ctattgacaa atactgtgtt cctaaattca atactacagc 840
aataacagat tatcttttca ttgatttatt cttccttggt gcatttgctt gtctgacgat 900
gagtagtaca ttccattgtc ttaagagtca ttctccaggt gttgccaaat ttggtaataa 960
attagattat ttaggcattg ttgtattgat ttcaacttca atggtaagta ttctttatta 1020
cggcttctat gataattctt ttatgtttta tttgttctca ggaattacat tgatgttcgg 1080
tagcgcatgt gctatcgtta gtctagatga gaaatttcgt acgagggaat ggcgtcctta 1140
tagagccgcc atgtttgtta tgtttggact ttcagctttt ctgccaatag gagcaggtct 1200
catttattac ggttcccatg aaacttggac tagagttcaa ttaaaatgga ttattttaga 1260
aggtgtattt tatatatttg gtgcatttct gtacggaggg agacttcctg aaaagtaccg 1320
tcctggtcac tatgatattt ggggtcattc tcatcaaata ttccatgtct tagttgttgt 1380
tgctgcattg tgccacttaa cgggtcttat tgaaagttac agatatgtcc acacatatat 1440
gattccatta atgatgcaag catgaatttc tattcttcaa agacagtttc catatttttg 1500
tattttctgt ctgtaattgc taatgggaca ctactatata ttaccataat gttagcgtat 1560
ctagcgtatt atttataact tatagaacta gttattgatt tatacgataa acaatatatc 1620
tattaatatt actgaaatac tatttgtgta tcttctaagt acaatcattt agcatttttg 1680
gtgccctaaa atttgtaact ttggtattct caccctacat ggagttaatc cgttgatcaa 1740
atcagctata gatgtcttaa tgaataagtt aatcacaaat acatttaata acaaattgat 1800
ttcaatcagg gttaaaatta aaaaaaaaaa aaaaaaa 1837
<210> 58
<211> 996
<212> DNA
<213> 家牛(Bos taurus)
<400> 58
atggctactt tgaaagatca attgattcaa aatttgttga aagaagaaca tgttccacaa 60
aataaaatta ctattgttgg tgttggtgct gttggtatgg cttgtgctat ttctattttg 120
atgaaagatt tggctgatga agttgctttg gttgatgtta tggaagataa attgaaaggt 180
gaaatgatgg atttgcaaca tggttctttg tttttgagaa ctccaaaaat tgtttctggt 240
aaagattata atgttactgc taattctaga ttggttatta ttactgctgg tgctagacaa 300
caagaaggtg aatctagatt gaatttggtt caaagaaatg ttaatatttt taaatttatt 360
attccaaata ttgttaaata ttctccaaat tgtaaattgt tggttgtttc taatccagtt 420
gatattttga cttatgttgc ttggaaaatt tctggttttc caaaaaatag agttattggt 480
tctggttgta atttggattc tgctagattt agatatttga tgggtgaaag attgggtgtt 540
catccattgt cttgtcatgg ttggattttg ggtgaacatg gtgattcttc tgttccagtt 600
tggtctggtg ttaatgttgc tggtgtttct ttgaaaaatt tgcatccaga attgggtact 660
gatgctgata aagaacaatg gaaagctgtt cataaacaag ttgttgattc tgcttatgaa 720
gttattaaat tgaaaggtta tacttcttgg gctattggtt tgtctgttgc tgatttggct 780
gaatctatta tgaaaaattt gagaagagtt catccaattt ctactatgat taaaggtttg 840
tatggtatta aagaagatgt ttttttgtct gttccatgta ttttgggtca aaatggtatt 900
tctgatgttg ttaaagttac tttgactcat gaagaagaag cttgtttgaa aaaatctgct 960
gatactttgt ggggtattca aaaagaattg caattt 996
<210> 59
<211> 2850
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1等位基因1的5'UTR
<400> 59
tcaaaaaaaa ttctttggaa gaagaaggat cattgcaata ataatgaaat aaaaagttca 60
acagttgtag tttgtacttg agctattgga caatcattac cactttcata aaaagtcacc 120
aaattaaaga gttaacaaaa atcgtgaacg aactggatta agtttctaaa ttacgttgaa 180
gatagattta ccgaaatgaa aattatgtca tctttctcta caaatttcgg atgtaattac 240
agttttactt tgattaagaa caaaatatac atgttagggt tggtaatcac tttaatgggt 300
ctttgattgt tatctattag ataatacagt aattaatttt caggccacac tttttatcat 360
cagcatacca agaaaactca agagatatgt gacctgttgc atgaggaaga tttcggatat 420
gagcacgcaa ttcatttaat gagatgtatg tattaagaag gtaaaattca atagagaatg 480
cagtttatac ttttagaagg ggtcaccaat tcaaccccaa attaaatgat ttagaaattc 540
aatattattt cactagaaca attcaagcaa tgagtttctt ttatttctag attctattta 600
ttctttcctt tcattaataa ataacaaata aatgctttta ctgttcaaaa aatatgtttc 660
cattacagtg atatattgaa ttagaaatat gacctatgct aattgaggtt tacttaattc 720
aaacatgaat aattcaaatt aagggagtac agttgatgaa aggaatattc tcagtttaga 780
attgttatta acaatatata gttaatagct gcacacccct tttctttatg catatatatt 840
ttcaataaaa gtaattacat aaccacccct taatgagttt ctcgaaattg ttataattta 900
aatttatttg tatataaaag agacgaaccg aattatagta gaaaaaactg aaagttgttc 960
aaaaagtgtc cctgctaaaa aattagcata caaatttgta aattcaaatg gataaataac 1020
aaataaatcg atgtaccgat agaatgcaca tgagtgacat gtctcagtat tttagaatag 1080
aagatagttg attaactaaa taaaggggat aataattttt ttaccatttc acatcagaaa 1140
aataggaaaa aatgataata ttccttgatt tgatttttct cgagaatcga attatgaccc 1200
cactataata tagtgacagc ttcgacatga cttccgaaaa agaaaatatt tcgtaaattc 1260
cctgtatagt gagtgaatct aaagcaacca actagcaaaa ccaacgtcaa gaaataccat 1320
gaaaagttta agaatacaag acctgctcca accctttttt cgtttgtttg ttcagccgtt 1380
cggtacaatt tacgagtttt cataaattat gcaaattaac agtttaccca tcgtgtcttt 1440
gtataactct cactcatcct tcggactctt accgctgctc taattacgta gtaatggatc 1500
aatttctatt gacctttatt taattagaat gattttgtga cgtttttttt tcttagctta 1560
aaaaatacta cgtgcatttt gctaagagcg acggtagaaa cttcaccata gaaaaatatc 1620
tattttagct gtaagaaggg tatttcttct catggttgac aagaaagtaa ttgactggct 1680
ctgtgaaacg ccggttaaga gtatttggtg agccctccag ttatattctt tcaacgtgca 1740
tcagacggtt catacaaaca tggccaaaga aatcgtgtag tagtagcatt catttatctg 1800
tgccttgggc ttcttctttg atgaaataat ggaaaaaaaa gaaatgtgcg cttgctgtgc 1860
ctgacttctt agcttccacg aaaaaatacc cagcgtccac aattaatttc ttttttttat 1920
ttatttatct ggagaacatc tgagtaaaaa aaaaagcggg aagagccaga aatatcgtat 1980
ctctttgaac aggaaattca taaattatgc atttattcat tttctcaaag atttaataaa 2040
aaaacaaaca aacttgaact atgtatattc ttcgcgctgt tttagttccg catatatccc 2100
actcacatta tttttttttt ctctcgtcgc ttcatccaaa ttcgctctgt gtattttatt 2160
atctctttcg gttatttcaa ttttttgcat aatttattca aaactcttaa atttcgaaaa 2220
aatttccacc cataaaaatt attttaattg atccagtaaa attgttgcac agatcgtaaa 2280
tgaaaaatta ttcacatcga tatcgtcttt gttgtaattt tgaatcgtta acaagaaatt 2340
tgttacttga cagcaggata tcagttgttt gttagagaaa ttataagcaa aaaaaaattc 2400
tcaaatttca cgaaagtcaa acagagttag atcaaattta ataatcatca acaggaaaac 2460
aactattttc tgcggataat ttacagtatt cacaatttgc tctcaaagga agtttgtggg 2520
caaatatttc tctttgtgat tgtttaaggg cagaaaaaag taagttgata gaataaaaat 2580
attaacaatt gatgatgttg atgtttgttt gatgtcagtt tggttgtttt actgcataaa 2640
gattgagagg actaagatca tcaaaatgag aaaatttttt ctttttcagt ttacgtatct 2700
gaattaatct ttttttttta atatataagg aacagattgt tttcctattt gaaatgaatt 2760
ctccgtttgt aaattttctc tgttaattgt ttttctctat ttcttgtcaa ttctaagata 2820
accatcctat tcaattatac acatccaatc 2850
<210> 60
<211> 2850
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1 等位基因2的5'UTR
<400> 60
tcaaaaaaaa ttctttggaa gaagaagaat cattacaata ataataaaat aaaaagttta 60
acagttgtgg tttgtacttg agctattgga caatcattgc cactttcata aaaagtcacc 120
aaattaaaga gttaacaaaa atcatgaacg aagtggataa agtttctaaa ttacgttgaa 180
gataaattta ccgacattga aattatgtca tctttctctt caaatttcgg atgtaattac 240
agttttactt tgattaagaa caaaatatac atgttagggt tggtaatcac tttaatgggt 300
ctttgattgt tatctaatag ataatacagt aattaatttt caggccacac tttttatcat 360
cagcatacca agaatactca agagatatgt gacctgttgc atgagaaaga ttttggatat 420
gaacatgcaa ttcatttaat gagttgtatg tattaagaag ttaaaattca atagagaata 480
cagtttatac ttttagaagg ggtcaccaat tcaaccccac attaaatgat ttagaaattc 540
aatattattt cactagaaca attgaagcaa tgagtttctt ttatttctag attttattta 600
ttctttcctt tcattaataa ataacaaata aatgctttta ctgttcaaaa actatgtttc 660
cattacagtg atatattgaa ttagaaatat gacctatgct aattgaggtt tacttagttc 720
aaacatgaat aattcaaatt aagggagtac agttgatgaa aggaatattc tcagtttaga 780
attgttatta atgaaatatc gttattagtt gcacacccct tttctttatg catatatatt 840
tccaataaat gtaattacat aaccatccct taatgagttt ctcaaaatgc ttatgatcta 900
aatttatttg tatataaaag agacgaaacg aattatagta gaaaaaactg aaagttattc 960
aaaaatcgtc cctgctaaaa attcagcata caaatttgta aattcaaaag gatatatcac 1020
aaataaatcg atgtaccaat agaatacata tgggtggcat ttctcagtat tcgagaatag 1080
aagataattg attatttaaa taaaggggat aattattttt ttaccatttt acaacagaaa 1140
aataggaaaa aatgataata ttccttaatt tgatttttct cgagaatcga attatgaccc 1200
cactataata tagtgacaac ttcgacatga cttccgaaaa agaaaatatt tcataaattc 1260
cctgtatagt gagtgaatct atagcaccca actagcaaaa ccaacgtcaa gaaataccat 1320
gaaaagttta agaatacagg atctgctcca tccttttttt cgtttgtttg ttcagccgtt 1380
cggtacaatt tacgagtttt cataaattat gcaaattaac agtttaccta tcgtgtcttt 1440
gtataactct cactcatcct tcggactctt accgccgctt taattacgta gtaatggatc 1500
aatttctatt gacctttatt taattagaat gattttgtga cgtttttttt tcttagctta 1560
aaaaattcta cgtgcatttt gctaagagcg acggtagaaa cttcaccata gaaaaatatc 1620
tattttagct gtaagaaggg tatttcttct catggttgac aagaaagtaa ttgactggtt 1680
ctgtgaaacg ccggttaaga gtatttggtg agccttccag ttatattctt tcaaaatgca 1740
tcagacggtt cataccaata tggccaaaga aatcgtgtag tagtagcatt catttatctg 1800
tgccttgggc ttcttctttg atgaaataat ggaaaaaaaa gaaatgtgcg cttgttgtgc 1860
ctgacttctt agcttccacg aagaaatacc cagcgtccac aattaatttc ttttttttat 1920
ttatttatct ggagaacatc tgagcaaaaa aaaaagcaag aagagccaga aatatcgtat 1980
ctctttgaac aggaaattca taaattatgc atttattcat tttctcaaag attcaataaa 2040
aaaacaaaca aacttgaact atgtatattg ttcgcgctgt tttagttccg catatatccc 2100
actcacatta tttttttttt ctctcgtcgc ttcatccaaa ttcgctctgt gtattttatt 2160
atcactttcg gttatttcaa ttttttgcat aatttattca tttctgttaa atttcgaaaa 2220
aatatccacc cacaaaaatt attttaactg atccagtaaa attgttgcac agatcgtaaa 2280
tgaaaaatta ttcacatcaa tatcgtcttt gttatatttt tgaatcgtta acaagaaatt 2340
tgttacttga cagcaggata tcagttgttt gttagagaaa ttataagcaa aaaaaaattc 2400
tcaaatttca cgaaagtcaa acagagctag atcaaattta ataataatca acaggaaaac 2460
aattaatttc tgcggataat ttacagtatt cacaatttgc tctcaaagga aatttgtggg 2520
caaatatttc tctttgtgat tgtttaaggg cagaaaagag taagttgata gaataaaaat 2580
attaacaatt gatgatgttg atgtttgttt gatgtcagtt tggttgtttt actgcataaa 2640
gattgagagg actaagatca tcaaaatgag aaaatttttt ctttttcagt ttaaatatct 2700
gaattaatct ttttttttta atatataagg aacagattgt tttcctattt gaaatgaatt 2760
ctccgtttgt aaattttttc tgttaattgt ttttctttat ttcttctcaa ttctaaaata 2820
accatcccat tcaattatac acatccaatc 2850
<210> 61
<211> 922
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1 等位基因1的3'UTR
<400> 61
taaaaaagct gaaaaagatc ttttaattaa tattttttac gttccatttt ttcatcatac 60
atatcataca tacatcatca ctttataaaa tttaatgaaa tagaatattt attcttttaa 120
tatttatttt tcggctatat taattaaatg ttatgcattt tgttacgtat ttattttatt 180
tacatggtat ttatttaatt gagagcattt gccttatttg ccaaatttaa agaatcatcg 240
atcgaatcat taccatacct cctttctaag attctggtcg cgatttgttg aattgcccat 300
acttcttcct catccaattc caatggtttc ttttcatcgt tgataatttg cagctcctta 360
acgcgattaa gtaatttagc ttttaaacga agaagcagcc ttcttcttaa gaaattttca 420
tctctcttaa cagatgctcc acttacatct tctggtactt taagttcttc ggagatactg 480
tttaataatc tcacagcttc caactgtata tatatctcag aggtgtaatc atgcgcagta 540
atagtggaag catcgaattc tcctgcacat aaacttttca aaaattgatc caatacacgt 600
tctttcaatt ccaaagtgaa tccagtcttc ctattaaaat tatgaaccaa ttggaacata 660
tacagccatg atttgcgtga atatacttga catcccaatt gtttagaata tttaccgtcg 720
tattccaata cgaaattttg gcatgtttcc aatagttccg aagtttttat aggatctgca 780
ttccatctgg ccttagtggc tgctttaata aaaccttgat acattaatcc aatcgtatat 840
acggagacaa atttcttgcc attatcttta gcctcttgga gtgccattgt ttgcttgtct 900
aaataatctg acgtacgtga ct 922
<210> 62
<211> 922
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> g3002-1等位基因2的3'UTR
<400> 62
taagaaagct gagaaaaatc ttttaattaa tattttttac gttccatttt ttcatcatac 60
atatcataca tacatcatca ctttataaaa tttaatgaaa tagaatattt attcttttaa 120
tatttatttt tcggctatat taattaaatg ttaagcattt tgttacgtat ttattatatt 180
tacatggtat ttattcaatt gagagccttt gccttattat ccaaatttaa agaatcatcg 240
ctcaaatcat taccgtacct cctttctaag attctggtcg cgatttgttg aattgcccat 300
gcttcttcct catccaattc aaatggcttc ttttcatcgt tgataagttg caactccttc 360
acacggttaa gtaatttagc ttttaaacgg agaagtagtc ttcttcttaa gaaattttca 420
tctcttttaa gagatgctcc acctacatct tctggtactt taagttcttc cgagatactg 480
tttaataatc tcacagcttc caactgtata tatatctcag aggtataatc atgcgcagta 540
atagtggaag catcgaattc tcctgcacat aaacttttca aaaattgatc caatacacgt 600
tctttcaatt ccaaagtgaa tccagtcttc ctattaaaat tatgaaccaa ttggaacata 660
tacagccatg atttgcgtga atatacttga catcccaatt gtttagaata tttaccgtcg 720
tattccaata cgaaattttg gcatgtttcc aatagttccg aagtttttat aggatctgca 780
ttccatctgg ccttagtggc tgctttaata aaaccttgat acattaatcc aatcgtatat 840
acggagacaa atttcttgcc attatcttta gcctcttgga gtgccattgt ttgcttgtct 900
aaataatctg acgtacgtga ct 922
<210> 63
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 63
gcaggatatc agttgtttg 19
<210> 64
<211> 18
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 64
atagagaagc tggaacag 18
<210> 65
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 65
gcaggatatc agttgtttg 19
<210> 66
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 66
cagaatctta gaaaggagg 19
<210> 67
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 67
gcaggatatc agttgtttg 19
<210> 68
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 68
aataccttgt tgagccatag 20
<210> 69
<211> 18
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 69
accttcttgt tgtctagc 18
<210> 70
<211> 18
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 70
ataactcttt cagctggc 18
<210> 71
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 71
gcaggatatc agttgtttg 19
<210> 72
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 72
tttcaaacca gtaccacca 19
<210> 73
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 73
gcaggatatc agttgtttg 19
<210> 74
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 74
gaagaagaat acaaagcacc 20
<210> 75
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 75
gcaggatatc agttgtttg 19
<210> 76
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物
<400> 76
caccagcttt aacagtaac 19

Claims (13)

1.一种具有乳酸产生能力的重组菌株,所述重组菌株是通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码丙酮酸脱羧酶的基因,并在编码丙酮酸脱羧酶的基因的位置引入编码源自表皮葡萄球菌(Staphylococcus epidermidis)的乳酸脱氢酶的基因构建而成。
2.根据权利要求1所述的重组菌株,其中所述编码源自表皮葡萄球菌的乳酸脱氢酶的基因由SEQ ID NO:1表示。
3.根据权利要求1所述的重组菌株,其中编码醇脱氢酶的基因被进一步缺失或失活。
4.根据权利要求1所述的重组菌株,其中编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的基因被进一步缺失或失活。
5.根据权利要求1所述的重组菌株,其中编码将乳酸盐转化为丙酮酸盐的酶的基因被进一步缺失或失活。
6.一种具有乳酸产生能力的重组菌株,所述重组菌株是通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的GPD1基因、编码将乳酸盐转化为丙酮酸盐的酶的CYB2基因、编码醇脱氢酶的ADH基因以及编码丙酮酸脱羧酶的PDC基因,并将编码乳酸脱氢酶的基因引入至所述耐酸酵母YBC菌株构建而成,
其中所述编码乳酸脱氢酶的基因被引入缺失的ADH基因的位置、缺失的PDC基因的位置和缺失的GPD1基因的位置处,并且
在所述缺失的PDC基因的位置处引入的所述编码乳酸脱氢酶的基因是编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
7.根据权利要求6所述的重组菌株,其中所述编码源自表皮葡萄球菌的乳酸脱氢酶的基因由SEQ ID NO:1表示。
8.根据权利要求6所述的重组菌株,其中在所述缺失的ADH基因和缺失的GPD1基因的位置处引入的所述编码乳酸脱氢酶的基因源自表皮葡萄球菌或植物乳杆菌(Lactobacillusplantarum)。
9.一种生产具有提高的乳酸耐受性和提高的乳酸产生能力的重组酵母菌株的方法,所述方法包括:
(a)通过从在低浓度乳酸培养基中到在高浓度乳酸培养基中循序地培养具有乳酸产生能力的重组酵母菌株,来诱导所述重组酵母菌株向高乳酸浓度的适应性进化;
(b)选择在所述高浓度乳酸培养基中具有提高的乳酸产生能力的重组酵母菌株;以及
(c)在所选菌株的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因。
10.根据权利要求9所述的方法,其中步骤(a)中所述具有乳酸产生能力的重组酵母菌株是通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的GPD1基因、编码将乳酸盐转化为丙酮酸盐的酶的CYB2基因、编码醇脱氢酶的ADH基因和编码丙酮酸脱羧酶的PDC基因,并将编码乳酸脱氢酶的基因引入至所述耐酸酵母YBC菌株构建而成的YBC5菌株,
其中所述编码乳酸脱氢酶的基因被引入缺失的ADH基因的位置、缺失的PDC基因的位置和缺失的GPD1基因的位置处。
11.一种重组菌株#26-5(登记号:KCTC 14215BP),其通过具有乳酸产生能力的重组菌株在高乳酸浓度下的适应性进化构建而成,所述具有乳酸产生能力的重组菌株通过从耐酸酵母YBC菌株(KCTC13508BP)中缺失编码将磷酸二羟丙酮转化为甘油-3-磷酸的酶的GPD1基因、编码将乳酸盐转化为丙酮酸盐的酶的CYB2基因、编码醇脱氢酶的ADH基因和编码丙酮酸脱羧酶的PDC基因,并将编码乳酸脱氢酶的基因引入至所述耐酸酵母YBC菌株构建而成。
12.一种重组酵母YBC6菌株,所述重组酵母YBC6菌株通过在重组菌株#26-5(登记号:KCTC 14215BP)的基因组的PDC基因的位置处引入编码源自表皮葡萄球菌的乳酸脱氢酶的基因构建而成,与YBC菌株(KCTC13508BP)或YBC5菌株相比,所述重组酵母YBC6菌株具有在高乳酸浓度下提高的乳酸产生能力和被抑制的乙醇和甘油的产生能力。
13.一种生产乳酸的方法,所述方法包括:
(a)培养根据权利要求1至6、11和12中任一项所述的菌株以产生乳酸;和
(b)收集产生的乳酸。
CN202110702592.1A 2020-06-24 2021-06-24 具有提高的乳酸产生能力的重组耐酸酵母 Pending CN113832043A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2020-0077331 2020-06-24
KR1020200077331A KR20210158676A (ko) 2020-06-24 2020-06-24 젖산 생산능이 증가된 재조합 내산성 효모

Publications (1)

Publication Number Publication Date
CN113832043A true CN113832043A (zh) 2021-12-24

Family

ID=76859416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110702592.1A Pending CN113832043A (zh) 2020-06-24 2021-06-24 具有提高的乳酸产生能力的重组耐酸酵母

Country Status (5)

Country Link
US (1) US11898173B2 (zh)
EP (1) EP3929282A3 (zh)
JP (1) JP2022008224A (zh)
KR (1) KR20210158676A (zh)
CN (1) CN113832043A (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102140596B1 (ko) 2018-04-17 2020-08-04 에스케이이노베이션 주식회사 유기산 내성 효모 유래 신규 프로모터 및 이를 이용한 목적유전자의 발현방법
KR20210041903A (ko) 2019-10-08 2021-04-16 에스케이이노베이션 주식회사 락테이트 대사 및 알코올 생성이 억제된 재조합 내산성 효모 및 이를 이용한 젖산의 제조방법
KR20210158676A (ko) * 2020-06-24 2021-12-31 에스케이이노베이션 주식회사 젖산 생산능이 증가된 재조합 내산성 효모
KR20220064647A (ko) 2020-11-12 2022-05-19 에스케이이노베이션 주식회사 내산성 효모 유전자 기반 합성 프로모터

Family Cites Families (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IT1294728B1 (it) 1997-09-12 1999-04-12 Biopolo S C A R L Ceppi di lievito per la riproduzione di acido lattico
JP2001204464A (ja) 2000-01-27 2001-07-31 Toyota Motor Corp 乳酸の製造方法
US7141410B2 (en) 2000-11-22 2006-11-28 Natureworks Llc Methods and materials for the production of organic products in cells of Candida species
AU2003240481B8 (en) 2002-05-30 2008-03-20 Natureworks Llc Methods and materials for the production of D-lactic acid in yeast
JP4095889B2 (ja) 2002-12-13 2008-06-04 トヨタ自動車株式会社 高光学純度な乳酸の製造方法
JP4460876B2 (ja) 2003-11-07 2010-05-12 株式会社豊田中央研究所 有機酸存在下におけるプロモーター及びその利用
US20050112737A1 (en) 2003-11-20 2005-05-26 A. E. Staley Manufacturing Co. Lactic acid producing yeast
JP4700395B2 (ja) 2005-04-13 2011-06-15 株式会社豊田中央研究所 酸性条件下で使用するためのプロモーター及びその利用
JP4692173B2 (ja) 2005-09-13 2011-06-01 東レ株式会社 D−乳酸デヒドロゲナーゼ活性を有するポリペプチド、これをコードする遺伝子およびd−乳酸の製造方法
WO2007117282A2 (en) 2005-11-23 2007-10-18 Natureworks Llc Lactic acid-producing yeast cells having nonfunctional l-or d-lactate: ferricytochrome c oxidoreductase gene
PL2586313T3 (pl) 2006-03-13 2017-06-30 Cargill, Incorporated Sposób fermentacji przy użyciu komórek drożdży mających zaburzony szlak od fosforanu dihydroksyacetonu do glicerolu
EP2060632A1 (en) 2007-10-29 2009-05-20 Technische Universität Berlin Method of modifying a yeast cell for the production of ethanol
JP2010075171A (ja) 2008-08-25 2010-04-08 Kirin Holdings Co Ltd キャンディダ・ユティリスによる高効率乳酸製造法
IN2012DN01521A (zh) 2009-08-21 2015-06-05 Asahi Glass Co Ltd
EP2480673B1 (en) 2009-09-27 2018-05-23 OPX Biotechnologies, Inc. Method for producing 3-hydroxypropionic acid and other products
JP2012061006A (ja) 2011-12-22 2012-03-29 Toyota Motor Corp 耐酸性微生物を用いた有機酸及びアルコールの製造方法
KR101576186B1 (ko) 2012-06-26 2015-12-10 한국생명공학연구원 에탄올 생산 경로가 봉쇄된 클루이베로마이세스 막시아누스 균주 및 이의 용도
KR102155697B1 (ko) 2013-02-05 2020-09-14 삼성전자주식회사 젖산트랜스포터 유전자가 과발현되고, 젖산 분해가 억제된 미생물 및 이를 이용한 젖산 생산 방법
KR102144998B1 (ko) 2013-08-30 2020-08-14 삼성전자주식회사 효모에 내산성을 부여하는 폴리펩티드, 그를 코딩하는 폴리뉴클레오티드, 그 양이 증가되어 있는 효모 세포, 상기 효모 세포를 이용한 산물의 생산 방법 및 내산성 효모 세포를 생산하는 방법
EP2873725B1 (en) 2013-11-15 2016-09-07 Samsung Electronics Co., Ltd Genetically Engineered Yeast Cell Producing Lactate Including Acetaldehyde Dehydrogenase, Method of Producing Yeast Cell, and Method of Producing Lactate Using the Same
KR20150064802A (ko) 2013-12-03 2015-06-12 삼성전자주식회사 글리세롤-3-포스페이트 데히드로게나제가 불활성화되고 글리세르알데히드-3-포스페이트 데히드로게나제가 활성화된 효모 세포 및 그를 이용한 락테이트를 생산하는 방법
KR102163724B1 (ko) 2014-02-13 2020-10-08 삼성전자주식회사 내산성을 갖는 효모 세포 및 이의 용도
KR101577134B1 (ko) 2014-05-09 2015-12-14 씨제이제일제당 (주) 젖산 생산이 향상된 미생물 및 이를 이용하여 젖산을 생산하는 방법
CN107709540B (zh) 2014-06-20 2021-04-09 韩国生命工学研究院 库德里阿兹威氏毕赤酵母ng7微生物及其用途
KR102277898B1 (ko) 2014-07-03 2021-07-15 삼성전자주식회사 산물 생산능이 향상된 효모 및 그를 이용한 산물을 생산하는 방법
KR102227975B1 (ko) 2014-07-24 2021-03-15 삼성전자주식회사 방사선 감수성 보완 키나아제의 활성이 증가되도록 유전적으로 조작된, 내산성을 갖는 효모 세포 및 그를 이용하여 락테이트를 생산하는 방법
KR20160012561A (ko) 2014-07-24 2016-02-03 삼성전자주식회사 Erg5의 활성이 증가되도록 유전적으로 조작된, 내산성을 갖는 효모 세포 및 그를 이용하여 락테이트를 생산하는 방법
US10704064B2 (en) 2014-08-29 2020-07-07 Sk Innovation Co., Ltd. Recombinant yeast producing 3-hydroxypropionic acid and method for producing 3-hydroxypropionic acid using the same
WO2016056566A1 (ja) 2014-10-10 2016-04-14 旭硝子株式会社 形質転換体およびその製造方法、ならびに乳酸の製造方法
KR102303832B1 (ko) 2015-05-12 2021-09-17 삼성전자주식회사 내산성을 갖는 효모 세포, 상기 효모 세포를 제조하는 방법 및 이의 용도
KR101704212B1 (ko) 2015-06-12 2017-02-08 씨제이제일제당 (주) 젖산을 생산하는 미생물 및 이를 이용한 젖산 제조 방법
KR20170008151A (ko) 2015-07-13 2017-01-23 에스케이이노베이션 주식회사 메틸말로닐-CoA 리덕테이즈 코딩 유전자를 함유하는 미생물 변이체 및 이의 용도
KR101759673B1 (ko) 2015-12-28 2017-07-31 서울대학교산학협력단 생장 속도가 증대된 유전적으로 조작된 효모 세포 및 그를 사용하여 목적 물질을 생산하는 방법
KR101965364B1 (ko) 2016-08-03 2019-04-03 한국생명공학연구원 Upc2를 발현하는 재조합 균주 및 이의 용도
KR101903382B1 (ko) 2016-10-24 2018-10-02 와토스코리아 주식회사 앵글밸브
JP6779435B2 (ja) 2017-02-27 2020-11-04 株式会社白石中央研究所 炭酸カルシウム多孔質焼結体の製造方法
KR102140596B1 (ko) 2018-04-17 2020-08-04 에스케이이노베이션 주식회사 유기산 내성 효모 유래 신규 프로모터 및 이를 이용한 목적유전자의 발현방법
KR102140597B1 (ko) 2018-04-17 2020-08-03 에스케이이노베이션 주식회사 에탄올 생산 경로가 억제된 내산성 효모 및 이를 이용한 젖산의 제조방법
KR20200040017A (ko) 2018-10-08 2020-04-17 에스케이이노베이션 주식회사 알코올 생성이 억제된 재조합 내산성 효모 및 이를 이용한 젖산의 제조방법
KR102185317B1 (ko) 2018-10-25 2020-12-01 주식회사 은명건설 파일근입 확인장치
KR20210041903A (ko) 2019-10-08 2021-04-16 에스케이이노베이션 주식회사 락테이트 대사 및 알코올 생성이 억제된 재조합 내산성 효모 및 이를 이용한 젖산의 제조방법
KR20210128742A (ko) 2020-04-17 2021-10-27 에스케이이노베이션 주식회사 글리세롤 생성이 억제된 재조합 내산성 효모 및 이를 이용한 젖산의 제조방법
KR20210158676A (ko) * 2020-06-24 2021-12-31 에스케이이노베이션 주식회사 젖산 생산능이 증가된 재조합 내산성 효모

Also Published As

Publication number Publication date
EP3929282A2 (en) 2021-12-29
EP3929282A3 (en) 2022-03-16
KR20210158676A (ko) 2021-12-31
US20210403882A1 (en) 2021-12-30
JP2022008224A (ja) 2022-01-13
US11898173B2 (en) 2024-02-13

Similar Documents

Publication Publication Date Title
CN108138122B (zh) 免疫调控
DK3209381T3 (en) COMPOSITIONS COMPREHENSIVE BAKERY STUES
AU2021221448B2 (en) Modified plant
AU2021225224B2 (en) Plant regulatory elements and uses thereof
CN113832043A (zh) 具有提高的乳酸产生能力的重组耐酸酵母
AU2017376780A1 (en) Compositions and methods for modulating growth of a genetically modified gut bacterial cell
JPH09322781A (ja) Staphylococcus aureusポリヌクレオチドおよび配列
KR101679548B1 (ko) 신규한 락토바실러스 브레비스 박테리오파지 Lac-BRP-1 및 이의 락토바실러스 브레비스 균 증식 억제 용도
AU2022256122A1 (en) Novel Proteins From Anaerobic Fungi And Uses Thereof
JPH09252787A (ja) マイコプラズマ・ジェニタリウムゲノムまたはその断片のヌクレオチド配列およびその使用
KR102064765B1 (ko) 병원성 대장균의 증식을 억제하는 신규 박테리오파지 및 이의 용도
KR20220024508A (ko) 생물학적으로 봉쇄된 박테리아 및 그의 용도
AU2021203084B2 (en) Modified plant
KR100676218B1 (ko) 효모 및 진균에서 세포 사멸 관련 약물 표적
AU2008200749B2 (en) Promoters for regulation of plant gene expression
KR20230136600A (ko) 안정적인 세포주에서 효율적인 성장을 가능하게 하는아프리카 돼지 열병 백신의 게놈 결실
AU2017322445B2 (en) Use of MCM7 to obtain acetic acid-resistant yeast strains
KR101635497B1 (ko) 종자수가 감소한 신품종 수박 및 이의 육종 방법
KR20230079107A (ko) 개선된 특성을 갖는 유전자 변형된 메틸로바실러스 세균
KR101975797B1 (ko) 고추의 웅성불임 회복과 관련된 핵산 분자, 프라이머 세트, 및 그의 용도
KR20240021274A (ko) 반코마이신 내성 장구균에 대한 박테리오파지
KR20220135919A (ko) 대량 고효율의 상추 품종 식별을 위한 snp 마커, 프라이머 세트, 및 이의 용도
KR20120096684A (ko) 고광학순도의 젖산 생산용 형질전환체 및 이를 이용한 젖산 생산 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination