CN112442507B - 马度米星化合物的生物合成基因簇及其应用 - Google Patents

马度米星化合物的生物合成基因簇及其应用 Download PDF

Info

Publication number
CN112442507B
CN112442507B CN201910838457.2A CN201910838457A CN112442507B CN 112442507 B CN112442507 B CN 112442507B CN 201910838457 A CN201910838457 A CN 201910838457A CN 112442507 B CN112442507 B CN 112442507B
Authority
CN
China
Prior art keywords
gene
ala
gly
leu
val
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910838457.2A
Other languages
English (en)
Other versions
CN112442507A (zh
Inventor
刘然
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Hesheng Technology Co ltd
Original Assignee
Wuhan Hesheng Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Hesheng Technology Co ltd filed Critical Wuhan Hesheng Technology Co ltd
Priority to CN201910838457.2A priority Critical patent/CN112442507B/zh
Publication of CN112442507A publication Critical patent/CN112442507A/zh
Application granted granted Critical
Publication of CN112442507B publication Critical patent/CN112442507B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1003Transferases (2.) transferring one-carbon groups (2.1)
    • C12N9/1007Methyltransferases (general) (2.1.1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/18Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
    • C12P17/181Heterocyclic compounds containing oxygen atoms as the only ring heteroatoms in the condensed system, e.g. Salinomycin, Septamycin
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y203/00Acyltransferases (2.3)
    • C12Y203/01Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • C12Y203/01187Acetyl-S-ACP:malonate ACP transferase (2.3.1.187)
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

本发明公开了一种马度米星化合物的生物合成基因簇及其应用。其中,马度米星化合物的生物合成基因簇包括:Ⅰ型线性聚酮合成酶基因模块,所述Ⅰ型线性聚酮合成酶基因模块包括:madA1基因、madA2基因、madA3基因、madA4基因、madA5基因、madA6基因、madA7基因、madA8基因、madA9基因、madA10基因和madA11基因;糖基合成相关基因模块,所述糖基合成相关基因模块包括:madG1基因、madG2基因、madG3基因、madG4基因、madG5基因、madG6基因、madG7基因和madG8基因;以及修饰基因模块,所述修饰基因模块包括:madO1基因、madO2基因、madI基因、madE基因、madP1基因、madP2基因、madM1基因、madM2基因、madG9基因、madG10基因和madTE基因;以及调节基因模块,所述调节基因模板包括:madR1基因、madR2基因、madR3基因和madR4基因。

Description

马度米星化合物的生物合成基因簇及其应用
技术领域
本发明涉及生物工程领域,具体地,涉及马度米星化合物的生物合成基因簇及其应用,更具体地,涉及马度米星化合物的生物合成基因簇、微生物和制备马度米星化合物的方法。
背景技术
球虫病是由球虫引起的野生动物和家畜的肠道传染病,其特征是感染动物生长不良,表现受损甚至死亡。全球家禽业受影响最大,每年造成的损失估计超过30亿美元。聚醚离子载体是亲脂性螯合剂,它具有将阳离子转运穿过细胞膜来破坏细胞的作用。而显示具有有效的抗球虫活性的马度米星是由细菌Actinomadura yumaensis通过有氧发酵生产的,最初于1983年鉴定,有效剂量为5mg/kg,这是目前用于预防艾美球虫感染的抗球虫药物的最低剂量,并且它还具有抗孢子虫和抗疟疾活性。因此,由于马度米星抗球虫的效率高,并且比其他离子载体抗生素具有相对更高的安全性,它在抗球虫病聚醚抗生素中占有最大的市场份额。
然而,迄今为止,还没有马度米星化合物生物合成基因簇相关研究的报道。
发明内容
本发明旨在至少解决现有技术中存在的技术问题之一。为此,本发明的一个目的在于提出一种具有抗球虫活性的抗生素-马度米星化合物的生物合成基因簇。
在本发明的第一方面,本发明提出了madTE基因作为修复功能基因在马度米星化合物的生物合成中的作用。发明人发现,在马度米星化合物的生物合成中,madTE基因作为修复功能基因可以将马度米星化合物合成过程中的错误进行修复或清除,实现马度米星化合物的正确合成。
在本发明的第二方面,本发明提供了一种马度米星化合物的生物合成基因簇,所述马度米星化合物结构如(I)所示,
Figure BDA0002192926430000011
根据本发明的实施例,所述生物合成基因簇包括:
Ⅰ型线性聚酮合成酶基因模块,所述Ⅰ型线性聚酮合成酶基因模块包括:madA1基因、madA2基因、madA3基因、madA4基因、madA5基因、madA6基因、madA7基因、madA8基因、madA9基因、madA10基因和madA11基因;
糖基合成基因模块,所述糖基合成基因模块包括:madG1基因、madG2基因、madG3基因、madG4基因、madG5基因、madG6基因、madG7基因和madG8基因;以及
修饰基因模块,所述修饰基因模块包括:madO1基因、madO2基因、madI基因、madE基因、madP1基因、madP2基因、madM1基因、madM2基因、madG9基因、madG10基因和madTE基因。
发明人惊奇的发现,本发明实施例的马度米星化合物的生物合成基因簇可以控制马度米星化合物的生物合成。
根据本发明的实施例,所述madA1基因编码SEQ ID NO:1所示的氨基酸序列;所述madA2基因编码SEQ ID NO:2所示的氨基酸序列;所述madA3基因编码SEQ ID NO:3所示的氨基酸序列;所述madA4基因编码SEQ ID NO:4所示的氨基酸序列;所述madA5基因编码SEQ IDNO:5所示的氨基酸序列;所述madA6基因编码SEQ ID NO:6所示的氨基酸序列;所述madA7基因编码SEQ ID NO:7所示的氨基酸序列;所述madA8基因编码SEQ ID NO:8所示的氨基酸序列;所述madA9基因编码SEQ ID NO:9所示的氨基酸序列;所述madA10基因编码SEQ ID NO:10所示的氨基酸序列;所述madA11基因编码SEQ ID NO:11所示的氨基酸序列。
根据本发明的实施例,所述madG1基因编码SEQ ID NO:12所示的氨基酸序列;所述madG2基因编码SEQ ID NO:13所示的氨基酸序列;所述madG3基因编码SEQ ID NO:14所示的氨基酸序列;所述madG4基因编码SEQ ID NO:15所示的氨基酸序列;所述madG5基因编码SEQID NO:16所示的氨基酸序列;所述madG6基因编码SEQ ID NO:17所示的氨基酸序列;所述madG7基因编码SEQ ID NO:18所示的氨基酸序列;所述madG8基因编码SEQ ID NO:19所示的氨基酸序列。
根据本发明的实施例,所述madO1基因编码SEQ ID NO:20所示的氨基酸序列;所述madO2基因编码SEQ ID NO:21所示的氨基酸序列;所述madI基因编码SEQ ID NO:22所示的氨基酸序列;所述madE基因编码SEQ ID NO:23所示的氨基酸序列;所述madP1基因编码SEQID NO:24所示的氨基酸序列;所述madP2基因编码SEQ ID NO:25所示的氨基酸序列;所述madM1基因编码SEQ ID NO:26所示的氨基酸序列;所述madM2基因编码SEQ ID NO:27所示的氨基酸序列;所述madG9基因编码SEQ ID NO:28所示的氨基酸序列;所述madG10基因编码SEQ ID NO:29所示的氨基酸序列;所述madTE基因编码SEQ ID NO:30所示的氨基酸序列。
根据本发明的实施例,该生物合成基因簇进一步包括:调节基因模块,所述调节基因模块包括:madR1基因、madR2基因、madR3基因、madR4基因、madT1基因、madT2基因、madT3基因和madU基因。
根据本发明的实施例,所述madR1基因编码SEQ ID NO:31所示的氨基酸序列;所述madR2基因编码SEQ ID NO:32所示的氨基酸序列;所述madR3基因编码SEQ ID NO:33所示的氨基酸序列;所述madR4基因编码SEQ ID NO:34所示的氨基酸序列;所述madT1基因编码SEQID NO:35所示的氨基酸序列;所述madT2基因编码SEQ ID NO:36所示的氨基酸序列;所述madT3基因编码SEQ ID NO:37所示的氨基酸序列;所述madU基因编码SEQ ID NO:38所示的氨基酸序列。
根据本发明的实施例,沿着基因的3’-5’端,所述madR3基因、所述madR2基因、所述madG1基因、所述madG2基因、所述madR1基因、所述madT3基因、所述madA1基因、所述madA2基因、所述madA3基因、所述madA4基因、所述madA5基因、所述madA6基因、所述madE基因、所述madG10基因、所述madM2基因、所述madP2基因、所述madA11基因、所述madM1基因、所述madP1基因、所述madA10基因、所述madO2基因、所述madI基因、所述madO1基因、所述madA9基因、所述madA8基因、所述madA7基因、所述madU基因、所述madG9基因、所述madTE基因、所述madT2基因、所述madT1基因、所述madR4基因、所述madG8基因、所述madG7基因、所述madG6基因、所述madG5基因、所述madG3基因和所述madG4基因依次串联。
根据本发明的实施例,所述生物合成基因簇具有SEQ ID NO:39所示的核苷酸序列。
根据本发明的另一方面,本发明提供了一种微生物。根据本发明的实施例,所述微生物具有前述的生物合成基因簇。发明人惊人地发现,利用该微生物进行发酵处理,可以高效、简单和低成本地合成马度米星化合物。
根据本发明的实施例,所述微生物为放线菌。
根据本发明的实施例,所述微生物是通过如下方式获得的:将前面所述的生物合成基因簇插入带有基因组整合位点的质粒载体中;将携带所述生物合成基因簇的质粒载体通过三亲本方法导入受体微生物中。发明人发现,采用三亲本方法(将供体菌、协助菌与受体菌混合起来,使菌体细胞紧密接触,供体中的质粒可以在协助菌的帮助下通过接合转移方式导入受体菌中的方法)和pIB139可成功获得结合子,在Actinomadura sp.J1-007中实现生物合成基因簇的过表达。
根据本发明的实施例,所述带有基因组整合位点的质粒载体为含有attP整合位点和整合酶基因int,例如pIB139。
根据本发明的实施例,进一步包括将被导入质粒载体的受体微生物进行扩增培养。
根据本发明的实施例,所述扩增培养是在ABB13培养基中进行的。发明人发现,利用ABB13培养基进行扩增培养,菌株生长更优。
根据本发明的实施例,所述ABB13培养基中含有15mg/L阿泊拉霉素。发明人发现,15mg/L阿泊拉霉素足以抑制Actinomadura sp.J1-007的生长而有不会影响扩增菌株的生长状态。
根据本发明的又一方面,本发明提供了一种制备马度米星化合物的方法。根据本发明的实施例,该方法包括:利用前述的微生物进行发酵处理,以便得到发酵后的微生物;将所述发酵后的微生物进行破壁处理,以便得到马度米星化合物。根据本发明实施例的制备马度米星化合物的方法,利用前述的微生物进行发酵处理,可以高效地合成马度米星化合物。
根据本发明的实施例,所述破壁处理后进一步包括将破壁处理产物进行纯化处理。将破壁处理产物进行纯化处理,可进一步提高马度米星化合物的纯度。
根据本发明的实施例,进一步包括将所述马度米星化合物进行化学修饰处理,以便获得马度米星化合物衍生物。
根据本发明的实施例,所述发酵处理的时间为7-8天。
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。
附图说明
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:
图1显示了根据本发明一个实施例的马度米星化合物的生物合成基因簇序列结构示意图;
图2显示了根据本发明一个实施例的制备马度米星化合物的方法的流程示意图;
图3显示了根据本发明一个实施例的马度米星化合物的生物合成途径的流程示意图;
图4显示了根据本发明一个实施例的马度米星的化合物鉴定结果;
图5显示了根据本发明一个实施例的马度米星菌株在不同培养基上的生长状况;
图6显示了根据本发明一个实施例的马度米星菌株的发酵产量。
具体实施方式
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。
在本发明的描述中,术语“纵向”、“横向”、“上”、“下”、“前”、“后”、“左”、“右”、“竖直”、“水平”、“顶”、“底”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明而不是要求本发明必须以特定的方位构造和操作,因此不能理解为对本发明的限制。
需要说明的是,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。进一步地,在本发明的描述中,除非另有说明,“多个”的含义是两个或两个以上。
根据本发明的一个方面,本发明提出了madTE基因作为修复功能基因在马度米星化合物的生物合成中的作用。发明人发现,在马度米星化合物的生物合成中,madTE基因作为修复功能基因可以将马度米星化合物合成过程中的从ACP上去除异常中间体以纠正聚酮化合物链延伸,从而加速PKS的合成效率,实现马度米星化合物的正确合成,换用其他具有修复功能的基因无法实现马度米星化合物合成过程中错误的修复或清除。
根据本发明的一个方面,本发明提供了一种马度米星化合物的生物合成基因簇。根据本发明的实施例,该生物合成基因簇包括:
Ⅰ型线性聚酮合成酶基因模块,所述Ⅰ型线性聚酮合成酶基因模块包括:madA1基因、madA2基因、madA3基因、madA4基因、madA5基因、madA6基因、madA7基因、madA8基因、madA9基因、madA10基因和madA11基因;
糖基合成相关基因模块,所述糖基合成相关基因模块包括:madG1基因、madG2基因、madG3基因、madG4基因、madG5基因、madG6基因、madG7基因和madG8基因;以及
修饰基因模块,所述修饰基因模块包括:madO1基因、madO2基因、madI基因、madE基因、madP1基因、madP2基因、madM1基因、madM2基因、madG9基因、madG10基因和madTE基因;
发明人惊奇的发现,本发明实施例的马度米星化合物的生物合成基因簇可以控制马度米星化合物的生物合成。
根据本发明的实施例,该生物合成基因簇(在本发明中,也简称为“基因簇”)为链霉菌的生物合成基因簇,具有SEQ ID NO:39所示的核苷酸序列,具体如下:
CCGGGGCGGCCCGGAACACGGCGGCCCGGAACACGGCGGGCCTGGACGCGGCGGGCCAGACGCGGCGCGGGCCTGGCCGGGGCGGGCCGCGCTACTCGGCGATCTTGGTGCGCATCGTCCAGTGGAACGTGACGGGGGCGGCGGGCTCGTCCACGAGCTCGAAGTGCTCGTAGCCGCCCCGGTAGGGGATCTTGAGCTTGCGGGCGGCGAGGGTGGAGCGCTCGGCGCGGATCACGGCGGGCACGCCGTCGGGGCCGCCTTCGAGCGTGATGTCGATCGTCTGTAAGCGGTCGTTCCTCTGTAAGCGGTCGTTCCTCATGTTCCGGTTCCTCCGGGTCGGGGGCGGCCGGGGCCGGCCGCCGGGTCTCGGACGGGGGACGCGGCGTCGCGCCGCAGCGCTGACCGAAGGTCGAAGCGTCGCACGTTGAGCTTGCGGTACACCTTCGTGAGGTGCTGCTCCACCGTGCTGACCGTGATGTAGAGCTTGCGGGAGATCTCCCGGTTGGTGTAGCCGTCGGCGGCGAGCGCGGCGACCCTGCGCTCGGCGTCGCTCAGTTCGAGCGTGCTCGGGCAGGGCGGCTCCTCGGTGGTGCGCTCCAGAATTCCGCACTTGCGCGCTAGGGAAAGGGACCTGCGCGACAGCAGCGAGGCGCGTTCGGGCTCGCCGAGGATCTCGTAGGCGCGGCTCTGGTCGGCGAGCGCCCGCGACAGTTCGAGCTGGTCGCCGCAGCTCTGGAGCATCTCCACGGCCTGGCCGAGCAGGGCGGGGCGCGACTCCGGGGGGCTCGTCGCCGCCTGCGCGCGCAGCGACATCCCGCGCGCCCGCAGGTGCTCGGGCTCCAGGAGCGCGAGCTGCTCGTCGACGAGCTCCCTCGCCTGGGCGGCGTCGCCCAGGCGCAGGTGGGCCTCGGCGGCGCCGATCCGCCAGGGCTCCACGGCCGGCAGGTCGATGCCCCAGTCGTTCATCAGCTTGCCGCAGGCGCGAAAGTCGCAGGCGGCGGCGTAGAAGCGGCCCTCCGCCAGGTGGTGCCGCCCCCGCGCGTACAGGTAGTGCAGCCCCATCCGCGTCTGGAACATCGTCTGCGGGACGGGGATGTCGAGCAGGGTGGCCGCGCGCTCGACCTTGCCCATGGCCGTCGTGGCGAGCACCATCGCGGCGAGCGGAAGCCCGACCGCGACGCCCCAGCTCTTCGGCGGGATGAGGCTCAGCGCGGCGTCGGCGTGCCGTTCGGCCGCGATCACGTTGCCCTGCCGGACGTGGATCAGGGCGCGTTCGGCCGCGAACAGCGCGTGCCACGTCGGGCTGCGCCGGGCGACCGCCTCGACCAGGAGGGAGTCGCACCAGGAGGCCGCCTGCTCCAGCCGGTCGTTGTACATCAGCACCGCGAGCGCCGTCGTCAGCGGCGCCAGCGTGCGGTTGCTGAGGCGCGAGCGCTGGAGGATGCCCTCGGCCGTGGTGAGGGCGTCGCTCTCGTTCTCGTCGACCAGCTCGGCCGCCAGCATGGTGGCCTCCTGGAGGTCGGGGAGCGCTCCGGTCGGCTCGGGCTCGTGGTGCGCGGGCACCGGCTCGGCCTTGAGGACGCCCGGGTAGAGGTAGGAGAGCCACAGCCGCGGGGTGTCGAGGTCGACGGACGCGGCGGTGCCGGACTCGGCGGCCTCGTCCTGCGTCCGGACGAGGTCGTCCACGATGCCGAGCGCCTCGCCCACCCGCCCGTGCCAGAGCAGGTAGGTGATCGGGGTGAACGTGTCGCACAGCTCGGGCCGTCCGGTGCGGGCGGGCGCGGCGAAGTCGGGGAGGTGGCGCAGGACGGCCGAGGGGTCCACCCGCCATTCGGCGTCGGCCAGGGCGGACGCGATCTCCGGCCGGAGCCGCTCGTCGTCGCACAGCCGGTAGGCGGCGCGCAGGTGGCGGATCGCGGGGCGGACGTCGTCGTTGGCGAGCGCGTGCTCGGCGGCCTCGCGCAGCAGCGGGACGACCCACGGCGCCTCGATCCGTCCGGCGGCGACGATGTGCGCGGCGACGTCGGTGGGGGCGGCGCCCTCGTTGTGCAGCAGGACGGCGGCGCTGCCGTGCATGGCCATCCGCTCGCCGGACGGGGTGGCGTCGAGGATCGCGGCCCGTACGATCTCGCTCCGGAAGTCGCCGTTCTCCAGGATCCGCGCCTTGGTGAGCGCGTTGACCGCCTGGGCGGTGGACTCGGCCGCGAGGTCCAGCAGCCGGCCGAGCAGCGTGGTCGAGGCCGCCTTCCCGAGGATCGCGATGGCGCGGGCGACCTCGCAGACCGGCGGCTCGTACCGGTAGAGGAACGTCATCACCGCGCGGCTGAACTCCATGCCGGAGCCGAGCTCGCCGCGCCCCGCGGCCGCCGCGGCGCGGTGGTCCTCGATCAGCGCCTGCACGAGCGCCGGATATCCGGAGGTCGCCTCGTGGAACGCGGGCGTCAGGTCCTGCGCCGCCGCGTCGCCGAGGCTCCTGCGCAGCAGCCGCGCGACGCCCGCCTTCGTCAACGGCTCGACGCGGATCTGGCGGAAGAACGGCTCGCGGAGGAACTCGCTGCGCAGCAGCGCGTTCGCCGCCTCCAGGTCGCGGCACTCGCTGAAGATCACGTGGAGGCGCGACGAGCGCAGCCTGCGCAGCAGGTACAGCAGGCAGCGCAGCGACGTCTCGTCGGCGTAGTGCACGTCGTCCACGGCGATCACGACCGGCTGCCGTTCGGCGAGGCCGCGCAGCACCTCGCACAGCCCGCGCAGGACGACCGAGGTCACCTGGCCCGAGCCGCCGGGACCGGCGGCGGAGGTGTTGAGGGCGCCGGCGTCGAGCCAGCGCATGGCGCGCTCGGCGTCGCCGGAGGGCAGGTCGGGGTTGCGGAAGAGCTGTTCGAGGACGCCCATCGGCAGGCCGCGCTCGATGCGCGACGCGACGGCGCTCAGGAACAGGGCGCCGTGCTCGACGGCCCGTTCCCCGAACGCGCGCAGCAGGGAGGTCTTGCCGGTCGCCGTGCCCCCGCTGACCACCACCACCCTGCTTCGCGCGCGTGGGAGCTCGGCGAACAGGCCGTCGAGAGCGGCGAGTTCCTCGTCTCGTTCGACCAGGCTCACAGGCACCCCTATGATGATCGCTCAATTGCGCCGACCCGCGTGGCGGTCGCGGGTGTGATCGAGACAGGCGGCGCCCCGGGCGCCGGATTCGAACGAATACTTGCACCCGTTCCGAACAAGTGCAACGAACGGCGTGTTGATGGTTCATTTCGGACGGTGATGATCGCGCGCCTGACCTGCGGGGCCGTGGCCGCCCGGGCGGCCGGTGATCCATCTCTGCACGCACGGGGACTTGCCAGGCCGCGGGCGATCGGTTATATGGCGGGAATGGTCTTGATCGATCACGCGATAAGCCCGCCCCGCGCGGGCGGATTAGGGGTGGGCGGGCACAGGTTAGGAGGGGTAAGGGGTTGCGCCCGGACGGCCCGGAGTATGTAAATCGTGCTTAAAGACAATGTTAACGGAACGTATGGGGCGGGCGGCGTCCGCCGTCACCCTCGGCGCAATTGCCCGGAGCCGCCGCCACCGGGCCGCGGGCACCCCTAGAAGGCTGCGCCGCAGAACCGAAAAGGGCGTGACCGTTATTCGGCGCCGGCCGCCGGCGGGTCCGGACCGTCCGGCCGACCGCTGGTCAGCGGGCTTCCAGGGCATCGCGGAGACGCCCCGTTCGCGCGCGGGCGGCCGGCGGCGGGTCCGCCCGCCGGTCGGTGATCTTCGGAATGTAGGGGAGCCGCCCGGCCGTTAGGGGATGACCCGCGCGACGCGGTCCGCTTAGATTGTGCATTGAACGGGCGGGTGCTCCCCGCCCCGGGCCCCGGTAGCCCAATTCTGGTAGCCAGGCGACGGACTTAAAATCCGTAGAGTGTGGGTTCGAGTCCCACCCGGGGCACGGCGGTGAACGGCCGCGAACGGCCGGAAGGCACCGGGCCGAAGGCTACCGGTCTCCGGCCGCCTGCCTTAGCGGGCCGCTTCCGCAGCAATCCAGAACGGCCTCCGAAGCGGGCAGCACCACCTCGATGTGCTGGAGCGCGGATTTCAGGTCGAAGCGTTTCACGTTGAGCTTCCGGTAGACCTTCGTGAGGTGCTGCTCCACCGTGCTGATGGTGATGTAAAGCTTCTTGGATATTTCCCGGTTGGTGTATCCGCTGGCGGCCAGAACGGCCACCCTGCGCTCGGCGTCGCTCAGCTTCAGGAACCGTTCGGGGTCGGCCGCCGCCACGCCTCCGGCGTCCCGTCCGGCGCTCCCGGCGAGACCGGCGGCGTGCCCGGCCCCCATCCGGTGGCGGCCGGCCAGGTCGTCCGGCCGGACGACGCCGCCCGGGACGTCCGGGCGCTCCCCTTCGCGGGCCGCGGGCCGCCCGCACTGCCGCGCGAGCAGCTCGGCCTGGTGCGCGAGCTCCCGCGCGCAGTCGTGCTCGCCGACCCATCGGCGCACCTCGGCGAGGTCGGCGAGCGCGTACGACAGCTCCAGCCGGTCGCCGCAGCCCTGGAGCAGCTCGATCGCCTCGCCCAGCAGGTCCGGCCGTTCCGGCACGGGATGTGTCGCGGCCTTGACGCGCAGGGCGACGCCGCGCGTCCGGACGTGCTCGGGCGCGGACCTCAGCAGCTGCTCCTCCATCAGCGCCCTCGCCTCCGCCAGGTCGCCGAGCGCGATCAGGGCCTGCGCGGCGCCGAGCCGCCACGGCTCGACCTCGGGCAGGACGACGGCCCAGCCCTCGCGCATCGTTCCGCAGGTGCGGAAGTCGGACAGCGCGGCGTGGAGCCGGTTGGCGGCCAGGTGGTGCCGGCCGCGCGCGTAGAGGTAGTGCGGGCCGAGCGGCGTGCGGAACATCGCCTCGGGCACCGTGATGCTGAGCAGGTCGGCGGCCTCGCGCGGACGTCCCATGGCCGTCTTCGCCAGGATCATGCACGACAGCGGCAGCCCGACGAGGACGCCCCAGCTCTTCGGGGAGATGAGGTTGAGCGCGGTGTCGGCGTGCCGTTCGGCCGCCGGGAGGTCGCCCTGCCGCAGATGGATCAGCGCGCGCAGCGCGGTGAACAGCGCGAGCCAGGTCGGGCTGCGCCGCGCCGTCGCCTCGGCCAGCAGCGCGTCGCACCAGACCGCCGCCCGGCCCGGCCGGTCGTTGTAGATCAGGGTGGCCAGCGCGATCGCGAGCGAGGCGAACGTCCGATGGTTCAAGCGGGAGCGTTCCATGACCCGCTCCGCCGCGACCAGCGCCCCGCGCACGTCCCTGCGGTGCAGCTCGGCCACGAGCATCGCCGCCGCGTCCTGCCCGGGACCGGGCGCCGCCGCCGGGACCGGGCCGAGGGCGGGGGCGGTGTCCGCCGCCGGGCCGTCCTTCGCCAGCTCGGGATACAGGTAGGCCAGGCAGAACCGCGGGGGGATGACGTCCATCGCCGTCCCGGCGTCGCCCGGCAGCGCGGCGGAGGGCCGGCCGCGCCCGAGCGCGTCGAGCATCCGGACGGCCTCGGAGACCCGCCCGTGCCAGAGCAGCGACATGAACGGCGTGCCGGCGTCCCGCCCGTCGAGCAGGCCGTCCTGGATCGCCAGGCTGAACTCGGGCACCCACCGGAGCGCGACCGAGGGATCGACCCGCCACTCCAGGTCGGCGAGCCGCGCGGCGATCGCCGAGCGCTCCCGCCGGTCGCGGCAGATCTGGTGGGCGACGCGCAGGTAGCGGATGCCCGTTCCGAGGTCGTCGCTCGTCAGCGCCTGCTCGGCGGCCTCCCGCAGGACGGGGATCGCCCACGGCGCGCCGTGCCGGTTGGACGCGACGATGTGGGCGGCGACGTCGGTCGCGGCGGCGCCGTCGGAGTGCAGCTGCTCGGCGGCGCGCTCGTGCAGGACGCCGGGGGCGTCGGGCGGGGCGCTGTCGACGACCGCCGCCTGGGCGGCGCCGTGCCGGAAGCTGCCGGCGTCGAGGATCCCCGCGGAGGTCAGCGTGCCGACGGCGCGGGCGGCGGTCTCGGCGTCGATGTCGAGCAGCCGGCCGATCAGGCTCGGCGGGACGGGCTCGTTGAGGATGCCGATGGCGCGGGCGACCTCCAGGACCGGGAACTCGTGCCGCTGGAGCAGCCCGACGACCGCGCGCCCGAACGCGCCGCCCGCGACCAGCTCGGGCGCGATGAACTCCGGCCCCGCGAGGTAGTCGTCGACGAGCGCGTGCACCAGCGCCGGGTTGCCCGCGCTCGCCTCGTGGAACGCGGGCGCGAGGAGCCGCGCGGTCTGGGCGTCGAGGTCCAGCCGGATCAGCGCTTCCACGCCGTCCGGGGAAAGGGGGCCGAGCCTGATCTGGCGGGAATGCGGTTCGCGGAGGAATTCGCTTTGGAACGGCGCGTTCGGCGCCTGGGAACGGGGGCTCACGGTGAGCACCGTGAGCAGGCGCGCCGAACGCAGCCGGCGCAGCAGGTACAGCAGGCACTGCAACGAGATCTCGTCGGCGTGGTGCACGTCGTCGACGGAGATCACCACGGGCCGGCGTTCGGTCAGGTCGCGCAGGACGTCCCAGAGGCCGCGAAAGCGCATGGGATAGGTCAACGCACGCGCGTTGAGCAGCTTCATCGCGCGCTCGGCGATCGCCTCGGGAACCTCCGGGCTGCGAAAGAGCTGTTCGAGCACGCCCATCGGCAGCGCCGTCTCCACCGACGAGGCGACCGCGTTGAGAACGACCGCGCCGGACTCGGCGGCCTGCCTTTCAAAGGCGTCCAGAAGGGAGCTCTTACCGGTGGCGACGTCCCCGCTGATGATCGCCAGTCTGCTGCGCGCCTCCAGGCAGTCCCCGAACAGCCCGTGGAGGACGGCGACTTCCGCGTTCCGTTCTACCAGGCTCACGAACTCCCCCTGAGCTGCACACTCTCTTCCCCGTTTGCGGCACCCCGCCGCATTGAGTCCCAGGTCACCCGTACCCCGTGGTCGTCTACCGGGCAGAATCTAGTCGCGGTCCTGTCGGCGGGCAACGGGCATTGCGTAATCGGACCGTTTCACAGGGTCGCTAATCACGTCCTGACCTGCGCGTGCAGCACCCTTGTGAGATAGGTTATACATCTGTATACTTCGGGGAAGCGAGGGGAGATGTGCGTCCCGCAGAAAGATTTCGTTAATTTTCTGTGATGGGACTGGAGTATCATCTCCGGAACCGCGATGCTCGAAATTCGCCACACCGCAGACCGGCCGCCGCGAGGCGGGCCCGAACGCCGCGAACCCCTCCCGAAGCCCCTCTGGAAGGCCCTCGGCGCGTGCCGCGCGCCGGCCTCGCAGTCGGTGATCTTTGCTCGGCCGTTCGCGGGCCCGCGACCATTTTAGGAGACCCGCCTAAAGGCCCGCCAACGCATAGGGCATGGCCCTTGCATTCCGCCCGAGTGGGCCGGATGATCACTCTGCCGTTCACCGTGCTGAGGGTCGCGAGAATACGTTCCGAACGGTGCGTACTGCACTCTAGGGGCGGTCGGAGAAGACGGCGTTCGGTCGATTAGGGGCCCCTTAACAACCGGCCAGTCATAAGGTCGAAAAGGCGGTTGACCAGGAGGCATTACGTTCCGCATCGTTCCGGATACGCATCGGGCCTTGCTGGTGATCCGGCCGCGGAGCACCCTTGGATCGGCCTCCTTTGGCCCGCCGGCGCACCGGCGACGACCCCGATGGAACGGAACGGAGAGACGGTGGCCGAACTGAAATCCGGGAGGGCGCGCGAGGTGGCCGCATTCCTCCGCCCCCGGCCGACCACGGCGTTATCCCGCGAAACCTCGACATCCCGCGAACCCTCGACGGCGGGTGCGCGATGAAGGGCGTCATCCTCGCGGGCGGAAAGGGCACGCGGCTCCAGCCCGTCACCGCCGTCGGCTCCAAGCAGCTCATGCCCATCTACGACAAGCCGATGGTCTACTACCCGCTGTCCATCCTCATGTTCGCCGGCATCCGCGACGTGCTCGTCATCTGCCGCCCGTCCGAGGTGGCCGTCTTCCGCGGCCTGTTCGGCGACGGGCGGCGGCTCGGGATGAACATCGACTACGCCGCCCAGGACGAGCCGCGCGGGATCCCCGACGCGTTCCTCGTCGGCGCCGACCACATCGGCGGCGACGACTGCGCGCTGATCCTCGGCGACAACCTCTTCCACGGCTCGGGCCTGCCGTCCCTGCTCCGCCAGAGCGCCGAGCGGATGGACGGCTGCGTCCTGTTCGGCCACCAGGTCTCCGATCCCGAGCGCTACGGCGTCGCCGAGATCGACGAGCGGGGCCGGCTGGTGTCCATCGAGGAGAAGCCGTCCGAGCCCCGCTCCAACCTGGCGATCCCCGGCCTGTACTTCTTCGACAACAAGGTCGTCGAGATCGCCCGCGGCCTCGCCCCGTCCGCCCGCGGCGAGCTGGAGATCACCGACGTGCTCCGCGCCTACCTGGAGGCGGGCCGCGCCGACCTGGTCTGGCTCGGCCGCGGCGTCACCTGGCTCGACACCGGCACGCACGAGACGCTGCTCGAGGCGGGCTGCTTCGTGCGCGACGTCCACCAGCGGCAGGGCGCGCGGATCGGCTGCGTCGAGGAGATCGCGCTCTACATGGGGTTCATCGGCCCGGACGAGTGCTACGAGCTGGGCGCGGAGATGGGCAACTCGCCCTACGGCCGGTACGTGATGGACCAGGCCCGCTTCTACGACCGCGTCCGCGACCTGCTGGACTGGCGCGCCACGTTCCTGGAGGGCGCATGAACCTGCTGGTCACCGGCGCCGCCGGGTTCATCGGCTCCACCTACGTGCGGATGCTGCTCGCTCCCGCCGCCCCGGACCCCGCCGCCCCGGACCCGGTTGCGCCCGACCCGGATGCGGCCGACCCGGACGCGTCCGAGGAGATCCGGGTGACGGTCCTGGACAAGCTCACCTACGCCGGCACGCTCGACAACCTGACGCTCAGCCATCCGCGGCTCCGCTTCGTCTACGGCGACATCTGCGACGAGGACCTCGTCGGCAAGCTGATGTCCGAGGCCGACCAGGTCGTGCACTTCGCCGCCGAGTCCCACGTGGACAACTCGATCGCCGCCGCCGGCGACTTCGTCCGCACCAACGTGGTCGGCACCCACACGCTGCTGGAGCAGGCGCTCGCTCATGGCGTGCGGCGGTTCGTCCACGTCTCGACGGACGAGGTGTACGGCTCGGTCGAGACGGGCCGCTCGCACGAGACCGACGCGCTCGACCCCAACTCCCCGTACGCGGCGTCCAAGGCCGCCTCCGACCTGCTCGCGCTCGCCTACCACCGCACCCACGGGCTCGACGTGCGCGTCACCCGGTGCTGCAACAACTACGGGCCCCGGCAGTTCCCCGAGAAGCTCATCCCGCTGTTCATCACGCGCCTGCTCGACGGCCAGAAGGTCCCCCTGTACGGCGACGGGCAGCACAAGCGCGACTGGCTCCACGTCGACGACCACTGCCGCGCGATCGAGCTCGTCCGCACCGCGGGCGCGCCGGGGGAGGTCTACAACATCTCCGGCCACACCGAGCTGACCAACCACCAGATCACCGGCCTGCTCCTGGAGGCGGTCGGCGGCGACTGGGACCAGGTGGCCTACGTGACCGACCGCAAGGCCCACGACCGCCGCTACGCCGTCCACGACGACAAGGCCCGCACCCGCCTCGGCTACCGCCCGCGCCGCGACTTCCCGCGCGGCCTGGCCGAGACCGTCGCCTGGTACCGCCACAACCGCTCCTGGTGGGAGCCCCTGAAACGCCGAGCGGGCCTCTGACCCCGCCGAAGCGCCGCCCGCCCCGCCCCGTGCCCCATCCGAGCGATGGGGCACACCCATGTGCGGGGGGAGGGGGGCGTCGCGGACGGACGGACGAACGAACGGCCGAGCCGGGGCGGCGGGGGCGCCCGTGGCATCGGCCGTGTCGTCGTTCGTGGCGGTAGGCGGGTAGGCGGGTAGGCGGGGTAGGTGGGGACGGGAGTACGTGGAGCGGGCGGGTCAGGCGGGGATGGGCAGGTCCAGCCGGTGGTCCGAGGTGAGGATGGACCGCTGGAGGTTCTGGAGGTCGCGGCACGGTTCGAGCCCGAGCTCCTCGTTGAGCGTCCTGCGCGCGGACTGGTAGGCGTGGAGCGCGTCCGCGCGGCGGCCCGAGCGGTAGAGCGCCAGCATGAGCTGGCGGTGGAGGACCTCCCGCAGCGGGTGCTCGGTGGTGAGCTGGTAGAGGTCGCCGACGAGCTCGCGGTGCCGGCCGAGCATCATGCGCGACTCCATGAGCATCTCCACGCACTCCAGCCGGGTCTCCCGGAGCCAGGTGTGGAACCCCTCCAGGACGGGCCCGTTGCCGAGGTCGTCGGGCAGGTGGTCGGCGGAGAGGTCGAGCGCCTCCTCGAACGCGGCCGACGCCTGGTCGTGGCGGCCCTGCCTGAAGTCGTTGCGGCCCTGGTTCACCAGCAGCTCCAGGCGGCGCAGGTCCAGCTCGTCGGCCCCGAGCCGCAGCAGGTAGCCGGGCGGCCGGGTGAGGACGGGGTCCTCGGCGCCGGGCCGGTGCAGGAACTTGCGGATCTGGGAGATGTAGACGTGCAGCCCGGCGGTGGCCCGGCGGGGCGGCGCCTCCCCCCAGATCTCGGTGATGAGCTGGTCGACCGGCACCACCTGGTCCGCGCGGACCAGCAGCACCGTCAGCAGGACCCGGATCTTCCGCGCCCGGATGGTGTGCTTCCCGTCCTCGTCGACCACCTGGAACGGGCCCAAGATCTCATAGCGCACGGTTCACCCCCGTCGATAAGCGTGCGAACGGACGGCGTCCTAACGGACGCCGGCTTCCTTCCACTGACGCCATGGCTCCCCCCTTTCGTGCGGGCGGACGCGCCGCCGAGGCGGCGGACGGACGGGCCGGGGTCCCGGCGGATCACCGCCGGGCCGGGCGCGTCCGATCACTACCCACTCAACTTAAGAACCGGCCAAGGCGCGCTGAACCCCTGTCCCGTGCCCGCCGACCCCAGTCGAGACCACCGTTCCCCCTACGGGCTCGCGGGCCGGGCCGCGCGGGCGCGCGACGCTCAGGGCGTCCGCCGGGGCGGTGCGGGCGCGCCGTACGGTCCGGCGGGCGCCCGCCCCAGGGGGATCTAGGGGTGGGGGTGAGGTGCGGGGCGCCTCCGGCAGCGGCGCTGGTCGTGCGCCGCTGCCGGAGGTAGCGGCCCGCCCGGGGCGGCGGTCCGCTCTCGGGGGGATCGGGTGGTCCGCGTTGACGGGGGTGAGGCGCGGCCCATCCGGGTCCCCGGGTTCAAGGGGTGAACGGTTCCGGCGGCCGTCGGGGGCGCGGCCGCGGTCGGTGGTCGGGGTCAGCGGGCGGGGTGGGTGGTCGGGGTCAGTGGGCGGGGGCTCTGTCGGGGCGCGGGCGCGGGCGCCGGGGGCGGTGCGCGCCGTAGTGGCGGAGGAACAGCTCCTCAAGGGTCGGTTCGCGGCTGACGAGGTCGCGGATGCCGACGGACGCGAGGTAGCTCATGAGCTTCTCGATGCTGTCGTGGTCCACATCGCATTCGACGTGGTTGTTCCGGACGGTCACGTTGTGTACGCCGGTCATGTGGGCCAGGGCGAACGGCCGGCCGAGGAGCTGGGCGGAGATCGAGGTGCGGGCGCCGAGCCGCATCTCGGCGAGGACGCCGCTCTCGACCGTGCGGCCGTCGCGGATGATGCTGACCCCGTCGCACAGGGCGTCCACCTCCGACAGGATGTGGCTCGACAGCAGGAACGTGCGGTTGGCGCGGGCCTGCTCCTCCAGGACGACGCGCCGGAACGTCCTCTCCATCACCGGGTCGAGGCCGAGGGCCGGCTCGTCGAGCACGAGCAGCTCGGCGTCGGAGGCCAGCGCGGCGACCAGCGCGACCTTCCGCCGCTCGGCGGGCGGGTAGTTGCGGCACCGCTCCCTCGGGTCCAGGTCGAAGCGCCCGATCAGCTCGTTCCTGCGGTCGCCGTCGAGGCCGCCGCGCAGCGCGCCGATCACGTCGATCACCTCGCCGCCGGTGAGGCCGGGCCACAGGGTGACGTCGCCGGGCACGTACGCGACGTGCCGGTGCAGCGCGGGCGCGTCCGCCCAGGGGTCGCCGCCGAGGAGCCGGGCCGTTCCGCCGTCGGCGCGGGTCAGGCCGAGCAGGATTCGGATGAGCGTCGTCTTGCCCGCCCCGTTCGGCCCGAGGAGGCCGTGCACCTCGCCCCGGTGCACGGTCAGGTCGAGGCCGTCCAGCACGCGCGCGCCGCCGAACGACTTCACCAGCCCGGTCGCCGCGATGGCTCTGGTCACTCTACTGTCCACGTGTCTCGTTTGCTATTCATCTGTATAGCCGAGAATACGTGCGTATTGTCCCGGGGGCAAGCGGGCCTGCCCAACGTATTGGCGGGGCTTTAGAGCGGCGACCCACGATGGGCGGCGTGCTCACAACCCAAGCGGGGATGCCGTCCGACCGCCTCCGTTCCCCGGGAACGGACGCGGCCGCGGACGGCGGCGCGGGCGACGGCGCGCGGCGGTGGTTCGGCGGGTTCCTTGAGGCGGTGGACGGGTTCGACGCCGGTTTCTTCGGCATCGCGCCGCGCGAGGCCGTCGCGATGGACCCGCAGCAGCGCCTGATGCTCGAAGTGGCGTGGGAGGCGCTGGAGGACGCGGGGATCCCGCCGGAGACGCTCGCGGGCAGCCGTACCGGCGTGTTCGTGGGGGCGATCTGGGACGAGTACGCGGCGATCCTCCGACGGTCCGGGCCGGGCACGGGCGAACGGCACGCCATGACCGGGTCCCACCGCAGCATCATCGCCAACCGCGTGTCCTACGTCCACGGGCTGCGCGGCCCGAGCATGACCATCGACACCGCCCAGTCGTCCTCGCTGGTCGCCGTGCACGCCGCGTGCGAGAGCATCCGGAACGGGGAGTGCTCGCTGGCGCTCGCCGGCGGCGTCAACCTCATCCTCGGCGAGGACGGCATGGAGGCCGCCGCCGCGCAGTTCGGCGGGCTGTCCCCGGACGGCCGCTGCCACACCTTCGACGCCCGCGCGAACGGCTTCGTCCGCGGCGAGGGCGCCGCGACGGTCGTCCTGAAGCCGCTGGACGCGGCGCTGATCGACGGCGACCCGGTGTACTGCGTGATCCGCGGGAGCGCCGTCAACCACGACGGCGCCGCGGACGGCCTGACGGTGCCGAGCGCCGCCGCGCAGGAGGACGTGCTGCGCCTGGCGTGCGAGCGGGCCCGCGTGGCGCCGGGGGACGTCCAGTACGTCGAGCTGCACGGCACCGGCACCCCGGTCGGCGACCCGGTCGAGGCGGCGGCGGTCGGCGCGGCGCTGGGCGCGTCCCGCGCGCACGGCGCCCCGCTGCGGGTCGGCTCCGCCAAGACGAACGTCGGCCACCTCGAAGGCGCCGCGGGGATCGTCGGGCTGCTCAAGGCGGTCCTCAGCGTCGCGCACCGCGAGCTCCCGCCGAGCCTCAACTTCACCGCGCCGAACCCGCGGATCCCGCTCGCGGAGCTGGGCCTGCGCGTCCAGGACGAGCTGACCGCGTGGCCGTGCCCGGACCGGCCGCTGATCGCGGGCGTCAGCTCGTTCGGCATGGGCGGGACCAACTGCCACGTGGTCCTCACCGAACCGCCCGCGACCGCCGCGGAGGTACCCGCGACCGGCGGCGCGACGCGCGCGGACGAGCCCGGCACCGCCGCCGGGAGGCCGGCGGCGCCCGTACCGTCGAACGAGACGCCCCCGATCGTTCCCTGGACGGTTTCGGGGAAGAGCGCGGCGGCGCTGCGAGCCCAGGCGGACCGGCTGCGCGAGCACGTGGAGGACCGTCCCGGCCTGGACCCGGCGGACGTCGGGCACGCGCTGGGCGCGACCCGTTCCGCGTTCGCCCACCGCGCGGTGGTCCTCGGCGGCGGCGCCGATGAGCTGCTCGGACGGCTGGGCGCGCTGGGACGGGGCGAGGACGCGCCCGGCGTCGTCCACGGGACCGTGAACGGTGGCGGCGTCGCGGTGCTGTTCAGCGGCCAGGGCAGCCAGCGCCCCCGCATGGGCCGCGAACTGTACGACGCGCACCCGGTGTTCGCCGCGGCGCTGGACGAGGCGTTCGCCTGCCTGGACCCGATGCTGGACCGGCCCCTGCGCGAGGTGGTGTGGAGCGGCGACGCCGCGCCGCTGGACCGGACCGCCTACACGCAGCCGGCGCTGTTCGCCGTCGAGGTCGCCCTGTACCGGCTGGCCGAGTCGTGGGGCCTGCGCCCCGGCCACCTGATCGGCCACTCGGTCGGCGAGATCGCCGCCGCGCACGTCGCCGGGGCGCTGTCGCTGCCGGACGCGTGCACGCTGGTCGCCGCGCGCGGACGGCTCATGCAGGCCGTCACCGCGCCCGGCGCGATGGCGGCCCTGCGCGCGACGCCGGACGAGGCGGCCGAGCTCGTCGCGGGCCGCGGCGGCCTCGGCGTCGCCGCCGTCAACGGCCCCGCCTCCGTCGTGATCTCCGGCGACCGCGACGCCGTCCGCGACGCCGCCGCGGCCTGGCGCGACCGCGGCGGGAAGGCCACGCTCCTGCGGGTCAGCCACGCCTTCCACTCGCCCCACATGGACGCGATGCTCGACGAGCTGCGCGCCGTCGCGGCGGGCCTGGCGTTCGCCGCGCCCGCCATCCCGGTCGTGTCCAACGTGACCGGCCTCCCCGCGACCGCCGAACGGCTCGCGTCCCCGGACTACTGGGCCGAGCACGCGCGCCGCGCCGTCCGTTTCATGCCGGGCGTGCGGTACCTGTGCGGCGAGGGCGTCACCACGTTCGTGGAGCTGGGCCCGGACGGGCAACTGGCGGCCATGGCCCGCGAGTCGTTCCCCGGCCCGTCCACCGAGGGACCGGACGGCCGGCCGCGTCCGGCGGCCCGCCCGGTGGCGCTGGCGGCGCTGCGCCGCGACCGCCCCGAGACCGCCACGTTCGCGACGGCGATGGCCGAGGCGTACGTCCGCGGCGGCGTGGACCTCGACTGGGACCGGATCTTCGCGGGACGCGCGCGCACGCGGGTGGACCTGCCGACGTACGCGTTCCAGCGCGAACGCTACTGGCCGGGCTCCCTGCCCCGCCCCGAAGCCGTCCCGGCCGGGCAGCCGGGGCCGGGGCCGGAGGCGGGTCCGGTTCTGGAAGCCGAGCCGCCTTCGCCGCTGGAACGCCTCGCGGGGCTCTCCGAGCCCGAACGGCGGCGCGCCCTGCTGGACCTGGTCCGGACGAACGCGGCCGTCGTCCTCGGGCACGCGTCGGGCACGGCGATCGACGCGGACCTGACGTTCAAGCAGCTCGGCTTCGACTCGATGGCGGCCGGGGAGCTCAGCGAACGGCTCGGCGCGGCCACCGGCCTGCCGCTGCCCGCCACGCTCACGTTCGACCACCCGACCCCGCTGGACGTGGCCGCGCACCTGAACCGGCGCGCCACCGCGACTCCGGACGGGACGTCCCGGCAGGACGCGCCGGCGGCCGGCCGGAGCGCGGCGGACGACCCGATCGCCGTCGTCGCCATGAGCTGCCGCTACCCCGGCGGCGCCACGACGCCCGAAGCCCTGTGGGACCTCGTCGCCGCGGGACGCGACGTGATCGGCGGCTTCCCGGACGACCGCGGCTGGGACCTGGCCGCCCTGTTCGACCCGGACACGGACCGGGCGGGCACCACCTACGCCCGGGAGGGCGGCTTCCTGTACGACGCGGCGGAGTTCGACGCGGAGTTCTTCGGGATCAGCCCCCGCGAGGCGCAGGCCATGGACCCGCAGCAACGCCTGCTGCTGGAGACGTCCTGGGAGGCGTTCGAACGGGCGGGCATCCGCCTCGGCGCACTGAAGGGCAGCCCGACCGGCGTTTTCGTCGGCGCGACCGCCCAGGACTACGGCCCGCGGCTCCATGAGGCCCCCAAGGGACTCGACGGGCACCTGCTGACGGGCGCGACGCCGAGCGTGGCGTCCGGCCGGGTGGCCTTCACCTTCGGCCTGGTCGGACCCGCGGTGACGGTGGACACGGCATGCTCGTCGTCCCTGGTCGCGATGCACCTGGCCGCGCAGGCGCTGCGCCAGGGCGAATGCGACCTGGCCCTGGCCGGCGGAGCGACCGTGCTCGCCGCCCCCGGCATGTTCACGGGTTTCTCGCGTCAGCGCGGCCTGGCTCCGGACGGCCGCTGCAAGCCGTTCGCCGCGGCGGCCGACGGAACGGGCTGGGCGGAGGGCGTCGGCCTCGTACTGCTGGAACGCCTGTCGGACGCCTTGCGAAACGGCCGCCGCGTCCTGGCCGTGATCCGCGGCTCGGCGGTCAATCAGGATGGTGCGAGTAATGGTTTGACGGCTCCGAATGGTCCGTCGCAGGAGCGTGTGATTCGTCAGGCGTTGGCGTCTGCGGGGTTGTCGTCGTCGGATGTGGATGCGGTGGAGGCGCATGGGACGGGGACGACGCTGGGTGATCCGATCGAGGCGGGCGCGTTGTTGGCGACGTATGGGCGGGGTCGTTCGGGTGGTCGGCCGTTGTGGTTGGGGTCGGTGAAGTCGAATATCGGTCATACGCAGGCGGCTGCTGGTGTGGCTGGTGTGATCAAGACGGTGATGGCGATGCGGAACGGTTCGCTTCCGGCGTCCCTGCATATCGATGAGCCGTCCCCGCACGTGGATTGGAAGTCGGGCGCGGTACGTCTGCTGACCGAGCCGATGGAATGGCCGACCGAAGATGACCGTCCGCGTCGGGCCGCGGTGTCGTCATTCGGCATTTCTGGAACGAACGCCCACCTGATTTTGGAAGCACCTCCCGAGCCGGACAGCGAGCCCGAACGACCTGCGCCGATGGACGTCGGCCCGCTCCCGTGGGTGCTGTCAGCGAGAAGCGAAGAGGCGTTGCGGGGGCAGGCGGCGGCGCTGGCGTCCCAGGTGCGCGTGTCGGATGACGCGCCGTTGGATGTGGGTTGGTCGCTGGCGGCGTCCCGTTCGAATTTCGAGCGTCGTGCGGTGGTCGTCGGGACCGGACGCGATGAATTGCTGGCCGGTTTGGAAGCTGTGGCGAGCGGTGAAGTGCGTTCTGTTGCGCCTTCCAGAGTCGGCGCGGGCCCGGTGTTGGTGTTCCCGGGTCAGGGTTCGCAGTGGGCCGGTATGGGTGCGGGGTTGCTGGATTCCTCGCCGGTGTTCGCGGAGCGTGTTGCTGAGTGTGAGCGTGCGTTGGCTCCGTTTGTTGATTGGTCGCTGTCGGACGTCCTGCGTGGCGACGGGGCGGAGTTGGCGCGGGTTGATGTCGTTCAGCCGGTGCTGTGGGCGACGATGGTTTCGCTGGCGGCTGTGTGGGCGTCTTATGGTGTCGTTCCGTCTGCGGTGATTGGTCATTCGCAGGGGGAGATCGCTGCCGCGTGTGTGGCTGGGGCGCTGTCGCTGGAGGCCGGCGCCAGGATCGTGGCGTTGCGGAGTCGGGCGTTGCGGCGGTTGGCCGGTGGCGGTGCGATGGCTTCGCTTGGTGTCGGTGTGGACGCCGCGCAGGAGTTCCTGTCTGCGGATGTAGCGGTGGCGGCGGTGAACGGACCGTCGTCCACGGTGGTGTCGGGCCCGCCTGAGCAGGTCACCGCAGCGGTGGCCCGTGTGGAAGCTGCCGGGTATCGGGCGCGGCTCATCGACGTTGATTACGCCTCGCATGGGCCGCAGGTCGATGAGATCACCGACGAATTGCATGAAGTCCTCGCGGGCATCGAGCCCGCACCGACAACAGAAGTGGCGTTCTATTCGACCGTGTCGGGTGGGCGGATCGACACCACGAGCATGGACACCGGTTATTGGGTGACCAATCTTCGGGAACGGGTGCGGTTCGCGGATGCCGTTCGGGAGTTGGCGGGCGACGGGCATCGGGTGTTCATCGAGGCGAGCCCGCATCCCGTTCTGACGCTCGGCACGCAGGAGACGCTGGAAGAGGCCGGGACCGTTGCCACGGTGGTGCCGACGCTGCGCCGTGACCACGGTGGTCGGCGGCAGCTCGCGGACGCGCTGGGGCAGGCGTTCGCGGCGGGGGTGGAGGTCGACTGGTCGCGGTGGTTCACCGGTGATCGGGCTCCGCGGCTTGTCGAGTTGCCTACGTACGCGTTCCAGCGCAGGCGCTACTGGCTGGACGGCCGGAGCGGGCCCGGCGGCGATCCGGAGAGCCTGGGGCTGGCGTCGGCCGGGCATCGGCTGCTGGGCGCGGCCGTCGAGCACGCGGGCGCGGACGCGTGGCTGCTCACGGGACGGCTCTCGACGCGGACGCAGCCGTGGCTGGCGCAGCACTCGGTCGCCGGGACGGTCGTCGTGCCCGGTGCGGCGCTGGTCGAGTGGGCTTTGCGGGCGGCGGACGAAGCGGGTTGCGGCGGAGTGGACGAGCTGACCCCGCGGGTTCCGCTGGTGCTGGCCGGTCCGGAGCCCGTACGGGTCCAGGTGGCCGTGGACGCGGCGGCGGACGACGGGACTCGCGACGTACGGGTGTACTCCCGGCCCGACGGCGACGAGGAAGCGGAATGGACCTGTCATGCGGAAGGCGTCCTCTCAGCGGGGTCGCCTGAGCCCGCGCCCGAGCCCGAGCCGGGTGGCGGCGGGGCGTGGCCTCCGCCGGGCGCCGAGCCGCTGGCCGCCGATGACGTCTACGAACGGTTGGCGGAGGCGGGATACGACTACGGGCAGGCGTTCCAGGGCGTGCGGGCGCTGTGGCGGCAGGGCGCGGACCTGCTCGCGGACCTCGTGCTGCCCGAGGCGGCCGGGGGGCCGGGAGGGTTCGGGATCCATCCCGCGCTGCTGGACGCCGCGTTGCATCCGTCGCTGCTCGACGCCGGTCCGGCCGGGGACGGCCGGGTGTGGCAGCCGTTCGCTTGGAGCGGGGTGTCGCTGTGGGCGGCGGAGGCGACCGCGGTACGGGTCCGGCTGACCCCGCACCACGGGGAGCAGGGCGTGCAGGTGGCCGTGGCGGTGACGGACGCGCTCGGCGGGCCCGTCCTGGACGTCGCGTCGGTGGTGATGCGCCCGGCGGACGTTCGGTGGCTCCGCGAGGGCGGTCACCGCGCCGGGCGGCGGGCGTCCACGGCCGCCGGGGGCACTACGCGTGTGCGGCGTGCGGTGGCCGCCGGCGGCGGGAACGGTTCGTCCGTCGACTGGGCCGACCGGCTGGCCGGGCTCGCCGAGGACGAACGGCGCCGGCTGGTGCTCGACCTGGTGCGTACACATGCCGCGACCGTGCTGGGCCATGCCGAGCCCGACGCGGTTCCGACCGGGACCAGCTTCAAGGAGATGGGGTTCGAGTCGCTGACGGCCGTCGAGCTGCGCGACCGGCTCGCCGCCGCGACCGGGCTGCGCCTCCCGGCCGCGCTGGTTTTCCGTTATCCGACGCCGGACGGCATCGCCGATCACCTGGTGGAACGCCTCGTCGCGAAGGAGCAGACACCGCCCAACGGACGCAACGGCGGACGCGCGGGCGAAGGCGGAAACGGCGGCGAGCGCGGAAACGGGAACGGGCACCGCTCCGTGACGGGCGTCGGTTCGGTCCTCGGCGAGCTGGACAGGCTGGAGGACACCCTGGCCGGGTTCGTTCCGGAGAACGGCGACTCCGGCGCGGTCACGGCCCGGCTCGAACGGCTGCTGACCAAGTGGAAGGCGGCGGCGCGGAACCCGGCGGGCGAGCCCAGCGCCGCCGAGCGGCTGGAGTCCGCGTCCACGGATCAGGTCCTTGACTTCATCGAGAACGAGCTTGGGGTGTCCTGAATGGCGAACGAAGAGAAGCTGGTCGACTATCTCAAGCGCGTCTCCGCGGAGCTGCACGACACGCGCCGTCGCCTGAGCGAGGTCGAGGACCGGGCCAACGAGCCGATCGCCGTCGTCGGCATGGCGTGCCGGTTCCCCGGCGAGGTCCGTTCGCCCGAGGACCTGTGGGACCTGCTCGTCTCCGGAAAGGACGCCATCGGCGACTTCCCCGACGACCGCGGCTGGGACCTCGACGGCCTCTACCACCCCGATCCCGCGCATCCCGGCACCAGCTCCACGCGGCAGGGCGGATTCCTGTACGACATCGACCGGTTCGACGCGGGATTCTTCGGGATCAGCCCGCGTGAGGCGCTCGCCATGGACCCGCAGCAGCGCGTCCTGCTGGAGACCGCCTGGGAGACGTTCGAACGGGCCGGAATCGACCCGTTCTCCCTGAAGGGCACCTCGACCGGCGTGTACGCGGGGGTGTCCAGCCAGGACTACCTGGCGCGGGCTCCGCGGATCCCCGAGGGCTTCGAGGGGTACGCGACGACGGGCAGCCTGACGAGCGTCGTGTCCGGCCGGGTGGCGTTCACGCTCGGGCTCCAGGGCCCGGCCGTCACCGTCGACACGGCGTGCTCGGCGTCGCTCGTCGCGATCCATCTGGCGTGCCAGGCGCTGCGGCAGGGCGAGTGCACGCTCGCGCTGGGCGGCGGCGTCTGCGGGCTGGCGACCCCGGTCATGTTCACCGAGTTCTCGCGGCAGCGGGCGCTGTCCCCGGACGGGCGGTGCAAGTCGTTCGCGGCGGACGCGGACGGCACCGGGTTCTCCGAGGGCGTCGGGCTCGTCCTGCTGGAACGGCTGTCGGACGCGCGCCGCAACGGCCACCGCGTCCTCGCGGTGGTCCGCGGCTCCGCCGTCAACCAGGACGGCGCGAGCAACGGCCTCACCGCGCCCAACGACGTCGCGCAGGAACAGGTCATCGAGCGGGCGCTCACGGCCGCGCGGCTCGCGCCGTCCGAGGTGGACGCGGTGGAGGCGCACGGGACCGGCACCCGCCTCGGCGACCCGATCGAGGCCGAGGCGCTCATCGCCACCTACGGCCGGGACCGGCCCGCGGAGCGCCCCCTCTGGCTCGGCTCCGTCAAGTCGAACATCGGGCACACGCACGCCGCGGCGGGCGTGGCCGGCCTGATCAAGATGGTGATGGCCCTGCGGAACGAGACGCTCCCGGCCAACCTGCACCTTTCCGAGCCGACCCCGCACGTGGACTGGGACACCGGCGCGGTGCGCCTCCTCACCGCGCCGGTGCCCTGGCCCCGTGACGACCGTCCGCGCCGCGCGGGCGTGTCGTCGTTCGGCATCTCCGGCACGAACGCCCACCTGATCGTGGAAGAGGCCCCCGAGCCGGCGGATCCGTCCGAGGCGGCTGCCCCGCCCGAGCCAGACTCGGGGGCGGTGGTGCCGTGGGTGTTGTCGGGGCGGAGTGAGGAGGCGTTGCGGGGTCAGGCTGCCGCGTTGGCTGCTCGGGTGGGCGGTTCGGAGGGTTGGGCTGCGGCTGATGTGGGCTGGTCGTTGGTGACGACGCGTTCGGTGTTCGAGCACCGTGCCGTGGTCGTCGGCTCGGAACGCGACGAACTGCTGGACGGCTTGAAGGCGCTGGCGGCGGGCGAGGCGCACCCGAACGTGGTCCATCCGGGGAGCCCGGCCGTGAGTGGCGCGAACGCGCGAGGCCCGGTGCTGGTGTTCCCGGGCCAGGGGTCGCAGTGGGCCGGTATGGGCGCGGAGCTGCTGGATTCCTCGCCGGTGTTCGCGGCCCGGATCGCCGAGTGCGAGTGTGCGCTCGCTCCGTTCGTTGACTGGTCGCTGTCGGACGTCCTGCGGGGCGACGGCGCGGAGCTGACGCGCGTCGATGTCGTTCAGCCGGTGCTGTGGGCGACGATGGTGTCTCTCGCGGCCGTGTGGGCGTCGTACGGTGTCGTTCCGTCCGCCGTGATCGGTCACTCGCAAGGGGAGATCGCCGCCGCGTGTGTGGCCGGGGCGCTGTCGTTGGAGGACGGCGCCAGGATCGTCGCCGTGCGCGGCCAGGAGCTGCGGCGGCTCTCCGGGCGGGGCGCCATGGCCTCGCTCGGGGTGGGGGAGGAGCGCGCCGGGGAACTCGTCGAAGGCGACGTCACGGTCGCGGCGGTGAACGGGCCGTCGTCGACGGTGGTGTCGGGCCCGCCCGAGCAGGTCGCCGCGGTGGTGGCCCGCGCGGAGGCCGCCGGGCACCGGGCGCGGACGATCGACGTCGACTACGCCTCGCACGGCCCGCAGATCGAGCTGATCGGCACGGAGCTGACCGAGAAGCTGGCCGGGGTGCGGCCCGTGCGCGCGGACGTGGGGTTCTACTCGACGGTGACGGGCGGCCGGATCGACACCGCCGGGCTCGACACCGTCTACTGGGTGACCAACCTGCGGCTGCGGGTGCGGTTCGCCGAGACCGTCCGGGCGCTGCTGGACGACGGGCACCGCGTGTTCATCGAGGCGAGCCCCCATCCCGTGCTGACGGTCGGCCTGGAGGAGTGCTTCGAGCAGGCCGGGACCGCGGCCGTCGCCGTCCCGACGCTCCGTCGCGGCGACGGCGGCGCGGGGCAGGTGGCGAAGGCGGCCGGGCAGGCGTTCGCCGCCGGGGCGCGGGTGGACTGGACAGGCTGGTTCCGCGGCCGTCCGGCGGCCCCGCGCGTGGTCGATCTGCCGACGTACGCGTTCCAGCGCGAGCGGTACTGGCTGGAGGGCCTGAACGGCGACGGCGGCGATCCCGCCGGGCTCGGGCTCGTCCCGGCCGGGCATCCGCTGCTGGGCGCGGCGGTCGAGGTCGCGGACGGCGACATCCACCTGCTGACCGGCCGCCTTCCGGGCAACGGGAACGCGGGCTGGCTGGCCGACCACCGGGTGCTCGGGACCGTCCTGGTGCCGGGGACGGCGCTGCTGGAGTGGGCGCTGCGCGCCGCCGACGAGGCCGGCTGCGGCTCCGTCGAGGAACTGGTGCTGCACGAGCCGATGGTCCTGCCCGGCGGGAGCGGCGGGAGCGGCGGCCTGCGCGTGCAGGTGGTCGTCGGCGCGGCGGGCGAGGACGGGCGGCGCGAGGTCCGCGTCAGCTCCCGCCCCGGCCAGGAGGACAGCGCAGGCGAACGCGGAGGCGAGGCGCGCGACTGGGTGTGCCACGCGTCCGGCGTGCTCGCCCCGGAAACGACCGTTCCGGCCCCGGGGCTGGACGGGATCTGGCCGCCTCCGGGCGCGGAGGCCGTTGACGTGAGCGACGTCTACGAACGCGTCGCGGCGTCCGGCTACGAGTACGGCCCCGCGTTCCAGGGCCTGCGGGCGCTGTGGCGGCACGGGACGGACCTCCTCGCGGAGGTCGCGCTGCCCGACGCGGCGGGGAAGCCGGACGGTTTCGGCGTCCATCCCGCGCTCCTCGACGCCGCGCTGCACCCGGCGCTGCTGGAGGACGGCGACGACGACGCGGAGGGCGGCGGCGAGCGCCGGGTGTGGCTGCCGTTCGCCTGGAACGGCGTGTCGCTGCGGGCGGACGGGGCGACCGCCGTCCGGGTCCGACTGTCTCCCGAGGGGGACGGCGCGGACGAGCGGAAGGTACGCGTCACGGTCGCGGACGCCGTCGGCGGTCCCGTCCTGGACGCCGAATCCGTGGTGATGCGCCCGGCCGACATCGACCGCTTGAAGTCGGCGGGCCGTCCCGGCGCGGGAGGAGCGGAAGGGCTCTTCACCGTCGAATGGACGGCCCTGCCCGAGCCCGCCGCGGAACCGTCCGCCGAGGACGACACCGACCACGTCACGCTCGCTCCCACCGCCTCCGCCGAAGACGGCGCCTCGTACGCGGGGCTCGACGGGCTGGCGGCGGCGCTGGACGGCGGCCTTCCCGCGCCGTCGGTCGTCCTCGCCGAGGTCGGCGCCGTCGCGGACGCGCTCGCGCTGGTGCAGCGGTGGCTGGCCGAGCCGAGGCTGTCCGAGGCGCGGCTGGCGGTCGTGACGCGCGGAGCGGTCGCGTGCGAGAACCCCGTACCGGACGGCGCGGGGGTGTGGGGCCTGGTCCGCAGCGCCCAGGCGGAGAACCCTGGACGGTTCGTGCTCCTCGACGTCGGCGCCGAAGCCGACGCCGGAGCCGACGCCGGCTCCCGCGCCGAACAGGCTGAGGCCGTGCGGCGGGCCGTCCGGGCGGGCGAGCCTCAGGTCGCGTGGCGCGACGGGCGGCTGCTCGTGCCGCGCTGGGTGCGCGCGGGCGAGCCGGTCGGGATCGTTCCGCCGCCGGGGGAGCGGGCGTGGCGGCTCGGGCTCTCGGGGACGGCGACGCTGGAGAACGTGTCGGCGACGGCGTGCCCGGAGGCGCTGGAGCCGCTGGAGCCCGGTCAGGTGCGCATCGACGTCCACGCGGCCGGGGTCAACTTCCGCGACGTGCTGATCGCGTTGGGGATGTATCCGGGCGACGCGGCGTTCGGCGGCAGCGAGGGCGCGGGCGTGGTGACGGAGGTCGGCCCGGACGTCGCGGGGCTGGCGGCCGGGGACCGGGTGATGGGCCTGTTCGACGGCGCGTTCGGGTCGGTCGCGGTGGCCGACGCGCGGATGGTCGCGCCCGTCCCGGACGGCTGGGACCTCCGGCAGGCCGCGGCGGCGCCGGTGGCGTTCCTGACCGCCTGGTACGGGCTCGTCCACCTGGGCGCGCTGCGGCGCGGCGACAGCGTGCTGGTGCACGCCGCGACCGGCGGCGTCGGCATGGCGGCGGTGCAGGTCGCGCGGCACGTGGGCGCCGAGGTGTACGCGACGGCGGGACCGGCCAAGCACGGCGTGCTGGACGGCATGGGCATCGACGAGGCGCACCGCGCGTCGTCCCGCGACCTGGACTTCGAGGACGCCGTGCGCCGCGCCACGGGCGGCCGGGGCGTGGACGTCGTCCTCAACAGCCTGTCCGGGCCGTTCACCGACGCGTCGCTGCGGCTCCTCGCGGACGGCGGCCGGTTCGTCGAGATGGGCAAGACGGACGTCCGCGCGCCCGACCCCGCGTCCGACGGGATGGCCGAGGGCGTGGCGTACCGGGCGTTCGACCTGCTCGCCGACGCCGGGCCCGACCGGATCGCGGAGATGCTGTCCGAGCTGGCGGGGCTGTTCTCCAGCGGCGCGCTGCGGCCCCTGCCGGTGCGGGCGTGGCCGCTCGGCCGCGCGCGGGACGCGCTGCGGCACATGAGCCAGGCCCGCCACACCGGGAAGCTGGTGCTGGACGTCCCCGCGACGCCGGACCCGGACGGCACCGTCCTCATCACCGGCGGGACGGGCACGATCGGCGGCGAGGTGGCCGTGCACCTGGCGCGCGACTGGCGCGTCCGGCACCTGCTGCTCCTCAGCCGCAGCGGCCCGTCCGCACCGGGCGCGGACGCGCTCGCCGAACGCATCCGGGAGCTCGGGGCCGAGGCCGCCATCAAGGCCGTCGACGTCGCCGACCCCGTCGCGCTGGACGAGGCGATCGCCGCGATCGACCCGGCGCATCCGCTGACCGGGATCGTCCACGCCGCCGGGACGCTCGACAACGCGCTGGTCACCTCCCTCACGCCGGACGCGCTGGCCGGGGTGTGGGACGTGAAGGCCGGAGCCGCCGCGCACCTGCACGCCGCCACCGCCGGGACGCGGCTCGGGCTGTTCGCCGTGTTCTCCTCGTTCGCGTCCACGCTGGGGACGCCGGGCCAGGCCAACTACGCCGCCGCCAACGCGTACTGCGACGCGCTCGCCGCCGCGCGCCGCGCCGAAGGGCTGCCCGGCCTGTCGGTCGCCTGGGGGCTCTGGGCGACGGCGAGCGGACTCACCGGCGGGCTCACCGAGGCCGACACCGCGCGCATCCACCGCTACGGCATCCGCCCGAACGGCACCGAAGAGGGCCTCGCGCTGCTCGACGCCGCCCAACGGCACGGCGGCCCGGAGGTCCTCGCGCTCAACTTCGACGCCCGCGAACTCGCCGGACGGCCCCCGGACACCCTGCCCGCCCCGCTGCGCGGCCTCGCCGCGACGGCCGCCGCCAAGCCGGGCGGCGTGCCGGGCCGTCCCACCGCCGCGGGCGGCGAACGGCCGACCGACTGGACGGCGCGGCTCGCCGGCCTGCCCCCGTCCGAACGGCACCGCACGCTGCTCAACCTCGTGCGGACCCACGCCGCGGCCGTCCTCGGCCACGCCGACGCGGGGCTCGTCCGCGCGGACGCCTCGTTCAAGGAACTCGGCTTCGACTCGCTCACCGGCGTCGAGCTGCGCAACCGCCTGTCCGCCGCGACCGGGCTGCGCCTGCACGCCGCGCTGGTCTTCGACCACCCGGAGGCGGGCGCGCTGGCCGACCACCTCCTCGAACTGCTCGCCCCGGACGGCGCGCCACCCGCCGGCGCGGACGCCACCGCGCCGGTGCTGGGCGAGGTGGCGCGGCTGGAGAGCACCCTGTCCGCCGTCGCCGCCGGGGCCGTGTCCGCCGACGCGGTCACCGCGCGGCTGGAGGACCTGCTCGCGAACTGGAAGGCGGCGCACGCGCCCGACGGCGGCACCGGCGGCGACGGCGACGCGGACGGCGCGGAGGAGAGGCTCCGGGTCGCCGACGCGGGCCAGATCCTCGACTTCATCGACAACGAGCTGGGCGTGAGGCCGTGAACGAAGAGAAGCTGGTCGACTACCTCCGGCGCGTCACCACCGACCTGCACCACACCCGCAAGCGGCTCAGCGAGGTCGAGGACAGGCAGCGCGAGCCGGTGGCCGTGGTGGACATGGCGTGCCGGTTCCCCGGCGGGGTCCGTTCGCCCGAGGACCTGTGGGAGCTGGTCGCGTCGGGCCGGGACGCGATGGGCGCCTTCCCCGAGGACCGCGGCTGGGACCTGGACCGCCTGTTCCACCCCGACCCCGACCATCCGGGGACCAGCTACGCTCGCGCGGGCGGGTTCCTCTACGACGCCGCCGAGTTCGACCCCGAGTTCTTCGGGATCAGCCCGCGCGAGGCGCTGGTCCTCGACCCGCAGCAGCGCCTGCTGCTGGAGGTGTCGTGGGAGGCGATGGAACGGGCGGGCATCGACCCGTCCACGCTGAAGGGCTCCCGCACCGGCGTGTACGCGGGCGCGGCGCTGCCCGGCTTCGGCACCCCGCACATCGACCCGAGCGCCGAAGGCCACCTGGTCACCGGGAACGCCCCGAGCGTCCTGTCGGGCCGCCTCTCCTACACCTTCGGCCTGGAGGGGCCGGCGGTGACGATCGACACGGCCTGCTCGTCGTCGCTCGTCGCGATCCACCTCGCCTGCCACGCGCTGCGGCAGGGCGAGTGCACGCTCGCGCTCGCGGGCGGCGTGACGGTGATGGCCGTCCCGAACGTGTTCACCGAGTTCGCGCGGCAGCGCGGGCTGGCCGCCGACGGGCGCTGCAAGCCGTTCGCGGAGGCCGCCGACGGCACCGCGTTCGCCGAGGGCGCCGGGATGCTGCTGCTGGAGCGGCTGTCGGACGCCCGCCGCAACGGCCACCCCGTCGCGGCGGTGATCCGCGGCTCGGCCGTCAACCAGGACGGCGCCAGCAACGGCCTGACCGCCCCGAACGGCCCGTCCCAGCAGCGGGTGATCCGGCAGGCGCTGGAGGCCGCGCGGCTCGCCCCGTCCGAGGTGGACGCGGTGGAGGCGCACGGGACCGGCACCCGGCTCGGCGACCCGATCGAGGCGAAGGCGCTGCTGTCGGTCTACGGCGCGGACCGGGACGCGGGCCGCCCGCTCCGGCTCGGCTCCTGCAAGTCCAACCTCGGCCACACCCAGGGCGCCGCGGGCGTCGCCGGCGTGATCAAGATGGTGCTGGCGCTGCGGCACGAGCGGCTCCCCGCGACGCTGCACGTGGACCGCCCGACCCCGCACGTGGACTGGGACTCCGGCGCGGTGCGCCTCCTCACCGAGCCCGCCGGGTGGCCGCGCGGCGGCGACCGCGCGCGCCGGGCCGGGGTGTCGTCGTTCGGCATCTCCGGAACGAACGCGCACCTGATCCTCGAAGAGCCCCCCGCCGAGAACGCCCAGGACGCCGAGCACGCCGAGCACGCCGAGCACGCCGACGACGCCCCGGCCGCCTCCGCCGGGCGCGCCGCGAACGAGGCCCGCGGGCCGATCCCGTGGGTGCTGTCGGGACGCACCGGGCCCGCCCTCGCCGAGCAGGCCGCCGCGCTCGCCGAGCACGTCGCCGCCCGTCCCGAGCTGCCCGCCGCCGACGTGGCCTGGTCGCTCGCGCGGACGCGGACGCCGTTCGAGCACCGCGCGGTCGTCCTCGGCGACGACCGGGACGCGCTGCGCGACGGGCTCGCCGCGCTGGCGGACGGCCGGGACCATCCGGACGTGACGCGCGCCGCGGGCCCTGCGGGGCCGGGCGGGACGGCGTTCATGTTCACCGGGCAGGGCAGCCAGCGGCCCGGCATGGGCCGCGACCTGCACCGGACGTTCGCCGCGTTCGCGGAGGCGTTCGACGAGGCCGCCGCGCACCTGGACCCGCTGCTCGGCCGGTCGCTGCGCGACCTGGTGTTCGCCGACGGCGGCACCGCCGAGGCCGCCGACCTGGACCGCACCGGCGTGACCCAGGCGGCGCTGTTCGCGGTGGAGACCGCGCTGTTCCGGCTGGTGGAGTCGTTCGGCGTCGTCCCCGCCTACCTGACGGGCCACTCGGTGGGGGAGATCGTCGCCGCGCACGTCGCGGGCGTGCTGTCCCTCCCGGACGCCTGCGCGCTGGTCGCCGCGCGGGGCGGGCTGATGCAGGCGCTGCCCGAGGGCGGCGCGATGGTCGCCGTCGAGGCCCCCGAGGACGAGGTGGCGCCGCTGCTCGCCGGGCTGGAGGACCGCGTCTCGCTGGCGGCGGTGAACGGCCCCGCCTCCGTCGTCGTGTCGGGCGACGGGACGGCCGTCGAGGAGGCCGTCCGCCCGCTGCGGGAACGCGGCCTGAGGGTCAGGCGGCTGCGCGTCAGCCACGCGTTCCACTCCCCGCTCCTCGGTCCCATGCTGGACGGCTTCCGCGAGGTCGTGAGCGGTCTGGCGTTCGGGCCGCCGGAGATCCCGCTGATCTCCAACCTCACCGGCGAGCCCGCCGACGCCGGACTGCTCGGCGACCCGGAGTACTGGGTGCGGCACGCCCGGCGGACCGTCCGGTTCCACGACGGCCTGCGCACCCTCCGCGCGGAGGGCGTGACGCGCTACCTGGAGCTCGGCCCGGACCCCGTCCTCACCGCCATGGCGCAGAAGTCCCTGGCGGACGACGGCCCCGCCCCGGCGTTCGCCGCCGCCATGCGCAAGGGCCGCGACGAGCCCCGCACGTTCCTCACCGCCCTCGCCGCGACGCACGTCGACGGCGGCGACGCCGACTTCGCCGCCGCGCTCCCGGCGGGCGCGGGACGCGTACCGCTGCCCACCTACCGGTTCCAGCGCAGGCGGTACTGGCGGCCCGTGCAGGACGCGGCGGCGGAGGTGAGCGCGTCCGGCCTGCGCCCGACCGGCCACCCGCTCCTCCAGGCCGCCGTGGAACCGGCGGACGGCGGGCTGCTGCTCGCCGGACGCCTGTCGCCCCGCACGCACGGCTGGCTCGCCGACCACGCCATCGCGGACGGCGCGCCGCTGCCCGCGACGGCGTTCCTGGAGTTCGCGCTGGTCGCGGCCCGGCGGACCGGCTGCGACGCCGTCGAGGACCTCAGCCTGGAGACGCCGCTGCCGCTGCCCGACGCCGAACCCGTGGACCTCCAGGTCGCGGTCGGCGGCCCGGACGGCGCGGGCCGCCGCCCGGTCACCATCTACTCCCGCCCGGTGGACGACGCGCACGACGACCCCGACGCGCCCGCGGCGGCGCGCCTGGACGACCTCGGACCCGCGAACGGCTGGCGCCGCCACGCCGCCGGAACCCTCGCCGGACCCGACCCCGACCCCGACCCCGCCCCGGCCGCGCCCGCGCAGTCCGCCGCTTGGCCCCCGGAGGGCGCGGACGAGATCGACGCGGCGGAGCTGTACGAGCGGCTGGCCGACCAGGGCTACCGGTACGGGCCCGCGTTCCGCGGGGTGCGCGCCGCGTGGCGGCTCGGCGACGAGCTGTTCGCGGAGGTGCGCCTCGCGCCCGACCAGCACGCCGACGCCGCCGCGTACGCCGTCCACCCGGCCCTGCTCGACAGCGCGCTCCACCCCGTGGACGAGCTCTACGAGCACGACGGGAGCACGGTCCGGCTGCCGTTCTCGTTCGGCGGCGTGCGCCTCCACGGAACGGGCCGCACGCGGCTGCGGGTCCGGATCACGCCCGGCGACGCGGACGTGCTGTCGCTCGCGCTCACCGACGACGACGGCGCCGCCGTCGCGTCCATCGGGTCCCTCGGCCTGCGCAAGGTCCCCGCCGGACGGTGGAACGGCGCCGCGCCCGCCGACGACGCCCTGCACCGGCTGGCCTGGCACCCCCTCCCCGTACCGGACGGCGGGCCCGGAACCGCCCGCCCCGCCGACGACTGGGCCGCCATCGCCGCGGCCGGCCCGGACGCGACCGGTCCGGCTGCGGACGGCCTGAGTGCGGACGGCTTGGGTGTGGTCGCGCCGGACGCCGTCCACCCGGACCTCGCCGCGTTGAAGACCGCGCTGGCCGACGGGCGCCCCACTCCCGCCGTCGTCCTCCTCCCGCTCCCGACCGGCGGGCCCGACGCCGCCGAGCCCGTCGCGCGGGCCCACGACGCCGCGCGGCGCGGGCTCGAACTGCTCCAGGAGTGGCTGGAGGAGGAGCGGTTCGCCGCGGCGCGGCTGATCGCGGTGACGCGGGGCGCGGTCGCCGCCACCGGCGACGACCCGCCCTCCGACCTGGCGGGCGCGCCGCTCTGGGGACTGGTCCGCGCGGCGCAGGCCGAGTACCCCGACCGGATCGTCCTCCTCGACCTCGACGGCGAGGACGCCTCCCGCGACGTTCTGCGGGGCGCGGTCGCCGCCGCGATGGAGAACGGTGAGTCGCAGCTCGCGGTGCGTTCGGGAGACGTCCTGGTCCCGCGGCTCGTTGACGCGGCGGAACGGGACGCGGACGGAACCGCGCCGGCACTCGACCCCGACGGGACCGTCCTGGTCACCGGCGGGACGGGGGCGCTCGGACGGCTCGTCGCGCGGCACCTCGTGACCGCGCACGGAGCGCGGCACCTGCTGCTCACCGGGAGGCGCGGCCCCGCCGCGGACGGCGCGGCCGAGCTGGAGGCGGAGCTCGCCGGGCTCGGCGCGGACGTGCGCGTCGCCGCCTGCGACGCCGCCGACCCGGACGCGCTCGCCGCGCTGCTCGCCTCGGTGCCCGCCGCGCACCCGCTGACGGCCGTCGTCCACGCCGCCGGGATCGTGGACGACGGCGTGATCACCTCGCTGACGCCCGGACGCCTCGCCGCCGTGCTGGCGCCGAAGGCCGACGCGGCCTGGCACCTGCACCGGCTGACCGAGGGGGACGACCTCGCGGCGTTCGTGCTGTTCTCGTCCGCCGCGTCCGTCCTCGGCAACGGCGGCCAGGCGAGCTACGGCGCGGCGAACATGTTCCTCAACGCCCTCGCCGAGCACCGCCGGGCCGGCGGGCGTCCCGCGACGTCCCTGGCGTGGGGCCTCTGGGAGGCGTCCGGCGGGATGGCCGAGGACCTCGCCGCCGCCGACCGCGCCCGCATCACCCGCTCCGGCGTCGCCGCGATGCCGCCCGGCCGCGCCCTCGCCCTGCTCGACGCGGCGCTCGCCGCGCCGCACCCGACGCTCGTGCCGGTCGCGTTCGACCGCGCGGCGCTGCGGGAGCAGGCGGCGGCGGGGACGCTGCCGCCCGTCCTGCGCGGCCGGGTCCGCGTGCCGGCCCGGTCGCCGCGCGGGCCCGCGGCGTCCTCCTTCGCGGCGCGCCTCGCCGGGCTGCCCGCCGAGGAACGGGACCGCGCCGTCGCCTCCGCCGTCCGCGACCAGATCGCGACCGTGCTCGCGCACCCCGAACCCGAGGCCATCGACCTCAACCGGGCGTTCCAGGAGCTCGGGTTCGACTCGCTCACCGCCCTGGAGCTGCGCAACCGGCTGAACGCGGCCACGGGCGCGGCCCTGCCCGCCACCGTCATCTTCGACCACCCGACCCCCGAGGCACTGATCCGCGCCGTCCGCACCCGCCTGGACGGCGCGGCGGGCGGGCGCGCCGTCCGGGCCCCGGTGAACGGCGCGCCGCGCGCAGCGGCCGCGGACGACGGCGAACCGATCGCCATCATCGGCATGGGCTGCCGCTACCCGGGCGGCGTGACGTCCCCGGAGGACCTGTGGGAGCTGGTCGCCTCGGGCGCGGACGCGATCGGGGAGTTCCCGGCCGACCGCGGCTGGGACCTGCCGGGCCTGTTCGACCCCGACCCCGACCGGCTCGGCCACAGCTACACGCGCGAGGGCGGGTTCCTGTACGACGCGGCGCGGTTCGACGCCGGGTTCTTCGGGATCAGCCCGCGCGAGGCCGTGGCGATGGACCCGCAGCAGCGGCTGCTGCTGGAGACGGTCTGGGAGGCGTTCGAACGCGCCGGGCTCGACCCCGCGGACCTGCGGGGGAGCCGCACCGGCGTGATCGCGGGCGTGATGTACGACGACTACGGCAGCCGGTTCCTCGGCCGGACGCCGCGCGCCGTCGAGGGCCGCCTCATGACGGGCAGCACGCCGAGCGTCGCGTCCGGCCGCGTCGCCTACACCTTCGGCCTGGAGGGCCCCACCCTGACGGTGGACACCGCGTGCTCGTCGTCGCTGGTGGCGATGCACCTCGCGGCGCAGGCGCTCCGCCGGGACGAGTGCACGCTGGCCGTGGCGGGCGGCGTCACGGTGATGGCGACGCCCAACACGTTCGTGGAGTTCTCGCGCCAGCGCGGCCTGGCCCCGGACGGCCGCTGCAAGTCGTTCGCCGCGTCGGCGGACGGCGTCGGCTGGGGCGAGGGCGCCGGGATGGTCGTGCTGGAACGGCTGTCGGACGCGCGCCGCAACGGCCACCGCGTCCTCGCGGTGCTGCGCGGCTCGGCGGTCAACCAGGACGGCGCGAGCAACGGCCTCACCGCCCCGAACGGCCCGTCGCAGGAACGCGTCATCCGCGCCGCGCTCGCCGGAGCCGGGCTCGCGCCGTCGGACGTGGACGCCGTCGAGGCGCACGGCACCGGCACGACCCTCGGCGACCCGATCGAGGCGCACGCGCTCCTCGCCGCCTACGGGCAGGACCGCCCGGACGGCCGTCCGCTCTGGCTCGGCTCGGTGAAGTCGAACATCGGCCACACCCAGGCCGCCGCGGGCGTCGCCGGCGTCATCAAGATGGTCATGGCCATGCGCGAGGGTTCGCTCCCCGCCTCCCTCCACATCGCCGAACCGTCCCCGCACGTCGACTGGACGGCGGGCGCCGTGCGCCTCCTCACCGAACCCGTCGAATGGCGGAACGGCGACCGTCCGCGCCGCGCCGGGGTGTCGTCGTTCGGCATCTCCGGGACCAACGCCCACCTCATCCTTGAGCAGGCCCCCGACCCCGCGGAGACTCCCGCGCCCGAGGGCCCGGTCATCCTGAACGGCGCGGCGAGCGCCGAGGCGGAGACAGGAACGGGCGATACGGGCGATACCGGGGTGCCGGACCTCGCCCTGGTCCCGTGGACGGTCTCGGCGCGCGACCCGGACGCCTTGCGCGGCCAGGCCGCCGCGCTCGCCGCCCACGTCTCCGCCCGGCCCGGCCTCCCCGTGGCGGACGTGGCGTGGTCGCTGCTGAAGACCCGGTCGTCGTTCCCCCACAAGGCCGTCGTGGTCGGCGAACGCCCCGCGGAACTCGTCGCGGGACTGGAGGCGCTCGCGGCCGGCGCCGACGCCCACCCGGCCCTGACCGGGCCGGGAGCACCGGCGGCCCCCGGCCGGCTCGTGTGGCTGTTCAGCGGCCAGGGAAGCCAGCGCGTCGGGATGGGCGCCGACCTCTACGTGCGGTTCCCGGCGTTCGCGGAGGCGTTCGACGAGGTGTGCGGCCACTTCGACGAACGGCTCGGACGCCCGCTCGGCGAGGTCGTCTTCACCGGCCCGGCGGACGTCCTGGACCACACCACCTACGCGCAGGCGGGCCTGTTCGCCCTCCAGGTCGCGCTGGCCCGCCTGCTCGGCACGGCCGGGATCCGCCCGGACGCCGTGATCGGCCACTCCATCGGCGAGGTCGCGGCGGCGCACGTCGCGGGGGTCCTCGACCTCCCCGATGCCTGCCGCCTCGTCGCGGCCCGCGCCGGGCTGATGGGACGGCTCCCGTCCGGCGGCGCGATGACCGCCGTCGAGGCCACCCCGGACGAGCTCGCGGACGACCTCGAACGGCTCGGCGGCGGCCTGGTCGCGGTCGCCGCGCTCAACACCCCCGGCAGCACGGTGATCTCCGGCCCGGCCGAGCCCGTCGCCCGGATCGGCGCGCGGTGGGCCGAGCGCGGCCGCAGGACCCGTTCGCTCACCGTCAGCCACGCCTTCCACTCGCCGCTGATGGACCCCGTCCTGGACGAGTTCGCGACGGCCATCGACGGCCTCGCCTACCGCGAACCGGCCATCCCGCTGATCAGCAACCTCACCGGGCTGCCCGCCGACGAGCGCATCACCACGCCCGCCTACTGGGCCGAGCACATCCGCCGCCCCGTACGGTTCCACCCCGCGGTCGCGCACGTCGCCGCCGACGCCGCCGCGTTCCTCGAACTCGGCCCCGACCCCGTCCTCGGCACCGCGACCCGGCGCACCCTCGACACCCTCGATGCCGGCGGGGACCCGGCCGGCGGGCCACCGGTCCTGGCGACCCTCACACGGAAGCAGCCGGAGATCCGCTCCCTCGCGAACGCCCTGGCCGGCCTCCACGTCACCGGCGTCCCGGTGGACTGGGCCGCGGCCTGGTTCCCCGCCGAGCCCGCCCCGCGCGCCGTGGACCTGCCCACGTACGCGTTCCAGCGGGAACGGTTCTGGCTCTCCGCCGACGCCGGAGGCGAGGACGCCGCCGGACTCGGCCTGACCCCCGCCGGGCACCCGCTGCTCGGCGCGGCGATGGACCTGGCGGGCGGCGGCGGCCACGTGCTGACCGGCCGCCTGACGCGCCGGTCCGGGGCGTGGCTCGCGGAGCACGTGGTCGCCGGGACGACCCTCCTGCCCGGCGCGGGCGTCGTCGAATGGGCGCTCCGCGCGGCCGACGAGGTCGGCTGCGGCGGAGTGGACGAGCTGACGCTCCGCGCCCCGTTCGTCCTGCCCGACACAGGCGGACTGCGCGTCCAGGTCGTGATCGGCGCGCCCGGTGAGGACGGCCGCCGCGACCTGCACGTCTACTCCCGCCTCGACCAGGACGGCGAGGCCGCGGAATGGCTGTGCAACGCCGAAGGCGTCCTCACCCCGGAAACCCCCGCCGACCCCGGACCGGAACCGGGCGGCCCCTGGCCCCCGCCGCACGCGGAACCCGTTGACGTCGAGGGCTTCTACGAACGCGCCGCGTCCGGCGGCTACGACTACGGACCGTCGTTCCAGGGCCTGCGGGCGCTGTGGCGCGCGGGCGACGACCTGTTCGCCGAAGTGACCCTCCCCGAAGCGGCCGGCGACGTCGCCGGATACGGCATCCACCCCGCTCTACTGGACGCGGCGCTGCACCCGCTGTTCCTCACGGGCCTGTTCGGCGGAGACCCGTCCGGAAACCGCATCTGGCTGCCGTTCACCTGGAACGACGTGTCGCTGCGGGCCGGCGGGGCGACGGCCGTCCGCGTGCGGCTCTCCCCGAACGAGGACGGCGACACCCTGCGCCTGACCGTCACCGACCCGTCCGCCGCGCCGATCCTCACCGTCGGCTCGCTCGTGATGCGCCCCGCCGACGCCGACCAGCTCCGGGCCGCCGCGCGCCAAGGCGACACACGCGGCCTCTTCACAGTCGACTGGACACCCGTCCCCCTGCCGTCCGCCCCTGCGGACGCGCCCGAACGAGACGACGCAGGCGAATGGGCCGCTCTGGGCCCGGACACGCTACCGCCCGGCCTGGACGCCTACCGCGACCTCCCGACCCTCCTGATGGCCCTCGACGCAGGCCAGCCCCCGCCGTCCCACGTCCTGACCACCATCCCCCCGGAGCCCCCGCCCGGCCGCACCGAAACCGACGTCGCGGCGGACACGCTGCTGCTCCTGCAAGCCTGGCTGGACGAGCCGCGCCTGGCGGATGCGCGCCTGGTGGTGGTGACCCGGGGCGCGGTCGCGTCCGACGTCCCCGATCTGGCCGGTGCAGGGGTGTGGGGCTTGGTACGCAGCGCGCAGGCGGAGAACCCCGGTCGCTTCGTCCTCCTCGACCTCGAAACGGCCCCGGACGGCGATGTCCTCGAACTCGTCCGACGCGCCCTGCACACCGACGAACCGCAACTGGCCGTACGCGGCGGCCAAGCGCTCGTACCCCGGCTGGCCAGAGCGGACCGCCCGGCCGAACTGACCGCCCCCGCGACCCGCGCGGGCTGGCGCCTCGGCGCCGAACGCCCGGGAACCGTGGAGAGCGTCACGATCGTCCCCGCCCCCGAGGCCCTCGAACCGCTCGCCCCCGGACAGGTGAGGATCTCCGTCCGCTCCGCCGGACTCAACTTCCGCGACACCCTCATCGCGCTCGGCATGTACCCAGGCGGCGGCGTCTTCCAAGGCAGCGAGGGCGCCGGCGTGGTGACCGAGGTGGGTCCGGGCGTCGCCTCGGGCCTGTCGGTCGGTGATCGTGTGATGGGTCTGTTCGAGGGAGCGTTCGGCCCGGTGGCGGTGGCGGACGAACGAATGGTCGTTCGTCTTCCGGATGGTTGGGATTTCCGTTCCGGTGCCGCTGTGCCGGTGGTGTTCTTGACCGCTTGGTACGGCTTGGTGGAACTGGCCGGGTTGAAGTCGGGTGAGTCGGTGTTGGTTCATGCGGCGACGGGTGGTGTGGGTTCGGCGGCGGTGCAGATCGCCCGCCATCTGGGTGCGGAGGTCTTCGCCACCGCCAGCCCCGGCAAGCACGCCGTCCTTGAGTCGATGGGTATCGATGCGGCCCACCGCGCCAGTTCCCGCGACTTGGACTTCGAGCGGACGTTCCGCGACGCGACGGGTGGTCGTGGTGTTGATGTGGTGCTCAATAGTTTGGCGGGGGAGTTCACCGATGCGTCGTTGAGGTTGGTGGCTGAGGGTGGTCGGTTCCTGGAGATGGGCAAGACCGACATCCGCGAACCCGAACACATCGCCGCGGAGTACCCTGGCCTCTCCTACCGGGCCTTCGACCTCGTCACCGACGCCGGACCCGACCACGTCGGGGAGATGCTGGCACGGCTCACCGAACTGTTCAGCGCCGACACCCTACGGCCGCCGCACGTTGAGGCATGGCCACTGGCCCGCGCTCGCGAGGCTTTGCGGGTGATGAGTCAGGCCAAGCACACCGGCAAGCTCGTTCTGGACATCCCTCCGACTTTGGACCCGGACGGCACCGTGCTGATCACCGGCGGCACGGGAACACTGGGCGGTCTGGTCGCCGAACATGTGGTCGGGACGTGGGGTGTCAAGCGGTTGGTGCTGGCCAGTCGTCGTGGCCTGCAAGCCCCGGGCGCCGACGAACTGGTCGAGCGCCTCACCCGAGAGTTGGGCGCGCAGGTACGGGTCGTCGCGGCCGATGTGTCCGACGCCGACGCCGTCACCGACCTCATCACCGCGACCGACGCCGACCCCGCGCACCCCCTCACCGGCCTCATACACACCGCCGGACTCATCGACGACGCGGTGATCGGCTCCCAGACACCCCAACGCCTCCACCACGTCTGGAACGCCAAAGCCACCGCCGCACACAACCTCCACACCGCCACCCGACACCTCCCCCTCGCCGCGTTCGTCATCTTCTCCTCGGCCGCCGGGACGATCGGGAGCGCCGGGCAGGCCAACTACGCCGCCGCCAACGCGTACTGCGACGCGCTCGCGGTTCGGCGGCGTTCGCTCGGGCTGCCGGGGGTGTCGGTGGCGTGGGGGCTGTGGGAGTCCAGCAGCGGGATCACCGGCCACCTGGGCGACGCCGACCTCGCGCGGATGGGCCGCTCCGGCTTCCGGCCGCTCACGGACCGGCACGGCCTGGCCCTGCTGGACGCGGCGAGCAGGCACGGCGCCGCTCATCTCGTCGCGGTGGACCTCAGGACGAGCGCGCTGGCGGGTCAGCCCGCGGAGAACGTTCCGCCGATGCTGCGGACGCTCGCGGCGACCGCCTCGGCCGGGGGCGCGGCGGCGCGGCCGACGGCGGGCGGTGTGCGGCGGCCCGCCGACCTGGCCGCCCGGCTGGCGGCGCAGTCCCGGACGGAGCAGGAACGCACGCTCCTCGGCCTCGTCCGGACGCAGGCCGCCACCGTCCTCGGCCATCCGAACCCCGACGCCATCCGTCCCGACAACTCGTTCAAGGAGCTGGGCTTTGACTCCCTCACCGCCGTCGAACTGCGCAATCGGCTCGCCGCGAGCACGGGGCTGCGCCTGCCCGCCTCGCTCGTCTTCGACTATCCGGAGGCGGCGGTCCTCGCCGACCATCTGCGACGCAGGCTCGCGCCGAACGGCGAGAGCGCGCCGGCGCAGGACGCCGTCGATCCGCTGCTCGGGGAGCTGGGGCGGATCGAGAGCGCGCTGGCCGCGCTCGCCCTGGACGACGAGGCGCGGAGCAGGGTCGCCAGGCGGCTCAACGGGCTGCTTTCGGCGGTGAACGGCGGTTCGAACGCTTCGGCGGAGACGGGCGGGTTCGACGACGTCGAGTCCGCGTCCGACGACGAGATCTTCGAGCTGATCGACAGGGAGCTCTGAGCGTGCGCATGGAGGGCGGGAACTGATGTCGGAGACCGAGGAGAAGCTCCGCCAATACCTGAAGCGGGTCACGGTCGACCTCGGGCAGACCCGCCAGCGCCTCCGCGAGGCGGAGGAGCGGGGCCGGGAGCCGGTGGCCGTGGTCGGCATGGCGTGCCGCTTCCCGGGCGGCGTCTCGTCCCCCGAAGAGCTGTGGGAGCTGGTCGACTCGGGCGGCGACGCGATCGGCGCGTTCCCGGACGACCGCGGCTGGAACCTGGAAGCCCTTTACAACCCCGACCCCGATCAGCCCGGCGGCAGCTACGTCCGCGAGGGCGGTTTCCTGTACGACGCGGACCTCTTCGACGCGGGGTTCTTCGGCATCAGCCCGCGCGAGGCGCTGGCGATGGAACCGCAGCAGCGGCTGCTGCTGGAGACGTCATGGGAACTGCTGGAACGGGCCGGCATCGAACCGGGCTCGTTGAAGGGCAGCGCCACCGGCGTGTACGCCGGAGCGGGCCTTCCCGGTTTCGGCACCCCGCACATCGAGGAGAGCGCCGAAGGGCACCTGGTGACCGGCAACGCGCTCAGCGTCCTGTCGGGACGCGTCGCGTTCACGCTCGGCCTGGAGGGCCCGGCGGTGTCGGTGGACACCGCGTGCTCGTCGTCGCTCGTCGCCATCCACCTCGCCTGCCAGGCACTGCGACAGGGCGACTGCTCCATGGCGCTCGCGGGCGGCGTGACGGTGATGGCCGTACCGGCCATGTTCACCGAGTTCTCGCGCCAGCGCGGCCTGGCCCCGGACGGCCGATGCAAGCCTTTCGCCGAAGCCGCCGACGGCACCGTCTTCTCCGAAGGCGTGGGCCTGGTGCTGCTGGAACGCTTGTCGGACGCGCTGCGCAATGGCCGCCGGATCCTGGGTGTGGTGCGGGGCTCGGCGGTCAACCAGGACGGCGCGAGCAACGGTCTCACGGCCCCGAACGGCCCGTCGCAGGAGCGGGTGATCCGGCAGGCGCTGGCGTACGCGGGTTTGTCGGGCGTCGAGGTGGACGCGGTGGAAGCCCACGGCACCGGGACCACGCTGGGCGACCCGATCGAGGCGGGCGCGCTGCTCGCGACCTACGGACAGGGCCGCCCGGACGACCGGCCGCTGTGGCTGGGTTCGGTGAAGTCGAACATCGGGCACGCGCAGGGCGCGGCGGGCGTGGCCGGCGTCATCAAGATGCTCATGGCCATGCGGAACGGCCGGCTCCCCGCCACCCTCCGCGTGGACCGCCCCTCGTCGCACGTCGACTGGGAATCCGGCGCGGTGAGCCTGCTGACCGAACCCGTCGAATGGCCGCGCGAGGACCGTCCCCGCCGCGCGGGCGTATCGGCGTTCGGCATGTCCGGCACGAACGCCCACCTGATCCTCGAAGAACCCCCCGAGCCGACCGAACCCGACGACGTGCCCGAACGACCTGCGCCGATGGACGTCGGCCCGCTCCCGTGGGTGCTGTCGGCGAGAAGCGAAGAGGCGTTGCGGGGGCAGGCGGCGGCGCTGGCGTCCCAGGTGCGTGTGTCGGATGACGCGCCGTTGGATGTGGGTTGGTCGCTGGCGGCGTCCCGTTCGAATTTCGAGCGTCGTGCGGTGGTCGTCGGGACCGGACGCGATGAATTGCTGGCCGGTTTGGAAGCTGTGGCGAGCGGTGAAGTGCGTTCTGTTGCGCCTTCTGGAGTAGGTGCGGGTCCGGTGTTGGTTTTTCCGGGTCAGGGTTCGCAGTGGGCCGGTATGGGCGCGGAGCTGTTGGATTCCTCGCCGGTGTTCGCGGAGCGTATTGCTGAGTGCGAGCGTGCCCTCGCTCCGTTCGTGGACTGGTCCCTGTCGGACGTCCTGCGTGGCGATGGCGCGGAGTTGGCGCGCGTTGATGTCGTTCAGCCCGTGCTGTGGGCGACGATGGTGTCTCTGGCGGCTGTGTGGGCGTCGTACGGTGTCGTTCCGTCCGCGGTGATCGGTCATTCGCAAGGGGAGATCGCTGCCGCGTGTGTGGCCGGGGCGCTGTCGTTGGAGGACGGCGCCAGGATCGTTGCGTTGCGGAGTCGGGCGTTGCGGCGGTTGGCCGGTGGCGGTGCGATGGCTTCGCTTGGTGTCGGTGTGGACGCCGCGCAGGAGTTTCTGCCTGAGGGTGTTGCGGTGGCGGCGGTGAACGGGCCGTCGTCGACGGTGGTGTCGGGCCCGCCCGAGCAGGTCACCGCAGCGGTGGCCCGTACCGAGGCTGCTGGGCATCGGGCGCGGTTGATCGATGTTGATTACGCCTCGCATGGTCCGCAGGTCGATGAGATCACCGACGAACTGCATGAAGTCCTCGCAGGCATCGAACCCGTACCGACAACAGAAGTGGCGTTCTATTCCACGGTCCGTGCCGCTGCGATCGACACCGCGACCCTGGACACCGGTTATTGGGTGACCAATCTCCGGGAGCGCGTGCGGTTCGCCGAGACCATTGACGCGCTTCTCGAAGACGGGCACCGGGTCTTCATCGAGGCCAGTCCCCATCCCGTCCTGACGGTCGGGATGGAGGAGACCTTTGAACAGGCCGGGGTCGATGGCACGACCGTTCCCACGCTTCGTCGTGATCACGGTGATCGCAAGCAGCTCGCGGATGCGCTCGGGCAGGCATTCTCGGCGGGTGTGGAGGTCGATTGGACCGGGTGGTTCCCGGCGGAGTGGGCTCCGCGGCTTGTCGAACTGCCTACGTATGCCTTCCAGCGGCAGCGGTACTGGATCGACGCGCAGGGTGGCCGGGGAGGCGATGCCGCGAGTCTCGGGCTGGCGTCCGCCGGGCATCGGCTGCTGGGCGCGGCGATCGAGCAGGCGGAAGGGGACGTCCGCCTTCTGACCGGACGGCTGTCCGCGACGGGCGACGCGAGCTGGCTGGCCGATCACCAGGTCGCGGGAACGGTCGTCGTGCCCGGCGCGGCGCTGCTGGAGTGGGCGTTGCGCGCGGCGGACGAGGCCGGGTGCACGGGTGTGGAGGACCTGGTCCTCCGTTCGCCGCTGGTCCTTCCCGCGTCCGGGAGCGTACGGGTGCAGGTGGTCGTGGGCGCGGCCGGGGAGGACGGGCGCCGCGACGTGCGGGTCTACTCCAAGCCCGGCCAGGACGGCGACTTCGATGACGACGCCGGGTGGACGTGCCACGCGGAAGGAACGCTCGGGCCCGAGACCGGGGACGCCGCCGAGGGCTTGGCCGGGGCCTGGCCTCCGCCGGGCGCGCAGCCCGTGGACGTGAGCGGGTTCTACGACTTCGCGGCGGCGTCCGGGTACGAGTACGGTCCGGCGTTCCAGGGCCTCCGGGCGGCCTGGCGGAACGGGGCGGACGTGCTCGCGGAAGTGGAGTTGCCGGAGGCCGCCGGTGAGCCGGGCGGGTTCGGTGTCCACCCCGCGCTGCTGGACGCCGCCCTTCACGCGGGCCTGCTGGCCGACGGACACGGCGACGACGGGGACGTGTGGCTGCCGTTCGCGTGGAGCGGGGTTTCGTTGTGGGCCACTGACGCGACGGCGGTCCGCGTTCTGCTGTCTCCGCTTCCGGAGGGCGAACGCGGTGTGCGTGTCGTCATGGCCGACGCCGTTGGCGCACCGGTTCTGACGGCCGACGCGGTGGCGTTGCGGCCGATCGACACCGAACGGCTGCGCGCCGAGGACGGTCGCGGTCCTGACGGCCTGTTCGTGCTCGACTGGGTCGCGGTGCAGGAGTCCGCGCCGTCCGCCGAGGCCAACGAACCGGGCGCGCTGTGCGGGCTCGGCGACTTCGATGACTCCGTGCTGCCGTCCGTCCCCGATGTGGACGCGCTGCTGGCACGCCTCGACACCGAGGATCTTCCGACTCCGTTGACGGTGCTGACTGTTCTCCCGCCCGAGGGCGAGGGCGGTGGCCTCGACCTGACCGGGCGCGTGCTCGCTCTGCTGCAAAGGTGGCTGAACGAACCGCGCCTGGACGACGTCCGGCTGGCGCTGGTGACGCAGGGCGCCGTCGCGGTCGAGAACCCGGACGTGGCGGGTGCGGCGGTGTGGGGGCTCGTCCGCAGCGCGCAATCCGAGGAACCGGGCAGGTTCGTTCTGGTCGATGCCGGTCTCCCGGATCCCGGCGCCGCGTTCGCCCAGGACGCCGCCGACGCGGTACGGCGCGCGGTGGAGGCGAACGAGCCGCAGGCGGCCGTGCGTGCGGGGCAGTTGCTCGCGCCACGTCTCGCGAGGACCGACACCCGCGCCCCCGACCCTGCCTACGCGGGCGATCGCGAGGGCGGTCATGGGCTGCCGGTCCTCGATCCTGACGGGGCCGTGCTGGTCACCGGGGGGACGGGCAAGGTCGGCGGCCTGGTCGCCGAGCATCTCGTTCGCGCGTGGGGGGTCACGCACCTCGTGCTGGCCAGCCGCCGGGGCTCCGACGCGCCCGGCGCATCGGAGCTGGAGAGCCGCCTGGCCTCGCTGGGCGCGCACGTTCAGGTGGTCGCGGCCGACCTGGCGGACGCGTCGGCCGTTGACGAACTGGTCGGACGGATCGCCGCCGAACGCCCGCTCACCGGCGTCGTGCACGCGGCCGGCGTCCTGGACGACGCGGTGATCGCCTCGCAGTCGCCGGAGCGGCTGGCGGGGGTGTGGGCGGCCAAGGCCGCCGCCGCGTACAACCTGCACGTGGCGACGGCGGACCTGCCGCTGGGCCTGTTCGTGCTGTTCTCCTCGGCCGCGTCGTGCCTGGGCAGCCCGGGACAGGGCAACTACGCCGCCGCCAACGCGTTCTGCGACGCTCTCGCCGCGCACCGCCGGGCGTCCGGCCTGCCGGGCGTGTCCGTCGCGTGGGGACTGTGGGCGGACGCCAGCGGCATGACCGGGCACATGACCGAAACCGACCTCGCCCGGGTGAGCCGTTCGGGGAACGTCCCGCTCACCAGCGACGAGGCACTGGAACTCTTCGATGCCGCCTGCGGGCACGGCGGCGCCTACCTGCTCGCCCTCCACCTGAACGCGAGCGCCCTCGCCTCCGCCGACCTGCTTCCTCCTCCGCTGCGGGCCTTTGGCGCCCGCCGTGATCGCGGCGGGCGCGTCCGGCGCGCCGCCGCCGCAGGAGGCCGCCAAGGATCGGAGCTGCTCTCGCGGCTCGCCGGACTCGACGCGAACGGCCGGGCCGACGCGGCGCTGGAGGTCGTCTCCGAGTGCGCCGCCGGAGTCCTCGGGCACCGTTCGCCCGACGGCGTACGGGCCGAGGCGAACTTCAAGGACCTCGGGTTCGACTCGTTGATGGCGGTCGAGCTGCGGAACCGCCTGTCGGTGATCTCCGGCGTCCGGCTGCCGGCGACACTGGTGTTCGACTACCCGACCCCGCACGCCCTGGCCGGATACCTGTGTGCGCGCCTGGACGGGCCGGCGACGTCCGTGGCCGGTCCGTCCGCCGGTCCGACGACGGCCACGGCGGACGACCCGGTCGCGATCGTCGCGATGGCGTGCCGGTTCCCCGGCGGCGTCGTCTCCGCCGAGTCCCTCTGGGAGATGGTGGCCGCGGGCGGCGACGCGATCGGCGCGTTCCCCCAGGACCGCGGCTGGGATCTGGACGGGTTGTTCCACCCGGACCCCGACCACCCCGGCACCAGCTACGCCCGCGAGGGCGGCTTCGTCCACGACGCGGCGGAATTCGACGCGGAGTTCTTCGGCGTCAATCCGCGTGAGGCGTTGGCGATGGATCCGCAGCAGCGTTTGTTGTTGGAGGTGTCGTGGGAGTTGTTGGAGCGTGCGGGGCTTGATCCGGTGTCGTTGAAGGGCAGCCGGACGGGCGTGTACGCGGGTGTGATGTACCACGACTACGGGGCCGGGATCGCGGCCGGCGACTCCCGCCTTGAGGGCTACGCGCTGATGTCCAGCTCGGGCAGCGTCGTCTCCGGCCGCGTCGCGTACACCCTGGGCTTCGAGGGCCCGGCGGTGACGGTGGACACGGCGTGCTCGTCGTCCCTGGTCGCGATGCACCTGGCCGCGCAGGCGCTGCGCCAGGGCGAATGCGACCTGGCTTTGGCCGGTGGGGTGACCGTGATGGCCACCCCGGACGTGTTCACGGGTTTCTCGCGTCAGCGCGGCCTGGCTCCGGACGGCCGCTGCAAGCCGTTCGCCGCGGCAGCGGACGGGACGGGTTGGGGCGAGGGCGTTGGGCTGGTGCTGTTGGAGCGCTTGTCGGACGCGCGCCGCCGTGGGCATGAGGTGTTGGCGGTGCTGCGTGGTTCGGCGGTCAATCAGGATGGTGCGAGTAATGGTTTGACGGCTCCGAATGGCCCGTCGCAGGAGCGTGTGATCCGTCAGGCGTTGGCGTCTGCGGGGTTGTCGTCGTCGGATGTGGATGCGGTGGAGGCGCATGGGACGGGGACGACGCTGGGTGATCCGATCGAGGCGGGCGCGTTGCTGGCGACCTATGGGCGGGATCGTTCGGGCGATCGGCCGTTGTGGTTGGGGTCGGTGAAGTCGAATATCGGTCATACGCAGGCGGCTGCTGGTGTGGCTGGTGTGATCAAGACGGTCATGGCGATGCGGAACGGTTCGCTTCCGGCGTCCCTGCATATCGATGAACCAAGCCCGCACGTGGATTGGGAGTCGGGCGCGGTACGTCTGTTGACCGAGCCGATGGAATGGCCGACCGAAGATGACCGTCCGCGTCGGGCTGGGGTGTCGTCATTCGGTGCTTCGGGAACGAACGCACACCTCATCCTGGAACAAGCGCCCGAACCGAAGAACGAACCGGAGGACGAATCCGAGCCGGTGCCGTTCGACGGTGTCCTGCCGTGGGTTCTGTCGGCGCACAATGCCGACGCCTTGCGCGGTCAGGCCGCCGCGCTGGCGGCGCACGTGTCCGCCACGGATGACACGTCCGTCGAAGTGGGCTGGTCGCTGCTGAAGAGCAGGTCGCTGCTCGACCAGCGGGCCGTTGTCCTTGGAGAGGACCGTGCTCAACTGCTCGCGGGTGTCGAGGCATTGGCGGCGGGCGAGCCGCATCCGGCGCTGGTGCGAACCGGTCCCGCCGCTACTGCGAGTGGAAAGTCCGTGTTCCTGTTCAGTGGGCAGGGGAGCCAGCGGGTTGGGATGGGTGCCGGGTTGTATGAGCGGTTCCCTGTGTTCGCGGGTGCGTTCGATGAGGTGTGCGGGCTGCTGGATGAGCATTTGGAGCGTCCGTTGAAGGACGTGGTGTTCACTGGATTGCCCGATCAGCCTGAGCTTTTGGATCACACCACGTATGCGCAGACGGGTTTGTTTGCGTTGCAGGTGGGTTTGGCGCGGTTGTTGGCGTCGTTGGGTGTGGTGCCTGATGTGGTTGTCGGGCATTCGGTGGGTGAGATCGCCGCCGCGCATGTGGCCGGAGTATTCGGTCTGGAGGATGCGTGCCGTTTGGTGGCGGCGCGTGCGACGTTGATGGGTGGGTTGCCGTCGGGTGGGGCCATGGCCGCCATTCAGGCGACCGCCGACGAACTCACCGACGACCTCGCACGTTTCGCCGGACGAGTGGCCATCGCCGCGCTCAACACCCCGGACAGCACGGTCGTCTCCGGACCGGCCGACCTGGTCTCCGAGGTGGCCGCCGCGTGGAAGGAACGGGGCCGCAAGACCAAGGCGCTCGCGGTCAGCCATGCGTTCCATTCGCCGATGATGGACCCGATCCTGGACGAGTTCACCGACGCGATCAGCGATCTGACCTACCGGCGGCCGAACGTGCCGCTGATCAGCAACCTCAGCGGTCTCCCGGCGGACGACCAGATCACCACGCCGGAGTACTGGGCCACGCACATCCGCCGGCCGGTCCGCTTCGCGCCCGCCATCGCTCACCTCGCCCCGCAGGCGGGCGTGTTCCTGGAACTGGGGCCTGATCCGGTACTGGCGACCGCCGCCCAGCACACCCTTCACCACGCCATGGACGAGCACGAGGACGGCGGGACGGATGAGGCCGGTCGGCCCGTTCCCCCGACGCTCTCCACGCTCAATGGCAAGCAGGACGATCCGCACGCGTTCGCGCACGCCATGGCGAGGCTCCACACTTCGGGCGCGGGCGTGGACTGGACGGACTGGTTCCCCGCCAAGCCCGCGCCTCGGGTCGTCGCGCTGCCCACCTATGCGTTCCAGCGCGAGCGGTACTGGCTGGCGCCCGGTCGCGGAGCCGGGGACGTGGCGGCGGTCGGCCTGCGGCGGGTCGAGCACGCGGTGCTGCCCGCGGCCGTCCCGCTCGCGGACGGCGGTCTGCTGCTGACCGGCCGGATGTCGGCGTCCGGCGGCTGGGCGTCCGACCACCGCATGTTGGGAACGACGCTGGTGCCGGGAGCGGCATTGGCGGAGTGGGCGTTGCGGGCCGCCGACGAGACCGGCTGCGGCGCGGTCGAGGAACTGGTGCTGCAAGCGCCGCTCGTACTCCCGCCGACCGGCGCCGCGCAGATCCAGGTCGTGGCCGACGCGCCCGGGTCCGATGGACGCCGCGAGCTGCGTGTCTACTCCCGTCCGGAACACGACGACGGTTCCGTCACCGAGGCGCCGTGGGTCTGCCATGCGATGGGCGTCCTCACGCCCGAGGCGCGGGACGTTCCGGAAGGGCTCGACGGTGCGTGGCCACCTCCCGGTGCGGAACCGGTGGACGTGACCGCCTTCTACGAACGAATGGCCGCGACGGGCTACGAGTACGGACCGTCGTTCCGCGGACTGCGCGCCGCGTGGCGCGACGGGCCCGACATCCTGGCGGAGGTGCAGTTGCCGGAGGCCGCCGGAGACCGCGCCGGCTTCGGCGTCCATCCCGCGCTTCTCGACGCCGTCCTGCACGCCGGCCTGTTGCCGGACGGAGAGGGCGAAGGGGAGCAGGACGGCCGGGTCTGGCTGCCGTTCGCCTGGAACGGGATCTCGCTGTGGGCGAGTGAGGCTGCCACGGTCCGCGTCCGGCTCGCGCCCGTCCACGACGGCGACGGCGAGGGCCGGGACGAACGCGGGGTGCGCGTCACCGTCGCCGACAGCGCGGGCGCGGCGGTCCTGACCGTTGACTCGGTCGTCACACGCCCTGCGGACATCGACAAGCTGAAGGCCGCCGGTGAACCCGACGCGGACGGCCTGTTCGCGCTCGAATGGACGCCCCTGCCCGCTCCGGCGCTCAGTGACCCGACGCTCGGCGACGACGACTGGGCCGTCCTCGGAACGGACGCGTTCGGCCCGGCCGATTGCGCCCGGTTCCCGGACCTGGCGGCACTGTCGGCGGCGCTCGATGCGGACCGGCCGTCCCCGACGCTCGTGCTGGCCGACGCCTCGTACGGCGAGAGCGACGACCTCGACGCGGTCGGGAACGTCCTGTCGCTTCTCCAGGCGTGGCTGGACGAGCCGCGTTTGGCGGATGCGCGCCTGGTGGTGGTGACCCGGGGCGCGGTCGCGTCGGACGTTCCTGATCTGGCCGGTGCAGGGGTGTGGGGCCTGGTCCGCAGCGCGCAGGCGGAGAACCCCGGCCGTTTCGTTCTCCTCGACCTCGAAGCGGACACGTTCCTCTCGACCGGTCTGGCAGACGCCGTGGCGCGTGCCGCCGAGGCCCGCGAGCCGCAGCTCGCCGTCCGGTCGGGCGAACTGCTGATCCCCCGGCTGGTGCGCGCGAGCGAACGGCAAGGCATCGCGGCCCCCACCGGCGAACGCGCCTGGCGGCTGGCCGTCGCCACCCCCGGCACCGTCGAGAACGTGGCACCGGAACCGTTCCCGAAAGCCCTCGAACCGCTGGCCGCCGGCCAGGTCCGCATCGAGGTGCGCGCGGGCGGCATCAACTTCCGCGACGTCCTCGTGACCCTGGGCATGGTGCCCGGTCAGACCGGCCTCGGCGGCGAGGGCGCCGGCGTGGTGACCGAGGTGGGGCCGGGCGTCTTCTCGGGCCTGTCGGTCGGTGATCGTGTGATGGGTCTGTTCGAGGGCGCGTTCGGCCCGGTGACCGTGGCGGACGAACGAATGGTGTCCCCGATCCCCGAGGACTGGGACTTCCGCAGCGCGGCAGCCGTCCCGATCGTCTTCCTGACCGCTTGGTACGGCCTGGTGGAGTTGGCCGGGTTGAAGTCGGGTGAGTCGGTGTTGGTTCATGCGGCGACGGGTGGTGTGGGTTCGGCGGCGGTGCAGATCGCTCGCCATTTGGGTGCGGAGGTTTTCGCCACGGCCAGCCCTGGTAAGCACGCTGTTCTGGAGGCGATGGGTATCGACGCGGCCCATCGCGCCAGTTCCCGCGATCTGGAGTTCGAGCGGACGTTCCGTGATGCGACGGGTGGTCGTGGTGTTGATGTGGTGCTCAATAGTTTGGCGGGGGAGTTCACCGATGCGTCGTTGAGGTTGGTGGCTGAGGGTGGCCGGTTCCTGGAGATGGGCAAGACCGACATCCGCGAACCCGAACACGTCGCCGCAGAGCATCCGGCGGTCGGTTATCAGGTGTTCGATCTGGTGACGGACGCCGGGCCGGTTCGGATCAGTGAAATGTTCCGGGCGTTGGCGGAGCTGTTCGCCGAACAGGTATTGCGGCCTGCTCCGACTCAAGCATGGCCGCTGGCTCGCGCGCGTGAGGCGTTGCGGGTGATGAGTCAGGCCAAGCACACCGGCAAGCTCGTTCTGGACATTCCCCCGGCCCTGGACCCGGACGGGACGGTGCTGATCACCGGCGGCACGGGAACGCTCGGCGGTCTGGTCGCTGAACATGTGGTCGGGACGTGGGGTGTGAAGCGGCTGGTGCTGGCCAGTCGTCGTGGCCTGCAAGCCCCGGGCGCGGATGAGCTGGTCGAGCGTCTCACCCGAGAGTTGGGTGCTGAGGTGCGGGTGGTCGCGGCGGATGTGTCCGACGCCGACGCCGTCACCGACCTGATCACCGCGACCGACGCAGACCCCGCGCACCCCCTCACCGGCCTCGTGCACACCGCCGGACTCATCGACGACGCGGTGATCGGCTCCCAGACACCGGAACGCCTCCACCACGTGTGGGACGCCAAGGCCACCGCCGCACACAACCTCCACACCGCCACTCGACACCTCCCCCTCGCCGCGTTCGTCATCTTCTCTTCGAGTGCGGCGATGCTCGGGAGTCCGGGGCAGGGCAACTATGCCGCCGCCAACGCGTACTGCGACGCGCTCGCCGCACGTCGTCAGGCCGCAGGGCTTCCGGCCGTGTCCGTCGGATGGGGGCTGTGGGAGGCCACCAGCGCGATGACCGGGAGTCTCACCGAAACCGATCGCGCCCGGATGAGCCGTTCGGGTGTCGGCGCGCTGCCGAATGAGCACGGCTTGGCGTTGCTGGACGCGGCGCTCAGGCACGGCGACGCGTCGCTGCTCGCCGCGAACCTCGATCTGCGCGCGCTCGCGGTCCAGCCCGCCGAGTCCCTGCCGGGTGCGCTGCGCGCCCTCGTGTCCGGCGGCGGCGAACGTGTCAGGCGTACCGCGGCCACGGGAAGCGCGCGGCCGGACGACTGGGCCGGGCGGCTGGCCGGAATGTCCGCGGCCGAGCAGCAGCGCGCCATGCTCAACCTGGTGCGCGGTCACGTCGCGACCGTGCTGGGACGTACCGACGCCGACTCCGTACGGGGCGACGCGTCGTTCAAGGAACTCGGGTTCGACTCGCTGACCGGGGTGGAACTGCGGAACCGGCTGGCGGACGCGACGGGCCTGCGCCTGCCTCCCGCCCTGGTGTTCGACTACCCGCAGGCCGGGGTCCTCGCCGAGCACCTGCGCACGCGGCTCGTACCGGAGGGCGCTGACGCCTCGGCCGCCGGCGCGGGCGTCGAGCCCGTCCTGGACGACCTGGCCAGGCTCGAAAGCACCCTGAACGCCGCCGCCGCTCTGGAGGACGGCGATTCCGACGCGGTCACGGCCCGGCTGGAAAGCCTCCTCACCCACTGGAGAGCCGTACGGAGCGGCCGGGCGGCGAACGGACACCACGGGCGCAACGGCCACGCCGAGCCGAGTGGGAACGACGGGCAGAACGGCCACGACGGGCAGGACGGGGAGGACGGCACGGCCGTTGACCGGCTGGAGTCGGCGTCCGCGGACCAGGTTCTCGACTTCATCGACAACGAATTGGGCGTGTCCTGGAACGGGCCGGACGCGACCACGACGCGATAGGGCTGGGTGACTCCGCATGGCGACCGAAGAACGGCTGGTTGAGTACCTCAAGCGCGTCACCGCCGACCTGCACGACACGCGCCTGCGCCTGCGCCGGATGGAGGAGCGCTCCGAGGAGCCCGTGGCCGTGGTGGGCATGGCCTGCCGGTTCCCCGGCGGCGTGGACTCGCCGGAGTCCCTGTGGGACCTGGTCGCCGCGGGCGCCGACGCGATCGGCGACTTCCCGGACGACCGGGGCTGGGACCTCGACGGCCTGTACCACCCCGACCCCGACCACCCCGGTACGACGTACACGCGCAAGGGCGGCTTCGTCGACGACGCCGACCGGTTCGACGCGGCGTTCTTCGGGATCAGCCCGCGCGAGGCGATGGCGACCAGCCCGCAGCAGCGGCTCCTGCTGGAGACGTCCTGGGAGGCGTTCGAACGGGCGGGCATCGACCCCGTCTCGCTGCGCGGCACCCGGACCGGCGTCTACGCGGGGACGGCGACCACGGGCGCGGCGGCGCACGGCGGCTCGTCCCGCAAGGAGGCGGAGGGGTACGCGGGGAACGCCCCGAGCCTGCTGTCCGGCCGCGTCGCGTACACGTTCGGGCTGGAAGGGCCGGCGGTCACGATCGAGACGGCGTGCTCGTCGGCACTGGTCGCGATGCACTTGGCGGGGCAGGCCCTGCGCCAGGGCGAATGCACCCTCGCCTTAGCGGGCGGCGTGACGGTGATGGCGACGCCAGAGGTGTTCACGGGCTTCTCCCGGCAGCGCGGCCTGTCGCCCGACGGCCGCTGCAAGGCGTTCTCCGCGGCGGCCGACGGCACCGGATGGGGCGAGGGCGTCGGCGTCGTCGTGCTGGAACGACTGTCGGACGCGCGCCGCAACGGCCACCGCGTCCTCGCGGTCATCCGCGGCTCCGCCGTCAACCAGGACGGCGCGAGCAACGGATTCACCGCCCCGAACGGCCCGTCCCAGCAGCGCGTGATCCGCCAGGCCCTCACGCAGGGACGGCTGTCGGCCGCCGAGGTGGACGCGGTCGAGGCGCACGGGACCGGCACGGCCCTCGGCGACCCCATCGAGGCGGACGCGCTCATCGCCGCCTACGGCCGCAACCGCCCGCCCGAACGTCCCCTGTGGCTCGGCTCGGTCAAATCGAACATCGGCCACACCCAGGGCGCGGCCGGCGTCGCGGGCGTCATCAAAATGATCATGGCGATGCGGCGCGGCGTACTCCCCGCCTCCCTGCACATCGACCAGCCGACGCCCCACGTCGAATGGGGCGACGGCGCGGTACGCCTCCTCACCGAACCCGTTCCCTGGCCGGACACCGACCGCCCGCGCCGCGCGGGCGTGTCGTCGTTCGGCATCTCCGGCACGAACGCCCACCTGATCCTGGAACAGGCCCCCGAACCCGTCGAGGCGCCCGAGCCGGAGCCAGAGTCGGGGGCGGTGGTGCCGTGGGTGCTGTCGGGGCGAAGTGAAGAGGCGTTGCGTGGCCAGGCTGCCGCGTTGGCCGCTCGGATGGACGGTTCGGAGGGTTGGGCACCCGTCGATGTGGGTTGGTCGTTGGCGACGACGCGTTCGGTATTCGAGCACCGCGCCGTGGTCGTGGGCTCGGAACGCGACGAACTGCTGGACGGCTTGAAGGCGCTGGCAACGGGAGAAATGCATCCCGGACTCGTCGGCCCCGGACTCGCCGCCCACCCCGACGCCTCCGCCCCCGTTTTCGTTTTTCCGGGTCAGGGTTCGCAGTGGGCCGGTATGGGCGCGGAGCTGCTGGATTCCTCGCCGGTGTTCGCGGAGCGTATTGCTGAGTGTGAGCGTGCGCTGGCTCCGTTCGTTGATTGGTCCCTGTCGGATGTGCTGCGTGGTGATGGCGCGGAGTTGGCGCGCGTTGATGTCGTTCAGCCGGTGCTGTGGGCGACGATGGTCTCTCTCGCTGCTGTGTGGGCGTCGTACGGTGTGGTTCCGTCCGCGGTGATCGGTCATTCGCAAGGGGAGATCGCCGCCGCGTGCGTCGCCGGCGCCCTGTCGCTGGAAGACGGCGCGCGGATCGTGGCGCTGCGGAGTCGCGCGTTGCGGCGGCTCGCCGGTGGCGGCGCGATGGCCTCGCTGGGCGTTGGTGTGGACGCCGCGCAGGAATTCCTGCCTGAGGGTGTTGCGGTGGCGGCGGTGAACGGACCGTCGTCCACGGTGGTGTCCGGCCCGCCCGAGCAGGTCACCGCAGCGGTGGCCCGTACCGAGGCTGCCGGGTATCGGGCGCGGCTCATCGACGTCGATTACGCCTCGCATGGGCCGCAAGTCGATGAGATCACCGACGAACTCCACCAAGTCCTCGCAGGCATCGAACCCGCACCGACAACAGAAGTGGCGTTCTATTCCACGGTCCGTGCCGCAGCGATCGACACCGCCACCCTGGACACCGGCTATTGGGTGACCAATCTCCGGGAACGCGTGCGGTTCGCCGACACCATCGACGCGCTTCTCGAAGACGGACACCGGATCTTCATCGAGGCCAGTCCCCATCCCGTCCTCACGGTCGGGATGGAGGAGACCTTTGAACAGGCCGGAACGGGGGCGGTGGCCGTACCGACGCTGCGCCGCGATCACGGTGGTCGGGAGCAGTTCGTACGCGTCGTCGGGCAGGCGTTCGTTGCCGGGGCGCACGCTGATTGGACGGCGTCGTTCGCGCAAGACCCTGCGCCGCGCACGGTGGACCTGCCGACTTACGCGTTCCAGCGGCAGCGGTTCTGGGCGGACGACGACCCCACCGTTCATTCCGGTGATCCGGCCGGGCTCGGGCTGGTGTCGGTCGGGCACCCGTTGCTGGGTGCCGCCGTGGACGTCGCGGACGAGGACACGCGTCTCCTGACCGGGCGCCTGTCGGCATCCGGCGGCGGTACCGGCTGGCTGAACGATCACCGGATAGGCGACACGGTGCTGGTTCCGGGCGCGGCGCTGGTCGAGTGGGCGTTGCGGGCGGCGGACGACGTCGGGTGCGGGGGAGTGGAGGAGCTGGCGCTCCAGGCGCCGCTGGTCCTGCCCGCGTCCGGGAGCGTACGGGTGCAGGTGGTCGTGGGCGCGGCCGGCGAGGACGGACGGCGTGAGGTGCGCGTCCACTCCGTTCCCGATCTCGGCGGCGACGGGGCGTCCGAGGATGTGCGGTGGGAATGCCATGCCGTCGGCGTTCTCAGCCCGGAGCCGACGGCCACGCCGTCCAACGGGCAGAGCGATGACCTCGGTGGTGTGTGGCCTCCTGCGGGTGCGCAGCCGATCGACCTCGAAGGCTTCTACGAACGCGCCGCGGCCGCGGGCTACGGGTACGGTCCCGCCTTCCAAGGGGTACGGGCCGTGTGGCGCGACGGTTCGGATCTGCTGGCGCACGTCGAGTTGCCCGAGGACGCGGCGGACCCGGCCGGGTTCGGTGTCCATCCGGCCCTGCTGGACGCGGCGCTGCACCCCGCCTTGCTCGGCGGCAGCCCGGAAACCGACTCCGAAGCGGGCGTGCTGCTGCCGTTCGTTTGGAACGGGGTCGCGCTGTGGGCGGCCGAGGCGACCAGCGTCCGGGTCAGGCTCTCGCCGCGCCGTGACGGGGAGCGCGGAGTCCGGGTGGTGATGGCCGATGCCGTGGGCGCGCCCGTTCTGAGCGTTGACGAGGTCGTCATGCGTCCTGCGGACACCGAACGGCTCCGGGCCGCGGGTCGGCGGGGCGCGGGAAACCTGTTCGCCCTGGAATGGGCACCGCTTCCCGCGTCCACCCCGGACGCCGCCGCCGGTCGCGGCGTCGGCGAGGGCGAGTGGGCGGTCGTGGGGGCGGAGCCGGTGGCGACGGATCTCGTTCACCACCCGGACCTGGACGGGTTGGTGGCGGCGCTCGACGGCGGCGCTCCCGTTCCTCCGGTCGTGCTCGCGGCCGTGCCGTCCTCGCCGGACGCGCTGGACATGGTGACGGACACGTTGGCGCTGGTGCAGGCGTGGCTGGCCGAACCGCGGCTGGCCGACGCCCGGTTGGCGCTCGTGACGCGGGGCGCGGTAGCCGCGGGGACGGAGCCCGACCTCGGCGGCGCCGCCGTGTGGGGGCTCGTCCGCAGCGCCCAGACGGAGAACCCCGACCGCTTCGTTCTCGTGGACGTCGATGACCTGGACGACGCCGACGCTGACGCCAGCGCCAACGTTGACCGCGATGCCGCGTGCGGCGTCGCCGAGGCCGTGCGGCGTGCGGTGGAGGCGGATGAGCCGCAGGTCGCGGTGCGCGGCGGACGGCTCCTCGTCCCGCGCCTGGCACGGAACGACGACCTCGGCGAACCCTCCGATCAGAGCGGCGACGGTTCACCGCCGCTCGACCCCGAGGGCACGGTCCTGATCACCGGTGGGACGGGCATGGCGGGGTCGCTGGTCGCCGAGCATCTCGTCCGGGCGTGGGGCGTCAGGCGGCTGCTGCTCGCCGGCCGCCGCGGCCCGGACGCGCCGGGCGCGGCGGAACTGGCCGAACGCGTCCGCGAGCTGGGCGGCGAGGTGCGGTTCGCGGCGGCCGACGTGTCGGACGCGGGCGCGGTCGCAGACCTGGTCGCCGGTGTCGATCCGGCGCATCCGCTGACGGGTGTCGTGCACGCGGCCGGCGTCCTGGACGACGGGGTGATCACTTCGCAGTCTGCGGAGCGGCTGGCGGGGGTGTGGGCGGCCAAGGCCGCCGCCGCGTACAACCTGCACGTGGCGACGGCGGCCCTGCCGCTGGGCCTGTTCGTGCTGTTCTCCTCGGCGGCCGGGGTGGTCGGCAACGCCGGGCAGGCGGGTTACGCCGCGGCCAACGCGTTCGTGGACGCGCTCGCGGAACAACGCCGGACCGCGGGCCTGCCGGGGCTGTCGGTGGCGTGGGGCCTGTGGGAGCAGTCCAGCGCCATGACCGGGCACCTCGCCGAAGCGGACCACGCCCGCCTGGCCCGCTCGGGGTTCTCCGCCCTCCGCACCGAACGCGCGCTCGAACTCCTGGACGCCGCGTGCCGTGCGGAACGTTCACCGGTCGTGGCCGCGGGCATCGACGTGCGCGGCCGGTCGGCCGCCGAGGTCCCGGCCCTGCTGCGGGGCCTCGCCGGGCGGCCGGGCCGCCGAACCGCCGCCAGCGCCGAGGCCGGGACCGGGGTGGACGGCGCGCGAGGCCGCGAGCTGGCGGTGCGCCTCGCCCGGATGGACGAGCCCGAACGGCTCGACGCCGTGCTGGAGACGGTCCGGGGATGCGTCGCGGTCGTTCTCGGGTACGCCTCGCCCGCCGACGTGCGGACCGAGGCGCCGTTCAAGGAACTCGGCCTCGACTCGTTGACCGCCGTGGAACTCCGCAACCGCCTGTCGAGGACGACCGGGCTGAGGCTGCCGGCGACGCTGGTGTTCGACTACCCGACGCCGCGGGCGCTGGCCGACCACCTGTGCGCCAAGCTCGTCGGGGACGCCGCGGCGGCCACCGCGACCGCACCCGTGGCGACGGCCATGACGGACGAGCCGGTGGCGATCGTCGCGATGGCCTGCCGGTTCCCCGGCGGAGCCGCCTCACCGGAAGGACTGTGGGATCTCGTCGCGCGGGGCGTGGACGCGATGGCGGAGTTCCCGGAGGACCGCGGCTGGGATCTGGACGGCCTCTTCCACCCGGACCCCGACCACCCCGGCACCAGCTACGCCCGCGAAGGCGCGTTCCTGGACGACGCGGCCGGATTCGACGCGGGCTTCTTCGGGATCAATCCGCGTGAGGCGTTGGCGATGGATCCGCAGCAGCGTTTGTTGCTGGAGGTGTCGTGGGAGTTGTTGGAGCGTGCGGGGCTTGATCCGGTGTCGTTGAAGGGCAGTCGGACGGGTGTGTACGCGGGTGTGATGTATCACGACTACGCGTCCGGCCTGTCCGACAGCGGCGACGCCAAGCTGGAGGGCTACTCCATGCTGGCCAGCGCGGGCAGCGCGATCTGCGGCCGCGTCGCGTACACCCTGGGCCTTGAGGGCCCGGCGGTGACGGTGGACACGGCGTGCTCGTCGTCGTTGGTGGCGATGCACCTGGCCGCGCAGGCGCTACGCCAGGGCGAATGCTCGCTCGCGCTCGCGGGCGGCGTCACGGTCATGGCCACCCCAGAGGTCTTTACCGGCTTCTCCCGTCAGCGCGGCCTGGCTCCGGACGGCCGCTGCAAGCCGTTCGCCGCGGCAGCGGACGGAACGGGCTGGGGCGAAGGCGCCGGCCTGGTCCTGCTGGAGCGCCTCTCGGACGCCCGCCGTAACGGCCACGAGGTGTTGGCGGTGCTGCGGGGTTCGGCGGTCAATCAGGACGGCGCGAGTAATGGGCTGACCGCTCCGAATGGCCCCTCGCAGGAGCGTGTGATTCGTCAGGCGTTGGCGTCTGCGGGGTTGTCGTCGTCGGATGTGGATGCGGTGGAGGCGCATGGGACGGGGACGACGCTGGGTGATCCGATCGAGGCGGGCGCGTTGTTGGCGACGTATGGGCGGGGTCGTTCGGGTGGTCGGCCGTTGTGGTTGGGGTCGGTGAAGTCGAATATCGGTCATACGCAGGCGGCTGCTGGTGTGGCTGGTGTGATCAAGACGGTGATGGCGATGCGGAACGGTTCGCTTCCGGCGTCTCTGCATATCGATGAGCCGTCCCCGCACGTGGACTGGGAGTCGGGCGCGGTTCGGCTGTTGACCGAGCCGATGGAATGGCCGACCGATGATGACCGTCCGCGTCGGGCTGGGGTGTCGTCGTTCGGCGCTTCGGGAACGAATGCTCACCTCATCCTGGAACAAGCGCCCGAACCGAAGAACGAACCGGAGGACGAGTCGGCGCCCGCTTTGGACGGCGTTGTGGTGCCGTGGGTGCTGTCAGCACGCAGCGAAGAGGCGTTGCGCGGACAGGCCGCCGCGCTGGCGGCGCACGTGTCCGGCACCGATGACGCTCCCGCCGATGTGGGCTGGTCGCTGGTCACGTCCCGGTCCGTGTTCGACCATCGTGCAGTTGTGACCGGCACGGATCGGCAGGAACTGCTGAACGGCCTGGGCGTCCTGGCGAACGGCCACGAAAACCCGGAAGTGGTCCGTGCAGTCGCGAATTCCGCATCCGTCTCCGGAAGCACGGTATTTCTGTTCAGTGGGCAGGGGAGTCAGCGGGTTGGGATGGGGGCCGGGTTGTATGAGCGGTTCCCTGTGTTCGCGGGTGCGTTCGATGAGGTGTGCGCGCTACTGGAACCGCGCTTGGAGCATCCGTTGAAGGACGTGGTGTTCACCGGGATTCCGGAGCGAGATGGCCTGCTCGATCACACCACGTATGCGCAGGCGGGTTTGTTTGCGTTGCAGGTGGGTTTGGCGCGGTTGTTGGCGTCGTTGGGTGTGGTGCCTGACGCGGTTGTCGGGCATTCGGTGGGTGAGATCGCCGCCGCGCATGTGGCCGGAGTATTCGGTCTGGAGGACGCGTGCCGTTTGGTGGCGGCGCGTGCGACGTTGATGGGCGGGCTGCCGTCGGGTGGGGCCATGGCCGCCATTCAGGCGACCGCCGACGAACTCACCGACGACCTCGAACGCTTCGCCGGACGAGTCGCGGTCGCCGCGCTCAACACCCCGGACAGCACGGTCGTCTCCGGGCCCGCCGATGAGGTGGACCAGCTCGTCGGGGTCTGGACCGGCAGGGGTCGCAAGTCGAAGCGGCTCGCGGTCAGCCATGCGTTCCATTCGCCGCTGATGGACCCGATCCTGGACGAGTTCACCGACGCGATCAGCGATCTGACCTACCGGCGGCCGGACGTGCCGCTGATCAGCAACCTCACCGGGGAGCCCGCCGAACCCGAGCACATCGCGACGCCCGGCTACTGGGCGGACCACATCCGGCGGCCGGTGCGCTTCGCGCCCGCCATCGCCCACATCGCCCCCCAGGCGGCGACCTTCCTCGAACTCGGGCCCGACCCCGTACTCGCCACCGCCGCCCAGCACACCATCGGCGACGGGTCCGCCGACCGGCCGCAGCCGCTGGTCACGGCCACCCTCACGCAGAAGCGGTCCGACGAACGAGCCTTCGCGCACGCCCTCGCCCAACTCCACACCCACGGTGCGGACGTTGACTGGAAGGCCCTGTTCGACCTCGATCCCGATTCGCGCCGCCGGCCACGCAACGTGGCGCTGCCCACCTACGCGTTCCAGCACCGGCGGTTCTGGCTGGACGCGTTCAACGGGCGCGGCGGCGATCCGGCCGGACTGGGACTGGCCTCGGCGGATCATGCGCTGCTCGGAGCGGCGGTCGAGCTGGCCGACAGCGGCACCTGGCTACTGACCGGGCGGATCTCGGCGCGTTCGCATCGGTGGCTGGGCGAGCACGTCATGGCCGCGACCCCGCTGGCGCCGGGAGCGATGCTGGTGGAGTGGGCGCTCCACGCGGCCGACGCGGTGGGCTGCGGAGGCGTCGAGGAACTCATGCTGCAAGCACCGCTCGTACTGCCGGAGACGGGCGGCGTCCGCGTCCAGGTCGTGGTGGGGCCCGCCGACCCGGACGGACGGCGCGAACTCCAGATCTACTCGCGGCCCGACGAGAACGACGACGACGGCCGTGCGAACGCGGCGTGGGTCTGCCACGCGCAAGGCGTGCTGAGCCCCGCCGAACCCGCCCCCGGGCAGCTGAACGGGTCGCACGGGCCGGACGTGGTGGGCGGGGCATGGCCGCCGCCCGGCGCGCGGCCCGTCGCGATCGACGGCCTCTACGAACGGGCCGCGTCGGCGGGCTACGAGTACGGCCCGGCATTCCAAGGCGTTCGGGCCGTTTGGCAGCACGGCGCGGACGTATTGGCCGAGGTGGAACTGCCCGAGGCCGCCGGAGAGAAGAAGGGCTTCGGCATCCACCCGATCCTTCTGGACGCCGCTCTCCACCCGCTCCTGCTCAGCATGGACGACGCGGGGAGCGGTGCGGCCTGGCTGCCGTTCGCATGGAACGGAGTGACGCTGTGGGCGAGCGAGGCCGGAACGGTGCGCGTGCGGCTGACCCCGCAGGACGGAGCCGAGGGCGACCGCACCGTACGGGTCGCGATCGGTGACACGGTGGGCGCACCGGTCCTCACCGCGGACTCGGTCGTGATGCGCGAGGCCAGCACGGAACACCTCCGCGCGGCCGCCGGAACGAACCGGGACGGCAGCGACACCGACGGACTGTTCACCACACGCTGGACTCCGCTGGACACCGCCGCCACTGGTGACACCGGCAACGGCGACGACGGCGACGACGGCGCGTGGGCGTTCCTCGGCAGCGAGCACATGCGCCCGCAGCAGAACGGCGAGACCCGTCCCGAATGGCTGTGGCACGACGATCTGGCGGCGCTCGTAACGGCGATCGAGGCCGGCGCGCCGGTTCCGCCCGTCGTGTTCGCCGAGGTGTCAGCCGGTGCCGGAGACGGTGGCAACAGTGGTGAGAACGCCGCTGCGCGCGGGCTGGCCACGGCGCGGCGGACCCTGGCGCTGCTCCAGGACTGGCTCGCCGAACCGCACCTGGCCGACGCCCGCCTCGTTCTCGTCACCCGAGGCGCGGTCACCACCGGGGAACTGGACGGACCCGCCGCCGGCGGCGACGCCGAAGCGCCGATCGCCCTGAGCGACGCGGGCGTGTGGGGGCTGGTGCGCACCGCCCAGTCGGAGAACCCCGACCGCTTCGTCCTGCTCGACCTCGATTCCGGTACGGCGGACGCCACCGGCGACGGAGCCGTCCGCGCCGTGTGGGCGGCGATCGCGGCGGACGAACCGCAACTCGCGCTCCGTTCAGGACGCGCGTTCGTCCCCCGCCTGACGCGTACGGGCACCGACGACGGAACGCTGCGGCTACCGCAGCCGGAACGGGCCTGGCACCTGGACGCGGGAGGCACCGGCACCCTCGACGGCATCGCCGCCGTCCCCTGCCCGGACGTCCTCGAACCGCTGGCCGCCGGCCAGGTCCGCATCGAGGTACGTACGGCCGGGATCAACTTCCGCGACGTTCTCGTCAGCCTCGGCATGGCCCCGGGCCAGACCGGCCTGGGCAGCGAAGGCGCCGGAACCGTCGTTGAAGTCGGCCCGGGTGTCTCGGGCCTGTCGGTCGGTGATCGTGTGATGGGTCTGTTCGAGGGCGCGTTCGGCCCGGTGACCGTGGCGGACGAACGAATGGTGGTCCGTCTTCCGGATGGTTGGGATTTCCGTTCAGGTGCCGCTGTTCCGGTGGTGTTCTTGACGGCTTGGTACGGCTTGGTGGAACTGGCTGGGTTGAAGTCGGGTGAGTCGGTGTTGGTTCATGCGGCGACGGGTGGTGTGGGTTCGGCTGCGGTGCAGATCGCCCGCCATTTGGGTGCGGAGGTTTTCGCCACGGCCAGTCCCGGTAAGCACGCTGTTCTGGAGGCGATGGGTATCGACGCGGCCCATCGCGCCAGTTCCCGCGACTTGGAGTTCGAGCGGACGTTCCGCGCCGCGACCGGTGGTCGTGGTGTTGATGTGGTGCTCAATAGTTTGGCGGGGGAGTTCACCGATGCGTCGTTGAGGTTGGTGGCTGAGGGTGGTCGGTTCCTGGAGATGGGCAAGACCGACATCCGCGAACCCGAACACATCGCCGCAGAGCATCCAGCGGTCGGTTATCAGGTGTTCGATCTGGTGACGGACGCCGGGCCGGATCGAATCAGTGAAATGTTCCGAGCGTTGGCGGAGCTGTTCGCCGAAAAGGTATTGCGGCCTGCGCCGACTCAAGCATGGCCGCTGACCCGCGCCCGCGAAGCGCTGCGGGTGATGAGTCAGGCCAAGCACACCGGCAAACTCGTTCTGGACATCCCTCCGGCTCTGGACCCGGACGGCACGGTGCTGATCACCGGCGGCACGGGAACACTCGGCGGCCTGGTCGCCGAGCATCTGGTCCGTTCCGAGCAGGTCCGGTCGTTGGTGCTGGCCAGCCGCAGCGGCCCGGACGCCGCCGGAGCGGCGGAGTTGAAGGCCCGGCTGGAGGAACTGGGCGCGGAGGTACGGATCGTCGCGGCCGACGTGTCGGACGCCGACGCCGTGACCCGTCTGGTCGCGGAGGCGGACGCGGACACCGGACGTCCGCTCACCGGGATCGTGCACACGGCAGGGGTGATCGACGACGCGGTGCTCGCTTCGCAGACGCCTGAACGGCTGGCACGCGTGTGGGACGCCAAAGCCTCCGCCGCGCACCATCTGCACGCCGCCACCGAGCATCTTCGGCTCGGCGCGTTCGTCGTGTTCTCCTCGGCCGCGGGGACGTTCGGGAGCGCGGGACAGGCGAACTACGCCGCCGCCAACGCGTTCTGCGACGCGCTCGCCGCGCATCGCCACGCCAAGGGGCTGCCCGCGGTCTCGGTGGCGTGGGGGCTGTGGGAGGCCACCAGCGGAATGACCGGGGACCTGGCCGACACCGACGTGGCCCGGATGAGCCGCGCCGGGATCAGGCCCCTGAGCGCCCGGCACGGGCTCGCGCTGCTCGACGCCGCATCGCGGCACGGCGCGGCGAACCTGCTCGCCGCCGACCTCGACGTTCGCGCTCTCAACGGTGGATCCGCCGAGGCCGGGCAGGGGGCGGTGCCCGCCTTCCTGCGCGCACTCGCCGGGGGCGGCACCGCGACCCGCCGGACGGCCGCCAACGGGCAGGCCCCCGCCGACTTCGCCGCACGGCTCGCCGCGATGCCCCTGGAGGAACGGCAGCGCGTCCTGCTCGACCTGGTCCGGTCGCACGCGGCGGCGACGCTCGGGCACTCCGACGTGGGCGCCGTCCAGACGGACGCACCGTTCAAGGAACTGGGATTCGACTCGCTCACCGCCGTCGAACTGCGCAACAAACTGTCGGCGAGCACGGGCCTGCGGCTTCCCGCGACGTTCGTGTTCAGCCATCCGACCCCATCCGCCATCGCCACCGAACTGCACGACAAGCTCTGCCCGGACGCGCCGGAGCCCTCCGCGCCCGTCTTCGCCGAGCTGGAACGGCTTGAGGCGGCGATGGCCGGGCTGACGCTCGACGCCGAGACGCGCGGCGGGCTCGTCAAACGCATGCAGACCCTGATGTGGAGGCTGGGCGACGGCGCCGCGGAGGCCGAGCAGGCCACCGCGGACGAGGACCTCGACTCCGCCTCCGACGAAGAGATATTCGCGTTCATCGACAGGGAACTGTGAGTTCCGCCGGAAAGCGAGGACCGATGCCGGGCACGGAAGAGAAGCTGCGCCAGTACCTGAAGCGGGTGACAGTGGACCTCGGGCAGGCACGCCAGCGTCTGCGCGACGTGGAGGAACGGAACCAGGAGCCCATCGCCGTCGTCGCCATGGCGTGCCGCTACCCGGGCGGCGTGCGCGGACCCGAAGACCTCTGGGACCTGGTGGCCTCACGCGGCGACGGCATCACGGAGTTCCCCGACGATCGCGGCTGGGACCTCGACGGCCTCTACCACCCGGACCCCGACCACTCCGGCACCAGCTACGTCCGCGAGGGCGGGTTCCTCGACGACACCGACCGGTTCGACGCGGCGTTCTTCGGCATCAGCCCGCGTGAGGCGATGGCGATGGACCCGCAGCAGCGCATGCTGCTGGAGGTGTCCTGGGAGCTGTTCGAACGCGCGGGCATCGCGCCCGCCGCGTTGAAGGGCACCCCGACCGGCGTGTACGCGGGGATGTCCAGCCAGGACTACATGACCCGTACGCCCCGCATCCCCGAAGGGTTCGAGGGCTACGCCACCACCGGCAGCCTCACCAGCGTCATCTCCGGCCGCGTCGCCTACACGTTCGGGCTGGAGGGCCCGGCCGTGACGGTGGACACCGCCTGCTCGTCCTCCCTGGTCGCCATCCACCTCGCGTGCCAGGCCCTGCGGCAGGACGAGTGCACGCTCGCGCTGGCCGGCGGCGTCACCGCCCTCACCACCCCGACGGCGTTCGCCGAGTTCTCCCGCCAGCGCGGCCTCGCCCCGGACGGCCGCTGCAAAGCCTTCGCCGCCGCCGCCGACGGCACCGGGTTCTCCGAAGGCGTCGGGCTCGTCCTGCTGGAACGGCTGTCGGACGCACGCCGCAACGGACACCACATCCTCGCGGTGCTGCGCGGCTCCGCCGTCAACCAGGACGGGGCGAGCAACGGCCTCACCGCCCCCAACGACGCCTCGCAGGAACGCGTCATCCGCCAGGCCCTCGCCAACGCCCGCCTCGCCCCCGACCAGGTCGACGCGGTCGAGGCGCACGGCACCGGAACGACCCTCGGCGACCCCATCGAAGCGCAGGCGTTACAGGCGACGTACGGGAAGGACCGCCCGGCGGACCGCCCGCTCTGGCTCGGCTCGGTCAAGTCGAACATCGGCCACACCCAGGCCGCCGCGGGCGTCGCCGGCGTCATCAAGATGATCATGGCGATGCGGAACGGCGTCCTGCCCCCCTCGCTGCACATCGACGAGCCGACCTCGCATGTGGACTGGGACTCCGGCGCGGTGCGTCTTCTCACCGAGCCGGTGGAGTGGGTGTGGGGTGGGCGTCCGCGTCGTGCGGGTGTGTCGTCGTTCGGTATTTCGGGGACGAATGCGCATCTGATTGTGGAGCAGGCCCCCGAGTCCGAGGCTGAGGTCGAGGTGTTGGAAGGGGCTGCCGGTCCTGACACGGAGGTGGATGAGTCGGTGCCGGGTGGGGTGGTGCCGTGGGTGGTGTCGGCGCGGTCGGTGGCGGGGTTGCGGGATCAGGCGGCGGTGTTGTCGGCGCATCTGGCCGGGCGGGAGGTGTCGCCGGTGCGGGTGGGGTGGTCGCTGGCGGCCACGCGTTCGGTGTTCGAGCACCGCGCCGTCATCACCGCCCACCACCGCGAGGAGTTCCTGGAGGGGCTCGATGCGCTGGCCGCCGGAGGCCAGCACCCCGGTCTGATCACCTCGCCTGCTGGTGGAGCGGGTGGTGGTGGGGTGGTGTGGATGTTCAGTGGGCAGGGCAGTCAGCGGCCCGGGATGGGCGCGGGACTGTACGCACGGTTCCCGGTGTTCGCCGCCGCCTTCGACGAGATCTGCGGCCTGCTCGACCCGCACCTGCCGCACCCGCTCCGCGAGGTGGTGTTCAACCCCGACCCCCACCAAGACCAGCAGAGCGGGCTGCTGGATCACACCCTGTACACCCAGACCGCGCTGTTCGCCCTGCACATCTCCCTGGCCCGCCTGCTGCACCACCACGACCACACCCCCCACACCCTCATCGGACACTCCATCGGAGAAATCGCCGCCGCCCACATCGCCGGCATCCTCGACCTCCCCGACGCCTGCCACCTCATCACCGCACGCGCCACCCTCATGGCCCAACTCCCCACCGGCGGCACCATGACCGCCATCCAGGCGCGGGAGGACGAGCTCGCGGCCGAGGTCGAGGAGTCGGGCGGGACGGTCGCCATCGCCGCGCTCAACACGCCGGACAGCACGGTGATCTCCGGGCAGGCCGATGAGGTGGCGCGGATCGCCGCCGGGTGGGAGGAGCGGGGGCGCAAGACGAAGGCGCTGGCGGTCAGCCACGCGTTCCACTCGCCGTTGATGGACCCGATGCTGGACGAGTTCCGCGAGGCGATCGAGGGGTTGAGCTTCCGGGCGCCGTCCGTTCCGTTGATCAGCACCTTGACGGGGGAACTGGCGGGTGACGAGATCGCCAGCCCCGCGTACTGGGTGGATCAGGTGCGGCGGGCGGTGCGGTTCGCGCCCGCCGTTGCCCGCGCCGCCGAACGGGGCGGGGCGTTCCTCGAAATCGGGCCCGATCCCGTGCTGGCCACCGCCGCGCAGCACACGCTGTCGGAGCCGGCGTCCGGCCGGGAGCGGCCCGAGCCGCTCGTCACCGCCGTGCTCGACCGGCACCGTCCGGACGACGAGGCGTTCGTGCGCGCGCTCGCCGAGGTGCACACGCACGTGCGGAGCAGCGCGATCGGCTGGGCGCGGCTCTTCCCGGACGACCCGGCGCCTCGCGCGGTCGACCTTCCGACGTACGCGTTCCAGCGGCAGCGGTACTGGATGGCGGACGTGACGCCGCCGGAGTCCGCCGCCGGAGGGGACGCGTCCGAGGAGCGGTTCTGGGGCGCGGTCGAGCGCGGGGACCTGGAGGCGCTGTCCGACACGCTGCGGCTGCCGGACGGCGACGGGCAGCGGGCGTCGCTGGGCGAGATCCTGCCGGTGCTGTCCGGGTGGCGCAGGGAGCGCCGGGAGCGTTCCGCGGTGGACTCCTGGCGTTACCGGGTGACGTGGAAGCGGCTGACCGGCCTCGACGGCGGTACGCCGTCCGGGCCCTGGCTCGTCGTCGCGGCGGGCGGGGCCGACGGGTGGGCGGACGCCTGCGGGCGCGCCCTGGGCGAGGAGGCGCACCGCCTCGACGTGGACGGCGCCGTTGACAGGGAAGGGCTGGCCGAATGGCTGCGGTCCCGGTACGCCGACGGCGCGCCGCCCGCCGGTGTGCTGTCCCTCCTGGCCCTGCGGGACGGGCCGGAAACGGGTGCCGGAGGGGCGGCGGGCACGCTGGCCCTGCTCCAGGCGATGGTGGACACGGGGATCGGCGCGCCGCTGTGGTGCGCCACGCGCGGCGCGGTCTCGGTCGGCGACTCCGACCTGCTGGAGTCCCCTGCGCAGGCGCGGGTGTGGGGGCTCGGCCGGGTGGCCGCGCTGGAGCATCCCGACCTGTGGGGCGGGCTGGTGGACCTTCCGCCGGGTCCGGACGGGTTGGACGCCGCGCGGCTGCGCGCCGTTCTGACCGGGACGGCGGGGGAGGACCAGGTGGCGCTGCGCGCGAACGGCGCGTTCGGGTGCCGGGTGGTGCCGTCGCCCGCCGGGGACGGCGAACCGGGGCGGGAGTGGTCGCCCGAGGGGACCGTCCTCGTCACCGGCGGCCTCGGCGAGCCGGTCGCGCGCATCGCGAGGTGGCTCGCGGAGGGCGGCGCGGAACGCGTGGTCGTTCCGGACCCCGGCGGGACGGATGCGCCCGGTGCTGGAGAGTTGGCGGACGAGCTGTCCGGGCTCGGCGCCGAGCTGATCGTCGCGCGCTGCGGGCCGGACGACCCGGCGGCCGTGAAGGAGCTCGCCGGACGGCTGTCGGCGGCGGGCGCGCGGATCGGGATGATCGTGCACGCGCCCGCTCCGGCCGAGCCGGGCCCGCTGGCGGAGCTGGCGCCCGCCGCGCTGGAGGCGTTCCCGGCCGAGGACGTCGGCGCCTGGCGGCGCGTCGGGGAGCTGTGCGGGATGGAGCCGGACGAGCCGGCGGTGTGCTTCACGTCCGTCGCGGCGCTGTGGGGGAGCACGGGCCAGGGTGCGCACGCCGCCGCGGGCGCCCACCTCGACGCGCTCGCCGGGCCGGGCCGTCCCGAGGGCGTCGTGTCCGTCGCGTGGGGCGCGTGGGACGTGCCCGCCGACACGGACGAACGGGGCGCCCGCGGCGCGGACGGCCTGCGCAGGCAGGGCCTCCAGCCGCTCGACCCGAGGCTGGCGCTGACCGCGCTGGAACGCGTCCTCGGGAACGGCGACCGGCGGATCGCCGTCGCCGACGTCGCGTGGGACCGGTTCGCGCCGCTGTTCACGCTGGCGCGGCCGAGCCGCCTGTTCGAGGACGTCCCGGAGGCGCGGCGGGCGATCGAGGCCGCGCGGGGCCCCAGCGACGACGAGGCCGCGGACCGTACGGCGGAGCTGCGCAGGGAGCTGGCGGCCGTCACCGCGGACGAGCGGGCCGCCCGGCTGCTGGCGATGGTCCGCACGGACGCGGCGGCCGTGCTGCGGTACGAGGCGGCGGACGCCGTGGACCCGGACCTGCCGTTCAAGGACCTCGGGTTCGACTCGATCGCGGCGGTCGGCCTCCGGAACCGGCTGCGCGCCTCGACCGGCCTGCGGCTGCCCGCGACGGTCGGGTTCGACTACCCGACGCCGAGGGCGCTCGCCGGGTACCTGCTGGGGCGCGTCCTGCCGGAGGAGTCCGGGACCGGGCACGCCGCGTTCGGGCACGTCGAGGAGCTGGACGCGGCGCTCGCCGAGCTGCCGCTGGAGGACCCGCGCCGCGCCGGCCTGATGAACCGGATGCGGGCCCTGCTGTGGAAGTACGAGCCCGACAGCGCCGCGAGCGAGGCGGGGGACGGAGACGGCGAAGAGGACCTCGCCGCGGCGTCCGCCGACGACATGTTCGCGCTCATCGACCGCGAACTGGGCACCTGAACGCATGAGAAGGCCCCGGCCTCGCGCGGTGCGCGGGGCCGGGGCCGGTGCGTCGGCGGGACGGCTCAGTTGGCGGTGCGGTCGCCCAGCGCGGCGCGGCGCTGGACGTTGCGGGCCAGGCGGAGGAGGTCCTCCGCGACCTCGTCGGGGTGCTGGGCGTGCAGGTCGTGGTCGCCGCCGACGTACTCGCGGACGGTCGCGTCGGGCAGCGCGGCCACGGCCGGCTCGACGTACGAGCGGACGCGTTCGAAGAGCGGCTCCCACCGCCGGTTCGGCTTCGGGACGGCCGGCATCAGCAGGGTCGGGACGGTGATGGCGGGATACCACTGCGCGGGCGGGTCGTCCCAGATGCTGCGCAGGATCGCGTTGCGCTGCGTCGGGGACAGGCGCGGCGTCAGCGCGCCGTCCTCGTGCACCCGCAGCGAGTGCAGCGACGCCTCGACGGCCTCTGCCGACCAGTCGGGGTAGACGGCGCGCTGGTAGTCGCGCATGGCGTCGAGGGTCGCCCCGTTCAGGTCGACCTCCGACATCGTCAGCAGCCCGGCGAACTGCTCCCACGAGTCGCACACGACGCTCGCGTGCGCCCACCCGCCCTCGATCAGCGCCAGGCCCGCGACCAGGCCCGGACGCCGGGCGGCCAGCCGCAGCGACACCAGGGCGCCCCACGAGTGCCCGGCGACGACCGCGCGGGTCACGCCGAGGGCGTCCGCGACCGCCGCGATGTCGGCGGCGGCGGTCTCGTTGCCGTAGTCGTCGTCGGCGGGGTCGGAGTCGCCGTGGCCGCGCTGGTCCACCGCGTACACCGGGTGGCCCTCGGCGGCGAGCCGGTCCGCGACCAGGTCCCACATCCGGGCGTTGGAGTCCAGGCCGTGCAGGAGCAGGAACGGCGGTCCGCCCGTGCCCTCCCGGTGCCTGACGTGCAGCTTCACCGCGTCGCCCAGCGGTACGGAAATGTCCATGTGAAAGTCGTCCCGAGAGTAGTCCGCCGGAGTGCGGCGCGGAGGAAACGATGTGCCGGAGGGCCCCGCGGCGCGGGGGCGCCCGCGTCAGGCGGGGCCGTGCCGGGCCGCCAGCCTCTCCAGCTCCGGAACCAGCCCGTTCGGGGTCGGCTGGTCGAGCACCTCCCTCCTGATCCGCTCGGCGCCCGCCCGGAACGACGGGTCGTCCAGCAGCCGCGCCAGGCTCTCCCGGACCTTCGGCCCGGTCACCTCGTCCGGCGGGATCGACAGCCCGGCGCCGGTCTCGGCGAGGTAGGCGCGCTTGTGGAGGGCGTCGGGCACCTTGGTGATGAGGAGCTGCGGCGTCCCGGCCAGCAGCGCGTTGTAGAACGTCCCGGCGCCGCCGTGGTGGATCATCGCCGCGCAGGTCGGCAGGACCGCGTGGAACGGCGCGAACTCCACGATCCGCGTGTTCCGGGGAACGCGGTCCAGCTTCTCGCGGAACGCCGTGGGCAGCGTCAGCACCAGCTCCATGTCGAGGCCGGCGACCGAGTCGAGCACCTCCTGCACCCGCTCGATCGACATGACCTGGAGCTCCTGCCAGTCGCTCATCGACACGCCGAGCGTCATCAGGACGCGGGGCGCGGGCGGCGGCTCGTTCAGCCAGTCCGGGATCACGGCGGGGCCGTTGTACGGGACGTAGCGCAGGTTGAGGTGGCTCGCGCCGTAGTCCCCGCCGAACGAGGGCGGGAACTGGTCGATCGTGGCGTGCGCGTTGACGAGCGTCTCGGAGTACTCGTAGCCGAACTTCTCCGTCCACGAGCCGAGCCACTCCGCCAGCGGGTCGCGCCGCCGTTCCGGCGGCTGCTGCGCCATCGTCCAGAGGAAGCCGCTGCGGAGCTGGAAGCTCAGATCCACGGTGAACGTCACGCGCGCGTGCGCAGCGCCGACCGCGCCCGCGGCGACGGGCCCGCGAGGGTGAGGGCGTCCCAGACGACGAGCCGGGGCCGCCACGACCGGCTGAACGCCACCAGGTCGTCGAACAGCGTGTCGTTCGCGATCCTGGCCCCTCGGCACGGCGACGGTCTCGTAGAGCCAGACCATGTCCTCCCAGGAGAGCTTCGCCCGCGGGTCGCCGATGGCACCGCCCGCCTGCCCGCCGCTGCCGGAGGGCTTCGGCGCGGTGGACGTCGCGCCGCCGAGGGCGCCCGCCAGCCGCCCCATGTCGAGCGGCGGCAGCGTCCCGTCCTCCTCCGCGCGCCGCGCCCGGTCCCGGATCGGCTCGTCCGAGCCGACCGGCACGGCCGTCAGCCCGGACCGGGTGACGGTGTCGACCAGCTCGGGCTCGCAGGCGACGCGCACGTCGTGCCCGGCGGTCTGGAGCGACCAGGCCAGCGGCACGAGGTTGAAGAAATGCGTGTGCCAGGTCGGCGTGACGAACAGGACGCGCACGGCCACCGTCTCCTTCGGGGGGAATGGGCTGTCACGGTCACCGGCCGGGCTTGCGCGCCGACGTGATGACGTAGTCCCGCGCCGCCGCCCGCGCCTCCACCTGCCTCAGGTGGTAGCGGATGGTGTTCCTCGGCTGGAACGACCGGTAGCGGGGGTCGCGGAGGAGCCGCGCGCAGTGCGCCCGCGCGAACACGGCGTTGGCCGCGATCGTCCTGTCGGTGACGTCCCGCACGTCGACGTCCACGAACCCGGCCTCGGCGAGCCGTGCGGCGTAGACCGAGCGCGGGTACCAGTTCGCGTCCGGGATGATGCGCCTGCGCCGCACCTCGTCCAGCCGCAGCACCAGGCCGCCCTTGCCGCCCGGCGCCTCGCGCGGCAGCGGGTCGGTCGTCGCGAGGACGCCGCCGGGCCGCAGCACCCGGAACGCCTCCCGGAAGAACACCTGCCGGGTGTTGAAGTGCGCGGACGACTCCAGCGCCACGACCCGGTCGAACGAGCCGTCCTCGAACGGCAGCGAGGTCGCCGACCCGACGCGCAGGTCGAGCCGGTCGTCCAGGTCCAGCTCCCGGGTGCGGCGCCGCGCGGCGTCCACCTGGCCCGGGGTGATGTTCAGGCCGACGATCTCCTTCGGCTCGCGGGTCCGCAGCCAGTGGAAGTCCTGCTCGCCGAAGCCGAAGCCGACGTCCAGGACGCGGTCCCCGGCGGTGATGCCGGCCGCCTCGCCGAGCCGCTCCGCGAGCGCGACGCACGCGGCGTCGTGGTCGGTGCAGCCCGGCTCCCAGTAGCCGTAGTTGAGGTAGCCGTTCTCGATGCCGAGCAGGTTCGGCGCGCCGCCGCTGGAGAGCCGTTCGAGGAGCCGGTACATCGAGCGCACCCACTTCTCGGTGAACGGCGAGGAGCCCCGCGTCCCGGCCGGGCCCGGCCCGGTCCGGGGGCCGTCGCCGCGGCCCTCACTGATCGTTCCGCCGGTCATCGTCAGCCTTCCGCTCTGTCGGGTCCGCCGGGGCCGTGCTCCGCGCGACCGCCCGCGGACGGCGCGTCCGCGGACGGCGGCCCGAGCGTCACGGGCAGCCGGGCGAGCTGGTGGGTGATGGGGATCTGGTCCCGTTCCAGCCGGTCCGGCGGCACGGCGAGCGCCAGCTCCGGGAACCGGGTGAACAGCGCCGAGAACGCGACCTCGGCCTCCTGGTTCGCGAGCGCCGCGCCCAGGCAGTAGTGCGGCCCGCGCGCGTACGCGAGGTGCTGCGCGTCGGCGGCGCCGCACGGCCGCTCGACGTCCAGCCGGTCCCCGTCCGGGAACCGGCGCGGGTCGCGGTTGGCCGAGCCCAGCACGAGCTGGACGCGTTCCCCCTGCCGGATCAGCGTGTCGCCGATCGTGACGTCCTGCGTCGCGTACCGCAGCTTCGCGATGACGACGGGGCTGCCCCAGCGCAGCAGCTCCTGCACCGCGCCGGGCATCAGCGCCGGGTCCTCGCGCAGCCGCGCGAGCTGGCCGGGGTGCGCGAGCAGCGCCGCGATGCCGTTGCCGAGCATGTGCGACGTCGTGGTGTGCCCGGCGATCATCAGGGTGAGCACCATGGTCACCAGCTCGGCGTCCGACAGCCTGCCGCCGCCGTCGTGCGCGCCGGTGAGGGCGGTGAGCAGGTCGTCGGCGGGCTCGGCGCGGCGCCGCTCGACCAGCTCGCGGATGTGCGCGTTCATCTCCGTGACGATCTCGCCGAGCCGCCGCGGCGTGGAGTCGCCGAACCGGCGGCTCCAGGCGTGCCACCGCGGCCGGTCCTCCTCGGGGACGCCGAGCAGCTCGCAGATGACGCTGACCGGCAGCGGGTAGGCGAAGTGCTCGATCAGGTCGACGACGCCGTCCACCGCCCGGTCCGGCAGGGTGTCCAGCAGCTCGCCGGTGATGGCCTCCACCCGCGGCCTCAGCCGCGCGATCCGCCGCGGGGTGAACGCCCGCGCCACCAGCCCGCGCAGCCGCGTGTGGTCGGGCGGGTCGAGGTAGACCAGGTTCCCGGCGAGGTAGGGGACGACGTCCTCGGGGATCCCCACCGTCCGCGCCAGGAACTCGGCGTGCTCGTCGGTCCGGCCCGGCAGCGACCCGGCGTTGCTGGCGAACCGCCGGTCCAGCAGGACGGCGGACACGTCGTCGTGCCGGGTGACGATCCACACCGGGCCGCCGTCCCACAGCCGCCCCCGGGCGACCGGGGCCTCCTCCCTGATGCGGCCGAACGCGCCGTAGGGGTCGGCCACCAGTTCGGGGTCGCTCAGCACGTGCTCGCCGGACGTGCGCGGGCCGGTCGTGCGCTCGGCGGTCATCCAGGGTCTCCTTCGGACGTCGTTGCCGGACCGGGGCGCGCGGCGACCCGGCGTGCCGCTACCGCGCGGCGCCGTCCCCGGACTCGGCGAGCAGCTCGGCCATGAACCGGCCGAGGTGCACCGGCGTCGGGTGCTCGATCACCGCGACGAGGGGGATCTCGATGCCGGTGAGCGACATCAGGTTGCGGGTCAGCTCCAGCGCCTTCAGCGACGTGAGCCCCTTTTCCAGGAAGTTGCTGTCGAGGTCGATGACGGACTGGTCGAGAACGGCGGCGGCCTGCTCGCGGACCGTTTCTCCCAAGATCCGCTCGCGTTCCGGCGGACCGGCGGCGGCAAGCTTCTCCTGGAGTTCGGTGCCGTCGAGAACGGCCTGTCCGTCGCTGTCGTTGGTCGCTTGCTGGTGCATGCGTGCTGTCCCGTCCTGTCGTTGTCGAAGCCGGCCCGCGGCGCCTCGTACCGCCGTCGGCGGGCGGCGCCGCCGTTCCGGCCCGGGTCGTCGCGAGCCTCCCGACCCACTGTGGGGCCGCCCTCTAAAGGCACGCCAAATAGGCCGCCCGCCAGCGCGCACACCCCATTCCGCGGACCCGCGAAGGGCTTTAGCCCCGCATTAACGGAACGGCGGAACATTGAGCGGCGGGGTGAAGTGAGCCTGGAGCCCCGCCGGACGCCGAAACGCTCAGGAGGAGTGTCATGACATCCCAATCGGCTGCGCCGGCGACGGCCGGCCAGATGTCCGACTACTACAGCGCGATGGGCGCGCTGCTACAGATGGCGTGGGGGGACAACTTCCATTTCGGCTACTGGGACGGGCCCTCCGACACGGCCACGATCGACGAGGCCACCGACCGGTTCACCGACATGCTCACCGAACGGCTCCGCGTCGGCCCCGGCGACCGGGTGCTGGACCTCGGCTGCGGCGTCGGCAAGCCCGCGCTGCGCATCGCGGCGAGGACCGGGGCGAGCGTCGTCGGCGTCACCATCAACGCCCGGCACGTCGAGCTGGCCACCGAGCGCGCCCGCGCCGAGGGCCGCGCCGACCAGGTGAGCTTCCGGCACGCCGACGCCATGGACCTGCCGTTCGAGGACGGCTCGTTCGACGCCGTGCTGGCGTTCGAGTCGATCATCCACATGGACCGGCCGACCGTGCTGCGGGAGGTCGAGCGGGTCCTGGTCCCCGGCGGGCGGCTCGCGCTCACCGACCTCACCCCGCTCACCGACGAGTCGAACACCCCGAAGTCGTTCCGCAGCCTCATGGGCGCCTCGCCCGGGGACGAGAGCCCCTCCGGGCAGGGCGGCGACCCCGACATCGCCACGCTCATCTCCGCCGGCGCCTGGCCCGGCCTGCTCAGCGACGCCTCCCTCACGCTCGACGAGCTGAACGACGTCACCGAGCACACGCAGGGCACGTTCATCCGGCTGTTCGAGAACTTCTTCAAGTACCGCCGCGAGTTCGAGCGCGAGCACGGGATCACCGTGGAGCAGGTCCTCGACTCGGCCAAGTCGGGGGCGCCCACGGACGGCGTCGGCTGCCTCGTCGTCGCCGCGCACAAGCCCTGACGCGTCGTCCCGCCCGGCCGCACGCGCGCGGAGGCCGGAGGCCGGTCATCACCGGCCTCCGGCCTCCGCGCGTCCGGCCCGTTCGGCCCTTCACGGGGCGGGCGCGCGAAGGCCCCGCGGCCCTTGGGGCACGCGGGGCGAACGGAGCGGGCGGAGCGCCGGTGTCAGGTGCGGGACGCCGCCCTCAGGCGGGCTCGCCGAGGACCACGGGGACGCGGACGAGCTGCCGGGTCAGCGGGATCGGCCTCCACTCCAGCTCCTCCGGCGCGACGGCGAGCGCGAGGTCCGGGTAGCGGCCGAACAGCGCCGACAGCGCCACCTCGATCTCCTGGTTCGCCAGCCCCGCGCCGAGGCAGTAGTGCGCGCCGCGCGAGTAGCCGAGGTGCTGCACGCCGCCGTCCACGGGCCGGGTGACGTCCAGCAGGTCCGGGGCGGGGAACCGGCGCGGGTCCCGGTTGGCCGACCCCAGCACGAGCTGGACGCGGTCCCCCTGCCTGATCACGGTGTCGCCGAACGCGATGTCCTCCGCCGCGTACCGCAGCATGGCGACGATGCCGGGCCCGCACAGGCGGACCAGCTCGTGCACCGCGCCCGGCCACAGGCCCGGGTCGGCGCGCAGCAGCGCGAGCTGGTCCGGGCGGGCCGCGAGCGCCACGGCCCCGTTCGCGACGAGCTGCGGGGTCGGGAGCTGCGAGGCGATCATCAGGGTGAGCACCATGGCGACCAGCTCGGTGTGCGAGAGCAGGCCGCCGTCGTCGTCGTGGGTGCGGATGAGCGCGCTCACCAGGTCGTCGGCGGGCTCGGCGCGGCGGCGGCCGACCAGCTCGCGCAGGTGCGCGCTCATGTCGGCGAGCATGCGGTTCAGCCGCGCCGGGTCCATGCTGACGTAGTCGTCGCTCCACCCGCGCCACAGCGGCCGGTCCTCCTCGGGCACGCCGAGCAGCTCGCAGATCACGGTGATCGGCAGCGGGTACGCGAAATGCTCGATCAGCTCGACCGCGCCGCCCTCCGCGCGGCCGGGCAGCGCGTCCAGCAGCTCGCGGGCGATGTCCTCCACGCGCGGCCTGAGCCGCGCCACCCGCCGCGCGGTGAACGCCCGCGAGAGCAGCTTGCGCAGCCGCGCGTGGCTCTCCGGGTCCGTCCGCACGAGGTCCCCGGCGAGGTACGGGACGAGCTCCTCGGCGACGCCGATCGCGCGCAGCGCGGCACGGTAGTCGTCGGGGCAGCCCGGCACCGCCGAGGAGTTCGTGGCGAGGCGCCGGTCCATCAGCAGCGCGCTGACGTCCTCGTGCCGCGTCACGATCCAGACCGGCCGGCCGTCCCATGAGCCCTGGACGATCGGGGCCTGCTCGCGGATGCGGCCGAACCCGCTGTACGGGTCGACCATCAGCTCGTGCCAGGTCAGCGCGTAGTCGTCCGTGGTCATCGGGGTCCTCCTTCGGATGCGTACGCCGCGACCGGCGCGCCGTCCGCTGGGGGCGGCTCCTGGAGCCAGAAGCGCCGGCGCTGGAAGGCGTACGTGGGCAGGGGAACGGCGCGCGGGCGCGGCCCGCCGCCGAAGAGGGCGGCCCATCCGACCGTCGCGCCGGAGGTGTGGAGCCGAGCGAGGGCCTCGACCAGGCCGCGGCCGACGGCACGTCCGTCGCCGGGAACGAGCCGTTCGGCAGAACGCGGCGCGTCCGGGAAGGGCTCGGGGCCGAAGTCCAGGAACGCGCCCGCCTCAGCGAACGGCGCGGAGCCCGCGTCGAGCACCGGAACGGTCGGCTTCCCGCCCGTCAGCGCGAGGGCGTCGGAGCCGGGGGCGTCGCGCTGGGCGACGAGGCGGCAGGCGTCCGGCAGGTCGAGCGCGCCCGCGAGGTGCGCGGCGGAGACCTCCCCGACGCCGCGGCCGACCACCGCGCCCGGCCGGAGCTCCGCCGCGAGCAGCAGGCGCGCCAGGGCGACGTGCAGCGCGAAGGTCGCGGCGTCCCCGCCGAGCGCCCCGCCGAACCGTTCGCTCACCTCGTCGTGAGCGGCGGCGAACGCGGGGAAACGTTCGCGGAGGGCGGTGGTCGCCTGCGCGGCCCGTTCGGCGGAGCCGCCGAACGCCCACGTACCGGCGTCGGAGTCCGCGTCGGGACCGGCGGTCGCTCCGGTGCCGCCGATCACGTTCTCGTGGGGCGTGCCGTTCGCGAGGGCGTGCAGCCCGGCGAGCAGCTCCGCGCGGTGCGCGCCCACGACGACCGCGCGGTCCTCGAATACCGAACGGGTCGCGGCGAGCGACCACGCGACGTCCTCCGGCGCGAGGTCCGGGTCGTTCCCGACGTGCGCGGCCAGCGCCGCCGCCTGCGCCCGCAGCGCCGCCGCGCCGCGCGCCGACACCGGCCAGGCCAGCGTCCCGCCCGGAGGCGGGGAGGGGGGCCGGGGCTCGGGCGGCTCGGGCGGCGGCTCCTCCAGGATCACGTGCGCGTTCGTCCCGGAGTTGCCGAACGCCGACACCCCGGCGCGGCGCGGGCGACCGCCGCGCGGCCACGGCGTCGGCTCGGTGACCAGCCGCACCGCGCCCGCCCGCCAGCTCACCAGCGGGGTCGGGCGGTCGATGTTCAGGGTGGCGGGCAGCGTCGCGTGCCGCATCGCCAGCACCGCCTTGATCACGCTCGCCATCCCGGCCGCGGCCTGGGAGTGGCCGATGTTCGGCTTGACCGTGCCGAGCCACAGCGGCCGGTCCGCCGGGCGGTCCTGGCCGTACACCGACAGGACGGCCTGCGCCTCGATCGCGTCGCCGATCGCGGTGCCCGTGCCGTGCGCCTCGACCACGTCCACGTCCGCCGGGGACAGCCGCGCGTCGGCGAGGGCCTCGCGCATGAGCCGCTGCCGGCCGGGGCCGTGCGGCGCGGCCATGCCGCTCACCGCGCCGTCCTGGTTGACGGCCGTGCCGCGCACGAGCGCGAGGACGCGGTGGCCGTTGCGCCGCGCGTCCGCCAGCCGCTCCAGCAGGACGAGCCCGGCGCCCTCGCCCCACACCATGCCGTCGGACGCCGCCGCGAACGGCTTGCAGCGCGCGTCGGGCGCGAGCTGCCGCTGGCTGGACGACAGCAGGAACGTGCTCGGCGTGTACATCACGGCGGCGCCGCCCGCGAGCGCGAGGTCGCACTCGCCGTCGCGGAGCGCCCGCGACGCCAGGTGCATCGCGGTGAGCGAGGAGGAGCACGCGCTGTCCAGCGCGACCGCGGGCCCGACCAGGCCGAGCGTGAACGCCACGCGGCCCGGCGCGCCGCCGCCCGCGTTGCCGGTGCCGAGGTAGGGCATCAGCGCGGCGGGGACCCGCCTCAGCCGGGTGCCGTAGTCGTGCAGCGCGACCCCCGCGTACACGGCGGTGGCGCTGCCGCGCAGCGTCCGCGGGTCGATGCCCGCGCTCTCCACGGCCTCCCACGCGGTCTCCAGCAGCAGCCGCTGCTGCGGCTCGATCGCCTCGGCCTCCTCGTCCCCGATCCCGAAGAACGCCGCGTCGAAGTCGGCGGCGTCGTCGAGGAACCCGCCGCCGCGCGCGTAGGTCGTGCCGTGGTGGTCCGGGTCGGGGTGGAACAGGGCGTCCAGGTCCCAGCCGCGGTCGGCGGGGAAGCCGGTGACGGCCTCGCGCCCCTCCGCGAGCAGGTCCCAGAACCGGTCCGGCGTCCGCGCGCCGCCGGGGAAGCGGCACGCCATCCCGACGACGGCGACCGGCTCGCGCCCGGCGTCCTCGGCCCGCCGCAGCCGCCGCCTCGCCTCATGCAGCTCGGCGGTCGTCCACCTCAGGTACTCGACGAGCTCCGCCTCCCGCCCCGCGCCGCCGCCCGCGCTACCGCCCGCGCCGTTGCCCGCGCCGTTGCCCGTGCGGGCGTCCGCGCCGCCGTTCGCGGGGGAGCGCGGCGCGCCGCCGGAGGTGGGGTCGGTCACCGGGGGACGCTCCTCACGCCCGCGACGGCGCGCCGGTCCAGGTTCACGAGCGCGAGCTCGTGCCGCCGCAGCGGCGGCTCGCCCAGCTCGGGGAGCAGCCGCGCCCCCTGCATCAGCGACATGAACCGCGCCGTGCCGAGCTCCGACTGCGGGACGGTCATGCTCATCACGTCCGTCACCGCGTCGCTCACGTCGGCGGAGCGGATCGCCATCGTCGCGACCATGTCGGCGAAGCCCCGCATGACCGCCGCGCCGCCGTTCAGCCGCGGGTCCTCGGCGTGGTTGCGGCAGTGCACGAACTCGATGTCCTTGGACGTCGCCATGATCCACGGGTCGTCGGCGGCGGCGCTGATCCGCTCCTGCGCCGCCCGCGCCGCGCCCGGCCCGAACGGCTCGGCGAGCCGTTCGTCGAGGACGCGGGCGGCGCGCGCGGCGGCGCTCATCCCGTGCCCGTAGATCGGGTTGAACGCGGCGAGCGAGTCGCCGAGGACGACCAGCCCGTCCGGCCACGTCGCCGCCCGCTCCGGGTACAGCCGCCGGTTCGCGCCGATGTGCGAGGCGAACACCGGCGTCAGCGGCTCGGCCCCGGCGACGAGGTCGGCCACGATCGGGTCGCGCAGCGTCCGCGCGTACGGCAGGAAGTCCTCGTCGCGGGTGGGCAGCTCCCCGCCGCGCGTGCACGACAGCGTCACCATCCACCGGCCGCCCTCCTGCGGGTACACCACGCCGAACCGGCCCGGCCTGCGGACCCGGTGGTCGGCGGCGACGTTGATCGCGGGGAACCCGGCGGTCGCGCCGTCCGGCGCCCGGAACGTCCGCGTCGCGTAGGCGATGCCCGCGTCCACGACGTCCTCGGCGAGCGGCGGCAGGCCGAGCGCGGCGAGCCAGTGCTTCAGGCGCGAGCCCCGGCCGGTGGCGTCCACGACGAGGTCGGCTTCGAGGAGCTCGCGCGCGCCGTCGCGGGCGCCGCGCACGTGGACGCCGGTGACGCGCTTCGCGCCGCCCGCCAGCTCGACGGCCTCGGTGCCCTCCCGCAGCGTGATCCGCTCGTCCGCGAGGATCCGGTCGCGGACGATCCAGTCCAGCAGCGGGCGCCCGCACATGATGGCGTACTGCCGCGGCGGGAACCGGTGCTGCCACCCGTGCGAGGTCAGCGTCACCAGGTCCTCGTGGAACCCGATCCTGCGGGCGCCGGCGGCGAGCAGCCGTTCGGCGGTCCCCGGCAGCAGCTCCTCGACGATGCGGGCGCCGCCCGACCACAGCACGTGGACGTGCCGCGCCTGGGGCAGCCCCCTGCGGTGCTCCGGCCCGCGTGGCAGCACGTCGCGTTCCACCACGGTGACGGCTTCCAGGTGCCGGGCCAGCACGTGGGCGGCGAGCATCCCCGCCCAGCTTCCCCCGAGCACGATCGCGCGTCCGGATCTCGTCACGTTCCCGTCCCTCCATGCACGGCCGGCCGCCCGGCCGGCGCCGCTGCGCCCGCCGCTCAGGAGGAGCCGGCGCGGGCCTCGCTCAACCGCACGTTCTCGATGAGGCGCTCCGTCACGGCCAGCGAGTGCGCCACCCCGGTGAGCCGCGGCGCGTCGGGCAGCGAGACGTCGGTGATGCCCCAGTACGCCTTGAGGCTGCGGGTCAGGCCGTTCCCGTCCACCTCCGAGACGCCGATGACCTGGTACCGCATGTTCGACGGGTACGAGGTCGTCACGACGGACGGCGCCACGACCGTCCGGCCGTCCAGCGCCAGGACGGGCCGCCCCGGCACCTCGTGCACCTTGCACTCGATGGCCCACGCGATGTTCTGGCGGATCGCCGCCTTGCCCCGCACCGGCGGCGAGCCCACCGGGTCCTCGAAGACGATGTCGTCGGTGAACAGCTCCATCACGCCGTCCAGGTCGCCGGCGTTGATGCGCCGCGCGTACTCCAGGGGCGCGTTCTTCCTGACCGCGTCATCCGTCCCGGCCTCGGCCGCCCCCCAGAACGCCCTGAGCTCCTCGATGAGCCCGCCGGCTCCGACGCGCAGCATCAGAACGTACTCGCGCTCGATGGAACGGACGCCGCCCGGCCCCGCCGCCGGGCCGTCCAGGGTGGCGCTGACGGGCACCAGCGCGTGCACGCCGTCCTGGGAGGCGACCGGCTCGCCGGGCACCTCGCGGGCGTTCGCCTCCACGGCCCGCGCGAAGTGCGCCCGCAGCTCCTCGCGGCCGGCGCGCTCGCCCCCGCCGACCGGGTCCTCGAACCGGACGTCCTCCGCGTACAGGTCGAGGAGCCCGCCGACGTCCCCGGCGTTGAGCCGCCGGCAGTGCTCCAGGATGATGCGCTTGCGCGCGCTCTCGTCGGTCAGCATCGGCTGCCCCTTCCGGCCGTGGCCGTCGTCGGTTCCCGGTCCCGGACGCTCATCCGGAGGCGGTCTCGGCGCCCACGGCCGCGCGCGGGCTCAGGTTCACGAGGGCGAGCTCGTCCGGGCTCAGCGGCGGCTCGGTGAGCTCCGGGCGCATCGGGTCGCTGCCGAGCAGCGCCATGAACCGGCTGGAGCCGAGCTCGGACTGCGGGACGGCCAGGCTGGCGGCGTCGGTCACCATCTCGCTCACGCCCGGCGACCGCAGCGAGCGGACCGTGATGACGTCGGCGAAGCGGCGCCGCGCGTCGGCCTCGCCCAGCAGCCGCGGGTCGGTCGCCGTGCTGCGGCAGCCGACGTACTCGATGTCGCGGGCCGCCGCGATGATCCACGGGTCGTCCACCGCCGCGCACACGTCCCGCTGCGCCGCCGCGACCGCCCCGGCGCCGAGGCCCTCCTCCGCGAGGCGCTCGTCCAGCGCCGCGATGCCGCGCGCGGCGGCGCTCATCCCGTGCCCGTAGATCGGGTTGAACACCGCGAGCGCGTCGCCGAGGACGACCAGCCGGTCCGGCCAGCCCGCGAGCCGTTCGGGGTACAGCCGCCGGTTCGCGCCGAAGTGCGAGACGACCAGCGGGGTCAGCGGCTCGGCGCGGGCGATGAGGTCGGCGACGAGCGGGTCGCGCAGCGTCCGCGCGTACTCGTTGAACTCCGCGTCGTCGGTCGGCAGCCGCACGCCCCGCGTCCCGGCGAGCGTGACCATCCACGTGCCGTCCTCCTGCGGGTGGACGACCCCGAAGCGGCCCGGCTCCCGCACCCGCGGGTCGGGCGCGAGGTTGACCGGCGGGAAGCCCGCCGCCGCGCCGGGCGGCGCCTGGTAGATCCGCGAGCAGTACGCCATGCCCGCGTCGACGACGTCCTCCTCCAGCGGCGGCAGCCCGAGCGCCCCGAGCCAGTGCTTGAGCCGGGACCCGCGGCCGGTCGCGTCCACCACCAGGTCCGCCGTCAGCGACGACGCGGCCCCGGTCGCCACGTCGCGGACGTGCACGCCGGTGATCCGGTCGCGGTCGCCCGCGAGGTCGAGCGCCTCGGTCCGCTGCCGCACCTCGACCCGTCCCGACGCCGCCACCTGCTCGCGCACCGCCCAGTCCATCAGCGGGCGGCTGCACATCACGCAGTACTGCTTCGACGGGAACCGGTGCTGCCAGCCGTGCGAGGTCAGCGTCACCAGGTCCCGCTGGAACATGATCCGGCGCGCGCCCAGCGAGAGCAGCCGGTCGATCGTCCCCGGCAGCAGCGCGTCCACGACCCGCGCGCCGCTCGACCACAGGATGTGCCCGTGGCGGGCCTGCGGCTGCCCCTTGCGCTGCCGGGGCCCCTCGGGCAGGACGTCGCGCTCCAGGACGGTGACGGCGTCGAAGCGGCGGGAGAGCGCCTGCGCGGCGAGCATCCCCGCCCAGCCGCCGCCCAGGACGATCGCGTGCTCGGATCTGGTCATGGTCACCCTCTCGTCGTCGCGCCGGAGCGACGCGGCGTCGTCCCTGTGATGATTGCCGTGCCTGCTTAAGCGGCCCTAGCGCCGGTTCCGGCCGAGGTCGTGGTCGATGAAGGCGAGCAGCTCGTCCACCGTGGCGGTCCCGATCCGCTCGGCGCCCATGCCGGACGGCTCGGCGGACGGCTCGCCGGACGGGCCGTCGGGCGCGCCGCCGGGGTGGTCGTCCAGCCTGGCGAGCATGGTCTGGAGGCGTCGCCGCAGTGCCGTCCGCACGCTCGCGTCGCCGTTCACGGCCGCGCCGGCGGGGGAGGCGGCGGCCATCGCGTTGTCCATGCGTTCGAGGCGGGCGAGCAGCGAGGCCGGCGTGCCGTCCTCGTCCGGGTCCGGTCCGCGCGGGGCCAGCTCAGCGTGCAGGTGCTCCGCGAGCGCGGCCGGGCTCGGGTGGTCGAAGACGGCGGTGGTGGGGACGGCCAGCCCGGTCGCGCCGGCGAGCCGTCCCGCGAGGTCGGTCGCGGTGATCGAGTCGAAGCCCATCTCCGAGAACCCGCGCCCGACCGGGACGAGCCCCGGATCGGCGTGGCCGAGGACCGCGGCGGCGTTCGTGCGGACCAGGTCGAGGAGCAGGGGGCCGCGCTGGTCGGCGGGCGCCGCCGCGAGGCGTTCGGCGAGGGACGCCGCGCCCCGGCCGGTGGCCGCGCTCGGCCGCGCCGGGGTCCGTCCGGCGAGGGCGCGCAGAGGAGCGGGCGGCGCGTCGGCTTCGGTGGCGTGCGCGGACGCGAGTGCCCGGACGTCCAGATTCGCGGCGACCAGTCGGGGGGTGGGGCTGTGGCGGGCCGTTTCCAGCAGGGCAAGCGCATGCTCGGCGGTCAGGGCTCCGACGCCCGGCGCGGCGGCGGACGGTCGCGCGGTGTCGTCCAGCCAAGGCCCCCAGGCGATCGACAGCGCCGTTCGCCCAGCCGCTCGGCGGTGGGCGGCCAACGCGTCGTTGAACGCCCCGGCCGCATCGCCGCCGGACGCGCCGAGCGTGGCGGCGGCCGAGGAGAACAGGACGAACGCCTCCAGCGGCAGATCGGCGGTCGCCGCGTGCAGGGCGTACGCCGTCGCGATCCCGTCCGTACGTGCCTCGTCCGGCGGGCACGCGGCGTGGACGACGGCGGCCAGCGGGCGCGCCTCGTCGATCGCGGCGACCACGTCGGCGACGGCGGACGCGTCGGTGAGGTCGCACGCGGCCACGCGGACGTCCGCGCCCAGCGCGGCGGCGCGCTCCGCCGCGTCCCGCACGCCCGGCGACTCGTGGCCGGGGCCGGGGCCGTCCGGGCGGAGCAGCAGCAGGTGCTTGGCCTGCCACGCGCGGACGGCGTGCTCGGCGACCAGCGCCCCGCGCGGACCGCCGTCGAGGGTGATCAGCACGGTCCCGTCCACGGCGAACGGACCGTCCGCCCTGGTGGCTCGCGGCCCGGGGGACGCCGCGCGCGCCAGGCGCGGCACGAGGACCTGCGCCGAACGGATCGCGACCTGCGGTTCGTGCTGCGCGAGCGCGGCCGCCACCGCCTCGCGGAGCCGCGCCTCCTCGGGGTCACTGTCGGTGTCGAGCAGAACGAACCGGTCCGGGCGCTCGGCCTGCGCGCTGTGCACCACGCCCCACACGGCGGCGTCGGCGAGGCTCGGAACGTCGCGTTCGCCGCCGTCCACGCTGACGGCTCCTCGGGTGACCACGACCAGGCGGGAGCCGGACAGGCGCGGCTCGGCCAGCCAGTCCCGCACCAGCGTGGAGGCGGCCTCCGCGGCCCGCCGCGCATCGCCACCGGAGGCGGTGGGCGCGCGGGTGAGAACGACCGGCGGGGCGGGAGCGCCGGGATCGGCGTCGAGCGCGGTGCGCAGCGCGTCCAGCTCGATGTAACGCCCTGAGGCGTCCTCGGTCGGCGCGGGCGCGGGCGTCCAGTCCAGCGTGAACAGGCCGTCCGTCCGCCCGCCGCCGGACGCACGGAGGTCTTCGACGGCGACCGGCTCCAGCGTCACCGAGCCGACCGTCAGCACGGGCGCTCCGGCCGGATCGGTCAGCGTCACGCGTACGCGCCGCGCGTTCCGGCCAGGGTCCCCGGTGGGGGACAGGCGTACTCGTACCGTGGTCGCCTCCGACGCCCACAGCGCCACGTCGTTCCATGCACTGGGCAGCCACACGTCGTCTTCGCGGGCCCGCGCGTCGTCCGGGACGTCCAGGAGCGCCGGGCGGAGCGCCGCTTCGAGCAGTACCGGATGGAGGCCGTAGCCGTCGTGACCACCGGCCTCGTCCGGCAGGGCCACCTCGGCGAGCAGGTCGGAGCCGTCGCGCCAGACCGCGCGGATCCCGTCGAACGCGGGGCCGTCCTCGTAGCCCGCCGCGACGGTGCGCTGACGGAGGCCGTCGAGGTCCACTGGTACGGCCTCGGGTGGTGGCCACGCGCGTGCCCGTTCGTCCTCGGGAGCCGGGGCGGCCGGGCCGGGTGCCGGCGGGGCGAGCGTGCCGTGCGCGTGGCACGTCCAGGCGGGATCATCGGCGGTACGGGGGCCGTGGTCGGGGCGGGAATGCACGCGTACGTCGCGGCGCCCGTCCTCCCGGGCCGCCCCCACGGTCACCTGCACGCGCGGGCCCCGGCCCCGCGCGGGCGGGACGAGCGGATCGCGGACCACCAGCTCTTCCACGCCGCCGCACCCCGCCTCGTCGGCCGCCAGCAGAGCCCACTCCACCAGCGCGGACGAGGGAACGATCGGCAGCCCCGCGACGGCATGCCCTGTCGTCCATGCCGCGCCGTTCGCGGAGATGGTGCCGTTCAGCGCCAGCCCGCCATCGGCCAGCTCGACCGCCGCCGCGAAGAACGGGTGACGGACGCGCCGCGCCCCGGCGGCGGACACGTCGCCCGCGCCGCCGCCCGCGTCGCCCAGCCAGTAGCGCTCGCGCTGGAACGCGTAGGTGGGGAGGCGGACGGTCTCCGGCGCGGAGCCGTCGGGGAAGCAGGCCGTCCAGTCGACGTCGGTCCCGGCGATGTGGAGCCGGGCCAGGGCCCGAACGAACGCGTGCGCGTCGGGCTGCCTGGAGCTGAGGGTGGCGACCGTGCGCGGGGCGGGGCCGTCCGGGCGGACGTGTTCCAGCGTGCGGCGCGCGGCGGCGGTGAGGGTCGAGTCGGGGCCGAGCTCCACGAACGTCCCGGCGTCCGGCGCGACGTGCGCGACGGCGGGGTGGAAGCGCACCGGCCGGCGGATGTGCTCGGCCCAGTACCCCGGGGTGGCGATGCGCTCGTCGGCGGGCTCGCCGGTGAGGTTGCTGATCAGCGGAACGGCCGGCGGCCGGTACGCGAGGTCGCGGACGGCCTCCTCGAACGGGGCGAGCACGGGCTCCATCAGCGGCGAGTGGAAGGCGTGGCTGACGGTCAGCGCCCGGGTCTTCCGCCCGCGCTCCGCCCACGCCTCCGCGATCTCCGCGACGGGTCCGGCGGCGCCCGAGATGACGGTGTTGCCGGGCGTGTTGAGGGCCGCGACGGCGACCCGCCCGCCGTGCGCGTCCAGGTCGGGGGACAGCTCCTCCGGGGTGGCGGCGACGGTCGCCATGGCCCCGCCCTCGGGCAGCCGCCCCATGAGCGTCGCGCGCGCGGCGACCAGCCGGCACGCGTCCGGCAGGTCGAACACCCCGGCGACGTGGGCGGCGGCGATCTCGCCGATCGAGTGGCCGATCACGACGTCGGGCCGCACCCCGGCCGCGCCGAGCAGCCGGGCCAGCGCGACGTGCAGCGCGAACAGCCCGGCCTGGGCGTAGGTGGTGTGGTCGAGCGCGCCGGACCGGTCGGCGAACACCACGTCCCGCACCGGACGGTCGAGATGCTCGTCCAGCAGCGCGCAGACCTCGTCGAACGCGGCGGCGAACGCCGGGAACCGCTCGTACAGCCCGGCGCCCATGCCGGGCCGCTGGCTGCCCTGCCCGCTGAACAGGAACACCGTCTTCCCGGCGGCCGTGCCCGTACCGGCCTCGCCCGCGCCCGCGACGACGACGCCGGGGTGGGCGTCGCCGCGCGCCAGCGCCTCCGTCGCGGCGAGCAGCTCGCCGCGGCCCTCGCCGACGACGACGGCGCGGTGCTCGAACGCCGTACGGGTCGTGGCGAGCGCCAGGCCGACCGCGGCCGGAGACTCCAGCGGCGCACCGGCCGCGTGGTCGGCCAGCGCGCTGGCCTGCGCCCGCAACGCGTCCTCGCCGCGCGCGGACAGCACCCACGGGAGCACCCCCTCCCCGGCGTCGGGCGGCGCGGTCGTCGCGGGCTCCGGGTCTTCCGGGGGCGGGGCCTCCTCCAGGACGAGGTGGGCGTTGGTGCCGGAGATGCCGAAGGACGACACCCCGGCGCGGCGGGGCCGTTCGCCGCGCGGCCACGGGACCGGCTCGGTGAGCAGGCTGACCGCGCCCGCCTCCCAGTCGACGTGCGGGTTCGGCTCGTCGATGTGCAGGGACGCGGGCAGCAGGTCGTTCCGCATCGCCATCACCATCTTGATGACCCCGGCGACGCCCGCCGCCGCCTGGGTGTGGCCGATGTTCGACTTGAGCGACCCGAGCCTCAGCGGCCGTTCCGCCGGTCGGTTCCGCCCGTACGCGGCGAGAACCGCCTGCGCCTCGATCGGGTCGCCGAGGGTCGTCCCAGTGCCGTGCGCCTCGACCGCGTCCACGTCGGCGGGCGACAGCCCGGCGGCGGCGAGCGCCCGCCGGATGACGCGCTGCTGGGACGGCCCGCTCGGCGCGGTCAGTCCGTTGCTGGCGCCGTCCTGGTTGACGGCAGAGCCGCGGATCACCGCGAGGACGCGGCGGCCGTTGCGGCGGGCGTCCGACAGCCGTTCCAGCAGCAGCAGGCCCGCGCCCTCGGCGAGACTCGTCCCGTCGGCCGAGGCCGCGAACGTCTTGCAGCGGCCGTCGGCGGCGAGGCCCCGCTGCCGCGTGAACTCCAGGAACAGCCCCGGCGTCGCCATCACCGTCACGCCGCCCGCCAGCGCGAGGGAGCATTCGCCCTGCCGCAGCGCGCGCGCCGCGAGGTGGGCGGCGACCAGCGACGACGAGCAGGCCGTGTCGACCGTGACGGCCGGCCCCTCCAGCCCGAACGTGTACGCCAGCCGCCCCGACACCGCGCTGCCCGCGTTGCCCGTGGCGAGATGCCCCTCCACCTCGGCGGCCGTCCCGCCCGCGAGGGCCAGGTAGTCCTGCGAGGCGATGCCCGCGAACACGCCGGTGTCGCTGCCGCGCAGGGCGTCGCGCCGCAGCCCGGCGTCCTCGAACGTCTCCCACGCCGTCTCCAGCAGCAGCCGCTGCTGCGGGTCCATGCCGAGCGCCTCGCGCGGGCTGATCCCGAAGAACGCGGCGTCGAAGCGCGGCGCGTCGTGCAGGAACCCGCCCTCCAGGGTGTGGCACGTCCCCGGGCCGCCGCCGGGCGCGCCGAGCTCCTCCAGCGGCCACCCGCGGTCCTCGGGGAACGCGGAGATCGCGTCCGCGCCGTCCGCGACGAGCCGCCACAGGTCGTCCGCGCCGCGCACGCCGCCGGGGAAGCGGCACGCCATCCCGACGATCGCGATCGGCTCCCGCGCCCGGTCCCGTTCCTCCCGCAGCTCCCGCCGGACCTGCCTCAGGTCGGCGGTGGCGTGCTTGAGGTAGCGGCGCAGCTTCTCGTCGTTCGTCATCGCCTCAGCCCCGCTCGGACGGGGCGTCGTGGTCGGCGGGACGGTCCCGTTCGAGCCCGGTCAGCTCGTTGTCGAGCAGGTCGAACAGCTCGTCGTCGGTGGCCGAGTCCAGGTCGTCGCGGCCCGGCCCGTCGTCCGGCGCGGCGCCGTCCGGGACGCCGGTGACGCTGCGCAGCAGCGCGTCCAGGCGGGCCGCGACCAGGCCGCGGCCCTCGTCGCCGTCCGGCAGCGCGGCCACGGCCGCCTCGATCCGGTCCAGCTCGGCGACCAGCGCCGCGGGCTCCGGTCCGGCGCCGTCCGGCGCGACCTCCTCGGCCAGCAGCCCGGCGACCGCGGCGGGCGTCGGATGGTCGAAGATCAGCGTTGCGGGGAGGCGGACGCCGGTCGCGGCGCTGAGCCGGTTGCGCAGCTCCACGCCGGTCAGCGAGTCGAAGCCGAGCTCGCGGAAGTTCTGCTCGGCGGGGATCGACCCCGCCGACTCGTGCGCGACGACCGTCGCCGCCTGCGTGCGCACCAGGTCGAGGAGCGCCTCGCGCCGGTCCGCGTCGTTCAGCGCGGCCAGCGTCCGCGCCAGCGACGGCCCGGCCGCCGTCCCGCCCGGCCGCGGCGCGGAGCCCCGCAGCAGGACCCGCAGGAACGCCGGGACCTCGCCCCCCGCGGTCCGGCGCCGCAGCCCCGGCAGGTCCAGGCGGACGGGCGCGAGCAGCGGCCGGCCGAGGCCGAGCGCCGCGTCGAACAGCGCGAGCGCCCGCTCCGCGCTCAGCCCGAGGTCGCCCGAACGGGCGTTCCGCGCGAGGTCGATGTCGGTGAACCGGCCCGACATCCCGGTGTCCATCTCCCAGAACCCCCACGCGAGGGACGTCGCGGGCAGCCCGCGCGCGTGCCGGTGCTGCGCGAGCGCGTCGAGGAACACGTTCGCCGCCGAGTAGCTGCCCTGCCCCGGGCCGCCCGCGAGACCGGTCACGGACGAGAACAGCACCAGCGCCGCGAGGTTCAGGTCGCGCGTCAGCTCGTGCAGGTGCCACGCCCCGTCCGCCTTGACGCGCAGGACGGCGTCGAGCTGGTCGGGCGTCGCCGTGCGGATCGTCGCGTCCTGGACGATCCCGGCGGTGTGCACGACCGCCGTCAGCGGATGCCGCGCCGGAACGGACGCCAGCAGGTCCGCGAGGGCCGCCCGGTCGCCGGTGTCGCACGCGGCGACGGCGACCTCCGCGCCGAGCGCGGCCAGCTCGGCGGCCAGCGCGTCCGCGCCGGGCGCCTCGGGCCCGCGGCGGCTCGCGAGCAGCAGCCGCCGGACGCCGCGTTCCGCGACGAGGTGCCGGGCCGTCGCCGCGCCGAGAGCCCCGGTGCCGCCGGTGATCAGCACGGTCCCGTCCGGGTCGAGGGGCGGCGGCAGCGACAGGACGGTCCGCCCGGCCGCGCCGTCCGCCCGGCGGAGCGCCTCGCGGACGTCGTACACGTCCCGGACGGTGACGGGCGCCGGAGGCAGCGTCCCGTCCTCGAACAGCGCCGCCAGATCGCTGAGCGTCGAACGGACGCGCTCCACCCGCCCGCCTTCCCCGCGGCCGGTGCCGTCGAGGTCGGCGGGGGTCGGATGGTAGGCGGGCAGGTAGGTGGAGACCGCGCCGGCGGCCTCCGCGTACGACCACCCGGCGGGCATCGGCACGACCAGGCGGTGGTCGGTGAGGGCCACCGGCCCGACGGCGTCGAACGCCCCCATCACCCGGTCGCCCACGGCGAACCCGTCCACGTCCGCGCCGACCCCGGCGACGACGCCCGCGCCGTCGCCCCGCACGTCGCGGACCTCGTCCGGCGCGCCCGGCCCGGCGGCCTGGGCGCTGACGGCGACGGCCCGCAACGCCACGCGCACCTCTCCGGGCGCGAGCGGCCGGCCGTGCTCGGGATCGGGAACGAGCGCGGCCTCGCCGTCCGCGCCGTACACCAGCCGCCACGCCTCCGCGCCCGGCGGCGGGGACAGCCGGTCGCCGGGATCGTAGCGGACGAGCCGCGGCGCGTACGCGCGGCCGTCGCGCAGCGCGATCTGCGGCTCACCCGAGGCGAGCGCGGCGGGCACCGCGTGCGACGAAGCGTCCGGCCCGTCGAGATCGAGCAGGACGATCCGTCCGGGGTTCTCGGACTGCGCGCTGCGCACCAGCCCCCACACCGCCGCCGCGGGCAGGTCGTCCACGCCGTCGCCGGGGCCGGTCGCGACCGCGCCGCGCGTGACGACGACCAGCCGGGACGTGCTCTCGTCGGACGGCCACTCCTGGAGCAGGGCCAGGACGCCCGAGGTGACGGCGTGGAGCCGTTCCTCCACGGGACCGGCGGCGCCGGAGGCGGTGCAGACGGCGAGCACGGTGTCCGGCAGCGGCGTGCCGTCAACGGCCGCGGTCAGGAGCGCGGAGAGGTCCGGATGCCGTTCCGTCCCCGGCAGCGCGCCGCCGAGCGCGTCGGCCAGCTCCGCGTCCGAGCCGGACGCCGCGTCGAGCGCGACGACCGCGAGCCGGGCCGGGGGTGTCCCTTCGGGGTCGGGCAGCGGCGTCCAGTCCATCGCGAACAGCGAGTTCCTGGCCGCCAGGCGGGCCTGCGCGGCGGCGTCGGCCTCGACCGAACGCAGCGTCAGGTCGTCCAGCGACAGGACCGGCGCTCCGGACGCGTCGGCGGCGGCGATCGTGAGCCGGTCCGCCGCCGTCGGCGTGACCCGCACCCGCAGCGACGCCGCGCCGGTGGCGTGGACCCGCAGGCCGCTCCAGGCGAACGGCAGCATGACCGCTCCGGAGTCGCCGTCCGCGCCGTCCGCGCCCAGCGTGCACGCCTGTAACGCGGCGTCGAGGAGAGCGGGATGGATGCCGTATCCGGCGGTTCCGGAGCCGTCCGCCCCGGCGCGGTCCGTTCCGTCCTCGGGCAGGACCACCTCCGCGTAGAGGTGCTCGTCGAGCCGCCACGCGGCGGTGAGCCCCTGGAAGGCCGTCCCGTACGCGGAGCCGCGGTCCGCGAGGTCGTCGTAGAGGTGTTCGGTGGCGAGCGGCTCCGCTCCGGGCGGCGGCCAGGCGCCGTCCAGCGGGTGGGGCTCGGACGGCGTGGCCGTGCTCGCCCCGGGTCCGTTCGCGAGGAGCCCGGTGGCGTGCCGCGTCCACGCGGACTCGCCGTCGTCGGCGTCCGCCGCCGGTCTGGAGTGGATGGCGACGGGACGCCCGCCGCCCTCGCCGTCGGCGGGACCGACCGCGACCTGGAGATCGACCGCGCCGTCCTCCGGGAGCCGCAGCGGCTCGTGCAGGACGAGCTCCGCGACGTGGTCGCAGCCCGTACGGGCCGCCGCGTGCACGGCGAGGTCGGCGAACGCGGCCCCCGGCAGCAGCACCGCCCCGAGCACCTTGTGATCGGCCAGCCACGGCTCCGTCCCCTGGGAGATCCGTCCGGTGAGCAGGTAGGCGTCCCCCTCCGCCTGCTCGATCGCGGCCCCCAGCAGCGGATGCCCCGCCGGAGCGAGCCCGAGCGTCCCCGGTTCGCCGCCGAGCCAGCCGCGCGTGTCCAGCCAGAACCGCTGGTGCTGGAACGGATAGGTGGGCAGGTCGGTCACGGCGGGCGGCGGGTCGGCGGGGTAGTGGCGCGTCCAGTCGGCGGGCGCGGCCGCGGTGTGGAGGCGGGCGAGCGCGTGGACGAGCTGCGTGTGGTCCGCGTGGTCGCGCCGGAGCGTGGAGGTCACGGTGGTACGGGTGTCGGCGTCCTCGGTGATCTGCTCCAGGGCGAGGGTCAGAATCGGATGCGGGCTGGCCTCGATGAAGTCGCGGTGCCCGTCGGCGAGGAGCGCGCGGACGGTGTCGGCGAACCGGACGGGACGGCGCAGGTTGGTCACCCAGTACTCGGTGTCCAGCTCGGCGGCGTCGATCCGCGCGGCCGTCACCGTCGAGTAGAACGGGATCCGCGCCTCGGAGGGACGGACCCCCGCGAGGATCTCGCGCAGTTCGCCGGTGATCCGGTCGATCTGCGGGCCGTGCGAGGCGTAGTCCACGTCGATCATCCGGGCCCGCTGCCCGTTCGCCTCGACCTTGGCGACGATGGCGGCGACCGCGTCCGGCCCCCCGGAGACCACCACGGACCTGGGCCCGTTCACCGCGGCGACCGTCACATCGCCTTCCAGCAGTTCCCCGGCCTCCTCGGGCGTGACGCCGAGGGAGGCCATGGCGCCGTGTCCGGCGAGGTTGCGGAGGGCTTTGCTGCGGAGGGCGGTGATGGTGGCGGCGTCGGTGAGGGTGAGTGCTCCGGCGATGCAGGCGGCGGCGATTTCGCCTTGGCTGTGTCCGATGACGGCTGCGGGTGTGATTCCGTGGTCGGTCCAGACGGCGGCGAGGGAGACCATGGTGGCCCAGAGGACGGGTTGGACGACGTCGACGCGGTTGAGGTCGGCGGCGCCTTCGGTTCCGCGGAGGACGTCGGTGAGGGACCAGTCGATGTGTGGGGCGAGGGCTTGTTCGCATTCGGCGATGCGTGCGGCGAAGACGGGTGAGGTGTCGAGCAGGTCGGCGCCCATGCCGGGCCACTGCGAACCCTGCCCCGGGAACACCAGCACGGGACCCGTCTGCGATACCGCTGCGGGCGTGCCCGGATTGACCAGGCACGGATGCGTCCGGCCGTCGGCCAAGGCTCTCAGGCCGTCCATGAGCTGGTCGCGGTCCTGGCCGATGACGACGGCGCGGTGTTCGAACGTCGAGCGTGTGGTGGCCAGCGACCATCCGACCTCGACCGGTGAGAAGTCCTCGCGTTGTTCGAGGTGTGTCACGAGTGCGGTGGCCTGGTCGCGCAGTGCGGGCGCGTTGCGCGCCGACAGCGCCCACGGCACCACCCCGTCCACGACTTCCGCACGTGGGGCCTGTCCCGCAGGGACCTCGTCCGGCTGCGGTGGCTCCTCCAGGATCAGATGGGCGTTCGTCCCTGAGATCCCGAACGACGACACGCCCGCGCGGCGGGGACGATCACCGCGCGGCCACTCGACCGGGTCGGTGAGGAGGCGGACGGCGCCCGCGTCCCAGTCCACGTGCGGGGACGGCTCGTCGATGTGCAGCGACGCGGGCAGCAGCCCGTTCCGCATCGCCATGAGCATCTTGATCACTCCGGCCACGCCCGCCGCGCCCTGCGAGTGCCCGATGTTCGACTTCACCGACCCCAGCCACAGCGGACGGTCGTCCGGACGGCCCTGCCCGTACGTCGCCAGCAGCGCCTTCGCCTCGATCGGGTCGCCCAGCGTCGTGCCCGTGCCGTGCGCCTCCACCGCGTCCACGTCCGAAGCCGCCAGTCCCGCGCGGACGAGCGCCTGCCGGATCACCCGCTCCTGCGACGGGCCGTTCGGCGCGGTCAGGCCGTTGCTCGCGCCGTCCTGGTTGACCGCCGAGCCCCGCACCACACCCAGGATCCGGCGGCCGTTGCGCTGGGCGTCCGACAGCGGCTCCAGCAGCACCAGCCCGACGCCCTCGCTCCAGCCCGTGCCGTCGGCCGCCGCCGCGAACGACTTGCACCGGCCGTCGGGCGCGTTCGCCCGCTGCCGCGAGAACTCGATGAACGCGTCGGGCGTCGCCAGCACCGACACCCCGCCCGCGAGCGCCATCGTGCACTCGCGCTGCCGGAGCGACTGGCAGGCGAGGTGGATCGCGGTCAGCGACGACGAGCACGCTGTGTCCACCGACACCGCCGGGCCCTCCAGGCCCCAGGTGTAGGCGATCCGGCCCGACGCGACGCTCCCCACGTTGCCGGTGCCCGCGTAGCCGCCGACGTCGCTGGTCGTCCGGCCGATGAGGGAGAGGTAGTCGTGCGAGCTGACCCCGGCGAACACCCCGGTCTCGCTGCCGTGCAGGGCGTCCCGGTCGAGTCCGGCCCGTTCGAACGTCTCCCACGCCGTCTCCAGCAGCAGCCGCTGCTGCGGGTCCATGGCGAGCGCCTCGCGCGGGCTGATGTCGAAGAACGCCGCGTCGAACCCGGGGACGTCGTCGCAGAAGCCGCCCTCCGTCACGTACGTCCTGCCGGGCTTGTCGGGGTCGGGATCGTAGATCCGCTCGACGTCCCACTCCCGGTCGGCGGGGAACGGGGTGATCGCGTCGCGGCCCTCCGCGACGAGCCGCCACAGGTCCTCGGGCGACCGGACGCCGCCGGGGTAGCGGCAGGCGAACCCCACGACCGCGATCGGCTCCGGCTCGGCGGACTCGGCCTCCCGCAGCCGCCGGCGGGCCTGCCGCAGCTCGGCGGTGACCCACTTGAGGTGGTCGAGCAGCTTTTCCTCGTTCGACATCTAGGGCGTACTCCTTGCCGCGACGCGAGGGATCGGGGTCATCGGGCTCACGACTTCACGACTTCCCGAACTCGTCGGCGATCAGGTCGAAGATGTCCTCCGCCGTCGCCGTCTCCAGTTCGTGGTCGGGCGCGGAACGCCCGTCCTCGTCTCCGGTGGCGTTCCACTTGGCCAGCAGCGACTCCAGCCGCTGGGTGACGCGCCGCCGCGCCGCCCCGTCCACCGCGGCCGGGTCGCAGGACGAGTCCCACTGGTCCAGGCCGGACAGGACGCGGCCCTCGTCCACCACGTCCACGTCGAGGAGCTGGACGCGGAGCAGGTCGGCGAGCGCCCCGCACGACGGATGGTCGAAGATCAGGGTGGGCGGCAGCGGGAGCCCGGTCGTCGCCGCGAGCCGGTTGCGGAGCTCGACGGCGCTCAGCGAGTCGAAGCCGAGCTCCTGGAACGGCTGCGCCGCCGGGATCGCGTCCGGGTCCGGATGGCCGAGGACGGCGGCGACGCGGGCCTGGACGTGCCGCAGCAGGATGTGCCGCTGCTCCTCGGGCGTACCGCCCGCGAGCTGCTCGCGGAGCGGATGCCCGCCGCCCTCGGCGGCCTCCTCGACGTCCGCCGCGGCGTCGGGCCGGGCGGGGGCCAGGTCCGCGATGAGCGGGCTGGGGCGCTGCGTGGTGAACGTGGTGGTGAACGCCTCCCATTCGATGTCCGCGACGGTGAGCGTCGTCTCGCCGCGCGTCAGCGCCCGGTGGAGCGCCTTGAGGGCCAGGTCGGGATCGAGCGGCGTCAGCCCGCGGCGGCCGATGTAGGTGATGACGTCGTCCACGGCCGCCATCCCCGCCTGCGCCCAGGGCCCCCAGGCGATGCTGGTCGCGGGGAGCCCGAGGGTGTGCCGGTGTTCGGCGAGGGCGTCGAGGTAGGCGTTGGCGGCGGCATAGGAGGCTTGCTGCCCGCTGCCCCAGGCCGCGGCCCCGGACGAGAACAGCACGAACGCCGTCAGGTCCAGATCCCGGGTCAGCTCGTGCAGATGATCGGCCGCCAGCGCCTTCGACCGCAGCACCTCACCGATATGGGGAACGTCGAGATCGGCGAACGCGGTGAGCTCCGGAATCCCCGCCACGTGGAACACCGCGTTCAGCGGCTGGTCGCCGGGAACGCCGTCGATGACGTTCTTCAGTGCGGAGCGGTCGGAGACGTCGCAGGCGGTGAGGGTGACGGCCGTTCCGAGCGTTTGGAGCTCTTCGGCGAGTTCGCGTGCGCCGGGTGCGTCGGGGCCGCGTCGGCTGGTGAGCAGGAGGTGGGGTGCGCCGTTGTGGGCGAGCCAGCGGGCCAGGTGTGCGCCCAGGCCGCCTGTTCCTCCGGTGATCAGCGTCGTCCCGGACGGCCGCCACGGCGCGTCGCCGTTGGTGCGGTCGGGCGCGCGGCGCATACGGCGCGCGAGGACGGCGGTCGCGCGGATCGCGGCCTGGTCCTCGGGCTGCCCGGGAGCGAGCAGTGCGGCAAGGCGCGCGGGAGTGTGCTGGTCGGGTGTGGTCGGCAGGTCGATGAGACCGCCCCAGCGGGCCGGGTATTCGAGCGCCGCGACGCGTCCGAGCCCCCACGTCTCGGCTTGGAGCGGGTTGAGGAGCGGATCGCTGGGCGACACCGACACCGCTCCCTGCGTCAGGCACCACAGGCGTGCCGGGACGCCGCTGTCATCGAGCGCCTGGACGAGTGCCGTCGTCGCCGCGAGCCCGGCCGGGACCGCCGGATGCGACGGGTGCGGCGTCTCGTCCAAGGCAAGCAGGCTGAGGACGCCTTCCAGCGGAGCGTCCGCGTCCGCATCGGCGAGCTGCCGCGCGAACTCCTCCCGCCCGGCCGTGGTCGCCTCCACGCGCAGAACGTTCGGAACGGCGCCGTGCGCGCGCAGCGCCTGAACGGCGAGGTCCGCAGCGGGCCCCTCAGCGTCGTTCGCGGGAACGAGCACGAGCCACGTGCCCGACAGCACCGGCGCGCTCTCCTCTGGGAGATGCGTCCAGGCGACCTGGTAGCGGTACGAGTCCAGTACGCCCTGCTCGCGGTGCCTGCGCCGCCACCCCGACAGGATCGGCAGAACCTCGCCCAGCGAGGTCTCCTGCTCGGCCGGTGAACCCAGCGTCCGCGCCAGCGACTCCACGTCCTCGCGTTCCACCGCGTCCCAGAACTCGGAGTCGAGCGCGTTCTCCGTCGCCGCCGCCTTCTCCGGCTCGTTCACCAGCCAGTACGACCGGCGCTGGAACGCGTAGTTGGGCAGGTCGGGAGCGGAGCCCGCGCGCCGGCTCGGGAAGTAGGGGGACCAGTCGATCGGCGCGACGTGCGTGTGCAACTGGGCGAGCGCGTCCGTCAGCGCGTGGACGTCCGAGTGGCGGTGGTTGAGCGTCGCCGCGACGACGGGCTCGGGCCGGCCGTCGCCCCCGACGCTCGCGAGAGTGTGCCGCGCCGCGGCGGCCAGCACGGGGTCCGGGCCGAGTTCGAGGTAGGCGGCCGGCGGCGTCGCGCTCCGGGCCAGATACTCGATCGCCGGCTGGAACCGTACCGGCTGCCGGACCTGCTGGATCCAGTAGTCGGGCCCCAGATCCTCCACCGGCCCGCCCGTGAGGTTGGTGATGAGCGGAATGGTCGCCGGACGGTGCGTCAGCCCCCGGACGACCTCGGCGAACGGTTCGAGGACCGGGTCCATGTGCGGCGAGTGGAACGCGTGGCTGACCTTGAGCCGCCGCGTCTTGCGCCCCTTCCCGGCCCAGACGTCCGCGACCTGCGCGACGAGATCCGGATCACCGGAGACCACCGTGCTGTCCGGGGTGTTGAGCGCGGCGATGGCGATGCGCCGCCCGTCGGGATCGACGGTTTGGAGATGGTCGGTGAGTTCGTCGTAGGTGGCCTGGATGGCGGTCATGGTGCCGTCGATGGGGATTTGGTCCATGAGGGTGGCGCGTGCGGTGAGCAGGCGGCAGGCGTCGGGCAGGTCGAGGATCCCGGCGATGTGGGCGGCGGCGATCTCTCCGATGGAGTGCCCGATGAGGGTGTGCGGGGTGTGGCCGTGCTGGTTGAGGAGCCGGGCCAGGGAGACCTGCAACGCGAACAGCGCGGTCTGGGTGTAGAGGGTGTGGTCCAGCAGGTCTGGTTGGTCGGGGTCGGGGTTGAAGACCACGTCGCGTAGCGGGTGGGGGAGGTGGGGGTCGAGCAGATCGCAGATCTGGTCGAAGGTGGTGGCGAACACCGGGAACCGCTCATACAGCCCCGCGCCCATCCCGGGCCGTTGGCTGCCCTGCCCACTGAACATCCACACCACCTCGCCACCCGTGGCACCGGCACTGGGTGAAGCGGTCAGGCACGGGTGGTCGTCTCCAGCGGCCAGCGCGTCCAGCCCTTGGAGGAACTCCTCAGAGTGGTGGCCGGTGATGACGGCGCGGTGCTCGAACACCGAACGCCTCGTAGCCAGCGACCACCCCACCTGAACCGGTGAGACGTCCCGGCCGGACAGGTGCGCCGACAACGCCGCCGCCTGATCCCGCAACCCCGCCACCGACCGCGCCGACACCACCCACGGCACCACCCCACCCGACCCGACCCGAGCCTCGGACTCGGACTCAGCCTCGGGGACGACCTCGGGGTCGGGTTCGGGGGCCTGCTCCACGATCAGATGCGCATTCGTCCCCGAAATACCGAACGACGACACACCCGCACGACGCGGACGCCCACCCCACACCCACTCCACCGGCTCACTCAGCAGCCGCACCGCACCCGCCTCCCAATCCACATGCGGAGACGGCTCGTCAATGTGCAGGGATTCGGGGAGAAGGCCATTTCGCATGGCCATCACCATTTTGATGACGCCGGCGACGCCTGCGGCCATCTGGGTGTGGCCGATGTTGGATTTGATCGAGCCGAGCCACAGGGGACGGTCTTCGGGGCGGTTGCGTCCGTAGGTGGCGAGGAGGGCTTGGGCTTCGATGGGGTCGCCGAGGGTCGTTCCGGTGCCGTGCGCTTCCACCGCGTCCACGTCGGCGGGCTCCAGGCGGGCGTTGGCGAGGGCTTGGCGGATGACGCGTTGCTGGGATGGGCCGTTGGGGGCGGTGAGGCCGTTGCTGGTGCCGTCCTGGTTGACGGCCGAGCCGCGGATCGTGGCGAGGATGTGGTGGCCGTTGCGTTCGGCGTCCGACAGCCGTTCCAGCAGGAGGAGACCGGCGCCTTCGCCCCAGCCGGTTCCGTCGGCAGCCGCCGCGAACGGCTTGCAGCGGCCGTTGGGCGCGAGGCCGCGCTGTCGGGAGAAGCCGATGAACGTGGCGGGCGTCGCCATGACGGTGACGCCGCCGGCGAGCGCGAGGTCGCATTCCCCCTGCGCGAGGGCCTGGCTTGCCAGGTGGATGGCCACCAGCGACGACGAGCACGCGGTGTCGACGGTCACCGCGGGGCCTTCCAGTCCGAATGTGTAGGACACCCGGCCCGACATGACGCTCGCCGTGTTCCCGACGAGGATGTAGCCCTCGATTTCCTTCGCGGGTGTTCCTGTGGCGCTGTAGCCCTGGAAGGTTCCGCCGGTGAAGACGCCGACGTTGCTTCCGCTGAGGGCTTCTTGGGTGAGTCCTGCGTTCTCGAACGTCTCCCATGCGGTTTCCAGGAGGAGACGTTGTTGCGGGTCCATGGCCAACGCTTCACGCGGGCTGATGCCGAAGAATCCGGCATCGAATTCCGGGGCGTCGTAGAGGAACGCGCCCTCGCGCACGTAACTGGTCCCGGGGTGCTCGGGGTCGGGGTGGTAGAGGTTGTCCAGGTCCCAGCTCCGGTTGGTGGGCATTTCCGCGATGGCGTCGCCGCCGGACGCCACCAGGTCCCAGAGCTGCTGCGGGTTCCGCACGCCGCCGGGGAAGCGGCACGCCATCGCCACGATCGCGATCGGCTCGTCCGTCGCGGCGGACGTGATCGACGCGGGGGACCGTACCGCGTCGGCCTGCTGCCCGGTCAGCTCCGCGCGCAGGTACTTGGCGATCGCGTTGGGCGTGGGGTGGTCGAAGACGAGTGTCCGGGGGAGCGAGAGGCCGGTGGCCGAGCCGAGGCGCTTGCTGAGCTCGACGGCCGTGAGGGAGTCGAAGCCCAGCTCCTGGAGCGGCTGCCCGGCCGAGACCGCGTCGGCCCCGGAGTGCCCGAGGATCGTCGCGGCGTGCTCCTGGACGTGCCGCACCAGCAGTTCGAGCTGCTGCTTCGGTGTGCTCCCGGCCAGTTCCCCGCGCAGCGGGTGGTCGTCCGTCCGCCCCTCTTCGGCTCCGGCACCGCCTCCGGTCCCGTCGCCCGTGTCGGACCGCGAGGGCGTGAGGTCCGCGACGAGTCGGCTGGGCCGCTGCGTCGTGAAGGTGGCGGTGAACGTCTCCCAGTGGATGTCGGCGACGGTCAGGGTGGTGTCGCCGTGGGAGACGGCCTGGTGGAGCGACTTGACGGCCAGGTCCGGGTCCATGATCGCCAGGCCGCGGCGGGCGAAGAACGCGATGACGTTCTCGTCCGCCGCCATCCCGGCCTCGCCCCACGGTCCCCAGGCGATGCTGGTCGCGGGGAGTCCGAGGGTGCGGCGGTGTTCGGCGAGGGCGTCGAGGTAGGTGTTGGCGGCGGCGTAGGAGGCTTGCTGTCCGCTGCCCCAGGCGGCGGCCCCGGAGGAGAACAGCACGAACGCCGTCAGGTCGAGGTGCCGGGTCAGCTCGTGCAGGTGTGCGGCGGCCTGTGCCTTGGGAAGCAGGACGTCGCTGATGTGGGGGATGTCGAGTTCGGCGAAGGGATGGAGCTCCGGAATCCCGGCCGCGTGGAAGACCGCGTTCAGCGGCTGGTCGTCGGGCACGCCGTCGATGACGCCCTTCAGCGCGGAACGGTCGGAGACGTCGCAGGCGGTGAGGGTGACGGCCGTTCCGAGCGCGCGTAGTTCGTCGGCGAGTTCGCGCGCGCCGGGTGCGTCGGGGCCGCGTCGGCTGGTGAGCAGGAGGTGCGGTGCGCCGTTGTGGGCGAGCCAGCGGGCCAGGTGCGCGCCCAGACCGCCCGTTCCTCCGGTGATCAACGTCGTCCCGGACGGCGTCCAGGGAGCCCGGCCGTTGCCGGAGGGCGCGGGGGCGCGCCGCATCCGGCGGGCGAGCGTCGCCGTGGCGCGGACCGCGACCTGGTCTTCGGGCTGGCCCGGTGTGAGCAGCGCGGCGAGGCGCGCGGGAGTGTGCTGGTCGGGTGTGGTCGGCAGGTCGATGAGACCGCCCCAGCGAGCGGGATACTCCAGCGCCGCGACGCGTCCGAGCCCCCACGTCTCGGCCTGAACGGGGTTGAGGAGCGGATCGCTGGGCGACACCGCCACGGCCCCTTGCGTGACGCACCACAGGCGGACCGCGATGCCGCTGGCGTCGAGCGCTTGGACGAGTGCCGTCGTCGCCGCGAGCCCGGCCGGGACCGCCGGATGCGACGGATGCGGCGTCTCGTCCAAGGCAAGCAGGCTGAGGATGCCGGACGGGGGCGTCTCCGCCTCCGTGAGGTGCTGGACGAGCTGGTCGCGTTCGACGGCGGCGCAGTCGACGGGCAGCACGGACGCCGTGGCGCCGTGGCTCCGGATCGCCTGCACCGCGGTGTCGATGGCGGAACGTTCGGCCGCGCCCGCGCCGCCGTCCGCGGGAACGAGCACCAGCCACGTGCCCGACAGCACCGGGGCGGCGATCTCCGGGAGATGCGTCCAGCCGATCTGGTAGCGCCAGGAGTCCAGCGTGCTCTGCTCGCGGTGCCGCCGCCGCCACTCCGACAGGATCGGCAGGGCGGGACGCAGATCGTCGAGGGCCGGGCCGTCGCCGTCCAGGTGCAGGGCGGCGCCGAGCGCGTCGACGTCGAGTTCCTCGATCGCGTTCCACAGCTCGGTCTCGGCCTGGTCGTGGCCGGGTCCGCCACCGCCGGACGGGACGGCCGGAGCCAGCCAGTAGGGCCGCCGCTGGAACGCGTACGTGGGCAGGTCCACCGCGCGCCGCCGCGCCGTCCGCCCGAACAGCGCCGTCCAGTCGACCTGGACGCCCGCCGTGAACGCCTCCGCCACCGACCGCGCCACCTGCGCCCGGTCGCCGCGGTCGCGGTGCAGCGTCGGCACCGCGTACGCGTCGGCGCCCGCCTCCTCGAAGATCTCCTCCAGGCCGACCGTCAGCACCGGGTGGGCGCTCGCCTCGATGAACACCCGGTGCCCGTCGTTCAGCAGGGCCTCCACGGTGTCGGCGAACCGCACTTGCTCGCGCAGGTTCGTCACCCAGTACGCGGCGTCCAGGCCGGTCGTGTCGATCCGCCCGGCGTGCACGGTCGAGTAGAACGGGATCTCCGCGTCGAACGGCCGGATACCGGCGAGGACCTCGTGCAGCTCGCCGGTGATCCGGTCGATCTGCGGGCCGTGCGAGGCGTAGTCCACGTCGATCATCCGCGCGCGCAGGCCGGCGGCCTTGGCCGCCGCCACGACGTCCGCGACGGCCTCGGGCGGCCCGGAGACCACCGTGGAGGACGGTCCGTTGAACGCCGCCACGACCACGTCGCCGCCCCGTTCCTCCAGCAGCCGTTCCGCCCCGTCGCGTCCGGTGCCGAGGGAGGCCATGGCGCCGTGTCCGGCGAGGTTGCGGAGGGCTTTGCTGCGGAGGGCGGTGATGGTGGCGGCGTCGGTGAGGGTGAGTGCTCCGGCGATGCAGGCGGCGGCGATTTCGCCTTGGCTGTGTCCGATGACGGCGGCGGGTGTGACGCCGTGGTCGGCCCAGACGGCGGCCAACGAAACCATGGTGGCCCAGAGGACGGGTTGGACGACGTCCACCCGGTTCAGGTCGGCCGCGCCCTCGGTTCCGCGCAGTACGTCGGTGAGGGACCAGTCGATGTGCGGGGCGAGGGCTTGTTCGCATTCGGCGATGCGTGCGGCGAACACGGGTGAGGTGTCGAGCAGCTTCGCGCCCATCCCGGGCCACTGCGAACCCTGCCCCGGGAACACTAGCACCGGACCCGGTTCGGAGGCCGCCGCGGGCGTGCCGGGGGCGACCAGCCCGGGGTGCTGCCCTGCGGTGGCGAGGGCGTCCAGCCCGGCCAGCAGCTCGTCGCGGTCCCGGCCGAGGACCACGGCGCGGTGCTCGAACGTCGAACGCCTCGTCGCCAGCGCCCAGCCGACGTCGCCGGGGGAGTCGTCGCCGCCCGCGAGGTGGGCGGACAGCGCGGCGGCCTGGTCGCGCAACGCGGTGGCGCTGCGGGCGGAGACCACCCACGGCACCGGGCCGCCGGACGCGTCGGGCTCGGCGTCCGCCGCGTCCGGGGCCTCGGAGGCGGTGTCGGGCACGGACGGCTCGGGCGCCTCCTCCAGGATCACGTGCGCGTTCGTCCCGGAGATCCCGAACGCCGACACCCCGGCCCGGCGCGGGCGTTCGCCCTCGGGCCACTCGACGGGTTCGGCGAGGAGGCGGACGGCGCCCGCGTCCCAGTCCACGTGCGGGGACGGCTCGTCGATGTGCAGGGATTCGGGGAGCAGGTCGTTCCGCATCGCCATCACCATCTTGATGACGCCGGCGACGCCCGCGGCCATCTGGGTGTGCCCGATGTTGGACTTGACCGACCCGAGCCACAGCGGCCGGTCCGCTGGCCGGTCCTGCCCGTACGTGGCGAGGAGGGCTTGGGCTTCGATGGGGTCGCCGAGGGTCGTCCCGGTGCCGTGCGCTTCCACCGCGTCGATGTCGGCGGGCTCTAGGCGGGCGTTGGCGAGTGCCTGGCGGATGACGCGTTGCTGGGACGGGCCGTTGGGGGCGGTGAGGCCGTTGCTGGTGCCGTCCTGGTTGACCGCCGAGCCGCGGATCGTCGCGAGCACGTGGTGGCCGTTGCGTTCGGCGTCCGACAGCCGTTCCAGCAGGAGCAGGCCGGCGCCCTCGCCCCAGTTGGTGCCGTCGGCCGCCGCGGCGAACGGCTTGGCGCGGCCGTTGGGGGCGAGGCCGCGCTGGCGGGAGAAGCCGACGAACGTCGCGGGCGTCGCCATGATGGCCACCCCGCCCGCGAGCGCGAGGCTGCACTCGCCCTGCGCGAGGGCCCGGCTCGCCAGGTGGATCGCGACGAGCGACGACGAGCACGCGGTGTCGACGGTGACGGCGGGGCCTTCCAACCCGAACGCGTACGAGACGCGCCCGGAGATGACGCTCGCGGTCGTTCCCGCCAGGGCGTAGCCCTCGCTCTCTTGGGCCGGGTTGCCTGTGGCGCTGTAGCCCTGGAAGGTTCCGCCGGTGAAGACGCCGACGTTGCTTCCGCTGAGGGCTTCTTGGGTGAGTCCGGCGTTCTCGAACGTCTCCCATGCGGTTTCCAGGAGCAGACGTTGCTGCGGGTCCATGGCCAACGCTTCACGCGGGCTGATGCCGAAGAATCCCGCGTCGAATTCCGGGGCGTCGTAGAGGAACGCGCCCTCGCGCACATAGCTGGTGCCGGGGTGCTCGGGATCGGGGTGGTAGAGGTTGTTCAGATCCCAGTAACGGTTGGTGGGCATACCGGCGATCGCGTCGCCGCCGGACGCCACCAGGTCCCAAAGCTGCTGCGGGTTCCGCACGCCGCCCGGATAGCGGCACGCCATCGCCACGATCGCGATCGGCTCGTCGCTCGCCGCCGCCGTGGCGGTACGGGCGGAGCCGCGGGACGCGGCGGCGGGCTGGTCGTCCGCCAGCTCGTTGTGCAGGTACTTGGCGAGCGCGTTGGGCGTCGGCTGGTCGAAGACCAGTGTCCGGGGCAGCGAGAGCCCCGTGCTCGTGCTGAGCTGCTTGCTCAGCTCGACGGCCGTCAGCGAGTCGAACCCCAGCTCCTGGAGCGGCTGCCCCGCCGGGACGGCGTCGATGCTCGTGTGGCCGAGGATCGACGCCGCGTGGGTCCGGACGTGCCGCAGCAGCAGGTCGAGCTGCTGCTTCGGCGCGCTCCCCGCCAGCTCCTGCCGCAGCGGGCTGGCGCCGGGCGAGGCGTCCCGCCGGTCGGCGTCGTCCGGGTCGGTCTCGGCGAGGTCGCTCAGGAACGGGCTCGGGCGCTGCGTCGTCAACGCCGCCGGGAACTTCCGCCAGTGGATGTCGGCGACCGTGAGGGACGTCTCGCCCTGCGTGATCGCCTCGTGGAGCGTCTTCACGGCCAGGCCCGGGTCCATGGGCGTCAGGCCGCGCCGGGCGAAGAACGCGATGACGTTCTCGTCGGCGGACATGCCGGCCTCGCCCCACGGCCCCCACGCCAGGCTGGTGGAGGGCAGCCCGAGACGGCGGCGGTGCTCGGCGAGCGCGTCGAGGTAGACGTTCGCGGCGGCGTAGGCGCCCTGCTGCCCGCTGCCCCAGGCCGCGGCGCCGGAGGAGAACAGCACGAACGCCGTCAGGTCGAGGTGCCGCGTCAGGTCGTGCAGATGCTCCGCCGCCTGCGACTTCGACCGCAGGACCTCGCCCGCGCGGGCCAGGTCCAGGTCGGTGATCGGGGTGAGGTCCGACATCCCGGCGGCGTGGAACACGGCGGTGAGCGGATGCTCGGCGGGCACGGCGTCGATGACGCCCTCCAGCGCGGAACGGTCGGACGCGTCGCAGGCGGCGAGGGTGACGCGGGTCCCGAGCGCTTCGAGCTCCTCGGCTAGCTCCGCGGCGCCGGGGGCGTCCGGGCCGCGGCGGCTGGTGAGCACGAGGTGCGGCGCGCCGTTCCGCGCGAGCCAGCGGGCCAGGTGCGTGCCGAGCCCGCCCGTGCCGCCGGTGACCAGCGTCGTCCCCGTCGGCCGCCAGGGCGCGTCGCCGCCGTCGCGGGCGGGGGCGCGGTCCATGCGGCGCGCGAACGACGCGGTCGTCCGGACGGCCACCTGGTCCTCGGGCCCGCCGGACGCCAGCAGCCCGGCGAGCCGGGCGGCGGTGTTCCGGTCGGGCGCGGCGGGCAGGTCGATCAGGCCGCCCCACCGGGACGGGTACTCCAGGGCGGCGACGCGTCCGAGCCCCCACACCTGCGCCTGCCGGGGGCTGGGCAGCGGATCGGTCGGCGACACCGCGACCGCGCCCTGCGTGAGGCACCACAGCGGCGCGCCCGCGCCGCCGTCCCCGTGCGCCTGGATCAGCGCGATCGTGGCGGCCAGCCCGGCCGGAACCGCGGGTTCGTCCGGATGCGGCGACGCGTCCAGCGCCAGCAGGCTGAGGACGCCCGCCGGCGGCGTCTCGGCGCTCGCCTCGGCGAGGTCCCGCACGAACCGTTCGCGCGTGGCGCCCGCGGTGTCGACGGGCAGCAGCCGCACGTCGCCGCCGTGGCCGCGGATCGCCTGCACGGCGGTGTGGACGGCGGGGCTCCCGCCGTCCTCGTCCGCGGGCGTGATGACGAGCCACGTGCCGGACAGCTCCGGGGCGCGCTCCTCCCGCAGGTGCTTCCACGTGATCTTGTAGCGCCAGGAGTCCAGGACGGTCCGCTCGCGGTGGCGGCGGCGCCACTGCGCCAGGATCGGCAGCGCCGGACGCAGGTCGTCCACGGCCGGGCTGCCGTCCTCCAGCCGCAGGGTGGCGCTCAGCGCGTCGACGTCCAGCTCCTCGATCGCGTGCCACAGCTCGCTCTCGGCCGGGTCGTGCCCGTCCCCGCCCACGCCGGAACGCGCGGCGGGCGCCAGCCAGAACCGCTCGCGCTGGAACGGGTAGGTGGGCAGGTCGGTGGTCGCGGGCGGCGGGGACGCGGGGAAGCAACGGGTCCAGTCGACGGGCGCGGCGAGGGTGTGGGCGTGCGCGAGCGCGCGGGTGAGCTGGGCGTGGTCCCCGTGGTCGCGCCGCAGCGTGGGCACGATCCCGGCCGCGGCGCCGGCGTCCTCGGCGATCTGCTCCAGCGCGAGGGTGAGGATCGGGTGCGGGCTGGCCTCGACGAACGTGCCGTGCCCGTCGTCCAGCAGCGCGTGGACGGCGTCGGCGAACCGGACGGGCCGGCACAGGTTGGCGACCCAGTAGTCCGCGTCCAGCCCGGACGTGTCGATGCGTGCCGCGGTCACCGTCGAGTAGAACGCCACGGACGAGGCGGCGGGCTCGATCCCGGCCAGCGCCTCGCGCAGCTCGTCCGCGACCTCTCCGACGTGCGGGCCGTGCGAGGCGTAGTCCACGTCGATCAGCCGGGCGCGCAGGCCGCGCGCCTTCACCCGGTCGACCACGCCGGCGACCTCGCCGGGCGGACCCGAGACCACGACCGAGGTCGGGCCGTTCACCGCCGCGATCGTCACGCCGCCGTCGAGGAGGCCCTCCGCCTCCTCCGGGCCGGTTTCGAGCGAGGCCATGGCGCCCGTGCCCGCGAGGCGGCGCAGGGCCCGGCTGCGCAGCGCCACGATCTTCGCGCCGTCCTCCAGGCCGAGCGCGCCCGCGACGCACGCGGCGCCGATCTCGCCCTGGCTGTGCCCGACGACCGCCGCGGGCGTGACGCCGTGCTCGGCCCACACGGCGGCCAGCGACACCATGACCGCCCACAGCACGGGCTGGACGACGTCCACCCGTTCGAGGTCGGCGGCGCCGTCGGCCCCGCGGAGCACGTCGGTGAGCGACCAGTCGACGTACGGCGCGAGCGCCCGCTCGCACTCGGCCATCCGCGCGGCGAACACGGGGGAGGAGTCGAGCAGCTCCGCGCCCATGCCGGGCCACTGCGAACCCTGGCCGGGGAACACCATCACCGGCCCGTTCTCGGAATGCGCCGCCGCCACTCCGGGATTGACCAGGTTGGGGTGCTGGGCGCCTGAGGCCAGCGCCTTCAGGCCGTCCATGAGCTGGTCGCGGTCTTGGCCGATGACGACGGCGCGGTGTTCGAACGTCGAGCGTGTGGTGGCCAGTGACCAGCCGACTTCGACCGGTGAGAAGTCGTCGCGTTCTTCGAGGTGCGTTACGAGTGCGGCGGCCTGGTCGCGCAGTGCGGGCGCGTTGCGCGCCGACAGCGCCCACGGCACCACCCCGCCCGCGACCTCTGCGTCCGGGGCCGGGGCCGGTGCCGCCGGAGCCGTGTCGTCCGGCTCCGGTGCCTCCTCCAGGATCACGTGGGCGTTCGTCCCGGAGATCCCGAACGACGACACGCCCGCGCGGCGGGGACGGTCGCCGTTCCGCCACTCGACCGGGTCGGTGAGGAGGCGGACGGCGCCCGCGTCCCAGTCCACGTGCGGGGACGGCTCGTCGATGTGCAGCGACGCGGGGAGGCGGCCGTTCCGCAGCGCCATCACCATCTTGATGACGCCCGCGACGCCCGCCGCCATCTGGGTGTGCCCGATGTTGGACTTGATCGACCCGAGCCACAGCGGCCGGTCGTCCGGGCGGCCTTGCCCGTACGTGGCGAGGAGCGCCTGCGCCTCGATCGGGTCGCCGAGCGCCGTGCCGGTGCCGTGCGCCTCCAGCGCGTCCACGTCGGCGGGGGAGAGGCGGGCGTCGGCGAGCGCGTCCGCGATCACGCGCTGCTGGGACGGGCCGTTCGGCGCGGTCAGGCCGTTGCTCGCGCCGTCCTGGTTCATCGCCGAGCCGCGGATCACGGCCAGCACCCGGTGGCCCGCGCGGCGCGCGTCCGACAGGCGCTCCAGCAGCACCATCCCGGCGCCCTCGGCCATGGTCATGCCGTCCGCGCCCGCCGCGAACGACTTGGACCGGCCGTCCGGGGCGAGCGCGCGGAGCTGGCTGAAGCCGACGAGCGGCGCGGACGACGACATCACGAACACCCCGCCCGCCAGCGCCATGTCGCACTCGCGCTGCCGCAGCGCGCGCGCCGCGAGGTGCAGGGTGACCAGCGACGAGGAGCACGCGGTGTCCACCGTCACCGCCGCGCCCTCCAGGCCGAGCGCGTACGCGACGCGGCCGGAGACGACGCTCGCGGTGTTGCCGATCGTGAAGTAGCCCGCCGACCCCTCGGGCACCTGCGACGGGTCGGAGTAGTAGTCGAGGCGGTCGCAGCCGATGAACGTGCTGGTCGGGCTCGACCGCAGCGACTCGGGGTCGATGCCCGCGTGCTCGACCGCCTCCCACGACGTCTCCAGCGCCAGCCGCTGCTGCGGCGCCATCGCGAGCGCCTCGCGGGGGCCGATGCCGAAGAACGAGGCGTCGAAGCCGGCGACGTCGTCGATGAAGCCGCCCTCGCGCACGTAGCTCGTGCCGGGGTGCTCGGGATCGGGGTCGAAGAGGCCGTCCAGGTCCCAGTTGCGGTCGGCGGGGAAGCCGCCGACGGCGTCCCGCCCCTCGGCGACCAGGTCCCACAGCTCCCCGGGCGACCGCACGCCGCCGGGGAAGCGGCAGGCCATCCCGACGATCGCGATCGGCTCGTGCCCGACCGACTCGACGTCCTTCAGCCGCCGCTGCGCCTGGCGCAGGTCCGTCGTCACGCGCTTGAGGTAGTCGAGGAGCTTCTCTTCGTTACTCGCCATTGCCGGCGCACCCATTTTCCTCCGCGGCACCCGGCGGCCCGCCCGGTGCCTCCGCTGACAGGACAGCTGCTCGACGCGAGCCCGGAAGGCGCGGACCCGCTATTTCCGGCAGTCGTAGAAGTGCTGAACGGAGATCCCGCCGGGCACGCTCGTCGTCTGCGTGTTGGTGTAGTGGTTCCTGCTGAGGATGACCATGTCGTCCTTGACGTTCACCTCGACGCCCTCGGCGTTGCCCGTCGGGCTGGGCTTCTTCACCAGTTTCACCGTTCCGGTGAAGATCAGATGGCCGTTCTTCCAGCCCTCCGAATAGGAGTTGCCCTGCACTCCCCAGTCGTCGTGGTATTCGCTCATGAACCGCCGGTCGACCGGGTTCCAGCCGAAGACCCTGCGGCCCACGACGAGGTCGGGCGTGACCATCGCCGAGTCGAGGTAGTGCCCGCCGATGTCGCGCTTGGTGGTGATGTAGTTCTCGATCTGCTCGCCGCCGCCGGGCGGGGTGTCCATGCACTTGTAGGAGCCCAGCATGAAGTCGAGGGCGCGCATCTGCGGCGGAGCGGGGAGTTTCGTGCCGTGGCCCGGCGGCCTGCCCGCGCCGTCCGCGCGCCCGCCGTCCGCCCGGCCGCCGCCGGTGTCGGCGACCGCCGCGACCGCGCCGGGCACCAGCACGACCAGCGCCGCTCCCGTGGCGGCGAGCGCCCACCGCGCCCGCGGCCCCCGTGCGGCCCGGCCCTGCTGCGCGGCGTTCCGGCGTGATTCCCGCATGTTCATCCTCGCTCCATTGATGGGACGTTCGTCGTCGGGCCGCCTCATCATCGGAACCGCCGCTAAAGCTGGACTAACGCCCGCCCCCGCGATAGGGGCCCCTAAACGCGGCGCGCCGCCCGTGCCGGACTTAGGGGTACGGGGCGGCGGCGGGGCCGCACTATTCTGGCGCCCGAACGGCAGACCGGATGCCGCCTGCCCGGTTTCAGGAAACCCCGAGACGTGAATTCCCCGACCGAAGGAGACGGTGGCCGTGCGGGTCCTGTTCGTGACGCTCCCGTGGAAGACGCACCTTTTCAATTTCGTCCCGCTGGCCTGGGCGTTGCAGACCGCGGGCCACGAGGTGCACGTGGCCGCCGAGCCCGACCTGACCGGCGCCGTCACCGGCGCGGGCCTCACCGCGGTCCCCGTCGGCTCGGGTGAGACCATGGGCGAACGGGTCCGGCGCGCCTGGCGGGAGGGCACCCTGCCGACGCCCGAGGAGGCGCCCCCGCCGGGCCACCCCGTCGAGCTGTACGACCTCGGCCCGGACCGGCAGCGGCTGAGCTGGCAGGAGCTCAACCGCATCCACGACACGCTCGTGGTGCCGAGGGCGTGGCTGTCCAACGACACGATGTTCGACGACCTGGTGGCGTACTGCCGGTCCTGGCGCCCCGACCTCGTCATCTGGAACGCGGTGACCTTCGCCGGCTCCGTCGCCGCCGCCGCGGTGGGCGCGGCGCACGCGCGGTTCCTGTTCTCCGTCGACCTCTACAGCCGGCTGCGCGACGACCAGCTCACCGTGATGGCGCGCCAGCCGCCGCAGGACCGCCGGGACGGGCTGAAGGACTGGTTCGCGCCGTGGGCCGCCAAGTACGGGGTCGAGTTCTCCGAGGAGCTGGTCAACGGGCACTTCTCGATCGACCAGATGCCCGCCTCGTTCCGCCTGGACTTCCCGCACCGGACCGTGTCGATGCGGCACGTCCCCTACAACGGGCCCGCCGTCATCCCCGCGTGGCTGGCCGAGCCGCCGCGCGCGCCGCGCGTCCTCATGACGTTCGGCGTCTCGGTGCGGTCGTGGCCCGAGCTCCAGGTCGTCCCCGTCGAACGGCTCCGGGAGTCCCTCGACTCGCTCGCCGACCTGGACGTCGAGGTCGTCGTGACCCTGCCCGACGACGTGCGCGCCAGCCTGGGCCGCGTCCCGGCGAACGTCCGCGTGGTCGACTTCGTGCCGCTGCACGCCGTCCTGCCCACCTGCTCGGCGGTGGTCCACCACGGGGGAGCGGGCTCGTTCAACGGCTCGCTGCTGAGCGGCGTCCCGCAGCTGCTGGTCAGCACGGCCCTGGACGCGCCGTTCAAGGACCACCACCTCCGCGCGGCCGGGGCCGGGCTGGCGATCACCCCCGACCGGTTCAGCGGCCCGCTGATCAGGGAGAACGTCGTCCGGATGCTGGAGGACCCCGCGTTCCGCGCGGGCGCCGAGACCCTGCGCCGGGAGATCATGTCCTACCCCGCGCCGAACGCCCTCGTGCCCGAGCTCGAAAGGCTCACCGCCGAGCACCGCGCGTGACGCGCCTGAGCGGCGGCCGGGAGGAACGCCCGGCCGCCGCCCGCGAGATCCGTCAGGTGGCGGCGGCGTCCGTACGGCCCGCGCCGCCGGCCACCCCGCCCGCGCCGCTCGGCACCCGGACGGCCGCGAGGTCGTCGGCGAGAGCCTGGATCACGCGCGGGCCGTGCTCGACGAGGTAGAAGTGCCCTCCGGGGAACACCCGCAGGTCGAACGGCCCGGAGCTGTGCTCCGCCCAGGCCAGCGCCTCGTCCCGCGTCACCTGCGGGTCGCGGTCGCCGACGAGCGCGACGACCGGGCACCTCAGCGCGGCGCCGGGAACGTGCCGGTAGGTCTCGACGGCGTGGTAGTCGCTGCGGACGGCGGGCAGGATCATGTCCCGCGCCTCGGGGTCGCCGAGCAGGCTGGAGTCGGTGCCGCTGAGCCGTTGCAGCTCGGCGACGAGCCCCGCGTCGTCCAGCTCGTGCACGTGCTCGGACCGGTGCCTGGACGGCGCGCGCCGCCCCGAGACGTACAGGCGCGCGAGCGGCTCCGCGCCGTCGCGCTCCAGCCGCCGCGCGACCTCGTAGGCGAGCACCGCGCCCATGCTGTGGCCGAAGAACGCCAGCGGCCGGTCGGCCAGCGCCCGCACGGCCGGGAGGACCGCGTCCGCCATCTCCGGGATGGTGCCGATGTTGGCCTCCGCGCGCCGGTCCTGGCGGCCCGGGTACTGCACGGCGGCGACGTCCACGGCCGGGGAGAGCCGTGCCGAGACGGGGAAGAAGAAGCTCGCCGAGCCGCCCGCGTGCGGGAAGCAGACGAGGCGCACGCCGCTGCTCGGCGCCTGGTGGTAGCGGCGGATCCACGCTTCGAAGTCGGTGGACGGCTGTGTCATGACGCCCTTTCGCTCTCGCACGGGTCGTTCCCTCGGCGGGCGGCCGGCGTGCTCGCCGGGCGTCCCGCCGGTGCGTACGGAGCCAGCTTTCATGACCGGGAACGCGTTGGGTACCCGTACCGATCCCCTACGCCCCCTCATCGGGGGACGCAGGGGACGGGGTGCCGGGAGGCCGCCGGACCTCCGCCGGACGGAGCAGGTTCGACGGGGTGAACGCGGTCGCGGGCGGCCTCCCGGCCGTTCTGGGGGGTCAGGCCGTTCCGATGTCCCTGCGCCGGGCGCCGACGAAGCCGACGAGCGTCAGCGCGAGGGAGATGGCCAGCAGCCAGACCACGGGCATCGCCGAGAAGTGCCCGCCGGGGATGCGCGGCGTGTGCGAGAACGGCGACAGGTCCATCAGCGCCTGGTCGAGCTGCATGGCCTCGCCGAGCTGCCCGAGCAGGAAGAACAGCGCGAGCGCGCCCCACACCGCCAGCGACAGCCGCGGCAGGATCCCGAACAGCGCGAGCGCCAGGCCCGTCAGCGTCCACACCGCCGGGAGCTGCACCATGGCGCCGCCGAGGATGCGCGGCACCTGGCCGCCGACGTCCCCGATGTCCGCGCCGTGCACGAGCCCGGTCGTGACGCCCGCCGCCGCGAGCGCGATGGCCGGGCCGACGGCGGCGAACAGCAGGTGGCTCGCCACCCACTTCGCCCGGCTCACCGAGGTCGCCAGCACGTACTCGCCGTGCATGGCCTGCTCCTCCGTGCGCAGCCGCAGCGCCGCCTGGATCCCGTACGCGGACGCGACCAGCCCGACCGTCCCCATGATGGAGGCGAGGTAGGCGTCGATCAGGCCCTGCTGGCCGCCGATCCGCAGGAACATGTCCTTCAGCGTGTCGTTGTCCTCCAGCAGGTCCCCGACGCCCTGGGCGATGCCGCCGAACACCCCGCCGAGCACGATCATCGCGACGGTCCAGGCGAGCAGCAGGCTCCGCTGGAGCCGCCACGCGAGGACGATCGGGTTGCTCAGCGACGGCGCGGCGTCCGCCGGGCCGAGCCGCGCCGCGAGGATCCCCGCGCCGATGTCCCGCCGCGACGAGAACGCGACCGCGACCGCGACCGCCGCGGCGGCGAACGCGACGGCGAGCGCGAGGACCCACCAGCGCTCGTCGGCGAACGGGCGCAGCAGCGTCACCCAGCCGAGCGGCGACAGCCACGACAGCCACGTCAGCCCGCTGTTGTCGCCGCCGGTGTCGGCGGCCAGCCGGACCAGGAAGAACCCGCCGAGCGCGGCGATCGAGAGGCCGCGCGCGCTGCCCGCGTACTCGGTGAGCTGCGCCGCGACGCCCGCGACCGCCGCGAACGTGCAGCCCGCGACCGCGAAGCCCAGCCCGATCGCGAACGCCCCTCCGGCCGGGAGCTGCCCGCTCAGGCCGAGCCCGAGCAGGACGGCGACCACCAGGTTGGCGGTCATCGTCACGATCAGCGCGGCGGCGAGCTGCGCGTGCCGCCCGACCACGGTCGCGGCGAGCAGCTCGCGGCGCCCGGCCTGCTCGTCGGTCCGGGTGTGCCGGATGACGGTCAGGATGCTGACCAGCCCGACGAAGATCGGGATGAAGCCCGCGCGCTGCGCGACGATGCCGCCGAGGCTCACGTCGGGCAGCGGGCCGTACAGCGCGAGGAACGTCGGGTTGCTGCCGCTCGTCGCGGCGTACTCGGCGCGGTCGGCGGCCGTCGGGTACAGGTCCTTGTAGGTGGACACGTAGCTGAGCGGGATCACCACCACCCAGAACATCCACGCCGGCAGCAGCCACCTGTCCCTGCGCAGGATCAGCCTGACCAGCTTGCCGGTGCCCGTGAGCGGGTTCACCGCGCCACCCCCGCGTTCGCGTCGCGCGCGCCGCCGTCCGCGGTGTAGTGCCGCAGGAACAGCTCCTCCAGCGTCGGCGGGCGGCTGACCAGGTTGCGCACGCCCGCGCCGGTGAGCTGCCGCAGCGCCTCGTCCAGGCTCGCCGTCTCGACCTCGCAGCTCACGTGCATGCCGTCGATCTTGAGCTCGTGCACGCCCGGGACGTCCGACAGGCCCGACGGCGGGCGCATCAGCTCGGCGTCGATCGAGGTGCGGGTCAGGTGCCGCATCTCGTCGAGGGTGCCGGTCTCCACGACCCGCCCCTCCCGGATGATGCTCACCCGGTCGCACAGCGCCTCGACCTCGGACAGGATGTGGCTCGACAGCAGGACGGTGCGGCCCGCGCTGCGCTCCTCGGCGATGGCCTCGCGGAACGTCTCCTCCATCAGCGGGTCGAGGCCGGAGGTCGGCTCGTCCAGCAGCAGCAGCTCGACGTCGGAGGCGAGCGCCGCCACCAGCGCGACCTTCTGCCGGTTGCCCTTGGAGTAGGTTCTGGTCTTCTTCCTCGGGTCGAGCTGGAACCGCTCCAGCAGCTCGGTCTTGCGCCGCTTGTCGACCCCGCCGCGCAGCCGGCCGAGCAGGTCGATCACCTCGCCGCCGGACAGGCCCGGCCACAGGTTGACGTCGCCCGGCACGTACGCCAGGCGGCGGTGCAGCGCGGTCGCGTCCTGCCACGGGTCGCCGTCCAGCAGCCCGACCTCGCCCGCGTCCGCGTGCAGGATGCCGAGCAGGACCCGGATGGTCGTCGACTTGCCCGCGCCGTTCGGCCCGAGGAAGCCGTGCACCTCCCCGGTGCGCACGGTGAGGTCGAGGCCGTCCAGCGCGCGCGTCGGACCGAACTTCTTGACCAGTCCCTTCGCCGAGATGGCGTCACTCATGGCGGTGCTCCTCTTGTGAATGGGTGAAGGGTGCGCGGTTGCCGGACCCGCTGCCGGGCTCGTCCAGCCCGTCCCGGAGCGAGGCGGCCGTCTCCGGGCTGATGACGCTGTGGGAGTGGACGTCGAGCAGGGCGAGGGCGACCCGCCGGTCACCCTCGTCGGAGAAGATGTCGGCGCCCACGGCGCGGGAGATGTGCTGGTGGAACGTCGGGATGCCGAGCGCCATGGCCACCATCAGCGCGGCCCGGCTCGCCGCGTCCGCGATGGGCGGGTCGGGCCGCTGCCGGTCGGCCTGGATCAGCGACCGCTCGGTCATCGCCACCAGGTCGTCGAAGATCCGCGCCGCCGCCTCGGACTCGTCGATCAGCGCCCGCGCCAGGTAGCGCTGGAACGGGTGCAGCGTGTGCCGGGCGTCCATGGTGGAGCGCGGATGGCTCAGCCGGTCGCCGTTCAGCGTCTGGTCGTAGAAGCCGTGCAGCGCCTCCAGCGCGTAGGCGTCGCACGCCTGGCGCAGCGCCTCCTTGGAACCGAAGTGGTGGCGGACCAGCCCCGGTGACACGCCGGCGGCGCGGGCTATCTCCCGGATCGTCGCCTGCGTGAACCCGTACTCCGTGAAGTACCGCAGCGCCGCGTCTCTGATCCGGGCGCGGGCGGTCAGATCCTCGAAGGCCGCTCCGCCACGTTCTGCCAGCATCAGGAAGCCCCCCGCTTGTATCGTCCGCTATCTCTCTGTGCAGAGTACAGTACACGTGTATAGGAAGCTATAGATGGCGTACAGCCGAAGCCCAGACGAACGTCCGCCCGGCGGGCGCGCGGCAGGCGCGGGCGGACGCGGGGGACGGGAAAAGGGGAAGGCGCCCGGTCGGGTGGAGGCGGCGCCCTGGGCGGGCGCGGCGTCCTGACCGGGGCGCCTTCGGGCGGTCAACGAACGTTCCGGCTCACCGGCCGGCGGCCGGGGCCGAACCGTTCGCGGCGGGGGACCGCCGGTAGAGGGCCTGCCAGAAGTAGGTCCAGGGCACGGGCCAGCCGTCCGCCGCGCAGGTCCACGCCCCGCCCGGCCGGTACGCCTCGATGAAGTCGGGGACGGCCGCCTCCACGGCGCGGCTGTCGTTGATGTCGACCACCTTGCGGAGCGGGCCGGTCTCCATCGCCATCCGCACGACCTCGTCGCCCTGCGTGTGGCCGTCCAGGCTCTTGTCCATGTACTTGCCGACGGGGTTCTTGACGTAGAACGCGTCGCAGCGGCGGTCGATCAGCGCGAGGTAGTCCCGCACGGTGCCCGGGGGCATCTCGGTGAACGAGTTGATGTTGACGGCCAGGTCGAAGCGCCGCGCGCCGAGCGCGTCCTCGACCTCGTCGACCGTCACGAACGTGATCCGGTCGAACCGCTCCTCGTCCAGCACGGCGCGCAGGTAGCTGCGGCACAGGCCGAGCGTGTTGGGCAGGTCCACGATGTGGTATGAGGCGAGGTCGTGGTTGGACAGCAGCGTGTGGCACGTCCGCCCGTAGCCGGCGCCGATCTCCAGGACGGCGGCGCCCGCGAGGTCGAGCCCCTTCTCGACGAACCCGAGCTCCAGCACGGCCTGGAGGTAGTCCAGGCACACGCTCTCGCCGTCGTAGGTCACGGTGAACGGGTCGCCCACCTCGCGGTTGCGGATCCTGCGCAGCTTCTCCCACTCGGCGGGACCGAGCCCCGACGCGAGGTTGTAGACCAGGGTCTTGAGGTACCTGATCCCGTTCGCCTCCGGATTCCAGATGGCGATCTTGTGGTTGGGCCGCCCGGATTTGAAATCCGCCAGCTCGGCGGACACATCCTCGGTGACCCAGTGTTTGCTAATGTTCTCCCACTGGAGGCTGCTCTGATAACTGGGCCCCATATCGTCCTCCCGGCCGTACCTTCGGATTCAGGGAAAGCGCCAAGAATATCTCGCGGCCGATGCTAATCGAGCGTCCGGCGCCGTCTCAAGGCCCTATTCCGCCGAGCCGCCCCGCTAGGGGCCCCTATGGCCCGCGGCCGTTCCCGGCGGGCGGCTCTTAGGGGTATGGGGCGGGAGGCGCAGGCATTACGTTGGTGCCGCAGCGCTCGGACGGTTATGCGGAAAGAATCGGTGCATTGCGGATTTTTTTGTGGCTTCCAGACCTGTCGGGCGCGGCCCGCCGCGCGGCGCGCGTCCCGGCCCCGGCGGGGACGGGCGGACGGTGAGCGGCGACGGGCTTCACGTCCGGCTCGGCCGCACGGGCCTGACCGTGAGCCGGCTCGGCCTCGGCACGGTCAACTTCGGCGGGCGCGTCGAGGAGCCCGAGGCGCACCGGCTGATGGACCACGCGCTCGCGTGCGGCCTCAACCTCCTCGACACCGCCGACTCCTACGGCTGGCGGGTCCGCAAGGGCCACACCGAGGAGGTCATCGGCCGCTGGCTGGCCCGCGGGAGCCGCCGCGACGACGTCGTCCTCGCCACCAAGGTCGGGTCGGCGATGGGGCCCGGCCCGAACGACCGCGGCCTGTCCGCCCGGCACATCGTCGCGGCGTGCGAGGACTCGCTGCGCAGGCTCGGGACCGACTGGATCGACCTCTACCAGCTCCACAGCATGGACCGGGCCGTCCCGTGGGACGAGATCTGGCGCGCCATGGAGCTGCTGGTGGCCCAGGGGAAGGTCCGGTACGTCGGCTCGTCCAACGCCGCCGGCTGGGACCTCGCCGCCGCGCAGGAGACCGCCGCCCGCCGCGGCGCGCCGGGCCTGGCCTCCGAGCAGTGCCTCTACAACCTGGTCGCGCGGCACCCCGAACTGGAGATCATCCCGGCGGCGCGCGCCTACGGCGTCGCGGTGCTGGTCTGGTCGCCGCTGCACGGCGGGCTGCTCGGCGGGACGCTGCGCAAGGCCGCCGAGGGCACGGCGGTCAAGTCCGCGCAGGGCCGCGCCGCCGCCGCGCTGGACGCCCACCGCGAGACGATCGCCGAGTACGAGCGGTTCTGCGGCGGCCTCGGCCGGGACCCGGCCGAGGTCGGCCTGGCGTGGGCGCTGCGCCGTCCGGGCGTGACCTCGCTGATCATCGGCCCGCGGACGCCCGAGCACGTGGACGGCGCGCTGCGGGCGCTGGACAAGCCCCTGTCCGATCCGGAGCTCGACCGGCTCGACGCGCTGTTCCCGCCCGTCGGGCGGGGCGGGCCCGCGCCGGACGCGTGGATCGGCTGACCGCCCCGACCCCGGAGGAGAGAGCACGCATGTCCATCAGCCAGTACACCGATCCGCGGATCGTCCCGCACGAGTCGGACGGCGAGCGCGAGGCGCGCGAGCGCCTGACCCGGCTGCTGTCCAAGACCCCGATCCAGCCCGAGTACCTGATCGACAACCTCGCGGTCTACCTGCGCCGGCACCAGCTCAGCGACCTGCTGTCGATGGACGCGCTCTACCGCGAGGTCCGCGATGTGCCCGGCGTGATCATGGAGTTCGGGGTGCTGCACGGACGGCACCTGGCGACGCTCACCGCGCTGCGCGGGATCTACGAGCCGTACAACTCGCTGCGCCGGATCATCGGGTTCGACACCTTCACCGGGTTCCCCGACATCAGCCCCGTGGACGAGGTCAGCCCGAGCGCGGTCGTCGGCAGGTTCGAGACGGCGCGGGACTACCCCGACCACCTGCGCGAGGTGCTCGCGGCGCACGAGCTGGGCGAGCCGATGGGCCACGTACAGCGCACGATCGTCATGCAGGGCGACGTGCGCGAGACCGTGCCGCGCTACCTGGAGGAGAACCCGCAGACGGTCATCGCGCTCGCCTACTTCGACATGGACCTGTACGCGCCGACCCGCGAGACGCTGGAGACGATCCGCCCGTACCTGACCAGGGGGAGCATCGTGGCGTTCGACGAGCTCGCCCACCCGAAGTGGCCGGGCGAGACCGCCGCGTTCCGCGAGGTCTGCGGCCAGGACGCCACCCTCCACCAGATCCCCGGCCGCGAGCCCCCCGTCATCTACCTGCGCTGGAACGAGTGACGCCCGCCGGCGGCGGGGCCCGCCGCGCCGCCGCCGGCCGGGCCGTTCCGCGCGTACGACCGCACGCACTGCTCGTAGGTGGGGAGCAGGCCGGCCGCGGCGGCCTCGGACAGGGACGGGGCCGCCGCGTCCTTCGCCGAGCGGATCGGCGGCCCGGTCAGCTTCCACGGCAGGGCCAGTTCGGGGTCCAGCGCGTCGATGTCGATCATCGTGCCGGGGACGTACTCGACCGAGCACAGGTAGCACATGCAGGTGTCGTCGGTCAGCGCCAGGAACGCGTGCCCGATCCCGTCGGGCAGGTAGACGGCGGTGCCCGACTCCGCGTCCTGGCGGGTGACCGCGTAGTGCCCGAACGTCGGCGAGCCCACGCGGATGTCCACGGCGATGTCGAGCGCCGCCCCGCGCACGCACGTGACGAGCTTCGCCTGGCCGGGGGGCAGCGCGGTGCCGTGGATCCCGCGCACGGTGTTCCGCTTGGAGACCGAGTAGTTGACCTGCCGGATCTCCAGCTCGTGGCCCGTGATGCTCCGCAGGGCCTCCGCTCTGATCGCCTCGTAGAAGAAGCCGCGCGAATCGGCGATGCGATCCGGCTCGATCCGGTAGGCGTCGGGGACCGTCATCTCTGAGATGCGCACAGCAAACCTCCTCGCGGAACGCCGCATTTTCCGCGGCCGCGCTTAAGCGGGACTAAAGGGCTCGGCGGCGCGCGGCCGCACTTCCAGGCCGTTCCGGGAAACGATGCCAAATGCGCGGCTGACCGTGCGTGCCTTAGGGGCCCCTGTTTTCCTCATAAACGGTTCGCGGCCTCGGGCGGGGCCGCGGTAAAGCGATACCGTGGTGAACCGGATATGCCCTACAAGGGAGGCAGTCGTGCCCGATCCCGCCCGTCCAGCGCCCTTATCCCCGCGGTCCCCGCAATCCCCGAGAGCCCCCCGGTCCCCGCGCGGACGCCCGGCGCCGGGCGGGCGCTCGCCGCGCGCCCGCGAGGACACCGCCGCGCAGCGCGCGATCGCCGCCTCGGCCGCCGCGCTCGAAGGCCGCGTGCCGGGCGTCGCCGACGTCCGCGCGTGGCTGGGCGAGCGCCGCGACGCCACCCGCTTCCAGGTCGGCCGCATCCCGTTCGCCGAGCTGGACGGCTGGTCGTTCGACGGCCCCACCGGCAACCTCGTCCACGACAGCGGGCGCTTCTTCACCGTGGAGGGGCTGCGCGTCAGCATCGACGGCGACCGGCCCCGGGAGTGGCAGCAGCCGATCATCTGCCAGTTCGAGGTCGGCATCCTCGGCATCCTCGCCAAGCGGTTCGACGGCGTCCTGCACTTCCTCATGCAGGCCAAGATGGAGCCCGGCAACCCCAACCTGCTCCAGCTCTCGCCGACCGTGCAGGCCACCCGCAGCAACTACACCAAGGTGCACCGCGGCGCGTCGGTCCGCTACCTGGAGTACTTCACCGGCGGGGAGCCGGTGCGGGTGCTCGCCGACGTGCTCCAGTCCGAGCACGGCGCCTGGTTCTGCCGCAAGCGCAACCGCAACATGATCGTCGAGGCGACCGGCGACGTCCCCGAGCACGAGGACTTCCGCTGGCTGACCCTCGGCCAGCTCGCCGGGCTCCTGCACCTCGACAACGTCGTCAACATGGACGCCCGGTCCGTGCTCGCGTGCATGCCCGGCGCGGGAGGCCCGACCGCCCTGCACTCCGACGTCGAGCTGCGGTCCTGGATCGCCGCCGAACGGTCCCGGCACATCGTCACGACCGAGCGCGTCCCCCGCGCCGGGCTCCGGCACTGGGTGCAGGGGCCCTGGTCGATCGACCACGAGGAGCGCCGGCACTTCCGCGTCGTGGCCGTGTCCGTGCGGGCGGGCAACCGCGAGGTGACGCGGTGGACCCAGCCGCTGTTCGAGCCGTACGAGACCGGGATCGTCGCCTACGTGACCCGCACGTTCGAGGGCGTGCCGCACGTCCTCGCGCGCGCCCGCGTCGAGGGCGGCTTCCTCGACACCGTCGAGCTCGGCCCGACCGTCCAGGCGATCCCCGGCAACTACGCGCACGTGCCCGCGAGCGAGCGGCCCGCGTTCATCGACCTGGTCTCCTCCGCCGACCCGTCCCGCGTCCGCTACTCGGCCGTCCACTCCGAGGAGGGCGGCCGGTTCCTCAACGCCGAGAGCCGCTACATGGTGGTCGAGGCCGACGAGGACGAGGCCCCGATCGACCCGCCCGCCGGGTTCCGCTGGGTCACCCCGGCCAGCTCACCATGCTCGCCCAGCACAGCCACTACCTCAACGTGCAGGCGCGCAGCCTGCTCGCCTGCCTGAACGCCATGGCCGCCCGCGGCGCGGCCGGCGCGGCGCTCACACCGTGATCCGCCGCGCCCGCTCCTCGATCTCGGACACCAGCCGCGCCTGATGCAGGCTCCCCGCCCCGAACGCCTCCGGCTCCCCGCCCGTACCCGCCTCCGCGCCCGCGCCCGGGCCCGGGGCGGTGCGGGCCGCGCCGGCCGCGCCGGCCACGCGCGCCGCGCCCACCGCGCGGGCGAAGAAGCGCCCCAGGTTGGCCCACTGGTGGTCCGGGGGCAGCGTGATCTCCTCGCGGTGGTCCTGCCGTTCGATCCGTACGACCGGCTGGTACGCCGCGGGAGGCGTGAACGCCCGGTCGGTCAGCAGCCGCCCCGCGCTCCCCGCGAACTCGCAGCCCGTCCGGTAGGAGTGCCGCACCCCGAACGCGAGGTCGGCCGTCACGCCGCCCGGCGTGCAGGCGAGGGCGCGCCCGCCGACCACCATGCCCGTCCCGCCGTCCACGTGCAGCACGGCGGCGGTCACGTCGAGGTCGTTGCCGAGGAAGCGCAGCGCCGCGCGGATCGGGTAGACGCCCTGTTCGAGCAGCGCCCCACCGCCGAGCCCCGGTAGGAACTGCGTGTTCCCGGCGGGCAGCGGCGGGATCGTGAACGCACTCGAAAAGTCCCGCAACTCACCGATCGCGCCGTCCGCGAGCATCTTCTGTACGGCGGTGTGCTGCGAGTGGTGCAGGAACGTCACGTTCTCCAGCAGGACGAGCCCGCGCGTGCGCGCGAGGCGCAGCAGCCGCGCGGTGGTCTCGTAGTCGGTGGTGAGGGGCTTCTCCGCGAGGACGTGCTTCCCCGCGTCCAGGGCCTTCTCCGTCCATTCGGCGTGCAGCGCCGAAGGCAGCGGGATGTAGACGGCGTCGATGTCGTCGGAGGCGAGCAGCGCGCCGTACCCTCCCACGGCGGCGCAGCCGACCTCGGCGGCGAAGCTCTCGGCCCTGCCGGGGTCCCTGCTCGCCACGGCGGTCACGTCCACGCCGTCCGCGGCGAGGAGGGCGGGGACCGTGCGGCGGCGCGCGATGTCCGCGCAGCCCAGCACGCCCAGGCGCAGCGGCGGTTCCTGTCGTTCCACGCCGTCAACGTGCCGCACGTCCTCGCGCCCGGCAAGCCCCTACCGCCGTGCGGGCCGTCGCGGAGGGACGTCCGCGCTGGCCGTGGGCGCGCGACGCCCGCGAGGGCTGGGGCCCCTACCCCGGCGGGCGCCCTCCGGCGGCGGCGCGAAAATGACGTGCGTTCACCAGGGCGCCGTCAGTACCGTTCCAGA(SEQ ID NO:39)
以该链霉菌的生物合成基因簇为例,对生物合成基因簇进行解释说明,该生物合成基因簇包括Ⅰ型线性聚酮合成酶基因模块、糖基合成相关基因模块和修饰基因模块。下面对该链霉菌的生物合成基因簇各模块进行详细描述,具体如下:
其中,Ⅰ型线性聚酮合成酶基因模块包括madA1,madA2,madA3,madA4,madA5,madA6,madA7,madA8,madA9,madA10和madA11共11个基因,具体如下:
根据本发明的实施例,madA1基因编码SEQ ID NO:1所示的氨基酸序列。具体如下:
MPSDRLRSPGTDAAADGGAGDGARRWFGGFLEAVDGFDAGFFGIAPREAVAMDPQQRLMLEVAWEALEDAGIPPETLAGSRTGVFVGAIWDEYAAILRRSGPGTGERHAMTGSHRSIIANRVSYVHGLRGPSMTIDTAQSSSLVAVHAACESIRNGECSLALAGGVNLILGEDGMEAAAAQFGGLSPDGRCHTFDARANGFVRGEGAATVVLKPLDAALIDGDPVYCVIRGSAVNHDGAADGLTVPSAAAQEDVLRLACERARVAPGDVQYVELHGTGTPVGDPVEAAAVGAALGASRAHGAPLRVGSAKTNVGHLEGAAGIVGLLKAVLSVAHRELPPSLNFTAPNPRIPLAELGLRVQDELTAWPCPDRPLIAGVSSFGMGGTNCHVVLTEPPATAAEVPATGGATRADEPGTAAGRPAAPVPSNETPPIVPWTVSGKSAAALRAQADRLREHVEDRPGLDPADVGHALGATRSAFAHRAVVLGGGADELLGRLGALGRGEDAPGVVHGTVNGGGVAVLFSGQGSQRPRMGRELYDAHPVFAAALDEAFACLDPMLDRPLREVVWSGDAAPLDRTAYTQPALFAVEVALYRLAESWGLRPGHLIGHSVGEIAAAHVAGALSLPDACTLVAARGRLMQAVTAPGAMAALRATPDEAAELVAGRGGLGVAAVNGPASVVISGDRDAVRDAAAAWRDRGGKATLLRVSHAFHSPHMDAMLDELRAVAAGLAFAAPAIPVVSNVTGLPATAERLASPDYWAEHARRAVRFMPGVRYLCGEGVTTFVELGPDGQLAAMARESFPGPSTEGPDGRPRPAARPVALAALRRDRPETATFATAMAEAYVRGGVDLDWDRIFAGRARTRVDLPTYAFQRERYWPGSLPRPEAVPAGQPGPGPEAGPVLEAEPPSPLERLAGLSEPERRRALLDLVRTNAAVVLGHASGTAIDADLTFKQLGFDSMAAGELSERLGAATGLPLPATLTFDHPTPLDVAAHLNRRATATPDGTSRQDAPAAGRSAADDPIAVVAMSCRYPGGATTPEALWDLVAAGRDVIGGFPDDRGWDLAALFDPDTDRAGTTYAREGGFLYDAAEFDAEFFGISPREAQAMDPQQRLLLETSWEAFERAGIRLGALKGSPTGVFVGATAQDYGPRLHEAPKGLDGHLLTGATPSVASGRVAFTFGLVGPAVTVDTACSSSLVAMHLAAQALRQGECDLALAGGATVLAAPGMFTGFSRQRGLAPDGRCKPFAAAADGTGWAEGVGLVLLERLSDALRNGRRVLAVIRGSAVNQDGASNGLTAPNGPSQERVIRQALASAGLSSSDVDAVEAHGTGTTLGDPIEAGALLATYGRGRSGGRPLWLGSVKSNIGHTQAAAGVAGVIKTVMAMRNGSLPASLHIDEPSPHVDWKSGAVRLLTEPMEWPTEDDRPRRAAVSSFGISGTNAHLILEAPPEPDSEPERPAPMDVGPLPWVLSARSEEALRGQAAALASQVRVSDDAPLDVGWSLAASRSNFERRAVVVGTGRDELLAGLEAVASGEVRSVAPSRVGAGPVLVFPGQGSQWAGMGAGLLDSSPVFAERVAECERALAPFVDWSLSDVLRGDGAELARVDVVQPVLWATMVSLAAVWASYGVVPSAVIGHSQGEIAAACVAGALSLEAGARIVALRSRALRRLAGGGAMASLGVGVDAAQEFLSADVAVAAVNGPSSTVVSGPPEQVTAAVARVEAAGYRARLIDVDYASHGPQVDEITDELHEVLAGIEPAPTTEVAFYSTVSGGRIDTTSMDTGYWVTNLRERVRFADAVRELAGDGHRVFIEASPHPVLTLGTQETLEEAGTVATVVPTLRRDHGGRRQLADALGQAFAAGVEVDWSRWFTGDRAPRLVELPTYAFQRRRYWLDGRSGPGGDPESLGLASAGHRLLGAAVEHAGADAWLLTGRLSTRTQPWLAQHSVAGTVVVPGAALVEWALRAADEAGCGGVDELTPRVPLVLAGPEPVRVQVAVDAAADDGTRDVRVYSRPDGDEEAEWTCHAEGVLSAGSPEPAPEPEPGGGGAWPPPGAEPLAADDVYERLAEAGYDYGQAFQGVRALWRQGADLLADLVLPEAAGGPGGFGIHPALLDAALHPSLLDAGPAGDGRVWQPFAWSGVSLWAAEATAVRVRLTPHHGEQGVQVAVAVTDALGGPVLDVASVVMRPADVRWLREGGHRAGRRASTAAGGTTRVRRAVAAGGGNGSSVDWADRLAGLAEDERRRLVLDLVRTHAATVLGHAEPDAVPTGTSFKEMGFESLTAVELRDRLAAATGLRLPAALVFRYPTPDGIADHLVERLVAKEQTPPNGRNGGRAGEGGNGGERGNGNGHRSVTGVGSVLGELDRLEDTLAGFVPENGDSGAVTARLERLLTKWKAAARNPAGEPSAAERLESASTDQVLDFIENELGVS(SEQ ID NO:1)
根据本发明的实施例,madA1基因位于该链霉菌的基因簇核苷酸序列第12437-19726个碱基处,长度为7290个碱基对,编码聚酮合成酶,长度为2429个氨基酸。
根据本发明的实施例,madA2基因编码SEQ ID NO:2所示的氨基酸序列,具体如下:
MANEEKLVDYLKRVSAELHDTRRRLSEVEDRANEPIAVVGMACRFPGEVRSPEDLWDLLVSGKDAIGDFPDDRGWDLDGLYHPDPAHPGTSSTRQGGFLYDIDRFDAGFFGISPREALAMDPQQRVLLETAWETFERAGIDPFSLKGTSTGVYAGVSSQDYLARAPRIPEGFEGYATTGSLTSVVSGRVAFTLGLQGPAVTVDTACSASLVAIHLACQALRQGECTLALGGGVCGLATPVMFTEFSRQRALSPDGRCKSFAADADGTGFSEGVGLVLLERLSDARRNGHRVLAVVRGSAVNQDGASNGLTAPNDVAQEQVIERALTAARLAPSEVDAVEAHGTGTRLGDPIEAEALIATYGRDRPAERPLWLGSVKSNIGHTHAAAGVAGLIKMVMALRNETLPANLHLSEPTPHVDWDTGAVRLLTAPVPWPRDDRPRRAGVSSFGISGTNAHLIVEEAPEPADPSEAAAPPEPDSGAVVPWVLSGRSEEALRGQAAALAARVGGSEGWAAADVGWSLVTTRSVFEHRAVVVGSERDELLDGLKALAAGEAHPNVVHPGSPAVSGANARGPVLVFPGQGSQWAGMGAELLDSSPVFAARIAECECALAPFVDWSLSDVLRGDGAELTRVDVVQPVLWATMVSLAAVWASYGVVPSAVIGHSQGEIAAACVAGALSLEDGARIVAVRGQELRRLSGRGAMASLGVGEERAGELVEGDVTVAAVNGPSSTVVSGPPEQVAAVVARAEAAGHRARTIDVDYASHGPQIELIGTELTEKLAGVRPVRADVGFYSTVTGGRIDTAGLDTVYWVTNLRLRVRFAETVRALLDDGHRVFIEASPHPVLTVGLEECFEQAGTAAVAVPTLRRGDGGAGQVAKAAGQAFAAGARVDWTGWFRGRPAAPRVVDLPTYAFQRERYWLEGLNGDGGDPAGLGLVPAGHPLLGAAVEVADGDIHLLTGRLPGNGNAGWLADHRVLGTVLVPGTALLEWALRAADEAGCGSVEELVLHEPMVLPGGSGGSGGLRVQVVVGAAGEDGRREVRVSSRPGQEDSAGERGGEARDWVCHASGVLAPETTVPAPGLDGIWPPPGAEAVDVSDVYERVAASGYEYGPAFQGLRALWRHGTDLLAEVALPDAAGKPDGFGVHPALLDAALHPALLEDGDDDAEGGGERRVWLPFAWNGVSLRADGATAVRVRLSPEGDGADERKVRVTVADAVGGPVLDAESVVMRPADIDRLKSAGRPGAGGAEGLFTVEWTALPEPAAEPSAEDDTDHVTLAPTASAEDGASYAGLDGLAAALDGGLPAPSVVLAEVGAVADALALVQRWLAEPRLSEARLAVVTRGAVACENPVPDGAGVWGLVRSAQAENPGRFVLLDVGAEADAGADAGSRAEQAEAVRRAVRAGEPQVAWRDGRLLVPRWVRAGEPVGIVPPPGERAWRLGLSGTATLENVSATACPEALEPLEPGQVRIDVHAAGVNFRDVLIALGMYPGDAAFGGSEGAGVVTEVGPDVAGLAAGDRVMGLFDGAFGSVAVADARMVAPVPDGWDLRQAAAAPVAFLTAWYGLVHLGALRRGDSVLVHAATGGVGMAAVQVARHVGAEVYATAGPAKHGVLDGMGIDEAHRASSRDLDFEDAVRRATGGRGVDVVLNSLSGPFTDASLRLLADGGRFVEMGKTDVRAPDPASDGMAEGVAYRAFDLLADAGPDRIAEMLSELAGLFSSGALRPLPVRAWPLGRARDALRHMSQARHTGKLVLDVPATPDPDGTVLITGGTGTIGGEVAVHLARDWRVRHLLLLSRSGPSAPGADALAERIRELGAEAAIKAVDVADPVALDEAIAAIDPAHPLTGIVHAAGTLDNALVTSLTPDALAGVWDVKAGAAAHLHAATAGTRLGLFAVFSSFASTLGTPGQANYAAANAYCDALAAARRAEGLPGLSVAWGLWATASGLTGGLTEADTARIHRYGIRPNGTEEGLALLDAAQRHGGPEVLALNFDARELAGRPPDTLPAPLRGLAATAAAKPGGVPGRPTAAGGERPTDWTARLAGLPPSERHRTLLNLVRTHAAAVLGHADAGLVRADASFKELGFDSLTGVELRNRLSAATGLRLHAALVFDHPEAGALADHLLELLAPDGAPPAGADATAPVLGEVARLESTLSAVAAGAVSADAVTARLEDLLANWKAAHAPDGGTGGDGDADGAEERLRVADAGQILDFIDNELGVRP(SEQ ID NO:2)
根据本发明的实施例,madA2基因位于该链霉菌的基因簇序列第19727-26380个碱基处,长度为6654个碱基对,编码聚酮合成酶,长度为2217个氨基酸。
根据本发明的实施例,madA3基因编码SEQ ID NO:3所示的氨基酸序列,具体如下:
VNEEKLVDYLRRVTTDLHHTRKRLSEVEDRQREPVAVVDMACRFPGGVRSPEDLWELVASGRDAMGAFPEDRGWDLDRLFHPDPDHPGTSYARAGGFLYDAAEFDPEFFGISPREALVLDPQQRLLLEVSWEAMERAGIDPSTLKGSRTGVYAGAALPGFGTPHIDPSAEGHLVTGNAPSVLSGRLSYTFGLEGPAVTIDTACSSSLVAIHLACHALRQGECTLALAGGVTVMAVPNVFTEFARQRGLAADGRCKPFAEAADGTAFAEGAGMLLLERLSDARRNGHPVAAVIRGSAVNQDGASNGLTAPNGPSQQRVIRQALEAARLAPSEVDAVEAHGTGTRLGDPIEAKALLSVYGADRDAGRPLRLGSCKSNLGHTQGAAGVAGVIKMVLALRHERLPATLHVDRPTPHVDWDSGAVRLLTEPAGWPRGGDRARRAGVSSFGISGTNAHLILEEPPAENAQDAEHAEHAEHADDAPAASAGRAANEARGPIPWVLSGRTGPALAEQAAALAEHVAARPELPAADVAWSLARTRTPFEHRAVVLGDDRDALRDGLAALADGRDHPDVTRAAGPAGPGGTAFMFTGQGSQRPGMGRDLHRTFAAFAEAFDEAAAHLDPLLGRSLRDLVFADGGTAEAADLDRTGVTQAALFAVETALFRLVESFGVVPAYLTGHSVGEIVAAHVAGVLSLPDACALVAARGGLMQALPEGGAMVAVEAPEDEVAPLLAGLEDRVSLAAVNGPASVVVSGDGTAVEEAVRPLRERGLRVRRLRVSHAFHSPLLGPMLDGFREVVSGLAFGPPEIPLISNLTGEPADAGLLGDPEYWVRHARRTVRFHDGLRTLRAEGVTRYLELGPDPVLTAMAQKSLADDGPAPAFAAAMRKGRDEPRTFLTALAATHVDGGDADFAAALPAGAGRVPLPTYRFQRRRYWRPVQDAAAEVSASGLRPTGHPLLQAAVEPADGGLLLAGRLSPRTHGWLADHAIADGAPLPATAFLEFALVAARRTGCDAVEDLSLETPLPLPDAEPVDLQVAVGGPDGAGRRPVTIYSRPVDDAHDDPDAPAAARLDDLGPANGWRRHAAGTLAGPDPDPDPAPAAPAQSAAWPPEGADEIDAAELYERLADQGYRYGPAFRGVRAAWRLGDELFAEVRLAPDQHADAAAYAVHPALLDSALHPVDELYEHDGSTVRLPFSFGGVRLHGTGRTRLRVRITPGDADVLSLALTDDDGAAVASIGSLGLRKVPAGRWNGAAPADDALHRLAWHPLPVPDGGPGTARPADDWAAIAAAGPDATGPAADGLSADGLGVVAPDAVHPDLAALKTALADGRPTPAVVLLPLPTGGPDAAEPVARAHDAARRGLELLQEWLEEERFAAARLIAVTRGAVAATGDDPPSDLAGAPLWGLVRAAQAEYPDRIVLLDLDGEDASRDVLRGAVAAAMENGESQLAVRSGDVLVPRLVDAAERDADGTAPALDPDGTVLVTGGTGALGRLVARHLVTAHGARHLLLTGRRGPAADGAAELEAELAGLGADVRVAACDAADPDALAALLASVPAAHPLTAVVHAAGIVDDGVITSLTPGRLAAVLAPKADAAWHLHRLTEGDDLAAFVLFSSAASVLGNGGQASYGAANMFLNALAEHRRAGGRPATSLAWGLWEASGGMAEDLAAADRARITRSGVAAMPPGRALALLDAALAAPHPTLVPVAFDRAALREQAAAGTLPPVLRGRVRVPARSPRGPAASSFAARLAGLPAEERDRAVASAVRDQIATVLAHPEPEAIDLNRAFQELGFDSLTALELRNRLNAATGAALPATVIFDHPTPEALIRAVRTRLDGAAGGRAVRAPVNGAPRAAAADDGEPIAIIGMGCRYPGGVTSPEDLWELVASGADAIGEFPADRGWDLPGLFDPDPDRLGHSYTREGGFLYDAARFDAGFFGISPREAVAMDPQQRLLLETVWEAFERAGLDPADLRGSRTGVIAGVMYDDYGSRFLGRTPRAVEGRLMTGSTPSVASGRVAYTFGLEGPTLTVDTACSSSLVAMHLAAQALRRDECTLAVAGGVTVMATPNTFVEFSRQRGLAPDGRCKSFAASADGVGWGEGAGMVVLERLSDARRNGHRVLAVLRGSAVNQDGASNGLTAPNGPSQERVIRAALAGAGLAPSDVDAVEAHGTGTTLGDPIEAHALLAAYGQDRPDGRPLWLGSVKSNIGHTQAAAGVAGVIKMVMAMREGSLPASLHIAEPSPHVDWTAGAVRLLTEPVEWRNGDRPRRAGVSSFGISGTNAHLILEQAPDPAETPAPEGPVILNGAASAEAETGTGDTGDTGVPDLALVPWTVSARDPDALRGQAAALAAHVSARPGLPVADVAWSLLKTRSSFPHKAVVVGERPAELVAGLEALAAGADAHPALTGPGAPAAPGRLVWLFSGQGSQRVGMGADLYVRFPAFAEAFDEVCGHFDERLGRPLGEVVFTGPADVLDHTTYAQAGLFALQVALARLLGTAGIRPDAVIGHSIGEVAAAHVAGVLDLPDACRLVAARAGLMGRLPSGGAMTAVEATPDELADDLERLGGGLVAVAALNTPGSTVISGPAEPVARIGARWAERGRRTRSLTVSHAFHSPLMDPVLDEFATAIDGLAYREPAIPLISNLTGLPADERITTPAYWAEHIRRPVRFHPAVAHVAADAAAFLELGPDPVLGTATRRTLDTLDAGGDPAGGPPVLATLTRKQPEIRSLANALAGLHVTGVPVDWAAAWFPAEPAPRAVDLPTYAFQRERFWLSADAGGEDAAGLGLTPAGHPLLGAAMDLAGGGGHVLTGRLTRRSGAWLAEHVVAGTTLLPGAGVVEWALRAADEVGCGGVDELTLRAPFVLPDTGGLRVQVVIGAPGEDGRRDLHVYSRLDQDGEAAEWLCNAEGVLTPETPADPGPEPGGPWPPPHAEPVDVEGFYERAASGGYDYGPSFQGLRALWRAGDDLFAEVTLPEAAGDVAGYGIHPALLDAALHPLFLTGLFGGDPSGNRIWLPFTWNDVSLRAGGATAVRVRLSPNEDGDTLRLTVTDPSAAPILTVGSLVMRPADADQLRAAARQGDTRGLFTVDWTPVPLPSAPADAPERDDAGEWAALGPDTLPPGLDAYRDLPTLLMALDAGQPPPSHVLTTIPPEPPPGRTETDVAADTLLLLQAWLDEPRLADARLVVVTRGAVASDVPDLAGAGVWGLVRSAQAENPGRFVLLDLETAPDGDVLELVRRALHTDEPQLAVRGGQALVPRLARADRPAELTAPATRAGWRLGAERPGTVESVTIVPAPEALEPLAPGQVRISVRSAGLNFRDTLIALGMYPGGGVFQGSEGAGVVTEVGPGVASGLSVGDRVMGLFEGAFGPVAVADERMVVRLPDGWDFRSGAAVPVVFLTAWYGLVELAGLKSGESVLVHAATGGVGSAAVQIARHLGAEVFATASPGKHAVLESMGIDAAHRASSRDLDFERTFRDATGGRGVDVVLNSLAGEFTDASLRLVAEGGRFLEMGKTDIREPEHIAAEYPGLSYRAFDLVTDAGPDHVGEMLARLTELFSADTLRPPHVEAWPLARAREALRVMSQAKHTGKLVLDIPPTLDPDGTVLITGGTGTLGGLVAEHVVGTWGVKRLVLASRRGLQAPGADELVERLTRELGAQVRVVAADVSDADAVTDLITATDADPAHPLTGLIHTAGLIDDAVIGSQTPQRLHHVWNAKATAAHNLHTATRHLPLAAFVIFSSAAGTIGSAGQANYAAANAYCDALAVRRRSLGLPGVSVAWGLWESSSGITGHLGDADLARMGRSGFRPLTDRHGLALLDAASRHGAAHLVAVDLRTSALAGQPAENVPPMLRTLAATASAGGAAARPTAGGVRRPADLAARLAAQSRTEQERTLLGLVRTQAATVLGHPNPDAIRPDNSFKELGFDSLTAVELRNRLAASTGLRLPASLVFDYPEAAVLADHLRRRLAPNGESAPAQDAVDPLLGELGRIESALAALALDDEARSRVARRLNGLLSAVNGGSNASAETGGFDDVESASDDEIFELIDREL(SEQ ID NO:3)
根据本发明的实施例,madA3基因位于该链霉菌的基因簇核苷酸序列第26377-38508个碱基处,长度为12132个碱基对,编码聚酮合成酶,长度为4043个氨基酸。
根据本发明的实施例,madA4基因编码SEQ ID NO:4所示的氨基酸序列,具体如下:
MSETEEKLRQYLKRVTVDLGQTRQRLREAEERGREPVAVVGMACRFPGGVSSPEELWELVDSGGDAIGAFPDDRGWNLEALYNPDPDQPGGSYVREGGFLYDADLFDAGFFGISPREALAMEPQQRLLLETSWELLERAGIEPGSLKGSATGVYAGAGLPGFGTPHIEESAEGHLVTGNALSVLSGRVAFTLGLEGPAVSVDTACSSSLVAIHLACQALRQGDCSMALAGGVTVMAVPAMFTEFSRQRGLAPDGRCKPFAEAADGTVFSEGVGLVLLERLSDALRNGRRILGVVRGSAVNQDGASNGLTAPNGPSQERVIRQALAYAGLSGVEVDAVEAHGTGTTLGDPIEAGALLATYGQGRPDDRPLWLGSVKSNIGHAQGAAGVAGVIKMLMAMRNGRLPATLRVDRPSSHVDWESGAVSLLTEPVEWPREDRPRRAGVSAFGMSGTNAHLILEEPPEPTEPDDVPERPAPMDVGPLPWVLSARSEEALRGQAAALASQVRVSDDAPLDVGWSLAASRSNFERRAVVVGTGRDELLAGLEAVASGEVRSVAPSGVGAGPVLVFPGQGSQWAGMGAELLDSSPVFAERIAECERALAPFVDWSLSDVLRGDGAELARVDVVQPVLWATMVSLAAVWASYGVVPSAVIGHSQGEIAAACVAGALSLEDGARIVALRSRALRRLAGGGAMASLGVGVDAAQEFLPEGVAVAAVNGPSSTVVSGPPEQVTAAVARTEAAGHRARLIDVDYASHGPQVDEITDELHEVLAGIEPVPTTEVAFYSTVRAAAIDTATLDTGYWVTNLRERVRFAETIDALLEDGHRVFIEASPHPVLTVGMEETFEQAGVDGTTVPTLRRDHGDRKQLADALGQAFSAGVEVDWTGWFPAEWAPRLVELPTYAFQRQRYWIDAQGGRGGDAASLGLASAGHRLLGAAIEQAEGDVRLLTGRLSATGDASWLADHQVAGTVVVPGAALLEWALRAADEAGCTGVEDLVLRSPLVLPASGSVRVQVVVGAAGEDGRRDVRVYSKPGQDGDFDDDAGWTCHAEGTLGPETGDAAEGLAGAWPPPGAQPVDVSGFYDFAAASGYEYGPAFQGLRAAWRNGADVLAEVELPEAAGEPGGFGVHPALLDAALHAGLLADGHGDDGDVWLPFAWSGVSLWATDATAVRVLLSPLPEGERGVRVVMADAVGAPVLTADAVALRPIDTERLRAEDGRGPDGLFVLDWVAVQESAPSAEANEPGALCGLGDFDDSVLPSVPDVDALLARLDTEDLPTPLTVLTVLPPEGEGGGLDLTGRVLALLQRWLNEPRLDDVRLALVTQGAVAVENPDVAGAAVWGLVRSAQSEEPGRFVLVDAGLPDPGAAFAQDAADAVRRAVEANEPQAAVRAGQLLAPRLARTDTRAPDPAYAGDREGGHGLPVLDPDGAVLVTGGTGKVGGLVAEHLVRAWGVTHLVLASRRGSDAPGASELESRLASLGAHVQVVAADLADASAVDELVGRIAAERPLTGVVHAAGVLDDAVIASQSPERLAGVWAAKAAAAYNLHVATADLPLGLFVLFSSAASCLGSPGQGNYAAANAFCDALAAHRRASGLPGVSVAWGLWADASGMTGHMTETDLARVSRSGNVPLTSDEALELFDAACGHGGAYLLALHLNASALASADLLPPPLRAFGARRDRGGRVRRAAAAGGRQGSELLSRLAGLDANGRADAALEVVSECAAGVLGHRSPDGVRAEANFKDLGFDSLMAVELRNRLSVISGVRLPATLVFDYPTPHALAGYLCARLDGPATSVAGPSAGPTTATADDPVAIVAMACRFPGGVVSAESLWEMVAAGGDAIGAFPQDRGWDLDGLFHPDPDHPGTSYAREGGFVHDAAEFDAEFFGVNPREALAMDPQQRLLLEVSWELLERAGLDPVSLKGSRTGVYAGVMYHDYGAGIAAGDSRLEGYALMSSSGSVVSGRVAYTLGFEGPAVTVDTACSSSLVAMHLAAQALRQGECDLALAGGVTVMATPDVFTGFSRQRGLAPDGRCKPFAAAADGTGWGEGVGLVLLERLSDARRRGHEVLAVLRGSAVNQDGASNGLTAPNGPSQERVIRQALASAGLSSSDVDAVEAHGTGTTLGDPIEAGALLATYGRDRSGDRPLWLGSVKSNIGHTQAAAGVAGVIKTVMAMRNGSLPASLHIDEPSPHVDWESGAVRLLTEPMEWPTEDDRPRRAGVSSFGASGTNAHLILEQAPEPKNEPEDESEPVPFDGVLPWVLSAHNADALRGQAAALAAHVSATDDTSVEVGWSLLKSRSLLDQRAVVLGEDRAQLLAGVEALAAGEPHPALVRTGPAATASGKSVFLFSGQGSQRVGMGAGLYERFPVFAGAFDEVCGLLDEHLERPLKDVVFTGLPDQPELLDHTTYAQTGLFALQVGLARLLASLGVVPDVVVGHSVGEIAAAHVAGVFGLEDACRLVAARATLMGGLPSGGAMAAIQATADELTDDLARFAGRVAIAALNTPDSTVVSGPADLVSEVAAAWKERGRKTKALAVSHAFHSPMMDPILDEFTDAISDLTYRRPNVPLISNLSGLPADDQITTPEYWATHIRRPVRFAPAIAHLAPQAGVFLELGPDPVLATAAQHTLHHAMDEHEDGGTDEAGRPVPPTLSTLNGKQDDPHAFAHAMARLHTSGAGVDWTDWFPAKPAPRVVALPTYAFQRERYWLAPGRGAGDVAAVGLRRVEHAVLPAAVPLADGGLLLTGRMSASGGWASDHRMLGTTLVPGAALAEWALRAADETGCGAVEELVLQAPLVLPPTGAAQIQVVADAPGSDGRRELRVYSRPEHDDGSVTEAPWVCHAMGVLTPEARDVPEGLDGAWPPPGAEPVDVTAFYERMAATGYEYGPSFRGLRAAWRDGPDILAEVQLPEAAGDRAGFGVHPALLDAVLHAGLLPDGEGEGEQDGRVWLPFAWNGISLWASEAATVRVRLAPVHDGDGEGRDERGVRVTVADSAGAAVLTVDSVVTRPADIDKLKAAGEPDADGLFALEWTPLPAPALSDPTLGDDDWAVLGTDAFGPADCARFPDLAALSAALDADRPSPTLVLADASYGESDDLDAVGNVLSLLQAWLDEPRLADARLVVVTRGAVASDVPDLAGAGVWGLVRSAQAENPGRFVLLDLEADTFLSTGLADAVARAAEAREPQLAVRSGELLIPRLVRASERQGIAAPTGERAWRLAVATPGTVENVAPEPFPKALEPLAAGQVRIEVRAGGINFRDVLVTLGMVPGQTGLGGEGAGVVTEVGPGVFSGLSVGDRVMGLFEGAFGPVTVADERMVSPIPEDWDFRSAAAVPIVFLTAWYGLVELAGLKSGESVLVHAATGGVGSAAVQIARHLGAEVFATASPGKHAVLEAMGIDAAHRASSRDLEFERTFRDATGGRGVDVVLNSLAGEFTDASLRLVAEGGRFLEMGKTDIREPEHVAAEHPAVGYQVFDLVTDAGPVRISEMFRALAELFAEQVLRPAPTQAWPLARAREALRVMSQAKHTGKLVLDIPPALDPDGTVLITGGTGTLGGLVAEHVVGTWGVKRLVLASRRGLQAPGADELVERLTRELGAEVRVVAADVSDADAVTDLITATDADPAHPLTGLVHTAGLIDDAVIGSQTPERLHHVWDAKATAAHNLHTATRHLPLAAFVIFSSSAAMLGSPGQGNYAAANAYCDALAARRQAAGLPAVSVGWGLWEATSAMTGSLTETDRARMSRSGVGALPNEHGLALLDAALRHGDASLLAANLDLRALAVQPAESLPGALRALVSGGGERVRRTAATGSARPDDWAGRLAGMSAAEQQRAMLNLVRGHVATVLGRTDADSVRGDASFKELGFDSLTGVELRNRLADATGLRLPPALVFDYPQAGVLAEHLRTRLVPEGADASAAGAGVEPVLDDLARLESTLNAAAALEDGDSDAVTARLESLLTHWRAVRSGRAANGHHGRNGHAEPSGNDGQNGHDGQDGEDGTAVDRLESASADQVLDFIDNELGVSWNGPDATTTR(SEQ ID NO:4)
根据本发明的实施例,madA4基因位于该链霉菌的基因簇核苷酸序列第38534-50596个碱基处,长度为12063个碱基对,编码聚酮合成酶,长度为4020个氨基酸。
根据本发明的实施例,madA5基因编码SEQ ID NO:5所示的氨基酸序列,具体如下:
MATEERLVEYLKRVTADLHDTRLRLRRMEERSEEPVAVVGMACRFPGGVDSPESLWDLVAAGADAIGDFPDDRGWDLDGLYHPDPDHPGTTYTRKGGFVDDADRFDAAFFGISPREAMATSPQQRLLLETSWEAFERAGIDPVSLRGTRTGVYAGTATTGAAAHGGSSRKEAEGYAGNAPSLLSGRVAYTFGLEGPAVTIETACSSALVAMHLAGQALRQGECTLALAGGVTVMATPEVFTGFSRQRGLSPDGRCKAFSAAADGTGWGEGVGVVVLERLSDARRNGHRVLAVIRGSAVNQDGASNGFTAPNGPSQQRVIRQALTQGRLSAAEVDAVEAHGTGTALGDPIEADALIAAYGRNRPPERPLWLGSVKSNIGHTQGAAGVAGVIKMIMAMRRGVLPASLHIDQPTPHVEWGDGAVRLLTEPVPWPDTDRPRRAGVSSFGISGTNAHLILEQAPEPVEAPEPEPESGAVVPWVLSGRSEEALRGQAAALAARMDGSEGWAPVDVGWSLATTRSVFEHRAVVVGSERDELLDGLKALATGEMHPGLVGPGLAAHPDASAPVFVFPGQGSQWAGMGAELLDSSPVFAERIAECERALAPFVDWSLSDVLRGDGAELARVDVVQPVLWATMVSLAAVWASYGVVPSAVIGHSQGEIAAACVAGALSLEDGARIVALRSRALRRLAGGGAMASLGVGVDAAQEFLPEGVAVAAVNGPSSTVVSGPPEQVTAAVARTEAAGYRARLIDVDYASHGPQVDEITDELHQVLAGIEPAPTTEVAFYSTVRAAAIDTATLDTGYWVTNLRERVRFADTIDALLEDGHRIFIEASPHPVLTVGMEETFEQAGTGAVAVPTLRRDHGGREQFVRVVGQAFVAGAHADWTASFAQDPAPRTVDLPTYAFQRQRFWADDDPTVHSGDPAGLGLVSVGHPLLGAAVDVADEDTRLLTGRLSASGGGTGWLNDHRIGDTVLVPGAALVEWALRAADDVGCGGVEELALQAPLVLPASGSVRVQVVVGAAGEDGRREVRVHSVPDLGGDGASEDVRWECHAVGVLSPEPTATPSNGQSDDLGGVWPPAGAQPIDLEGFYERAAAAGYGYGPAFQGVRAVWRDGSDLLAHVELPEDAADPAGFGVHPALLDAALHPALLGGSPETDSEAGVLLPFVWNGVALWAAEATSVRVRLSPRRDGERGVRVVMADAVGAPVLSVDEVVMRPADTERLRAAGRRGAGNLFALEWAPLPASTPDAAAGRGVGEGEWAVVGAEPVATDLVHHPDLDGLVAALDGGAPVPPVVLAAVPSSPDALDMVTDTLALVQAWLAEPRLADARLALVTRGAVAAGTEPDLGGAAVWGLVRSAQTENPDRFVLVDVDDLDDADADASANVDRDAACGVAEAVRRAVEADEPQVAVRGGRLLVPRLARNDDLGEPSDQSGDGSPPLDPEGTVLITGGTGMAGSLVAEHLVRAWGVRRLLLAGRRGPDAPGAAELAERVRELGGEVRFAAADVSDAGAVADLVAGVDPAHPLTGVVHAAGVLDDGVITSQSAERLAGVWAAKAAAAYNLHVATAALPLGLFVLFSSAAGVVGNAGQAGYAAANAFVDALAEQRRTAGLPGLSVAWGLWEQSSAMTGHLAEADHARLARSGFSALRTERALELLDAACRAERSPVVAAGIDVRGRSAAEVPALLRGLAGRPGRRTAASAEAGTGVDGARGRELAVRLARMDEPERLDAVLETVRGCVAVVLGYASPADVRTEAPFKELGLDSLTAVELRNRLSRTTGLRLPATLVFDYPTPRALADHLCAKLVGDAAAATATAPVATAMTDEPVAIVAMACRFPGGAASPEGLWDLVARGVDAMAEFPEDRGWDLDGLFHPDPDHPGTSYAREGAFLDDAAGFDAGFFGINPREALAMDPQQRLLLEVSWELLERAGLDPVSLKGSRTGVYAGVMYHDYASGLSDSGDAKLEGYSMLASAGSAICGRVAYTLGLEGPAVTVDTACSSSLVAMHLAAQALRQGECSLALAGGVTVMATPEVFTGFSRQRGLAPDGRCKPFAAAADGTGWGEGAGLVLLERLSDARRNGHEVLAVLRGSAVNQDGASNGLTAPNGPSQERVIRQALASAGLSSSDVDAVEAHGTGTTLGDPIEAGALLATYGRGRSGGRPLWLGSVKSNIGHTQAAAGVAGVIKTVMAMRNGSLPASLHIDEPSPHVDWESGAVRLLTEPMEWPTDDDRPRRAGVSSFGASGTNAHLILEQAPEPKNEPEDESAPALDGVVVPWVLSARSEEALRGQAAALAAHVSGTDDAPADVGWSLVTSRSVFDHRAVVTGTDRQELLNGLGVLANGHENPEVVRAVANSASVSGSTVFLFSGQGSQRVGMGAGLYERFPVFAGAFDEVCALLEPRLEHPLKDVVFTGIPERDGLLDHTTYAQAGLFALQVGLARLLASLGVVPDAVVGHSVGEIAAAHVAGVFGLEDACRLVAARATLMGGLPSGGAMAAIQATADELTDDLERFAGRVAVAALNTPDSTVVSGPADEVDQLVGVWTGRGRKSKRLAVSHAFHSPLMDPILDEFTDAISDLTYRRPDVPLISNLTGEPAEPEHIATPGYWADHIRRPVRFAPAIAHIAPQAATFLELGPDPVLATAAQHTIGDGSADRPQPLVTATLTQKRSDERAFAHALAQLHTHGADVDWKALFDLDPDSRRRPRNVALPTYAFQHRRFWLDAFNGRGGDPAGLGLASADHALLGAAVELADSGTWLLTGRISARSHRWLGEHVMAATPLAPGAMLVEWALHAADAVGCGGVEELMLQAPLVLPETGGVRVQVVVGPADPDGRRELQIYSRPDENDDDGRANAAWVCHAQGVLSPAEPAPGQLNGSHGPDVVGGAWPPPGARPVAIDGLYERAASAGYEYGPAFQGVRAVWQHGADVLAEVELPEAAGEKKGFGIHPILLDAALHPLLLSMDDAGSGAAWLPFAWNGVTLWASEAGTVRVRLTPQDGAEGDRTVRVAIGDTVGAPVLTADSVVMREASTEHLRAAAGTNRDGSDTDGLFTTRWTPLDTAATGDTGNGDDGDDGAWAFLGSEHMRPQQNGETRPEWLWHDDLAALVTAIEAGAPVPPVVFAEVSAGAGDGGNSGENAAARGLATARRTLALLQDWLAEPHLADARLVLVTRGAVTTGELDGPAAGGDAEAPIALSDAGVWGLVRTAQSENPDRFVLLDLDSGTADATGDGAVRAVWAAIAADEPQLALRSGRAFVPRLTRTGTDDGTLRLPQPERAWHLDAGGTGTLDGIAAVPCPDVLEPLAAGQVRIEVRTAGINFRDVLVSLGMAPGQTGLGSEGAGTVVEVGPGVSGLSVGDRVMGLFEGAFGPVTVADERMVVRLPDGWDFRSGAAVPVVFLTAWYGLVELAGLKSGESVLVHAATGGVGSAAVQIARHLGAEVFATASPGKHAVLEAMGIDAAHRASSRDLEFERTFRAATGGRGVDVVLNSLAGEFTDASLRLVAEGGRFLEMGKTDIREPEHIAAEHPAVGYQVFDLVTDAGPDRISEMFRALAELFAEKVLRPAPTQAWPLTRAREALRVMSQAKHTGKLVLDIPPALDPDGTVLITGGTGTLGGLVAEHLVRSEQVRSLVLASRSGPDAAGAAELKARLEELGAEVRIVAADVSDADAVTRLVAEADADTGRPLTGIVHTAGVIDDAVLASQTPERLARVWDAKASAAHHLHAATEHLRLGAFVVFSSAAGTFGSAGQANYAAANAFCDALAAHRHAKGLPAVSVAWGLWEATSGMTGDLADTDVARMSRAGIRPLSARHGLALLDAASRHGAANLLAADLDVRALNGGSAEAGQGAVPAFLRALAGGGTATRRTAANGQAPADFAARLAAMPLEERQRVLLDLVRSHAAATLGHSDVGAVQTDAPFKELGFDSLTAVELRNKLSASTGLRLPATFVFSHPTPSAIATELHDKLCPDAPEPSAPVFAELERLEAAMAGLTLDAETRGGLVKRMQTLMWRLGDGAAEAEQATADEDLDSASDEEIFAFIDREL(SEQ ID NO:5)
根据本发明的实施例,madA5基因簇位于该链霉菌的基因簇核苷酸序列第50613-62720个碱基处,长度为12108个碱基对,编码聚酮合成酶,长度为4035个氨基酸。
根据本发明的实施例,madA6基因编码SEQ ID NO:6所示的氨基酸序列,具体如下:
VSSAGKRGPMPGTEEKLRQYLKRVTVDLGQARQRLRDVEERNQEPIAVVAMACRYPGGVRGPEDLWDLVASRGDGITEFPDDRGWDLDGLYHPDPDHSGTSYVREGGFLDDTDRFDAAFFGISPREAMAMDPQQRMLLEVSWELFERAGIAPAALKGTPTGVYAGMSSQDYMTRTPRIPEGFEGYATTGSLTSVISGRVAYTFGLEGPAVTVDTACSSSLVAIHLACQALRQDECTLALAGGVTALTTPTAFAEFSRQRGLAPDGRCKAFAAAADGTGFSEGVGLVLLERLSDARRNGHHILAVLRGSAVNQDGASNGLTAPNDASQERVIRQALANARLAPDQVDAVEAHGTGTTLGDPIEAQALQATYGKDRPADRPLWLGSVKSNIGHTQAAAGVAGVIKMIMAMRNGVLPPSLHIDEPTSHVDWDSGAVRLLTEPVEWVWGGRPRRAGVSSFGISGTNAHLIVEQAPESEAEVEVLEGAAGPDTEVDESVPGGVVPWVVSARSVAGLRDQAAVLSAHLAGREVSPVRVGWSLAATRSVFEHRAVITAHHREEFLEGLDALAAGGQHPGLITSPAGGAGGGGVVWMFSGQGSQRPGMGAGLYARFPVFAAAFDEICGLLDPHLPHPLREVVFNPDPHQDQQSGLLDHTLYTQTALFALHISLARLLHHHDHTPHTLIGHSIGEIAAAHIAGILDLPDACHLITARATLMAQLPTGGTMTAIQAREDELAAEVEESGGTVAIAALNTPDSTVISGQADEVARIAAGWEERGRKTKALAVSHAFHSPLMDPMLDEFREAIEGLSFRAPSVPLISTLTGELAGDEIASPAYWVDQVRRAVRFAPAVARAAERGGAFLEIGPDPVLATAAQHTLSEPASGRERPEPLVTAVLDRHRPDDEAFVRALAEVHTHVRSSAIGWARLFPDDPAPRAVDLPTYAFQRQRYWMADVTPPESAAGGDASEERFWGAVERGDLEALSDTLRLPDGDGQRASLGEILPVLSGWRRERRERSAVDSWRYRVTWKRLTGLDGGTPSGPWLVVAAGGADGWADACGRALGEEAHRLDVDGAVDREGLAEWLRSRYADGAPPAGVLSLLALRDGPETGAGGAAGTLALLQAMVDTGIGAPLWCATRGAVSVGDSDLLESPAQARVWGLGRVAALEHPDLWGGLVDLPPGPDGLDAARLRAVLTGTAGEDQVALRANGAFGCRVVPSPAGDGEPGREWSPEGTVLVTGGLGEPVARIARWLAEGGAERVVVPDPGGTDAPGAGELADELSGLGAELIVARCGPDDPAAVKELAGRLSAAGARIGMIVHAPAPAEPGPLAELAPAALEAFPAEDVGAWRRVGELCGMEPDEPAVCFTSVAALWGSTGQGAHAAAGAHLDALAGPGRPEGVVSVAWGAWDVPADTDERGARGADGLRRQGLQPLDPRLALTALERVLGNGDRRIAVADVAWDRFAPLFTLARPSRLFEDVPEARRAIEAARGPSDDEAADRTAELRRELAAVTADERAARLLAMVRTDAAAVLRYEAADAVDPDLPFKDLGFDSIAAVGLRNRLRASTGLRLPATVGFDYPTPRALAGYLLGRVLPEESGTGHAAFGHVEELDAALAELPLEDPRRAGLMNRMRALLWKYEPDSAASEAGDGDGEEDLAAASADDMFALIDRELGT(SEQ ID NO:6)
根据本发明的实施例,madA6基因位于该链霉菌的基因簇核苷酸序列第62717-67726个碱基处,长度为5010个碱基对,编码聚酮合成酶,长度为1669个氨基酸。
根据本发明的实施例,madA7基因编码SEQ ID NO:7所示的氨基酸序列,具体如下:
MASNEEKLLDYLKRVTTDLRQAQRRLKDVESVGHEPIAIVGMACRFPGGVRSPGELWDLVAEGRDAVGGFPADRNWDLDGLFDPDPEHPGTSYVREGGFIDDVAGFDASFFGIGPREALAMAPQQRLALETSWEAVEHAGIDPESLRSSPTSTFIGCDRLDYYSDPSQVPEGSAGYFTIGNTASVVSGRVAYALGLEGAAVTVDTACSSSLVTLHLAARALRQRECDMALAGGVFVMSSSAPLVGFSQLRALAPDGRSKSFAAGADGMTMAEGAGMVLLERLSDARRAGHRVLAVIRGSAMNQDGASNGLTAPNGPSQQRVIADALADARLSPADVDALEAHGTGTALGDPIEAQALLATYGQGRPDDRPLWLGSIKSNIGHTQMAAGVAGVIKMVMALRNGRLPASLHIDEPSPHVDWDAGAVRLLTDPVEWRNGDRPRRAGVSSFGISGTNAHVILEEAPEPDDTAPAAPAPAPDAEVAGGVVPWALSARNAPALRDQAAALVTHLEERDDFSPVEVGWSLATTRSTFEHRAVVIGQDRDQLMDGLKALASGAQHPNLVNPGVAAAHSENGPVMVFPGQGSQWPGMGAELLDSSPVFAARMAECERALAPYVDWSLTDVLRGADGAADLERVDVVQPVLWAVMVSLAAVWAEHGVTPAAVVGHSQGEIGAACVAGALGLEDGAKIVALRSRALRRLAGTGAMASLETGPEEAEGLLDGGVTIAAVNGPTSVVVSGPPGEVAGVVDRVKARGLRARLIDVDYASHGPHVGEVADELREALAGIEPAASSVAFYSTVTAARIDTSGLDADYWVANLCRPVRFADAVHALLDDGHGTFVEASPHPILTLALEQIAEDAGAAAGIVPTLRRDHGDHAQLTRALAHAHTLAAPVDWTRCFPASPPPATTDLPTYPFQRERFWLAPAARSGVGGDGHDPAESELWHAIEELDVDALSATLRLEDGSPAVDDLRPALPILAQWRRRHRERTVLDSWRYKITWKHLREERAPELSGTWLVITPADEDGGSPAVHTAVQAIRGHGGDVRLLPVDTAGATRERFVRDLAEASAETPPAGVLSLLALDASPHPDEPAVPAGLAATIALIQAHGDGGAGAPLWCLTQGAVAVSPTDPLPSPRQAQVWGLGRVAALEYPSRWGGLIDLPAAPDRNTAARLAGLLASGGPEDQVAVRTTASFARRMDRAPARDGGDAPWRPTGTTLVTGGTGGLGTHLARWLARNGAPHLVLTSRRGPDAPGAAELAEELEALGTRVTLAACDASDRSALEGVIDAVPAEHPLTAVFHAAGMSDLTPITDLDLARAGEVLRSKSQAAEHLHDLTRHLDLTAFVLFSSGAAAWGSGQQGAYAAANVYLDALAEHRRRLGLPSTSLAWGPWGEAGMSADENVIAFFARRGLTPMDPGLAVKTLHEAITQGETSLTVADIHWRKFPAALTTQRPSPFLSDLAETDPDDADRRDASPGASPLRQELAGSAPKQQLDLLLRHVRTHAASILGHTSIDAVPAGQPLQELGFDSLTAVELSKQLSTSTGLSLPRTLVFDQPTPNALAKYLHNELADDQPAAASRGSARTATAAASDEPIAIVAMACRYPGGVRNPQQLWDLVASGGDAIAGMPTNRYWDLNNLYHPDPEHPGTSYVREGAFLYDAPEFDAGFFGISPREALAMDPQQRLLLETAWETFENAGLTQEALSGSNVGVFTGGTFQGYSATGNPAQESEGYALAGTTASVISGRVSYAFGLEGPAVTVDTACSSSLVAIHLASRALAQGECSLALAGGVAIMATPATFVGFSRQRGLAPNGRAKPFAAAADGTNWGEGAGLLLLERLSDAERNGHHVLATIRGSAVNQDGTSNGLTAPNGPSQQRVIRQALANARLEPADIDAVEAHGTGTTLGDPIEAQALLATYGQDRPADRPLWLGSVKSNIGHTQMAAGVAGVIKMVMAMRNDLLPESLHIDEPSPHVDWDAGAVRLLAEPVEWPEGERPRRAGVSAFGISGTNAHVILEEAPEPSVPDTASEAPDAADAEPDASGGPVPWVVSARSATALRDQAAALSAHLAGGDDSPGDVGWALATRRSTFEHRAVVLGRDRDELLAGLDALATAGQHPGLVAPGTPAAASEPGPVLVFPGQGSQWPGMGAKLLDTSPVFAARIAECEQALAPHIDWSLTDVLRGTEGAADLNRVDVVQPVLWATMVSLAAVWADHGVTPAAVIGHSQGEIAAACIAGALTLTDAATITALRSKALRNLAGHGAMASLGTGRDGAERLLEERGGDVVVAAFNGPSSTVVSGPPEAVADVVAAAKAAGLRARMIDVDYASHGPQIDRITGELHEVLAGIRPFDAEIPFYSTVHAGRIDTTGLDAAYWVTNLREQVRFADTVEALLNDGHRVFIEASAHPVLTVGLEEIFEEAGADAYAVPTLHRDRGDRAQVARSVAEAFTAGVQVDWTALFGRTARRRAVDLPTYAFQRRPYWLAPAVPSGGGGPGHDQAETELWNAIEELDVDALGAALHLDGDGPALDDLRPALPILSEWRRRHREQSTLDSWRYQIGWTHLPEIAAPVLSGTWLVLVPADGGAGAAERSAIDTAVQAIRSHGATASVLPVDCAAVERDQLVQHLTEAETPPSGILSLLALDETPHPSHPAVPAGLAATTALVQALDASGIAVRLWCVTQGAVAVSPSDPLLNPVQAETWGLGRVAALEYPARWGGLIDLPTTPDQHTPARLAALLTPGQPEDQVAVRATATLARRMRRAPAPSGNGRAPWTPSGTTLITGGTGGLGAHLARWLAHNGAPHLLLTSRRGPDAPGARELADELRALGTAVTLTACDVSDRSALKGVIDGVPDDQPLNAVFHAAGIPELHPFAELDIPHISDVLLPKAQAAAHLHELTRHLDLTAFVLFSSGAAAWGSGQQASYAAANTYLDALAEHRRTLGLPATSIAWGPWGEAGMAADENVIAFFARRGLAIMDPDLAVKSLHQAVSHGDTTLTVADIHWETFTATFTTQRPSRLVADLTPSRSDTGDGTGGGAGAEEGRTDDHPLRGELAGSTPKQQLELLVRHVQEHAATILGHSGADAVSAGQPLQELGFDSLTAVELSKRLGSATGLSLPRTLVFDHPTPNAIAKYLRAELTGQQADAVRSPASITSAATDEPIAIVAMACRFPGGVRNPQQLWDLVASGGDAIAEMPTNRSWDLDNLYHPDPEHPGTSYVREGAFLYDAPEFDAGFFGISPREALAMDPQQRLLLETAWETFENAGLTQEALSGSNVGVFTGGTFQGYSATGTPAKEIEGYILVGNTASVMSGRVSYTFGLEGPAVTVDTACSSSLVAIHLASQALAQGECDLALAGGVTVMATPATFIGFSRQRGLAPNGRCKPFAAAADGTGWGEGAGLLLLERLSDAERNGHHILATIRGSAVNQDGTSNGLTAPNGPSQQRVIRQALANARLEPADVDAVEAHGTGTTLGDPIEAQALLATYGRNRPEDRPLWLGSIKSNIGHTQMAAGVAGVIKMVMAMRNGLLPESLHIDEPSPHVDWEAGAVRLLSEPVEWVWGGRPRRAGVSSFGISGTNAHLIVEQAPEPDPEVVPEAESESEARVGSGGVVPWVVSARSVAGLRDQAAALSAHLSGRDVSPVQVGWSLATRRSVFEHRAVITGHHSEEFLQGLDALAAGDDHPCLTASPSAGATGGEVVWMFSGQGSQRPGMGAGLYERFPVFATTFDQICDLLDPHLPHPLRDVVFNPDPDQPDLLDHTLYTQTALFALQVSLARLLNQHGHTPHTLIGHSIGEIAAAHIAGILDLPDACRLLTARATLMDQIPIDGTMTAIQATYDELTDHLQTVDPDGRRIAIAALNTPDSTVVSGDPDLVAQVADVWAGKGRKTRRLKVSHAFHSPHMDPVLEPFAEVVRGLTHRPATIPLITNLTGGPVEDLGPDYWIQQVRQPVRFQPAIEYLARSATPPAAYLELGPDPVLAAAARHTLASVGGDGRPEPVVAATLNHRHSDVHALTDALAQLHTHVAPIDWSPYFPSRRAGSAPDLPNYAFQRRSYWLVNEPEKAAATENALDSEFWDAVEREDVESLARTLGSPAEQETSLGEVLPILSGWRRRHREQGVLDSYRYQVAWTHLPEESAPVLSGTWLVLVPANDAEGPAADLAVQALRAHGAVPNVLRVEATTAGREEFARQLADADADAPLEGVLSLLALDETPHPSHPAVPAGLAATTALVQALDDSGVPARLWCLTQGAVSVSPSDPLLNPLQAETWGLGRVAALEYPARWGGLIDLPTTPDQHTPARLAALLAPGQPEDQAAIRATAVLARRMRRAPDRTNGDAPWRPSGTTLITGGTGGLGAHLARWLAHNGAPHLLLTSRRGPDAPGARELAEELQTLGTAVTLTACDVSDRSALKNVIDGVPGDQPLNAVFHVAGIPELTAFADLDVPHIGEVLRSKALAADHLHELTRDLDLTAFVLFSSGAAAWGSGQQASYAAANAYLDALAEHRHTLGLPATSIAWGPWAQAGMAAVDDVITYIGRRGLTPLDPDLALKALHRALTRGETTLTVADIEWEAFTTTFTTQRPSPLIADLAPARPDAAADVEEAAEGGGHPLREQLAGGTPEEQRHILLRHVQARVAAVLGHPDPDAIPAAQPFQELGFDSLSAVELRNRLAATTGLPLPPTLIFDHPSCGALADLLRVQLLDVDVVDEGRVLSGLDQWDSSCDPAAVDGAARRRVTQRLESLLAKWNATGDEDGRSAPDHELETATAEDIFDLIADEFGKS(SEQ ID NO:7)
根据本发明的实施例,madA7基因位于该链霉菌的基因簇核苷酸序列第93752-108064个碱基处,长度为14313个碱基对,编码聚酮合成酶,长度为4770个氨基酸。
根据本发明的实施例,madA8基因编码SEQ ID NO:8所示的氨基酸序列,具体如下:
MSNEEKLLDHLKWVTAELRQARRRLREAESAEPEPIAVVGFACRYPGGVRSPEDLWRLVAEGRDAITPFPADREWDVERIYDPDPDKPGRTYVTEGGFCDDVPGFDAAFFDISPREALAMDPQQRLLLETAWETFERAGLDRDALHGSETGVFAGVSSHDYLSLIGRTTSDVGGYAGTGNVGSVASGRIAYTWGLEGPAVSVDTACSSSLTAIHLACQSLRQRECTMALAGGVSVLATPDAFIEFSRQRANAPDGRCKSFAAAADGTGWSEGVGLVLLEPLSDAQRNGRRILGVVRGSAVNQDGASNGLTAPNGPSQERVIRQALVRAGLAASDVDAVEAHGTGTTLGDPIEAKALLATYGQGRPDDRPLWLGSVKSNIGHSQGAAGVAGVIKMLMAMRNGLLPASLHIDEPSPHVDWDAGAVRLLTDPVEWPRGDRPRRAGVSSFGISGTNAHLILEEPPQPDEVPAGQAPRAEVVDGVVPWALSARNAPALRDQATALVTHLEQREDFSPVEVGWSLATTRSTFEHRAVVIGQDRDQLMDGLRALADGRTHPCLVNPGTPAAVSQTGPVLVFPGQGSQWPGMGADLLDTSPVFAARIAECEQALAPHIDWSLTDVLRGTEGAADLNRVDVVQPVLWATMVSLAAVWTDHGITPAAVIGHSQGEIAAACIAGALTLTDAATITALRSKALRNLAGHGAMASLGVTPEEAGELLEGDVTVAAVNGPRSVVVSGGPDAVAAIVAKVEANGQRARMIDVDYASHGPQIDRITGELREILAGVRPSEARIPFYSTVTAARIDAAELDTEYWVTNLRRPVRFADTVRALLADGHRDFIEASPHPILTLALEQITEDADTRTTVTSTLRRDHADHTQLVHALARLHTAAAPADWTRHYPADPPPAVTDLPTYPFQHQRFWLDTRGWLGGEPGTLGLAPAGHPLLGAAIEQAEGDAYLLTGRISQGTEPWLADHKVLGAVLLPGAAFADLAVHAAARTGCDHVAELVLHEPLRLPEDGAVDLQVAVGPADGEGGGRPVAIHSRPAADADDGESAWTRHATGLLANGPGASTATPSEPHPLDGAWPPPGAEPLATEHLYDDLADRGSAYGTAFQGLTAAWRLDEHLYAEVVLPEDGTDRAGADGSGTAGYGIHPALLDAALQACTLGADGADGDSGAVMLPFAWSGLRVHATGAASLRVRVTPTAADRLTIAAADASGAPVLSLDDLTLRSVEADAAAQARLAARNSLFAMDWTPLPDPEGTPPARLAVVALDAASGSDAELADALGGALPGTERHPDLSALLTAAVDGTPLPDTVLAVCTASGAAGPVEERLHAVTSGVLALLQEWPSDESTSRLVVVTRGAVATGPGDGVDDLPAAAVWGLVRSAQSENPGRIVLLDLDGPDASSHAVPAALASGEPQIALRDGRAYAPRLVRYDPGDRLSPPPGAEAWRLVYGADGEAALVPDPEHGRPLAPGEVRVALRAVAVSAQAAGPGAPDEVRDVRGDGAGVVAGVGADVDGFAVGDRVMGAFDAVGPVALTDHRLVVPMPAGWSYAEAAGAVSTYLPAYHPTPADLDGTGRGEGGRVERVRSTLSDLAALFEDGTLPPAPVTVRDVYDVREALRRADGAAGRTVLSLPPPLDPDGTVLITGGTGALGAATARHLVAERGVRRLLLASRRGPEAPGADALAAELAALGAEVAVAACDTGDRAALADLLASVPARHPLTAVVHTAGIVQDATIRTATPDQLDAVLRVKADGAWHLHELTRDLNLAALVLFSSVTGLAGGPGQGSYSAANVFLDALAQHRHARGLPATSLAWGFWEMDTGMSGRFTDIDLARNARSGDLGLSAERALALFDAALGLGRPLLAPVRLDLPGLRRRTAGGEVPAFLRVLLRGSAPRPGGTAAGPSLARTLAALNDADRREALLDLVRTQAATVVAHESAGSIPAEQNFRELGFDSLTGVELRNRLSAATGVRLPATLIFDHPTPAAVAGLLAEEVAPDGAGPEPAALVAELDRIEAAVAALPDGDEGRGLVAARLDALLRSVTGVPDGAAPDDGPGRDDLDSATDDELFDLLDNELTGLERDRPADHDAPSERG(SEQ ID NO:8)
根据本发明的实施例,madA8基因位于该链霉菌的基因簇核苷酸序列第87421-93696个碱基处,长度为6276个碱基对,编码聚酮合成酶,长度为2091个氨基酸。
根据本发明的实施例,madA9基因编码SEQ ID NO:9所示的氨基酸序列,具体如下:
MTNDEKLRRYLKHATADLRQVRRELREERDRAREPIAIVGMACRFPGGVRGADDLWRLVADGADAISAFPEDRGWPLEELGAPGGGPGTCHTLEGGFLHDAPRFDAAFFGISPREALGMDPQQRLLLETAWETFEDAGLRRDALRGSDTGVFAGIASQDYLALAGGTAAEVEGHLATGNAGSAVSGRLAYTFGLEGPAVTVDTACSSSLVAAHLAARALRQGECSLALAGGVTVMATPGLFLEFTRQRGLAADGRCKTFAASADGTSLAEGAGLLLLERLSDARRNGRRVLAVIRGSAVNQDGASNGLTAPSGPSQQRVIRRALAAAGLSPADVDAVEAHGTGTTLGDPIEAQAVLAAYGRNRPAERPLRLGSLKSNIGHTQAAAGVAGVIKMVMAMRNDLLPASLHIDEPNPHVDWEAGAVSLLTEPVPWPRGERPRRAGVSSFGISGTNAHLVLEEAPPPEDPEPATTAPPDAGEGVLPWVLSARGEDALRAQASALADHAAGAPLESPAAVGLALATTRTAFEHRAVVVGEGRGELLAATEALARGDAHPGVVVAGAGEAGTGTAAGKTVFLFSGQGSQRPGMGAGLYERFPAFAAAFDEVCALLDEHLDRPVRDVVFADRSGALDHTTYAQAGLFALHVALARLLGAAGVRPDVVIGHSIGEIAAAHVAGVFDLPDACRLVAARATLMGRLPEGGAMATVAATPEELSPDLDAHGGRVAVAALNTPGNTVISGAAGPVAEIAEAWAERGRKTRALTVSHAFHSPLMEPVLAPFEEAVRDLAYRPPAVPLISNLTGEPADERIATPGYWAEHIRRPVRFHPAVAHVAPDAGTFVELGPDSTLTAAARRTLEHVRPDGPAPRTVATLSSRQPDAHAFVRALARLHIAGTDVDWTACFPDGSAPETVRLPTYAFQRERYWLGDAGGGAGDVSAAGARRVRHPFFAAAVELADGGLALNGTISANGAAWTTGHAVAGLPIVPSSALVEWALLAADEAGCGGVEELVVRDPLVPPARGRGPRVQVTVGAAREDGRRDVRVHSRPDHGPRTADDPAWTCHAHGTLAPPAPGPAAPAPEDERARAWPPPEAVPVDLDGLRQRTVAAGYEDGPAFDGIRAVWRDGSDLLAEVALPDEAGGHDGYGLHPVLLEAALRPALLDVPDDARAREDDVWLPSAWNDVALWASEATTVRVRLSPTGDPGRNARRVRVTLTDPAGAPVLTVGSVTLEPVAVEDLRASGGGRTDGLFTLDWTPAPAPTEDASGRYIELDALRTALDADPGAPAPPVVLTRAPTASGGDARRAAEAASTLVRDWLAEPRLSGSRLVVVTRGAVSVDGGERDVPSLADAAVWGVVHSAQAERPDRFVLLDTDSDPEEARLREAVAAALAQHEPQVAIRSAQVLVPRLARAASPGPRATRADGPFAVDGTVLITLDGGPRGALVAEHAVRAWQAKHLLLLRPDGPGPGHESPGVRDAAERAAALGADVRVAACDLTDASAVADVVAAIDEARPLAAVVHAACPPDEARTDGIATAYALHAATADLPLEAFVLFSSAAATLGASGGDAAGAFNDALAAHRRAAGRTALSIAWGPWLDDTARPSAAAPGVGALTAEHALALLETARHSPTPRLVAANLDVRALASAHATEADAPPAPLRALAGRTPARPSAATGRGAASLAERLAAAPADQRGPLLLDLVRTNAAAVLGHADPGLVPVGRGFSEMGFDSITATDLAGRLAGATGLAVPTTAVFDHPSPAALAEHLHAELAPRGPDPDEDGTPASLLARLERMDNAMAAASPAGAAVNGDASVRTALRRRLQTMLARLDDHPGGAPDGPSGEPSAEPSGMGAERIGTATVDELLAFIDHDLGRNRR(SEQ ID NO:9)
根据本发明的实施例,madA9基因位于该链霉菌的基因簇核苷酸序列第81810-87416个碱基处,长度为5607个碱基对,编码聚酮合成酶,长度为1868个氨基酸。
根据本发明的实施例,madA10基因编码SEQ ID NO:10所示的氨基酸序列,具体如下:
VTDPTSGGAPRSPANGGADARTGNGAGNGAGGSAGGGAGREAELVEYLRWTTAELHEARRRLRRAEDAGREPVAVVGMACRFPGGARTPDRFWDLLAEGREAVTGFPADRGWDLDALFHPDPDHHGTTYARGGGFLDDAADFDAAFFGIGDEEAEAIEPQQRLLLETAWEAVESAGIDPRTLRGSATAVYAGVALHDYGTRLRRVPAALMPYLGTGNAGGGAPGRVAFTLGLVGPAVALDSACSSSLTAMHLASRALRDGECDLALAGGAAVMYTPSTFLLSSSQRQLAPDARCKPFAAASDGMVWGEGAGLVLLERLADARRNGHRVLALVRGTAVNQDGAVSGMAAPHGPGRQRLMREALADARLSPADVDVVEAHGTGTAIGDAIEAQAVLSVYGQDRPADRPLWLGTVKPNIGHSQAAAGMASVIKAVLAMRHATLPATLNIDRPTPLVSWRAGAVRLVTEPTPWPRGGRPRRAGVSAFGNSGTNAHVILEEPPPEPPEPRPPSPPPGGTLAWPVSARGAAALRAQAAALAAHVGNDPDLAPEDVAWSLAATRSVFEDRAVVVGAHRAELLAGLHALANGTPHENVIGGTGATAGPDADSDAGTWAFGGSAERAAQATTALRERFPAFAAAHDEVSERFGGALGGDAATFALHVALARLLLAAELRPGAVVGRGVGEVSAAHLAGALDLPDACRLVAQRDAPGSDALALTGGKPTVPVLDAGSAPFAEAGAFLDFGPEPFPDAPRSAERLVPGDGRAVGRGLVEALARLHTSGATVGWAALFGGGPRPRAVPLPTYAFQRRRFWLQEPPPADGAPVAAYASEGGPR(SEQ ID NO:10)
根据本发明的实施例,madA10基因位于该链霉菌的基因簇核苷酸序列第75412-77904个碱基处,长度为2493个碱基对,编码聚酮合成酶,长度为830个氨基酸。
根据本发明的实施例,madA11基因编码SEQ ID NO:11所示的氨基酸序列,具体如下:
MHQQATNDSDGQAVLDGTELQEKLAAAGPPERERILGETVREQAAAVLDQSVIDLDSNFLEKGLTSLKALELTRNLMSLTGIEIPLVAVIEHPTPVHLGRFMAELLAESGDGAAR(SEQ ID NO:11)
根据本发明的实施例,madA11基因位于该链霉菌的基因簇核苷酸序列第72491-72838个碱基处,长度为348个碱基对,编码聚酮合成酶的ACP蛋白,长度为115个氨基酸。
其中,糖基合成相关基因模块包括madG1、madG2、madG3、madG4、madG5、madG6、madG7和madG8共8个基因,具体如下:
根据本发明的实施例,madG1基因编码SEQ ID NO:12所示的氨基酸序列,具体如下:
MKGVILAGGKGTRLQPVTAVGSKQLMPIYDKPMVYYPLSILMFAGIRDVLVICRPSEVAVFRGLFGDGRRLGMNIDYAAQDEPRGIPDAFLVGADHIGGDDCALILGDNLFHGSGLPSLLRQSAERMDGCVLFGHQVSDPERYGVAEIDERGRLVSIEEKPSEPRSNLAIPGLYFFDNKVVEIARGLAPSARGELEITDVLRAYLEAGRADLVWLGRGVTWLDTGTHETLLEAGCFVRDVHQRQGARIGCVEEIALYMGFIGPDECYELGAEMGNSPYGRYVMDQARFYDRVRDLLDWRATFLEGA(SEQ ID NO:12)
根据本发明的实施例,madG1基因位于该链霉菌的基因簇核苷酸序列第7818-8738个碱基处,长度为921个碱基对,编码葡萄糖-1-磷酸胸苷酰转移酶,长度为306个氨基酸。
根据本发明的实施例,madG2基因编码SEQ ID NO:13所示的氨基酸序列,具体如下:
MNLLVTGAAGFIGSTYVRMLLAPAAPDPAAPDPVAPDPDAADPDASEEIRVTVLDKLTYAGTLDNLTLSHPRLRFVYGDICDEDLVGKLMSEADQVVHFAAESHVDNSIAAAGDFVRTNVVGTHTLLEQALAHGVRRFVHVSTDEVYGSVETGRSHETDALDPNSPYAASKAASDLLALAYHRTHGLDVRVTRCCNNYGPRQFPEKLIPLFITRLLDGQKVPLYGDGQHKRDWLHVDDHCRAIELVRTAGAPGEVYNISGHTELTNHQITGLLLEAVGGDWDQVAYVTDRKAHDRRYAVHDDKARTRLGYRPRRDFPRGLAETVAWYRHNRSWWEPLKRRAGL(SEQ ID NO:13)
根据本发明的实施例,madG2基因位于该链霉菌的基因簇核苷酸序列第8735-9766个碱基处,长度为1032个碱基对,编码dTDP-葡萄糖4,6-脱水酶,长度为343个氨基酸。
根据本发明的实施例,madG3基因编码SEQ ID NO:14所示的氨基酸序列,具体如下:
VPDPARPAPLSPRSPQSPRAPRSPRGRPAPGGRSPRAREDTAAQRAIAASAAALEGRVPGVADVRAWLGERRDATRFQVGRIPFAELDGWSFDGPTGNLVHDSGRFFTVEGLRVSIDGDRPREWQQPIICQFEVGILGILAKRFDGVLHFLMQAKMEPGNPNLLQLSPTVQATRSNYTKVHRGASVRYLEYFTGGEPVRVLADVLQSEHGAWFCRKRNRNMIVEATGDVPEHEDFRWLTLGQLAGLLHLDNVVNMDARSVLACMPGAGGPTALHSDVELRSWIAAERSRHIVTTERVPRAGLRHWVQGPWSIDHEERRHFRVVAVSVRAGNREVTRWTQPLFEPYETGIVAYVTRTFEGVPHVLARARVEGGFLDTVELGPTVQAIPGNYAHVPASERPAFIDLVSSADPSRVRYSAVHSEEGGRFLNAESRYMVVEADEDEAPIDPPAGFRWVTPASSPCSPSTATTSTCRRAACSPA(SEQ ID NO:14)
根据本发明的实施例,madG3基因位于该链霉菌的基因簇核苷酸序列第118881-120320个碱基处,长度为1440个碱基对,编码NDP-己糖-2,3-脱水酶,长度为479个氨基酸。
根据本发明的实施例,madG4基因编码SEQ ID NO:15所示的氨基酸序列,具体如下:
VRHVDGVERQEPPLRLGVLGCADIARRRTVPALLAADGVDVTAVASRDPGRAESFAAEVGCAAVGGYGALLASDDIDAVYIPLPSALHAEWTEKALDAGKHVLAEKPLTTDYETTARLLRLARTRGLVLLENVTFLHHSQHTAVQKMLADGAIGELRDFSSAFTIPPLPAGNTQFLPGLGGGALLEQGVYPIRAALRFLGNDLDVTAAVLHVDGGTGMVVGGRALACTPGGVTADLAFGVRHSYRTGCEFAGSAGRLLTDRAFTPPAAYQPVVRIERQDHREEITLPPDHQWANLGRFFARAVGAARVAGAAGAARTAPGPGAGAEAGTGGEPEAFGAGSLHQARLVSEIEERARRITV(SEQ ID NO:15)
根据本发明的实施例,madG4基因位于该链霉菌的基因簇核苷酸序列第120357-121436个碱基处,长度为1080个碱基对,编码氧化还原酶长度为359个氨基酸。
根据本发明的实施例,madG5基因编码SEQ ID NO:16所示的氨基酸序列,具体如下:
MRRSARRFAVRISEMTVPDAYRIEPDRIADSRGFFYEAIRAEALRSITGHELEIRQVNYSVSKRNTVRGIHGTALPPGQAKLVTCVRGAALDIAVDIRVGSPTFGHYAVTRQDAESGTAVYLPDGIGHAFLALTDDTCMCYLCSVEYVPGTMIDIDALDPELALPWKLTGPPIRSAKDAAAPSLSEAAAAGLLPTYEQCVRSYARNGPAGGGAAGPAAGGRHSFQRR(SEQ ID NO:16)
根据本发明的实施例,madG5基因位于该链霉菌的基因簇核苷酸序列第117989-118672个碱基处,长度为684个碱基对,编码dTDP-4-酮-6-脱氧-D-葡萄糖差向异构酶,长度为227个氨基酸。
根据本发明的实施例,madG6基因编码SEQ ID NO:17所示的氨基酸序列,具体如下:
MSISQYTDPRIVPHESDGEREARERLTRLLSKTPIQPEYLIDNLAVYLRRHQLSDLLSMDALYREVRDVPGVIMEFGVLHGRHLATLTALRGIYEPYNSLRRIIGFDTFTGFPDISPVDEVSPSAVVGRFETARDYPDHLREVLAAHELGEPMGHVQRTIVMQGDVRETVPRYLEENPQTVIALAYFDMDLYAPTRETLETIRPYLTRGSIVAFDELAHPKWPGETAAFREVCGQDATLHQIPGREPPVIYLRWNE(SEQ ID NO:17)
根据本发明的实施例,madG6基因位于该链霉菌的基因簇核苷酸序列第117240-118010个碱基处,长度为771个碱基对,编码dTDP-6-脱氧-L-己糖3-O-甲基转移酶,长度为256个氨基酸。
根据本发明的实施例,madG7基因编码SEQ ID NO:18所示的氨基酸序列,具体如下:
VSGDGLHVRLGRTGLTVSRLGLGTVNFGGRVEEPEAHRLMDHALACGLNLLDTADSYGWRVRKGHTEEVIGRWLARGSRRDDVVLATKVGSAMGPGPNDRGLSARHIVAACEDSLRRLGTDWIDLYQLHSMDRAVPWDEIWRAMELLVAQGKVRYVGSSNAAGWDLAAAQETAARRGAPGLASEQCLYNLVARHPELEIIPAARAYGVAVLVWSPLHGGLLGGTLRKAAEGTAVKSAQGRAAAALDAHRETIAEYERFCGGLGRDPAEVGLAWALRRPGVTSLIIGPRTPEHVDGALRALDKPLSDPELDRLDALFPPVGRGGPAPDAWIG(SEQ ID NO:18)
根据本发明的实施例,madG7基因位于该链霉菌的基因簇核苷酸序列第116215-117210个碱基处,长度为996个碱基对,编码还原酶,长度为331个氨基酸。
根据本发明的实施例,madG8基因编码SEQ ID NO:19所示的氨基酸序列,具体如下:
MGPSYQSSLQWENISKHWVTEDVSAELADFKSGRPNHKIAIWNPEANGIRYLKTLVYNLASGLGPAEWEKLRRIRNREVGDPFTVTYDGESVCLDYLQAVLELGFVEKGLDLAGAAVLEIGAGYGRTCHTLLSNHDLASYHIVDLPNTLGLCRSYLRAVLDEERFDRITFVTVDEVEDALGARRFDLAVNINSFTEMPPGTVRDYLALIDRRCDAFYVKNPVGKYMDKSLDGHTQGDEVVRMAMETGPLRKVVDINDSRAVEAAVPDFIEAYRPGGAWTCAADGWPVPWTYFWQALYRRSPAANGSAPAAGR(SEQ ID NO:19)
根据本发明的实施例,madG8基因位于该链霉菌的基因簇核苷酸序列第114954-115892个碱基处,长度为939个碱基对,编码糖基氧甲基酶,长度为312个氨基酸。
其中,修饰基因,madO1基因、madO2基因、madI基因、madE基因、madP1基因、madP2基因、madM1基因、madM2基因、madG9基因、madG10基因和madTE基因共11个基因,具体如下:
根据本发明的实施例,madO1基因编码SEQ ID NO:20所示的氨基酸序列,具体如下:
VTMTRSEHAIVLGGGWAGMLAAQALSRRFDAVTVLERDVLPEGPRQRKGQPQARHGHILWSSGARVVDALLPGTIDRLLSLGARRIMFQRDLVTLTSHGWQHRFPSKQYCVMCSRPLMDWAVREQVAASGRVEVRQRTEALDLAGDRDRITGVHVRDVATGAASSLTADLVVDATGRGSRLKHWLGALGLPPLEEDVVDAGMAYCSRIYQAPPGAAAGFPPVNLAPDPRVREPGRFGVVHPQEDGTWMVTLAGTRGVRLPTDDAEFNEYARTLRDPLVADLIARAEPLTPLVVSHFGANRRLYPERLAGWPDRLVVLGDALAVFNPIYGHGMSAAARGIAALDERLAEEGLGAGAVAAAQRDVCAAVDDPWIIAAARDIEYVGCRSTATDPRLLGEADARRRFADVITVRSLRSPGVSEMVTDAASLAVPQSELGSSRFMALLGSDPMRPELTEPPLSPDELALVNLSPRAAVGAETASG(SEQ ID NO:20)
根据本发明的实施例,madO1基因位于该链霉菌的基因簇核苷酸序列第80298-81740个碱基处,长度为1443个碱基对,编码环氧酶,长度为480个氨基酸。
根据本发明的实施例,madO2基因编码SEQ ID NO:21所示的氨基酸序列,具体如下:
VTRSGRAIVLGGSWAGMLAAHVLARHLEAVTVVERDVLPRGPEHRRGLPQARHVHVLWSGGARIVEELLPGTAERLLAAGARRIGFHEDLVTLTSHGWQHRFPPRQYAIMCGRPLLDWIVRDRILADERITLREGTEAVELAGGAKRVTGVHVRGARDGARELLEADLVVDATGRGSRLKHWLAALGLPPLAEDVVDAGIAYATRTFRAPDGATAGFPAINVAADHRVRRPGRFGVVYPQEGGRWMVTLSCTRGGELPTRDEDFLPYARTLRDPIVADLVAGAEPLTPVFASHIGANRRLYPERAATWPDGLVVLGDSLAAFNPIYGHGMSAAARAARVLDERLAEPFGPGAARAAQERISAAADDPWIMATSKDIEFVHCRNHAEDPRLNGGAAVMRGFADMVATMAIRSADVSDAVTDVMSMTVPQSELGTARFMSLMQGARLLPELGEPPLRRHELALVNLDRRAVAGVRSVPR(SEQ ID NO:21)
根据本发明的实施例,madO2基因位于该链霉菌的基因簇核苷酸序列第77901-79334个碱基处,长度为1434个碱基对,编码环氧酶,长度为477个氨基酸。
根据本发明的实施例,madI基因编码SEQ ID NO:22所示的氨基酸序列,具体如下:
MLTDESARKRIILEHCRRLNAGDVGGLLDLYAEDVRFEDPVGGGERAGREELRAHFARAVEANAREVPGEPVASQDGVHALVPVSATLDGPAAGPGGVRSIEREYVLMLRVGAGGLIEELRAFWGAAEAGTDDAVRKNAPLEYARRINAGDLDGVMELFTDDIVFEDPVGSPPVRGKAAIRQNIAWAIECKVHEVPGRPVLALDGRTVVAPSVVTTSYPSNMRYQVIGVSEVDGNGLTRSLKAYWGITDVSLPDAPRLTGVAHSLAVTERLIENVRLSEARAGSS(SEQ ID NO:22)
根据本发明的实施例,madI基因位于该链霉菌的基因簇核苷酸序列第79391-80248个碱基处,长度为858个碱基对,编码环氧化物水解酶,长度为285个氨基酸。
根据本发明的实施例,madE基因编码SEQ ID NO:23所示的氨基酸序列,具体如下:
MDISVPLGDAVKLHVRHREGTGGPPFLLLHGLDSNARMWDLVADRLAAEGHPVYAVDQRGHGDSDPADDDYGNETAAADIAAVADALGVTRAVVAGHSWGALVSLRLAARRPGLVAGLALIEGGWAHASVVCDSWEQFAGLLTMSEVDLNGATLDAMRDYQRAVYPDWSAEAVEASLHSLRVHEDGALTPRLSPTQRNAILRSIWDDPPAQWYPAITVPTLLMPAVPKPNRRWEPLFERVRSYVEPAVAALPDATVREYVGGDHDLHAQHPDEVAEDLLRLARNVQRRAALGDRTAN(SEQ ID NO:23)
根据本发明的实施例,madE基因位于该链霉菌的基因簇核苷酸序列第67792-68685个碱基处,长度为894个碱基对,编码聚酮硫酯酶,长度为297个氨基酸。
根据本发明的实施例,madP1基因具有SEQ ID NO:24所示的氨基酸序列,具体如下:
MTTDDYALTWHELMVDPYSGFGRIREQAPIVQGSWDGRPVWIVTRHEDVSALLMDRRLATNSSAVPGCPDDYRAALRAIGVAEELVPYLAGDLVRTDPESHARLRKLLSRAFTARRVARLRPRVEDIARELLDALPGRAEGGAVELIEHFAYPLPITVICELLGVPEEDRPLWRGWSDDYVSMDPARLNRMLADMSAHLRELVGRRRAEPADDLVSALIRTHDDDGGLLSHTELVAMVLTLMIASQLPTPQLVANGAVALAARPDQLALLRADPGLWPGAVHELVRLCGPGIVAMLRYAAEDIAFGDTVIRQGDRVQLVLGSANRDPRRFPAPDLLDVTRPVDGGVQHLGYSRGAHYCLGAGLANQEIEVALSALFGRYPDLALAVAPEELEWRPIPLTRQLVRVPVVLGEPA(SEQ ID NO:24)
根据本发明的实施例,madP1基因位于该链霉菌的基因簇氨基酸序列第74174-75415个碱基处,长度为1242个碱基对,编码细胞色素P450酶,长度为413个氨基酸。
根据本发明的实施例,madP2基因编码SEQ ID NO:25所示的氨基酸序列,具体如下:
MTAERTTGPRTSGEHVLSDPELVADPYGAFGRIREEAPVARGRLWDGGPVWIVTRHDDVSAVLLDRRFASNAGSLPGRTDEHAEFLARTVGIPEDVVPYLAGNLVYLDPPDHTRLRGLVARAFTPRRIARLRPRVEAITGELLDTLPDRAVDGVVDLIEHFAYPLPVSVICELLGVPEEDRPRWHAWSRRFGDSTPRRLGEIVTEMNAHIRELVERRRAEPADDLLTALTGAHDGGGRLSDAELVTMVLTLMIAGHTTTSHMLGNGIAALLAHPGQLARLREDPALMPGAVQELLRWGSPVVIAKLRYATQDVTIGDTLIRQGERVQLVLGSANRDPRRFPDGDRLDVERPCGAADAQHLAYARGPHYCLGAALANQEAEVAFSALFTRFPELALAVPPDRLERDQIPITHQLARLPVTLGPPSADAPSAGGRAEHGPGGPDRAEG(SEQ ID NO:25)
根据本发明的实施例,madP2基因位于该链霉菌的基因簇核苷酸序列第71092-72432个碱基处,长度为1341个碱基对,编码细胞色素P450酶,长度为446个氨基酸。
根据本发明的实施例,madM1基因编码SEQ ID NO:26所示的氨基酸序列,具体如下:
MSDYYSAMGALLQMAWGDNFHFGYWDGPSDTATIDEATDRFTDMLTERLRVGPGDRVLDLGCGVGKPALRIAARTGASVVGVTINARHVELATERARAEGRADQVSFRHADAMDLPFEDGSFDAVLAFESIIHMDRPTVLREVERVLVPGGRLALTDLTPLTDESNTPKSFRSLMGASPGDESPSGQGGDPDIATLISAGAWPGLLSDASLTLDELNDVTEHTQGTFIRLFENFFKYRREFEREHGITVEQVLDSAKSGAPTDGVGCLVVAAHKP(SEQ ID NO:26)
根据本发明的实施例,madM1基因位于该链霉菌的基因簇核苷酸序列第73162-73989个碱基处,长度为828个碱基对,编码甲基转移酶,长度为275个氨基酸。
根据本发明的实施例,madM2基因编码SEQ ID NO:27所示的氨基酸序列,具体如下:
MTGGTISEGRGDGPRTGPGPAGTRGSSPFTEKWVRSMYRLLERLSSGGAPNLLGIENGYLNYGYWEPGCTDHDAACVALAERLGEAAGITAGDRVLDVGFGFGEQDFHWLRTREPKEIVGLNITPGQVDAARRRTRELDLDDRLDLRVGSATSLPFEDGSFDRVVALESSAHFNTRQVFFREAFRVLRPGGVLATTDPLPREAPGGKGGLVLRLDEVRRRRIIPDANWYPRSVYAARLAEAGFVDVDVRDVTDRTIAANAVFARAHCARLLRDPRYRSFQPRNTIRYHLRQVEARAAARDYVITSARKPGR(SEQ ID NO:27)
根据本发明的实施例,madM2基因位于该链霉菌的基因簇核苷酸序列第70154-71089个碱基处,长度为936个碱基对,编码甲基转移酶,长度为311个氨基酸。
根据本发明的实施例,madG9基因编码SEQ ID NO:28所示的氨基酸序列,具体如下:
VAVRVLFVTLPWKTHLFNFVPLAWALQTAGHEVHVAAEPDLTGAVTGAGLTAVPVGSGETMGERVRRAWREGTLPTPEEAPPPGHPVELYDLGPDRQRLSWQELNRIHDTLVVPRAWLSNDTMFDDLVAYCRSWRPDLVIWNAVTFAGSVAAAAVGAAHARFLFSVDLYSRLRDDQLTVMARQPPQDRRDGLKDWFAPWAAKYGVEFSEELVNGHFSIDQMPASFRLDFPHRTVSMRHVPYNGPAVIPAWLAEPPRAPRVLMTFGVSVRSWPELQVVPVERLRESLDSLADLDVEVVVTLPDDVRASLGRVPANVRVVDFVPLHAVLPTCSAVVHHGGAGSFNGSLLSGVPQLLVSTALDAPFKDHHLRAAGAGLAITPDRFSGPLIRENVVRMLEDPAFRAGAETLRREIMSYPAPNALVPELERLTAEHRA(SEQ ID NO:28)
根据本发明的实施例,madG9基因位于该链霉菌的基因簇核苷酸序列第109074-110375个碱基处,长度为1302个碱基对,编码糖基转移酶,长度为433个氨基酸。
根据本发明的实施例,madG10基因编码SEQ ID NO:29所示的氨基酸序列,具体如下:
VAVRVLFVTPTWHTHFFNLVPLAWSLQTAGHDVRVACEPELVDTVTRSGLTAVPVGSDEPIRDRARRAEEDGTLPPLDMGRLAGALGGATSTAPKPSGSGGQAGGAIGDPRAKLSWEDMVWLYETVAVPRGQDRERHAVRRPGGVQPVVAAPARRLGRPHPRGPVAAGAVGAAHARVTFTVDLSFQLRSGFLWTMAQQPPERRRDPLAEWLGSWTEKFGYEYSETLVNAHATIDQFPPSFGGDYGASHLNLRYVPYNGPAVIPDWLNEPPPAPRVLMTLGVSMSDWQELQVMSIERVQEVLDSVAGLDMELVLTLPTAFREKLDRVPRNTRIVEFAPFHAVLPTCAAMIHHGGAGTFYNALLAGTPQLLITKVPDALHKRAYLAETGAGLSIPPDEVTGPKVRESLARLLDDPSFRAGAERIRREVLDQPTPNGLVPELERLAARHGPA(SEQ ID NO:29)
根据本发明的实施例,madG10基因位于该链霉菌的基因簇核苷酸序列第68773-70122个碱基处,长度为1350个碱基对,编码糖基转移酶,长度为449个氨基酸。
根据本发明的实施例,madTE基因编码SEQ ID NO:30所示的氨基酸序列,具体如下:
MKAGSVRTGGTPGEHAGRPPRERPVRERKGVMTQPSTDFEAWIRRYHQAPSSGVRLVCFPHAGGSASFFFPVSARLSPAVDVAAVQYPGRQDRRAEANIGTIPEMADAVLPAVRALADRPLAFFGHSMGAVLAYEVARRLERDGAEPLARLYVSGRRAPSRHRSEHVHELDDAGLVAELQRLSGTDSSLLGDPEARDMILPAVRSDYHAVETYRHVPGAALRCPVVALVGDRDPQVTRDEALAWAEHSSGPFDLRVFPGGHFYLVEHGPRVIQALADDLAAVRVPSGAGGVAGGAGRTDAAAT(SEQ ID NO:30)
根据本发明的实施例,madTE基因位于该链霉菌的基因簇核苷酸序列第110428-111339个碱基处,长度为912个碱基对,编码II型硫酯酶,长度为303个氨基酸。
根据本发明的实施例,该生物合成基因簇进一步包括:调节基因模块,该调节基因模块包括:madR1基因、madR2基因、madR3基因、madR4基因、madT1基因、madT2基因、madT3基因和madU基因,上述基因的序列信息具体如下:
根据本发明的实施例,madR1基因编码SEQ ID NO:31所示的氨基酸序列,具体如下:
VRYEILGPFQVVDEDGKHTIRARKIRVLLTVLLVRADQVVPVDQLITEIWGEAPPRRATAGLHVYISQIRKFLHRPGAEDPVLTRPPGYLLRLGADELDLRRLELLVNQGRNDFRQGRHDQASAAFEEALDLSADHLPDDLGNGPVLEGFHTWLRETRLECVEMLMESRMMLGRHRELVGDLYQLTTEHPLREVLHRQLMLALYRSGRRADALHAYQSARRTLNEELGLEPCRDLQNLQRSILTSDHRLDLPIPA(SEQ ID NO:31)
根据本发明的实施例,madR1基因位于该链霉菌的基因簇核苷酸序列第9986-10753个碱基处,长度为768个碱基对,编码转录调节因子,长度为255个氨基酸。
根据本发明的实施例,madR2基因编码SEQ ID NO:32所示的氨基酸序列,具体如下:
VSLVERNAEVAVLHGLFGDCLEARSRLAIISGDVATGKSSLLDAFERQAAESGAVVLNAVASSVETALPMGVLEQLFRSPEVPEAIAERAMKLLNARALTYPMRFRGLWDVLRDLTERRPVVISVDDVHHADEISLQCLLYLLRRLRSARLLTVLTVSPRSQAPNAPFQSEFLREPHSRQIRLGPLSPDGVEALIRLDLDAQTARLLAPAFHEASAGNPALVHALVDDYLAGPEFIAPELVAGGAFGRAVVGLLQRHEFPVLEVARAIGILNEPVPPSLIGRLLDIDAETAARAVGTLTSAGILDAGSFRHGAAQAAVVDSAPPDAPGVLHERAAEQLHSDGAAATDVAAHIVASNRHGAPWAIPVLREAAEQALTSDDLGTGIRYLRVAHQICRDRRERSAIAARLADLEWRVDPSVALRWVPEFSLAIQDGLLDGRDAGTPFMSLLWHGRVSEAVRMLDALGRGRPSAALPGDAGTAMDVIPPRFCLAYLYPELAKDGPAADTAPALGPVPAAAPGPGQDAAAMLVAELHRRDVRGALVAAERVMERSRLNHRTFASLAIALATLIYNDRPGRAAVWCDALLAEATARRSPTWLALFTALRALIHLRQGDLPAAERHADTALNLISPKSWGVLVGLPLSCMILAKTAMGRPREAADLLSITVPEAMFRTPLGPHYLYARGRHHLAANRLHAALSDFRTCGTMREGWAVVLPEVEPWRLGAAQALIALGDLAEARALMEEQLLRSAPEHVRTRGVALRVKAATHPVPERPDLLGEAIELLQGCGDRLELSYALADLAEVRRWVGEHDCARELAHQAELLARQCGRPAAREGERPDVPGGVVRPDDLAGRHRMGAGHAAGLAGSAGRDAGGVAAADPERFLKLSDAERRVAVLAASGYTNREISKKLYITISTVEQHLTKVYRKLNVKRFDLKSALQHIEVVLPASEAVLDCCGSGPLRQAAGDR(SEQ ID NO:32)
根据本发明的实施例,madR2基因位于该链霉菌的基因簇核苷酸序列第3974-6871个碱基处,长度为2898个碱基对,编码转录调节因子,长度为965个氨基酸。
根据本发明的实施例,madR3基因编码SEQ ID NO:33所示的氨基酸序列,具体如下:
VPVSLVERDEELAALDGLFAELPRARSRVVVVSGGTATGKTSLLRAFGERAVEHGALFLSAVASRIERGLPMGVLEQLFRNPDLPSGDAERAMRWLDAGALNTSAAGPGGSGQVTSVVLRGLCEVLRGLAERQPVVIAVDDVHYADETSLRCLLYLLRRLRSSRLHVIFSECRDLEAANALLRSEFLREPFFRQIRVEPLTKAGVARLLRRSLGDAAAQDLTPAFHEATSGYPALVQALIEDHRAAAAAGRGELGSGMEFSRAVMTFLYRYEPPVCEVARAIAILGKAASTTLLGRLLDLAAESTAQAVNALTKARILENGDFRSEIVRAAILDATPSGERMAMHGSAAVLLHNEGAAPTDVAAHIVAAGRIEAPWVVPLLREAAEHALANDDVRPAIRHLRAAYRLCDDERLRPEIASALADAEWRVDPSAVLRHLPDFAAPARTGRPELCDTFTPITYLLWHGRVGEALGIVDDLVRTQDEAAESGTAASVDLDTPRLWLSYLYPGVLKAEPVPAHHEPEPTGALPDLQEATMLAAELVDENESDALTTAEGILQRSRLSNRTLAPLTTALAVLMYNDRLEQAASWCDSLLVEAVARRSPTWHALFAAERALIHVRQGNVIAAERHADAALSLIPPKSWGVAVGLPLAAMVLATTAMGKVERAATLLDIPVPQTMFQTRMGLHYLYARGRHHLAEGRFYAAACDFRACGKLMNDWGIDLPAVEPWRIGAAEAHLRLGDAAQARELVDEQLALLEPEHLRARGMSLRAQAATSPPESRPALLGQAVEMLQSCGDQLELSRALADQSRAYEILGEPERASLLSRRSLSLARKCGILERTTEEPPCPSTLELSDAERRVAALAADGYTNREISRKLYITVSTVEQHLTKVYRKLNVRRFDLRSALRRDAASPVRDPAAGPGRPRPGGTGT(SEQ ID NO:33)
根据本发明的实施例,madR3基因位于该链霉菌的基因簇核苷酸序列第316-3105个碱基处,长度为2790个碱基对,编码转录调节因子,长度为929个氨基酸
根据本发明的实施例,madR4基因编码SEQ ID NO:34所示的氨基酸序列,具体如下:
MLAERGGAAFEDLTARARIRDAALRYFTEYGFTQATIREIARAAGVSPGLVRHHFGSKEALRQACDAYALEALHGFYDQTLNGDRLSHPRSTMDARHTLHPFQRYLARALIDESEAAARIFDDLVAMTERSLIQADRQRPDPPIADAASRAALMVAMALGIPTFHQHISRAVGADIFSDEGDRRVALALLDVHSHSVISPETAASLRDGLDEPGSGSGNRAPFTHSQEEHRHE*(SEQ ID NO:34)
根据本发明的实施例,madR4基因位于该链霉菌的基因簇核苷酸序列第114008-114709个碱基处,长度为702个碱基对,编码转录调节因子,长度为233个氨基酸。
根据本发明的实施例,madT1基因编码SEQ ID NO:35所示的氨基酸序列,具体如下:
MSDAISAKGLVKKFGPTRALDGLDLTVRTGEVHGFLGPNGAGKSTTIRVLLGILHADAGEVGLLDGDPWQDATALHRRLAYVPGDVNLWPGLSGGEVIDLLGRLRGGVDKRRKTELLERFQLDPRKKTRTYSKGNRQKVALVAALASDVELLLLDEPTSGLDPLMEETFREAIAEERSAGRTVLLSSHILSEVEALCDRVSIIREGRVVETGTLDEMRHLTRTSIDAELMRPPSGLSDVPGVHELKIDGMHVSCEVETASLDEALRQLTGAGVRNLVSRPPTLEELFLRHYTADGGARDANAGVAR(SEQ ID NO:35)
根据本发明的实施例,madT1基因位于该链霉菌的基因簇核苷酸序列第113095-114015个碱基处,长度为921个碱基对,编码ABC转运蛋白ATP结合蛋白,长度为306个氨基酸。
根据本发明的实施例,madT2基因具有SEQ ID NO:36所示的氨基酸序列,具体如下:
VNPLTGTGKLVRLILRRDRWLLPAWMFWVVVIPLSYVSTYKDLYPTAADRAEYAATSGSNPTFLALYGPLPDVSLGGIVAQRAGFIPIFVGLVSILTVIRHTRTDEQAGRRELLAATVVGRHAQLAAALIVTMTANLVVAVLLGLGLSGQLPAGGAFAIGLGFAVAGCTFAAVAGVAAQLTEYAGSARGLSIAALGGFFLVRLAADTGGDNSGLTWLSWLSPLGWVTLLRPFADERWWVLALAVAFAAAAVAVAVAFSSRRDIGAGILAARLGPADAAPSLSNPIVLAWRLQRSLLLAWTVAMIVLGGVFGGIAQGVGDLLEDNDTLKDMFLRIGGQQGLIDAYLASIMGTVGLVASAYGIQAALRLRTEEQAMHGEYVLATSVSRAKWVASHLLFAAVGPAIALAAAGVTTGLVHGADIGDVGGQVPRILGGAMVQLPAVWTLTGLALALFGILPRLSLAVWGALALFFLLGQLGEAMQLDQALMDLSPFSHTPRIPGGHFSAMPVVWLLAISLALTLVGFVGARRRDIGTA(SEQ ID NO:36)
根据本发明的实施例,madT2基因位于该链霉菌的基因簇核苷酸序列第111497-113098个碱基处,长度为1602个碱基对,编码ABC转运蛋白通透酶,长度为533个氨基酸。
根据本发明的实施例,madT3基因编码SEQ ID NO:37所示的氨基酸序列,具体如下:
VTRAIAATGLVKSFGGARVLDGLDLTVHRGEVHGLLGPNGAGKTTLIRILLGLTRADGGTARLLGGDPWADAPALHRHVAYVPGDVTLWPGLTGGEVIDVIGALRGGLDGDRRNELIGRFDLDPRERCRNYPPAERRKVALVAALASDAELLVLDEPALGLDPVMERTFRRVVLEEQARANRTFLLSSHILSEVDALCDGVSIIRDGRTVESGVLAEMRLGARTSISAQLLGRPFALAHMTGVHNVTVRNNHVECDVDHDSIEKLMSYLASVGIRDLVSREPTLEELFLRHYGAHRPRRPRPRPDRAPAH(SEQ ID NO:37)
根据本发明的实施例,madT3基因位于该链霉菌的基因簇核苷酸序列第11355-12287个碱基处,长度为933个碱基对,编码ABC转运蛋白ATP结合蛋白,长度为310个氨基酸。
根据本发明的实施例,madU基因编码SEQ ID NO:38所示的氨基酸序列,具体如下:
MNMRESRRNAAQQGRAARGPRARWALAATGAALVVLVPGAVAAVADTGGGRADGGRADGAGRPPGHGTKLPAPPQMRALDFMLGSYKCMDTPPGGGEQIENYITTKRDIGGHYLDSAMVTPDLVVGRRVFGWNPVDRRFMSEYHDDWGVQGNSYSEGWKNGHLIFTGTVKLVKKPSPTGNAEGVEVNVKDDMVILSRNHYTNTQTTSVPGGISVQHFYDCRK(SEQ ID NO:38)
根据本发明的实施例,madU基因位于该链霉菌的基因簇核苷酸序列第108164-108832个碱基处,长度为669个碱基对,编码未知蛋白,长度为222个氨基酸。
参考图1,根据本发明的实施例,沿着基因的5’-3’端,所述madR3基因、所述madR2基因、所述madG1基因、所述madG2基因、所述madR1基因、所述madT3基因、所述madA1基因、所述madA2基因、所述madA3基因、所述madA4基因、所述madA5基因、所述madA6基因、所述madE基因、所述madG10基因、所述madM2基因、所述madP2基因、所述madA11基因、所述madM1基因、所述madP1基因、所述madA10基因、所述madO2基因、所述madI基因、所述madO1基因、所述madA9基因、所述madA8基因、所述madA7基因、所述madU基因、所述madG9基因、所述madTE基因、所述madT2基因、所述madT1基因、所述madR4基因、所述madG8基因、所述madG7基因、所述madG6基因、所述madG5基因、所述madG3基因和所述madG4基因依次串联。
此外,需要说明的是,本发明中,“串联”仅表示基因的连接顺序,相邻基因间即可以是直接连接,也可以是存在其它基因片段或者连接处的部分序列存在相互重叠。
根据本发明的一些实施例,madR3基因、madR2基因、madR1基因、madT3基因、madE基因、madG10基因、madM2基因、madP2基因、madA11基因、madP1基因、madA10基因、madO2基因、madI基因、madO1基因、madA9基因、madA8基因、madA7基因、madU基因、madTE基因、madT2基因、madT1基因、madR4基因、madG8基因、madG5基因和madG4基因的表达方向相同。
根据本发明的一些实施例,madG1基因、madG2基因、madA1基因、madA2基因、madA3基因、madA4基因、madA5基因、madA6基因、madM1基因、madG9基因、madG7基因、madG6基因和madG3基因的表达方向相同。
微生物
根据本发明的另一方面,本发明提供了一种微生物。根据本发明的实施例,该微生物具有前述的生物合成基因簇。发明人惊人地发现,利用该微生物进行发酵处理,可以高效、简单和低成本地合成马度米星化合物。
根据本发明的实施例,所述微生物为链霉菌。根据本发明的具体实施例,微生物为Actinomadura sp.J1-007放线菌。
根据本发明的又一方面,本发明提供了一种制备马度米星化合物的方法。参考图2,根据本发明的实施例,对该方法进行解释说明,该方法包括:
S100发酵处理
根据本发明的实施例,利用前述的微生物进行发酵处理,得到发酵后的微生物。
根据本发明的具体实施例,发酵培养基条件:0.4%[w/v]玉米糖浆,6%[w/v]葡萄糖,2.4%[w/v]羽毛粉,0.005%[w/v]Fe2(SO4)3,0.3%[w/v]NaCl,0.015%[w/v]K2HPO4,0.1%[w/v]CaCO3,pH 7.2。由此,发酵效果好,马度米星化合物产量高。
根据本发明的具体实施例,发酵处理的时间为7-8天。
S200破壁处理
根据本发明的实施例,将发酵后的微生物进行破壁处理,得到含有马度米星化合物的溶液。
S300纯化
根据本发明的实施例,从含有所述马度米星化合物的溶液中纯化所述马度米星化合物,得到马度米星化合物产品。
根据本发明的一些实施例,将含有马度米星化合物的溶液后旋干,采用甲醇萃取,浓缩后,用硅胶柱粗分离,后用半制备HPLC进一步纯化得到目标产物。
根据本发明实施例的制备马度米星化合物的方法,利用前述的微生物进行发酵处理,可以高效地合成马度米星化合物。然后,通过微生物破壁处理和分离纯化处理,可以高效、简单和低成本地获得马度米星化合物纯品。
根据本发明的一些实施例,通过基因预测,推测出马度米星化合物的生物合成途径如图3。一方面由聚酮合成基因簇催化丙酰辅酶A和甲基丙二酰辅酶A的延伸合成最终形成长链脂肪酸链,在通过madO1/madO2、madI、madP1、madP2、madM1和madM2基因的修饰并得到马度米星的中间产物,另一方面通过madG1、madG2、madG3、madG4、madG5、madG6、madG7和madG8等基因催化得到糖基,再由糖基转移酶madG9和madG10将糖基转移到脂肪酸链上,最后由硫酯酶madE水解释放得到马度米星。
下面参考具体实施例,对本发明进行说明,需要说明的是,这些实施例仅仅是说明性的,而不能理解为对本发明的限制。
下面将结合实施例对本发明的方案进行解释。本领域技术人员将会理解,下面的实施例仅用于说明本发明,而不应视为限定本发明的范围。实施例中未注明具体技术或条件的,按照本领域内的文献所描述的技术或条件(例如参考J.萨姆布鲁克等著,黄培堂等译的《分子克隆实验指南》,第三版,科学出版社)或者按照产品说明书进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品,例如可以采购自Illumina公司。
实施例1
将Actinomadura sp.J1-007放线菌接种于SFM平板,培养三天到四天左右。待SFM培养基中形成菌落时,挑单菌落至种子培养基中培养三天到四天左右,按照1%的量接菌至发酵培养基(发酵培养基的成份为0.4%[w/v]玉米糖浆,6%[w/v]葡萄糖,2.4%[w/v]羽毛粉,0.005%[w/v]Fe2(SO4)3,0.3%[w/v]NaCl,0.015%[w/v]K2HPO4,0.1%[w/v]CaCO3,pH7.2。)中。培养条件:30度,220转速,培养7到8天时间。
待培养到第7或8天时,用甲醇提取培养物,通过LC-MS检测提取物,并使用商业化的马杜霉素标准品作为阳性对照。如图4所示,Actinomadura sp.J1-007具有与马杜米星标准品相同的保留时间(铵盐化合物:m/z=934.16)。通过其特征性片段化(m/z=871.83,804.85,740.58,646.67和629.00)进一步证实了该化合物为马度米星化合物。
实施例2
以Actinomadura sp.J1-007放线菌为研究对象,研究马度米星化合物生物合成基因簇的序列信息,具体方法如下:
1、提取DNA
(1)将Actinomadura sp.J1-007亚型链霉菌接种于TSBY培养基中培养2天后,7000rpm×5min收集菌体。
(2)将收集的菌体弃上清,加30mL SET buffer,打散,离心7000rpm,5分钟。再次弃上清,加10mL SET buffer,打散菌体,加300μL溶菌酶(100mg/ml),37℃水浴30-60分钟,再加100μL蛋白酶K(100mg/ml),10μL RNase(10mg/ml),37℃水浴30分钟。
(3)向步骤(2)得到的混合物中加入1.2mL 10%SDS,55℃水浴2小时,每隔15分钟轻摇至液体逐渐澄清。
(4)向步骤(3)得到的液体中加入4mL 5M Nacl,混匀,冷却至37℃。加10mL氯仿,轻轻混匀30分钟,离心12000rpm,15分钟。
(5)取步骤(4)离心的上清液,再次加10mL氯仿,轻轻混匀30分钟,离心12000rpm,15分钟。
(6)取步骤(5)离心的上清液,加入0.8倍体积的异丙醇,轻轻混旋至出现丝状DNA。
(7)挑出DNA于EP管中,加70%的乙醇洗,然后倒掉乙醇,自然风干,溶于一定量的双蒸水中,提取获得链霉菌的DNA。
2、DNA序列分析
将该基因组送至测序公司进行基因组测序,得到其基因组信息,对组装后的基因组,使用GeneMarkS进行编码蛋白基因的预测。随后利用antiSMASH软件预测出与聚醚化合物合成基因簇相似的区域,并综合聚醚化合物合成基因簇中蛋白在该菌编码基因数据集中比对结果,挖掘出马度米星合成相关基因。基因簇的结构如图2所示,负责化合物马度米星的生物合成,各个基因的功能分析见表1。
Figure BDA0002192926430000531
Figure BDA0002192926430000541
Figure BDA0002192926430000551
实施例3
在鉴定用于合成马度米星的基因簇后,我们接下来寻求开发用于Actinomadurasp.J1-007的遗传操作系统。首先,为了提高结合效率,我们选择了两种常用的培养基(SFM和ABB13)来选择Actinomadura sp.J1-007的最佳培养条件。在培养5天后,J1-007在ABB13上表现出比SFM更好的生长,并且15mg/L阿泊拉霉素足以抑制Actinomadura sp的生长(图5)。因此,具有15mg/L阿泊拉霉素的ABB13培养基被认为是接合子的最佳筛选条件。
我们测试了将常用质粒引入Actinomadura sp.J1-007的两种方法(两亲本和三亲本的方法)。并测试了几种质粒:包括位点特异性整合质粒pIB139,遗传不稳定质粒pJTU1278和温度敏感质粒pKC1139用于基因敲除或基因置换同源重组。结果显示只能在三亲本方法和pIB139获得成功的结合子,这表明可以在Actinomadura sp.J1-007中实现基因过表达,而基因敲除不可以。因此,我们在pIB139质粒上的多克隆位点NdeI和KpnI位点上插入了madTE基因,构建了位点整合的质粒pRL701,以在ermE*启动子下过表达madTE(图6A),预期其将II型硫酯酶的额外拷贝引入Actinomadura sp.J1-007的染色体中,通过II型硫酯酶在合成过程中作为修复功能基因可以将马度米星化合物合成过程中的从ACP上去除异常中间体以纠正聚酮化合物链延伸,从而加速PKS的合成效率,实现马度米星化合物的正确合成,用于增加马杜霉素的产量。通过PCR验证(图6B)我们获得了成功的接合子,并将其命名为RL701。
将菌株RL701和Actinomadura sp.J1-007按照实施例1的方法发酵7天。结果如图6C所示,在Actinomadura sp.J1-007中检测到5,498.4±267.2mg/L的马度米星。而菌株RL701表现出马度米星产量增加30%(7,159.6±63.2mg/L)。这是迄今为止报道的最高产量。
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。
尽管已经示出和描述了本发明的实施例,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。
SEQUENCE LISTING
<110> 武汉臻智生物科技有限公司
<120> 马度米星化合物的生物合成基因簇及其应用
<130> PIDC3194374
<160> 39
<170> PatentIn version 3.3
<210> 1
<211> 2429
<212> PRT
<213> Artificial
<220>
<223> madA1基因编码的氨基酸序列
<400> 1
Met Pro Ser Asp Arg Leu Arg Ser Pro Gly Thr Asp Ala Ala Ala Asp
1 5 10 15
Gly Gly Ala Gly Asp Gly Ala Arg Arg Trp Phe Gly Gly Phe Leu Glu
20 25 30
Ala Val Asp Gly Phe Asp Ala Gly Phe Phe Gly Ile Ala Pro Arg Glu
35 40 45
Ala Val Ala Met Asp Pro Gln Gln Arg Leu Met Leu Glu Val Ala Trp
50 55 60
Glu Ala Leu Glu Asp Ala Gly Ile Pro Pro Glu Thr Leu Ala Gly Ser
65 70 75 80
Arg Thr Gly Val Phe Val Gly Ala Ile Trp Asp Glu Tyr Ala Ala Ile
85 90 95
Leu Arg Arg Ser Gly Pro Gly Thr Gly Glu Arg His Ala Met Thr Gly
100 105 110
Ser His Arg Ser Ile Ile Ala Asn Arg Val Ser Tyr Val His Gly Leu
115 120 125
Arg Gly Pro Ser Met Thr Ile Asp Thr Ala Gln Ser Ser Ser Leu Val
130 135 140
Ala Val His Ala Ala Cys Glu Ser Ile Arg Asn Gly Glu Cys Ser Leu
145 150 155 160
Ala Leu Ala Gly Gly Val Asn Leu Ile Leu Gly Glu Asp Gly Met Glu
165 170 175
Ala Ala Ala Ala Gln Phe Gly Gly Leu Ser Pro Asp Gly Arg Cys His
180 185 190
Thr Phe Asp Ala Arg Ala Asn Gly Phe Val Arg Gly Glu Gly Ala Ala
195 200 205
Thr Val Val Leu Lys Pro Leu Asp Ala Ala Leu Ile Asp Gly Asp Pro
210 215 220
Val Tyr Cys Val Ile Arg Gly Ser Ala Val Asn His Asp Gly Ala Ala
225 230 235 240
Asp Gly Leu Thr Val Pro Ser Ala Ala Ala Gln Glu Asp Val Leu Arg
245 250 255
Leu Ala Cys Glu Arg Ala Arg Val Ala Pro Gly Asp Val Gln Tyr Val
260 265 270
Glu Leu His Gly Thr Gly Thr Pro Val Gly Asp Pro Val Glu Ala Ala
275 280 285
Ala Val Gly Ala Ala Leu Gly Ala Ser Arg Ala His Gly Ala Pro Leu
290 295 300
Arg Val Gly Ser Ala Lys Thr Asn Val Gly His Leu Glu Gly Ala Ala
305 310 315 320
Gly Ile Val Gly Leu Leu Lys Ala Val Leu Ser Val Ala His Arg Glu
325 330 335
Leu Pro Pro Ser Leu Asn Phe Thr Ala Pro Asn Pro Arg Ile Pro Leu
340 345 350
Ala Glu Leu Gly Leu Arg Val Gln Asp Glu Leu Thr Ala Trp Pro Cys
355 360 365
Pro Asp Arg Pro Leu Ile Ala Gly Val Ser Ser Phe Gly Met Gly Gly
370 375 380
Thr Asn Cys His Val Val Leu Thr Glu Pro Pro Ala Thr Ala Ala Glu
385 390 395 400
Val Pro Ala Thr Gly Gly Ala Thr Arg Ala Asp Glu Pro Gly Thr Ala
405 410 415
Ala Gly Arg Pro Ala Ala Pro Val Pro Ser Asn Glu Thr Pro Pro Ile
420 425 430
Val Pro Trp Thr Val Ser Gly Lys Ser Ala Ala Ala Leu Arg Ala Gln
435 440 445
Ala Asp Arg Leu Arg Glu His Val Glu Asp Arg Pro Gly Leu Asp Pro
450 455 460
Ala Asp Val Gly His Ala Leu Gly Ala Thr Arg Ser Ala Phe Ala His
465 470 475 480
Arg Ala Val Val Leu Gly Gly Gly Ala Asp Glu Leu Leu Gly Arg Leu
485 490 495
Gly Ala Leu Gly Arg Gly Glu Asp Ala Pro Gly Val Val His Gly Thr
500 505 510
Val Asn Gly Gly Gly Val Ala Val Leu Phe Ser Gly Gln Gly Ser Gln
515 520 525
Arg Pro Arg Met Gly Arg Glu Leu Tyr Asp Ala His Pro Val Phe Ala
530 535 540
Ala Ala Leu Asp Glu Ala Phe Ala Cys Leu Asp Pro Met Leu Asp Arg
545 550 555 560
Pro Leu Arg Glu Val Val Trp Ser Gly Asp Ala Ala Pro Leu Asp Arg
565 570 575
Thr Ala Tyr Thr Gln Pro Ala Leu Phe Ala Val Glu Val Ala Leu Tyr
580 585 590
Arg Leu Ala Glu Ser Trp Gly Leu Arg Pro Gly His Leu Ile Gly His
595 600 605
Ser Val Gly Glu Ile Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu
610 615 620
Pro Asp Ala Cys Thr Leu Val Ala Ala Arg Gly Arg Leu Met Gln Ala
625 630 635 640
Val Thr Ala Pro Gly Ala Met Ala Ala Leu Arg Ala Thr Pro Asp Glu
645 650 655
Ala Ala Glu Leu Val Ala Gly Arg Gly Gly Leu Gly Val Ala Ala Val
660 665 670
Asn Gly Pro Ala Ser Val Val Ile Ser Gly Asp Arg Asp Ala Val Arg
675 680 685
Asp Ala Ala Ala Ala Trp Arg Asp Arg Gly Gly Lys Ala Thr Leu Leu
690 695 700
Arg Val Ser His Ala Phe His Ser Pro His Met Asp Ala Met Leu Asp
705 710 715 720
Glu Leu Arg Ala Val Ala Ala Gly Leu Ala Phe Ala Ala Pro Ala Ile
725 730 735
Pro Val Val Ser Asn Val Thr Gly Leu Pro Ala Thr Ala Glu Arg Leu
740 745 750
Ala Ser Pro Asp Tyr Trp Ala Glu His Ala Arg Arg Ala Val Arg Phe
755 760 765
Met Pro Gly Val Arg Tyr Leu Cys Gly Glu Gly Val Thr Thr Phe Val
770 775 780
Glu Leu Gly Pro Asp Gly Gln Leu Ala Ala Met Ala Arg Glu Ser Phe
785 790 795 800
Pro Gly Pro Ser Thr Glu Gly Pro Asp Gly Arg Pro Arg Pro Ala Ala
805 810 815
Arg Pro Val Ala Leu Ala Ala Leu Arg Arg Asp Arg Pro Glu Thr Ala
820 825 830
Thr Phe Ala Thr Ala Met Ala Glu Ala Tyr Val Arg Gly Gly Val Asp
835 840 845
Leu Asp Trp Asp Arg Ile Phe Ala Gly Arg Ala Arg Thr Arg Val Asp
850 855 860
Leu Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Pro Gly Ser Leu
865 870 875 880
Pro Arg Pro Glu Ala Val Pro Ala Gly Gln Pro Gly Pro Gly Pro Glu
885 890 895
Ala Gly Pro Val Leu Glu Ala Glu Pro Pro Ser Pro Leu Glu Arg Leu
900 905 910
Ala Gly Leu Ser Glu Pro Glu Arg Arg Arg Ala Leu Leu Asp Leu Val
915 920 925
Arg Thr Asn Ala Ala Val Val Leu Gly His Ala Ser Gly Thr Ala Ile
930 935 940
Asp Ala Asp Leu Thr Phe Lys Gln Leu Gly Phe Asp Ser Met Ala Ala
945 950 955 960
Gly Glu Leu Ser Glu Arg Leu Gly Ala Ala Thr Gly Leu Pro Leu Pro
965 970 975
Ala Thr Leu Thr Phe Asp His Pro Thr Pro Leu Asp Val Ala Ala His
980 985 990
Leu Asn Arg Arg Ala Thr Ala Thr Pro Asp Gly Thr Ser Arg Gln Asp
995 1000 1005
Ala Pro Ala Ala Gly Arg Ser Ala Ala Asp Asp Pro Ile Ala Val
1010 1015 1020
Val Ala Met Ser Cys Arg Tyr Pro Gly Gly Ala Thr Thr Pro Glu
1025 1030 1035
Ala Leu Trp Asp Leu Val Ala Ala Gly Arg Asp Val Ile Gly Gly
1040 1045 1050
Phe Pro Asp Asp Arg Gly Trp Asp Leu Ala Ala Leu Phe Asp Pro
1055 1060 1065
Asp Thr Asp Arg Ala Gly Thr Thr Tyr Ala Arg Glu Gly Gly Phe
1070 1075 1080
Leu Tyr Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly Ile Ser
1085 1090 1095
Pro Arg Glu Ala Gln Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
1100 1105 1110
Glu Thr Ser Trp Glu Ala Phe Glu Arg Ala Gly Ile Arg Leu Gly
1115 1120 1125
Ala Leu Lys Gly Ser Pro Thr Gly Val Phe Val Gly Ala Thr Ala
1130 1135 1140
Gln Asp Tyr Gly Pro Arg Leu His Glu Ala Pro Lys Gly Leu Asp
1145 1150 1155
Gly His Leu Leu Thr Gly Ala Thr Pro Ser Val Ala Ser Gly Arg
1160 1165 1170
Val Ala Phe Thr Phe Gly Leu Val Gly Pro Ala Val Thr Val Asp
1175 1180 1185
Thr Ala Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gln
1190 1195 1200
Ala Leu Arg Gln Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Ala
1205 1210 1215
Thr Val Leu Ala Ala Pro Gly Met Phe Thr Gly Phe Ser Arg Gln
1220 1225 1230
Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala
1235 1240 1245
Ala Asp Gly Thr Gly Trp Ala Glu Gly Val Gly Leu Val Leu Leu
1250 1255 1260
Glu Arg Leu Ser Asp Ala Leu Arg Asn Gly Arg Arg Val Leu Ala
1265 1270 1275
Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly
1280 1285 1290
Leu Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg Val Ile Arg Gln
1295 1300 1305
Ala Leu Ala Ser Ala Gly Leu Ser Ser Ser Asp Val Asp Ala Val
1310 1315 1320
Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala
1325 1330 1335
Gly Ala Leu Leu Ala Thr Tyr Gly Arg Gly Arg Ser Gly Gly Arg
1340 1345 1350
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln
1355 1360 1365
Ala Ala Ala Gly Val Ala Gly Val Ile Lys Thr Val Met Ala Met
1370 1375 1380
Arg Asn Gly Ser Leu Pro Ala Ser Leu His Ile Asp Glu Pro Ser
1385 1390 1395
Pro His Val Asp Trp Lys Ser Gly Ala Val Arg Leu Leu Thr Glu
1400 1405 1410
Pro Met Glu Trp Pro Thr Glu Asp Asp Arg Pro Arg Arg Ala Ala
1415 1420 1425
Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu Ile Leu
1430 1435 1440
Glu Ala Pro Pro Glu Pro Asp Ser Glu Pro Glu Arg Pro Ala Pro
1445 1450 1455
Met Asp Val Gly Pro Leu Pro Trp Val Leu Ser Ala Arg Ser Glu
1460 1465 1470
Glu Ala Leu Arg Gly Gln Ala Ala Ala Leu Ala Ser Gln Val Arg
1475 1480 1485
Val Ser Asp Asp Ala Pro Leu Asp Val Gly Trp Ser Leu Ala Ala
1490 1495 1500
Ser Arg Ser Asn Phe Glu Arg Arg Ala Val Val Val Gly Thr Gly
1505 1510 1515
Arg Asp Glu Leu Leu Ala Gly Leu Glu Ala Val Ala Ser Gly Glu
1520 1525 1530
Val Arg Ser Val Ala Pro Ser Arg Val Gly Ala Gly Pro Val Leu
1535 1540 1545
Val Phe Pro Gly Gln Gly Ser Gln Trp Ala Gly Met Gly Ala Gly
1550 1555 1560
Leu Leu Asp Ser Ser Pro Val Phe Ala Glu Arg Val Ala Glu Cys
1565 1570 1575
Glu Arg Ala Leu Ala Pro Phe Val Asp Trp Ser Leu Ser Asp Val
1580 1585 1590
Leu Arg Gly Asp Gly Ala Glu Leu Ala Arg Val Asp Val Val Gln
1595 1600 1605
Pro Val Leu Trp Ala Thr Met Val Ser Leu Ala Ala Val Trp Ala
1610 1615 1620
Ser Tyr Gly Val Val Pro Ser Ala Val Ile Gly His Ser Gln Gly
1625 1630 1635
Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Ala
1640 1645 1650
Gly Ala Arg Ile Val Ala Leu Arg Ser Arg Ala Leu Arg Arg Leu
1655 1660 1665
Ala Gly Gly Gly Ala Met Ala Ser Leu Gly Val Gly Val Asp Ala
1670 1675 1680
Ala Gln Glu Phe Leu Ser Ala Asp Val Ala Val Ala Ala Val Asn
1685 1690 1695
Gly Pro Ser Ser Thr Val Val Ser Gly Pro Pro Glu Gln Val Thr
1700 1705 1710
Ala Ala Val Ala Arg Val Glu Ala Ala Gly Tyr Arg Ala Arg Leu
1715 1720 1725
Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Val Asp Glu Ile
1730 1735 1740
Thr Asp Glu Leu His Glu Val Leu Ala Gly Ile Glu Pro Ala Pro
1745 1750 1755
Thr Thr Glu Val Ala Phe Tyr Ser Thr Val Ser Gly Gly Arg Ile
1760 1765 1770
Asp Thr Thr Ser Met Asp Thr Gly Tyr Trp Val Thr Asn Leu Arg
1775 1780 1785
Glu Arg Val Arg Phe Ala Asp Ala Val Arg Glu Leu Ala Gly Asp
1790 1795 1800
Gly His Arg Val Phe Ile Glu Ala Ser Pro His Pro Val Leu Thr
1805 1810 1815
Leu Gly Thr Gln Glu Thr Leu Glu Glu Ala Gly Thr Val Ala Thr
1820 1825 1830
Val Val Pro Thr Leu Arg Arg Asp His Gly Gly Arg Arg Gln Leu
1835 1840 1845
Ala Asp Ala Leu Gly Gln Ala Phe Ala Ala Gly Val Glu Val Asp
1850 1855 1860
Trp Ser Arg Trp Phe Thr Gly Asp Arg Ala Pro Arg Leu Val Glu
1865 1870 1875
Leu Pro Thr Tyr Ala Phe Gln Arg Arg Arg Tyr Trp Leu Asp Gly
1880 1885 1890
Arg Ser Gly Pro Gly Gly Asp Pro Glu Ser Leu Gly Leu Ala Ser
1895 1900 1905
Ala Gly His Arg Leu Leu Gly Ala Ala Val Glu His Ala Gly Ala
1910 1915 1920
Asp Ala Trp Leu Leu Thr Gly Arg Leu Ser Thr Arg Thr Gln Pro
1925 1930 1935
Trp Leu Ala Gln His Ser Val Ala Gly Thr Val Val Val Pro Gly
1940 1945 1950
Ala Ala Leu Val Glu Trp Ala Leu Arg Ala Ala Asp Glu Ala Gly
1955 1960 1965
Cys Gly Gly Val Asp Glu Leu Thr Pro Arg Val Pro Leu Val Leu
1970 1975 1980
Ala Gly Pro Glu Pro Val Arg Val Gln Val Ala Val Asp Ala Ala
1985 1990 1995
Ala Asp Asp Gly Thr Arg Asp Val Arg Val Tyr Ser Arg Pro Asp
2000 2005 2010
Gly Asp Glu Glu Ala Glu Trp Thr Cys His Ala Glu Gly Val Leu
2015 2020 2025
Ser Ala Gly Ser Pro Glu Pro Ala Pro Glu Pro Glu Pro Gly Gly
2030 2035 2040
Gly Gly Ala Trp Pro Pro Pro Gly Ala Glu Pro Leu Ala Ala Asp
2045 2050 2055
Asp Val Tyr Glu Arg Leu Ala Glu Ala Gly Tyr Asp Tyr Gly Gln
2060 2065 2070
Ala Phe Gln Gly Val Arg Ala Leu Trp Arg Gln Gly Ala Asp Leu
2075 2080 2085
Leu Ala Asp Leu Val Leu Pro Glu Ala Ala Gly Gly Pro Gly Gly
2090 2095 2100
Phe Gly Ile His Pro Ala Leu Leu Asp Ala Ala Leu His Pro Ser
2105 2110 2115
Leu Leu Asp Ala Gly Pro Ala Gly Asp Gly Arg Val Trp Gln Pro
2120 2125 2130
Phe Ala Trp Ser Gly Val Ser Leu Trp Ala Ala Glu Ala Thr Ala
2135 2140 2145
Val Arg Val Arg Leu Thr Pro His His Gly Glu Gln Gly Val Gln
2150 2155 2160
Val Ala Val Ala Val Thr Asp Ala Leu Gly Gly Pro Val Leu Asp
2165 2170 2175
Val Ala Ser Val Val Met Arg Pro Ala Asp Val Arg Trp Leu Arg
2180 2185 2190
Glu Gly Gly His Arg Ala Gly Arg Arg Ala Ser Thr Ala Ala Gly
2195 2200 2205
Gly Thr Thr Arg Val Arg Arg Ala Val Ala Ala Gly Gly Gly Asn
2210 2215 2220
Gly Ser Ser Val Asp Trp Ala Asp Arg Leu Ala Gly Leu Ala Glu
2225 2230 2235
Asp Glu Arg Arg Arg Leu Val Leu Asp Leu Val Arg Thr His Ala
2240 2245 2250
Ala Thr Val Leu Gly His Ala Glu Pro Asp Ala Val Pro Thr Gly
2255 2260 2265
Thr Ser Phe Lys Glu Met Gly Phe Glu Ser Leu Thr Ala Val Glu
2270 2275 2280
Leu Arg Asp Arg Leu Ala Ala Ala Thr Gly Leu Arg Leu Pro Ala
2285 2290 2295
Ala Leu Val Phe Arg Tyr Pro Thr Pro Asp Gly Ile Ala Asp His
2300 2305 2310
Leu Val Glu Arg Leu Val Ala Lys Glu Gln Thr Pro Pro Asn Gly
2315 2320 2325
Arg Asn Gly Gly Arg Ala Gly Glu Gly Gly Asn Gly Gly Glu Arg
2330 2335 2340
Gly Asn Gly Asn Gly His Arg Ser Val Thr Gly Val Gly Ser Val
2345 2350 2355
Leu Gly Glu Leu Asp Arg Leu Glu Asp Thr Leu Ala Gly Phe Val
2360 2365 2370
Pro Glu Asn Gly Asp Ser Gly Ala Val Thr Ala Arg Leu Glu Arg
2375 2380 2385
Leu Leu Thr Lys Trp Lys Ala Ala Ala Arg Asn Pro Ala Gly Glu
2390 2395 2400
Pro Ser Ala Ala Glu Arg Leu Glu Ser Ala Ser Thr Asp Gln Val
2405 2410 2415
Leu Asp Phe Ile Glu Asn Glu Leu Gly Val Ser
2420 2425
<210> 2
<211> 2217
<212> PRT
<213> Artificial
<220>
<223> madA2基因编码的氨基酸序列
<400> 2
Met Ala Asn Glu Glu Lys Leu Val Asp Tyr Leu Lys Arg Val Ser Ala
1 5 10 15
Glu Leu His Asp Thr Arg Arg Arg Leu Ser Glu Val Glu Asp Arg Ala
20 25 30
Asn Glu Pro Ile Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Glu
35 40 45
Val Arg Ser Pro Glu Asp Leu Trp Asp Leu Leu Val Ser Gly Lys Asp
50 55 60
Ala Ile Gly Asp Phe Pro Asp Asp Arg Gly Trp Asp Leu Asp Gly Leu
65 70 75 80
Tyr His Pro Asp Pro Ala His Pro Gly Thr Ser Ser Thr Arg Gln Gly
85 90 95
Gly Phe Leu Tyr Asp Ile Asp Arg Phe Asp Ala Gly Phe Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Val Leu Leu
115 120 125
Glu Thr Ala Trp Glu Thr Phe Glu Arg Ala Gly Ile Asp Pro Phe Ser
130 135 140
Leu Lys Gly Thr Ser Thr Gly Val Tyr Ala Gly Val Ser Ser Gln Asp
145 150 155 160
Tyr Leu Ala Arg Ala Pro Arg Ile Pro Glu Gly Phe Glu Gly Tyr Ala
165 170 175
Thr Thr Gly Ser Leu Thr Ser Val Val Ser Gly Arg Val Ala Phe Thr
180 185 190
Leu Gly Leu Gln Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ala
195 200 205
Ser Leu Val Ala Ile His Leu Ala Cys Gln Ala Leu Arg Gln Gly Glu
210 215 220
Cys Thr Leu Ala Leu Gly Gly Gly Val Cys Gly Leu Ala Thr Pro Val
225 230 235 240
Met Phe Thr Glu Phe Ser Arg Gln Arg Ala Leu Ser Pro Asp Gly Arg
245 250 255
Cys Lys Ser Phe Ala Ala Asp Ala Asp Gly Thr Gly Phe Ser Glu Gly
260 265 270
Val Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly
275 280 285
His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly
290 295 300
Ala Ser Asn Gly Leu Thr Ala Pro Asn Asp Val Ala Gln Glu Gln Val
305 310 315 320
Ile Glu Arg Ala Leu Thr Ala Ala Arg Leu Ala Pro Ser Glu Val Asp
325 330 335
Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu
340 345 350
Ala Glu Ala Leu Ile Ala Thr Tyr Gly Arg Asp Arg Pro Ala Glu Arg
355 360 365
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr His Ala
370 375 380
Ala Ala Gly Val Ala Gly Leu Ile Lys Met Val Met Ala Leu Arg Asn
385 390 395 400
Glu Thr Leu Pro Ala Asn Leu His Leu Ser Glu Pro Thr Pro His Val
405 410 415
Asp Trp Asp Thr Gly Ala Val Arg Leu Leu Thr Ala Pro Val Pro Trp
420 425 430
Pro Arg Asp Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile
435 440 445
Ser Gly Thr Asn Ala His Leu Ile Val Glu Glu Ala Pro Glu Pro Ala
450 455 460
Asp Pro Ser Glu Ala Ala Ala Pro Pro Glu Pro Asp Ser Gly Ala Val
465 470 475 480
Val Pro Trp Val Leu Ser Gly Arg Ser Glu Glu Ala Leu Arg Gly Gln
485 490 495
Ala Ala Ala Leu Ala Ala Arg Val Gly Gly Ser Glu Gly Trp Ala Ala
500 505 510
Ala Asp Val Gly Trp Ser Leu Val Thr Thr Arg Ser Val Phe Glu His
515 520 525
Arg Ala Val Val Val Gly Ser Glu Arg Asp Glu Leu Leu Asp Gly Leu
530 535 540
Lys Ala Leu Ala Ala Gly Glu Ala His Pro Asn Val Val His Pro Gly
545 550 555 560
Ser Pro Ala Val Ser Gly Ala Asn Ala Arg Gly Pro Val Leu Val Phe
565 570 575
Pro Gly Gln Gly Ser Gln Trp Ala Gly Met Gly Ala Glu Leu Leu Asp
580 585 590
Ser Ser Pro Val Phe Ala Ala Arg Ile Ala Glu Cys Glu Cys Ala Leu
595 600 605
Ala Pro Phe Val Asp Trp Ser Leu Ser Asp Val Leu Arg Gly Asp Gly
610 615 620
Ala Glu Leu Thr Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Thr
625 630 635 640
Met Val Ser Leu Ala Ala Val Trp Ala Ser Tyr Gly Val Val Pro Ser
645 650 655
Ala Val Ile Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala
660 665 670
Gly Ala Leu Ser Leu Glu Asp Gly Ala Arg Ile Val Ala Val Arg Gly
675 680 685
Gln Glu Leu Arg Arg Leu Ser Gly Arg Gly Ala Met Ala Ser Leu Gly
690 695 700
Val Gly Glu Glu Arg Ala Gly Glu Leu Val Glu Gly Asp Val Thr Val
705 710 715 720
Ala Ala Val Asn Gly Pro Ser Ser Thr Val Val Ser Gly Pro Pro Glu
725 730 735
Gln Val Ala Ala Val Val Ala Arg Ala Glu Ala Ala Gly His Arg Ala
740 745 750
Arg Thr Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Ile Glu Leu
755 760 765
Ile Gly Thr Glu Leu Thr Glu Lys Leu Ala Gly Val Arg Pro Val Arg
770 775 780
Ala Asp Val Gly Phe Tyr Ser Thr Val Thr Gly Gly Arg Ile Asp Thr
785 790 795 800
Ala Gly Leu Asp Thr Val Tyr Trp Val Thr Asn Leu Arg Leu Arg Val
805 810 815
Arg Phe Ala Glu Thr Val Arg Ala Leu Leu Asp Asp Gly His Arg Val
820 825 830
Phe Ile Glu Ala Ser Pro His Pro Val Leu Thr Val Gly Leu Glu Glu
835 840 845
Cys Phe Glu Gln Ala Gly Thr Ala Ala Val Ala Val Pro Thr Leu Arg
850 855 860
Arg Gly Asp Gly Gly Ala Gly Gln Val Ala Lys Ala Ala Gly Gln Ala
865 870 875 880
Phe Ala Ala Gly Ala Arg Val Asp Trp Thr Gly Trp Phe Arg Gly Arg
885 890 895
Pro Ala Ala Pro Arg Val Val Asp Leu Pro Thr Tyr Ala Phe Gln Arg
900 905 910
Glu Arg Tyr Trp Leu Glu Gly Leu Asn Gly Asp Gly Gly Asp Pro Ala
915 920 925
Gly Leu Gly Leu Val Pro Ala Gly His Pro Leu Leu Gly Ala Ala Val
930 935 940
Glu Val Ala Asp Gly Asp Ile His Leu Leu Thr Gly Arg Leu Pro Gly
945 950 955 960
Asn Gly Asn Ala Gly Trp Leu Ala Asp His Arg Val Leu Gly Thr Val
965 970 975
Leu Val Pro Gly Thr Ala Leu Leu Glu Trp Ala Leu Arg Ala Ala Asp
980 985 990
Glu Ala Gly Cys Gly Ser Val Glu Glu Leu Val Leu His Glu Pro Met
995 1000 1005
Val Leu Pro Gly Gly Ser Gly Gly Ser Gly Gly Leu Arg Val Gln
1010 1015 1020
Val Val Val Gly Ala Ala Gly Glu Asp Gly Arg Arg Glu Val Arg
1025 1030 1035
Val Ser Ser Arg Pro Gly Gln Glu Asp Ser Ala Gly Glu Arg Gly
1040 1045 1050
Gly Glu Ala Arg Asp Trp Val Cys His Ala Ser Gly Val Leu Ala
1055 1060 1065
Pro Glu Thr Thr Val Pro Ala Pro Gly Leu Asp Gly Ile Trp Pro
1070 1075 1080
Pro Pro Gly Ala Glu Ala Val Asp Val Ser Asp Val Tyr Glu Arg
1085 1090 1095
Val Ala Ala Ser Gly Tyr Glu Tyr Gly Pro Ala Phe Gln Gly Leu
1100 1105 1110
Arg Ala Leu Trp Arg His Gly Thr Asp Leu Leu Ala Glu Val Ala
1115 1120 1125
Leu Pro Asp Ala Ala Gly Lys Pro Asp Gly Phe Gly Val His Pro
1130 1135 1140
Ala Leu Leu Asp Ala Ala Leu His Pro Ala Leu Leu Glu Asp Gly
1145 1150 1155
Asp Asp Asp Ala Glu Gly Gly Gly Glu Arg Arg Val Trp Leu Pro
1160 1165 1170
Phe Ala Trp Asn Gly Val Ser Leu Arg Ala Asp Gly Ala Thr Ala
1175 1180 1185
Val Arg Val Arg Leu Ser Pro Glu Gly Asp Gly Ala Asp Glu Arg
1190 1195 1200
Lys Val Arg Val Thr Val Ala Asp Ala Val Gly Gly Pro Val Leu
1205 1210 1215
Asp Ala Glu Ser Val Val Met Arg Pro Ala Asp Ile Asp Arg Leu
1220 1225 1230
Lys Ser Ala Gly Arg Pro Gly Ala Gly Gly Ala Glu Gly Leu Phe
1235 1240 1245
Thr Val Glu Trp Thr Ala Leu Pro Glu Pro Ala Ala Glu Pro Ser
1250 1255 1260
Ala Glu Asp Asp Thr Asp His Val Thr Leu Ala Pro Thr Ala Ser
1265 1270 1275
Ala Glu Asp Gly Ala Ser Tyr Ala Gly Leu Asp Gly Leu Ala Ala
1280 1285 1290
Ala Leu Asp Gly Gly Leu Pro Ala Pro Ser Val Val Leu Ala Glu
1295 1300 1305
Val Gly Ala Val Ala Asp Ala Leu Ala Leu Val Gln Arg Trp Leu
1310 1315 1320
Ala Glu Pro Arg Leu Ser Glu Ala Arg Leu Ala Val Val Thr Arg
1325 1330 1335
Gly Ala Val Ala Cys Glu Asn Pro Val Pro Asp Gly Ala Gly Val
1340 1345 1350
Trp Gly Leu Val Arg Ser Ala Gln Ala Glu Asn Pro Gly Arg Phe
1355 1360 1365
Val Leu Leu Asp Val Gly Ala Glu Ala Asp Ala Gly Ala Asp Ala
1370 1375 1380
Gly Ser Arg Ala Glu Gln Ala Glu Ala Val Arg Arg Ala Val Arg
1385 1390 1395
Ala Gly Glu Pro Gln Val Ala Trp Arg Asp Gly Arg Leu Leu Val
1400 1405 1410
Pro Arg Trp Val Arg Ala Gly Glu Pro Val Gly Ile Val Pro Pro
1415 1420 1425
Pro Gly Glu Arg Ala Trp Arg Leu Gly Leu Ser Gly Thr Ala Thr
1430 1435 1440
Leu Glu Asn Val Ser Ala Thr Ala Cys Pro Glu Ala Leu Glu Pro
1445 1450 1455
Leu Glu Pro Gly Gln Val Arg Ile Asp Val His Ala Ala Gly Val
1460 1465 1470
Asn Phe Arg Asp Val Leu Ile Ala Leu Gly Met Tyr Pro Gly Asp
1475 1480 1485
Ala Ala Phe Gly Gly Ser Glu Gly Ala Gly Val Val Thr Glu Val
1490 1495 1500
Gly Pro Asp Val Ala Gly Leu Ala Ala Gly Asp Arg Val Met Gly
1505 1510 1515
Leu Phe Asp Gly Ala Phe Gly Ser Val Ala Val Ala Asp Ala Arg
1520 1525 1530
Met Val Ala Pro Val Pro Asp Gly Trp Asp Leu Arg Gln Ala Ala
1535 1540 1545
Ala Ala Pro Val Ala Phe Leu Thr Ala Trp Tyr Gly Leu Val His
1550 1555 1560
Leu Gly Ala Leu Arg Arg Gly Asp Ser Val Leu Val His Ala Ala
1565 1570 1575
Thr Gly Gly Val Gly Met Ala Ala Val Gln Val Ala Arg His Val
1580 1585 1590
Gly Ala Glu Val Tyr Ala Thr Ala Gly Pro Ala Lys His Gly Val
1595 1600 1605
Leu Asp Gly Met Gly Ile Asp Glu Ala His Arg Ala Ser Ser Arg
1610 1615 1620
Asp Leu Asp Phe Glu Asp Ala Val Arg Arg Ala Thr Gly Gly Arg
1625 1630 1635
Gly Val Asp Val Val Leu Asn Ser Leu Ser Gly Pro Phe Thr Asp
1640 1645 1650
Ala Ser Leu Arg Leu Leu Ala Asp Gly Gly Arg Phe Val Glu Met
1655 1660 1665
Gly Lys Thr Asp Val Arg Ala Pro Asp Pro Ala Ser Asp Gly Met
1670 1675 1680
Ala Glu Gly Val Ala Tyr Arg Ala Phe Asp Leu Leu Ala Asp Ala
1685 1690 1695
Gly Pro Asp Arg Ile Ala Glu Met Leu Ser Glu Leu Ala Gly Leu
1700 1705 1710
Phe Ser Ser Gly Ala Leu Arg Pro Leu Pro Val Arg Ala Trp Pro
1715 1720 1725
Leu Gly Arg Ala Arg Asp Ala Leu Arg His Met Ser Gln Ala Arg
1730 1735 1740
His Thr Gly Lys Leu Val Leu Asp Val Pro Ala Thr Pro Asp Pro
1745 1750 1755
Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Thr Ile Gly Gly
1760 1765 1770
Glu Val Ala Val His Leu Ala Arg Asp Trp Arg Val Arg His Leu
1775 1780 1785
Leu Leu Leu Ser Arg Ser Gly Pro Ser Ala Pro Gly Ala Asp Ala
1790 1795 1800
Leu Ala Glu Arg Ile Arg Glu Leu Gly Ala Glu Ala Ala Ile Lys
1805 1810 1815
Ala Val Asp Val Ala Asp Pro Val Ala Leu Asp Glu Ala Ile Ala
1820 1825 1830
Ala Ile Asp Pro Ala His Pro Leu Thr Gly Ile Val His Ala Ala
1835 1840 1845
Gly Thr Leu Asp Asn Ala Leu Val Thr Ser Leu Thr Pro Asp Ala
1850 1855 1860
Leu Ala Gly Val Trp Asp Val Lys Ala Gly Ala Ala Ala His Leu
1865 1870 1875
His Ala Ala Thr Ala Gly Thr Arg Leu Gly Leu Phe Ala Val Phe
1880 1885 1890
Ser Ser Phe Ala Ser Thr Leu Gly Thr Pro Gly Gln Ala Asn Tyr
1895 1900 1905
Ala Ala Ala Asn Ala Tyr Cys Asp Ala Leu Ala Ala Ala Arg Arg
1910 1915 1920
Ala Glu Gly Leu Pro Gly Leu Ser Val Ala Trp Gly Leu Trp Ala
1925 1930 1935
Thr Ala Ser Gly Leu Thr Gly Gly Leu Thr Glu Ala Asp Thr Ala
1940 1945 1950
Arg Ile His Arg Tyr Gly Ile Arg Pro Asn Gly Thr Glu Glu Gly
1955 1960 1965
Leu Ala Leu Leu Asp Ala Ala Gln Arg His Gly Gly Pro Glu Val
1970 1975 1980
Leu Ala Leu Asn Phe Asp Ala Arg Glu Leu Ala Gly Arg Pro Pro
1985 1990 1995
Asp Thr Leu Pro Ala Pro Leu Arg Gly Leu Ala Ala Thr Ala Ala
2000 2005 2010
Ala Lys Pro Gly Gly Val Pro Gly Arg Pro Thr Ala Ala Gly Gly
2015 2020 2025
Glu Arg Pro Thr Asp Trp Thr Ala Arg Leu Ala Gly Leu Pro Pro
2030 2035 2040
Ser Glu Arg His Arg Thr Leu Leu Asn Leu Val Arg Thr His Ala
2045 2050 2055
Ala Ala Val Leu Gly His Ala Asp Ala Gly Leu Val Arg Ala Asp
2060 2065 2070
Ala Ser Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Gly Val Glu
2075 2080 2085
Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu His Ala
2090 2095 2100
Ala Leu Val Phe Asp His Pro Glu Ala Gly Ala Leu Ala Asp His
2105 2110 2115
Leu Leu Glu Leu Leu Ala Pro Asp Gly Ala Pro Pro Ala Gly Ala
2120 2125 2130
Asp Ala Thr Ala Pro Val Leu Gly Glu Val Ala Arg Leu Glu Ser
2135 2140 2145
Thr Leu Ser Ala Val Ala Ala Gly Ala Val Ser Ala Asp Ala Val
2150 2155 2160
Thr Ala Arg Leu Glu Asp Leu Leu Ala Asn Trp Lys Ala Ala His
2165 2170 2175
Ala Pro Asp Gly Gly Thr Gly Gly Asp Gly Asp Ala Asp Gly Ala
2180 2185 2190
Glu Glu Arg Leu Arg Val Ala Asp Ala Gly Gln Ile Leu Asp Phe
2195 2200 2205
Ile Asp Asn Glu Leu Gly Val Arg Pro
2210 2215
<210> 3
<211> 4043
<212> PRT
<213> Artificial
<220>
<223> madA3基因编码的氨基酸序列
<400> 3
Val Asn Glu Glu Lys Leu Val Asp Tyr Leu Arg Arg Val Thr Thr Asp
1 5 10 15
Leu His His Thr Arg Lys Arg Leu Ser Glu Val Glu Asp Arg Gln Arg
20 25 30
Glu Pro Val Ala Val Val Asp Met Ala Cys Arg Phe Pro Gly Gly Val
35 40 45
Arg Ser Pro Glu Asp Leu Trp Glu Leu Val Ala Ser Gly Arg Asp Ala
50 55 60
Met Gly Ala Phe Pro Glu Asp Arg Gly Trp Asp Leu Asp Arg Leu Phe
65 70 75 80
His Pro Asp Pro Asp His Pro Gly Thr Ser Tyr Ala Arg Ala Gly Gly
85 90 95
Phe Leu Tyr Asp Ala Ala Glu Phe Asp Pro Glu Phe Phe Gly Ile Ser
100 105 110
Pro Arg Glu Ala Leu Val Leu Asp Pro Gln Gln Arg Leu Leu Leu Glu
115 120 125
Val Ser Trp Glu Ala Met Glu Arg Ala Gly Ile Asp Pro Ser Thr Leu
130 135 140
Lys Gly Ser Arg Thr Gly Val Tyr Ala Gly Ala Ala Leu Pro Gly Phe
145 150 155 160
Gly Thr Pro His Ile Asp Pro Ser Ala Glu Gly His Leu Val Thr Gly
165 170 175
Asn Ala Pro Ser Val Leu Ser Gly Arg Leu Ser Tyr Thr Phe Gly Leu
180 185 190
Glu Gly Pro Ala Val Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu Val
195 200 205
Ala Ile His Leu Ala Cys His Ala Leu Arg Gln Gly Glu Cys Thr Leu
210 215 220
Ala Leu Ala Gly Gly Val Thr Val Met Ala Val Pro Asn Val Phe Thr
225 230 235 240
Glu Phe Ala Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Pro
245 250 255
Phe Ala Glu Ala Ala Asp Gly Thr Ala Phe Ala Glu Gly Ala Gly Met
260 265 270
Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Pro Val
275 280 285
Ala Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn
290 295 300
Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Gln
305 310 315 320
Ala Leu Glu Ala Ala Arg Leu Ala Pro Ser Glu Val Asp Ala Val Glu
325 330 335
Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Lys Ala
340 345 350
Leu Leu Ser Val Tyr Gly Ala Asp Arg Asp Ala Gly Arg Pro Leu Arg
355 360 365
Leu Gly Ser Cys Lys Ser Asn Leu Gly His Thr Gln Gly Ala Ala Gly
370 375 380
Val Ala Gly Val Ile Lys Met Val Leu Ala Leu Arg His Glu Arg Leu
385 390 395 400
Pro Ala Thr Leu His Val Asp Arg Pro Thr Pro His Val Asp Trp Asp
405 410 415
Ser Gly Ala Val Arg Leu Leu Thr Glu Pro Ala Gly Trp Pro Arg Gly
420 425 430
Gly Asp Arg Ala Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly
435 440 445
Thr Asn Ala His Leu Ile Leu Glu Glu Pro Pro Ala Glu Asn Ala Gln
450 455 460
Asp Ala Glu His Ala Glu His Ala Glu His Ala Asp Asp Ala Pro Ala
465 470 475 480
Ala Ser Ala Gly Arg Ala Ala Asn Glu Ala Arg Gly Pro Ile Pro Trp
485 490 495
Val Leu Ser Gly Arg Thr Gly Pro Ala Leu Ala Glu Gln Ala Ala Ala
500 505 510
Leu Ala Glu His Val Ala Ala Arg Pro Glu Leu Pro Ala Ala Asp Val
515 520 525
Ala Trp Ser Leu Ala Arg Thr Arg Thr Pro Phe Glu His Arg Ala Val
530 535 540
Val Leu Gly Asp Asp Arg Asp Ala Leu Arg Asp Gly Leu Ala Ala Leu
545 550 555 560
Ala Asp Gly Arg Asp His Pro Asp Val Thr Arg Ala Ala Gly Pro Ala
565 570 575
Gly Pro Gly Gly Thr Ala Phe Met Phe Thr Gly Gln Gly Ser Gln Arg
580 585 590
Pro Gly Met Gly Arg Asp Leu His Arg Thr Phe Ala Ala Phe Ala Glu
595 600 605
Ala Phe Asp Glu Ala Ala Ala His Leu Asp Pro Leu Leu Gly Arg Ser
610 615 620
Leu Arg Asp Leu Val Phe Ala Asp Gly Gly Thr Ala Glu Ala Ala Asp
625 630 635 640
Leu Asp Arg Thr Gly Val Thr Gln Ala Ala Leu Phe Ala Val Glu Thr
645 650 655
Ala Leu Phe Arg Leu Val Glu Ser Phe Gly Val Val Pro Ala Tyr Leu
660 665 670
Thr Gly His Ser Val Gly Glu Ile Val Ala Ala His Val Ala Gly Val
675 680 685
Leu Ser Leu Pro Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Gly Leu
690 695 700
Met Gln Ala Leu Pro Glu Gly Gly Ala Met Val Ala Val Glu Ala Pro
705 710 715 720
Glu Asp Glu Val Ala Pro Leu Leu Ala Gly Leu Glu Asp Arg Val Ser
725 730 735
Leu Ala Ala Val Asn Gly Pro Ala Ser Val Val Val Ser Gly Asp Gly
740 745 750
Thr Ala Val Glu Glu Ala Val Arg Pro Leu Arg Glu Arg Gly Leu Arg
755 760 765
Val Arg Arg Leu Arg Val Ser His Ala Phe His Ser Pro Leu Leu Gly
770 775 780
Pro Met Leu Asp Gly Phe Arg Glu Val Val Ser Gly Leu Ala Phe Gly
785 790 795 800
Pro Pro Glu Ile Pro Leu Ile Ser Asn Leu Thr Gly Glu Pro Ala Asp
805 810 815
Ala Gly Leu Leu Gly Asp Pro Glu Tyr Trp Val Arg His Ala Arg Arg
820 825 830
Thr Val Arg Phe His Asp Gly Leu Arg Thr Leu Arg Ala Glu Gly Val
835 840 845
Thr Arg Tyr Leu Glu Leu Gly Pro Asp Pro Val Leu Thr Ala Met Ala
850 855 860
Gln Lys Ser Leu Ala Asp Asp Gly Pro Ala Pro Ala Phe Ala Ala Ala
865 870 875 880
Met Arg Lys Gly Arg Asp Glu Pro Arg Thr Phe Leu Thr Ala Leu Ala
885 890 895
Ala Thr His Val Asp Gly Gly Asp Ala Asp Phe Ala Ala Ala Leu Pro
900 905 910
Ala Gly Ala Gly Arg Val Pro Leu Pro Thr Tyr Arg Phe Gln Arg Arg
915 920 925
Arg Tyr Trp Arg Pro Val Gln Asp Ala Ala Ala Glu Val Ser Ala Ser
930 935 940
Gly Leu Arg Pro Thr Gly His Pro Leu Leu Gln Ala Ala Val Glu Pro
945 950 955 960
Ala Asp Gly Gly Leu Leu Leu Ala Gly Arg Leu Ser Pro Arg Thr His
965 970 975
Gly Trp Leu Ala Asp His Ala Ile Ala Asp Gly Ala Pro Leu Pro Ala
980 985 990
Thr Ala Phe Leu Glu Phe Ala Leu Val Ala Ala Arg Arg Thr Gly Cys
995 1000 1005
Asp Ala Val Glu Asp Leu Ser Leu Glu Thr Pro Leu Pro Leu Pro
1010 1015 1020
Asp Ala Glu Pro Val Asp Leu Gln Val Ala Val Gly Gly Pro Asp
1025 1030 1035
Gly Ala Gly Arg Arg Pro Val Thr Ile Tyr Ser Arg Pro Val Asp
1040 1045 1050
Asp Ala His Asp Asp Pro Asp Ala Pro Ala Ala Ala Arg Leu Asp
1055 1060 1065
Asp Leu Gly Pro Ala Asn Gly Trp Arg Arg His Ala Ala Gly Thr
1070 1075 1080
Leu Ala Gly Pro Asp Pro Asp Pro Asp Pro Ala Pro Ala Ala Pro
1085 1090 1095
Ala Gln Ser Ala Ala Trp Pro Pro Glu Gly Ala Asp Glu Ile Asp
1100 1105 1110
Ala Ala Glu Leu Tyr Glu Arg Leu Ala Asp Gln Gly Tyr Arg Tyr
1115 1120 1125
Gly Pro Ala Phe Arg Gly Val Arg Ala Ala Trp Arg Leu Gly Asp
1130 1135 1140
Glu Leu Phe Ala Glu Val Arg Leu Ala Pro Asp Gln His Ala Asp
1145 1150 1155
Ala Ala Ala Tyr Ala Val His Pro Ala Leu Leu Asp Ser Ala Leu
1160 1165 1170
His Pro Val Asp Glu Leu Tyr Glu His Asp Gly Ser Thr Val Arg
1175 1180 1185
Leu Pro Phe Ser Phe Gly Gly Val Arg Leu His Gly Thr Gly Arg
1190 1195 1200
Thr Arg Leu Arg Val Arg Ile Thr Pro Gly Asp Ala Asp Val Leu
1205 1210 1215
Ser Leu Ala Leu Thr Asp Asp Asp Gly Ala Ala Val Ala Ser Ile
1220 1225 1230
Gly Ser Leu Gly Leu Arg Lys Val Pro Ala Gly Arg Trp Asn Gly
1235 1240 1245
Ala Ala Pro Ala Asp Asp Ala Leu His Arg Leu Ala Trp His Pro
1250 1255 1260
Leu Pro Val Pro Asp Gly Gly Pro Gly Thr Ala Arg Pro Ala Asp
1265 1270 1275
Asp Trp Ala Ala Ile Ala Ala Ala Gly Pro Asp Ala Thr Gly Pro
1280 1285 1290
Ala Ala Asp Gly Leu Ser Ala Asp Gly Leu Gly Val Val Ala Pro
1295 1300 1305
Asp Ala Val His Pro Asp Leu Ala Ala Leu Lys Thr Ala Leu Ala
1310 1315 1320
Asp Gly Arg Pro Thr Pro Ala Val Val Leu Leu Pro Leu Pro Thr
1325 1330 1335
Gly Gly Pro Asp Ala Ala Glu Pro Val Ala Arg Ala His Asp Ala
1340 1345 1350
Ala Arg Arg Gly Leu Glu Leu Leu Gln Glu Trp Leu Glu Glu Glu
1355 1360 1365
Arg Phe Ala Ala Ala Arg Leu Ile Ala Val Thr Arg Gly Ala Val
1370 1375 1380
Ala Ala Thr Gly Asp Asp Pro Pro Ser Asp Leu Ala Gly Ala Pro
1385 1390 1395
Leu Trp Gly Leu Val Arg Ala Ala Gln Ala Glu Tyr Pro Asp Arg
1400 1405 1410
Ile Val Leu Leu Asp Leu Asp Gly Glu Asp Ala Ser Arg Asp Val
1415 1420 1425
Leu Arg Gly Ala Val Ala Ala Ala Met Glu Asn Gly Glu Ser Gln
1430 1435 1440
Leu Ala Val Arg Ser Gly Asp Val Leu Val Pro Arg Leu Val Asp
1445 1450 1455
Ala Ala Glu Arg Asp Ala Asp Gly Thr Ala Pro Ala Leu Asp Pro
1460 1465 1470
Asp Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Arg
1475 1480 1485
Leu Val Ala Arg His Leu Val Thr Ala His Gly Ala Arg His Leu
1490 1495 1500
Leu Leu Thr Gly Arg Arg Gly Pro Ala Ala Asp Gly Ala Ala Glu
1505 1510 1515
Leu Glu Ala Glu Leu Ala Gly Leu Gly Ala Asp Val Arg Val Ala
1520 1525 1530
Ala Cys Asp Ala Ala Asp Pro Asp Ala Leu Ala Ala Leu Leu Ala
1535 1540 1545
Ser Val Pro Ala Ala His Pro Leu Thr Ala Val Val His Ala Ala
1550 1555 1560
Gly Ile Val Asp Asp Gly Val Ile Thr Ser Leu Thr Pro Gly Arg
1565 1570 1575
Leu Ala Ala Val Leu Ala Pro Lys Ala Asp Ala Ala Trp His Leu
1580 1585 1590
His Arg Leu Thr Glu Gly Asp Asp Leu Ala Ala Phe Val Leu Phe
1595 1600 1605
Ser Ser Ala Ala Ser Val Leu Gly Asn Gly Gly Gln Ala Ser Tyr
1610 1615 1620
Gly Ala Ala Asn Met Phe Leu Asn Ala Leu Ala Glu His Arg Arg
1625 1630 1635
Ala Gly Gly Arg Pro Ala Thr Ser Leu Ala Trp Gly Leu Trp Glu
1640 1645 1650
Ala Ser Gly Gly Met Ala Glu Asp Leu Ala Ala Ala Asp Arg Ala
1655 1660 1665
Arg Ile Thr Arg Ser Gly Val Ala Ala Met Pro Pro Gly Arg Ala
1670 1675 1680
Leu Ala Leu Leu Asp Ala Ala Leu Ala Ala Pro His Pro Thr Leu
1685 1690 1695
Val Pro Val Ala Phe Asp Arg Ala Ala Leu Arg Glu Gln Ala Ala
1700 1705 1710
Ala Gly Thr Leu Pro Pro Val Leu Arg Gly Arg Val Arg Val Pro
1715 1720 1725
Ala Arg Ser Pro Arg Gly Pro Ala Ala Ser Ser Phe Ala Ala Arg
1730 1735 1740
Leu Ala Gly Leu Pro Ala Glu Glu Arg Asp Arg Ala Val Ala Ser
1745 1750 1755
Ala Val Arg Asp Gln Ile Ala Thr Val Leu Ala His Pro Glu Pro
1760 1765 1770
Glu Ala Ile Asp Leu Asn Arg Ala Phe Gln Glu Leu Gly Phe Asp
1775 1780 1785
Ser Leu Thr Ala Leu Glu Leu Arg Asn Arg Leu Asn Ala Ala Thr
1790 1795 1800
Gly Ala Ala Leu Pro Ala Thr Val Ile Phe Asp His Pro Thr Pro
1805 1810 1815
Glu Ala Leu Ile Arg Ala Val Arg Thr Arg Leu Asp Gly Ala Ala
1820 1825 1830
Gly Gly Arg Ala Val Arg Ala Pro Val Asn Gly Ala Pro Arg Ala
1835 1840 1845
Ala Ala Ala Asp Asp Gly Glu Pro Ile Ala Ile Ile Gly Met Gly
1850 1855 1860
Cys Arg Tyr Pro Gly Gly Val Thr Ser Pro Glu Asp Leu Trp Glu
1865 1870 1875
Leu Val Ala Ser Gly Ala Asp Ala Ile Gly Glu Phe Pro Ala Asp
1880 1885 1890
Arg Gly Trp Asp Leu Pro Gly Leu Phe Asp Pro Asp Pro Asp Arg
1895 1900 1905
Leu Gly His Ser Tyr Thr Arg Glu Gly Gly Phe Leu Tyr Asp Ala
1910 1915 1920
Ala Arg Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala
1925 1930 1935
Val Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Val Trp
1940 1945 1950
Glu Ala Phe Glu Arg Ala Gly Leu Asp Pro Ala Asp Leu Arg Gly
1955 1960 1965
Ser Arg Thr Gly Val Ile Ala Gly Val Met Tyr Asp Asp Tyr Gly
1970 1975 1980
Ser Arg Phe Leu Gly Arg Thr Pro Arg Ala Val Glu Gly Arg Leu
1985 1990 1995
Met Thr Gly Ser Thr Pro Ser Val Ala Ser Gly Arg Val Ala Tyr
2000 2005 2010
Thr Phe Gly Leu Glu Gly Pro Thr Leu Thr Val Asp Thr Ala Cys
2015 2020 2025
Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gln Ala Leu Arg
2030 2035 2040
Arg Asp Glu Cys Thr Leu Ala Val Ala Gly Gly Val Thr Val Met
2045 2050 2055
Ala Thr Pro Asn Thr Phe Val Glu Phe Ser Arg Gln Arg Gly Leu
2060 2065 2070
Ala Pro Asp Gly Arg Cys Lys Ser Phe Ala Ala Ser Ala Asp Gly
2075 2080 2085
Val Gly Trp Gly Glu Gly Ala Gly Met Val Val Leu Glu Arg Leu
2090 2095 2100
Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Leu Arg
2105 2110 2115
Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala
2120 2125 2130
Pro Asn Gly Pro Ser Gln Glu Arg Val Ile Arg Ala Ala Leu Ala
2135 2140 2145
Gly Ala Gly Leu Ala Pro Ser Asp Val Asp Ala Val Glu Ala His
2150 2155 2160
Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala His Ala Leu
2165 2170 2175
Leu Ala Ala Tyr Gly Gln Asp Arg Pro Asp Gly Arg Pro Leu Trp
2180 2185 2190
Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala
2195 2200 2205
Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg Glu Gly
2210 2215 2220
Ser Leu Pro Ala Ser Leu His Ile Ala Glu Pro Ser Pro His Val
2225 2230 2235
Asp Trp Thr Ala Gly Ala Val Arg Leu Leu Thr Glu Pro Val Glu
2240 2245 2250
Trp Arg Asn Gly Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe
2255 2260 2265
Gly Ile Ser Gly Thr Asn Ala His Leu Ile Leu Glu Gln Ala Pro
2270 2275 2280
Asp Pro Ala Glu Thr Pro Ala Pro Glu Gly Pro Val Ile Leu Asn
2285 2290 2295
Gly Ala Ala Ser Ala Glu Ala Glu Thr Gly Thr Gly Asp Thr Gly
2300 2305 2310
Asp Thr Gly Val Pro Asp Leu Ala Leu Val Pro Trp Thr Val Ser
2315 2320 2325
Ala Arg Asp Pro Asp Ala Leu Arg Gly Gln Ala Ala Ala Leu Ala
2330 2335 2340
Ala His Val Ser Ala Arg Pro Gly Leu Pro Val Ala Asp Val Ala
2345 2350 2355
Trp Ser Leu Leu Lys Thr Arg Ser Ser Phe Pro His Lys Ala Val
2360 2365 2370
Val Val Gly Glu Arg Pro Ala Glu Leu Val Ala Gly Leu Glu Ala
2375 2380 2385
Leu Ala Ala Gly Ala Asp Ala His Pro Ala Leu Thr Gly Pro Gly
2390 2395 2400
Ala Pro Ala Ala Pro Gly Arg Leu Val Trp Leu Phe Ser Gly Gln
2405 2410 2415
Gly Ser Gln Arg Val Gly Met Gly Ala Asp Leu Tyr Val Arg Phe
2420 2425 2430
Pro Ala Phe Ala Glu Ala Phe Asp Glu Val Cys Gly His Phe Asp
2435 2440 2445
Glu Arg Leu Gly Arg Pro Leu Gly Glu Val Val Phe Thr Gly Pro
2450 2455 2460
Ala Asp Val Leu Asp His Thr Thr Tyr Ala Gln Ala Gly Leu Phe
2465 2470 2475
Ala Leu Gln Val Ala Leu Ala Arg Leu Leu Gly Thr Ala Gly Ile
2480 2485 2490
Arg Pro Asp Ala Val Ile Gly His Ser Ile Gly Glu Val Ala Ala
2495 2500 2505
Ala His Val Ala Gly Val Leu Asp Leu Pro Asp Ala Cys Arg Leu
2510 2515 2520
Val Ala Ala Arg Ala Gly Leu Met Gly Arg Leu Pro Ser Gly Gly
2525 2530 2535
Ala Met Thr Ala Val Glu Ala Thr Pro Asp Glu Leu Ala Asp Asp
2540 2545 2550
Leu Glu Arg Leu Gly Gly Gly Leu Val Ala Val Ala Ala Leu Asn
2555 2560 2565
Thr Pro Gly Ser Thr Val Ile Ser Gly Pro Ala Glu Pro Val Ala
2570 2575 2580
Arg Ile Gly Ala Arg Trp Ala Glu Arg Gly Arg Arg Thr Arg Ser
2585 2590 2595
Leu Thr Val Ser His Ala Phe His Ser Pro Leu Met Asp Pro Val
2600 2605 2610
Leu Asp Glu Phe Ala Thr Ala Ile Asp Gly Leu Ala Tyr Arg Glu
2615 2620 2625
Pro Ala Ile Pro Leu Ile Ser Asn Leu Thr Gly Leu Pro Ala Asp
2630 2635 2640
Glu Arg Ile Thr Thr Pro Ala Tyr Trp Ala Glu His Ile Arg Arg
2645 2650 2655
Pro Val Arg Phe His Pro Ala Val Ala His Val Ala Ala Asp Ala
2660 2665 2670
Ala Ala Phe Leu Glu Leu Gly Pro Asp Pro Val Leu Gly Thr Ala
2675 2680 2685
Thr Arg Arg Thr Leu Asp Thr Leu Asp Ala Gly Gly Asp Pro Ala
2690 2695 2700
Gly Gly Pro Pro Val Leu Ala Thr Leu Thr Arg Lys Gln Pro Glu
2705 2710 2715
Ile Arg Ser Leu Ala Asn Ala Leu Ala Gly Leu His Val Thr Gly
2720 2725 2730
Val Pro Val Asp Trp Ala Ala Ala Trp Phe Pro Ala Glu Pro Ala
2735 2740 2745
Pro Arg Ala Val Asp Leu Pro Thr Tyr Ala Phe Gln Arg Glu Arg
2750 2755 2760
Phe Trp Leu Ser Ala Asp Ala Gly Gly Glu Asp Ala Ala Gly Leu
2765 2770 2775
Gly Leu Thr Pro Ala Gly His Pro Leu Leu Gly Ala Ala Met Asp
2780 2785 2790
Leu Ala Gly Gly Gly Gly His Val Leu Thr Gly Arg Leu Thr Arg
2795 2800 2805
Arg Ser Gly Ala Trp Leu Ala Glu His Val Val Ala Gly Thr Thr
2810 2815 2820
Leu Leu Pro Gly Ala Gly Val Val Glu Trp Ala Leu Arg Ala Ala
2825 2830 2835
Asp Glu Val Gly Cys Gly Gly Val Asp Glu Leu Thr Leu Arg Ala
2840 2845 2850
Pro Phe Val Leu Pro Asp Thr Gly Gly Leu Arg Val Gln Val Val
2855 2860 2865
Ile Gly Ala Pro Gly Glu Asp Gly Arg Arg Asp Leu His Val Tyr
2870 2875 2880
Ser Arg Leu Asp Gln Asp Gly Glu Ala Ala Glu Trp Leu Cys Asn
2885 2890 2895
Ala Glu Gly Val Leu Thr Pro Glu Thr Pro Ala Asp Pro Gly Pro
2900 2905 2910
Glu Pro Gly Gly Pro Trp Pro Pro Pro His Ala Glu Pro Val Asp
2915 2920 2925
Val Glu Gly Phe Tyr Glu Arg Ala Ala Ser Gly Gly Tyr Asp Tyr
2930 2935 2940
Gly Pro Ser Phe Gln Gly Leu Arg Ala Leu Trp Arg Ala Gly Asp
2945 2950 2955
Asp Leu Phe Ala Glu Val Thr Leu Pro Glu Ala Ala Gly Asp Val
2960 2965 2970
Ala Gly Tyr Gly Ile His Pro Ala Leu Leu Asp Ala Ala Leu His
2975 2980 2985
Pro Leu Phe Leu Thr Gly Leu Phe Gly Gly Asp Pro Ser Gly Asn
2990 2995 3000
Arg Ile Trp Leu Pro Phe Thr Trp Asn Asp Val Ser Leu Arg Ala
3005 3010 3015
Gly Gly Ala Thr Ala Val Arg Val Arg Leu Ser Pro Asn Glu Asp
3020 3025 3030
Gly Asp Thr Leu Arg Leu Thr Val Thr Asp Pro Ser Ala Ala Pro
3035 3040 3045
Ile Leu Thr Val Gly Ser Leu Val Met Arg Pro Ala Asp Ala Asp
3050 3055 3060
Gln Leu Arg Ala Ala Ala Arg Gln Gly Asp Thr Arg Gly Leu Phe
3065 3070 3075
Thr Val Asp Trp Thr Pro Val Pro Leu Pro Ser Ala Pro Ala Asp
3080 3085 3090
Ala Pro Glu Arg Asp Asp Ala Gly Glu Trp Ala Ala Leu Gly Pro
3095 3100 3105
Asp Thr Leu Pro Pro Gly Leu Asp Ala Tyr Arg Asp Leu Pro Thr
3110 3115 3120
Leu Leu Met Ala Leu Asp Ala Gly Gln Pro Pro Pro Ser His Val
3125 3130 3135
Leu Thr Thr Ile Pro Pro Glu Pro Pro Pro Gly Arg Thr Glu Thr
3140 3145 3150
Asp Val Ala Ala Asp Thr Leu Leu Leu Leu Gln Ala Trp Leu Asp
3155 3160 3165
Glu Pro Arg Leu Ala Asp Ala Arg Leu Val Val Val Thr Arg Gly
3170 3175 3180
Ala Val Ala Ser Asp Val Pro Asp Leu Ala Gly Ala Gly Val Trp
3185 3190 3195
Gly Leu Val Arg Ser Ala Gln Ala Glu Asn Pro Gly Arg Phe Val
3200 3205 3210
Leu Leu Asp Leu Glu Thr Ala Pro Asp Gly Asp Val Leu Glu Leu
3215 3220 3225
Val Arg Arg Ala Leu His Thr Asp Glu Pro Gln Leu Ala Val Arg
3230 3235 3240
Gly Gly Gln Ala Leu Val Pro Arg Leu Ala Arg Ala Asp Arg Pro
3245 3250 3255
Ala Glu Leu Thr Ala Pro Ala Thr Arg Ala Gly Trp Arg Leu Gly
3260 3265 3270
Ala Glu Arg Pro Gly Thr Val Glu Ser Val Thr Ile Val Pro Ala
3275 3280 3285
Pro Glu Ala Leu Glu Pro Leu Ala Pro Gly Gln Val Arg Ile Ser
3290 3295 3300
Val Arg Ser Ala Gly Leu Asn Phe Arg Asp Thr Leu Ile Ala Leu
3305 3310 3315
Gly Met Tyr Pro Gly Gly Gly Val Phe Gln Gly Ser Glu Gly Ala
3320 3325 3330
Gly Val Val Thr Glu Val Gly Pro Gly Val Ala Ser Gly Leu Ser
3335 3340 3345
Val Gly Asp Arg Val Met Gly Leu Phe Glu Gly Ala Phe Gly Pro
3350 3355 3360
Val Ala Val Ala Asp Glu Arg Met Val Val Arg Leu Pro Asp Gly
3365 3370 3375
Trp Asp Phe Arg Ser Gly Ala Ala Val Pro Val Val Phe Leu Thr
3380 3385 3390
Ala Trp Tyr Gly Leu Val Glu Leu Ala Gly Leu Lys Ser Gly Glu
3395 3400 3405
Ser Val Leu Val His Ala Ala Thr Gly Gly Val Gly Ser Ala Ala
3410 3415 3420
Val Gln Ile Ala Arg His Leu Gly Ala Glu Val Phe Ala Thr Ala
3425 3430 3435
Ser Pro Gly Lys His Ala Val Leu Glu Ser Met Gly Ile Asp Ala
3440 3445 3450
Ala His Arg Ala Ser Ser Arg Asp Leu Asp Phe Glu Arg Thr Phe
3455 3460 3465
Arg Asp Ala Thr Gly Gly Arg Gly Val Asp Val Val Leu Asn Ser
3470 3475 3480
Leu Ala Gly Glu Phe Thr Asp Ala Ser Leu Arg Leu Val Ala Glu
3485 3490 3495
Gly Gly Arg Phe Leu Glu Met Gly Lys Thr Asp Ile Arg Glu Pro
3500 3505 3510
Glu His Ile Ala Ala Glu Tyr Pro Gly Leu Ser Tyr Arg Ala Phe
3515 3520 3525
Asp Leu Val Thr Asp Ala Gly Pro Asp His Val Gly Glu Met Leu
3530 3535 3540
Ala Arg Leu Thr Glu Leu Phe Ser Ala Asp Thr Leu Arg Pro Pro
3545 3550 3555
His Val Glu Ala Trp Pro Leu Ala Arg Ala Arg Glu Ala Leu Arg
3560 3565 3570
Val Met Ser Gln Ala Lys His Thr Gly Lys Leu Val Leu Asp Ile
3575 3580 3585
Pro Pro Thr Leu Asp Pro Asp Gly Thr Val Leu Ile Thr Gly Gly
3590 3595 3600
Thr Gly Thr Leu Gly Gly Leu Val Ala Glu His Val Val Gly Thr
3605 3610 3615
Trp Gly Val Lys Arg Leu Val Leu Ala Ser Arg Arg Gly Leu Gln
3620 3625 3630
Ala Pro Gly Ala Asp Glu Leu Val Glu Arg Leu Thr Arg Glu Leu
3635 3640 3645
Gly Ala Gln Val Arg Val Val Ala Ala Asp Val Ser Asp Ala Asp
3650 3655 3660
Ala Val Thr Asp Leu Ile Thr Ala Thr Asp Ala Asp Pro Ala His
3665 3670 3675
Pro Leu Thr Gly Leu Ile His Thr Ala Gly Leu Ile Asp Asp Ala
3680 3685 3690
Val Ile Gly Ser Gln Thr Pro Gln Arg Leu His His Val Trp Asn
3695 3700 3705
Ala Lys Ala Thr Ala Ala His Asn Leu His Thr Ala Thr Arg His
3710 3715 3720
Leu Pro Leu Ala Ala Phe Val Ile Phe Ser Ser Ala Ala Gly Thr
3725 3730 3735
Ile Gly Ser Ala Gly Gln Ala Asn Tyr Ala Ala Ala Asn Ala Tyr
3740 3745 3750
Cys Asp Ala Leu Ala Val Arg Arg Arg Ser Leu Gly Leu Pro Gly
3755 3760 3765
Val Ser Val Ala Trp Gly Leu Trp Glu Ser Ser Ser Gly Ile Thr
3770 3775 3780
Gly His Leu Gly Asp Ala Asp Leu Ala Arg Met Gly Arg Ser Gly
3785 3790 3795
Phe Arg Pro Leu Thr Asp Arg His Gly Leu Ala Leu Leu Asp Ala
3800 3805 3810
Ala Ser Arg His Gly Ala Ala His Leu Val Ala Val Asp Leu Arg
3815 3820 3825
Thr Ser Ala Leu Ala Gly Gln Pro Ala Glu Asn Val Pro Pro Met
3830 3835 3840
Leu Arg Thr Leu Ala Ala Thr Ala Ser Ala Gly Gly Ala Ala Ala
3845 3850 3855
Arg Pro Thr Ala Gly Gly Val Arg Arg Pro Ala Asp Leu Ala Ala
3860 3865 3870
Arg Leu Ala Ala Gln Ser Arg Thr Glu Gln Glu Arg Thr Leu Leu
3875 3880 3885
Gly Leu Val Arg Thr Gln Ala Ala Thr Val Leu Gly His Pro Asn
3890 3895 3900
Pro Asp Ala Ile Arg Pro Asp Asn Ser Phe Lys Glu Leu Gly Phe
3905 3910 3915
Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ala Ala Ser
3920 3925 3930
Thr Gly Leu Arg Leu Pro Ala Ser Leu Val Phe Asp Tyr Pro Glu
3935 3940 3945
Ala Ala Val Leu Ala Asp His Leu Arg Arg Arg Leu Ala Pro Asn
3950 3955 3960
Gly Glu Ser Ala Pro Ala Gln Asp Ala Val Asp Pro Leu Leu Gly
3965 3970 3975
Glu Leu Gly Arg Ile Glu Ser Ala Leu Ala Ala Leu Ala Leu Asp
3980 3985 3990
Asp Glu Ala Arg Ser Arg Val Ala Arg Arg Leu Asn Gly Leu Leu
3995 4000 4005
Ser Ala Val Asn Gly Gly Ser Asn Ala Ser Ala Glu Thr Gly Gly
4010 4015 4020
Phe Asp Asp Val Glu Ser Ala Ser Asp Asp Glu Ile Phe Glu Leu
4025 4030 4035
Ile Asp Arg Glu Leu
4040
<210> 4
<211> 4020
<212> PRT
<213> Artificial
<220>
<223> madA4基因编码的氨基酸序列
<400> 4
Met Ser Glu Thr Glu Glu Lys Leu Arg Gln Tyr Leu Lys Arg Val Thr
1 5 10 15
Val Asp Leu Gly Gln Thr Arg Gln Arg Leu Arg Glu Ala Glu Glu Arg
20 25 30
Gly Arg Glu Pro Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly
35 40 45
Gly Val Ser Ser Pro Glu Glu Leu Trp Glu Leu Val Asp Ser Gly Gly
50 55 60
Asp Ala Ile Gly Ala Phe Pro Asp Asp Arg Gly Trp Asn Leu Glu Ala
65 70 75 80
Leu Tyr Asn Pro Asp Pro Asp Gln Pro Gly Gly Ser Tyr Val Arg Glu
85 90 95
Gly Gly Phe Leu Tyr Asp Ala Asp Leu Phe Asp Ala Gly Phe Phe Gly
100 105 110
Ile Ser Pro Arg Glu Ala Leu Ala Met Glu Pro Gln Gln Arg Leu Leu
115 120 125
Leu Glu Thr Ser Trp Glu Leu Leu Glu Arg Ala Gly Ile Glu Pro Gly
130 135 140
Ser Leu Lys Gly Ser Ala Thr Gly Val Tyr Ala Gly Ala Gly Leu Pro
145 150 155 160
Gly Phe Gly Thr Pro His Ile Glu Glu Ser Ala Glu Gly His Leu Val
165 170 175
Thr Gly Asn Ala Leu Ser Val Leu Ser Gly Arg Val Ala Phe Thr Leu
180 185 190
Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser Ser
195 200 205
Leu Val Ala Ile His Leu Ala Cys Gln Ala Leu Arg Gln Gly Asp Cys
210 215 220
Ser Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Val Pro Ala Met
225 230 235 240
Phe Thr Glu Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys
245 250 255
Lys Pro Phe Ala Glu Ala Ala Asp Gly Thr Val Phe Ser Glu Gly Val
260 265 270
Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Leu Arg Asn Gly Arg
275 280 285
Arg Ile Leu Gly Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala
290 295 300
Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg Val Ile
305 310 315 320
Arg Gln Ala Leu Ala Tyr Ala Gly Leu Ser Gly Val Glu Val Asp Ala
325 330 335
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala
340 345 350
Gly Ala Leu Leu Ala Thr Tyr Gly Gln Gly Arg Pro Asp Asp Arg Pro
355 360 365
Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Ala Gln Gly Ala
370 375 380
Ala Gly Val Ala Gly Val Ile Lys Met Leu Met Ala Met Arg Asn Gly
385 390 395 400
Arg Leu Pro Ala Thr Leu Arg Val Asp Arg Pro Ser Ser His Val Asp
405 410 415
Trp Glu Ser Gly Ala Val Ser Leu Leu Thr Glu Pro Val Glu Trp Pro
420 425 430
Arg Glu Asp Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Met Ser
435 440 445
Gly Thr Asn Ala His Leu Ile Leu Glu Glu Pro Pro Glu Pro Thr Glu
450 455 460
Pro Asp Asp Val Pro Glu Arg Pro Ala Pro Met Asp Val Gly Pro Leu
465 470 475 480
Pro Trp Val Leu Ser Ala Arg Ser Glu Glu Ala Leu Arg Gly Gln Ala
485 490 495
Ala Ala Leu Ala Ser Gln Val Arg Val Ser Asp Asp Ala Pro Leu Asp
500 505 510
Val Gly Trp Ser Leu Ala Ala Ser Arg Ser Asn Phe Glu Arg Arg Ala
515 520 525
Val Val Val Gly Thr Gly Arg Asp Glu Leu Leu Ala Gly Leu Glu Ala
530 535 540
Val Ala Ser Gly Glu Val Arg Ser Val Ala Pro Ser Gly Val Gly Ala
545 550 555 560
Gly Pro Val Leu Val Phe Pro Gly Gln Gly Ser Gln Trp Ala Gly Met
565 570 575
Gly Ala Glu Leu Leu Asp Ser Ser Pro Val Phe Ala Glu Arg Ile Ala
580 585 590
Glu Cys Glu Arg Ala Leu Ala Pro Phe Val Asp Trp Ser Leu Ser Asp
595 600 605
Val Leu Arg Gly Asp Gly Ala Glu Leu Ala Arg Val Asp Val Val Gln
610 615 620
Pro Val Leu Trp Ala Thr Met Val Ser Leu Ala Ala Val Trp Ala Ser
625 630 635 640
Tyr Gly Val Val Pro Ser Ala Val Ile Gly His Ser Gln Gly Glu Ile
645 650 655
Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Gly Ala Arg
660 665 670
Ile Val Ala Leu Arg Ser Arg Ala Leu Arg Arg Leu Ala Gly Gly Gly
675 680 685
Ala Met Ala Ser Leu Gly Val Gly Val Asp Ala Ala Gln Glu Phe Leu
690 695 700
Pro Glu Gly Val Ala Val Ala Ala Val Asn Gly Pro Ser Ser Thr Val
705 710 715 720
Val Ser Gly Pro Pro Glu Gln Val Thr Ala Ala Val Ala Arg Thr Glu
725 730 735
Ala Ala Gly His Arg Ala Arg Leu Ile Asp Val Asp Tyr Ala Ser His
740 745 750
Gly Pro Gln Val Asp Glu Ile Thr Asp Glu Leu His Glu Val Leu Ala
755 760 765
Gly Ile Glu Pro Val Pro Thr Thr Glu Val Ala Phe Tyr Ser Thr Val
770 775 780
Arg Ala Ala Ala Ile Asp Thr Ala Thr Leu Asp Thr Gly Tyr Trp Val
785 790 795 800
Thr Asn Leu Arg Glu Arg Val Arg Phe Ala Glu Thr Ile Asp Ala Leu
805 810 815
Leu Glu Asp Gly His Arg Val Phe Ile Glu Ala Ser Pro His Pro Val
820 825 830
Leu Thr Val Gly Met Glu Glu Thr Phe Glu Gln Ala Gly Val Asp Gly
835 840 845
Thr Thr Val Pro Thr Leu Arg Arg Asp His Gly Asp Arg Lys Gln Leu
850 855 860
Ala Asp Ala Leu Gly Gln Ala Phe Ser Ala Gly Val Glu Val Asp Trp
865 870 875 880
Thr Gly Trp Phe Pro Ala Glu Trp Ala Pro Arg Leu Val Glu Leu Pro
885 890 895
Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Ile Asp Ala Gln Gly Gly
900 905 910
Arg Gly Gly Asp Ala Ala Ser Leu Gly Leu Ala Ser Ala Gly His Arg
915 920 925
Leu Leu Gly Ala Ala Ile Glu Gln Ala Glu Gly Asp Val Arg Leu Leu
930 935 940
Thr Gly Arg Leu Ser Ala Thr Gly Asp Ala Ser Trp Leu Ala Asp His
945 950 955 960
Gln Val Ala Gly Thr Val Val Val Pro Gly Ala Ala Leu Leu Glu Trp
965 970 975
Ala Leu Arg Ala Ala Asp Glu Ala Gly Cys Thr Gly Val Glu Asp Leu
980 985 990
Val Leu Arg Ser Pro Leu Val Leu Pro Ala Ser Gly Ser Val Arg Val
995 1000 1005
Gln Val Val Val Gly Ala Ala Gly Glu Asp Gly Arg Arg Asp Val
1010 1015 1020
Arg Val Tyr Ser Lys Pro Gly Gln Asp Gly Asp Phe Asp Asp Asp
1025 1030 1035
Ala Gly Trp Thr Cys His Ala Glu Gly Thr Leu Gly Pro Glu Thr
1040 1045 1050
Gly Asp Ala Ala Glu Gly Leu Ala Gly Ala Trp Pro Pro Pro Gly
1055 1060 1065
Ala Gln Pro Val Asp Val Ser Gly Phe Tyr Asp Phe Ala Ala Ala
1070 1075 1080
Ser Gly Tyr Glu Tyr Gly Pro Ala Phe Gln Gly Leu Arg Ala Ala
1085 1090 1095
Trp Arg Asn Gly Ala Asp Val Leu Ala Glu Val Glu Leu Pro Glu
1100 1105 1110
Ala Ala Gly Glu Pro Gly Gly Phe Gly Val His Pro Ala Leu Leu
1115 1120 1125
Asp Ala Ala Leu His Ala Gly Leu Leu Ala Asp Gly His Gly Asp
1130 1135 1140
Asp Gly Asp Val Trp Leu Pro Phe Ala Trp Ser Gly Val Ser Leu
1145 1150 1155
Trp Ala Thr Asp Ala Thr Ala Val Arg Val Leu Leu Ser Pro Leu
1160 1165 1170
Pro Glu Gly Glu Arg Gly Val Arg Val Val Met Ala Asp Ala Val
1175 1180 1185
Gly Ala Pro Val Leu Thr Ala Asp Ala Val Ala Leu Arg Pro Ile
1190 1195 1200
Asp Thr Glu Arg Leu Arg Ala Glu Asp Gly Arg Gly Pro Asp Gly
1205 1210 1215
Leu Phe Val Leu Asp Trp Val Ala Val Gln Glu Ser Ala Pro Ser
1220 1225 1230
Ala Glu Ala Asn Glu Pro Gly Ala Leu Cys Gly Leu Gly Asp Phe
1235 1240 1245
Asp Asp Ser Val Leu Pro Ser Val Pro Asp Val Asp Ala Leu Leu
1250 1255 1260
Ala Arg Leu Asp Thr Glu Asp Leu Pro Thr Pro Leu Thr Val Leu
1265 1270 1275
Thr Val Leu Pro Pro Glu Gly Glu Gly Gly Gly Leu Asp Leu Thr
1280 1285 1290
Gly Arg Val Leu Ala Leu Leu Gln Arg Trp Leu Asn Glu Pro Arg
1295 1300 1305
Leu Asp Asp Val Arg Leu Ala Leu Val Thr Gln Gly Ala Val Ala
1310 1315 1320
Val Glu Asn Pro Asp Val Ala Gly Ala Ala Val Trp Gly Leu Val
1325 1330 1335
Arg Ser Ala Gln Ser Glu Glu Pro Gly Arg Phe Val Leu Val Asp
1340 1345 1350
Ala Gly Leu Pro Asp Pro Gly Ala Ala Phe Ala Gln Asp Ala Ala
1355 1360 1365
Asp Ala Val Arg Arg Ala Val Glu Ala Asn Glu Pro Gln Ala Ala
1370 1375 1380
Val Arg Ala Gly Gln Leu Leu Ala Pro Arg Leu Ala Arg Thr Asp
1385 1390 1395
Thr Arg Ala Pro Asp Pro Ala Tyr Ala Gly Asp Arg Glu Gly Gly
1400 1405 1410
His Gly Leu Pro Val Leu Asp Pro Asp Gly Ala Val Leu Val Thr
1415 1420 1425
Gly Gly Thr Gly Lys Val Gly Gly Leu Val Ala Glu His Leu Val
1430 1435 1440
Arg Ala Trp Gly Val Thr His Leu Val Leu Ala Ser Arg Arg Gly
1445 1450 1455
Ser Asp Ala Pro Gly Ala Ser Glu Leu Glu Ser Arg Leu Ala Ser
1460 1465 1470
Leu Gly Ala His Val Gln Val Val Ala Ala Asp Leu Ala Asp Ala
1475 1480 1485
Ser Ala Val Asp Glu Leu Val Gly Arg Ile Ala Ala Glu Arg Pro
1490 1495 1500
Leu Thr Gly Val Val His Ala Ala Gly Val Leu Asp Asp Ala Val
1505 1510 1515
Ile Ala Ser Gln Ser Pro Glu Arg Leu Ala Gly Val Trp Ala Ala
1520 1525 1530
Lys Ala Ala Ala Ala Tyr Asn Leu His Val Ala Thr Ala Asp Leu
1535 1540 1545
Pro Leu Gly Leu Phe Val Leu Phe Ser Ser Ala Ala Ser Cys Leu
1550 1555 1560
Gly Ser Pro Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ala Phe Cys
1565 1570 1575
Asp Ala Leu Ala Ala His Arg Arg Ala Ser Gly Leu Pro Gly Val
1580 1585 1590
Ser Val Ala Trp Gly Leu Trp Ala Asp Ala Ser Gly Met Thr Gly
1595 1600 1605
His Met Thr Glu Thr Asp Leu Ala Arg Val Ser Arg Ser Gly Asn
1610 1615 1620
Val Pro Leu Thr Ser Asp Glu Ala Leu Glu Leu Phe Asp Ala Ala
1625 1630 1635
Cys Gly His Gly Gly Ala Tyr Leu Leu Ala Leu His Leu Asn Ala
1640 1645 1650
Ser Ala Leu Ala Ser Ala Asp Leu Leu Pro Pro Pro Leu Arg Ala
1655 1660 1665
Phe Gly Ala Arg Arg Asp Arg Gly Gly Arg Val Arg Arg Ala Ala
1670 1675 1680
Ala Ala Gly Gly Arg Gln Gly Ser Glu Leu Leu Ser Arg Leu Ala
1685 1690 1695
Gly Leu Asp Ala Asn Gly Arg Ala Asp Ala Ala Leu Glu Val Val
1700 1705 1710
Ser Glu Cys Ala Ala Gly Val Leu Gly His Arg Ser Pro Asp Gly
1715 1720 1725
Val Arg Ala Glu Ala Asn Phe Lys Asp Leu Gly Phe Asp Ser Leu
1730 1735 1740
Met Ala Val Glu Leu Arg Asn Arg Leu Ser Val Ile Ser Gly Val
1745 1750 1755
Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Thr Pro His Ala
1760 1765 1770
Leu Ala Gly Tyr Leu Cys Ala Arg Leu Asp Gly Pro Ala Thr Ser
1775 1780 1785
Val Ala Gly Pro Ser Ala Gly Pro Thr Thr Ala Thr Ala Asp Asp
1790 1795 1800
Pro Val Ala Ile Val Ala Met Ala Cys Arg Phe Pro Gly Gly Val
1805 1810 1815
Val Ser Ala Glu Ser Leu Trp Glu Met Val Ala Ala Gly Gly Asp
1820 1825 1830
Ala Ile Gly Ala Phe Pro Gln Asp Arg Gly Trp Asp Leu Asp Gly
1835 1840 1845
Leu Phe His Pro Asp Pro Asp His Pro Gly Thr Ser Tyr Ala Arg
1850 1855 1860
Glu Gly Gly Phe Val His Asp Ala Ala Glu Phe Asp Ala Glu Phe
1865 1870 1875
Phe Gly Val Asn Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln
1880 1885 1890
Arg Leu Leu Leu Glu Val Ser Trp Glu Leu Leu Glu Arg Ala Gly
1895 1900 1905
Leu Asp Pro Val Ser Leu Lys Gly Ser Arg Thr Gly Val Tyr Ala
1910 1915 1920
Gly Val Met Tyr His Asp Tyr Gly Ala Gly Ile Ala Ala Gly Asp
1925 1930 1935
Ser Arg Leu Glu Gly Tyr Ala Leu Met Ser Ser Ser Gly Ser Val
1940 1945 1950
Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Phe Glu Gly Pro Ala
1955 1960 1965
Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Met His
1970 1975 1980
Leu Ala Ala Gln Ala Leu Arg Gln Gly Glu Cys Asp Leu Ala Leu
1985 1990 1995
Ala Gly Gly Val Thr Val Met Ala Thr Pro Asp Val Phe Thr Gly
2000 2005 2010
Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro
2015 2020 2025
Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Val Gly
2030 2035 2040
Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Arg Gly His
2045 2050 2055
Glu Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gln Asp Gly
2060 2065 2070
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg
2075 2080 2085
Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu Ser Ser Ser Asp
2090 2095 2100
Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp
2105 2110 2115
Pro Ile Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Arg Asp Arg
2120 2125 2130
Ser Gly Asp Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile
2135 2140 2145
Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly Val Ile Lys Thr
2150 2155 2160
Val Met Ala Met Arg Asn Gly Ser Leu Pro Ala Ser Leu His Ile
2165 2170 2175
Asp Glu Pro Ser Pro His Val Asp Trp Glu Ser Gly Ala Val Arg
2180 2185 2190
Leu Leu Thr Glu Pro Met Glu Trp Pro Thr Glu Asp Asp Arg Pro
2195 2200 2205
Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr Asn Ala
2210 2215 2220
His Leu Ile Leu Glu Gln Ala Pro Glu Pro Lys Asn Glu Pro Glu
2225 2230 2235
Asp Glu Ser Glu Pro Val Pro Phe Asp Gly Val Leu Pro Trp Val
2240 2245 2250
Leu Ser Ala His Asn Ala Asp Ala Leu Arg Gly Gln Ala Ala Ala
2255 2260 2265
Leu Ala Ala His Val Ser Ala Thr Asp Asp Thr Ser Val Glu Val
2270 2275 2280
Gly Trp Ser Leu Leu Lys Ser Arg Ser Leu Leu Asp Gln Arg Ala
2285 2290 2295
Val Val Leu Gly Glu Asp Arg Ala Gln Leu Leu Ala Gly Val Glu
2300 2305 2310
Ala Leu Ala Ala Gly Glu Pro His Pro Ala Leu Val Arg Thr Gly
2315 2320 2325
Pro Ala Ala Thr Ala Ser Gly Lys Ser Val Phe Leu Phe Ser Gly
2330 2335 2340
Gln Gly Ser Gln Arg Val Gly Met Gly Ala Gly Leu Tyr Glu Arg
2345 2350 2355
Phe Pro Val Phe Ala Gly Ala Phe Asp Glu Val Cys Gly Leu Leu
2360 2365 2370
Asp Glu His Leu Glu Arg Pro Leu Lys Asp Val Val Phe Thr Gly
2375 2380 2385
Leu Pro Asp Gln Pro Glu Leu Leu Asp His Thr Thr Tyr Ala Gln
2390 2395 2400
Thr Gly Leu Phe Ala Leu Gln Val Gly Leu Ala Arg Leu Leu Ala
2405 2410 2415
Ser Leu Gly Val Val Pro Asp Val Val Val Gly His Ser Val Gly
2420 2425 2430
Glu Ile Ala Ala Ala His Val Ala Gly Val Phe Gly Leu Glu Asp
2435 2440 2445
Ala Cys Arg Leu Val Ala Ala Arg Ala Thr Leu Met Gly Gly Leu
2450 2455 2460
Pro Ser Gly Gly Ala Met Ala Ala Ile Gln Ala Thr Ala Asp Glu
2465 2470 2475
Leu Thr Asp Asp Leu Ala Arg Phe Ala Gly Arg Val Ala Ile Ala
2480 2485 2490
Ala Leu Asn Thr Pro Asp Ser Thr Val Val Ser Gly Pro Ala Asp
2495 2500 2505
Leu Val Ser Glu Val Ala Ala Ala Trp Lys Glu Arg Gly Arg Lys
2510 2515 2520
Thr Lys Ala Leu Ala Val Ser His Ala Phe His Ser Pro Met Met
2525 2530 2535
Asp Pro Ile Leu Asp Glu Phe Thr Asp Ala Ile Ser Asp Leu Thr
2540 2545 2550
Tyr Arg Arg Pro Asn Val Pro Leu Ile Ser Asn Leu Ser Gly Leu
2555 2560 2565
Pro Ala Asp Asp Gln Ile Thr Thr Pro Glu Tyr Trp Ala Thr His
2570 2575 2580
Ile Arg Arg Pro Val Arg Phe Ala Pro Ala Ile Ala His Leu Ala
2585 2590 2595
Pro Gln Ala Gly Val Phe Leu Glu Leu Gly Pro Asp Pro Val Leu
2600 2605 2610
Ala Thr Ala Ala Gln His Thr Leu His His Ala Met Asp Glu His
2615 2620 2625
Glu Asp Gly Gly Thr Asp Glu Ala Gly Arg Pro Val Pro Pro Thr
2630 2635 2640
Leu Ser Thr Leu Asn Gly Lys Gln Asp Asp Pro His Ala Phe Ala
2645 2650 2655
His Ala Met Ala Arg Leu His Thr Ser Gly Ala Gly Val Asp Trp
2660 2665 2670
Thr Asp Trp Phe Pro Ala Lys Pro Ala Pro Arg Val Val Ala Leu
2675 2680 2685
Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Ala Pro Gly
2690 2695 2700
Arg Gly Ala Gly Asp Val Ala Ala Val Gly Leu Arg Arg Val Glu
2705 2710 2715
His Ala Val Leu Pro Ala Ala Val Pro Leu Ala Asp Gly Gly Leu
2720 2725 2730
Leu Leu Thr Gly Arg Met Ser Ala Ser Gly Gly Trp Ala Ser Asp
2735 2740 2745
His Arg Met Leu Gly Thr Thr Leu Val Pro Gly Ala Ala Leu Ala
2750 2755 2760
Glu Trp Ala Leu Arg Ala Ala Asp Glu Thr Gly Cys Gly Ala Val
2765 2770 2775
Glu Glu Leu Val Leu Gln Ala Pro Leu Val Leu Pro Pro Thr Gly
2780 2785 2790
Ala Ala Gln Ile Gln Val Val Ala Asp Ala Pro Gly Ser Asp Gly
2795 2800 2805
Arg Arg Glu Leu Arg Val Tyr Ser Arg Pro Glu His Asp Asp Gly
2810 2815 2820
Ser Val Thr Glu Ala Pro Trp Val Cys His Ala Met Gly Val Leu
2825 2830 2835
Thr Pro Glu Ala Arg Asp Val Pro Glu Gly Leu Asp Gly Ala Trp
2840 2845 2850
Pro Pro Pro Gly Ala Glu Pro Val Asp Val Thr Ala Phe Tyr Glu
2855 2860 2865
Arg Met Ala Ala Thr Gly Tyr Glu Tyr Gly Pro Ser Phe Arg Gly
2870 2875 2880
Leu Arg Ala Ala Trp Arg Asp Gly Pro Asp Ile Leu Ala Glu Val
2885 2890 2895
Gln Leu Pro Glu Ala Ala Gly Asp Arg Ala Gly Phe Gly Val His
2900 2905 2910
Pro Ala Leu Leu Asp Ala Val Leu His Ala Gly Leu Leu Pro Asp
2915 2920 2925
Gly Glu Gly Glu Gly Glu Gln Asp Gly Arg Val Trp Leu Pro Phe
2930 2935 2940
Ala Trp Asn Gly Ile Ser Leu Trp Ala Ser Glu Ala Ala Thr Val
2945 2950 2955
Arg Val Arg Leu Ala Pro Val His Asp Gly Asp Gly Glu Gly Arg
2960 2965 2970
Asp Glu Arg Gly Val Arg Val Thr Val Ala Asp Ser Ala Gly Ala
2975 2980 2985
Ala Val Leu Thr Val Asp Ser Val Val Thr Arg Pro Ala Asp Ile
2990 2995 3000
Asp Lys Leu Lys Ala Ala Gly Glu Pro Asp Ala Asp Gly Leu Phe
3005 3010 3015
Ala Leu Glu Trp Thr Pro Leu Pro Ala Pro Ala Leu Ser Asp Pro
3020 3025 3030
Thr Leu Gly Asp Asp Asp Trp Ala Val Leu Gly Thr Asp Ala Phe
3035 3040 3045
Gly Pro Ala Asp Cys Ala Arg Phe Pro Asp Leu Ala Ala Leu Ser
3050 3055 3060
Ala Ala Leu Asp Ala Asp Arg Pro Ser Pro Thr Leu Val Leu Ala
3065 3070 3075
Asp Ala Ser Tyr Gly Glu Ser Asp Asp Leu Asp Ala Val Gly Asn
3080 3085 3090
Val Leu Ser Leu Leu Gln Ala Trp Leu Asp Glu Pro Arg Leu Ala
3095 3100 3105
Asp Ala Arg Leu Val Val Val Thr Arg Gly Ala Val Ala Ser Asp
3110 3115 3120
Val Pro Asp Leu Ala Gly Ala Gly Val Trp Gly Leu Val Arg Ser
3125 3130 3135
Ala Gln Ala Glu Asn Pro Gly Arg Phe Val Leu Leu Asp Leu Glu
3140 3145 3150
Ala Asp Thr Phe Leu Ser Thr Gly Leu Ala Asp Ala Val Ala Arg
3155 3160 3165
Ala Ala Glu Ala Arg Glu Pro Gln Leu Ala Val Arg Ser Gly Glu
3170 3175 3180
Leu Leu Ile Pro Arg Leu Val Arg Ala Ser Glu Arg Gln Gly Ile
3185 3190 3195
Ala Ala Pro Thr Gly Glu Arg Ala Trp Arg Leu Ala Val Ala Thr
3200 3205 3210
Pro Gly Thr Val Glu Asn Val Ala Pro Glu Pro Phe Pro Lys Ala
3215 3220 3225
Leu Glu Pro Leu Ala Ala Gly Gln Val Arg Ile Glu Val Arg Ala
3230 3235 3240
Gly Gly Ile Asn Phe Arg Asp Val Leu Val Thr Leu Gly Met Val
3245 3250 3255
Pro Gly Gln Thr Gly Leu Gly Gly Glu Gly Ala Gly Val Val Thr
3260 3265 3270
Glu Val Gly Pro Gly Val Phe Ser Gly Leu Ser Val Gly Asp Arg
3275 3280 3285
Val Met Gly Leu Phe Glu Gly Ala Phe Gly Pro Val Thr Val Ala
3290 3295 3300
Asp Glu Arg Met Val Ser Pro Ile Pro Glu Asp Trp Asp Phe Arg
3305 3310 3315
Ser Ala Ala Ala Val Pro Ile Val Phe Leu Thr Ala Trp Tyr Gly
3320 3325 3330
Leu Val Glu Leu Ala Gly Leu Lys Ser Gly Glu Ser Val Leu Val
3335 3340 3345
His Ala Ala Thr Gly Gly Val Gly Ser Ala Ala Val Gln Ile Ala
3350 3355 3360
Arg His Leu Gly Ala Glu Val Phe Ala Thr Ala Ser Pro Gly Lys
3365 3370 3375
His Ala Val Leu Glu Ala Met Gly Ile Asp Ala Ala His Arg Ala
3380 3385 3390
Ser Ser Arg Asp Leu Glu Phe Glu Arg Thr Phe Arg Asp Ala Thr
3395 3400 3405
Gly Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu Ala Gly Glu
3410 3415 3420
Phe Thr Asp Ala Ser Leu Arg Leu Val Ala Glu Gly Gly Arg Phe
3425 3430 3435
Leu Glu Met Gly Lys Thr Asp Ile Arg Glu Pro Glu His Val Ala
3440 3445 3450
Ala Glu His Pro Ala Val Gly Tyr Gln Val Phe Asp Leu Val Thr
3455 3460 3465
Asp Ala Gly Pro Val Arg Ile Ser Glu Met Phe Arg Ala Leu Ala
3470 3475 3480
Glu Leu Phe Ala Glu Gln Val Leu Arg Pro Ala Pro Thr Gln Ala
3485 3490 3495
Trp Pro Leu Ala Arg Ala Arg Glu Ala Leu Arg Val Met Ser Gln
3500 3505 3510
Ala Lys His Thr Gly Lys Leu Val Leu Asp Ile Pro Pro Ala Leu
3515 3520 3525
Asp Pro Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Thr Leu
3530 3535 3540
Gly Gly Leu Val Ala Glu His Val Val Gly Thr Trp Gly Val Lys
3545 3550 3555
Arg Leu Val Leu Ala Ser Arg Arg Gly Leu Gln Ala Pro Gly Ala
3560 3565 3570
Asp Glu Leu Val Glu Arg Leu Thr Arg Glu Leu Gly Ala Glu Val
3575 3580 3585
Arg Val Val Ala Ala Asp Val Ser Asp Ala Asp Ala Val Thr Asp
3590 3595 3600
Leu Ile Thr Ala Thr Asp Ala Asp Pro Ala His Pro Leu Thr Gly
3605 3610 3615
Leu Val His Thr Ala Gly Leu Ile Asp Asp Ala Val Ile Gly Ser
3620 3625 3630
Gln Thr Pro Glu Arg Leu His His Val Trp Asp Ala Lys Ala Thr
3635 3640 3645
Ala Ala His Asn Leu His Thr Ala Thr Arg His Leu Pro Leu Ala
3650 3655 3660
Ala Phe Val Ile Phe Ser Ser Ser Ala Ala Met Leu Gly Ser Pro
3665 3670 3675
Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ala Tyr Cys Asp Ala Leu
3680 3685 3690
Ala Ala Arg Arg Gln Ala Ala Gly Leu Pro Ala Val Ser Val Gly
3695 3700 3705
Trp Gly Leu Trp Glu Ala Thr Ser Ala Met Thr Gly Ser Leu Thr
3710 3715 3720
Glu Thr Asp Arg Ala Arg Met Ser Arg Ser Gly Val Gly Ala Leu
3725 3730 3735
Pro Asn Glu His Gly Leu Ala Leu Leu Asp Ala Ala Leu Arg His
3740 3745 3750
Gly Asp Ala Ser Leu Leu Ala Ala Asn Leu Asp Leu Arg Ala Leu
3755 3760 3765
Ala Val Gln Pro Ala Glu Ser Leu Pro Gly Ala Leu Arg Ala Leu
3770 3775 3780
Val Ser Gly Gly Gly Glu Arg Val Arg Arg Thr Ala Ala Thr Gly
3785 3790 3795
Ser Ala Arg Pro Asp Asp Trp Ala Gly Arg Leu Ala Gly Met Ser
3800 3805 3810
Ala Ala Glu Gln Gln Arg Ala Met Leu Asn Leu Val Arg Gly His
3815 3820 3825
Val Ala Thr Val Leu Gly Arg Thr Asp Ala Asp Ser Val Arg Gly
3830 3835 3840
Asp Ala Ser Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Gly Val
3845 3850 3855
Glu Leu Arg Asn Arg Leu Ala Asp Ala Thr Gly Leu Arg Leu Pro
3860 3865 3870
Pro Ala Leu Val Phe Asp Tyr Pro Gln Ala Gly Val Leu Ala Glu
3875 3880 3885
His Leu Arg Thr Arg Leu Val Pro Glu Gly Ala Asp Ala Ser Ala
3890 3895 3900
Ala Gly Ala Gly Val Glu Pro Val Leu Asp Asp Leu Ala Arg Leu
3905 3910 3915
Glu Ser Thr Leu Asn Ala Ala Ala Ala Leu Glu Asp Gly Asp Ser
3920 3925 3930
Asp Ala Val Thr Ala Arg Leu Glu Ser Leu Leu Thr His Trp Arg
3935 3940 3945
Ala Val Arg Ser Gly Arg Ala Ala Asn Gly His His Gly Arg Asn
3950 3955 3960
Gly His Ala Glu Pro Ser Gly Asn Asp Gly Gln Asn Gly His Asp
3965 3970 3975
Gly Gln Asp Gly Glu Asp Gly Thr Ala Val Asp Arg Leu Glu Ser
3980 3985 3990
Ala Ser Ala Asp Gln Val Leu Asp Phe Ile Asp Asn Glu Leu Gly
3995 4000 4005
Val Ser Trp Asn Gly Pro Asp Ala Thr Thr Thr Arg
4010 4015 4020
<210> 5
<211> 4035
<212> PRT
<213> Artificial
<220>
<223> madA5基因编码的氨基酸序列
<400> 5
Met Ala Thr Glu Glu Arg Leu Val Glu Tyr Leu Lys Arg Val Thr Ala
1 5 10 15
Asp Leu His Asp Thr Arg Leu Arg Leu Arg Arg Met Glu Glu Arg Ser
20 25 30
Glu Glu Pro Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Gly
35 40 45
Val Asp Ser Pro Glu Ser Leu Trp Asp Leu Val Ala Ala Gly Ala Asp
50 55 60
Ala Ile Gly Asp Phe Pro Asp Asp Arg Gly Trp Asp Leu Asp Gly Leu
65 70 75 80
Tyr His Pro Asp Pro Asp His Pro Gly Thr Thr Tyr Thr Arg Lys Gly
85 90 95
Gly Phe Val Asp Asp Ala Asp Arg Phe Asp Ala Ala Phe Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Met Ala Thr Ser Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Thr Ser Trp Glu Ala Phe Glu Arg Ala Gly Ile Asp Pro Val Ser
130 135 140
Leu Arg Gly Thr Arg Thr Gly Val Tyr Ala Gly Thr Ala Thr Thr Gly
145 150 155 160
Ala Ala Ala His Gly Gly Ser Ser Arg Lys Glu Ala Glu Gly Tyr Ala
165 170 175
Gly Asn Ala Pro Ser Leu Leu Ser Gly Arg Val Ala Tyr Thr Phe Gly
180 185 190
Leu Glu Gly Pro Ala Val Thr Ile Glu Thr Ala Cys Ser Ser Ala Leu
195 200 205
Val Ala Met His Leu Ala Gly Gln Ala Leu Arg Gln Gly Glu Cys Thr
210 215 220
Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Glu Val Phe
225 230 235 240
Thr Gly Phe Ser Arg Gln Arg Gly Leu Ser Pro Asp Gly Arg Cys Lys
245 250 255
Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Val Gly
260 265 270
Val Val Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg
275 280 285
Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
290 295 300
Asn Gly Phe Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
305 310 315 320
Gln Ala Leu Thr Gln Gly Arg Leu Ser Ala Ala Glu Val Asp Ala Val
325 330 335
Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile Glu Ala Asp
340 345 350
Ala Leu Ile Ala Ala Tyr Gly Arg Asn Arg Pro Pro Glu Arg Pro Leu
355 360 365
Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Gly Ala Ala
370 375 380
Gly Val Ala Gly Val Ile Lys Met Ile Met Ala Met Arg Arg Gly Val
385 390 395 400
Leu Pro Ala Ser Leu His Ile Asp Gln Pro Thr Pro His Val Glu Trp
405 410 415
Gly Asp Gly Ala Val Arg Leu Leu Thr Glu Pro Val Pro Trp Pro Asp
420 425 430
Thr Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly
435 440 445
Thr Asn Ala His Leu Ile Leu Glu Gln Ala Pro Glu Pro Val Glu Ala
450 455 460
Pro Glu Pro Glu Pro Glu Ser Gly Ala Val Val Pro Trp Val Leu Ser
465 470 475 480
Gly Arg Ser Glu Glu Ala Leu Arg Gly Gln Ala Ala Ala Leu Ala Ala
485 490 495
Arg Met Asp Gly Ser Glu Gly Trp Ala Pro Val Asp Val Gly Trp Ser
500 505 510
Leu Ala Thr Thr Arg Ser Val Phe Glu His Arg Ala Val Val Val Gly
515 520 525
Ser Glu Arg Asp Glu Leu Leu Asp Gly Leu Lys Ala Leu Ala Thr Gly
530 535 540
Glu Met His Pro Gly Leu Val Gly Pro Gly Leu Ala Ala His Pro Asp
545 550 555 560
Ala Ser Ala Pro Val Phe Val Phe Pro Gly Gln Gly Ser Gln Trp Ala
565 570 575
Gly Met Gly Ala Glu Leu Leu Asp Ser Ser Pro Val Phe Ala Glu Arg
580 585 590
Ile Ala Glu Cys Glu Arg Ala Leu Ala Pro Phe Val Asp Trp Ser Leu
595 600 605
Ser Asp Val Leu Arg Gly Asp Gly Ala Glu Leu Ala Arg Val Asp Val
610 615 620
Val Gln Pro Val Leu Trp Ala Thr Met Val Ser Leu Ala Ala Val Trp
625 630 635 640
Ala Ser Tyr Gly Val Val Pro Ser Ala Val Ile Gly His Ser Gln Gly
645 650 655
Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Gly
660 665 670
Ala Arg Ile Val Ala Leu Arg Ser Arg Ala Leu Arg Arg Leu Ala Gly
675 680 685
Gly Gly Ala Met Ala Ser Leu Gly Val Gly Val Asp Ala Ala Gln Glu
690 695 700
Phe Leu Pro Glu Gly Val Ala Val Ala Ala Val Asn Gly Pro Ser Ser
705 710 715 720
Thr Val Val Ser Gly Pro Pro Glu Gln Val Thr Ala Ala Val Ala Arg
725 730 735
Thr Glu Ala Ala Gly Tyr Arg Ala Arg Leu Ile Asp Val Asp Tyr Ala
740 745 750
Ser His Gly Pro Gln Val Asp Glu Ile Thr Asp Glu Leu His Gln Val
755 760 765
Leu Ala Gly Ile Glu Pro Ala Pro Thr Thr Glu Val Ala Phe Tyr Ser
770 775 780
Thr Val Arg Ala Ala Ala Ile Asp Thr Ala Thr Leu Asp Thr Gly Tyr
785 790 795 800
Trp Val Thr Asn Leu Arg Glu Arg Val Arg Phe Ala Asp Thr Ile Asp
805 810 815
Ala Leu Leu Glu Asp Gly His Arg Ile Phe Ile Glu Ala Ser Pro His
820 825 830
Pro Val Leu Thr Val Gly Met Glu Glu Thr Phe Glu Gln Ala Gly Thr
835 840 845
Gly Ala Val Ala Val Pro Thr Leu Arg Arg Asp His Gly Gly Arg Glu
850 855 860
Gln Phe Val Arg Val Val Gly Gln Ala Phe Val Ala Gly Ala His Ala
865 870 875 880
Asp Trp Thr Ala Ser Phe Ala Gln Asp Pro Ala Pro Arg Thr Val Asp
885 890 895
Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg Phe Trp Ala Asp Asp Asp
900 905 910
Pro Thr Val His Ser Gly Asp Pro Ala Gly Leu Gly Leu Val Ser Val
915 920 925
Gly His Pro Leu Leu Gly Ala Ala Val Asp Val Ala Asp Glu Asp Thr
930 935 940
Arg Leu Leu Thr Gly Arg Leu Ser Ala Ser Gly Gly Gly Thr Gly Trp
945 950 955 960
Leu Asn Asp His Arg Ile Gly Asp Thr Val Leu Val Pro Gly Ala Ala
965 970 975
Leu Val Glu Trp Ala Leu Arg Ala Ala Asp Asp Val Gly Cys Gly Gly
980 985 990
Val Glu Glu Leu Ala Leu Gln Ala Pro Leu Val Leu Pro Ala Ser Gly
995 1000 1005
Ser Val Arg Val Gln Val Val Val Gly Ala Ala Gly Glu Asp Gly
1010 1015 1020
Arg Arg Glu Val Arg Val His Ser Val Pro Asp Leu Gly Gly Asp
1025 1030 1035
Gly Ala Ser Glu Asp Val Arg Trp Glu Cys His Ala Val Gly Val
1040 1045 1050
Leu Ser Pro Glu Pro Thr Ala Thr Pro Ser Asn Gly Gln Ser Asp
1055 1060 1065
Asp Leu Gly Gly Val Trp Pro Pro Ala Gly Ala Gln Pro Ile Asp
1070 1075 1080
Leu Glu Gly Phe Tyr Glu Arg Ala Ala Ala Ala Gly Tyr Gly Tyr
1085 1090 1095
Gly Pro Ala Phe Gln Gly Val Arg Ala Val Trp Arg Asp Gly Ser
1100 1105 1110
Asp Leu Leu Ala His Val Glu Leu Pro Glu Asp Ala Ala Asp Pro
1115 1120 1125
Ala Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu His
1130 1135 1140
Pro Ala Leu Leu Gly Gly Ser Pro Glu Thr Asp Ser Glu Ala Gly
1145 1150 1155
Val Leu Leu Pro Phe Val Trp Asn Gly Val Ala Leu Trp Ala Ala
1160 1165 1170
Glu Ala Thr Ser Val Arg Val Arg Leu Ser Pro Arg Arg Asp Gly
1175 1180 1185
Glu Arg Gly Val Arg Val Val Met Ala Asp Ala Val Gly Ala Pro
1190 1195 1200
Val Leu Ser Val Asp Glu Val Val Met Arg Pro Ala Asp Thr Glu
1205 1210 1215
Arg Leu Arg Ala Ala Gly Arg Arg Gly Ala Gly Asn Leu Phe Ala
1220 1225 1230
Leu Glu Trp Ala Pro Leu Pro Ala Ser Thr Pro Asp Ala Ala Ala
1235 1240 1245
Gly Arg Gly Val Gly Glu Gly Glu Trp Ala Val Val Gly Ala Glu
1250 1255 1260
Pro Val Ala Thr Asp Leu Val His His Pro Asp Leu Asp Gly Leu
1265 1270 1275
Val Ala Ala Leu Asp Gly Gly Ala Pro Val Pro Pro Val Val Leu
1280 1285 1290
Ala Ala Val Pro Ser Ser Pro Asp Ala Leu Asp Met Val Thr Asp
1295 1300 1305
Thr Leu Ala Leu Val Gln Ala Trp Leu Ala Glu Pro Arg Leu Ala
1310 1315 1320
Asp Ala Arg Leu Ala Leu Val Thr Arg Gly Ala Val Ala Ala Gly
1325 1330 1335
Thr Glu Pro Asp Leu Gly Gly Ala Ala Val Trp Gly Leu Val Arg
1340 1345 1350
Ser Ala Gln Thr Glu Asn Pro Asp Arg Phe Val Leu Val Asp Val
1355 1360 1365
Asp Asp Leu Asp Asp Ala Asp Ala Asp Ala Ser Ala Asn Val Asp
1370 1375 1380
Arg Asp Ala Ala Cys Gly Val Ala Glu Ala Val Arg Arg Ala Val
1385 1390 1395
Glu Ala Asp Glu Pro Gln Val Ala Val Arg Gly Gly Arg Leu Leu
1400 1405 1410
Val Pro Arg Leu Ala Arg Asn Asp Asp Leu Gly Glu Pro Ser Asp
1415 1420 1425
Gln Ser Gly Asp Gly Ser Pro Pro Leu Asp Pro Glu Gly Thr Val
1430 1435 1440
Leu Ile Thr Gly Gly Thr Gly Met Ala Gly Ser Leu Val Ala Glu
1445 1450 1455
His Leu Val Arg Ala Trp Gly Val Arg Arg Leu Leu Leu Ala Gly
1460 1465 1470
Arg Arg Gly Pro Asp Ala Pro Gly Ala Ala Glu Leu Ala Glu Arg
1475 1480 1485
Val Arg Glu Leu Gly Gly Glu Val Arg Phe Ala Ala Ala Asp Val
1490 1495 1500
Ser Asp Ala Gly Ala Val Ala Asp Leu Val Ala Gly Val Asp Pro
1505 1510 1515
Ala His Pro Leu Thr Gly Val Val His Ala Ala Gly Val Leu Asp
1520 1525 1530
Asp Gly Val Ile Thr Ser Gln Ser Ala Glu Arg Leu Ala Gly Val
1535 1540 1545
Trp Ala Ala Lys Ala Ala Ala Ala Tyr Asn Leu His Val Ala Thr
1550 1555 1560
Ala Ala Leu Pro Leu Gly Leu Phe Val Leu Phe Ser Ser Ala Ala
1565 1570 1575
Gly Val Val Gly Asn Ala Gly Gln Ala Gly Tyr Ala Ala Ala Asn
1580 1585 1590
Ala Phe Val Asp Ala Leu Ala Glu Gln Arg Arg Thr Ala Gly Leu
1595 1600 1605
Pro Gly Leu Ser Val Ala Trp Gly Leu Trp Glu Gln Ser Ser Ala
1610 1615 1620
Met Thr Gly His Leu Ala Glu Ala Asp His Ala Arg Leu Ala Arg
1625 1630 1635
Ser Gly Phe Ser Ala Leu Arg Thr Glu Arg Ala Leu Glu Leu Leu
1640 1645 1650
Asp Ala Ala Cys Arg Ala Glu Arg Ser Pro Val Val Ala Ala Gly
1655 1660 1665
Ile Asp Val Arg Gly Arg Ser Ala Ala Glu Val Pro Ala Leu Leu
1670 1675 1680
Arg Gly Leu Ala Gly Arg Pro Gly Arg Arg Thr Ala Ala Ser Ala
1685 1690 1695
Glu Ala Gly Thr Gly Val Asp Gly Ala Arg Gly Arg Glu Leu Ala
1700 1705 1710
Val Arg Leu Ala Arg Met Asp Glu Pro Glu Arg Leu Asp Ala Val
1715 1720 1725
Leu Glu Thr Val Arg Gly Cys Val Ala Val Val Leu Gly Tyr Ala
1730 1735 1740
Ser Pro Ala Asp Val Arg Thr Glu Ala Pro Phe Lys Glu Leu Gly
1745 1750 1755
Leu Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ser Arg
1760 1765 1770
Thr Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro
1775 1780 1785
Thr Pro Arg Ala Leu Ala Asp His Leu Cys Ala Lys Leu Val Gly
1790 1795 1800
Asp Ala Ala Ala Ala Thr Ala Thr Ala Pro Val Ala Thr Ala Met
1805 1810 1815
Thr Asp Glu Pro Val Ala Ile Val Ala Met Ala Cys Arg Phe Pro
1820 1825 1830
Gly Gly Ala Ala Ser Pro Glu Gly Leu Trp Asp Leu Val Ala Arg
1835 1840 1845
Gly Val Asp Ala Met Ala Glu Phe Pro Glu Asp Arg Gly Trp Asp
1850 1855 1860
Leu Asp Gly Leu Phe His Pro Asp Pro Asp His Pro Gly Thr Ser
1865 1870 1875
Tyr Ala Arg Glu Gly Ala Phe Leu Asp Asp Ala Ala Gly Phe Asp
1880 1885 1890
Ala Gly Phe Phe Gly Ile Asn Pro Arg Glu Ala Leu Ala Met Asp
1895 1900 1905
Pro Gln Gln Arg Leu Leu Leu Glu Val Ser Trp Glu Leu Leu Glu
1910 1915 1920
Arg Ala Gly Leu Asp Pro Val Ser Leu Lys Gly Ser Arg Thr Gly
1925 1930 1935
Val Tyr Ala Gly Val Met Tyr His Asp Tyr Ala Ser Gly Leu Ser
1940 1945 1950
Asp Ser Gly Asp Ala Lys Leu Glu Gly Tyr Ser Met Leu Ala Ser
1955 1960 1965
Ala Gly Ser Ala Ile Cys Gly Arg Val Ala Tyr Thr Leu Gly Leu
1970 1975 1980
Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu
1985 1990 1995
Val Ala Met His Leu Ala Ala Gln Ala Leu Arg Gln Gly Glu Cys
2000 2005 2010
Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Glu
2015 2020 2025
Val Phe Thr Gly Phe Ser Arg Gln Arg Gly Leu Ala Pro Asp Gly
2030 2035 2040
Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly
2045 2050 2055
Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ser Asp Ala Arg
2060 2065 2070
Arg Asn Gly His Glu Val Leu Ala Val Leu Arg Gly Ser Ala Val
2075 2080 2085
Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro
2090 2095 2100
Ser Gln Glu Arg Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu
2105 2110 2115
Ser Ser Ser Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr
2120 2125 2130
Thr Leu Gly Asp Pro Ile Glu Ala Gly Ala Leu Leu Ala Thr Tyr
2135 2140 2145
Gly Arg Gly Arg Ser Gly Gly Arg Pro Leu Trp Leu Gly Ser Val
2150 2155 2160
Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly
2165 2170 2175
Val Ile Lys Thr Val Met Ala Met Arg Asn Gly Ser Leu Pro Ala
2180 2185 2190
Ser Leu His Ile Asp Glu Pro Ser Pro His Val Asp Trp Glu Ser
2195 2200 2205
Gly Ala Val Arg Leu Leu Thr Glu Pro Met Glu Trp Pro Thr Asp
2210 2215 2220
Asp Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser
2225 2230 2235
Gly Thr Asn Ala His Leu Ile Leu Glu Gln Ala Pro Glu Pro Lys
2240 2245 2250
Asn Glu Pro Glu Asp Glu Ser Ala Pro Ala Leu Asp Gly Val Val
2255 2260 2265
Val Pro Trp Val Leu Ser Ala Arg Ser Glu Glu Ala Leu Arg Gly
2270 2275 2280
Gln Ala Ala Ala Leu Ala Ala His Val Ser Gly Thr Asp Asp Ala
2285 2290 2295
Pro Ala Asp Val Gly Trp Ser Leu Val Thr Ser Arg Ser Val Phe
2300 2305 2310
Asp His Arg Ala Val Val Thr Gly Thr Asp Arg Gln Glu Leu Leu
2315 2320 2325
Asn Gly Leu Gly Val Leu Ala Asn Gly His Glu Asn Pro Glu Val
2330 2335 2340
Val Arg Ala Val Ala Asn Ser Ala Ser Val Ser Gly Ser Thr Val
2345 2350 2355
Phe Leu Phe Ser Gly Gln Gly Ser Gln Arg Val Gly Met Gly Ala
2360 2365 2370
Gly Leu Tyr Glu Arg Phe Pro Val Phe Ala Gly Ala Phe Asp Glu
2375 2380 2385
Val Cys Ala Leu Leu Glu Pro Arg Leu Glu His Pro Leu Lys Asp
2390 2395 2400
Val Val Phe Thr Gly Ile Pro Glu Arg Asp Gly Leu Leu Asp His
2405 2410 2415
Thr Thr Tyr Ala Gln Ala Gly Leu Phe Ala Leu Gln Val Gly Leu
2420 2425 2430
Ala Arg Leu Leu Ala Ser Leu Gly Val Val Pro Asp Ala Val Val
2435 2440 2445
Gly His Ser Val Gly Glu Ile Ala Ala Ala His Val Ala Gly Val
2450 2455 2460
Phe Gly Leu Glu Asp Ala Cys Arg Leu Val Ala Ala Arg Ala Thr
2465 2470 2475
Leu Met Gly Gly Leu Pro Ser Gly Gly Ala Met Ala Ala Ile Gln
2480 2485 2490
Ala Thr Ala Asp Glu Leu Thr Asp Asp Leu Glu Arg Phe Ala Gly
2495 2500 2505
Arg Val Ala Val Ala Ala Leu Asn Thr Pro Asp Ser Thr Val Val
2510 2515 2520
Ser Gly Pro Ala Asp Glu Val Asp Gln Leu Val Gly Val Trp Thr
2525 2530 2535
Gly Arg Gly Arg Lys Ser Lys Arg Leu Ala Val Ser His Ala Phe
2540 2545 2550
His Ser Pro Leu Met Asp Pro Ile Leu Asp Glu Phe Thr Asp Ala
2555 2560 2565
Ile Ser Asp Leu Thr Tyr Arg Arg Pro Asp Val Pro Leu Ile Ser
2570 2575 2580
Asn Leu Thr Gly Glu Pro Ala Glu Pro Glu His Ile Ala Thr Pro
2585 2590 2595
Gly Tyr Trp Ala Asp His Ile Arg Arg Pro Val Arg Phe Ala Pro
2600 2605 2610
Ala Ile Ala His Ile Ala Pro Gln Ala Ala Thr Phe Leu Glu Leu
2615 2620 2625
Gly Pro Asp Pro Val Leu Ala Thr Ala Ala Gln His Thr Ile Gly
2630 2635 2640
Asp Gly Ser Ala Asp Arg Pro Gln Pro Leu Val Thr Ala Thr Leu
2645 2650 2655
Thr Gln Lys Arg Ser Asp Glu Arg Ala Phe Ala His Ala Leu Ala
2660 2665 2670
Gln Leu His Thr His Gly Ala Asp Val Asp Trp Lys Ala Leu Phe
2675 2680 2685
Asp Leu Asp Pro Asp Ser Arg Arg Arg Pro Arg Asn Val Ala Leu
2690 2695 2700
Pro Thr Tyr Ala Phe Gln His Arg Arg Phe Trp Leu Asp Ala Phe
2705 2710 2715
Asn Gly Arg Gly Gly Asp Pro Ala Gly Leu Gly Leu Ala Ser Ala
2720 2725 2730
Asp His Ala Leu Leu Gly Ala Ala Val Glu Leu Ala Asp Ser Gly
2735 2740 2745
Thr Trp Leu Leu Thr Gly Arg Ile Ser Ala Arg Ser His Arg Trp
2750 2755 2760
Leu Gly Glu His Val Met Ala Ala Thr Pro Leu Ala Pro Gly Ala
2765 2770 2775
Met Leu Val Glu Trp Ala Leu His Ala Ala Asp Ala Val Gly Cys
2780 2785 2790
Gly Gly Val Glu Glu Leu Met Leu Gln Ala Pro Leu Val Leu Pro
2795 2800 2805
Glu Thr Gly Gly Val Arg Val Gln Val Val Val Gly Pro Ala Asp
2810 2815 2820
Pro Asp Gly Arg Arg Glu Leu Gln Ile Tyr Ser Arg Pro Asp Glu
2825 2830 2835
Asn Asp Asp Asp Gly Arg Ala Asn Ala Ala Trp Val Cys His Ala
2840 2845 2850
Gln Gly Val Leu Ser Pro Ala Glu Pro Ala Pro Gly Gln Leu Asn
2855 2860 2865
Gly Ser His Gly Pro Asp Val Val Gly Gly Ala Trp Pro Pro Pro
2870 2875 2880
Gly Ala Arg Pro Val Ala Ile Asp Gly Leu Tyr Glu Arg Ala Ala
2885 2890 2895
Ser Ala Gly Tyr Glu Tyr Gly Pro Ala Phe Gln Gly Val Arg Ala
2900 2905 2910
Val Trp Gln His Gly Ala Asp Val Leu Ala Glu Val Glu Leu Pro
2915 2920 2925
Glu Ala Ala Gly Glu Lys Lys Gly Phe Gly Ile His Pro Ile Leu
2930 2935 2940
Leu Asp Ala Ala Leu His Pro Leu Leu Leu Ser Met Asp Asp Ala
2945 2950 2955
Gly Ser Gly Ala Ala Trp Leu Pro Phe Ala Trp Asn Gly Val Thr
2960 2965 2970
Leu Trp Ala Ser Glu Ala Gly Thr Val Arg Val Arg Leu Thr Pro
2975 2980 2985
Gln Asp Gly Ala Glu Gly Asp Arg Thr Val Arg Val Ala Ile Gly
2990 2995 3000
Asp Thr Val Gly Ala Pro Val Leu Thr Ala Asp Ser Val Val Met
3005 3010 3015
Arg Glu Ala Ser Thr Glu His Leu Arg Ala Ala Ala Gly Thr Asn
3020 3025 3030
Arg Asp Gly Ser Asp Thr Asp Gly Leu Phe Thr Thr Arg Trp Thr
3035 3040 3045
Pro Leu Asp Thr Ala Ala Thr Gly Asp Thr Gly Asn Gly Asp Asp
3050 3055 3060
Gly Asp Asp Gly Ala Trp Ala Phe Leu Gly Ser Glu His Met Arg
3065 3070 3075
Pro Gln Gln Asn Gly Glu Thr Arg Pro Glu Trp Leu Trp His Asp
3080 3085 3090
Asp Leu Ala Ala Leu Val Thr Ala Ile Glu Ala Gly Ala Pro Val
3095 3100 3105
Pro Pro Val Val Phe Ala Glu Val Ser Ala Gly Ala Gly Asp Gly
3110 3115 3120
Gly Asn Ser Gly Glu Asn Ala Ala Ala Arg Gly Leu Ala Thr Ala
3125 3130 3135
Arg Arg Thr Leu Ala Leu Leu Gln Asp Trp Leu Ala Glu Pro His
3140 3145 3150
Leu Ala Asp Ala Arg Leu Val Leu Val Thr Arg Gly Ala Val Thr
3155 3160 3165
Thr Gly Glu Leu Asp Gly Pro Ala Ala Gly Gly Asp Ala Glu Ala
3170 3175 3180
Pro Ile Ala Leu Ser Asp Ala Gly Val Trp Gly Leu Val Arg Thr
3185 3190 3195
Ala Gln Ser Glu Asn Pro Asp Arg Phe Val Leu Leu Asp Leu Asp
3200 3205 3210
Ser Gly Thr Ala Asp Ala Thr Gly Asp Gly Ala Val Arg Ala Val
3215 3220 3225
Trp Ala Ala Ile Ala Ala Asp Glu Pro Gln Leu Ala Leu Arg Ser
3230 3235 3240
Gly Arg Ala Phe Val Pro Arg Leu Thr Arg Thr Gly Thr Asp Asp
3245 3250 3255
Gly Thr Leu Arg Leu Pro Gln Pro Glu Arg Ala Trp His Leu Asp
3260 3265 3270
Ala Gly Gly Thr Gly Thr Leu Asp Gly Ile Ala Ala Val Pro Cys
3275 3280 3285
Pro Asp Val Leu Glu Pro Leu Ala Ala Gly Gln Val Arg Ile Glu
3290 3295 3300
Val Arg Thr Ala Gly Ile Asn Phe Arg Asp Val Leu Val Ser Leu
3305 3310 3315
Gly Met Ala Pro Gly Gln Thr Gly Leu Gly Ser Glu Gly Ala Gly
3320 3325 3330
Thr Val Val Glu Val Gly Pro Gly Val Ser Gly Leu Ser Val Gly
3335 3340 3345
Asp Arg Val Met Gly Leu Phe Glu Gly Ala Phe Gly Pro Val Thr
3350 3355 3360
Val Ala Asp Glu Arg Met Val Val Arg Leu Pro Asp Gly Trp Asp
3365 3370 3375
Phe Arg Ser Gly Ala Ala Val Pro Val Val Phe Leu Thr Ala Trp
3380 3385 3390
Tyr Gly Leu Val Glu Leu Ala Gly Leu Lys Ser Gly Glu Ser Val
3395 3400 3405
Leu Val His Ala Ala Thr Gly Gly Val Gly Ser Ala Ala Val Gln
3410 3415 3420
Ile Ala Arg His Leu Gly Ala Glu Val Phe Ala Thr Ala Ser Pro
3425 3430 3435
Gly Lys His Ala Val Leu Glu Ala Met Gly Ile Asp Ala Ala His
3440 3445 3450
Arg Ala Ser Ser Arg Asp Leu Glu Phe Glu Arg Thr Phe Arg Ala
3455 3460 3465
Ala Thr Gly Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu Ala
3470 3475 3480
Gly Glu Phe Thr Asp Ala Ser Leu Arg Leu Val Ala Glu Gly Gly
3485 3490 3495
Arg Phe Leu Glu Met Gly Lys Thr Asp Ile Arg Glu Pro Glu His
3500 3505 3510
Ile Ala Ala Glu His Pro Ala Val Gly Tyr Gln Val Phe Asp Leu
3515 3520 3525
Val Thr Asp Ala Gly Pro Asp Arg Ile Ser Glu Met Phe Arg Ala
3530 3535 3540
Leu Ala Glu Leu Phe Ala Glu Lys Val Leu Arg Pro Ala Pro Thr
3545 3550 3555
Gln Ala Trp Pro Leu Thr Arg Ala Arg Glu Ala Leu Arg Val Met
3560 3565 3570
Ser Gln Ala Lys His Thr Gly Lys Leu Val Leu Asp Ile Pro Pro
3575 3580 3585
Ala Leu Asp Pro Asp Gly Thr Val Leu Ile Thr Gly Gly Thr Gly
3590 3595 3600
Thr Leu Gly Gly Leu Val Ala Glu His Leu Val Arg Ser Glu Gln
3605 3610 3615
Val Arg Ser Leu Val Leu Ala Ser Arg Ser Gly Pro Asp Ala Ala
3620 3625 3630
Gly Ala Ala Glu Leu Lys Ala Arg Leu Glu Glu Leu Gly Ala Glu
3635 3640 3645
Val Arg Ile Val Ala Ala Asp Val Ser Asp Ala Asp Ala Val Thr
3650 3655 3660
Arg Leu Val Ala Glu Ala Asp Ala Asp Thr Gly Arg Pro Leu Thr
3665 3670 3675
Gly Ile Val His Thr Ala Gly Val Ile Asp Asp Ala Val Leu Ala
3680 3685 3690
Ser Gln Thr Pro Glu Arg Leu Ala Arg Val Trp Asp Ala Lys Ala
3695 3700 3705
Ser Ala Ala His His Leu His Ala Ala Thr Glu His Leu Arg Leu
3710 3715 3720
Gly Ala Phe Val Val Phe Ser Ser Ala Ala Gly Thr Phe Gly Ser
3725 3730 3735
Ala Gly Gln Ala Asn Tyr Ala Ala Ala Asn Ala Phe Cys Asp Ala
3740 3745 3750
Leu Ala Ala His Arg His Ala Lys Gly Leu Pro Ala Val Ser Val
3755 3760 3765
Ala Trp Gly Leu Trp Glu Ala Thr Ser Gly Met Thr Gly Asp Leu
3770 3775 3780
Ala Asp Thr Asp Val Ala Arg Met Ser Arg Ala Gly Ile Arg Pro
3785 3790 3795
Leu Ser Ala Arg His Gly Leu Ala Leu Leu Asp Ala Ala Ser Arg
3800 3805 3810
His Gly Ala Ala Asn Leu Leu Ala Ala Asp Leu Asp Val Arg Ala
3815 3820 3825
Leu Asn Gly Gly Ser Ala Glu Ala Gly Gln Gly Ala Val Pro Ala
3830 3835 3840
Phe Leu Arg Ala Leu Ala Gly Gly Gly Thr Ala Thr Arg Arg Thr
3845 3850 3855
Ala Ala Asn Gly Gln Ala Pro Ala Asp Phe Ala Ala Arg Leu Ala
3860 3865 3870
Ala Met Pro Leu Glu Glu Arg Gln Arg Val Leu Leu Asp Leu Val
3875 3880 3885
Arg Ser His Ala Ala Ala Thr Leu Gly His Ser Asp Val Gly Ala
3890 3895 3900
Val Gln Thr Asp Ala Pro Phe Lys Glu Leu Gly Phe Asp Ser Leu
3905 3910 3915
Thr Ala Val Glu Leu Arg Asn Lys Leu Ser Ala Ser Thr Gly Leu
3920 3925 3930
Arg Leu Pro Ala Thr Phe Val Phe Ser His Pro Thr Pro Ser Ala
3935 3940 3945
Ile Ala Thr Glu Leu His Asp Lys Leu Cys Pro Asp Ala Pro Glu
3950 3955 3960
Pro Ser Ala Pro Val Phe Ala Glu Leu Glu Arg Leu Glu Ala Ala
3965 3970 3975
Met Ala Gly Leu Thr Leu Asp Ala Glu Thr Arg Gly Gly Leu Val
3980 3985 3990
Lys Arg Met Gln Thr Leu Met Trp Arg Leu Gly Asp Gly Ala Ala
3995 4000 4005
Glu Ala Glu Gln Ala Thr Ala Asp Glu Asp Leu Asp Ser Ala Ser
4010 4015 4020
Asp Glu Glu Ile Phe Ala Phe Ile Asp Arg Glu Leu
4025 4030 4035
<210> 6
<211> 1669
<212> PRT
<213> Artificial
<220>
<223> madA6基因编码的氨基酸序列
<400> 6
Val Ser Ser Ala Gly Lys Arg Gly Pro Met Pro Gly Thr Glu Glu Lys
1 5 10 15
Leu Arg Gln Tyr Leu Lys Arg Val Thr Val Asp Leu Gly Gln Ala Arg
20 25 30
Gln Arg Leu Arg Asp Val Glu Glu Arg Asn Gln Glu Pro Ile Ala Val
35 40 45
Val Ala Met Ala Cys Arg Tyr Pro Gly Gly Val Arg Gly Pro Glu Asp
50 55 60
Leu Trp Asp Leu Val Ala Ser Arg Gly Asp Gly Ile Thr Glu Phe Pro
65 70 75 80
Asp Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr His Pro Asp Pro Asp
85 90 95
His Ser Gly Thr Ser Tyr Val Arg Glu Gly Gly Phe Leu Asp Asp Thr
100 105 110
Asp Arg Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Met
115 120 125
Ala Met Asp Pro Gln Gln Arg Met Leu Leu Glu Val Ser Trp Glu Leu
130 135 140
Phe Glu Arg Ala Gly Ile Ala Pro Ala Ala Leu Lys Gly Thr Pro Thr
145 150 155 160
Gly Val Tyr Ala Gly Met Ser Ser Gln Asp Tyr Met Thr Arg Thr Pro
165 170 175
Arg Ile Pro Glu Gly Phe Glu Gly Tyr Ala Thr Thr Gly Ser Leu Thr
180 185 190
Ser Val Ile Ser Gly Arg Val Ala Tyr Thr Phe Gly Leu Glu Gly Pro
195 200 205
Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Ile His
210 215 220
Leu Ala Cys Gln Ala Leu Arg Gln Asp Glu Cys Thr Leu Ala Leu Ala
225 230 235 240
Gly Gly Val Thr Ala Leu Thr Thr Pro Thr Ala Phe Ala Glu Phe Ser
245 250 255
Arg Gln Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Ala
260 265 270
Ala Ala Asp Gly Thr Gly Phe Ser Glu Gly Val Gly Leu Val Leu Leu
275 280 285
Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His His Ile Leu Ala Val
290 295 300
Leu Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr
305 310 315 320
Ala Pro Asn Asp Ala Ser Gln Glu Arg Val Ile Arg Gln Ala Leu Ala
325 330 335
Asn Ala Arg Leu Ala Pro Asp Gln Val Asp Ala Val Glu Ala His Gly
340 345 350
Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Gln Ala
355 360 365
Thr Tyr Gly Lys Asp Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Ser
370 375 380
Val Lys Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Val Ala Gly
385 390 395 400
Val Ile Lys Met Ile Met Ala Met Arg Asn Gly Val Leu Pro Pro Ser
405 410 415
Leu His Ile Asp Glu Pro Thr Ser His Val Asp Trp Asp Ser Gly Ala
420 425 430
Val Arg Leu Leu Thr Glu Pro Val Glu Trp Val Trp Gly Gly Arg Pro
435 440 445
Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His
450 455 460
Leu Ile Val Glu Gln Ala Pro Glu Ser Glu Ala Glu Val Glu Val Leu
465 470 475 480
Glu Gly Ala Ala Gly Pro Asp Thr Glu Val Asp Glu Ser Val Pro Gly
485 490 495
Gly Val Val Pro Trp Val Val Ser Ala Arg Ser Val Ala Gly Leu Arg
500 505 510
Asp Gln Ala Ala Val Leu Ser Ala His Leu Ala Gly Arg Glu Val Ser
515 520 525
Pro Val Arg Val Gly Trp Ser Leu Ala Ala Thr Arg Ser Val Phe Glu
530 535 540
His Arg Ala Val Ile Thr Ala His His Arg Glu Glu Phe Leu Glu Gly
545 550 555 560
Leu Asp Ala Leu Ala Ala Gly Gly Gln His Pro Gly Leu Ile Thr Ser
565 570 575
Pro Ala Gly Gly Ala Gly Gly Gly Gly Val Val Trp Met Phe Ser Gly
580 585 590
Gln Gly Ser Gln Arg Pro Gly Met Gly Ala Gly Leu Tyr Ala Arg Phe
595 600 605
Pro Val Phe Ala Ala Ala Phe Asp Glu Ile Cys Gly Leu Leu Asp Pro
610 615 620
His Leu Pro His Pro Leu Arg Glu Val Val Phe Asn Pro Asp Pro His
625 630 635 640
Gln Asp Gln Gln Ser Gly Leu Leu Asp His Thr Leu Tyr Thr Gln Thr
645 650 655
Ala Leu Phe Ala Leu His Ile Ser Leu Ala Arg Leu Leu His His His
660 665 670
Asp His Thr Pro His Thr Leu Ile Gly His Ser Ile Gly Glu Ile Ala
675 680 685
Ala Ala His Ile Ala Gly Ile Leu Asp Leu Pro Asp Ala Cys His Leu
690 695 700
Ile Thr Ala Arg Ala Thr Leu Met Ala Gln Leu Pro Thr Gly Gly Thr
705 710 715 720
Met Thr Ala Ile Gln Ala Arg Glu Asp Glu Leu Ala Ala Glu Val Glu
725 730 735
Glu Ser Gly Gly Thr Val Ala Ile Ala Ala Leu Asn Thr Pro Asp Ser
740 745 750
Thr Val Ile Ser Gly Gln Ala Asp Glu Val Ala Arg Ile Ala Ala Gly
755 760 765
Trp Glu Glu Arg Gly Arg Lys Thr Lys Ala Leu Ala Val Ser His Ala
770 775 780
Phe His Ser Pro Leu Met Asp Pro Met Leu Asp Glu Phe Arg Glu Ala
785 790 795 800
Ile Glu Gly Leu Ser Phe Arg Ala Pro Ser Val Pro Leu Ile Ser Thr
805 810 815
Leu Thr Gly Glu Leu Ala Gly Asp Glu Ile Ala Ser Pro Ala Tyr Trp
820 825 830
Val Asp Gln Val Arg Arg Ala Val Arg Phe Ala Pro Ala Val Ala Arg
835 840 845
Ala Ala Glu Arg Gly Gly Ala Phe Leu Glu Ile Gly Pro Asp Pro Val
850 855 860
Leu Ala Thr Ala Ala Gln His Thr Leu Ser Glu Pro Ala Ser Gly Arg
865 870 875 880
Glu Arg Pro Glu Pro Leu Val Thr Ala Val Leu Asp Arg His Arg Pro
885 890 895
Asp Asp Glu Ala Phe Val Arg Ala Leu Ala Glu Val His Thr His Val
900 905 910
Arg Ser Ser Ala Ile Gly Trp Ala Arg Leu Phe Pro Asp Asp Pro Ala
915 920 925
Pro Arg Ala Val Asp Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg Tyr
930 935 940
Trp Met Ala Asp Val Thr Pro Pro Glu Ser Ala Ala Gly Gly Asp Ala
945 950 955 960
Ser Glu Glu Arg Phe Trp Gly Ala Val Glu Arg Gly Asp Leu Glu Ala
965 970 975
Leu Ser Asp Thr Leu Arg Leu Pro Asp Gly Asp Gly Gln Arg Ala Ser
980 985 990
Leu Gly Glu Ile Leu Pro Val Leu Ser Gly Trp Arg Arg Glu Arg Arg
995 1000 1005
Glu Arg Ser Ala Val Asp Ser Trp Arg Tyr Arg Val Thr Trp Lys
1010 1015 1020
Arg Leu Thr Gly Leu Asp Gly Gly Thr Pro Ser Gly Pro Trp Leu
1025 1030 1035
Val Val Ala Ala Gly Gly Ala Asp Gly Trp Ala Asp Ala Cys Gly
1040 1045 1050
Arg Ala Leu Gly Glu Glu Ala His Arg Leu Asp Val Asp Gly Ala
1055 1060 1065
Val Asp Arg Glu Gly Leu Ala Glu Trp Leu Arg Ser Arg Tyr Ala
1070 1075 1080
Asp Gly Ala Pro Pro Ala Gly Val Leu Ser Leu Leu Ala Leu Arg
1085 1090 1095
Asp Gly Pro Glu Thr Gly Ala Gly Gly Ala Ala Gly Thr Leu Ala
1100 1105 1110
Leu Leu Gln Ala Met Val Asp Thr Gly Ile Gly Ala Pro Leu Trp
1115 1120 1125
Cys Ala Thr Arg Gly Ala Val Ser Val Gly Asp Ser Asp Leu Leu
1130 1135 1140
Glu Ser Pro Ala Gln Ala Arg Val Trp Gly Leu Gly Arg Val Ala
1145 1150 1155
Ala Leu Glu His Pro Asp Leu Trp Gly Gly Leu Val Asp Leu Pro
1160 1165 1170
Pro Gly Pro Asp Gly Leu Asp Ala Ala Arg Leu Arg Ala Val Leu
1175 1180 1185
Thr Gly Thr Ala Gly Glu Asp Gln Val Ala Leu Arg Ala Asn Gly
1190 1195 1200
Ala Phe Gly Cys Arg Val Val Pro Ser Pro Ala Gly Asp Gly Glu
1205 1210 1215
Pro Gly Arg Glu Trp Ser Pro Glu Gly Thr Val Leu Val Thr Gly
1220 1225 1230
Gly Leu Gly Glu Pro Val Ala Arg Ile Ala Arg Trp Leu Ala Glu
1235 1240 1245
Gly Gly Ala Glu Arg Val Val Val Pro Asp Pro Gly Gly Thr Asp
1250 1255 1260
Ala Pro Gly Ala Gly Glu Leu Ala Asp Glu Leu Ser Gly Leu Gly
1265 1270 1275
Ala Glu Leu Ile Val Ala Arg Cys Gly Pro Asp Asp Pro Ala Ala
1280 1285 1290
Val Lys Glu Leu Ala Gly Arg Leu Ser Ala Ala Gly Ala Arg Ile
1295 1300 1305
Gly Met Ile Val His Ala Pro Ala Pro Ala Glu Pro Gly Pro Leu
1310 1315 1320
Ala Glu Leu Ala Pro Ala Ala Leu Glu Ala Phe Pro Ala Glu Asp
1325 1330 1335
Val Gly Ala Trp Arg Arg Val Gly Glu Leu Cys Gly Met Glu Pro
1340 1345 1350
Asp Glu Pro Ala Val Cys Phe Thr Ser Val Ala Ala Leu Trp Gly
1355 1360 1365
Ser Thr Gly Gln Gly Ala His Ala Ala Ala Gly Ala His Leu Asp
1370 1375 1380
Ala Leu Ala Gly Pro Gly Arg Pro Glu Gly Val Val Ser Val Ala
1385 1390 1395
Trp Gly Ala Trp Asp Val Pro Ala Asp Thr Asp Glu Arg Gly Ala
1400 1405 1410
Arg Gly Ala Asp Gly Leu Arg Arg Gln Gly Leu Gln Pro Leu Asp
1415 1420 1425
Pro Arg Leu Ala Leu Thr Ala Leu Glu Arg Val Leu Gly Asn Gly
1430 1435 1440
Asp Arg Arg Ile Ala Val Ala Asp Val Ala Trp Asp Arg Phe Ala
1445 1450 1455
Pro Leu Phe Thr Leu Ala Arg Pro Ser Arg Leu Phe Glu Asp Val
1460 1465 1470
Pro Glu Ala Arg Arg Ala Ile Glu Ala Ala Arg Gly Pro Ser Asp
1475 1480 1485
Asp Glu Ala Ala Asp Arg Thr Ala Glu Leu Arg Arg Glu Leu Ala
1490 1495 1500
Ala Val Thr Ala Asp Glu Arg Ala Ala Arg Leu Leu Ala Met Val
1505 1510 1515
Arg Thr Asp Ala Ala Ala Val Leu Arg Tyr Glu Ala Ala Asp Ala
1520 1525 1530
Val Asp Pro Asp Leu Pro Phe Lys Asp Leu Gly Phe Asp Ser Ile
1535 1540 1545
Ala Ala Val Gly Leu Arg Asn Arg Leu Arg Ala Ser Thr Gly Leu
1550 1555 1560
Arg Leu Pro Ala Thr Val Gly Phe Asp Tyr Pro Thr Pro Arg Ala
1565 1570 1575
Leu Ala Gly Tyr Leu Leu Gly Arg Val Leu Pro Glu Glu Ser Gly
1580 1585 1590
Thr Gly His Ala Ala Phe Gly His Val Glu Glu Leu Asp Ala Ala
1595 1600 1605
Leu Ala Glu Leu Pro Leu Glu Asp Pro Arg Arg Ala Gly Leu Met
1610 1615 1620
Asn Arg Met Arg Ala Leu Leu Trp Lys Tyr Glu Pro Asp Ser Ala
1625 1630 1635
Ala Ser Glu Ala Gly Asp Gly Asp Gly Glu Glu Asp Leu Ala Ala
1640 1645 1650
Ala Ser Ala Asp Asp Met Phe Ala Leu Ile Asp Arg Glu Leu Gly
1655 1660 1665
Thr
<210> 7
<211> 4770
<212> PRT
<213> Artificial
<220>
<223> madA7基因编码的氨基酸序列
<400> 7
Met Ala Ser Asn Glu Glu Lys Leu Leu Asp Tyr Leu Lys Arg Val Thr
1 5 10 15
Thr Asp Leu Arg Gln Ala Gln Arg Arg Leu Lys Asp Val Glu Ser Val
20 25 30
Gly His Glu Pro Ile Ala Ile Val Gly Met Ala Cys Arg Phe Pro Gly
35 40 45
Gly Val Arg Ser Pro Gly Glu Leu Trp Asp Leu Val Ala Glu Gly Arg
50 55 60
Asp Ala Val Gly Gly Phe Pro Ala Asp Arg Asn Trp Asp Leu Asp Gly
65 70 75 80
Leu Phe Asp Pro Asp Pro Glu His Pro Gly Thr Ser Tyr Val Arg Glu
85 90 95
Gly Gly Phe Ile Asp Asp Val Ala Gly Phe Asp Ala Ser Phe Phe Gly
100 105 110
Ile Gly Pro Arg Glu Ala Leu Ala Met Ala Pro Gln Gln Arg Leu Ala
115 120 125
Leu Glu Thr Ser Trp Glu Ala Val Glu His Ala Gly Ile Asp Pro Glu
130 135 140
Ser Leu Arg Ser Ser Pro Thr Ser Thr Phe Ile Gly Cys Asp Arg Leu
145 150 155 160
Asp Tyr Tyr Ser Asp Pro Ser Gln Val Pro Glu Gly Ser Ala Gly Tyr
165 170 175
Phe Thr Ile Gly Asn Thr Ala Ser Val Val Ser Gly Arg Val Ala Tyr
180 185 190
Ala Leu Gly Leu Glu Gly Ala Ala Val Thr Val Asp Thr Ala Cys Ser
195 200 205
Ser Ser Leu Val Thr Leu His Leu Ala Ala Arg Ala Leu Arg Gln Arg
210 215 220
Glu Cys Asp Met Ala Leu Ala Gly Gly Val Phe Val Met Ser Ser Ser
225 230 235 240
Ala Pro Leu Val Gly Phe Ser Gln Leu Arg Ala Leu Ala Pro Asp Gly
245 250 255
Arg Ser Lys Ser Phe Ala Ala Gly Ala Asp Gly Met Thr Met Ala Glu
260 265 270
Gly Ala Gly Met Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala
275 280 285
Gly His Arg Val Leu Ala Val Ile Arg Gly Ser Ala Met Asn Gln Asp
290 295 300
Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg
305 310 315 320
Val Ile Ala Asp Ala Leu Ala Asp Ala Arg Leu Ser Pro Ala Asp Val
325 330 335
Asp Ala Leu Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile
340 345 350
Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Gly Arg Pro Asp Asp
355 360 365
Arg Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His Thr Gln
370 375 380
Met Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Leu Arg
385 390 395 400
Asn Gly Arg Leu Pro Ala Ser Leu His Ile Asp Glu Pro Ser Pro His
405 410 415
Val Asp Trp Asp Ala Gly Ala Val Arg Leu Leu Thr Asp Pro Val Glu
420 425 430
Trp Arg Asn Gly Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly
435 440 445
Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Glu Pro
450 455 460
Asp Asp Thr Ala Pro Ala Ala Pro Ala Pro Ala Pro Asp Ala Glu Val
465 470 475 480
Ala Gly Gly Val Val Pro Trp Ala Leu Ser Ala Arg Asn Ala Pro Ala
485 490 495
Leu Arg Asp Gln Ala Ala Ala Leu Val Thr His Leu Glu Glu Arg Asp
500 505 510
Asp Phe Ser Pro Val Glu Val Gly Trp Ser Leu Ala Thr Thr Arg Ser
515 520 525
Thr Phe Glu His Arg Ala Val Val Ile Gly Gln Asp Arg Asp Gln Leu
530 535 540
Met Asp Gly Leu Lys Ala Leu Ala Ser Gly Ala Gln His Pro Asn Leu
545 550 555 560
Val Asn Pro Gly Val Ala Ala Ala His Ser Glu Asn Gly Pro Val Met
565 570 575
Val Phe Pro Gly Gln Gly Ser Gln Trp Pro Gly Met Gly Ala Glu Leu
580 585 590
Leu Asp Ser Ser Pro Val Phe Ala Ala Arg Met Ala Glu Cys Glu Arg
595 600 605
Ala Leu Ala Pro Tyr Val Asp Trp Ser Leu Thr Asp Val Leu Arg Gly
610 615 620
Ala Asp Gly Ala Ala Asp Leu Glu Arg Val Asp Val Val Gln Pro Val
625 630 635 640
Leu Trp Ala Val Met Val Ser Leu Ala Ala Val Trp Ala Glu His Gly
645 650 655
Val Thr Pro Ala Ala Val Val Gly His Ser Gln Gly Glu Ile Gly Ala
660 665 670
Ala Cys Val Ala Gly Ala Leu Gly Leu Glu Asp Gly Ala Lys Ile Val
675 680 685
Ala Leu Arg Ser Arg Ala Leu Arg Arg Leu Ala Gly Thr Gly Ala Met
690 695 700
Ala Ser Leu Glu Thr Gly Pro Glu Glu Ala Glu Gly Leu Leu Asp Gly
705 710 715 720
Gly Val Thr Ile Ala Ala Val Asn Gly Pro Thr Ser Val Val Val Ser
725 730 735
Gly Pro Pro Gly Glu Val Ala Gly Val Val Asp Arg Val Lys Ala Arg
740 745 750
Gly Leu Arg Ala Arg Leu Ile Asp Val Asp Tyr Ala Ser His Gly Pro
755 760 765
His Val Gly Glu Val Ala Asp Glu Leu Arg Glu Ala Leu Ala Gly Ile
770 775 780
Glu Pro Ala Ala Ser Ser Val Ala Phe Tyr Ser Thr Val Thr Ala Ala
785 790 795 800
Arg Ile Asp Thr Ser Gly Leu Asp Ala Asp Tyr Trp Val Ala Asn Leu
805 810 815
Cys Arg Pro Val Arg Phe Ala Asp Ala Val His Ala Leu Leu Asp Asp
820 825 830
Gly His Gly Thr Phe Val Glu Ala Ser Pro His Pro Ile Leu Thr Leu
835 840 845
Ala Leu Glu Gln Ile Ala Glu Asp Ala Gly Ala Ala Ala Gly Ile Val
850 855 860
Pro Thr Leu Arg Arg Asp His Gly Asp His Ala Gln Leu Thr Arg Ala
865 870 875 880
Leu Ala His Ala His Thr Leu Ala Ala Pro Val Asp Trp Thr Arg Cys
885 890 895
Phe Pro Ala Ser Pro Pro Pro Ala Thr Thr Asp Leu Pro Thr Tyr Pro
900 905 910
Phe Gln Arg Glu Arg Phe Trp Leu Ala Pro Ala Ala Arg Ser Gly Val
915 920 925
Gly Gly Asp Gly His Asp Pro Ala Glu Ser Glu Leu Trp His Ala Ile
930 935 940
Glu Glu Leu Asp Val Asp Ala Leu Ser Ala Thr Leu Arg Leu Glu Asp
945 950 955 960
Gly Ser Pro Ala Val Asp Asp Leu Arg Pro Ala Leu Pro Ile Leu Ala
965 970 975
Gln Trp Arg Arg Arg His Arg Glu Arg Thr Val Leu Asp Ser Trp Arg
980 985 990
Tyr Lys Ile Thr Trp Lys His Leu Arg Glu Glu Arg Ala Pro Glu Leu
995 1000 1005
Ser Gly Thr Trp Leu Val Ile Thr Pro Ala Asp Glu Asp Gly Gly
1010 1015 1020
Ser Pro Ala Val His Thr Ala Val Gln Ala Ile Arg Gly His Gly
1025 1030 1035
Gly Asp Val Arg Leu Leu Pro Val Asp Thr Ala Gly Ala Thr Arg
1040 1045 1050
Glu Arg Phe Val Arg Asp Leu Ala Glu Ala Ser Ala Glu Thr Pro
1055 1060 1065
Pro Ala Gly Val Leu Ser Leu Leu Ala Leu Asp Ala Ser Pro His
1070 1075 1080
Pro Asp Glu Pro Ala Val Pro Ala Gly Leu Ala Ala Thr Ile Ala
1085 1090 1095
Leu Ile Gln Ala His Gly Asp Gly Gly Ala Gly Ala Pro Leu Trp
1100 1105 1110
Cys Leu Thr Gln Gly Ala Val Ala Val Ser Pro Thr Asp Pro Leu
1115 1120 1125
Pro Ser Pro Arg Gln Ala Gln Val Trp Gly Leu Gly Arg Val Ala
1130 1135 1140
Ala Leu Glu Tyr Pro Ser Arg Trp Gly Gly Leu Ile Asp Leu Pro
1145 1150 1155
Ala Ala Pro Asp Arg Asn Thr Ala Ala Arg Leu Ala Gly Leu Leu
1160 1165 1170
Ala Ser Gly Gly Pro Glu Asp Gln Val Ala Val Arg Thr Thr Ala
1175 1180 1185
Ser Phe Ala Arg Arg Met Asp Arg Ala Pro Ala Arg Asp Gly Gly
1190 1195 1200
Asp Ala Pro Trp Arg Pro Thr Gly Thr Thr Leu Val Thr Gly Gly
1205 1210 1215
Thr Gly Gly Leu Gly Thr His Leu Ala Arg Trp Leu Ala Arg Asn
1220 1225 1230
Gly Ala Pro His Leu Val Leu Thr Ser Arg Arg Gly Pro Asp Ala
1235 1240 1245
Pro Gly Ala Ala Glu Leu Ala Glu Glu Leu Glu Ala Leu Gly Thr
1250 1255 1260
Arg Val Thr Leu Ala Ala Cys Asp Ala Ser Asp Arg Ser Ala Leu
1265 1270 1275
Glu Gly Val Ile Asp Ala Val Pro Ala Glu His Pro Leu Thr Ala
1280 1285 1290
Val Phe His Ala Ala Gly Met Ser Asp Leu Thr Pro Ile Thr Asp
1295 1300 1305
Leu Asp Leu Ala Arg Ala Gly Glu Val Leu Arg Ser Lys Ser Gln
1310 1315 1320
Ala Ala Glu His Leu His Asp Leu Thr Arg His Leu Asp Leu Thr
1325 1330 1335
Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Ala Trp Gly Ser Gly
1340 1345 1350
Gln Gln Gly Ala Tyr Ala Ala Ala Asn Val Tyr Leu Asp Ala Leu
1355 1360 1365
Ala Glu His Arg Arg Arg Leu Gly Leu Pro Ser Thr Ser Leu Ala
1370 1375 1380
Trp Gly Pro Trp Gly Glu Ala Gly Met Ser Ala Asp Glu Asn Val
1385 1390 1395
Ile Ala Phe Phe Ala Arg Arg Gly Leu Thr Pro Met Asp Pro Gly
1400 1405 1410
Leu Ala Val Lys Thr Leu His Glu Ala Ile Thr Gln Gly Glu Thr
1415 1420 1425
Ser Leu Thr Val Ala Asp Ile His Trp Arg Lys Phe Pro Ala Ala
1430 1435 1440
Leu Thr Thr Gln Arg Pro Ser Pro Phe Leu Ser Asp Leu Ala Glu
1445 1450 1455
Thr Asp Pro Asp Asp Ala Asp Arg Arg Asp Ala Ser Pro Gly Ala
1460 1465 1470
Ser Pro Leu Arg Gln Glu Leu Ala Gly Ser Ala Pro Lys Gln Gln
1475 1480 1485
Leu Asp Leu Leu Leu Arg His Val Arg Thr His Ala Ala Ser Ile
1490 1495 1500
Leu Gly His Thr Ser Ile Asp Ala Val Pro Ala Gly Gln Pro Leu
1505 1510 1515
Gln Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Ser Lys
1520 1525 1530
Gln Leu Ser Thr Ser Thr Gly Leu Ser Leu Pro Arg Thr Leu Val
1535 1540 1545
Phe Asp Gln Pro Thr Pro Asn Ala Leu Ala Lys Tyr Leu His Asn
1550 1555 1560
Glu Leu Ala Asp Asp Gln Pro Ala Ala Ala Ser Arg Gly Ser Ala
1565 1570 1575
Arg Thr Ala Thr Ala Ala Ala Ser Asp Glu Pro Ile Ala Ile Val
1580 1585 1590
Ala Met Ala Cys Arg Tyr Pro Gly Gly Val Arg Asn Pro Gln Gln
1595 1600 1605
Leu Trp Asp Leu Val Ala Ser Gly Gly Asp Ala Ile Ala Gly Met
1610 1615 1620
Pro Thr Asn Arg Tyr Trp Asp Leu Asn Asn Leu Tyr His Pro Asp
1625 1630 1635
Pro Glu His Pro Gly Thr Ser Tyr Val Arg Glu Gly Ala Phe Leu
1640 1645 1650
Tyr Asp Ala Pro Glu Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro
1655 1660 1665
Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
1670 1675 1680
Thr Ala Trp Glu Thr Phe Glu Asn Ala Gly Leu Thr Gln Glu Ala
1685 1690 1695
Leu Ser Gly Ser Asn Val Gly Val Phe Thr Gly Gly Thr Phe Gln
1700 1705 1710
Gly Tyr Ser Ala Thr Gly Asn Pro Ala Gln Glu Ser Glu Gly Tyr
1715 1720 1725
Ala Leu Ala Gly Thr Thr Ala Ser Val Ile Ser Gly Arg Val Ser
1730 1735 1740
Tyr Ala Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala
1745 1750 1755
Cys Ser Ser Ser Leu Val Ala Ile His Leu Ala Ser Arg Ala Leu
1760 1765 1770
Ala Gln Gly Glu Cys Ser Leu Ala Leu Ala Gly Gly Val Ala Ile
1775 1780 1785
Met Ala Thr Pro Ala Thr Phe Val Gly Phe Ser Arg Gln Arg Gly
1790 1795 1800
Leu Ala Pro Asn Gly Arg Ala Lys Pro Phe Ala Ala Ala Ala Asp
1805 1810 1815
Gly Thr Asn Trp Gly Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg
1820 1825 1830
Leu Ser Asp Ala Glu Arg Asn Gly His His Val Leu Ala Thr Ile
1835 1840 1845
Arg Gly Ser Ala Val Asn Gln Asp Gly Thr Ser Asn Gly Leu Thr
1850 1855 1860
Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu
1865 1870 1875
Ala Asn Ala Arg Leu Glu Pro Ala Asp Ile Asp Ala Val Glu Ala
1880 1885 1890
His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala
1895 1900 1905
Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Ala Asp Arg Pro Leu
1910 1915 1920
Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Met Ala
1925 1930 1935
Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg Asn
1940 1945 1950
Asp Leu Leu Pro Glu Ser Leu His Ile Asp Glu Pro Ser Pro His
1955 1960 1965
Val Asp Trp Asp Ala Gly Ala Val Arg Leu Leu Ala Glu Pro Val
1970 1975 1980
Glu Trp Pro Glu Gly Glu Arg Pro Arg Arg Ala Gly Val Ser Ala
1985 1990 1995
Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala
2000 2005 2010
Pro Glu Pro Ser Val Pro Asp Thr Ala Ser Glu Ala Pro Asp Ala
2015 2020 2025
Ala Asp Ala Glu Pro Asp Ala Ser Gly Gly Pro Val Pro Trp Val
2030 2035 2040
Val Ser Ala Arg Ser Ala Thr Ala Leu Arg Asp Gln Ala Ala Ala
2045 2050 2055
Leu Ser Ala His Leu Ala Gly Gly Asp Asp Ser Pro Gly Asp Val
2060 2065 2070
Gly Trp Ala Leu Ala Thr Arg Arg Ser Thr Phe Glu His Arg Ala
2075 2080 2085
Val Val Leu Gly Arg Asp Arg Asp Glu Leu Leu Ala Gly Leu Asp
2090 2095 2100
Ala Leu Ala Thr Ala Gly Gln His Pro Gly Leu Val Ala Pro Gly
2105 2110 2115
Thr Pro Ala Ala Ala Ser Glu Pro Gly Pro Val Leu Val Phe Pro
2120 2125 2130
Gly Gln Gly Ser Gln Trp Pro Gly Met Gly Ala Lys Leu Leu Asp
2135 2140 2145
Thr Ser Pro Val Phe Ala Ala Arg Ile Ala Glu Cys Glu Gln Ala
2150 2155 2160
Leu Ala Pro His Ile Asp Trp Ser Leu Thr Asp Val Leu Arg Gly
2165 2170 2175
Thr Glu Gly Ala Ala Asp Leu Asn Arg Val Asp Val Val Gln Pro
2180 2185 2190
Val Leu Trp Ala Thr Met Val Ser Leu Ala Ala Val Trp Ala Asp
2195 2200 2205
His Gly Val Thr Pro Ala Ala Val Ile Gly His Ser Gln Gly Glu
2210 2215 2220
Ile Ala Ala Ala Cys Ile Ala Gly Ala Leu Thr Leu Thr Asp Ala
2225 2230 2235
Ala Thr Ile Thr Ala Leu Arg Ser Lys Ala Leu Arg Asn Leu Ala
2240 2245 2250
Gly His Gly Ala Met Ala Ser Leu Gly Thr Gly Arg Asp Gly Ala
2255 2260 2265
Glu Arg Leu Leu Glu Glu Arg Gly Gly Asp Val Val Val Ala Ala
2270 2275 2280
Phe Asn Gly Pro Ser Ser Thr Val Val Ser Gly Pro Pro Glu Ala
2285 2290 2295
Val Ala Asp Val Val Ala Ala Ala Lys Ala Ala Gly Leu Arg Ala
2300 2305 2310
Arg Met Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Ile Asp
2315 2320 2325
Arg Ile Thr Gly Glu Leu His Glu Val Leu Ala Gly Ile Arg Pro
2330 2335 2340
Phe Asp Ala Glu Ile Pro Phe Tyr Ser Thr Val His Ala Gly Arg
2345 2350 2355
Ile Asp Thr Thr Gly Leu Asp Ala Ala Tyr Trp Val Thr Asn Leu
2360 2365 2370
Arg Glu Gln Val Arg Phe Ala Asp Thr Val Glu Ala Leu Leu Asn
2375 2380 2385
Asp Gly His Arg Val Phe Ile Glu Ala Ser Ala His Pro Val Leu
2390 2395 2400
Thr Val Gly Leu Glu Glu Ile Phe Glu Glu Ala Gly Ala Asp Ala
2405 2410 2415
Tyr Ala Val Pro Thr Leu His Arg Asp Arg Gly Asp Arg Ala Gln
2420 2425 2430
Val Ala Arg Ser Val Ala Glu Ala Phe Thr Ala Gly Val Gln Val
2435 2440 2445
Asp Trp Thr Ala Leu Phe Gly Arg Thr Ala Arg Arg Arg Ala Val
2450 2455 2460
Asp Leu Pro Thr Tyr Ala Phe Gln Arg Arg Pro Tyr Trp Leu Ala
2465 2470 2475
Pro Ala Val Pro Ser Gly Gly Gly Gly Pro Gly His Asp Gln Ala
2480 2485 2490
Glu Thr Glu Leu Trp Asn Ala Ile Glu Glu Leu Asp Val Asp Ala
2495 2500 2505
Leu Gly Ala Ala Leu His Leu Asp Gly Asp Gly Pro Ala Leu Asp
2510 2515 2520
Asp Leu Arg Pro Ala Leu Pro Ile Leu Ser Glu Trp Arg Arg Arg
2525 2530 2535
His Arg Glu Gln Ser Thr Leu Asp Ser Trp Arg Tyr Gln Ile Gly
2540 2545 2550
Trp Thr His Leu Pro Glu Ile Ala Ala Pro Val Leu Ser Gly Thr
2555 2560 2565
Trp Leu Val Leu Val Pro Ala Asp Gly Gly Ala Gly Ala Ala Glu
2570 2575 2580
Arg Ser Ala Ile Asp Thr Ala Val Gln Ala Ile Arg Ser His Gly
2585 2590 2595
Ala Thr Ala Ser Val Leu Pro Val Asp Cys Ala Ala Val Glu Arg
2600 2605 2610
Asp Gln Leu Val Gln His Leu Thr Glu Ala Glu Thr Pro Pro Ser
2615 2620 2625
Gly Ile Leu Ser Leu Leu Ala Leu Asp Glu Thr Pro His Pro Ser
2630 2635 2640
His Pro Ala Val Pro Ala Gly Leu Ala Ala Thr Thr Ala Leu Val
2645 2650 2655
Gln Ala Leu Asp Ala Ser Gly Ile Ala Val Arg Leu Trp Cys Val
2660 2665 2670
Thr Gln Gly Ala Val Ala Val Ser Pro Ser Asp Pro Leu Leu Asn
2675 2680 2685
Pro Val Gln Ala Glu Thr Trp Gly Leu Gly Arg Val Ala Ala Leu
2690 2695 2700
Glu Tyr Pro Ala Arg Trp Gly Gly Leu Ile Asp Leu Pro Thr Thr
2705 2710 2715
Pro Asp Gln His Thr Pro Ala Arg Leu Ala Ala Leu Leu Thr Pro
2720 2725 2730
Gly Gln Pro Glu Asp Gln Val Ala Val Arg Ala Thr Ala Thr Leu
2735 2740 2745
Ala Arg Arg Met Arg Arg Ala Pro Ala Pro Ser Gly Asn Gly Arg
2750 2755 2760
Ala Pro Trp Thr Pro Ser Gly Thr Thr Leu Ile Thr Gly Gly Thr
2765 2770 2775
Gly Gly Leu Gly Ala His Leu Ala Arg Trp Leu Ala His Asn Gly
2780 2785 2790
Ala Pro His Leu Leu Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro
2795 2800 2805
Gly Ala Arg Glu Leu Ala Asp Glu Leu Arg Ala Leu Gly Thr Ala
2810 2815 2820
Val Thr Leu Thr Ala Cys Asp Val Ser Asp Arg Ser Ala Leu Lys
2825 2830 2835
Gly Val Ile Asp Gly Val Pro Asp Asp Gln Pro Leu Asn Ala Val
2840 2845 2850
Phe His Ala Ala Gly Ile Pro Glu Leu His Pro Phe Ala Glu Leu
2855 2860 2865
Asp Ile Pro His Ile Ser Asp Val Leu Leu Pro Lys Ala Gln Ala
2870 2875 2880
Ala Ala His Leu His Glu Leu Thr Arg His Leu Asp Leu Thr Ala
2885 2890 2895
Phe Val Leu Phe Ser Ser Gly Ala Ala Ala Trp Gly Ser Gly Gln
2900 2905 2910
Gln Ala Ser Tyr Ala Ala Ala Asn Thr Tyr Leu Asp Ala Leu Ala
2915 2920 2925
Glu His Arg Arg Thr Leu Gly Leu Pro Ala Thr Ser Ile Ala Trp
2930 2935 2940
Gly Pro Trp Gly Glu Ala Gly Met Ala Ala Asp Glu Asn Val Ile
2945 2950 2955
Ala Phe Phe Ala Arg Arg Gly Leu Ala Ile Met Asp Pro Asp Leu
2960 2965 2970
Ala Val Lys Ser Leu His Gln Ala Val Ser His Gly Asp Thr Thr
2975 2980 2985
Leu Thr Val Ala Asp Ile His Trp Glu Thr Phe Thr Ala Thr Phe
2990 2995 3000
Thr Thr Gln Arg Pro Ser Arg Leu Val Ala Asp Leu Thr Pro Ser
3005 3010 3015
Arg Ser Asp Thr Gly Asp Gly Thr Gly Gly Gly Ala Gly Ala Glu
3020 3025 3030
Glu Gly Arg Thr Asp Asp His Pro Leu Arg Gly Glu Leu Ala Gly
3035 3040 3045
Ser Thr Pro Lys Gln Gln Leu Glu Leu Leu Val Arg His Val Gln
3050 3055 3060
Glu His Ala Ala Thr Ile Leu Gly His Ser Gly Ala Asp Ala Val
3065 3070 3075
Ser Ala Gly Gln Pro Leu Gln Glu Leu Gly Phe Asp Ser Leu Thr
3080 3085 3090
Ala Val Glu Leu Ser Lys Arg Leu Gly Ser Ala Thr Gly Leu Ser
3095 3100 3105
Leu Pro Arg Thr Leu Val Phe Asp His Pro Thr Pro Asn Ala Ile
3110 3115 3120
Ala Lys Tyr Leu Arg Ala Glu Leu Thr Gly Gln Gln Ala Asp Ala
3125 3130 3135
Val Arg Ser Pro Ala Ser Ile Thr Ser Ala Ala Thr Asp Glu Pro
3140 3145 3150
Ile Ala Ile Val Ala Met Ala Cys Arg Phe Pro Gly Gly Val Arg
3155 3160 3165
Asn Pro Gln Gln Leu Trp Asp Leu Val Ala Ser Gly Gly Asp Ala
3170 3175 3180
Ile Ala Glu Met Pro Thr Asn Arg Ser Trp Asp Leu Asp Asn Leu
3185 3190 3195
Tyr His Pro Asp Pro Glu His Pro Gly Thr Ser Tyr Val Arg Glu
3200 3205 3210
Gly Ala Phe Leu Tyr Asp Ala Pro Glu Phe Asp Ala Gly Phe Phe
3215 3220 3225
Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg
3230 3235 3240
Leu Leu Leu Glu Thr Ala Trp Glu Thr Phe Glu Asn Ala Gly Leu
3245 3250 3255
Thr Gln Glu Ala Leu Ser Gly Ser Asn Val Gly Val Phe Thr Gly
3260 3265 3270
Gly Thr Phe Gln Gly Tyr Ser Ala Thr Gly Thr Pro Ala Lys Glu
3275 3280 3285
Ile Glu Gly Tyr Ile Leu Val Gly Asn Thr Ala Ser Val Met Ser
3290 3295 3300
Gly Arg Val Ser Tyr Thr Phe Gly Leu Glu Gly Pro Ala Val Thr
3305 3310 3315
Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Ile His Leu Ala
3320 3325 3330
Ser Gln Ala Leu Ala Gln Gly Glu Cys Asp Leu Ala Leu Ala Gly
3335 3340 3345
Gly Val Thr Val Met Ala Thr Pro Ala Thr Phe Ile Gly Phe Ser
3350 3355 3360
Arg Gln Arg Gly Leu Ala Pro Asn Gly Arg Cys Lys Pro Phe Ala
3365 3370 3375
Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Ala Gly Leu Leu
3380 3385 3390
Leu Leu Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His His Ile
3395 3400 3405
Leu Ala Thr Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Thr Ser
3410 3415 3420
Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile
3425 3430 3435
Arg Gln Ala Leu Ala Asn Ala Arg Leu Glu Pro Ala Asp Val Asp
3440 3445 3450
Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile
3455 3460 3465
Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Arg Asn Arg Pro Glu
3470 3475 3480
Asp Arg Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Ile Gly His
3485 3490 3495
Thr Gln Met Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Met
3500 3505 3510
Ala Met Arg Asn Gly Leu Leu Pro Glu Ser Leu His Ile Asp Glu
3515 3520 3525
Pro Ser Pro His Val Asp Trp Glu Ala Gly Ala Val Arg Leu Leu
3530 3535 3540
Ser Glu Pro Val Glu Trp Val Trp Gly Gly Arg Pro Arg Arg Ala
3545 3550 3555
Gly Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Leu Ile
3560 3565 3570
Val Glu Gln Ala Pro Glu Pro Asp Pro Glu Val Val Pro Glu Ala
3575 3580 3585
Glu Ser Glu Ser Glu Ala Arg Val Gly Ser Gly Gly Val Val Pro
3590 3595 3600
Trp Val Val Ser Ala Arg Ser Val Ala Gly Leu Arg Asp Gln Ala
3605 3610 3615
Ala Ala Leu Ser Ala His Leu Ser Gly Arg Asp Val Ser Pro Val
3620 3625 3630
Gln Val Gly Trp Ser Leu Ala Thr Arg Arg Ser Val Phe Glu His
3635 3640 3645
Arg Ala Val Ile Thr Gly His His Ser Glu Glu Phe Leu Gln Gly
3650 3655 3660
Leu Asp Ala Leu Ala Ala Gly Asp Asp His Pro Cys Leu Thr Ala
3665 3670 3675
Ser Pro Ser Ala Gly Ala Thr Gly Gly Glu Val Val Trp Met Phe
3680 3685 3690
Ser Gly Gln Gly Ser Gln Arg Pro Gly Met Gly Ala Gly Leu Tyr
3695 3700 3705
Glu Arg Phe Pro Val Phe Ala Thr Thr Phe Asp Gln Ile Cys Asp
3710 3715 3720
Leu Leu Asp Pro His Leu Pro His Pro Leu Arg Asp Val Val Phe
3725 3730 3735
Asn Pro Asp Pro Asp Gln Pro Asp Leu Leu Asp His Thr Leu Tyr
3740 3745 3750
Thr Gln Thr Ala Leu Phe Ala Leu Gln Val Ser Leu Ala Arg Leu
3755 3760 3765
Leu Asn Gln His Gly His Thr Pro His Thr Leu Ile Gly His Ser
3770 3775 3780
Ile Gly Glu Ile Ala Ala Ala His Ile Ala Gly Ile Leu Asp Leu
3785 3790 3795
Pro Asp Ala Cys Arg Leu Leu Thr Ala Arg Ala Thr Leu Met Asp
3800 3805 3810
Gln Ile Pro Ile Asp Gly Thr Met Thr Ala Ile Gln Ala Thr Tyr
3815 3820 3825
Asp Glu Leu Thr Asp His Leu Gln Thr Val Asp Pro Asp Gly Arg
3830 3835 3840
Arg Ile Ala Ile Ala Ala Leu Asn Thr Pro Asp Ser Thr Val Val
3845 3850 3855
Ser Gly Asp Pro Asp Leu Val Ala Gln Val Ala Asp Val Trp Ala
3860 3865 3870
Gly Lys Gly Arg Lys Thr Arg Arg Leu Lys Val Ser His Ala Phe
3875 3880 3885
His Ser Pro His Met Asp Pro Val Leu Glu Pro Phe Ala Glu Val
3890 3895 3900
Val Arg Gly Leu Thr His Arg Pro Ala Thr Ile Pro Leu Ile Thr
3905 3910 3915
Asn Leu Thr Gly Gly Pro Val Glu Asp Leu Gly Pro Asp Tyr Trp
3920 3925 3930
Ile Gln Gln Val Arg Gln Pro Val Arg Phe Gln Pro Ala Ile Glu
3935 3940 3945
Tyr Leu Ala Arg Ser Ala Thr Pro Pro Ala Ala Tyr Leu Glu Leu
3950 3955 3960
Gly Pro Asp Pro Val Leu Ala Ala Ala Ala Arg His Thr Leu Ala
3965 3970 3975
Ser Val Gly Gly Asp Gly Arg Pro Glu Pro Val Val Ala Ala Thr
3980 3985 3990
Leu Asn His Arg His Ser Asp Val His Ala Leu Thr Asp Ala Leu
3995 4000 4005
Ala Gln Leu His Thr His Val Ala Pro Ile Asp Trp Ser Pro Tyr
4010 4015 4020
Phe Pro Ser Arg Arg Ala Gly Ser Ala Pro Asp Leu Pro Asn Tyr
4025 4030 4035
Ala Phe Gln Arg Arg Ser Tyr Trp Leu Val Asn Glu Pro Glu Lys
4040 4045 4050
Ala Ala Ala Thr Glu Asn Ala Leu Asp Ser Glu Phe Trp Asp Ala
4055 4060 4065
Val Glu Arg Glu Asp Val Glu Ser Leu Ala Arg Thr Leu Gly Ser
4070 4075 4080
Pro Ala Glu Gln Glu Thr Ser Leu Gly Glu Val Leu Pro Ile Leu
4085 4090 4095
Ser Gly Trp Arg Arg Arg His Arg Glu Gln Gly Val Leu Asp Ser
4100 4105 4110
Tyr Arg Tyr Gln Val Ala Trp Thr His Leu Pro Glu Glu Ser Ala
4115 4120 4125
Pro Val Leu Ser Gly Thr Trp Leu Val Leu Val Pro Ala Asn Asp
4130 4135 4140
Ala Glu Gly Pro Ala Ala Asp Leu Ala Val Gln Ala Leu Arg Ala
4145 4150 4155
His Gly Ala Val Pro Asn Val Leu Arg Val Glu Ala Thr Thr Ala
4160 4165 4170
Gly Arg Glu Glu Phe Ala Arg Gln Leu Ala Asp Ala Asp Ala Asp
4175 4180 4185
Ala Pro Leu Glu Gly Val Leu Ser Leu Leu Ala Leu Asp Glu Thr
4190 4195 4200
Pro His Pro Ser His Pro Ala Val Pro Ala Gly Leu Ala Ala Thr
4205 4210 4215
Thr Ala Leu Val Gln Ala Leu Asp Asp Ser Gly Val Pro Ala Arg
4220 4225 4230
Leu Trp Cys Leu Thr Gln Gly Ala Val Ser Val Ser Pro Ser Asp
4235 4240 4245
Pro Leu Leu Asn Pro Leu Gln Ala Glu Thr Trp Gly Leu Gly Arg
4250 4255 4260
Val Ala Ala Leu Glu Tyr Pro Ala Arg Trp Gly Gly Leu Ile Asp
4265 4270 4275
Leu Pro Thr Thr Pro Asp Gln His Thr Pro Ala Arg Leu Ala Ala
4280 4285 4290
Leu Leu Ala Pro Gly Gln Pro Glu Asp Gln Ala Ala Ile Arg Ala
4295 4300 4305
Thr Ala Val Leu Ala Arg Arg Met Arg Arg Ala Pro Asp Arg Thr
4310 4315 4320
Asn Gly Asp Ala Pro Trp Arg Pro Ser Gly Thr Thr Leu Ile Thr
4325 4330 4335
Gly Gly Thr Gly Gly Leu Gly Ala His Leu Ala Arg Trp Leu Ala
4340 4345 4350
His Asn Gly Ala Pro His Leu Leu Leu Thr Ser Arg Arg Gly Pro
4355 4360 4365
Asp Ala Pro Gly Ala Arg Glu Leu Ala Glu Glu Leu Gln Thr Leu
4370 4375 4380
Gly Thr Ala Val Thr Leu Thr Ala Cys Asp Val Ser Asp Arg Ser
4385 4390 4395
Ala Leu Lys Asn Val Ile Asp Gly Val Pro Gly Asp Gln Pro Leu
4400 4405 4410
Asn Ala Val Phe His Val Ala Gly Ile Pro Glu Leu Thr Ala Phe
4415 4420 4425
Ala Asp Leu Asp Val Pro His Ile Gly Glu Val Leu Arg Ser Lys
4430 4435 4440
Ala Leu Ala Ala Asp His Leu His Glu Leu Thr Arg Asp Leu Asp
4445 4450 4455
Leu Thr Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Ala Trp Gly
4460 4465 4470
Ser Gly Gln Gln Ala Ser Tyr Ala Ala Ala Asn Ala Tyr Leu Asp
4475 4480 4485
Ala Leu Ala Glu His Arg His Thr Leu Gly Leu Pro Ala Thr Ser
4490 4495 4500
Ile Ala Trp Gly Pro Trp Ala Gln Ala Gly Met Ala Ala Val Asp
4505 4510 4515
Asp Val Ile Thr Tyr Ile Gly Arg Arg Gly Leu Thr Pro Leu Asp
4520 4525 4530
Pro Asp Leu Ala Leu Lys Ala Leu His Arg Ala Leu Thr Arg Gly
4535 4540 4545
Glu Thr Thr Leu Thr Val Ala Asp Ile Glu Trp Glu Ala Phe Thr
4550 4555 4560
Thr Thr Phe Thr Thr Gln Arg Pro Ser Pro Leu Ile Ala Asp Leu
4565 4570 4575
Ala Pro Ala Arg Pro Asp Ala Ala Ala Asp Val Glu Glu Ala Ala
4580 4585 4590
Glu Gly Gly Gly His Pro Leu Arg Glu Gln Leu Ala Gly Gly Thr
4595 4600 4605
Pro Glu Glu Gln Arg His Ile Leu Leu Arg His Val Gln Ala Arg
4610 4615 4620
Val Ala Ala Val Leu Gly His Pro Asp Pro Asp Ala Ile Pro Ala
4625 4630 4635
Ala Gln Pro Phe Gln Glu Leu Gly Phe Asp Ser Leu Ser Ala Val
4640 4645 4650
Glu Leu Arg Asn Arg Leu Ala Ala Thr Thr Gly Leu Pro Leu Pro
4655 4660 4665
Pro Thr Leu Ile Phe Asp His Pro Ser Cys Gly Ala Leu Ala Asp
4670 4675 4680
Leu Leu Arg Val Gln Leu Leu Asp Val Asp Val Val Asp Glu Gly
4685 4690 4695
Arg Val Leu Ser Gly Leu Asp Gln Trp Asp Ser Ser Cys Asp Pro
4700 4705 4710
Ala Ala Val Asp Gly Ala Ala Arg Arg Arg Val Thr Gln Arg Leu
4715 4720 4725
Glu Ser Leu Leu Ala Lys Trp Asn Ala Thr Gly Asp Glu Asp Gly
4730 4735 4740
Arg Ser Ala Pro Asp His Glu Leu Glu Thr Ala Thr Ala Glu Asp
4745 4750 4755
Ile Phe Asp Leu Ile Ala Asp Glu Phe Gly Lys Ser
4760 4765 4770
<210> 8
<211> 2091
<212> PRT
<213> Artificial
<220>
<223> madA8基因编码的氨基酸序列
<400> 8
Met Ser Asn Glu Glu Lys Leu Leu Asp His Leu Lys Trp Val Thr Ala
1 5 10 15
Glu Leu Arg Gln Ala Arg Arg Arg Leu Arg Glu Ala Glu Ser Ala Glu
20 25 30
Pro Glu Pro Ile Ala Val Val Gly Phe Ala Cys Arg Tyr Pro Gly Gly
35 40 45
Val Arg Ser Pro Glu Asp Leu Trp Arg Leu Val Ala Glu Gly Arg Asp
50 55 60
Ala Ile Thr Pro Phe Pro Ala Asp Arg Glu Trp Asp Val Glu Arg Ile
65 70 75 80
Tyr Asp Pro Asp Pro Asp Lys Pro Gly Arg Thr Tyr Val Thr Glu Gly
85 90 95
Gly Phe Cys Asp Asp Val Pro Gly Phe Asp Ala Ala Phe Phe Asp Ile
100 105 110
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Thr Ala Trp Glu Thr Phe Glu Arg Ala Gly Leu Asp Arg Asp Ala
130 135 140
Leu His Gly Ser Glu Thr Gly Val Phe Ala Gly Val Ser Ser His Asp
145 150 155 160
Tyr Leu Ser Leu Ile Gly Arg Thr Thr Ser Asp Val Gly Gly Tyr Ala
165 170 175
Gly Thr Gly Asn Val Gly Ser Val Ala Ser Gly Arg Ile Ala Tyr Thr
180 185 190
Trp Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser
195 200 205
Ser Leu Thr Ala Ile His Leu Ala Cys Gln Ser Leu Arg Gln Arg Glu
210 215 220
Cys Thr Met Ala Leu Ala Gly Gly Val Ser Val Leu Ala Thr Pro Asp
225 230 235 240
Ala Phe Ile Glu Phe Ser Arg Gln Arg Ala Asn Ala Pro Asp Gly Arg
245 250 255
Cys Lys Ser Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly
260 265 270
Val Gly Leu Val Leu Leu Glu Pro Leu Ser Asp Ala Gln Arg Asn Gly
275 280 285
Arg Arg Ile Leu Gly Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly
290 295 300
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg Val
305 310 315 320
Ile Arg Gln Ala Leu Val Arg Ala Gly Leu Ala Ala Ser Asp Val Asp
325 330 335
Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu
340 345 350
Ala Lys Ala Leu Leu Ala Thr Tyr Gly Gln Gly Arg Pro Asp Asp Arg
355 360 365
Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Ser Gln Gly
370 375 380
Ala Ala Gly Val Ala Gly Val Ile Lys Met Leu Met Ala Met Arg Asn
385 390 395 400
Gly Leu Leu Pro Ala Ser Leu His Ile Asp Glu Pro Ser Pro His Val
405 410 415
Asp Trp Asp Ala Gly Ala Val Arg Leu Leu Thr Asp Pro Val Glu Trp
420 425 430
Pro Arg Gly Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile
435 440 445
Ser Gly Thr Asn Ala His Leu Ile Leu Glu Glu Pro Pro Gln Pro Asp
450 455 460
Glu Val Pro Ala Gly Gln Ala Pro Arg Ala Glu Val Val Asp Gly Val
465 470 475 480
Val Pro Trp Ala Leu Ser Ala Arg Asn Ala Pro Ala Leu Arg Asp Gln
485 490 495
Ala Thr Ala Leu Val Thr His Leu Glu Gln Arg Glu Asp Phe Ser Pro
500 505 510
Val Glu Val Gly Trp Ser Leu Ala Thr Thr Arg Ser Thr Phe Glu His
515 520 525
Arg Ala Val Val Ile Gly Gln Asp Arg Asp Gln Leu Met Asp Gly Leu
530 535 540
Arg Ala Leu Ala Asp Gly Arg Thr His Pro Cys Leu Val Asn Pro Gly
545 550 555 560
Thr Pro Ala Ala Val Ser Gln Thr Gly Pro Val Leu Val Phe Pro Gly
565 570 575
Gln Gly Ser Gln Trp Pro Gly Met Gly Ala Asp Leu Leu Asp Thr Ser
580 585 590
Pro Val Phe Ala Ala Arg Ile Ala Glu Cys Glu Gln Ala Leu Ala Pro
595 600 605
His Ile Asp Trp Ser Leu Thr Asp Val Leu Arg Gly Thr Glu Gly Ala
610 615 620
Ala Asp Leu Asn Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Thr
625 630 635 640
Met Val Ser Leu Ala Ala Val Trp Thr Asp His Gly Ile Thr Pro Ala
645 650 655
Ala Val Ile Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Ile Ala
660 665 670
Gly Ala Leu Thr Leu Thr Asp Ala Ala Thr Ile Thr Ala Leu Arg Ser
675 680 685
Lys Ala Leu Arg Asn Leu Ala Gly His Gly Ala Met Ala Ser Leu Gly
690 695 700
Val Thr Pro Glu Glu Ala Gly Glu Leu Leu Glu Gly Asp Val Thr Val
705 710 715 720
Ala Ala Val Asn Gly Pro Arg Ser Val Val Val Ser Gly Gly Pro Asp
725 730 735
Ala Val Ala Ala Ile Val Ala Lys Val Glu Ala Asn Gly Gln Arg Ala
740 745 750
Arg Met Ile Asp Val Asp Tyr Ala Ser His Gly Pro Gln Ile Asp Arg
755 760 765
Ile Thr Gly Glu Leu Arg Glu Ile Leu Ala Gly Val Arg Pro Ser Glu
770 775 780
Ala Arg Ile Pro Phe Tyr Ser Thr Val Thr Ala Ala Arg Ile Asp Ala
785 790 795 800
Ala Glu Leu Asp Thr Glu Tyr Trp Val Thr Asn Leu Arg Arg Pro Val
805 810 815
Arg Phe Ala Asp Thr Val Arg Ala Leu Leu Ala Asp Gly His Arg Asp
820 825 830
Phe Ile Glu Ala Ser Pro His Pro Ile Leu Thr Leu Ala Leu Glu Gln
835 840 845
Ile Thr Glu Asp Ala Asp Thr Arg Thr Thr Val Thr Ser Thr Leu Arg
850 855 860
Arg Asp His Ala Asp His Thr Gln Leu Val His Ala Leu Ala Arg Leu
865 870 875 880
His Thr Ala Ala Ala Pro Ala Asp Trp Thr Arg His Tyr Pro Ala Asp
885 890 895
Pro Pro Pro Ala Val Thr Asp Leu Pro Thr Tyr Pro Phe Gln His Gln
900 905 910
Arg Phe Trp Leu Asp Thr Arg Gly Trp Leu Gly Gly Glu Pro Gly Thr
915 920 925
Leu Gly Leu Ala Pro Ala Gly His Pro Leu Leu Gly Ala Ala Ile Glu
930 935 940
Gln Ala Glu Gly Asp Ala Tyr Leu Leu Thr Gly Arg Ile Ser Gln Gly
945 950 955 960
Thr Glu Pro Trp Leu Ala Asp His Lys Val Leu Gly Ala Val Leu Leu
965 970 975
Pro Gly Ala Ala Phe Ala Asp Leu Ala Val His Ala Ala Ala Arg Thr
980 985 990
Gly Cys Asp His Val Ala Glu Leu Val Leu His Glu Pro Leu Arg Leu
995 1000 1005
Pro Glu Asp Gly Ala Val Asp Leu Gln Val Ala Val Gly Pro Ala
1010 1015 1020
Asp Gly Glu Gly Gly Gly Arg Pro Val Ala Ile His Ser Arg Pro
1025 1030 1035
Ala Ala Asp Ala Asp Asp Gly Glu Ser Ala Trp Thr Arg His Ala
1040 1045 1050
Thr Gly Leu Leu Ala Asn Gly Pro Gly Ala Ser Thr Ala Thr Pro
1055 1060 1065
Ser Glu Pro His Pro Leu Asp Gly Ala Trp Pro Pro Pro Gly Ala
1070 1075 1080
Glu Pro Leu Ala Thr Glu His Leu Tyr Asp Asp Leu Ala Asp Arg
1085 1090 1095
Gly Ser Ala Tyr Gly Thr Ala Phe Gln Gly Leu Thr Ala Ala Trp
1100 1105 1110
Arg Leu Asp Glu His Leu Tyr Ala Glu Val Val Leu Pro Glu Asp
1115 1120 1125
Gly Thr Asp Arg Ala Gly Ala Asp Gly Ser Gly Thr Ala Gly Tyr
1130 1135 1140
Gly Ile His Pro Ala Leu Leu Asp Ala Ala Leu Gln Ala Cys Thr
1145 1150 1155
Leu Gly Ala Asp Gly Ala Asp Gly Asp Ser Gly Ala Val Met Leu
1160 1165 1170
Pro Phe Ala Trp Ser Gly Leu Arg Val His Ala Thr Gly Ala Ala
1175 1180 1185
Ser Leu Arg Val Arg Val Thr Pro Thr Ala Ala Asp Arg Leu Thr
1190 1195 1200
Ile Ala Ala Ala Asp Ala Ser Gly Ala Pro Val Leu Ser Leu Asp
1205 1210 1215
Asp Leu Thr Leu Arg Ser Val Glu Ala Asp Ala Ala Ala Gln Ala
1220 1225 1230
Arg Leu Ala Ala Arg Asn Ser Leu Phe Ala Met Asp Trp Thr Pro
1235 1240 1245
Leu Pro Asp Pro Glu Gly Thr Pro Pro Ala Arg Leu Ala Val Val
1250 1255 1260
Ala Leu Asp Ala Ala Ser Gly Ser Asp Ala Glu Leu Ala Asp Ala
1265 1270 1275
Leu Gly Gly Ala Leu Pro Gly Thr Glu Arg His Pro Asp Leu Ser
1280 1285 1290
Ala Leu Leu Thr Ala Ala Val Asp Gly Thr Pro Leu Pro Asp Thr
1295 1300 1305
Val Leu Ala Val Cys Thr Ala Ser Gly Ala Ala Gly Pro Val Glu
1310 1315 1320
Glu Arg Leu His Ala Val Thr Ser Gly Val Leu Ala Leu Leu Gln
1325 1330 1335
Glu Trp Pro Ser Asp Glu Ser Thr Ser Arg Leu Val Val Val Thr
1340 1345 1350
Arg Gly Ala Val Ala Thr Gly Pro Gly Asp Gly Val Asp Asp Leu
1355 1360 1365
Pro Ala Ala Ala Val Trp Gly Leu Val Arg Ser Ala Gln Ser Glu
1370 1375 1380
Asn Pro Gly Arg Ile Val Leu Leu Asp Leu Asp Gly Pro Asp Ala
1385 1390 1395
Ser Ser His Ala Val Pro Ala Ala Leu Ala Ser Gly Glu Pro Gln
1400 1405 1410
Ile Ala Leu Arg Asp Gly Arg Ala Tyr Ala Pro Arg Leu Val Arg
1415 1420 1425
Tyr Asp Pro Gly Asp Arg Leu Ser Pro Pro Pro Gly Ala Glu Ala
1430 1435 1440
Trp Arg Leu Val Tyr Gly Ala Asp Gly Glu Ala Ala Leu Val Pro
1445 1450 1455
Asp Pro Glu His Gly Arg Pro Leu Ala Pro Gly Glu Val Arg Val
1460 1465 1470
Ala Leu Arg Ala Val Ala Val Ser Ala Gln Ala Ala Gly Pro Gly
1475 1480 1485
Ala Pro Asp Glu Val Arg Asp Val Arg Gly Asp Gly Ala Gly Val
1490 1495 1500
Val Ala Gly Val Gly Ala Asp Val Asp Gly Phe Ala Val Gly Asp
1505 1510 1515
Arg Val Met Gly Ala Phe Asp Ala Val Gly Pro Val Ala Leu Thr
1520 1525 1530
Asp His Arg Leu Val Val Pro Met Pro Ala Gly Trp Ser Tyr Ala
1535 1540 1545
Glu Ala Ala Gly Ala Val Ser Thr Tyr Leu Pro Ala Tyr His Pro
1550 1555 1560
Thr Pro Ala Asp Leu Asp Gly Thr Gly Arg Gly Glu Gly Gly Arg
1565 1570 1575
Val Glu Arg Val Arg Ser Thr Leu Ser Asp Leu Ala Ala Leu Phe
1580 1585 1590
Glu Asp Gly Thr Leu Pro Pro Ala Pro Val Thr Val Arg Asp Val
1595 1600 1605
Tyr Asp Val Arg Glu Ala Leu Arg Arg Ala Asp Gly Ala Ala Gly
1610 1615 1620
Arg Thr Val Leu Ser Leu Pro Pro Pro Leu Asp Pro Asp Gly Thr
1625 1630 1635
Val Leu Ile Thr Gly Gly Thr Gly Ala Leu Gly Ala Ala Thr Ala
1640 1645 1650
Arg His Leu Val Ala Glu Arg Gly Val Arg Arg Leu Leu Leu Ala
1655 1660 1665
Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala Asp Ala Leu Ala Ala
1670 1675 1680
Glu Leu Ala Ala Leu Gly Ala Glu Val Ala Val Ala Ala Cys Asp
1685 1690 1695
Thr Gly Asp Arg Ala Ala Leu Ala Asp Leu Leu Ala Ser Val Pro
1700 1705 1710
Ala Arg His Pro Leu Thr Ala Val Val His Thr Ala Gly Ile Val
1715 1720 1725
Gln Asp Ala Thr Ile Arg Thr Ala Thr Pro Asp Gln Leu Asp Ala
1730 1735 1740
Val Leu Arg Val Lys Ala Asp Gly Ala Trp His Leu His Glu Leu
1745 1750 1755
Thr Arg Asp Leu Asn Leu Ala Ala Leu Val Leu Phe Ser Ser Val
1760 1765 1770
Thr Gly Leu Ala Gly Gly Pro Gly Gln Gly Ser Tyr Ser Ala Ala
1775 1780 1785
Asn Val Phe Leu Asp Ala Leu Ala Gln His Arg His Ala Arg Gly
1790 1795 1800
Leu Pro Ala Thr Ser Leu Ala Trp Gly Phe Trp Glu Met Asp Thr
1805 1810 1815
Gly Met Ser Gly Arg Phe Thr Asp Ile Asp Leu Ala Arg Asn Ala
1820 1825 1830
Arg Ser Gly Asp Leu Gly Leu Ser Ala Glu Arg Ala Leu Ala Leu
1835 1840 1845
Phe Asp Ala Ala Leu Gly Leu Gly Arg Pro Leu Leu Ala Pro Val
1850 1855 1860
Arg Leu Asp Leu Pro Gly Leu Arg Arg Arg Thr Ala Gly Gly Glu
1865 1870 1875
Val Pro Ala Phe Leu Arg Val Leu Leu Arg Gly Ser Ala Pro Arg
1880 1885 1890
Pro Gly Gly Thr Ala Ala Gly Pro Ser Leu Ala Arg Thr Leu Ala
1895 1900 1905
Ala Leu Asn Asp Ala Asp Arg Arg Glu Ala Leu Leu Asp Leu Val
1910 1915 1920
Arg Thr Gln Ala Ala Thr Val Val Ala His Glu Ser Ala Gly Ser
1925 1930 1935
Ile Pro Ala Glu Gln Asn Phe Arg Glu Leu Gly Phe Asp Ser Leu
1940 1945 1950
Thr Gly Val Glu Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Val
1955 1960 1965
Arg Leu Pro Ala Thr Leu Ile Phe Asp His Pro Thr Pro Ala Ala
1970 1975 1980
Val Ala Gly Leu Leu Ala Glu Glu Val Ala Pro Asp Gly Ala Gly
1985 1990 1995
Pro Glu Pro Ala Ala Leu Val Ala Glu Leu Asp Arg Ile Glu Ala
2000 2005 2010
Ala Val Ala Ala Leu Pro Asp Gly Asp Glu Gly Arg Gly Leu Val
2015 2020 2025
Ala Ala Arg Leu Asp Ala Leu Leu Arg Ser Val Thr Gly Val Pro
2030 2035 2040
Asp Gly Ala Ala Pro Asp Asp Gly Pro Gly Arg Asp Asp Leu Asp
2045 2050 2055
Ser Ala Thr Asp Asp Glu Leu Phe Asp Leu Leu Asp Asn Glu Leu
2060 2065 2070
Thr Gly Leu Glu Arg Asp Arg Pro Ala Asp His Asp Ala Pro Ser
2075 2080 2085
Glu Arg Gly
2090
<210> 9
<211> 1868
<212> PRT
<213> Artificial
<220>
<223> madA9基因编码的氨基酸序列
<400> 9
Met Thr Asn Asp Glu Lys Leu Arg Arg Tyr Leu Lys His Ala Thr Ala
1 5 10 15
Asp Leu Arg Gln Val Arg Arg Glu Leu Arg Glu Glu Arg Asp Arg Ala
20 25 30
Arg Glu Pro Ile Ala Ile Val Gly Met Ala Cys Arg Phe Pro Gly Gly
35 40 45
Val Arg Gly Ala Asp Asp Leu Trp Arg Leu Val Ala Asp Gly Ala Asp
50 55 60
Ala Ile Ser Ala Phe Pro Glu Asp Arg Gly Trp Pro Leu Glu Glu Leu
65 70 75 80
Gly Ala Pro Gly Gly Gly Pro Gly Thr Cys His Thr Leu Glu Gly Gly
85 90 95
Phe Leu His Asp Ala Pro Arg Phe Asp Ala Ala Phe Phe Gly Ile Ser
100 105 110
Pro Arg Glu Ala Leu Gly Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
115 120 125
Thr Ala Trp Glu Thr Phe Glu Asp Ala Gly Leu Arg Arg Asp Ala Leu
130 135 140
Arg Gly Ser Asp Thr Gly Val Phe Ala Gly Ile Ala Ser Gln Asp Tyr
145 150 155 160
Leu Ala Leu Ala Gly Gly Thr Ala Ala Glu Val Glu Gly His Leu Ala
165 170 175
Thr Gly Asn Ala Gly Ser Ala Val Ser Gly Arg Leu Ala Tyr Thr Phe
180 185 190
Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser
195 200 205
Leu Val Ala Ala His Leu Ala Ala Arg Ala Leu Arg Gln Gly Glu Cys
210 215 220
Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Gly Leu
225 230 235 240
Phe Leu Glu Phe Thr Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys
245 250 255
Lys Thr Phe Ala Ala Ser Ala Asp Gly Thr Ser Leu Ala Glu Gly Ala
260 265 270
Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly Arg
275 280 285
Arg Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Ala
290 295 300
Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gln Gln Arg Val Ile
305 310 315 320
Arg Arg Ala Leu Ala Ala Ala Gly Leu Ser Pro Ala Asp Val Asp Ala
325 330 335
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala
340 345 350
Gln Ala Val Leu Ala Ala Tyr Gly Arg Asn Arg Pro Ala Glu Arg Pro
355 360 365
Leu Arg Leu Gly Ser Leu Lys Ser Asn Ile Gly His Thr Gln Ala Ala
370 375 380
Ala Gly Val Ala Gly Val Ile Lys Met Val Met Ala Met Arg Asn Asp
385 390 395 400
Leu Leu Pro Ala Ser Leu His Ile Asp Glu Pro Asn Pro His Val Asp
405 410 415
Trp Glu Ala Gly Ala Val Ser Leu Leu Thr Glu Pro Val Pro Trp Pro
420 425 430
Arg Gly Glu Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser
435 440 445
Gly Thr Asn Ala His Leu Val Leu Glu Glu Ala Pro Pro Pro Glu Asp
450 455 460
Pro Glu Pro Ala Thr Thr Ala Pro Pro Asp Ala Gly Glu Gly Val Leu
465 470 475 480
Pro Trp Val Leu Ser Ala Arg Gly Glu Asp Ala Leu Arg Ala Gln Ala
485 490 495
Ser Ala Leu Ala Asp His Ala Ala Gly Ala Pro Leu Glu Ser Pro Ala
500 505 510
Ala Val Gly Leu Ala Leu Ala Thr Thr Arg Thr Ala Phe Glu His Arg
515 520 525
Ala Val Val Val Gly Glu Gly Arg Gly Glu Leu Leu Ala Ala Thr Glu
530 535 540
Ala Leu Ala Arg Gly Asp Ala His Pro Gly Val Val Val Ala Gly Ala
545 550 555 560
Gly Glu Ala Gly Thr Gly Thr Ala Ala Gly Lys Thr Val Phe Leu Phe
565 570 575
Ser Gly Gln Gly Ser Gln Arg Pro Gly Met Gly Ala Gly Leu Tyr Glu
580 585 590
Arg Phe Pro Ala Phe Ala Ala Ala Phe Asp Glu Val Cys Ala Leu Leu
595 600 605
Asp Glu His Leu Asp Arg Pro Val Arg Asp Val Val Phe Ala Asp Arg
610 615 620
Ser Gly Ala Leu Asp His Thr Thr Tyr Ala Gln Ala Gly Leu Phe Ala
625 630 635 640
Leu His Val Ala Leu Ala Arg Leu Leu Gly Ala Ala Gly Val Arg Pro
645 650 655
Asp Val Val Ile Gly His Ser Ile Gly Glu Ile Ala Ala Ala His Val
660 665 670
Ala Gly Val Phe Asp Leu Pro Asp Ala Cys Arg Leu Val Ala Ala Arg
675 680 685
Ala Thr Leu Met Gly Arg Leu Pro Glu Gly Gly Ala Met Ala Thr Val
690 695 700
Ala Ala Thr Pro Glu Glu Leu Ser Pro Asp Leu Asp Ala His Gly Gly
705 710 715 720
Arg Val Ala Val Ala Ala Leu Asn Thr Pro Gly Asn Thr Val Ile Ser
725 730 735
Gly Ala Ala Gly Pro Val Ala Glu Ile Ala Glu Ala Trp Ala Glu Arg
740 745 750
Gly Arg Lys Thr Arg Ala Leu Thr Val Ser His Ala Phe His Ser Pro
755 760 765
Leu Met Glu Pro Val Leu Ala Pro Phe Glu Glu Ala Val Arg Asp Leu
770 775 780
Ala Tyr Arg Pro Pro Ala Val Pro Leu Ile Ser Asn Leu Thr Gly Glu
785 790 795 800
Pro Ala Asp Glu Arg Ile Ala Thr Pro Gly Tyr Trp Ala Glu His Ile
805 810 815
Arg Arg Pro Val Arg Phe His Pro Ala Val Ala His Val Ala Pro Asp
820 825 830
Ala Gly Thr Phe Val Glu Leu Gly Pro Asp Ser Thr Leu Thr Ala Ala
835 840 845
Ala Arg Arg Thr Leu Glu His Val Arg Pro Asp Gly Pro Ala Pro Arg
850 855 860
Thr Val Ala Thr Leu Ser Ser Arg Gln Pro Asp Ala His Ala Phe Val
865 870 875 880
Arg Ala Leu Ala Arg Leu His Ile Ala Gly Thr Asp Val Asp Trp Thr
885 890 895
Ala Cys Phe Pro Asp Gly Ser Ala Pro Glu Thr Val Arg Leu Pro Thr
900 905 910
Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Gly Asp Ala Gly Gly Gly
915 920 925
Ala Gly Asp Val Ser Ala Ala Gly Ala Arg Arg Val Arg His Pro Phe
930 935 940
Phe Ala Ala Ala Val Glu Leu Ala Asp Gly Gly Leu Ala Leu Asn Gly
945 950 955 960
Thr Ile Ser Ala Asn Gly Ala Ala Trp Thr Thr Gly His Ala Val Ala
965 970 975
Gly Leu Pro Ile Val Pro Ser Ser Ala Leu Val Glu Trp Ala Leu Leu
980 985 990
Ala Ala Asp Glu Ala Gly Cys Gly Gly Val Glu Glu Leu Val Val Arg
995 1000 1005
Asp Pro Leu Val Pro Pro Ala Arg Gly Arg Gly Pro Arg Val Gln
1010 1015 1020
Val Thr Val Gly Ala Ala Arg Glu Asp Gly Arg Arg Asp Val Arg
1025 1030 1035
Val His Ser Arg Pro Asp His Gly Pro Arg Thr Ala Asp Asp Pro
1040 1045 1050
Ala Trp Thr Cys His Ala His Gly Thr Leu Ala Pro Pro Ala Pro
1055 1060 1065
Gly Pro Ala Ala Pro Ala Pro Glu Asp Glu Arg Ala Arg Ala Trp
1070 1075 1080
Pro Pro Pro Glu Ala Val Pro Val Asp Leu Asp Gly Leu Arg Gln
1085 1090 1095
Arg Thr Val Ala Ala Gly Tyr Glu Asp Gly Pro Ala Phe Asp Gly
1100 1105 1110
Ile Arg Ala Val Trp Arg Asp Gly Ser Asp Leu Leu Ala Glu Val
1115 1120 1125
Ala Leu Pro Asp Glu Ala Gly Gly His Asp Gly Tyr Gly Leu His
1130 1135 1140
Pro Val Leu Leu Glu Ala Ala Leu Arg Pro Ala Leu Leu Asp Val
1145 1150 1155
Pro Asp Asp Ala Arg Ala Arg Glu Asp Asp Val Trp Leu Pro Ser
1160 1165 1170
Ala Trp Asn Asp Val Ala Leu Trp Ala Ser Glu Ala Thr Thr Val
1175 1180 1185
Arg Val Arg Leu Ser Pro Thr Gly Asp Pro Gly Arg Asn Ala Arg
1190 1195 1200
Arg Val Arg Val Thr Leu Thr Asp Pro Ala Gly Ala Pro Val Leu
1205 1210 1215
Thr Val Gly Ser Val Thr Leu Glu Pro Val Ala Val Glu Asp Leu
1220 1225 1230
Arg Ala Ser Gly Gly Gly Arg Thr Asp Gly Leu Phe Thr Leu Asp
1235 1240 1245
Trp Thr Pro Ala Pro Ala Pro Thr Glu Asp Ala Ser Gly Arg Tyr
1250 1255 1260
Ile Glu Leu Asp Ala Leu Arg Thr Ala Leu Asp Ala Asp Pro Gly
1265 1270 1275
Ala Pro Ala Pro Pro Val Val Leu Thr Arg Ala Pro Thr Ala Ser
1280 1285 1290
Gly Gly Asp Ala Arg Arg Ala Ala Glu Ala Ala Ser Thr Leu Val
1295 1300 1305
Arg Asp Trp Leu Ala Glu Pro Arg Leu Ser Gly Ser Arg Leu Val
1310 1315 1320
Val Val Thr Arg Gly Ala Val Ser Val Asp Gly Gly Glu Arg Asp
1325 1330 1335
Val Pro Ser Leu Ala Asp Ala Ala Val Trp Gly Val Val His Ser
1340 1345 1350
Ala Gln Ala Glu Arg Pro Asp Arg Phe Val Leu Leu Asp Thr Asp
1355 1360 1365
Ser Asp Pro Glu Glu Ala Arg Leu Arg Glu Ala Val Ala Ala Ala
1370 1375 1380
Leu Ala Gln His Glu Pro Gln Val Ala Ile Arg Ser Ala Gln Val
1385 1390 1395
Leu Val Pro Arg Leu Ala Arg Ala Ala Ser Pro Gly Pro Arg Ala
1400 1405 1410
Thr Arg Ala Asp Gly Pro Phe Ala Val Asp Gly Thr Val Leu Ile
1415 1420 1425
Thr Leu Asp Gly Gly Pro Arg Gly Ala Leu Val Ala Glu His Ala
1430 1435 1440
Val Arg Ala Trp Gln Ala Lys His Leu Leu Leu Leu Arg Pro Asp
1445 1450 1455
Gly Pro Gly Pro Gly His Glu Ser Pro Gly Val Arg Asp Ala Ala
1460 1465 1470
Glu Arg Ala Ala Ala Leu Gly Ala Asp Val Arg Val Ala Ala Cys
1475 1480 1485
Asp Leu Thr Asp Ala Ser Ala Val Ala Asp Val Val Ala Ala Ile
1490 1495 1500
Asp Glu Ala Arg Pro Leu Ala Ala Val Val His Ala Ala Cys Pro
1505 1510 1515
Pro Asp Glu Ala Arg Thr Asp Gly Ile Ala Thr Ala Tyr Ala Leu
1520 1525 1530
His Ala Ala Thr Ala Asp Leu Pro Leu Glu Ala Phe Val Leu Phe
1535 1540 1545
Ser Ser Ala Ala Ala Thr Leu Gly Ala Ser Gly Gly Asp Ala Ala
1550 1555 1560
Gly Ala Phe Asn Asp Ala Leu Ala Ala His Arg Arg Ala Ala Gly
1565 1570 1575
Arg Thr Ala Leu Ser Ile Ala Trp Gly Pro Trp Leu Asp Asp Thr
1580 1585 1590
Ala Arg Pro Ser Ala Ala Ala Pro Gly Val Gly Ala Leu Thr Ala
1595 1600 1605
Glu His Ala Leu Ala Leu Leu Glu Thr Ala Arg His Ser Pro Thr
1610 1615 1620
Pro Arg Leu Val Ala Ala Asn Leu Asp Val Arg Ala Leu Ala Ser
1625 1630 1635
Ala His Ala Thr Glu Ala Asp Ala Pro Pro Ala Pro Leu Arg Ala
1640 1645 1650
Leu Ala Gly Arg Thr Pro Ala Arg Pro Ser Ala Ala Thr Gly Arg
1655 1660 1665
Gly Ala Ala Ser Leu Ala Glu Arg Leu Ala Ala Ala Pro Ala Asp
1670 1675 1680
Gln Arg Gly Pro Leu Leu Leu Asp Leu Val Arg Thr Asn Ala Ala
1685 1690 1695
Ala Val Leu Gly His Ala Asp Pro Gly Leu Val Pro Val Gly Arg
1700 1705 1710
Gly Phe Ser Glu Met Gly Phe Asp Ser Ile Thr Ala Thr Asp Leu
1715 1720 1725
Ala Gly Arg Leu Ala Gly Ala Thr Gly Leu Ala Val Pro Thr Thr
1730 1735 1740
Ala Val Phe Asp His Pro Ser Pro Ala Ala Leu Ala Glu His Leu
1745 1750 1755
His Ala Glu Leu Ala Pro Arg Gly Pro Asp Pro Asp Glu Asp Gly
1760 1765 1770
Thr Pro Ala Ser Leu Leu Ala Arg Leu Glu Arg Met Asp Asn Ala
1775 1780 1785
Met Ala Ala Ala Ser Pro Ala Gly Ala Ala Val Asn Gly Asp Ala
1790 1795 1800
Ser Val Arg Thr Ala Leu Arg Arg Arg Leu Gln Thr Met Leu Ala
1805 1810 1815
Arg Leu Asp Asp His Pro Gly Gly Ala Pro Asp Gly Pro Ser Gly
1820 1825 1830
Glu Pro Ser Ala Glu Pro Ser Gly Met Gly Ala Glu Arg Ile Gly
1835 1840 1845
Thr Ala Thr Val Asp Glu Leu Leu Ala Phe Ile Asp His Asp Leu
1850 1855 1860
Gly Arg Asn Arg Arg
1865
<210> 10
<211> 830
<212> PRT
<213> Artificial
<220>
<223> madA10基因编码的氨基酸序列
<400> 10
Val Thr Asp Pro Thr Ser Gly Gly Ala Pro Arg Ser Pro Ala Asn Gly
1 5 10 15
Gly Ala Asp Ala Arg Thr Gly Asn Gly Ala Gly Asn Gly Ala Gly Gly
20 25 30
Ser Ala Gly Gly Gly Ala Gly Arg Glu Ala Glu Leu Val Glu Tyr Leu
35 40 45
Arg Trp Thr Thr Ala Glu Leu His Glu Ala Arg Arg Arg Leu Arg Arg
50 55 60
Ala Glu Asp Ala Gly Arg Glu Pro Val Ala Val Val Gly Met Ala Cys
65 70 75 80
Arg Phe Pro Gly Gly Ala Arg Thr Pro Asp Arg Phe Trp Asp Leu Leu
85 90 95
Ala Glu Gly Arg Glu Ala Val Thr Gly Phe Pro Ala Asp Arg Gly Trp
100 105 110
Asp Leu Asp Ala Leu Phe His Pro Asp Pro Asp His His Gly Thr Thr
115 120 125
Tyr Ala Arg Gly Gly Gly Phe Leu Asp Asp Ala Ala Asp Phe Asp Ala
130 135 140
Ala Phe Phe Gly Ile Gly Asp Glu Glu Ala Glu Ala Ile Glu Pro Gln
145 150 155 160
Gln Arg Leu Leu Leu Glu Thr Ala Trp Glu Ala Val Glu Ser Ala Gly
165 170 175
Ile Asp Pro Arg Thr Leu Arg Gly Ser Ala Thr Ala Val Tyr Ala Gly
180 185 190
Val Ala Leu His Asp Tyr Gly Thr Arg Leu Arg Arg Val Pro Ala Ala
195 200 205
Leu Met Pro Tyr Leu Gly Thr Gly Asn Ala Gly Gly Gly Ala Pro Gly
210 215 220
Arg Val Ala Phe Thr Leu Gly Leu Val Gly Pro Ala Val Ala Leu Asp
225 230 235 240
Ser Ala Cys Ser Ser Ser Leu Thr Ala Met His Leu Ala Ser Arg Ala
245 250 255
Leu Arg Asp Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Ala Ala Val
260 265 270
Met Tyr Thr Pro Ser Thr Phe Leu Leu Ser Ser Ser Gln Arg Gln Leu
275 280 285
Ala Pro Asp Ala Arg Cys Lys Pro Phe Ala Ala Ala Ser Asp Gly Met
290 295 300
Val Trp Gly Glu Gly Ala Gly Leu Val Leu Leu Glu Arg Leu Ala Asp
305 310 315 320
Ala Arg Arg Asn Gly His Arg Val Leu Ala Leu Val Arg Gly Thr Ala
325 330 335
Val Asn Gln Asp Gly Ala Val Ser Gly Met Ala Ala Pro His Gly Pro
340 345 350
Gly Arg Gln Arg Leu Met Arg Glu Ala Leu Ala Asp Ala Arg Leu Ser
355 360 365
Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Ile
370 375 380
Gly Asp Ala Ile Glu Ala Gln Ala Val Leu Ser Val Tyr Gly Gln Asp
385 390 395 400
Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Thr Val Lys Pro Asn Ile
405 410 415
Gly His Ser Gln Ala Ala Ala Gly Met Ala Ser Val Ile Lys Ala Val
420 425 430
Leu Ala Met Arg His Ala Thr Leu Pro Ala Thr Leu Asn Ile Asp Arg
435 440 445
Pro Thr Pro Leu Val Ser Trp Arg Ala Gly Ala Val Arg Leu Val Thr
450 455 460
Glu Pro Thr Pro Trp Pro Arg Gly Gly Arg Pro Arg Arg Ala Gly Val
465 470 475 480
Ser Ala Phe Gly Asn Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu
485 490 495
Pro Pro Pro Glu Pro Pro Glu Pro Arg Pro Pro Ser Pro Pro Pro Gly
500 505 510
Gly Thr Leu Ala Trp Pro Val Ser Ala Arg Gly Ala Ala Ala Leu Arg
515 520 525
Ala Gln Ala Ala Ala Leu Ala Ala His Val Gly Asn Asp Pro Asp Leu
530 535 540
Ala Pro Glu Asp Val Ala Trp Ser Leu Ala Ala Thr Arg Ser Val Phe
545 550 555 560
Glu Asp Arg Ala Val Val Val Gly Ala His Arg Ala Glu Leu Leu Ala
565 570 575
Gly Leu His Ala Leu Ala Asn Gly Thr Pro His Glu Asn Val Ile Gly
580 585 590
Gly Thr Gly Ala Thr Ala Gly Pro Asp Ala Asp Ser Asp Ala Gly Thr
595 600 605
Trp Ala Phe Gly Gly Ser Ala Glu Arg Ala Ala Gln Ala Thr Thr Ala
610 615 620
Leu Arg Glu Arg Phe Pro Ala Phe Ala Ala Ala His Asp Glu Val Ser
625 630 635 640
Glu Arg Phe Gly Gly Ala Leu Gly Gly Asp Ala Ala Thr Phe Ala Leu
645 650 655
His Val Ala Leu Ala Arg Leu Leu Leu Ala Ala Glu Leu Arg Pro Gly
660 665 670
Ala Val Val Gly Arg Gly Val Gly Glu Val Ser Ala Ala His Leu Ala
675 680 685
Gly Ala Leu Asp Leu Pro Asp Ala Cys Arg Leu Val Ala Gln Arg Asp
690 695 700
Ala Pro Gly Ser Asp Ala Leu Ala Leu Thr Gly Gly Lys Pro Thr Val
705 710 715 720
Pro Val Leu Asp Ala Gly Ser Ala Pro Phe Ala Glu Ala Gly Ala Phe
725 730 735
Leu Asp Phe Gly Pro Glu Pro Phe Pro Asp Ala Pro Arg Ser Ala Glu
740 745 750
Arg Leu Val Pro Gly Asp Gly Arg Ala Val Gly Arg Gly Leu Val Glu
755 760 765
Ala Leu Ala Arg Leu His Thr Ser Gly Ala Thr Val Gly Trp Ala Ala
770 775 780
Leu Phe Gly Gly Gly Pro Arg Pro Arg Ala Val Pro Leu Pro Thr Tyr
785 790 795 800
Ala Phe Gln Arg Arg Arg Phe Trp Leu Gln Glu Pro Pro Pro Ala Asp
805 810 815
Gly Ala Pro Val Ala Ala Tyr Ala Ser Glu Gly Gly Pro Arg
820 825 830
<210> 11
<211> 115
<212> PRT
<213> Artificial
<220>
<223> madA11基因编码的氨基酸序列
<400> 11
Met His Gln Gln Ala Thr Asn Asp Ser Asp Gly Gln Ala Val Leu Asp
1 5 10 15
Gly Thr Glu Leu Gln Glu Lys Leu Ala Ala Ala Gly Pro Pro Glu Arg
20 25 30
Glu Arg Ile Leu Gly Glu Thr Val Arg Glu Gln Ala Ala Ala Val Leu
35 40 45
Asp Gln Ser Val Ile Asp Leu Asp Ser Asn Phe Leu Glu Lys Gly Leu
50 55 60
Thr Ser Leu Lys Ala Leu Glu Leu Thr Arg Asn Leu Met Ser Leu Thr
65 70 75 80
Gly Ile Glu Ile Pro Leu Val Ala Val Ile Glu His Pro Thr Pro Val
85 90 95
His Leu Gly Arg Phe Met Ala Glu Leu Leu Ala Glu Ser Gly Asp Gly
100 105 110
Ala Ala Arg
115
<210> 12
<211> 306
<212> PRT
<213> Artificial
<220>
<223> madG1基因编码的氨基酸序列
<400> 12
Met Lys Gly Val Ile Leu Ala Gly Gly Lys Gly Thr Arg Leu Gln Pro
1 5 10 15
Val Thr Ala Val Gly Ser Lys Gln Leu Met Pro Ile Tyr Asp Lys Pro
20 25 30
Met Val Tyr Tyr Pro Leu Ser Ile Leu Met Phe Ala Gly Ile Arg Asp
35 40 45
Val Leu Val Ile Cys Arg Pro Ser Glu Val Ala Val Phe Arg Gly Leu
50 55 60
Phe Gly Asp Gly Arg Arg Leu Gly Met Asn Ile Asp Tyr Ala Ala Gln
65 70 75 80
Asp Glu Pro Arg Gly Ile Pro Asp Ala Phe Leu Val Gly Ala Asp His
85 90 95
Ile Gly Gly Asp Asp Cys Ala Leu Ile Leu Gly Asp Asn Leu Phe His
100 105 110
Gly Ser Gly Leu Pro Ser Leu Leu Arg Gln Ser Ala Glu Arg Met Asp
115 120 125
Gly Cys Val Leu Phe Gly His Gln Val Ser Asp Pro Glu Arg Tyr Gly
130 135 140
Val Ala Glu Ile Asp Glu Arg Gly Arg Leu Val Ser Ile Glu Glu Lys
145 150 155 160
Pro Ser Glu Pro Arg Ser Asn Leu Ala Ile Pro Gly Leu Tyr Phe Phe
165 170 175
Asp Asn Lys Val Val Glu Ile Ala Arg Gly Leu Ala Pro Ser Ala Arg
180 185 190
Gly Glu Leu Glu Ile Thr Asp Val Leu Arg Ala Tyr Leu Glu Ala Gly
195 200 205
Arg Ala Asp Leu Val Trp Leu Gly Arg Gly Val Thr Trp Leu Asp Thr
210 215 220
Gly Thr His Glu Thr Leu Leu Glu Ala Gly Cys Phe Val Arg Asp Val
225 230 235 240
His Gln Arg Gln Gly Ala Arg Ile Gly Cys Val Glu Glu Ile Ala Leu
245 250 255
Tyr Met Gly Phe Ile Gly Pro Asp Glu Cys Tyr Glu Leu Gly Ala Glu
260 265 270
Met Gly Asn Ser Pro Tyr Gly Arg Tyr Val Met Asp Gln Ala Arg Phe
275 280 285
Tyr Asp Arg Val Arg Asp Leu Leu Asp Trp Arg Ala Thr Phe Leu Glu
290 295 300
Gly Ala
305
<210> 13
<211> 343
<212> PRT
<213> Artificial
<220>
<223> madG2基因编码的氨基酸序列
<400> 13
Met Asn Leu Leu Val Thr Gly Ala Ala Gly Phe Ile Gly Ser Thr Tyr
1 5 10 15
Val Arg Met Leu Leu Ala Pro Ala Ala Pro Asp Pro Ala Ala Pro Asp
20 25 30
Pro Val Ala Pro Asp Pro Asp Ala Ala Asp Pro Asp Ala Ser Glu Glu
35 40 45
Ile Arg Val Thr Val Leu Asp Lys Leu Thr Tyr Ala Gly Thr Leu Asp
50 55 60
Asn Leu Thr Leu Ser His Pro Arg Leu Arg Phe Val Tyr Gly Asp Ile
65 70 75 80
Cys Asp Glu Asp Leu Val Gly Lys Leu Met Ser Glu Ala Asp Gln Val
85 90 95
Val His Phe Ala Ala Glu Ser His Val Asp Asn Ser Ile Ala Ala Ala
100 105 110
Gly Asp Phe Val Arg Thr Asn Val Val Gly Thr His Thr Leu Leu Glu
115 120 125
Gln Ala Leu Ala His Gly Val Arg Arg Phe Val His Val Ser Thr Asp
130 135 140
Glu Val Tyr Gly Ser Val Glu Thr Gly Arg Ser His Glu Thr Asp Ala
145 150 155 160
Leu Asp Pro Asn Ser Pro Tyr Ala Ala Ser Lys Ala Ala Ser Asp Leu
165 170 175
Leu Ala Leu Ala Tyr His Arg Thr His Gly Leu Asp Val Arg Val Thr
180 185 190
Arg Cys Cys Asn Asn Tyr Gly Pro Arg Gln Phe Pro Glu Lys Leu Ile
195 200 205
Pro Leu Phe Ile Thr Arg Leu Leu Asp Gly Gln Lys Val Pro Leu Tyr
210 215 220
Gly Asp Gly Gln His Lys Arg Asp Trp Leu His Val Asp Asp His Cys
225 230 235 240
Arg Ala Ile Glu Leu Val Arg Thr Ala Gly Ala Pro Gly Glu Val Tyr
245 250 255
Asn Ile Ser Gly His Thr Glu Leu Thr Asn His Gln Ile Thr Gly Leu
260 265 270
Leu Leu Glu Ala Val Gly Gly Asp Trp Asp Gln Val Ala Tyr Val Thr
275 280 285
Asp Arg Lys Ala His Asp Arg Arg Tyr Ala Val His Asp Asp Lys Ala
290 295 300
Arg Thr Arg Leu Gly Tyr Arg Pro Arg Arg Asp Phe Pro Arg Gly Leu
305 310 315 320
Ala Glu Thr Val Ala Trp Tyr Arg His Asn Arg Ser Trp Trp Glu Pro
325 330 335
Leu Lys Arg Arg Ala Gly Leu
340
<210> 14
<211> 479
<212> PRT
<213> Artificial
<220>
<223> madG3基因编码的氨基酸序列
<400> 14
Val Pro Asp Pro Ala Arg Pro Ala Pro Leu Ser Pro Arg Ser Pro Gln
1 5 10 15
Ser Pro Arg Ala Pro Arg Ser Pro Arg Gly Arg Pro Ala Pro Gly Gly
20 25 30
Arg Ser Pro Arg Ala Arg Glu Asp Thr Ala Ala Gln Arg Ala Ile Ala
35 40 45
Ala Ser Ala Ala Ala Leu Glu Gly Arg Val Pro Gly Val Ala Asp Val
50 55 60
Arg Ala Trp Leu Gly Glu Arg Arg Asp Ala Thr Arg Phe Gln Val Gly
65 70 75 80
Arg Ile Pro Phe Ala Glu Leu Asp Gly Trp Ser Phe Asp Gly Pro Thr
85 90 95
Gly Asn Leu Val His Asp Ser Gly Arg Phe Phe Thr Val Glu Gly Leu
100 105 110
Arg Val Ser Ile Asp Gly Asp Arg Pro Arg Glu Trp Gln Gln Pro Ile
115 120 125
Ile Cys Gln Phe Glu Val Gly Ile Leu Gly Ile Leu Ala Lys Arg Phe
130 135 140
Asp Gly Val Leu His Phe Leu Met Gln Ala Lys Met Glu Pro Gly Asn
145 150 155 160
Pro Asn Leu Leu Gln Leu Ser Pro Thr Val Gln Ala Thr Arg Ser Asn
165 170 175
Tyr Thr Lys Val His Arg Gly Ala Ser Val Arg Tyr Leu Glu Tyr Phe
180 185 190
Thr Gly Gly Glu Pro Val Arg Val Leu Ala Asp Val Leu Gln Ser Glu
195 200 205
His Gly Ala Trp Phe Cys Arg Lys Arg Asn Arg Asn Met Ile Val Glu
210 215 220
Ala Thr Gly Asp Val Pro Glu His Glu Asp Phe Arg Trp Leu Thr Leu
225 230 235 240
Gly Gln Leu Ala Gly Leu Leu His Leu Asp Asn Val Val Asn Met Asp
245 250 255
Ala Arg Ser Val Leu Ala Cys Met Pro Gly Ala Gly Gly Pro Thr Ala
260 265 270
Leu His Ser Asp Val Glu Leu Arg Ser Trp Ile Ala Ala Glu Arg Ser
275 280 285
Arg His Ile Val Thr Thr Glu Arg Val Pro Arg Ala Gly Leu Arg His
290 295 300
Trp Val Gln Gly Pro Trp Ser Ile Asp His Glu Glu Arg Arg His Phe
305 310 315 320
Arg Val Val Ala Val Ser Val Arg Ala Gly Asn Arg Glu Val Thr Arg
325 330 335
Trp Thr Gln Pro Leu Phe Glu Pro Tyr Glu Thr Gly Ile Val Ala Tyr
340 345 350
Val Thr Arg Thr Phe Glu Gly Val Pro His Val Leu Ala Arg Ala Arg
355 360 365
Val Glu Gly Gly Phe Leu Asp Thr Val Glu Leu Gly Pro Thr Val Gln
370 375 380
Ala Ile Pro Gly Asn Tyr Ala His Val Pro Ala Ser Glu Arg Pro Ala
385 390 395 400
Phe Ile Asp Leu Val Ser Ser Ala Asp Pro Ser Arg Val Arg Tyr Ser
405 410 415
Ala Val His Ser Glu Glu Gly Gly Arg Phe Leu Asn Ala Glu Ser Arg
420 425 430
Tyr Met Val Val Glu Ala Asp Glu Asp Glu Ala Pro Ile Asp Pro Pro
435 440 445
Ala Gly Phe Arg Trp Val Thr Pro Ala Ser Ser Pro Cys Ser Pro Ser
450 455 460
Thr Ala Thr Thr Ser Thr Cys Arg Arg Ala Ala Cys Ser Pro Ala
465 470 475
<210> 15
<211> 359
<212> PRT
<213> Artificial
<220>
<223> madG4基因编码的氨基酸序列
<400> 15
Val Arg His Val Asp Gly Val Glu Arg Gln Glu Pro Pro Leu Arg Leu
1 5 10 15
Gly Val Leu Gly Cys Ala Asp Ile Ala Arg Arg Arg Thr Val Pro Ala
20 25 30
Leu Leu Ala Ala Asp Gly Val Asp Val Thr Ala Val Ala Ser Arg Asp
35 40 45
Pro Gly Arg Ala Glu Ser Phe Ala Ala Glu Val Gly Cys Ala Ala Val
50 55 60
Gly Gly Tyr Gly Ala Leu Leu Ala Ser Asp Asp Ile Asp Ala Val Tyr
65 70 75 80
Ile Pro Leu Pro Ser Ala Leu His Ala Glu Trp Thr Glu Lys Ala Leu
85 90 95
Asp Ala Gly Lys His Val Leu Ala Glu Lys Pro Leu Thr Thr Asp Tyr
100 105 110
Glu Thr Thr Ala Arg Leu Leu Arg Leu Ala Arg Thr Arg Gly Leu Val
115 120 125
Leu Leu Glu Asn Val Thr Phe Leu His His Ser Gln His Thr Ala Val
130 135 140
Gln Lys Met Leu Ala Asp Gly Ala Ile Gly Glu Leu Arg Asp Phe Ser
145 150 155 160
Ser Ala Phe Thr Ile Pro Pro Leu Pro Ala Gly Asn Thr Gln Phe Leu
165 170 175
Pro Gly Leu Gly Gly Gly Ala Leu Leu Glu Gln Gly Val Tyr Pro Ile
180 185 190
Arg Ala Ala Leu Arg Phe Leu Gly Asn Asp Leu Asp Val Thr Ala Ala
195 200 205
Val Leu His Val Asp Gly Gly Thr Gly Met Val Val Gly Gly Arg Ala
210 215 220
Leu Ala Cys Thr Pro Gly Gly Val Thr Ala Asp Leu Ala Phe Gly Val
225 230 235 240
Arg His Ser Tyr Arg Thr Gly Cys Glu Phe Ala Gly Ser Ala Gly Arg
245 250 255
Leu Leu Thr Asp Arg Ala Phe Thr Pro Pro Ala Ala Tyr Gln Pro Val
260 265 270
Val Arg Ile Glu Arg Gln Asp His Arg Glu Glu Ile Thr Leu Pro Pro
275 280 285
Asp His Gln Trp Ala Asn Leu Gly Arg Phe Phe Ala Arg Ala Val Gly
290 295 300
Ala Ala Arg Val Ala Gly Ala Ala Gly Ala Ala Arg Thr Ala Pro Gly
305 310 315 320
Pro Gly Ala Gly Ala Glu Ala Gly Thr Gly Gly Glu Pro Glu Ala Phe
325 330 335
Gly Ala Gly Ser Leu His Gln Ala Arg Leu Val Ser Glu Ile Glu Glu
340 345 350
Arg Ala Arg Arg Ile Thr Val
355
<210> 16
<211> 227
<212> PRT
<213> Artificial
<220>
<223> madG5基因编码的氨基酸序列
<400> 16
Met Arg Arg Ser Ala Arg Arg Phe Ala Val Arg Ile Ser Glu Met Thr
1 5 10 15
Val Pro Asp Ala Tyr Arg Ile Glu Pro Asp Arg Ile Ala Asp Ser Arg
20 25 30
Gly Phe Phe Tyr Glu Ala Ile Arg Ala Glu Ala Leu Arg Ser Ile Thr
35 40 45
Gly His Glu Leu Glu Ile Arg Gln Val Asn Tyr Ser Val Ser Lys Arg
50 55 60
Asn Thr Val Arg Gly Ile His Gly Thr Ala Leu Pro Pro Gly Gln Ala
65 70 75 80
Lys Leu Val Thr Cys Val Arg Gly Ala Ala Leu Asp Ile Ala Val Asp
85 90 95
Ile Arg Val Gly Ser Pro Thr Phe Gly His Tyr Ala Val Thr Arg Gln
100 105 110
Asp Ala Glu Ser Gly Thr Ala Val Tyr Leu Pro Asp Gly Ile Gly His
115 120 125
Ala Phe Leu Ala Leu Thr Asp Asp Thr Cys Met Cys Tyr Leu Cys Ser
130 135 140
Val Glu Tyr Val Pro Gly Thr Met Ile Asp Ile Asp Ala Leu Asp Pro
145 150 155 160
Glu Leu Ala Leu Pro Trp Lys Leu Thr Gly Pro Pro Ile Arg Ser Ala
165 170 175
Lys Asp Ala Ala Ala Pro Ser Leu Ser Glu Ala Ala Ala Ala Gly Leu
180 185 190
Leu Pro Thr Tyr Glu Gln Cys Val Arg Ser Tyr Ala Arg Asn Gly Pro
195 200 205
Ala Gly Gly Gly Ala Ala Gly Pro Ala Ala Gly Gly Arg His Ser Phe
210 215 220
Gln Arg Arg
225
<210> 17
<211> 256
<212> PRT
<213> Artificial
<220>
<223> madG6基因编码的氨基酸序列
<400> 17
Met Ser Ile Ser Gln Tyr Thr Asp Pro Arg Ile Val Pro His Glu Ser
1 5 10 15
Asp Gly Glu Arg Glu Ala Arg Glu Arg Leu Thr Arg Leu Leu Ser Lys
20 25 30
Thr Pro Ile Gln Pro Glu Tyr Leu Ile Asp Asn Leu Ala Val Tyr Leu
35 40 45
Arg Arg His Gln Leu Ser Asp Leu Leu Ser Met Asp Ala Leu Tyr Arg
50 55 60
Glu Val Arg Asp Val Pro Gly Val Ile Met Glu Phe Gly Val Leu His
65 70 75 80
Gly Arg His Leu Ala Thr Leu Thr Ala Leu Arg Gly Ile Tyr Glu Pro
85 90 95
Tyr Asn Ser Leu Arg Arg Ile Ile Gly Phe Asp Thr Phe Thr Gly Phe
100 105 110
Pro Asp Ile Ser Pro Val Asp Glu Val Ser Pro Ser Ala Val Val Gly
115 120 125
Arg Phe Glu Thr Ala Arg Asp Tyr Pro Asp His Leu Arg Glu Val Leu
130 135 140
Ala Ala His Glu Leu Gly Glu Pro Met Gly His Val Gln Arg Thr Ile
145 150 155 160
Val Met Gln Gly Asp Val Arg Glu Thr Val Pro Arg Tyr Leu Glu Glu
165 170 175
Asn Pro Gln Thr Val Ile Ala Leu Ala Tyr Phe Asp Met Asp Leu Tyr
180 185 190
Ala Pro Thr Arg Glu Thr Leu Glu Thr Ile Arg Pro Tyr Leu Thr Arg
195 200 205
Gly Ser Ile Val Ala Phe Asp Glu Leu Ala His Pro Lys Trp Pro Gly
210 215 220
Glu Thr Ala Ala Phe Arg Glu Val Cys Gly Gln Asp Ala Thr Leu His
225 230 235 240
Gln Ile Pro Gly Arg Glu Pro Pro Val Ile Tyr Leu Arg Trp Asn Glu
245 250 255
<210> 18
<211> 331
<212> PRT
<213> Artificial
<220>
<223> madG7基因编码的氨基酸序列
<400> 18
Val Ser Gly Asp Gly Leu His Val Arg Leu Gly Arg Thr Gly Leu Thr
1 5 10 15
Val Ser Arg Leu Gly Leu Gly Thr Val Asn Phe Gly Gly Arg Val Glu
20 25 30
Glu Pro Glu Ala His Arg Leu Met Asp His Ala Leu Ala Cys Gly Leu
35 40 45
Asn Leu Leu Asp Thr Ala Asp Ser Tyr Gly Trp Arg Val Arg Lys Gly
50 55 60
His Thr Glu Glu Val Ile Gly Arg Trp Leu Ala Arg Gly Ser Arg Arg
65 70 75 80
Asp Asp Val Val Leu Ala Thr Lys Val Gly Ser Ala Met Gly Pro Gly
85 90 95
Pro Asn Asp Arg Gly Leu Ser Ala Arg His Ile Val Ala Ala Cys Glu
100 105 110
Asp Ser Leu Arg Arg Leu Gly Thr Asp Trp Ile Asp Leu Tyr Gln Leu
115 120 125
His Ser Met Asp Arg Ala Val Pro Trp Asp Glu Ile Trp Arg Ala Met
130 135 140
Glu Leu Leu Val Ala Gln Gly Lys Val Arg Tyr Val Gly Ser Ser Asn
145 150 155 160
Ala Ala Gly Trp Asp Leu Ala Ala Ala Gln Glu Thr Ala Ala Arg Arg
165 170 175
Gly Ala Pro Gly Leu Ala Ser Glu Gln Cys Leu Tyr Asn Leu Val Ala
180 185 190
Arg His Pro Glu Leu Glu Ile Ile Pro Ala Ala Arg Ala Tyr Gly Val
195 200 205
Ala Val Leu Val Trp Ser Pro Leu His Gly Gly Leu Leu Gly Gly Thr
210 215 220
Leu Arg Lys Ala Ala Glu Gly Thr Ala Val Lys Ser Ala Gln Gly Arg
225 230 235 240
Ala Ala Ala Ala Leu Asp Ala His Arg Glu Thr Ile Ala Glu Tyr Glu
245 250 255
Arg Phe Cys Gly Gly Leu Gly Arg Asp Pro Ala Glu Val Gly Leu Ala
260 265 270
Trp Ala Leu Arg Arg Pro Gly Val Thr Ser Leu Ile Ile Gly Pro Arg
275 280 285
Thr Pro Glu His Val Asp Gly Ala Leu Arg Ala Leu Asp Lys Pro Leu
290 295 300
Ser Asp Pro Glu Leu Asp Arg Leu Asp Ala Leu Phe Pro Pro Val Gly
305 310 315 320
Arg Gly Gly Pro Ala Pro Asp Ala Trp Ile Gly
325 330
<210> 19
<211> 312
<212> PRT
<213> Artificial
<220>
<223> madG8基因编码的氨基酸序列
<400> 19
Met Gly Pro Ser Tyr Gln Ser Ser Leu Gln Trp Glu Asn Ile Ser Lys
1 5 10 15
His Trp Val Thr Glu Asp Val Ser Ala Glu Leu Ala Asp Phe Lys Ser
20 25 30
Gly Arg Pro Asn His Lys Ile Ala Ile Trp Asn Pro Glu Ala Asn Gly
35 40 45
Ile Arg Tyr Leu Lys Thr Leu Val Tyr Asn Leu Ala Ser Gly Leu Gly
50 55 60
Pro Ala Glu Trp Glu Lys Leu Arg Arg Ile Arg Asn Arg Glu Val Gly
65 70 75 80
Asp Pro Phe Thr Val Thr Tyr Asp Gly Glu Ser Val Cys Leu Asp Tyr
85 90 95
Leu Gln Ala Val Leu Glu Leu Gly Phe Val Glu Lys Gly Leu Asp Leu
100 105 110
Ala Gly Ala Ala Val Leu Glu Ile Gly Ala Gly Tyr Gly Arg Thr Cys
115 120 125
His Thr Leu Leu Ser Asn His Asp Leu Ala Ser Tyr His Ile Val Asp
130 135 140
Leu Pro Asn Thr Leu Gly Leu Cys Arg Ser Tyr Leu Arg Ala Val Leu
145 150 155 160
Asp Glu Glu Arg Phe Asp Arg Ile Thr Phe Val Thr Val Asp Glu Val
165 170 175
Glu Asp Ala Leu Gly Ala Arg Arg Phe Asp Leu Ala Val Asn Ile Asn
180 185 190
Ser Phe Thr Glu Met Pro Pro Gly Thr Val Arg Asp Tyr Leu Ala Leu
195 200 205
Ile Asp Arg Arg Cys Asp Ala Phe Tyr Val Lys Asn Pro Val Gly Lys
210 215 220
Tyr Met Asp Lys Ser Leu Asp Gly His Thr Gln Gly Asp Glu Val Val
225 230 235 240
Arg Met Ala Met Glu Thr Gly Pro Leu Arg Lys Val Val Asp Ile Asn
245 250 255
Asp Ser Arg Ala Val Glu Ala Ala Val Pro Asp Phe Ile Glu Ala Tyr
260 265 270
Arg Pro Gly Gly Ala Trp Thr Cys Ala Ala Asp Gly Trp Pro Val Pro
275 280 285
Trp Thr Tyr Phe Trp Gln Ala Leu Tyr Arg Arg Ser Pro Ala Ala Asn
290 295 300
Gly Ser Ala Pro Ala Ala Gly Arg
305 310
<210> 20
<211> 480
<212> PRT
<213> Artificial
<220>
<223> madO1基因编码的氨基酸序列
<400> 20
Val Thr Met Thr Arg Ser Glu His Ala Ile Val Leu Gly Gly Gly Trp
1 5 10 15
Ala Gly Met Leu Ala Ala Gln Ala Leu Ser Arg Arg Phe Asp Ala Val
20 25 30
Thr Val Leu Glu Arg Asp Val Leu Pro Glu Gly Pro Arg Gln Arg Lys
35 40 45
Gly Gln Pro Gln Ala Arg His Gly His Ile Leu Trp Ser Ser Gly Ala
50 55 60
Arg Val Val Asp Ala Leu Leu Pro Gly Thr Ile Asp Arg Leu Leu Ser
65 70 75 80
Leu Gly Ala Arg Arg Ile Met Phe Gln Arg Asp Leu Val Thr Leu Thr
85 90 95
Ser His Gly Trp Gln His Arg Phe Pro Ser Lys Gln Tyr Cys Val Met
100 105 110
Cys Ser Arg Pro Leu Met Asp Trp Ala Val Arg Glu Gln Val Ala Ala
115 120 125
Ser Gly Arg Val Glu Val Arg Gln Arg Thr Glu Ala Leu Asp Leu Ala
130 135 140
Gly Asp Arg Asp Arg Ile Thr Gly Val His Val Arg Asp Val Ala Thr
145 150 155 160
Gly Ala Ala Ser Ser Leu Thr Ala Asp Leu Val Val Asp Ala Thr Gly
165 170 175
Arg Gly Ser Arg Leu Lys His Trp Leu Gly Ala Leu Gly Leu Pro Pro
180 185 190
Leu Glu Glu Asp Val Val Asp Ala Gly Met Ala Tyr Cys Ser Arg Ile
195 200 205
Tyr Gln Ala Pro Pro Gly Ala Ala Ala Gly Phe Pro Pro Val Asn Leu
210 215 220
Ala Pro Asp Pro Arg Val Arg Glu Pro Gly Arg Phe Gly Val Val His
225 230 235 240
Pro Gln Glu Asp Gly Thr Trp Met Val Thr Leu Ala Gly Thr Arg Gly
245 250 255
Val Arg Leu Pro Thr Asp Asp Ala Glu Phe Asn Glu Tyr Ala Arg Thr
260 265 270
Leu Arg Asp Pro Leu Val Ala Asp Leu Ile Ala Arg Ala Glu Pro Leu
275 280 285
Thr Pro Leu Val Val Ser His Phe Gly Ala Asn Arg Arg Leu Tyr Pro
290 295 300
Glu Arg Leu Ala Gly Trp Pro Asp Arg Leu Val Val Leu Gly Asp Ala
305 310 315 320
Leu Ala Val Phe Asn Pro Ile Tyr Gly His Gly Met Ser Ala Ala Ala
325 330 335
Arg Gly Ile Ala Ala Leu Asp Glu Arg Leu Ala Glu Glu Gly Leu Gly
340 345 350
Ala Gly Ala Val Ala Ala Ala Gln Arg Asp Val Cys Ala Ala Val Asp
355 360 365
Asp Pro Trp Ile Ile Ala Ala Ala Arg Asp Ile Glu Tyr Val Gly Cys
370 375 380
Arg Ser Thr Ala Thr Asp Pro Arg Leu Leu Gly Glu Ala Asp Ala Arg
385 390 395 400
Arg Arg Phe Ala Asp Val Ile Thr Val Arg Ser Leu Arg Ser Pro Gly
405 410 415
Val Ser Glu Met Val Thr Asp Ala Ala Ser Leu Ala Val Pro Gln Ser
420 425 430
Glu Leu Gly Ser Ser Arg Phe Met Ala Leu Leu Gly Ser Asp Pro Met
435 440 445
Arg Pro Glu Leu Thr Glu Pro Pro Leu Ser Pro Asp Glu Leu Ala Leu
450 455 460
Val Asn Leu Ser Pro Arg Ala Ala Val Gly Ala Glu Thr Ala Ser Gly
465 470 475 480
<210> 21
<211> 477
<212> PRT
<213> Artificial
<220>
<223> madO2基因编码的氨基酸序列
<400> 21
Val Thr Arg Ser Gly Arg Ala Ile Val Leu Gly Gly Ser Trp Ala Gly
1 5 10 15
Met Leu Ala Ala His Val Leu Ala Arg His Leu Glu Ala Val Thr Val
20 25 30
Val Glu Arg Asp Val Leu Pro Arg Gly Pro Glu His Arg Arg Gly Leu
35 40 45
Pro Gln Ala Arg His Val His Val Leu Trp Ser Gly Gly Ala Arg Ile
50 55 60
Val Glu Glu Leu Leu Pro Gly Thr Ala Glu Arg Leu Leu Ala Ala Gly
65 70 75 80
Ala Arg Arg Ile Gly Phe His Glu Asp Leu Val Thr Leu Thr Ser His
85 90 95
Gly Trp Gln His Arg Phe Pro Pro Arg Gln Tyr Ala Ile Met Cys Gly
100 105 110
Arg Pro Leu Leu Asp Trp Ile Val Arg Asp Arg Ile Leu Ala Asp Glu
115 120 125
Arg Ile Thr Leu Arg Glu Gly Thr Glu Ala Val Glu Leu Ala Gly Gly
130 135 140
Ala Lys Arg Val Thr Gly Val His Val Arg Gly Ala Arg Asp Gly Ala
145 150 155 160
Arg Glu Leu Leu Glu Ala Asp Leu Val Val Asp Ala Thr Gly Arg Gly
165 170 175
Ser Arg Leu Lys His Trp Leu Ala Ala Leu Gly Leu Pro Pro Leu Ala
180 185 190
Glu Asp Val Val Asp Ala Gly Ile Ala Tyr Ala Thr Arg Thr Phe Arg
195 200 205
Ala Pro Asp Gly Ala Thr Ala Gly Phe Pro Ala Ile Asn Val Ala Ala
210 215 220
Asp His Arg Val Arg Arg Pro Gly Arg Phe Gly Val Val Tyr Pro Gln
225 230 235 240
Glu Gly Gly Arg Trp Met Val Thr Leu Ser Cys Thr Arg Gly Gly Glu
245 250 255
Leu Pro Thr Arg Asp Glu Asp Phe Leu Pro Tyr Ala Arg Thr Leu Arg
260 265 270
Asp Pro Ile Val Ala Asp Leu Val Ala Gly Ala Glu Pro Leu Thr Pro
275 280 285
Val Phe Ala Ser His Ile Gly Ala Asn Arg Arg Leu Tyr Pro Glu Arg
290 295 300
Ala Ala Thr Trp Pro Asp Gly Leu Val Val Leu Gly Asp Ser Leu Ala
305 310 315 320
Ala Phe Asn Pro Ile Tyr Gly His Gly Met Ser Ala Ala Ala Arg Ala
325 330 335
Ala Arg Val Leu Asp Glu Arg Leu Ala Glu Pro Phe Gly Pro Gly Ala
340 345 350
Ala Arg Ala Ala Gln Glu Arg Ile Ser Ala Ala Ala Asp Asp Pro Trp
355 360 365
Ile Met Ala Thr Ser Lys Asp Ile Glu Phe Val His Cys Arg Asn His
370 375 380
Ala Glu Asp Pro Arg Leu Asn Gly Gly Ala Ala Val Met Arg Gly Phe
385 390 395 400
Ala Asp Met Val Ala Thr Met Ala Ile Arg Ser Ala Asp Val Ser Asp
405 410 415
Ala Val Thr Asp Val Met Ser Met Thr Val Pro Gln Ser Glu Leu Gly
420 425 430
Thr Ala Arg Phe Met Ser Leu Met Gln Gly Ala Arg Leu Leu Pro Glu
435 440 445
Leu Gly Glu Pro Pro Leu Arg Arg His Glu Leu Ala Leu Val Asn Leu
450 455 460
Asp Arg Arg Ala Val Ala Gly Val Arg Ser Val Pro Arg
465 470 475
<210> 22
<211> 285
<212> PRT
<213> Artificial
<220>
<223> madI基因编码的氨基酸序列
<400> 22
Met Leu Thr Asp Glu Ser Ala Arg Lys Arg Ile Ile Leu Glu His Cys
1 5 10 15
Arg Arg Leu Asn Ala Gly Asp Val Gly Gly Leu Leu Asp Leu Tyr Ala
20 25 30
Glu Asp Val Arg Phe Glu Asp Pro Val Gly Gly Gly Glu Arg Ala Gly
35 40 45
Arg Glu Glu Leu Arg Ala His Phe Ala Arg Ala Val Glu Ala Asn Ala
50 55 60
Arg Glu Val Pro Gly Glu Pro Val Ala Ser Gln Asp Gly Val His Ala
65 70 75 80
Leu Val Pro Val Ser Ala Thr Leu Asp Gly Pro Ala Ala Gly Pro Gly
85 90 95
Gly Val Arg Ser Ile Glu Arg Glu Tyr Val Leu Met Leu Arg Val Gly
100 105 110
Ala Gly Gly Leu Ile Glu Glu Leu Arg Ala Phe Trp Gly Ala Ala Glu
115 120 125
Ala Gly Thr Asp Asp Ala Val Arg Lys Asn Ala Pro Leu Glu Tyr Ala
130 135 140
Arg Arg Ile Asn Ala Gly Asp Leu Asp Gly Val Met Glu Leu Phe Thr
145 150 155 160
Asp Asp Ile Val Phe Glu Asp Pro Val Gly Ser Pro Pro Val Arg Gly
165 170 175
Lys Ala Ala Ile Arg Gln Asn Ile Ala Trp Ala Ile Glu Cys Lys Val
180 185 190
His Glu Val Pro Gly Arg Pro Val Leu Ala Leu Asp Gly Arg Thr Val
195 200 205
Val Ala Pro Ser Val Val Thr Thr Ser Tyr Pro Ser Asn Met Arg Tyr
210 215 220
Gln Val Ile Gly Val Ser Glu Val Asp Gly Asn Gly Leu Thr Arg Ser
225 230 235 240
Leu Lys Ala Tyr Trp Gly Ile Thr Asp Val Ser Leu Pro Asp Ala Pro
245 250 255
Arg Leu Thr Gly Val Ala His Ser Leu Ala Val Thr Glu Arg Leu Ile
260 265 270
Glu Asn Val Arg Leu Ser Glu Ala Arg Ala Gly Ser Ser
275 280 285
<210> 23
<211> 297
<212> PRT
<213> Artificial
<220>
<223> madE基因编码的氨基酸序列
<400> 23
Met Asp Ile Ser Val Pro Leu Gly Asp Ala Val Lys Leu His Val Arg
1 5 10 15
His Arg Glu Gly Thr Gly Gly Pro Pro Phe Leu Leu Leu His Gly Leu
20 25 30
Asp Ser Asn Ala Arg Met Trp Asp Leu Val Ala Asp Arg Leu Ala Ala
35 40 45
Glu Gly His Pro Val Tyr Ala Val Asp Gln Arg Gly His Gly Asp Ser
50 55 60
Asp Pro Ala Asp Asp Asp Tyr Gly Asn Glu Thr Ala Ala Ala Asp Ile
65 70 75 80
Ala Ala Val Ala Asp Ala Leu Gly Val Thr Arg Ala Val Val Ala Gly
85 90 95
His Ser Trp Gly Ala Leu Val Ser Leu Arg Leu Ala Ala Arg Arg Pro
100 105 110
Gly Leu Val Ala Gly Leu Ala Leu Ile Glu Gly Gly Trp Ala His Ala
115 120 125
Ser Val Val Cys Asp Ser Trp Glu Gln Phe Ala Gly Leu Leu Thr Met
130 135 140
Ser Glu Val Asp Leu Asn Gly Ala Thr Leu Asp Ala Met Arg Asp Tyr
145 150 155 160
Gln Arg Ala Val Tyr Pro Asp Trp Ser Ala Glu Ala Val Glu Ala Ser
165 170 175
Leu His Ser Leu Arg Val His Glu Asp Gly Ala Leu Thr Pro Arg Leu
180 185 190
Ser Pro Thr Gln Arg Asn Ala Ile Leu Arg Ser Ile Trp Asp Asp Pro
195 200 205
Pro Ala Gln Trp Tyr Pro Ala Ile Thr Val Pro Thr Leu Leu Met Pro
210 215 220
Ala Val Pro Lys Pro Asn Arg Arg Trp Glu Pro Leu Phe Glu Arg Val
225 230 235 240
Arg Ser Tyr Val Glu Pro Ala Val Ala Ala Leu Pro Asp Ala Thr Val
245 250 255
Arg Glu Tyr Val Gly Gly Asp His Asp Leu His Ala Gln His Pro Asp
260 265 270
Glu Val Ala Glu Asp Leu Leu Arg Leu Ala Arg Asn Val Gln Arg Arg
275 280 285
Ala Ala Leu Gly Asp Arg Thr Ala Asn
290 295
<210> 24
<211> 413
<212> PRT
<213> Artificial
<220>
<223> madP1基因编码的氨基酸序列
<400> 24
Met Thr Thr Asp Asp Tyr Ala Leu Thr Trp His Glu Leu Met Val Asp
1 5 10 15
Pro Tyr Ser Gly Phe Gly Arg Ile Arg Glu Gln Ala Pro Ile Val Gln
20 25 30
Gly Ser Trp Asp Gly Arg Pro Val Trp Ile Val Thr Arg His Glu Asp
35 40 45
Val Ser Ala Leu Leu Met Asp Arg Arg Leu Ala Thr Asn Ser Ser Ala
50 55 60
Val Pro Gly Cys Pro Asp Asp Tyr Arg Ala Ala Leu Arg Ala Ile Gly
65 70 75 80
Val Ala Glu Glu Leu Val Pro Tyr Leu Ala Gly Asp Leu Val Arg Thr
85 90 95
Asp Pro Glu Ser His Ala Arg Leu Arg Lys Leu Leu Ser Arg Ala Phe
100 105 110
Thr Ala Arg Arg Val Ala Arg Leu Arg Pro Arg Val Glu Asp Ile Ala
115 120 125
Arg Glu Leu Leu Asp Ala Leu Pro Gly Arg Ala Glu Gly Gly Ala Val
130 135 140
Glu Leu Ile Glu His Phe Ala Tyr Pro Leu Pro Ile Thr Val Ile Cys
145 150 155 160
Glu Leu Leu Gly Val Pro Glu Glu Asp Arg Pro Leu Trp Arg Gly Trp
165 170 175
Ser Asp Asp Tyr Val Ser Met Asp Pro Ala Arg Leu Asn Arg Met Leu
180 185 190
Ala Asp Met Ser Ala His Leu Arg Glu Leu Val Gly Arg Arg Arg Ala
195 200 205
Glu Pro Ala Asp Asp Leu Val Ser Ala Leu Ile Arg Thr His Asp Asp
210 215 220
Asp Gly Gly Leu Leu Ser His Thr Glu Leu Val Ala Met Val Leu Thr
225 230 235 240
Leu Met Ile Ala Ser Gln Leu Pro Thr Pro Gln Leu Val Ala Asn Gly
245 250 255
Ala Val Ala Leu Ala Ala Arg Pro Asp Gln Leu Ala Leu Leu Arg Ala
260 265 270
Asp Pro Gly Leu Trp Pro Gly Ala Val His Glu Leu Val Arg Leu Cys
275 280 285
Gly Pro Gly Ile Val Ala Met Leu Arg Tyr Ala Ala Glu Asp Ile Ala
290 295 300
Phe Gly Asp Thr Val Ile Arg Gln Gly Asp Arg Val Gln Leu Val Leu
305 310 315 320
Gly Ser Ala Asn Arg Asp Pro Arg Arg Phe Pro Ala Pro Asp Leu Leu
325 330 335
Asp Val Thr Arg Pro Val Asp Gly Gly Val Gln His Leu Gly Tyr Ser
340 345 350
Arg Gly Ala His Tyr Cys Leu Gly Ala Gly Leu Ala Asn Gln Glu Ile
355 360 365
Glu Val Ala Leu Ser Ala Leu Phe Gly Arg Tyr Pro Asp Leu Ala Leu
370 375 380
Ala Val Ala Pro Glu Glu Leu Glu Trp Arg Pro Ile Pro Leu Thr Arg
385 390 395 400
Gln Leu Val Arg Val Pro Val Val Leu Gly Glu Pro Ala
405 410
<210> 25
<211> 446
<212> PRT
<213> Artificial
<220>
<223> madP2基因编码的氨基酸序列
<400> 25
Met Thr Ala Glu Arg Thr Thr Gly Pro Arg Thr Ser Gly Glu His Val
1 5 10 15
Leu Ser Asp Pro Glu Leu Val Ala Asp Pro Tyr Gly Ala Phe Gly Arg
20 25 30
Ile Arg Glu Glu Ala Pro Val Ala Arg Gly Arg Leu Trp Asp Gly Gly
35 40 45
Pro Val Trp Ile Val Thr Arg His Asp Asp Val Ser Ala Val Leu Leu
50 55 60
Asp Arg Arg Phe Ala Ser Asn Ala Gly Ser Leu Pro Gly Arg Thr Asp
65 70 75 80
Glu His Ala Glu Phe Leu Ala Arg Thr Val Gly Ile Pro Glu Asp Val
85 90 95
Val Pro Tyr Leu Ala Gly Asn Leu Val Tyr Leu Asp Pro Pro Asp His
100 105 110
Thr Arg Leu Arg Gly Leu Val Ala Arg Ala Phe Thr Pro Arg Arg Ile
115 120 125
Ala Arg Leu Arg Pro Arg Val Glu Ala Ile Thr Gly Glu Leu Leu Asp
130 135 140
Thr Leu Pro Asp Arg Ala Val Asp Gly Val Val Asp Leu Ile Glu His
145 150 155 160
Phe Ala Tyr Pro Leu Pro Val Ser Val Ile Cys Glu Leu Leu Gly Val
165 170 175
Pro Glu Glu Asp Arg Pro Arg Trp His Ala Trp Ser Arg Arg Phe Gly
180 185 190
Asp Ser Thr Pro Arg Arg Leu Gly Glu Ile Val Thr Glu Met Asn Ala
195 200 205
His Ile Arg Glu Leu Val Glu Arg Arg Arg Ala Glu Pro Ala Asp Asp
210 215 220
Leu Leu Thr Ala Leu Thr Gly Ala His Asp Gly Gly Gly Arg Leu Ser
225 230 235 240
Asp Ala Glu Leu Val Thr Met Val Leu Thr Leu Met Ile Ala Gly His
245 250 255
Thr Thr Thr Ser His Met Leu Gly Asn Gly Ile Ala Ala Leu Leu Ala
260 265 270
His Pro Gly Gln Leu Ala Arg Leu Arg Glu Asp Pro Ala Leu Met Pro
275 280 285
Gly Ala Val Gln Glu Leu Leu Arg Trp Gly Ser Pro Val Val Ile Ala
290 295 300
Lys Leu Arg Tyr Ala Thr Gln Asp Val Thr Ile Gly Asp Thr Leu Ile
305 310 315 320
Arg Gln Gly Glu Arg Val Gln Leu Val Leu Gly Ser Ala Asn Arg Asp
325 330 335
Pro Arg Arg Phe Pro Asp Gly Asp Arg Leu Asp Val Glu Arg Pro Cys
340 345 350
Gly Ala Ala Asp Ala Gln His Leu Ala Tyr Ala Arg Gly Pro His Tyr
355 360 365
Cys Leu Gly Ala Ala Leu Ala Asn Gln Glu Ala Glu Val Ala Phe Ser
370 375 380
Ala Leu Phe Thr Arg Phe Pro Glu Leu Ala Leu Ala Val Pro Pro Asp
385 390 395 400
Arg Leu Glu Arg Asp Gln Ile Pro Ile Thr His Gln Leu Ala Arg Leu
405 410 415
Pro Val Thr Leu Gly Pro Pro Ser Ala Asp Ala Pro Ser Ala Gly Gly
420 425 430
Arg Ala Glu His Gly Pro Gly Gly Pro Asp Arg Ala Glu Gly
435 440 445
<210> 26
<211> 275
<212> PRT
<213> Artificial
<220>
<223> madM1基因编码的氨基酸序列
<400> 26
Met Ser Asp Tyr Tyr Ser Ala Met Gly Ala Leu Leu Gln Met Ala Trp
1 5 10 15
Gly Asp Asn Phe His Phe Gly Tyr Trp Asp Gly Pro Ser Asp Thr Ala
20 25 30
Thr Ile Asp Glu Ala Thr Asp Arg Phe Thr Asp Met Leu Thr Glu Arg
35 40 45
Leu Arg Val Gly Pro Gly Asp Arg Val Leu Asp Leu Gly Cys Gly Val
50 55 60
Gly Lys Pro Ala Leu Arg Ile Ala Ala Arg Thr Gly Ala Ser Val Val
65 70 75 80
Gly Val Thr Ile Asn Ala Arg His Val Glu Leu Ala Thr Glu Arg Ala
85 90 95
Arg Ala Glu Gly Arg Ala Asp Gln Val Ser Phe Arg His Ala Asp Ala
100 105 110
Met Asp Leu Pro Phe Glu Asp Gly Ser Phe Asp Ala Val Leu Ala Phe
115 120 125
Glu Ser Ile Ile His Met Asp Arg Pro Thr Val Leu Arg Glu Val Glu
130 135 140
Arg Val Leu Val Pro Gly Gly Arg Leu Ala Leu Thr Asp Leu Thr Pro
145 150 155 160
Leu Thr Asp Glu Ser Asn Thr Pro Lys Ser Phe Arg Ser Leu Met Gly
165 170 175
Ala Ser Pro Gly Asp Glu Ser Pro Ser Gly Gln Gly Gly Asp Pro Asp
180 185 190
Ile Ala Thr Leu Ile Ser Ala Gly Ala Trp Pro Gly Leu Leu Ser Asp
195 200 205
Ala Ser Leu Thr Leu Asp Glu Leu Asn Asp Val Thr Glu His Thr Gln
210 215 220
Gly Thr Phe Ile Arg Leu Phe Glu Asn Phe Phe Lys Tyr Arg Arg Glu
225 230 235 240
Phe Glu Arg Glu His Gly Ile Thr Val Glu Gln Val Leu Asp Ser Ala
245 250 255
Lys Ser Gly Ala Pro Thr Asp Gly Val Gly Cys Leu Val Val Ala Ala
260 265 270
His Lys Pro
275
<210> 27
<211> 311
<212> PRT
<213> Artificial
<220>
<223> madM2基因编码的氨基酸序列
<400> 27
Met Thr Gly Gly Thr Ile Ser Glu Gly Arg Gly Asp Gly Pro Arg Thr
1 5 10 15
Gly Pro Gly Pro Ala Gly Thr Arg Gly Ser Ser Pro Phe Thr Glu Lys
20 25 30
Trp Val Arg Ser Met Tyr Arg Leu Leu Glu Arg Leu Ser Ser Gly Gly
35 40 45
Ala Pro Asn Leu Leu Gly Ile Glu Asn Gly Tyr Leu Asn Tyr Gly Tyr
50 55 60
Trp Glu Pro Gly Cys Thr Asp His Asp Ala Ala Cys Val Ala Leu Ala
65 70 75 80
Glu Arg Leu Gly Glu Ala Ala Gly Ile Thr Ala Gly Asp Arg Val Leu
85 90 95
Asp Val Gly Phe Gly Phe Gly Glu Gln Asp Phe His Trp Leu Arg Thr
100 105 110
Arg Glu Pro Lys Glu Ile Val Gly Leu Asn Ile Thr Pro Gly Gln Val
115 120 125
Asp Ala Ala Arg Arg Arg Thr Arg Glu Leu Asp Leu Asp Asp Arg Leu
130 135 140
Asp Leu Arg Val Gly Ser Ala Thr Ser Leu Pro Phe Glu Asp Gly Ser
145 150 155 160
Phe Asp Arg Val Val Ala Leu Glu Ser Ser Ala His Phe Asn Thr Arg
165 170 175
Gln Val Phe Phe Arg Glu Ala Phe Arg Val Leu Arg Pro Gly Gly Val
180 185 190
Leu Ala Thr Thr Asp Pro Leu Pro Arg Glu Ala Pro Gly Gly Lys Gly
195 200 205
Gly Leu Val Leu Arg Leu Asp Glu Val Arg Arg Arg Arg Ile Ile Pro
210 215 220
Asp Ala Asn Trp Tyr Pro Arg Ser Val Tyr Ala Ala Arg Leu Ala Glu
225 230 235 240
Ala Gly Phe Val Asp Val Asp Val Arg Asp Val Thr Asp Arg Thr Ile
245 250 255
Ala Ala Asn Ala Val Phe Ala Arg Ala His Cys Ala Arg Leu Leu Arg
260 265 270
Asp Pro Arg Tyr Arg Ser Phe Gln Pro Arg Asn Thr Ile Arg Tyr His
275 280 285
Leu Arg Gln Val Glu Ala Arg Ala Ala Ala Arg Asp Tyr Val Ile Thr
290 295 300
Ser Ala Arg Lys Pro Gly Arg
305 310
<210> 28
<211> 433
<212> PRT
<213> Artificial
<220>
<223> madG9基因编码的氨基酸序列
<400> 28
Val Ala Val Arg Val Leu Phe Val Thr Leu Pro Trp Lys Thr His Leu
1 5 10 15
Phe Asn Phe Val Pro Leu Ala Trp Ala Leu Gln Thr Ala Gly His Glu
20 25 30
Val His Val Ala Ala Glu Pro Asp Leu Thr Gly Ala Val Thr Gly Ala
35 40 45
Gly Leu Thr Ala Val Pro Val Gly Ser Gly Glu Thr Met Gly Glu Arg
50 55 60
Val Arg Arg Ala Trp Arg Glu Gly Thr Leu Pro Thr Pro Glu Glu Ala
65 70 75 80
Pro Pro Pro Gly His Pro Val Glu Leu Tyr Asp Leu Gly Pro Asp Arg
85 90 95
Gln Arg Leu Ser Trp Gln Glu Leu Asn Arg Ile His Asp Thr Leu Val
100 105 110
Val Pro Arg Ala Trp Leu Ser Asn Asp Thr Met Phe Asp Asp Leu Val
115 120 125
Ala Tyr Cys Arg Ser Trp Arg Pro Asp Leu Val Ile Trp Asn Ala Val
130 135 140
Thr Phe Ala Gly Ser Val Ala Ala Ala Ala Val Gly Ala Ala His Ala
145 150 155 160
Arg Phe Leu Phe Ser Val Asp Leu Tyr Ser Arg Leu Arg Asp Asp Gln
165 170 175
Leu Thr Val Met Ala Arg Gln Pro Pro Gln Asp Arg Arg Asp Gly Leu
180 185 190
Lys Asp Trp Phe Ala Pro Trp Ala Ala Lys Tyr Gly Val Glu Phe Ser
195 200 205
Glu Glu Leu Val Asn Gly His Phe Ser Ile Asp Gln Met Pro Ala Ser
210 215 220
Phe Arg Leu Asp Phe Pro His Arg Thr Val Ser Met Arg His Val Pro
225 230 235 240
Tyr Asn Gly Pro Ala Val Ile Pro Ala Trp Leu Ala Glu Pro Pro Arg
245 250 255
Ala Pro Arg Val Leu Met Thr Phe Gly Val Ser Val Arg Ser Trp Pro
260 265 270
Glu Leu Gln Val Val Pro Val Glu Arg Leu Arg Glu Ser Leu Asp Ser
275 280 285
Leu Ala Asp Leu Asp Val Glu Val Val Val Thr Leu Pro Asp Asp Val
290 295 300
Arg Ala Ser Leu Gly Arg Val Pro Ala Asn Val Arg Val Val Asp Phe
305 310 315 320
Val Pro Leu His Ala Val Leu Pro Thr Cys Ser Ala Val Val His His
325 330 335
Gly Gly Ala Gly Ser Phe Asn Gly Ser Leu Leu Ser Gly Val Pro Gln
340 345 350
Leu Leu Val Ser Thr Ala Leu Asp Ala Pro Phe Lys Asp His His Leu
355 360 365
Arg Ala Ala Gly Ala Gly Leu Ala Ile Thr Pro Asp Arg Phe Ser Gly
370 375 380
Pro Leu Ile Arg Glu Asn Val Val Arg Met Leu Glu Asp Pro Ala Phe
385 390 395 400
Arg Ala Gly Ala Glu Thr Leu Arg Arg Glu Ile Met Ser Tyr Pro Ala
405 410 415
Pro Asn Ala Leu Val Pro Glu Leu Glu Arg Leu Thr Ala Glu His Arg
420 425 430
Ala
<210> 29
<211> 449
<212> PRT
<213> Artificial
<220>
<223> madG10基因编码的氨基酸序列
<400> 29
Val Ala Val Arg Val Leu Phe Val Thr Pro Thr Trp His Thr His Phe
1 5 10 15
Phe Asn Leu Val Pro Leu Ala Trp Ser Leu Gln Thr Ala Gly His Asp
20 25 30
Val Arg Val Ala Cys Glu Pro Glu Leu Val Asp Thr Val Thr Arg Ser
35 40 45
Gly Leu Thr Ala Val Pro Val Gly Ser Asp Glu Pro Ile Arg Asp Arg
50 55 60
Ala Arg Arg Ala Glu Glu Asp Gly Thr Leu Pro Pro Leu Asp Met Gly
65 70 75 80
Arg Leu Ala Gly Ala Leu Gly Gly Ala Thr Ser Thr Ala Pro Lys Pro
85 90 95
Ser Gly Ser Gly Gly Gln Ala Gly Gly Ala Ile Gly Asp Pro Arg Ala
100 105 110
Lys Leu Ser Trp Glu Asp Met Val Trp Leu Tyr Glu Thr Val Ala Val
115 120 125
Pro Arg Gly Gln Asp Arg Glu Arg His Ala Val Arg Arg Pro Gly Gly
130 135 140
Val Gln Pro Val Val Ala Ala Pro Ala Arg Arg Leu Gly Arg Pro His
145 150 155 160
Pro Arg Gly Pro Val Ala Ala Gly Ala Val Gly Ala Ala His Ala Arg
165 170 175
Val Thr Phe Thr Val Asp Leu Ser Phe Gln Leu Arg Ser Gly Phe Leu
180 185 190
Trp Thr Met Ala Gln Gln Pro Pro Glu Arg Arg Arg Asp Pro Leu Ala
195 200 205
Glu Trp Leu Gly Ser Trp Thr Glu Lys Phe Gly Tyr Glu Tyr Ser Glu
210 215 220
Thr Leu Val Asn Ala His Ala Thr Ile Asp Gln Phe Pro Pro Ser Phe
225 230 235 240
Gly Gly Asp Tyr Gly Ala Ser His Leu Asn Leu Arg Tyr Val Pro Tyr
245 250 255
Asn Gly Pro Ala Val Ile Pro Asp Trp Leu Asn Glu Pro Pro Pro Ala
260 265 270
Pro Arg Val Leu Met Thr Leu Gly Val Ser Met Ser Asp Trp Gln Glu
275 280 285
Leu Gln Val Met Ser Ile Glu Arg Val Gln Glu Val Leu Asp Ser Val
290 295 300
Ala Gly Leu Asp Met Glu Leu Val Leu Thr Leu Pro Thr Ala Phe Arg
305 310 315 320
Glu Lys Leu Asp Arg Val Pro Arg Asn Thr Arg Ile Val Glu Phe Ala
325 330 335
Pro Phe His Ala Val Leu Pro Thr Cys Ala Ala Met Ile His His Gly
340 345 350
Gly Ala Gly Thr Phe Tyr Asn Ala Leu Leu Ala Gly Thr Pro Gln Leu
355 360 365
Leu Ile Thr Lys Val Pro Asp Ala Leu His Lys Arg Ala Tyr Leu Ala
370 375 380
Glu Thr Gly Ala Gly Leu Ser Ile Pro Pro Asp Glu Val Thr Gly Pro
385 390 395 400
Lys Val Arg Glu Ser Leu Ala Arg Leu Leu Asp Asp Pro Ser Phe Arg
405 410 415
Ala Gly Ala Glu Arg Ile Arg Arg Glu Val Leu Asp Gln Pro Thr Pro
420 425 430
Asn Gly Leu Val Pro Glu Leu Glu Arg Leu Ala Ala Arg His Gly Pro
435 440 445
Ala
<210> 30
<211> 303
<212> PRT
<213> Artificial
<220>
<223> madTE基因编码的氨基酸序列
<400> 30
Met Lys Ala Gly Ser Val Arg Thr Gly Gly Thr Pro Gly Glu His Ala
1 5 10 15
Gly Arg Pro Pro Arg Glu Arg Pro Val Arg Glu Arg Lys Gly Val Met
20 25 30
Thr Gln Pro Ser Thr Asp Phe Glu Ala Trp Ile Arg Arg Tyr His Gln
35 40 45
Ala Pro Ser Ser Gly Val Arg Leu Val Cys Phe Pro His Ala Gly Gly
50 55 60
Ser Ala Ser Phe Phe Phe Pro Val Ser Ala Arg Leu Ser Pro Ala Val
65 70 75 80
Asp Val Ala Ala Val Gln Tyr Pro Gly Arg Gln Asp Arg Arg Ala Glu
85 90 95
Ala Asn Ile Gly Thr Ile Pro Glu Met Ala Asp Ala Val Leu Pro Ala
100 105 110
Val Arg Ala Leu Ala Asp Arg Pro Leu Ala Phe Phe Gly His Ser Met
115 120 125
Gly Ala Val Leu Ala Tyr Glu Val Ala Arg Arg Leu Glu Arg Asp Gly
130 135 140
Ala Glu Pro Leu Ala Arg Leu Tyr Val Ser Gly Arg Arg Ala Pro Ser
145 150 155 160
Arg His Arg Ser Glu His Val His Glu Leu Asp Asp Ala Gly Leu Val
165 170 175
Ala Glu Leu Gln Arg Leu Ser Gly Thr Asp Ser Ser Leu Leu Gly Asp
180 185 190
Pro Glu Ala Arg Asp Met Ile Leu Pro Ala Val Arg Ser Asp Tyr His
195 200 205
Ala Val Glu Thr Tyr Arg His Val Pro Gly Ala Ala Leu Arg Cys Pro
210 215 220
Val Val Ala Leu Val Gly Asp Arg Asp Pro Gln Val Thr Arg Asp Glu
225 230 235 240
Ala Leu Ala Trp Ala Glu His Ser Ser Gly Pro Phe Asp Leu Arg Val
245 250 255
Phe Pro Gly Gly His Phe Tyr Leu Val Glu His Gly Pro Arg Val Ile
260 265 270
Gln Ala Leu Ala Asp Asp Leu Ala Ala Val Arg Val Pro Ser Gly Ala
275 280 285
Gly Gly Val Ala Gly Gly Ala Gly Arg Thr Asp Ala Ala Ala Thr
290 295 300
<210> 31
<211> 255
<212> PRT
<213> Artificial
<220>
<223> madR1基因编码的氨基酸序列
<400> 31
Val Arg Tyr Glu Ile Leu Gly Pro Phe Gln Val Val Asp Glu Asp Gly
1 5 10 15
Lys His Thr Ile Arg Ala Arg Lys Ile Arg Val Leu Leu Thr Val Leu
20 25 30
Leu Val Arg Ala Asp Gln Val Val Pro Val Asp Gln Leu Ile Thr Glu
35 40 45
Ile Trp Gly Glu Ala Pro Pro Arg Arg Ala Thr Ala Gly Leu His Val
50 55 60
Tyr Ile Ser Gln Ile Arg Lys Phe Leu His Arg Pro Gly Ala Glu Asp
65 70 75 80
Pro Val Leu Thr Arg Pro Pro Gly Tyr Leu Leu Arg Leu Gly Ala Asp
85 90 95
Glu Leu Asp Leu Arg Arg Leu Glu Leu Leu Val Asn Gln Gly Arg Asn
100 105 110
Asp Phe Arg Gln Gly Arg His Asp Gln Ala Ser Ala Ala Phe Glu Glu
115 120 125
Ala Leu Asp Leu Ser Ala Asp His Leu Pro Asp Asp Leu Gly Asn Gly
130 135 140
Pro Val Leu Glu Gly Phe His Thr Trp Leu Arg Glu Thr Arg Leu Glu
145 150 155 160
Cys Val Glu Met Leu Met Glu Ser Arg Met Met Leu Gly Arg His Arg
165 170 175
Glu Leu Val Gly Asp Leu Tyr Gln Leu Thr Thr Glu His Pro Leu Arg
180 185 190
Glu Val Leu His Arg Gln Leu Met Leu Ala Leu Tyr Arg Ser Gly Arg
195 200 205
Arg Ala Asp Ala Leu His Ala Tyr Gln Ser Ala Arg Arg Thr Leu Asn
210 215 220
Glu Glu Leu Gly Leu Glu Pro Cys Arg Asp Leu Gln Asn Leu Gln Arg
225 230 235 240
Ser Ile Leu Thr Ser Asp His Arg Leu Asp Leu Pro Ile Pro Ala
245 250 255
<210> 32
<211> 965
<212> PRT
<213> Artificial
<220>
<223> madR2基因编码的氨基酸序列
<400> 32
Val Ser Leu Val Glu Arg Asn Ala Glu Val Ala Val Leu His Gly Leu
1 5 10 15
Phe Gly Asp Cys Leu Glu Ala Arg Ser Arg Leu Ala Ile Ile Ser Gly
20 25 30
Asp Val Ala Thr Gly Lys Ser Ser Leu Leu Asp Ala Phe Glu Arg Gln
35 40 45
Ala Ala Glu Ser Gly Ala Val Val Leu Asn Ala Val Ala Ser Ser Val
50 55 60
Glu Thr Ala Leu Pro Met Gly Val Leu Glu Gln Leu Phe Arg Ser Pro
65 70 75 80
Glu Val Pro Glu Ala Ile Ala Glu Arg Ala Met Lys Leu Leu Asn Ala
85 90 95
Arg Ala Leu Thr Tyr Pro Met Arg Phe Arg Gly Leu Trp Asp Val Leu
100 105 110
Arg Asp Leu Thr Glu Arg Arg Pro Val Val Ile Ser Val Asp Asp Val
115 120 125
His His Ala Asp Glu Ile Ser Leu Gln Cys Leu Leu Tyr Leu Leu Arg
130 135 140
Arg Leu Arg Ser Ala Arg Leu Leu Thr Val Leu Thr Val Ser Pro Arg
145 150 155 160
Ser Gln Ala Pro Asn Ala Pro Phe Gln Ser Glu Phe Leu Arg Glu Pro
165 170 175
His Ser Arg Gln Ile Arg Leu Gly Pro Leu Ser Pro Asp Gly Val Glu
180 185 190
Ala Leu Ile Arg Leu Asp Leu Asp Ala Gln Thr Ala Arg Leu Leu Ala
195 200 205
Pro Ala Phe His Glu Ala Ser Ala Gly Asn Pro Ala Leu Val His Ala
210 215 220
Leu Val Asp Asp Tyr Leu Ala Gly Pro Glu Phe Ile Ala Pro Glu Leu
225 230 235 240
Val Ala Gly Gly Ala Phe Gly Arg Ala Val Val Gly Leu Leu Gln Arg
245 250 255
His Glu Phe Pro Val Leu Glu Val Ala Arg Ala Ile Gly Ile Leu Asn
260 265 270
Glu Pro Val Pro Pro Ser Leu Ile Gly Arg Leu Leu Asp Ile Asp Ala
275 280 285
Glu Thr Ala Ala Arg Ala Val Gly Thr Leu Thr Ser Ala Gly Ile Leu
290 295 300
Asp Ala Gly Ser Phe Arg His Gly Ala Ala Gln Ala Ala Val Val Asp
305 310 315 320
Ser Ala Pro Pro Asp Ala Pro Gly Val Leu His Glu Arg Ala Ala Glu
325 330 335
Gln Leu His Ser Asp Gly Ala Ala Ala Thr Asp Val Ala Ala His Ile
340 345 350
Val Ala Ser Asn Arg His Gly Ala Pro Trp Ala Ile Pro Val Leu Arg
355 360 365
Glu Ala Ala Glu Gln Ala Leu Thr Ser Asp Asp Leu Gly Thr Gly Ile
370 375 380
Arg Tyr Leu Arg Val Ala His Gln Ile Cys Arg Asp Arg Arg Glu Arg
385 390 395 400
Ser Ala Ile Ala Ala Arg Leu Ala Asp Leu Glu Trp Arg Val Asp Pro
405 410 415
Ser Val Ala Leu Arg Trp Val Pro Glu Phe Ser Leu Ala Ile Gln Asp
420 425 430
Gly Leu Leu Asp Gly Arg Asp Ala Gly Thr Pro Phe Met Ser Leu Leu
435 440 445
Trp His Gly Arg Val Ser Glu Ala Val Arg Met Leu Asp Ala Leu Gly
450 455 460
Arg Gly Arg Pro Ser Ala Ala Leu Pro Gly Asp Ala Gly Thr Ala Met
465 470 475 480
Asp Val Ile Pro Pro Arg Phe Cys Leu Ala Tyr Leu Tyr Pro Glu Leu
485 490 495
Ala Lys Asp Gly Pro Ala Ala Asp Thr Ala Pro Ala Leu Gly Pro Val
500 505 510
Pro Ala Ala Ala Pro Gly Pro Gly Gln Asp Ala Ala Ala Met Leu Val
515 520 525
Ala Glu Leu His Arg Arg Asp Val Arg Gly Ala Leu Val Ala Ala Glu
530 535 540
Arg Val Met Glu Arg Ser Arg Leu Asn His Arg Thr Phe Ala Ser Leu
545 550 555 560
Ala Ile Ala Leu Ala Thr Leu Ile Tyr Asn Asp Arg Pro Gly Arg Ala
565 570 575
Ala Val Trp Cys Asp Ala Leu Leu Ala Glu Ala Thr Ala Arg Arg Ser
580 585 590
Pro Thr Trp Leu Ala Leu Phe Thr Ala Leu Arg Ala Leu Ile His Leu
595 600 605
Arg Gln Gly Asp Leu Pro Ala Ala Glu Arg His Ala Asp Thr Ala Leu
610 615 620
Asn Leu Ile Ser Pro Lys Ser Trp Gly Val Leu Val Gly Leu Pro Leu
625 630 635 640
Ser Cys Met Ile Leu Ala Lys Thr Ala Met Gly Arg Pro Arg Glu Ala
645 650 655
Ala Asp Leu Leu Ser Ile Thr Val Pro Glu Ala Met Phe Arg Thr Pro
660 665 670
Leu Gly Pro His Tyr Leu Tyr Ala Arg Gly Arg His His Leu Ala Ala
675 680 685
Asn Arg Leu His Ala Ala Leu Ser Asp Phe Arg Thr Cys Gly Thr Met
690 695 700
Arg Glu Gly Trp Ala Val Val Leu Pro Glu Val Glu Pro Trp Arg Leu
705 710 715 720
Gly Ala Ala Gln Ala Leu Ile Ala Leu Gly Asp Leu Ala Glu Ala Arg
725 730 735
Ala Leu Met Glu Glu Gln Leu Leu Arg Ser Ala Pro Glu His Val Arg
740 745 750
Thr Arg Gly Val Ala Leu Arg Val Lys Ala Ala Thr His Pro Val Pro
755 760 765
Glu Arg Pro Asp Leu Leu Gly Glu Ala Ile Glu Leu Leu Gln Gly Cys
770 775 780
Gly Asp Arg Leu Glu Leu Ser Tyr Ala Leu Ala Asp Leu Ala Glu Val
785 790 795 800
Arg Arg Trp Val Gly Glu His Asp Cys Ala Arg Glu Leu Ala His Gln
805 810 815
Ala Glu Leu Leu Ala Arg Gln Cys Gly Arg Pro Ala Ala Arg Glu Gly
820 825 830
Glu Arg Pro Asp Val Pro Gly Gly Val Val Arg Pro Asp Asp Leu Ala
835 840 845
Gly Arg His Arg Met Gly Ala Gly His Ala Ala Gly Leu Ala Gly Ser
850 855 860
Ala Gly Arg Asp Ala Gly Gly Val Ala Ala Ala Asp Pro Glu Arg Phe
865 870 875 880
Leu Lys Leu Ser Asp Ala Glu Arg Arg Val Ala Val Leu Ala Ala Ser
885 890 895
Gly Tyr Thr Asn Arg Glu Ile Ser Lys Lys Leu Tyr Ile Thr Ile Ser
900 905 910
Thr Val Glu Gln His Leu Thr Lys Val Tyr Arg Lys Leu Asn Val Lys
915 920 925
Arg Phe Asp Leu Lys Ser Ala Leu Gln His Ile Glu Val Val Leu Pro
930 935 940
Ala Ser Glu Ala Val Leu Asp Cys Cys Gly Ser Gly Pro Leu Arg Gln
945 950 955 960
Ala Ala Gly Asp Arg
965
<210> 33
<211> 929
<212> PRT
<213> Artificial
<220>
<223> madR3基因编码的氨基酸序列
<400> 33
Val Pro Val Ser Leu Val Glu Arg Asp Glu Glu Leu Ala Ala Leu Asp
1 5 10 15
Gly Leu Phe Ala Glu Leu Pro Arg Ala Arg Ser Arg Val Val Val Val
20 25 30
Ser Gly Gly Thr Ala Thr Gly Lys Thr Ser Leu Leu Arg Ala Phe Gly
35 40 45
Glu Arg Ala Val Glu His Gly Ala Leu Phe Leu Ser Ala Val Ala Ser
50 55 60
Arg Ile Glu Arg Gly Leu Pro Met Gly Val Leu Glu Gln Leu Phe Arg
65 70 75 80
Asn Pro Asp Leu Pro Ser Gly Asp Ala Glu Arg Ala Met Arg Trp Leu
85 90 95
Asp Ala Gly Ala Leu Asn Thr Ser Ala Ala Gly Pro Gly Gly Ser Gly
100 105 110
Gln Val Thr Ser Val Val Leu Arg Gly Leu Cys Glu Val Leu Arg Gly
115 120 125
Leu Ala Glu Arg Gln Pro Val Val Ile Ala Val Asp Asp Val His Tyr
130 135 140
Ala Asp Glu Thr Ser Leu Arg Cys Leu Leu Tyr Leu Leu Arg Arg Leu
145 150 155 160
Arg Ser Ser Arg Leu His Val Ile Phe Ser Glu Cys Arg Asp Leu Glu
165 170 175
Ala Ala Asn Ala Leu Leu Arg Ser Glu Phe Leu Arg Glu Pro Phe Phe
180 185 190
Arg Gln Ile Arg Val Glu Pro Leu Thr Lys Ala Gly Val Ala Arg Leu
195 200 205
Leu Arg Arg Ser Leu Gly Asp Ala Ala Ala Gln Asp Leu Thr Pro Ala
210 215 220
Phe His Glu Ala Thr Ser Gly Tyr Pro Ala Leu Val Gln Ala Leu Ile
225 230 235 240
Glu Asp His Arg Ala Ala Ala Ala Ala Gly Arg Gly Glu Leu Gly Ser
245 250 255
Gly Met Glu Phe Ser Arg Ala Val Met Thr Phe Leu Tyr Arg Tyr Glu
260 265 270
Pro Pro Val Cys Glu Val Ala Arg Ala Ile Ala Ile Leu Gly Lys Ala
275 280 285
Ala Ser Thr Thr Leu Leu Gly Arg Leu Leu Asp Leu Ala Ala Glu Ser
290 295 300
Thr Ala Gln Ala Val Asn Ala Leu Thr Lys Ala Arg Ile Leu Glu Asn
305 310 315 320
Gly Asp Phe Arg Ser Glu Ile Val Arg Ala Ala Ile Leu Asp Ala Thr
325 330 335
Pro Ser Gly Glu Arg Met Ala Met His Gly Ser Ala Ala Val Leu Leu
340 345 350
His Asn Glu Gly Ala Ala Pro Thr Asp Val Ala Ala His Ile Val Ala
355 360 365
Ala Gly Arg Ile Glu Ala Pro Trp Val Val Pro Leu Leu Arg Glu Ala
370 375 380
Ala Glu His Ala Leu Ala Asn Asp Asp Val Arg Pro Ala Ile Arg His
385 390 395 400
Leu Arg Ala Ala Tyr Arg Leu Cys Asp Asp Glu Arg Leu Arg Pro Glu
405 410 415
Ile Ala Ser Ala Leu Ala Asp Ala Glu Trp Arg Val Asp Pro Ser Ala
420 425 430
Val Leu Arg His Leu Pro Asp Phe Ala Ala Pro Ala Arg Thr Gly Arg
435 440 445
Pro Glu Leu Cys Asp Thr Phe Thr Pro Ile Thr Tyr Leu Leu Trp His
450 455 460
Gly Arg Val Gly Glu Ala Leu Gly Ile Val Asp Asp Leu Val Arg Thr
465 470 475 480
Gln Asp Glu Ala Ala Glu Ser Gly Thr Ala Ala Ser Val Asp Leu Asp
485 490 495
Thr Pro Arg Leu Trp Leu Ser Tyr Leu Tyr Pro Gly Val Leu Lys Ala
500 505 510
Glu Pro Val Pro Ala His His Glu Pro Glu Pro Thr Gly Ala Leu Pro
515 520 525
Asp Leu Gln Glu Ala Thr Met Leu Ala Ala Glu Leu Val Asp Glu Asn
530 535 540
Glu Ser Asp Ala Leu Thr Thr Ala Glu Gly Ile Leu Gln Arg Ser Arg
545 550 555 560
Leu Ser Asn Arg Thr Leu Ala Pro Leu Thr Thr Ala Leu Ala Val Leu
565 570 575
Met Tyr Asn Asp Arg Leu Glu Gln Ala Ala Ser Trp Cys Asp Ser Leu
580 585 590
Leu Val Glu Ala Val Ala Arg Arg Ser Pro Thr Trp His Ala Leu Phe
595 600 605
Ala Ala Glu Arg Ala Leu Ile His Val Arg Gln Gly Asn Val Ile Ala
610 615 620
Ala Glu Arg His Ala Asp Ala Ala Leu Ser Leu Ile Pro Pro Lys Ser
625 630 635 640
Trp Gly Val Ala Val Gly Leu Pro Leu Ala Ala Met Val Leu Ala Thr
645 650 655
Thr Ala Met Gly Lys Val Glu Arg Ala Ala Thr Leu Leu Asp Ile Pro
660 665 670
Val Pro Gln Thr Met Phe Gln Thr Arg Met Gly Leu His Tyr Leu Tyr
675 680 685
Ala Arg Gly Arg His His Leu Ala Glu Gly Arg Phe Tyr Ala Ala Ala
690 695 700
Cys Asp Phe Arg Ala Cys Gly Lys Leu Met Asn Asp Trp Gly Ile Asp
705 710 715 720
Leu Pro Ala Val Glu Pro Trp Arg Ile Gly Ala Ala Glu Ala His Leu
725 730 735
Arg Leu Gly Asp Ala Ala Gln Ala Arg Glu Leu Val Asp Glu Gln Leu
740 745 750
Ala Leu Leu Glu Pro Glu His Leu Arg Ala Arg Gly Met Ser Leu Arg
755 760 765
Ala Gln Ala Ala Thr Ser Pro Pro Glu Ser Arg Pro Ala Leu Leu Gly
770 775 780
Gln Ala Val Glu Met Leu Gln Ser Cys Gly Asp Gln Leu Glu Leu Ser
785 790 795 800
Arg Ala Leu Ala Asp Gln Ser Arg Ala Tyr Glu Ile Leu Gly Glu Pro
805 810 815
Glu Arg Ala Ser Leu Leu Ser Arg Arg Ser Leu Ser Leu Ala Arg Lys
820 825 830
Cys Gly Ile Leu Glu Arg Thr Thr Glu Glu Pro Pro Cys Pro Ser Thr
835 840 845
Leu Glu Leu Ser Asp Ala Glu Arg Arg Val Ala Ala Leu Ala Ala Asp
850 855 860
Gly Tyr Thr Asn Arg Glu Ile Ser Arg Lys Leu Tyr Ile Thr Val Ser
865 870 875 880
Thr Val Glu Gln His Leu Thr Lys Val Tyr Arg Lys Leu Asn Val Arg
885 890 895
Arg Phe Asp Leu Arg Ser Ala Leu Arg Arg Asp Ala Ala Ser Pro Val
900 905 910
Arg Asp Pro Ala Ala Gly Pro Gly Arg Pro Arg Pro Gly Gly Thr Gly
915 920 925
Thr
<210> 34
<211> 233
<212> PRT
<213> Artificial
<220>
<223> madR4基因编码的氨基酸序列
<400> 34
Met Leu Ala Glu Arg Gly Gly Ala Ala Phe Glu Asp Leu Thr Ala Arg
1 5 10 15
Ala Arg Ile Arg Asp Ala Ala Leu Arg Tyr Phe Thr Glu Tyr Gly Phe
20 25 30
Thr Gln Ala Thr Ile Arg Glu Ile Ala Arg Ala Ala Gly Val Ser Pro
35 40 45
Gly Leu Val Arg His His Phe Gly Ser Lys Glu Ala Leu Arg Gln Ala
50 55 60
Cys Asp Ala Tyr Ala Leu Glu Ala Leu His Gly Phe Tyr Asp Gln Thr
65 70 75 80
Leu Asn Gly Asp Arg Leu Ser His Pro Arg Ser Thr Met Asp Ala Arg
85 90 95
His Thr Leu His Pro Phe Gln Arg Tyr Leu Ala Arg Ala Leu Ile Asp
100 105 110
Glu Ser Glu Ala Ala Ala Arg Ile Phe Asp Asp Leu Val Ala Met Thr
115 120 125
Glu Arg Ser Leu Ile Gln Ala Asp Arg Gln Arg Pro Asp Pro Pro Ile
130 135 140
Ala Asp Ala Ala Ser Arg Ala Ala Leu Met Val Ala Met Ala Leu Gly
145 150 155 160
Ile Pro Thr Phe His Gln His Ile Ser Arg Ala Val Gly Ala Asp Ile
165 170 175
Phe Ser Asp Glu Gly Asp Arg Arg Val Ala Leu Ala Leu Leu Asp Val
180 185 190
His Ser His Ser Val Ile Ser Pro Glu Thr Ala Ala Ser Leu Arg Asp
195 200 205
Gly Leu Asp Glu Pro Gly Ser Gly Ser Gly Asn Arg Ala Pro Phe Thr
210 215 220
His Ser Gln Glu Glu His Arg His Glu
225 230
<210> 35
<211> 306
<212> PRT
<213> Artificial
<220>
<223> madT1基因编码的氨基酸序列
<400> 35
Met Ser Asp Ala Ile Ser Ala Lys Gly Leu Val Lys Lys Phe Gly Pro
1 5 10 15
Thr Arg Ala Leu Asp Gly Leu Asp Leu Thr Val Arg Thr Gly Glu Val
20 25 30
His Gly Phe Leu Gly Pro Asn Gly Ala Gly Lys Ser Thr Thr Ile Arg
35 40 45
Val Leu Leu Gly Ile Leu His Ala Asp Ala Gly Glu Val Gly Leu Leu
50 55 60
Asp Gly Asp Pro Trp Gln Asp Ala Thr Ala Leu His Arg Arg Leu Ala
65 70 75 80
Tyr Val Pro Gly Asp Val Asn Leu Trp Pro Gly Leu Ser Gly Gly Glu
85 90 95
Val Ile Asp Leu Leu Gly Arg Leu Arg Gly Gly Val Asp Lys Arg Arg
100 105 110
Lys Thr Glu Leu Leu Glu Arg Phe Gln Leu Asp Pro Arg Lys Lys Thr
115 120 125
Arg Thr Tyr Ser Lys Gly Asn Arg Gln Lys Val Ala Leu Val Ala Ala
130 135 140
Leu Ala Ser Asp Val Glu Leu Leu Leu Leu Asp Glu Pro Thr Ser Gly
145 150 155 160
Leu Asp Pro Leu Met Glu Glu Thr Phe Arg Glu Ala Ile Ala Glu Glu
165 170 175
Arg Ser Ala Gly Arg Thr Val Leu Leu Ser Ser His Ile Leu Ser Glu
180 185 190
Val Glu Ala Leu Cys Asp Arg Val Ser Ile Ile Arg Glu Gly Arg Val
195 200 205
Val Glu Thr Gly Thr Leu Asp Glu Met Arg His Leu Thr Arg Thr Ser
210 215 220
Ile Asp Ala Glu Leu Met Arg Pro Pro Ser Gly Leu Ser Asp Val Pro
225 230 235 240
Gly Val His Glu Leu Lys Ile Asp Gly Met His Val Ser Cys Glu Val
245 250 255
Glu Thr Ala Ser Leu Asp Glu Ala Leu Arg Gln Leu Thr Gly Ala Gly
260 265 270
Val Arg Asn Leu Val Ser Arg Pro Pro Thr Leu Glu Glu Leu Phe Leu
275 280 285
Arg His Tyr Thr Ala Asp Gly Gly Ala Arg Asp Ala Asn Ala Gly Val
290 295 300
Ala Arg
305
<210> 36
<211> 533
<212> PRT
<213> Artificial
<220>
<223> madT2基因编码的氨基酸序列
<400> 36
Val Asn Pro Leu Thr Gly Thr Gly Lys Leu Val Arg Leu Ile Leu Arg
1 5 10 15
Arg Asp Arg Trp Leu Leu Pro Ala Trp Met Phe Trp Val Val Val Ile
20 25 30
Pro Leu Ser Tyr Val Ser Thr Tyr Lys Asp Leu Tyr Pro Thr Ala Ala
35 40 45
Asp Arg Ala Glu Tyr Ala Ala Thr Ser Gly Ser Asn Pro Thr Phe Leu
50 55 60
Ala Leu Tyr Gly Pro Leu Pro Asp Val Ser Leu Gly Gly Ile Val Ala
65 70 75 80
Gln Arg Ala Gly Phe Ile Pro Ile Phe Val Gly Leu Val Ser Ile Leu
85 90 95
Thr Val Ile Arg His Thr Arg Thr Asp Glu Gln Ala Gly Arg Arg Glu
100 105 110
Leu Leu Ala Ala Thr Val Val Gly Arg His Ala Gln Leu Ala Ala Ala
115 120 125
Leu Ile Val Thr Met Thr Ala Asn Leu Val Val Ala Val Leu Leu Gly
130 135 140
Leu Gly Leu Ser Gly Gln Leu Pro Ala Gly Gly Ala Phe Ala Ile Gly
145 150 155 160
Leu Gly Phe Ala Val Ala Gly Cys Thr Phe Ala Ala Val Ala Gly Val
165 170 175
Ala Ala Gln Leu Thr Glu Tyr Ala Gly Ser Ala Arg Gly Leu Ser Ile
180 185 190
Ala Ala Leu Gly Gly Phe Phe Leu Val Arg Leu Ala Ala Asp Thr Gly
195 200 205
Gly Asp Asn Ser Gly Leu Thr Trp Leu Ser Trp Leu Ser Pro Leu Gly
210 215 220
Trp Val Thr Leu Leu Arg Pro Phe Ala Asp Glu Arg Trp Trp Val Leu
225 230 235 240
Ala Leu Ala Val Ala Phe Ala Ala Ala Ala Val Ala Val Ala Val Ala
245 250 255
Phe Ser Ser Arg Arg Asp Ile Gly Ala Gly Ile Leu Ala Ala Arg Leu
260 265 270
Gly Pro Ala Asp Ala Ala Pro Ser Leu Ser Asn Pro Ile Val Leu Ala
275 280 285
Trp Arg Leu Gln Arg Ser Leu Leu Leu Ala Trp Thr Val Ala Met Ile
290 295 300
Val Leu Gly Gly Val Phe Gly Gly Ile Ala Gln Gly Val Gly Asp Leu
305 310 315 320
Leu Glu Asp Asn Asp Thr Leu Lys Asp Met Phe Leu Arg Ile Gly Gly
325 330 335
Gln Gln Gly Leu Ile Asp Ala Tyr Leu Ala Ser Ile Met Gly Thr Val
340 345 350
Gly Leu Val Ala Ser Ala Tyr Gly Ile Gln Ala Ala Leu Arg Leu Arg
355 360 365
Thr Glu Glu Gln Ala Met His Gly Glu Tyr Val Leu Ala Thr Ser Val
370 375 380
Ser Arg Ala Lys Trp Val Ala Ser His Leu Leu Phe Ala Ala Val Gly
385 390 395 400
Pro Ala Ile Ala Leu Ala Ala Ala Gly Val Thr Thr Gly Leu Val His
405 410 415
Gly Ala Asp Ile Gly Asp Val Gly Gly Gln Val Pro Arg Ile Leu Gly
420 425 430
Gly Ala Met Val Gln Leu Pro Ala Val Trp Thr Leu Thr Gly Leu Ala
435 440 445
Leu Ala Leu Phe Gly Ile Leu Pro Arg Leu Ser Leu Ala Val Trp Gly
450 455 460
Ala Leu Ala Leu Phe Phe Leu Leu Gly Gln Leu Gly Glu Ala Met Gln
465 470 475 480
Leu Asp Gln Ala Leu Met Asp Leu Ser Pro Phe Ser His Thr Pro Arg
485 490 495
Ile Pro Gly Gly His Phe Ser Ala Met Pro Val Val Trp Leu Leu Ala
500 505 510
Ile Ser Leu Ala Leu Thr Leu Val Gly Phe Val Gly Ala Arg Arg Arg
515 520 525
Asp Ile Gly Thr Ala
530
<210> 37
<211> 310
<212> PRT
<213> Artificial
<220>
<223> madT3基因编码的氨基酸序列
<400> 37
Val Thr Arg Ala Ile Ala Ala Thr Gly Leu Val Lys Ser Phe Gly Gly
1 5 10 15
Ala Arg Val Leu Asp Gly Leu Asp Leu Thr Val His Arg Gly Glu Val
20 25 30
His Gly Leu Leu Gly Pro Asn Gly Ala Gly Lys Thr Thr Leu Ile Arg
35 40 45
Ile Leu Leu Gly Leu Thr Arg Ala Asp Gly Gly Thr Ala Arg Leu Leu
50 55 60
Gly Gly Asp Pro Trp Ala Asp Ala Pro Ala Leu His Arg His Val Ala
65 70 75 80
Tyr Val Pro Gly Asp Val Thr Leu Trp Pro Gly Leu Thr Gly Gly Glu
85 90 95
Val Ile Asp Val Ile Gly Ala Leu Arg Gly Gly Leu Asp Gly Asp Arg
100 105 110
Arg Asn Glu Leu Ile Gly Arg Phe Asp Leu Asp Pro Arg Glu Arg Cys
115 120 125
Arg Asn Tyr Pro Pro Ala Glu Arg Arg Lys Val Ala Leu Val Ala Ala
130 135 140
Leu Ala Ser Asp Ala Glu Leu Leu Val Leu Asp Glu Pro Ala Leu Gly
145 150 155 160
Leu Asp Pro Val Met Glu Arg Thr Phe Arg Arg Val Val Leu Glu Glu
165 170 175
Gln Ala Arg Ala Asn Arg Thr Phe Leu Leu Ser Ser His Ile Leu Ser
180 185 190
Glu Val Asp Ala Leu Cys Asp Gly Val Ser Ile Ile Arg Asp Gly Arg
195 200 205
Thr Val Glu Ser Gly Val Leu Ala Glu Met Arg Leu Gly Ala Arg Thr
210 215 220
Ser Ile Ser Ala Gln Leu Leu Gly Arg Pro Phe Ala Leu Ala His Met
225 230 235 240
Thr Gly Val His Asn Val Thr Val Arg Asn Asn His Val Glu Cys Asp
245 250 255
Val Asp His Asp Ser Ile Glu Lys Leu Met Ser Tyr Leu Ala Ser Val
260 265 270
Gly Ile Arg Asp Leu Val Ser Arg Glu Pro Thr Leu Glu Glu Leu Phe
275 280 285
Leu Arg His Tyr Gly Ala His Arg Pro Arg Arg Pro Arg Pro Arg Pro
290 295 300
Asp Arg Ala Pro Ala His
305 310
<210> 38
<211> 222
<212> PRT
<213> Artificial
<220>
<223> madU基因编码的氨基酸序列
<400> 38
Met Asn Met Arg Glu Ser Arg Arg Asn Ala Ala Gln Gln Gly Arg Ala
1 5 10 15
Ala Arg Gly Pro Arg Ala Arg Trp Ala Leu Ala Ala Thr Gly Ala Ala
20 25 30
Leu Val Val Leu Val Pro Gly Ala Val Ala Ala Val Ala Asp Thr Gly
35 40 45
Gly Gly Arg Ala Asp Gly Gly Arg Ala Asp Gly Ala Gly Arg Pro Pro
50 55 60
Gly His Gly Thr Lys Leu Pro Ala Pro Pro Gln Met Arg Ala Leu Asp
65 70 75 80
Phe Met Leu Gly Ser Tyr Lys Cys Met Asp Thr Pro Pro Gly Gly Gly
85 90 95
Glu Gln Ile Glu Asn Tyr Ile Thr Thr Lys Arg Asp Ile Gly Gly His
100 105 110
Tyr Leu Asp Ser Ala Met Val Thr Pro Asp Leu Val Val Gly Arg Arg
115 120 125
Val Phe Gly Trp Asn Pro Val Asp Arg Arg Phe Met Ser Glu Tyr His
130 135 140
Asp Asp Trp Gly Val Gln Gly Asn Ser Tyr Ser Glu Gly Trp Lys Asn
145 150 155 160
Gly His Leu Ile Phe Thr Gly Thr Val Lys Leu Val Lys Lys Pro Ser
165 170 175
Pro Thr Gly Asn Ala Glu Gly Val Glu Val Asn Val Lys Asp Asp Met
180 185 190
Val Ile Leu Ser Arg Asn His Tyr Thr Asn Thr Gln Thr Thr Ser Val
195 200 205
Pro Gly Gly Ile Ser Val Gln His Phe Tyr Asp Cys Arg Lys
210 215 220
<210> 39
<211> 121610
<212> DNA
<213> Artificial
<220>
<223> 链霉菌的生物合成基因簇的核苷酸序列
<400> 39
ccggggcggc ccggaacacg gcggcccgga acacggcggg cctggacgcg gcgggccaga 60
cgcggcgcgg gcctggccgg ggcgggccgc gctactcggc gatcttggtg cgcatcgtcc 120
agtggaacgt gacgggggcg gcgggctcgt ccacgagctc gaagtgctcg tagccgcccc 180
ggtaggggat cttgagcttg cgggcggcga gggtggagcg ctcggcgcgg atcacggcgg 240
gcacgccgtc ggggccgcct tcgagcgtga tgtcgatcgt ctgtaagcgg tcgttcctct 300
gtaagcggtc gttcctcatg ttccggttcc tccgggtcgg gggcggccgg ggccggccgc 360
cgggtctcgg acgggggacg cggcgtcgcg ccgcagcgct gaccgaaggt cgaagcgtcg 420
cacgttgagc ttgcggtaca ccttcgtgag gtgctgctcc accgtgctga ccgtgatgta 480
gagcttgcgg gagatctccc ggttggtgta gccgtcggcg gcgagcgcgg cgaccctgcg 540
ctcggcgtcg ctcagttcga gcgtgctcgg gcagggcggc tcctcggtgg tgcgctccag 600
aattccgcac ttgcgcgcta gggaaaggga cctgcgcgac agcagcgagg cgcgttcggg 660
ctcgccgagg atctcgtagg cgcggctctg gtcggcgagc gcccgcgaca gttcgagctg 720
gtcgccgcag ctctggagca tctccacggc ctggccgagc agggcggggc gcgactccgg 780
ggggctcgtc gccgcctgcg cgcgcagcga catcccgcgc gcccgcaggt gctcgggctc 840
caggagcgcg agctgctcgt cgacgagctc cctcgcctgg gcggcgtcgc ccaggcgcag 900
gtgggcctcg gcggcgccga tccgccaggg ctccacggcc ggcaggtcga tgccccagtc 960
gttcatcagc ttgccgcagg cgcgaaagtc gcaggcggcg gcgtagaagc ggccctccgc 1020
caggtggtgc cgcccccgcg cgtacaggta gtgcagcccc atccgcgtct ggaacatcgt 1080
ctgcgggacg gggatgtcga gcagggtggc cgcgcgctcg accttgccca tggccgtcgt 1140
ggcgagcacc atcgcggcga gcggaagccc gaccgcgacg ccccagctct tcggcgggat 1200
gaggctcagc gcggcgtcgg cgtgccgttc ggccgcgatc acgttgccct gccggacgtg 1260
gatcagggcg cgttcggccg cgaacagcgc gtgccacgtc gggctgcgcc gggcgaccgc 1320
ctcgaccagg agggagtcgc accaggaggc cgcctgctcc agccggtcgt tgtacatcag 1380
caccgcgagc gccgtcgtca gcggcgccag cgtgcggttg ctgaggcgcg agcgctggag 1440
gatgccctcg gccgtggtga gggcgtcgct ctcgttctcg tcgaccagct cggccgccag 1500
catggtggcc tcctggaggt cggggagcgc tccggtcggc tcgggctcgt ggtgcgcggg 1560
caccggctcg gccttgagga cgcccgggta gaggtaggag agccacagcc gcggggtgtc 1620
gaggtcgacg gacgcggcgg tgccggactc ggcggcctcg tcctgcgtcc ggacgaggtc 1680
gtccacgatg ccgagcgcct cgcccacccg cccgtgccag agcaggtagg tgatcggggt 1740
gaacgtgtcg cacagctcgg gccgtccggt gcgggcgggc gcggcgaagt cggggaggtg 1800
gcgcaggacg gccgaggggt ccacccgcca ttcggcgtcg gccagggcgg acgcgatctc 1860
cggccggagc cgctcgtcgt cgcacagccg gtaggcggcg cgcaggtggc ggatcgcggg 1920
gcggacgtcg tcgttggcga gcgcgtgctc ggcggcctcg cgcagcagcg ggacgaccca 1980
cggcgcctcg atccgtccgg cggcgacgat gtgcgcggcg acgtcggtgg gggcggcgcc 2040
ctcgttgtgc agcaggacgg cggcgctgcc gtgcatggcc atccgctcgc cggacggggt 2100
ggcgtcgagg atcgcggccc gtacgatctc gctccggaag tcgccgttct ccaggatccg 2160
cgccttggtg agcgcgttga ccgcctgggc ggtggactcg gccgcgaggt ccagcagccg 2220
gccgagcagc gtggtcgagg ccgccttccc gaggatcgcg atggcgcggg cgacctcgca 2280
gaccggcggc tcgtaccggt agaggaacgt catcaccgcg cggctgaact ccatgccgga 2340
gccgagctcg ccgcgccccg cggccgccgc ggcgcggtgg tcctcgatca gcgcctgcac 2400
gagcgccgga tatccggagg tcgcctcgtg gaacgcgggc gtcaggtcct gcgccgccgc 2460
gtcgccgagg ctcctgcgca gcagccgcgc gacgcccgcc ttcgtcaacg gctcgacgcg 2520
gatctggcgg aagaacggct cgcggaggaa ctcgctgcgc agcagcgcgt tcgccgcctc 2580
caggtcgcgg cactcgctga agatcacgtg gaggcgcgac gagcgcagcc tgcgcagcag 2640
gtacagcagg cagcgcagcg acgtctcgtc ggcgtagtgc acgtcgtcca cggcgatcac 2700
gaccggctgc cgttcggcga ggccgcgcag cacctcgcac agcccgcgca ggacgaccga 2760
ggtcacctgg cccgagccgc cgggaccggc ggcggaggtg ttgagggcgc cggcgtcgag 2820
ccagcgcatg gcgcgctcgg cgtcgccgga gggcaggtcg gggttgcgga agagctgttc 2880
gaggacgccc atcggcaggc cgcgctcgat gcgcgacgcg acggcgctca ggaacagggc 2940
gccgtgctcg acggcccgtt ccccgaacgc gcgcagcagg gaggtcttgc cggtcgccgt 3000
gcccccgctg accaccacca ccctgcttcg cgcgcgtggg agctcggcga acaggccgtc 3060
gagagcggcg agttcctcgt ctcgttcgac caggctcaca ggcaccccta tgatgatcgc 3120
tcaattgcgc cgacccgcgt ggcggtcgcg ggtgtgatcg agacaggcgg cgccccgggc 3180
gccggattcg aacgaatact tgcacccgtt ccgaacaagt gcaacgaacg gcgtgttgat 3240
ggttcatttc ggacggtgat gatcgcgcgc ctgacctgcg gggccgtggc cgcccgggcg 3300
gccggtgatc catctctgca cgcacgggga cttgccaggc cgcgggcgat cggttatatg 3360
gcgggaatgg tcttgatcga tcacgcgata agcccgcccc gcgcgggcgg attaggggtg 3420
ggcgggcaca ggttaggagg ggtaaggggt tgcgcccgga cggcccggag tatgtaaatc 3480
gtgcttaaag acaatgttaa cggaacgtat ggggcgggcg gcgtccgccg tcaccctcgg 3540
cgcaattgcc cggagccgcc gccaccgggc cgcgggcacc cctagaaggc tgcgccgcag 3600
aaccgaaaag ggcgtgaccg ttattcggcg ccggccgccg gcgggtccgg accgtccggc 3660
cgaccgctgg tcagcgggct tccagggcat cgcggagacg ccccgttcgc gcgcgggcgg 3720
ccggcggcgg gtccgcccgc cggtcggtga tcttcggaat gtaggggagc cgcccggccg 3780
ttaggggatg acccgcgcga cgcggtccgc ttagattgtg cattgaacgg gcgggtgctc 3840
cccgccccgg gccccggtag cccaattctg gtagccaggc gacggactta aaatccgtag 3900
agtgtgggtt cgagtcccac ccggggcacg gcggtgaacg gccgcgaacg gccggaaggc 3960
accgggccga aggctaccgg tctccggccg cctgccttag cgggccgctt ccgcagcaat 4020
ccagaacggc ctccgaagcg ggcagcacca cctcgatgtg ctggagcgcg gatttcaggt 4080
cgaagcgttt cacgttgagc ttccggtaga ccttcgtgag gtgctgctcc accgtgctga 4140
tggtgatgta aagcttcttg gatatttccc ggttggtgta tccgctggcg gccagaacgg 4200
ccaccctgcg ctcggcgtcg ctcagcttca ggaaccgttc ggggtcggcc gccgccacgc 4260
ctccggcgtc ccgtccggcg ctcccggcga gaccggcggc gtgcccggcc cccatccggt 4320
ggcggccggc caggtcgtcc ggccggacga cgccgcccgg gacgtccggg cgctcccctt 4380
cgcgggccgc gggccgcccg cactgccgcg cgagcagctc ggcctggtgc gcgagctccc 4440
gcgcgcagtc gtgctcgccg acccatcggc gcacctcggc gaggtcggcg agcgcgtacg 4500
acagctccag ccggtcgccg cagccctgga gcagctcgat cgcctcgccc agcaggtccg 4560
gccgttccgg cacgggatgt gtcgcggcct tgacgcgcag ggcgacgccg cgcgtccgga 4620
cgtgctcggg cgcggacctc agcagctgct cctccatcag cgccctcgcc tccgccaggt 4680
cgccgagcgc gatcagggcc tgcgcggcgc cgagccgcca cggctcgacc tcgggcagga 4740
cgacggccca gccctcgcgc atcgttccgc aggtgcggaa gtcggacagc gcggcgtgga 4800
gccggttggc ggccaggtgg tgccggccgc gcgcgtagag gtagtgcggg ccgagcggcg 4860
tgcggaacat cgcctcgggc accgtgatgc tgagcaggtc ggcggcctcg cgcggacgtc 4920
ccatggccgt cttcgccagg atcatgcacg acagcggcag cccgacgagg acgccccagc 4980
tcttcgggga gatgaggttg agcgcggtgt cggcgtgccg ttcggccgcc gggaggtcgc 5040
cctgccgcag atggatcagc gcgcgcagcg cggtgaacag cgcgagccag gtcgggctgc 5100
gccgcgccgt cgcctcggcc agcagcgcgt cgcaccagac cgccgcccgg cccggccggt 5160
cgttgtagat cagggtggcc agcgcgatcg cgagcgaggc gaacgtccga tggttcaagc 5220
gggagcgttc catgacccgc tccgccgcga ccagcgcccc gcgcacgtcc ctgcggtgca 5280
gctcggccac gagcatcgcc gccgcgtcct gcccgggacc gggcgccgcc gccgggaccg 5340
ggccgagggc gggggcggtg tccgccgccg ggccgtcctt cgccagctcg ggatacaggt 5400
aggccaggca gaaccgcggg gggatgacgt ccatcgccgt cccggcgtcg cccggcagcg 5460
cggcggaggg ccggccgcgc ccgagcgcgt cgagcatccg gacggcctcg gagacccgcc 5520
cgtgccagag cagcgacatg aacggcgtgc cggcgtcccg cccgtcgagc aggccgtcct 5580
ggatcgccag gctgaactcg ggcacccacc ggagcgcgac cgagggatcg acccgccact 5640
ccaggtcggc gagccgcgcg gcgatcgccg agcgctcccg ccggtcgcgg cagatctggt 5700
gggcgacgcg caggtagcgg atgcccgttc cgaggtcgtc gctcgtcagc gcctgctcgg 5760
cggcctcccg caggacgggg atcgcccacg gcgcgccgtg ccggttggac gcgacgatgt 5820
gggcggcgac gtcggtcgcg gcggcgccgt cggagtgcag ctgctcggcg gcgcgctcgt 5880
gcaggacgcc gggggcgtcg ggcggggcgc tgtcgacgac cgccgcctgg gcggcgccgt 5940
gccggaagct gccggcgtcg aggatccccg cggaggtcag cgtgccgacg gcgcgggcgg 6000
cggtctcggc gtcgatgtcg agcagccggc cgatcaggct cggcgggacg ggctcgttga 6060
ggatgccgat ggcgcgggcg acctccagga ccgggaactc gtgccgctgg agcagcccga 6120
cgaccgcgcg cccgaacgcg ccgcccgcga ccagctcggg cgcgatgaac tccggccccg 6180
cgaggtagtc gtcgacgagc gcgtgcacca gcgccgggtt gcccgcgctc gcctcgtgga 6240
acgcgggcgc gaggagccgc gcggtctggg cgtcgaggtc cagccggatc agcgcttcca 6300
cgccgtccgg ggaaaggggg ccgagcctga tctggcggga atgcggttcg cggaggaatt 6360
cgctttggaa cggcgcgttc ggcgcctggg aacgggggct cacggtgagc accgtgagca 6420
ggcgcgccga acgcagccgg cgcagcaggt acagcaggca ctgcaacgag atctcgtcgg 6480
cgtggtgcac gtcgtcgacg gagatcacca cgggccggcg ttcggtcagg tcgcgcagga 6540
cgtcccagag gccgcgaaag cgcatgggat aggtcaacgc acgcgcgttg agcagcttca 6600
tcgcgcgctc ggcgatcgcc tcgggaacct ccgggctgcg aaagagctgt tcgagcacgc 6660
ccatcggcag cgccgtctcc accgacgagg cgaccgcgtt gagaacgacc gcgccggact 6720
cggcggcctg cctttcaaag gcgtccagaa gggagctctt accggtggcg acgtccccgc 6780
tgatgatcgc cagtctgctg cgcgcctcca ggcagtcccc gaacagcccg tggaggacgg 6840
cgacttccgc gttccgttct accaggctca cgaactcccc ctgagctgca cactctcttc 6900
cccgtttgcg gcaccccgcc gcattgagtc ccaggtcacc cgtaccccgt ggtcgtctac 6960
cgggcagaat ctagtcgcgg tcctgtcggc gggcaacggg cattgcgtaa tcggaccgtt 7020
tcacagggtc gctaatcacg tcctgacctg cgcgtgcagc acccttgtga gataggttat 7080
acatctgtat acttcgggga agcgagggga gatgtgcgtc ccgcagaaag atttcgttaa 7140
ttttctgtga tgggactgga gtatcatctc cggaaccgcg atgctcgaaa ttcgccacac 7200
cgcagaccgg ccgccgcgag gcgggcccga acgccgcgaa cccctcccga agcccctctg 7260
gaaggccctc ggcgcgtgcc gcgcgccggc ctcgcagtcg gtgatctttg ctcggccgtt 7320
cgcgggcccg cgaccatttt aggagacccg cctaaaggcc cgccaacgca tagggcatgg 7380
cccttgcatt ccgcccgagt gggccggatg atcactctgc cgttcaccgt gctgagggtc 7440
gcgagaatac gttccgaacg gtgcgtactg cactctaggg gcggtcggag aagacggcgt 7500
tcggtcgatt aggggcccct taacaaccgg ccagtcataa ggtcgaaaag gcggttgacc 7560
aggaggcatt acgttccgca tcgttccgga tacgcatcgg gccttgctgg tgatccggcc 7620
gcggagcacc cttggatcgg cctcctttgg cccgccggcg caccggcgac gaccccgatg 7680
gaacggaacg gagagacggt ggccgaactg aaatccggga gggcgcgcga ggtggccgca 7740
ttcctccgcc cccggccgac cacggcgtta tcccgcgaaa cctcgacatc ccgcgaaccc 7800
tcgacggcgg gtgcgcgatg aagggcgtca tcctcgcggg cggaaagggc acgcggctcc 7860
agcccgtcac cgccgtcggc tccaagcagc tcatgcccat ctacgacaag ccgatggtct 7920
actacccgct gtccatcctc atgttcgccg gcatccgcga cgtgctcgtc atctgccgcc 7980
cgtccgaggt ggccgtcttc cgcggcctgt tcggcgacgg gcggcggctc gggatgaaca 8040
tcgactacgc cgcccaggac gagccgcgcg ggatccccga cgcgttcctc gtcggcgccg 8100
accacatcgg cggcgacgac tgcgcgctga tcctcggcga caacctcttc cacggctcgg 8160
gcctgccgtc cctgctccgc cagagcgccg agcggatgga cggctgcgtc ctgttcggcc 8220
accaggtctc cgatcccgag cgctacggcg tcgccgagat cgacgagcgg ggccggctgg 8280
tgtccatcga ggagaagccg tccgagcccc gctccaacct ggcgatcccc ggcctgtact 8340
tcttcgacaa caaggtcgtc gagatcgccc gcggcctcgc cccgtccgcc cgcggcgagc 8400
tggagatcac cgacgtgctc cgcgcctacc tggaggcggg ccgcgccgac ctggtctggc 8460
tcggccgcgg cgtcacctgg ctcgacaccg gcacgcacga gacgctgctc gaggcgggct 8520
gcttcgtgcg cgacgtccac cagcggcagg gcgcgcggat cggctgcgtc gaggagatcg 8580
cgctctacat ggggttcatc ggcccggacg agtgctacga gctgggcgcg gagatgggca 8640
actcgcccta cggccggtac gtgatggacc aggcccgctt ctacgaccgc gtccgcgacc 8700
tgctggactg gcgcgccacg ttcctggagg gcgcatgaac ctgctggtca ccggcgccgc 8760
cgggttcatc ggctccacct acgtgcggat gctgctcgct cccgccgccc cggaccccgc 8820
cgccccggac ccggttgcgc ccgacccgga tgcggccgac ccggacgcgt ccgaggagat 8880
ccgggtgacg gtcctggaca agctcaccta cgccggcacg ctcgacaacc tgacgctcag 8940
ccatccgcgg ctccgcttcg tctacggcga catctgcgac gaggacctcg tcggcaagct 9000
gatgtccgag gccgaccagg tcgtgcactt cgccgccgag tcccacgtgg acaactcgat 9060
cgccgccgcc ggcgacttcg tccgcaccaa cgtggtcggc acccacacgc tgctggagca 9120
ggcgctcgct catggcgtgc ggcggttcgt ccacgtctcg acggacgagg tgtacggctc 9180
ggtcgagacg ggccgctcgc acgagaccga cgcgctcgac cccaactccc cgtacgcggc 9240
gtccaaggcc gcctccgacc tgctcgcgct cgcctaccac cgcacccacg ggctcgacgt 9300
gcgcgtcacc cggtgctgca acaactacgg gccccggcag ttccccgaga agctcatccc 9360
gctgttcatc acgcgcctgc tcgacggcca gaaggtcccc ctgtacggcg acgggcagca 9420
caagcgcgac tggctccacg tcgacgacca ctgccgcgcg atcgagctcg tccgcaccgc 9480
gggcgcgccg ggggaggtct acaacatctc cggccacacc gagctgacca accaccagat 9540
caccggcctg ctcctggagg cggtcggcgg cgactgggac caggtggcct acgtgaccga 9600
ccgcaaggcc cacgaccgcc gctacgccgt ccacgacgac aaggcccgca cccgcctcgg 9660
ctaccgcccg cgccgcgact tcccgcgcgg cctggccgag accgtcgcct ggtaccgcca 9720
caaccgctcc tggtgggagc ccctgaaacg ccgagcgggc ctctgacccc gccgaagcgc 9780
cgcccgcccc gccccgtgcc ccatccgagc gatggggcac acccatgtgc ggggggaggg 9840
gggcgtcgcg gacggacgga cgaacgaacg gccgagccgg ggcggcgggg gcgcccgtgg 9900
catcggccgt gtcgtcgttc gtggcggtag gcgggtaggc gggtaggcgg ggtaggtggg 9960
gacgggagta cgtggagcgg gcgggtcagg cggggatggg caggtccagc cggtggtccg 10020
aggtgaggat ggaccgctgg aggttctgga ggtcgcggca cggttcgagc ccgagctcct 10080
cgttgagcgt cctgcgcgcg gactggtagg cgtggagcgc gtccgcgcgg cggcccgagc 10140
ggtagagcgc cagcatgagc tggcggtgga ggacctcccg cagcgggtgc tcggtggtga 10200
gctggtagag gtcgccgacg agctcgcggt gccggccgag catcatgcgc gactccatga 10260
gcatctccac gcactccagc cgggtctccc ggagccaggt gtggaacccc tccaggacgg 10320
gcccgttgcc gaggtcgtcg ggcaggtggt cggcggagag gtcgagcgcc tcctcgaacg 10380
cggccgacgc ctggtcgtgg cggccctgcc tgaagtcgtt gcggccctgg ttcaccagca 10440
gctccaggcg gcgcaggtcc agctcgtcgg ccccgagccg cagcaggtag ccgggcggcc 10500
gggtgaggac ggggtcctcg gcgccgggcc ggtgcaggaa cttgcggatc tgggagatgt 10560
agacgtgcag cccggcggtg gcccggcggg gcggcgcctc cccccagatc tcggtgatga 10620
gctggtcgac cggcaccacc tggtccgcgc ggaccagcag caccgtcagc aggacccgga 10680
tcttccgcgc ccggatggtg tgcttcccgt cctcgtcgac cacctggaac gggcccaaga 10740
tctcatagcg cacggttcac ccccgtcgat aagcgtgcga acggacggcg tcctaacgga 10800
cgccggcttc cttccactga cgccatggct cccccctttc gtgcgggcgg acgcgccgcc 10860
gaggcggcgg acggacgggc cggggtcccg gcggatcacc gccgggccgg gcgcgtccga 10920
tcactaccca ctcaacttaa gaaccggcca aggcgcgctg aacccctgtc ccgtgcccgc 10980
cgaccccagt cgagaccacc gttcccccta cgggctcgcg ggccgggccg cgcgggcgcg 11040
cgacgctcag ggcgtccgcc ggggcggtgc gggcgcgccg tacggtccgg cgggcgcccg 11100
ccccaggggg atctaggggt gggggtgagg tgcggggcgc ctccggcagc ggcgctggtc 11160
gtgcgccgct gccggaggta gcggcccgcc cggggcggcg gtccgctctc ggggggatcg 11220
ggtggtccgc gttgacgggg gtgaggcgcg gcccatccgg gtccccgggt tcaaggggtg 11280
aacggttccg gcggccgtcg ggggcgcggc cgcggtcggt ggtcggggtc agcgggcggg 11340
gtgggtggtc ggggtcagtg ggcgggggct ctgtcggggc gcgggcgcgg gcgccggggg 11400
cggtgcgcgc cgtagtggcg gaggaacagc tcctcaaggg tcggttcgcg gctgacgagg 11460
tcgcggatgc cgacggacgc gaggtagctc atgagcttct cgatgctgtc gtggtccaca 11520
tcgcattcga cgtggttgtt ccggacggtc acgttgtgta cgccggtcat gtgggccagg 11580
gcgaacggcc ggccgaggag ctgggcggag atcgaggtgc gggcgccgag ccgcatctcg 11640
gcgaggacgc cgctctcgac cgtgcggccg tcgcggatga tgctgacccc gtcgcacagg 11700
gcgtccacct ccgacaggat gtggctcgac agcaggaacg tgcggttggc gcgggcctgc 11760
tcctccagga cgacgcgccg gaacgtcctc tccatcaccg ggtcgaggcc gagggccggc 11820
tcgtcgagca cgagcagctc ggcgtcggag gccagcgcgg cgaccagcgc gaccttccgc 11880
cgctcggcgg gcgggtagtt gcggcaccgc tccctcgggt ccaggtcgaa gcgcccgatc 11940
agctcgttcc tgcggtcgcc gtcgaggccg ccgcgcagcg cgccgatcac gtcgatcacc 12000
tcgccgccgg tgaggccggg ccacagggtg acgtcgccgg gcacgtacgc gacgtgccgg 12060
tgcagcgcgg gcgcgtccgc ccaggggtcg ccgccgagga gccgggccgt tccgccgtcg 12120
gcgcgggtca ggccgagcag gattcggatg agcgtcgtct tgcccgcccc gttcggcccg 12180
aggaggccgt gcacctcgcc ccggtgcacg gtcaggtcga ggccgtccag cacgcgcgcg 12240
ccgccgaacg acttcaccag cccggtcgcc gcgatggctc tggtcactct actgtccacg 12300
tgtctcgttt gctattcatc tgtatagccg agaatacgtg cgtattgtcc cgggggcaag 12360
cgggcctgcc caacgtattg gcggggcttt agagcggcga cccacgatgg gcggcgtgct 12420
cacaacccaa gcggggatgc cgtccgaccg cctccgttcc ccgggaacgg acgcggccgc 12480
ggacggcggc gcgggcgacg gcgcgcggcg gtggttcggc gggttccttg aggcggtgga 12540
cgggttcgac gccggtttct tcggcatcgc gccgcgcgag gccgtcgcga tggacccgca 12600
gcagcgcctg atgctcgaag tggcgtggga ggcgctggag gacgcgggga tcccgccgga 12660
gacgctcgcg ggcagccgta ccggcgtgtt cgtgggggcg atctgggacg agtacgcggc 12720
gatcctccga cggtccgggc cgggcacggg cgaacggcac gccatgaccg ggtcccaccg 12780
cagcatcatc gccaaccgcg tgtcctacgt ccacgggctg cgcggcccga gcatgaccat 12840
cgacaccgcc cagtcgtcct cgctggtcgc cgtgcacgcc gcgtgcgaga gcatccggaa 12900
cggggagtgc tcgctggcgc tcgccggcgg cgtcaacctc atcctcggcg aggacggcat 12960
ggaggccgcc gccgcgcagt tcggcgggct gtccccggac ggccgctgcc acaccttcga 13020
cgcccgcgcg aacggcttcg tccgcggcga gggcgccgcg acggtcgtcc tgaagccgct 13080
ggacgcggcg ctgatcgacg gcgacccggt gtactgcgtg atccgcggga gcgccgtcaa 13140
ccacgacggc gccgcggacg gcctgacggt gccgagcgcc gccgcgcagg aggacgtgct 13200
gcgcctggcg tgcgagcggg cccgcgtggc gccgggggac gtccagtacg tcgagctgca 13260
cggcaccggc accccggtcg gcgacccggt cgaggcggcg gcggtcggcg cggcgctggg 13320
cgcgtcccgc gcgcacggcg ccccgctgcg ggtcggctcc gccaagacga acgtcggcca 13380
cctcgaaggc gccgcgggga tcgtcgggct gctcaaggcg gtcctcagcg tcgcgcaccg 13440
cgagctcccg ccgagcctca acttcaccgc gccgaacccg cggatcccgc tcgcggagct 13500
gggcctgcgc gtccaggacg agctgaccgc gtggccgtgc ccggaccggc cgctgatcgc 13560
gggcgtcagc tcgttcggca tgggcgggac caactgccac gtggtcctca ccgaaccgcc 13620
cgcgaccgcc gcggaggtac ccgcgaccgg cggcgcgacg cgcgcggacg agcccggcac 13680
cgccgccggg aggccggcgg cgcccgtacc gtcgaacgag acgcccccga tcgttccctg 13740
gacggtttcg gggaagagcg cggcggcgct gcgagcccag gcggaccggc tgcgcgagca 13800
cgtggaggac cgtcccggcc tggacccggc ggacgtcggg cacgcgctgg gcgcgacccg 13860
ttccgcgttc gcccaccgcg cggtggtcct cggcggcggc gccgatgagc tgctcggacg 13920
gctgggcgcg ctgggacggg gcgaggacgc gcccggcgtc gtccacggga ccgtgaacgg 13980
tggcggcgtc gcggtgctgt tcagcggcca gggcagccag cgcccccgca tgggccgcga 14040
actgtacgac gcgcacccgg tgttcgccgc ggcgctggac gaggcgttcg cctgcctgga 14100
cccgatgctg gaccggcccc tgcgcgaggt ggtgtggagc ggcgacgccg cgccgctgga 14160
ccggaccgcc tacacgcagc cggcgctgtt cgccgtcgag gtcgccctgt accggctggc 14220
cgagtcgtgg ggcctgcgcc ccggccacct gatcggccac tcggtcggcg agatcgccgc 14280
cgcgcacgtc gccggggcgc tgtcgctgcc ggacgcgtgc acgctggtcg ccgcgcgcgg 14340
acggctcatg caggccgtca ccgcgcccgg cgcgatggcg gccctgcgcg cgacgccgga 14400
cgaggcggcc gagctcgtcg cgggccgcgg cggcctcggc gtcgccgccg tcaacggccc 14460
cgcctccgtc gtgatctccg gcgaccgcga cgccgtccgc gacgccgccg cggcctggcg 14520
cgaccgcggc gggaaggcca cgctcctgcg ggtcagccac gccttccact cgccccacat 14580
ggacgcgatg ctcgacgagc tgcgcgccgt cgcggcgggc ctggcgttcg ccgcgcccgc 14640
catcccggtc gtgtccaacg tgaccggcct ccccgcgacc gccgaacggc tcgcgtcccc 14700
ggactactgg gccgagcacg cgcgccgcgc cgtccgtttc atgccgggcg tgcggtacct 14760
gtgcggcgag ggcgtcacca cgttcgtgga gctgggcccg gacgggcaac tggcggccat 14820
ggcccgcgag tcgttccccg gcccgtccac cgagggaccg gacggccggc cgcgtccggc 14880
ggcccgcccg gtggcgctgg cggcgctgcg ccgcgaccgc cccgagaccg ccacgttcgc 14940
gacggcgatg gccgaggcgt acgtccgcgg cggcgtggac ctcgactggg accggatctt 15000
cgcgggacgc gcgcgcacgc gggtggacct gccgacgtac gcgttccagc gcgaacgcta 15060
ctggccgggc tccctgcccc gccccgaagc cgtcccggcc gggcagccgg ggccggggcc 15120
ggaggcgggt ccggttctgg aagccgagcc gccttcgccg ctggaacgcc tcgcggggct 15180
ctccgagccc gaacggcggc gcgccctgct ggacctggtc cggacgaacg cggccgtcgt 15240
cctcgggcac gcgtcgggca cggcgatcga cgcggacctg acgttcaagc agctcggctt 15300
cgactcgatg gcggccgggg agctcagcga acggctcggc gcggccaccg gcctgccgct 15360
gcccgccacg ctcacgttcg accacccgac cccgctggac gtggccgcgc acctgaaccg 15420
gcgcgccacc gcgactccgg acgggacgtc ccggcaggac gcgccggcgg ccggccggag 15480
cgcggcggac gacccgatcg ccgtcgtcgc catgagctgc cgctaccccg gcggcgccac 15540
gacgcccgaa gccctgtggg acctcgtcgc cgcgggacgc gacgtgatcg gcggcttccc 15600
ggacgaccgc ggctgggacc tggccgccct gttcgacccg gacacggacc gggcgggcac 15660
cacctacgcc cgggagggcg gcttcctgta cgacgcggcg gagttcgacg cggagttctt 15720
cgggatcagc ccccgcgagg cgcaggccat ggacccgcag caacgcctgc tgctggagac 15780
gtcctgggag gcgttcgaac gggcgggcat ccgcctcggc gcactgaagg gcagcccgac 15840
cggcgttttc gtcggcgcga ccgcccagga ctacggcccg cggctccatg aggcccccaa 15900
gggactcgac gggcacctgc tgacgggcgc gacgccgagc gtggcgtccg gccgggtggc 15960
cttcaccttc ggcctggtcg gacccgcggt gacggtggac acggcatgct cgtcgtccct 16020
ggtcgcgatg cacctggccg cgcaggcgct gcgccagggc gaatgcgacc tggccctggc 16080
cggcggagcg accgtgctcg ccgcccccgg catgttcacg ggtttctcgc gtcagcgcgg 16140
cctggctccg gacggccgct gcaagccgtt cgccgcggcg gccgacggaa cgggctgggc 16200
ggagggcgtc ggcctcgtac tgctggaacg cctgtcggac gccttgcgaa acggccgccg 16260
cgtcctggcc gtgatccgcg gctcggcggt caatcaggat ggtgcgagta atggtttgac 16320
ggctccgaat ggtccgtcgc aggagcgtgt gattcgtcag gcgttggcgt ctgcggggtt 16380
gtcgtcgtcg gatgtggatg cggtggaggc gcatgggacg gggacgacgc tgggtgatcc 16440
gatcgaggcg ggcgcgttgt tggcgacgta tgggcggggt cgttcgggtg gtcggccgtt 16500
gtggttgggg tcggtgaagt cgaatatcgg tcatacgcag gcggctgctg gtgtggctgg 16560
tgtgatcaag acggtgatgg cgatgcggaa cggttcgctt ccggcgtccc tgcatatcga 16620
tgagccgtcc ccgcacgtgg attggaagtc gggcgcggta cgtctgctga ccgagccgat 16680
ggaatggccg accgaagatg accgtccgcg tcgggccgcg gtgtcgtcat tcggcatttc 16740
tggaacgaac gcccacctga ttttggaagc acctcccgag ccggacagcg agcccgaacg 16800
acctgcgccg atggacgtcg gcccgctccc gtgggtgctg tcagcgagaa gcgaagaggc 16860
gttgcggggg caggcggcgg cgctggcgtc ccaggtgcgc gtgtcggatg acgcgccgtt 16920
ggatgtgggt tggtcgctgg cggcgtcccg ttcgaatttc gagcgtcgtg cggtggtcgt 16980
cgggaccgga cgcgatgaat tgctggccgg tttggaagct gtggcgagcg gtgaagtgcg 17040
ttctgttgcg ccttccagag tcggcgcggg cccggtgttg gtgttcccgg gtcagggttc 17100
gcagtgggcc ggtatgggtg cggggttgct ggattcctcg ccggtgttcg cggagcgtgt 17160
tgctgagtgt gagcgtgcgt tggctccgtt tgttgattgg tcgctgtcgg acgtcctgcg 17220
tggcgacggg gcggagttgg cgcgggttga tgtcgttcag ccggtgctgt gggcgacgat 17280
ggtttcgctg gcggctgtgt gggcgtctta tggtgtcgtt ccgtctgcgg tgattggtca 17340
ttcgcagggg gagatcgctg ccgcgtgtgt ggctggggcg ctgtcgctgg aggccggcgc 17400
caggatcgtg gcgttgcgga gtcgggcgtt gcggcggttg gccggtggcg gtgcgatggc 17460
ttcgcttggt gtcggtgtgg acgccgcgca ggagttcctg tctgcggatg tagcggtggc 17520
ggcggtgaac ggaccgtcgt ccacggtggt gtcgggcccg cctgagcagg tcaccgcagc 17580
ggtggcccgt gtggaagctg ccgggtatcg ggcgcggctc atcgacgttg attacgcctc 17640
gcatgggccg caggtcgatg agatcaccga cgaattgcat gaagtcctcg cgggcatcga 17700
gcccgcaccg acaacagaag tggcgttcta ttcgaccgtg tcgggtgggc ggatcgacac 17760
cacgagcatg gacaccggtt attgggtgac caatcttcgg gaacgggtgc ggttcgcgga 17820
tgccgttcgg gagttggcgg gcgacgggca tcgggtgttc atcgaggcga gcccgcatcc 17880
cgttctgacg ctcggcacgc aggagacgct ggaagaggcc gggaccgttg ccacggtggt 17940
gccgacgctg cgccgtgacc acggtggtcg gcggcagctc gcggacgcgc tggggcaggc 18000
gttcgcggcg ggggtggagg tcgactggtc gcggtggttc accggtgatc gggctccgcg 18060
gcttgtcgag ttgcctacgt acgcgttcca gcgcaggcgc tactggctgg acggccggag 18120
cgggcccggc ggcgatccgg agagcctggg gctggcgtcg gccgggcatc ggctgctggg 18180
cgcggccgtc gagcacgcgg gcgcggacgc gtggctgctc acgggacggc tctcgacgcg 18240
gacgcagccg tggctggcgc agcactcggt cgccgggacg gtcgtcgtgc ccggtgcggc 18300
gctggtcgag tgggctttgc gggcggcgga cgaagcgggt tgcggcggag tggacgagct 18360
gaccccgcgg gttccgctgg tgctggccgg tccggagccc gtacgggtcc aggtggccgt 18420
ggacgcggcg gcggacgacg ggactcgcga cgtacgggtg tactcccggc ccgacggcga 18480
cgaggaagcg gaatggacct gtcatgcgga aggcgtcctc tcagcggggt cgcctgagcc 18540
cgcgcccgag cccgagccgg gtggcggcgg ggcgtggcct ccgccgggcg ccgagccgct 18600
ggccgccgat gacgtctacg aacggttggc ggaggcggga tacgactacg ggcaggcgtt 18660
ccagggcgtg cgggcgctgt ggcggcaggg cgcggacctg ctcgcggacc tcgtgctgcc 18720
cgaggcggcc ggggggccgg gagggttcgg gatccatccc gcgctgctgg acgccgcgtt 18780
gcatccgtcg ctgctcgacg ccggtccggc cggggacggc cgggtgtggc agccgttcgc 18840
ttggagcggg gtgtcgctgt gggcggcgga ggcgaccgcg gtacgggtcc ggctgacccc 18900
gcaccacggg gagcagggcg tgcaggtggc cgtggcggtg acggacgcgc tcggcgggcc 18960
cgtcctggac gtcgcgtcgg tggtgatgcg cccggcggac gttcggtggc tccgcgaggg 19020
cggtcaccgc gccgggcggc gggcgtccac ggccgccggg ggcactacgc gtgtgcggcg 19080
tgcggtggcc gccggcggcg ggaacggttc gtccgtcgac tgggccgacc ggctggccgg 19140
gctcgccgag gacgaacggc gccggctggt gctcgacctg gtgcgtacac atgccgcgac 19200
cgtgctgggc catgccgagc ccgacgcggt tccgaccggg accagcttca aggagatggg 19260
gttcgagtcg ctgacggccg tcgagctgcg cgaccggctc gccgccgcga ccgggctgcg 19320
cctcccggcc gcgctggttt tccgttatcc gacgccggac ggcatcgccg atcacctggt 19380
ggaacgcctc gtcgcgaagg agcagacacc gcccaacgga cgcaacggcg gacgcgcggg 19440
cgaaggcgga aacggcggcg agcgcggaaa cgggaacggg caccgctccg tgacgggcgt 19500
cggttcggtc ctcggcgagc tggacaggct ggaggacacc ctggccgggt tcgttccgga 19560
gaacggcgac tccggcgcgg tcacggcccg gctcgaacgg ctgctgacca agtggaaggc 19620
ggcggcgcgg aacccggcgg gcgagcccag cgccgccgag cggctggagt ccgcgtccac 19680
ggatcaggtc cttgacttca tcgagaacga gcttggggtg tcctgaatgg cgaacgaaga 19740
gaagctggtc gactatctca agcgcgtctc cgcggagctg cacgacacgc gccgtcgcct 19800
gagcgaggtc gaggaccggg ccaacgagcc gatcgccgtc gtcggcatgg cgtgccggtt 19860
ccccggcgag gtccgttcgc ccgaggacct gtgggacctg ctcgtctccg gaaaggacgc 19920
catcggcgac ttccccgacg accgcggctg ggacctcgac ggcctctacc accccgatcc 19980
cgcgcatccc ggcaccagct ccacgcggca gggcggattc ctgtacgaca tcgaccggtt 20040
cgacgcggga ttcttcggga tcagcccgcg tgaggcgctc gccatggacc cgcagcagcg 20100
cgtcctgctg gagaccgcct gggagacgtt cgaacgggcc ggaatcgacc cgttctccct 20160
gaagggcacc tcgaccggcg tgtacgcggg ggtgtccagc caggactacc tggcgcgggc 20220
tccgcggatc cccgagggct tcgaggggta cgcgacgacg ggcagcctga cgagcgtcgt 20280
gtccggccgg gtggcgttca cgctcgggct ccagggcccg gccgtcaccg tcgacacggc 20340
gtgctcggcg tcgctcgtcg cgatccatct ggcgtgccag gcgctgcggc agggcgagtg 20400
cacgctcgcg ctgggcggcg gcgtctgcgg gctggcgacc ccggtcatgt tcaccgagtt 20460
ctcgcggcag cgggcgctgt ccccggacgg gcggtgcaag tcgttcgcgg cggacgcgga 20520
cggcaccggg ttctccgagg gcgtcgggct cgtcctgctg gaacggctgt cggacgcgcg 20580
ccgcaacggc caccgcgtcc tcgcggtggt ccgcggctcc gccgtcaacc aggacggcgc 20640
gagcaacggc ctcaccgcgc ccaacgacgt cgcgcaggaa caggtcatcg agcgggcgct 20700
cacggccgcg cggctcgcgc cgtccgaggt ggacgcggtg gaggcgcacg ggaccggcac 20760
ccgcctcggc gacccgatcg aggccgaggc gctcatcgcc acctacggcc gggaccggcc 20820
cgcggagcgc cccctctggc tcggctccgt caagtcgaac atcgggcaca cgcacgccgc 20880
ggcgggcgtg gccggcctga tcaagatggt gatggccctg cggaacgaga cgctcccggc 20940
caacctgcac ctttccgagc cgaccccgca cgtggactgg gacaccggcg cggtgcgcct 21000
cctcaccgcg ccggtgccct ggccccgtga cgaccgtccg cgccgcgcgg gcgtgtcgtc 21060
gttcggcatc tccggcacga acgcccacct gatcgtggaa gaggcccccg agccggcgga 21120
tccgtccgag gcggctgccc cgcccgagcc agactcgggg gcggtggtgc cgtgggtgtt 21180
gtcggggcgg agtgaggagg cgttgcgggg tcaggctgcc gcgttggctg ctcgggtggg 21240
cggttcggag ggttgggctg cggctgatgt gggctggtcg ttggtgacga cgcgttcggt 21300
gttcgagcac cgtgccgtgg tcgtcggctc ggaacgcgac gaactgctgg acggcttgaa 21360
ggcgctggcg gcgggcgagg cgcacccgaa cgtggtccat ccggggagcc cggccgtgag 21420
tggcgcgaac gcgcgaggcc cggtgctggt gttcccgggc caggggtcgc agtgggccgg 21480
tatgggcgcg gagctgctgg attcctcgcc ggtgttcgcg gcccggatcg ccgagtgcga 21540
gtgtgcgctc gctccgttcg ttgactggtc gctgtcggac gtcctgcggg gcgacggcgc 21600
ggagctgacg cgcgtcgatg tcgttcagcc ggtgctgtgg gcgacgatgg tgtctctcgc 21660
ggccgtgtgg gcgtcgtacg gtgtcgttcc gtccgccgtg atcggtcact cgcaagggga 21720
gatcgccgcc gcgtgtgtgg ccggggcgct gtcgttggag gacggcgcca ggatcgtcgc 21780
cgtgcgcggc caggagctgc ggcggctctc cgggcggggc gccatggcct cgctcggggt 21840
gggggaggag cgcgccgggg aactcgtcga aggcgacgtc acggtcgcgg cggtgaacgg 21900
gccgtcgtcg acggtggtgt cgggcccgcc cgagcaggtc gccgcggtgg tggcccgcgc 21960
ggaggccgcc gggcaccggg cgcggacgat cgacgtcgac tacgcctcgc acggcccgca 22020
gatcgagctg atcggcacgg agctgaccga gaagctggcc ggggtgcggc ccgtgcgcgc 22080
ggacgtgggg ttctactcga cggtgacggg cggccggatc gacaccgccg ggctcgacac 22140
cgtctactgg gtgaccaacc tgcggctgcg ggtgcggttc gccgagaccg tccgggcgct 22200
gctggacgac gggcaccgcg tgttcatcga ggcgagcccc catcccgtgc tgacggtcgg 22260
cctggaggag tgcttcgagc aggccgggac cgcggccgtc gccgtcccga cgctccgtcg 22320
cggcgacggc ggcgcggggc aggtggcgaa ggcggccggg caggcgttcg ccgccggggc 22380
gcgggtggac tggacaggct ggttccgcgg ccgtccggcg gccccgcgcg tggtcgatct 22440
gccgacgtac gcgttccagc gcgagcggta ctggctggag ggcctgaacg gcgacggcgg 22500
cgatcccgcc gggctcgggc tcgtcccggc cgggcatccg ctgctgggcg cggcggtcga 22560
ggtcgcggac ggcgacatcc acctgctgac cggccgcctt ccgggcaacg ggaacgcggg 22620
ctggctggcc gaccaccggg tgctcgggac cgtcctggtg ccggggacgg cgctgctgga 22680
gtgggcgctg cgcgccgccg acgaggccgg ctgcggctcc gtcgaggaac tggtgctgca 22740
cgagccgatg gtcctgcccg gcgggagcgg cgggagcggc ggcctgcgcg tgcaggtggt 22800
cgtcggcgcg gcgggcgagg acgggcggcg cgaggtccgc gtcagctccc gccccggcca 22860
ggaggacagc gcaggcgaac gcggaggcga ggcgcgcgac tgggtgtgcc acgcgtccgg 22920
cgtgctcgcc ccggaaacga ccgttccggc cccggggctg gacgggatct ggccgcctcc 22980
gggcgcggag gccgttgacg tgagcgacgt ctacgaacgc gtcgcggcgt ccggctacga 23040
gtacggcccc gcgttccagg gcctgcgggc gctgtggcgg cacgggacgg acctcctcgc 23100
ggaggtcgcg ctgcccgacg cggcggggaa gccggacggt ttcggcgtcc atcccgcgct 23160
cctcgacgcc gcgctgcacc cggcgctgct ggaggacggc gacgacgacg cggagggcgg 23220
cggcgagcgc cgggtgtggc tgccgttcgc ctggaacggc gtgtcgctgc gggcggacgg 23280
ggcgaccgcc gtccgggtcc gactgtctcc cgagggggac ggcgcggacg agcggaaggt 23340
acgcgtcacg gtcgcggacg ccgtcggcgg tcccgtcctg gacgccgaat ccgtggtgat 23400
gcgcccggcc gacatcgacc gcttgaagtc ggcgggccgt cccggcgcgg gaggagcgga 23460
agggctcttc accgtcgaat ggacggccct gcccgagccc gccgcggaac cgtccgccga 23520
ggacgacacc gaccacgtca cgctcgctcc caccgcctcc gccgaagacg gcgcctcgta 23580
cgcggggctc gacgggctgg cggcggcgct ggacggcggc cttcccgcgc cgtcggtcgt 23640
cctcgccgag gtcggcgccg tcgcggacgc gctcgcgctg gtgcagcggt ggctggccga 23700
gccgaggctg tccgaggcgc ggctggcggt cgtgacgcgc ggagcggtcg cgtgcgagaa 23760
ccccgtaccg gacggcgcgg gggtgtgggg cctggtccgc agcgcccagg cggagaaccc 23820
tggacggttc gtgctcctcg acgtcggcgc cgaagccgac gccggagccg acgccggctc 23880
ccgcgccgaa caggctgagg ccgtgcggcg ggccgtccgg gcgggcgagc ctcaggtcgc 23940
gtggcgcgac gggcggctgc tcgtgccgcg ctgggtgcgc gcgggcgagc cggtcgggat 24000
cgttccgccg ccgggggagc gggcgtggcg gctcgggctc tcggggacgg cgacgctgga 24060
gaacgtgtcg gcgacggcgt gcccggaggc gctggagccg ctggagcccg gtcaggtgcg 24120
catcgacgtc cacgcggccg gggtcaactt ccgcgacgtg ctgatcgcgt tggggatgta 24180
tccgggcgac gcggcgttcg gcggcagcga gggcgcgggc gtggtgacgg aggtcggccc 24240
ggacgtcgcg gggctggcgg ccggggaccg ggtgatgggc ctgttcgacg gcgcgttcgg 24300
gtcggtcgcg gtggccgacg cgcggatggt cgcgcccgtc ccggacggct gggacctccg 24360
gcaggccgcg gcggcgccgg tggcgttcct gaccgcctgg tacgggctcg tccacctggg 24420
cgcgctgcgg cgcggcgaca gcgtgctggt gcacgccgcg accggcggcg tcggcatggc 24480
ggcggtgcag gtcgcgcggc acgtgggcgc cgaggtgtac gcgacggcgg gaccggccaa 24540
gcacggcgtg ctggacggca tgggcatcga cgaggcgcac cgcgcgtcgt cccgcgacct 24600
ggacttcgag gacgccgtgc gccgcgccac gggcggccgg ggcgtggacg tcgtcctcaa 24660
cagcctgtcc gggccgttca ccgacgcgtc gctgcggctc ctcgcggacg gcggccggtt 24720
cgtcgagatg ggcaagacgg acgtccgcgc gcccgacccc gcgtccgacg ggatggccga 24780
gggcgtggcg taccgggcgt tcgacctgct cgccgacgcc gggcccgacc ggatcgcgga 24840
gatgctgtcc gagctggcgg ggctgttctc cagcggcgcg ctgcggcccc tgccggtgcg 24900
ggcgtggccg ctcggccgcg cgcgggacgc gctgcggcac atgagccagg cccgccacac 24960
cgggaagctg gtgctggacg tccccgcgac gccggacccg gacggcaccg tcctcatcac 25020
cggcgggacg ggcacgatcg gcggcgaggt ggccgtgcac ctggcgcgcg actggcgcgt 25080
ccggcacctg ctgctcctca gccgcagcgg cccgtccgca ccgggcgcgg acgcgctcgc 25140
cgaacgcatc cgggagctcg gggccgaggc cgccatcaag gccgtcgacg tcgccgaccc 25200
cgtcgcgctg gacgaggcga tcgccgcgat cgacccggcg catccgctga ccgggatcgt 25260
ccacgccgcc gggacgctcg acaacgcgct ggtcacctcc ctcacgccgg acgcgctggc 25320
cggggtgtgg gacgtgaagg ccggagccgc cgcgcacctg cacgccgcca ccgccgggac 25380
gcggctcggg ctgttcgccg tgttctcctc gttcgcgtcc acgctgggga cgccgggcca 25440
ggccaactac gccgccgcca acgcgtactg cgacgcgctc gccgccgcgc gccgcgccga 25500
agggctgccc ggcctgtcgg tcgcctgggg gctctgggcg acggcgagcg gactcaccgg 25560
cgggctcacc gaggccgaca ccgcgcgcat ccaccgctac ggcatccgcc cgaacggcac 25620
cgaagagggc ctcgcgctgc tcgacgccgc ccaacggcac ggcggcccgg aggtcctcgc 25680
gctcaacttc gacgcccgcg aactcgccgg acggcccccg gacaccctgc ccgccccgct 25740
gcgcggcctc gccgcgacgg ccgccgccaa gccgggcggc gtgccgggcc gtcccaccgc 25800
cgcgggcggc gaacggccga ccgactggac ggcgcggctc gccggcctgc ccccgtccga 25860
acggcaccgc acgctgctca acctcgtgcg gacccacgcc gcggccgtcc tcggccacgc 25920
cgacgcgggg ctcgtccgcg cggacgcctc gttcaaggaa ctcggcttcg actcgctcac 25980
cggcgtcgag ctgcgcaacc gcctgtccgc cgcgaccggg ctgcgcctgc acgccgcgct 26040
ggtcttcgac cacccggagg cgggcgcgct ggccgaccac ctcctcgaac tgctcgcccc 26100
ggacggcgcg ccacccgccg gcgcggacgc caccgcgccg gtgctgggcg aggtggcgcg 26160
gctggagagc accctgtccg ccgtcgccgc cggggccgtg tccgccgacg cggtcaccgc 26220
gcggctggag gacctgctcg cgaactggaa ggcggcgcac gcgcccgacg gcggcaccgg 26280
cggcgacggc gacgcggacg gcgcggagga gaggctccgg gtcgccgacg cgggccagat 26340
cctcgacttc atcgacaacg agctgggcgt gaggccgtga acgaagagaa gctggtcgac 26400
tacctccggc gcgtcaccac cgacctgcac cacacccgca agcggctcag cgaggtcgag 26460
gacaggcagc gcgagccggt ggccgtggtg gacatggcgt gccggttccc cggcggggtc 26520
cgttcgcccg aggacctgtg ggagctggtc gcgtcgggcc gggacgcgat gggcgccttc 26580
cccgaggacc gcggctggga cctggaccgc ctgttccacc ccgaccccga ccatccgggg 26640
accagctacg ctcgcgcggg cgggttcctc tacgacgccg ccgagttcga ccccgagttc 26700
ttcgggatca gcccgcgcga ggcgctggtc ctcgacccgc agcagcgcct gctgctggag 26760
gtgtcgtggg aggcgatgga acgggcgggc atcgacccgt ccacgctgaa gggctcccgc 26820
accggcgtgt acgcgggcgc ggcgctgccc ggcttcggca ccccgcacat cgacccgagc 26880
gccgaaggcc acctggtcac cgggaacgcc ccgagcgtcc tgtcgggccg cctctcctac 26940
accttcggcc tggaggggcc ggcggtgacg atcgacacgg cctgctcgtc gtcgctcgtc 27000
gcgatccacc tcgcctgcca cgcgctgcgg cagggcgagt gcacgctcgc gctcgcgggc 27060
ggcgtgacgg tgatggccgt cccgaacgtg ttcaccgagt tcgcgcggca gcgcgggctg 27120
gccgccgacg ggcgctgcaa gccgttcgcg gaggccgccg acggcaccgc gttcgccgag 27180
ggcgccggga tgctgctgct ggagcggctg tcggacgccc gccgcaacgg ccaccccgtc 27240
gcggcggtga tccgcggctc ggccgtcaac caggacggcg ccagcaacgg cctgaccgcc 27300
ccgaacggcc cgtcccagca gcgggtgatc cggcaggcgc tggaggccgc gcggctcgcc 27360
ccgtccgagg tggacgcggt ggaggcgcac gggaccggca cccggctcgg cgacccgatc 27420
gaggcgaagg cgctgctgtc ggtctacggc gcggaccggg acgcgggccg cccgctccgg 27480
ctcggctcct gcaagtccaa cctcggccac acccagggcg ccgcgggcgt cgccggcgtg 27540
atcaagatgg tgctggcgct gcggcacgag cggctccccg cgacgctgca cgtggaccgc 27600
ccgaccccgc acgtggactg ggactccggc gcggtgcgcc tcctcaccga gcccgccggg 27660
tggccgcgcg gcggcgaccg cgcgcgccgg gccggggtgt cgtcgttcgg catctccgga 27720
acgaacgcgc acctgatcct cgaagagccc cccgccgaga acgcccagga cgccgagcac 27780
gccgagcacg ccgagcacgc cgacgacgcc ccggccgcct ccgccgggcg cgccgcgaac 27840
gaggcccgcg ggccgatccc gtgggtgctg tcgggacgca ccgggcccgc cctcgccgag 27900
caggccgccg cgctcgccga gcacgtcgcc gcccgtcccg agctgcccgc cgccgacgtg 27960
gcctggtcgc tcgcgcggac gcggacgccg ttcgagcacc gcgcggtcgt cctcggcgac 28020
gaccgggacg cgctgcgcga cgggctcgcc gcgctggcgg acggccggga ccatccggac 28080
gtgacgcgcg ccgcgggccc tgcggggccg ggcgggacgg cgttcatgtt caccgggcag 28140
ggcagccagc ggcccggcat gggccgcgac ctgcaccgga cgttcgccgc gttcgcggag 28200
gcgttcgacg aggccgccgc gcacctggac ccgctgctcg gccggtcgct gcgcgacctg 28260
gtgttcgccg acggcggcac cgccgaggcc gccgacctgg accgcaccgg cgtgacccag 28320
gcggcgctgt tcgcggtgga gaccgcgctg ttccggctgg tggagtcgtt cggcgtcgtc 28380
cccgcctacc tgacgggcca ctcggtgggg gagatcgtcg ccgcgcacgt cgcgggcgtg 28440
ctgtccctcc cggacgcctg cgcgctggtc gccgcgcggg gcgggctgat gcaggcgctg 28500
cccgagggcg gcgcgatggt cgccgtcgag gcccccgagg acgaggtggc gccgctgctc 28560
gccgggctgg aggaccgcgt ctcgctggcg gcggtgaacg gccccgcctc cgtcgtcgtg 28620
tcgggcgacg ggacggccgt cgaggaggcc gtccgcccgc tgcgggaacg cggcctgagg 28680
gtcaggcggc tgcgcgtcag ccacgcgttc cactccccgc tcctcggtcc catgctggac 28740
ggcttccgcg aggtcgtgag cggtctggcg ttcgggccgc cggagatccc gctgatctcc 28800
aacctcaccg gcgagcccgc cgacgccgga ctgctcggcg acccggagta ctgggtgcgg 28860
cacgcccggc ggaccgtccg gttccacgac ggcctgcgca ccctccgcgc ggagggcgtg 28920
acgcgctacc tggagctcgg cccggacccc gtcctcaccg ccatggcgca gaagtccctg 28980
gcggacgacg gccccgcccc ggcgttcgcc gccgccatgc gcaagggccg cgacgagccc 29040
cgcacgttcc tcaccgccct cgccgcgacg cacgtcgacg gcggcgacgc cgacttcgcc 29100
gccgcgctcc cggcgggcgc gggacgcgta ccgctgccca cctaccggtt ccagcgcagg 29160
cggtactggc ggcccgtgca ggacgcggcg gcggaggtga gcgcgtccgg cctgcgcccg 29220
accggccacc cgctcctcca ggccgccgtg gaaccggcgg acggcgggct gctgctcgcc 29280
ggacgcctgt cgccccgcac gcacggctgg ctcgccgacc acgccatcgc ggacggcgcg 29340
ccgctgcccg cgacggcgtt cctggagttc gcgctggtcg cggcccggcg gaccggctgc 29400
gacgccgtcg aggacctcag cctggagacg ccgctgccgc tgcccgacgc cgaacccgtg 29460
gacctccagg tcgcggtcgg cggcccggac ggcgcgggcc gccgcccggt caccatctac 29520
tcccgcccgg tggacgacgc gcacgacgac cccgacgcgc ccgcggcggc gcgcctggac 29580
gacctcggac ccgcgaacgg ctggcgccgc cacgccgccg gaaccctcgc cggacccgac 29640
cccgaccccg accccgcccc ggccgcgccc gcgcagtccg ccgcttggcc cccggagggc 29700
gcggacgaga tcgacgcggc ggagctgtac gagcggctgg ccgaccaggg ctaccggtac 29760
gggcccgcgt tccgcggggt gcgcgccgcg tggcggctcg gcgacgagct gttcgcggag 29820
gtgcgcctcg cgcccgacca gcacgccgac gccgccgcgt acgccgtcca cccggccctg 29880
ctcgacagcg cgctccaccc cgtggacgag ctctacgagc acgacgggag cacggtccgg 29940
ctgccgttct cgttcggcgg cgtgcgcctc cacggaacgg gccgcacgcg gctgcgggtc 30000
cggatcacgc ccggcgacgc ggacgtgctg tcgctcgcgc tcaccgacga cgacggcgcc 30060
gccgtcgcgt ccatcgggtc cctcggcctg cgcaaggtcc ccgccggacg gtggaacggc 30120
gccgcgcccg ccgacgacgc cctgcaccgg ctggcctggc accccctccc cgtaccggac 30180
ggcgggcccg gaaccgcccg ccccgccgac gactgggccg ccatcgccgc ggccggcccg 30240
gacgcgaccg gtccggctgc ggacggcctg agtgcggacg gcttgggtgt ggtcgcgccg 30300
gacgccgtcc acccggacct cgccgcgttg aagaccgcgc tggccgacgg gcgccccact 30360
cccgccgtcg tcctcctccc gctcccgacc ggcgggcccg acgccgccga gcccgtcgcg 30420
cgggcccacg acgccgcgcg gcgcgggctc gaactgctcc aggagtggct ggaggaggag 30480
cggttcgccg cggcgcggct gatcgcggtg acgcggggcg cggtcgccgc caccggcgac 30540
gacccgccct ccgacctggc gggcgcgccg ctctggggac tggtccgcgc ggcgcaggcc 30600
gagtaccccg accggatcgt cctcctcgac ctcgacggcg aggacgcctc ccgcgacgtt 30660
ctgcggggcg cggtcgccgc cgcgatggag aacggtgagt cgcagctcgc ggtgcgttcg 30720
ggagacgtcc tggtcccgcg gctcgttgac gcggcggaac gggacgcgga cggaaccgcg 30780
ccggcactcg accccgacgg gaccgtcctg gtcaccggcg ggacgggggc gctcggacgg 30840
ctcgtcgcgc ggcacctcgt gaccgcgcac ggagcgcggc acctgctgct caccgggagg 30900
cgcggccccg ccgcggacgg cgcggccgag ctggaggcgg agctcgccgg gctcggcgcg 30960
gacgtgcgcg tcgccgcctg cgacgccgcc gacccggacg cgctcgccgc gctgctcgcc 31020
tcggtgcccg ccgcgcaccc gctgacggcc gtcgtccacg ccgccgggat cgtggacgac 31080
ggcgtgatca cctcgctgac gcccggacgc ctcgccgccg tgctggcgcc gaaggccgac 31140
gcggcctggc acctgcaccg gctgaccgag ggggacgacc tcgcggcgtt cgtgctgttc 31200
tcgtccgccg cgtccgtcct cggcaacggc ggccaggcga gctacggcgc ggcgaacatg 31260
ttcctcaacg ccctcgccga gcaccgccgg gccggcgggc gtcccgcgac gtccctggcg 31320
tggggcctct gggaggcgtc cggcgggatg gccgaggacc tcgccgccgc cgaccgcgcc 31380
cgcatcaccc gctccggcgt cgccgcgatg ccgcccggcc gcgccctcgc cctgctcgac 31440
gcggcgctcg ccgcgccgca cccgacgctc gtgccggtcg cgttcgaccg cgcggcgctg 31500
cgggagcagg cggcggcggg gacgctgccg cccgtcctgc gcggccgggt ccgcgtgccg 31560
gcccggtcgc cgcgcgggcc cgcggcgtcc tccttcgcgg cgcgcctcgc cgggctgccc 31620
gccgaggaac gggaccgcgc cgtcgcctcc gccgtccgcg accagatcgc gaccgtgctc 31680
gcgcaccccg aacccgaggc catcgacctc aaccgggcgt tccaggagct cgggttcgac 31740
tcgctcaccg ccctggagct gcgcaaccgg ctgaacgcgg ccacgggcgc ggccctgccc 31800
gccaccgtca tcttcgacca cccgaccccc gaggcactga tccgcgccgt ccgcacccgc 31860
ctggacggcg cggcgggcgg gcgcgccgtc cgggccccgg tgaacggcgc gccgcgcgca 31920
gcggccgcgg acgacggcga accgatcgcc atcatcggca tgggctgccg ctacccgggc 31980
ggcgtgacgt ccccggagga cctgtgggag ctggtcgcct cgggcgcgga cgcgatcggg 32040
gagttcccgg ccgaccgcgg ctgggacctg ccgggcctgt tcgaccccga ccccgaccgg 32100
ctcggccaca gctacacgcg cgagggcggg ttcctgtacg acgcggcgcg gttcgacgcc 32160
gggttcttcg ggatcagccc gcgcgaggcc gtggcgatgg acccgcagca gcggctgctg 32220
ctggagacgg tctgggaggc gttcgaacgc gccgggctcg accccgcgga cctgcggggg 32280
agccgcaccg gcgtgatcgc gggcgtgatg tacgacgact acggcagccg gttcctcggc 32340
cggacgccgc gcgccgtcga gggccgcctc atgacgggca gcacgccgag cgtcgcgtcc 32400
ggccgcgtcg cctacacctt cggcctggag ggccccaccc tgacggtgga caccgcgtgc 32460
tcgtcgtcgc tggtggcgat gcacctcgcg gcgcaggcgc tccgccggga cgagtgcacg 32520
ctggccgtgg cgggcggcgt cacggtgatg gcgacgccca acacgttcgt ggagttctcg 32580
cgccagcgcg gcctggcccc ggacggccgc tgcaagtcgt tcgccgcgtc ggcggacggc 32640
gtcggctggg gcgagggcgc cgggatggtc gtgctggaac ggctgtcgga cgcgcgccgc 32700
aacggccacc gcgtcctcgc ggtgctgcgc ggctcggcgg tcaaccagga cggcgcgagc 32760
aacggcctca ccgccccgaa cggcccgtcg caggaacgcg tcatccgcgc cgcgctcgcc 32820
ggagccgggc tcgcgccgtc ggacgtggac gccgtcgagg cgcacggcac cggcacgacc 32880
ctcggcgacc cgatcgaggc gcacgcgctc ctcgccgcct acgggcagga ccgcccggac 32940
ggccgtccgc tctggctcgg ctcggtgaag tcgaacatcg gccacaccca ggccgccgcg 33000
ggcgtcgccg gcgtcatcaa gatggtcatg gccatgcgcg agggttcgct ccccgcctcc 33060
ctccacatcg ccgaaccgtc cccgcacgtc gactggacgg cgggcgccgt gcgcctcctc 33120
accgaacccg tcgaatggcg gaacggcgac cgtccgcgcc gcgccggggt gtcgtcgttc 33180
ggcatctccg ggaccaacgc ccacctcatc cttgagcagg cccccgaccc cgcggagact 33240
cccgcgcccg agggcccggt catcctgaac ggcgcggcga gcgccgaggc ggagacagga 33300
acgggcgata cgggcgatac cggggtgccg gacctcgccc tggtcccgtg gacggtctcg 33360
gcgcgcgacc cggacgcctt gcgcggccag gccgccgcgc tcgccgccca cgtctccgcc 33420
cggcccggcc tccccgtggc ggacgtggcg tggtcgctgc tgaagacccg gtcgtcgttc 33480
ccccacaagg ccgtcgtggt cggcgaacgc cccgcggaac tcgtcgcggg actggaggcg 33540
ctcgcggccg gcgccgacgc ccacccggcc ctgaccgggc cgggagcacc ggcggccccc 33600
ggccggctcg tgtggctgtt cagcggccag ggaagccagc gcgtcgggat gggcgccgac 33660
ctctacgtgc ggttcccggc gttcgcggag gcgttcgacg aggtgtgcgg ccacttcgac 33720
gaacggctcg gacgcccgct cggcgaggtc gtcttcaccg gcccggcgga cgtcctggac 33780
cacaccacct acgcgcaggc gggcctgttc gccctccagg tcgcgctggc ccgcctgctc 33840
ggcacggccg ggatccgccc ggacgccgtg atcggccact ccatcggcga ggtcgcggcg 33900
gcgcacgtcg cgggggtcct cgacctcccc gatgcctgcc gcctcgtcgc ggcccgcgcc 33960
gggctgatgg gacggctccc gtccggcggc gcgatgaccg ccgtcgaggc caccccggac 34020
gagctcgcgg acgacctcga acggctcggc ggcggcctgg tcgcggtcgc cgcgctcaac 34080
acccccggca gcacggtgat ctccggcccg gccgagcccg tcgcccggat cggcgcgcgg 34140
tgggccgagc gcggccgcag gacccgttcg ctcaccgtca gccacgcctt ccactcgccg 34200
ctgatggacc ccgtcctgga cgagttcgcg acggccatcg acggcctcgc ctaccgcgaa 34260
ccggccatcc cgctgatcag caacctcacc gggctgcccg ccgacgagcg catcaccacg 34320
cccgcctact gggccgagca catccgccgc cccgtacggt tccaccccgc ggtcgcgcac 34380
gtcgccgccg acgccgccgc gttcctcgaa ctcggccccg accccgtcct cggcaccgcg 34440
acccggcgca ccctcgacac cctcgatgcc ggcggggacc cggccggcgg gccaccggtc 34500
ctggcgaccc tcacacggaa gcagccggag atccgctccc tcgcgaacgc cctggccggc 34560
ctccacgtca ccggcgtccc ggtggactgg gccgcggcct ggttccccgc cgagcccgcc 34620
ccgcgcgccg tggacctgcc cacgtacgcg ttccagcggg aacggttctg gctctccgcc 34680
gacgccggag gcgaggacgc cgccggactc ggcctgaccc ccgccgggca cccgctgctc 34740
ggcgcggcga tggacctggc gggcggcggc ggccacgtgc tgaccggccg cctgacgcgc 34800
cggtccgggg cgtggctcgc ggagcacgtg gtcgccggga cgaccctcct gcccggcgcg 34860
ggcgtcgtcg aatgggcgct ccgcgcggcc gacgaggtcg gctgcggcgg agtggacgag 34920
ctgacgctcc gcgccccgtt cgtcctgccc gacacaggcg gactgcgcgt ccaggtcgtg 34980
atcggcgcgc ccggtgagga cggccgccgc gacctgcacg tctactcccg cctcgaccag 35040
gacggcgagg ccgcggaatg gctgtgcaac gccgaaggcg tcctcacccc ggaaaccccc 35100
gccgaccccg gaccggaacc gggcggcccc tggcccccgc cgcacgcgga acccgttgac 35160
gtcgagggct tctacgaacg cgccgcgtcc ggcggctacg actacggacc gtcgttccag 35220
ggcctgcggg cgctgtggcg cgcgggcgac gacctgttcg ccgaagtgac cctccccgaa 35280
gcggccggcg acgtcgccgg atacggcatc caccccgctc tactggacgc ggcgctgcac 35340
ccgctgttcc tcacgggcct gttcggcgga gacccgtccg gaaaccgcat ctggctgccg 35400
ttcacctgga acgacgtgtc gctgcgggcc ggcggggcga cggccgtccg cgtgcggctc 35460
tccccgaacg aggacggcga caccctgcgc ctgaccgtca ccgacccgtc cgccgcgccg 35520
atcctcaccg tcggctcgct cgtgatgcgc cccgccgacg ccgaccagct ccgggccgcc 35580
gcgcgccaag gcgacacacg cggcctcttc acagtcgact ggacacccgt ccccctgccg 35640
tccgcccctg cggacgcgcc cgaacgagac gacgcaggcg aatgggccgc tctgggcccg 35700
gacacgctac cgcccggcct ggacgcctac cgcgacctcc cgaccctcct gatggccctc 35760
gacgcaggcc agcccccgcc gtcccacgtc ctgaccacca tccccccgga gcccccgccc 35820
ggccgcaccg aaaccgacgt cgcggcggac acgctgctgc tcctgcaagc ctggctggac 35880
gagccgcgcc tggcggatgc gcgcctggtg gtggtgaccc ggggcgcggt cgcgtccgac 35940
gtccccgatc tggccggtgc aggggtgtgg ggcttggtac gcagcgcgca ggcggagaac 36000
cccggtcgct tcgtcctcct cgacctcgaa acggccccgg acggcgatgt cctcgaactc 36060
gtccgacgcg ccctgcacac cgacgaaccg caactggccg tacgcggcgg ccaagcgctc 36120
gtaccccggc tggccagagc ggaccgcccg gccgaactga ccgcccccgc gacccgcgcg 36180
ggctggcgcc tcggcgccga acgcccggga accgtggaga gcgtcacgat cgtccccgcc 36240
cccgaggccc tcgaaccgct cgcccccgga caggtgagga tctccgtccg ctccgccgga 36300
ctcaacttcc gcgacaccct catcgcgctc ggcatgtacc caggcggcgg cgtcttccaa 36360
ggcagcgagg gcgccggcgt ggtgaccgag gtgggtccgg gcgtcgcctc gggcctgtcg 36420
gtcggtgatc gtgtgatggg tctgttcgag ggagcgttcg gcccggtggc ggtggcggac 36480
gaacgaatgg tcgttcgtct tccggatggt tgggatttcc gttccggtgc cgctgtgccg 36540
gtggtgttct tgaccgcttg gtacggcttg gtggaactgg ccgggttgaa gtcgggtgag 36600
tcggtgttgg ttcatgcggc gacgggtggt gtgggttcgg cggcggtgca gatcgcccgc 36660
catctgggtg cggaggtctt cgccaccgcc agccccggca agcacgccgt ccttgagtcg 36720
atgggtatcg atgcggccca ccgcgccagt tcccgcgact tggacttcga gcggacgttc 36780
cgcgacgcga cgggtggtcg tggtgttgat gtggtgctca atagtttggc gggggagttc 36840
accgatgcgt cgttgaggtt ggtggctgag ggtggtcggt tcctggagat gggcaagacc 36900
gacatccgcg aacccgaaca catcgccgcg gagtaccctg gcctctccta ccgggccttc 36960
gacctcgtca ccgacgccgg acccgaccac gtcggggaga tgctggcacg gctcaccgaa 37020
ctgttcagcg ccgacaccct acggccgccg cacgttgagg catggccact ggcccgcgct 37080
cgcgaggctt tgcgggtgat gagtcaggcc aagcacaccg gcaagctcgt tctggacatc 37140
cctccgactt tggacccgga cggcaccgtg ctgatcaccg gcggcacggg aacactgggc 37200
ggtctggtcg ccgaacatgt ggtcgggacg tggggtgtca agcggttggt gctggccagt 37260
cgtcgtggcc tgcaagcccc gggcgccgac gaactggtcg agcgcctcac ccgagagttg 37320
ggcgcgcagg tacgggtcgt cgcggccgat gtgtccgacg ccgacgccgt caccgacctc 37380
atcaccgcga ccgacgccga ccccgcgcac cccctcaccg gcctcataca caccgccgga 37440
ctcatcgacg acgcggtgat cggctcccag acaccccaac gcctccacca cgtctggaac 37500
gccaaagcca ccgccgcaca caacctccac accgccaccc gacacctccc cctcgccgcg 37560
ttcgtcatct tctcctcggc cgccgggacg atcgggagcg ccgggcaggc caactacgcc 37620
gccgccaacg cgtactgcga cgcgctcgcg gttcggcggc gttcgctcgg gctgccgggg 37680
gtgtcggtgg cgtgggggct gtgggagtcc agcagcggga tcaccggcca cctgggcgac 37740
gccgacctcg cgcggatggg ccgctccggc ttccggccgc tcacggaccg gcacggcctg 37800
gccctgctgg acgcggcgag caggcacggc gccgctcatc tcgtcgcggt ggacctcagg 37860
acgagcgcgc tggcgggtca gcccgcggag aacgttccgc cgatgctgcg gacgctcgcg 37920
gcgaccgcct cggccggggg cgcggcggcg cggccgacgg cgggcggtgt gcggcggccc 37980
gccgacctgg ccgcccggct ggcggcgcag tcccggacgg agcaggaacg cacgctcctc 38040
ggcctcgtcc ggacgcaggc cgccaccgtc ctcggccatc cgaaccccga cgccatccgt 38100
cccgacaact cgttcaagga gctgggcttt gactccctca ccgccgtcga actgcgcaat 38160
cggctcgccg cgagcacggg gctgcgcctg cccgcctcgc tcgtcttcga ctatccggag 38220
gcggcggtcc tcgccgacca tctgcgacgc aggctcgcgc cgaacggcga gagcgcgccg 38280
gcgcaggacg ccgtcgatcc gctgctcggg gagctggggc ggatcgagag cgcgctggcc 38340
gcgctcgccc tggacgacga ggcgcggagc agggtcgcca ggcggctcaa cgggctgctt 38400
tcggcggtga acggcggttc gaacgcttcg gcggagacgg gcgggttcga cgacgtcgag 38460
tccgcgtccg acgacgagat cttcgagctg atcgacaggg agctctgagc gtgcgcatgg 38520
agggcgggaa ctgatgtcgg agaccgagga gaagctccgc caatacctga agcgggtcac 38580
ggtcgacctc gggcagaccc gccagcgcct ccgcgaggcg gaggagcggg gccgggagcc 38640
ggtggccgtg gtcggcatgg cgtgccgctt cccgggcggc gtctcgtccc ccgaagagct 38700
gtgggagctg gtcgactcgg gcggcgacgc gatcggcgcg ttcccggacg accgcggctg 38760
gaacctggaa gccctttaca accccgaccc cgatcagccc ggcggcagct acgtccgcga 38820
gggcggtttc ctgtacgacg cggacctctt cgacgcgggg ttcttcggca tcagcccgcg 38880
cgaggcgctg gcgatggaac cgcagcagcg gctgctgctg gagacgtcat gggaactgct 38940
ggaacgggcc ggcatcgaac cgggctcgtt gaagggcagc gccaccggcg tgtacgccgg 39000
agcgggcctt cccggtttcg gcaccccgca catcgaggag agcgccgaag ggcacctggt 39060
gaccggcaac gcgctcagcg tcctgtcggg acgcgtcgcg ttcacgctcg gcctggaggg 39120
cccggcggtg tcggtggaca ccgcgtgctc gtcgtcgctc gtcgccatcc acctcgcctg 39180
ccaggcactg cgacagggcg actgctccat ggcgctcgcg ggcggcgtga cggtgatggc 39240
cgtaccggcc atgttcaccg agttctcgcg ccagcgcggc ctggccccgg acggccgatg 39300
caagcctttc gccgaagccg ccgacggcac cgtcttctcc gaaggcgtgg gcctggtgct 39360
gctggaacgc ttgtcggacg cgctgcgcaa tggccgccgg atcctgggtg tggtgcgggg 39420
ctcggcggtc aaccaggacg gcgcgagcaa cggtctcacg gccccgaacg gcccgtcgca 39480
ggagcgggtg atccggcagg cgctggcgta cgcgggtttg tcgggcgtcg aggtggacgc 39540
ggtggaagcc cacggcaccg ggaccacgct gggcgacccg atcgaggcgg gcgcgctgct 39600
cgcgacctac ggacagggcc gcccggacga ccggccgctg tggctgggtt cggtgaagtc 39660
gaacatcggg cacgcgcagg gcgcggcggg cgtggccggc gtcatcaaga tgctcatggc 39720
catgcggaac ggccggctcc ccgccaccct ccgcgtggac cgcccctcgt cgcacgtcga 39780
ctgggaatcc ggcgcggtga gcctgctgac cgaacccgtc gaatggccgc gcgaggaccg 39840
tccccgccgc gcgggcgtat cggcgttcgg catgtccggc acgaacgccc acctgatcct 39900
cgaagaaccc cccgagccga ccgaacccga cgacgtgccc gaacgacctg cgccgatgga 39960
cgtcggcccg ctcccgtggg tgctgtcggc gagaagcgaa gaggcgttgc gggggcaggc 40020
ggcggcgctg gcgtcccagg tgcgtgtgtc ggatgacgcg ccgttggatg tgggttggtc 40080
gctggcggcg tcccgttcga atttcgagcg tcgtgcggtg gtcgtcggga ccggacgcga 40140
tgaattgctg gccggtttgg aagctgtggc gagcggtgaa gtgcgttctg ttgcgccttc 40200
tggagtaggt gcgggtccgg tgttggtttt tccgggtcag ggttcgcagt gggccggtat 40260
gggcgcggag ctgttggatt cctcgccggt gttcgcggag cgtattgctg agtgcgagcg 40320
tgccctcgct ccgttcgtgg actggtccct gtcggacgtc ctgcgtggcg atggcgcgga 40380
gttggcgcgc gttgatgtcg ttcagcccgt gctgtgggcg acgatggtgt ctctggcggc 40440
tgtgtgggcg tcgtacggtg tcgttccgtc cgcggtgatc ggtcattcgc aaggggagat 40500
cgctgccgcg tgtgtggccg gggcgctgtc gttggaggac ggcgccagga tcgttgcgtt 40560
gcggagtcgg gcgttgcggc ggttggccgg tggcggtgcg atggcttcgc ttggtgtcgg 40620
tgtggacgcc gcgcaggagt ttctgcctga gggtgttgcg gtggcggcgg tgaacgggcc 40680
gtcgtcgacg gtggtgtcgg gcccgcccga gcaggtcacc gcagcggtgg cccgtaccga 40740
ggctgctggg catcgggcgc ggttgatcga tgttgattac gcctcgcatg gtccgcaggt 40800
cgatgagatc accgacgaac tgcatgaagt cctcgcaggc atcgaacccg taccgacaac 40860
agaagtggcg ttctattcca cggtccgtgc cgctgcgatc gacaccgcga ccctggacac 40920
cggttattgg gtgaccaatc tccgggagcg cgtgcggttc gccgagacca ttgacgcgct 40980
tctcgaagac gggcaccggg tcttcatcga ggccagtccc catcccgtcc tgacggtcgg 41040
gatggaggag acctttgaac aggccggggt cgatggcacg accgttccca cgcttcgtcg 41100
tgatcacggt gatcgcaagc agctcgcgga tgcgctcggg caggcattct cggcgggtgt 41160
ggaggtcgat tggaccgggt ggttcccggc ggagtgggct ccgcggcttg tcgaactgcc 41220
tacgtatgcc ttccagcggc agcggtactg gatcgacgcg cagggtggcc ggggaggcga 41280
tgccgcgagt ctcgggctgg cgtccgccgg gcatcggctg ctgggcgcgg cgatcgagca 41340
ggcggaaggg gacgtccgcc ttctgaccgg acggctgtcc gcgacgggcg acgcgagctg 41400
gctggccgat caccaggtcg cgggaacggt cgtcgtgccc ggcgcggcgc tgctggagtg 41460
ggcgttgcgc gcggcggacg aggccgggtg cacgggtgtg gaggacctgg tcctccgttc 41520
gccgctggtc cttcccgcgt ccgggagcgt acgggtgcag gtggtcgtgg gcgcggccgg 41580
ggaggacggg cgccgcgacg tgcgggtcta ctccaagccc ggccaggacg gcgacttcga 41640
tgacgacgcc gggtggacgt gccacgcgga aggaacgctc gggcccgaga ccggggacgc 41700
cgccgagggc ttggccgggg cctggcctcc gccgggcgcg cagcccgtgg acgtgagcgg 41760
gttctacgac ttcgcggcgg cgtccgggta cgagtacggt ccggcgttcc agggcctccg 41820
ggcggcctgg cggaacgggg cggacgtgct cgcggaagtg gagttgccgg aggccgccgg 41880
tgagccgggc gggttcggtg tccaccccgc gctgctggac gccgcccttc acgcgggcct 41940
gctggccgac ggacacggcg acgacgggga cgtgtggctg ccgttcgcgt ggagcggggt 42000
ttcgttgtgg gccactgacg cgacggcggt ccgcgttctg ctgtctccgc ttccggaggg 42060
cgaacgcggt gtgcgtgtcg tcatggccga cgccgttggc gcaccggttc tgacggccga 42120
cgcggtggcg ttgcggccga tcgacaccga acggctgcgc gccgaggacg gtcgcggtcc 42180
tgacggcctg ttcgtgctcg actgggtcgc ggtgcaggag tccgcgccgt ccgccgaggc 42240
caacgaaccg ggcgcgctgt gcgggctcgg cgacttcgat gactccgtgc tgccgtccgt 42300
ccccgatgtg gacgcgctgc tggcacgcct cgacaccgag gatcttccga ctccgttgac 42360
ggtgctgact gttctcccgc ccgagggcga gggcggtggc ctcgacctga ccgggcgcgt 42420
gctcgctctg ctgcaaaggt ggctgaacga accgcgcctg gacgacgtcc ggctggcgct 42480
ggtgacgcag ggcgccgtcg cggtcgagaa cccggacgtg gcgggtgcgg cggtgtgggg 42540
gctcgtccgc agcgcgcaat ccgaggaacc gggcaggttc gttctggtcg atgccggtct 42600
cccggatccc ggcgccgcgt tcgcccagga cgccgccgac gcggtacggc gcgcggtgga 42660
ggcgaacgag ccgcaggcgg ccgtgcgtgc ggggcagttg ctcgcgccac gtctcgcgag 42720
gaccgacacc cgcgcccccg accctgccta cgcgggcgat cgcgagggcg gtcatgggct 42780
gccggtcctc gatcctgacg gggccgtgct ggtcaccggg gggacgggca aggtcggcgg 42840
cctggtcgcc gagcatctcg ttcgcgcgtg gggggtcacg cacctcgtgc tggccagccg 42900
ccggggctcc gacgcgcccg gcgcatcgga gctggagagc cgcctggcct cgctgggcgc 42960
gcacgttcag gtggtcgcgg ccgacctggc ggacgcgtcg gccgttgacg aactggtcgg 43020
acggatcgcc gccgaacgcc cgctcaccgg cgtcgtgcac gcggccggcg tcctggacga 43080
cgcggtgatc gcctcgcagt cgccggagcg gctggcgggg gtgtgggcgg ccaaggccgc 43140
cgccgcgtac aacctgcacg tggcgacggc ggacctgccg ctgggcctgt tcgtgctgtt 43200
ctcctcggcc gcgtcgtgcc tgggcagccc gggacagggc aactacgccg ccgccaacgc 43260
gttctgcgac gctctcgccg cgcaccgccg ggcgtccggc ctgccgggcg tgtccgtcgc 43320
gtggggactg tgggcggacg ccagcggcat gaccgggcac atgaccgaaa ccgacctcgc 43380
ccgggtgagc cgttcgggga acgtcccgct caccagcgac gaggcactgg aactcttcga 43440
tgccgcctgc gggcacggcg gcgcctacct gctcgccctc cacctgaacg cgagcgccct 43500
cgcctccgcc gacctgcttc ctcctccgct gcgggccttt ggcgcccgcc gtgatcgcgg 43560
cgggcgcgtc cggcgcgccg ccgccgcagg aggccgccaa ggatcggagc tgctctcgcg 43620
gctcgccgga ctcgacgcga acggccgggc cgacgcggcg ctggaggtcg tctccgagtg 43680
cgccgccgga gtcctcgggc accgttcgcc cgacggcgta cgggccgagg cgaacttcaa 43740
ggacctcggg ttcgactcgt tgatggcggt cgagctgcgg aaccgcctgt cggtgatctc 43800
cggcgtccgg ctgccggcga cactggtgtt cgactacccg accccgcacg ccctggccgg 43860
atacctgtgt gcgcgcctgg acgggccggc gacgtccgtg gccggtccgt ccgccggtcc 43920
gacgacggcc acggcggacg acccggtcgc gatcgtcgcg atggcgtgcc ggttccccgg 43980
cggcgtcgtc tccgccgagt ccctctggga gatggtggcc gcgggcggcg acgcgatcgg 44040
cgcgttcccc caggaccgcg gctgggatct ggacgggttg ttccacccgg accccgacca 44100
ccccggcacc agctacgccc gcgagggcgg cttcgtccac gacgcggcgg aattcgacgc 44160
ggagttcttc ggcgtcaatc cgcgtgaggc gttggcgatg gatccgcagc agcgtttgtt 44220
gttggaggtg tcgtgggagt tgttggagcg tgcggggctt gatccggtgt cgttgaaggg 44280
cagccggacg ggcgtgtacg cgggtgtgat gtaccacgac tacggggccg ggatcgcggc 44340
cggcgactcc cgccttgagg gctacgcgct gatgtccagc tcgggcagcg tcgtctccgg 44400
ccgcgtcgcg tacaccctgg gcttcgaggg cccggcggtg acggtggaca cggcgtgctc 44460
gtcgtccctg gtcgcgatgc acctggccgc gcaggcgctg cgccagggcg aatgcgacct 44520
ggctttggcc ggtggggtga ccgtgatggc caccccggac gtgttcacgg gtttctcgcg 44580
tcagcgcggc ctggctccgg acggccgctg caagccgttc gccgcggcag cggacgggac 44640
gggttggggc gagggcgttg ggctggtgct gttggagcgc ttgtcggacg cgcgccgccg 44700
tgggcatgag gtgttggcgg tgctgcgtgg ttcggcggtc aatcaggatg gtgcgagtaa 44760
tggtttgacg gctccgaatg gcccgtcgca ggagcgtgtg atccgtcagg cgttggcgtc 44820
tgcggggttg tcgtcgtcgg atgtggatgc ggtggaggcg catgggacgg ggacgacgct 44880
gggtgatccg atcgaggcgg gcgcgttgct ggcgacctat gggcgggatc gttcgggcga 44940
tcggccgttg tggttggggt cggtgaagtc gaatatcggt catacgcagg cggctgctgg 45000
tgtggctggt gtgatcaaga cggtcatggc gatgcggaac ggttcgcttc cggcgtccct 45060
gcatatcgat gaaccaagcc cgcacgtgga ttgggagtcg ggcgcggtac gtctgttgac 45120
cgagccgatg gaatggccga ccgaagatga ccgtccgcgt cgggctgggg tgtcgtcatt 45180
cggtgcttcg ggaacgaacg cacacctcat cctggaacaa gcgcccgaac cgaagaacga 45240
accggaggac gaatccgagc cggtgccgtt cgacggtgtc ctgccgtggg ttctgtcggc 45300
gcacaatgcc gacgccttgc gcggtcaggc cgccgcgctg gcggcgcacg tgtccgccac 45360
ggatgacacg tccgtcgaag tgggctggtc gctgctgaag agcaggtcgc tgctcgacca 45420
gcgggccgtt gtccttggag aggaccgtgc tcaactgctc gcgggtgtcg aggcattggc 45480
ggcgggcgag ccgcatccgg cgctggtgcg aaccggtccc gccgctactg cgagtggaaa 45540
gtccgtgttc ctgttcagtg ggcaggggag ccagcgggtt gggatgggtg ccgggttgta 45600
tgagcggttc cctgtgttcg cgggtgcgtt cgatgaggtg tgcgggctgc tggatgagca 45660
tttggagcgt ccgttgaagg acgtggtgtt cactggattg cccgatcagc ctgagctttt 45720
ggatcacacc acgtatgcgc agacgggttt gtttgcgttg caggtgggtt tggcgcggtt 45780
gttggcgtcg ttgggtgtgg tgcctgatgt ggttgtcggg cattcggtgg gtgagatcgc 45840
cgccgcgcat gtggccggag tattcggtct ggaggatgcg tgccgtttgg tggcggcgcg 45900
tgcgacgttg atgggtgggt tgccgtcggg tggggccatg gccgccattc aggcgaccgc 45960
cgacgaactc accgacgacc tcgcacgttt cgccggacga gtggccatcg ccgcgctcaa 46020
caccccggac agcacggtcg tctccggacc ggccgacctg gtctccgagg tggccgccgc 46080
gtggaaggaa cggggccgca agaccaaggc gctcgcggtc agccatgcgt tccattcgcc 46140
gatgatggac ccgatcctgg acgagttcac cgacgcgatc agcgatctga cctaccggcg 46200
gccgaacgtg ccgctgatca gcaacctcag cggtctcccg gcggacgacc agatcaccac 46260
gccggagtac tgggccacgc acatccgccg gccggtccgc ttcgcgcccg ccatcgctca 46320
cctcgccccg caggcgggcg tgttcctgga actggggcct gatccggtac tggcgaccgc 46380
cgcccagcac acccttcacc acgccatgga cgagcacgag gacggcggga cggatgaggc 46440
cggtcggccc gttcccccga cgctctccac gctcaatggc aagcaggacg atccgcacgc 46500
gttcgcgcac gccatggcga ggctccacac ttcgggcgcg ggcgtggact ggacggactg 46560
gttccccgcc aagcccgcgc ctcgggtcgt cgcgctgccc acctatgcgt tccagcgcga 46620
gcggtactgg ctggcgcccg gtcgcggagc cggggacgtg gcggcggtcg gcctgcggcg 46680
ggtcgagcac gcggtgctgc ccgcggccgt cccgctcgcg gacggcggtc tgctgctgac 46740
cggccggatg tcggcgtccg gcggctgggc gtccgaccac cgcatgttgg gaacgacgct 46800
ggtgccggga gcggcattgg cggagtgggc gttgcgggcc gccgacgaga ccggctgcgg 46860
cgcggtcgag gaactggtgc tgcaagcgcc gctcgtactc ccgccgaccg gcgccgcgca 46920
gatccaggtc gtggccgacg cgcccgggtc cgatggacgc cgcgagctgc gtgtctactc 46980
ccgtccggaa cacgacgacg gttccgtcac cgaggcgccg tgggtctgcc atgcgatggg 47040
cgtcctcacg cccgaggcgc gggacgttcc ggaagggctc gacggtgcgt ggccacctcc 47100
cggtgcggaa ccggtggacg tgaccgcctt ctacgaacga atggccgcga cgggctacga 47160
gtacggaccg tcgttccgcg gactgcgcgc cgcgtggcgc gacgggcccg acatcctggc 47220
ggaggtgcag ttgccggagg ccgccggaga ccgcgccggc ttcggcgtcc atcccgcgct 47280
tctcgacgcc gtcctgcacg ccggcctgtt gccggacgga gagggcgaag gggagcagga 47340
cggccgggtc tggctgccgt tcgcctggaa cgggatctcg ctgtgggcga gtgaggctgc 47400
cacggtccgc gtccggctcg cgcccgtcca cgacggcgac ggcgagggcc gggacgaacg 47460
cggggtgcgc gtcaccgtcg ccgacagcgc gggcgcggcg gtcctgaccg ttgactcggt 47520
cgtcacacgc cctgcggaca tcgacaagct gaaggccgcc ggtgaacccg acgcggacgg 47580
cctgttcgcg ctcgaatgga cgcccctgcc cgctccggcg ctcagtgacc cgacgctcgg 47640
cgacgacgac tgggccgtcc tcggaacgga cgcgttcggc ccggccgatt gcgcccggtt 47700
cccggacctg gcggcactgt cggcggcgct cgatgcggac cggccgtccc cgacgctcgt 47760
gctggccgac gcctcgtacg gcgagagcga cgacctcgac gcggtcggga acgtcctgtc 47820
gcttctccag gcgtggctgg acgagccgcg tttggcggat gcgcgcctgg tggtggtgac 47880
ccggggcgcg gtcgcgtcgg acgttcctga tctggccggt gcaggggtgt ggggcctggt 47940
ccgcagcgcg caggcggaga accccggccg tttcgttctc ctcgacctcg aagcggacac 48000
gttcctctcg accggtctgg cagacgccgt ggcgcgtgcc gccgaggccc gcgagccgca 48060
gctcgccgtc cggtcgggcg aactgctgat cccccggctg gtgcgcgcga gcgaacggca 48120
aggcatcgcg gcccccaccg gcgaacgcgc ctggcggctg gccgtcgcca cccccggcac 48180
cgtcgagaac gtggcaccgg aaccgttccc gaaagccctc gaaccgctgg ccgccggcca 48240
ggtccgcatc gaggtgcgcg cgggcggcat caacttccgc gacgtcctcg tgaccctggg 48300
catggtgccc ggtcagaccg gcctcggcgg cgagggcgcc ggcgtggtga ccgaggtggg 48360
gccgggcgtc ttctcgggcc tgtcggtcgg tgatcgtgtg atgggtctgt tcgagggcgc 48420
gttcggcccg gtgaccgtgg cggacgaacg aatggtgtcc ccgatccccg aggactggga 48480
cttccgcagc gcggcagccg tcccgatcgt cttcctgacc gcttggtacg gcctggtgga 48540
gttggccggg ttgaagtcgg gtgagtcggt gttggttcat gcggcgacgg gtggtgtggg 48600
ttcggcggcg gtgcagatcg ctcgccattt gggtgcggag gttttcgcca cggccagccc 48660
tggtaagcac gctgttctgg aggcgatggg tatcgacgcg gcccatcgcg ccagttcccg 48720
cgatctggag ttcgagcgga cgttccgtga tgcgacgggt ggtcgtggtg ttgatgtggt 48780
gctcaatagt ttggcggggg agttcaccga tgcgtcgttg aggttggtgg ctgagggtgg 48840
ccggttcctg gagatgggca agaccgacat ccgcgaaccc gaacacgtcg ccgcagagca 48900
tccggcggtc ggttatcagg tgttcgatct ggtgacggac gccgggccgg ttcggatcag 48960
tgaaatgttc cgggcgttgg cggagctgtt cgccgaacag gtattgcggc ctgctccgac 49020
tcaagcatgg ccgctggctc gcgcgcgtga ggcgttgcgg gtgatgagtc aggccaagca 49080
caccggcaag ctcgttctgg acattccccc ggccctggac ccggacggga cggtgctgat 49140
caccggcggc acgggaacgc tcggcggtct ggtcgctgaa catgtggtcg ggacgtgggg 49200
tgtgaagcgg ctggtgctgg ccagtcgtcg tggcctgcaa gccccgggcg cggatgagct 49260
ggtcgagcgt ctcacccgag agttgggtgc tgaggtgcgg gtggtcgcgg cggatgtgtc 49320
cgacgccgac gccgtcaccg acctgatcac cgcgaccgac gcagaccccg cgcaccccct 49380
caccggcctc gtgcacaccg ccggactcat cgacgacgcg gtgatcggct cccagacacc 49440
ggaacgcctc caccacgtgt gggacgccaa ggccaccgcc gcacacaacc tccacaccgc 49500
cactcgacac ctccccctcg ccgcgttcgt catcttctct tcgagtgcgg cgatgctcgg 49560
gagtccgggg cagggcaact atgccgccgc caacgcgtac tgcgacgcgc tcgccgcacg 49620
tcgtcaggcc gcagggcttc cggccgtgtc cgtcggatgg gggctgtggg aggccaccag 49680
cgcgatgacc gggagtctca ccgaaaccga tcgcgcccgg atgagccgtt cgggtgtcgg 49740
cgcgctgccg aatgagcacg gcttggcgtt gctggacgcg gcgctcaggc acggcgacgc 49800
gtcgctgctc gccgcgaacc tcgatctgcg cgcgctcgcg gtccagcccg ccgagtccct 49860
gccgggtgcg ctgcgcgccc tcgtgtccgg cggcggcgaa cgtgtcaggc gtaccgcggc 49920
cacgggaagc gcgcggccgg acgactgggc cgggcggctg gccggaatgt ccgcggccga 49980
gcagcagcgc gccatgctca acctggtgcg cggtcacgtc gcgaccgtgc tgggacgtac 50040
cgacgccgac tccgtacggg gcgacgcgtc gttcaaggaa ctcgggttcg actcgctgac 50100
cggggtggaa ctgcggaacc ggctggcgga cgcgacgggc ctgcgcctgc ctcccgccct 50160
ggtgttcgac tacccgcagg ccggggtcct cgccgagcac ctgcgcacgc ggctcgtacc 50220
ggagggcgct gacgcctcgg ccgccggcgc gggcgtcgag cccgtcctgg acgacctggc 50280
caggctcgaa agcaccctga acgccgccgc cgctctggag gacggcgatt ccgacgcggt 50340
cacggcccgg ctggaaagcc tcctcaccca ctggagagcc gtacggagcg gccgggcggc 50400
gaacggacac cacgggcgca acggccacgc cgagccgagt gggaacgacg ggcagaacgg 50460
ccacgacggg caggacgggg aggacggcac ggccgttgac cggctggagt cggcgtccgc 50520
ggaccaggtt ctcgacttca tcgacaacga attgggcgtg tcctggaacg ggccggacgc 50580
gaccacgacg cgatagggct gggtgactcc gcatggcgac cgaagaacgg ctggttgagt 50640
acctcaagcg cgtcaccgcc gacctgcacg acacgcgcct gcgcctgcgc cggatggagg 50700
agcgctccga ggagcccgtg gccgtggtgg gcatggcctg ccggttcccc ggcggcgtgg 50760
actcgccgga gtccctgtgg gacctggtcg ccgcgggcgc cgacgcgatc ggcgacttcc 50820
cggacgaccg gggctgggac ctcgacggcc tgtaccaccc cgaccccgac caccccggta 50880
cgacgtacac gcgcaagggc ggcttcgtcg acgacgccga ccggttcgac gcggcgttct 50940
tcgggatcag cccgcgcgag gcgatggcga ccagcccgca gcagcggctc ctgctggaga 51000
cgtcctggga ggcgttcgaa cgggcgggca tcgaccccgt ctcgctgcgc ggcacccgga 51060
ccggcgtcta cgcggggacg gcgaccacgg gcgcggcggc gcacggcggc tcgtcccgca 51120
aggaggcgga ggggtacgcg gggaacgccc cgagcctgct gtccggccgc gtcgcgtaca 51180
cgttcgggct ggaagggccg gcggtcacga tcgagacggc gtgctcgtcg gcactggtcg 51240
cgatgcactt ggcggggcag gccctgcgcc agggcgaatg caccctcgcc ttagcgggcg 51300
gcgtgacggt gatggcgacg ccagaggtgt tcacgggctt ctcccggcag cgcggcctgt 51360
cgcccgacgg ccgctgcaag gcgttctccg cggcggccga cggcaccgga tggggcgagg 51420
gcgtcggcgt cgtcgtgctg gaacgactgt cggacgcgcg ccgcaacggc caccgcgtcc 51480
tcgcggtcat ccgcggctcc gccgtcaacc aggacggcgc gagcaacgga ttcaccgccc 51540
cgaacggccc gtcccagcag cgcgtgatcc gccaggccct cacgcaggga cggctgtcgg 51600
ccgccgaggt ggacgcggtc gaggcgcacg ggaccggcac ggccctcggc gaccccatcg 51660
aggcggacgc gctcatcgcc gcctacggcc gcaaccgccc gcccgaacgt cccctgtggc 51720
tcggctcggt caaatcgaac atcggccaca cccagggcgc ggccggcgtc gcgggcgtca 51780
tcaaaatgat catggcgatg cggcgcggcg tactccccgc ctccctgcac atcgaccagc 51840
cgacgcccca cgtcgaatgg ggcgacggcg cggtacgcct cctcaccgaa cccgttccct 51900
ggccggacac cgaccgcccg cgccgcgcgg gcgtgtcgtc gttcggcatc tccggcacga 51960
acgcccacct gatcctggaa caggcccccg aacccgtcga ggcgcccgag ccggagccag 52020
agtcgggggc ggtggtgccg tgggtgctgt cggggcgaag tgaagaggcg ttgcgtggcc 52080
aggctgccgc gttggccgct cggatggacg gttcggaggg ttgggcaccc gtcgatgtgg 52140
gttggtcgtt ggcgacgacg cgttcggtat tcgagcaccg cgccgtggtc gtgggctcgg 52200
aacgcgacga actgctggac ggcttgaagg cgctggcaac gggagaaatg catcccggac 52260
tcgtcggccc cggactcgcc gcccaccccg acgcctccgc ccccgttttc gtttttccgg 52320
gtcagggttc gcagtgggcc ggtatgggcg cggagctgct ggattcctcg ccggtgttcg 52380
cggagcgtat tgctgagtgt gagcgtgcgc tggctccgtt cgttgattgg tccctgtcgg 52440
atgtgctgcg tggtgatggc gcggagttgg cgcgcgttga tgtcgttcag ccggtgctgt 52500
gggcgacgat ggtctctctc gctgctgtgt gggcgtcgta cggtgtggtt ccgtccgcgg 52560
tgatcggtca ttcgcaaggg gagatcgccg ccgcgtgcgt cgccggcgcc ctgtcgctgg 52620
aagacggcgc gcggatcgtg gcgctgcgga gtcgcgcgtt gcggcggctc gccggtggcg 52680
gcgcgatggc ctcgctgggc gttggtgtgg acgccgcgca ggaattcctg cctgagggtg 52740
ttgcggtggc ggcggtgaac ggaccgtcgt ccacggtggt gtccggcccg cccgagcagg 52800
tcaccgcagc ggtggcccgt accgaggctg ccgggtatcg ggcgcggctc atcgacgtcg 52860
attacgcctc gcatgggccg caagtcgatg agatcaccga cgaactccac caagtcctcg 52920
caggcatcga acccgcaccg acaacagaag tggcgttcta ttccacggtc cgtgccgcag 52980
cgatcgacac cgccaccctg gacaccggct attgggtgac caatctccgg gaacgcgtgc 53040
ggttcgccga caccatcgac gcgcttctcg aagacggaca ccggatcttc atcgaggcca 53100
gtccccatcc cgtcctcacg gtcgggatgg aggagacctt tgaacaggcc ggaacggggg 53160
cggtggccgt accgacgctg cgccgcgatc acggtggtcg ggagcagttc gtacgcgtcg 53220
tcgggcaggc gttcgttgcc ggggcgcacg ctgattggac ggcgtcgttc gcgcaagacc 53280
ctgcgccgcg cacggtggac ctgccgactt acgcgttcca gcggcagcgg ttctgggcgg 53340
acgacgaccc caccgttcat tccggtgatc cggccgggct cgggctggtg tcggtcgggc 53400
acccgttgct gggtgccgcc gtggacgtcg cggacgagga cacgcgtctc ctgaccgggc 53460
gcctgtcggc atccggcggc ggtaccggct ggctgaacga tcaccggata ggcgacacgg 53520
tgctggttcc gggcgcggcg ctggtcgagt gggcgttgcg ggcggcggac gacgtcgggt 53580
gcgggggagt ggaggagctg gcgctccagg cgccgctggt cctgcccgcg tccgggagcg 53640
tacgggtgca ggtggtcgtg ggcgcggccg gcgaggacgg acggcgtgag gtgcgcgtcc 53700
actccgttcc cgatctcggc ggcgacgggg cgtccgagga tgtgcggtgg gaatgccatg 53760
ccgtcggcgt tctcagcccg gagccgacgg ccacgccgtc caacgggcag agcgatgacc 53820
tcggtggtgt gtggcctcct gcgggtgcgc agccgatcga cctcgaaggc ttctacgaac 53880
gcgccgcggc cgcgggctac gggtacggtc ccgccttcca aggggtacgg gccgtgtggc 53940
gcgacggttc ggatctgctg gcgcacgtcg agttgcccga ggacgcggcg gacccggccg 54000
ggttcggtgt ccatccggcc ctgctggacg cggcgctgca ccccgccttg ctcggcggca 54060
gcccggaaac cgactccgaa gcgggcgtgc tgctgccgtt cgtttggaac ggggtcgcgc 54120
tgtgggcggc cgaggcgacc agcgtccggg tcaggctctc gccgcgccgt gacggggagc 54180
gcggagtccg ggtggtgatg gccgatgccg tgggcgcgcc cgttctgagc gttgacgagg 54240
tcgtcatgcg tcctgcggac accgaacggc tccgggccgc gggtcggcgg ggcgcgggaa 54300
acctgttcgc cctggaatgg gcaccgcttc ccgcgtccac cccggacgcc gccgccggtc 54360
gcggcgtcgg cgagggcgag tgggcggtcg tgggggcgga gccggtggcg acggatctcg 54420
ttcaccaccc ggacctggac gggttggtgg cggcgctcga cggcggcgct cccgttcctc 54480
cggtcgtgct cgcggccgtg ccgtcctcgc cggacgcgct ggacatggtg acggacacgt 54540
tggcgctggt gcaggcgtgg ctggccgaac cgcggctggc cgacgcccgg ttggcgctcg 54600
tgacgcgggg cgcggtagcc gcggggacgg agcccgacct cggcggcgcc gccgtgtggg 54660
ggctcgtccg cagcgcccag acggagaacc ccgaccgctt cgttctcgtg gacgtcgatg 54720
acctggacga cgccgacgct gacgccagcg ccaacgttga ccgcgatgcc gcgtgcggcg 54780
tcgccgaggc cgtgcggcgt gcggtggagg cggatgagcc gcaggtcgcg gtgcgcggcg 54840
gacggctcct cgtcccgcgc ctggcacgga acgacgacct cggcgaaccc tccgatcaga 54900
gcggcgacgg ttcaccgccg ctcgaccccg agggcacggt cctgatcacc ggtgggacgg 54960
gcatggcggg gtcgctggtc gccgagcatc tcgtccgggc gtggggcgtc aggcggctgc 55020
tgctcgccgg ccgccgcggc ccggacgcgc cgggcgcggc ggaactggcc gaacgcgtcc 55080
gcgagctggg cggcgaggtg cggttcgcgg cggccgacgt gtcggacgcg ggcgcggtcg 55140
cagacctggt cgccggtgtc gatccggcgc atccgctgac gggtgtcgtg cacgcggccg 55200
gcgtcctgga cgacggggtg atcacttcgc agtctgcgga gcggctggcg ggggtgtggg 55260
cggccaaggc cgccgccgcg tacaacctgc acgtggcgac ggcggccctg ccgctgggcc 55320
tgttcgtgct gttctcctcg gcggccgggg tggtcggcaa cgccgggcag gcgggttacg 55380
ccgcggccaa cgcgttcgtg gacgcgctcg cggaacaacg ccggaccgcg ggcctgccgg 55440
ggctgtcggt ggcgtggggc ctgtgggagc agtccagcgc catgaccggg cacctcgccg 55500
aagcggacca cgcccgcctg gcccgctcgg ggttctccgc cctccgcacc gaacgcgcgc 55560
tcgaactcct ggacgccgcg tgccgtgcgg aacgttcacc ggtcgtggcc gcgggcatcg 55620
acgtgcgcgg ccggtcggcc gccgaggtcc cggccctgct gcggggcctc gccgggcggc 55680
cgggccgccg aaccgccgcc agcgccgagg ccgggaccgg ggtggacggc gcgcgaggcc 55740
gcgagctggc ggtgcgcctc gcccggatgg acgagcccga acggctcgac gccgtgctgg 55800
agacggtccg gggatgcgtc gcggtcgttc tcgggtacgc ctcgcccgcc gacgtgcgga 55860
ccgaggcgcc gttcaaggaa ctcggcctcg actcgttgac cgccgtggaa ctccgcaacc 55920
gcctgtcgag gacgaccggg ctgaggctgc cggcgacgct ggtgttcgac tacccgacgc 55980
cgcgggcgct ggccgaccac ctgtgcgcca agctcgtcgg ggacgccgcg gcggccaccg 56040
cgaccgcacc cgtggcgacg gccatgacgg acgagccggt ggcgatcgtc gcgatggcct 56100
gccggttccc cggcggagcc gcctcaccgg aaggactgtg ggatctcgtc gcgcggggcg 56160
tggacgcgat ggcggagttc ccggaggacc gcggctggga tctggacggc ctcttccacc 56220
cggaccccga ccaccccggc accagctacg cccgcgaagg cgcgttcctg gacgacgcgg 56280
ccggattcga cgcgggcttc ttcgggatca atccgcgtga ggcgttggcg atggatccgc 56340
agcagcgttt gttgctggag gtgtcgtggg agttgttgga gcgtgcgggg cttgatccgg 56400
tgtcgttgaa gggcagtcgg acgggtgtgt acgcgggtgt gatgtatcac gactacgcgt 56460
ccggcctgtc cgacagcggc gacgccaagc tggagggcta ctccatgctg gccagcgcgg 56520
gcagcgcgat ctgcggccgc gtcgcgtaca ccctgggcct tgagggcccg gcggtgacgg 56580
tggacacggc gtgctcgtcg tcgttggtgg cgatgcacct ggccgcgcag gcgctacgcc 56640
agggcgaatg ctcgctcgcg ctcgcgggcg gcgtcacggt catggccacc ccagaggtct 56700
ttaccggctt ctcccgtcag cgcggcctgg ctccggacgg ccgctgcaag ccgttcgccg 56760
cggcagcgga cggaacgggc tggggcgaag gcgccggcct ggtcctgctg gagcgcctct 56820
cggacgcccg ccgtaacggc cacgaggtgt tggcggtgct gcggggttcg gcggtcaatc 56880
aggacggcgc gagtaatggg ctgaccgctc cgaatggccc ctcgcaggag cgtgtgattc 56940
gtcaggcgtt ggcgtctgcg gggttgtcgt cgtcggatgt ggatgcggtg gaggcgcatg 57000
ggacggggac gacgctgggt gatccgatcg aggcgggcgc gttgttggcg acgtatgggc 57060
ggggtcgttc gggtggtcgg ccgttgtggt tggggtcggt gaagtcgaat atcggtcata 57120
cgcaggcggc tgctggtgtg gctggtgtga tcaagacggt gatggcgatg cggaacggtt 57180
cgcttccggc gtctctgcat atcgatgagc cgtccccgca cgtggactgg gagtcgggcg 57240
cggttcggct gttgaccgag ccgatggaat ggccgaccga tgatgaccgt ccgcgtcggg 57300
ctggggtgtc gtcgttcggc gcttcgggaa cgaatgctca cctcatcctg gaacaagcgc 57360
ccgaaccgaa gaacgaaccg gaggacgagt cggcgcccgc tttggacggc gttgtggtgc 57420
cgtgggtgct gtcagcacgc agcgaagagg cgttgcgcgg acaggccgcc gcgctggcgg 57480
cgcacgtgtc cggcaccgat gacgctcccg ccgatgtggg ctggtcgctg gtcacgtccc 57540
ggtccgtgtt cgaccatcgt gcagttgtga ccggcacgga tcggcaggaa ctgctgaacg 57600
gcctgggcgt cctggcgaac ggccacgaaa acccggaagt ggtccgtgca gtcgcgaatt 57660
ccgcatccgt ctccggaagc acggtatttc tgttcagtgg gcaggggagt cagcgggttg 57720
ggatgggggc cgggttgtat gagcggttcc ctgtgttcgc gggtgcgttc gatgaggtgt 57780
gcgcgctact ggaaccgcgc ttggagcatc cgttgaagga cgtggtgttc accgggattc 57840
cggagcgaga tggcctgctc gatcacacca cgtatgcgca ggcgggtttg tttgcgttgc 57900
aggtgggttt ggcgcggttg ttggcgtcgt tgggtgtggt gcctgacgcg gttgtcgggc 57960
attcggtggg tgagatcgcc gccgcgcatg tggccggagt attcggtctg gaggacgcgt 58020
gccgtttggt ggcggcgcgt gcgacgttga tgggcgggct gccgtcgggt ggggccatgg 58080
ccgccattca ggcgaccgcc gacgaactca ccgacgacct cgaacgcttc gccggacgag 58140
tcgcggtcgc cgcgctcaac accccggaca gcacggtcgt ctccgggccc gccgatgagg 58200
tggaccagct cgtcggggtc tggaccggca ggggtcgcaa gtcgaagcgg ctcgcggtca 58260
gccatgcgtt ccattcgccg ctgatggacc cgatcctgga cgagttcacc gacgcgatca 58320
gcgatctgac ctaccggcgg ccggacgtgc cgctgatcag caacctcacc ggggagcccg 58380
ccgaacccga gcacatcgcg acgcccggct actgggcgga ccacatccgg cggccggtgc 58440
gcttcgcgcc cgccatcgcc cacatcgccc cccaggcggc gaccttcctc gaactcgggc 58500
ccgaccccgt actcgccacc gccgcccagc acaccatcgg cgacgggtcc gccgaccggc 58560
cgcagccgct ggtcacggcc accctcacgc agaagcggtc cgacgaacga gccttcgcgc 58620
acgccctcgc ccaactccac acccacggtg cggacgttga ctggaaggcc ctgttcgacc 58680
tcgatcccga ttcgcgccgc cggccacgca acgtggcgct gcccacctac gcgttccagc 58740
accggcggtt ctggctggac gcgttcaacg ggcgcggcgg cgatccggcc ggactgggac 58800
tggcctcggc ggatcatgcg ctgctcggag cggcggtcga gctggccgac agcggcacct 58860
ggctactgac cgggcggatc tcggcgcgtt cgcatcggtg gctgggcgag cacgtcatgg 58920
ccgcgacccc gctggcgccg ggagcgatgc tggtggagtg ggcgctccac gcggccgacg 58980
cggtgggctg cggaggcgtc gaggaactca tgctgcaagc accgctcgta ctgccggaga 59040
cgggcggcgt ccgcgtccag gtcgtggtgg ggcccgccga cccggacgga cggcgcgaac 59100
tccagatcta ctcgcggccc gacgagaacg acgacgacgg ccgtgcgaac gcggcgtggg 59160
tctgccacgc gcaaggcgtg ctgagccccg ccgaacccgc ccccgggcag ctgaacgggt 59220
cgcacgggcc ggacgtggtg ggcggggcat ggccgccgcc cggcgcgcgg cccgtcgcga 59280
tcgacggcct ctacgaacgg gccgcgtcgg cgggctacga gtacggcccg gcattccaag 59340
gcgttcgggc cgtttggcag cacggcgcgg acgtattggc cgaggtggaa ctgcccgagg 59400
ccgccggaga gaagaagggc ttcggcatcc acccgatcct tctggacgcc gctctccacc 59460
cgctcctgct cagcatggac gacgcgggga gcggtgcggc ctggctgccg ttcgcatgga 59520
acggagtgac gctgtgggcg agcgaggccg gaacggtgcg cgtgcggctg accccgcagg 59580
acggagccga gggcgaccgc accgtacggg tcgcgatcgg tgacacggtg ggcgcaccgg 59640
tcctcaccgc ggactcggtc gtgatgcgcg aggccagcac ggaacacctc cgcgcggccg 59700
ccggaacgaa ccgggacggc agcgacaccg acggactgtt caccacacgc tggactccgc 59760
tggacaccgc cgccactggt gacaccggca acggcgacga cggcgacgac ggcgcgtggg 59820
cgttcctcgg cagcgagcac atgcgcccgc agcagaacgg cgagacccgt cccgaatggc 59880
tgtggcacga cgatctggcg gcgctcgtaa cggcgatcga ggccggcgcg ccggttccgc 59940
ccgtcgtgtt cgccgaggtg tcagccggtg ccggagacgg tggcaacagt ggtgagaacg 60000
ccgctgcgcg cgggctggcc acggcgcggc ggaccctggc gctgctccag gactggctcg 60060
ccgaaccgca cctggccgac gcccgcctcg ttctcgtcac ccgaggcgcg gtcaccaccg 60120
gggaactgga cggacccgcc gccggcggcg acgccgaagc gccgatcgcc ctgagcgacg 60180
cgggcgtgtg ggggctggtg cgcaccgccc agtcggagaa ccccgaccgc ttcgtcctgc 60240
tcgacctcga ttccggtacg gcggacgcca ccggcgacgg agccgtccgc gccgtgtggg 60300
cggcgatcgc ggcggacgaa ccgcaactcg cgctccgttc aggacgcgcg ttcgtccccc 60360
gcctgacgcg tacgggcacc gacgacggaa cgctgcggct accgcagccg gaacgggcct 60420
ggcacctgga cgcgggaggc accggcaccc tcgacggcat cgccgccgtc ccctgcccgg 60480
acgtcctcga accgctggcc gccggccagg tccgcatcga ggtacgtacg gccgggatca 60540
acttccgcga cgttctcgtc agcctcggca tggccccggg ccagaccggc ctgggcagcg 60600
aaggcgccgg aaccgtcgtt gaagtcggcc cgggtgtctc gggcctgtcg gtcggtgatc 60660
gtgtgatggg tctgttcgag ggcgcgttcg gcccggtgac cgtggcggac gaacgaatgg 60720
tggtccgtct tccggatggt tgggatttcc gttcaggtgc cgctgttccg gtggtgttct 60780
tgacggcttg gtacggcttg gtggaactgg ctgggttgaa gtcgggtgag tcggtgttgg 60840
ttcatgcggc gacgggtggt gtgggttcgg ctgcggtgca gatcgcccgc catttgggtg 60900
cggaggtttt cgccacggcc agtcccggta agcacgctgt tctggaggcg atgggtatcg 60960
acgcggccca tcgcgccagt tcccgcgact tggagttcga gcggacgttc cgcgccgcga 61020
ccggtggtcg tggtgttgat gtggtgctca atagtttggc gggggagttc accgatgcgt 61080
cgttgaggtt ggtggctgag ggtggtcggt tcctggagat gggcaagacc gacatccgcg 61140
aacccgaaca catcgccgca gagcatccag cggtcggtta tcaggtgttc gatctggtga 61200
cggacgccgg gccggatcga atcagtgaaa tgttccgagc gttggcggag ctgttcgccg 61260
aaaaggtatt gcggcctgcg ccgactcaag catggccgct gacccgcgcc cgcgaagcgc 61320
tgcgggtgat gagtcaggcc aagcacaccg gcaaactcgt tctggacatc cctccggctc 61380
tggacccgga cggcacggtg ctgatcaccg gcggcacggg aacactcggc ggcctggtcg 61440
ccgagcatct ggtccgttcc gagcaggtcc ggtcgttggt gctggccagc cgcagcggcc 61500
cggacgccgc cggagcggcg gagttgaagg cccggctgga ggaactgggc gcggaggtac 61560
ggatcgtcgc ggccgacgtg tcggacgccg acgccgtgac ccgtctggtc gcggaggcgg 61620
acgcggacac cggacgtccg ctcaccggga tcgtgcacac ggcaggggtg atcgacgacg 61680
cggtgctcgc ttcgcagacg cctgaacggc tggcacgcgt gtgggacgcc aaagcctccg 61740
ccgcgcacca tctgcacgcc gccaccgagc atcttcggct cggcgcgttc gtcgtgttct 61800
cctcggccgc ggggacgttc gggagcgcgg gacaggcgaa ctacgccgcc gccaacgcgt 61860
tctgcgacgc gctcgccgcg catcgccacg ccaaggggct gcccgcggtc tcggtggcgt 61920
gggggctgtg ggaggccacc agcggaatga ccggggacct ggccgacacc gacgtggccc 61980
ggatgagccg cgccgggatc aggcccctga gcgcccggca cgggctcgcg ctgctcgacg 62040
ccgcatcgcg gcacggcgcg gcgaacctgc tcgccgccga cctcgacgtt cgcgctctca 62100
acggtggatc cgccgaggcc gggcaggggg cggtgcccgc cttcctgcgc gcactcgccg 62160
ggggcggcac cgcgacccgc cggacggccg ccaacgggca ggcccccgcc gacttcgccg 62220
cacggctcgc cgcgatgccc ctggaggaac ggcagcgcgt cctgctcgac ctggtccggt 62280
cgcacgcggc ggcgacgctc gggcactccg acgtgggcgc cgtccagacg gacgcaccgt 62340
tcaaggaact gggattcgac tcgctcaccg ccgtcgaact gcgcaacaaa ctgtcggcga 62400
gcacgggcct gcggcttccc gcgacgttcg tgttcagcca tccgacccca tccgccatcg 62460
ccaccgaact gcacgacaag ctctgcccgg acgcgccgga gccctccgcg cccgtcttcg 62520
ccgagctgga acggcttgag gcggcgatgg ccgggctgac gctcgacgcc gagacgcgcg 62580
gcgggctcgt caaacgcatg cagaccctga tgtggaggct gggcgacggc gccgcggagg 62640
ccgagcaggc caccgcggac gaggacctcg actccgcctc cgacgaagag atattcgcgt 62700
tcatcgacag ggaactgtga gttccgccgg aaagcgagga ccgatgccgg gcacggaaga 62760
gaagctgcgc cagtacctga agcgggtgac agtggacctc gggcaggcac gccagcgtct 62820
gcgcgacgtg gaggaacgga accaggagcc catcgccgtc gtcgccatgg cgtgccgcta 62880
cccgggcggc gtgcgcggac ccgaagacct ctgggacctg gtggcctcac gcggcgacgg 62940
catcacggag ttccccgacg atcgcggctg ggacctcgac ggcctctacc acccggaccc 63000
cgaccactcc ggcaccagct acgtccgcga gggcgggttc ctcgacgaca ccgaccggtt 63060
cgacgcggcg ttcttcggca tcagcccgcg tgaggcgatg gcgatggacc cgcagcagcg 63120
catgctgctg gaggtgtcct gggagctgtt cgaacgcgcg ggcatcgcgc ccgccgcgtt 63180
gaagggcacc ccgaccggcg tgtacgcggg gatgtccagc caggactaca tgacccgtac 63240
gccccgcatc cccgaagggt tcgagggcta cgccaccacc ggcagcctca ccagcgtcat 63300
ctccggccgc gtcgcctaca cgttcgggct ggagggcccg gccgtgacgg tggacaccgc 63360
ctgctcgtcc tccctggtcg ccatccacct cgcgtgccag gccctgcggc aggacgagtg 63420
cacgctcgcg ctggccggcg gcgtcaccgc cctcaccacc ccgacggcgt tcgccgagtt 63480
ctcccgccag cgcggcctcg ccccggacgg ccgctgcaaa gccttcgccg ccgccgccga 63540
cggcaccggg ttctccgaag gcgtcgggct cgtcctgctg gaacggctgt cggacgcacg 63600
ccgcaacgga caccacatcc tcgcggtgct gcgcggctcc gccgtcaacc aggacggggc 63660
gagcaacggc ctcaccgccc ccaacgacgc ctcgcaggaa cgcgtcatcc gccaggccct 63720
cgccaacgcc cgcctcgccc ccgaccaggt cgacgcggtc gaggcgcacg gcaccggaac 63780
gaccctcggc gaccccatcg aagcgcaggc gttacaggcg acgtacggga aggaccgccc 63840
ggcggaccgc ccgctctggc tcggctcggt caagtcgaac atcggccaca cccaggccgc 63900
cgcgggcgtc gccggcgtca tcaagatgat catggcgatg cggaacggcg tcctgccccc 63960
ctcgctgcac atcgacgagc cgacctcgca tgtggactgg gactccggcg cggtgcgtct 64020
tctcaccgag ccggtggagt gggtgtgggg tgggcgtccg cgtcgtgcgg gtgtgtcgtc 64080
gttcggtatt tcggggacga atgcgcatct gattgtggag caggcccccg agtccgaggc 64140
tgaggtcgag gtgttggaag gggctgccgg tcctgacacg gaggtggatg agtcggtgcc 64200
gggtggggtg gtgccgtggg tggtgtcggc gcggtcggtg gcggggttgc gggatcaggc 64260
ggcggtgttg tcggcgcatc tggccgggcg ggaggtgtcg ccggtgcggg tggggtggtc 64320
gctggcggcc acgcgttcgg tgttcgagca ccgcgccgtc atcaccgccc accaccgcga 64380
ggagttcctg gaggggctcg atgcgctggc cgccggaggc cagcaccccg gtctgatcac 64440
ctcgcctgct ggtggagcgg gtggtggtgg ggtggtgtgg atgttcagtg ggcagggcag 64500
tcagcggccc gggatgggcg cgggactgta cgcacggttc ccggtgttcg ccgccgcctt 64560
cgacgagatc tgcggcctgc tcgacccgca cctgccgcac ccgctccgcg aggtggtgtt 64620
caaccccgac ccccaccaag accagcagag cgggctgctg gatcacaccc tgtacaccca 64680
gaccgcgctg ttcgccctgc acatctccct ggcccgcctg ctgcaccacc acgaccacac 64740
cccccacacc ctcatcggac actccatcgg agaaatcgcc gccgcccaca tcgccggcat 64800
cctcgacctc cccgacgcct gccacctcat caccgcacgc gccaccctca tggcccaact 64860
ccccaccggc ggcaccatga ccgccatcca ggcgcgggag gacgagctcg cggccgaggt 64920
cgaggagtcg ggcgggacgg tcgccatcgc cgcgctcaac acgccggaca gcacggtgat 64980
ctccgggcag gccgatgagg tggcgcggat cgccgccggg tgggaggagc gggggcgcaa 65040
gacgaaggcg ctggcggtca gccacgcgtt ccactcgccg ttgatggacc cgatgctgga 65100
cgagttccgc gaggcgatcg aggggttgag cttccgggcg ccgtccgttc cgttgatcag 65160
caccttgacg ggggaactgg cgggtgacga gatcgccagc cccgcgtact gggtggatca 65220
ggtgcggcgg gcggtgcggt tcgcgcccgc cgttgcccgc gccgccgaac ggggcggggc 65280
gttcctcgaa atcgggcccg atcccgtgct ggccaccgcc gcgcagcaca cgctgtcgga 65340
gccggcgtcc ggccgggagc ggcccgagcc gctcgtcacc gccgtgctcg accggcaccg 65400
tccggacgac gaggcgttcg tgcgcgcgct cgccgaggtg cacacgcacg tgcggagcag 65460
cgcgatcggc tgggcgcggc tcttcccgga cgacccggcg cctcgcgcgg tcgaccttcc 65520
gacgtacgcg ttccagcggc agcggtactg gatggcggac gtgacgccgc cggagtccgc 65580
cgccggaggg gacgcgtccg aggagcggtt ctggggcgcg gtcgagcgcg gggacctgga 65640
ggcgctgtcc gacacgctgc ggctgccgga cggcgacggg cagcgggcgt cgctgggcga 65700
gatcctgccg gtgctgtccg ggtggcgcag ggagcgccgg gagcgttccg cggtggactc 65760
ctggcgttac cgggtgacgt ggaagcggct gaccggcctc gacggcggta cgccgtccgg 65820
gccctggctc gtcgtcgcgg cgggcggggc cgacgggtgg gcggacgcct gcgggcgcgc 65880
cctgggcgag gaggcgcacc gcctcgacgt ggacggcgcc gttgacaggg aagggctggc 65940
cgaatggctg cggtcccggt acgccgacgg cgcgccgccc gccggtgtgc tgtccctcct 66000
ggccctgcgg gacgggccgg aaacgggtgc cggaggggcg gcgggcacgc tggccctgct 66060
ccaggcgatg gtggacacgg ggatcggcgc gccgctgtgg tgcgccacgc gcggcgcggt 66120
ctcggtcggc gactccgacc tgctggagtc ccctgcgcag gcgcgggtgt gggggctcgg 66180
ccgggtggcc gcgctggagc atcccgacct gtggggcggg ctggtggacc ttccgccggg 66240
tccggacggg ttggacgccg cgcggctgcg cgccgttctg accgggacgg cgggggagga 66300
ccaggtggcg ctgcgcgcga acggcgcgtt cgggtgccgg gtggtgccgt cgcccgccgg 66360
ggacggcgaa ccggggcggg agtggtcgcc cgaggggacc gtcctcgtca ccggcggcct 66420
cggcgagccg gtcgcgcgca tcgcgaggtg gctcgcggag ggcggcgcgg aacgcgtggt 66480
cgttccggac cccggcggga cggatgcgcc cggtgctgga gagttggcgg acgagctgtc 66540
cgggctcggc gccgagctga tcgtcgcgcg ctgcgggccg gacgacccgg cggccgtgaa 66600
ggagctcgcc ggacggctgt cggcggcggg cgcgcggatc gggatgatcg tgcacgcgcc 66660
cgctccggcc gagccgggcc cgctggcgga gctggcgccc gccgcgctgg aggcgttccc 66720
ggccgaggac gtcggcgcct ggcggcgcgt cggggagctg tgcgggatgg agccggacga 66780
gccggcggtg tgcttcacgt ccgtcgcggc gctgtggggg agcacgggcc agggtgcgca 66840
cgccgccgcg ggcgcccacc tcgacgcgct cgccgggccg ggccgtcccg agggcgtcgt 66900
gtccgtcgcg tggggcgcgt gggacgtgcc cgccgacacg gacgaacggg gcgcccgcgg 66960
cgcggacggc ctgcgcaggc agggcctcca gccgctcgac ccgaggctgg cgctgaccgc 67020
gctggaacgc gtcctcggga acggcgaccg gcggatcgcc gtcgccgacg tcgcgtggga 67080
ccggttcgcg ccgctgttca cgctggcgcg gccgagccgc ctgttcgagg acgtcccgga 67140
ggcgcggcgg gcgatcgagg ccgcgcgggg ccccagcgac gacgaggccg cggaccgtac 67200
ggcggagctg cgcagggagc tggcggccgt caccgcggac gagcgggccg cccggctgct 67260
ggcgatggtc cgcacggacg cggcggccgt gctgcggtac gaggcggcgg acgccgtgga 67320
cccggacctg ccgttcaagg acctcgggtt cgactcgatc gcggcggtcg gcctccggaa 67380
ccggctgcgc gcctcgaccg gcctgcggct gcccgcgacg gtcgggttcg actacccgac 67440
gccgagggcg ctcgccgggt acctgctggg gcgcgtcctg ccggaggagt ccgggaccgg 67500
gcacgccgcg ttcgggcacg tcgaggagct ggacgcggcg ctcgccgagc tgccgctgga 67560
ggacccgcgc cgcgccggcc tgatgaaccg gatgcgggcc ctgctgtgga agtacgagcc 67620
cgacagcgcc gcgagcgagg cgggggacgg agacggcgaa gaggacctcg ccgcggcgtc 67680
cgccgacgac atgttcgcgc tcatcgaccg cgaactgggc acctgaacgc atgagaaggc 67740
cccggcctcg cgcggtgcgc ggggccgggg ccggtgcgtc ggcgggacgg ctcagttggc 67800
ggtgcggtcg cccagcgcgg cgcggcgctg gacgttgcgg gccaggcgga ggaggtcctc 67860
cgcgacctcg tcggggtgct gggcgtgcag gtcgtggtcg ccgccgacgt actcgcggac 67920
ggtcgcgtcg ggcagcgcgg ccacggccgg ctcgacgtac gagcggacgc gttcgaagag 67980
cggctcccac cgccggttcg gcttcgggac ggccggcatc agcagggtcg ggacggtgat 68040
ggcgggatac cactgcgcgg gcgggtcgtc ccagatgctg cgcaggatcg cgttgcgctg 68100
cgtcggggac aggcgcggcg tcagcgcgcc gtcctcgtgc acccgcagcg agtgcagcga 68160
cgcctcgacg gcctctgccg accagtcggg gtagacggcg cgctggtagt cgcgcatggc 68220
gtcgagggtc gccccgttca ggtcgacctc cgacatcgtc agcagcccgg cgaactgctc 68280
ccacgagtcg cacacgacgc tcgcgtgcgc ccacccgccc tcgatcagcg ccaggcccgc 68340
gaccaggccc ggacgccggg cggccagccg cagcgacacc agggcgcccc acgagtgccc 68400
ggcgacgacc gcgcgggtca cgccgagggc gtccgcgacc gccgcgatgt cggcggcggc 68460
ggtctcgttg ccgtagtcgt cgtcggcggg gtcggagtcg ccgtggccgc gctggtccac 68520
cgcgtacacc gggtggccct cggcggcgag ccggtccgcg accaggtccc acatccgggc 68580
gttggagtcc aggccgtgca ggagcaggaa cggcggtccg cccgtgccct cccggtgcct 68640
gacgtgcagc ttcaccgcgt cgcccagcgg tacggaaatg tccatgtgaa agtcgtcccg 68700
agagtagtcc gccggagtgc ggcgcggagg aaacgatgtg ccggagggcc ccgcggcgcg 68760
ggggcgcccg cgtcaggcgg ggccgtgccg ggccgccagc ctctccagct ccggaaccag 68820
cccgttcggg gtcggctggt cgagcacctc cctcctgatc cgctcggcgc ccgcccggaa 68880
cgacgggtcg tccagcagcc gcgccaggct ctcccggacc ttcggcccgg tcacctcgtc 68940
cggcgggatc gacagcccgg cgccggtctc ggcgaggtag gcgcgcttgt ggagggcgtc 69000
gggcaccttg gtgatgagga gctgcggcgt cccggccagc agcgcgttgt agaacgtccc 69060
ggcgccgccg tggtggatca tcgccgcgca ggtcggcagg accgcgtgga acggcgcgaa 69120
ctccacgatc cgcgtgttcc ggggaacgcg gtccagcttc tcgcggaacg ccgtgggcag 69180
cgtcagcacc agctccatgt cgaggccggc gaccgagtcg agcacctcct gcacccgctc 69240
gatcgacatg acctggagct cctgccagtc gctcatcgac acgccgagcg tcatcaggac 69300
gcggggcgcg ggcggcggct cgttcagcca gtccgggatc acggcggggc cgttgtacgg 69360
gacgtagcgc aggttgaggt ggctcgcgcc gtagtccccg ccgaacgagg gcgggaactg 69420
gtcgatcgtg gcgtgcgcgt tgacgagcgt ctcggagtac tcgtagccga acttctccgt 69480
ccacgagccg agccactccg ccagcgggtc gcgccgccgt tccggcggct gctgcgccat 69540
cgtccagagg aagccgctgc ggagctggaa gctcagatcc acggtgaacg tcacgcgcgc 69600
gtgcgcagcg ccgaccgcgc ccgcggcgac gggcccgcga gggtgagggc gtcccagacg 69660
acgagccggg gccgccacga ccggctgaac gccaccaggt cgtcgaacag cgtgtcgttc 69720
gcgatcctgg cccctcggca cggcgacggt ctcgtagagc cagaccatgt cctcccagga 69780
gagcttcgcc cgcgggtcgc cgatggcacc gcccgcctgc ccgccgctgc cggagggctt 69840
cggcgcggtg gacgtcgcgc cgccgagggc gcccgccagc cgccccatgt cgagcggcgg 69900
cagcgtcccg tcctcctccg cgcgccgcgc ccggtcccgg atcggctcgt ccgagccgac 69960
cggcacggcc gtcagcccgg accgggtgac ggtgtcgacc agctcgggct cgcaggcgac 70020
gcgcacgtcg tgcccggcgg tctggagcga ccaggccagc ggcacgaggt tgaagaaatg 70080
cgtgtgccag gtcggcgtga cgaacaggac gcgcacggcc accgtctcct tcggggggaa 70140
tgggctgtca cggtcaccgg ccgggcttgc gcgccgacgt gatgacgtag tcccgcgccg 70200
ccgcccgcgc ctccacctgc ctcaggtggt agcggatggt gttcctcggc tggaacgacc 70260
ggtagcgggg gtcgcggagg agccgcgcgc agtgcgcccg cgcgaacacg gcgttggccg 70320
cgatcgtcct gtcggtgacg tcccgcacgt cgacgtccac gaacccggcc tcggcgagcc 70380
gtgcggcgta gaccgagcgc gggtaccagt tcgcgtccgg gatgatgcgc ctgcgccgca 70440
cctcgtccag ccgcagcacc aggccgccct tgccgcccgg cgcctcgcgc ggcagcgggt 70500
cggtcgtcgc gaggacgccg ccgggccgca gcacccggaa cgcctcccgg aagaacacct 70560
gccgggtgtt gaagtgcgcg gacgactcca gcgccacgac ccggtcgaac gagccgtcct 70620
cgaacggcag cgaggtcgcc gacccgacgc gcaggtcgag ccggtcgtcc aggtccagct 70680
cccgggtgcg gcgccgcgcg gcgtccacct ggcccggggt gatgttcagg ccgacgatct 70740
ccttcggctc gcgggtccgc agccagtgga agtcctgctc gccgaagccg aagccgacgt 70800
ccaggacgcg gtccccggcg gtgatgccgg ccgcctcgcc gagccgctcc gcgagcgcga 70860
cgcacgcggc gtcgtggtcg gtgcagcccg gctcccagta gccgtagttg aggtagccgt 70920
tctcgatgcc gagcaggttc ggcgcgccgc cgctggagag ccgttcgagg agccggtaca 70980
tcgagcgcac ccacttctcg gtgaacggcg aggagccccg cgtcccggcc gggcccggcc 71040
cggtccgggg gccgtcgccg cggccctcac tgatcgttcc gccggtcatc gtcagccttc 71100
cgctctgtcg ggtccgccgg ggccgtgctc cgcgcgaccg cccgcggacg gcgcgtccgc 71160
ggacggcggc ccgagcgtca cgggcagccg ggcgagctgg tgggtgatgg ggatctggtc 71220
ccgttccagc cggtccggcg gcacggcgag cgccagctcc gggaaccggg tgaacagcgc 71280
cgagaacgcg acctcggcct cctggttcgc gagcgccgcg cccaggcagt agtgcggccc 71340
gcgcgcgtac gcgaggtgct gcgcgtcggc ggcgccgcac ggccgctcga cgtccagccg 71400
gtccccgtcc gggaaccggc gcgggtcgcg gttggccgag cccagcacga gctggacgcg 71460
ttccccctgc cggatcagcg tgtcgccgat cgtgacgtcc tgcgtcgcgt accgcagctt 71520
cgcgatgacg acggggctgc cccagcgcag cagctcctgc accgcgccgg gcatcagcgc 71580
cgggtcctcg cgcagccgcg cgagctggcc ggggtgcgcg agcagcgccg cgatgccgtt 71640
gccgagcatg tgcgacgtcg tggtgtgccc ggcgatcatc agggtgagca ccatggtcac 71700
cagctcggcg tccgacagcc tgccgccgcc gtcgtgcgcg ccggtgaggg cggtgagcag 71760
gtcgtcggcg ggctcggcgc ggcgccgctc gaccagctcg cggatgtgcg cgttcatctc 71820
cgtgacgatc tcgccgagcc gccgcggcgt ggagtcgccg aaccggcggc tccaggcgtg 71880
ccaccgcggc cggtcctcct cggggacgcc gagcagctcg cagatgacgc tgaccggcag 71940
cgggtaggcg aagtgctcga tcaggtcgac gacgccgtcc accgcccggt ccggcagggt 72000
gtccagcagc tcgccggtga tggcctccac ccgcggcctc agccgcgcga tccgccgcgg 72060
ggtgaacgcc cgcgccacca gcccgcgcag ccgcgtgtgg tcgggcgggt cgaggtagac 72120
caggttcccg gcgaggtagg ggacgacgtc ctcggggatc cccaccgtcc gcgccaggaa 72180
ctcggcgtgc tcgtcggtcc ggcccggcag cgacccggcg ttgctggcga accgccggtc 72240
cagcaggacg gcggacacgt cgtcgtgccg ggtgacgatc cacaccgggc cgccgtccca 72300
cagccgcccc cgggcgaccg gggcctcctc cctgatgcgg ccgaacgcgc cgtaggggtc 72360
ggccaccagt tcggggtcgc tcagcacgtg ctcgccggac gtgcgcgggc cggtcgtgcg 72420
ctcggcggtc atccagggtc tccttcggac gtcgttgccg gaccggggcg cgcggcgacc 72480
cggcgtgccg ctaccgcgcg gcgccgtccc cggactcggc gagcagctcg gccatgaacc 72540
ggccgaggtg caccggcgtc gggtgctcga tcaccgcgac gagggggatc tcgatgccgg 72600
tgagcgacat caggttgcgg gtcagctcca gcgccttcag cgacgtgagc cccttttcca 72660
ggaagttgct gtcgaggtcg atgacggact ggtcgagaac ggcggcggcc tgctcgcgga 72720
ccgtttctcc caagatccgc tcgcgttccg gcggaccggc ggcggcaagc ttctcctgga 72780
gttcggtgcc gtcgagaacg gcctgtccgt cgctgtcgtt ggtcgcttgc tggtgcatgc 72840
gtgctgtccc gtcctgtcgt tgtcgaagcc ggcccgcggc gcctcgtacc gccgtcggcg 72900
ggcggcgccg ccgttccggc ccgggtcgtc gcgagcctcc cgacccactg tggggccgcc 72960
ctctaaaggc acgccaaata ggccgcccgc cagcgcgcac accccattcc gcggacccgc 73020
gaagggcttt agccccgcat taacggaacg gcggaacatt gagcggcggg gtgaagtgag 73080
cctggagccc cgccggacgc cgaaacgctc aggaggagtg tcatgacatc ccaatcggct 73140
gcgccggcga cggccggcca gatgtccgac tactacagcg cgatgggcgc gctgctacag 73200
atggcgtggg gggacaactt ccatttcggc tactgggacg ggccctccga cacggccacg 73260
atcgacgagg ccaccgaccg gttcaccgac atgctcaccg aacggctccg cgtcggcccc 73320
ggcgaccggg tgctggacct cggctgcggc gtcggcaagc ccgcgctgcg catcgcggcg 73380
aggaccgggg cgagcgtcgt cggcgtcacc atcaacgccc ggcacgtcga gctggccacc 73440
gagcgcgccc gcgccgaggg ccgcgccgac caggtgagct tccggcacgc cgacgccatg 73500
gacctgccgt tcgaggacgg ctcgttcgac gccgtgctgg cgttcgagtc gatcatccac 73560
atggaccggc cgaccgtgct gcgggaggtc gagcgggtcc tggtccccgg cgggcggctc 73620
gcgctcaccg acctcacccc gctcaccgac gagtcgaaca ccccgaagtc gttccgcagc 73680
ctcatgggcg cctcgcccgg ggacgagagc ccctccgggc agggcggcga ccccgacatc 73740
gccacgctca tctccgccgg cgcctggccc ggcctgctca gcgacgcctc cctcacgctc 73800
gacgagctga acgacgtcac cgagcacacg cagggcacgt tcatccggct gttcgagaac 73860
ttcttcaagt accgccgcga gttcgagcgc gagcacggga tcaccgtgga gcaggtcctc 73920
gactcggcca agtcgggggc gcccacggac ggcgtcggct gcctcgtcgt cgccgcgcac 73980
aagccctgac gcgtcgtccc gcccggccgc acgcgcgcgg aggccggagg ccggtcatca 74040
ccggcctccg gcctccgcgc gtccggcccg ttcggccctt cacggggcgg gcgcgcgaag 74100
gccccgcggc ccttggggca cgcggggcga acggagcggg cggagcgccg gtgtcaggtg 74160
cgggacgccg ccctcaggcg ggctcgccga ggaccacggg gacgcggacg agctgccggg 74220
tcagcgggat cggcctccac tccagctcct ccggcgcgac ggcgagcgcg aggtccgggt 74280
agcggccgaa cagcgccgac agcgccacct cgatctcctg gttcgccagc cccgcgccga 74340
ggcagtagtg cgcgccgcgc gagtagccga ggtgctgcac gccgccgtcc acgggccggg 74400
tgacgtccag caggtccggg gcggggaacc ggcgcgggtc ccggttggcc gaccccagca 74460
cgagctggac gcggtccccc tgcctgatca cggtgtcgcc gaacgcgatg tcctccgccg 74520
cgtaccgcag catggcgacg atgccgggcc cgcacaggcg gaccagctcg tgcaccgcgc 74580
ccggccacag gcccgggtcg gcgcgcagca gcgcgagctg gtccgggcgg gccgcgagcg 74640
ccacggcccc gttcgcgacg agctgcgggg tcgggagctg cgaggcgatc atcagggtga 74700
gcaccatggc gaccagctcg gtgtgcgaga gcaggccgcc gtcgtcgtcg tgggtgcgga 74760
tgagcgcgct caccaggtcg tcggcgggct cggcgcggcg gcggccgacc agctcgcgca 74820
ggtgcgcgct catgtcggcg agcatgcggt tcagccgcgc cgggtccatg ctgacgtagt 74880
cgtcgctcca cccgcgccac agcggccggt cctcctcggg cacgccgagc agctcgcaga 74940
tcacggtgat cggcagcggg tacgcgaaat gctcgatcag ctcgaccgcg ccgccctccg 75000
cgcggccggg cagcgcgtcc agcagctcgc gggcgatgtc ctccacgcgc ggcctgagcc 75060
gcgccacccg ccgcgcggtg aacgcccgcg agagcagctt gcgcagccgc gcgtggctct 75120
ccgggtccgt ccgcacgagg tccccggcga ggtacgggac gagctcctcg gcgacgccga 75180
tcgcgcgcag cgcggcacgg tagtcgtcgg ggcagcccgg caccgccgag gagttcgtgg 75240
cgaggcgccg gtccatcagc agcgcgctga cgtcctcgtg ccgcgtcacg atccagaccg 75300
gccggccgtc ccatgagccc tggacgatcg gggcctgctc gcggatgcgg ccgaacccgc 75360
tgtacgggtc gaccatcagc tcgtgccagg tcagcgcgta gtcgtccgtg gtcatcgggg 75420
tcctccttcg gatgcgtacg ccgcgaccgg cgcgccgtcc gctgggggcg gctcctggag 75480
ccagaagcgc cggcgctgga aggcgtacgt gggcagggga acggcgcgcg ggcgcggccc 75540
gccgccgaag agggcggccc atccgaccgt cgcgccggag gtgtggagcc gagcgagggc 75600
ctcgaccagg ccgcggccga cggcacgtcc gtcgccggga acgagccgtt cggcagaacg 75660
cggcgcgtcc gggaagggct cggggccgaa gtccaggaac gcgcccgcct cagcgaacgg 75720
cgcggagccc gcgtcgagca ccggaacggt cggcttcccg cccgtcagcg cgagggcgtc 75780
ggagccgggg gcgtcgcgct gggcgacgag gcggcaggcg tccggcaggt cgagcgcgcc 75840
cgcgaggtgc gcggcggaga cctccccgac gccgcggccg accaccgcgc ccggccggag 75900
ctccgccgcg agcagcaggc gcgccagggc gacgtgcagc gcgaaggtcg cggcgtcccc 75960
gccgagcgcc ccgccgaacc gttcgctcac ctcgtcgtga gcggcggcga acgcggggaa 76020
acgttcgcgg agggcggtgg tcgcctgcgc ggcccgttcg gcggagccgc cgaacgccca 76080
cgtaccggcg tcggagtccg cgtcgggacc ggcggtcgct ccggtgccgc cgatcacgtt 76140
ctcgtggggc gtgccgttcg cgagggcgtg cagcccggcg agcagctccg cgcggtgcgc 76200
gcccacgacg accgcgcggt cctcgaatac cgaacgggtc gcggcgagcg accacgcgac 76260
gtcctccggc gcgaggtccg ggtcgttccc gacgtgcgcg gccagcgccg ccgcctgcgc 76320
ccgcagcgcc gccgcgccgc gcgccgacac cggccaggcc agcgtcccgc ccggaggcgg 76380
ggaggggggc cggggctcgg gcggctcggg cggcggctcc tccaggatca cgtgcgcgtt 76440
cgtcccggag ttgccgaacg ccgacacccc ggcgcggcgc gggcgaccgc cgcgcggcca 76500
cggcgtcggc tcggtgacca gccgcaccgc gcccgcccgc cagctcacca gcggggtcgg 76560
gcggtcgatg ttcagggtgg cgggcagcgt cgcgtgccgc atcgccagca ccgccttgat 76620
cacgctcgcc atcccggccg cggcctggga gtggccgatg ttcggcttga ccgtgccgag 76680
ccacagcggc cggtccgccg ggcggtcctg gccgtacacc gacaggacgg cctgcgcctc 76740
gatcgcgtcg ccgatcgcgg tgcccgtgcc gtgcgcctcg accacgtcca cgtccgccgg 76800
ggacagccgc gcgtcggcga gggcctcgcg catgagccgc tgccggccgg ggccgtgcgg 76860
cgcggccatg ccgctcaccg cgccgtcctg gttgacggcc gtgccgcgca cgagcgcgag 76920
gacgcggtgg ccgttgcgcc gcgcgtccgc cagccgctcc agcaggacga gcccggcgcc 76980
ctcgccccac accatgccgt cggacgccgc cgcgaacggc ttgcagcgcg cgtcgggcgc 77040
gagctgccgc tggctggacg acagcaggaa cgtgctcggc gtgtacatca cggcggcgcc 77100
gcccgcgagc gcgaggtcgc actcgccgtc gcggagcgcc cgcgacgcca ggtgcatcgc 77160
ggtgagcgag gaggagcacg cgctgtccag cgcgaccgcg ggcccgacca ggccgagcgt 77220
gaacgccacg cggcccggcg cgccgccgcc cgcgttgccg gtgccgaggt agggcatcag 77280
cgcggcgggg acccgcctca gccgggtgcc gtagtcgtgc agcgcgaccc ccgcgtacac 77340
ggcggtggcg ctgccgcgca gcgtccgcgg gtcgatgccc gcgctctcca cggcctccca 77400
cgcggtctcc agcagcagcc gctgctgcgg ctcgatcgcc tcggcctcct cgtccccgat 77460
cccgaagaac gccgcgtcga agtcggcggc gtcgtcgagg aacccgccgc cgcgcgcgta 77520
ggtcgtgccg tggtggtccg ggtcggggtg gaacagggcg tccaggtccc agccgcggtc 77580
ggcggggaag ccggtgacgg cctcgcgccc ctccgcgagc aggtcccaga accggtccgg 77640
cgtccgcgcg ccgccgggga agcggcacgc catcccgacg acggcgaccg gctcgcgccc 77700
ggcgtcctcg gcccgccgca gccgccgcct cgcctcatgc agctcggcgg tcgtccacct 77760
caggtactcg acgagctccg cctcccgccc cgcgccgccg cccgcgctac cgcccgcgcc 77820
gttgcccgcg ccgttgcccg tgcgggcgtc cgcgccgccg ttcgcggggg agcgcggcgc 77880
gccgccggag gtggggtcgg tcaccggggg acgctcctca cgcccgcgac ggcgcgccgg 77940
tccaggttca cgagcgcgag ctcgtgccgc cgcagcggcg gctcgcccag ctcggggagc 78000
agccgcgccc cctgcatcag cgacatgaac cgcgccgtgc cgagctccga ctgcgggacg 78060
gtcatgctca tcacgtccgt caccgcgtcg ctcacgtcgg cggagcggat cgccatcgtc 78120
gcgaccatgt cggcgaagcc ccgcatgacc gccgcgccgc cgttcagccg cgggtcctcg 78180
gcgtggttgc ggcagtgcac gaactcgatg tccttggacg tcgccatgat ccacgggtcg 78240
tcggcggcgg cgctgatccg ctcctgcgcc gcccgcgccg cgcccggccc gaacggctcg 78300
gcgagccgtt cgtcgaggac gcgggcggcg cgcgcggcgg cgctcatccc gtgcccgtag 78360
atcgggttga acgcggcgag cgagtcgccg aggacgacca gcccgtccgg ccacgtcgcc 78420
gcccgctccg ggtacagccg ccggttcgcg ccgatgtgcg aggcgaacac cggcgtcagc 78480
ggctcggccc cggcgacgag gtcggccacg atcgggtcgc gcagcgtccg cgcgtacggc 78540
aggaagtcct cgtcgcgggt gggcagctcc ccgccgcgcg tgcacgacag cgtcaccatc 78600
caccggccgc cctcctgcgg gtacaccacg ccgaaccggc ccggcctgcg gacccggtgg 78660
tcggcggcga cgttgatcgc ggggaacccg gcggtcgcgc cgtccggcgc ccggaacgtc 78720
cgcgtcgcgt aggcgatgcc cgcgtccacg acgtcctcgg cgagcggcgg caggccgagc 78780
gcggcgagcc agtgcttcag gcgcgagccc cggccggtgg cgtccacgac gaggtcggct 78840
tcgaggagct cgcgcgcgcc gtcgcgggcg ccgcgcacgt ggacgccggt gacgcgcttc 78900
gcgccgcccg ccagctcgac ggcctcggtg ccctcccgca gcgtgatccg ctcgtccgcg 78960
aggatccggt cgcggacgat ccagtccagc agcgggcgcc cgcacatgat ggcgtactgc 79020
cgcggcggga accggtgctg ccacccgtgc gaggtcagcg tcaccaggtc ctcgtggaac 79080
ccgatcctgc gggcgccggc ggcgagcagc cgttcggcgg tccccggcag cagctcctcg 79140
acgatgcggg cgccgcccga ccacagcacg tggacgtgcc gcgcctgggg cagccccctg 79200
cggtgctccg gcccgcgtgg cagcacgtcg cgttccacca cggtgacggc ttccaggtgc 79260
cgggccagca cgtgggcggc gagcatcccc gcccagcttc ccccgagcac gatcgcgcgt 79320
ccggatctcg tcacgttccc gtccctccat gcacggccgg ccgcccggcc ggcgccgctg 79380
cgcccgccgc tcaggaggag ccggcgcggg cctcgctcaa ccgcacgttc tcgatgaggc 79440
gctccgtcac ggccagcgag tgcgccaccc cggtgagccg cggcgcgtcg ggcagcgaga 79500
cgtcggtgat gccccagtac gccttgaggc tgcgggtcag gccgttcccg tccacctccg 79560
agacgccgat gacctggtac cgcatgttcg acgggtacga ggtcgtcacg acggacggcg 79620
ccacgaccgt ccggccgtcc agcgccagga cgggccgccc cggcacctcg tgcaccttgc 79680
actcgatggc ccacgcgatg ttctggcgga tcgccgcctt gccccgcacc ggcggcgagc 79740
ccaccgggtc ctcgaagacg atgtcgtcgg tgaacagctc catcacgccg tccaggtcgc 79800
cggcgttgat gcgccgcgcg tactccaggg gcgcgttctt cctgaccgcg tcatccgtcc 79860
cggcctcggc cgccccccag aacgccctga gctcctcgat gagcccgccg gctccgacgc 79920
gcagcatcag aacgtactcg cgctcgatgg aacggacgcc gcccggcccc gccgccgggc 79980
cgtccagggt ggcgctgacg ggcaccagcg cgtgcacgcc gtcctgggag gcgaccggct 80040
cgccgggcac ctcgcgggcg ttcgcctcca cggcccgcgc gaagtgcgcc cgcagctcct 80100
cgcggccggc gcgctcgccc ccgccgaccg ggtcctcgaa ccggacgtcc tccgcgtaca 80160
ggtcgaggag cccgccgacg tccccggcgt tgagccgccg gcagtgctcc aggatgatgc 80220
gcttgcgcgc gctctcgtcg gtcagcatcg gctgcccctt ccggccgtgg ccgtcgtcgg 80280
ttcccggtcc cggacgctca tccggaggcg gtctcggcgc ccacggccgc gcgcgggctc 80340
aggttcacga gggcgagctc gtccgggctc agcggcggct cggtgagctc cgggcgcatc 80400
gggtcgctgc cgagcagcgc catgaaccgg ctggagccga gctcggactg cgggacggcc 80460
aggctggcgg cgtcggtcac catctcgctc acgcccggcg accgcagcga gcggaccgtg 80520
atgacgtcgg cgaagcggcg ccgcgcgtcg gcctcgccca gcagccgcgg gtcggtcgcc 80580
gtgctgcggc agccgacgta ctcgatgtcg cgggccgccg cgatgatcca cgggtcgtcc 80640
accgccgcgc acacgtcccg ctgcgccgcc gcgaccgccc cggcgccgag gccctcctcc 80700
gcgaggcgct cgtccagcgc cgcgatgccg cgcgcggcgg cgctcatccc gtgcccgtag 80760
atcgggttga acaccgcgag cgcgtcgccg aggacgacca gccggtccgg ccagcccgcg 80820
agccgttcgg ggtacagccg ccggttcgcg ccgaagtgcg agacgaccag cggggtcagc 80880
ggctcggcgc gggcgatgag gtcggcgacg agcgggtcgc gcagcgtccg cgcgtactcg 80940
ttgaactccg cgtcgtcggt cggcagccgc acgccccgcg tcccggcgag cgtgaccatc 81000
cacgtgccgt cctcctgcgg gtggacgacc ccgaagcggc ccggctcccg cacccgcggg 81060
tcgggcgcga ggttgaccgg cgggaagccc gccgccgcgc cgggcggcgc ctggtagatc 81120
cgcgagcagt acgccatgcc cgcgtcgacg acgtcctcct ccagcggcgg cagcccgagc 81180
gccccgagcc agtgcttgag ccgggacccg cggccggtcg cgtccaccac caggtccgcc 81240
gtcagcgacg acgcggcccc ggtcgccacg tcgcggacgt gcacgccggt gatccggtcg 81300
cggtcgcccg cgaggtcgag cgcctcggtc cgctgccgca cctcgacccg tcccgacgcc 81360
gccacctgct cgcgcaccgc ccagtccatc agcgggcggc tgcacatcac gcagtactgc 81420
ttcgacggga accggtgctg ccagccgtgc gaggtcagcg tcaccaggtc ccgctggaac 81480
atgatccggc gcgcgcccag cgagagcagc cggtcgatcg tccccggcag cagcgcgtcc 81540
acgacccgcg cgccgctcga ccacaggatg tgcccgtggc gggcctgcgg ctgccccttg 81600
cgctgccggg gcccctcggg caggacgtcg cgctccagga cggtgacggc gtcgaagcgg 81660
cgggagagcg cctgcgcggc gagcatcccc gcccagccgc cgcccaggac gatcgcgtgc 81720
tcggatctgg tcatggtcac cctctcgtcg tcgcgccgga gcgacgcggc gtcgtccctg 81780
tgatgattgc cgtgcctgct taagcggccc tagcgccggt tccggccgag gtcgtggtcg 81840
atgaaggcga gcagctcgtc caccgtggcg gtcccgatcc gctcggcgcc catgccggac 81900
ggctcggcgg acggctcgcc ggacgggccg tcgggcgcgc cgccggggtg gtcgtccagc 81960
ctggcgagca tggtctggag gcgtcgccgc agtgccgtcc gcacgctcgc gtcgccgttc 82020
acggccgcgc cggcggggga ggcggcggcc atcgcgttgt ccatgcgttc gaggcgggcg 82080
agcagcgagg ccggcgtgcc gtcctcgtcc gggtccggtc cgcgcggggc cagctcagcg 82140
tgcaggtgct ccgcgagcgc ggccgggctc gggtggtcga agacggcggt ggtggggacg 82200
gccagcccgg tcgcgccggc gagccgtccc gcgaggtcgg tcgcggtgat cgagtcgaag 82260
cccatctccg agaacccgcg cccgaccggg acgagccccg gatcggcgtg gccgaggacc 82320
gcggcggcgt tcgtgcggac caggtcgagg agcagggggc cgcgctggtc ggcgggcgcc 82380
gccgcgaggc gttcggcgag ggacgccgcg ccccggccgg tggccgcgct cggccgcgcc 82440
ggggtccgtc cggcgagggc gcgcagagga gcgggcggcg cgtcggcttc ggtggcgtgc 82500
gcggacgcga gtgcccggac gtccagattc gcggcgacca gtcggggggt ggggctgtgg 82560
cgggccgttt ccagcagggc aagcgcatgc tcggcggtca gggctccgac gcccggcgcg 82620
gcggcggacg gtcgcgcggt gtcgtccagc caaggccccc aggcgatcga cagcgccgtt 82680
cgcccagccg ctcggcggtg ggcggccaac gcgtcgttga acgccccggc cgcatcgccg 82740
ccggacgcgc cgagcgtggc ggcggccgag gagaacagga cgaacgcctc cagcggcaga 82800
tcggcggtcg ccgcgtgcag ggcgtacgcc gtcgcgatcc cgtccgtacg tgcctcgtcc 82860
ggcgggcacg cggcgtggac gacggcggcc agcgggcgcg cctcgtcgat cgcggcgacc 82920
acgtcggcga cggcggacgc gtcggtgagg tcgcacgcgg ccacgcggac gtccgcgccc 82980
agcgcggcgg cgcgctccgc cgcgtcccgc acgcccggcg actcgtggcc ggggccgggg 83040
ccgtccgggc ggagcagcag caggtgcttg gcctgccacg cgcggacggc gtgctcggcg 83100
accagcgccc cgcgcggacc gccgtcgagg gtgatcagca cggtcccgtc cacggcgaac 83160
ggaccgtccg ccctggtggc tcgcggcccg ggggacgccg cgcgcgccag gcgcggcacg 83220
aggacctgcg ccgaacggat cgcgacctgc ggttcgtgct gcgcgagcgc ggccgccacc 83280
gcctcgcgga gccgcgcctc ctcggggtca ctgtcggtgt cgagcagaac gaaccggtcc 83340
gggcgctcgg cctgcgcgct gtgcaccacg ccccacacgg cggcgtcggc gaggctcgga 83400
acgtcgcgtt cgccgccgtc cacgctgacg gctcctcggg tgaccacgac caggcgggag 83460
ccggacaggc gcggctcggc cagccagtcc cgcaccagcg tggaggcggc ctccgcggcc 83520
cgccgcgcat cgccaccgga ggcggtgggc gcgcgggtga gaacgaccgg cggggcggga 83580
gcgccgggat cggcgtcgag cgcggtgcgc agcgcgtcca gctcgatgta acgccctgag 83640
gcgtcctcgg tcggcgcggg cgcgggcgtc cagtccagcg tgaacaggcc gtccgtccgc 83700
ccgccgccgg acgcacggag gtcttcgacg gcgaccggct ccagcgtcac cgagccgacc 83760
gtcagcacgg gcgctccggc cggatcggtc agcgtcacgc gtacgcgccg cgcgttccgg 83820
ccagggtccc cggtggggga caggcgtact cgtaccgtgg tcgcctccga cgcccacagc 83880
gccacgtcgt tccatgcact gggcagccac acgtcgtctt cgcgggcccg cgcgtcgtcc 83940
gggacgtcca ggagcgccgg gcggagcgcc gcttcgagca gtaccggatg gaggccgtag 84000
ccgtcgtgac caccggcctc gtccggcagg gccacctcgg cgagcaggtc ggagccgtcg 84060
cgccagaccg cgcggatccc gtcgaacgcg gggccgtcct cgtagcccgc cgcgacggtg 84120
cgctgacgga ggccgtcgag gtccactggt acggcctcgg gtggtggcca cgcgcgtgcc 84180
cgttcgtcct cgggagccgg ggcggccggg ccgggtgccg gcggggcgag cgtgccgtgc 84240
gcgtggcacg tccaggcggg atcatcggcg gtacgggggc cgtggtcggg gcgggaatgc 84300
acgcgtacgt cgcggcgccc gtcctcccgg gccgccccca cggtcacctg cacgcgcggg 84360
ccccggcccc gcgcgggcgg gacgagcgga tcgcggacca ccagctcttc cacgccgccg 84420
caccccgcct cgtcggccgc cagcagagcc cactccacca gcgcggacga gggaacgatc 84480
ggcagccccg cgacggcatg ccctgtcgtc catgccgcgc cgttcgcgga gatggtgccg 84540
ttcagcgcca gcccgccatc ggccagctcg accgccgccg cgaagaacgg gtgacggacg 84600
cgccgcgccc cggcggcgga cacgtcgccc gcgccgccgc ccgcgtcgcc cagccagtag 84660
cgctcgcgct ggaacgcgta ggtggggagg cggacggtct ccggcgcgga gccgtcgggg 84720
aagcaggccg tccagtcgac gtcggtcccg gcgatgtgga gccgggccag ggcccgaacg 84780
aacgcgtgcg cgtcgggctg cctggagctg agggtggcga ccgtgcgcgg ggcggggccg 84840
tccgggcgga cgtgttccag cgtgcggcgc gcggcggcgg tgagggtcga gtcggggccg 84900
agctccacga acgtcccggc gtccggcgcg acgtgcgcga cggcggggtg gaagcgcacc 84960
ggccggcgga tgtgctcggc ccagtacccc ggggtggcga tgcgctcgtc ggcgggctcg 85020
ccggtgaggt tgctgatcag cggaacggcc ggcggccggt acgcgaggtc gcggacggcc 85080
tcctcgaacg gggcgagcac gggctccatc agcggcgagt ggaaggcgtg gctgacggtc 85140
agcgcccggg tcttccgccc gcgctccgcc cacgcctccg cgatctccgc gacgggtccg 85200
gcggcgcccg agatgacggt gttgccgggc gtgttgaggg ccgcgacggc gacccgcccg 85260
ccgtgcgcgt ccaggtcggg ggacagctcc tccggggtgg cggcgacggt cgccatggcc 85320
ccgccctcgg gcagccgccc catgagcgtc gcgcgcgcgg cgaccagccg gcacgcgtcc 85380
ggcaggtcga acaccccggc gacgtgggcg gcggcgatct cgccgatcga gtggccgatc 85440
acgacgtcgg gccgcacccc ggccgcgccg agcagccggg ccagcgcgac gtgcagcgcg 85500
aacagcccgg cctgggcgta ggtggtgtgg tcgagcgcgc cggaccggtc ggcgaacacc 85560
acgtcccgca ccggacggtc gagatgctcg tccagcagcg cgcagacctc gtcgaacgcg 85620
gcggcgaacg ccgggaaccg ctcgtacagc ccggcgccca tgccgggccg ctggctgccc 85680
tgcccgctga acaggaacac cgtcttcccg gcggccgtgc ccgtaccggc ctcgcccgcg 85740
cccgcgacga cgacgccggg gtgggcgtcg ccgcgcgcca gcgcctccgt cgcggcgagc 85800
agctcgccgc ggccctcgcc gacgacgacg gcgcggtgct cgaacgccgt acgggtcgtg 85860
gcgagcgcca ggccgaccgc ggccggagac tccagcggcg caccggccgc gtggtcggcc 85920
agcgcgctgg cctgcgcccg caacgcgtcc tcgccgcgcg cggacagcac ccacgggagc 85980
accccctccc cggcgtcggg cggcgcggtc gtcgcgggct ccgggtcttc cgggggcggg 86040
gcctcctcca ggacgaggtg ggcgttggtg ccggagatgc cgaaggacga caccccggcg 86100
cggcggggcc gttcgccgcg cggccacggg accggctcgg tgagcaggct gaccgcgccc 86160
gcctcccagt cgacgtgcgg gttcggctcg tcgatgtgca gggacgcggg cagcaggtcg 86220
ttccgcatcg ccatcaccat cttgatgacc ccggcgacgc ccgccgccgc ctgggtgtgg 86280
ccgatgttcg acttgagcga cccgagcctc agcggccgtt ccgccggtcg gttccgcccg 86340
tacgcggcga gaaccgcctg cgcctcgatc gggtcgccga gggtcgtccc agtgccgtgc 86400
gcctcgaccg cgtccacgtc ggcgggcgac agcccggcgg cggcgagcgc ccgccggatg 86460
acgcgctgct gggacggccc gctcggcgcg gtcagtccgt tgctggcgcc gtcctggttg 86520
acggcagagc cgcggatcac cgcgaggacg cggcggccgt tgcggcgggc gtccgacagc 86580
cgttccagca gcagcaggcc cgcgccctcg gcgagactcg tcccgtcggc cgaggccgcg 86640
aacgtcttgc agcggccgtc ggcggcgagg ccccgctgcc gcgtgaactc caggaacagc 86700
cccggcgtcg ccatcaccgt cacgccgccc gccagcgcga gggagcattc gccctgccgc 86760
agcgcgcgcg ccgcgaggtg ggcggcgacc agcgacgacg agcaggccgt gtcgaccgtg 86820
acggccggcc cctccagccc gaacgtgtac gccagccgcc ccgacaccgc gctgcccgcg 86880
ttgcccgtgg cgagatgccc ctccacctcg gcggccgtcc cgcccgcgag ggccaggtag 86940
tcctgcgagg cgatgcccgc gaacacgccg gtgtcgctgc cgcgcagggc gtcgcgccgc 87000
agcccggcgt cctcgaacgt ctcccacgcc gtctccagca gcagccgctg ctgcgggtcc 87060
atgccgagcg cctcgcgcgg gctgatcccg aagaacgcgg cgtcgaagcg cggcgcgtcg 87120
tgcaggaacc cgccctccag ggtgtggcac gtccccgggc cgccgccggg cgcgccgagc 87180
tcctccagcg gccacccgcg gtcctcgggg aacgcggaga tcgcgtccgc gccgtccgcg 87240
acgagccgcc acaggtcgtc cgcgccgcgc acgccgccgg ggaagcggca cgccatcccg 87300
acgatcgcga tcggctcccg cgcccggtcc cgttcctccc gcagctcccg ccggacctgc 87360
ctcaggtcgg cggtggcgtg cttgaggtag cggcgcagct tctcgtcgtt cgtcatcgcc 87420
tcagccccgc tcggacgggg cgtcgtggtc ggcgggacgg tcccgttcga gcccggtcag 87480
ctcgttgtcg agcaggtcga acagctcgtc gtcggtggcc gagtccaggt cgtcgcggcc 87540
cggcccgtcg tccggcgcgg cgccgtccgg gacgccggtg acgctgcgca gcagcgcgtc 87600
caggcgggcc gcgaccaggc cgcggccctc gtcgccgtcc ggcagcgcgg ccacggccgc 87660
ctcgatccgg tccagctcgg cgaccagcgc cgcgggctcc ggtccggcgc cgtccggcgc 87720
gacctcctcg gccagcagcc cggcgaccgc ggcgggcgtc ggatggtcga agatcagcgt 87780
tgcggggagg cggacgccgg tcgcggcgct gagccggttg cgcagctcca cgccggtcag 87840
cgagtcgaag ccgagctcgc ggaagttctg ctcggcgggg atcgaccccg ccgactcgtg 87900
cgcgacgacc gtcgccgcct gcgtgcgcac caggtcgagg agcgcctcgc gccggtccgc 87960
gtcgttcagc gcggccagcg tccgcgccag cgacggcccg gccgccgtcc cgcccggccg 88020
cggcgcggag ccccgcagca ggacccgcag gaacgccggg acctcgcccc ccgcggtccg 88080
gcgccgcagc cccggcaggt ccaggcggac gggcgcgagc agcggccggc cgaggccgag 88140
cgccgcgtcg aacagcgcga gcgcccgctc cgcgctcagc ccgaggtcgc ccgaacgggc 88200
gttccgcgcg aggtcgatgt cggtgaaccg gcccgacatc ccggtgtcca tctcccagaa 88260
cccccacgcg agggacgtcg cgggcagccc gcgcgcgtgc cggtgctgcg cgagcgcgtc 88320
gaggaacacg ttcgccgccg agtagctgcc ctgccccggg ccgcccgcga gaccggtcac 88380
ggacgagaac agcaccagcg ccgcgaggtt caggtcgcgc gtcagctcgt gcaggtgcca 88440
cgccccgtcc gccttgacgc gcaggacggc gtcgagctgg tcgggcgtcg ccgtgcggat 88500
cgtcgcgtcc tggacgatcc cggcggtgtg cacgaccgcc gtcagcggat gccgcgccgg 88560
aacggacgcc agcaggtccg cgagggccgc ccggtcgccg gtgtcgcacg cggcgacggc 88620
gacctccgcg ccgagcgcgg ccagctcggc ggccagcgcg tccgcgccgg gcgcctcggg 88680
cccgcggcgg ctcgcgagca gcagccgccg gacgccgcgt tccgcgacga ggtgccgggc 88740
cgtcgccgcg ccgagagccc cggtgccgcc ggtgatcagc acggtcccgt ccgggtcgag 88800
gggcggcggc agcgacagga cggtccgccc ggccgcgccg tccgcccggc ggagcgcctc 88860
gcggacgtcg tacacgtccc ggacggtgac gggcgccgga ggcagcgtcc cgtcctcgaa 88920
cagcgccgcc agatcgctga gcgtcgaacg gacgcgctcc acccgcccgc cttccccgcg 88980
gccggtgccg tcgaggtcgg cgggggtcgg atggtaggcg ggcaggtagg tggagaccgc 89040
gccggcggcc tccgcgtacg accacccggc gggcatcggc acgaccaggc ggtggtcggt 89100
gagggccacc ggcccgacgg cgtcgaacgc ccccatcacc cggtcgccca cggcgaaccc 89160
gtccacgtcc gcgccgaccc cggcgacgac gcccgcgccg tcgccccgca cgtcgcggac 89220
ctcgtccggc gcgcccggcc cggcggcctg ggcgctgacg gcgacggccc gcaacgccac 89280
gcgcacctct ccgggcgcga gcggccggcc gtgctcggga tcgggaacga gcgcggcctc 89340
gccgtccgcg ccgtacacca gccgccacgc ctccgcgccc ggcggcgggg acagccggtc 89400
gccgggatcg tagcggacga gccgcggcgc gtacgcgcgg ccgtcgcgca gcgcgatctg 89460
cggctcaccc gaggcgagcg cggcgggcac cgcgtgcgac gaagcgtccg gcccgtcgag 89520
atcgagcagg acgatccgtc cggggttctc ggactgcgcg ctgcgcacca gcccccacac 89580
cgccgccgcg ggcaggtcgt ccacgccgtc gccggggccg gtcgcgaccg cgccgcgcgt 89640
gacgacgacc agccgggacg tgctctcgtc ggacggccac tcctggagca gggccaggac 89700
gcccgaggtg acggcgtgga gccgttcctc cacgggaccg gcggcgccgg aggcggtgca 89760
gacggcgagc acggtgtccg gcagcggcgt gccgtcaacg gccgcggtca ggagcgcgga 89820
gaggtccgga tgccgttccg tccccggcag cgcgccgccg agcgcgtcgg ccagctccgc 89880
gtccgagccg gacgccgcgt cgagcgcgac gaccgcgagc cgggccgggg gtgtcccttc 89940
ggggtcgggc agcggcgtcc agtccatcgc gaacagcgag ttcctggccg ccaggcgggc 90000
ctgcgcggcg gcgtcggcct cgaccgaacg cagcgtcagg tcgtccagcg acaggaccgg 90060
cgctccggac gcgtcggcgg cggcgatcgt gagccggtcc gccgccgtcg gcgtgacccg 90120
cacccgcagc gacgccgcgc cggtggcgtg gacccgcagg ccgctccagg cgaacggcag 90180
catgaccgct ccggagtcgc cgtccgcgcc gtccgcgccc agcgtgcacg cctgtaacgc 90240
ggcgtcgagg agagcgggat ggatgccgta tccggcggtt ccggagccgt ccgccccggc 90300
gcggtccgtt ccgtcctcgg gcaggaccac ctccgcgtag aggtgctcgt cgagccgcca 90360
cgcggcggtg agcccctgga aggccgtccc gtacgcggag ccgcggtccg cgaggtcgtc 90420
gtagaggtgt tcggtggcga gcggctccgc tccgggcggc ggccaggcgc cgtccagcgg 90480
gtggggctcg gacggcgtgg ccgtgctcgc cccgggtccg ttcgcgagga gcccggtggc 90540
gtgccgcgtc cacgcggact cgccgtcgtc ggcgtccgcc gccggtctgg agtggatggc 90600
gacgggacgc ccgccgccct cgccgtcggc gggaccgacc gcgacctgga gatcgaccgc 90660
gccgtcctcc gggagccgca gcggctcgtg caggacgagc tccgcgacgt ggtcgcagcc 90720
cgtacgggcc gccgcgtgca cggcgaggtc ggcgaacgcg gcccccggca gcagcaccgc 90780
cccgagcacc ttgtgatcgg ccagccacgg ctccgtcccc tgggagatcc gtccggtgag 90840
caggtaggcg tccccctccg cctgctcgat cgcggccccc agcagcggat gccccgccgg 90900
agcgagcccg agcgtccccg gttcgccgcc gagccagccg cgcgtgtcca gccagaaccg 90960
ctggtgctgg aacggatagg tgggcaggtc ggtcacggcg ggcggcgggt cggcggggta 91020
gtggcgcgtc cagtcggcgg gcgcggccgc ggtgtggagg cgggcgagcg cgtggacgag 91080
ctgcgtgtgg tccgcgtggt cgcgccggag cgtggaggtc acggtggtac gggtgtcggc 91140
gtcctcggtg atctgctcca gggcgagggt cagaatcgga tgcgggctgg cctcgatgaa 91200
gtcgcggtgc ccgtcggcga ggagcgcgcg gacggtgtcg gcgaaccgga cgggacggcg 91260
caggttggtc acccagtact cggtgtccag ctcggcggcg tcgatccgcg cggccgtcac 91320
cgtcgagtag aacgggatcc gcgcctcgga gggacggacc cccgcgagga tctcgcgcag 91380
ttcgccggtg atccggtcga tctgcgggcc gtgcgaggcg tagtccacgt cgatcatccg 91440
ggcccgctgc ccgttcgcct cgaccttggc gacgatggcg gcgaccgcgt ccggcccccc 91500
ggagaccacc acggacctgg gcccgttcac cgcggcgacc gtcacatcgc cttccagcag 91560
ttccccggcc tcctcgggcg tgacgccgag ggaggccatg gcgccgtgtc cggcgaggtt 91620
gcggagggct ttgctgcgga gggcggtgat ggtggcggcg tcggtgaggg tgagtgctcc 91680
ggcgatgcag gcggcggcga tttcgccttg gctgtgtccg atgacggctg cgggtgtgat 91740
tccgtggtcg gtccagacgg cggcgaggga gaccatggtg gcccagagga cgggttggac 91800
gacgtcgacg cggttgaggt cggcggcgcc ttcggttccg cggaggacgt cggtgaggga 91860
ccagtcgatg tgtggggcga gggcttgttc gcattcggcg atgcgtgcgg cgaagacggg 91920
tgaggtgtcg agcaggtcgg cgcccatgcc gggccactgc gaaccctgcc ccgggaacac 91980
cagcacggga cccgtctgcg ataccgctgc gggcgtgccc ggattgacca ggcacggatg 92040
cgtccggccg tcggccaagg ctctcaggcc gtccatgagc tggtcgcggt cctggccgat 92100
gacgacggcg cggtgttcga acgtcgagcg tgtggtggcc agcgaccatc cgacctcgac 92160
cggtgagaag tcctcgcgtt gttcgaggtg tgtcacgagt gcggtggcct ggtcgcgcag 92220
tgcgggcgcg ttgcgcgccg acagcgccca cggcaccacc ccgtccacga cttccgcacg 92280
tggggcctgt cccgcaggga cctcgtccgg ctgcggtggc tcctccagga tcagatgggc 92340
gttcgtccct gagatcccga acgacgacac gcccgcgcgg cggggacgat caccgcgcgg 92400
ccactcgacc gggtcggtga ggaggcggac ggcgcccgcg tcccagtcca cgtgcgggga 92460
cggctcgtcg atgtgcagcg acgcgggcag cagcccgttc cgcatcgcca tgagcatctt 92520
gatcactccg gccacgcccg ccgcgccctg cgagtgcccg atgttcgact tcaccgaccc 92580
cagccacagc ggacggtcgt ccggacggcc ctgcccgtac gtcgccagca gcgccttcgc 92640
ctcgatcggg tcgcccagcg tcgtgcccgt gccgtgcgcc tccaccgcgt ccacgtccga 92700
agccgccagt cccgcgcgga cgagcgcctg ccggatcacc cgctcctgcg acgggccgtt 92760
cggcgcggtc aggccgttgc tcgcgccgtc ctggttgacc gccgagcccc gcaccacacc 92820
caggatccgg cggccgttgc gctgggcgtc cgacagcggc tccagcagca ccagcccgac 92880
gccctcgctc cagcccgtgc cgtcggccgc cgccgcgaac gacttgcacc ggccgtcggg 92940
cgcgttcgcc cgctgccgcg agaactcgat gaacgcgtcg ggcgtcgcca gcaccgacac 93000
cccgcccgcg agcgccatcg tgcactcgcg ctgccggagc gactggcagg cgaggtggat 93060
cgcggtcagc gacgacgagc acgctgtgtc caccgacacc gccgggccct ccaggcccca 93120
ggtgtaggcg atccggcccg acgcgacgct ccccacgttg ccggtgcccg cgtagccgcc 93180
gacgtcgctg gtcgtccggc cgatgaggga gaggtagtcg tgcgagctga ccccggcgaa 93240
caccccggtc tcgctgccgt gcagggcgtc ccggtcgagt ccggcccgtt cgaacgtctc 93300
ccacgccgtc tccagcagca gccgctgctg cgggtccatg gcgagcgcct cgcgcgggct 93360
gatgtcgaag aacgccgcgt cgaacccggg gacgtcgtcg cagaagccgc cctccgtcac 93420
gtacgtcctg ccgggcttgt cggggtcggg atcgtagatc cgctcgacgt cccactcccg 93480
gtcggcgggg aacggggtga tcgcgtcgcg gccctccgcg acgagccgcc acaggtcctc 93540
gggcgaccgg acgccgccgg ggtagcggca ggcgaacccc acgaccgcga tcggctccgg 93600
ctcggcggac tcggcctccc gcagccgccg gcgggcctgc cgcagctcgg cggtgaccca 93660
cttgaggtgg tcgagcagct tttcctcgtt cgacatctag ggcgtactcc ttgccgcgac 93720
gcgagggatc ggggtcatcg ggctcacgac ttcacgactt cccgaactcg tcggcgatca 93780
ggtcgaagat gtcctccgcc gtcgccgtct ccagttcgtg gtcgggcgcg gaacgcccgt 93840
cctcgtctcc ggtggcgttc cacttggcca gcagcgactc cagccgctgg gtgacgcgcc 93900
gccgcgccgc cccgtccacc gcggccgggt cgcaggacga gtcccactgg tccaggccgg 93960
acaggacgcg gccctcgtcc accacgtcca cgtcgaggag ctggacgcgg agcaggtcgg 94020
cgagcgcccc gcacgacgga tggtcgaaga tcagggtggg cggcagcggg agcccggtcg 94080
tcgccgcgag ccggttgcgg agctcgacgg cgctcagcga gtcgaagccg agctcctgga 94140
acggctgcgc cgccgggatc gcgtccgggt ccggatggcc gaggacggcg gcgacgcggg 94200
cctggacgtg ccgcagcagg atgtgccgct gctcctcggg cgtaccgccc gcgagctgct 94260
cgcggagcgg atgcccgccg ccctcggcgg cctcctcgac gtccgccgcg gcgtcgggcc 94320
gggcgggggc caggtccgcg atgagcgggc tggggcgctg cgtggtgaac gtggtggtga 94380
acgcctccca ttcgatgtcc gcgacggtga gcgtcgtctc gccgcgcgtc agcgcccggt 94440
ggagcgcctt gagggccagg tcgggatcga gcggcgtcag cccgcggcgg ccgatgtagg 94500
tgatgacgtc gtccacggcc gccatccccg cctgcgccca gggcccccag gcgatgctgg 94560
tcgcggggag cccgagggtg tgccggtgtt cggcgagggc gtcgaggtag gcgttggcgg 94620
cggcatagga ggcttgctgc ccgctgcccc aggccgcggc cccggacgag aacagcacga 94680
acgccgtcag gtccagatcc cgggtcagct cgtgcagatg atcggccgcc agcgccttcg 94740
accgcagcac ctcaccgata tggggaacgt cgagatcggc gaacgcggtg agctccggaa 94800
tccccgccac gtggaacacc gcgttcagcg gctggtcgcc gggaacgccg tcgatgacgt 94860
tcttcagtgc ggagcggtcg gagacgtcgc aggcggtgag ggtgacggcc gttccgagcg 94920
tttggagctc ttcggcgagt tcgcgtgcgc cgggtgcgtc ggggccgcgt cggctggtga 94980
gcaggaggtg gggtgcgccg ttgtgggcga gccagcgggc caggtgtgcg cccaggccgc 95040
ctgttcctcc ggtgatcagc gtcgtcccgg acggccgcca cggcgcgtcg ccgttggtgc 95100
ggtcgggcgc gcggcgcata cggcgcgcga ggacggcggt cgcgcggatc gcggcctggt 95160
cctcgggctg cccgggagcg agcagtgcgg caaggcgcgc gggagtgtgc tggtcgggtg 95220
tggtcggcag gtcgatgaga ccgccccagc gggccgggta ttcgagcgcc gcgacgcgtc 95280
cgagccccca cgtctcggct tggagcgggt tgaggagcgg atcgctgggc gacaccgaca 95340
ccgctccctg cgtcaggcac cacaggcgtg ccgggacgcc gctgtcatcg agcgcctgga 95400
cgagtgccgt cgtcgccgcg agcccggccg ggaccgccgg atgcgacggg tgcggcgtct 95460
cgtccaaggc aagcaggctg aggacgcctt ccagcggagc gtccgcgtcc gcatcggcga 95520
gctgccgcgc gaactcctcc cgcccggccg tggtcgcctc cacgcgcaga acgttcggaa 95580
cggcgccgtg cgcgcgcagc gcctgaacgg cgaggtccgc agcgggcccc tcagcgtcgt 95640
tcgcgggaac gagcacgagc cacgtgcccg acagcaccgg cgcgctctcc tctgggagat 95700
gcgtccaggc gacctggtag cggtacgagt ccagtacgcc ctgctcgcgg tgcctgcgcc 95760
gccaccccga caggatcggc agaacctcgc ccagcgaggt ctcctgctcg gccggtgaac 95820
ccagcgtccg cgccagcgac tccacgtcct cgcgttccac cgcgtcccag aactcggagt 95880
cgagcgcgtt ctccgtcgcc gccgccttct ccggctcgtt caccagccag tacgaccggc 95940
gctggaacgc gtagttgggc aggtcgggag cggagcccgc gcgccggctc gggaagtagg 96000
gggaccagtc gatcggcgcg acgtgcgtgt gcaactgggc gagcgcgtcc gtcagcgcgt 96060
ggacgtccga gtggcggtgg ttgagcgtcg ccgcgacgac gggctcgggc cggccgtcgc 96120
ccccgacgct cgcgagagtg tgccgcgccg cggcggccag cacggggtcc gggccgagtt 96180
cgaggtaggc ggccggcggc gtcgcgctcc gggccagata ctcgatcgcc ggctggaacc 96240
gtaccggctg ccggacctgc tggatccagt agtcgggccc cagatcctcc accggcccgc 96300
ccgtgaggtt ggtgatgagc ggaatggtcg ccggacggtg cgtcagcccc cggacgacct 96360
cggcgaacgg ttcgaggacc gggtccatgt gcggcgagtg gaacgcgtgg ctgaccttga 96420
gccgccgcgt cttgcgcccc ttcccggccc agacgtccgc gacctgcgcg acgagatccg 96480
gatcaccgga gaccaccgtg ctgtccgggg tgttgagcgc ggcgatggcg atgcgccgcc 96540
cgtcgggatc gacggtttgg agatggtcgg tgagttcgtc gtaggtggcc tggatggcgg 96600
tcatggtgcc gtcgatgggg atttggtcca tgagggtggc gcgtgcggtg agcaggcggc 96660
aggcgtcggg caggtcgagg atcccggcga tgtgggcggc ggcgatctct ccgatggagt 96720
gcccgatgag ggtgtgcggg gtgtggccgt gctggttgag gagccgggcc agggagacct 96780
gcaacgcgaa cagcgcggtc tgggtgtaga gggtgtggtc cagcaggtct ggttggtcgg 96840
ggtcggggtt gaagaccacg tcgcgtagcg ggtgggggag gtgggggtcg agcagatcgc 96900
agatctggtc gaaggtggtg gcgaacaccg ggaaccgctc atacagcccc gcgcccatcc 96960
cgggccgttg gctgccctgc ccactgaaca tccacaccac ctcgccaccc gtggcaccgg 97020
cactgggtga agcggtcagg cacgggtggt cgtctccagc ggccagcgcg tccagccctt 97080
ggaggaactc ctcagagtgg tggccggtga tgacggcgcg gtgctcgaac accgaacgcc 97140
tcgtagccag cgaccacccc acctgaaccg gtgagacgtc ccggccggac aggtgcgccg 97200
acaacgccgc cgcctgatcc cgcaaccccg ccaccgaccg cgccgacacc acccacggca 97260
ccaccccacc cgacccgacc cgagcctcgg actcggactc agcctcgggg acgacctcgg 97320
ggtcgggttc gggggcctgc tccacgatca gatgcgcatt cgtccccgaa ataccgaacg 97380
acgacacacc cgcacgacgc ggacgcccac cccacaccca ctccaccggc tcactcagca 97440
gccgcaccgc acccgcctcc caatccacat gcggagacgg ctcgtcaatg tgcagggatt 97500
cggggagaag gccatttcgc atggccatca ccattttgat gacgccggcg acgcctgcgg 97560
ccatctgggt gtggccgatg ttggatttga tcgagccgag ccacagggga cggtcttcgg 97620
ggcggttgcg tccgtaggtg gcgaggaggg cttgggcttc gatggggtcg ccgagggtcg 97680
ttccggtgcc gtgcgcttcc accgcgtcca cgtcggcggg ctccaggcgg gcgttggcga 97740
gggcttggcg gatgacgcgt tgctgggatg ggccgttggg ggcggtgagg ccgttgctgg 97800
tgccgtcctg gttgacggcc gagccgcgga tcgtggcgag gatgtggtgg ccgttgcgtt 97860
cggcgtccga cagccgttcc agcaggagga gaccggcgcc ttcgccccag ccggttccgt 97920
cggcagccgc cgcgaacggc ttgcagcggc cgttgggcgc gaggccgcgc tgtcgggaga 97980
agccgatgaa cgtggcgggc gtcgccatga cggtgacgcc gccggcgagc gcgaggtcgc 98040
attccccctg cgcgagggcc tggcttgcca ggtggatggc caccagcgac gacgagcacg 98100
cggtgtcgac ggtcaccgcg gggccttcca gtccgaatgt gtaggacacc cggcccgaca 98160
tgacgctcgc cgtgttcccg acgaggatgt agccctcgat ttccttcgcg ggtgttcctg 98220
tggcgctgta gccctggaag gttccgccgg tgaagacgcc gacgttgctt ccgctgaggg 98280
cttcttgggt gagtcctgcg ttctcgaacg tctcccatgc ggtttccagg aggagacgtt 98340
gttgcgggtc catggccaac gcttcacgcg ggctgatgcc gaagaatccg gcatcgaatt 98400
ccggggcgtc gtagaggaac gcgccctcgc gcacgtaact ggtcccgggg tgctcggggt 98460
cggggtggta gaggttgtcc aggtcccagc tccggttggt gggcatttcc gcgatggcgt 98520
cgccgccgga cgccaccagg tcccagagct gctgcgggtt ccgcacgccg ccggggaagc 98580
ggcacgccat cgccacgatc gcgatcggct cgtccgtcgc ggcggacgtg atcgacgcgg 98640
gggaccgtac cgcgtcggcc tgctgcccgg tcagctccgc gcgcaggtac ttggcgatcg 98700
cgttgggcgt ggggtggtcg aagacgagtg tccgggggag cgagaggccg gtggccgagc 98760
cgaggcgctt gctgagctcg acggccgtga gggagtcgaa gcccagctcc tggagcggct 98820
gcccggccga gaccgcgtcg gccccggagt gcccgaggat cgtcgcggcg tgctcctgga 98880
cgtgccgcac cagcagttcg agctgctgct tcggtgtgct cccggccagt tccccgcgca 98940
gcgggtggtc gtccgtccgc ccctcttcgg ctccggcacc gcctccggtc ccgtcgcccg 99000
tgtcggaccg cgagggcgtg aggtccgcga cgagtcggct gggccgctgc gtcgtgaagg 99060
tggcggtgaa cgtctcccag tggatgtcgg cgacggtcag ggtggtgtcg ccgtgggaga 99120
cggcctggtg gagcgacttg acggccaggt ccgggtccat gatcgccagg ccgcggcggg 99180
cgaagaacgc gatgacgttc tcgtccgccg ccatcccggc ctcgccccac ggtccccagg 99240
cgatgctggt cgcggggagt ccgagggtgc ggcggtgttc ggcgagggcg tcgaggtagg 99300
tgttggcggc ggcgtaggag gcttgctgtc cgctgcccca ggcggcggcc ccggaggaga 99360
acagcacgaa cgccgtcagg tcgaggtgcc gggtcagctc gtgcaggtgt gcggcggcct 99420
gtgccttggg aagcaggacg tcgctgatgt gggggatgtc gagttcggcg aagggatgga 99480
gctccggaat cccggccgcg tggaagaccg cgttcagcgg ctggtcgtcg ggcacgccgt 99540
cgatgacgcc cttcagcgcg gaacggtcgg agacgtcgca ggcggtgagg gtgacggccg 99600
ttccgagcgc gcgtagttcg tcggcgagtt cgcgcgcgcc gggtgcgtcg gggccgcgtc 99660
ggctggtgag caggaggtgc ggtgcgccgt tgtgggcgag ccagcgggcc aggtgcgcgc 99720
ccagaccgcc cgttcctccg gtgatcaacg tcgtcccgga cggcgtccag ggagcccggc 99780
cgttgccgga gggcgcgggg gcgcgccgca tccggcgggc gagcgtcgcc gtggcgcgga 99840
ccgcgacctg gtcttcgggc tggcccggtg tgagcagcgc ggcgaggcgc gcgggagtgt 99900
gctggtcggg tgtggtcggc aggtcgatga gaccgcccca gcgagcggga tactccagcg 99960
ccgcgacgcg tccgagcccc cacgtctcgg cctgaacggg gttgaggagc ggatcgctgg 100020
gcgacaccgc cacggcccct tgcgtgacgc accacaggcg gaccgcgatg ccgctggcgt 100080
cgagcgcttg gacgagtgcc gtcgtcgccg cgagcccggc cgggaccgcc ggatgcgacg 100140
gatgcggcgt ctcgtccaag gcaagcaggc tgaggatgcc ggacgggggc gtctccgcct 100200
ccgtgaggtg ctggacgagc tggtcgcgtt cgacggcggc gcagtcgacg ggcagcacgg 100260
acgccgtggc gccgtggctc cggatcgcct gcaccgcggt gtcgatggcg gaacgttcgg 100320
ccgcgcccgc gccgccgtcc gcgggaacga gcaccagcca cgtgcccgac agcaccgggg 100380
cggcgatctc cgggagatgc gtccagccga tctggtagcg ccaggagtcc agcgtgctct 100440
gctcgcggtg ccgccgccgc cactccgaca ggatcggcag ggcgggacgc agatcgtcga 100500
gggccgggcc gtcgccgtcc aggtgcaggg cggcgccgag cgcgtcgacg tcgagttcct 100560
cgatcgcgtt ccacagctcg gtctcggcct ggtcgtggcc gggtccgcca ccgccggacg 100620
ggacggccgg agccagccag tagggccgcc gctggaacgc gtacgtgggc aggtccaccg 100680
cgcgccgccg cgccgtccgc ccgaacagcg ccgtccagtc gacctggacg cccgccgtga 100740
acgcctccgc caccgaccgc gccacctgcg cccggtcgcc gcggtcgcgg tgcagcgtcg 100800
gcaccgcgta cgcgtcggcg cccgcctcct cgaagatctc ctccaggccg accgtcagca 100860
ccgggtgggc gctcgcctcg atgaacaccc ggtgcccgtc gttcagcagg gcctccacgg 100920
tgtcggcgaa ccgcacttgc tcgcgcaggt tcgtcaccca gtacgcggcg tccaggccgg 100980
tcgtgtcgat ccgcccggcg tgcacggtcg agtagaacgg gatctccgcg tcgaacggcc 101040
ggataccggc gaggacctcg tgcagctcgc cggtgatccg gtcgatctgc gggccgtgcg 101100
aggcgtagtc cacgtcgatc atccgcgcgc gcaggccggc ggccttggcc gccgccacga 101160
cgtccgcgac ggcctcgggc ggcccggaga ccaccgtgga ggacggtccg ttgaacgccg 101220
ccacgaccac gtcgccgccc cgttcctcca gcagccgttc cgccccgtcg cgtccggtgc 101280
cgagggaggc catggcgccg tgtccggcga ggttgcggag ggctttgctg cggagggcgg 101340
tgatggtggc ggcgtcggtg agggtgagtg ctccggcgat gcaggcggcg gcgatttcgc 101400
cttggctgtg tccgatgacg gcggcgggtg tgacgccgtg gtcggcccag acggcggcca 101460
acgaaaccat ggtggcccag aggacgggtt ggacgacgtc cacccggttc aggtcggccg 101520
cgccctcggt tccgcgcagt acgtcggtga gggaccagtc gatgtgcggg gcgagggctt 101580
gttcgcattc ggcgatgcgt gcggcgaaca cgggtgaggt gtcgagcagc ttcgcgccca 101640
tcccgggcca ctgcgaaccc tgccccggga acactagcac cggacccggt tcggaggccg 101700
ccgcgggcgt gccgggggcg accagcccgg ggtgctgccc tgcggtggcg agggcgtcca 101760
gcccggccag cagctcgtcg cggtcccggc cgaggaccac ggcgcggtgc tcgaacgtcg 101820
aacgcctcgt cgccagcgcc cagccgacgt cgccggggga gtcgtcgccg cccgcgaggt 101880
gggcggacag cgcggcggcc tggtcgcgca acgcggtggc gctgcgggcg gagaccaccc 101940
acggcaccgg gccgccggac gcgtcgggct cggcgtccgc cgcgtccggg gcctcggagg 102000
cggtgtcggg cacggacggc tcgggcgcct cctccaggat cacgtgcgcg ttcgtcccgg 102060
agatcccgaa cgccgacacc ccggcccggc gcgggcgttc gccctcgggc cactcgacgg 102120
gttcggcgag gaggcggacg gcgcccgcgt cccagtccac gtgcggggac ggctcgtcga 102180
tgtgcaggga ttcggggagc aggtcgttcc gcatcgccat caccatcttg atgacgccgg 102240
cgacgcccgc ggccatctgg gtgtgcccga tgttggactt gaccgacccg agccacagcg 102300
gccggtccgc tggccggtcc tgcccgtacg tggcgaggag ggcttgggct tcgatggggt 102360
cgccgagggt cgtcccggtg ccgtgcgctt ccaccgcgtc gatgtcggcg ggctctaggc 102420
gggcgttggc gagtgcctgg cggatgacgc gttgctggga cgggccgttg ggggcggtga 102480
ggccgttgct ggtgccgtcc tggttgaccg ccgagccgcg gatcgtcgcg agcacgtggt 102540
ggccgttgcg ttcggcgtcc gacagccgtt ccagcaggag caggccggcg ccctcgcccc 102600
agttggtgcc gtcggccgcc gcggcgaacg gcttggcgcg gccgttgggg gcgaggccgc 102660
gctggcggga gaagccgacg aacgtcgcgg gcgtcgccat gatggccacc ccgcccgcga 102720
gcgcgaggct gcactcgccc tgcgcgaggg cccggctcgc caggtggatc gcgacgagcg 102780
acgacgagca cgcggtgtcg acggtgacgg cggggccttc caacccgaac gcgtacgaga 102840
cgcgcccgga gatgacgctc gcggtcgttc ccgccagggc gtagccctcg ctctcttggg 102900
ccgggttgcc tgtggcgctg tagccctgga aggttccgcc ggtgaagacg ccgacgttgc 102960
ttccgctgag ggcttcttgg gtgagtccgg cgttctcgaa cgtctcccat gcggtttcca 103020
ggagcagacg ttgctgcggg tccatggcca acgcttcacg cgggctgatg ccgaagaatc 103080
ccgcgtcgaa ttccggggcg tcgtagagga acgcgccctc gcgcacatag ctggtgccgg 103140
ggtgctcggg atcggggtgg tagaggttgt tcagatccca gtaacggttg gtgggcatac 103200
cggcgatcgc gtcgccgccg gacgccacca ggtcccaaag ctgctgcggg ttccgcacgc 103260
cgcccggata gcggcacgcc atcgccacga tcgcgatcgg ctcgtcgctc gccgccgccg 103320
tggcggtacg ggcggagccg cgggacgcgg cggcgggctg gtcgtccgcc agctcgttgt 103380
gcaggtactt ggcgagcgcg ttgggcgtcg gctggtcgaa gaccagtgtc cggggcagcg 103440
agagccccgt gctcgtgctg agctgcttgc tcagctcgac ggccgtcagc gagtcgaacc 103500
ccagctcctg gagcggctgc cccgccggga cggcgtcgat gctcgtgtgg ccgaggatcg 103560
acgccgcgtg ggtccggacg tgccgcagca gcaggtcgag ctgctgcttc ggcgcgctcc 103620
ccgccagctc ctgccgcagc gggctggcgc cgggcgaggc gtcccgccgg tcggcgtcgt 103680
ccgggtcggt ctcggcgagg tcgctcagga acgggctcgg gcgctgcgtc gtcaacgccg 103740
ccgggaactt ccgccagtgg atgtcggcga ccgtgaggga cgtctcgccc tgcgtgatcg 103800
cctcgtggag cgtcttcacg gccaggcccg ggtccatggg cgtcaggccg cgccgggcga 103860
agaacgcgat gacgttctcg tcggcggaca tgccggcctc gccccacggc ccccacgcca 103920
ggctggtgga gggcagcccg agacggcggc ggtgctcggc gagcgcgtcg aggtagacgt 103980
tcgcggcggc gtaggcgccc tgctgcccgc tgccccaggc cgcggcgccg gaggagaaca 104040
gcacgaacgc cgtcaggtcg aggtgccgcg tcaggtcgtg cagatgctcc gccgcctgcg 104100
acttcgaccg caggacctcg cccgcgcggg ccaggtccag gtcggtgatc ggggtgaggt 104160
ccgacatccc ggcggcgtgg aacacggcgg tgagcggatg ctcggcgggc acggcgtcga 104220
tgacgccctc cagcgcggaa cggtcggacg cgtcgcaggc ggcgagggtg acgcgggtcc 104280
cgagcgcttc gagctcctcg gctagctccg cggcgccggg ggcgtccggg ccgcggcggc 104340
tggtgagcac gaggtgcggc gcgccgttcc gcgcgagcca gcgggccagg tgcgtgccga 104400
gcccgcccgt gccgccggtg accagcgtcg tccccgtcgg ccgccagggc gcgtcgccgc 104460
cgtcgcgggc gggggcgcgg tccatgcggc gcgcgaacga cgcggtcgtc cggacggcca 104520
cctggtcctc gggcccgccg gacgccagca gcccggcgag ccgggcggcg gtgttccggt 104580
cgggcgcggc gggcaggtcg atcaggccgc cccaccggga cgggtactcc agggcggcga 104640
cgcgtccgag cccccacacc tgcgcctgcc gggggctggg cagcggatcg gtcggcgaca 104700
ccgcgaccgc gccctgcgtg aggcaccaca gcggcgcgcc cgcgccgccg tccccgtgcg 104760
cctggatcag cgcgatcgtg gcggccagcc cggccggaac cgcgggttcg tccggatgcg 104820
gcgacgcgtc cagcgccagc aggctgagga cgcccgccgg cggcgtctcg gcgctcgcct 104880
cggcgaggtc ccgcacgaac cgttcgcgcg tggcgcccgc ggtgtcgacg ggcagcagcc 104940
gcacgtcgcc gccgtggccg cggatcgcct gcacggcggt gtggacggcg gggctcccgc 105000
cgtcctcgtc cgcgggcgtg atgacgagcc acgtgccgga cagctccggg gcgcgctcct 105060
cccgcaggtg cttccacgtg atcttgtagc gccaggagtc caggacggtc cgctcgcggt 105120
ggcggcggcg ccactgcgcc aggatcggca gcgccggacg caggtcgtcc acggccgggc 105180
tgccgtcctc cagccgcagg gtggcgctca gcgcgtcgac gtccagctcc tcgatcgcgt 105240
gccacagctc gctctcggcc gggtcgtgcc cgtccccgcc cacgccggaa cgcgcggcgg 105300
gcgccagcca gaaccgctcg cgctggaacg ggtaggtggg caggtcggtg gtcgcgggcg 105360
gcggggacgc ggggaagcaa cgggtccagt cgacgggcgc ggcgagggtg tgggcgtgcg 105420
cgagcgcgcg ggtgagctgg gcgtggtccc cgtggtcgcg ccgcagcgtg ggcacgatcc 105480
cggccgcggc gccggcgtcc tcggcgatct gctccagcgc gagggtgagg atcgggtgcg 105540
ggctggcctc gacgaacgtg ccgtgcccgt cgtccagcag cgcgtggacg gcgtcggcga 105600
accggacggg ccggcacagg ttggcgaccc agtagtccgc gtccagcccg gacgtgtcga 105660
tgcgtgccgc ggtcaccgtc gagtagaacg ccacggacga ggcggcgggc tcgatcccgg 105720
ccagcgcctc gcgcagctcg tccgcgacct ctccgacgtg cgggccgtgc gaggcgtagt 105780
ccacgtcgat cagccgggcg cgcaggccgc gcgccttcac ccggtcgacc acgccggcga 105840
cctcgccggg cggacccgag accacgaccg aggtcgggcc gttcaccgcc gcgatcgtca 105900
cgccgccgtc gaggaggccc tccgcctcct ccgggccggt ttcgagcgag gccatggcgc 105960
ccgtgcccgc gaggcggcgc agggcccggc tgcgcagcgc cacgatcttc gcgccgtcct 106020
ccaggccgag cgcgcccgcg acgcacgcgg cgccgatctc gccctggctg tgcccgacga 106080
ccgccgcggg cgtgacgccg tgctcggccc acacggcggc cagcgacacc atgaccgccc 106140
acagcacggg ctggacgacg tccacccgtt cgaggtcggc ggcgccgtcg gccccgcgga 106200
gcacgtcggt gagcgaccag tcgacgtacg gcgcgagcgc ccgctcgcac tcggccatcc 106260
gcgcggcgaa cacgggggag gagtcgagca gctccgcgcc catgccgggc cactgcgaac 106320
cctggccggg gaacaccatc accggcccgt tctcggaatg cgccgccgcc actccgggat 106380
tgaccaggtt ggggtgctgg gcgcctgagg ccagcgcctt caggccgtcc atgagctggt 106440
cgcggtcttg gccgatgacg acggcgcggt gttcgaacgt cgagcgtgtg gtggccagtg 106500
accagccgac ttcgaccggt gagaagtcgt cgcgttcttc gaggtgcgtt acgagtgcgg 106560
cggcctggtc gcgcagtgcg ggcgcgttgc gcgccgacag cgcccacggc accaccccgc 106620
ccgcgacctc tgcgtccggg gccggggccg gtgccgccgg agccgtgtcg tccggctccg 106680
gtgcctcctc caggatcacg tgggcgttcg tcccggagat cccgaacgac gacacgcccg 106740
cgcggcgggg acggtcgccg ttccgccact cgaccgggtc ggtgaggagg cggacggcgc 106800
ccgcgtccca gtccacgtgc ggggacggct cgtcgatgtg cagcgacgcg gggaggcggc 106860
cgttccgcag cgccatcacc atcttgatga cgcccgcgac gcccgccgcc atctgggtgt 106920
gcccgatgtt ggacttgatc gacccgagcc acagcggccg gtcgtccggg cggccttgcc 106980
cgtacgtggc gaggagcgcc tgcgcctcga tcgggtcgcc gagcgccgtg ccggtgccgt 107040
gcgcctccag cgcgtccacg tcggcggggg agaggcgggc gtcggcgagc gcgtccgcga 107100
tcacgcgctg ctgggacggg ccgttcggcg cggtcaggcc gttgctcgcg ccgtcctggt 107160
tcatcgccga gccgcggatc acggccagca cccggtggcc cgcgcggcgc gcgtccgaca 107220
ggcgctccag cagcaccatc ccggcgccct cggccatggt catgccgtcc gcgcccgccg 107280
cgaacgactt ggaccggccg tccggggcga gcgcgcggag ctggctgaag ccgacgagcg 107340
gcgcggacga cgacatcacg aacaccccgc ccgccagcgc catgtcgcac tcgcgctgcc 107400
gcagcgcgcg cgccgcgagg tgcagggtga ccagcgacga ggagcacgcg gtgtccaccg 107460
tcaccgccgc gccctccagg ccgagcgcgt acgcgacgcg gccggagacg acgctcgcgg 107520
tgttgccgat cgtgaagtag cccgccgacc cctcgggcac ctgcgacggg tcggagtagt 107580
agtcgaggcg gtcgcagccg atgaacgtgc tggtcgggct cgaccgcagc gactcggggt 107640
cgatgcccgc gtgctcgacc gcctcccacg acgtctccag cgccagccgc tgctgcggcg 107700
ccatcgcgag cgcctcgcgg gggccgatgc cgaagaacga ggcgtcgaag ccggcgacgt 107760
cgtcgatgaa gccgccctcg cgcacgtagc tcgtgccggg gtgctcggga tcggggtcga 107820
agaggccgtc caggtcccag ttgcggtcgg cggggaagcc gccgacggcg tcccgcccct 107880
cggcgaccag gtcccacagc tccccgggcg accgcacgcc gccggggaag cggcaggcca 107940
tcccgacgat cgcgatcggc tcgtgcccga ccgactcgac gtccttcagc cgccgctgcg 108000
cctggcgcag gtccgtcgtc acgcgcttga ggtagtcgag gagcttctct tcgttactcg 108060
ccattgccgg cgcacccatt ttcctccgcg gcacccggcg gcccgcccgg tgcctccgct 108120
gacaggacag ctgctcgacg cgagcccgga aggcgcggac ccgctatttc cggcagtcgt 108180
agaagtgctg aacggagatc ccgccgggca cgctcgtcgt ctgcgtgttg gtgtagtggt 108240
tcctgctgag gatgaccatg tcgtccttga cgttcacctc gacgccctcg gcgttgcccg 108300
tcgggctggg cttcttcacc agtttcaccg ttccggtgaa gatcagatgg ccgttcttcc 108360
agccctccga ataggagttg ccctgcactc cccagtcgtc gtggtattcg ctcatgaacc 108420
gccggtcgac cgggttccag ccgaagaccc tgcggcccac gacgaggtcg ggcgtgacca 108480
tcgccgagtc gaggtagtgc ccgccgatgt cgcgcttggt ggtgatgtag ttctcgatct 108540
gctcgccgcc gccgggcggg gtgtccatgc acttgtagga gcccagcatg aagtcgaggg 108600
cgcgcatctg cggcggagcg gggagtttcg tgccgtggcc cggcggcctg cccgcgccgt 108660
ccgcgcgccc gccgtccgcc cggccgccgc cggtgtcggc gaccgccgcg accgcgccgg 108720
gcaccagcac gaccagcgcc gctcccgtgg cggcgagcgc ccaccgcgcc cgcggccccc 108780
gtgcggcccg gccctgctgc gcggcgttcc ggcgtgattc ccgcatgttc atcctcgctc 108840
cattgatggg acgttcgtcg tcgggccgcc tcatcatcgg aaccgccgct aaagctggac 108900
taacgcccgc ccccgcgata ggggccccta aacgcggcgc gccgcccgtg ccggacttag 108960
gggtacgggg cggcggcggg gccgcactat tctggcgccc gaacggcaga ccggatgccg 109020
cctgcccggt ttcaggaaac cccgagacgt gaattccccg accgaaggag acggtggccg 109080
tgcgggtcct gttcgtgacg ctcccgtgga agacgcacct tttcaatttc gtcccgctgg 109140
cctgggcgtt gcagaccgcg ggccacgagg tgcacgtggc cgccgagccc gacctgaccg 109200
gcgccgtcac cggcgcgggc ctcaccgcgg tccccgtcgg ctcgggtgag accatgggcg 109260
aacgggtccg gcgcgcctgg cgggagggca ccctgccgac gcccgaggag gcgcccccgc 109320
cgggccaccc cgtcgagctg tacgacctcg gcccggaccg gcagcggctg agctggcagg 109380
agctcaaccg catccacgac acgctcgtgg tgccgagggc gtggctgtcc aacgacacga 109440
tgttcgacga cctggtggcg tactgccggt cctggcgccc cgacctcgtc atctggaacg 109500
cggtgacctt cgccggctcc gtcgccgccg ccgcggtggg cgcggcgcac gcgcggttcc 109560
tgttctccgt cgacctctac agccggctgc gcgacgacca gctcaccgtg atggcgcgcc 109620
agccgccgca ggaccgccgg gacgggctga aggactggtt cgcgccgtgg gccgccaagt 109680
acggggtcga gttctccgag gagctggtca acgggcactt ctcgatcgac cagatgcccg 109740
cctcgttccg cctggacttc ccgcaccgga ccgtgtcgat gcggcacgtc ccctacaacg 109800
ggcccgccgt catccccgcg tggctggccg agccgccgcg cgcgccgcgc gtcctcatga 109860
cgttcggcgt ctcggtgcgg tcgtggcccg agctccaggt cgtccccgtc gaacggctcc 109920
gggagtccct cgactcgctc gccgacctgg acgtcgaggt cgtcgtgacc ctgcccgacg 109980
acgtgcgcgc cagcctgggc cgcgtcccgg cgaacgtccg cgtggtcgac ttcgtgccgc 110040
tgcacgccgt cctgcccacc tgctcggcgg tggtccacca cgggggagcg ggctcgttca 110100
acggctcgct gctgagcggc gtcccgcagc tgctggtcag cacggccctg gacgcgccgt 110160
tcaaggacca ccacctccgc gcggccgggg ccgggctggc gatcaccccc gaccggttca 110220
gcggcccgct gatcagggag aacgtcgtcc ggatgctgga ggaccccgcg ttccgcgcgg 110280
gcgccgagac cctgcgccgg gagatcatgt cctaccccgc gccgaacgcc ctcgtgcccg 110340
agctcgaaag gctcaccgcc gagcaccgcg cgtgacgcgc ctgagcggcg gccgggagga 110400
acgcccggcc gccgcccgcg agatccgtca ggtggcggcg gcgtccgtac ggcccgcgcc 110460
gccggccacc ccgcccgcgc cgctcggcac ccggacggcc gcgaggtcgt cggcgagagc 110520
ctggatcacg cgcgggccgt gctcgacgag gtagaagtgc cctccgggga acacccgcag 110580
gtcgaacggc ccggagctgt gctccgccca ggccagcgcc tcgtcccgcg tcacctgcgg 110640
gtcgcggtcg ccgacgagcg cgacgaccgg gcacctcagc gcggcgccgg gaacgtgccg 110700
gtaggtctcg acggcgtggt agtcgctgcg gacggcgggc aggatcatgt cccgcgcctc 110760
ggggtcgccg agcaggctgg agtcggtgcc gctgagccgt tgcagctcgg cgacgagccc 110820
cgcgtcgtcc agctcgtgca cgtgctcgga ccggtgcctg gacggcgcgc gccgccccga 110880
gacgtacagg cgcgcgagcg gctccgcgcc gtcgcgctcc agccgccgcg cgacctcgta 110940
ggcgagcacc gcgcccatgc tgtggccgaa gaacgccagc ggccggtcgg ccagcgcccg 111000
cacggccggg aggaccgcgt ccgccatctc cgggatggtg ccgatgttgg cctccgcgcg 111060
ccggtcctgg cggcccgggt actgcacggc ggcgacgtcc acggccgggg agagccgtgc 111120
cgagacgggg aagaagaagc tcgccgagcc gcccgcgtgc gggaagcaga cgaggcgcac 111180
gccgctgctc ggcgcctggt ggtagcggcg gatccacgct tcgaagtcgg tggacggctg 111240
tgtcatgacg ccctttcgct ctcgcacggg tcgttccctc ggcgggcggc cggcgtgctc 111300
gccgggcgtc ccgccggtgc gtacggagcc agctttcatg accgggaacg cgttgggtac 111360
ccgtaccgat cccctacgcc ccctcatcgg gggacgcagg ggacggggtg ccgggaggcc 111420
gccggacctc cgccggacgg agcaggttcg acggggtgaa cgcggtcgcg ggcggcctcc 111480
cggccgttct ggggggtcag gccgttccga tgtccctgcg ccgggcgccg acgaagccga 111540
cgagcgtcag cgcgagggag atggccagca gccagaccac gggcatcgcc gagaagtgcc 111600
cgccggggat gcgcggcgtg tgcgagaacg gcgacaggtc catcagcgcc tggtcgagct 111660
gcatggcctc gccgagctgc ccgagcagga agaacagcgc gagcgcgccc cacaccgcca 111720
gcgacagccg cggcaggatc ccgaacagcg cgagcgccag gcccgtcagc gtccacaccg 111780
ccgggagctg caccatggcg ccgccgagga tgcgcggcac ctggccgccg acgtccccga 111840
tgtccgcgcc gtgcacgagc ccggtcgtga cgcccgccgc cgcgagcgcg atggccgggc 111900
cgacggcggc gaacagcagg tggctcgcca cccacttcgc ccggctcacc gaggtcgcca 111960
gcacgtactc gccgtgcatg gcctgctcct ccgtgcgcag ccgcagcgcc gcctggatcc 112020
cgtacgcgga cgcgaccagc ccgaccgtcc ccatgatgga ggcgaggtag gcgtcgatca 112080
ggccctgctg gccgccgatc cgcaggaaca tgtccttcag cgtgtcgttg tcctccagca 112140
ggtccccgac gccctgggcg atgccgccga acaccccgcc gagcacgatc atcgcgacgg 112200
tccaggcgag cagcaggctc cgctggagcc gccacgcgag gacgatcggg ttgctcagcg 112260
acggcgcggc gtccgccggg ccgagccgcg ccgcgaggat ccccgcgccg atgtcccgcc 112320
gcgacgagaa cgcgaccgcg accgcgaccg ccgcggcggc gaacgcgacg gcgagcgcga 112380
ggacccacca gcgctcgtcg gcgaacgggc gcagcagcgt cacccagccg agcggcgaca 112440
gccacgacag ccacgtcagc ccgctgttgt cgccgccggt gtcggcggcc agccggacca 112500
ggaagaaccc gccgagcgcg gcgatcgaga ggccgcgcgc gctgcccgcg tactcggtga 112560
gctgcgccgc gacgcccgcg accgccgcga acgtgcagcc cgcgaccgcg aagcccagcc 112620
cgatcgcgaa cgcccctccg gccgggagct gcccgctcag gccgagcccg agcaggacgg 112680
cgaccaccag gttggcggtc atcgtcacga tcagcgcggc ggcgagctgc gcgtgccgcc 112740
cgaccacggt cgcggcgagc agctcgcggc gcccggcctg ctcgtcggtc cgggtgtgcc 112800
ggatgacggt caggatgctg accagcccga cgaagatcgg gatgaagccc gcgcgctgcg 112860
cgacgatgcc gccgaggctc acgtcgggca gcgggccgta cagcgcgagg aacgtcgggt 112920
tgctgccgct cgtcgcggcg tactcggcgc ggtcggcggc cgtcgggtac aggtccttgt 112980
aggtggacac gtagctgagc gggatcacca ccacccagaa catccacgcc ggcagcagcc 113040
acctgtccct gcgcaggatc agcctgacca gcttgccggt gcccgtgagc gggttcaccg 113100
cgccaccccc gcgttcgcgt cgcgcgcgcc gccgtccgcg gtgtagtgcc gcaggaacag 113160
ctcctccagc gtcggcgggc ggctgaccag gttgcgcacg cccgcgccgg tgagctgccg 113220
cagcgcctcg tccaggctcg ccgtctcgac ctcgcagctc acgtgcatgc cgtcgatctt 113280
gagctcgtgc acgcccggga cgtccgacag gcccgacggc gggcgcatca gctcggcgtc 113340
gatcgaggtg cgggtcaggt gccgcatctc gtcgagggtg ccggtctcca cgacccgccc 113400
ctcccggatg atgctcaccc ggtcgcacag cgcctcgacc tcggacagga tgtggctcga 113460
cagcaggacg gtgcggcccg cgctgcgctc ctcggcgatg gcctcgcgga acgtctcctc 113520
catcagcggg tcgaggccgg aggtcggctc gtccagcagc agcagctcga cgtcggaggc 113580
gagcgccgcc accagcgcga ccttctgccg gttgcccttg gagtaggttc tggtcttctt 113640
cctcgggtcg agctggaacc gctccagcag ctcggtcttg cgccgcttgt cgaccccgcc 113700
gcgcagccgg ccgagcaggt cgatcacctc gccgccggac aggcccggcc acaggttgac 113760
gtcgcccggc acgtacgcca ggcggcggtg cagcgcggtc gcgtcctgcc acgggtcgcc 113820
gtccagcagc ccgacctcgc ccgcgtccgc gtgcaggatg ccgagcagga cccggatggt 113880
cgtcgacttg cccgcgccgt tcggcccgag gaagccgtgc acctccccgg tgcgcacggt 113940
gaggtcgagg ccgtccagcg cgcgcgtcgg accgaacttc ttgaccagtc ccttcgccga 114000
gatggcgtca ctcatggcgg tgctcctctt gtgaatgggt gaagggtgcg cggttgccgg 114060
acccgctgcc gggctcgtcc agcccgtccc ggagcgaggc ggccgtctcc gggctgatga 114120
cgctgtggga gtggacgtcg agcagggcga gggcgacccg ccggtcaccc tcgtcggaga 114180
agatgtcggc gcccacggcg cgggagatgt gctggtggaa cgtcgggatg ccgagcgcca 114240
tggccaccat cagcgcggcc cggctcgccg cgtccgcgat gggcgggtcg ggccgctgcc 114300
ggtcggcctg gatcagcgac cgctcggtca tcgccaccag gtcgtcgaag atccgcgccg 114360
ccgcctcgga ctcgtcgatc agcgcccgcg ccaggtagcg ctggaacggg tgcagcgtgt 114420
gccgggcgtc catggtggag cgcggatggc tcagccggtc gccgttcagc gtctggtcgt 114480
agaagccgtg cagcgcctcc agcgcgtagg cgtcgcacgc ctggcgcagc gcctccttgg 114540
aaccgaagtg gtggcggacc agccccggtg acacgccggc ggcgcgggct atctcccgga 114600
tcgtcgcctg cgtgaacccg tactccgtga agtaccgcag cgccgcgtct ctgatccggg 114660
cgcgggcggt cagatcctcg aaggccgctc cgccacgttc tgccagcatc aggaagcccc 114720
ccgcttgtat cgtccgctat ctctctgtgc agagtacagt acacgtgtat aggaagctat 114780
agatggcgta cagccgaagc ccagacgaac gtccgcccgg cgggcgcgcg gcaggcgcgg 114840
gcggacgcgg gggacgggaa aaggggaagg cgcccggtcg ggtggaggcg gcgccctggg 114900
cgggcgcggc gtcctgaccg gggcgccttc gggcggtcaa cgaacgttcc ggctcaccgg 114960
ccggcggccg gggccgaacc gttcgcggcg ggggaccgcc ggtagagggc ctgccagaag 115020
taggtccagg gcacgggcca gccgtccgcc gcgcaggtcc acgccccgcc cggccggtac 115080
gcctcgatga agtcggggac ggccgcctcc acggcgcggc tgtcgttgat gtcgaccacc 115140
ttgcggagcg ggccggtctc catcgccatc cgcacgacct cgtcgccctg cgtgtggccg 115200
tccaggctct tgtccatgta cttgccgacg gggttcttga cgtagaacgc gtcgcagcgg 115260
cggtcgatca gcgcgaggta gtcccgcacg gtgcccgggg gcatctcggt gaacgagttg 115320
atgttgacgg ccaggtcgaa gcgccgcgcg ccgagcgcgt cctcgacctc gtcgaccgtc 115380
acgaacgtga tccggtcgaa ccgctcctcg tccagcacgg cgcgcaggta gctgcggcac 115440
aggccgagcg tgttgggcag gtccacgatg tggtatgagg cgaggtcgtg gttggacagc 115500
agcgtgtggc acgtccgccc gtagccggcg ccgatctcca ggacggcggc gcccgcgagg 115560
tcgagcccct tctcgacgaa cccgagctcc agcacggcct ggaggtagtc caggcacacg 115620
ctctcgccgt cgtaggtcac ggtgaacggg tcgcccacct cgcggttgcg gatcctgcgc 115680
agcttctccc actcggcggg accgagcccc gacgcgaggt tgtagaccag ggtcttgagg 115740
tacctgatcc cgttcgcctc cggattccag atggcgatct tgtggttggg ccgcccggat 115800
ttgaaatccg ccagctcggc ggacacatcc tcggtgaccc agtgtttgct aatgttctcc 115860
cactggaggc tgctctgata actgggcccc atatcgtcct cccggccgta ccttcggatt 115920
cagggaaagc gccaagaata tctcgcggcc gatgctaatc gagcgtccgg cgccgtctca 115980
aggccctatt ccgccgagcc gccccgctag gggcccctat ggcccgcggc cgttcccggc 116040
gggcggctct taggggtatg gggcgggagg cgcaggcatt acgttggtgc cgcagcgctc 116100
ggacggttat gcggaaagaa tcggtgcatt gcggattttt ttgtggcttc cagacctgtc 116160
gggcgcggcc cgccgcgcgg cgcgcgtccc ggccccggcg gggacgggcg gacggtgagc 116220
ggcgacgggc ttcacgtccg gctcggccgc acgggcctga ccgtgagccg gctcggcctc 116280
ggcacggtca acttcggcgg gcgcgtcgag gagcccgagg cgcaccggct gatggaccac 116340
gcgctcgcgt gcggcctcaa cctcctcgac accgccgact cctacggctg gcgggtccgc 116400
aagggccaca ccgaggaggt catcggccgc tggctggccc gcgggagccg ccgcgacgac 116460
gtcgtcctcg ccaccaaggt cgggtcggcg atggggcccg gcccgaacga ccgcggcctg 116520
tccgcccggc acatcgtcgc ggcgtgcgag gactcgctgc gcaggctcgg gaccgactgg 116580
atcgacctct accagctcca cagcatggac cgggccgtcc cgtgggacga gatctggcgc 116640
gccatggagc tgctggtggc ccaggggaag gtccggtacg tcggctcgtc caacgccgcc 116700
ggctgggacc tcgccgccgc gcaggagacc gccgcccgcc gcggcgcgcc gggcctggcc 116760
tccgagcagt gcctctacaa cctggtcgcg cggcaccccg aactggagat catcccggcg 116820
gcgcgcgcct acggcgtcgc ggtgctggtc tggtcgccgc tgcacggcgg gctgctcggc 116880
gggacgctgc gcaaggccgc cgagggcacg gcggtcaagt ccgcgcaggg ccgcgccgcc 116940
gccgcgctgg acgcccaccg cgagacgatc gccgagtacg agcggttctg cggcggcctc 117000
ggccgggacc cggccgaggt cggcctggcg tgggcgctgc gccgtccggg cgtgacctcg 117060
ctgatcatcg gcccgcggac gcccgagcac gtggacggcg cgctgcgggc gctggacaag 117120
cccctgtccg atccggagct cgaccggctc gacgcgctgt tcccgcccgt cgggcggggc 117180
gggcccgcgc cggacgcgtg gatcggctga ccgccccgac cccggaggag agagcacgca 117240
tgtccatcag ccagtacacc gatccgcgga tcgtcccgca cgagtcggac ggcgagcgcg 117300
aggcgcgcga gcgcctgacc cggctgctgt ccaagacccc gatccagccc gagtacctga 117360
tcgacaacct cgcggtctac ctgcgccggc accagctcag cgacctgctg tcgatggacg 117420
cgctctaccg cgaggtccgc gatgtgcccg gcgtgatcat ggagttcggg gtgctgcacg 117480
gacggcacct ggcgacgctc accgcgctgc gcgggatcta cgagccgtac aactcgctgc 117540
gccggatcat cgggttcgac accttcaccg ggttccccga catcagcccc gtggacgagg 117600
tcagcccgag cgcggtcgtc ggcaggttcg agacggcgcg ggactacccc gaccacctgc 117660
gcgaggtgct cgcggcgcac gagctgggcg agccgatggg ccacgtacag cgcacgatcg 117720
tcatgcaggg cgacgtgcgc gagaccgtgc cgcgctacct ggaggagaac ccgcagacgg 117780
tcatcgcgct cgcctacttc gacatggacc tgtacgcgcc gacccgcgag acgctggaga 117840
cgatccgccc gtacctgacc agggggagca tcgtggcgtt cgacgagctc gcccacccga 117900
agtggccggg cgagaccgcc gcgttccgcg aggtctgcgg ccaggacgcc accctccacc 117960
agatccccgg ccgcgagccc cccgtcatct acctgcgctg gaacgagtga cgcccgccgg 118020
cggcggggcc cgccgcgccg ccgccggccg ggccgttccg cgcgtacgac cgcacgcact 118080
gctcgtaggt ggggagcagg ccggccgcgg cggcctcgga cagggacggg gccgccgcgt 118140
ccttcgccga gcggatcggc ggcccggtca gcttccacgg cagggccagt tcggggtcca 118200
gcgcgtcgat gtcgatcatc gtgccgggga cgtactcgac cgagcacagg tagcacatgc 118260
aggtgtcgtc ggtcagcgcc aggaacgcgt gcccgatccc gtcgggcagg tagacggcgg 118320
tgcccgactc cgcgtcctgg cgggtgaccg cgtagtgccc gaacgtcggc gagcccacgc 118380
ggatgtccac ggcgatgtcg agcgccgccc cgcgcacgca cgtgacgagc ttcgcctggc 118440
cggggggcag cgcggtgccg tggatcccgc gcacggtgtt ccgcttggag accgagtagt 118500
tgacctgccg gatctccagc tcgtggcccg tgatgctccg cagggcctcc gctctgatcg 118560
cctcgtagaa gaagccgcgc gaatcggcga tgcgatccgg ctcgatccgg taggcgtcgg 118620
ggaccgtcat ctctgagatg cgcacagcaa acctcctcgc ggaacgccgc attttccgcg 118680
gccgcgctta agcgggacta aagggctcgg cggcgcgcgg ccgcacttcc aggccgttcc 118740
gggaaacgat gccaaatgcg cggctgaccg tgcgtgcctt aggggcccct gttttcctca 118800
taaacggttc gcggcctcgg gcggggccgc ggtaaagcga taccgtggtg aaccggatat 118860
gccctacaag ggaggcagtc gtgcccgatc ccgcccgtcc agcgccctta tccccgcggt 118920
ccccgcaatc cccgagagcc ccccggtccc cgcgcggacg cccggcgccg ggcgggcgct 118980
cgccgcgcgc ccgcgaggac accgccgcgc agcgcgcgat cgccgcctcg gccgccgcgc 119040
tcgaaggccg cgtgccgggc gtcgccgacg tccgcgcgtg gctgggcgag cgccgcgacg 119100
ccacccgctt ccaggtcggc cgcatcccgt tcgccgagct ggacggctgg tcgttcgacg 119160
gccccaccgg caacctcgtc cacgacagcg ggcgcttctt caccgtggag gggctgcgcg 119220
tcagcatcga cggcgaccgg ccccgggagt ggcagcagcc gatcatctgc cagttcgagg 119280
tcggcatcct cggcatcctc gccaagcggt tcgacggcgt cctgcacttc ctcatgcagg 119340
ccaagatgga gcccggcaac cccaacctgc tccagctctc gccgaccgtg caggccaccc 119400
gcagcaacta caccaaggtg caccgcggcg cgtcggtccg ctacctggag tacttcaccg 119460
gcggggagcc ggtgcgggtg ctcgccgacg tgctccagtc cgagcacggc gcctggttct 119520
gccgcaagcg caaccgcaac atgatcgtcg aggcgaccgg cgacgtcccc gagcacgagg 119580
acttccgctg gctgaccctc ggccagctcg ccgggctcct gcacctcgac aacgtcgtca 119640
acatggacgc ccggtccgtg ctcgcgtgca tgcccggcgc gggaggcccg accgccctgc 119700
actccgacgt cgagctgcgg tcctggatcg ccgccgaacg gtcccggcac atcgtcacga 119760
ccgagcgcgt cccccgcgcc gggctccggc actgggtgca ggggccctgg tcgatcgacc 119820
acgaggagcg ccggcacttc cgcgtcgtgg ccgtgtccgt gcgggcgggc aaccgcgagg 119880
tgacgcggtg gacccagccg ctgttcgagc cgtacgagac cgggatcgtc gcctacgtga 119940
cccgcacgtt cgagggcgtg ccgcacgtcc tcgcgcgcgc ccgcgtcgag ggcggcttcc 120000
tcgacaccgt cgagctcggc ccgaccgtcc aggcgatccc cggcaactac gcgcacgtgc 120060
ccgcgagcga gcggcccgcg ttcatcgacc tggtctcctc cgccgacccg tcccgcgtcc 120120
gctactcggc cgtccactcc gaggagggcg gccggttcct caacgccgag agccgctaca 120180
tggtggtcga ggccgacgag gacgaggccc cgatcgaccc gcccgccggg ttccgctggg 120240
tcaccccggc cagctcacca tgctcgccca gcacagccac tacctcaacg tgcaggcgcg 120300
cagcctgctc gcctgcctga acgccatggc cgcccgcggc gcggccggcg cggcgctcac 120360
accgtgatcc gccgcgcccg ctcctcgatc tcggacacca gccgcgcctg atgcaggctc 120420
cccgccccga acgcctccgg ctccccgccc gtacccgcct ccgcgcccgc gcccgggccc 120480
ggggcggtgc gggccgcgcc ggccgcgccg gccacgcgcg ccgcgcccac cgcgcgggcg 120540
aagaagcgcc ccaggttggc ccactggtgg tccgggggca gcgtgatctc ctcgcggtgg 120600
tcctgccgtt cgatccgtac gaccggctgg tacgccgcgg gaggcgtgaa cgcccggtcg 120660
gtcagcagcc gccccgcgct ccccgcgaac tcgcagcccg tccggtagga gtgccgcacc 120720
ccgaacgcga ggtcggccgt cacgccgccc ggcgtgcagg cgagggcgcg cccgccgacc 120780
accatgcccg tcccgccgtc cacgtgcagc acggcggcgg tcacgtcgag gtcgttgccg 120840
aggaagcgca gcgccgcgcg gatcgggtag acgccctgtt cgagcagcgc cccaccgccg 120900
agccccggta ggaactgcgt gttcccggcg ggcagcggcg ggatcgtgaa cgcactcgaa 120960
aagtcccgca actcaccgat cgcgccgtcc gcgagcatct tctgtacggc ggtgtgctgc 121020
gagtggtgca ggaacgtcac gttctccagc aggacgagcc cgcgcgtgcg cgcgaggcgc 121080
agcagccgcg cggtggtctc gtagtcggtg gtgaggggct tctccgcgag gacgtgcttc 121140
cccgcgtcca gggccttctc cgtccattcg gcgtgcagcg ccgaaggcag cgggatgtag 121200
acggcgtcga tgtcgtcgga ggcgagcagc gcgccgtacc ctcccacggc ggcgcagccg 121260
acctcggcgg cgaagctctc ggccctgccg gggtccctgc tcgccacggc ggtcacgtcc 121320
acgccgtccg cggcgaggag ggcggggacc gtgcggcggc gcgcgatgtc cgcgcagccc 121380
agcacgccca ggcgcagcgg cggttcctgt cgttccacgc cgtcaacgtg ccgcacgtcc 121440
tcgcgcccgg caagccccta ccgccgtgcg ggccgtcgcg gagggacgtc cgcgctggcc 121500
gtgggcgcgc gacgcccgcg agggctgggg cccctacccc ggcgggcgcc ctccggcggc 121560
ggcgcgaaaa tgacgtgcgt tcaccagggc gccgtcagta ccgttccaga 121610

Claims (9)

1.一种马度米星化合物的生物合成基因簇,所述马度米星化合物结构如式(I)所示,
Figure 985378DEST_PATH_IMAGE001
式(I),
其特征在于,所述生物合成基因簇包括:
Ⅰ型线性聚酮合成酶基因模块,所述Ⅰ型线性聚酮合成酶基因模块包括:madA1基因、madA2基因、madA3基因、madA4基因、madA5基因、madA6基因、madA7基因、madA8基因、madA9基因、madA10基因和madA11基因;
糖基合成基因模块,所述糖基合成基因模块包括:madG1基因、madG2基因、madG3基因、madG4基因、madG5基因、madG6基因、madG7基因和madG8基因;
修饰基因模块,所述修饰基因模块包括:madO1基因、madO2基因、madI基因、madE基因、madP1基因、madP2基因、madM1基因、madM2基因、madG9基因、madG10基因和madTE基因;以及
调节基因模块,所述调节基因模块包括:madR1基因、madR2基因、madR3基因、madR4基因、madT1基因、madT2基因、madT3基因和madU基因;
所述madA1基因编码SEQ ID NO:1所示的氨基酸序列;
所述madA2基因编码SEQ ID NO:2所示的氨基酸序列;
所述madA3基因编码SEQ ID NO:3所示的氨基酸序列;
所述madA4基因编码SEQ ID NO:4所示的氨基酸序列;
所述madA5基因编码SEQ ID NO:5所示的氨基酸序列;
所述madA6基因编码SEQ ID NO:6所示的氨基酸序列;
所述madA7基因编码SEQ ID NO:7所示的氨基酸序列;
所述madA8基因编码SEQ ID NO:8所示的氨基酸序列;
所述madA9基因编码SEQ ID NO:9所示的氨基酸序列;
所述madA10基因编码SEQ ID NO:10所示的氨基酸序列;
所述madA11基因编码SEQ ID NO:11所示的氨基酸序列;
所述madG1基因编码SEQ ID NO:12所示的氨基酸序列;
所述madG2基因编码SEQ ID NO:13所示的氨基酸序列;
所述madG3基因编码SEQ ID NO:14所示的氨基酸序列;
所述madG4基因编码SEQ ID NO:15所示的氨基酸序列;
所述madG5基因编码SEQ ID NO:16所示的氨基酸序列;
所述madG6基因编码SEQ ID NO:17所示的氨基酸序列;
所述madG7基因编码SEQ ID NO:18所示的氨基酸序列;
所述madG8基因编码SEQ ID NO:19所示的氨基酸序列;
所述madO1基因编码SEQ ID NO:20所示的氨基酸序列;
所述madO2基因编码SEQ ID NO:21所示的氨基酸序列;
所述madI基因编码SEQ ID NO:22所示的氨基酸序列;
所述madE基因编码SEQ ID NO:23所示的氨基酸序列;
所述madP1基因编码SEQ ID NO:24所示的氨基酸序列;
所述madP2基因编码SEQ ID NO:25所示的氨基酸序列;
所述madM1基因编码SEQ ID NO:26所示的氨基酸序列;
所述madM2基因编码SEQ ID NO:27所示的氨基酸序列;
所述madG9基因编码SEQ ID NO:28所示的氨基酸序列;
所述madG10基因编码SEQ ID NO:29所示的氨基酸序列;
所述madTE基因编码SEQ ID NO:30所示的氨基酸序列;
所述madR1基因编码SEQ ID NO:31所示的氨基酸序列;
所述madR2基因编码SEQ ID NO:32所示的氨基酸序列;
所述madR3基因编码SEQ ID NO:33所示的氨基酸序列;
所述madR4基因编码SEQ ID NO:34所示的氨基酸序列;
所述madT1基因编码SEQ ID NO:35所示的氨基酸序列;
所述madT2基因编码SEQ ID NO:36所示的氨基酸序列;
所述madT3基因编码SEQ ID NO:37所示的氨基酸序列;
所述madU基因编码SEQ ID NO:38所示的氨基酸序列。
2.根据权利要求1所述的生物合成基因簇,其特征在于,沿着基因的5’-3’端,所述madR3基因、所述madR2基因、所述madG1基因、所述madG2基因、所述madR1基因、所述madT3基因、所述madA1基因、所述madA2基因、所述madA3基因、所述madA4基因、所述madA5基因、所述madA6基因、所述madE基因、所述madG10基因、所述madM2基因、所述madP2基因、所述madA11基因、所述madM1基因、所述madP1基因、所述madA10基因、所述madO2基因、所述madI基因、所述madO1基因、所述madA9基因、所述madA8基因、所述madA7基因、所述madU基因、所述madG9基因、所述madTE基因、所述madT2基因、所述madT1基因、所述madR4基因、所述madG8基因、所述madG7基因、所述madG6基因、所述madG5基因、所述madG3基因和所述madG4基因依次串联。
3.根据权利要求1-2任一项所述的生物合成基因簇,其特征在于,所述生物合成基因簇的核苷酸序列如SEQ ID NO:39所示。
4.一种微生物,其特征在于,所述微生物具有权利要求1-3任一项所述的生物合成基因簇,所述微生物为放线菌。
5.根据权利要求4所述的微生物,其特征在于,所述微生物是通过如下方式获得的:
将权利要求1-3任一项所述的生物合成基因簇插入带有基因组整合位点的质粒载体中;
将携带所述生物合成基因簇的质粒载体通过三亲本方法导入受体微生物中;
任选地,所述带有基因组整合位点的质粒载体含有attP整合位点和整合酶基因int
任选地,进一步包括将被导入质粒载体的受体微生物进行扩增培养。
6.根据权利要求5所述的微生物,其特征在于,所述扩增培养是在ABB13培养基中进行的。
7.根据权利要求6所述的微生物,其特征在于,所述ABB13培养基中含有15 mg/L阿泊拉霉素。
8.一种制备马度米星化合物的方法,其特征在于,包括:
利用权利要求4~7任一项所述的微生物进行发酵处理,以便得到发酵后的微生物,所述微生物为放线菌;
将所述发酵后的微生物进行破壁处理,以便得到马度米星化合物;
任选地,所述破壁处理后进一步包括将破壁处理产物进行纯化处理;
任选地,所述发酵处理的时间为7-8天。
9.madTE基因作为修复功能基因在马度米星化合物的生物合成中的用途,所述马度米星化合物是通过微生物合成的,所述微生物过表达madTE基因,所述微生物具有权利要求1~3任一项所述的生物合成基因簇,所述微生物为放线菌,所述madTE基因编码SEQ ID NO:30所示的氨基酸序列。
CN201910838457.2A 2019-09-05 2019-09-05 马度米星化合物的生物合成基因簇及其应用 Active CN112442507B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910838457.2A CN112442507B (zh) 2019-09-05 2019-09-05 马度米星化合物的生物合成基因簇及其应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910838457.2A CN112442507B (zh) 2019-09-05 2019-09-05 马度米星化合物的生物合成基因簇及其应用

Publications (2)

Publication Number Publication Date
CN112442507A CN112442507A (zh) 2021-03-05
CN112442507B true CN112442507B (zh) 2022-09-30

Family

ID=74733186

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910838457.2A Active CN112442507B (zh) 2019-09-05 2019-09-05 马度米星化合物的生物合成基因簇及其应用

Country Status (1)

Country Link
CN (1) CN112442507B (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996001901A1 (fr) * 1994-07-08 1996-01-25 Rhone-Poulenc Rorer S.A. Streptogramines et procede de preparation de streptogramines par mutasynthese
CN106916836A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN106916834A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN106916835A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN107523554A (zh) * 2017-10-24 2017-12-29 江南大学 一株分泌马杜霉素特异性单克隆抗体的杂交瘤细胞株ss0708及其应用

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996001901A1 (fr) * 1994-07-08 1996-01-25 Rhone-Poulenc Rorer S.A. Streptogramines et procede de preparation de streptogramines par mutasynthese
CN106916836A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN106916834A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN106916835A (zh) * 2015-12-24 2017-07-04 武汉臻智生物科技有限公司 化合物的生物合成基因簇及其应用
CN107523554A (zh) * 2017-10-24 2017-12-29 江南大学 一株分泌马杜霉素特异性单克隆抗体的杂交瘤细胞株ss0708及其应用

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
GenBank登录号:MWK40690.1;NCBI;《NCBI GenBank》;20191223;第1-310位 *
GenBank登录号:WP_175251316.1;NCBI;《NCBI GenBank》;20200619;第1-301位 *
Ran Liu等.Genomics‑driven discovery of the biosynthetic gene cluster of maduramicin and its overproduction in Actinomadura sp. J1‑007.《Journal of Industrial Microbiology & Biotechnology》.2019,第47卷第275-285页. *

Also Published As

Publication number Publication date
CN112442507A (zh) 2021-03-05

Similar Documents

Publication Publication Date Title
DK2271666T3 (da) Nrps-pks-gengruppe og dens manipulation og anvendelighed
JPH09224686A (ja) プラテノリドシンターゼ遺伝子
CN107794286B (zh) 一种环脂肽类化合物生物合成基因簇及其激活方法与应用
CN101691575B (zh) 一种萨菲菌素的生物合成基因簇
CN111607603B (zh) Hangtaimycin生物合成基因簇及其应用
CN107540682B (zh) 曲张链丝菌素衍生物及其制备方法和应用
CN111378008B (zh) 脂肽类化合物Totopotensamides及其制备方法和应用
EP0929681A1 (en) Rifamycin biosynthesis gene cluster
CN110857447B (zh) 提高米尔贝霉素a3/a4或其衍生物产量的方法
CN112442507B (zh) 马度米星化合物的生物合成基因簇及其应用
CN114517175B (zh) 基因工程菌及其应用
KR101189475B1 (ko) 삼원환 화합물의 생합성을 담당하는 유전자와 단백질
CN101578268A (zh) 新的化合物及其生产方法
US20030175888A1 (en) Discrete acyltransferases associated with type I polyketide synthases and methods of use
KR102159415B1 (ko) Uk-2 생합성 유전자 및 그것을 사용한 uk-2 생산성을 향상시키기 위한 방법
US20030171562A1 (en) Genes and proteins for the biosynthesis of polyketides
CN106676115A (zh) 2’‑氯代喷司他丁和2’‑氨基‑2’‑脱氧腺苷生物合成基因簇及其应用
US20040091975A1 (en) Midecamycin biosynthetic genes
CN107164394B (zh) 一种非典型角环素类化合物nenestatin A的生物合成基因簇及其应用
CN107541523B (zh) 曲张链丝菌素生物合成基因簇及其应用
US20030113874A1 (en) Genes and proteins for the biosynthesis of rosaramicin
CN112359048B (zh) 一种吕宋肽菌素c的制备方法
CN101027395A (zh) 用于制备复合聚酮化合物的生物合成基因簇
CN1190444C (zh) 南昌霉素生物合成基因簇
KR20100089370A (ko) 슈도노카디아 오토트로피카의 신규한 폴리엔 생합성 유전자

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20220722

Address after: 430075 room d102, building B5, building b4-b8, Wuhan National Biological Industry (Jiufeng innovation) base, No. 666, Gaoxin Avenue, East Lake New Technology Development Zone, Wuhan, Hubei

Applicant after: Wuhan Hesheng Technology Co.,Ltd.

Address before: Building B5, R & D building, Wuhan Institute of biotechnology, 666 Gaoxin Avenue, Donghu Development Zone, Wuhan City, Hubei Province, 430075

Applicant before: WUHAN J1 BIOTECH Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant