CN113677697A - 使用重叠的aav载体生产大型拟抗肌萎缩蛋白 - Google Patents

使用重叠的aav载体生产大型拟抗肌萎缩蛋白 Download PDF

Info

Publication number
CN113677697A
CN113677697A CN202080025191.0A CN202080025191A CN113677697A CN 113677697 A CN113677697 A CN 113677697A CN 202080025191 A CN202080025191 A CN 202080025191A CN 113677697 A CN113677697 A CN 113677697A
Authority
CN
China
Prior art keywords
leu
glu
gln
lys
ser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080025191.0A
Other languages
English (en)
Inventor
I·理查德
W·洛斯塔尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Evry Wald Esson University
Institut National de la Sante et de la Recherche Medicale INSERM
Genethon
Universite D'Evry Val D'Essonne
Original Assignee
Evry Wald Esson University
Institut National de la Sante et de la Recherche Medicale INSERM
Genethon
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Evry Wald Esson University, Institut National de la Sante et de la Recherche Medicale INSERM, Genethon filed Critical Evry Wald Esson University
Publication of CN113677697A publication Critical patent/CN113677697A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4707Muscular dystrophy
    • C07K14/4708Duchenne dystrophy
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K35/00Medicinal preparations containing materials or reaction products thereof with undetermined constitution
    • A61K35/66Microorganisms or materials therefrom
    • A61K35/76Viruses; Subviral particles; Bacteriophages
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P21/00Drugs for disorders of the muscular or neuromuscular system
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0652Cells of skeletal and connective tissues; Mesenchyme
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14143Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14145Special targeting system for viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14151Methods of production or purification of viral material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14171Demonstrated in vivo effect

Abstract

本发明涉及拟抗肌萎缩蛋白(QD)以及能够生产该蛋白的双AAV载体系统,所述拟抗肌萎缩蛋白在其N‑端部分具有CH1CH2H1R1R2R3H2R8R9的结构,优选地进一步包括R16和R17杆状区。

Description

使用重叠的AAV载体生产大型拟抗肌萎缩蛋白
本发明涉及基因治疗载体,其用于治疗或预防肌营养不良症,尤其是杜氏肌营养不良症(DMD)或贝克型肌营养不良症(BMD)。
本申请公开了利用重叠腺相关病毒(AAV)载体能够产生大量显示出重要功能域的活性截短抗肌萎缩蛋白。
背景技术
杜氏肌营养不良症(DMD)是最常见的进行性肌肉退行性疾病,大约每3500-5000名男婴就有一名受此影响。DMD是由X染色体上编码抗肌萎缩蛋白的基因缺失或突变引起的。抗肌萎缩蛋白(Dystrophin)是组装抗肌萎缩蛋白-糖蛋白复合物所必需的,并在肌肉纤维的细胞骨架和细胞外基质之间提供机械和功能联结。抗肌萎缩蛋白的缺失会引起纤维退化、感染、坏死以及肌肉被瘢痕和脂肪组织替代,导致进行性肌无力及呼衰和心衰引起的20岁到40岁间的过早死亡(Moser,H.,Hum Genet,1984.66(1):17-40)。
这种疾病的一种较轻型称为贝克型肌营养不良症(BMD),它与DMD的区别在于起病较迟、对轮椅支撑的依赖较晚和生存期更长。BMD通常对应于保留了阅读框的突变(MuntoniF等,Lancet Neurol,2003.2(12):731-40)。
对于DMD尚无治愈或治疗有效的方式(Rodino-Klapac,L.R.等,Curr NeurolNeurosci Rep,2013.13(3):332),对BMD也是如此。常规疗法仅限于支持性治疗,可部分缓解体征和症状,但不直接针对发病机制治疗,也无法逆转表型。
目前针对DMD正在开发多种治疗策略,包括体外基因疗法、细胞移植疗法、DMD无义突变的药物治疗和修复抗肌萎缩蛋白基因阅读框的外显子跳跃或基因编辑策略。以上所有治疗策略均存在需要克服的问题,包括有效率、不同肌肉群的针对性、传递优化、转基因的长期表达和潜在的免疫应答(Jarmin等,Expert Opin Biol Ther,2014.14(2):209-30)。
DMD的不同基因转移疗法旨在弥补抗肌萎缩蛋白的功能丧失,并提供使用单一药物治疗所有患者的可能性。为了防止肌肉退化,可能需要正常水平的大约30%的抗肌萎缩蛋白。
抗肌萎缩蛋白基因是人类基因组中已知最大的基因,跨越2.5Mb,约占人类整个X染色体的2%。它由79个外显子(全长cDNA:11,1kb)组成,编码3685个氨基酸、427kD的抗肌萎缩蛋白。抗肌萎缩蛋白由四个结构区定义(图1A)。分别是NH2末端的肌动蛋白结合域(外显子1-8)、中央杆状区(24个血影样蛋白的重复序列R1-24和4个铰链区H1-4;外显子9-62)、半胱氨酸(CR)富集区(外显子63-69)和羧基端(CT)结构域(外显子70至79)。
cDNA因过大而无法装配在已知的基因治疗载体系统中,尤其是在腺相关病毒(AAV)载体中,而AAV载体是根据AAV血清型的向性将基因有效转移至各个肌群中的一种有潜力的选择。AAV载体具有在分裂的(肌纤维和心肌细胞)和非分裂的(成熟肌管)肌肉细胞中表现出长期基因转导的潜力。
事实上,AAV的主要限制是它的装载能力被认为仅限大约5kb,即亲本病毒基因组的大小(Wu Z.等,Mol Ther.,2010,18(1):80-86;Lai Y.等,Mol Ther.,2010,18(1):75-79;Wang Y.等,Hum Gene Ther Methods,2012,23(4):225-33)。而较大的载体基因组意味着截短的包装基因组、大小分布范围广泛的异质基因组群和较低的表达效率(Wu Z.等,MolTher.,2010,18(1):80-86)。
为了克服AAV的DNA包装限制(<5kb),一些研究小组试图设计合成的截短但具备功能的抗肌萎缩蛋白(MD,也即“微抗肌萎缩蛋白”或“迷你抗肌萎缩蛋白”)。一系列微抗肌萎缩蛋白已被设计用于编码截短的抗肌萎缩蛋白,这些抗肌萎缩蛋白经过优化以包含对临床更重要的蛋白质区域。通常认为这些区域位于抗肌萎缩蛋白的N端和半胱氨酸富集区.
微抗肌萎缩蛋白包含24个血影样蛋白的重复序列中前3个和最后一个,不含C端结构域(ΔR4-R23/ΔCT),名为MD1(见图1B),显示出修复抗肌萎缩蛋白以及与互生蛋白(syntrophin)和肌营养蛋白(dystrobrevin)共同定位的高功能活性,但它未能在mdx小鼠(肌营养不良实验鼠)模型的肌膜处招募nNOS(Yue等,Mol Ther,2006.14(1):79-87)。
使用AAV2/8载体对营养不良的CXMDj犬进行试验,该载体编码序列优化的犬MD1微抗肌萎缩蛋白,肌肉特异性spc512启动子(AAV8-spc512-cMD1)驱动表达,(Koo等J.GeneMed.2011.13:497)实验取得了激励性的成果。局部给药可诱导肢体肌肉组织中高水平的微抗肌萎缩蛋白表达,并显著改善组织学和功能参数。不伴有免疫抑制的全身静脉给药引导骨骼肌中显著和持续水平的微抗肌萎缩蛋白,并在2年内减少营养不良症状(Le Guiner等,Nat Commun.2017.8:16105)。未观察到载体给药的毒性或免疫副反应。
然而,在肌肉功能中,缺失区域,例如R16-R17 nNOS结合位点和/或R8-R9 Par1b结合位点和/或R10-R17结合位点与F-肌动蛋白的相关性仍然存疑。
作为替代策略,已经提出依靠重组项目和使用双AAV载体系统来生产拟抗肌萎缩蛋白。如本领域已知的,双AAV系统的两个载体可以是重叠载体、反式剪接AAV载体或混合反式剪接AAV载体(参见例如Pryadkina等,Molecular Therapy,2015,2,15009)。
基于该策略,如图1C所示,Kodippili等(Human Gene Therapy,2018,29(3),299-311)报道了在杜氏肌营养不良症(DMD)犬模型中使用一对双AAV9载体表达犬ΔH2-R15迷你抗肌萎缩蛋白(SEQ ID NO:1)。更准确地说,第一个载体包含巨细胞病毒(CMV)启动子、融合到抗肌萎缩蛋白基因N端的Flag标签和ΔH2-R15迷你抗肌萎缩蛋白基因的5’部分(包括N端结构域、铰链1和3、以及血影蛋白样重复序列R1-R3、R16-R20和R21的一部分)。第二个载体包含ΔH2-R15迷你抗肌萎缩蛋白基因的3’部分(包括铰链3的一部分、血影蛋白样重复序列R20-R24、铰链4、半胱氨酸富集区和C端结构域)、融合到C端的GFP标签和SV40多聚腺苷酸化信号。两个载体共享一个375个核苷酸的抗肌萎缩蛋白基因片段(从铰链3的后部到R21的第一部分)。
无论如何,本领域仍然需要生产高水平的活性拟抗肌萎缩蛋白。
发明概述
本发明的目的在于,通过使用双AAV载体系统表达较短但具备功能性的抗肌萎缩蛋白多肽,也称为拟抗肌萎缩蛋白,来缓解或治愈毁灭性的杜氏肌营养不良症(DMD)以及贝克型肌营养不良症(BMD)。
据申请人所知,本发明首次报道了遗传工具,即双AAV载体系统,能够产生大量的表现出R16-R17 nNOS结合位点和R8-R9 Par1b结合位点的大型活性拟抗肌萎缩蛋白。这为治疗肌营养不良性疾病提供了新的治疗手段。
在过去的几年中,确实地发现了DMD病理是由肌纤维脆性和肌肉干细胞功能障碍引起的,这些病理机制损害肌肉再生并导致肌肉萎缩。卫星细胞功能障碍对DMD相关肌肉萎缩的影响是一个相对较新的发现,可以通过保留抗肌萎缩蛋白的关键结合位点和关键分子功能来改善目前基于AAV抗肌萎缩蛋白递送的策略。
Par1b,又称Marks,是一种丝氨酸-苏氨酸激酶,可与抗肌萎缩蛋白相关联并调节卫星细胞的极性,从而确保不对称分裂(Dumont等,Nat Med,2015,21(12):1455-63)。抗肌萎缩蛋白的缺乏会导致Par1b下调和不对称分裂受损,而这正是肌源性祖细胞正确生成所需的。因此,维持拟抗肌萎缩蛋白中的Par1b结合位点(R8-R9)对于保证卫星细胞极性和不对称分裂以产生肌源性祖细胞和有效的肌肉再生至关重要。
保留nNOS(神经一氧化氮合酶)的结合位点也很重要,因为众所周知抗肌萎缩蛋白介导的胞质跨膜蛋白nNOS的组装对肌膜中NO的产生是必需的,而NO的产生是控制骨骼肌血流调节的重要因素。(Ervasti,Biochim Biophys Acta,2007,1772(2):108-17)。
定义
冠词“一”和“一个”在本文中用于指代该冠词的一个或多个(即至少一个)描述对象。例如,“一个元件”是指一个元件或一个以上的元件。
如本文所用,“大约”或“近似”在提及诸如数量、时间段等可测量值时,意为涵盖指定值的±20%或±10%,优选为±5%,更优选为±1%,并且进一步优选地为±0.1%,这样的变动程度适合于执行本发明的方法。
范围:在本说明书全篇中,本发明的各个方面均可以以范围形式呈现。应理解,范围形式的描述仅仅是出于方便和简洁考虑,不应理解为对本发明范围界限的硬性限制。因此,应该认为对范围的描述已经具体公开了所有可能的子范围以及该范围内的各个数值。例如,1到6的范围描述应被理解为具体地公开了以下子范围:1到3、1到4、1到5、2到4、2到6、3到6等,以及公开了包含在该范围中的个体数值,例如1、2、2.7、3、4、5、5.3和6。对任意广度的范围均适用。
“分离的”是指改变或脱离自然状态。例如,天然存在于活体动物的核酸或肽不是“分离的”,但相同的核酸或肽被部分或整体地从与其天然状态共存的物质中分割出后即是“分离的”。分离的核酸或肽可以以基本纯化的形式存在,也可存在于非天然环境中,例如宿主细胞中。
在本发明的上下文中,常见的核酸碱基使用以下缩写。“A”指腺苷,“C”指胞嘧啶,“G”指鸟苷,“T”指胸苷,“U”指尿苷。
除非另有说明,否则“编码氨基酸序列的核苷酸序列”包括所有互为简并形式且编码同一氨基酸序列的核苷酸序列。鉴于某些形式的编码蛋白质的核苷酸序列可能包含内含子,术语编码蛋白质或RNA或cDNA的核苷酸序列也可以包括内含子。
“编码”是指诸如基因、cDNA或mRNA的多核苷酸中特定核苷酸序列的固有特性,用作生物过程中合成其他聚合物和大分子的模板,这些聚合物和大分子具有确定的核苷酸序列(即rRNA、tRNA和mRNA)或确定的氨基酸序列以及由此产生的生物学特性。因此,如果某基因对应的mRNA的转录和翻译在细胞或其他生物系统中产生某蛋白质,则该基因编码该蛋白质。编码链的核苷酸序列与mRNA序列相同,通常在序列表中提供,非编码链作为基因或cDNA转录的模板,此二者均可被描述为编码该基因或cDNA的蛋白质或其他产物。
本文所用的术语“多核苷酸”指核苷酸链。进一步地,核酸是核苷酸的聚合物。因此,本文所用的核酸和多核苷酸可以互换使用。本领域技术人员均知晓核酸是多核苷酸,其可以水解成单体“核苷酸”。单体核苷酸可以水解成核苷。本文所用的多核苷酸包括但不限于通过本领域可用的任何方式获得的所有核酸序列,所述方式包括但不限于重组方式,即用普通克隆技术和PCR等手段从重组文库或细胞基因组中克隆核酸序列,和合成方式。
如本文所用,术语“肽”、“多肽”和“蛋白质”可互换使用,是指由通过肽键共价连接的氨基酸残基组成的化合物。蛋白质或肽必须包含至少两个氨基酸,构成蛋白质序列或肽序列的最大氨基酸数量无限制。多肽包括任何含有两个或以上由肽键相连的氨基酸的肽或蛋白质。如本文所用,该术语既指短链也指较长的链,短链在本领域中通常也被称为,例如,肽、寡肽和寡聚体,较长的链本领域中通常被称为蛋白质,有许多种类。“多肽”包括,例如生物活性片段、基本同源的多肽、寡肽、同型二聚体、异二聚体、多肽的变体、修饰的多肽、衍生物、类似物、融合蛋白等。所述多肽包括天然肽、重组肽、合成肽、或其组合。
“相同”是指两个多肽或两个核酸分子之间的序列相似性或序列同一性。当两个比较序列中的某一位点被相同的碱基或氨基酸单体亚基占据时,例如,如果两个DNA分子中的某一位点都被腺嘌呤占据,则这两个分子在该位点是同源或相同的。两个序列之间的同源性/同一性百分比是两个序列共有的匹配位点数除以比较位点数×100的函数。举例而言,两条序列的10个位点中有6个匹配,则这两条序列60%相同。通常,比较时取两条序列比能获得最大同源性/同一性的匹配。
“载体”是包含分离的核酸并且可用于将分离的核酸递送至细胞内部的物质组合物。许多载体是本领域已知的,包括但不限于线性多核苷酸、与离子或两亲化合物相关的多核苷酸、质粒和病毒。因此,术语“载体”包括自主复制的质粒或病毒。该术语还应解释为包括促进核酸转移到细胞中的非质粒和非病毒化合物,例如聚赖氨酸化合物、脂质体等。病毒载体的示例包括但不限于腺病毒载体、腺相关病毒载体、逆转录病毒载体等。
“表达载体”是指包含重组多核苷酸的载体,该重组多核苷酸包含与要表达的核苷酸序列可操作地连接的表达控制序列。表达载体包含足够的用于表达的顺式作用元件;其他表达元件可以由宿主细胞提供或在体外表达系统中提供。表达载体包括本领域已知的所有表达载体,例如加入了重组多核苷酸的粘粒、质粒(例如裸露的或包含在脂质体中的)和病毒(例如慢病毒、逆转录病毒、腺病毒和腺相关病毒)。
如本文所用,术语“启动子”定义为启动多核苷酸序列的特异性转录所需要的,能被细胞的合成机制或引入的合成机制识别的DNA序列。
如本文所用,术语“启动子/调节序列”是指表达与启动子/调节序列可操作连接的基因产物所需的核酸序列。在某些情况下,该序列可以是核心启动子序列,而在其他情况下,该序列还可以包括增强子序列和基因产物表达所需的其他调控元件。例如,启动子/调节序列可以是以组织特异性方式表达基因产物的序列。
“组成型”启动子是一种核苷酸序列,当该序列与编码或指定某一基因产物的多核苷酸可操作地连接时,在细胞内的大部分或所有生理条件下都能使该基因产物在细胞中生成。
“诱导型”启动子是一种核苷酸序列,当该序列与编码或指定某一基因产物的多核苷酸可操作地连接时,基本上仅当细胞中存在对应于启动子的诱导物时才使该基因产物在细胞中生成。
“组织特异性”启动子是一种核苷酸序列,当该序列与编码某一基因或由某一基因指定的多核苷酸可操作连接时,如果细胞是与启动子对应的组织类型的细胞,则使基因产物优先在该细胞中产生。
术语“患者”、“受试者”、“个体”等在本文中可互换使用,其指代任何适用于本文所述的方法的动物或其体外和原位的细胞。在某些非限制性实施方式中,所述患者、受试者或个体是人。
“疾病”是动物的健康状态,所述动物不能维持体内状态平衡,如果疾病没有改善,则动物的健康状况持续恶化。相比之下,动物的“紊乱”是一种健康状态,在这种状态下,动物能够维持体内状态平衡,但动物的健康状况不如没有疾病时的健康状况。如果不加以治疗,紊乱不一定会导致动物健康状况进一步下降。
如果疾病或紊乱的症状的严重性、患者经历这种症状的频率或两者都降低了,则该疾病或紊乱被“减轻”或“改善”。“减轻”或“改善”这还包括阻止疾病或障碍的进展。如果疾病或紊乱的症状的严重性、患者经历这种症状的频率或两者都被消除了,则该疾病或紊乱被“治愈”。
“治疗性”治疗是对表现出病理征象的受试者进行的处理,目的是减少或消除这些征象。
如本文所用,“治疗疾病或病症”是指降低受试者经历的疾病或病症的至少一种体征或症状的频率或严重性。疾病和紊乱在本文对治疗的描述中可互换使用。
化合物的“有效量”是足以向施用该化合物的受试者提供有益效果的化合物的量。如本文所用,术语“治疗有效量”指足以或有效地预防或治疗(延迟或预防其发生、预防其进展、抑制、减轻或逆转)疾病或病症,包括减轻这些疾病的症状的量。运输载体的“有效量”是指足以有效地结合或运送化合物的量。
发明详述
在本发明的第一方面,提供了一种拟抗肌萎缩蛋白(QD),较佳地为功能性拟抗肌萎缩蛋白,更佳地为人源性拟抗肌萎缩蛋白。
本申请所涉及的拟抗肌萎缩蛋白是指较天然或野生型抗肌萎缩蛋白短的肽或蛋白质。本描述本发明时,“微抗肌萎缩蛋白”、“迷你抗肌萎缩蛋白”和“拟抗肌萎缩蛋白”意义相同。在本申请的其余部分,将使用术语“拟抗肌萎缩蛋白”,因为本发明的蛋白大小比先前公布的(例如由Yue等人(MD1)公布的)微抗肌萎缩蛋白大。
在具体的实施方式中,本发明的拟抗肌萎缩蛋白大小为全长抗肌萎缩蛋白大小(例如人抗肌萎缩蛋白全长3685氨基酸)的大于35%、40%、45%、50%、55%、或甚至大于60%。在优选的实施方式中,其具有全长抗肌萎缩蛋白大小的大于61.1%、62%、63%、64%或甚至65%。对人抗肌萎缩蛋白而言,这意味着本发明的拟抗肌萎缩蛋白优选的包含大于2361个氨基酸,更优选地包含大于2400个氨基酸,例如2406aa(WL2)或2427aa(WL1)。
抗肌萎缩蛋白的结构已有记载(见图1A),其活性片段也已披露。如本领域所应理解的,活性片段是指全长序列的一个部分或多个部分,其至少保留了全长序列的部分生物功能。
“功能性”截短的抗肌萎缩蛋白或拟抗肌萎缩蛋白是指其对应的肽或蛋白质能够至少表现出野生型抗肌萎缩蛋白的某些功能,并能够至少部分缓解与缺乏天然抗肌萎缩蛋白的一种或多种症状,尤其是纤维退化、感染、坏死以及肌肉被瘢痕和脂肪组织替代、肌无力、呼衰和心衰、以及过早死亡。
优选地,本发明的拟抗肌萎缩蛋白(或多或少地)表现出与已有技术中的微抗肌萎缩蛋白相关的至少一种特性,特别是Yue等人(Mol Ther,2006.14(1):79-87)或Kodippili等人(Human Gene Therapy,2018,29(3),299-311)披露的特性。
其中优选的特性包括:
-与至少一种DAP(“抗肌萎缩蛋白相关蛋白”)结合,特别是与互生蛋白、肌营养蛋白、nNOS和/或PAR-1b蛋白结合;
-在肌纤维膜上募集DAP复合体;
-修复微管网络;
-保护肌肉免受损伤;
-保持蛋白整体结构和血影样重复(R)域组织;
-恢复肌肉结构和功能;尤其适合于骨骼肌,同时也适合于心肌和膈膜;
-更通常地,改善肌肉功能、步态、心功能、呼吸功能、生存期、生存质量和/或期望寿命。
如本领域所知的,所述特性可以通过对多种表达抗肌萎缩蛋白的细胞进行体外实验来验证,如对iPS衍生的人DMD致病细胞的实验、从各种动物模型中分离出来的肌肉纤维的体外组织实验、或基于患DMD或BMD的动物模型甚至患者的体内实验。所述动物模型例如mdx小鼠(Foster H.等.,Mol Ther,2008.16(11):第1825-32页),mdx4cv小鼠(Decrouy等.,Gen Ther,1997.4(5):401-8),D2.B10-mdx/J小鼠(Coley等.,Human Molecular genetics,2016.25(1):130-45),CXMDj犬(Koo等.,J Gene Med,2011.13(9):497-506)或GRMD犬(LeGuiner等.,Mol Ther.,2014.22(11):1923-35)。小鼠模型通常用于测试编码微抗肌萎缩蛋白的新结构。然而,由于小鼠表现出的疾病不太严重,没有免疫反应,因此这一模型存在缺陷。另一种动物模型,犬,被认为在预测基因治疗产品在人类中的治疗潜力方面更加可靠。Larcher等人(Plos One,2014,9(10),e110371)披露的大鼠模型能表现出心肌病,因此也非常合适。
如上所述,全长抗肌萎缩蛋白(图1A)的特征在于各个结构域:
-结合肌动蛋白的N端结构域(CH1CH2);
-4个铰链区(H1到H4);
-24个血影样蛋白重复区或杆状区(R1到R24);
-半胱氨酸富集区;
-C端结构域。
在一实施方式中,本发明的拟抗肌萎缩蛋白相较全长抗肌萎缩蛋白至少有一个域缺失,优选至少一个血影样重复区(R)缺失。
在另一方面,本发明的拟抗肌萎缩蛋白包含至少一个关键蛋白质结合点位,尤其是F-肌动蛋白、nNOS蛋白和PAR-1b蛋白的结合位点。优选地,适合的拟抗肌萎缩蛋白包含nNOS和/或PAR1b的结合位点。nNOS结合位点表现在杆状区的重复序列16和17(R16、R17)(Lai等.,J.Clin.Invest.,2009.119:624-635),而体外实验证明了抗肌萎缩蛋白重复序列8和9(R8、R9)有结合PAR1b的位点(Yamashita等.,Biochem.Biophys.Res.Commun.,2010.391:812-817)。在优选的实施方式中,适合的拟抗肌萎缩蛋白包含所述R16-R17和/或R8-9的杆状区。
在该方面中,本发明涉及一种包含R8、R9、R16和R17杆状区的拟抗肌萎缩蛋白。
在另一方面,本发明的拟抗肌萎缩蛋白的特征在于缺乏或去除了R4、R5、R6和R7杆状区(ΔR4-R7)的N端部分。
换言之,本发明的拟抗肌萎缩蛋白具有如下的N端部分,顺序为从N端起按下列顺序:
-结合肌动蛋白的N端结构域(CH1CH2);
-H1铰链区;
-R1、R2和R3杆状区;
-H2铰链区;
-R8和R9杆状区。
这类拟抗肌萎缩蛋白在其N端部分显示结构CH1CH2H1R1R2R3H2R8R9。
在优选的实施方式中,这类拟抗肌萎缩蛋白进一步包含R16和R17杆状区。
在特定的实施方式中,本发明的拟抗肌萎缩蛋白至少包含现有技术中MD1微抗肌萎缩蛋白的血影样重复,即R1-R2-R3和R24。
优选地,本发明的拟抗肌萎缩蛋白较MD1微抗肌萎缩蛋白还包含杆状(R)区,优选选自下组的至少一个:R14、R15、R20、R21、R22和R23。在优选的实施方式中,所述拟抗肌萎缩蛋白还包含R22-R23,并可能包含R14-15或R20-21。
在具体的实施方式中,本发明的拟抗肌萎缩蛋白缺少下述杆状区:
R4到R7、R10到R15和R18-R19。换言之,这类拟抗肌萎缩蛋白包含R1到R3、R8-R9、R16-R17和R20和R24;或
R4到R7、R10到R13和R18到R21。换言之,这类拟抗肌萎缩蛋白包含R1到R3、R8-R9、R14到R17和R22到R24;
优选地,其进一步包含:
-完整的N端结构域,对应序列SEQ ID NO:2或3的1到252位氨基酸
-完整的半胱氨酸富集(CR)区,分别对应序列SEQ ID NO:2或SEQ ID NO:20的1822到2102位氨基酸,或SEQ ID NO:3的1801到2081位氨基酸;
-部分或全长C端结构域,优选全长C端结构域,分别对应序列SEQ ID NO:2或SEQID NO:20的2103到2427位氨基酸,或SEQ ID NO:3的2082到2406位氨基酸。可能的部分C端结构域为WO2016/177911中公开的MD1、MD2、MD3或MD4的截短C端结构域;
-可能地至少一个铰链(H)区,选自H1、H2、H3和H4构成的组,优选地至少H1、H2和H4,可能地为H1、H2、H3和H4。
在具体的实施方式中,本发明的拟抗肌萎缩蛋白为:
-ΔR4-R7ΔR10-R15ΔR18-R19拟抗肌萎缩蛋白(也称作WL1),优选地具有SEQ IDNO:2所示的序列;或
-ΔR4-R7ΔR10-R13ΔR18-R21拟抗肌萎缩蛋白(也称作WL2),优选地具有SEQ IDNO:3所示的序列;
在优选的实施方式中,本发明的拟抗肌萎缩蛋白为:ΔR4-R7ΔR10-R15ΔR18-R19或ΔR4-R7ΔR10-R13ΔR18-R21拟抗肌萎缩蛋白。在另一优选的实施方式中,本发明的拟抗肌萎缩蛋白包含或由SEQ ID NO:2或SEQ ID NO:3所示的序列构成。
在一实施方式中,所述拟抗肌萎缩蛋白与公开的拟抗肌萎缩蛋白,特别是序列如SEQ ID NO:2或SEQ ID NO:3所示的是“基本相同”的,即大约60%相同,优选地大约70%相同,更优选地大约80%相同,更进一步优选地大约90%、91%、92%、93%、94%、95%、96%、97%、98%或再进一步优选地大约99%相同。
在特定的实施方式中,本发明的ΔR4-R7ΔR10-R15ΔR18-R19拟抗肌萎缩蛋白在其编码序列中至少存在以下两种突变之一(与SEQ ID NO:2所示序列相比):
-第49位丝氨酸取代精氨酸(R→S);和/或
-第748位丝氨酸取代苯丙氨酸(F→S)。
在具体的实施方式中,本发明的ΔR4-R7ΔR10-R15ΔR18-R19拟抗肌萎缩蛋白包含或由SEQ ID NO:20所示的序列组成。
本发明的另一方面涉及一种核酸序列,所述核酸序列编码上述的拟抗肌萎缩蛋白。
在一实施方式中,所述编码本发明的拟抗肌萎缩蛋白的核酸序列包含或由SEQ IDNO:4所示的序列构成。
在一实施方式中,所述编码本发明的拟抗肌萎缩蛋白的核酸序列与SEQ ID NO:4所示的序列是“基本相同”的,即大约60%相同,优选地大约70%相同,更优选地大约80%相同,更进一步优选地大约90%、91%、92%、93%、94%、95%、96%、97%、98%或再进一步优选地大约99%相同。
在优选的实施方式中,本发明的核酸序列包含或由SEQ ID NO:5或SEQ ID NO:22所示的序列构成。
在另一实施方式中,所述编码本发明的拟抗肌萎缩蛋白的核酸序列包含或由SEQID NO:21所示的序列构成。
在一实施方式中,所述编码本发明的拟抗肌萎缩蛋白的核酸序列与SEQ ID NO:21所示的序列是“基本相同”的,即大约60%相同,优选地大约70%相同,更优选地大约80%相同,更进一步优选地大约90%、91%、92%、93%、94%、95%、96%、97%、98%或再进一步优选地大约99%相同。
在优选的实施方式中,本发明的拟抗肌萎缩蛋白是使用所谓的双AAV载体系统产生的。
本发明的表达系统通常由2个AAV载体组成。在基于2个AAV载体共有的重叠区域进行体内重组后,能够产生大量拟抗肌萎缩蛋白。因此,编码拟抗肌萎缩蛋白的核酸序列可分为两部分,即编码其N端部分的5’序列和编码其C端的3’序列。5’序列的3’端和3’序列的5’端是同源的,甚至是相同的,以便进行同源重组。
在本发明的另一方面,涉及包含2个AAV载体的双AAV载体系统,其中
第一AAV载体包含在5’和3’AAV ITR之间的第一核酸序列,该序列编码拟抗肌萎缩蛋白的N端部分,
第二AAV载体包含在5’和3’AAV ITR之间的第二核酸序列,该序列编码拟抗肌萎缩蛋白的C端部分,
其中所述第一和第二核酸序列包含重叠区,所述重叠区允许通过重组生产本发明的拟抗肌萎缩蛋白。
如下文所述,在本发明的框架内,术语“重叠区”(或“重叠序列”)和“同源区”(或“同源序列区”)具有相同含义,可互换使用。
换言之,本发明涉及一种组合物,所述组合物包含重组的腺相关病毒(AAV)载体,优选两个载体,所述载体携带有互补结构,允许表达本发明的拟抗肌萎缩蛋白。
在本发明的框架内,术语“组合物”可以被“结合物”、“组合”或“表达载体”替代。其意指两个AAV载体共同工作且必须互相接触,以形成同源重组然后产生活性蛋白。不过,他们可以处于一个单独的组合物中,也可能处在两个分开的组合物中并在使用前混合。
在特定的实施例中,第一腺相关病毒(AAV)载体包括:
i)AAV 5’ITR(反向末端重复)序列;
ii)由启动子控制的基因部分;
iii)AAV 3’ITR序列。
此外,第二腺相关病毒(AAV)载体包括:
iv)AAV 5’ITR(反向末端重复)序列;
v)优选地连有多聚腺苷酸化信号的基因部分;
vi)AAV 3’ITR序列。
这两个AAV载体具有互补序列,该互补序列在重组时将形成一个功能单元。如技术人员所知,重组是由于细胞DNA修复途径,通过识别每条AAV载体上的同源序列发生的。
因此,所述的两个AAV载体的基因部分需要满足下列条件:
-所述第一和第二AAV载体的基因部分共同含有编码本发明拟抗肌萎缩蛋白(优选为人源)的开放阅读框;
-所述第一和第二AAV载体的基因部分都含有同源区,所述同源区在同源重组后能够允许所述开放阅读框的重建。
“双AAV系统”意指由两个AAV载体构成的载体系统,所述系统中每个载体都携带有编码需要运送到细胞中的本发明拟抗肌萎缩蛋白的序列的一部分,编码所述拟抗肌萎缩蛋白的开放阅读框(ORF)由第一和第二核酸序列间的相互作用重组入细胞中。本发明的双载体系统使用包含允许(同源)重组的序列的载体,即重叠载体。因此,通过在导入每个AAV载体的抗肌萎缩蛋白基因上加入一段合适的重叠区来进行同源重组,以重组拟抗肌萎缩蛋白。
在本发明中,系统中每个腺相关病毒(AAV)载体都包含一表达结构,也称为“表达盒”或“插入物”。本申请中所述的“插入物”优选地定义为位于AAV基因组5’和3’ITR(“反向末端重复”)序列间的核酸序列。
如本领域所公知,所述插入物的大小不应太多地超过野生型AAV基因组长度。例如,AAV2包含2个各145bp的ITR序列,并含有4682bp(包括ITR序列)的基因组。
在特定的实施方式中,导入每个AAV载体的编码部分拟抗肌萎缩蛋白的核酸序列的长度小于5kb,例如小于4.9、4.8、4.7、4.6或4.5kb。
为了获益和限制AAV包装的大小,所述核酸序列对应于外显子。换言之,所述核酸序列优选为cDNA片段。
为获益,由两AAV结合得来的阅读框编码拟抗肌萎缩蛋白(MD),优选地编码功能性的拟抗肌萎缩蛋白,更优选地编码人源性拟抗肌萎缩蛋白,如前所述。
在优选的实施方式中,由所述双AAV载体系统生产的拟抗肌萎缩蛋白包含至少2000个氨基酸(aa),优选至少2100aa、2200aa、2300aa、2400aa或2500aa。在另一实施方式中,由所述AAV载体生产的微抗肌萎缩蛋白包含不超过3000个氨基酸(aa),优选不超过2900aa、2800aa、2700aa、2600aa、2550aa或2500aa。在优选的实施方式中,由所述双AAV载体系统生产的拟抗肌萎缩蛋白包含介于2400个到2500个aa,即其大小对应于全长抗肌萎缩蛋白大小的约70%。
在本发明的双载体系统中,所述第一和第二核酸序列具有一序列同源区来促进分子间同源重组,由此通过重组双AAV系统中的两个载体来生产大型拟抗肌萎缩蛋白转基因,也即分别地编码拟抗肌萎缩蛋白N端部分和C端部分的核酸序列。在实行所述重叠系统的特定实施方式中,序列同源区的长度可以在较大范围内变化,只要生成的插入物(包括5’-和3’-ITR序列、和表达控制序列)的大小满足封装入AAV载体的大小限制。本领域技术人员很清楚该大小限制,并且能够基于该知识调整所述第一和第二核酸序列的大小,从而调整序列同源区的大小。因此,在特定的实施方式中,所述序列同源区是抗肌萎缩蛋白基因的多核苷酸序列,其长度为小于4599个核苷酸,例如小于4500、4000、3500、3000、2500、2000、1500或1000个核苷酸。在另一特定的实施方式中,所述序列同源区是拟抗肌萎缩蛋白基因的多核苷酸序列,其长度为至少100个核苷酸,例如至少100、200、300、400、500、600、700或800个核苷酸。在另外的特定实施方式中,所述序列同源区是拟抗肌萎缩蛋白基因的多核苷酸序列,其长度包括在100到1000个核苷酸之间,特定地在500到1000个核苷酸之间,例如在700到900个核苷酸之间,优选地在750到850个核苷酸之间。
在另一实施方式中,所述同源区包含至少一个(1)完整血影样区域(R),优选两个(2)完整区域,可能还包括另外两个(2)截短的区域,和可选的一个(1)铰链区(H)。它们可能对应于天然抗肌萎缩蛋白上的相邻区域。优选地,它们位于天然抗肌萎缩蛋白的中心区,例如位于R14和R21之间。
在具体的实施方式中,所述重叠区对应于编码下述部分的序列:
-截短的R16、R17、H3和截短的R20,或
-截短的R14、R15、R16和截短的R17;或
-截短的H3、R20和R21。
在特定的实施方式中,所述第一核酸序列可编码抗肌萎缩蛋白的N端结构域、H1、R1到R3、H2、R8、R9、R16、R17、H3和截短的R20,所述第二核酸序列可编码抗肌萎缩蛋白的截短的R16、R17、H3、R20到R24、H4、CR区和C端结构域(所述抗肌萎缩蛋白为如前所述的全长的或截短的,优选全长的)。
在具体的实施方式中,所述第一核酸序列包括或由SEQ ID NO:6或SEQ ID NO:23所示的序列组成,所述第二核酸序列包括或由SEQ ID NO:7所示的序列组成。序列SEQ IDNO:12的800bp的重叠区对应SEQ ID NO:6(或SEQ ID NO:23)所示序列的3779到4578位核苷酸和SEQ ID NO:7所示序列的173到972位核苷酸。
在另一实施方式中,所述第一核酸序列包含SEQ ID NO:6(或SEQ ID NO:23)所示序列的646到4578位核苷酸,所述第二核酸序列包含SEQ ID NO:7所示序列的173到4323位核苷酸。
在具体的实施方式中,所述第一核酸序列包括或由SEQ ID NO:8所示的序列组成,所述第二核酸序列包括或由SEQ ID NO:9所示的序列组成。800bp的重叠区对应SEQ ID NO:8所示序列的3798到4597位核苷酸和SEQ ID NO:9所示序列的193到992位核苷酸。在另一实施方式中,所述800bp的重叠区序列如SEQ ID NO:10、SEQ ID NO:11或SEQ ID NO:12所示。
在另一实施方式中,所述第一核酸序列包含SEQ ID NO:8所示序列的665到4597位核苷酸,所述第二核酸序列包含SEQ ID NO:9所示序列的193到4343位核苷酸。
在另一实施方式中,所述第一核酸序列可编码抗肌萎缩蛋白的N端结构域、H1、R1到R3、H2、R8、R9、R14、R15、R16和截短的R17、所述第二核酸序列可编码抗肌萎缩蛋白的截短的R14、R15、R16、R22到R24、H4、CR区和C端结构域(所述抗肌萎缩蛋白为如前所述的全长的或截短的,优选全长的)。
在具体的实施方式中,所述第一核酸序列包括或由SEQ ID NO:13所示的序列组成,所述第二核酸序列包括或由SEQ ID NO:14所示的序列组成。800bp的重叠区对应SEQ IDNO:13所示序列的3701到4500位核苷酸和SEQ ID NO:14所示序列的182到981位核苷酸。在另一实施方式中,所述800bp的重叠区序列如SEQ ID NO:15、SEQ ID NO:16或SEQ ID NO:17所示。
在另一实施方式中,所述第一核酸序列包含SEQ ID NO:13所示序列的646到4500位核苷酸,所述第二核酸序列包含SEQ ID NO:14所示序列的182到4347位核苷酸。
在另一特定的实施方式中,所述第一核酸序列可编码抗肌萎缩蛋白的N端结构域、H1、R1到R3、R16、R17、R18、R19、H3、R20和R21、所述第二核酸序列可编码抗肌萎缩蛋白的截短的H3、R20到R24、H4、CR区和C端结构域(所述抗肌萎缩蛋白为如前所述的全长的或截短的,优选全长的)。
在具体的实施方式中,所述第一核酸序列包括或由SEQ ID NO:18所示的序列组成,所述第二核酸序列包括或由SEQ ID NO:19所示的序列组成。375bp的重叠区对应SEQ IDNO:18所示序列的4091到4465位核苷酸和SEQ ID NO:19所示序列的193到567位核苷酸。
编码拟抗肌萎缩蛋白的核酸序列优选为人源的,但也可以是犬的、大鼠的、小鼠的或非人灵长类动物的序列。在一实施方式中,所述核酸序列来源于将要被施用于的生物,优选地为人源序列以便施用于人体。
在本发明中,所述序列同源区,即重叠区,可以是经过优化以进行重组的序列。此类序列包括如SEQ ID NO:10、SEQ ID NO:11或SEQ ID NO:12所示的序列,是从对应的野生型序列(SEQ ID NO:9所示序列的193到992位核苷酸)优化而来的,或如SEQ ID NO:15、SEQID NO:16或SEQ ID NO:17所示的序列,是从对应的野生型序列(SEQ ID NO:14所示序列的182到981位核苷酸)优化而来的。
根据已知的方法,存在不同的方法优化编码蛋白质的序列,以增加mRNA水平(重组和转录)和/或蛋白质水平(翻译)。在本发明中,本发明的双AAV载体系统中载体的序列优选地经过优化以促进重组,并可能增加拟抗肌萎缩蛋白多肽的体内表达。
在实施中,可优化下述序列:
-编码抗肌萎缩蛋白N端部分的第一核酸序列;
-编码抗肌萎缩蛋白C端部分的第二核酸序列;和/或
-位于第一和第二核酸序列间的重叠区。
在特定的实施方式中,抗肌萎缩蛋白的非重叠序列和重组区都经过了优化。序列优化可包含核酸序列的许多变化,包括密码子的优化、GC含量的增加、CpG岛数量的减少、选择性开放阅读框(ARF)数量的减少和/或剪接供体和剪接受体位点数量的减少。由于遗传密码的简并性,不同的核酸分子可能编码同一蛋白质。众所周知,不同生物的遗传密码往往倾向于使用编码相同氨基酸的几个密码子中的一个,而不是其他密码子。通过密码子优化,向核苷酸序列中引入变化,该变化利用了给定细胞环境中存在的密码子偏性,使所得到的密码子优化核苷酸序列与非密码子优化序列相比,更有可能在该给定细胞环境中以相对较高的水平表达。在本发明优选的实施方式中,此类编码拟抗肌萎缩蛋白且经过序列优化的核酸序列和编码同一蛋白的非密码子优化的核酸序列相比,经密码子优化的方式(例如利用人类特定的密码子使用偏性)促进了它在人细胞中的表达和稳定性。在特定的实施方式中,拟抗肌萎缩蛋白的完整序列,即非重叠和重叠区,都经过了优化以促进其在目的或宿主细胞中的产生,尤其是在人体中。
在本发明的另一方面,涉及一种AAV载体,所述AAV载体对应本发明双AAV载体系统的第一AAV载体或第二AAV载体。
每个AAV载体都包含编码抗肌萎缩蛋白基因的相关部分(分别为N端和C端部分)的核酸序列,所述序列优选地如上文所定义,但也包括在整个基因重组后正确表达所述拟抗肌萎缩蛋白所需的所有序列。
在一实施方式中,所述第一和第二AAV载体的第一和第二核酸序列受到调控序列的控制。优选地,所述第一核酸序列前是启动子,后面是可选的内含子,第二核酸序列后面是一段多聚腺苷酸化信号。
该启动子可以是天然的或合成的(人工的)启动子,可以是诱导型或组成型启动子。
在一实施方式中,所述启动子是普遍存在的启动子,或具有低组织特异性。举例而言,所述表达载体可含有磷酸甘油酸激酶1(PGK)、EF1、ACTA1、β-肌动蛋白、肌间线蛋白(Desmin)、所有MCK变体、心肌肌钙蛋白和CMV启动子。
在优选的实施方式中,启动子序列被筛选以充分调控置于其控制下的核酸序列的表达,包括调控表达水平和组织特异性。
在一实施方式中,所述表达载体包含肌肉组织特异性启动子。该启动子能够在骨骼肌、膈肌、可能的心肌和卫星细胞中强表达。技术人员所知的合适启动子如肌间线蛋白启动子、肌酸激酶(MCK)启动子、截短型肌酸激酶启动子,例如CK6、CK7或CK8启动子、Syn启动子、MyoD、Myf5、Vcam、Pax3和Pax7卫星细胞启动子。另一种启动子是合成启动子C5-12(spC5-12)。也可以使用包含来自两个或更多转录调控元件的序列的杂交启动子(参见例如PCT/EP2019/053061)。
优选地,将第一核酸序列置于肌肉特异性启动子的控制下。换言之,第一AAV载体进一步包含肌肉特异性启动子,该启动子可操作地连接到编码抗肌萎缩蛋白N端部分的核酸序列上。
如本领域所知,有待引入一个或另一个AAV载体中的其他可能的调控序列包括但不限于:
-多聚腺苷酸化信号,优选地位于编码功能性微抗肌萎缩蛋白的序列的3’端;
-转录稳定序列,例如内含子;
-增强子序列;
-miRNA靶序列,其可抑制编码功能性抗肌萎缩蛋白的序列在非靶组织中的表达,其中所述表达是非期望性的,例如在其可能有毒的情况下。
在一方面,所述内含子选自下组:一种人β珠蛋白b2(或HBB2)内含子、FIX内含子和鸡β-珠蛋白内含子,其中所述内含子可选地为修饰的内含子,例如修饰的HBB2内含子、修饰的FIX内含子或修饰的鸡β-珠蛋白内含子。
在另一实施方式中,所述多聚腺苷酸化信号选自人β珠蛋白多聚腺苷酸化信号、牛生长激素多聚腺苷酸化信号、SV40多聚腺苷酸化(pA)信号或另一自然发生或人工多聚腺苷酸化信号中选择。
在特定的实施方式中,本发明的双AAV载体系统包括下列元件中的至少一个,优选地包括全部:
-spC5-12启动子,优选地对应SEQ ID NO:6(或SEQ ID No:23)所示序列的173到506位核苷酸;
-嵌合内含子,优选地对应SEQ ID NO:6(或SEQ ID No:23)所示序列的507到639位核苷酸;和
-SV40 pA信号,优选地对应SEQ ID NO:7所示序列的4324到4545位核苷酸。
为了克隆和生产病毒颗粒,可以将表达构造插入适合于选择、复制和生产拟抗肌萎缩蛋白的质粒中。
本发明所述的包含表达构建物的病毒载体是腺相关病毒(AAV)载体。
腺相关病毒(AAV)载体已成为治疗各种疾病的强有力的基因传递工具。AAV载体具有许多特性,包括低致病性、较轻的免疫原性以及以稳定有效的方式转导有丝分裂后细胞和组织的能力,这些特性使其非常适合基因治疗。通过选取合适的AAV血清型、启动子和传递方法,AAV载体中包含的特定基因的表达可以被特定地靶向一种或多种类型细胞。
在一实施方式中,编码序列被包含在AAV载体中。目前已知的天然存在AAV血清型超过100种。AAV衣壳存在许多天然变种,使我们能够识别和选自拥有特定的适合于抗肌萎缩蛋白疾病的特性的AAV。可以用常规的分子生物技术改造AAV病毒,使这些病毒得到优化,以向特定细胞输送核酸序列、最小化其免疫原性、调整稳定性和病毒寿命、使其有效降解、准确输送到细胞核。
根据上文提到的,使用AAV是DNA外源传递的常见模式,因为它相对无毒性、能高效地传递基因,并且可以方便地针对特定目的进行优化。在分离自人或非人灵长类(NHP)的特征良好的AAV血清型中,人血清型2是第一个被开发用作基因转移载体的AAV。目前使用的其他AAV血清型包括AAV1、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV11和AAV12。另外,也可以使用非天然的工程化变种和嵌合或杂交AAV。
用于组装载体的理想的AAV片段包括衣壳蛋白,所述衣壳蛋白包括vp1、vp2、vp3和可变区,病毒复制相关蛋白,所述病毒复制蛋白包括rep78、rep68、rep52和rep40,以及编码上述蛋白的序列。这些片段可以方便地用在各种载体系统和宿主细胞中。
这些片段可以单独应用、和其他AAV血清型序列或片段联合应用、或和源自其他AAV或非AAV病毒序列的元件联合应用。如本文所用,人工AAV血清型包括但不限于含有非天然存在的衣壳蛋白的AAV。该人工衣壳可以用任何合适的技术生成,如使用选定的AAV序列(例如一段vp1衣壳蛋白的片段)联合异质序列,所述异质序列可取自另一个选定的AAV血清型、同一AAV血清型上的非连续部分、取自非AAV病毒来源、或取自非病毒来源。人工AAV血清型可以是但不限于嵌合AAV衣壳、重组AAV衣壳或“人源化的”AAV衣壳。其中示例性的AAV或人工AAV包括AAV2/8(US7,282,199)、AAV2/5(可从美国国立卫生研究院获得)、AAV2/9(WO2005/033321)、AAV2/6(US 6,156,303)、AAVrh8(WO2003/042397)、和rh74-AAV9(EP18305399.0)。在一实施方式中,本发明的组合物和方法中可用的载体至少包括编码选定的AAV血清型衣壳的序列,例如AAV8衣壳或其片段。
在另一实施方式中,可用的载体至少包括编码选定的AAV血清型病毒复制相关蛋白,例如AAV8病毒复制相关蛋白或其片段。可选地,这些载体可同时包含AAV衣壳蛋白和病毒复制相关蛋白。在同时包含AAV衣壳蛋白和病毒复制相关蛋白的载体中,所述AAV病毒复制相关序列和衣壳序列可以是来自同一血清型的,例如都来自于AAV8.或者,所用的载体的病毒相关复制序列和衣壳序列来自不同的AAV血清型。在一实施方式中,所述病毒复制相关序列和衣壳序列可以由不同来源表达(例如,不同载体,或宿主细胞和载体)。在另一实施方式中,所述病毒复制相关序列在框架中与另一AAV血清型的衣壳序列相融合,以构建嵌合AAV载体,例如AAV2/8(US 7,282,199)。
在一实施方式中,所述第一和第二AAV载体各自是具有AAV来源衣壳的AAV载体,例如AAV1、AAV2、AAV2变体、AAV3、AAV3变体、AAV3B、AAV3B变体、AAV4、AAV5、AAV6、AAV6变体、AAV7、AAV8、AAV9、AAV2G9、AAV10(例如AAVcy10和AAVrh10)、AAVrh74、AAVdj、AAV-Anc80、AAV-LK03、AAV2i8、和猪AAV,例如AAVpo4和AAVpo6蛋白衣壳,或是具有嵌合衣壳的AAV载体,优选地具有AAV8、AAV9、AAVrh74、AAV2i8衣壳或AAV9-rh74嵌合衣壳,更优选地具有AAV8、AAV9或AAVrh74衣壳,进一步优选地具有AAV8或AAV9衣壳。
在本发明使用的AAV载体中,AAV基因组优选为单链核酸。
如本领域所知的,重组病毒颗粒可以通过,例如293HEK细胞三重转染、单纯疱疹病毒系统和通过巴库洛病毒系统或使用特定的细胞系得到。优选地,所述病毒颗粒通过293HEK细胞三重转染得到。
载体滴度通常表示为每毫升病毒基因组数(vg/ml)。
本发明还涉及如上文所述的使用双AAV载体系统转导的细胞,尤其是肌肉细胞。
在另一实施方式中,本发明涉及一种组合物,优选为治疗组合物或药物,包括上文所述的双AAV载体系统和可能的其他活性分子(其他基因治疗产品、化学分子、肽、蛋白、……),用于治疗同一疾病或其他疾病。
本发明还提供了药物组合物,所述药物组合物包括双AAV载体系统或该系统的第一或第二AAV载体。所述药物组合物包括治疗有效量的治疗剂(本发明的核酸或载体),和药学上可接受的载体。在特定的实施方式中,术语“药学上可接受的”指经联邦或州政府监管机构批准,或在美国或欧洲药典或其他公认药典中列出的用于动物和人类的药物。术语“载体”指与药剂一同施用的稀释剂、佐剂、敷料、或赋形剂。所述药学载体可以是无菌液体,如水和油,包括石油、动物、植物或合成来源的油,例如花生油、豆油、矿物油、芝麻油等。当所述药物组合物用于静脉注射时,优选的载体为水。盐水溶液和葡萄糖和甘油溶液也可以用作液体载体,尤其是用于注射溶液中。合适的药用辅料包括淀粉、葡萄糖、乳糖、蔗糖、硬脂酸钠、单硬脂酸甘油酯、滑石、氯化钠、脱脂奶粉、甘油、丙二醇、水、乙醇等。
如有需要,所述组合物还可含有少量润湿剂或乳化剂,或pH缓冲剂。所述组合物可以为溶液、悬浮液、乳液、缓释制剂等剂型。E.W.Martin的“雷明顿药学技术(Remington'sPharmaceutical Sciences)”中描述了合适的药物载体的示例。所述组合物将包含治疗有效量的治疗用药物,优选为纯化形式,以及合适的量的载体,以便提供适合向受试者给药的形式。
在优选的实施方式中,根据常规方法将该组合物配制成适用于人体静脉注射的药物组合物。通常,用于静脉给药的组合物是无菌等渗水缓冲液中的溶液。必要时,该组合物还可包括溶解剂和局部麻醉剂(例如利多卡因),以缓解注射部位的疼痛。
在一实施方式中,本发明的组合物适合被施用于人。
所述组合物优选为液体剂型,优选为盐水组合物,更优选为磷酸盐缓冲盐水(PBS)组合物或乳酸林格溶液。
可通过标准临床技术确定本发明的治疗剂(即核酸或载体)在治疗肌营养不良疾病中的有效量。此外,体内和/或体外分析可以选择性地用于辅助预测最佳剂量范围。配方中的精确使用量还将取决于给药途径、体重和疾病的严重程度,并应根据医师的判断和每位患者的情况来决定。
上述的AAV载体或组合物可以用作药物,尤其是用作基因治疗产品,用于向有需要的受试者施用。本发明的另一方面涉及上述公开的AAV载体或组合物用于制备药物的用途。
合适的给药应能够将治疗有效量的基因治疗产品输送到靶组织,尤其是骨骼肌,可能包括膈肌和心脏。在本发明中,当基因治疗产品是包含编码拟抗肌萎缩蛋白的核酸序列的病毒载体时,治疗剂量定义为受试者每千克体重(kg)施用的含有拟抗肌萎缩蛋白序列的病毒颗粒(vg意为病毒基因组)的量。
可用的给药途径有局部给药、肠内给药(产生系统性作用,但通过胃肠道给药)或肠外给药(产生系统性作用,但通过胃肠道以外的途径给药)。本文公开的组合物的优选给药途径为肠外给药,包括肌肉给药(即进入肌肉)和全身给药(即进入循环系统)。基于此,术语“注射”(或“灌注”或“输注”)包括血管内,尤其是静脉内(IV)和肌肉内(IM)给药。注射通常使用注射器或输液管进行。
在一实施方式中,所述组合物的系统递送包括在局部治疗部位附近施用该组合物,即在衰弱的肌肉附近的静脉或动脉中施用。在某些实施方式中,本发明包括局部递送组合物,所述组合物能产生系统性效应。这种给药途径通常被称为“局部输注”、“孤立肢体灌注给药”或“经静脉高压肢体灌注给药”,已成功地用作肌营养不良症的基因递送方法(Zheng Fan等(2012,Molecular Therapy 20(2),456-461)。
本发明的优选给药方法是全身给药。全身注射开辟了通向全身的注射通路,能够到达受试者全身的肌肉,包括心脏和膈膜,然后对这些全身性的和仍然无法治愈的疾病产生实际的治疗。在某些实施方式中,系统性递送包括将组合物输送给受试者,使得组合物可以到达受试者全身。
在优选的实施方式中,全身给药,包括子宫内给药,是通过在血管内注射组合物实现的,即血管内(静脉内或动脉内)给药。
在一实施方式中,经由外周静脉通过静脉注射施用该组合物。或者,通过肌肉注射进行全身给药。
当进行系统性递送时,含有本发明双AAV载体系统的组合物优选以小于等于1015vg/kg或甚至1014vg/kg的剂量施用,优选地在1012vg/kg到1014vg/kg之间,更优选地在5.1012vg/kg到1014vg/kg之间,例如1、2、3、4、5、6、7、8或9.1013vg/kg。为了避免潜在毒性和/或免疫反应,剂量还可以为例如1、2、3、4、5、6、7、8或9.1012vg/kg或更低。如技术人员所知,在保证获得满意效率的前提下,优选尽可能低的剂量。
在特定的实施方式中,治疗包括单次施用该组合物。
在一实施方式中,会在1个月,或3个月或6个月,或甚至1年、2年、5年、10年,或甚至更长时间内观察AAV载体的存在和/或拟抗肌萎缩蛋白的表达以及相关的治疗获益。
在本发明中,受试者优选为人类,但也可以是小鼠、大鼠、非人灵长类或狗。
本发明中的“肌营养不良疾病”指与抗肌萎缩蛋白基因缺陷相关的疾病。该缺陷可能是基因缺失或突变导致表达水平降低或表达缺失、在开放阅读框中引入过早的终止密码子,或产生失活的蛋白。肌营养不良疾病优选为由抗肌萎缩蛋白基因突变引起的杜氏和贝克型肌营养不良症(DMD/BMD)。所述突变可导致抗肌萎缩蛋白表达缺失或低水平表达,或产生部分或完全失活的、可能被截短的蛋白质。
可从本发明的组合物获益的受试者包括被诊断患有肌营养不良症或有可能发展为肌营养不良症的所有患者。可通过本领域技术人员已知的任何方法,基于对抗肌萎缩蛋白基因中的突变或缺失的识别,选择待治疗的受试者,包括例如对抗肌萎缩蛋白基因进行测序,和/或通过本领域技术人员已知的任何方法评估抗肌萎缩蛋白的表达水平或活性。因此,所述受试者既包括已经表现出营养不良疾病症状的受试者,也包括具有发展为所述疾病风险的受试者。在一实施方式中,所述受试者包括已经表现出肌营养不良疾病症状的受试者。在另一实施方式中,所述受试者为门诊患者和早期非门诊患者。
在一实施方式中,本发明涉及如上所述的双AAV载体系统或包含所述AAV载体的用于治疗肌营养不良疾病的组合物。在另一实施方式中,本发明涉及上述的AAV载体或包含所述AAV载体的组合物的用途,所述用途为用于制备用于治疗营养不良疾病的药物。
换言之,本发明提供了一种治疗受试者肌营养不良疾病的方法,包括向受试者施用上述的双AAV载体系统或包含所述系统的组合物。
此类双AAV载体系统和包含所述系统的组合物主要用于基因治疗,特别是用于治疗患有杜氏肌营养不良症(DMD)或贝克型肌营养不良症(BMD)的受试者。首要治疗目的是提供安全(无毒)的治疗。进一步的目的是提供有效的治疗方法,以延迟、减缓或预防疾病的发展,并可能改善在临床上易于观测的患者表型。本发明的AAV载体和组合物可用于受试者的以下方面:
-改善肌肉功能,尤其是骨骼肌,同时也包括心肌和膈膜;
-改善步态;
-改善心功能;
-改善呼吸功能;
-延长生存期,更广义地为提高生活质量和预期寿命。
除非另有说明,否则本发明的操作采用分子生物学(包括重组技术)、微生物学、细胞生物学、生物化学和免疫学的常规技术,这些技术完全在本领域技术人员的权限内。这些技术在文献中已充分介绍,例如《分子克隆:实验室手册(Molecular Cloning:ALaboratoryManual)》第四版(Sambrook,2012);《寡核苷酸合成(Oligonucleotide Synthesis)》(Gait,1984);《动物细胞培养(Culture of Animal Cells)》(Freshney,2010);《酶学实验方法(Methods in Enzymology)》;《实验免疫学手册(Handbook of ExperimentalImmunology)》(Weir,1997);《哺乳动物细胞的基因转移载体(Gene Transfer Vectors forMammalian Cells)》(Miller and Calos,1987);《精编分子生物学实验指南(ShortProtocols in Molecular Biology)》(Ausubel,2002);《聚合酶链反应:原理、应用和故障排除(Polymerase Chain Reaction:Principles,Applications and Troubleshooting)》(Babar,2011);《现行免疫学指南(Current Protocols in Immunology)》(Coligan,2002)。这些技术适用于生产本发明的多核苷酸和多肽,因此,在制造和实施本发明时可以使用这些技术。下面将讨论在特定实施例中尤其适用的技术。
本文引用的每项专利、专利申请和出版物的公开内容整体通过引用并入本文。
在不作进一步描述的情况下,相信本领域的普通技术人员可以根据上文的描述和下面的示例性实施例来制备和利用本发明的化合物并实施所要求保护的方法。
实施例
下面通过参考示例性实施例和附图进一步详细描述本发明。
这些示例仅供说明之用,并非构成对本发明的限制。
附图简述
图1:不同形式抗肌萎缩蛋白图谱:
A/全长抗肌萎缩蛋白图谱;
B/Yue等公开的MD1微抗肌萎缩蛋白图谱;
C/Kodippili公开的ΔH2-R15拟抗肌萎缩蛋白(SEQ ID NO:1)(DD);
D/本发明的ΔR4-R7ΔR10-R15ΔR18-R19拟抗肌萎缩蛋白(SEQ ID NO:2)(WL1);
E/本发明的ΔR4-R7ΔR10-R13ΔR18-R21拟抗肌萎缩蛋白(SEQ ID NO:3)(WL2);
图2:不同形式抗肌萎缩蛋白的生产水平分析:
A/抗-抗肌萎缩蛋白抗体蛋白质印迹分析
给3月龄mdx小鼠的胫骨前肌(TA肌)注射1x1010vg AAV。Ta肌在注射后30天恢复,并进行蛋白质印迹分析检测抗肌萎缩蛋白抗体(DysB)。6至8通道(Opt)显示了使用本发明的双AAV载体系统获得的蛋白质WL1水平(280kDa),与之相比的有分别使用AAV2/9-hMD1(通道9:138kDa)、人类对照样品(FL:通道2:427kDa)和mdx(KO:通道3和通道4)获得的蛋白质WL1水平。来自匹配年龄段的mdx小鼠和C57BL/10小鼠的的对比肌肉只注射了盐水。
B和C/给1-2月龄WT小鼠的TA肌注射1-3x1010vg AAV。在注射后30天收集TA肌,使用抗抗肌萎缩蛋白抗体(DysB和Dys2)和抗α-辅肌动蛋白水平(α-辅肌动蛋白水平用作标准参照)进行蛋白质印迹分析。不同结构表达的蛋白水平比较:
-WL1 wt(由SEQ ID NO:8+9所示序列表达):上半印迹中通道1到3;
-WL2 wt(由SEQ ID NO:13+14所示序列表达):上半印迹中通道4到6;
-WL1 Opt(由SEQ ID NO:6+7所示序列表达):上半印迹中通道7到9;
-PBS:下半印迹中通道10;
-DD(由SEQ ID NO:18+19所示序列表达):下半印迹中通道1到3;
-WL1 Opt(由SEQ ID NO:6+7所示序列表达):下半印迹中通道4到6;
-PBS:下半印迹中通道7。
图3:1月龄的Dba2_Mdx小鼠静脉注射以下药物后,评估TA肌中抗肌萎缩蛋白的表达和分布:
-含有SEQ ID NO:6所示序列的5’AAV载体和含有SEQ ID NO:7所示序列的3’AAV载体(注射7只小鼠;n=3只WB中的代表性小鼠);
-仅含有SEQ ID NO:6所示序列的5’AAV载体(注射2只小鼠;n=1只WB中的代表性小鼠);
-仅含有SEQ ID NO:7所示序列的3’AAV载体(注射2只小鼠;n=1只WB中的代表性小鼠);
-无载体,即PBS(注射5只小鼠;n=2只WB中的代表性小鼠)
Ta肌在注射后70天恢复,并进行蛋白质印迹分析和冰冻切片。
A/使用抗抗肌萎缩蛋白抗体(Dys N-ter和DysC-ter)和α-肌动蛋白抗体进行蛋白质印迹(WB)分析。预期尺寸=280kDa;全长抗肌萎缩蛋白=427kDa。
B/用Dys N-ter抗体在冰冻切片中进行免疫标记。
每张图片上都列出了阳性纤维的百分比。
图4:抗肌萎缩蛋白对TA肌肉的疗效评价如图3所示
A/切片用苏木精、荧光桃红、藏红花色素(HPS)染色
B/TMEM8C(左上)、CD11b(右上)和纤维连接蛋白(下)的表达水平
图5:使用抗抗肌萎缩蛋白抗体(Dys N-ter和Dys C-ter)和α-肌动蛋白抗体进行蛋白质印迹检测,评估如图3中治疗的小鼠心脏中抗肌萎缩蛋白的表达
图6:评估TA(胫骨前肌)和DIA(膈肌)中抗肌萎缩蛋白的表达Dba2_Mdx小鼠静脉注射下列之一:含有SEQ ID NO:6所示序列的5’AAV载体和含有SEQ ID NO:7所示序列的3’AAV载体(5’mut+3’或双Dys mut);注射5只小鼠;
含有SEQ ID NO:23所示序列的5’AAV载体和含有SEQ ID NO:7所示序列的3’AAV载体(5’cor+3’或双Dys cor);注射5只小鼠
实验中包括未受注射(NI)的Dba2_Mdx小鼠(n=3)和Dba2_WT小鼠用作对照。
Ta和DA肌在注射后3周恢复,并进行蛋白质印迹分析和冰冻切片。
A/使用抗抗肌萎缩蛋白抗体(Dys2)和α-肌动蛋白抗体(用于使样品正常化)进行蛋白质印迹(WB)分析。预期尺寸=280kDa;全长抗肌萎缩蛋白=427kDa,L=ladder;HL=高分子量ladder,T+=阳性对照。
B/用Dys2抗体在冰冻切片中进行免疫标记。每张图片上都列出了阳性纤维的百分比。
图7:抗肌萎缩蛋白对肌肉的疗效评价如图6所示:TMEM8C(左上)、CD11b(右上)和纤维连接蛋白(下)的表达水平。
材料和方法
体内基因转移本研究采用不同的小鼠模型:野生型C57BL/10和C57BL/6J;mdx和Dba_2Mdx(D2.B10 mdx/J)。所有小鼠实验均按照伦理委员会批准的方案在埃夫里动物研究所CERFE内在适当的生物学控制下进行。
采用三种质粒构建方案制备腺相关病毒载体。
处死小鼠后,收集组织(TA肌肉和心脏),在液氮冷却的异戊烷中快速冰冻,并储存在-80℃。
蛋白质印迹实验从组织样品中提取总蛋白。蛋白质提取物通过NuPAGETM凝胶分离,然后转移到硝酸纤维素膜上。然后用奥德赛封闭缓冲液和PBS封闭膜,然后与适当的抗体(抗抗肌萎缩蛋白DysB或Dys2抗体、兔α-肌动蛋白(Life Technologies))和二级抗小鼠或抗兔结合(680或800)抗体杂交。
免疫组化使用Mouse on Mouse(M.O.M)试剂盒(Vector Labs)对TA肌肉冰冻切片进行染色。
一级抗体在4℃下培养过夜,然后用PBS-0.1%吐温洗3次,并与山羊抗小鼠或山羊抗兔二级抗体Alexa 594(Life Technologies)一起培养。
使用抗抗肌萎缩蛋白抗体(Dys2,1:100,小鼠单克隆,Novocastra)。
HPS染色使用苏木精(核染色)、福禄考(细胞质染色)和藏红花(胶原)对TA肌肉冰冻切片进行染色。
抗肌萎缩蛋白基因的PCR定量分析通过RT-定量PCR(RTqPCR)对收集自冰冻切片的cDNA进行定量检测,检测TMEM8C(表示纤维再生/TGE Mm00481256_m1,ThermoFisher)、CD11b(表示炎症/TGE Mm00434455_m1)和纤维连接蛋白(表示纤维化/TGE Mm01256744_m1)的表达。
结果:
1)AAV载体的构建
构建不同的重组AAV2/8或AAV2/9载体以评估其产生抗肌萎缩蛋白的相对效率:
表1检测的抗肌萎缩蛋白
Figure BDA0003282572610000231
2)肌内(IM)给药后抗肌萎缩蛋白基因分析
为了评估本发明的双载体系统产生的蛋白质的肌肉转导和表达水平,对载体进行体内分析。
1月龄的抗肌萎缩蛋白缺陷(mdx)小鼠接受1e10 vg AAV载体肌肉注射。胫骨前肌(Tibialis anterior,TA)肌肉在注射后30天恢复,并进行蛋白质印迹分析检测抗肌萎缩蛋白抗体(DysB)(图2A)6至8通道(Opt)显示了使用本发明的双AAV载体系统获得的蛋白质WL1OPT(280kDa)水平,与之相比的有分别使用AAV2/9-hMD1(通道9:138kDa)、人抗肌萎缩蛋白(FL:通道2:427kDa)和mdx(KO:通道3和通道4)获得的蛋白质WL1水平。来自匹配年龄段的mdx小鼠和C57BL/10小鼠的的对比肌肉只注射了盐水。如上所示,这些数据证实基于双AAV载体系统可以有效地产生本发明的截短型抗肌萎缩蛋白。
在进一步的实验中评估了不同的双AAV载体系统。1月龄的wt小鼠接受1-3e10 vgAAV载体肌肉注射。胫骨前肌(TA)在注射后30天恢复,并用抗肌营养不良蛋白(Dys2和DysB)和-肌动蛋白抗体进行蛋白质印迹检测。图2B和C显示了使用StudioLight软件用-肌动蛋白标准化后获得的蛋白质定量水平。
图2B所示的结果表明,本发明的拟抗肌萎缩蛋白(WL1 Opt)产生的水平显著高于WL1 wt和WL2 wt,甚至优于根据现有技术(DD)的构建体产生的拟抗肌萎缩蛋白。
因此,使用被认为最有希望用于功能评估的所述构建物进行了进一步的实验。
3)血管内(IV)注射后抗肌萎缩蛋白基因分析
A/使用WL1 Opt
在进行了IM实验后,选择WL1 Opt进行血管内(IV)注射实验。4周龄Dba_2Mdx小鼠接受血管内注射(5e11vg/小鼠)给药(5'+3'组n=7;5'组n=2;3'组n=2;PBS组n=5)。在基因转移后70天进行终点分析。
3-1肌肉内
在静脉注射后检测基于本发明的双AAV载体系统的拟抗肌萎缩蛋白生成(图3A)。
TA肌肉切片的免疫组化分析(图3B)显示了膜上的抗肌萎缩蛋白标记,并确认了本发明的拟抗肌萎缩蛋白的正确表达,尽管小鼠之间在表达水平上存在一定的异质性。
为了评估通过静脉注射给药的本发明的拟抗肌萎缩蛋白的治疗效果,用HPS对TA切片进行染色,这能够揭示组织的大体状态,尤其是炎症、纤维再生、纤维化。与wt和mdx小鼠相比,表达本发明拟抗肌萎缩蛋白的缺陷小鼠表现的形态居于两者之间(图4A)。这一观察结果通过相关标记物的测量得到证实(见图4B)。对于每项指标,可以观察到受治疗动物的水平有所改善,位于wt和mdx小鼠中观察到的水平之间。
3-2心肌内
除了如上所述对肌肉的有益作用外,本发明的转基因拟抗肌萎缩蛋白也在接受注射的小鼠心脏中表达(见图5)。
B/使用WL1 Cor
已经注意到WL1密码子优化导致WL1 ORF中的两个突变(R49S和F748S)。因此,5’AAV载体(序列ID编号6)中的相应序列已被校正(790位置的AGC转换为AGG,2887位置的TCT转换为TTC)。
然后使用两种结构重复实验:对应于WL1 cor(SEQ ID NO:22)的新(cor)双Dys载体系统(突变校正;5'cor+3')和对应于WL1 Opt(SEQ ID NO:5)的旧(mut)双Dys载体系统(5’mut+3',n=5)。
如图6A/6B所示,即使在静脉注射后的早期(3周),在胫骨前肌和膈肌中,双AAV载体系统也能观察到一定量的拟抗肌萎缩蛋白生成。
就表达水平而言,新的(cor)双Dys载体系统(突变校正;5'cor+3'),即编码天然拟抗肌萎缩蛋白的优化版本似乎是有前景的。
关于生成的拟抗肌萎缩蛋白的治疗效果,图7证实了这两种结构产生了受治疗小鼠的相似特征,即再生、炎症和纤维化相关标记物的改善。
序列表
<110> 吉尼松公司
INSERM(国家健康和医学研究院)
埃夫里-瓦尔德艾松大学
<120> 使用重叠的AAV载体生产大型拟抗肌萎缩蛋白
<130> G143-B-53376 PCT
<150> EP19305377
<151> 2019-03-25
<160> 23
<170> BiSSAP 1.3.6
<210> 1
<211> 2361
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> DD
<400> 1
Met Leu Trp Trp Glu Glu Val Glu Asp Cys Tyr Glu Arg Glu Asp Val
1 5 10 15
Gln Lys Lys Thr Phe Thr Lys Trp Val Asn Ala Gln Phe Ser Lys Phe
20 25 30
Gly Lys Gln His Ile Glu Asn Leu Phe Ser Asp Leu Gln Asp Gly Arg
35 40 45
Arg Leu Leu Asp Leu Leu Glu Gly Leu Thr Gly Gln Lys Leu Pro Lys
50 55 60
Glu Lys Gly Ser Thr Arg Val His Ala Leu Asn Asn Val Asn Lys Ala
65 70 75 80
Leu Arg Val Leu Gln Asn Asn Asn Val Asp Leu Val Asn Ile Gly Ser
85 90 95
Thr Asp Ile Val Asp Gly Asn His Lys Leu Thr Leu Gly Leu Ile Trp
100 105 110
Asn Ile Ile Leu His Trp Gln Val Lys Asn Val Met Lys Asn Ile Met
115 120 125
Ala Gly Leu Gln Gln Thr Asn Ser Glu Lys Ile Leu Leu Ser Trp Val
130 135 140
Arg Gln Ser Thr Arg Asn Tyr Pro Gln Val Asn Val Ile Asn Phe Thr
145 150 155 160
Thr Ser Trp Ser Asp Gly Leu Ala Leu Asn Ala Leu Ile His Ser His
165 170 175
Arg Pro Asp Leu Phe Asp Trp Asn Ser Val Val Cys Gln Gln Ser Ala
180 185 190
Thr Gln Arg Leu Glu His Ala Phe Asn Ile Ala Arg Tyr Gln Leu Gly
195 200 205
Ile Glu Lys Leu Leu Asp Pro Glu Asp Val Asp Thr Thr Tyr Pro Asp
210 215 220
Lys Lys Ser Ile Leu Met Tyr Ile Thr Ser Leu Phe Gln Val Leu Pro
225 230 235 240
Gln Gln Val Ser Ile Glu Ala Ile Gln Glu Val Glu Met Leu Pro Arg
245 250 255
Pro Pro Lys Val Thr Lys Glu Glu His Phe Gln Leu His His Gln Met
260 265 270
His Tyr Ser Gln Gln Ile Thr Val Ser Leu Ala Gln Gly Tyr Glu Arg
275 280 285
Thr Ser Ser Pro Lys Pro Arg Phe Lys Ser Tyr Ala Tyr Thr Gln Ala
290 295 300
Ala Tyr Val Thr Thr Ser Asp Pro Thr Arg Ser Pro Phe Pro Ser Gln
305 310 315 320
His Leu Glu Ala Pro Glu Asp Lys Ser Phe Gly Ser Ser Leu Met Glu
325 330 335
Ser Glu Val Asn Leu Asp Arg Tyr Gln Thr Ala Leu Glu Glu Val Leu
340 345 350
Ser Trp Leu Leu Ser Ala Glu Asp Thr Leu Gln Ala Gln Gly Glu Ile
355 360 365
Ser Asn Asp Val Glu Val Val Lys Asp Gln Phe His Thr His Glu Gly
370 375 380
Tyr Met Met Asp Leu Thr Ala His Gln Gly Arg Val Gly Asn Ile Leu
385 390 395 400
Gln Leu Gly Ser Lys Leu Ile Gly Thr Gly Lys Leu Ser Glu Asp Glu
405 410 415
Glu Thr Glu Val Gln Glu Gln Met Asn Leu Leu Asn Ser Arg Trp Glu
420 425 430
Cys Leu Arg Val Ala Ser Met Glu Lys Gln Ser Asn Leu His Arg Val
435 440 445
Leu Met Asp Leu Gln Asn Gln Lys Leu Lys Glu Leu Asn Asp Trp Leu
450 455 460
Thr Lys Thr Glu Glu Arg Thr Arg Lys Met Glu Glu Glu Pro Leu Gly
465 470 475 480
Pro Asp Leu Glu Asp Leu Lys Arg Gln Val Gln Gln His Lys Val Leu
485 490 495
Gln Glu Asp Leu Glu Gln Glu Gln Val Arg Val Asn Ser Leu Thr His
500 505 510
Met Val Val Val Val Asp Glu Ser Ser Gly Asp His Ala Thr Ala Ala
515 520 525
Leu Glu Glu Gln Leu Lys Val Leu Gly Asp Arg Trp Ala Asn Ile Cys
530 535 540
Arg Trp Thr Glu Asp Arg Trp Val Leu Leu Gln Asp Ile Leu Leu Lys
545 550 555 560
Trp Gln Arg Leu Thr Glu Glu Gln Cys Leu Phe Ser Ala Trp Leu Ser
565 570 575
Glu Lys Glu Asp Ala Val Asn Lys Ile His Thr Thr Gly Phe Lys Asp
580 585 590
Gln Asn Glu Met Leu Ser Ser Leu Gln Lys Leu Ala Val Leu Lys Ala
595 600 605
Asp Leu Glu Lys Lys Lys Gln Ser Met Gly Lys Leu Tyr Ser Leu Lys
610 615 620
Gln Asp Leu Leu Ser Thr Leu Lys Asn Lys Ser Val Thr Gln Lys Thr
625 630 635 640
Glu Ala Trp Leu Asp Asn Phe Ala Arg Cys Trp Asp Asn Leu Val Gln
645 650 655
Lys Leu Glu Lys Ser Thr Ala Gln Ile Ser Gln Glu Ile Ser Tyr Val
660 665 670
Pro Ser Thr Tyr Leu Thr Glu Ile Thr His Val Ser Gln Ala Leu Leu
675 680 685
Glu Val Glu Gln Leu Leu Asn Ala Pro Asp Leu Cys Ala Lys Asp Phe
690 695 700
Glu Asp Leu Phe Lys Gln Glu Glu Ser Leu Lys Asn Ile Lys Asp Ser
705 710 715 720
Leu Gln Gln Ser Ser Gly Arg Ile Asp Ile Ile His Ser Lys Lys Thr
725 730 735
Ala Ala Leu Gln Ser Ala Thr Pro Val Glu Arg Val Lys Leu Gln Glu
740 745 750
Ala Leu Ser Gln Leu Asp Phe Gln Trp Glu Lys Val Asn Lys Met Tyr
755 760 765
Lys Asp Arg Gln Gly Arg Phe Asp Arg Ser Val Glu Lys Trp Arg Arg
770 775 780
Phe His Tyr Asp Ile Lys Ile Phe Asn Gln Trp Leu Thr Glu Ala Glu
785 790 795 800
Gln Phe Leu Arg Lys Thr Gln Ile Pro Glu Asn Trp Glu His Ala Lys
805 810 815
Tyr Lys Trp Tyr Leu Lys Glu Leu Gln Asp Gly Ile Gly Gln Arg Gln
820 825 830
Thr Val Val Arg Thr Leu Asn Ala Thr Gly Glu Glu Ile Ile Gln Gln
835 840 845
Ser Ser Lys Thr Asp Ala Ser Ile Leu Gln Glu Lys Leu Gly Ser Leu
850 855 860
Asn Leu Arg Trp Gln Glu Val Cys Lys Gln Leu Ser Asp Arg Lys Lys
865 870 875 880
Arg Leu Glu Glu Gln Lys Asn Ile Leu Ser Glu Phe Gln Arg Asp Leu
885 890 895
Asn Glu Phe Val Leu Trp Leu Glu Glu Ala Asp Asn Ile Ala Ser Ile
900 905 910
Pro Leu Glu Pro Gly Lys Glu Gln Gln Leu Lys Glu Lys Leu Glu Gln
915 920 925
Val Lys Leu Leu Val Glu Glu Leu Pro Leu Arg Gln Gly Ile Leu Lys
930 935 940
Gln Leu Asn Glu Thr Gly Gly Pro Val Leu Val Ser Ala Pro Ile Ser
945 950 955 960
Pro Glu Glu Gln Asp Lys Leu Glu Asn Lys Leu Lys Gln Thr Asn Leu
965 970 975
Gln Trp Ile Lys Val Ser Arg Ala Leu Pro Glu Lys Gln Gly Glu Ile
980 985 990
Glu Ala Gln Ile Lys Asp Leu Gly Gln Leu Glu Lys Lys Leu Glu Asp
995 1000 1005
Leu Glu Glu Gln Leu Asn His Leu Leu Leu Trp Leu Ser Pro Ile Arg
1010 1015 1020
Asn Gln Leu Glu Ile Tyr Asn Gln Pro Asn Gln Glu Gly Pro Phe Asp
1025 1030 1035 1040
Val Gln Glu Thr Glu Ile Ala Val Gln Ala Lys Gln Pro Asp Val Glu
1045 1050 1055
Glu Ile Leu Ser Lys Gly Gln His Leu Tyr Lys Glu Lys Pro Ala Thr
1060 1065 1070
Gln Pro Val Lys Arg Lys Leu Glu Asp Leu Ser Ser Glu Trp Lys Ala
1075 1080 1085
Val Asn Arg Leu Leu Gln Glu Leu Arg Ala Lys Gln Pro Asp Leu Ala
1090 1095 1100
Pro Gly Leu Thr Thr Ile Gly Ala Ser Pro Thr Gln Thr Val Thr Leu
1105 1110 1115 1120
Val Thr Gln Pro Val Val Thr Lys Glu Thr Ala Ile Ser Lys Leu Glu
1125 1130 1135
Met Pro Ser Ser Leu Met Leu Glu Val Pro Ala Leu Ala Asp Phe Asn
1140 1145 1150
Arg Ala Trp Thr Glu Leu Thr Asp Trp Leu Ser Leu Leu Asp Gln Val
1155 1160 1165
Ile Lys Ser Gln Arg Val Met Val Gly Asp Leu Glu Asp Ile Asn Glu
1170 1175 1180
Met Ile Ile Lys Gln Lys Ala Thr Met Gln Asp Leu Glu Gln Arg Arg
1185 1190 1195 1200
Pro Gln Leu Glu Glu Leu Ile Thr Ala Ala Gln Asn Leu Lys Asn Lys
1205 1210 1215
Thr Ser Asn Gln Glu Ala Arg Thr Ile Ile Thr Asp Arg Ile Glu Arg
1220 1225 1230
Ile Gln Asn Gln Trp Asp Glu Val Gln Glu His Leu Gln Asn Arg Arg
1235 1240 1245
Gln Gln Leu Asn Glu Met Leu Lys Asp Ser Thr Gln Trp Leu Glu Ala
1250 1255 1260
Lys Glu Glu Ala Glu Gln Val Leu Gly Gln Ala Arg Ala Lys Leu Glu
1265 1270 1275 1280
Ser Trp Lys Glu Gly Pro Tyr Thr Val Asp Ala Ile Gln Lys Lys Ile
1285 1290 1295
Thr Glu Thr Lys Gln Leu Ala Lys Asp Leu Arg Gln Trp Gln Thr Asn
1300 1305 1310
Val Asp Val Ala Asn Asp Leu Ala Leu Lys Leu Leu Arg Asp Tyr Ser
1315 1320 1325
Ala Asp Asp Thr Arg Lys Val His Met Ile Thr Glu Asn Ile Asn Ala
1330 1335 1340
Ser Trp Arg Ser Ile His Lys Arg Val Ser Glu Arg Glu Ala Ala Leu
1345 1350 1355 1360
Glu Glu Thr His Arg Leu Leu Gln Gln Phe Pro Leu Asp Leu Glu Lys
1365 1370 1375
Phe Leu Ala Trp Leu Thr Glu Ala Glu Thr Thr Ala Asn Val Leu Gln
1380 1385 1390
Asp Ala Thr Arg Lys Glu Arg Leu Leu Glu Asp Ser Lys Gly Val Lys
1395 1400 1405
Glu Leu Met Lys Gln Trp Gln Asp Leu Gln Gly Glu Ile Glu Ala His
1410 1415 1420
Thr Asp Val Tyr His Asn Leu Asp Glu Asn Ser Gln Lys Ile Leu Arg
1425 1430 1435 1440
Ser Leu Glu Gly Ser Asp Asp Ala Val Leu Leu Gln Arg Arg Leu Asp
1445 1450 1455
Asn Met Asn Phe Lys Trp Ser Glu Leu Arg Lys Lys Ser Leu Asn Ile
1460 1465 1470
Arg Ser His Leu Glu Ala Ser Ser Asp Gln Trp Lys Arg Leu His Leu
1475 1480 1485
Ser Leu Gln Glu Leu Leu Val Trp Leu Gln Leu Lys Asp Asp Glu Leu
1490 1495 1500
Ser Arg Gln Ala Pro Ile Gly Gly Asp Phe Pro Ala Val Gln Lys Gln
1505 1510 1515 1520
Asn Asp Val His Arg Ala Phe Lys Arg Glu Leu Lys Thr Lys Glu Pro
1525 1530 1535
Val Ile Met Ser Thr Leu Glu Thr Val Arg Ile Phe Leu Thr Glu Gln
1540 1545 1550
Pro Leu Glu Gly Leu Glu Lys Leu Tyr Gln Glu Pro Arg Glu Leu Pro
1555 1560 1565
Pro Glu Glu Arg Ala Gln Asn Val Thr Arg Leu Leu Arg Lys Gln Ala
1570 1575 1580
Glu Glu Val Asn Thr Glu Trp Glu Lys Leu Asn Leu His Ser Ala Asp
1585 1590 1595 1600
Trp Gln Arg Lys Ile Asp Glu Thr Leu Glu Arg Leu Gln Glu Leu Gln
1605 1610 1615
Glu Ala Thr Asp Glu Leu Asp Leu Lys Leu Arg Gln Ala Glu Val Ile
1620 1625 1630
Lys Gly Ser Trp Gln Pro Val Gly Asp Leu Leu Ile Asp Ser Leu Gln
1635 1640 1645
Asp His Leu Glu Lys Val Lys Ala Leu Arg Gly Glu Ile Ala Pro Leu
1650 1655 1660
Lys Glu Asn Val Ser His Val Asn Asp Leu Ala Arg Gln Leu Thr Thr
1665 1670 1675 1680
Leu Gly Ile Gln Leu Ser Pro Tyr Asn Leu Ser Thr Leu Glu Asp Leu
1685 1690 1695
Asn Thr Arg Trp Lys Leu Leu Gln Val Ala Val Glu Asp Arg Val Arg
1700 1705 1710
Gln Leu His Glu Ala His Arg Asp Phe Gly Pro Ala Ser Gln His Phe
1715 1720 1725
Leu Ser Thr Ser Val Gln Gly Pro Trp Glu Arg Ala Ile Ser Pro Asn
1730 1735 1740
Lys Val Pro Tyr Tyr Ile Asn His Glu Thr Gln Thr Thr Cys Trp Asp
1745 1750 1755 1760
His Pro Lys Met Thr Glu Leu Tyr Gln Ser Leu Ala Asp Leu Asn Asn
1765 1770 1775
Val Arg Phe Ser Ala Tyr Arg Thr Ala Met Lys Leu Arg Arg Leu Gln
1780 1785 1790
Lys Ala Leu Cys Leu Asp Leu Leu Ser Leu Ser Ala Ala Cys Asp Ala
1795 1800 1805
Leu Asp Gln His Asn Leu Lys Gln Asn Asp Gln Pro Met Asp Ile Leu
1810 1815 1820
Gln Ile Ile Asn Cys Leu Thr Thr Ile Tyr Asp Arg Leu Glu Gln Glu
1825 1830 1835 1840
His Asn Asn Leu Val Asn Val Pro Leu Cys Val Asp Met Cys Leu Asn
1845 1850 1855
Trp Leu Leu Asn Val Tyr Asp Thr Gly Arg Thr Gly Arg Ile Arg Val
1860 1865 1870
Leu Ser Phe Lys Thr Gly Ile Ile Ser Leu Cys Lys Ala His Leu Glu
1875 1880 1885
Asp Lys Tyr Arg Tyr Leu Phe Lys Gln Val Ala Ser Ser Thr Gly Phe
1890 1895 1900
Cys Asp Gln Arg Arg Leu Gly Leu Leu Leu His Asp Ser Ile Gln Ile
1905 1910 1915 1920
Pro Arg Gln Leu Gly Glu Val Ala Ser Phe Gly Gly Ser Asn Ile Glu
1925 1930 1935
Pro Ser Val Arg Ser Cys Phe Gln Phe Ala Asn Asn Lys Pro Glu Ile
1940 1945 1950
Glu Ala Ala Leu Phe Leu Asp Trp Met Arg Leu Glu Pro Gln Ser Met
1955 1960 1965
Val Trp Leu Pro Val Leu His Arg Val Ala Ala Ala Glu Thr Ala Lys
1970 1975 1980
His Gln Ala Lys Cys Asn Ile Cys Lys Glu Cys Pro Ile Ile Gly Phe
1985 1990 1995 2000
Arg Tyr Arg Ser Leu Lys His Phe Asn Tyr Asp Ile Cys Gln Ser Cys
2005 2010 2015
Phe Phe Ser Gly Arg Val Ala Lys Gly His Lys Met His Tyr Pro Met
2020 2025 2030
Val Glu Tyr Cys Thr Pro Thr Thr Ser Gly Glu Asp Val Arg Asp Phe
2035 2040 2045
Ala Lys Val Leu Lys Asn Lys Phe Arg Thr Lys Arg Tyr Phe Ala Lys
2050 2055 2060
His Pro Arg Met Gly Tyr Leu Pro Val Gln Thr Val Leu Glu Gly Asp
2065 2070 2075 2080
Asn Met Glu Thr Pro Val Thr Leu Ile Asn Phe Trp Pro Val Asp Ser
2085 2090 2095
Ala Pro Ala Ser Ser Pro Gln Leu Ser His Asp Asp Thr His Ser Arg
2100 2105 2110
Ile Glu His Tyr Ala Ser Arg Leu Ala Glu Met Glu Asn Ser Asn Gly
2115 2120 2125
Ser Tyr Leu Asn Asp Ser Ile Ser Pro Asn Glu Ser Ile Asp Asp Glu
2130 2135 2140
His Leu Leu Ile Gln His Tyr Cys Gln Ser Leu Asn Gln Asp Ser Pro
2145 2150 2155 2160
Leu Ser Gln Pro Arg Ser Pro Ala Gln Ile Leu Ile Ser Leu Glu Ser
2165 2170 2175
Glu Glu Arg Gly Glu Leu Glu Arg Ile Leu Ala Asp Leu Glu Glu Glu
2180 2185 2190
Asn Arg Asn Leu Gln Ala Glu Tyr Asp Arg Leu Lys Gln Gln His Glu
2195 2200 2205
His Lys Gly Leu Ser Pro Leu Pro Ser Pro Pro Glu Met Met Pro Thr
2210 2215 2220
Ser Pro Gln Ser Pro Arg Asp Ala Glu Leu Ile Ala Glu Ala Lys Leu
2225 2230 2235 2240
Leu Arg Gln His Lys Gly Arg Leu Glu Ala Arg Met Gln Ile Leu Glu
2245 2250 2255
Asp His Asn Lys Gln Leu Glu Ser Gln Leu His Arg Leu Arg Gln Leu
2260 2265 2270
Leu Glu Gln Pro Gln Ala Glu Ala Lys Val Asn Gly Thr Thr Val Ser
2275 2280 2285
Ser Pro Ser Thr Ser Leu Gln Arg Ser Asp Ser Ser Gln Pro Met Leu
2290 2295 2300
Leu Arg Val Val Gly Ser Gln Thr Ser Asp Ser Met Gly Glu Glu Asp
2305 2310 2315 2320
Leu Leu Ser Pro Pro Gln Asp Thr Ser Thr Gly Leu Glu Glu Val Met
2325 2330 2335
Glu Gln Leu Asn Asn Ser Phe Pro Ser Ser Arg Gly Arg Asn Thr Pro
2340 2345 2350
Gly Lys Pro Met Arg Glu Asp Thr Met
2355 2360
<210> 2
<211> 2427
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1
<400> 2
Met Leu Trp Trp Glu Glu Val Glu Asp Cys Tyr Glu Arg Glu Asp Val
1 5 10 15
Gln Lys Lys Thr Phe Thr Lys Trp Val Asn Ala Gln Phe Ser Lys Phe
20 25 30
Gly Lys Gln His Ile Glu Asn Leu Phe Ser Asp Leu Gln Asp Gly Arg
35 40 45
Arg Leu Leu Asp Leu Leu Glu Gly Leu Thr Gly Gln Lys Leu Pro Lys
50 55 60
Glu Lys Gly Ser Thr Arg Val His Ala Leu Asn Asn Val Asn Lys Ala
65 70 75 80
Leu Arg Val Leu Gln Asn Asn Asn Val Asp Leu Val Asn Ile Gly Ser
85 90 95
Thr Asp Ile Val Asp Gly Asn His Lys Leu Thr Leu Gly Leu Ile Trp
100 105 110
Asn Ile Ile Leu His Trp Gln Val Lys Asn Val Met Lys Asn Ile Met
115 120 125
Ala Gly Leu Gln Gln Thr Asn Ser Glu Lys Ile Leu Leu Ser Trp Val
130 135 140
Arg Gln Ser Thr Arg Asn Tyr Pro Gln Val Asn Val Ile Asn Phe Thr
145 150 155 160
Thr Ser Trp Ser Asp Gly Leu Ala Leu Asn Ala Leu Ile His Ser His
165 170 175
Arg Pro Asp Leu Phe Asp Trp Asn Ser Val Val Cys Gln Gln Ser Ala
180 185 190
Thr Gln Arg Leu Glu His Ala Phe Asn Ile Ala Arg Tyr Gln Leu Gly
195 200 205
Ile Glu Lys Leu Leu Asp Pro Glu Asp Val Asp Thr Thr Tyr Pro Asp
210 215 220
Lys Lys Ser Ile Leu Met Tyr Ile Thr Ser Leu Phe Gln Val Leu Pro
225 230 235 240
Gln Gln Val Ser Ile Glu Ala Ile Gln Glu Val Glu Met Leu Pro Arg
245 250 255
Pro Pro Lys Val Thr Lys Glu Glu His Phe Gln Leu His His Gln Met
260 265 270
His Tyr Ser Gln Gln Ile Thr Val Ser Leu Ala Gln Gly Tyr Glu Arg
275 280 285
Thr Ser Ser Pro Lys Pro Arg Phe Lys Ser Tyr Ala Tyr Thr Gln Ala
290 295 300
Ala Tyr Val Thr Thr Ser Asp Pro Thr Arg Ser Pro Phe Pro Ser Gln
305 310 315 320
His Leu Glu Ala Pro Glu Asp Lys Ser Phe Gly Ser Ser Leu Met Glu
325 330 335
Ser Glu Val Asn Leu Asp Arg Tyr Gln Thr Ala Leu Glu Glu Val Leu
340 345 350
Ser Trp Leu Leu Ser Ala Glu Asp Thr Leu Gln Ala Gln Gly Glu Ile
355 360 365
Ser Asn Asp Val Glu Val Val Lys Asp Gln Phe His Thr His Glu Gly
370 375 380
Tyr Met Met Asp Leu Thr Ala His Gln Gly Arg Val Gly Asn Ile Leu
385 390 395 400
Gln Leu Gly Ser Lys Leu Ile Gly Thr Gly Lys Leu Ser Glu Asp Glu
405 410 415
Glu Thr Glu Val Gln Glu Gln Met Asn Leu Leu Asn Ser Arg Trp Glu
420 425 430
Cys Leu Arg Val Ala Ser Met Glu Lys Gln Ser Asn Leu His Arg Val
435 440 445
Leu Met Asp Leu Gln Asn Gln Lys Leu Lys Glu Leu Asn Asp Trp Leu
450 455 460
Thr Lys Thr Glu Glu Arg Thr Arg Lys Met Glu Glu Glu Pro Leu Gly
465 470 475 480
Pro Asp Leu Glu Asp Leu Lys Arg Gln Val Gln Gln His Lys Val Leu
485 490 495
Gln Glu Asp Leu Glu Gln Glu Gln Val Arg Val Asn Ser Leu Thr His
500 505 510
Met Val Val Val Val Asp Glu Ser Ser Gly Asp His Ala Thr Ala Ala
515 520 525
Leu Glu Glu Gln Leu Lys Val Leu Gly Asp Arg Trp Ala Asn Ile Cys
530 535 540
Arg Trp Thr Glu Asp Arg Trp Val Leu Leu Gln Asp Ile Leu Leu Lys
545 550 555 560
Trp Gln Arg Leu Thr Glu Glu Gln Cys Leu Phe Ser Ala Trp Leu Ser
565 570 575
Glu Lys Glu Asp Ala Val Asn Lys Ile His Thr Thr Gly Phe Lys Asp
580 585 590
Gln Asn Glu Met Leu Ser Ser Leu Gln Lys Leu Ala Val Leu Lys Ala
595 600 605
Asp Leu Glu Lys Lys Lys Gln Ser Met Gly Lys Leu Tyr Ser Leu Lys
610 615 620
Gln Asp Leu Leu Ser Thr Leu Lys Asn Lys Ser Val Thr Gln Lys Thr
625 630 635 640
Glu Ala Trp Leu Asp Asn Phe Ala Arg Cys Trp Asp Asn Leu Val Gln
645 650 655
Lys Leu Glu Lys Ser Thr Ala Gln Ile Ser Gln Ala Val Thr Thr Thr
660 665 670
Gln Pro Ser Leu Thr Gln Thr Thr Val Met Glu Thr Val Thr Thr Val
675 680 685
Thr Thr Arg Glu Gln Ile Leu Val Lys His Ala Gln Glu Glu Leu Pro
690 695 700
Pro Pro Pro Pro Gln Lys Lys Arg Gln Ile Thr Val Asp Gly Leu Glu
705 710 715 720
Lys Thr Val Ser Leu Gln Lys Asp Leu Ser Glu Met His Glu Trp Met
725 730 735
Thr Gln Ala Glu Glu Glu Tyr Leu Glu Arg Asp Phe Glu Tyr Lys Thr
740 745 750
Pro Asp Glu Leu Gln Lys Ala Val Glu Glu Met Lys Arg Ala Lys Glu
755 760 765
Glu Ala Gln Gln Lys Glu Ala Lys Val Lys Leu Leu Thr Glu Ser Val
770 775 780
Asn Ser Val Ile Ala Gln Ala Pro Pro Val Ala Gln Glu Ala Leu Lys
785 790 795 800
Lys Glu Leu Glu Thr Leu Thr Thr Asn Tyr Gln Trp Leu Cys Thr Arg
805 810 815
Leu Asn Gly Lys Cys Lys Thr Leu Glu Glu Val Trp Ala Cys Trp His
820 825 830
Glu Leu Leu Ser Tyr Leu Glu Lys Ala Asn Lys Trp Leu Asn Glu Val
835 840 845
Glu Phe Lys Leu Lys Thr Thr Glu Asn Ile Pro Gly Gly Ala Glu Glu
850 855 860
Ile Ser Glu Val Leu Asp Ser Leu Glu Asn Leu Met Arg His Ser Glu
865 870 875 880
Asp Asn Pro Asn Gln Ile Arg Ile Leu Ala Gln Thr Leu Thr Asp Gly
885 890 895
Gly Val Met Asp Glu Leu Ile Asn Glu Glu Leu Glu Thr Phe Asn Ser
900 905 910
Arg Trp Arg Glu Leu His Glu Glu Ala Val Arg Arg Gln Lys Leu Leu
915 920 925
Glu Gln Ile His Thr Val Arg Glu Glu Thr Met Met Val Met Thr Glu
930 935 940
Asp Met Pro Leu Glu Ile Ser Tyr Val Pro Ser Thr Tyr Leu Thr Glu
945 950 955 960
Ile Thr His Val Ser Gln Ala Leu Leu Glu Val Glu Gln Leu Leu Asn
965 970 975
Ala Pro Asp Leu Cys Ala Lys Asp Phe Glu Asp Leu Phe Lys Gln Glu
980 985 990
Glu Ser Leu Lys Asn Ile Lys Asp Ser Leu Gln Gln Ser Ser Gly Arg
995 1000 1005
Ile Asp Ile Ile His Ser Lys Lys Thr Ala Ala Leu Gln Ser Ala Thr
1010 1015 1020
Pro Val Glu Arg Val Lys Leu Gln Glu Ala Leu Ser Gln Leu Asp Phe
1025 1030 1035 1040
Gln Trp Glu Lys Val Asn Lys Met Tyr Lys Asp Arg Gln Gly Arg Phe
1045 1050 1055
Asp Arg Ser Val Glu Lys Trp Arg Arg Phe His Tyr Asp Ile Lys Ile
1060 1065 1070
Phe Asn Gln Trp Leu Thr Glu Ala Glu Gln Phe Leu Arg Lys Thr Gln
1075 1080 1085
Ile Pro Glu Asn Trp Glu His Ala Lys Tyr Lys Trp Tyr Leu Lys Glu
1090 1095 1100
Leu Gln Asp Gly Ile Gly Gln Arg Gln Thr Val Val Arg Thr Leu Asn
1105 1110 1115 1120
Ala Thr Gly Glu Glu Ile Ile Gln Gln Ser Ser Lys Thr Asp Ala Ser
1125 1130 1135
Ile Leu Gln Glu Lys Leu Gly Ser Leu Asn Leu Arg Trp Gln Glu Val
1140 1145 1150
Cys Lys Gln Leu Ser Asp Arg Lys Lys Arg Leu Glu Glu Gln Pro Asp
1155 1160 1165
Leu Ala Pro Gly Leu Thr Thr Ile Gly Ala Ser Pro Thr Gln Thr Val
1170 1175 1180
Thr Leu Val Thr Gln Pro Val Val Thr Lys Glu Thr Ala Ile Ser Lys
1185 1190 1195 1200
Leu Glu Met Pro Ser Ser Leu Met Leu Glu Val Pro Ala Leu Ala Asp
1205 1210 1215
Phe Asn Arg Ala Trp Thr Glu Leu Thr Asp Trp Leu Ser Leu Leu Asp
1220 1225 1230
Gln Val Ile Lys Ser Gln Arg Val Met Val Gly Asp Leu Glu Asp Ile
1235 1240 1245
Asn Glu Met Ile Ile Lys Gln Lys Ala Thr Met Gln Asp Leu Glu Gln
1250 1255 1260
Arg Arg Pro Gln Leu Glu Glu Leu Ile Thr Ala Ala Gln Asn Leu Lys
1265 1270 1275 1280
Asn Lys Thr Ser Asn Gln Glu Ala Arg Thr Ile Ile Thr Asp Arg Ile
1285 1290 1295
Glu Arg Ile Gln Asn Gln Trp Asp Glu Val Gln Glu His Leu Gln Asn
1300 1305 1310
Arg Arg Gln Gln Leu Asn Glu Met Leu Lys Asp Ser Thr Gln Trp Leu
1315 1320 1325
Glu Ala Lys Glu Glu Ala Glu Gln Val Leu Gly Gln Ala Arg Ala Lys
1330 1335 1340
Leu Glu Ser Trp Lys Glu Gly Pro Tyr Thr Val Asp Ala Ile Gln Lys
1345 1350 1355 1360
Lys Ile Thr Glu Thr Lys Gln Leu Ala Lys Asp Leu Arg Gln Trp Gln
1365 1370 1375
Thr Asn Val Asp Val Ala Asn Asp Leu Ala Leu Lys Leu Leu Arg Asp
1380 1385 1390
Tyr Ser Ala Asp Asp Thr Arg Lys Val His Met Ile Thr Glu Asn Ile
1395 1400 1405
Asn Ala Ser Trp Arg Ser Ile His Lys Arg Val Ser Glu Arg Glu Ala
1410 1415 1420
Ala Leu Glu Glu Thr His Arg Leu Leu Gln Gln Phe Pro Leu Asp Leu
1425 1430 1435 1440
Glu Lys Phe Leu Ala Trp Leu Thr Glu Ala Glu Thr Thr Ala Asn Val
1445 1450 1455
Leu Gln Asp Ala Thr Arg Lys Glu Arg Leu Leu Glu Asp Ser Lys Gly
1460 1465 1470
Val Lys Glu Leu Met Lys Gln Trp Gln Asp Leu Gln Gly Glu Ile Glu
1475 1480 1485
Ala His Thr Asp Val Tyr His Asn Leu Asp Glu Asn Ser Gln Lys Ile
1490 1495 1500
Leu Arg Ser Leu Glu Gly Ser Asp Asp Ala Val Leu Leu Gln Arg Arg
1505 1510 1515 1520
Leu Asp Asn Met Asn Phe Lys Trp Ser Glu Leu Arg Lys Lys Ser Leu
1525 1530 1535
Asn Ile Arg Ser His Leu Glu Ala Ser Ser Asp Gln Trp Lys Arg Leu
1540 1545 1550
His Leu Ser Leu Gln Glu Leu Leu Val Trp Leu Gln Leu Lys Asp Asp
1555 1560 1565
Glu Leu Ser Arg Gln Ala Pro Ile Gly Gly Asp Phe Pro Ala Val Gln
1570 1575 1580
Lys Gln Asn Asp Val His Arg Ala Phe Lys Arg Glu Leu Lys Thr Lys
1585 1590 1595 1600
Glu Pro Val Ile Met Ser Thr Leu Glu Thr Val Arg Ile Phe Leu Thr
1605 1610 1615
Glu Gln Pro Leu Glu Gly Leu Glu Lys Leu Tyr Gln Glu Pro Arg Glu
1620 1625 1630
Leu Pro Pro Glu Glu Arg Ala Gln Asn Val Thr Arg Leu Leu Arg Lys
1635 1640 1645
Gln Ala Glu Glu Val Asn Thr Glu Trp Glu Lys Leu Asn Leu His Ser
1650 1655 1660
Ala Asp Trp Gln Arg Lys Ile Asp Glu Thr Leu Glu Arg Leu Gln Glu
1665 1670 1675 1680
Leu Gln Glu Ala Thr Asp Glu Leu Asp Leu Lys Leu Arg Gln Ala Glu
1685 1690 1695
Val Ile Lys Gly Ser Trp Gln Pro Val Gly Asp Leu Leu Ile Asp Ser
1700 1705 1710
Leu Gln Asp His Leu Glu Lys Val Lys Ala Leu Arg Gly Glu Ile Ala
1715 1720 1725
Pro Leu Lys Glu Asn Val Ser His Val Asn Asp Leu Ala Arg Gln Leu
1730 1735 1740
Thr Thr Leu Gly Ile Gln Leu Ser Pro Tyr Asn Leu Ser Thr Leu Glu
1745 1750 1755 1760
Asp Leu Asn Thr Arg Trp Lys Leu Leu Gln Val Ala Val Glu Asp Arg
1765 1770 1775
Val Arg Gln Leu His Glu Ala His Arg Asp Phe Gly Pro Ala Ser Gln
1780 1785 1790
His Phe Leu Ser Thr Ser Val Gln Gly Pro Trp Glu Arg Ala Ile Ser
1795 1800 1805
Pro Asn Lys Val Pro Tyr Tyr Ile Asn His Glu Thr Gln Thr Thr Cys
1810 1815 1820
Trp Asp His Pro Lys Met Thr Glu Leu Tyr Gln Ser Leu Ala Asp Leu
1825 1830 1835 1840
Asn Asn Val Arg Phe Ser Ala Tyr Arg Thr Ala Met Lys Leu Arg Arg
1845 1850 1855
Leu Gln Lys Ala Leu Cys Leu Asp Leu Leu Ser Leu Ser Ala Ala Cys
1860 1865 1870
Asp Ala Leu Asp Gln His Asn Leu Lys Gln Asn Asp Gln Pro Met Asp
1875 1880 1885
Ile Leu Gln Ile Ile Asn Cys Leu Thr Thr Ile Tyr Asp Arg Leu Glu
1890 1895 1900
Gln Glu His Asn Asn Leu Val Asn Val Pro Leu Cys Val Asp Met Cys
1905 1910 1915 1920
Leu Asn Trp Leu Leu Asn Val Tyr Asp Thr Gly Arg Thr Gly Arg Ile
1925 1930 1935
Arg Val Leu Ser Phe Lys Thr Gly Ile Ile Ser Leu Cys Lys Ala His
1940 1945 1950
Leu Glu Asp Lys Tyr Arg Tyr Leu Phe Lys Gln Val Ala Ser Ser Thr
1955 1960 1965
Gly Phe Cys Asp Gln Arg Arg Leu Gly Leu Leu Leu His Asp Ser Ile
1970 1975 1980
Gln Ile Pro Arg Gln Leu Gly Glu Val Ala Ser Phe Gly Gly Ser Asn
1985 1990 1995 2000
Ile Glu Pro Ser Val Arg Ser Cys Phe Gln Phe Ala Asn Asn Lys Pro
2005 2010 2015
Glu Ile Glu Ala Ala Leu Phe Leu Asp Trp Met Arg Leu Glu Pro Gln
2020 2025 2030
Ser Met Val Trp Leu Pro Val Leu His Arg Val Ala Ala Ala Glu Thr
2035 2040 2045
Ala Lys His Gln Ala Lys Cys Asn Ile Cys Lys Glu Cys Pro Ile Ile
2050 2055 2060
Gly Phe Arg Tyr Arg Ser Leu Lys His Phe Asn Tyr Asp Ile Cys Gln
2065 2070 2075 2080
Ser Cys Phe Phe Ser Gly Arg Val Ala Lys Gly His Lys Met His Tyr
2085 2090 2095
Pro Met Val Glu Tyr Cys Thr Pro Thr Thr Ser Gly Glu Asp Val Arg
2100 2105 2110
Asp Phe Ala Lys Val Leu Lys Asn Lys Phe Arg Thr Lys Arg Tyr Phe
2115 2120 2125
Ala Lys His Pro Arg Met Gly Tyr Leu Pro Val Gln Thr Val Leu Glu
2130 2135 2140
Gly Asp Asn Met Glu Thr Pro Val Thr Leu Ile Asn Phe Trp Pro Val
2145 2150 2155 2160
Asp Ser Ala Pro Ala Ser Ser Pro Gln Leu Ser His Asp Asp Thr His
2165 2170 2175
Ser Arg Ile Glu His Tyr Ala Ser Arg Leu Ala Glu Met Glu Asn Ser
2180 2185 2190
Asn Gly Ser Tyr Leu Asn Asp Ser Ile Ser Pro Asn Glu Ser Ile Asp
2195 2200 2205
Asp Glu His Leu Leu Ile Gln His Tyr Cys Gln Ser Leu Asn Gln Asp
2210 2215 2220
Ser Pro Leu Ser Gln Pro Arg Ser Pro Ala Gln Ile Leu Ile Ser Leu
2225 2230 2235 2240
Glu Ser Glu Glu Arg Gly Glu Leu Glu Arg Ile Leu Ala Asp Leu Glu
2245 2250 2255
Glu Glu Asn Arg Asn Leu Gln Ala Glu Tyr Asp Arg Leu Lys Gln Gln
2260 2265 2270
His Glu His Lys Gly Leu Ser Pro Leu Pro Ser Pro Pro Glu Met Met
2275 2280 2285
Pro Thr Ser Pro Gln Ser Pro Arg Asp Ala Glu Leu Ile Ala Glu Ala
2290 2295 2300
Lys Leu Leu Arg Gln His Lys Gly Arg Leu Glu Ala Arg Met Gln Ile
2305 2310 2315 2320
Leu Glu Asp His Asn Lys Gln Leu Glu Ser Gln Leu His Arg Leu Arg
2325 2330 2335
Gln Leu Leu Glu Gln Pro Gln Ala Glu Ala Lys Val Asn Gly Thr Thr
2340 2345 2350
Val Ser Ser Pro Ser Thr Ser Leu Gln Arg Ser Asp Ser Ser Gln Pro
2355 2360 2365
Met Leu Leu Arg Val Val Gly Ser Gln Thr Ser Asp Ser Met Gly Glu
2370 2375 2380
Glu Asp Leu Leu Ser Pro Pro Gln Asp Thr Ser Thr Gly Leu Glu Glu
2385 2390 2395 2400
Val Met Glu Gln Leu Asn Asn Ser Phe Pro Ser Ser Arg Gly Arg Asn
2405 2410 2415
Thr Pro Gly Lys Pro Met Arg Glu Asp Thr Met
2420 2425
<210> 3
<211> 2406
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> WL2
<400> 3
Met Leu Trp Trp Glu Glu Val Glu Asp Cys Tyr Glu Arg Glu Asp Val
1 5 10 15
Gln Lys Lys Thr Phe Thr Lys Trp Val Asn Ala Gln Phe Ser Lys Phe
20 25 30
Gly Lys Gln His Ile Glu Asn Leu Phe Ser Asp Leu Gln Asp Gly Arg
35 40 45
Arg Leu Leu Asp Leu Leu Glu Gly Leu Thr Gly Gln Lys Leu Pro Lys
50 55 60
Glu Lys Gly Ser Thr Arg Val His Ala Leu Asn Asn Val Asn Lys Ala
65 70 75 80
Leu Arg Val Leu Gln Asn Asn Asn Val Asp Leu Val Asn Ile Gly Ser
85 90 95
Thr Asp Ile Val Asp Gly Asn His Lys Leu Thr Leu Gly Leu Ile Trp
100 105 110
Asn Ile Ile Leu His Trp Gln Val Lys Asn Val Met Lys Asn Ile Met
115 120 125
Ala Gly Leu Gln Gln Thr Asn Ser Glu Lys Ile Leu Leu Ser Trp Val
130 135 140
Arg Gln Ser Thr Arg Asn Tyr Pro Gln Val Asn Val Ile Asn Phe Thr
145 150 155 160
Thr Ser Trp Ser Asp Gly Leu Ala Leu Asn Ala Leu Ile His Ser His
165 170 175
Arg Pro Asp Leu Phe Asp Trp Asn Ser Val Val Cys Gln Gln Ser Ala
180 185 190
Thr Gln Arg Leu Glu His Ala Phe Asn Ile Ala Arg Tyr Gln Leu Gly
195 200 205
Ile Glu Lys Leu Leu Asp Pro Glu Asp Val Asp Thr Thr Tyr Pro Asp
210 215 220
Lys Lys Ser Ile Leu Met Tyr Ile Thr Ser Leu Phe Gln Val Leu Pro
225 230 235 240
Gln Gln Val Ser Ile Glu Ala Ile Gln Glu Val Glu Met Leu Pro Arg
245 250 255
Pro Pro Lys Val Thr Lys Glu Glu His Phe Gln Leu His His Gln Met
260 265 270
His Tyr Ser Gln Gln Ile Thr Val Ser Leu Ala Gln Gly Tyr Glu Arg
275 280 285
Thr Ser Ser Pro Lys Pro Arg Phe Lys Ser Tyr Ala Tyr Thr Gln Ala
290 295 300
Ala Tyr Val Thr Thr Ser Asp Pro Thr Arg Ser Pro Phe Pro Ser Gln
305 310 315 320
His Leu Glu Ala Pro Glu Asp Lys Ser Phe Gly Ser Ser Leu Met Glu
325 330 335
Ser Glu Val Asn Leu Asp Arg Tyr Gln Thr Ala Leu Glu Glu Val Leu
340 345 350
Ser Trp Leu Leu Ser Ala Glu Asp Thr Leu Gln Ala Gln Gly Glu Ile
355 360 365
Ser Asn Asp Val Glu Val Val Lys Asp Gln Phe His Thr His Glu Gly
370 375 380
Tyr Met Met Asp Leu Thr Ala His Gln Gly Arg Val Gly Asn Ile Leu
385 390 395 400
Gln Leu Gly Ser Lys Leu Ile Gly Thr Gly Lys Leu Ser Glu Asp Glu
405 410 415
Glu Thr Glu Val Gln Glu Gln Met Asn Leu Leu Asn Ser Arg Trp Glu
420 425 430
Cys Leu Arg Val Ala Ser Met Glu Lys Gln Ser Asn Leu His Arg Val
435 440 445
Leu Met Asp Leu Gln Asn Gln Lys Leu Lys Glu Leu Asn Asp Trp Leu
450 455 460
Thr Lys Thr Glu Glu Arg Thr Arg Lys Met Glu Glu Glu Pro Leu Gly
465 470 475 480
Pro Asp Leu Glu Asp Leu Lys Arg Gln Val Gln Gln His Lys Val Leu
485 490 495
Gln Glu Asp Leu Glu Gln Glu Gln Val Arg Val Asn Ser Leu Thr His
500 505 510
Met Val Val Val Val Asp Glu Ser Ser Gly Asp His Ala Thr Ala Ala
515 520 525
Leu Glu Glu Gln Leu Lys Val Leu Gly Asp Arg Trp Ala Asn Ile Cys
530 535 540
Arg Trp Thr Glu Asp Arg Trp Val Leu Leu Gln Asp Ile Leu Leu Lys
545 550 555 560
Trp Gln Arg Leu Thr Glu Glu Gln Cys Leu Phe Ser Ala Trp Leu Ser
565 570 575
Glu Lys Glu Asp Ala Val Asn Lys Ile His Thr Thr Gly Phe Lys Asp
580 585 590
Gln Asn Glu Met Leu Ser Ser Leu Gln Lys Leu Ala Val Leu Lys Ala
595 600 605
Asp Leu Glu Lys Lys Lys Gln Ser Met Gly Lys Leu Tyr Ser Leu Lys
610 615 620
Gln Asp Leu Leu Ser Thr Leu Lys Asn Lys Ser Val Thr Gln Lys Thr
625 630 635 640
Glu Ala Trp Leu Asp Asn Phe Ala Arg Cys Trp Asp Asn Leu Val Gln
645 650 655
Lys Leu Glu Lys Ser Thr Ala Gln Ile Ser Gln Ala Val Thr Thr Thr
660 665 670
Gln Pro Ser Leu Thr Gln Thr Thr Val Met Glu Thr Val Thr Thr Val
675 680 685
Thr Thr Arg Glu Gln Ile Leu Val Lys His Ala Gln Glu Glu Leu Pro
690 695 700
Pro Pro Pro Pro Gln Lys Lys Arg Gln Ile Thr Val Asp Gly Leu Glu
705 710 715 720
Lys Thr Val Ser Leu Gln Lys Asp Leu Ser Glu Met His Glu Trp Met
725 730 735
Thr Gln Ala Glu Glu Glu Tyr Leu Glu Arg Asp Phe Glu Tyr Lys Thr
740 745 750
Pro Asp Glu Leu Gln Lys Ala Val Glu Glu Met Lys Arg Ala Lys Glu
755 760 765
Glu Ala Gln Gln Lys Glu Ala Lys Val Lys Leu Leu Thr Glu Ser Val
770 775 780
Asn Ser Val Ile Ala Gln Ala Pro Pro Val Ala Gln Glu Ala Leu Lys
785 790 795 800
Lys Glu Leu Glu Thr Leu Thr Thr Asn Tyr Gln Trp Leu Cys Thr Arg
805 810 815
Leu Asn Gly Lys Cys Lys Thr Leu Glu Glu Val Trp Ala Cys Trp His
820 825 830
Glu Leu Leu Ser Tyr Leu Glu Lys Ala Asn Lys Trp Leu Asn Glu Val
835 840 845
Glu Phe Lys Leu Lys Thr Thr Glu Asn Ile Pro Gly Gly Ala Glu Glu
850 855 860
Ile Ser Glu Val Leu Asp Ser Leu Glu Asn Leu Met Arg His Ser Glu
865 870 875 880
Asp Asn Pro Asn Gln Ile Arg Ile Leu Ala Gln Thr Leu Thr Asp Gly
885 890 895
Gly Val Met Asp Glu Leu Ile Asn Glu Glu Leu Glu Thr Phe Asn Ser
900 905 910
Arg Trp Arg Glu Leu His Glu Glu Ala Val Arg Arg Gln Lys Leu Leu
915 920 925
Glu Gln Pro Leu Lys Glu Leu Glu Gln Phe Asn Ser Asp Ile Gln Lys
930 935 940
Leu Leu Glu Pro Leu Glu Ala Glu Ile Gln Gln Gly Val Asn Leu Lys
945 950 955 960
Glu Glu Asp Phe Asn Lys Asp Met Asn Glu Asp Asn Glu Gly Thr Val
965 970 975
Lys Glu Leu Leu Gln Arg Gly Asp Asn Leu Gln Gln Arg Ile Thr Asp
980 985 990
Glu Arg Lys Arg Glu Glu Ile Lys Ile Lys Gln Gln Leu Leu Gln Thr
995 1000 1005
Lys His Asn Ala Phe Lys Asp Leu Arg Ser Gln Arg Arg Lys Lys Ala
1010 1015 1020
Leu Glu Ile Ser His Gln Trp Tyr Gln Tyr Lys Arg Gln Ala Asp Asp
1025 1030 1035 1040
Leu Leu Lys Cys Leu Asp Asp Ile Glu Lys Lys Leu Ala Ser Leu Pro
1045 1050 1055
Glu Pro Arg Asp Glu Arg Lys Ile Lys Glu Ile Asp Arg Glu Leu Gln
1060 1065 1070
Lys Lys Lys Glu Glu Leu Asn Ala Val Arg Arg Gln Ala Glu Gly Leu
1075 1080 1085
Ser Glu Asp Gly Ala Ala Met Ala Val Glu Pro Thr Gln Ile Gln Leu
1090 1095 1100
Ser Lys Arg Trp Arg Glu Ile Glu Ser Lys Phe Ala Gln Phe Arg Arg
1105 1110 1115 1120
Leu Asn Phe Ala Gln Ile His Thr Val Arg Glu Glu Thr Met Met Val
1125 1130 1135
Met Thr Glu Asp Met Pro Leu Glu Ile Ser Tyr Val Pro Ser Thr Tyr
1140 1145 1150
Leu Thr Glu Ile Thr His Val Ser Gln Ala Leu Leu Glu Val Glu Gln
1155 1160 1165
Leu Leu Asn Ala Pro Asp Leu Cys Ala Lys Asp Phe Glu Asp Leu Phe
1170 1175 1180
Lys Gln Glu Glu Ser Leu Lys Asn Ile Lys Asp Ser Leu Gln Gln Ser
1185 1190 1195 1200
Ser Gly Arg Ile Asp Ile Ile His Ser Lys Lys Thr Ala Ala Leu Gln
1205 1210 1215
Ser Ala Thr Pro Val Glu Arg Val Lys Leu Gln Glu Ala Leu Ser Gln
1220 1225 1230
Leu Asp Phe Gln Trp Glu Lys Val Asn Lys Met Tyr Lys Asp Arg Gln
1235 1240 1245
Gly Arg Phe Asp Arg Ser Val Glu Lys Trp Arg Arg Phe His Tyr Asp
1250 1255 1260
Ile Lys Ile Phe Asn Gln Trp Leu Thr Glu Ala Glu Gln Phe Leu Arg
1265 1270 1275 1280
Lys Thr Gln Ile Pro Glu Asn Trp Glu His Ala Lys Tyr Lys Trp Tyr
1285 1290 1295
Leu Lys Glu Leu Gln Asp Gly Ile Gly Gln Arg Gln Thr Val Val Arg
1300 1305 1310
Thr Leu Asn Ala Thr Gly Glu Glu Ile Ile Gln Gln Ser Ser Lys Thr
1315 1320 1325
Asp Ala Ser Ile Leu Gln Glu Lys Leu Gly Ser Leu Asn Leu Arg Trp
1330 1335 1340
Gln Glu Val Cys Lys Gln Leu Ser Asp Arg Lys Lys Arg Leu Glu Glu
1345 1350 1355 1360
Gln Pro Asp Leu Ala Pro Gly Leu Thr Thr Ile Gly Ala Ser Pro Thr
1365 1370 1375
Gln Thr Val Thr Leu Val Thr Gln Pro Val Val Thr Lys Glu Thr Ala
1380 1385 1390
Ile Ser Lys Leu Glu Met Pro Ser Ser Leu Met Leu Glu Val Pro Thr
1395 1400 1405
His Arg Leu Leu Gln Gln Phe Pro Leu Asp Leu Glu Lys Phe Leu Ala
1410 1415 1420
Trp Leu Thr Glu Ala Glu Thr Thr Ala Asn Val Leu Gln Asp Ala Thr
1425 1430 1435 1440
Arg Lys Glu Arg Leu Leu Glu Asp Ser Lys Gly Val Lys Glu Leu Met
1445 1450 1455
Lys Gln Trp Gln Asp Leu Gln Gly Glu Ile Glu Ala His Thr Asp Val
1460 1465 1470
Tyr His Asn Leu Asp Glu Asn Ser Gln Lys Ile Leu Arg Ser Leu Glu
1475 1480 1485
Gly Ser Asp Asp Ala Val Leu Leu Gln Arg Arg Leu Asp Asn Met Asn
1490 1495 1500
Phe Lys Trp Ser Glu Leu Arg Lys Lys Ser Leu Asn Ile Arg Ser His
1505 1510 1515 1520
Leu Glu Ala Ser Ser Asp Gln Trp Lys Arg Leu His Leu Ser Leu Gln
1525 1530 1535
Glu Leu Leu Val Trp Leu Gln Leu Lys Asp Asp Glu Leu Ser Arg Gln
1540 1545 1550
Ala Pro Ile Gly Gly Asp Phe Pro Ala Val Gln Lys Gln Asn Asp Val
1555 1560 1565
His Arg Ala Phe Lys Arg Glu Leu Lys Thr Lys Glu Pro Val Ile Met
1570 1575 1580
Ser Thr Leu Glu Thr Val Arg Ile Phe Leu Thr Glu Gln Pro Leu Glu
1585 1590 1595 1600
Gly Leu Glu Lys Leu Tyr Gln Glu Pro Arg Glu Leu Pro Pro Glu Glu
1605 1610 1615
Arg Ala Gln Asn Val Thr Arg Leu Leu Arg Lys Gln Ala Glu Glu Val
1620 1625 1630
Asn Thr Glu Trp Glu Lys Leu Asn Leu His Ser Ala Asp Trp Gln Arg
1635 1640 1645
Lys Ile Asp Glu Thr Leu Glu Arg Leu Gln Glu Leu Gln Glu Ala Thr
1650 1655 1660
Asp Glu Leu Asp Leu Lys Leu Arg Gln Ala Glu Val Ile Lys Gly Ser
1665 1670 1675 1680
Trp Gln Pro Val Gly Asp Leu Leu Ile Asp Ser Leu Gln Asp His Leu
1685 1690 1695
Glu Lys Val Lys Ala Leu Arg Gly Glu Ile Ala Pro Leu Lys Glu Asn
1700 1705 1710
Val Ser His Val Asn Asp Leu Ala Arg Gln Leu Thr Thr Leu Gly Ile
1715 1720 1725
Gln Leu Ser Pro Tyr Asn Leu Ser Thr Leu Glu Asp Leu Asn Thr Arg
1730 1735 1740
Trp Lys Leu Leu Gln Val Ala Val Glu Asp Arg Val Arg Gln Leu His
1745 1750 1755 1760
Glu Ala His Arg Asp Phe Gly Pro Ala Ser Gln His Phe Leu Ser Thr
1765 1770 1775
Ser Val Gln Gly Pro Trp Glu Arg Ala Ile Ser Pro Asn Lys Val Pro
1780 1785 1790
Tyr Tyr Ile Asn His Glu Thr Gln Thr Thr Cys Trp Asp His Pro Lys
1795 1800 1805
Met Thr Glu Leu Tyr Gln Ser Leu Ala Asp Leu Asn Asn Val Arg Phe
1810 1815 1820
Ser Ala Tyr Arg Thr Ala Met Lys Leu Arg Arg Leu Gln Lys Ala Leu
1825 1830 1835 1840
Cys Leu Asp Leu Leu Ser Leu Ser Ala Ala Cys Asp Ala Leu Asp Gln
1845 1850 1855
His Asn Leu Lys Gln Asn Asp Gln Pro Met Asp Ile Leu Gln Ile Ile
1860 1865 1870
Asn Cys Leu Thr Thr Ile Tyr Asp Arg Leu Glu Gln Glu His Asn Asn
1875 1880 1885
Leu Val Asn Val Pro Leu Cys Val Asp Met Cys Leu Asn Trp Leu Leu
1890 1895 1900
Asn Val Tyr Asp Thr Gly Arg Thr Gly Arg Ile Arg Val Leu Ser Phe
1905 1910 1915 1920
Lys Thr Gly Ile Ile Ser Leu Cys Lys Ala His Leu Glu Asp Lys Tyr
1925 1930 1935
Arg Tyr Leu Phe Lys Gln Val Ala Ser Ser Thr Gly Phe Cys Asp Gln
1940 1945 1950
Arg Arg Leu Gly Leu Leu Leu His Asp Ser Ile Gln Ile Pro Arg Gln
1955 1960 1965
Leu Gly Glu Val Ala Ser Phe Gly Gly Ser Asn Ile Glu Pro Ser Val
1970 1975 1980
Arg Ser Cys Phe Gln Phe Ala Asn Asn Lys Pro Glu Ile Glu Ala Ala
1985 1990 1995 2000
Leu Phe Leu Asp Trp Met Arg Leu Glu Pro Gln Ser Met Val Trp Leu
2005 2010 2015
Pro Val Leu His Arg Val Ala Ala Ala Glu Thr Ala Lys His Gln Ala
2020 2025 2030
Lys Cys Asn Ile Cys Lys Glu Cys Pro Ile Ile Gly Phe Arg Tyr Arg
2035 2040 2045
Ser Leu Lys His Phe Asn Tyr Asp Ile Cys Gln Ser Cys Phe Phe Ser
2050 2055 2060
Gly Arg Val Ala Lys Gly His Lys Met His Tyr Pro Met Val Glu Tyr
2065 2070 2075 2080
Cys Thr Pro Thr Thr Ser Gly Glu Asp Val Arg Asp Phe Ala Lys Val
2085 2090 2095
Leu Lys Asn Lys Phe Arg Thr Lys Arg Tyr Phe Ala Lys His Pro Arg
2100 2105 2110
Met Gly Tyr Leu Pro Val Gln Thr Val Leu Glu Gly Asp Asn Met Glu
2115 2120 2125
Thr Pro Val Thr Leu Ile Asn Phe Trp Pro Val Asp Ser Ala Pro Ala
2130 2135 2140
Ser Ser Pro Gln Leu Ser His Asp Asp Thr His Ser Arg Ile Glu His
2145 2150 2155 2160
Tyr Ala Ser Arg Leu Ala Glu Met Glu Asn Ser Asn Gly Ser Tyr Leu
2165 2170 2175
Asn Asp Ser Ile Ser Pro Asn Glu Ser Ile Asp Asp Glu His Leu Leu
2180 2185 2190
Ile Gln His Tyr Cys Gln Ser Leu Asn Gln Asp Ser Pro Leu Ser Gln
2195 2200 2205
Pro Arg Ser Pro Ala Gln Ile Leu Ile Ser Leu Glu Ser Glu Glu Arg
2210 2215 2220
Gly Glu Leu Glu Arg Ile Leu Ala Asp Leu Glu Glu Glu Asn Arg Asn
2225 2230 2235 2240
Leu Gln Ala Glu Tyr Asp Arg Leu Lys Gln Gln His Glu His Lys Gly
2245 2250 2255
Leu Ser Pro Leu Pro Ser Pro Pro Glu Met Met Pro Thr Ser Pro Gln
2260 2265 2270
Ser Pro Arg Asp Ala Glu Leu Ile Ala Glu Ala Lys Leu Leu Arg Gln
2275 2280 2285
His Lys Gly Arg Leu Glu Ala Arg Met Gln Ile Leu Glu Asp His Asn
2290 2295 2300
Lys Gln Leu Glu Ser Gln Leu His Arg Leu Arg Gln Leu Leu Glu Gln
2305 2310 2315 2320
Pro Gln Ala Glu Ala Lys Val Asn Gly Thr Thr Val Ser Ser Pro Ser
2325 2330 2335
Thr Ser Leu Gln Arg Ser Asp Ser Ser Gln Pro Met Leu Leu Arg Val
2340 2345 2350
Val Gly Ser Gln Thr Ser Asp Ser Met Gly Glu Glu Asp Leu Leu Ser
2355 2360 2365
Pro Pro Gln Asp Thr Ser Thr Gly Leu Glu Glu Val Met Glu Gln Leu
2370 2375 2380
Asn Asn Ser Phe Pro Ser Ser Arg Gly Arg Asn Thr Pro Gly Lys Pro
2385 2390 2395 2400
Met Arg Glu Asp Thr Met
2405
<210> 4
<211> 7284
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1 wt
<400> 4
atgctttggt gggaagaagt agaggactgt tatgaaagag aagatgttca aaagaaaaca 60
ttcacaaaat gggtaaatgc acaattttct aagtttggga agcagcatat tgagaacctc 120
ttcagtgacc tacaggatgg gaggcgcctc ctagacctcc tcgaaggcct gacagggcaa 180
aaactgccaa aagaaaaagg atccacaaga gttcatgccc tgaacaatgt caacaaggca 240
ctgcgggttt tgcagaacaa taatgttgat ttagtgaata ttggaagtac tgacatcgta 300
gatggaaatc ataaactgac tcttggtttg atttggaata taatcctcca ctggcaggtc 360
aaaaatgtaa tgaaaaatat catggctgga ttgcaacaaa ccaacagtga aaagattctc 420
ctgagctggg tccgacaatc aactcgtaat tatccacagg ttaatgtaat caacttcacc 480
accagctggt ctgatggcct ggctttgaat gctctcatcc atagtcatag gccagaccta 540
tttgactgga atagtgtggt ttgccagcag tcagccacac aacgactgga acatgcattc 600
aacatcgcca gatatcaatt aggcatagag aaactactcg atcctgaaga tgttgatacc 660
acctatccag ataagaagtc catcttaatg tacatcacat cactcttcca agttttgcct 720
caacaagtga gcattgaagc catccaggaa gtggaaatgt tgccaaggcc acctaaagtg 780
actaaagaag aacattttca gttacatcat caaatgcact attctcaaca gatcacggtc 840
agtctagcac agggatatga gagaacttct tcccctaagc ctcgattcaa gagctatgcc 900
tacacacagg ctgcttatgt caccacctct gaccctacac ggagcccatt tccttcacag 960
catttggaag ctcctgaaga caagtcattt ggcagttcat tgatggagag tgaagtaaac 1020
ctggaccgtt atcaaacagc tttagaagaa gtattatcgt ggcttctttc tgctgaggac 1080
acattgcaag cacaaggaga gatttctaat gatgtggaag tggtgaaaga ccagtttcat 1140
actcatgagg ggtacatgat ggatttgaca gcccatcagg gccgggttgg taatattcta 1200
caattgggaa gtaagctgat tggaacagga aaattatcag aagatgaaga aactgaagta 1260
caagagcaga tgaatctcct aaattcaaga tgggaatgcc tcagggtagc tagcatggaa 1320
aaacaaagca atttacatag agttttaatg gatctccaga atcagaaact gaaagagttg 1380
aatgactggc taacaaaaac agaagaaaga acaaggaaaa tggaggaaga gcctcttgga 1440
cctgatcttg aagacctaaa acgccaagta caacaacata aggtgcttca agaagatcta 1500
gaacaagaac aagtcagggt caattctctc actcacatgg tggtggtagt tgatgaatct 1560
agtggagatc acgcaactgc tgctttggaa gaacaactta aggtattggg agatcgatgg 1620
gcaaacatct gtagatggac agaagaccgc tgggttcttt tacaagacat ccttctcaaa 1680
tggcaacgtc ttactgaaga acagtgcctt tttagtgcat ggctttcaga aaaagaagat 1740
gcagtgaaca agattcacac aactggcttt aaagatcaaa atgaaatgtt atcaagtctt 1800
caaaaactgg ccgttttaaa agcggatcta gaaaagaaaa agcaatccat gggcaaactg 1860
tattcactca aacaagatct tctttcaaca ctgaagaata agtcagtgac ccagaagacg 1920
gaagcatggc tggataactt tgcccggtgt tgggataatt tagtccaaaa acttgaaaag 1980
agtacagcac agatttcaca ggctgtcacc accactcagc catcactaac acagacaact 2040
gtaatggaaa cagtaactac ggtgaccaca agggaacaga tcctggtaaa gcatgctcaa 2100
gaggaacttc caccaccacc tccccaaaag aagaggcaga ttactgtgga tggtttggag 2160
aaaactgtaa gcctccagaa agatctatca gagatgcacg aatggatgac acaagctgaa 2220
gaagagtatc ttgagagaga ttttgaatat aaaactccag atgaattaca gaaagcagtt 2280
gaagagatga agagagctaa agaagaggcc caacaaaaag aagcgaaagt gaaactcctt 2340
actgagtctg taaatagtgt catagctcaa gctccacctg tagcacaaga ggccttaaaa 2400
aaggaacttg aaactctaac caccaactac cagtggctct gcactaggct gaatgggaaa 2460
tgcaagactt tggaagaagt ttgggcatgt tggcatgagt tattgtcata cttggagaaa 2520
gcaaacaagt ggctaaatga agtagaattt aaacttaaaa ccactgaaaa cattcctggc 2580
ggagctgagg aaatctctga ggtgctagat tcacttgaaa atttgatgcg acattcagag 2640
gataacccaa atcagattcg catattggca cagaccctaa cagatggcgg agtcatggat 2700
gagctaatca atgaggaact tgagacattt aattctcgtt ggagggaact acatgaagag 2760
gctgtaagga ggcaaaagtt gcttgaacag attcacactg tccgtgaaga aacgatgatg 2820
gtgatgactg aagacatgcc tttggaaatt tcttatgtgc cttctactta tttgactgaa 2880
atcactcatg tctcacaagc cctattagaa gtggaacaac ttctcaatgc tcctgacctc 2940
tgtgctaagg actttgaaga tctctttaag caagaggagt ctctgaagaa tataaaagat 3000
agtctacaac aaagctcagg tcggattgac attattcata gcaagaagac agcagcattg 3060
caaagtgcaa cgcctgtgga aagggtgaag ctacaggaag ctctctccca gcttgatttc 3120
caatgggaaa aagttaacaa aatgtacaag gaccgacaag ggcgatttga cagatctgtt 3180
gagaaatggc ggcgttttca ttatgatata aagatattta atcagtggct aacagaagct 3240
gaacagtttc tcagaaagac acaaattcct gagaattggg aacatgctaa atacaaatgg 3300
tatcttaagg aactccagga tggcattggg cagcggcaaa ctgttgtcag aacattgaat 3360
gcaactgggg aagaaataat tcagcaatcc tcaaaaacag atgccagtat tctacaggaa 3420
aaattgggaa gcctgaatct gcggtggcag gaggtctgca aacagctgtc agacagaaaa 3480
aagaggctag aagagcagcc tgacctagct cctggactga ccactattgg agcctctcct 3540
actcagactg ttactctggt gacacaacct gtggttacta aggaaactgc catctccaaa 3600
ctagaaatgc catcttcctt gatgttggag gtacctgctc tggcagattt caaccgggct 3660
tggacagaac ttaccgactg gctttctctg cttgatcaag ttataaaatc acagagggtg 3720
atggtgggtg accttgagga tatcaacgag atgatcatca agcagaaggc aacaatgcag 3780
gatttggaac agaggcgtcc ccagttggaa gaactcatta ccgctgccca aaatttgaaa 3840
aacaagacca gcaatcaaga ggctagaaca atcattacgg atcgaattga aagaattcag 3900
aatcagtggg atgaagtaca agaacacctt cagaaccgga ggcaacagtt gaatgaaatg 3960
ttaaaggatt caacacaatg gctggaagct aaggaagaag ctgagcaggt cttaggacag 4020
gccagagcca agcttgagtc atggaaggag ggtccctata cagtagatgc aatccaaaag 4080
aaaatcacag aaaccaagca gttggccaaa gacctccgcc agtggcagac aaatgtagat 4140
gtggcaaatg acttggccct gaaacttctc cgggattatt ctgcagatga taccagaaaa 4200
gtccacatga taacagagaa tatcaatgcc tcttggagaa gcattcataa aagggtgagt 4260
gagcgagagg ctgctttgga agaaactcat agattactgc aacagttccc cctggacctg 4320
gaaaagtttc ttgcctggct tacagaagct gaaacaactg ccaatgtcct acaggatgct 4380
acccgtaagg aaaggctcct agaagactcc aagggagtaa aagagctgat gaaacaatgg 4440
caagacctcc aaggtgaaat tgaagctcac acagatgttt atcacaacct ggatgaaaac 4500
agccaaaaaa tcctgagatc cctggaaggt tccgatgatg cagtcctgtt acaaagacgt 4560
ttggataaca tgaacttcaa gtggagtgaa cttcggaaaa agtctctcaa cattaggtcc 4620
catttggaag ccagttctga ccagtggaag cgtctgcacc tttctctgca ggaacttctg 4680
gtgtggctac agctgaaaga tgatgaatta agccggcagg cacctattgg aggcgacttt 4740
ccagcagttc agaagcagaa cgatgtacat agggccttca agagggaatt gaaaactaaa 4800
gaacctgtaa tcatgagtac tcttgagact gtacgaatat ttctgacaga gcagcctttg 4860
gaaggactag agaaactcta ccaggagccc agagagctgc ctcctgagga gagagcccag 4920
aatgtcactc ggcttctacg aaagcaggct gaggaggtca atactgagtg ggaaaaattg 4980
aacctgcact ccgctgactg gcagagaaaa atagatgaga cccttgaaag actccaggaa 5040
cttcaagagg ccacggatga gctggacctc aagctgcgcc aagctgaggt gatcaaggga 5100
tcctggcagc ccgtgggcga tctcctcatt gactctctcc aagatcacct cgagaaagtc 5160
aaggcacttc gaggagaaat tgcgcctctg aaagagaacg tgagccacgt caatgacctt 5220
gctcgccagc ttaccacttt gggcattcag ctctcaccgt ataacctcag cactctggaa 5280
gacctgaaca ccagatggaa gcttctgcag gtggccgtcg aggaccgagt caggcagctg 5340
catgaagccc acagggactt tggtccagca tctcagcact ttctttccac gtctgtccag 5400
ggtccctggg agagagccat ctcgccaaac aaagtgccct actatatcaa ccacgagact 5460
caaacaactt gctgggacca tcccaaaatg acagagctct accagtcttt agctgacctg 5520
aataatgtca gattctcagc ttataggact gccatgaaac tccgaagact gcagaaggcc 5580
ctttgcttgg atctcttgag cctgtcagct gcatgtgatg ccttggacca gcacaacctc 5640
aagcaaaatg accagcccat ggatatcctg cagattatta attgtttgac cactatttat 5700
gaccgcctgg agcaagagca caacaatttg gtcaacgtcc ctctctgcgt ggatatgtgt 5760
ctgaactggc tgctgaatgt ttatgatacg ggacgaacag ggaggatccg tgtcctgtct 5820
tttaaaactg gcatcatttc cctgtgtaaa gcacatttgg aagacaagta cagatacctt 5880
ttcaagcaag tggcaagttc aacaggattt tgtgaccagc gcaggctggg cctccttctg 5940
catgattcta tccaaattcc aagacagttg ggtgaagttg catcctttgg gggcagtaac 6000
attgagccaa gtgtccggag ctgcttccaa tttgctaata ataagccaga gatcgaagcg 6060
gccctcttcc tagactggat gagactggaa ccccagtcca tggtgtggct gcccgtcctg 6120
cacagagtgg ctgctgcaga aactgccaag catcaggcca aatgtaacat ctgcaaagag 6180
tgtccaatca ttggattcag gtacaggagt ctaaagcact ttaattatga catctgccaa 6240
agctgctttt tttctggtcg agttgcaaaa ggccataaaa tgcactatcc catggtggaa 6300
tattgcactc cgactacatc aggagaagat gttcgagact ttgccaaggt actaaaaaac 6360
aaatttcgaa ccaaaaggta ttttgcgaag catccccgaa tgggctacct gccagtgcag 6420
actgtcttag agggggacaa catggaaact cccgttactc tgatcaactt ctggccagta 6480
gattctgcgc ctgcctcgtc ccctcagctt tcacacgatg atactcattc acgcattgaa 6540
cattatgcta gcaggctagc agaaatggaa aacagcaatg gatcttatct aaatgatagc 6600
atctctccta atgagagcat agatgatgaa catttgttaa tccagcatta ctgccaaagt 6660
ttgaaccagg actcccccct gagccagcct cgtagtcctg cccagatctt gatttcctta 6720
gagagtgagg aaagagggga gctagagaga atcctagcag atcttgagga agaaaacagg 6780
aatctgcaag cagaatatga ccgtctaaag cagcagcacg aacataaagg cctgtcccca 6840
ctgccgtccc ctcctgaaat gatgcccacc tctccccaga gtccccggga tgctgagctc 6900
attgctgagg ccaagctact gcgtcaacac aaaggccgcc tggaagccag gatgcaaatc 6960
ctggaagacc acaataaaca gctggagtca cagttacaca ggctaaggca gctgctggag 7020
caaccccagg cagaggccaa agtgaatggc acaacggtgt cctctccttc tacctctcta 7080
cagaggtccg acagcagtca gcctatgctg ctccgagtgg ttggcagtca aacttcggac 7140
tccatgggtg aggaagatct tctcagtcct ccccaggaca caagcacagg gttagaggag 7200
gtgatggagc aactcaacaa ctccttccct agttcaagag gaagaaatac ccctggaaag 7260
ccaatgagag aggacacaat gtag 7284
<210> 5
<211> 7284
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1 opt
<400> 5
atgctgtggt gggaggaggt ggaggactgc tatgagagag aggatgtgca gaagaagacc 60
ttcaccaagt gggtgaatgc ccagttcagc aagtttggca agcagcacat tgagaacctg 120
ttctctgacc tgcaggatgg caggagcctg ctggacctgc tggagggcct aacaggccag 180
aagctgccca aggagaaggg cagtaccagg gtgcatgccc tgaacaatgt gaacaaggcc 240
ctgagggtgc tgcagaacaa caatgtggac ctagtgaata ttggcagtac agatattgtg 300
gatggcaacc acaagctgac cctgggcctg atctggaata ttatcctaca ctggcaggtg 360
aagaatgtga tgaagaatat tatggctggc ctgcagcaga ccaactctga gaagatcctg 420
ctgagctggg tgaggcagag taccaggaac tacccccagg tgaatgtgat caacttcacc 480
accagctggt ctgatggcct ggccctgaat gccctaatcc acagccacag gcctgacctg 540
tttgactgga actctgtggt gtgccagcag tctgccaccc agaggctaga gcatgccttc 600
aatattgcca ggtaccagct ggggattgag aagctgctgg accctgagga tgtggataca 660
acctaccctg acaagaagag catcctgatg tacatcacca gcctgttcca ggtgctgccc 720
cagcaggtga gcattgaggc catccaggag gtggagatgc tacccaggcc ccccaaggtg 780
accaaggagg agcacttcca gctacaccac cagatgcact acagccagca gatcacagtg 840
agcctggccc agggctatga gaggaccagc agccccaagc ccaggttcaa gagctatgcc 900
tacacccagg ctgcctatgt gaccacctca gaccccacca ggagcccctt ccccagccag 960
cacctggagg cccctgagga caagagcttt ggcagcagcc tgatggagtc tgaggtgaac 1020
ctggacaggt accagacagc cctggaggag gtgctgagct ggctgctgtc tgctgaggat 1080
accctacagg cccagggaga gatcagcaat gatgtggagg tggtgaagga ccagttccat 1140
acccatgagg gctacatgat ggacctaaca gcccaccagg gcagggtggg caacatcctg 1200
cagctgggca gcaagctgat tggcacaggc aagctgtctg aggatgagga gacagaggtg 1260
caggagcaga tgaacctgct gaacagcagg tgggagtgcc tgagggtggc cagtatggag 1320
aagcagagca acctacacag ggtgctgatg gacctgcaga accagaagct gaaggagctg 1380
aatgactggc tgaccaagac agaggagagg accaggaaga tggaggagga gcccctgggc 1440
cctgacctgg aggacctaaa gaggcaggtg cagcagcaca aggtgctgca ggaggacctg 1500
gagcaggagc aggtgagggt gaacagccta acccatatgg tggtggtggt ggatgagagc 1560
tcaggagacc atgctacagc tgccctggag gagcagctga aggtgctggg agacaggtgg 1620
gctaatatct gcaggtggac agaggacagg tgggtgctgc tgcaggatat cctgctgaag 1680
tggcagaggc tgacagagga gcagtgcctg ttctctgcct ggctgtcaga gaaggaggat 1740
gctgtgaaca agatccatac aacaggcttc aaggaccaga atgagatgct gagcagcctg 1800
cagaagctgg ctgtgctgaa ggctgacctg gagaagaaga agcagagtat gggcaagctg 1860
tacagcctga agcaggacct gctgagtacc ctgaagaaca agtctgtgac ccagaagaca 1920
gaggcctggc tggacaactt tgccaggtgc tgggataacc tggtgcagaa gctggagaag 1980
agtacagccc agatcagcca ggctgtgaca acaacccagc ccagcctgac ccagacaaca 2040
gtgatggaga cagtgacaac agtgacaacc agggagcaga tcctagtgaa gcatgcccag 2100
gaggagctgc cccccccccc cccccagaag aagaggcaga ttacagtgga tggcctggag 2160
aagacagtga gcctccagaa ggacctgtca gagatgcatg agtggatgac ccaggctgag 2220
gaggagtacc tggagaggga ctctgagtac aagacccctg atgagctgca gaaggctgtg 2280
gaggagatga agagggccaa ggaggaggcc cagcagaagg aggccaaggt gaagctgctg 2340
acagagtctg tgaactctgt gattgcccag gccccccctg tggcccagga ggccctgaag 2400
aaggagctgg agaccctgac aaccaactac cagtggctgt gcaccaggct gaatggcaag 2460
tgcaagaccc tagaggaggt gtgggcctgc tggcatgagc tgctgagcta cctggagaag 2520
gccaacaagt ggctgaatga ggtggagttc aagctgaaga ccacagagaa catccctgga 2580
ggagctgagg agatctctga ggtgctggac agcctggaga acctaatgag gcactctgag 2640
gacaacccca accagatcag gatcctggcc cagaccctga cagatggagg agtgatggat 2700
gagctgatca atgaggagct ggagaccttc aacagcaggt ggagggagct acatgaggag 2760
gctgtgagga ggcagaagct gctagagcag atccatacag tgagggagga gacaatgatg 2820
gtgatgacag aggatatgcc cctggagatc agctatgtgc ccagcaccta cctaacagag 2880
atcacccatg tgagccaggc cctgctggag gtggagcagc tgctgaatgc ccctgacctg 2940
tgtgccaagg actttgagga cctgttcaag caggaggaga gcctgaagaa catcaaggac 3000
agcctgcagc agagctcagg caggattgat attatccaca gcaagaagac agctgccctg 3060
cagtctgcta cccctgtgga gagggtgaag ctgcaggagg ccctgagcca gctggacttc 3120
cagtgggaga aggtgaacaa gatgtacaag gacaggcagg gcaggtttga caggtcagtg 3180
gagaagtgga ggaggttcca ctatgacatc aagatcttca accagtggct gacagaggct 3240
gagcagttcc taaggaagac ccagatccct gagaactggg agcatgccaa gtacaagtgg 3300
taccttaagg agctgcagga tggcattggc cagaggcaga cagtggtgag gaccctgaat 3360
gctacagggg aggagattat ccagcagagc agcaagacag atgccagtat cctgcaggag 3420
aagctgggca gcctgaacct gaggtggcag gaggtgtgca agcagctgtc tgacaggaag 3480
aagaggctgg aggagcagcc tgacctggcc cctggcctga caacaattgg agccagccct 3540
acccagacag tgaccctggt gacccagcct gtggtgacca aggagacagc tatcagcaag 3600
ctggagatgc ccagcagcct gatgctggag gtgcctgccc tggctgactt caacagggcc 3660
tggacagagc tgacagactg gctgagcctg ctggaccagg tgatcaagag ccagagggtg 3720
atggtggggg acctggagga tatcaatgag atgattatca agcagaaggc tacaatgcag 3780
gacctggagc agaggaggcc ccagctggag gagctgatta cagctgccca gaacctaaag 3840
aacaagacca gcaaccagga ggccaggaca attattacag acaggattga gaggatccag 3900
aaccagtggg atgaggtgca ggagcacctg cagaacagga ggcagcagct gaatgagatg 3960
ctgaaggaca gtacccagtg gctggaggcc aaggaggagg ctgagcaggt gctgggccag 4020
gccagggcca agctggagag ctggaaggag ggcccctaca cagtggatgc catccagaag 4080
aagatcacag agaccaagca gctggccaag gacctgaggc agtggcagac caatgtggat 4140
gtggccaatg acctggccct gaagctgctg agggactact cagctgatga caccaggaag 4200
gtgcacatga tcacagagaa catcaatgcc agctggagga gcatccacaa gagggtgtct 4260
gagagggagg ctgccctaga ggagacccac aggctgctgc agcagttccc cctggacctg 4320
gagaagttcc tggcctggct gacagaggct gagacaacag ccaatgtgct gcaggatgct 4380
accaggaagg agaggctgct ggaggacagc aagggagtga aggagctgat gaagcagtgg 4440
caggacctac agggggagat tgaggcccat acagatgtgt accacaacct ggatgagaac 4500
agccagaaga tcctgaggag cctggagggc tctgatgatg ctgtgctgct gcagaggagg 4560
ctggacaata tgaacttcaa gtggtctgag ctgaggaaga agagcctaaa tatcaggagc 4620
cacctggagg ccagctctga ccagtggaag aggctacacc tgagcctgca ggagctgctg 4680
gtgtggctgc agctgaagga tgatgagcta agcaggcagg cccctattgg gggagacttc 4740
cctgctgtgc agaagcagaa tgatgtgcac agggccttca agagggagct gaagaccaag 4800
gagcctgtga ttatgagtac cctggagaca gtgaggatct tcctgacaga gcagcccctg 4860
gagggcctgg agaagctgta ccaggagccc agggagctgc cccctgagga gagggcccag 4920
aatgtgacca ggctgctgag gaagcaggct gaggaggtga atacagagtg ggagaagctg 4980
aacctacact ctgctgactg gcagaggaag attgatgaga ccctggagag gctgcaggag 5040
ctgcaggagg ctacagatga gctggacctg aagctgaggc aggctgaggt gatcaagggc 5100
agctggcagc ctgtggggga cctgctgatt gacagcctac aggaccacct ggagaaggtg 5160
aaggccctga ggggagagat tgccccccta aaggagaatg tgagtcatgt gaatgacctg 5220
gccaggcagc tgaccaccct gggcatccag ctgagcccct acaacctgag caccctggag 5280
gacctgaaca ccaggtggaa gctgctgcag gtggctgtgg aggacagggt gaggcagcta 5340
catgaggccc acagggactt tggccctgcc agccagcact tcctaagcac ctctgttcag 5400
ggcccctggg agagggccat cagccccaac aaggtgccct actacatcaa ccatgagacc 5460
cagaccacct gctgggacca ccccaagatg acagagctgt accagagcct ggctgacctg 5520
aacaatgtga ggttctctgc ctacaggaca gccatgaagc tgaggaggct gcagaaggcc 5580
ctgtgcctgg acctgctgag cctgtctgct gcctgtgatg ccctagacca gcacaacctg 5640
aagcagaatg accagcccat ggacatcctt cagatcatca actgcctgac caccatctat 5700
gacaggctag agcaggagca caacaacctg gtgaatgtgc ccctgtgtgt ggacatgtgc 5760
ctgaactggc tgctgaatgt gtatgacaca ggcaggacag gcaggatcag ggtgctgagc 5820
ttcaagacag gcatcatcag cctatgcaag gctcacctgg aggacaagta caggtacctg 5880
ttcaagcagg tggccagcag cacaggcttc tgtgaccaga ggaggctggg cctgctactt 5940
catgacagca ttcagatccc caggcagctg ggagaggtgg ccagctttgg aggcagcaac 6000
attgagccct cagtgaggag ctgcttccag tttgccaaca acaagcctga gattgaggct 6060
gccctgttcc tggactggat gaggctggag cctcagagca tggtgtggct gcctgtgctg 6120
cacagggtgg ctgctgctga gacagccaag catcaggcca agtgcaacat ctgcaaggag 6180
tgccccatca ttggcttcag gtacaggagc ctaaagcact tcaactatga catctgccag 6240
agctgcttct tctcaggcag ggtggccaag ggccacaaga tgcactaccc catggtggag 6300
tactgcaccc ccaccacctc aggagaggat gtgagggact ttgccaaggt gctgaagaac 6360
aagttcagga ccaagaggta ctttgccaag caccccagga tgggctacct gcctgtgcag 6420
acagtgctgg agggggacaa catggagacc cctgtgaccc tgatcaactt ctggcctgtg 6480
gactcagccc ctgccagcag cccccagctg agccatgatg acacccacag caggattgag 6540
cactatgcca gcaggctggc tgagatggag aacagcaatg gcagctacct aaatgacagc 6600
atcagcccca atgagagcat tgatgatgag cacctgctaa tccagcacta ctgccagagc 6660
ctgaaccagg acagccccct gagccagccc aggagccctg cccagatcct gatcagcctg 6720
gagtctgagg agagggggga gctggagagg atcctggctg acctggagga ggagaacagg 6780
aaccttcagg ctgagtatga caggctaaag cagcagcatg agcacaaggg cctgagcccc 6840
ctacccagcc cccctgagat gatgcccacc agcccccaga gccccaggga tgctgagctg 6900
attgctgagg ccaagctgct gaggcagcac aagggcaggc tggaggccag gatgcagatc 6960
ctagaggacc acaacaagca gctggagagc cagctacaca ggctgaggca gctgctggag 7020
cagcctcagg ctgaggccaa ggtgaatggc accacagtga gcagccccag caccagcctg 7080
cagaggtctg acagcagtca gcccatgctg ctcagggtgg tgggcagtca gacctctgac 7140
agcatgggag aggaggacct gctgagcccc cctcaggaca ccagcacagg cctggaggag 7200
gtgatggagc agctgaacaa cagcttcccc agcagcaggg gcaggaacac ccctggcaag 7260
cccatgaggg aggacaccat gtag 7284
<210> 6
<211> 4768
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 5' WL1 opt
<400> 6
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta ttggccgtcc 180
gccctcggca ccatcctcac gacacccaaa tatggcgacg ggtgaggaat ggtggggagt 240
tatttttaga gcggtgagga aggtgggcag gcagcaggtg ttggcgctct aaaaataact 300
cccgggagtt atttttagag cggaggaatg gtggacaccc aaatatggcg acggttcctc 360
acccgtcgcc atatttgggt gtccgccctc ggccggggcc gcattcctgg gggccgggcg 420
gtgctcccgc ccgcctcgat aaaaggctcc ggggccggcg gcggcccacg agctacccgg 480
aggagcggga ggcgccaagc tctagagtaa gtatcaaggt tacaagacag gtttaaggag 540
accaatagaa actgggcttg tcgagacaga gaagactctt gcgtttctga taggcaccta 600
ttggtcttac tgacatccac tttgcctttc tctccacagg ccaccatgct gtggtgggag 660
gaggtggagg actgctatga gagagaggat gtgcagaaga agaccttcac caagtgggtg 720
aatgcccagt tcagcaagtt tggcaagcag cacattgaga acctgttctc tgacctgcag 780
gatggcagga gcctgctgga cctgctggag ggcctaacag gccagaagct gcccaaggag 840
aagggcagta ccagggtgca tgccctgaac aatgtgaaca aggccctgag ggtgctgcag 900
aacaacaatg tggacctagt gaatattggc agtacagata ttgtggatgg caaccacaag 960
ctgaccctgg gcctgatctg gaatattatc ctacactggc aggtgaagaa tgtgatgaag 1020
aatattatgg ctggcctgca gcagaccaac tctgagaaga tcctgctgag ctgggtgagg 1080
cagagtacca ggaactaccc ccaggtgaat gtgatcaact tcaccaccag ctggtctgat 1140
ggcctggccc tgaatgccct aatccacagc cacaggcctg acctgtttga ctggaactct 1200
gtggtgtgcc agcagtctgc cacccagagg ctagagcatg ccttcaatat tgccaggtac 1260
cagctgggga ttgagaagct gctggaccct gaggatgtgg atacaaccta ccctgacaag 1320
aagagcatcc tgatgtacat caccagcctg ttccaggtgc tgccccagca ggtgagcatt 1380
gaggccatcc aggaggtgga gatgctaccc aggcccccca aggtgaccaa ggaggagcac 1440
ttccagctac accaccagat gcactacagc cagcagatca cagtgagcct ggcccagggc 1500
tatgagagga ccagcagccc caagcccagg ttcaagagct atgcctacac ccaggctgcc 1560
tatgtgacca cctcagaccc caccaggagc cccttcccca gccagcacct ggaggcccct 1620
gaggacaaga gctttggcag cagcctgatg gagtctgagg tgaacctgga caggtaccag 1680
acagccctgg aggaggtgct gagctggctg ctgtctgctg aggataccct acaggcccag 1740
ggagagatca gcaatgatgt ggaggtggtg aaggaccagt tccataccca tgagggctac 1800
atgatggacc taacagccca ccagggcagg gtgggcaaca tcctgcagct gggcagcaag 1860
ctgattggca caggcaagct gtctgaggat gaggagacag aggtgcagga gcagatgaac 1920
ctgctgaaca gcaggtggga gtgcctgagg gtggccagta tggagaagca gagcaaccta 1980
cacagggtgc tgatggacct gcagaaccag aagctgaagg agctgaatga ctggctgacc 2040
aagacagagg agaggaccag gaagatggag gaggagcccc tgggccctga cctggaggac 2100
ctaaagaggc aggtgcagca gcacaaggtg ctgcaggagg acctggagca ggagcaggtg 2160
agggtgaaca gcctaaccca tatggtggtg gtggtggatg agagctcagg agaccatgct 2220
acagctgccc tggaggagca gctgaaggtg ctgggagaca ggtgggctaa tatctgcagg 2280
tggacagagg acaggtgggt gctgctgcag gatatcctgc tgaagtggca gaggctgaca 2340
gaggagcagt gcctgttctc tgcctggctg tcagagaagg aggatgctgt gaacaagatc 2400
catacaacag gcttcaagga ccagaatgag atgctgagca gcctgcagaa gctggctgtg 2460
ctgaaggctg acctggagaa gaagaagcag agtatgggca agctgtacag cctgaagcag 2520
gacctgctga gtaccctgaa gaacaagtct gtgacccaga agacagaggc ctggctggac 2580
aactttgcca ggtgctggga taacctggtg cagaagctgg agaagagtac agcccagatc 2640
agccaggctg tgacaacaac ccagcccagc ctgacccaga caacagtgat ggagacagtg 2700
acaacagtga caaccaggga gcagatccta gtgaagcatg cccaggagga gctgcccccc 2760
cccccccccc agaagaagag gcagattaca gtggatggcc tggagaagac agtgagcctc 2820
cagaaggacc tgtcagagat gcatgagtgg atgacccagg ctgaggagga gtacctggag 2880
agggactctg agtacaagac ccctgatgag ctgcagaagg ctgtggagga gatgaagagg 2940
gccaaggagg aggcccagca gaaggaggcc aaggtgaagc tgctgacaga gtctgtgaac 3000
tctgtgattg cccaggcccc ccctgtggcc caggaggccc tgaagaagga gctggagacc 3060
ctgacaacca actaccagtg gctgtgcacc aggctgaatg gcaagtgcaa gaccctagag 3120
gaggtgtggg cctgctggca tgagctgctg agctacctgg agaaggccaa caagtggctg 3180
aatgaggtgg agttcaagct gaagaccaca gagaacatcc ctggaggagc tgaggagatc 3240
tctgaggtgc tggacagcct ggagaaccta atgaggcact ctgaggacaa ccccaaccag 3300
atcaggatcc tggcccagac cctgacagat ggaggagtga tggatgagct gatcaatgag 3360
gagctggaga ccttcaacag caggtggagg gagctacatg aggaggctgt gaggaggcag 3420
aagctgctag agcagatcca tacagtgagg gaggagacaa tgatggtgat gacagaggat 3480
atgcccctgg agatcagcta tgtgcccagc acctacctaa cagagatcac ccatgtgagc 3540
caggccctgc tggaggtgga gcagctgctg aatgcccctg acctgtgtgc caaggacttt 3600
gaggacctgt tcaagcagga ggagagcctg aagaacatca aggacagcct gcagcagagc 3660
tcaggcagga ttgatattat ccacagcaag aagacagctg ccctgcagtc tgctacccct 3720
gtggagaggg tgaagctgca ggaggccctg agccagctgg acttccagtg ggagaaggtt 3780
aacaagatgt acaaggacag gcagggcagg tttgacaggt cagtggagaa gtggaggagg 3840
ttccactatg acatcaagat cttcaaccag tggctgacag aggctgagca gttcctaagg 3900
aagacccaga tccctgagaa ctgggagcat gccaagtaca agtggtacct taaggagctg 3960
caggatggca ttggccagag gcagacagtg gtgaggaccc tgaatgctac aggggaggag 4020
attatccagc agagcagcaa gacagatgcc agtatcctgc aggagaagct gggcagcctg 4080
aacctgaggt ggcaggaggt gtgcaagcag ctgtctgaca ggaagaagag gctggaggag 4140
cagcctgacc tggcccctgg cctgacaaca attggagcca gccctaccca gacagtgacc 4200
ctggtgaccc agcctgtggt gaccaaggag acagctatca gcaagctgga gatgcccagc 4260
agcctgatgc tggaggtgcc tgccctggct gacttcaaca gggcctggac agagctgaca 4320
gactggctga gcctgctgga ccaggtgatc aagagccaga gggtgatggt gggggacctg 4380
gaggatatca atgagatgat tatcaagcag aaggctacaa tgcaggacct ggagcagagg 4440
aggccccagc tggaggagct gattacagct gcccagaacc taaagaacaa gaccagcaac 4500
caggaggcca ggacaattat tacagacagg attgagagga tccagaacca gtgggatgag 4560
gtgcaggagc acctgcagtt tacgcgtttt gcggccgctt tacgtagata agtagcatgg 4620
cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc cctctctgcg 4680
cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg 4740
ggcggcctca gtgagcgagc gagcgcgc 4768
<210> 7
<211> 4715
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 3' WL1 opt
<400> 7
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta tttgaacaag 180
atgtacaagg acaggcaggg caggtttgac aggtcagtgg agaagtggag gaggttccac 240
tatgacatca agatcttcaa ccagtggctg acagaggctg agcagttcct aaggaagacc 300
cagatccctg agaactggga gcatgccaag tacaagtggt accttaagga gctgcaggat 360
ggcattggcc agaggcagac agtggtgagg accctgaatg ctacagggga ggagattatc 420
cagcagagca gcaagacaga tgccagtatc ctgcaggaga agctgggcag cctgaacctg 480
aggtggcagg aggtgtgcaa gcagctgtct gacaggaaga agaggctgga ggagcagcct 540
gacctggccc ctggcctgac aacaattgga gccagcccta cccagacagt gaccctggtg 600
acccagcctg tggtgaccaa ggagacagct atcagcaagc tggagatgcc cagcagcctg 660
atgctggagg tgcctgccct ggctgacttc aacagggcct ggacagagct gacagactgg 720
ctgagcctgc tggaccaggt gatcaagagc cagagggtga tggtggggga cctggaggat 780
atcaatgaga tgattatcaa gcagaaggct acaatgcagg acctggagca gaggaggccc 840
cagctggagg agctgattac agctgcccag aacctaaaga acaagaccag caaccaggag 900
gccaggacaa ttattacaga caggattgag aggatccaga accagtggga tgaggtgcag 960
gagcacctgc agaacaggag gcagcagctg aatgagatgc tgaaggacag tacccagtgg 1020
ctggaggcca aggaggaggc tgagcaggtg ctgggccagg ccagggccaa gctggagagc 1080
tggaaggagg gcccctacac agtggatgcc atccagaaga agatcacaga gaccaagcag 1140
ctggccaagg acctgaggca gtggcagacc aatgtggatg tggccaatga cctggccctg 1200
aagctgctga gggactactc agctgatgac accaggaagg tgcacatgat cacagagaac 1260
atcaatgcca gctggaggag catccacaag agggtgtctg agagggaggc tgccctagag 1320
gagacccaca ggctgctgca gcagttcccc ctggacctgg agaagttcct ggcctggctg 1380
acagaggctg agacaacagc caatgtgctg caggatgcta ccaggaagga gaggctgctg 1440
gaggacagca agggagtgaa ggagctgatg aagcagtggc aggacctaca gggggagatt 1500
gaggcccata cagatgtgta ccacaacctg gatgagaaca gccagaagat cctgaggagc 1560
ctggagggct ctgatgatgc tgtgctgctg cagaggaggc tggacaatat gaacttcaag 1620
tggtctgagc tgaggaagaa gagcctaaat atcaggagcc acctggaggc cagctctgac 1680
cagtggaaga ggctacacct gagcctgcag gagctgctgg tgtggctgca gctgaaggat 1740
gatgagctaa gcaggcaggc ccctattggg ggagacttcc ctgctgtgca gaagcagaat 1800
gatgtgcaca gggccttcaa gagggagctg aagaccaagg agcctgtgat tatgagtacc 1860
ctggagacag tgaggatctt cctgacagag cagcccctgg agggcctgga gaagctgtac 1920
caggagccca gggagctgcc ccctgaggag agggcccaga atgtgaccag gctgctgagg 1980
aagcaggctg aggaggtgaa tacagagtgg gagaagctga acctacactc tgctgactgg 2040
cagaggaaga ttgatgagac cctggagagg ctgcaggagc tgcaggaggc tacagatgag 2100
ctggacctga agctgaggca ggctgaggtg atcaagggca gctggcagcc tgtgggggac 2160
ctgctgattg acagcctaca ggaccacctg gagaaggtga aggccctgag gggagagatt 2220
gcccccctaa aggagaatgt gagtcatgtg aatgacctgg ccaggcagct gaccaccctg 2280
ggcatccagc tgagccccta caacctgagc accctggagg acctgaacac caggtggaag 2340
ctgctgcagg tggctgtgga ggacagggtg aggcagctac atgaggccca cagggacttt 2400
ggccctgcca gccagcactt cctaagcacc tctgttcagg gcccctggga gagggccatc 2460
agccccaaca aggtgcccta ctacatcaac catgagaccc agaccacctg ctgggaccac 2520
cccaagatga cagagctgta ccagagcctg gctgacctga acaatgtgag gttctctgcc 2580
tacaggacag ccatgaagct gaggaggctg cagaaggccc tgtgcctgga cctgctgagc 2640
ctgtctgctg cctgtgatgc cctagaccag cacaacctga agcagaatga ccagcccatg 2700
gacatccttc agatcatcaa ctgcctgacc accatctatg acaggctaga gcaggagcac 2760
aacaacctgg tgaatgtgcc cctgtgtgtg gacatgtgcc tgaactggct gctgaatgtg 2820
tatgacacag gcaggacagg caggatcagg gtgctgagct tcaagacagg catcatcagc 2880
ctatgcaagg ctcacctgga ggacaagtac aggtacctgt tcaagcaggt ggccagcagc 2940
acaggcttct gtgaccagag gaggctgggc ctgctacttc atgacagcat tcagatcccc 3000
aggcagctgg gagaggtggc cagctttgga ggcagcaaca ttgagccctc agtgaggagc 3060
tgcttccagt ttgccaacaa caagcctgag attgaggctg ccctgttcct ggactggatg 3120
aggctggagc ctcagagcat ggtgtggctg cctgtgctgc acagggtggc tgctgctgag 3180
acagccaagc atcaggccaa gtgcaacatc tgcaaggagt gccccatcat tggcttcagg 3240
tacaggagcc taaagcactt caactatgac atctgccaga gctgcttctt ctcaggcagg 3300
gtggccaagg gccacaagat gcactacccc atggtggagt actgcacccc caccacctca 3360
ggagaggatg tgagggactt tgccaaggtg ctgaagaaca agttcaggac caagaggtac 3420
tttgccaagc accccaggat gggctacctg cctgtgcaga cagtgctgga gggggacaac 3480
atggagaccc ctgtgaccct gatcaacttc tggcctgtgg actcagcccc tgccagcagc 3540
ccccagctga gccatgatga cacccacagc aggattgagc actatgccag caggctggct 3600
gagatggaga acagcaatgg cagctaccta aatgacagca tcagccccaa tgagagcatt 3660
gatgatgagc acctgctaat ccagcactac tgccagagcc tgaaccagga cagccccctg 3720
agccagccca ggagccctgc ccagatcctg atcagcctgg agtctgagga gaggggggag 3780
ctggagagga tcctggctga cctggaggag gagaacagga accttcaggc tgagtatgac 3840
aggctaaagc agcagcatga gcacaagggc ctgagccccc tacccagccc ccctgagatg 3900
atgcccacca gcccccagag ccccagggat gctgagctga ttgctgaggc caagctgctg 3960
aggcagcaca agggcaggct ggaggccagg atgcagatcc tagaggacca caacaagcag 4020
ctggagagcc agctacacag gctgaggcag ctgctggagc agcctcaggc tgaggccaag 4080
gtgaatggca ccacagtgag cagccccagc accagcctgc agaggtctga cagcagtcag 4140
cccatgctgc tcagggtggt gggcagtcag acctctgaca gcatgggaga ggaggacctg 4200
ctgagccccc ctcaggacac cagcacaggc ctggaggagg tgatggagca gctgaacaac 4260
agcttcccca gcagcagggg caggaacacc cctggcaagc ccatgaggga ggacaccatg 4320
tagcagacat gataagatac attgatgagt ttggacaaac cacaactaga atgcagtgaa 4380
aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt atttgtaacc attataagct 4440
gcaataaaca agttaacaac aacaattgca ttcattttat gtttcaggtt cagggggaga 4500
tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg tggtatttac gtagataagt 4560
agcatggcgg gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct 4620
ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct 4680
ttgcccgggc ggcctcagtg agcgagcgag cgcgc 4715
<210> 8
<211> 4787
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1 wt 5'AAV
<400> 8
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta tttactagtt 180
ttctcgagtt tggccgtccg ccctcggcac catcctcacg acacccaaat atggcgacgg 240
gtgaggaatg gtggggagtt atttttagag cggtgaggaa ggtgggcagg cagcaggtgt 300
tggcgctcta aaaataactc ccgggagtta tttttagagc ggaggaatgg tggacaccca 360
aatatggcga cggttcctca cccgtcgcca tatttgggtg tccgccctcg gccggggccg 420
cattcctggg ggccgggcgg tgctcccgcc cgcctcgata aaaggctccg gggccggcgg 480
cggcccacga gctacccgga ggagcgggag gcgccaagct ctagagtaag tatcaaggtt 540
acaagacagg tttaaggaga ccaatagaaa ctgggcttgt cgagacagag aagactcttg 600
cgtttctgat aggcacctat tggtcttact gacatccact ttgcctttct ctccacaggc 660
caccatgctt tggtgggaag aagtagagga ctgttatgaa agagaagatg ttcaaaagaa 720
aacattcaca aaatgggtaa atgcacaatt ttctaagttt gggaagcagc atattgagaa 780
cctcttcagt gacctacagg atgggaggcg cctcctagac ctcctcgaag gcctgacagg 840
gcaaaaactg ccaaaagaaa aaggatccac aagagttcat gccctgaaca atgtcaacaa 900
ggcactgcgg gttttgcaga acaataatgt tgatttagtg aatattggaa gtactgacat 960
cgtagatgga aatcataaac tgactcttgg tttgatttgg aatataatcc tccactggca 1020
ggtcaaaaat gtaatgaaaa atatcatggc tggattgcaa caaaccaaca gtgaaaagat 1080
tctcctgagc tgggtccgac aatcaactcg taattatcca caggttaatg taatcaactt 1140
caccaccagc tggtctgatg gcctggcttt gaatgctctc atccatagtc ataggccaga 1200
cctatttgac tggaatagtg tggtttgcca gcagtcagcc acacaacgac tggaacatgc 1260
attcaacatc gccagatatc aattaggcat agagaaacta ctcgatcctg aagatgttga 1320
taccacctat ccagataaga agtccatctt aatgtacatc acatcactct tccaagtttt 1380
gcctcaacaa gtgagcattg aagccatcca ggaagtggaa atgttgccaa ggccacctaa 1440
agtgactaaa gaagaacatt ttcagttaca tcatcaaatg cactattctc aacagatcac 1500
ggtcagtcta gcacagggat atgagagaac ttcttcccct aagcctcgat tcaagagcta 1560
tgcctacaca caggctgctt atgtcaccac ctctgaccct acacggagcc catttccttc 1620
acagcatttg gaagctcctg aagacaagtc atttggcagt tcattgatgg agagtgaagt 1680
aaacctggac cgttatcaaa cagctttaga agaagtatta tcgtggcttc tttctgctga 1740
ggacacattg caagcacaag gagagatttc taatgatgtg gaagtggtga aagaccagtt 1800
tcatactcat gaggggtaca tgatggattt gacagcccat cagggccggg ttggtaatat 1860
tctacaattg ggaagtaagc tgattggaac aggaaaatta tcagaagatg aagaaactga 1920
agtacaagag cagatgaatc tcctaaattc aagatgggaa tgcctcaggg tagctagcat 1980
ggaaaaacaa agcaatttac atagagtttt aatggatctc cagaatcaga aactgaaaga 2040
gttgaatgac tggctaacaa aaacagaaga aagaacaagg aaaatggagg aagagcctct 2100
tggacctgat cttgaagacc taaaacgcca agtacaacaa cataaggtgc ttcaagaaga 2160
tctagaacaa gaacaagtca gggtcaattc tctcactcac atggtggtgg tagttgatga 2220
atctagtgga gatcacgcaa ctgctgcttt ggaagaacaa cttaaggtat tgggagatcg 2280
atgggcaaac atctgtagat ggacagaaga ccgctgggtt cttttacaag acatccttct 2340
caaatggcaa cgtcttactg aagaacagtg cctttttagt gcatggcttt cagaaaaaga 2400
agatgcagtg aacaagattc acacaactgg ctttaaagat caaaatgaaa tgttatcaag 2460
tcttcaaaaa ctggccgttt taaaagcgga tctagaaaag aaaaagcaat ccatgggcaa 2520
actgtattca ctcaaacaag atcttctttc aacactgaag aataagtcag tgacccagaa 2580
gacggaagca tggctggata actttgcccg gtgttgggat aatttagtcc aaaaacttga 2640
aaagagtaca gcacagattt cacaggctgt caccaccact cagccatcac taacacagac 2700
aactgtaatg gaaacagtaa ctacggtgac cacaagggaa cagatcctgg taaagcatgc 2760
tcaagaggaa cttccaccac cacctcccca aaagaagagg cagattactg tggatggttt 2820
ggagaaaact gtaagcctcc agaaagatct atcagagatg cacgaatgga tgacacaagc 2880
tgaagaagag tatcttgaga gagattttga atataaaact ccagatgaat tacagaaagc 2940
agttgaagag atgaagagag ctaaagaaga ggcccaacaa aaagaagcga aagtgaaact 3000
ccttactgag tctgtaaata gtgtcatagc tcaagctcca cctgtagcac aagaggcctt 3060
aaaaaaggaa cttgaaactc taaccaccaa ctaccagtgg ctctgcacta ggctgaatgg 3120
gaaatgcaag actttggaag aagtttgggc atgttggcat gagttattgt catacttgga 3180
gaaagcaaac aagtggctaa atgaagtaga atttaaactt aaaaccactg aaaacattcc 3240
tggcggagct gaggaaatct ctgaggtgct agattcactt gaaaatttga tgcgacattc 3300
agaggataac ccaaatcaga ttcgcatatt ggcacagacc ctaacagatg gcggagtcat 3360
ggatgagcta atcaatgagg aacttgagac atttaattct cgttggaggg aactacatga 3420
agaggctgta aggaggcaaa agttgcttga acagattcac actgtccgtg aagaaacgat 3480
gatggtgatg actgaagaca tgcctttgga aatttcttat gtgccttcta cttatttgac 3540
tgaaatcact catgtctcac aagccctatt agaagtggaa caacttctca atgctcctga 3600
cctctgtgct aaggactttg aagatctctt taagcaagag gagtctctga agaatataaa 3660
agatagtcta caacaaagct caggtcggat tgacattatt catagcaaga agacagcagc 3720
attgcaaagt gcaacgcctg tggaaagggt gaagctacag gaagctctct cccagcttga 3780
tttccaatgg gaaaaagtta acaaaatgta caaggaccga caagggcgat ttgacagatc 3840
tgttgagaaa tggcggcgtt ttcattatga tataaagata tttaatcagt ggctaacaga 3900
agctgaacag tttctcagaa agacacaaat tcctgagaat tgggaacatg ctaaatacaa 3960
atggtatctt aaggaactcc aggatggcat tgggcagcgg caaactgttg tcagaacatt 4020
gaatgcaact ggggaagaaa taattcagca atcctcaaaa acagatgcca gtattctaca 4080
ggaaaaattg ggaagcctga atctgcggtg gcaggaggtc tgcaaacagc tgtcagacag 4140
aaaaaagagg ctagaagagc agcctgacct agctcctgga ctgaccacta ttggagcctc 4200
tcctactcag actgttactc tggtgacaca acctgtggtt actaaggaaa ctgccatctc 4260
caaactagaa atgccatctt ccttgatgtt ggaggtacct gctctggcag atttcaaccg 4320
ggcttggaca gaacttaccg actggctttc tctgcttgat caagttataa aatcacagag 4380
ggtgatggtg ggtgaccttg aggatatcaa cgagatgatc atcaagcaga aggcaacaat 4440
gcaggatttg gaacagaggc gtccccagtt ggaagaactc attaccgctg cccaaaattt 4500
gaaaaacaag accagcaatc aagaggctag aacaatcatt acggatcgaa ttgaaagaat 4560
tcagaatcag tgggatgaag tacaagaaca ccttcagttt acgcgttttg cggccgcttt 4620
acgtagataa gtagcatggc gggttaatca ttaactacaa ggaaccccta gtgatggagt 4680
tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc 4740
gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgc 4787
<210> 9
<211> 4753
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1 wt 3'AAV
<400> 9
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta tttgcggccg 180
ctttacgcgt ttttaacaaa atgtacaagg accgacaagg gcgatttgac agatctgttg 240
agaaatggcg gcgttttcat tatgatataa agatatttaa tcagtggcta acagaagctg 300
aacagtttct cagaaagaca caaattcctg agaattggga acatgctaaa tacaaatggt 360
atcttaagga actccaggat ggcattgggc agcggcaaac tgttgtcaga acattgaatg 420
caactgggga agaaataatt cagcaatcct caaaaacaga tgccagtatt ctacaggaaa 480
aattgggaag cctgaatctg cggtggcagg aggtctgcaa acagctgtca gacagaaaaa 540
agaggctaga agagcagcct gacctagctc ctggactgac cactattgga gcctctccta 600
ctcagactgt tactctggtg acacaacctg tggttactaa ggaaactgcc atctccaaac 660
tagaaatgcc atcttccttg atgttggagg tacctgctct ggcagatttc aaccgggctt 720
ggacagaact taccgactgg ctttctctgc ttgatcaagt tataaaatca cagagggtga 780
tggtgggtga ccttgaggat atcaacgaga tgatcatcaa gcagaaggca acaatgcagg 840
atttggaaca gaggcgtccc cagttggaag aactcattac cgctgcccaa aatttgaaaa 900
acaagaccag caatcaagag gctagaacaa tcattacgga tcgaattgaa agaattcaga 960
atcagtggga tgaagtacaa gaacaccttc agaaccggcg ccaacagttg aatgaaatgt 1020
taaaggattc aacacaatgg ctggaagcta aggaagaagc tgagcaggtc ttaggacagg 1080
ccagagccaa gcttgagtca tggaaggagg gtccctatac agtagatgca atccaaaaga 1140
aaatcacaga aaccaagcag ttggccaaag acctccgcca gtggcagaca aatgtagatg 1200
tggcaaatga cttggccctg aaacttctcc gggattattc tgcagatgat accagaaaag 1260
tccacatgat aacagagaat atcaatgcct cttggagaag cattcataaa agggtgagtg 1320
agcgagaggc tgctttggaa gaaactcata gattactgca acagttcccc ctggacctgg 1380
aaaagtttct tgcctggctt acagaagctg aaacaactgc caatgtccta caggatgcta 1440
cccgtaagga aaggctccta gaagactcca agggagtaaa agagctgatg aaacaatggc 1500
aagacctcca aggtgaaatt gaagctcaca cagatgttta tcacaacctg gatgaaaaca 1560
gccaaaaaat cctgagatcc ctggaaggtt ccgatgatgc agtcctgtta caaagacgtt 1620
tggataacat gaacttcaag tggagtgaac ttcggaaaaa gtctctcaac attaggtccc 1680
atttggaagc cagttctgac cagtggaagc gtctgcacct ttctctgcag gaacttctgg 1740
tgtggctaca gctgaaagat gatgaattaa gccggcaggc acctattgga ggcgactttc 1800
cagcagttca gaagcagaac gatgtacata gggccttcaa gagggaattg aaaactaaag 1860
aacctgtaat catgagtact cttgagactg tacgaatatt tctgacagag cagcctttgg 1920
aaggactaga gaaactctac caggagccca gagagctgcc tcctgaggag agagcccaga 1980
atgtcactcg gcttctacga aagcaggctg aggaggtcaa tactgagtgg gaaaaattga 2040
acctgcactc cgctgactgg cagagaaaaa tagatgagac ccttgaaaga ctccaggaac 2100
ttcaagaggc cacggatgag ctggacctca agctgcgcca agctgaggtg atcaagggat 2160
cctggcagcc cgtgggcgat ctcctcattg actctctcca agatcacctc gagaaagtca 2220
aggcacttcg aggagaaatt gcgcctctga aagagaacgt gagccacgtc aatgaccttg 2280
ctcgccagct taccactttg ggcattcagc tctcaccgta taacctcagc actctggaag 2340
acctgaacac cagatggaag cttctgcagg tggccgtcga ggaccgagtc aggcagctgc 2400
atgaagccca cagggacttt ggtccagcat ctcagcactt tctttccacg tctgtccagg 2460
gtccctggga gagagccatc tcgccaaaca aagtgcccta ctatatcaac cacgagactc 2520
aaacaacttg ctgggaccat cccaaaatga cagagctcta ccagtcttta gctgacctga 2580
ataatgtcag attctcagct tataggactg ccatgaaact ccgaagactg cagaaggccc 2640
tttgcttgga tctcttgagc ctgtcagctg catgtgatgc cttggaccag cacaacctca 2700
agcaaaatga ccagcccatg gatatcctgc agattattaa ttgtttgacc actatttatg 2760
accgcctgga gcaagagcac aacaatttgg tcaacgtccc tctctgcgtg gatatgtgtc 2820
tgaactggct gctgaatgtt tatgatacgg gacgaacagg gaggatccgt gtcctgtctt 2880
ttaaaactgg catcatttcc ctgtgtaaag cacatttgga agacaagtac agataccttt 2940
tcaagcaagt ggcaagttca acaggatttt gtgaccagcg caggctgggc ctccttctgc 3000
atgattctat ccaaattcca agacagttgg gtgaagttgc atcctttggg ggcagtaaca 3060
ttgagccaag tgtccggagc tgcttccaat ttgctaataa taagccagag atcgaagcgg 3120
ccctcttcct agactggatg agactggaac cccagtccat ggtgtggctg cccgtcctgc 3180
acagagtggc tgctgcagaa actgccaagc atcaggccaa atgtaacatc tgcaaagagt 3240
gtccaatcat tggattcagg tacaggagtc taaagcactt taattatgac atctgccaaa 3300
gctgcttttt ttctggtcga gttgcaaaag gccataaaat gcactatccc atggtggaat 3360
attgcactcc gactacatca ggagaagatg ttcgagactt tgccaaggta ctaaaaaaca 3420
aatttcgaac caaaaggtat tttgcgaagc atccccgaat gggctacctg ccagtgcaga 3480
ctgtcttaga gggggacaac atggaaactc ccgttactct gatcaacttc tggccagtag 3540
attctgcgcc tgcctcgtcc cctcagcttt cacacgatga tactcattca cgcattgaac 3600
attatgctag caggctagca gaaatggaaa acagcaatgg atcttatcta aatgatagca 3660
tctctcctaa tgagagcata gatgatgaac atttgttaat ccagcattac tgccaaagtt 3720
tgaaccagga ctcccccctg agccagcctc gtagtcctgc ccagatcttg atttccttag 3780
agagtgagga aagaggggag ctagagagaa tcctagcaga tcttgaggaa gaaaacagga 3840
atctgcaagc agaatatgac cgtctaaagc agcagcacga acataaaggc ctgtccccac 3900
tgccgtcccc tcctgaaatg atgcccacct ctccccagag tccccgggat gctgagctca 3960
ttgctgaggc caagctactg cgtcaacaca aaggccgcct ggaagccagg atgcaaatcc 4020
tggaagacca caataaacag ctggagtcac agttacacag gctaaggcag ctgctggagc 4080
aaccccaggc agaggccaaa gtgaatggca caacggtgtc ctctccttct acctctctac 4140
agaggtccga cagcagtcag cctatgctgc tccgagtggt tggcagtcaa acttcggact 4200
ccatgggtga ggaagatctt ctcagtcctc cccaggacac aagcacaggg ttagaggagg 4260
tgatggagca actcaacaac tccttcccta gttcaagagg aagaaatacc cctggaaagc 4320
caatgagaga ggacacaatg tagcagacat gataagatac attgatgagt ttggacaaac 4380
cacaactaga atgcagtgaa aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt 4440
atttgtaacc attataagct gcaataaaca agttaacaac aacaattgca ttcattttat 4500
gtttcaggtt cagggggaga tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg 4560
tggtattgtc gactttacta gtttttacgt agataagtag catggcgggt taatcattaa 4620
ctacaaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac 4680
tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag 4740
cgagcgagcg cgc 4753
<210> 10
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL1部分OPT1
<400> 10
ttaacaagat gtacaaggac cggcaggggc ggttcgaccg gtcggtggag aagtggcggc 60
ggttccacta cgacatcaag atcttcaacc agtggctgac ggaggcggag cagttcctgc 120
ggaagacgca gatcccggag aactgggagc acgcgaagta caagtggtac ctgaaggagc 180
tgcaggacgg catcgggcag cggcagacgg tggtccggac gctgaacgcg acgggggagg 240
agatcatcca gcagtcctcg aagacggacg cctcgatcct gcaggagaag ctggggagcc 300
tgaacctgcg gtggcaggag gtctgcaagc agctgtcgga ccggaagaag cggctggagg 360
agcagccgga cctcgcgccg gggctgacca cgatcggggc ctcgccgacg cagacggtga 420
cgctggtgac gcagccggtg gtgacgaagg agacggccat ctccaagctg gagatgccgt 480
cgtccctgat gctggaggtg ccggcgctgg cggacttcaa ccgggcgtgg acggagctga 540
ccgactggct gtcgctgctg gaccaggtga tcaagtcgca gagggtgatg gtgggcgacc 600
tggaggacat caacgagatg atcatcaagc agaaggcgac gatgcaggac ctggagcagc 660
ggcggcccca gctggaggag ctgatcaccg cggcccagaa cctgaagaac aagacctcga 720
accaggaggc gcggacgatc atcacggacc ggatcgagcg gatccagaac cagtgggacg 780
aggtgcagga gcacctgcag 800
<210> 11
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL1部分OPT2
<400> 11
ttaacaagat gtacaaggac aggcagggga ggtttgacag gtctgtggag aagtggagga 60
ggttccacta tgacatcaag atcttcaacc agtggctgac agaggctgag cagttcctca 120
ggaagaccca gatccctgag aactgggagc atgccaagta caagtggtac ctgaaggagc 180
tccaggatgg cattgggcag aggcagactg tggtcaggac cctgaatgcc actggggagg 240
agatcatcca gcagtcctcc aagacagatg ccagcatcct gcaggagaag ctggggagcc 300
tgaacctgag gtggcaggag gtctgcaagc agctgtcaga caggaagaag aggctggagg 360
agcagcctga cctggcccct ggcctgacca ccattggggc ctcccccacc cagactgtga 420
ccctggtgac ccagcctgtg gtgaccaagg agactgccat ctccaagctg gagatgccct 480
cctccctgat gctggaggtg cctgccctgg cagacttcaa cagggcctgg acagagctga 540
ctgactggct gtccctgctg gaccaggtga tcaagtccca gagggtgatg gtgggggacc 600
tggaggacat caatgagatg atcatcaagc agaaggccac catgcaggac ctggagcaga 660
ggaggcccca gctggaggag ctcatcactg ctgcccagaa cctgaagaac aagaccagca 720
accaggaggc caggaccatc atcactgaca ggattgagag gatccagaac cagtgggatg 780
aggtgcagga gcacctgcag 800
<210> 12
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL1 OPT
<400> 12
ttaacaagat gtacaaggac aggcagggca ggtttgacag gtcagtggag aagtggagga 60
ggttccacta tgacatcaag atcttcaacc agtggctgac agaggctgag cagttcctaa 120
ggaagaccca gatccctgag aactgggagc atgccaagta caagtggtac cttaaggagc 180
tgcaggatgg cattggccag aggcagacag tggtgaggac cctgaatgct acaggggagg 240
agattatcca gcagagcagc aagacagatg ccagtatcct gcaggagaag ctgggcagcc 300
tgaacctgag gtggcaggag gtgtgcaagc agctgtctga caggaagaag aggctggagg 360
agcagcctga cctggcccct ggcctgacaa caattggagc cagccctacc cagacagtga 420
ccctggtgac ccagcctgtg gtgaccaagg agacagctat cagcaagctg gagatgccca 480
gcagcctgat gctggaggtg cctgccctgg ctgacttcaa cagggcctgg acagagctga 540
cagactggct gagcctgctg gaccaggtga tcaagagcca gagggtgatg gtgggggacc 600
tggaggatat caatgagatg attatcaagc agaaggctac aatgcaggac ctggagcaga 660
ggaggcccca gctggaggag ctgattacag ctgcccagaa cctaaagaac aagaccagca 720
accaggaggc caggacaatt attacagaca ggattgagag gatccagaac cagtgggatg 780
aggtgcagga gcacctgcag 800
<210> 13
<211> 4687
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL2 wt 5'AAV
<400> 13
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta ttggccgtcc 180
gccctcggca ccatcctcac gacacccaaa tatggcgacg ggtgaggaat ggtggggagt 240
tatttttaga gcggtgagga aggtgggcag gcagcaggtg ttggcgctct aaaaataact 300
cccgggagtt atttttagag cggaggaatg gtggacaccc aaatatggcg acggttcctc 360
acccgtcgcc atatttgggt gtccgccctc ggccggggcc gcattcctgg gggccgggcg 420
gtgctcccgc ccgcctcgat aaaaggctcc ggggccggcg gcggcccacg agctacccgg 480
aggagcggga ggcgccaagc tctagagtaa gtatcaaggt tacaagacag gtttaaggag 540
accaatagaa actgggcttg tcgagacaga gaagactctt gcgtttctga taggcaccta 600
ttggtcttac tgacatccac tttgcctttc tctccacagg ccaccatgct ttggtgggaa 660
gaagtagagg actgttatga aagagaagat gttcaaaaga aaacattcac aaaatgggta 720
aatgcacaat tttctaagtt tgggaagcag catattgaga acctcttcag tgacctacag 780
gatgggaggc gcctcctaga cctcctcgaa ggcctgacag ggcaaaaact gccaaaagaa 840
aaaggatcca caagagttca tgccctgaac aatgtcaaca aggcactgcg ggttttgcag 900
aacaataatg ttgatttagt gaatattgga agtactgaca tcgtagatgg aaatcataaa 960
ctgactcttg gtttgatttg gaatataatc ctccactggc aggtcaaaaa tgtaatgaaa 1020
aatatcatgg ctggattgca acaaaccaac agtgaaaaga ttctcctgag ctgggtccga 1080
caatcaactc gtaattatcc acaggttaat gtaatcaact tcaccaccag ctggtctgat 1140
ggcctggctt tgaatgctct catccatagt cataggccag acctatttga ctggaatagt 1200
gtggtttgcc agcagtcagc cacacaacga ctggaacatg cattcaacat cgccagatat 1260
caattaggca tagagaaact actcgatcct gaagatgttg ataccaccta tccagataag 1320
aagtccatct taatgtacat cacatcactc ttccaagttt tgcctcaaca agtgagcatt 1380
gaagccatcc aggaagtgga aatgttgcca aggccaccta aagtgactaa agaagaacat 1440
tttcagttac atcatcaaat gcactattct caacagatca cggtcagtct agcacaggga 1500
tatgagagaa cttcttcccc taagcctcga ttcaagagct atgcctacac acaggctgct 1560
tatgtcacca cctctgaccc tacacggagc ccatttcctt cacagcattt ggaagctcct 1620
gaagacaagt catttggcag ttcattgatg gagagtgaag taaacctgga ccgttatcaa 1680
acagctttag aagaagtatt atcgtggctt ctttctgctg aggacacatt gcaagcacaa 1740
ggagagattt ctaatgatgt ggaagtggtg aaagaccagt ttcatactca tgaggggtac 1800
atgatggatt tgacagccca tcagggccgg gttggtaata ttctacaatt gggaagtaag 1860
ctgattggaa caggaaaatt atcagaagat gaagaaactg aagtacaaga gcagatgaat 1920
ctcctaaatt caagatggga atgcctcagg gtagctagca tggaaaaaca aagcaattta 1980
catagagttt taatggatct ccagaatcag aaactgaaag agttgaatga ctggctaaca 2040
aaaacagaag aaagaacaag gaaaatggag gaagagcctc ttggacctga tcttgaagac 2100
ctaaaacgcc aagtacaaca acataaggtg cttcaagaag atctagaaca agaacaagtc 2160
agggtcaatt ctctcactca catggtggtg gtagttgatg aatctagtgg agatcacgca 2220
actgctgctt tggaagaaca acttaaggta ttgggagatc gatgggcaaa catctgtaga 2280
tggacagaag accgctgggt tcttttacaa gacatccttc tcaaatggca acgtcttact 2340
gaagaacagt gcctttttag tgcatggctt tcagaaaaag aagatgcagt gaacaagatt 2400
cacacaactg gctttaaaga tcaaaatgaa atgttatcaa gtcttcaaaa actggccgtt 2460
ttaaaagcgg atctagaaaa gaaaaagcaa tccatgggca aactgtattc actcaaacaa 2520
gatcttcttt caacactgaa gaataagtca gtgacccaga agacggaagc atggctggat 2580
aactttgccc ggtgttggga taatttagtc caaaaacttg aaaagagtac agcacagatt 2640
tcacaggctg tcaccaccac tcagccatca ctaacacaga caactgtaat ggaaacagta 2700
actacggtga ccacaaggga acagatcctg gtaaagcatg ctcaagagga acttccacca 2760
ccacctcccc aaaagaagag gcagattact gtggatggtt tggagaaaac tgtaagcctc 2820
cagaaagatc tatcagagat gcacgaatgg atgacacaag ctgaagaaga gtatcttgag 2880
agagattttg aatataaaac tccagatgaa ttacagaaag cagttgaaga gatgaagaga 2940
gctaaagaag aggcccaaca aaaagaagcg aaagtgaaac tccttactga gtctgtaaat 3000
agtgtcatag ctcaagctcc acctgtagca caagaggcct taaaaaagga acttgaaact 3060
ctaaccacca actaccagtg gctctgcact aggctgaatg ggaaatgcaa gactttggaa 3120
gaagtttggg catgttggca tgagttattg tcatacttgg agaaagcaaa caagtggcta 3180
aatgaagtag aatttaaact taaaaccact gaaaacattc ctggcggagc tgaggaaatc 3240
tctgaggtgc tagattcact tgaaaatttg atgcgacatt cagaggataa cccaaatcag 3300
attcgcatat tggcacagac cctaacagat ggcggagtca tggatgagct aatcaatgag 3360
gaacttgaga catttaattc tcgttggagg gaactacatg aagaggctgt aaggaggcaa 3420
aagttgcttg aacagccttt gaaggaattg gagcagttta actcagatat acaaaaattg 3480
cttgaaccac tggaggctga aattcagcag ggggtgaatc tgaaagagga agacttcaat 3540
aaagatatga atgaagacaa tgagggtact gtaaaagaat tgttgcaaag aggagacaac 3600
ttacaacaaa gaatcacaga tgagagaaag agagaggaaa taaagataaa acagcagctg 3660
ttacagacaa aacataacgc gttcaaggat ttgaggtctc aaagaagaaa aaaggctcta 3720
gaaatttctc atcagtggta tcagtacaag aggcaggctg atgatctcct gaaatgcttg 3780
gatgacattg aaaaaaaatt agccagccta cctgagccca gagatgaaag gaaaataaag 3840
gaaattgatc gggaattgca gaagaagaaa gaggagctga atgcagtgcg taggcaagct 3900
gagggcttgt ctgaggatgg ggccgcaatg gcagtggagc caactcagat ccagctcagc 3960
aagcgctggc gggaaattga gagcaaattt gctcagtttc gaagactcaa ctttgcacaa 4020
attcacactg tccgtgaaga aacgatgatg gtgatgactg aagacatgcc tttggaaatt 4080
tcttatgtgc cttctactta tttgactgaa atcactcatg tctcacaagc cctattagaa 4140
gtggaacaac ttctcaatgc tcctgacctc tgtgctaagg actttgaaga tctctttaag 4200
caagaggagt ctctgaagaa tataaaagat agtctacaac aaagctcagg tcggattgac 4260
attattcata gcaagaagac agcagcattg caaagtgcaa cgcctgtgga aagggtgaag 4320
ctacaggaag ctctctccca gcttgatttc caatgggaaa aagttaacaa aatgtacaag 4380
gaccgacaag ggcgatttga cagatctgtt gagaaatggc ggcgttttca ttatgatata 4440
aagatattta atcagtggct aacagaagct gaacagtttc tcagaaagac acaaattcct 4500
accggttttg cggccgcttt acgtagataa gtagcatggc gggttaatca ttaactacaa 4560
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 4620
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 4680
agcgcgc 4687
<210> 14
<211> 4739
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL2 wt 3'AAV
<400> 14
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta ttacgcgttt 180
taaagaagaa aaaaggctct agaaatttct catcagtggt atcagtacaa gaggcaggct 240
gatgatctcc tgaaatgctt ggatgacatt gaaaaaaaat tagccagcct acctgagccc 300
agagatgaaa ggaaaataaa ggaaattgat cgggaattgc agaagaagaa agaggagctg 360
aatgcagtgc gtaggcaagc tgagggcttg tctgaggatg gggccgcaat ggcagtggag 420
ccaactcaga tccagctcag caagcgctgg cgggaaattg agagcaaatt tgctcagttt 480
cgaagactca actttgcaca aattcacact gtccgtgaag aaacgatgat ggtgatgact 540
gaagacatgc ctttggaaat ttcttatgtg ccttctactt atttgactga aatcactcat 600
gtctcacaag ccctattaga agtggaacaa cttctcaatg ctcctgacct ctgtgctaag 660
gactttgaag atctctttaa gcaagaggag tctctgaaga atataaaaga tagtctacaa 720
caaagctcag gtcggattga cattattcat agcaagaaga cagcagcatt gcaaagtgca 780
acgcctgtgg aaagggtgaa gctacaggaa gctctctccc agcttgattt ccaatgggaa 840
aaagttaaca aaatgtacaa ggaccgacaa gggcgatttg acagatctgt tgagaaatgg 900
cggcgttttc attatgatat aaagatattt aatcagtggc taacagaagc tgaacagttt 960
ctcagaaaga cacaaattcc tgagaattgg gaacatgcta aatacaaatg gtatcttaag 1020
gaactccagg atggcattgg gcagcggcaa actgttgtca gaacattgaa tgcaactggg 1080
gaagaaataa ttcagcaatc ctcaaaaaca gatgccagta ttctacagga aaaattggga 1140
agcctgaatc tgcggtggca ggaggtctgc aaacagctgt cagacagaaa aaagaggcta 1200
gaagagcagc ctgacctagc tcctggactg accactattg gagcctctcc tactcagact 1260
gttactctgg tgacacaacc tgtggttact aaggaaactg ccatctccaa actagaaatg 1320
ccatcttcct tgatgttgga ggtaccaact catagattac tgcaacagtt ccccctggac 1380
ctggaaaagt ttcttgcctg gcttacagaa gctgaaacaa ctgccaatgt cctacaggat 1440
gctacccgta aggaaaggct cctagaagac tccaagggag taaaagagct gatgaaacaa 1500
tggcaagacc tccaaggtga aattgaagct cacacagatg tttatcacaa cctggatgaa 1560
aacagccaaa aaatcctgag atccctggaa ggttccgatg atgcagtcct gttacaaaga 1620
cgtttggata acatgaactt caagtggagt gaacttcgga aaaagtctct caacattagg 1680
tcccatttgg aagccagttc tgaccagtgg aagcgtctgc acctttctct gcaggaactt 1740
ctggtgtggc tacagctgaa agatgatgaa ttaagccggc aggcacctat tggaggcgac 1800
tttccagcag ttcagaagca gaacgatgta catagggcct tcaagaggga attgaaaact 1860
aaagaacctg taatcatgag tactcttgag actgtacgaa tatttctgac agagcagcct 1920
ttggaaggac tagagaaact ctaccaggag cccagagagc tgcctcctga ggagagagcc 1980
cagaatgtca ctcggcttct acgaaagcag gctgaggagg tcaatactga gtgggaaaaa 2040
ttgaacctgc actccgctga ctggcagaga aaaatagatg agacccttga aagactccag 2100
gaacttcaag aggccacgga tgagctggac ctcaagctgc gccaagctga ggtgatcaag 2160
ggatcctggc agcccgtggg cgatctcctc attgactctc tccaagatca cctcgagaaa 2220
gtcaaggcac ttcgaggaga aattgcgcct ctgaaagaga acgtgagcca cgtcaatgac 2280
cttgctcgcc agcttaccac tttgggcatt cagctctcac cgtataacct cagcactctg 2340
gaagacctga acaccagatg gaagcttctg caggtggccg tcgaggaccg agtcaggcag 2400
ctgcatgaag cccacaggga ctttggtcca gcatctcagc actttctttc cacgtctgtc 2460
cagggtccct gggagagagc catctcgcca aacaaagtgc cctactatat caaccacgag 2520
actcaaacaa cttgctggga ccatcccaaa atgacagagc tctaccagtc tttagctgac 2580
ctgaataatg tcagattctc agcttatagg actgccatga aactccgaag actgcagaag 2640
gccctttgct tggatctctt gagcctgtca gctgcatgtg atgccttgga ccagcacaac 2700
ctcaagcaaa atgaccagcc catggatatc ctgcagatta ttaattgttt gaccactatt 2760
tatgaccgcc tggagcaaga gcacaacaat ttggtcaacg tccctctctg cgtggatatg 2820
tgtctgaact ggctgctgaa tgtttatgat acgggacgaa cagggaggat ccgtgtcctg 2880
tcttttaaaa ctggcatcat ttccctgtgt aaagcacatt tggaagacaa gtacagatac 2940
cttttcaagc aagtggcaag ttcaacagga ttttgtgacc agcgcaggct gggcctcctt 3000
ctgcatgatt ctatccaaat tccaagacag ttgggtgaag ttgcatcctt tgggggcagt 3060
aacattgagc caagtgtccg gagctgcttc caatttgcta ataataagcc agagatcgaa 3120
gcggccctct tcctagactg gatgagactg gaaccccagt ccatggtgtg gctgcccgtc 3180
ctgcacagag tggctgctgc agaaactgcc aagcatcagg ccaaatgtaa catctgcaaa 3240
gagtgtccaa tcattggatt caggtacagg agtctaaagc actttaatta tgacatctgc 3300
caaagctgct ttttttctgg tcgagttgca aaaggccata aaatgcacta tcccatggtg 3360
gaatattgca ctccgactac atcaggagaa gatgttcgag actttgccaa ggtactaaaa 3420
aacaaatttc gaaccaaaag gtattttgcg aagcatcccc gaatgggcta cctgccagtg 3480
cagactgtct tagaggggga caacatggaa actcccgtta ctctgatcaa cttctggcca 3540
gtagattctg cgcctgcctc gtcccctcag ctttcacacg atgatactca ttcacgcatt 3600
gaacattatg ctagcaggct agcagaaatg gaaaacagca atggatctta tctaaatgat 3660
agcatctctc ctaatgagag catagatgat gaacatttgt taatccagca ttactgccaa 3720
agtttgaacc aggactcccc cctgagccag cctcgtagtc ctgcccagat cttgatttcc 3780
ttagagagtg aggaaagagg ggagctagag agaatcctag cagatcttga ggaagaaaac 3840
aggaatctgc aagcagaata tgaccgtcta aagcagcagc acgaacataa aggcctgtcc 3900
ccactgccgt cccctcctga aatgatgccc acctctcccc agagtccccg ggatgctgag 3960
ctcattgctg aggccaagct actgcgtcaa cacaaaggcc gcctggaagc caggatgcaa 4020
atcctggaag accacaataa acagctggag tcacagttac acaggctaag gcagctgctg 4080
gagcaacccc aggcagaggc caaagtgaat ggcacaacgg tgtcctctcc ttctacctct 4140
ctacagaggt ccgacagcag tcagcctatg ctgctccgag tggttggcag tcaaacttcg 4200
gactccatgg gtgaggaaga tcttctcagt cctccccagg acacaagcac agggttagag 4260
gaggtgatgg agcaactcaa caactccttc cctagttcaa gaggaagaaa tacccctgga 4320
aagccaatga gagaggacac aatgtagcag acatgataag atacattgat gagtttggac 4380
aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg 4440
ctttatttgt aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt 4500
ttatgtttca ggttcagggg gagatgtggg aggtttttta aagcaagtaa aacctctaca 4560
aatgtggtat ttacgtagat aagtagcatg gcgggttaat cattaactac aaggaacccc 4620
tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac 4680
caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgc 4739
<210> 15
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL2部分OPT1
<400> 15
agcggcggaa gaaggccctg gagatctccc accagtggta ccagtacaag cggcaggccg 60
acgacctcct gaagtgcctg gacgacatcg agaagaagct ggccagcctg ccggagcccc 120
gggacgagcg caagatcaag gagatcgacc gggagctgca gaagaagaag gaggagctga 180
acgccgtgcg ccggcaggcc gagggcctgt ccgaggacgg ggccgccatg gccgtggagc 240
ccacccagat ccagctcagc aagcgctggc gggagatcga gagcaagttc gcccagttcc 300
ggcgcctcaa cttcgcccag atccacaccg tccgggagga gacgatgatg gtgatgaccg 360
aggacatgcc cctggagatc tcctacgtgc cctccaccta cctgaccgag atcacccacg 420
tctcccaggc cctgctggag gtggagcagc tcctcaacgc ccccgacctc tgcgccaagg 480
acttcgagga cctcttcaag caggaggagt ccctgaagaa catcaaggac agcctgcagc 540
agagctccgg gcggatcgac atcatccaca gcaagaagac cgccgccctg cagagcgcca 600
cgcccgtgga gcgggtgaag ctgcaggagg ccctctccca gctggacttc cagtgggaga 660
aggtgaacaa gatgtacaag gaccggcagg ggcggttcga ccgctccgtg gagaagtggc 720
ggcgcttcca ctacgacatc aagatcttca accagtggct gaccgaggcc gagcagttcc 780
tccggaagac ccagatcccc 800
<210> 16
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL2部分OPT2
<400> 16
agaggaggaa gaaggccctg gagatcagcc accagtggta ccagtacaag aggcaggctg 60
atgacctcct gaagtgcctg gatgacattg agaagaagct ggccagcctc cctgagccca 120
gggatgagag gaagatcaag gagattgaca gggagctgca gaagaagaag gaggagctga 180
atgcagtgag gaggcaggct gagggcctgt ctgaggatgg ggctgccatg gcagtggagc 240
ccacccagat ccagctcagc aagaggtgga gggagattga gagcaagttt gcccagttca 300
ggaggctcaa ctttgcccag atccacactg tcagggagga gaccatgatg gtgatgactg 360
aggacatgcc cctggagatc agctatgtgc ccagcaccta cctgactgag atcacccatg 420
tcagccaggc cctcctggag gtggagcagc tcctcaatgc ccctgacctc tgtgccaagg 480
actttgagga cctcttcaag caggaggaga gcctgaagaa catcaaggac agcctgcagc 540
agagctcagg caggattgac atcatccaca gcaagaagac agcagccctg cagagtgcca 600
cccctgtgga gagggtgaag ctgcaggagg ccctcagcca gctggacttc cagtgggaga 660
aggtgaacaa gatgtacaag gacaggcagg ggaggtttga caggagtgtg gagaagtgga 720
ggaggttcca ctatgacatc aagatcttca accagtggct gacagaggct gagcagttcc 780
tcaggaagac ccagatccct 800
<210> 17
<211> 800
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HR WL2 OPT
<400> 17
agaggaggaa gaaggccctg gagatcagcc accagtggta ccagtacaag aggcaggctg 60
atgacctgct gaagtgcctg gatgacattg agaagaagct ggccagcctg cctgagccca 120
gggatgagag gaagatcaag gagattgaca gggagctgca gaagaagaag gaggagctga 180
atgctgtgag gaggcaggct gagggcctgt ctgaggatgg ggctgccatg gctgtggagc 240
ccacccagat ccagctgagc aagaggtgga gggagattga gagcaagttt gcccagttca 300
ggaggctgaa ctttgcccag atccacactg tgagggagga gaccatgatg gtgatgacag 360
aggacatgcc cctggagatc agctatgtgc ccagcaccta cctgactgag atcacccatg 420
tgagccaggc cctgctggag gtggagcagc tgctgaatgc ccctgacctg tgtgccaagg 480
actttgagga cctgttcaag caggaggaga gcctgaagaa catcaaggac agcctgcagc 540
agagctctgg caggattgac atcatccaca gcaagaagac tgctgccctg cagtctgcca 600
cccctgtgga gagggtgaag ctgcaggagg ccctgagcca gctggacttc cagtgggaga 660
aggtgaacaa gatgtacaag gacaggcagg gcaggtttga caggtctgtg gagaagtgga 720
ggaggttcca ctatgacatc aagatcttca accagtggct gactgaggct gagcagttcc 780
tgaggaagac ccagatccct 800
<210> 18
<211> 4655
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> DD 5'AAV
<400> 18
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta tttactagtt 180
ttctcgagtt tggccgtccg ccctcggcac catcctcacg acacccaaat atggcgacgg 240
gtgaggaatg gtggggagtt atttttagag cggtgaggaa ggtgggcagg cagcaggtgt 300
tggcgctcta aaaataactc ccgggagtta tttttagagc ggaggaatgg tggacaccca 360
aatatggcga cggttcctca cccgtcgcca tatttgggtg tccgccctcg gccggggccg 420
cattcctggg ggccgggcgg tgctcccgcc cgcctcgata aaaggctccg gggccggcgg 480
cggcccacga gctacccgga ggagcgggag gcgccaagct ctagagtaag tatcaaggtt 540
acaagacagg tttaaggaga ccaatagaaa ctgggcttgt cgagacagag aagactcttg 600
cgtttctgat aggcacctat tggtcttact gacatccact ttgcctttct ctccacaggc 660
caccatgctt tggtgggaag aagtagagga ctgttatgaa agagaagatg ttcaaaagaa 720
aacattcaca aaatgggtaa atgcacaatt ttctaagttt gggaagcagc atattgagaa 780
cctcttcagt gacctacagg atgggaggcg cctcctagac ctcctcgaag gcctgacagg 840
gcaaaaactg ccaaaagaaa aaggatccac aagagttcat gccctgaaca atgtcaacaa 900
ggcactgcgg gttttgcaga acaataatgt tgatttagtg aatattggaa gtactgacat 960
cgtagatgga aatcataaac tgactcttgg tttgatttgg aatataatcc tccactggca 1020
ggtcaaaaat gtaatgaaaa atatcatggc tggattgcaa caaaccaaca gtgaaaagat 1080
tctcctgagc tgggtccgac aatcaactcg taattatcca caggttaatg taatcaactt 1140
caccaccagc tggtctgatg gcctggcttt gaatgctctc atccatagtc ataggccaga 1200
cctatttgac tggaatagtg tggtttgcca gcagtcagcc acacaacgac tggaacatgc 1260
attcaacatc gccagatatc aattaggcat agagaaacta ctcgatcctg aagatgttga 1320
taccacctat ccagataaga agtccatctt aatgtacatc acatcactct tccaagtttt 1380
gcctcaacaa gtgagcattg aagccatcca ggaagtggaa atgttgccaa ggccacctaa 1440
agtgactaaa gaagaacatt ttcagttaca tcatcaaatg cactattctc aacagatcac 1500
ggtcagtcta gcacagggat atgagagaac ttcttcccct aagcctcgat tcaagagcta 1560
tgcctacaca caggctgctt atgtcaccac ctctgaccct acacggagcc catttccttc 1620
acagcatttg gaagctcctg aagacaagtc atttggcagt tcattgatgg agagtgaagt 1680
aaacctggac cgttatcaaa cagctttaga agaagtatta tcgtggcttc tttctgctga 1740
ggacacattg caagcacaag gagagatttc taatgatgtg gaagtggtga aagaccagtt 1800
tcatactcat gaggggtaca tgatggattt gacagcccat cagggccggg ttggtaatat 1860
tctacaattg ggaagtaagc tgattggaac aggaaaatta tcagaagatg aagaaactga 1920
agtacaagag cagatgaatc tcctaaattc aagatgggaa tgcctcaggg tagctagcat 1980
ggaaaaacaa agcaatttac atagagtttt aatggatctc cagaatcaga aactgaaaga 2040
gttgaatgac tggctaacaa aaacagaaga aagaacaagg aaaatggagg aagagcctct 2100
tggacctgat cttgaagacc taaaacgcca agtacaacaa cataaggtgc ttcaagaaga 2160
tctagaacaa gaacaagtca gggtcaattc tctcactcac atggtggtgg tagttgatga 2220
atctagtgga gatcacgcaa ctgctgcttt ggaagaacaa cttaaggtat tgggagatcg 2280
atgggcaaac atctgtagat ggacagaaga ccgctgggtt cttttacaag acatccttct 2340
caaatggcaa cgtcttactg aagaacagtg cctttttagt gcatggcttt cagaaaaaga 2400
agatgcagtg aacaagattc acacaactgg ctttaaagat caaaatgaaa tgttatcaag 2460
tcttcaaaaa ctggccgttt taaaagcgga tctagaaaag aaaaagcaat ccatgggcaa 2520
actgtattca ctcaaacaag atcttctttc aacactgaag aataagtcag tgacccagaa 2580
gacggaagca tggctggata actttgcccg gtgttgggat aatttagtcc aaaaacttga 2640
aaagagtaca gcacagattt cacaggaaat ttcttatgtg ccttctactt atttgactga 2700
aatcactcat gtctcacaag ccctattaga agtggaacaa cttctcaatg ctcctgacct 2760
ctgtgctaag gactttgaag atctctttaa gcaagaggag tctctgaaga atataaaaga 2820
tagtctacaa caaagctcag gtcggattga cattattcat agcaagaaga cagcagcatt 2880
gcaaagtgca acgcctgtgg aaagggtgaa gctacaggaa gctctctccc agcttgattt 2940
ccaatgggaa aaagttaaca aaatgtacaa ggaccgacaa gggcgatttg acagatctgt 3000
tgagaaatgg cggcgttttc attatgatat aaagatattt aatcagtggc taacagaagc 3060
tgaacagttt ctcagaaaga cacaaattcc tgagaattgg gaacatgcta aatacaaatg 3120
gtatcttaag gaactccagg atggcattgg gcagcggcaa actgttgtca gaacattgaa 3180
tgcaactggg gaagaaataa ttcagcaatc ctcaaaaaca gatgccagta ttctacagga 3240
aaaattggga agcctgaatc tgcggtggca ggaggtctgc aaacagctgt cagacagaaa 3300
aaagaggcta gaagaacaaa agaatatctt gtcagaattt caaagagatt taaatgaatt 3360
tgttttatgg ttggaggaag cagataacat tgctagtatc ccacttgaac ctggaaaaga 3420
gcagcaacta aaagaaaagc ttgagcaagt caagttactg gtggaagagt tgcccctgcg 3480
ccagggaatt ctcaaacaat taaatgaaac tggaggaccc gtgcttgtaa gtgctcccat 3540
aagcccagaa gagcaagata aacttgaaaa taagctcaag cagacaaatc tccagtggat 3600
aaaggtttcc agagctttac ctgagaaaca aggagaaatt gaagctcaaa taaaagacct 3660
tgggcagctt gaaaaaaagc ttgaagacct tgaagagcag ttaaatcatc tgctgctgtg 3720
gttatctcct attaggaatc agttggaaat ttataaccaa ccaaaccaag aaggaccatt 3780
tgacgttcag gaaactgaaa tagcagttca agctaaacaa ccggatgtgg aagagatttt 3840
gtctaaaggg cagcatttgt acaaggaaaa accagccact cagccagtga agaggaagtt 3900
agaagatctg agctctgagt ggaaggcggt aaaccgttta cttcaagagc tgagggcaaa 3960
gcagcctgac ctagctcctg gactgaccac tattggagcc tctcctactc agactgttac 4020
tctggtgaca caacctgtgg ttactaagga aactgccatc tccaaactag aaatgccatc 4080
ttccttgatg ttggaggtac ctgctctggc agatttcaac cgggcttgga cagaacttac 4140
cgactggctt tctctgcttg atcaagttat aaaatcacag agggtgatgg tgggtgacct 4200
tgaggatatc aacgagatga tcatcaagca gaaggcaaca atgcaggatt tggaacagag 4260
gcgtccccag ttggaagaac tcattaccgc tgcccaaaat ttgaaaaaca agaccagcaa 4320
tcaagaggct agaacaatca ttacggatcg aattgaaaga attcagaatc agtgggatga 4380
agtacaagaa caccttcaga accggcgcca acagttgaat gaaatgttaa aggattcaac 4440
acaatggctg gaagctaagg aagaatttac gcgttttgcg gccgctttac gtagataagt 4500
agcatggcgg gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct 4560
ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct 4620
ttgcccgggc ggcctcagtg agcgagcgag cgcgc 4655
<210> 19
<211> 4262
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> DD 3'AAV
<400> 19
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta tttgcggccg 180
ctttacgcgt ttttggaggt acctgctctg gcagatttca accgggcttg gacagaactt 240
accgactggc tttctctgct tgatcaagtt ataaaatcac agagggtgat ggtgggtgac 300
cttgaggata tcaacgagat gatcatcaag cagaaggcaa caatgcagga tttggaacag 360
aggcgtcccc agttggaaga actcattacc gctgcccaaa atttgaaaaa caagaccagc 420
aatcaagagg ctagaacaat cattacggat cgaattgaaa gaattcagaa tcagtgggat 480
gaagtacaag aacaccttca gaaccggcgc caacagttga atgaaatgtt aaaggattca 540
acacaatggc tggaagctaa ggaagaagct gagcaggtct taggacaggc cagagccaag 600
cttgagtcat ggaaggaggg tccctataca gtagatgcaa tccaaaagaa aatcacagaa 660
accaagcagt tggccaaaga cctccgccag tggcagacaa atgtagatgt ggcaaatgac 720
ttggccctga aacttctccg ggattattct gcagatgata ccagaaaagt ccacatgata 780
acagagaata tcaatgcctc ttggagaagc attcataaaa gggtgagtga gcgagaggct 840
gctttggaag aaactcatag attactgcaa cagttccccc tggacctgga aaagtttctt 900
gcctggctta cagaagctga aacaactgcc aatgtcctac aggatgctac ccgtaaggaa 960
aggctcctag aagactccaa gggagtaaaa gagctgatga aacaatggca agacctccaa 1020
ggtgaaattg aagctcacac agatgtttat cacaacctgg atgaaaacag ccaaaaaatc 1080
ctgagatccc tggaaggttc cgatgatgca gtcctgttac aaagacgttt ggataacatg 1140
aacttcaagt ggagtgaact tcggaaaaag tctctcaaca ttaggtccca tttggaagcc 1200
agttctgacc agtggaagcg tctgcacctt tctctgcagg aacttctggt gtggctacag 1260
ctgaaagatg atgaattaag ccggcaggca cctattggag gcgactttcc agcagttcag 1320
aagcagaacg atgtacatag ggccttcaag agggaattga aaactaaaga acctgtaatc 1380
atgagtactc ttgagactgt acgaatattt ctgacagagc agcctttgga aggactagag 1440
aaactctacc aggagcccag agagctgcct cctgaggaga gagcccagaa tgtcactcgg 1500
cttctacgaa agcaggctga ggaggtcaat actgagtggg aaaaattgaa cctgcactcc 1560
gctgactggc agagaaaaat agatgagacc cttgaaagac tccaggaact tcaagaggcc 1620
acggatgagc tggacctcaa gctgcgccaa gctgaggtga tcaagggatc ctggcagccc 1680
gtgggcgatc tcctcattga ctctctccaa gatcacctcg agaaagtcaa ggcacttcga 1740
ggagaaattg cgcctctgaa agagaacgtg agccacgtca atgaccttgc tcgccagctt 1800
accactttgg gcattcagct ctcaccgtat aacctcagca ctctggaaga cctgaacacc 1860
agatggaagc ttctgcaggt ggccgtcgag gaccgagtca ggcagctgca tgaagcccac 1920
agggactttg gtccagcatc tcagcacttt ctttccacgt ctgtccaggg tccctgggag 1980
agagccatct cgccaaacaa agtgccctac tatatcaacc acgagactca aacaacttgc 2040
tgggaccatc ccaaaatgac agagctctac cagtctttag ctgacctgaa taatgtcaga 2100
ttctcagctt ataggactgc catgaaactc cgaagactgc agaaggccct ttgcttggat 2160
ctcttgagcc tgtcagctgc atgtgatgcc ttggaccagc acaacctcaa gcaaaatgac 2220
cagcccatgg atatcctgca gattattaat tgtttgacca ctatttatga ccgcctggag 2280
caagagcaca acaatttggt caacgtccct ctctgcgtgg atatgtgtct gaactggctg 2340
ctgaatgttt atgatacggg acgaacaggg aggatccgtg tcctgtcttt taaaactggc 2400
atcatttccc tgtgtaaagc acatttggaa gacaagtaca gatacctttt caagcaagtg 2460
gcaagttcaa caggattttg tgaccagcgc aggctgggcc tccttctgca tgattctatc 2520
caaattccaa gacagttggg tgaagttgca tcctttgggg gcagtaacat tgagccaagt 2580
gtccggagct gcttccaatt tgctaataat aagccagaga tcgaagcggc cctcttccta 2640
gactggatga gactggaacc ccagtccatg gtgtggctgc ccgtcctgca cagagtggct 2700
gctgcagaaa ctgccaagca tcaggccaaa tgtaacatct gcaaagagtg tccaatcatt 2760
ggattcaggt acaggagtct aaagcacttt aattatgaca tctgccaaag ctgctttttt 2820
tctggtcgag ttgcaaaagg ccataaaatg cactatccca tggtggaata ttgcactccg 2880
actacatcag gagaagatgt tcgagacttt gccaaggtac taaaaaacaa atttcgaacc 2940
aaaaggtatt ttgcgaagca tccccgaatg ggctacctgc cagtgcagac tgtcttagag 3000
ggggacaaca tggaaactcc cgttactctg atcaacttct ggccagtaga ttctgcgcct 3060
gcctcgtccc ctcagctttc acacgatgat actcattcac gcattgaaca ttatgctagc 3120
aggctagcag aaatggaaaa cagcaatgga tcttatctaa atgatagcat ctctcctaat 3180
gagagcatag atgatgaaca tttgttaatc cagcattact gccaaagttt gaaccaggac 3240
tcccccctga gccagcctcg tagtcctgcc cagatcttga tttccttaga gagtgaggaa 3300
agaggggagc tagagagaat cctagcagat cttgaggaag aaaacaggaa tctgcaagca 3360
gaatatgacc gtctaaagca gcagcacgaa cataaaggcc tgtccccact gccgtcccct 3420
cctgaaatga tgcccacctc tccccagagt ccccgggatg ctgagctcat tgctgaggcc 3480
aagctactgc gtcaacacaa aggccgcctg gaagccagga tgcaaatcct ggaagaccac 3540
aataaacagc tggagtcaca gttacacagg ctaaggcagc tgctggagca accccaggca 3600
gaggccaaag tgaatggcac aacggtgtcc tctccttcta cctctctaca gaggtccgac 3660
agcagtcagc ctatgctgct ccgagtggtt ggcagtcaaa cttcggactc catgggtgag 3720
gaagatcttc tcagtcctcc ccaggacaca agcacagggt tagaggaggt gatggagcaa 3780
ctcaacaact ccttccctag ttcaagagga agaaataccc ctggaaagcc aatgagagag 3840
gacacaatgt agcagacatg ataagataca ttgatgagtt tggacaaacc acaactagaa 3900
tgcagtgaaa aaaatgcttt atttgtgaaa tttgtgatgc tattgcttta tttgtaacca 3960
ttataagctg caataaacaa gttaacaaca acaattgcat tcattttatg tttcaggttc 4020
agggggagat gtgggaggtt ttttaaagca agtaaaacct ctacaaatgt ggtattgtcg 4080
actttactag tttttacgta gataagtagc atggcgggtt aatcattaac tacaaggaac 4140
ccctagtgat ggagttggcc actccctctc tgcgcgctcg ctcgctcact gaggccgggc 4200
gaccaaaggt cgcccgacgc ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc 4260
gc 4262
<210> 20
<211> 2427
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<400> 20
Met Leu Trp Trp Glu Glu Val Glu Asp Cys Tyr Glu Arg Glu Asp Val
1 5 10 15
Gln Lys Lys Thr Phe Thr Lys Trp Val Asn Ala Gln Phe Ser Lys Phe
20 25 30
Gly Lys Gln His Ile Glu Asn Leu Phe Ser Asp Leu Gln Asp Gly Arg
35 40 45
Ser Leu Leu Asp Leu Leu Glu Gly Leu Thr Gly Gln Lys Leu Pro Lys
50 55 60
Glu Lys Gly Ser Thr Arg Val His Ala Leu Asn Asn Val Asn Lys Ala
65 70 75 80
Leu Arg Val Leu Gln Asn Asn Asn Val Asp Leu Val Asn Ile Gly Ser
85 90 95
Thr Asp Ile Val Asp Gly Asn His Lys Leu Thr Leu Gly Leu Ile Trp
100 105 110
Asn Ile Ile Leu His Trp Gln Val Lys Asn Val Met Lys Asn Ile Met
115 120 125
Ala Gly Leu Gln Gln Thr Asn Ser Glu Lys Ile Leu Leu Ser Trp Val
130 135 140
Arg Gln Ser Thr Arg Asn Tyr Pro Gln Val Asn Val Ile Asn Phe Thr
145 150 155 160
Thr Ser Trp Ser Asp Gly Leu Ala Leu Asn Ala Leu Ile His Ser His
165 170 175
Arg Pro Asp Leu Phe Asp Trp Asn Ser Val Val Cys Gln Gln Ser Ala
180 185 190
Thr Gln Arg Leu Glu His Ala Phe Asn Ile Ala Arg Tyr Gln Leu Gly
195 200 205
Ile Glu Lys Leu Leu Asp Pro Glu Asp Val Asp Thr Thr Tyr Pro Asp
210 215 220
Lys Lys Ser Ile Leu Met Tyr Ile Thr Ser Leu Phe Gln Val Leu Pro
225 230 235 240
Gln Gln Val Ser Ile Glu Ala Ile Gln Glu Val Glu Met Leu Pro Arg
245 250 255
Pro Pro Lys Val Thr Lys Glu Glu His Phe Gln Leu His His Gln Met
260 265 270
His Tyr Ser Gln Gln Ile Thr Val Ser Leu Ala Gln Gly Tyr Glu Arg
275 280 285
Thr Ser Ser Pro Lys Pro Arg Phe Lys Ser Tyr Ala Tyr Thr Gln Ala
290 295 300
Ala Tyr Val Thr Thr Ser Asp Pro Thr Arg Ser Pro Phe Pro Ser Gln
305 310 315 320
His Leu Glu Ala Pro Glu Asp Lys Ser Phe Gly Ser Ser Leu Met Glu
325 330 335
Ser Glu Val Asn Leu Asp Arg Tyr Gln Thr Ala Leu Glu Glu Val Leu
340 345 350
Ser Trp Leu Leu Ser Ala Glu Asp Thr Leu Gln Ala Gln Gly Glu Ile
355 360 365
Ser Asn Asp Val Glu Val Val Lys Asp Gln Phe His Thr His Glu Gly
370 375 380
Tyr Met Met Asp Leu Thr Ala His Gln Gly Arg Val Gly Asn Ile Leu
385 390 395 400
Gln Leu Gly Ser Lys Leu Ile Gly Thr Gly Lys Leu Ser Glu Asp Glu
405 410 415
Glu Thr Glu Val Gln Glu Gln Met Asn Leu Leu Asn Ser Arg Trp Glu
420 425 430
Cys Leu Arg Val Ala Ser Met Glu Lys Gln Ser Asn Leu His Arg Val
435 440 445
Leu Met Asp Leu Gln Asn Gln Lys Leu Lys Glu Leu Asn Asp Trp Leu
450 455 460
Thr Lys Thr Glu Glu Arg Thr Arg Lys Met Glu Glu Glu Pro Leu Gly
465 470 475 480
Pro Asp Leu Glu Asp Leu Lys Arg Gln Val Gln Gln His Lys Val Leu
485 490 495
Gln Glu Asp Leu Glu Gln Glu Gln Val Arg Val Asn Ser Leu Thr His
500 505 510
Met Val Val Val Val Asp Glu Ser Ser Gly Asp His Ala Thr Ala Ala
515 520 525
Leu Glu Glu Gln Leu Lys Val Leu Gly Asp Arg Trp Ala Asn Ile Cys
530 535 540
Arg Trp Thr Glu Asp Arg Trp Val Leu Leu Gln Asp Ile Leu Leu Lys
545 550 555 560
Trp Gln Arg Leu Thr Glu Glu Gln Cys Leu Phe Ser Ala Trp Leu Ser
565 570 575
Glu Lys Glu Asp Ala Val Asn Lys Ile His Thr Thr Gly Phe Lys Asp
580 585 590
Gln Asn Glu Met Leu Ser Ser Leu Gln Lys Leu Ala Val Leu Lys Ala
595 600 605
Asp Leu Glu Lys Lys Lys Gln Ser Met Gly Lys Leu Tyr Ser Leu Lys
610 615 620
Gln Asp Leu Leu Ser Thr Leu Lys Asn Lys Ser Val Thr Gln Lys Thr
625 630 635 640
Glu Ala Trp Leu Asp Asn Phe Ala Arg Cys Trp Asp Asn Leu Val Gln
645 650 655
Lys Leu Glu Lys Ser Thr Ala Gln Ile Ser Gln Ala Val Thr Thr Thr
660 665 670
Gln Pro Ser Leu Thr Gln Thr Thr Val Met Glu Thr Val Thr Thr Val
675 680 685
Thr Thr Arg Glu Gln Ile Leu Val Lys His Ala Gln Glu Glu Leu Pro
690 695 700
Pro Pro Pro Pro Gln Lys Lys Arg Gln Ile Thr Val Asp Gly Leu Glu
705 710 715 720
Lys Thr Val Ser Leu Gln Lys Asp Leu Ser Glu Met His Glu Trp Met
725 730 735
Thr Gln Ala Glu Glu Glu Tyr Leu Glu Arg Asp Ser Glu Tyr Lys Thr
740 745 750
Pro Asp Glu Leu Gln Lys Ala Val Glu Glu Met Lys Arg Ala Lys Glu
755 760 765
Glu Ala Gln Gln Lys Glu Ala Lys Val Lys Leu Leu Thr Glu Ser Val
770 775 780
Asn Ser Val Ile Ala Gln Ala Pro Pro Val Ala Gln Glu Ala Leu Lys
785 790 795 800
Lys Glu Leu Glu Thr Leu Thr Thr Asn Tyr Gln Trp Leu Cys Thr Arg
805 810 815
Leu Asn Gly Lys Cys Lys Thr Leu Glu Glu Val Trp Ala Cys Trp His
820 825 830
Glu Leu Leu Ser Tyr Leu Glu Lys Ala Asn Lys Trp Leu Asn Glu Val
835 840 845
Glu Phe Lys Leu Lys Thr Thr Glu Asn Ile Pro Gly Gly Ala Glu Glu
850 855 860
Ile Ser Glu Val Leu Asp Ser Leu Glu Asn Leu Met Arg His Ser Glu
865 870 875 880
Asp Asn Pro Asn Gln Ile Arg Ile Leu Ala Gln Thr Leu Thr Asp Gly
885 890 895
Gly Val Met Asp Glu Leu Ile Asn Glu Glu Leu Glu Thr Phe Asn Ser
900 905 910
Arg Trp Arg Glu Leu His Glu Glu Ala Val Arg Arg Gln Lys Leu Leu
915 920 925
Glu Gln Ile His Thr Val Arg Glu Glu Thr Met Met Val Met Thr Glu
930 935 940
Asp Met Pro Leu Glu Ile Ser Tyr Val Pro Ser Thr Tyr Leu Thr Glu
945 950 955 960
Ile Thr His Val Ser Gln Ala Leu Leu Glu Val Glu Gln Leu Leu Asn
965 970 975
Ala Pro Asp Leu Cys Ala Lys Asp Phe Glu Asp Leu Phe Lys Gln Glu
980 985 990
Glu Ser Leu Lys Asn Ile Lys Asp Ser Leu Gln Gln Ser Ser Gly Arg
995 1000 1005
Ile Asp Ile Ile His Ser Lys Lys Thr Ala Ala Leu Gln Ser Ala Thr
1010 1015 1020
Pro Val Glu Arg Val Lys Leu Gln Glu Ala Leu Ser Gln Leu Asp Phe
1025 1030 1035 1040
Gln Trp Glu Lys Val Asn Lys Met Tyr Lys Asp Arg Gln Gly Arg Phe
1045 1050 1055
Asp Arg Ser Val Glu Lys Trp Arg Arg Phe His Tyr Asp Ile Lys Ile
1060 1065 1070
Phe Asn Gln Trp Leu Thr Glu Ala Glu Gln Phe Leu Arg Lys Thr Gln
1075 1080 1085
Ile Pro Glu Asn Trp Glu His Ala Lys Tyr Lys Trp Tyr Leu Lys Glu
1090 1095 1100
Leu Gln Asp Gly Ile Gly Gln Arg Gln Thr Val Val Arg Thr Leu Asn
1105 1110 1115 1120
Ala Thr Gly Glu Glu Ile Ile Gln Gln Ser Ser Lys Thr Asp Ala Ser
1125 1130 1135
Ile Leu Gln Glu Lys Leu Gly Ser Leu Asn Leu Arg Trp Gln Glu Val
1140 1145 1150
Cys Lys Gln Leu Ser Asp Arg Lys Lys Arg Leu Glu Glu Gln Pro Asp
1155 1160 1165
Leu Ala Pro Gly Leu Thr Thr Ile Gly Ala Ser Pro Thr Gln Thr Val
1170 1175 1180
Thr Leu Val Thr Gln Pro Val Val Thr Lys Glu Thr Ala Ile Ser Lys
1185 1190 1195 1200
Leu Glu Met Pro Ser Ser Leu Met Leu Glu Val Pro Ala Leu Ala Asp
1205 1210 1215
Phe Asn Arg Ala Trp Thr Glu Leu Thr Asp Trp Leu Ser Leu Leu Asp
1220 1225 1230
Gln Val Ile Lys Ser Gln Arg Val Met Val Gly Asp Leu Glu Asp Ile
1235 1240 1245
Asn Glu Met Ile Ile Lys Gln Lys Ala Thr Met Gln Asp Leu Glu Gln
1250 1255 1260
Arg Arg Pro Gln Leu Glu Glu Leu Ile Thr Ala Ala Gln Asn Leu Lys
1265 1270 1275 1280
Asn Lys Thr Ser Asn Gln Glu Ala Arg Thr Ile Ile Thr Asp Arg Ile
1285 1290 1295
Glu Arg Ile Gln Asn Gln Trp Asp Glu Val Gln Glu His Leu Gln Asn
1300 1305 1310
Arg Arg Gln Gln Leu Asn Glu Met Leu Lys Asp Ser Thr Gln Trp Leu
1315 1320 1325
Glu Ala Lys Glu Glu Ala Glu Gln Val Leu Gly Gln Ala Arg Ala Lys
1330 1335 1340
Leu Glu Ser Trp Lys Glu Gly Pro Tyr Thr Val Asp Ala Ile Gln Lys
1345 1350 1355 1360
Lys Ile Thr Glu Thr Lys Gln Leu Ala Lys Asp Leu Arg Gln Trp Gln
1365 1370 1375
Thr Asn Val Asp Val Ala Asn Asp Leu Ala Leu Lys Leu Leu Arg Asp
1380 1385 1390
Tyr Ser Ala Asp Asp Thr Arg Lys Val His Met Ile Thr Glu Asn Ile
1395 1400 1405
Asn Ala Ser Trp Arg Ser Ile His Lys Arg Val Ser Glu Arg Glu Ala
1410 1415 1420
Ala Leu Glu Glu Thr His Arg Leu Leu Gln Gln Phe Pro Leu Asp Leu
1425 1430 1435 1440
Glu Lys Phe Leu Ala Trp Leu Thr Glu Ala Glu Thr Thr Ala Asn Val
1445 1450 1455
Leu Gln Asp Ala Thr Arg Lys Glu Arg Leu Leu Glu Asp Ser Lys Gly
1460 1465 1470
Val Lys Glu Leu Met Lys Gln Trp Gln Asp Leu Gln Gly Glu Ile Glu
1475 1480 1485
Ala His Thr Asp Val Tyr His Asn Leu Asp Glu Asn Ser Gln Lys Ile
1490 1495 1500
Leu Arg Ser Leu Glu Gly Ser Asp Asp Ala Val Leu Leu Gln Arg Arg
1505 1510 1515 1520
Leu Asp Asn Met Asn Phe Lys Trp Ser Glu Leu Arg Lys Lys Ser Leu
1525 1530 1535
Asn Ile Arg Ser His Leu Glu Ala Ser Ser Asp Gln Trp Lys Arg Leu
1540 1545 1550
His Leu Ser Leu Gln Glu Leu Leu Val Trp Leu Gln Leu Lys Asp Asp
1555 1560 1565
Glu Leu Ser Arg Gln Ala Pro Ile Gly Gly Asp Phe Pro Ala Val Gln
1570 1575 1580
Lys Gln Asn Asp Val His Arg Ala Phe Lys Arg Glu Leu Lys Thr Lys
1585 1590 1595 1600
Glu Pro Val Ile Met Ser Thr Leu Glu Thr Val Arg Ile Phe Leu Thr
1605 1610 1615
Glu Gln Pro Leu Glu Gly Leu Glu Lys Leu Tyr Gln Glu Pro Arg Glu
1620 1625 1630
Leu Pro Pro Glu Glu Arg Ala Gln Asn Val Thr Arg Leu Leu Arg Lys
1635 1640 1645
Gln Ala Glu Glu Val Asn Thr Glu Trp Glu Lys Leu Asn Leu His Ser
1650 1655 1660
Ala Asp Trp Gln Arg Lys Ile Asp Glu Thr Leu Glu Arg Leu Gln Glu
1665 1670 1675 1680
Leu Gln Glu Ala Thr Asp Glu Leu Asp Leu Lys Leu Arg Gln Ala Glu
1685 1690 1695
Val Ile Lys Gly Ser Trp Gln Pro Val Gly Asp Leu Leu Ile Asp Ser
1700 1705 1710
Leu Gln Asp His Leu Glu Lys Val Lys Ala Leu Arg Gly Glu Ile Ala
1715 1720 1725
Pro Leu Lys Glu Asn Val Ser His Val Asn Asp Leu Ala Arg Gln Leu
1730 1735 1740
Thr Thr Leu Gly Ile Gln Leu Ser Pro Tyr Asn Leu Ser Thr Leu Glu
1745 1750 1755 1760
Asp Leu Asn Thr Arg Trp Lys Leu Leu Gln Val Ala Val Glu Asp Arg
1765 1770 1775
Val Arg Gln Leu His Glu Ala His Arg Asp Phe Gly Pro Ala Ser Gln
1780 1785 1790
His Phe Leu Ser Thr Ser Val Gln Gly Pro Trp Glu Arg Ala Ile Ser
1795 1800 1805
Pro Asn Lys Val Pro Tyr Tyr Ile Asn His Glu Thr Gln Thr Thr Cys
1810 1815 1820
Trp Asp His Pro Lys Met Thr Glu Leu Tyr Gln Ser Leu Ala Asp Leu
1825 1830 1835 1840
Asn Asn Val Arg Phe Ser Ala Tyr Arg Thr Ala Met Lys Leu Arg Arg
1845 1850 1855
Leu Gln Lys Ala Leu Cys Leu Asp Leu Leu Ser Leu Ser Ala Ala Cys
1860 1865 1870
Asp Ala Leu Asp Gln His Asn Leu Lys Gln Asn Asp Gln Pro Met Asp
1875 1880 1885
Ile Leu Gln Ile Ile Asn Cys Leu Thr Thr Ile Tyr Asp Arg Leu Glu
1890 1895 1900
Gln Glu His Asn Asn Leu Val Asn Val Pro Leu Cys Val Asp Met Cys
1905 1910 1915 1920
Leu Asn Trp Leu Leu Asn Val Tyr Asp Thr Gly Arg Thr Gly Arg Ile
1925 1930 1935
Arg Val Leu Ser Phe Lys Thr Gly Ile Ile Ser Leu Cys Lys Ala His
1940 1945 1950
Leu Glu Asp Lys Tyr Arg Tyr Leu Phe Lys Gln Val Ala Ser Ser Thr
1955 1960 1965
Gly Phe Cys Asp Gln Arg Arg Leu Gly Leu Leu Leu His Asp Ser Ile
1970 1975 1980
Gln Ile Pro Arg Gln Leu Gly Glu Val Ala Ser Phe Gly Gly Ser Asn
1985 1990 1995 2000
Ile Glu Pro Ser Val Arg Ser Cys Phe Gln Phe Ala Asn Asn Lys Pro
2005 2010 2015
Glu Ile Glu Ala Ala Leu Phe Leu Asp Trp Met Arg Leu Glu Pro Gln
2020 2025 2030
Ser Met Val Trp Leu Pro Val Leu His Arg Val Ala Ala Ala Glu Thr
2035 2040 2045
Ala Lys His Gln Ala Lys Cys Asn Ile Cys Lys Glu Cys Pro Ile Ile
2050 2055 2060
Gly Phe Arg Tyr Arg Ser Leu Lys His Phe Asn Tyr Asp Ile Cys Gln
2065 2070 2075 2080
Ser Cys Phe Phe Ser Gly Arg Val Ala Lys Gly His Lys Met His Tyr
2085 2090 2095
Pro Met Val Glu Tyr Cys Thr Pro Thr Thr Ser Gly Glu Asp Val Arg
2100 2105 2110
Asp Phe Ala Lys Val Leu Lys Asn Lys Phe Arg Thr Lys Arg Tyr Phe
2115 2120 2125
Ala Lys His Pro Arg Met Gly Tyr Leu Pro Val Gln Thr Val Leu Glu
2130 2135 2140
Gly Asp Asn Met Glu Thr Pro Val Thr Leu Ile Asn Phe Trp Pro Val
2145 2150 2155 2160
Asp Ser Ala Pro Ala Ser Ser Pro Gln Leu Ser His Asp Asp Thr His
2165 2170 2175
Ser Arg Ile Glu His Tyr Ala Ser Arg Leu Ala Glu Met Glu Asn Ser
2180 2185 2190
Asn Gly Ser Tyr Leu Asn Asp Ser Ile Ser Pro Asn Glu Ser Ile Asp
2195 2200 2205
Asp Glu His Leu Leu Ile Gln His Tyr Cys Gln Ser Leu Asn Gln Asp
2210 2215 2220
Ser Pro Leu Ser Gln Pro Arg Ser Pro Ala Gln Ile Leu Ile Ser Leu
2225 2230 2235 2240
Glu Ser Glu Glu Arg Gly Glu Leu Glu Arg Ile Leu Ala Asp Leu Glu
2245 2250 2255
Glu Glu Asn Arg Asn Leu Gln Ala Glu Tyr Asp Arg Leu Lys Gln Gln
2260 2265 2270
His Glu His Lys Gly Leu Ser Pro Leu Pro Ser Pro Pro Glu Met Met
2275 2280 2285
Pro Thr Ser Pro Gln Ser Pro Arg Asp Ala Glu Leu Ile Ala Glu Ala
2290 2295 2300
Lys Leu Leu Arg Gln His Lys Gly Arg Leu Glu Ala Arg Met Gln Ile
2305 2310 2315 2320
Leu Glu Asp His Asn Lys Gln Leu Glu Ser Gln Leu His Arg Leu Arg
2325 2330 2335
Gln Leu Leu Glu Gln Pro Gln Ala Glu Ala Lys Val Asn Gly Thr Thr
2340 2345 2350
Val Ser Ser Pro Ser Thr Ser Leu Gln Arg Ser Asp Ser Ser Gln Pro
2355 2360 2365
Met Leu Leu Arg Val Val Gly Ser Gln Thr Ser Asp Ser Met Gly Glu
2370 2375 2380
Glu Asp Leu Leu Ser Pro Pro Gln Asp Thr Ser Thr Gly Leu Glu Glu
2385 2390 2395 2400
Val Met Glu Gln Leu Asn Asn Ser Phe Pro Ser Ser Arg Gly Arg Asn
2405 2410 2415
Thr Pro Gly Lys Pro Met Arg Glu Asp Thr Met
2420 2425
<210> 21
<211> 7221
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL2 wt
<400> 21
atgctttggt gggaagaagt agaggactgt tatgaaagag aagatgttca aaagaaaaca 60
ttcacaaaat gggtaaatgc acaattttct aagtttggga agcagcatat tgagaacctc 120
ttcagtgacc tacaggatgg gaggcgcctc ctagacctcc tcgaaggcct gacagggcaa 180
aaactgccaa aagaaaaagg atccacaaga gttcatgccc tgaacaatgt caacaaggca 240
ctgcgggttt tgcagaacaa taatgttgat ttagtgaata ttggaagtac tgacatcgta 300
gatggaaatc ataaactgac tcttggtttg atttggaata taatcctcca ctggcaggtc 360
aaaaatgtaa tgaaaaatat catggctgga ttgcaacaaa ccaacagtga aaagattctc 420
ctgagctggg tccgacaatc aactcgtaat tatccacagg ttaatgtaat caacttcacc 480
accagctggt ctgatggcct ggctttgaat gctctcatcc atagtcatag gccagaccta 540
tttgactgga atagtgtggt ttgccagcag tcagccacac aacgactgga acatgcattc 600
aacatcgcca gatatcaatt aggcatagag aaactactcg atcctgaaga tgttgatacc 660
acctatccag ataagaagtc catcttaatg tacatcacat cactcttcca agttttgcct 720
caacaagtga gcattgaagc catccaggaa gtggaaatgt tgccaaggcc acctaaagtg 780
actaaagaag aacattttca gttacatcat caaatgcact attctcaaca gatcacggtc 840
agtctagcac agggatatga gagaacttct tcccctaagc ctcgattcaa gagctatgcc 900
tacacacagg ctgcttatgt caccacctct gaccctacac ggagcccatt tccttcacag 960
catttggaag ctcctgaaga caagtcattt ggcagttcat tgatggagag tgaagtaaac 1020
ctggaccgtt atcaaacagc tttagaagaa gtattatcgt ggcttctttc tgctgaggac 1080
acattgcaag cacaaggaga gatttctaat gatgtggaag tggtgaaaga ccagtttcat 1140
actcatgagg ggtacatgat ggatttgaca gcccatcagg gccgggttgg taatattcta 1200
caattgggaa gtaagctgat tggaacagga aaattatcag aagatgaaga aactgaagta 1260
caagagcaga tgaatctcct aaattcaaga tgggaatgcc tcagggtagc tagcatggaa 1320
aaacaaagca atttacatag agttttaatg gatctccaga atcagaaact gaaagagttg 1380
aatgactggc taacaaaaac agaagaaaga acaaggaaaa tggaggaaga gcctcttgga 1440
cctgatcttg aagacctaaa acgccaagta caacaacata aggtgcttca agaagatcta 1500
gaacaagaac aagtcagggt caattctctc actcacatgg tggtggtagt tgatgaatct 1560
agtggagatc acgcaactgc tgctttggaa gaacaactta aggtattggg agatcgatgg 1620
gcaaacatct gtagatggac agaagaccgc tgggttcttt tacaagacat ccttctcaaa 1680
tggcaacgtc ttactgaaga acagtgcctt tttagtgcat ggctttcaga aaaagaagat 1740
gcagtgaaca agattcacac aactggcttt aaagatcaaa atgaaatgtt atcaagtctt 1800
caaaaactgg ccgttttaaa agcggatcta gaaaagaaaa agcaatccat gggcaaactg 1860
tattcactca aacaagatct tctttcaaca ctgaagaata agtcagtgac ccagaagacg 1920
gaagcatggc tggataactt tgcccggtgt tgggataatt tagtccaaaa acttgaaaag 1980
agtacagcac agatttcaca ggctgtcacc accactcagc catcactaac acagacaact 2040
gtaatggaaa cagtaactac ggtgaccaca agggaacaga tcctggtaaa gcatgctcaa 2100
gaggaacttc caccaccacc tccccaaaag aagaggcaga ttactgtgga tggtttggag 2160
aaaactgtaa gcctccagaa agatctatca gagatgcacg aatggatgac acaagctgaa 2220
gaagagtatc ttgagagaga ttttgaatat aaaactccag atgaattaca gaaagcagtt 2280
gaagagatga agagagctaa agaagaggcc caacaaaaag aagcgaaagt gaaactcctt 2340
actgagtctg taaatagtgt catagctcaa gctccacctg tagcacaaga ggccttaaaa 2400
aaggaacttg aaactctaac caccaactac cagtggctct gcactaggct gaatgggaaa 2460
tgcaagactt tggaagaagt ttgggcatgt tggcatgagt tattgtcata cttggagaaa 2520
gcaaacaagt ggctaaatga agtagaattt aaacttaaaa ccactgaaaa cattcctggc 2580
ggagctgagg aaatctctga ggtgctagat tcacttgaaa atttgatgcg acattcagag 2640
gataacccaa atcagattcg catattggca cagaccctaa cagatggcgg agtcatggat 2700
gagctaatca atgaggaact tgagacattt aattctcgtt ggagggaact acatgaagag 2760
gctgtaagga ggcaaaagtt gcttgaacag cctttgaagg aattggagca gtttaactca 2820
gatatacaaa aattgcttga accactggag gctgaaattc agcagggggt gaatctgaaa 2880
gaggaagact tcaataaaga tatgaatgaa gacaatgagg gtactgtaaa agaattgttg 2940
caaagaggag acaacttaca acaaagaatc acagatgaga gaaagagaga ggaaataaag 3000
ataaaacagc agctgttaca gacaaaacat aacgcgttca aggatttgag gtctcaaaga 3060
agaaaaaagg ctctagaaat ttctcatcag tggtatcagt acaagaggca ggctgatgat 3120
ctcctgaaat gcttggatga cattgaaaaa aaattagcca gcctacctga gcccagagat 3180
gaaaggaaaa taaaggaaat tgatcgggaa ttgcagaaga agaaagagga gctgaatgca 3240
gtgcgtaggc aagctgaggg cttgtctgag gatggggccg caatggcagt ggagccaact 3300
cagatccagc tcagcaagcg ctggcgggaa attgagagca aatttgctca gtttcgaaga 3360
ctcaactttg cacaaattca cactgtccgt gaagaaacga tgatggtgat gactgaagac 3420
atgcctttgg aaatttctta tgtgccttct acttatttga ctgaaatcac tcatgtctca 3480
caagccctat tagaagtgga acaacttctc aatgctcctg acctctgtgc taaggacttt 3540
gaagatctct ttaagcaaga ggagtctctg aagaatataa aagatagtct acaacaaagc 3600
tcaggtcgga ttgacattat tcatagcaag aagacagcag cattgcaaag tgcaacgcct 3660
gtggaaaggg tgaagctaca ggaagctctc tcccagcttg atttccaatg ggaaaaagtt 3720
aacaaaatgt acaaggaccg acaagggcga tttgacagat ctgttgagaa atggcggcgt 3780
tttcattatg atataaagat atttaatcag tggctaacag aagctgaaca gtttctcaga 3840
aagacacaaa ttcctgagaa ttgggaacat gctaaataca aatggtatct taaggaactc 3900
caggatggca ttgggcagcg gcaaactgtt gtcagaacat tgaatgcaac tggggaagaa 3960
ataattcagc aatcctcaaa aacagatgcc agtattctac aggaaaaatt gggaagcctg 4020
aatctgcggt ggcaggaggt ctgcaaacag ctgtcagaca gaaaaaagag gctagaagag 4080
cagcctgacc tagctcctgg actgaccact attggagcct ctcctactca gactgttact 4140
ctggtgacac aacctgtggt tactaaggaa actgccatct ccaaactaga aatgccatct 4200
tccttgatgt tggaggtacc aactcataga ttactgcaac agttccccct ggacctggaa 4260
aagtttcttg cctggcttac agaagctgaa acaactgcca atgtcctaca ggatgctacc 4320
cgtaaggaaa ggctcctaga agactccaag ggagtaaaag agctgatgaa acaatggcaa 4380
gacctccaag gtgaaattga agctcacaca gatgtttatc acaacctgga tgaaaacagc 4440
caaaaaatcc tgagatccct ggaaggttcc gatgatgcag tcctgttaca aagacgtttg 4500
gataacatga acttcaagtg gagtgaactt cggaaaaagt ctctcaacat taggtcccat 4560
ttggaagcca gttctgacca gtggaagcgt ctgcaccttt ctctgcagga acttctggtg 4620
tggctacagc tgaaagatga tgaattaagc cggcaggcac ctattggagg cgactttcca 4680
gcagttcaga agcagaacga tgtacatagg gccttcaaga gggaattgaa aactaaagaa 4740
cctgtaatca tgagtactct tgagactgta cgaatatttc tgacagagca gcctttggaa 4800
ggactagaga aactctacca ggagcccaga gagctgcctc ctgaggagag agcccagaat 4860
gtcactcggc ttctacgaaa gcaggctgag gaggtcaata ctgagtggga aaaattgaac 4920
ctgcactccg ctgactggca gagaaaaata gatgagaccc ttgaaagact ccaggaactt 4980
caagaggcca cggatgagct ggacctcaag ctgcgccaag ctgaggtgat caagggatcc 5040
tggcagcccg tgggcgatct cctcattgac tctctccaag atcacctcga gaaagtcaag 5100
gcacttcgag gagaaattgc gcctctgaaa gagaacgtga gccacgtcaa tgaccttgct 5160
cgccagctta ccactttggg cattcagctc tcaccgtata acctcagcac tctggaagac 5220
ctgaacacca gatggaagct tctgcaggtg gccgtcgagg accgagtcag gcagctgcat 5280
gaagcccaca gggactttgg tccagcatct cagcactttc tttccacgtc tgtccagggt 5340
ccctgggaga gagccatctc gccaaacaaa gtgccctact atatcaacca cgagactcaa 5400
acaacttgct gggaccatcc caaaatgaca gagctctacc agtctttagc tgacctgaat 5460
aatgtcagat tctcagctta taggactgcc atgaaactcc gaagactgca gaaggccctt 5520
tgcttggatc tcttgagcct gtcagctgca tgtgatgcct tggaccagca caacctcaag 5580
caaaatgacc agcccatgga tatcctgcag attattaatt gtttgaccac tatttatgac 5640
cgcctggagc aagagcacaa caatttggtc aacgtccctc tctgcgtgga tatgtgtctg 5700
aactggctgc tgaatgttta tgatacggga cgaacaggga ggatccgtgt cctgtctttt 5760
aaaactggca tcatttccct gtgtaaagca catttggaag acaagtacag ataccttttc 5820
aagcaagtgg caagttcaac aggattttgt gaccagcgca ggctgggcct ccttctgcat 5880
gattctatcc aaattccaag acagttgggt gaagttgcat cctttggggg cagtaacatt 5940
gagccaagtg tccggagctg cttccaattt gctaataata agccagagat cgaagcggcc 6000
ctcttcctag actggatgag actggaaccc cagtccatgg tgtggctgcc cgtcctgcac 6060
agagtggctg ctgcagaaac tgccaagcat caggccaaat gtaacatctg caaagagtgt 6120
ccaatcattg gattcaggta caggagtcta aagcacttta attatgacat ctgccaaagc 6180
tgcttttttt ctggtcgagt tgcaaaaggc cataaaatgc actatcccat ggtggaatat 6240
tgcactccga ctacatcagg agaagatgtt cgagactttg ccaaggtact aaaaaacaaa 6300
tttcgaacca aaaggtattt tgcgaagcat ccccgaatgg gctacctgcc agtgcagact 6360
gtcttagagg gggacaacat ggaaactccc gttactctga tcaacttctg gccagtagat 6420
tctgcgcctg cctcgtcccc tcagctttca cacgatgata ctcattcacg cattgaacat 6480
tatgctagca ggctagcaga aatggaaaac agcaatggat cttatctaaa tgatagcatc 6540
tctcctaatg agagcataga tgatgaacat ttgttaatcc agcattactg ccaaagtttg 6600
aaccaggact cccccctgag ccagcctcgt agtcctgccc agatcttgat ttccttagag 6660
agtgaggaaa gaggggagct agagagaatc ctagcagatc ttgaggaaga aaacaggaat 6720
ctgcaagcag aatatgaccg tctaaagcag cagcacgaac ataaaggcct gtccccactg 6780
ccgtcccctc ctgaaatgat gcccacctct ccccagagtc cccgggatgc tgagctcatt 6840
gctgaggcca agctactgcg tcaacacaaa ggccgcctgg aagccaggat gcaaatcctg 6900
gaagaccaca ataaacagct ggagtcacag ttacacaggc taaggcagct gctggagcaa 6960
ccccaggcag aggccaaagt gaatggcaca acggtgtcct ctccttctac ctctctacag 7020
aggtccgaca gcagtcagcc tatgctgctc cgagtggttg gcagtcaaac ttcggactcc 7080
atgggtgagg aagatcttct cagtcctccc caggacacaa gcacagggtt agaggaggtg 7140
atggagcaac tcaacaactc cttccctagt tcaagaggaa gaaatacccc tggaaagcca 7200
atgagagagg acacaatgta g 7221
<210> 22
<211> 7284
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> WL1校正
<400> 22
atgctgtggt gggaggaggt ggaggactgc tatgagagag aggatgtgca gaagaagacc 60
ttcaccaagt gggtgaatgc ccagttcagc aagtttggca agcagcacat tgagaacctg 120
ttctctgacc tgcaggatgg caggaggctg ctggacctgc tggagggcct aacaggccag 180
aagctgccca aggagaaggg cagtaccagg gtgcatgccc tgaacaatgt gaacaaggcc 240
ctgagggtgc tgcagaacaa caatgtggac ctagtgaata ttggcagtac agatattgtg 300
gatggcaacc acaagctgac cctgggcctg atctggaata ttatcctaca ctggcaggtg 360
aagaatgtga tgaagaatat tatggctggc ctgcagcaga ccaactctga gaagatcctg 420
ctgagctggg tgaggcagag taccaggaac tacccccagg tgaatgtgat caacttcacc 480
accagctggt ctgatggcct ggccctgaat gccctaatcc acagccacag gcctgacctg 540
tttgactgga actctgtggt gtgccagcag tctgccaccc agaggctaga gcatgccttc 600
aatattgcca ggtaccagct ggggattgag aagctgctgg accctgagga tgtggataca 660
acctaccctg acaagaagag catcctgatg tacatcacca gcctgttcca ggtgctgccc 720
cagcaggtga gcattgaggc catccaggag gtggagatgc tacccaggcc ccccaaggtg 780
accaaggagg agcacttcca gctacaccac cagatgcact acagccagca gatcacagtg 840
agcctggccc agggctatga gaggaccagc agccccaagc ccaggttcaa gagctatgcc 900
tacacccagg ctgcctatgt gaccacctca gaccccacca ggagcccctt ccccagccag 960
cacctggagg cccctgagga caagagcttt ggcagcagcc tgatggagtc tgaggtgaac 1020
ctggacaggt accagacagc cctggaggag gtgctgagct ggctgctgtc tgctgaggat 1080
accctacagg cccagggaga gatcagcaat gatgtggagg tggtgaagga ccagttccat 1140
acccatgagg gctacatgat ggacctaaca gcccaccagg gcagggtggg caacatcctg 1200
cagctgggca gcaagctgat tggcacaggc aagctgtctg aggatgagga gacagaggtg 1260
caggagcaga tgaacctgct gaacagcagg tgggagtgcc tgagggtggc cagtatggag 1320
aagcagagca acctacacag ggtgctgatg gacctgcaga accagaagct gaaggagctg 1380
aatgactggc tgaccaagac agaggagagg accaggaaga tggaggagga gcccctgggc 1440
cctgacctgg aggacctaaa gaggcaggtg cagcagcaca aggtgctgca ggaggacctg 1500
gagcaggagc aggtgagggt gaacagccta acccatatgg tggtggtggt ggatgagagc 1560
tcaggagacc atgctacagc tgccctggag gagcagctga aggtgctggg agacaggtgg 1620
gctaatatct gcaggtggac agaggacagg tgggtgctgc tgcaggatat cctgctgaag 1680
tggcagaggc tgacagagga gcagtgcctg ttctctgcct ggctgtcaga gaaggaggat 1740
gctgtgaaca agatccatac aacaggcttc aaggaccaga atgagatgct gagcagcctg 1800
cagaagctgg ctgtgctgaa ggctgacctg gagaagaaga agcagagtat gggcaagctg 1860
tacagcctga agcaggacct gctgagtacc ctgaagaaca agtctgtgac ccagaagaca 1920
gaggcctggc tggacaactt tgccaggtgc tgggataacc tggtgcagaa gctggagaag 1980
agtacagccc agatcagcca ggctgtgaca acaacccagc ccagcctgac ccagacaaca 2040
gtgatggaga cagtgacaac agtgacaacc agggagcaga tcctagtgaa gcatgcccag 2100
gaggagctgc cccccccccc cccccagaag aagaggcaga ttacagtgga tggcctggag 2160
aagacagtga gcctccagaa ggacctgtca gagatgcatg agtggatgac ccaggctgag 2220
gaggagtacc tggagaggga cttcgagtac aagacccctg atgagctgca gaaggctgtg 2280
gaggagatga agagggccaa ggaggaggcc cagcagaagg aggccaaggt gaagctgctg 2340
acagagtctg tgaactctgt gattgcccag gccccccctg tggcccagga ggccctgaag 2400
aaggagctgg agaccctgac aaccaactac cagtggctgt gcaccaggct gaatggcaag 2460
tgcaagaccc tagaggaggt gtgggcctgc tggcatgagc tgctgagcta cctggagaag 2520
gccaacaagt ggctgaatga ggtggagttc aagctgaaga ccacagagaa catccctgga 2580
ggagctgagg agatctctga ggtgctggac agcctggaga acctaatgag gcactctgag 2640
gacaacccca accagatcag gatcctggcc cagaccctga cagatggagg agtgatggat 2700
gagctgatca atgaggagct ggagaccttc aacagcaggt ggagggagct acatgaggag 2760
gctgtgagga ggcagaagct gctagagcag atccatacag tgagggagga gacaatgatg 2820
gtgatgacag aggatatgcc cctggagatc agctatgtgc ccagcaccta cctaacagag 2880
atcacccatg tgagccaggc cctgctggag gtggagcagc tgctgaatgc ccctgacctg 2940
tgtgccaagg actttgagga cctgttcaag caggaggaga gcctgaagaa catcaaggac 3000
agcctgcagc agagctcagg caggattgat attatccaca gcaagaagac agctgccctg 3060
cagtctgcta cccctgtgga gagggtgaag ctgcaggagg ccctgagcca gctggacttc 3120
cagtgggaga aggtgaacaa gatgtacaag gacaggcagg gcaggtttga caggtcagtg 3180
gagaagtgga ggaggttcca ctatgacatc aagatcttca accagtggct gacagaggct 3240
gagcagttcc taaggaagac ccagatccct gagaactggg agcatgccaa gtacaagtgg 3300
taccttaagg agctgcagga tggcattggc cagaggcaga cagtggtgag gaccctgaat 3360
gctacagggg aggagattat ccagcagagc agcaagacag atgccagtat cctgcaggag 3420
aagctgggca gcctgaacct gaggtggcag gaggtgtgca agcagctgtc tgacaggaag 3480
aagaggctgg aggagcagcc tgacctggcc cctggcctga caacaattgg agccagccct 3540
acccagacag tgaccctggt gacccagcct gtggtgacca aggagacagc tatcagcaag 3600
ctggagatgc ccagcagcct gatgctggag gtgcctgccc tggctgactt caacagggcc 3660
tggacagagc tgacagactg gctgagcctg ctggaccagg tgatcaagag ccagagggtg 3720
atggtggggg acctggagga tatcaatgag atgattatca agcagaaggc tacaatgcag 3780
gacctggagc agaggaggcc ccagctggag gagctgatta cagctgccca gaacctaaag 3840
aacaagacca gcaaccagga ggccaggaca attattacag acaggattga gaggatccag 3900
aaccagtggg atgaggtgca ggagcacctg cagaacagga ggcagcagct gaatgagatg 3960
ctgaaggaca gtacccagtg gctggaggcc aaggaggagg ctgagcaggt gctgggccag 4020
gccagggcca agctggagag ctggaaggag ggcccctaca cagtggatgc catccagaag 4080
aagatcacag agaccaagca gctggccaag gacctgaggc agtggcagac caatgtggat 4140
gtggccaatg acctggccct gaagctgctg agggactact cagctgatga caccaggaag 4200
gtgcacatga tcacagagaa catcaatgcc agctggagga gcatccacaa gagggtgtct 4260
gagagggagg ctgccctaga ggagacccac aggctgctgc agcagttccc cctggacctg 4320
gagaagttcc tggcctggct gacagaggct gagacaacag ccaatgtgct gcaggatgct 4380
accaggaagg agaggctgct ggaggacagc aagggagtga aggagctgat gaagcagtgg 4440
caggacctac agggggagat tgaggcccat acagatgtgt accacaacct ggatgagaac 4500
agccagaaga tcctgaggag cctggagggc tctgatgatg ctgtgctgct gcagaggagg 4560
ctggacaata tgaacttcaa gtggtctgag ctgaggaaga agagcctaaa tatcaggagc 4620
cacctggagg ccagctctga ccagtggaag aggctacacc tgagcctgca ggagctgctg 4680
gtgtggctgc agctgaagga tgatgagcta agcaggcagg cccctattgg gggagacttc 4740
cctgctgtgc agaagcagaa tgatgtgcac agggccttca agagggagct gaagaccaag 4800
gagcctgtga ttatgagtac cctggagaca gtgaggatct tcctgacaga gcagcccctg 4860
gagggcctgg agaagctgta ccaggagccc agggagctgc cccctgagga gagggcccag 4920
aatgtgacca ggctgctgag gaagcaggct gaggaggtga atacagagtg ggagaagctg 4980
aacctacact ctgctgactg gcagaggaag attgatgaga ccctggagag gctgcaggag 5040
ctgcaggagg ctacagatga gctggacctg aagctgaggc aggctgaggt gatcaagggc 5100
agctggcagc ctgtggggga cctgctgatt gacagcctac aggaccacct ggagaaggtg 5160
aaggccctga ggggagagat tgccccccta aaggagaatg tgagtcatgt gaatgacctg 5220
gccaggcagc tgaccaccct gggcatccag ctgagcccct acaacctgag caccctggag 5280
gacctgaaca ccaggtggaa gctgctgcag gtggctgtgg aggacagggt gaggcagcta 5340
catgaggccc acagggactt tggccctgcc agccagcact tcctaagcac ctctgttcag 5400
ggcccctggg agagggccat cagccccaac aaggtgccct actacatcaa ccatgagacc 5460
cagaccacct gctgggacca ccccaagatg acagagctgt accagagcct ggctgacctg 5520
aacaatgtga ggttctctgc ctacaggaca gccatgaagc tgaggaggct gcagaaggcc 5580
ctgtgcctgg acctgctgag cctgtctgct gcctgtgatg ccctagacca gcacaacctg 5640
aagcagaatg accagcccat ggacatcctt cagatcatca actgcctgac caccatctat 5700
gacaggctag agcaggagca caacaacctg gtgaatgtgc ccctgtgtgt ggacatgtgc 5760
ctgaactggc tgctgaatgt gtatgacaca ggcaggacag gcaggatcag ggtgctgagc 5820
ttcaagacag gcatcatcag cctatgcaag gctcacctgg aggacaagta caggtacctg 5880
ttcaagcagg tggccagcag cacaggcttc tgtgaccaga ggaggctggg cctgctactt 5940
catgacagca ttcagatccc caggcagctg ggagaggtgg ccagctttgg aggcagcaac 6000
attgagccct cagtgaggag ctgcttccag tttgccaaca acaagcctga gattgaggct 6060
gccctgttcc tggactggat gaggctggag cctcagagca tggtgtggct gcctgtgctg 6120
cacagggtgg ctgctgctga gacagccaag catcaggcca agtgcaacat ctgcaaggag 6180
tgccccatca ttggcttcag gtacaggagc ctaaagcact tcaactatga catctgccag 6240
agctgcttct tctcaggcag ggtggccaag ggccacaaga tgcactaccc catggtggag 6300
tactgcaccc ccaccacctc aggagaggat gtgagggact ttgccaaggt gctgaagaac 6360
aagttcagga ccaagaggta ctttgccaag caccccagga tgggctacct gcctgtgcag 6420
acagtgctgg agggggacaa catggagacc cctgtgaccc tgatcaactt ctggcctgtg 6480
gactcagccc ctgccagcag cccccagctg agccatgatg acacccacag caggattgag 6540
cactatgcca gcaggctggc tgagatggag aacagcaatg gcagctacct aaatgacagc 6600
atcagcccca atgagagcat tgatgatgag cacctgctaa tccagcacta ctgccagagc 6660
ctgaaccagg acagccccct gagccagccc aggagccctg cccagatcct gatcagcctg 6720
gagtctgagg agagggggga gctggagagg atcctggctg acctggagga ggagaacagg 6780
aaccttcagg ctgagtatga caggctaaag cagcagcatg agcacaaggg cctgagcccc 6840
ctacccagcc cccctgagat gatgcccacc agcccccaga gccccaggga tgctgagctg 6900
attgctgagg ccaagctgct gaggcagcac aagggcaggc tggaggccag gatgcagatc 6960
ctagaggacc acaacaagca gctggagagc cagctacaca ggctgaggca gctgctggag 7020
cagcctcagg ctgaggccaa ggtgaatggc accacagtga gcagccccag caccagcctg 7080
cagaggtctg acagcagtca gcccatgctg ctcagggtgg tgggcagtca gacctctgac 7140
agcatgggag aggaggacct gctgagcccc cctcaggaca ccagcacagg cctggaggag 7200
gtgatggagc agctgaacaa cagcttcccc agcagcaggg gcaggaacac ccctggcaag 7260
cccatgaggg aggacaccat gtag 7284
<210> 23
<211> 4768
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 5' WL1校正
<400> 23
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta ttggccgtcc 180
gccctcggca ccatcctcac gacacccaaa tatggcgacg ggtgaggaat ggtggggagt 240
tatttttaga gcggtgagga aggtgggcag gcagcaggtg ttggcgctct aaaaataact 300
cccgggagtt atttttagag cggaggaatg gtggacaccc aaatatggcg acggttcctc 360
acccgtcgcc atatttgggt gtccgccctc ggccggggcc gcattcctgg gggccgggcg 420
gtgctcccgc ccgcctcgat aaaaggctcc ggggccggcg gcggcccacg agctacccgg 480
aggagcggga ggcgccaagc tctagagtaa gtatcaaggt tacaagacag gtttaaggag 540
accaatagaa actgggcttg tcgagacaga gaagactctt gcgtttctga taggcaccta 600
ttggtcttac tgacatccac tttgcctttc tctccacagg ccaccatgct gtggtgggag 660
gaggtggagg actgctatga gagagaggat gtgcagaaga agaccttcac caagtgggtg 720
aatgcccagt tcagcaagtt tggcaagcag cacattgaga acctgttctc tgacctgcag 780
gatggcagga ggctgctgga cctgctggag ggcctaacag gccagaagct gcccaaggag 840
aagggcagta ccagggtgca tgccctgaac aatgtgaaca aggccctgag ggtgctgcag 900
aacaacaatg tggacctagt gaatattggc agtacagata ttgtggatgg caaccacaag 960
ctgaccctgg gcctgatctg gaatattatc ctacactggc aggtgaagaa tgtgatgaag 1020
aatattatgg ctggcctgca gcagaccaac tctgagaaga tcctgctgag ctgggtgagg 1080
cagagtacca ggaactaccc ccaggtgaat gtgatcaact tcaccaccag ctggtctgat 1140
ggcctggccc tgaatgccct aatccacagc cacaggcctg acctgtttga ctggaactct 1200
gtggtgtgcc agcagtctgc cacccagagg ctagagcatg ccttcaatat tgccaggtac 1260
cagctgggga ttgagaagct gctggaccct gaggatgtgg atacaaccta ccctgacaag 1320
aagagcatcc tgatgtacat caccagcctg ttccaggtgc tgccccagca ggtgagcatt 1380
gaggccatcc aggaggtgga gatgctaccc aggcccccca aggtgaccaa ggaggagcac 1440
ttccagctac accaccagat gcactacagc cagcagatca cagtgagcct ggcccagggc 1500
tatgagagga ccagcagccc caagcccagg ttcaagagct atgcctacac ccaggctgcc 1560
tatgtgacca cctcagaccc caccaggagc cccttcccca gccagcacct ggaggcccct 1620
gaggacaaga gctttggcag cagcctgatg gagtctgagg tgaacctgga caggtaccag 1680
acagccctgg aggaggtgct gagctggctg ctgtctgctg aggataccct acaggcccag 1740
ggagagatca gcaatgatgt ggaggtggtg aaggaccagt tccataccca tgagggctac 1800
atgatggacc taacagccca ccagggcagg gtgggcaaca tcctgcagct gggcagcaag 1860
ctgattggca caggcaagct gtctgaggat gaggagacag aggtgcagga gcagatgaac 1920
ctgctgaaca gcaggtggga gtgcctgagg gtggccagta tggagaagca gagcaaccta 1980
cacagggtgc tgatggacct gcagaaccag aagctgaagg agctgaatga ctggctgacc 2040
aagacagagg agaggaccag gaagatggag gaggagcccc tgggccctga cctggaggac 2100
ctaaagaggc aggtgcagca gcacaaggtg ctgcaggagg acctggagca ggagcaggtg 2160
agggtgaaca gcctaaccca tatggtggtg gtggtggatg agagctcagg agaccatgct 2220
acagctgccc tggaggagca gctgaaggtg ctgggagaca ggtgggctaa tatctgcagg 2280
tggacagagg acaggtgggt gctgctgcag gatatcctgc tgaagtggca gaggctgaca 2340
gaggagcagt gcctgttctc tgcctggctg tcagagaagg aggatgctgt gaacaagatc 2400
catacaacag gcttcaagga ccagaatgag atgctgagca gcctgcagaa gctggctgtg 2460
ctgaaggctg acctggagaa gaagaagcag agtatgggca agctgtacag cctgaagcag 2520
gacctgctga gtaccctgaa gaacaagtct gtgacccaga agacagaggc ctggctggac 2580
aactttgcca ggtgctggga taacctggtg cagaagctgg agaagagtac agcccagatc 2640
agccaggctg tgacaacaac ccagcccagc ctgacccaga caacagtgat ggagacagtg 2700
acaacagtga caaccaggga gcagatccta gtgaagcatg cccaggagga gctgcccccc 2760
cccccccccc agaagaagag gcagattaca gtggatggcc tggagaagac agtgagcctc 2820
cagaaggacc tgtcagagat gcatgagtgg atgacccagg ctgaggagga gtacctggag 2880
agggacttcg agtacaagac ccctgatgag ctgcagaagg ctgtggagga gatgaagagg 2940
gccaaggagg aggcccagca gaaggaggcc aaggtgaagc tgctgacaga gtctgtgaac 3000
tctgtgattg cccaggcccc ccctgtggcc caggaggccc tgaagaagga gctggagacc 3060
ctgacaacca actaccagtg gctgtgcacc aggctgaatg gcaagtgcaa gaccctagag 3120
gaggtgtggg cctgctggca tgagctgctg agctacctgg agaaggccaa caagtggctg 3180
aatgaggtgg agttcaagct gaagaccaca gagaacatcc ctggaggagc tgaggagatc 3240
tctgaggtgc tggacagcct ggagaaccta atgaggcact ctgaggacaa ccccaaccag 3300
atcaggatcc tggcccagac cctgacagat ggaggagtga tggatgagct gatcaatgag 3360
gagctggaga ccttcaacag caggtggagg gagctacatg aggaggctgt gaggaggcag 3420
aagctgctag agcagatcca tacagtgagg gaggagacaa tgatggtgat gacagaggat 3480
atgcccctgg agatcagcta tgtgcccagc acctacctaa cagagatcac ccatgtgagc 3540
caggccctgc tggaggtgga gcagctgctg aatgcccctg acctgtgtgc caaggacttt 3600
gaggacctgt tcaagcagga ggagagcctg aagaacatca aggacagcct gcagcagagc 3660
tcaggcagga ttgatattat ccacagcaag aagacagctg ccctgcagtc tgctacccct 3720
gtggagaggg tgaagctgca ggaggccctg agccagctgg acttccagtg ggagaaggtt 3780
aacaagatgt acaaggacag gcagggcagg tttgacaggt cagtggagaa gtggaggagg 3840
ttccactatg acatcaagat cttcaaccag tggctgacag aggctgagca gttcctaagg 3900
aagacccaga tccctgagaa ctgggagcat gccaagtaca agtggtacct taaggagctg 3960
caggatggca ttggccagag gcagacagtg gtgaggaccc tgaatgctac aggggaggag 4020
attatccagc agagcagcaa gacagatgcc agtatcctgc aggagaagct gggcagcctg 4080
aacctgaggt ggcaggaggt gtgcaagcag ctgtctgaca ggaagaagag gctggaggag 4140
cagcctgacc tggcccctgg cctgacaaca attggagcca gccctaccca gacagtgacc 4200
ctggtgaccc agcctgtggt gaccaaggag acagctatca gcaagctgga gatgcccagc 4260
agcctgatgc tggaggtgcc tgccctggct gacttcaaca gggcctggac agagctgaca 4320
gactggctga gcctgctgga ccaggtgatc aagagccaga gggtgatggt gggggacctg 4380
gaggatatca atgagatgat tatcaagcag aaggctacaa tgcaggacct ggagcagagg 4440
aggccccagc tggaggagct gattacagct gcccagaacc taaagaacaa gaccagcaac 4500
caggaggcca ggacaattat tacagacagg attgagagga tccagaacca gtgggatgag 4560
gtgcaggagc acctgcagtt tacgcgtttt gcggccgctt tacgtagata agtagcatgg 4620
cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc cctctctgcg 4680
cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg 4740
ggcggcctca gtgagcgagc gagcgcgc 4768

Claims (15)

1.一种拟抗肌萎缩蛋白(QD),其特征在于,在其N-端部分具有CH1CH2H1R1R2R3H2R8R9的结构,优选地进一步包括R16和R17杆状区。
2.如权利要求1所述的拟抗肌萎缩蛋白,其缺少R4、R5、R6、R7杆状区(ΔR4-R7)、缺少R10、R11、R12、R13、R14、R15杆状区(ΔR10-R15)、以及缺少R18和R19杆状区(ΔR18-R19)。
3.如权利要求2所述的拟抗肌萎缩蛋白,其具有如SEQ ID NO:2所示的序列,或与之具有90%相似性的序列,优选SEQ ID NO:20所示的序列。
4.如权利要求1所述的拟抗肌萎缩蛋白,其缺少R4、R5、R6、R7杆状区(ΔR4-R7)、缺少R10、R11、R12、R13杆状区(ΔR10-R13)、以及缺少R18、R19、R20和R21杆状区(ΔR18-R21)。
5.如权利要求4所述的拟抗肌萎缩蛋白,其具有如SEQ ID NO:3所示的序列,或与之具有90%相似性的序列。
6.一种核酸序列,其编码权利要求1-5中任一项所述的拟抗肌萎缩蛋白。
7.如权利要求6所述的核酸序列,其包含或由SEQ ID NO:4所示序列组成,或包含或由与之具有70%相似性的序列,优选SEQ ID NO:5或SEQ ID NO:22所示的序列组成。
8.一种包含两个AAV载体的双AAV载体系统,其特征在于,
-第一AAV载体包含在5’和3’AAV ITR之间的第一核酸序列,该序列编码拟抗肌萎缩蛋白的N端部分,
-第二AAV载体包含在5’和3’AAV ITR之间的第二核酸序列,该序列编码拟抗肌萎缩蛋白的C端部分,
所述第一和第二核酸序列包含重叠区,所述重叠区能够通过重组生产如权利要求1-5中任一项所述的拟抗肌萎缩蛋白。
9.如权利要求8所述的双AAV载体系统,其特征在于,所述第一核酸序列具有序列SEQID NO:6或SEQ ID NO:23,且所述第二核酸序列具有序列SEQ ID NO:7。
10.如权利要求8所述的双AAV载体系统,其特征在于,所述第一核酸序列具有序列SEQID NO:8,且所述第二核酸序列具有序列SEQ ID NO:9。
11.一种细胞,其特征在于,所述细胞经过如权利要求8-10中任一项所述的双AAV载体转导,优选为肌细胞。
12.一种组合物,其在药学上可接受的载体中包含有如权利要求8-10中任一项所述的双AAV载体或如权利要求11所述的细胞。
13.如权利要求8-10中任一项所述双AAV载体、如权利要求11所述的细胞或如权利要求12所述的组合物,用于制备药物。
14.如权利要求8-10中任一项所述双AAV载体、如权利要求11所述的细胞或如权利要求12所述的组合物,用于治疗肌营养不良症的疗法,特别是杜氏肌营养不良症(DMD)。
15.一种AAV载体,其特征在于,所述AAV载体是如权利要求8-10所述的双AAV载体系统中的第一AAV载体或第二AAV载体。
CN202080025191.0A 2019-03-25 2020-03-25 使用重叠的aav载体生产大型拟抗肌萎缩蛋白 Pending CN113677697A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP19305377.4 2019-03-25
EP19305377 2019-03-25
PCT/EP2020/058372 WO2020193636A1 (en) 2019-03-25 2020-03-25 Production of large-sized quasidystrophins using overlapping aav vectors

Publications (1)

Publication Number Publication Date
CN113677697A true CN113677697A (zh) 2021-11-19

Family

ID=66218030

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080025191.0A Pending CN113677697A (zh) 2019-03-25 2020-03-25 使用重叠的aav载体生产大型拟抗肌萎缩蛋白

Country Status (6)

Country Link
US (1) US20220204574A1 (zh)
EP (1) EP3947427A1 (zh)
JP (1) JP2022526526A (zh)
CN (1) CN113677697A (zh)
CA (1) CA3130196A1 (zh)
WO (1) WO2020193636A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2021265103A1 (en) * 2020-04-29 2023-01-19 Bristol-Myers Squibb Company Miniaturized dystrophins having spectrin fusion domains and uses thereof

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002029056A2 (en) * 2000-10-06 2002-04-11 Regents Of The University Of Michigan Mini-dystrophin nucleic acid and peptide sequences
WO2008088895A2 (en) * 2007-01-18 2008-07-24 University Of Missouri-Columbia Synthetic mini/micro-dystrophin genes to restore nnos to the sarcolemma
CN106659804A (zh) * 2014-06-27 2017-05-10 吉尼松公司 营养不良性疾病的高效全身治疗
WO2017223128A1 (en) * 2016-06-21 2017-12-28 The Curators Of The University Of Missouri Modified dystrophin proteins
WO2017221145A1 (en) * 2016-06-21 2017-12-28 Bamboo Therapeutics, Inc. Optimized mini-dystrophin genes and expression cassettes and their use
CN109069672A (zh) * 2016-04-15 2018-12-21 全国儿童医院研究所 腺相关病毒载体传递微肌营养不良蛋白以治疗肌营养不良症

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6156303A (en) 1997-06-11 2000-12-05 University Of Washington Adeno-associated virus (AAV) isolates and AAV vectors derived therefrom
ES2258601T3 (es) 2001-11-13 2006-09-01 The Trustees Of The University Of Pennsylvania Un metodo para la identificacion de las secuencias desconocidas del virus adeno-asociado (vaa) y un kit para el metodo.
EP3517134B1 (en) 2001-12-17 2024-01-17 The Trustees of the University of Pennsylvania Adeno-associated virus (aav) serotype 8 sequences, vectors containing same and uses therefor
CN1856576B (zh) 2003-09-30 2011-05-04 宾夕法尼亚州立大学托管会 腺伴随病毒(aav)进化支、序列、含有这些序列的载体及它们的应用
GB201507842D0 (en) 2015-05-07 2015-06-17 New Royal Holloway & Bedford Production of large-sized microdystrophins in an AAV-based vector configuration

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002029056A2 (en) * 2000-10-06 2002-04-11 Regents Of The University Of Michigan Mini-dystrophin nucleic acid and peptide sequences
WO2008088895A2 (en) * 2007-01-18 2008-07-24 University Of Missouri-Columbia Synthetic mini/micro-dystrophin genes to restore nnos to the sarcolemma
CN106659804A (zh) * 2014-06-27 2017-05-10 吉尼松公司 营养不良性疾病的高效全身治疗
CN109069672A (zh) * 2016-04-15 2018-12-21 全国儿童医院研究所 腺相关病毒载体传递微肌营养不良蛋白以治疗肌营养不良症
WO2017223128A1 (en) * 2016-06-21 2017-12-28 The Curators Of The University Of Missouri Modified dystrophin proteins
WO2017221145A1 (en) * 2016-06-21 2017-12-28 Bamboo Therapeutics, Inc. Optimized mini-dystrophin genes and expression cassettes and their use

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KASUN KODIPPILI ET AL.: "Dual AAV Gene Therapy for Duchenne Muscular Dystrophy with a 7-kb Mini-Dystrophin Gene in the Canine Model", HUMAN GENE THERAPY, vol. 29, no. 3, pages 299 - 311, XP055618817, DOI: 10.1089/hum.2017.095 *
刘悦;马沛沛;吴士文;: "杜氏肌营养不良症生物学治疗的研究进展", 中华灾害救援医学, no. 04, pages 61 - 64 *

Also Published As

Publication number Publication date
EP3947427A1 (en) 2022-02-09
WO2020193636A1 (en) 2020-10-01
JP2022526526A (ja) 2022-05-25
US20220204574A1 (en) 2022-06-30
CA3130196A1 (en) 2020-10-01

Similar Documents

Publication Publication Date Title
KR102606174B1 (ko) 트리플 가이드 서열을 지닌 crispr/cas9를 사용한 엑손 스키핑 변형을 위한 최적화된 전략
AU2020200948B2 (en) Compositions and methods for enhanced gene expression in cone cells
RU2761564C9 (ru) Композиции и способы деградации неправильно упакованных белков
KR102652994B1 (ko) 조절성 폴리뉴클레오티드
US20210340565A1 (en) Methods and pharmaceutical composition for the treatment and the prevention of cardiomyopathy due to energy failure
KR20200044793A (ko) Aav의 전달을 위한 조성물 및 방법
KR20200116933A (ko) 인간 심근세포에서 디스트로핀 돌연변이를 교정하기 위한 조성물 및 방법
KR20220139956A (ko) 심장 질환을 치료하기 위한 유전자 요법 벡터
KR20210082193A (ko) 소형화 디스트로핀 및 그의 용도
KR20070059058A (ko) 망막 디스트로핀 외래도입유전자 및 이의 이용 방법
KR20230069157A (ko) GJB2를 코딩하는 재조합 아데노 연관 바이러스 (rAAV) 및 그의 용도
CN113677697A (zh) 使用重叠的aav载体生产大型拟抗肌萎缩蛋白
EP3880827A1 (en) Compositions and methods of inducing differentiation of a hair cell
KR20230003557A (ko) 스펙트린 융합 도메인을 갖는 소형화 디스트로핀 및 그의 용도
RU2780329C2 (ru) Варианты кислой альфа-глюкозидазы и их использование
KR20230148824A (ko) 핵산을 전달하기 위한 조성물 및 방법
KR20230066453A (ko) 비바이러스성 dna 벡터 및 fviii 치료제 발현을 위한 이의 용도
KR20240056729A (ko) 조절성 폴리뉴클레오티드
Class et al. Patent application title: Synthetic mini/micro-dystrophin genes to restore nNOS to the sarcolemma Inventors: Dongsheng Duan (Columbia, MO, US) Yi Lai (Columbia, MO, US) Yongping Yue (Columbia, MO, US)
NZ710497B2 (en) Methods and pharmaceutical composition for the treatment and the prevention of cardiomyopathy due to energy failure

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination