CN112725282A - 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建 - Google Patents

携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建 Download PDF

Info

Publication number
CN112725282A
CN112725282A CN202110012018.3A CN202110012018A CN112725282A CN 112725282 A CN112725282 A CN 112725282A CN 202110012018 A CN202110012018 A CN 202110012018A CN 112725282 A CN112725282 A CN 112725282A
Authority
CN
China
Prior art keywords
amino acid
cell line
protein
trna
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110012018.3A
Other languages
English (en)
Inventor
周德敏
夏青
徐欢
张博
司龙龙
杨琦
姚天卓
张礼和
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN202110012018.3A priority Critical patent/CN112725282A/zh
Publication of CN112725282A publication Critical patent/CN112725282A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0684Cells of the urinary tract or kidneys
    • C12N5/0686Kidney cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0069Oxidoreductases (1.) acting on single donors with incorporation of molecular oxygen, i.e. oxygenases (1.13)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y113/00Oxidoreductases acting on single donors with incorporation of molecular oxygen (oxygenases) (1.13)
    • C12Y113/12Oxidoreductases acting on single donors with incorporation of molecular oxygen (oxygenases) (1.13) with incorporation of one atom of oxygen (internal monooxygenases or internal mixed function oxidases)(1.13.12)
    • C12Y113/12007Photinus-luciferin 4-monooxygenase (ATP-hydrolysing) (1.13.12.7), i.e. firefly-luciferase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y601/00Ligases forming carbon-oxygen bonds (6.1)
    • C12Y601/01Ligases forming aminoacyl-tRNA and related compounds (6.1.1)
    • C12Y601/01026Pyrrolysine-tRNAPyl ligase (6.1.1.26)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/15011Lentivirus, not HIV, e.g. FIV, SIV
    • C12N2740/15041Use of virus, viral particle or viral elements as a vector
    • C12N2740/15043Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2840/00Vectors comprising a special translation-regulating system
    • C12N2840/20Vectors comprising a special translation-regulating system translation of more than one cistron
    • C12N2840/203Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES

Abstract

本发明涉及携带正交tRNA/氨酰tRNA合成酶稳定细胞系的构建方法。基于基因密码子扩展技术,利用一对正交的tRNA/氨酰tRNA合成酶将非天然氨基酸定点引入蛋白质,本发明还涉及双慢病毒载体的构建方法,携带多拷贝数正交tRNA载体的构建方法以及借助双慢病毒稳定转导、质粒稳定转染将正交的tRNA/氨酰tRNA合成酶基因稳定整合到细胞基因组的方法。本发明进一步涉及稳定细胞系的应用,如表达含有非天然氨基酸目的蛋白的用途。

Description

携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
本申请是申请日为2016年1月27日、发明题目为“携带正交tRNA/氨酰tRNA合成酶稳定细胞系的构建”的中国专利申请号为2016100555428的专利申请的分案申请。
技术领域
本发明属于生物制药领域,涉及携带正交tRNA/氨酰tRNA合成酶稳定细胞系的构建方法。
背景技术
(1)基因密码子扩展技术
近年来遗传密码扩展技术发展迅速,利用琥珀终止密码子为有义编码子,通过引入相应的正交tRNA及氨酰tRNA合成酶,最终可以将设计好的非天然氨基酸引入蛋白质中。根据非天然氨基酸的性质,可以赋予蛋白质特殊的功能。到目前为止,这一技术已经将上百种非天然氨基酸成功地定点表达在蛋白质表面,涉及的非天然氨基酸包括含有叠氮、炔基、酮基、醛基、烯基、酰胺基、硝基、磷酸根、磺酸根等多样性功能基团,可进行多种生物正交反应,如:点击化学、光敏感、糖基化、光交联等反应。
(2)基因密码子扩展技术在蛋白质药物开发中的应用
现代生物技术的发展,使得蛋白质药物的大规模生产成为现实,这类药物应用于临床的数量也越来越多。蛋白质药物是指多肽和基因工程药物、单克隆抗体和基因工程抗体、重组疫苗。与以往的小分子药物相比,蛋白质药物具有高活性、特异性强、低毒性、生物功能明确、有利于临床应用的特点。但是,蛋白类药物也有其稳定性差、膜通透性差、生物半衰期短等缺点,影响了蛋白质药物的治疗潜力和临床应用。对天然蛋白结构进行改造或修饰是获取更好药代动力学性质的一种有效途径。各种修饰多从改变重组蛋白的性状入手,如增加相对分子质量、减缓蛋白酶降解、降低免疫原性、提高生物及化学稳定性等,进而改善其体内药代动力学性质、延长体内半衰期或加速体内释放、降低中和抗体产生率、提高患者适应性及治疗效果等。鉴于改构或修饰后蛋白质的众多优势,对蛋白质进行重组改构以及体内外修饰也必将得到越来越为广泛的应用。第一代蛋白质修饰技术的缺陷在于对偶联位点的不可控性,所以传统的修饰方法是非定点非定量的,并不适合大规模生产制备的质量控制。而非天然氨基酸修饰技术位点可控的优点,在蛋白质修饰领域有广泛应用前景。以抗体偶联药物为例,同传统修饰位点不可控的ADC(抗体药物偶联物)药物相比,位点特异性ADC药物的特异性强、成份单一、毒性低,无疑是未来靶向药物的发展方向(Tian Feng.etal.PNAS,2014,111:1766–1771.)。
(3)基因密码子扩展技术工业应用的瓶颈
按照宿主细胞的类型,可将蛋白质表达系统大致分为原核、酵母、植物、昆虫和哺乳动物细胞表达系统。与其它系统相比,哺乳动物细胞表达系统的优势在于能够指导蛋白质的正确折叠,提供复杂的N型糖基化和准确的O型糖基化等多种翻译后加工功能,因而表达产物在分子结构、理化特性和生物学功能方面最接近于天然的高等生物蛋白质分子。在工业上,要想实现蛋白质药物的高产量大规模化生产,就需要构建目的基因的高表达稳定细胞系。而将非天然氨基酸应用于蛋白质药物的开发上,一个亟待解决的问题就是如何构建稳定整合正交tRNA/氨酰tRNA合成酶的工程细胞。由于tRNA的转录和加工有异于蛋白质,因此如何实现正交tRNA的高效稳定表达依旧是个国际难题。实现稳定整合正交tRNA/氨酰tRNA合成酶工程细胞的构建,将有效促进基因密码子扩展技术在蛋白质药物开发中的应用。
荧光素酶报告基因是以荧光素(luciferin)为底物来检测萤火虫荧光素酶(firefly luciferase)活性的一种报告系统。荧光素酶可以催化luciferin氧化成oxyluciferin,在luciferin氧化的过程中,会发出生物荧光(bioluminescence)。然后可以通过荧光测定仪也称化学发光仪(luminometer)或液闪测定仪测定luciferin氧化过程中释放的生物荧光。荧光素和荧光素酶这一生物发光体系,可以极其灵敏、高效地检测基因的表达,其中荧光素酶的基因序列如SEQ ID NO:1所示,其氨基酸序列NCBI登录号为AAP46189。
发明内容:
发明人经过对现有技术的思考和研究,将古甲烷球菌的tRNA(tRNAPyl)(其序列如SEQ ID NO:8所示)和吡咯赖氨酰-tRNA合成酶(PylRS)(其基因序列如SEQ ID NO:9所示)的蛋白质翻译系统整合到哺乳动物细胞(例如HEK293T细胞)基因组中。发明人先利用双慢病毒系统将正交的吡咯赖氨酰-tRNA合成酶和带有琥珀密码子突变的报告基因GFP整合到宿主细胞中,之后,再利用携带多拷贝数正交tRNA质粒线性化稳定转染的方法,将正交tRNA也整合到宿主细胞中,从而得到了携带正交tRNA/氨酰tRNA合成酶的稳定细胞系。该稳定细胞系能使非天然氨基酸定点插入到目的蛋白表面,从而得到定点突变的目的蛋白,例如突变型荧光素酶蛋白(luciferase)。
相比于其它方法,本发明的优点可体现在如下中的一个或几个:
1.构建了一套双慢病毒系统,可以同时实现双蛋白的稳定表达;
2.构建了一个携带高拷贝数正交tRNA的质粒,可以实现正交tRNA的稳定表达;
3.获得了携带正交tRNA/氨酰tRNA合成酶的稳定细胞系;
4.利用该稳定细胞系,可以在目的蛋白质任意位点引入非天然氨基酸,从而创造可以仅对该位点进行特异性修饰的蛋白;
5.利用非天然氨基酸上特有的活性基团,可以实现高效,特异性的修饰目的。
具体地,在本发明的一个具体的实施方案中,在宿主细胞HEK293T细胞中整合了正交tRNA/氨酰tRNA合成酶基因,主要通过六个步骤:(1)构建携带正交氨酰tRNA合成酶基因的病毒载体1号pSD31-pylRS-IRES-puro;(2)构建携带特定位点上具有琥珀密码子突变绿色荧光蛋白报告基因的病毒载体2号pSD31-GFP39TAG-IRES-hygro;(3)构建携带有12个拷贝数type-3 Pol III启动子启动的正交tRNA的载体pXH-12tRNA-zeo;(4)包装步骤(1)中病毒1号和步骤(2)中病毒2号,转导HEK 293T细胞,获得整合了正交氨酰tRNA合成酶基因和突变型绿色荧光蛋白报告基因的稳定细胞系;(5)将步骤(3)中载体pXH-12tRNA-zeo线性化后转染步骤(4)中的稳定细胞系,并利用其上携带的博莱霉素抗性基因进行筛选;(6)在培养基中加入非天然氨基酸,挑取带有绿色荧光的单克隆,并扩大培养,最终得到稳定细胞系HEK293-PYL。
该稳定细胞系能在报告基因上插入非天然氨基酸的原理在于:整合的突变型tRNAPyl/PylRS满足下列关系:(1):突变型的tRNAPyl不能利用宿主细胞的赖氨酰tRNA合成酶,只能被突变型的PylRS酰化;(2):突变型的PylRS只能酰化tRNAPyl,不能酰化其它tRNA,因此,突变型tRNAPyl和PylRS之间的关系是正交性的,即突变型的PylRS只能酰化突变型tRNAPyl,同时突变型的tRNAPyl只能被突变型的PylRS酰化,也就是说同一质粒中的突变型的tRNAPyl和PylRS是绝对的相互专一的。这种正交性的酶并且是只有这种酶可以把非天然氨基酸酰化到这种正交的tRNA上,并且只能酰化这种tRNA,而不能酰化其它的tRNA。获得的正交赖氨酰tRNA合酶/tRNA系统,使非20种常见氨基酸的Lys-azido(也可称为:NAEK)与琥珀密码子相对应,从而将非天然氨基酸定点引入到报告蛋白GFP或其它目的蛋白上。突变型PylRS利用慢病毒pSD31-pylRS-IRES-puro整合到稳定细胞系基因组中,其中,IRES为内部核糖体进入序列(internal ribosome entry site),常用于多顺反子基因表达(PelletierJ.et al.,Nature,1988,334:320–325.)。例如,在目的基因之后插入IRES序列,后面是选择标记基因,这样转录出来的mRNA就可以同时表达两种蛋白。利用IRES系统过表达目的基因有2个优势:1.目的基因与标记基因共用一个启动子,避免了假阳性的出现;2.IRES翻译效率低于传统翻译起始位点,使得目的基因表达量高于标记基因(Kozak M,et al.,NucleicAcids Res,2005,33:6593–6602.)。利用该双慢病毒系统,可以同时实现两个蛋白在宿主细胞中的稳定过表达。
在本发明的一个具体的实施方案中,构建了一套双慢病毒系统pSD31-IRES-puro和pSD31-IRES-hygro,这两个病毒载体来源于病毒载体pSD31(Zhang Jing.et al.RNA,2007,13:1375–1383.),将pSD31载体上的sv40启动子启动的puroR基因分别替换成IRES-puroR和IRES-hygroR基因,这样就得到了2个不同抗性的病毒载体pSD31-IRES-puro和pSD31-IRES-hygro。
在本发明的一个具体的实施方案中,本发明提供了一种在哺乳动物细胞中稳定整合正交tRNA的方法,原理在于:将原核的tRNA用合适的真核启动子启动并进行串联表达以增加整合概率。本发明选择了第三类RNA聚合酶III启动子(type-3Pol III),该类启动子转录序列依靠的是启动子自身的启动元件,不需要任何内源性转录元件(例如A-和B-box)存在于下游的编码序列中。因此,可以在真核细胞中启动缺少内部启动元件A-和B-box的原核tRNA。本发明将12个拷贝数type-3Pol III启动子启动的正交tRNA串联到穿梭载体pXH上,并在pXH载体上引入真核筛选博莱霉素抗性基因,得到了载体pXH-12tRNA-zeo,将该载体线性化后转染细胞,并用博莱霉素筛选,分离鉴定单克隆,即可得到稳定表达正交tRNA的细胞。
更为具体地,本发明提供了:
1.双病毒载体pSD31-IRES-puro和pSD31-IRES-hygro,其组合使用可实现双蛋白的同时过表达,其原理利用了内部核糖体进入序列(IRES),其中,pSD31-IRES-puro载体携带嘌呤霉素(puromycin)抗性基因,pSD31-IRES-hygro载体携带含有潮霉素B(hygromycin)抗性基因,用于进行真核筛选。pSD31-IRES-puro的序列如SEQ ID NO:2所示。pSD31-IRES-hygro序列如SEQ ID NO:3所示。
2.携带正交氨酰tRNA合成酶基因的病毒载体1号pSD31-pylRS-IRES-puro,其序列如SEQ ID NO:4所示。该载体包装病毒后转导细胞,利用嘌呤霉素筛选,可以将正交氨酰tRNA合成酶整合到宿主细胞中。
3.携带Tyr39位突变为琥珀密码子的绿色荧光蛋白报告基因的病毒载体2号pSD31-GFP39TAG-IRES-hygro,该载体包装病毒后转导细胞,利用潮霉素B筛选,可以将报告基因整合到宿主细胞中。pSD31-GFP39TAG-IRES-hygro的序列如SEQ ID NO:5所示。
4.携带有12个拷贝数type-3Pol III启动子启动的正交tRNA的载体pXH-12tRNA-zeo,该载体线性化后转染细胞,利用博莱霉素筛选,可以将正交tRNA整合到宿主细胞中。pXH-12tRNA-zeo的序列如SEQ ID NO:6所示。
5.稳定细胞系HEK293-PYL(保藏于中国微生物菌种保藏管理委员会普通微生物中心、保藏日期为2015年11月17日、保藏号为CGMCC No:11592的细胞系;其分类命名为人HEK293T细胞),该细胞系由两轮病毒转导和1轮质粒稳定转染获得,携带有正交tRNA/氨酰tRNA合成酶基因,利用该稳定细胞系,可以在目的蛋白质任意位点引入非天然氨基酸,从而创造可以仅对该位点进行特异性修饰的原料蛋白。
6.定点突变的蛋白,例如萤火虫荧光素酶报告基因luciferase,其第F14位上的氨基酸被突变为非天然氨基酸,所述非天然氨基酸为含有叠氮基团的非天然氨基酸Lys-azido(NAEK),后说明均以此非天然氨基酸为例。
Figure BDA0002885359960000081
本系统也适用于含有光交联基团的非天然氨基酸DiZPK
Figure BDA0002885359960000082
示例性地,所述突变位点可为SEQ ID NO:1编码的荧光素酶的任意位点上一个或多个的氨基酸。优选地,所述突变位点选自:由SEQ ID NO:1编码的荧光素酶的第F14位或其他对活性影响较小的位点。
7.定点突变的目的蛋白,其与突变前蛋白的氨基酸的序列的区别在于:第N位的氨基酸被突变为NAEK,所述突变氨基酸与在蛋白中的连接方式如下式所示:
Figure BDA0002885359960000083
由R1到R2的方向为氨基酸序列的N末端到C末端方向,R1为蛋白的第1至第N-1位氨基酸残基,
R2为蛋白的第N+1位至C末端的氨基酸残基,R4
Figure BDA0002885359960000084
8.编码项目6-7中任一项的突变的目的蛋白(例如luciferase)的核酸分子。示例性地,所述核酸分子与SEQ ID NO:1的区别在于,编码第F14位或其他对活性及稳定性影响较小的位点的一个氨基酸的密码子被图变为琥珀密码子。
9.制备含有非天然氨基酸的目的蛋白(例如luciferase)的方法,包括步骤:
(1)获得携带有正交tRNA/氨酰tRNA合成酶基因的稳定细胞系HEK293-PYL(保藏于中国微生物菌种保藏管理委员会普通微生物中心、保藏日期为2015年11月17日、保藏号为CGMCC No:11592的细胞系;其分类命名为人HEK293T细胞)。
(2)选择步骤:在目的蛋白的氨基酸序列中选择期望突变的一个或多个特定氨基酸位点;
(3)基因突变:将编码对应于(2)中选择的位点的目的蛋白的氨基酸的密码子用基因工程方法突变为琥珀密码子;
(4)表达载体构建:将(3)基因突变步骤得到的突变的目的蛋白的编码序列与合适的载体可操作地连接,得到突变序列表达载体;
(5)表达:将步骤(4)得到的突变序列表达载体转染到步骤(1)中的稳定细胞系HEK293-PYL,将转染成功后的宿主细胞在含有NAEK的培养基中培养,在适当的时间收集细胞;
(6)裂解细胞,检测含有非天然氨基酸的目的蛋白(例如luciferase)蛋白表达量。
本发明所述稳定细胞系HEK293-PYL,其携带有正交tRNA/氨酰tRNA合成酶基因。示例性地,本发明的稳定细胞系为保藏号为CGMCC No:11592的细胞系。
附图说明:
图1:双慢病毒载体的构建
A:慢病毒载体pSD31结构示意图;
B:双慢病毒载体pSD31-IRES-puro和pSD31-IRES-hygro结构示意图。在pSD31基础上,通过BamHI和xbal双酶切,将sv40启动子及puromycin抗性基因分别替换成IRES-puro和IRES-hygro,从而得到了分别带有嘌呤霉素抗性和潮霉素B抗性的双病毒载体;
C:双病毒载体pSD31-pylRS-IRES-puro和pSD31-GFP39TAG-IRES-hygro。将pSD31-IRES-puro载体通过BamHI单酶切引入CMV启动子及正交氨酰tRNA合成酶基因,pSD31-IRES-hygro载体通过BamHI单酶切引入CMV启动子及突变型绿色荧光蛋白GFP基因。
图2:pXH-12tRNA-zeo载体的构建
A:pXH空白载体示意图;
B:pXH-12tRNA-zeo载体示意图。
图3:筛选稳定细胞系的流程
携带有正交tRNA/氨酰tRNA合成酶基因的稳定细胞系HEK293-PYL通过3轮筛选获得,第1轮筛选先包装pSD31-pylRS-IRES-puro病毒,转导HEK293T细胞,用浓度为0.6ug/ml的puromycin进行筛选,得到表达正交氨酰tRNA合成酶的稳定细胞系1号后,第2轮筛选再包装pSD31-GFP39TAG-IRES-hygro病毒,用浓度为200ug/ml的hygromycin进行筛选,得到同时表达正交氨酰tRNA合成酶和报告基因突变型绿色荧光蛋白的稳定细胞系2号。第3轮筛选将质粒pXH-12tRNA-zeo载体酶切线性化后,转染稳定细胞系2号,用400ug/ml的zeomycin进行筛选,并在培养过程中加入非天然氨基酸NAEK,分离纯化GFP阳性克隆,继续用剂量减半的zeomycin扩大培养,最终得到稳定细胞系HEK293-PYL。
图4:稳定细胞系的鉴定
A:稳定细胞系培养中加入的非天然氨基酸Lys-azido(NAEK)结构示意图;
B:稳定细胞系加/减非天然氨基酸后的绿色荧光蛋白成像,只有加入非天然氨基酸后,才能使带有琥珀终止密码子突变的GFP基因通读;
C:Western Blot检测稳定细胞系加/减非天然氨基酸后正交氨酰tRNA合成酶和绿色荧光蛋白的表达,只有加入非天然氨基酸后,才能检测到全长的绿色荧光蛋白,与图4B结果相符合;
D:萤火虫荧光素酶报告基因luciferase检测稳定细胞系可以在目的蛋白质任意位点引入非天然氨基酸,荧光素酶读值显示,加入非天然氨基酸后,可以得到全长有活性的突变型萤火虫荧光素酶蛋白。
为了更好地理解本发明,发明人用实施例对具体试验进行阐述和说明,其中所述实施例仅用于说明,并不限定本发明的保护范围。任何与本发明等价的变体或者实施方案都包括在本发明中。
实施例1:双慢病毒载体的构建及获得
(1)载体骨架的获得
双慢病毒载体骨架为慢病毒载体pSD31(Zhang Jing.et al.RNA,2007,13:1375–1383.),其中,sv40启动子启动嘌呤霉素抗性基因蛋白puroR的表达。
(2)SOE PCR的引物设计
发明人利用SOE PCR,将内部核糖体进入序列(IRES)和嘌呤霉素(puromycin)抗性基因/潮霉素B(hygromycin)抗性基因的DNA片段进行拼接,分别得到IRES-puro和IRES-hygro片段,具体引物如下表所示。
表1:SOE PCR引物列表
Figure BDA0002885359960000111
Figure BDA0002885359960000121
(3)慢病毒载体的改造
在pSD31基础上,通过BamHI和xbal双酶切,将sv40启动子及puromycin抗性基因片段分别替换成IRES-puro和IRES-hygro片段,从而得到了分别带有嘌呤霉素抗性和潮霉素B抗性的双病毒载体。
实施例2:pXH-12tRNA-zeo载体的构建及获得
为了保证tRNA的表达量,需要将多拷贝串联表达的启动子-tRNA克隆到一个合适的载体上。本发明以pXH空白载体为骨架,通过EcoRI酶切位点将zeomycin-polyA序列引入到SV40启动子后面,使其带有博莱霉素抗性。之后,再利用SalI酶切位点将12个拷贝的启动子-tRNA序列克隆到pXH-zeo载体上,为了避免重复序列发生重组的概率,使用了4种不同的tRNA启动子:7sk/hu6/H1/mu6。最后得到了筛选tRNA的载体bjmu-12t-zeo。
(1)载体骨架的获得
pXH-12tRNA-zeo载体骨架为载体pXH,该载体是由PUC19载体改造获得的穿梭载体,具有能在真核细胞中复制、分子量小、自带多克隆位点等优点。pXH序列如SEQ ID NO:7所示。
(2)SOE PCR的引物设计
发明人利用SOE PCR,将启动子(type-3Pol III)序列和正交tRNA的DNA片段进行拼接,选用的启动子分别为人源7sk启动子,人源u6启动子,人源H1启动子,鼠源u6启动子,分别得到7sk-tRNA、hu6-tRNA、H1-tRNA和mu6-tRNA片段,启动子序列、tRNA序列、具体引物如下表所示
表2-1.启动子以及tRNA序列
Figure BDA0002885359960000131
表2-2.SOE-PCR基因扩增引物列表
Figure BDA0002885359960000132
Figure BDA0002885359960000141
(3)pXH载体的改造
在pXH载体基础上,通过EcoRI酶切位点将zeomycin-polyA序列引入到sv40启动子后面,使其带有博莱霉素抗性,得到pXH-zeo载体。之后,将载体用SalI单酶切,将启动子及tRNA片段分别用SalI/xhol同尾酶双酶切,将12个拷贝的启动子-tRNA序列克隆到pXH-zeo载体上,得到pXH-12tRNA-zeo载体。
实施例3:稳定细胞系的筛选
(1)慢病毒的包装和转导,包括以下步骤:
a.HEK 293T细胞铺板:使用培养基A,成分(DMEM+10%FBS,1×NEAA,withoutsodium pyruvate),细胞消化计数,六孔板每孔细胞接种数目为4×105细胞/每孔。
b.慢病毒包装:在细胞密度为70%~80%进行转染,质粒和转染试剂配比如表3-1。转染后6小时,换成培养基B(DMEM+3%FBS,1×NEAA,With Sodium Pyruvate)。继续培养。在转染后48小时,72小时收取病毒液,并用孔径为0.45μm的PVDF膜针头滤器过滤。
表3-1.慢病毒包装的质粒配比
质粒/转染试剂 每孔用量
Opti-MEM 200μl
Transfer vector 0.72μg
pRSV 0.64μg
VSVG 0.32μg
PRRE O.32μgl
Megatran 1.0 6ul
c.病毒转导:待病毒感染的细胞计数提前一天铺六孔板,每孔加入2ml病毒液,加入Polybrene使得病毒液中Polybrene的浓度为8μg/ml。
d.病毒滴度测定:采用倍比稀释法,感染HT1080细胞形成克隆。
e.抗生素筛选:病毒转导24h后就可以进行抗生素筛选,筛选浓度要根据具体细胞的杀伤曲线来决定,其中293T细胞嘌呤霉素筛选浓度为0.6ug/ml,潮霉素B筛选浓度为200ug/ml。抗生素筛选10天,直至不加病毒液的空白组全部死亡,加入病毒液的实验组形成单克隆,将单克隆扩大培养得到稳定细胞系。
(2)pXH-12tRNA-zeo载体的稳定转染,包括以下步骤:
a.将pXH-12t-zeo载体酶切线性化后,转染(1)中筛选得到的稳定细胞系。
b.转染6小时后换液,加入非天然氨基酸NAEK。
c.转染48小时后,观察绿色荧光,换液,加入400ug/ml的zeomycin。
d.每3天换液,直到blank组全部死亡,转染组形成克隆。
e.分离纯化GFP阳性克隆,继续用剂量减半的博莱霉素扩大培养,得到稳定细胞系HEK293-PYL。
稳定细胞系HEK293-PYL,可以从中国微生物菌种保藏管理委员会普通微生物中心,保藏日期为2015年11月17日、保藏号为CGMCC No:11592的细胞系获得。
实施例4:稳定细胞系的鉴定
本发明中构建的稳定细胞系HEK293-PYL中含有源自古甲烷球菌的tRNA(tRNAPyl)和吡咯赖氨酰-tRNA合成酶(pylRS),在表达细胞中,以琥珀终止密码子(TAG)为有义编码子,能够使非天然氨基酸NAEK掺入到蛋白中。下面,发明人对NAEK的掺入可能性和突变蛋白质的生产性能进行了检测。
1:非天然氨基酸NAEK的合成和鉴定
非天然氨基酸Lys-azido的化学合成反应式如下
Figure BDA0002885359960000161
如上式所述,将原料1(2-溴乙醇)2.3mL溶于90mL丙酮以及15mL水的混合溶液,加入NaN3 3.12g,60℃油浴加热回流反应20h。冷却至室温,旋蒸除去丙酮,无水乙醚萃取(30mL×8),无水Na2SO4干燥,旋蒸除去溶剂得2.62g无色液体产物2。
将产物2(500mg,5.74mmol)加入到三光气(1.70g,5.74mmol)的THF(10ml)溶液中。0℃搅拌反应8h,溶剂蒸干。剩余物在真空下干燥1h,得到无色油状产物3。
将3溶解在1.5ml的THF中并缓慢加入Boc-Lys-OH(1.7g,6.88mmol)的1M NaOH(20ml)/THF(5ml)的溶液中。0℃搅拌反应12h并逐渐升温到室温。重新将反应液冷却到0℃并用0℃的1M的盐酸溶液将反应液pH值调整至2~3。反应液用EtOAc萃取(30mL×5),有机层用2×100ml的饱和食盐水洗涤。无水Na2SO4干燥有机层、过滤、旋蒸除去溶剂得到1.65g无色粘稠液体产物4不用进一步纯化。
将4溶于15mL CH2Cl2中,搅拌下缓慢滴加15mL TFA,室温下反应30min后蒸出溶剂,剩余液体产物用5mL甲醇溶解,加入100mL乙醚,析出大量白色固体沉淀,过滤干燥得到1.38g白色固体终产物5。1H NMR(D2O):δ=1.22-1.45(m,4H),1.67-1.73(m,2H),2.99(m,2H),3.38(m,2H),3.70(m,1H),4.09(m,2H).13C NMR(D2O):δ=21.4,28.4,29.6,39.5,53.4,56.2,57.8,116.0(TFA),153.1,162.3(TFA),172.9.HRMS:m/z calcd for C9H17N5O4[M]+:259.1281;found:259.1283,证明得到的Lys-azido结构正确。2:突变荧光素酶的NAEK掺入表达
以萤火虫荧光素酶的突变型(luciferase-Phe-14TAG)为例:将携带突变型萤火虫荧光素酶的核酸载体转染实施例3的稳定细胞系HEK293-PYL,同时加入NAEK至终浓度1mM,37℃,5%CO2表达48小时后裂解细胞;
向细胞裂解液加入荧光素酶底物,检测荧光读值。结果如图4D所示。加入非天然氨基酸后,可以得到全长有活性的突变型萤火虫荧光素酶蛋白。
虽然用上述实施方式描述了本发明,应当理解的是,在不背离本发明的精神的前提下,本发明可进行进一步的修饰和变动,且这些修饰和变动均属于本发明的保护范围之内。例如,本申请虽然以荧光素酶为例对稳定细胞系的应用进行了说明,但是很显然,本发明不应当仅仅限于荧光素酶,本领域技术人可将本发明适用于任何目的蛋白插入非天然氨基酸。
序列表
<110> 北京大学
<120> 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
<130> 1
<160> 9
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1653
<212> DNA
<213> 萤火虫(Photinus pyralis)
<400> 1
atggaagacg ccaaaaacat aaagaaaggc ccggcgccat tctatccgct ggaagatgga 60
accgctggag agcaactgca taaggctatg aagagatacg ccctggttcc tggaacaatt 120
gcttttacag atgcacatat cgaggtggac atcacttacg ctgagtactt cgaaatgtcc 180
gttcggttgg cagaagctat gaaacgatat gggctgaata caaatcacag aatcgtcgta 240
tgcagtgaaa actctcttca attctttatg ccggtgttgg gcgcgttatt tatcggagtt 300
gcagttgcgc ccgcgaacga catttataat gaacgtgaat tgctcaacag tatgggcatt 360
tcgcagccta ccgtggtgtt cgtttccaaa aaggggttgc aaaaaatttt gaacgtgcaa 420
aaaaagctcc caatcatcca aaaaattatt atcatggatt ctaaaacgga ttaccaggga 480
tttcagtcga tgtacacgtt cgtcacatct catctacctc ccggttttaa tgaatacgat 540
tttgtgccag agtccttcga tagggacaag acaattgcac tgatcatgaa ctcctctgga 600
tctactggtc tgcctaaagg tgtcgctctg cctcatagaa ctgcctgcgt gagattctcg 660
catgccagag atcctatttt tggcaatcaa atcattccgg atactgcgat tttaagtgtt 720
gttccattcc atcacggttt tggaatgttt actacactcg gatatttgat atgtggattt 780
cgagtcgtct taatgtatag atttgaagaa gagctgtttc tgaggagcct tcaggattac 840
aagattcaaa gtgcgctgct ggtgccaacc ctattctcct tcttcgccaa aagcactctg 900
attgacaaat acgatttatc taatttacac gaaattgctt ctggtggcgc tcccctctct 960
aaggaagtcg gggaagcggt tgccaagagg ttccatctgc caggtatcag gcaaggatat 1020
gggctcactg agactacatc agctattctg attacacccg agggggatga taaaccgggc 1080
gcggtcggta aagttgttcc attttttgaa gcgaaggttg tggatctgga taccgggaaa 1140
acgctgggcg ttaatcaaag aggcgaactg tgtgtgagag gtcctatgat tatgtccggt 1200
tatgtaaaca atccggaagc gaccaacgcc ttgattgaca aggatggatg gctacattct 1260
ggagacatag cttactggga cgaagacgaa cacttcttca tcgttgaccg cctgaagtct 1320
ctgattaagt acaaaggcta tcaggtggct cccgctgaat tggaatccat cttgctccaa 1380
caccccaaca tcttcgacgc aggtgtcgca ggtcttcccg acgatgacgc cggtgaactt 1440
cccgccgccg ttgttgtttt ggagcacgga aagacgatga cggaaaaaga gatcgtggat 1500
tacgtcgcca gtcaagtaac aaccgcgaaa aagttgcgcg gaggagttgt gtttgtggac 1560
gaagtaccga aaggtcttac cggaaaactc gacgcaagaa aaatcagaga gatcctcata 1620
aaggccaaga agggcggaaa gatcgccgtg taa 1653
<210> 2
<211> 7678
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 60
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 120
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg 180
aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt 240
gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg 300
actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga 360
attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat ataaattaaa 420
acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg gcctgttaga 480
aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc agacaggatc 540
agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc atcaaaggat 600
agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa acaaaagtaa 660
gaaaaaagca cagcaagcag cagctgacac aggacacagc aatcaggtca gccaaaatta 720
ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac ctagaacttt 780
aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga tacccatgtt 840
ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 900
gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcaggcaa 960
agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg ttccttgggt 1020
tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg gtacaggcca 1080
gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct attgaggcgc 1140
aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca agaatcctgg 1200
ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc tctggaaaac 1260
tcatttgcac cactgctgtg ccttggatct acaaatggca gtattcatcc acaatttaaa 1320
agaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 1380
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 1440
acagggacag cagagatcca gtttggggat ccaattccgc ccctctccct cccccccccc 1500
taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgttatt 1560
ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt 1620
gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt 1680
cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct 1740
ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt 1800
ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt 1860
ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa 1920
ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta 1980
gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt tcctttgaaa 2040
aacacgatga taagcttgcc acaacccaca aggagacgac cttccatgac cgagtacaag 2100
cccacggtgc gcctcgccac ccgcgacgac gtcccccggg ccgtacgcac cctcgccgcc 2160
gcgttcgccg actaccccgc cacgcgccac accgtcgacc cggaccgcca catcgagcgg 2220
gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc tcgacatcgg caaggtgtgg 2280
gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc cggagagcgt cgaagcgggg 2340
gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga gcggttcccg gctggccgcg 2400
cagcaacaga tggaaggcct cctggcgccg caccggccca aggagcccgc gtggttcctg 2460
gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc tgggcagcgc cgtcgtgctc 2520
cccggagtgg aggcggccga gcgcgccggg gtgcccgcct tcctggagac ctccgcgccc 2580
cgcaacctcc ccttctacga gcggctcggc ttcaccgtca ccgccgacgt cgaggtgccc 2640
gaaggaccgc gcacctggtg catgacccgc aagcccggtg cctgatctag aggatcataa 2700
tcagccatac cacatttgta gaggttttac ttgctttaaa aaacctccca cacctccccc 2760
tgaacctgaa acataaaatg aatgcaattg ttgttgttaa cttgtttatt gcagcttata 2820
atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc 2880
attctagttg tggtttgtcc aaactcatca atgtatctta tcatgtctgg atcgggctgc 2940
aggaattcga tatcaagctt atcgataatc aacctctgga ttacaaaatt tgtgaaagat 3000
tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct gctttaatgc 3060
ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg tataaatcct 3120
ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca 3180
ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt cagctccttt 3240
ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc gcctgccttg 3300
cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga 3360
aatcatcgtc ctttccttgg ctgctcgcct gtgttgccac ctggattctg cgcgggacgt 3420
ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc ggcctgctgc 3480
cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt 3540
gggccgcctc cccgcatcga taccgtcgac tagccgtacc tttaagacca atgacttaca 3600
aggcagctgt agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc 3660
actcccaaag aagacaagat ctgctttttg cctgtactgg gtctctctgg ttagaccaga 3720
tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct 3780
tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat 3840
ccctcagacc cttttagtca gtgtggaaaa tctctagcag aattcgatat caagcttatc 3900
gataccgtcg acctcgaggg ggggcccggt acccaattcg ccctatagtg agtcgtatta 3960
cgctcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 4020
aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 4080
gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggaaattgta agcgttaata 4140
ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac caataggccg 4200
aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg agtgttgttc 4260
cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa 4320
ccgtctatca gggcgatggc ccactacgtg aaccatcacc ctaatcaagt tttttggggt 4380
cgaggtgccg taaagcacta aatcggaacc ctaaagggag cccccgattt agagcttgac 4440
ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa agcgaaagga gcgggcgcta 4500
gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc gcgcttaatg 4560
cgccgctaca gggcgcgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt 4620
gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa 4680
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta 4740
ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag 4800
taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca 4860
gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta 4920
aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc 4980
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc 5040
ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca 5100
ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc 5160
acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca 5220
taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac 5280
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg 5340
cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg 5400
ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg 5460
gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac 5520
gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc 5580
aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct 5640
aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc 5700
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc 5760
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg 5820
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa 5880
atactgttct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc 5940
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt 6000
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa 6060
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc 6120
tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc 6180
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct 6240
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat 6300
gctcgtcagg ggggcggagc ctatggaaaa acgccgcaac cggccttttt acggttcctg 6360
gccttttgct ggccttttgc tcacatgtct ttcctgcgtt acccctgatt ctgtggataa 6420
ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag 6480
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg 6540
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 6600
gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 6660
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 6720
ctatgaccat gattacgcca agccgaatta accctcacta aagggaacaa aagctggagc 6780
tccaccgcgg tggcggcctc gaggtcgaga tccggtcgac cagcaaccat agtcccgccc 6840
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 6900
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 6960
aagtagtgag gaggcttttt tggaggccta ggcttttgca aaaagcttcg acggtatcga 7020
ttggctcatg tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt 7080
aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta 7140
cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga 7200
cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt 7260
tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta 7320
ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg 7380
actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt 7440
tttggcagta catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc 7500
accccattga cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat 7560
gtcgtaacaa ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg aattcggagt 7620
ggcgagccct cagatcctgc atataagcag ctgctttttg cctgtatggg tctctctg 7678
<210> 3
<211> 8103
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 60
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 120
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg 180
aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt 240
gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg 300
actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga 360
attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat ataaattaaa 420
acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg gcctgttaga 480
aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc agacaggatc 540
agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc atcaaaggat 600
agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa acaaaagtaa 660
gaaaaaagca cagcaagcag cagctgacac aggacacagc aatcaggtca gccaaaatta 720
ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac ctagaacttt 780
aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga tacccatgtt 840
ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 900
gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcaggcaa 960
agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg ttccttgggt 1020
tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg gtacaggcca 1080
gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct attgaggcgc 1140
aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca agaatcctgg 1200
ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc tctggaaaac 1260
tcatttgcac cactgctgtg ccttggatct acaaatggca gtattcatcc acaatttaaa 1320
agaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 1380
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 1440
acagggacag cagagatcca gtttggggat ccaattccgc ccctctccct cccccccccc 1500
taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct atatgttatt 1560
ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt 1620
gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc tgttgaatgt 1680
cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg tagcgaccct 1740
ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa agccacgtgt 1800
ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt ggatagttgt 1860
ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg atgcccagaa 1920
ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta catgtgttta 1980
gtcgaggtta aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa 2040
acacgatgat aagcttgcca caacccacaa ggagacgacc ttccatgaaa aagcctgaac 2100
tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt cgacagcgtc tccgacctga 2160
tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt cgatgtagga gggcgtggat 2220
atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa agatcgttat gtttatcggc 2280
actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa ttcagcgaga 2340
gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac ctgcctgaaa 2400
ccgaactgcc cgctgttctg cagccggtcg cggaggccat ggatgcgatc gctgcggccg 2460
atcttagcca gacgagcggg ttcggcccat tcggaccgca aggaatcggt caatacacta 2520
catggcgtga tttcatatgc gcgattgctg atccccatgt gtatcactgg caaactgtga 2580
tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg ctttgggccg 2640
aggactgccc cgaagtccgg cacctcgtgc acgcggattt cggctccaac aatgtcctga 2700
cggacaatgg ccgcataaca gcggtcattg actggagcga ggcgatgttc ggggattccc 2760
aatacgaggt cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg gagcagcaga 2820
cgcgctactt cgagcggagg catccggagc ttgcaggatc gccgcggctc cgggcgtata 2880
tgctccgcat tggtcttgac caactctatc agagcttggt tgacggcaat ttcgatgatg 2940
cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg actgtcgggc 3000
gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta gaagtactcg 3060
ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatga tctagaggat 3120
cataatcagc cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct 3180
ccccctgaac ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc 3240
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 3300
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatcgg 3360
gctgcaggaa ttcgatatca agcttatcga taatcaacct ctggattaca aaatttgtga 3420
aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt 3480
aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa 3540
atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt 3600
gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct 3660
cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg 3720
ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc 3780
ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg 3840
gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct 3900
gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc 3960
cctttgggcc gcctccccgc atcgataccg tcgactagcc gtacctttaa gaccaatgac 4020
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 4080
aattcactcc caaagaagac aagatctgct ttttgcctgt actgggtctc tctggttaga 4140
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 4200
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 4260
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagaattc gatatcaagc 4320
ttatcgatac cgtcgacctc gagggggggc ccggtaccca attcgcccta tagtgagtcg 4380
tattacgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 4440
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 4500
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggaaa ttgtaagcgt 4560
taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata 4620
ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt 4680
tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg 4740
aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt 4800
ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc 4860
ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg 4920
cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct 4980
taatgcgccg ctacagggcg cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5040
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5100
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5160
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5220
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5280
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5340
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5400
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5460
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 5520
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 5580
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 5640
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5700
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5760
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5820
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 5880
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 5940
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 6000
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 6060
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 6120
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 6180
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 6240
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6300
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 6360
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6420
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 6480
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 6540
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 6600
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6660
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6720
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc gcaaccggcc tttttacggt 6780
tcctggcctt ttgctggcct tttgctcaca tgtctttcct gcgttacccc tgattctgtg 6840
gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 6900
cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc 6960
gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 7020
agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 7080
tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga 7140
aacagctatg accatgatta cgccaagccg aattaaccct cactaaaggg aacaaaagct 7200
ggagctccac cgcggtggcg gcctcgaggt cgagatccgg tcgaccagca accatagtcc 7260
cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 7320
atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat 7380
tccagaagta gtgaggaggc ttttttggag gcctaggctt ttgcaaaaag cttcgacggt 7440
atcgattggc tcatgtccaa cattaccgcc atgttgacat tgattattga ctagttatta 7500
atagtaatca attacggggt cattagttca tagcccatat atggagttcc gcgttacata 7560
acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat 7620
aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga 7680
gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc 7740
ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt 7800
atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat 7860
gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag 7920
tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc 7980
aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcgtg tacggaattc 8040
ggagtggcga gccctcagat cctgcatata agcagctgct ttttgcctgt atgggtctct 8100
ctg 8103
<210> 4
<211> 9983
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 60
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 120
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg 180
aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt 240
gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg 300
actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga 360
attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat ataaattaaa 420
acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg gcctgttaga 480
aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc agacaggatc 540
agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc atcaaaggat 600
agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa acaaaagtaa 660
gaaaaaagca cagcaagcag cagctgacac aggacacagc aatcaggtca gccaaaatta 720
ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac ctagaacttt 780
aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga tacccatgtt 840
ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 900
gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcaggcaa 960
agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg ttccttgggt 1020
tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg gtacaggcca 1080
gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct attgaggcgc 1140
aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca agaatcctgg 1200
ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc tctggaaaac 1260
tcatttgcac cactgctgtg ccttggatct acaaatggca gtattcatcc acaatttaaa 1320
agaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 1380
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 1440
acagggacag cagagatcca gtttggggat cccaatattg gccattagcc atattattca 1500
ttggttatat agcataaatc aatattggct attggccatt gcatacgttg tatctatatc 1560
ataatatgta catttatatt ggctcatgtc caatatgacc gccatgttgg cattgattat 1620
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 1680
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 1740
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 1800
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 1860
tgccaagtcc gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 1920
agtacatgac cttacgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 1980
ttaccatggt gatgcggttt tggcagtaca ccaatgggcg tggatagcgg tttgactcac 2040
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 2100
aacgggactt tccaaaatgt cgtaataacc ccgccccgtt gacgcaaatg ggcggtaggc 2160
gtgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgtcag atcgcctgga 2220
gacgccatcc acgctgtttt gacctccata gaagacaccg ggaccgatcc agcctccgcg 2280
gccgggaacg gtgcattgga acgcggattc cccgtgccaa gagtgacgta agtaccgcct 2340
atagagtcta taggcccacc cccttggctt cgttagaacg cggctacaat taatacataa 2400
ccttatgtat catacacata cgatttaggt gacactatag aataacatcc actttgcctt 2460
tctctccaca ggtgtccact cccaggtcca actgcacgga agcttgccac catggataaa 2520
aaaccattag atgttttaat atctgcgacc gggctctgga tgtccaggac tggcacgctc 2580
cacaaaatca agcaccatga ggtctcaaga agtaaaatat acattgaaat ggcgtgtgga 2640
gaccatcttg ttgtgaataa ttccaggagt tgtagaacag ccagagcatt cagacatcat 2700
aagtacagaa aaacctgcaa acgatgtagg gtttcggacg aggatatcaa taattttctc 2760
acaagatcaa ccgaaagcaa aaacagtgtg aaagttaggg tagtttctgc tccaaaggtc 2820
aaaaaagcta tgccgaaatc agtttcaagg gctccgaagc ctctggaaaa ttctgtttct 2880
gcaaaggcat cgacgaacac atccagatct gtaccttcgc ctgcaaaatc aactccaaat 2940
tcgtctgttc ccgcatcggc tcctgctcct tcacttacaa gaagccagct tgatagggtt 3000
gaggctctct taagtccaga ggataaaatt tctctaaata tggcaaagcc tttcagggaa 3060
cttgagcctg aacttgtgac aagaagaaaa aacgattttc agcggctcta taccaatgat 3120
agagaagact acctcggtaa actcgaacgt gatattacga aatttttcgt agaccggggt 3180
tttctggaga taaagtctcc tatccttatt ccggcggaat acgtggagag aatgggtatt 3240
aataatgata ctgaactttc aaaacagatc ttccgggtgg ataaaaatct ctgcttgagg 3300
ccaatgcttg ccccgactct gtataactat gcgcgaaaac tcgataggat tttaccaggc 3360
ccaataaaaa ttttcgaagt cggaccttgt taccggaaag agtctgacgg caaagagcac 3420
ctggaagaat ttactatggt gaacttcagt cagatgggtt cgggatgtac tcgggaaaat 3480
cttgaagctc tcatcaaaga gtttctggac tatctggaaa tcgacttcga aatcgtagga 3540
gattcctgta tggtctttgg ggatactctt gatataatgc acggggacct ggagctttct 3600
tcggcagtcg tcgggccagt ttctcttgat agagaatggg gtattgacaa accatggata 3660
ggtgcaggtt ttggtcttga acgcttgctc aaggttatgc acggctttaa aaacattaag 3720
agggcatcaa ggtccgaatc ttactataat gggatttcaa ccaatctata aggatccaat 3780
tccgcccctc tccctccccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 3840
ggtgtgcgtt tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 3900
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 3960
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 4020
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 4080
gcctctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 4140
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 4200
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 4260
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 4320
acggggacgt ggttttcctt tgaaaaacac gatgataagc ttgccacaac ccacaaggag 4380
acgaccttcc atgaccgagt acaagcccac ggtgcgcctc gccacccgcg acgacgtccc 4440
ccgggccgta cgcaccctcg ccgccgcgtt cgccgactac cccgccacgc gccacaccgt 4500
cgacccggac cgccacatcg agcgggtcac cgagctgcaa gaactcttcc tcacgcgcgt 4560
cgggctcgac atcggcaagg tgtgggtcgc ggacgacggc gccgcggtgg cggtctggac 4620
cacgccggag agcgtcgaag cgggggcggt gttcgccgag atcggcccgc gcatggccga 4680
gttgagcggt tcccggctgg ccgcgcagca acagatggaa ggcctcctgg cgccgcaccg 4740
gcccaaggag cccgcgtggt tcctggccac cgtcggcgtc tcgcccgacc accagggcaa 4800
gggtctgggc agcgccgtcg tgctccccgg agtggaggcg gccgagcgcg ccggggtgcc 4860
cgccttcctg gagacctccg cgccccgcaa cctccccttc tacgagcggc tcggcttcac 4920
cgtcaccgcc gacgtcgagg tgcccgaagg accgcgcacc tggtgcatga cccgcaagcc 4980
cggtgcctga tctagaggat cataatcagc cataccacat ttgtagaggt tttacttgct 5040
ttaaaaaacc tcccacacct ccccctgaac ctgaaacata aaatgaatgc aattgttgtt 5100
gttaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc 5160
acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta 5220
tcttatcatg tctggatcgg gctgcaggaa ttcgatatca agcttatcga taatcaacct 5280
ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg 5340
ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc 5400
attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt gtggcccgtt 5460
gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac tggttggggc 5520
attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc tattgccacg 5580
gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct gttgggcact 5640
gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt 5700
gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct caatccagcg 5760
gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc 5820
cctcagacga gtcggatctc cctttgggcc gcctccccgc atcgataccg tcgactagcc 5880
gtacctttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt tttaaaagaa 5940
aaggggggac tggaagggct aattcactcc caaagaagac aagatctgct ttttgcctgt 6000
actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac 6060
ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg 6120
ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg gaaaatctct 6180
agcagaattc gatatcaagc ttatcgatac cgtcgacctc gagggggggc ccggtaccca 6240
attcgcccta tagtgagtcg tattacgctc actggccgtc gttttacaac gtcgtgactg 6300
ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg 6360
gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg 6420
cgaatggaaa ttgtaagcgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc 6480
agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag 6540
accgagatag ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg 6600
gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca 6660
tcaccctaat caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa 6720
gggagccccc gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg 6780
aagaaagcga aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta 6840
accaccacac ccgccgcgct taatgcgccg ctacagggcg cgtcaggtgg cacttttcgg 6900
ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg 6960
ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt 7020
attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 7080
gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg 7140
ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg ccccgaagaa 7200
cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt 7260
gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga cttggttgag 7320
tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt 7380
gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac gatcggagga 7440
ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg ccttgatcgt 7500
tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac gatgcctgta 7560
gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct agcttcccgg 7620
caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc 7680
cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt 7740
atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat ctacacgacg 7800
gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg tgcctcactg 7860
attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat tgatttaaaa 7920
cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa 7980
atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga 8040
tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg 8100
ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact 8160
ggcttcagca gagcgcagat accaaatact gttcttctag tgtagccgta gttaggccac 8220
cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg 8280
gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg 8340
gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga 8400
acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc 8460
gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg 8520
agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc 8580
tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc 8640
gcaaccggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgtctttcct 8700
gcgttacccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 8760
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 8820
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 8880
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 8940
aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 9000
gataacaatt tcacacagga aacagctatg accatgatta cgccaagccg aattaaccct 9060
cactaaaggg aacaaaagct ggagctccac cgcggtggcg gcctcgaggt cgagatccgg 9120
tcgaccagca accatagtcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag 9180
ttccgcccat tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc 9240
cgcctcggcc tctgagctat tccagaagta gtgaggaggc ttttttggag gcctaggctt 9300
ttgcaaaaag cttcgacggt atcgattggc tcatgtccaa cattaccgcc atgttgacat 9360
tgattattga ctagttatta atagtaatca attacggggt cattagttca tagcccatat 9420
atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac 9480
ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc 9540
cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg 9600
tatcatatgc caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat 9660
tatgcccagt acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc 9720
atcgctatta ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt 9780
gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac 9840
caaaatcaac gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc 9900
ggtaggcgtg tacggaattc ggagtggcga gccctcagat cctgcatata agcagctgct 9960
ttttgcctgt atgggtctct ctg 9983
<210> 5
<211> 9607
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 60
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 120
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg 180
aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt 240
gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg 300
actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga 360
attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat ataaattaaa 420
acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg gcctgttaga 480
aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc agacaggatc 540
agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc atcaaaggat 600
agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa acaaaagtaa 660
gaaaaaagca cagcaagcag cagctgacac aggacacagc aatcaggtca gccaaaatta 720
ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac ctagaacttt 780
aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga tacccatgtt 840
ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg 900
gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcaggcaa 960
agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg ttccttgggt 1020
tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg gtacaggcca 1080
gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct attgaggcgc 1140
aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca agaatcctgg 1200
ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc tctggaaaac 1260
tcatttgcac cactgctgtg ccttggatct acaaatggca gtattcatcc acaatttaaa 1320
agaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 1380
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 1440
acagggacag cagagatcca gtttggggat ccgttgacat tgattattga ctagttatta 1500
atagtaatca attacggggt cattagttca tagcccatat atggagttcc gcgttacata 1560
acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat 1620
aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga 1680
gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc 1740
ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt 1800
atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat 1860
gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag 1920
tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc 1980
aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcgtg tacggtggga 2040
ggtctatata agcagagctc tctggctaac tagagaaccc actgcttact ggcttatcga 2100
aattaatacg actcactata gggagaccca agctggctag ttaagcttgc caccatggat 2160
tacaaggatg acgacgataa ggtgagcaag ggcgaggagc tgttcaccgg ggtggtgccc 2220
atcctggtcg agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc cggcgagggc 2280
gagggcgatg ccacctaggg caagctgacc ctgaagttca tctgcaccac cggcaagctg 2340
cccgtgccct ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg cttcagccgc 2400
taccccgacc acatgaagca gcacgacttc ttcaagtccg ccatgcccga aggctacgtc 2460
caggagcgca ccatcttctt caaggacgac ggcaactaca agacccgcgc cgaggtgaag 2520
ttcgagggcg acaccctggt gaaccgcatc gagctgaagg gcatcgactt caaggaggac 2580
ggcaacatcc tggggcacaa gctggagtac aactacaaca gccacaacgt ctatatcatg 2640
gccgacaagc agaagaacgg catcaaggtg aacttcaaga tccgccacaa catcgaggac 2700
ggcagcgtgc agctcgccga ccactaccag cagaacaccc ccatcggcga cggccccgtg 2760
ctgctgcccg acaaccacta cctgagcacc cagtccgccc tgagcaaaga ccccaacgag 2820
aagcgcgatc acatggtcct gctggagttc gtgaccgccg ccgggatcac tctcggcatg 2880
gacgagctgt acaaggggcc cttcgaacaa aaactcatct cagaagagga tctgaatatg 2940
cataccggtc atcatcacca tcaccattga ggatccaatt ccgcccctct ccctcccccc 3000
cccctaacgt tactggccga agccgcttgg aataaggccg gtgtgcgttt gtctatatgt 3060
tattttccac catattgccg tcttttggca atgtgagggc ccggaaacct ggccctgtct 3120
tcttgacgag cattcctagg ggtctttccc ctctcgccaa aggaatgcaa ggtctgttga 3180
atgtcgtgaa ggaagcagtt cctctggaag cttcttgaag acaaacaacg tctgtagcga 3240
ccctttgcag gcagcggaac cccccacctg gcgacaggtg cctctgcggc caaaagccac 3300
gtgtataaga tacacctgca aaggcggcac aaccccagtg ccacgttgtg agttggatag 3360
ttgtggaaag agtcaaatgg ctctcctcaa gcgtattcaa caaggggctg aaggatgccc 3420
agaaggtacc ccattgtatg ggatctgatc tggggcctcg gtgcacatgc tttacatgtg 3480
tttagtcgag gttaaaaaac gtctaggccc cccgaaccac ggggacgtgg ttttcctttg 3540
aaaaacacga tgataagctt gccacaaccc acaaggagac gaccttccat gaaaaagcct 3600
gaactcaccg cgacgtctgt cgagaagttt ctgatcgaaa agttcgacag cgtctccgac 3660
ctgatgcagc tctcggaggg cgaagaatct cgtgctttca gcttcgatgt aggagggcgt 3720
ggatatgtcc tgcgggtaaa tagctgcgcc gatggtttct acaaagatcg ttatgtttat 3780
cggcactttg catcggccgc gctcccgatt ccggaagtgc ttgacattgg ggaattcagc 3840
gagagcctga cctattgcat ctcccgccgt gcacagggtg tcacgttgca agacctgcct 3900
gaaaccgaac tgcccgctgt tctgcagccg gtcgcggagg ccatggatgc gatcgctgcg 3960
gccgatctta gccagacgag cgggttcggc ccattcggac cgcaaggaat cggtcaatac 4020
actacatggc gtgatttcat atgcgcgatt gctgatcccc atgtgtatca ctggcaaact 4080
gtgatggacg acaccgtcag tgcgtccgtc gcgcaggctc tcgatgagct gatgctttgg 4140
gccgaggact gccccgaagt ccggcacctc gtgcacgcgg atttcggctc caacaatgtc 4200
ctgacggaca atggccgcat aacagcggtc attgactgga gcgaggcgat gttcggggat 4260
tcccaatacg aggtcgccaa catcttcttc tggaggccgt ggttggcttg tatggagcag 4320
cagacgcgct acttcgagcg gaggcatccg gagcttgcag gatcgccgcg gctccgggcg 4380
tatatgctcc gcattggtct tgaccaactc tatcagagct tggttgacgg caatttcgat 4440
gatgcagctt gggcgcaggg tcgatgcgac gcaatcgtcc gatccggagc cgggactgtc 4500
gggcgtacac aaatcgcccg cagaagcgcg gccgtctgga ccgatggctg tgtagaagta 4560
ctcgccgata gtggaaaccg acgccccagc actcgtccga gggcaaagga atgatctaga 4620
ggatcataat cagccatacc acatttgtag aggttttact tgctttaaaa aacctcccac 4680
acctccccct gaacctgaaa cataaaatga atgcaattgt tgttgttaac ttgtttattg 4740
cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 4800
tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctgga 4860
tcgggctgca ggaattcgat atcaagctta tcgataatca acctctggat tacaaaattt 4920
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 4980
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 5040
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 5100
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 5160
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 5220
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 5280
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 5340
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 5400
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 5460
tctccctttg ggccgcctcc ccgcatcgat accgtcgact agccgtacct ttaagaccaa 5520
tgacttacaa ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag 5580
ggctaattca ctcccaaaga agacaagatc tgctttttgc ctgtactggg tctctctggt 5640
tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc 5700
aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta 5760
actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcaga attcgatatc 5820
aagcttatcg ataccgtcga cctcgagggg gggcccggta cccaattcgc cctatagtga 5880
gtcgtattac gctcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt 5940
acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag 6000
gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg gaaattgtaa 6060
gcgttaatat tttgttaaaa ttcgcgttaa atttttgtta aatcagctca ttttttaacc 6120
aataggccga aatcggcaaa atcccttata aatcaaaaga atagaccgag atagggttga 6180
gtgttgttcc agtttggaac aagagtccac tattaaagaa cgtggactcc aacgtcaaag 6240
ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga accatcaccc taatcaagtt 6300
ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 6360
gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 6420
cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 6480
cgcttaatgc gccgctacag ggcgcgtcag gtggcacttt tcggggaaat gtgcgcggaa 6540
cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac 6600
cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg 6660
tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc 6720
tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg 6780
atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga 6840
gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc 6900
aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag 6960
aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga 7020
gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg 7080
cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga 7140
atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt 7200
tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact 7260
ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt 7320
ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg 7380
ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta 7440
tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac 7500
tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta 7560
aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt 7620
tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt 7680
tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt 7740
gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc 7800
agataccaaa tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg 7860
tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg 7920
ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt 7980
cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac 8040
tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg 8100
acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg 8160
gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat 8220
ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccgcaacc ggccttttta 8280
cggttcctgg ccttttgctg gccttttgct cacatgtctt tcctgcgtta cccctgattc 8340
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 8400
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 8460
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 8520
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 8580
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 8640
aggaaacagc tatgaccatg attacgccaa gccgaattaa ccctcactaa agggaacaaa 8700
agctggagct ccaccgcggt ggcggcctcg aggtcgagat ccggtcgacc agcaaccata 8760
gtcccgcccc taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg 8820
ccccatggct gactaatttt ttttatttat gcagaggccg aggccgcctc ggcctctgag 8880
ctattccaga agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagcttcga 8940
cggtatcgat tggctcatgt ccaacattac cgccatgttg acattgatta ttgactagtt 9000
attaatagta atcaattacg gggtcattag ttcatagccc atatatggag ttccgcgtta 9060
cataacttac ggtaaatggc ccgcctggct gaccgcccaa cgacccccgc ccattgacgt 9120
caataatgac gtatgttccc atagtaacgc caatagggac tttccattga cgtcaatggg 9180
tggagtattt acggtaaact gcccacttgg cagtacatca agtgtatcat atgccaagta 9240
cgccccctat tgacgtcaat gacggtaaat ggcccgcctg gcattatgcc cagtacatga 9300
ccttatggga ctttcctact tggcagtaca tctacgtatt agtcatcgct attaccatgg 9360
tgatgcggtt ttggcagtac atcaatgggc gtggatagcg gtttgactca cggggatttc 9420
caagtctcca ccccattgac gtcaatggga gtttgttttg gcaccaaaat caacgggact 9480
ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat gggcggtagg cgtgtacgga 9540
attcggagtg gcgagccctc agatcctgca tataagcagc tgctttttgc ctgtatgggt 9600
ctctctg 9607
<210> 6
<211> 8483
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcctgc aggtcgacga acgctgacgt catcaacccg 2280
ctccaaggaa tcgcgggccc agtgtcacta ggcgggaaca cccagcgcgc gtgcgccctg 2340
gcaggaagat ggctgtgagg gacaggggag tggcgccctg caatatttgc atgtcgctat 2400
gtgttctggg aaatcaccat aaacgtgaaa tgtctttgga tttgggaatc ttataagttc 2460
tgtatgagac cacagatccc cggaaacctg atcatgtaga tcgaatggac tctaaatccg 2520
ttcagccggg ttagattccc ggggtttccg ccatttttct cgacgacgcc gccatctcta 2580
ggcccgcgcc ggccccctcg cacagacttg tgggagaagc tcggctactc ccctgccccg 2640
gttaatttgc atataatatt tcctagtaac tatagaggct taatgtgcga taaaagacag 2700
ataatctgtt ctttttaata ctagctacat tttacatgat aggcttggat ttctataaga 2760
gatacaaata ctaaattatt attttaaaaa acagcacaaa aggaaactca ccctaactgt 2820
aaagtaattg tgtgttttga gactataaat atcccttgga gaaaagcctt gtttggaaac 2880
ctgatcatgt agatcgaatg gactctaaat ccgttcagcc gggttagatt cccggggttt 2940
ccgccatttt tctcgacaag gtcgggcagg aagagggcct atttcccatg attccttcat 3000
atttgcatat acgatacaag gctgttagag agataattag aattaatttg actgtaaaca 3060
caaagatatt agtacaaaat acgtgacgta gaaagtaata atttcttggg tagtttgcag 3120
ttttaaaatt atgttttaaa atggactatc atatgcttac cgtaacttga aagtatttcg 3180
atttcttggc tttatatatc ttgtggaaag gacgaaacac cggaaacctg atcatgtaga 3240
tcgaatggac tctaaatccg ttcagccggg ttagattccc ggggtttccg ccatttttct 3300
cgacgaacgc tgacgtcatc aacccgctcc aaggaatcgc gggcccagtg tcactaggcg 3360
ggaacaccca gcgcgcgtgc gccctggcag gaagatggct gtgagggaca ggggagtggc 3420
gccctgcaat atttgcatgt cgctatgtgt tctgggaaat caccataaac gtgaaatgtc 3480
tttggatttg ggaatcttat aagttctgta tgagaccaca gatccccgga aacctgatca 3540
tgtagatcga atggactcta aatccgttca gccgggttag attcccgggg tttccgccat 3600
ttttctcgac gacgccgcca tctctaggcc cgcgccggcc ccctcgcaca gacttgtggg 3660
agaagctcgg ctactcccct gccccggtta atttgcatat aatatttcct agtaactata 3720
gaggcttaat gtgcgataaa agacagataa tctgttcttt ttaatactag ctacatttta 3780
catgataggc ttggatttct ataagagata caaatactaa attattattt taaaaaacag 3840
cacaaaagga aactcaccct aactgtaaag taattgtgtg ttttgagact ataaatatcc 3900
cttggagaaa agccttgttt ggaaacctga tcatgtagat cgaatggact ctaaatccgt 3960
tcagccgggt tagattcccg gggtttccgc catttttctc gacaaggtcg ggcaggaaga 4020
gggcctattt cccatgattc cttcatattt gcatatacga tacaaggctg ttagagagat 4080
aattagaatt aatttgactg taaacacaaa gatattagta caaaatacgt gacgtagaaa 4140
gtaataattt cttgggtagt ttgcagtttt aaaattatgt tttaaaatgg actatcatat 4200
gcttaccgta acttgaaagt atttcgattt cttggcttta tatatcttgt ggaaaggacg 4260
aaacaccgga aacctgatca tgtagatcga atggactcta aatccgttca gccgggttag 4320
attcccgggg tttccgccat ttttctcgac tctagaggat ccctgcagta tttagcatgc 4380
cccacccatc tgcaaggcat tctggatagt gtcaaaacag ccggaaatca agtccgttta 4440
tctcaaactt tagcattttg ggaataaatg atatttgcta tgctggttaa attagatttt 4500
agttaaattt cctgctgaag ctctagtacg ataagtaact tgacctaagt gtaaagttga 4560
gatttccttc aggtttatat agcttgtgcg ccgcctgggt acctcggaaa cctgatcatg 4620
tagatcgaat ggactctaaa tccgttcagc cgggttagat tcccggggtt tccgccattt 4680
ttggatctaa ggtcgggcag gaagagggcc tatttcccat gattccttca tatttgcata 4740
tacgatacaa ggctgttaga gagataatta gaattaattt gactgtaaac acaaagatat 4800
tagtacaaaa tacgtgacgt agaaagtaat aatttcttgg gtagtttgca gttttaaaat 4860
tatgttttaa aatggactat catatgctta ccgtaacttg aaagtatttc gatttcttgg 4920
ctttatatat cttgtggaaa ggacgaaaca ccggaaacct gatcatgtag atcgaatgga 4980
ctctaaatcc gttcagccgg gttagattcc cggggtttcc gccatttttg gatctgaacg 5040
ctgacgtcat caacccgctc caaggaatcg cgggcccagt gtcactaggc gggaacaccc 5100
agcgcgcgtg cgccctggca ggaagatggc tgtgagggac aggggagtgg cgccctgcaa 5160
tatttgcatg tcgctatgtg ttctgggaaa tcaccataaa cgtgaaatgt ctttggattt 5220
gggaatctta taagttctgt atgagaccac agatccccgg aaacctgatc atgtagatcg 5280
aatggactct aaatccgttc agccgggtta gattcccggg gtttccgcca tttttggatc 5340
tctgcagtat ttagcatgcc ccacccatct gcaaggcatt ctggatagtg tcaaaacagc 5400
cggaaatcaa gtccgtttat ctcaaacttt agcattttgg gaataaatga tatttgctat 5460
gctggttaaa ttagatttta gttaaatttc ctgctgaagc tctagtacga taagtaactt 5520
gacctaagtg taaagttgag atttccttca ggtttatata gcttgtgcgc cgcctgggta 5580
cctcggaaac ctgatcatgt agatcgaatg gactctaaat ccgttcagcc gggttagatt 5640
cccggggttt ccgccatttt tggatctaag gtcgggcagg aagagggcct atttcccatg 5700
attccttcat atttgcatat acgatacaag gctgttagag agataattag aattaatttg 5760
actgtaaaca caaagatatt agtacaaaat acgtgacgta gaaagtaata atttcttggg 5820
tagtttgcag ttttaaaatt atgttttaaa atggactatc atatgcttac cgtaacttga 5880
aagtatttcg atttcttggc tttatatatc ttgtggaaag gacgaaacac cggaaacctg 5940
atcatgtaga tcgaatggac tctaaatccg ttcagccggg ttagattccc ggggtttccg 6000
ccatttttgg atctgaacgc tgacgtcatc aacccgctcc aaggaatcgc gggcccagtg 6060
tcactaggcg ggaacaccca gcgcgcgtgc gccctggcag gaagatggct gtgagggaca 6120
ggggagtggc gccctgcaat atttgcatgt cgctatgtgt tctgggaaat caccataaac 6180
gtgaaatgtc tttggatttg ggaatcttat aagttctgta tgagaccaca gatccccgga 6240
aacctgatca tgtagatcga atggactcta aatccgttca gccgggttag attcccgggg 6300
tttccgccat ttttggatct ccgggtaccc tgtgccttct agttgccagc catctgttgt 6360
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 6420
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 6480
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggatgc 6540
ggtgggctct atggcttctg aggcggaaag aaccagctgg ggctctaggg ggtatcccca 6600
cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc 6660
tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 6720
gttcgccggc tttccccgtc aagctctaaa tcggggcatc cctttagggt tccgatttag 6780
tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac gtagtgggcc 6840
atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 6900
actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt ttgatttata 6960
agggattttg gggatttcgg cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 7020
cgcgaattaa ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc aggctcccca 7080
ggcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccaggt gtggaaagtc 7140
cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccat 7200
agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc 7260
gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct ctgcctctga 7320
gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca aaaagctccc 7380
gggagcttgt atatccattt tcggaattca tggccaagtt gaccagtgcc gttccggtgc 7440
tcaccgcgcg cgacgtcgcc ggagcggtcg agttctggac cgaccggctc gggttctccc 7500
gggacttcgt ggaggacgac ttcgccggtg tggtccggga cgacgtgacc ctgttcatca 7560
gcgcggtcca ggaccaggtg gtgccggaca acaccctggc ctgggtgtgg gtgcgcggcc 7620
tggacgagct gtacgccgag tggtcggagg tcgtgtccac gaacttccgg gacgcctccg 7680
ggccggccat gaccgagatc ggcgagcagc cgtgggggcg ggagttcgcc ctgcgcgacc 7740
cggccggcaa ctgcgtgcac ttcgtggccg aggagcagga ctgagcggga ctctggggtt 7800
cgaaatgacc gaccaagcga cgcccaacct gccatcacga gatttcgatt ccaccgccgc 7860
cttctatgaa aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca 7920
gcgcggggat ctcatgctgg agttcttcgc ccaccccaac ttgtttattg cagcttataa 7980
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 8040
ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctgac tggccgtcgt 8100
tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca 8160
tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca 8220
gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat tttctcctta cgcatctgtg 8280
cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 8340
aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc 8400
ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc 8460
accgtcatca ccgaaacgcg cga 8483
<210> 7
<211> 3754
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcctgc aggtcgactc tagaggatcc ccgggtaccc 2280
tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct 2340
ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct 2400
gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg 2460
ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg aggcggaaag 2520
aaccagctgg ggctctaggg ggtatcccca cgcgccctgt agcggcgcat taagcgcggc 2580
gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc 2640
tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc aagctctaaa 2700
tcggggcatc cctttagggt tccgatttag tgctttacgg cacctcgacc ccaaaaaact 2760
tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt ttcgcccttt 2820
gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa caacactcaa 2880
ccctatctcg gtctattctt ttgatttata agggattttg gggatttcgg cctattggtt 2940
aaaaaatgag ctgatttaac aaaaatttaa cgcgaattaa ttctgtggaa tgtgtgtcag 3000
ttagggtgtg gaaagtcccc aggctcccca ggcaggcaga agtatgcaaa gcatgcatct 3060
caattagtca gcaaccaggt gtggaaagtc cccaggctcc ccagcaggca gaagtatgca 3120
aagcatgcat ctcaattagt cagcaaccat agtcccgccc ctaactccgc ccatcccgcc 3180
cctaactccg cccagttccg cccattctcc gccccatggc tgactaattt tttttattta 3240
tgcagaggcc gaggccgcct ctgcctctga gctattccag aagtagtgag gaggcttttt 3300
tggaggccta ggcttttgca aaaagctccc gggagcttgt atatccattt tcggaattca 3360
ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc 3420
cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc 3480
ccttcccaac agttgcgcag cctgaatggc gaatggcgcc tgatgcggta ttttctcctt 3540
acgcatctgt gcggtatttc acaccgcata tggtgcactc tcagtacaat ctgctctgat 3600
gccgcatagt taagccagcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct 3660
tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt 3720
cagaggtttt caccgtcatc accgaaacgc gcga 3754
<210> 8
<211> 72
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
ggaaacctga tcatgtagat cgaatggact ctaaatccgt tcagccgggt tagattcccg 60
gggtttccgc ca 72
<210> 9
<211> 1260
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
atggataaaa aaccattaga tgttttaata tctgcgaccg ggctctggat gtccaggact 60
ggcacgctcc acaaaatcaa gcaccatgag gtctcaagaa gtaaaatata cattgaaatg 120
gcgtgtggag accatcttgt tgtgaataat tccaggagtt gtagaacagc cagagcattc 180
agacatcata agtacagaaa aacctgcaaa cgatgtaggg tttcggacga ggatatcaat 240
aattttctca caagatcaac cgaaagcaaa aacagtgtga aagttagggt agtttctgct 300
ccaaaggtca aaaaagctat gccgaaatca gtttcaaggg ctccgaagcc tctggaaaat 360
tctgtttctg caaaggcatc gacgaacaca tccagatctg taccttcgcc tgcaaaatca 420
actccaaatt cgtctgttcc cgcatcggct cctgctcctt cacttacaag aagccagctt 480
gatagggttg aggctctctt aagtccagag gataaaattt ctctaaatat ggcaaagcct 540
ttcagggaac ttgagcctga acttgtgaca agaagaaaaa acgattttca gcggctctat 600
accaatgata gagaagacta cctcggtaaa ctcgaacgtg atattacgaa atttttcgta 660
gaccggggtt ttctggagat aaagtctcct atccttattc cggcggaata cgtggagaga 720
atgggtatta ataatgatac tgaactttca aaacagatct tccgggtgga taaaaatctc 780
tgcttgaggc caatgcttgc cccgactctg tataactatg cgcgaaaact cgataggatt 840
ttaccaggcc caataaaaat tttcgaagtc ggaccttgtt accggaaaga gtctgacggc 900
aaagagcacc tggaagaatt tactatggtg aacttcagtc agatgggttc gggatgtact 960
cgggaaaatc ttgaagctct catcaaagag tttctggact atctggaaat cgacttcgaa 1020
atcgtaggag attcctgtat ggtctttggg gatactcttg atataatgca cggggacctg 1080
gagctttctt cggcagtcgt cgggccagtt tctcttgata gagaatgggg tattgacaaa 1140
ccatggatag gtgcaggttt tggtcttgaa cgcttgctca aggttatgca cggctttaaa 1200
aacattaaga gggcatcaag gtccgaatct tactataatg ggatttcaac caatctataa 1260

Claims (15)

1.一种用于在蛋白质或肽的任意位点引入非天然氨基酸的细胞系,其特征在于所述细胞系携带有吡咯赖氨酰-tRNA合成酶基因和源自古甲烷球菌的tRNA(tRNAPyl)。
2.如权利要求1所述的的细胞系,其特征在于所述的tRNAPyl是多个拷贝数的启动子-tRNAPyl
3.如权利要求2所述的的细胞系,其特征在于所述的tRNAPyl是12个拷贝数的type-3Pol III启动子启动的tRNAPyl
4.如权利要求1-3中任一项所述的细胞系,其特征在于所述的吡咯赖氨酰-tRNA合成酶基因如SEQ ID NO:9所示。
5.如权利要求1-3中任一项所述的细胞系,其特征在于所述的tRNAPyl来自于序列如SEQID NO:6所示的载体pXH-12tRNA-zeo。
6.如权利要求1-5中任一项所述的细胞系,其是通过下述的步骤获得的:
(1)在如SEQ ID NO:2所示的pSD31-IRES-puro上连接吡咯赖氨酰-tRNA合成酶基因,获得如SEQ ID NO:4所示的携带的病毒载体pSD31-pylRS-IRES-puro;
(2)在如SEQ ID NO:3所示的pSD31-IRES-hygro上连接带有突变的绿色荧光蛋白基因,获得如SEQ ID NO:5所示的的病毒载体pSD31-GFP39TAG-IRES-hygro;
(3)包装(1)和(2)中所述的病毒载体pSD31-pylRS-IRES-puro和pSD31-GFP39TAG-IRES-hygro,转导HEK 293T细胞,分别用嘌呤霉素和潮霉素B筛选,获得整合了吡咯赖氨酰-tRNA合成酶基因和突变型绿色荧光蛋白报告基因的稳定细胞系;
(4)将序列如SEQ ID NO:6所示的载体pXH-12tRNA-zeo线性化后转染(3)中获得的稳定细胞系,并利用其上携带的博莱霉素抗性基因进行筛选;
(5)在培养基中加入非天然氨基酸,挑取带有绿色荧光的单克隆,并扩大培养,最终得到稳定细胞系。
7.根据权利要求6的方法获得的所述的稳定细胞系,其是HEK293-PYL,保藏号为CGMCCNo:11592。
8.权利要求6中所述的序列如SEQ ID NO:2所示的病毒载体,
权利要求6中所述的序列如SEQ ID NO:3所示的病毒载体,或
权利要求6中所述的序列如SEQ ID NO:6所示的载体。
9.利用权利要求1-7中任一项所述的细胞系制备含有非天然氨基酸的蛋白或肽的方法,包括步骤:
(1)在目的蛋白的氨基酸序列中选择期望突变的一个或多个氨基酸位点;
(2)在编码(1)所述的目的蛋白的核酸分子中将(1)中所选择的位点的氨基酸的密码子突变为琥珀密码子UAG;
(3)将(2)中得到的突变的核酸与合适的载体可操作地连接,得到突变的核酸的表达载体;
(4)将(3)得到的突变的核酸的表达载体转染权利要求1-6中任一项所述的细胞系,将转染成功后的宿主细胞在含有NAEK的培养基中培养,在适当的时间收集细胞;
(5)检测含有非天然氨基酸的目的蛋白的活性。
10.根据权利要求9所述的方法获得的至少1个位点上的氨基酸被突变为非天然氨基酸的蛋白或肽,其特征在于所述非天然氨基酸为含有叠氮基团的非天然氨基酸Lys-azido(NAEK)
Figure FDA0002885359950000021
或者含有光交联基团的非天然氨基酸Lys-diazirine(DiZPK)
Figure FDA0002885359950000022
11.如权利要求10所述的定点突变的蛋白或肽,其是荧光素酶,突变位点为荧光素酶任意一个或多个位点上氨基酸。
12.如权利要求11所述的定点突变的蛋白或肽,所述突变位点选自:由SEQ ID NO:1所示序列编码的荧光素酶的第F14位。
13.如权利要求9-12中任一项所述的定点突变的蛋白或肽,其第N位的氨基酸被突变为NAEK,且第N位氨基酸在蛋白或肽中的连接方式如下式所示:
Figure FDA0002885359950000031
由R1到R2的方向为氨基酸序列的N末端到C末端方向,R1为第1至第N-1位氨基酸残基,
R2为第N+1位至C末端的氨基酸残基,R4
Figure FDA0002885359950000032
14.编码权利要求9-12中任一项的突变蛋白或肽的核酸分子。
15.如权利要求14所述的突变蛋白或肽的核酸分子,其特征在于编码非天然氨基酸的密码子为琥珀密码子UAG。
CN202110012018.3A 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建 Pending CN112725282A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110012018.3A CN112725282A (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110012018.3A CN112725282A (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
CN201610055542.8A CN107012121B (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201610055542.8A Division CN107012121B (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建

Publications (1)

Publication Number Publication Date
CN112725282A true CN112725282A (zh) 2021-04-30

Family

ID=59438847

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202110012018.3A Pending CN112725282A (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
CN201610055542.8A Active CN107012121B (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201610055542.8A Active CN107012121B (zh) 2016-01-27 2016-01-27 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建

Country Status (1)

Country Link
CN (2) CN112725282A (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111304234A (zh) * 2020-02-27 2020-06-19 江南大学 一种适用于枯草芽孢杆菌的非天然氨基酸利用工具
CN113481239A (zh) * 2021-07-01 2021-10-08 四川大学华西医院 一种通过Rosa26位点向细胞系中引入非天然氨基酸编码体系的方法及其细胞系
CN114107394A (zh) * 2021-11-05 2022-03-01 中国科学院精密测量科学与技术创新研究院 一种慢病毒转移载体、表达PylRS及tRNACUA的细胞系及制备方法与应用
CN114540308A (zh) * 2021-10-26 2022-05-27 中国农业科学院兰州兽医研究所 稳定表达正交氨酰tRNA合成酶/tRNA的细胞系及构建方法

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110835633B (zh) * 2018-08-13 2021-10-01 北京大学 利用优化的基因密码子扩展系统制备ptc稳定细胞系及应用
CN110846311A (zh) * 2018-08-20 2020-02-28 北京大学 利用抑制性tRNA系统制备PTC稳定细胞系及应用
CN111850020B (zh) * 2019-04-25 2021-05-07 苏州鲲鹏生物技术有限公司 利用质粒系统在蛋白中引入非天然氨基酸
CN111849929B (zh) * 2019-04-30 2021-05-11 苏州鲲鹏生物技术有限公司 高效引入赖氨酸衍生物的氨酰基—tRNA合成酶
CN110172467B (zh) * 2019-05-24 2021-03-16 浙江大学 一种利用嵌合设计方法构建正交的氨酰-tRNA合成酶/tRNA体系
CN114908066B (zh) * 2022-05-17 2024-01-23 杭州嵌化合生医药科技有限公司 一种正交翻译系统及其在再分配密码子恢复ptc疾病中功能蛋白表达方面的应用
CN115261344B (zh) * 2022-08-29 2023-07-21 北京大学 基于非天然氨基酸的离子液体、其制备方法及应用

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060166319A1 (en) * 2004-08-13 2006-07-27 Chan Michael K Charging tRNA with pyrrolysine
CN101535338A (zh) * 2006-10-18 2009-09-16 斯克利普斯研究院 在哺乳动物细胞中将非天然氨基酸遗传掺入蛋白质
CN102838671A (zh) * 2011-06-23 2012-12-26 北京大学 定点突变和定点修饰的生长激素、其制备方法及其应用
CN102838663A (zh) * 2011-06-23 2012-12-26 北京大学 定点突变和定点修饰的病毒膜蛋白、其制备方法及其应用
CN104099360A (zh) * 2013-04-12 2014-10-15 北京大学 非天然氨基酸标记的目的蛋白或肽的制备
CN105026574A (zh) * 2012-09-24 2015-11-04 米迪缪尼有限公司 细胞系
CN106929482A (zh) * 2015-12-31 2017-07-07 北京大学 定点突变的流感病毒、其活疫苗及其制备方法和应用

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060166319A1 (en) * 2004-08-13 2006-07-27 Chan Michael K Charging tRNA with pyrrolysine
CN101535338A (zh) * 2006-10-18 2009-09-16 斯克利普斯研究院 在哺乳动物细胞中将非天然氨基酸遗传掺入蛋白质
CN102838671A (zh) * 2011-06-23 2012-12-26 北京大学 定点突变和定点修饰的生长激素、其制备方法及其应用
CN102838663A (zh) * 2011-06-23 2012-12-26 北京大学 定点突变和定点修饰的病毒膜蛋白、其制备方法及其应用
CN105026574A (zh) * 2012-09-24 2015-11-04 米迪缪尼有限公司 细胞系
CN104099360A (zh) * 2013-04-12 2014-10-15 北京大学 非天然氨基酸标记的目的蛋白或肽的制备
CN106929482A (zh) * 2015-12-31 2017-07-07 北京大学 定点突变的流感病毒、其活疫苗及其制备方法和应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
NCBI: "Methanosarcina barkeri strain MS tRNA-Pyl gene, complete sequence; and PylS (pylS) gene, complete cds,GenBank:AY273828.1", 《NCBI》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111304234A (zh) * 2020-02-27 2020-06-19 江南大学 一种适用于枯草芽孢杆菌的非天然氨基酸利用工具
CN113481239A (zh) * 2021-07-01 2021-10-08 四川大学华西医院 一种通过Rosa26位点向细胞系中引入非天然氨基酸编码体系的方法及其细胞系
CN114540308A (zh) * 2021-10-26 2022-05-27 中国农业科学院兰州兽医研究所 稳定表达正交氨酰tRNA合成酶/tRNA的细胞系及构建方法
CN114107394A (zh) * 2021-11-05 2022-03-01 中国科学院精密测量科学与技术创新研究院 一种慢病毒转移载体、表达PylRS及tRNACUA的细胞系及制备方法与应用
CN114107394B (zh) * 2021-11-05 2024-01-30 中国科学院精密测量科学与技术创新研究院 一种慢病毒转移载体、表达PylRS及tRNACUA的细胞系及制备方法与应用

Also Published As

Publication number Publication date
CN107012121B (zh) 2021-01-26
CN107012121A (zh) 2017-08-04

Similar Documents

Publication Publication Date Title
CN112725282A (zh) 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
AU774643B2 (en) Compositions and methods for use in recombinational cloning of nucleic acids
KR102622910B1 (ko) Pd-1 호밍 엔도뉴클레아제 변이체, 조성물 및 사용 방법
KR20200064129A (ko) 트랜스제닉 선택 방법 및 조성물
DK1197567T4 (en) Characterization of gene function using double stranded RNA inhibition
KR101982360B1 (ko) 콤팩트 tale-뉴클레아제의 발생 방법 및 이의 용도
CN111344395A (zh) 产生经修饰的自然杀伤细胞的方法及使用方法
KR20210149060A (ko) Tn7-유사 트랜스포존을 사용한 rna-유도된 dna 통합
CN108431225A (zh) 细胞基因组的诱导型修饰
AU2016333886A1 (en) Engineered meganucleases with recognition sequences found in the human T cell receptor alpha constant region gene
US20200188531A1 (en) Single-vector gene construct comprising insulin and glucokinase genes
CN110467679B (zh) 一种融合蛋白、碱基编辑工具和方法及其应用
CN107849583B (zh) 使用细胞分裂基因座控制细胞增殖的工具和方法
CN114807152A (zh) 工程化病毒载体减少了炎症和免疫反应的诱导
CN110785179A (zh) Wiskott-Aldrich综合征和X连锁血小板减少症中的治疗性基因组编辑
CN116083398B (zh) 分离的Cas13蛋白及其应用
CN109295100A (zh) 携带正交tRNA/氨酰tRNA合成酶的稳定细胞系的构建
CN107849579B (zh) 用于基因优化的方法
CN112342234B (zh) 一种调控n-乙酰神经氨酸产量提高的重组枯草芽孢杆菌
CN116323942A (zh) 用于基因组编辑的组合物及其使用方法
CN115362000A (zh) 使用多核苷酸沉默和替换的神经退行性病症的基因疗法
NL2027815B1 (en) Genomic integration
Puah Selective binding to mRNA duplex regions by chemically modified PNAs stimulates ribosomal frameshifting
RU2781083C2 (ru) Варианты, композиции и методы применения хоминг-эндонуклеазы pd-1
PL228024B1 (pl) Zestaw wektorów ekspresyjnych

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210430

RJ01 Rejection of invention patent application after publication