CN112575033B - Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用 - Google Patents

Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用 Download PDF

Info

Publication number
CN112575033B
CN112575033B CN202011554428.2A CN202011554428A CN112575033B CN 112575033 B CN112575033 B CN 112575033B CN 202011554428 A CN202011554428 A CN 202011554428A CN 112575033 B CN112575033 B CN 112575033B
Authority
CN
China
Prior art keywords
leu
scn1a
ser
phe
ile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011554428.2A
Other languages
English (en)
Other versions
CN112575033A (zh
Inventor
牛冬
汪滔
陶裴裴
刘瑜
曾为俊
王磊
程锐
赵泽英
马翔
黄彩云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Qizhen Genetic Engineering Co Ltd
Original Assignee
Nanjing Qizhen Genetic Engineering Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Qizhen Genetic Engineering Co Ltd filed Critical Nanjing Qizhen Genetic Engineering Co Ltd
Priority to CN202011554428.2A priority Critical patent/CN112575033B/zh
Publication of CN112575033A publication Critical patent/CN112575033A/zh
Application granted granted Critical
Publication of CN112575033B publication Critical patent/CN112575033B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/8509Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K67/00Rearing or breeding animals, not otherwise provided for; New breeds of animals
    • A01K67/027New breeds of vertebrates
    • A01K67/0275Genetically modified vertebrates, e.g. transgenic
    • A01K67/0276Knockout animals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0652Cells of skeletal and connective tissues; Mesenchyme
    • C12N5/0656Adult fibroblasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2217/00Genetically modified animals
    • A01K2217/07Animals genetically altered by homologous recombination
    • A01K2217/075Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2227/00Animals characterised by species
    • A01K2227/10Mammal
    • A01K2227/108Swine
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01KANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
    • A01K2267/00Animals characterised by purpose
    • A01K2267/03Animal model, e.g. for test or diseases
    • A01K2267/035Animal model for multifactorial diseases
    • A01K2267/0356Animal model for processes and diseases of the central nervous system, e.g. stress, learning, schizophrenia, pain, epilepsy
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/09Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2517/00Cells related to new breeds of animals
    • C12N2517/02Cells from transgenic animals
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/48Vector systems having a special element relevant for transcription regulating transport or export of RNA, e.g. RRE, PRE, WPRE, CTE
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/50Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Abstract

本发明公开了一种CRISPR/Cas9系统及其在构建SCN1A基因突变的克隆猪核供体细胞中的应用。一种用于猪SCN1A基因编辑的CRISPR/Cas9系统,包含序列如SEQ ID NO.2所示的Cas9高效表达载体pKG‑GE3和针对猪SCN1A基因的gRNA表达载体,该载体以序列如SEQ ID NO.3所示的pKG‑U6gRNA为载体骨架,表达序列如SEQ ID NO.36所示的gRNA。采用本发明筛选的gRNA联合改造的Cas9高效表达载体进行基因编辑,编辑效率比原载体有了显著的提高。

Description

CRISPR系统及其在构建SCN1A基因突变的癫痫性脑病克隆猪 核供体细胞中的应用
技术领域
本发明属于生物技术领域,具体涉及用于SCN1A基因编辑的CRISPR/Cas9系统及其应用。
背景技术
癫痫(Epilepsy)即俗称的“羊角风”或“羊癫风”,是大脑神经元突发性异常放电导致短暂的大脑功能障碍的一种慢性疾病。由于异常放电的起始部位和传递方式的不同,癫痫发作的临床表现复杂多样,可表现为发作性运动、感觉、自主神经、意识及精神障碍等症状。癫痫患者经过正规的抗癫痫药物治疗,约70%的患者其发作是可以得到控制的,其中50%~60%的患者经2~5年的治疗可以痊愈,患者可以和正常人一样地工作和生活。癫痫的发病机制非常复杂,中枢神经系统兴奋与抑制间的不平衡导致癫痫发作,其主要与离子通道神经递质及神经胶质细胞的改变有关。
癫痫性脑病(Epileptic encephalopathies,EEs)是癫痫性异常导致进展性脑部功能障碍,是一种儿童期常见的脑部疾病,以多种严重的癫痫症候群为特征的神经系统功能紊乱性疾病。EEs是发生在婴儿和儿童的严重痉挛性疾病,首次发病多见儿童、青年时期,男性高于女性,儿童死亡率是成人的10倍,然而病因、发病机制仍不十分明确,年龄依赖性是EEs的共同特征,早发性癫痫性脑病(Early-onset epileptic encephalopathies,EOEE)包括早发性肌阵挛脑病(Early myoclonic epileptic encephalopathy,EMEE)、大田原综合征(Ohtahara syndrome,OS)、婴儿恶性游走性部分性癫痫(Malignant migratingpartial seizures in infancy,MMPSI)、West综合征及Dravet综合征。
Dravet综合征,以前称为严重的婴儿肌阵挛性癫痫(Severe myoclonic epilepsyof infancy,SMEI),由Dravet在1978年首次描述,患儿均于1岁内起病,起病高峰年龄为生后6个月内。最常见的首发症状为发热诱发的较长时间惊厥发作(30分钟以上),表现为单侧肢体阵挛或双侧强直阵挛发作,部分患儿首次发作前有接种疫苗诱因。1岁后逐渐出现无热惊厥,但仍有热敏感特点(遇到发热性疾病或环境温度过热即可出现癫痫发作),癫痫发作可以表现为多种形式,包括局灶性发作、全面强直阵挛发作(癫痫大发作)、肌阵挛发作、不典型失神发作等,易出现癫痫持续状态。患儿起病前发育多正常,发病后通常会经历语言和运动技能发展滞后、多动症和睡眠困难、慢性感染、生长和平衡问题以及与他人相处困难等问题,60%患儿有共济失调表现,且不会随着时间的推移而消失,被诊断为Dravet综合征的儿童需要有极大耐心和密切关注他们生活的看护人。在以后的生活中,Dravet综合征患者会出现无症状的癫痫发作,包括肌阵挛性发作、强直性-阵挛性发作、失神、简单和复杂的局部发作。在70%的Dravet综合征患者中发现了SCN1A基因突变,该基因编码神经元电压门控钠离子通道Na(V)1.1α亚基蛋白(电压门控钠离子通道是可兴奋细胞中动作电位产生和传导所必需的一个离子通道,主要发生在神经和肌肉),特异性定位于神经元胞体。因此,本发明基于SCN1A基因突变开发了克隆猪核移植供体细胞,后期通过体细胞核移植动物克隆技术培育出活体猪模型,可用于进行Dravet综合征发病机制研究,并为相关药物的开发及临床应用提供有效的实验数据,也为成功治疗人类相关疾病奠定基础。
基因编辑是近年来不断取得重大发展的一种生物技术,其包括从基于同源重组的基因编辑到基于核酸酶的ZFN、TALEN、CRISPR/Cas9等编辑技术,其中CRISPR/Cas9技术是当前最先进的基因编辑技术。目前,基因编辑技术被越来越多地应用到动物模型的制作上。猪作为大动物,是人类长期以来主要的肉食供应动物,其体型大小和生理功能与人类近似,易于大规模繁殖饲养,而且在伦理道德及动物保护等方面要求较低,是理想的人类疾病模型动物。
发明内容
本发明的目的是针对现有技术的上述不足,提供一种用于SCN1A基因编辑的CRISPR/Cas9系统。
本发明的另一目的是提供用于SCN1A基因编辑的gRNA及其表达载体。
本发明的又一目的是提供所述的CRISPR/Cas9系统在构建SCN1A基因突变的猪重组细胞中的应用。
本发明的目的可通过以下技术方案实现:
一种用于猪SCN1A编辑的CRISPR/Cas9系统,包含Cas9表达载体和针对猪SCN1A的gRNA表达载体;所述的Cas9表达载体为质粒全序列如SEQ ID NO.2所示的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体。
为了增加Cas9质粒的基因编辑能力,我们在购自addgene(Plasmid#42230,fromZhang Feng lab)pX330-U6-Chimeric_BB-CBh-hSpCas9(简称PX330)载体的基础上进行改造得到pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO(简称粒pKG-GE3)。PX330的图谱如图1,改造方式如下:
1)去除原载体gRNA骨架中多余无效的序列;
2)改造启动子:将原有启动子(chickenβ-actin启动子)改造为具更高表达活性的EF1a启动子,增加Cas9基因的蛋白表达能力;
3)增加核定位信号:在Cas9的N端及C端均增加核定位信号编码序列(NLS),增加Cas9的核定位能力;
4)增加双筛选标记:原载体无任何筛选标记,不利于阳性转化细胞的筛选和富集,在Cas9的C端,插入P2A-EGFP-T2A-PURO,赋予载体荧光和抗性筛选能力;
5)插入WPRE和3’LTR等调控基因表达的序列:在基因读码框最后插入WPRE、3’LTR等序列,可增强Cas9基因的蛋白翻译能力。
改造后载体pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO(简称pKG-GE3)及改造位点如图2,质粒全序列如SEQ ID NO:2所示;pKG-GE3的主要元件有:
1)gRNA表达元件:U6 gRNA scaffold;
2)启动子:EF1a启动子和CMV增强子;
3)含多个NLS的Cas9基因:含N端和C端多核定位信号(NLS)的Cas9基因;
4)筛选标记基因:荧光和抗性双筛选标记元件P2A-EGFP-T2A-PURO;
5)增强翻译的元件:WPRE和3’LTR增强Cas9及筛选标记基因的翻译效率;
6)转录终止信号:bGH polyA signal;
7)载体骨架:包括Amp抗性元件和ori复制子等。
质粒pKG-GE3中,具有特异融合基因;所述特异融合基因编码特异融合蛋白;
所述特异融合蛋白自N端至C端依次包括如下元件:两个核定位信号(NLS)、Cas9蛋白、两个核定位信号、自剪切多肽P2A、荧光报告蛋白、自裂解多肽T2A、抗性筛选标记蛋白;
质粒pKG-GE3中,由EF1a启动子启动所述特异融合基因的表达;
质粒pKG-GE3中,所述特异融合基因下游具有WPRE序列元件、3’LTR序列元件和bGHpoly(A)signal序列元件。
质粒pKG-GE3中,依次具有如下元件:CMV增强子、EF1a启动子、所述特异融合基因、WPRE序列元件、3’LTR序列元件、bGH poly(A)signal序列元件。
所述特异融合蛋白中,Cas9蛋白上游的两个核定位信号为SV40核定位信号,Cas9蛋白下游的两个核定位信号为nucleoplasmin核定位信号。
所述特异融合蛋白中,荧光报告蛋白具体可为EGFP蛋白。
所述特异融合蛋白中,抗性筛选标记蛋白具体可为Puromycin蛋白。
自剪切多肽P2A的氨基酸序列为“ATNFSLLKQAGDVEENPGP”(发生自剪切的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间)。
自裂解多肽T2A的氨基酸序列为“EGRGSLLTCGDVEENPGP”(发生自裂解的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间)。
特异融合基因具体如SEQ ID NO:2中第911-6706位核苷酸所示。
CMV增强子如SEQ ID NO:2中第395-680位核苷酸所示。
EF1a启动子如SEQ ID NO:2中第682-890位核苷酸所示。
WPRE序列元件如SEQ ID NO:2第6722-7310位核苷酸所示。
3’LTR序列元件如SEQ ID NO:2中第7382-7615位核苷酸所示。
bGH poly(A)signal序列元件如SEQ ID NO:2中第7647-7871位核苷酸所示。
作为本发明的一种优选,针对猪SCN1A的gRNA表达载体的载体骨架为pKG-U6gRNA,质粒全序列如SEQ ID NO.3所示。
作为本发明的进一步优选,该表达载体表达SEQ ID NO.36所示的gRNA,其靶点如SEQ ID NO.21所示。
作为本发明的一种优选,所述的针对猪SCN1A的gRNA表达载体由SEQ ID NO.26和SEQ ID NO.27所示的单链DNA退火而成的双链插入载体骨架pKG-U6gRNA所得。
作为本发明的更进一步优选,所述的gRNA表达载体和Cas9表达载体的摩尔比为1~3:1,进一步优选3:1。
一种重组细胞,由本发明所述的CRISPR/Cas9系统共转染猪原代成纤维细胞经验证后所得。
本发明所述的重组细胞在构建SCN1A敲除的克隆猪中的应用;优选在构建SCN1A敲除的癫痫性脑病克隆猪中的应用。
针对猪SCN1A的gRNA,序列如SEQ ID NO.22所示。
一种针对猪SCN1A的gRNA表达载体,该表达载体表达SEQ ID NO.36所示的gRNA;且该表达载体的载体骨架为pKG-U6gRNA,质粒全序列如SEQ ID NO.3所示;所述的gRNA表达载体优选由SEQ ID NO.26和SEQ ID NO.27所示的单链DNA退火而成的双链插入载体骨架pKG-U6gRNA所得。
本发明所述的CRISPR/Cas9系统、本发明所述的gRNA表达载体在构建猪SCN1A突变的猪重组细胞中的应用。
与现有技术相比,本发明至少具有如下有益效果:
(1)本发明研究对象(猪)比其他动物(大小鼠、灵长类)具有更好的应用性。
大小鼠等啮齿类动物不论从生理、病理及体型上,均与人相差巨大,无法真实地模拟人类正常的生理、病理状态。灵长类动物扩繁速度慢、数量少、成本高,动物保护及伦理等方面的要求也较高。而猪就没有上述缺点,同时猪的克隆技术非常成熟,饲养及克隆成本较灵长类低得多。因此猪是非常适合作为人类疾病模型的动物。
(2)本发明中经过实验验证改造的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体相对改造前的pX330载体,更换了更强的启动子及添加了增强蛋白翻译的元件,提高了Cas9的表达,并且增加了核定位信号个数,提高了Cas9蛋白的核定位能力,具有更高的基因编辑效率。本发明还在载体中加入了荧光标记及抗性标记,使其更方便运用于载体阳性转化细胞的筛选及富集。采用本发明筛选的gRNA联合改造的Cas9高效表达载体进行基因编辑,编辑效率比原载体提高100%以上。
(3)本发明针对SCN1A基因的不同靶点gRNA设计了相应的表达载体,通过筛选获得具有较高编辑效率的gRNA及其表达载体。配合本发明改造的Cas9高效表达载体进行基因编辑,通过靶标基因PCR产物测序结果可分析出所获细胞的基因型(包括双等位基因相同变异的纯合突变、双等位基因不同变异的纯合突变、杂合突变型或野生型),获得纯合突变的概率为20%~50%,大大优于使用胚胎注射技术的模型制备方法(即受精卵注射基因编辑材料)中获得纯合突变的概率(低于5%)。
(4)利用本发明所得到的纯合突变单细胞克隆株进行体细胞核移植动物克隆可直接得到含靶标基因纯合突变的克隆猪,并且该纯合突变可稳定遗传。
本发明采用技术难度大、挑战性高的原代细胞体外编辑并筛选阳性编辑单细胞克隆的方法,后期再通过体细胞核移植动物克隆技术直接获得相应疾病模型猪,可大大缩短模型猪制作周期并节省人力、物力、财力。
附图说明
图1为质粒pX330的结构示意图。
图2为质粒pU6gRNACas9的结构示意图。
图3为pU6gRNA-eEF1a Cas9载体的结构图谱。
图4为pU6gRNA-eEF1a Cas9+nNLS载体图谱。
图5为质粒pKG-GE3的结构示意图。
图6为质粒pKG-U6gRNA的结构示意图。
图7为将20bp左右的DNA分子(用于转录形成gRNA的靶序列结合区)插入质粒pKG-U6gRNA的示意图。
图8为实施例2中步骤2.3.3的测序峰图。
图9为实施例2中步骤2.4.3的测序峰图。
图10为实施例3中步骤3.2.3以猪的基因组DNA为模板分别采用SCN1A-E22g-F-27/SCN1A-E22g-R639(组1)、SCN1A-E22g-F-27/SCN1A-E22g-R746(组2)、SCN1A-E22g-F-69/SCN1A-E22g-R639(组3)、SCN1A-E22g-F-69/SCN1A-E22g-R746(组4)组成的引物对进行PCR扩增后的电泳图。
图11为实施例3中步骤3.2.4以18只猪的基因组DNA为模板采用SCN1A-E22g-F-27/SCN1A-E22g-R746组成的引物对进行PCR扩增后的电泳图。
图12为实施例4中步骤4.3.4以基因组DNA为模板采用SCN1A-E22g-F-27/SCN1A-E22g-R746组成的引物对进行PCR扩增后的电泳图。
图13为实施例5中步骤5.4.4以基因组DNA为模板采用SCN1A-E22g-F-27/SCN1A-E22g-R746组成的引物对进行PCR扩增后的电泳图。
图14为实施例5中步骤5.4.5判定靶基因为野生型的示例性测序峰图。
图15为实施例5中步骤5.4.5判定靶基因为双等位相同变异的纯合突变型示例性测序峰图。
图16为实施例5中步骤5.4.5判定靶基因为杂合突变型的示例性测序峰图。
具体实施方式
实施例1、质粒的构建
1.1构建质粒pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO(简称质粒pKG-GE3)
原始质粒pX330-U6-Chimeric_BB-CBh-hSpCas9(简称质粒pX330),序列如SEQ IDNO:1所示。质粒pX330的结构示意图见图1。SEQ ID NO:1中,第440-725位核苷酸组成CMV增强子,第727-1208位核苷酸组成chickenβ-actin启动子,第1304-1324位核苷酸编码SV40核定位信号(NLS),第1325-5449位核苷酸编码Cas9蛋白,第5450-5497位核苷酸编码nucleoplasmin核定位信号(NLS)。
质粒pU6gRNA eEF1a-mNLS-hSpCas9-EGFP-PURO(图5),简称质粒pKG-GE3,核苷酸如SEQ ID NO:2所示。与质粒pX330相比,质粒pKG-GE3主要进行了如下改造:①去除残留的gRNA骨架序列(GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTT),降低干扰;②将原有chickenβ-actin启动子改造为具更高表达活性的EF1a启动子,增加Cas9基因的蛋白表达能力;③在Cas9基因的上游和下游均增加核定位信号编码基因(NLS),增加Cas9蛋白的核定位能力;④原质粒无任何真核细胞筛选标记,不利于阳性转化细胞的筛选和富集,依次在Cas9基因的下游插入P2A-EGFP-T2A-PURO编码基因,赋予载体荧光和真核细胞抗性筛选能力;⑤插入WPRE元件和3’LTR序列元件,增强Cas9基因的蛋白翻译能力。
pKG-GE3质粒构建方法如下:
(1)去除gRNA骨架中多余无效的序列
质粒pX330用BbsI、XbaI酶切,回收载体片段(约8313bp左右),利用多片段重组法合成插入片段175bp(SEQ ID NO:4),与回收后载体片段重组得到pU6gRNACas9载体(图2)。
(2)改造启动子及增强子
对构建好的pU6gRNACas9载体,用XbaI和AgeI内切酶去除启动子(chickenβ-actin启动子)及增强子序列(CMV增强子),回收线性载体序列约7650bp,并利用多片段重组法合成554bp的包含CMV增强子及EF1a启动子的序列(SEQ ID NO:5),与酶切后载体pU6gRNACas9重组得到pU6gRNA-eEF1a Cas9载体(图3)。
(3)Cas9基因N端增加NLS序列
将构建好的载体pU6gRNA-eEF1a Cas9使用AgeI、BglII酶切,回收7786bp载体序列,并将增加了NLS的序列补充到酶切位点,即利用多片段重组法合成447bp的包括2个核定位信号及部分切除的Cas9编码序列(SEQ ID NO:6),重组得到pU6gRNA-eEF1a Cas9+nNLS载体(图4)。
(4)Cas9基因C端加入NLS、P2A-EGFP-T2A-PURO、WPRE-3’LTR-bGH polyA signals
以上构建好的载体命名为pU6gRNA-eEF1a Cas9+nNLS,使用FseI、SbfI酶切,回收载体序列7781bp,利用多片段重组法合成2727bp的包括NLS-P2A-EGFP-T2A-PURO-WPRE-3’LTR-bGH polyA signals的片段(SEQ ID NO:7),与载体片段重组得到载体pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO,简称pKG-GE3,质粒图谱如图5,核苷酸序列(SEQ ID NO:2)。
SEQ ID NO:2中,第395-680位核苷酸组成CMV增强子,第682-890位核苷酸组成EF1a启动子,第986-1006位核苷酸编码核定位信号(NLS),第1016-1036位核苷酸编码核定位信号(NLS),第1037-5161位核苷酸编码Cas9蛋白,第5162-5209位核苷酸编码核定位信号(NLS),第5219-5266位核苷酸编码核定位信号(NLS),第5276-5332位核苷酸编码自剪切多肽P2A(自剪切多肽P2A的氨基酸序列为“ATNFSLLKQAGDVEENPGP”,发生自剪切的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第5333-6046位核苷酸编码EGFP蛋白,第6056-6109位核苷酸编码自裂解多肽T2A(自裂解多肽T2A的氨基酸序列为“EGRGSLLTCGDVEENPGP”,发生自裂解的断裂位置为C端开始第一个氨基酸残基和第二个氨基酸残基之间),第6110-6703位核苷酸编码Puromycin蛋白(简称Puro蛋白),第6722-7310位核苷酸组成WPRE序列元件,第7382-7615位核苷酸组成3’LTR序列元件,第7647-7871位核苷酸组成bGH poly(A)signal序列元件。SEQ ID NO:2中,第911-6706形成融合基因,表达融合蛋白。由于自剪切多肽P2A和自裂解多肽T2A的存在,融合蛋白自发形成如下三个蛋白:具有Cas9蛋白的蛋白、具有EGFP蛋白的蛋白和具有Puro蛋白的蛋白。
1.2构建pKG-U6gRNA载体
来源pUC57载体,通过EcoRV酶切位点,连接pKG-U6gRNA插入序列(含U6启动子、BbsI酶切位点和sgRNA骨架序列的DNA片段,序列如SEQ ID NO:8所示),反向插入到pUC57载体,得到pKG-U6gRNA载体全序列(SEQ ID NO:3),SEQ ID NO:3中,第2280-2539位核苷酸组成hU6启动子,第2558-2637位核苷酸用于转录形成gRNA骨架。使用时,将20bp左右的DNA分子(用于转录形成gRNA的靶序列结合区)(图7)插入质粒pKG-U6gRNA(图6),形成重组质粒,在细胞中重组质粒转录得到gRNA。
实施例2质粒配比优化以及质粒pX330和质粒pKG-GE3的效果比较
2.1靶点gRNA设计及构建
2.1.1使用Benchling对RAG1基因进行靶点gRNA设计
RAG1-g4:AGTTATGGCAGAACTCAGTG(SEQ ID NO.9)
合成针对上述RAG1基因靶点的插入序列互补DNA Oligo如下:
RAG1-gRNA4S:caccgAGTTATGGCAGAACTCAGTG(SEQ ID NO.10)
RAG1-gRNA4A:aaacCACTGAGTTCTGCCATAACTc(SEQ ID NO.11)
RAG1-gRNA4S、RAG1-gRNA4A均为单链DNA分子。
2.1.2设计用于扩增和检测包含RAG1 gRNA靶点片段的引物
RAG1-nF126:CCCCATCCAAAGTTTTTAAAGGA(SEQ ID NO.12)
RAG1-nR525:TGTGGCAGATGTCACAGTTTAGG(SEQ ID NO.13)
2.1.3将gRNA序列克隆到pKG-U6gRNA骨架载体上的方法
1)用限制性内切酶BbsI消化1ug pKG-U6gRNA质粒;
2)酶切的pKG-U6gRNA质粒跑琼脂糖凝胶(琼脂糖凝胶浓度1%,即1g琼脂糖凝胶加入到100mL电泳缓冲液中)分离,用胶回收试剂盒(Vazyme)纯化回收酶切产物;
3)将2.1.1所述靶点合成的2条互补DNA Oligo,通过以下退火程序形成与pKG-U6gRNA载体BbsI酶切后粘性末端互补的DNA双链,示意如图7:
95℃,5min然后以5℃/min的速率降至25℃;
4)按照以下体系启动连接反应:室温反应10min
Figure BDA0002858168750000091
5)转化
按照感受态细胞(Vazyme)说明书进行操作。
2.1.4 gRNA载体构建
1)将合成的RAG1-gRNA4S和RAG1-gRNA4A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(RAG1-gRNA4)。质粒pKG-U6gRNA(RAG1-gRNA4)将会表达SEQ ID NO.14所示的RAG1-gRNA4。
2.1.5 gRNA载体鉴定
从LB平板上挑取单克隆置入加有相应抗生素的LB培养液内,37℃恒温摇床内培养12-16h后小提质粒后送通用公司测序,经序列比对确认RAG1-gRNA4载体构建成功。
2.2猪原代成纤维细胞制备
2.2.1取初生从江香猪耳组织0.5g,去除外部组织,75﹪酒精浸泡30-40s;
2.2.2用含5%P/S(Gibco Penicillin-Streptomycin)的PBS洗涤5次,不含P/S的PBS洗一次;
其中5%P/S的PBS配方为:5%P/S(Gibco Penicillin-Streptomycin)+95%PBS,5%、95%为体积百分比。
2.2.3用剪刀将组织剪碎,加入5mL 0.1%的胶原酶(Sigma)溶液,37℃摇床消化1h;
2.2.4 500g离心5min,去上清,将沉淀用1mL完全培养基重悬,铺入含10mL完全培养基并已用0.2%明胶(VWR)封盘的10cm细胞培养皿中。
其中,细胞完全培养基的配方为:15%胎牛血清(Gibco)+83%DMEM培养基
(Gibco)+1%P/S(Gibco Penicillin-Streptomycin)+1%HEPES(Solarbio),15%、83%、1%、1%为体积百分比。
2.2.5置于37℃,5%CO2(体积百分比)、5%O2(体积百分比)的恒温培养箱中进行培养;
2.2.6将细胞培养至长满皿底60%左右时使用0.25%(Gibco)的胰蛋白酶将细胞消化下来,然后加入完全培养基终止消化,将细胞悬液转入15mL离心管中,400g离心4min,弃去上清,使用1mL完全培养基重悬细胞以备下一步核转染实验。
2.3质粒配比优化
2.3.1共转染分组情况
第一组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.44μg质粒pKG-U6gRNA(RAG1-gRNA4):1.56μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为1:1。
第二组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.72μg质粒pKG-U6gRNA(RAG1-gRNA4):1.28μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为2:1。
第三组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pKG-GE3。即质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3的摩尔配比为3:1。
第四组:将质粒pKG-U6gRNA(RAG1-gRNA4)转染致猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:1μg质粒pKG-U6gRNA(RAG1-gRNA4)。
2.3.2共转染操作方法
使用哺乳动物成纤维细胞核转染试剂盒(Neon)与Neon TM transfection system电转仪进行转染实验。
1)按照上述分组配制电转反应液,混匀过程中注意切勿产生气泡;
2)将步骤一制备得到的细胞悬液使用PBS磷酸缓冲液(Solarbio)洗一遍,600g离心6min,弃去上清,使用11μL电转基本溶液Opti-MEM重悬细胞,重悬过程中要避免气泡的产生;
3)吸取10μL细胞悬液,加入步骤1)中的电转反应液中混匀,混匀过程中注意切勿产生气泡;
4)将试剂盒带有的电转杯放置于Neon TM transfection system电转仪杯槽内,加入3mL E Buffer;
5)用电转枪吸取10μL步骤3)得到的混合液,插入点击杯内,选择电转程序(1450V10ms3pulse),电击转染后立即在超净台内将电转枪中混合液转入到6孔板中,每孔含3mL15%胎牛血清(Gibco)+83%DMEM培养基(Gibco)+1%P/S(Gibco Penicillin-Streptomycin)+1%HEPES(Solarbio)的完全培养液;
6)混匀后放置于37℃,5%CO2、5%O2的恒温培养箱中进行培养;
7)电转后6-12h换液,36-48h用0.25%(Gibco)的胰蛋白酶消化并收集细胞于1.5mL离心管中。
2.3.3基因编辑效率分析
提取2.3.2中收集的细胞基因组DNA,采用RAG1-nF126和RAG1-nR525组成的引物对进行PCR扩增,将产物进行测序。测序结果利用网页版Synthego ICE工具分析测序峰图得出第一组、第二组、第三组的编辑效率依次为9%、53%、66%,测序结果示例性峰图见图8。分析认定第三组的基因编辑效率最高,即确定gRNA质粒与Cas9质粒最适用量为摩尔比3:1,质粒实际用量为0.92μg:1.08μg。
2.4质粒pX330和质粒pKG-GE3的效果比较
2.4.1共转染分组情况
RAG1-330组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pX330共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pX330,其中pKG-U6gRNA(RAG1-gRNA4)与pX330摩尔比为3:1。
RAG1-KG组:将质粒pKG-U6gRNA(RAG1-gRNA4)和质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(RAG1-gRNA4)与pKG-GE3摩尔比为3:1。
RAG1-B组:将质粒pKG-U6gRNA(RAG1-gRNA4)转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(RAG1-gRNA4)。
2.4.2共转染操作方法
同本实施例中2.3.2。
2.4.3基因编辑效率分析
提取2.4.2中收集的细胞基因组DNA,采用RAG1-nF126和RAG1-nR525组成的引物对进行PCR扩增,将产物进行测序。测序结果利用网页版Synthego ICE工具分析测序峰图得出RAG1-330组、RAG1-KG组的编辑效率分别为28%、68%,测序结果示例性峰图见图9,结果表明,与采用质粒pX330相比,采用质粒pKG-GE3使得基因编辑效率显著提高。
实施例3 SCN1A基因靶点gRNA的设计及构建
3.1基因组DNA的提取
使用Vazyme公司FastPure Cell/Tissue DNA Isolation Mini Kit(VazymeCat.DC102-01)分别进行18只猪(雄性A、B、C、D、E、F、G、H雌性1、2、3、4、5、6、7、8、9、10)耳组织的基因组DNA的柱式提取,使用NanoDrop进行定量,-20℃保存备用。
3.2 SCN1A基因敲除预设靶点及邻近基因组序列保守性分析
3.2.1猪SCN1A基因信息
电压门控钠离子通道α亚基1(sodium voltage-gated channel alpha subunit1);位于15号染色体;GeneID为106506270,Sus scrofa。猪SCN1A基因编码的氨基酸序列如SEQ ID NO.15所示。已有研究结果表明70%的Dravet综合征患者中发现SCN1A基因突变,在猪基因组DNA中,SCN1A基因具有29个外显子,其中第22外显子序列(含部分第21内含子和部分第22内含子序列)如SEQ ID NO.16所示。
3.2.2 SCN1A基因敲除预设靶点外显子及邻近基因组序列PCR扩增引物设计
根据查到的猪SCN1A基因组序列
(https://www.ncbi.nlm.nih.gov/nuccore/NC_010457.5?report=genbank& from=72529866&to=72696715&strand=true),设计引物扩增前述18只猪基因组样品SCN1A基因外显子22的位点。
使用Oligo7软件进行引物设计,设计结果如下:
SCN1A-E22g-F-27:GAAAGACTGAAGACTACTAGGGG(SEQ ID NO.17)
SCN1A-E22g-R639:TTATTAATTTATTAATCTACTTG(SEQ ID NO.18)
SCN1A-E22g-F-69:AAATATCCTAGAATAGTCTTGAT(SEQ ID NO.19)
SCN1A-E22g-R746:GTTATAATCCAACCAGGTGTGCT(SEQ ID NO.20)
3.2.3 SCN1A基因组PCR扩增引物筛选
使用猪(雌性1#)耳朵组织提取的基因组为模板,使用设计的两条上游引物和两条下游引物组合,Max酶(Vazyme公司货号:P505)进行PCR,产物进行1%琼脂糖凝胶电泳以筛选好的扩增引物,结果如图10,组1:SCN1A-E22g-F-27/SCN1A-E22g-R639;组2:SCN1A-E22g-F-27/SCN1A-E22g-R746;组3:SCN1A-E22g-F-69/SCN1A-E22g-R639;组4:SCN1A-E22g-F-69/SCN1A-E22g-R746,优选SCN1A-E22g-F-27/SCN1A-E22g-R746引物对进行目的片段扩增。
3.2.4 18只猪SCN1A基因片段PCR扩增
分别以18个基因组模板(雄性A、B、C、D、E、F、G、H雌性1、2、3、4、5、6、7、8、9、10),引物SCN1A-E22g-F-27/SCN1A-E22g-R746,Max酶进行SCN1A基因组片段的扩增,产物(773bp)进行1%琼脂糖凝胶电泳,结果如图11。
3.2.5 SCN1A基因序列保守性分析
以上PCR扩增产物,使用扩增引物进行测序(通用生物公司测序),将测序结果与公共数据库中的SCN1A基因序列进行比对分析。根据比对结果,扩增片段序列相对保守,所设计的引物本身无可能的突变位点。
3.3靶点gRNA设计及构建
3.3.1使用Benchling进行靶点gRNA设计
设计靶点已避开可能的突变位点,使用Benchling进行靶点gRNA设计:
https://benchling.com/
SCN1A基因敲除靶点设计如下:
SCN1A-E22-g1:CAGTTGTGGTGTTAACACAG(SEQ ID NO.21)
SCN1A-E22-g2:AAAGAAATGAGACTGCCCGA(SEQ ID NO.22)
SCN1A-E22-g3:AAAGTAAACTTTGATAATGT(SEQ ID NO.23)
SCN1A-E22-g4:GTCAAACATGTCACCAGTTG(SEQ ID NO.24)
SCN1A-E22-g5:CATTTCTTTCTATTAGTATT(SEQ ID NO.25)
合成的SCN1A基因共4个靶点的插入序列互补DNA Oligo如下:
SCN1A-E22-g1S:caccgCAGTTGTGGTGTTAACACAG(SEQ ID NO.26)
SCN1A-E22-g1A:aaacCTGTGTTAACACCACAACTGc(SEQ ID NO.27)
SCN1A-E22-g2S:caccgAAAGAAATGAGACTGCCCGA(SEQ ID NO.28)
SCN1A-E22-g2A:aaacTCGGGCAGTCTCATTTCTTTc(SEQ ID NO.29)
SCN1A-E22-g3S:caccgAAAGTAAACTTTGATAATGT(SEQ ID NO.30)
SCN1A-E22-g3A:aaacACATTATCAAAGTTTACTTTc(SEQ ID NO.31)
SCN1A-E22-g4S:caccGTCAAACATGTCACCAGTTG(SEQ ID NO.32)
SCN1A-E22-g4A:aaacCAACTGGTGACATGTTTGAC(SEQ ID NO.33)
SCN1A-E22-g5S:caccgCATTTCTTTCTATTAGTATT(SEQ ID NO.34)
SCN1A-E22-g5A:aaacAATACTAATAGAAAGAAATGc(SEQ ID NO.35)
SCN1A-E22-g1S、SCN1A-E22-g1A、SCN1A-E22-g2S、SCN1A-E22-g2A、SCN1A-E22-g3S、SCN1A-E22-g3A、SCN1A-E22-g4S、SCN1A-E22-g4A、SCN1A-E22-g5S、SCN1A-E22-g5A均为单链DNA分子。
3.3.2将gRNA序列克隆到pKG-U6gRNA骨架载体上的方法
同实施例2中2.1.3。
3.3.3 gRNA载体构建
1)将合成的SCN1A-E22-g1S和SCN1A-E22-g1A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(SCN1A-E22-g1)。质粒pKG-U6gRNA(SCN1A-E22-g1)将会表达SEQ ID NO.36所示的SCN1A-E22-gRNA1。
SEQ ID NO.36:
CAGUUGUGGUGUUAACACAGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
2)将合成的SCN1A-E22-g2S和SCN1A-E22-g2A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(SCN1A-E22-g2)。质粒pKG-U6gRNA(SCN1A-E22-g2)将会表达SEQ ID NO.37所示的SCN1A-E22-gRNA2。
SEQ ID NO.37:
AAAGAAAUGAGACUGCCCGAguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
3)将合成的SCN1A-E22-g3S和SCN1A-E22-g3A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(SCN1A-E22-g3)。质粒pKG-U6gRNA(SCN1A-E22-g3)将会表达SEQ ID NO.38所示的SCN1A-E22-gRNA3。
SEQ ID NO.38:
AAAGUAAACUUUGAUAAUGUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
4)将合成的SCN1A-E22-g4S和SCN1A-E22-g4A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(SCN1A-E22-g4)。质粒pKG-U6gRNA(SCN1A-E22-g4)将会表达SEQ ID NO.39所示的SCN1A-E22-gRNA4。
SEQ ID NO.39:
GUCAAACAUGUCACCAGUUGguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
5)将合成的SCN1A-E22-g5S和SCN1A-E22-g5A混合并进行退火,得到具有粘性末端的双链DNA分子。将具有粘性末端的双链DNA分子和载体骨架连接,得到质粒pKG-U6gRNA(SCN1A-E22-g5)。质粒pKG-U6gRNA(SCN1A-E22-g5)将会表达SEQ ID NO.40所示的SCN1A-E22-gRNA5。
SEQ ID NO.40:
CAUUUCUUUCUAUUAGUAUUguuuuagagcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaaguggcaccgagucggugcuuuu
3.3.3 gRNA载体鉴定
从LB平板上挑取单克隆置入加有相应抗生素的LB培养液内,37℃恒温摇床内培养12-16h后小提质粒后送通用公司测序,经序列比对确认pKG-U6gRNA(SCN1A-E22-g1)、pKG-U6gRNA(SCN1A-E22-g2)、pKG-U6gRNA(SCN1A-E22-g3)、pKG-U6gRNA(SCN1A-E22-g4)、pKG-U6gRNA(SCN1A-E22-g5)载体均构建成功。
实施例4 SCN1A基因不同靶点gRNA的编辑效率比较
4.1猪原代成纤维细胞制备
同实施例2中2.2。
4.2使用构建好的gRNA质粒、Cas9质粒(pKG-GE3)共转染猪原代成纤维细胞
4.2.1共转染分组情况
第一组:将质粒pKG-U6gRNA(SCN1A-E22-g1)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(SCN1A-E22-g1):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(SCN1A-E22-g1)与pKG-GE3摩尔比为3:1。
第二组:将质粒pKG-U6gRNA(SCN1A-E22-g2)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(SCN1A-E22-g2):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(SCN1A-E22-g2)与pKG-GE3摩尔比为3:1。
第三组:将质粒pKG-U6gRNA(SCN1A-E22-g3)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(SCN1A-E22-g3):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(SCN1A-E22-g3)与pKG-GE3摩尔比为3:1。
第四组:将质粒pKG-U6gRNA(SCN1A-E22-g4)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(SCN1A-E22-g4):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(SCN1A-E22-g4)与pKG-GE3摩尔比为3:1。
第五组:将质粒pKG-U6gRNA(SCN1A-E22-g5)、质粒pKG-GE3共转染猪原代成纤维细胞。配比:约20万个猪原代成纤维细胞:0.92μg质粒pKG-U6gRNA(SCN1A-E22-g5):1.08μg质粒pKG-GE3,其中pKG-U6gRNA(SCN1A-E22-g5)与pKG-GE3摩尔比为3:1。
第六组:猪原代成纤维细胞,同等电转参数不加质粒进行电转染操作。
4.2.2共转染操作方法
同实施例2中2.3.2。
4.3 SCN1A基因不同靶点gRNA的编辑效率分析
4.3.1分别向步骤4.2中收集在1.5mL离心管中的5组细胞加入10μL KAPA2G裂解液裂解细胞提取细胞基因组DNA。
KAPA2G裂解液配制体系如下:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
75℃15min—95℃5min—4℃,反应结束后基因组DNA于-20℃保存;
4.3.2采用前述针对SCN1A基因E22引物SCN1A-E22g-F-27/SCN1A-E22g-R746进行检测突变,PCR目的产物长度为773bp;
4.3.3使用常规PCR反应扩增SCN1A靶点基因;
4.3.4将PCR反应产物进行1%琼脂糖凝胶电泳,如图12,将目的产物及其附近产物切胶回收后送测序公司进行测序,然后将测序结果利用网页版Synthego ICE工具分析测序峰图得出SCN1A-E22-g1、SCN1A-E22-g2、SCN1A-E22-g3、SCN1A-E22-g4、SCN1A-E22-g5不同靶点的编辑效率依次为32%、15%、0、12%、0。结果表明,SCN1A-E22-g1编辑效率最高,为最优靶点。
实施例5构建SCN1A基因敲除的从江香猪单细胞克隆株
5.1猪原代成纤维细胞制备
同实施例2中2.2。
5.2使用构建好的pKG-U6gRNA(SCN1A-E22-g1)质粒、pKG-GE3质粒共转染猪原代成纤维细胞
同实施例2中2.3.2,但不使用0.25%(Gibco)的胰蛋白酶消化并收集细胞于1.5mL离心管中。
5.3 SCN1A基因敲除单细胞克隆株的筛选
5.3.1将步骤5.2所得电转48h的群体细胞,使用胰蛋白酶进行消化,完全培养基中和,500g离心5min,去上清,将沉淀用200μL完全培养基重悬,并适当稀释,用口吸管挑取单克隆转移到100μL完全培养基的96孔板中;
5.3.2 37℃,5%CO2、5%O2的恒温培养箱中进行培养,每2~3天换一次细胞培养基,期间用显微镜观察每孔细胞生长情况,排除无细胞及非单细胞克隆的孔;
5.3.3待96孔板的孔中细胞长满孔底,使用胰蛋白酶消化并收集细胞,其中2/3细胞接种到含有完全培养基的6孔板中,剩余的1/3的细胞收集在1.5mL离心管中;
5.3.4待6孔板长至80%汇合度时使用0.25%(Gibco)的胰蛋白酶消化并收集细胞,使用细胞冻存液(90%完全培养基+10%DMSO,体积比)将细胞冻存。
5.4 SCN1A基因敲除的重组细胞鉴定
5.4.1将步骤5.3收集在1.5mL离心管中得到的细胞,然后在细胞中加入10μLKAPA2G裂解液裂解细胞提取细胞基因组DNA。
KAPA2G裂解液配制体系如下:
10X extract Buffer 1μL
Enzyme 0.2μL
ddH2O 8.8μL
75℃15min—95℃5min—4℃,反应结束后基因组DNA于-20℃保存;
5.4.2采用前述针对SCN1A基因E22引物SCN1A-E22g-F-27/SCN1A-E22g-R746进行检测突变,PCR目的产物长度为773bp;
5.4.3使用PCR常规反应扩增SCN1A靶点基因;
5.4.4将PCR反应产物进行电泳,电泳结果如图13,泳道编号与单细胞克隆编号一致。回收PCR扩增产物并测序。
5.4.5将测序结果与SCN1A靶点信息进行比对,从而判断该重组细胞是否为SCN1A基因敲除。
编号为2、5、10、15、18的单细胞克隆的基因型为双等位相同变异的纯合突变型。编号为1、4、9、11的单细胞克隆的基因型为杂合突变型。编号为3、6、7、8、12、13、14、16、17、19、20的单细胞克隆的基因型为纯合野生型。得到SCN1A基因编辑单细胞克隆的比率为45%。
示例性的测序比对结果如图14至图16,其中图14是克隆号为SCN1A-2的反向测序与已公布序列的比对结果,判定为野生型;图15是克隆号为SCN1A-5的反向测序与已公布序列的比对结果,判定为双等位相同变异的纯合突变型;图16是克隆号为SCN1A-4的反向测序与已公布序列的比对结果,判定为杂合突变型。
通过具体序列的分析,SCN1A各单细胞克隆基因型如表1:
表1 SCN1A基因敲除从江香猪单细胞克隆基因型鉴定
Figure BDA0002858168750000181
Figure BDA0002858168750000191
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
序列表
<110> 南京启真基因工程有限公司
<120> CRISPR系统及其在构建SCN1A基因突变的癫痫性脑病克隆猪核供体细胞中的应用
<160> 40
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8484
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttg ttttagagct 360
agaaatagca agttaaaata aggctagtcc gtttttagcg cgtgcgccaa ttctgcagac 420
aaatggctct agaggtaccc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 480
ccaacgaccc ccgcccattg acgtcaatag taacgccaat agggactttc cattgacgtc 540
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 600
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tgtgcccagt 660
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 720
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 780
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 840
ggggggggcg gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc 900
agagcggcgc gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata 960
aaaagcgaag cgcgcggcgg gcgggagtcg ctgcgcgctg ccttcgcccc gtgccccgct 1020
ccgccgccgc ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga 1080
gcgggcggga cggcccttct cctccgggct gtaattagct gagcaagagg taagggttta 1140
agggatggtt ggttggtggg gtattaatgt ttaattacct ggagcacctg cctgaaatca 1200
ctttttttca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 1260
ggatcatgat attgattaca aagacgatga cgataagatg gccccaaaga agaagcggaa 1320
ggtcggtatc cacggagtcc cagcagccga caagaagtac agcatcggcc tggacatcgg 1380
caccaactct gtgggctggg ccgtgatcac cgacgagtac aaggtgccca gcaagaaatt 1440
caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg gagccctgct 1500
gttcgacagc ggcgaaacag ccgaggccac ccggctgaag agaaccgcca gaagaagata 1560
caccagacgg aagaaccgga tctgctatct gcaagagatc ttcagcaacg agatggccaa 1620
ggtggacgac agcttcttcc acagactgga agagtccttc ctggtggaag aggataagaa 1680
gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc acgagaagta 1740
ccccaccatc taccacctga gaaagaaact ggtggacagc accgacaagg ccgacctgcg 1800
gctgatctat ctggccctgg cccacatgat caagttccgg ggccacttcc tgatcgaggg 1860
cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg tgcagaccta 1920
caaccagctg ttcgaggaaa accccatcaa cgccagcggc gtggacgcca aggccatcct 1980
gtctgccaga ctgagcaaga gcagacggct ggaaaatctg atcgcccagc tgcccggcga 2040
gaagaagaat ggcctgttcg gaaacctgat tgccctgagc ctgggcctga cccccaactt 2100
caagagcaac ttcgacctgg ccgaggatgc caaactgcag ctgagcaagg acacctacga 2160
cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc tgtttctggc 2220
cgccaagaac ctgtccgacg ccatcctgct gagcgacatc ctgagagtga acaccgagat 2280
caccaaggcc cccctgagcg cctctatgat caagagatac gacgagcacc accaggacct 2340
gaccctgctg aaagctctcg tgcggcagca gctgcctgag aagtacaaag agattttctt 2400
cgaccagagc aagaacggct acgccggcta cattgacggc ggagccagcc aggaagagtt 2460
ctacaagttc atcaagccca tcctggaaaa gatggacggc accgaggaac tgctcgtgaa 2520
gctgaacaga gaggacctgc tgcggaagca gcggaccttc gacaacggca gcatccccca 2580
ccagatccac ctgggagagc tgcacgccat tctgcggcgg caggaagatt tttacccatt 2640
cctgaaggac aaccgggaaa agatcgagaa gatcctgacc ttccgcatcc cctactacgt 2700
gggccctctg gccaggggaa acagcagatt cgcctggatg accagaaaga gcgaggaaac 2760
catcaccccc tggaacttcg aggaagtggt ggacaagggc gcttccgccc agagcttcat 2820
cgagcggatg accaacttcg ataagaacct gcccaacgag aaggtgctgc ccaagcacag 2880
cctgctgtac gagtacttca ccgtgtataa cgagctgacc aaagtgaaat acgtgaccga 2940
gggaatgaga aagcccgcct tcctgagcgg cgagcagaaa aaggccatcg tggacctgct 3000
gttcaagacc aaccggaaag tgaccgtgaa gcagctgaaa gaggactact tcaagaaaat 3060
cgagtgcttc gactccgtgg aaatctccgg cgtggaagat cggttcaacg cctccctggg 3120
cacataccac gatctgctga aaattatcaa ggacaaggac ttcctggaca atgaggaaaa 3180
cgaggacatt ctggaagata tcgtgctgac cctgacactg tttgaggaca gagagatgat 3240
cgaggaacgg ctgaaaacct atgcccacct gttcgacgac aaagtgatga agcagctgaa 3300
gcggcggaga tacaccggct ggggcaggct gagccggaag ctgatcaacg gcatccggga 3360
caagcagtcc ggcaagacaa tcctggattt cctgaagtcc gacggcttcg ccaacagaaa 3420
cttcatgcag ctgatccacg acgacagcct gacctttaaa gaggacatcc agaaagccca 3480
ggtgtccggc cagggcgata gcctgcacga gcacattgcc aatctggccg gcagccccgc 3540
cattaagaag ggcatcctgc agacagtgaa ggtggtggac gagctcgtga aagtgatggg 3600
ccggcacaag cccgagaaca tcgtgatcga aatggccaga gagaaccaga ccacccagaa 3660
gggacagaag aacagccgcg agagaatgaa gcggatcgaa gagggcatca aagagctggg 3720
cagccagatc ctgaaagaac accccgtgga aaacacccag ctgcagaacg agaagctgta 3780
cctgtactac ctgcagaatg ggcgggatat gtacgtggac caggaactgg acatcaaccg 3840
gctgtccgac tacgatgtgg accatatcgt gcctcagagc tttctgaagg acgactccat 3900
cgacaacaag gtgctgacca gaagcgacaa gaaccggggc aagagcgaca acgtgccctc 3960
cgaagaggtc gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg ccaagctgat 4020
tacccagaga aagttcgaca atctgaccaa ggccgagaga ggcggcctga gcgaactgga 4080
taaggccggc ttcatcaaga gacagctggt ggaaacccgg cagatcacaa agcacgtggc 4140
acagatcctg gactcccgga tgaacactaa gtacgacgag aatgacaagc tgatccggga 4200
agtgaaagtg atcaccctga agtccaagct ggtgtccgat ttccggaagg atttccagtt 4260
ttacaaagtg cgcgagatca acaactacca ccacgcccac gacgcctacc tgaacgccgt 4320
cgtgggaacc gccctgatca aaaagtaccc taagctggaa agcgagttcg tgtacggcga 4380
ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaaa tcggcaaggc 4440
taccgccaag tacttcttct acagcaacat catgaacttt ttcaagaccg agattaccct 4500
ggccaacggc gagatccgga agcggcctct gatcgagaca aacggcgaaa ccggggagat 4560
cgtgtgggat aagggccggg attttgccac cgtgcggaaa gtgctgagca tgccccaagt 4620
gaatatcgtg aaaaagaccg aggtgcagac aggcggcttc agcaaagagt ctatcctgcc 4680
caagaggaac agcgataagc tgatcgccag aaagaaggac tgggacccta agaagtacgg 4740
cggcttcgac agccccaccg tggcctattc tgtgctggtg gtggccaaag tggaaaaggg 4800
caagtccaag aaactgaaga gtgtgaaaga gctgctgggg atcaccatca tggaaagaag 4860
cagcttcgag aagaatccca tcgactttct ggaagccaag ggctacaaag aagtgaaaaa 4920
ggacctgatc atcaagctgc ctaagtactc cctgttcgag ctggaaaacg gccggaagag 4980
aatgctggcc tctgccggcg aactgcagaa gggaaacgaa ctggccctgc cctccaaata 5040
tgtgaacttc ctgtacctgg ccagccacta tgagaagctg aagggctccc ccgaggataa 5100
tgagcagaaa cagctgtttg tggaacagca caagcactac ctggacgaga tcatcgagca 5160
gatcagcgag ttctccaaga gagtgatcct ggccgacgct aatctggaca aagtgctgtc 5220
cgcctacaac aagcaccggg ataagcccat cagagagcag gccgagaata tcatccacct 5280
gtttaccctg accaatctgg gagcccctgc cgccttcaag tactttgaca ccaccatcga 5340
ccggaagagg tacaccagca ccaaagaggt gctggacgcc accctgatcc accagagcat 5400
caccggcctg tacgagacac ggatcgacct gtctcagctg ggaggcgaca aaaggccggc 5460
ggccacgaaa aaggccggcc aggcaaaaaa gaaaaagtaa gaattcctag agctcgctga 5520
tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct 5580
tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca 5640
tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag 5700
ggggaggatt gggaagagaa tagcaggcat gctggggagc ggccgcagga acccctagtg 5760
atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag 5820
gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc 5880
ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 5940
atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 6000
tggttacgcg cagcgtgacc gctacacttg ccagcgcctt agcgcccgct cctttcgctt 6060
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 6120
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 6180
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 6240
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aactctatct 6300
cgggctattc ttttgattta taagggattt tgccgatttc ggtctattgg ttaaaaaatg 6360
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat 6420
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc 6480
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 6540
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 6600
cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg 6660
tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 6720
ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 6780
aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 6840
tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 6900
atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 6960
agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 7020
tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 7080
tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 7140
atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 7200
ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 7260
tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 7320
acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 7380
ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata 7440
aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 7500
ctggagccgg tgagcgtgga agccgcggta tcattgcagc actggggcca gatggtaagc 7560
cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata 7620
gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 7680
actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 7740
agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 7800
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 7860
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 7920
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 7980
ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 8040
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 8100
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 8160
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 8220
gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 8280
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 8340
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 8400
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 8460
tttgctggcc ttttgctcac atgt 8484
<210> 2
<211> 10476
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag ttaaaataag 300
gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc tagcgcgtgc 360
gccaattctg cagacaaatg gctctagagg tacccgttac ataacttacg gtaaatggcc 420
cgcctggctg accgcccaac gacccccgcc cattgacgtc aatagtaacg ccaataggga 480
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 540
aagtgtatca tatgccaagt acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct 600
ggcattgtgc ccagtacatg accttatggg actttcctac ttggcagtac atctacgtat 660
tagtcatcgc tattaccatg ggggcagagc gcacatcgcc cacagtcccc gagaagttgg 720
ggggaggggt cggcaattga tccggtgcct agagaaggtg gcgcggggta aactgggaaa 780
gtgatgtcgt gtactggctc cgcctttttc ccgagggtgg gggagaaccg tatataagtg 840
cagtagtcgc cgtgaacgtt ctttttcgca acgggtttgc cgccagaaca caggttggac 900
cggtgccacc atggactata aggaccacga cggagactac aaggatcatg atattgatta 960
caaagacgat gacgataaga tggcccccaa aaagaaacga aaggtgggtg ggtccccaaa 1020
gaagaagcgg aaggtcggta tccacggagt cccagcagcc gacaagaagt acagcatcgg 1080
cctggacatc ggcaccaact ctgtgggctg ggccgtgatc accgacgagt acaaggtgcc 1140
cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac agcatcaaga agaacctgat 1200
cggagccctg ctgttcgaca gcggcgaaac agccgaggcc acccggctga agagaaccgc 1260
cagaagaaga tacaccagac ggaagaaccg gatctgctat ctgcaagaga tcttcagcaa 1320
cgagatggcc aaggtggacg acagcttctt ccacagactg gaagagtcct tcctggtgga 1380
agaggataag aagcacgagc ggcaccccat cttcggcaac atcgtggacg aggtggccta 1440
ccacgagaag taccccacca tctaccacct gagaaagaaa ctggtggaca gcaccgacaa 1500
ggccgacctg cggctgatct atctggccct ggcccacatg atcaagttcc ggggccactt 1560
cctgatcgag ggcgacctga accccgacaa cagcgacgtg gacaagctgt tcatccagct 1620
ggtgcagacc tacaaccagc tgttcgagga aaaccccatc aacgccagcg gcgtggacgc 1680
caaggccatc ctgtctgcca gactgagcaa gagcagacgg ctggaaaatc tgatcgccca 1740
gctgcccggc gagaagaaga atggcctgtt cggaaacctg attgccctga gcctgggcct 1800
gacccccaac ttcaagagca acttcgacct ggccgaggat gccaaactgc agctgagcaa 1860
ggacacctac gacgacgacc tggacaacct gctggcccag atcggcgacc agtacgccga 1920
cctgtttctg gccgccaaga acctgtccga cgccatcctg ctgagcgaca tcctgagagt 1980
gaacaccgag atcaccaagg cccccctgag cgcctctatg atcaagagat acgacgagca 2040
ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag cagctgcctg agaagtacaa 2100
agagattttc ttcgaccaga gcaagaacgg ctacgccggc tacattgacg gcggagccag 2160
ccaggaagag ttctacaagt tcatcaagcc catcctggaa aagatggacg gcaccgagga 2220
actgctcgtg aagctgaaca gagaggacct gctgcggaag cagcggacct tcgacaacgg 2280
cagcatcccc caccagatcc acctgggaga gctgcacgcc attctgcggc ggcaggaaga 2340
tttttaccca ttcctgaagg acaaccggga aaagatcgag aagatcctga ccttccgcat 2400
cccctactac gtgggccctc tggccagggg aaacagcaga ttcgcctgga tgaccagaaa 2460
gagcgaggaa accatcaccc cctggaactt cgaggaagtg gtggacaagg gcgcttccgc 2520
ccagagcttc atcgagcgga tgaccaactt cgataagaac ctgcccaacg agaaggtgct 2580
gcccaagcac agcctgctgt acgagtactt caccgtgtat aacgagctga ccaaagtgaa 2640
atacgtgacc gagggaatga gaaagcccgc cttcctgagc ggcgagcaga aaaaggccat 2700
cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg aagcagctga aagaggacta 2760
cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc ggcgtggaag atcggttcaa 2820
cgcctccctg ggcacatacc acgatctgct gaaaattatc aaggacaagg acttcctgga 2880
caatgaggaa aacgaggaca ttctggaaga tatcgtgctg accctgacac tgtttgagga 2940
cagagagatg atcgaggaac ggctgaaaac ctatgcccac ctgttcgacg acaaagtgat 3000
gaagcagctg aagcggcgga gatacaccgg ctggggcagg ctgagccgga agctgatcaa 3060
cggcatccgg gacaagcagt ccggcaagac aatcctggat ttcctgaagt ccgacggctt 3120
cgccaacaga aacttcatgc agctgatcca cgacgacagc ctgaccttta aagaggacat 3180
ccagaaagcc caggtgtccg gccagggcga tagcctgcac gagcacattg ccaatctggc 3240
cggcagcccc gccattaaga agggcatcct gcagacagtg aaggtggtgg acgagctcgt 3300
gaaagtgatg ggccggcaca agcccgagaa catcgtgatc gaaatggcca gagagaacca 3360
gaccacccag aagggacaga agaacagccg cgagagaatg aagcggatcg aagagggcat 3420
caaagagctg ggcagccaga tcctgaaaga acaccccgtg gaaaacaccc agctgcagaa 3480
cgagaagctg tacctgtact acctgcagaa tgggcgggat atgtacgtgg accaggaact 3540
ggacatcaac cggctgtccg actacgatgt ggaccatatc gtgcctcaga gctttctgaa 3600
ggacgactcc atcgacaaca aggtgctgac cagaagcgac aagaaccggg gcaagagcga 3660
caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac tactggcggc agctgctgaa 3720
cgccaagctg attacccaga gaaagttcga caatctgacc aaggccgaga gaggcggcct 3780
gagcgaactg gataaggccg gcttcatcaa gagacagctg gtggaaaccc ggcagatcac 3840
aaagcacgtg gcacagatcc tggactcccg gatgaacact aagtacgacg agaatgacaa 3900
gctgatccgg gaagtgaaag tgatcaccct gaagtccaag ctggtgtccg atttccggaa 3960
ggatttccag ttttacaaag tgcgcgagat caacaactac caccacgccc acgacgccta 4020
cctgaacgcc gtcgtgggaa ccgccctgat caaaaagtac cctaagctgg aaagcgagtt 4080
cgtgtacggc gactacaagg tgtacgacgt gcggaagatg atcgccaaga gcgagcagga 4140
aatcggcaag gctaccgcca agtacttctt ctacagcaac atcatgaact ttttcaagac 4200
cgagattacc ctggccaacg gcgagatccg gaagcggcct ctgatcgaga caaacggcga 4260
aaccggggag atcgtgtggg ataagggccg ggattttgcc accgtgcgga aagtgctgag 4320
catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag acaggcggct tcagcaaaga 4380
gtctatcctg cccaagagga acagcgataa gctgatcgcc agaaagaagg actgggaccc 4440
taagaagtac ggcggcttcg acagccccac cgtggcctat tctgtgctgg tggtggccaa 4500
agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa gagctgctgg ggatcaccat 4560
catggaaaga agcagcttcg agaagaatcc catcgacttt ctggaagcca agggctacaa 4620
agaagtgaaa aaggacctga tcatcaagct gcctaagtac tccctgttcg agctggaaaa 4680
cggccggaag agaatgctgg cctctgccgg cgaactgcag aagggaaacg aactggccct 4740
gccctccaaa tatgtgaact tcctgtacct ggccagccac tatgagaagc tgaagggctc 4800
ccccgaggat aatgagcaga aacagctgtt tgtggaacag cacaagcact acctggacga 4860
gatcatcgag cagatcagcg agttctccaa gagagtgatc ctggccgacg ctaatctgga 4920
caaagtgctg tccgcctaca acaagcaccg ggataagccc atcagagagc aggccgagaa 4980
tatcatccac ctgtttaccc tgaccaatct gggagcccct gccgccttca agtactttga 5040
caccaccatc gaccggaaga ggtacaccag caccaaagag gtgctggacg ccaccctgat 5100
ccaccagagc atcaccggcc tgtacgagac acggatcgac ctgtctcagc tgggaggcga 5160
caaaaggccg gcggccacga aaaaggccgg ccaggcaaaa aagaaaaagg gcggctccaa 5220
gcggcctgcc gcgacgaaga aagcgggaca ggccaagaaa aagaaaggat ccggcgcaac 5280
aaacttctct ctgctgaaac aagccggaga tgtcgaagag aatcctggac cggtgagcaa 5340
gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 5400
cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 5460
cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 5520
cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 5580
cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 5640
cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 5700
cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 5760
caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 5820
gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 5880
gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 5940
ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 6000
cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagggct ccggcgaggg 6060
caggggaagt cttctaacat gcggggacgt ggaggaaaat cccggcccaa ccgagtacaa 6120
gcccacggtg cgcctcgcca cccgcgacga cgtccccagg gccgtacgca ccctcgccgc 6180
cgcgttcgcc gactaccccg ccacgcgcca caccgtcgat ccggaccgcc acatcgagcg 6240
ggtcaccgag ctgcaagaac tcttcctcac gcgcgtcggg ctcgacatcg gcaaggtgtg 6300
ggtcgcggac gacggcgccg cggtggcggt ctggaccacg ccggagagcg tcgaagcggg 6360
ggcggtgttc gccgagatcg gcccgcgcat ggccgagttg agcggttccc ggctggccgc 6420
gcagcaacag atggaaggcc tcctggcgcc gcaccggccc aaggagcccg cgtggttcct 6480
ggccaccgtc ggagtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct 6540
ccccggagtg gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc 6600
ccgcaacctc cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc 6660
cgaaggaccg cgcacctggt gcatgacccg caagcccggt gcctgaacgc gttaagtcga 6720
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 6780
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 6840
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 6900
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 6960
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 7020
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 7080
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 7140
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 7200
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 7260
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 7320
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 7380
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 7440
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7500
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7560
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagggcc 7620
cgtttaaacc cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 7680
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 7740
aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt 7800
ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg gggatgcggt 7860
gggctctatg gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt 7920
atttcacacc gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg 7980
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ttagcgcccg 8040
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 8100
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 8160
aacttgattt gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 8220
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 8280
tcaactctat ctcgggctat tcttttgatt tataagggat tttgccgatt tcggtctatt 8340
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 8400
ttacaatttt atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc 8460
cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg 8520
cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat 8580
caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 8640
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 8700
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 8760
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 8820
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 8880
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 8940
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 9000
cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac 9060
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 9120
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 9180
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 9240
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 9300
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 9360
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 9420
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 9480
ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg tatcattgca gcactggggc 9540
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 9600
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 9660
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 9720
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 9780
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 9840
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 9900
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 9960
taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 10020
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 10080
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 10140
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 10200
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 10260
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 10320
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 10380
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 10440
ggttcctggc cttttgctgg ccttttgctc acatgt 10476
<210> 3
<211> 3120
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctt gcatgcaggc ctctgcagtc gacgggcccg ggatccgatg 2280
ataaacatgt gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc 2340
tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac 2400
gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat 2460
ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt 2520
gtggaaagga cgaaacaccg ggtcttcgag aagacctgtt ttagagctag aaatagcaag 2580
ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 2640
tagcgcgtgc gccaattctg cagacaaatg gctctagagg tacccataga tctagatgca 2700
ttcgcgaggt accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa 2760
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 2820
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 2880
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 2940
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 3000
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 3060
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 3120
<210> 4
<211> 175
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 60
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 120
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccgtta cataa 175
<210> 5
<211> 554
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
tctgcagaca aatggctcta gaggtacccg ttacataact tacggtaaat ggcccgcctg 60
gctgaccgcc caacgacccc cgcccattga cgtcaatagt aacgccaata gggactttcc 120
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 180
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 240
gtgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 300
tcgctattac catgggggca gagcgcacat cgcccacagt ccccgagaag ttggggggag 360
gggtcggcaa ttgatccggt gcctagagaa ggtggcgcgg ggtaaactgg gaaagtgatg 420
tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata agtgcagtag 480
tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacaggtt ggaccggtgc 540
caccatggac tata 554
<210> 6
<211> 447
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
ccagaacaca ggttggaccg gtgccaccat ggactataag gaccacgacg gagactacaa 60
ggatcatgat attgattaca aagacgatga cgataagatg gcccccaaaa agaaacgaaa 120
ggtgggtggg tccccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagccga 180
caagaagtac agcatcggcc tggacatcgg caccaactct gtgggctggg ccgtgatcac 240
cgacgagtac aaggtgccca gcaagaaatt caaggtgctg ggcaacaccg accggcacag 300
catcaagaag aacctgatcg gagccctgct gttcgacagc ggcgaaacag ccgaggccac 360
ccggctgaag agaaccgcca gaagaagata caccagacgg aagaaccgga tctgctatct 420
gcaagagatc ttcagcaacg agatggc 447
<210> 7
<211> 2727
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
cggcggccac gaaaaaggcc ggccaggcaa aaaagaaaaa gggcggctcc aagcggcctg 60
ccgcgacgaa gaaagcggga caggccaaga aaaagaaagg atccggcgca acaaacttct 120
ctctgctgaa acaagccgga gatgtcgaag agaatcctgg accggtgagc aagggcgagg 180
agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta aacggccaca 240
agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg accctgaagt 300
tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc accctgacct 360
acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac ttcttcaagt 420
ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac gacggcaact 480
acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc atcgagctga 540
agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag tacaactaca 600
acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag gtgaacttca 660
agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac cagcagaaca 720
cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc acccagtccg 780
ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag ttcgtgaccg 840
ccgccgggat cactctcggc atggacgagc tgtacaaggg ctccggcgag ggcaggggaa 900
gtcttctaac atgcggggac gtggaggaaa atcccggccc aaccgagtac aagcccacgg 960
tgcgcctcgc cacccgcgac gacgtcccca gggccgtacg caccctcgcc gccgcgttcg 1020
ccgactaccc cgccacgcgc cacaccgtcg atccggaccg ccacatcgag cgggtcaccg 1080
agctgcaaga actcttcctc acgcgcgtcg ggctcgacat cggcaaggtg tgggtcgcgg 1140
acgacggcgc cgcggtggcg gtctggacca cgccggagag cgtcgaagcg ggggcggtgt 1200
tcgccgagat cggcccgcgc atggccgagt tgagcggttc ccggctggcc gcgcagcaac 1260
agatggaagg cctcctggcg ccgcaccggc ccaaggagcc cgcgtggttc ctggccaccg 1320
tcggagtctc gcccgaccac cagggcaagg gtctgggcag cgccgtcgtg ctccccggag 1380
tggaggcggc cgagcgcgcc ggggtgcccg ccttcctgga gacctccgcg ccccgcaacc 1440
tccccttcta cgagcggctc ggcttcaccg tcaccgccga cgtcgaggtg cccgaaggac 1500
cgcgcacctg gtgcatgacc cgcaagcccg gtgcctgaac gcgttaagtc gacaatcaac 1560
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 1620
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 1680
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 1740
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 1800
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 1860
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 1920
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 1980
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 2040
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 2100
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 2160
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 2220
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 2280
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 2340
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 2400
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcaggg cccgtttaaa 2460
cccgctgatc agcctcgact gtgccttcta gttgccagcc atctgttgtt tgcccctccc 2520
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 2580
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 2640
acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 2700
tggcctgcag gggcgcctga tgcggta 2727
<210> 8
<211> 410
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
gataaacatg tgagggccta tttcccatga ttccttcata tttgcatata cgatacaagg 60
ctgttagaga gataattgga attaatttga ctgtaaacac aaagatatta gtacaaaata 120
cgtgacgtag aaagtaataa tttcttgggt agtttgcagt tttaaaatta tgttttaaaa 180
tggactatca tatgcttacc gtaacttgaa agtatttcga tttcttggct ttatatatct 240
tgtggaaagg acgaaacacc gggtcttcga gaagacctgt tttagagcta gaaatagcaa 300
gttaaaataa ggctagtccg ttatcaactt gaaaaagtgg caccgagtcg gtgctttttt 360
ctagcgcgtg cgccaattct gcagacaaat ggctctagag gtacccatag 410
<210> 9
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
agttatggca gaactcagtg 20
<210> 10
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
caccgagtta tggcagaact cagtg 25
<210> 11
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
aaaccactga gttctgccat aactc 25
<210> 12
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
ccccatccaa agtttttaaa gga 23
<210> 13
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
tgtggcagat gtcacagttt agg 23
<210> 14
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 14
aguuauggca gaacucagug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 15
<211> 2009
<212> PRT
<213> 猪(Sus scrofa)
<400> 15
Met Glu Gln Thr Val Leu Val Pro Pro Gly Pro Asp Ser Phe Asn Phe
1 5 10 15
Phe Thr Arg Glu Ser Leu Ala Ala Ile Glu Arg Arg Ile Ala Glu Glu
20 25 30
Lys Ala Lys Asn Pro Lys Pro Asp Lys Lys Asp Asp Asp Glu His Gly
35 40 45
Pro Lys Pro Asn Ser Asp Leu Glu Ala Gly Lys Asn Leu Pro Phe Ile
50 55 60
Tyr Gly Asp Ile Pro Pro Glu Met Val Ser Glu Pro Leu Glu Asp Leu
65 70 75 80
Asp Pro Tyr Tyr Ile Asn Lys Lys Thr Phe Ile Val Leu Asn Lys Gly
85 90 95
Lys Ala Ile Phe Arg Phe Ser Ala Thr Ser Ala Leu Tyr Ile Leu Thr
100 105 110
Pro Phe Asn Pro Leu Arg Lys Ile Ala Ile Lys Ile Leu Val His Ser
115 120 125
Leu Phe Ser Met Leu Ile Met Cys Thr Ile Leu Thr Asn Cys Val Phe
130 135 140
Met Thr Met Ser Asn Pro Pro Asp Trp Thr Lys Asn Val Glu Tyr Thr
145 150 155 160
Phe Thr Gly Ile Tyr Thr Phe Glu Ser Leu Ile Lys Ile Ile Ala Arg
165 170 175
Gly Phe Cys Leu Glu Asp Phe Thr Phe Leu Arg Asp Pro Trp Asn Trp
180 185 190
Leu Asp Phe Thr Val Ile Thr Phe Ala Tyr Val Thr Glu Phe Val Asp
195 200 205
Leu Gly Asn Val Ser Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu
210 215 220
Lys Thr Ile Ser Val Ile Pro Gly Leu Lys Thr Ile Val Gly Ala Leu
225 230 235 240
Ile Gln Ser Val Lys Lys Leu Ser Asp Val Met Ile Leu Thr Val Phe
245 250 255
Cys Leu Ser Val Phe Ala Leu Ile Gly Leu Gln Leu Phe Met Gly Asn
260 265 270
Leu Arg Asn Lys Cys Val Gln Trp Pro Pro Thr Asn Ala Ser Leu Glu
275 280 285
Glu His Ser Ile Glu Lys Asn Val Thr Arg Asp Tyr Asn Gly Thr Leu
290 295 300
Val Asn Glu Thr Ser Thr Glu Phe Asp Trp Lys Ser Tyr Ile Gln Asp
305 310 315 320
Ser Arg Tyr His Tyr Phe Leu Glu Gly Phe Pro Asp Ala Leu Leu Cys
325 330 335
Gly Asn Ser Ser Asp Ala Gly Gln Cys Pro Glu Gly Tyr Met Cys Val
340 345 350
Lys Ala Gly Arg Asn Pro Asn Tyr Gly Tyr Thr Ser Phe Asp Thr Phe
355 360 365
Ser Trp Ala Phe Leu Ser Leu Phe Arg Leu Met Thr Gln Asp Phe Trp
370 375 380
Glu Asn Leu Tyr Gln Leu Thr Leu Arg Ala Ala Gly Lys Thr Tyr Met
385 390 395 400
Ile Phe Phe Val Leu Val Ile Phe Leu Gly Ser Phe Tyr Leu Ile Asn
405 410 415
Leu Ile Leu Ala Val Val Ala Met Ala Tyr Glu Glu Gln Asn Gln Ala
420 425 430
Thr Leu Glu Glu Ala Glu Gln Lys Glu Ala Glu Phe Gln Gln Met Leu
435 440 445
Glu Gln Leu Lys Lys Gln Gln Glu Ala Ala Gln Gln Ala Ala Ala Val
450 455 460
Thr Ala Ser Glu His Ser Arg Glu Pro Ser Ala Ala Gly Gly Leu Ser
465 470 475 480
Asp Ser Ser Ser Glu Thr Ser Lys Leu Ser Ser Lys Ser Ala Lys Glu
485 490 495
Arg Arg Asn Arg Arg Lys Lys Arg Lys Gln Lys Glu Gln Ser Gly Gly
500 505 510
Glu Glu Lys Asp Asp Asp Asp Phe His Lys Ser Glu Ser Glu Asp Ser
515 520 525
Ile Arg Arg Lys Gly Phe Arg Phe Ser Ile Glu Gly Asn Arg Leu Thr
530 535 540
Tyr Glu Lys Arg Tyr Ser Ser Pro His Gln Ser Leu Leu Ser Ile Arg
545 550 555 560
Gly Ser Leu Phe Ser Pro Arg Arg Asn Ser Arg Thr Ser Leu Phe Ser
565 570 575
Phe Arg Gly Arg Ala Lys Asp Val Gly Ser Glu Asn Asp Phe Ala Asp
580 585 590
Asp Glu His Ser Thr Phe Glu Asp Asn Glu Ser Arg Arg Asp Ser Leu
595 600 605
Phe Val Pro Arg Arg His Gly Glu Arg Arg Asn Ser Asn Leu Ser Gln
610 615 620
Thr Ser Arg Ser Ser Arg Met Leu Ala Val Phe Pro Ala Asn Gly Lys
625 630 635 640
Met His Ser Thr Val Asp Cys Asn Gly Val Val Ser Leu Val Gly Gly
645 650 655
Pro Ser Val Pro Thr Ser Pro Val Gly Gln Leu Leu Pro Glu Val Ile
660 665 670
Ile Asp Lys Pro Ala Thr Asp Asp Asn Gly Thr Thr Thr Glu Thr Asp
675 680 685
Met Arg Lys Arg Arg Ser Ser Ser Phe His Val Ser Met Asp Phe Leu
690 695 700
Glu Asp Pro Ser Gln Arg Gln Arg Ala Met Ser Ile Ala Ser Ile Leu
705 710 715 720
Thr Asn Thr Val Glu Glu Leu Glu Glu Ser Arg Gln Lys Cys Pro Pro
725 730 735
Cys Trp Tyr Lys Phe Ser Asn Ile Phe Leu Ile Trp Asp Cys Ser Pro
740 745 750
Tyr Trp Leu Lys Val Lys His Ile Val Asn Leu Val Val Met Asp Pro
755 760 765
Phe Val Asp Leu Ala Ile Thr Ile Cys Ile Val Leu Asn Thr Leu Phe
770 775 780
Met Ala Met Glu His Tyr Pro Met Thr Asp His Phe Asn Asn Val Leu
785 790 795 800
Thr Val Gly Asn Leu Val Phe Thr Gly Ile Phe Thr Ala Glu Met Phe
805 810 815
Leu Lys Ile Ile Ala Met Asp Pro Tyr Tyr Tyr Phe Gln Glu Gly Trp
820 825 830
Asn Ile Phe Asp Gly Phe Ile Val Thr Leu Ser Leu Val Glu Leu Gly
835 840 845
Leu Ala Asn Val Glu Gly Leu Ser Val Leu Arg Ser Phe Arg Leu Leu
850 855 860
Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Met Leu Ile
865 870 875 880
Lys Ile Ile Gly Asn Ser Val Gly Ala Leu Gly Asn Leu Thr Leu Val
885 890 895
Leu Ala Ile Ile Val Phe Ile Phe Ala Val Val Gly Met Gln Leu Phe
900 905 910
Gly Lys Ser Tyr Lys Asp Cys Val Cys Lys Ile Asn Ser Glu Cys Lys
915 920 925
Leu Pro Arg Trp His Met Asn Asp Phe Phe His Ser Phe Leu Ile Val
930 935 940
Phe Arg Val Leu Cys Gly Glu Trp Ile Glu Thr Met Trp Asp Cys Met
945 950 955 960
Glu Val Ala Gly Gln Ala Met Cys Leu Thr Val Phe Met Met Val Met
965 970 975
Val Ile Gly Asn Leu Val Val Leu Asn Leu Phe Leu Ala Leu Leu Leu
980 985 990
Ser Ser Phe Ser Ala Asp Asn Leu Ala Ala Thr Asp Asp Asp Asn Glu
995 1000 1005
Met Asn Asn Leu Gln Ile Ala Val Asp Arg Met His Lys Gly Ile Ala
1010 1015 1020
Tyr Val Lys Arg Lys Ile Tyr Glu Phe Ile Gln Gln Ser Phe Val Arg
1025 1030 1035 1040
Lys Gln Lys Ile Leu Asp Glu Ile Lys Pro Leu Asp Asp Leu Asn Asn
1045 1050 1055
Arg Lys Asp Cys Cys Met Ser Asn His Thr Gly Glu Thr Gly Lys Asp
1060 1065 1070
Leu Asp Tyr Leu Lys Asp Val Asn Gly Thr Thr Ser Gly Ile Gly Thr
1075 1080 1085
Gly Ser Ser Val Glu Lys Tyr Ile Ile Asp Glu Ser Asp Tyr Met Ser
1090 1095 1100
Phe Ile Asn Asn Pro Ser Leu Thr Val Thr Val Pro Ile Ala Val Gly
1105 1110 1115 1120
Glu Ser Asp Phe Glu Asn Leu Asn Thr Glu Asp Phe Ser Ser Glu Ser
1125 1130 1135
Asp Leu Glu Glu Ser Lys Glu Lys Leu Asn Glu Ser Ser Ser Ser Ser
1140 1145 1150
Glu Gly Ser Thr Val Asp Ile Gly Ala Pro Ala Glu Glu Gln Pro Val
1155 1160 1165
Val Glu Pro Glu Glu Thr Leu Glu Pro Glu Ala Cys Phe Thr Glu Gly
1170 1175 1180
Cys Val Gln Arg Phe Lys Cys Cys Gln Ile Ser Val Glu Glu Gly Arg
1185 1190 1195 1200
Gly Lys Gln Trp Trp Asn Leu Arg Arg Thr Cys Phe Arg Ile Val Glu
1205 1210 1215
His Asn Trp Phe Glu Thr Phe Ile Val Phe Met Ile Leu Leu Ser Ser
1220 1225 1230
Gly Ala Leu Ala Phe Glu Asp Ile Tyr Ile Asp Gln Arg Lys Thr Ile
1235 1240 1245
Lys Thr Met Leu Glu Tyr Ala Asp Lys Val Phe Thr Tyr Ile Phe Ile
1250 1255 1260
Leu Glu Met Leu Leu Lys Trp Val Ala Tyr Gly Tyr Gln Thr Tyr Phe
1265 1270 1275 1280
Thr Asn Ala Trp Cys Trp Leu Asp Phe Leu Ile Val Asp Val Ser Leu
1285 1290 1295
Val Ser Leu Thr Ala Asn Ala Leu Gly Tyr Ser Glu Leu Gly Ala Ile
1300 1305 1310
Lys Ser Leu Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu Ser
1315 1320 1325
Arg Phe Glu Gly Met Arg Val Val Val Asn Ala Leu Leu Gly Ala Ile
1330 1335 1340
Pro Ser Ile Met Asn Val Leu Leu Val Cys Leu Ile Phe Trp Leu Ile
1345 1350 1355 1360
Phe Ser Ile Met Gly Val Asn Leu Phe Ala Gly Lys Phe Tyr His Cys
1365 1370 1375
Val Asn Thr Thr Thr Gly Asp Met Phe Asp Ile Ser Glu Val Asn Asn
1380 1385 1390
His Ser Asp Cys Leu Ile Leu Ile Glu Arg Asn Glu Thr Ala Arg Trp
1395 1400 1405
Lys Asn Val Lys Val Asn Phe Asp Asn Val Gly Phe Gly Tyr Leu Ser
1410 1415 1420
Leu Leu Gln Val Ala Thr Phe Lys Gly Trp Met Asp Ile Met Tyr Ala
1425 1430 1435 1440
Ala Val Asp Ser Arg Asn Val Glu Leu Gln Pro Lys Tyr Glu Glu Ser
1445 1450 1455
Leu Tyr Met Tyr Leu Tyr Phe Val Ile Phe Ile Ile Phe Gly Ser Phe
1460 1465 1470
Phe Thr Leu Asn Leu Phe Ile Gly Val Ile Ile Asp Asn Phe Asn Gln
1475 1480 1485
Gln Lys Lys Lys Phe Gly Gly Gln Asp Ile Phe Met Thr Glu Glu Gln
1490 1495 1500
Lys Lys Tyr Tyr Asn Ala Met Lys Lys Leu Gly Ser Lys Lys Pro Gln
1505 1510 1515 1520
Lys Pro Ile Pro Arg Pro Gly Asn Lys Phe Gln Gly Met Val Phe Asp
1525 1530 1535
Phe Val Thr Arg Gln Val Phe Asp Ile Ser Ile Met Ile Leu Ile Cys
1540 1545 1550
Leu Asn Met Val Thr Met Met Val Glu Thr Asp Asp Gln Ser Asp Tyr
1555 1560 1565
Val Thr Asn Ile Leu Ser Arg Ile Asn Leu Val Phe Ile Val Leu Phe
1570 1575 1580
Thr Gly Glu Cys Val Leu Lys Leu Ile Ser Leu Arg His Tyr Tyr Phe
1585 1590 1595 1600
Thr Ile Gly Trp Asn Ile Phe Asp Phe Val Val Val Ile Leu Ser Ile
1605 1610 1615
Val Gly Met Phe Leu Ala Glu Leu Ile Glu Lys Tyr Phe Val Ser Pro
1620 1625 1630
Thr Leu Phe Arg Val Ile Arg Leu Ala Arg Ile Gly Arg Ile Leu Arg
1635 1640 1645
Leu Ile Lys Gly Ala Lys Gly Ile Arg Thr Leu Leu Phe Ala Leu Met
1650 1655 1660
Met Ser Leu Pro Ala Leu Phe Asn Ile Gly Leu Leu Leu Phe Leu Val
1665 1670 1675 1680
Met Phe Ile Tyr Ala Ile Phe Gly Met Ser Asn Phe Ala Tyr Val Lys
1685 1690 1695
Arg Glu Val Gly Ile Asp Asp Met Phe Asn Phe Glu Thr Phe Gly Asn
1700 1705 1710
Ser Met Ile Cys Leu Phe Gln Ile Thr Thr Ser Ala Gly Trp Asp Gly
1715 1720 1725
Leu Leu Ala Pro Ile Leu Asn Ser Lys Pro Pro Asp Cys Asp Pro Asn
1730 1735 1740
Lys Val Asn Pro Gly Ser Ser Val Lys Gly Asp Cys Gly Asn Pro Ser
1745 1750 1755 1760
Val Gly Ile Phe Phe Phe Val Ser Tyr Ile Ile Ile Ser Phe Leu Val
1765 1770 1775
Val Val Asn Met Tyr Ile Ala Val Ile Leu Glu Asn Phe Ser Val Ala
1780 1785 1790
Thr Glu Glu Ser Ala Glu Pro Leu Ser Glu Asp Asp Phe Glu Met Phe
1795 1800 1805
Tyr Glu Val Trp Glu Lys Phe Asp Pro Asp Ala Thr Gln Phe Met Glu
1810 1815 1820
Phe Glu Lys Leu Ser Gln Phe Ala Ala Ala Leu Glu Pro Pro Leu Asn
1825 1830 1835 1840
Leu Pro Gln Pro Asn Lys Leu Gln Leu Ile Ala Met Asp Leu Pro Met
1845 1850 1855
Val Ser Gly Asp Arg Ile His Cys Leu Asp Ile Leu Phe Ala Phe Thr
1860 1865 1870
Lys Arg Val Leu Gly Glu Ser Gly Glu Met Asp Ala Leu Arg Ile Gln
1875 1880 1885
Met Glu Glu Arg Phe Met Ala Ser Asn Pro Ser Lys Val Ser Tyr Gln
1890 1895 1900
Pro Ile Thr Thr Thr Leu Lys Arg Lys Gln Glu Glu Val Ser Ala Val
1905 1910 1915 1920
Ile Ile Gln Arg Ala Tyr Arg Arg His Leu Leu Lys Arg Thr Val Lys
1925 1930 1935
Gln Ala Ser Phe Thr Tyr Asn Lys Asn Lys Ile Lys Gly Gly Ala Asn
1940 1945 1950
Leu Leu Val Lys Glu Asp Met Ile Ile Asp Arg Ile Asn Glu Asn Ser
1955 1960 1965
Met Thr Glu Lys Thr Asp Leu Thr Met Ser Thr Ala Ala Cys Pro Pro
1970 1975 1980
Ser Tyr Asp Arg Val Thr Lys Pro Ile Val Glu Lys His Glu Gln Glu
1985 1990 1995 2000
Gly Lys Asp Glu Lys Ala Lys Gly Lys
2005
<210> 16
<211> 1600
<212> DNA
<213> 猪(Sus scrofa)
<400> 16
taaactcagt aaatgaaaca actgatatga ctggagcttg aaataaacga tgtgatgatc 60
taatgaaata cataatgcta aattgtcttg cttcttatgc aaaaatgatt attagtcata 120
gcaatgcatg aataattaag gacaaaacta tattaggtat ttaataatat ttctatattt 180
atacacctga atttttagtt tattaaaata tattggtcaa accaactctt gtcccaatat 240
tttagtttca ttctttaata tattgccttt tttaatgagt taatcttctg ttaggatttc 300
ttacttatct ttttctaata aacaagtttc ctgacttcaa tctggcagca aattcctctg 360
attttgtttt tcctttagcc ttttatactt cctcttctct tttttcaaac ttttttatcg 420
ttgttatatt atttatattt atttccctaa atgtattgct taagaaagaa tatatttcat 480
tttatgactt ttttactttg cttttaaata tcctagaata gtcttgatag taaaaaagaa 540
aaaattagaa agactgaaga ctactagggg aatgtctttt tttttttctt ttaatcaata 600
agaattctga cttttctttt tttccatttg tgtattaggt ggttgtgaat gcccttttag 660
gagcaattcc atccatcatg aatgtgctcc tggtttgtct tatattctgg ctaattttca 720
gcatcatggg cgtaaatttg tttgctggca aattctacca ctgtgttaac accacaactg 780
gtgacatgtt tgacatcagc gaagtgaata atcattctga ttgcctaata ctaatagaaa 840
gaaatgagac tgcccgatgg aaaaatgtga aagtaaactt tgataatgta ggatttgggt 900
atctctcttt gcttcaagtt gtaagtgaat actcttttct ctgaaatgtg tttattgttt 960
ggaatgataa caaagtaatg acatgtagct atgatttagt taccaagaag aaagtttata 1020
atatatttcc ttttttcaaa ttttgattta ttagtacaag tttatgagcc cagatgggtg 1080
aaacaaacgt atatttaagc agtgcagatg tagtcaatag tggccaggta ttttagttcg 1140
ttccaaagat cttctctttc ttctcatatt attctctctt cccatatgaa caagtagatt 1200
aataaattaa taaatgtatt atttttccta ttgcaattgg aaagcttgtt tcactgaggg 1260
atcacactcc caccccctgc tcaaaacaca actgcttagc acacctggtt ggattataac 1320
taattggtag acattaactg ctactaaatt gtatcatttg gactgaagct tggggaatga 1380
tatgtaggct ttaagattag atcctcatag tgaaatggaa taataccagt taacacctag 1440
acacggaaat gaaacacaag tagctgttgt ccccaaatga cctttcagta caattactta 1500
agtctttttt cccccctata tattaatgtt ttcactactg ggtctctaag actgatttac 1560
gcaaaagaaa aagttcataa tttatgaaaa tacaaaagat 1600
<210> 17
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
gaaagactga agactactag ggg 23
<210> 18
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
ttattaattt attaatctac ttg 23
<210> 19
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
aaatatccta gaatagtctt gat 23
<210> 20
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
gttataatcc aaccaggtgt gct 23
<210> 21
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
cagttgtggt gttaacacag 20
<210> 22
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
aaagaaatga gactgcccga 20
<210> 23
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
aaagtaaact ttgataatgt 20
<210> 24
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
gtcaaacatg tcaccagttg 20
<210> 25
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
catttctttc tattagtatt 20
<210> 26
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
caccgcagtt gtggtgttaa cacag 25
<210> 27
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
aaacctgtgt taacaccaca actgc 25
<210> 28
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
caccgaaaga aatgagactg cccga 25
<210> 29
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
aaactcgggc agtctcattt ctttc 25
<210> 30
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
caccgaaagt aaactttgat aatgt 25
<210> 31
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
aaacacatta tcaaagttta ctttc 25
<210> 32
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 32
caccgtcaaa catgtcacca gttg 24
<210> 33
<211> 24
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
aaaccaactg gtgacatgtt tgac 24
<210> 34
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 34
caccgcattt ctttctatta gtatt 25
<210> 35
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 35
aaacaatact aatagaaaga aatgc 25
<210> 36
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 36
caguuguggu guuaacacag guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 37
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 37
aaagaaauga gacugcccga guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 38
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 38
aaaguaaacu uugauaaugu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 39
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 39
gucaaacaug ucaccaguug guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100
<210> 40
<211> 100
<212> RNA
<213> 人工序列(Artificial Sequence)
<400> 40
cauuucuuuc uauuaguauu guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 100

Claims (3)

1.一种用于猪SCN1A编辑的CRISPR/Cas9系统,其特征在于包含Cas9表达载体和针对猪SCN1A的gRNA表达载体;所述的Cas9表达载体为质粒全序列如SEQ ID NO.2所示的pU6gRNA-eEF1a-mNLS-hSpCas9-EGFP-PURO载体;所述的针对猪SCN1A基因的gRNA表达载体由SEQ IDNO.26和SEQ ID NO.27所示的单链DNA退火而成的双链插入载体骨架pKG-U6gRNA所得;该表达载体表达SEQ ID NO.36所示的gRNA,其靶点如SEQ ID NO.21所示;载体骨架pKG-U6gRNA的质粒全序列如SEQ ID NO.3所示;所述的gRNA表达载体和Cas9表达载体的摩尔比为3:1。
2.一种猪SCN1A基因敲除的猪重组细胞,其特征在于由权利要求1所述的CRISPR/Cas9系统共转染猪原代成纤维细胞经验证后所得。
3.权利要求1所述的CRISPR/Cas9系统在构建猪SCN1A基因敲除的猪重组细胞中的应用。
CN202011554428.2A 2020-12-24 2020-12-24 Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用 Active CN112575033B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011554428.2A CN112575033B (zh) 2020-12-24 2020-12-24 Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011554428.2A CN112575033B (zh) 2020-12-24 2020-12-24 Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用

Publications (2)

Publication Number Publication Date
CN112575033A CN112575033A (zh) 2021-03-30
CN112575033B true CN112575033B (zh) 2023-03-10

Family

ID=75139635

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011554428.2A Active CN112575033B (zh) 2020-12-24 2020-12-24 Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用

Country Status (1)

Country Link
CN (1) CN112575033B (zh)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101255424A (zh) * 2007-11-26 2008-09-03 广州医学院第二附属医院 与癫痫相关scn1a基因的启动子及其构建方法和临床应用
KR20150056445A (ko) * 2013-11-15 2015-05-26 한국과학기술원 뇌전증 동물모델 및 그의 제조 방법
WO2015187988A1 (en) * 2014-06-04 2015-12-10 Intellimedix Composition and methods of treating epilepsy and/or epilepsy-related disorders
JP2018088888A (ja) * 2016-12-06 2018-06-14 国立研究開発法人理化学研究所 Scn1a遺伝子の発現増強法
WO2019224864A1 (ja) * 2018-05-21 2019-11-28 国立研究開発法人理化学研究所 Scn1a遺伝子の発現増強法とそれによるドラベ症候群の治療法
CN110951781A (zh) * 2019-12-18 2020-04-03 武汉大学 一种癫痫动物模型的构建方法及应用

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101255424A (zh) * 2007-11-26 2008-09-03 广州医学院第二附属医院 与癫痫相关scn1a基因的启动子及其构建方法和临床应用
KR20150056445A (ko) * 2013-11-15 2015-05-26 한국과학기술원 뇌전증 동물모델 및 그의 제조 방법
WO2015187988A1 (en) * 2014-06-04 2015-12-10 Intellimedix Composition and methods of treating epilepsy and/or epilepsy-related disorders
JP2018088888A (ja) * 2016-12-06 2018-06-14 国立研究開発法人理化学研究所 Scn1a遺伝子の発現増強法
WO2019224864A1 (ja) * 2018-05-21 2019-11-28 国立研究開発法人理化学研究所 Scn1a遺伝子の発現増強法とそれによるドラベ症候群の治療法
CN110951781A (zh) * 2019-12-18 2020-04-03 武汉大学 一种癫痫动物模型的构建方法及应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
RNA-seq Analysis of the SCN1A-KO Model based on CRISPR/Cas9 Genome Editing Technology;Xiaoguang Shi等;《Neuroscience》;20181208;第1-11页 *

Also Published As

Publication number Publication date
CN112575033A (zh) 2021-03-30

Similar Documents

Publication Publication Date Title
CN112779292B (zh) 构建瘦肉率高、生长快且抗蓝耳病和系列腹泻病的优质猪核移植供体细胞的方法及其应用
CN112779291B (zh) 构建瘦肉率高、生长快、繁殖力高且抗系列流行病的优质猪核移植供体细胞的方法及其应用
CN112522260B (zh) Crispr系统及其在制备ttn基因突变的扩张型心肌病克隆猪核供体细胞中的应用
CN112877362A (zh) 构建高繁殖力、抗蓝耳病和系列腹泻病的优质猪核移植供体细胞的基因编辑系统及其应用
CN112522264B (zh) 一种导致先天性耳聋的CRISPR/Cas9系统及其在制备模型猪核供体细胞中的应用
CN112522313B (zh) 用于构建TPH2基因突变的抑郁症克隆猪核供体细胞的CRISPR/Cas9系统
CN113046388B (zh) 用于构建双基因联合敲除的动脉粥样硬化猪核移植供体细胞的crispr系统及其应用
CN112522261B (zh) 用于制备lmna基因突变的扩张型心肌病克隆猪核供体细胞的crispr系统及其应用
CN114958762B (zh) 一种构建神经组织特异过表达人源snca的帕金森病模型猪的方法及应用
CN112575033B (zh) Crispr系统及其在构建scn1a基因突变的癫痫性脑病克隆猪核供体细胞中的应用
CN112680453B (zh) Crispr系统及其在构建stxbp1突变的癫痫性脑病克隆猪核供体细胞中的应用
CN112813101B (zh) 一种构建瘦肉率高、生长快的优质猪核移植供体细胞的基因编辑系统及其应用
CN112899306B (zh) Crispr系统及其在构建gabrg2基因突变的克隆猪核供体细胞中的应用
CN112795566B (zh) 用于构建骨质疏松症克隆猪核供体细胞系的opg基因编辑系统及其应用
CN112680444B (zh) 用于oca2基因突变的crispr系统及其在构建白化病克隆猪核供体细胞中的应用
CN112608941B (zh) 用于构建mc4r基因突变的肥胖症猪核移植供体细胞的crispr系统及其应用
CN112522311B (zh) 用于adcy3基因编辑的crispr系统及其在构建肥胖症猪核移植供体细胞中应用
CN113584078B (zh) 用于双靶标基因编辑的crispr系统及其在构建抑郁症猪核移植供体细胞中的应用
CN112522258B (zh) Il2rg基因和ada基因联合敲除的重组细胞及其在制备免疫缺陷猪模型中的应用
CN112522309B (zh) 重症免疫缺陷猪源重组细胞及其制备方法和试剂盒
CN112522202B (zh) 制备addi四个基因联合敲除的重症免疫缺陷猪源重组细胞的方法及其专用试剂盒
CN112522257B (zh) 用于制备rrip四个基因联合敲除的重症免疫缺陷猪源重组细胞的系统
CN112522256B (zh) CRISPR/Cas9系统及其在构建抗肌萎缩蛋白基因缺陷的猪源重组细胞中的应用
CN112522255B (zh) CRISPR/Cas9系统及其在构建胰岛素受体底物基因缺陷的猪源重组细胞中的应用
CN112877359A (zh) CRISPR/cas系统及其在构建INHA突变的高繁殖力猪核移植供体细胞中的应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant