CN111909914B - 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用 - Google Patents

核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用 Download PDF

Info

Publication number
CN111909914B
CN111909914B CN202010695299.2A CN202010695299A CN111909914B CN 111909914 B CN111909914 B CN 111909914B CN 202010695299 A CN202010695299 A CN 202010695299A CN 111909914 B CN111909914 B CN 111909914B
Authority
CN
China
Prior art keywords
lys
leu
glu
asp
ile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010695299.2A
Other languages
English (en)
Other versions
CN111909914A (zh
Inventor
黄强
朱海霞
薛冬梅
杜文豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fudan University
Original Assignee
Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fudan University filed Critical Fudan University
Priority to CN202010695299.2A priority Critical patent/CN111909914B/zh
Publication of CN111909914A publication Critical patent/CN111909914A/zh
Application granted granted Critical
Publication of CN111909914B publication Critical patent/CN111909914B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/18Carboxylic ester hydrolases (3.1.1)
    • C12N9/20Triglyceride splitting, e.g. by means of lipase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli

Abstract

本发明属于蛋白质工程技术领域,具体为一种来源于酿脓链球菌的CRISPR核酸酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用。本发明中的高PAM兼容性截短型变异体txCas9核酸酶属于CRISPR‑Cas9系统。txCas9核酸酶是将高PAM兼容性xCas9核酸酶的第180位到299位的氨基酸截掉后重组所得。该高PAM兼容性截短型变异体txCas9不仅具有与野生型CRISPR‑Cas9核酸酶相当的基因编辑活性,而且能够识别NGN、GAA、GAT PAM,有效扩宽了CRISPR‑Cas9核酸酶的靶向范围。重要的是,该小型化Cas9核酸酶适合使用装载容量有限的腺病毒载体进行体内运输,因而在生物医药的体内精准编辑方面具有更大的应用潜力。

Description

核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其 应用
技术领域
本发明属于蛋白质工程技术领域,具体涉及来源于酿脓链球菌的CRISPR-Cas9核酸酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用。
背景技术
CRISPR-Cas9是细菌和古细菌中抵抗外源核酸的免疫系统,目前已发展为一种特异性切割DNA片段的基因编辑技术(1)。其中,来源于酿脓链球菌(Streptococcuspyogenes)的SpCas9,因其具有构建简单、可操作性强、效率高等特点,目前已成为应用最为广泛的CRISPR核酸酶(2)。近年来,该系统已被广泛应用于基因工程领域,比如,利用CRISPR系统进行基因干扰,从而阐明基因功能(3);通过设计靶向致病基因位点的引导RNA,快速建立细胞或动物疾病模型(4);将失活的Cas9(dCas9)与增强子或沉默子融合,使基因转录上调或沉默(5)。值得一提的是,dCas9与碱基修饰酶融合后形成的碱基编辑系统使CRISPR-Cas9进入了精准基因组编辑时代,该系统利用脱氨酶对靶位点上的嘌呤或嘧啶进行脱氨反应,实现精准的碱基替换,最终使导致约2/3人类疾病发生的单核苷酸变异得到修复(6)。
虽然SpCas9是一个强大的基因编辑工具,但仍存在着许多有待解决的问题。比如SpCas9体积过于庞大,其4.2kb的大小对于常用的装载容量只有4.5kb的腺病毒载体而言包装困难,因此难以通过病毒载体将其高效靶向地运载到体内或细胞中,从而限制了其在临床医学治疗上的应用(7);当sgRNA引导SpCas9至特定靶序列时,sgRNA可能会与其他类似靶序列的基因位点发生错误匹配,从而激活Cas9切割非靶DNA,导致脱靶效应(8);SpCas9需要位于靶位点上游的NGG前间隔序列邻近基序(protospace adjacent motif,PAM)才能进行靶标识别,因此限制了其可靶向的基因组位点(9)。
为解决以上问题,人们通过定向进化、同源替换和结构指导等方法设计了一系列SpCas9突变体。其中,Johnny等通过定向进化的方法获得的xCas9,不仅具有与SpCas9相当的编辑活性,而且还具有PAM兼容性更高、脱靶率更低等特点,因此成为目前应用最多的SpCas9突变体之一(10)。虽然,xCas9已成功改善SpCas9存在的两个不足,但是其体积庞大的问题仍有待解决。因此,基于上述研究问题,在xCas9基础上获得小型化Cas9核酸酶,对有效实现体内基因位点的精准编辑是十分必要的。
发明内容
本发明的目的是提供一种小型化、且具有高特异性的来源于酿脓链球菌的CRISPR-Cas9核酸酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用。
为实现上述目的,本发明采用以下技术方案:
本发明提供的来源于酿脓链球菌的CRISPR-Cas9核酸酶SpCas9的高PAM兼容性截短型变异体,其体积比xCas9小,具体是将xCas9第180位到299位氨基酸截掉所得,记为txCas9;属于CRISPR-Cas9系统。
在同等反应条件下,所述txCas9核酸酶与野生型SpCas9核酸酶具有同等的剪切活性。
在同等反应条件下,所述txCas9核酸酶比野生型SpCas9核酸酶的靶向范围更广。
在同等反应条件下,所述txCas9核酸酶比野生型SpCas9核酸酶的脱靶效率更低。
在同等反应条件下,所述txCas9核酸酶比野生型SpCas9核酸酶的体积更小。
所述野生型SpCas9核酸酶的核苷酸序列和氨基酸序列分别为SEQIDNO.1和SEQ IDNO.2所示。
所述高PAM兼容性xCas9核酸酶的核苷酸序列和氨基酸序列分别为SEQ ID NO.3和SEQ ID NO.4所示。
所述高PAM兼容性截短型变异体txCas9核酸酶的核苷酸序列和氨基酸序列分别为SEQ ID NO.5和SEQ ID NO.6所示,与野生型SpCas9的相似度达90%以上。
本发明还提供一种表达载体,其含有上述多核苷酸序列。
本发明还提供一种宿主细胞,可以用于转化上述表达载体。
本发明还提供所述txCas9核酸酶的制备方法,具体步骤包括:首先,构建所述txCas9核酸酶的多核苷酸序列表达载体;然后,将所述表达载体转化至宿主细胞,筛选并挑出单克隆;最后,将所述单克隆诱导表达,并通过亲和层析、离子交换方法从表达产物中分离出所述的txCas9核酸酶。
本发明提供的上述txCas9核酸酶、多核苷酸序列以及表达载体均可作为编辑基因组DNA的编辑工具,用于基因组DNA片段的相关编辑。
本发明中,所述的基因编辑可以是单点编辑,也可以是编辑位点大于等于两个的多点编辑。
所述编辑的手段包括删除、突变、插入、倒位、移位、重复或易位。
本发明中,所述txCas9核酸酶作为编辑工具包括与靶标DNA片段匹配的引导sgRNA。
所述的CRISPR-Cas9核酸酶与能够介导它的sgRNA组合,从而对目的基因进行编辑。
本发明中,所述含有编码CRISPR-Cas9核酸酶多核苷酸序列的载体和与之匹配的引导sgRNA一同转入宿主细胞,对基因进行编辑。
本发明中,所述单位点或多位点基因编辑,包括利用所述CRISPR-Cas9核酸酶对双链DNA进行剪切,并通过宿主细胞的修复系统对断裂的缺口进行修复。
本发明中,所述的单位点或多位点的基因编辑,是改变单位点或多位点编辑时的碱基突变特征。
与现有技术相比,本发明的txCas9工程核酸酶,属于CRISPR-Cas9系统,分别具有SEQ ID NO.5和SEQ ID NO.6的核苷酸序列和氨基酸序列,具有高特异性、高PAM兼容性和小型化等特点,可实现对体内基因组特定位置的精准编辑,具有潜在的生物医学应用价值。
附图说明
图1为pET21-6His-TEV-txCas9质粒构建图示。
图2为pET21-6His-TEV-txCas9质粒筛选与培养。
图3为txCas9目标蛋白的纯化方法。
图4为txCas9目标蛋白纯化获取的过程。
图5为CRISPR-Cas9目标蛋白纯化的电泳鉴定。
图6为sgRNA的在靶和脱靶序列。
图7为野生型SpCas9、xCas9及txCas9体外剪切活性的检测。
图8为野生型SpCas9体外在靶和脱靶效应的检测。
图9为高PAM兼容性xCas9体外在靶和脱靶效应的检测。
图10为高PAM兼容性截短型txCas9体外在靶和脱靶效应的检测。
图11为野生型SpCas9的体外PAM兼容性检测。
图12为高PAM兼容性xCas9的体外PAM兼容性检测。
图13为高PAM兼容性截短型txCas9的体外PAM兼容性检测。
具体实施方式
下面通过具体实施例进一步描述本发明。
下述实施例中所用的实验方法,如无特定说明,均为常规方法。
下述实施例中所用的材料、试剂等,如无特定说明,均为从商业途径获得。
一、CRISPR-Cas9核酸酶
本发明的txCas9核酸酶,是将高PAM兼容性xCas9第180位到299位氨基酸截掉所得,属于CRISPR-Cas9系统,该系统不仅具有与野生型CRISPR-Cas9核酸酶相当的基因编辑活性,而且具有更低的脱靶效率、更高的PAM兼容性、更小的体积,因此能实现更大范围的体内靶向编辑。
二、编码CRISPR-Cas9核酸酶的多核苷酸序列
转录和翻译所述txCas9核酸酶的多核苷酸序列,包括DNA或RNA。DNA还可以细分为质粒DNA、基因组DNA或人工合成的DNA;
编码所述txCas9的多核苷酸序列,可以利用该领域科研或技术人员所熟悉的相关分子生物学技术来制备,其不局限于重组DNA技术和化学合成方法。
三、表达载体
所述表达载体含有编码所述txCas9核酸酶的多核苷酸序列;该表达载体可以通过科研或技术人员所熟悉的分子生物学方法来构建,包括DNA重组技术和DNA合成技术等,主要将txCas9核酸酶的DNA有效连接到载体上的克隆位点中,然后通过转录翻译等过程表达目的蛋白txCas9。
四、宿主细胞
所述宿主细胞是用来转化表达CRISPR-Cas9核酸酶的重组质粒。主要包括原核细胞(如细菌),低等真核细胞(如酵母),高等真核细胞(如哺乳动物细胞)等。常用的宿主细胞如大肠杆菌DH5α、毕赤酵母、HEK293、CHO、Hela细胞等。
五、txCas9核酸酶及其编码该酶的核苷酸序列和所述表达载体的用途
本发明的txCas9核酸酶及其编码该酶的多核苷酸序列和所述的表达载体能够用于基因组DNA片段的编辑或用于制备基因编辑工具。txCas9核酸酶编辑包括单位点和多位点编辑,其编辑手段包括删除、突变、插入、倒位、移位、重复或易位等。
六、基因编辑工具及其方法
本发明的基因编辑工具属于CRISPR-Cas9系统,txCas9核酸酶在特定的sgRNA的引导下可以在目的基因DNA片段PAM(NGN/GAA/GAT)位点上游3到4位碱基间剪切底物DNA。该编辑过程可以在体内或体内进行。当sgRNA是单个的时候可以进行单点编辑,当sgRNA是两个或两个以上时可以进行多位点编辑。
如本发明的一些实施方式中所列举的,txCas9核酸酶在sgRNA的引导下,可以在体外对底物DNA(920bp)进行剪切。
在本发明中,Cas9可作为CRISPR-Cas9核酸酶的简称使用,其含义与CRISPR-Cas9核酸酶相同。本发明中的高PAM兼容性截短型txCas9核酸酶,其是将高PAM兼容性的xCas9第180位到299位氨基酸截掉所得。
在进一步描述本发明具体实施方式之前,应理解,本发明的保护范围并不局限为下述特定的具体实施方案,还应理解为,本发明实施例中的术语是为了描述特定的具体实施方案,而不是为了限制本发明的保护范围。下例实施例中未注明具体条件的试验方法,通常按照常规条件操作,或者按照各生产厂商所建议的条件操作。
在实施例给出的数值范围中,应理解,除非本发明另有说明,每个数值范围的两个端点以及两个端点之间任何一个数值均可选用。除非另外定义,本发明中使用的所有技术和科学术语与本技术领域技术人员通常理解的意义相同。除实施例中使用的具体方法、设备、材料外,根据本技术领域的技术人员对现有技术的掌握及本发明的记载,还可以使用与本发明实施例中所述的方法、设备、材料相似或等同的现有技术的任何方法、设备和材料来实现本发明。
除非另外说明,本发明中所公开的实验方法、检测方法、制备方法均采用本技术领域常规的分子生物学、生物化学、重组DNA技术及相关领域的常规技术。
实施例1,构建txCas9核酸酶的质粒
1.变异体的设计
以xCas9即SEQ ID NO.3为模板,截掉180a.a.~299a.a.,相当于碱基序列SEQ IDNO.5和氨基酸序列SEQ ID NO.6,其余位置的碱基保持不变,把剩下部分重组,将其记为txCas9。其改造设计思路如图1所示,其详细步骤简述如下:
首先,使用限制性内切酶NotI(购于NEB)酶切xCas9,然后,通过AxyPrepTM DNA GelExtraction Kit纯化回收。最后,使用T4ligase(购于Takara)将上一步纯化回收的片段连接,即可获得目的产物txCas9。
(1)质粒抽提试剂盒
所用质粒小提试剂盒TIANprep Mini Plasmid Kit从天根生化科技(北京)有限公司订购。抽提质粒,抽提方法见天根质粒小提试剂盒使用说明。
(2)限制内切酶及T4连接酶
所用限制内切酶NotI和T4连接酶均从NEB公司订购(公司链接)订购。
酶切反应体系:
Figure BDA0002590814610000051
按该反应体系37℃孵育至少2小时,并电泳割胶纯化回收。
连接反应体系:
Figure BDA0002590814610000052
Figure BDA0002590814610000061
该反应体系16℃孵育至少2小时,并转化大肠杆菌DH5α筛选单克隆txCas9,如图2所示,其核苷酸序列为SEQ ID NO.5,培养细菌。
(3)割胶回收试剂盒采购
所用割胶回收试剂盒AxyPrepTM DNA Gel Extraction Kit从Axygen公司(公司链接)订购,割胶回收操作均按其说明书进行。
(4)样品测序
在上海杰李生物技术有限公司通过一代测序对样品质粒进行测序。
实施例2,制备txCas9核酸酶
2.蛋白表达与纯化
2.1蛋白表达
(1)打开超净台,用含75%酒精的棉球擦拭桌面以及各种器具耗材,开紫外灯照射20min,启动风机备用。
(2)移液枪吸取10μl表达txCas9的Rosetta(DE3)(购于TIANGEN)菌液转至6ml含有双抗(Amp与Cm)的LB液体培养基中,37℃,200r/min振荡培养过夜。
(3)将过夜培养的菌液按照体积比为1:100转至500ml含双抗的LB(购于生工)液体培养基中,37℃,200r/min振荡培养。在培养过程中,随时检测菌液的OD值。
(4)当菌液的OD值接近0.4~0.8时,加入蛋白诱导剂IPTG,使其终浓度为0.1mM,然后16℃,200r/min振荡培养20h。
(5)收集菌液,5000r/min离心5min使菌体沉淀,弃上清,并称重txCas9菌体。
2.2蛋白纯化
所述蛋白纯化主要通过镍柱亲和层析技术,如图4所示;其纯化过程包括菌体破碎、蛋白样品离心收集、蛋白样品与镍柱介质共孵育以及目的蛋白的洗脱等,如图5所示。其详细步骤如下:
(1)向菌体中加入预先冰浴且PMSF终浓度为0.1mM的裂解液(20mM HEPES,500mMKCl,pH7.5;1g菌体加入5ml),涡旋仪重悬使菌块分散混匀,细胞超声破碎仪破碎细胞,超声3sec停3sec,一次10min,超声两次,超声过程均在冰浴中进行。
(2)向破碎的菌液中加入终浓度为10μg/ml RNase(生工),5μg/ml DNase I(生工),冰浴处理30min后,4℃10000r/min离心45~60min,收集上清。
(3)将上清与预先用平衡液(20mM HEPES,500mM KCl,1%蔗糖,pH7.5)处理的Qiagen Ni-NTA介质孵育,此过程在冰浴上进行,并加以振荡(150r/min),1.5h后静置,待Qiagen Ni-NTA沉淀。
(4)将Qiagen Ni-NTA装载到重力柱中,BioLogic LP系统的监测下,分别以流速为2ml/min的平衡液和洗脱液(20mM HEPES,500mM KCl,500mM咪唑,1%蔗糖,pH7.5),20、30、40、50、100、250、500Mm洗脱液冲洗Qiagen Ni-NTA,并收集蛋白。
(5)将不同咪唑浓度下的蛋白溶液跑SDS-PAGE(购于EpiZyme Scientific)电泳,考马斯亮蓝染色,脱色剂脱色,观察目的蛋白的表达和挂柱效果。
所述蛋白SpCas9、xCas9、txCas9的纯化结果如图5所示,其显示该目的蛋白txCas9的纯化情况。如图所示,从条带前沿来看,txCas9比SpCas9、xCas9分子量更小。同时,txCas9杂条带较少,蛋白较为纯净。
实施例3,检验txCas9核酸酶剪切活性
3.变异体活性检测
所用底物DNA(SEQ ID NO.7),主要利用引物QG-F和QG-R通过常规PCR扩增,然后割胶回收获取。
(1)采购扩增试剂盒
所用扩增试剂盒Fast HiFidelity PCR Kit从天根生化科技(北京)有限公司订购。
(2)采购引物
所用引物均从上海生工生物工程有限公司订购。它们的序列如下:
QG-F:TAGTCCTGTCGGGTTTCG(SEQ ID NO.8)
QG-R:TTCCATTCGCCATTCAGG(SEQ ID NO.9)
其反应体系和扩增条件如下:
扩增体系如下:
Figure BDA0002590814610000071
Figure BDA0002590814610000081
PCR反应条件:
Figure BDA0002590814610000082
(3)采购割胶回收试剂盒
所用割胶回收试剂盒AxyPrepTM DNA Gel Extraction Kit从Axygen公司订购,割胶回收操作均按其说明书进行,可以获得较纯的底物DNA(SEQ ID NO.7)。
Cas9与sgRNA以等摩尔混合,而根据实验需要,底物DNA可调为Cas9摩尔质量的0.2~1倍。其反应体系如下:
Figure BDA0002590814610000083
将反应体系置37℃孵育,1h后70℃加热10min,最后通过琼脂糖凝胶电泳检测目的蛋白txCas9的体外切割活性。其结果如图7所示,从图7中可以得知,与野生型SpCas9和高PAM兼容性xCas9相比,txCas9在体外能够剪切底物DNA,生成产物1和产物2,且活性与前者相当。该结果表明,xCas9截短后的突变体txCas9仍保留与SpCas9和xCas9相当的剪切活性。
实施例4,txCas9核酸酶体外脱靶检测的评价方法
4.脱靶效应检测
利用不同位置错配的sgRNA,如图6中1到8号所示,检测txCas9的体外脱切割活性,从而评价txCas9-的脱靶效应,其反应体系如下:
Figure BDA0002590814610000091
首先,评价野生型SpCas9在体外的脱靶效应,如图8所示,与0号sgRNA引导的SpCas9剪切活性相比,1到8号sgRNA引导的SpCas9均能够在体外剪切底物DNA,生成产物1和产物2。尽管只有部分sgRNA的引导活性强,即1到4号sgRNA引导的SpCas9体外剪切活性比较强,5到8号sgRNA引导的SpCas9体外剪切活性相对较弱。但是,该结果依然表明野生型SpCas9在体外的脱靶效应比较严重。由此说明,野生型SpCas9有较强的脱靶效应,尤其在1到4号sgRNA的引导下更为显著。
其次,评价高PAM兼容性xCas9在体外的脱靶效应,如图9所示,与0号sgRNA引导的xCas9剪切活性相比,1到5号sgRNA引导的xCas9仍能够在体外剪切底物DNA,生成产物1和产物2,且剪切活性与野生型SpCas9相当。但6到8号sgRNA引导的xCas9对底物DNA的剪切活性较弱,由此说明高PAM兼容性xCas9在一定程度上降低了野生型SpCas9的脱靶效应,但脱靶效应仍较为明显。
最后,评价高PAM兼容性截短型txCas9在体外的脱靶效应,如图10所示,与0号sgRNA引导的TxSpCas9剪切活性相比1到5号sgRNA引导的截短型txCas9体外剪切底物DNA的能力明显降低。6到8号sgRNA引导的txCas9对底物DNA几乎没有剪切活性。与xCas9的体外的脱靶效应相比,1和3号sgRNA引导的截短型txCas9的体外剪切活性进一步降低,由此说明txCas9在xCas9的基础上进一步降低了脱靶效应。
在sgRNA与底物DNA完全互补的情况下,txCas9保留了野生型SpCas9核酸酶的切割活性;同时,在sgRNA与底物DNA存在两个碱基错配的情况下,与野生型SpCas9和高PAM兼容性xCas9相比较,txCas9对底物DNA的容错率更低,体外剪切特异性更高。即txCas9不仅具有野生型CRISPR-Cas9核酸酶基因编辑功能,而且与野生型核酸酶相比,txCas9特异性更高,易于实现精准编辑。
5.体外PAM兼容性检测
以含有AGG PAM的质粒(SEQ ID NO.7)为模板,SEQ ID NO.11-SEQ ID NO.22为引物进行六次点突变,最终得到含有TGG、TGA、TGC、TGT、GAA、GAT六种PAM的质粒。
利用引物QG-F和QG-R通过常规PCR扩增,然后割胶回收获取920bp的底物DNA片段。
(1)采购点突变试剂盒
所用点突变试剂盒Fast Site-Directed Mutagenesis Kit从天根生化科技(北京)有限公司订购。
(2)采购引物
所用引物均从上海生工生物工程有限公司订购。它们的序列如下:
Figure BDA0002590814610000101
点突变体系如下:
Figure BDA0002590814610000102
PCR反应条件:
Figure BDA0002590814610000111
质粒模板消化:
Figure BDA0002590814610000112
该反应体系在37℃孵育1小时。
(4)样品测序
在上海杰李生物技术有限公司通过一代测序对样品质粒进行测序。
反应体系和反应条件见实施例3。
首先,评价野生型SpCas9的体外PAM兼容性,如图11所示,野生型SpCas9高效切割含有TGG、TGAPAM的底物DNA,对含有TGC PAM的底物DNA的剪切活性较弱,对含有TGT、GAA、GAT PAM的底物DNA几乎没有剪切活性。该结果表明野生型SpCas9在体外优先识别TGG、TGAPAM的底物DNA,因此限制了野生型SpCas9的靶向范围。
其次,评价高PAM兼容性xCas9的体外PAM兼容性,如图12所示,xCas9高效切割含有TGG、TGAPAM的底物DNA,对含有TGC、TGT、GAA PAM的底物DNA的剪切活性较弱,对含有GATPAM的底物DNA有微弱的剪切活性。该结果表明xCas9在野生型SpCas9的基础上进一步提高了PAM兼容性,扩宽了靶向范围。
最后,评价高PAM兼容性截短型txCas9的体外PAM兼容性,如图13所示,txCas9高效切割含有TGG、TGAPAM的底物DNA,对含有TGC、TGT、GAA PAM的底物DNA的剪切活性较弱,对含有GAT PAM的底物DNA有微弱的剪切活性。该结果表明txCas9在xCas9的基础上截短后仍保留高PAM兼容性。
综上所述,高PAM兼容性截短型txCas9在保留与xCas9相当的剪切活性和高PAM兼容性的同时,脱靶效率更低,体积更小,更易于使用腺病毒AAV进行体内运输,有望成为生物医学领域中具有潜在应用价值的基因编辑工具。
参考文献
1.P.D.Hsu,E.S.Lander,F.Zhang,Development and Applications of CRISPR-Cas9 for Genome Engineering.Cell157,1262-1278(2014).
2.C.Lier et al.,Analysis of the type II-A CRISPR-Cas system ofStreptococcus agalactiae reveals distinctive features according to geneticlineages.Front.Genet.6,214(2015).
3.O.Shalem et al.,Genome-scale CRISPR-Cas9 knockout screening inhuman cells.Science343,84-87(2014).
4.J.D.Sander,J.K.Joung,CRISPR-Cas systems for editing,regulating andtargeting genomes.Nat.Biotechnol.32,347-355(2014).
5.L.S.Qi et al.,Repurposing CRISPR as an RNA-Guided Platform forSequence-Specific Control of Gene Expression.Cell152,1173-1183(2013).
6.L.Yang et al.,Engineering and optimising deaminase fusions forgenome editing.Nat.Commun.7,13330(2016).
7.L.Swiech et al.,In vivo interrogation of gene function in themammalian brain using CRISPR-Cas9.Nat.Biotechnol.33,102-106(2015).
8.V.Pattanayak et al.,High-throughput profiling of off-target DNAcleavage reveals RNA-programmed Cas9 nuclease specificity.Nat.Biotechnol.31,839-843(2013).
9.S.H.Sternberg,S.Redding,M.Jinek,E.C.Greene,J.A.Doudna,DNAinterrogation by the CRISPR RNA-guided endonuclease Cas9.Nature507,62-67(2014).
10.J.H.Hu et al.,Evolved Cas9 variants with broad PAM compatibilityand high DNA specificity.Nature556,57-63(2018).。
SEQUENCE LISTING
<110> 复旦大学
<120> 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用
<130> 001
<160> 22
<170> PatentIn version 3.3
<210> 1
<211> 4104
<212> DNA
<213> artifical sequence
<400> 1
ggcgacaaga agtactccat tgggctcgat atcggcacaa acagcgtcgg ctgggccgtc 60
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 120
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 180
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 240
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 300
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 360
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 420
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 480
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 540
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 600
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 660
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 720
cttatcgccc tgtcactcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 780
gatgccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 840
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 900
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 960
atgatcaagc gctatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 1020
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 1080
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 1140
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 1200
aaacagcgca ctttcgacaa tggaagcatc ccccaccaga ttcacctggg cgaactgcac 1260
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 1320
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 1380
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgaggaa 1440
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 1500
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 1560
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 1620
tctggagagc agaagaaagc tatcgtggac ctcctcttca agacgaaccg gaaagttacc 1680
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 1740
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 1800
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 1860
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 1920
catctcttcg acgacaaagt catgaaacag ctcaagaggc gccgatatac aggatggggg 1980
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 2040
gattttctta agtccgatgg atttgccaac cggaacttca tgcagttgat ccatgatgac 2100
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 2160
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 2220
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 2280
atcgagatgg cccgagagaa ccaaactacc cagaagggac agaagaacag tagggaaagg 2340
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 2400
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 2460
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 2520
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 2580
gataaaaata gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 2640
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 2700
actaaggctg aacgaggtgg cctgtctgag ttggataaag caggcttcat caaaaggcag 2760
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 2820
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 2880
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 2940
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 3000
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 3060
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 3120
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 3180
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 3240
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 3300
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 3360
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 3420
tacagtgtac tggttgtggc caaagtggag aaagggaagt ctaaaaaact caaaagcgtc 3480
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 3540
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 3600
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgagctg 3660
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 3720
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 3780
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 3840
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 3900
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 3960
cctgcagcct tcaagtactt cgacaccacc atagacagaa agcggtacac ctctacaaag 4020
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 4080
gacctctctc agctcggtgg agac 4104
<210> 2
<211> 1368
<212> PRT
<213> artifical sequence
<400> 2
Gly Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<210> 3
<211> 4104
<212> DNA
<213> artifical sequence
<400> 3
ggcgacaaga agtactccat tgggctcgat atcggcacaa acagcgtcgg ctgggccgtc 60
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 120
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 180
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 240
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 300
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 360
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 420
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 480
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcgat 540
gtcgacaaac tctttatcca actggttcag acttacaatc agcttttcga agagaacccg 600
atcaacgcat ccggagttga cgccaaagca atcctgagcg ctaggctgtc caaatcccgg 660
cggctcgaaa acctcatcgc acagctccct ggggagaaga agaacggcct gtttggtaat 720
cttatcgccc tgtcactcgg gctgaccccc aactttaaat ctaacttcga cctggccgaa 780
gataccaagc ttcaactgag caaagacacc tacgatgatg atctcgacaa tctgctggcc 840
cagatcggcg accagtacgc agaccttttt ttggcggcaa agaacctgtc agacgccatt 900
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 960
atgatcaagc tgtatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 1020
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 1080
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 1140
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 1200
aaacagcgca ctttcgacaa tggaattatc ccccaccaga ttcacctggg cgaactgcac 1260
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 1320
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 1380
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 1440
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 1500
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 1560
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 1620
tctggagatc agaagaaagc tatcgtggac ctcctcttca agacgaaccg gaaagttacc 1680
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 1740
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 1800
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 1860
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 1920
catctcttcg acgacaaagt catgaaacag ctcaagaggc gccgatatac aggatggggg 1980
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 2040
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 2100
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 2160
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 2220
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 2280
atcgagatgg cccgagagaa ccaaactacc cagaagggac agaagaacag tagggaaagg 2340
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 2400
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 2460
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 2520
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 2580
gataaaaata gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 2640
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 2700
actaaggctg aacgaggtgg cctgtctgag ttggataaag caggcttcat caaaaggcag 2760
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 2820
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 2880
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 2940
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 3000
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 3060
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 3120
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 3180
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 3240
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 3300
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 3360
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 3420
tacagtgtac tggttgtggc caaagtggag aaagggaagt ctaaaaaact caaaagcgtc 3480
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 3540
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 3600
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgttctg 3660
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 3720
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 3780
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 3840
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 3900
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 3960
cctgcagcct tcaagtactt cgacaccacc atagacagaa agcggtacac ctctacaaag 4020
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 4080
gacctctctc agctcggtgg agac 4104
<210> 4
<211> 1368
<212> PRT
<213> artifical sequence
<400> 4
Gly Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Thr Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Leu Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Arg Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Lys
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Asp Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Ile Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<210> 5
<211> 3744
<212> DNA
<213> artifical sequence
<400> 5
ggcgacaaga agtactccat tgggctcgat atcggcacaa acagcgtcgg ctgggccgtc 60
attacggacg agtacaaggt gccgagcaaa aaattcaaag ttctgggcaa taccgatcgc 120
cacagcataa agaagaacct cattggcgcc ctcctgttcg actccgggga gacggccgaa 180
gccacgcggc tcaaaagaac agcacggcgc agatataccc gcagaaagaa tcggatctgc 240
tacctgcagg agatctttag taatgagatg gctaaggtgg atgactcttt cttccatagg 300
ctggaggagt cctttttggt ggaggaggat aaaaagcacg agcgccaccc aatctttggc 360
aatatcgtgg acgaggtggc gtaccatgaa aagtacccaa ccatatatca tctgaggaag 420
aagcttgtag acagtactga taaggctgac ttgcggttga tctatctcgc gctggcgcat 480
atgatcaaat ttcggggaca cttcctcatc gagggggacc tgaacccaga caacagcatt 540
ctgctgagtg atattctgcg agtgaacacg gagatcacca aagctccgct gagcgctagt 600
atgatcaagc tgtatgatga gcaccaccaa gacttgactt tgctgaaggc ccttgtcaga 660
cagcaactgc ctgagaagta caaggaaatt ttcttcgatc agtctaaaaa tggctacgcc 720
ggatacattg acggcggagc aagccaggag gaattttaca aatttattaa gcccatcttg 780
gaaaaaatgg acggcaccga ggagctgctg gtaaagctta acagagaaga tctgttgcgc 840
aaacagcgca ctttcgacaa tggaattatc ccccaccaga ttcacctggg cgaactgcac 900
gctatcctca ggcggcaaga ggatttctac ccctttttga aagataacag ggaaaagatt 960
gagaaaatcc tcacatttcg gataccctac tatgtaggcc ccctcgcccg gggaaattcc 1020
agattcgcgt ggatgactcg caaatcagaa gagaccatca ctccctggaa cttcgagaaa 1080
gtcgtggata agggggcctc tgcccagtcc ttcatcgaaa ggatgactaa ctttgataaa 1140
aatctgccta acgaaaaggt gcttcctaaa cactctctgc tgtacgagta cttcacagtt 1200
tataacgagc tcaccaaggt caaatacgtc acagaaggga tgagaaagcc agcattcctg 1260
tctggagatc agaagaaagc tatcgtggac ctcctcttca agacgaaccg gaaagttacc 1320
gtgaaacagc tcaaagaaga ctatttcaaa aagattgaat gtttcgactc tgttgaaatc 1380
agcggagtgg aggatcgctt caacgcatcc ctgggaacgt atcacgatct cctgaaaatc 1440
attaaagaca aggacttcct ggacaatgag gagaacgagg acattcttga ggacattgtc 1500
ctcaccctta cgttgtttga agatagggag atgattgaag aacgcttgaa aacttacgct 1560
catctcttcg acgacaaagt catgaaacag ctcaagaggc gccgatatac aggatggggg 1620
cggctgtcaa gaaaactgat caatgggatc cgagacaagc agagtggaaa gacaatcctg 1680
gattttctta agtccgatgg atttgccaac cggaacttca ttcagttgat ccatgatgac 1740
tctctcacct ttaaggagga catccagaaa gcacaagttt ctggccaggg ggacagtctt 1800
cacgagcaca tcgctaatct tgcaggtagc ccagctatca aaaagggaat actgcagacc 1860
gttaaggtcg tggatgaact cgtcaaagta atgggaaggc ataagcccga gaatatcgtt 1920
atcgagatgg cccgagagaa ccaaactacc cagaagggac agaagaacag tagggaaagg 1980
atgaagagga ttgaagaggg tataaaagaa ctggggtccc aaatccttaa ggaacaccca 2040
gttgaaaaca cccagcttca gaatgagaag ctctacctgt actacctgca gaacggcagg 2100
gacatgtacg tggatcagga actggacatc aatcggctct ccgactacga cgtggatcat 2160
atcgtgcccc agtcttttct caaagatgat tctattgata ataaagtgtt gacaagatcc 2220
gataaaaata gagggaagag tgataacgtc ccctcagaag aagttgtcaa gaaaatgaaa 2280
aattattggc ggcagctgct gaacgccaaa ctgatcacac aacggaagtt cgataatctg 2340
actaaggctg aacgaggtgg cctgtctgag ttggataaag caggcttcat caaaaggcag 2400
cttgttgaga cacgccagat caccaagcac gtggcccaaa ttctcgattc acgcatgaac 2460
accaagtacg atgaaaatga caaactgatt cgagaggtga aagttattac tctgaagtct 2520
aagctggtct cagatttcag aaaggacttt cagttttata aggtgagaga gatcaacaat 2580
taccaccatg cgcatgatgc ctacctgaat gcagtggtag gcactgcact tatcaaaaaa 2640
tatcccaagc ttgaatctga atttgtttac ggagactata aagtgtacga tgttaggaaa 2700
atgatcgcaa agtctgagca ggaaataggc aaggccaccg ctaagtactt cttttacagc 2760
aatattatga attttttcaa gaccgagatt acactggcca atggagagat tcggaagcga 2820
ccacttatcg aaacaaacgg agaaacagga gaaatcgtgt gggacaaggg tagggatttc 2880
gcgacagtcc ggaaggtcct gtccatgccg caggtgaaca tcgttaaaaa gaccgaagta 2940
cagaccggag gcttctccaa ggaaagtatc ctcccgaaaa ggaacagcga caagctgatc 3000
gcacgcaaaa aagattggga ccccaagaaa tacggcggat tcgattctcc tacagtcgct 3060
tacagtgtac tggttgtggc caaagtggag aaagggaagt ctaaaaaact caaaagcgtc 3120
aaggaactgc tgggcatcac aatcatggag cgatcaagct tcgaaaaaaa ccccatcgac 3180
tttctcgagg cgaaaggata taaagaggtc aaaaaagacc tcatcattaa gcttcccaag 3240
tactctctct ttgagcttga aaacggccgg aaacgaatgc tcgctagtgc gggcgttctg 3300
cagaaaggta acgagctggc actgccctct aaatacgtta atttcttgta tctggccagc 3360
cactatgaaa agctcaaagg gtctcccgaa gataatgagc agaagcagct gttcgtggaa 3420
caacacaaac actaccttga tgagatcatc gagcaaataa gcgaattctc caaaagagtg 3480
atcctcgccg acgctaacct cgataaggtg ctttctgctt acaataagca cagggataag 3540
cccatcaggg agcaggcaga aaacattatc cacttgttta ctctgaccaa cttgggcgcg 3600
cctgcagcct tcaagtactt cgacaccacc atagacagaa agcggtacac ctctacaaag 3660
gaggtcctgg acgccacact gattcatcag tcaattacgg ggctctatga aacaagaatc 3720
gacctctctc agctcggtgg agac 3744
<210> 6
<211> 1248
<212> PRT
<213> artifical sequence
<400> 6
Gly Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Ile Leu Leu Ser Asp Ile Leu Arg Val Asn Thr Glu Ile
180 185 190
Thr Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Leu Tyr Asp Glu His
195 200 205
His Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg Gln Gln Leu Pro
210 215 220
Glu Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys Asn Gly Tyr Ala
225 230 235 240
Gly Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe Tyr Lys Phe Ile
245 250 255
Lys Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu Leu Leu Val Lys
260 265 270
Leu Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly
275 280 285
Ile Ile Pro His Gln Ile His Leu Gly Glu Leu His Ala Ile Leu Arg
290 295 300
Arg Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn Arg Glu Lys Ile
305 310 315 320
Glu Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala
325 330 335
Arg Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys Ser Glu Glu Thr
340 345 350
Ile Thr Pro Trp Asn Phe Glu Lys Val Val Asp Lys Gly Ala Ser Ala
355 360 365
Gln Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys Asn Leu Pro Asn
370 375 380
Glu Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu Tyr Phe Thr Val
385 390 395 400
Tyr Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu Gly Met Arg Lys
405 410 415
Pro Ala Phe Leu Ser Gly Asp Gln Lys Lys Ala Ile Val Asp Leu Leu
420 425 430
Phe Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu Lys Glu Asp Tyr
435 440 445
Phe Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile Ser Gly Val Glu
450 455 460
Asp Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp Leu Leu Lys Ile
465 470 475 480
Ile Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn Glu Asp Ile Leu
485 490 495
Glu Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp Arg Glu Met Ile
500 505 510
Glu Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp Asp Lys Val Met
515 520 525
Lys Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly Arg Leu Ser Arg
530 535 540
Lys Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly Lys Thr Ile Leu
545 550 555 560
Asp Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn Phe Ile Gln Leu
565 570 575
Ile His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile Gln Lys Ala Gln
580 585 590
Val Ser Gly Gln Gly Asp Ser Leu His Glu His Ile Ala Asn Leu Ala
595 600 605
Gly Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr Val Lys Val Val
610 615 620
Asp Glu Leu Val Lys Val Met Gly Arg His Lys Pro Glu Asn Ile Val
625 630 635 640
Ile Glu Met Ala Arg Glu Asn Gln Thr Thr Gln Lys Gly Gln Lys Asn
645 650 655
Ser Arg Glu Arg Met Lys Arg Ile Glu Glu Gly Ile Lys Glu Leu Gly
660 665 670
Ser Gln Ile Leu Lys Glu His Pro Val Glu Asn Thr Gln Leu Gln Asn
675 680 685
Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Arg Asp Met Tyr Val
690 695 700
Asp Gln Glu Leu Asp Ile Asn Arg Leu Ser Asp Tyr Asp Val Asp His
705 710 715 720
Ile Val Pro Gln Ser Phe Leu Lys Asp Asp Ser Ile Asp Asn Lys Val
725 730 735
Leu Thr Arg Ser Asp Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser
740 745 750
Glu Glu Val Val Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn
755 760 765
Ala Lys Leu Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu
770 775 780
Arg Gly Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln
785 790 795 800
Leu Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp
805 810 815
Ser Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg Glu
820 825 830
Val Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe Arg Lys
835 840 845
Asp Phe Gln Phe Tyr Lys Val Arg Glu Ile Asn Asn Tyr His His Ala
850 855 860
His Asp Ala Tyr Leu Asn Ala Val Val Gly Thr Ala Leu Ile Lys Lys
865 870 875 880
Tyr Pro Lys Leu Glu Ser Glu Phe Val Tyr Gly Asp Tyr Lys Val Tyr
885 890 895
Asp Val Arg Lys Met Ile Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala
900 905 910
Thr Ala Lys Tyr Phe Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr
915 920 925
Glu Ile Thr Leu Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu
930 935 940
Thr Asn Gly Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe
945 950 955 960
Ala Thr Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys
965 970 975
Lys Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
980 985 990
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
995 1000 1005
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1010 1015 1020
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1025 1030 1035
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1040 1045 1050
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1055 1060 1065
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1070 1075 1080
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1085 1090 1095
Val Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1100 1105 1110
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1115 1120 1125
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1130 1135 1140
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1145 1150 1155
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1160 1165 1170
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1175 1180 1185
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1190 1195 1200
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1205 1210 1215
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1220 1225 1230
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1235 1240 1245
<210> 7
<211> 920
<212> DNA
<213> artifical sequence
<400> 7
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 60
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 120
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 180
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 240
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 300
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 360
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 420
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 480
ccatgattac gccaagctcg aaattaaccc tcactaaagg gaacaaaagc tggagctcca 540
ccgcggtggc ggccgctcta gaactagtgg atcccccggg ctgcaggaat tcgatatcaa 600
gcttatcgat taccgctcca gtcgttcatg aggttagagc tagaaatagc aagttaaaat 660
aaggctagtc cgttatcaac ttgaaaaagt ggcaccgagt cggtgctctc gagggggggc 720
ccggtaccca attcgcccta tagtgagtcg tattacaatt cactggccgt cgttttacaa 780
cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct 840
ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 900
agcctgaatg gcgaatggaa 920
<210> 8
<211> 18
<212> DNA
<213> artifical sequence
<400> 8
tagtcctgtc gggtttcg 18
<210> 9
<211> 18
<212> DNA
<213> artifical sequence
<400> 9
ttccattcgc cattcagg 18
<210> 10
<211> 3046
<212> DNA
<213> artifical sequence
<400> 10
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgga gctccaccgc ggtggcggcc 2220
gctctagaac tagtggatcc cccgggctgc aggaattcga tatcaagctt atcgattacc 2280
gctccagtcg ttcatgaggt tagagctaga aatagcaagt taaaataagg ctagtccgtt 2340
atcaacttga aaaagtggca ccgagtcggt gctctcgagg gggggcccgg tacccaattc 2400
gccctatagt gagtcgtatt acaattcact ggccgtcgtt ttacaacgtc gtgactggga 2460
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 2520
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 2580
atggaaattg taagcgttaa tattttgtta aaattcgcgt taaatttttg ttaaatcagc 2640
tcatttttta accaataggc cgaaatcggc aaaatccctt ataaatcaaa agaatagacc 2700
gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa gaacgtggac 2760
tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg tgaaccatca 2820
ccctaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa ccctaaaggg 2880
agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa ggaagggaag 2940
aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag cggtcacgct gcgcgtaacc 3000
accacacccg ccgcgcttaa tgcgccgcta cagggcgcgt caggtg 3046
<210> 11
<211> 40
<212> DNA
<213> artifical sequence
<400> 11
accgctccag tcgttcatgt ggttagagct agaaatagca 40
<210> 12
<211> 40
<212> DNA
<213> artifical sequence
<400> 12
tgctatttct agctctaacc acatgaacga ctggagcggt 40
<210> 13
<211> 40
<212> DNA
<213> artifical sequence
<400> 13
accgctccag tcgttcatgt gattagagct agaaatagca 40
<210> 14
<211> 40
<212> DNA
<213> artifical sequence
<400> 14
tgctatttct agctctaatc acatgaacga ctggagcggt 40
<210> 15
<211> 40
<212> DNA
<213> artifical sequence
<400> 15
accgctccag tcgttcatgt gcttagagct agaaatagca 40
<210> 16
<211> 40
<212> DNA
<213> artifical sequence
<400> 16
tgctatttct agctctaagc acatgaacga ctggagcggt 40
<210> 17
<211> 40
<212> DNA
<213> artifical sequence
<400> 17
accgctccag tcgttcatgt gtttagagct agaaatagca 40
<210> 18
<211> 40
<212> DNA
<213> artifical sequence
<400> 18
tgctatttct agctctaaac acatgaacga ctggagcggt 40
<210> 19
<211> 40
<212> DNA
<213> artifical sequence
<400> 19
accgctccag tcgttcatgg aattagagct agaaatagca 40
<210> 20
<211> 40
<212> DNA
<213> artifical sequence
<400> 20
tgctatttct agctctaatt ccatgaacga ctggagcggt 40
<210> 21
<211> 40
<212> DNA
<213> artifical sequence
<400> 21
accgctccag tcgttcatgg atttagagct agaaatagca 40
<210> 22
<211> 40
<212> DNA
<213> artifical sequence
<400> 22
tgctatttct agctctaaat ccatgaacga ctggagcggt 40

Claims (9)

1.一种CRISPR-Cas9核酸酶SpCas9的高PAM兼容性截短型变异体txCas9核酸酶,其特征在于,是将xCas9核酸酶的氨基酸序列截掉120个即第180位到299位的氨基酸后重组所得,记为txCas9核酸酶;编码txCas9核酸酶的核苷酸序列如SEQ ID NO.5所示,其氨基酸序列为SEQ ID NO.6所示。
2.一种表达载体,其含有如SEQ ID NO.5所示的多核苷酸。
3.一种宿主细胞,包含如权利要求2所述的表达载体。
4.一种如权利要求1所述的CRISPR-Cas9核酸酶SpCas9的高PAM兼容性截短型变异体txCas9核酸酶的制备方法,具体步骤包括:首先,构建含有如SEQ ID NO.5所示的多核苷酸的表达载体;然后,将所述表达载体转化得到如权利要求3所述的宿主细胞,筛选并挑出单克隆;最后,将所述单克隆诱导表达,并通过亲和层析或离子交换方法从表达产物中分离出所述的txCas9核酸酶。
5.一种如权利要求1所述的txCas9核酸酶、如权利要求2所述的表达载体作为编辑基因组DNA编辑工具的应用,用于基因组DNA片段的编辑,所述应用为非治疗目的。
6.根据权利要求5所述的应用,其特征在于,所述的编辑是单点编辑,或者是编辑位点大于等于两个的多点编辑;所述编辑的手段包括删除、突变、插入、倒位、移位、重复或易位。
7.根据权利要求6所述的应用,其特性在于,所述的编辑工具,还包括与靶标DNA片段匹配的引导sgRNA;所述的txCas9核酸酶与能够介导它的sgRNA组合,对目的基因进行编辑。
8.根据权利要求7所述的应用,其特性在于,所述的表达载体和与之匹配的引导sgRNA一同转入宿主细胞,对基因进行编辑。
9.根据权利要求7所述的应用,其特性在于,所述单点或多点编辑,是利用txCas9核酸酶对双链DNA进行剪切,并通过宿主细胞的修复系统对断裂的缺口进行修复。
CN202010695299.2A 2020-07-19 2020-07-19 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用 Active CN111909914B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010695299.2A CN111909914B (zh) 2020-07-19 2020-07-19 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010695299.2A CN111909914B (zh) 2020-07-19 2020-07-19 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用

Publications (2)

Publication Number Publication Date
CN111909914A CN111909914A (zh) 2020-11-10
CN111909914B true CN111909914B (zh) 2022-04-12

Family

ID=73281852

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010695299.2A Active CN111909914B (zh) 2020-07-19 2020-07-19 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用

Country Status (1)

Country Link
CN (1) CN111909914B (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115161302B (zh) * 2021-03-25 2023-08-29 山东大学 一种高特异性Taq DNA聚合酶变体及其获得方法和应用

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110241099B (zh) * 2019-06-05 2021-04-30 复旦大学 酿脓链球菌的CRISPR核酸酶SpCas9 的截短变异体及其应用

Also Published As

Publication number Publication date
CN111909914A (zh) 2020-11-10

Similar Documents

Publication Publication Date Title
CN107586779B (zh) 使用crispr-cas系统对间充质干细胞进行casp3基因敲除的方法
KR101106253B1 (ko) 사이코스 3-에피머라제 효소를 코딩하는 폴리뉴클레오티드를 포함하는 대장균 및 그를 이용하여 사이코스를 생산하는 방법
CN104498493B (zh) CRISPR/Cas9特异性敲除乙型肝炎病毒的方法以及用于特异性靶向HBV DNA的gRNA
CN111235080B (zh) 基因重组大肠杆菌及5-羟色胺的生产方法
CN106867952B (zh) 一株大肠杆菌基因工程菌及利用其生产l-苏氨酸的方法
CN111154707B (zh) 基因工程化大肠杆菌及褪黑素的生产方法
CN110241098B (zh) 酿脓链球菌的CRISPR核酸酶SpCas9的截短型高特异性变异体及其应用
CN107988250B (zh) 一种通用型衣藻外源基因表达载体构建方法
CN111909914B (zh) 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用
CN101466833B (zh) 经修饰的软骨素合酶多肽及其晶体
CN110272881B (zh) 核酸内切酶SpCas9高特异性截短变异体TSpCas9-V1/V2及其应用
CN104278031B (zh) 一种受黄嘌呤调控的启动子a及其重组表达载体和应用
CN110499336B (zh) 一种利用小分子化合物提高基因组定点修饰效率的方法
CN110964725A (zh) 特异性识别猪KIT基因的sgRNA及其编码DNA、试剂盒和应用
CN106479928B (zh) 一株耐高盐耐高cod盐水球菌菌株和来源该菌株的内源质粒
CN111909959B (zh) 一种条件性Yap1基因敲入小鼠的构建方法和应用
CN112553237A (zh) 一种新型mariner转座子系统、应用和构建枯草芽孢杆菌插入突变株文库
CN112662697B (zh) 一种莱茵衣藻tctn1表达质粒及其构建方法和应用
CN109136228A (zh) 长链非编码rna-nkila在骨组织损伤修复中的应用
CN106636023B (zh) 一种增强zwf基因启动子表达强度的方法
CN114369593B (zh) 二氧化硅结合肽介导醇脱氢酶和胺脱氢酶共固定化级联反应制备手性胺的方法
CN113444708B (zh) 一种用于药物皮下注射制剂的透明质酸酶突变体
CN106520818B (zh) 一种快速回补鸭疫里默氏杆菌缺失基因的方法
KR20120010850A (ko) 지방산 생합성 경로의 과발현용 형질전환 대장균 및 그의 제조방법
CN110656120A (zh) 一种乙脑病毒sa14-14-2的克隆方法及应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant