CN108690845A - 基因组编辑系统和方法 - Google Patents
基因组编辑系统和方法 Download PDFInfo
- Publication number
- CN108690845A CN108690845A CN201710228595.XA CN201710228595A CN108690845A CN 108690845 A CN108690845 A CN 108690845A CN 201710228595 A CN201710228595 A CN 201710228595A CN 108690845 A CN108690845 A CN 108690845A
- Authority
- CN
- China
- Prior art keywords
- sequence
- cpf1
- disease
- seq
- albumen
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/14—Drugs for disorders of the nervous system for treating abnormal movements, e.g. chorea, dyskinesia
- A61P25/16—Anti-Parkinson drugs
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/18—Antipsychotics, i.e. neuroleptics; Drugs for mania or schizophrenia
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/28—Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/30—Drugs for disorders of the nervous system for treating abuse or dependence
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P27/00—Drugs for disorders of the senses
- A61P27/02—Ophthalmic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P29/00—Non-central analgesic, antipyretic or antiinflammatory agents, e.g. antirheumatic agents; Non-steroidal antiinflammatory drugs [NSAID]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/80—Vectors containing sites for inducing double-stranded breaks, e.g. meganuclease restriction sites
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Neurosurgery (AREA)
- Neurology (AREA)
- Psychiatry (AREA)
- Mycology (AREA)
- Ophthalmology & Optometry (AREA)
- Heart & Thoracic Surgery (AREA)
- Rheumatology (AREA)
- Pain & Pain Management (AREA)
- Hospice & Palliative Care (AREA)
- Psychology (AREA)
- Cardiology (AREA)
- Addiction (AREA)
- Cell Biology (AREA)
Abstract
本发明涉及基因工程领域。具体而言,本发明涉及新的真核生物基因组编辑系统和方法。更具体而言,本发明涉及能够对真核动物细胞基因组进行高效编辑的CRISPR‑Cpf1系统及其用途。
Description
技术领域
本发明涉及基因工程领域。具体而言,本发明涉及新的真核生物基因 组编辑系统和方法。更具体而言,本发明涉及能够对真核生物细胞基因组 进行高效编辑的CRISPR-Cpf1系统及其用途。
背景技术
CRISPR(Clustered regularly interspaced short palindromic repeats,成簇的 规律间隔的短回文重复序列)系统是在进化过程中产生的细菌用于防御外来 基因入侵的免疫系统。其中,II型CRISPR-Cas9系统是通过两个小RNA (crRNA和tracrRNA)或者一个人工合成小RNA(sgRNA)介导一个Cas9蛋白 进行DNA切割的系统,也是最早发现的三种(I、II、III型)CRISPR系统 中最简单的系统。由于该系统简单易操作,在2013年被改造并成功实现了 真核生物基因组的编辑。CRISPR/Cas9系统迅速成为了生命科学领域最热 门的技术。
2015年,Zhang et al.通过序列比对和系统分析的方法又发现了区别于 CRISPR-Cas9系统之外的新的基因编辑系统,即CRISPR-Cpf1系统。该系 统只需要一个小RNA(crRNA)的介导即可实现基因组的编辑。自然界中 存在着成千上万种的CRISPR系统,但是,能够成功实现真核生物基因组 编辑的CRISPR系统并不多。
本领域仍然需要能够实现高效真核生物基因组编辑的CRISPR-Cpf1系 统。
发明简述
在一方面,本发明提供了一种用于对细胞基因组中的靶序列进行定点 修饰的基因组编辑系统,其包含以下i)至v)中至少一项:
i)Cpf1蛋白,和向导RNA(crRNA);
ii)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和向导RNA;
iii)Cpf1蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;
iv)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和包含编码向导 RNA的核苷酸序列的表达构建体;
v)包含编码Cpf1蛋白的核苷酸序列和编码向导RNA的核苷酸序列的 表达构建体;
其中所述Cpf1蛋白包含SEQ ID No:1-12之一的氨基酸序列或与SEQ ID No:1-12之一具有至少80%序列相同性的氨基酸序列,所述向导RNA能 够将所述Cpf1蛋白靶向所述细胞基因组中的靶序列。
在第二方面,本发明提供了一种对细胞基因组中的靶序列进行定点修 饰的方法,包括将本发明的基因组编辑系统导入所述细胞,由此所述向导 RNA将所述Cpf1蛋白靶向所述细胞基因组中的靶序列,导致所述靶序列中 的一或多个核苷酸的取代、缺失和/或添加。
在第三方面,本发明提供了一种治疗有需要的对象中的疾病的方法, 包括向所述对象递送有效量的本发明的基因组编辑系统以修饰所述对象中 与所述疾病相关的基因。
在第四方面,本发明提供了本发明的基因组编辑系统在制备用于治疗 有需要的对象中的疾病的药物组合物中的用途,其中所述基因组编辑系统 用于修饰所述对象中与所述疾病相关的基因。
在第五方面,本发明提供了一种用于治疗有需要的对象中的疾病的药 物组合物,其包含本发明的基因组编辑系统和药学可接受的载体,其中所 述基因组编辑系统用于修饰所述对象中与所述疾病相关的基因。
附图描述
图1.来自25种新的Cpf1家族蛋白的直接重复序列(DR)的比对。其中 具有DR序列的21种成熟crRNA的茎重复(stem duplex)极度保守。
图2.免疫荧光显示Cpf1蛋白定位于细胞核。
图3.示出Cpf1蛋白对人Dnmt1位点3的编辑能力。A:T7EI测定显 示9种选定的Cpf1蛋白中的8种能够在293T细胞中对人Dnmt1位点3实 现插入/缺失(indel);B:测序显示人Dnmt1位点3中的代表性indel。
图4.示出Cpf1蛋白对人Dnmt1位点1的编辑能力。A:T7EI测定显 示4种Cpf1蛋白明显靶向人Dnmt1位点1。B:测序结果显示人Dnmt1位 点1中的代表性indel,其中6种Cpf1蛋白能够产生indel,尽管PsCpf1和 SaCpf1在酶切测定中无明显条带。
图5.示出Cpf1蛋白对人Dnmt1位点2的编辑能力。A:T7EI测定显 示4种Cpf1蛋白明显靶向人Dnmt1位点2。B.:测序结果显示人Dnmt1位 点2中的代表性indel,其中5种Cpf1能够产生indel,尽管PsCpf1在酶切 测定中无明显条带。
图6.体外测定BsCpf1的PAM序列。A:BsCpf1蛋白的大肠杆菌表达 和纯化。B-E:BsCpf1的靶向采用5’-TTN PAM。
图7.体外测定各Cpf1的PAM序列。“+”表示有体外酶切活性。
图8.示出HkCpf1对人AAVS1基因8个位点的编辑能力。A:T7EI 测定结果。B:测序结果显示HkCpf1能够在7个位点产生突变,且PAM序 列是5’-YYN,与图7结果吻合。
图9-11.示出4种Cpf1蛋白在EpiSC细胞中编辑小鼠基因组。
图12.示出所使用的各种载体图谱。
发明详述
一、定义
在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有 本领域技术人员所通常理解的含义。并且,本文中所用的蛋白质和核酸化 学、分子生物学、细胞和组织培养、微生物学、免疫学相关术语和实验室 操作步骤均为相应领域内广泛使用的术语和常规步骤。例如,本发明中使 用的标准重组DNA和分子克隆技术为本领域技术人员熟知,并且在如下文 献中有更全面的描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,MolecularCloning:A Laboratory Manual;Cold Spring Harbor Laboratory Press:Cold SpringHarbor,1989(下文称为“Sambrook”)。同时,为了更好地理解本发 明,下面提供相关术语的定义和解释。
“Cpf1核酸酶”、“Cpf1蛋白”和“Cpf1”在本文中可互换使用,指的 是包括Cpf1蛋白或其片段的RNA指导的核酸酶。Cpf1是CRISPR-Cpf1基 因组编辑系统的组分,能在向导RNA(crRNA)的指导下靶向并切割DNA靶 序列形成DNA双链断裂(DSB)。DSB能够激活细胞内固有的修复机制非同 源末端连接(Non-homologous end joining,NHEJ)和同源重组(Homologous recombination,HR)对细胞中的DNA损伤进行修复,在修复过程中,对该特 定的DNA序列进行定点编辑。
“向导RNA”和“gRNA”在本文中可互换使用。CRISPR-Cpf1基因 组编辑系统的向导RNA通常仅由crRNA分子构成,其中crRNA包含与靶 序列具有足够相同性以便与靶序列的互补序列杂交并且指导CRISPR复合 物(Cpf1+crRNA)与该靶序列序列特异性结合的序列。
“基因组”如本文所用不仅涵盖存在于细胞核中的染色体DNA,而且 还包括存在于细胞的亚细胞组分(如线粒体、质体)中的细胞器DNA。
如本文所用,“生物体”包括适于基因组编辑的任何生物体,优选真核 生物。生物体的实例包括但不限于,哺乳动物如人、小鼠、大鼠、猴、犬、 猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物包括单子叶植物和双子叶植物, 例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。
“经遗传修饰的生物体”或“经遗传修饰的细胞”意指在其基因组内 包含外源多核苷酸或修饰的基因或表达调控序列的生物体或细胞。例如外 源多核苷酸能够稳定地整合进生物体或细胞的基因组中,并遗传连续的世 代。外源多核苷酸可单独地或作为重组DNA构建体的部分整合进基因组 中。修饰的基因或表达调控序列为在生物体或细胞基因组中所述序列包含 单个或多个脱氧核苷酸取代、缺失和添加。
针对序列而言的“外源”意指来自外来物种的序列,或者如果来自相 同物种,则指通过蓄意的人为干预而从其天然形式发生了组成和/或基因座 的显著改变的序列。
“多核苷酸”、“核酸序列”、“核苷酸序列”或“核酸片段”可互换使 用并且是单链或双链RNA或DNA聚合物,任选地可含有合成的、非天然 的或改变的核苷酸碱基。核苷酸通过如下它们的单个字母名称来指代:“A” 为腺苷或脱氧腺苷(分别对应RNA或DNA),“C”表示胞苷或脱氧胞苷,“G” 表示鸟苷或脱氧鸟苷,“U”表示尿苷,“T”表示脱氧胸苷,“R”表示嘌呤(A或G),“Y”表示嘧啶(C或T),“K”表示G或T,“H”表示A或C或 T,“I”表示肌苷,并且“N”表示任何核苷酸。
“多肽”、“肽”、和“蛋白质”在本发明中可互换使用,指氨基酸残基 的聚合物。该术语适用于其中一个或多个氨基酸残基是相应的天然存在的 氨基酸的人工化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸 聚合物。术语“多肽”、“肽”、“氨基酸序列”和“蛋白质”还可包括修饰 形式,包括但不限于糖基化、脂质连接、硫酸盐化、谷氨酸残基的γ羧化、 羟化和ADP-核糖基化。
序列“相同性”具有本领域公认的含义,并且可以利用公开的技术计算两 个核酸或多肽分子或区域之间序列相同性的百分比。可以沿着多核苷酸或 多肽的全长或者沿着该分子的区域测量序列相同性。(参见,例如: Computational Molecular Biology,Lesk,A.M.,ed.,Oxford University Press, New York,1988;Biocomputing:Informatics andGenome Projects,Smith,D.W., ed.,Academic Press,New York,1993;ComputerAnalysis of Sequence Data, Part I,Griffin,A.M.,and Griffin,H.G.,eds.,HumanaPress,New Jersey,1994; Sequence Analysis in Molecular Biology,von Heinje,G.,Academic Press,1987; and Sequence Analysis Primer,Gribskov,M.and Devereux,J.,eds.,M Stockton Press,New York,1991)。虽然存在许多测量两个多核苷酸或多肽之间的相同 性的方法,但是术语“相同性”是技术人员公知的(Carrillo,H.&Lipman,D., SIAM JApplied Math 48:1073(1988))。
在肽或蛋白中,合适的保守型氨基酸取代是本领域技术人员已知的, 并且一般可以进行而不改变所得分子的生物活性。通常,本领域技术人员 认识到多肽的非必需区中的单个氨基酸取代基本上不改变生物活性(参见, 例如,Watson et al.,Molecular Biologyof the Gene,4th Edition,1987,The Benjamin/Cummings Pub.co.,p.224)。
如本发明所用,“表达构建体”是指适于感兴趣的核苷酸序列在生物体 中表达的载体如重组载体。“表达”指功能产物的产生。例如,核苷酸序列 的表达可指核苷酸序列的转录(如转录生成mRNA或功能RNA)和/或RNA 翻译成前体或成熟蛋白质。
本发明的“表达构建体”可以是线性的核酸片段、环状质粒、病毒载 体,或者,在一些实施方式中,可以是能够翻译的RNA(如mRNA)。
本发明的“表达构建体”可包含不同来源的调控序列和感兴趣的核苷 酸序列,或相同来源但以不同于通常天然存在的方式排列的调控序列和感 兴趣的核苷酸序列。
“调控序列”和“调控元件”可互换使用,指位于编码序列的上游(5' 非编码序列)、中间或下游(3'非编码序列),并且影响相关编码序列的转录、RNA加工或稳定性或者翻译的核苷酸序列。调控序列可包括但不限于启动 子、翻译前导序列、内含子和多腺苷酸化识别序列。
“启动子”指能够控制另一核酸片段转录的核酸片段。在本发明的一 些实施方案中,启动子是能够控制细胞中基因转录的启动子,无论其是否 来源于所述细胞。启动子可以是组成型启动子或组织特异性启动子或发育 调控启动子或诱导型启动子。
“组成型启动子”指一般将引起基因在多数细胞类型中在多数情况下 表达的启动子。“组织特异性启动子”和“组织优选启动子”可互换使用, 并且指主要但非必须专一地在一种组织或器官中表达,而且也可在一种特 定细胞或细胞型中表达的启动子。“发育调控启动子”指其活性由发育事件 决定的启动子。“诱导型启动子”响应内源性或外源性刺激(环境、激素、化 学信号等)而选择性表达可操纵连接的DNA序列。
如本文中所用,术语“可操作地连接”指调控元件(例如但不限于,启 动子序列、转录终止序列等)与核酸序列(例如,编码序列或开放读码框)连 接,使得核苷酸序列的转录被所述转录调控元件控制和调节。用于将调控 元件区域可操作地连接于核酸分子的技术为本领域已知的。
将核酸分子(例如质粒、线性核酸片段、RNA等)或蛋白质“导入”生 物体是指用所述核酸或蛋白质转化生物体细胞,使得所述核酸或蛋白质在 细胞中能够发挥功能。本发明所用的“转化”包括稳定转化和瞬时转化。
“稳定转化”指将外源核苷酸序列导入基因组中,导致外源基因稳定 遗传。一旦稳定转化,外源核酸序列稳定地整合进所述生物体和其任何连 续世代的基因组中。
“瞬时转化”指将核酸分子或蛋白质导入细胞中,执行功能而没有外 源基因稳定遗传。瞬时转化中,外源核酸序列不整合进基因组中。
二、高效基因组编辑系统
在一方面,本发明提供包含与SEQ ID NO:1-12之一具有至少80%、至 少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、 甚至100%序列相同性的氨基酸序列的Cpf1蛋白在真核细胞基因组编辑中 的用途。
在另一方面,本发明提供了一种用于对细胞基因组中的靶序列进行定 点修饰的基因组编辑系统,其包含以下i)至v)中至少一项:
i)Cpf1蛋白,和向导RNA;
ii)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和向导RNA;
iii)Cpf1蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;
iv)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和包含编码向导 RNA的核苷酸序列的表达构建体;
v)包含编码Cpf1蛋白的核苷酸序列和编码向导RNA的核苷酸序列的 表达构建体;
其中所述Cpf1蛋白包含SEQ ID NO:1-12的氨基酸序列或与SEQ ID NO:1-12之一具有至少80%序列相同性的氨基酸序列,所述向导RNA能够 将所述Cpf1蛋白靶向所述细胞基因组中的靶序列。
在本发明的方法的一些实施方式中,所述向导RNA是crRNA。
在本发明的方法的一些实施方式中,所述细胞是真核生物细胞,优选 哺乳动物细胞。
在本发明的方法的一些实施方式中,所述Cpf1蛋白包含与SEQ ID NO:1-12之一具有至少80%、至少85%、至少90%、至少95%、至少96%、 至少97%、至少98%、至少99%、甚至100%序列相同性的氨基酸序列。
在本发明的方法的一些实施方式中,所述Cpf1蛋白包含相对于SEQ ID NO:1-12之一具有一或多个氨基酸残基取代、缺失或添加的氨基酸序列。例 如,所述Cpf1蛋白包含相对于SEQ ID NO:1-12之一具有1个、2个、3个、 4个、5个、6个、7个、8个、9个、10个氨基酸残基取代、缺失或添加的 氨基酸序列。在一些实施方案中,所述氨基酸取代是保守型取代。
本发明的Cpf1蛋白可以来自选自以下的物种:Agathobacter rectalis、Lachnospira pectinoschiza、Sneathia amnii、Helcococcus kunzii、Arcobacterbutzleri、Bacteroidetes oral、Oribacterium sp.、Butyrivibrio sp.、Proteocatellasphenisci、Candidatus Dojkabacteria、Pseudobutyrivibrio xylanivorans、Pseudobutyrivibrio ruminis。
在本发明的一些优选实施方案中,所述Cpf1蛋白来自Agathobacter rectalis(ArCpf1)、Butyrivibrio sp.(BsCpf1)、Helcococcus kunzii(HkCpf1)或Pseudobutyrivibrio xylanivorans(PxCpf1)。
在本发明的一些实施方案中,本发明的Cpf1蛋白还包含核定位序列(NLS)。一般而言,所述Cpf1蛋白中的一个或多个NLS应具有足够的强度, 以便在细胞核中驱动所述Cpf1蛋白以可实现其基因编辑功能的量积聚。一 般而言,核定位活性的强度由所述Cpf1蛋白中NLS的数目、位置、所使用 的一个或多个特定的NLS、或这些因素的组合决定。
在本发明的一些实施方案中,本发明的Cpf1蛋白的NLS可以位于N 端和/或C端。在一些实施方案中,所述Cpf1蛋白包含约1、2、3、4、5、 6、7、8、9、10个或更多个NLS。在一些实施方案中,所述Cpf1蛋白包含 在或接近于N端的约1、2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述Cpf1蛋白包含在或接近于C端约1、2、3、4、5、 6、7、8、9、10个或更多个NLS。在一些实施方案中,所述Cpf1蛋白包含 这些的组合,如包含在N端的一个或多个NLS以及在C端的一个或多个 NLS。当存在多于一个NLS时,每一个可以被选择为不依赖于其他NLS。 在本发明的一些实施方式中,所述Cpf1蛋白包含2个NLS,例如所述2个 NLS分别位于N端和C端。
一般而言,NLS由暴露于蛋白表面上的带正电的赖氨酸或精氨酸的一 个或多个短序列组成,但其他类型的NLS也是已知的。NLS的非限制性实 例包括:KKRKV(核苷酸序列5’-AAGAAGAGAAAGGTC-3’)、 PKKKRKV(核苷酸序列5’-CCCAAGAAGAAGAGGAAGGTG-3’或CCAAAGAAGAAGAGGAAGGTT),或SGGSPKKKRKV(核苷酸序列5’-TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3’)。
此外,根据所需要编辑的DNA位置,本发明的Cpf1蛋白还可以包括 其他的定位序列,例如细胞质定位序列、叶绿体定位序列、线粒体定位序 列等。
为了在靶细胞中获得有效表达,在本发明的一些实施方式中,所述编 码Cpf1蛋白的核苷酸序列针对待进行基因组编辑的细胞所来自的生物体进 行密码子优化。
密码子优化是指通过用在宿主细胞的基因中更频繁地或者最频繁地使 用的密码子代替天然序列的至少一个密码子(例如约或多于约1、2、3、4、 5、10、15、20、25、50个或更多个密码子同时维持该天然氨基酸序列而修 饰核酸序列以便增强在感兴趣宿主细胞中的表达的方法。不同的物种对于 特定氨基酸的某些密码子展示出特定的偏好。密码子偏好性(在生物之间的 密码子使用的差异)经常与信使RNA(mRNA)的翻译效率相关,而该翻译效率则被认为依赖于被翻译的密码子的性质和特定的转运RNA(tRNA)分子的 可用性。细胞内选定的tRNA的优势一般反映了最频繁用于肽合成的密码 子。因此,可以将基因定制为基于密码子优化在给定生物中的最佳基因表 达。密码子利用率表可以容易地获得,例如在www.kazusa.orjp/codon/上可 获得的密码子使用数据库(“Codon Usage Database”)中,并且这些表可以 通过不同的方式调整适用。参见,Nakamura Y.等,“Codon usagetabulated from the international DNA sequence databases:status for theyear2000. Nucl.Acids Res.,28:292(2000)。
可通过本发明的系统进行基因组编辑的细胞所来自的生物体优选是真 核生物,包括但不限于,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、 牛、猫;家禽如鸡、鸭、鹅;植物包括单子叶植物和双子叶植物,例如水 稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。
在本发明的一些具体实施方式中,所述编码Cpf1蛋白的核苷酸序列针 对人进行密码子优化。在一些具体实施方式中,所述密码子优化的编码Cpf1 蛋白的核苷酸序列选自SEQ ID NO:13-24。
在本发明一些实施方式中,所述编码Cpf1蛋白的核苷酸序列和/或所述 编码向导RNA的核苷酸序列与表达调控元件如启动子可操作地连接。
本发明可使用的启动子的实例包括但不限于聚合酶(pol)I、pol II或pol III启动子。pol I启动子的实例包括鸡RNA pol I启动子。pol II启动子的实 例包括但不限于巨细胞病毒立即早期(CMV)启动子、劳斯肉瘤病毒长末端 重复(RSV-LTR)启动子和猿猴病毒40(SV40)立即早期启动子。pol III启动子 的实例包括U6和H1启动子。可以使用诱导型启动子如金属硫蛋白启动子。 启动子的其他实例包括T7噬菌体启动子、T3噬菌体启动子、β-半乳糖苷酶 启动子和Sp6噬菌体启动子。当用于植物时,启动子可以是花椰菜花叶病 毒35S启动子、玉米Ubi-1启动子、小麦U6启动子、水稻U3启动子、玉 米U3启动子、水稻肌动蛋白启动子。
三、修饰细胞基因组中靶序列的方法
在另一方面,本发明提供了一种修饰细胞基因组中靶序列的方法,包 括将本发明的基因组编辑系统导入所述细胞,由此所述向导RNA将所述Cpf1蛋白靶向所述细胞基因组中的靶序列,导致所述靶序列中的一或多个 核苷酸的取代、缺失和/或添加。
在另一方面,本发明还提一种产生经遗传修饰的细胞的方法,包括将 本发明的基因组编辑系统导入细胞中,由此所述向导RNA将所述Cpf1蛋 白靶向所述细胞基因组中的靶序列,导致所述靶序列中的一或多个核苷酸 取代、缺失和/或添加。
在另一方面,本发明还提供经遗传修饰的生物体,其包含通过本发明 的方法产生的经遗传修饰的细胞或其后代细胞。
可以被Cpf1蛋白和向导RNA(即crRNA)复合物识别并靶向的靶序列或 crRNA编码序列的设计可以参照例如Zhang et al.,Cell 163,1–13,October 22, 2015。一般而言,本发明的基因组编辑系统靶向的靶序列5’末端需包含前 间区序列邻近基序(protospaceradjacent motif)(PAM)5’-TTTN、5’-TTN、 5’-CCN、5’-TCCN、5’-TCN或5’-CTN,其中N表示A、G、C或T。
例如,在本发明的一些实施方案中,所述靶序列具有以下结构:5’- TYYN-NX-3’或5’-YYN-NX-3’,其中N独立地选自A、G、C和T,Y选自 C和T;X为15≤X≤35的整数;Nx表示X个连续的核苷酸。
在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如 位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启 动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。
可以通过T7EI、PCR/RE或测序方法检测所述细胞靶序列中的取代、 缺失和/或添加。
在本发明的方法中,所述基因组编辑系统可以通过本领域技术人员熟 知的各种方法导入细胞。
可用于将本发明的基因组编辑系统导入细胞的方法包括但不限于:磷 酸钙转染、原生质融合、电穿孔、脂质体转染、微注射、病毒感染(如杆状 病毒、痘苗病毒、腺病毒、腺相关病毒、慢病毒和其他病毒)、基因枪法、 PEG介导的原生质体转化、土壤农杆菌介导的转化。
可以通过本发明的方法进行基因组编辑的细胞可以来自例如,哺乳动 物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植 物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、 大豆、花生、拟南芥等。
在一些实施方式中,本发明的方法在体外进行。例如,所述细胞是分 离的细胞。在一些实施方式中,所述细胞是CAR-T细胞。在一些实施方式 中,所述细胞是诱导的胚胎干细胞。
在另一些实施方式中,本发明的方法还可以在体内进行。例如,所述 细胞是生物体内的细胞,可以通过例如病毒介导的方法将本发明的系统体 内导入所述细胞。例如,所述细胞可以是患者体内的肿瘤细胞。
四、治疗应用
本发明还涵盖本发明的基因组编辑系统在疾病治疗中的应用。
通过本发明的基因组编辑系统对疾病相关基因进行修饰,可以实现疾 病相关基因的上调、下调、失活、激活或者突变纠正等,从而实现疾病的 预防和/或治疗。例如,本发明中靶序列可以位于疾病相关基因的蛋白编码 区内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而可 以实现对所述疾病相关基因功能修饰或对疾病相关基因表达的修饰。
“疾病相关”基因是指与非疾病对照的组织或细胞相比,在来源于疾 病影响的组织的细胞中以异常水平或以异常形式产生转录或翻译产物的任 何基因。在改变的表达与疾病的出现和/或进展相关的情况下,它可以是以 异常高的水平被表达的基因;它可以是以异常低的水平被表达的基因。疾 病相关基因还指具有一个或多个突变或直接负责或与一个或多个负责疾病 的病因学的基因连锁不平衡的遗传变异的基因。转录的或翻译的产物可以 是已知的或未知的,并且可以处于正常或异常水平。
因此,本发明还提供治疗有需要的对象中的疾病的方法,包括向所述 对象递送有效量的本发明的基因组编辑系统以修饰与所述疾病相关的基 因。
本发明还提供本发明的基因组编辑系统在制备用于治疗有需要的对象 中的疾病的药物组合物中的用途,其中所述基因组编辑系统用于修饰与所 述疾病相关的基因。
本发明还提供用于治疗有需要的对象中的疾病的药物组合物,其包含 本发明的基因组编辑系统和药学可接受的载体,其中所述基因组编辑系统 用于修饰与所述疾病相关的基因。
在一些实施方式中,所述对象是哺乳动物,例如人。
所述疾病的实例包括但不限于肿瘤、炎症、帕金森病、心血管疾病、 阿尔茨海默病、自闭症、药物成瘾、年龄相关性黄斑变性、精神分裂症、 遗传性疾病等。
本发明所述的疾病和相应的疾病相关基因的更多实例例如可以从中国 专利申请CN201480045703.4中找到。
五、试剂盒
本发明的范围内还包括用于本发明的方法的试剂盒,该试剂盒包括本 发明的基因组编辑系统,以及使用说明。试剂盒一般包括表明试剂盒内容 物的预期用途和/或使用方法的标签。术语标签包括在试剂盒上或与试剂盒 一起提供的或以其他方式随试剂盒提供的任何书面的或记录的材料。
实施例
材料与方法
细胞培养和转染
HEK293T,HeLa细胞培养在添加10%FBS(Gibco)和100U/ml青霉素 和100ug/ml链霉素的DMEM培养液(Gibco)中,小鼠EpiSC细胞培养在 添加bFGF的N2B27培养液中。转染前12h将细胞的密度以每孔500,000 接种于12孔板中,12h后细胞的密度在60%-70%左右,然后使用 Lipofectamine LTX and PLUS reagents(Invitrogen)将1.5ug(Cpf1:crRNA= 2:1)质粒转入细胞中,在转染前换成无血清的opti-MEM培养液(Gibco), 转染后8-12h换成添加血清的DMED培养液,转染48h后流式分选GFP阳 性的细胞用于基因型鉴定。
基因型分析:T7EI分析和DNA测序
流式分选GFP阳性细胞,用Buffer L+1/100V Protease K(Biotool)55℃ 裂解30min,95℃失活5min,直接用于PCR检测。在Cpf1打靶位点附近 设计合适的引物,扩增产物用DNA Clean&ConcentratorTM-5试剂盒纯化 (ZYMO Research),200ng纯化后PCR产物加入1uL NEBuffer2(NEB)并 用ddH2O稀释到10uL。然后按照之前报道的方法用于再退火形成异源二聚 体(Li,W.,et al.,Simultaneous generation and germline transmission ofmultiple gene mutations in rat using CRISPR-Cas systems.Nat Biotechnol,2013.31(8): p.684-6)。再退火产物加入0.3uL T7EI endnucelease和1/10V NEBuffer2(NEB),37℃酶切1h30min。3%TAE-Gel电泳分析基因型。Indel按照之前 文章中报道的方法进行计算(Cong,L.,et al.,Multiplex genome engineering using CRISPR/Cassystems.Science,2013.339(6121):p.819-23)。
T7EI分析阳性的样品对应的PCR产物连接到pEASY-T1或pEASY-B (Transgen)载体上。转化、涂板,37℃培养过夜。挑取适量的单克隆菌落 用于Sanger测序。通过与野生型基因型比对,确定突变基因型。
免疫荧光染色
Cpf1真核表达载体转染HeLa细胞48h后用4%PFA室温固定10min。 PBST(PBS+0.3%Triton X100)清洗三遍。2%BSA(+0.3%Triton X100)室温 封闭通透15min,一抗Ratanti-HA(Roche,1:1000)孵育过夜,PBST清洗三 遍。Cy3标记荧光二抗(JacksonImmunoResearch,1:1000)室温孵育2h, DAPI(Sigma,1:1000)染核10min,PBST清洗三遍。用Aqueous Mounting Medium(Abcam)封片,固定。Zeiss LSM780用于观察。
Cfp1蛋白原核表达纯化
Cpf1编码基因克隆到BPK2103/2104原核表达载体上。融合的C端 10xHis标签用于纯化蛋白。表达载体转化到BL21(DE3)E.coli感受态细胞 (Transgen)中。挑选IPTG诱导高表达的克隆接种到300mL CmR+LB培 养基中。37℃摇菌培养至OD600~0.4,加入终浓度1mM的IPTG,16℃诱 导表达16h。8000rpm 4℃离心10min,收集菌体沉淀,40mL NPI-10(+1xEDTA-free Protease Inhibitor Cocktail,Roche;+5%甘油)裂解菌体,冰浴, 超声破碎:总时间15min,超声2s,暂停5s,功率100W。破碎后,8000rpm 4℃离心10min,收集上清液。上清液中加入2mL His60Ni Superflow Resin (Takara),4℃摇匀1h,摇匀后经过Polypropylene Colums(Qiagen)纯化。利 用重力流,分别用20mL NPI-20,20mL NPI-40和10mL NPI-100充分洗掉 没有融合His10标签的杂蛋白。最后用6x0.5mL NPI-500将融合His10标签 的Cpf1蛋白从Ni柱中洗脱下来。洗脱后的目的蛋白在透析液(50mM Tris-HCl,300mM KCl,1mM DTT,20%Glycerol)中透析过夜。透析后的 蛋白溶液用100kDa AmiconUltra-4,PLHK Ultracel-PL超滤管(Millipore) 浓缩蛋白。用Micro BCATMProtein AssayKit(Thermo ScientificTM)测定蛋白 浓度。
RNA体外转录
合成带有T7启动子的体外转录crRNA模板(BGI),按照操作手册用HiScribeTMT7Quick High Yield RNA Synthesis Kit(NEB)体外转录crRNA。 转录完成后用Oligo Clean&ConcentratorTM(ZYMO Research)纯化,用 NanoDrop(ThermoScientificTM)测定浓度,-80℃保存。
体外酶切分析
带有不同5’PAM序列的靶序列合成后克隆到pUC19或p11-LacY-wtx1 载体上,用引物扩增后切胶纯化。200ng纯化PCR产物分别与400ng crRNA 以及50nM Cpf1蛋白在NEBuffer3(NEB)反应体系中37℃反应1h。随后在 12%Urea-TBE-PAGE胶或2.5%琼脂糖胶电泳分析。
靶序列
实验中所用靶序列如下表1所示:
表1
实施例1.鉴定新的Cpf1蛋白
CRISPR/Cpf1(Clustered Regularly Interspaced Short Palindromic Repeatsfrom Prevotella and Francisella 1)是在Prevotella和Francisella中发现的获得 性免疫机制,并且被成功改造用于DNA编辑技术(Zetsche,B.,et al.,Cpf1is a single RNA-guided endonuclease of a class 2CRISPR-Cas system.Cell,2015. 163(3):p.759-71.)。与CRISPR/Cas9不同,Cpf1只需要1条crRNA来引导, 不需要tracrRNA,并且Cpf1蛋白使用富含T碱基的PAM序列。
本发明人利用已经报道的AsCpf1、LbCpf1和FnCpf1在NCBI数据库 中进行PSI-Blast搜索,从中挑选出25个未详细报道的Cpf1蛋白,其中有 21个Cpf1蛋白拥有directrepeat(DR)序列。
通过序列比对发现,21条来自不同Cpf1蛋白的DR序列和RNA的二 级结构相当保守(图1)。基于除了crRNA的保守外,所有Cpf1蛋白的PAM 序列也很保守(即都是使用富含T碱基的PAM序列)的假设,从25个Cpf1 蛋白中挑选了12个作为候选。所选12个候选Cpf1蛋白具体信息的如下表 2所示:
表2.候选Cpf1蛋白
实施例2.验证Cpf1蛋白编辑人基因组的能力
针对在人中的表达进行密码子优化并合成12个选定蛋白的编码序列 (BGI),克隆到真核表达载体(pCAG-2AeGFP-SV40和 pCAG-2AeGFP-SV40_v4)和原核表达载体(BPK2103-ccdB和BPK2014-ccdB) 中。
通过HeLa细胞转染实验,我们可以发现,所选定的Cpf1蛋白可以明 显地在细胞核中表达(图2)。
接下来,通过与靶向人Dnmt1基因的crRNA共同转染293T细胞,经 PCR和T7EI酶切鉴定,发现有8种Cpf1蛋白(Ab、Ar、Bo、Bs、Hk、Lp、 Ps、Sa)可以在293T细胞中的Dnmt1基因的3号位点引发突变(图3A)。 而DNA测序也进一步证明该基因位点确实产生了基因突变(图3B)。
为了进一步证明所鉴定的Cpf1蛋白可以在哺乳动物基因组上引发突 变,选取了Dnmt1基因的1号和2号位点重复上述实验。T7EI酶切结果证 明,ArCpf1、BsCpf1、HkCpf1和LpCpf1四个蛋白可以在两个基因位点产 生明显的indel(图4A和5A)。而DNA测序结果证明除了上述4个蛋白, PsCpf1和SaCpf1蛋白也能引发基因的突变(图4B和5B)。
实施例3.鉴定PAM序列
为了确定所鉴定Cpf1蛋白的PAM序列,首先,选择了BsCpf1蛋白进 行详细的研究。
利用大肠杆菌原核表达BsCpf1,通过His标签进行蛋白纯化(图6A)。 通过将纯化的BsCpf1蛋白和对应的crRNA以及dsDNA片段(人Dnmt1靶 位点2)在37℃孵育1h,然后进行TBE变性PAGE胶电泳,证明BsCpf1 的PAM序列为5’(T)TTN-(图6B-E)。
然后,利用类似方法,鉴别其余Cpf1蛋白的PAM序列,结果如图7 所示。实验结果表明,所选的Cpf1蛋白均具有体外酶切活性。其中9种 Cpf1蛋白的PAM序列是5’(T)TTN-。而C6Cpf1、HkCpf1和PsCpf1的PAM 序列是5’(T)YYN-。
为了确认HkCpf1体内编辑的PAM序列是5’(T)YYN-,将HkCpf1载体 与人AAVS1基因上8个位点的靶向crRNA载体分别转染293T细胞。实验 结果表明(图8),HkCpf1可以对其中的7个位点进行编辑并产生突变,其 PAM序列是5’(T)YYN-(图8B下划线部分)。
实施例4.小鼠基因组编辑
为了拓展这些Cpf1蛋白的适用性,在小鼠的EpiSC细胞系中进行了一 些列的转染实验。
首先,我们将BsCpf1载体与小鼠Tet1基因上8个位点的靶向crRNA 载体(pUC19-crRNA)分别转染EpiSC细胞。通过T7EI酶切和DNA测序证 明,BsCpf1在2号,3号和7号位点引发基因突变(图9A)。其中,7号位 点的PAM序列为5’TTA-,该结果与实施例3的结论吻合。说明BsCpf1 蛋白使用5’TTN-PAM序列。
接下来,重复验证了ArCpf1、BsCpf1、HkCpf1和PxCpf1的DNA编 辑能力。通过与小鼠MeCP2基因8个位点的靶向crRNA载体分别共转染。 利用T7EI酶切和DNA测序,证明ArCpf1(图9D、E)、BsCpf1(图9B、C)、 HkCpf1(图9F、G)和PxCpf1(图9H、I)四种蛋白可以靶向编辑小鼠的基因 组。
以上实验结果证明,本发明发现的12个Cpf1蛋白都具有DNA编辑的 能力,其中ArCpf1(SEQ ID NO:1)、BsCpf1(SEQ ID NO:8)、HkCpf1(SEQ ID NO:4)和PxCpf1(SEQ IDNO:11)能够实现高效的哺乳动物基因组编辑,其余 的Cpf1蛋白由于编辑效率太低,实验结果重复性差,可能不能够有效的实 现哺乳动物基因组的编辑。
序列表
>SEQ ID NO:1 ArCpf1
MNNGTNNFQNFIGISSLQKTLRNALTPTETTQQFIVKNGIIKEDELRGENRQILKDIMDDYYRGFISETLSSIDDIDWTSLFEKMEIQLKNGDNKDTLIKEQAEKRKAIYKKFADDDRFKNMFSAKLISDILPEFVIHNNNYSASEKEEKTQVIKLFSRFATSFKDYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKNLSNDDINKISGDMKDSLKKMSLEKIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKNLYKLRKLHKQILCIADTSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIGDNYNGYNLDKIYIVSKFYESVSQKTYRDWETINTALEIHYNNILPGNGKSKADKVKKAVKNDLQKSITEINELVSNYKLCPDDNIKAETYIHEISHILNNFEAQELKYNPEIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIYPVISLYNLVRNYVTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLYYLGIFNAKNKPEKKIIEGNTSENKGDYKKMIYNLLPGPNKMIPKVFLSSKTGVETYKPSAYILEGYKQNKHLKSSKDFDITFCRDLIDYFKNCIAIHPEWKNFGFDFSDTSTYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTGNDNLHTMYLKNLFSEENLKDVVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTYEAEEKDQFGNIQIVRKTIPENIYQELYKYFNDKSDKELSDEAAKLKNAVGHHEAATNIVKDYRYTYDKYFLHMPITINFKANKTSFINDRILQYIAKEKDLHVIGIDRGERNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKEIKEGYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINKLNYLVFKDISITENGGLLKGYQLTYIPEKLKNVGHQCGCIFYVPAAYTSKIDPTTGFVNIFKFKDLTVDAKREFIKKFDSIRYDSDKNLFCFTFDYNNFITQNTVMSKSSWSVYTYGVRIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDIIDYEIVQHIFEIFKLTVQMRNSLSELEDRNYDRLISPVLNENNIFYDSAKAGDALPKDADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL
>SEQ ID NO:2 LpCpf1
MFSLDYFSLTLSQRYIDIYNTMIGGNTLADGTKVQGINENINIYRQKNNIDRKNLPTLKPLHKQLLSDRETLSWIPEAFKTKEEVVGAIEDFYKNNIISFKCCDNIVDITKQFIDIFSLNEDYELNKIFIKNDISITSISQDIFKDYRIIKEALWQKHINENPKAAKSKDLTGDKEKYFSRKNSFFSFEEIISSLKLMGRKIDLFSYFKDNVEYRAHSIETTFIKWQKNKNDKKTTKELLDNILNLQRVLKPLYLKAEVEKDILFYSIFDIYFESLNEIVKLYNKVRDFESKKPYSLEKFKLNFQNSTLLSGWDVNKEPDNTSILLKKDGLYYLGIMDKKHNRVFKNLESSKGGYEKIEYKLLSGPNKMLPKVFFSNKSIGYYNPSPALLEKYKSGVHKKGESFDLNFCHELIDFFKASIDKHEDWKNFNFKFSDTSEYADISGFYREVEQQGYKITFKNIDEEFINTLINEGKLYLFQIYNKDFSTFSKGTKNLHTLYWEMIFNEENLKNVVYKLNGEAEIFYRKKSIEYSEDKMKYGHHYEELKDKFNYPIIKDKRFTMDKFQFHVPITMNFKATGRSYINEEVNDFLRQNSKDVKIIGINRGERHLIYLTMINAKGEIIQQYSLNEIVNSYNNKNFTVNYNEKLSKKEGERAIARENWGVVENIKELKEGYLSHAIHTISNLIVENNAIVVLEDLNFEFKRERLKVEKSIYQKFEKMLIDKLNYLVDKKKDINENGGLLKALQLTNKFESFEKIGKQNGFLFFVNAWNITKICPVTGFVSLFDTRYQSVDKAREFFSKFDSIKYNEEKEHYEFVFDYSNFTDKAKDTKTKWTVCSYGTRIKTFRNSEKNNNWDNKTVSPTEDLSKLLKSCDRDIKEFIISQDKKEFFVELLEIFSLIVQMKNSIINSEIDYIISPVANENGEFFDSRFANSSLPKNADANAAYNTARKGLMLLEKIRDSEIGKKIDMKITNTEWLNFVQER
>SEQ ID NO:3 SaCpf1
MNDIEGLKEEFLKISLENFEGIYISNKKLNEISNRKFGDYNSINMMIKQSMNEKGILSKKEINELIPDLENINKPKVKSFNLSFIFENLTKEHKELIIDYIRENICNVIENVKITIEKYRNIDNKIEFKNNAEKVSKIKEMLESINELCKLIKEFNTDEIEKNNEFYNILNKNFEIFESSYKVLNKVRNFVTKKEVIENKMKLNFSNYQLGNGWHKNKEKDCSIILFRKRNNERWIYYLGILKHGTKIKENDYLSSVDTGFYKMDYYAQNSLSKMIPKCSITVKNVKNAPEDESVILNDSKKFNEPLEITPEIRKLYGNNEHIKGDKFKKESLVKWIDFCKEFLLKYKSFEKAKKEILKLKESNLYENLEEFYSDAEEKAYFLEFINIDEDKIKKLVKEKNLYLFQIYNKDFSAYSTGNKNLHTMYFEELFTDENLKKPVFKLNGNTEVFYRIASSKPKIVHNKGEKLVNKTYLDDGIIKTIPDSVYEEISEKVKNNEDYSKLLEENNIKNLEIKVATHEIVKDKRYFENKFLFYLPITLNKKVSNKNTNKNINKNVIDEIKDCNEYNVIGIDRGERNLISLCIINQNGEIILQKEMNIIQSSDKYNVDYNEKLEIKSKERDNAKKNWSEIGKIKDLKSGYLSAVVHEIVKLAIEYNAVIILEDLNNGFKNSRKKVDKQIYQKFERALIEKLQFLIFKNYDKNEKGGLRNAFQLTPELKNITKVASQQGIIIYTNPAYTSKIDPTTGYANIIKKSNNNEESIVKAIDKISYDKEKDMFYFDINLSNSSFNLTVKNVLKKEWRIYTNGERIIYKDRKYITLNITQEMKDILSKCGIDYLNIDNLKQDILKNKLHKKVYYIFELANKMRNENKDVDYIISPVLNKDGKFFMTQEINELTPKDADLNGAYNIALKGKLMIDNLNKKEKFVFLSNEDWLNFIQGR
>SEQ ID NO:4 HkCpf1
MFEKLSNIVSISKTIRFKLIPVGKTLENIEKLGKLEKDFERSDFYPILKNISDDYYRQYIKEKLSDLNLDWQKLYDAHELLDSSKKESQKNLEMIQAQYRKVLFNILSGELDKSGEKNSKDLIKNNKALYGKLFKKQFILEVLPDFVNNNDSYSEEDLEGLNLYSKFTTRLKNFWETRKNVFTDKDIVTAIPFRAVNENFGFYYDNIKIFNKNIEYLENKIPNLENELKEADILDDNRSVKDYFTPNGFNYVITQDGIDVYQAIRGGFTKENGEKVQGINEILNLTQQQLRRKPETKNVKLGVLTKLRKQILEYSESTSFLIDQIEDDNDLVDRINKFNVSFFESTEVSPSLFEQIERLYNALKSIKKEEVYIDARNTQKFSQMLFGQWDVIRRGYTVKITEGSKEEKKKYKEYLELDETSKAKRYLNIREIEELVNLVEGFEEVDVFSVLLEKFKMNNIERSEFEAPIYGSPIKLEAIKEYLEKHLEEYHKWKLLLIGNDDLDTDETFYPLLNEVISDYYIIPLYNLTRNYLTRKHSDKDKIKVNFDFPTLADGWSESKISDNRSIILRKGGYYYLGILIDNKLLINKKNKSKKIYEILIYNQIPEFSKSIPNYPFTKKVKEHFKNNVSDFQLIDGYVSPLIITKEIYDIKKEKKYKKDFYKDNNTNKNYLYTIYKWIEFCKQFLYKYKGPNKESYKEMYDFSTLKDTSLYVNLNDFYADVNSCAYRVLFNKIDENTIDNAVEDGKLLLFQIYNKDFSPESKGKKNLHTLYWLSMFSEENLRTRKLKLNGQAEIFYRKKLEKKPIIHKEGSILLNKIDKEGNTIPENIYHECYRYLNKKIGREDLSDEAIALFNKDVLKYKEARFDIIKDRRYSESQFFFHVPITFNWDIKTNKNVNQIVQGMIKDGEIKHIIGIDRGERHLLYYSVIDLEGNIVEQGSLNTLEQNRFDNSTVKVDYQNKLRTREEDRDRARKNWTNINKIKELKDGYLSHVVHKLSRLIIKYEAIVIMENLNQGFKRGRFKVERQVYQKFELALMNKLSALSFKEKYDERKNLEPSGILNPIQACYPVDAYQELQGQNGIVFYLPAAYTSVIDPVTGFTNLFRLKSINSSKYEEFIKKFKNIYFDNEEEDFKFIFNYKDFAKANLVILNNIKSKDWKISTRGERISYNSKKKEYFYVQPTEFLINKLKELNIDYENIDIIPLIDNLEEKAKRKILKALFDTFKYSVQLRNYDFENDYIISPTADDNGNYYNSNEIDIDKTNLPNNGDANGAFNIARKGLLLKDRIVNSNESKVDLKIKNEDWINFIIS
>SEQ ID NO:5 AbCpf1
MFSLDYFSLTLSQRYIDIYNTMIGGNTLADGTKVQGINENINIYRQKNNIDRKNLPTLKPLHKQLLSDRETLSWIPEAFKTKEEVVGAIEDFYKNNIISFKCCDNIVDITKQFIDIFSLNEDYELNKIFIKNDISITSISQDIFKDYRIIKEALWQKHINENPKAAKSKDLTGDKEKYFSRKNSFFSFEEIISSLKLMGRKIDLFSYFKDNVEYRAHSIETTFIKWQKNKNDKKTTKELLDNILNLQRVLKPLYLKAEVEKDILFYSIFDIYFESLNEIVKLYNKVRDFESKKPYSLEKFKLNFQNSTLLSGWDVNKEPDNTSILLKKDGLYYLGIMDKKHNRVFKNLESSKGGYEKIEYKLLSGPNKMLPKVFFSNKSIGYYNPSPALLEKYKSGVHKKGESFDLNFCHELIDFFKASIDKHEDWKNFNFKFSDTSEYADISGFYREVEQQGYKITFKNIDEEFINTLINEGKLYLFQIYNKDFSTFSKGTKNLHTLYWEMIFNEENLKNVVYKLNGEAEIFYRKKSIEYSEDKMKYGHHYEELKDKFNYPIIKDKRFTMDKFQFHVPITMNFKATGRSYINEEVNDFLRQNSKDVKIIGINRGERHLIYLTMINAKGEIIQQYSLNEIVNSYNNKNFTVNYNEKLSKKEGERAIARENWGVVENIKELKEGYLSHAIHTISNLIVENNAIVVLEDLNFEFKRERLKVEKSIYQKFEKMLIDKLNYLVDKKKDINENGGLLKALQLTNKFESFEKIGKQNGFLFFVNAWNITKICPVTGFVSLFDTRYQSVDKAREFFSKFDSIKYNEEKEHYEFVFDYSNFTDKAKDTKTKWTVCSYGTRIKTFRNSEKNNNWDNKTVSPTEDLSKLLKSCDRDIKEFIISQDKKEFFVELLEIFSLIVQMKNSIINSEIDYIISPVANENGEFFDSRFANSSLPKNADANAAYNTARKGLMLLEKIRDSEIGKKIDMKITNTEWLNFVQER
>SEQ ID NO:6 BoCpf1
MRKFNEFVGLYPISKTLRFELKPIGKTLEHIQRNKLLEHDAVRADDYVKVKKIIDKYHKCLIDEALSGFTFDTEADGRSNNSLSEYYLYYNLKKRNEQEQKTFKTIQNNLRKQIVNKLTQSEKYKRIDKKELITTDLPDFLTNESEKELVEKFKNFTTYFTEFHKNRKNMYSKEEKSTAIAFRLINENLPKFVDNIAAFEKVVSSPLAEKINALYEDFKEYLNVEEISRVFRLDYYDELLTQKQIDLYNAIVGGRTEEDNKIQIKGLNQYINEYNQQQTDRSNRLPKLKPLYKQILSDRESVSWLPPKFDSDKNLLIKIKECYDALSEKEKVFDKLESILKSLSTYDLSKIYISNDSQLSYISQKMFGRWDIISKAIREDCAKRNPQKSRESLEKFAERIDKKLKTIDSISIGDVDECLAQLGETYVKRVEDYFVAMGESEIDDEQTDTTSFKKNIEGAYESVKELLNNADNITDNNLMQDKGNVEKIKTLLDAIKDLQRFIKPLLGKGDEADKDGVFYGEFTSLWTKLDQVTPLYNMVRNYLTSKPYSTKKIKLNFENSTLMDGWDLNKEPDNTTVIFCKDGLYYLGIMGKKYNRVFVDREDLPHDGECYDKMEYKLLPGANKMLPKVFFSETGIQRFLPSEELLGKYERGTHKKGAGFDLGDCRALIDFFKKSIERHDDWKKFDFKFSDTSTYQDISEFYREVEQQGYKMSFRKVSVDYIKSLVEEGKLYLFQIYNKDFSAHSKGTPNMHTLYWKMLFDEENLKDVVYKLNGEAEVFFRKSSITVQSPTHPANSPIKNKNKDNQKKESKFEYDLIKDRRYTVDKFLFHVPITMNFKSVGGSNINQLVKRHIRSATDLHIIGIDRGERHLLYLTVIDSRGNIKEQFSLNEIVNEYNGNTYRTDYHELLDTREGERTEARRNWQTIQNIRELKEGYLSQVIHKISELAIKYNAVIVLEDLNFGFMRSRQKVEKQVYQKFEKMLIDKLNYLVDKKKPVAETGGLLRAYQLTGEFESFKTLGKQSGILFYVPAWNTSKIDPVTGFVNLFDTHYENIEKAKVFFDKFKSIRYNSDKDWFEFVVDDYTRFSPKAEGTRRDWTICTQGKRIQICRNHQRNNEWEGQEIDLTKAFKEHFEAYGVDISKDLREQINTQNKKEFFEELLRLLRLTLQMRNSMPSSDIDYLISPVANDTGCFFDSRKQAELKENAVLPMNADANGAYNIARKGLLAIRKMKQEENDSAKISLAISNKEWLKFAQTKPYLED
>SEQ ID NO:7 OsCpf1
METEILKYDFFEREGKYMYYDGLTKQYALSKTIRNELVPIGKTLDNIKKNRILEADIKRKSDYEHVKKLMDMYHKKIINEALDNFKLSVLEDAADIYFNKQNDERDIDAFLKIQDKLRKEIVEQLKGHTDYSKVGNKDFLGLLKAASTEEDRILIESFDNFYTYFTSYNKVRSNLYSAEDKSSTVAYRLINENLPKFFDNIKAYRTVRNAGVISGDMSIVEQDELFEVDTFNHTLTQYGIDTYNHMIGQLNSAINLYNQKMHGAGSFKKLPKMKELYKQLLTEREEEFIEEYTDDEVLITSVHNYVSYLIDYLNSDKVESFFDTLRKSDGKEVFIKNDVSKTTMSNILFDNWSTIDDLINHEYDSAPENVKKTKDDKYFEKRQKDLKKNKSYSLSKIAALCRDTTILEKYIRRLVDDIEKIYTSNNVFSDIVLSKHDRSKKLSKNTNAVQAIKNMLDSIKDFEHDVMLINGSGQEIKKNLNVYSEQEALAGILRQVDHIYNLTRNYLTKKPFSTEKIKLNFNRPTFLDGWDKNKEEANLGILLIKDNRYYLGIMNTSSNKAFVNPPKAISNDIYKKVDYKLLPGPNKMLPKVFFATKNIAYYAPSEELLSKYRKGTHKKGDSFSIDDCRNLIDFFKSSINKNTDWSTFGFNFSDTNSYNDISDFYREVEKQGYKLSFTDIDACYIKDLVDNNELYLFQIYNKDFSPYSKGKLNLHTLYFKMLFDQRNLDNVVYKLNGEAEVFYRPASIESDEQIIHKSGQNIKNKNQKRSNCKKTSTFDYDIVKDRRYCKDKFMLHLPITVNFGTNESGKFNELVNNAIRADKDVNVIGIDRGERNLLYVVVVDPCGKIIEQISLNTIVDKEYDIETDYHQLLDEKEGSRDKARKDWNTIENIKELKEGYLSQVVNIIAKLVLKYDAIICLEDLNFGFKRGRQKVEKQVYQKFEKMLIDKMNYLVLDKSRKQESPQKPGGALNALQLTSAFKSFKELGKQTGIIYYVPAYLTSKIDPTTGFANLFYIKYESVDKARDFFSKFDFIRYNQMDNYFEFGFDYKSFTERASGCKSKWIACTNGERIVKYRNSDKNNSFDDKTVILTDEYRSLFDKYLQNYIDEDDLKDQILQIDSADFYKNLIKLFQLTLQMRNSSSDGKRDYIISPVKNYREEFFCSEFSDDTFPRDADANGAYNIARKGLWVIKQIRETKSGTKINLAMSNSEWLEYAQCNLL
>SEQ ID NO:8 BsCpf1
MYYQNLTKKYPVSKTIRNELIPIGKTLENIRKNNILESDVKRKQDYEHVKGIMDEYHKQLINEALDNYMLPSLNQAAEIYLKKHVDVEDREEFKKTQDLLRREVTGRLKEHENYTKIGKKDILDLLEKLPSISEEDYNALESFRNFYTYFTSYNKVRENLYSDEEKSSTVAYRLINENLPKFLDNIKSYAFVKAAGVLADCIEEEEQDALFMVETFNMTLTQEGIDMYNYQIGKVNSAINLYNQKNHKVEEFKKIPKMKVLYKQILSDREEVFIGEFKDDETLLSSIGAYGNVLMTYLKSEKINIFFDALRESEGKNVYVKNDLSKTTMSNIVFGSWSAFDELLNQEYDLANENKKKDDKYFEKRQKELKKNKSYTLEQMSNLSKEDISPIENYIERISEDIEKICIYNGEFEKIVVNEHDSSRKLSKNIKAVKVIKDYLDSIKELEHDIKLINGSGQELEKNLVVYVGQEEALEQLRPVDSLYNLTRNYLTKKPFSTEKVKLNFNKSTLLNGWDKNKETDNLGILFFKDGKYYLGIMNTTANKAFVNPPAAKTENVFKKVDYKLLPGSNKMLPKVFFAKSNIGYYNPSTELYSNYKKGTHKKGPSFSIDDCHNLIDFFKESIKKHEDWSKFGFEFSDTADYRDISEFYREVEKQGYKLTFTDIDESYINDLIEKNELYLFQIYNKDFSEYSKGKLNLHTLYFMMLFDQRNLDNVVYKLNGEAEVFYRPASIAENELVIHKAGEGIKNKNPNRAKVKETSTFSYDIVKDKRYSKYKFTLHIPITMNFGVDEVRRFNDVINNALRTDDNVNVIGIDRGERNLLYVVVINSEGKILEQISLNSIINKEYDIETNYHALLDEREDDRNKARKDWNTIENIKELKTGYLSQVVNVVAKLVLKYNAIICLEDLNFGFKRGRQKVEKQVYQKFEKMLIEKLNYLVIDKSREQVSPEKMGGALNALQLTSKFKSFAELGKQSGIIYYVPAYLTSKIDPTTGFVNLFYIKYENIEKAKQFFDGFDFIRFNKKDDMFEFSFDYKSFTQKACGIRSKWIVYTNGERIIKYPNPEKNNLFDEKVINVTDEIKGLFKQYRIPYENGEDIKEIIISKAEADFYKRLFRLLHQTLQMRNSTSDGTRDYIISPVKNDRGEFFCSEFSEGTMPKDADANGAYNIARKGLWVLEQIRQKDEGEKVNLSMTNAEWLKYAQLHLL
>SEQ ID NO:9 PsCpf1
MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKFKETIEERLKYTEFSECDLGNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKPDKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIRKHIFKGESSGSMAYRIIDENLTTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISKSENVKIQGINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIEMISDLINKTEISQDVIMSDIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGDGKRKKSYENDRKKHLETNVYSINYISELLTDTDVSSNIKMRYKELEQNYQVCKENFNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFYTWLSKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNSTIIMRKFNNDRGDYDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLPKQFLSKIWKAKHPTTPEFDKKYKEGRHKKGPDFEKEFLHELIDCFKHGLVNHDEKYQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSFNKIADTSNLINDGKLYVFQIWSKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDIIKKGHHHAELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNENLGQFTHIIGIDRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTKGVEHKTHYLNKLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQEKYNALIVMENLNYGFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGNQSGIVLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGEFKFDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKLILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDSRENIKNLPADADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQQE
>SEQ ID NO:10 C6Cpf1
MKNVFGGFTNLYSLTKTLRFELKPTSKTQKLMKRNNVIQTDEEIDKLYHDEMKPILDEIHRRFINDALAQKIFISASLDNFLKVVKNYKVESAKKNIKQNQVKLLQKEITIKTLGLRREVVSGFITVSKKWKDKYVGLGIKLKGDGYKVLTEQAVLDILKIEFPNKAKYIDKFRGFWTYFSGFNENRKNYYSEEDKATSIANRIVNENLSRYIDNIIAFEEILQKIPNLKKFKQDLDITSYNYYLNQAGIDKYNKIIGGYIVDKDKKIQGINEKVNLYTQQTKKKLPKLKFLFKQIGSERKGFGIFEIKEGKEWEQLGDLFKLQRTKINSNGREKGLFDSLRTMYREFFDEIKRDSNSQARYSLDKIYFNKASVNTISNSWFTNWNKFAELLNIKEDKKNGEKKIPEQISIEDIKDSLSIIPKENLEELFKLTNREKHDRTRFFGSNAWVTFLNIWQNEIEESFNKLEEKEKDFKKNAAIKFQKNNLVQKNYIKEVCDRMLAIERMAKYHLPKDSNLSREEDFYWIIDNLSEQREIYKYYNAFRNYISKKPYNKSKMKLNFENGNLLGGWSDGQERNKAGVILRNGNKYYLGVLINRGIFRTDKINNEIYRTGSSKWERLILSNLKFQTLAGKGFLGKHGVSYGNMNPEKSVPSLQKFIRENYLKKYPQLTEVSNTKFLSKKDFDAAIKEALKECFTMNFINIAENKLLEAEDKGDLYLFEITNKDFSGKKSGKDNIHTIYWKYLFSESNCKSPIIGLNGGAEIFFREGQKDKLHTKLDKKGKKVFDAKRYSEDKLFFHVSITINYGKPKNIKFRDIINQLITSMNVNIIGIDRGEKHLLYYSVIDSNGIILKQGSLNKIRVGDKEVDFNKKLTERANEMKKARQSWEQIGNIKNFKEGYLSQAIHEIYQLMIKYNAIIVLEDLNTEFKAKRLSKVEKSVYKKFELKLARKLNHLILKDRNTNEIGGVLKAYQLTPTIGGGDVSKFEKAKQWGMMFYVRANYTSTTDPVTGWRKHLYISNFSNNSVIKSFFDPTNRDTGIEIFYSGKYRSWGFRYVQKETGKKWELFATKELERFKYNQTTKLCEKINLYDKFEELFKGIDKSADIYSQLCNVLDFRWKSLVYLWNLLNQIRNVDKNAEGNKNDFIQSPVYPFFDSRKTDGKTEPINGDANGALNIARKGLMLVERIKNNPEKYEQLIRDTEWDAWIQNFNKVN
>SEQ ID NO:11 PxCpf1
MIIGRDFNMYYQNLTKMYPISKTLRNELIPVGKTLENIRKNGILEADIQRKADYEHVKKLMDNYHKQLINEALQGVHLSDLSDAYDLYFNLSKEKNSVDAFSKCQDKLRKEIVSFLKNHENFPKIGNKEIIKLIQSLNDNDADNNALDSFSNFYTYFSSYNEVRKNLYSDEEKSSTVAYRLINENLPKSLDNIKAYAIAKKAGVRAEGLSEEEQDCLFIIETFERTLTQDGIDNYNADIGKLNTAINLYNQQNKKQEGFRKVPQMKCLYKQILSDREEAFIDEFSDDEDLITNIESFAENMNVFLNSEIITDFKNALVESDGSLVYIKNDVSKTLFSNIVFGSWNAIDEKLSDEYDLANSKKKKDEKYYEKRQKELKKNKSYDLETIIGLFDDSIDVIGKYIEKLESDITAIAEAKNDFDEIVLRKHDKNKSLRKNTNAVEAIKSYLDTVKDFERDIKLINGSGQEVEKNLVVYAEQENILAEIKNVDSLYNMSRNYLTQKPFSTEKFKLNFENPTLLNGWDRNKEKDYLGILFEKEGMYYLGIINNNHRKIFENEKLCTGKESCFNKIVYKQISNAAKYLSSKQINPQNPPKEIAEILLKRKADSSSLSRKETELFIDYLKDDFLVNYPMIINSDGENFFNFHFKQAKDYGSLQEFFKEVEHQAYSLKTRPIDDSYIYRMIDEGKLYLFQIHNKDFSPYSKGNLNLHTIYLQMLFDQRNLNNVVYKLNGEAEVFYRPASINDEEVIIHKAGEEIKNKNSKRAVDKPTSKFGYDIIKDRRYSKDKFMLHIPVTMNFGVDETRRFNDVVNDALRNDEKVRVIGIDRGERNLLYVVVVDTDGTILEQISLNSIINNEYSIETDYHKLLDEKEGDRDRARKNWTTIENIKELKEGYLSQVVNVIAKLVLKYNAIICLEDLNFGFKRGRQKVEKQVYQKFEKMLIDKLNYLVIDKSRKQEKPEEFGGALNALQLTSKFTSFKDMGKQTGIIYYVPAYLTSKIDPTTGFANLFYVKYENVEKAKEFFSRFDSISYNNESGYFEFAFDYKKFTDRACGARSQWTVCTYGERIIKYRNADKNNSFDDKTIVLSEEFKELFSIYGISYEDGAELKNKIMSVDEADFFRCLTGLLQKTLQMRNSSNDGTRDYIISPIMNDRGEFFNSEACDASKPKDADANGAFNIARKGLWVLEQIRNTPSGDKLNLAMSNAEWLEYAQRNQI
>SEQ ID NO:12 PrCpf1
MIIGRDFNMYYQNLTKMYPISKTLRNELIPVGKTLENIRKNGILEADIQRKADYEHVKKLMDNYHKQLINEALQGVHLSDLSDAYDLYFNLSKEKNSVDAFSKCQDKLRKEIVSLLKNHENFPKIGNKEIIKLLQSLYDNDTDYKALDSFSNFYTYFSSYNEVRKNLYSDEEKSSTVAYRLINENLPKFLDNIKAYAIAKKAGVRAEGLSEEDQDCLFIIETFERTLTQDGIDNYNAAIGKLNTAINLFNQQNKKQEGFRKVPQMKCLYKQILSDREEAFIDEFSDDEDLITNIESFAENMNVFLNSEIITDFKIALVESDGSLVYIKNDVSKTSFSNIVFGSWNAIDEKLSDEYDLANSKKKKDEKYYEKRQKELKKNKSYDLETIIGLFDDNSDVIGKYIEKLESDITAIAEAKNDFDEIVLRKHDKNKSLRKNTNAVEAIKSYLDTVKDFERDIKLINGSGQEVEKNLVVYAEQENILAEIKNVDSLYNMSRNYLTQKPFSTEKFKLNFNRATLLNGWDKNKETDNLGILFEKDGMYYLGIMNTKANKIFVNIPKATSNDVYHKVNYKLLPGPNKMLPKVFFAQSNLDYYKPSEELLAKYKAGTHKKGDNFSLEDCHALIDFFKASIEKHPDWSSFGFEFSETCTYEDLSGFYREVEKQGYKITYTDVDADYITSLVERDELYLFQIYNKDFSPYSKGNLNLHTIYLQMLFDQRNLNNVVYKLNGEAEVFYRPASINDEEVIIHKAGEEIKNKNSKRAVDKPTSKFGYDIIKDRRYSKDKFMLHIPVTMNFGVDETRRFNDVVNDALRNDEKVRVIGIDRGERNLLYVVVVDTDGTILEQISLNSIINNEYSIETDYHKLLDEKEGDRDRARKNWTTIENIKELKEGYLSQVVNVIAKLVLKYNAIICLEDLNFGFKRGRQKVEKQVYQKFEKMLIDKLNYLVIDKSRKQDKPEEFGGALNALQLTSKFTSFKDMGKQTGIIYYVPAYLTSKIDPTTGFANLFYVKYENVEKAKEFFSRFDSISYNNESGYFEFAFDYKKFTDRACGARSQWTVCTYGERIIKFRNTEKNNSFDDKTIVLSEEFKELFSIYGISYEDGAELKNKIMSVDEADFFRSLTRLFQQTMQMRNSSNDVTRDYIISPIMNDRGEFFNSEACDASKPKDADANGAFNIARKGLWVLEQIRNTPSGDKLNLAMSNAEWLEYAQRNQI
>SEQ ID NO:13 ArCpf1核苷酸序列
AACAACGGCACAAACAATTTCCAGAACTTCATTGGCATTTCATCACTCCAGAAGACCCTGAGGAATGCCCTGACACCTACCGAGACAACCCAGCAATTCATTGTGAAGAACGGAATCATTAAGGAGGACGAACTGAGAGGCGAGAACCGCCAGATCCTGAAAGACATCATGGACGACTACTATCGCGGTTTCATCTCCGAGACACTCAGCAGCATTGATGATATTGATTGGACATCTCTGTTTGAGAAGATGGAAATCCAGCTGAAGAATGGCGACAACAAGGACACTCTCATTAAGGAGCAAGCAGAGAAGAGAAAGGCTATCTACAAGAAGTTTGCAGACGACGATCGCTTCAAGAATATGTTTAGCGCAAAGCTGATAAGTGATATTCTCCCAGAGTTTGTGATTCACAACAACAACTATTCTGCCAGCGAGAAGGAAGAGAAGACACAGGTCATCAAGCTCTTCTCTCGGTTTGCCACCTCATTCAAAGATTACTTCAAGAATCGGGCAAATTGTTTCTCTGCCGATGACATCTCATCTTCCTCCTGTCACAGAATCGTTAATGATAACGCCGAAATCTTCTTCTCAAATGCCCTGGTGTACCGGAGAATCGTGAAGAATCTGAGTAACGACGACATCAATAAGATTAGCGGTGATATGAAGGACTCTCTGAAGAAGATGTCCCTGGAGAAGATATACAGCTATGAGAAGTACGGCGAGTTCATCACTCAAGAGGGAATTAGCTTCTACAACGACATTTGTGGCAAGGTTAACTCATTCATGAATCTCTACTGCCAGAAGAACAAGGAGAATAAGAATCTGTACAAACTGCGCAAACTGCATAAGCAAATCCTGTGTATCGCCGACACCTCTTATGAGGTGCCCTACAAGTTTGAGTCTGACGAAGAGGTGTACCAGTCCGTGAACGGATTCCTGGACAACATCAGTTCTAAACACATCGTCGAGCGGCTGAGGAAGATCGGCGACAACTACAATGGCTATAACCTGGACAAGATATACATCGTCAGTAAGTTCTATGAAAGTGTGAGTCAGAAGACCTACCGGGACTGGGAGACTATCAACACCGCACTCGAAATCCACTACAATAACATCCTGCCTGGCAACGGTAAGAGCAAGGCCGACAAGGTGAAGAAGGCCGTCAAGAACGACCTCCAGAAGAGCATCACCGAGATTAACGAACTGGTGAGTAACTACAAGCTCTGCCCAGACGATAACATTAAGGCAGAAACCTACATTCATGAGATTTCCCATATACTGAACAACTTTGAAGCTCAAGAGCTGAAATACAATCCCGAGATTCATCTGGTGGAGAGCGAGCTGAAAGCATCCGAGCTGAAGAACGTTCTCGACGTCATCATGAACGCCTTCCACTGGTGTAGCGTGTTCATGACTGAGGAGCTGGTCGATAAAGACAACAATTTCTACGCCGAGCTGGAAGAAATCTACGACGAGATATACCCTGTGATTAGCCTCTACAATCTGGTCCGGAACTATGTGACCCAGAAGCCCTATTCAACTAAGAAGATCAAGCTGAACTTTGGGATTCCTACCCTGGCCGACGGCTGGTCCAAGAGCAAAGAGTATTCCAATAACGCAATCATCCTGATGAGAGACAACCTGTATTACCTCGGTATCTTTAACGCTAAGAATAAGCCCGAGAAGAAGATCATCGAAGGAAATACATCCGAGAACAAGGGCGACTACAAGAAGATGATCTATAACCTGCTGCCTGGCCCAAACAAGATGATCCCTAAGGTGTTCCTGAGCAGCAAGACCGGAGTCGAGACTTACAAGCCAAGTGCCTACATACTGGAGGGCTATAAGCAGAACAAGCACCTGAAATCTAGCAAAGATTTCGACATCACTTTCTGTCGCGACCTGATCGACTATTTCAAGAATTGTATTGCCATCCACCCAGAGTGGAAGAATTTCGGATTCGACTTCTCTGACACCTCCACATACGAGGACATCAGTGGCTTCTATAGAGAAGTGGAGCTGCAAGGTTACAAGATCGACTGGACCTACATATCTGAGAAAGACATCGACCTGCTGCAAGAGAAAGGGCAGCTCTACCTCTTCCAAATCTACAACAAGGACTTTAGTAAGAAGTCTACAGGTAATGACAATCTGCACACTATGTACCTGAAGAATCTCTTCTCTGAAGAGAACCTCAAAGACGTGGTGCTGAAACTGAACGGCGAAGCAGAAATCTTCTTTCGCAAATCATCCATTAAGAATCCTATCATACACAAGAAGGGTAGTATCCTGGTGAACAGGACATACGAAGCCGAGGAGAAAGATCAGTTCGGCAACATTCAGATTGTGCGCAAGACTATTCCCGAGAATATCTACCAGGAGCTGTACAAATACTTCAACGATAAGTCTGATAAAGAGCTGTCAGACGAAGCAGCCAAGCTGAAGAATGCTGTGGGACATCATGAAGCAGCTACTAACATCGTGAAAGACTATAGATACACATACGACAAGTATTTCCTGCACATGCCAATTACCATCAACTTCAAAGCCAATAAGACTTCTTTCATTAACGACCGCATCCTCCAGTACATTGCAAAGGAGAAAGACCTGCACGTGATCGGGATTGATCGCGGAGAACGGAACCTCATCTACGTTTCAGTCATCGACACATGCGGTAACATCGTCGAACAGAAGAGCTTCAACATTGTTAATGGGTATGATTACCAGATCAAACTCAAGCAACAGGAAGGCGCACGGCAGATTGCTCGCAAGGAGTGGAAGGAAATTGGCAAGATCAAGGAAATCAAGGAAGGATACCTCAGCCTCGTCATTCATGAAATCAGCAAGATGGTGATCAAGTATAACGCAATCATCGCTATGGAGGACCTGAGTTATGGCTTCAAGAAAGGCAGATTCAAGGTGGAGCGGCAAGTCTACCAGAAATTCGAAACAATGCTGATCAACAAGCTGAACTACCTGGTGTTCAAAGACATCAGCATAACCGAGAACGGAGGACTCCTGAAAGGCTACCAGCTCACATACATCCCAGAGAAACTCAAGAATGTGGGCCACCAATGCGGCTGCATCTTCTACGTCCCTGCCGCTTACACCAGCAAGATAGATCCCACTACAGGATTCGTGAACATATTCAAATTCAAGGATCTGACAGTGGACGCCAAGAGGGAGTTCATCAAGAAGTTTGATAGTATTCGCTATGACAGCGATAAGAATCTGTTCTGTTTCACCTTTGACTACAACAACTTCATTACCCAGAATACCGTTATGTCCAAGTCTAGCTGGAGTGTGTATACCTACGGTGTTCGGATCAAGCGGAGGTTTGTCAATGGTAGATTCTCAAACGAAAGCGACACCATCGACATCACAAAGGACATGGAGAAGACACTGGAAATGACTGACATAAACTGGAGAGATGGACACGACCTGCGGCAAGACATCATTGACTACGAGATCGTTCAGCACATCTTTGAAATCTTCAAGCTGACTGTTCAGATGCGGAATAGTCTGAGCGAGCTGGAGGACCGGAATTACGACCGCCTGATCTCACCAGTCCTGAACGAGAATAACATCTTCTACGATTCTGCCAAAGCAGGAGATGCCCTGCCAAAGGACGCTGATGCAAACGGTGCCTACTGCATCGCCCTCAAAGGTCTGTACGAAATCAAGCAGATTACCGAGAATTGGAAGGAGGACGGGAAGTTCAGCAGAGACAAGCTCAAGATCAGCAATAAGGACTGGTTCGATTTCATTCAGAACAAGCGCTACCTG
>SEQ ID NO:14 LpCpf1核苷酸序列
ATGATCATGAACAACGTGACCGGCGACTTCAGCGAGTTCGTGGCCATCAGCAAGGTGCAGAAGACCCTGCGCAACGAGCTGCGCCCCACCCCCCTGACCATGAAGCACATCAAGCAGAAGGGCATCATCACCGAGGACGAGTACAAGACCCAGCAGAGCCTGGAGCTGAAGCGCATCGCCGACGGCTACTACCGCGACTACATCACCCACAAGCTGAACGACACCAACAACCTGGACTTCCGCAACCTGTTCGAGGCCATCGAGGAGAAGTACAAGAAGAACGACAAGGACAACCGCGACAAGCTGGACCTGGTGGAGAAGAGCAAGCGCGGCGAGATCGCCAAGCTGCTGAGCGCCGACGACAACTTCAAGAGCATGTTCGAGGCCAAGCTGATCACCCAGCTGCTGCCCGTGTACGTGGAGCAGAACTACATCGGCGAGGACAAGGAGAAGGCCCTGGAGACCATCGCCCTGTTCAAGGGCTTCACCACCTACTTCACCGACTACTTCAACATCCGCAAGAACATGTTCAAGGAGAACGGCGGCGCCAGCAGCATCTGCTACCGCATCGTGAACGTGAACGCCAGCATCTTCTACGACAACCTGAAGACCTTCATGTGCATCAAGGAGAAGGCCGAGACCGAGATCGCCCTGATCGAGGAGGAGCTGACCGAGCTGCTGGACAGCTGGCGCCTGGAGCACATCTTCAGCGAGGACTACTACAACGAGCTGCTGGCCCAGAAGGGCATCGACTACTACAACCAGATCTGCGGCGACGTGAACAAGCACATGAACCTGTACTGCCAGCAGAACAAGCTGAAGGCCAACGTGTTCAAGATGACCAAGCTGCAGAAGCAGATCATGGGCATCAGCGAGAAGGCCTTCGAGATCCCCCCCATGTACCAGAACGACGAGGAGGTGTACGCCGCCTTCAACGGCTTCATCAGCCGCCTGGAGGAGGTGAAGCTGATCGACCGCCTGGGCAACGTGCTGCAGAACAGCAACATCTACGACACCGCCAAGATCTACATCAACGCCCGCTGCTACACCAACGTGAGCAGCTACGTGTACGGCGGCTGGGGCGTGATCGAGAGCGCCATCGAGCGCTACTGGTACAACACCATCGCCGGCAAGGGCCAGAGCAAGGCCAAGAAGATCGAGAAGGCCAAGAAGGACAACAAGTTCATGAGCGTGAAGGAGCTGGACAGCATCGTGAGCGACTACGAGCCCGACTACTTCAACGCCAGCAACATGGACGACGACAACAGCGGCCGCGCCTTCAGCGGCCACGGCGTGCTGGGCTACTTCAACAAGATGAGCAAGCTGCTGGCCAACATGAGCCTGCACACCATCACCTACGACAGCGGCGACAGCCTGATCGAGAACAAGGAGACCGCCCTGAACATCAAGAAGGACCTGGACGACATCATGAGCATCTACCACTGGCTGCAGACCTTCATCATCGACGAGGTGGTGGAGAAGGACAACGCCTTCTACGCCGAGCTGGAGGACATCTACTACGAGCTGGAGAACGTGGTGACCCTGTACGACCGCATCCGCAACTACGTGACCCGCAAGCCCTACAGCACCCAGAAGTTCaagcttAACTTCGCCAGCCCCACCCTGGCCAGCGGCTGGAGCCGCAGCAAGGAGTTCGACAACAACGCCATCATCCTGCTGCGCAACAACAAGTACTACATCGCCATCTTCAACGTGAACAACAAGCCCGACAAGCAGATCATCAAGGGCAGCGAGGAGCAGCGCCTGAGCACCGACTACAAGAAGATGGTGTACAACCTGCTGCCCGGCCCCAACAAGATGCTGCCCTGGGTGTTCATCAAGAGCAACACCGGCAAGCGCGACTACAACCCCAGCAGCTACATCCTGGAGGGCTACGAGAAGAACCGCCACATCAAGAGCAGCGGCAACTTCGACATCAACTACTGCCACGACCTGATCGACTACTACAAGGCCTGCATCAACAAGCACCCCGAGTGGAAGAACTACGGCTTCAAGTTCAAGGAGACCACCCAGTACAACGACATCGGCCAGTTCTACAAGGACGTGGAGAAGCAGGGCTACAGCATCAGCTGGGCCTACATCAGCGAGGCCGACATCAACCGCCTGGACGAGGAGGGCAAGATCTACCTGTTCGAGATCTACAACAAGGACCTGAGCAGCCACAGCACCGGCAAGGACAACCTGCACACCATGTACCTGAAGAACATCTTCAGCGAGGACAACCTGAAGAACATCTGCATCGAGCTGAACGGCAACGCCGAGCTGTTCTACCGCAAGAGCAGCATGAAGCGCAACATCACCCACAAGAAGGACACCGTGCTGGTGAACAAGACCTACATCAACGAGGCCGGCGTGCGCGTGAGCCTGACCGACGAGGACTACATCAAGGTGTACAACTACTACAACAACGACTACGTGATCGACGTGGAGAAGGACAAGAAGCTGGTGGAGATCCTGGAGCGCATCGGCCACCGCAAGAACCCCATCGACATCATCAAGGACAAGCGCTACACCGAGGACAAGTACTTCCTGCACTTCCCCATCACCATCAACTACGGCGTGGACGACGAGAACATCAACGCCAAGATGATCGAGTACATCGCCAAGCACAACAACATGAACGTGATCGGCATCGACCGCGGCGAGCGCAACCTGATCTACATCAGCGTGATCAACAACAAGGGCAACATCATCGAGCAGAAGAGCTTCAACCTGGTGAACAACTACGACTACAAGAACAAGCTGAAGAACATGGAGAAGACCCGCGACAACGCCCGCAAGAACTGGCAGGAGATCGGCAAGATCAAGGACGTGAAGAACGGCTACCTGAGCGGCGTGATCAGCAAGATCGCCCGCATGGTGGTGGACTACAACGCCATCATCGTGATGGAGGACCTGAACCGCGGCTTCAAGCGCGGCCGCTTCAAGGTGGAGCGCCAGGTGTACCAGAAGTTCGAGAACATGCTGATCAGCAAGCTGAACTACCTGGTGTTCAAGGAGAAGAAGGCCGACGAGAACGGCGGCATCCTGAAGGGCTACCAGCTGACCTACCTGCCCAAGAGCGCCCTGCAGATCGGCAAGCAGTGCGGCTGCATCTTCTACGTGCCCGCCGCCTACACCAGCAAGATCGACCCCGCCACCGGCTTCATCAACATCTTCGACTTCAAGAAGTACAGCGGCAGCGCCATCAACGCCAAGGTGAAGGACAAGAAGGAGTTCCTGATGAGCATGAACAGCATCCGCTACGTGAACGAGGGCAGCGCCGAGTACGAGAAGATCGGCCACCGCCAGCTGTTCGCCTTCAGCTTCGACTACAACAACTTCAAGACCTACAACGTGAGCATCCCCGTGAACGAGTGGACCACCTACACCTACGGCGAGCGCATCAAGAAGCTGTACAAGGACGGCCGCTGGAGCGGCAGCGAGGTGCTGAACCTGACCGAGGACCTGATCGAGCTGATGGAGCAGTACGGCATCGAGTACAAGGACGGCCACGACATCCGCGAGGACATCAGCCACATGGACGAGATGCGCAACGCCGACTTCATCTGCAACCTGTTCGAGAAGTTCAAGTACACCGTGCAGCTGCGCAACAGCAAGAGCGAGGCCGAGGGCGACGACTACGACCGCCTGGTGAGCCCCGTGCTGAACAGCCACAACGGCTTCTTCGACAGCAGCGACTACAAGGAGAACGAGAAGAGCGACGACATCATCGACGACAAGCAGATCATGCCCAAGGACGCCGACGCCAACGGCGCCTACTGCATCGCCCTGAAGGGCCTGTACGAGATCAACAAGATCAAGGAGAACTGGAGCGACGACAAGAAGCTGAAGGAGAGCGAGCTGTACATCGGCGTGACCGAGTGGCTGGACTACATCCAGAACCGCCGCTTCGAG
>SEQ ID NO:15 SaCpf1核苷酸序列
ATGAACGACATTGAAGGACTGAAGGAGGAATTTCTGAAGATTTCCCTGGAGAACTTTGAGGGTATCTACATCAGCAATAAGAAGCTGAATGAGATTTCTAACCGGAAATTCGGCGACTACAACAGCATTAACATGATGATCAAGCAGAGCATGAATGAGAAGGGTATTCTGTCCAAGAAGGAGATCAACGAACTCATACCCGATCTGGAGAACATCAACAAGCCTAAGGTCAAGTCTTTCAATCTGAGTTTCATCTTCGAGAACCTCACCAAAGAGCATAAGGAACTGATCATCGACTACATCAGGGAGAACATCTGCAACGTGATTGAGAATGTCAAGATTACAATAGAGAAATACAGGAACATTGATAACAAGATCGAGTTTAAGAACAATGCTGAGAAGGTGTCCAAGATTAAGGAAATGCTGGAGAGCATCAACGAGCTGTGTAAACTGATTAAGGAGTTCAACACAGACGAGATTGAGAAGAACAATGAGTTCTATAACATTCTCAATAAGAATTTCGAAATCTTCGAATCCAGTTACAAGGTTCTGAATAAGGTCCGGAACTTTGTGACCAAGAAAGAAGTTATTGAGAATAAGATGAAGCTGAATTTCTCAAACTATCAGCTCGGGAACGGCTGGCACAAGAACAAAGAGAAGGACTGTAGCATTATCCTGTTTCGCAAGAGAAACAATGAGCGCTGGATATACTACCTCGGGATTCTGAAGCATGGTACAAAGATCAAAGAGAACGACTATCTCTCATCAGTGGACACAGGGTTCTACAAGATGGACTATTACGCACAGAATAGTCTGTCAAAGATGATTCCAAAGTGCAGCATTACAGTTAAGAACGTCAAGAACGCTCCAGAGGACGAGTCTGTCATTCTGAACGATTCCAAGAAGTTCAATGAACCTCTGGAGATCACACCCGAGATAAGAAAGCTGTACGGCAATAACGAGCACATCAAGGGCGACAAATTCAAGAAAGAGTCCCTGGTCAAGTGGATAGACTTCTGTAAGGAGTTTCTGCTGAAATACAAGAGTTTCGAGAAGGCCAAGAAGGAAATCCTCAAACTCAAAGAATCAAACCTCTATGAGAACCTGGAGGAGTTCTACAGCGATGCCGAGGAGAAAGCCTACTTCCTGGAGTTCATCAACATTGATGAGGACAAGATCAAGAAGCTGGTCAAGGAGAAGAATCTGTACCTCTTCCAGATATACAACAAAGACTTTAGCGCTTATTCAACCGGCAATAAGAACCTCCATACCATGTATTTCGAGGAGCTGTTCACTGATGAGAATCTCAAGAAGCCTGTGTTCAAGCTCAACGGAAACACAGAAGTGTTCTACAGAATTGCCAGTAGTAAGCCAAAGATCGTGCACAACAAGGGAGAGAAACTGGTCAACAAGACATACCTCGACGATGGCATCATTAAGACAATTCCAGATTCTGTGTACGAAGAGATTTCAGAGAAAGTCAAGAACAACGAGGACTACTCCAAGCTGCTCGAAGAGAATAACATCAAGAATCTGGAAATCAAGGTGGCAACTCATGAAATTGTGAAAGATAAACGCTACTTTGAGAACAAATTCCTGTTCTATCTGCCCATCACACTGAACAAGAAGGTGTCTAACAAGAATACCAATAAGAACATCAACAAGAATGTCATCGACGAGATTAAGGATTGTAACGAGTATAACGTCATTGGGATTGATAGAGGTGAGCGGAACCTGATTAGCCTGTGCATCATCAATCAGAACGGCGAAATCATACTCCAGAAGGAGATGAACATCATTCAATCTAGCGACAAGTATAACGTGGACTATAACGAGAAACTGGAGATCAAGTCTAAGGAGAGGGACAACGCTAAGAAGAACTGGAGCGAAATCGGGAAGATAAAGGACCTGAAGAGCGGATACCTCTCCGCTGTGGTGCACGAGATTGTCAAGCTGGCAATCGAATACAACGCTGTGATCATTCTGGAGGACCTCAACAACGGGTTTAAGAACTCAAGGAAGAAGGTTGATAAACAGATATACCAGAAATTCGAGAGGGCTCTGATTGAGAAGCTGCAATTTCTCATCTTCAAGAACTATGACAAGAATGAGAAAGGAGGACTCAGGAATGCTTTCCAGCTGACTCCCGAACTGAAGAACATCACCAAGGTGGCATCCCAGCAGGGCATCATAATCTACACAAATCCAGCCTATACCAGCAAGATCGACCCTACCACAGGTTATGCAAACATAATCAAGAAGAGCAACAATAACGAGGAGTCTATCGTGAAGGCAATCGACAAGATTTCCTATGACAAAGAGAAAGACATGTTCTACTTCGACATTAACCTGTCAAATAGCTCCTTTAACCTGACCGTTAAGAATGTGCTCAAGAAGGAATGGCGCATCTACACCAATGGCGAGAGAATCATCTATAAGGATCGCAAGTACATTACACTGAATATCACACAAGAGATGAAAGACATCCTGTCAAAGTGCGGCATTGATTATCTGAACATCGACAACCTGAAACAGGACATTCTCAAGAACAAACTGCATAAGAAGGTCTACTATATCTTCGAGCTGGCTAACAAGATGCGCAACGAGAATAAGGACGTGGATTACATAATCAGCCCAGTGCTGAATAAGGATGGAAAGTTCTTCATGACCCAAGAGATCAACGAGCTGACACCAAAGGATGCCGACCTGAACGGAGCCTATAACATAGCTCTGAAGGGCAAGCTGATGATTGACAACCTGAATAAGAAGGAGAAGTTTGTCTTTCTCAGCAATGAAGATTGGCTCAATTTCATCCAGGGTCGG
>SEQ ID NO:16 HkCpf1核苷酸序列
ATGTTCGAGAAGCTGAGCAACATCGTGAGCATCAGCAAGACCATCCGCTTCAAGCTGATCCCCGTGGGCAAGACCCTGGAGAACATCGAGAAGCTGGGCAAGCTGGAGAAGGACTTCGAGCGCAGCGACTTCTACCCCATCCTGAAGAACATCAGCGACGACTACTACCGCCAGTACATCAAGGAGAAGCTGAGCGACCTGAACCTGGACTGGCAGAAGCTGTACGACGCCCACGAGCTGCTGGACAGCAGCAAGAAGGAGAGCCAGAAGAACCTGGAGATGATCCAGGCCCAGTACCGCAAGGTGCTGTTCAACATCCTGAGCGGCGAGCTGGACAAGAGCGGCGAGAAGAACAGCAAGGACCTGATCAAGAACAACAAGGCCCTGTACGGCAAGCTGTTCAAGAAGCAGTTCATCCTGGAGGTGCTGCCCGACTTCGTGAACAACAACGACAGCTACAGCGAGGAGGACCTGGAGGGCCTGAACCTGTACAGCAAGTTCACCACCCGCCTGAAGAACTTCTGGGAGACCCGCAAGAACGTGTTCACCGACAAGGACATCGTGACCGCCATCCCCTTCCGCGCCGTGAACGAGAACTTCGGCTTCTACTACGACAACATCAAGATCTTCAACAAGAACATCGAGTACCTGGAGAACAAGATCCCCAACCTGGAGAACGAGCTGAAGGAGGCCGACATCCTGGACGACAACCGCAGCGTGAAGGACTACTTCACCCCCAACGGCTTCAACTACGTGATCACCCAGGACGGCATCGACGTGTACCAGGCCATCCGCGGCGGCTTCACCAAGGAGAACGGCGAGAAGGTGCAGGGCATCAACGAGATCCTGAACCTGACCCAGCAGCAGCTGCGCCGCAAGCCCGAGACCAAGAACGTGAAGCTGGGCGTGCTGACCAAGCTGCGCAAGCAGATCCTGGAGTACAGCGAGAGCACCAGCTTCCTGATCGACCAGATCGAGGACGACAACGACCTGGTGGACCGCATCAACAAGTTCAACGTGAGCTTCTTCGAGAGCACCGAGGTGAGCCCCAGCCTGTTCGAGCAGATCGAGCGCCTGTACAACGCCCTGAAGAGCATCAAGAAGGAGGAGGTGTACATCGACGCCCGCAACACCCAGAAGTTCAGCCAGATGCTGTTCGGCCAGTGGGACGTGATCCGCCGCGGCTACACCGTGAAGATCACCGAGGGCAGCAAGGAGGAGAAGAAGAAGTACAAGGAGTACCTGGAGCTGGACGAGACCAGCAAGGCCAAGCGCTACCTGAACATCCGCGAGATCGAGGAGCTGGTGAACCTGGTGGAGGGCTTCGAGGAGGTGGACGTGTTCAGCGTGCTGCTGGAGAAGTTCAAGATGAACAACATCGAGCGCAGCGAGTTCGAGGCCCCCATCTACGGCAGCCCCATCAAGCTGGAGGCCATCAAGGAGTACCTGGAGAAGCACCTGGAGGAGTACCACAAGTGGAAGCTGCTGCTGATCGGCAACGACGACCTGGACACCGACGAGACCTTCTACCCCCTGCTGAACGAGGTGATCAGCGACTACTACATCATCCCCCTGTACAACCTGACCCGCAACTACCTGACCCGCAAGCACAGCGACAAGGACAAGATCAAGGTGAACTTCGACTTCCCCACCCTGGCCGACGGCTGGAGCGAGAGCAAGATCAGCGACAACCGCAGCATCATCCTGCGCAAGGGCGGCTACTACTACCTGGGCATCCTGATCGACAACAAGCTGCTGATCAACAAGAAGAACAAGAGCAAGAAGATCTACGAGATCCTGATCTACAACCAGATCCCCGAGTTCAGCAAGAGCATCCCCAACTACCCCTTCACCAAGAAGGTGAAGGAGCACTTCAAGAACAACGTGAGCGACTTCCAGCTGATCGACGGCTACGTGAGCCCCCTGATCATCACCAAGGAGATCTACGACATCAAGAAGGAGAAGAAGTACAAGAAGGACTTCTACAAGGACAACAACACCAACAAGAACTACCTGTACACCATCTACAAGTGGATCGAGTTCTGCAAGCAGTTCCTGTACAAGTACAAgggcccCAACAAGGAGAGCTACAAGGAGATGTACGACTTCAGCACCCTGAAGGACACCAGCCTGTACGTGAACCTGAACGACTTCTACGCCGACGTGAACAGCTGCGCCTACCGCGTGCTGTTCAACAAGATCGACGAGAACACCATCGACAACGCCGTGGAGGACGGCAAGCTGCTGCTGTTCCAGATCTACAACAAGGACTTCAGCCCCGAGAGCAAGGGCAAGAAGAACCTGCACACCCTGTACTGGCTGAGCATGTTCAGCGAGGAGAACCTGCGCACCCGCAAGCTGAAGCTGAACGGCCAGGCCGAGATCTTCTACCGCAAGAAGCTGGAGAAGAAGCCCATCATCCACAAGGAGGGCAGCATCCTGCTGAACAAGATCGACAAGGAGGGCAACACCATCCCCGAGAACATCTACCACGAGTGCTACCGCTACCTGAACAAGAAGATCGGCCGCGAGGACCTGAGCGACGAGGCCATCGCCCTGTTCAACAAGGACGTGCTGAAGTACAAGGAGGCCCGCTTCGACATCATCAAGGACCGCCGCTACAGCGAGAGCCAGTTCTTCTTCCACGTGCCCATCACCTTCAACTGGGACATCAAGACCAACAAGAACGTGAACCAGATCGTGCAGGGCATGATCAAGGACGGCGAGATCAAGCACATCATCGGCATCGACCGCGGCGAGCGCCACCTGCTGTACTACAGCGTGATCGACCTGGAGGGCAACATCGTGGAGCAGGGCAGCCTGAACACCCTGGAGCAGAACCGCTTCGACAACAGCACCGTGAAGGTGGACTACCAGAACAAGCTGCGCACCCGCGAGGAGGACCGCGACCGCGCCCGCAAGAACTGGACCAACATCAACAAGATCAAGGAGCTGAAGGACGGCTACCTGAGCCACGTGGTGCACAAGCTGAGCCGCCTGATCATCAAGTACGAGGCCATCGTGATCATGGAGAACCTGAACCAGGGCTTCAAGCGCGGCCGCTTCAAGGTGGAGCGCCAGGTGTACCAGAAGTTCGAGCTGGCCCTGATGAACAAGCTGAGCGCCCTGAGCTTCAAGGAGAAGTACGACGAGCGCAAGAACCTGGAGCCCAGCGGCATCCTGAACCCCATCCAGGCCTGCTACCCCGTGGACGCCTACCAGGAGCTGCAGGGCCAGAACGGCATCGTGTTCTACCTGCCCGCCGCCTACACCAGCGTGATCGACCCCGTGACCGGCTTCACCAACCTGTTCCGCCTGAAGAGCATCAACAGCAGCAAGTACGAGGAGTTCATCAAGAAGTTCAAGAACATCTACTTCGACAACGAGGAGGAGGACTTCAAGTTCATCTTCAACTACAAGGACTTCGCCAAGGCCAACCTGGTGATCCTGAACAACATCAAGAGCAAGGACTGGAAGATCAGCACCCGCGGCGAGCGCATCAGCTACAACAGCAAGAAGAAGGAGTACTTCTACGTGCAGCCCACCGAGTTCCTGATCAACAAGCTGAAGGAGCTGAACATCGACTACGAGAACATCGACATCATCCCCCTGATCGACAACCTGGAGGAGAAGGCCAAGCGCAAGATCCTGAAGGCCCTGTTCGACACCTTCAAGTACAGCGTGCAGCTGCGCAACTACGACTTCGAGAACGACTACATCATCAGCCCCACCGCCGACGACAACGGCAACTACTACAACAGCAACGAGATCGACATCGACAAGACCAACCTGCCCAACAACGGCGACGCCAACGGCGCCTTCAACATCGCCCGCAAGGGCCTGCTGCTGAAGGACCGCATCGTGAACAGCAACGAGAGCAAGGTGGACCTGAAGATCAAGAACGAGGACTGGATCAACTTCATCATCAGC
>SEQ ID NO:17 AbCpf1核苷酸序列
ATGTTCAGCCTGGACTACTTCAGCCTGACCCTGAGCCAGCGCTACATCGACATCTACAACACCATGATCGGCGGCAACACCCTGGCCGACGGCACCAAGGTGCAGGGCATCAACGAGAACATCAACATCTACCGCCAGAAGAACAACATCGACCGCAAGAACCTGCCCACCCTGAAGCCCCTGCACAAGCAGCTGCTGAGCGACCGCGAGACCCTGAGCTGGATCCCCGAGGCCTTCAAGACCAAGGAGGAGGTGGTGGGCGCCATCGAGGACTTCTACAAGAACAACATCATCAGCTTCAAGTGCTGCGACAACATCGTGGACATCACCAAGCAGTTCATCGACATCTTCAGCCTGAACGAGGACTACGAGCTGAACAAGATCTTCATCAAGAACGACATCAGCATCACCAGCATCAGCCAGGACATCTTCAAGGACTACCGCATCATCAAGGAGGCCCTGTGGCAGAAGCACATCAACGAGAACCCCAAGGCCGCCAAGAGCAAGGACCTGACCGGCGACAAGGAGAAGTACTTCAGCCGCAAGAACAGCTTCTTCAGCTTCGAGGAGATCATCAGCAGCCTGAAGCTGATGGGCCGCAAGATCGACCTGTTCAGCTACTTCAAGGACAACGTGGAGTACCGCGCCCACAGCATCGAGACCACCTTCATCAAGTGGCAGAAGAACAAGAACGACAAGAAGACCACCAAGGAGCTGCTGGACAACATCCTGAACCTGCAGCGCGTGCTGAAGCCCCTGTACCTGAAGGCCGAGGTGGAGAAGGACATCCTGTTCTACAGCATCTTCGACATCTACTTCGAGAGCCTGAACGAGATCGTGAAGCTGTACAACAAGGTGCGCGACTTCGAGAGCAAGAAGCCCTACAGCCTGGAGAAGTTCAAGCTGAACTTCCAGAACAGCACCCTGCTGAGCGGCTGGGACGTGAACAAGGAGCCCGACAACACCAGCATCCTGCTGAAGAAGGACGGCCTGTACTACCTGGGCATCATGGACAAGAAGCACAACCGCGTGTTCAAGAACCTGGAGAGCAGCAAGGGCGGCTACGAGAAGATCGAGTACAAGCTGCTGAGCGGCCCCAACAAGATGCTGCCCAAGGTGTTCTTCAGCAACAAGAGCATCGGCTACTACAACCCCAGCCCCGCCCTGCTGGAGAAGTACAAGAGCGGCGTGCACAAGAAGGGCGAGAGCTTCGACCTGAACTTCTGCCACGAGCTGATCGACTTCTTCAAGGCCAGCATCGACAAGCACGAGGACTGGAAGAACTTCAACTTCAAGTTCAGCGACACCAGCGAGTACGCCGACATCAGCGGCTTCTACCGCGAGGTGGAGCAGCAGGGCTACAAGATCACCTTCAAGAACATCGACGAGGAGTTCATCAACACCCTGATCAACGAGGGCAAGCTGTACCTGTTCCAGATCTACAACAAGGACTTCAGCACCTTCAGCAAGGGCACCAAGAACCTGCACACCCTGTACTGGGAGATGATCTTCAACGAGGAGAACCTGAAGAACGTGGTGTACAAGCTGAACGGCGAGGCCGAGATCTTCTACCGCAAGAAGAGCATCGAGTACAGCGAGGACAAGATGAAGTACGGCCACCACTACGAGGAGCTGAAGGACAAGTTCAACTACCCCATCATCAAGGACAAGCGCTTCAccatggACAAGTTCCAGTTCCACGTGCCCATCACCATGAACTTCAAGGCCACCGGCCGCAGCTACATCAACGAGGAGGTGAACGACTTCCTGCGCCAGAACAGCAAGGACGTGAAGATCATCGGCATCAACCGCGGCGAGCGCCACCTGATCTACCTGACCATGATCAACGCCAAGGGCGAGATCATCCAGCAGTACAGCCTGAACGAGATCGTGAACAGCTACAACAACAAGAACTTCACCGTGAACTACAACGAGAAGCTGAGCAAGAAGGAGGGCGAGCGCGCCATCGCCCGCGAGAACTGGGGCGTGGTGGAGAACATCAAGGAGCTGAAGGAGGGCTACCTGAGCCACGCCATCCACACCATCAGCAACCTGATCGTGGAGAACAACGCCATCGTGGTGCTGGAGGACCTGAACTTCGAGTTCAAGCGCGAGCGCCTGAAGGTGGAGAAGAGCATCTACCAGAAGTTCGAGAAGATGCTGATCGACAAGCTGAACTACCTGGTGGACAAGAAGAAGGACATCAACGAGAACGGCGGCCTGCTGAAGGCCCTGCAGCTGACCAACAAGTTCGAGAGCTTCGAGAAGATCGGCAAGCAGAACGGCTTCCTGTTCTTCGTGAACGCCTGGAACATCACCAAGATCTGCCCCGTGACCGGCTTCGTGAGCCTGTTCGACACCCGCTACCAGAGCGTGGACAAGGCCCGCGAGTTCTTCAGCAAGTTCGACAGCATCAAGTACAACGAGGAGAAGGAGCACTACGAGTTCGTGTTCGACTACAGCAACTTCACCGACAAGGCCAAGGACACCAAGACCAAGTGGACCGTGTGCAGCTACGGCACCCGCATCAAGACCTTCCGCAACAGCGAGAAGAACAACAACTGGGACAACAAGACCGTGAGCCCCACCGAGGACCTGAGCAAGCTGCTGAAGAGCTGCGACCGCGACATCAAGGAGTTCATCATCAGCCAGGACAAGAAGGAGTTCTTCGTGGAGCTGCTGGAGATCTTCAGCCTGATCGTGCAGATGAAGAACAGCATCATCAACAGCGAGATCGACTACATCATCAGCCCCGTGGCCAACGAGAACGGCGAGTTCTTCGACAGCCGCTTCGCCAACAGCAGCCTGCCCAAGAACGCCGACGCCAACGCCGCCTACAACACCGCCCGCAAGGGCCTGATGCTGCTGGAGAAGATCCGCGACAGCGAGATCGGCAAGAAGATCGACATGAAGATCACCAACACCGAGTGGCTGAACTTCGTGCAGGAGCGC
>SEQ ID NO:18 BoCpf1核苷酸序列
AGGAAATTCAATGAGTTCGTGGGTCTGTATCCTATTAGTAAGACCCTCAGGTTCGAGCTGAAACCAATCGGCAAGACACTGGAGCATATCCAGAGAAACAAGCTCCTGGAGCATGATGCCGTTCGCGCTGACGACTATGTCAAAGTGAAGAAGATCATTGACAAATACCATAAGTGTCTGATAGATGAGGCCCTGTCTGGATTCACCTTCGATACAGAAGCCGATGGGAGAAGCAATAACAGCCTGTCTGAGTACTATCTGTACTACAATCTCAAGAAGAGAAATGAGCAGGAACAGAAGACTTTCAAGACAATCCAGAACAATCTGCGGAAGCAGATTGTCAACAAGCTGACCCAGAGTGAGAAGTATAAGAGAATTGATAAGAAAGAACTCATCACCACTGATCTGCCAGACTTCCTGACTAATGAAAGCGAGAAAGAACTGGTGGAGAAGTTCAAGAACTTTACTACCTACTTTACCGAATTTCACAAGAACCGCAAGAATATGTACTCCAAGGAAGAGAAGTCCACCGCAATCGCTTTCCGCCTGATTAACGAGAACCTGCCAAAGTTTGTCGATAACATCGCTGCTTTCGAGAAAGTTGTGTCCTCACCTCTCGCAGAGAAGATCAATGCCCTGTACGAGGACTTTAAGGAGTATCTGAATGTGGAAGAAATCTCACGGGTGTTTAGACTCGACTATTACGATGAACTGCTGACACAGAAACAGATTGATCTGTACAACGCTATCGTCGGTGGTCGGACAGAGGAGGACAACAAGATCCAGATAAAGGGACTGAACCAGTATATCAACGAATACAATCAGCAGCAGACAGATCGGTCTAATCGGCTGCCAAAGCTGAAACCTCTCTATAAGCAAATTCTCTCCGACAGAGAGAGCGTGTCATGGCTGCCTCCCAAGTTTGATAGCGATAAGAATCTGCTGATTAAGATCAAAGAATGCTACGACGCCCTGTCCGAGAAGGAGAAAGTGTTTGACAAGCTGGAAAGTATTCTCAAGAGCCTGTCAACCTATGACCTGTCTAAGATATACATTTCTAACGACTCTCAGCTGTCTTACATTAGCCAGAAGATGTTTGGACGGTGGGACATCATATCTAAGGCCATCAGGGAGGATTGTGCTAAGAGGAATCCTCAGAAATCTCGGGAATCCCTGGAGAAGTTCGCCGAGAGGATAGATAAGAAACTCAAGACCATCGACTCCATCAGCATCGGCGATGTGGATGAGTGCCTGGCCCAGCTGGGTGAAACCTACGTTAAGCGGGTGGAGGATTACTTTGTGGCAATGGGCGAATCCGAGATCGACGATGAGCAGACAGATACCACCTCCTTCAAGAAGAACATAGAGGGAGCATACGAGTCCGTCAAGGAGCTGCTGAACAACGCTGATAACATTACAGACAATAACCTGATGCAGGACAAGGGCAATGTGGAGAAGATCAAGACCCTGCTGGATGCAATCAAGGACCTCCAGCGGTTCATTAAGCCACTCCTGGGTAAAGGTGACGAAGCAGACAAGGACGGCGTGTTCTACGGTGAGTTTACATCCCTGTGGACCAAACTCGATCAGGTTACTCCTCTCTATAACATGGTTCGGAATTACCTCACTTCAAAGCCTTATAGTACAAAGAAGATTAAGCTGAACTTTGAGAACAGCACTCTCATGGATGGATGGGATCTGAATAAGGAGCCAGATAACACTACCGTGATATTCTGCAAAGATGGGCTGTATTACCTGGGCATTATGGGTAAGAAGTACAATAGAGTGTTTGTCGATAGAGAGGACCTGCCTCACGACGGCGAGTGCTACGACAAGATGGAGTACAAACTGCTGCCAGGTGCCAATAAGATGCTCCCTAAAGTGTTCTTCTCCGAAACTGGTATTCAACGGTTCCTCCCATCCGAGGAACTCCTGGGCAAGTACGAAAGAGGCACACATAAGAAAGGAGCTGGGTTTGACCTGGGAGACTGTAGAGCACTGATTGATTTCTTTAAGAAGAGCATTGAAAGGCACGATGATTGGAAGAAGTTTGACTTCAAGTTCAGCGACACAAGCACATACCAGGACATAAGTGAGTTCTATAGAGAAGTGGAGCAGCAGGGCTATAAGATGTCCTTTAGAAAGGTTTCTGTGGACTATATCAAGTCTCTGGTGGAAGAAGGTAAGCTGTATCTGTTCCAGATATACAACAAAGACTTCTCCGCACATTCCAAAGGGACACCTAACATGCACACTCTCTATTGGAAGATGCTGTTCGATGAGGAGAACCTGAAGGACGTGGTGTATAAGCTGAATGGAGAAGCTGAGGTGTTCTTCCGGAAATCTAGCATCACAGTGCAAAGCCCAACACACCCTGCTAATTCACCTATCAAGAACAAGAACAAGGATAATCAGAAGAAGGAATCAAAGTTTGAGTACGATCTCATCAAGGACCGCAGGTATACCGTGGACAAGTTCCTCTTTCACGTGCCTATAACCATGAATTTCAAGTCCGTCGGTGGCTCTAACATCAATCAGCTCGTGAAGCGGCACATTCGGTCCGCAACCGACCTCCACATCATCGGCATAGATAGAGGAGAGCGGCATCTGCTGTACCTGACCGTTATCGACAGCAGAGGTAACATCAAAGAACAGTTCAGTCTGAACGAGATAGTGAACGAGTATAACGGGAACACCTATCGGACCGATTACCACGAGCTGCTCGATACCAGAGAAGGCGAGAGAACAGAAGCTAGACGGAACTGGCAGACTATACAGAACATACGCGAGCTGAAAGAGGGATACCTCTCCCAGGTGATTCACAAGATCAGCGAGCTGGCTATCAAATACAACGCCGTGATCGTGCTGGAGGATCTCAATTTCGGCTTTATGAGGTCACGCCAGAAAGTGGAGAAGCAGGTGTATCAGAAATTCGAGAAGATGCTGATCGACAAGCTGAACTACCTGGTCGATAAGAAGAAACCTGTCGCTGAAACCGGAGGGCTGCTGAGAGCCTACCAGCTGACCGGAGAATTTGAGTCCTTTAAGACCCTGGGAAAGCAGAGCGGCATTCTGTTCTACGTTCCCGCTTGGAACACCAGTAAGATTGATCCTGTGACTGGGTTTGTCAATCTCTTCGATACCCACTATGAGAACATTGAGAAGGCTAAGGTGTTCTTTGACAAATTCAAGAGTATCAGGTACAATTCCGACAAGGATTGGTTCGAATTTGTCGTGGACGACTATACAAGGTTCTCACCTAAGGCAGAGGGCACCAGGAGGGACTGGACTATCTGCACCCAGGGAAAGCGCATTCAGATATGTCGGAACCACCAGCGCAATAACGAGTGGGAGGGTCAAGAGATTGACCTGACCAAAGCATTCAAGGAGCACTTTGAAGCCTATGGCGTTGACATCTCAAAGGACCTGAGGGAGCAGATCAATACTCAGAACAAGAAAGAGTTCTTCGAAGAACTGCTGCGCCTGCTGCGGCTCACCCTGCAAATGAGGAACTCCATGCCAAGTTCTGACATCGACTACCTGATCAGCCCAGTCGCCAACGACACCGGATGCTTCTTCGATTCAAGAAAGCAGGCCGAGCTGAAAGAGAATGCAGTTCTCCCTATGAACGCTGATGCTAATGGTGCATACAACATCGCTAGAAAGGGACTGCTGGCAATCCGCAAGATGAAACAAGAAGAGAACGACAGTGCTAAGATCAGCCTCGCTATATCCAACAAGGAGTGGCTCAAGTTTGCTCAGACTAAGCCATATCTGGAGGAC
>SEQ ID NO:19 OsCpf1核苷酸序列
ATGGAGACCGAGATCCTGAAGTACGACTTCTTCGAGCGCGAGGGCAAGTACATGTACTACGACGGCCTGACCAAGCAGTACGCCCTGAGCAAGACCATCCGCAACGAGCTGGTGCCCATCGGCAAGACCCTGGACAACATCAAGAAGAACCGCATCCTGGAGGCCGACATCAAGCGCAAGAGCGACTACGAGCACGTGAAGAAGCTGATGGACATGTACCACAAGAAGATCATCAACGAGGCCCTGGACAACTTCAAGCTGAGCGTGCTGGAGGACGCCGCCGACATCTACTTCAACAAGCAGAACGACGAGCGCGACATCGACGCCTTCCTGAAGATCCAGGACAAGCTGCGCAAGGAGATCGTGGAGCAGCTGAAGGGCCACACCGACTACAGCAAGGTGGGCAACAAGGACTTCCTGGGCCTGCTGAAGGCCGCCAGCACCGAGGAGGACCGCATCCTGATCGAGAGCTTCGACAACTTCTACACCTACTTCACCAGCTACAACAAGGTGCGCAGCAACCTGTACAGCGCCGAGGACAAGAGCAGCACCGTGGCCTACCGCCTGATCAACGAGAACCTGCCCAAGTTCTTCGACAACATCAAGGCCTACCGCACCGTGCGCAACGCCGGCGTGATCAGCGGCGACATGAGCATCGTGGAGCAGGACGAGCTGTTCGAGGTGGACACCTTCAACCACACCCTGACCCAGTACGGCATCGACACCTACAACCACATGATCGGCCAGCTGAACAGCGCCATCAACCTGTACAACCAGAAGATGCACGGCGCCGGCAGCTTCAAGAAGCTGCCCAAGATGAAGGAGCTGTACAAGCAGCTGCTGACCGAGCGCGAGGAGGAGTTCATCGAGGAGTACACCGACGACGAGGTGCTGATCACCAGCGTGCACAACTACGTGAGCTACCTGATCGACTACCTGAACAGCGACAAGGTGGAGAGCTTCTTCGACACCCTGCGCAAGAGCGACGGCAAGGAGGTGTTCATCAAGAACGACGTGAGCAAGACCACCATGAGCAACATCCTGTTCGACAACTGGAGCACCATCGACGACCTGATCAACCACGAGTACGACAGCGCCCCCGAGAACGTGAAGAAGACCAAGGACGACAagtactTCGAGAAGCGCCAGAAGGACCTGAAGAAGAACAAGAGCTACAGCCTGAGCAAGATCGCCGCCCTGTGCCGCGACACCACCATCCTGGAGAAGTACATCCGCCGCCTGGTGGACGACATCGAGAAGATCTACACCAGCAACAACGTGTTCAGCGACATCGTGCTGAGCAAGCACGACCGCAGCAAGAAGCTGAGCAAGAACACCAACGCCGTGCAGGCCATCAAGAACATGCTGGACAGCATCAAGGACTTCGAGCACGACGTGATGCTGATCAACGGCAGCGGCCAGGAGATCAAGAAGAACCTGAACGTGTACAGCGAGCAGGAGGCCCTGGCCGGCATCCTGCGCCAGGTGGACCACATCTACAACCTGACCCGCAACTACCTGACCAAGAAGCCCTTCAGCACCGAGAAGATCAAGCTGAACTTCAACCGCCCCACCTTCCTGGACGGCTGGGACAAGAACAAGGAGGAGGCCAACCTGGGCATCCTGCTGATCAAGGACAACCGCTACTACCTGGGCATCATGAACACCAGCAGCAACAAGGCCTTCGTGAACCCCCCCAAGGCCATCAGCAACGACATCTACAAGAAGGTGGACTACAAGCTGCTGCCCGGCCCCAACAAGATGCTGCCCAAGGTGTTCTTCGCCACCAAGAACATCGCCTACTACGCCCCCAGCGAGGAGCTGCTGAGCAAGTACCGCAAGGGCACCCACAAGAAGGGCGACAGCTTCAGCATCGACGACTGCCGCAACCTGATCGACTTCTTCAAGAGCAGCATCAACAAGAACACCGACTGGAGCACCTTCGGCTTCAACTTCAGCGACACCAACAGCTACAACGACATCAGCGACTTCTACCGCGAGGTGGAGAAGCAGGGCTACAAGCTGAGCTTCACCGACATCGACGCCTGCTACATCAAGGACCTGGTGGACAACAACGAGCTGTACCTGTTCCAGATCTACAACAAGGACTTCAGCCCCTACAGCAAGGGCAAGCTGAACCTGCACACCCTGTACTTCAAGATGCTGTTCGACCAGCGCAACCTGGACAACGTGGTGTACAAGCTGAACGGCGAGGCCGAGGTGTTCTACCGCCCCGCCAGCATCGAGAGCGACGAGCAGATCATCCACAAGAGCGGCCAGAACATCAAGAACAAGAACCAGAAGCGCAGCAACTGCAAGAAGACCAGCACCTTCGACTACGACATCGTGAAGGACCGCCGCTACTGCAAGGACAAGTTCATGCTGCACCTGCCCATCACCGTGAACTTCGGCACCAACGAGAGCGGCAAGTTCAACGAGCTGGTGAACAACGCCATCCGCGCCGACAAGGACGTGAACGTGATCGGCATCGAccgcggCGAGCGCAACCTGCTGTACGTGGTGGTGGTGGACCCCTGCGGCAAGATCATCGAGCAGATCAGCCTGAACACCATCGTGGACAAGGAGTACGACATCGAGACCGACTACCACCAGCTGCTGGACGAGAAGGAGGGCAGCCGCGACAAGGCCCGCAAGGACTGGAACACCATCGAGAACATCAAGGAGCTGAAGGAGGGCTACCTGAGCCAGGTGGTGAACATCATCGCCAAGCTGGTGCTGAAGTACGACGCCATCATCTGCCTGGAGGACCTGAACTTCGGCTTCAAGCGCGGCCGCCAGAAGGTGGAGAAGCAGGTGTACCAGAAGTTCGAGAAGATGCTGATCGACAAGATGAACTACCTGGTGCTGGACAAGAGCCGCAAGCAGGAGAGCCCCCAGAAGCCCGGCGGCGCCCTGAACGCCCTGCAGCTGACCAGCGCCTTCAAGAGCTTCAAGGAGCTGGGCAAGCAGACCGGCATCATCTACTACGTGCCCGCCTACCTGACCAGCAAGATCGACCCCACCACCGGCTTCGCCAACCTGTTCTACATCAAGTACGAGAGCGTGGACAAGGCCCGCGACTTCTTCAGCAAGTTCGACTTCATCCGCTACAACCAGATGGACAACTACTTCGAGTTCGGCTTCGACTACAAGAGCTTCACCGAGCGCGCCAGCGGCTGCAAGAGCAAGTGGATCGCCTGCACCAACGGCGAGCGCATCGTGAAGTACCGCAACAGCGACAAGAACAACAGCTTCGACGACAAGACCGTGATCCTGACCGACGAGTACCGCAGCCTGTTCGACAAGTACCTGCAGAACTACATCGACGAGGACGACCTGAAGGACCAGATCCTGCAGATCGACAGCGCCGACTTCTACAAGAACCTGATCAAGCTGTTCCAGCTGACCCTGCAGATGCGCAACAGCAGCAGCGACGGCAAGCGCGACTACATCATCAGCCCCGTGAAGAACTACCGCGAGGAGTTCTTCTGCAGCGAGTTCAGCGACGACACCTTCCCCCGCGACGCCGACGCCAACGGCGCCTACAACATCGCCCGCAAGGGCCTGTGGGTGATCAAGCAGATCCGCGAGACCAAGAGCGGCACCAAGATCAACCTGGCCATGAGCAACAGCGAGTGGCTGGAGTACGCCCAGTGCAACCTGCTG
>SEQ ID NO:20 BsCpf1核苷酸序列
ATGTACTACCAGAACCTGACCAAGAAGTACCCCGTGAGCAAGACCATCCGCAACGAGCTGATCCCCATCGGCAAGACCCTGGAGAACATCCGCAAGAACAACATCCTGGAGAGCGACGTGAAGCGCAAGCAGGACTACGAGCACGTGAAGGGCATCATGGACGAGTACCACAAGCAGCTGATCAACGAGGCCCTGGACAACTACATGCTGCCCAGCCTGAACCAGGCCGCCGAGATCTACCTGAAGAAGCACGTGGACGTGGAGGACCGCGAGGAGTTCAAGAAGACCCAGGACCTGCTGCGCCGCGAGGTGACCGGCCGCCTGAAGGAGCACGAGAACTACACCAAGATCGGCAAGAAGGACATCCTGGACCTGCTGGAGAAGCTGCCCAGCATCAGCGAGGAGGACTACAACGCCCTGGAGAGCTTCCGCAACTTCTACACCTACTTCACCAGCTACAACAAGGTGCGCGAGAACCTGTACAGCGACGAGGAGAAGAGCAGCACCGTGGCCTACCGCCTGATCAACGAGAACCTGCCCAAGTTCCTGGACAACATCAAGAGCTACGCCTTCGTGAAGGCCGCCGGCGTGCTGGCCGACTGCATCGAGGAGGAGGAGCAGGACGCCCTGTTCATGGTGGAGACCTTCAACATGACCCTGACCCAGGAGGGCATCGACATGTACAACTACCAGATCGGCAAGGTGAACAGCGCCATCAACCTGTACAACCAGAAGAACCACAAGGTGGAGGAGTTCAAGAAGATCCCCAAGATGAAGGTGCTGTACAAGCAGATCCTGAGCGACCGCGAGGAGGTGTTCATCGGCGAGTTCAAGGACGACGAGACCCTGCTGAGCAGCATCGGCGCCTACGGCAACGTGCTGATGACCTACCTGAAGAGCGAGAAGATCAACATCTTCTTCGACGCCCTGCGCGAGAGCGAGGGCAAGAACGTGTACGTGAAGAACGACCTGAGCAAGACCACCATGAGCAACATCGTGTTCGGCAGCTGGAGCGCCTTCGACGAGCTGCTGAACCAGGAGTACGACCTGGCCAACGAGAACAAGAAGAAGGACGACAAGTACTTCGAGAAGCGCCAGAAGGAGCTGAAGAAGAACAAGAGCTACACCCTGGAGCAGATGAGCAACCTGAGCAAGGAGGACATCAGCCCCATCGAGAACTACATCGAGCGCATCAGCGAGGACATCGAGAAGATCTGCATCTACAACGGCGAGTTCGAGAAGATCGTGGTGAACGAGCACGACAGCAGCCGCAAGCTGAGCAAGAACATCAAGGCCGTGAAGGTGATCAAGGACTACCTGGACAGCATCAAGGAGCTGGAGCACGACATCAAGCTGATCAACGGCAGCGGCCAGGAGCTGGAGAAGAACCTGGTGGTGTACGTGGGCCAGGAGGAGGCCCTGGAGCAGCTGCGCCCCGTGGACAGCCTGTACAACCTGACCCGCAACTACCTGACCAAGAAGCCCTTCAGCACCGAGAAGGTGAAGCTGAACTTCAACAAGAGCACCCTGCTGAACGGCTGGGACAAGAACAAGGAGACCGACAACCTGGGCATCCTGTTCTTCAAGGACGGCAAGTACTACCTGGGCATCATGAACACCACCGCCAACAAGGCCTTCGTGAACCCCCCCGCCGCCAAGACCGAGAACGTGTTCAAGAAGGTGGACTACAAGCTGCTGCCCGGCAGCAACAAGATGCTGCCCAAGGTGTTCTTCGCCAAGAGCAACATCGGCTACTACAACCCCAGCACCGAGCTGTACAGCAACTACAAGAAGGGCACCCACAAGAAgggcccCAGCTTCAGCATCGACGACTGCCACAACCTGATCGACTTCTTCAAGGAGAGCATCAAGAAGCACGAGGACTGGAGCAAGTTCGGCTTCGAGTTCAGCGACACCGCCGACTACCGCGACATCAGCGAGTTCTACCGCGAGGTGGAGAAGCAGGGCTACAAGCTGACCTTCACCGACATCGACGAGAGCTACATCAACGACCTGATCGAGAAGAACGAGCTGTACCTGTTCCAGATCTACAACAAGGACTTCAGCGAGTACAGCAAGGGCAAGCTGAACCTGCACACCCTGTACTTCATGATGCTGTTCGACCAGCGCAACCTGGACAACGTGGTGTACAAGCTGAACGGCGAGGCCGAGGTGTTCTACCGCCCCGCCAGCATCGCCGAGAACGAGCTGGTGATCCACAAGGCCGGCGAGGGCATCAAGAACAAGAACCCCAACCGCGCCAAGGTGAAGGAGACCAGCACCTTCAGCTACGACATCGTGAAGGACAAGCGCTACAGCAAGTACAAGTTCACCCTGCACATCCCCATCACCATGAACTTCGGCGTGGACGAGGTGCGCCGCTTCAACGACGTGATCAACAACGCCCTGCGCACCGACGACAACGTGAACGTGATCGGCATCGACCGCGGCGAGCGCAACCTGCTGTACGTGGTGGTGATCAACAGCGAGGGCAAGATCCTGGAGCAGATCAGCCTGAACAGCATCATCAACAAGGAGTACGACATCGAGACCAACTACCACGCCCTGCTGGACGAGCGCGAGGACGACCGCAACAAGGCCCGCAAGGACTGGAACACCATCGAGAACATCAAGGAGCTGAAGACCGGCTACCTGAGCCAGGTGGTGAACGTGGTGGCCAAGCTGGTGCTGAAGTACAACGCCATCATCTGCCTGGAGGACCTGAACTTCGGCTTCAAGCGCGGCCGCCAGAAGGTGGAGAAGCAGGTGTACCAGAAGTTCGAGAAGATGCTGATCGAGAAGCTGAACTACCTGGTGATCGACAAGAGCCGCGAGCAGGTGAGCCCCGAGAAGATGGGCGGCGCCCTGAACGCCCTGCAGCTGACCAGCAAGTTCAAGAGCTTCGCCGAGCTGGGCAAGCAGAGCGGCATCATCTACTACGTGCCCGCCTACCTGACCAGCAAGATCGACCCCACCACCGGCTTCGTGAACCTGTTCTACATCAAGTACGAGAACATCGAGAAGGCCAAGCAGTTCTTCGACGGCTTCGACTTCATCCGCTTCAACAAGAAGGACGACATGTTCGAGTTCAGCTTCGACTACAAGAGCTTCACCCAGAAGGCCTGCGGCATCCGCAGCAAGTGGATCGTGTACACCAACGGCGAGCGCATCATCAAGTACCCCAACCCCGAGAAGAACAACCTGTTCGACGAGAAGGTGATCAACGTGACCGACGAGATCAAGGGCCTGTTCAAGCAGTACCGCATCCCCTACGAGAACGGCGAGGACATCAAGGAGATCATCATCAGCAAGGCCGAGGCCGACTTCTACAAGCGCCTGTTCCGCCTGCTGCACCAGACCCTGCAGATGCGCAACAGCACCAGCGACGGCACCCGCGACTACATCATCAGCCCCGTGAAGAACGACCGCGGCGAGTTCTTCTGCAGCGAGTTCAGCGAGGGCACCATGCCCAAGGACGCCGACGCCAACGGCGCCTACAACATCGCCCGCAAGGGCCTGTGGGTGCTGGAGCAGATCCGCCAGAAGGACGAGGGCGAGAAGGTGAACCTGAGCATGACCAACGCCGAGTGGCTGAAGTACGCCCAGCTGCACCTGCTG
>SEQ ID NO:21 PsCpf1核苷酸序列
GAGAACTTCAAGAACCTGTACCCCATCAACAAGACCCTGCGCTTCGAGCTGCGCCCCTACGGCAAGACCCTGGAGAACTTCAAGAAGAGCGGCCTGCTGGAGAAGGACGCCTTCAAGGCCAACAGCCGCCGCAGCATGCAGGCCATCATCGACGAGAAGTTCAAGGAGACCATCGAGGAGCGCCTGAAGTACACCGAGTTCAGCGAGTGCGACCTGGGCAACATGACCAGCAAGGACAAGAAGATCACCGACAAGGCCGCCACCAACCTGAAGAAGCAGGTGATCCTGAGCTTCGACGACGAGATCTTCAACAACTACCTGAAGCCCGACAAGAACATCGACGCCCTGTTCAAGAACGACCCCAGCAACCCCGTGATCAGCACCTTCAAGGGCTTCACCACCTACTTCGTGAACTTCTTCGAGATCCGCAAGCACATCTTCAAGGGCGAGAGCAGCGGCAGCATGGCCTACCGCATCATCGACGAGAACCTGACCACCTACCTGAACAACATCGAGAAGATCAAGAAGCTGCCCGAGGAGCTGAAGAGCCAGCTGGAGGGCATCGACCAGATCGACAAGCTGAACAACTACAACGAGTTCATCACCCAGAGCGGCATCACCCACTACAACGAGATCATCGGCGGCATCAGCAAGAGCGAGAACGTGAAGATCCAGGGCATCAACGAGGGCATCAACCTGTACTGCCAGAAGAACAAGGTGAAGCTGCCCCGCCTGACCCCCCTGTACAAGATGATCCTGAGCGACCGCGTGAGCAACAGCTTCGTGCTGGACACCATCGAGAACGACACCGAGCTGATCGAGATGATCAGCGACCTGATCAACAAGACCGAGATCAGCCAGGACGTGATCATGAGCGACATCCAGAACATCTTCATCAAGTACAAGCAGCTGGGCAACCTGCCCGGCATCAGCTACAGCAGCATCGTGAACGCCATCTGCAGCGACTACGACAACAACTTCGGCGACGGCAAGCGCAAGAAGAGCTACGAGAACGACCGCAAGAAGCACCTGGAGACCAACGTGTACAGCATCAACTACATCAGCGAGCTGCTGACCGACACCGACGTGAGCAGCAACATCAAGATGCGCTACAAGGAGCTGGAGCAGAACTACCAGGTGTGCAAGGAGAACTTCAACGCCACCAACTGGATGAACATCAAGAACATCAAGCAGAGCGAGAAGACCAACCTGATCAAGGACCTGCTGGACATCCTGAAGAGCATCCAGCGCTTCTACGACCTGTTCGACATCGTGGACGAGGACAAGAACCCCAGCGCCGAGTTCTACACCTGGCTGAGCAAGAACGCCGAGAAGCTtGACTTCGAGTTCAACAGCGTGTACAACAAGAGCCGCAACTACCTGACCCGCAAGCAGTACAGCGACAAGAAGATCAAGCTGAACTTCGACAGCCCCACCCTGGCCAAGGGCTGGGACGCCAACAAGGAGATCGACAACAGCACCATCATCATGCGCAAGTTCAACAACGACCGCGGCGACTACGACTACTTCCTGGGCATCTGGAACAAGAGCACCCCCGCCAACGAGAAGATCATCCCCCTGGAGGACAACGGCCTGTTCGAGAAGATGCAGTACAAGCTGTACCCCGACCCCAGCAAGATGCTGCCCAAGCAGTTCCTGAGCAAGATCTGGAAGGCCAAGCACCCCACCACCCCCGAGTTCGACAAGAAGTACAAGGAGGGCCGCCACAAGAAGGGCCCCGACTTCGAGAAGGAGTTCCTGCACGAGCTGATCGACTGCTTCAAGCACGGCCTGGTGAACCACGACGAGAAGTACCAGGACGTGTTCGGCTTCAACCTGCGCAACACCGAGGACTACAACAGCTACACCGAGTTCCTGGAGGACGTGGAGCGCTGCAACTACAACCTGAGCTTCAACAAGATCGCCGACACCAGCAACCTGATCAACGACGGCAAGCTGTACGTGTTCCAGATCTGGAGCAAGGACTTCAGCATCGACAGCAAGGGCACCAAGAACCTGAACACCATCTACTTCGAGAGCCTGTTCAGCGAGGAGAACATGATCGAGAAGATGTTCAAGCTGAGCGGCGAGGCCGAGATCTTCTACCGCCCCGCCAGCCTGAACTACTGCGAGGACATCATCAAGAAGGGCCACCACCACGCCGAGCTGAAGGACAAGTTCGACTACCCCATCATCAAGGACAAGCGCTACAGCCAGGACAAGTTCTTCTTCCACGTGCCCATGGTGATCAACTACAAGAGCGAGAAGCTGAACAGCAAGAGCCTGAACAACCGCACCAACGAGAACCTGGGCCAGTTCACCCACATCATCGGCATCGACCGCGGCGAGCGCCACCTGATCTACCTGACCGTGGTGGACGTGAGCACCGGCGAGATCGTGGAGCAGAAGCACCTGGACGAGATCATCAACACCGACACCAAGGGCGTGGAGCACAAGACCCACTACCTGAACAAGCTGGAGGAGAAGAGCAAGACCCGCGACAACGAGCGCAAGAGCTGGGAGGCCATCGAGACCATCAAGGAGCTGAAGGAGGGCTACATCAGCCACGTGATCAACGAGATCCAGAAGCTGCAGGAGAAGTACAACGCCCTGATCGTGATGGAGAACCTGAACTACGGCTTCAAGAACAGCCGCATCAAGGTGGAGAAGCAGGTGTACCAGAAGTTCGAGACCGCCCTGATCAAGAAGTTCAACTACATCATCGACAAGAAGGACCCCGAGACCTACATCCACGGCTACCAGCTGACCAACCCCATCACCACCCTGGACAAGATCGGCAACCAGAGCGGCATCGTGCTGTACATCCCCGCCTGGAACACCAGCAAGATCGACCCCGTGACCGGCTTCGTGAACCTGCTGTACGCCGACGACCTGAAGTACAAGAACCAGGAGCAGGCCAAGAGCTTCATCCAGAAGATCGACAACATCTACTTCGAGAACGGCGAGTTCAAGTTCGACATCGACTTCAGCAAGTGGAACAACCGCTACAGCATCAGCAAGACCAAGTGGACCCTGACCAGCTACGGCACCCGCATCCAGACCTTCCGCAACCCCCAGAAGAACAACAAGTGGGACAGCGCCGAGTACGACCTGACCGAGGAGTTCAAGCTGATCCTGAACATCGACGGCACCCTGAAGAGCCAGGACGTGGAGACCTACAAGAAGTTCATGAGCCTGTTCAAGCTGATGCTGCAGCTGCGCAACAGCGTGACCGGCACCGACATCGACTACATGATCAGCCCCGTGACCGACAAGACCGGCACCCACTTCGACAGCCGCGAGAACATCAAGAACCTGCCCGCCGACGCCGACGCCAACGGCGCCTACAACATCGCCCGCAAGGGCATCATGGCCATCGAGAACATCATGAACGGCATCAGCGACCCCCTGAAGATCAGCAACGAGGACTACCTGAAGTACATCCAGAACCAGCAGGAG
>SEQ ID NO:22 C6Cpf1核苷酸序列
ATGAAGAACGTGTTCGGCGGCTTCACCAACCTGTACAGCCTGACCAAGACCCTGCGCTTCGAGCTGAAGCCCACCAGCAAGACCCAGAAGCTGATGAAGCGCAACAACGTGATCCAGACCGACGAGGAGATCGACAAGCTGTACCACGACGAGATGAAGCCCATCCTGGACGAGATCCACCGCCGCTTCATCAACGACGCCCTGGCCCAGAAGATCTTCATCAGCGCCAGCCTGGACAACTTCCTGAAGGTGGTGAAGAACTACAAGGTGGAGAGCGCCAAGAAGAACATCAAGCAGAACCAGGTGAAGCTGCTGCAGAAGGAGATCACCATCAAGACCCTGGGCCTGCGCCGCGAGGTGGTGAGCGGCTTCATCACCGTGAGCAAGAAGTGGAAGGACAAGTACGTGGGCCTGGGCATCAAGCTGAAGGGCGACGGCTACAAGGTGCTGACCGAGCAGGCCGTGCTGGACATCCTGAAGATCGAGTTCCCCAACAAGGCCAAGTACATCGACAAGTTCCGCGGCTTCTGGACCTACTTCAGCGGCTTCAACGAGAACCGCAAGAACTACTACAGCGAGGAGGACAAGGCCACCAGCATCGCCAACCGCATCGTGAACGAGAACCTGAGCCGCTACATCGACAACATCATCGCCTTCGAGGAGATCCTGCAGAAGATCCCCAACCTGAAGAAGTTCAAGCAGGACCTGGACATCACCAGCTACAACTACTACCTGAACCAGGCCGGCATCGACAAGTACAACAAGATCATCGGCGGCTACATCGTGGACAAGGACAAGAAGATCCAGGGCATCAACGAGAAGGTGAACCTGTACACCCAGCAGACCAAGAAGAAGCTGCCCAAGCTGAAGTTCCTGTTCAAGCAGATCGGCAGCGAGCGCAAGGGCTTCGGCATCTTCGAGATCAAGGAGGGCAAGGAGTGGGAGCAGCTGGGCGACCTGTTCAAGCTGCAGCGCACCAAGATCAACAGCAACGGCCGCGAGAAGGGCCTGTTCGACAGCCTGCGCACCATGTACCGCGAGTTCTTCGACGAGATCAAGCGCGACAGCAACAGCCAGGCCCGCTACAGCCTGGACAAGATCTACTTCAACAAGGCCAGCGTGAACACCATCAGCAACAGCTGGTTCACCAACTGGAACAAGTTCGCCGAGCTGCTGAACATCAAGGAGGACAAGAAGAACGGCGAGAAGAAGATCCCCGAGCAGATCAGCATCGAGGACATCAAGGACAGCCTGAGCATCATCCCCAAGGAGAACCTGGAGGAGCTGTTCAAGCTGACCAACCGCGAGAAGCACGACCGCACCCGCTTCTTCGGCAGCAACGCCTGGGTGACCTTCCTGAACATCTGGCAGAACGAGATCGAGGAGAGCTTCAACAAGCTGGAGGAGAAGGAGAAGGACTTCAAGAAGAACGCCGCCATCAAGTTCCAGAAGAACAACCTGGTGCAGAAGAACTACATCAAGGAGGTGTGCGACCgcatgcTGGCCATCGAGCGCATGGCCAAGTACCACCTGCCCAAGGACAGCAACCTGAGCCGCGAGGAGGACTTCTACTGGATCATCGACAACCTGAGCGAGCAGCGCGAGATCTACAAGTACTACAACGCCTTCCGCAACTACATCAGCAAGAAGCCCTACAACAAGAGCAAGATGAAGCTGAACTTCGAGAACGGCAACCTGCTGGGCGGCTGGAGCGACGGCCAGGAGCGCAACAAGGCCGGCGTGATCCTGCGCAACGGCAACAAGTACTACCTGGGCGTGCTGATCAACCGCGGCATCTTCCGCACCGACAAGATCAACAACGAGATCTACCGCACCGGCAGCAGCAAGTGGGAGCGCCTGATCCTGAGCAACCTGAAGTTCCAGACCCTGGCCGGCAAGGGCTTCCTGGGCAAGCACGGCGTGAGCTACGGCAACATGAACCCCGAGAAGAGCGTGCCCAGCCTGCAGAAGTTCATCCGCGAGAACTACCTGAAGAAGTACCCCCAGCTGACCGAGGTGAGCAACACCAAGTTCCTGAGCAAGAAGGACTTCGACGCCGCCATCAAGGAGGCCCTGAAGGAGTGCTTCACCATGAACTTCATCAACATCGCCGAGAACAAGCTGCTGGAGGCCGAGGACAAGGGCGACCTGTACCTGTTCGAGATCACCAACAAGGACTTCAGCGGCAAGAAGAGCGGCAAGGACAACATCCACACCATCTACTGGAAGTACCTGTTCAGCGAGAGCAACTGCAAGAGCCCCATCATCGGCCTGAACGGCGGCGCCGAGATCTTCTTCCGCGAGGGCCAGAAGGACAAGCTGCACACCAAGCTGGACAAGAAGGGCAAGAAGGTGTTCGACGCCAAGCGCTACAGCGAGGACAAGCTGTTCTTCCACGTGAGCATCACCATCAACTACGGCAAGCCCAAGAACATCAAGTTCCGCGACATCATCAACCAGCTGATCACCAGCATGAACGTGAACATCATCGGCATCGACCGCGGCGAGAAGCACCTGCTGTACTACAGCGTGATCGACAGCAACGGCATCATCCTGAAGCAGGGCAGCCTGAACAAGATCCGCGTGGGCGACAAGGAGGTGGACTTCAACAAGAAGCTGACCGAGCGCGCCAACGAGATGAAGAAGGCCCGCCAGAGCTGGGAGCAGATCGGCAACATCAAGAACTTCAAGGAGGGCTACCTGAGCCAGGCCATCcacgagATCTACCAGCTGATGATCAAGTACAACGCCATCATCGTGCTGGAGGACCTGAACACCGAGTTCAAGGCCAAGCGCCTGAGCAAGGTGGAGAAGAGCGTGTACAAGAAGTTCGAGCTGAAGCTGGCCCGCAAGCTGAACCACCTGATCCTGAAGGACCGCAACACCAACGAGATCGGCGGCGTGCTGAAGGCCTACCAGCTGACCCCCACCATCGGCGGCGGCGACGTGAGCAAGTTCGAGAAGGCCAAGCAGTGGGGCATGATGTTCTACGTGCGCGCCAACTACACCAGCACCACCGACCCCGTGACCGGCTGGCGCAAGCACCTGTACATCAGCAACTTCAGCAACAACAGCGTGATCAAGAGCTTCTTCGACCCCACCAACCGCGACACCGGCATCGAGATCTTCTACAGCGGCAAGTACCGCAGCTGGGGCTTCCGCTACGTGCAGAAGGAGACCGGCAAGAAGTGGGAGCTGTTCGCCACCAAGGAGCTGGAGCGCTTCAAGTACAACCAGACCACCAAGCTGTGCGAGAAGATCAACCTGTACGACAAGTTCGAGGAGCTGTTCAAGGGCATCGACAAGAGCGCCGACATCTACAGCCAGCTGTGCAACGTGCTGGACTTCCGCTGGAAGAGCCTGGTGTACCTGTGGAACCTGCTGAACCAGATCCGCAACGTGGACAAGAACGCCGAGGGCAACAAGAACGACTTCATCCAGAGCCCCGTGTACCCCTTCTTCGACAGCCGCAAGACCGACGGCAAGACCGAGCCCATCAACGGCGACGCCAACGGCGCCCTGAACATCGCCCGCAAGGGCCTGATGCTGGTGGAGCGCATCAAGAACAACCCCGAGAAGTACGAGCAGCTGATCCGCGACACCGAGTGGGACGCCTGGATCCAGAACTTCAACAAGGTGAAC
>SEQ ID NO:23 PxCpf1核苷酸序列
ATGATCATCGGCCGCGACTTCAACATGTACTACCAGAACCTGACCAAGATGTACCCCATCAGCAAGACCCTGCGCAACGAGCTGATCCCCGTGGGCAAGACCCTGGAGAACATCCGCAAGAACGGCATCCTGGAGGCCGACATCCAGCGCAAGGCCGACTACGAGCACGTGAAGAAGCTGATGGACAACTACCACAAGCAGCTGATCAACGAGGCCCTGCAGGGCGTGCACCTGAGCGACCTGAGCGACGCCTACGACCTGTACTTCAACCTGAGCAAGGAGAAGAACAGCGTGGACGCCTTCAGCAAGTGCCAGGACAAGCTGCGCAAGGAGATCGTGAGCTTCCTGAAGAACCACGAGAACTTCCCCAAGATCGGCAACAAGGAGATCATCAAGCTGATCCAGAGCCTGAACGACAACGACGCCGACAACAACGCCCTGGACAGCTTCAGCAACTTCTACACCTACTTCAGCAGCTACAACGAGGTGCGCAAGAACCTGTACAGCGACGAGGAGAAGAGCAGCACCGTGGCCTACCGCCTGATCAACGAGAACCTGCCCAAGAGCCTGGACAACATCAAGGCCTACGCCATCGCCAAGAAGGCCGGCGTGCGCGCCGAGGGCCTGAGCGAGGAGGAGCAGGACTGCCTGTTCATCATCGAGACCTTCGAGCGCACCCTGACCCAGGACGGCATCGACAACTACAACGCCGACATCGGCAAGCTGAACACCGCCATCAACCTGTACAACCAGCAGAACAAGAAGCAGGAGGGCTTCCGCAAGGTGCCCCAGATGAAGTGCCTGTACAAGCAGATCCTGAGCGACCGCGAGGAGGCCTTCATCGACGAGTTCAGCGACGACGAGGACCTGATCACCAACATCGAGAGCTTCGCCGAGAACATGAACGTGTTCCTGAACAGCGAGATCATCACCGACTTCAAGAACGCCCTGGTGGAGAGCGACGGCAGCCTGGTGTACATCAAGAACGACGTGAGCAAGACCCTGTTCAGCAACATCGTGTTCGGCAGCTGGAACGCCATCGACGAGAAGCTGAGCGACGAGTACGACCTGGCCAACAGCAAGAAGAAGAAGGACGAGAagtactACGAGAAGCGCCAGAAGGAGCTGAAGAAGAACAAGAGCTACGACCTGGAGACCATCATCGGCCTGTTCGACGACAGCATCGACGTGATCGGCAAGTACATCGAGAAGCTGGAGAGCGACATCACCGCCATCGCCGAGGCCAAGAACGACTTCGACGAGATCGTGCTGCGCAAGCACGACAAGAACAAGAGCCTGCGCAAGAACACCAACGCCGTGGAGGCCATCAAGAGCTACCTGGACACCGTGAAGGACTTCGAGCGCGACATCAAGCTGATCAACGGCAGCGGCCAGGAGGTGGAGAAGAACCTGGTGGTGTACGCCGAGCAGGAGAACATCCTGGCCGAGATCAAGAACGTGGACAGCCTGTACAACATGAGCCGCAACTACCTGACCCAGAAGCCCTTCAGCACCGAGAAGTTCAAGCTGAACTTCGAGAACCCCACCCTGCTGAACGGCTGGGACCGCAACAAGGAGAAGGACTACCTGGGCATCCTGTTCGAGAAGGAGGGCATGTACTACCTGGGCATCATCAACAACAACCACCGCAAGATCTTCGAGAACGAGAAGCTGTGCACCGGCAAGGAGAGCTGCTTCAACAAGATCGTGTACAAGCAGATCAGCAACGCCGCCAAGTACCTGAGCAGCAAGCAGATCAACCCCCAGAACCCCCCCAAGGAGATCGCCGAGATCCTGCTGAAGCGCAAGGCCGACAGCAGCAGCCTGAGCCGCAAGGAGACCGAGCTGTTCATCGACTACCTGAAGGACGACTTCCTGGTGAACTACCCCATGATCATCAACAGCGACGGCGAGAACTTCTTCAACTTCCACTTCAAGCAGGCCAAGGACTACGGCAGCCTGCAGGAGTTCTTCAAGGAGGTGGAGCACCAGGCCTACAGCCTGAAGACCCGCCCCATCGACGACAGCTACATCTACCGCATGATCGACGAGGGCAAGCTGTACCTGTTCCAGATCCACAACAAGGACTTCAGCCCCTACAGCAAGGGCAACCTGAACCTGCACACCATCTACCTGCAGATGCTGTTCGACCAGCGCAACCTGAACAACGTGGTGTACAAGCTGAACGGCGAGGCCGAGGTGTTCTACCGCCCCGCCAGCATCAACGACGAGGAGGTGATCATCCACAAGGCCGGCGAGGAGATCAAGAACAAGAACAGCAAGCGCGCCGTGGACAAGCCCACCAGCAAGTTCGGCTACGACATCATCAAGGACCGCCGCTACAGCAAGGACAAGTTCATGCTGCACATCCCCGTGACCATGAACTTCGGCGTGGACGAGACCCGCCGCTTCAACGACGTGGTGAACGACGCCCTGCGCAACGACGAGAAGGTGCGCGTGATCGGCATCGACCGCGGCGAGCGCAACCTGCTGTACGTGGTGGTGGTGGACACCGACGGCACCATCCTGGAGCAGATCAGCCTGAACAGCATCATCAACAACGAGTACAGCATCGAGACCGACTACCACAAGCTGCTGGACGAGAAGGAGGGCGACCGCGACCGCGCCCGCAAGAACTGGACCACCATCGAGAACATCAAGGAGCTGAAGGAGGGCTACCTGAGCCAGGTGGTGAACGTGATCGCCAAGCTGGTGCTGAAGTACAACGCCATCATCTGCCTGGAGGACCTGAACTTCGGCTTCAAGCgcggccgcCAGAAGGTGGAGAAGCAGGTGTACCAGAAGTTCGAGAAGATGCTGATCGACAAGCTGAACTACCTGGTGATCGACAAGAGCCGCAAGCAGGAGAAGCCCGAGGAGTTCGGCGGCGCCCTGAACGCCCTGCAGCTGACCAGCAAGTTCACCAGCTTCAAGGACATGGGCAAGCAGACCGGCATCATCTACTACGTGCCCGCCTACCTGACCAGCAAGATCGACCCCACCACCGGCTTCGCCAACCTGTTCTACGTGAAGTACGAGAACGTGGAGAAGGCCAAGGAGTTCTTCAGCCGCTTCGACAGCATCAGCTACAACAACGAGAGCGGCTACTTCGAGTTCGCCTTCGACTACAAGAAGTTCACCGACCGCGCCTGCGGCGCCCGCAGCCAGTGGACCGTGTGCACCTACGGCGAGCGCATCATCAAGTACCGCAACGCCGACAAGAACAACAGCTTCGACGACAAGACCATCGTGCTGAGCGAGGAGTTCAAGGAGCTGTTCAGCATCTACGGCATCAGCTACGAGGACGGCGCCGAGCTGAAGAACAAGATCATGAGCGTGGACGAGGCCGACTTCTTCCGCTGCCTGACCGGCCTGCTGCAGAAGACCCTGCAGATGCGCAACAGCAGCAACGACGGCACCCGCGACTACATCATCAGCCCCATCATGAACGACCGCGGCGAGTTCTTCAACAGCGAGGCCTGCGACGCCAGCAAGCCCAAGGACGCCGACGCCAACGGCGCCTTCAACATCGCCCGCAAGGGCCTGTGGGTGCTGGAGCAGATCCGCAACACCCCCAGCGGCGACAAGCTGAACCTGGCCATGAGCAACGCCGAGTGGCTGGAGTACGCCCAGCGCAACCAGATC
>SEQ ID NO:24 PrCpf1核苷酸序列
ATGATCATCGGCCGCGACTTCAACATGTACTACCAGAACCTGACCAAGATGTACCCCATCAGCAAGACCCTGCGCAACGAGCTGATCCCCGTGGGCAAGACCCTGGAGAACATCCGCAAGAACGGCATCCTGGAGGCCGACATCCAGCGCAAGGCCGACTACGAGCACGTGAAGAAGCTGATGGACAACTACCACAAGCAGCTGATCAACGAGGCCCTGCAGGGCGTGCACCTGAGCGACCTGAGCGACGCCTACGACCTGTACTTCAACCTGAGCAAGGAGAAGAACAGCGTGGACGCCTTCAGCAAGTGCCAGGACAAGCTGCGCAAGGAGATCGTGAGCCTGCTGAAGAACCACGAGAACTTCCCCAAGATCGGCAACAAGGAGATCATCAAGCTGCTGCAGAGCCTGTACGACAACGACACCGACTACAAGGCCCTGGACAGCTTCAGCAACTTCTACACCTACTTCAGCAGCTACAACGAGGTGCGCAAGAACCTGTACAGCGACGAGGAGAAGAGCAGCACCGTGGCCTACCGCCTGATCAACGAGAACCTGCCCAAGTTCCTGGACAACATCAAGGCCTACGCCATCGCCAAGAAGGCCGGCGTGCGCGCCGAGGGCCTGAGCGAGGAGGACCAGGACTGCCTGTTCATCATCGAGACCTTCGAGCGCACCCTGACCCAGGACGGCATCGACAACTACAACGCCGCCATCGGCAAGCTGAACACCGCCATCAACCTGTTCAACCAGCAGAACAAGAAGCAGGAGGGCTTCCGCAAGGTGCCCCAGATGAAGTGCCTGTACAAGCAGATCCTGAGCGACCGCGAGGAGGCCTTCATCGACGAGTTCAGCGACGACGAGGACCTGATCACCAACATCGAGAGCTTCGCCGAGAACATGAACGTGTTCCTGAACAGCGAGATCATCACCGACTTCAAGATCGCCCTGGTGGAGAGCGACGGCAGCCTGGTGTACATCAAGAACGACGTGAGCAAGACCAGCTTCAGCAACATCGTGTTCGGCAGCTGGAACGCCATCGACGAGAAGCTGAGCGACGAGTACGACCTGGCCAACAGCAAGAAGAAGAAGGACGAGAagtactACGAGAAGCGCCAGAAGGAGCTGAAGAAGAACAAGAGCTACGACCTGGAGACCATCATCGGCCTGTTCGACGACAACAGCGACGTGATCGGCAAGTACATCGAGAAGCTGGAGAGCGACATCACCGCCATCGCCGAGGCCAAGAACGACTTCGACGAGATCGTGCTGCGCAAGCACGACAAGAACAAGAGCCTGCGCAAGAACACCAACGCCGTGGAGGCCATCAAGAGCTACCTGGACACCGTGAAGGACTTCGAGCGCGACATCAAGCTGATCAACGGCAGCGGCCAGGAGGTGGAGAAGAACCTGGTGGTGTACGCCGAGCAGGAGAACATCCTGGCCGAGATCAAGAACGTGGACAGCCTGTACAACATGAGCCGCAACTACCTGACCCAGAAGCCCTTCAGCACCGAGAAGTTCAAGCTGAACTTCAACCGCGCCACCCTGCTGAACGGCTGGGACAAGAACAAGGAGACCGACAACCTGGGCATCCTGTTCGAGAAGGACGGCATGTACTACCTGGGCATCATGAACACCAAGGCCAACAAGATCTTCGTGAACATCCCCAAGGCCACCAGCAACGACGTGTACCACAAGGTGAACTACAAGCTGCTGCCCGGCCCCAACAAGATGCTGCCCAAGGTGTTCTTCGCCCAGAGCAACCTGGACTACTACAAGCCCAGCGAGGAGCTGCTGGCCAAGTACAAGGCCGGCACCCACAAGAAGGGCGACAACTTCAGCCTGGAGGACTGCCACGCCCTGATCGACTTCTTCAAGGCCAGCATCGAGAAGCACCCCGACTGGAGCAGCTTCGGCTTCGAGTTCAGCGAGACCTGCACCTACGAGGACCTGAGCGGCTTCTACCGCGAGGTGGAGAAGCAGGGCTACAAGATCACCTACACCGACGTGGACGCCGACTACATCACCAGCCTGGTGGAGCGCGACGAGCTGTACCTGTTCCAGATCTACAACAAGGACTTCAGCCCCTACAGCAAGGGCAACCTGAACCTGCACACCATCTACCTGCAGATGCTGTTCGACCAGCGCAACCTGAACAACGTGGTGTACAAGCTGAACGGCGAGGCCGAGGTGTTCTACCGCCCCGCCAGCATCAACGACGAGGAGGTGATCATCCACAAGGCCGGCGAGGAGATCAAGAACAAGAACAGCAAGCGCGCCGTGGACAAGCCCACCAGCAAGTTCGGCTACGACATCATCAAGGACCGCCGCTACAGCAAGGACAAGTTCATGCTGCACATCCCCGTGACCATGAACTTCGGCGTGGACGAGACCCGCCGCTTCAACGACGTGGTGAACGACGCCCTGCGCAACGACGAGAAGGTGCGCGTGATCGGCATCGACCGCGGCGAGCGCAACCTGCTGTACGTGGTGGTGGTGGACACCGACGGCACCATCCTGGAGCAGATCAGCCTGAACAGCATCATCAACAACGAGTACAGCATCGAGACCGACTACCACAAGCTGCTGGACGAGAAGGAGGGCGACCGCGACCGCGCCCGCAAGAACTGGACCACCATCGAGAACATCAAGGAGCTGAAGGAGGGCTACCTGAGCCAGGTGGTGAACGTGATCGCCAAGCTGGTGCTGAAGTACAACGCCATCATCTGCCTGGAGGACCTGAACTTCGGCTTCAAGCgcggccgcCAGAAGGTGGAGAAGCAGGTGTACCAGAAGTTCGAGAAGATGCTGATCGACAAGCTGAACTACCTGGTGATCGACAAGAGCCGCAAGCAGGACAAGCCCGAGGAGTTCGGCGGCGCCCTGAACGCCCTGCAGCTGACCAGCAAGTTCACCAGCTTCAAGGACATGGGCAAGCAGACCGGCATCATCTACTACGTGCCCGCCTACCTGACCAGCAAGATCGACCCCACCACCGGCTTCGCCAACCTGTTCTACGTGAAGTACGAGAACGTGGAGAAGGCCAAGGAGTTCTTCAGCCGCTTCGACAGCATCAGCTACAACAACGAGAGCGGCTACTTCGAGTTCGCCTTCGACTACAAGAAGTTCACCGACCGCGCCTGCGGCGCCCGCAGCCAGTGGACCGTGTGCACCTACGGCGAGCGCATCATCAAGTTCCGCAACACCGAGAAGAACAACAGCTTCGACGACAAGACCATCGTGCTGAGCGAGGAGTTCAAGGAGCTGTTCAGCATCTACGGCATCAGCTACGAGGACGGCGCCGAGCTGAAGAACAAGATCATGAGCGTGGACGAGGCCGACTTCTTCCGCAGCCTGACCCGCCTGTTCCAGCAGACCATGCAGATGCGCAACAGCAGCAACGACGTGACCCGCGACTACATCATCAGCCCCATCATGAACGACCGCGGCGAGTTCTTCAACAGCGAGGCCTGCGACGCCAGCAAGCCCAAGGACGCCGACGCCAACGGCGCCTTCAACATCGCCCGCAAGGGCCTGTGGGTGCTGGAGCAGATCCGCAACACCCCCAGCGGCGACAAGCTGAACCTGGCCATGAGCAACGCCGAGTGGCTGGAGTACGCCCAGCGCAACCAGATC
Claims (17)
1.一种用于对细胞基因组中的靶序列进行定点修饰的基因组编辑系统,其包含以下i)至v)中至少一项:
i)Cpf1蛋白,和向导RNA;
ii)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和向导RNA;
iii)Cpf1蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;
iv)包含编码Cpf1蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;
v)包含编码Cpf1蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;
其中所述Cpf1蛋白包含SEQ ID NO:1-12的氨基酸序列或与SEQ ID NO:1-12之一具有至少80%序列相同性的氨基酸序列,所述向导RNA能够将所述Cpf1蛋白靶向所述细胞基因组中的靶序列。
2.权利要求1的系统,其中所述Cpf1蛋白包含与SEQ ID NO:1-12之一具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%、或100%序列相同性的氨基酸序列。
3.权利要求1的系统,其中所述Cpf1蛋白包含相对于SEQ ID NO:1-12之一具有一或多个氨基酸残基取代、缺失或添加的氨基酸序列,例如,所述Cpf1蛋白包含相对于SEQ ID NO:1-12之一具有1个、2个、3个、4个、5个、6个、7个、8个、9个、10个氨基酸残基取代、缺失或添加的氨基酸序列。
4.权利要求1的系统,其中所述Cpf1蛋白来自选自以下的物种:Agathobacterrectalis、Lachnospira pectinoschiza、Sneathia amnii、Helcococcus kunzii、Arcobacter butzleri、Bacteroidetes oral、Oribacterium sp.、Butyrivibrio sp.、Proteocatella sphenisci、Candidatus Dojkabacteria、Pseudobutyrivibrioxylanivorans和Pseudobutyrivibrio ruminis。
5.权利要求1的系统,所述Cpf1蛋白包含选自SEQ ID NO:1、SEQ ID NO:4、SEQ ID NO:8和SEQ ID NO:11所示的氨基酸序列。
6.权利要求1的系统,其中所述编码Cpf1蛋白的核苷酸序列针对待进行定点修饰的细胞所来自的生物体进行密码子优化。
7.权利要求1的系统,其中所述编码Cpf1蛋白的核苷酸序列选自SEQ ID NO:13-24。
8.权利要求1的系统,其中所述向导RNA是crRNA。
9.权利要求1的系统,所述靶序列具有以下结构:5’-TYYN-NX-3’或5’-YYN-NX-3’,其中N独立地选自A、G、C和T;Y选自C和T;X为15≤X≤35的整数;Nx表示x个连续的核苷酸。
10.一种对细胞基因组中的靶序列进行定点修饰的方法,包括将权利要求1-9中任一项的系统导入细胞中,由此所述向导RNA将所述Cpf1蛋白靶向所述细胞基因组中的靶序列,导致所述靶序列中的一或多个核苷酸取代、缺失和/或添加。
11.权利要求10的方法,其中所述细胞来自哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。
12.权利要求10-11中任一项的方法,其中所述系统通过选自以下的方法导入所述细胞:磷酸钙转染、原生质融合、电穿孔、脂质体转染、微注射、病毒感染(如杆状病毒、痘苗病毒、腺病毒、腺相关病毒、慢病毒和其他病毒)、基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化。
13.一种治疗有需要的对象中的疾病的方法,包括向所述对象递送有效量的权利要求1-9中任一项的基因组编辑系统以修饰所述对象中与所述疾病相关的基因。
14.权利要求1-9中任一项的基因组编辑系统在制备用于治疗有需要的对象中的疾病的药物组合物中的用途,其中所述基因组编辑系统用于修饰所述对象中与所述疾病相关的基因。
15.一种用于治疗有需要的对象中的疾病的药物组合物,其包含权利要求1-9中任一项的基因组编辑系统和药学可接受的载体,其中所述基因组编辑系统用于修饰所述对象中与所述疾病相关的基因。
16.权利要求13-15的方法、用途或药物组合物,其中所述对象是哺乳动物,例如人。
17.权利要求16的方法、用途或药物组合物,其中所述疾病选自肿瘤、炎症、帕金森病、心血管疾病、阿尔茨海默病、自闭症、药物成瘾、年龄相关性黄斑变性、精神分裂症和遗传性疾病。
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710228595.XA CN108690845B (zh) | 2017-04-10 | 2017-04-10 | 基因组编辑系统和方法 |
US16/603,975 US11702643B2 (en) | 2017-04-10 | 2018-04-10 | System and method for genome editing |
JP2019555026A JP7100057B2 (ja) | 2017-04-10 | 2018-04-10 | ゲノム編集のためのシステム及び方法 |
PCT/CN2018/082446 WO2018188571A1 (zh) | 2017-04-10 | 2018-04-10 | 基因组编辑系统和方法 |
EP18783675.4A EP3611258A4 (en) | 2017-04-10 | 2018-04-10 | SYSTEM AND METHOD FOR GENOMEDITATION |
JP2022064162A JP2022105015A (ja) | 2017-04-10 | 2022-04-07 | ゲノム編集のためのシステム及び方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710228595.XA CN108690845B (zh) | 2017-04-10 | 2017-04-10 | 基因组编辑系统和方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108690845A true CN108690845A (zh) | 2018-10-23 |
CN108690845B CN108690845B (zh) | 2021-04-27 |
Family
ID=63793114
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710228595.XA Active CN108690845B (zh) | 2017-04-10 | 2017-04-10 | 基因组编辑系统和方法 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11702643B2 (zh) |
EP (1) | EP3611258A4 (zh) |
JP (2) | JP7100057B2 (zh) |
CN (1) | CN108690845B (zh) |
WO (1) | WO2018188571A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111235130A (zh) * | 2019-11-15 | 2020-06-05 | 武汉大学 | II类V型CRISPR蛋白CeCas12a及其在基因编辑的应用 |
CN112301018A (zh) * | 2019-05-14 | 2021-02-02 | 深圳华大生命科学研究院 | 新型的Cas蛋白、Crispr-Cas系统及其在基因编辑领域中的用途 |
WO2023029492A1 (zh) * | 2021-09-03 | 2023-03-09 | 中国科学院动物研究所 | 一种外源基因定点整合的系统及方法 |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2963820A1 (en) | 2014-11-07 | 2016-05-12 | Editas Medicine, Inc. | Methods for improving crispr/cas-mediated genome-editing |
EP3294896A1 (en) | 2015-05-11 | 2018-03-21 | Editas Medicine, Inc. | Optimized crispr/cas9 systems and methods for gene editing in stem cells |
JP7396783B2 (ja) | 2015-06-09 | 2023-12-12 | エディタス・メディシン、インコーポレイテッド | 移植を改善するためのcrispr/cas関連方法および組成物 |
US11667911B2 (en) | 2015-09-24 | 2023-06-06 | Editas Medicine, Inc. | Use of exonucleases to improve CRISPR/CAS-mediated genome editing |
US11597924B2 (en) | 2016-03-25 | 2023-03-07 | Editas Medicine, Inc. | Genome editing systems comprising repair-modulating enzyme molecules and methods of their use |
EP4047092A1 (en) | 2016-04-13 | 2022-08-24 | Editas Medicine, Inc. | Cas9 fusion molecules, gene editing systems, and methods of use thereof |
KR102677877B1 (ko) * | 2016-12-22 | 2024-06-25 | 키진 엔.브이. | 이중나선 dna의 표적화된 변형 방법 |
EP4095263A1 (en) | 2017-01-06 | 2022-11-30 | Editas Medicine, Inc. | Methods of assessing nuclease cleavage |
CN108690845B (zh) * | 2017-04-10 | 2021-04-27 | 中国科学院动物研究所 | 基因组编辑系统和方法 |
WO2018201086A1 (en) | 2017-04-28 | 2018-11-01 | Editas Medicine, Inc. | Methods and systems for analyzing guide rna molecules |
WO2018227114A1 (en) | 2017-06-09 | 2018-12-13 | Editas Medicine, Inc. | Engineered cas9 nucleases |
US11866726B2 (en) | 2017-07-14 | 2024-01-09 | Editas Medicine, Inc. | Systems and methods for targeted integration and genome editing and detection thereof using integrated priming sites |
CN116601293A (zh) * | 2020-12-07 | 2023-08-15 | 中国科学院动物研究所 | 工程化的Cas效应蛋白及其使用方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105907785A (zh) * | 2016-05-05 | 2016-08-31 | 苏州吉玛基因股份有限公司 | 化学合成的crRNA用于CRISPR/Cpf1系统在基因编辑中的应用 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MX2017014561A (es) | 2015-05-15 | 2018-03-02 | Pioneer Hi Bred Int | Nuevos sistemas de arn guia/endonucleasa cas. |
US9790490B2 (en) | 2015-06-18 | 2017-10-17 | The Broad Institute Inc. | CRISPR enzymes and systems |
CA2990699A1 (en) | 2015-06-29 | 2017-01-05 | Ionis Pharmaceuticals, Inc. | Modified crispr rna and modified single crispr rna and uses thereof |
CN108690845B (zh) * | 2017-04-10 | 2021-04-27 | 中国科学院动物研究所 | 基因组编辑系统和方法 |
-
2017
- 2017-04-10 CN CN201710228595.XA patent/CN108690845B/zh active Active
-
2018
- 2018-04-10 WO PCT/CN2018/082446 patent/WO2018188571A1/zh unknown
- 2018-04-10 JP JP2019555026A patent/JP7100057B2/ja active Active
- 2018-04-10 EP EP18783675.4A patent/EP3611258A4/en active Pending
- 2018-04-10 US US16/603,975 patent/US11702643B2/en active Active
-
2022
- 2022-04-07 JP JP2022064162A patent/JP2022105015A/ja active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105907785A (zh) * | 2016-05-05 | 2016-08-31 | 苏州吉玛基因股份有限公司 | 化学合成的crRNA用于CRISPR/Cpf1系统在基因编辑中的应用 |
Non-Patent Citations (3)
Title |
---|
佚名: "登录号:WP_005398606", 《GENBANK》 * |
佚名: "登录号:WP_052943011", 《GENBANK》 * |
佚名: "登录号:WP_055272206", 《GENBANK》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112301018A (zh) * | 2019-05-14 | 2021-02-02 | 深圳华大生命科学研究院 | 新型的Cas蛋白、Crispr-Cas系统及其在基因编辑领域中的用途 |
CN112301018B (zh) * | 2019-05-14 | 2023-07-25 | 深圳华大生命科学研究院 | 新型的Cas蛋白、Crispr-Cas系统及其在基因编辑领域中的用途 |
CN111235130A (zh) * | 2019-11-15 | 2020-06-05 | 武汉大学 | II类V型CRISPR蛋白CeCas12a及其在基因编辑的应用 |
WO2023029492A1 (zh) * | 2021-09-03 | 2023-03-09 | 中国科学院动物研究所 | 一种外源基因定点整合的系统及方法 |
Also Published As
Publication number | Publication date |
---|---|
EP3611258A1 (en) | 2020-02-19 |
WO2018188571A1 (zh) | 2018-10-18 |
JP7100057B2 (ja) | 2022-07-12 |
US11702643B2 (en) | 2023-07-18 |
JP2022105015A (ja) | 2022-07-12 |
CN108690845B (zh) | 2021-04-27 |
EP3611258A4 (en) | 2020-05-06 |
US20200149021A1 (en) | 2020-05-14 |
JP2020516255A (ja) | 2020-06-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108690845A (zh) | 基因组编辑系统和方法 | |
CN111094561B (zh) | 靶特异性crispr变体 | |
CN108546716A (zh) | 一种基因组编辑方法 | |
CN106922154A (zh) | 使用空肠弯曲杆菌crispr/cas系统衍生的rna引导的工程化核酸酶的基因编辑 | |
CN107922931A (zh) | 热稳定的Cas9核酸酶 | |
CN105646719A (zh) | 一种高效定点转基因的工具及其应用 | |
US20240209332A1 (en) | Enzymes with ruvc domains | |
CN109689875A (zh) | 基因组编辑系统和方法 | |
CN109136248A (zh) | 多靶点编辑载体及其构建方法和应用 | |
CN108130342A (zh) | 基于Cpf1的植物基因组定点编辑方法 | |
WO2020224611A1 (en) | Improved gene editing system | |
CN106086031B (zh) | 猪肌抑素基因编辑位点及其应用 | |
JP7361109B2 (ja) | C2c1ヌクレアーゼに基づくゲノム編集のためのシステムおよび方法 | |
WO2023169454A1 (zh) | 腺嘌呤脱氨酶及其在碱基编辑中的用途 | |
CA3228222A1 (en) | Class ii, type v crispr systems | |
CN110106173A (zh) | 改进的基因组编辑方法 | |
JP2024532544A (ja) | 改善させたプライム編集システム | |
CN106754949B (zh) | 猪肌抑素基因编辑位点864-883及其应用 | |
WO2024051850A1 (zh) | 基于dna聚合酶的基因组编辑系统和方法 | |
CN114026226B (zh) | 靶特异性crispr突变体 | |
EP4392561A1 (en) | Enzymes with ruvc domains | |
JP2024501892A (ja) | 新規の核酸誘導型ヌクレアーゼ | |
KR20220150363A (ko) | 개선된 사이토신 염기 편집 시스템 | |
CN115052980B (zh) | 衍生自黄杆菌的基因编辑系统 | |
Chapman | Evaluation of CRISPR-Based Approaches for Precision Poultry Gene Modulation and Delivery |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1261024 Country of ref document: HK |
|
GR01 | Patent grant | ||
GR01 | Patent grant |